Query lcl|NC_019769.1_cdsid_YP_007151743.1 [gene=F864_gp08] [protein=hypothetical protein] [protein_id=YP_007151743.1] [location=5831..6280] Match_columns 149 No_of_seqs 136 out of 319 Neff 8.9 Searched_HMMs 1612 Date Thu Nov 7 16:34:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:194 Length: 149 # 100.0 1.3E-42 8E-46 250.2 15.1 149 1-149 1-149 (149) 2 protein:vir:93617 Length: 148 100.0 1.7E-42 1E-45 249.6 15.5 148 1-149 1-148 (148) 3 protein:vir:4347 Length: 164 # 100.0 4.1E-39 2.6E-42 231.0 13.6 148 1-149 1-156 (164) 4 protein:vir:1891 Length: 179 # 100.0 1.2E-38 7.7E-42 228.4 13.8 149 1-149 1-171 (179) 5 protein:vir:102875 Length: 146 100.0 2.5E-36 1.6E-39 215.7 14.6 143 1-147 1-146 (146) 6 protein:vir:107568 Length: 146 100.0 2.5E-36 1.6E-39 215.7 14.6 143 1-147 1-146 (146) 7 protein:vir:102085 Length: 146 100.0 2.5E-36 1.6E-39 215.7 14.6 143 1-147 1-146 (146) 8 protein:vir:105007 Length: 146 100.0 2.5E-36 1.6E-39 215.7 14.6 143 1-147 1-146 (146) 9 protein:vir:100075 Length: 140 100.0 4.2E-36 2.6E-39 214.5 14.1 138 1-149 1-138 (140) 10 protein:vir:80362 Length: 140 100.0 6.1E-36 3.8E-39 213.6 14.5 138 1-149 1-138 (140) 11 protein:vir:1437 Length: 140 # 100.0 1.3E-35 7.8E-39 211.9 14.0 138 1-149 1-138 (140) 12 protein:vir:1386 Length: 149 # 100.0 1.7E-35 1E-38 211.2 14.3 145 1-149 1-149 (149) 13 protein:vir:100243 Length: 140 100.0 1.3E-35 8.2E-39 211.8 13.6 138 1-149 1-138 (140) 14 protein:vir:5745 Length: 135 # 100.0 9.9E-35 6.1E-38 207.0 14.1 130 2-148 1-135 (135) 15 protein:vir:105089 Length: 133 100.0 4.4E-34 2.7E-37 203.5 13.0 128 1-146 1-133 (133) 16 protein:vir:3873 Length: 128 # 100.0 5.5E-33 3.4E-36 197.5 12.5 128 4-144 1-128 (128) 17 protein:vir:1273 Length: 127 # 100.0 5.2E-32 3.2E-35 192.1 12.0 124 1-144 1-127 (127) 18 protein:vir:94538 Length: 125 99.9 3.3E-31 2E-34 187.7 11.5 124 1-146 1-125 (125) 19 protein:vir:101594 Length: 173 99.9 6.5E-31 4E-34 186.1 11.3 118 6-146 1-173 (173) 20 protein:vir:97088 Length: 157 99.9 1.4E-29 8.9E-33 178.7 14.4 144 2-149 1-155 (157) 21 protein:vir:9708 Length: 125 # 99.9 8.3E-30 5.1E-33 180.0 11.9 121 7-145 1-125 (125) 22 protein:vir:95789 Length: 114 99.9 8.6E-30 5.4E-33 179.9 10.6 114 4-144 1-114 (114) 23 protein:vir:3617 Length: 112 # 99.9 1.8E-29 1.1E-32 178.2 9.9 112 2-140 1-112 (112) 24 protein:vir:79988 Length: 125 99.9 4.3E-28 2.7E-31 170.6 13.4 122 2-144 1-125 (125) 25 protein:vir:4704 Length: 125 # 99.9 4.3E-28 2.7E-31 170.6 13.4 122 2-144 1-125 (125) 26 protein:vir:98342 Length: 125 99.9 4.3E-28 2.7E-31 170.6 13.4 122 2-144 1-125 (125) 27 protein:vir:9414 Length: 125 # 99.9 4.3E-28 2.7E-31 170.6 13.4 122 2-144 1-125 (125) 28 protein:vir:81106 Length: 125 99.9 4.3E-28 2.7E-31 170.6 13.4 122 2-144 1-125 (125) 29 protein:vir:9312 Length: 115 # 99.9 1.1E-28 6.7E-32 173.9 9.9 109 6-140 1-115 (115) 30 protein:vir:97144 Length: 115 99.9 1.1E-28 6.7E-32 173.9 9.9 109 6-140 1-115 (115) 31 protein:vir:78858 Length: 115 99.9 1.1E-28 6.7E-32 173.9 9.9 109 6-140 1-115 (115) 32 protein:vir:96225 Length: 115 99.9 1.1E-28 6.7E-32 173.9 9.9 109 6-140 1-115 (115) 33 protein:vir:96358 Length: 115 99.9 1.1E-28 6.7E-32 173.9 9.9 109 6-140 1-115 (115) 34 protein:vir:103917 Length: 115 99.9 1.1E-28 6.7E-32 173.9 9.9 109 6-140 1-115 (115) 35 protein:vir:106623 Length: 115 99.9 1.2E-28 7.2E-32 173.7 10.1 109 6-140 1-115 (115) 36 protein:vir:99744 Length: 115 99.9 2E-28 1.2E-31 172.5 10.2 109 6-140 1-115 (115) 37 protein:vir:9930 Length: 108 # 99.9 2.7E-28 1.7E-31 171.7 9.8 108 8-141 1-108 (108) 38 protein:vir:743 Length: 108 # 99.9 6.9E-28 4.3E-31 169.5 9.8 108 6-140 1-108 (108) 39 protein:vir:102154 Length: 119 99.9 1.3E-27 8E-31 168.0 8.9 118 1-144 1-119 (119) 40 protein:vir:98409 Length: 108 99.9 1.7E-27 1.1E-30 167.3 9.4 108 6-140 1-108 (108) 41 protein:vir:106570 Length: 182 99.9 2.4E-26 1.5E-29 161.0 12.6 147 1-149 1-182 (182) 42 protein:vir:96486 Length: 112 99.9 1.2E-26 7.6E-30 162.7 10.0 111 1-139 1-112 (112) 43 protein:vir:2740 Length: 114 # 99.9 9.6E-27 5.9E-30 163.2 8.7 113 1-141 1-114 (114) 44 protein:vir:4906 Length: 114 # 99.9 9.6E-27 5.9E-30 163.2 8.7 113 1-141 1-114 (114) 45 protein:vir:5978 Length: 144 # 99.8 1.2E-23 7.3E-27 146.3 10.5 115 1-140 3-144 (144) 46 protein:vir:97427 Length: 137 99.8 1.1E-23 6.7E-27 146.5 9.9 108 1-136 1-137 (137) 47 protein:vir:93738 Length: 137 99.8 1.1E-23 6.7E-27 146.5 9.9 108 1-136 1-137 (137) 48 protein:vir:94490 Length: 137 99.8 1.1E-23 6.7E-27 146.5 9.9 108 1-136 1-137 (137) 49 protein:vir:4956 Length: 153 # 99.8 2.9E-23 1.8E-26 144.1 12.2 136 1-149 1-140 (153) 50 protein:vir:107099 Length: 137 99.8 1.3E-23 8.3E-27 146.0 9.7 108 1-136 1-137 (137) 51 protein:vir:95894 Length: 137 99.8 1.8E-23 1.1E-26 145.3 9.7 108 1-136 1-137 (137) 52 protein:vir:94108 Length: 149 99.8 1.3E-23 8.1E-27 146.1 8.8 108 1-136 13-149 (149) 53 protein:vir:94796 Length: 137 99.8 1.9E-23 1.2E-26 145.2 9.3 108 1-136 1-137 (137) 54 protein:vir:100887 Length: 139 99.8 7.5E-23 4.7E-26 141.9 12.2 136 4-149 1-136 (139) 55 protein:vir:105916 Length: 149 99.8 1.8E-23 1.1E-26 145.3 8.6 108 1-136 13-149 (149) 56 protein:vir:105330 Length: 137 99.8 3.7E-23 2.3E-26 143.5 9.6 108 1-136 1-137 (137) 57 protein:vir:96121 Length: 137 99.8 1.2E-22 7.4E-26 140.8 9.4 108 1-136 1-137 (137) 58 protein:vir:96829 Length: 135 99.8 1.3E-22 7.9E-26 140.6 9.1 108 1-136 1-135 (135) 59 protein:vir:5000 Length: 141 # 99.8 1E-21 6.4E-25 135.7 12.1 136 1-149 1-140 (141) 60 protein:vir:4859 Length: 140 # 99.8 1.4E-21 8.5E-25 135.0 12.3 137 1-149 1-140 (140) 61 protein:vir:100223 Length: 139 99.8 1.2E-21 7.3E-25 135.3 11.9 136 4-149 1-136 (139) 62 protein:vir:79034 Length: 141 99.8 1.7E-21 1.1E-24 134.4 11.0 134 1-149 1-137 (141) 63 protein:vir:94654 Length: 142 99.8 1.6E-21 1E-24 134.5 10.3 115 1-140 1-142 (142) 64 protein:vir:4833 Length: 140 # 99.7 7.6E-21 4.7E-24 130.9 12.3 137 1-149 1-140 (140) 65 protein:vir:8669 Length: 142 # 99.7 2.3E-21 1.4E-24 133.8 8.4 113 1-137 1-142 (142) 66 protein:vir:99101 Length: 142 99.7 2.3E-21 1.4E-24 133.8 8.4 113 1-137 1-142 (142) 67 protein:vir:81147 Length: 126 99.7 1.2E-19 7.2E-23 124.4 11.1 120 1-143 1-126 (126) 68 protein:vir:105467 Length: 144 99.6 1.4E-17 8.7E-21 113.0 11.5 124 1-149 1-142 (144) 69 protein:vir:1243 Length: 116 # 99.6 2.9E-18 1.8E-21 116.7 6.8 87 26-136 1-116 (116) 70 protein:vir:97327 Length: 116 99.6 2.9E-18 1.8E-21 116.7 6.8 87 26-136 1-116 (116) 71 protein:vir:95062 Length: 116 99.6 2.8E-18 1.8E-21 116.8 6.7 87 26-136 1-116 (116) 72 protein:vir:78077 Length: 141 99.6 1E-17 6.2E-21 113.8 9.5 115 1-144 1-141 (141) 73 protein:vir:3848 Length: 159 # 99.6 3.3E-17 2E-20 110.9 12.1 146 1-149 1-159 (159) 74 protein:vir:99528 Length: 92 # 99.5 2.3E-17 1.5E-20 111.8 7.7 92 1-116 1-92 (92) 75 protein:vir:100652 Length: 134 99.5 1.7E-16 1.1E-19 107.0 10.0 121 4-142 1-134 (134) 76 protein:vir:81067 Length: 119 99.5 3.9E-17 2.4E-20 110.6 5.4 92 41-149 1-117 (119) 77 protein:vir:10367 Length: 119 99.5 4.7E-17 2.9E-20 110.1 5.4 92 41-149 1-117 (119) 78 protein:vir:9513 Length: 134 # 99.4 5.9E-16 3.6E-19 104.1 9.8 121 4-142 1-134 (134) 79 protein:vir:101302 Length: 134 99.4 5.9E-16 3.6E-19 104.1 9.8 121 4-142 1-134 (134) 80 protein:vir:106041 Length: 137 99.4 4.1E-16 2.5E-19 105.0 6.9 104 2-134 1-137 (137) 81 protein:vir:966 Length: 123 # 99.4 5.7E-15 3.5E-18 98.7 11.7 117 2-141 1-123 (123) 82 protein:vir:9879 Length: 127 # 99.4 2.7E-15 1.6E-18 100.5 8.4 109 8-141 1-127 (127) 83 protein:vir:102441 Length: 137 99.3 3.6E-15 2.2E-18 99.8 6.6 107 2-135 1-137 (137) 84 protein:vir:107545 Length: 140 99.3 6.6E-15 4.1E-18 98.3 5.9 108 1-134 1-140 (140) 85 protein:vir:97982 Length: 140 99.3 6.6E-15 4.1E-18 98.3 5.9 108 1-134 1-140 (140) 86 protein:vir:9647 Length: 132 # 99.2 7.7E-14 4.8E-17 92.5 10.8 125 2-145 1-132 (132) 87 protein:vir:6216 Length: 125 # 99.2 6.6E-14 4.1E-17 92.8 6.5 119 2-143 1-125 (125) 88 protein:vir:102963 Length: 163 99.1 5.5E-13 3.4E-16 87.8 10.8 144 2-149 1-160 (163) 89 protein:vir:6246 Length: 143 # 99.1 6.2E-13 3.9E-16 87.5 10.0 126 1-149 1-143 (143) 90 protein:vir:1332 Length: 143 # 99.1 6.3E-13 3.9E-16 87.5 9.9 137 1-149 1-143 (143) 91 protein:vir:98636 Length: 138 99.0 3.1E-12 1.9E-15 83.7 10.5 126 1-145 6-138 (138) 92 protein:vir:9363 Length: 133 # 98.9 1.2E-11 7.7E-15 80.4 10.2 121 4-141 1-133 (133) 93 protein:vir:78644 Length: 133 98.9 1.2E-11 7.7E-15 80.4 10.2 121 4-141 1-133 (133) 94 protein:vir:94419 Length: 133 98.9 1.2E-11 7.7E-15 80.4 10.2 121 4-141 1-133 (133) 95 protein:vir:96973 Length: 133 98.9 1.2E-11 7.7E-15 80.4 10.2 121 4-141 1-133 (133) 96 protein:vir:106506 Length: 137 98.9 1.8E-12 1.1E-15 84.9 5.6 108 1-140 1-137 (137) 97 protein:vir:95372 Length: 124 98.9 1.9E-11 1.1E-14 79.4 10.2 114 1-141 1-124 (124) 98 protein:vir:78335 Length: 133 98.9 3.4E-11 2.1E-14 78.0 10.6 121 4-143 1-133 (133) 99 protein:vir:93898 Length: 133 98.8 8.1E-11 5E-14 75.9 10.1 121 4-141 1-133 (133) 100 protein:vir:80116 Length: 127 98.8 5.8E-11 3.6E-14 76.7 8.9 117 1-144 1-127 (127) 101 protein:vir:104347 Length: 145 98.6 9.2E-11 5.7E-14 75.6 5.8 132 1-143 1-145 (145) 102 protein:vir:79638 Length: 146 98.6 3.7E-10 2.3E-13 72.3 7.6 135 1-147 1-146 (146) 103 protein:vir:7412 Length: 168 # 98.6 8.3E-10 5.2E-13 70.4 9.5 137 1-149 1-165 (168) 104 protein:vir:1087 Length: 161 # 98.6 9.2E-10 5.7E-13 70.1 9.3 138 1-149 1-161 (161) 105 protein:vir:103280 Length: 142 98.5 4E-10 2.5E-13 72.1 6.7 134 1-143 1-142 (142) 106 protein:vir:107703 Length: 147 98.5 9.3E-10 5.8E-13 70.1 8.1 139 1-147 1-147 (147) 107 protein:vir:99833 Length: 190 98.5 1E-09 6.4E-13 69.8 7.2 132 1-147 1-190 (190) 108 protein:vir:1028 Length: 168 # 98.5 2.3E-09 1.4E-12 67.9 9.0 135 1-149 1-165 (168) 109 protein:vir:94944 Length: 121 98.4 7.4E-10 4.6E-13 70.6 4.3 113 1-128 1-121 (121) 110 protein:vir:94994 Length: 131 98.3 2.8E-09 1.7E-12 67.5 6.1 124 2-140 1-131 (131) 111 protein:vir:79091 Length: 175 98.3 5.5E-09 3.4E-12 65.8 7.6 136 1-149 1-174 (175) 112 protein:vir:78380 Length: 131 98.3 3.4E-09 2.1E-12 67.0 6.2 126 2-140 1-131 (131) 113 protein:vir:101563 Length: 155 98.3 1.6E-09 9.7E-13 68.9 3.9 103 4-149 1-105 (155) 114 protein:vir:3994 Length: 168 # 98.3 1.3E-08 7.8E-12 63.9 8.8 135 1-149 1-165 (168) 115 protein:vir:1988 Length: 156 # 98.2 3.6E-09 2.2E-12 66.9 5.7 130 2-145 1-156 (156) 116 protein:vir:96012 Length: 133 98.2 2.9E-08 1.8E-11 61.9 10.3 120 1-143 1-133 (133) 117 protein:vir:95157 Length: 144 98.2 4E-09 2.5E-12 66.6 5.0 130 1-144 1-144 (144) 118 protein:vir:5257 Length: 148 # 98.2 2E-09 1.3E-12 68.2 3.4 94 2-149 1-98 (148) 119 protein:vir:3163 Length: 145 # 98.2 1.3E-08 7.9E-12 63.9 6.9 131 1-149 1-145 (145) 120 protein:vir:77650 Length: 155 98.1 2.6E-09 1.6E-12 67.7 2.4 103 4-149 1-105 (155) 121 protein:vir:103841 Length: 155 98.1 3E-08 1.9E-11 61.8 7.3 133 1-147 1-155 (155) 122 protein:vir:4096 Length: 140 # 98.0 8E-08 4.9E-11 59.5 9.2 132 1-149 1-139 (140) 123 protein:vir:80425 Length: 134 98.0 2.5E-08 1.5E-11 62.3 5.0 126 2-149 1-134 (134) 124 protein:vir:97190 Length: 148 97.9 2E-08 1.3E-11 62.7 4.3 133 1-149 1-148 (148) 125 protein:vir:79225 Length: 155 97.9 8.7E-08 5.4E-11 59.3 7.6 133 1-147 1-155 (155) 126 protein:vir:107851 Length: 175 97.9 1.4E-07 8.4E-11 58.2 7.9 137 1-149 1-174 (175) 127 protein:vir:99196 Length: 155 97.9 1.5E-07 9.3E-11 58.0 8.0 129 1-143 1-155 (155) 128 protein:vir:106728 Length: 155 97.9 1.2E-08 7.7E-12 63.9 2.0 103 4-149 1-105 (155) 129 protein:vir:78607 Length: 155 97.9 1.2E-08 7.6E-12 63.9 1.9 103 4-149 1-105 (155) 130 protein:vir:96774 Length: 152 97.8 6.3E-08 3.9E-11 60.0 5.1 126 1-142 10-152 (152) 131 protein:vir:107757 Length: 189 97.7 1.4E-08 9E-12 63.6 0.4 92 21-149 1-100 (189) 132 protein:vir:96105 Length: 193 97.7 1.2E-08 7.4E-12 64.0 -0.3 126 2-149 1-143 (193) 133 protein:vir:102338 Length: 116 97.7 3.2E-07 2E-10 56.2 7.3 94 26-144 1-116 (116) 134 protein:vir:95260 Length: 160 97.7 1.1E-07 6.6E-11 58.8 4.7 91 1-149 1-102 (160) 135 protein:vir:94069 Length: 168 97.4 5.7E-08 3.5E-11 60.3 -0.1 104 26-149 1-108 (168) 136 protein:vir:2688 Length: 123 # 97.4 2.5E-06 1.6E-09 51.3 8.8 111 14-141 1-123 (123) 137 protein:vir:7449 Length: 123 # 97.4 4.6E-06 2.8E-09 49.9 10.0 121 1-147 1-123 (123) 138 protein:vir:99546 Length: 200 97.3 1.7E-06 1.1E-09 52.2 6.3 120 1-149 4-150 (200) 139 protein:vir:97088 Length: 157 97.2 6.8E-06 4.2E-09 48.9 8.8 130 6-149 1-151 (157) 140 protein:vir:105773 Length: 131 97.0 6.2E-06 3.8E-09 49.1 7.3 114 6-141 1-131 (131) 141 protein:vir:6071 Length: 150 # 97.0 6.5E-06 4E-09 49.0 7.4 126 8-141 1-150 (150) 142 protein:vir:80037 Length: 199 97.0 6.4E-07 4E-10 54.6 1.8 135 2-149 1-146 (199) 143 protein:vir:5703 Length: 150 # 97.0 7.9E-06 4.9E-09 48.6 7.4 131 8-141 1-150 (150) 144 protein:vir:80970 Length: 112 96.9 8.1E-06 5.1E-09 48.5 7.1 104 2-143 1-112 (112) 145 protein:vir:98557 Length: 149 96.9 6.3E-06 3.9E-09 49.1 6.4 125 8-141 1-149 (149) 146 protein:vir:101508 Length: 120 96.9 3.8E-05 2.3E-08 44.8 10.4 114 1-149 1-118 (120) 147 protein:vir:79115 Length: 148 96.8 9.4E-06 5.8E-09 48.1 6.7 125 8-141 1-148 (148) 148 protein:vir:1838 Length: 149 # 96.8 9.6E-06 5.9E-09 48.1 6.4 130 8-141 1-149 (149) 149 protein:vir:2026 Length: 150 # 96.7 1.5E-05 9E-09 47.1 7.1 131 8-141 1-150 (150) 150 protein:vir:79179 Length: 155 96.7 1.7E-05 1.1E-08 46.7 7.3 131 1-141 1-155 (155) 151 protein:vir:98892 Length: 108 96.6 1.7E-05 1.1E-08 46.7 6.8 103 1-141 1-108 (108) 152 protein:vir:45 Length: 112 # N 96.6 2E-05 1.3E-08 46.3 6.9 104 2-143 1-112 (112) 153 protein:vir:96288 Length: 100 96.5 6E-06 3.7E-09 49.2 3.7 88 1-135 13-100 (100) 154 protein:vir:100312 Length: 152 96.4 4.2E-05 2.6E-08 44.6 7.6 133 1-142 1-152 (152) 155 protein:vir:396 Length: 184 # 96.3 0.00019 1.2E-07 41.0 10.8 139 6-149 1-184 (184) 156 protein:vir:3427 Length: 192 # 95.3 0.00065 4.1E-07 38.0 9.7 138 6-149 1-184 (192) 157 protein:vir:7993 Length: 108 # 95.2 9.7E-06 6E-09 48.1 -0.6 90 1-126 1-108 (108) 158 protein:vir:96763 Length: 177 94.7 0.0026 1.6E-06 34.7 11.5 142 1-149 4-175 (177) 159 protein:vir:4790 Length: 114 # 94.7 0.00033 2.1E-07 39.7 6.5 104 2-149 1-114 (114) 160 protein:vir:9823 Length: 118 # 93.9 0.00044 2.8E-07 39.0 5.6 101 1-149 1-117 (118) 161 protein:vir:3036 Length: 118 # 93.9 0.00044 2.8E-07 39.0 5.6 101 1-149 1-117 (118) 162 protein:vir:1581 Length: 116 # 93.3 0.00078 4.8E-07 37.6 6.0 105 2-140 1-116 (116) 163 protein:vir:1164 Length: 156 # 92.8 0.0014 8.8E-07 36.2 6.6 134 1-145 1-156 (156) 164 protein:vir:79687 Length: 113 92.2 0.0011 7E-07 36.7 5.3 103 11-147 1-113 (113) 165 protein:vir:6375 Length: 205 # 91.9 0.013 8.1E-06 30.9 10.8 143 2-149 1-202 (205) 166 protein:vir:8106 Length: 150 # 91.4 0.00034 2.1E-07 39.6 1.5 117 1-149 5-144 (150) 167 protein:vir:79555 Length: 192 91.3 0.011 6.6E-06 31.4 9.6 132 8-149 1-184 (192) 168 protein:vir:102190 Length: 93 86.0 0.011 6.6E-06 31.4 5.8 91 30-144 1-93 (93) 169 protein:vir:102608 Length: 108 73.4 0.0081 5E-06 32.0 0.8 90 1-126 11-108 (108) 170 protein:vir:105825 Length: 108 73.4 0.0081 5E-06 32.0 0.8 90 1-126 11-108 (108) 171 protein:vir:4200 Length: 133 # 66.3 0.094 5.8E-05 26.2 5.0 128 1-141 1-133 (133) 172 protein:vir:78894 Length: 105 53.2 0.067 4.1E-05 27.0 1.7 101 2-141 1-105 (105) 173 protein:vir:4162 Length: 133 # 49.1 0.31 0.00019 23.4 4.7 128 1-141 1-133 (133) 174 protein:vir:4460 Length: 170 # 48.0 0.46 0.00029 22.4 5.5 131 1-149 1-168 (170) 175 protein:vir:487 Length: 187 # 44.5 0.42 0.00026 22.6 4.7 133 1-149 14-185 (187) 176 protein:vir:101654 Length: 126 40.9 0.15 9E-05 25.2 1.6 118 8-134 1-126 (126) 177 protein:vir:7859 Length: 126 # 40.9 0.15 9E-05 25.2 1.6 118 8-134 1-126 (126) 178 protein:vir:79034 Length: 141 31.4 1.5 0.00093 19.6 8.1 126 1-149 9-141 (141) 179 protein:vir:5745 Length: 135 # 28.3 1.8 0.0011 19.2 10.3 120 6-149 1-132 (135) 180 protein:vir:3787 Length: 231 # 25.7 1.5 0.00092 19.6 4.5 139 2-148 1-231 (231) 181 protein:vir:105007 Length: 146 21.8 2.5 0.0016 18.4 10.6 129 1-143 5-146 (146) 182 protein:vir:102085 Length: 146 21.8 2.5 0.0016 18.4 10.6 129 1-143 5-146 (146) 183 protein:vir:102875 Length: 146 21.8 2.5 0.0016 18.4 10.6 129 1-143 5-146 (146) 184 protein:vir:107568 Length: 146 21.8 2.5 0.0016 18.4 10.6 129 1-143 5-146 (146) No 1 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=100.00 E-value=1.3e-42 Score=250.24 Aligned_cols=149 Identities=92% Similarity=1.286 Sum_probs=141.6 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||+++|+|+|||+|++.|++|++++.+++++.||.++|++|+++|+++||+++|.+++||..+.......+++...+.+. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 99999999999999999999999988889999999999999999999999999999999999988888899999999888 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +.....+...........+++|||+|+||||++|||||||+||+++++++++++|.++|+++|+|+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 888777777777777788899999999999999999999999999999999999999999999999999 No 2 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=100.00 E-value=1.7e-42 Score=249.59 Aligned_cols=148 Identities=74% Similarity=1.153 Sum_probs=137.0 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||+++|+|+|||+|++.|++|++++.+++++.||++||++|+++|+.+||+++|.+.+||..... ....|++...+... T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~-~~~~g~~~~~v~~~ 79 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSR-RSRDGGMESGVHIR 79 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccc-cccCCceeeeeeec Confidence 99999999999999999999999888899999999999999999999999999999999987654 44577788888888 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .+....+.....+..++..++|||||+||||++|||||||+||+++++++++++|.++++++|+++++| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred ccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 777777777777777888899999999999999999999999999999999999999999999999999 No 3 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=100.00 E-value=4.1e-39 Score=231.02 Aligned_cols=148 Identities=18% Similarity=0.323 Sum_probs=120.4 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc-----CCCcccccceec--cccccccCc Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPV-----RTGKLKKNVVVV--TQKSRRRGE 72 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~-----~~g~l~~~i~~~--~~~~~~~~~ 72 (149) |++ ++|+|+|||+|+++|++|+.++++++++.||.+||++|+++++.++|+ .+|++.++|... .......+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 887 899999999999999999999888999999999999999999999997 457888888663 333444454 Q ss_pred cccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 73 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +...+.+..+.... ...........+++|||||+||||++|||||||+|||++++++++++|.++|+++|+++++| T Consensus 81 ~~~~vg~~~~~~~~-~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka~~k 156 (164) T protein:vir:43 81 LGFRIGVLHGAVLP-KKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRAIKR 156 (164) T ss_pred eeEEeccccccccc-ccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 44444433222111 12222334566789999999999999999999999999999999999999999999999999 No 4 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=100.00 E-value=1.2e-38 Score=228.39 Aligned_cols=149 Identities=28% Similarity=0.476 Sum_probs=117.9 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC-----CCcccccceecccc--ccccCc Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVR-----TGKLKKNVVVVTQK--SRRRGE 72 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~-----~g~l~~~i~~~~~~--~~~~~~ 72 (149) |++ |+|+|+||+||+++|++|++++++++++.||.+||++|+++|+.+||+. +|.+.+||...... +...+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 997 8999999999999999999998889999999999999999999999764 56777888665332 334444 Q ss_pred cccceeeecccccc---------ccc-----ceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHH Q lcl|NC_019769. 73 ISSGVHIRGVNPRT---------GNS-----DNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRR 138 (149) Q Consensus 73 ~~~~i~~~~~~~~~---------~~~-----~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~ 138 (149) +...+.+....... +.. ..........++|||||+||||++|||||||+|||++++++++++|.++ T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 44443332221110 000 1112234556899999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcC Q lcl|NC_019769. 139 MNQAIDEALSK 149 (149) Q Consensus 139 ~~~~l~k~~~k 149 (149) |+++|+|+++| T Consensus 161 l~~~i~k~lk~ 171 (179) T protein:vir:18 161 MGKAIDRAIRL 171 (179) T ss_pred HHHHHHHHHHh Confidence 99999999999 No 5 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=100.00 E-value=2.5e-36 Score=215.73 Aligned_cols=143 Identities=22% Similarity=0.387 Sum_probs=121.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.. .....++..+.|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~---~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSE---PWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccc---ccccccccccccee Confidence 887 7999999999999999999874 689999999999999999999999999999988643 33445677777776 Q ss_pred ecccccccccceeEec--CCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA--NNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~--~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ...+...+.....+.. ...+++|||+|+||||++|||+|||+||+++++++++++|.++|+++|+++| T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 6665554443332222 2346789999999999999999999999999999999999999999999999 No 6 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=100.00 E-value=2.5e-36 Score=215.73 Aligned_cols=143 Identities=22% Similarity=0.387 Sum_probs=121.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.. .....++..+.|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~---~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSE---PWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccc---ccccccccccccee Confidence 887 7999999999999999999874 689999999999999999999999999999988643 33445677777776 Q ss_pred ecccccccccceeEec--CCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA--NNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~--~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ...+...+.....+.. ...+++|||+|+||||++|||+|||+||+++++++++++|.++|+++|+++| T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 6665554443332222 2346789999999999999999999999999999999999999999999999 No 7 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=100.00 E-value=2.5e-36 Score=215.73 Aligned_cols=143 Identities=22% Similarity=0.387 Sum_probs=121.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.. .....++..+.|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~---~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSE---PWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccc---ccccccccccccee Confidence 887 7999999999999999999874 689999999999999999999999999999988643 33445677777776 Q ss_pred ecccccccccceeEec--CCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA--NNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~--~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ...+...+.....+.. ...+++|||+|+||||++|||+|||+||+++++++++++|.++|+++|+++| T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 6665554443332222 2346789999999999999999999999999999999999999999999999 No 8 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=100.00 E-value=2.5e-36 Score=215.73 Aligned_cols=143 Identities=22% Similarity=0.387 Sum_probs=121.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.. .....++..+.|.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~---~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSE---PWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccc---ccccccccccccee Confidence 887 7999999999999999999874 689999999999999999999999999999988643 33445677777776 Q ss_pred ecccccccccceeEec--CCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA--NNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~--~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ...+...+.....+.. ...+++|||+|+||||++|||+|||+||+++++++++++|.++|+++|+++| T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 6665554443332222 2346789999999999999999999999999999999999999999999999 No 9 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=100.00 E-value=4.2e-36 Score=214.55 Aligned_cols=138 Identities=41% Similarity=0.690 Sum_probs=114.3 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.+ |+|+|||+|++.|++|+++++++++++||+++|++|+++++++||+++|+|++||..+..+....+.... +. T Consensus 1 Ma~--~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~-~g-- 75 (140) T protein:vir:10 1 MSS--IQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLAT-AG-- 75 (140) T ss_pred Cce--eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEE-ee-- Confidence 664 6789999999999999998888899999999999999999999999999999999876544433222211 11 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +.... .......+++|||+|+||||++|||||||+||+++++++++++|.+++.++|+|++++ T Consensus 76 -~~~~~-----~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 76 -VRVRT-----KGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred -eeecc-----ccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 11000 1123346789999999999999999999999999999999999999999999999999 No 10 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=100.00 E-value=6.1e-36 Score=213.63 Aligned_cols=138 Identities=40% Similarity=0.667 Sum_probs=114.4 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |. +|+|+|||+|++.|++|+.++.++++++||+++|++|+++++.+||+++|+|++||..+.......+....... T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~-- 76 (140) T protein:vir:80 1 MS--SIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGV-- 76 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeee-- Confidence 55 57789999999999999998888899999999999999999999999999999999876544433322211111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .. +. .......+++|||+|+||||++|||||||+||+++++++++++|.++|.++|++++++ T Consensus 77 --~~--~~---~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:80 77 --RV--RT---KGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGG 138 (140) T ss_pred --ec--cc---ccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 10 00 1112346789999999999999999999999999999999999999999999999999 No 11 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=100.00 E-value=1.3e-35 Score=211.93 Aligned_cols=138 Identities=39% Similarity=0.684 Sum_probs=113.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.+ |+|+|||+|++.|++|+.++.++++++||.++|++++++++.++|+++|+|++||..+............ +.+. T Consensus 1 M~~--~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~-vg~~ 77 (140) T protein:vir:14 1 MSS--IQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLAT-AGVR 77 (140) T ss_pred Cce--eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEE-eeee Confidence 654 7788999999999999998888899999999999999999999999999999999876544332222111 1111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .+. .......+++|||||+||||++|||||||+||+++++++++++|.+++.++|+|++++ T Consensus 78 -----~~~---~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:14 78 -----VRT---KGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred -----ecc---ccccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 000 0122345689999999999999999999999999999999999999999999999999 No 12 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=100.00 E-value=1.7e-35 Score=211.24 Aligned_cols=145 Identities=19% Similarity=0.306 Sum_probs=117.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccccee Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALS-RAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~-~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |++ ++|+|.||+||+++|++|+ +...++++++||++||++|+++++.++|++.+.... ....++..+++.++|. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~----~~~~~~~~~~~~d~i~ 76 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKS----GRKGSRPPGHAANNIP 76 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCcccc----ccccccccchhhhcce Confidence 886 7999999999999999996 345688999999999999999999999997643321 1223445677888887 Q ss_pred eecccccccccceeEec--CCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKA--NNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~--~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +..+....+.....+.. ...+++|||||+||||++|||||||+||+++++++++++|.++|++.|++.|+- T Consensus 77 ~~~~~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 77 EPKIRKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred ecccccccceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 76555544443322211 123568999999999999999999999999999999999999999999999999 No 13 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=100.00 E-value=1.3e-35 Score=211.79 Aligned_cols=138 Identities=45% Similarity=0.692 Sum_probs=113.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+ +|+|+|||+|++.|++|++++.++++++||++||++|+++++.+||+++|.|++||..+......... .+.+ T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~---~~~~- 74 (140) T protein:vir:10 1 MS--SVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPG---IATA- 74 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccc---eeEE- Confidence 65 47788999999999999998878899999999999999999999999999999999876544332211 1111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .+.... .......+++|||+|+||||++|||||||+||+++++++++++|.++++++|+|++++ T Consensus 75 ~~~~~~-----~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 75 GVRVRT-----KGKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGG 138 (140) T ss_pred eecccc-----ccccCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 111110 0112345789999999999999999999999999999999999999999999999999 No 14 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=100.00 E-value=9.9e-35 Score=207.02 Aligned_cols=130 Identities=21% Similarity=0.371 Sum_probs=108.3 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCC----CcccccceeccccccccCccccce Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRT----GKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~----g~l~~~i~~~~~~~~~~~~~~~~i 77 (149) |.++|+|.||++|++.|++|+.++.+++++.||++||++|+++++.++|+++ |+|++||.++..+ T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k----------- 69 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSR----------- 69 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhccccccc----------- Confidence 8999999999999999999999988889999999999999999999999975 6777777554322 Q ss_pred eeecccccccccceeEe-cCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019769. 78 HIRGVNPRTGNSDNTMK-ANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALS 148 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~-~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~ 148 (149) ...+.....+. +......||+||+||||++|||||||+|||++++++++++|.++|+++|+|+.. T Consensus 70 ------~~~~~~~v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 70 ------GKAGSTVVVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred ------ccccceeEEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHhcC Confidence 11122111111 223445688999999999999999999999999999999999999999999999 No 15 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=100.00 E-value=4.4e-34 Score=203.47 Aligned_cols=128 Identities=26% Similarity=0.392 Sum_probs=101.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcc----cccceeccccccccCccccc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKL----KKNVVVVTQKSRRRGEISSG 76 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l----~~~i~~~~~~~~~~~~~~~~ 76 (149) ||+++ |+||++|+++|++|+.++.+++++.||.+||++|+++++.+||+++|.+ ++||.++..... T Consensus 1 M~~~~--i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~-------- 70 (133) T protein:vir:10 1 MIRME--VKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRK-------- 70 (133) T ss_pred CeeEe--eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccc-------- Confidence 77655 6899999999999999888889999999999999999999999998874 444432211100 Q ss_pred eeeeccccccccccee-EecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 77 VHIRGVNPRTGNSDNT-MKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEA 146 (149) Q Consensus 77 i~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~ 146 (149) ..+..... .++.....+|||+|+||||++|||||||+|||++++++++++|.++|+++|+|- T Consensus 71 --------~~~~~~~~v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 71 --------AQGNAVVTLRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred --------cCccceEEEEecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 00111111 122344567999999999999999999999999999999999999999999988 No 16 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.96 E-value=5.5e-33 Score=197.46 Aligned_cols=128 Identities=15% Similarity=0.260 Sum_probs=106.5 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |+|+|+||+||+++|++|+.++ ++++++||++||++++++++.++|+++|.++. .+++.++|.+.... T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~-~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~-----------~~h~~d~I~~~~~k 68 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGV-AKEARAAVRDGAQKFADKLKSNTPEWDGETDM-----------SGHLRDDIKLSSVR 68 (128) T ss_pred CccchhhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcc-----------cchhhhhhcccccc Confidence 8889999999999999999875 68999999999999999999999998875433 34566666554443 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) ...+... ..++.+..++|||||+||||++|||+|||+||+++++++++++|.++|+++|- T Consensus 69 ~~~g~~~-~~VG~~k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 69 ETSGLTE-VDVGYGKDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred ccCceeE-EEeeecCCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 3333322 23344566789999999999999999999999999999999999999999998 No 17 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.95 E-value=5.2e-32 Score=192.10 Aligned_cols=124 Identities=23% Similarity=0.370 Sum_probs=102.1 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC---CCcccccceeccccccccCccccce Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVR---TGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~---~g~l~~~i~~~~~~~~~~~~~~~~i 77 (149) |.+ |+|+||++|++.|++|+.++ +++++.||++||.+|+++++.++|++ +|++++||..+..+.... T Consensus 1 M~~--~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~------- 70 (127) T protein:vir:12 1 MAD--MSFDGIDDLTQYFEKIGGDI-EKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKD------- 70 (127) T ss_pred Cee--eeehhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccC------- Confidence 665 67789999999999999876 67899999999999999999999975 688888886543221111 Q ss_pred eeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) +.....++...+++|||||+||||++|||||||+||+++++++++++|.+.|+++|+ T Consensus 71 ----------g~~~v~Vg~~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 71 ----------GVRFVAVGPNKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred ----------ceeEEEEeeCCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 111112334456799999999999999999999999999999999999999999999 No 18 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.95 E-value=3.3e-31 Score=187.73 Aligned_cols=124 Identities=19% Similarity=0.263 Sum_probs=103.9 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++ |+|+|+|||+|.+.|+++++++. +.++.|+..+++.+.++++.++|++||.|++||........ .+. T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~-~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~-~~~------- 71 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELV-PYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEE-HGV------- 71 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceecc-CCc------- Confidence 777 79999999999999999999875 45688999999999999999999999999999976432111 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEA 146 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~ 146 (149) +.+....+.+||+|+||||++|||||||+||++++++.+++.|.++|++.|++. T Consensus 72 -------------~~~~v~~~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 72 -------------VTGRYVARADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred -------------EEEEeeCCCCccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 111123457899999999999999999999999999999999999988888888 No 19 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.94 E-value=6.5e-31 Score=186.09 Aligned_cols=118 Identities=18% Similarity=0.345 Sum_probs=102.2 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccccc Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~ 85 (149) |+|+|||+|+++|++|++.+ +++++.|+.++|++|+++|+.+||++||+|++||..+.... . T Consensus 1 i~i~Gld~L~~~L~~l~~~~-~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~--~--------------- 62 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-DKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKA--K--------------- 62 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeecc--C--------------- Confidence 99999999999999999876 67899999999999999999999999999999997642110 0 Q ss_pred ccccceeEecCCCCCcceeeeeccCcc----------------------------------------------------- Q lcl|NC_019769. 86 TGNSDNTMKANNPRNAFYWRFVEMGTA----------------------------------------------------- 112 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~~~E~GT~----------------------------------------------------- 112 (149) ..+.+...++++||.|+||||+ T Consensus 63 -----~~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 137 (173) T protein:vir:10 63 -----DLISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKIL 137 (173) T ss_pred -----ceeEEeeCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEee Confidence 0122334567899999999997 Q ss_pred --CCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 113 --NMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEA 146 (149) Q Consensus 113 --~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~ 146 (149) .|||||||+|||+++++++.++|++.|+++|+|+ T Consensus 138 ~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 138 GAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred cCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 3899999999999999999999999999999999 No 20 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.94 E-value=1.4e-29 Score=178.72 Aligned_cols=144 Identities=20% Similarity=0.275 Sum_probs=109.9 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceecccccccc-Cccccceeee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRR-GEISSGVHIR 80 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~-~~~~~~i~~~ 80 (149) |+++|.-..|++|...|+.|++ .+++++++|+.+||++|+++|+.+||+++|.|++||.......... |.....| T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~-~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~V--- 76 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVE-HSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAV--- 76 (157) T ss_pred CeeEeecccHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEE--- Confidence 7776655557799999999986 4678999999999999999999999999999999998865444332 3222222 Q ss_pred cccccccccceeE---------ecCCCCCcceeeeeccCccC-CCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTM---------KANNPRNAFYWRFVEMGTAN-MTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~---------~~~~~~~~~y~~~~E~GT~~-~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +++...+...+.+ ........|||+|+|+||.. |||||||||||++.++++.+.|.+++.++|+++++= T Consensus 77 g~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 77 SWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred eecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 2222222211111 22234456888888888855 999999999999999999999999999999999998 No 21 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.93 E-value=8.3e-30 Score=180.02 Aligned_cols=121 Identities=14% Similarity=0.169 Sum_probs=98.7 Q ss_pred ehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc----ccccceeccccccccCccccceeeecc Q lcl|NC_019769. 7 DFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK----LKKNVVVVTQKSRRRGEISSGVHIRGV 82 (149) Q Consensus 7 ~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~----l~~~i~~~~~~~~~~~~~~~~i~~~~~ 82 (149) =|+||+||+++|++|+.++ +++.++||++||++++++++.++|++++. +++||..+.. T Consensus 1 mv~Gl~el~~~l~~l~~~~-~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~----------------- 62 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKA-PKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGF----------------- 62 (125) T ss_pred CchhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccc----------------- Confidence 5799999999999999875 68899999999999999999999998875 5555544322 Q ss_pred cccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 83 NPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDE 145 (149) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k 145 (149) +....+.....++.+..++|||||+||||++|||||||+||+++++++++++|.++|+++|.= T Consensus 63 k~~~~g~~~~~VG~~k~~~~y~~f~E~GT~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 63 KGANVGIVSKEIGYGKATGWRAHYPNDGTIYQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred cccccCceEEEEeecCCCceeEeeeccCccCCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 111111112233445667899999999999999999999999999999999999999999876 No 22 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.93 E-value=8.6e-30 Score=179.92 Aligned_cols=114 Identities=19% Similarity=0.228 Sum_probs=98.5 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |+|+|+|||+|.+.|++|++.+.+. ++.+|+++|..++++|+.+||++||.|++||..+.. + . T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~-v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~-----g-~---------- 63 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQ-SLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYP-----G-M---------- 63 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecC-----c-e---------- Confidence 7788899999999999999987654 588999999999999999999999999999875321 0 0 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) . +....+.+||+|+||||++|||||||+||++++++++.+.|.+.++++|+ T Consensus 64 ------~----~~V~~~~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 64 ------E----AHIHGEAGYDGYQEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred ------E----EEeecCCCccceeecCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 0 01123568999999999999999999999999999999999999999999 No 23 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.93 E-value=1.8e-29 Score=178.23 Aligned_cols=112 Identities=22% Similarity=0.406 Sum_probs=94.2 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+++|+|+|||+|++.|+++.. .++++.+|++++.+|+++++.++|++||.|++||...... +. T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~---~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~----~~--------- 64 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS---LKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTE----GG--------- 64 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecC----Cc--------- Confidence 9999999999999999998864 3567999999999999999999999999999999753211 00 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ..+.+ ..+.+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 65 ---------~~~~V--~~~~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 65 ---------FSGQA--GPHTDYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred ---------eEEEe--ecCCCccceeeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 00111 2357899999999999999999999999999999998888877 No 24 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.92 E-value=4.3e-28 Score=170.62 Aligned_cols=122 Identities=13% Similarity=0.144 Sum_probs=91.5 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc--ccccceeccccccccCccccceee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++++.||++ .|++|+... +++.+.||++||+++++.++.++|++++. +++||.++. T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--------------- 61 (125) T protein:vir:79 1 MGARIESNNIEQ---GLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--------------- 61 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--------------- Confidence 888887765554 555555553 57789999999999999999999988754 666665543 Q ss_pred ecccccccccceeE-ecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .+...+.....+ ++.+..++|||||+||||++|||+||++||+++++++++++|.++|++..+ T Consensus 62 --~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 62 --VKTDRHTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred --cccccccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 222222222222 233455689999999999999999999999999999999999999966555 No 25 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.92 E-value=4.3e-28 Score=170.62 Aligned_cols=122 Identities=13% Similarity=0.144 Sum_probs=91.5 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc--ccccceeccccccccCccccceee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++++.||++ .|++|+... +++.+.||++||+++++.++.++|++++. +++||.++. T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--------------- 61 (125) T protein:vir:47 1 MGARIESNNIEQ---GLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--------------- 61 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--------------- Confidence 888887765554 555555553 57789999999999999999999988754 666665543 Q ss_pred ecccccccccceeE-ecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .+...+.....+ ++.+..++|||||+||||++|||+||++||+++++++++++|.++|++..+ T Consensus 62 --~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 62 --VKTDRHTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred --cccccccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 222222222222 233455689999999999999999999999999999999999999966555 No 26 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.92 E-value=4.3e-28 Score=170.62 Aligned_cols=122 Identities=13% Similarity=0.144 Sum_probs=91.5 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc--ccccceeccccccccCccccceee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++++.||++ .|++|+... +++.+.||++||+++++.++.++|++++. +++||.++. T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--------------- 61 (125) T protein:vir:98 1 MGARIESNNIEQ---GLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--------------- 61 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--------------- Confidence 888887765554 555555553 57789999999999999999999988754 666665543 Q ss_pred ecccccccccceeE-ecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .+...+.....+ ++.+..++|||||+||||++|||+||++||+++++++++++|.++|++..+ T Consensus 62 --~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 62 --VKTDRHTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred --cccccccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 222222222222 233455689999999999999999999999999999999999999966555 No 27 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.92 E-value=4.3e-28 Score=170.62 Aligned_cols=122 Identities=13% Similarity=0.144 Sum_probs=91.5 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc--ccccceeccccccccCccccceee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++++.||++ .|++|+... +++.+.||++||+++++.++.++|++++. +++||.++. T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--------------- 61 (125) T protein:vir:94 1 MGARIESNNIEQ---GLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--------------- 61 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--------------- Confidence 888887765554 555555553 57789999999999999999999988754 666665543 Q ss_pred ecccccccccceeE-ecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .+...+.....+ ++.+..++|||||+||||++|||+||++||+++++++++++|.++|++..+ T Consensus 62 --~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 62 --VKTDRHTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred --cccccccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 222222222222 233455689999999999999999999999999999999999999966555 No 28 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.92 E-value=4.3e-28 Score=170.62 Aligned_cols=122 Identities=13% Similarity=0.144 Sum_probs=91.5 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc--ccccceeccccccccCccccceee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++++.||++ .|++|+... +++.+.||++||+++++.++.++|++++. +++||.++. T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--------------- 61 (125) T protein:vir:81 1 MGARIESNNIEQ---GLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--------------- 61 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--------------- Confidence 888887765554 555555553 57789999999999999999999988754 666665543 Q ss_pred ecccccccccceeE-ecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .+...+.....+ ++.+..++|||||+||||++|||+||++||+++++++++++|.++|++..+ T Consensus 62 --~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 62 --VKTDRHTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred --cccccccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 222222222222 233455689999999999999999999999999999999999999966555 No 29 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.92 E-value=1.1e-28 Score=173.90 Aligned_cols=109 Identities=21% Similarity=0.355 Sum_probs=91.9 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.|+.+++..+.++++.++ |++||.|++||..+. .+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCc------- Confidence 999999999999999998875 5568999999999999999998 899999999997531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ....+.+||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 68 ---------~~~----~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 68 ---------LQY----TITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---------eEE----EeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 011 112346899999999999999999999999999999999888777 No 30 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.92 E-value=1.1e-28 Score=173.90 Aligned_cols=109 Identities=21% Similarity=0.355 Sum_probs=91.9 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.|+.+++..+.++++.++ |++||.|++||..+. .+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCc------- Confidence 999999999999999998875 5568999999999999999998 899999999997531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ....+.+||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 68 ---------~~~----~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 68 ---------LQY----TITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---------eEE----EeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 011 112346899999999999999999999999999999999888777 No 31 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.92 E-value=1.1e-28 Score=173.90 Aligned_cols=109 Identities=21% Similarity=0.355 Sum_probs=91.9 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.|+.+++..+.++++.++ |++||.|++||..+. .+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCc------- Confidence 999999999999999998875 5568999999999999999998 899999999997531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ....+.+||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 68 ---------~~~----~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 68 ---------LQY----TITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---------eEE----EeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 011 112346899999999999999999999999999999999888777 No 32 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.92 E-value=1.1e-28 Score=173.90 Aligned_cols=109 Identities=21% Similarity=0.355 Sum_probs=91.9 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.|+.+++..+.++++.++ |++||.|++||..+. .+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCc------- Confidence 999999999999999998875 5568999999999999999998 899999999997531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ....+.+||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 68 ---------~~~----~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 68 ---------LQY----TITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---------eEE----EeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 011 112346899999999999999999999999999999999888777 No 33 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.92 E-value=1.1e-28 Score=173.90 Aligned_cols=109 Identities=21% Similarity=0.355 Sum_probs=91.9 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.|+.+++..+.++++.++ |++||.|++||..+. .+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCc------- Confidence 999999999999999998875 5568999999999999999998 899999999997531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ....+.+||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 68 ---------~~~----~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 68 ---------LQY----TITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---------eEE----EeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 011 112346899999999999999999999999999999999888777 No 34 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.92 E-value=1.1e-28 Score=173.90 Aligned_cols=109 Identities=21% Similarity=0.355 Sum_probs=91.9 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.|+.+++..+.++++.++ |++||.|++||..+. .+. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCc------- Confidence 999999999999999998875 5568999999999999999998 899999999997531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ....+.+||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 68 ---------~~~----~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 68 ---------LQY----TITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---------eEE----EeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 011 112346899999999999999999999999999999999888777 No 35 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.92 E-value=1.2e-28 Score=173.72 Aligned_cols=109 Identities=20% Similarity=0.289 Sum_probs=91.8 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|+++++.+. +.++.+|++++..++++++.++ |++||.|++||.... .+. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~-----~g~------- 67 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIE-DDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKK-----IGD------- 67 (115) T ss_pred CeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee-----cCc------- Confidence 999999999999999998864 5669999999999999999998 778999999986531 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) . .+....+++||+|+||||++|||||||+||++++++.+++.|.+.+. T Consensus 68 ---------~----~~~v~~~~~Ya~~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 68 ---------L----HYRVISTAHYSGFLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred ---------E----EEEeeCCCccchheecccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 1 11113457899999999999999999999999999999988888887 No 36 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.91 E-value=2e-28 Score=172.48 Aligned_cols=109 Identities=20% Similarity=0.340 Sum_probs=93.1 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|+|||+|++.|++|++.+. +.++.++++++..+.++++.++ |++||.|++||.... .+. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~-----~g~------- 67 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TVD------- 67 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee-----cCc------- Confidence 999999999999999998875 6679999999999999999998 999999999997532 111 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ..+...++++||+|+||||++|+|||||+|||+++++.+++.|.+.++ T Consensus 68 -------------~~~~V~~~~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 68 -------------LQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred -------------EEEEecCCccccccccccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 111113457899999999999999999999999999999999888877 No 37 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.91 E-value=2.7e-28 Score=171.71 Aligned_cols=108 Identities=19% Similarity=0.272 Sum_probs=92.0 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccccccc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTG 87 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~ 87 (149) |+|||+|++.|+++++.+. +.++.+|.++|..++++++.++|++||.|++||..... +. T Consensus 1 i~Gld~l~~~l~~~~~~~~-~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~-----~~--------------- 59 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVR-IAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQ-----RL--------------- 59 (108) T ss_pred CchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeec-----Cc--------------- Confidence 9999999999999999864 56699999999999999999999999999999965321 11 Q ss_pred ccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 88 NSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 88 ~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .. +...++.+||+|+||||++|||||||+||++++++++.+.|.+.|++ T Consensus 60 -~~----~~v~~~~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 60 -LH----YRVVSPALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred -EE----EEeecCcccchhcccCccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 00 11123578999999999999999999999999999998888888877 No 38 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.91 E-value=6.9e-28 Score=169.49 Aligned_cols=108 Identities=20% Similarity=0.389 Sum_probs=90.0 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccccc Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~ 85 (149) |+|+|||+|++.|+++.. ...++.||+++|..|+++|+.++|++||.|++||.+..... .. T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~----~~------------ 61 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT---LDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDG----GL------------ 61 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecC----ce------------ Confidence 999999999999998764 35678999999999999999999999999999997532110 00 Q ss_pred ccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 86 TGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ... ..++.+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 62 ------~~~--V~~~~~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 62 ------SGT--TGPHTDYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ------EEE--eecCCCcccceeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 011 12356799999999999999999999999999999998887777 No 39 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.90 E-value=1.3e-27 Score=168.00 Aligned_cols=118 Identities=20% Similarity=0.351 Sum_probs=99.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+ ++++.|+|+|+..|++|+.. .+++.++||++|+++|++++..++|+++|.+++ |..+.+ T Consensus 1 Ma--~iel~G~del~~~l~~~g~~-~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~k--------------- 61 (119) T protein:vir:10 1 MA--SLEIEGFEEFEKFISEDMVL-DESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRVK--------------- 61 (119) T ss_pred Cc--eeehhhHHHHHHHHHhhhhh-hHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeeee--------------- Confidence 54 56779999999999999975 589999999999999999999999999999885 322111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCC-cchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAH-PFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~-PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .. ....++..++..||..|+|||||+|||| |||.||++++++++++.|.++|.+.++ T Consensus 62 -----k~--g~~~VG~~ks~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 62 -----NT--GLATEGTASSSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred -----cC--ceeEeccCCcchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 00 1223344556789999999999999999 999999999999999999999999999 No 40 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.90 E-value=1.7e-27 Score=167.29 Aligned_cols=108 Identities=23% Similarity=0.394 Sum_probs=89.6 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccccc Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~ 85 (149) |+|+|||+|++.|+++.. ...++.+|+++|..++++|+.++|++||.|++||...... +.. T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~----~~~------------ 61 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT---LNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTD----GGL------------ 61 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeec----Cce------------ Confidence 999999999999998754 3456899999999999999999999999999999653211 000 Q ss_pred ccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 86 TGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) . +. ...+.+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 62 ----~--~~--V~~~~~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 62 ----T--GT--TIPHTDYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ----E--EE--eecCCCccceeeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 0 11 12356899999999999999999999999999999998887777 No 41 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.89 E-value=2.4e-26 Score=161.05 Aligned_cols=147 Identities=18% Similarity=0.231 Sum_probs=100.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceecccccc------ccC Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENN---KVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR------RRG 71 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~---k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~------~~~ 71 (149) ||.+ +|.|+|+|.++|++|++.+.+ +++..++.+++..++++|+.++|++||.|++||........ -.. T Consensus 1 m~~v--~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~ 78 (182) T protein:vir:10 1 MIEV--ELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWN 78 (182) T ss_pred CeEE--EEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeec Confidence 7766 779999999999999986543 23455556677788889999999999999999975432211 122 Q ss_pred ccccceeeeccccccccccee-----EecCCCCCc--ceeeeec-------------------cCccCCCCCcchhHhHH Q lcl|NC_019769. 72 EISSGVHIRGVNPRTGNSDNT-----MKANNPRNA--FYWRFVE-------------------MGTANMTAHPFIRPAFD 125 (149) Q Consensus 72 ~~~~~i~~~~~~~~~~~~~~~-----~~~~~~~~~--~y~~~~E-------------------~GT~~~~a~PFl~pA~~ 125 (149) .....++++.+.+..+..... .......+. +|++.++ |+|++|||||||+||++ T Consensus 79 ~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~ 158 (182) T protein:vir:10 79 SSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAAN 158 (182) T ss_pred CCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHH Confidence 233555565554322211110 000001111 1222222 56889999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 126 VRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 126 ~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) ++++++.++|.+.++++|++.++= T Consensus 159 ~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 159 KMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HhHHHHHHHHHHHHHHHHHHhhcC Confidence 999999999999999999999988 No 42 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.89 E-value=1.2e-26 Score=162.65 Aligned_cols=111 Identities=23% Similarity=0.333 Sum_probs=89.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSR-AENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~-~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |.+ |+|+|||+|+++|++++. +..+++++.++.+.+..+++.++.++|++||.|++||..... .. T Consensus 1 Ma~--i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~------~~------ 66 (112) T protein:vir:96 1 MAT--IEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAG------SD------ 66 (112) T ss_pred Cce--eeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecC------ce------ Confidence 554 688999999999999843 234678899999999999999999999999999999965311 10 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRM 139 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~ 139 (149) ... ...+.+||+|+||||++|||||||+|||++.+..+++.+.+-- T Consensus 67 ------------~~~--v~~~~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 67 ------------RAV--VEALTNYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ------------EEE--ecCCCCccceeccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 011 1234689999999999999999999999999999888776544 No 43 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.88 E-value=9.6e-27 Score=163.23 Aligned_cols=113 Identities=22% Similarity=0.351 Sum_probs=88.4 Q ss_pred CcceeeehHhHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSR-AENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~-~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |. +|+|+|||+|++.|+++.. ...+++++.++...++.+++.|+.++|++||+|++||...... +. T Consensus 1 Ma--~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~----~~------- 67 (114) T protein:vir:27 1 MA--TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES----DK------- 67 (114) T ss_pred Ce--eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC----Ce------- Confidence 55 5888999999999999842 2235666777777777777777777899999999999653210 00 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .. ...+.+||+|+||||++|||||||+||++++++++++.|.+.++- T Consensus 68 -------------~~--V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 68 -------------AT--VEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred -------------eE--ecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 00 123468999999999999999999999999999999998888877 No 44 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.88 E-value=9.6e-27 Score=163.23 Aligned_cols=113 Identities=22% Similarity=0.351 Sum_probs=88.4 Q ss_pred CcceeeehHhHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSR-AENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~-~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |. +|+|+|||+|++.|+++.. ...+++++.++...++.+++.|+.++|++||+|++||...... +. T Consensus 1 Ma--~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~----~~------- 67 (114) T protein:vir:49 1 MA--TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES----DK------- 67 (114) T ss_pred Ce--eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC----Ce------- Confidence 55 5888999999999999842 2235666777777777777777777899999999999653210 00 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .. ...+.+||+|+||||++|||||||+||++++++++++.|.+.++- T Consensus 68 -------------~~--V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 68 -------------AT--VEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred -------------eE--ecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 00 123468999999999999999999999999999999998888877 No 45 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.83 E-value=1.2e-23 Score=146.29 Aligned_cols=115 Identities=22% Similarity=0.250 Sum_probs=91.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) +|+++++++|+++|.+.|+++++.+. +.++.+|..+|+.++++++.++|++||+|++||........ + T Consensus 3 ~ms~~i~~~g~~~l~~~l~~~~~~~~-~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g----~------- 70 (144) T protein:vir:59 3 LMSVRIDPSWRRIMSRNVRTFSGHVL-TQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNG----L------- 70 (144) T ss_pred cceeeehhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCc----E------- Confidence 56667788999999999999999875 56789999999999999999999999999999975321100 0 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc---------------------------cCCCCCcchhHhHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT---------------------------ANMTAHPFIRPAFDVRQEQATE 133 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT---------------------------~~~~a~PFl~pA~~~~k~~~~~ 133 (149) .+....+..|+.|+|||| .+|||||||+||++.+++.+.+ T Consensus 71 -------------~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~ 137 (144) T protein:vir:59 71 -------------TAEITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFER 137 (144) T ss_pred -------------EEEEecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHH Confidence 001122457888888887 5699999999999999998888 Q ss_pred HHHHHHH Q lcl|NC_019769. 134 VAIRRMN 140 (149) Q Consensus 134 ~~~~~~~ 140 (149) .|.+.+- T Consensus 138 ~i~~~~g 144 (144) T protein:vir:59 138 EMRRLRG 144 (144) T ss_pred HHHHhcC Confidence 7777766 No 46 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.82 E-value=1.1e-23 Score=146.49 Aligned_cols=108 Identities=23% Similarity=0.297 Sum_probs=88.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++ +.|+++|++.|+++++++. +.++.++..++..++++++.++|++||.|++||....... T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~------------- 63 (137) T protein:vir:97 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS------------- 63 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecC------------- Confidence 5555 4699999999999999874 6789999999999999999999999999999996532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ... +....+..|++|+|||| ++|||||||+||++++++.+ T Consensus 64 -------~~~----~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:97 64 -------GFT----GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------ceE----EEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 000 01123467999999998 57999999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:97 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 47 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.82 E-value=1.1e-23 Score=146.49 Aligned_cols=108 Identities=23% Similarity=0.297 Sum_probs=88.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++ +.|+++|++.|+++++++. +.++.++..++..++++++.++|++||.|++||....... T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~------------- 63 (137) T protein:vir:93 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS------------- 63 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecC------------- Confidence 5555 4699999999999999874 6789999999999999999999999999999996532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ... +....+..|++|+|||| ++|||||||+||++++++.+ T Consensus 64 -------~~~----~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:93 64 -------GFT----GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------ceE----EEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 000 01123467999999998 57999999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:93 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 48 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.82 E-value=1.1e-23 Score=146.49 Aligned_cols=108 Identities=23% Similarity=0.297 Sum_probs=88.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++ +.|+++|++.|+++++++. +.++.++..++..++++++.++|++||.|++||....... T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~------------- 63 (137) T protein:vir:94 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS------------- 63 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecC------------- Confidence 5555 4699999999999999874 6789999999999999999999999999999996532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ... +....+..|++|+|||| ++|||||||+||++++++.+ T Consensus 64 -------~~~----~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:94 64 -------GFT----GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------ceE----EEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 000 01123467999999998 57999999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:94 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 49 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=99.82 E-value=2.9e-23 Score=144.13 Aligned_cols=136 Identities=18% Similarity=0.212 Sum_probs=104.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.+++ .||++|+++|++|...+ .++.++|+++||+++++.++.++|++.- +.+.....+|+.++|.+. T Consensus 1 M~~~~---~glee~~~~lekL~~~~-~~~~~katkAGA~v~~e~L~~~tp~~h~--------~~~kt~~~~HlaD~I~~s 68 (153) T protein:vir:49 1 MTGLD---EALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTREKHY--------SKKKDLKYGHMADGLAVQ 68 (153) T ss_pred CccHH---HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCC--------CCCCCCCCCcccccceec Confidence 77755 79999999999999864 5677899999999999999999997631 112334456888888876 Q ss_pred cccccccccceeEecC--CCCCcceeeeeccCccCCCCCcchhHhHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKAN--NPRNAFYWRFVEMGTANMTAHPFIRPAFDVR--QEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~--~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--k~~~~~~~~~~~~~~l~k~~~k 149 (149) ..+. .|.......++ ....+||+||+||||++|||+||+.++.+++ ++++++++.+++++.|++..+= T Consensus 69 ~~~i-dG~~dG~s~VG~~~~~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~ 140 (153) T protein:vir:49 69 STNA-DGRKNGVSTVGWKNNYHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGV 140 (153) T ss_pred cccc-cccccceeeecccCCccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCe Confidence 4332 22222222333 3445899999999999999999999999876 7889999999999888887665 No 50 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.82 E-value=1.3e-23 Score=145.99 Aligned_cols=108 Identities=21% Similarity=0.310 Sum_probs=85.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++. +.|+|+|++.|+++++.+. +.++.+|.+++..++++|+.+||++||.|++||..........+ T Consensus 1 Ma~~---~~Gl~~l~~~l~~~~~~~~-~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~--------- 67 (137) T protein:vir:10 1 MAKV---KYGNWELVKELEDFEKETI-RWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTG--------- 67 (137) T ss_pred Cchh---HhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEE--------- Confidence 6666 4699999999999999875 56799999999999999999999999999999965322111000 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ...++..|++|+|||| ++|||||||+||++++++++ T Consensus 68 ---------------~V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:10 68 ---------------VINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred ---------------EEecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHH Confidence 0112345677777775 56899999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhcC Confidence 99998 No 51 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.82 E-value=1.8e-23 Score=145.32 Aligned_cols=108 Identities=22% Similarity=0.306 Sum_probs=88.6 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++ +.|+++|.+.|+++++++ +++++.++..++..++++++.++|++||.|++||....... T Consensus 1 Ma~~---~~G~~~l~~~l~~~~~~~-~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~------------- 63 (137) T protein:vir:95 1 MAKV---KYGNWDLVKELENYERDM-ERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG------------- 63 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCC------------- Confidence 6655 479999999999999986 57889999999999999999999999999999996432110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) . ..+...++..|+.|+|||| ++|||||||+||++++++++ T Consensus 64 -------~----~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:95 64 -------G----FTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------c----eEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 0 0011123467899999998 67999999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~l~ 137 (137) T protein:vir:95 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 52 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.82 E-value=1.3e-23 Score=146.05 Aligned_cols=108 Identities=19% Similarity=0.274 Sum_probs=87.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++. .|+|+|.+.|+++++++ .++++.++..++..++++|+.++|++||.|++||....... . T Consensus 13 Ma~~~---~Gld~l~~~L~~~~~~~-~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~----g-------- 76 (149) T protein:vir:94 13 MAKVK---YGADSMVVELDKFDKKI-EEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDG----G-------- 76 (149) T ss_pred HHHHH---HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCC----c-------- Confidence 87753 39999999999999987 46889999999999999999999999999999997532110 0 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ..+....+..|+.|+|||| .+|||||||+||++++++++ T Consensus 77 ------------~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i 144 (149) T protein:vir:94 77 ------------LSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTF 144 (149) T ss_pred ------------EEEEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHH Confidence 0111123467899999998 45889999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 145 ~~~i~ 149 (149) T protein:vir:94 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 99998 No 53 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.81 E-value=1.9e-23 Score=145.20 Aligned_cols=108 Identities=22% Similarity=0.302 Sum_probs=88.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++ .|+|+|.+.|+++++++. +.++.+|..+|..++++++.++|++||.|++||....... T Consensus 1 Ma~~~---~G~~~l~~~L~~~~~~~~-~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~------------- 63 (137) T protein:vir:94 1 MAKVK---YGNWDLVKELENYERDIE-RWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDG------------- 63 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecC------------- Confidence 77774 499999999999999874 6789999999999999999999999999999996532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccC-----------------------------ccCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMG-----------------------------TANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G-----------------------------T~~~~a~PFl~pA~~~~k~~~ 131 (149) . ..+....+..|+.|+||| |++|||||||+||++++++++ T Consensus 64 -------~----~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:94 64 -------G----FTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFF 132 (137) T ss_pred -------c----EEEEEecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHH Confidence 0 001112346799999999 567999999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:94 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 54 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=99.81 E-value=7.5e-23 Score=141.88 Aligned_cols=136 Identities=16% Similarity=0.222 Sum_probs=105.0 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) ++|+ +||++|+++|++|.... .+..++++++||+++++.++.++|++.-. ..+.....+|+.++|.+...+ T Consensus 1 v~~~-~~lee~l~~i~kl~~~~-~~~~~ki~kaGA~v~~e~L~~~tp~~~~~-------~~~~~~~~~HlaD~I~~s~~~ 71 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAELS-ISDQEKITKAGADVYAKKLAETTKEKHPN-------TKGDGGKYGHLSEDIRSAAGD 71 (139) T ss_pred CCHH-HHHHHHHHHHHHhhccC-HHHHHHHHHHHHHHHHHHHHHhcccccCc-------CCCCCCCCcchhhcceecCcc Confidence 3333 79999999999998643 35567899999999999999999975311 111222345778887776643 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .. +.......+++...+|++||+||||++|||+||+.++.+++++++++++.++|++.|++..+- T Consensus 72 ~d-g~~~g~~~VG~~k~~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~~~ 136 (139) T protein:vir:10 72 ID-GDHNGSSTVGFHNKAHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred cc-cccceeeeeCCCCCcceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 32 223333445666679999999999999999999999999999999999999999999987776 No 55 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.81 E-value=1.8e-23 Score=145.29 Aligned_cols=108 Identities=19% Similarity=0.273 Sum_probs=87.3 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++. .|+|+|.+.|+++++++. ++++.++.+++..++++|+.++|++||.|++||........ T Consensus 13 Ma~v~---~Gld~l~~~l~~~~~~~~-~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g------------ 76 (149) T protein:vir:10 13 MAKVK---YGADSMVVELDKFDKKIE-EWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGG------------ 76 (149) T ss_pred hHHHH---HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCc------------ Confidence 87753 399999999999999874 68899999999999999999999999999999975321110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ..+....+..|+.|+|||| .+|||||||+||++++++++ T Consensus 77 ------------~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i 144 (149) T protein:vir:10 77 ------------LSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTF 144 (149) T ss_pred ------------EEEEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHH Confidence 0011123456888888887 55889999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 145 ~~~i~ 149 (149) T protein:vir:10 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 99998 No 56 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.81 E-value=3.7e-23 Score=143.55 Aligned_cols=108 Identities=22% Similarity=0.329 Sum_probs=85.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++. .|+|+|.+.|+++++.+. +.++.+|..++..|+++++.++|++||.|++||..........+. T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~-~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~-------- 68 (137) T protein:vir:10 1 MAKVK---YGNWDLVKELEEFEKETI-RWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGV-------- 68 (137) T ss_pred Cccch---hCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEE-------- Confidence 77765 499999999999999875 566899999999999999999999999999999654221111100 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ...+..|+.|+|||| .+|||||||+||++++++++ T Consensus 69 ----------------V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i 132 (137) T protein:vir:10 69 ----------------INIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred ----------------EecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHH Confidence 112345666666665 56999999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99998 No 57 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.79 E-value=1.2e-22 Score=140.76 Aligned_cols=108 Identities=22% Similarity=0.250 Sum_probs=87.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |+++. .|+|+|++.|+++++.+. ++++.+|..+|..++++|+.++|++||.|++||....... T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~-~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~------------- 63 (137) T protein:vir:96 1 MAKVK---YGNWDLVAELEDYRDEME-EWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDG------------- 63 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecC------------- Confidence 77663 599999999999998874 6778999999999999999999999999999996532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc-----------------------------cCCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------------------------ANMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~k~~~ 131 (149) ... .. ...+..|+.|+|||| .+|||||||+||++++++.+ T Consensus 64 -------g~~--~~--V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i 132 (137) T protein:vir:96 64 -------GFS--SV--ISVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVF 132 (137) T ss_pred -------ceE--EE--EecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHH Confidence 000 11 123467999999998 55899999999999999999 Q ss_pred HHHHH Q lcl|NC_019769. 132 TEVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~i~ 137 (137) T protein:vir:96 133 NRYFS 137 (137) T ss_pred HHhhC Confidence 99998 No 58 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.79 E-value=1.3e-22 Score=140.62 Aligned_cols=108 Identities=22% Similarity=0.290 Sum_probs=87.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++.+ .|||+|.+.|+++++.+ ++.++.++..+++.++++|+.++|++||.|++||........ T Consensus 1 Ma~~~---~Gl~~l~~~l~~~~~~~-~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g------------ 64 (135) T protein:vir:96 1 MAKVK---YGADSIVVDLEKYSKDM-EKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGG------------ 64 (135) T ss_pred Cchhh---hhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCc------------ Confidence 66643 39999999999999987 567899999999999999999999999999999965321000 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc---------------------------cCCCCCcchhHhHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT---------------------------ANMTAHPFIRPAFDVRQEQATE 133 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT---------------------------~~~~a~PFl~pA~~~~k~~~~~ 133 (149) ..+...++..|+.|+|||| .+|||||||+||++.+++++.+ T Consensus 65 ------------~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~ 132 (135) T protein:vir:96 65 ------------FTGVVKIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQ 132 (135) T ss_pred ------------EEEEEecCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHH Confidence 0011124567889999988 5699999999999999999888 Q ss_pred HHH Q lcl|NC_019769. 134 VAI 136 (149) Q Consensus 134 ~~~ 136 (149) .|. T Consensus 133 ~i~ 135 (135) T protein:vir:96 133 YFS 135 (135) T ss_pred hcC Confidence 888 No 59 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=99.78 E-value=1e-21 Score=135.65 Aligned_cols=136 Identities=19% Similarity=0.249 Sum_probs=101.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.++. +||++|+++|++|.... .+...+|+++||+++++.++.++|++.- +.+.....+|+.++|.+. T Consensus 1 M~~~~---~gl~e~~~~lekl~~~~-~~~~~katkAGA~v~~~~L~~~tp~~hy--------~~~~~~~~~HlaD~I~~~ 68 (141) T protein:vir:50 1 MVGLA---EALDEWLKTVASIGNLT-PAEQVEITTAGAKVFKKELEEVTREKHY--------SRKKNPKFGHMADGLAIQ 68 (141) T ss_pred CccHH---HHHHHHHHHHHHhcCCC-HHHHHHHHHHHHHHHHHHHHHhcccCCC--------CCCCCCCCCccccceeec Confidence 77755 99999999999999654 4667899999999999999999997531 122334456778877776 Q ss_pred cccccccccceeEecC--CCCCcceeeeeccCccCCCCCcchhHhHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKAN--NPRNAFYWRFVEMGTANMTAHPFIRPAFDVR--QEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~--~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--k~~~~~~~~~~~~~~l~k~~~k 149 (149) ..+. .|.......++ ....+|++||+||||++|||+||+.+|.+.+ +++|++++.++|++.|++.=+- T Consensus 69 ~~~~-DG~~dg~s~VG~~~~~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~ 140 (141) T protein:vir:50 69 STNA-DGRKNGVSTVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEEKEGC 140 (141) T ss_pred cCcc-ccccCCeeeeccCCCccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhccCC Confidence 5432 22222222233 3444899999999999999999999999854 7889999988888887765444 No 60 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=99.77 E-value=1.4e-21 Score=134.95 Aligned_cols=137 Identities=18% Similarity=0.176 Sum_probs=103.1 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.++. +||++|+++|++|.... .+...+|+++||+++++.++.++|+..- +.......+|+.++|.+. T Consensus 1 M~~~~---d~l~e~~~~lekl~~~~-~~~~~katkAGA~v~~~~L~~~tp~~h~--------~~~~t~~~~HlaD~I~~~ 68 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTRQKHY--------SNKKHLKYGHMADGLSVQ 68 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCC--------CCCCCCCCCcchhceeec Confidence 87765 69999999999999754 4667899999999999999999997431 112233456888888876 Q ss_pred cccccccccceeEecC-CCCCcceeeeeccCccCCCCCcchhHhHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKAN-NPRNAFYWRFVEMGTANMTAHPFIRPAFDVR--QEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~-~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--k~~~~~~~~~~~~~~l~k~~~k 149 (149) ..+..........+++ ....+|++||+||||++|||+||+.+|.+.+ +.++++++.+++++.|++-=+- T Consensus 69 ~~~iDg~~~g~s~VG~~kk~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 69 STNVDGRKNGVSTVGWVNRYHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ccccccccCceeeeccCCCcceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 4332221111222332 3446999999999999999999999999965 7889999999998888875555 No 61 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=99.77 E-value=1.2e-21 Score=135.31 Aligned_cols=136 Identities=18% Similarity=0.243 Sum_probs=104.5 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |+|+ +||++|+++|++|.... .+...+|+.+||+++++.++.++|...-. ..+.....+|+.+.|.+...+ T Consensus 1 ~~~~-~~l~e~l~~lekl~~~~-~~~~~k~tkaGA~v~~~~L~~~tp~~~~~-------~~~~~~~~~HlaD~I~~~~~~ 71 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAQLS-VSDQEKITKAGADVYAKELAETTKEKHPN-------TKGDGGKYGHLSEDISSAAGD 71 (139) T ss_pred CCHH-HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccccc-------CCCCCCCCCcccccceecCcc Confidence 4444 79999999999998643 45567899999999999999999964210 111223346788887776533 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .. +.......+++...+|.+||+||||++|||+||+..+.+++++++++++.++|++.|++..+- T Consensus 72 id-g~~~g~~~VG~~~~~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~~~~~ 136 (139) T protein:vir:10 72 ID-GDHNGSSTVGFHNKAHIARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred cc-ccccccceeCCCCCceeeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 22 333344455666678899999999999999999999999999999999999999999887666 No 62 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.76 E-value=1.7e-21 Score=134.42 Aligned_cols=134 Identities=22% Similarity=0.287 Sum_probs=105.2 Q ss_pred Ccce-eeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIET-SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~-~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |..+ +++++||++|.+.|+++......+.++.++++.|..+..+++.++|++||.|++|+.........+ + T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~------~-- 72 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLP------V-- 72 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccc------e-- Confidence 8886 899999999999999987654577889999999999999999999999999999875432110000 0 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhH--HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAF--DVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~--~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) ...+.. +.+....+..||+|+||||+.++++||+.|++ +.+.+++.+.|.+.+.+.|++.+++ T Consensus 73 ----~~~g~~---~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 73 ----YKQGNN---YIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred ----eecCCe---eEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000111 11122346789999999999999999999998 7777888888888889999998888 No 63 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.76 E-value=1.6e-21 Score=134.53 Aligned_cols=115 Identities=25% Similarity=0.306 Sum_probs=89.4 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |..++++|. +++|.+.|+.+.+.+. .+++.+|..+|..++++++.++|++||.|++||........ T Consensus 1 Ma~~~~~~~-~~~l~~~l~~~~~~~~-~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g------------ 66 (142) T protein:vir:94 1 MAGLNYRVN-STEFQGALRAALDRLT-GAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGR------------ 66 (142) T ss_pred CceeEEEec-HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCC------------ Confidence 888998884 8999999999998864 67899999999999999999999999999999965321110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccC---------------------------CCCCcchhHhHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN---------------------------MTAHPFIRPAFDVRQEQATE 133 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---------------------------~~a~PFl~pA~~~~k~~~~~ 133 (149) ..+.+....+..|+.|+||||.. ++|||||+||++.+++++.+ T Consensus 67 ----------~~~~~~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~ 136 (142) T protein:vir:94 67 ----------FSFSVTIGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRN 136 (142) T ss_pred ----------ceEEEEEecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHH Confidence 00111112357899999999842 78999999999999877755 Q ss_pred HHHHHHH Q lcl|NC_019769. 134 VAIRRMN 140 (149) Q Consensus 134 ~~~~~~~ 140 (149) .+ ++|+ T Consensus 137 ~~-~~~~ 142 (142) T protein:vir:94 137 HA-KGIR 142 (142) T ss_pred HH-HhcC Confidence 44 3444 No 64 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=99.75 E-value=7.6e-21 Score=130.88 Aligned_cols=137 Identities=15% Similarity=0.166 Sum_probs=104.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.++. .||++|+++|++|...+ .+.-.+|+.+||+++++.++.++|++.-. .+.....+|+.++|.+. T Consensus 1 M~~~~---d~l~e~~~~v~kl~~~~-~~~~~katkAGAkv~~~~L~~~tp~~h~~--------~r~t~~~~HlaD~I~~~ 68 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKKELAEVTREKHYS--------KKKDLKYGHMADGLAVQ 68 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCC-HHHHHHHHHHhHHHHHHHHHHhcccCCCC--------CCCCCCCCcccccceec Confidence 87766 69999999999999754 35668999999999999999999986411 12334557888888877 Q ss_pred cccccccccceeEecCC-CCCcceeeeeccCccCCCCCcchhHhHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVEMGTANMTAHPFIRPAFDVR--QEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--k~~~~~~~~~~~~~~l~k~~~k 149 (149) ..+..........+++. ...+|+++|+++||++|||+||+..+.+.+ ++++++++.+++++.|.+--+- T Consensus 69 ~~~idg~~dG~s~VG~~k~~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 69 STNVDGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ccccccccccceeecccCCCceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 53322211111223333 335899999999999999999999999854 8899999999999998877666 No 65 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.74 E-value=2.3e-21 Score=133.75 Aligned_cols=113 Identities=22% Similarity=0.291 Sum_probs=85.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||++++.++||+. .|+.+...+ ..+++.++...+..++.+|+.++|++||.|++||.......... T Consensus 1 m~~~~~~~~gl~~---~l~~~~~~~-~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~---------- 66 (142) T protein:vir:86 1 MVQVSVRYEGFDY---NPVGAAAQV-GPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTP---------- 66 (142) T ss_pred CceeEEEeeecch---hHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecccccc---------- Confidence 9999999999986 445555554 46789999999999999999999999999999997543211111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----------------------------CCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----------------------------NMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~k~~~ 131 (149) ..+.+...+++.|+.|+||||. .++|||||+||+++++++. T Consensus 67 ----------~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~ 136 (142) T protein:vir:86 67 ----------FHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD 136 (142) T ss_pred ----------ceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh Confidence 0111222346789999999984 3669999999999998887 Q ss_pred HHHHHH Q lcl|NC_019769. 132 TEVAIR 137 (149) Q Consensus 132 ~~~~~~ 137 (149) ..+..+ T Consensus 137 ~~~~~r 142 (142) T protein:vir:86 137 RRIRVR 142 (142) T ss_pred hhhccC Confidence 776666 No 66 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.74 E-value=2.3e-21 Score=133.75 Aligned_cols=113 Identities=22% Similarity=0.291 Sum_probs=85.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||++++.++||+. .|+.+...+ ..+++.++...+..++.+|+.++|++||.|++||.......... T Consensus 1 m~~~~~~~~gl~~---~l~~~~~~~-~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~---------- 66 (142) T protein:vir:99 1 MVQVSVRYEGFDY---NPVGAAAQV-GPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTP---------- 66 (142) T ss_pred CceeEEEeeecch---hHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecccccc---------- Confidence 9999999999986 445555554 46789999999999999999999999999999997543211111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----------------------------CCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----------------------------NMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~k~~~ 131 (149) ..+.+...+++.|+.|+||||. .++|||||+||+++++++. T Consensus 67 ----------~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~ 136 (142) T protein:vir:99 67 ----------FHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD 136 (142) T ss_pred ----------ceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh Confidence 0111222346789999999984 3669999999999998887 Q ss_pred HHHHHH Q lcl|NC_019769. 132 TEVAIR 137 (149) Q Consensus 132 ~~~~~~ 137 (149) ..+..+ T Consensus 137 ~~~~~r 142 (142) T protein:vir:99 137 RRIRVR 142 (142) T ss_pred hhhccC Confidence 776666 No 67 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.69 E-value=1.2e-19 Score=124.40 Aligned_cols=120 Identities=18% Similarity=0.294 Sum_probs=92.1 Q ss_pred CcceeeehHhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIETSLDFSGL-NDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |.. |+|++| +++.+.|+++++.+ .+.++.++.++|+.++++++.++|+.||.|++++.+.... ..+ T Consensus 1 Ma~--i~id~la~~I~~~L~~y~~~v-~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~--~~g-------- 67 (126) T protein:vir:81 1 MAN--ITIDRLADELLQAVKEYTDDV-AEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKED--GYG-------- 67 (126) T ss_pred Ccc--cchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccc--cCC-------- Confidence 665 667777 55888899999976 4677999999999999999999999999999998654211 001 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccC-----CCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN-----MTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) ....++.+.......|++||||.+ +||+|||+||++...+++.+.|.+.|+..= T Consensus 68 ----------~~~~vv~~~~~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 68 ----------TTKRIIWNKKHYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred ----------cceEEEeccCCCCceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 111233444556778999999997 899999999999988877777666666433 No 68 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.58 E-value=1.4e-17 Score=112.97 Aligned_cols=124 Identities=15% Similarity=0.137 Sum_probs=90.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRA-ENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~-~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |...+|+++||++|.+.|+++... ...+.++.++...|..+.+++++++|++||.|++|+...... T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~------------- 67 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPT------------- 67 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeeccee------------- Confidence 666799999999999999998753 235678999999999999999999999999999998653211 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCC-----------------CCCcchhHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANM-----------------TAHPFIRPAFDVRQEQATEVAIRRMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~-----------------~a~PFl~pA~~~~k~~~~~~~~~~~~~~ 142 (149) ..+.. ..+...++..|++|+||||+.+ +.+|||.+|.+..+..+.+. +.+. T Consensus 68 -----~~~~~---~~~~V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~----l~k~ 135 (144) T protein:vir:10 68 -----YGCGG---WTIKLINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQL----VTEG 135 (144) T ss_pred -----eecCe---eEEEEecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHH----HHHH Confidence 00111 1111235678999999999754 56788888887755554444 4444 Q ss_pred HHHHhcC Q lcl|NC_019769. 143 IDEALSK 149 (149) Q Consensus 143 l~k~~~k 149 (149) |+++... T Consensus 136 l~~l~d~ 142 (144) T protein:vir:10 136 LWGLKDL 142 (144) T ss_pred HHHHhhh Confidence 4555444 No 69 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.57 E-value=2.9e-18 Score=116.73 Aligned_cols=87 Identities=20% Similarity=0.275 Sum_probs=69.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccceeEecCCCCCcceee Q lcl|NC_019769. 26 NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) Q Consensus 26 ~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) .+++++.++.+++..++++|+.++|++||+|++||........ + .+....+..|+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~----~--------------------~~~V~~~~~YA~ 56 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGG----F--------------------TGVINIGSEYAI 56 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCc----E--------------------EEEEecCCCccc Confidence 4678899999999999999999999999999999965321110 0 001123456777 Q ss_pred eeccC-----------------------------ccCCCCCcchhHhHHHHHHHHHHHHH Q lcl|NC_019769. 106 FVEMG-----------------------------TANMTAHPFIRPAFDVRQEQATEVAI 136 (149) Q Consensus 106 ~~E~G-----------------------------T~~~~a~PFl~pA~~~~k~~~~~~~~ 136 (149) |+||| |..|+|||||+||++++++.+.+.|. T Consensus 57 yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 57 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 77877 78899999999999999999988888 No 70 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.57 E-value=2.9e-18 Score=116.73 Aligned_cols=87 Identities=20% Similarity=0.275 Sum_probs=69.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccceeEecCCCCCcceee Q lcl|NC_019769. 26 NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) Q Consensus 26 ~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) .+++++.++.+++..++++|+.++|++||+|++||........ + .+....+..|+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~----~--------------------~~~V~~~~~YA~ 56 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGG----F--------------------TGVINIGSEYAI 56 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCc----E--------------------EEEEecCCCccc Confidence 4678899999999999999999999999999999965321110 0 001123456777 Q ss_pred eeccC-----------------------------ccCCCCCcchhHhHHHHHHHHHHHHH Q lcl|NC_019769. 106 FVEMG-----------------------------TANMTAHPFIRPAFDVRQEQATEVAI 136 (149) Q Consensus 106 ~~E~G-----------------------------T~~~~a~PFl~pA~~~~k~~~~~~~~ 136 (149) |+||| |..|+|||||+||++++++.+.+.|. T Consensus 57 yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 57 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 77877 78899999999999999999988888 No 71 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.57 E-value=2.8e-18 Score=116.78 Aligned_cols=87 Identities=20% Similarity=0.258 Sum_probs=68.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccceeEecCCCCCcceee Q lcl|NC_019769. 26 NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) Q Consensus 26 ~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) .+++++.++..++..|+.+|+.++|++||+|++||.......... +....+..|+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~~~~------------------------~~V~~~~~Ya~ 56 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFT------------------------GVINIGSEYAI 56 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecCcEE------------------------EEEecCCCccc Confidence 467889999999999999999999999999999996532111100 00112345777 Q ss_pred eeccC-----------------------------ccCCCCCcchhHhHHHHHHHHHHHHH Q lcl|NC_019769. 106 FVEMG-----------------------------TANMTAHPFIRPAFDVRQEQATEVAI 136 (149) Q Consensus 106 ~~E~G-----------------------------T~~~~a~PFl~pA~~~~k~~~~~~~~ 136 (149) |+||| |..|+|||||+||++.+++.+.+.|. T Consensus 57 yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 57 YVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 77777 77899999999999999999999888 No 72 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.56 E-value=1e-17 Score=113.77 Aligned_cols=115 Identities=17% Similarity=0.179 Sum_probs=80.1 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |-+++|+ +...+.++++.+.+.+.+...|+..+++.++..|+.++|++||.|++||........ T Consensus 1 ~~~~~f~----~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g------------ 64 (141) T protein:vir:78 1 MNEFEFD----SNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSS------------ 64 (141) T ss_pred CcchhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCC------------ Confidence 7777776 344444555555544444344688889999999999999999999999965321110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc--------------------------cCCCCCcchhHhHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT--------------------------ANMTAHPFIRPAFDVRQEQATEV 134 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT--------------------------~~~~a~PFl~pA~~~~k~~~~~~ 134 (149) .. +.+ ..+..|+-|+|||| +.|||||||+||++.+++++.+. T Consensus 65 --------~~--~~V--~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~ 132 (141) T protein:vir:78 65 --------KE--VIV--GNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVF 132 (141) T ss_pred --------cE--EEE--ecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHH Confidence 00 111 13456888888887 56999999999999999998887 Q ss_pred HHHHHHHHHH Q lcl|NC_019769. 135 AIRRMNQAID 144 (149) Q Consensus 135 ~~~~~~~~l~ 144 (149) |.+.|+ .|+ T Consensus 133 i~~~~~-~l~ 141 (141) T protein:vir:78 133 TERALR-GIN 141 (141) T ss_pred HHHHhh-ccC Confidence 776664 344 No 73 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=99.56 E-value=3.3e-17 Score=110.94 Aligned_cols=146 Identities=12% Similarity=0.170 Sum_probs=109.0 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceec------cccccccCccc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVV------TQKSRRRGEIS 74 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~------~~~~~~~~~~~ 74 (149) ||. +|+ .+|++++++|+++.... .+.-.++..+||+++++..+..+|++.-..++..... .......+|+. T Consensus 1 mm~-~~~-~~l~~~l~~v~k~~~~~-~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~Hla 77 (159) T protein:vir:38 1 MAN-DMG-EFYNNWVNEVEKGMKLS-VEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQ 77 (159) T ss_pred Ccc-hHH-HHHHHHHHHHHHhcCCC-HHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccc Confidence 443 355 77999999998865432 2344688999999999999999998754443322211 12345578999 Q ss_pred cceeeecccccccccceeEecCC--CCCcceeeeeccCccCCCCC-----cchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 75 SGVHIRGVNPRTGNSDNTMKANN--PRNAFYWRFVEMGTANMTAH-----PFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 75 ~~i~~~~~~~~~~~~~~~~~~~~--~~~~~y~~~~E~GT~~~~a~-----PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ++|.+..+....|...+...+++ ...+|+++|+++||++|||+ ||+..+.++.+++|++++.+++++-|+..- T Consensus 78 D~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~~~~ 157 (159) T protein:vir:38 78 DSITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAYKEVMNHDS 157 (159) T ss_pred cceeeecCccccccccceeeecccCCccceEeeecccCccccCCCCccCChhHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 99988665433334333344444 44489999999999999998 899999999999999999999999998888 Q ss_pred cC Q lcl|NC_019769. 148 SK 149 (149) Q Consensus 148 ~k 149 (149) -| T Consensus 158 ~~ 159 (159) T protein:vir:38 158 DK 159 (159) T ss_pred CC Confidence 88 No 74 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.52 E-value=2.3e-17 Score=111.76 Aligned_cols=92 Identities=20% Similarity=0.319 Sum_probs=72.4 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++++|+|+|+|++.|++.... ..++.+++..+..++.+|+.+||++||.|++||...... +.+.. T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~---~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~----~g~~~----- 68 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNM---NTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISR----DGFTG----- 68 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccH---HHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeec----CCeeE----- Confidence 999999999999999999987653 235788999999999999999999999999999764321 11111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTA 116 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a 116 (149) .+......+.|+.|+||||++|+| T Consensus 69 ------------~v~~~gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 69 ------------SVTYGGGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred ------------EEEeccCccccccccccceeecCC Confidence 111112456799999999999999 No 75 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=99.48 E-value=1.7e-16 Score=107.00 Aligned_cols=121 Identities=21% Similarity=0.246 Sum_probs=88.0 Q ss_pred eeeehHhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCccccceee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++.|++||+++|++. +.. ..++..+||.++++.|.++++.+.++ |||.+.+++..+..... T Consensus 1 MsvevkGv~eil~~LE~k~g~~~-~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~----------- 68 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKE-MVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWI----------- 68 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhh-hhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeec----------- Confidence 899999999999999987 544 57899999999999999999998776 99998888776543211 Q ss_pred ecccccccccceeEec-CCCCCcceeeeeccCccCCCCCcchhH--------hHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA-NNPRNAFYWRFVEMGTANMTAHPFIRP--------AFDVRQEQATEVAIRRMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~y~~~~E~GT~~~~a~PFl~p--------A~~~~k~~~~~~~~~~~~~~ 142 (149) .|.....+.. +..+...+-|+.|||+.++...+|++| |+++.+..+.+.++++|++- T Consensus 69 ------~G~r~V~vgW~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 69 ------KGKRTVTIRWRGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------CCceEEEEEEEcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1111111111 224457888999999999999999999 55555555444444444433 No 76 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.47 E-value=3.9e-17 Score=110.56 Aligned_cols=92 Identities=24% Similarity=0.285 Sum_probs=76.5 Q ss_pred HHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccceeE-ecCCCCCcceeeeeccC--------- Q lcl|NC_019769. 41 LKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTM-KANNPRNAFYWRFVEMG--------- 110 (149) Q Consensus 41 v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~G--------- 110 (149) ++|+++..+|+.+|.|+++|........+. +....+ ++++...++|||++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~-----------------dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~ 63 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEEST-----------------NGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYK 63 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCC-----------------CCeEEEEeeccCCcCCcccccccceeeeeeeee Confidence 999999999999999999997654332221 111222 34556779999999999 Q ss_pred ---------------ccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 111 ---------------TANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 111 ---------------T~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) |+++||+||||||||+...++.++|.+.+.+.+.+++.= T Consensus 64 ~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:81 64 GKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred ccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 899999999999999999999999999999999999987 No 77 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.46 E-value=4.7e-17 Score=110.09 Aligned_cols=92 Identities=24% Similarity=0.287 Sum_probs=76.4 Q ss_pred HHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccceeE-ecCCCCCcceeeeeccC--------- Q lcl|NC_019769. 41 LKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTM-KANNPRNAFYWRFVEMG--------- 110 (149) Q Consensus 41 v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~y~~~~E~G--------- 110 (149) ++|+++..+|+.+|.|+++|........+. +....+ ++++...++|||++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~-----------------dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~ 63 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEEST-----------------NGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYK 63 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCC-----------------CCEEEEEeecCCCcCCcccccccceeeeeeeee Confidence 999999999999999999997654332221 112222 34556779999999999 Q ss_pred ---------------ccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 111 ---------------TANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 111 ---------------T~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) ++++||+||||||||+...++.++|.+.+.+.+.+++.= T Consensus 64 ~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:10 64 GKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred ccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 889999999999999999999999999999999999987 No 78 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=99.44 E-value=5.9e-16 Score=104.09 Aligned_cols=121 Identities=24% Similarity=0.258 Sum_probs=88.2 Q ss_pred eeeehHhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCccccceee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|++.|++||+++|++. +.. ..++..+||.++++.|.++++.+.++ |||.+.+.+..+..... T Consensus 1 msvevkGv~eil~~le~k~g~~~-~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~----------- 68 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKE-MVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWI----------- 68 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhh-hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeec----------- Confidence 899999999999999987 544 57899999999999999999999986 89998888776543211 Q ss_pred ecccccccccceeEec-CCCCCcceeeeeccCccCCCCCcchhH--------hHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA-NNPRNAFYWRFVEMGTANMTAHPFIRP--------AFDVRQEQATEVAIRRMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~y~~~~E~GT~~~~a~PFl~p--------A~~~~k~~~~~~~~~~~~~~ 142 (149) .|.....+.. +..+..++-|+.|||+.+....+|++| |+++.+..+.+.++++|++- T Consensus 69 ------~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 69 ------NGKRTITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------CCceEEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1111111111 224457888999999999999999999 55555555554444444443 No 79 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=99.44 E-value=5.9e-16 Score=104.09 Aligned_cols=121 Identities=24% Similarity=0.258 Sum_probs=88.2 Q ss_pred eeeehHhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCccccceee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |+|++.|++||+++|++. +.. ..++..+||.++++.|.++++.+.++ |||.+.+.+..+..... T Consensus 1 msvevkGv~eil~~le~k~g~~~-~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~----------- 68 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKE-MVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWI----------- 68 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhh-hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeec----------- Confidence 899999999999999987 544 57899999999999999999999986 89998888776543211 Q ss_pred ecccccccccceeEec-CCCCCcceeeeeccCccCCCCCcchhH--------hHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKA-NNPRNAFYWRFVEMGTANMTAHPFIRP--------AFDVRQEQATEVAIRRMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~y~~~~E~GT~~~~a~PFl~p--------A~~~~k~~~~~~~~~~~~~~ 142 (149) .|.....+.. +..+..++-|+.|||+.+....+|++| |+++.+..+.+.++++|++- T Consensus 69 ------~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 69 ------NGKRTITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------CCceEEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1111111111 224457888999999999999999999 55555555554444444443 No 80 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.40 E-value=4.1e-16 Score=104.96 Aligned_cols=104 Identities=18% Similarity=0.258 Sum_probs=69.3 Q ss_pred cceeeehH-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 2 IETSLDFS-GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 2 m~~~~~i~-Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.++++|+ ..++|++.+ ..+++.+|...+..++.+++.++|++||+|++||........ T Consensus 1 m~~s~~i~i~~~~l~~~v--------~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~------------ 60 (137) T protein:vir:10 1 MPVTARIHINEPELERQT--------GAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYR------------ 60 (137) T ss_pred CCeeEEEeeCHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccc------------ Confidence 88877776 334444333 345677788889999999999999999999999975321100 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----------------------------CCCCCcchhHhHHHH---H Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----------------------------NMTAHPFIRPAFDVR---Q 128 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~---k 128 (149) ...+.+....+..|+.|+||||. .++|||||+||+++. + T Consensus 61 ---------~~~~~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~ 131 (137) T protein:vir:10 61 ---------PFHVGGGVEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAAD 131 (137) T ss_pred ---------cceEEEEEecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhcc Confidence 00011112345678888888883 345999999999974 3 Q ss_pred HHHHHH Q lcl|NC_019769. 129 EQATEV 134 (149) Q Consensus 129 ~~~~~~ 134 (149) .++.-. T Consensus 132 ~ri~~~ 137 (137) T protein:vir:10 132 PDIHMT 137 (137) T ss_pred ccccCC Confidence 444333 No 81 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.38 E-value=5.7e-15 Score=98.68 Aligned_cols=117 Identities=15% Similarity=0.193 Sum_probs=89.7 Q ss_pred cceeeehHhHHH-HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 2 IETSLDFSGLND-IAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 2 m~~~~~i~Gl~~-l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |..+++|+.|++ +.+.|+.+.+.+. ..+..++.+.|..+.+++++.+|++||.+.++..+.... T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~-~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~-------------- 65 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVV-DDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLK-------------- 65 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecC-------------- Confidence 888999988866 6899999999875 567999999999999999999999999999987543210 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----CCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----NMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .. ..+.+.+.......|++|||.. +.+|+|||+||++...+.+.+.+.+.|.+ T Consensus 66 -------~~-~~~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 66 -------NG-DQVIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred -------Ce-eEEEEEecCCcceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 00 1122333333456899999943 47999999999998777777666666666 No 82 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.35 E-value=2.7e-15 Score=100.49 Aligned_cols=109 Identities=12% Similarity=0.198 Sum_probs=78.5 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCc-------CCCcccccceeccccccccCcccccee Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR--APV-------RTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~--aP~-------~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |.|+|+|+++|++...+. + +..++.-...+...++++ +|+ +||.+++||......... T Consensus 1 i~G~~~L~~~Lk~~s~~d---v-k~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~--------- 67 (127) T protein:vir:98 1 MTGMPALEVKLRSMSEKR---W-DRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSK--------- 67 (127) T ss_pred CcChHHHHHHHHHhhHHH---H-HHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCc--------- Confidence 999999999999885432 3 445555555567777765 888 999999998764321111 Q ss_pred eecccccccccceeEecCCCCCcceeeeeccCccC---------CCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN---------MTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---------~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ...++.......|+-++||||+. +|+||||.|||+.-+..+++.+.+.+++ T Consensus 68 ------------~~~vgp~g~t~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 68 ------------DVITGNFGYIKDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred ------------eEEeccCcccccccceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 11122223346899999999995 5599999999999988888877777777 No 83 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.31 E-value=3.6e-15 Score=99.79 Aligned_cols=107 Identities=19% Similarity=0.221 Sum_probs=69.7 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |..++.++ -....|.+.+ ..+++.+++..+..++++|+.++|+++|.|++||..........+. T Consensus 1 ~~~~~~~~------~~~~~~~~~~-~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~--------- 64 (137) T protein:vir:10 1 MTVTARYE------RNPVGEARQF-QVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLR--------- 64 (137) T ss_pred CeeEEEec------cCchhHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccce--------- Confidence 66666654 1122223333 3466788899999999999999999999999999754321111110 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCcc------------------------------CCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGTA------------------------------NMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~------------------------------~~~a~PFl~pA~~~~k~~~ 131 (149) +......+..|+.|+||||. .++|+|||+||++.++.+. T Consensus 65 -----------~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~ 133 (137) T protein:vir:10 65 -----------LDSGVTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARE 133 (137) T ss_pred -----------EEEEecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhh Confidence 11111233455555555552 3569999999999999988 Q ss_pred HHHH Q lcl|NC_019769. 132 TEVA 135 (149) Q Consensus 132 ~~~~ 135 (149) ...- T Consensus 134 ~~~~ 137 (137) T protein:vir:10 134 TATS 137 (137) T ss_pred cccC Confidence 7665 No 84 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.27 E-value=6.6e-15 Score=98.32 Aligned_cols=108 Identities=22% Similarity=0.235 Sum_probs=66.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |..+.-++. +.-....|.+.+ ..+++.+++..+..++.+++.++|+++|.|++||........ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~------------ 63 (140) T protein:vir:10 1 MATIRARAR----IEIDEAALERES-GEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYT------------ 63 (140) T ss_pred Ceeeeeeee----eeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCC------------ Confidence 665543332 112222333332 345677788899999999999999999999999975321111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----------------------------CCCCCcchhHhHHHH---H Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----------------------------NMTAHPFIRPAFDVR---Q 128 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~---k 128 (149) ...+.+....++.|+.|+||||. .++|||||+||++.. + T Consensus 64 ---------~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~ 134 (140) T protein:vir:10 64 ---------PFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND 134 (140) T ss_pred ---------CceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh Confidence 01111122334556666666662 367999999999974 5 Q ss_pred HHHHHH Q lcl|NC_019769. 129 EQATEV 134 (149) Q Consensus 129 ~~~~~~ 134 (149) +++... T Consensus 135 ~~i~~~ 140 (140) T protein:vir:10 135 PRVRMT 140 (140) T ss_pred hhccCC Confidence 555555 No 85 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.27 E-value=6.6e-15 Score=98.32 Aligned_cols=108 Identities=22% Similarity=0.235 Sum_probs=66.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |..+.-++. +.-....|.+.+ ..+++.+++..+..++.+++.++|+++|.|++||........ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~------------ 63 (140) T protein:vir:97 1 MATIRARAR----IEIDEAALERES-GEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYT------------ 63 (140) T ss_pred Ceeeeeeee----eeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCC------------ Confidence 665543332 112222333332 345677788899999999999999999999999975321111 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----------------------------CCCCCcchhHhHHHH---H Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----------------------------NMTAHPFIRPAFDVR---Q 128 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~---k 128 (149) ...+.+....++.|+.|+||||. .++|||||+||++.. + T Consensus 64 ---------~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~ 134 (140) T protein:vir:97 64 ---------PFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND 134 (140) T ss_pred ---------CceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh Confidence 01111122334556666666662 367999999999974 5 Q ss_pred HHHHHH Q lcl|NC_019769. 129 EQATEV 134 (149) Q Consensus 129 ~~~~~~ 134 (149) +++... T Consensus 135 ~~i~~~ 140 (140) T protein:vir:97 135 PRVRMT 140 (140) T ss_pred hhccCC Confidence 555555 No 86 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=99.25 E-value=7.7e-14 Score=92.46 Aligned_cols=125 Identities=15% Similarity=0.210 Sum_probs=96.3 Q ss_pred cceeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCcccccee Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |+.--+|.|++||+++|++ |++.-..++..+||.+||+.+.+.++.+.|+ |||...+.+..+...+. T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~---------- 70 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRRE---------- 70 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeec---------- Confidence 6666678899999999999 9986567899999999999999999999996 88888887766543221 Q ss_pred eecccccccccceeEecCCCCCcce-eeeeccCccCCC-CC--cchhHhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFY-WRFVEMGTANMT-AH--PFIRPAFDVRQEQATEVAIRRMNQAIDE 145 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y-~~~~E~GT~~~~-a~--PFl~pA~~~~k~~~~~~~~~~~~~~l~k 145 (149) .| ...+.++|.+..|| -|..|||+-++. |+ -+++.|++..+..+.+.+.++|++.|+- T Consensus 71 -------~G--~r~V~VgW~GpR~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 71 -------DG--IPKVKLGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred -------CC--ceEEEecccCCceeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 11 12233333333444 488899985543 22 3899999999999999999999999988 No 87 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=99.16 E-value=6.6e-14 Score=92.84 Aligned_cols=119 Identities=18% Similarity=0.400 Sum_probs=92.3 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |.+ .=.|+.|++..|..|.+ +.+++....|.+||+.+.+..+.+.|.+-- ...+++++.+.++. T Consensus 1 m~s--NNNGFae~~~~~~tl~k-Vd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~-------------kkk~HlrD~lkVvv 64 (125) T protein:vir:62 1 MAS--NNNGFAEALEDINTLLR-VNKKVSLDALDEAAKYFASKLKPKINVSNK-------------NKRTHLRDSLKVVV 64 (125) T ss_pred CCC--CchhHHHHHHHhhhhhh-hhhhhhHHHHHHHHHHHHHhhccccChhhh-------------hhhhhcceeeeEEe Confidence 333 44799999999999997 458999999999999999999999996531 11233444443322 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCccCC------CCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGTANM------TAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~------~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) - ...+.+.....+|||+|.|.||+++ .+|+|...+|+++++.+.++|.+.+-.++ T Consensus 65 k-------~d~V~V~Fed~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 65 K-------DDRVSVEFKDEAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred e-------CCeEEEEEcchhhhhhhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 1 1223344567899999999999997 89999999999999999999999887777 No 88 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.15 E-value=5.5e-13 Score=87.81 Aligned_cols=144 Identities=12% Similarity=0.217 Sum_probs=96.3 Q ss_pred cceeeehHhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCcCCC-cccc-----cc----eecccccccc Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRA-ENNKVLRDATRAGAEVLKEEVIARAPVRTG-KLKK-----NV----VVVTQKSRRR 70 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~-~~~k~~~~al~~~a~~v~~~ak~~aP~~~g-~l~~-----~i----~~~~~~~~~~ 70 (149) |+..|++++|+++.++|..+... ..++.++..+.+.|..+...++.+.|+..- .+.. +. .........+ T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 99999999999999999988542 235667899999999999999999998320 0000 00 0001111234 Q ss_pred CccccceeeecccccccccceeEecCCCCCcceeeeeccCcc-----CCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 71 GEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----NMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDE 145 (149) Q Consensus 71 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k 145 (149) |++..+..+..+.. .++... +....+..|+||+|||.. ..|.+++|..+.+....++-+.+.+.|.+.|++ T Consensus 81 G~lr~swk~~~~~k-~~~~~~---v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~k 156 (163) T protein:vir:10 81 GTLQKGWSKSRIEV-SGRTYK---QKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMRK 156 (163) T ss_pred chhhccceecceee-cCCceE---EEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45544444433321 112111 122345789999999965 368999999999998888888877777777777 Q ss_pred HhcC Q lcl|NC_019769. 146 ALSK 149 (149) Q Consensus 146 ~~~k 149 (149) .+.= T Consensus 157 ~~~~ 160 (163) T protein:vir:10 157 VVLG 160 (163) T ss_pred hhcC Confidence 7654 No 89 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=99.12 E-value=6.2e-13 Score=87.50 Aligned_cols=126 Identities=21% Similarity=0.371 Sum_probs=101.0 Q ss_pred Ccc---eeeehHhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC-----------CCcccccceeccc Q lcl|NC_019769. 1 MIE---TSLDFSGLNDIAKDLEAL-SRAENNKVLRDATRAGAEVLKEEVIARAPVR-----------TGKLKKNVVVVTQ 65 (149) Q Consensus 1 Mm~---~~~~i~Gl~~l~~~l~~l-~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~-----------~g~l~~~i~~~~~ 65 (149) |++ ..|.|+|+.++...|+++ +.++ .+.++.+++++|+++...+++.+|+. +|.|..||++.. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aa- 78 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKEL-NKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTA- 78 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCch-hHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccc- Confidence 766 589999999999999999 6554 56889999999999999999999984 455555554321 Q ss_pred cccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCccCCC--CCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 66 KSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMT--AHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 66 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~--a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) +.....+..+..+.-+|+.|++||+..+. |+-||..++-++.+++.....+.+.+.| T Consensus 79 ---------------------T~raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl 137 (143) T protein:vir:62 79 ---------------------SAKGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAVV 137 (143) T ss_pred ---------------------cccceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHHH Confidence 11222233344457899999999998765 9999999999999999999999999999 Q ss_pred HHHhcC Q lcl|NC_019769. 144 DEALSK 149 (149) Q Consensus 144 ~k~~~k 149 (149) ++.|.- T Consensus 138 ~k~l~s 143 (143) T protein:vir:62 138 EKYLES 143 (143) T ss_pred HHHhcC Confidence 988888 No 90 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=99.12 E-value=6.3e-13 Score=87.46 Aligned_cols=137 Identities=20% Similarity=0.357 Sum_probs=101.4 Q ss_pred Ccc---eeeehHhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccc Q lcl|NC_019769. 1 MIE---TSLDFSGLNDIAKDLEAL-SRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSG 76 (149) Q Consensus 1 Mm~---~~~~i~Gl~~l~~~l~~l-~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~ 76 (149) |++ ..|+|+|+..+...|+++ +.++ .+.++.+++++|+++...+++.+|+..-+-..| +...+|.+... T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~s------rr~r~G~L~~S 73 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKEL-NKAVREANKASGEVLIPQAKHESPDGHRDPKSS------KRYRPGKLDKS 73 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcc-hHHHHHHHHHHHHHHHHHHHhhcCCcccccccc------cccccchhhcc Confidence 766 589999999999999999 6554 568899999999999999999999972211111 11122333322 Q ss_pred eeeecccccccccceeEecCCCCCcceeeeeccCccCCC--CCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 77 VHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMT--AHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 77 i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~--a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) |.+. .+.....+..+....-+|+.|++||+.++. ++-||+.++-++.+++.....+.+.+.|++.|.- T Consensus 74 ir~a-----aT~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 74 IKVT-----ASAKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred cccc-----ccccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHHHHHHhcC Confidence 2211 112222334444556799999999998766 9999999999999999999999999999988888 No 91 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=99.04 E-value=3.1e-12 Score=83.67 Aligned_cols=126 Identities=15% Similarity=0.194 Sum_probs=93.7 Q ss_pred CcceeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccce Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i 77 (149) -|+.--+|.|++|++++|++ |++....++..+||.++++.+.+..+.+.+ .|||...+.+..+.... T Consensus 6 ~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~---------- 75 (138) T protein:vir:98 6 SMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRR---------- 75 (138) T ss_pred cccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeee---------- Confidence 23333467799999999999 888767899999999999999999999988 57888777665543221 Q ss_pred eeecccccccccceeEecCCCCCcc-eeeeeccCccCCC-CC--cchhHhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 78 HIRGVNPRTGNSDNTMKANNPRNAF-YWRFVEMGTANMT-AH--PFIRPAFDVRQEQATEVAIRRMNQAIDE 145 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~-y~~~~E~GT~~~~-a~--PFl~pA~~~~k~~~~~~~~~~~~~~l~k 145 (149) ..| ...+..+|.+..| .-|..|||+.+++ |+ -+++.|++..+..+.+.++++|++.|+- T Consensus 76 -------~~G--~r~V~igW~GpR~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 76 -------EDG--IPKVKLGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred -------cCC--ceEEEEeeecCeeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 111 1222333333333 3588899986643 22 3899999999999999999999999998 No 92 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=98.93 E-value=1.2e-11 Score=80.37 Aligned_cols=121 Identities=12% Similarity=0.181 Sum_probs=83.5 Q ss_pred eeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++.|++||+++|++ |++....++..+||.++++.+.+..+.+.. .|||...+.+..+.... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYT------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeee------------- Confidence 89999999999999998 888877899999999999999999999887 58888777766543211 Q ss_pred cccccccccceeEecCCC---CCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNP---RNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~---~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+.....+..+|. +....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|++ T Consensus 68 ----~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 68 ----KVGSQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ----ccCCcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 12222222333332 234456999999533 22333 36666666666666555555555 No 93 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=98.93 E-value=1.2e-11 Score=80.37 Aligned_cols=121 Identities=12% Similarity=0.181 Sum_probs=83.5 Q ss_pred eeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++.|++||+++|++ |++....++..+||.++++.+.+..+.+.. .|||...+.+..+.... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYT------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeee------------- Confidence 89999999999999998 888877899999999999999999999887 58888777766543211 Q ss_pred cccccccccceeEecCCC---CCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNP---RNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~---~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+.....+..+|. +....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|++ T Consensus 68 ----~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 68 ----KVGSQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ----ccCCcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 12222222333332 234456999999533 22333 36666666666666555555555 No 94 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=98.93 E-value=1.2e-11 Score=80.37 Aligned_cols=121 Identities=12% Similarity=0.181 Sum_probs=83.5 Q ss_pred eeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++.|++||+++|++ |++....++..+||.++++.+.+..+.+.. .|||...+.+..+.... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYT------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeee------------- Confidence 89999999999999998 888877899999999999999999999887 58888777766543211 Q ss_pred cccccccccceeEecCCC---CCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNP---RNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~---~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+.....+..+|. +....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|++ T Consensus 68 ----~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 68 ----KVGSQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ----ccCCcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 12222222333332 234456999999533 22333 36666666666666555555555 No 95 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=98.93 E-value=1.2e-11 Score=80.37 Aligned_cols=121 Identities=12% Similarity=0.181 Sum_probs=83.5 Q ss_pred eeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++.|++||+++|++ |++....++..+||.++++.+.+..+.+.. .|||...+.+..+.... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYT------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeee------------- Confidence 89999999999999998 888877899999999999999999999887 58888777766543211 Q ss_pred cccccccccceeEecCCC---CCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNP---RNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~---~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+.....+..+|. +....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|++ T Consensus 68 ----~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 68 ----KVGSQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ----ccCCcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 12222222333332 234456999999533 22333 36666666666666555555555 No 96 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=98.93 E-value=1.8e-12 Score=84.92 Aligned_cols=108 Identities=19% Similarity=0.182 Sum_probs=65.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |---+.+++ .. .|+.+. ..+++.++...+..++.+++.++|++||+|++||........ T Consensus 1 ~~~~~~~l~-~~----~l~~~~----~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~------------ 59 (137) T protein:vir:10 1 MVAHTLRIE-RA----QLHGLG----MDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRER------------ 59 (137) T ss_pred CcccccccC-hh----hHhhHH----HHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeecc------------ Confidence 555566655 12 222222 346678888899999999999999999999999976432110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCcc-----------------------------CCCCCcchhHhHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-----------------------------NMTAHPFIRPAFDVRQEQA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~k~~~ 131 (149) ...+......+..|+.|+||||. .++|+|||+||++..+.+. T Consensus 60 ---------g~~v~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~ 130 (137) T protein:vir:10 60 ---------GAVVIGSVEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQE 130 (137) T ss_pred ---------ccEEEEEecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhccc Confidence 00111112234556666666662 2559999999999876553 Q ss_pred HHHHHHHHH Q lcl|NC_019769. 132 TEVAIRRMN 140 (149) Q Consensus 132 ~~~~~~~~~ 140 (149) --.+. |. T Consensus 131 ~~~~~--~~ 137 (137) T protein:vir:10 131 GFRVT--IG 137 (137) T ss_pred ceeEe--eC Confidence 21111 11 No 97 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.91 E-value=1.9e-11 Score=79.42 Aligned_cols=114 Identities=18% Similarity=0.240 Sum_probs=79.5 Q ss_pred CcceeeehHhH-HHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccc Q lcl|NC_019769. 1 MIETSLDFSGL-NDIAKDLEALSRAENNKVLRDAT----RAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~al----~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~ 75 (149) |.. |+|+.| +++.+.|+...+++.+ .++.++ +.+++.++.++++.+|+.||.+.++......... T Consensus 1 M~~--i~id~La~~I~~~L~~Ys~~v~~-~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~------- 70 (124) T protein:vir:95 1 MAK--IKIGRLADEITSQLRKYSQVIAD-DVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNG------- 70 (124) T ss_pred Ccc--ccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCc------- Confidence 666 455555 5588889888887643 335555 6666666677778999999999988765432100 Q ss_pred ceeeecccccccccceeEecCCCCCcceeeeeccCccC-----CCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 76 GVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN-----MTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 76 ~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+.+.....-.|++|||..+ .+|+|+|+|+.+...+.+.+.|.+.|+. T Consensus 71 -----------------~~V~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 71 -----------------WVIHNKTEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred -----------------eeEEEcCCCceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 012222233348999999765 6899999999999888877777777776 No 98 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=98.88 E-value=3.4e-11 Score=78.00 Aligned_cols=121 Identities=12% Similarity=0.225 Sum_probs=89.4 Q ss_pred eeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARA--PVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~a--P~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++.|++||+++|++ |++....++..+||.++++.+.+..+.+. ..|||...+.+..+.... T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSY------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeee------------- Confidence 89999999999999998 88887789999999999999999999864 568888877776553221 Q ss_pred cccccccccceeEecCC---CCCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANN---PRNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~---~~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) ..|. ..+..+| .+....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|.+.| T Consensus 68 ----~~G~--r~V~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 68 ----DKGV--RSIKIDWKGPKDRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred ----eCCc--eEEEEEEecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 1111 1222222 2234456999999533 33443 5888899988888888888888777 No 99 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=98.80 E-value=8.1e-11 Score=75.92 Aligned_cols=121 Identities=12% Similarity=0.191 Sum_probs=82.0 Q ss_pred eeeehHhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEAL-SRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l-~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++.|++||+++|++. ++....++..+||.++++.+.+..+.+.. .|||...+.+..+.... T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYT------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeee------------- Confidence 899999999999999875 55556789999999999999999999987 68888777766543211 Q ss_pred cccccccccceeEecCC---CCCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANN---PRNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~---~~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+.....+..+| .+....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|++ T Consensus 68 ----~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 68 ----KVGSQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ----ccCCcceEEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 1222222233333 2234456999999533 22333 36666766666666666655555 No 100 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.78 E-value=5.8e-11 Score=76.71 Aligned_cols=117 Identities=19% Similarity=0.273 Sum_probs=79.9 Q ss_pred CcceeeehHhH-HHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccc Q lcl|NC_019769. 1 MIETSLDFSGL-NDIAKDLEALSRAENNKVLRDAT----RAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~al----~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~ 75 (149) |.. |+|+.| +++.+.|++..+++.+ .+..++ +.+++.++++++..+|+.||.+.++........ T Consensus 1 M~~--i~id~La~~I~~~L~~y~~~v~~-~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~-------- 69 (127) T protein:vir:80 1 MAN--IKIDRLGDEITRQLKRYSQVIAG-DLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPG-------- 69 (127) T ss_pred Ccc--ccHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccC-------- Confidence 666 556666 5588899888887644 446666 555556666666799999999998875432110 Q ss_pred ceeeecccccccccceeEecCCCCCcceeeeeccCccC-----CCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 76 GVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN-----MTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 76 ~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) . ..+.+.....-.|++|||..+ .+|+|+|+|+.+...+++.+.+.+.|+..=+ T Consensus 70 -------------~---~~v~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 70 -------------G---WVIHNKTEYRLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred -------------c---eeEeecCCcceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 0 112222222348999999765 6899999999999887777777776665544 No 101 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.63 E-value=9.2e-11 Score=75.60 Aligned_cols=132 Identities=17% Similarity=0.141 Sum_probs=74.7 Q ss_pred Ccce-----eeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceecccccc------- Q lcl|NC_019769. 1 MIET-----SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR------- 68 (149) Q Consensus 1 Mm~~-----~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~------- 68 (149) |++- +|..+ ++++..+++.- +...++..+..+..++...+|++||.++.|...+..... T Consensus 1 ~~~~m~~~~sF~~~-i~~~~~~ve~~--------~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~ 71 (145) T protein:vir:10 1 MARNIGSVVTFEKS-IADWIDRAEDG--------FGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEY 71 (145) T ss_pred CCCcccchhccccC-HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCccchhhccccceeeccccccccccc Confidence 3321 22212 23444444322 234567777778888899999999999999876532111 Q ss_pred -ccCccccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 69 -RRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 69 -~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) +.|........... ................+.+|+.++|||+|.|+|..|.+.++..- .+++.....+++++| T Consensus 72 d~~G~~t~~~~~~~~-~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 72 DQTGGQTKTYLARQA-RAVANSKATSVIYITNRLDYAADLEYGASNQAPAGVLGVVQARL-GRYFQEAVEEARRAI 145 (145) T ss_pred CCCCccchhhHHHHH-HHhhcccccceEEEeeCchhhhHhhccccCCCcchHHHHHHHHH-HHHHHHHHHHhhccC Confidence 11111100000000 00001111112233467899999999999999999999999776 445555556666666 No 102 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.57 E-value=3.7e-10 Score=72.28 Aligned_cols=135 Identities=16% Similarity=0.131 Sum_probs=79.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++-++ -++...+.+..+.+ +..+..++++.+..+..++...+|++||.++.|...+..... .+......+ T Consensus 1 ma~~~~-----~sFa~~i~~~~~~v-e~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~-~~~~~~~dp-- 71 (146) T protein:vir:79 1 MADYSI-----REFHGNVDKWIEQV-ESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPP-LYALNQYDP-- 71 (146) T ss_pred CCcchh-----HHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcc-cccccCCCC-- Confidence 666432 34555666666554 333456778888889999999999999999999877543211 111110000 Q ss_pred ccccc-----------ccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 81 GVNPR-----------TGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 81 ~~~~~-----------~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) .+... ..............+.+|+.++|||++.|+|..|.+.++.+- .++++....++++ +..| T Consensus 72 ~G~~t~~~~~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~a~~e~k~--~~~l 146 (146) T protein:vir:79 72 DGEKIKAEGRRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSKQAPAGVFGIVAIRL-RSYMAEAIREARK--KNAL 146 (146) T ss_pred CCcccHHHHHHHHHHHHhcccccceeEEeeCchhhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHHHHh--hccC Confidence 00000 000111122333567899999999999999999999999654 3333333444444 2223 No 103 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=98.57 E-value=8.3e-10 Score=70.36 Aligned_cols=137 Identities=15% Similarity=0.194 Sum_probs=95.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |.++. ..|++++.++++|..+...+--.++..+||+++++.....+|...-. .+.....+|+.+.|.+. T Consensus 1 M~~~~---~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~--------~k~t~~~~HLaDsI~~~ 69 (168) T protein:vir:74 1 MATFE---EAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYR--------HRDTGEDPHLADSIVMK 69 (168) T ss_pred CccHH---HHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcc--------cCCCcccchhhhheeec Confidence 77666 55778888888877542111123677889999999999888854311 12234456777877766 Q ss_pred cccccccccceeEecCCC--------CCcceeeeeccCcc------------------CCCCCcchhHhHHH--HHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNP--------RNAFYWRFVEMGTA------------------NMTAHPFIRPAFDV--RQEQAT 132 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~--------~~~~y~~~~E~GT~------------------~~~a~PFl~pA~~~--~k~~~~ 132 (149) ..+.. +...+...+++. ..++.++|++.||. +|++-||+..+-+. .+++|+ T Consensus 70 ~~niD-g~~dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~ 148 (168) T protein:vir:74 70 NKNID-GVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGIL 148 (168) T ss_pred ccccC-cccCCceeecccccccccccchhhhhhhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHH Confidence 65433 222222333332 25788999999995 68999999999998 779999 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_019769. 133 EVAIRRMNQAIDEALSK 149 (149) Q Consensus 133 ~~~~~~~~~~l~k~~~k 149 (149) ++..+++++-|++.-+- T Consensus 149 ~Ae~~~y~eIl~~k~~~ 165 (168) T protein:vir:74 149 KAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHhhcCC Confidence 99999999988887666 No 104 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=98.55 E-value=9.2e-10 Score=70.12 Aligned_cols=138 Identities=17% Similarity=0.210 Sum_probs=93.3 Q ss_pred Ccceeeeh-HhHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccce Q lcl|NC_019769. 1 MIETSLDF-SGLNDIAKDLEALSRAE--NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i-~Gl~~l~~~l~~l~~~~--~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i 77 (149) ||.-+--+ ..|+++++++++|..+. .++ .+...+||+++++.....+|...=. .+..+..+|+.+.| T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~k--akIT~AGAkv~a~~L~~~T~~kHy~--------~~kt~k~~HLADsI 70 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDK--AKITKAGANAFAIGLEKVTKDKHYR--------IRKTGENPHLADSI 70 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHH--HHHHHHhHHHHHHHHHHHhhhhcCc--------CCCCCCcchhhhhe Confidence 88764333 67888888888877432 223 3677889999999999998874311 12334566888887 Q ss_pred eeecccccccccceeEecCC-CCCcceeeeeccCc-----------------cCCCCCcchhHhHH--HHHHHHHHHHHH Q lcl|NC_019769. 78 HIRGVNPRTGNSDNTMKANN-PRNAFYWRFVEMGT-----------------ANMTAHPFIRPAFD--VRQEQATEVAIR 137 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~-~~~~~y~~~~E~GT-----------------~~~~a~PFl~pA~~--~~k~~~~~~~~~ 137 (149) .+...+.. |...+...+++ ...++.+||++-|| .+|++-||+..+.+ +.+++|+++..+ T Consensus 71 ~~~~~niD-g~~dG~StVGw~~kka~ia~~indGtr~~~~~~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~~ 149 (161) T protein:vir:10 71 LVQNTNID-GIKDGNSTVGWDYTKSRVGHLIENGTRFPMYSKKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEAE 149 (161) T ss_pred eecccccC-cccCCceeccccCchhhhhhhhcccchhhhhhcccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHHH Confidence 77655433 22233333333 34466667666665 66999999999999 577999999999 Q ss_pred HHHHHHHHHhcC Q lcl|NC_019769. 138 RMNQAIDEALSK 149 (149) Q Consensus 138 ~~~~~l~k~~~k 149 (149) ++++-|++.=+- T Consensus 150 ~y~eil~~k~~~ 161 (161) T protein:vir:10 150 VFSEILKKKGAE 161 (161) T ss_pred HHHHHHHhhcCC Confidence 888877765444 No 105 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.53 E-value=4e-10 Score=72.12 Aligned_cols=134 Identities=13% Similarity=0.152 Sum_probs=76.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccc--------cCc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRR--------RGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~--------~~~ 72 (149) |++..+.+ ...+.+..+.+ +.....++++.+..+..++...+|++||.++.|...+...... .|. T Consensus 1 Ma~~~~sf------~~~i~~~~~~v-e~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~ 73 (142) T protein:vir:10 1 MANDVVSF------RNSINAWIDGV-TEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGN 73 (142) T ss_pred Cccchhhh------hccHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCc Confidence 66532322 22333333333 2233456677777888888889999999999998775332211 111 Q ss_pred cccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 73 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) ...+........ ..............+.+|+.++|||+|.|+|..|++.++.+- .++++....++++.| T Consensus 74 ~t~~~~~~~~~~-i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~a~q~~-~~~v~~a~~e~~~~~ 142 (142) T protein:vir:10 74 ETRNSLRRQIYA-LARDANTNVIYISNRLDYAQGLEFGSSNQAPSGVLGVVQKRL-GRYFAEAVQEAKRAL 142 (142) T ss_pred cchhhHHHHHHH-hhhccccceEEEeeCcchhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHHhhccC Confidence 110000000000 000011122233557899999999999999999999999654 455555556666666 No 106 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.50 E-value=9.3e-10 Score=70.10 Aligned_cols=139 Identities=15% Similarity=0.182 Sum_probs=76.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccc--------cCc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRR--------RGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~--------~~~ 72 (149) |++-++ .++...+.+..+.+. .-+...+++.+..+...+...+|++||.++.|...+...... .|. T Consensus 1 ma~~~~-----~~F~~~i~~~~~~ve-~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~ 74 (147) T protein:vir:10 1 MANYQI-----RRFQGEIDAWINAAE-STLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGG 74 (147) T ss_pred CCCcch-----hhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCcc Confidence 776554 345555666665543 334567788888899999999999999999998765332110 111 Q ss_pred cccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 73 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ............-..............+.+|+.++|||++.|+|..|.+-++.+-.. ++.....++++.=+ ++ T Consensus 75 ~t~a~~~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~QAP~G~V~~t~q~~~~-~v~~~~~e~k~~~~-~~ 147 (147) T protein:vir:10 75 VVRGEEQAKTYGMFSRGGAITSVHFSNMLIYANALEYGHSQQAPSGVVGLVALRLRS-YMADAIKQARRQQN-AL 147 (147) T ss_pred chhhhhhHHHHHHhhhccCcceEEEeeCcchhhhhhccccCCCCchHHHHHHHHHHH-HHHHHHHHHHhhhc-cC Confidence 000000000000000001111223345789999999999999999999988855432 22222222222111 11 No 107 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.45 E-value=1e-09 Score=69.85 Aligned_cols=132 Identities=15% Similarity=0.125 Sum_probs=83.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CcCCCcccccce---------ecccc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----PVRTGKLKKNVV---------VVTQK 66 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-----P~~~g~l~~~i~---------~~~~~ 66 (149) ||.++++|+ +++|.+.|+.|.....+ .+..++.-|+.++...+++. |.. .-+.... ..... T Consensus 1 M~~i~i~~d-~~~~~~~L~~l~~~~~~--~~~l~~~ig~~l~~~~~~rf~~~~~PdG--~~W~p~~~~t~~rk~~~~~~~ 75 (190) T protein:vir:99 1 MAGITLEWD-GRRALDVLNAGSAALGD--PSGLLQDIGELLLNIHRRRFQAQVSPDG--TPWQPLSPAYLRRKRKNRDKI 75 (190) T ss_pred CceeEEEec-HHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCC--CCCccccHHHHHHhhcCCCcc Confidence 999999997 58899999999876543 35778888888888877653 421 1111110 01111 Q ss_pred ccccCccccceeeecccccccccceeEecCCCCCcceeeeeccC------------------------------------ Q lcl|NC_019769. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMG------------------------------------ 110 (149) Q Consensus 67 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G------------------------------------ 110 (149) ....|.+...|... .+. ....+ .++..|+..++|| T Consensus 76 L~~tg~L~~Si~~~-----~~~-~~v~v---Gtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 146 (190) T protein:vir:99 76 LTLDGHLRNLLRYQ-----LDG-SELLF---GSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFA 146 (190) T ss_pred ceecHHHHHHHhhe-----ecC-cEEEE---ecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccc Confidence 22233443333321 111 11111 3467888888888 Q ss_pred --------ccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 111 --------TANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 111 --------T~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) |.++|+||||.-. ++.++++.+.+.+.|.+.|.+.+ T Consensus 147 ~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 147 QDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred hhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 2457999999655 66778888888888888888888 No 108 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=98.45 E-value=2.3e-09 Score=67.93 Aligned_cols=135 Identities=16% Similarity=0.226 Sum_probs=91.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccccee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSR--AENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~--~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |.++.-. |+++++++++|+- .+.++ .++..+||+++++.....+|...-. .+.....+|+.+.|. T Consensus 1 M~~~~d~---l~~~~~~vekl~~~ls~eqk--akITkAGAkv~~~~L~~~tk~kHy~--------~k~t~~~~HLaDsI~ 67 (168) T protein:vir:10 1 MVSFYDA---MQLIVDRAEELSTKMSVEDK--AEVTKAGAKVFEQALAYEVRNRHYR--------HRDTGEDPHLADSIV 67 (168) T ss_pred CCcHHHH---HHHHHHHHHHhhcCCCHHHH--HHHhHhhhHHHHHHHHHHhhHhhhc--------cCCCCccchhhhhhe Confidence 7777655 4456666666531 12222 3567889999999999998864321 122344557777777 Q ss_pred eecccccccccceeEecCC-C-------CCcceeeeeccCcc------------------CCCCCcchhHhHHH--HHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANN-P-------RNAFYWRFVEMGTA------------------NMTAHPFIRPAFDV--RQEQ 130 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~-~-------~~~~y~~~~E~GT~------------------~~~a~PFl~pA~~~--~k~~ 130 (149) +...+.. +...+...+++ . ..++.++|++.||. +|++-||+..+.+. .+++ T Consensus 68 ~~~~niD-g~~dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~ 146 (168) T protein:vir:10 68 MKNKNID-GVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQG 146 (168) T ss_pred ecccccc-cccCCceeecccCccccccccchheeeeccccccccccccccccccccccccccccchhHHHhhhchhhhHH Confidence 6655332 22222233333 2 36788999999995 68999999999996 5899 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 131 ATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 131 ~~~~~~~~~~~~l~k~~~k 149 (149) |+++..+++++-|++.-+- T Consensus 147 V~~Ae~~~y~eIl~~k~~~ 165 (168) T protein:vir:10 147 ILKAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHHHhhcCC Confidence 9999999999888887666 No 109 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.36 E-value=7.4e-10 Score=70.64 Aligned_cols=113 Identities=16% Similarity=0.177 Sum_probs=72.0 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceecccccc--------ccCc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR--------RRGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~--------~~~~ 72 (149) ||.|++..+ ++++..+++.-. ...++..+..+...+...+|+++|.++.|...+..... +.|. T Consensus 1 ~~~~sf~~~-i~~~~~~ve~~~--------~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~ 71 (121) T protein:vir:94 1 MISMKFNVN-LSRLRSNLREEA--------KKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANP 71 (121) T ss_pred Cccchhhcc-HHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcc Confidence 999998866 666666655432 23445556667777888999999999998866532111 1111 Q ss_pred cccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHH Q lcl|NC_019769. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQ 128 (149) Q Consensus 73 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k 128 (149) ......+. ...+.. .......+.+|+..+|||+|.|+|..|++.++.+-+ T Consensus 72 ~t~~~~~~---~~~~~~---~~iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 72 TPAPAIVV---SSNVAL---PHFYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred hhHHHHHH---HHhhcc---ceEEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 11000000 000111 122335678999999999999999999999998766 No 110 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.30 E-value=2.8e-09 Score=67.46 Aligned_cols=124 Identities=16% Similarity=0.176 Sum_probs=70.8 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |++..++. ++..+.+.-. ..++++.+..+..++...+|++||.++.|...+...... +..... ... T Consensus 1 msF~~~i~---~~~~~ve~~~--------~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~-~~~~~~--d~~ 66 (131) T protein:vir:94 1 MSFALDVT---RFVEKAKKNP--------EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPAD-GTTDAT--DKS 66 (131) T ss_pred CCcccCHH---HHHHHHHHHH--------HHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccc-cccCCC--CCC Confidence 55555543 5555444322 344556666677777889999999999998665321110 000000 000 Q ss_pred ccc-------cccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 82 VNP-------RTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 82 ~~~-------~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) +.. ...............+.+|+.++|||++.|+|..|.+-++..- .++++....+++ T Consensus 67 g~~t~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~g~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:94 67 GNTATGNATSFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred chhhHHHHHHHHhhccccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHhcC Confidence 000 0000111112234557899999999999999999999999664 334444444444 No 111 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.29 E-value=5.5e-09 Score=65.85 Aligned_cols=136 Identities=15% Similarity=0.186 Sum_probs=81.8 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCccccc-c------------- Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKN-V------------- 60 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~-i------------- 60 (149) |.. ++|+|++ +++.+.|++|.....+ .+.+++.-|+.++...+.+ .|.++. +..+ + T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~p-ls~~t~~~r~~~~~~~~~~ 76 (175) T protein:vir:79 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQ--KADAMRKITQALVLVTEDNFAAQGRPRWQA-LSEATIHMRVGGKKAYKKN 76 (175) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhcC--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CChHHHHhhcccccccccc Confidence 555 5777776 7899999999876533 3577888888888877664 343211 0000 0 Q ss_pred -e---------eccccccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCcc-------CCCCCcchhHh Q lcl|NC_019769. 61 -V---------VVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-------NMTAHPFIRPA 123 (149) Q Consensus 61 -~---------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA 123 (149) . .........|.+...|... .+. .. +...++..|+.++.||+. ++||||||.=. T Consensus 77 ~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~-----~~~-~~---v~vGtn~~YAaiHqfGg~~~~~~~v~IPARPfLG~s 147 (175) T protein:vir:79 77 GELTAAASRRKAGLMILQDSGQMAASTATD-----SGE-DY---SVIGSNKEYAAIQHFGGQAGRGLKVTIPGRAWLPVT 147 (175) T ss_pred ccchhhHhhhccCCCcceechhhhhhhhhe-----ecC-CE---EEEecCcchhhHhhcccccCCCcccccCcccccCCC Confidence 0 0000111122232222221 111 11 122456789999999986 79999999843 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 124 F-DVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 124 ~-~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) - ++-..++.+.|.+.+.+.|++++++ T Consensus 148 ~~de~~~~~~~~I~~~i~~~l~~a~~~ 174 (175) T protein:vir:79 148 ADGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred cccchhHHHHHHHHHHHHHHHHHHhcc Confidence 3 3445677788888888888888888 No 112 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.28 E-value=3.4e-09 Score=67.02 Aligned_cols=126 Identities=16% Similarity=0.164 Sum_probs=71.3 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccccee--- Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH--- 78 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~--- 78 (149) |++..++. ++..+.+.-. ...+++.+..+..++...+|++||.++.|...+...... +....... T Consensus 1 msf~~~i~---~~~~~ve~~~--------~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~-~~~~~~d~~g~ 68 (131) T protein:vir:78 1 MSFALDVS---KFVEKAKKNP--------EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPAD-GTTDATDKAGT 68 (131) T ss_pred CCcCcCHH---HHHHHHHHHH--------HHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccc-cccCCCCCCch Confidence 55555543 5555444322 345566666777777889999999999998765322111 10000000 Q ss_pred --eecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 79 --IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 79 --~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) ...+.................+.+|+..+|||+|.|+|..|.+.++..- .++++....+++ T Consensus 69 ~t~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:78 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhHhhccccCCCcchHHHHHHHHH-HHHHHHHHHhcC Confidence 0000000001111112234567899999999999999999999999654 344444444444 No 113 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=98.26 E-value=1.6e-09 Score=68.85 Aligned_cols=103 Identities=16% Similarity=0.118 Sum_probs=53.4 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |++.-+||+++++.|+... ++--.|.+.+ +.... +..+..+ T Consensus 1 m~v~r~~L~~~~~~l~~~~----------------------V~VGi~~~a~------------y~d~~----g~~~~~g- 41 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYKSMS----------------------VKAGVLAGAT------------YPDES----GKKLADG- 41 (155) T ss_pred CcchHHHHHHHHHHhhCCe----------------------eEEeecCCCC------------CCccc----cchhhhh- Confidence 6666677777766555310 0000111100 00000 0000000 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI--DEALSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l--~k~~~k 149 (149) .........+.+.+.++.++||||.++||||||||+++.++++..+.+...+...+ ++++.. T Consensus 42 ----~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:10 42 ----TILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred ----hhhccccccCcchhhhhhhhhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 00000011123446788899999999999999999999999998877766554432 122222 No 114 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=98.26 E-value=1.3e-08 Score=63.90 Aligned_cols=135 Identities=16% Similarity=0.229 Sum_probs=91.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccccee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAE--NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~--~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |.++.- .|++++++++.|..+. .++ .+...+||+++++.....+|..... .+.....+|+.+.|. T Consensus 1 M~~~~d---~l~~~~~~v~kl~~~lt~e~k--akIT~AGAkv~a~~L~~~T~~kHy~--------~rktg~~~HLADsI~ 67 (168) T protein:vir:39 1 MVSFYD---AMQLIINQAESLSTKMTVEDK--AEVTKAGAKVFEQALAYEVRNRHYR--------HRDTGEDPHLADSIV 67 (168) T ss_pred CccHHH---HHHHHHHHHHhccCCCCHHHH--HHHHHHhHHHHHHHHHHHhHHhccc--------CCCCCCCccchhhee Confidence 666654 3567777787776332 222 3567889999999888888853311 123344578888887 Q ss_pred eecccccccccceeEecCCC--------CCcceeeeeccCcc------------------CCCCCcchhHhHHH--HHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNP--------RNAFYWRFVEMGTA------------------NMTAHPFIRPAFDV--RQEQ 130 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~--------~~~~y~~~~E~GT~------------------~~~a~PFl~pA~~~--~k~~ 130 (149) +...+.. +...+...+++. ..++.++|++-||. +|++-||+..+.+. .+++ T Consensus 68 ~~~~niD-g~~dG~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~a 146 (168) T protein:vir:39 68 MKNKNID-GVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQG 146 (168) T ss_pred ecccccC-cccCCceeccccCccccccccchhheehhccccccchhhhhcccccccccceeecccchhHHHhhhhhhhHH Confidence 7666433 222333333332 36888999999994 68999999999995 5899 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 131 ATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 131 ~~~~~~~~~~~~l~k~~~k 149 (149) |+++..+++++-|++.=+- T Consensus 147 V~~Ae~e~~~eil~~k~~~ 165 (168) T protein:vir:39 147 ILKAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHHHhcCCC Confidence 9999998888877765444 No 115 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.25 E-value=3.6e-09 Score=66.87 Aligned_cols=130 Identities=15% Similarity=0.228 Sum_probs=76.3 Q ss_pred cceeeehH-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CcCCCcccccceec------------ Q lcl|NC_019769. 2 IETSLDFS-GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----PVRTGKLKKNVVVV------------ 63 (149) Q Consensus 2 m~~~~~i~-Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-----P~~~g~l~~~i~~~------------ 63 (149) |++.++|+ .+++|.+.|.+|...... +..++.-++.++...+.+- |. .|.-+.....+ T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~---~~l~~~Ig~~l~~~~~~rf~~~~~Pd-~G~~W~pls~~t~~~r~~~~~~~ 76 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRD---RAIPRVMAAALLSSTEQAFERQADPD-TGKGWEAWSDSWLAWRQDHGFVP 76 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhcc---HHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCCcccChHHHHHhhccCCCC Confidence 77877776 677899999988654321 3566777777777766553 32 12222111111 Q ss_pred cccccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCcc--------CCCCCcchhHhHHHHHHHHHHHH Q lcl|NC_019769. 64 TQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA--------NMTAHPFIRPAFDVRQEQATEVA 135 (149) Q Consensus 64 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~--------~~~a~PFl~pA~~~~k~~~~~~~ 135 (149) .......|.+...+... .+. ....+ .++..|+..++||+. ++||||||. --+..++++.+.+ T Consensus 77 ~~~L~~tg~L~~Si~~~-----~~~-~~v~v---Gt~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG-~s~~d~~~I~~~i 146 (156) T protein:vir:19 77 GSILTLHGDLARSITTD-----YGQ-DYALI---GSPKIYAAIHQWGGTPDMAPRPAGVPARPYMG-LDKTGEQEIFDAI 146 (156) T ss_pred CcchhhhHHHHHHhhhe-----ecC-CEEEE---ecchhhhHHhhcCcccccCCCccccCCccccC-CCHHHHHHHHHHH Confidence 11112223333333221 111 11111 346799999999975 599999994 5567777777777 Q ss_pred HHHHHHHHHH Q lcl|NC_019769. 136 IRRMNQAIDE 145 (149) Q Consensus 136 ~~~~~~~l~k 145 (149) .+.|.+.+++ T Consensus 147 ~~~l~~~~~~ 156 (156) T protein:vir:19 147 RKRVSAALRQ 156 (156) T ss_pred HHHHHHHhhC Confidence 7777777666 No 116 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=98.23 E-value=2.9e-08 Score=61.88 Aligned_cols=120 Identities=16% Similarity=0.143 Sum_probs=84.8 Q ss_pred CcceeeehHhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccce Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i 77 (149) |. +|.|++||+++|++ |++....+++.+||.++++.+.+..+.+.- .|||...+.+..+... T Consensus 1 m~----evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~----------- 65 (133) T protein:vir:96 1 MR----LIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPT----------- 65 (133) T ss_pred Cc----cccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCce----------- Confidence 33 56999999999975 667777899999999999999999998765 4777776665544321 Q ss_pred eeecccccccccceeEecCC---CCCcceeeeeccCcc-----CCCCCc--chhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 78 HIRGVNPRTGNSDNTMKANN---PRNAFYWRFVEMGTA-----NMTAHP--FIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~---~~~~~y~~~~E~GT~-----~~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) ...| ...+..+| .+....-|..|||+. +..|+- -+..|+++.+..+.+.++++|++.| T Consensus 66 ------~~~g--~rtV~i~W~gp~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 66 ------WENG--KRTIRVYWEGEKHRYSIVHLNEKGFYAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred ------ecCC--ceEEEEEeecCCCceeeEeeecccceecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHHHhC Confidence 1111 12222233 223445689999943 234444 5889999999998888888888888 No 117 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.20 E-value=4e-09 Score=66.64 Aligned_cols=130 Identities=17% Similarity=0.131 Sum_probs=74.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++..+++ ...++...+.+ +..+...++..|..+...+...+|++||.++.|...+..... .+......... T Consensus 1 MA~~~~~f------~~~i~~~~~~v-e~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~-~~~~~~~~~~~ 72 (144) T protein:vir:95 1 MAKSLLDL------ADRLEKKAKAI-DEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPS-GQQIKPHFPGS 72 (144) T ss_pred Cchhhhhh------hhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccc-ccccccccccc Confidence 66643333 34445555444 345567788888888899999999999999999876643211 11000000000 Q ss_pred cc--------------cccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GV--------------NPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 81 ~~--------------~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .. .................+.+|+..+|||+|.|+|..|.+.++.+-..-+.+ ++- ++ T Consensus 73 ~~~t~d~sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G~vr~~~q~~~~~v~~-~~~-----~~ 144 (144) T protein:vir:95 73 QGSTQRASAAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAGFVERAVLIGRKMRKK-FKI-----KD 144 (144) T ss_pred ccccCCCchhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHh-hcc-----CC Confidence 00 000011111122234567899999999999999999999999665433322 211 00 No 118 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=98.20 E-value=2e-09 Score=68.24 Aligned_cols=94 Identities=19% Similarity=0.370 Sum_probs=55.5 Q ss_pred cceee--ehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 2 IETSL--DFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~--~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |++++ +..|+++|++.|++|.+.. + +--.|.+.. T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~---v----------------~VGi~~~~~------------------------- 36 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKA---V----------------YVGFPAEFD------------------------- 36 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCe---E----------------EEEeecCcC------------------------- Confidence 66544 4457888888888774310 0 000010000 Q ss_pred ecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH--HHhcC Q lcl|NC_019769. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID--EALSK 149 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~--k~~~k 149 (149) .......+.+.+.++.++||||.+.||||||||+++.++++..+.+...+...++ +++.. T Consensus 37 ----------~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 98 (148) T protein:vir:52 37 ----------EKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEENQEKYTALFIQWFDQGVPAAQIYER 98 (148) T ss_pred ----------CCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 0000011234578899999999999999999999999999888777655543221 11111 No 119 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.16 E-value=1.3e-08 Score=63.86 Aligned_cols=131 Identities=18% Similarity=0.201 Sum_probs=66.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc-------cccc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT-------QKSR 68 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~-------~~~~ 68 (149) |.... +++.+.|++|.... ..+|...+..+.+.+..+ .|. |.-+.....++ .... T Consensus 1 ~i~~~------~~i~~~l~~l~~~~-----~~~l~~i~~~~~~~~~~rf~~~~~p~--G~~W~pLs~st~a~k~~~~~L~ 67 (145) T protein:vir:31 1 MVEDE------NNIPEAREAIQDGL-----TDGLERLHTITLRELITNMSDGQDAL--GNPWEPLKESTIRAKGSDTPLI 67 (145) T ss_pred CcccH------HHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcccChHHHHHhcCCCCCc Confidence 43332 34445555554332 234444555555554443 332 21121111111 1112 Q ss_pred ccCccccceeeecccccccccceeEecCCCCCcceeeeeccCccC--CCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 69 RRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN--MTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEA 146 (149) Q Consensus 69 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~ 146 (149) ..|.+...+.... ........+...++..|+.+++|||.+ +||||||.|+..-..+++.+++.+.+...|..+ T Consensus 68 ~tG~L~~Si~~~~-----~~~~~~~~a~vGtn~~YA~~hqfG~~~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~~~L~~~ 142 (145) T protein:vir:31 68 DNSRLLTDINAAS-----MMDRANRMAVIGTNLDYAEHHEFGAPEAGIPARPIFGPAGAYASQQAPDVIGDEIDTNLEGA 142 (145) T ss_pred cCHHHHHHHHHHh-----hhcccCceeEecCCchhhhhhccCCcccccCCCCccCCCccchHHHHHHHHHHHHHHHhhhh Confidence 2222222221110 000011112234677999999999976 999999999987777777777776666666655 Q ss_pred hcC Q lcl|NC_019769. 147 LSK 149 (149) Q Consensus 147 ~~k 149 (149) +-- T Consensus 143 ~~~ 145 (145) T protein:vir:31 143 VID 145 (145) T ss_pred ccC Confidence 555 No 120 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=98.12 E-value=2.6e-09 Score=67.67 Aligned_cols=103 Identities=17% Similarity=0.114 Sum_probs=52.1 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |++.-.||+.+++.|+... .+--.|.+.+. . +........ T Consensus 1 m~~~r~~l~~~~~~l~~~~----------------------v~VGi~~~a~y------------~------d~~~~~~~~ 40 (155) T protein:vir:77 1 MSVTRRGLTLPKDRYRSMS----------------------VKAGVLAGATY------------P------DESGKKLAD 40 (155) T ss_pred CcchHHHHHHHHHHHhcCc----------------------eEEeecCCCCC------------c------cccchhhhh Confidence 5555566666655544310 01111111100 0 000000000 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH--HHhcC Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID--EALSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~--k~~~k 149 (149) +.........+.+.+.++.++||||.++||||||||+++.++++..+.+...+...++ +++.. T Consensus 41 ---~~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:77 41 ---GSILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred ---hhhccccccccccHhhhhhhhhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Confidence 0000000111234467889999999999999999999999999988877765544321 12222 No 121 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.07 E-value=3e-08 Score=61.84 Aligned_cols=133 Identities=16% Similarity=0.156 Sum_probs=74.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CcCCCcccccceec------------ccc Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-PVRTGKLKKNVVVV------------TQK 66 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-P~~~g~l~~~i~~~------------~~~ 66 (149) |.. +++++. .+++.+.|++|.....+ ....++..++.++...+.+- |. |.-.....+. .+. T Consensus 1 Ms~~i~i~~~-~~~~~~~L~~l~~~~~~--~~~l~~~ig~~l~~~~~~rF~p~--G~~W~plsp~t~~~r~k~g~~~~~~ 75 (155) T protein:vir:10 1 MANRIELELV-DREVQERLAALYAAVTD--TLPLMRGIAAELLAETEFAFMDE--GPGWPQLSPVTVAARAAKGRGAHPI 75 (155) T ss_pred CCceEEEEec-hHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHhhc--CCCCCCCCccchHHHHhccCCCCCc Confidence 442 455555 36799999999876532 36778888888888887765 32 2222111111 111 Q ss_pred ccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCcc-------CCCCCcchh-HhHHHHHHHHHHHHHHH Q lcl|NC_019769. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-------NMTAHPFIR-PAFDVRQEQATEVAIRR 138 (149) Q Consensus 67 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~-pA~~~~k~~~~~~~~~~ 138 (149) ....|.+...|.... +. ... ...++..|+..++||+. .+||||||. ..-++-++++.+.|.+. T Consensus 76 L~~tG~L~~Si~~~~-----~~-~~v---~vGtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~ 146 (155) T protein:vir:10 76 LQVTNALARSITTRA-----DR-DQA---QIGSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARDAVLDV 146 (155) T ss_pred cccchhhhhhhhcee-----cC-CEE---EEecCcchhhhhhcccccCCCCccccCCccccCCCccccchHHHHHHHHHH Confidence 222344444333221 11 111 12356789999999974 699999997 33333345555555555 Q ss_pred HHHHHHHHh Q lcl|NC_019769. 139 MNQAIDEAL 147 (149) Q Consensus 139 ~~~~l~k~~ 147 (149) +.+.|.+-- T Consensus 147 i~~~l~~~r 155 (155) T protein:vir:10 147 LLAALSQGR 155 (155) T ss_pred HHHHHhhcC Confidence 555554443 No 122 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=98.04 E-value=8e-08 Score=59.50 Aligned_cols=132 Identities=16% Similarity=0.242 Sum_probs=92.3 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCcCCCc---ccccceeccccccccCcccc Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATR-AGAEVLKEEVIARAPVRTGK---LKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~-~~a~~v~~~ak~~aP~~~g~---l~~~i~~~~~~~~~~~~~~~ 75 (149) |.. .++++.++++|++.++++|.+. +++...+|. .|+..+.+.+....|++.+. ++... |+.. T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ks-E~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~-----------HAK~ 68 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKS-EAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKN-----------HAQS 68 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchH-HHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhcccc-----------chhh Confidence 655 7999999999999999999875 677777775 56677788888999998532 11111 0000 Q ss_pred ceeeecccccccccceeEecCCCCCcceeeeecc--CccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 76 GVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEM--GTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 76 ~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~--GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) . .+.+.....-.... ..+...-|-.|... ||++-.||.||+..++..-+.+++.+.+++.++|++.++= T Consensus 69 s---~pl~~~~~NLgf~i--~~k~kf~YLvfPD~G~G~sn~~~q~FmerGl~~~t~~i~E~L~~~l~k~in~~Lgg 139 (140) T protein:vir:40 69 S---GPFNVKMGNLGFEL--LTKPKFNYLIFPDQGIGKHNKTKQDFMQLGVEESSQEIVEMLEQAVFKEINDTLGG 139 (140) T ss_pred h---hhhhhhhhhcceeE--eecCcccccccccccCCCCCcchHHHHHhccccchhHHHHHHHHHHHHHHHHhhcC Confidence 0 00001111111111 11445667778775 6888889999999999999999999999999999999987 No 123 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=97.95 E-value=2.5e-08 Score=62.29 Aligned_cols=126 Identities=15% Similarity=0.198 Sum_probs=67.8 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCcccccee--- Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH--- 78 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~--- 78 (149) |++..++. ++.++++.-. ...++..+..+..++...+|++||.++.|...+....... ....... T Consensus 1 msF~~~i~---~~~~~ve~~~--------~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~-~~~~~d~~g~ 68 (134) T protein:vir:80 1 MSYTDRFN---VIAKGIEDNV--------DNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPES-VLDIPESPSE 68 (134) T ss_pred CCcccCHH---HHHHHHHHHH--------HHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccc-cccCcCCCCc Confidence 66655543 5554444322 3445666666777778899999999999987653221110 0000000 Q ss_pred -----eecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 79 -----IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 79 -----~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .....................+.+|+.++|||+|.|+|..|.+-+..+-..-+.+ + |++-| T Consensus 69 ~~~~~~~~~~~vi~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~t~~~~~~~v~~-~---------~~~~~ 134 (134) T protein:vir:80 69 GMDEALQVLQQTVGQYKAGDTVHITNNAPYIKELNSGSSQQAPANFVETSIMRATRLIRN-V---------KVVPQ 134 (134) T ss_pred cchhhHHHHHHHHhhccCcceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHh-h---------ccCCC Confidence 0000000001111112234567899999999999999999999887554322222 1 11222 No 124 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=97.94 E-value=2e-08 Score=62.74 Aligned_cols=133 Identities=13% Similarity=0.082 Sum_probs=69.5 Q ss_pred Ccce-eeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceee Q lcl|NC_019769. 1 MIET-SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~-~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~ 79 (149) |.++ +|.-+ ++++.++++. .+...++..+..+...+....|++||.++.|...+...... +.+...... T Consensus 1 m~~~~sFa~~-i~~~~~~ve~--------~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~-~~~~~~dp~ 70 (148) T protein:vir:97 1 MPSLSEFSRR-ITLRGRKVAE--------GADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPS-SVIDAYSPG 70 (148) T ss_pred CCccchhccc-HHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeeccccc-ccccccCCC Confidence 5554 34333 3344443332 22345566666777778889999999999998765322111 000000000 Q ss_pred ecccc--------------cccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 80 RGVNP--------------RTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDE 145 (149) Q Consensus 80 ~~~~~--------------~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k 145 (149) ..+.. ...............+.+|+..+|||.+.|+|..|++-++..-.+-+ ++ .+.+++ T Consensus 71 ~~G~~~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~t~~~~~~~v----~~--~~~~~~ 144 (148) T protein:vir:97 71 EAGSTEAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPANFVEQAVLEAVQVV----QF--GRVVDG 144 (148) T ss_pred CCCcccccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcchHHHHHHHHHHHHH----Hh--hhhhcC Confidence 00000 00011111122345678999999999999999999999986543322 11 112222 Q ss_pred HhcC Q lcl|NC_019769. 146 ALSK 149 (149) Q Consensus 146 ~~~k 149 (149) .=+- T Consensus 145 ~~~~ 148 (148) T protein:vir:97 145 DPGS 148 (148) T ss_pred CCCC Confidence 2222 No 125 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=97.93 E-value=8.7e-08 Score=59.28 Aligned_cols=133 Identities=13% Similarity=0.111 Sum_probs=72.0 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CcCCCcccccceec------------ccc Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-PVRTGKLKKNVVVV------------TQK 66 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-P~~~g~l~~~i~~~------------~~~ 66 (149) |.. ++|++.+ +++.+.|.+|...+.+ .+..++.-++.++...+.+- |. |.-......+ ... T Consensus 1 M~~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~l~~~ig~~l~~~~~~rF~~e--G~~W~pls~~t~~~r~~~g~~~~~i 75 (155) T protein:vir:79 1 MTTRIDVELDD-QEVRQRLAVLMRSVTD--TLPVMRGIAAELLAETEFAFMDE--GPGWPQLSPATVAAREAKGRGPHPI 75 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHhhcc--CCCCCCCCHHHHHHHhccCCCCCCc Confidence 433 4566665 6899999999876543 36778888888888888775 32 3222111111 111 Q ss_pred ccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCcc-------CCCCCcchhHhHH-HHHHHHHHHHHHH Q lcl|NC_019769. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-------NMTAHPFIRPAFD-VRQEQATEVAIRR 138 (149) Q Consensus 67 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~~-~~k~~~~~~~~~~ 138 (149) ....|.+...|.+. .+. .. +...++..|+..++||+. ++||||||.=.-+ ....++.+.|.+. T Consensus 76 L~~tG~L~~Si~~~-----~~~-~~---v~vGt~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~ 146 (155) T protein:vir:79 76 LQVTNALARSVTTW-----ADR-NE---AGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEV 146 (155) T ss_pred cccchhhhhhhhce-----ecC-CE---EEEecCchhhhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHH Confidence 22233343333222 111 11 122456799999999975 7999999963322 2223333333333 Q ss_pred HHHHHHHHh Q lcl|NC_019769. 139 MNQAIDEAL 147 (149) Q Consensus 139 ~~~~l~k~~ 147 (149) +.+.|.+.- T Consensus 147 i~~~l~r~r 155 (155) T protein:vir:79 147 VLTALSRNR 155 (155) T ss_pred HHHHHHhcC Confidence 333333333 No 126 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.88 E-value=1.4e-07 Score=58.24 Aligned_cols=137 Identities=16% Similarity=0.187 Sum_probs=76.4 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccce------------- Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVV------------- 61 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~------------- 61 (149) |.. ++++|+. ++|.+.|++|.....+ .+..++.-++.++...+.+ .|.++.--...+. T Consensus 1 Ms~~i~i~~~~-~~l~~~L~~l~~~~~d--~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~ 77 (175) T protein:vir:10 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQ--KAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceeEEEEecH-HHHHHHHHHHHHHhcc--HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhh Confidence 444 3666663 7899999998876532 2566777777777776654 3432211000000 Q ss_pred ----------eccccccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCcc-------CCCCCcchhHhH Q lcl|NC_019769. 62 ----------VVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-------NMTAHPFIRPAF 124 (149) Q Consensus 62 ----------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~ 124 (149) .........|.+...|... .+. . .+...++..|+.++.||+. ++||||||.=.- T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~-----~~~-~---~v~vGtn~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~ 148 (175) T protein:vir:10 78 ELTAAASRRKAGLMILQDSGQMAASVSTD-----HDD-N---SAVIGSNKEYAAIHQFGGQAGRGLKVTIPARPWLPVTA 148 (175) T ss_pred hhhhhhhhhccCCCcceechhhhhhhhee-----ecC-C---EEEEecChhhhhhhhcccccCCCCccccCCccccCCCc Confidence 0000011122222222211 111 1 1122456789999999987 899999998543 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 125 D-VRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 125 ~-~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) + +...+.++.|.+.+.+.|.+++++ T Consensus 149 ~d~~~~e~~~~Il~~~~~~l~~~~~~ 174 (175) T protein:vir:10 149 DGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred ccccchHHHHHHHHHHHHHHHHHhcc Confidence 2 223356677777777777777777 No 127 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=97.87 E-value=1.5e-07 Score=57.99 Aligned_cols=129 Identities=13% Similarity=0.140 Sum_probs=69.5 Q ss_pred Ccc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CcCCCcccccceec------------ccc Q lcl|NC_019769. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-PVRTGKLKKNVVVV------------TQK 66 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-P~~~g~l~~~i~~~------------~~~ 66 (149) |.. ++++++. ++|.+.|.+|.....+ .+..++.-++.++...+.+- |. |.-+...... ... T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~l~~~ig~~l~~~~~~rF~pd--G~~W~pls~~t~~~r~~~g~~~~~i 75 (155) T protein:vir:99 1 MTTRIDVELDD-QEVRQRLALLMRSVTD--TLPVMRGIAAELLAETEFAFMDE--GPGWPQLSPVTVAAREAKGRGPHPI 75 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHhhcc--CCCCCCCChHHHHHHhccCCCCCCc Confidence 444 3566664 6799999999876543 46778888888888887765 32 2211111111 111 Q ss_pred ccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCcc-------CCCCCcchhHhHH-----HHHHHHHHH Q lcl|NC_019769. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA-------NMTAHPFIRPAFD-----VRQEQATEV 134 (149) Q Consensus 67 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~~-----~~k~~~~~~ 134 (149) ....|.+...|.+. .+. .. +...++..|+..++||+. .+||||||.=.-+ +.++.|.+. T Consensus 76 L~~tg~L~~Si~~~-----~~~-~~---v~vGtn~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~ 146 (155) T protein:vir:99 76 LQVTNALARSVTTW-----ADR-NE---AGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEI 146 (155) T ss_pred chhchhhhhhhhce-----ecC-CE---EEEecCccchhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHH Confidence 22233343333221 111 11 122456889999999976 7999999963321 222333333 Q ss_pred HHHHHHHHH Q lcl|NC_019769. 135 AIRRMNQAI 143 (149) Q Consensus 135 ~~~~~~~~l 143 (149) +.+.|.+.= T Consensus 147 i~~~l~~~~ 155 (155) T protein:vir:99 147 VLTALSRNR 155 (155) T ss_pred HHHHHhccC Confidence 333333222 No 128 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=97.86 E-value=1.2e-08 Score=63.92 Aligned_cols=103 Identities=16% Similarity=0.146 Sum_probs=48.7 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |++.-+||+.+.+.|+... ++--.|.+.+.. +.. ......+... T Consensus 1 m~v~~k~L~~~~~~l~~~~----------------------v~VGi~~~a~y~-d~~---------~~~~~~~~~~---- 44 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRSMS----------------------VKAGVLAGATYP-DES---------GKKLADGTIL---- 44 (155) T ss_pred CcchHHHHHHHHHHHhCCe----------------------eEEeecCCCCCc-ccc---------chhhhhhhhc---- Confidence 4555555544443332100 000011110000 000 0000000000 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI--DEALSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l--~k~~~k 149 (149) . .....+.+.+.++.+.||||.++||||||||++++++++..+.+...+...+ ++++.. T Consensus 45 ----~---~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:10 45 ----T---KDPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred ----c---cccccCCcHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 0 0001223456788899999999999999999999999998877766554322 111111 No 129 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=97.86 E-value=1.2e-08 Score=63.95 Aligned_cols=103 Identities=17% Similarity=0.151 Sum_probs=48.3 Q ss_pred eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccc Q lcl|NC_019769. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~ 83 (149) |++.-+||+.+.+.|+... ++--.|.+.+.. +.. ......+.... T Consensus 1 m~v~~k~L~~~~~~l~~~~----------------------v~VGi~~~a~y~-d~~---------~~~~~~~~~~~--- 45 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRSMS----------------------VKAGVLAGATYP-DES---------GKKLADGTILT--- 45 (155) T ss_pred CcchHHHHHHHHHHHhCCe----------------------eEEeecCCCCCC-ccc---------chhhhhhhhcc--- Confidence 5555555544443332100 000011110000 000 00000000000 Q ss_pred ccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH--HHhcC Q lcl|NC_019769. 84 PRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID--EALSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~--k~~~k 149 (149) .....+.+.+.++.+.||||.++||||||||+++.++++..+.+...+...++ +++.. T Consensus 46 --------~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:78 46 --------KDPRAGLPVAMIAMALNYGTSKLPARPFMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred --------cccccCCcHHHHHHhhhcCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 00011234567888999999999999999999999999888776655543221 11111 No 130 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=97.81 E-value=6.3e-08 Score=60.04 Aligned_cols=126 Identities=17% Similarity=0.110 Sum_probs=70.9 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------------CCCcccccceecccc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPV--------------RTGKLKKNVVVVTQK 66 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--------------~~g~l~~~i~~~~~~ 66 (149) =|++..+|. .+..+.+. -+...+++.+..+...+...+|+ ++|.++.|...+... T Consensus 10 ~msFaa~i~---~~~~~~e~--------~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~ 78 (152) T protein:vir:96 10 PMSWSKSLK---NIIVKNEN--------LTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISK 78 (152) T ss_pred cccccccHH---HHHHHHHH--------HHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecC Confidence 334444333 33333332 23455666677788888888999 999999988776432 Q ss_pred ccccCccccceeeecc---cccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 67 SRRRGEISSGVHIRGV---NPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQA 142 (149) Q Consensus 67 ~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~ 142 (149) .. .+...+....... .................+.+|+..+|||+|.|+|..|.+.++..-.+-+.+ +++.+ T Consensus 79 p~-~~~~~~~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~QAP~G~vr~t~~~~~~~v~e----a~~~~ 152 (152) T protein:vir:96 79 IT-SFEKGISSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGHSSQAPNGVYRPAVRRLVKFLNT----ELKAK 152 (152) T ss_pred CC-cccccCCCCCchHHHHHHHHhhccccceEEEeeCchhhhHhhccccCCCCchHHHHHHHHHHHHHHH----HhccC Confidence 21 1111110000000 000011111122334557899999999999999999999999775544444 44433 No 131 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=97.73 E-value=1.4e-08 Score=63.57 Aligned_cols=92 Identities=18% Similarity=0.243 Sum_probs=48.6 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccceeEecCCCCC Q lcl|NC_019769. 21 LSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRN 100 (149) Q Consensus 21 l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 100 (149) |+..+ +..- .+...+.+.++.+. . ..+.+ ++.. . .. .-.+... T Consensus 1 M~~~i-----~~~~-~~~~~L~~~lk~l~---~----k~V~V-------------Gi~~-----~-~~-----y~dG~~v 43 (189) T protein:vir:10 1 MGRVI-----RKQG-PARVKLNAFIKGMN---D----YSVRI-------------GWFS-----T-AK-----YPDGTPT 43 (189) T ss_pred Cccee-----ccCc-HHHHHHHHHHHHhh---C----CeEEE-------------EecC-----C-CC-----CCCcccH Confidence 32211 1111 11112223333221 0 00111 0100 0 00 0012345 Q ss_pred cceeeeeccCc--cCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH------HHHhcC Q lcl|NC_019769. 101 AFYWRFVEMGT--ANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI------DEALSK 149 (149) Q Consensus 101 ~~y~~~~E~GT--~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l------~k~~~k 149 (149) +.++.++|||+ .++||||||||+++.++++..+.+...+...| ++++.. T Consensus 44 A~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~ 100 (189) T protein:vir:10 44 AYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEG 100 (189) T ss_pred HHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Confidence 78899999999 56899999999999999999888888877633 444444 No 132 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=97.71 E-value=1.2e-08 Score=64.03 Aligned_cols=126 Identities=13% Similarity=0.042 Sum_probs=61.1 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHH-----------HHHHHHHHHhCCcCCCcccccceecccccccc Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGA-----------EVLKEEVIARAPVRTGKLKKNVVVVTQKSRRR 70 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a-----------~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~ 70 (149) |+++....++++|++.|++|.... +. -...+.+ ..+..-|.-+ ..| ..|..... . T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~---v~-vGi~~~~~~~~~~~~~~G~~va~iAai~---EfG---~~I~~~~~----~ 66 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRS---VS-AGWYSTARYPDKAGGSVGIQVARIARLN---EYG---GTIDHPGG----T 66 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCe---EE-EEEcCCCCCCCcccccccchHHHHHhHH---HcC---CccccCcc----c Confidence 999999999999999999997532 21 1111111 1111101000 001 00000000 0 Q ss_pred CccccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH------ Q lcl|NC_019769. 71 GEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID------ 144 (149) Q Consensus 71 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~------ 144 (149) ..+...+. .....+... .. ..-..-+.|.-..|.++|||||||++++.+++++.+.+.+.+...+. T Consensus 67 ~~~~~~~~---~g~~~~~~~--~k---~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~ 138 (193) T protein:vir:96 67 RYIRDAIV---RGRFVGVRF--VR---NDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPD 138 (193) T ss_pred eeeeeccc---cccccccce--ec---cCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHH Confidence 00000000 000000000 00 00112345566678999999999999999999888887777665443 Q ss_pred HHhcC Q lcl|NC_019769. 145 EALSK 149 (149) Q Consensus 145 k~~~k 149 (149) +++.+ T Consensus 139 ~~l~~ 143 (193) T protein:vir:96 139 QALAQ 143 (193) T ss_pred HHHHH Confidence 22222 No 133 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=97.70 E-value=3.2e-07 Score=56.23 Aligned_cols=94 Identities=16% Similarity=0.129 Sum_probs=65.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcC---CCcccccceeccccccccCccccceeeecccccccccceeEecCCCCCcc Q lcl|NC_019769. 26 NNKVLRDATRAGAEVLKEEVIARAPVR---TGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAF 102 (149) Q Consensus 26 ~~k~~~~al~~~a~~v~~~ak~~aP~~---~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (149) .++.++.++++.|..+...++.++|+. +|.|++|....... . .+. . .-.+.. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~-----------------k-~~~----~---v~N~~e 55 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELN-----------------L-FDG----V---VSNNVE 55 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeee-----------------c-cCc----e---eecCCc Confidence 345667888899999999999999985 58898887543110 0 000 1 124689 Q ss_pred eeeeeccCccC-------------------CCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 103 YWRFVEMGTAN-------------------MTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 103 y~~~~E~GT~~-------------------~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) |++|+|||..- .+.+.||+.+..+.+.++-+.+.+.|.+.++ T Consensus 56 YA~~VE~GHRq~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 56 YIHHLEYGHRTRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ccccccCCceeeCCcceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 99999999653 3556688888877776666666666666555 No 134 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=97.70 E-value=1.1e-07 Score=58.80 Aligned_cols=91 Identities=10% Similarity=0.134 Sum_probs=50.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||+ .+...|++.|...|++|+... + +-.+ +.+.|.. T Consensus 1 ~~~-~~~~~G~~~L~~~~k~l~~~~---V-~VGi---------------~~d~g~~------------------------ 36 (160) T protein:vir:95 1 MVK-RVIHPARAKLVGAMKNLQTAN---A-QVGY---------------FQEQGQH------------------------ 36 (160) T ss_pred Cce-eechHhHHHHHHHHHHHhCCe---e-EEee---------------ccccccC------------------------ Confidence 544 556688888888777763211 0 0000 0010000 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHH----HHHHHHHHHHHHHHHHHHH-------HhcC Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDV----RQEQATEVAIRRMNQAIDE-------ALSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~----~k~~~~~~~~~~~~~~l~k-------~~~k 149 (149) ..+.+-+.++.+.||||.+.|+|||||++|+. .+...+..+...+...+.. .|+. T Consensus 37 --------------~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~~~~~~~LG~ 102 (160) T protein:vir:95 37 --------------SSGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSSLNTDPSNTLEA 102 (160) T ss_pred --------------CCCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHH Confidence 01112345678999999999999999999974 4555555555544444442 2333 No 135 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=97.42 E-value=5.7e-08 Score=60.31 Aligned_cols=104 Identities=13% Similarity=0.139 Sum_probs=44.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeeecccccccccceeEecCCCCCcce Q lcl|NC_019769. 26 NNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFY 103 (149) Q Consensus 26 ~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y 103 (149) ..-+.++.+ +.....++.+.. +.-|.+...- ...|...... .+.... ..-.+-+.+.+ T Consensus 1 ~~~~~~~g~----~~~~~~~~~l~~~~v~vG~l~~a~-------yp~G~~~~~~--------~~~~~~-~~~~g~~va~I 60 (168) T protein:vir:94 1 MTTIARKGV----KMPPHLEAQFQSGEVKAGVLSGST-------YPQMTYTDQR--------TGKQIE-DARGGMPVAVI 60 (168) T ss_pred Cccccchhh----hhhHHHHHhhhccceeeeccccCc-------ccccccchhh--------cccccc-cccccccHHHH Confidence 111112222 222222222111 0111111000 0000000000 000000 00011234788 Q ss_pred eeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019769. 104 WRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI--DEALSK 149 (149) Q Consensus 104 ~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l--~k~~~k 149 (149) +.++||||.++||||||||++++++++..+.+...++..+ +.++.. T Consensus 61 a~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 108 (168) T protein:vir:94 61 AQALEYGHGQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAADTALRT 108 (168) T ss_pred HHHHhcCCCCCCCchhhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHH Confidence 9999999999999999999999999888776665554221 111111 No 136 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=97.41 E-value=2.5e-06 Score=51.25 Aligned_cols=111 Identities=12% Similarity=0.142 Sum_probs=69.0 Q ss_pred HHHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCCCcccccceeccccccccCccccceeeecccccccccc Q lcl|NC_019769. 14 IAKDLE-ALSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSD 90 (149) Q Consensus 14 l~~~l~-~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 90 (149) |+++|+ .|++....+++.+||.++++.+.+..+.+.- .|||...+.+..+.. ....+... T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p-----------------~~~~g~~~ 63 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKP-----------------YTKVGSQE 63 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCe-----------------eeccCCcc Confidence 666665 3566667789999999999999999998764 477776666554321 11122222 Q ss_pred eeEecCCC---CCcceeeeeccCccC----CCCCc--chhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 91 NTMKANNP---RNAFYWRFVEMGTAN----MTAHP--FIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 91 ~~~~~~~~---~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+..+|. +....-|..|||..+ ..|+- -+..|+++.+..+.+.++++|++ T Consensus 64 rtV~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PRG~G~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 64 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred ceEEEEeecCCCceeeEeeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 33333332 234456999999533 22333 46667777666666666655555 No 137 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.39 E-value=4.6e-06 Score=49.86 Aligned_cols=121 Identities=17% Similarity=0.264 Sum_probs=76.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCcccccee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |++++|+|+ .++|++.++.+..+. ......-...+|..+..+||.+||= .||+-+..|.... T Consensus 1 ~~~~~f~~d-~~~l~~~i~~~~~k~-~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~-------------- 64 (123) T protein:vir:74 1 MAKVTFEYD-AQELRTNIRNLDRRM-ESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVA-------------- 64 (123) T ss_pred CceeEEEec-HHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccc-------------- Confidence 999999998 789999999887664 2333333455778899999999993 3454433331110 Q ss_pred eecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) .. .|... ....-..+..|.-|+|+++...++ -|+|+.+.--+++++-+..-+. +|+++- T Consensus 65 ----~~-~g~~~--~~Iylsh~veYG~~LEla~~~kya--Ii~Ptv~~~~~~im~g~~~ll~-~l~~~~ 123 (123) T protein:vir:74 65 ----NK-LGPGS--HELIMSYSVHYGIWLEIANSGQYA--VIGPFLPVMGRKLMHDLEHLID-RLERAQ 123 (123) T ss_pred ----cc-CCCce--EEEEEecCeeecceeeecCCCCce--eecchHHHHhHHHHHHHHHHHH-HhhccC Confidence 00 01111 111122346899999998876554 6889888877777776655443 344444 No 138 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=97.25 E-value=1.7e-06 Score=52.16 Aligned_cols=120 Identities=16% Similarity=0.157 Sum_probs=58.5 Q ss_pred CcceeeehHh---HHHHHHHHHHhHHHHHHHHHHHHHHHH-----------HHHHHHHHHHh-------CCcCCCccccc Q lcl|NC_019769. 1 MIETSLDFSG---LNDIAKDLEALSRAENNKVLRDATRAG-----------AEVLKEEVIAR-------APVRTGKLKKN 59 (149) Q Consensus 1 Mm~~~~~i~G---l~~l~~~l~~l~~~~~~k~~~~al~~~-----------a~~v~~~ak~~-------aP~~~g~l~~~ 59 (149) =|.++++|.| +++++++|++|.... + .-.+.+. +..++.-|.-+ .|-.+..++.. T Consensus 4 ~~~~~~k~~~~~~~~~~~~~l~~l~~~~---v-~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~ 79 (200) T protein:vir:99 4 GFSKSNSVAAPLKHFQMLKQFDALKGKT---V-QAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDA 79 (200) T ss_pred CcceeeeeecchHHHHHHHHHHHhhCCe---E-EEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCccccccc Confidence 1335555555 777777777775431 1 1111111 11222211111 11111111100 Q ss_pred ceeccccccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHH Q lcl|NC_019769. 60 VVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRM 139 (149) Q Consensus 60 i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~ 139 (149) + .. + ...+. ....++..-|+.|.--.|.++||||||||+++.++++..+.+...+ T Consensus 80 ~-----------------~~-g--~~~g~-----rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~ 134 (200) T protein:vir:99 80 I-----------------VD-G--RYVGT-----RFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIA 134 (200) T ss_pred c-----------------cc-c--ccccc-----ccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHH Confidence 0 00 0 00011 1112234456666666788999999999999999999888887777 Q ss_pred HHHHH------HHhcC Q lcl|NC_019769. 140 NQAID------EALSK 149 (149) Q Consensus 140 ~~~l~------k~~~k 149 (149) .+.++ +++.+ T Consensus 135 ~~~l~g~~~~~~~L~~ 150 (200) T protein:vir:99 135 RQLLDGTINPEQALAQ 150 (200) T ss_pred HHHHhCCCCHHHHHHH Confidence 65442 33333 No 139 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.18 E-value=6.8e-06 Score=48.91 Aligned_cols=130 Identities=12% Similarity=0.025 Sum_probs=86.5 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccccc Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~ 85 (149) |+|+=...=+..|.+.-+... +..+++++.++..-...++..|-- . -...+|.+..+|++.....+ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~-~~~~~v~R~A~~~ga~vv~dear~--------~-----aP~~tG~LkksI~~~~~~~~ 66 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVV-EHSSDVVRTMTYESAVAVRESAKA--------F-----VNDETGKLRNNLYVAYSPEE 66 (157) T ss_pred CeeEeecccHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHH--------h-----CCCCcchhhhheeeeecccc Confidence 888732333444555555564 466778887777777776655421 1 12357999999999887766 Q ss_pred ccccceeE-ecCCCCCcceeeeeccCccCCCC------C-----------c-c--hhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 86 TGNSDNTM-KANNPRNAFYWRFVEMGTANMTA------H-----------P-F--IRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 86 ~~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a------~-----------P-F--l~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) .+.....+ +++...+++||||+|||+..-.. . + + -+|=+.-.-+...+.+.+.+..+|. T Consensus 67 s~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~ 146 (157) T protein:vir:97 67 SVEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGA 146 (157) T ss_pred CCCceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHH Confidence 65555544 44557789999999999754211 0 1 1 3566676777778888888888888 Q ss_pred HHhcC Q lcl|NC_019769. 145 EALSK 149 (149) Q Consensus 145 k~~~k 149 (149) +.+.. T Consensus 147 k~I~e 151 (157) T protein:vir:97 147 KKYAE 151 (157) T ss_pred HHHHH Confidence 88877 No 140 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=97.03 E-value=6.2e-06 Score=49.14 Aligned_cols=114 Identities=12% Similarity=0.105 Sum_probs=75.0 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeeccccc Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~ 85 (149) |+|.|+.+....|+++-++++.+-+-.||..+.-+....|--..|+||..|-+|=.... ... T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei-------------~~n----- 62 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYKKL-------------EPI----- 62 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccceee-------------ecc----- Confidence 99999999999999999988754445688888888888999999999988876532110 000 Q ss_pred ccccceeEecCCCCCcceeeeecc--CccCCCCCc--------------chhHhHHHH-HHHHHHHHHHHHHH Q lcl|NC_019769. 86 TGNSDNTMKANNPRNAFYWRFVEM--GTANMTAHP--------------FIRPAFDVR-QEQATEVAIRRMNQ 141 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~~~E~--GT~~~~a~P--------------Fl~pA~~~~-k~~~~~~~~~~~~~ 141 (149) |. .+.+.....+-|+-++.- |+-+..|+| ||..+|+.+ .+.+-..++++++- T Consensus 63 -gt---ritGRVGYSAnYA~yVHda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 63 -PS---GMIGRVGYTANYAAAVNAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred -Cc---eeEEeeccceeeeeeeecCccccCCCcCCCCCcceecCCCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 00 011112234566666644 554544444 999999776 44455555555554 No 141 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.02 E-value=6.5e-06 Score=49.04 Aligned_cols=126 Identities=13% Similarity=0.121 Sum_probs=61.0 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc---------ccccccCcc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT---------QKSRRRGEI 73 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~---------~~~~~~~~~ 73 (149) +..+++|...|..+-......-.+..++.-|+.++...+.+ .|. |.-........ ......+.+ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~Pd--G~~W~p~~~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPD--GTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCCCcccChHHHHHhhcCCCccchhhhhh Confidence 44555555555554333211112334555566666665544 452 21111110000 000001111 Q ss_pred ccceeeecccccccccceeEecCCCCCcceeeeeccCc----------cCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGT----------ANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 74 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT----------~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ... .....+.....+.....++..|+..+-||- ..+|+||||.=. ++.++++++.+.+.|.+ T Consensus 79 ~~s-----l~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 79 SRF-----LHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred cce-----eeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 111 111222222223333467889999999993 368999999866 45566677777666666 No 142 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=97.01 E-value=6.4e-07 Score=54.55 Aligned_cols=135 Identities=12% Similarity=0.110 Sum_probs=59.6 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+++-.-.-++++++.|+.|.... +.-..+......+.--|.-+ ..| ..|.+... ...-.+........ T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~---v~vGi~~~d~~~~~~Ia~~~---E~G---a~I~~~~~--~l~Ip~~~a~~~k~ 69 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYS---LQIGLFGEDDSFIQMIAGVH---EFG---LTIRPKGK--YLTIPTPEAGDRRA 69 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCE---EEEEEecCCCcchhheeehh---hcC---CeeecCCc--eeeecchhhhcccc Confidence 777655566888888888886431 11111111111111111000 001 11111000 00000000000000 Q ss_pred c---cccccccceeEecCCCCCcceeeeeccCcc--CCCCCcchhHhHHHHHHHHHHHHHHHHHHHH------HHHhcC Q lcl|NC_019769. 82 V---NPRTGNSDNTMKANNPRNAFYWRFVEMGTA--NMTAHPFIRPAFDVRQEQATEVAIRRMNQAI------DEALSK 149 (149) Q Consensus 82 ~---~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~--~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l------~k~~~k 149 (149) . .............. .+.--...+|||+. +.||||||||+++.++++..+.+...+.+.| ++++.. T Consensus 70 ~~~~~~~~p~g~~~~~~~--~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~ 146 (199) T protein:vir:80 70 RDIPGLFKPKGKNILAVA--GPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNR 146 (199) T ss_pred cccCcccccCCcceeeee--ccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHH Confidence 0 00000111111111 12223566899984 7899999999999999999888888777643 233333 No 143 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.96 E-value=7.9e-06 Score=48.55 Aligned_cols=131 Identities=13% Similarity=0.144 Sum_probs=62.4 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc--ccc-cccCccc-ccee Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT--QKS-RRRGEIS-SGVH 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~--~~~-~~~~~~~-~~i~ 78 (149) +..+++|...|..+-......-.+..++.-|+.++...+.+ .|.. .-........ .+. +....+. .... T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG--~~W~p~k~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDG--TPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCC--CCCcccChHHHHHhccCCCcccchhhhh Confidence 45566666666655433221112344555566666665544 4521 1111100000 000 0000000 0000 Q ss_pred eecccccccccceeEecCCCCCcceeeeeccCcc----------CCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA----------NMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ........+.....+.....++..|+..+-||-. .+||||||.=. +..+.++++.+.+.|.+ T Consensus 79 ~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 79 SRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred ccceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 1111122222222232334678899999999933 58999999866 34556666666666666 No 144 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=96.92 E-value=8.1e-06 Score=48.48 Aligned_cols=104 Identities=13% Similarity=0.167 Sum_probs=68.0 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+++|+|+ +..+.+.|.+ ....|...-+..+..++..-+|.++|.|++|.... T Consensus 1 M~vkV~id-~~~~~~~l~~--------a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~~------------------ 53 (112) T protein:vir:80 1 MPIKVRVD-LSKAKGSVKK--------AKERGQFALINQAAADIALYVPFLSGDLSNQYVIM------------------ 53 (112) T ss_pred CceeEEee-hHHHHHHHHH--------HHHHHHHHHHHHHHHHhhcCCCcccCccccceeec------------------ Confidence 88888887 3444444332 22345555667777788899999999998863210 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCccC--------CCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN--------MTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--------~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) +.. .+ ..+++|++.+-||-.. ..-..|+..+.....+++++.+.+.+.+.| T Consensus 54 -----~~g--~I----~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 54 -----NDK--EI----MWTSIYARRLYNGINFNFTLTHHPLAGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred -----cCc--eE----EecCchhhHhhhcccCCCCcCCCCCcchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 000 01 2246788777775432 233458888888888888888888888888 No 145 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.90 E-value=6.3e-06 Score=49.09 Aligned_cols=125 Identities=16% Similarity=0.152 Sum_probs=60.8 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc---cc------ccccCcc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT---QK------SRRRGEI 73 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~---~~------~~~~~~~ 73 (149) ++.+++|...|..|-......-.+..++.-|+.++...+.+ .|. |.-........ +. ....+.+ T Consensus 1 m~d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~Pd--G~~W~p~~~~~~~~k~~~~~~~l~~~g~l 78 (149) T protein:vir:98 1 MSELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPD--GTPYAARKRQSVRSKKGRIRREMFARLRT 78 (149) T ss_pred CchHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCCCcccchHHHHhccCCCCcccchhhhh Confidence 22345555555555333211112344566666666665544 453 21111111100 00 0000111 Q ss_pred ccceeeecccccccccceeEecCCCCCcceeeeeccCcc----------CCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA----------NMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 74 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .. ......+ .....++...++..|+..+.||.. .+|+||||.=. +..++++++.+.+.|.+ T Consensus 79 ~~-----sl~~~~~-~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 79 NR-----FMKAKGS-DSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred hh-----hhhheec-CCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCC-HHHHHHHHHHHHHHhhC Confidence 11 1111112 222233334678899999999953 68999999744 45566777777776666 No 146 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=96.86 E-value=3.8e-05 Score=44.83 Aligned_cols=114 Identities=13% Similarity=0.210 Sum_probs=66.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCcccccee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~ 78 (149) |++++|+|+ .++|++.++.+..+. ......-+..+|..+..+||.+||= .||+-+..|.... T Consensus 1 ~~~~~f~~~-~~~l~~~i~~~~~k~-~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~-------------- 64 (120) T protein:vir:10 1 MAKIEFKFK-DIELRRGVEDMEAKV-DRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVA-------------- 64 (120) T ss_pred CceEEEEec-HHHHHHHHhhhHHHH-HHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccc-------------- Confidence 999999999 589999999887664 2344444566777899999999993 3444433332110 Q ss_pred eecccccccccceeEecCCCCCcceeeeecc--CccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVEM--GTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~--GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) ....++. +...-..+..|.-|+|+ |..++ -|+|+.+.-.+++++- ++..++| T Consensus 65 ----~~~~~~~---~~Iylsh~veYG~~LEla~~~kya----Il~PTi~~~~~~il~g--------~~~ll~~ 118 (120) T protein:vir:10 65 ----STPQPDR---YEIVFAHTVHYGIWLEIANSGRYE----IIMPTVHHEGKLMAQR--------LRGLLGR 118 (120) T ss_pred ----ccCCCce---EEEEEecCeeecceEEeeCCCCcc----cccchHHHHhHHHHHH--------HHHHhhh Confidence 0000111 11111234578888994 44433 4566665544444444 4444444 No 147 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=96.82 E-value=9.4e-06 Score=48.14 Aligned_cols=125 Identities=15% Similarity=0.116 Sum_probs=63.1 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc--cccc------ccCccc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT--QKSR------RRGEIS 74 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~--~~~~------~~~~~~ 74 (149) +..|++|...|..|-.......-+..++.-|+.++...+.+ .|..+ -+....... .+.. ..+.+. T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~--~W~p~s~~~~~~~g~~~~~~~~~l~~~ 78 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGS--PYVPRKPQLRHRAGRIRRAMFMRLRLA 78 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCC--cCcccchHHHhhcccccccccchhhhh Confidence 44456666666655443322112334555555555555443 45322 111110000 0000 001111 Q ss_pred cceeeecccccccccceeEecCCCCCcceeeeeccC----------ccCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 75 SGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMG----------TANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 75 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G----------T~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ..+ ....+ .....++...++..|+..+-|| +..+|+||||.=. +..++++++.+.+.|.. T Consensus 79 ~~l-----~~~~~-~~~~~v~~~Gt~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 79 RYM-----KTQAD-ANTAVVTFAGNAQRIATVHQFGLRDRVNKAGLTAQYPARELLGMD-GVDMEHITNLLLLHLGA 148 (148) T ss_pred hhe-----eeeee-CCeeeEEeeccchhhhhhhhcCccccccCCCCccccCcccccCCC-HHHHHHHHHHHHHHhcC Confidence 111 11111 2222333346778999999999 4469999999865 45677788888888777 No 148 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.76 E-value=9.6e-06 Score=48.10 Aligned_cols=130 Identities=15% Similarity=0.151 Sum_probs=58.2 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc---ccccccCccccce-e Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT---QKSRRRGEISSGV-H 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~---~~~~~~~~~~~~i-~ 78 (149) +..|+++...|..|-.......-+..++.-|+.++...+.+ .|.. .-........ +.......+.... . T Consensus 1 m~~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG--~~W~p~~~~~~~~~~g~~~~~~~~~l~~ 78 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDG--TPYAARKRQPVRSKKGRIKREMFAKLRT 78 (149) T ss_pred CchHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCC--CCCcccchhhhhhccCcccchhhhhhhh Confidence 22344444444443322211111334555555566555543 4532 1111111000 0000000000000 0 Q ss_pred eecccccccccceeEecCCCCCcceeeeeccCcc----------CCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA----------NMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .......... ....++...++..|+..+-||.. ++||||||.=. ++.+.++++.+.+.|.+ T Consensus 79 ~~~l~~~~~~-~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 79 SRFMKAKGSD-SAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred hhhhheeecC-ceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCC-HHHHHHHHHHHHHHHhC Confidence 0011111111 22233334678899999999964 68999999865 44566777777776666 No 149 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.72 E-value=1.5e-05 Score=47.11 Aligned_cols=131 Identities=12% Similarity=0.111 Sum_probs=63.0 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecc--cccc-ccCcccc-cee Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT--QKSR-RRGEISS-GVH 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~--~~~~-~~~~~~~-~i~ 78 (149) +..+++|...|..|-..+...-.+..++.-|+.++...+.+ .|.. .-........ .+.. ....+.. ... T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG--~~W~p~k~~~~~~k~g~~~~~l~~~~~l 78 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDG--TPYAPRQQQSVRKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCC--CCCcccchHHHHHhccCCCccccchhhh Confidence 45556666666655433221122344556666666665544 3422 1111111000 0000 0000110 001 Q ss_pred eecccccccccceeEecCCCCCcceeeeeccCc----------cCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGT----------ANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT----------~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ........+.....+.....++..|+..+-||- ..+||||||.=.- ..++++++.+.+.|.+ T Consensus 79 ~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~-~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 79 SRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG-EDVQMIEEIILAHLER 150 (150) T ss_pred hhhhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCH-HHHHHHHHHHHHHHhC Confidence 111122222222233333467889999999993 3689999998663 4556677766666666 No 150 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.68 E-value=1.7e-05 Score=46.68 Aligned_cols=131 Identities=14% Similarity=0.111 Sum_probs=59.5 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceec---cccccccCc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVV---TQKSRRRGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~---~~~~~~~~~ 72 (149) ||+ + +++|...|..|-..+....-+..++.-|+.++...+.+ .|..+. ...-... .......|. T Consensus 1 m~~---~---~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~--W~prk~~~~~~~~~~~~g~ 72 (155) T protein:vir:79 1 MTD---D---LQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSA--YEPRKVKAGGKRLREKAGR 72 (155) T ss_pred Cch---H---HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC--CcccchhhhhhhhhcccCc Confidence 665 2 33444444444332211111233445555555554433 453221 1100000 000011111 Q ss_pred ccccee------eecccccccccceeEecCCCCCcceeeeeccCcc----------CCCCCcchhHhHHHHHHHHHHHHH Q lcl|NC_019769. 73 ISSGVH------IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA----------NMTAHPFIRPAFDVRQEQATEVAI 136 (149) Q Consensus 73 ~~~~i~------~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~k~~~~~~~~ 136 (149) +..... ........+ .....++...++..|+..+-||.. .+||||||.=.- ..++++++.+. T Consensus 73 ~~~~~m~~~l~~a~~l~~~~~-~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LGls~-~d~~~I~~~i~ 150 (155) T protein:vir:79 73 VKREAMFRKLRTARYLRIDVD-STGLAIGFDERLSRIARVHQEGQKAPVEPGGPLAQYPVRVVLGFSD-ADRELVRDRLL 150 (155) T ss_pred ccchhhhhhhhhhheeeeeec-CcEEEEEecCcchhhhhhhhcCCcccCCCCCcccccccccccCCCH-HHHHHHHHHHH Confidence 111100 001111111 222233334678899999999954 689999997664 45677777777 Q ss_pred HHHHH Q lcl|NC_019769. 137 RRMNQ 141 (149) Q Consensus 137 ~~~~~ 141 (149) +.|.+ T Consensus 151 ~~l~r 155 (155) T protein:vir:79 151 RELTR 155 (155) T ss_pred HHhhC Confidence 77776 No 151 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=96.61 E-value=1.7e-05 Score=46.72 Aligned_cols=103 Identities=15% Similarity=0.075 Sum_probs=61.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||+++|++.++...+.. +.+..|...-+..+..++..-+|.++|.|++|...... .| T Consensus 1 mmkvkv~~~~~~~~~~~----------~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s~----~g--------- 57 (108) T protein:vir:98 1 MPKIRVELSGAKDKLSP----------QTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISSD----AE--------- 57 (108) T ss_pred CceeEeeehHHHHHHHH----------HHHHHHHHHHHHHHHHhhcccCcCcCCccccceeeccC----Cc--------- Confidence 99999999876542211 12223445556667778888999999999987543210 00 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCC-----CcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTA-----HPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a-----~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .+ ..+++|++++=||..+.-. ..|+..|.....+++++.+.+.++= T Consensus 58 -----------~I----~y~tPYAr~qYYg~~~n~~~p~ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 58 -----------EI----YYNTPYAKRRFYEPAYNYTTPGTGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred -----------eE----EecChhhHHhhhccccCCCCCCCcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 01 1246788877777544332 3466666666655555555444433 No 152 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=96.56 E-value=2e-05 Score=46.33 Aligned_cols=104 Identities=12% Similarity=0.149 Sum_probs=68.0 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+++|+|.. ..+.++|. +.+.++...-++.+..++..-+|.++|.|++|.... T Consensus 1 M~vkv~vn~-~~~~~~l~--------~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~~------------------ 53 (112) T protein:vir:45 1 MPIKVRVDL-SKAKGSVK--------KAKERGQFALINQAAADIALYVPFLSGDLSNQYVIM------------------ 53 (112) T ss_pred CceeEEeeh-HHHHHHHH--------HHHHHHHHHHHHHHHHHhhcCCccccCccccceeec------------------ Confidence 888888873 44433332 222345666677778888999999999998863210 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCccC--------CCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN--------MTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--------~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) +... + ..+++|++++=||... ..-..|+..|.....+.+++.+.+.+.+.| T Consensus 54 -----~~g~--I----~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 54 -----NDKE--I----MWTSIYARRLYKGINFNFTLTHHPLAGPEWDQRAKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred -----cCCe--E----EecChhhHHhhhccccCCCCCCCCCCchhhHHHHHHhhHHHHHHHHHHHHhhcC Confidence 0000 1 1346788777665432 233468888888888888888888888888 No 153 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=96.51 E-value=6e-06 Score=49.23 Aligned_cols=88 Identities=18% Similarity=0.237 Sum_probs=57.3 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |++++. |.++|.+.|+..++++.+.+ +..+.+.|+.|...|..+||+|+|.|+.||....+... +...|.+. T Consensus 13 makvky---G~~dmvk~~~~f~~~i~~~v-k~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk~GG----ltavI~vG 84 (100) T protein:vir:96 13 MAKVKY---GADSMVVELDKFDKKIEEWV-KKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGG----LSSVISVG 84 (100) T ss_pred hhhhee---chHHHHHHHhcchHHHHHHH-HHHHHHHHHHHHhhHHhhccccccccceeeeeeeecCC----eeEEEecc Confidence 777765 88999999999999986555 78889999999999999999999999999977543322 22222110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVA 135 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~ 135 (149) +.|+- ..| .+-.+-.+ T Consensus 85 --------------------AeYAI------krm-------------sqllvtvi 100 (100) T protein:vir:96 85 --------------------ADYAI------KRM-------------SQLLVTVI 100 (100) T ss_pred --------------------hhHHH------HHH-------------HHHHhhcC Confidence 11110 000 00000000 No 154 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.38 E-value=4.2e-05 Score=44.58 Aligned_cols=133 Identities=14% Similarity=0.107 Sum_probs=61.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCc--ccccceeccccccccCcc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGK--LKKNVVVVTQKSRRRGEI 73 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~--l~~~i~~~~~~~~~~~~~ 73 (149) |.+ . +.+|...|..|-......--+..+..-|+.++...+.+ .|..+.. ++..... .+.....+.. T Consensus 1 M~~-~-----~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~-~k~~~~~~~m 73 (152) T protein:vir:10 1 MSE-P-----IEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKG-VKSKIKSGKM 73 (152) T ss_pred Cch-H-----HHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhh-hcccccchhH Confidence 333 2 34444444444332211111234555566666555544 4533211 0000000 0000011111 Q ss_pred cccee-eecccccccccceeEecCCCCCcceeeeeccC-----------ccCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 74 SSGVH-IRGVNPRTGNSDNTMKANNPRNAFYWRFVEMG-----------TANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 74 ~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G-----------T~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) ...+. ....+.. .......++...++..|+..+-|| +..+|+||||.=. +..+.++++.+.+.|.. T Consensus 74 ~~~L~~a~~l~~~-a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~ 151 (152) T protein:vir:10 74 FDKITQPRFMRLR-LESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFT-DDDLQMIEDYMINILAG 151 (152) T ss_pred HHhhhhcceeeee-ecCcEEEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCC-HHHHHHHHHHHHHHHhc Confidence 11000 0001111 112233333346788999999998 5569999999766 44567777777777777 Q ss_pred H Q lcl|NC_019769. 142 A 142 (149) Q Consensus 142 ~ 142 (149) + T Consensus 152 a 152 (152) T protein:vir:10 152 S 152 (152) T ss_pred C Confidence 6 No 155 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=96.30 E-value=0.00019 Score=40.99 Aligned_cols=139 Identities=12% Similarity=0.173 Sum_probs=62.8 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCc----ccccceeccccccccCccccceeeec Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK----LKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~----l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+|+||+++++.|..|++....+++..|+...+.-++..+...+...++- +++.+.... ...+.+...|.+.+ T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~k---as~~~l~a~I~~~~ 77 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVKR---ATVNKPRALIRVNR 77 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheecc---cCCCCeEEEEEEec Confidence 99999999999999998775678888999888888887776665543322 222222110 11112222222111 Q ss_pred c------------------cccccccceeEecCCC--CCccee--------eeeccCccCC-------C-CCcchhHhHH Q lcl|NC_019769. 82 V------------------NPRTGNSDNTMKANNP--RNAFYW--------RFVEMGTANM-------T-AHPFIRPAFD 125 (149) Q Consensus 82 ~------------------~~~~~~~~~~~~~~~~--~~~~y~--------~~~E~GT~~~-------~-a~PFl~pA~~ 125 (149) . ....+. ...+.++.. ..+|.+ -|.--|.... | +.| +..+++ T Consensus 78 ~~i~l~~~g~~~~k~~~~~~~~~~~-~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~-~~e~~~ 155 (184) T protein:vir:39 78 GNLPAIKLGTASVRLSRRKRDKKGA-NSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAP-LTTAFK 155 (184) T ss_pred cceeeeeccccccccCccccccccc-cceeeecceecCcceeeecCCCceEEEEEecCcccceeEEEcCchHH-HHHHHH Confidence 0 000010 000100000 012222 1222233322 2 122 233333 Q ss_pred HHH-----HHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 126 VRQ-----EQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 126 ~~k-----~~~~~~~~~~~~~~l~k~~~k 149 (149) ..- +.+...|..+|..+|+++++| T Consensus 156 ~~~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 156 EELPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 222 333344444444444445555 No 156 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=95.26 E-value=0.00065 Score=38.04 Aligned_cols=138 Identities=14% Similarity=0.175 Sum_probs=60.8 Q ss_pred eehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcc----cccceeccccccccCccccceeeec Q lcl|NC_019769. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKL----KKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l----~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+|+||+++++.|+.|++....+++..|+...|.-+...+...+...+|-- +..+.... ...+.+...|.+.. T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~k---As~~~l~a~I~~~~ 77 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKR---ATVKNPQARIKVNR 77 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheecc---ccCCCceEEEEEec Confidence 888999999999999998766788888888888877777766655444322 22222110 01111111111111 Q ss_pred cc--------cc----------------ccccceeEecCC--CCCccee-------e-eec-cCccCCC----CCcc--- Q lcl|NC_019769. 82 VN--------PR----------------TGNSDNTMKANN--PRNAFYW-------R-FVE-MGTANMT----AHPF--- 119 (149) Q Consensus 82 ~~--------~~----------------~~~~~~~~~~~~--~~~~~y~-------~-~~E-~GT~~~~----a~PF--- 119 (149) .. .. .......+.++. -..+|++ | |.- .|...-| -=|. T Consensus 78 ~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpis~~ 157 (192) T protein:vir:34 78 GDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVP 157 (192) T ss_pred cceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEechhHH Confidence 00 00 000000011100 0112322 1 221 1332211 1122 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 120 IRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 120 l~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +..||+..- .+.+.+.+.++|.++|+. T Consensus 158 l~~af~~~~---~~~~~~~~~~El~~~L~~ 184 (192) T protein:vir:34 158 LTTAFKQNI---ERIRRERLPKELGYALQH 184 (192) T ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHHHH Confidence 255555443 333333333333333333 No 157 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=95.16 E-value=9.7e-06 Score=48.08 Aligned_cols=90 Identities=19% Similarity=0.264 Sum_probs=51.6 Q ss_pred Cc----------ceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceecccccccc Q lcl|NC_019769. 1 MI----------ETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRR 70 (149) Q Consensus 1 Mm----------~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~ 70 (149) |+ ++.+..+ +| .+|++ + ...+.+=.+.+...++++.|+++|.++.|..+..+. .. T Consensus 1 ma~gpt~kNP~~KFGvs~~---d~----~K~~E-V-----n~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers-~N- 65 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLD---DF----DKLPE-V-----NQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERS-TN- 65 (108) T ss_pred CCCCcccccchhhhcCChh---hh----hhchh-h-----hhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhh-hc- Confidence 43 3444332 22 22442 2 223333344677888999999999999887653211 00 Q ss_pred CccccceeeecccccccccceeEecCCCCCcceeeeeccCccCC----CC----CcchhHhHHH Q lcl|NC_019769. 71 GEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANM----TA----HPFIRPAFDV 126 (149) Q Consensus 71 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~----~a----~PFl~pA~~~ 126 (149) +| .+......||+||+||||.+. |+ ..|=..||+. T Consensus 66 ---------------kG------RG~~G~~~~~AH~VEFGs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:79 66 ---------------KG------RGKVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred ---------------cC------ccccCCcchhhhhhhhhccccccccchhhHHHhhcccccCC Confidence 01 112234679999999999874 33 3466666655 No 158 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=94.69 E-value=0.0026 Score=34.74 Aligned_cols=142 Identities=16% Similarity=0.151 Sum_probs=71.6 Q ss_pred CcceeeehHh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCC----cccccceeccccccccCcccc Q lcl|NC_019769. 1 MIETSLDFSG-LNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTG----KLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~G-l~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g----~l~~~i~~~~~~~~~~~~~~~ 75 (149) =|+++|++++ ++.+...|..++.. ..+++..||...|.-++..+...+...++ .+++.+....... .+.. T Consensus 4 ~~~l~idv~~~l~~i~~~l~~~~~~-~~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~~----~~~~ 78 (177) T protein:vir:96 4 GFEMKIDVSREAEDIAAMVAATTKQ-LELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQRQ----KGEV 78 (177) T ss_pred CceeEEehhHHHHHHHHHHhhcHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccCC----CcEE Confidence 4567888877 44444555555554 46788889988888877777665543332 2223222211111 1111 Q ss_pred ceeeec-------c-cccccccceeEecCC--CCCcceee--------eeccCccC-------CCCCcchhHhHHHHHHH Q lcl|NC_019769. 76 GVHIRG-------V-NPRTGNSDNTMKANN--PRNAFYWR--------FVEMGTAN-------MTAHPFIRPAFDVRQEQ 130 (149) Q Consensus 76 ~i~~~~-------~-~~~~~~~~~~~~~~~--~~~~~y~~--------~~E~GT~~-------~~a~PFl~pA~~~~k~~ 130 (149) .|.... . ..+.+.. .+.++. -..+|.+. |.--|... .|--|=+..+++...++ T Consensus 79 ~i~~~~~~i~l~~~~~~r~t~~--Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~~ 156 (177) T protein:vir:96 79 RFWVGLDPIGVYRLGTPKVTQK--GVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWERR 156 (177) T ss_pred EEEEeccceehhhcccCCCCcc--ceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHHH Confidence 111110 0 0111110 000000 00112211 11123222 33333356778877888 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 131 ATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 131 ~~~~~~~~~~~~l~k~~~k 149 (149) +.+.|...|.++|+.+|+- T Consensus 157 ~~~~~~~~l~~Ei~~~L~g 175 (177) T protein:vir:96 157 VFQRFKELFEQEARAIING 175 (177) T ss_pred HHHHHHHHHHHHHHHHhcc Confidence 8888999999999999999 No 159 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=94.68 E-value=0.00033 Score=39.67 Aligned_cols=104 Identities=15% Similarity=0.169 Sum_probs=57.0 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |+++|+|+ ++.+.+.|.. +.+.++...-+..+..++..-+|.++|.|++|..... T Consensus 1 M~~kVkv~-l~~~~~~l~~-------~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~----------------- 55 (114) T protein:vir:47 1 MNIAIKVD-LQKAKQKLSN-------ESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVG----------------- 55 (114) T ss_pred CceeEEee-hhHHHHHHHH-------HHHHHHHHHHHHHHHHhhccCCcCccCccccceeeee----------------- Confidence 88887777 5555555531 1223344445566777788899999999998753311 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCcc----------CCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGTA----------NMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) +... + ..+++|++++=||-. .+.-..|+..|.....+++++.+.+.+ += T Consensus 56 -----~~~~--I----~y~tPYAr~qyYg~~~~~~~~~~~~p~~g~~W~eraka~~~~~~~~~~~k~~--------g~ 114 (114) T protein:vir:47 56 -----QGDA--V----VYGTVYARAQFYGSNGIVTFRRYTTPGTGKRWDQVATSKHAEEWARAFVKGM--------GL 114 (114) T ss_pred -----CCcE--E----EecCchhhHhhhcccCCCCCCccCCCCCcchhHHHHHhhhhHHHHHHHHHhh--------CC Confidence 0000 1 124677777666521 123344666666555555444443322 22 No 160 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=93.93 E-value=0.00044 Score=38.96 Aligned_cols=101 Identities=13% Similarity=0.136 Sum_probs=52.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||+++|++.+++..+. .+.+.++...-+..+..++..-+|.++|.|++|..+... . T Consensus 1 m~kV~vdl~~~~~~ls----------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~----------~---- 56 (118) T protein:vir:98 1 MAKVVVELGGIKRKVS----------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV----------G---- 56 (118) T ss_pred CceeeechhHHhhhhh----------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC----------e---- Confidence 9999999998766441 111223344455567777888999999999987543210 0 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc------------cCC--CCCcchhHhHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT------------ANM--TAHPFIRPAFDVRQ--EQATEVAIRRMNQAID 144 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT------------~~~--~a~PFl~pA~~~~k--~~~~~~~~~~~~~~l~ 144 (149) + ..+++|++.+=||- ... .-..|..++.-..+ +..++++.+ T Consensus 57 ------------I----~Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k------- 113 (118) T protein:vir:98 57 ------------V----TWSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLR------- 113 (118) T ss_pred ------------e----EECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHH------- Confidence 1 12356665554432 122 23445554443221 222222222 Q ss_pred HHhcC Q lcl|NC_019769. 145 EALSK 149 (149) Q Consensus 145 k~~~k 149 (149) .++= T Consensus 114 -~~g~ 117 (118) T protein:vir:98 114 -GMGF 117 (118) T ss_pred -hcCC Confidence 2221 No 161 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=93.93 E-value=0.00044 Score=38.96 Aligned_cols=101 Identities=13% Similarity=0.136 Sum_probs=52.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ||+++|++.+++..+. .+.+.++...-+..+..++..-+|.++|.|++|..+... . T Consensus 1 m~kV~vdl~~~~~~ls----------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~----------~---- 56 (118) T protein:vir:30 1 MAKVVVELGGIKRKVS----------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV----------G---- 56 (118) T ss_pred CceeeechhHHhhhhh----------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC----------e---- Confidence 9999999998766441 111223344455567777888999999999987543210 0 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc------------cCC--CCCcchhHhHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT------------ANM--TAHPFIRPAFDVRQ--EQATEVAIRRMNQAID 144 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT------------~~~--~a~PFl~pA~~~~k--~~~~~~~~~~~~~~l~ 144 (149) + ..+++|++.+=||- ... .-..|..++.-..+ +..++++.+ T Consensus 57 ------------I----~Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k------- 113 (118) T protein:vir:30 57 ------------V----TWSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLR------- 113 (118) T ss_pred ------------e----EECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHH------- Confidence 1 12356665554432 122 23445554443221 222222222 Q ss_pred HHhcC Q lcl|NC_019769. 145 EALSK 149 (149) Q Consensus 145 k~~~k 149 (149) .++= T Consensus 114 -~~g~ 117 (118) T protein:vir:30 114 -GMGF 117 (118) T ss_pred -hcCC Confidence 2221 No 162 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=93.33 E-value=0.00078 Score=37.63 Aligned_cols=105 Identities=10% Similarity=0.024 Sum_probs=57.0 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeec Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~ 81 (149) |.++|+|. ++.+.++|. .+.+.++...-+..+..++..-+|.++|.+..+...... T Consensus 1 M~ikVkv~-l~~~~~~~~-------~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~---------------- 56 (116) T protein:vir:15 1 MAFRINVD-LDGFMDQTS-------LDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHAT---------------- 56 (116) T ss_pred CCceEEee-hhHhhhhhh-------HHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeee---------------- Confidence 88888876 455544442 122334455556667778888999999875543221110 Q ss_pred ccccccccceeEecCCCCCcceeeeeccCc-----------cCCCCCcchhHhHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 82 VNPRTGNSDNTMKANNPRNAFYWRFVEMGT-----------ANMTAHPFIRPAFDVRQEQATEVAIRRMN 140 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------~~~~a~PFl~pA~~~~k~~~~~~~~~~~~ 140 (149) .+... + ..+++|++.+=||- ....-..|+..|.....+..++.+.+.++ T Consensus 57 ----~~~~~--I----~y~tPYAr~qyYg~~~~~~~~~~~t~p~ag~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 57 ----SDGSE--I----TYSTPYAKAQFYGIINDKYPVHNYTTPGTTKRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred ----cCCce--E----EecCchhHHHhcccccCCCCcccccCCCCCcchhHHHHhhhHHHHHHHHHHhcC Confidence 00000 1 12356666554432 11233457777777776666666666555 No 163 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=92.80 E-value=0.0014 Score=36.19 Aligned_cols=134 Identities=13% Similarity=0.068 Sum_probs=55.9 Q ss_pred CcceeeehHhHH-HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCcccccceecccccc-----c Q lcl|NC_019769. 1 MIETSLDFSGLN-DIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVTQKSR-----R 69 (149) Q Consensus 1 Mm~~~~~i~Gl~-~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~~~~~-----~ 69 (149) |-+ +++-|+ .|..-|.+|..... +..++.-|+.++...+.+ .|..+ -............ . T Consensus 1 m~~---~~~~l~~~L~~ll~~L~~~~~----~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~--~W~p~~~~~~~~~~~~~~~ 71 (156) T protein:vir:11 1 MAD---SLEALEDWAGPILRALEPGPR----AALARSLARDLRRSQQKRVMAQRNPDGS--AYEPRKKRELRGKQGRIRR 71 (156) T ss_pred Cch---hHHHHHHHHHHHHHhcCCcch----HHHHHHHHHHHHHHHHHHHHhhcCCCCC--CCcccchHHHhhhcccccc Confidence 333 233222 22223334432211 233455555555555443 45322 1111100000000 0 Q ss_pred cCc-cccceeeecccccccccceeEecCCCCCcceeeeeccCcc----------CCCCCcchhHhHHHHHHHHHHHHHHH Q lcl|NC_019769. 70 RGE-ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTA----------NMTAHPFIRPAFDVRQEQATEVAIRR 138 (149) Q Consensus 70 ~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~k~~~~~~~~~~ 138 (149) ... +..-.......... ......++...++..|++.+-||.. .+||||||.=.- +.++++++.+.+. T Consensus 72 ~~~m~~~l~~~~~l~~~~-~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~i~~~i~~~ 149 (156) T protein:vir:11 72 KIKMFQKLRTVRYLRAKG-DAQAITVSFAGRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDS-SDMETIQNGILAH 149 (156) T ss_pred chhhhhhhhhhheeeeee-cCcEEEEEecCCchhhhhhhcccccccccCCCCcccccccccCCCCH-HHHHHHHHHHHHH Confidence 000 00000000111111 1222233334678899999999965 689999997663 4455555555555 Q ss_pred HHHHHHH Q lcl|NC_019769. 139 MNQAIDE 145 (149) Q Consensus 139 ~~~~l~k 145 (149) |....-- T Consensus 150 l~~~~~~ 156 (156) T protein:vir:11 150 IDANSPI 156 (156) T ss_pred HhhcCCC Confidence 4443222 No 164 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=92.20 E-value=0.0011 Score=36.75 Aligned_cols=103 Identities=17% Similarity=0.104 Sum_probs=53.8 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccccccccc Q lcl|NC_019769. 11 LNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSD 90 (149) Q Consensus 11 l~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 90 (149) +.+|...-+.| ..+.+.+|-..-+..|..++..-+|.++|.|++|..+.. .. T Consensus 1 ~~dL~~~~~~~----~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~s------------------------~~ 52 (113) T protein:vir:79 1 MSDLSVFSRMA----QSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVND------------------------TG 52 (113) T ss_pred CchHHHHHHhh----chhHHHHHHHHHHHHHHHhhcccCcccccchhccccccC------------------------Ce Confidence 22332222222 333445566666777888899999999999998753210 00 Q ss_pred eeEecCCCCCcceeeeeccCccC----------CCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019769. 91 NTMKANNPRNAFYWRFVEMGTAN----------MTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEAL 147 (149) Q Consensus 91 ~~~~~~~~~~~~y~~~~E~GT~~----------~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~ 147 (149) ..-+++|++++=||... ..-..|+..|.....++.++.+.+.+.+..+-.- T Consensus 53 ------I~y~tPYAr~qyYg~~~~~~~~~~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~~~~ 113 (113) T protein:vir:79 53 ------IHYTAKYARAQFYGFVNGHRVRNYSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAKGEY 113 (113) T ss_pred ------eEecChhhhHhhccccCCCCccccCCCCCCchhhHHHHHHhHHHHHHHHHHHhhccccccC Confidence 12245677776665332 2223466655555555554444443333221111 No 165 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=91.92 E-value=0.013 Score=30.93 Aligned_cols=143 Identities=12% Similarity=0.156 Sum_probs=56.0 Q ss_pred cceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH-HH----HHHhCCcCCCcccccceecccc----cc---- Q lcl|NC_019769. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLK-EE----VIARAPVRTGKLKKNVVVVTQK----SR---- 68 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~-~~----ak~~aP~~~g~l~~~i~~~~~~----~~---- 68 (149) |.+++.++|++++.+.|++|++... ++...|+...|..-. .. +....-...+.+..+......+ .. T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~-~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~ 79 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQ-QAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAV 79 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhh-HHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEE Confidence 9999999999999999999998754 445556666544443 22 2222222233333222111000 00 Q ss_pred --------------ccCcccc-----ceeeecccccccccc-----eeEecC--CC---CCcceeeeec----------c Q lcl|NC_019769. 69 --------------RRGEISS-----GVHIRGVNPRTGNSD-----NTMKAN--NP---RNAFYWRFVE----------M 109 (149) Q Consensus 69 --------------~~~~~~~-----~i~~~~~~~~~~~~~-----~~~~~~--~~---~~~~y~~~~E----------~ 109 (149) .++.... +|.+. ++.|... ..+... .. ++...+-++- - T Consensus 80 I~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~---Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~ 156 (205) T protein:vir:63 80 IGARQRPTSLARFAEPGQTTKSTRKGGVSVV---VKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATD 156 (205) T ss_pred EecCCCcceeeeccCCCccccccccCCeEEE---EEcCCCeeccCceEEEeeccccccccccceEEEeeecCcccccccc Confidence 0111111 11111 1111110 000000 00 0000111111 1 Q ss_pred CccCCCCCc---chhHhHHHHHH----HHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 110 GTANMTAHP---FIRPAFDVRQE----QATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 110 GT~~~~a~P---Fl~pA~~~~k~----~~~~~~~~~~~~~l~k~~~k 149 (149) |-.+. +.+ +..|..++.-. .+...|.+.+.+++++-..+ T Consensus 157 g~~k~-~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~r~~~~ 202 (205) T protein:vir:63 157 GATKL-SNNVYLLYGPSVDQVFRTVADDITTEVLDALADEFLRQFTR 202 (205) T ss_pred Cceec-CCceEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHHHhhhh Confidence 11111 122 56666665443 33333333333333333333 No 166 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=91.38 E-value=0.00034 Score=39.59 Aligned_cols=117 Identities=18% Similarity=0.245 Sum_probs=56.2 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ..++.+..+ +|.+-++.-++ + +.-+..-+.+- ...-++++.|+++|+++.|..+..+.. T Consensus 5 ~~KFGvS~~---e~~K~irns~E-V-~~GiNdFMe~~---A~~~aK~~SPV~~GeY~~S~~V~~ka~------------- 63 (150) T protein:vir:81 5 FEKFGVSDS---ELAKHIRNSAE-V-DAGINDFMENE---AIPYAKSISPVDDGEYAASWAVMKKAK------------- 63 (150) T ss_pred hhhhcCCHH---HHHHhhccchh-h-hhhHHHHHHhh---hhhhhhccCCcccchhHHHHHHHhhcc------------- Confidence 445555544 44444443332 2 22223333322 223468899999999998875532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccC---C------------------CCCcchh--HhHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN---M------------------TAHPFIR--PAFDVRQEQATEVAIR 137 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---~------------------~a~PFl~--pA~~~~k~~~~~~~~~ 137 (149) . ..+......||+||+||||-- | .---|-| |----..+-+.+.+.. T Consensus 64 -------N----GRG~~G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqkvas 132 (150) T protein:vir:81 64 -------N----GRGVFGPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQKVAS 132 (150) T ss_pred -------c----CccccCccchhhhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHHHHH Confidence 0 011224568999999999841 1 1111221 1111222334444444 Q ss_pred HHHHHHHHHhcC Q lcl|NC_019769. 138 RMNQAIDEALSK 149 (149) Q Consensus 138 ~~~~~l~k~~~k 149 (149) .+.-.|+--++| T Consensus 133 hfggslkggisk 144 (150) T protein:vir:81 133 HFGGSLKGGISK 144 (150) T ss_pred hccccccccccc Confidence 444444444444 No 167 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=91.30 E-value=0.011 Score=31.39 Aligned_cols=132 Identities=14% Similarity=0.215 Sum_probs=50.9 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHH--------HHHHHHHHHHHHHH------hCCcCC-------------Ccccccc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDA--------TRAGAEVLKEEVIA------RAPVRT-------------GKLKKNV 60 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~a--------l~~~a~~v~~~ak~------~aP~~~-------------g~l~~~i 60 (149) |+||+++++.|+.|+.....++...| +..++.-+..+... .+|... |.+...| T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I 80 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARI 80 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEE Confidence 99999999999998876545544444 44455555554321 122110 1111111 Q ss_pred eeccccccccCccccceeeeccc----------ccccccceeEec-------CCCC--Ccceee-eec-cCccCCC---- Q lcl|NC_019769. 61 VVVTQKSRRRGEISSGVHIRGVN----------PRTGNSDNTMKA-------NNPR--NAFYWR-FVE-MGTANMT---- 115 (149) Q Consensus 61 ~~~~~~~~~~~~~~~~i~~~~~~----------~~~~~~~~~~~~-------~~~~--~~~y~~-~~E-~GT~~~~---- 115 (149) .... .++.+.... ...+.....+.+ ..-. ...+|| |.- -|...-| T Consensus 81 ~v~~----------~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~R~~gk~R~PIevv 150 (192) T protein:vir:79 81 RVNR----------GNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVV 150 (192) T ss_pred EEec----------CceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceEecCCCccCCeeeE Confidence 1000 011110000 000000001111 1100 111222 222 2432222 Q ss_pred CCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 116 AHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 116 a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) -=|.-.|.-+.-..++...+.+++.++|..+|+. T Consensus 151 kIpis~~l~~af~~e~~r~~~~~~~~el~~~L~~ 184 (192) T protein:vir:79 151 KIPLSGPLTQAFEDARDRIIAAEMPKQLGYALKQ 184 (192) T ss_pred eechHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1233444444344455555555555555555554 No 168 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=86.02 E-value=0.011 Score=31.41 Aligned_cols=91 Identities=12% Similarity=0.151 Sum_probs=52.0 Q ss_pred HHHHHHHHHHHHHHHHHHhCCc--CCCcccccceeccccccccCccccceeeecccccccccceeEecCCCCCcceeeee Q lcl|NC_019769. 30 LRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFV 107 (149) Q Consensus 30 ~~~al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~ 107 (149) +......+|..+..+||.+||= .||+-+..|.... ...|... .......+..|.-|+ T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~-------------------~~~g~~~--~~i~lsh~v~Yg~~L 59 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVA-------------------STPQPDR--YEIVFAHTVHYGIWL 59 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccc-------------------cccCCce--EEEEEecCeeccceE Confidence 3445566788899999999993 3454443332110 0011111 111122346789999 Q ss_pred ccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 108 EMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAID 144 (149) Q Consensus 108 E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~ 144 (149) |.++.+.++ .|+|+.+.--+++++-+..-+.+ |+ T Consensus 60 E~a~~~kya--Il~Ptv~~~~~~i~~g~~~ll~~-l~ 93 (93) T protein:vir:10 60 EIANSGRYE--IIMPTVHHEGKLMAQRLRGLLGR-LR 93 (93) T ss_pred EeecCCCcc--chhhhHHHHHHHHHHHHHHHHHh-cC Confidence 999887765 68888877666666555443332 12 No 169 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=73.43 E-value=0.0081 Score=32.04 Aligned_cols=90 Identities=18% Similarity=0.249 Sum_probs=49.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ++++.+.+. .+.+|++ + ...+.+=-..+...-+.+.|+++|++++|+.+..... T Consensus 11 lakfgi~ld-------dfdklpe-v-----nqgvnef~dev~aawk~nspv~~g~yrdsvqvterst------------- 64 (108) T protein:vir:10 11 LAKFGVRLD-------DFDKLPE-V-----NQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERST------------- 64 (108) T ss_pred hhhhccchh-------hhhccch-h-----hhhHHHHHHHHHHhhhcCCCccccccccceeeccccc------------- Confidence 445555443 3344553 2 1223333444666678999999999999886532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCC----CC----CcchhHhHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANM----TA----HPFIRPAFDV 126 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~----~a----~PFl~pA~~~ 126 (149) .+|.. ....+-+-+|++|||..+. |+ ..|=..||+. T Consensus 65 ----nkgrg------kvgatdpqahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 65 ----NKGRG------KVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred ----ccccc------cccCcchhhhhhhhhccccccccchhhhHHhhcccccCC Confidence 00110 1122346689999998763 33 3466666655 No 170 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=73.43 E-value=0.0081 Score=32.04 Aligned_cols=90 Identities=18% Similarity=0.249 Sum_probs=49.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) ++++.+.+. .+.+|++ + ...+.+=-..+...-+.+.|+++|++++|+.+..... T Consensus 11 lakfgi~ld-------dfdklpe-v-----nqgvnef~dev~aawk~nspv~~g~yrdsvqvterst------------- 64 (108) T protein:vir:10 11 LAKFGVRLD-------DFDKLPE-V-----NQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERST------------- 64 (108) T ss_pred hhhhccchh-------hhhccch-h-----hhhHHHHHHHHHHhhhcCCCccccccccceeeccccc------------- Confidence 445555443 3344553 2 1223333444666678999999999999886532110 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCccCC----CC----CcchhHhHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANM----TA----HPFIRPAFDV 126 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~----~a----~PFl~pA~~~ 126 (149) .+|.. ....+-+-+|++|||..+. |+ ..|=..||+. T Consensus 65 ----nkgrg------kvgatdpqahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 65 ----NKGRG------KVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred ----ccccc------cccCcchhhhhhhhhccccccccchhhhHHhhcccccCC Confidence 00110 1122346689999998763 33 3466666655 No 171 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=66.31 E-value=0.094 Score=26.23 Aligned_cols=128 Identities=15% Similarity=0.234 Sum_probs=64.3 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |. .+.|+--|.|+.+-.+..+. + ...+..-...++.-+...||+.+|.|+.|...+.. ...|.+...++.. T Consensus 1 mi--~i~idkp~almek~~ev~~~----i-e~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sie--gstgelsn~~~yl 71 (133) T protein:vir:42 1 MI--EIRIDKPDALMEKPHEVQGK----I-EETLEKILNQLQGIAENTAPVKTGNLRDSHIISIE--GSTGELSNLAYYL 71 (133) T ss_pred Ce--eeecCCchhhhcchhhhhhH----H-HHHHHHHHHHHHHHhhhccccccccceeeeeEEee--cCccchhhhhHHh Confidence 54 44455557777766555444 3 22334444456777778899999999987665432 3344444433221 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc---cCCCCCcchhH--hHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT---ANMTAHPFIRP--AFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT---~~~~a~PFl~p--A~~~~k~~~~~~~~~~~~~ 141 (149) ..-- -| ...+......+.||--+..-. +..||+.||.- ||...+.-+.+.+.+-++. T Consensus 72 ~~vl-~g---rgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 72 PFVL-HG---RGWVFPVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred hHhh-hc---ccceeeccccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 1100 00 011222233344442222111 23455557765 4555556666666666655 No 172 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=53.16 E-value=0.067 Score=27.03 Aligned_cols=101 Identities=14% Similarity=0.082 Sum_probs=49.6 Q ss_pred cc-eeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHH---HHHHHHHHhCCcCCCcccccceeccccccccCccccce Q lcl|NC_019769. 2 IE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAE---VLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 2 m~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~---~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i 77 (149) |+ .+|+.. +.+-+.+.+|..+++ .+...+...+|.++|.|++|...... T Consensus 1 ~~f~~f~~~---------------~~k~l~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tv------------ 53 (105) T protein:vir:78 1 MSFSSFKDA---------------VIDDIHNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKII------------ 53 (105) T ss_pred CCcccccch---------------HHHHHHHhcCCCCchhhHHHHHHhCCCCccccccccccccccee------------ Confidence 33 233322 222223333333332 23345556788999999987432110 Q ss_pred eeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~ 141 (149) .....+++....-++|++.+=|... ...-|+..+....++.+.+.+...++= T Consensus 54 ----------Igsg~I~y~~~~~aPYAr~qYYe~~--Rg~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 54 ----------IQKNSIVARVFSLTPYARRQYYENR--RNPRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred ----------ecCCeeEeeccccCchhhhhhhccc--CCCchhHHhhhcchhHHHHHHhcccCC Confidence 0111223322334788888877543 333488888877776644444322211 No 173 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=49.07 E-value=0.31 Score=23.35 Aligned_cols=128 Identities=15% Similarity=0.201 Sum_probs=61.7 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeee Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~ 80 (149) |. .+.|+--|.|+.+-.+..+. + ...+..-...++.-+...||+.+|.|+.|...+.. ...|.+...++.. T Consensus 1 mi--~i~idkp~almek~~ev~~~----i-e~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sie--gstgelsn~~~yl 71 (133) T protein:vir:41 1 MI--RINIDKPEALMEKASEVEDR----V-EQTVTLLMIELEEILMNTAPIKTGELRISHTWSVE--GSTGELTNTVPYL 71 (133) T ss_pred Ce--eeecCCchhhhcchhhhhhH----H-HHHHHHHHHHHHHHhhhccccccccceeeeeEEee--cCccchhhhhHHh Confidence 54 44455557777766555444 3 22334444456777778899999999987665432 3344444433221 Q ss_pred cccccccccceeEecCCCCCcceeeeeccCc---cCCCCCcchhHh--HHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVEMGT---ANMTAHPFIRPA--FDVRQEQATEVAIRRMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT---~~~~a~PFl~pA--~~~~k~~~~~~~~~~~~~ 141 (149) .--- -| ...+......+.||--+..-. +..||+.||.-+ |...+.-+.+.+.+-+-. T Consensus 72 ~~vl-~g---rgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlis 133 (133) T protein:vir:41 72 QWVL-FG---RGWVFPVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIVEDSFIEWLIS 133 (133) T ss_pred hHhh-hc---ccceeeecccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHhcC Confidence 1100 00 011222223344442222111 224555576653 444455555555444433 No 174 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=48.04 E-value=0.46 Score=22.43 Aligned_cols=131 Identities=21% Similarity=0.343 Sum_probs=74.2 Q ss_pred Ccce---eeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----------C-cCCCcccccceeccc Q lcl|NC_019769. 1 MIET---SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----------P-VRTGKLKKNVVVVTQ 65 (149) Q Consensus 1 Mm~~---~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-----------P-~~~g~l~~~i~~~~~ 65 (149) |.+. -|++.-.++ +- .....++.|..+.+++.-.+|+.++ | ..||.|..||..... T Consensus 1 M~~~~~lHvdF~qp~~-------~~--Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vp 71 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEE-------LV--FNRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVP 71 (170) T ss_pred CCCCceeEEeeecCCc-------ee--ecHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccc Confidence 6663 333332232 22 2234557888888888888887543 2 246777777754322 Q ss_pred cc--cccCccccceeeecccccccccceeEecCCCCCcceeeeeccCccC-------------------CCC-CcchhHh Q lcl|NC_019769. 66 KS--RRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN-------------------MTA-HPFIRPA 123 (149) Q Consensus 66 ~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-------------------~~a-~PFl~pA 123 (149) +. ..+|-+. .+.+ +-+.|.....+ .-.||=.|+-||... ..| .-||.-+ T Consensus 72 ras~~rpG~mV---kIaP-Nqk~G~g~r~i-----~g~fYPafL~YGVr~gakr~k~hhr~a~ggsgwriaPR~Nym~~~ 142 (170) T protein:vir:44 72 RASKKRPGLMV---KIAP-NQKNGEGNRHI-----NGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVEPRNNYMTEV 142 (170) T ss_pred cccCCCCceeE---EecC-CCCCCCCcccc-----ccccchhhhhhhhhcccccchhhcccccCCCcceeccchhHHHHH Confidence 22 2222111 1111 11111111111 124888888888531 233 4699999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 124 FDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 124 ~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) ++..+......+..+|++.|+-.-.| T Consensus 143 l~~~~~wt~~~L~r~L~~sLrp~~r~ 168 (170) T protein:vir:44 143 LDKRRSWTRYVLSRELRKSLRPQRRK 168 (170) T ss_pred HHhhHHHHHHHHHHHHHHhcCccccc Confidence 99999999999999999888755444 No 175 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=44.45 E-value=0.42 Score=22.64 Aligned_cols=133 Identities=21% Similarity=0.339 Sum_probs=72.7 Q ss_pred Ccce---eeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----------C-cCCCcccccceeccc Q lcl|NC_019769. 1 MIET---SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----------P-VRTGKLKKNVVVVTQ 65 (149) Q Consensus 1 Mm~~---~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-----------P-~~~g~l~~~i~~~~~ 65 (149) |.+. -|++.- .+++- .....++.|..+.+++.-.+|+.++ | ..||.|..||..... T Consensus 14 m~~~~~lHvdF~q-------p~~~~--Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIgy~Vp 84 (187) T protein:vir:48 14 MNQTAFLHVDFKQ-------PKELE--FNRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIGYYVP 84 (187) T ss_pred hhhccceeEeeec-------CCcee--ecHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhhhccc Confidence 4332 222221 22221 2234567888888888888887664 2 236777777754322 Q ss_pred --cccccCccccceeeecccccccccceeEecCCCCCcceeeeeccCccC---------------------CCC-Ccchh Q lcl|NC_019769. 66 --KSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTAN---------------------MTA-HPFIR 121 (149) Q Consensus 66 --~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---------------------~~a-~PFl~ 121 (149) ....+|-+. . +.+ +-+.|....... -.-.||=.|+-||... ..| .-||. T Consensus 85 kat~~RpG~mV-k--IaP-Nqk~G~g~r~~P---i~gdfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwriaPR~Nym~ 157 (187) T protein:vir:48 85 KKTTRRPGLMV-K--ISP-NQKNGQGNRRFP---EGAPYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRLAPRNNFMA 157 (187) T ss_pred cccCCCCcceE-E--ecC-CcccCccccccc---ccccchhHHHHhhhhhhhhccchhhhhhhcccCCcceeccchhHHH Confidence 111222111 1 111 111111111100 1125888888888531 233 46999 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 122 PAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 122 pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) -+++..+......+..+|++.|+-.-.| T Consensus 158 ~~L~~~~~wt~~~L~raL~~sLrp~~r~ 185 (187) T protein:vir:48 158 DVIERRRHWTQELLSRELQRSLRPVKRK 185 (187) T ss_pred HHHHhhHHHHHHHHHHHHHHhcCccccc Confidence 9999999999999999999888755444 No 176 >protein:vir:101654 Length: 126 # NCBI annotation: gp17 # Family: family:all:11115 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654772;genbank:gi:109302770;genbank:GeneID:4156088 Probab=40.86 E-value=0.15 Score=25.17 Aligned_cols=118 Identities=12% Similarity=0.209 Sum_probs=48.6 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCc--CCCcccccceeccccccccCccccceeeecc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIA---RAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHIRGV 82 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~---~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~ 82 (149) ++||-.|+.+-+-...-...-++-..|-+-|+.+++--.. ..|. +.-+..++.. ...+|.....|.+.++ T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksgy-----venpgdyaksirvsfi 75 (126) T protein:vir:10 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSGY-----VENPGDYAKSIRVSFI 75 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCcccccccccccc-----ccCchhhhhhhheeee Confidence 6777777765443221111112223333334444433221 1221 1111111111 1234555555666666 Q ss_pred cccccccceeEecCCCCCcceeeeeccCccCCCC---CcchhHhHHHHHHHHHHH Q lcl|NC_019769. 83 NPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTA---HPFIRPAFDVRQEQATEV 134 (149) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a---~PFl~pA~~~~k~~~~~~ 134 (149) +.+.|-....+.. +.+-.+|+|||..+||. |-.----|+-...-.+.+ T Consensus 76 ksksglpkarvma----tdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:10 76 KSKSGLPKARVMA----TDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred ecccCCcccceeh----hhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 6555554443332 23456789999999973 221111111111111111 No 177 >protein:vir:7859 Length: 126 # NCBI annotation: gp16 # Family: family:all:11115 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817466;genbank:gi:29565895;genbank:GeneID:1259088 Probab=40.86 E-value=0.15 Score=25.17 Aligned_cols=118 Identities=12% Similarity=0.209 Sum_probs=48.6 Q ss_pred hHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCc--CCCcccccceeccccccccCccccceeeecc Q lcl|NC_019769. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIA---RAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHIRGV 82 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~---~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~ 82 (149) ++||-.|+.+-+-...-...-++-..|-+-|+.+++--.. ..|. +.-+..++.. ...+|.....|.+.++ T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksgy-----venpgdyaksirvsfi 75 (126) T protein:vir:78 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSGY-----VENPGDYAKSIRVSFI 75 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCcccccccccccc-----ccCchhhhhhhheeee Confidence 6777777765443221111112223333334444433221 1221 1111111111 1234555555666666 Q ss_pred cccccccceeEecCCCCCcceeeeeccCccCCCC---CcchhHhHHHHHHHHHHH Q lcl|NC_019769. 83 NPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTA---HPFIRPAFDVRQEQATEV 134 (149) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a---~PFl~pA~~~~k~~~~~~ 134 (149) +.+.|-....+.. +.+-.+|+|||..+||. |-.----|+-...-.+.+ T Consensus 76 ksksglpkarvma----tdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:78 76 KSKSGLPKARVMA----TDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred ecccCCcccceeh----hhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 6555554443332 23456789999999973 221111111111111111 No 178 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=31.43 E-value=1.5 Score=19.62 Aligned_cols=126 Identities=10% Similarity=0.048 Sum_probs=55.8 Q ss_pred CcceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-------HHhCCcCCCcccccceeccccccccCcc Q lcl|NC_019769. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEV-------IARAPVRTGKLKKNVVVVTQKSRRRGEI 73 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~a-------k~~aP~~~g~l~~~i~~~~~~~~~~~~~ 73 (149) +...+=-.+-|.++.. ..++.. .+++++.+....-..+.+.+ +..- ..|...+...+... .+.+ T Consensus 9 ~~gl~~~~~~l~~~~~--~~~~~~-~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw--~~~~~~~~~~~~~~----g~~~ 79 (141) T protein:vir:79 9 FREFKRVCKKMEKLTK--IDLDKF-CKDAARELAARLLGKVIRRTPVDTGFLRQGW--NGVAYARSLPVYKQ----GNNY 79 (141) T ss_pred HHHHHHHHHHHHHHhH--HHHHHH-HHHHHHHHHHHHHHHHHHhCCCcchhhcccc--cccccccccceeec----CCee Confidence 2221111111222211 012221 12233333332233333332 2110 11111111111100 1111 Q ss_pred ccceeeecccccccccceeEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 74 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) . +.+ .+ . . .+-.--.|.|..--|....|++++|+.+.+..+..+.+.+.+.|++.|++.+.- T Consensus 80 ~--v~v--~n-~-~--------~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 80 I--IEV--VN-P-T--------EYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred E--EEE--ec-C-C--------cchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1 000 00 0 0 011112344455555567899999999999999999999999999999999999 No 179 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=28.34 E-value=1.8 Score=19.24 Aligned_cols=120 Identities=15% Similarity=0.145 Sum_probs=53.3 Q ss_pred eehH-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCcccccceeccccccccCccccceeeecccc Q lcl|NC_019769. 6 LDFS-GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNP 84 (149) Q Consensus 6 ~~i~-Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~ 84 (149) |.++ -++-|..-++.|.. +..++.+++++.+...-.+.++..+ +.+.... ....+|++.++|.+. . T Consensus 1 M~~~~~i~Gl~el~~~l~~-L~~~~~~k~~~~Al~~~a~~v~~~~-------k~~ap~~--~~~~~g~l~~~I~i~---~ 67 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIA-VGEEVGTKILRDAGRAAMAVVEADM-------KQNAGYD--NSSTNAHMRDSIKIR---S 67 (135) T ss_pred CceeeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHH-------HHhCCCC--CCCchhhHHhhcccc---c Confidence 5553 11223333333332 2233334444554444444444433 2222111 122345666655421 1 Q ss_pred cccccceeEecCCCCCcceeeeeccCccC-----------CCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019769. 85 RTGNSDNTMKANNPRNAFYWRFVEMGTAN-----------MTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALSK 149 (149) Q Consensus 85 ~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----------~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~k 149 (149) .... ..+ ....+..|..+ =..+-=-+|=+.-.-++..+.+.+.+.++|++.|.| T Consensus 68 ~k~~---------~~~--~~v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~k 132 (135) T protein:vir:57 68 SRGK---------AGS--TVVVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLST 132 (135) T ss_pred cccc---------ccc--eeEEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHH Confidence 0000 000 11112223221 111112357777888888888888888888888888 No 180 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=25.72 E-value=1.5 Score=19.64 Aligned_cols=139 Identities=15% Similarity=0.193 Sum_probs=49.1 Q ss_pred cce--eeehHhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcCCCccc-c----ccee----- Q lcl|NC_019769. 2 IET--SLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLK-K----NVVV----- 62 (149) Q Consensus 2 m~~--~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~al~~~a~~v~~~ak~~-----aP~~~g~l~-~----~i~~----- 62 (149) |.+ +++.+.++.|.+.|.-| +..-. +.-|..-|..++..++++ .|..+..-- + .+.- T Consensus 1 m~~~~~~n~~dl~~l~~~L~ll~L~p~kR----rrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~ 76 (231) T protein:vir:37 1 MQIRLGLKQEDLDAFVRDLRTLNLTGKQK----KKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLK 76 (231) T ss_pred CCccCCcCHHHHHHHHHHHHHhcCCHHHH----HHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHH Confidence 665 55556777777777655 33322 344555566677666655 344321100 0 0000 Q ss_pred ccccccc-cCccccceee---------------eccccccc----------ccce-------------eEecCCCC---- Q lcl|NC_019769. 63 VTQKSRR-RGEISSGVHI---------------RGVNPRTG----------NSDN-------------TMKANNPR---- 99 (149) Q Consensus 63 ~~~~~~~-~~~~~~~i~~---------------~~~~~~~~----------~~~~-------------~~~~~~~~---- 99 (149) .-..... .........+ .+.....+ ++.. .+.+...+ T Consensus 77 kL~~~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~ 156 (231) T protein:vir:37 77 KVLRYASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQG 156 (231) T ss_pred HhHHhhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCC Confidence 0000000 0000000000 01100000 0000 00000000 Q ss_pred Ccce------e--------------eeecc----------CccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019769. 100 NAFY------W--------------RFVEM----------GTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAIDEALS 148 (149) Q Consensus 100 ~~~y------~--------------~~~E~----------GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l~k~~~ 148 (149) ..-| | +.++- =|...|++|||...-++..+.+...|. +.+.-..+ T Consensus 157 k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e~~~~l~~~l~----~i~~~~~~ 231 (231) T protein:vir:37 157 KTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKENVDILREITL----KFLSGEYK 231 (231) T ss_pred CCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHHHHHHHHHHHH----HHhcccCC Confidence 0000 0 11111 124579999998776554433333333 22222222 No 181 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=129 Identities=11% Similarity=0.023 Sum_probs=56.8 Q ss_pred Cc-ceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CcCCCcccccceeccccccc Q lcl|NC_019769. 1 MI-ETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA----------PVRTGKLKKNVVVVTQKSRR 69 (149) Q Consensus 1 Mm-~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~ 69 (149) |- +++ =++-|..-+..|..-.+.+..++++.+...-.+.++..+-... -..+++++++|........ T Consensus 5 ~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~- 82 (146) T protein:vir:10 5 IDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLE- 82 (146) T ss_pred eeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccc- Confidence 32 221 1122333333444444445566666666666666666662211 1133555666654332211 Q ss_pred cCccccceeeecccccccccce--eEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 70 RGEISSGVHIRGVNPRTGNSDN--TMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 70 ~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) .+... +.+.......+.... -+..+....+. -+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 83 ~g~~~--~~vg~~~~~~~~~~y~~f~E~GT~~~~a-~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 83 GGIKT--VKIGLNKADRSPWFYLKFHEWGTSKMPA-HPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccee--EEeeeccCCCCCcceeeeeccCCCCCCC-Ccch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 12111 111111111111111 11111111111 1222 567777777788888888888888888 No 182 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=129 Identities=11% Similarity=0.023 Sum_probs=56.8 Q ss_pred Cc-ceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CcCCCcccccceeccccccc Q lcl|NC_019769. 1 MI-ETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA----------PVRTGKLKKNVVVVTQKSRR 69 (149) Q Consensus 1 Mm-~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~ 69 (149) |- +++ =++-|..-+..|..-.+.+..++++.+...-.+.++..+-... -..+++++++|........ T Consensus 5 ~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~- 82 (146) T protein:vir:10 5 IDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLE- 82 (146) T ss_pred eeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccc- Confidence 32 221 1122333333444444445566666666666666666662211 1133555666654332211 Q ss_pred cCccccceeeecccccccccce--eEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 70 RGEISSGVHIRGVNPRTGNSDN--TMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 70 ~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) .+... +.+.......+.... -+..+....+. -+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 83 ~g~~~--~~vg~~~~~~~~~~y~~f~E~GT~~~~a-~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 83 GGIKT--VKIGLNKADRSPWFYLKFHEWGTSKMPA-HPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccee--EEeeeccCCCCCcceeeeeccCCCCCCC-Ccch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 12111 111111111111111 11111111111 1222 567777777788888888888888888 No 183 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=129 Identities=11% Similarity=0.023 Sum_probs=56.8 Q ss_pred Cc-ceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CcCCCcccccceeccccccc Q lcl|NC_019769. 1 MI-ETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA----------PVRTGKLKKNVVVVTQKSRR 69 (149) Q Consensus 1 Mm-~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~ 69 (149) |- +++ =++-|..-+..|..-.+.+..++++.+...-.+.++..+-... -..+++++++|........ T Consensus 5 ~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~- 82 (146) T protein:vir:10 5 IDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLE- 82 (146) T ss_pred eeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccc- Confidence 32 221 1122333333444444445566666666666666666662211 1133555666654332211 Q ss_pred cCccccceeeecccccccccce--eEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 70 RGEISSGVHIRGVNPRTGNSDN--TMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 70 ~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) .+... +.+.......+.... -+..+....+. -+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 83 ~g~~~--~~vg~~~~~~~~~~y~~f~E~GT~~~~a-~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 83 GGIKT--VKIGLNKADRSPWFYLKFHEWGTSKMPA-HPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccee--EEeeeccCCCCCcceeeeeccCCCCCCC-Ccch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 12111 111111111111111 11111111111 1222 567777777788888888888888888 No 184 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=129 Identities=11% Similarity=0.023 Sum_probs=56.8 Q ss_pred Cc-ceeeehHhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CcCCCcccccceeccccccc Q lcl|NC_019769. 1 MI-ETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA----------PVRTGKLKKNVVVVTQKSRR 69 (149) Q Consensus 1 Mm-~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~ 69 (149) |- +++ =++-|..-+..|..-.+.+..++++.+...-.+.++..+-... -..+++++++|........ T Consensus 5 ~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~- 82 (146) T protein:vir:10 5 IDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLE- 82 (146) T ss_pred eeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccc- Confidence 32 221 1122333333444444445566666666666666666662211 1133555666654332211 Q ss_pred cCccccceeeecccccccccce--eEecCCCCCcceeeeeccCccCCCCCcchhHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019769. 70 RGEISSGVHIRGVNPRTGNSDN--TMKANNPRNAFYWRFVEMGTANMTAHPFIRPAFDVRQEQATEVAIRRMNQAI 143 (149) Q Consensus 70 ~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~k~~~~~~~~~~~~~~l 143 (149) .+... +.+.......+.... -+..+....+. -+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 83 ~g~~~--~~vg~~~~~~~~~~y~~f~E~GT~~~~a-~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 83 GGIKT--VKIGLNKADRSPWFYLKFHEWGTSKMPA-HPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccee--EEeeeccCCCCCcceeeeeccCCCCCCC-Ccch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 12111 111111111111111 11111111111 1222 567777777788888888888888888 Done!