Query lcl|NC_019933.2_cdsid_YP_007238082.1 [gene=G176_gp12] [protein=hypothetical protein] [protein_id=YP_007238082.1] [location=8227..8694] Match_columns 155 No_of_seqs 103 out of 210 Neff 7.4 Searched_HMMs 1612 Date Thu Nov 7 17:35:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_12 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_12_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97088 Length: 157 100.0 1.1E-48 6.7E-52 283.5 16.1 155 1-155 1-156 (157) 2 protein:vir:93617 Length: 148 100.0 2.4E-35 1.5E-38 210.4 13.7 146 1-154 2-148 (148) 3 protein:vir:100243 Length: 140 100.0 1.1E-34 6.5E-38 206.9 14.0 131 1-155 1-139 (140) 4 protein:vir:5745 Length: 135 # 100.0 2.4E-34 1.5E-37 204.9 13.5 130 1-153 1-135 (135) 5 protein:vir:100075 Length: 140 100.0 2.7E-34 1.7E-37 204.6 13.5 138 1-155 1-139 (140) 6 protein:vir:1891 Length: 179 # 100.0 8.7E-34 5.4E-37 201.8 15.2 155 1-155 1-172 (179) 7 protein:vir:80362 Length: 140 100.0 2.2E-33 1.4E-36 199.6 13.9 138 1-155 1-139 (140) 8 protein:vir:194 Length: 149 # 100.0 3.8E-33 2.4E-36 198.3 14.1 147 1-154 2-149 (149) 9 protein:vir:1437 Length: 140 # 100.0 5.2E-33 3.2E-36 197.6 13.6 138 1-155 1-139 (140) 10 protein:vir:101594 Length: 173 100.0 6.3E-33 3.9E-36 197.1 12.9 141 5-151 1-173 (173) 11 protein:vir:4347 Length: 164 # 100.0 1.6E-32 9.8E-36 194.9 14.3 148 1-155 1-157 (164) 12 protein:vir:81067 Length: 119 100.0 2.5E-33 1.5E-36 199.3 7.4 117 39-155 1-118 (119) 13 protein:vir:10367 Length: 119 100.0 2.5E-33 1.6E-36 199.3 7.4 117 39-155 1-118 (119) 14 protein:vir:107568 Length: 146 100.0 1E-31 6.3E-35 190.5 12.9 129 1-152 1-146 (146) 15 protein:vir:102875 Length: 146 100.0 1E-31 6.3E-35 190.5 12.9 129 1-152 1-146 (146) 16 protein:vir:102085 Length: 146 100.0 1E-31 6.3E-35 190.5 12.9 129 1-152 1-146 (146) 17 protein:vir:105007 Length: 146 100.0 1E-31 6.3E-35 190.5 12.9 129 1-152 1-146 (146) 18 protein:vir:105089 Length: 133 100.0 8.8E-32 5.5E-35 190.8 12.4 126 1-151 2-133 (133) 19 protein:vir:1386 Length: 149 # 100.0 2E-31 1.2E-34 188.9 13.7 129 1-154 1-149 (149) 20 protein:vir:1273 Length: 127 # 100.0 1.1E-31 7.1E-35 190.2 12.1 124 1-149 1-127 (127) 21 protein:vir:106570 Length: 182 99.9 2.6E-30 1.6E-33 182.8 15.8 147 2-154 1-182 (182) 22 protein:vir:3873 Length: 128 # 99.9 7.5E-31 4.7E-34 185.7 11.5 122 1-149 1-128 (128) 23 protein:vir:94538 Length: 125 99.9 2.7E-30 1.6E-33 182.7 12.3 123 1-151 1-125 (125) 24 protein:vir:95789 Length: 114 99.9 5.3E-30 3.3E-33 181.1 12.4 114 1-149 1-114 (114) 25 protein:vir:81106 Length: 125 99.9 3E-29 1.9E-32 177.0 12.7 123 1-149 1-125 (125) 26 protein:vir:9414 Length: 125 # 99.9 3E-29 1.9E-32 177.0 12.7 123 1-149 1-125 (125) 27 protein:vir:4704 Length: 125 # 99.9 3E-29 1.9E-32 177.0 12.7 123 1-149 1-125 (125) 28 protein:vir:79988 Length: 125 99.9 3E-29 1.9E-32 177.0 12.7 123 1-149 1-125 (125) 29 protein:vir:98342 Length: 125 99.9 3E-29 1.9E-32 177.0 12.7 123 1-149 1-125 (125) 30 protein:vir:3617 Length: 112 # 99.9 2.6E-29 1.6E-32 177.3 11.7 112 1-145 1-112 (112) 31 protein:vir:9708 Length: 125 # 99.9 1E-28 6.2E-32 174.1 11.3 121 5-150 1-125 (125) 32 protein:vir:78858 Length: 115 99.9 1.1E-28 6.8E-32 173.9 11.4 109 5-145 1-115 (115) 33 protein:vir:9312 Length: 115 # 99.9 1.1E-28 6.8E-32 173.9 11.4 109 5-145 1-115 (115) 34 protein:vir:103917 Length: 115 99.9 1.1E-28 6.8E-32 173.9 11.4 109 5-145 1-115 (115) 35 protein:vir:96225 Length: 115 99.9 1.1E-28 6.8E-32 173.9 11.4 109 5-145 1-115 (115) 36 protein:vir:96358 Length: 115 99.9 1.1E-28 6.8E-32 173.9 11.4 109 5-145 1-115 (115) 37 protein:vir:97144 Length: 115 99.9 1.1E-28 6.8E-32 173.9 11.4 109 5-145 1-115 (115) 38 protein:vir:9930 Length: 108 # 99.9 1.7E-28 1E-31 172.9 11.5 108 7-146 1-108 (108) 39 protein:vir:106623 Length: 115 99.9 2E-28 1.3E-31 172.4 11.5 109 5-145 1-115 (115) 40 protein:vir:99744 Length: 115 99.9 3.2E-28 2E-31 171.3 11.2 109 5-145 1-115 (115) 41 protein:vir:5978 Length: 144 # 99.9 1.3E-27 7.8E-31 168.1 12.9 137 1-145 4-144 (144) 42 protein:vir:743 Length: 108 # 99.9 7.8E-28 4.8E-31 169.2 11.0 108 5-145 1-108 (108) 43 protein:vir:94796 Length: 137 99.9 1.3E-27 7.9E-31 168.0 12.2 129 5-141 1-137 (137) 44 protein:vir:107099 Length: 137 99.9 1.7E-27 1E-30 167.4 11.8 131 1-141 1-137 (137) 45 protein:vir:98409 Length: 108 99.9 1.7E-27 1E-30 167.4 10.8 108 5-145 1-108 (108) 46 protein:vir:94490 Length: 137 99.9 2.5E-27 1.5E-30 166.4 11.5 131 1-141 1-137 (137) 47 protein:vir:97427 Length: 137 99.9 2.5E-27 1.5E-30 166.4 11.5 131 1-141 1-137 (137) 48 protein:vir:93738 Length: 137 99.9 2.5E-27 1.5E-30 166.4 11.5 131 1-141 1-137 (137) 49 protein:vir:96486 Length: 112 99.9 3.9E-27 2.4E-30 165.4 11.0 110 1-145 1-112 (112) 50 protein:vir:105330 Length: 137 99.9 6.9E-27 4.3E-30 164.0 12.3 131 1-141 1-137 (137) 51 protein:vir:105916 Length: 149 99.9 4.6E-27 2.9E-30 165.0 11.3 131 1-141 13-149 (149) 52 protein:vir:95894 Length: 137 99.9 6.5E-27 4E-30 164.1 11.6 131 1-141 1-137 (137) 53 protein:vir:4906 Length: 114 # 99.9 5.5E-27 3.4E-30 164.5 10.3 112 1-146 1-114 (114) 54 protein:vir:2740 Length: 114 # 99.9 5.5E-27 3.4E-30 164.5 10.3 112 1-146 1-114 (114) 55 protein:vir:94108 Length: 149 99.9 7.3E-27 4.5E-30 163.9 10.9 131 1-141 13-149 (149) 56 protein:vir:94654 Length: 142 99.9 5.8E-26 3.6E-29 159.0 13.9 137 1-145 1-142 (142) 57 protein:vir:96829 Length: 135 99.9 4.4E-26 2.7E-29 159.6 12.1 130 1-141 1-135 (135) 58 protein:vir:96121 Length: 137 99.9 7.5E-26 4.7E-29 158.3 11.9 131 1-141 1-137 (137) 59 protein:vir:102154 Length: 119 99.9 6.2E-26 3.9E-29 158.8 9.1 118 1-149 1-119 (119) 60 protein:vir:99101 Length: 142 99.9 3.8E-25 2.3E-28 154.5 10.0 134 1-142 2-142 (142) 61 protein:vir:8669 Length: 142 # 99.9 3.8E-25 2.3E-28 154.5 10.0 134 1-142 2-142 (142) 62 protein:vir:81147 Length: 126 99.9 6.7E-25 4.1E-28 153.1 10.5 122 5-148 1-126 (126) 63 protein:vir:95062 Length: 116 99.8 4E-23 2.5E-26 143.4 9.1 110 24-141 1-116 (116) 64 protein:vir:1243 Length: 116 # 99.8 5.1E-23 3.2E-26 142.8 9.3 110 24-141 1-116 (116) 65 protein:vir:97327 Length: 116 99.8 5.1E-23 3.2E-26 142.8 9.3 110 24-141 1-116 (116) 66 protein:vir:78077 Length: 141 99.8 1.7E-22 1.1E-25 139.9 12.2 136 5-149 1-141 (141) 67 protein:vir:79034 Length: 141 99.8 3.9E-21 2.4E-24 132.5 11.7 128 1-155 1-138 (141) 68 protein:vir:105467 Length: 144 99.7 1.5E-20 9.3E-24 129.3 13.2 138 1-155 1-143 (144) 69 protein:vir:102441 Length: 137 99.7 9.1E-21 5.6E-24 130.5 9.3 129 1-140 1-137 (137) 70 protein:vir:106041 Length: 137 99.7 9.3E-21 5.8E-24 130.4 8.7 129 1-143 1-137 (137) 71 protein:vir:97982 Length: 140 99.7 1.1E-20 6.9E-24 130.0 6.9 129 1-139 1-140 (140) 72 protein:vir:107545 Length: 140 99.7 1.1E-20 6.9E-24 130.0 6.9 129 1-139 1-140 (140) 73 protein:vir:966 Length: 123 # 99.7 1.7E-19 1.1E-22 123.4 11.4 122 1-146 1-123 (123) 74 protein:vir:4956 Length: 153 # 99.6 1E-18 6.4E-22 119.2 9.8 123 5-155 1-137 (153) 75 protein:vir:100887 Length: 139 99.6 1.3E-18 8.3E-22 118.6 9.8 127 3-155 1-137 (139) 76 protein:vir:106506 Length: 137 99.6 1.4E-18 9E-22 118.4 7.6 130 1-145 1-137 (137) 77 protein:vir:5000 Length: 141 # 99.6 3.7E-18 2.3E-21 116.1 9.2 121 5-155 1-137 (141) 78 protein:vir:102963 Length: 163 99.6 4.5E-17 2.8E-20 110.2 12.2 132 1-155 1-161 (163) 79 protein:vir:4859 Length: 140 # 99.5 2.3E-17 1.4E-20 111.8 10.2 121 5-155 1-137 (140) 80 protein:vir:95372 Length: 124 99.5 6.5E-17 4E-20 109.3 10.8 116 5-146 1-124 (124) 81 protein:vir:9879 Length: 127 # 99.5 6.6E-17 4.1E-20 109.3 9.3 117 7-146 1-127 (127) 82 protein:vir:100223 Length: 139 99.5 7.9E-17 4.9E-20 108.9 8.9 125 3-155 1-137 (139) 83 protein:vir:99528 Length: 92 # 99.5 1.3E-16 7.9E-20 107.7 9.1 91 1-121 1-92 (92) 84 protein:vir:4833 Length: 140 # 99.5 3.5E-16 2.2E-19 105.3 9.8 120 5-155 1-137 (140) 85 protein:vir:80116 Length: 127 99.4 5.7E-16 3.5E-19 104.2 9.6 119 5-149 1-127 (127) 86 protein:vir:100652 Length: 134 99.4 1.1E-15 7E-19 102.5 10.6 120 1-151 1-134 (134) 87 protein:vir:101302 Length: 134 99.4 2.4E-15 1.5E-18 100.8 10.4 120 1-151 1-134 (134) 88 protein:vir:9513 Length: 134 # 99.4 2.4E-15 1.5E-18 100.8 10.4 120 1-151 1-134 (134) 89 protein:vir:9647 Length: 132 # 99.2 2.8E-13 1.8E-16 89.4 10.0 128 1-150 1-132 (132) 90 protein:vir:3848 Length: 159 # 99.1 3.1E-13 1.9E-16 89.2 9.2 128 1-154 1-159 (159) 91 protein:vir:96486 Length: 112 99.0 4.6E-12 2.8E-15 82.8 11.0 109 5-152 1-112 (112) 92 protein:vir:6246 Length: 143 # 98.9 8.7E-12 5.4E-15 81.2 9.0 127 1-154 1-143 (143) 93 protein:vir:98636 Length: 138 98.9 1.9E-11 1.2E-14 79.4 9.9 128 1-150 7-138 (138) 94 protein:vir:107703 Length: 147 98.9 4.7E-11 2.9E-14 77.2 11.4 116 5-155 1-144 (147) 95 protein:vir:1332 Length: 143 # 98.8 3.2E-11 2E-14 78.2 8.9 127 1-154 1-143 (143) 96 protein:vir:79091 Length: 175 98.8 9.5E-11 5.9E-14 75.5 10.2 128 1-155 1-175 (175) 97 protein:vir:102338 Length: 116 98.8 4.5E-11 2.8E-14 77.3 8.1 113 24-153 1-116 (116) 98 protein:vir:1988 Length: 156 # 98.8 1E-10 6.3E-14 75.4 10.0 122 1-150 1-156 (156) 99 protein:vir:104347 Length: 145 98.8 5.1E-11 3.1E-14 77.0 8.0 119 1-148 1-145 (145) 100 protein:vir:93898 Length: 133 98.8 1.6E-10 9.9E-14 74.3 10.5 127 1-146 1-133 (133) 101 protein:vir:103841 Length: 155 98.7 2E-10 1.2E-13 73.8 10.4 125 1-152 1-155 (155) 102 protein:vir:9363 Length: 133 # 98.7 2.4E-10 1.5E-13 73.4 10.6 127 1-146 1-133 (133) 103 protein:vir:78644 Length: 133 98.7 2.4E-10 1.5E-13 73.4 10.6 127 1-146 1-133 (133) 104 protein:vir:96973 Length: 133 98.7 2.4E-10 1.5E-13 73.4 10.6 127 1-146 1-133 (133) 105 protein:vir:94419 Length: 133 98.7 2.4E-10 1.5E-13 73.4 10.6 127 1-146 1-133 (133) 106 protein:vir:103280 Length: 142 98.7 2.1E-10 1.3E-13 73.6 9.7 116 5-148 1-142 (142) 107 protein:vir:78335 Length: 133 98.7 4.4E-10 2.7E-13 71.9 11.3 127 1-148 1-133 (133) 108 protein:vir:79638 Length: 146 98.7 3.7E-10 2.3E-13 72.3 10.6 117 5-155 1-144 (146) 109 protein:vir:79225 Length: 155 98.7 4.1E-10 2.6E-13 72.0 10.3 125 1-152 1-155 (155) 110 protein:vir:3163 Length: 145 # 98.7 2.4E-10 1.5E-13 73.4 9.0 117 7-154 1-145 (145) 111 protein:vir:107851 Length: 175 98.6 5.7E-10 3.6E-13 71.3 10.3 128 1-155 1-175 (175) 112 protein:vir:6216 Length: 125 # 98.6 3.8E-10 2.4E-13 72.2 9.0 121 1-148 1-125 (125) 113 protein:vir:99196 Length: 155 98.6 7.9E-10 4.9E-13 70.5 10.6 125 1-152 1-155 (155) 114 protein:vir:99833 Length: 190 98.5 2.8E-09 1.7E-12 67.5 11.8 139 1-152 1-190 (190) 115 protein:vir:94994 Length: 131 98.4 1.7E-09 1.1E-12 68.6 8.2 107 1-145 1-131 (131) 116 protein:vir:78380 Length: 131 98.3 6E-09 3.7E-12 65.7 8.3 107 1-145 1-131 (131) 117 protein:vir:96012 Length: 133 98.3 1.4E-08 8.7E-12 63.6 10.3 126 5-148 1-133 (133) 118 protein:vir:1087 Length: 161 # 98.2 2.5E-08 1.6E-11 62.3 10.2 140 4-155 1-158 (161) 119 protein:vir:96105 Length: 193 98.2 1.3E-08 8.1E-12 63.8 8.0 116 1-155 1-134 (193) 120 protein:vir:99546 Length: 200 98.2 1.9E-08 1.2E-11 62.9 8.4 119 1-155 5-141 (200) 121 protein:vir:3994 Length: 168 # 98.1 2.4E-08 1.5E-11 62.4 7.9 138 7-155 1-162 (168) 122 protein:vir:7412 Length: 168 # 98.1 8.1E-08 5E-11 59.5 10.3 138 7-155 1-162 (168) 123 protein:vir:95157 Length: 144 98.1 3.8E-08 2.3E-11 61.3 8.4 111 5-149 1-144 (144) 124 protein:vir:1028 Length: 168 # 98.0 8.5E-08 5.3E-11 59.3 9.4 138 7-155 1-162 (168) 125 protein:vir:97190 Length: 148 98.0 3.3E-08 2.1E-11 61.6 7.1 116 7-154 1-148 (148) 126 protein:vir:107757 Length: 189 98.0 6.1E-08 3.8E-11 60.1 7.1 89 1-155 1-91 (189) 127 protein:vir:96774 Length: 152 97.9 1E-07 6.2E-11 59.0 8.2 117 1-143 1-152 (152) 128 protein:vir:94944 Length: 121 97.9 4.4E-08 2.7E-11 60.9 6.1 98 1-133 2-121 (121) 129 protein:vir:5257 Length: 148 # 97.9 3.2E-08 2E-11 61.7 5.0 96 1-155 1-99 (148) 130 protein:vir:80970 Length: 112 97.9 2.7E-07 1.7E-10 56.6 9.7 112 1-148 1-112 (112) 131 protein:vir:2688 Length: 123 # 97.8 2.6E-07 1.6E-10 56.7 8.2 117 13-146 1-123 (123) 132 protein:vir:80425 Length: 134 97.7 1.3E-07 8.3E-11 58.3 5.6 108 1-154 1-134 (134) 133 protein:vir:101594 Length: 173 97.6 9.9E-07 6.2E-10 53.5 9.2 122 8-155 1-173 (173) 134 protein:vir:45 Length: 112 # N 97.6 1.3E-06 8.3E-10 52.8 9.7 112 1-148 1-112 (112) 135 protein:vir:98557 Length: 149 97.5 1.7E-06 1.1E-09 52.2 8.9 119 5-146 1-149 (149) 136 protein:vir:194 Length: 149 # 97.5 1.3E-06 8E-10 52.9 8.2 129 1-150 6-149 (149) 137 protein:vir:7449 Length: 123 # 97.4 7.4E-06 4.6E-09 48.7 11.7 121 1-155 1-123 (123) 138 protein:vir:1891 Length: 179 # 97.4 3.4E-06 2.1E-09 50.5 9.1 135 1-155 7-176 (179) 139 protein:vir:4096 Length: 140 # 97.3 2.9E-06 1.8E-09 51.0 7.9 133 1-155 1-140 (140) 140 protein:vir:80037 Length: 199 97.3 1.5E-06 9.4E-10 52.5 6.3 102 36-155 1-137 (199) 141 protein:vir:4347 Length: 164 # 97.2 8.5E-06 5.3E-09 48.4 10.0 132 1-154 7-164 (164) 142 protein:vir:100075 Length: 140 97.1 6.9E-06 4.3E-09 48.9 8.5 123 1-152 4-140 (140) 143 protein:vir:101563 Length: 155 97.1 8.3E-07 5.2E-10 53.9 3.4 95 25-155 1-106 (155) 144 protein:vir:2026 Length: 150 # 97.1 1.1E-05 6.8E-09 47.8 9.2 119 5-146 1-150 (150) 145 protein:vir:80362 Length: 140 97.1 1.3E-05 8.3E-09 47.3 9.6 125 1-152 4-140 (140) 146 protein:vir:6071 Length: 150 # 97.0 1.7E-05 1.1E-08 46.7 9.8 119 5-146 1-150 (150) 147 protein:vir:77650 Length: 155 97.0 1.3E-06 7.8E-10 52.9 3.1 95 25-155 1-106 (155) 148 protein:vir:5703 Length: 150 # 97.0 2.1E-05 1.3E-08 46.2 9.6 119 5-146 1-150 (150) 149 protein:vir:94069 Length: 168 96.9 1.5E-06 9.6E-10 52.5 3.0 107 5-155 1-116 (168) 150 protein:vir:8106 Length: 150 # 96.8 2.4E-06 1.5E-09 51.4 3.4 138 1-155 1-149 (150) 151 protein:vir:105089 Length: 133 96.8 2.5E-05 1.6E-08 45.8 9.0 131 5-155 1-133 (133) 152 protein:vir:96288 Length: 100 96.8 1.2E-05 7.5E-09 47.5 6.9 89 1-95 1-100 (100) 153 protein:vir:100312 Length: 152 96.7 3E-05 1.8E-08 45.4 8.7 127 5-147 1-152 (152) 154 protein:vir:79179 Length: 155 96.7 2.2E-05 1.4E-08 46.1 7.9 127 5-146 1-155 (155) 155 protein:vir:95260 Length: 160 96.7 2.1E-05 1.3E-08 46.2 7.7 83 1-155 1-107 (160) 156 protein:vir:7993 Length: 108 # 96.6 1.2E-06 7.5E-10 53.0 0.5 106 1-131 1-108 (108) 157 protein:vir:78607 Length: 155 96.6 2.7E-06 1.7E-09 51.1 2.4 103 5-155 1-106 (155) 158 protein:vir:4790 Length: 114 # 96.6 5.6E-05 3.5E-08 43.9 9.4 113 1-146 1-114 (114) 159 protein:vir:6375 Length: 205 # 96.5 0.00018 1.1E-07 41.1 12.0 152 1-155 1-199 (205) 160 protein:vir:1838 Length: 149 # 96.5 3.4E-05 2.1E-08 45.1 7.9 119 5-146 1-149 (149) 161 protein:vir:106728 Length: 155 96.5 3.7E-06 2.3E-09 50.4 2.5 103 5-155 1-106 (155) 162 protein:vir:2740 Length: 114 # 96.5 4.5E-05 2.8E-08 44.4 8.4 111 5-154 1-114 (114) 163 protein:vir:4906 Length: 114 # 96.5 4.5E-05 2.8E-08 44.4 8.4 111 5-154 1-114 (114) 164 protein:vir:96763 Length: 177 96.2 0.00022 1.3E-07 40.7 10.7 147 1-155 5-176 (177) 165 protein:vir:101508 Length: 120 96.2 0.00022 1.3E-07 40.7 10.6 118 1-152 1-120 (120) 166 protein:vir:1386 Length: 149 # 96.1 8.2E-05 5.1E-08 43.0 7.9 141 5-155 1-146 (149) 167 protein:vir:100243 Length: 140 96.1 0.00014 8.5E-08 41.8 9.0 134 5-155 1-135 (140) 168 protein:vir:1437 Length: 140 # 96.1 0.00011 7.1E-08 42.2 8.5 128 1-152 4-140 (140) 169 protein:vir:1581 Length: 116 # 95.9 0.00015 9.2E-08 41.6 8.6 114 1-145 1-116 (116) 170 protein:vir:79115 Length: 148 95.9 0.00022 1.3E-07 40.7 9.3 119 5-146 1-148 (148) 171 protein:vir:93617 Length: 148 95.8 0.00031 1.9E-07 39.9 9.8 129 1-150 6-148 (148) 172 protein:vir:105773 Length: 131 95.8 0.00016 1E-07 41.3 8.1 129 5-146 1-131 (131) 173 protein:vir:98892 Length: 108 95.7 0.00024 1.5E-07 40.5 8.9 107 1-146 2-108 (108) 174 protein:vir:3427 Length: 192 # 95.7 0.00043 2.7E-07 39.0 10.1 143 5-155 1-185 (192) 175 protein:vir:105007 Length: 146 95.4 0.0007 4.3E-07 37.9 10.3 134 5-155 1-145 (146) 176 protein:vir:107568 Length: 146 95.4 0.0007 4.3E-07 37.9 10.3 134 5-155 1-145 (146) 177 protein:vir:102085 Length: 146 95.4 0.0007 4.3E-07 37.9 10.3 134 5-155 1-145 (146) 178 protein:vir:102875 Length: 146 95.4 0.0007 4.3E-07 37.9 10.3 134 5-155 1-145 (146) 179 protein:vir:1164 Length: 156 # 95.1 0.00023 1.4E-07 40.5 6.8 124 5-150 1-156 (156) 180 protein:vir:396 Length: 184 # 95.1 0.00078 4.8E-07 37.6 9.6 142 5-154 1-184 (184) 181 protein:vir:4460 Length: 170 # 95.0 0.00012 7.2E-08 42.1 5.0 145 1-155 1-169 (170) 182 protein:vir:1273 Length: 127 # 94.5 0.00059 3.6E-07 38.3 7.6 127 5-153 1-127 (127) 183 protein:vir:487 Length: 187 # 94.2 0.00066 4.1E-07 38.0 7.1 151 1-155 1-186 (187) 184 protein:vir:5745 Length: 135 # 93.5 0.0032 2E-06 34.2 9.7 132 4-155 1-133 (135) 185 protein:vir:79687 Length: 113 93.5 0.0014 8.4E-07 36.3 7.5 109 7-155 1-109 (113) 186 protein:vir:102190 Length: 93 93.2 0.0011 6.6E-07 36.9 6.5 91 28-152 1-93 (93) 187 protein:vir:106570 Length: 182 93.0 0.0017 1E-06 35.8 7.2 126 5-155 1-179 (182) 188 protein:vir:4514 Length: 168 # 92.8 0.001 6.5E-07 36.9 5.9 145 1-155 1-168 (168) 189 protein:vir:3873 Length: 128 # 90.2 0.011 6.9E-06 31.3 8.7 127 5-153 1-128 (128) 190 protein:vir:4200 Length: 133 # 89.8 0.0031 1.9E-06 34.4 5.3 125 5-146 1-133 (133) 191 protein:vir:102608 Length: 108 85.2 0.002 1.3E-06 35.4 1.4 107 1-131 1-108 (108) 192 protein:vir:105825 Length: 108 85.2 0.002 1.3E-06 35.4 1.4 107 1-131 1-108 (108) 193 protein:vir:9823 Length: 118 # 84.7 0.043 2.7E-05 28.1 8.5 115 1-155 2-118 (118) 194 protein:vir:3036 Length: 118 # 84.7 0.043 2.7E-05 28.1 8.5 115 1-155 2-118 (118) 195 protein:vir:4162 Length: 133 # 83.6 0.012 7.6E-06 31.1 5.0 125 5-154 1-133 (133) 196 protein:vir:97088 Length: 157 83.2 0.07 4.4E-05 26.9 9.5 131 1-152 6-157 (157) 197 protein:vir:79555 Length: 192 78.2 0.12 7.3E-05 25.7 11.7 133 7-155 1-185 (192) 198 protein:vir:94654 Length: 142 77.6 0.049 3E-05 27.8 6.2 122 5-152 1-142 (142) 199 protein:vir:102154 Length: 119 75.7 0.09 5.6E-05 26.3 7.1 119 5-153 1-119 (119) 200 protein:vir:94538 Length: 125 75.5 0.13 8.3E-05 25.4 8.0 122 5-155 1-125 (125) 201 protein:vir:5978 Length: 144 # 72.7 0.18 0.00011 24.7 9.1 121 1-153 1-144 (144) 202 protein:vir:96121 Length: 137 71.2 0.2 0.00012 24.4 8.1 111 7-153 1-137 (137) 203 protein:vir:396 Length: 184 # 64.0 0.31 0.00019 23.4 7.6 138 8-155 1-177 (184) 204 protein:vir:94108 Length: 149 63.3 0.32 0.0002 23.3 7.5 125 1-153 1-149 (149) 205 protein:vir:107099 Length: 137 60.7 0.37 0.00023 23.0 8.0 115 7-153 1-137 (137) 206 protein:vir:101654 Length: 126 60.6 0.11 7.1E-05 25.7 4.3 108 7-121 1-126 (126) 207 protein:vir:7859 Length: 126 # 60.6 0.11 7.1E-05 25.7 4.3 108 7-121 1-126 (126) 208 protein:vir:102963 Length: 163 59.3 0.39 0.00024 22.8 8.3 138 5-155 1-157 (163) 209 protein:vir:78163 Length: 92 # 59.2 0.062 3.8E-05 27.2 2.6 92 1-134 1-92 (92) 210 protein:vir:95789 Length: 114 56.8 0.45 0.00028 22.5 7.9 112 5-153 1-114 (114) 211 protein:vir:3787 Length: 231 # 56.2 0.46 0.00029 22.4 8.1 141 1-155 1-229 (231) 212 protein:vir:99454 Length: 150 53.8 0.52 0.00032 22.1 8.2 126 5-137 1-150 (150) 213 protein:vir:9708 Length: 125 # 50.2 0.62 0.00038 21.7 8.8 125 9-154 1-125 (125) 214 protein:vir:3750 Length: 227 # 49.7 0.63 0.00039 21.7 9.4 139 1-155 1-226 (227) 215 protein:vir:966 Length: 123 # 43.8 0.83 0.00052 21.0 9.8 120 5-154 1-123 (123) 216 protein:vir:78894 Length: 105 34.6 0.35 0.00022 23.1 2.7 102 1-148 1-105 (105) 217 protein:vir:4230 Length: 111 # 25.8 1.5 0.0009 19.7 4.4 98 1-118 1-111 (111) 218 protein:vir:2435 Length: 111 # 22.1 2 0.0013 18.9 4.5 98 1-118 1-111 (111) No 1 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=100.00 E-value=1.1e-48 Score=283.54 Aligned_cols=155 Identities=45% Similarity=0.760 Sum_probs=148.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) ||++|+.++|++|.+.|++|++.+++++|+|+.+||++|+++||.+||++||+|++||++...++++++|.++|.|||+. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccce-eeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATP-VHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt-~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +++|||||+||||+.+..........+||.+.++|| ++||||||||||||+.+++++++|.++|+++|+|+|++- T Consensus 81 ~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g~ 156 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRGD 156 (157) T ss_pred CccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcCC Confidence 999999999999998887777777778999988876 679999999999999999999999999999999999999 No 2 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=100.00 E-value=2.4e-35 Score=210.42 Aligned_cols=146 Identities=14% Similarity=0.198 Sum_probs=114.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+++|++.||++|.+.|++|++. .+++++.|+++||.+|+++|+.+||++||.|++||.+......+ |.....|+++ T Consensus 2 m~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~--g~~~~~v~~~ 79 (148) T protein:vir:93 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRD--GGMESGVHIR 79 (148) T ss_pred cceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccC--Cceeeeeeec Confidence 99999999999999999999765 56899999999999999999999999999999999876554443 3333334333 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) ...... |...........+..+||.|+||||++|||||||+|||+.+++++++.|.++|+++|+|+++| T Consensus 80 ~~~~~~------~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 80 GVNPDT------GNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred cccccc------ccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 221100 000111112234556778888888889999999999999999999999999999999999999 No 3 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=100.00 E-value=1.1e-34 Score=206.86 Aligned_cols=131 Identities=22% Similarity=0.293 Sum_probs=111.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+ +|++.||++|.+.|+.|++. .++++++|+.+||.+|+++|+++||++||+|++||.+...+.+++++...+.|+++ T Consensus 1 Ma-~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~ 79 (140) T protein:vir:10 1 MS-SVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVR 79 (140) T ss_pred Cc-eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeeccc Confidence 87 57777999999999999865 46899999999999999999999999999999999988777666666666666543 Q ss_pred C-------CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 80 K-------KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 80 ~-------~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) . +.++||||+|||| ++|||||||+|||+.++++++++|.+.|++.|+|++ T Consensus 80 ~~~~~~~~~~~~y~~f~E~GT-----------------------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~ 136 (140) T protein:vir:10 80 TKGKADSPNNAFYWRFVELGT-----------------------QFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVV 136 (140) T ss_pred cccccCCCCcccccceeccCc-----------------------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2 2345555555555 589999999999999999999999999999999999 Q ss_pred ccC Q lcl|NC_019933. 153 SKR 155 (155) Q Consensus 153 ~k~ 155 (155) +|+ T Consensus 137 ~~~ 139 (140) T protein:vir:10 137 GGG 139 (140) T ss_pred hcC Confidence 999 No 4 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=100.00 E-value=2.4e-34 Score=204.90 Aligned_cols=130 Identities=16% Similarity=0.182 Sum_probs=117.9 Q ss_pred CceeeeeccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCC----cchhhcceeeeecccccCCceEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDS-DSVSRTMAFESAAVVRDSAKAHVRSK----TGRLKGAIYAVYVPEESTEVRHVYA 75 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~-~~~~r~a~~~~a~~i~~eak~~aP~~----tG~Lr~sI~~~~~~~~~~~g~~~~~ 75 (155) |+++|++.||++|.+.|+.|+... +++++.|+.+||++|+++++.+||++ +|+|++||.++..+.+.+++...+. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~ 80 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLR 80 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEE Confidence 999999999999999999998764 68999999999999999999999986 4999999988877666666666777 Q ss_pred EEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 76 VSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 76 Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) ||+.++..+|+||+||||+ +|||||||+|||+.++++++++|.+.|+++|+|+.+ T Consensus 81 vg~~~~~~~~~~f~E~GT~-----------------------~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 81 VGPTRSHYMKALAQEFGTI-----------------------KQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred ecCCCCcceeEeecccCCC-----------------------CCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHhcC Confidence 8877777778888899985 799999999999999999999999999999999999 No 5 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=100.00 E-value=2.7e-34 Score=204.60 Aligned_cols=138 Identities=19% Similarity=0.295 Sum_probs=109.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+ +|++.|||+|.+.|+.|++. .++++++|+.++|.+|+++|+++||++||+|++||.+...+.+ ++...+.||+. T Consensus 1 Ma-~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~--~~~~~~~~g~~ 77 (140) T protein:vir:10 1 MS-SIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQK--DAPGLATAGVR 77 (140) T ss_pred Cc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccc--cccceEEeeee Confidence 87 47777899999999999865 4689999999999999999999999999999999987654443 33444455543 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .+... .....+.++||.|.|+||++|||||||+||++.++++++++|.+++.+.|+|++.+| T Consensus 78 ~~~~~--------------~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:10 78 VRTKG--------------KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred ecccc--------------ccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 22110 011223455666666666789999999999999999999999999999999999999 No 6 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=100.00 E-value=8.7e-34 Score=201.84 Aligned_cols=155 Identities=14% Similarity=0.173 Sum_probs=123.4 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCC-----Ccchhhcceeeeeccc-ccCCce Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRS-----KTGRLKGAIYAVYVPE-ESTEVR 71 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~-----~tG~Lr~sI~~~~~~~-~~~~g~ 71 (155) |. +++++.||++|++.|+.|++. .++++++|+.+||++|+++|+++||+ ++|.|++||.+..... ...+|. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 77 677788999999999999865 57899999999999999999999965 5799999998765443 344567 Q ss_pred EEEEEEecCCccccchhh------hccc--cccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSWNKKKAPHGHLV------EYGH--WRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKA 143 (155) Q Consensus 72 ~~~~Vg~~~~~a~~~~~v------EfGt--~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~ 143 (155) ..+.||+..+..+++.-- ..|. ...+.....+++.+||+|+||||++|||||||||||++++++++++|.++ T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 778888766554433210 0110 01112233556789999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccC Q lcl|NC_019933. 144 GAKRLAELRSKR 155 (155) Q Consensus 144 l~~~i~k~~~k~ 155 (155) |+++|+|++++. T Consensus 161 l~~~i~k~lk~~ 172 (179) T protein:vir:18 161 MGKAIDRAIRLA 172 (179) T ss_pred HHHHHHHHHHhh Confidence 999999999988 No 7 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.96 E-value=2.2e-33 Score=199.59 Aligned_cols=138 Identities=20% Similarity=0.303 Sum_probs=107.4 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+ +|++.||++|.+.|+.|+.. .++++++|+.++|.+|+++|+++||++||+|++||.+...+.. ++...+.++++ T Consensus 1 Ma-~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~--~~~~~~~~~~~ 77 (140) T protein:vir:80 1 MS-SIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQK--DAPGLATAGVR 77 (140) T ss_pred Cc-eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccc--cccceeeeeee Confidence 88 57777899999999999765 5689999999999999999999999999999999986544333 33344445543 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .+... + ....+.++||.|.|+||.+|||||||+||++.++++++++|.++|++.|++++++| T Consensus 78 ~~~~~-------~-------~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:80 78 VRTKG-------K-------ADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGR 139 (140) T ss_pred ccccc-------c-------cCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 22100 0 00123344555555555689999999999999999999999999999999999999 No 8 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.96 E-value=3.8e-33 Score=198.33 Aligned_cols=147 Identities=18% Similarity=0.226 Sum_probs=116.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+++|++.||++|.+.|+.|+.. .+++++.|+.++|++|+++|+++||++||+|++||.+....... .+.....|++. T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~-~~~~~~~v~~~ 80 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRR-RGEISSGVHIR 80 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhcccccccccc-ccceeeccccc Confidence 99999999999999999999876 45899999999999999999999999999999999875443322 23333334433 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) ..... -|...........+.++||.|.|+||++|||||||+|||+.++++++++|.++|+++|+|+++| T Consensus 81 ~~~~~------~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 81 GVNPR------TGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccccc------cccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 22111 0111111222344567888999999999999999999999999999999999999999999999 No 9 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.96 E-value=5.2e-33 Score=197.58 Aligned_cols=138 Identities=19% Similarity=0.297 Sum_probs=109.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+ +|++.||++|.+.|+.|+.. ..+++++|+.++|.+|+++|+++||++||+|++||.+..... .++...+.||+. T Consensus 1 M~-~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~--~~~~~~~~vg~~ 77 (140) T protein:vir:14 1 MS-SIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQ--KDAPGLATAGVR 77 (140) T ss_pred Cc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccc--cccceeEEeeee Confidence 87 47777899999999999765 567999999999999999999999999999999998754433 345555566653 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .+... .....+.++||.|.|+||++|||||||+||++.+++++++.|.++|++.|++++++| T Consensus 78 ~~~~~--------------~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:14 78 VRTKG--------------KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred ecccc--------------ccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 22110 001223455555556666789999999999999999999999999999999999999 No 10 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.96 E-value=6.3e-33 Score=197.10 Aligned_cols=141 Identities=15% Similarity=0.137 Sum_probs=109.7 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |++.||++|.+.|+.|++.+.+++++|+.++|.+|+++|+.+||++||+|++||.+..... .+...+.| ...++ T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~---~~~~~~~v---~~~~~ 74 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKA---KDLISKKI---TVNEL 74 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeecc---CceeEEee---CCCcc Confidence 8888999999999999998899999999999999999999999999999999997754332 23333333 25678 Q ss_pred cchhhhccccccCCCcCCC-----------------Cce---------------eeeeecccceeeeCCccchhhHHHHH Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVD-----------------GKW---------------LFTKEKLATPVHVPARSFLRPGYDSV 132 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~-----------------~~~---------------~~~~~~~~gt~~~pa~PFlrPA~~~~ 132 (155) |+.||||||+....-.+.. +.| .++.+....+..|||||||+|||+.+ T Consensus 75 Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~ 154 (173) T protein:vir:10 75 YGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEG 154 (173) T ss_pred cchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHh Confidence 9999999997533211100 000 00111122345699999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 133 KGRLVEVANKAGAKRLAEL 151 (155) Q Consensus 133 ~~~~~~~i~~~l~~~i~k~ 151 (155) ++++.++|+++|+++|.|+ T Consensus 155 ~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 155 KKQYLKDLENLLKTYNKKI 173 (173) T ss_pred HHHHHHHHHHHHHHHhhcC Confidence 9999999999999999999 No 11 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.96 E-value=1.6e-32 Score=194.93 Aligned_cols=148 Identities=10% Similarity=0.050 Sum_probs=119.1 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCC-----Ccchhhcceeeeeccc-ccCCce Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRS-----KTGRLKGAIYAVYVPE-ESTEVR 71 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~-----~tG~Lr~sI~~~~~~~-~~~~g~ 71 (155) |+ ++|++.||++|.+.|++|+.. .+++++.|+.+||++|+++|+.+||+ ++|+|++||.+..+.. ...++. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 87 567788999999999999876 56899999999999999999999996 6789999997754322 223455 Q ss_pred EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAEL 151 (155) Q Consensus 72 ~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~ 151 (155) ..+.||+..+...... ........+++.+||+|+||||++|||||||||||+.++++++++|.++|+++|+++ T Consensus 81 ~~~~vg~~~~~~~~~~-------~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka 153 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKK-------GERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRA 153 (164) T ss_pred eeEEeccccccccccc-------ccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHH Confidence 6677777554322111 111222345678999999999999999999999999999999999999999999999 Q ss_pred hccC Q lcl|NC_019933. 152 RSKR 155 (155) Q Consensus 152 ~~k~ 155 (155) +++. T Consensus 154 ~~k~ 157 (164) T protein:vir:43 154 IKRA 157 (164) T ss_pred HHHH Confidence 9988 No 12 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.96 E-value=2.5e-33 Score=199.34 Aligned_cols=117 Identities=51% Similarity=0.922 Sum_probs=111.0 Q ss_pred HHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcC-CCCceeeeeeccccee Q lcl|NC_019933. 39 VRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAE-VDGKWLFTKEKLATPV 117 (155) Q Consensus 39 i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~-~~~~~~~~~~~~~gt~ 117 (155) |+|||+.++|++||+|++||++.+++++|++|.++|.||||..+|||||++|||+++....++ .+|.|+.....+.+|+ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~ 80 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPK 80 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccceeeeeeeeeccCceeeecCccccCce Confidence 999999999999999999999999999999999999999999999999999999998887765 5566777777889999 Q ss_pred eeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 118 HVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 118 ~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +|||+||||||||+..++++++|.+++++.+.|++++| T Consensus 81 ~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~ 118 (119) T protein:vir:81 81 WIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGE 118 (119) T ss_pred ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999 No 13 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.96 E-value=2.5e-33 Score=199.31 Aligned_cols=117 Identities=50% Similarity=0.900 Sum_probs=110.9 Q ss_pred HHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcC-CCCceeeeeeccccee Q lcl|NC_019933. 39 VRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAE-VDGKWLFTKEKLATPV 117 (155) Q Consensus 39 i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~-~~~~~~~~~~~~~gt~ 117 (155) |+|||+.++|++||+|++||++.+++++|++|.++|.||||..+|||||++|||+++....+. .+|.|+.....+.+++ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~ 80 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPK 80 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccceeeeeeeeeccCceeeecCccccCce Confidence 999999999999999999999999999999999999999999999999999999998877765 5666777777889999 Q ss_pred eeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 118 HVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 118 ~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +|||+||||||||+..++++++|.+++++.+.|++++| T Consensus 81 ~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~ 118 (119) T protein:vir:10 81 WIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGE 118 (119) T ss_pred ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999 No 14 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.95 E-value=1e-31 Score=190.51 Aligned_cols=129 Identities=12% Similarity=0.171 Sum_probs=103.8 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeec------------ccc Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYV------------PEE 66 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~------------~~~ 66 (155) |+ ++|++.||++|.+.|+.|+..+++++++|+.+||++|+++|+.++|+++|.|++++..... ..+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 87 4667889999999999999988999999999999999999999999999988887743211 111 Q ss_pred cCCceEEEEEEecCC---ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 67 STEVRHVYAVSWNKK---KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKA 143 (155) Q Consensus 67 ~~~g~~~~~Vg~~~~---~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~ 143 (155) ..+|...+.||+..+ .++||||+|||| ++|||||||+|||+.++++++++|.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------------~~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------------SKMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCC-----------------------CCCCCCcchhHHHHHhHHHHHHHHHHH Confidence 223444555665432 345555555555 589999999999999999999999999 Q ss_pred HHHHHHHHh Q lcl|NC_019933. 144 GAKRLAELR 152 (155) Q Consensus 144 l~~~i~k~~ 152 (155) |+++|+++| T Consensus 138 l~~~l~ka~ 146 (146) T protein:vir:10 138 LKNEMRLDL 146 (146) T ss_pred HHHHHhhcC Confidence 999999999 No 15 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.95 E-value=1e-31 Score=190.51 Aligned_cols=129 Identities=12% Similarity=0.171 Sum_probs=103.8 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeec------------ccc Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYV------------PEE 66 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~------------~~~ 66 (155) |+ ++|++.||++|.+.|+.|+..+++++++|+.+||++|+++|+.++|+++|.|++++..... ..+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 87 4667889999999999999988999999999999999999999999999988887743211 111 Q ss_pred cCCceEEEEEEecCC---ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 67 STEVRHVYAVSWNKK---KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKA 143 (155) Q Consensus 67 ~~~g~~~~~Vg~~~~---~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~ 143 (155) ..+|...+.||+..+ .++||||+|||| ++|||||||+|||+.++++++++|.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------------~~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------------SKMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCC-----------------------CCCCCCcchhHHHHHhHHHHHHHHHHH Confidence 223444555665432 345555555555 589999999999999999999999999 Q ss_pred HHHHHHHHh Q lcl|NC_019933. 144 GAKRLAELR 152 (155) Q Consensus 144 l~~~i~k~~ 152 (155) |+++|+++| T Consensus 138 l~~~l~ka~ 146 (146) T protein:vir:10 138 LKNEMRLDL 146 (146) T ss_pred HHHHHhhcC Confidence 999999999 No 16 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.95 E-value=1e-31 Score=190.51 Aligned_cols=129 Identities=12% Similarity=0.171 Sum_probs=103.8 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeec------------ccc Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYV------------PEE 66 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~------------~~~ 66 (155) |+ ++|++.||++|.+.|+.|+..+++++++|+.+||++|+++|+.++|+++|.|++++..... ..+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 87 4667889999999999999988999999999999999999999999999988887743211 111 Q ss_pred cCCceEEEEEEecCC---ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 67 STEVRHVYAVSWNKK---KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKA 143 (155) Q Consensus 67 ~~~g~~~~~Vg~~~~---~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~ 143 (155) ..+|...+.||+..+ .++||||+|||| ++|||||||+|||+.++++++++|.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------------~~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------------SKMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCC-----------------------CCCCCCcchhHHHHHhHHHHHHHHHHH Confidence 223444555665432 345555555555 589999999999999999999999999 Q ss_pred HHHHHHHHh Q lcl|NC_019933. 144 GAKRLAELR 152 (155) Q Consensus 144 l~~~i~k~~ 152 (155) |+++|+++| T Consensus 138 l~~~l~ka~ 146 (146) T protein:vir:10 138 LKNEMRLDL 146 (146) T ss_pred HHHHHhhcC Confidence 999999999 No 17 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.95 E-value=1e-31 Score=190.51 Aligned_cols=129 Identities=12% Similarity=0.171 Sum_probs=103.8 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeec------------ccc Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYV------------PEE 66 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~------------~~~ 66 (155) |+ ++|++.||++|.+.|+.|+..+++++++|+.+||++|+++|+.++|+++|.|++++..... ..+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 87 4667889999999999999988999999999999999999999999999988887743211 111 Q ss_pred cCCceEEEEEEecCC---ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 67 STEVRHVYAVSWNKK---KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKA 143 (155) Q Consensus 67 ~~~g~~~~~Vg~~~~---~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~ 143 (155) ..+|...+.||+..+ .++||||+|||| ++|||||||+|||+.++++++++|.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT-----------------------~~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGT-----------------------SKMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCC-----------------------CCCCCCcchhHHHHHhHHHHHHHHHHH Confidence 223444555665432 345555555555 589999999999999999999999999 Q ss_pred HHHHHHHHh Q lcl|NC_019933. 144 GAKRLAELR 152 (155) Q Consensus 144 l~~~i~k~~ 152 (155) |+++|+++| T Consensus 138 l~~~l~ka~ 146 (146) T protein:vir:10 138 LKNEMRLDL 146 (146) T ss_pred HHHHHhhcC Confidence 999999999 No 18 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.95 E-value=8.8e-32 Score=190.83 Aligned_cols=126 Identities=16% Similarity=0.159 Sum_probs=103.9 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcch----hhcceeeeecc-cccCCceEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-SDSVSRTMAFESAAVVRDSAKAHVRSKTGR----LKGAIYAVYVP-EESTEVRHVY 74 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-~~~~~r~a~~~~a~~i~~eak~~aP~~tG~----Lr~sI~~~~~~-~~~~~g~~~~ 74 (155) |+++ +.||++|.+.|++|+.. .+++++.|+.+||++|+++|+.+||+++|. |++||.+.... ....++...+ T Consensus 2 ~~~~--i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v 79 (133) T protein:vir:10 2 IRME--VKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTL 79 (133) T ss_pred eeEe--eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEE Confidence 4454 44889999999999875 567999999999999999999999998875 89999765333 3334566677 Q ss_pred EEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 75 AVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAEL 151 (155) Q Consensus 75 ~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~ 151 (155) .||..+...+||+|+||||+ +|||||||+|||+.++++++++|.++|+++|+|= T Consensus 80 ~vg~~~~~~~y~~f~E~GT~-----------------------k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 80 RVGPSKQHHMKVLAQEFGTV-----------------------KQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred EecCCCCccceEeeeccCCC-----------------------CCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 78877777788889999885 7999999999999999999999988887777655 No 19 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.95 E-value=2e-31 Score=188.93 Aligned_cols=129 Identities=14% Similarity=0.187 Sum_probs=111.4 Q ss_pred Cc--eeeeeccHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCCC-------------cchhhcceeeeec Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLR--DDSDSVSRTMAFESAAVVRDSAKAHVRSK-------------TGRLKGAIYAVYV 63 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~--~~~~~~~r~a~~~~a~~i~~eak~~aP~~-------------tG~Lr~sI~~~~~ 63 (155) |+ ++|++.||++|.+.|+.|+ ...+++++.|+++||.+|+++++.++|+. +|+++++|.+... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 98 5788889999999999995 57889999999999999999999999963 5689999977433 Q ss_pred ccccCCceEEEEEEecCC---ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHH Q lcl|NC_019933. 64 PEESTEVRHVYAVSWNKK---KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVA 140 (155) Q Consensus 64 ~~~~~~g~~~~~Vg~~~~---~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i 140 (155) + ..+|..++.|||.++ .++||||+|||| ++|||||||+||++.++++++++| T Consensus 81 ~--~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT-----------------------~k~~a~pF~~pa~~~~~~~~~~~~ 135 (149) T protein:vir:13 81 R--KKKGNLQCVVGWEKSDNTPFYYMKMEEWGT-----------------------SERPPHHAFGKTNKILKRVYDNIA 135 (149) T ss_pred c--cccceeEEEeeccCCCCCccceeeeeccCc-----------------------cCCCCCccchHHHHHHHHHHHHHH Confidence 3 345677788998764 347777777777 489999999999999999999999 Q ss_pred HHHHHHHHHHHhcc Q lcl|NC_019933. 141 NKAGAKRLAELRSK 154 (155) Q Consensus 141 ~~~l~~~i~k~~~k 154 (155) .++|.+.|++.|.- T Consensus 136 ~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 136 QKKYDNFVKEKLGD 149 (149) T ss_pred HHHHHHHHHHHhcC Confidence 99999999999999 No 20 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.95 E-value=1.1e-31 Score=190.21 Aligned_cols=124 Identities=20% Similarity=0.303 Sum_probs=110.2 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---cchhhcceeeeecccccCCceEEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSK---TGRLKGAIYAVYVPEESTEVRHVYAVS 77 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~---tG~Lr~sI~~~~~~~~~~~g~~~~~Vg 77 (155) |.. |++.||++|.+.|++|+....++++.|+.+||.+|.+++++++|++ ||+|++||.+...+ ...+|..++.|| T Consensus 1 M~~-~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k-~~~~g~~~v~Vg 78 (127) T protein:vir:12 1 MAD-MSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVR-ESKDGVRFVAVG 78 (127) T ss_pred Cee-eeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccc-cccCceeEEEEe Confidence 754 7777899999999999988889999999999999999999999975 79999999765433 234677888899 Q ss_pred ecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 78 WNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 78 ~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) |++++++||||+||||+ +|||||||+||++.++++++++|.+.|+++|+ T Consensus 79 ~~~~~~~y~~f~E~GT~-----------------------~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 79 PNKKVAYRGRFLEWGTS-----------------------KMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred eCCCCcceeeeeccCcc-----------------------CCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 99999999999999985 79999999999999999999999999988888 No 21 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.94 E-value=2.6e-30 Score=182.75 Aligned_cols=147 Identities=12% Similarity=0.079 Sum_probs=108.2 Q ss_pred ceeeeeccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEE Q lcl|NC_019933. 2 SSKITSLDISGVLSALNDLRDDSDS----VSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVS 77 (155) Q Consensus 2 ~~~m~~~~l~~L~~~l~~l~~~~~~----~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg 77 (155) =.+|+..|+|.|.+.|+.|++.+++ ++.+++.+++..|+++|+.+||++||+|++||....... ++..++.|+ T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~---~~~~~g~V~ 77 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVD---GDEVIGRWW 77 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeec---CCeEEEEee Confidence 1233555899999999999865554 456666777888899999999999999999997654322 345556666 Q ss_pred ecCCccccchhhhccccccCCCc------------CCCCceeeeeecc-------------------cceeeeCCccchh Q lcl|NC_019933. 78 WNKKKAPHGHLVEYGHWRTNVVA------------EVDGKWLFTKEKL-------------------ATPVHVPARSFLR 126 (155) Q Consensus 78 ~~~~~a~~~~~vEfGt~~~~~~~------------~~~~~~~~~~~~~-------------------~gt~~~pa~PFlr 126 (155) . .++|+.||||||..+.... ...+++++....+ .+|.+|||||||+ T Consensus 78 ~---~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~ 154 (182) T protein:vir:10 78 N---SSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMT 154 (182) T ss_pred c---CCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchH Confidence 4 4568889999996543211 1122222211111 2467899999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 127 PGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 127 PA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) |||+.+++++.+.|+++++++|++++.+ T Consensus 155 pA~~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 155 PAANKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHhhcC Confidence 9999999999999999999999999999 No 22 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.94 E-value=7.5e-31 Score=185.73 Aligned_cols=122 Identities=17% Similarity=0.151 Sum_probs=108.0 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc------chhhcceeeeecccccCCceEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKT------GRLKGAIYAVYVPEESTEVRHVY 74 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~t------G~Lr~sI~~~~~~~~~~~g~~~~ 74 (155) |++++. ||++|.+.|++|+..+.++++.|+.+||.+|+++++.++|+++ |+|+++|.+... +..+|..++ T Consensus 1 m~v~i~--Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~--k~~~g~~~~ 76 (128) T protein:vir:38 1 MGVKVT--GDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSV--RETSGLTEV 76 (128) T ss_pred Cccchh--hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccc--cccCceeEE Confidence 766654 9999999999999988999999999999999999999999865 468999966433 344667778 Q ss_pred EEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 75 AVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 75 ~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) .|||.+.+++||||+||||+ +|||||||+||++.++++++++|.+.|+++|- T Consensus 77 ~VG~~k~~~~y~~f~E~GT~-----------------------k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 77 DVGYGKDTGWRAHFPNSGTS-----------------------MQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred EeeecCCCceEEeeeccCcc-----------------------CCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 99999999999999999985 79999999999999999999999999999988 No 23 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.94 E-value=2.7e-30 Score=182.73 Aligned_cols=123 Identities=17% Similarity=0.144 Sum_probs=103.8 Q ss_pred Ccee--eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSK--ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~--m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |... |...||++|.+.|++|.+...+.+.+|+.+++..|.++|+.++|++||+|++||.+...+ ..++..++.||+ T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~--~~~~~~~~~v~~ 78 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVK--EEHGVVTGRYVA 78 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceeccee--ccCCcEEEEeeC Confidence 6643 333488889999998888888888999999999999999999999999999999765333 335566667775 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAEL 151 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~ 151 (155) .++|++||||||. +|||||||+||++.+++++++.|.+.|+++|+.. T Consensus 79 ---~~~Ya~~vEfGT~-----------------------~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 79 ---RADYSSYNEYGTY-----------------------RMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred ---CCCccceeecccc-----------------------cCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 4679999999985 6999999999999999999999999999888887 No 24 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.94 E-value=5.3e-30 Score=181.07 Aligned_cols=114 Identities=15% Similarity=0.123 Sum_probs=100.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++++. |||+|.+.|+.|.+...+.+++++.++|..|.++|+.+||++||+||+||.+. ++...+.|++ T Consensus 1 msi~i~--Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~-------~~g~~~~V~~-- 69 (114) T protein:vir:95 1 MAIKWQ--GIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTS-------YPGMEAHIHG-- 69 (114) T ss_pred Ceeeee--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeee-------cCceEEEeec-- Confidence 888775 89999999999998888888999999999999999999999999999999753 2234456664 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) .++|++||||||. +|||||||+|||+.+++++.+.|.+.|++.|+ T Consensus 70 -~~~Ya~yvE~GT~-----------------------~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 70 -EAGYDGYQEYGTR-----------------------FQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred -CCCccceeecCcc-----------------------ccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 4678999999985 69999999999999999999999999999998 No 25 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.93 E-value=3e-29 Score=176.95 Aligned_cols=123 Identities=11% Similarity=0.125 Sum_probs=107.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTG--RLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG--~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+++++ +++|++.|+.|+....++.+.|+++||.++++.++.++|+++| +|++||.++..+...++|..++.||+ T Consensus 1 M~v~v~---~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:81 1 MGARIE---SNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEee---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 888877 4789999999998888999999999999999999999998765 49999988766666667888899999 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) ++.+++|+||+||||+ +|||||||+||++.++++++++|.+.|++-.+ T Consensus 78 ~k~~~~~a~F~E~GT~-----------------------k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 78 AKGVSHRIHATEFGTM-----------------------YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCCCceEEEeccCCcc-----------------------CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 9999999999999985 79999999999999999999999777744333 No 26 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.93 E-value=3e-29 Score=176.95 Aligned_cols=123 Identities=11% Similarity=0.125 Sum_probs=107.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTG--RLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG--~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+++++ +++|++.|+.|+....++.+.|+++||.++++.++.++|+++| +|++||.++..+...++|..++.||+ T Consensus 1 M~v~v~---~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:94 1 MGARIE---SNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEee---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 888877 4789999999998888999999999999999999999998765 49999988766666667888899999 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) ++.+++|+||+||||+ +|||||||+||++.++++++++|.+.|++-.+ T Consensus 78 ~k~~~~~a~F~E~GT~-----------------------k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 78 AKGVSHRIHATEFGTM-----------------------YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCCCceEEEeccCCcc-----------------------CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 9999999999999985 79999999999999999999999777744333 No 27 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.93 E-value=3e-29 Score=176.95 Aligned_cols=123 Identities=11% Similarity=0.125 Sum_probs=107.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTG--RLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG--~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+++++ +++|++.|+.|+....++.+.|+++||.++++.++.++|+++| +|++||.++..+...++|..++.||+ T Consensus 1 M~v~v~---~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:47 1 MGARIE---SNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEee---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 888877 4789999999998888999999999999999999999998765 49999988766666667888899999 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) ++.+++|+||+||||+ +|||||||+||++.++++++++|.+.|++-.+ T Consensus 78 ~k~~~~~a~F~E~GT~-----------------------k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 78 AKGVSHRIHATEFGTM-----------------------YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCCCceEEEeccCCcc-----------------------CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 9999999999999985 79999999999999999999999777744333 No 28 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.93 E-value=3e-29 Score=176.95 Aligned_cols=123 Identities=11% Similarity=0.125 Sum_probs=107.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTG--RLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG--~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+++++ +++|++.|+.|+....++.+.|+++||.++++.++.++|+++| +|++||.++..+...++|..++.||+ T Consensus 1 M~v~v~---~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:79 1 MGARIE---SNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEee---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 888877 4789999999998888999999999999999999999998765 49999988766666667888899999 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) ++.+++|+||+||||+ +|||||||+||++.++++++++|.+.|++-.+ T Consensus 78 ~k~~~~~a~F~E~GT~-----------------------k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 78 AKGVSHRIHATEFGTM-----------------------YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCCCceEEEeccCCcc-----------------------CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 9999999999999985 79999999999999999999999777744333 No 29 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.93 E-value=3e-29 Score=176.95 Aligned_cols=123 Identities=11% Similarity=0.125 Sum_probs=107.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTG--RLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG--~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+++++ +++|++.|+.|+....++.+.|+++||.++++.++.++|+++| +|++||.++..+...++|..++.||+ T Consensus 1 M~v~v~---~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:98 1 MGARIE---SNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEee---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 888877 4789999999998888999999999999999999999998765 49999988766666667888899999 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) ++.+++|+||+||||+ +|||||||+||++.++++++++|.+.|++-.+ T Consensus 78 ~k~~~~~a~F~E~GT~-----------------------k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 78 AKGVSHRIHATEFGTM-----------------------YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCCCceEEEeccCCcc-----------------------CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 9999999999999985 79999999999999999999999777744333 No 30 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.93 E-value=2.6e-29 Score=177.28 Aligned_cols=112 Identities=19% Similarity=0.224 Sum_probs=95.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+++|...||++|.+.|+++.. .+.+++++.+++.+|+++|+.++|++||+|++||.+.. .++...+.||++ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~--~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~-----~~~~~~~~V~~~- 72 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS--LKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMEL-----TEGGFSGQAGPH- 72 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeee-----cCCceEEEeecC- Confidence 9999998889988888887643 46789999999999999999999999999999997542 234456677754 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) ++|++||||||. +|||||||+||++.+++++++.|.+.|+ T Consensus 73 --~~Ya~~vE~GT~-----------------------k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 73 --TDYSAYVEYGTR-----------------------FQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred --CCccceeecccc-----------------------ccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 668999999995 6999999999999999999998877777 No 31 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.92 E-value=1e-28 Score=174.10 Aligned_cols=121 Identities=12% Similarity=0.078 Sum_probs=107.2 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch----hhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGR----LKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~----Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |. -||++|.+.|++|+...+++.++|+.+||.+|.++++.++|+++|. |++||.+...+ .+.+|..++.|||.+ T Consensus 1 mv-~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k-~~~~g~~~~~VG~~k 78 (125) T protein:vir:97 1 MT-KGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFK-GANVGIVSKEIGYGK 78 (125) T ss_pred Cc-hhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhccccc-ccccCceEEEEeecC Confidence 43 4899999999999998899999999999999999999999998775 99999775443 344577778899999 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAE 150 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k 150 (155) .+++||||+||||+ +|||||||+||++.++++++++|.+.|+++|.= T Consensus 79 ~~~~y~~f~E~GT~-----------------------k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 79 ATGWRAHYPNDGTI-----------------------YQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred CCceeEeeeccCcc-----------------------CCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 99999999999985 799999999999999999999999999888865 No 32 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.92 E-value=1.1e-28 Score=173.85 Aligned_cols=109 Identities=19% Similarity=0.221 Sum_probs=95.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|++|++...+.+++++.++|..|.++|++++ |++||+|++||.+.. +|...+.|++ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~------~g~~~~~v~~ 74 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TGDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee------cCceEEEeec Confidence 8888999999999999998889999999999999999999998 899999999997641 3445566764 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||+ +|||||||+|||+.+++++++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---Cccchhhhccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4679999999985 7999999999999999999999887777 No 33 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.92 E-value=1.1e-28 Score=173.85 Aligned_cols=109 Identities=19% Similarity=0.221 Sum_probs=95.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|++|++...+.+++++.++|..|.++|++++ |++||+|++||.+.. +|...+.|++ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~------~g~~~~~v~~ 74 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TGDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee------cCceEEEeec Confidence 8888999999999999998889999999999999999999998 899999999997641 3445566764 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||+ +|||||||+|||+.+++++++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---Cccchhhhccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4679999999985 7999999999999999999999887777 No 34 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.92 E-value=1.1e-28 Score=173.85 Aligned_cols=109 Identities=19% Similarity=0.221 Sum_probs=95.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|++|++...+.+++++.++|..|.++|++++ |++||+|++||.+.. +|...+.|++ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~------~g~~~~~v~~ 74 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TGDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee------cCceEEEeec Confidence 8888999999999999998889999999999999999999998 899999999997641 3445566764 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||+ +|||||||+|||+.+++++++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---Cccchhhhccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4679999999985 7999999999999999999999887777 No 35 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.92 E-value=1.1e-28 Score=173.85 Aligned_cols=109 Identities=19% Similarity=0.221 Sum_probs=95.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|++|++...+.+++++.++|..|.++|++++ |++||+|++||.+.. +|...+.|++ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~------~g~~~~~v~~ 74 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TGDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee------cCceEEEeec Confidence 8888999999999999998889999999999999999999998 899999999997641 3445566764 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||+ +|||||||+|||+.+++++++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---Cccchhhhccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4679999999985 7999999999999999999999887777 No 36 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.92 E-value=1.1e-28 Score=173.85 Aligned_cols=109 Identities=19% Similarity=0.221 Sum_probs=95.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|++|++...+.+++++.++|..|.++|++++ |++||+|++||.+.. +|...+.|++ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~------~g~~~~~v~~ 74 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TGDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee------cCceEEEeec Confidence 8888999999999999998889999999999999999999998 899999999997641 3445566764 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||+ +|||||||+|||+.+++++++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---Cccchhhhccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4679999999985 7999999999999999999999887777 No 37 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.92 E-value=1.1e-28 Score=173.85 Aligned_cols=109 Identities=19% Similarity=0.221 Sum_probs=95.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|++|++...+.+++++.++|..|.++|++++ |++||+|++||.+.. +|...+.|++ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~------~g~~~~~v~~ 74 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TGDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee------cCceEEEeec Confidence 8888999999999999998889999999999999999999998 899999999997641 3445566764 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||+ +|||||||+|||+.+++++++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ---Cccchhhhccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4679999999985 7999999999999999999999887777 No 38 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.92 E-value=1.7e-28 Score=172.85 Aligned_cols=108 Identities=21% Similarity=0.179 Sum_probs=93.3 Q ss_pred eccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccc Q lcl|NC_019933. 7 SLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHG 86 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~ 86 (155) +.|||+|.+.|+++.+.+++.+++++.++|..|+++|+.++|++||+|++||.+.. ++...+.|+. .+.|+ T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~------~~~~~~~v~~---~~~Ya 71 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQ------QRLLHYRVVS---PALYS 71 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeee------cCcEEEEeec---Ccccc Confidence 66899999999999988899999999999999999999999999999999997642 1223345553 46799 Q ss_pred hhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 87 HLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 87 ~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) +||||||. +|||||||+||++.+++++++.|++.|++ T Consensus 72 ~~vE~GT~-----------------------~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 72 IYLELGTR-----------------------KMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred hhcccCcc-----------------------ccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 99999985 69999999999999999999998887777 No 39 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.92 E-value=2e-28 Score=172.42 Aligned_cols=109 Identities=17% Similarity=0.180 Sum_probs=94.8 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|||+|.+.|+++++...+.+++++.+++..|.++|+++| |++||+|++||.+. + +|...+.|+. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~----~--~g~~~~~v~~ 74 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVK----K--IGDLHYRVIS 74 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeee----e--cCcEEEEeeC Confidence 8888999999999999998889999999999999999999998 78999999999754 1 3444556654 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++|+||||+ +|||||||+|||+.+++.+++.|++.|. T Consensus 75 ---~~~Ya~~vEfGT~-----------------------km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 75 ---TAHYSGFLEFGTR-----------------------YMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred ---CCccchheecccc-----------------------cCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 4779999999985 7999999999999999999999877777 No 40 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.91 E-value=3.2e-28 Score=171.31 Aligned_cols=109 Identities=20% Similarity=0.243 Sum_probs=95.5 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV------RSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a------P~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |++.|||+|.+.|++|++...+.+++++.+++..|.++|++++ |++||+|++||.... +|...+.|++ T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~------~g~~~~~V~~ 74 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK------TVDLQYTITS 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee------cCcEEEEecC Confidence 8888999999999999988889999999999999999999997 999999999997542 3445566664 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|++||||||. +|+|||||+|||+.+++.+++.|++.++ T Consensus 75 ---~~~Ya~~vE~GT~-----------------------~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 75 ---HAAYSGFLEFGTR-----------------------YMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred ---Ccccccccccccc-----------------------ccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 4679999999984 7999999999999999999999987777 No 41 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.91 E-value=1.3e-27 Score=168.06 Aligned_cols=137 Identities=16% Similarity=0.154 Sum_probs=108.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++.+..-++++|...|+++.+...+++++++.++|..|+++|+.++|++||+|++||..... ++..++.|+.+ T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~-----~~g~~~~V~~~- 77 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYK-----NNGLTAEITVG- 77 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEee-----cCcEEEEEecC- Confidence 888888788999999999999999999999999999999999999999999999999976431 22245567654 Q ss_pred CccccchhhhccccccCCCcCCC-Cceeeeeec---ccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVD-GKWLFTKEK---LATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~-~~~~~~~~~---~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) +.|+.||||||..+....... ..|.|.... ...|.+|||||||+|||+.+++.+.+.|++.+. T Consensus 78 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 78 --AEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred --CCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 568889999997766543322 233333221 124678999999999999999999998888877 No 42 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.91 E-value=7.8e-28 Score=169.21 Aligned_cols=108 Identities=14% Similarity=0.179 Sum_probs=85.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |...||++|.+.|+++ ...+.+++++.++|..|+++|+.+||++||+|++||.+... ++...+.|++ .++ T Consensus 1 i~i~Gld~l~~~l~~~--~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~-----~~~~~~~V~~---~~~ 70 (108) T protein:vir:74 1 MKITGIDALQKKLRKN--ATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFT-----DGGLSGTTGP---HTD 70 (108) T ss_pred CcchhHHHHHHHHHHh--hhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeee-----cCceEEEeec---CCC Confidence 4444555555555543 24567899999999999999999999999999999976432 2334556664 467 Q ss_pred cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) |++||||||. +|||||||+||++.+++++++.|.+.|+ T Consensus 71 Ya~~vE~GT~-----------------------km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 71 YAGYVEYGTR-----------------------FQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred cccceecccc-----------------------ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 9999999995 6999999999999999999998877777 No 43 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.91 E-value=1.3e-27 Score=168.03 Aligned_cols=129 Identities=12% Similarity=0.143 Sum_probs=100.1 Q ss_pred eeec--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCc Q lcl|NC_019933. 5 ITSL--DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKK 82 (155) Q Consensus 5 m~~~--~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~ 82 (155) |..+ |+++|.+.|+++.+...+.+++++.++|..|+++|+.++|+|||+|++||..... ++...+.|+.+ T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~--- 72 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFK-----DGGFTGVINIG--- 72 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEee-----cCcEEEEEecC--- Confidence 4443 8999999999999999999999999999999999999999999999999975432 22344567754 Q ss_pred cccchhhhccccccCCCcCCC----Cceeeeee--cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 83 APHGHLVEYGHWRTNVVAEVD----GKWLFTKE--KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 83 a~~~~~vEfGt~~~~~~~~~~----~~~~~~~~--~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ++|++||||||..+....... +.|+|... ....|.+|||||||+||++.+++++++.|. T Consensus 73 ~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 73 SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred CCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 578899999997765444321 22222221 112367899999999999999999999997 No 44 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.91 E-value=1.7e-27 Score=167.37 Aligned_cols=131 Identities=13% Similarity=0.161 Sum_probs=101.4 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.-. .|+++|.+.|+.|++...+.+++++.++|..|+++|+.+||+|||+|++||.+... .+...+.|+.+ T Consensus 1 Ma~~~--~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~- 72 (137) T protein:vir:10 1 MAKVK--YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFK-----KGGLTGVINIG- 72 (137) T ss_pred CchhH--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEee-----CCcEEEEEecC- Confidence 66442 48999999999999999999999999999999999999999999999999976422 12244566654 Q ss_pred CccccchhhhccccccCCCcCC----CCceeeee--ecccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEV----DGKWLFTK--EKLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~----~~~~~~~~--~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ++|++||||||..+...... ...|.|.. .....|.+|||||||+||++.+++++.+.|. T Consensus 73 --~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 73 --SEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred --CCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 56899999999766533321 12233322 2234677899999999999999999999887 No 45 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.90 E-value=1.7e-27 Score=167.38 Aligned_cols=108 Identities=14% Similarity=0.178 Sum_probs=86.2 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |.+.||++|.+.|+++. ..+.++++++++|..|.++|+.+||++||+|++||.+... ++...+.|++ .++ T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~-----~~~~~~~V~~---~~~ 70 (108) T protein:vir:98 1 MKITGIDALQKKLRKNA--TLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFT-----DGGLTGTTIP---HTD 70 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeee-----cCceEEEeec---CCC Confidence 55556666666665442 4567899999999999999999999999999999975421 2334566764 456 Q ss_pred cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) |++||||||. +|||||||+||++.+++++++.|.+.|+ T Consensus 71 Ya~~vE~GT~-----------------------~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 71 YAGYVEYGTR-----------------------FQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ccceeecccc-----------------------ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 8999999995 6999999999999999999998877777 No 46 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.90 E-value=2.5e-27 Score=166.43 Aligned_cols=131 Identities=13% Similarity=0.149 Sum_probs=102.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+..+ .|+++|.+.|+++++...+++++++.++|..|+++|+.++|++||+|++||..... ++...+.|+.+ T Consensus 1 Ma~~~--~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~- 72 (137) T protein:vir:94 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DSGFTGVINIG- 72 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEee-----cCceEEEEecC- Confidence 77664 48999999999999999999999999999999999999999999999999975432 22344567654 Q ss_pred CccccchhhhccccccCCCcCCC----Cceeeeeec--ccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVD----GKWLFTKEK--LATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~----~~~~~~~~~--~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ++|++||||||..+....... ..|+|.... ...|.+|||||||+||++.+++++.+.|. T Consensus 73 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 73 --SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred --CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 568899999997665444321 223332211 12356799999999999999999999997 No 47 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.90 E-value=2.5e-27 Score=166.43 Aligned_cols=131 Identities=13% Similarity=0.149 Sum_probs=102.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+..+ .|+++|.+.|+++++...+++++++.++|..|+++|+.++|++||+|++||..... ++...+.|+.+ T Consensus 1 Ma~~~--~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~- 72 (137) T protein:vir:97 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DSGFTGVINIG- 72 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEee-----cCceEEEEecC- Confidence 77664 48999999999999999999999999999999999999999999999999975432 22344567654 Q ss_pred CccccchhhhccccccCCCcCCC----Cceeeeeec--ccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVD----GKWLFTKEK--LATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~----~~~~~~~~~--~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ++|++||||||..+....... ..|+|.... ...|.+|||||||+||++.+++++.+.|. T Consensus 73 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 73 --SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred --CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 568899999997665444321 223332211 12356799999999999999999999997 No 48 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.90 E-value=2.5e-27 Score=166.43 Aligned_cols=131 Identities=13% Similarity=0.149 Sum_probs=102.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+..+ .|+++|.+.|+++++...+++++++.++|..|+++|+.++|++||+|++||..... ++...+.|+.+ T Consensus 1 Ma~~~--~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~- 72 (137) T protein:vir:93 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DSGFTGVINIG- 72 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEee-----cCceEEEEecC- Confidence 77664 48999999999999999999999999999999999999999999999999975432 22344567654 Q ss_pred CccccchhhhccccccCCCcCCC----Cceeeeeec--ccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVD----GKWLFTKEK--LATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~----~~~~~~~~~--~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ++|++||||||..+....... ..|+|.... ...|.+|||||||+||++.+++++.+.|. T Consensus 73 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 73 --SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred --CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 568899999997665444321 223332211 12356799999999999999999999997 No 49 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.90 E-value=3.9e-27 Score=165.36 Aligned_cols=110 Identities=18% Similarity=0.156 Sum_probs=87.4 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+. |+..|||+|.+.|+++ .+..++++++++.+.+..|++.|+.++|++||+|++||.+. ++...+.||+ T Consensus 1 Ma~-i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~-------~~~~~~~v~~ 72 (112) T protein:vir:96 1 MAT-IEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLE-------AGSDRAVVEA 72 (112) T ss_pred Cce-eeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeee-------cCceEEEecC Confidence 542 4444666666666666 35678899999999999999999999999999999999653 3445667775 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) + +.|++||||||. +|||||||+|||+.+++.+++.|+ .|+ T Consensus 73 ~---~~Ya~~vE~GTr-----------------------~m~AqPF~~PA~~~~~~~~~~~l~-~L~ 112 (112) T protein:vir:96 73 L---TNYSGYLEVGTR-----------------------KMEAQPFMRPALDQVVPEMVEEMA-KWE 112 (112) T ss_pred C---CCccceeccCcc-----------------------ccCCCCchhhhHHHHHHHHHHHHH-hcC Confidence 4 568999999984 799999999999999999988884 333 No 50 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.90 E-value=6.9e-27 Score=164.00 Aligned_cols=131 Identities=12% Similarity=0.152 Sum_probs=99.8 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.-. .|+++|.+.|+.+++...+.+++++.++|..|+++|+.+||++||+|++||.+... .+...+.|+.+ T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~-----~~~~~~~V~~~- 72 (137) T protein:vir:10 1 MAKVK--YGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFK-----KGGLTGVINIG- 72 (137) T ss_pred Cccch--hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEec-----CCcEEEEEecC- Confidence 54331 48999999999999999999999999999999999999999999999999975422 12244566654 Q ss_pred CccccchhhhccccccCCCcCCC----Cceeeee--ecccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVD----GKWLFTK--EKLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~----~~~~~~~--~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ..|++||||||..+....... ..|+|.. .....|.+|||||||+||++.+++++.+.|. T Consensus 73 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 73 --SEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred --CccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 568999999997655332211 2233321 1223577899999999999999999999887 No 51 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.90 E-value=4.6e-27 Score=164.95 Aligned_cols=131 Identities=15% Similarity=0.172 Sum_probs=100.2 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+. +. +|+++|.+.|+.+.+...+++++++.+++..|+++|+.++|++||+|++||.+... .+ ...+.|+.+ T Consensus 13 Ma~-v~-~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~----~~-g~~~~V~~~- 84 (149) T protein:vir:10 13 MAK-VK-YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF----DG-GLSSVISVG- 84 (149) T ss_pred hHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEec----CC-cEEEEEecC- Confidence 643 22 48999999999999999999999999999999999999999999999999975422 12 245567754 Q ss_pred CccccchhhhccccccCCCcCCCC----ceeeee--ecccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDG----KWLFTK--EKLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~----~~~~~~--~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) +.|+.||||||..+........ .|+|.. .....|.+|||||||+||++.+++++.+.|. T Consensus 85 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 85 --ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred --CCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 5688999999976554332221 122111 1223577899999999999999999999997 No 52 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.90 E-value=6.5e-27 Score=164.14 Aligned_cols=131 Identities=14% Similarity=0.149 Sum_probs=102.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+..+ .|+++|.+.|+++.+..++++++++.+++..|+++|+.++|++||+|++||..... .+...+.|+.+ T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~-----~~~~~~~V~~~- 72 (137) T protein:vir:95 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DGGFTGVINIG- 72 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEee-----CCceEEEEecC- Confidence 77665 48999999999999999999999999999999999999999999999999965422 22344567754 Q ss_pred CccccchhhhccccccCCCcCC----CCceeeeee--cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEV----DGKWLFTKE--KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~----~~~~~~~~~--~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) +.|++||||||..+...... .+.|+|... ....|.+|||||||+||++.+++++.+.|. T Consensus 73 --~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 73 --SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred --CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 56889999999776554432 122333221 112356799999999999999999999987 No 53 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.89 E-value=5.5e-27 Score=164.54 Aligned_cols=112 Identities=18% Similarity=0.182 Sum_probs=86.9 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |. +|+..|||+|.+.|+++ .+..++++++++.+.+..+++.|+.++|++||+|++||.+... ++. +.|++ T Consensus 1 Ma-~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~-----~~~--~~V~~ 72 (114) T protein:vir:49 1 MA-TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVE-----SDK--ATVEA 72 (114) T ss_pred Ce-eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeec-----CCe--eEecC Confidence 55 35555666666666666 3456778888888888888888888899999999999976432 222 34665 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) + ++|++||||||. +|||||||+|||+.+++++++.|.+.++- T Consensus 73 ~---~~Ya~~vEfGT~-----------------------km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 73 L---TSYSGYLEVGTR-----------------------KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred C---CCccceeccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 4 578999999984 79999999999999999999998887777 No 54 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.89 E-value=5.5e-27 Score=164.54 Aligned_cols=112 Identities=18% Similarity=0.182 Sum_probs=86.9 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |. +|+..|||+|.+.|+++ .+..++++++++.+.+..+++.|+.++|++||+|++||.+... ++. +.|++ T Consensus 1 Ma-~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~-----~~~--~~V~~ 72 (114) T protein:vir:27 1 MA-TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVE-----SDK--ATVEA 72 (114) T ss_pred Ce-eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeec-----CCe--eEecC Confidence 55 35555666666666666 3456778888888888888888888899999999999976432 222 34665 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) + ++|++||||||. +|||||||+|||+.+++++++.|.+.++- T Consensus 73 ~---~~Ya~~vEfGT~-----------------------km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 73 L---TSYSGYLEVGTR-----------------------KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred C---CCccceeccccc-----------------------ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 4 578999999984 79999999999999999999998887777 No 55 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.89 E-value=7.3e-27 Score=163.89 Aligned_cols=131 Identities=15% Similarity=0.172 Sum_probs=99.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |.. +. .|+++|.+.|+.+.+...+++++++.++|..|+++|+.++|++||+|++||.+... .+ ...+.|+.+ T Consensus 13 Ma~-~~-~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~----~~-g~~~~V~~~- 84 (149) T protein:vir:94 13 MAK-VK-YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF----DG-GLSSVISVG- 84 (149) T ss_pred HHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEee----CC-cEEEEEecC- Confidence 643 32 48999999999999999999999999999999999999999999999999975422 12 244566654 Q ss_pred CccccchhhhccccccCCCcCCCCc----eeeee--ecccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGK----WLFTK--EKLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~----~~~~~--~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) +.|++||||||..+......... |.|.. .....|.+|||||||+||++.+++++.+.|. T Consensus 85 --~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 85 --ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred --CCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 56899999999765533322211 11111 1223467799999999999999999999887 No 56 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.89 E-value=5.8e-26 Score=158.95 Aligned_cols=137 Identities=22% Similarity=0.188 Sum_probs=100.9 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.---.+|+++|.+.|+.+.+...+++++++.++|..|+++|+.++|++||+|++||...... + ....++.|+. T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~--~-g~~~~~~v~~-- 75 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSG--G-RFSFSVTIGT-- 75 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeecc--C-CceEEEEEec-- Confidence 7744444689999999999999999999999999999999999999999999999999754322 1 2233455553 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeee--ccccee---eeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKE--KLATPV---HVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~--~~~gt~---~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) .++|+.||||||..+. ..+.+.+.+.|.. +..+.+ .+||||||+||++.+++++.+.| ++|+ T Consensus 76 -~~~YA~~vE~Gt~~~~-i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~-~~~~ 142 (142) T protein:vir:94 76 -NVTYAADVEYGTAPHV-IVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHA-KGIR 142 (142) T ss_pred -CcccchhhhccCCCce-eccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHH-HhcC Confidence 4679999999997654 3444444444422 222233 36799999999999887775554 4455 No 57 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.88 E-value=4.4e-26 Score=159.62 Aligned_cols=130 Identities=12% Similarity=0.085 Sum_probs=100.8 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.. .+||++|.+.|+++++..++++++++.++|..|+++|+.++|++||+|++||..... ++..++.||. T Consensus 1 Ma~~--~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~-----~~g~~~~V~~-- 71 (135) T protein:vir:96 1 MAKV--KYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFE-----NGGFTGVVKI-- 71 (135) T ss_pred Cchh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEee-----cCcEEEEEec-- Confidence 7752 249999999999999999999999999999999999999999999999999976421 2335566774 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeee-----cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKE-----KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~-----~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ...|+.||||||..+....+ .+....|++ ....|.+|||||||+||++.+++++.+.|. T Consensus 72 -~~~YA~~ve~GT~~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 72 -GSNYAVYVNYGTGIYATKGS-RAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred -CCCccchhhcccccccCCCc-cccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 45688899999977654433 222222211 112467899999999999999999888886 No 58 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.88 E-value=7.5e-26 Score=158.33 Aligned_cols=131 Identities=17% Similarity=0.155 Sum_probs=99.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.-. .|+++|.+.|+++++..++++++++.++|..|+++|+.++|++||+|++||...... +| ..+.|+.+ T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~----~g-~~~~V~~~- 72 (137) T protein:vir:96 1 MAKVK--YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTD----GG-FSSVISVG- 72 (137) T ss_pred CchhH--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeec----Cc-eEEEEecC- Confidence 66543 389999999999999999999999999999999999999999999999999654321 22 34567754 Q ss_pred CccccchhhhccccccCCCcCCC----Cceeeeee--cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVD----GKWLFTKE--KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~----~~~~~~~~--~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) +.|+.||||||..+....... ..|.|... ....|.+|||||||+||++.+++.+.+.|. T Consensus 73 --~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 73 --AEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred --CCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 568899999996655333221 22332221 112456799999999999999998888887 No 59 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.87 E-value=6.2e-26 Score=158.77 Aligned_cols=118 Identities=10% Similarity=0.070 Sum_probs=100.0 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |. ++++.|+++|.+.|++|+...+++.++|+++|+++|++++..|+|++||+|++ |..+ ..+ +| .+.||+++ T Consensus 1 Ma-~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~--~kk--~g--~~~VG~~k 72 (119) T protein:vir:10 1 MA-SLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIR--VKN--TG--LATEGTAS 72 (119) T ss_pred Cc-eeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeee--eec--Cc--eeEeccCC Confidence 43 44555677777777788888889999999999999999999999999999997 4322 222 23 57899999 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCc-cchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPAR-SFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~-PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) ..+||+.|.||||+ +|||| ||+.||+++++++|++.|.+.|.+.++ T Consensus 73 s~~fy~kF~EFGTS-----------------------km~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 73 SSEFYDIFQNFGTS-----------------------EQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred cchhhhhhcccccc-----------------------ccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 99999999999996 79999 999999999999999999999998888 No 60 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.86 E-value=3.8e-25 Score=154.49 Aligned_cols=134 Identities=16% Similarity=0.083 Sum_probs=97.2 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.++ .+++|+..|+.+.+...+++++++.+.+..|.++||.+||++||+|++||........ ......+.|+ T Consensus 2 ~~~~~---~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~-~~~~~~~~v~--- 74 (142) T protein:vir:99 2 VQVSV---RYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMV-TPFHVSGGVT--- 74 (142) T ss_pred ceeEE---EeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecccc-ccceEEEEec--- Confidence 44443 4567778888888888999999999999999999999999999999999965432221 1122334444 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecc----cceeee---CCccchhhHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKL----ATPVHV---PARSFLRPGYDSVKGRLVEVANK 142 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~----~gt~~~---pa~PFlrPA~~~~~~~~~~~i~~ 142 (155) ..++|+.||||||.. +...+.++.+++|.+.. ...+++ +|||||+|||+.+..+......+ T Consensus 75 ~~a~YA~~ve~GT~p-h~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 75 AHAKYAAAVHEGTRP-HVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred cCccccceeccCCcc-ceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 357799999999964 45667777776654321 123454 49999999999988875555444 No 61 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.86 E-value=3.8e-25 Score=154.49 Aligned_cols=134 Identities=16% Similarity=0.083 Sum_probs=97.2 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.++ .+++|+..|+.+.+...+++++++.+.+..|.++||.+||++||+|++||........ ......+.|+ T Consensus 2 ~~~~~---~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~-~~~~~~~~v~--- 74 (142) T protein:vir:86 2 VQVSV---RYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMV-TPFHVSGGVT--- 74 (142) T ss_pred ceeEE---EeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecccc-ccceEEEEec--- Confidence 44443 4567778888888888999999999999999999999999999999999965432221 1122334444 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecc----cceeee---CCccchhhHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKL----ATPVHV---PARSFLRPGYDSVKGRLVEVANK 142 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~----~gt~~~---pa~PFlrPA~~~~~~~~~~~i~~ 142 (155) ..++|+.||||||.. +...+.++.+++|.+.. ...+++ +|||||+|||+.+..+......+ T Consensus 75 ~~a~YA~~ve~GT~p-h~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 75 AHAKYAAAVHEGTRP-HVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred cCccccceeccCCcc-ceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 357799999999964 45667777776654321 123454 49999999999988875555444 No 62 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.86 E-value=6.7e-25 Score=153.12 Aligned_cols=122 Identities=26% Similarity=0.336 Sum_probs=95.7 Q ss_pred eeeccHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 5 ITSLDISGV----LSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 5 m~~~~l~~L----~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+.+.+|+| .+.|+++.+.+.+.++.++.++|..+++++|+++|++||.|++||.++...+. + ....|.++. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~---g-~~~~vv~~~ 76 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGY---G-TTKRIIWNK 76 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccC---C-cceEEEecc Confidence 555555554 44477777889999999999999999999999999999999999987644322 2 234577888 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) .+..+.|++||||+.+.+ | .|||+|||+||++...+++.+.|++.|+..= T Consensus 77 ~~~~l~HLLEfGha~r~g-----G-------------rV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 77 KHYRRVHLLEFGHAKVNG-----G-------------RVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred CCCCceeeeecceecCCC-----C-------------ccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 888899999999985432 2 3899999999999988888877776666332 No 63 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.80 E-value=4e-23 Score=143.37 Aligned_cols=110 Identities=14% Similarity=0.142 Sum_probs=83.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCC Q lcl|NC_019933. 24 SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVD 103 (155) Q Consensus 24 ~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~ 103 (155) +++++++++.+++..|+++|+++||++||+|++||..... ++...+.|+.+ +.|+.||||||..+....... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~-----~~~~~~~V~~~---~~Ya~yvE~GTg~~~~~~~~~ 72 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DGGFTGVINIG---SEYAIYVNYGTGIYATGAGGS 72 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEee-----cCcEEEEEecC---CCccceeecCccccccCCCcc Confidence 8899999999999999999999999999999999965421 23345567654 568888999998766433211 Q ss_pred ----Cceeeeee--cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 104 ----GKWLFTKE--KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 104 ----~~~~~~~~--~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ..|+|... ....|.+|||||||+||++.+++.+++.|. T Consensus 73 ~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 73 RAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 12332211 122477899999999999999998888887 No 64 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.80 E-value=5.1e-23 Score=142.80 Aligned_cols=110 Identities=14% Similarity=0.147 Sum_probs=84.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcC-- Q lcl|NC_019933. 24 SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAE-- 101 (155) Q Consensus 24 ~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~-- 101 (155) +++++++++.+++..|+++|+.+||++||+|++||..... ++...+.|+.+ ..|+.||||||..+..... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~-----~~~~~~~V~~~---~~YA~yvE~GTg~~~~~~~~~ 72 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DGGFTGVINIG---SEYAIYVNYGTGIYATGAGGS 72 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEee-----cCcEEEEEecC---CCcccccccCCcccccCCCcc Confidence 8999999999999999999999999999999999965431 22344567654 5688889999987754443 Q ss_pred --CCCceeeeee--cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 102 --VDGKWLFTKE--KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 102 --~~~~~~~~~~--~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ....|+|... ....|.+|||||||+||++.+++.+.+.|. T Consensus 73 ~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 2223333321 223477899999999999999998888887 No 65 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.80 E-value=5.1e-23 Score=142.80 Aligned_cols=110 Identities=14% Similarity=0.147 Sum_probs=84.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcC-- Q lcl|NC_019933. 24 SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAE-- 101 (155) Q Consensus 24 ~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~-- 101 (155) +++++++++.+++..|+++|+.+||++||+|++||..... ++...+.|+.+ ..|+.||||||..+..... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~-----~~~~~~~V~~~---~~YA~yvE~GTg~~~~~~~~~ 72 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-----DGGFTGVINIG---SEYAIYVNYGTGIYATGAGGS 72 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEee-----cCcEEEEEecC---CCcccccccCCcccccCCCcc Confidence 8999999999999999999999999999999999965431 22344567654 5688889999987754443 Q ss_pred --CCCceeeeee--cccceeeeCCccchhhHHHHHHHHHHHHHH Q lcl|NC_019933. 102 --VDGKWLFTKE--KLATPVHVPARSFLRPGYDSVKGRLVEVAN 141 (155) Q Consensus 102 --~~~~~~~~~~--~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~ 141 (155) ....|+|... ....|.+|||||||+||++.+++.+.+.|. T Consensus 73 ~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 2223333321 223477899999999999999998888887 No 66 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.80 E-value=1.7e-22 Score=139.93 Aligned_cols=136 Identities=9% Similarity=0.075 Sum_probs=94.7 Q ss_pred eeeccHHH-HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCc Q lcl|NC_019933. 5 ITSLDISG-VLSALNDLRDDSDSVSRT-MAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKK 82 (155) Q Consensus 5 m~~~~l~~-L~~~l~~l~~~~~~~~r~-a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~ 82 (155) |..++|+. .....+.+.+...+.++. |+..++.+|...|+.++|+|||+|++||...... +| ..+.||.+ T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~----~g-~~~~V~~~--- 72 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRK----SS-KEVIVGNS--- 72 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeec----CC-cEEEEecC--- Confidence 88888875 555556666666655555 6777888999999999999999999999654321 12 23457754 Q ss_pred cccchhhhccccccCCCcCCC-Cceeeeeec--ccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 83 APHGHLVEYGHWRTNVVAEVD-GKWLFTKEK--LATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 83 a~~~~~vEfGt~~~~~~~~~~-~~~~~~~~~--~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) +.|+.||||||..+....... +.|+|.... ..-|..|||||||+||++.+++++.+.|.+.|+ .|+ T Consensus 73 ~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~-~l~ 141 (141) T protein:vir:78 73 SDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALR-GIN 141 (141) T ss_pred CCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhh-ccC Confidence 457778999998776554322 234432111 112456899999999999999998887766554 344 No 67 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.75 E-value=3.9e-21 Score=132.45 Aligned_cols=128 Identities=18% Similarity=0.155 Sum_probs=87.6 Q ss_pred CceeeeeccHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeec----ccccCCceE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRD----DSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYV----PEESTEVRH 72 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~----~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~----~~~~~~g~~ 72 (155) |+. |..+++++|++.+++|.+ ...+.+++++++.|..+.++++.++||+||+||+||..... +.....+.. T Consensus 1 M~~-~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~ 79 (141) T protein:vir:79 1 MAR-WGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNY 79 (141) T ss_pred CCC-CccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCee Confidence 764 333555566666665543 45678999999999999999999999999999999954311 111122333 Q ss_pred EEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 73 VYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGY--DSVKGRLVEVANKAGAKRLAE 150 (155) Q Consensus 73 ~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~--~~~~~~~~~~i~~~l~~~i~k 150 (155) ++.|+ .+.+|+|||||||. .++++||..|+| +...+++.+.+.+.+.+.|++ T Consensus 80 ~v~v~---n~~~YA~~VE~Ghr-----------------------~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~ 133 (141) T protein:vir:79 80 IIEVV---NPTEYASYVNFGHR-----------------------TKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLI 133 (141) T ss_pred EEEEe---cCCcchhhhhccee-----------------------ecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 44555 34789999999995 245566666665 555566666666666777777 Q ss_pred HhccC Q lcl|NC_019933. 151 LRSKR 155 (155) Q Consensus 151 ~~~k~ 155 (155) +|++- T Consensus 134 ~l~~~ 138 (141) T protein:vir:79 134 LLKGV 138 (141) T ss_pred HHHHh Confidence 66666 No 68 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.74 E-value=1.5e-20 Score=129.27 Aligned_cols=138 Identities=16% Similarity=0.119 Sum_probs=93.1 Q ss_pred CceeeeeccHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDD-----SDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYA 75 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~-----~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~ 75 (155) || |+.+++++|++.+++|.+. .++.+++++.+.|..++++++.++||+||+||+||.+.... . ..+..++. T Consensus 1 Ms--~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~-~-~~~~~~~~ 76 (144) T protein:vir:10 1 MS--LGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPT-Y-GCGGWTIK 76 (144) T ss_pred CC--CCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeeccee-e-ecCeeEEE Confidence 55 3344555666666655442 45788999999999999999999999999999999765332 2 23444556 Q ss_pred EEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 76 VSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 76 Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) |+ .+.+|++||||||... +|.++.+.......-++|.+|||.+|.+..+..+.+ .|.+.|++++... T Consensus 77 V~---n~~~YA~~VE~Ghr~~------~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~----~l~k~l~~l~d~~ 143 (144) T protein:vir:10 77 LI---NNAEYASYVESGHRQT------PGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQ----LVTEGLWGLKDLF 143 (144) T ss_pred Ee---cCCCcccccccceeec------CCcccccCCCccccceecCccchHHHHHHHHHHHHH----HHHHHHHHHhhhc Confidence 66 4577899999999532 222333322222334689999999999766555544 4566666666555 No 69 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.72 E-value=9.1e-21 Score=130.46 Aligned_cols=129 Identities=19% Similarity=0.101 Sum_probs=86.8 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |.+++-.. -....|.+...+++++++.+.+..|.++||.++|++||+|++||....... ...+.....|+. T Consensus 1 ~~~~~~~~------~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~-~~~~~~~~~V~~-- 71 (137) T protein:vir:10 1 MTVTARYE------RNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVV-AGPLRLDSGVTA-- 71 (137) T ss_pred CeeEEEec------cCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeec-cccceEEEEecC-- Confidence 55555421 112223345667888888999999999999999999999999997653222 122333444553 Q ss_pred CccccchhhhccccccCCCcCCCCc-eeeeeec---cc-ceeee---CCccchhhHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGK-WLFTKEK---LA-TPVHV---PARSFLRPGYDSVKGRLVEVA 140 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~-~~~~~~~---~~-gt~~~---pa~PFlrPA~~~~~~~~~~~i 140 (155) .++|+.||||||..| ..++.+++ ..+|+.. .| ..|++ +|+|||+|||+.++.+....- T Consensus 72 -~~~YA~~ve~GT~ph-~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 72 -HADYARYVHDGTRAH-VIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred -CCccceeeecCCCCc-eeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 467899999999654 45666554 4454322 12 23444 599999999999999877655 No 70 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.72 E-value=9.3e-21 Score=130.40 Aligned_cols=129 Identities=16% Similarity=0.069 Sum_probs=83.1 Q ss_pred Cceeeeec-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSL-DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~-~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+++...- +.++| .+...++++.++.+.+..|+.+|+.++|++||+|++||....... +....+..||. T Consensus 1 m~~s~~i~i~~~~l-------~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~--~~~~~~~~v~~- 70 (137) T protein:vir:10 1 MPVTARIHINEPEL-------ERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTY--RPFHVGGGVED- 70 (137) T ss_pred CCeeEEEeeCHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeecc--ccceEEEEEec- Confidence 66554432 33333 244566778888889999999999999999999999997643222 22334455664 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeee----cccceeeeC---CccchhhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKE----KLATPVHVP---ARSFLRPGYDSVKGRLVEVANKA 143 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~----~~~gt~~~p---a~PFlrPA~~~~~~~~~~~i~~~ 143 (155) .++|+.||||||.. +...+.+++.+.|.. +-...+++| |||||+|||+.... ....|+-. T Consensus 71 --~~~YA~~ve~GT~p-h~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~-~~~ri~~~ 137 (137) T protein:vir:10 71 --NVDYAAPVHEGSRP-HRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVA-ADPDIHMT 137 (137) T ss_pred --CCCceeeeeecCCC-ceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhh-ccccccCC Confidence 46688899999964 455666666554322 122345666 99999999986421 22223222 No 71 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.70 E-value=1.1e-20 Score=129.96 Aligned_cols=129 Identities=18% Similarity=0.192 Sum_probs=82.6 Q ss_pred CceeeeeccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |. .+.- -.|.-....|.+....++++.+.+.+..|..+|+.+||+|||+||+||...... .++...++.|+ T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~--~~~~~~~~~v~-- 72 (140) T protein:vir:97 1 MA----TIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQV--YTPFRVRGGVE-- 72 (140) T ss_pred Ce----eeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeee--CCCceEEEEec-- Confidence 33 2221 123334445555667788888889999999999999999999999999754221 22233444454 Q ss_pred CCccccchhhhccccccCCCcCCCCcee--eeeec--ccceee---eCCccchhhHHHHH---HHHHHHH Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWL--FTKEK--LATPVH---VPARSFLRPGYDSV---KGRLVEV 139 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~--~~~~~--~~gt~~---~pa~PFlrPA~~~~---~~~~~~~ 139 (155) ..+.|+.||||||..| ...+.+.+.+ ||... -...++ ++|||||+||++.. ++++... T Consensus 73 -~~a~YA~~Ve~GT~ph-~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 73 -ATADYAAPVHEGSRPH-AIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred -CCccchhhhccCCCCc-eeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 4577899999999754 4555555543 33221 113345 45999999999874 3443333 No 72 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.70 E-value=1.1e-20 Score=129.96 Aligned_cols=129 Identities=18% Similarity=0.192 Sum_probs=82.6 Q ss_pred CceeeeeccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |. .+.- -.|.-....|.+....++++.+.+.+..|..+|+.+||+|||+||+||...... .++...++.|+ T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~--~~~~~~~~~v~-- 72 (140) T protein:vir:10 1 MA----TIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQV--YTPFRVRGGVE-- 72 (140) T ss_pred Ce----eeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeee--CCCceEEEEec-- Confidence 33 2221 123334445555667788888889999999999999999999999999754221 22233444454 Q ss_pred CCccccchhhhccccccCCCcCCCCcee--eeeec--ccceee---eCCccchhhHHHHH---HHHHHHH Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWL--FTKEK--LATPVH---VPARSFLRPGYDSV---KGRLVEV 139 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~--~~~~~--~~gt~~---~pa~PFlrPA~~~~---~~~~~~~ 139 (155) ..+.|+.||||||..| ...+.+.+.+ ||... -...++ ++|||||+||++.. ++++... T Consensus 73 -~~a~YA~~Ve~GT~ph-~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 73 -ATADYAAPVHEGSRPH-AIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred -CCccchhhhccCCCCc-eeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 4577899999999754 4555555543 33221 113345 45999999999874 3443333 No 73 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.68 E-value=1.7e-19 Score=123.43 Aligned_cols=122 Identities=15% Similarity=0.156 Sum_probs=96.6 Q ss_pred CceeeeeccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+.+|++-+| +.+.+.|+++.+.+.+.+..++.+.|..+.++.++.+|++||.+++||.++. ++++.. .|--+ T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~----~~~~~~--~v~~~ 74 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQK----LKNGDQ--VIYQK 74 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeee----cCCeeE--EEEEe Confidence 9999998887 4679999999999999999999999999999999999999999999997653 223322 12223 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ....+..|++||||.... +| +++|+|||+||++...+.+++.|.+.|++ T Consensus 75 ~~~y~l~HLLE~GHa~r~-----GG-------------rV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 75 APTYRLTHLLENGHAKRN-----GG-------------RVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred cCCcceEEeeecceeecC-----Cc-------------eeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 334457899999996432 12 48999999999988888777777666666 No 74 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=99.63 E-value=1e-18 Score=119.21 Aligned_cols=123 Identities=16% Similarity=0.275 Sum_probs=87.6 Q ss_pred eeecc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------cchhhcceeeeecccccCCceEE Q lcl|NC_019933. 5 ITSLD--ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSK---------TGRLKGAIYAVYVPEESTEVRHV 73 (155) Q Consensus 5 m~~~~--l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~---------tG~Lr~sI~~~~~~~~~~~g~~~ 73 (155) |..++ |++|...|+.|.....+.-++|+++||.++++..+.++|.. .|+|++||.+.. ....+...-. T Consensus 1 M~~~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~-~~idG~~dG~ 79 (153) T protein:vir:49 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQS-TNADGRKNGV 79 (153) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceecc-ccccccccce Confidence 44443 55555555555555667789999999999999999999852 359999998752 2222222225 Q ss_pred EEEEecCC-ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 74 YAVSWNKK-KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV--KGRLVEVANKAGAKRLAE 150 (155) Q Consensus 74 ~~Vg~~~~-~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~--~~~~~~~i~~~l~~~i~k 150 (155) ..|||... +++++||+|+||. +|||+||+.++.+.. +.++++++ .+.+.+ T Consensus 80 s~VG~~~~~~a~~a~f~n~GT~-----------------------km~~~hFie~tr~e~~~k~~vl~A~----~~~~~~ 132 (153) T protein:vir:49 80 STVGWKNNYHAQNARRLNDGTK-----------------------KYRADHFITNVQNDSTVKNKVLLAE----KEEYEK 132 (153) T ss_pred eeecccCCccceeeeecccCcc-----------------------cCCCChhhHHHHHHhhHHHHHHHHH----HHHHHH Confidence 57999644 4889999999984 799999999999875 45566555 555666 Q ss_pred HhccC Q lcl|NC_019933. 151 LRSKR 155 (155) Q Consensus 151 ~~~k~ 155 (155) +|+++ T Consensus 133 il~~~ 137 (153) T protein:vir:49 133 LIRRK 137 (153) T ss_pred HHHhc Confidence 66666 No 75 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=99.62 E-value=1.3e-18 Score=118.57 Aligned_cols=127 Identities=14% Similarity=0.114 Sum_probs=92.5 Q ss_pred eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----------cchhhcceeeeecccccCCceE Q lcl|NC_019933. 3 SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSK----------TGRLKGAIYAVYVPEESTEVRH 72 (155) Q Consensus 3 ~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~----------tG~Lr~sI~~~~~~~~~~~g~~ 72 (155) ++|. .+|++|...|+.|.....+.-.+++.+||+++.+..+.++|.. .++|+++|.+.... ..+.... T Consensus 1 v~~~-~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~-~dg~~~g 78 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGD-IDGDHNG 78 (139) T ss_pred CCHH-HHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcc-cccccce Confidence 3333 3566666666666555666778899999999999999999962 36899999876432 2222223 Q ss_pred EEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 73 VYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 73 ~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) ...|||+.. ++++||+||||. +|||+||+..+.+..++++++++.+.+++.|.+-. T Consensus 79 ~~~VG~~k~-~~~A~f~n~GT~-----------------------k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~ 134 (139) T protein:vir:10 79 SSTVGFHNK-AHIARFLNDGTK-----------------------YIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKAN 134 (139) T ss_pred eeeeCCCCC-cceEeecccCcc-----------------------ccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 346899764 788999999984 79999999999999999888888666666555443 Q ss_pred ccC Q lcl|NC_019933. 153 SKR 155 (155) Q Consensus 153 ~k~ 155 (155) -+- T Consensus 135 ~~~ 137 (139) T protein:vir:10 135 GGG 137 (139) T ss_pred CCC Confidence 333 No 76 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=99.59 E-value=1.4e-18 Score=118.39 Aligned_cols=130 Identities=15% Similarity=0.041 Sum_probs=84.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |-.-+.-++...|+ ....+++++++...+..+..+||.++|++||+|++||........ ....+..|+ T Consensus 1 ~~~~~~~l~~~~l~-------~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~--g~~v~~~V~--- 68 (137) T protein:vir:10 1 MVAHTLRIERAQLH-------GLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRER--GAVVIGSVE--- 68 (137) T ss_pred CcccccccChhhHh-------hHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeecc--ccEEEEEec--- Confidence 55554444444333 345678888889999999999999999999999999976433211 112233344 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeee----cccceeeeC---CccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKE----KLATPVHVP---ARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~----~~~gt~~~p---a~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) ..++|+.||||||.. +..++++.+.+.|.. +-...|++| |+|||+||++...+..-=. -.|. T Consensus 69 ~~~~YA~~ve~GT~p-h~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~~~~--~~~~ 137 (137) T protein:vir:10 69 YTARYAAAVHNGRRA-LTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQEGFR--VTIG 137 (137) T ss_pred CCcccceeeecCCCC-ceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhccccee--EeeC Confidence 457789999999965 456777776654422 122345555 9999999998666532111 1111 No 77 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=99.59 E-value=3.7e-18 Score=116.13 Aligned_cols=121 Identities=12% Similarity=0.199 Sum_probs=87.7 Q ss_pred eeecc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---------CcchhhcceeeeecccccCCceE- Q lcl|NC_019933. 5 ITSLD--ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRS---------KTGRLKGAIYAVYVPEESTEVRH- 72 (155) Q Consensus 5 m~~~~--l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~---------~tG~Lr~sI~~~~~~~~~~~g~~- 72 (155) |..+. |++|...|+.|.....+.-.+|+.+||+++.+..+.++|. +.++|++||.+... ..+|.. T Consensus 1 M~~~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~---~~DG~~d 77 (141) T protein:vir:50 1 MVGLAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQST---NADGRKN 77 (141) T ss_pred CccHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccC---ccccccC Confidence 55444 4445555555544556778899999999999999999995 35699999987532 223433 Q ss_pred -EEEEEecCCc-cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019933. 73 -VYAVSWNKKK-APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV--KGRLVEVANKAGAKRL 148 (155) Q Consensus 73 -~~~Vg~~~~~-a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~--~~~~~~~i~~~l~~~i 148 (155) +..|||.+.+ ++++||+++||. +|||+||+.++.+.. ++++++++ .+.+ T Consensus 78 g~s~VG~~~~~~~~~A~f~n~GT~-----------------------k~~~~hFve~~~~~a~~k~~Vl~A~----~~~~ 130 (141) T protein:vir:50 78 GVSTVGWKNNYHAQNARRLNDGTK-----------------------KYRADHFVTNVQNDSTVQKKVLLEK----KRNT 130 (141) T ss_pred CeeeeccCCCccceeeeccccCcc-----------------------ccCCCchhHHHHHhhhhHHHHHHHH----HHHH Confidence 5579995555 889999999984 799999999999754 45566655 5566 Q ss_pred HHHhccC Q lcl|NC_019933. 149 AELRSKR 155 (155) Q Consensus 149 ~k~~~k~ 155 (155) .++|+++ T Consensus 131 k~~l~~~ 137 (141) T protein:vir:50 131 KNSLEEK 137 (141) T ss_pred HHHHHhc Confidence 6677777 No 78 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.55 E-value=4.5e-17 Score=110.19 Aligned_cols=132 Identities=17% Similarity=0.151 Sum_probs=104.2 Q ss_pred CceeeeeccHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCC---------------------------Cc Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLR--DDSDSVSRTMAFESAAVVRDSAKAHVRS---------------------------KT 51 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~--~~~~~~~r~a~~~~a~~i~~eak~~aP~---------------------------~t 51 (155) |+..+..-+|+.+.+.|..+. ....+.+++.+.+.|..+..+++.+.|+ +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 888877666666666666553 3456789999999999999999999997 79 Q ss_pred chhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH Q lcl|NC_019933. 52 GRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS 131 (155) Q Consensus 52 G~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~ 131 (155) |+||+||.+..... ++++ -.+.|+ .+.+|+|||||||.... + -++|.++||..|.+. T Consensus 81 G~lr~swk~~~~~k-~~~~-~~v~v~---N~~~YA~~VE~GHR~~~------g------------GfV~G~fml~~s~~~ 137 (163) T protein:vir:10 81 GTLQKGWSKSRIEV-SGRT-YKQKVY---NKVYYAPHVEYGHKTVN------G------------GFVPGQFFLHKTVED 137 (163) T ss_pred chhhccceecceee-cCCc-eEEEEE---ecCCccchhhcceeecC------C------------ceeccchhhHHHHHH Confidence 99999998764433 2332 222343 46789999999995432 1 158999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 132 VKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 132 ~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ...++.+.+.+.|.+-|++++.++ T Consensus 138 ~~~~~~~~~e~~l~~~l~k~~~~~ 161 (163) T protein:vir:10 138 TKSDMEKRVRDKYDGFMRKVVLGN 161 (163) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCC Confidence 999999999999999999999999 No 79 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=99.55 E-value=2.3e-17 Score=111.81 Aligned_cols=121 Identities=17% Similarity=0.288 Sum_probs=87.6 Q ss_pred eeecc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC------C---cchhhcceeeeecccccCCceE- Q lcl|NC_019933. 5 ITSLD--ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRS------K---TGRLKGAIYAVYVPEESTEVRH- 72 (155) Q Consensus 5 m~~~~--l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~------~---tG~Lr~sI~~~~~~~~~~~g~~- 72 (155) |..++ |++|...|+.|.....+.-.+++.+||.++++..+.++|. + .++|++||.+.. .+.+|.. T Consensus 1 M~~~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~---~~iDg~~~ 77 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQS---TNVDGRKN 77 (140) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecc---cccccccC Confidence 44443 5555555555555556778999999999999999999994 2 458999998752 2223322 Q ss_pred -EEEEEecCC-ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019933. 73 -VYAVSWNKK-KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV--KGRLVEVANKAGAKRL 148 (155) Q Consensus 73 -~~~Vg~~~~-~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~--~~~~~~~i~~~l~~~i 148 (155) +..|||.+. +++++||+++||. +|||+||+.++.+.. +.++++++ .+.+ T Consensus 78 g~s~VG~~kk~~a~~A~f~n~GT~-----------------------k~~~~hFve~~~~e~~~k~~vl~A~----~~~~ 130 (140) T protein:vir:48 78 GVSTVGWVNRYHAQNARRLNDGTK-----------------------KYRADHFVTNVQNDSAVQTKVLLAE----KEEY 130 (140) T ss_pred ceeeeccCCCcceeeeeccccCcc-----------------------ccCCCchhHHHHHhhhhHHHHHHHH----HHHH Confidence 557999755 5899999999984 799999999999865 55666666 4555 Q ss_pred HHHhccC Q lcl|NC_019933. 149 AELRSKR 155 (155) Q Consensus 149 ~k~~~k~ 155 (155) .++|+++ T Consensus 131 ~~~l~~~ 137 (140) T protein:vir:48 131 EKLIRKK 137 (140) T ss_pred HHHHHhh Confidence 5566666 No 80 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=99.52 E-value=6.5e-17 Score=109.32 Aligned_cols=116 Identities=21% Similarity=0.221 Sum_probs=83.3 Q ss_pred eeeccHHHH----HHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 5 ITSLDISGV----LSALNDLRDDSDSVS----RTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 5 m~~~~l~~L----~~~l~~l~~~~~~~~----r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) |+.+.||+| .+.|+++.+++.+.+ .++..+++..|+.++++.+|++||.+++||..+... ++. | T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~----e~~----~ 72 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVP----NGW----V 72 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeec----Cce----e Confidence 666666655 444555555554444 455567777777788889999999999999776443 222 2 Q ss_pred EecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) -+++......|++||||...++. .++|+|+|+||.+...+.+++.|++.|+. T Consensus 73 V~nk~~yqLtHLLE~GHAkr~GG------------------RV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 73 IHNKTEYRLAHLLEYGHATVDGG------------------RVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred EEEcCCCceeeeeecceeccCCc------------------ccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 34555555699999999764432 38899999999999999888888887777 No 81 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.50 E-value=6.6e-17 Score=109.31 Aligned_cols=117 Identities=14% Similarity=0.047 Sum_probs=81.8 Q ss_pred eccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCC-------CcchhhcceeeeecccccCCceEEEEEE Q lcl|NC_019933. 7 SLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAH--VRS-------KTGRLKGAIYAVYVPEESTEVRHVYAVS 77 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~--aP~-------~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg 77 (155) +.|+++|+.+|+... .+-+++-+++=...+...|+++ +|+ +||.||+||....+ ++...+.|| T Consensus 1 i~G~~~L~~~Lk~~s---~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~-----~~g~~~~vg 72 (127) T protein:vir:98 1 MTGMPALEVKLRSMS---EKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKV-----NSSKDVITG 72 (127) T ss_pred CcChHHHHHHHHHhh---HHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEe-----cCCceEEec Confidence 678999999998662 2224555555555566666664 888 99999999976543 344556677 Q ss_pred ecCCccccchhhhccccccCCCcCCCCceeeeeeccccee-eeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 78 WNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPV-HVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 78 ~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~-~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) +......|+.||||||.-- .+++ ++ ++||||||.|||+..++...+.+.+.+++ T Consensus 73 p~g~t~dYapyvEyGTR~m-----~~~~----------~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 73 NFGYIKDYAPHVEYGHRIV-----RNGK----------QVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred cCcccccccceeecceeee-----eccc----------ccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 6444467888899999410 0011 11 48899999999999999888888777776 No 82 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=99.49 E-value=7.9e-17 Score=108.85 Aligned_cols=125 Identities=14% Similarity=0.129 Sum_probs=89.1 Q ss_pred eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----------cchhhcceeeeecccccCCce- Q lcl|NC_019933. 3 SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSK----------TGRLKGAIYAVYVPEESTEVR- 71 (155) Q Consensus 3 ~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~----------tG~Lr~sI~~~~~~~~~~~g~- 71 (155) +.|. .+|++|.+.|+.|.....+.-.+++.+||.++.+..+.++|.. .++|+++|.+... ..+|. T Consensus 1 ~~~~-~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~---~idg~~ 76 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAG---DIDGDH 76 (139) T ss_pred CCHH-HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCc---cccccc Confidence 2333 3566666666666555666677899999999999999999951 3589999977532 22332 Q ss_pred -EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 72 -HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAE 150 (155) Q Consensus 72 -~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k 150 (155) -.+.|||.. +++.+||+|+||. +|||+||+..+.+..++++++++.+.+++.|.+ T Consensus 77 ~g~~~VG~~~-~~~~Ahf~n~GT~-----------------------~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~ 132 (139) T protein:vir:10 77 NGSSTVGFHN-KAHIARFLNDGTK-----------------------NIRADHFVDNARDDAKDAVFAAEAEKYQAMIAK 132 (139) T ss_pred cccceeCCCC-CceeeeeeccCcc-----------------------ccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 235688875 4777899999884 799999999999998888888886666555544 Q ss_pred HhccC Q lcl|NC_019933. 151 LRSKR 155 (155) Q Consensus 151 ~~~k~ 155 (155) -.-+- T Consensus 133 ~~~~~ 137 (139) T protein:vir:10 133 ANGGD 137 (139) T ss_pred cCCCC Confidence 33333 No 83 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.48 E-value=1.3e-16 Score=107.72 Aligned_cols=91 Identities=15% Similarity=0.168 Sum_probs=67.3 Q ss_pred Cce-eeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSS-KITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~-~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+- +|...|+++|.+.|++... ...+++.+.+.+..|+.+|+++||++||+||+||..... ++..++.|++. T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~--~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~-----~~g~~~~v~~~ 73 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQN--MNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQIS-----RDGFTGSVTYG 73 (92) T ss_pred CCceeeEeehHHHHHHHHHhhcc--HHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEee-----cCCeeEEEEec Confidence 874 5555556666555554322 356889999999999999999999999999999975432 33445566655 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCC Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPA 121 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa 121 (155) .+.+.|+.||||||. +|+| T Consensus 74 gp~a~Ya~YvE~GTR-----------------------~M~A 92 (92) T protein:vir:99 74 GGLVNYAAYVEFGTR-----------------------FMDS 92 (92) T ss_pred cCcccccccccccee-----------------------ecCC Confidence 677789999999984 6888 No 84 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=99.45 E-value=3.5e-16 Score=105.33 Aligned_cols=120 Identities=17% Similarity=0.241 Sum_probs=85.2 Q ss_pred eeeccHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------cchhhcceeeeecccccCCce- Q lcl|NC_019933. 5 ITSLDISGVLSALNDL---RDDSDSVSRTMAFESAAVVRDSAKAHVRSK---------TGRLKGAIYAVYVPEESTEVR- 71 (155) Q Consensus 5 m~~~~l~~L~~~l~~l---~~~~~~~~r~a~~~~a~~i~~eak~~aP~~---------tG~Lr~sI~~~~~~~~~~~g~- 71 (155) |..++ ++|++.|+++ .....+.-.+++.+||+++.+..+.++|.. .|+|++||.+.. . +.+|. T Consensus 1 M~~~~-d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~-~--~idg~~ 76 (140) T protein:vir:48 1 MTGLD-EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQS-T--NVDGRK 76 (140) T ss_pred CccHH-HHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecc-c--cccccc Confidence 55444 3455555555 444456778999999999999999999842 358999998752 2 22232 Q ss_pred -EEEEEEecCC-ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_019933. 72 -HVYAVSWNKK-KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV--KGRLVEVANKAGAKR 147 (155) Q Consensus 72 -~~~~Vg~~~~-~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~--~~~~~~~i~~~l~~~ 147 (155) -...|||.+. +++++||+++||+ +|||+||+-.+.+.. ++++++++.+.+ T Consensus 77 dG~s~VG~~k~~~a~~a~f~NdGT~-----------------------k~~~~hFve~t~~e~~~~~~vl~A~~~~y--- 130 (140) T protein:vir:48 77 NGVATVGWKNNYHAQNARRLNDGTK-----------------------KYRADHFVTNVQNDSAVRDKVLLAEKEEY--- 130 (140) T ss_pred ccceeecccCCCceeEEeecccCcc-----------------------ccCCCchHHHHHHhhhhHHHHHHHHHHHH--- Confidence 2446999866 4889999999884 799999999999744 677777775555 Q ss_pred HHHHhccC Q lcl|NC_019933. 148 LAELRSKR 155 (155) Q Consensus 148 i~k~~~k~ 155 (155) .++|.++ T Consensus 131 -~~~l~kk 137 (140) T protein:vir:48 131 -EKLIRKK 137 (140) T ss_pred -HHHHHhh Confidence 4555555 No 85 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=99.43 E-value=5.7e-16 Score=104.16 Aligned_cols=119 Identities=18% Similarity=0.200 Sum_probs=78.1 Q ss_pred eeeccHHHH----HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 5 ITSLDISGV----LSALNDLRDDSDSVSRTMA----FESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 5 m~~~~l~~L----~~~l~~l~~~~~~~~r~a~----~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) |+.+.+|+| .+.|+++.+++.+.+..++ .++++.++.+++...|++||.+++||..+... ++. + T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~----~~~----~ 72 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTP----GGW----V 72 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeecc----Cce----e Confidence 666666655 4444455566666666666 55555666666679999999999999765432 221 2 Q ss_pred EecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 77 g~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) -+++......|++||||....+. .++|+|+|+||.+...+++++.|.+.|+-.=. T Consensus 73 v~nk~~yqLtHLLE~GHAkr~GG------------------RV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 73 IHNKTEYRLAHLLEYGHATVDGG------------------RVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred EeecCCcceeehhhcceeccCCc------------------ccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 23444445699999999764432 37899999999988777776666555543222 No 86 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=99.42 E-value=1.1e-15 Score=102.53 Aligned_cols=120 Identities=18% Similarity=0.240 Sum_probs=93.8 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVRS--KTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP~--~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||++++ |+++|+..|++. +..+.++..+|+.+++++|.++++.+.++ |||.+.+++..+... ..+|.-++.| T Consensus 1 Msvevk--Gv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~--~~~G~r~V~v 76 (134) T protein:vir:10 1 MSVKVT--GDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPE--WIKGKRTVTI 76 (134) T ss_pred CeEEee--cHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCee--ecCCceEEEE Confidence 777766 788998888887 66788999999999999999999998876 999999999775433 3367788999 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhh--------HHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRP--------GYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrP--------A~~~~~~~~~~~i~~~l~~ 146 (155) ||..+. .++.|+.|||+. ++..-||++| |++..+..+.+.++..|+ T Consensus 77 gW~G~~~R~~ivHLnE~Gyt-----------------------~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~- 132 (134) T protein:vir:10 77 RWRGPFERFRIVHLIENGHV-----------------------EKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELK- 132 (134) T ss_pred EEEcCCceeeEEEeeeccee-----------------------ecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHh- Confidence 997663 568899999984 4567889999 666666655555544444 Q ss_pred HHHHH Q lcl|NC_019933. 147 RLAEL 151 (155) Q Consensus 147 ~i~k~ 151 (155) ++ T Consensus 133 ---kl 134 (134) T protein:vir:10 133 ---KL 134 (134) T ss_pred ---cC Confidence 44 No 87 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=99.39 E-value=2.4e-15 Score=100.77 Aligned_cols=120 Identities=19% Similarity=0.286 Sum_probs=92.8 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVRS--KTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP~--~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||++++ |+++|+..|++. +..+.++..+|+.++++.|.++++.+.++ |||.+.+++..+... ..+|.-++.| T Consensus 1 msvevk--Gv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~--~~~G~r~V~v 76 (134) T protein:vir:10 1 MSVKVI--GDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPE--WINGKRTITV 76 (134) T ss_pred CeEEEe--cHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCee--ecCCceEEEE Confidence 776665 888999888887 66788999999999999999999999996 999999999765433 3457778899 Q ss_pred EecCC--ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhh--------HHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKK--KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRP--------GYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~--~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrP--------A~~~~~~~~~~~i~~~l~~ 146 (155) ||..+ ..++.|+.|||+.+. ...||++| |++..+..+.+.++..|+ T Consensus 77 gW~G~~~R~~iiHLNE~Gytr~-----------------------~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~- 132 (134) T protein:vir:10 77 HWRGSKDRYKIVHLIEYGHVQK-----------------------GTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELK- 132 (134) T ss_pred EEEcCCceeEEEEeecccceec-----------------------ccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHh- Confidence 99766 357899999997532 25678888 666666655555554444 Q ss_pred HHHHH Q lcl|NC_019933. 147 RLAEL 151 (155) Q Consensus 147 ~i~k~ 151 (155) ++ T Consensus 133 ---kl 134 (134) T protein:vir:10 133 ---KL 134 (134) T ss_pred ---cC Confidence 44 No 88 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=99.39 E-value=2.4e-15 Score=100.77 Aligned_cols=120 Identities=19% Similarity=0.286 Sum_probs=92.8 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVRS--KTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP~--~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||++++ |+++|+..|++. +..+.++..+|+.++++.|.++++.+.++ |||.+.+++..+... ..+|.-++.| T Consensus 1 msvevk--Gv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~--~~~G~r~V~v 76 (134) T protein:vir:95 1 MSVKVI--GDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPE--WINGKRTITV 76 (134) T ss_pred CeEEEe--cHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCee--ecCCceEEEE Confidence 776665 888999888887 66788999999999999999999999996 999999999765433 3457778899 Q ss_pred EecCC--ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhh--------HHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKK--KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRP--------GYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~--~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrP--------A~~~~~~~~~~~i~~~l~~ 146 (155) ||..+ ..++.|+.|||+.+. ...||++| |++..+..+.+.++..|+ T Consensus 77 gW~G~~~R~~iiHLNE~Gytr~-----------------------~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~- 132 (134) T protein:vir:95 77 HWRGSKDRYKIVHLIEYGHVQK-----------------------GTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELK- 132 (134) T ss_pred EEEcCCceeEEEEeecccceec-----------------------ccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHh- Confidence 99766 357899999997532 25678888 666666655555554444 Q ss_pred HHHHH Q lcl|NC_019933. 147 RLAEL 151 (155) Q Consensus 147 ~i~k~ 151 (155) ++ T Consensus 133 ---kl 134 (134) T protein:vir:95 133 ---KL 134 (134) T ss_pred ---cC Confidence 44 No 89 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=99.16 E-value=2.8e-13 Score=89.37 Aligned_cols=128 Identities=13% Similarity=0.040 Sum_probs=102.5 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHVRS--KTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~aP~--~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||..-+..|+++|.+.|++ |++ .+.++..+|+.++|++|.++.+.+.|+ |||.+-++|...... ..+|...+.| T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~--~~~G~r~V~V 78 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVR--REDGIPKVKL 78 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCee--ecCCceEEEe Confidence 8877777789999999998 776 588999999999999999999999996 999999999765433 4568889999 Q ss_pred EecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAE 150 (155) Q Consensus 77 g~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k 150 (155) ||+.+-...-|+.|||+.... ++ +..=+++-|++..+..+.+.+++.|++.|+- T Consensus 79 gW~GpR~~ivHLNE~GyGk~~---~P-----------------rG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 79 GFTTPRWNIVHLQELEYGWKH---NR-----------------RGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred cccCCceeEEeeecccccCCc---CC-----------------CcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 999776667788899983211 11 1234799999999988888887777776665 No 90 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=99.14 E-value=3.1e-13 Score=89.16 Aligned_cols=128 Identities=16% Similarity=0.110 Sum_probs=91.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-----------------------cchhhcc Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSK-----------------------TGRLKGA 57 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~-----------------------tG~Lr~s 57 (155) |..+|. .+|+++...++.+.....+.-.++..+||+++++-.+..+|.. .|+|++| T Consensus 1 mm~~~~-~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~ 79 (159) T protein:vir:38 1 MANDMG-EFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDS 79 (159) T ss_pred CcchHH-HHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccc Confidence 555554 2244444444443334456678889999999999999999962 3699999 Q ss_pred eeeeecccccCCceE--EEEEEecCC-ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCc-----cchhhHH Q lcl|NC_019933. 58 IYAVYVPEESTEVRH--VYAVSWNKK-KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPAR-----SFLRPGY 129 (155) Q Consensus 58 I~~~~~~~~~~~g~~--~~~Vg~~~~-~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~-----PFlrPA~ 129 (155) |.+... ...||.. ...|||.+. ++++++|+..|| ++|||+ +|+--+. T Consensus 80 I~~~~~--~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT-----------------------~~m~~k~~~gdHFvekt~ 134 (159) T protein:vir:38 80 ITYKPG--YTADKLHTGDTDVGFEGKYYDFLAKIVNNGQ-----------------------HHMSPKRYKNMHFLDKAQ 134 (159) T ss_pred eeeecC--ccccccccceeeecccCCccceEeeecccCc-----------------------cccCCCCccCChhHHHHH Confidence 976532 2334433 467999544 478888888887 478887 8999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 130 DSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 130 ~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) +..++++++++.+.+++-|....-+ T Consensus 135 ~~~k~~Vl~A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 135 QEAKKSVAEAELKAYKEVMNHDSDK 159 (159) T ss_pred HHHHHHHHHHHHHHHHHHhhcccCC Confidence 9888888888877777777666666 No 91 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.02 E-value=4.6e-12 Score=82.76 Aligned_cols=109 Identities=8% Similarity=0.101 Sum_probs=68.5 Q ss_pred eeeccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |+.+.|+||+++++.|....+ ..+++++++++....+++...+....+ ..+++..-.+.+....-. T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~ap------------vdTG~Lr~sI~~~~~~~~- 67 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKG------------YSTGATRRSITLEAGSDR- 67 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCC------------CCchhhhhceeeecCceE- Confidence 999999999999999999964 668999999999999999998864432 123333222222111100 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch--hhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL--RPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl--rPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) ++.| ....|+.++|+ ..-|| +|=+. -+.+..+..+.+.|.++- T Consensus 68 -----~~v~-----------~~~~Ya~~vE~------GTr~m~AqPF~~----PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 68 -----AVVE-----------ALTNYSGYLEV------GTRKMEAQPFMR----PALDQVVPEMVEEMAKWE 112 (112) T ss_pred -----EEec-----------CCCCccceecc------CccccCCCCchh----hhHHHHHHHHHHHHHhcC Confidence 1111 12247777777 55565 34443 345556666666666666 No 92 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=98.93 E-value=8.7e-12 Score=81.22 Aligned_cols=127 Identities=13% Similarity=0.192 Sum_probs=96.2 Q ss_pred Cc-eeeeeccHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHhCCC-----------Ccchhhcceeeeecc Q lcl|NC_019933. 1 MS-SKITSLDISGVLSALNDLR----DDSDSVSRTMAFESAAVVRDSAKAHVRS-----------KTGRLKGAIYAVYVP 64 (155) Q Consensus 1 M~-~~m~~~~l~~L~~~l~~l~----~~~~~~~r~a~~~~a~~i~~eak~~aP~-----------~tG~Lr~sI~~~~~~ 64 (155) |+ -.-..+.++||.+....|. .+..+.++.+..++|.++...+++.+|+ +||.|..||.+.... T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 77 3334444555555555554 3567899999999999999999999998 799999999764322 Q ss_pred cccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 65 EESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAG 144 (155) Q Consensus 65 ~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l 144 (155) . ...+..|-. .--||+.||+||+-.+ ++.++-||.-|.-..++++...-.+++ T Consensus 81 r-----aa~VrAG~~-krVPYA~~I~~G~r~r---------------------~Isp~rFl~~a~a~te~~~~r~Ye~~i 133 (143) T protein:vir:62 81 K-----GAVIKAGSA-SRVPYAAAIHFGYRAR---------------------NISPNRFLFRAMARKSDVVAATYERRI 133 (143) T ss_pred c-----ceeeeeCCc-CCCCcccccccCcccc---------------------cccchhhhhhhhhccCHHHHHHHHHHH Confidence 1 233344421 3468999999996432 456889999999999999999999999 Q ss_pred HHHHHHHhcc Q lcl|NC_019933. 145 AKRLAELRSK 154 (155) Q Consensus 145 ~~~i~k~~~k 154 (155) .+.|++.|.. T Consensus 134 ~~vl~k~l~s 143 (143) T protein:vir:62 134 AAVVEKYLES 143 (143) T ss_pred HHHHHHHhcC Confidence 9999999988 No 93 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=98.90 E-value=1.9e-11 Score=79.41 Aligned_cols=128 Identities=13% Similarity=0.045 Sum_probs=97.2 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||..-+..|+++|++.|+. |+. .+.++..+|+.++++.|.++.+.+.+ .|||..-+++...... ..+|.-.+.| T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~--~~~G~r~V~i 84 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVR--REDGIPKVKL 84 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCee--ecCCceEEEE Confidence 6654444577777777777 554 47889999999999999999999998 5999988888665333 4467888999 Q ss_pred EecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAE 150 (155) Q Consensus 77 g~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k 150 (155) ||..+-...-|+.|||+.... + -+..=+++-|++..+..+.+.++..|++.|+- T Consensus 85 gW~GpR~~ivHLNE~GyGk~i---~-----------------PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 85 GFTTPRWNIVHLQELEYGWKH---N-----------------RRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred eeecCeeeEEeeecccccCCc---C-----------------CCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 998876667788899983211 1 11234799999999999998887777777766 No 94 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.88 E-value=4.7e-11 Score=77.19 Aligned_cols=116 Identities=14% Similarity=0.125 Sum_probs=87.0 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccc---------ccC------- Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPE---------EST------- 68 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~---------~~~------- 68 (155) |...++.++...++++.+..+..+...+++.+..+..+...+.|||||.+|.|+.+....- +++ T Consensus 1 ma~~~~~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~~ 80 (147) T protein:vir:10 1 MANYQIRRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGEE 80 (147) T ss_pred CCCcchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhhh Confidence 8888888999999999888888888899999999999999999999999999997652211 111 Q ss_pred ------------CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHH Q lcl|NC_019933. 69 ------------EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRL 136 (155) Q Consensus 69 ------------~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~ 136 (155) .+.. ++++ ...+|+.++||||+ .++|..|.+.++ T Consensus 81 ~~~~~~~~~~~~~~~~-iyi~---Nn~pYA~~LEyG~S-----------------------~QAP~G~V~~t~------- 126 (147) T protein:vir:10 81 QAKTYGMFSRGGAITS-VHFS---NMLIYANALEYGHS-----------------------QQAPSGVVGLVA------- 126 (147) T ss_pred hHHHHHHhhhccCcce-EEEe---eCcchhhhhhcccc-----------------------CCCCchHHHHHH------- Confidence 1111 1122 34678888999986 578999999766 Q ss_pred HHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 137 VEVANKAGAKRLAELRSKR 155 (155) Q Consensus 137 ~~~i~~~l~~~i~k~~~k~ 155 (155) +.+.+.+++++.|+-+.| T Consensus 127 -q~~~~~v~~~~~e~k~~~ 144 (147) T protein:vir:10 127 -LRLRSYMADAIKQARRQQ 144 (147) T ss_pred -HHHHHHHHHHHHHHHhhh Confidence 445555666777777766 No 95 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=98.83 E-value=3.2e-11 Score=78.16 Aligned_cols=127 Identities=14% Similarity=0.200 Sum_probs=97.5 Q ss_pred Cc-eeeeeccHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhCCCC-----------cchhhcceeeeecc Q lcl|NC_019933. 1 MS-SKITSLDISGVLSALNDLRD----DSDSVSRTMAFESAAVVRDSAKAHVRSK-----------TGRLKGAIYAVYVP 64 (155) Q Consensus 1 M~-~~m~~~~l~~L~~~l~~l~~----~~~~~~r~a~~~~a~~i~~eak~~aP~~-----------tG~Lr~sI~~~~~~ 64 (155) |+ -.-..+.++||.+....|.. +..+.++.+..++|.++...+++.+|+- ||.|..||.+.... T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 77 34445566676666666654 4568899999999999999999999975 89999999764322 Q ss_pred cccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 65 EESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAG 144 (155) Q Consensus 65 ~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l 144 (155) . ...+..|- +.--||+.||+||+-.+ ++.++-||+-|.-..++++...-.+++ T Consensus 81 r-----aa~VrAGr-~arVPYA~~I~~G~r~r---------------------~Is~~rFl~~a~a~te~~~~r~Ye~~i 133 (143) T protein:vir:13 81 K-----GAVIKAGS-AARVPYAAAIHFGYRKR---------------------NISANRFLYRAMARKSDVVAATYERRI 133 (143) T ss_pred c-----ceeeeecC-cCCCCcccccccCCccc---------------------ccchhhhhhhhhhccCHHHHHHHHHHH Confidence 1 23333331 12358999999996432 356889999999999999999999999 Q ss_pred HHHHHHHhcc Q lcl|NC_019933. 145 AKRLAELRSK 154 (155) Q Consensus 145 ~~~i~k~~~k 154 (155) .+.|++.|.. T Consensus 134 ~~vl~k~l~s 143 (143) T protein:vir:13 134 AAVVEKYLES 143 (143) T ss_pred HHHHHHHhcC Confidence 9999999988 No 96 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.79 E-value=9.5e-11 Score=75.53 Aligned_cols=128 Identities=15% Similarity=0.164 Sum_probs=87.2 Q ss_pred Cceeeee-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-------------------------- Q lcl|NC_019933. 1 MSSKITS-LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-----R-------------------------- 48 (155) Q Consensus 1 M~~~m~~-~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-----P-------------------------- 48 (155) ||..|++ ++.+++.+.|+.|.....+ .++.++.-|..++.+...+. | T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d-~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~ 79 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQ-KADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcC-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccc Confidence 9975443 2446777777777654432 45677777777776665531 2 Q ss_pred --------------CCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeeccc Q lcl|NC_019933. 49 --------------SKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLA 114 (155) Q Consensus 49 --------------~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~ 114 (155) .+||.|++||..... ...+.||.+. .|+.+..||+.. ..+ T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~~-------~~~v~vGtn~---~YAaiHqfGg~~----------------~~~ 133 (175) T protein:vir:79 80 TAAASRRKAGLMILQDSGQMAASTATDSG-------EDYSVIGSNK---EYAAIQHFGGQA----------------GRG 133 (175) T ss_pred hhhHhhhccCCCcceechhhhhhhhheec-------CCEEEEecCc---chhhHhhccccc----------------CCC Confidence 148999999965422 2245788765 577889999631 123 Q ss_pred ceeeeCCccchhhHHH-HHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 115 TPVHVPARSFLRPGYD-SVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 115 gt~~~pa~PFlrPA~~-~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ..+.+||+|||-=.=+ .-..++.+.|.+.+.+.|+++++++ T Consensus 134 ~~v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 134 LKVTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred cccccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHhccC Confidence 4568999999975433 3356788899999999999999999 No 97 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=98.77 E-value=4.5e-11 Score=77.32 Aligned_cols=113 Identities=14% Similarity=0.058 Sum_probs=75.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC---CcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCc Q lcl|NC_019933. 24 SDSVSRTMAFESAAVVRDSAKAHVRS---KTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVA 100 (155) Q Consensus 24 ~~~~~r~a~~~~a~~i~~eak~~aP~---~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~ 100 (155) ..+.+++++.+.|..+...++++.|+ ++|+||+|+.+...... ++. |+ ....|++||||||....+. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~--~~~----v~---N~~eYA~~VE~GHRq~~g~- 70 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLF--DGV----VS---NNVEYIHHLEYGHRTRQGT- 70 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeecc--Cce----ee---cCCcccccccCCceeeCCc- Confidence 77888999999999999999999998 56999999977533221 111 22 4678999999999653321 Q ss_pred CCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 101 EVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 101 ~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) |.........-...++|.+-||+-+.+.-+.+ |-..+++.|++++. T Consensus 71 ---g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~----~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 71 ---GTSENYRPKPNGISFVPGVFMLARSVDEMSSI----IDDELNQIIIDFWN 116 (116) T ss_pred ---ceecccccccccCCccCceehHHHHHHHHHHH----HHHHHHHHHHHhcC Confidence 11111111112223678888998888655544 44555666666666 No 98 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.77 E-value=1e-10 Score=75.37 Aligned_cols=122 Identities=18% Similarity=0.208 Sum_probs=76.5 Q ss_pred Cceeeeec-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CC------------------------- Q lcl|NC_019933. 1 MSSKITSL-DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-----RS------------------------- 49 (155) Q Consensus 1 M~~~m~~~-~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-----P~------------------------- 49 (155) ||+.++.. +++.|.+.|..|....+. +..+++.+..++.+.+.+. |. T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~--~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~ 78 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRD--RAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGS 78 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhcc--HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCc Confidence 88777655 788999999998654332 2455555666666555432 31 Q ss_pred ---CcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchh Q lcl|NC_019933. 50 ---KTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLR 126 (155) Q Consensus 50 ---~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlr 126 (155) +||.|++||..... ...+.||.+ .+|+...+||...... ..++++||+||| T Consensus 79 ~L~~tg~L~~Si~~~~~-------~~~v~vGt~---~~yA~vHqfG~~~~~~---------------~~~~~iPaRpfL- 132 (156) T protein:vir:19 79 ILTLHGDLARSITTDYG-------QDYALIGSP---KIYAAIHQWGGTPDMA---------------PRPAGVPARPYM- 132 (156) T ss_pred chhhhHHHHHHhhheec-------CCEEEEecc---hhhhHHhhcCcccccC---------------CCccccCCcccc- Confidence 47899999964321 224567875 4688889999743211 113579999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 127 PGYDSVKGRLVEVANKAGAKRLAE 150 (155) Q Consensus 127 PA~~~~~~~~~~~i~~~l~~~i~k 150 (155) +-=+...+++.+.+.+.|...+.+ T Consensus 133 G~s~~d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 133 GLDKTGEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhC Confidence 444555555555555555555544 No 99 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.76 E-value=5.1e-11 Score=77.02 Aligned_cols=119 Identities=12% Similarity=0.089 Sum_probs=76.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccc-------cCCceEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEE-------STEVRHV 73 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~-------~~~g~~~ 73 (155) |.-.|.. +-++...+.++-+..+..+...+++.+..|..+...+.|||||.+|.|+.+....-. ..+|..+ T Consensus 1 ~~~~m~~--~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t 78 (145) T protein:vir:10 1 MARNIGS--VVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQT 78 (145) T ss_pred CCCcccc--hhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccc Confidence 6666553 223444445554555555555677777778888888999999999999976531111 0111111 Q ss_pred -------------------EEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHH Q lcl|NC_019933. 74 -------------------YAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKG 134 (155) Q Consensus 74 -------------------~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~ 134 (155) ++++ ...||+..+||||+ .++|..|.+.++..-.+ T Consensus 79 ~~~~~~~~~~i~~~k~g~~iyi~---Nn~pYA~~LEyG~S-----------------------~QAP~G~v~~~~~~~~~ 132 (145) T protein:vir:10 79 KTYLARQARAVANSKATSVIYIT---NRLDYAADLEYGAS-----------------------NQAPAGVLGVVQARLGR 132 (145) T ss_pred hhhHHHHHHHhhcccccceEEEe---eCchhhhHhhcccc-----------------------CCCcchHHHHHHHHHHH Confidence 1122 34688888999986 57899999999976644 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_019933. 135 RLVEVANKAGAKRL 148 (155) Q Consensus 135 ~~~~~i~~~l~~~i 148 (155) +++...++++++| T Consensus 133 -~v~~~~~e~k~~~ 145 (145) T protein:vir:10 133 -YFQEAVEEARRAI 145 (145) T ss_pred -HHHHHHHHhhccC Confidence 5555557777777 No 100 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=98.75 E-value=1.6e-10 Score=74.29 Aligned_cols=127 Identities=19% Similarity=0.180 Sum_probs=89.9 Q ss_pred CceeeeeccHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDL--RDDSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l--~~~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||+++. |+++|++.|+.. +..+.++..+|+.++++.|.++.+.+.. .|||..-+++..+......+.+..++.| T Consensus 1 msvevk--Gv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i 78 (133) T protein:vir:93 1 MSVEIK--GIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLI 78 (133) T ss_pred CeEEEe--cHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEE Confidence 777766 778888888775 4567889999999999999999999988 6999999998765333333333567899 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ||..+. ...-|+.|||+.+ +|+++.-+ .-=-++-|++..+....+.++++|++ T Consensus 79 ~W~gp~~R~~iVHLNE~Gytr-------~Gk~i~Pr----------G~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 79 EWVGPMNRKNIIHLNEHGYTR-------DGKKYTPR----------GFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred EeecCCCceeEEEeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 998764 4467889999743 22211100 11136777777777777777666666 No 101 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.73 E-value=2e-10 Score=73.77 Aligned_cols=125 Identities=17% Similarity=0.086 Sum_probs=80.4 Q ss_pred Cceeeeec-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-C---------------------------CCc Q lcl|NC_019933. 1 MSSKITSL-DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-R---------------------------SKT 51 (155) Q Consensus 1 M~~~m~~~-~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-P---------------------------~~t 51 (155) |+..|+.. +.+.|.+.|+.|.+..++ .+..++..+..++...+.+. | .+| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~-~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~t 79 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTD-TLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVT 79 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccc Confidence 99777644 567788888888765543 45667777777777665543 1 248 Q ss_pred chhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchh-hHHH Q lcl|NC_019933. 52 GRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLR-PGYD 130 (155) Q Consensus 52 G~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlr-PA~~ 130 (155) |.|++||..... ...+.||.+. +|+...+||.... .++++.+||+|||- ..-+ T Consensus 80 G~L~~Si~~~~~-------~~~v~vGtn~---~YA~iHqfGg~~~----------------~~~~~~iPARPfLG~s~~~ 133 (155) T protein:vir:10 80 NALARSITTRAD-------RDQAQIGSNL---SYAAIQQLGGQAG----------------RGRKVTIPARPYLPVLRNG 133 (155) T ss_pred hhhhhhhhceec-------CCEEEEecCc---chhhhhhcccccC----------------CCCccccCCccccCCCccc Confidence 899999965422 2245678764 5778899996321 13456899999996 3333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 131 SVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 131 ~~~~~~~~~i~~~l~~~i~k~~ 152 (155) .-.+++++.|.+.+.+.|++-. T Consensus 134 e~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 134 QLKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred cchHHHHHHHHHHHHHHHhhcC Confidence 3345666666666655554433 No 102 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=98.73 E-value=2.4e-10 Score=73.37 Aligned_cols=127 Identities=19% Similarity=0.169 Sum_probs=89.3 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||+++. |+++|++.|+. |+. .+.++..+|+.++++.|.++.+.+.. .|||..-+++..+......+.+..++.| T Consensus 1 msvevk--Gv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i 78 (133) T protein:vir:93 1 MSVEIK--GIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLI 78 (133) T ss_pred CeEEEe--cHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEE Confidence 777766 78888888887 554 56889999999999999999999988 6999999998765433333334567899 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ||..+. ...-|+.|||+.+ +|+++.-+ .-=-++-|++..+....+.++++|++ T Consensus 79 ~W~gp~~R~~iVHLNE~Gytr-------~Gk~i~Pr----------G~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 79 EWVGPMNRKNIIHLNEHGYTR-------DGKKYTPR----------GFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EeecCCCceeEEEeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 998764 4467889999743 22211100 11136667777777777666666655 No 103 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=98.73 E-value=2.4e-10 Score=73.37 Aligned_cols=127 Identities=19% Similarity=0.169 Sum_probs=89.3 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||+++. |+++|++.|+. |+. .+.++..+|+.++++.|.++.+.+.. .|||..-+++..+......+.+..++.| T Consensus 1 msvevk--Gv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i 78 (133) T protein:vir:78 1 MSVEIK--GIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLI 78 (133) T ss_pred CeEEEe--cHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEE Confidence 777766 78888888887 554 56889999999999999999999988 6999999998765433333334567899 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ||..+. ...-|+.|||+.+ +|+++.-+ .-=-++-|++..+....+.++++|++ T Consensus 79 ~W~gp~~R~~iVHLNE~Gytr-------~Gk~i~Pr----------G~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 79 EWVGPMNRKNIIHLNEHGYTR-------DGKKYTPR----------GFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EeecCCCceeEEEeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 998764 4467889999743 22211100 11136667777777777666666655 No 104 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=98.73 E-value=2.4e-10 Score=73.37 Aligned_cols=127 Identities=19% Similarity=0.169 Sum_probs=89.3 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||+++. |+++|++.|+. |+. .+.++..+|+.++++.|.++.+.+.. .|||..-+++..+......+.+..++.| T Consensus 1 msvevk--Gv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i 78 (133) T protein:vir:96 1 MSVEIK--GIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLI 78 (133) T ss_pred CeEEEe--cHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEE Confidence 777766 78888888887 554 56889999999999999999999988 6999999998765433333334567899 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ||..+. ...-|+.|||+.+ +|+++.-+ .-=-++-|++..+....+.++++|++ T Consensus 79 ~W~gp~~R~~iVHLNE~Gytr-------~Gk~i~Pr----------G~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 79 EWVGPMNRKNIIHLNEHGYTR-------DGKKYTPR----------GFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EeecCCCceeEEEeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 998764 4467889999743 22211100 11136667777777777666666655 No 105 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=98.73 E-value=2.4e-10 Score=73.37 Aligned_cols=127 Identities=19% Similarity=0.169 Sum_probs=89.3 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||+++. |+++|++.|+. |+. .+.++..+|+.++++.|.++.+.+.. .|||..-+++..+......+.+..++.| T Consensus 1 msvevk--Gv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i 78 (133) T protein:vir:94 1 MSVEIK--GIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLI 78 (133) T ss_pred CeEEEe--cHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEE Confidence 777766 78888888887 554 56889999999999999999999988 6999999998765433333334567899 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ||..+. ...-|+.|||+.+ +|+++.-+ .-=-++-|++..+....+.++++|++ T Consensus 79 ~W~gp~~R~~iVHLNE~Gytr-------~Gk~i~Pr----------G~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 79 EWVGPMNRKNIIHLNEHGYTR-------DGKKYTPR----------GFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EeecCCCceeEEEeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 998764 4467889999743 22211100 11136667777777777666666655 No 106 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.70 E-value=2.1e-10 Score=73.65 Aligned_cols=116 Identities=13% Similarity=0.122 Sum_probs=78.0 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccccc-------CCceE----- Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEES-------TEVRH----- 72 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~-------~~g~~----- 72 (155) |.. ++.++...++.+-+..+..+...+++.+..+..+...+.|||||.+|.|+.++...-.. .+|.. T Consensus 1 Ma~-~~~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~ 79 (142) T protein:vir:10 1 MAN-DVVSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNSL 79 (142) T ss_pred Ccc-chhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhhH Confidence 543 45566667777777777777777888888888888999999999999999775322110 00110 Q ss_pred --------------EEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHH Q lcl|NC_019933. 73 --------------VYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVE 138 (155) Q Consensus 73 --------------~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~ 138 (155) +++++ ...+|+..+||||+ .+.|..|.+.++..-.+ +++ T Consensus 80 ~~~~~~i~~~~~g~~iyi~---Nn~pYA~~LEyG~S-----------------------~QAP~G~v~~a~q~~~~-~v~ 132 (142) T protein:vir:10 80 RRQIYALARDANTNVIYIS---NRLDYAQGLEFGSS-----------------------NQAPSGVLGVVQKRLGR-YFA 132 (142) T ss_pred HHHHHHhhhccccceEEEe---eCcchhhhhhcccc-----------------------CCCcchHHHHHHHHHHH-HHH Confidence 11111 34688888999986 47899999999965544 444 Q ss_pred HHHHHHHHHH Q lcl|NC_019933. 139 VANKAGAKRL 148 (155) Q Consensus 139 ~i~~~l~~~i 148 (155) ...++++.+| T Consensus 133 ~a~~e~~~~~ 142 (142) T protein:vir:10 133 EAVQEAKRAL 142 (142) T ss_pred HHHHHhhccC Confidence 4445566666 No 107 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=98.70 E-value=4.4e-10 Score=71.89 Aligned_cols=127 Identities=17% Similarity=0.179 Sum_probs=94.0 Q ss_pred CceeeeeccHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhC--CCCcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALND-LRD-DSDSVSRTMAFESAAVVRDSAKAHV--RSKTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~-l~~-~~~~~~r~a~~~~a~~i~~eak~~a--P~~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) ||+++. |+++|++.|+. |+. .+.++..+|+.++++.|.++.+.+. ..|||..-+++..+... ..+|.-.+.| T Consensus 1 msvevk--Gv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~--~~~G~r~V~i 76 (133) T protein:vir:78 1 MSVEVT--GVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPS--YDKGVRSIKI 76 (133) T ss_pred CeEEEe--cHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCee--eeCCceEEEE Confidence 777766 78888888887 554 5688999999999999999999965 56999999998764332 3467788899 Q ss_pred EecCCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 77 g~~~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) ||..++ ...-|+.|||+.+ +|+++.-+ .-==++-|++..+....+.++++|++.| T Consensus 77 ~W~gp~~R~~iVHLNE~GYtr-------~Gk~i~Pr----------G~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 77 DWKGPKDRYKIIHLNEYGYTR-------NGKKITPA----------GTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred EEecCCCceeEEEeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 998764 4567889999743 22221100 1124788888888888888877777777 No 108 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.69 E-value=3.7e-10 Score=72.32 Aligned_cols=117 Identities=14% Similarity=0.085 Sum_probs=82.7 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccccc-------CCce------ Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEES-------TEVR------ 71 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~-------~~g~------ 71 (155) |..-.+.++...++.+-+..+..+...+++.+..+..+...+.|||||.+|.|+.++...-.. .+|. T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~ 80 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEG 80 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHH Confidence 777778888999999888888888889999999999999999999999999999775321110 1111 Q ss_pred --------------EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHH Q lcl|NC_019933. 72 --------------HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLV 137 (155) Q Consensus 72 --------------~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~ 137 (155) .+++++ ...||+..+||||+ .++|..|.+.++..-.+ T Consensus 81 ~~~i~~~~~g~~~~~~iyi~---NnlpYA~~LEyG~S-----------------------~QAP~G~v~~~~~~~~~--- 131 (146) T protein:vir:79 81 RRTLYALLHGGGAIKSIYFS---NMLIYANALEYGHS-----------------------KQAPAGVFGIVAIRLRS--- 131 (146) T ss_pred HHHHHHHHhcccccceeEEe---eCchhhhhhhcccc-----------------------CCCcchHHHHHHHHHHH--- Confidence 112222 34688888999986 57899999998854433 Q ss_pred HHHHHHHHHHHHHHhccC Q lcl|NC_019933. 138 EVANKAGAKRLAELRSKR 155 (155) Q Consensus 138 ~~i~~~l~~~i~k~~~k~ 155 (155) .+++++.|+-++- T Consensus 132 -----~v~~a~~e~k~~~ 144 (146) T protein:vir:79 132 -----YMAEAIREARKKN 144 (146) T ss_pred -----HHHHHHHHHHhhc Confidence 2333333333322 No 109 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.67 E-value=4.1e-10 Score=72.04 Aligned_cols=125 Identities=16% Similarity=0.101 Sum_probs=81.8 Q ss_pred Cceeeeec-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------CCc Q lcl|NC_019933. 1 MSSKITSL-DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV--------R--------------------SKT 51 (155) Q Consensus 1 M~~~m~~~-~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a--------P--------------------~~t 51 (155) |++.|++. +.+.+.+.|+.|...... .+..++..+..++...+.+. | .+| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d-~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTD-TLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccc Confidence 99877654 467888888888765542 56677777777777766653 1 368 Q ss_pred chhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHH- Q lcl|NC_019933. 52 GRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYD- 130 (155) Q Consensus 52 G~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~- 130 (155) |.|++||..... ...+.||.+. +|+...+||+.. ..++++++||+|||-=.-+ T Consensus 80 G~L~~Si~~~~~-------~~~v~vGt~~---~YA~iHqfGg~~----------------~~~~~v~iPaRpfLG~s~~~ 133 (155) T protein:vir:79 80 NALARSVTTWAD-------RNEAGIGSNL---VYAAIHQFGGDA----------------GRGHQVEIPARRYLPFDENG 133 (155) T ss_pred hhhhhhhhceec-------CCEEEEecCc---hhhhhhhccccc----------------CCCCccccCCccccCCCCcc Confidence 999999965422 2245678664 577789999632 1234578999999964332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 131 SVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 131 ~~~~~~~~~i~~~l~~~i~k~~ 152 (155) .-..++.+.|.+.+.+.|++-. T Consensus 134 ~l~~~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 134 QLAAGARQSILEVVLTALSRNR 155 (155) T ss_pred ccchHHHHHHHHHHHHHHHhcC Confidence 2234566666666666665544 No 110 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.67 E-value=2.4e-10 Score=73.36 Aligned_cols=117 Identities=13% Similarity=0.104 Sum_probs=70.4 Q ss_pred ecc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C----------------------CCcchhhcce Q lcl|NC_019933. 7 SLD-ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-----R----------------------SKTGRLKGAI 58 (155) Q Consensus 7 ~~~-l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-----P----------------------~~tG~Lr~sI 58 (155) ++. .+.+.+.++.|.+... .++...+..+.+++..+. | .+||.|++|| T Consensus 1 ~i~~~~~i~~~l~~l~~~~~----~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si 76 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDGLT----DGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDI 76 (145) T ss_pred CcccHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHH Confidence 444 4467777777655433 344444544555444321 1 2689999999 Q ss_pred eeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHH Q lcl|NC_019933. 59 YAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVE 138 (155) Q Consensus 59 ~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~ 138 (155) ........ ....+.||.+. +|+.+.+||+.+ +++||+|||-|+.+...+++.+ T Consensus 77 ~~~~~~~~---~~~~a~vGtn~---~YA~~hqfG~~~---------------------~~IPaRPfLG~~~~~~~~~~~~ 129 (145) T protein:vir:31 77 NAASMMDR---ANRMAVIGTNL---DYAEHHEFGAPE---------------------AGIPARPIFGPAGAYASQQAPD 129 (145) T ss_pred HHHhhhcc---cCceeEecCCc---hhhhhhccCCcc---------------------cccCCCCccCCCccchHHHHHH Confidence 65432221 12335678665 577889999742 3599999999998776666666 Q ss_pred HHHHHHHHHHHHHhcc Q lcl|NC_019933. 139 VANKAGAKRLAELRSK 154 (155) Q Consensus 139 ~i~~~l~~~i~k~~~k 154 (155) .|.+.+...|.-++=- T Consensus 130 ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 130 VIGDEIDTNLEGAVID 145 (145) T ss_pred HHHHHHHHHhhhhccC Confidence 6655555544433333 No 111 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.64 E-value=5.7e-10 Score=71.25 Aligned_cols=128 Identities=16% Similarity=0.161 Sum_probs=84.3 Q ss_pred Cceeeeec-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-------------------------- Q lcl|NC_019933. 1 MSSKITSL-DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-----R-------------------------- 48 (155) Q Consensus 1 M~~~m~~~-~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-----P-------------------------- 48 (155) ||..++.. +.++|.+.|+.|.....+ .++.++.-+..++.+...+. | T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d-~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~ 79 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQ-KAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhcc-HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhh Confidence 99754432 556788888888654432 34555666666665554432 1 Q ss_pred --------------CCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeeccc Q lcl|NC_019933. 49 --------------SKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLA 114 (155) Q Consensus 49 --------------~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~ 114 (155) .+||.|++||..... ...+.||.+. .|+....||+... .. T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~~-------~~~v~vGtn~---~YAaiHqfGg~~~----------------~~ 133 (175) T protein:vir:10 80 TAAASRRKAGLMILQDSGQMAASVSTDHD-------DNSAVIGSNK---EYAAIHQFGGQAG----------------RG 133 (175) T ss_pred hhhhhhhccCCCcceechhhhhhhheeec-------CCEEEEecCh---hhhhhhhcccccC----------------CC Confidence 247889999965422 2245788765 4677788996311 12 Q ss_pred ceeeeCCccchhhHHHHH-HHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 115 TPVHVPARSFLRPGYDSV-KGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 115 gt~~~pa~PFlrPA~~~~-~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ..+++||+|||-=.-+.. ..++++.|.+.+.+.|.++++++ T Consensus 134 ~~v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 134 LKVTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred CccccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 346899999998654332 34677888899999999999999 No 112 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=98.62 E-value=3.8e-10 Score=72.21 Aligned_cols=121 Identities=21% Similarity=0.236 Sum_probs=93.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----CcchhhcceeeeecccccCCceEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRS----KTGRLKGAIYAVYVPEESTEVRHVYAV 76 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~----~tG~Lr~sI~~~~~~~~~~~g~~~~~V 76 (155) |+++=. ||.+..+.|..|-+.-+++-...+.++|+-..+--+-+.|+ +.|+||++|.+....+ ... | T Consensus 1 m~sNNN--GFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d----~V~---V 71 (125) T protein:vir:62 1 MASNNN--GFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDD----RVS---V 71 (125) T ss_pred CCCCch--hHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCC----eEE---E Confidence 665543 67777777777777778999999999999999999998885 4689999998764322 222 2 Q ss_pred EecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 77 SWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 77 g~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) - -...++||+|+|.||....+.+ .+.||-|..--|+++++++.+.|.+.+-+++ T Consensus 72 ~-Fed~a~yW~f~EnGt~~~~~~g-----------------~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 72 E-FKDEAWYWYLVEHGHKKAKGKG-----------------RVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred E-Ecchhhhhhhhhcccccccccc-----------------ccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 1 2467999999999996432211 1468999999999999999999988877777 No 113 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.62 E-value=7.9e-10 Score=70.47 Aligned_cols=125 Identities=16% Similarity=0.096 Sum_probs=78.2 Q ss_pred Cceeeeec-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------CCc Q lcl|NC_019933. 1 MSSKITSL-DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV--------R--------------------SKT 51 (155) Q Consensus 1 M~~~m~~~-~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a--------P--------------------~~t 51 (155) ||+.|++. +.++|.+.|+.|....+. .++.++..+..++...+.+. | .+| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d-~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTD-TLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhc Confidence 99877643 567888888888765543 56777777777777776653 1 258 Q ss_pred chhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHH- Q lcl|NC_019933. 52 GRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYD- 130 (155) Q Consensus 52 G~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~- 130 (155) |.|++||..... ...+.||.+. .|+...+||.... ....+.+||+|||-=.-+ T Consensus 80 g~L~~Si~~~~~-------~~~v~vGtn~---~YA~iHqfGg~~~----------------~~~~v~iPaRpfLG~s~~~ 133 (155) T protein:vir:99 80 NALARSVTTWAD-------RNEAGIGSNL---VYAAIHQFGGDAG----------------RGHQVEIPARRYLPFDENG 133 (155) T ss_pred hhhhhhhhceec-------CCEEEEecCc---cchhhhhcccccC----------------CCCccccCCccccCCCCcc Confidence 899999965422 2245678664 4777899996321 123468999999963322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 131 SVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 131 ~~~~~~~~~i~~~l~~~i~k~~ 152 (155) .-..+..+.|.+.+.+.|++-. T Consensus 134 ~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 134 QLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred ccchHHHHHHHHHHHHHHhccC Confidence 1223444555444444444333 No 114 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.55 E-value=2.8e-09 Score=67.49 Aligned_cols=139 Identities=12% Similarity=0.156 Sum_probs=78.8 Q ss_pred Cc-eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C------------------------CC Q lcl|NC_019933. 1 MS-SKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-----R------------------------SK 50 (155) Q Consensus 1 M~-~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-----P------------------------~~ 50 (155) |+ ++++ +++++|.+.|+.|.+...+ .++.+++.|..++.+.+++. | .+ T Consensus 1 M~~i~i~-~d~~~~~~~L~~l~~~~~~-~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~ 78 (190) T protein:vir:99 1 MAGITLE-WDGRRALDVLNAGSAALGD-PSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTL 78 (190) T ss_pred CceeEEE-ecHHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCcccee Confidence 44 5555 4778888888888654432 35667777777777666542 2 14 Q ss_pred cchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCC---------Cc----eeeee------- Q lcl|NC_019933. 51 TGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVD---------GK----WLFTK------- 110 (155) Q Consensus 51 tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~---------~~----~~~~~------- 110 (155) ||.|++||...... ..+.||.+. .|+...+||........... +. ..... T Consensus 79 tg~L~~Si~~~~~~-------~~v~vGtn~---~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 148 (190) T protein:vir:99 79 DGHLRNLLRYQLDG-------SELLFGSDR---PYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQD 148 (190) T ss_pred cHHHHHHHhheecC-------cEEEEecCc---chhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchh Confidence 78999999654321 235678764 56777889954222111000 00 00000 Q ss_pred -ecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 111 -EKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 111 -~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) ....-++++||+|||--. +...+++.+.|.+.|.+.|.+.. T Consensus 149 ~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 149 VQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred cccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 011235789999999655 33445555555555555555444 No 115 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.45 E-value=1.7e-09 Score=68.61 Aligned_cols=107 Identities=13% Similarity=0.193 Sum_probs=64.1 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecc---------cccC--- Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVP---------EEST--- 68 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~---------~~~~--- 68 (155) |+|... ++.+ -+..++.+...+++.+..+..+...+.|||||.+|.|+.+.... .+++ T Consensus 1 msF~~~---i~~~-------~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t 70 (131) T protein:vir:94 1 MSFALD---VTRF-------VEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTA 70 (131) T ss_pred CCcccC---HHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhh Confidence 766443 2222 22333344455556666667777789999999999999665321 1111 Q ss_pred ------------CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHH Q lcl|NC_019933. 69 ------------EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRL 136 (155) Q Consensus 69 ------------~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~ 136 (155) .|.. +.++ ...+|+..+||||+ .++|..|.+-++..-.+ + T Consensus 71 ~~~~~~~i~~~~~g~~-iyi~---Nn~pYA~~LEyG~S-----------------------~QAP~g~v~~~~~~~~~-~ 122 (131) T protein:vir:94 71 TGNATSFVLNAADWHT-FTLT---NNLPYAQRLEYGWS-----------------------QQAPQGFVRVNVSRFQQ-L 122 (131) T ss_pred HHHHHHHHhhccccce-EEEe---eCchhhhhhhcccc-----------------------CCCcchHHHHHHHHHHH-H Confidence 1222 2232 34688888999986 58899999999865444 3 Q ss_pred HHHHHHHHH Q lcl|NC_019933. 137 VEVANKAGA 145 (155) Q Consensus 137 ~~~i~~~l~ 145 (155) ++...++++ T Consensus 123 v~~~~~e~k 131 (131) T protein:vir:94 123 LNEEASKVK 131 (131) T ss_pred HHHHHHhcC Confidence 333333333 No 116 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.32 E-value=6e-09 Score=65.67 Aligned_cols=107 Identities=13% Similarity=0.195 Sum_probs=62.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccc---------ccC--- Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPE---------EST--- 68 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~---------~~~--- 68 (155) |+|... ++.+ ++..++. +...+++.+..+..+...+.|||||.+|.|+.+....- +++ T Consensus 1 msf~~~---i~~~---~~~ve~~----~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t 70 (131) T protein:vir:78 1 MSFALD---VSKF---VEKAKKN----PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) T ss_pred CCcCcC---HHHH---HHHHHHH----HHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhh Confidence 776443 2222 2222333 33445555556666667799999999999997653211 110 Q ss_pred ------------CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHH Q lcl|NC_019933. 69 ------------EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRL 136 (155) Q Consensus 69 ------------~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~ 136 (155) .|.. ++++ ...+|+..+||||+ .++|..|.+.++..-.+ + T Consensus 71 ~~~~~~~i~~~~~g~~-iyi~---Nn~pYA~~LEyG~S-----------------------~QAP~G~v~~~~~~~~~-~ 122 (131) T protein:vir:78 71 TSNAANFVLNAADWHT-FTLT---NNLPYAQRLEYGWS-----------------------QQAPQGFVRVNVSRFQQ-L 122 (131) T ss_pred HHHHHHHHhhccCCce-EEEe---eCchhhhHhhcccc-----------------------CCCcchHHHHHHHHHHH-H Confidence 1111 2222 34688888999986 58899999999864443 3 Q ss_pred HHHHHHHHH Q lcl|NC_019933. 137 VEVANKAGA 145 (155) Q Consensus 137 ~~~i~~~l~ 145 (155) ++...++++ T Consensus 123 v~~~~~e~k 131 (131) T protein:vir:78 123 LNEEASKVK 131 (131) T ss_pred HHHHHHhcC Confidence 333333333 No 117 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=98.31 E-value=1.4e-08 Score=63.62 Aligned_cols=126 Identities=17% Similarity=0.243 Sum_probs=87.5 Q ss_pred eeec-cHHHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 5 ITSL-DISGVLSALNDL-R-DDSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 5 m~~~-~l~~L~~~l~~l-~-~~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |+.+ |+++|++.|+.- + ..+.++..+|+.++++.|.++.|.+.- .|||..=+++..+.. ...+|.-++.|+|. T Consensus 1 m~evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p--~~~~g~rtV~i~W~ 78 (133) T protein:vir:96 1 MRLIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKP--TWENGKRTIRVYWE 78 (133) T ss_pred CccccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCc--eecCCceEEEEEee Confidence 5544 566666666553 3 467788999999999999999999875 499998888865432 23457778899998 Q ss_pred CCc--cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 80 KKK--APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 80 ~~~--a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) .++ ...-|+.||||... +|+++.-+ .-=-++-|++..+....+.+++.|++.| T Consensus 79 gp~~R~~iVHLNE~G~ytr------~Gk~i~Pr----------G~G~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 79 GEKHRYSIVHLNEKGFYAK------DGKFIRPK----------GMGAIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred cCCCceeeEeeecccceec------CCceeccc----------hhhHHHHHHHhhhHHHHHHHHHHHHHhC Confidence 764 44678899996422 22221100 1124788888888888888877777666 No 118 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=98.24 E-value=2.5e-08 Score=62.25 Aligned_cols=140 Identities=16% Similarity=0.153 Sum_probs=84.9 Q ss_pred eeeecc-HH-HHHHHHHHHHHHH-H---HHHHHHHHHHHHHHHHHHHHhCCC------Cc---chhhcceeeeeccc-cc Q lcl|NC_019933. 4 KITSLD-IS-GVLSALNDLRDDS-D---SVSRTMAFESAAVVRDSAKAHVRS------KT---GRLKGAIYAVYVPE-ES 67 (155) Q Consensus 4 ~m~~~~-l~-~L~~~l~~l~~~~-~---~~~r~a~~~~a~~i~~eak~~aP~------~t---G~Lr~sI~~~~~~~-~~ 67 (155) .|..-. |+ .|+..|+++...+ . +--.+...+||++..+.....+|. ++ |+|++||....... .. T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg~ 80 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDGI 80 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCcc Confidence 344333 22 2444444443322 1 112456677888888777777664 34 59999997653211 11 Q ss_pred CCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH--HHHHHHHHHHHHHH Q lcl|NC_019933. 68 TEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS--VKGRLVEVANKAGA 145 (155) Q Consensus 68 ~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~--~~~~~~~~i~~~l~ 145 (155) .+|. ..|||...+++.+|||.-||.-..- -.. ..++.-.||++|++-+|+--+-+. .+++++++. . T Consensus 81 ~dG~--StVGw~~kka~ia~~indGtr~~~~--~~~----~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae----~ 148 (161) T protein:vir:10 81 KDGN--STVGWDYTKSRVGHLIENGTRFPMY--SKK----GTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAE----A 148 (161) T ss_pred cCCc--eeccccCchhhhhhhhcccchhhhh--hcc----cccccCCcceeecCcchhHHHHhhhhhHHHHHHHH----H Confidence 2333 4699988899999999999842110 000 123445789999999999888773 445565555 5 Q ss_pred HHHHHHhccC Q lcl|NC_019933. 146 KRLAELRSKR 155 (155) Q Consensus 146 ~~i~k~~~k~ 155 (155) +.++|+++++ T Consensus 149 ~~y~eil~~k 158 (161) T protein:vir:10 149 EVFSEILKKK 158 (161) T ss_pred HHHHHHHHhh Confidence 6667777777 No 119 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=98.21 E-value=1.3e-08 Score=63.80 Aligned_cols=116 Identities=21% Similarity=0.200 Sum_probs=56.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++++.. +.|.+.+++|.++.++.+ .-|-+..+.+- .. ++ ..+|++ T Consensus 1 m~~~~~~---~~~~~~~~~l~~l~~~~v---------------------~vGi~~~~~~~----~~--~~---~~~G~~- 46 (193) T protein:vir:96 1 MSLRRDS---ELIAAHLQMLRAMRGRSV---------------------SAGWYSTARYP----DK--AG---GSVGIQ- 46 (193) T ss_pred Ceeccch---HHHHHHHHHHHHhcCCeE---------------------EEEEcCCCCCC----Cc--cc---ccccch- Confidence 7777544 344444444433322211 11222221110 00 11 123322 Q ss_pred CccccchhhhccccccCCCcC-------CCCc-----------eeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAE-------VDGK-----------WLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANK 142 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~-------~~~~-----------~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~ 142 (155) -+..+.+.|||....+..+. ..+. ..-+.+..-.++++||||||||+++.++++..+.+ T Consensus 47 -va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~-- 123 (193) T protein:vir:96 47 -VARIARLNEYGGTIDHPGGTRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQ-- 123 (193) T ss_pred -HHHHHhHHHcCCccccCccceeeeeccccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHH-- Confidence 24457778999643221110 0000 01112223357789999999999999988766655 Q ss_pred HHHHHHHHHhccC Q lcl|NC_019933. 143 AGAKRLAELRSKR 155 (155) Q Consensus 143 ~l~~~i~k~~~k~ 155 (155) ++.+..++++. T Consensus 124 --~~~~~~~~~g~ 134 (193) T protein:vir:96 124 --NRIAMRLARGQ 134 (193) T ss_pred --HHHHHHHHhCC Confidence 55666666666 No 120 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=98.18 E-value=1.9e-08 Score=62.88 Aligned_cols=119 Identities=14% Similarity=0.123 Sum_probs=55.3 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+......|=+.|.+.+++|..+.++.+. -|-+..+-+. .+ ++. ..|+ T Consensus 5 ~~~~~k~~~~~~~~~~~~~l~~l~~~~v~---------------------vGi~~~~~y~----~~--~~~---~dG~-- 52 (200) T protein:vir:99 5 FSKSNSVAAPLKHFQMLKQFDALKGKTVQ---------------------AGWFETDRYP----AK--EGE---TIGP-- 52 (200) T ss_pred cceeeeeecchHHHHHHHHHHHhhCCeEE---------------------EEEcCCCCcC----Cc--ccc---cccc-- Confidence 33333333434566666555443221111 1111111000 00 000 0111 Q ss_pred CccccchhhhccccccCCCcCC------------------CCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEV------------------DGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANK 142 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~------------------~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~ 142 (155) .-+..+.+.|||+...+..+.. +....++.+..-.++++||+|||||+++.+.++..+.+ T Consensus 53 ~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~-- 130 (200) T protein:vir:99 53 LVAKIARQLEFGGVINHPGGTKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQ-- 130 (200) T ss_pred hHHHHHhHHHcCCeeccCCCccccccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHH-- Confidence 1244567789986432211100 00111122222347789999999999999988766655 Q ss_pred HHHHHHHHHhccC Q lcl|NC_019933. 143 AGAKRLAELRSKR 155 (155) Q Consensus 143 ~l~~~i~k~~~k~ 155 (155) ++.+..++++. T Consensus 131 --~~~~~~~l~g~ 141 (200) T protein:vir:99 131 --AQIARQLLDGT 141 (200) T ss_pred --HHHHHHHHhCC Confidence 55566666666 No 121 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=98.13 E-value=2.4e-08 Score=62.38 Aligned_cols=138 Identities=14% Similarity=0.166 Sum_probs=80.7 Q ss_pred eccHHH-HHHHHHHHHHHH-H---HHHHHHHHHHHHHHHHHHHHhCCC------C---cchhhcceeeeeccc-ccCCce Q lcl|NC_019933. 7 SLDISG-VLSALNDLRDDS-D---SVSRTMAFESAAVVRDSAKAHVRS------K---TGRLKGAIYAVYVPE-ESTEVR 71 (155) Q Consensus 7 ~~~l~~-L~~~l~~l~~~~-~---~~~r~a~~~~a~~i~~eak~~aP~------~---tG~Lr~sI~~~~~~~-~~~~g~ 71 (155) +.+|++ |+..++++..++ . +--.+...+||++..+.....+|. + .++|++||....... ...+|. T Consensus 1 M~~~~d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~dG~ 80 (168) T protein:vir:39 1 MVSFYDAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQ 80 (168) T ss_pred CccHHHHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCcccCCc Confidence 445543 444444444322 1 112344556666666655544442 3 378999997653311 112333 Q ss_pred EEEEEEecCC-------ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH--HHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSWNKK-------KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV--KGRLVEVANK 142 (155) Q Consensus 72 ~~~~Vg~~~~-------~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~--~~~~~~~i~~ 142 (155) ..|||... +++.++||.-||.-+.-..+.+. ...-.|+++|++-+|+--+-+.. +++++++. T Consensus 81 --StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~-----~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~~Ae-- 151 (168) T protein:vir:39 81 --SVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGR-----KYKNPGEVAVHADHFIEETRKNPIVQQGILKAE-- 151 (168) T ss_pred --eeccccCccccccccchhheehhccccccchhhhhccc-----ccccccceeecccchhHHHhhhhhhhHHHHHHH-- Confidence 46899875 78999999999842211011111 12346889999999998888743 55555555 Q ss_pred HHHHHHHHHhccC Q lcl|NC_019933. 143 AGAKRLAELRSKR 155 (155) Q Consensus 143 ~l~~~i~k~~~k~ 155 (155) .++++|+|+++ T Consensus 152 --~e~~~eil~~k 162 (168) T protein:vir:39 152 --AEAMRKIINRK 162 (168) T ss_pred --HHHHHHHHHhc Confidence 56667777777 No 122 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=98.10 E-value=8.1e-08 Score=59.47 Aligned_cols=138 Identities=15% Similarity=0.165 Sum_probs=82.5 Q ss_pred eccHHH-HHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHhCCC------Cc---chhhcceeeeeccc-ccCCce Q lcl|NC_019933. 7 SLDISG-VLSALNDLRDDSDS----VSRTMAFESAAVVRDSAKAHVRS------KT---GRLKGAIYAVYVPE-ESTEVR 71 (155) Q Consensus 7 ~~~l~~-L~~~l~~l~~~~~~----~~r~a~~~~a~~i~~eak~~aP~------~t---G~Lr~sI~~~~~~~-~~~~g~ 71 (155) +.+|++ |...++++..++-+ --.+...+||++..+.-...+|. ++ ++|++||....... ...+| T Consensus 1 M~~~~~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG- 79 (168) T protein:vir:74 1 MATFEEAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDG- 79 (168) T ss_pred CccHHHHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCcccCC- Confidence 334544 66666666554322 22355666777777766666653 34 49999997653311 11233 Q ss_pred EEEEEEecCC-------ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH--HHHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSWNKK-------KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS--VKGRLVEVANK 142 (155) Q Consensus 72 ~~~~Vg~~~~-------~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~--~~~~~~~~i~~ 142 (155) ...|||.+. +|+.++|+.-||.-+ .-....+.. ..-.|.++||+-+|+--+-+. .++++.++. T Consensus 80 -~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~-~~~~~~~~~----~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae-- 151 (168) T protein:vir:74 80 -QSVVGWERSTEKGTHTKGYIANIINNGSRFP-QFTTRSGRK----YKKPGEVAVHADHFIEETRMNLIVQQGILKAE-- 151 (168) T ss_pred -ceeecccccccccccchhhhhhhhccccccc-ccccccccc----cccccccccccchhHHHHHhhhhhHHHHHHHH-- Confidence 346899865 688999999998421 111111211 124578899999999887765 446666665 Q ss_pred HHHHHHHHHhccC Q lcl|NC_019933. 143 AGAKRLAELRSKR 155 (155) Q Consensus 143 ~l~~~i~k~~~k~ 155 (155) .+.+.|+|+++ T Consensus 152 --~~~y~eIl~~k 162 (168) T protein:vir:74 152 --AEAMRKIINRK 162 (168) T ss_pred --HHHHHHHHHhh Confidence 44555566555 No 123 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.10 E-value=3.8e-08 Score=61.29 Aligned_cols=111 Identities=8% Similarity=0.022 Sum_probs=74.3 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccc------------------c Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPE------------------E 66 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~------------------~ 66 (155) |+. .+-++...++.+.+..++.+...+++.|..|..+...+.|||||.+|.|+.+..... . T Consensus 1 MA~-~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d~ 79 (144) T protein:vir:95 1 MAK-SLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQRA 79 (144) T ss_pred Cch-hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCCC Confidence 443 344566777777777888888888999999999999999999999999997664310 0 Q ss_pred cC---------------CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH Q lcl|NC_019933. 67 ST---------------EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS 131 (155) Q Consensus 67 ~~---------------~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~ 131 (155) ++ .|. ++++. ...||+..+||||+ .++|..|.|.++.. T Consensus 80 sg~~tl~~~~~vi~~~~~g~-~iyi~---NnlpYA~~LEyG~S-----------------------~QAP~G~vr~~~q~ 132 (144) T protein:vir:95 80 SAAETLNSAKLVLRNKKPGQ-AIFIT---NNLPYIRRLNDGYS-----------------------AQAPAGFVERAVLI 132 (144) T ss_pred chhHHHHHHHHHHhhcCccc-eEEEe---eCchhhhhhhcccc-----------------------CCCcchHHHHHHHH Confidence 10 011 11222 34678888999986 57899999999965 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 132 VKGRLVEVANKAGAKRLA 149 (155) Q Consensus 132 ~~~~~~~~i~~~l~~~i~ 149 (155) -.. +++..+ +. | T Consensus 133 ~~~-~v~~~~--~~---~ 144 (144) T protein:vir:95 133 GRK-MRKKFK--IK---D 144 (144) T ss_pred HHH-HHHhhc--cC---C Confidence 444 332221 01 1 No 124 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=98.04 E-value=8.5e-08 Score=59.33 Aligned_cols=138 Identities=15% Similarity=0.188 Sum_probs=81.2 Q ss_pred eccHHH-HHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhCCC------Ccc---hhhcceeeeeccc-ccCCce Q lcl|NC_019933. 7 SLDISG-VLSALNDLRDD----SDSVSRTMAFESAAVVRDSAKAHVRS------KTG---RLKGAIYAVYVPE-ESTEVR 71 (155) Q Consensus 7 ~~~l~~-L~~~l~~l~~~----~~~~~r~a~~~~a~~i~~eak~~aP~------~tG---~Lr~sI~~~~~~~-~~~~g~ 71 (155) +.+|++ |+..++++..+ ..+--.+...+||++..+.....+|. ++| +|++||....... ...+| T Consensus 1 M~~~~d~l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG- 79 (168) T protein:vir:10 1 MVSFYDAMQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDG- 79 (168) T ss_pred CCcHHHHHHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheecccccccccCC- Confidence 344443 44444444332 12223455667777777777776663 344 8999997653211 11233 Q ss_pred EEEEEEecCC-------ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH--HHHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSWNKK-------KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS--VKGRLVEVANK 142 (155) Q Consensus 72 ~~~~Vg~~~~-------~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~--~~~~~~~~i~~ 142 (155) ...|||.+. +++.++|+.-||.-+ .-....+.. ..-.|.++||+-+|+--+-+. .++++.++. T Consensus 80 -~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~-~~~~~~~~~----~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae-- 151 (168) T protein:vir:10 80 -QSVVGWERSTEKGTHTKGYIANIINNGSRFP-QFTTRSGRK----YKKPGEVAVHADHFIEETRKNPIVQQGILKAE-- 151 (168) T ss_pred -ceeecccCccccccccchheeeecccccccc-ccccccccc----cccccccccccchhHHHhhhchhhhHHHHHHH-- Confidence 346999875 789999999998421 111111211 124578899999999888765 356666665 Q ss_pred HHHHHHHHHhccC Q lcl|NC_019933. 143 AGAKRLAELRSKR 155 (155) Q Consensus 143 ~l~~~i~k~~~k~ 155 (155) .+.+.|+|+++ T Consensus 152 --~~~y~eIl~~k 162 (168) T protein:vir:10 152 --AEAMRKIINRK 162 (168) T ss_pred --HHHHHHHHHhh Confidence 44555666655 No 125 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=98.04 E-value=3.3e-08 Score=61.57 Aligned_cols=116 Identities=9% Similarity=0.033 Sum_probs=68.8 Q ss_pred eccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccc-----------ccCC-----c Q lcl|NC_019933. 7 SLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPE-----------ESTE-----V 70 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~-----------~~~~-----g 70 (155) +-++.++...+..+-+..+..+...+++.+..|..+...+.|+|||.+|.|+.+....- .+++ + T Consensus 1 m~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~~~~~ 80 (148) T protein:vir:97 1 MPSLSEFSRRITLRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTEAANT 80 (148) T ss_pred CCccchhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCcccccch Confidence 22344555555555555566666666677777777788899999999999996652211 1110 0 Q ss_pred e----------------EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHH Q lcl|NC_019933. 71 R----------------HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKG 134 (155) Q Consensus 71 ~----------------~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~ 134 (155) . .+++++ ...+|+..+|||++ .++|..|.+-++..-.+ T Consensus 81 ~~~i~~~~~vi~~~k~g~~iyi~---NnlpYA~~LEyG~S-----------------------~QAP~G~v~~t~~~~~~ 134 (148) T protein:vir:97 81 QAAIDQAESVIRGYNYGEEIHIT---NNLPYIQRLNDGYS-----------------------AQAPANFVEQAVLEAVQ 134 (148) T ss_pred hHHHHHHHHHhhccCCCceEEEe---ecchhhhHhhcccc-----------------------CCCcchHHHHHHHHHHH Confidence 0 011222 34688888999986 47899999999854333 Q ss_pred HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 135 RLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 135 ~~~~~i~~~l~~~i~k~~~k 154 (155) .++. .+.+++.=.. T Consensus 135 ----~v~~--~~~~~~~~~~ 148 (148) T protein:vir:97 135 ----VVQF--GRVVDGDPGS 148 (148) T ss_pred ----HHHh--hhhhcCCCCC Confidence 2211 2222222222 No 126 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=97.95 E-value=6.1e-08 Score=60.14 Aligned_cols=89 Identities=17% Similarity=0.197 Sum_probs=49.8 Q ss_pred Cceeeeecc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLD--ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~--l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |+..|..-+ ++.|.+.|+.|. ++. +.-|-+..+ +..||. T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~---~k~---------------------V~VGi~~~~--------~y~dG~------- 41 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMN---DYS---------------------VRIGWFSTA--------KYPDGT------- 41 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhh---CCe---------------------EEEEecCCC--------CCCCcc------- Confidence 888888533 334444443321 100 011111000 001111 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +.+..+.+.|||+-. .++||+|||||+++.++++..+.+ ...++.++.+. T Consensus 42 --~vA~Ia~~~E~G~p~---------------------~~IP~RPFlr~t~~~~~~~~~~~l----~~~~~~vl~G~ 91 (189) T protein:vir:10 42 --PTAYVASIHEFGAPS---------------------RGIPARSFIRPTIAAQQAAWSQQM----RFYAKQIVVGQ 91 (189) T ss_pred --cHHHHHHHHHhcCcC---------------------CCCCCchhhhHHHHHHHHHHHHHH----HHHHHHHHhCC Confidence 236677889999731 148999999999999988766655 56666666666 No 127 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=97.95 E-value=1e-07 Score=58.96 Aligned_cols=117 Identities=12% Similarity=0.076 Sum_probs=66.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--------------Ccchhhcceeeeecccc Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRS--------------KTGRLKGAIYAVYVPEE 66 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~--------------~tG~Lr~sI~~~~~~~~ 66 (155) |-.-+-.-+.-++...++.+-+..+..+...+++.+..+..+.....|| |||.+|.|+.++...-. T Consensus 1 ~~~~~~~~~~msFaa~i~~~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~ 80 (152) T protein:vir:96 1 MLSCICGGNPMSWSKSLKNIIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKIT 80 (152) T ss_pred CcceeeCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCC Confidence 2222221122222233334434444445555555666666677777999 99999999977633211 Q ss_pred cC-------Cce--------------EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch Q lcl|NC_019933. 67 ST-------EVR--------------HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL 125 (155) Q Consensus 67 ~~-------~g~--------------~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl 125 (155) .+ .+. .++++. ...||+..+||||+ .++|..|. T Consensus 81 ~~~~~~~~~~~t~~~~~~~i~~~~~g~~iyi~---NnlPYA~~LEyG~S-----------------------~QAP~G~v 134 (152) T protein:vir:96 81 SFEKGISSQSSIMMDLQSDIAKFKIGETLFMT---NPLPYATSIEYGHS-----------------------SQAPNGVY 134 (152) T ss_pred cccccCCCCCchHHHHHHHHhhccccceEEEe---eCchhhhHhhcccc-----------------------CCCCchHH Confidence 11 000 111121 24678888888886 57899999 Q ss_pred hhHHHHHHHHHHHHHHHH Q lcl|NC_019933. 126 RPGYDSVKGRLVEVANKA 143 (155) Q Consensus 126 rPA~~~~~~~~~~~i~~~ 143 (155) |.++..-.+-+.++++.+ T Consensus 135 r~t~~~~~~~v~ea~~~~ 152 (152) T protein:vir:96 135 RPAVRRLVKFLNTELKAK 152 (152) T ss_pred HHHHHHHHHHHHHHhccC Confidence 999976666555555444 No 128 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=97.94 E-value=4.4e-08 Score=60.93 Aligned_cols=98 Identities=13% Similarity=0.143 Sum_probs=59.6 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccc---------ccCC-- Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPE---------ESTE-- 69 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~---------~~~~-- 69 (155) |+++.+. .|.+..+.+ ++.+...+++.+..+.++...+.|||||.+|.|+.+..... ++++ T Consensus 2 ~~~sf~~----~i~~~~~~v----e~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t 73 (121) T protein:vir:94 2 ISMKFNV----NLSRLRSNL----REEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTP 73 (121) T ss_pred ccchhhc----cHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchh Confidence 2222221 233333333 33344455666677777788899999999999996642110 1110 Q ss_pred -----------ceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHH Q lcl|NC_019933. 70 -----------VRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVK 133 (155) Q Consensus 70 -----------g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~ 133 (155) +. ++++. ...||+..+||||+ .++|..|.+.++..-+ T Consensus 74 ~~~~~~~~~~~~~-~iyi~---NnlpYA~~LE~G~S-----------------------~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 74 APAIVVSSNVALP-HFYIT---NGAPYAQQLEKGSS-----------------------TQAPLGIVRVTLASLR 121 (121) T ss_pred HHHHHHHHhhccc-eEEEe---eCcchhhhhhcccC-----------------------CCCcchHHHHHHHhhC Confidence 11 11222 34678888999986 5789999999986655 No 129 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=97.92 E-value=3.2e-08 Score=61.68 Aligned_cols=96 Identities=16% Similarity=0.264 Sum_probs=52.1 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeee-cccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVY-VPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~-~~~~~~~g~~~~~Vg~~ 79 (155) |+++++. +.++|.+.++.|.++.++ .++=.|.... ......+| T Consensus 1 M~~~~k~-~~~~~~~l~~~l~~l~~~--------------------------~v~VGi~~~~~~~~~~~~g--------- 44 (148) T protein:vir:52 1 MAVTVTA-NFSAAKQLIEQMKSLKEK--------------------------AVYVGFPAEFDEKVKGSEN--------- 44 (148) T ss_pred Ccccccc-ccHHHHHHHHHHHHhhCC--------------------------eEEEEeecCcCCCCCCCCC--------- Confidence 9998884 666666666666554321 0111111000 00000011 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH--HHhccC Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA--ELRSKR 155 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~--k~~~k~ 155 (155) -+.+..+.+.|||+ .++||+|||||+++.++++..+.+...+...++ .+|..= T Consensus 45 ~~vA~ia~~~E~G~-----------------------~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~~ 99 (148) T protein:vir:52 45 FNLASLAAVLEFGN-----------------------EHIPARPFLRQTLEENQEKYTALFIQWFDQGVPAAQIYERL 99 (148) T ss_pred CCHHHHHHHHhcCC-----------------------CCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHH Confidence 12356777888886 369999999999999999887776554443221 000000 No 130 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.89 E-value=2.7e-07 Score=56.57 Aligned_cols=112 Identities=15% Similarity=0.067 Sum_probs=69.9 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++++. ++++.+++.|. +...+|....++.|..++..-+|.+||.|++|-.. .++|.+ .+ T Consensus 1 M~vkV~-id~~~~~~~l~-------~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~------~~~g~I----~y-- 60 (112) T protein:vir:80 1 MPIKVR-VDLSKAKGSVK-------KAKERGQFALINQAAADIALYVPFLSGDLSNQYVI------MNDKEI----MW-- 60 (112) T ss_pred CceeEE-eehHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcCCCcccCccccceee------ccCceE----Ee-- Confidence 999988 47776655443 34456677778888888889999999999998421 123433 22 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) .+||++.+-||..-.+......+.. ++=|.| |.....+++++.+.+.+++.| T Consensus 61 -~tPYAr~qYY~~~~~~~~~~~p~ag--------------~~W~er-ak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 61 -TSIYARRLYNGINFNFTLTHHPLAG--------------PKWDQR-AKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred -cCchhhHhhhcccCCCCcCCCCCcc--------------hhhHHH-HHhhhhHHHHHHHHHHHhhcC Confidence 2677887777643211111100111 222444 777777888888888888777 No 131 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=97.80 E-value=2.6e-07 Score=56.68 Aligned_cols=117 Identities=16% Similarity=0.134 Sum_probs=77.4 Q ss_pred HHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEEEecCCc--cccc Q lcl|NC_019933. 13 VLSALND-LR-DDSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKK--APHG 86 (155) Q Consensus 13 L~~~l~~-l~-~~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~--a~~~ 86 (155) |+..|+. |+ ..+.++..+|+.++++.|..+.+.+.- .|||..=+++..+......+.+..++.|+|..+. ...- T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iV 80 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNII 80 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCCCceeeE Confidence 4444443 33 457788999999999999999999875 4999988888654332223333467899997764 4567 Q ss_pred hhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 87 HLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 87 ~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) |+.|||+.+ +|+++.-+ .-==++-|++..+....+.++++|++ T Consensus 81 HLNE~GYtr-------~Gk~i~PR----------G~G~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 81 HLNEHGYTR-------DGKKYTPR----------GFGVIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred eeeccceec-------CCCeEccc----------hhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 889999743 22211100 11136777777777777777666666 No 132 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=97.72 E-value=1.3e-07 Score=58.25 Aligned_cols=108 Identities=10% Similarity=0.066 Sum_probs=60.0 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccc-------cCCce-- Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEE-------STEVR-- 71 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~-------~~~g~-- 71 (155) |+|... ++. ..+ ..++.+...+++.+..+..+...+.|+|||.+|.|+.+....-. ..+|. T Consensus 1 msF~~~---i~~---~~~----~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~ 70 (134) T protein:vir:80 1 MSYTDR---FNV---IAK----GIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGM 70 (134) T ss_pred CCcccC---HHH---HHH----HHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccc Confidence 666443 222 233 33344444555556666666777899999999999976532210 00110 Q ss_pred -----------------EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHH Q lcl|NC_019933. 72 -----------------HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKG 134 (155) Q Consensus 72 -----------------~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~ 134 (155) .+++++ ...+|+..+||||+ .++|..|.+-+...-.. T Consensus 71 ~~~~~~~~~vi~~~k~g~~iyi~---Nn~pYA~~LEyG~S-----------------------~QAP~G~v~~t~~~~~~ 124 (134) T protein:vir:80 71 DEALQVLQQTVGQYKAGDTVHIT---NNAPYIKELNSGSS-----------------------QQAPANFVETSIMRATR 124 (134) T ss_pred hhhHHHHHHHHhhccCcceEEEe---eCchhhhhhhcccc-----------------------CCCcchHHHHHHHHHHH Confidence 112222 34678888999986 47899999987743322 Q ss_pred HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 135 RLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 135 ~~~~~i~~~l~~~i~k~~~k 154 (155) +++.. + ++-+ T Consensus 125 -~v~~~--------~-~~~~ 134 (134) T protein:vir:80 125 -LIRNV--------K-VVPQ 134 (134) T ss_pred -HHHhh--------c-cCCC Confidence 22211 1 1112 No 133 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=97.63 E-value=9.9e-07 Score=53.49 Aligned_cols=122 Identities=11% Similarity=-0.017 Sum_probs=80.0 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccch Q lcl|NC_019933. 8 LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGH 87 (155) Q Consensus 8 ~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~ 87 (155) +.|+||+++++.|..+.+.+ ..++.++.....+.+...+- .. .+.++++....+.+...... T Consensus 1 i~i~Gld~L~~~L~~l~~~~-~~~~~~a~~~~a~~i~~~ak-------~~-----aPv~TG~Lr~sI~~~~~~~~----- 62 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-DKNINATTEEAANFIEDRAK-------TL-----APKNFGKLAQSISTSDLKAK----- 62 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-------Hh-----CCcCchhhhhcceeeeeccC----- Confidence 99999999999999988665 67888887777777666652 22 23344444443333322111 Q ss_pred hhhccccccCCCcCCCCceeeeeecccceeeeCCccchh----------------------------------------- Q lcl|NC_019933. 88 LVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLR----------------------------------------- 126 (155) Q Consensus 88 ~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlr----------------------------------------- 126 (155) |+ .....+....|+.|+|+||.+|+|||..+ T Consensus 63 ----~~----~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 134 (173) T protein:vir:10 63 ----DL----ISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFA 134 (173) T ss_pred ----ce----eEEeeCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceee Confidence 11 11122345689999999999999999633 Q ss_pred ----------hHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 127 ----------PGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 127 ----------PA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) |=|--.-++..+.+.+.|.+.|.+.++|= T Consensus 135 ~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 135 KILGAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred EeecCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 12222336667777788888888888877 No 134 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=97.62 E-value=1.3e-06 Score=52.78 Aligned_cols=112 Identities=14% Similarity=0.048 Sum_probs=68.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++++. ++++.++..| .+++.+|....++.|..++..-+|.++|.|++|-.+ .++|.++ + T Consensus 1 M~vkv~-vn~~~~~~~l-------~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~------~~~g~I~----y-- 60 (112) T protein:vir:45 1 MPIKVR-VDLSKAKGSV-------KKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI------MNDKEIM----W-- 60 (112) T ss_pred CceeEE-eehHHHHHHH-------HHHHHHHHHHHHHHHHHHhhcCCccccCccccceee------ccCCeEE----e-- Confidence 999998 4766655433 234455677778888888899999999999998422 1234322 2 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) .+||+++.=||..-.+......+. .++=|.| |.....+++++.+.+.+++.| T Consensus 61 -~tPYAr~qYY~~~~~~~~~~~p~a--------------g~~W~er-ak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 61 -TSIYARRLYKGINFNFTLTHHPLA--------------GPEWDQR-AKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred -cChhhHHhhhccccCCCCCCCCCC--------------chhhHHH-HHHhhHHHHHHHHHHHHhhcC Confidence 256777766654321111100011 1222444 777777888888877777777 No 135 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.51 E-value=1.7e-06 Score=52.22 Aligned_cols=119 Identities=16% Similarity=0.140 Sum_probs=64.9 Q ss_pred eeeccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh-----CCC------------------------Ccchh Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSAKAH-----VRS------------------------KTGRL 54 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~eak~~-----aP~------------------------~tG~L 54 (155) |+ +|+.|++.|+.|-. +.-.-.++-++.-|..++...+.+ .|. .+|.| T Consensus 1 m~--d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l 78 (149) T protein:vir:98 1 MS--ELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRT 78 (149) T ss_pred Cc--hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhh Confidence 44 56666666666532 211123445566666666666554 231 13556 Q ss_pred hcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHH Q lcl|NC_019933. 55 KGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKG 134 (155) Q Consensus 55 r~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~ 134 (155) .+||..... .+ .+.||+......|+....||....... . ...+.+||+|||-=. +..++ T Consensus 79 ~~sl~~~~~----~~---~~~V~~~Gs~~~yAa~HQfG~~~r~~~---~----------~~~~~iPaRp~LG~s-~~d~~ 137 (149) T protein:vir:98 79 NRFMKAKGS----DS---AAVVEFTGRVQRMARVHQYGLKDRPNR---H----------SRDVQYAARPLLGFT-RDDEQ 137 (149) T ss_pred hhhhhheec----CC---eeEEEecCcchHHhhHhhccccccccC---C----------CcceeccccccCCCC-HHHHH Confidence 666644321 12 234555556678999999996422111 1 113579999999744 33455 Q ss_pred HHHHHHHHHHHH Q lcl|NC_019933. 135 RLVEVANKAGAK 146 (155) Q Consensus 135 ~~~~~i~~~l~~ 146 (155) ++++.|.+.|.+ T Consensus 138 ~i~~~i~~~l~~ 149 (149) T protein:vir:98 138 MIEDIIIRHLGK 149 (149) T ss_pred HHHHHHHHHhhC Confidence 566666555555 No 136 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.50 E-value=1.3e-06 Score=52.87 Aligned_cols=129 Identities=16% Similarity=0.037 Sum_probs=73.7 Q ss_pred CceeeeeccH-HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-------HHhCC------CCcchhhcceeeeeccc Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLR-DDSDSVSRTMAFESAAVVRDSA-------KAHVR------SKTGRLKGAIYAVYVPE 65 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~ea-------k~~aP------~~tG~Lr~sI~~~~~~~ 65 (155) |+++-- -+| ..|..+-+++. ...+++++.++......++..| +.+.- ..+|.+...|.+..... T Consensus 6 ~~i~Gl-~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~~~~ 84 (149) T protein:vir:19 6 LDFSGL-NDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNP 84 (149) T ss_pred eehhhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccccccc Confidence 777631 122 33444444444 3456666666666555555544 22221 24566777776544333 Q ss_pred ccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 66 ESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 66 ~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) ..+... ...++.+...++||||+||||..... . + | -+|=+.=.=+...+.+.+.+.++|. T Consensus 85 ~~~~~~-~~~~~~~~~~~~y~~f~E~GT~~~~a-~---P---------F------~~pA~~~~k~~~~~~~~~~l~~~l~ 144 (149) T protein:vir:19 85 RTGNSD-NTMKANNPRNAFYWRFVELGTANMPA-H---P---------F------VRPAYDTREEEAASVAIARMNQAID 144 (149) T ss_pred cccccc-ceeecCCCCccceeeeeccCCCCCCC-C---c---------c------hhHHHHHHHHHHHHHHHHHHHHHHH Confidence 322222 23344566678999999999964321 1 1 1 3566665556666778888888888 Q ss_pred HHHHH Q lcl|NC_019933. 146 KRLAE 150 (155) Q Consensus 146 ~~i~k 150 (155) +++.| T Consensus 145 k~~~k 149 (149) T protein:vir:19 145 EVLSK 149 (149) T ss_pred HHhcC Confidence 88888 No 137 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.45 E-value=7.4e-06 Score=48.71 Aligned_cols=121 Identities=13% Similarity=0.101 Sum_probs=79.4 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |.--=-.+++++|.+.++++......++--=+.-.|.....+||.||| .+||+-|++|.-.... .|.-.+.|.. T Consensus 1 ~~~~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~----~g~~~~~Iyl 76 (123) T protein:vir:74 1 MAKVTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANK----LGPGSHELIM 76 (123) T ss_pred CceeEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccccc----CCCceEEEEE Confidence 552222257788999999887666666666666688899999999999 5799999999533211 2222233333 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .++ -.|+-|+|.++... ---|+|+.+.-.+++++-+ +.-+.++.+-| T Consensus 77 sh~-veYG~~LEla~~~k-------------------------yaIi~Ptv~~~~~~im~g~----~~ll~~l~~~~ 123 (123) T protein:vir:74 77 SYS-VHYGIWLEIANSGQ-------------------------YAVIGPFLPVMGRKLMHDL----EHLIDRLERAQ 123 (123) T ss_pred ecC-eeecceeeecCCCC-------------------------ceeecchHHHHhHHHHHHH----HHHHHHhhccC Confidence 333 46888899887421 1157899887777777666 44455555555 No 138 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=97.38 E-value=3.4e-06 Score=50.53 Aligned_cols=135 Identities=13% Similarity=-0.004 Sum_probs=77.7 Q ss_pred CceeeeeccHH-HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH------------HHhCC--------CCcchhhcce Q lcl|NC_019933. 1 MSSKITSLDIS-GVLSALNDLR-DDSDSVSRTMAFESAAVVRDSA------------KAHVR--------SKTGRLKGAI 58 (155) Q Consensus 1 M~~~m~~~~l~-~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~ea------------k~~aP--------~~tG~Lr~sI 58 (155) ++++--. +|. .|+++-+++. ...+++++.|+.--...++..| +.++. ..+|.+.-++ T Consensus 7 ~~i~Gl~-eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~~~~~v 85 (179) T protein:vir:18 7 VSLTGLE-SLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGDLAFRV 85 (179) T ss_pred EEeecHH-HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccceeEee Confidence 7766432 433 3444445554 3567777777777666666554 22221 2345554444 Q ss_pred eeeeccc---------ccCC-ceEEE---EEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch Q lcl|NC_019933. 59 YAVYVPE---------ESTE-VRHVY---AVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL 125 (155) Q Consensus 59 ~~~~~~~---------~~~~-g~~~~---~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl 125 (155) .+..... ..+. +.... ..+....++||||||||||+.... .+| -+|=+ T Consensus 86 gv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa-------------~PF------lrPA~ 146 (179) T protein:vir:18 86 GVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSA-------------RPI------LRPAM 146 (179) T ss_pred ecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCC-------------Ccc------chhhH Confidence 3321000 0000 11111 112234568999999999964221 111 47788 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 126 RPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 126 rPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .=.-+...+.+.+.+.++|.+.|.+.-++= T Consensus 147 ~~~~~~a~~~i~~~l~~~i~k~lk~~~~~~ 176 (179) T protein:vir:18 147 NGVDNDVINVFSTEMGKAIDRAIRLAMKKG 176 (179) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHhhcccC Confidence 888888889999999999999998887666 No 139 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=97.29 E-value=2.9e-06 Score=50.97 Aligned_cols=133 Identities=11% Similarity=0.131 Sum_probs=88.2 Q ss_pred Cc--eeeeeccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCCCc---chhhcceeeeecccccCC-ceEE Q lcl|NC_019933. 1 MS--SKITSLDISGVLSALNDLRDDSDSVSRTMAFE-SAAVVRDSAKAHVRSKT---GRLKGAIYAVYVPEESTE-VRHV 73 (155) Q Consensus 1 M~--~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~-~a~~i~~eak~~aP~~t---G~Lr~sI~~~~~~~~~~~-g~~~ 73 (155) |+ ++...-+++.|...+.+++..+++++.+.+.. |+.++.+.+-.+.|+.. |.+|+-...+.+..-... .-.. T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLg 80 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLG 80 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcc Confidence 87 55555567788888888888999999888875 67778888889999863 356666655433321111 0011 Q ss_pred EEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 74 YAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 74 ~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) +.+-+ +..|+.++ +...-.|+++--||-||+-.++...+.+++.+.+.|-+.|.+.|. T Consensus 81 f~i~~---k~kf~YLv-------------------fPD~G~G~sn~~~q~FmerGl~~~t~~i~E~L~~~l~k~in~~Lg 138 (140) T protein:vir:40 81 FELLT---KPKFNYLI-------------------FPDQGIGKHNKTKQDFMQLGVEESSQEIVEMLEQAVFKEINDTLG 138 (140) T ss_pred eeEee---cCcccccc-------------------cccccCCCCCcchHHHHHhccccchhHHHHHHHHHHHHHHHHhhc Confidence 11111 22121111 111122444445777999999999999999999999999999999 Q ss_pred cC Q lcl|NC_019933. 154 KR 155 (155) Q Consensus 154 k~ 155 (155) +| T Consensus 139 g~ 140 (140) T protein:vir:40 139 GK 140 (140) T ss_pred CC Confidence 99 No 140 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=97.29 E-value=1.5e-06 Score=52.48 Aligned_cols=102 Identities=16% Similarity=0.180 Sum_probs=47.8 Q ss_pred HHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCc----cccchhhhccccccCCCc----------- Q lcl|NC_019933. 36 AAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKK----APHGHLVEYGHWRTNVVA----------- 100 (155) Q Consensus 36 a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~----a~~~~~vEfGt~~~~~~~----------- 100 (155) ..+..+. .+.+ .|.+.|. . -....+.||+-... +..+...|||....+..+ T Consensus 1 m~vt~~~--~~~~----~~~~~l~-------~-L~~k~v~vGi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~ 66 (199) T protein:vir:80 1 MKVTTDK--STMN----KAIRELD-------Q-LDRYSLQIGLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGD 66 (199) T ss_pred CcccccH--HHHH----HHHHHHH-------H-hcCCEEEEEEecCCCcchhheeehhhcCCeeecCCceeeecchhhhc Confidence 1111110 0000 1111110 0 11234566665432 345556788854221100 Q ss_pred ----------CCCCceee--------eeecccce--eeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 101 ----------EVDGKWLF--------TKEKLATP--VHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 101 ----------~~~~~~~~--------~~~~~~gt--~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ..++...+ ....++|. +++||||||||+++.++++..+.+ .+.+..++++. T Consensus 67 ~k~~~~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~----~~~~~~vl~g~ 137 (199) T protein:vir:80 67 RRARDIPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELF----EGWIDDVIHGK 137 (199) T ss_pred ccccccCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHH----HHHHHHHHhCC Confidence 00111111 11134444 478999999999999988876655 55566666666 No 141 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=97.24 E-value=8.5e-06 Score=48.37 Aligned_cols=132 Identities=14% Similarity=0.107 Sum_probs=57.8 Q ss_pred CceeeeeccH-HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH------------HhCC--------CCcchhhcce Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLR-DDSDSVSRTMAFESAAVVRDSAK------------AHVR--------SKTGRLKGAI 58 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~eak------------~~aP--------~~tG~Lr~sI 58 (155) |+++--. +| ..|.++-.+.. ...+.+++.++.--...++..|- .+.- ..+|.+...+ T Consensus 7 ~~i~Gl~-eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~~v 85 (164) T protein:vir:43 7 FSITGLD-SLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLGFRI 85 (164) T ss_pred EeeecHH-HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccceeEEe Confidence 7765332 32 23333334443 34455655555555544444331 1111 1233333333 Q ss_pred eeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhh----HHHHHHH Q lcl|NC_019933. 59 YAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRP----GYDSVKG 134 (155) Q Consensus 59 ~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrP----A~~~~~~ 134 (155) .+..... .........++ ..+++|||||+||||+.... . + | -+|=+.= +++.-.+ T Consensus 86 g~~~~~~-~~~~~~~~~~~-~~~~~~y~~f~EfGT~km~a-~---P---------F------lrPA~~~~k~~~~~~~~~ 144 (164) T protein:vir:43 86 GVLHGAV-LPKKGERSDKT-ANAPTPHWRLLEFGTEDMRA-Q---P---------F------MRSALADNIAEVTSTFVS 144 (164) T ss_pred ccccccc-ccccccccccC-CCCCcceEEEeecCCCCCCC-C---c---------c------hhhhHHHhHHHHHHHHHH Confidence 2211100 00001111122 23457999999999964321 1 0 1 3444544 4444455 Q ss_pred HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 135 RLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 135 ~~~~~i~~~l~~~i~k~~~k 154 (155) ++.+.|.+.|+++..|..++ T Consensus 145 ~l~~~i~ka~~k~~~~~~~~ 164 (164) T protein:vir:43 145 EYEKGIDRAIKRAAKKAAQG 164 (164) T ss_pred HHHHHHHHHHHHHHhhhccC Confidence 55555555555555555555 No 142 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.14 E-value=6.9e-06 Score=48.88 Aligned_cols=123 Identities=14% Similarity=0.019 Sum_probs=67.2 Q ss_pred CceeeeeccH-HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-------HHhCCCCcchhhcceeeeecccccCCce Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLR-DDSDSVSRTMAFESAAVVRDSA-------KAHVRSKTGRLKGAIYAVYVPEESTEVR 71 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~ea-------k~~aP~~tG~Lr~sI~~~~~~~~~~~g~ 71 (155) |+++-- -.| +.|..+-++.. +..+++++.++......++..| +.+..+..+.-+..-... . T Consensus 4 ~~i~Gl-d~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~---------~ 73 (140) T protein:vir:10 4 IQIIGL-ADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLA---------T 73 (140) T ss_pred eeehhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceE---------E Confidence 776632 233 23444334443 3556666666666665555544 444433222111110000 0 Q ss_pred EEEEEEe-----cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSW-----NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 72 ~~~~Vg~-----~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ..+.+++ +.+.++||+|+||||+.... . +| =+|=+.-.-+...+.+.+.+.+.|.+ T Consensus 74 ~g~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a----~---------PF------l~pA~~~~~~~~~~~~~~~~~~~l~k 134 (140) T protein:vir:10 74 AGVRVRTKGKADSPNNAFYWRFDEFGTQHMKA----Q---------PF------MRPAFDASIGEAEGAIRTELARAIDR 134 (140) T ss_pred eeeeeccccccCCCCccceeeeeccCCCCCCC----C---------cc------hhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 0111111 24568999999999964321 0 01 35666777777788889999999999 Q ss_pred HHHHHh Q lcl|NC_019933. 147 RLAELR 152 (155) Q Consensus 147 ~i~k~~ 152 (155) .+.... T Consensus 135 ~~~~~~ 140 (140) T protein:vir:10 135 VLGGRR 140 (140) T ss_pred HhhccC Confidence 998777 No 143 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=97.13 E-value=8.3e-07 Score=53.91 Aligned_cols=95 Identities=8% Similarity=-0.046 Sum_probs=42.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc---------ccchhhhccccc Q lcl|NC_019933. 25 DSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA---------PHGHLVEYGHWR 95 (155) Q Consensus 25 ~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a---------~~~~~vEfGt~~ 95 (155) =++-|+.+.+... .+ .+ ..+.||+-.... .++..+|++.. T Consensus 1 m~v~r~~L~~~~~-------------------~l----------~~-~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~- 49 (155) T protein:vir:10 1 MSVTRRGLTLPKD-------------------RY----------KS-MSVKAGVLAGATYPDESGKKLADGTILKKDPR- 49 (155) T ss_pred CcchHHHHHHHHH-------------------Hh----------hC-CeeEEeecCCCCCCccccchhhhhhhhccccc- Confidence 1111211111110 00 00 124455532210 11111222210 Q ss_pred cCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH--HHHhccC Q lcl|NC_019933. 96 TNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL--AELRSKR 155 (155) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i--~k~~~k~ 155 (155) .+-+-..+....|+|++++||+|||||+++.++++..+.+...+...+ +++|..= T Consensus 50 -----~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~ 106 (155) T protein:vir:10 50 -----AGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQI 106 (155) T ss_pred -----cCcchhhhhhhhhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHH Confidence 011112234456677789999999999999999998877766554332 1111110 No 144 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.09 E-value=1.1e-05 Score=47.78 Aligned_cols=119 Identities=14% Similarity=0.074 Sum_probs=63.0 Q ss_pred eeeccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC-----CC------------------------Ccchh Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDS-DSVSRTMAFESAAVVRDSAKAHV-----RS------------------------KTGRL 54 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~-~~~~r~a~~~~a~~i~~eak~~a-----P~------------------------~tG~L 54 (155) |. +|+.|+..|..|-... -.-.++-++.-|..++...+.+. |. .+|.| T Consensus 1 ~~--~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l 78 (150) T protein:vir:20 1 MN--EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLIT 78 (150) T ss_pred Cc--hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhh Confidence 33 4556666665553221 12234456666666666666542 31 24556 Q ss_pred hcceeeeecccccCCceEEEEEEec-CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHH Q lcl|NC_019933. 55 KGAIYAVYVPEESTEVRHVYAVSWN-KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVK 133 (155) Q Consensus 55 r~sI~~~~~~~~~~~g~~~~~Vg~~-~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~ 133 (155) .+||..... ...+.||+. .....|+...-||-...... ..-.+.+||+|||-=.- ..+ T Consensus 79 ~~sl~~~~~-------~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~-------------~~~~~~iPaRp~LG~s~-~d~ 137 (150) T protein:vir:20 79 SRFLHIRAS-------PEQASMEFYGGKSPKIASVHQFGLSEENRK-------------DGKKIDYPARPLLGFTG-EDV 137 (150) T ss_pred hhhhheeec-------CcEEEEEeeCCcchhhhhhhhccccccccc-------------CCCceeccccccCCCCH-HHH Confidence 666644322 122345543 34567888889985321110 01135799999998663 344 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_019933. 134 GRLVEVANKAGAK 146 (155) Q Consensus 134 ~~~~~~i~~~l~~ 146 (155) +++.+.|.+.|.+ T Consensus 138 ~~i~~~i~~~l~k 150 (150) T protein:vir:20 138 QMIEEIILAHLER 150 (150) T ss_pred HHHHHHHHHHHhC Confidence 5555555555555 No 145 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.08 E-value=1.3e-05 Score=47.30 Aligned_cols=125 Identities=13% Similarity=0.020 Sum_probs=64.3 Q ss_pred CceeeeeccHH-HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-------HHhCCCC---cchhhcceeeeecccccC Q lcl|NC_019933. 1 MSSKITSLDIS-GVLSALNDLR-DDSDSVSRTMAFESAAVVRDSA-------KAHVRSK---TGRLKGAIYAVYVPEEST 68 (155) Q Consensus 1 M~~~m~~~~l~-~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~ea-------k~~aP~~---tG~Lr~sI~~~~~~~~~~ 68 (155) |+++-- -+|. .|..+-++.. +..+++++.++......++..| +.+..+. +......+.+. .... T Consensus 4 ~~i~Gl-d~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~-~~~~-- 79 (140) T protein:vir:80 4 IQIVGL-ADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAG-VRVR-- 79 (140) T ss_pred eeehhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeee-eecc-- Confidence 777643 2332 3333334443 3556777777766666666654 3333222 11111111111 0000 Q ss_pred CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 69 EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 69 ~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) .. ..++ ..+.++||+|+||||+.... . +| =+|=+.-.-+...+.+.+.+.+.|.+.| T Consensus 80 --~~-~~~~-~~~~~~y~~f~E~GT~~~~a-~------------PF------l~pA~~~~~~~~~~~~~~~~~~~l~k~~ 136 (140) T protein:vir:80 80 --TK-GKAD-SPSNAFYWRFDEFGTQHMKA-Q------------PF------MRPAFDASIGEAEGAIRTELARAIDQAL 136 (140) T ss_pred --cc-cccC-CCCCcceeeeeccCCCCCCC-C------------cc------hhhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00 0011 23568899999999964321 0 00 2445555555566777888888888888 Q ss_pred HHHh Q lcl|NC_019933. 149 AELR 152 (155) Q Consensus 149 ~k~~ 152 (155) .... T Consensus 137 ~~~~ 140 (140) T protein:vir:80 137 GGRR 140 (140) T ss_pred hccC Confidence 7777 No 146 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.03 E-value=1.7e-05 Score=46.72 Aligned_cols=119 Identities=15% Similarity=0.075 Sum_probs=59.6 Q ss_pred eeeccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh-----CCC------------------------Ccchh Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSAKAH-----VRS------------------------KTGRL 54 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~eak~~-----aP~------------------------~tG~L 54 (155) |. +|+.|++.|..+-+ +.-...++-+++-|..++...+.+ .|. .+|.| T Consensus 1 ~~--~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:60 1 MN--EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred Cc--hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhh Confidence 33 45555555554422 111122444555666666666554 231 13345 Q ss_pred hcceeeeecccccCCceEEEEEEec-CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHH Q lcl|NC_019933. 55 KGAIYAVYVPEESTEVRHVYAVSWN-KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVK 133 (155) Q Consensus 55 r~sI~~~~~~~~~~~g~~~~~Vg~~-~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~ 133 (155) ..||..... .+ .+.||+. .....|+....||-...... ...++.+||+|||-=.- ... T Consensus 79 ~~sl~~~~~----~~---~a~vg~~~Gt~~~yAaiHQfG~~~~~~~-------------~~~~~~iPaRp~LG~s~-~d~ 137 (150) T protein:vir:60 79 SRFLHIRAS----PE---QASMEFYGGKSPKIASVHQFGLSEENRK-------------DGKKIDYPARPLLGFTG-EDV 137 (150) T ss_pred cceeeeeee----Cc---EEEEEeeCCCchhhhhhhhccccccccC-------------CCCceecCCcccCCCCH-HHH Confidence 555543321 11 2234332 33467888889995322110 01235799999998663 345 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_019933. 134 GRLVEVANKAGAK 146 (155) Q Consensus 134 ~~~~~~i~~~l~~ 146 (155) +++++.|.+.|.+ T Consensus 138 ~~i~~~i~~~l~r 150 (150) T protein:vir:60 138 QMIEEIILAHLDR 150 (150) T ss_pred HHHHHHHHHHHhC Confidence 5566656555555 No 147 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=96.97 E-value=1.3e-06 Score=52.94 Aligned_cols=95 Identities=11% Similarity=0.014 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc-c--------cchhhhccccc Q lcl|NC_019933. 25 DSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA-P--------HGHLVEYGHWR 95 (155) Q Consensus 25 ~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a-~--------~~~~vEfGt~~ 95 (155) =++.|..+. .+.++.. + ..+.||+-.+.. + ++...|+|+.. T Consensus 1 m~~~r~~l~----~~~~~l~-------------------------~-~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~ 50 (155) T protein:vir:77 1 MSVTRRGLT----LPKDRYR-------------------------S-MSVKAGVLAGATYPDESGKKLADGSILKKDPRA 50 (155) T ss_pred CcchHHHHH----HHHHHHh-------------------------c-CceEEeecCCCCCccccchhhhhhhhccccccc Confidence 111111111 1111100 0 112344432210 0 11122322210 Q ss_pred cCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH--HHhccC Q lcl|NC_019933. 96 TNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA--ELRSKR 155 (155) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~--k~~~k~ 155 (155) +-+-..+....|+|++++||+|||||+++.++++..+.+...+...++ ++|..= T Consensus 51 ------G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~l 106 (155) T protein:vir:77 51 ------GLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQI 106 (155) T ss_pred ------cccHhhhhhhhhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHHH Confidence 001112333456677889999999999999999988877665543221 111100 No 148 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.95 E-value=2.1e-05 Score=46.24 Aligned_cols=119 Identities=15% Similarity=0.087 Sum_probs=59.7 Q ss_pred eeeccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh-----CCC------------------------Ccchh Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSAKAH-----VRS------------------------KTGRL 54 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~eak~~-----aP~------------------------~tG~L 54 (155) |. +|+.|++.|..+-+ +.-...+..+++-|..++...+.+ .|. .+|.| T Consensus 1 m~--~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:57 1 MN--EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred Cc--hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhh Confidence 33 45555555554422 111223445566666666666554 231 13345 Q ss_pred hcceeeeecccccCCceEEEEEEec-CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHH Q lcl|NC_019933. 55 KGAIYAVYVPEESTEVRHVYAVSWN-KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVK 133 (155) Q Consensus 55 r~sI~~~~~~~~~~~g~~~~~Vg~~-~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~ 133 (155) .+||...... + .+.||+. .....|+....||-...... -...+.+||+|||-=.- ... T Consensus 79 ~~sl~~~~~~----~---~a~vg~~~G~~~~yAaiHQfG~~~r~~~-------------~~~~~~iPaRp~LG~s~-~d~ 137 (150) T protein:vir:57 79 SRFLHIRASP----E---QASMEFYGGKSPKIASVHQFGLSEETRK-------------DGKKIDYPARPLLGFTG-EDV 137 (150) T ss_pred ccceeeeeeC----c---EEEEEeecCCchhhhhhhhccccccccC-------------CCceeecCCcccCCCCH-HHH Confidence 5555433211 1 2234432 33467888889985322110 01235799999998663 344 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_019933. 134 GRLVEVANKAGAK 146 (155) Q Consensus 134 ~~~~~~i~~~l~~ 146 (155) +++.+.|.+.|.+ T Consensus 138 ~~i~~~i~~~l~r 150 (150) T protein:vir:57 138 QMIEEIILAHLDR 150 (150) T ss_pred HHHHHHHHHHHhC Confidence 5555555555555 No 149 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=96.90 E-value=1.5e-06 Score=52.45 Aligned_cols=107 Identities=13% Similarity=0.023 Sum_probs=43.4 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |+++-=.+|...+..+.+.....++-.+.+ -..+|.-+..--..+. .....++ | -+.+. T Consensus 1 ~~~~~~~g~~~~~~~~~~l~~~~v~vG~l~---------~a~yp~G~~~~~~~~~----~~~~~~~------g--~~va~ 59 (168) T protein:vir:94 1 MTTIARKGVKMPPHLEAQFQSGEVKAGVLS---------GSTYPQMTYTDQRTGK----QIEDARG------G--MPVAV 59 (168) T ss_pred CccccchhhhhhHHHHHhhhccceeeeccc---------cCcccccccchhhccc----ccccccc------c--ccHHH Confidence 443333333333332222222111111111 1112221111000000 0000000 0 02345 Q ss_pred cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH--HHhc-------cC Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA--ELRS-------KR 155 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~--k~~~-------k~ 155 (155) ++.+.|||+ .++||+|||||+++.++++..+.+...++..++ .+|. +. T Consensus 60 Ia~~~E~G~-----------------------~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~lG~~~~~~ 116 (168) T protein:vir:94 60 IAQALEYGH-----------------------GQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAADTALRTVGQRMAED 116 (168) T ss_pred HHHHHhcCC-----------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHH Confidence 666777775 479999999999999998877766544432110 0000 00 No 150 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=96.82 E-value=2.4e-06 Score=51.42 Aligned_cols=138 Identities=14% Similarity=0.079 Sum_probs=64.1 Q ss_pred Cceeeeecc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEE Q lcl|NC_019933. 1 MSSKITSLD--ISGVLSALNDLRDDSDSVSRTMAFESAAVV-RDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVS 77 (155) Q Consensus 1 M~~~m~~~~--l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i-~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg 77 (155) |---++-+| +++|...|+.-.+. ...+.+=+..+ ..-|+++.||++|.++.|+.+. +++.+|+-. + T Consensus 1 mgNP~~KFGvS~~e~~K~irns~EV-----~~GiNdFMe~~A~~~aK~~SPV~~GeY~~S~~V~---~ka~NGRG~--~- 69 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRNSAEV-----DAGINDFMENEAIPYAKSISPVDDGEYAASWAVM---KKAKNGRGV--F- 69 (150) T ss_pred CCCchhhhcCCHHHHHHhhccchhh-----hhhHHHHHHhhhhhhhhccCCcccchhHHHHHHH---hhcccCccc--c- Confidence 665555555 34566655543332 11122211111 1246899999999999999643 455565532 3 Q ss_pred ecCCccccchhhhccccccCCCcC-------CCCce-eeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 78 WNKKKAPHGHLVEYGHWRTNVVAE-------VDGKW-LFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA 149 (155) Q Consensus 78 ~~~~~a~~~~~vEfGt~~~~~~~~-------~~~~~-~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~ 149 (155) .+++||+|||||||.....-.+ ++++. .-....+|--+ -|-.|--- .--.+++..-+.-.|.-.|. T Consensus 70 --G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrv-gpdtptka---qgiaqkvashfggslkggis 143 (150) T protein:vir:81 70 --GPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRV-GPDTPTKA---QGIAQKVASHFGGSLKGGIS 143 (150) T ss_pred --CccchhhhhhhhccccccccccccccccCcccceeeeecCccceec-CCCCchhh---hhHHHHHHHhcccccccccc Confidence 4789999999999853221111 11111 11111111000 02222111 11122333333344444555 Q ss_pred HHhccC Q lcl|NC_019933. 150 ELRSKR 155 (155) Q Consensus 150 k~~~k~ 155 (155) +-|..- T Consensus 144 kslsdd 149 (150) T protein:vir:81 144 KSLSDD 149 (150) T ss_pred cccccc Confidence 544444 No 151 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=96.81 E-value=2.5e-05 Score=45.81 Aligned_cols=131 Identities=8% Similarity=0.006 Sum_probs=84.8 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEe--cCCc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSW--NKKK 82 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~--~~~~ 82 (155) |..+.|+||+++++.|..+.+++-+++++.+...-.......+ +.+.-..... ........+.|.. ..+. T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~a-------k~~ap~~~~~-~~~~~~~~I~v~~~~~~~~ 72 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDM-------KQHAGFDETS-TGQHMRDSIKIRSSTRKAQ 72 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-------HHhCCCCCCc-chhhhhhcccccccccccC Confidence 9999999999999999999988888777777655555544444 3333211000 0000111112221 1111 Q ss_pred cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 83 APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 83 a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .....++..| ..++.++||.|.|| ...-+| =+|=+.-..++..+.+.+.+.+.|.+.|+|| T Consensus 73 ~~~~~~v~vg--------~~~~~~~y~~f~E~---GT~k~~-a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 73 GNAVVTLRVG--------PSKQHHMKVLAQEF---GTVKQV-ADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred ccceEEEEec--------CCCCccceEeeecc---CCCCCC-CCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 1111223333 23455678888887 777777 5788999999999999999999999999999 No 152 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=96.76 E-value=1.2e-05 Score=47.54 Aligned_cols=89 Identities=11% Similarity=0.087 Sum_probs=59.2 Q ss_pred Ccee--------eeec--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCc Q lcl|NC_019933. 1 MSSK--------ITSL--DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEV 70 (155) Q Consensus 1 M~~~--------m~~~--~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g 70 (155) |+.+ |.-+ |-+.|...++++++....-+++.+.+.|..|...|..++|+|+|+|++||.+.+. +| T Consensus 1 ~~~~~~~~~~~~makvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk-----~G 75 (100) T protein:vir:96 1 MKLNYYDLSRCHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF-----DG 75 (100) T ss_pred CcccccccchhhhhhheechHHHHHHHhcchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeeee-----cC Confidence 3322 2322 5678899999998888888999999999999999999999999999999976432 34 Q ss_pred eEEEEEEecCCccccchhhh-ccccc Q lcl|NC_019933. 71 RHVYAVSWNKKKAPHGHLVE-YGHWR 95 (155) Q Consensus 71 ~~~~~Vg~~~~~a~~~~~vE-fGt~~ 95 (155) .....|.+...+|- .++-. .-|-. T Consensus 76 GltavI~vGAeYAI-krmsqllvtvi 100 (100) T protein:vir:96 76 GLSSVISVGADYAI-KRMSQLLVTVI 100 (100) T ss_pred CeeEEEecchhHHH-HHHHHHHhhcC Confidence 34444444444432 11100 00100 No 153 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.71 E-value=3e-05 Score=45.41 Aligned_cols=127 Identities=13% Similarity=0.075 Sum_probs=64.5 Q ss_pred eeeccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCC------------------Ccchhhcceee Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKAH-----VRS------------------KTGRLKGAIYA 60 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~~-----aP~------------------~tG~Lr~sI~~ 60 (155) |+. +|..|++.|+.|-+... ..-+.-++.-|..++...+.+ .|. ++|.+-.++.. T Consensus 1 M~~-~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~ 79 (152) T protein:vir:10 1 MSE-PIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQ 79 (152) T ss_pred Cch-HHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhh Confidence 443 46666666665533221 112345556666666666554 342 12222222211 Q ss_pred e-ecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHH Q lcl|NC_019933. 61 V-YVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEV 139 (155) Q Consensus 61 ~-~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~ 139 (155) . .... ..+...+.||+......|+....||-..... .+ ...++.+||+|||-=.- ...+++.+. T Consensus 80 a~~l~~--~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~----~~--------~~~~v~iPaRp~LG~s~-~d~~~I~~~ 144 (152) T protein:vir:10 80 PRFMRL--RLESEGVSLGYEGGDAVIARIHQQGLIGRVR----KD--------WDLKVKYASRELLGFTD-DDLQMIEDY 144 (152) T ss_pred cceeee--eecCcEEEEEecCCchhhhhhhccCcccccc----CC--------CCcceeccccccCCCCH-HHHHHHHHH Confidence 0 0000 0112234566666667888888888532111 01 01146799999997663 455667777 Q ss_pred HHHHHHHH Q lcl|NC_019933. 140 ANKAGAKR 147 (155) Q Consensus 140 i~~~l~~~ 147 (155) |.++|..+ T Consensus 145 i~~~l~~a 152 (152) T protein:vir:10 145 MINILAGS 152 (152) T ss_pred HHHHHhcC Confidence 77777766 No 154 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.68 E-value=2.2e-05 Score=46.07 Aligned_cols=127 Identities=12% Similarity=0.079 Sum_probs=65.0 Q ss_pred eeeccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCC------------------Ccchhhcceee Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKAH-----VRS------------------KTGRLKGAIYA 60 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~~-----aP~------------------~tG~Lr~sI~~ 60 (155) |+. +|.+|++.|+.|-+... ..-+.-++.-|..++...+.+ .|. ++|.+++++-. T Consensus 1 m~~-~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~ 79 (155) T protein:vir:79 1 MTD-DLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMF 79 (155) T ss_pred Cch-HHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhh Confidence 332 56677776666543221 122344566666666665554 341 24544444211 Q ss_pred eecc----cccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHH Q lcl|NC_019933. 61 VYVP----EESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRL 136 (155) Q Consensus 61 ~~~~----~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~ 136 (155) .... -+..-+...+.||+......|+...-||....... + .-.+.|||+|||-=.-+ ..+++ T Consensus 80 ~~l~~a~~l~~~~~~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~~---~----------~~~v~iPaRp~LGls~~-d~~~I 145 (155) T protein:vir:79 80 RKLRTARYLRIDVDSTGLAIGFDERLSRIARVHQEGQKAPVEP---G----------GPLAQYPVRVVLGFSDA-DRELV 145 (155) T ss_pred hhhhhhheeeeeecCcEEEEEecCcchhhhhhhhcCCcccCCC---C----------CcccccccccccCCCHH-HHHHH Confidence 0000 00001112345665556677888899996422110 0 12357999999966643 45666 Q ss_pred HHHHHHHHHH Q lcl|NC_019933. 137 VEVANKAGAK 146 (155) Q Consensus 137 ~~~i~~~l~~ 146 (155) .+.+.++|.+ T Consensus 146 ~~~i~~~l~r 155 (155) T protein:vir:79 146 RDRLLRELTR 155 (155) T ss_pred HHHHHHHhhC Confidence 6666666666 No 155 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=96.67 E-value=2.1e-05 Score=46.21 Aligned_cols=83 Identities=17% Similarity=0.149 Sum_probs=35.8 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |--++..-|++.|.+.+++|. + ..+.||+-. T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~----------------------~---------------------------~~V~VGi~~ 31 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQ----------------------T---------------------------ANAQVGYFQ 31 (160) T ss_pred CceeechHhHHHHHHHHHHHh----------------------C---------------------------CeeEEeecc Confidence 322333333333333333220 0 011222222 Q ss_pred Cc---------cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH----HHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KK---------APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS----VKGRLVEVANKAGAKR 147 (155) Q Consensus 81 ~~---------a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~----~~~~~~~~i~~~l~~~ 147 (155) .. +..+.|.|||| ++.|++||||+.|+. +.+.++......+... T Consensus 32 d~g~~~dG~sv~~vA~~~EfG~-----------------------~~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~ 88 (160) T protein:vir:95 32 EQGQHSSGFSYPALMYLQEVIG-----------------------VPSASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQ 88 (160) T ss_pred ccccCCCCccHHHHHhhhhcCc-----------------------ccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 22344566665 579999999999974 4444444444433333 Q ss_pred HHH-------Hhc----cC Q lcl|NC_019933. 148 LAE-------LRS----KR 155 (155) Q Consensus 148 i~k-------~~~----k~ 155 (155) +.. +|. +. T Consensus 89 ~~~g~~~~~~~LG~~~~~~ 107 (160) T protein:vir:95 89 LSSLNTDPSNTLEAFAKNA 107 (160) T ss_pred HhhcchhHHHHHHHHHHHH Confidence 321 111 11 No 156 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=96.62 E-value=1.2e-06 Score=53.02 Aligned_cols=106 Identities=25% Similarity=0.244 Sum_probs=54.4 Q ss_pred Cceeeeecc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCC-ceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLD-ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTE-VRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~-l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~-g~~~~~Vg~ 78 (155) |...-+--. |..+--.+++|..+ --+..++.+=+.++.+.++++.||++|.+++|+.+. +++.+ |+- .|| T Consensus 1 ma~gpt~kNP~~KFGvs~~d~~K~--~EVn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~---ers~NkGRG--~~G- 72 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDFDKL--PEVNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVT---ERSTNKGRG--KVG- 72 (108) T ss_pred CCCCcccccchhhhcCChhhhhhc--hhhhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHH---HhhhccCcc--ccC- Confidence 543322211 22222222222221 113455566666777889999999999999999653 33333 332 233 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS 131 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~ 131 (155) .++||+||||||+. |.....+..+ -|+-|=-.||+. T Consensus 73 --~~~~~AH~VEFGs~-hndeyapaqk--------------takqfggtay~d 108 (108) T protein:vir:79 73 --ATDPQAHLVEFGSA-HNDEYAPAQK--------------TAKQFGGTAYGD 108 (108) T ss_pred --Ccchhhhhhhhhcc-ccccccchhh--------------HHHhhcccccCC Confidence 67899999999984 2211111100 022233333332 No 157 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=96.60 E-value=2.7e-06 Score=51.07 Aligned_cols=103 Identities=12% Similarity=0.145 Sum_probs=36.8 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch-hhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGR-LKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~-Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |+. .-.+|...+++|.. +.++-+..+ + ...|..+|. +...... +... .-| -+.+ T Consensus 1 m~v-~~k~L~~~~~~l~~---~~v~VGi~~-------~--a~y~d~~~~~~~~~~~~----~~~~------~~g--~~va 55 (155) T protein:vir:78 1 MSV-TRRGLTLPKDRYRS---MSVKAGVLA-------G--ATYPDESGKKLADGTIL----TKDP------RAG--LPVA 55 (155) T ss_pred Ccc-hHHHHHHHHHHHhC---CeeEEeecC-------C--CCCCcccchhhhhhhhc----cccc------ccC--CcHH Confidence 111 11112222221110 000000000 0 001111110 0000000 0000 000 0123 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH--HHhccC Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA--ELRSKR 155 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~--k~~~k~ 155 (155) .++.+.||| ++++||+|||||+++.++++..+.+...+...++ ++|..= T Consensus 56 ~ia~~~E~G-----------------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~ 106 (155) T protein:vir:78 56 MIAMALNYG-----------------------TSKLPARPFMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQI 106 (155) T ss_pred HHHHhhhcC-----------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHH Confidence 334445555 4689999999999999999888777655543221 111000 No 158 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=96.58 E-value=5.6e-05 Score=43.88 Aligned_cols=113 Identities=12% Similarity=-0.004 Sum_probs=59.9 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++++. ++|+.++..|. .+.+.++-..-++.|..+...-+|.++|.|++|..+.. ..+. |-+ T Consensus 1 M~~kVk-v~l~~~~~~l~------~~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~-----~~~~----I~y-- 62 (114) T protein:vir:47 1 MNIAIK-VDLQKAKQKLS------NESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVG-----QGDA----VVY-- 62 (114) T ss_pred CceeEE-eehhHHHHHHH------HHHHHHHHHHHHHHHHHhhccCCcCccCccccceeeee-----CCcE----EEe-- Confidence 988888 57777776653 23334455566677777888899999999999975421 1222 222 Q ss_pred CccccchhhhccccccCCCcCC-CCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEV-DGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~-~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) .+||+++.=||....+..... ++. -.++=|.| |.....++.++...+.+.= T Consensus 63 -~tPYAr~qyYg~~~~~~~~~~~~p~-------------~g~~W~er-aka~~~~~~~~~~~k~~g~ 114 (114) T protein:vir:47 63 -GTVYARAQFYGSNGIVTFRRYTTPG-------------TGKRWDQV-ATSKHAEEWARAFVKGMGL 114 (114) T ss_pred -cCchhhHhhhcccCCCCCCccCCCC-------------CcchhHHH-HHhhhhHHHHHHHHHhhCC Confidence 256777666664211111000 000 00121333 4444455444444333322 No 159 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=96.53 E-value=0.00018 Score=41.07 Aligned_cols=152 Identities=13% Similarity=0.022 Sum_probs=68.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhCCCCcchhhcceeeeecccccC-C----- Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVR-----DSAKAHVRSKTGRLKGAIYAVYVPEEST-E----- 69 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~-----~eak~~aP~~tG~Lr~sI~~~~~~~~~~-~----- 69 (155) |++++...|++++.+.|+.|++.+.+++..|+.++|..-. +++..-+-...+.|++++..+..+..+. + T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~I 80 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAVI 80 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEEE Confidence 9999999999999999999999887777666666555443 2344444445667775432221110000 0 Q ss_pred ------------------------ceEEEEEEecCCccccchhhhccc--------cccCCC--cCCCCceeeeeecccc Q lcl|NC_019933. 70 ------------------------VRHVYAVSWNKKKAPHGHLVEYGH--------WRTNVV--AEVDGKWLFTKEKLAT 115 (155) Q Consensus 70 ------------------------g~~~~~Vg~~~~~a~~~~~vEfGt--------~~~~~~--~~~~~~~~~~~~~~~g 115 (155) +...+.|-....+-.-+-|+=-+. +.+.+. +...+.++... .| T Consensus 81 ~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~---~g 157 (205) T protein:vir:63 81 GARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHAT---DG 157 (205) T ss_pred ecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccc---cC Confidence 122222322222111111111110 000000 11112111100 01 Q ss_pred eeeeCC--ccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 116 PVHVPA--RSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 116 t~~~pa--~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ..+.|- .=+.-|+.+..-..+-+.|...+.+.+.+-..++ T Consensus 158 ~~k~~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~r~ 199 (205) T protein:vir:63 158 ATKLSNNVYLLYGPSVDQVFRTVADDITTEVLDALADEFLRQ 199 (205) T ss_pred ceecCCceEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHHHh Confidence 111111 1156677765555444444444444444333333 No 160 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.53 E-value=3.4e-05 Score=45.09 Aligned_cols=119 Identities=20% Similarity=0.136 Sum_probs=58.7 Q ss_pred eeeccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh-----CCC-------Cc-----------------chh Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSAKAH-----VRS-------KT-----------------GRL 54 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~eak~~-----aP~-------~t-----------------G~L 54 (155) |. +|+.+.+.|+.|-. +.-..-++.+++.|..++...+.+ .|. .. |.+ T Consensus 1 m~--~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~ 78 (149) T protein:vir:18 1 MS--ELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRT 78 (149) T ss_pred Cc--hHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhh Confidence 43 45555555555422 111112445566666666666554 341 11 112 Q ss_pred hcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHH Q lcl|NC_019933. 55 KGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKG 134 (155) Q Consensus 55 r~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~ 134 (155) .++|... .+.++..++.+| ....|+....||....... .+ -.+.+||+|||-=.- ..++ T Consensus 79 ~~~l~~~----~~~~~~~v~~~G---tn~~yAaiHQfG~~~r~~~---~~----------~~v~iPaRp~LG~s~-~d~~ 137 (149) T protein:vir:18 79 SRFMKAK----GSDSAAVVEFTG---KVQRMARVHQYGLKDRPNR---NS----------RDVQYEARPLLGFTR-DDEQ 137 (149) T ss_pred hhhhhee----ecCceeEEEecc---cchhhhhhhhccccccccC---CC----------ccccccccccCCCCH-HHHH Confidence 3333221 112333333344 4456888889996422211 11 135799999997553 3455 Q ss_pred HHHHHHHHHHHH Q lcl|NC_019933. 135 RLVEVANKAGAK 146 (155) Q Consensus 135 ~~~~~i~~~l~~ 146 (155) ++++.|.+.|.+ T Consensus 138 ~I~~~i~~~l~~ 149 (149) T protein:vir:18 138 MIEDVIISHLGK 149 (149) T ss_pred HHHHHHHHHHhC Confidence 566666555555 No 161 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=96.50 E-value=3.7e-06 Score=50.36 Aligned_cols=103 Identities=12% Similarity=0.144 Sum_probs=37.1 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch-hhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGR-LKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~-Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |+. .-.+|...+++|.. +.++-+..+ + ...|..+|. +...... +... .-| -+.+ T Consensus 1 m~v-~~k~L~~~~~~l~~---~~v~VGi~~-------~--a~y~d~~~~~~~~~~~~----~~~~------~~g--~~va 55 (155) T protein:vir:10 1 MSV-TRRGLTLPKDRYRS---MSVKAGVLA-------G--ATYPDESGKKLADGTIL----TKDP------RAG--LPVA 55 (155) T ss_pred Ccc-hHHHHHHHHHHHhC---CeeEEeecC-------C--CCCccccchhhhhhhhc----cccc------ccC--CcHH Confidence 111 11112222221110 000000000 0 001111110 0000000 0000 000 0123 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHH--HHhccC Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLA--ELRSKR 155 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~--k~~~k~ 155 (155) .++.+.||| ++++||+|||||+++.++++..+.+...+...++ ++|..= T Consensus 56 ~ia~~~E~G-----------------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~l 106 (155) T protein:vir:10 56 MIAMALNYG-----------------------TSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQI 106 (155) T ss_pred HHHHHHhcC-----------------------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHH Confidence 344445555 4679999999999999999988777665543221 111000 No 162 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=96.49 E-value=4.5e-05 Score=44.43 Aligned_cols=111 Identities=9% Similarity=0.113 Sum_probs=74.6 Q ss_pred eeeccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDS-DSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~-~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |..+.|+||+++++.|.+.. ...+++++++++..+.++++++++...+. .+++....+.+.+..+.+ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~------------~TG~Lr~sI~~~~~~~~~ 68 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGY------------STGATRRSITLQVESDKA 68 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCC------------CchhhhhceeeeecCCee Confidence 99999999999999999985 46789999999999999999999866653 222332223333322221 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch--hhHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL--RPGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl--rPA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) +.|. ...|+.++|| ..-+| +|=+.- +.+..+..+.+.|.++++- T Consensus 69 ------~V~~-----------~~~Ya~~vEf------GT~km~a~Pfl~P----A~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 69 ------TVEA-----------LTSYSGYLEV------GTRKMEAQPFMKP----ALDEVAPKMVEELAKWDET 114 (114) T ss_pred ------EecC-----------CCCccceecc------cccccCCCCchhh----hHHHHHHHHHHHHHHHhcC Confidence 1121 1247777777 44455 344543 4456666677777777777 No 163 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=96.49 E-value=4.5e-05 Score=44.43 Aligned_cols=111 Identities=9% Similarity=0.113 Sum_probs=74.6 Q ss_pred eeeccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDS-DSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~-~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |..+.|+||+++++.|.+.. ...+++++++++..+.++++++++...+. .+++....+.+.+..+.+ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~------------~TG~Lr~sI~~~~~~~~~ 68 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGY------------STGATRRSITLQVESDKA 68 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCC------------CchhhhhceeeeecCCee Confidence 99999999999999999985 46789999999999999999999866653 222332223333322221 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch--hhHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL--RPGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl--rPA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) +.|. ...|+.++|| ..-+| +|=+.- +.+..+..+.+.|.++++- T Consensus 69 ------~V~~-----------~~~Ya~~vEf------GT~km~a~Pfl~P----A~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 69 ------TVEA-----------LTSYSGYLEV------GTRKMEAQPFMKP----ALDEVAPKMVEELAKWDET 114 (114) T ss_pred ------EecC-----------CCCccceecc------cccccCCCCchhh----hHHHHHHHHHHHHHHHhcC Confidence 1121 1247777777 44455 344543 4456666677777777777 No 164 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=96.22 E-value=0.00022 Score=40.67 Aligned_cols=147 Identities=7% Similarity=-0.015 Sum_probs=84.9 Q ss_pred Cceeeeecc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----CcchhhcceeeeecccccCCceEEEE Q lcl|NC_019933. 1 MSSKITSLD-ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRS----KTGRLKGAIYAVYVPEESTEVRHVYA 75 (155) Q Consensus 1 M~~~m~~~~-l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~----~tG~Lr~sI~~~~~~~~~~~g~~~~~ 75 (155) |..+|+.-+ ++.+...|..++....+++..|+..+|.-++.++...+.. ....+++.+.+... +.++...+. T Consensus 5 ~~l~idv~~~l~~i~~~l~~~~~~~~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a---~~~~~~~i~ 81 (177) T protein:vir:96 5 FEMKIDVSREAEDIAAMVAATTKQLELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQ---RQKGEVRFW 81 (177) T ss_pred ceeEEehhHHHHHHHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeecc---CCCcEEEEE Confidence 555555433 5556666666677778888899999888888877665543 45678888865422 223333333 Q ss_pred EEecCCccccchhhhccccccCCC---------------cCCCCce-eeeee----cccceeeeCCccchhhHHHHHHHH Q lcl|NC_019933. 76 VSWNKKKAPHGHLVEYGHWRTNVV---------------AEVDGKW-LFTKE----KLATPVHVPARSFLRPGYDSVKGR 135 (155) Q Consensus 76 Vg~~~~~a~~~~~vEfGt~~~~~~---------------~~~~~~~-~~~~~----~~~gt~~~pa~PFlrPA~~~~~~~ 135 (155) ++.++ -+. .-||+.+.... ...+|.+ ++.+- .+---+++|--|=+..+++...++ T Consensus 82 ~~~~~--i~l---~~~~~~r~t~~Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~~ 156 (177) T protein:vir:96 82 VGLDP--IGV---YRLGTPKVTQKGVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWERR 156 (177) T ss_pred Eeccc--eeh---hhcccCCCCccceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHHH Confidence 33221 111 12333211100 0011111 11110 001123456555567889888899 Q ss_pred HHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 136 LVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 136 ~~~~i~~~l~~~i~k~~~k~ 155 (155) +.+.+...|.++|..+|+++ T Consensus 157 ~~~~~~~~l~~Ei~~~L~g~ 176 (177) T protein:vir:96 157 VFQRFKELFEQEARAIINGH 176 (177) T ss_pred HHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999 No 165 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=96.19 E-value=0.00022 Score=40.68 Aligned_cols=118 Identities=14% Similarity=0.142 Sum_probs=73.2 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEEEe Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAVSW 78 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~ 78 (155) |.--=-.++.++|.+.+.++......++.-=+.-.|.....+||.||| .+||+-|++|...... .+.-.+.|.. T Consensus 1 ~~~~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~----~~~~~~~Iyl 76 (120) T protein:vir:10 1 MAKIEFKFKDIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVAST----PQPDRYEIVF 76 (120) T ss_pred CceEEEEecHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccccc----CCCceEEEEE Confidence 552222256778888999887666666666666788889999999999 5799999999643221 1222233333 Q ss_pred cCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 79 NKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 79 ~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) ..+ -.|+-|+|+-... -.--|+|..+.-.+++++-+ +.-|.++. T Consensus 77 sh~-veYG~~LEla~~~-------------------------kyaIl~PTi~~~~~~il~g~----~~ll~~l~ 120 (120) T protein:vir:10 77 AHT-VHYGIWLEIANSG-------------------------RYEIIMPTVHHEGKLMAQRL----RGLLGRLR 120 (120) T ss_pred ecC-eeecceEEeeCCC-------------------------CcccccchHHHHhHHHHHHH----HHHhhhcC Confidence 332 4577778843211 11147788877777776665 33444444 No 166 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=96.11 E-value=8.2e-05 Score=42.98 Aligned_cols=141 Identities=9% Similarity=0.046 Sum_probs=85.6 Q ss_pred eee---ccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccC-CceEEEEEEec Q lcl|NC_019933. 5 ITS---LDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEEST-EVRHVYAVSWN 79 (155) Q Consensus 5 m~~---~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~-~g~~~~~Vg~~ 79 (155) |+. +.|+||+++++.|..+.. +..+++++.+...-...++..+-..+-.+.++-.......+.+ .....+.++ + T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~-~ 79 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEP-K 79 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceec-c Confidence 985 578999999999999995 6778888888877777766665444443322211000000000 001111111 1 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ....-...++.-|.. ...+.+++||+|.| ....-+| =+|=+....++..+.+.+.+.+.|.+++++. T Consensus 80 ~~~~~g~~~~~VG~~-----~~~~~~~~y~~f~E---~GT~k~~-a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~~ 146 (149) T protein:vir:13 80 IRKKKGNLQCVVGWE-----KSDNTPFYYMKMEE---WGTSERP-PHHAFGKTNKILKRVYDNIAQKKYDNFVKEK 146 (149) T ss_pred cccccceeEEEeecc-----CCCCCccceeeeec---cCccCCC-CCccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 011111112333321 11234568888765 5566666 4688999999999999999999999999999 No 167 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=96.09 E-value=0.00014 Score=41.77 Aligned_cols=134 Identities=11% Similarity=0.041 Sum_probs=83.4 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |+.+.|+||+++++.|..+.+++.+++++.+...-.+.....+- ..+- ..++.....+.+.+..... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak-------~~ap-----~~tG~l~~sI~~~~~~~~~- 67 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEAR-------ARAP-----KKTGKLKRNIVTAALKQKD- 67 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HhCC-----CChhhHHHhceeccccccc- Confidence 99999999999999999999999888988888877776666543 2221 1222222222222111110 Q ss_pred cchhhhccccc-cCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 85 HGHLVEYGHWR-TNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 85 ~~~~vEfGt~~-~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) -...+.+|... ........+..+||+|.| .....+|= +|=+.-..++..+.+.+.+.+.|.+.+++- T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E---~GT~~~~a-~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~ 135 (140) T protein:vir:10 68 SPGIATAGVRVRTKGKADSPNNAFYWRFVE---LGTQFMKA-EPFMRPAFDASIAQAEGAIRTEIARAIDQV 135 (140) T ss_pred ccceeEEeeccccccccCCCCcccccceec---cCcCCCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 00011122111 111122334567777655 55566665 677788888888889999999999999888 No 168 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=96.08 E-value=0.00011 Score=42.20 Aligned_cols=128 Identities=13% Similarity=-0.005 Sum_probs=67.9 Q ss_pred CceeeeeccH-HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-------HHhCCCCcchhhcceeeeecccccCCce Q lcl|NC_019933. 1 MSSKITSLDI-SGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSA-------KAHVRSKTGRLKGAIYAVYVPEESTEVR 71 (155) Q Consensus 1 M~~~m~~~~l-~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~ea-------k~~aP~~tG~Lr~sI~~~~~~~~~~~g~ 71 (155) |+++-- -+| +.|..+-++..+ ..+++++.++......++..| +.+.....+.-.............+... T Consensus 4 ~~i~Gl-d~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~~~ 82 (140) T protein:vir:14 4 IQIIGL-ADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKG 82 (140) T ss_pred eeehhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeecccc Confidence 777732 233 234443344433 456777777666666666644 4444443322222111110000000000 Q ss_pred EEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 72 HVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAEL 151 (155) Q Consensus 72 ~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~ 151 (155) .. -..+.++||||+||||+.... . .| =+|=+.-.-+...+.+.+.+.+.|.+.+... T Consensus 83 ---~~-~~~~~~~y~~f~E~GT~~~~a----~---------pF------l~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:14 83 ---KA-DSPNNAFYWRFDEFGTQHMKA----Q---------PF------MRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred ---cc-CCCCccceeeeeccccCCCCC----C---------cc------hhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 01 134568999999999964321 0 00 2555666666667788888888889888877 Q ss_pred h Q lcl|NC_019933. 152 R 152 (155) Q Consensus 152 ~ 152 (155) . T Consensus 140 ~ 140 (140) T protein:vir:14 140 R 140 (140) T ss_pred C Confidence 7 No 169 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=95.94 E-value=0.00015 Score=41.57 Aligned_cols=114 Identities=11% Similarity=0.051 Sum_probs=58.4 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |++++. +++++++..|. .+.++++-..-++.|..+...-+|.+||.|..|-..... ++.|.++ ++ T Consensus 1 M~ikVk-v~l~~~~~~~~------~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~---~~~~~I~----y~- 65 (116) T protein:vir:15 1 MAFRIN-VDLDGFMDQTS------LDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHAT---SDGSEIT----YS- 65 (116) T ss_pred CCceEE-eehhHhhhhhh------HHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeee---cCCceEE----ec- Confidence 998888 56777776653 234444555666677777888999999885544322111 1122222 23 Q ss_pred Cccccchhhhcccc--ccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHW--RTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 81 ~~a~~~~~vEfGt~--~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~ 145 (155) +||++..=||.. .+...+-.++ .-.++=|.| |-....+..++.+.+.++ T Consensus 66 --tPYAr~qyYg~~~~~~~~~~~t~p-------------~ag~~W~er-aK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 66 --TPYAKAQFYGIINDKYPVHNYTTP-------------GTTKRWDLK-AKSMFMSSWIDTFTKGMK 116 (116) T ss_pred --CchhHHHhcccccCCCCcccccCC-------------CCCcchhHH-HHhhhHHHHHHHHHHhcC Confidence 456665544431 1110000000 011222444 555555555555555555 No 170 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.89 E-value=0.00022 Score=40.66 Aligned_cols=119 Identities=13% Similarity=0.082 Sum_probs=59.7 Q ss_pred eeeccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCC-----------------------Ccchhh Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKAH-----VRS-----------------------KTGRLK 55 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~~-----aP~-----------------------~tG~Lr 55 (155) |+ +|++|++.|..|-+... ..-++.+++-|..++...+++ .|. +++.+. T Consensus 1 m~--~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~ 78 (148) T protein:vir:79 1 MS--ESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLA 78 (148) T ss_pred Cc--cHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhh Confidence 44 46666666665532221 112344555666666655544 342 112334 Q ss_pred cceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHH Q lcl|NC_019933. 56 GAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGR 135 (155) Q Consensus 56 ~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~ 135 (155) .+|.... +.++. .||+......|+...-||-..... + -.-++.|||+|||-=.- ...++ T Consensus 79 ~~l~~~~----~~~~~---~v~~~Gt~~~yAaiHQfG~~~r~~-----~--------~~~~v~iPaRp~LG~s~-~d~~~ 137 (148) T protein:vir:79 79 RYMKTQA----DANTA---VVTFAGNAQRIATVHQFGLRDRVN-----K--------AGLTAQYPARELLGMDG-VDMEH 137 (148) T ss_pred hheeeee----eCCee---eEEeeccchhhhhhhhcCcccccc-----C--------CCCccccCcccccCCCH-HHHHH Confidence 4443221 12222 343333445677778888432110 0 01245799999997553 35556 Q ss_pred HHHHHHHHHHH Q lcl|NC_019933. 136 LVEVANKAGAK 146 (155) Q Consensus 136 ~~~~i~~~l~~ 146 (155) +++.|.++|.- T Consensus 138 i~~~i~~~l~~ 148 (148) T protein:vir:79 138 ITNLLLLHLGA 148 (148) T ss_pred HHHHHHHHhcC Confidence 66666666655 No 171 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=95.81 E-value=0.00031 Score=39.86 Aligned_cols=129 Identities=14% Similarity=0.057 Sum_probs=82.0 Q ss_pred Ccee-eeeccHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-------HHhC-----CCCcchhhcceeeeecccc Q lcl|NC_019933. 1 MSSK-ITSLDISGVLSALNDLR-DDSDSVSRTMAFESAAVVRDSA-------KAHV-----RSKTGRLKGAIYAVYVPEE 66 (155) Q Consensus 1 M~~~-m~~~~l~~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~ea-------k~~a-----P~~tG~Lr~sI~~~~~~~~ 66 (155) |+++ +..+. ..|.++-++.. ...+++++.++.-....++..| +.+. ...+|+++.+|........ T Consensus 6 ~~i~Gldel~-~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~~~~~ 84 (148) T protein:vir:93 6 LDFSGLEDIS-RDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNPD 84 (148) T ss_pred eeehhHHHHH-HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecccccc Confidence 7776 44331 23333334443 3556677766666666666554 2222 2357889999877654444 Q ss_pred cCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 67 STEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 67 ~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) ++... ...++++...++||||+||||+.... .+ | -+|=+.=.-+...+.+.+.+.++|.+ T Consensus 85 ~~~~~-~~~~~~~~~~~~y~~f~E~GT~~~pa----~P---------F------l~pA~~~~k~~~~~~~~~~~~~~i~k 144 (148) T protein:vir:93 85 TGNSD-NTMKADNPRNAFYWRFVEMGTVNMPP----HP---------F------VRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) T ss_pred ccccc-ceeecCCCCCcceeeeeccCCCCCCC----Cc---------c------hhHHHHHhHHHHHHHHHHHHHHHHHH Confidence 33322 22345677789999999999974321 11 1 46777777778888899999999999 Q ss_pred HHHH Q lcl|NC_019933. 147 RLAE 150 (155) Q Consensus 147 ~i~k 150 (155) .|.| T Consensus 145 ~~~k 148 (148) T protein:vir:93 145 VLRR 148 (148) T ss_pred HhcC Confidence 9999 No 172 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=95.77 E-value=0.00016 Score=41.34 Aligned_cols=129 Identities=12% Similarity=0.073 Sum_probs=72.8 Q ss_pred eeeccHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLR-DDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |..-|+.....-|+++= +....-+-+||..+..++...|-...|+||+.|=+|=..... ....+.+.+||-. | T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei~---~ngtritGRVGYS---A 74 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYKKLE---PIPSGMIGRVGYT---A 74 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccceeee---ccCceeEEeeccc---e Confidence 55556666666666653 333344556777777777778889999999999999765432 2233566777754 4 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHH-HHHHHH Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVA-NKAGAK 146 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i-~~~l~~ 146 (155) .|+-||+--.....+..+++++.-||... .-.-||+-+|+.+....++++ +++++- T Consensus 75 nYA~yVHda~Gklkgqprp~gkgn~w~p~-------ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 75 NYAAAVNAAKGKLKGKPRPDGSGNYWDPN-------GEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred eeeeeeecCccccCCCcCCCCCcceecCC-------CChhhhhhhhhccchHHHHHHHhhhcCC Confidence 55555644111122222333444455421 122499999976644444433 333332 No 173 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=95.74 E-value=0.00024 Score=40.47 Aligned_cols=107 Identities=12% Similarity=0.031 Sum_probs=56.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+++++. +.+.+.|. .+.+.+|-..-++.|..+...-+|.++|.|++|-.+.. ..|.++ + T Consensus 2 mkvkv~~---~~~~~~~~------~~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s-----~~g~I~----y-- 61 (108) T protein:vir:98 2 PKIRVEL---SGAKDKLS------PQTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISS-----DAEEIY----Y-- 61 (108) T ss_pred ceeEeee---hHHHHHHH------HHHHHHHHHHHHHHHHHhhcccCcCcCCccccceeecc-----CCceEE----e-- Confidence 7788664 33333332 12334456667778888888899999999999943321 122222 2 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAK 146 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~ 146 (155) .+||+++.=||...... .+..| ++=|.| |.....+++++.+.+.++= T Consensus 62 -~tPYAr~qYYg~~~n~~-~p~ag----------------~~W~er-aka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 62 -NTPYAKRRFYEPAYNYT-TPGTG----------------PRWDMK-AKRLFISDWERAYMKGANW 108 (108) T ss_pred -cChhhHHhhhccccCCC-CCCCc----------------chhHHH-HHhhhhHHHHHHHHHhhcC Confidence 25677777676432111 11111 122443 5545555555544433333 No 174 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=95.68 E-value=0.00043 Score=39.02 Aligned_cols=143 Identities=11% Similarity=0.135 Sum_probs=67.1 Q ss_pred eeeccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcc----hhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSAKAHVRSKTG----RLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~eak~~aP~~tG----~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |++-||+.+.+.|+.|.+ ...+++..|+..+|.-+...+...+...+| .+++.+.... .+.+ ...+.|.++ T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~k---As~~-~l~a~I~~~ 76 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKR---ATVK-NPQARIKVN 76 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheecc---ccCC-CceEEEEEe Confidence 666678888888877754 467788888888888887777766654433 5666665532 1111 123333333 Q ss_pred CCccccchhhhc---------------------cccc---------cCCCcCCCCceeeeeeccc------ceeeeC-Cc Q lcl|NC_019933. 80 KKKAPHGHLVEY---------------------GHWR---------TNVVAEVDGKWLFTKEKLA------TPVHVP-AR 122 (155) Q Consensus 80 ~~~a~~~~~vEf---------------------Gt~~---------~~~~~~~~~~~~~~~~~~~------gt~~~p-a~ 122 (155) .+.-+--.|..+ |+.- ......+++.|-.+....+ --+++| .. T Consensus 77 ~~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpis~ 156 (192) T protein:vir:34 77 RGDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAV 156 (192) T ss_pred ccceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEechhH Confidence 222110011111 0000 0001112222211111111 113334 23 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 123 SFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 123 PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) |. ..||+. ++.+.+.+.+.++|.++|+.| T Consensus 157 ~l-~~af~~---~~~~~~~~~~~~El~~~L~~~ 185 (192) T protein:vir:34 157 PL-TTAFKQ---NIERIRRERLPKELGYALQHQ 185 (192) T ss_pred HH-HHHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 33 556654 444444455555555555555 No 175 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=95.39 E-value=0.0007 Score=37.88 Aligned_cols=134 Identities=13% Similarity=0.055 Sum_probs=78.0 Q ss_pred eee---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccc------cCCceE-EE Q lcl|NC_019933. 5 ITS---LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEE------STEVRH-VY 74 (155) Q Consensus 5 m~~---~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~------~~~g~~-~~ 74 (155) |+. +.|+||+++++.|..+..+ .+++++++...-.+.++..+ +.++-......+ ..++.+ .. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~a-------k~~ap~~~~~~~~~~~~~~~~~~~~~~ 72 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAI-------AERAPRSPSPKKRSKSEPWRTGQHGAD 72 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHH-------HHhCCCccccccccccccccccccccc Confidence 773 4589999999999999887 56677777766666655543 444422211111 111111 00 Q ss_pred EEEecCCccc-cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 75 AVSWNKKKAP-HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 75 ~Vg~~~~~a~-~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) .|.+...+.. -..++..|. ....++.++||+|.| ....-+| =+|-+....++..+.+.+.+.+.|.+.++ T Consensus 73 ~i~~~~~~~~~g~~~~~vg~-----~~~~~~~~~y~~f~E---~GT~~~~-a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 73 QIKVTKAKLEGGIKTVKIGL-----NKADRSPWFYLKFHE---WGTSKMP-AHPFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred cceeccccccccceeEEeee-----ccCCCCCcceeeeec---cCCCCCC-CCcchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 1111110000 001111111 112345678888765 5567777 57888888888889999999999999887 Q ss_pred cC Q lcl|NC_019933. 154 KR 155 (155) Q Consensus 154 k~ 155 (155) |= T Consensus 144 ka 145 (146) T protein:vir:10 144 LD 145 (146) T ss_pred hc Confidence 77 No 176 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=95.39 E-value=0.0007 Score=37.88 Aligned_cols=134 Identities=13% Similarity=0.055 Sum_probs=78.0 Q ss_pred eee---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccc------cCCceE-EE Q lcl|NC_019933. 5 ITS---LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEE------STEVRH-VY 74 (155) Q Consensus 5 m~~---~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~------~~~g~~-~~ 74 (155) |+. +.|+||+++++.|..+..+ .+++++++...-.+.++..+ +.++-......+ ..++.+ .. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~a-------k~~ap~~~~~~~~~~~~~~~~~~~~~~ 72 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAI-------AERAPRSPSPKKRSKSEPWRTGQHGAD 72 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHH-------HHhCCCccccccccccccccccccccc Confidence 773 4589999999999999887 56677777766666655543 444422211111 111111 00 Q ss_pred EEEecCCccc-cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 75 AVSWNKKKAP-HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 75 ~Vg~~~~~a~-~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) .|.+...+.. -..++..|. ....++.++||+|.| ....-+| =+|-+....++..+.+.+.+.+.|.+.++ T Consensus 73 ~i~~~~~~~~~g~~~~~vg~-----~~~~~~~~~y~~f~E---~GT~~~~-a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 73 QIKVTKAKLEGGIKTVKIGL-----NKADRSPWFYLKFHE---WGTSKMP-AHPFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred cceeccccccccceeEEeee-----ccCCCCCcceeeeec---cCCCCCC-CCcchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 1111110000 001111111 112345678888765 5567777 57888888888889999999999999887 Q ss_pred cC Q lcl|NC_019933. 154 KR 155 (155) Q Consensus 154 k~ 155 (155) |= T Consensus 144 ka 145 (146) T protein:vir:10 144 LD 145 (146) T ss_pred hc Confidence 77 No 177 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=95.39 E-value=0.0007 Score=37.88 Aligned_cols=134 Identities=13% Similarity=0.055 Sum_probs=78.0 Q ss_pred eee---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccc------cCCceE-EE Q lcl|NC_019933. 5 ITS---LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEE------STEVRH-VY 74 (155) Q Consensus 5 m~~---~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~------~~~g~~-~~ 74 (155) |+. +.|+||+++++.|..+..+ .+++++++...-.+.++..+ +.++-......+ ..++.+ .. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~a-------k~~ap~~~~~~~~~~~~~~~~~~~~~~ 72 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAI-------AERAPRSPSPKKRSKSEPWRTGQHGAD 72 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHH-------HHhCCCccccccccccccccccccccc Confidence 773 4589999999999999887 56677777766666655543 444422211111 111111 00 Q ss_pred EEEecCCccc-cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 75 AVSWNKKKAP-HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 75 ~Vg~~~~~a~-~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) .|.+...+.. -..++..|. ....++.++||+|.| ....-+| =+|-+....++..+.+.+.+.+.|.+.++ T Consensus 73 ~i~~~~~~~~~g~~~~~vg~-----~~~~~~~~~y~~f~E---~GT~~~~-a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 73 QIKVTKAKLEGGIKTVKIGL-----NKADRSPWFYLKFHE---WGTSKMP-AHPFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred cceeccccccccceeEEeee-----ccCCCCCcceeeeec---cCCCCCC-CCcchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 1111110000 001111111 112345678888765 5567777 57888888888889999999999999887 Q ss_pred cC Q lcl|NC_019933. 154 KR 155 (155) Q Consensus 154 k~ 155 (155) |= T Consensus 144 ka 145 (146) T protein:vir:10 144 LD 145 (146) T ss_pred hc Confidence 77 No 178 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=95.39 E-value=0.0007 Score=37.88 Aligned_cols=134 Identities=13% Similarity=0.055 Sum_probs=78.0 Q ss_pred eee---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccc------cCCceE-EE Q lcl|NC_019933. 5 ITS---LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEE------STEVRH-VY 74 (155) Q Consensus 5 m~~---~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~------~~~g~~-~~ 74 (155) |+. +.|+||+++++.|..+..+ .+++++++...-.+.++..+ +.++-......+ ..++.+ .. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~-~~~~~~~al~~ga~~i~~~a-------k~~ap~~~~~~~~~~~~~~~~~~~~~~ 72 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLR-GEKIEDKALAAGGEPIRKAI-------AERAPRSPSPKKRSKSEPWRTGQHGAD 72 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHH-------HHhCCCccccccccccccccccccccc Confidence 773 4589999999999999887 56677777766666655543 444422211111 111111 00 Q ss_pred EEEecCCccc-cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 75 AVSWNKKKAP-HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 75 ~Vg~~~~~a~-~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) .|.+...+.. -..++..|. ....++.++||+|.| ....-+| =+|-+....++..+.+.+.+.+.|.+.++ T Consensus 73 ~i~~~~~~~~~g~~~~~vg~-----~~~~~~~~~y~~f~E---~GT~~~~-a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 73 QIKVTKAKLEGGIKTVKIGL-----NKADRSPWFYLKFHE---WGTSKMP-AHPFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred cceeccccccccceeEEeee-----ccCCCCCcceeeeec---cCCCCCC-CCcchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 1111110000 001111111 112345678888765 5567777 57888888888889999999999999887 Q ss_pred cC Q lcl|NC_019933. 154 KR 155 (155) Q Consensus 154 k~ 155 (155) |= T Consensus 144 ka 145 (146) T protein:vir:10 144 LD 145 (146) T ss_pred hc Confidence 77 No 179 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=95.12 E-value=0.00023 Score=40.53 Aligned_cols=124 Identities=15% Similarity=0.081 Sum_probs=61.4 Q ss_pred eeeccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCC-------C-------cchhhc-------- Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKAH-----VRS-------K-------TGRLKG-------- 56 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~~-----aP~-------~-------tG~Lr~-------- 56 (155) |+. +|+.|++.|+.|-.... ..-++-+++-|..++...+.+ .|. + .|.+++ T Consensus 1 m~~-~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l 79 (156) T protein:vir:11 1 MAD-SLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKL 79 (156) T ss_pred Cch-hHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhh Confidence 553 56677766666532211 112334566666666666554 242 1 122111 Q ss_pred --c--eeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH Q lcl|NC_019933. 57 --A--IYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV 132 (155) Q Consensus 57 --s--I~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~ 132 (155) + |... .+...+.||+......|++..-||....... . .-.+.|||+|||-=.- .. T Consensus 80 ~~~~~l~~~-------~~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~---~----------~~~v~iPaRp~LG~s~-~d 138 (156) T protein:vir:11 80 RTVRYLRAK-------GDAQAITVSFAGRIARIARVHQYGLRDRAEP---G----------APEVSYAQRLLLGFDS-SD 138 (156) T ss_pred hhhheeeee-------ecCcEEEEEecCCchhhhhhhcccccccccC---C----------CCcccccccccCCCCH-HH Confidence 1 2111 1112345666556677888888996422110 1 1135799999996653 34 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 133 KGRLVEVANKAGAKRLAE 150 (155) Q Consensus 133 ~~~~~~~i~~~l~~~i~k 150 (155) ++++++.|.++|.....= T Consensus 139 ~~~i~~~i~~~l~~~~~~ 156 (156) T protein:vir:11 139 METIQNGILAHIDANSPI 156 (156) T ss_pred HHHHHHHHHHHHhhcCCC Confidence 444555554444433222 No 180 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=95.09 E-value=0.00078 Score=37.62 Aligned_cols=142 Identities=12% Similarity=0.151 Sum_probs=59.1 Q ss_pred eeeccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCC----CcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRD-DSDSVSRTMAFESAAVVRDSAKAHVRS----KTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~-~~~~~~r~a~~~~a~~i~~eak~~aP~----~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |++-||+.+++.|..|.+ ...+++..|+..++.-++.++...+.. ....+++.+.+.. .+ .+...+.|-++ T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~k---as-~~~l~a~I~~~ 76 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVKR---AT-VNKPRALIRVN 76 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheecc---cC-CCCeEEEEEEe Confidence 554555556666666633 467888888888888888777665543 3456777765421 11 11222222222 Q ss_pred CCccccchhhhcccccc--------------------------CCCcCCCCce-eeeee----cccceeeeC-Cccchhh Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRT--------------------------NVVAEVDGKW-LFTKE----KLATPVHVP-ARSFLRP 127 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~--------------------------~~~~~~~~~~-~~~~~----~~~gt~~~p-a~PFlrP 127 (155) .+.-+. +-||+... ......+|.+ ++.+- .+..-+++| +.| +.. T Consensus 77 ~~~i~l---~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~-~~e 152 (184) T protein:vir:39 77 RGNLPA---IKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAP-LTT 152 (184) T ss_pred ccceee---eeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEEecCcccceeEEEcCchHH-HHH Confidence 211110 11221100 0000011111 11110 001122334 233 344 Q ss_pred HHHHHH-----HHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 128 GYDSVK-----GRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 128 A~~~~~-----~~~~~~i~~~l~~~i~k~~~k 154 (155) +++... +.+.+.|..+|...|..++++ T Consensus 153 ~~~~~~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 153 AFKEELPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 444322 233334444444444444444 No 181 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=95.05 E-value=0.00012 Score=42.14 Aligned_cols=145 Identities=17% Similarity=0.101 Sum_probs=86.0 Q ss_pred Cceeee-eccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----------CC-Ccchhhcceeeeeccccc Q lcl|NC_019933. 1 MSSKIT-SLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHV-----------RS-KTGRLKGAIYAVYVPEES 67 (155) Q Consensus 1 M~~~m~-~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~a-----------P~-~tG~Lr~sI~~~~~~~~~ 67 (155) |.-+-. -++|+..++. .-.+..+|.|..+.+.+.-.+|+.++ |. .||.|..||.....+..+ T Consensus 1 M~~~~~lHvdF~qp~~~-----~Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vpras~ 75 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEEL-----VFNRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVPRASK 75 (170) T ss_pred CCCCceeEEeeecCCce-----eecHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccccccC Confidence 442211 1122222111 01245689999999999999998654 32 689999999655433322 Q ss_pred CCceEEEEEEecCCcc---------ccchhhhccccccCCCcCCCCceeeeeecccceee--eCCccchhhHHHHHHHHH Q lcl|NC_019933. 68 TEVRHVYAVSWNKKKA---------PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVH--VPARSFLRPGYDSVKGRL 136 (155) Q Consensus 68 ~~g~~~~~Vg~~~~~a---------~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~--~pa~PFlrPA~~~~~~~~ 136 (155) ......+.|.+|.... +|-.|+.||-.+ +..+.+. -++...+|+=| -|-.-||.-+++..+... T Consensus 76 ~rpG~mVkIaPNqk~G~g~r~i~g~fYPafL~YGVr~-gakr~k~----hhr~a~ggsgwriaPR~Nym~~~l~~~~~wt 150 (170) T protein:vir:44 76 KRPGLMVKIAPNQKNGEGNRHINGAFYPAFLFYGVRR-GAKRKKG----HHRGASGGSGWRVEPRNNYMTEVLDKRRSWT 150 (170) T ss_pred CCCceeEEecCCCCCCCCccccccccchhhhhhhhhc-ccccchh----hcccccCCCcceeccchhHHHHHHHhhHHHH Confidence 2234556777776543 788899999632 2222110 01111112111 366789999999999999 Q ss_pred HHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 137 VEVANKAGAKRLAELRSKR 155 (155) Q Consensus 137 ~~~i~~~l~~~i~k~~~k~ 155 (155) -..+.++|+..|.-.-.+. T Consensus 151 ~~~L~r~L~~sLrp~~r~~ 169 (170) T protein:vir:44 151 RYVLSRELRKSLRPQRRKK 169 (170) T ss_pred HHHHHHHHHHhcCcccccC Confidence 8888888888875443333 No 182 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=94.54 E-value=0.00059 Score=38.29 Aligned_cols=127 Identities=7% Similarity=-0.048 Sum_probs=80.6 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |+.+.++||++.++.|..+..++ .+++..+...-.+.....+- .+.... ....+.....+.++-.+.... T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~~k-------~~ap~~--~~~tg~l~~~I~~~~~k~~~~ 70 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDI-EKVEPVALKAGGEIIAERQR-------SHVNRS--DKKQPHMQDNITVSNVRESKD 70 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHH-------HhCCCC--CCChhHHHHhhhccccccccC Confidence 99999999999999999988765 66778877777777666553 332110 001111111111111111111 Q ss_pred cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) -..+++.|+ ..+..+||.|.|+ ...-+|= +|=+....++..+.+.+.+.+.|.+.++ T Consensus 71 g~~~v~Vg~--------~~~~~~y~~f~E~---GT~~~~a-~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 71 GVRFVAVGP--------NKKVAYRGRFLEW---GTSKMPP-QPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred ceeEEEEee--------CCCCcceeeeecc---CccCCCC-CccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 112344443 2345788988888 5555553 5778888889999999999999999999 No 183 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=94.21 E-value=0.00066 Score=38.02 Aligned_cols=151 Identities=15% Similarity=0.138 Sum_probs=84.8 Q ss_pred CceeeeeccHHHHHHH---------HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhC-----------C-CCcchhhcc Q lcl|NC_019933. 1 MSSKITSLDISGVLSA---------LNDLRDD--SDSVSRTMAFESAAVVRDSAKAHV-----------R-SKTGRLKGA 57 (155) Q Consensus 1 M~~~m~~~~l~~L~~~---------l~~l~~~--~~~~~r~a~~~~a~~i~~eak~~a-----------P-~~tG~Lr~s 57 (155) |+--+.-- |+.... +++.++. -+..+|.|..+.+.+.-.+|+.++ | ..||.|..| T Consensus 1 ~~~~~~~~--~~~nam~~~~~lHvdF~qp~~~~Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~S 78 (187) T protein:vir:48 1 MKNCVQRD--DGVNAMNQTAFLHVDFKQPKELEFNRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARS 78 (187) T ss_pred Cccccccc--cchhhhhhccceeEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhh Confidence 33222111 111111 1111121 245689999999999999998765 3 268999999 Q ss_pred eeeeecccccCCceEEEEEEecCCc-----------cccchhhhccccccCCCcCCCCceeeeeecccceee-eCCccch Q lcl|NC_019933. 58 IYAVYVPEESTEVRHVYAVSWNKKK-----------APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVH-VPARSFL 125 (155) Q Consensus 58 I~~~~~~~~~~~g~~~~~Vg~~~~~-----------a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~-~pa~PFl 125 (155) |.....+..+.-....+.|.+|... .+|-.|+.||-. .+..+...... ....+..++-. -|-.-|| T Consensus 79 Igy~Vpkat~~RpG~mVkIaPNqk~G~g~r~~Pi~gdfYPafL~YGVr-~ga~~~~~~~k-~~~~~~~sgwriaPR~Nym 156 (187) T protein:vir:48 79 IGYYVPKKTTRRPGLMVKISPNQKNGQGNRRFPEGAPYYPAFLYYGVR-HSAYGMDKKDK-RQKKHHSSTFRLAPRNNFM 156 (187) T ss_pred hhhccccccCCCCcceEEecCCcccCcccccccccccchhHHHHhhhh-hhhhccchhhh-hhhcccCCcceeccchhHH Confidence 9654332222223345567776321 377888999953 22222111000 00011111112 3667899 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 126 RPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 126 rPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) .-+++..+...-..+.++|+..|.-.-.+. T Consensus 157 ~~~L~~~~~wt~~~L~raL~~sLrp~~r~~ 186 (187) T protein:vir:48 157 ADVIERRRHWTQELLSRELQRSLRPVKRKH 186 (187) T ss_pred HHHHHhhHHHHHHHHHHHHHHhcCcccccC Confidence 999999999999988888888885444444 No 184 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=93.54 E-value=0.0032 Score=34.24 Aligned_cols=132 Identities=10% Similarity=0.068 Sum_probs=83.7 Q ss_pred eeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCc- Q lcl|NC_019933. 4 KITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKK- 82 (155) Q Consensus 4 ~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~- 82 (155) .|..+.++||+++++.|..+..++-+++++.+...-.+.++..+-..... . ....++.....+.|+..+.. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~-------~-~~~~~g~l~~~I~i~~~k~~~ 72 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGY-------D-NSSTNAHMRDSIKIRSSRGKA 72 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-------C-CCCchhhHHhhcccccccccc Confidence 88889999999999999999999988888888877777766665333221 1 00111111222222211111 Q ss_pred cccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 83 APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 83 a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ......+..| +..+.++|+.|.|+ ...-+|= +|=+.-..++..+.+.+.+.+.|.+.|+|= T Consensus 73 ~~~~v~v~vg--------~~~~~~~~~~f~E~---GT~~~~a-~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka 133 (135) T protein:vir:57 73 GSTVVVLRVG--------PTRSHYMKALAQEF---GTIKQVA-KPFIRPALDYNKMQVLRILTVEIRDGLSTL 133 (135) T ss_pred cceeEEEEec--------CCCCcceeEeeccc---CCCCCCC-CcchhHhHHHhHHHHHHHHHHHHHHHHHHh Confidence 0000111122 22333444555566 7777774 688998999999999999999999999988 No 185 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=93.47 E-value=0.0014 Score=36.30 Aligned_cols=109 Identities=16% Similarity=0.150 Sum_probs=52.5 Q ss_pred eccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccc Q lcl|NC_019933. 7 SLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHG 86 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~ 86 (155) +-+|+++.++|+ .+.+++|-..-++.|..+...-+|.++|.|++|..+ +++. |-++ +||+ T Consensus 1 ~~dL~~~~~~~~------~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i-------~s~~----I~y~---tPYA 60 (113) T protein:vir:79 1 MSDLSVFSRMAQ------STGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFV-------NDTG----IHYT---AKYA 60 (113) T ss_pred CchHHHHHHhhc------hhHHHHHHHHHHHHHHHhhcccCcccccchhccccc-------cCCe----eEec---Chhh Confidence 235555554443 345556666677788888999999999999999632 1232 2222 5677 Q ss_pred hhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 87 HLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 87 ~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) ++.=||......... ..|-.-.++=|.| |....+++.++...+. .+++. T Consensus 61 r~qyYg~~~~~~~~~------------~t~p~ag~~W~er-aKa~h~~~w~~~~~~a-------~~~G~ 109 (113) T protein:vir:79 61 RAQFYGFVNGHRVRN------------YSTPGTGRRWDLK-AKAVYKADWQKVAVAA-------FLKEA 109 (113) T ss_pred hHhhccccCCCCccc------------cCCCCCCchhhHH-HHHHhHHHHHHHHHHH-------hhccc Confidence 666555321110000 0000001122332 3333333333332222 33333 No 186 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=93.19 E-value=0.0011 Score=36.87 Aligned_cols=91 Identities=15% Similarity=0.141 Sum_probs=58.3 Q ss_pred HHHHHHHHHHHHHHHHHHhCC--CCcchhhcceeeeecccccCCceEEEEEEecCCccccchhhhccccccCCCcCCCCc Q lcl|NC_019933. 28 SRTMAFESAAVVRDSAKAHVR--SKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGK 105 (155) Q Consensus 28 ~r~a~~~~a~~i~~eak~~aP--~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~ 105 (155) +..-..-+|.....+||.||| .+||+-|++|...... .++... .|.... .-.|+-|+|.++... T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~--~g~~~~--~i~lsh-~v~Yg~~LE~a~~~k--------- 66 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVAST--PQPDRY--EIVFAH-TVHYGIWLEIANSGR--------- 66 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccc--cCCceE--EEEEec-CeeccceEEeecCCC--------- Confidence 555556678889999999999 5799999999543221 122223 333332 246888899987521 Q ss_pred eeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 106 WLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 106 ~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) ---|+|+.+.-.+++++-+. .-+.++. T Consensus 67 ----------------yaIl~Ptv~~~~~~i~~g~~----~ll~~l~ 93 (93) T protein:vir:10 67 ----------------YEIIMPTVHHEGKLMAQRLR----GLLGRLR 93 (93) T ss_pred ----------------ccchhhhHHHHHHHHHHHHH----HHHHhcC Confidence 12688999877777777663 3333333 No 187 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=92.98 E-value=0.0017 Score=35.80 Aligned_cols=126 Identities=8% Similarity=0.028 Sum_probs=74.8 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCC Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSR---TMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKK 81 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r---~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~ 81 (155) |..+.+.+|+++.+.|....+.+.+ .++..+++.+..++..++- .. .+.++++...++.+-+... T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak-------~~-----~PvdtG~Lr~SI~~~~~~~ 68 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQ-------SS-----IKYSTGELTRSFKHEVKVD 68 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------hh-----CCCCchhhhhceeeeeeec Confidence 9999999999999999999877765 3455555666666666653 22 2334444443332211111 Q ss_pred ccccchhhhccccccCCCcCCCCceeeeeecccceeee--C----Cccc------------------------------- Q lcl|NC_019933. 82 KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHV--P----ARSF------------------------------- 124 (155) Q Consensus 82 ~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~--p----a~PF------------------------------- 124 (155) ....- |+ -....-|..++|+||--+ . -.|. T Consensus 69 ~~~~~-----g~--------V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~ 135 (182) T protein:vir:10 69 GDEVI-----GR--------WWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIK 135 (182) T ss_pred CCeEE-----EE--------eecCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceee Confidence 10000 10 111123555666666110 0 0011 Q ss_pred -------------hhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 125 -------------LRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 125 -------------lrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) =+|=|...-.+..+.+.+.|+++|++++++. T Consensus 136 ~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~~l~~~ 179 (182) T protein:vir:10 136 INGKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQELHDK 179 (182) T ss_pred ecCceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHHHHHHh Confidence 1366777788889999999999999999999 No 188 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=92.84 E-value=0.001 Score=36.91 Aligned_cols=145 Identities=13% Similarity=0.092 Sum_probs=79.0 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHh---CC-CCcchhhcceeeeecccccC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMA--------FESAAVVRDSAKAH---VR-SKTGRLKGAIYAVYVPEEST 68 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~--------~~~a~~i~~eak~~---aP-~~tG~Lr~sI~~~~~~~~~~ 68 (155) |....--++|+...+.. -.+..+|.|- +.+...|-.-++.. .| -.||.|..||.....+..+. T Consensus 1 m~~~~lHvdF~qp~~~~-----Fnr~riRraFv~igq~hmr~ArrlV~rrgrs~pGe~P~~qTGrLa~SIgy~Vpras~~ 75 (168) T protein:vir:45 1 MTTSFLHVDFQQPAEMR-----FNRARVRRAFVTIGQRHMRDARRLVMRHARSAPGENPGYQTGRLARSIGYMVPRASKH 75 (168) T ss_pred CCccceeeeeecCCcee-----ecHHHHHHHHHHHhHHHHHHHHHHHhhcccccCCCCCcchhhhhhhhhhhccccccCC Confidence 55433334444322111 1123355553 33344443333321 13 36999999996543333222 Q ss_pred CceEEEEEEecCCc---------cccchhhhccccccCCCcCCCCceeeeeeccccee--eeCCccchhhHHHHHHHHHH Q lcl|NC_019933. 69 EVRHVYAVSWNKKK---------APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPV--HVPARSFLRPGYDSVKGRLV 137 (155) Q Consensus 69 ~g~~~~~Vg~~~~~---------a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~--~~pa~PFlrPA~~~~~~~~~ 137 (155) .....+.|.+|... .+|-.|+-||-. .+...... -.+...+|+= --|-.-||.-+++..+...- T Consensus 76 rpG~mvkIaPNqk~G~g~r~i~gdfYPafL~YGVr-~gakr~r~----h~rga~ggsgwriaPR~Nym~~~l~~~~~wt~ 150 (168) T protein:vir:45 76 RPGFMARIAPNQRNGEGNRRITGDFYPAFLFYGVR-GGAKRRRS----HHRGASGGSGWRLAPRNNFMVETLEKNRSWTR 150 (168) T ss_pred CCceEEEecCCCCCCCCCCccccccchhhhhhhhh-cchhhhhh----hhccccCCCcceeccchhhHHHHHHhhHHHHH Confidence 23455677776543 378888999963 22222111 0111122221 24677899999999999998 Q ss_pred HHHHHHHHHHHHHHhccC Q lcl|NC_019933. 138 EVANKAGAKRLAELRSKR 155 (155) Q Consensus 138 ~~i~~~l~~~i~k~~~k~ 155 (155) ..+.++|++.|.-.-.++ T Consensus 151 ~~L~r~L~~sLrp~rr~~ 168 (168) T protein:vir:45 151 YFLARELRKSLKPERRRR 168 (168) T ss_pred HHHHHHHHHhcCcccccC Confidence 888888888886655555 No 189 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=90.20 E-value=0.011 Score=31.29 Aligned_cols=127 Identities=13% Similarity=-0.053 Sum_probs=68.5 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeeccc-ccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPE-ESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~-~~~~g~~~~~Vg~~~~~a 83 (155) |+ +.++||+++++.|..+.+++ .++.+.+...-.+..... ++.++....... .++.....+.|+-...+ T Consensus 1 m~-v~i~Gl~el~~~l~~l~~~~-~k~~~~al~~ga~~~~~~-------~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~- 70 (128) T protein:vir:38 1 MG-VKVTGDAELLANLNKLQFGV-AKEARAAVRDGAQKFADK-------LKSNTPEWDGETDMSGHLRDDIKLSSVRET- 70 (128) T ss_pred Cc-cchhhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHH-------HHHhCCCcCCCCcccchhhhhhcccccccc- Confidence 76 69999999999999998775 566666655544444433 444442221110 01111111112111111 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) ....++.-| ...+..+||.|.|+ ...-+|= +|=++...++..+.+.+.+.+.|.+.+= T Consensus 71 ~g~~~~~VG--------~~k~~~~y~~f~E~---GT~k~~a-~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 71 SGLTEVDVG--------YGKDTGWRAHFPNS---GTSMQDP-QHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred CceeEEEee--------ecCCCceEEeeecc---CccCCCC-CcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 011112233 23345689999998 5555553 5666666666666666666666655555 No 190 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=89.79 E-value=0.0031 Score=34.35 Aligned_cols=125 Identities=14% Similarity=0.081 Sum_probs=61.3 Q ss_pred eeeccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCC Q lcl|NC_019933. 5 ITSLDIS---GVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKK 81 (155) Q Consensus 5 m~~~~l~---~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~ 81 (155) |..+.+| .|.++-.+.++.. .+.+..-...++.-+-.-||++||+||.|=..+... | .|.. .. T Consensus 1 mi~i~idkp~almek~~ev~~~i----e~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg--s-tgel-------sn 66 (133) T protein:vir:42 1 MIEIRIDKPDALMEKPHEVQGKI----EETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEG--S-TGEL-------SN 66 (133) T ss_pred CeeeecCCchhhhcchhhhhhHH----HHHHHHHHHHHHHHhhhccccccccceeeeeEEeec--C-ccch-------hh Confidence 4445554 4544444444333 334444455566667777999999999996543211 1 1221 24 Q ss_pred ccccchhhhccccccCCCcCCCCceeeeeeccccee---eeCCccchhhH--HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 82 KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPV---HVPARSFLRPG--YDSVKGRLVEVANKAGAK 146 (155) Q Consensus 82 ~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~---~~pa~PFlrPA--~~~~~~~~~~~i~~~l~~ 146 (155) .++|-.||=||-.=.. +-..+..||....--.. --||.-|+.-+ |-..+.-+++.+..-|++ T Consensus 67 ~~~yl~~vl~grgwvf---pv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 67 LAYYLPFVLHGRGWVF---PVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred hhHHhhHhhhccccee---eccccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 5789999999853222 22233444433221100 11233343322 333333344444444444 No 191 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=85.20 E-value=0.002 Score=35.36 Aligned_cols=107 Identities=22% Similarity=0.188 Sum_probs=49.7 Q ss_pred Cceeeeecc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLD-ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~-l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |...-+--. |..+--.|+++..+. -+.+++.+=..+|...=++|.|+.||.+|+|+.+. +++.+- -...||. T Consensus 1 ma~gpt~knplakfgi~lddfdklp--evnqgvnef~dev~aawk~nspv~~g~yrdsvqvt---erstnk-grgkvga- 73 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLP--EVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVT---ERSTNK-GRGKVGA- 73 (108) T ss_pred CCCCCccccchhhhccchhhhhccc--hhhhhHHHHHHHHHHhhhcCCCccccccccceeec---cccccc-ccccccC- Confidence 443222111 111111222221111 12344455555566667899999999999999654 333321 1123553 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS 131 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~ 131 (155) +-|-+|+||||.. |.....+..+ -|+-|=-.||+. T Consensus 74 --tdpqahlvefgs~-hndeyapaqk--------------takqfggtay~d 108 (108) T protein:vir:10 74 --TDPQAHLVEFGSA-HNDEYAPAQK--------------TAKQFGGTAYGD 108 (108) T ss_pred --cchhhhhhhhhcc-ccccccchhh--------------hHHhhcccccCC Confidence 4467899999963 2221111110 022233333333 No 192 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=85.20 E-value=0.002 Score=35.36 Aligned_cols=107 Identities=22% Similarity=0.188 Sum_probs=49.7 Q ss_pred Cceeeeecc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEec Q lcl|NC_019933. 1 MSSKITSLD-ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWN 79 (155) Q Consensus 1 M~~~m~~~~-l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~ 79 (155) |...-+--. |..+--.|+++..+. -+.+++.+=..+|...=++|.|+.||.+|+|+.+. +++.+- -...||. T Consensus 1 ma~gpt~knplakfgi~lddfdklp--evnqgvnef~dev~aawk~nspv~~g~yrdsvqvt---erstnk-grgkvga- 73 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLP--EVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVT---ERSTNK-GRGKVGA- 73 (108) T ss_pred CCCCCccccchhhhccchhhhhccc--hhhhhHHHHHHHHHHhhhcCCCccccccccceeec---cccccc-ccccccC- Confidence 443222111 111111222221111 12344455555566667899999999999999654 333321 1123553 Q ss_pred CCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHH Q lcl|NC_019933. 80 KKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDS 131 (155) Q Consensus 80 ~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~ 131 (155) +-|-+|+||||.. |.....+..+ -|+-|=-.||+. T Consensus 74 --tdpqahlvefgs~-hndeyapaqk--------------takqfggtay~d 108 (108) T protein:vir:10 74 --TDPQAHLVEFGSA-HNDEYAPAQK--------------TAKQFGGTAYGD 108 (108) T ss_pred --cchhhhhhhhhcc-ccccccchhh--------------hHHhhcccccCC Confidence 4467899999963 2221111110 022233333333 No 193 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=84.70 E-value=0.043 Score=28.09 Aligned_cols=115 Identities=17% Similarity=0.132 Sum_probs=49.0 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |.+++. |+++...|. .+.+.++-...++.|..+...-||.+||.|++|..+. ++.+ -++ T Consensus 2 ~kV~vd---l~~~~~~ls------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~-------~~~I----~Y~- 60 (118) T protein:vir:98 2 AKVVVE---LGGIKRKVS------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRAN-------SVGV----TWS- 60 (118) T ss_pred ceeeec---hhHHhhhhh------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeec-------CCee----EEC- Confidence 556554 445544442 2334455666677788888889999999999996432 1222 222 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeC-Cccch-hhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVP-ARSFL-RPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~p-a~PFl-rPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +||++..=||... ... .+. -+.+...| +.+.. .++. +-..-.....+...+.+--+ T Consensus 61 --tPYAr~qYY~~~~-~~~---~g~-------~~~~~~~p~~g~~Wd~R~k------a~~~~~~~w~~~~~k~~g~k 118 (118) T protein:vir:98 61 --GPHARAQFYGGAY-NKY---KSF-------KFKKYTTPGTGKRWDKRAL------ANATIVKDWEKSLLRGMGFK 118 (118) T ss_pred --CchhhHhhhcccc-CCC---Ccc-------ccccccCCCCCCcccchhh------cchhhhHHHHHHHHHhcCCC Confidence 4566554444210 000 000 00000111 11111 1111 00011111222233333333 No 194 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=84.70 E-value=0.043 Score=28.09 Aligned_cols=115 Identities=17% Similarity=0.132 Sum_probs=49.0 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |.+++. |+++...|. .+.+.++-...++.|..+...-||.+||.|++|..+. ++.+ -++ T Consensus 2 ~kV~vd---l~~~~~~ls------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~-------~~~I----~Y~- 60 (118) T protein:vir:30 2 AKVVVE---LGGIKRKVS------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRAN-------SVGV----TWS- 60 (118) T ss_pred ceeeec---hhHHhhhhh------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeec-------CCee----EEC- Confidence 556554 445544442 2334455666677788888889999999999996432 1222 222 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeC-Cccch-hhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVP-ARSFL-RPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~p-a~PFl-rPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +||++..=||... ... .+. -+.+...| +.+.. .++. +-..-.....+...+.+--+ T Consensus 61 --tPYAr~qYY~~~~-~~~---~g~-------~~~~~~~p~~g~~Wd~R~k------a~~~~~~~w~~~~~k~~g~k 118 (118) T protein:vir:30 61 --GPHARAQFYGGAY-NKY---KSF-------KFKKYTTPGTGKRWDKRAL------ANATIVKDWEKSLLRGMGFK 118 (118) T ss_pred --CchhhHhhhcccc-CCC---Ccc-------ccccccCCCCCCcccchhh------cchhhhHHHHHHHHHhcCCC Confidence 4566554444210 000 000 00000111 11111 1111 00011111222233333333 No 195 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=83.62 E-value=0.012 Score=31.06 Aligned_cols=125 Identities=14% Similarity=0.104 Sum_probs=58.3 Q ss_pred eeeccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCC Q lcl|NC_019933. 5 ITSLDIS---GVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKK 81 (155) Q Consensus 5 m~~~~l~---~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~ 81 (155) |..+.+| .|.++-.+.++.. .+.+..-...++.-+-.-||++||+||.|=..+... +.|.. .. T Consensus 1 mi~i~idkp~almek~~ev~~~i----e~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg---stgel-------sn 66 (133) T protein:vir:41 1 MIRINIDKPEALMEKASEVEDRV----EQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEG---STGEL-------TN 66 (133) T ss_pred CeeeecCCchhhhcchhhhhhHH----HHHHHHHHHHHHHHhhhccccccccceeeeeEEeec---Cccch-------hh Confidence 4445554 4544444444333 334444455566667777999999999996543211 11221 24 Q ss_pred ccccchhhhccccccCCCcCCCCceeeeeeccccee---eeCCccchhhH--HHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 82 KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPV---HVPARSFLRPG--YDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 82 ~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~---~~pa~PFlrPA--~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) .++|-.||-||-.=.. +-..+..||....--.. --||.-|+.-+ |-..+.-+++.+..-| -. T Consensus 67 ~~~yl~~vl~grgwvf---pv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewl--------is 133 (133) T protein:vir:41 67 TVPYLQWVLFGRGWVF---PVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIVEDSFIEWL--------IS 133 (133) T ss_pred hhHHhhHhhhccccee---eecccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHh--------cC Confidence 5789999999853222 22233344433221000 11233333322 2222222333332222 22 No 196 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=83.23 E-value=0.07 Score=26.91 Aligned_cols=131 Identities=10% Similarity=-0.067 Sum_probs=74.8 Q ss_pred CceeeeeccHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-------Hh-----CCCCcchhhcceeeeeccccc Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLR-DDSDSVSRTMAFESAAVVRDSAK-------AH-----VRSKTGRLKGAIYAVYVPEES 67 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~-~~~~~~~r~a~~~~a~~i~~eak-------~~-----aP~~tG~Lr~sI~~~~~~~~~ 67 (155) =++.++ +|....+.|.+.. +..+++++.++..--+.++.-|- .+ .+.++|+......+.... +. T Consensus 6 ~~~d~s--~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~-~~ 82 (157) T protein:vir:97 6 RSVDIT--GILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK-KA 82 (157) T ss_pred ecccHH--HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC-Cc Confidence 044444 6665555555554 45677777776666666555442 11 255666665443232211 11 Q ss_pred CCceEEEEEE--------ecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHH Q lcl|NC_019933. 68 TEVRHVYAVS--------WNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEV 139 (155) Q Consensus 68 ~~g~~~~~Vg--------~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~ 139 (155) ....+.+.-| .++...+||+|+|+||....... +| -+|=+.-.-+...+.+.++ T Consensus 83 a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~------------PF------lRPA~d~~k~~a~~~~~~~ 144 (157) T protein:vir:97 83 APHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAK------------PF------LRPGYDSVAMQIPDIARAA 144 (157) T ss_pred cceeeeeecCcccccccccCCcccccccccccCCCCcCCCC------------cc------cchHHHHhHHHHHHHHHHH Confidence 1111111112 22334689999999995322111 11 4688888999999999999 Q ss_pred HHHHHHHHHHHHh Q lcl|NC_019933. 140 ANKAGAKRLAELR 152 (155) Q Consensus 140 i~~~l~~~i~k~~ 152 (155) |.++|.+.|.==. T Consensus 145 l~k~I~e~l~g~~ 157 (157) T protein:vir:97 145 GAKKYAELQRGDT 157 (157) T ss_pred HHHHHHHHhcCCC Confidence 9999988884444 No 197 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=78.17 E-value=0.12 Score=25.69 Aligned_cols=133 Identities=14% Similarity=0.169 Sum_probs=55.2 Q ss_pred eccHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHH------hCCCC-------------cchhhcce Q lcl|NC_019933. 7 SLDISGVLSALNDLRD---------DSDSVSRTMAFESAAVVRDSAKA------HVRSK-------------TGRLKGAI 58 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~---------~~~~~~r~a~~~~a~~i~~eak~------~aP~~-------------tG~Lr~sI 58 (155) +-||+.+++.|++|.. ...++...|+..++..|..+..+ .+|.+ +|.+...| T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I 80 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARI 80 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEE Confidence 4567777777777642 22345667777788888877532 22311 13332222 Q ss_pred eeeeccc-------------c-cC---CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecc------cc Q lcl|NC_019933. 59 YAVYVPE-------------E-ST---EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKL------AT 115 (155) Q Consensus 59 ~~~~~~~-------------~-~~---~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~------~g 115 (155) .+...+- + .+ .+...+.|| ++-+-+-|+ ....+|.|-.+.... .. T Consensus 81 ~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vG---k~~f~gaFi---------a~m~ngr~~V~~R~~gk~R~PIe 148 (192) T protein:vir:79 81 RVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVG---KYLFRDAFI---------QQLANGRWHVMRRIDGKNRYPID 148 (192) T ss_pred EEecCceeeeeecccccccccccccccccccceEEc---ceecCchhc---------cccCCCCccceEecCCCccCCee Confidence 1110000 0 00 001111111 111111121 111222222222211 12 Q ss_pred eeeeC-CccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 116 PVHVP-ARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 116 t~~~p-a~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) -+++| ++|. ..||+. ++...+...+.+.|..+|+.| T Consensus 149 vvkIpis~~l-~~af~~---e~~r~~~~~~~~el~~~L~~q 185 (192) T protein:vir:79 149 VVKIPLSGPL-TQAFED---ARDRIIAAEMPKQLGYALKQQ 185 (192) T ss_pred eEeechHHHH-HHHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 23444 3443 355554 455555566666666666666 No 198 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=77.64 E-value=0.049 Score=27.77 Aligned_cols=122 Identities=11% Similarity=0.065 Sum_probs=63.3 Q ss_pred eeeccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcc Q lcl|NC_019933. 5 ITSLDIS-GVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKA 83 (155) Q Consensus 5 m~~~~l~-~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a 83 (155) |+.+.+. +++++.+.|.+..+.+ ..++..+-..+..+....|- .. .+.++++....+.+.+..... T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~-~~~~~~~l~~~a~~i~~~ak-------~~-----aPv~TG~Lr~SI~~~~~~~g~ 67 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRL-TGAAREATEAAANDMVNMAK-------GL-----CPVDTGRLRSSIQAVPSGGRF 67 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-------Hh-----CCccchhhhccceeeeccCCc Confidence 9988887 8888888777776654 45666666666555555542 11 223344444443333322211 Q ss_pred ccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHH-------------------HHHHHHHHHHHHHH Q lcl|NC_019933. 84 PHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYD-------------------SVKGRLVEVANKAG 144 (155) Q Consensus 84 ~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~-------------------~~~~~~~~~i~~~l 144 (155) .+ -++.|+ ...|..++|+||..+.-.|-.++++. .--..+++.-+..| T Consensus 68 ~~--~~~v~~-----------~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i 134 (142) T protein:vir:94 68 SF--SVTIGT-----------NVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFL 134 (142) T ss_pred eE--EEEEec-----------CcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHH Confidence 11 123333 23477788888854322232222210 11123444556666 Q ss_pred HHHHHHHh Q lcl|NC_019933. 145 AKRLAELR 152 (155) Q Consensus 145 ~~~i~k~~ 152 (155) .+.|+++. T Consensus 135 ~~~~~~~~ 142 (142) T protein:vir:94 135 RNHAKGIR 142 (142) T ss_pred HHHHHhcC Confidence 77777776 No 199 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=75.74 E-value=0.09 Score=26.31 Aligned_cols=119 Identities=12% Similarity=0.033 Sum_probs=70.1 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |..+.|+|+++.++.|.++.... ..--++|-..-.+ |+.. .+..+. +.+++.... |.....+. T Consensus 1 Ma~iel~G~del~~~l~~~g~~~-~~ie~kAlk~g~e------~I~~-~~~~n~-----P~~tg~lkk---ik~~~kk~- 63 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLD-ESTKRKGIKAGIT------KIGK-AIEKNS-----PIKSGRLSK---VKIRVKNT- 63 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhh-HHHHHHHHHHHhH------HHHH-HHhhcC-----CcccCCcce---eeeeeecC- Confidence 99999999999999999887432 2222222211111 2111 333333 333333332 22222221 Q ss_pred cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) .|+.-|+ ....+||..|.+| ...-+|-=-|=++..-+.-.+.....+.+.|.+-++ T Consensus 64 --g~~~VG~--------~ks~~fy~kF~EF---GTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 64 --GLATEGT--------ASSSEFYDIFQNF---GTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred --ceeEecc--------CCcchhhhhhccc---cccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 1455554 2356688899998 888999744888877777777777777777766666 No 200 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=75.48 E-value=0.13 Score=25.37 Aligned_cols=122 Identities=11% Similarity=0.019 Sum_probs=71.1 Q ss_pred eee---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCC Q lcl|NC_019933. 5 ITS---LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKK 81 (155) Q Consensus 5 m~~---~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~ 81 (155) |.. +.++||+++++.|....+++.+. +..+...-.......+...+ +..+++....+.+..... T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~-v~~al~~~a~~i~~~ak~~a------------p~~tG~L~~sI~~~~~~~ 67 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPY-SVEAMKTSLSRAVEKSKGLA------------RVDTGYMRNNIQQDEVKE 67 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHhhC------------CCCChhhhhhceecceec Confidence 888 57889999999999887777654 44444433333333332121 222333332222221111 Q ss_pred ccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 82 KAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 82 ~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) +.. +-.++.| ....|+.++|| .+.-+| =+|=|....++....+.+.|++.|.++++.- T Consensus 68 ~~~-~~~~~v~-----------~~~~Ya~~vEf---GT~~~~-a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 68 EHG-VVTGRYV-----------ARADYSSYNEY---GTYRMS-AQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred cCC-cEEEEee-----------CCCCccceeec---ccccCC-CCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 100 0001112 12346777777 444444 3677888888899999999999999999888 No 201 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=72.73 E-value=0.18 Score=24.68 Aligned_cols=121 Identities=12% Similarity=0.003 Sum_probs=68.7 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |+..-..+++++|.+..+.|....++ +++++.++.....+++...+-.. .+.++++....+.+.+.. T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~-~~~~v~~~l~~~a~~i~~~ak~~------------apv~TG~Lr~SI~~~~~~ 67 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGH-VLTQVEQVIIKTAEKIAGLAASL------------APVDEGNLKNSIQIDYKN 67 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh------------CCccchhhhcCeeEEeec Confidence 88877777889999999999776655 56777777777766666544311 223344444433332221 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCcc---------------------ch--hhHHHHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARS---------------------FL--RPGYDSVKGRLV 137 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~P---------------------Fl--rPA~~~~~~~~~ 137 (155) +. .. .+.|+ ..-|..++|+||..+...| ++ +|-|. .++ T Consensus 68 ~g-~~---~~V~~-----------~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~----pA~ 128 (144) T protein:vir:59 68 NG-LT---AEITV-----------GAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFW----PAV 128 (144) T ss_pred Cc-EE---EEEec-----------CCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchh----HHH Confidence 11 00 12222 1237778888885543332 33 33343 344 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_019933. 138 EVANKAGAKRLAELRS 153 (155) Q Consensus 138 ~~i~~~l~~~i~k~~~ 153 (155) +.-+..+.+.|.++.- T Consensus 129 ~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 129 EEGGEYFEREMRRLRG 144 (144) T ss_pred HHHHHHHHHHHHHhcC Confidence 4556666666777766 No 202 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=71.21 E-value=0.2 Score=24.43 Aligned_cols=111 Identities=6% Similarity=-0.088 Sum_probs=58.4 Q ss_pred eccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcccc Q lcl|NC_019933. 7 SLDI-SGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPH 85 (155) Q Consensus 7 ~~~l-~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~ 85 (155) +-.+ .+|+++++.|.+..++ +++++.++......++...+-.. .+..+++......+.+..+ ... T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~-~~~~~~~~l~~~a~~~~~~ak~~------------~pvdTG~L~~Si~~~~~~~-g~~ 66 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDE-MEEWVKKGILKTTLAIYNTAVAL------------APVDLGFLKESIDFKVTDG-GFS 66 (137) T ss_pred CchhHhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh------------CCcCccchhcCceeEeecC-ceE Confidence 4464 4999999999887544 46777777766666665543311 1223444333333322211 111 Q ss_pred chhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhh-------------------------HHHHHHHHHHHHH Q lcl|NC_019933. 86 GHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRP-------------------------GYDSVKGRLVEVA 140 (155) Q Consensus 86 ~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrP-------------------------A~~~~~~~~~~~i 140 (155) .+.|+ ..-|..++|+||..+.++|...+ -|... ++.. T Consensus 67 ---~~V~~-----------~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA----~~~~ 128 (137) T protein:vir:96 67 ---SVISV-----------GAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPA----IDEG 128 (137) T ss_pred ---EEEec-----------CCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHH----HHHH Confidence 12222 12378899999988877776533 33322 2333 Q ss_pred HHHHHHHHHHHhc Q lcl|NC_019933. 141 NKAGAKRLAELRS 153 (155) Q Consensus 141 ~~~l~~~i~k~~~ 153 (155) +..|.+.+. T Consensus 129 ----~~~i~k~i~ 137 (137) T protein:vir:96 129 ----RKVFNRYFS 137 (137) T ss_pred ----HHHHHHhhC Confidence 333333333 No 203 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=63.98 E-value=0.31 Score=23.39 Aligned_cols=138 Identities=12% Similarity=0.071 Sum_probs=73.4 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEE-ecCCccccc Q lcl|NC_019933. 8 LDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVS-WNKKKAPHG 86 (155) Q Consensus 8 ~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg-~~~~~a~~~ 86 (155) +++++|++.+..|.....+++..|+..+-+-....++..+-. .+.+...+....-+ ....+. ...+ ...+ T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r---~v~~~~~i~~~~ir-----~r~~~~kas~~-~l~a 71 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVA---VVSKDTRVPRKLVK-----QRARVKRATVN-KPRA 71 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhcCCHHHHH-----hhheecccCCC-CeEE Confidence 999999999999999999999999999999988888877653 34444332211111 111121 1111 1111 Q ss_pred hhhhcccccc-----CCCcCCC----------------Cce-eeeeec---ccceee---------eCC---c-cchhhH Q lcl|NC_019933. 87 HLVEYGHWRT-----NVVAEVD----------------GKW-LFTKEK---LATPVH---------VPA---R-SFLRPG 128 (155) Q Consensus 87 ~~vEfGt~~~-----~~~~~~~----------------~~~-~~~~~~---~~gt~~---------~pa---~-PFlrPA 128 (155) . |-.+.... +...... |+. +..-|. ..|-.+ .|= . |.--|+ T Consensus 72 ~-I~~~~~~i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~~ 150 (184) T protein:vir:39 72 L-IRVNRGNLPAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAPL 150 (184) T ss_pred E-EEEeccceeeeeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEEecCcccceeEEEcCchHHH Confidence 1 21111111 1111000 000 001110 011111 121 1 545565 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019933. 129 YDSVKGRLVEVANKAGAKRLAELRSKR 155 (155) Q Consensus 129 ~~~~~~~~~~~i~~~l~~~i~k~~~k~ 155 (155) -+...+++.+.+...+.+.|++.|..+ T Consensus 151 ~e~~~~~~~~~~~~~~~~el~~~l~~~ 177 (184) T protein:vir:39 151 TTAFKEELPKLMESDMPKELRASLTNQ 177 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 567777888888888888888888888 No 204 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=63.34 E-value=0.32 Score=23.31 Aligned_cols=125 Identities=9% Similarity=-0.107 Sum_probs=59.7 Q ss_pred Cceee------eecc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEE Q lcl|NC_019933. 1 MSSKI------TSLD-ISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHV 73 (155) Q Consensus 1 M~~~m------~~~~-l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~ 73 (155) |...- -+.. +.||+++.+.|.+..++ +.+++.++......++...|-.. .+.++++.... T Consensus 1 ~~~~~~~~~~~~Ma~~~~Gld~l~~~L~~~~~~-~~~~~~~al~~~a~~v~~~ak~~------------aPvdTG~Lr~S 67 (149) T protein:vir:94 1 MKLSYYDLSRCHMAKVKYGADSMVVELDKFDKK-IEEWVKKGIAKTTTKIYNTAVAL------------APVDLGFLEES 67 (149) T ss_pred CeeeeeecchhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh------------CCcccchhhcC Confidence 32211 1334 34788888888776665 46788888877777776665321 12234444333 Q ss_pred EEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH-----------------HHHH Q lcl|NC_019933. 74 YAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV-----------------KGRL 136 (155) Q Consensus 74 ~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~-----------------~~~~ 136 (155) +.+.+..+. .. .+.|+ ..-|..++|+||..+..+|..+.+.... .+-. T Consensus 68 I~~~~~~~g-~~---~~V~~-----------~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PF 132 (149) T protein:vir:94 68 IDFKYFDGG-LS---SVISV-----------GADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPF 132 (149) T ss_pred eeEEeeCCc-EE---EEEec-----------CCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcc Confidence 333222111 00 11121 1237888999998877766555332100 0001 Q ss_pred HHHHHHHHHHHHHHHhc Q lcl|NC_019933. 137 VEVANKAGAKRLAELRS 153 (155) Q Consensus 137 ~~~i~~~l~~~i~k~~~ 153 (155) +.--.+.-+..|.+.++ T Consensus 133 l~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 133 WNPAIDAGRKTFEQYFS 149 (149) T ss_pred hHHHHHHHHHHHHHhhC Confidence 11111223334444444 No 205 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=60.75 E-value=0.37 Score=22.97 Aligned_cols=115 Identities=9% Similarity=-0.048 Sum_probs=61.4 Q ss_pred eccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCcccc Q lcl|NC_019933. 7 SLDI-SGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPH 85 (155) Q Consensus 7 ~~~l-~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~ 85 (155) +..+ .||+++.+.|.+..+ .+.+++..+......++...|-.. .+.++++......+-+..+. .. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~-~~~~~~~~al~~~a~~i~~~ak~~------------aPvdTG~Lr~SI~~~~~~~~-~~ 66 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEK-ETIRWAKKGIAKTTTIIHNSIVSN------------MPVDTGYLRESVSMDFKKGG-LT 66 (137) T ss_pred CchhHhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh------------CCcCcchhhcCeeEEeeCCc-EE Confidence 4464 599999999977665 446677777766666666544322 23344554444333322211 00 Q ss_pred chhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHH---------------------HHHHHHHHHHHH Q lcl|NC_019933. 86 GHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSV---------------------KGRLVEVANKAG 144 (155) Q Consensus 86 ~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~---------------------~~~~~~~i~~~l 144 (155) .+.|+ ..-|..++|+||..+..+|..+++.... -..+++.- T Consensus 67 ---~~V~~-----------~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~---- 128 (137) T protein:vir:10 67 ---GVINI-----------GSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEG---- 128 (137) T ss_pred ---EEEec-----------CCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHH---- Confidence 11121 1237788999998888888877654221 11222333 Q ss_pred HHHHHHHhc Q lcl|NC_019933. 145 AKRLAELRS 153 (155) Q Consensus 145 ~~~i~k~~~ 153 (155) +..|.+.+. T Consensus 129 ~~~i~k~i~ 137 (137) T protein:vir:10 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhcC Confidence 333333344 No 206 >protein:vir:101654 Length: 126 # NCBI annotation: gp17 # Family: family:all:11115 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654772;genbank:gi:109302770;genbank:GeneID:4156088 Probab=60.56 E-value=0.11 Score=25.74 Aligned_cols=108 Identities=12% Similarity=0.040 Sum_probs=51.7 Q ss_pred eccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH---hCC--------------CCcchhhcceeeeecccccC Q lcl|NC_019933. 7 SLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKA---HVR--------------SKTGRLKGAIYAVYVPEEST 68 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~---~aP--------------~~tG~Lr~sI~~~~~~~~~~ 68 (155) +.||..|..+-+--..... -++-.++-+-|++|++-=.. -.| .++|+..+||.+.+.+.+++ T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksgyvenpgdyaksirvsfiksksg 80 (126) T protein:vir:10 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSGYVENPGDYAKSIRVSFIKSKSG 80 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccccccCchhhhhhhheeeeecccC Confidence 2344444444332222221 23455555566666664322 123 36899999999998887766 Q ss_pred CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCC Q lcl|NC_019933. 69 EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPA 121 (155) Q Consensus 69 ~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa 121 (155) -..+ +|.. +-+..+|+|||.. +-..-.+......+ |-.+|...+.| T Consensus 81 lpka--rvma---tdykswwieygak-hmpefaprahtlah-fegggattvsa 126 (126) T protein:vir:10 81 LPKA--RVMA---TDYKSWWIEYGAK-HMPEFAPRAHTLAH-FEGGGATTVSA 126 (126) T ss_pred Cccc--ceeh---hhhhHHHHhhhhh-hcccccccchhhhh-ccCCccccccC Confidence 4443 3332 2345678999963 21111111111111 11223233444 No 207 >protein:vir:7859 Length: 126 # NCBI annotation: gp16 # Family: family:all:11115 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817466;genbank:gi:29565895;genbank:GeneID:1259088 Probab=60.56 E-value=0.11 Score=25.74 Aligned_cols=108 Identities=12% Similarity=0.040 Sum_probs=51.7 Q ss_pred eccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH---hCC--------------CCcchhhcceeeeecccccC Q lcl|NC_019933. 7 SLDISGVLSALNDLRDDSD-SVSRTMAFESAAVVRDSAKA---HVR--------------SKTGRLKGAIYAVYVPEEST 68 (155) Q Consensus 7 ~~~l~~L~~~l~~l~~~~~-~~~r~a~~~~a~~i~~eak~---~aP--------------~~tG~Lr~sI~~~~~~~~~~ 68 (155) +.||..|..+-+--..... -++-.++-+-|++|++-=.. -.| .++|+..+||.+.+.+.+++ T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksgyvenpgdyaksirvsfiksksg 80 (126) T protein:vir:78 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSGYVENPGDYAKSIRVSFIKSKSG 80 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccccccCchhhhhhhheeeeecccC Confidence 2344444444332222221 23455555566666664322 123 36899999999998887766 Q ss_pred CceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCC Q lcl|NC_019933. 69 EVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPA 121 (155) Q Consensus 69 ~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa 121 (155) -..+ +|.. +-+..+|+|||.. +-..-.+......+ |-.+|...+.| T Consensus 81 lpka--rvma---tdykswwieygak-hmpefaprahtlah-fegggattvsa 126 (126) T protein:vir:78 81 LPKA--RVMA---TDYKSWWIEYGAK-HMPEFAPRAHTLAH-FEGGGATTVSA 126 (126) T ss_pred Cccc--ceeh---hhhhHHHHhhhhh-hcccccccchhhhh-ccCCccccccC Confidence 4443 3332 2345678999963 21111111111111 11223233444 No 208 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=59.32 E-value=0.39 Score=22.80 Aligned_cols=138 Identities=8% Similarity=0.015 Sum_probs=85.6 Q ss_pred eee-ccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCCCcch-hhcceee--------------eeccccc Q lcl|NC_019933. 5 ITS-LDISGVLSALNDLRDDSDS-VSRTMAFESAAVVRDSAKAHVRSKTGR-LKGAIYA--------------VYVPEES 67 (155) Q Consensus 5 m~~-~~l~~L~~~l~~l~~~~~~-~~r~a~~~~a~~i~~eak~~aP~~tG~-Lr~sI~~--------------~~~~~~~ 67 (155) |+. +++++|++..++|...+.. -+...+.+.++.+..+..+.|-.+|-. ..+.-.. ...+..+ T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 663 4688999999999999975 578889999999999998888877663 1111000 0011122 Q ss_pred CCceEEEEEEecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch--hhHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 68 TEVRHVYAVSWNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL--RPGYDSVKGRLVEVANKAGA 145 (155) Q Consensus 68 ~~g~~~~~Vg~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl--rPA~~~~~~~~~~~i~~~l~ 145 (155) ++....+.+|....+. -..-||- .....|+...|.|....+ .=|. ++=|+...+++.+.+-+.|+ T Consensus 81 G~lr~swk~~~~~k~~-~~~~v~v-----------~N~~~YA~~VE~GHR~~~-gGfV~G~fml~~s~~~~~~~~~~~~e 147 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSG-RTYKQKV-----------YNKVYYAPHVEYGHKTVN-GGFVPGQFFLHKTVEDTKSDMEKRVR 147 (163) T ss_pred chhhccceecceeecC-CceEEEE-----------EecCCccchhhcceeecC-CceeccchhhHHHHHHHHHHHHHHHH Confidence 2223333332211000 0000111 112235667788865555 3465 34568889999999999999 Q ss_pred HHHHHHhccC Q lcl|NC_019933. 146 KRLAELRSKR 155 (155) Q Consensus 146 ~~i~k~~~k~ 155 (155) +.|+++++|= T Consensus 148 ~~l~~~l~k~ 157 (163) T protein:vir:10 148 DKYDGFMRKV 157 (163) T ss_pred HHHHHHHHHh Confidence 9999999999 No 209 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=59.16 E-value=0.062 Score=27.22 Aligned_cols=92 Identities=17% Similarity=0.097 Sum_probs=44.5 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecC Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNK 80 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~ 80 (155) |.--++.-. .-+++.|+.- -++.-..-+|+.-...|+.++|||||.+|+.+.+...+.++.. ....||... T Consensus 1 madaftpNp-~~FDqIl~s~------~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~Rt--T~MVVG~D~ 71 (92) T protein:vir:78 1 MADAFTPNP-TWFDQIMRTP------KVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRE--TAMVVGSDE 71 (92) T ss_pred CCCccCCCh-hHHHHhhccc------chhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccc--eeEEeecCc Confidence 544333211 1122222211 1222233345566677999999999999999977655544322 234566554 Q ss_pred CccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHH Q lcl|NC_019933. 81 KKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKG 134 (155) Q Consensus 81 ~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~ 134 (155) ++ -+||.-|.. |+-|+...+. T Consensus 72 KT----lLvESrTGN-----------------------------Lakalk~~rs 92 (92) T protein:vir:78 72 KT----LLIESRTGN-----------------------------LARSVKRRRS 92 (92) T ss_pred ce----eeeecccch-----------------------------HHHHHhhhcC Confidence 33 346655431 1111111111 No 210 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=56.82 E-value=0.45 Score=22.50 Aligned_cols=112 Identities=8% Similarity=0.122 Sum_probs=64.1 Q ss_pred eeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccc Q lcl|NC_019933. 5 ITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAP 84 (155) Q Consensus 5 m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~ 84 (155) |+ +.++||+++++.|....+.+.+. +..+..........-+ +.. .+..++.....+.+...... T Consensus 1 ms-i~i~Gld~l~~~l~~~~~~~~~~-v~~al~~~a~~i~~~a-------k~~-----aPv~TG~Lr~sI~~~~~g~~-- 64 (114) T protein:vir:95 1 MA-IKWQGIEKLVATISNAQPKAVEQ-SLQVLKNNGEKGKRIA-------KQL-----APKDTEFLKDHITTSYPGME-- 64 (114) T ss_pred Ce-eeeehHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH-------HHh-----CCcCchhhhhceeeecCceE-- Confidence 77 59999999999998888777544 4554444333333322 222 23334444433333221111 Q ss_pred cchhhhccccccCCCcCCCCceeeeeecccceeeeCCccch--hhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 85 HGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFL--RPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 85 ~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFl--rPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) .+.| ....|+.++|+ ...++ +|-+.-.-++....+.+.|.+.|.+.++ T Consensus 65 ----~~V~-----------~~~~Ya~yvE~------GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 65 ----AHIH-----------GEAGYDGYQEY------GTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred ----EEee-----------cCCCccceeec------CccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 1111 12246777777 44454 5667777777777777777777777777 No 211 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=56.25 E-value=0.46 Score=22.43 Aligned_cols=141 Identities=15% Similarity=0.116 Sum_probs=61.8 Q ss_pred CceeeeeccHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhC-----CC-------C--cch---------hh Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRD--DSDSVSRTMAFESAAVVRDSAKAHV-----RS-------K--TGR---------LK 55 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~--~~~~~~r~a~~~~a~~i~~eak~~a-----P~-------~--tG~---------Lr 55 (155) |++.|+ ++-++|.++.+.|.. +.-+.=+.-+...|..++..+++++ |. + .|. |. T Consensus 1 m~~~~~-~n~~dl~~l~~~L~ll~L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL~ 79 (231) T protein:vir:37 1 MQIRLG-LKQEDLDAFVRDLRTLNLTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKVL 79 (231) T ss_pred CCccCC-cCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHhH Confidence 998888 455555555555542 2223334455666667777777664 32 2 222 22 Q ss_pred cceeeeecccccCCceEEEEEEecCCccccchhhhccc----------------------------------cccCCCcC Q lcl|NC_019933. 56 GAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHLVEYGH----------------------------------WRTNVVAE 101 (155) Q Consensus 56 ~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~vEfGt----------------------------------~~~~~~~~ 101 (155) +...+. .. ++...+.++.+...+..++...||- .-+.+..+ T Consensus 80 ~~~~~~---~~--~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k 154 (231) T protein:vir:37 80 RYASIL---AE--ERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNR 154 (231) T ss_pred Hhhccc---cc--cCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCC Confidence 222111 11 1122223333433333444444442 11111100 Q ss_pred C-CCcee----ee--------------eeccc----------ceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 102 V-DGKWL----FT--------------KEKLA----------TPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELR 152 (155) Q Consensus 102 ~-~~~~~----~~--------------~~~~~----------gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~ 152 (155) . .+.+. .| +..++ =++.+|++|||-..=+. +.+.|...|++++ T Consensus 155 ~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e--------~~~~l~~~l~~i~ 226 (231) T protein:vir:37 155 QGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKE--------NVDILREITLKFL 226 (231) T ss_pred CCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHH--------HHHHHHHHHHHHh Confidence 0 00000 00 00111 12457999999875432 2334455566666 Q ss_pred ccC Q lcl|NC_019933. 153 SKR 155 (155) Q Consensus 153 ~k~ 155 (155) .+. T Consensus 227 ~~~ 229 (231) T protein:vir:37 227 SGE 229 (231) T ss_pred ccc Confidence 666 No 212 >protein:vir:99454 Length: 150 # NCBI annotation: hypothetical protein # Family: family:all:32760 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919085;genbank:gi:119757043;genbank:GeneID:4606107 Probab=53.79 E-value=0.52 Score=22.14 Aligned_cols=126 Identities=15% Similarity=0.116 Sum_probs=68.1 Q ss_pred eeec---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCC Q lcl|NC_019933. 5 ITSL---DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKK 81 (155) Q Consensus 5 m~~~---~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~ 81 (155) |+++ .-|.-++.|++|.+-+++-+-.++.+-|-.|.+.--+.--.+ ...-|.....+-....|..++.-||-.+ T Consensus 1 mt~l~~f~~d~re~lld~le~~areeiap~vq~~ahdile~yg~~hdyd---v~~iiea~et~v~rr~~rvvvr~gwpep 77 (150) T protein:vir:99 1 MTTLAGFEADAREALLDELEDHAREEIAPAVQQHAHDILEAYGRENDYD---VQSIIDAAETRVERRKGSVVVRWGWPEP 77 (150) T ss_pred CCccchhhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhccccccc---hhhhhhhhhhheeecCCeEEEEecCCCc Confidence 5554 234566778888888888888888887777776532211111 1111211111222234667777777655 Q ss_pred ccccchhhhccccccCCCcCCCC---------------------ceeeeeecccceeeeCCccchhhHHHHHHHHHH Q lcl|NC_019933. 82 KAPHGHLVEYGHWRTNVVAEVDG---------------------KWLFTKEKLATPVHVPARSFLRPGYDSVKGRLV 137 (155) Q Consensus 82 ~a~~~~~vEfGt~~~~~~~~~~~---------------------~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~ 137 (155) . -|.|-||..|.+-.++.+ +.+..+..+......|---|+|-.|.--..+.. T Consensus 78 a----iyfergt~dhvvea~nad~lsfvwedpp~wvre~fe~e~~g~rvfl~e~~v~glpesrfirdtln~lr~~fa 150 (150) T protein:vir:99 78 A----IFFERGTVDHVVEATNADVLSFIWEDPPRWVRQGYEREGGGWRVFLPEVEVSGLPESRFIRDTLNWLRRRFA 150 (150) T ss_pred c----eeeeccchhhhhhccccchhhhhhcCchhHhHhhcCcCCCceEEEeecccccCCcchhhHHHHHHHHHHhcC Confidence 3 347889876654433221 111111222223346777899988876655554 No 213 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=50.18 E-value=0.62 Score=21.73 Aligned_cols=125 Identities=11% Similarity=-0.077 Sum_probs=68.6 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCccccchh Q lcl|NC_019933. 9 DISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKKAPHGHL 88 (155) Q Consensus 9 ~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~a~~~~~ 88 (155) =++||++.++.|..+.+++ .++..++...-.+.....+ +.+.-...... .......+.|+..+....-..+ T Consensus 1 mv~Gl~el~~~l~~l~~~~-~~~~~~al~~ga~~~~~~~-------k~~ap~~~~~~-~~hl~d~I~~~~~k~~~~g~~~ 71 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKA-PKTAKAAVTEVAKEFEKAL-------KANTPVYEVET-DERLQEDTVISGFKGANVGIVS 71 (125) T ss_pred CchhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHH-------HHhCCcCCCCc-hhhHHhhhhcccccccccCceE Confidence 7788888888888887654 5566665555544444433 33332111000 0001111112111111111112 Q ss_pred hhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 89 VEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 89 vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) ++-|. .....+||.|.|+ ...-+|= +|=++...++..+.+.+.+.+.|.+.|+= T Consensus 72 ~~VG~--------~k~~~~y~~f~E~---GT~k~~~-~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 72 KEIGY--------GKATGWRAHYPND---GTIYQRG-QDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred EEEee--------cCCCceeEeeecc---CccCCCc-CccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 33332 2345689999998 6666663 57778888888888888888888888877 No 214 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=49.70 E-value=0.63 Score=21.68 Aligned_cols=139 Identities=17% Similarity=0.139 Sum_probs=65.4 Q ss_pred CceeeeeccHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhC-----CC-------Ccc------hhhcceee Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRD--DSDSVSRTMAFESAAVVRDSAKAHV-----RS-------KTG------RLKGAIYA 60 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~--~~~~~~r~a~~~~a~~i~~eak~~a-----P~-------~tG------~Lr~sI~~ 60 (155) |+++|+ ++.++|.+..++|.- +.-+.=+.-+...|..++..+++++ |. +.| .|.+-+.+ T Consensus 1 M~i~~~-~n~~~~~~l~~~L~ll~L~p~~Rr~ll~~iak~lr~~~k~rIr~Q~~PDGs~~~pRKr~k~KM~~kL~k~l~~ 79 (227) T protein:vir:37 1 MNIRMG-IDKEDLKKFLKDLEIISLPDKKKREILIRSLQMIKRQAVKSAANQRNPMGGSWKKRKNGTAKMLRRIAKLANS 79 (227) T ss_pred Cccccc-CCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCchhcchhHHHHhhhHHHcce Confidence 988888 577777777777652 2233344555666777777777764 32 222 24444433 Q ss_pred eecccccCCceEEEEEEec-CCccccchhhhcc-----------------------------------ccccCCCcCC-C Q lcl|NC_019933. 61 VYVPEESTEVRHVYAVSWN-KKKAPHGHLVEYG-----------------------------------HWRTNVVAEV-D 103 (155) Q Consensus 61 ~~~~~~~~~g~~~~~Vg~~-~~~a~~~~~vEfG-----------------------------------t~~~~~~~~~-~ 103 (155) .. +.+.. .|++. ...+..++...|| ..-+.+..+. . T Consensus 80 ~~----~~~~a---~v~f~~g~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~~k~k~~k 152 (227) T protein:vir:37 80 KA----EKAQG---TLFYKQKRTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVANGKTKNGK 152 (227) T ss_pred ee----cccce---EEEecCcchHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccCCCCCCcC Confidence 21 11111 23332 2222233333333 2211111110 0 Q ss_pred Cce----eeee--------------eccc------------ceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019933. 104 GKW----LFTK--------------EKLA------------TPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRLAELRS 153 (155) Q Consensus 104 ~~~----~~~~--------------~~~~------------gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~ 153 (155) +.+ +-|. ..++ =++..|++|||-..= +.+.+.|...|..+.. T Consensus 153 ~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~--------~e~~~~l~r~l~~~~~ 224 (227) T protein:vir:37 153 AKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTRE--------EENAKIILAEIQKYTQ 224 (227) T ss_pred CccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCH--------HHHHHHHHHHHHHHhh Confidence 000 0000 0111 123479999998743 3344455666777777 Q ss_pred cC Q lcl|NC_019933. 154 KR 155 (155) Q Consensus 154 k~ 155 (155) +| T Consensus 225 ~~ 226 (227) T protein:vir:37 225 KQ 226 (227) T ss_pred hc Confidence 77 No 215 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=43.78 E-value=0.83 Score=21.02 Aligned_cols=120 Identities=12% Similarity=0.102 Sum_probs=62.0 Q ss_pred eee-ccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEEecCCc Q lcl|NC_019933. 5 ITS-LDISGVLSALND-LRDDSDSVSRTMAFESAAVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVSWNKKK 82 (155) Q Consensus 5 m~~-~~l~~L~~~l~~-l~~~~~~~~r~a~~~~a~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg~~~~~ 82 (155) |.. +.+|+|.+.|.+ |.+-... +...+.+..+.+.+++...... ..+.+++.....+.+...... T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~-v~~~v~~~v~~~a~~~~~~lk~------------~sP~~TG~yaksW~~k~~~~~ 67 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKD-VVDDIDDIKKDITKNGVKQLRE------------SSPKRTGDYAKNWTSQKLKNG 67 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh------------hCCccccccccceeeeecCCe Confidence 887 789999998665 4555444 5578888888888887765552 234455544444444322111 Q ss_pred cccchhhhccccccCCCcCCCCceeeeeecccceeeeC-CccchhhHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019933. 83 APHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVP-ARSFLRPGYDSVKGRLVEVANKAGAKRLAELRSK 154 (155) Q Consensus 83 a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~p-a~PFlrPA~~~~~~~~~~~i~~~l~~~i~k~~~k 154 (155) . . .....+..+...--+|+|..+-. .+===+|=+....+.+.+.|.+.+++.|. + T Consensus 68 -~-----------~-~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~----~ 123 (123) T protein:vir:96 68 -D-----------Q-VIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLS----Q 123 (123) T ss_pred -e-----------E-EEEEecCCcceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhc----C Confidence 0 0 01111112222234567754432 32323455555555555555555555554 4 No 216 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=34.63 E-value=0.35 Score=23.05 Aligned_cols=102 Identities=14% Similarity=0.052 Sum_probs=38.9 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHhCCCCcchhhcceeeeecccccCCceEEEEEE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESA---AVVRDSAKAHVRSKTGRLKGAIYAVYVPEESTEVRHVYAVS 77 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a---~~i~~eak~~aP~~tG~Lr~sI~~~~~~~~~~~g~~~~~Vg 77 (155) |||. .++.+-+.. | .++++..++ .+|..-..-.+|.+||.|++|=.-.+ --+.|..+|.+- T Consensus 1 ~~f~--~f~~~~~k~----l-------~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~t---vIgsg~I~y~~~ 64 (105) T protein:vir:78 1 MSFS--SFKDAVIDD----I-------HNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKI---IIQKNSIVARVF 64 (105) T ss_pred CCcc--cccchHHHH----H-------HHhcCCCCchhhHHHHHHhCCCCcccccccccccccce---eecCCeeEeecc Confidence 5432 233221111 1 111121111 14444455678999999999832111 123445544321 Q ss_pred ecCCccccchhhhccccccCCCcCCCCceeeeeecccceeeeCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 78 WNKKKAPHGHLVEYGHWRTNVVAEVDGKWLFTKEKLATPVHVPARSFLRPGYDSVKGRLVEVANKAGAKRL 148 (155) Q Consensus 78 ~~~~~a~~~~~vEfGt~~~~~~~~~~~~~~~~~~~~~gt~~~pa~PFlrPA~~~~~~~~~~~i~~~l~~~i 148 (155) .-+||++..=|... .| |+ ++.-+.-..++.+.+.....++ | T Consensus 65 ---~~aPYAr~qYYe~~--------Rg----------------~~-WfErm~a~hk~~I~~~vegg~~--~ 105 (105) T protein:vir:78 65 ---SLTPYARRQYYENR--------RN----------------PR-WYEMAVSYGIQSINQIVEGGMR--L 105 (105) T ss_pred ---ccCchhhhhhhccc--------CC----------------Cc-hhHHhhhcchhHHHHHHhcccC--C Confidence 12566665544321 01 11 2222333333322222111111 1 No 217 >protein:vir:4230 Length: 111 # NCBI annotation: predicted 12.0Kd protein # Family: family:all:2819 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039685;swissprot:sw:q05227;genbank:gi:9625451;uniprot:Q05227;genbank:GeneID:2942925 Probab=25.77 E-value=1.5 Score=19.69 Aligned_cols=98 Identities=16% Similarity=0.247 Sum_probs=45.1 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--------CCcchhhcceeeeecccccCCceE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVR--------SKTGRLKGAIYAVYVPEESTEVRH 72 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP--------~~tG~Lr~sI~~~~~~~~~~~g~~ 72 (155) |.---.. .......|. -.+.+++.-+..+-+.||.|-- ..+|.+-++|.. .+|.. T Consensus 1 makvyan-----aN~v~a~~~-----~~k~avr~E~~~v~~RAraNLA~a~astri~~~g~~p~~it~-------~~gdv 63 (111) T protein:vir:42 1 MAKVYAN-----ANKVAARYV-----ETRDAVRDERNKVTRRAKANLARQNSTTRITDEGYFPATITE-------QDGDV 63 (111) T ss_pred Ccceecc-----hhhhhhhch-----hHHHHHHHHHhhhhhhHHHhHHHhhhccccccccccCceeec-------ccCCc Confidence 2111110 011111111 1356666666666666665432 235666666632 34444 Q ss_pred EEEEEecCCccccchhhhccccccCCC-----cCCCCceeeeeecccceee Q lcl|NC_019933. 73 VYAVSWNKKKAPHGHLVEYGHWRTNVV-----AEVDGKWLFTKEKLATPVH 118 (155) Q Consensus 73 ~~~Vg~~~~~a~~~~~vEfGt~~~~~~-----~~~~~~~~~~~~~~~gt~~ 118 (155) -..+... +|-.--+||||.+.+.- +.+.+-++..+--.+||+. T Consensus 64 D~~~~l~---APnamAiEfGH~PSG~F~g~dTKaPe~~YILt~AAiggt~~ 111 (111) T protein:vir:42 64 DFHTILN---APNALALEFGHAPSGFFAGTDTKPPEATYILTRAAIGGTVS 111 (111) T ss_pred ceEEEec---CCChhhhhcccCCcceecccccCCCCceeeeeccccccccC Confidence 4444433 44555699999654422 2233333333444566655 No 218 >protein:vir:2435 Length: 111 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046837;genbank:gi:9630405;genbank:GeneID:1261628 Probab=22.14 E-value=2 Score=18.89 Aligned_cols=98 Identities=17% Similarity=0.235 Sum_probs=44.2 Q ss_pred CceeeeeccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--------CCcchhhcceeeeecccccCCceE Q lcl|NC_019933. 1 MSSKITSLDISGVLSALNDLRDDSDSVSRTMAFESAAVVRDSAKAHVR--------SKTGRLKGAIYAVYVPEESTEVRH 72 (155) Q Consensus 1 M~~~m~~~~l~~L~~~l~~l~~~~~~~~r~a~~~~a~~i~~eak~~aP--------~~tG~Lr~sI~~~~~~~~~~~g~~ 72 (155) |.---. ........|. -++.++++-+..+-+.||.|-- ...|.+=.+|... +|.. T Consensus 1 makvya-----naN~v~ahl~-----~vk~avr~Ea~ev~~RAr~NLA~arastri~k~g~~P~~I~~~-------~gdv 63 (111) T protein:vir:24 1 MAKVYA-----NANKVAARHV-----DVRKRVKEERDGVTRRARTNLARANKTTRITKEGYFPASIEEV-------DGDV 63 (111) T ss_pred Cccccc-----chhhHhhhch-----hHHHHHHHHHhhhhhhHHHhHHHhhhcceecccccCccccccc-------cCCc Confidence 211111 0111111111 1356666666777766666532 2456776776322 2333 Q ss_pred EEEEEecCCccccchhhhccccccCCC-----cCCCCceeeeeecccceee Q lcl|NC_019933. 73 VYAVSWNKKKAPHGHLVEYGHWRTNVV-----AEVDGKWLFTKEKLATPVH 118 (155) Q Consensus 73 ~~~Vg~~~~~a~~~~~vEfGt~~~~~~-----~~~~~~~~~~~~~~~gt~~ 118 (155) -..|.. .+|-.--+||||.+.+.- +.+.+-++....-.+|++. T Consensus 64 D~~~~l---~APnamAiEfGH~PSG~F~g~dTKaP~glYILt~AA~~g~~~ 111 (111) T protein:vir:24 64 DFHTVL---HAPNAFALEFGHAPSGFFAGTDTKPPDPEYILTRAAIGGTVS 111 (111) T ss_pred ceEEEe---cCCChhhhhccCCCcceecccccCCCCCceeeeccccccccC Confidence 333332 345555699999654422 2233333333334556654 Done!