Query lcl|NC_018285.1_cdsid_YP_006561269.1 [gene=Ssal_phage00055] [protein=putative tail component protein] [protein_id=YP_006561269.1] [location=32082..32510] Match_columns 142 No_of_seqs 72 out of 76 Neff 5.7 Searched_HMMs 1612 Date Thu Nov 7 13:05:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_48 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_48_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4833 Length: 140 # 100.0 1.5E-60 9.1E-64 348.6 15.0 140 3-142 1-140 (140) 2 protein:vir:4859 Length: 140 # 100.0 1.6E-60 1E-63 348.4 15.0 140 3-142 1-140 (140) 3 protein:vir:5000 Length: 141 # 100.0 1E-59 6.2E-63 344.0 15.0 140 3-142 1-140 (141) 4 protein:vir:4956 Length: 153 # 100.0 1E-58 6.3E-62 338.5 15.0 140 3-142 1-140 (153) 5 protein:vir:100223 Length: 139 100.0 1.4E-53 8.9E-57 310.3 14.7 135 4-142 1-136 (139) 6 protein:vir:3994 Length: 168 # 100.0 7.7E-54 4.8E-57 311.8 12.5 140 3-142 1-165 (168) 7 protein:vir:100887 Length: 139 100.0 9.4E-53 5.8E-56 305.8 14.8 135 4-142 1-136 (139) 8 protein:vir:7412 Length: 168 # 100.0 5.2E-53 3.2E-56 307.2 12.1 140 3-142 1-165 (168) 9 protein:vir:1028 Length: 168 # 100.0 8.1E-53 5E-56 306.2 12.5 140 3-142 1-165 (168) 10 protein:vir:3848 Length: 159 # 100.0 3.3E-52 2E-55 302.8 15.0 139 2-142 1-159 (159) 11 protein:vir:1087 Length: 161 # 100.0 9.2E-51 5.7E-54 294.9 12.6 140 2-142 1-161 (161) 12 protein:vir:81106 Length: 125 100.0 8.3E-35 5.1E-38 207.4 11.2 123 1-137 2-125 (125) 13 protein:vir:98342 Length: 125 100.0 8.3E-35 5.1E-38 207.4 11.2 123 1-137 2-125 (125) 14 protein:vir:9414 Length: 125 # 100.0 8.3E-35 5.1E-38 207.4 11.2 123 1-137 2-125 (125) 15 protein:vir:79988 Length: 125 100.0 8.3E-35 5.1E-38 207.4 11.2 123 1-137 2-125 (125) 16 protein:vir:4704 Length: 125 # 100.0 8.3E-35 5.1E-38 207.4 11.2 123 1-137 2-125 (125) 17 protein:vir:9708 Length: 125 # 100.0 6.8E-34 4.2E-37 202.4 11.3 125 6-138 1-125 (125) 18 protein:vir:1273 Length: 127 # 99.9 6.4E-32 4E-35 191.6 10.6 126 3-137 1-127 (127) 19 protein:vir:3873 Length: 128 # 99.9 1.8E-31 1.1E-34 189.2 12.0 127 3-137 1-128 (128) 20 protein:vir:1386 Length: 149 # 99.9 4.8E-31 3E-34 186.8 11.7 139 1-142 1-149 (149) 21 protein:vir:105089 Length: 133 99.9 2.9E-28 1.8E-31 171.6 11.1 129 3-139 1-133 (133) 22 protein:vir:107568 Length: 146 99.9 1.4E-27 8.6E-31 167.8 12.1 137 1-140 1-146 (146) 23 protein:vir:102085 Length: 146 99.9 1.4E-27 8.6E-31 167.8 12.1 137 1-140 1-146 (146) 24 protein:vir:102875 Length: 146 99.9 1.4E-27 8.6E-31 167.8 12.1 137 1-140 1-146 (146) 25 protein:vir:105007 Length: 146 99.9 1.4E-27 8.6E-31 167.8 12.1 137 1-140 1-146 (146) 26 protein:vir:5745 Length: 135 # 99.9 1.7E-27 1E-30 167.4 11.5 132 2-141 1-135 (135) 27 protein:vir:102154 Length: 119 99.9 2.7E-26 1.7E-29 160.8 7.7 117 3-137 1-119 (119) 28 protein:vir:100075 Length: 140 99.9 4.4E-25 2.7E-28 154.1 10.6 128 3-142 1-138 (140) 29 protein:vir:1437 Length: 140 # 99.9 1.1E-24 6.7E-28 152.0 10.8 127 3-142 1-138 (140) 30 protein:vir:80362 Length: 140 99.8 2.4E-24 1.5E-27 150.1 11.2 128 3-142 1-138 (140) 31 protein:vir:100243 Length: 140 99.8 3.3E-24 2E-27 149.3 10.6 129 3-142 1-138 (140) 32 protein:vir:93617 Length: 148 99.8 9.1E-24 5.6E-27 146.9 11.3 139 1-142 1-148 (148) 33 protein:vir:4347 Length: 164 # 99.8 7E-24 4.4E-27 147.5 9.2 134 1-142 1-156 (164) 34 protein:vir:194 Length: 149 # 99.8 3.5E-23 2.2E-26 143.7 11.6 130 1-142 1-149 (149) 35 protein:vir:1891 Length: 179 # 99.8 5.7E-22 3.5E-25 137.1 9.5 136 1-142 1-175 (179) 36 protein:vir:94538 Length: 125 99.7 3.7E-20 2.3E-23 127.1 9.9 122 1-139 1-125 (125) 37 protein:vir:96358 Length: 115 99.6 9.9E-19 6.2E-22 119.3 9.4 114 6-133 1-115 (115) 38 protein:vir:78858 Length: 115 99.6 9.9E-19 6.2E-22 119.3 9.4 114 6-133 1-115 (115) 39 protein:vir:97144 Length: 115 99.6 9.9E-19 6.2E-22 119.3 9.4 114 6-133 1-115 (115) 40 protein:vir:103917 Length: 115 99.6 9.9E-19 6.2E-22 119.3 9.4 114 6-133 1-115 (115) 41 protein:vir:96225 Length: 115 99.6 9.9E-19 6.2E-22 119.3 9.4 114 6-133 1-115 (115) 42 protein:vir:9312 Length: 115 # 99.6 9.9E-19 6.2E-22 119.3 9.4 114 6-133 1-115 (115) 43 protein:vir:106623 Length: 115 99.6 3.6E-18 2.2E-21 116.2 9.3 114 6-137 1-115 (115) 44 protein:vir:2740 Length: 114 # 99.6 1.1E-17 6.6E-21 113.7 9.4 113 3-134 1-114 (114) 45 protein:vir:4906 Length: 114 # 99.6 1.1E-17 6.6E-21 113.7 9.4 113 3-134 1-114 (114) 46 protein:vir:99744 Length: 115 99.5 1.7E-17 1E-20 112.5 9.5 114 6-133 1-115 (115) 47 protein:vir:3617 Length: 112 # 99.5 1.9E-17 1.2E-20 112.2 9.3 111 1-133 1-112 (112) 48 protein:vir:95789 Length: 114 99.5 3.7E-17 2.3E-20 110.7 9.2 114 1-137 1-114 (114) 49 protein:vir:101594 Length: 173 99.5 7.1E-17 4.4E-20 109.1 10.0 117 6-139 1-173 (173) 50 protein:vir:743 Length: 108 # 99.5 7.4E-17 4.6E-20 109.0 9.6 107 6-133 1-108 (108) 51 protein:vir:9930 Length: 108 # 99.5 6.9E-17 4.3E-20 109.2 8.4 108 7-134 1-108 (108) 52 protein:vir:98409 Length: 108 99.5 3.5E-16 2.2E-19 105.3 9.6 107 6-137 1-108 (108) 53 protein:vir:96486 Length: 112 99.4 2.8E-16 1.7E-19 105.9 8.8 111 3-133 1-112 (112) 54 protein:vir:97088 Length: 157 99.4 1.4E-15 8.8E-19 102.0 10.4 126 1-142 3-156 (157) 55 protein:vir:9647 Length: 132 # 99.4 1.8E-15 1.1E-18 101.4 10.3 123 3-138 1-132 (132) 56 protein:vir:105467 Length: 144 99.3 1E-13 6.3E-17 91.8 11.6 124 1-140 1-144 (144) 57 protein:vir:6216 Length: 125 # 99.2 1.2E-14 7.5E-18 96.9 6.2 119 3-136 1-125 (125) 58 protein:vir:100652 Length: 134 99.2 1.4E-13 8.5E-17 91.1 8.5 122 3-135 1-134 (134) 59 protein:vir:107099 Length: 137 99.2 2.4E-13 1.5E-16 89.7 9.2 108 3-129 1-137 (137) 60 protein:vir:98636 Length: 138 99.1 4E-13 2.5E-16 88.5 10.3 126 1-138 5-138 (138) 61 protein:vir:95894 Length: 137 99.1 3.6E-13 2.2E-16 88.8 9.4 108 3-129 1-137 (137) 62 protein:vir:94796 Length: 137 99.1 3.4E-13 2.1E-16 89.0 9.2 108 3-129 1-137 (137) 63 protein:vir:94490 Length: 137 99.1 5E-13 3.1E-16 88.0 9.2 108 3-129 1-137 (137) 64 protein:vir:97427 Length: 137 99.1 5E-13 3.1E-16 88.0 9.2 108 3-129 1-137 (137) 65 protein:vir:93738 Length: 137 99.1 5E-13 3.1E-16 88.0 9.2 108 3-129 1-137 (137) 66 protein:vir:81147 Length: 126 99.1 2.1E-12 1.3E-15 84.6 10.7 121 1-141 1-126 (126) 67 protein:vir:96121 Length: 137 99.1 1.2E-12 7.7E-16 85.9 9.4 108 3-129 1-137 (137) 68 protein:vir:101302 Length: 134 99.1 8.3E-13 5.1E-16 86.8 8.1 122 3-135 1-134 (134) 69 protein:vir:9513 Length: 134 # 99.1 8.3E-13 5.1E-16 86.8 8.1 122 3-135 1-134 (134) 70 protein:vir:105330 Length: 137 99.1 1.1E-12 6.8E-16 86.2 8.7 108 3-129 1-137 (137) 71 protein:vir:94108 Length: 149 99.0 1.9E-12 1.2E-15 84.9 9.2 110 1-129 11-149 (149) 72 protein:vir:105916 Length: 149 99.0 2.5E-12 1.6E-15 84.1 8.9 110 1-129 11-149 (149) 73 protein:vir:96829 Length: 135 99.0 3.1E-12 1.9E-15 83.7 8.8 108 3-137 1-135 (135) 74 protein:vir:106570 Length: 182 99.0 5.2E-12 3.2E-15 82.4 9.0 123 3-142 1-182 (182) 75 protein:vir:94654 Length: 142 98.9 9.4E-12 5.8E-15 81.0 7.4 115 1-133 1-142 (142) 76 protein:vir:79034 Length: 141 98.8 4E-11 2.5E-14 77.6 8.7 125 1-138 1-141 (141) 77 protein:vir:5978 Length: 144 # 98.8 7.6E-11 4.7E-14 76.1 10.1 114 1-137 1-144 (144) 78 protein:vir:102963 Length: 163 98.7 3.9E-10 2.4E-13 72.1 11.5 134 3-141 1-163 (163) 79 protein:vir:96012 Length: 133 98.6 5.7E-10 3.6E-13 71.2 9.2 122 3-136 1-133 (133) 80 protein:vir:78335 Length: 133 98.5 2.1E-09 1.3E-12 68.1 9.3 123 3-136 1-133 (133) 81 protein:vir:78077 Length: 141 98.4 8.3E-09 5.2E-12 64.9 10.4 115 1-136 1-141 (141) 82 protein:vir:96973 Length: 133 98.3 4.8E-09 3E-12 66.2 8.3 123 3-134 1-133 (133) 83 protein:vir:9363 Length: 133 # 98.3 4.8E-09 3E-12 66.2 8.3 123 3-134 1-133 (133) 84 protein:vir:78644 Length: 133 98.3 4.8E-09 3E-12 66.2 8.3 123 3-134 1-133 (133) 85 protein:vir:94419 Length: 133 98.3 4.8E-09 3E-12 66.2 8.3 123 3-134 1-133 (133) 86 protein:vir:99528 Length: 92 # 98.3 3.3E-09 2.1E-12 67.1 6.5 91 1-107 1-92 (92) 87 protein:vir:93898 Length: 133 98.3 9.5E-09 5.9E-12 64.6 8.2 123 3-134 1-133 (133) 88 protein:vir:99101 Length: 142 98.1 7.3E-09 4.5E-12 65.2 4.3 112 1-130 1-142 (142) 89 protein:vir:8669 Length: 142 # 98.1 7.3E-09 4.5E-12 65.2 4.3 112 1-130 1-142 (142) 90 protein:vir:97327 Length: 116 98.0 5.5E-08 3.4E-11 60.4 7.8 87 24-129 1-116 (116) 91 protein:vir:1243 Length: 116 # 98.0 5.5E-08 3.4E-11 60.4 7.8 87 24-129 1-116 (116) 92 protein:vir:966 Length: 123 # 98.0 2.4E-07 1.5E-10 56.9 11.1 117 1-138 1-123 (123) 93 protein:vir:95062 Length: 116 98.0 6E-08 3.7E-11 60.2 7.8 87 24-129 1-116 (116) 94 protein:vir:1332 Length: 143 # 97.9 4.5E-07 2.8E-10 55.4 10.9 134 1-142 1-143 (143) 95 protein:vir:6246 Length: 143 # 97.8 6.8E-07 4.2E-10 54.4 10.9 134 1-142 1-143 (143) 96 protein:vir:81067 Length: 119 97.8 9.8E-08 6.1E-11 59.0 5.5 88 43-140 1-119 (119) 97 protein:vir:2688 Length: 123 # 97.7 3.1E-07 1.9E-10 56.3 7.8 113 13-134 1-123 (123) 98 protein:vir:10367 Length: 119 97.7 1.2E-07 7.7E-11 58.4 5.6 88 43-140 1-119 (119) 99 protein:vir:9879 Length: 127 # 97.6 6.3E-07 3.9E-10 54.6 7.6 116 7-138 1-127 (127) 100 protein:vir:4096 Length: 140 # 97.4 2E-06 1.2E-09 51.9 8.3 131 1-142 1-139 (140) 101 protein:vir:106041 Length: 137 97.0 1.6E-06 1E-09 52.3 4.0 106 1-127 1-137 (137) 102 protein:vir:102338 Length: 116 97.0 1.2E-05 7.6E-09 47.5 8.8 97 24-137 1-116 (116) 103 protein:vir:95372 Length: 124 97.0 2.9E-05 1.8E-08 45.4 10.8 119 1-138 1-124 (124) 104 protein:vir:102441 Length: 137 96.9 9.5E-06 5.9E-09 48.1 7.2 105 1-126 2-137 (137) 105 protein:vir:80116 Length: 127 96.8 4.5E-05 2.8E-08 44.4 10.4 122 1-141 1-127 (127) 106 protein:vir:104347 Length: 145 96.8 1.6E-05 1E-08 46.9 7.9 118 1-136 1-145 (145) 107 protein:vir:107851 Length: 175 96.8 1.8E-05 1.1E-08 46.6 8.1 128 1-139 1-175 (175) 108 protein:vir:79091 Length: 175 96.7 2.4E-05 1.5E-08 45.9 8.2 128 1-139 1-175 (175) 109 protein:vir:103280 Length: 142 96.7 2.9E-05 1.8E-08 45.5 8.4 113 1-136 1-142 (142) 110 protein:vir:107545 Length: 140 96.6 2.6E-06 1.6E-09 51.2 2.0 108 3-127 1-140 (140) 111 protein:vir:97982 Length: 140 96.6 2.6E-06 1.6E-09 51.2 2.0 108 3-127 1-140 (140) 112 protein:vir:79638 Length: 146 96.5 8E-05 5E-08 43.0 10.1 117 1-138 1-146 (146) 113 protein:vir:107703 Length: 147 96.2 8.2E-05 5.1E-08 43.0 8.2 116 1-142 1-147 (147) 114 protein:vir:3163 Length: 145 # 96.1 3.2E-05 2E-08 45.2 5.6 125 3-142 1-145 (145) 115 protein:vir:2026 Length: 150 # 95.7 0.00015 9.4E-08 41.5 7.6 120 3-134 1-150 (150) 116 protein:vir:94994 Length: 131 95.6 0.00011 6.5E-08 42.4 6.6 107 4-133 1-131 (131) 117 protein:vir:106506 Length: 137 95.6 2E-05 1.2E-08 46.4 2.5 108 1-137 1-137 (137) 118 protein:vir:1988 Length: 156 # 95.6 6.7E-05 4.1E-08 43.5 5.4 123 1-138 1-156 (156) 119 protein:vir:6071 Length: 150 # 95.6 0.0002 1.2E-07 40.9 7.9 120 3-134 1-150 (150) 120 protein:vir:5703 Length: 150 # 95.4 0.00025 1.5E-07 40.3 7.9 120 3-134 1-150 (150) 121 protein:vir:99833 Length: 190 95.3 0.00018 1.1E-07 41.1 6.7 126 1-140 1-190 (190) 122 protein:vir:78380 Length: 131 94.6 0.00012 7.3E-08 42.1 4.0 107 4-133 1-131 (131) 123 protein:vir:100312 Length: 152 93.9 0.00096 5.9E-07 37.1 7.4 125 3-135 1-152 (152) 124 protein:vir:107757 Length: 189 93.5 0.00019 1.2E-07 41.0 2.8 85 43-142 1-94 (189) 125 protein:vir:96105 Length: 193 93.4 0.00088 5.5E-07 37.3 6.3 124 1-142 1-138 (193) 126 protein:vir:98557 Length: 149 93.2 0.0013 7.9E-07 36.5 7.0 118 3-134 1-149 (149) 127 protein:vir:101563 Length: 155 92.7 0.00024 1.5E-07 40.4 2.2 93 36-142 1-100 (155) 128 protein:vir:99546 Length: 200 92.5 0.00046 2.8E-07 38.9 3.5 90 39-142 1-145 (200) 129 protein:vir:77650 Length: 155 92.2 0.00033 2E-07 39.7 2.4 89 36-142 1-98 (155) 130 protein:vir:79225 Length: 155 92.2 0.0024 1.5E-06 35.0 7.0 125 1-139 2-155 (155) 131 protein:vir:103841 Length: 155 91.8 0.0024 1.5E-06 35.0 6.5 128 1-140 1-155 (155) 132 protein:vir:99196 Length: 155 91.7 0.0026 1.6E-06 34.7 6.7 125 1-140 1-155 (155) 133 protein:vir:1838 Length: 149 # 91.0 0.0036 2.2E-06 34.0 6.7 119 3-134 1-149 (149) 134 protein:vir:80425 Length: 134 90.6 0.0022 1.4E-06 35.1 5.2 107 4-138 1-134 (134) 135 protein:vir:78607 Length: 155 90.4 0.0009 5.6E-07 37.3 2.8 95 29-142 1-98 (155) 136 protein:vir:5257 Length: 148 # 90.2 0.00032 2E-07 39.7 0.3 80 25-142 1-91 (148) 137 protein:vir:79115 Length: 148 89.8 0.0076 4.7E-06 32.2 7.5 119 3-138 1-148 (148) 138 protein:vir:106728 Length: 155 89.8 0.001 6.4E-07 37.0 2.7 95 29-142 1-98 (155) 139 protein:vir:1164 Length: 156 # 89.4 0.0064 4E-06 32.6 6.7 123 3-138 1-156 (156) 140 protein:vir:79179 Length: 155 89.3 0.0078 4.8E-06 32.1 7.1 124 3-134 1-155 (155) 141 protein:vir:80037 Length: 199 89.0 0.0025 1.6E-06 34.8 4.2 126 1-142 1-139 (199) 142 protein:vir:94069 Length: 168 88.3 0.0018 1.1E-06 35.7 2.9 97 28-142 1-101 (168) 143 protein:vir:97190 Length: 148 86.5 0.013 8.3E-06 30.8 6.6 114 1-142 1-148 (148) 144 protein:vir:95157 Length: 144 85.2 0.0096 6E-06 31.6 5.2 110 1-137 1-144 (144) 145 protein:vir:94944 Length: 121 83.5 0.022 1.4E-05 29.7 6.3 101 1-119 1-121 (121) 146 protein:vir:96774 Length: 152 71.9 0.028 1.7E-05 29.1 3.3 116 1-139 1-152 (152) 147 protein:vir:95260 Length: 160 71.2 0.033 2E-05 28.7 3.5 85 46-142 1-102 (160) 148 protein:vir:8106 Length: 150 # 68.8 0.017 1.1E-05 30.3 1.4 119 3-142 1-142 (150) 149 protein:vir:3750 Length: 227 # 65.2 0.11 7.1E-05 25.7 5.2 135 1-142 1-152 (227) 150 protein:vir:7449 Length: 123 # 53.4 0.53 0.00033 22.1 7.9 123 1-139 1-123 (123) 151 protein:vir:105773 Length: 131 44.7 0.8 0.00049 21.1 6.4 110 1-134 1-131 (131) 152 protein:vir:101508 Length: 120 42.1 0.9 0.00056 20.8 8.4 118 1-140 1-120 (120) 153 protein:vir:3787 Length: 231 # 24.9 1.2 0.00072 20.2 3.7 135 1-142 3-157 (231) 154 protein:vir:7993 Length: 108 # 21.8 0.21 0.00013 24.3 -1.0 103 1-142 1-104 (108) 155 protein:vir:3427 Length: 192 # 20.8 2.7 0.0017 18.2 8.1 132 1-138 1-192 (192) No 1 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=100.00 E-value=1.5e-60 Score=348.60 Aligned_cols=140 Identities=90% Similarity=1.305 Sum_probs=138.3 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |++|+++|++|+++|++|++++++++++||+|||+||++.|+++||++||++|+|+++|||||+|+++++++||.++|++ T Consensus 1 M~~~~d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~s 80 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGVA 80 (140) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|||++++++||||||||||++||||||||+||++++++++||+||.++|+++|++|||+ T Consensus 81 ~VG~~k~~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 81 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred eecccCCCceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 999999999999999999999999999999999999888999999999999999999999 No 2 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=100.00 E-value=1.6e-60 Score=348.37 Aligned_cols=140 Identities=88% Similarity=1.274 Sum_probs=138.2 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |++|++||+||+++|++|++++++++++||+|||+||++.|+++||++||++++|+++|||+|+|+++++++||..+|++ T Consensus 1 M~~~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~s 80 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGVS 80 (140) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|||++++++||||||||||++||||||||+||++++++++||+||+++|+++|++|||+ T Consensus 81 ~VG~~kk~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 81 TVGWVNRYHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred eeccCCCcceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 999999999999999999999999999999999999888999999999999999999999 No 3 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=100.00 E-value=1e-59 Score=344.04 Aligned_cols=140 Identities=79% Similarity=1.143 Sum_probs=138.0 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |++|++||+||+++|++|++++++++++||+|||+||++.|+++||++||++++++++|||+|+|+++++++||.++|++ T Consensus 1 M~~~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~s 80 (141) T protein:vir:50 1 MVGLAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGVS 80 (141) T ss_pred CccHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|||++++++||||||||||++|||||||++||++++++++||+||+++|+++|++|||. T Consensus 81 ~VG~~~~~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~ 140 (141) T protein:vir:50 81 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEEKEGC 140 (141) T ss_pred eeccCCCccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhccCC Confidence 999999999999999999999999999999999998888999999999999999999999 No 4 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=100.00 E-value=1e-58 Score=338.53 Aligned_cols=140 Identities=89% Similarity=1.301 Sum_probs=138.0 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |++|++||+||+++|++|++++++++++||+|||+||++.|+++||++||++++++++|||+|+|+++++++||..+|++ T Consensus 1 M~~~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~s 80 (153) T protein:vir:49 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGVS 80 (153) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceecccccccccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|||++++++|+|||+||||++||||||||++|++++++++||+||.++|+++|++|+|= T Consensus 81 ~VG~~~~~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~ 140 (153) T protein:vir:49 81 TVGWKNNYHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGV 140 (153) T ss_pred eecccCCccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCe Confidence 999999999999999999999999999999999999888999999999999999999998 No 5 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=100.00 E-value=1.4e-53 Score=310.30 Aligned_cols=135 Identities=44% Similarity=0.730 Sum_probs=129.2 Q ss_pred chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC-CCCcccchhcceecCcccccccccee Q lcl|NC_018285. 4 VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK-DLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 4 ~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~-~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) .+|+++|++|+++|++|++++++++++||+|||+||++.|+++||++||+++. +++++||+|+|+++++++||..+|++ T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccc Confidence 89999999999999999999999999999999999999999999999998666 56899999999999999999999999 Q ss_pred EeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|||+++ +|+||||||||++||||||||+||+++ +++||+||.++|+++|++|+|. T Consensus 81 ~VG~~~~--~~~Ahf~n~GT~~~~~~hFie~t~~e~--~~ev~~a~~~~~ke~l~~~~~~ 136 (139) T protein:vir:10 81 TVGFHNK--AHIARFLNDGTKNIRADHFVDNARDDA--KDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred eeCCCCC--ceeeeeeccCccccCCCchHHHHHHHH--HHHHHHHHHHHHHHHHhhcCCC Confidence 9999854 899999999999999999999999987 5899999999999999999999 No 6 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=100.00 E-value=7.7e-54 Score=311.78 Aligned_cols=140 Identities=31% Similarity=0.541 Sum_probs=135.9 Q ss_pred cchHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIG-DITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~-~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++|+++|++|++++++|+ +++++++++||+|||+||++.|+++||++||++|+|+++|||||+|+++++||||..||+ T Consensus 1 M~~~~d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~dG~ 80 (168) T protein:vir:39 1 MVSFYDAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQ 80 (168) T ss_pred CccHHHHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCcccCCc Confidence 9999999999999999998 568999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeccCCC------CceeEEeecccCc------------------cccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 82 ATVGWKNN------YHAQNARRLNDGT------------------KKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 82 ~~VG~~k~------~~a~~A~f~n~GT------------------~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) |+|||+++ +|+|||||+|||| ++|+++|||+++|++++++++||+|++++|++||+ T Consensus 81 StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~~Ae~e~~~eil~ 160 (168) T protein:vir:39 81 SVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKIIN 160 (168) T ss_pred eeccccCccccccccchhheehhccccccchhhhhcccccccccceeecccchhHHHhhhhhhhHHHHHHHHHHHHHHHH Confidence 99999976 6999999999999 47999999999999998899999999999999999 Q ss_pred hcCCC Q lcl|NC_018285. 138 RKGGK 142 (142) Q Consensus 138 ~k~g~ 142 (142) +|||+ T Consensus 161 ~k~~~ 165 (168) T protein:vir:39 161 RKKKE 165 (168) T ss_pred hcCCC Confidence 99999 No 7 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=100.00 E-value=9.4e-53 Score=305.82 Aligned_cols=135 Identities=43% Similarity=0.714 Sum_probs=129.2 Q ss_pred chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC-CCCcccchhcceecCcccccccccee Q lcl|NC_018285. 4 VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK-DLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 4 ~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~-~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) .+|+++|++|+++|++|++++++++++||+|||+||++.|+++||++||++++ +++++||+|+|+++++++||+.+|++ T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceee Confidence 89999999999999999999999999999999999999999999999998764 67899999999999999999999999 Q ss_pred EeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|||+++ +|+|||+||||++|||||||++|++++ +.+||+||.++|+++|++|+|. T Consensus 81 ~VG~~k~--~~~A~f~n~GT~k~~~~hFie~t~~e~--~~evl~a~~~~~k~~l~~~~~~ 136 (139) T protein:vir:10 81 TVGFHNK--AHIARFLNDGTKYIRADHFVDNARDDA--KDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred eeCCCCC--cceEeecccCccccCCCchHHHHHHHH--HHHHHHHHHHHHHHHHhhcCCC Confidence 9999864 899999999999999999999999987 5899999999999999999998 No 8 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=100.00 E-value=5.2e-53 Score=307.21 Aligned_cols=140 Identities=31% Similarity=0.542 Sum_probs=135.1 Q ss_pred cchHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIG-DITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~-~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++|+++|++|+++|++|+ .++++++.+||+|||+||++.|+++||++||++|++++++||||+|+++++||||..||+ T Consensus 1 M~~~~~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG~ 80 (168) T protein:vir:74 1 MATFEEAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQ 80 (168) T ss_pred CccHHHHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCcccCCc Confidence 9999999999999999998 579999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeccCCC------CceeEEeecccCcc------------------ccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 82 ATVGWKNN------YHAQNARRLNDGTK------------------KYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 82 ~~VG~~k~------~~a~~A~f~n~GT~------------------k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) |+|||+++ +|||+|||+||||+ +||++|||+++|++.+++++||+|++++|++||+ T Consensus 81 s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~~~y~eIl~ 160 (168) T protein:vir:74 81 SVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEAEAMRKIIN 160 (168) T ss_pred eeecccccccccccchhhhhhhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHHHHHHHHHHHHHH Confidence 99999975 48999999999994 7999999999999987789999999999999999 Q ss_pred hcCCC Q lcl|NC_018285. 138 RKGGK 142 (142) Q Consensus 138 ~k~g~ 142 (142) +|||+ T Consensus 161 ~k~~~ 165 (168) T protein:vir:74 161 RKKKE 165 (168) T ss_pred hhcCC Confidence 99999 No 9 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=100.00 E-value=8.1e-53 Score=306.16 Aligned_cols=140 Identities=30% Similarity=0.546 Sum_probs=135.7 Q ss_pred cchHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIG-DITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~-~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++|+++|++|++++++|+ .++.+++.+||+|||+||++.|+++||++||++|+++++|||||+|+++++||||..||+ T Consensus 1 M~~~~d~l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG~ 80 (168) T protein:vir:10 1 MVSFYDAMQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQ 80 (168) T ss_pred CCcHHHHHHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheecccccccccCCc Confidence 9999999999999999985 789999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeccCCC------CceeEEeecccCcc------------------ccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 82 ATVGWKNN------YHAQNARRLNDGTK------------------KYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 82 ~~VG~~k~------~~a~~A~f~n~GT~------------------k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) |+|||+++ +|||+|||+||||+ +||++||||++|++++++++||+|++++|++||+ T Consensus 81 s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~~~y~eIl~ 160 (168) T protein:vir:10 81 SVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKIIN 160 (168) T ss_pred eeecccCccccccccchheeeeccccccccccccccccccccccccccccchhHHHhhhchhhhHHHHHHHHHHHHHHHH Confidence 99999976 69999999999994 7999999999999998899999999999999999 Q ss_pred hcCCC Q lcl|NC_018285. 138 RKGGK 142 (142) Q Consensus 138 ~k~g~ 142 (142) +|+|+ T Consensus 161 ~k~~~ 165 (168) T protein:vir:10 161 RKKKE 165 (168) T ss_pred hhcCC Confidence 99999 No 10 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=100.00 E-value=3.3e-52 Score=302.84 Aligned_cols=139 Identities=24% Similarity=0.403 Sum_probs=131.6 Q ss_pred ccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC--------------CCCcccchhcc Q lcl|NC_018285. 2 AMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK--------------DLKYGHMADGL 67 (142) Q Consensus 2 ~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~--------------~~k~~HlaD~I 67 (142) +|++|+++|++|+++|+++..++++++++||+|||+||++.|+++||++||++|+ +++++||||+| T Consensus 1 mm~~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~I 80 (159) T protein:vir:38 1 MANDMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDSI 80 (159) T ss_pred CcchHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccce Confidence 9999999999999999999999999999999999999999999999999999876 78899999999 Q ss_pred eecC-ccccccccceeEeccCCCCceeEEeecccCccccCCC-----chhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 68 SVQS-TNADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRAD-----HFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGG 141 (142) Q Consensus 68 ~~~~-~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~-----hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g 141 (142) +|++ ++|||..||+|+|||++++++|||||+||||++|||+ ||||+||+++ +.+||+||.++|++||+.--- T Consensus 81 ~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~--k~~Vl~A~~~~~~~il~~~~~ 158 (159) T protein:vir:38 81 TYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEA--KKSVAEAELKAYKEVMNHDSD 158 (159) T ss_pred eeecCccccccccceeeecccCCccceEeeecccCccccCCCCccCChhHHHHHHHH--HHHHHHHHHHHHHHHhhcccC Confidence 9988 6999999999999999889999999999999999875 9999999987 589999999999999987666 Q ss_pred C Q lcl|NC_018285. 142 K 142 (142) Q Consensus 142 ~ 142 (142) + T Consensus 159 ~ 159 (159) T protein:vir:38 159 K 159 (159) T ss_pred C Confidence 6 No 11 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=100.00 E-value=9.2e-51 Score=294.90 Aligned_cols=140 Identities=32% Similarity=0.554 Sum_probs=130.9 Q ss_pred ccch---HHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccc Q lcl|NC_018285. 2 AMVG---LDEALEGWLETVASIG-DITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGR 77 (142) Q Consensus 2 ~m~~---~~~~l~e~~~~l~kl~-~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~ 77 (142) .|.+ |+++|++|+++|+||+ +++++++++||+|||+||++.|+++||++||++|+|+++|||||+|+++++||||. T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg~ 80 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDGI 80 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCcc Confidence 2322 7899999999999998 56899999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeccCCCCceeEEeecccCc-----------------cccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 78 KNGVATVGWKNNYHAQNARRLNDGT-----------------KKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 78 ~~G~~~VG~~k~~~a~~A~f~n~GT-----------------~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) .||+|+|||+++ ++|||||+|||| ++|+++|||+++|+..+++++||+|++++|++||++|| T Consensus 81 ~dG~StVGw~~k-ka~ia~~indGtr~~~~~~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~~~y~eil~~k~ 159 (161) T protein:vir:10 81 KDGNSTVGWDYT-KSRVGHLIENGTRFPMYSKKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEAEVFSEILKKKG 159 (161) T ss_pred cCCceeccccCc-hhhhhhhhcccchhhhhhcccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHHHHHHHHHHhhc Confidence 999999999865 589999998887 67999999999999877789999999999999999999 Q ss_pred CC Q lcl|NC_018285. 141 GK 142 (142) Q Consensus 141 g~ 142 (142) |+ T Consensus 160 ~~ 161 (161) T protein:vir:10 160 AE 161 (161) T ss_pred CC Confidence 99 No 12 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=100.00 E-value=8.3e-35 Score=207.44 Aligned_cols=123 Identities=16% Similarity=0.160 Sum_probs=101.6 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) =+..+| ++|...|++|+..+.+..++||++||++|++.|+++||++. ..+||+|+|++|+. +.++..+ T Consensus 2 ~v~v~~----~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~~g~ 70 (125) T protein:vir:81 2 GARIES----NNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRHTSE 70 (125) T ss_pred eeEeeH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccccce Confidence 122334 34555666666667788999999999999999999999742 23699999999984 4445667 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) .++.|||++. ++|||||+||||++||||||+++|++++ +++|++++.++|+++++ T Consensus 71 ~~v~VG~~k~-~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 71 KIVTIGYAKG-VSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred EEEEeccCCC-CceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 7899999864 6899999999999999999999999998 48999999999999998 No 13 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=100.00 E-value=8.3e-35 Score=207.44 Aligned_cols=123 Identities=16% Similarity=0.160 Sum_probs=101.6 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) =+..+| ++|...|++|+..+.+..++||++||++|++.|+++||++. ..+||+|+|++|+. +.++..+ T Consensus 2 ~v~v~~----~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~~g~ 70 (125) T protein:vir:98 2 GARIES----NNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRHTSE 70 (125) T ss_pred eeEeeH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccccce Confidence 122334 34555666666667788999999999999999999999742 23699999999984 4445667 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) .++.|||++. ++|||||+||||++||||||+++|++++ +++|++++.++|+++++ T Consensus 71 ~~v~VG~~k~-~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 71 KIVTIGYAKG-VSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred EEEEeccCCC-CceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 7899999864 6899999999999999999999999998 48999999999999998 No 14 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=100.00 E-value=8.3e-35 Score=207.44 Aligned_cols=123 Identities=16% Similarity=0.160 Sum_probs=101.6 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) =+..+| ++|...|++|+..+.+..++||++||++|++.|+++||++. ..+||+|+|++|+. +.++..+ T Consensus 2 ~v~v~~----~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~~g~ 70 (125) T protein:vir:94 2 GARIES----NNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRHTSE 70 (125) T ss_pred eeEeeH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccccce Confidence 122334 34555666666667788999999999999999999999742 23699999999984 4445667 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) .++.|||++. ++|||||+||||++||||||+++|++++ +++|++++.++|+++++ T Consensus 71 ~~v~VG~~k~-~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 71 KIVTIGYAKG-VSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred EEEEeccCCC-CceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 7899999864 6899999999999999999999999998 48999999999999998 No 15 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=100.00 E-value=8.3e-35 Score=207.44 Aligned_cols=123 Identities=16% Similarity=0.160 Sum_probs=101.6 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) =+..+| ++|...|++|+..+.+..++||++||++|++.|+++||++. ..+||+|+|++|+. +.++..+ T Consensus 2 ~v~v~~----~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~~g~ 70 (125) T protein:vir:79 2 GARIES----NNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRHTSE 70 (125) T ss_pred eeEeeH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccccce Confidence 122334 34555666666667788999999999999999999999742 23699999999984 4445667 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) .++.|||++. ++|||||+||||++||||||+++|++++ +++|++++.++|+++++ T Consensus 71 ~~v~VG~~k~-~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 71 KIVTIGYAKG-VSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred EEEEeccCCC-CceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 7899999864 6899999999999999999999999998 48999999999999998 No 16 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=100.00 E-value=8.3e-35 Score=207.44 Aligned_cols=123 Identities=16% Similarity=0.160 Sum_probs=101.6 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) =+..+| ++|...|++|+..+.+..++||++||++|++.|+++||++. ..+||+|+|++|+. +.++..+ T Consensus 2 ~v~v~~----~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~-------~~~hl~d~I~vs~~k~~~~~g~ 70 (125) T protein:vir:47 2 GARIES----NNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFAN-------TKKHARDHIAVSNVKTDRHTSE 70 (125) T ss_pred eeEeeH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-------CCchhhhheeecccccccccce Confidence 122334 34555666666667788999999999999999999999742 23699999999984 4445667 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) .++.|||++. ++|||||+||||++||||||+++|++++ +++|++++.++|+++++ T Consensus 71 ~~v~VG~~k~-~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 71 KIVTIGYAKG-VSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred EEEEeccCCC-CceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 7899999864 6899999999999999999999999998 48999999999999998 No 17 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.96 E-value=6.8e-34 Score=202.40 Aligned_cols=125 Identities=16% Similarity=0.145 Sum_probs=113.6 Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEec Q lcl|NC_018285. 6 LDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVG 85 (142) Q Consensus 6 ~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG 85 (142) |-+||+||+++|++|+....++.++|+++||+++++.|++++|+++... .+||+|+|++++++.++..++++.|| T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~-----~~hl~d~I~~~~~k~~~~g~~~~~VG 75 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVET-----DERLQEDTVISGFKGANVGIVSKEIG 75 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCc-----hhhHHhhhhcccccccccCceEEEEe Confidence 7899999999999999888899999999999999999999999865332 35999999999987777677799999 Q ss_pred cCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 86 WKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 86 ~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) |++. ++|++||+||||++|||+||++++++++ +.+|++++.++|++.|+= T Consensus 76 ~~k~-~~~y~~f~E~GT~k~~~~pF~~pa~~~~--k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 76 YGKA-TGWRAHYPNDGTIYQRGQDFKERTINQM--TPKAKQLYAEKVKEGLGL 125 (125) T ss_pred ecCC-CceeEeeeccCccCCCcCccchHhHHHh--HHHHHHHHHHHHHHHhcC Confidence 9864 6899999999999999999999999987 489999999999999988 No 18 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.95 E-value=6.4e-32 Score=191.60 Aligned_cols=126 Identities=15% Similarity=0.187 Sum_probs=113.1 Q ss_pred cchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++++ +||+||++.|++|.....+..++|+++||+++++.++.++|+++++ .+||+|+|.+++...++....+ T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~------tg~l~~~I~~~~~k~~~~g~~~ 74 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKK------QPHMQDNITVSNVRESKDGVRF 74 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC------hhHHHHhhhccccccccCceeE Confidence 89999 8999999999999988888999999999999999999999985433 2699999999887665544568 Q ss_pred eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) +.|||++ .++|++||+|+||++|||+||+.++.+++ +.+|++++.++|++.|+ T Consensus 75 v~Vg~~~-~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~--~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 75 VAVGPNK-KVAYRGRFLEWGTSKMPPQPFIEKGGKEG--EGPAVELMERILTAPIK 127 (127) T ss_pred EEEeeCC-CCcceeeeeccCccCCCCCccchHhHHHH--HHHHHHHHHHHHHHhcC Confidence 9999986 46899999999999999999999999987 47899999999999999 No 19 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.95 E-value=1.8e-31 Score=189.17 Aligned_cols=127 Identities=20% Similarity=0.160 Sum_probs=109.5 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcc-ccccccce Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTN-ADGRKNGV 81 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~-~dg~~~G~ 81 (142) |.--=+||+||+++|++|+.+..++.++||++||+++++.|++++|++..+. ..++||+|+|.+++.. .+| .++ T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~---~~~~h~~d~I~~~~~k~~~g--~~~ 75 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGET---DMSGHLRDDIKLSSVRETSG--LTE 75 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCC---cccchhhhhhccccccccCc--eeE Confidence 3322389999999999999988899999999999999999999999865443 3457999999998743 344 468 Q ss_pred eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) +.|||++. ++||+||+||||++|||+||++++++++ +.++++++.++|++.|= T Consensus 76 ~~VG~~k~-~~~y~~f~E~GT~k~~a~pF~~pa~~~~--~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 76 VDVGYGKD-TGWRAHFPNSGTSMQDPQHFIEETQEIM--RPVVIAAFLSHLKEGGM 128 (128) T ss_pred EEeeecCC-CceEEeeeccCccCCCCCcchhHHHHHh--HHHHHHHHHHHHHhhcC Confidence 99999865 6899999999999999999999999987 48999999999998877 No 20 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.94 E-value=4.8e-31 Score=186.81 Aligned_cols=139 Identities=15% Similarity=0.200 Sum_probs=114.1 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhcc--ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCC----CCCCcccchhcceecCc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGD--ITPAEQAKITTAGAKVFQKELEEVTREKHYSNK----KDLKYGHMADGLSVQST 72 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~--~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~----~~~k~~HlaD~I~~~~~ 72 (142) ||. ++++ +||+||+++|++|+. ...+..++|+++||+++++.++.++|++..+.. .+...+||+|+|++++. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 443 3477 899999999999973 456778899999999999999999998633221 11224699999999986 Q ss_pred cccccccceeEeccCCC--CceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 73 NADGRKNGVATVGWKNN--YHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 73 ~~dg~~~G~~~VG~~k~--~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ..++. .-++.|||.+. ..+|++||+||||++|||+||+.++.+++ ++++++++.++|++.|++..|+ T Consensus 81 ~~~~g-~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~--~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 81 RKKKG-NLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKIL--KRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred ccccc-eeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Confidence 65442 22688999764 34599999999999999999999999987 4899999999999999999999 No 21 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.91 E-value=2.9e-28 Score=171.58 Aligned_cols=129 Identities=19% Similarity=0.242 Sum_probs=107.0 Q ss_pred cchHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecC-cccc-ccc Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQS-TNAD-GRK 78 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~-~~~d-g~~ 78 (142) |++|+ +||++|+++|++|.... .+..++|+.+||+++++.++.++|++ +..+ .+||+|+|.++. .+.+ +.. T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~---~~~~--~~~~~~~I~v~~~~~~~~~~~ 75 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFD---ETST--GQHMRDSIKIRSSTRKAQGNA 75 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---CCcc--hhhhhhcccccccccccCccc Confidence 99999 99999999999998764 34668999999999999999999975 2223 359999999854 2222 122 Q ss_pred cceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 79 NGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 79 ~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k 139 (142) .-++.||+++. ..|++||+||||++|||+||+.++.+++ ++++++++.++|++.|.+| T Consensus 76 ~~~v~vg~~~~-~~~y~~f~E~GT~k~~a~PF~~pA~~~~--~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 76 VVTLRVGPSKQ-HHMKVLAQEFGTVKQVADPFIRPALDYN--VQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred eEEEEecCCCC-ccceEeeeccCCCCCCCCccchHHHHHh--HHHHHHHHHHHHHHHhhcC Confidence 23567888754 4699999999999999999999999987 4899999999999999999 No 22 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.91 E-value=1.4e-27 Score=167.83 Aligned_cols=137 Identities=19% Similarity=0.160 Sum_probs=112.0 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC-----CCCcccchhcceecCcc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK-----DLKYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~-----~~k~~HlaD~I~~~~~~ 73 (142) ||. .+++ +||++|+++|++|.....+..++|+++||+++++.++.++|++....+. ....+|++|+|.+++.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 553 2345 8999999999999988888999999999999999999999975433221 12346999999998855 Q ss_pred ccccccceeEeccCCC--CceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 74 ADGRKNGVATVGWKNN--YHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 74 ~dg~~~G~~~VG~~k~--~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) .++. ..++.|||++. +.+|++||+|+||++|||+||+.++.+++ +++|++++.++|++.|++=- T Consensus 81 ~~~g-~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 81 LEGG-IKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccc-ceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 5432 34678999753 56799999999999999999999999988 47899999999999987766 No 23 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.91 E-value=1.4e-27 Score=167.83 Aligned_cols=137 Identities=19% Similarity=0.160 Sum_probs=112.0 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC-----CCCcccchhcceecCcc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK-----DLKYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~-----~~k~~HlaD~I~~~~~~ 73 (142) ||. .+++ +||++|+++|++|.....+..++|+++||+++++.++.++|++....+. ....+|++|+|.+++.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 553 2345 8999999999999988888999999999999999999999975433221 12346999999998855 Q ss_pred ccccccceeEeccCCC--CceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 74 ADGRKNGVATVGWKNN--YHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 74 ~dg~~~G~~~VG~~k~--~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) .++. ..++.|||++. +.+|++||+|+||++|||+||+.++.+++ +++|++++.++|++.|++=- T Consensus 81 ~~~g-~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 81 LEGG-IKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccc-ceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 5432 34678999753 56799999999999999999999999988 47899999999999987766 No 24 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.91 E-value=1.4e-27 Score=167.83 Aligned_cols=137 Identities=19% Similarity=0.160 Sum_probs=112.0 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC-----CCCcccchhcceecCcc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK-----DLKYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~-----~~k~~HlaD~I~~~~~~ 73 (142) ||. .+++ +||++|+++|++|.....+..++|+++||+++++.++.++|++....+. ....+|++|+|.+++.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 553 2345 8999999999999988888999999999999999999999975433221 12346999999998855 Q ss_pred ccccccceeEeccCCC--CceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 74 ADGRKNGVATVGWKNN--YHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 74 ~dg~~~G~~~VG~~k~--~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) .++. ..++.|||++. +.+|++||+|+||++|||+||+.++.+++ +++|++++.++|++.|++=- T Consensus 81 ~~~g-~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 81 LEGG-IKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccc-ceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 5432 34678999753 56799999999999999999999999988 47899999999999987766 No 25 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.91 E-value=1.4e-27 Score=167.83 Aligned_cols=137 Identities=19% Similarity=0.160 Sum_probs=112.0 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCC-----CCCcccchhcceecCcc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKK-----DLKYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~-----~~k~~HlaD~I~~~~~~ 73 (142) ||. .+++ +||++|+++|++|.....+..++|+++||+++++.++.++|++....+. ....+|++|+|.+++.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 553 2345 8999999999999988888999999999999999999999975433221 12346999999998855 Q ss_pred ccccccceeEeccCCC--CceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 74 ADGRKNGVATVGWKNN--YHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 74 ~dg~~~G~~~VG~~k~--~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) .++. ..++.|||++. +.+|++||+|+||++|||+||+.++.+++ +++|++++.++|++.|++=- T Consensus 81 ~~~g-~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 81 LEGG-IKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccc-ceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 5432 34678999753 56799999999999999999999999988 47899999999999987766 No 26 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=99.90 E-value=1.7e-27 Score=167.38 Aligned_cols=132 Identities=17% Similarity=0.186 Sum_probs=109.0 Q ss_pred ccchHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-cccccc Q lcl|NC_018285. 2 AMVGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRK 78 (142) Q Consensus 2 ~m~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~ 78 (142) -|.+++ +||+||++.|++|.... .+..++|+++||+++++.++.++|++..+ + .+||+|||.++.. ..+|.. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~---~--~g~l~~~I~i~~~k~~~~~~ 75 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSS---T--NAHMRDSIKIRSSRGKAGST 75 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC---c--hhhHHhhcccccccccccce Confidence 567777 89999999999998764 45668999999999999999999975322 1 2699999999873 334544 Q ss_pred cceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 79 NGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGG 141 (142) Q Consensus 79 ~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g 141 (142) ..++.||+++. ..|++||+||||++|||+||+.++.++++ ++|++++.++|++.|.+=.- T Consensus 76 ~v~v~vg~~~~-~~~~~~f~E~GT~~~~a~PF~~pa~~~~~--~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 76 VVVLRVGPTRS-HYMKALAQEFGTIKQVAKPFIRPALDYNK--MQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred eEEEEecCCCC-cceeEeecccCCCCCCCCcchhHhHHHhH--HHHHHHHHHHHHHHHHHhcC Confidence 44566887755 46999999999999999999999999884 78999999999999887655 No 27 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.87 E-value=2.7e-26 Score=160.76 Aligned_cols=117 Identities=16% Similarity=0.193 Sum_probs=101.4 Q ss_pred cchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++++ +||||+++.|++|+....++.++|+++||+++++.++.++|.+ ||+..| |+.+- .++|+ T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~------tg~lkk----ik~~~-----kk~g~ 65 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIK------SGRLSK----VKIRV-----KNTGL 65 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcc------cCCcce----eeeee-----ecCce Confidence 99999 8999999999999999999999999999999999999999962 333333 33221 24569 Q ss_pred eEeccCCCCceeEEeecccCccccCCC-chhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKKYRAD-HFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k~~~~-hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) ++||+++ ...||.+|+|||||+||++ ||++++.+++. +++++.+.++|.+-|+ T Consensus 66 ~~VG~~k-s~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~--~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 66 ATEGTAS-SSEFYDIFQNFGTSEQKAHVGYFDRAVDETT--NEAVEEVAEIIFRKMR 119 (119) T ss_pred eEeccCC-cchhhhhhccccccccCCCCCccccccccCh--HHHHHHHHHHHHHhcC Confidence 9999986 4579999999999999999 89999999884 8999999999999988 No 28 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.86 E-value=4.4e-25 Score=154.14 Aligned_cols=128 Identities=21% Similarity=0.220 Sum_probs=104.0 Q ss_pred cchHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |++|+ +||++|++.|++|.... .+.-.+|+++||+++++.++.++|+. .+||+|+|.++.... ...++ T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~---------tG~l~~sI~~~~~~~-~~~~~ 70 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKK---------TGKLRRNIVSAALRQ-KDAPG 70 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------hhhHHHhcccccccc-ccccc Confidence 88888 89999999999998754 34678999999999999999999963 259999999876322 12244 Q ss_pred eeEeccC--------CCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 81 VATVGWK--------NNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 81 ~~~VG~~--------k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ++.||+. ....+|++||+|+||++|||+||+.++.+++ ++++++++.++|++.|.+--|. T Consensus 71 ~~~~g~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 71 LATAGVRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDAS--IGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred eEEeeeeeccccccCCCCccceeeeeccCCCCCCCCcchhhhHHHH--HHHHHHHHHHHHHHHHHHHhhc Confidence 5666652 1245699999999999999999999999987 4789999998888888777666 No 29 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.85 E-value=1.1e-24 Score=151.97 Aligned_cols=127 Identities=22% Similarity=0.237 Sum_probs=102.6 Q ss_pred cchHH-HHHHHHHHHHHHhccccH-HHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccc-ccccc Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITP-AEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNA-DGRKN 79 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~-~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~-dg~~~ 79 (142) |++++ +||++|++.|++|..... +...+|+++||+++++.++.++|+. .+||+|||.++.... ++ . T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~---------tG~l~~sI~~~~~~~~~~--~ 69 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKK---------TGKLRRNIVSAALRQKDA--P 69 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------hhhHHhhccccccccccc--c Confidence 88888 899999999999987643 4568999999999999999999963 259999999976332 33 3 Q ss_pred ceeEeccC--------CCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 80 GVATVGWK--------NNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 80 G~~~VG~~--------k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +++.||+. ....+|++||+|+||++|||+||+.++.++++ .++.+++.++|++.|.+--|. T Consensus 70 ~~~~vg~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~--~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:14 70 GLATAGVRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASI--GEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred eeEEeeeeeccccccCCCCccceeeeeccccCCCCCCcchhHHHHHHH--HHHHHHHHHHHHHHHHHHhhc Confidence 35566642 12346999999999999999999999999874 678888888888877776666 No 30 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.85 E-value=2.4e-24 Score=150.05 Aligned_cols=128 Identities=21% Similarity=0.231 Sum_probs=103.2 Q ss_pred cchHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |++++ +||++|++.|++|..+. .+.-.+|+++||+++++.++.++|+. .+||+|+|.++.... ...++ T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~---------tG~l~~~i~~~~~~~-~~~~~ 70 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKK---------TGKLRRNIVSAALRQ-KDAPG 70 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------cchhhhceeeecccc-ccccc Confidence 88998 89999999999998654 34558899999999999999999963 259999999876322 12244 Q ss_pred eeEeccC--------CCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 81 VATVGWK--------NNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 81 ~~~VG~~--------k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ++.||+. ....+|++||+|+||++|||+||+.++.+++ ++++.+++.++|++.|.+.-|. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:80 71 LATAGVRVRTKGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDAS--IGEAEGAIRTELARAIDQALGG 138 (140) T ss_pred eeeeeeecccccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHH--HHHHHHHHHHHHHHHHHHHhhc Confidence 5566653 1234699999999999999999999999987 4788899888888888777666 No 31 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.84 E-value=3.3e-24 Score=149.35 Aligned_cols=129 Identities=19% Similarity=0.184 Sum_probs=103.6 Q ss_pred cchHH-HHHHHHHHHHHHhccccH-HHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcc-cccccc Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITP-AEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTN-ADGRKN 79 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~-~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~-~dg~~~ 79 (142) |++|+ +||++|++.|++|..... +.-.+|+++||+++++.++.++|+. .+||+++|.++... .++... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~---------tG~l~~sI~~~~~~~~~~~~~ 71 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKK---------TGKLKRNIVTAALKQKDSPGI 71 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------hhhHHHhceecccccccccce Confidence 88988 899999999999987653 4668999999999999999999963 25999999987632 233333 Q ss_pred ceeEeccCC------CCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 80 GVATVGWKN------NYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 80 G~~~VG~~k------~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ..+.|++.. ...+|++||+|+||++|||+||+.++.+++ +++|++++.++|++.|.+--+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 72 ATAGVRVRTKGKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDAS--IAQAEGAIRTEIARAIDQVVGG 138 (140) T ss_pred eEEeeccccccccCCCCcccccceeccCcCCCCCCcchhhhHHHH--HHHHHHHHHHHHHHHHHHHhhc Confidence 334444421 234799999999999999999999999987 4789999999998888876655 No 32 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=99.83 E-value=9.1e-24 Score=146.91 Aligned_cols=139 Identities=17% Similarity=0.144 Sum_probs=99.6 Q ss_pred CccchHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCC-------CCCCCCCcccchhcceecC Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHY-------SNKKDLKYGHMADGLSVQS 71 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~-------~~~~~~k~~HlaD~I~~~~ 71 (142) |...+|+ +||++|++.|++|.... .+...+|+++||+++++.++.++|.+.- ........+|+.+.|.+.. T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 6556677 89999999999997653 3567889999999999999999996311 0001112245555555444 Q ss_pred ccccccccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 72 TNADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 72 ~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) .+.+....+...++|+ +..+|++||+||||++|||+||+.++.+++ +++|++++.++|++.|.+--.+ T Consensus 81 ~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~--k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 81 VNPDTGNSDNTMKADN-PRNAFYWRFVEMGTVNMPPHPFVRPAFDVR--SEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred cccccccccceeecCC-CCCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHHHHhcC Confidence 2222222223334454 445799999999999999999999999987 4778888888888887776666 No 33 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.83 E-value=7e-24 Score=147.51 Aligned_cols=134 Identities=20% Similarity=0.194 Sum_probs=98.0 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecC-ccc-- Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQS-TNA-- 74 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~-~~~-- 74 (142) |+. .+|+ +||++|+++|+.|.... .+..++|+++||+++++.++.++|+...+. ..+||+|+|.++. ... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~----~~~~l~~~i~~~~~~~~~~ 76 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPG----TGRSISDNIALRWNGRLFK 76 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCC----ccchhhhhhhhhcccCccc Confidence 552 3466 89999999999998654 356789999999999999999999853322 1259999998842 111 Q ss_pred -cccccceeEecc---------------CCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 75 -DGRKNGVATVGW---------------KNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 75 -dg~~~G~~~VG~---------------~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) .|.. ...||. ......|++||+||||++|||+||+.++.+++ +++|++++.++|++.|.+ T Consensus 77 ~~~~~--~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~--k~~~~~~~~~~l~~~i~k 152 (164) T protein:vir:43 77 RTGDL--GFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADN--IAEVTSTFVSEYEKGIDR 152 (164) T ss_pred cccce--eEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHh--HHHHHHHHHHHHHHHHHH Confidence 1110 111221 11233699999999999999999999999987 478999999888888865 Q ss_pred cCCC Q lcl|NC_018285. 139 KGGK 142 (142) Q Consensus 139 k~g~ 142 (142) --.+ T Consensus 153 a~~k 156 (164) T protein:vir:43 153 AIKR 156 (164) T ss_pred HHHH Confidence 4333 No 34 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.82 E-value=3.5e-23 Score=143.68 Aligned_cols=130 Identities=18% Similarity=0.177 Sum_probs=98.7 Q ss_pred CccchHH-HHHHHHHHHHHHhccccH-HHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecC------- Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITP-AEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQS------- 71 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~-~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~------- 71 (142) |.-.+++ +||++|++.|++|..... +.-++|+.+||+++++.++.++|+. .++|++||.++. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~---------~g~l~~si~~~~~~~~~~~ 71 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVR---------TGKLKKNVVVVTQKSRRRG 71 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC---------chhhhhhcccccccccccc Confidence 4444566 899999999999987644 4568999999999999999999963 135566655433 Q ss_pred ----------ccccccccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 72 ----------TNADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGG 141 (142) Q Consensus 72 ----------~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g 141 (142) .+.+........++++ +..+|++||+|+||++|||+||+.++.+++ +++|++++.++|++.|++--+ T Consensus 72 ~~~~~v~~~~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~--k~~~~~~~~~~l~~~l~k~~~ 148 (149) T protein:vir:19 72 EISSGVHIRGVNPRTGNSDNTMKANN-PRNAFYWRFVELGTANMPAHPFVRPAYDTR--EEEAASVAIARMNQAIDEVLS 148 (149) T ss_pred ceeecccccccccccccccceeecCC-CCccceeeeeccCCCCCCCCcchhHHHHHH--HHHHHHHHHHHHHHHHHHHhc Confidence 2222111222333443 345799999999999999999999999987 478999999999998888888 Q ss_pred C Q lcl|NC_018285. 142 K 142 (142) Q Consensus 142 ~ 142 (142) + T Consensus 149 k 149 (149) T protein:vir:19 149 K 149 (149) T ss_pred C Confidence 8 No 35 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=99.77 E-value=5.7e-22 Score=137.06 Aligned_cols=136 Identities=18% Similarity=0.156 Sum_probs=95.0 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhcccc-HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDIT-PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGR 77 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~-~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~ 77 (142) ||. .+++ +||+||.+.|+.|.... .+.-++|+.+||+++++.++.++|+.... ...+||+++|.+........ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~----~~~~~l~~~i~~~~~~~~~~ 76 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDP----LTKEAIHKNIVASFSSKQFR 76 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccc----cchhhhhhheeecccccccc Confidence 653 4556 89999999999998654 34668999999999999999999975211 11269999999865211111 Q ss_pred ccc--eeEecc----------------------------C--CCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHH Q lcl|NC_018285. 78 KNG--VATVGW----------------------------K--NNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVL 125 (142) Q Consensus 78 ~~G--~~~VG~----------------------------~--k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl 125 (142) .+| +..||. . .....|++||+||||++|||+||+.++.++++ ++++ T Consensus 77 ~~g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~--~~a~ 154 (179) T protein:vir:18 77 RTGDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVD--NDVI 154 (179) T ss_pred cccceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhH--HHHH Confidence 111 122221 0 01246999999999999999999999999874 6777 Q ss_pred HHHHHHHHHHHh----hcCCC Q lcl|NC_018285. 126 LAEKAEYEKLIR----RKGGK 142 (142) Q Consensus 126 ~A~~~~~k~~l~----~k~g~ 142 (142) +++.++|++.|. +-+++ T Consensus 155 ~~i~~~l~~~i~k~lk~~~~~ 175 (179) T protein:vir:18 155 NVFSTEMGKAIDRAIRLAMKK 175 (179) T ss_pred HHHHHHHHHHHHHHHHhhccc Confidence 777766655554 43444 No 36 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.70 E-value=3.7e-20 Score=127.10 Aligned_cols=122 Identities=11% Similarity=0.076 Sum_probs=100.8 Q ss_pred Cccc-hHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccc-ccc Q lcl|NC_018285. 1 MAMV-GLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNA-DGR 77 (142) Q Consensus 1 m~m~-~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~-dg~ 77 (142) ||-. +++ +||++|++.|+++.....+.-.+++..+|+.+++..+.++|+. | |+|+|||.++.... +|. T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~------t---G~L~~sI~~~~~~~~~~~ 71 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVD------T---GYMRNNIQQDEVKEEHGV 71 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC------C---hhhhhhceecceeccCCc Confidence 4332 133 6999999999999887778888999999999999999999952 3 58999999876433 332 Q ss_pred ccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 78 KNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 78 ~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k 139 (142) . +..||++ +++|+|+|+||++|||+||+.++.++. +.++.+.+.+++++.++.- T Consensus 72 ~--~~~v~~~----~~Ya~~vEfGT~~~~a~Pfl~pa~~~~--~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 72 V--TGRYVAR----ADYSSYNEYGTYRMSAQPFMAPSVAAM--TPFFYKAVRDALNKAAKFS 125 (125) T ss_pred E--EEEeeCC----CCccceeecccccCCCCcccchhHHHH--HHHHHHHHHHHHHHHhccC Confidence 2 5778865 357999999999999999999999987 4789999999999999998 No 37 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.62 E-value=9.9e-19 Score=119.29 Aligned_cols=114 Identities=18% Similarity=0.194 Sum_probs=92.1 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|++|.+...+.-.+|++.+|..+.+..+.++|+......+| ++|+|||+++. +| +..+.| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TG--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCc---hhhhhcceeee---cC--ceEEEe Confidence 66 89999999999998777777788999999999999999998764334445 58999999874 22 346778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|||+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:96 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred ecC----ccchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4667777766666 No 38 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.62 E-value=9.9e-19 Score=119.29 Aligned_cols=114 Identities=18% Similarity=0.194 Sum_probs=92.1 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|++|.+...+.-.+|++.+|..+.+..+.++|+......+| ++|+|||+++. +| +..+.| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TG--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCc---hhhhhcceeee---cC--ceEEEe Confidence 66 89999999999998777777788999999999999999998764334445 58999999874 22 346778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|||+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:78 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred ecC----ccchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4667777766666 No 39 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.62 E-value=9.9e-19 Score=119.29 Aligned_cols=114 Identities=18% Similarity=0.194 Sum_probs=92.1 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|++|.+...+.-.+|++.+|..+.+..+.++|+......+| ++|+|||+++. +| +..+.| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TG--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCc---hhhhhcceeee---cC--ceEEEe Confidence 66 89999999999998777777788999999999999999998764334445 58999999874 22 346778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|||+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:97 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred ecC----ccchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4667777766666 No 40 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.62 E-value=9.9e-19 Score=119.29 Aligned_cols=114 Identities=18% Similarity=0.194 Sum_probs=92.1 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|++|.+...+.-.+|++.+|..+.+..+.++|+......+| ++|+|||+++. +| +..+.| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TG--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCc---hhhhhcceeee---cC--ceEEEe Confidence 66 89999999999998777777788999999999999999998764334445 58999999874 22 346778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|||+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:10 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred ecC----ccchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4667777766666 No 41 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.62 E-value=9.9e-19 Score=119.29 Aligned_cols=114 Identities=18% Similarity=0.194 Sum_probs=92.1 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|++|.+...+.-.+|++.+|..+.+..+.++|+......+| ++|+|||+++. +| +..+.| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TG--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCc---hhhhhcceeee---cC--ceEEEe Confidence 66 89999999999998777777788999999999999999998764334445 58999999874 22 346778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|||+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:96 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred ecC----ccchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4667777766666 No 42 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.62 E-value=9.9e-19 Score=119.29 Aligned_cols=114 Identities=18% Similarity=0.194 Sum_probs=92.1 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|++|.+...+.-.+|++.+|..+.+..+.++|+......+| ++|+|||+++. +| +..+.| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TG--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCc---hhhhhcceeee---cC--ceEEEe Confidence 66 89999999999998777777788999999999999999998764334445 58999999874 22 346778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|||+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:93 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred ecC----ccchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4667777766666 No 43 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.59 E-value=3.6e-18 Score=116.25 Aligned_cols=114 Identities=16% Similarity=0.100 Sum_probs=89.6 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|+.+.+...+.-.++++.+|..+++..+.++|.+.....+| +.|++||+++. +| +.+..| T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~T---G~Lr~sI~~~~---~g--~~~~~v 72 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWT---GNLASLIEVKK---IG--DLHYRV 72 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcc---hhhhhceeeee---cC--cEEEEe Confidence 66 89999999999998777777789999999999999999999764334455 47999999863 23 235667 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) |.+ +++|+|+|+||++|||+||+.++.++.+ ..+.+.+ +++|. T Consensus 73 ~~~----~~Ya~~vEfGT~km~a~PFl~PA~~~~k--~~~~~~i----~~~i~ 115 (115) T protein:vir:10 73 IST----AHYSGFLEFGTRYMEPAPFMFPTYQTLK--KSTINDL----KRLLS 115 (115) T ss_pred eCC----CccchheecccccCCCCCchhhhHHHHH--HHHHHHH----HHHhC Confidence 743 5799999999999999999999998763 4555554 44454 No 44 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.56 E-value=1.1e-17 Score=113.65 Aligned_cols=113 Identities=16% Similarity=0.184 Sum_probs=85.5 Q ss_pred cchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++++ +|||+|++.|++++. .++-.++++.++..+++....++|...+ .+| ++|++||.++.. ++. T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~--~~~v~~~~~~~~~~~~~~~~~~a~~~~p--~~T---G~Lr~sI~~~~~------~~~ 67 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNAS--PEKRSKVLRKYGSKLKEAAVNRAQFNKG--YST---GATRRSITLQVE------SDK 67 (114) T ss_pred CeeeeeehHHHHHHHHHHhcC--HHHHHHHHHHHHHHHHHHHHHhcccCCC--CCc---hhhhhceeeeec------CCe Confidence 88998 899999999999863 4555778888888888888888764322 234 589999998642 334 Q ss_pred eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) ..||++ +++|+|+|+||++|||+||+.++.++.+ .++.+.+.+.|+- T Consensus 68 ~~V~~~----~~Ya~~vEfGT~km~a~Pfl~PA~~~~~--~~~~~~l~~l~k~ 114 (114) T protein:vir:27 68 ATVEAL----TSYSGYLEVGTRKMEAQPFMKPALDEVA--PKMVEELAKWDET 114 (114) T ss_pred eEecCC----CCccceecccccccCCCCchhhhHHHHH--HHHHHHHHHHhcC Confidence 679975 3578999999999999999999998763 4555555555544 No 45 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.56 E-value=1.1e-17 Score=113.65 Aligned_cols=113 Identities=16% Similarity=0.184 Sum_probs=85.5 Q ss_pred cchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++++ +|||+|++.|++++. .++-.++++.++..+++....++|...+ .+| ++|++||.++.. ++. T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~--~~~v~~~~~~~~~~~~~~~~~~a~~~~p--~~T---G~Lr~sI~~~~~------~~~ 67 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNAS--PEKRSKVLRKYGSKLKEAAVNRAQFNKG--YST---GATRRSITLQVE------SDK 67 (114) T ss_pred CeeeeeehHHHHHHHHHHhcC--HHHHHHHHHHHHHHHHHHHHHhcccCCC--CCc---hhhhhceeeeec------CCe Confidence 88998 899999999999863 4555778888888888888888764322 234 589999998642 334 Q ss_pred eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) ..||++ +++|+|+|+||++|||+||+.++.++.+ .++.+.+.+.|+- T Consensus 68 ~~V~~~----~~Ya~~vEfGT~km~a~Pfl~PA~~~~~--~~~~~~l~~l~k~ 114 (114) T protein:vir:49 68 ATVEAL----TSYSGYLEVGTRKMEAQPFMKPALDEVA--PKMVEELAKWDET 114 (114) T ss_pred eEecCC----CCccceecccccccCCCCchhhhHHHHH--HHHHHHHHHHhcC Confidence 679975 3578999999999999999999998763 4555555555544 No 46 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.55 E-value=1.7e-17 Score=112.54 Aligned_cols=114 Identities=17% Similarity=0.160 Sum_probs=91.2 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +|||+|++.|+.|.+...+.-.++++.+|..+++..+..++.+.....+| +.|++||+++. +| +..+.| T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~T---G~Lr~SI~~~~---~g--~~~~~V 72 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWT---GNLSRNIRYKK---TV--DLQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcc---hhhhhceeeee---cC--cEEEEe Confidence 67 89999999999998877777899999999999999999987654334455 47999999864 33 236778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |.+ +++|+|+|+||++|+|+||+.++.+.. +.++.+.+.+.++ T Consensus 73 ~~~----~~Ya~~vE~GT~~m~a~PFl~PA~~~~--k~~~~~~l~~~~k 115 (115) T protein:vir:99 73 TSH----AAYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKTLFE 115 (115) T ss_pred cCC----ccccccccccccccCCCCcchhhHHHH--HHHHHHHHHHHhC Confidence 854 578999999999999999999999876 4566666666555 No 47 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.54 E-value=1.9e-17 Score=112.20 Aligned_cols=111 Identities=12% Similarity=0.179 Sum_probs=88.2 Q ss_pred CccchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |.|. ++ +||++|+++|+++.. .....++++.+|..+++.++.++|.. | ++|++||.++.. +|. T Consensus 1 M~~~-i~i~Gld~l~~~L~~~~~--~~~~~~al~~~~~~i~~~ak~~aPvd------T---G~Lr~si~~~~~--~~~-- 64 (112) T protein:vir:36 1 MKSS-LSFKGIDQLVKHLDKAAS--LKGVQQVVKSNTSNMTANMQKLVPVD------T---GYMKRSIKMELT--EGG-- 64 (112) T ss_pred Ccee-eeehhHHHHHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHhCCCC------c---hhhhhceeeeec--CCc-- Confidence 8885 65 899999999998864 35578899999999999999999962 3 589999997642 121 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) -++.||+. +++|+|+|+||++|||+||+.++.++.+ .++.+.+.+.++ T Consensus 65 ~~~~V~~~----~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~--~~~~~~i~~~lr 112 (112) T protein:vir:36 65 FSGQAGPH----TDYSAYVEYGTRFQSAQPFVKPAYNEQK--GVFIKDLERLLK 112 (112) T ss_pred eEEEeecC----CCccceeeccccccCCCcchhhhHHHHH--HHHHHHHHHHcC Confidence 26789975 4579999999999999999999998764 566666655555 No 48 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.52 E-value=3.7e-17 Score=110.66 Aligned_cols=114 Identities=10% Similarity=0.093 Sum_probs=94.9 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |.+ .+ +||++|.+.|+++.....+.-..+++.+|..+.+..+..+|.. | ++|++||.++.. | . T Consensus 1 msi-~i-~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~------T---G~Lr~sI~~~~~---g-~-- 63 (114) T protein:vir:95 1 MAI-KW-QGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKD------T---EFLKDHITTSYP---G-M-- 63 (114) T ss_pred Cee-ee-ehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC------c---hhhhhceeeecC---c-e-- Confidence 222 22 6999999999999887777778999999999999999999963 3 589999998652 2 2 Q ss_pred eeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) +..||+. +++|+|+|+||++|||+||+.++.++. +.++.+.+.+++++.|+ T Consensus 64 ~~~V~~~----~~Ya~yvE~GT~~~~aqPfl~pa~~~~--~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 64 EAHIHGE----AGYDGYQEYGTRFQPGTPHFRPMMEQI--QPQFQKDMTDVMKGAFK 114 (114) T ss_pred EEEeecC----CCccceeecCccccCCCccchhhHHHH--HHHHHHHHHHHHHhhcC Confidence 4568864 467999999999999999999999987 47899999999999998 No 49 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.51 E-value=7.1e-17 Score=109.12 Aligned_cols=117 Identities=11% Similarity=0.134 Sum_probs=95.5 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +||++|++.|++|.....+...+|+.++|+++++..+.++|.. | +||++||.++..... |.+.+ T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~------T---G~Lr~sI~~~~~~~~----~~~~~ 67 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKN------F---GKLAQSISTSDLKAK----DLISK 67 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC------c---hhhhhcceeeeeccC----ceeEE Confidence 77 8999999999999888888999999999999999999999962 3 599999998764322 23444 Q ss_pred ccCCCCceeEEeecccCccc-------------------------------------------------------cCCCc Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKK-------------------------------------------------------YRADH 109 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k-------------------------------------------------------~~~~h 109 (142) +.. ..+++|+|+|+||++ ||||| T Consensus 68 ~v~--~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqP 145 (173) T protein:vir:10 68 KIT--VNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQP 145 (173) T ss_pred eeC--CCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCc Confidence 443 347999999999984 88999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 110 FVTNVQNDSSVQKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 110 Fie~t~~e~~~~~~vl~A~~~~~k~~l~~k 139 (142) |+-++.++++ +++.+.+.+.+++.|++= T Consensus 146 Fl~PA~~~~~--~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 146 FLYPAWIEGK--KQYLKDLENLLKTYNKKI 173 (173) T ss_pred cchhHHHHhH--HHHHHHHHHHHHHHhhcC Confidence 9999998874 678888887777777766 No 50 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.50 E-value=7.4e-17 Score=109.01 Aligned_cols=107 Identities=15% Similarity=0.181 Sum_probs=83.4 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +||++|++.|++++. ...-.++++.+|..+++..+.++|.. | |+|++||.++-. +| ...+.| T Consensus 1 i~i~Gld~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~~aPv~------T---G~Lr~si~~~~~--~~--~~~~~V 65 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT--LDDVKHVVKSNTASMNKNMQNLAPVD------T---GNMKRSITSEFT--DG--GLSGTT 65 (108) T ss_pred CcchhHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhCCCC------c---hhhhccceeeee--cC--ceEEEe Confidence 66 789999999998763 46677999999999999999999962 3 589999987532 12 225778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |+. +++|+|+|+||++|||+||+.++.+..+ .++.+.+.+.++ T Consensus 66 ~~~----~~Ya~~vE~GT~km~aqpf~~pa~~~~~--~~~~~~i~~~~k 108 (108) T protein:vir:74 66 GPH----TDYAGYVEYGTRFQSAQPFVKPAFNIQK--KVFTNDLERLTK 108 (108) T ss_pred ecC----CCcccceeccccccCCCcchhhHHHHHH--HHHHHHHHHHcC Confidence 864 3579999999999999999999998763 455555544444 No 51 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.49 E-value=6.9e-17 Score=109.18 Aligned_cols=108 Identities=14% Similarity=0.104 Sum_probs=87.5 Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEecc Q lcl|NC_018285. 7 DEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGW 86 (142) Q Consensus 7 ~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~ 86 (142) =+|||+|++.|+++.......-.+++..+|..+....+..+|.. | |+|++||.++.. | ++.+.||- T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~------T---G~Lr~sI~~~~~---~--~~~~~v~~ 66 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVD------T---GWLRAQIYSEQQ---R--LLHYRVVS 66 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC------c---hhhhcceeeeec---C--cEEEEeec Confidence 67899999999999877666778999999999999999999962 3 489999998652 2 33667773 Q ss_pred CCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 87 KNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 87 ~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) + +.+|+|+|+||++|||+||+.++.+..+ ..+.+.+.+.+++ T Consensus 67 ~----~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~--~~~~~~i~~~lrk 108 (108) T protein:vir:99 67 P----ALYSIYLELGTRKMEAQSFLDPALRKEW--PVLMANIKKMFKR 108 (108) T ss_pred C----cccchhcccCccccCCCcchhhhHHHHH--HHHHHHHHHHhcC Confidence 2 4789999999999999999999998763 5666766666666 No 52 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.45 E-value=3.5e-16 Score=105.33 Aligned_cols=107 Identities=12% Similarity=0.140 Sum_probs=82.6 Q ss_pred HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 6 LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 6 ~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) ++ +||++|++.|++++. ...-.++++.+|..+++..+..+|. +| ++|++||.++.. .+ .....| T Consensus 1 i~i~Gld~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~~apv------dT---G~Lr~si~~~~~-~~---~~~~~V 65 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT--LNDVKHVVKRNTVSMNKNMQNLAPV------DT---GNMKRSITSEFT-DG---GLTGTT 65 (108) T ss_pred CcchhHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhCCC------Cc---hhhHhhceeeee-cC---ceEEEe Confidence 66 799999999998763 4556789999999999999999996 23 589999987532 12 236778 Q ss_pred ccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) |.+ +++|+|+|+||++|||+||+.++.+... ..+.+.+. ++|+ T Consensus 66 ~~~----~~Ya~~vE~GT~~m~aqPFl~pa~~~~~--~~~~~~i~----~~lr 108 (108) T protein:vir:98 66 IPH----TDYAGYVEYGTRFQAAQPFVKPAFDVQK--KIFTNDLE----RLTK 108 (108) T ss_pred ecC----CCccceeeccccccCCCcchhhHHHHHH--HHHHHHHH----HHcC Confidence 864 3579999999999999999999998763 44555444 4444 No 53 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.45 E-value=2.8e-16 Score=105.87 Aligned_cols=111 Identities=16% Similarity=0.220 Sum_probs=73.8 Q ss_pred cchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |++++ +|||+|++.|++++.. +.-.++++..+.-+++.+.+.++...+ .+| ++|++||.++. |. .+ T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~--~~v~~~v~~~~~~~~~~~~~~a~~~ap--vdT---G~Lr~sI~~~~----~~--~~ 67 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASS--ERRSKVLRKYGAKLKEAAVSKAQFKKG--YST---GATRRSITLEA----GS--DR 67 (112) T ss_pred CceeeehHHHHHHHHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHhhhcCC--CCc---hhhhhceeeec----Cc--eE Confidence 99999 8999999999998753 233345554444444444454432211 123 58999998753 22 25 Q ss_pred eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) ..||.+ ..+|+|+|+||++|+|+||+.++.++.+ +.|....+.+. T Consensus 68 ~~v~~~----~~Ya~~vE~GTr~m~AqPF~~PA~~~~~---~~~~~~l~~L~ 112 (112) T protein:vir:96 68 AVVEAL----TNYSGYLEVGTRKMEAQPFMRPALDQVV---PEMVEEMAKWE 112 (112) T ss_pred EEecCC----CCccceeccCccccCCCCchhhhHHHHH---HHHHHHHHhcC Confidence 778865 3579999999999999999999998753 33333333333 No 54 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.41 E-value=1.4e-15 Score=101.97 Aligned_cols=126 Identities=17% Similarity=0.250 Sum_probs=85.2 Q ss_pred CccchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) -.|.+++ .+|.+.++.|.+. ..+.-++|+.+||.++.+..+..+|.+ + +.|+++|.+.....+ ..+ T Consensus 3 ~~~~~~d~s~l~~~l~~l~~~---~~~v~R~A~~~ga~vv~dear~~aP~~------t---G~LkksI~~~~~~~~-s~~ 69 (157) T protein:vir:97 3 FSIRSVDITGILAGLETVVEH---SSDVVRTMTYESAVAVRESAKAFVNDE------T---GKLRNNLYVAYSPEE-SVE 69 (157) T ss_pred eEeecccHHHHHHHHHHhHHH---HHHHHHHHHHHHHHHHHHHHHHhCCCC------c---chhhhheeeeecccc-CCC Confidence 2333333 3355555555443 446788999999999999999999963 2 579999988542211 123 Q ss_pred ce--eEeccCCCCceeEEeecccC------------------------ccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GV--ATVGWKNNYHAQNARRLNDG------------------------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 80 G~--~~VG~~k~~~a~~A~f~n~G------------------------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) |. ..|||.++ +++++||+++| |++|||+||+.++.+.. ++++.+++.+.+. T Consensus 70 g~~~~~Vg~~~~-~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~--k~~a~~~~~~~l~ 146 (157) T protein:vir:97 70 GIQTYAVSWRKK-AAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSV--AMQIPDIARAAGA 146 (157) T ss_pred ceEEEEEeecCC-ccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHh--HHHHHHHHHHHHH Confidence 33 45999755 58999999999 67799999999999876 3566666555444 Q ss_pred HHHhh-cCCC Q lcl|NC_018285. 134 KLIRR-KGGK 142 (142) Q Consensus 134 ~~l~~-k~g~ 142 (142) +.|.+ -+|+ T Consensus 147 k~I~e~l~g~ 156 (157) T protein:vir:97 147 KKYAELQRGD 156 (157) T ss_pred HHHHHHhcCC Confidence 33332 2333 No 55 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=99.40 E-value=1.8e-15 Score=101.40 Aligned_cols=123 Identities=10% Similarity=0.076 Sum_probs=102.0 Q ss_pred cchHH--HHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecC-cccccc Q lcl|NC_018285. 3 MVGLD--EALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQS-TNADGR 77 (142) Q Consensus 3 m~~~~--~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~-~~~dg~ 77 (142) |..|. .|++|++++|++ |++ ...+..++||++||+++.+.|+.+.|. -++| +++-|.|++|+ +..+|. T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~----f~DT---G~t~dev~~s~~~~~~G~ 73 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISI----YKRT---GETTESAVVSGVRREDGI 73 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhh----hhhc---chhhcceeecCeeecCCc Confidence 77777 499999999997 887 578999999999999999999999993 3455 47999999999 557787 Q ss_pred ccceeEeccCCCCcee-EEeecccCccccC---CCchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 78 KNGVATVGWKNNYHAQ-NARRLNDGTKKYR---ADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 78 ~~G~~~VG~~k~~~a~-~A~f~n~GT~k~~---~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) + ++.|||..+ .| |.|.-+||+.+++ +--+|+++.+.++ ...++-+.+++++.|.+ T Consensus 74 r--~V~VgW~Gp--R~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se--~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 74 P--KVKLGFTTP--RWNIVHLQELEYGWKHNRRGVGVIRRYSDILE--TIYPRGIRDKLKRGFDG 132 (132) T ss_pred e--EEEecccCC--ceeEEeeecccccCCcCCCcchHHHHHHHhhh--hHHHHHHHHHHHHHhcC Confidence 6 899999754 45 6666669986543 3348999999985 56889999999999998 No 56 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.25 E-value=1e-13 Score=91.80 Aligned_cols=124 Identities=12% Similarity=0.066 Sum_probs=92.7 Q ss_pred CccchHH-HHHHHHHHHHHHhccc--cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDI--TPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGR 77 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~--~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~ 77 (142) |+|++++ .+|++|+++|++++.. ..+.-.++++.-|..+...++++||.. | ++|+.|+.+++....+. T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVd------T---G~Lr~S~~~~~~~~~~~ 71 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVK------Q---GNLRRSWTAEGPTYGCG 71 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCC------c---chhccceeecceeeecC Confidence 9999999 8999999999987643 345667777777777777889999952 3 59999999987655432 Q ss_pred ccceeEeccCCCCceeEEeecccCccc-----------------cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 78 KNGVATVGWKNNYHAQNARRLNDGTKK-----------------YRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 78 ~~G~~~VG~~k~~~a~~A~f~n~GT~k-----------------~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) ..+..||-. ..+|||+|+||.. .+++||++++..+.+ ..+-+-+.+.+.++++-+- T Consensus 72 -~~~~~V~n~----~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~--~~~~~~l~k~l~~l~d~~~ 144 (144) T protein:vir:10 72 -GWTIKLINN----AEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQ--RQLPQLVTEGLWGLKDLFE 144 (144) T ss_pred -eeEEEEecC----CCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHH--HHHHHHHHHHHHHHhhhcC Confidence 224668832 4679999999964 478899999998764 4455666666777776666 No 57 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=99.24 E-value=1.2e-14 Score=96.89 Aligned_cols=119 Identities=21% Similarity=0.241 Sum_probs=100.4 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |++=..|+-|+++.+.+|.+-..+.....+.++|+.|++.|..+.|.+ .. .+++||+|+|.|.-+ |-.+ T Consensus 1 m~sNNNGFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~S---l~--kkk~HlrD~lkVvvk------~d~V 69 (125) T protein:vir:62 1 MASNNNGFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVS---NK--NKRTHLRDSLKVVVK------DDRV 69 (125) T ss_pred CCCCchhHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChh---hh--hhhhhcceeeeEEee------CCeE Confidence 888788999999999999887789999999999999999999999954 22 235799999999753 4467 Q ss_pred EeccCCCCceeEEeecccCcccc------CCCchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGTKKY------RADHFVTNVQNDSSVQKKVLLAEKAEYEKLI 136 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT~k~------~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l 136 (142) .|-|.. .+|+=+|++.||+++ +++||+..|.++. ++.|.+.|.+.+-..| T Consensus 70 ~V~Fed--~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~n--k~kI~~iM~kki~d~m 125 (125) T protein:vir:62 70 SVEFKD--EAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAE--GDKIADIMAQKIINRM 125 (125) T ss_pred EEEEcc--hhhhhhhhhccccccccccccchhhhhhccHHhh--HHHHHHHHHHHHHhhC Confidence 788973 489999999999997 9999999999887 4778888888777666 No 58 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=99.17 E-value=1.4e-13 Score=91.11 Aligned_cols=122 Identities=16% Similarity=0.217 Sum_probs=94.2 Q ss_pred cchHHHHHHHHHHHHHHh--ccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASI--GDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl--~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.---.|++|++++|++. .....+..++||.++|+++.+.|+.+++. -++|| -+-|.|++|++ .++|.+ T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~----fkDTG---ati~ev~~s~p~~~~G~r- 72 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKP----SEDSG---ALISEIGRTEPEWIKGKR- 72 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCc----ccccc---ceeccEeecCeeecCCce- Confidence 433337999999999876 45678999999999999999999999973 44555 47899999994 466765 Q ss_pred ceeEeccCCCCce-eEEeecccCccccCCCchhhH--------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKKYRADHFVTN--------VQNDSSVQKKVLLAEKAEYEKL 135 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k~~~~hFie~--------t~~e~~~~~~vl~A~~~~~k~~ 135 (142) ++.|||.-+..- +|.|+.||||.+++.-.||.+ +.++++ ...++-+.++++++ T Consensus 73 -~V~vgW~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e--~~~~~~ik~eL~kl 134 (134) T protein:vir:10 73 -TVTIRWRGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQ--NKYFETLKRELKKL 134 (134) T ss_pred -EEEEEEEcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhh--HHHHHHHHHHHhcC Confidence 899999433222 599999999998887766655 777663 56777777777777 No 59 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.15 E-value=2.4e-13 Score=89.74 Aligned_cols=108 Identities=12% Similarity=0.053 Sum_probs=84.9 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....||++|.+.|+.+.+...+...+++..+|..++...+..+|. +| ++|++||.+.- ..+| ... T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv------dT---G~Lr~SI~~~~-~~~~---~~~ 67 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPV------DT---GYLRESVSMDF-KKGG---LTG 67 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------Cc---chhhcCeeEEe-eCCc---EEE Confidence 88888899999999999988888888999999999999999999996 23 58999998742 2222 356 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.+. -+|+|+|+|| +.|||+||+.++.++++ ..|.+-+. T Consensus 68 ~V~~~~----~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~--~~i~k~i~ 137 (137) T protein:vir:10 68 VINIGS----EYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGR--AFFNKYFS 137 (137) T ss_pred EEecCC----CcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHH--HHHHHhcC Confidence 788543 3689999997 34899999999998864 34444443 No 60 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=99.15 E-value=4e-13 Score=88.53 Aligned_cols=126 Identities=9% Similarity=0.057 Sum_probs=100.3 Q ss_pred CccchHH--HHHHHHHHHHHH-hccc-cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecC-cccc Q lcl|NC_018285. 1 MAMVGLD--EALEGWLETVAS-IGDI-TPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQS-TNAD 75 (142) Q Consensus 1 m~m~~~~--~~l~e~~~~l~k-l~~~-~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~-~~~d 75 (142) ..|..|. .|++|++++|++ |++. ..++.++||..|++++.+.|+.+.+. -++||. .-|.|++|+ +..+ T Consensus 5 ~~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~----fkDTGa---t~dev~~s~p~~~~ 77 (138) T protein:vir:98 5 VSMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISI----YKRTGE---TTESAVVSGVRRED 77 (138) T ss_pred ecccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhh----hhhccc---eeeeeeecCeeecC Confidence 4566666 499999999987 7753 67899999999999999999999984 456664 779999999 4467 Q ss_pred ccccceeEeccCCCCceeEEeecccCccccC-C--CchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 76 GRKNGVATVGWKNNYHAQNARRLNDGTKKYR-A--DHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 76 g~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~-~--~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) |.+ ++.|||.-+ .-+|.|.-+||+.+++ | --+|+++.+.++ ..-++-+.+++++.|.+ T Consensus 78 G~r--~V~igW~Gp-R~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se--~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 78 GIP--KVKLGFTTP-RWNIVHLQELEYGWKHNRRGVGVIRRYSDILE--TIYPRGIRDKLKRGFDG 138 (138) T ss_pred Cce--EEEEeeecC-eeeEEeeecccccCCcCCCcchHHHHHHHhhh--HHHHHHHHHHHHHHhcC Confidence 766 899999754 3346666669986543 3 348999999885 56889999999999999 No 61 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.14 E-value=3.6e-13 Score=88.81 Aligned_cols=108 Identities=8% Similarity=-0.006 Sum_probs=85.8 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....||++|.+.|+++.+...+.-..++..+|..++...+..+|. +| ++|++||.+.-. .+ .... T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv------~T---G~L~~Si~~~~~-~~---~~~~ 67 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV------DT---GYLRESVTMDFK-DG---GFTG 67 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---hhhhcCeeeEee-CC---ceEE Confidence 88888899999999999988888888899999999999999999995 23 589999986432 12 2356 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.+. -+|+|+|+|| +.|||+||+.++.++.+ ..|.+.+. T Consensus 68 ~V~~~~----~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~--~~i~k~l~ 137 (137) T protein:vir:95 68 VINIGS----EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 137 (137) T ss_pred EEecCC----CcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHH--HHHHHhhC Confidence 788543 4689999999 67999999999988763 44544444 No 62 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.13 E-value=3.4e-13 Score=88.96 Aligned_cols=108 Identities=8% Similarity=-0.013 Sum_probs=85.9 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....|+++|.+.|+++.....+.-.+++..+|..++...+..+|.. | ++|++||.++-. .+ .... T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvd------T---G~Lr~SI~~~~~-~~---~~~~ 67 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVD------T---GYLRESVTMDFK-DG---GFTG 67 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC------c---chhhcCceeEee-cC---cEEE Confidence 888778999999999999888788888999999999999999999962 3 589999987532 12 2356 Q ss_pred EeccCCCCceeEEeecccC-----------------------------ccccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDG-----------------------------TKKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~G-----------------------------T~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.+ +.+|+|+|+| |+.|||+||+.++.++.+ ..|.+.+. T Consensus 68 ~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~--~~~~~~l~ 137 (137) T protein:vir:94 68 VINIG----SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--VFFNKYFS 137 (137) T ss_pred EEecC----CCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHH--HHHHHhhC Confidence 78854 3578999999 567999999999988763 44544444 No 63 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.11 E-value=5e-13 Score=88.02 Aligned_cols=108 Identities=9% Similarity=0.017 Sum_probs=85.9 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....||++|.+.|+++.....+.-.+++..+|..++...+..+|. +| ++|++||.+.- ..+| .+. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv------dT---G~Lr~SI~~~~-~~~~---~~~ 67 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV------DT---GYLRESVTMDF-KDSG---FTG 67 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---cchhccceeEe-ecCc---eEE Confidence 88888899999999999988888888899999999999999999995 23 58999998753 2122 256 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.. +-+|+|+|+|| +.|||+||+.++.++.+ ..|.+-+. T Consensus 68 ~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~--~~~~~~l~ 137 (137) T protein:vir:94 68 VINIG----SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 137 (137) T ss_pred EEecC----CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHH--HHHHHhhC Confidence 78854 34789999999 57999999999988763 44444444 No 64 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.11 E-value=5e-13 Score=88.02 Aligned_cols=108 Identities=9% Similarity=0.017 Sum_probs=85.9 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....||++|.+.|+++.....+.-.+++..+|..++...+..+|. +| ++|++||.+.- ..+| .+. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv------dT---G~Lr~SI~~~~-~~~~---~~~ 67 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV------DT---GYLRESVTMDF-KDSG---FTG 67 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---cchhccceeEe-ecCc---eEE Confidence 88888899999999999988888888899999999999999999995 23 58999998753 2122 256 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.. +-+|+|+|+|| +.|||+||+.++.++.+ ..|.+-+. T Consensus 68 ~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~--~~~~~~l~ 137 (137) T protein:vir:97 68 VINIG----SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 137 (137) T ss_pred EEecC----CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHH--HHHHHhhC Confidence 78854 34789999999 57999999999988763 44444444 No 65 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.11 E-value=5e-13 Score=88.02 Aligned_cols=108 Identities=9% Similarity=0.017 Sum_probs=85.9 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....||++|.+.|+++.....+.-.+++..+|..++...+..+|. +| ++|++||.+.- ..+| .+. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv------dT---G~Lr~SI~~~~-~~~~---~~~ 67 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV------DT---GYLRESVTMDF-KDSG---FTG 67 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---cchhccceeEe-ecCc---eEE Confidence 88888899999999999988888888899999999999999999995 23 58999998753 2122 256 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.. +-+|+|+|+|| +.|||+||+.++.++.+ ..|.+-+. T Consensus 68 ~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~--~~~~~~l~ 137 (137) T protein:vir:93 68 VINIG----SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 137 (137) T ss_pred EEecC----CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHH--HHHHHhhC Confidence 78854 34789999999 57999999999988763 44444444 No 66 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.07 E-value=2.1e-12 Score=84.63 Aligned_cols=121 Identities=18% Similarity=0.268 Sum_probs=94.0 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+-.++++.-+++.+.|+.+...+.++-+++++..|+.+.+.|+..+|.. | +.+++++.+......| + T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~r------T---G~y~ksw~vk~~~~~g---~ 68 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKR------T---GEYARTFTITKEDGYG---T 68 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc------c---chhhccccccccccCC---c Confidence 66656666557789999999888899999999999999999999999952 3 4799999987643222 2 Q ss_pred eeEeccCCCCceeEEeecccCccc-----cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKK-----YRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGG 141 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k-----~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g 141 (142) ...|-|.++ +..++|+||+||.+ +||.|||.++.+. +.+.+.+..+++|.+ || T Consensus 69 ~~~vv~~~~-~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~------~~~~~~~~i~~~l~~-gg 126 (126) T protein:vir:81 69 TKRIIWNKK-HYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDK------HGARLPDELKRVIEN-GG 126 (126) T ss_pred ceEEEeccC-CCCceeeeecceecCCCCccCCCcchHHHHHH------HHHHHHHHHHHHhhc-CC Confidence 455777654 56799999999997 7999999999654 345566777777764 44 No 67 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.06 E-value=1.2e-12 Score=85.86 Aligned_cols=108 Identities=12% Similarity=0.014 Sum_probs=84.6 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+....||++|++.|+++.....+.-.++++..|..++...+..+|.. | ++|++||.++-. .+| ... T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvd------T---G~L~~Si~~~~~-~~g---~~~ 67 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVD------L---GFLKESIDFKVT-DGG---FSS 67 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcC------c---cchhcCceeEee-cCc---eEE Confidence 888778999999999999888888888899999999999999999952 3 589999987532 122 356 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.+ +-+|.|+|+|| ..|||+||+.++.++.+ ..|.+-+. T Consensus 68 ~V~~~----~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~--~~i~k~i~ 137 (137) T protein:vir:96 68 VISVG----AEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGR--KVFNRYFS 137 (137) T ss_pred EEecC----CCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHH--HHHHHhhC Confidence 78854 34789999999 55899999999998763 34444444 No 68 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=99.05 E-value=8.3e-13 Score=86.83 Aligned_cols=122 Identities=18% Similarity=0.170 Sum_probs=92.4 Q ss_pred cchHHHHHHHHHHHHHHh--ccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcc-cccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASI--GDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTN-ADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl--~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~-~dg~~~ 79 (142) |.---.|++|++++|++. .....+..++||..+++++.+.|+.+++. -++|| -.-|.|++|++. .+|.+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~----fkDTG---~t~~ev~~s~p~~~~G~r- 72 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKP----SKDTG---ALINEVSFSKPEWINGKR- 72 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhh----hhhcc---ceeccEEecCeeecCCce- Confidence 433338999999999876 45677999999999999999999999983 34455 588999999955 56765 Q ss_pred ceeEeccCCCCce-eEEeecccCccccCCCc--------hhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKKYRADH--------FVTNVQNDSSVQKKVLLAEKAEYEKL 135 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k~~~~h--------Fie~t~~e~~~~~~vl~A~~~~~k~~ 135 (142) ++.|||.-+..- +|.|+.|||+.+...-. =|+++.++++ ...++-+.++++++ T Consensus 73 -~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e--~~~~~~ik~eL~kl 134 (134) T protein:vir:10 73 -TITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQ--NKYFETLKRELKKL 134 (134) T ss_pred -EEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhh--HHHHHHHHHHHhcC Confidence 899999433222 58999999977754333 3566887774 56777788888877 No 69 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=99.05 E-value=8.3e-13 Score=86.83 Aligned_cols=122 Identities=18% Similarity=0.170 Sum_probs=92.4 Q ss_pred cchHHHHHHHHHHHHHHh--ccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcc-cccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASI--GDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTN-ADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl--~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~-~dg~~~ 79 (142) |.---.|++|++++|++. .....+..++||..+++++.+.|+.+++. -++|| -.-|.|++|++. .+|.+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~----fkDTG---~t~~ev~~s~p~~~~G~r- 72 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKP----SKDTG---ALINEVSFSKPEWINGKR- 72 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhh----hhhcc---ceeccEEecCeeecCCce- Confidence 433338999999999876 45677999999999999999999999983 34455 588999999955 56765 Q ss_pred ceeEeccCCCCce-eEEeecccCccccCCCc--------hhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKKYRADH--------FVTNVQNDSSVQKKVLLAEKAEYEKL 135 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k~~~~h--------Fie~t~~e~~~~~~vl~A~~~~~k~~ 135 (142) ++.|||.-+..- +|.|+.|||+.+...-. =|+++.++++ ...++-+.++++++ T Consensus 73 -~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e--~~~~~~ik~eL~kl 134 (134) T protein:vir:95 73 -TITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQ--NKYFETLKRELKKL 134 (134) T ss_pred -EEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhh--HHHHHHHHHHHhcC Confidence 899999433222 58999999977754333 3566887774 56777788888877 No 70 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.05 E-value=1.1e-12 Score=86.15 Aligned_cols=108 Identities=12% Similarity=0.056 Sum_probs=83.0 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+.++.||++|.+.|+++.....+...+++...|..++...+.++|. +| ++|++||.+.- ..+| ... T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv------~T---G~Lr~SI~~~~-~~~~---~~~ 67 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPV------DT---GYLRESVSMDF-KKGG---LTG 67 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------Cc---chhhcCeeeEe-cCCc---EEE Confidence 88888899999999999987777777888998888999999999995 23 58999998753 2222 246 Q ss_pred EeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .||.+ .-+|+|+|+|| ..|||+||+.++.++++ ..|.+-++ T Consensus 68 ~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~--~~i~k~i~ 137 (137) T protein:vir:10 68 VINIG----SEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGR--AFFNKYFS 137 (137) T ss_pred EEecC----CccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHH--HHHHHhhC Confidence 67754 34789999998 34899999999998864 33444333 No 71 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.03 E-value=1.9e-12 Score=84.86 Aligned_cols=110 Identities=9% Similarity=0.007 Sum_probs=84.7 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) --|+++..||++|.+.|+++.....+.-.+++...|..++...+..+|. +| ++|++||.++-. .+| - T Consensus 11 ~~Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPv------dT---G~Lr~SI~~~~~-~~g---~ 77 (149) T protein:vir:94 11 CHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPV------DL---GFLEESIDFKYF-DGG---L 77 (149) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---chhhcCeeEEee-CCc---E Confidence 2367777799999999999988888889999999999999999999995 23 589999987532 122 2 Q ss_pred eeEeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) ...||.+ .-+|+|+|+|| ..|||+||+.++.++.+ ..|.+.+. T Consensus 78 ~~~V~~~----~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~--~~i~~~i~ 149 (149) T protein:vir:94 78 SSVISVG----ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGR--KTFEQYFS 149 (149) T ss_pred EEEEecC----CCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHH--HHHHHhhC Confidence 4567754 34799999999 44789999999998763 44444444 No 72 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.01 E-value=2.5e-12 Score=84.15 Aligned_cols=110 Identities=9% Similarity=0.007 Sum_probs=84.8 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) --|++...||++|.+.|+++.....+.-.+++..+|..++...+..+|. +| ++|++||.++-. .+| - T Consensus 11 ~~Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPv------dT---G~L~~SI~~~~~-~~g---~ 77 (149) T protein:vir:10 11 CHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPV------DL---GFLEESIDFKYF-DGG---L 77 (149) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---chhhccceEEec-CCc---E Confidence 2356666799999999999988888889999999999999999999995 23 589999987532 122 2 Q ss_pred eeEeccCCCCceeEEeecccCc-----------------------------cccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGT-----------------------------KKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT-----------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) ...||.+ .-+|+|+|+|| ..|||+||+.++.++.+ ..|.+.++ T Consensus 78 ~~~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k--~~i~~~i~ 149 (149) T protein:vir:10 78 SSVISVG----ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGR--KTFEQYFS 149 (149) T ss_pred EEEEecC----CCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHH--HHHHHhhC Confidence 4668854 34689999999 44789999999998864 44544444 No 73 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=98.99 E-value=3.1e-12 Score=83.68 Aligned_cols=108 Identities=12% Similarity=0.048 Sum_probs=83.1 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccccee Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVA 82 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~ 82 (142) |+...-||++|.+.|+++.....+.-.+++..+|..++...+..+|. +| +.|++||.+.- +.+| -.. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apv------dT---G~Lr~SI~~~~-~~~g---~~~ 67 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPV------DT---GFLRQSTTVDF-ENGG---FTG 67 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---hhhhcceeEEe-ecCc---EEE Confidence 77777799999999999988778888889999999999999999984 23 58999998742 2122 246 Q ss_pred EeccCCCCceeEEeecccCc---------------------------cccCCCchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 83 TVGWKNNYHAQNARRLNDGT---------------------------KKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKL 135 (142) Q Consensus 83 ~VG~~k~~~a~~A~f~n~GT---------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~ 135 (142) .||.+ .-+|.|+|+|| ..|||+||+.++.++.+ .++ .++ T Consensus 68 ~V~~~----~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~--~~~--------~~~ 133 (135) T protein:vir:96 68 VVKIG----SNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGR--QTF--------EQY 133 (135) T ss_pred EEecC----CCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHH--HHH--------HHh Confidence 68843 35889999999 56999999999988763 333 333 Q ss_pred Hh Q lcl|NC_018285. 136 IR 137 (142) Q Consensus 136 l~ 137 (142) |. T Consensus 134 i~ 135 (135) T protein:vir:96 134 FS 135 (135) T ss_pred cC Confidence 33 No 74 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=98.96 E-value=5.2e-12 Score=82.44 Aligned_cols=123 Identities=15% Similarity=0.134 Sum_probs=71.7 Q ss_pred cchHH-HHHHHHHHHHHHhccccHHHHHHHH----HHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccc Q lcl|NC_018285. 3 MVGLD-EALEGWLETVASIGDITPAEQAKIT----TAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGR 77 (142) Q Consensus 3 m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~----~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~ 77 (142) |+.++ .|||+|.+.|+++.....+.-.+++ ..++..+...++..+|. +| +.|++||...-....+. T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~Pv------dt---G~Lr~SI~~~~~~~~~~ 71 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKY------ST---GELTRSFKHEVKVDGDE 71 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC------Cc---hhhhhceeeeeeecCCe Confidence 89999 8999999999988654333333444 44445555556666664 23 57999997533221221 Q ss_pred ccceeEeccCCCCceeEEeecccC------------------------------------------------------cc Q lcl|NC_018285. 78 KNGVATVGWKNNYHAQNARRLNDG------------------------------------------------------TK 103 (142) Q Consensus 78 ~~G~~~VG~~k~~~a~~A~f~n~G------------------------------------------------------T~ 103 (142) . +..||-+ +-+|.|+|+| |. T Consensus 72 ~--~g~V~~~----~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~ 145 (182) T protein:vir:10 72 V--IGRWWNS----SMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTT 145 (182) T ss_pred E--EEEeecC----CCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecC Confidence 1 1223321 1123333333 56 Q ss_pred ccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 104 KYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 104 k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) .|||+||+.++.++++ ++|.+.+.+..++.|++.-|- T Consensus 146 G~~aqPFl~pA~~~~~--~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 146 GQPARQFMTPAANKMA--KEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred CCCCCcchHHHHHHhH--HHHHHHHHHHHHHHHHHhhcC Confidence 7999999999999874 445555555555544443333 No 75 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=98.87 E-value=9.4e-12 Score=81.04 Aligned_cols=115 Identities=14% Similarity=0.109 Sum_probs=85.9 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+..++.-++++|.+.|+.+.......-.+++...|..++...+.++|.. | ++|++||.+.. ..+|. .+ T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~------T---G~Lr~SI~~~~-~~~g~-~~ 69 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVD------T---GRLRSSIQAVP-SGGRF-SF 69 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhccceeee-ccCCc-eE Confidence 55554445789999999988776677788899999999999999999952 3 58999998653 22332 23 Q ss_pred eeEeccCCCCceeEEeecccCccc---------------------------cCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKK---------------------------YRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k---------------------------~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) ...||. ...+|.|+|+||.- ++|+||+.++.+++ .+.++...++++ T Consensus 70 ~~~v~~----~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~---~~~i~~~~~~~~ 142 (142) T protein:vir:94 70 SVTIGT----NVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAA---STFLRNHAKGIR 142 (142) T ss_pred EEEEec----CcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHH---HHHHHHHHHhcC Confidence 556763 25789999999943 67999999999876 345566666666 No 76 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=98.80 E-value=4e-11 Score=77.57 Aligned_cols=125 Identities=12% Similarity=0.027 Sum_probs=83.3 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcc---- Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTN---- 73 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~---- 73 (142) |+= +.++ .+|++|.+.|++++. ...+.-.++++..|..+....++.||.. | ++|+.++.++... T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVd------T---G~Lr~sw~~~~~~~~~~ 71 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVD------T---GFLRQGWNGVAYARSLP 71 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc------c---hhhcccccccccccccc Confidence 432 3445 699999999999876 4566778899999999999999999952 3 5899998775421 Q ss_pred ccccccc-eeEeccCCCCceeEEeecccCccccCCCchh------hHHHHHHHH--HHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 74 ADGRKNG-VATVGWKNNYHAQNARRLNDGTKKYRADHFV------TNVQNDSSV--QKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 74 ~dg~~~G-~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFi------e~t~~e~~~--~~~vl~A~~~~~k~~l~~ 138 (142) +....++ ++.||- ...+|||+|+||+.++|++|| +++..+.+. .+.|-++..+.|++++++ T Consensus 72 ~~~~g~~~~v~v~n----~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 72 VYKQGNNYIIEVVN----PTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred eeecCCeeEEEEec----CCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1111122 345772 257899999999988776654 565544321 123445555566666666 No 77 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=98.80 E-value=7.6e-11 Score=76.06 Aligned_cols=114 Identities=14% Similarity=0.157 Sum_probs=83.0 Q ss_pred Cccch--HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccc Q lcl|NC_018285. 1 MAMVG--LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGR 77 (142) Q Consensus 1 m~m~~--~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~ 77 (142) |+.-. ++ .++++|.+.|+.+.....+.-.+++...|+.++...+..+|.. | ++|++||.+.- +.+| T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~------T---G~Lr~SI~~~~-~~~g- 69 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVD------E---GNLKNSIQIDY-KNNG- 69 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhcCeeEEe-ecCc- Confidence 43322 23 6899999999988776677777899999999999999999852 3 58999998752 2222 Q ss_pred ccceeEeccCCCCceeEEeecccCc---------------------------cccCCCchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 78 KNGVATVGWKNNYHAQNARRLNDGT---------------------------KKYRADHFVTNVQNDSSVQKKVLLAEKA 130 (142) Q Consensus 78 ~~G~~~VG~~k~~~a~~A~f~n~GT---------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~~ 130 (142) .+..||.+ .-+|.|+|+|| ..|||+||+.++.++.+ .. ..+ T Consensus 70 --~~~~V~~~----~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~--~~----~~~ 137 (144) T protein:vir:59 70 --LTAEITVG----AEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGG--EY----FER 137 (144) T ss_pred --EEEEEecC----CCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHH--HH----HHH Confidence 25678854 34789999998 46999999999998753 22 234 Q ss_pred HHHHHHh Q lcl|NC_018285. 131 EYEKLIR 137 (142) Q Consensus 131 ~~k~~l~ 137 (142) .++++.+ T Consensus 138 ~i~~~~g 144 (144) T protein:vir:59 138 EMRRLRG 144 (144) T ss_pred HHHHhcC Confidence 4666666 No 78 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=98.71 E-value=3.9e-10 Score=72.15 Aligned_cols=134 Identities=16% Similarity=0.173 Sum_probs=90.4 Q ss_pred cch-HH-HHHHHHHHHHHHhccc--cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCC------------------CCCCc Q lcl|NC_018285. 3 MVG-LD-EALEGWLETVASIGDI--TPAEQAKITTAGAKVFQKELEEVTREKHYSNK------------------KDLKY 60 (142) Q Consensus 3 m~~-~~-~~l~e~~~~l~kl~~~--~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~------------------~~~k~ 60 (142) |.+ ++ .+|++|.++|.+++.. ..+.-.+.++.-|.-+...+++.||..-|... ...+. T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 554 55 5899999999987642 23334566666666667778889996432211 11224 Q ss_pred ccchhcceecCccccccccceeEeccCCCCceeEEeecccCc-----cccCCCchhhHHHHHHHHH--HHHHHHHHHHHH Q lcl|NC_018285. 61 GHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGT-----KKYRADHFVTNVQNDSSVQ--KKVLLAEKAEYE 133 (142) Q Consensus 61 ~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT-----~k~~~~hFie~t~~e~~~~--~~vl~A~~~~~k 133 (142) +||+.+..+++....|. ...+.|+-. ..+|||+|+|- ...|++|+++++.++...+ ..+-+...+.++ T Consensus 81 G~lr~swk~~~~~k~~~-~~~v~v~N~----~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~ 155 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGR-TYKQKVYNK----VYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMR 155 (163) T ss_pred chhhccceecceeecCC-ceEEEEEec----CCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 69999999988655442 334667733 35699999994 4579999999999886432 244455666778 Q ss_pred HHHhhcCC Q lcl|NC_018285. 134 KLIRRKGG 141 (142) Q Consensus 134 ~~l~~k~g 141 (142) +++++++. T Consensus 156 k~~~~~~~ 163 (163) T protein:vir:10 156 KVVLGNGK 163 (163) T ss_pred HhhcCCCC Confidence 88888887 No 79 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=98.60 E-value=5.7e-10 Score=71.24 Aligned_cols=122 Identities=12% Similarity=0.112 Sum_probs=94.3 Q ss_pred cchHHHHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.+. .|++|++++|++ +++ ...+...+||.+|++.+.+.|+.+.. +-++|| --=|.+++|++ .++|.. T Consensus 1 m~ev-kGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---atidev~~s~p~~~~g~r- 71 (133) T protein:vir:96 1 MRLI-YDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLK----YFRDTG---AEYGEVKLSKPTWENGKR- 71 (133) T ss_pred Cccc-cCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhH----HHhhcc---ceeeeEEecCceecCCce- Confidence 5544 799999999985 553 55688999999999999999999986 345565 36688999984 666754 Q ss_pred ceeEeccCCCCce-eEEeecccCc-----cccCCCc--hhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGT-----KKYRADH--FVTNVQNDSSVQKKVLLAEKAEYEKLI 136 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT-----~k~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~~l 136 (142) ++.|||.-+.+- .|.|.-++|+ .++.|.- =|+++.++++ ..-++-+.++++++| T Consensus 72 -tV~i~W~gp~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~I~~al~~se--~~y~~~vk~el~kll 133 (133) T protein:vir:96 72 -TIRVYWEGEKHRYSIVHLNEKGFYAKDGKFIRPKGMGAIDKALRASR--DKFFKVYAEEVSKLL 133 (133) T ss_pred -EEEEEeecCCCceeeEeeecccceecCCceeccchhhHHHHHHHhhh--HHHHHHHHHHHHHhC Confidence 899999533233 4889999994 3445554 5899998885 568889999999999 No 80 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=98.47 E-value=2.1e-09 Score=68.11 Aligned_cols=123 Identities=16% Similarity=0.116 Sum_probs=94.0 Q ss_pred cchHHHHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.=--.|++|++++|++ |++ ...+...+||.+|++.+.+.|+.+.. +-++|| -.-|.|++|++ ..+|.+ T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---ati~ev~~s~p~~~~G~r- 72 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFV----QFKDTG---ASIDEINIEKPSYDKGVR- 72 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhc----chhccc---ceeeeEEecCeeeeCCce- Confidence 43333799999999986 665 34789999999999999999999886 345565 46789999995 467765 Q ss_pred ceeEeccCCCCce-eEEeecccCc----cccCCCc--hhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGT----KKYRADH--FVTNVQNDSSVQKKVLLAEKAEYEKLI 136 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT----~k~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~~l 136 (142) ++.|||.-+.+- .|.|.-|+|+ .++.|.- =|+++.++++ ..-++-+.+++++.| T Consensus 73 -~V~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se--~~y~~~vk~el~k~l 133 (133) T protein:vir:78 73 -SIKIDWKGPKDRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISE--RAYRAIVQKKIGDKL 133 (133) T ss_pred -EEEEEEecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHHHhhC Confidence 889999533233 4889999996 3344554 5889988875 567788888899888 No 81 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=98.38 E-value=8.3e-09 Score=64.87 Aligned_cols=115 Identities=13% Similarity=0.039 Sum_probs=80.3 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |...+|+.-++.+.++|++.... ..+..++...+.+++...+..+|.. | +.|+.||.+.- ...| . T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~--~~~~~a~~~~~~~ie~~ak~~~pvd------t---G~L~~SI~~~v-~~~g---~ 65 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQ--ALEDIGEHMTTELAEGGHGVTSNND------T---GEYAQKSGYKV-RKSS---K 65 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhhhccccc------c---chhhcceeeee-ecCC---c Confidence 77777777788888888765432 2233346666777777777788842 3 47999998753 2222 3 Q ss_pred eeEeccCCCCceeEEeecccCc--------------------------cccCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGT--------------------------KKYRADHFVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT--------------------------~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) .+.||.+. =+|-|+|+|| +-||||||+.++.++. +.+|-+.+.+.++. T Consensus 66 ~~~V~~~~----~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~--~~~i~~~i~~~~~~ 139 (141) T protein:vir:78 66 EVIVGNSS----DYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDE--QDKVRVFTERALRG 139 (141) T ss_pred EEEEecCC----CccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhh--HHHHHHHHHHHhhc Confidence 56688543 3778889998 4599999999999987 46777777777776 Q ss_pred HH Q lcl|NC_018285. 135 LI 136 (142) Q Consensus 135 ~l 136 (142) += T Consensus 140 l~ 141 (141) T protein:vir:78 140 IN 141 (141) T ss_pred cC Confidence 64 No 82 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=98.34 E-value=4.8e-09 Score=66.18 Aligned_cols=123 Identities=15% Similarity=0.174 Sum_probs=88.9 Q ss_pred cchHHHHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.=--.|++|++++|++ |++ ...+...+||.++++.+.+.|+.+.. +-++|| -.-|.|++|++ ..+|..- T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---ati~ev~~s~p~~~~g~~~ 73 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFE----SFKDTG---ASIEEMTKSKPYTKVGSQE 73 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhh----hhhccc---ceeeeEEecCeeeccCCcc Confidence 43333899999999986 665 44789999999999999999999997 345565 47789999995 4677655 Q ss_pred ceeEeccCCCCce-eEEeecccCccc----cCCCc--hhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKK----YRADH--FVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k----~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) -++.|||.-+.+- .|.|.-|+|+.+ +.|.- =|+++.++++ ..-++-+.+++++ T Consensus 74 rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se--~~y~~~vk~eL~k 133 (133) T protein:vir:96 74 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASE--RKYREIIKKELAR 133 (133) T ss_pred eeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHhcC Confidence 5789999533233 488999999633 33444 4788887774 3445555555554 No 83 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=98.34 E-value=4.8e-09 Score=66.18 Aligned_cols=123 Identities=15% Similarity=0.174 Sum_probs=88.9 Q ss_pred cchHHHHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.=--.|++|++++|++ |++ ...+...+||.++++.+.+.|+.+.. +-++|| -.-|.|++|++ ..+|..- T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---ati~ev~~s~p~~~~g~~~ 73 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFE----SFKDTG---ASIEEMTKSKPYTKVGSQE 73 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhh----hhhccc---ceeeeEEecCeeeccCCcc Confidence 43333899999999986 665 44789999999999999999999997 345565 47789999995 4677655 Q ss_pred ceeEeccCCCCce-eEEeecccCccc----cCCCc--hhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKK----YRADH--FVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k----~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) -++.|||.-+.+- .|.|.-|+|+.+ +.|.- =|+++.++++ ..-++-+.+++++ T Consensus 74 rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se--~~y~~~vk~eL~k 133 (133) T protein:vir:93 74 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASE--RKYREIIKKELAR 133 (133) T ss_pred eeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHhcC Confidence 5789999533233 488999999633 33444 4788887774 3445555555554 No 84 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=98.34 E-value=4.8e-09 Score=66.18 Aligned_cols=123 Identities=15% Similarity=0.174 Sum_probs=88.9 Q ss_pred cchHHHHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.=--.|++|++++|++ |++ ...+...+||.++++.+.+.|+.+.. +-++|| -.-|.|++|++ ..+|..- T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---ati~ev~~s~p~~~~g~~~ 73 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFE----SFKDTG---ASIEEMTKSKPYTKVGSQE 73 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhh----hhhccc---ceeeeEEecCeeeccCCcc Confidence 43333899999999986 665 44789999999999999999999997 345565 47789999995 4677655 Q ss_pred ceeEeccCCCCce-eEEeecccCccc----cCCCc--hhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKK----YRADH--FVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k----~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) -++.|||.-+.+- .|.|.-|+|+.+ +.|.- =|+++.++++ ..-++-+.+++++ T Consensus 74 rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se--~~y~~~vk~eL~k 133 (133) T protein:vir:78 74 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASE--RKYREIIKKELAR 133 (133) T ss_pred eeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHhcC Confidence 5789999533233 488999999633 33444 4788887774 3445555555554 No 85 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=98.34 E-value=4.8e-09 Score=66.18 Aligned_cols=123 Identities=15% Similarity=0.174 Sum_probs=88.9 Q ss_pred cchHHHHHHHHHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.=--.|++|++++|++ |++ ...+...+||.++++.+.+.|+.+.. +-++|| -.-|.|++|++ ..+|..- T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---ati~ev~~s~p~~~~g~~~ 73 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFE----SFKDTG---ASIEEMTKSKPYTKVGSQE 73 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhh----hhhccc---ceeeeEEecCeeeccCCcc Confidence 43333899999999986 665 44789999999999999999999997 345565 47789999995 4677655 Q ss_pred ceeEeccCCCCce-eEEeecccCccc----cCCCc--hhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKK----YRADH--FVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k----~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) -++.|||.-+.+- .|.|.-|+|+.+ +.|.- =|+++.++++ ..-++-+.+++++ T Consensus 74 rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se--~~y~~~vk~eL~k 133 (133) T protein:vir:94 74 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASE--RKYREIIKKELAR 133 (133) T ss_pred eeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHhcC Confidence 5789999533233 488999999633 33444 4788887774 3445555555554 No 86 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=98.30 E-value=3.3e-09 Score=67.06 Aligned_cols=91 Identities=16% Similarity=0.208 Sum_probs=63.4 Q ss_pred CccchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |+-.++. +|||+|++.|++...+ ...+ ++++.=+.-++...+.++|. +| ++|++||.++-. +|.-. T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~-~~v~-~vv~~~~~~l~~~ak~~ap~------dT---G~lrrSI~~~~~--~~g~~ 67 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNM-NTVK-KVVKKHTANLMTATQQAVPV------DT---GHLKQSAQIQIS--RDGFT 67 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccH-HHHH-HHHHHHHHHHHHHHHHhCCC------Cc---cccceeeeEEee--cCCee Confidence 5554456 8999999999988765 3444 55554445558888999985 24 589999998642 23233 Q ss_pred ceeEeccCCCCceeEEeecccCccccCC Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRA 107 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~ 107 (142) +.+.+|.+ ++-++-|+|+||.+|++ T Consensus 68 ~~v~~~gp---~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 68 GSVTYGGG---LVNYAAYVEFGTRFMDS 92 (92) T ss_pred EEEEeccC---ccccccccccceeecCC Confidence 44555433 13478999999999999 No 87 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=98.26 E-value=9.5e-09 Score=64.56 Aligned_cols=123 Identities=14% Similarity=0.168 Sum_probs=88.1 Q ss_pred cchHHHHHHHHHHHHHH-hc-cccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-ccccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVAS-IG-DITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKN 79 (142) Q Consensus 3 m~~~~~~l~e~~~~l~k-l~-~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~ 79 (142) |.=--.|++|++++|++ |+ ....+...+||.++++.+.+.|+.+.. +-++|| -.-|.|++|++ ..+|..- T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---ati~ev~~s~p~~~~g~~~ 73 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFE----SFKDTG---ASIEEMTKSKPYTKVGSQE 73 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhh----hhhccc---ceeeeEEecCeeeccCCcc Confidence 43333899999999986 45 356789999999999999999999997 345565 47789999985 4567655 Q ss_pred ceeEeccCCCCce-eEEeecccCccc----cCCCc--hhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHA-QNARRLNDGTKK----YRADH--FVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 80 G~~~VG~~k~~~a-~~A~f~n~GT~k----~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) -++.|||.-+.+- .|.|.-|+|+.+ +.|.- =|+++.++++ ..-++-+.+++++ T Consensus 74 rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se--~~y~~~vk~eL~k 133 (133) T protein:vir:93 74 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANE--RKYREIIKKELAR 133 (133) T ss_pred eEEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHhcC Confidence 5789999533233 488999999633 33444 4788887764 3445555555554 No 88 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=98.09 E-value=7.3e-09 Score=65.19 Aligned_cols=112 Identities=14% Similarity=0.104 Sum_probs=74.1 Q ss_pred CccchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |++.++. +||+..+..+.+.. .+.-.+++.+.+..++...+..+|.. | ++|++||..+.. .++. . T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~---~~~~~~~i~~~a~~v~~~Ak~~aPv~------t---G~Lr~SI~~~~~-~~~~-~ 66 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQV---GPILRRTHSSLTRQIANETRARVPVL------T---GHLGRSVREDPQ-VMVT-P 66 (142) T ss_pred CceeEEEeeecchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhcceeeeec-cccc-c Confidence 8877777 78877777776543 35567778888888999999999952 2 589999986532 1221 1 Q ss_pred ceeEeccCCCCceeEEeecccCcc--------------------------c---cCCCchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTK--------------------------K---YRADHFVTNVQNDSSVQKKVLLAEKA 130 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~--------------------------k---~~~~hFie~t~~e~~~~~~vl~A~~~ 130 (142) .++.+|-. +.+.+|.|+|+||. + ++|+||+.++.++.. .+....... T Consensus 67 ~~~~~~v~--~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~--~~~~~~~~r 142 (142) T protein:vir:99 67 FHVSGGVT--AHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVV--RRDRRIRVR 142 (142) T ss_pred ceEEEEec--cCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHH--hhhhhhccC Confidence 23333322 23678999999994 2 559999999987653 222222222 No 89 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=98.09 E-value=7.3e-09 Score=65.19 Aligned_cols=112 Identities=14% Similarity=0.104 Sum_probs=74.1 Q ss_pred CccchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |++.++. +||+..+..+.+.. .+.-.+++.+.+..++...+..+|.. | ++|++||..+.. .++. . T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~---~~~~~~~i~~~a~~v~~~Ak~~aPv~------t---G~Lr~SI~~~~~-~~~~-~ 66 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQV---GPILRRTHSSLTRQIANETRARVPVL------T---GHLGRSVREDPQ-VMVT-P 66 (142) T ss_pred CceeEEEeeecchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhcceeeeec-cccc-c Confidence 8877777 78877777776543 35567778888888999999999952 2 589999986532 1221 1 Q ss_pred ceeEeccCCCCceeEEeecccCcc--------------------------c---cCCCchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTK--------------------------K---YRADHFVTNVQNDSSVQKKVLLAEKA 130 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~--------------------------k---~~~~hFie~t~~e~~~~~~vl~A~~~ 130 (142) .++.+|-. +.+.+|.|+|+||. + ++|+||+.++.++.. .+....... T Consensus 67 ~~~~~~v~--~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~--~~~~~~~~r 142 (142) T protein:vir:86 67 FHVSGGVT--AHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVV--RRDRRIRVR 142 (142) T ss_pred ceEEEEec--cCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHH--hhhhhhccC Confidence 23333322 23678999999994 2 559999999987653 222222222 No 90 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=98.01 E-value=5.5e-08 Score=60.38 Aligned_cols=87 Identities=10% Similarity=0.043 Sum_probs=61.3 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCCCceeEEeecccC-- Q lcl|NC_018285. 24 TPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG-- 101 (142) Q Consensus 24 ~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G-- 101 (142) ..+.-.+++...+..++..++..+|.. | ++|++||.++-.+ +| ....||-+ .-+|.|+|+| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~------T---G~Lr~SI~~~~~~-~~---~~~~V~~~----~~YA~yvE~GTg 63 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVD------T---GYLRESVTMDFKD-GG---FTGVINIG----SEYAIYVNYGTG 63 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcC------c---ccccccceEEeec-Cc---EEEEEecC----CCcccccccCCc Confidence 344455667777888899999999852 3 5899999875321 22 24567743 3478999999 Q ss_pred ---------------------------ccccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 102 ---------------------------TKKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 102 ---------------------------T~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) |..|+|+||+.++.++.+ ..|.+.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~--~~i~k~i~ 116 (116) T protein:vir:97 64 IYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 116 (116) T ss_pred ccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHH--HHHHHhhC Confidence 888999999999988763 44444444 No 91 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=98.01 E-value=5.5e-08 Score=60.38 Aligned_cols=87 Identities=10% Similarity=0.043 Sum_probs=61.3 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCCCceeEEeecccC-- Q lcl|NC_018285. 24 TPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG-- 101 (142) Q Consensus 24 ~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G-- 101 (142) ..+.-.+++...+..++..++..+|.. | ++|++||.++-.+ +| ....||-+ .-+|.|+|+| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~------T---G~Lr~SI~~~~~~-~~---~~~~V~~~----~~YA~yvE~GTg 63 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVD------T---GYLRESVTMDFKD-GG---FTGVINIG----SEYAIYVNYGTG 63 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcC------c---ccccccceEEeec-Cc---EEEEEecC----CCcccccccCCc Confidence 344455667777888899999999852 3 5899999875321 22 24567743 3478999999 Q ss_pred ---------------------------ccccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 102 ---------------------------TKKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 102 ---------------------------T~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) |..|+|+||+.++.++.+ ..|.+.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~--~~i~k~i~ 116 (116) T protein:vir:12 64 IYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 116 (116) T ss_pred ccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHH--HHHHHhhC Confidence 888999999999988763 44444444 No 92 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=98.00 E-value=2.4e-07 Score=56.86 Aligned_cols=117 Identities=17% Similarity=0.197 Sum_probs=83.8 Q ss_pred Ccc-chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAM-VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m-~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |+- +++++.-+++.+.|+.....+.++=.++++--|+-..+.|++.+|. +| |.++.+..+... .+ T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~------~T---G~yaksW~~k~~-----~~ 66 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPK------RT---GDYAKNWTSQKL-----KN 66 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc------cc---cccccceeeeec-----CC Confidence 665 4566666667788887776667777788888888889999999984 23 468888877542 13 Q ss_pred ceeEeccCCCCceeEEeecccC-----ccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDG-----TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~G-----T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) |+..| |-+.+...++|.|++| --+.+|.|||.++.+ .+.+.+.+..++.|.+ T Consensus 67 ~~~~v-~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee------~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 67 GDQVI-YQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEE------ELVSNYISRVEKRLSQ 123 (123) T ss_pred eeEEE-EEecCCcceEEeeecceeecCCceeCcchhhhHHHH------HHHHHHHHHHHHHhcC Confidence 33334 3333333589999999 455799999988864 4667777888888877 No 93 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=97.99 E-value=6e-08 Score=60.16 Aligned_cols=87 Identities=10% Similarity=0.043 Sum_probs=60.5 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCCCceeEEeecccC-- Q lcl|NC_018285. 24 TPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG-- 101 (142) Q Consensus 24 ~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G-- 101 (142) ..+.-.+++...+..++..++..+|.. | ++|++||.+.-.+ +| ....||-+ .-+|.|+|+| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~------T---G~Lr~SI~~~~~~-~~---~~~~V~~~----~~Ya~yvE~GTg 63 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVD------T---GYLRESVTMDFKD-GG---FTGVINIG----SEYAIYVNYGTG 63 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCcc------c---cccccceeEEeec-Cc---EEEEEecC----CCccceeecCcc Confidence 344455667777778888899999852 3 5899999875321 22 24567743 2477888999 Q ss_pred ---------------------------ccccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 102 ---------------------------TKKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 102 ---------------------------T~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) |..|+|+||+.++.++.+ ..|.+.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~--~~i~k~is 116 (116) T protein:vir:95 64 IYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--AFFNKYFS 116 (116) T ss_pred ccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHH--HHHHHhhC Confidence 778999999999988763 44444444 No 94 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=97.89 E-value=4.5e-07 Score=55.40 Aligned_cols=134 Identities=15% Similarity=0.183 Sum_probs=98.7 Q ss_pred CccchHH----HHHHHHHHHHHHh-ccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCC--CcccchhcceecCcc Q lcl|NC_018285. 1 MAMVGLD----EALEGWLETVASI-GDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDL--KYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m~~~~----~~l~e~~~~l~kl-~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~--k~~HlaD~I~~~~~~ 73 (142) |+.-+.- +||-.+..++.++ +.+..++=+.+++..|+|+...++..||.+|..-+.+. ..+-|+-+|.+.++- T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 4444432 6889999999999 65556888899999999999999999999865322221 235799999998753 Q ss_pred ccccccceeEeccCCCCceeEEeecccCccccC--CCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 74 ADGRKNGVATVGWKNNYHAQNARRLNDGTKKYR--ADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 74 ~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~--~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +.+-+-.|= +--=.+|-|+|||+.++. |+-|+-.+...++ .++..-..+.+.++|.+-.|- T Consensus 81 ----raa~VrAGr--~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te--~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 81 ----KGAVIKAGS--AARVPYAAAIHFGYRKRNISANRFLYRAMARKS--DVVAATYERRIAAVVEKYLES 143 (143) T ss_pred ----cceeeeecC--cCCCCcccccccCCcccccchhhhhhhhhhccC--HHHHHHHHHHHHHHHHHHhcC Confidence 333455662 211157889999997766 9999988877665 677777777788887777777 No 95 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=97.82 E-value=6.8e-07 Score=54.40 Aligned_cols=134 Identities=13% Similarity=0.164 Sum_probs=97.3 Q ss_pred CccchHH----HHHHHHHHHHHHh-ccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCC--CcccchhcceecCcc Q lcl|NC_018285. 1 MAMVGLD----EALEGWLETVASI-GDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDL--KYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m~~~~----~~l~e~~~~l~kl-~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~--k~~HlaD~I~~~~~~ 73 (142) |+.-+.- +||-++..++.++ +.+..++=+.++++.|+|+...++..||+....-+.+. ..+-|+-+|.+.++- T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 4444432 6999999999999 65556888899999999999999999998532111111 125799999998753 Q ss_pred ccccccceeEeccCCCCceeEEeecccCccccC--CCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 74 ADGRKNGVATVGWKNNYHAQNARRLNDGTKKYR--ADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 74 ~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~--~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +.+-+-.|=-+. =.+|-|+|||+..+. |+-|+-.+....+ .++..-..+.+.++|.+-.|- T Consensus 81 ----raa~VrAG~~kr--VPYA~~I~~G~r~r~Isp~rFl~~a~a~te--~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 81 ----KGAVIKAGSASR--VPYAAAIHFGYRARNISPNRFLFRAMARKS--DVVAATYERRIAAVVEKYLES 143 (143) T ss_pred ----cceeeeeCCcCC--CCcccccccCcccccccchhhhhhhhhccC--HHHHHHHHHHHHHHHHHHhcC Confidence 334455663222 257789999987655 9999988877664 677777777777777777777 No 96 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=97.77 E-value=9.8e-08 Score=59.02 Aligned_cols=88 Identities=20% Similarity=0.323 Sum_probs=60.1 Q ss_pred HHHhcCcCCCCCCCCCCcccchhcceec--C-ccccccccceeEeccCCCCceeEEeecccC------------------ Q lcl|NC_018285. 43 LEEVTREKHYSNKKDLKYGHMADGLSVQ--S-TNADGRKNGVATVGWKNNYHAQNARRLNDG------------------ 101 (142) Q Consensus 43 L~~~tp~~~~~~~~~~k~~HlaD~I~~~--~-~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G------------------ 101 (142) |++..+.. -...+ +-|+++|-+. . .+.||.. +=.|||.++ +|.-.|++|+| T Consensus 1 ~rDeakar--v~~~~---G~Lr~sIY~ay~~~~S~dG~~--~Y~Vswn~r-kAPhghlvE~Ghw~~~~~~~~~dG~w~~~ 72 (119) T protein:vir:81 1 MRESAKAF--VNDET---GKLRSNLYVAYSPEESTNGVQ--TYAVSWRKK-AAPHGHLLEFGHWQTHAAYKGKDGEWYSS 72 (119) T ss_pred CCcccccc--cCCCc---cchhhhheeeeccccCCCCeE--EEEeeccCC-cCCcccccccceeeeeeeeeccCceeeec Confidence 55555533 22333 4699999653 3 4445543 344999754 68899999999 Q ss_pred ------ccccCCCchhhHHHHHHHHHHHHHHHHH----HHHHHHHhhcC Q lcl|NC_018285. 102 ------TKKYRADHFVTNVQNDSSVQKKVLLAEK----AEYEKLIRRKG 140 (142) Q Consensus 102 ------T~k~~~~hFie~t~~e~~~~~~vl~A~~----~~~k~~l~~k~ 140 (142) |+++||+|||.++.+... .++..+|. +.+.+++.++- T Consensus 73 ~~~l~~~~~vPa~pFlRpA~da~~--~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 73 SVKLVNPKWIPARPFLRPGYDSVA--MQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CccccCceecCCCCccchhHHHHH--HHHHHHHHHHHHHHHHHHhccCC Confidence 999999999999988543 33444444 44888887777 No 97 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=97.74 E-value=3.1e-07 Score=56.25 Aligned_cols=113 Identities=14% Similarity=0.196 Sum_probs=79.2 Q ss_pred HHHHHHH-hcc-ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCc-cccccccceeEeccCCC Q lcl|NC_018285. 13 WLETVAS-IGD-ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQST-NADGRKNGVATVGWKNN 89 (142) Q Consensus 13 ~~~~l~k-l~~-~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~-~~dg~~~G~~~VG~~k~ 89 (142) |+++|++ |++ ...+...+||.+|++.+.+.|+.+.. +-++|| --=|.|++|++ ..+|..--++.|||.-+ T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~----~fkDTG---atidev~~s~p~~~~g~~~rtV~i~W~gp 73 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFE----SFKDTG---ASIEEMTKSKPYTKVGSQERAVLIEWVGP 73 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhH----Hhhhcc---ceeeeEEecCeeeccCCccceEEEEeecC Confidence 7777764 553 45688999999999999999999986 345565 36688999985 45666445789999533 Q ss_pred Cce-eEEeecccCccc----cCCCc--hhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 90 YHA-QNARRLNDGTKK----YRADH--FVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 90 ~~a-~~A~f~n~GT~k----~~~~h--Fie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) .+- .|.|.-|+|+.+ +.|.- =|+++.++++ ..-++-+.+++++ T Consensus 74 ~~R~~iVHLNE~GYtr~Gk~i~PRG~G~i~~a~~~se--~~y~~~vk~eL~k 123 (123) T protein:vir:26 74 MNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANE--RKYREIIKKELAR 123 (123) T ss_pred CCceeeEeeeccceecCCCeEccchhhHHHHHHHhhh--HHHHHHHHHHhcC Confidence 233 488999999633 33444 4788887774 3445555555554 No 98 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=97.73 E-value=1.2e-07 Score=58.44 Aligned_cols=88 Identities=20% Similarity=0.328 Sum_probs=60.0 Q ss_pred HHHhcCcCCCCCCCCCCcccchhcceec--C-ccccccccceeEeccCCCCceeEEeecccC------------------ Q lcl|NC_018285. 43 LEEVTREKHYSNKKDLKYGHMADGLSVQ--S-TNADGRKNGVATVGWKNNYHAQNARRLNDG------------------ 101 (142) Q Consensus 43 L~~~tp~~~~~~~~~~k~~HlaD~I~~~--~-~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G------------------ 101 (142) |++..+.. -...+ +-|+++|-+. . .+.||.. +=.|||.++ +|.-.|++|+| T Consensus 1 ~rDeakar--v~~~~---G~Lr~sIY~ay~~~~S~dG~~--~Y~Vswn~r-kAPhghlvE~Ghw~~~~~~~~~dG~w~~~ 72 (119) T protein:vir:10 1 MRESAKAF--VNDET---GKLRSNLYVAYSTEESTNGVQ--TYAVSWRKK-AAPHGHLLEFGHWQTHAAYKGKDGEWYSS 72 (119) T ss_pred CCcccccc--cCCCc---cchhhhheeeeccccCCCCEE--EEEeecCCC-cCCcccccccceeeeeeeeeccCceeeec Confidence 55555533 22333 4699999653 3 4445543 344999754 68999999999 Q ss_pred ------ccccCCCchhhHHHHHHHHHHHHHHHHH----HHHHHHHhhcC Q lcl|NC_018285. 102 ------TKKYRADHFVTNVQNDSSVQKKVLLAEK----AEYEKLIRRKG 140 (142) Q Consensus 102 ------T~k~~~~hFie~t~~e~~~~~~vl~A~~----~~~k~~l~~k~ 140 (142) |+++||+|||.++.+... .++..+|. +.+.+++.++- T Consensus 73 ~~~l~~~~~vPa~pFlRpA~da~~--~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 73 SVKLVNPKWIPARPFLRPGYDSVA--MQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CccccCceecCCCCccchhHHHHH--HHHHHHHHHHHHHHHHHHhccCC Confidence 899999999999988643 33444444 44888887777 No 99 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=97.59 E-value=6.3e-07 Score=54.56 Aligned_cols=116 Identities=10% Similarity=0.129 Sum_probs=74.6 Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHH-hcCcC-CCCCCCCCCcccchhcceecCccccccccceeEe Q lcl|NC_018285. 7 DEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEE-VTREK-HYSNKKDLKYGHMADGLSVQSTNADGRKNGVATV 84 (142) Q Consensus 7 ~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~-~tp~~-~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~V 84 (142) =.|++.|.+.|++... ...++-+-+.||+......+. .+|.. +...++| ++|+-||.++-.+ +. .++.| T Consensus 1 i~G~~~L~~~Lk~~s~--~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dT---G~lkRSi~l~~~~--~g--~~~~v 71 (127) T protein:vir:98 1 MTGMPALEVKLRSMSE--KRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKS---GELLRSRRLKKVN--SS--KDVIT 71 (127) T ss_pred CcChHHHHHHHHHhhH--HHHHHHHhhhhHHHHHHHHhccCCceeccccccCc---ccceeeeEEEEec--CC--ceEEe Confidence 3578888888887632 235555556888877766654 35542 2223344 5899999986542 22 25668 Q ss_pred ccCCCCceeEEeecccCcccc---------CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 85 GWKNNYHAQNARRLNDGTKKY---------RADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 85 G~~k~~~a~~A~f~n~GT~k~---------~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) |+.-.+ .=+|-++|+||..| ++|||+-++.++. ++|| .+.+++++++ T Consensus 72 gp~g~t-~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Q---k~iF---~~DL~~l~k~ 127 (127) T protein:vir:98 72 GNFGYI-KDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQ---REIY---RQDMLNELRR 127 (127) T ss_pred ccCccc-ccccceeecceeeeecccccccccCccccccchHHH---hHHH---HHHHHHHhcC Confidence 864111 23568899999955 4999999998764 4555 4556666666 No 100 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=97.43 E-value=2e-06 Score=51.87 Aligned_cols=131 Identities=13% Similarity=0.106 Sum_probs=89.3 Q ss_pred Ccc-chHH-HHHHHHHHHHHHhccccHHHHHHHHH-HHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccc Q lcl|NC_018285. 1 MAM-VGLD-EALEGWLETVASIGDITPAEQAKITT-AGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGR 77 (142) Q Consensus 1 m~m-~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~-AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~ 77 (142) |+- -+++ ..++.|++.++++-..+++.-++.++ .|+.+..+.+....|.|......--+..|.+++=.+... T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~----- 75 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVK----- 75 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhh----- Confidence 321 1233 46999999999998889999999998 799999999999999764322111223588876554332 Q ss_pred ccceeEeccC---CCCceeEEeecccC--ccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 78 KNGVATVGWK---NNYHAQNARRLNDG--TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 78 ~~G~~~VG~~---k~~~a~~A~f~n~G--T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) .-..||. +...+|+ -||+-| |++-.+|+|.++...+.. +.|++...+++.+.++.--|- T Consensus 76 ---~~NLgf~i~~k~kf~YL-vfPD~G~G~sn~~~q~FmerGl~~~t--~~i~E~L~~~l~k~in~~Lgg 139 (140) T protein:vir:40 76 ---MGNLGFELLTKPKFNYL-IFPDQGIGKHNKTKQDFMQLGVEESS--QEIVEMLEQAVFKEINDTLGG 139 (140) T ss_pred ---hhhcceeEeecCccccc-ccccccCCCCCcchHHHHHhccccch--hHHHHHHHHHHHHHHHHhhcC Confidence 2235553 4445676 899987 888899999999988763 566666666555555444333 No 101 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=97.01 E-value=1.6e-06 Score=52.29 Aligned_cols=106 Identities=9% Similarity=0.084 Sum_probs=63.0 Q ss_pred Ccc-chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAM-VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m-~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |-| +.++--+..+++++...+ ++++...|..++...+.++|.. | ++|++||.....+. +... T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~-------k~~l~~~a~~i~~~ak~~aPv~------t---G~Lr~SI~~~~~~~-~~~~ 63 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIF-------RGKHRSITRRIATQARADVPVR------T---GNLGRGIQEMPQTY-RPFH 63 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCCcc------c---chhhcCceeeeecc-ccce Confidence 433 234433444555554443 2346666777788888999852 2 69999999765332 2122 Q ss_pred ceeEeccCCCCceeEEeecccCcc--------------------------c---cCCCchhhHHHHHHHHHH-HHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTK--------------------------K---YRADHFVTNVQNDSSVQK-KVLLA 127 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~--------------------------k---~~~~hFie~t~~e~~~~~-~vl~A 127 (142) .+..||.. +-+|.|+++||. + ++|+||++++.++..++. .|--. T Consensus 64 ~~~~v~~~----~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~~ 137 (137) T protein:vir:10 64 VGGGVEDN----VDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDIHMT 137 (137) T ss_pred EEEEEecC----CCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccccccCC Confidence 24557743 357899999983 2 349999999987642221 11111 No 102 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=97.01 E-value=1.2e-05 Score=47.51 Aligned_cols=97 Identities=7% Similarity=0.008 Sum_probs=66.1 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCCCceeEEeecccCcc Q lcl|NC_018285. 24 TPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK 103 (142) Q Consensus 24 ~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~ 103 (142) ..+.-.++++.-|..+....++.||.+. .++ +||+.+..++..+..+. +|+ +..-+|+|+|+|-. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~---~d~---G~LR~sW~~g~v~k~~~-----~v~----N~~eYA~~VE~GHR 65 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAK---IDG---GTARKSWKYKELNLFDG-----VVS----NNVEYIHHLEYGHR 65 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCc---CCC---cccccCceeeeeeccCc-----eee----cCCcccccccCCce Confidence 1233334455556666677788999642 222 59999999987543322 254 23578999999932 Q ss_pred -------------------ccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 104 -------------------KYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIR 137 (142) Q Consensus 104 -------------------k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~ 137 (142) ..+++||++++..+.+ ..+-....+.+.++|+ T Consensus 66 q~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~--~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 66 TRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMS--SIIDDELNQIIIDFWN 116 (116) T ss_pred eeCCcceecccccccccCCccCceehHHHHHHHHH--HHHHHHHHHHHHHhcC Confidence 5688899999988764 4566777778888888 No 103 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=97.00 E-value=2.9e-05 Score=45.42 Aligned_cols=119 Identities=13% Similarity=0.097 Sum_probs=83.0 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+-..+++.-+++.+.|+.-+..+.++-.++++.-|+-..+.|++.+|... -+.||+ .+.+..+... .++ T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~ts--pkrTG~---YaK~W~~kk~-----~e~ 70 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVG--LVQTGD---YMRGWTRKRV-----PNG 70 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcC--cccccc---hhccceeeee-----cCc Confidence 777777777778888888877777788888888888888888877655421 134554 6677776543 233 Q ss_pred eeEeccCCCCceeEEeecccCccc-----cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKK-----YRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k-----~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~ 138 (142) + |=|. +++..++|.|++|--+ .++.|+|.++.+. +.+.+.+..+++|.. T Consensus 71 ~--~V~n-k~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~------~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 71 W--VIHN-KTEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDW------LEKEFEDRVEKAIKQ 124 (124) T ss_pred e--eEEE-cCCCceeeeeecceeccCCcccCCccchhHHHHH------HHHHHHHHHHHHhcC Confidence 3 3344 3455699999999644 6889999887543 556677777778777 No 104 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=96.87 E-value=9.5e-06 Score=48.12 Aligned_cols=105 Identities=14% Similarity=0.110 Sum_probs=57.8 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccc-c Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRK-N 79 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~-~ 79 (142) -|-+.++-..-.+.+++... -+.++++.+..++...+.++|.. | +||++||.++.. .++.. . T Consensus 2 ~~~~~~~~~~~~~~~~~~~v-------~r~~l~~~a~~v~~~Ak~~aPv~------t---G~Lr~SI~~~~~-~~~~~~~ 64 (137) T protein:vir:10 2 TVTARYERNPVGEARQFQVI-------ARRRLSRITRGTANQARADVPVK------T---GNLGRSIREDPI-VVAGPLR 64 (137) T ss_pred eeEEEeccCchhHHHHHHHH-------HHHHHHHHHHHHHHHHHhcCCcc------c---hhhhcCceeeee-eccccce Confidence 11111111111122233222 23356667777888888999852 2 589999987542 22221 1 Q ss_pred ceeEeccCCCCceeEEeecccCccc------------------------------cCCCchhhHHHHHHHHHHHHHH Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKK------------------------------YRADHFVTNVQNDSSVQKKVLL 126 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k------------------------------~~~~hFie~t~~e~~~~~~vl~ 126 (142) .+..||-+ ..+|.|+++||.- ++|+||+.++.++.+.++-... T Consensus 65 ~~~~V~~~----~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 65 LDSGVTAH----ADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred EEEEecCC----CccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 13446532 4567888888741 4689999999988764432222 No 105 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=96.81 E-value=4.5e-05 Score=44.40 Aligned_cols=122 Identities=14% Similarity=0.108 Sum_probs=82.1 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+-.++++.-+++.+.|+..+..+...-.+++..-|+-..+.|+...++.. -+.||+ .+.+.++... .++ T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~ts--PkrTG~---YaK~W~~k~~-----~~~ 70 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEG--LVQTGD---YKRGWTRKRT-----PGG 70 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcC--cccccc---ccccceeeec-----cCc Confidence 777777777788888888888778888888887777777777776655431 123444 5666665432 122 Q ss_pred eeEeccCCCCceeEEeecccCccc-----cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKK-----YRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGG 141 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k-----~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g 141 (142) + |=|. +++..++|.|++|--+ .++.|+|.++.+ .+.+.+.+..+++|..-|- T Consensus 71 ~--~v~n-k~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee------~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 71 W--VIHN-KTEYRLAHLLEYGHATVDGGRVPETPHIRPVED------WLEKEFEDRVERAIKNESR 127 (127) T ss_pred e--eEee-cCCcceeehhhcceeccCCcccCCccchhhHHH------HHHHHHHHHHHHHhcCCCC Confidence 2 3344 3444699999999644 688999988754 3556777788888765444 No 106 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=96.81 E-value=1.6e-05 Score=46.86 Aligned_cols=118 Identities=11% Similarity=0.112 Sum_probs=71.7 Q ss_pred Cccc-----hHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccch--hccee---c Q lcl|NC_018285. 1 MAMV-----GLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMA--DGLSV---Q 70 (142) Q Consensus 1 m~m~-----~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~Hla--D~I~~---~ 70 (142) ||-. +|...+.+|.++++.-. ..+++..|.-+...|...+|..--.-|. -|+. ++... . T Consensus 1 ~~~~m~~~~sF~~~i~~~~~~ve~~~-------~~v~r~~a~~i~~~vv~~sPVdTGr~Ra----nw~vs~~~~~~~~~~ 69 (145) T protein:vir:10 1 MARNIGSVVTFEKSIADWIDRAEDGF-------GIVVSNTVIKTANAIVDLSPVDTGRFKA----NWQISANSPAQQSLN 69 (145) T ss_pred CCCcccchhccccCHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCCccchhhcc----ccceeeccccccccc Confidence 6644 57778888988886643 3346666666777777788853111000 0211 11111 1 Q ss_pred Ccccccc----------------ccceeE-eccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 71 STNADGR----------------KNGVAT-VGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 71 ~~~~dg~----------------~~G~~~-VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) .++.+|. +.|.+. ++ +..-+|.+||+|++.|.|..|++-+..+- .++++....+.| T Consensus 70 ~~d~~G~~t~~~~~~~~~~i~~~k~g~~iyi~----Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~---~~~v~~~~~e~k 142 (145) T protein:vir:10 70 EYDQTGGQTKTYLARQARAVANSKATSVIYIT----NRLDYAADLEYGASNQAPAGVLGVVQARL---GRYFQEAVEEAR 142 (145) T ss_pred ccCCCCccchhhHHHHHHHhhcccccceEEEe----eCchhhhHhhccccCCCcchHHHHHHHHH---HHHHHHHHHHhh Confidence 1111121 111111 12 23468899999999999999999998764 678888889999 Q ss_pred HHH Q lcl|NC_018285. 134 KLI 136 (142) Q Consensus 134 ~~l 136 (142) +.| T Consensus 143 ~~~ 145 (145) T protein:vir:10 143 RAI 145 (145) T ss_pred ccC Confidence 998 No 107 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.79 E-value=1.8e-05 Score=46.64 Aligned_cols=128 Identities=15% Similarity=0.121 Sum_probs=76.5 Q ss_pred Cc-cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-----CcCCCC--------------------- Q lcl|NC_018285. 1 MA-MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-----REKHYS--------------------- 53 (142) Q Consensus 1 m~-m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-----p~~~~~--------------------- 53 (142) |. |.+++--.+++...|+.|+... ...+.++..-|+.+...-+++- |...+- T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~-~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~ 79 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAG-HQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHh-ccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhh Confidence 32 2233211245566666655432 2345667776766666555432 221110 Q ss_pred ------CCCCC----CcccchhcceecCccccccccceeEeccCCCCceeEEeecccCcc-------ccCCCchhhHHHH Q lcl|NC_018285. 54 ------NKKDL----KYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK-------KYRADHFVTNVQN 116 (142) Q Consensus 54 ------~~~~~----k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~-------k~~~~hFie~t~~ 116 (142) .+..+ ..+.|+++|.+.. .+.++.||.+. -+|.+-+||+. ++|+.||+-=+.+ T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~------~~~~v~vGtn~----~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~ 149 (175) T protein:vir:10 80 TAAASRRKAGLMILQDSGQMAASVSTDH------DDNSAVIGSNK----EYAAIHQFGGQAGRGLKVTIPARPWLPVTAD 149 (175) T ss_pred hhhhhhhccCCCcceechhhhhhhheee------cCCEEEEecCh----hhhhhhhcccccCCCCccccCCccccCCCcc Confidence 00111 1357999999764 23378899653 35788889977 8999999953322 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 117 ---DSSVQKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 117 ---e~~~~~~vl~A~~~~~k~~l~~k 139 (142) +.+..++|+....+-+.+.+.++ T Consensus 150 d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 150 GELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred cccchHHHHHHHHHHHHHHHHHhccC Confidence 22334789999999999999999 No 108 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=96.70 E-value=2.4e-05 Score=45.89 Aligned_cols=128 Identities=13% Similarity=0.106 Sum_probs=77.3 Q ss_pred Cc-cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-----CcCCC---------------------- Q lcl|NC_018285. 1 MA-MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-----REKHY---------------------- 52 (142) Q Consensus 1 m~-m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-----p~~~~---------------------- 52 (142) |. |.+++--.+++...|..|.... ...+.+++.-|+.+....+++- |...+ T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~-~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~ 79 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAG-HQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHh-cCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccc Confidence 33 2222211144555555554332 3445677777777766555532 21100 Q ss_pred -----CCCCCC----CcccchhcceecCccccccccceeEeccCCCCceeEEeecccCcc-------ccCCCchhhHHHH Q lcl|NC_018285. 53 -----SNKKDL----KYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK-------KYRADHFVTNVQN 116 (142) Q Consensus 53 -----~~~~~~----k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~-------k~~~~hFie~t~~ 116 (142) ..+..+ ..++|+++|.+... +.++.||.+. -||.+-+||+. .+|+.||+-=+.+ T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~~------~~~v~vGtn~----~YAaiHqfGg~~~~~~~v~IPARPfLG~s~~ 149 (175) T protein:vir:79 80 TAAASRRKAGLMILQDSGQMAASTATDSG------EDYSVIGSNK----EYAAIQHFGGQAGRGLKVTIPGRAWLPVTAD 149 (175) T ss_pred hhhHhhhccCCCcceechhhhhhhhheec------CCEEEEecCc----chhhHhhcccccCCCcccccCcccccCCCcc Confidence 000011 14689999998642 3378899653 46788899975 7999999953332 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 117 ---DSSVQKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 117 ---e~~~~~~vl~A~~~~~k~~l~~k 139 (142) +.++.++|++...+-+++.+.++ T Consensus 150 de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 150 GELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred cchhHHHHHHHHHHHHHHHHHHhccC Confidence 23445789999999999999999 No 109 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=96.67 E-value=2.9e-05 Score=45.48 Aligned_cols=113 Identities=13% Similarity=0.177 Sum_probs=70.0 Q ss_pred Cccc--hHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec-------- Q lcl|NC_018285. 1 MAMV--GLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ-------- 70 (142) Q Consensus 1 m~m~--~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~-------- 70 (142) ||-. .|...+++|.++++.-. ..+++..|.-+...|...+|.. || -++-|..+| T Consensus 1 Ma~~~~sf~~~i~~~~~~ve~~~-------~~v~r~~a~~i~~~vv~~sPVd------TG---r~R~nw~vs~~~~~~~~ 64 (142) T protein:vir:10 1 MANDVVSFRNSINAWIDGVTEGV-------ELIVEGTLTKATKDIVKLSPVD------TG---RFRGNWQATGNSPAAQS 64 (142) T ss_pred CccchhhhhccHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCccc------ch---hhcccceeeecCccccc Confidence 6554 57778889988886543 3446666666666667788853 11 122222221 Q ss_pred --Cccccccc----------------cce-eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 71 --STNADGRK----------------NGV-ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAE 131 (142) Q Consensus 71 --~~~~dg~~----------------~G~-~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~ 131 (142) .++..|.. .|. .-++ +..-+|.+||+|++.|.|..|++-+..+- .++++.-..+ T Consensus 65 ~~~~d~~G~~t~~~~~~~~~~i~~~~~g~~iyi~----Nn~pYA~~LEyG~S~QAP~G~v~~a~q~~---~~~v~~a~~e 137 (142) T protein:vir:10 65 LNNYDPDGNETRNSLRRQIYALARDANTNVIYIS----NRLDYAQGLEFGSSNQAPSGVLGVVQKRL---GRYFAEAVQE 137 (142) T ss_pred ccCcCCCCccchhhHHHHHHHhhhccccceEEEe----eCcchhhhhhccccCCCcchHHHHHHHHH---HHHHHHHHHH Confidence 01111111 011 1122 23467899999999999999999998764 6788888888 Q ss_pred HHHHH Q lcl|NC_018285. 132 YEKLI 136 (142) Q Consensus 132 ~k~~l 136 (142) .|+.| T Consensus 138 ~~~~~ 142 (142) T protein:vir:10 138 AKRAL 142 (142) T ss_pred hhccC Confidence 88888 No 110 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=96.56 E-value=2.6e-06 Score=51.19 Aligned_cols=108 Identities=11% Similarity=0.081 Sum_probs=58.4 Q ss_pred cchHHH--HHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 3 MVGLDE--ALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 3 m~~~~~--~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+.|+. -|+-=...|.++ ....-.+++++.+..++...+.++|. +| +||++||.....+. |...- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~v~~~ak~~aPv------dt---G~Lr~SI~~~~~~~-~~~~~ 67 (140) T protein:vir:10 1 MATIRARARIEIDEAALERE---SGEHLRAFHRSLTRRIANQSRVAVPV------RT---GNLGRTIGELPQVY-TPFRV 67 (140) T ss_pred CeeeeeeeeeeeCHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcCCc------cc---hhhhccceeeeeeC-CCceE Confidence 333331 111111122222 22344555667777888888888985 23 59999998755321 11111 Q ss_pred eeEeccCCCCceeEEeecccCcc--------------------------c---cCCCchhhHHHHHHHHH-HHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTK--------------------------K---YRADHFVTNVQNDSSVQ-KKVLLA 127 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~--------------------------k---~~~~hFie~t~~e~~~~-~~vl~A 127 (142) ...|| ..+-+|.|+++||. + ++|+||++++.++..++ ..|-.- T Consensus 68 ~~~v~----~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 68 RGGVE----ATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred EEEec----CCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 22344 22467788888883 1 56999999998874322 222222 No 111 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=96.56 E-value=2.6e-06 Score=51.19 Aligned_cols=108 Identities=11% Similarity=0.081 Sum_probs=58.4 Q ss_pred cchHHH--HHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 3 MVGLDE--ALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 3 m~~~~~--~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+.|+. -|+-=...|.++ ....-.+++++.+..++...+.++|. +| +||++||.....+. |...- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~v~~~ak~~aPv------dt---G~Lr~SI~~~~~~~-~~~~~ 67 (140) T protein:vir:97 1 MATIRARARIEIDEAALERE---SGEHLRAFHRSLTRRIANQSRVAVPV------RT---GNLGRTIGELPQVY-TPFRV 67 (140) T ss_pred CeeeeeeeeeeeCHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcCCc------cc---hhhhccceeeeeeC-CCceE Confidence 333331 111111122222 22344555667777888888888985 23 59999998755321 11111 Q ss_pred eeEeccCCCCceeEEeecccCcc--------------------------c---cCCCchhhHHHHHHHHH-HHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTK--------------------------K---YRADHFVTNVQNDSSVQ-KKVLLA 127 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~--------------------------k---~~~~hFie~t~~e~~~~-~~vl~A 127 (142) ...|| ..+-+|.|+++||. + ++|+||++++.++..++ ..|-.- T Consensus 68 ~~~v~----~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 68 RGGVE----ATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred EEEec----CCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 22344 22467788888883 1 56999999998874322 222222 No 112 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=96.54 E-value=8e-05 Score=43.04 Aligned_cols=117 Identities=15% Similarity=0.162 Sum_probs=69.6 Q ss_pred Cccc---hHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec------- Q lcl|NC_018285. 1 MAMV---GLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ------- 70 (142) Q Consensus 1 m~m~---~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~------- 70 (142) ||.. +|...+++|+++++.-. ..+++..|.-+...|...+|.. || -++-|-.+| T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~-------~~~~r~~a~~i~~~vv~~sPVD------TG---r~Ranw~vs~~~~~~~ 64 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGL-------NDVIQIFGEKVHGALVDIAPVD------TG---RFKANMQITANKPPLY 64 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCCCc------ch---hhccccceeecCcccc Confidence 7775 56678888888886543 3456666666666777788853 11 122221111 Q ss_pred ---Cccccccc-------------cc---eeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 71 ---STNADGRK-------------NG---VATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAE 131 (142) Q Consensus 71 ---~~~~dg~~-------------~G---~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~ 131 (142) .++.+|.. .| ..+|=+. +..-+|.+||+|++.|.|..|++-+..+- .++++.-..+ T Consensus 65 ~~~~~dp~G~~t~~~~~~~i~~~~~g~~~~~~iyi~--NnlpYA~~LEyG~S~QAP~G~v~~~~~~~---~~~v~~a~~e 139 (146) T protein:vir:79 65 ALNQYDPDGEKIKAEGRRTLYALLHGGGAIKSIYFS--NMLIYANALEYGHSKQAPAGVFGIVAIRL---RSYMAEAIRE 139 (146) T ss_pred cccCCCCCCcccHHHHHHHHHHHHhcccccceeEEe--eCchhhhhhhccccCCCcchHHHHHHHHH---HHHHHHHHHH Confidence 11111110 00 0011111 23467899999999999999999998764 5677777777 Q ss_pred HHHHHhh Q lcl|NC_018285. 132 YEKLIRR 138 (142) Q Consensus 132 ~k~~l~~ 138 (142) .|+.+.= T Consensus 140 ~k~~~~l 146 (146) T protein:vir:79 140 ARKKNAL 146 (146) T ss_pred HHhhccC Confidence 7765444 No 113 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=96.17 E-value=8.2e-05 Score=42.99 Aligned_cols=116 Identities=9% Similarity=0.067 Sum_probs=63.6 Q ss_pred Cccch---HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec------- Q lcl|NC_018285. 1 MAMVG---LDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ------- 70 (142) Q Consensus 1 m~m~~---~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~------- 70 (142) ||.-+ |...+++|.++++.- ..++++..|.-+...|...+|.. || -++-|-.++ T Consensus 1 ma~~~~~~F~~~i~~~~~~ve~~-------~~~~~r~~a~~i~~~vv~~sPVd------TG---r~Ranw~vs~~~~~~~ 64 (147) T protein:vir:10 1 MANYQIRRFQGEIDAWINAAEST-------LEHAIEIFVRDVHDALVSRSPVD------TG---RFKGNWQITFNEIPNH 64 (147) T ss_pred CCCcchhhhhhhHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHhCCCc------ch---hhccccceeecCcccc Confidence 55433 345677777776543 34456666666667777788852 11 122222111 Q ss_pred ---Ccccccc-----------------ccce-eEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 71 ---STNADGR-----------------KNGV-ATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 71 ---~~~~dg~-----------------~~G~-~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .++..|. ..|. .-+++ ..-+|.+||+|++.|+|..|++-+..+- +.+++.-. T Consensus 65 ~~~~~dp~g~~t~a~~~~~~~~~~~~~~~~~~iyi~N----n~pYA~~LEyG~S~QAP~G~V~~t~q~~---~~~v~~~~ 137 (147) T protein:vir:10 65 ALNRYDKTGGVVRGEEQAKTYGMFSRGGAITSVHFSN----MLIYANALEYGHSQQAPSGVVGLVALRL---RSYMADAI 137 (147) T ss_pred ccCCcCCCccchhhhhhHHHHHHhhhccCcceEEEee----CcchhhhhhccccCCCCchHHHHHHHHH---HHHHHHHH Confidence 0111110 1111 11222 2467899999999999999999887664 45666555 Q ss_pred HHHHHHHhhcCCC Q lcl|NC_018285. 130 AEYEKLIRRKGGK 142 (142) Q Consensus 130 ~~~k~~l~~k~g~ 142 (142) .+.|+- |..- T Consensus 138 ~e~k~~---~~~~ 147 (147) T protein:vir:10 138 KQARRQ---QNAL 147 (147) T ss_pred HHHHhh---hccC Confidence 555542 2222 No 114 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=96.08 E-value=3.2e-05 Score=45.19 Aligned_cols=125 Identities=10% Similarity=0.012 Sum_probs=69.0 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHH---HHHHHH-hcCcCCCC---------CCCCC----Ccccchh Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVF---QKELEE-VTREKHYS---------NKKDL----KYGHMAD 65 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~---~~~L~~-~tp~~~~~---------~~~~~----k~~HlaD 65 (142) |+.-+..|++++..|..... .....+|+.++ .+.+.+ ..|..... .+..+ ..++|++ T Consensus 1 ~i~~~~~i~~~l~~l~~~~~------~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~ 74 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDGLT------DGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLT 74 (145) T ss_pred CcccHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHH Confidence 88877778888887765321 11222333222 222332 22221110 01111 1358999 Q ss_pred cceecC-ccccccccceeEeccCCCCceeEEeecccCccc--cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 66 GLSVQS-TNADGRKNGVATVGWKNNYHAQNARRLNDGTKK--YRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 66 ~I~~~~-~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k--~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|..+- .+.+ +.++.||-+ --+|.+-++||.+ +||.||+-.+.... .+++...+.+...+-|+++-=+ T Consensus 75 Si~~~~~~~~~---~~~a~vGtn----~~YA~~hqfG~~~~~IPaRPfLG~~~~~~--~~~~~~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 75 DINAASMMDRA---NRMAVIGTN----LDYAEHHEFGAPEAGIPARPIFGPAGAYA--SQQAPDVIGDEIDTNLEGAVID 145 (145) T ss_pred HHHHHhhhccc---CceeEecCC----chhhhhhccCCcccccCCCCccCCCccch--HHHHHHHHHHHHHHHhhhhccC Confidence 998654 2221 235778832 2578899999976 99999996554332 2455556666666666665555 No 115 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=95.67 E-value=0.00015 Score=41.52 Aligned_cols=120 Identities=13% Similarity=0.097 Sum_probs=72.1 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-----CcCCC---CC-----CCCC-------Cccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-----REKHY---SN-----KKDL-------KYGH 62 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-----p~~~~---~~-----~~~~-------k~~H 62 (142) |++|++..+.|-.-|.+| ++..+.++++.-|+.+...-+++. |.... .+ ++.+ .... T Consensus 1 ~~~~~~l~~~L~~ll~~l---~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~ 77 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESL---SPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLI 77 (150) T ss_pred CchHHHHHHHHHHHHHhc---CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhh Confidence 888765333333333444 334566677776666666655532 31110 01 0110 1236 Q ss_pred chhcceecCccccccccceeEeccCCCCceeEEeecccC----------ccccCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 63 MADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG----------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEY 132 (142) Q Consensus 63 laD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G----------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~ 132 (142) |+.+|.++.. .-++.|||...+...||..-.+| ++.+|+.||+-=+.++ .++|+....+-+ T Consensus 78 l~~sl~~~~~------~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~~d---~~~i~~~i~~~l 148 (150) T protein:vir:20 78 TSRFLHIRAS------PEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGED---VQMIEEIILAHL 148 (150) T ss_pred hhhhhheeec------CcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCHHH---HHHHHHHHHHHH Confidence 7777876541 22677998656667889988999 3579999999655443 367888888888 Q ss_pred HH Q lcl|NC_018285. 133 EK 134 (142) Q Consensus 133 k~ 134 (142) .+ T Consensus 149 ~k 150 (150) T protein:vir:20 149 ER 150 (150) T ss_pred hC Confidence 87 No 116 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=95.63 E-value=0.00011 Score=42.39 Aligned_cols=107 Identities=10% Similarity=0.058 Sum_probs=63.9 Q ss_pred chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecC----------cc Q lcl|NC_018285. 4 VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQS----------TN 73 (142) Q Consensus 4 ~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~----------~~ 73 (142) -.|...+.+|.+++++-.. ++++..+.-+...|...+|.. || -++-|..++- ++ T Consensus 1 msF~~~i~~~~~~ve~~~~-------~~~r~~a~~~~~~iv~~sPVd------TG---r~Ranw~vs~~~~~~~~~~~~d 64 (131) T protein:vir:94 1 MSFALDVTRFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPVD------TG---RFRMNWMASGSTPADGTTDATD 64 (131) T ss_pred CCcccCHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCCc------hh---hhhccchhccccccccccCCCC Confidence 3456778889988876432 445555555555666677752 11 1222221110 11 Q ss_pred -------------ccccccceeE-eccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 74 -------------ADGRKNGVAT-VGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 74 -------------~dg~~~G~~~-VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) +.+-+.|.+. +++ ..-+|.+||+|+++|.|..|++-+..+- ..+++....+.| T Consensus 65 ~~g~~t~~~~~~~i~~~~~g~~iyi~N----n~pYA~~LEyG~S~QAP~g~v~~~~~~~---~~~v~~~~~e~k 131 (131) T protein:vir:94 65 KSGNTATGNATSFVLNAADWHTFTLTN----NLPYAQRLEYGWSQQAPQGFVRVNVSRF---QQLLNEEASKVK 131 (131) T ss_pred CCchhhHHHHHHHHhhccccceEEEee----CchhhhhhhccccCCCcchHHHHHHHHH---HHHHHHHHHhcC Confidence 1111222221 332 2467899999999999999999888764 677877777777 No 117 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=95.62 E-value=2e-05 Score=46.36 Aligned_cols=108 Identities=12% Similarity=0.068 Sum_probs=57.9 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) |+-+...--.-.|..++ -..-.+++.+.+.-++...+.++|. + .+||++||.+...+.+|. .. T Consensus 1 ~~~~~~~l~~~~l~~~~-------~~~~~~~~~~~a~~ve~~ak~~aPv------~---TG~Lr~SI~~~~~~~~g~-~v 63 (137) T protein:vir:10 1 MVAHTLRIERAQLHGLG-------MDEARKAVNRVVRRTFTRSQILAPV------D---TGYLRASGRLVLGRERGA-VV 63 (137) T ss_pred CcccccccChhhHhhHH-------HHHHHHHHHHHHHHHHHHHHhcCCc------C---chhhhccceeeeeecccc-EE Confidence 55444331111222222 2334455666677777777888884 2 259999999866432221 11 Q ss_pred eeEeccCCCCceeEEeecccCccc-----------------------------cCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKK-----------------------------YRADHFVTNVQNDSSVQKKVLLAEKAE 131 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k-----------------------------~~~~hFie~t~~e~~~~~~vl~A~~~~ 131 (142) ...|| +..-+|.|+++||.- ++|+||++++.++...+ +.|.. T Consensus 64 ~~~V~----~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~-~~~~~---- 134 (137) T protein:vir:10 64 IGSVE----YTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQ-EGFRV---- 134 (137) T ss_pred EEEec----CCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhcc-cceeE---- Confidence 22244 234577888888731 45889999888765322 22221 Q ss_pred HHHHHh Q lcl|NC_018285. 132 YEKLIR 137 (142) Q Consensus 132 ~k~~l~ 137 (142) .|+ T Consensus 135 ---~~~ 137 (137) T protein:vir:10 135 ---TIG 137 (137) T ss_pred ---eeC Confidence 111 No 118 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=95.60 E-value=6.7e-05 Score=43.47 Aligned_cols=123 Identities=12% Similarity=0.107 Sum_probs=74.3 Q ss_pred Cccc-hHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-----Cc-CCCC----------CC----CCCC Q lcl|NC_018285. 1 MAMV-GLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-----RE-KHYS----------NK----KDLK 59 (142) Q Consensus 1 m~m~-~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-----p~-~~~~----------~~----~~~k 59 (142) |.|. +++-.++++...|..|.... . ...+++.-|+.+....+++- |. .... .+ ..++ T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~-~-~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~ 78 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVT-R-DRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGS 78 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhh-c-cHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCc Confidence 7766 34433445555555554321 1 22456655666655554433 42 2111 00 1111 Q ss_pred ----cccchhcceecCccccccccceeEeccCCCCceeEEeecccCcc--------ccCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_018285. 60 ----YGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK--------KYRADHFVTNVQNDSSVQKKVLLA 127 (142) Q Consensus 60 ----~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~--------k~~~~hFie~t~~e~~~~~~vl~A 127 (142) .++|+++|.+... ...+.||.+ ..||..-++|+. ++|+.||+-=+.++ .++|.+. T Consensus 79 ~L~~tg~L~~Si~~~~~------~~~v~vGt~----~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s~~d---~~~I~~~ 145 (156) T protein:vir:19 79 ILTLHGDLARSITTDYG------QDYALIGSP----KIYAAIHQWGGTPDMAPRPAGVPARPYMGLDKTG---EQEIFDA 145 (156) T ss_pred chhhhHHHHHHhhheec------CCEEEEecc----hhhhHHhhcCcccccCCCccccCCccccCCCHHH---HHHHHHH Confidence 3699999997541 226789964 357888999965 69999999544333 3689999 Q ss_pred HHHHHHHHHhh Q lcl|NC_018285. 128 EKAEYEKLIRR 138 (142) Q Consensus 128 ~~~~~k~~l~~ 138 (142) ..+-+++++++ T Consensus 146 i~~~l~~~~~~ 156 (156) T protein:vir:19 146 IRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHhhC Confidence 99999999998 No 119 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=95.57 E-value=0.0002 Score=40.88 Aligned_cols=120 Identities=11% Similarity=0.080 Sum_probs=71.7 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCCC---CCC-----CC---CC----ccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKHY---SNK-----KD---LK----YGH 62 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~~---~~~-----~~---~k----~~H 62 (142) |++|++..+.|-.-|.+|.+ ..+.++++.-|+.+...-+++ .|.... ... +. ++ ... T Consensus 1 ~~~~~~l~~~L~~~l~~L~~---~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~ 77 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSP---SGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLI 77 (150) T ss_pred CchHHHHHHHHHHHHHhcCC---hhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhh Confidence 88888755555555666643 345667776666665555443 232111 010 00 00 124 Q ss_pred chhcceecCccccccccceeEeccCCCCceeEEeecccC----------ccccCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 63 MADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG----------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEY 132 (142) Q Consensus 63 laD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G----------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~ 132 (142) +..+|.++.. .-.+.|||...+...||..-.+| ++.+|+.||+-=+.++ ..+|++.+.+-+ T Consensus 78 l~~sl~~~~~------~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~~d---~~~i~~~i~~~l 148 (150) T protein:vir:60 78 TSRFLHIRAS------PEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGED---VQMIEEIILAHL 148 (150) T ss_pred hcceeeeeee------CcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCCHHH---HHHHHHHHHHHH Confidence 4555554331 22577998656667899999999 4478999999655443 257877777777 Q ss_pred HH Q lcl|NC_018285. 133 EK 134 (142) Q Consensus 133 k~ 134 (142) .+ T Consensus 149 ~r 150 (150) T protein:vir:60 149 DR 150 (150) T ss_pred hC Confidence 76 No 120 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=95.41 E-value=0.00025 Score=40.34 Aligned_cols=120 Identities=11% Similarity=0.076 Sum_probs=70.8 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCCCC---CC-----CC---CC----ccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKHYS---NK-----KD---LK----YGH 62 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~~~---~~-----~~---~k----~~H 62 (142) |++|++..+.|-.-|.+|.+ .....+++.-|+.+...-+++ .|..... .. +. ++ ... T Consensus 1 m~~~~~l~~~L~~~l~~L~~---~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~ 77 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSP---SGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLI 77 (150) T ss_pred CchHHHHHHHHHHHHHhcCC---hhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhh Confidence 88887655555555566643 335566776666665555443 3321110 00 00 00 124 Q ss_pred chhcceecCccccccccceeEeccCCCCceeEEeecccC----------ccccCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 63 MADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG----------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEY 132 (142) Q Consensus 63 laD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G----------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~ 132 (142) ++.+|.++.. .-.+.|||...+...||+.-.+| .+.+|+.||+-=+.++ ..+|++.+.+-+ T Consensus 78 l~~sl~~~~~------~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~d---~~~i~~~i~~~l 148 (150) T protein:vir:57 78 TSRFLHIRAS------PEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGED---VQMIEEIILAHL 148 (150) T ss_pred hccceeeeee------CcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCHHH---HHHHHHHHHHHH Confidence 4555655431 22577998655667899999998 3368999999655443 257888887777 Q ss_pred HH Q lcl|NC_018285. 133 EK 134 (142) Q Consensus 133 k~ 134 (142) .+ T Consensus 149 ~r 150 (150) T protein:vir:57 149 DR 150 (150) T ss_pred hC Confidence 77 No 121 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=95.28 E-value=0.00018 Score=41.09 Aligned_cols=126 Identities=13% Similarity=0.085 Sum_probs=71.6 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-----CcCCCC---C--------CCCC----Cc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-----REKHYS---N--------KKDL----KY 60 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-----p~~~~~---~--------~~~~----k~ 60 (142) |+-+.++-=++++.+.|..|.... ...+.+++.-|..+....+++- |..+.. + +..+ .. T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~-~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~t 79 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAAL-GDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLD 79 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceec Confidence 554443311223333333332211 2344667766666666555433 322110 0 0111 14 Q ss_pred ccchhcceecCccccccccceeEeccCCCCceeEEeecccC--------------------------------------- Q lcl|NC_018285. 61 GHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG--------------------------------------- 101 (142) Q Consensus 61 ~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G--------------------------------------- 101 (142) ++|+++|.+... ...+.||.+. -+|..-++| T Consensus 80 g~L~~Si~~~~~------~~~v~vGtn~----~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 149 (190) T protein:vir:99 80 GHLRNLLRYQLD------GSELLFGSDR----PYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDV 149 (190) T ss_pred HHHHHHHhheec------CcEEEEecCc----chhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhc Confidence 699999996531 2367888642 345556666 Q ss_pred -----ccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 102 -----TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 102 -----T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) |+++|+.+|+--+.++ .++|...+.+-+++++.++- T Consensus 150 ~~~~~~v~IPaRpfLG~s~~d---~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 150 QIGPYTIQMPARPWLGTSSQD---DDTILQRVERYLQRALRERA 190 (190) T ss_pred ccccceeeecCcccCCCCHHH---HHHHHHHHHHHHHHHHhhcC Confidence 4678999999544333 36899999999999999888 No 122 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=94.65 E-value=0.00012 Score=42.14 Aligned_cols=107 Identities=10% Similarity=0.077 Sum_probs=63.8 Q ss_pred chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec----------Ccc Q lcl|NC_018285. 4 VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ----------STN 73 (142) Q Consensus 4 ~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~----------~~~ 73 (142) -.|...+..|+++++.-.. ++++..+.-+...|...+|.. || -++-|..++ .++ T Consensus 1 msf~~~i~~~~~~ve~~~~-------~~~r~~a~~~~~~iv~~sPVd------TG---r~Ranw~vs~~~~~~~~~~~~d 64 (131) T protein:vir:78 1 MSFALDVSKFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPVD------TG---RFRMNWMASGGTPADGTTDATD 64 (131) T ss_pred CCcCcCHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCCc------hh---hhccccceecccccccccCCCC Confidence 3455678888888866432 445555566666666677852 11 122222111 111 Q ss_pred cc-------------ccccceeE-eccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 74 AD-------------GRKNGVAT-VGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 74 ~d-------------g~~~G~~~-VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) .+ +-+.|.+. ++ +..-+|.+||+|+++|+|..|++-+..+- .++++....+.| T Consensus 65 ~~g~~t~~~~~~~i~~~~~g~~iyi~----Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~---~~~v~~~~~e~k 131 (131) T protein:vir:78 65 KAGTTATSNAANFVLNAADWHTFTLT----NNLPYAQRLEYGWSQQAPQGFVRVNVSRF---QQLLNEEASKVK 131 (131) T ss_pred CCchhhHHHHHHHHhhccCCceEEEe----eCchhhhHhhccccCCCcchHHHHHHHHH---HHHHHHHHHhcC Confidence 11 11112211 23 22467899999999999999999888764 577777777777 No 123 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=93.89 E-value=0.00096 Score=37.13 Aligned_cols=125 Identities=14% Similarity=0.120 Sum_probs=63.4 Q ss_pred cchHHHHHHHHHHHHHHh-ccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCCCC---------CCCCCCcccchhcc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASI-GDITPAEQAKITTAGAKVFQKELEEV-----TREKHYS---------NKKDLKYGHMADGL 67 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl-~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~~~---------~~~~~k~~HlaD~I 67 (142) |++ -|.++...|+.| ..+++..++..++.-|+.+...-+++ .|..... .+...+.+.|-..+ T Consensus 1 M~~---~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L 77 (152) T protein:vir:10 1 MSE---PIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKI 77 (152) T ss_pred Cch---HHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhh Confidence 555 333444444332 23344455567766666655444432 3321111 11111222343333 Q ss_pred eecC-ccccccccceeEeccCCCCceeEEeecccC-----------ccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 68 SVQS-TNADGRKNGVATVGWKNNYHAQNARRLNDG-----------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKL 135 (142) Q Consensus 68 ~~~~-~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G-----------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~ 135 (142) .-+. .+..-. .-.+.|||. .+...||+.-.+| ++.+|+.||+-=+.++. .+|++.+.+-+... T Consensus 78 ~~a~~l~~~a~-~~~~~Vg~~-Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~~d~---~~I~~~i~~~l~~a 152 (152) T protein:vir:10 78 TQPRFMRLRLE-SEGVSLGYE-GGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTDDDL---QMIEDYMINILAGS 152 (152) T ss_pred hhcceeeeeec-CcEEEEEec-CCchhhhhhhccCccccccCCCCcceeccccccCCCCHHHH---HHHHHHHHHHHhcC Confidence 2111 111111 225779984 4556788777777 67899999996555442 56777777776665 No 124 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=93.45 E-value=0.00019 Score=41.00 Aligned_cols=85 Identities=13% Similarity=0.076 Sum_probs=52.6 Q ss_pred HHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCC-------CceeEEeecccCc--cccCCCchhhH Q lcl|NC_018285. 43 LEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNN-------YHAQNARRLNDGT--KKYRADHFVTN 113 (142) Q Consensus 43 L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~-------~~a~~A~f~n~GT--~k~~~~hFie~ 113 (142) +...+. +.+....+|..-+.- .....+.|||.-. ..++||-+.|+|| ..+||.||+.. T Consensus 1 M~~~i~------~~~~~~~~L~~~lk~-------l~~k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~ 67 (189) T protein:vir:10 1 MGRVIR------KQGPARVKLNAFIKG-------MNDYSVRIGWFSTAKYPDGTPTAYVASIHEFGAPSRGIPARSFIRP 67 (189) T ss_pred Ccceec------cCcHHHHHHHHHHHH-------hhCCeEEEEecCCCCCCCcccHHHHHHHHHhcCcCCCCCCchhhhH Confidence 111111 011111233322221 1123567887521 2568999999998 56999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 114 VQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 114 t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +.++. +.++.+.+...++.+|.+...= T Consensus 68 t~~~~--~~~~~~~l~~~~~~vl~G~~~~ 94 (189) T protein:vir:10 68 TIAAQ--QAAWSQQMRFYAKQIVVGQMNV 94 (189) T ss_pred HHHHH--HHHHHHHHHHHHHHHHhCCCCH Confidence 99987 5788899999999998776322 No 125 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=93.36 E-value=0.00088 Score=37.33 Aligned_cols=124 Identities=13% Similarity=0.048 Sum_probs=69.5 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHH-----------HHHHHHH--HHHHHHhcCcCCCCCCCCCCcccchhcc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKIT-----------TAGAKVF--QKELEEVTREKHYSNKKDLKYGHMADGL 67 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~-----------~AGA~v~--~~~L~~~tp~~~~~~~~~~k~~HlaD~I 67 (142) |.|-.-...|+++++.|+.|......+- +. .-|..+. +-..+-=.+-.|... ...+...+ T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~v~vG--i~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~-----~~~~~~~~ 73 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRSVSAG--WYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGG-----TRYIRDAI 73 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCeEEEE--EcCCCCCCCcccccccchHHHHHhHHHcCCccccCcc-----ceeeeecc Confidence 7777666789999999998875432110 00 0000000 000000000000000 00111111 Q ss_pred eecCccccccccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CC Q lcl|NC_018285. 68 SVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG-GK 142 (142) Q Consensus 68 ~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~-g~ 142 (142) . .++.+-+.|.+...+..+.|.--.|+++|+.||+..+.++.. +++.+...+.++.++.+.. .+ T Consensus 74 ~---------~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~--~~~~~~~~~~~~~~~~g~~~~~ 138 (193) T protein:vir:96 74 V---------RGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFS--ADRAAIQNRIAMRLARGQITPD 138 (193) T ss_pred c---------cccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHH--HHHHHHHHHHHHHHHhCCCCHH Confidence 1 122344555554445677888888999999999999998874 6788999999999998764 33 No 126 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=93.24 E-value=0.0013 Score=36.47 Aligned_cols=118 Identities=17% Similarity=0.180 Sum_probs=67.7 Q ss_pred cchHHHHHHHHHH-HHHHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCC---CCCC-----CCCC-------cc Q lcl|NC_018285. 3 MVGLDEALEGWLE-TVASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKH---YSNK-----KDLK-------YG 61 (142) Q Consensus 3 m~~~~~~l~e~~~-~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~---~~~~-----~~~k-------~~ 61 (142) |.||.+ |++.|. -|.+|. +..++.+++.-|+.+...-+++ .|... ..+. +.+. .+ T Consensus 1 m~d~~~-l~~~L~~ll~~L~---~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g 76 (149) T protein:vir:98 1 MSELTA-LQERLTGLIASLS---PAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARL 76 (149) T ss_pred CchHHH-HHHHHHHHHHhcC---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhh Confidence 887765 333333 334553 3456677777666666655543 34211 1111 1111 13 Q ss_pred cchhcceecCccccccccceeEeccCCCCceeEEeecccCc----------cccCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 62 HMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGT----------KKYRADHFVTNVQNDSSVQKKVLLAEKAE 131 (142) Q Consensus 62 HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT----------~k~~~~hFie~t~~e~~~~~~vl~A~~~~ 131 (142) .|+.+|.+.. ....+.|||. .+...||..-.+|. +++|+.+|+-=+.++ .++|+..+.+- T Consensus 77 ~l~~sl~~~~------~~~~~~V~~~-Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s~~d---~~~i~~~i~~~ 146 (149) T protein:vir:98 77 RTNRFMKAKG------SDSAAVVEFT-GRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFTRDD---EQMIEDIIIRH 146 (149) T ss_pred hhhhhhhhee------cCCeeEEEec-CcchHHhhHhhccccccccCCCcceeccccccCCCCHHH---HHHHHHHHHHH Confidence 4566766543 1225779884 45578999999994 379999999543332 25777777777 Q ss_pred HHH Q lcl|NC_018285. 132 YEK 134 (142) Q Consensus 132 ~k~ 134 (142) +.| T Consensus 147 l~~ 149 (149) T protein:vir:98 147 LGK 149 (149) T ss_pred hhC Confidence 776 No 127 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=92.67 E-value=0.00024 Score=40.44 Aligned_cols=93 Identities=22% Similarity=0.182 Sum_probs=41.8 Q ss_pred HHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEecc----C---CCCceeEEeecccCccccCCC Q lcl|NC_018285. 36 AKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGW----K---NNYHAQNARRLNDGTKKYRAD 108 (142) Q Consensus 36 A~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~----~---k~~~a~~A~f~n~GT~k~~~~ 108 (142) -+|-.+.|+.-..+ .+.. +-+.+=++++ .-.+.+|.. ..+|. + .-..+.+|-+.|+||.++||. T Consensus 1 m~v~r~~L~~~~~~--l~~~-~V~VGi~~~a---~y~d~~g~~---~~~g~~~~~~~~~G~pva~ia~~~e~G~~~IP~R 71 (155) T protein:vir:10 1 MSVTRRGLTLPKDR--YKSM-SVKAGVLAGA---TYPDESGKK---LADGTILKKDPRAGLPVAMIAMALNYGTSKLPAR 71 (155) T ss_pred CcchHHHHHHHHHH--hhCC-eeEEeecCCC---CCCccccch---hhhhhhhccccccCcchhhhhhhhhcCCCCCCCc Confidence 44444444332211 0000 0000000000 000000100 00000 0 112367899999999999999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 109 HFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 109 hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ||+..+.++.. .++.++..+.++..+ .|+ T Consensus 72 PFlr~t~~~~~--~~~~~~l~~~~~~~~---~~~ 100 (155) T protein:vir:10 72 PFMEKTIADRS--AEWIKGLTVMMTMGY---DAE 100 (155) T ss_pred chhHHHHHHHH--HHHHHHHHHHHHcCC---CHH Confidence 99999998874 566666555443211 112 No 128 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=92.53 E-value=0.00046 Score=38.90 Aligned_cols=90 Identities=17% Similarity=0.124 Sum_probs=49.4 Q ss_pred HHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCC-------------CceeEEeecccC---- Q lcl|NC_018285. 39 FQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNN-------------YHAQNARRLNDG---- 101 (142) Q Consensus 39 ~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~-------------~~a~~A~f~n~G---- 101 (142) +.+.|.-.... ..+ ..+.+-+. .++--....+.|||... ..++||.+.|+| T Consensus 1 ~~~~~~~~~k~------~~~--~~~~~~~~----~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~ 68 (200) T protein:vir:99 1 MKKGFSKSNSV------AAP--LKHFQMLK----QFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVIN 68 (200) T ss_pred CCcCcceeeee------ecc--hHHHHHHH----HHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeec Confidence 00000000000 000 01111110 01111123567888411 245788888888 Q ss_pred -------------------------------------ccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-C Q lcl|NC_018285. 102 -------------------------------------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGG-K 142 (142) Q Consensus 102 -------------------------------------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g-~ 142 (142) |+++||.||+..+.++. ++++.+...+.++.+|.+... + T Consensus 69 ~p~~~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~--~~~~~~~~~~~~~~~l~g~~~~~ 145 (200) T protein:vir:99 69 HPGGTKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATF--NKDKVKIQAQIARQLLDGTINPE 145 (200) T ss_pred cCCCccccccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHH--HHHHHHHHHHHHHHHHhCCCCHH Confidence 45789999999999987 467888898999999887643 2 No 129 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=92.22 E-value=0.00033 Score=39.67 Aligned_cols=89 Identities=25% Similarity=0.217 Sum_probs=40.4 Q ss_pred HHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccc--cccc-------eeEeccCCCCceeEEeecccCccccC Q lcl|NC_018285. 36 AKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADG--RKNG-------VATVGWKNNYHAQNARRLNDGTKKYR 106 (142) Q Consensus 36 A~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg--~~~G-------~~~VG~~k~~~a~~A~f~n~GT~k~~ 106 (142) -++-...|+..-.+ .+.+. -+.+=+.+ .+.-|| .... .-..|. ..+.+|-+.|+||.++| T Consensus 1 m~~~r~~l~~~~~~--l~~~~-v~VGi~~~-----a~y~d~~~~~~~~~~~~~~~~~~G~---pva~ia~~~e~G~~~IP 69 (155) T protein:vir:77 1 MSVTRRGLTLPKDR--YRSMS-VKAGVLAG-----ATYPDESGKKLADGSILKKDPRAGL---PVAMIAMALNYGTSKLP 69 (155) T ss_pred CcchHHHHHHHHHH--HhcCc-eEEeecCC-----CCCccccchhhhhhhhccccccccc---cHhhhhhhhhcCCCCCC Confidence 33333333222111 01000 00000000 000010 0000 000121 23578999999999999 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 107 ADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 107 ~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) |.||+..+.++.. .++.++..+.++. ++. T Consensus 70 ~RPFlr~t~~~~~--~~~~~~l~~~~~~-----~~~ 98 (155) T protein:vir:77 70 ARPFMEKTIADRS--AEWIKGLTVMMTM-----GYD 98 (155) T ss_pred CCchhhHHHHHHH--HHHHHHHHHHHHc-----cCc Confidence 9999999998874 5666666555432 233 No 130 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=92.19 E-value=0.0024 Score=34.96 Aligned_cols=125 Identities=10% Similarity=0.033 Sum_probs=65.5 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-CcCCC-----------------CCCCC-CCcc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-REKHY-----------------SNKKD-LKYG 61 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-p~~~~-----------------~~~~~-~k~~ 61 (142) =+|.+++--.+++.+.|..|.... +....+++.-|+.+....+++- |.... +..+. -..+ T Consensus 2 ~~~i~i~~d~~~~~~~L~~l~~~~-~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 2 TTRIDVELDDQEVRQRLAVLMRSV-TDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred ceEEEEEechHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 122322211134555555554332 3456677766666666555543 32111 00000 1236 Q ss_pred cchhcceecCccccccccceeEeccCCCCceeEEeecccCcc-------ccCCCchhhHHHHH---HHHHHHHHHHHHHH Q lcl|NC_018285. 62 HMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK-------KYRADHFVTNVQND---SSVQKKVLLAEKAE 131 (142) Q Consensus 62 HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~-------k~~~~hFie~t~~e---~~~~~~vl~A~~~~ 131 (142) +|+++|.+.. ....+.||-+ -.+|.+-+||+. .+|+.||+--+.+. .+..++|++...+- T Consensus 81 ~L~~Si~~~~------~~~~v~vGt~----~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~~~ 150 (155) T protein:vir:79 81 ALARSVTTWA------DRNEAGIGSN----LVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEVVLTA 150 (155) T ss_pred hhhhhhhcee------cCCEEEEecC----chhhhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHHHH Confidence 8999999764 1336778853 256888999965 79999999432221 12234555555555 Q ss_pred HHHHHhhc Q lcl|NC_018285. 132 YEKLIRRK 139 (142) Q Consensus 132 ~k~~l~~k 139 (142) +++- + T Consensus 151 l~r~---r 155 (155) T protein:vir:79 151 LSRN---R 155 (155) T ss_pred HHhc---C Confidence 5422 2 No 131 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=91.78 E-value=0.0024 Score=34.99 Aligned_cols=128 Identities=13% Similarity=0.051 Sum_probs=70.1 Q ss_pred Ccc-chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-CcCCCC--------------CCCC----CCc Q lcl|NC_018285. 1 MAM-VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-REKHYS--------------NKKD----LKY 60 (142) Q Consensus 1 m~m-~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-p~~~~~--------------~~~~----~k~ 60 (142) |.. .+++--..++...|+.|.... ++...++..-|+.+....+++- |..... .+.. ... T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~-~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~t 79 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAV-TDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVT 79 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccc Confidence 542 112211233444444444322 3445666666666665554433 321110 0111 114 Q ss_pred ccchhcceecCccccccccceeEeccCCCCceeEEeecccCcc-------ccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 61 GHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK-------KYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 61 ~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~-------k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) ++|+++|.+.. ....+.||.+. -+|.+-+||+. .+|+.||+-=.... +-+.+|.+.+.+.+. T Consensus 80 G~L~~Si~~~~------~~~~v~vGtn~----~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~-e~~~ei~~~I~~~i~ 148 (155) T protein:vir:10 80 NALARSITTRA------DRDQAQIGSNL----SYAAIQQLGGQAGRGRKVTIPARPYLPVLRNG-QLKPSARDAVLDVLL 148 (155) T ss_pred hhhhhhhhcee------cCCEEEEecCc----chhhhhhcccccCCCCccccCCccccCCCccc-cchHHHHHHHHHHHH Confidence 69999999764 23367899642 46888899964 69999999522211 123577788888888 Q ss_pred HHHhhcC Q lcl|NC_018285. 134 KLIRRKG 140 (142) Q Consensus 134 ~~l~~k~ 140 (142) +.|.+-. T Consensus 149 ~~l~~~r 155 (155) T protein:vir:10 149 AALSQGR 155 (155) T ss_pred HHHhhcC Confidence 8885433 No 132 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=91.68 E-value=0.0026 Score=34.73 Aligned_cols=125 Identities=11% Similarity=0.036 Sum_probs=65.0 Q ss_pred Cc-cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-CcCCC-----------------CCCC-CCCc Q lcl|NC_018285. 1 MA-MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-REKHY-----------------SNKK-DLKY 60 (142) Q Consensus 1 m~-m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-p~~~~-----------------~~~~-~~k~ 60 (142) |. |.+++--.+++.+.|..|.... +....+++.-|+.+....+++- |.... +... -... T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~-~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSV-TDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhc Confidence 32 2222211134444444444322 3456677766666666555543 32111 0000 0124 Q ss_pred ccchhcceecCccccccccceeEeccCCCCceeEEeecccCcc-------ccCCCchhhHHHHH---HHHHHHHHHHHHH Q lcl|NC_018285. 61 GHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK-------KYRADHFVTNVQND---SSVQKKVLLAEKA 130 (142) Q Consensus 61 ~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~-------k~~~~hFie~t~~e---~~~~~~vl~A~~~ 130 (142) ++|+++|.+.. .+..+.||.+. .+|..-+||+. .+|+.||+--+.+. .+..++|++...+ T Consensus 80 g~L~~Si~~~~------~~~~v~vGtn~----~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~~ 149 (155) T protein:vir:99 80 NALARSVTTWA------DRNEAGIGSNL----VYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEIVLT 149 (155) T ss_pred hhhhhhhhcee------cCCEEEEecCc----cchhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHHH Confidence 68999999764 13367899542 46888999975 79999999432211 1222455555555 Q ss_pred HHHHHHhhcC Q lcl|NC_018285. 131 EYEKLIRRKG 140 (142) Q Consensus 131 ~~k~~l~~k~ 140 (142) -+++ .. T Consensus 150 ~l~~----~~ 155 (155) T protein:vir:99 150 ALSR----NR 155 (155) T ss_pred HHhc----cC Confidence 5553 33 No 133 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=90.99 E-value=0.0036 Score=33.99 Aligned_cols=119 Identities=18% Similarity=0.185 Sum_probs=65.8 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCC---CCCC-----CCCC-cccc----- Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKH---YSNK-----KDLK-YGHM----- 63 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~---~~~~-----~~~k-~~Hl----- 63 (142) |.+|++..+.|-.-|.+|.+ ..+..+++.-|+.+...-+++ .|... +.+. +.+. ...| T Consensus 1 m~~~~~~~~~l~~ll~~L~~---~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~ 77 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSP---AARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLR 77 (149) T ss_pred CchHHHHHHHHHHHHHhcCC---chHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhh Confidence 88877654444444555543 335567777777776665553 34211 1111 1111 1112 Q ss_pred -hhcceecCccccccccceeEeccCCCCceeEEeecccCcc----------ccCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 64 -ADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK----------KYRADHFVTNVQNDSSVQKKVLLAEKAEY 132 (142) Q Consensus 64 -aD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~----------k~~~~hFie~t~~e~~~~~~vl~A~~~~~ 132 (142) +.+|.+.. + .-.+.|||- .+...||..-.+|.. ++|+.||+-=+.++ ..+|++...+-+ T Consensus 78 ~~~~l~~~~-~-----~~~~~v~~~-Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d---~~~I~~~i~~~l 147 (149) T protein:vir:18 78 TSRFMKAKG-S-----DSAAVVEFT-GKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTRDD---EQMIEDVIISHL 147 (149) T ss_pred hhhhhheee-c-----CceeEEEec-ccchhhhhhhhccccccccCCCccccccccccCCCCHHH---HHHHHHHHHHHH Confidence 23333221 1 124667774 445688899999954 79999999644433 256777777777 Q ss_pred HH Q lcl|NC_018285. 133 EK 134 (142) Q Consensus 133 k~ 134 (142) .| T Consensus 148 ~~ 149 (149) T protein:vir:18 148 GK 149 (149) T ss_pred hC Confidence 76 No 134 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=90.64 E-value=0.0022 Score=35.14 Aligned_cols=107 Identities=10% Similarity=0.107 Sum_probs=58.1 Q ss_pred chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec----------Ccc Q lcl|NC_018285. 4 VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ----------STN 73 (142) Q Consensus 4 ~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~----------~~~ 73 (142) -+|...++.|.++++.... .+++..|.-+...|...+|.. || -++-|-.++ ..+ T Consensus 1 msF~~~i~~~~~~ve~~~~-------~~~r~~a~~~~~~vv~~sPVd------TG---r~Ranw~vs~~~~~~~~~~~~d 64 (134) T protein:vir:80 1 MSYTDRFNVIAKGIEDNVD-------NLVKNVALAIGSNVIADTPIL------TG---QARRNWQTELNQMPESVLDIPE 64 (134) T ss_pred CCcccCHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCCc------ch---hhhcccceeecCcccccccCcC Confidence 4677889999888876432 345555555666666678852 11 122222222 111 Q ss_pred ccc----------------cccceeE-eccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 74 ADG----------------RKNGVAT-VGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLI 136 (142) Q Consensus 74 ~dg----------------~~~G~~~-VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l 136 (142) .+| -+-|.+. ++ +..-+|.+||+|++.|+|..|++-+..+- ..+++- .+.+ T Consensus 65 ~~g~~~~~~~~~~~~vi~~~k~g~~iyi~----Nn~pYA~~LEyG~S~QAP~G~v~~t~~~~---~~~v~~----~~~~- 132 (134) T protein:vir:80 65 SPSEGMDEALQVLQQTVGQYKAGDTVHIT----NNAPYIKELNSGSSQQAPANFVETSIMRA---TRLIRN----VKVV- 132 (134) T ss_pred CCCccchhhHHHHHHHHhhccCcceEEEe----eCchhhhhhhccccCCCcchHHHHHHHHH---HHHHHh----hccC- Confidence 111 1111111 22 22467899999999999999998776553 233332 2222 Q ss_pred hh Q lcl|NC_018285. 137 RR 138 (142) Q Consensus 137 ~~ 138 (142) -+ T Consensus 133 ~~ 134 (134) T protein:vir:80 133 PQ 134 (134) T ss_pred CC Confidence 11 No 135 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=90.39 E-value=0.0009 Score=37.29 Aligned_cols=95 Identities=20% Similarity=0.168 Sum_probs=43.9 Q ss_pred HHHHHHHHHHHHHHHHHhcC-cCCCC--CCCCCCcccchhcceecCccccccccceeEeccCCCCceeEEeecccCcccc Q lcl|NC_018285. 29 AKITTAGAKVFQKELEEVTR-EKHYS--NKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTKKY 105 (142) Q Consensus 29 ~ka~~AGA~v~~~~L~~~tp-~~~~~--~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~ 105 (142) =++.+-|-+.+.+.|....= ..... ...++...-+.+...... +. .-|. ..+.+|-+.|+||.++ T Consensus 1 m~v~~k~L~~~~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~----~~-----~~g~---~va~ia~~~E~G~~~I 68 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRSMSVKAGVLAGATYPDESGKKLADGTILTK----DP-----RAGL---PVAMIAMALNYGTSKL 68 (155) T ss_pred CcchHHHHHHHHHHHhCCeeEEeecCCCCCCcccchhhhhhhhccc----cc-----ccCC---cHHHHHHhhhcCCCCC Confidence 44444554444444321100 00000 000000000111111000 00 0122 1357888999999999 Q ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 106 RADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 106 ~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ||.||+..+.++.. .++.+...+.++. +.. T Consensus 69 P~RPFlr~t~~~~~--~~~~~~l~~~~~~-----~~~ 98 (155) T protein:vir:78 69 PARPFMEKTITDRS--AEWIKGLTVMMTM-----GYD 98 (155) T ss_pred CCcchhhHHHHHHH--HHHHHHHHHHHHc-----CCC Confidence 99999999998874 5566655544432 233 No 136 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=90.20 E-value=0.00032 Score=39.71 Aligned_cols=80 Identities=16% Similarity=0.155 Sum_probs=42.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCC-----------CCcee Q lcl|NC_018285. 25 PAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKN-----------NYHAQ 93 (142) Q Consensus 25 ~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k-----------~~~a~ 93 (142) -...-+.-..|.+.+.+.|+.-. .-.+.|||.- -..+. T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~-------------------------------~~~v~VGi~~~~~~~~~~~~g~~vA~ 49 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLK-------------------------------EKAVYVGFPAEFDEKVKGSENFNLAS 49 (148) T ss_pred CccccccccHHHHHHHHHHHHhh-------------------------------CCeEEEEeecCcCCCCCCCCCCCHHH Confidence 01111111112222222222111 1246677731 12468 Q ss_pred EEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 94 NARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 94 ~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) +|.+.|+||.++|+.||+..+.++.. +++.+.+.+.++ + +.. T Consensus 50 ia~~~E~G~~~IP~Rpflr~t~~~~~--~~~~~~~~~~~~----~-~~~ 91 (148) T protein:vir:52 50 LAAVLEFGNEHIPARPFLRQTLEENQ--EKYTALFIQWFD----Q-GVP 91 (148) T ss_pred HHHHHhcCCCCCCCcchhHHHHHHHH--HHHHHHHHHHHH----c-CCC Confidence 99999999999999999999998863 556555544433 2 122 No 137 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=89.82 E-value=0.0076 Score=32.20 Aligned_cols=119 Identities=13% Similarity=0.051 Sum_probs=60.2 Q ss_pred cchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCCCC--------CCCCC------Ccccc Q lcl|NC_018285. 3 MVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKHYS--------NKKDL------KYGHM 63 (142) Q Consensus 3 m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~~~--------~~~~~------k~~Hl 63 (142) |++|++..+.|-.-|.+|. +..+..+++.-|+.+...-+++ .|..... .++.. ....+ T Consensus 1 m~~~~~l~~~L~~ll~~l~---~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~ 77 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLD---APARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRL 77 (148) T ss_pred CccHHHHHHHHHHHHHhcC---ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhh Confidence 8887663333333344553 3444566665555555444432 3321110 01100 01122 Q ss_pred hhcceecCccccccccceeEeccCCCCceeEEeecccC----------ccccCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 64 ADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDG----------TKKYRADHFVTNVQNDSSVQKKVLLAEKAEYE 133 (142) Q Consensus 64 aD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~G----------T~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k 133 (142) +-++..+. + .-.+.|||. .+...||..-.+| ++.||+.||+-=+.++. ++|+..+.+- T Consensus 78 ~~~l~~~~---~---~~~~~v~~~-Gt~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d~---~~i~~~i~~~-- 145 (148) T protein:vir:79 78 ARYMKTQA---D---ANTAVVTFA-GNAQRIATVHQFGLRDRVNKAGLTAQYPARELLGMDGVDM---EHITNLLLLH-- 145 (148) T ss_pred hhheeeee---e---CCeeeEEee-ccchhhhhhhhcCccccccCCCCccccCcccccCCCHHHH---HHHHHHHHHH-- Confidence 33343322 1 225778884 4556788888888 56799999997554432 4555544443 Q ss_pred HHHhh Q lcl|NC_018285. 134 KLIRR 138 (142) Q Consensus 134 ~~l~~ 138 (142) |.+ T Consensus 146 --l~~ 148 (148) T protein:vir:79 146 --LGA 148 (148) T ss_pred --hcC Confidence 444 No 138 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=89.79 E-value=0.001 Score=36.97 Aligned_cols=95 Identities=20% Similarity=0.165 Sum_probs=43.8 Q ss_pred HHHHHHHHHHHHHHHHHhcC-cCCCC--CCCCCCcccchhcceecCccccccccceeEeccCCCCceeEEeecccCcccc Q lcl|NC_018285. 29 AKITTAGAKVFQKELEEVTR-EKHYS--NKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTKKY 105 (142) Q Consensus 29 ~ka~~AGA~v~~~~L~~~tp-~~~~~--~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~ 105 (142) =++.+-|-+.+.+.|....= ..... ...++...-+.+...... +. .-|. ..+.+|-+.|+||.++ T Consensus 1 m~v~~k~L~~~~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~----~~-----~~g~---~va~ia~~~E~G~~~I 68 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRSMSVKAGVLAGATYPDESGKKLADGTILTK----DP-----RAGL---PVAMIAMALNYGTSKL 68 (155) T ss_pred CcchHHHHHHHHHHHhCCeeEEeecCCCCCccccchhhhhhhhccc----cc-----ccCC---cHHHHHHHHhcCCCCC Confidence 44444554444444321100 00000 000000000111111000 00 0122 1357888999999999 Q ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 106 RADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 106 ~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ||.||+..+.++.. +++.+...+.++. +.. T Consensus 69 P~RPFlr~t~~~~~--~~~~~~l~~~~~~-----~~~ 98 (155) T protein:vir:10 69 PARPFMEKTIADRS--AEWIKGLTVMMTM-----GYD 98 (155) T ss_pred CCcchhHHHHHHHH--HHHHHHHHHHHHc-----CCC Confidence 99999999998874 5666655544432 222 No 139 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=89.41 E-value=0.0064 Score=32.60 Aligned_cols=123 Identities=17% Similarity=0.189 Sum_probs=66.9 Q ss_pred cchHHHHHHHHHHHH-HHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCCC---CCC------------CCCCcc Q lcl|NC_018285. 3 MVGLDEALEGWLETV-ASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKHY---SNK------------KDLKYG 61 (142) Q Consensus 3 m~~~~~~l~e~~~~l-~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~~---~~~------------~~~k~~ 61 (142) |.+--+.|++.|..| .+|. +.....+++.-|+.+...-+++ .|.... .+. ...... T Consensus 1 m~~~~~~l~~~L~~ll~~L~---~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~ 77 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALE---PGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQ 77 (156) T ss_pred CchhHHHHHHHHHHHHHhcC---CcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhh Confidence 665444555544443 3443 3345567776666666555443 332111 010 001111 Q ss_pred cchhc--ceecCccccccccceeEeccCCCCceeEEeecccCcc----------ccCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 62 HMADG--LSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTK----------KYRADHFVTNVQNDSSVQKKVLLAEK 129 (142) Q Consensus 62 HlaD~--I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~----------k~~~~hFie~t~~e~~~~~~vl~A~~ 129 (142) .|+.+ |.++. ..-.+.|||. .+.+.||+.-.+|.. .||+.||+-=+.++. ++|+.... T Consensus 78 ~l~~~~~l~~~~------~~~~a~vg~~-Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~~d~---~~i~~~i~ 147 (156) T protein:vir:11 78 KLRTVRYLRAKG------DAQAITVSFA-GRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDSSDM---ETIQNGIL 147 (156) T ss_pred hhhhhheeeeee------cCcEEEEEec-CCchhhhhhhcccccccccCCCCcccccccccCCCCHHHH---HHHHHHHH Confidence 23333 22221 1226779984 455788999999964 699999996555442 57877777 Q ss_pred HHHHHHHhh Q lcl|NC_018285. 130 AEYEKLIRR 138 (142) Q Consensus 130 ~~~k~~l~~ 138 (142) +-+++..-= T Consensus 148 ~~l~~~~~~ 156 (156) T protein:vir:11 148 AHIDANSPI 156 (156) T ss_pred HHHhhcCCC Confidence 777765444 No 140 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=89.32 E-value=0.0078 Score=32.14 Aligned_cols=124 Identities=17% Similarity=0.201 Sum_probs=64.2 Q ss_pred cchHHHHHHHHHHHH-HHhccccHHHHHHHHHHHHHHHHHHHHHh-----cCcCCCC---CCCC------CCcccchh-- Q lcl|NC_018285. 3 MVGLDEALEGWLETV-ASIGDITPAEQAKITTAGAKVFQKELEEV-----TREKHYS---NKKD------LKYGHMAD-- 65 (142) Q Consensus 3 m~~~~~~l~e~~~~l-~kl~~~~~~~~~ka~~AGA~v~~~~L~~~-----tp~~~~~---~~~~------~k~~HlaD-- 65 (142) |.+--+.|++++..| .+| ++..++..++.=|..+...-+++ .|..... +..+ .+.+++.. T Consensus 1 m~~~~~~l~~~l~~ll~~l---~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~ 77 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKL---SPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREA 77 (155) T ss_pred CchHHHHHHHHHHHHHHhc---CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchh Confidence 444334455555544 233 33455667777777666555443 2321110 0000 01223322 Q ss_pred ---cceecC-ccccccccceeEeccCCCCceeEEeecccCc----------cccCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 66 ---GLSVQS-TNADGRKNGVATVGWKNNYHAQNARRLNDGT----------KKYRADHFVTNVQNDSSVQKKVLLAEKAE 131 (142) Q Consensus 66 ---~I~~~~-~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT----------~k~~~~hFie~t~~e~~~~~~vl~A~~~~ 131 (142) .+..+. .+..- ..-++.|||. .+...||..-.+|. +.||+.||+-=+.++ .++|+....+- T Consensus 78 m~~~l~~a~~l~~~~-~~d~a~Vg~~-Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LGls~~d---~~~I~~~i~~~ 152 (155) T protein:vir:79 78 MFRKLRTARYLRIDV-DSTGLAIGFD-ERLSRIARVHQEGQKAPVEPGGPLAQYPVRVVLGFSDAD---RELVRDRLLRE 152 (155) T ss_pred hhhhhhhhheeeeee-cCcEEEEEec-CcchhhhhhhhcCCcccCCCCCcccccccccccCCCHHH---HHHHHHHHHHH Confidence 221111 01111 1225779984 45567888888883 479999999655544 25788877777 Q ss_pred HHH Q lcl|NC_018285. 132 YEK 134 (142) Q Consensus 132 ~k~ 134 (142) +.| T Consensus 153 l~r 155 (155) T protein:vir:79 153 LTR 155 (155) T ss_pred hhC Confidence 776 No 141 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=89.05 E-value=0.0025 Score=34.84 Aligned_cols=126 Identities=7% Similarity=0.024 Sum_probs=66.2 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHH---HHHhcCcCCCC---CCCCC-CcccchhcceecCcc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKE---LEEVTREKHYS---NKKDL-KYGHMADGLSVQSTN 73 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~---L~~~tp~~~~~---~~~~~-k~~HlaD~I~~~~~~ 73 (142) |-+.+-.+-++++++.|+.|.... +..|-- .+. |.-..-...|. .++.+ ..-++.+.+.....+ T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~-------v~vGi~--~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~ 71 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYS-------LQIGLF--GEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARD 71 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCE-------EEEEEe--cCCCcchhheeehhhcCCeeecCCceeeecchhhhcccccc Confidence 877665567888888888875433 233321 110 00000000011 00000 011233333322222 Q ss_pred ccccc----cceeEeccCCCCceeEEeecccCc--cccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 74 ADGRK----NGVATVGWKNNYHAQNARRLNDGT--KKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 74 ~dg~~----~G~~~VG~~k~~~a~~A~f~n~GT--~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) .+|.. .....++.. +-++..++.|+ .++||.||+..+.++. ++++.+.+.+.++.+|.+. .. T Consensus 72 ~~~~~~p~g~~~~~~~~~----~~~~~~~e~g~~~~~IP~RPFlr~t~~~~--~~~~~~~~~~~~~~vl~g~-~~ 139 (199) T protein:vir:80 72 IPGLFKPKGKNILAVAGP----DGKLTVMFYLKTEVNIPERSFLRSTFDEK--SNKWGELFEGWIDDVIHGK-LS 139 (199) T ss_pred cCcccccCCcceeeeecc----ccceeeeeeccccccCCCCchhHHHHHHH--HHHHHHHHHHHHHHHHhCC-Cc Confidence 22211 112223332 22456678887 5899999999999987 4789999999999998774 33 No 142 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=88.33 E-value=0.0018 Score=35.69 Aligned_cols=97 Identities=15% Similarity=0.177 Sum_probs=43.2 Q ss_pred HHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec----CccccccccceeEeccCCCCceeEEeecccCcc Q lcl|NC_018285. 28 QAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ----STNADGRKNGVATVGWKNNYHAQNARRLNDGTK 103 (142) Q Consensus 28 ~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~----~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~ 103 (142) =.-+++.|-+...+.+.+-.... .+.+=|. +..+. ....++..+....-|+ ..+.+|.+.|+||. T Consensus 1 ~~~~~~~g~~~~~~~~~~l~~~~-------v~vG~l~-~a~yp~G~~~~~~~~~~~~~~~~g~---~va~Ia~~~E~G~~ 69 (168) T protein:vir:94 1 MTTIARKGVKMPPHLEAQFQSGE-------VKAGVLS-GSTYPQMTYTDQRTGKQIEDARGGM---PVAVIAQALEYGHG 69 (168) T ss_pred CccccchhhhhhHHHHHhhhccc-------eeeeccc-cCcccccccchhhcccccccccccc---cHHHHHHHHhcCCC Confidence 11122333333333332222110 0001111 11110 0111111111111232 24689999999999 Q ss_pred ccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 104 KYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 104 k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) ++||.||+..+.++.. .++. +.+.++|.+ +.. T Consensus 70 ~IP~RPFlr~t~~~~~--~~~~----~~~~~~~~~-~~~ 101 (168) T protein:vir:94 70 QNHPRPFMQQTYAAQY--RAWS----RDLTLTLKA-GAA 101 (168) T ss_pred CCCCchhhHHHHHHHH--HHHH----HHHHHHHhc-CCC Confidence 9999999999998763 3433 344555554 223 No 143 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=86.46 E-value=0.013 Score=30.84 Aligned_cols=114 Identities=16% Similarity=0.144 Sum_probs=58.4 Q ss_pred Cccc-hHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec--------- Q lcl|NC_018285. 1 MAMV-GLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ--------- 70 (142) Q Consensus 1 m~m~-~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~--------- 70 (142) |+-. +|...++.|.++++.-.. ..++.-|..+...|...+|.. || -++-|-.++ T Consensus 1 m~~~~sFa~~i~~~~~~ve~~~~-------~~~r~~a~~i~~~vv~~sPVd------TG---rfRanw~vs~~~p~~~~~ 64 (148) T protein:vir:97 1 MPSLSEFSRRITLRGRKVAEGAD-------ALTRKVALAADQAVVSGTPVD------TG---RARSNWIAAIGSAPSSVI 64 (148) T ss_pred CCccchhcccHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCCc------ch---hhhhhhheeecccccccc Confidence 5433 466889999999876432 223333333444455567742 11 122222222 Q ss_pred -Cccc----------------------cccccceeE-eccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHH Q lcl|NC_018285. 71 -STNA----------------------DGRKNGVAT-VGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLL 126 (142) Q Consensus 71 -~~~~----------------------dg~~~G~~~-VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~ 126 (142) ..+- .+-+-|.+. ++ +..-+|.+||+|.+.|.|..|++-+..+- ..+++ T Consensus 65 ~~~dp~~~G~~~~~~~~~~i~~~~~vi~~~k~g~~iyi~----NnlpYA~~LEyG~S~QAP~G~v~~t~~~~---~~~v~ 137 (148) T protein:vir:97 65 DAYSPGEAGSTEAANTQAAIDQAESVIRGYNYGEEIHIT----NNLPYIQRLNDGYSAQAPANFVEQAVLEA---VQVVQ 137 (148) T ss_pred cccCCCCCCcccccchhHHHHHHHHHhhccCCCceEEEe----ecchhhhHhhccccCCCcchHHHHHHHHH---HHHHH Confidence 0110 011111111 22 22467899999999999999999887653 33332 Q ss_pred HHHHHHHHHHhhcCCC Q lcl|NC_018285. 127 AEKAEYEKLIRRKGGK 142 (142) Q Consensus 127 A~~~~~k~~l~~k~g~ 142 (142) - . +++++--|. T Consensus 138 ~-~----~~~~~~~~~ 148 (148) T protein:vir:97 138 F-G----RVVDGDPGS 148 (148) T ss_pred h-h----hhhcCCCCC Confidence 1 2 233344444 No 144 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=85.20 E-value=0.0096 Score=31.64 Aligned_cols=110 Identities=19% Similarity=0.169 Sum_probs=54.3 Q ss_pred Cccc--hHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec--C----- Q lcl|NC_018285. 1 MAMV--GLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ--S----- 71 (142) Q Consensus 1 m~m~--~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~--~----- 71 (142) ||-. +|...++.|.++++.-.. .+++.-|..+-..+...+|..-- -++-|-.++ . T Consensus 1 MA~~~~~f~~~i~~~~~~ve~~~~-------~~~r~~a~~v~~~vv~~sPVDTG---------rfRanw~vs~~~p~~~~ 64 (144) T protein:vir:95 1 MAKSLLDLADRLEKKAKAIDEAAS-------QNAVDTALAIVGDLAYKTPVDTS---------QALSNWIVTLESPSGQQ 64 (144) T ss_pred CchhhhhhhhhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCccch---------hhccccceecccccccc Confidence 6643 577888899888876432 22333333334445556774211 111111111 0 Q ss_pred ---c----------------------cccccccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHH Q lcl|NC_018285. 72 ---T----------------------NADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLL 126 (142) Q Consensus 72 ---~----------------------~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~ 126 (142) . .+.+-+-|. +|=+. +..-+|.+||+|++.|+|..|+.-+..+- ..+. T Consensus 65 ~~~~~~~~~~~t~d~sg~~tl~~~~~vi~~~~~g~-~iyi~--NnlpYA~~LEyG~S~QAP~G~vr~~~q~~---~~~v- 137 (144) T protein:vir:95 65 IKPHFPGSQGSTQRASAAETLNSAKLVLRNKKPGQ-AIFIT--NNLPYIRRLNDGYSAQAPAGFVERAVLIG---RKMR- 137 (144) T ss_pred ccccccccccccCCCchhHHHHHHHHHHhhcCccc-eEEEe--eCchhhhhhhccccCCCcchHHHHHHHHH---HHHH- Confidence 0 011111111 11112 22467899999999999999998776543 1221 Q ss_pred HHHHHHHHHHh Q lcl|NC_018285. 127 AEKAEYEKLIR 137 (142) Q Consensus 127 A~~~~~k~~l~ 137 (142) +..| +.+ T Consensus 138 ---~~~~-~~~ 144 (144) T protein:vir:95 138 ---KKFK-IKD 144 (144) T ss_pred ---Hhhc-cCC Confidence 1111 222 No 145 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=83.47 E-value=0.022 Score=29.67 Aligned_cols=101 Identities=11% Similarity=0.005 Sum_probs=52.9 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceec---------- Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQ---------- 70 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~---------- 70 (142) |+-=+|...|..|.+++++.... .++..+--+-..|...+|..- | -++-|-.++ T Consensus 1 ~~~~sf~~~i~~~~~~ve~~~~~-------~~r~~~~~~~~~vv~~sPVdt------G---rfRanw~vs~~~p~~~~~~ 64 (121) T protein:vir:94 1 MISMKFNVNLSRLRSNLREEAKK-------KAIRIAQEIVNGVIARSPVLA------G---DYRSSWNVSEGSMEFKFNN 64 (121) T ss_pred CccchhhccHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCch------h---hhhccccccccCcccccCC Confidence 44345668999999999775322 233333333345556777421 0 111111111 Q ss_pred Cccccccc----------cceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHH Q lcl|NC_018285. 71 STNADGRK----------NGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSS 119 (142) Q Consensus 71 ~~~~dg~~----------~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~ 119 (142) .++..|.. ....+|=+.| ..-+|.+||+|+++|+|..|++-+..+-. T Consensus 65 ~~dp~g~~t~~~~~~~~~~~~~~iyi~N--nlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 65 GGNPANPTPAPAIVVSSNVALPHFYITN--GAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred CCCCCcchhHHHHHHHHhhccceEEEee--CcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 01111100 0001111222 24678999999999999999998876642 No 146 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=71.87 E-value=0.028 Score=29.09 Aligned_cols=116 Identities=13% Similarity=0.021 Sum_probs=56.6 Q ss_pred Ccc-------chHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCC--------CCCCCCcccchh Q lcl|NC_018285. 1 MAM-------VGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYS--------NKKDLKYGHMAD 65 (142) Q Consensus 1 m~m-------~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~--------~~~~~k~~HlaD 65 (142) |-- -.|...+..|.++++.- ..++++..+.-+...|-..+|..... .=+| +-++- T Consensus 1 ~~~~~~~~~~msFaa~i~~~~~~~e~~-------~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydt---GrfRa 70 (152) T protein:vir:96 1 MLSCICGGNPMSWSKSLKNIIVKNENL-------TEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRA---GSYRS 70 (152) T ss_pred CcceeeCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHhhccccccccccccccccch---hhhhh Confidence 110 13556677787777653 33456666666666666667741100 0011 12222 Q ss_pred cceecC--------ccccc-------------cccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHH Q lcl|NC_018285. 66 GLSVQS--------TNADG-------------RKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKV 124 (142) Q Consensus 66 ~I~~~~--------~~~dg-------------~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~v 124 (142) |-.+|- .+.|+ -+-|. +|=+. +..-+|..||+|++.|+|..|+.-+..+- .++ T Consensus 71 nw~vS~~~p~~~~~~~~~~~~t~~~~~~~i~~~~~g~-~iyi~--NnlPYA~~LEyG~S~QAP~G~vr~t~~~~---~~~ 144 (152) T protein:vir:96 71 NHRVSISKITSFEKGISSQSSIMMDLQSDIAKFKIGE-TLFMT--NPLPYATSIEYGHSSQAPNGVYRPAVRRL---VKF 144 (152) T ss_pred hheeeecCCCcccccCCCCCchHHHHHHHHhhccccc-eEEEe--eCchhhhHhhccccCCCCchHHHHHHHHH---HHH Confidence 222221 11111 01111 11111 22467899999999999999999887653 233 Q ss_pred HHHHHHHHHHHHhhc Q lcl|NC_018285. 125 LLAEKAEYEKLIRRK 139 (142) Q Consensus 125 l~A~~~~~k~~l~~k 139 (142) ++ + .++.+ T Consensus 145 v~---e----a~~~~ 152 (152) T protein:vir:96 145 LN---T----ELKAK 152 (152) T ss_pred HH---H----HhccC Confidence 32 2 22223 No 147 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=71.22 E-value=0.033 Score=28.72 Aligned_cols=85 Identities=7% Similarity=-0.019 Sum_probs=39.9 Q ss_pred hcCcCCCCCCCCCCcccchhcceecCccccccccceeEeccCCCC--------ceeEEeecccCccccCCCchhhHHHHH Q lcl|NC_018285. 46 VTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGVATVGWKNNY--------HAQNARRLNDGTKKYRADHFVTNVQND 117 (142) Q Consensus 46 ~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~~~VG~~k~~--------~a~~A~f~n~GT~k~~~~hFie~t~~e 117 (142) ..++ ..++ .-.+|..-+.- +++ ..+.|||.-+. ...+|-+.++||.++|+.||+.++.+. T Consensus 1 ~~~~--~~~~---G~~~L~~~~k~----l~~---~~V~VGi~~d~g~~~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~ 68 (160) T protein:vir:95 1 MVKR--VIHP---ARAKLVGAMKN----LQT---ANAQVGYFQEQGQHSSGFSYPALMYLQEVIGVPSASGKVYRRLFEI 68 (160) T ss_pred Ccee--echH---hHHHHHHHHHH----HhC---CeeEEeeccccccCCCCccHHHHHhhhhcCcccCCCcchhHHHHHH Confidence 1111 0110 01134443332 222 24668875311 225888999999999999999998863 Q ss_pred HHH--HHHHHHHHHHHHHHHHh-hc------CCC Q lcl|NC_018285. 118 SSV--QKKVLLAEKAEYEKLIR-RK------GGK 142 (142) Q Consensus 118 ~~~--~~~vl~A~~~~~k~~l~-~k------~g~ 142 (142) ... +...+++....+.+.+. +- -|. T Consensus 69 ~~~~~~~~~~~~~~~~i~~~~~~g~~~~~~~LG~ 102 (160) T protein:vir:95 69 TMMLNKQTLLEQTKKNLYKQLSSLNTDPSNTLEA 102 (160) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHH Confidence 211 12333333333322222 10 111 No 148 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=68.84 E-value=0.017 Score=30.28 Aligned_cols=119 Identities=17% Similarity=0.112 Sum_probs=55.4 Q ss_pred cch-HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccce Q lcl|NC_018285. 3 MVG-LDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNGV 81 (142) Q Consensus 3 m~~-~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G~ 81 (142) |.+ |+. +.--+..|.|+.+++++..+-+..-=-++.+.-.+++.|. ++ +.-+|++.+..+.. +|- T Consensus 1 mgNP~~K-FGvS~~e~~K~irns~EV~~GiNdFMe~~A~~~aK~~SPV------~~---GeY~~S~~V~~ka~----NGR 66 (150) T protein:vir:81 1 MGNPFEK-FGVSDSELAKHIRNSAEVDAGINDFMENEAIPYAKSISPV------DD---GEYAASWAVMKKAK----NGR 66 (150) T ss_pred CCCchhh-hcCCHHHHHHhhccchhhhhhHHHHHHhhhhhhhhccCCc------cc---chhHHHHHHHhhcc----cCc Confidence 555 332 2223455566667776654333332222222233444443 22 35799999877543 344 Q ss_pred eEeccCCCCceeEEeecccCccc---c------------------CCCchhhHHHHHHHH-HHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 82 ATVGWKNNYHAQNARRLNDGTKK---Y------------------RADHFVTNVQNDSSV-QKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 82 ~~VG~~k~~~a~~A~f~n~GT~k---~------------------~~~hFie~t~~e~~~-~~~vl~A~~~~~k~~l~~k 139 (142) -.||. ++|+|||+++||-- | .--.|-.-. -+..+ .+-|.+..+.-|---| | T Consensus 67 G~~G~----~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvg-pdtptkaqgiaqkvashfggsl--k 139 (150) T protein:vir:81 67 GVFGP----KAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVG-PDTPTKAQGIAQKVASHFGGSL--K 139 (150) T ss_pred cccCc----cchhhhhhhhccccccccccccccccCcccceeeeecCccceecC-CCCchhhhhHHHHHHHhccccc--c Confidence 55884 26999999999732 1 111232100 00111 1234444444332222 2 Q ss_pred CCC Q lcl|NC_018285. 140 GGK 142 (142) Q Consensus 140 ~g~ 142 (142) ||- T Consensus 140 ggi 142 (150) T protein:vir:81 140 GGI 142 (150) T ss_pred ccc Confidence 222 No 149 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=65.19 E-value=0.11 Score=25.75 Aligned_cols=135 Identities=14% Similarity=0.193 Sum_probs=74.6 Q ss_pred Cccch-HH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcC-----c-CCCCCCCCCC---cccchhccee Q lcl|NC_018285. 1 MAMVG-LD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTR-----E-KHYSNKKDLK---YGHMADGLSV 69 (142) Q Consensus 1 m~m~~-~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp-----~-~~~~~~~~~k---~~HlaD~I~~ 69 (142) |-|.- ++ .++..|..+|. |..+++..+..++..-|.-+...-+++.. . +.|.-++-++ ..=|+-...+ T Consensus 1 M~i~~~~n~~~~~~l~~~L~-ll~L~p~~Rr~ll~~iak~lr~~~k~rIr~Q~~PDGs~~~pRKr~k~KM~~kL~k~l~~ 79 (227) T protein:vir:37 1 MNIRMGIDKEDLKKFLKDLE-IISLPDKKKREILIRSLQMIKRQAVKSAANQRNPMGGSWKKRKNGTAKMLRRIAKLANS 79 (227) T ss_pred CcccccCCHHHHHHHHHHHH-HhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCchhcchhHHHHhhhHHHcce Confidence 44332 33 78999999998 55789999999999888878777666543 2 2222222221 1234433333 Q ss_pred cCccccccccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH------hhcCCC Q lcl|NC_018285. 70 QSTNADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLI------RRKGGK 142 (142) Q Consensus 70 ~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l------~~k~g~ 142 (142) .. ..+.++|||.+...++||+.--+|-.-...-..++.-...-..+...-..++..++++= ++|+|+ T Consensus 80 ~~------~~~~a~v~f~~g~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~~k~k~~k 152 (227) T protein:vir:37 80 KA------EKAQGTLFYKQKRTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVANGKTKNGK 152 (227) T ss_pred ee------cccceEEEecCcchHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccCCCCCCcC Confidence 21 12356799986667899988888854443322222111000111224455666666653 223444 No 150 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=53.36 E-value=0.53 Score=22.09 Aligned_cols=123 Identities=11% Similarity=0.043 Sum_probs=62.2 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) ||-..|+-=.++++++|+..-+ ....++.+=+...+..|+...+..-+....| +|.+-.|.-. ....|...- T Consensus 1 ~~~~~f~~d~~~l~~~i~~~~~----k~~~~~~~~~d~~a~~le~~aK~nApW~DRT---g~ARqgl~~~-~~~~g~~~~ 72 (123) T protein:vir:74 1 MAKVTFEYDAQELRTNIRNLDR----RMESAVDALMDYEAAYATGQLKMRAPWTDRT---GAARSGLLAV-ANKLGPGSH 72 (123) T ss_pred CceeEEEecHHHHHHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHhcCCCCcccc---hhhhhhhccc-cccCCCceE Confidence 8877787557888888887633 3334444444444444444444333344444 4788777532 222221112 Q ss_pred eeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRK 139 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k 139 (142) ...+-.+.+ +.-|||.++...+ |-+.+|.+... .+|++-+...+-++-+.. T Consensus 73 ~Iylsh~ve----YG~~LEla~~~ky--aIi~Ptv~~~~--~~im~g~~~ll~~l~~~~ 123 (123) T protein:vir:74 73 ELIMSYSVH----YGIWLEIANSGQY--AVIGPFLPVMG--RKLMHDLEHLIDRLERAQ 123 (123) T ss_pred EEEEecCee----ecceeeecCCCCc--eeecchHHHHh--HHHHHHHHHHHHHhhccC Confidence 233443322 2356774443222 33444444432 467777776666664444 No 151 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=44.69 E-value=0.8 Score=21.12 Aligned_cols=110 Identities=15% Similarity=0.164 Sum_probs=53.2 Q ss_pred CccchHHHHHHHHHHHHHHhcc-----ccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGD-----ITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNAD 75 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~-----~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~d 75 (142) |-- .|+.+...++..+.. +.++.=..|+-.|| ..-.-.||-.-. |.= .--=..|.+.++.+ T Consensus 1 ikV----~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~----~~AA~~TPIDTS----TLi-NSQfrei~~ngtri- 66 (131) T protein:vir:10 1 MPV----KGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGA----NHAAVITPVKSS----TLI-NSQYKKLEPIPSGM- 66 (131) T ss_pred CCc----chHHHHHHHHHHHHHhhccchHHHHHHHHHHHHH----hhhhhccccchh----hhc-cccceeeeccCcee- Confidence 333 345554555554432 33333333343333 333456774211 100 00001233333322 Q ss_pred ccccceeEeccCCCCceeEEeeccc--CccccCCC--------------chhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 76 GRKNGVATVGWKNNYHAQNARRLND--GTKKYRAD--------------HFVTNVQNDSSVQKKVLLAEKAEYEK 134 (142) Q Consensus 76 g~~~G~~~VG~~k~~~a~~A~f~n~--GT~k~~~~--------------hFie~t~~e~~~~~~vl~A~~~~~k~ 134 (142) +-.|||+- -+|-++.| |+.+..|. .|+.+..++++ .+.|-..+.++|+- T Consensus 67 -----tGRVGYSA----nYA~yVHda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~-~d~i~avik~e~k~ 131 (131) T protein:vir:10 67 -----IGRVGYTA----NYAAAVNAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDG-LNEIKAIIRQGYKV 131 (131) T ss_pred -----EEeeccce----eeeeeeecCccccCCCcCCCCCcceecCCCChhhhhhhhhccc-hHHHHHHHhhhcCC Confidence 34489874 44555655 55444333 48988887764 36677778888876 No 152 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=42.07 E-value=0.9 Score=20.83 Aligned_cols=118 Identities=13% Similarity=0.036 Sum_probs=62.4 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCccccccccc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKNG 80 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~G 80 (142) ||-..|+--.++++++|+..- .....++.+=+...+..|+...+..-+....| +|.+-.|.-+. +..|...- T Consensus 1 ~~~~~f~~~~~~l~~~i~~~~----~k~~~~~~~~~d~~a~~le~~aK~nApW~DRT---g~ARq~i~~~~-~~~~~~~~ 72 (120) T protein:vir:10 1 MAKIEFKFKDIELRRGVEDME----AKVDRAMKATSNYHAVEGTAHMKEHAPWTDRT---GAARAGLHAVA-STPQPDRY 72 (120) T ss_pred CceEEEEecHHHHHHHHhhhH----HHHHHHHHHHHHHHHHHHHHHHhcCCCCcccc---hhhhhhhcccc-ccCCCceE Confidence 888888866788888887653 23344444444555555555444433454444 47777776422 22221111 Q ss_pred eeEeccCCCCceeEEeecc--cCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_018285. 81 VATVGWKNNYHAQNARRLN--DGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKG 140 (142) Q Consensus 81 ~~~VG~~k~~~a~~A~f~n--~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~ 140 (142) ...+-.+.++ .-||| .|.+...=.|+|.+-- .+|++-+...+-++ . T Consensus 73 ~Iylsh~veY----G~~LEla~~~kyaIl~PTi~~~~------~~il~g~~~ll~~l----~ 120 (120) T protein:vir:10 73 EIVFAHTVHY----GIWLEIANSGRYEIIMPTVHHEG------KLMAQRLRGLLGRL----R 120 (120) T ss_pred EEEEecCeee----cceEEeeCCCCcccccchHHHHh------HHHHHHHHHHhhhc----C Confidence 2334433332 24566 6777766667775432 34655555444443 3 No 153 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=24.90 E-value=1.2 Score=20.22 Aligned_cols=135 Identities=14% Similarity=0.218 Sum_probs=64.9 Q ss_pred CccchHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhc-----Cc-CCCCCCC--CCCcc---c---chhc Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVT-----RE-KHYSNKK--DLKYG---H---MADG 66 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~t-----p~-~~~~~~~--~~k~~---H---laD~ 66 (142) +-|.-..++|..|.++|.-| .+++..+..++..-|.-+...-+++. |. +.|.-++ .++.. . |.-. T Consensus 3 ~~~~~n~~dl~~l~~~L~ll-~L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL~~~ 81 (231) T protein:vir:37 3 IRLGLKQEDLDAFVRDLRTL-NLTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKVLRY 81 (231) T ss_pred ccCCcCHHHHHHHHHHHHHh-cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHhHHh Confidence 33333446777777777744 78899999999987777776666543 32 2222222 11111 1 2222 Q ss_pred ceecCccccccccceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH------hhcC Q lcl|NC_018285. 67 LSVQSTNADGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLI------RRKG 140 (142) Q Consensus 67 I~~~~~~~dg~~~G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l------~~k~ 140 (142) ..+.. .+ .+.+.|+|-+...+.||+.--+|-.....-..... .....+....-..++.+++++= ++|+ T Consensus 82 ~~~~~---~~--~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~-~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~ 155 (231) T protein:vir:37 82 ASILA---EE--RGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDK-NKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQ 155 (231) T ss_pred hcccc---cc--CCceEEeeecchHHHHHHHhhcCcccccchhhhhh-ccCCCCCCCCCHHHHHHHHHhcccccCCCCCC Confidence 22111 11 23345555445567888888888432221111110 0000111234445566665442 2233 Q ss_pred CC Q lcl|NC_018285. 141 GK 142 (142) Q Consensus 141 g~ 142 (142) |+ T Consensus 156 ~k 157 (231) T protein:vir:37 156 GK 157 (231) T ss_pred CC Confidence 33 No 154 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=21.79 E-value=0.21 Score=24.30 Aligned_cols=103 Identities=17% Similarity=0.191 Sum_probs=44.3 Q ss_pred CccchHH-HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHhcCcCCCCCCCCCCcccchhcceecCcccccccc Q lcl|NC_018285. 1 MAMVGLD-EALEGWLETVASIGDITPAEQAKITTAGAKVFQKELEEVTREKHYSNKKDLKYGHMADGLSVQSTNADGRKN 79 (142) Q Consensus 1 m~m~~~~-~~l~e~~~~l~kl~~~~~~~~~ka~~AGA~v~~~~L~~~tp~~~~~~~~~~k~~HlaD~I~~~~~~~dg~~~ 79 (142) |+-..-. .-|.-+--.+..+ .+..-+.+|..-|.++.-...+ ..|--.+ +--+|++.+..+..+. T Consensus 1 ma~gpt~kNP~~KFGvs~~d~------~K~~EVn~GvNeFMdE~~~~~K--~~SPV~~---G~Y~~S~~V~ers~Nk--- 66 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDF------DKLPEVNQGVNEFMDEVVDAWK--NNSPVGT---GAYRDSVQVTERSTNK--- 66 (108) T ss_pred CCCCcccccchhhhcCChhhh------hhchhhhhhHHHHHHHHHHHHh--hcCCCCc---hhhHHHHHHHHhhhcc--- Confidence 3322211 1111111111111 1112245566555553322211 1122223 4689999997754331 Q ss_pred ceeEeccCCCCceeEEeecccCccccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_018285. 80 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSSVQKKVLLAEKAEYEKLIRRKGGK 142 (142) Q Consensus 80 G~~~VG~~k~~~a~~A~f~n~GT~k~~~~hFie~t~~e~~~~~~vl~A~~~~~k~~l~~k~g~ 142 (142) |--.||. ++|.|||+++||.+..--.=.++|..+ -||- T Consensus 67 GRG~~G~----~~~~AH~VEFGs~hndeyapaqktakq---------------------fggt 104 (108) T protein:vir:79 67 GRGKVGA----TDPQAHLVEFGSAHNDEYAPAQKTAKQ---------------------FGGT 104 (108) T ss_pred CccccCC----cchhhhhhhhhccccccccchhhHHHh---------------------hccc Confidence 3344784 269999999999885322111222111 1222 No 155 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=20.80 E-value=2.7 Score=18.21 Aligned_cols=132 Identities=17% Similarity=0.173 Sum_probs=56.7 Q ss_pred CccchHHHHHHHHHHHHHHhccc-cH-HHHHHHHHHHHHHHHHHHHHhcCcCCC----------CCCCCCCcccchhcce Q lcl|NC_018285. 1 MAMVGLDEALEGWLETVASIGDI-TP-AEQAKITTAGAKVFQKELEEVTREKHY----------SNKKDLKYGHMADGLS 68 (142) Q Consensus 1 m~m~~~~~~l~e~~~~l~kl~~~-~~-~~~~ka~~AGA~v~~~~L~~~tp~~~~----------~~~~~~k~~HlaD~I~ 68 (142) |.| ++|++++.+|..|.+. .+ .....+.++|..+.....++.+..-.. ..+.+. .++.=.|. T Consensus 1 ~~i----k~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~--~~l~a~I~ 74 (192) T protein:vir:34 1 MAI----KGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATV--KNPQARIK 74 (192) T ss_pred Ccc----hhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccC--CCceEEEE Confidence 555 4666667777777653 22 334556667776665555544332100 001111 12222222 Q ss_pred ecCccc----------c-------------c--cccceeEeccCCCCceeEEeecccCc-cccC---------------- Q lcl|NC_018285. 69 VQSTNA----------D-------------G--RKNGVATVGWKNNYHAQNARRLNDGT-KKYR---------------- 106 (142) Q Consensus 69 ~~~~~~----------d-------------g--~~~G~~~VG~~k~~~a~~A~f~n~GT-~k~~---------------- 106 (142) +...++ . + ..+....||--.-..||+++-.|.++ ++++ T Consensus 75 ~~~~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpi 154 (192) T protein:vir:34 75 VNRGDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPM 154 (192) T ss_pred EeccceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEech Confidence 211110 0 0 00112234422112357766555431 1111 Q ss_pred ---CCchhhHHHHH---HHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 107 ---ADHFVTNVQND---SSVQKKVLLAEKAEYEKLIRR 138 (142) Q Consensus 107 ---~~hFie~t~~e---~~~~~~vl~A~~~~~k~~l~~ 138 (142) -..-++...+. .....|+..+...+++-+|++ T Consensus 155 s~~l~~af~~~~~~~~~~~~~~El~~~L~~~lr~~~k~ 192 (192) T protein:vir:34 155 AVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR 192 (192) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 11112222211 122357888888888888888 Done!