Query lcl|NC_011356.1_cdsid_YP_002274193.1 [gene=YYZ_gp57] [protein=hypothetical protein] [protein_id=YP_002274193.1] [location=35509..35955] Match_columns 148 No_of_seqs 138 out of 331 Neff 8.9 Searched_HMMs 1612 Date Thu Nov 7 13:29:08 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_57 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_57_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:93617 Length: 148 100.0 2.7E-44 1.7E-47 259.4 16.2 148 1-148 1-148 (148) 2 protein:vir:194 Length: 149 # 100.0 4.7E-42 2.9E-45 247.1 15.4 148 1-148 1-149 (149) 3 protein:vir:1891 Length: 179 # 100.0 1.8E-39 1.1E-42 233.0 14.9 148 1-148 1-171 (179) 4 protein:vir:4347 Length: 164 # 100.0 3.5E-39 2.2E-42 231.4 14.8 147 1-148 1-156 (164) 5 protein:vir:100075 Length: 140 100.0 1E-36 6.4E-40 217.9 15.6 138 1-148 1-138 (140) 6 protein:vir:100243 Length: 140 100.0 1.2E-36 7.2E-40 217.6 15.1 138 1-148 1-138 (140) 7 protein:vir:80362 Length: 140 100.0 2.3E-36 1.4E-39 216.0 15.3 138 1-148 1-138 (140) 8 protein:vir:1437 Length: 140 # 100.0 3.8E-36 2.3E-39 214.8 15.4 138 1-148 1-138 (140) 9 protein:vir:5745 Length: 135 # 100.0 4.7E-36 2.9E-39 214.3 15.1 131 2-147 1-135 (135) 10 protein:vir:107568 Length: 146 100.0 2.4E-36 1.5E-39 215.8 13.5 143 1-146 1-146 (146) 11 protein:vir:102085 Length: 146 100.0 2.4E-36 1.5E-39 215.8 13.5 143 1-146 1-146 (146) 12 protein:vir:105007 Length: 146 100.0 2.4E-36 1.5E-39 215.8 13.5 143 1-146 1-146 (146) 13 protein:vir:102875 Length: 146 100.0 2.4E-36 1.5E-39 215.8 13.5 143 1-146 1-146 (146) 14 protein:vir:1386 Length: 149 # 100.0 1.7E-34 1.1E-37 205.7 14.4 145 1-148 1-149 (149) 15 protein:vir:105089 Length: 133 100.0 1.3E-34 7.8E-38 206.4 13.3 129 1-145 1-133 (133) 16 protein:vir:94538 Length: 125 100.0 1E-32 6.3E-36 196.0 12.5 124 1-145 1-125 (125) 17 protein:vir:3873 Length: 128 # 100.0 1.6E-32 1E-35 194.8 12.7 122 4-143 1-128 (128) 18 protein:vir:1273 Length: 127 # 100.0 3.5E-32 2.1E-35 193.1 12.7 123 1-143 1-127 (127) 19 protein:vir:101594 Length: 173 99.9 1.9E-31 1.2E-34 189.0 11.7 118 6-145 1-173 (173) 20 protein:vir:97088 Length: 157 99.9 1E-30 6.5E-34 184.9 14.4 145 2-148 1-155 (157) 21 protein:vir:95789 Length: 114 99.9 2.3E-30 1.4E-33 183.0 11.0 114 4-143 1-114 (114) 22 protein:vir:3617 Length: 112 # 99.9 3.2E-30 2E-33 182.3 10.8 112 2-139 1-112 (112) 23 protein:vir:9708 Length: 125 # 99.9 2.1E-29 1.3E-32 177.8 12.0 120 7-144 1-125 (125) 24 protein:vir:106623 Length: 115 99.9 1.9E-29 1.2E-32 178.1 10.6 109 6-139 1-115 (115) 25 protein:vir:103917 Length: 115 99.9 1.8E-29 1.1E-32 178.2 10.3 109 6-139 1-115 (115) 26 protein:vir:9312 Length: 115 # 99.9 1.8E-29 1.1E-32 178.2 10.3 109 6-139 1-115 (115) 27 protein:vir:78858 Length: 115 99.9 1.8E-29 1.1E-32 178.2 10.3 109 6-139 1-115 (115) 28 protein:vir:97144 Length: 115 99.9 1.8E-29 1.1E-32 178.2 10.3 109 6-139 1-115 (115) 29 protein:vir:96225 Length: 115 99.9 1.8E-29 1.1E-32 178.2 10.3 109 6-139 1-115 (115) 30 protein:vir:96358 Length: 115 99.9 1.8E-29 1.1E-32 178.2 10.3 109 6-139 1-115 (115) 31 protein:vir:98342 Length: 125 99.9 1E-28 6.5E-32 174.0 13.2 123 2-143 1-125 (125) 32 protein:vir:4704 Length: 125 # 99.9 1E-28 6.5E-32 174.0 13.2 123 2-143 1-125 (125) 33 protein:vir:9414 Length: 125 # 99.9 1E-28 6.5E-32 174.0 13.2 123 2-143 1-125 (125) 34 protein:vir:79988 Length: 125 99.9 1E-28 6.5E-32 174.0 13.2 123 2-143 1-125 (125) 35 protein:vir:81106 Length: 125 99.9 1E-28 6.5E-32 174.0 13.2 123 2-143 1-125 (125) 36 protein:vir:9930 Length: 108 # 99.9 3.1E-29 2E-32 176.8 10.0 108 8-140 1-108 (108) 37 protein:vir:99744 Length: 115 99.9 3.5E-29 2.2E-32 176.6 10.2 109 6-139 1-115 (115) 38 protein:vir:743 Length: 108 # 99.9 1.6E-28 1E-31 172.9 10.2 108 6-139 1-108 (108) 39 protein:vir:98409 Length: 108 99.9 3.8E-28 2.3E-31 170.9 10.4 108 6-139 1-108 (108) 40 protein:vir:2740 Length: 114 # 99.9 1.7E-27 1.1E-30 167.3 9.6 113 1-140 1-114 (114) 41 protein:vir:4906 Length: 114 # 99.9 1.7E-27 1.1E-30 167.3 9.6 113 1-140 1-114 (114) 42 protein:vir:106570 Length: 182 99.9 1E-26 6.3E-30 163.1 13.3 145 1-148 1-182 (182) 43 protein:vir:102154 Length: 119 99.9 1.6E-27 9.7E-31 167.5 8.4 118 1-143 1-119 (119) 44 protein:vir:96486 Length: 112 99.9 8.9E-27 5.5E-30 163.4 11.0 111 1-138 1-112 (112) 45 protein:vir:5978 Length: 144 # 99.9 1.6E-24 1E-27 151.0 11.3 115 1-139 3-144 (144) 46 protein:vir:97427 Length: 137 99.8 2.2E-24 1.3E-27 150.3 10.1 108 1-135 1-137 (137) 47 protein:vir:94490 Length: 137 99.8 2.2E-24 1.3E-27 150.3 10.1 108 1-135 1-137 (137) 48 protein:vir:93738 Length: 137 99.8 2.2E-24 1.3E-27 150.3 10.1 108 1-135 1-137 (137) 49 protein:vir:107099 Length: 137 99.8 2.6E-24 1.6E-27 149.9 9.9 108 1-135 1-137 (137) 50 protein:vir:94108 Length: 149 99.8 2.2E-24 1.4E-27 150.3 9.0 108 1-135 13-149 (149) 51 protein:vir:94796 Length: 137 99.8 3.1E-24 1.9E-27 149.5 9.8 108 1-135 1-137 (137) 52 protein:vir:105916 Length: 149 99.8 2.7E-24 1.7E-27 149.8 9.0 108 1-135 13-149 (149) 53 protein:vir:95894 Length: 137 99.8 4.1E-24 2.6E-27 148.8 10.0 108 1-135 1-137 (137) 54 protein:vir:105330 Length: 137 99.8 5.4E-24 3.3E-27 148.1 10.1 108 1-135 1-137 (137) 55 protein:vir:96121 Length: 137 99.8 2.1E-23 1.3E-26 144.9 9.8 108 1-135 1-137 (137) 56 protein:vir:96829 Length: 135 99.8 2.2E-23 1.4E-26 144.8 9.6 108 1-135 1-135 (135) 57 protein:vir:94654 Length: 142 99.8 2.9E-22 1.8E-25 138.6 11.6 115 1-139 1-142 (142) 58 protein:vir:79034 Length: 141 99.8 1.1E-21 6.7E-25 135.5 11.4 134 1-148 1-137 (141) 59 protein:vir:4956 Length: 153 # 99.8 1.5E-21 9.1E-25 134.8 12.0 137 1-148 1-140 (153) 60 protein:vir:100887 Length: 139 99.8 1.2E-21 7.7E-25 135.2 11.6 126 2-148 1-136 (139) 61 protein:vir:99101 Length: 142 99.8 5.7E-22 3.5E-25 137.1 9.6 113 1-136 1-142 (142) 62 protein:vir:8669 Length: 142 # 99.8 5.7E-22 3.5E-25 137.1 9.6 113 1-136 1-142 (142) 63 protein:vir:100223 Length: 139 99.7 1.3E-20 8E-24 129.6 11.5 136 2-148 1-136 (139) 64 protein:vir:4859 Length: 140 # 99.7 4.1E-20 2.6E-23 126.8 11.6 128 1-148 1-140 (140) 65 protein:vir:5000 Length: 141 # 99.7 6.7E-20 4.2E-23 125.7 11.6 128 1-148 1-140 (141) 66 protein:vir:81147 Length: 126 99.7 8.6E-20 5.4E-23 125.1 10.9 120 1-142 1-126 (126) 67 protein:vir:4833 Length: 140 # 99.7 3E-19 1.8E-22 122.1 11.8 137 1-148 1-140 (140) 68 protein:vir:105467 Length: 144 99.7 6.3E-19 3.9E-22 120.4 11.8 124 1-148 1-142 (144) 69 protein:vir:78077 Length: 141 99.6 1.5E-18 9.6E-22 118.2 9.8 115 1-143 1-141 (141) 70 protein:vir:95062 Length: 116 99.6 7.3E-19 4.5E-22 120.0 6.7 87 26-135 1-116 (116) 71 protein:vir:97327 Length: 116 99.6 7.6E-19 4.7E-22 119.9 6.7 87 26-135 1-116 (116) 72 protein:vir:1243 Length: 116 # 99.6 7.6E-19 4.7E-22 119.9 6.7 87 26-135 1-116 (116) 73 protein:vir:99528 Length: 92 # 99.6 4.5E-18 2.8E-21 115.7 7.5 92 1-115 1-92 (92) 74 protein:vir:100652 Length: 134 99.5 4.6E-17 2.9E-20 110.1 10.2 122 4-141 1-134 (134) 75 protein:vir:106041 Length: 137 99.5 2.2E-17 1.4E-20 111.9 8.0 104 2-133 1-137 (137) 76 protein:vir:3848 Length: 159 # 99.5 1.7E-16 1.1E-19 107.0 11.3 145 1-148 1-159 (159) 77 protein:vir:101302 Length: 134 99.5 1.5E-16 9.6E-20 107.3 10.1 122 4-141 1-134 (134) 78 protein:vir:9513 Length: 134 # 99.5 1.5E-16 9.6E-20 107.3 10.1 122 4-141 1-134 (134) 79 protein:vir:81067 Length: 119 99.5 2.3E-17 1.4E-20 111.8 5.5 93 41-148 1-117 (119) 80 protein:vir:10367 Length: 119 99.5 2.8E-17 1.7E-20 111.4 5.5 93 41-148 1-117 (119) 81 protein:vir:102441 Length: 137 99.4 8.8E-16 5.5E-19 103.1 7.8 106 2-134 1-137 (137) 82 protein:vir:97982 Length: 140 99.4 8.6E-16 5.3E-19 103.2 6.8 108 1-133 1-140 (140) 83 protein:vir:107545 Length: 140 99.4 8.6E-16 5.3E-19 103.2 6.8 108 1-133 1-140 (140) 84 protein:vir:966 Length: 123 # 99.4 6E-15 3.7E-18 98.5 10.9 117 2-140 1-123 (123) 85 protein:vir:9879 Length: 127 # 99.3 3E-15 1.9E-18 100.2 7.8 109 8-140 1-127 (127) 86 protein:vir:102963 Length: 163 99.3 3.3E-14 2.1E-17 94.5 11.9 127 2-148 1-160 (163) 87 protein:vir:9647 Length: 132 # 99.2 2.7E-13 1.6E-16 89.5 11.1 126 2-144 1-132 (132) 88 protein:vir:6246 Length: 143 # 99.1 3.7E-13 2.3E-16 88.7 9.7 126 1-148 1-143 (143) 89 protein:vir:1332 Length: 143 # 99.1 5.3E-13 3.3E-16 87.9 9.6 126 1-148 1-143 (143) 90 protein:vir:106506 Length: 137 99.1 2.2E-13 1.4E-16 90.0 5.1 108 1-139 1-137 (137) 91 protein:vir:6216 Length: 125 # 99.0 6.5E-13 4E-16 87.4 6.5 115 1-142 1-125 (125) 92 protein:vir:98636 Length: 138 99.0 8.4E-12 5.2E-15 81.3 10.5 127 1-144 6-138 (138) 93 protein:vir:78644 Length: 133 98.9 1.5E-11 9.5E-15 79.9 10.9 121 4-148 1-133 (133) 94 protein:vir:9363 Length: 133 # 98.9 1.5E-11 9.5E-15 79.9 10.9 121 4-148 1-133 (133) 95 protein:vir:94419 Length: 133 98.9 1.5E-11 9.5E-15 79.9 10.9 121 4-148 1-133 (133) 96 protein:vir:96973 Length: 133 98.9 1.5E-11 9.5E-15 79.9 10.9 121 4-148 1-133 (133) 97 protein:vir:78335 Length: 133 98.9 1.8E-11 1.1E-14 79.5 11.0 124 4-142 1-133 (133) 98 protein:vir:95372 Length: 124 98.9 3.6E-11 2.2E-14 77.8 9.9 114 1-140 1-124 (124) 99 protein:vir:93898 Length: 133 98.8 1.4E-10 8.9E-14 74.5 11.0 121 4-140 1-133 (133) 100 protein:vir:104347 Length: 145 98.7 6.8E-11 4.2E-14 76.3 7.0 131 1-142 5-145 (145) 101 protein:vir:80116 Length: 127 98.7 1.5E-10 9E-14 74.5 8.5 117 1-143 1-127 (127) 102 protein:vir:103280 Length: 142 98.6 1.8E-10 1.1E-13 74.0 7.8 132 1-142 1-142 (142) 103 protein:vir:107703 Length: 147 98.6 4.3E-10 2.7E-13 71.9 9.5 134 1-148 1-143 (147) 104 protein:vir:79638 Length: 146 98.6 4.5E-10 2.8E-13 71.8 9.3 137 1-146 1-146 (146) 105 protein:vir:94944 Length: 121 98.6 1.6E-10 9.8E-14 74.3 5.3 115 1-127 1-121 (121) 106 protein:vir:99833 Length: 190 98.5 1.1E-09 7E-13 69.6 9.0 117 1-146 1-190 (190) 107 protein:vir:79091 Length: 175 98.5 6.9E-10 4.3E-13 70.8 7.7 136 1-148 1-174 (175) 108 protein:vir:78380 Length: 131 98.4 9.9E-10 6.1E-13 70.0 6.7 125 2-139 1-131 (131) 109 protein:vir:1988 Length: 156 # 98.4 1.9E-09 1.2E-12 68.4 7.6 130 2-148 1-156 (156) 110 protein:vir:94994 Length: 131 98.4 8.3E-10 5.2E-13 70.4 5.4 125 2-139 1-131 (131) 111 protein:vir:96012 Length: 133 98.4 8E-09 5E-12 65.0 10.8 123 1-142 1-133 (133) 112 protein:vir:7412 Length: 168 # 98.3 6.7E-09 4.1E-12 65.4 8.9 127 1-148 1-165 (168) 113 protein:vir:3163 Length: 145 # 98.3 7.1E-09 4.4E-12 65.3 9.0 116 1-148 1-145 (145) 114 protein:vir:5257 Length: 148 # 98.2 1.9E-09 1.2E-12 68.5 4.0 94 2-148 1-98 (148) 115 protein:vir:1028 Length: 168 # 98.2 1.8E-08 1.1E-11 63.0 8.8 127 1-148 1-165 (168) 116 protein:vir:102338 Length: 116 98.2 1.3E-08 7.8E-12 63.9 7.7 94 26-143 1-116 (116) 117 protein:vir:101563 Length: 155 98.2 6E-09 3.7E-12 65.7 5.3 103 4-148 1-105 (155) 118 protein:vir:79225 Length: 155 98.2 1.9E-08 1.2E-11 62.9 8.0 134 1-146 1-155 (155) 119 protein:vir:107851 Length: 175 98.1 1.9E-08 1.2E-11 62.9 7.8 136 1-148 1-174 (175) 120 protein:vir:1087 Length: 161 # 98.1 2.7E-08 1.7E-11 62.0 8.2 130 1-148 1-161 (161) 121 protein:vir:95157 Length: 144 98.1 2.8E-08 1.8E-11 62.0 7.8 130 1-143 1-144 (144) 122 protein:vir:99196 Length: 155 98.0 7E-08 4.4E-11 59.8 8.7 132 1-148 1-153 (155) 123 protein:vir:77650 Length: 155 98.0 1.3E-08 8.2E-12 63.8 4.7 103 4-148 1-105 (155) 124 protein:vir:96774 Length: 152 98.0 2.3E-08 1.4E-11 62.5 5.8 126 1-141 10-152 (152) 125 protein:vir:3994 Length: 168 # 97.9 1.1E-07 6.8E-11 58.7 8.3 136 1-148 1-165 (168) 126 protein:vir:97190 Length: 148 97.9 7.4E-08 4.6E-11 59.7 6.9 128 1-148 1-144 (148) 127 protein:vir:4096 Length: 140 # 97.8 2.6E-07 1.6E-10 56.7 8.8 128 1-148 1-139 (140) 128 protein:vir:78607 Length: 155 97.8 5.2E-09 3.2E-12 66.0 -0.6 103 4-148 1-105 (155) 129 protein:vir:106728 Length: 155 97.8 5.3E-09 3.3E-12 65.9 -0.5 103 4-148 1-105 (155) 130 protein:vir:80425 Length: 134 97.8 4.4E-08 2.7E-11 60.9 4.2 125 2-148 1-134 (134) 131 protein:vir:103841 Length: 155 97.8 1E-07 6.4E-11 58.9 6.3 131 2-148 1-153 (155) 132 protein:vir:107757 Length: 189 97.8 7.4E-09 4.6E-12 65.2 -0.3 92 2-148 1-100 (189) 133 protein:vir:96105 Length: 193 97.8 5.8E-08 3.6E-11 60.3 4.0 130 2-148 1-143 (193) 134 protein:vir:96288 Length: 100 97.7 1.1E-07 6.9E-11 58.7 4.7 88 1-134 13-100 (100) 135 protein:vir:7449 Length: 123 # 97.6 2.5E-06 1.6E-09 51.3 11.1 121 1-146 1-123 (123) 136 protein:vir:105773 Length: 131 97.5 1.1E-06 6.5E-10 53.3 7.2 114 6-140 1-131 (131) 137 protein:vir:2688 Length: 123 # 97.4 3.1E-06 2E-09 50.8 8.9 111 14-140 1-123 (123) 138 protein:vir:94069 Length: 168 97.3 2.1E-07 1.3E-10 57.1 1.7 106 2-148 1-108 (168) 139 protein:vir:95260 Length: 160 97.3 1.3E-06 8.2E-10 52.8 5.9 91 1-148 1-102 (160) 140 protein:vir:99546 Length: 200 97.2 1.5E-06 9E-10 52.6 5.6 137 1-148 4-150 (200) 141 protein:vir:80970 Length: 112 97.2 4.3E-06 2.7E-09 50.0 8.1 104 2-142 1-112 (112) 142 protein:vir:7993 Length: 108 # 97.2 3.1E-07 2E-10 56.2 1.3 100 1-125 1-108 (108) 143 protein:vir:98892 Length: 108 97.1 5.2E-06 3.2E-09 49.6 7.7 103 1-140 1-108 (108) 144 protein:vir:101508 Length: 120 96.9 3.5E-05 2.1E-08 45.0 10.7 116 1-142 1-120 (120) 145 protein:vir:5703 Length: 150 # 96.9 7.7E-06 4.8E-09 48.6 7.0 132 8-140 1-150 (150) 146 protein:vir:6071 Length: 150 # 96.9 8.2E-06 5.1E-09 48.5 7.0 132 8-140 1-150 (150) 147 protein:vir:396 Length: 184 # 96.9 3.8E-05 2.4E-08 44.8 10.6 140 6-148 1-184 (184) 148 protein:vir:45 Length: 112 # N 96.8 1.4E-05 8.4E-09 47.3 7.7 104 2-142 1-112 (112) 149 protein:vir:80037 Length: 199 96.8 1.7E-06 1E-09 52.2 2.5 126 2-148 1-146 (199) 150 protein:vir:79179 Length: 155 96.7 1.4E-05 8.7E-09 47.2 6.9 132 1-140 1-155 (155) 151 protein:vir:2026 Length: 150 # 96.7 1.3E-05 7.8E-09 47.4 6.6 132 8-140 1-150 (150) 152 protein:vir:100312 Length: 152 96.2 4.7E-05 2.9E-08 44.3 7.0 133 2-141 1-152 (152) 153 protein:vir:4790 Length: 114 # 96.1 6.1E-05 3.8E-08 43.7 7.3 104 2-140 1-114 (114) 154 protein:vir:1838 Length: 149 # 96.1 3.5E-05 2.2E-08 45.0 6.0 131 8-140 1-149 (149) 155 protein:vir:3427 Length: 192 # 96.1 0.00046 2.9E-07 38.9 12.0 141 6-148 1-188 (192) 156 protein:vir:98557 Length: 149 96.1 2.7E-05 1.7E-08 45.6 5.0 126 8-140 1-149 (149) 157 protein:vir:79115 Length: 148 96.0 5E-05 3.1E-08 44.2 6.0 131 8-140 1-148 (148) 158 protein:vir:1581 Length: 116 # 95.0 0.00032 2E-07 39.8 7.2 105 2-139 1-116 (116) 159 protein:vir:6375 Length: 205 # 95.0 0.0012 7.2E-07 36.7 10.2 145 2-148 1-202 (205) 160 protein:vir:96763 Length: 177 94.9 0.0022 1.3E-06 35.2 11.5 144 1-148 4-175 (177) 161 protein:vir:79687 Length: 113 93.8 0.00047 2.9E-07 38.9 5.6 103 15-146 1-113 (113) 162 protein:vir:8106 Length: 150 # 93.8 9.1E-05 5.7E-08 42.7 1.6 117 1-148 5-148 (150) 163 protein:vir:9823 Length: 118 # 93.4 0.00053 3.3E-07 38.6 5.2 102 1-141 1-118 (118) 164 protein:vir:3036 Length: 118 # 93.4 0.00053 3.3E-07 38.6 5.2 102 1-141 1-118 (118) 165 protein:vir:102608 Length: 108 92.4 0.00018 1.1E-07 41.1 1.2 90 1-125 11-108 (108) 166 protein:vir:105825 Length: 108 92.4 0.00018 1.1E-07 41.1 1.2 90 1-125 11-108 (108) 167 protein:vir:1164 Length: 156 # 91.6 0.002 1.2E-06 35.4 5.9 136 2-144 1-156 (156) 168 protein:vir:102190 Length: 93 89.7 0.0054 3.4E-06 33.0 6.6 91 30-142 1-93 (93) 169 protein:vir:79555 Length: 192 88.3 0.033 2.1E-05 28.7 11.1 140 8-148 1-184 (192) 170 protein:vir:4460 Length: 170 # 85.7 0.018 1.1E-05 30.1 6.9 130 1-148 1-168 (170) 171 protein:vir:487 Length: 187 # 84.5 0.016 1E-05 30.4 6.1 132 1-148 14-185 (187) 172 protein:vir:4200 Length: 133 # 77.6 0.032 2E-05 28.8 5.1 127 1-140 1-133 (133) 173 protein:vir:4162 Length: 133 # 64.8 0.1 6.4E-05 26.0 4.9 127 1-140 1-133 (133) 174 protein:vir:97088 Length: 157 64.0 0.31 0.00019 23.4 9.7 130 6-148 1-151 (157) 175 protein:vir:4514 Length: 168 # 57.9 0.42 0.00026 22.6 7.2 138 1-148 1-167 (168) 176 protein:vir:101654 Length: 126 57.1 0.024 1.5E-05 29.5 -0.0 117 8-133 1-126 (126) 177 protein:vir:7859 Length: 126 # 57.1 0.024 1.5E-05 29.5 -0.0 117 8-133 1-126 (126) 178 protein:vir:79034 Length: 141 50.4 0.61 0.00038 21.8 9.6 128 1-148 9-141 (141) 179 protein:vir:78163 Length: 92 # 47.7 0.11 6.7E-05 25.9 2.0 91 1-128 1-92 (92) 180 protein:vir:102875 Length: 146 41.1 0.94 0.00059 20.7 10.4 130 1-142 6-146 (146) 181 protein:vir:105007 Length: 146 41.1 0.94 0.00059 20.7 10.4 130 1-142 6-146 (146) 182 protein:vir:107568 Length: 146 41.1 0.94 0.00059 20.7 10.4 130 1-142 6-146 (146) 183 protein:vir:102085 Length: 146 41.1 0.94 0.00059 20.7 10.4 130 1-142 6-146 (146) 184 protein:vir:78894 Length: 105 38.8 0.17 0.0001 24.8 1.6 101 2-140 1-105 (105) 185 protein:vir:1386 Length: 149 # 33.6 1.3 0.00083 19.9 8.4 129 1-144 6-149 (149) 186 protein:vir:5745 Length: 135 # 32.1 1.4 0.0009 19.7 8.9 129 6-148 1-132 (135) 187 protein:vir:98342 Length: 125 24.3 2.2 0.0014 18.7 9.9 124 6-147 1-125 (125) 188 protein:vir:4704 Length: 125 # 24.3 2.2 0.0014 18.7 9.9 124 6-147 1-125 (125) 189 protein:vir:79988 Length: 125 24.3 2.2 0.0014 18.7 9.9 124 6-147 1-125 (125) 190 protein:vir:9414 Length: 125 # 24.3 2.2 0.0014 18.7 9.9 124 6-147 1-125 (125) 191 protein:vir:81106 Length: 125 24.3 2.2 0.0014 18.7 9.9 124 6-147 1-125 (125) 192 protein:vir:6154 Length: 119 # 23.7 0.09 5.6E-05 26.3 -2.6 118 2-148 1-118 (119) 193 protein:vir:3787 Length: 231 # 21.7 1.7 0.0011 19.3 4.0 137 2-148 1-228 (231) 194 protein:vir:3750 Length: 227 # 21.7 2.3 0.0014 18.6 4.7 135 2-148 1-225 (227) No 1 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=100.00 E-value=2.7e-44 Score=259.41 Aligned_cols=148 Identities=99% Similarity=1.407 Sum_probs=141.8 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+|+|+|+|||+|++.|++|++++.++++++||++||++|+++|+.+||+++|.|++||.++.....+|+....+++.. T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 99999999999999999999998888889999999999999999999999999999999999999999999999999988 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+.....+...+..++|||||+||||++|||||||+||++++++++++.|.++++++|+++++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred cccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 88887777777778888999999999999999999999999999999999999999999999999999 No 2 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=100.00 E-value=4.7e-42 Score=247.14 Aligned_cols=148 Identities=76% Similarity=1.166 Sum_probs=134.0 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceecccccc-ccceeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSR-DGGMESGVHIR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~-~~~~~~~~~~~ 79 (148) ||+|+|+|+|||+|++.|+.|++++.++++++|+.++|++|+++|+++||+++|.|++||.+++.... .+++...+++. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 99999999999999999999999988889999999999999999999999999999999998766544 44556666666 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ......+.....+...+.+++|||||+||||++|||||||+||++++++++++.|.++|+++|+|+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 666666666666777788899999999999999999999999999999999999999999999999999 No 3 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=100.00 E-value=1.8e-39 Score=233.05 Aligned_cols=148 Identities=25% Similarity=0.454 Sum_probs=117.2 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-----Ccchhhhhceecccccc---ccc Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPV-----RRGKLRRNVVVLSRCSR---DGG 71 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~-----~~g~l~~~i~~~~~~~~---~~~ 71 (148) |++ |+|+|+||+||+++|++|+++++++++++||.+||++|+++|+.+||+ .+|.+++||.+.+.... .+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 886 899999999999999999999988999999999999999999999976 46888999987654432 333 Q ss_pred eeeeeeeeeeccc------------cccc--cceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 72 MESGVHIRGVNPD------------TGNS--DNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIAR 137 (148) Q Consensus 72 ~~~~~~~~~~~~~------------~~~~--~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~ 137 (148) ....+++...... .+.. ........+.++|||||+||||++|||||||+|||+++++++++.|.++ T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 2222222110000 0000 0111233566899999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcC Q lcl|NC_011356. 138 MNRAIDEVLRR 148 (148) Q Consensus 138 ~~~~i~k~~kk 148 (148) |+++|+|+|+| T Consensus 161 l~~~i~k~lk~ 171 (179) T protein:vir:18 161 MGKAIDRAIRL 171 (179) T ss_pred HHHHHHHHHHh Confidence 99999999999 No 4 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=100.00 E-value=3.5e-39 Score=231.40 Aligned_cols=147 Identities=17% Similarity=0.269 Sum_probs=117.6 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-----Ccchhhhhceecccccc---ccc Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPV-----RRGKLRRNVVVLSRCSR---DGG 71 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~-----~~g~l~~~i~~~~~~~~---~~~ 71 (148) |++ |+|+|+|||+|+++|++|+++++++++++||.+||++|+++|+.++|+ .+|+|++||.+++.... ++. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 887 899999999999999999999888999999999999999999999997 56899999987543222 222 Q ss_pred eeeeeeeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 72 MESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) +...+.+.... .............++++|||||+||||++|||||||+|||+++++++++.|.++|+++|+++++| T Consensus 81 ~~~~vg~~~~~-~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka~~k 156 (164) T protein:vir:43 81 LGFRIGVLHGA-VLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRAIKR 156 (164) T ss_pred eeEEecccccc-cccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 22222211111 01111222233456788999999999999999999999999999999999999999999999999 No 5 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=100.00 E-value=1e-36 Score=217.86 Aligned_cols=138 Identities=41% Similarity=0.645 Sum_probs=115.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |. +|+|+|||+|++.|+.|+++++++++++||.++|++|+++|+++||+++|+|++||.++......+.....+.... T Consensus 1 Ma--~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~ 78 (140) T protein:vir:10 1 MS--SIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRV 78 (140) T ss_pred Cc--eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeee Confidence 55 4678899999999999998888889999999999999999999999999999999998766554443322221110 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .. .....+.+++|||+|+||||++|||||||+||++++++++++.|.++++++|+|++.+ T Consensus 79 -~~-------~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 79 -RT-------KGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred -cc-------ccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 00 1123346789999999999999999999999999999999999999999999999999 No 6 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=100.00 E-value=1.2e-36 Score=217.60 Aligned_cols=138 Identities=43% Similarity=0.665 Sum_probs=115.2 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |. +|+|+|||+|++.|+.|++++.++++++|+.++|++|+++|+++||+++|+|++||.++......+.....+ . T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~---~ 75 (140) T protein:vir:10 1 MS--SVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATA---G 75 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEE---e Confidence 55 577889999999999999888888999999999999999999999999999999999876654443322211 1 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) +..... ....+.+++|||||+||||++|||||||+||++++++++++.|.++++++|+|++++ T Consensus 76 ~~~~~~-----~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 76 VRVRTK-----GKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGG 138 (140) T ss_pred eccccc-----cccCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 111111 112345788999999999999999999999999999999999999999999999999 No 7 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=100.00 E-value=2.3e-36 Score=215.99 Aligned_cols=138 Identities=40% Similarity=0.639 Sum_probs=114.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |. +|+|+|||+|++.|+.|++++.++++++|+.++|++|+++|+++||+++|+|++||.++......+.....+.. . T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~-~ 77 (140) T protein:vir:80 1 MS--SIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGV-R 77 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeee-e Confidence 44 67888999999999999988888899999999999999999999999999999999876544433322111111 1 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) +... ....+.+++|||+|+||||++|||||||+||++++++++++.|.++++++|++++.+ T Consensus 78 ~~~~-------~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:80 78 VRTK-------GKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGG 138 (140) T ss_pred cccc-------cccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1111 112345789999999999999999999999999999999999999999999999999 No 8 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=100.00 E-value=3.8e-36 Score=214.78 Aligned_cols=138 Identities=41% Similarity=0.648 Sum_probs=115.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |. +|+|+|||+|++.|+.|++++.++++++|+.++|++|+++++++||+++|+|++||.++......+.....+... T Consensus 1 M~--~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~- 77 (140) T protein:vir:14 1 MS--SIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVR- 77 (140) T ss_pred Cc--eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeee- Confidence 55 477889999999999999888888999999999999999999999999999999999876555444332222211 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .+. .....+.+++|||||+||||++|||||||+||++++++++++.|.++++++|+|++.+ T Consensus 78 ----~~~---~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:14 78 ----VRT---KGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred ----ecc---ccccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 111 0122345789999999999999999999999999999999999999999999999999 No 9 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=100.00 E-value=4.7e-36 Score=214.25 Aligned_cols=131 Identities=20% Similarity=0.345 Sum_probs=112.8 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc----chhhhhceeccccccccceeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRR----GKLRRNVVVLSRCSRDGGMESGVH 77 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~----g~l~~~i~~~~~~~~~~~~~~~~~ 77 (148) |.++|+|+||+||++.|++|+.++.++++++||++||++|+++++.++|+++ |+|++||.++......+.....+. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~ 80 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLR 80 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEE Confidence 9999999999999999999999988889999999999999999999999964 899999987765444333222111 Q ss_pred eeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011356. 78 IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLR 147 (148) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~k 147 (148) ++......||+||+||||++|||||||+||+++++++++++|.++|+++|+|+.+ T Consensus 81 ---------------vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 81 ---------------VGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred ---------------ecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHhcC Confidence 1122345688999999999999999999999999999999999999999999999 No 10 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=100.00 E-value=2.4e-36 Score=215.84 Aligned_cols=143 Identities=20% Similarity=0.364 Sum_probs=118.7 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++ |+|+|+||++|+++|++|+.+. ++++++||.+||++|+++++.++|+++|.+++++..... ..++..+.+.+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~--~~~~~~~~i~~~ 77 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWR--TGQHGADQIKVT 77 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccc--ccccccccceec Confidence 887 7999999999999999999874 679999999999999999999999999999988754332 345556666555 Q ss_pred eecccccccccee--ecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 80 GVNPDTGNSDNTM--KADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 80 ~~~~~~~~~~~~~--~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) ......+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|+++|+++| T Consensus 78 ~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 78 KAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 5444433322221 123356789999999999999999999999999999999999999999999999 No 11 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=100.00 E-value=2.4e-36 Score=215.84 Aligned_cols=143 Identities=20% Similarity=0.364 Sum_probs=118.7 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++ |+|+|+||++|+++|++|+.+. ++++++||.+||++|+++++.++|+++|.+++++..... ..++..+.+.+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~--~~~~~~~~i~~~ 77 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWR--TGQHGADQIKVT 77 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccc--ccccccccceec Confidence 887 7999999999999999999874 679999999999999999999999999999988754332 345556666555 Q ss_pred eecccccccccee--ecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 80 GVNPDTGNSDNTM--KADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 80 ~~~~~~~~~~~~~--~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) ......+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|+++|+++| T Consensus 78 ~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 78 KAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 5444433322221 123356789999999999999999999999999999999999999999999999 No 12 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=100.00 E-value=2.4e-36 Score=215.84 Aligned_cols=143 Identities=20% Similarity=0.364 Sum_probs=118.7 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++ |+|+|+||++|+++|++|+.+. ++++++||.+||++|+++++.++|+++|.+++++..... ..++..+.+.+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~--~~~~~~~~i~~~ 77 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWR--TGQHGADQIKVT 77 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccc--ccccccccceec Confidence 887 7999999999999999999874 679999999999999999999999999999988754332 345556666555 Q ss_pred eecccccccccee--ecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 80 GVNPDTGNSDNTM--KADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 80 ~~~~~~~~~~~~~--~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) ......+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|+++|+++| T Consensus 78 ~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 78 KAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 5444433322221 123356789999999999999999999999999999999999999999999999 No 13 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=100.00 E-value=2.4e-36 Score=215.84 Aligned_cols=143 Identities=20% Similarity=0.364 Sum_probs=118.7 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++ |+|+|+||++|+++|++|+.+. ++++++||.+||++|+++++.++|+++|.+++++..... ..++..+.+.+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~--~~~~~~~~i~~~ 77 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWR--TGQHGADQIKVT 77 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccc--ccccccccceec Confidence 887 7999999999999999999874 679999999999999999999999999999988754332 345556666555 Q ss_pred eecccccccccee--ecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 80 GVNPDTGNSDNTM--KADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 80 ~~~~~~~~~~~~~--~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) ......+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|+++|+++| T Consensus 78 ~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 78 KAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 5444433322221 123356789999999999999999999999999999999999999999999999 No 14 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=100.00 E-value=1.7e-34 Score=205.72 Aligned_cols=145 Identities=17% Similarity=0.275 Sum_probs=108.8 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLS-GAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~-~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |.+ ++|+|.||+||+++|++|+ +...++++++||++||++|++++++++|++.+..... .......++..+.+.+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~---~~~~~~~~~~~d~i~~ 77 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSG---RKGSRPPGHAANNIPE 77 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccc---cccccccchhhhccee Confidence 886 7999999999999999996 3456789999999999999999999999864322111 1112223344444444 Q ss_pred eeecccccccccee--ecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 79 RGVNPDTGNSDNTM--KADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 79 ~~~~~~~~~~~~~~--~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ..+....+.....+ ....++++|||||+||||++|||||||+||+++++++++++|.++|+++|+++|-- T Consensus 78 ~~~~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 78 PKIRKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred cccccccceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33333222211111 11224578999999999999999999999999999999999999999999999999 No 15 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=100.00 E-value=1.3e-34 Score=206.43 Aligned_cols=129 Identities=24% Similarity=0.378 Sum_probs=103.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch----hhhhceeccccccccceeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK----LRRNVVVLSRCSRDGGMESGV 76 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~----l~~~i~~~~~~~~~~~~~~~~ 76 (148) ||+++ |+||++|+++|++|+.++.++++++||.+||++|+++|+.+||+++|. +++||.++........ .+. T Consensus 1 M~~~~--i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~--~~~ 76 (133) T protein:vir:10 1 MIRME--VKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQG--NAV 76 (133) T ss_pred CeeEe--eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCc--cce Confidence 77655 679999999999999988888999999999999999999999999876 6777765432222111 011 Q ss_pred eeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 77 HIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEV 145 (148) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~ 145 (148) ..+ .++.+...+|||||+||||++|||||||+|||++++++++++|.++++++|+|- T Consensus 77 ~~v------------~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 77 VTL------------RVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred EEE------------EecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 111 112334567999999999999999999999999999999999999999999887 No 16 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.96 E-value=1e-32 Score=195.97 Aligned_cols=124 Identities=19% Similarity=0.212 Sum_probs=104.5 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |.+ |+|+|+|+|+|.+.|+++++++. +.+++|+.++++.|.++|+.+||++||+|++||.++......+++...+ T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~-~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v--- 76 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELV-PYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRY--- 76 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEe--- Confidence 666 79999999999999999998876 5668999999999999999999999999999998765444444332211 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEV 145 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~ 145 (148) +++.+||+|+||||++|||||||+||++++++.+.+.|.++|+++|++. T Consensus 77 -----------------~~~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 77 -----------------VARADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred -----------------eCCCCccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 2445799999999999999999999999999988887777777777776 No 17 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.96 E-value=1.6e-32 Score=194.85 Aligned_cols=122 Identities=17% Similarity=0.216 Sum_probs=101.2 Q ss_pred ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch------hhhhceeccccccccceeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK------LRRNVVVLSRCSRDGGMESGVH 77 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~------l~~~i~~~~~~~~~~~~~~~~~ 77 (148) |+|+|+||+||+++|++|+.++ ++++++||.+||++|+++++.++|+++|. ++++|.++......+... T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~-~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~---- 75 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGV-AKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTE---- 75 (128) T ss_pred CccchhhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeE---- Confidence 8888999999999999999875 57899999999999999999999997654 666665543322222111 Q ss_pred eeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 78 IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) ..++.+..++|||||+||||++|||||||+||+++++++++++|.++|+++|- T Consensus 76 -------------~~VG~~k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 76 -------------VDVGYGKDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred -------------EEeeecCCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 12233456789999999999999999999999999999999999999999988 No 18 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.95 E-value=3.5e-32 Score=193.07 Aligned_cols=123 Identities=24% Similarity=0.396 Sum_probs=102.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---cchhhhhceeccccc-cccceeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVR---RGKLRRNVVVLSRCS-RDGGMESGV 76 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~---~g~l~~~i~~~~~~~-~~~~~~~~~ 76 (148) |.+ |+|+||+||++.|++|+.++ ++++++||.+||++|++++++++|++ +|+|++||.++.... ..+.... T Consensus 1 M~~--~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v-- 75 (127) T protein:vir:12 1 MAD--MSFDGIDDLTQYFEKIGGDI-EKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFV-- 75 (127) T ss_pred Cee--eeehhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEE-- Confidence 555 77889999999999999876 57899999999999999999999975 799999997654322 1121111 Q ss_pred eeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 77 HIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .++.+.+++|||||+||||++|||||||+||++++++++++.|.++|+++|+ T Consensus 76 ---------------~Vg~~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 76 ---------------AVGPNKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred ---------------EEeeCCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 1222346789999999999999999999999999999999999999999998 No 19 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.95 E-value=1.9e-31 Score=189.03 Aligned_cols=118 Identities=18% Similarity=0.311 Sum_probs=101.3 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecccc Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDT 85 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (148) |+|+|||+|+++|++|++.+ ++++++|+.++|++|+++|+.+||++||+|++||.++..... +. T Consensus 1 i~i~Gld~L~~~L~~l~~~~-~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~-~~-------------- 64 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-DKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAK-DL-------------- 64 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccC-ce-------------- Confidence 99999999999999999876 578999999999999999999999999999999987533221 11 Q ss_pred ccccceeecCCCCCcceeeecccCcc------------------------------------------------------ Q lcl|NC_011356. 86 GNSDNTMKADNPRNAFYWRFVEMGTV------------------------------------------------------ 111 (148) Q Consensus 86 ~~~~~~~~~~~~~~~~y~~~~E~GT~------------------------------------------------------ 111 (148) ..+...++.+||.|+||||+ T Consensus 65 ------~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 138 (173) T protein:vir:10 65 ------ISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILG 138 (173) T ss_pred ------eEEeeCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeec Confidence 11223466789999999986 Q ss_pred -CCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 112 -NMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEV 145 (148) Q Consensus 112 -~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~ 145 (148) .|||||||+|||+++++++.+.|.+.|+++|+|+ T Consensus 139 ~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 139 AGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred CCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 3899999999999999999999999999999999 No 20 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.95 E-value=1e-30 Score=184.95 Aligned_cols=145 Identities=18% Similarity=0.208 Sum_probs=110.3 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+++|.-..|++|...|+.|++ .+++++++|+.+||++|+++|+.+||+.+|.|++||.+......++...... .+.+ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~-~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~-~Vg~ 78 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVE-HSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTY-AVSW 78 (157) T ss_pred CeeEeecccHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEE-EEee Confidence 6666654557799999999985 5678999999999999999999999999999999998876655544332211 1122 Q ss_pred ccccc---------cccceeecCCCCCcceeeecccCccC-CCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 82 NPDTG---------NSDNTMKADNPRNAFYWRFVEMGTVN-MPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 82 ~~~~~---------~~~~~~~~~~~~~~~y~~~~E~GT~~-~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ....+ .........+....|||+|+|+||.. |||||||||||++.++++.+.+.+++.++|+++++= T Consensus 79 ~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 79 RKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred cCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 11111 11111122334556888888888855 999999999999999999999999999999999998 No 21 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.94 E-value=2.3e-30 Score=183.04 Aligned_cols=114 Identities=18% Similarity=0.188 Sum_probs=97.6 Q ss_pred ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecc Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNP 83 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 83 (148) |+|+|+|+|+|.+.|+.|++.+. +.++.+|.++|..+.++|+.+||++||.|++||.++.. +... T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~-~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~-----g~~~--------- 65 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAV-EQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYP-----GMEA--------- 65 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecC-----ceEE--------- Confidence 77888899999999999998876 45689999999999999999999999999999976421 1110 Q ss_pred ccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 84 ~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) ....+.+||+|+||||++|||||||+||++++++++.+.|.+.++++|+ T Consensus 66 -----------~V~~~~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 66 -----------HIHGEAGYDGYQEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred -----------EeecCCCccceeecCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 0123457999999999999999999999999999999988888888888 No 22 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.94 E-value=3.2e-30 Score=182.30 Aligned_cols=112 Identities=18% Similarity=0.367 Sum_probs=93.5 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+++|+|+|||+|++.|+++.. ++.+++++.+++.+|+++++.++|++||+|++||.+.... ++... T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~---~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~---~~~~~------- 67 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS---LKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTE---GGFSG------- 67 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecC---CceEE------- Confidence 9999999999999999998753 3567999999999999999999999999999999764321 11110 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+++.+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 68 -------------~V~~~~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 68 -------------QAGPHTDYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred -------------EeecCCCccceeeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 112456799999999999999999999999999988887776666 No 23 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.93 E-value=2.1e-29 Score=177.79 Aligned_cols=120 Identities=14% Similarity=0.172 Sum_probs=99.6 Q ss_pred eehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch----hhhhceecccccc-ccceeeeeeeeee Q lcl|NC_011356. 7 DFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK----LRRNVVVLSRCSR-DGGMESGVHIRGV 81 (148) Q Consensus 7 ~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~----l~~~i~~~~~~~~-~~~~~~~~~~~~~ 81 (148) =|+||+||+++|++|+.+. +++.++|+++||++|.++++.++|++++. |++||.++..... .|... T Consensus 1 mv~Gl~el~~~l~~l~~~~-~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~-------- 71 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKA-PKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVS-------- 71 (125) T ss_pred CchhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceE-------- Confidence 5899999999999999875 57899999999999999999999998876 8889876543221 11111 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k 144 (148) ..++.+..++|||||+||||++|||||||+||+++++++++++|.++|+++|.= T Consensus 72 ---------~~VG~~k~~~~y~~f~E~GT~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 72 ---------KEIGYGKATGWRAHYPNDGTIYQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred ---------EEEeecCCCceeEeeeccCccCCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 112234567899999999999999999999999999999999999999988876 No 24 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.93 E-value=1.9e-29 Score=178.07 Aligned_cols=109 Identities=20% Similarity=0.271 Sum_probs=91.9 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|+.+++.+. +.+++++.+++..|+++|+++| |++||.|++||.+.. .+.... T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~----~g~~~~----- 70 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIE-DDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKK----IGDLHY----- 70 (115) T ss_pred CeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee----cCcEEE----- Confidence 999999999999999998765 5679999999999999999998 789999999997642 121111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ....+++||+|+||||++|||||||+||++++++.+++.|.+.++ T Consensus 71 ---------------~v~~~~~Ya~~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 71 ---------------RVISTAHYSGFLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred ---------------EeeCCCccchheecccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 112456899999999999999999999999999988887777777 No 25 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.93 E-value=1.8e-29 Score=178.20 Aligned_cols=109 Identities=19% Similarity=0.312 Sum_probs=91.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++.+. +.+++++.+++..|.++|+++| |++||+|++||.++. .++.... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~---- 71 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYT---- 71 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEE---- Confidence 999999999999999998875 5679999999999999999998 899999999997642 1211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+..+||+|+||||++|||||||+||++++++.+.+.|.+.++ T Consensus 72 ----------------v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 72 ----------------ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----------------eecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 12345799999999999999999999999999988887776666 No 26 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.93 E-value=1.8e-29 Score=178.20 Aligned_cols=109 Identities=19% Similarity=0.312 Sum_probs=91.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++.+. +.+++++.+++..|.++|+++| |++||+|++||.++. .++.... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~---- 71 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYT---- 71 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEE---- Confidence 999999999999999998875 5679999999999999999998 899999999997642 1211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+..+||+|+||||++|||||||+||++++++.+.+.|.+.++ T Consensus 72 ----------------v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 72 ----------------ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----------------eecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 12345799999999999999999999999999988887776666 No 27 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.93 E-value=1.8e-29 Score=178.20 Aligned_cols=109 Identities=19% Similarity=0.312 Sum_probs=91.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++.+. +.+++++.+++..|.++|+++| |++||+|++||.++. .++.... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~---- 71 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYT---- 71 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEE---- Confidence 999999999999999998875 5679999999999999999998 899999999997642 1211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+..+||+|+||||++|||||||+||++++++.+.+.|.+.++ T Consensus 72 ----------------v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 72 ----------------ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----------------eecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 12345799999999999999999999999999988887776666 No 28 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.93 E-value=1.8e-29 Score=178.20 Aligned_cols=109 Identities=19% Similarity=0.312 Sum_probs=91.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++.+. +.+++++.+++..|.++|+++| |++||+|++||.++. .++.... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~---- 71 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYT---- 71 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEE---- Confidence 999999999999999998875 5679999999999999999998 899999999997642 1211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+..+||+|+||||++|||||||+||++++++.+.+.|.+.++ T Consensus 72 ----------------v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 72 ----------------ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----------------eecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 12345799999999999999999999999999988887776666 No 29 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.93 E-value=1.8e-29 Score=178.20 Aligned_cols=109 Identities=19% Similarity=0.312 Sum_probs=91.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++.+. +.+++++.+++..|.++|+++| |++||+|++||.++. .++.... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~---- 71 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYT---- 71 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEE---- Confidence 999999999999999998875 5679999999999999999998 899999999997642 1211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+..+||+|+||||++|||||||+||++++++.+.+.|.+.++ T Consensus 72 ----------------v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 72 ----------------ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----------------eecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 12345799999999999999999999999999988887776666 No 30 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.93 E-value=1.8e-29 Score=178.20 Aligned_cols=109 Identities=19% Similarity=0.312 Sum_probs=91.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++.+. +.+++++.+++..|.++|+++| |++||+|++||.++. .++.... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~---- 71 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYT---- 71 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEE---- Confidence 999999999999999998875 5679999999999999999998 899999999997642 1211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ..+..+||+|+||||++|||||||+||++++++.+.+.|.+.++ T Consensus 72 ----------------v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 72 ----------------ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----------------eecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 12345799999999999999999999999999988887776666 No 31 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.92 E-value=1e-28 Score=173.99 Aligned_cols=123 Identities=15% Similarity=0.109 Sum_probs=93.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--hhhhceeccccccccceeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK--LRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++++.||++.+++ |+.+. +++.++|+++||+++++.++.++|++++. |++||.++......+.....+ T Consensus 1 M~v~v~~~~L~~~l~~---l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v--- 73 (125) T protein:vir:98 1 MGARIESNNIEQGLKN---AVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIV--- 73 (125) T ss_pred CeeEeeHHHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEE--- Confidence 8888887665555554 54443 46778999999999999999999998765 999998875443322211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .++++....|||||+||||++||||||++||+++++++++++|.++|++..+ T Consensus 74 ------------~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 74 ------------TIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------EeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1123345569999999999999999999999999999999999988865544 No 32 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.92 E-value=1e-28 Score=173.99 Aligned_cols=123 Identities=15% Similarity=0.109 Sum_probs=93.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--hhhhceeccccccccceeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK--LRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++++.||++.+++ |+.+. +++.++|+++||+++++.++.++|++++. |++||.++......+.....+ T Consensus 1 M~v~v~~~~L~~~l~~---l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v--- 73 (125) T protein:vir:47 1 MGARIESNNIEQGLKN---AVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIV--- 73 (125) T ss_pred CeeEeeHHHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEE--- Confidence 8888887665555554 54443 46778999999999999999999998765 999998875443322211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .++++....|||||+||||++||||||++||+++++++++++|.++|++..+ T Consensus 74 ------------~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 74 ------------TIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------EeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1123345569999999999999999999999999999999999988865544 No 33 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.92 E-value=1e-28 Score=173.99 Aligned_cols=123 Identities=15% Similarity=0.109 Sum_probs=93.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--hhhhceeccccccccceeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK--LRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++++.||++.+++ |+.+. +++.++|+++||+++++.++.++|++++. |++||.++......+.....+ T Consensus 1 M~v~v~~~~L~~~l~~---l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v--- 73 (125) T protein:vir:94 1 MGARIESNNIEQGLKN---AVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIV--- 73 (125) T ss_pred CeeEeeHHHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEE--- Confidence 8888887665555554 54443 46778999999999999999999998765 999998875443322211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .++++....|||||+||||++||||||++||+++++++++++|.++|++..+ T Consensus 74 ------------~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 74 ------------TIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------EeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1123345569999999999999999999999999999999999988865544 No 34 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.92 E-value=1e-28 Score=173.99 Aligned_cols=123 Identities=15% Similarity=0.109 Sum_probs=93.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--hhhhceeccccccccceeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK--LRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++++.||++.+++ |+.+. +++.++|+++||+++++.++.++|++++. |++||.++......+.....+ T Consensus 1 M~v~v~~~~L~~~l~~---l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v--- 73 (125) T protein:vir:79 1 MGARIESNNIEQGLKN---AVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIV--- 73 (125) T ss_pred CeeEeeHHHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEE--- Confidence 8888887665555554 54443 46778999999999999999999998765 999998875443322211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .++++....|||||+||||++||||||++||+++++++++++|.++|++..+ T Consensus 74 ------------~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 74 ------------TIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------EeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1123345569999999999999999999999999999999999988865544 No 35 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.92 E-value=1e-28 Score=173.99 Aligned_cols=123 Identities=15% Similarity=0.109 Sum_probs=93.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--hhhhceeccccccccceeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGK--LRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++++.||++.+++ |+.+. +++.++|+++||+++++.++.++|++++. |++||.++......+.....+ T Consensus 1 M~v~v~~~~L~~~l~~---l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v--- 73 (125) T protein:vir:81 1 MGARIESNNIEQGLKN---AVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIV--- 73 (125) T ss_pred CeeEeeHHHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEE--- Confidence 8888887665555554 54443 46778999999999999999999998765 999998875443322211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .++++....|||||+||||++||||||++||+++++++++++|.++|++..+ T Consensus 74 ------------~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 74 ------------TIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------EeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1123345569999999999999999999999999999999999988865544 No 36 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.92 E-value=3.1e-29 Score=176.84 Aligned_cols=108 Identities=19% Similarity=0.234 Sum_probs=90.8 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecccccc Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGN 87 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (148) |+|||+|++.|+++++++. +.++++|.++|..|.++|+.++|++||+|++||.++.. +..... T Consensus 1 i~Gld~l~~~l~~~~~~~~-~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~----~~~~~~------------ 63 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVR-IAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQ----RLLHYR------------ 63 (108) T ss_pred CchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeec----CcEEEE------------ Confidence 9999999999999998765 67799999999999999999999999999999976432 111111 Q ss_pred ccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 88 SDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 88 ~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ..++.+||+|+||||++|||||||+||++++++++.+.|.+.|++ T Consensus 64 --------v~~~~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 64 --------VVSPALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred --------eecCcccchhcccCccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 124568999999999999999999999999999877776666666 No 37 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.92 E-value=3.5e-29 Score=176.60 Aligned_cols=109 Identities=17% Similarity=0.278 Sum_probs=92.4 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA------PVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+|+|||+|++.|++|++++. +.+++++.+++..|.++|+.+| |++||.|++||.+... +++. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~----g~~~------ 69 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT----VDLQ------ 69 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeec----CcEE------ Confidence 999999999999999998765 6779999999999999999998 9999999999976431 2111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) +...++++||+|+||||++|+|||||+|||+++++.+++.|.+.++ T Consensus 70 --------------~~V~~~~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 70 --------------YTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred --------------EEecCCccccccccccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 1112456899999999999999999999999999988887776666 No 38 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.92 E-value=1.6e-28 Score=172.95 Aligned_cols=108 Identities=18% Similarity=0.357 Sum_probs=89.1 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecccc Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDT 85 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (148) |+|+|||+|++.|+++.. ...++++|.++|..|+++|+.+||++||+|++||.+..... +.. T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~---~~~------------ 62 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT---LDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDG---GLS------------ 62 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecC---ceE------------ Confidence 999999999999998753 45679999999999999999999999999999997643211 111 Q ss_pred ccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 86 GNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 86 ~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) +...+..+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 63 --------~~V~~~~~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 63 --------GTTGPHTDYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred --------EEeecCCCcccceeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 0012445799999999999999999999999999988887766666 No 39 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.91 E-value=3.8e-28 Score=170.92 Aligned_cols=108 Identities=20% Similarity=0.369 Sum_probs=88.6 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecccc Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDT 85 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (148) |+|+|||+|++.|+.+.. ...+++++++++..|+++|+.+||++||+|++||.+.... ++... T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~---~~~~~----------- 63 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT---LNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTD---GGLTG----------- 63 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeec---CceEE----------- Confidence 999999999999998753 3567899999999999999999999999999999754321 11110 Q ss_pred ccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 86 GNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 86 ~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ...+..+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 64 ---------~V~~~~~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 64 ---------TTIPHTDYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ---------EeecCCCccceeeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 012445799999999999999999999999999988887766666 No 40 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.90 E-value=1.7e-27 Score=167.28 Aligned_cols=113 Identities=19% Similarity=0.291 Sum_probs=88.7 Q ss_pred CccceeeehhHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLS-GAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~-~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |. +|+|+|||+|++.|+++. ....++++++++.+.++.+++.|+.++|++||+|++||.++... ++.. T Consensus 1 Ma--~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~---~~~~------ 69 (114) T protein:vir:27 1 MA--TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES---DKAT------ 69 (114) T ss_pred Ce--eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC---CeeE------ Confidence 44 588899999999999984 22235677777777777777777778899999999999765321 1110 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ...+.+||+|+||||++|||||||+||++++++.+++.|.+.++. T Consensus 70 ----------------V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 70 ----------------VEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ----------------ecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 123457999999999999999999999999999988888877777 No 41 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.90 E-value=1.7e-27 Score=167.28 Aligned_cols=113 Identities=19% Similarity=0.291 Sum_probs=88.7 Q ss_pred CccceeeehhHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLS-GAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~-~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |. +|+|+|||+|++.|+++. ....++++++++.+.++.+++.|+.++|++||+|++||.++... ++.. T Consensus 1 Ma--~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~---~~~~------ 69 (114) T protein:vir:49 1 MA--TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES---DKAT------ 69 (114) T ss_pred Ce--eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC---CeeE------ Confidence 44 588899999999999984 22235677777777777777777778899999999999765321 1110 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ...+.+||+|+||||++|||||||+||++++++.+++.|.+.++. T Consensus 70 ----------------V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 70 ----------------VEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ----------------ecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 123457999999999999999999999999999988888877777 No 42 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.90 E-value=1e-26 Score=163.07 Aligned_cols=145 Identities=17% Similarity=0.191 Sum_probs=93.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeee- Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENN---RVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGV- 76 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~---~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~- 76 (148) ||.+ +|.|+|+|.++|+++++.+.+ +++.+++.+++..|+++|+.++|++||+|++||.+..... .+.+...+ T Consensus 1 m~~v--~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~-~~~~~g~V~ 77 (182) T protein:vir:10 1 MIEV--ELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVD-GDEVIGRWW 77 (182) T ss_pred CeEE--EEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeec-CCeEEEEee Confidence 7766 777999999999999976543 3456666777788889999999999999999997643322 11111111 Q ss_pred ----eeeeecccccc---------ccceeecCCCCCc-ceeeecc-------------------cCccCCCCCchhHHHH Q lcl|NC_011356. 77 ----HIRGVNPDTGN---------SDNTMKADNPRNA-FYWRFVE-------------------MGTVNMPPHPFVRPAF 123 (148) Q Consensus 77 ----~~~~~~~~~~~---------~~~~~~~~~~~~~-~y~~~~E-------------------~GT~~~~a~PFl~pA~ 123 (148) +...+..++|. .........+.++ +|++.++ |+|..|||||||+||+ T Consensus 78 ~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~ 157 (182) T protein:vir:10 78 NSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAA 157 (182) T ss_pred cCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHH Confidence 11112212211 0001111111111 1223322 5688999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 124 DVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 124 ~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) +++++++.+.|.+.++++|++.+-= T Consensus 158 ~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 158 NKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HHhHHHHHHHHHHHHHHHHHHhhcC Confidence 9999988888777777777766655 No 43 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.90 E-value=1.6e-27 Score=167.54 Aligned_cols=118 Identities=19% Similarity=0.336 Sum_probs=98.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |. +++|.|+|||...|++|+.. .+++.++||++|+++|++++..++|++||.|+. |+.+. ...| T Consensus 1 Ma--~iel~G~del~~~l~~~g~~-~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~--kk~g---------- 64 (119) T protein:vir:10 1 MA--SLEIEGFEEFEKFISEDMVL-DESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRV--KNTG---------- 64 (119) T ss_pred Cc--eeehhhHHHHHHHHHhhhhh-hHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeee--ecCc---------- Confidence 43 67788999999999999965 578999999999999999999999999999885 33221 1111 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCC-chhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPH-PFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~-PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) ...++.+.+..||..|+|||||+|||| |||.||++++++++++.|.++|.+.++ T Consensus 65 ---------~~~VG~~ks~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 65 ---------LATEGTASSSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred ---------eeEeccCCcchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 112233456789999999999999999 999999999999999999999999998 No 44 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.89 E-value=8.9e-27 Score=163.39 Aligned_cols=111 Identities=22% Similarity=0.331 Sum_probs=88.1 Q ss_pred CccceeeehhHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSG-AENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~-~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |.+ |+|+|||+|+++|+.++. +..++++++++.+.+..+++.|+.++|++||+|++||.++. ++.. T Consensus 1 Ma~--i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~-----~~~~------ 67 (112) T protein:vir:96 1 MAT--IEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEA-----GSDR------ 67 (112) T ss_pred Cce--eeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeec-----CceE------ Confidence 444 788899999999999842 23467899999999999999999999999999999997532 1111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARM 138 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~ 138 (148) +...++.+||+|+||||++|||||||+|||+++++.+++.+++-- T Consensus 68 --------------~~v~~~~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 68 --------------AVVEALTNYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred --------------EEecCCCCccceeccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 011244579999999999999999999999999997766655422 No 45 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.85 E-value=1.6e-24 Score=150.99 Aligned_cols=115 Identities=23% Similarity=0.278 Sum_probs=92.6 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) +|+++++++|+++|.+.|+++++.+. +.+++++.++|+.++++|+.++|++||+|++||.+..... + +... T Consensus 3 ~ms~~i~~~g~~~l~~~l~~~~~~~~-~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~--g-~~~~----- 73 (144) T protein:vir:59 3 LMSVRIDPSWRRIMSRNVRTFSGHVL-TQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNN--G-LTAE----- 73 (144) T ss_pred cceeeehhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecC--c-EEEE----- Confidence 56667778999999999999998865 6779999999999999999999999999999998653211 1 1100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc---------------------------cCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT---------------------------VNMPPHPFVRPAFDVRSEQAAQV 133 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT---------------------------~~~~a~PFl~pA~~~~~~~~~~~ 133 (148) ..++..|+.|+|||| .+|||||||+||++.+++.+.+. T Consensus 74 ---------------V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~ 138 (144) T protein:vir:59 74 ---------------ITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFERE 138 (144) T ss_pred ---------------EecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHH Confidence 012345888888887 56999999999999999988887 Q ss_pred HHHHHH Q lcl|NC_011356. 134 AIARMN 139 (148) Q Consensus 134 ~~~~~~ 139 (148) |++.+- T Consensus 139 i~~~~g 144 (144) T protein:vir:59 139 MRRLRG 144 (144) T ss_pred HHHhcC Confidence 777777 No 46 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.84 E-value=2.2e-24 Score=150.31 Aligned_cols=108 Identities=21% Similarity=0.257 Sum_probs=88.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++ +.|+++|++.|+++++++. +.+++++.+++..|+++|+.++|++||.|++||.+..... +... T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~------ 67 (137) T protein:vir:97 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS---GFTG------ 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecC---ceEE------ Confidence 5554 4599999999999998875 6889999999999999999999999999999997643211 1100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ...++..|++|+|||| ++|||||||+||++++++.+. T Consensus 68 --------------~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~ 133 (137) T protein:vir:97 68 --------------VINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFN 133 (137) T ss_pred --------------EEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHH Confidence 0123456899999998 679999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 ~~l~ 137 (137) T protein:vir:97 134 KYFS 137 (137) T ss_pred HhhC Confidence 9888 No 47 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.84 E-value=2.2e-24 Score=150.31 Aligned_cols=108 Identities=21% Similarity=0.257 Sum_probs=88.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++ +.|+++|++.|+++++++. +.+++++.+++..|+++|+.++|++||.|++||.+..... +... T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~------ 67 (137) T protein:vir:94 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS---GFTG------ 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecC---ceEE------ Confidence 5554 4599999999999998875 6889999999999999999999999999999997643211 1100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ...++..|++|+|||| ++|||||||+||++++++.+. T Consensus 68 --------------~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~ 133 (137) T protein:vir:94 68 --------------VINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFN 133 (137) T ss_pred --------------EEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHH Confidence 0123456899999998 679999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 ~~l~ 137 (137) T protein:vir:94 134 KYFS 137 (137) T ss_pred HhhC Confidence 9888 No 48 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.84 E-value=2.2e-24 Score=150.31 Aligned_cols=108 Identities=21% Similarity=0.257 Sum_probs=88.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++ +.|+++|++.|+++++++. +.+++++.+++..|+++|+.++|++||.|++||.+..... +... T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~------ 67 (137) T protein:vir:93 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS---GFTG------ 67 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecC---ceEE------ Confidence 5554 4599999999999998875 6889999999999999999999999999999997643211 1100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ...++..|++|+|||| ++|||||||+||++++++.+. T Consensus 68 --------------~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~ 133 (137) T protein:vir:93 68 --------------VINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFN 133 (137) T ss_pred --------------EEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHH Confidence 0123456899999998 679999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 ~~l~ 137 (137) T protein:vir:93 134 KYFS 137 (137) T ss_pred HhhC Confidence 9888 No 49 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.84 E-value=2.6e-24 Score=149.89 Aligned_cols=108 Identities=20% Similarity=0.277 Sum_probs=84.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++ +.|+|+|++.|+.+++.+. +.++++|.+++..|+++|+.+||++||.|++||.+..... ++...+ T Consensus 1 Ma~~---~~Gl~~l~~~l~~~~~~~~-~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V---- 69 (137) T protein:vir:10 1 MAKV---KYGNWELVKELEDFEKETI-RWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKG---GLTGVI---- 69 (137) T ss_pred Cchh---HhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCC---cEEEEE---- Confidence 6666 3599999999999998875 6779999999999999999999999999999997643211 111000 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) .++..|++|+|||| ++|||||||+||++++++++. T Consensus 70 ----------------~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~ 133 (137) T protein:vir:10 70 ----------------NIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFN 133 (137) T ss_pred ----------------ecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHH Confidence 12234666666665 568999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 k~i~ 137 (137) T protein:vir:10 134 KYFS 137 (137) T ss_pred HhcC Confidence 9888 No 50 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.84 E-value=2.2e-24 Score=150.25 Aligned_cols=108 Identities=22% Similarity=0.269 Sum_probs=87.5 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++.+ |+|+|.+.|+++++++. +++++++.+++..|+++|+.++|++||.|++||.++...+ ++.. T Consensus 13 Ma~~~~---Gld~l~~~L~~~~~~~~-~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~---g~~~------ 79 (149) T protein:vir:94 13 MAKVKY---GADSMVVELDKFDKKIE-EWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDG---GLSS------ 79 (149) T ss_pred HHHHHH---HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCC---cEEE------ Confidence 777643 99999999999998875 6889999999999999999999999999999997643211 1110 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ...++..|++|+|||| ..|||||||+||++++++++. T Consensus 80 --------------~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~ 145 (149) T protein:vir:94 80 --------------VISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFE 145 (149) T ss_pred --------------EEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHH Confidence 0123346888888888 457899999999999999888 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 146 ~~i~ 149 (149) T protein:vir:94 146 QYFS 149 (149) T ss_pred HhhC Confidence 8888 No 51 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.84 E-value=3.1e-24 Score=149.45 Aligned_cols=108 Identities=20% Similarity=0.246 Sum_probs=88.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++++ |+|+|.+.|+++++++. +.+++++.+++..|+++|+.++|++||+|++||.+..... +.... T Consensus 1 Ma~~~~---G~~~l~~~L~~~~~~~~-~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~----- 68 (137) T protein:vir:94 1 MAKVKY---GNWDLVKELENYERDIE-RWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDG---GFTGV----- 68 (137) T ss_pred CchhHH---hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecC---cEEEE----- Confidence 777753 99999999999999875 6889999999999999999999999999999997643211 11100 Q ss_pred eccccccccceeecCCCCCcceeeecccC-----------------------------ccCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMG-----------------------------TVNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~ 131 (148) ..++..|++|+||| |++|||||||+||++++++++. T Consensus 69 ---------------V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~ 133 (137) T protein:vir:94 69 ---------------INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFN 133 (137) T ss_pred ---------------EecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHH Confidence 11345688888888 5679999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 ~~l~ 137 (137) T protein:vir:94 134 KYFS 137 (137) T ss_pred HhhC Confidence 9888 No 52 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.84 E-value=2.7e-24 Score=149.80 Aligned_cols=108 Identities=22% Similarity=0.267 Sum_probs=87.2 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++.+ |+|+|.+.|+++++++. +++++++.+++..|+++|+.++|++||.|++||.+....+ ++... T Consensus 13 Ma~v~~---Gld~l~~~l~~~~~~~~-~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~---g~~~~----- 80 (149) T protein:vir:10 13 MAKVKY---GADSMVVELDKFDKKIE-EWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDG---GLSSV----- 80 (149) T ss_pred hHHHHH---HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCC---cEEEE----- Confidence 777643 99999999999998875 6889999999999999999999999999999997643211 11100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ..++..|+.|+|||| .+|||||||+||++++++++. T Consensus 81 ---------------V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~ 145 (149) T protein:vir:10 81 ---------------ISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFE 145 (149) T ss_pred ---------------EecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHH Confidence 113345888888887 558899999999999999988 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 146 ~~i~ 149 (149) T protein:vir:10 146 QYFS 149 (149) T ss_pred HhhC Confidence 8888 No 53 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.84 E-value=4.1e-24 Score=148.77 Aligned_cols=108 Identities=21% Similarity=0.269 Sum_probs=88.0 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++ +.|+++|.+.|+++++++. ++++.++.+++..++++|+.++|++||.|++||.+.... ++.... T Consensus 1 Ma~~---~~G~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~---~~~~~~----- 68 (137) T protein:vir:95 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---GGFTGV----- 68 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeC---CceEEE----- Confidence 5555 4699999999999998864 788999999999999999999999999999999754321 111100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ..++..|++|+|||| ++|||||||+||++++++++. T Consensus 69 ---------------V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~ 133 (137) T protein:vir:95 69 ---------------INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFN 133 (137) T ss_pred ---------------EecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHH Confidence 123456888899888 679999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 k~l~ 137 (137) T protein:vir:95 134 KYFS 137 (137) T ss_pred HhhC Confidence 9888 No 54 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.83 E-value=5.4e-24 Score=148.15 Aligned_cols=108 Identities=21% Similarity=0.296 Sum_probs=84.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++.+ |+|+|.+.|+++++.+. +.++.+|..++..|+++|+.+||++||+|++||.+..... ++...+ T Consensus 1 Ma~~~~---G~~~l~~~l~~~~~~~~-~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~---~~~~~V---- 69 (137) T protein:vir:10 1 MAKVKY---GNWDLVKELEEFEKETI-RWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKG---GLTGVI---- 69 (137) T ss_pred Cccchh---CHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCC---cEEEEE---- Confidence 666653 99999999999998875 6779999999999999999999999999999997643211 111100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) .++..|++|+|||| ++|||||||+||++++++++. T Consensus 70 ----------------~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~ 133 (137) T protein:vir:10 70 ----------------NIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFN 133 (137) T ss_pred ----------------ecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHH Confidence 12234555666555 569999999999999999999 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 k~i~ 137 (137) T protein:vir:10 134 KYFS 137 (137) T ss_pred HhhC Confidence 9888 No 55 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.82 E-value=2.1e-23 Score=144.93 Aligned_cols=108 Identities=21% Similarity=0.221 Sum_probs=87.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.++. .|+|+|++.|+.+++.+. ++++++|.+++..++++|+.++|++||.|++||.+....+ +.... T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~-~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~---g~~~~----- 68 (137) T protein:vir:96 1 MAKVK---YGNWDLVAELEDYRDEME-EWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDG---GFSSV----- 68 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecC---ceEEE----- Confidence 66664 499999999999998875 6789999999999999999999999999999997543211 11100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----------------------------cCCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~~ 131 (148) ..++..|+.|+|||| ..|||||||+||++++++.+. T Consensus 69 ---------------V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~ 133 (137) T protein:vir:96 69 ---------------ISVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFN 133 (137) T ss_pred ---------------EecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHH Confidence 113346999999998 558999999999999999888 Q ss_pred HHHH Q lcl|NC_011356. 132 QVAI 135 (148) Q Consensus 132 ~~~~ 135 (148) +.|. T Consensus 134 k~i~ 137 (137) T protein:vir:96 134 RYFS 137 (137) T ss_pred HhhC Confidence 8888 No 56 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.81 E-value=2.2e-23 Score=144.79 Aligned_cols=108 Identities=23% Similarity=0.314 Sum_probs=87.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+++ + |||+|.+.|+++++.+. +.+++++.++++.|+++|+.++|++||.|++||.+.... ++... T Consensus 1 Ma~~~--~-Gl~~l~~~l~~~~~~~~-~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~---~g~~~------ 67 (135) T protein:vir:96 1 MAKVK--Y-GADSIVVDLEKYSKDME-KWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFEN---GGFTG------ 67 (135) T ss_pred Cchhh--h-hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeec---CcEEE------ Confidence 66553 3 99999999999998874 788999999999999999999999999999999764311 11110 Q ss_pred eccccccccceeecCCCCCcceeeecccCc---------------------------cCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT---------------------------VNMPPHPFVRPAFDVRSEQAAQV 133 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT---------------------------~~~~a~PFl~pA~~~~~~~~~~~ 133 (148) ...++..|+.|+|||| .+|||||||+||++++++++.+. T Consensus 68 --------------~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~ 133 (135) T protein:vir:96 68 --------------VVKIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQY 133 (135) T ss_pred --------------EEecCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHh Confidence 0124456888888888 66999999999999999988887 Q ss_pred HH Q lcl|NC_011356. 134 AI 135 (148) Q Consensus 134 ~~ 135 (148) |. T Consensus 134 i~ 135 (135) T protein:vir:96 134 FS 135 (135) T ss_pred cC Confidence 77 No 57 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.79 E-value=2.9e-22 Score=138.64 Aligned_cols=115 Identities=24% Similarity=0.250 Sum_probs=89.2 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |..|+++| |+++|.+.|+.+.+.+. .+++.++..++..++++|+.++|++||.|++||.+....... .... T Consensus 1 Ma~~~~~~-~~~~l~~~l~~~~~~~~-~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~-~~~~------ 71 (142) T protein:vir:94 1 MAGLNYRV-NSTEFQGALRAALDRLT-GAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRF-SFSV------ 71 (142) T ss_pred CceeEEEe-cHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCc-eEEE------ Confidence 88899888 58999999999998765 688999999999999999999999999999999764322211 0100 Q ss_pred eccccccccceeecCCCCCcceeeecccCccC---------------------------CCCCchhHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN---------------------------MPPHPFVRPAFDVRSEQAAQV 133 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---------------------------~~a~PFl~pA~~~~~~~~~~~ 133 (148) ...++..|+.|+||||.. ++|||||+||++.+++.+.+. T Consensus 72 --------------~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~ 137 (142) T protein:vir:94 72 --------------TIGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNH 137 (142) T ss_pred --------------EEecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHH Confidence 012345799999999843 689999999999998876544 Q ss_pred HHHHHH Q lcl|NC_011356. 134 AIARMN 139 (148) Q Consensus 134 ~~~~~~ 139 (148) ++ +|+ T Consensus 138 ~~-~~~ 142 (142) T protein:vir:94 138 AK-GIR 142 (142) T ss_pred HH-hcC Confidence 43 333 No 58 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.77 E-value=1.1e-21 Score=135.53 Aligned_cols=134 Identities=15% Similarity=0.208 Sum_probs=104.2 Q ss_pred Cccc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIET-LLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~-~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |.++ +|++.||++|.++|+.+.+....+.+++++++.|..+...++.++|++||.|++|+......+... + T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~-~------- 72 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLP-V------- 72 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccc-e------- Confidence 8886 899999999999999887644567889999999999999999999999999999975432111000 0 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHH--HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAF--DVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~--~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ...... ......++..||+||||||+.++++||+.+++ +...+++.+.|.+.+.+.|++.+++ T Consensus 73 ----~~~g~~--~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 73 ----YKQGNN--YIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred ----eecCCe--eEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 01111345689999999999999999999998 7777888888888889999999988 No 59 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=99.77 E-value=1.5e-21 Score=134.79 Aligned_cols=137 Identities=17% Similarity=0.132 Sum_probs=92.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+|+ .||+||+++|++|.+.. .+.-++|+++||+++++..+.++|+..-.- ......+++.+.|.+.. T Consensus 1 M~~~~---~glee~~~~lekL~~~~-~~~~~katkAGA~v~~e~L~~~tp~~h~~~-------~kt~~~~HlaD~I~~s~ 69 (153) T protein:vir:49 1 MTGLD---EALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTREKHYSK-------KKDLKYGHMADGLAVQS 69 (153) T ss_pred CccHH---HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCCCC-------CCCCCCCcccccceecc Confidence 77766 69999999999999764 356789999999999999999999853100 01111233344433322 Q ss_pred eccccccccceeecC-CCCCcceeeecccCccCCCCCchhHHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKAD-NPRNAFYWRFVEMGTVNMPPHPFVRPAFDVR--SEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~-~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~i~k~~kk 148 (148) ............+++ ....+||+||+||||++|||+||+.++.+++ ++++++++.++|++.|++..-= T Consensus 70 ~~idG~~dG~s~VG~~~~~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~ 140 (153) T protein:vir:49 70 TNADGRKNGVSTVGWKNNYHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGV 140 (153) T ss_pred ccccccccceeeecccCCccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCe Confidence 111111111122333 2445899999999999999999999999876 6778887777777666654332 No 60 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=99.77 E-value=1.2e-21 Score=135.19 Aligned_cols=126 Identities=16% Similarity=0.201 Sum_probs=93.3 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc----------chhhhhceeccccccccc Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRR----------GKLRRNVVVLSRCSRDGG 71 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~----------g~l~~~i~~~~~~~~~~~ 71 (148) |+|. +||++++++|++|.... .+.-.+++.+||+++++..+.++|... ++|+++|.++.. +..+ T Consensus 1 v~~~---~~lee~l~~i~kl~~~~-~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~-~~dg- 74 (139) T protein:vir:10 1 MDMD---EALGQWLKQVSKAAELS-ISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAG-DIDG- 74 (139) T ss_pred CCHH---HHHHHHHHHHHHhhccC-HHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCc-cccc- Confidence 3333 69999999999997532 345578999999999999999999742 456666655432 1111 Q ss_pred eeeeeeeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 72 MESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .......+++...+|++||+||||++|||+||+.++.+++++++++++.++|++.|++..-= T Consensus 75 ---------------~~~g~~~VG~~k~~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~~~ 136 (139) T protein:vir:10 75 ---------------DHNGSSTVGFHNKAHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred ---------------ccceeeeeCCCCCcceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 11111122333458999999999999999999999999999999999888888877764433 No 61 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.77 E-value=5.7e-22 Score=137.06 Aligned_cols=113 Identities=18% Similarity=0.231 Sum_probs=83.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+++|.++||+..++.+. +.+. +++++++.+.+..|+++|+.++|++||+|++||.........+. T Consensus 1 m~~~~~~~~gl~~~l~~~~---~~~~-~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~--------- 67 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAA---AQVG-PILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPF--------- 67 (142) T ss_pred CceeEEEeeecchhHHHHH---HHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccc--------- Confidence 9999999999987555544 4554 67899999999999999999999999999999976543221110 Q ss_pred eccccccccceeecCCCCCcceeeecccCcc-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----------------------------NMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~~~~~~ 131 (148) .+.....+++.|+.|+||||. .++|||||+||++++.++.. T Consensus 68 ----------~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ 137 (142) T protein:vir:99 68 ----------HVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDR 137 (142) T ss_pred ----------eEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhh Confidence 001111245668888888884 35699999999999888765 Q ss_pred HHHHH Q lcl|NC_011356. 132 QVAIA 136 (148) Q Consensus 132 ~~~~~ 136 (148) ..... T Consensus 138 ~~~~r 142 (142) T protein:vir:99 138 RIRVR 142 (142) T ss_pred hhccC Confidence 55444 No 62 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.77 E-value=5.7e-22 Score=137.06 Aligned_cols=113 Identities=18% Similarity=0.231 Sum_probs=83.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+++|.++||+..++.+. +.+. +++++++.+.+..|+++|+.++|++||+|++||.........+. T Consensus 1 m~~~~~~~~gl~~~l~~~~---~~~~-~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~--------- 67 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAA---AQVG-PILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPF--------- 67 (142) T ss_pred CceeEEEeeecchhHHHHH---HHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccc--------- Confidence 9999999999987555544 4554 67899999999999999999999999999999976543221110 Q ss_pred eccccccccceeecCCCCCcceeeecccCcc-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----------------------------NMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~~~~~~ 131 (148) .+.....+++.|+.|+||||. .++|||||+||++++.++.. T Consensus 68 ----------~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ 137 (142) T protein:vir:86 68 ----------HVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDR 137 (142) T ss_pred ----------eEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhh Confidence 001111245668888888884 35699999999999888765 Q ss_pred HHHHH Q lcl|NC_011356. 132 QVAIA 136 (148) Q Consensus 132 ~~~~~ 136 (148) ..... T Consensus 138 ~~~~r 142 (142) T protein:vir:86 138 RIRVR 142 (142) T ss_pred hhccC Confidence 55444 No 63 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=99.73 E-value=1.3e-20 Score=129.61 Aligned_cols=136 Identities=16% Similarity=0.205 Sum_probs=91.5 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+ |+ .||++++++|++|.... .+.-.+++.+||+++++..+.++|... +.........+++.+.|.+... T Consensus 1 ~~--~~-~~l~e~l~~lekl~~~~-~~~~~k~tkaGA~v~~~~L~~~tp~~~------~~~~~~~~~~~HlaD~I~~~~~ 70 (139) T protein:vir:10 1 MD--MD-EALGQWLKQVSKAAQLS-VSDQEKITKAGADVYAKELAETTKEKH------PNTKGDGGKYGHLSEDISSAAG 70 (139) T ss_pred CC--HH-HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccc------ccCCCCCCCCCcccccceecCc Confidence 44 44 69999999999997643 345578999999999999999999631 0011111122333333333221 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ... +.......+++...+|.+||+|+||++|||+||+..+.++.++++++++.++|++.|++..-- T Consensus 71 ~id-g~~~g~~~VG~~~~~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~~~~~ 136 (139) T protein:vir:10 71 DID-GDHNGSSTVGFHNKAHIARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred ccc-ccccccceeCCCCCceeeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 111 111122223333457889999999999999999999999999999998888887777654332 No 64 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=99.71 E-value=4.1e-20 Score=126.84 Aligned_cols=128 Identities=18% Similarity=0.133 Sum_probs=94.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------cchhhhhceeccccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVR---------RGKLRRNVVVLSRCSRDGG 71 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~---------~g~l~~~i~~~~~~~~~~~ 71 (148) |.+|. +||++|+++|++|.+... +.-.+++++||+++++..+..+|.. .++|++||.++. .+..|. T Consensus 1 M~~~~---d~l~e~~~~lekl~~~~~-~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~-~~iDg~ 75 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLTP-AEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQS-TNVDGR 75 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCCH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecc-cccccc Confidence 77766 699999999999987543 5668999999999999999999963 234666665532 221111 Q ss_pred eeeeeeeeeeccccccccceeecCC-CCCcceeeecccCccCCCCCchhHHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 72 MESGVHIRGVNPDTGNSDNTMKADN-PRNAFYWRFVEMGTVNMPPHPFVRPAFDVR--SEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~i~k~~kk 148 (148) ......+++. ...+|++||+++||++|||+||+.+|.+++ +.++++++.++|++.|++.-.- T Consensus 76 ---------------~~g~s~VG~~kk~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 76 ---------------KNGVSTVGWVNRYHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ---------------cCceeeeccCCCcceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 1111223332 346899999999999999999999999966 7788888888888777664433 No 65 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=99.70 E-value=6.7e-20 Score=125.69 Aligned_cols=128 Identities=17% Similarity=0.166 Sum_probs=90.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------cchhhhhceeccccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVR---------RGKLRRNVVVLSRCSRDGG 71 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~---------~g~l~~~i~~~~~~~~~~~ 71 (148) |.+|. .||+||+++|++|.... .+.-.+++.+||+++++..+..+|++ .++|++||.++.. +..|. T Consensus 1 M~~~~---~gl~e~~~~lekl~~~~-~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~-~~DG~ 75 (141) T protein:vir:50 1 MVGLA---EALDEWLKTVASIGNLT-PAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQST-NADGR 75 (141) T ss_pred CccHH---HHHHHHHHHHHHhcCCC-HHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccC-ccccc Confidence 77765 89999999999998543 35568999999999999999999964 3356666655331 11111 Q ss_pred eeeeeeeeeeccccccccceeecC-CCCCcceeeecccCccCCCCCchhHHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 72 MESGVHIRGVNPDTGNSDNTMKAD-NPRNAFYWRFVEMGTVNMPPHPFVRPAFDVR--SEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~i~k~~kk 148 (148) ......+++ +...+|++||+||||++|||+||+.+|.+.+ +++|++++.++|++.|++.--- T Consensus 76 ---------------~dg~s~VG~~~~~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~ 140 (141) T protein:vir:50 76 ---------------KNGVSTVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEEKEGC 140 (141) T ss_pred ---------------cCCeeeeccCCCccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhccCC Confidence 111112232 3345899999999999999999999999864 6778887777666655542111 No 66 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.69 E-value=8.6e-20 Score=125.10 Aligned_cols=120 Identities=16% Similarity=0.232 Sum_probs=89.2 Q ss_pred CccceeeehhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGL-EDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~~~~i~Gl-~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |.. |+|.+| +++.+.|+.+.+.+. +.++.++.++|+.+.++++.++|+.||.|++|++++..... +. T Consensus 1 Ma~--i~id~la~~I~~~L~~y~~~v~-~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~-g~-------- 68 (126) T protein:vir:81 1 MAN--ITIDRLADELLQAVKEYTDDVA-EGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGY-GT-------- 68 (126) T ss_pred Ccc--cchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccC-Cc-------- Confidence 555 666787 568888999998765 67899999999999999999999999999999876542211 11 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccC-----CCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN-----MPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) ......+...+.+.|+|||||.+ +||+|||+||++...+++.+.|.+.|+..= T Consensus 69 ----------~~~vv~~~~~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 69 ----------TKRIIWNKKHYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred ----------ceEEEeccCCCCceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 11123334555679999999987 899999999999877765554444443222 No 67 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=99.67 E-value=3e-19 Score=122.15 Aligned_cols=137 Identities=16% Similarity=0.130 Sum_probs=94.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+|. .||++|++.|++|.+.+. +.-.+++.+||+++++..+.++|+..-..+ .....+++.+.|.+.. T Consensus 1 M~~~~---d~l~e~~~~v~kl~~~~~-~~~~katkAGAkv~~~~L~~~tp~~h~~~r-------~t~~~~HlaD~I~~~~ 69 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLTP-AEQAKITTAGAKVFKKELAEVTREKHYSKK-------KDLKYGHMADGLAVQS 69 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCCH-HHHHHHHHHhHHHHHHHHHHhcccCCCCCC-------CCCCCCcccccceecc Confidence 77766 699999999999997543 455899999999999999999997531100 0111233333333332 Q ss_pred eccccccccceeecCC-CCCcceeeecccCccCCCCCchhHHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADN-PRNAFYWRFVEMGTVNMPPHPFVRPAFDVR--SEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~-~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~~~~~i~k~~kk 148 (148) ............+++. ...+|++||+++||++|||+||+..+.++. +++++.++.++|++.|.+-..- T Consensus 70 ~~idg~~dG~s~VG~~k~~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 70 TNVDGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred cccccccccceeecccCCCceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 1111111111223333 335899999999999999999999999854 8899999988888877665444 No 68 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.66 E-value=6.3e-19 Score=120.37 Aligned_cols=124 Identities=16% Similarity=0.153 Sum_probs=92.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGA-ENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~-~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |...+|+++||++|.+.|+++... ..++.+++++.+.+..+.+.++.++|++||.|++|+.........++... T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~----- 75 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTI----- 75 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEE----- Confidence 555699999999999999998643 23568899999999999999999999999999999875432211111111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCC-----------------CCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM-----------------PPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~-----------------~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) ...++..|++||||||+.+ +.+|||++|.+..+..+ .+.+++.| T Consensus 76 ---------------~V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~----~~~l~k~l 136 (144) T protein:vir:10 76 ---------------KLINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQL----PQLVTEGL 136 (144) T ss_pred ---------------EEecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHH----HHHHHHHH Confidence 1135667999999999644 67889999998777754 44555555 Q ss_pred HHHhcC Q lcl|NC_011356. 143 DEVLRR 148 (148) Q Consensus 143 ~k~~kk 148 (148) ++++.. T Consensus 137 ~~l~d~ 142 (144) T protein:vir:10 137 WGLKDL 142 (144) T ss_pred HHHhhh Confidence 555555 No 69 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.62 E-value=1.5e-18 Score=118.23 Aligned_cols=115 Identities=14% Similarity=0.086 Sum_probs=79.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |-+++|+ +...+.++++.+.+.+.+...++..++++|+..|+.++|++||.|++||....... + ...- T Consensus 1 ~~~~~f~----~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~--g-~~~~----- 68 (141) T protein:vir:78 1 MNEFEFD----SNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKS--S-KEVI----- 68 (141) T ss_pred CcchhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecC--C-cEEE----- Confidence 6667776 44555555555555544433468888999999999999999999999997542211 1 1100 Q ss_pred eccccccccceeecCCCCCcceeeecccCc--------------------------cCCCCCchhHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT--------------------------VNMPPHPFVRPAFDVRSEQAAQVA 134 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT--------------------------~~~~a~PFl~pA~~~~~~~~~~~~ 134 (148) ..++..|+.|+|||| +.|||||||+||++.+++++.+.| T Consensus 69 ---------------V~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i 133 (141) T protein:vir:78 69 ---------------VGNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFT 133 (141) T ss_pred ---------------EecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHH Confidence 013345777777776 569999999999999999887766 Q ss_pred HHHHHHHHH Q lcl|NC_011356. 135 IARMNRAID 143 (148) Q Consensus 135 ~~~~~~~i~ 143 (148) .+.|+. |+ T Consensus 134 ~~~~~~-l~ 141 (141) T protein:vir:78 134 ERALRG-IN 141 (141) T ss_pred HHHhhc-cC Confidence 665543 33 No 70 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.60 E-value=7.3e-19 Score=120.01 Aligned_cols=87 Identities=21% Similarity=0.284 Sum_probs=67.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeee Q lcl|NC_011356. 26 NNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRF 105 (148) Q Consensus 26 ~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 105 (148) .++++++++.+++..|+.+|+.++|++||+|++||.+..... ++...+ ..+..|+.| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~---~~~~~V--------------------~~~~~Ya~y 57 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG---GFTGVI--------------------NIGSEYAIY 57 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecC---cEEEEE--------------------ecCCCccce Confidence 457889999999999999999999999999999997543211 111100 122346666 Q ss_pred cccC-----------------------------ccCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011356. 106 VEMG-----------------------------TVNMPPHPFVRPAFDVRSEQAAQVAI 135 (148) Q Consensus 106 ~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~~~~~ 135 (148) +||| |..|+|||||+||++++++.+.+.|. T Consensus 58 vE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 58 VNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 6666 77899999999999999998888888 No 71 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.60 E-value=7.6e-19 Score=119.92 Aligned_cols=87 Identities=21% Similarity=0.284 Sum_probs=68.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeee Q lcl|NC_011356. 26 NNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRF 105 (148) Q Consensus 26 ~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 105 (148) .++++++++.+++..|+++|+.++|++||+|++||.+..... ++...+ .++..|+.| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~---~~~~~V--------------------~~~~~YA~y 57 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG---GFTGVI--------------------NIGSEYAIY 57 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecC---cEEEEE--------------------ecCCCcccc Confidence 457889999999999999999999999999999997543211 111000 123457777 Q ss_pred cccC-----------------------------ccCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011356. 106 VEMG-----------------------------TVNMPPHPFVRPAFDVRSEQAAQVAI 135 (148) Q Consensus 106 ~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~~~~~ 135 (148) +||| |..|+|||||+||++++++.+.+.|. T Consensus 58 vE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 7777 88899999999999999998888887 No 72 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.60 E-value=7.6e-19 Score=119.92 Aligned_cols=87 Identities=21% Similarity=0.284 Sum_probs=68.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeee Q lcl|NC_011356. 26 NNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRF 105 (148) Q Consensus 26 ~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 105 (148) .++++++++.+++..|+++|+.++|++||+|++||.+..... ++...+ .++..|+.| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~---~~~~~V--------------------~~~~~YA~y 57 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG---GFTGVI--------------------NIGSEYAIY 57 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecC---cEEEEE--------------------ecCCCcccc Confidence 457889999999999999999999999999999997543211 111000 123457777 Q ss_pred cccC-----------------------------ccCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011356. 106 VEMG-----------------------------TVNMPPHPFVRPAFDVRSEQAAQVAI 135 (148) Q Consensus 106 ~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~~~~~ 135 (148) +||| |..|+|||||+||++++++.+.+.|. T Consensus 58 vE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 7777 88899999999999999998888887 No 73 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.56 E-value=4.5e-18 Score=115.69 Aligned_cols=92 Identities=17% Similarity=0.278 Sum_probs=72.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |..++|+|+|+|+|++.|++.... ..+++++.+.+..++..|+++||++||.|++||....... ++... T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~---~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~---g~~~~----- 69 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNM---NTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRD---GFTGS----- 69 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccH---HHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecC---CeeEE----- Confidence 888999999999999999987542 3468999999999999999999999999999998653322 11111 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPP 115 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a 115 (148) .......+.|+.|+||||++|+| T Consensus 70 ------------v~~~gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 70 ------------VTYGGGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred ------------EEeccCccccccccccceeecCC Confidence 11112445699999999999999 No 74 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=99.53 E-value=4.6e-17 Score=110.13 Aligned_cols=122 Identities=17% Similarity=0.154 Sum_probs=89.9 Q ss_pred ceeeehhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLL--SGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l--~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++.|++||+++|++. +..+ .++.++||.++++.|.+++|.+.++ |||.+.+++..+......|.....+ T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~-~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~v--- 76 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEM-VKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTI--- 76 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhh-hhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEE--- Confidence 889999999999999987 5554 5789999999999999999998776 9999988887765443333211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRP--------AFDVRSEQAAQVAIARMNRA 141 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~p--------A~~~~~~~~~~~~~~~~~~~ 141 (148) .+.+..+.+++.|+.|||+.++...+|+.| |+++.+..+.+.++++|++. T Consensus 77 ------------gW~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 77 ------------RWRGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------------EEEcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 122334567899999999999999999999 55555554444444333333 No 75 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.52 E-value=2.2e-17 Score=111.91 Aligned_cols=104 Identities=19% Similarity=0.272 Sum_probs=70.2 Q ss_pred ccceeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 2 IETLLDFS-GLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 2 m~~~~~i~-Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|+ ..++|++ .+ ..+++.++.+.+..++.+|+.++|++||+|++||...........+... T Consensus 1 m~~s~~i~i~~~~l~~-------~v-~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~----- 67 (137) T protein:vir:10 1 MPVTARIHINEPELER-------QT-GAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGG----- 67 (137) T ss_pred CCeeEEEeeCHHHHHH-------HH-HHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEE----- Confidence 88888877 2333333 22 2456788899999999999999999999999999865432221111111 Q ss_pred eccccccccceeecCCCCCcceeeecccCcc-----------------------------CCCCCchhHHHHHHH---HH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----------------------------NMPPHPFVRPAFDVR---SE 128 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~~---~~ 128 (148) ..++..|+.|+||||. .++|||||+||+++. ++ T Consensus 68 ---------------v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ 132 (137) T protein:vir:10 68 ---------------VEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADP 132 (137) T ss_pred ---------------EecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccc Confidence 1234568888888872 245999999999973 45 Q ss_pred HHHHH Q lcl|NC_011356. 129 QAAQV 133 (148) Q Consensus 129 ~~~~~ 133 (148) ++.-. T Consensus 133 ri~~~ 137 (137) T protein:vir:10 133 DIHMT 137 (137) T ss_pred cccCC Confidence 44333 No 76 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=99.50 E-value=1.7e-16 Score=107.02 Aligned_cols=145 Identities=12% Similarity=0.108 Sum_probs=101.6 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceec-------ccccccccee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVL-------SRCSRDGGME 73 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~-------~~~~~~~~~~ 73 (148) ||. +|+ .+|++++++|+++.... .+.-.+++.+||+++++..+..+|+..-..+...... ......++++ T Consensus 1 mm~-~~~-~~l~~~l~~v~k~~~~~-~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~Hla 77 (159) T protein:vir:38 1 MAN-DMG-EFYNNWVNEVEKGMKLS-VEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQ 77 (159) T ss_pred Ccc-hHH-HHHHHHHHHHHHhcCCC-HHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccc Confidence 443 455 78999999998854321 2333689999999999999999999654433322211 1233345666 Q ss_pred eeeeeeeecccccccc-ceeecC-CCCCcceeeecccCccCCCCC-----chhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 74 SGVHIRGVNPDTGNSD-NTMKAD-NPRNAFYWRFVEMGTVNMPPH-----PFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 74 ~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~y~~~~E~GT~~~~a~-----PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) ++|.+......-|... ...+++ +...+|+++|+++||++|||+ ||+..+.++.+++|++++.++|++-|+..- T Consensus 78 D~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~~~~ 157 (159) T protein:vir:38 78 DSITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAYKEVMNHDS 157 (159) T ss_pred cceeeecCccccccccceeeecccCCccceEeeecccCccccCCCCccCChhHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6665544322222221 223344 344579999999999999998 899999999999999999999988888777 Q ss_pred cC Q lcl|NC_011356. 147 RR 148 (148) Q Consensus 147 kk 148 (148) -| T Consensus 158 ~~ 159 (159) T protein:vir:38 158 DK 159 (159) T ss_pred CC Confidence 77 No 77 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=99.49 E-value=1.5e-16 Score=107.27 Aligned_cols=122 Identities=19% Similarity=0.174 Sum_probs=90.3 Q ss_pred ceeeehhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLL--SGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l--~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+++++|++||+++|++. +..+ .++.++||.++++.|.+++|.+.++ |||.+.+.+..+......|.....+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~-~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~v--- 76 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEM-VKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITV--- 76 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhh-hhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEE--- Confidence 899999999999999987 5554 5789999999999999999999987 9999998887765543333211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRP--------AFDVRSEQAAQVAIARMNRA 141 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~p--------A~~~~~~~~~~~~~~~~~~~ 141 (148) .+.+..+.+++.|+.|||+.+....+|+.| |+++.+..+.+.++++|++. T Consensus 77 ------------gW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 77 ------------HWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------------EEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 122334567899999999999989999999 55555554444444443333 No 78 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=99.49 E-value=1.5e-16 Score=107.27 Aligned_cols=122 Identities=19% Similarity=0.174 Sum_probs=90.3 Q ss_pred ceeeehhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLL--SGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l--~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |+++++|++||+++|++. +..+ .++.++||.++++.|.+++|.+.++ |||.+.+.+..+......|.....+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~-~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~v--- 76 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEM-VKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITV--- 76 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhh-hhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEE--- Confidence 899999999999999987 5554 5789999999999999999999987 9999998887765543333211111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRP--------AFDVRSEQAAQVAIARMNRA 141 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~p--------A~~~~~~~~~~~~~~~~~~~ 141 (148) .+.+..+.+++.|+.|||+.+....+|+.| |+++.+..+.+.++++|++. T Consensus 77 ------------gW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 77 ------------HWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------------EEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 122334567899999999999989999999 55555554444444443333 No 79 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.49 E-value=2.3e-17 Score=111.83 Aligned_cols=93 Identities=23% Similarity=0.263 Sum_probs=78.2 Q ss_pred HHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccC----------- Q lcl|NC_011356. 41 LKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMG----------- 109 (148) Q Consensus 41 i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G----------- 109 (148) ++|+++..+|+.+|.|++||-.......+.... . ...++++...++|||++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~-~--------------~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~ 65 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGV-Q--------------TYAVSWRKKAAPHGHLLEFGHWQTHAAYKGK 65 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCe-E--------------EEEeeccCCcCCcccccccceeeeeeeeecc Confidence 999999999999999999997665443332211 1 12245566788999999999 Q ss_pred -------------ccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 110 -------------TVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 110 -------------T~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) |+.+||+|||+|||++...++.++|.+.+++.+.++++= T Consensus 66 dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:81 66 DGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred CceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 899999999999999999999999999999999999988 No 80 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.48 E-value=2.8e-17 Score=111.37 Aligned_cols=93 Identities=23% Similarity=0.272 Sum_probs=78.0 Q ss_pred HHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccC----------- Q lcl|NC_011356. 41 LKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMG----------- 109 (148) Q Consensus 41 i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G----------- 109 (148) ++|+++..+|+.+|.|++||-.......+.... . ...++.+...++|||++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~-~--------------~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~ 65 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGV-Q--------------TYAVSWRKKAAPHGHLLEFGHWQTHAAYKGK 65 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCE-E--------------EEEeecCCCcCCcccccccceeeeeeeeecc Confidence 999999999999999999997664443332211 1 12245566788999999999 Q ss_pred -------------ccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 110 -------------TVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 110 -------------T~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ++.+||+|||+|||++...++.++|.+.+++.+.++++= T Consensus 66 dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:10 66 DGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred CceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 889999999999999999999999999999999999988 No 81 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.39 E-value=8.8e-16 Score=103.12 Aligned_cols=106 Identities=18% Similarity=0.197 Sum_probs=69.5 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccc-cceeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRD-GGMESGVHIRG 80 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~-~~~~~~~~~~~ 80 (148) |.+++.++ -....|.+.+ ..+++.+++..+..+.++|+.++|++||+|++||..+...... +.+... T Consensus 1 ~~~~~~~~------~~~~~~~~~~-~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~----- 68 (137) T protein:vir:10 1 MTVTARYE------RNPVGEARQF-QVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSG----- 68 (137) T ss_pred CeeEEEec------cCchhHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEE----- Confidence 77777665 1222233333 3567889999999999999999999999999999865332211 111111 Q ss_pred eccccccccceeecCCCCCcceeeecccCcc------------------------------CCCCCchhHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV------------------------------NMPPHPFVRPAFDVRSEQA 130 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~------------------------------~~~a~PFl~pA~~~~~~~~ 130 (148) ..++..|+.|+||||. .++|+|||+||+++++.+. T Consensus 69 ---------------V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~ 133 (137) T protein:vir:10 69 ---------------VTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARE 133 (137) T ss_pred ---------------ecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhh Confidence 1123334444444441 3579999999999999988 Q ss_pred HHHH Q lcl|NC_011356. 131 AQVA 134 (148) Q Consensus 131 ~~~~ 134 (148) ...- T Consensus 134 ~~~~ 137 (137) T protein:vir:10 134 TATS 137 (137) T ss_pred cccC Confidence 7765 No 82 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.37 E-value=8.6e-16 Score=103.19 Aligned_cols=108 Identities=21% Similarity=0.215 Sum_probs=66.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |..+.-+++ |.-....|.+.+ ..++++.+...+..++.+|+.+||++||+|++||...........+...+ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v---- 71 (140) T protein:vir:97 1 MATIRARAR----IEIDEAALERES-GEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGV---- 71 (140) T ss_pred Ceeeeeeee----eeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEe---- Confidence 665533322 112233333333 35678888999999999999999999999999997543322222111111 Q ss_pred eccccccccceeecCCCCCcceeeecccCcc-----------------------------CCCCCchhHHHHHH---HHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----------------------------NMPPHPFVRPAFDV---RSE 128 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~---~~~ 128 (148) .+++.|+.|+||||. .++|||||+||++. +++ T Consensus 72 ----------------~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~ 135 (140) T protein:vir:97 72 ----------------EATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDP 135 (140) T ss_pred ----------------cCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhh Confidence 122345555555541 36799999999997 566 Q ss_pred HHHHH Q lcl|NC_011356. 129 QAAQV 133 (148) Q Consensus 129 ~~~~~ 133 (148) ++... T Consensus 136 ~i~~~ 140 (140) T protein:vir:97 136 RVRMT 140 (140) T ss_pred hccCC Confidence 66655 No 83 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.37 E-value=8.6e-16 Score=103.19 Aligned_cols=108 Identities=21% Similarity=0.215 Sum_probs=66.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |..+.-+++ |.-....|.+.+ ..++++.+...+..++.+|+.+||++||+|++||...........+...+ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v---- 71 (140) T protein:vir:10 1 MATIRARAR----IEIDEAALERES-GEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGV---- 71 (140) T ss_pred Ceeeeeeee----eeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEe---- Confidence 665533322 112233333333 35678888999999999999999999999999997543322222111111 Q ss_pred eccccccccceeecCCCCCcceeeecccCcc-----------------------------CCCCCchhHHHHHH---HHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----------------------------NMPPHPFVRPAFDV---RSE 128 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----------------------------~~~a~PFl~pA~~~---~~~ 128 (148) .+++.|+.|+||||. .++|||||+||++. +++ T Consensus 72 ----------------~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~ 135 (140) T protein:vir:10 72 ----------------EATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDP 135 (140) T ss_pred ----------------cCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhh Confidence 122345555555541 36799999999997 566 Q ss_pred HHHHH Q lcl|NC_011356. 129 QAAQV 133 (148) Q Consensus 129 ~~~~~ 133 (148) ++... T Consensus 136 ~i~~~ 140 (140) T protein:vir:10 136 RVRMT 140 (140) T ss_pred hccCC Confidence 66655 No 84 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.36 E-value=6e-15 Score=98.54 Aligned_cols=117 Identities=10% Similarity=0.135 Sum_probs=87.4 Q ss_pred ccceeeehhHH-HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLE-DISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 2 m~~~~~i~Gl~-el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+.+++|..|+ ++.+.|+.+.+.+. ..+..++.+.|+.+.+++++.+|++||.+++|..+.... .+ T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~-~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~--~~---------- 67 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVV-DDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLK--NG---------- 67 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecC--Ce---------- Confidence 88889988885 45889999998876 578999999999999999999999999999996543211 00 Q ss_pred eccccccccceeecCCCCCcceeeecccCc-----cCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT-----VNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT-----~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ..+...+.......|.||||. -+.+|+|||+||++...+.+.+.+.+.|++ T Consensus 68 ---------~~~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 68 ---------DQVIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred ---------eEEEEEecCCcceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 011122223334689999994 347999999999998887666655555554 No 85 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.34 E-value=3e-15 Score=100.19 Aligned_cols=109 Identities=14% Similarity=0.134 Sum_probs=75.9 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCC-------Ccchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR--APV-------RRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~--aP~-------~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |.|+|+|+++|++... + -+++.+++-+..+...|++. +|+ +||.++.||+...... +.. T Consensus 1 i~G~~~L~~~Lk~~s~---~-dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~---g~~----- 68 (127) T protein:vir:98 1 MTGMPALEVKLRSMSE---K-RWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNS---SKD----- 68 (127) T ss_pred CcChHHHHHHHHHhhH---H-HHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecC---Cce----- Confidence 9999999999997732 2 24777877777788888875 888 9999999987543211 110 Q ss_pred eeeccccccccceeecCCCCCcceeeecccCccC---------CCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 79 RGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN---------MPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---------~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) +.+ +.......|+-++||||+. +|+||||.|||+..++..++.+.+.+++ T Consensus 69 ~~v------------gp~g~t~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 69 VIT------------GNFGYIKDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred EEe------------ccCcccccccceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 001 1111234699999999994 5699999999999998555544444443 No 86 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.31 E-value=3.3e-14 Score=94.47 Aligned_cols=127 Identities=15% Similarity=0.230 Sum_probs=91.4 Q ss_pred ccceeeehhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCC---------------------------Cc Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGA-ENNRVLREATRAGANVLKEEVVSRAPV---------------------------RR 53 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~-~~~~~~~~al~~~a~~i~~~ak~~aP~---------------------------~~ 53 (148) |+..|++.+|+++.++|..+..+ ..++.++..+.+.|..+...++.++|+ +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 99999999999999999987532 234578999999999999999999997 45 Q ss_pred chhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc-----CCCCCchhHHHHHHHHH Q lcl|NC_011356. 54 GKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----NMPPHPFVRPAFDVRSE 128 (148) Q Consensus 54 g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----~~~a~PFl~pA~~~~~~ 128 (148) |+|++|..+.......+.+ .+.. .++..|+|+||||+. +.|.+|+|..|.+.... T Consensus 81 G~lr~swk~~~~~k~~~~~------------------~v~v--~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~ 140 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTY------------------KQKV--YNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKS 140 (163) T ss_pred chhhccceecceeecCCce------------------EEEE--EecCCccchhhcceeecCCceeccchhhHHHHHHHHH Confidence 6666665432211111111 1111 245579999999965 36899999999999888 Q ss_pred HHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 129 QAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 129 ~~~~~~~~~~~~~i~k~~kk 148 (148) ++.+.+.+.|.+.|++.+.= T Consensus 141 ~~~~~~e~~l~~~l~k~~~~ 160 (163) T protein:vir:10 141 DMEKRVRDKYDGFMRKVVLG 160 (163) T ss_pred HHHHHHHHHHHHHHHHhhcC Confidence 77776666666666666554 No 87 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=99.19 E-value=2.7e-13 Score=89.53 Aligned_cols=126 Identities=16% Similarity=0.158 Sum_probs=96.1 Q ss_pred ccceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |+.--+++|++||+++|++ |++.-..++.++||.++++.+.++.|.+.|+ |||...+.+.++......|..... T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~--- 77 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVK--- 77 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEE--- Confidence 5555678899999999999 9986556899999999999999999999996 899998888776554433321111 Q ss_pred eeeccccccccceeecCCCCCcceeeecccCccCCC-CC--chhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 79 RGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMP-PH--PFVRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~-a~--PFl~pA~~~~~~~~~~~~~~~~~~~i~k 144 (148) .++.++.+..-|..|||+-+++ |+ -++..|++..+..+...+.++|++.|+= T Consensus 78 --------------VgW~GpR~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 78 --------------LGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred --------------ecccCCceeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 1222233345677889985432 22 3899999999999999999988888888 No 88 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=99.14 E-value=3.7e-13 Score=88.74 Aligned_cols=126 Identities=23% Similarity=0.367 Sum_probs=98.8 Q ss_pred Ccc---ceeeehhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-----------Ccchhhhhceeccc Q lcl|NC_011356. 1 MIE---TLLDFSGLEDISRDLQLL-SGAENNRVLREATRAGANVLKEEVVSRAPV-----------RRGKLRRNVVVLSR 65 (148) Q Consensus 1 Mm~---~~~~i~Gl~el~~~l~~l-~~~~~~~~~~~al~~~a~~i~~~ak~~aP~-----------~~g~l~~~i~~~~~ 65 (148) |.+ ..|+|+|+.+++..|..+ +.++ .+.++.+..++|+++...+++.+|+ .+|.|..||++.. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aa- 78 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKEL-NKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTA- 78 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCch-hHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccc- Confidence 665 589999999999999999 5554 4788999999999999999999999 4677777765421 Q ss_pred cccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCccCCC--CCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 66 CSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMP--PHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~--a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) +.....++.+.-+..+|+.|++||+..+. |+-||+.|+-+.++++.......|++-|+ T Consensus 79 --------------------T~raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~ 138 (143) T protein:vir:62 79 --------------------SAKGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAVVE 138 (143) T ss_pred --------------------cccceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHHHH Confidence 11122233333356789999999998766 89999999999999988877777777777 Q ss_pred HHhcC Q lcl|NC_011356. 144 EVLRR 148 (148) Q Consensus 144 k~~kk 148 (148) +.|-- T Consensus 139 k~l~s 143 (143) T protein:vir:62 139 KYLES 143 (143) T ss_pred HHhcC Confidence 76666 No 89 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=99.12 E-value=5.3e-13 Score=87.91 Aligned_cols=126 Identities=21% Similarity=0.337 Sum_probs=98.8 Q ss_pred Ccc---ceeeehhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-----------cchhhhhceeccc Q lcl|NC_011356. 1 MIE---TLLDFSGLEDISRDLQLL-SGAENNRVLREATRAGANVLKEEVVSRAPVR-----------RGKLRRNVVVLSR 65 (148) Q Consensus 1 Mm~---~~~~i~Gl~el~~~l~~l-~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~-----------~g~l~~~i~~~~~ 65 (148) |.+ ..|+|+|+..++..|.++ +.++ .+.++.+..++|+++...+++.+|+. +|.|..||++.. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aa- 78 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKEL-NKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTA- 78 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcc-hHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccc- Confidence 665 589999999999999999 5554 46889999999999999999999997 566666654321 Q ss_pred cccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCccCCC--CCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 66 CSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMP--PHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~--a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) +.....++.+.....+|+.|++||+.++. ++-|++.|+-+.++++......+|++-|+ T Consensus 79 --------------------T~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~ 138 (143) T protein:vir:13 79 --------------------SAKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERRIAAVVE 138 (143) T ss_pred --------------------cccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHHHH Confidence 11222333334455789999999998876 99999999999999988877777777777 Q ss_pred HHhcC Q lcl|NC_011356. 144 EVLRR 148 (148) Q Consensus 144 k~~kk 148 (148) +.|-- T Consensus 139 k~l~s 143 (143) T protein:vir:13 139 KYLES 143 (143) T ss_pred HHhcC Confidence 77666 No 90 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=99.06 E-value=2.2e-13 Score=89.97 Aligned_cols=108 Identities=19% Similarity=0.138 Sum_probs=63.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |-.-+++|+ .. .|+.+ ...++++++...+..++.++|.++|++||+|++||...........+... T Consensus 1 ~~~~~~~l~-~~----~l~~~----~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~----- 66 (137) T protein:vir:10 1 MVAHTLRIE-RA----QLHGL----GMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGS----- 66 (137) T ss_pred CcccccccC-hh----hHhhH----HHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEE----- Confidence 444444444 11 23322 24577889999999999999999999999999999764322111111111 Q ss_pred eccccccccceeecCCCCCcceeeecccCc--------------------------c---CCCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT--------------------------V---NMPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT--------------------------~---~~~a~PFl~pA~~~~~~~~~ 131 (148) ..++..|+.|+|||| + .++|+|||+||++..+++.- T Consensus 67 ---------------V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~~ 131 (137) T protein:vir:10 67 ---------------VEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQEG 131 (137) T ss_pred ---------------ecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhcccc Confidence 112234444444444 1 24699999999997776421 Q ss_pred HHHHHHHH Q lcl|NC_011356. 132 QVAIARMN 139 (148) Q Consensus 132 ~~~~~~~~ 139 (148) -.+ .|. T Consensus 132 ~~~--~~~ 137 (137) T protein:vir:10 132 FRV--TIG 137 (137) T ss_pred eeE--eeC Confidence 111 111 No 91 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=99.03 E-value=6.5e-13 Score=87.41 Aligned_cols=115 Identities=21% Similarity=0.379 Sum_probs=92.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----cchhhhhceeccccccccceeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVR----RGKLRRNVVVLSRCSRDGGMESGV 76 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~----~g~l~~~i~~~~~~~~~~~~~~~~ 76 (148) |.+- =.|+.|.+..|..|.+ +.+++....|.+||+-..+..+.+.|.+ .|++|+++++.... T Consensus 1 m~sN---NNGFae~~~~~~tl~k-Vd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~---------- 66 (125) T protein:vir:62 1 MASN---NNGFAEALEDINTLLR-VNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKD---------- 66 (125) T ss_pred CCCC---chhHHHHHHHhhhhhh-hhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeC---------- Confidence 3333 3499999999999875 5688999999999999999999999975 46888887653221 Q ss_pred eeeeeccccccccceeecCCCCCcceeeecccCccCC------CCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 77 HIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM------PPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~------~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) ..+.+.-...+|||+|+|.||+++ .+|+|...+|+++++.+.++|.+.+-.++ T Consensus 67 -------------d~V~V~Fed~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 67 -------------DRVSVEFKDEAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred -------------CeEEEEEcchhhhhhhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 112222346789999999999997 99999999999999999999988777666 No 92 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=98.97 E-value=8.4e-12 Score=81.31 Aligned_cols=127 Identities=16% Similarity=0.140 Sum_probs=92.5 Q ss_pred CccceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhhceeccccccccceeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAP--VRRGKLRRNVVVLSRCSRDGGMESGVH 77 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~~ 77 (148) -|+.--+++|++|++++|++ |+++...++.++||.++++.|.++.+.+.+ .|||...+.+..+......|..... T Consensus 6 ~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~-- 83 (138) T protein:vir:98 6 SMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVK-- 83 (138) T ss_pred cccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEE-- Confidence 23333467799999999999 887766789999999999999999999998 4888877777655443322221111 Q ss_pred eeeeccccccccceeecCCCCCcceeeecccCccCCC-CC--chhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 78 IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMP-PH--PFVRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~-a~--PFl~pA~~~~~~~~~~~~~~~~~~~i~k 144 (148) ..+.++.+..-|..|||+.+++ |+ -++..|++..+..+...+++++++.|+- T Consensus 84 ---------------igW~GpR~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 84 ---------------LGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred ---------------EeeecCeeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 1112233345677889985532 22 3899999999999999999999888888 No 93 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=98.94 E-value=1.5e-11 Score=79.86 Aligned_cols=121 Identities=7% Similarity=0.113 Sum_probs=81.8 Q ss_pred ceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAP--VRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|++||+++|++ |+++...++.++||.++++.+.+..|.+.. .|||..-+.+..+......| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g---------- 70 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVG---------- 70 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccC---------- Confidence 89999999999999998 888776789999999999999999999987 58888777766544322222 Q ss_pred ecccccccccee---ecCCCCCcceeeecccCccCC----CCCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTM---KADNPRNAFYWRFVEMGTVNM----PPHP--FVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~---~~~~~~~~~y~~~~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+ +.+..+.+...|..|||..+- .|+- -+..|+++.+..+.+.++++|+ | T Consensus 71 ------~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~--------k 133 (133) T protein:vir:78 71 ------SQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA--------R 133 (133) T ss_pred ------CcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc--------C Confidence 111111 222233456689999995332 2332 3666666666655544444444 4 No 94 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=98.94 E-value=1.5e-11 Score=79.86 Aligned_cols=121 Identities=7% Similarity=0.113 Sum_probs=81.8 Q ss_pred ceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAP--VRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|++||+++|++ |+++...++.++||.++++.+.+..|.+.. .|||..-+.+..+......| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g---------- 70 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVG---------- 70 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccC---------- Confidence 89999999999999998 888776789999999999999999999987 58888777766544322222 Q ss_pred ecccccccccee---ecCCCCCcceeeecccCccCC----CCCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTM---KADNPRNAFYWRFVEMGTVNM----PPHP--FVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~---~~~~~~~~~y~~~~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+ +.+..+.+...|..|||..+- .|+- -+..|+++.+..+.+.++++|+ | T Consensus 71 ------~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~--------k 133 (133) T protein:vir:93 71 ------SQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA--------R 133 (133) T ss_pred ------CcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc--------C Confidence 111111 222233456689999995332 2332 3666666666655544444444 4 No 95 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=98.94 E-value=1.5e-11 Score=79.86 Aligned_cols=121 Identities=7% Similarity=0.113 Sum_probs=81.8 Q ss_pred ceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAP--VRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|++||+++|++ |+++...++.++||.++++.+.+..|.+.. .|||..-+.+..+......| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g---------- 70 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVG---------- 70 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccC---------- Confidence 89999999999999998 888776789999999999999999999987 58888777766544322222 Q ss_pred ecccccccccee---ecCCCCCcceeeecccCccCC----CCCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTM---KADNPRNAFYWRFVEMGTVNM----PPHP--FVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~---~~~~~~~~~y~~~~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+ +.+..+.+...|..|||..+- .|+- -+..|+++.+..+.+.++++|+ | T Consensus 71 ------~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~--------k 133 (133) T protein:vir:94 71 ------SQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA--------R 133 (133) T ss_pred ------CcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc--------C Confidence 111111 222233456689999995332 2332 3666666666655544444444 4 No 96 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=98.94 E-value=1.5e-11 Score=79.86 Aligned_cols=121 Identities=7% Similarity=0.113 Sum_probs=81.8 Q ss_pred ceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAP--VRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|++||+++|++ |+++...++.++||.++++.+.+..|.+.. .|||..-+.+..+......| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g---------- 70 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVG---------- 70 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccC---------- Confidence 89999999999999998 888776789999999999999999999987 58888777766544322222 Q ss_pred ecccccccccee---ecCCCCCcceeeecccCccCC----CCCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTGNSDNTM---KADNPRNAFYWRFVEMGTVNM----PPHP--FVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~---~~~~~~~~~y~~~~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+ +.+..+.+...|..|||..+- .|+- -+..|+++.+..+.+.++++|+ | T Consensus 71 ------~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~--------k 133 (133) T protein:vir:96 71 ------SQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA--------R 133 (133) T ss_pred ------CcceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc--------C Confidence 111111 222233456689999995332 2332 3666666666655544444444 4 No 97 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=98.93 E-value=1.8e-11 Score=79.54 Aligned_cols=124 Identities=12% Similarity=0.145 Sum_probs=89.2 Q ss_pred ceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRA--PVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~a--P~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|++||+++|++ |+++...++.++||.++++.+.+..|.+. ..|||..-+.+..+......|.....+ T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i---- 76 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKI---- 76 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEE---- Confidence 89999999999999998 88876678999999999999999999964 568998877776654432222111111 Q ss_pred eccccccccceeecCCCCCcceeeecccCccC----CCCCc--hhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN----MPPHP--FVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~----~~a~P--Fl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .+.+..+.+...|..|||..+ ..|+- -+..|+++.+..+.+.++++|++.| T Consensus 77 -----------~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 77 -----------DWKGPKDRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred -----------EEecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 122223345678999999533 23443 5888899888877777777766666 No 98 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.85 E-value=3.6e-11 Score=77.84 Aligned_cols=114 Identities=17% Similarity=0.207 Sum_probs=77.4 Q ss_pred CccceeeehhH-HHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeee Q lcl|NC_011356. 1 MIETLLDFSGL-EDISRDLQLLSGAENNRVLREAT----RAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESG 75 (148) Q Consensus 1 Mm~~~~~i~Gl-~el~~~l~~l~~~~~~~~~~~al----~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~ 75 (148) |..++ |..| +++.+.|+...+++.+ .+++++ ..+++.++.+++..+|+.||.+.++.+...... +. T Consensus 1 M~~i~--id~La~~I~~~L~~Ys~~v~~-~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e--~~---- 71 (124) T protein:vir:95 1 MAKIK--IGRLADEITSQLRKYSQVIAD-DVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPN--GW---- 71 (124) T ss_pred Ccccc--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecC--ce---- Confidence 66544 4455 6778888888776643 345555 666666667777899999999998876543321 10 Q ss_pred eeeeeeccccccccceeecCCCCCcceeeecccCccC-----CCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 76 VHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN-----MPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ...+...+--.|.||||..+ .+++|+|+|+.+...+.+.+.|.+.|+. T Consensus 72 -----------------~V~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 72 -----------------VIHNKTEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred -----------------eEEEcCCCceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 12222333359999999754 6999999999998888666666655555 No 99 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=98.78 E-value=1.4e-10 Score=74.55 Aligned_cols=121 Identities=7% Similarity=0.125 Sum_probs=80.3 Q ss_pred ceeeehhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLL-SGAENNRVLREATRAGANVLKEEVVSRAP--VRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l-~~~~~~~~~~~al~~~a~~i~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |+++++|++||+++|++. ++....++.++||.++++.+.+..|.+.. .|||..-+.+..+......| T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g---------- 70 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVG---------- 70 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccC---------- Confidence 899999999999999875 44445689999999999999999999987 58888777766544322122 Q ss_pred ecccccccccee---ecCCCCCcceeeecccCccCC----CCCc--hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTM---KADNPRNAFYWRFVEMGTVNM----PPHP--FVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 81 ~~~~~~~~~~~~---~~~~~~~~~y~~~~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .....+ +.+..+.+...|..|||..+- .|+- -+..|+++.+..+.+.++++|++ T Consensus 71 ------~~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 71 ------SQERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ------CcceEEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 111111 222233456689999995332 2332 46666666666555554444444 No 100 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.70 E-value=6.8e-11 Score=76.32 Aligned_cols=131 Identities=15% Similarity=0.094 Sum_probs=75.4 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |-+ ++|..+ ++++..+++. -+...+++.+..+..++...+|++||.+|.|-.++...-..+.......-. T Consensus 5 m~~~~sF~~~-i~~~~~~ve~--------~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G 75 (145) T protein:vir:10 5 IGSVVTFEKS-IADWIDRAED--------GFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTG 75 (145) T ss_pred ccchhccccC-HHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCC Confidence 222 233222 3344444443 224467777788888888899999999999976654332222111110000 Q ss_pred e---------eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 G---------VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 80 ~---------~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) . ...-.+.+. ....+-..+..|+.++|||+|.|+|..|++-++.+- .+++.....+++++| T Consensus 76 ~~t~~~~~~~~~~i~~~k~-g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 76 GQTKTYLARQARAVANSKA-TSVIYITNRLDYAADLEYGASNQAPAGVLGVVQARL-GRYFQEAVEEARRAI 145 (145) T ss_pred ccchhhHHHHHHHhhcccc-cceEEEeeCchhhhHhhccccCCCcchHHHHHHHHH-HHHHHHHHHHhhccC Confidence 0 000000000 111223456789999999999999999999999765 555555556666666 No 101 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.69 E-value=1.5e-10 Score=74.51 Aligned_cols=117 Identities=18% Similarity=0.260 Sum_probs=75.6 Q ss_pred CccceeeehhH-HHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeee Q lcl|NC_011356. 1 MIETLLDFSGL-EDISRDLQLLSGAENNRVLREAT----RAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESG 75 (148) Q Consensus 1 Mm~~~~~i~Gl-~el~~~l~~l~~~~~~~~~~~al----~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~ 75 (148) |.. |+|..| +++.+.|+...+++. ..+.+++ ..+++.++++++..+|+.||.+.++.+...... + T Consensus 1 M~~--i~id~La~~I~~~L~~y~~~v~-~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~--~----- 70 (127) T protein:vir:80 1 MAN--IKIDRLGDEITRQLKRYSQVIA-GDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPG--G----- 70 (127) T ss_pred Ccc--ccHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccC--c----- Confidence 655 555555 667888888887764 3446666 555556666666799999999998865433211 0 Q ss_pred eeeeeeccccccccceeecCCCCCcceeeecccCccC-----CCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 76 VHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN-----MPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) ....+...+--.|.||||..+ .+|+|+|+|+.+...+++.+.+.+.|+-.=+ T Consensus 71 ----------------~~v~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 71 ----------------WVIHNKTEYRLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred ----------------eeEeecCCcceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 011222222249999999764 6899999999998777555554444443333 No 102 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.64 E-value=1.8e-10 Score=73.98 Aligned_cols=132 Identities=14% Similarity=0.102 Sum_probs=76.2 Q ss_pred Cccc--eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 1 MIET--LLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 1 Mm~~--~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |.+- +|.- .+....+++. ..+...+++.+..+..++..++|++||.+|.|..++...-..+.......- T Consensus 1 Ma~~~~sf~~--------~i~~~~~~ve-~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~ 71 (142) T protein:vir:10 1 MANDVVSFRN--------SINAWIDGVT-EGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPD 71 (142) T ss_pred Cccchhhhhc--------cHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCC Confidence 5532 3332 3333333332 233556777778888888889999999999998776544332221111100 Q ss_pred eeeccccc--------cccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 79 RGVNPDTG--------NSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 79 ~~~~~~~~--------~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .......+ ........+-..+..|+.++|||+|.|+|..|++-++. .-.++++....++++.| T Consensus 72 G~~t~~~~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~a~q-~~~~~v~~a~~e~~~~~ 142 (142) T protein:vir:10 72 GNETRNSLRRQIYALARDANTNVIYISNRLDYAQGLEFGSSNQAPSGVLGVVQK-RLGRYFAEAVQEAKRAL 142 (142) T ss_pred CccchhhHHHHHHHhhhccccceEEEeeCcchhhhhhccccCCCcchHHHHHHH-HHHHHHHHHHHHhhccC Confidence 00000000 00011122234567899999999999999999999996 44555555555566655 No 103 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.63 E-value=4.3e-10 Score=71.95 Aligned_cols=134 Identities=16% Similarity=0.163 Sum_probs=78.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+-++ .++...+....+.+. ..+...+++.+..+...+...+|++||.+|.|-.++...-..+.......... T Consensus 1 ma~~~~-----~~F~~~i~~~~~~ve-~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~ 74 (147) T protein:vir:10 1 MANYQI-----RRFQGEIDAWINAAE-STLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGG 74 (147) T ss_pred CCCcch-----hhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCcc Confidence 666444 355556666665543 34467888888889999999999999999999876544322222111110000 Q ss_pred eccccc---------cccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 81 VNPDTG---------NSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 81 ~~~~~~---------~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+ ........+-..+..|+.++|||+|.|+|..|++-++.+-.. .+...+.++-+. T Consensus 75 ~t~a~~~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~QAP~G~V~~t~q~~~~--------~v~~~~~e~k~~ 143 (147) T protein:vir:10 75 VVRGEEQAKTYGMFSRGGAITSVHFSNMLIYANALEYGHSQQAPSGVVGLVALRLRS--------YMADAIKQARRQ 143 (147) T ss_pred chhhhhhHHHHHHhhhccCcceEEEeeCcchhhhhhccccCCCCchHHHHHHHHHHH--------HHHHHHHHHHhh Confidence 000000 000111223345678999999999999999999988864333 333333333222 No 104 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.62 E-value=4.5e-10 Score=71.83 Aligned_cols=137 Identities=11% Similarity=0.042 Sum_probs=79.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+-+ +.++...+....+.+. ..+..++++.+..+..++...+|+|||.+|.|-.++...-..+.......-.. T Consensus 1 ma~~~-----~~sFa~~i~~~~~~ve-~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~ 74 (146) T protein:vir:79 1 MADYS-----IREFHGNVDKWIEQVE-SGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGE 74 (146) T ss_pred CCcch-----hHHHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCc Confidence 55533 2355566666665553 34567888888899999999999999999999876654332221111110000 Q ss_pred e---------ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 81 V---------NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 81 ~---------~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) . ..-..........+-..+..|+.++|||++.|+|..|++.++.+-. ++++....++++ +..| T Consensus 75 ~t~~~~~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~~~~~~~-~~v~~a~~e~k~--~~~l 146 (146) T protein:vir:79 75 KIKAEGRRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSKQAPAGVFGIVAIRLR-SYMAEAIREARK--KNAL 146 (146) T ss_pred ccHHHHHHHHHHHHhcccccceeEEeeCchhhhhhhccccCCCcchHHHHHHHHHH-HHHHHHHHHHHh--hccC Confidence 0 0000000001122334567899999999999999999999997443 333333333333 2222 No 105 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.56 E-value=1.6e-10 Score=74.33 Aligned_cols=115 Identities=18% Similarity=0.139 Sum_probs=72.6 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+|+|... ++++..+++.- +...+++.+..+.+.+...+|++||.+|.|-.++......+.......... T Consensus 1 ~~~~sf~~~-i~~~~~~ve~~--------~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~ 71 (121) T protein:vir:94 1 MISMKFNVN-LSRLRSNLREE--------AKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANP 71 (121) T ss_pred Cccchhhcc-HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcc Confidence 999999866 55665555432 234556666777777888999999999988765433222111111000000 Q ss_pred ------eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHH Q lcl|NC_011356. 81 ------VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRS 127 (148) Q Consensus 81 ------~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~ 127 (148) +....+ .....+-..+..|+..+|||+|.|+|..|++.++.+-+ T Consensus 72 ~t~~~~~~~~~~---~~~~iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 72 TPAPAIVVSSNV---ALPHFYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred hhHHHHHHHHhh---ccceEEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 000000 01122334677899999999999999999999998777 No 106 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.52 E-value=1.1e-09 Score=69.63 Aligned_cols=117 Identities=19% Similarity=0.152 Sum_probs=74.9 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC------------------------ Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APV------------------------ 51 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~------------------------ 51 (148) ||.++++|. +++|.+.|+.|...+.+ .+..+...|+.++...+.+ .|. T Consensus 1 M~~i~i~~d-~~~~~~~L~~l~~~~~~--~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~ 77 (190) T protein:vir:99 1 MAGITLEWD-GRRALDVLNAGSAALGD--PSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILT 77 (190) T ss_pred CceeEEEec-HHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccce Confidence 999999998 48899999999876543 3577788888887777765 232 Q ss_pred CcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccC---------------------- Q lcl|NC_011356. 52 RRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMG---------------------- 109 (148) Q Consensus 52 ~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G---------------------- 109 (148) ++|.|++||......+ . ...+++..|+...+|| T Consensus 78 ~tg~L~~Si~~~~~~~----------------------~---v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~ 132 (190) T protein:vir:99 78 LDGHLRNLLRYQLDGS----------------------E---LLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGE 132 (190) T ss_pred ecHHHHHHHhheecCc----------------------E---EEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhh Confidence 2345555544321111 0 0012334455555555 Q ss_pred ----------------------ccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 110 ----------------------TVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 110 ----------------------T~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) |.++|++|||.-. ++.++++.+.+.+.|.+.|++.. T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 133 VGREFVPRRRSNFAQDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred hhcccccccccccchhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 3568999999655 56677777777777777777766 No 107 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.52 E-value=6.9e-10 Score=70.82 Aligned_cols=136 Identities=14% Similarity=0.152 Sum_probs=80.2 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcchhhhhceeccc--------- Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGKLRRNVVVLSR--------- 65 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~l~~~i~~~~~--------- 65 (148) |.. ++|+|.+ +++.+.|++|.....+ .+..+..-++.+....+.+ .|.++.--...+..+.+ T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~ 77 (175) T protein:vir:79 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQ--KADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhcC--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccc Confidence 554 5788886 8899999999766543 3567888888888777764 34321110000000000 Q ss_pred ---------------cccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc-------CCCCCchhHHHH Q lcl|NC_011356. 66 ---------------CSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-------NMPPHPFVRPAF 123 (148) Q Consensus 66 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~ 123 (148) ....|.+..++... .+. . ....+++..|+.+..||+. .+||+|||.=.- T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~-----~~~-~---~v~vGtn~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~ 148 (175) T protein:vir:79 78 ELTAAASRRKAGLMILQDSGQMAASTATD-----SGE-D---YSVIGSNKEYAAIQHFGGQAGRGLKVTIPGRAWLPVTA 148 (175) T ss_pred cchhhHhhhccCCCcceechhhhhhhhhe-----ecC-C---EEEEecCcchhhHhhcccccCCCcccccCcccccCCCc Confidence 00011111111110 000 1 1112456689999999975 799999998433 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 124 -DVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 124 -~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ++-..++.+.|.+.+.+.|++++++ T Consensus 149 ~de~~~~~~~~I~~~i~~~l~~a~~~ 174 (175) T protein:vir:79 149 DGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred ccchhHHHHHHHHHHHHHHHHHHhcc Confidence 3445677788888899999999988 No 108 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.44 E-value=9.9e-10 Score=69.96 Aligned_cols=125 Identities=16% Similarity=0.168 Sum_probs=71.7 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee- Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG- 80 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~- 80 (148) |++..+ ++++..+.+.- +...+++.+..+...+...+|++||.+|.|..++...-..+.......-.. T Consensus 1 msf~~~---i~~~~~~ve~~--------~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (131) T protein:vir:78 1 MSFALD---VSKFVEKAKKN--------PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTT 69 (131) T ss_pred CCcCcC---HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchh Confidence 555544 33555554432 234566666777777778999999999999876654332222111110000 Q ss_pred -ecc----ccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 -VNP----DTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 81 -~~~----~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ... -.+.+ .....+-..+..|+..+|||+|.|+|..|++-++..- .++++....+++ T Consensus 70 t~~~~~~~i~~~~-~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:78 70 ATSNAANFVLNAA-DWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred hHHHHHHHHhhcc-CCceEEEeeCchhhhHhhccccCCCcchHHHHHHHHH-HHHHHHHHHhcC Confidence 000 00000 0112223456789999999999999999999999744 444444444444 No 109 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.41 E-value=1.9e-09 Score=68.39 Aligned_cols=130 Identities=13% Similarity=0.180 Sum_probs=68.6 Q ss_pred ccceeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CC-Cc-c-hhhhh-ceecc------- Q lcl|NC_011356. 2 IETLLDFS-GLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-----PV-RR-G-KLRRN-VVVLS------- 64 (148) Q Consensus 2 m~~~~~i~-Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-----P~-~~-g-~l~~~-i~~~~------- 64 (148) |+|.++|+ .+++|.+.|++|..... . +..++.-++.++...+.+- |. .+ . .|..+ +..+. T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~-~--~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~ 77 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTR-D--RAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPG 77 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhc-c--HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCC Confidence 88888877 67889999998864322 1 3456666666666666542 42 10 0 01000 00000 Q ss_pred -ccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc--------CCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011356. 65 -RCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV--------NMPPHPFVRPAFDVRSEQAAQVAI 135 (148) Q Consensus 65 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~--------~~~a~PFl~pA~~~~~~~~~~~~~ 135 (148) .....|.+..++.. ..+.. ... .+++..|+...+||+. ++|++|||. --++.++++.+.+. T Consensus 78 ~~L~~tg~L~~Si~~-----~~~~~-~v~---vGt~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG-~s~~d~~~I~~~i~ 147 (156) T protein:vir:19 78 SILTLHGDLARSITT-----DYGQD-YAL---IGSPKIYAAIHQWGGTPDMAPRPAGVPARPYMG-LDKTGEQEIFDAIR 147 (156) T ss_pred cchhhhHHHHHHhhh-----eecCC-EEE---EecchhhhHHhhcCcccccCCCccccCCccccC-CCHHHHHHHHHHHH Confidence 00011111111111 00111 111 1356789999999975 599999995 44566665555555 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_011356. 136 ARMNRAIDEVLRR 148 (148) Q Consensus 136 ~~~~~~i~k~~kk 148 (148) +.|...++ + T Consensus 148 ~~l~~~~~----~ 156 (156) T protein:vir:19 148 KRVSAALR----Q 156 (156) T ss_pred HHHHHHhh----C Confidence 55555554 4 No 110 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.40 E-value=8.3e-10 Score=70.36 Aligned_cols=125 Identities=16% Similarity=0.161 Sum_probs=71.7 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee- Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG- 80 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~- 80 (148) |++..+ ++++..+.+.- +..++++.+..+...+...+|++||.+|.|..++...-..+.......-.. T Consensus 1 msF~~~---i~~~~~~ve~~--------~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (131) T protein:vir:94 1 MSFALD---VTRFVEKAKKN--------PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNT 69 (131) T ss_pred CCcccC---HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchh Confidence 555444 33555554432 234566666777777778999999999999876644333222111110000 Q ss_pred -----eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 -----VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 81 -----~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) ...-.+.+ .....+-..+..|+.++|||++.|+|..|++-++..- .++++....+++ T Consensus 70 t~~~~~~~i~~~~-~g~~iyi~Nn~pYA~~LEyG~S~QAP~g~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:94 70 ATGNATSFVLNAA-DWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred hHHHHHHHHhhcc-ccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHhcC Confidence 00000000 0111223456789999999999999999999999744 444444444444 No 111 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=98.40 E-value=8e-09 Score=64.97 Aligned_cols=123 Identities=15% Similarity=0.122 Sum_probs=84.0 Q ss_pred CccceeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVH 77 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~ 77 (148) |. +|.|++||+++|++ |++....++.++||.++++.+.+..|.+.-+ |||..-+.+..+......|.....+ T Consensus 1 m~----evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i- 75 (133) T protein:vir:96 1 MR----LIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRV- 75 (133) T ss_pred Cc----cccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEE- Confidence 43 56899999999985 5666667899999999999999999998654 7887766665544332222111111 Q ss_pred eeeeccccccccceeecCCCCCcceeeecccCcc-----CCCCCc--hhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 78 IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----NMPPHP--FVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-----~~~a~P--Fl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .+.+..+.+...|..|||+. +..|+- -+..|+++.+..+.+.+++++++.| T Consensus 76 --------------~W~gp~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 76 --------------YWEGEKHRYSIVHLNEKGFYAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred --------------EeecCCCceeeEeeecccceecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHHHhC Confidence 11122334556889999943 234443 5889999999977777777776666 No 112 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=98.33 E-value=6.7e-09 Score=65.40 Aligned_cols=127 Identities=15% Similarity=0.192 Sum_probs=86.0 Q ss_pred CccceeeehhHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHHhCCCCc---------chhhhhceeccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAE--NNRVLREATRAGANVLKEEVVSRAPVRR---------GKLRRNVVVLSRCSRD 69 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~aP~~~---------g~l~~~i~~~~~~~~~ 69 (148) |.+|. ..|++.+.++++|..++ .++ .+...+||+++++.....+|... ++|++||.... T Consensus 1 M~~~~---~~l~~~~~~vekl~~~lt~eqk--akITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~----- 70 (168) T protein:vir:74 1 MATFE---EAMQLIINQAESLSTKMTVEDK--AEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKN----- 70 (168) T ss_pred CccHH---HHHHHHHHHHHhhccCCCHHHH--HHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecc----- Confidence 76666 55778888888877543 333 46788999999998888887532 25555554332 Q ss_pred cceeeeeeeeeeccccccccceeecCCC-------CCcceeeecccCcc------------------CCCCCchhHHHHH Q lcl|NC_011356. 70 GGMESGVHIRGVNPDTGNSDNTMKADNP-------RNAFYWRFVEMGTV------------------NMPPHPFVRPAFD 124 (148) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~y~~~~E~GT~------------------~~~a~PFl~pA~~ 124 (148) ............+++.+ ..++.++|++.||. +|++-||+..+-+ T Consensus 71 -----------~niDg~~dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~ 139 (168) T protein:vir:74 71 -----------KNIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRM 139 (168) T ss_pred -----------cccCcccCCceeecccccccccccchhhhhhhhcccccccccccccccccccccccccccchhHHHHHh Confidence 21111111222223322 35788999999994 6899999999999 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 125 V--RSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 125 ~--~~~~~~~~~~~~~~~~i~k~~kk 148 (148) . .++.|+++..+++++-|++.-+- T Consensus 140 ~~~~k~~V~~Ae~~~y~eIl~~k~~~ 165 (168) T protein:vir:74 140 NLIVQQGILKAEAEAMRKIINRKKKE 165 (168) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 8 67988898888888777766555 No 113 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.33 E-value=7.1e-09 Score=65.26 Aligned_cols=116 Identities=16% Similarity=0.212 Sum_probs=62.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC----------------------Cc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APV----------------------RR 53 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~----------------------~~ 53 (148) |... .+++.+.|++|...+. .++...+..+.+.+..+ .|. ++ T Consensus 1 ~i~~------~~~i~~~l~~l~~~~~-----~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~t 69 (145) T protein:vir:31 1 MVED------ENNIPEAREAIQDGLT-----DGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDN 69 (145) T ss_pred Cccc------HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccC Confidence 4333 2345555555543332 34444455455544442 232 22 Q ss_pred chhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCccC--CCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 54 GKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN--MPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 54 g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--~~a~PFl~pA~~~~~~~~~ 131 (148) |.|++||..+.... ........+++..|+.+.+|||.+ +||||||.++..-..+++. T Consensus 70 G~L~~Si~~~~~~~---------------------~~~~~a~vGtn~~YA~~hqfG~~~~~IPaRPfLG~~~~~~~~~~~ 128 (145) T protein:vir:31 70 SRLLTDINAASMMD---------------------RANRMAVIGTNLDYAEHHEFGAPEAGIPARPIFGPAGAYASQQAP 128 (145) T ss_pred HHHHHHHHHHhhhc---------------------ccCceeEecCCchhhhhhccCCcccccCCCCccCCCccchHHHHH Confidence 33333332211100 001112234667899999999976 9999999999876677666 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_011356. 132 QVAIARMNRAIDEVLRR 148 (148) Q Consensus 132 ~~~~~~~~~~i~k~~kk 148 (148) +.+.+.+...|..++-- T Consensus 129 ~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 129 DVIGDEIDTNLEGAVID 145 (145) T ss_pred HHHHHHHHHHhhhhccC Confidence 66666666555555544 No 114 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=98.24 E-value=1.9e-09 Score=68.45 Aligned_cols=94 Identities=17% Similarity=0.342 Sum_probs=55.2 Q ss_pred cccee--eehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 2 IETLL--DFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 2 m~~~~--~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |++++ +..|+++|++.|++|.+.. + +-..|.+.. T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~---v----------------~VGi~~~~~------------------------- 36 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKA---V----------------YVGFPAEFD------------------------- 36 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCe---E----------------EEEeecCcC------------------------- Confidence 77544 4557888888888774310 0 000010000 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH--HHhcC Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID--EVLRR 148 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~--k~~kk 148 (148) .......+.+.+.++.+.||||.+.||||||+|+++.++++..+.+...++..++ .+|.. T Consensus 37 ---------~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 98 (148) T protein:vir:52 37 ---------EKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEENQEKYTALFIQWFDQGVPAAQIYER 98 (148) T ss_pred ---------CCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 0000011235577999999999999999999999999999887766654443221 11111 No 115 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=98.21 E-value=1.8e-08 Score=62.99 Aligned_cols=127 Identities=15% Similarity=0.199 Sum_probs=82.4 Q ss_pred CccceeeehhHHHHHHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc---------chhhhhceeccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLS--GAENNRVLREATRAGANVLKEEVVSRAPVRR---------GKLRRNVVVLSRCSRD 69 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~--~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~---------g~l~~~i~~~~~~~~~ 69 (148) |-+|.-- |+.++..++.|+ ..+.++ .+...+||+++++.....+|... ++|++||... T Consensus 1 M~~~~d~---l~~~~~~vekl~~~ls~eqk--akITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~------ 69 (168) T protein:vir:10 1 MVSFYDA---MQLIVDRAEELSTKMSVEDK--AEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMK------ 69 (168) T ss_pred CCcHHHH---HHHHHHHHHHhhcCCCHHHH--HHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheec------ Confidence 7666554 445566666652 122333 36788999999999999888632 2455555433 Q ss_pred cceeeeeeeeeeccccccccceeecCCC-------CCcceeeecccCcc------------------CCCCCchhHHHHH Q lcl|NC_011356. 70 GGMESGVHIRGVNPDTGNSDNTMKADNP-------RNAFYWRFVEMGTV------------------NMPPHPFVRPAFD 124 (148) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~y~~~~E~GT~------------------~~~a~PFl~pA~~ 124 (148) .............+++.+ ..++.++|++.||. +|++-||+..+-+ T Consensus 70 ----------~~niDg~~dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~ 139 (168) T protein:vir:10 70 ----------NKNIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRK 139 (168) T ss_pred ----------ccccccccCCceeecccCccccccccchheeeeccccccccccccccccccccccccccccchhHHHhhh Confidence 221111122222233332 36788999999994 6899999999999 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 125 V--RSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 125 ~--~~~~~~~~~~~~~~~~i~k~~kk 148 (148) . .++.|+++..+++++-|++.-+- T Consensus 140 d~a~k~~V~~Ae~~~y~eIl~~k~~~ 165 (168) T protein:vir:10 140 NPIVQQGILKAEAEAMRKIINRKKKE 165 (168) T ss_pred chhhhHHHHHHHHHHHHHHHHhhcCC Confidence 6 57888888888888777765555 No 116 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=98.20 E-value=1.3e-08 Score=63.89 Aligned_cols=94 Identities=19% Similarity=0.133 Sum_probs=65.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC---CcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcce Q lcl|NC_011356. 26 NNRVLREATRAGANVLKEEVVSRAPV---RRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFY 102 (148) Q Consensus 26 ~~~~~~~al~~~a~~i~~~ak~~aP~---~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y 102 (148) .++.+++++++.|..+...++.++|+ ++|+||+|..+..... .+ + . ..++..| T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k-~~---------------~----~----v~N~~eY 56 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNL-FD---------------G----V----VSNNVEY 56 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeec-cC---------------c----e----eecCCcc Confidence 34667888999999999999999998 4699999865432111 00 0 0 1256689 Q ss_pred eeecccCcc-------------------CCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 103 WRFVEMGTV-------------------NMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 103 ~~~~E~GT~-------------------~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) ++|||||.. +.|.+.||..+..+.+.++...+++.|.+.++ T Consensus 57 A~~VE~GHRq~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 57 IHHLEYGHRTRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred cccccCCceeeCCcceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 999999964 34667788888888877666655555555555 No 117 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=98.17 E-value=6e-09 Score=65.66 Aligned_cols=103 Identities=13% Similarity=0.080 Sum_probs=53.1 Q ss_pred ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecc Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNP 83 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 83 (148) |++.-+||++|++.|+.. + ++-..|.+.+. .+ ..+... .+ .... T Consensus 1 m~v~r~~L~~~~~~l~~~--~--------------------V~VGi~~~a~y-~d---------~~g~~~-~~--g~~~- 44 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYKSM--S--------------------VKAGVLAGATY-PD---------ESGKKL-AD--GTIL- 44 (155) T ss_pred CcchHHHHHHHHHHhhCC--e--------------------eEEeecCCCCC-Cc---------cccchh-hh--hhhh- Confidence 666667777776655531 0 00011111100 00 000000 00 0000 Q ss_pred ccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_011356. 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI--DEVLRR 148 (148) Q Consensus 84 ~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i--~k~~kk 148 (148) . .....+.+-+.++.++||||.+.||||||||++++++++..+.+...++..+ +++|.. T Consensus 45 ~------~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:10 45 K------KDPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred c------cccccCcchhhhhhhhhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 0 0001123446788999999999999999999999999988877666554432 222222 No 118 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.16 E-value=1.9e-08 Score=62.88 Aligned_cols=134 Identities=12% Similarity=0.083 Sum_probs=67.6 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCC-------cchhhhhce----eccccc Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-PVR-------RGKLRRNVV----VLSRCS 67 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-P~~-------~g~l~~~i~----~~~~~~ 67 (148) |.. ++|++.+ +++.+.|.+|...+.+ ....++..++.+....+.+- |.. ...++..-. ...... T Consensus 1 M~~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:79 1 MTTRIDVELDD-QEVRQRLAVLMRSVTD--TLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccc Confidence 333 3666665 7899999999876543 36778888888888887764 221 111110000 000001 Q ss_pred cccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc-------CCCCCchhHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_011356. 68 RDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-------NMPPHPFVRPAFD-VRSEQAAQVAIARMN 139 (148) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~~-~~~~~~~~~~~~~~~ 139 (148) ..|.+..++... .+.. .+..+++..|+...+||+. .+|++|||.=.-+ ...+++.+.|.+.+. T Consensus 78 ~tG~L~~Si~~~-----~~~~----~v~vGt~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~ 148 (155) T protein:vir:79 78 VTNALARSVTTW-----ADRN----EAGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEVVL 148 (155) T ss_pred cchhhhhhhhce-----ecCC----EEEEecCchhhhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHH Confidence 112222222111 1111 1112466789999999975 7999999963322 222333333333333 Q ss_pred HHHHHHh Q lcl|NC_011356. 140 RAIDEVL 146 (148) Q Consensus 140 ~~i~k~~ 146 (148) +.|++-- T Consensus 149 ~~l~r~r 155 (155) T protein:vir:79 149 TALSRNR 155 (155) T ss_pred HHHHhcC Confidence 3333332 No 119 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.15 E-value=1.9e-08 Score=62.87 Aligned_cols=136 Identities=16% Similarity=0.164 Sum_probs=76.0 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcchhhhhceecc---------- Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGKLRRNVVVLS---------- 64 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~l~~~i~~~~---------- 64 (148) |.. ++++|.. ++|.+.|++|.....+ .+..+..-++.++.....+ .|..+.--...+..+. T Consensus 1 Ms~~i~i~~~~-~~l~~~L~~l~~~~~d--~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~ 77 (175) T protein:vir:10 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQ--KAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceeEEEEecH-HHHHHHHHHHHHHhcc--HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhh Confidence 444 3666664 7899999998766543 2566777777777777664 3432211000000000 Q ss_pred --------------ccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc-------CCCCCchhHHHH Q lcl|NC_011356. 65 --------------RCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-------NMPPHPFVRPAF 123 (148) Q Consensus 65 --------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~ 123 (148) .....|.+..++... .+. .....+++..|+....||+. ++||+|||.=.- T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~-----~~~----~~v~vGtn~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~ 148 (175) T protein:vir:10 78 ELTAAASRRKAGLMILQDSGQMAASVSTD-----HDD----NSAVIGSNKEYAAIHQFGGQAGRGLKVTIPARPWLPVTA 148 (175) T ss_pred hhhhhhhhhccCCCcceechhhhhhhhee-----ecC----CEEEEecChhhhhhhhcccccCCCCccccCCccccCCCc Confidence 000011111111110 000 11122456689999999987 899999998533 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 124 D-VRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 124 ~-~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) + +...+.++.|.+.+.+.|.+++++ T Consensus 149 ~d~~~~e~~~~Il~~~~~~l~~~~~~ 174 (175) T protein:vir:10 149 DGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred ccccchHHHHHHHHHHHHHHHHHhcc Confidence 2 223456677788888888888888 No 120 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=98.13 E-value=2.7e-08 Score=62.05 Aligned_cols=130 Identities=17% Similarity=0.156 Sum_probs=81.1 Q ss_pred Cccce-eeehhHHHHHHHHHHhHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCCC------c---chhhhhceecccccc Q lcl|NC_011356. 1 MIETL-LDFSGLEDISRDLQLLSGA--ENNRVLREATRAGANVLKEEVVSRAPVR------R---GKLRRNVVVLSRCSR 68 (148) Q Consensus 1 Mm~~~-~~i~Gl~el~~~l~~l~~~--~~~~~~~~al~~~a~~i~~~ak~~aP~~------~---g~l~~~i~~~~~~~~ 68 (148) ||.-+ +==..|+++++++++|..+ +.++ .+...+||+++++.....+|.. + |+|++||..... +- T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~k--akIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~-ni 77 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDK--AKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNT-NI 77 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHH--HHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeeccc-cc Confidence 88753 3234577777788776643 2333 4678899999999999888763 2 366666654421 21 Q ss_pred ccceeeeeeeeeeccccccccceeecCCCCCcceeeecc-----------------cCccCCCCCchhHHHHH--HHHHH Q lcl|NC_011356. 69 DGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVE-----------------MGTVNMPPHPFVRPAFD--VRSEQ 129 (148) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E-----------------~GT~~~~a~PFl~pA~~--~~~~~ 129 (148) .| .......+++.+..++.+||++ .||.+|++-||+..+-+ +.++. T Consensus 78 Dg---------------~~dG~StVGw~~kka~ia~~indGtr~~~~~~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~a 142 (161) T protein:vir:10 78 DG---------------IKDGNSTVGWDYTKSRVGHLIENGTRFPMYSKKGTKYRKGGQVAITSDPFVSTYRDSMEAQVA 142 (161) T ss_pred Cc---------------ccCCceeccccCchhhhhhhhcccchhhhhhcccccccCCcceeecCcchhHHHHhhhhhHHH Confidence 11 1112222333333345555554 45577999999999999 57788 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 130 AAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 130 ~~~~~~~~~~~~i~k~~kk 148 (148) |+++..+++++-|++.-.- T Consensus 143 V~~Ae~~~y~eil~~k~~~ 161 (161) T protein:vir:10 143 MFSAEAEVFSEILKKKGAE 161 (161) T ss_pred HHHHHHHHHHHHHHhhcCC Confidence 8888877777666543333 No 121 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.10 E-value=2.8e-08 Score=61.96 Aligned_cols=130 Identities=15% Similarity=0.132 Sum_probs=73.2 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeee-- Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHI-- 78 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~-- 78 (148) |.+-.+ ++-..++...+.+ +..+...+++.+..+.......+|++||.+|.|-.++......+.....+.- T Consensus 1 MA~~~~------~f~~~i~~~~~~v-e~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~ 73 (144) T protein:vir:95 1 MAKSLL------DLADRLEKKAKAI-DEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQ 73 (144) T ss_pred Cchhhh------hhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccc Confidence 554222 2333444444444 3455778888888888999999999999999997765443221111110000 Q ss_pred eeecccccc------------ccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 79 RGVNPDTGN------------SDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) Q Consensus 79 ~~~~~~~~~------------~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~ 143 (148) .......+. .......+-..+..|+..+|||+|.|+|..|++-++.+...-+.+ ++- ++ T Consensus 74 ~~t~d~sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G~vr~~~q~~~~~v~~-~~~-----~~ 144 (144) T protein:vir:95 74 GSTQRASAAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAGFVERAVLIGRKMRKK-FKI-----KD 144 (144) T ss_pred cccCCCchhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHh-hcc-----CC Confidence 000000000 000112233456789999999999999999999999755442222 110 01 No 122 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.03 E-value=7e-08 Score=59.80 Aligned_cols=132 Identities=14% Similarity=0.099 Sum_probs=66.2 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCc-------chhhhhcee----ccccc Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-PVRR-------GKLRRNVVV----LSRCS 67 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-P~~~-------g~l~~~i~~----~~~~~ 67 (148) |.. ++|++.. ++|.+.|++|.....+ .+..++.-++.+....+.+- |..+ ..++..... ..-.. T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:99 1 MTTRIDVELDD-QEVRQRLALLMRSVTD--TLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcch Confidence 444 3666665 7899999999876543 36778888888888777764 3211 111100000 00011 Q ss_pred cccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc-------CCCCCchhHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_011356. 68 RDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-------NMPPHPFVRPAFD-VRSEQAAQVAIARMN 139 (148) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~pA~~-~~~~~~~~~~~~~~~ 139 (148) ..|.+..++... .+.. .+..+++..|+...+||+. .+|++|||.=.-+ .-.++..+ +|. T Consensus 78 ~tg~L~~Si~~~-----~~~~----~v~vGtn~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e~~~----~I~ 144 (155) T protein:vir:99 78 VTNALARSVTTW-----ADRN----EAGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQ----SIL 144 (155) T ss_pred hchhhhhhhhce-----ecCC----EEEEecCccchhhhhcccccCCCCccccCCccccCCCCccccchHHHH----HHH Confidence 112222222111 1111 1112456789999999975 7999999963221 11122222 333 Q ss_pred HHHHHHhcC Q lcl|NC_011356. 140 RAIDEVLRR 148 (148) Q Consensus 140 ~~i~k~~kk 148 (148) ..|.+-++| T Consensus 145 ~~i~~~l~~ 153 (155) T protein:vir:99 145 EIVLTALSR 153 (155) T ss_pred HHHHHHHhc Confidence 333333334 No 123 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=98.03 E-value=1.3e-08 Score=63.77 Aligned_cols=103 Identities=13% Similarity=0.064 Sum_probs=51.7 Q ss_pred ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecc Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNP 83 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 83 (148) |++.-.||+.+.+.|+... + +-..|.+++. .+ ..+. ..+ ...... T Consensus 1 m~~~r~~l~~~~~~l~~~~--v--------------------~VGi~~~a~y-~d---------~~~~--~~~-~~~~~~ 45 (155) T protein:vir:77 1 MSVTRRGLTLPKDRYRSMS--V--------------------KAGVLAGATY-PD---------ESGK--KLA-DGSILK 45 (155) T ss_pred CcchHHHHHHHHHHHhcCc--e--------------------EEeecCCCCC-cc---------ccch--hhh-hhhhcc Confidence 5555556666655544310 0 1111111110 00 0000 000 000000 Q ss_pred ccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_011356. 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI--DEVLRR 148 (148) Q Consensus 84 ~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i--~k~~kk 148 (148) . ....+.+.+.++.++||||.+.||||||||++++++++..+.+...++..+ +++|.. T Consensus 46 ~-------~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:77 46 K-------DPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred c-------cccccccHhhhhhhhhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Confidence 0 001123456788999999999999999999999999988877766554332 122222 No 124 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.01 E-value=2.3e-08 Score=62.47 Aligned_cols=126 Identities=15% Similarity=0.042 Sum_probs=71.0 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--------------Ccchhhhhceecccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPV--------------RRGKLRRNVVVLSRC 66 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--------------~~g~l~~~i~~~~~~ 66 (148) =|++..+| +.+..+.+. -+...++..+..+...+...+|+ +||.+|.|-.++... T Consensus 10 ~msFaa~i---~~~~~~~e~--------~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~ 78 (152) T protein:vir:96 10 PMSWSKSL---KNIIVKNEN--------LTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISK 78 (152) T ss_pred cccccccH---HHHHHHHHH--------HHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecC Confidence 23333332 233333332 23456666677777777788899 999999998776444 Q ss_pred ccccceeeeeeeeeeccccc---cccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 67 SRDGGMESGVHIRGVNPDTG---NSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRA 141 (148) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~ 141 (148) -..+...+...........+ ........+-..+..|+..+|||+|.|+|..|++.++..-.+ .+.++++.+ T Consensus 79 p~~~~~~~~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~QAP~G~vr~t~~~~~~----~v~ea~~~~ 152 (152) T protein:vir:96 79 ITSFEKGISSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGHSSQAPNGVYRPAVRRLVK----FLNTELKAK 152 (152) T ss_pred CCcccccCCCCCchHHHHHHHHhhccccceEEEeeCchhhhHhhccccCCCCchHHHHHHHHHHH----HHHHHhccC Confidence 33322211111110000000 000011223345678999999999999999999999986665 444444444 No 125 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=97.94 E-value=1.1e-07 Score=58.75 Aligned_cols=136 Identities=14% Similarity=0.204 Sum_probs=80.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAE--NNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |-+|.- .|+.++.+++.|..+. .++ .+...+||+++++.....+|......+ .....+++.++|.. T Consensus 1 M~~~~d---~l~~~~~~v~kl~~~lt~e~k--akIT~AGAkv~a~~L~~~T~~kHy~~r-------ktg~~~HLADsI~~ 68 (168) T protein:vir:39 1 MVSFYD---AMQLIINQAESLSTKMTVEDK--AEVTKAGAKVFEQALAYEVRNRHYRHR-------DTGEDPHLADSIVM 68 (168) T ss_pred CccHHH---HHHHHHHHHHhccCCCCHHHH--HHHHHHhHHHHHHHHHHHhHHhcccCC-------CCCCCccchhheee Confidence 666644 4566777777766432 333 367888999999888887775321111 11122333333333 Q ss_pred eeeccccccccceeecCCC-------CCcceeeecccCcc------------------CCCCCchhHHHHHH--HHHHHH Q lcl|NC_011356. 79 RGVNPDTGNSDNTMKADNP-------RNAFYWRFVEMGTV------------------NMPPHPFVRPAFDV--RSEQAA 131 (148) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~-------~~~~y~~~~E~GT~------------------~~~a~PFl~pA~~~--~~~~~~ 131 (148) ..............+++.+ ..++.++|++-||. +|++-||+..+-+. .++.|+ T Consensus 69 ~~~niDg~~dG~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~ 148 (168) T protein:vir:39 69 KNKNIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQGIL 148 (168) T ss_pred cccccCcccCCceeccccCccccccccchhheehhccccccchhhhhcccccccccceeecccchhHHHhhhhhhhHHHH Confidence 3322221122222233332 36778999999984 68999999999995 478888 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_011356. 132 QVAIARMNRAIDEVLRR 148 (148) Q Consensus 132 ~~~~~~~~~~i~k~~kk 148 (148) ++..+++++-|++.-.- T Consensus 149 ~Ae~e~~~eil~~k~~~ 165 (168) T protein:vir:39 149 KAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHhcCCC Confidence 87777776655543332 No 126 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=97.91 E-value=7.4e-08 Score=59.68 Aligned_cols=128 Identities=16% Similarity=0.096 Sum_probs=69.2 Q ss_pred Cccc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIET-LLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~~-~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |.++ +|.-. ++++.++++. .+...++..+..+...+....|++||.+|.|-.++...-..+...... .. T Consensus 1 m~~~~sFa~~-i~~~~~~ve~--------~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~d-p~ 70 (148) T protein:vir:97 1 MPSLSEFSRR-ITLRGRKVAE--------GADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYS-PG 70 (148) T ss_pred CCccchhccc-HHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccC-CC Confidence 5554 34433 3344333332 234556666677777778899999999999987654332222111100 00 Q ss_pred eecccc---c------------cccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDT---G------------NSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) Q Consensus 80 ~~~~~~---~------------~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k 144 (148) ..+... + ........+-..+..|+..+|||+|.|+|..|++-++..-..-+ ++ .+ T Consensus 71 ~~G~~~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~t~~~~~~~v--------~~--~~ 140 (148) T protein:vir:97 71 EAGSTEAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPANFVEQAVLEAVQVV--------QF--GR 140 (148) T ss_pred CCCcccccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcchHHHHHHHHHHHHH--------Hh--hh Confidence 000000 0 00000122334567899999999999999999999997444322 11 12 Q ss_pred HhcC Q lcl|NC_011356. 145 VLRR 148 (148) Q Consensus 145 ~~kk 148 (148) +++- T Consensus 141 ~~~~ 144 (148) T protein:vir:97 141 VVDG 144 (148) T ss_pred hhcC Confidence 2222 No 127 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=97.84 E-value=2.6e-07 Score=56.72 Aligned_cols=128 Identities=14% Similarity=0.234 Sum_probs=86.3 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCCcc---hhhhhceeccccc----cccc Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATR-AGANVLKEEVVSRAPVRRG---KLRRNVVVLSRCS----RDGG 71 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~-~~a~~i~~~ak~~aP~~~g---~l~~~i~~~~~~~----~~~~ 71 (148) |.. .++++.++++|.++++++|.+. ++++.++|. +++..+.+.+....|++.+ .+|+..- ...++ ...+ T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ks-E~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~H-AK~s~pl~~~~~N 78 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKS-EAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNH-AQSSGPFNVKMGN 78 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchH-HHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccc-hhhhhhhhhhhhh Confidence 554 6999999999999999999874 688888884 5888889999999999843 2222110 00000 0111 Q ss_pred eeeeeeeeeeccccccccceeecCCCCCcceeeecc--cCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 72 MESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVE--MGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E--~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) +.-.+ ......-|-.|.. .||++-.||.||+..++...+.+++.+.+++.++|.++|-= T Consensus 79 Lgf~i------------------~~k~kf~YLvfPD~G~G~sn~~~q~FmerGl~~~t~~i~E~L~~~l~k~in~~Lgg 139 (140) T protein:vir:40 79 LGFEL------------------LTKPKFNYLIFPDQGIGKHNKTKQDFMQLGVEESSQEIVEMLEQAVFKEINDTLGG 139 (140) T ss_pred cceeE------------------eecCcccccccccccCCCCCcchHHHHHhccccchhHHHHHHHHHHHHHHHHhhcC Confidence 10000 0123445777765 47888888999999999988877776666666666666654 No 128 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=97.84 E-value=5.2e-09 Score=66.01 Aligned_cols=103 Identities=12% Similarity=0.073 Sum_probs=48.1 Q ss_pred ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecc Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNP 83 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 83 (148) |++.-+||+.+.+.|+.. ++ +-..|.+.+. ..+.... ........ T Consensus 1 m~v~~k~L~~~~~~l~~~--~v--------------------~VGi~~~a~y------------~d~~~~~-~~~~~~~~ 45 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRSM--SV--------------------KAGVLAGATY------------PDESGKK-LADGTILT 45 (155) T ss_pred CcchHHHHHHHHHHHhCC--ee--------------------EEeecCCCCC------------Ccccchh-hhhhhhcc Confidence 555555555544333211 00 0001111100 0000000 00000000 Q ss_pred ccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_011356. 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI--DEVLRR 148 (148) Q Consensus 84 ~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i--~k~~kk 148 (148) .. ...+.+.+.++.+.||||.+.||||||||++++++++..+.+...+...+ +++|.. T Consensus 46 ~~-------~~~g~~va~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:78 46 KD-------PRAGLPVAMIAMALNYGTSKLPARPFMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred cc-------cccCCcHHHHHHhhhcCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 00 01123456788899999999999999999999999988776655543321 111111 No 129 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=97.83 E-value=5.3e-09 Score=65.94 Aligned_cols=103 Identities=12% Similarity=0.064 Sum_probs=48.4 Q ss_pred ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecc Q lcl|NC_011356. 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNP 83 (148) Q Consensus 4 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 83 (148) |++.-+||+.+.+.|+.. ++ +-..|.+.+. ..+.. .......... T Consensus 1 m~v~~k~L~~~~~~l~~~--~v--------------------~VGi~~~a~y------------~d~~~-~~~~~~~~~~ 45 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRSM--SV--------------------KAGVLAGATY------------PDESG-KKLADGTILT 45 (155) T ss_pred CcchHHHHHHHHHHHhCC--ee--------------------EEeecCCCCC------------ccccc-hhhhhhhhcc Confidence 555555555444333211 00 0001111100 00000 0000000000 Q ss_pred ccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_011356. 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI--DEVLRR 148 (148) Q Consensus 84 ~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i--~k~~kk 148 (148) .. ...+.+.+.++.+.||||.+.||||||||++++++++..+.+...++..+ +++|.. T Consensus 46 ~~-------~~~g~~va~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:10 46 KD-------PRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred cc-------cccCCcHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 00 01123456788899999999999999999999999988776665544322 111111 No 130 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=97.82 E-value=4.4e-08 Score=60.94 Aligned_cols=125 Identities=13% Similarity=0.107 Sum_probs=68.1 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |++..+ ++++..+++.- +...++..+..+..++...+|++||.+|.|-.++...-..+........... T Consensus 1 msF~~~---i~~~~~~ve~~--------~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (134) T protein:vir:80 1 MSYTDR---FNVIAKGIEDN--------VDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEG 69 (134) T ss_pred CCcccC---HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCcc Confidence 555554 33555444432 2345566667777777779999999999998665443222211111000000 Q ss_pred ---------ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 82 ---------NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 82 ---------~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ..-.+.+. ....+-..+..|+.++|||+|.|+|..|++-+..+-.. +++.++ ++-| T Consensus 70 ~~~~~~~~~~vi~~~k~-g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~t~~~~~~-~v~~~~---------~~~~ 134 (134) T protein:vir:80 70 MDEALQVLQQTVGQYKA-GDTVHITNNAPYIKELNSGSSQQAPANFVETSIMRATR-LIRNVK---------VVPQ 134 (134) T ss_pred chhhHHHHHHHHhhccC-cceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHH-HHHhhc---------cCCC Confidence 00000000 01122345678999999999999999999988864433 222111 1111 No 131 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.81 E-value=1e-07 Score=58.87 Aligned_cols=131 Identities=16% Similarity=0.151 Sum_probs=67.6 Q ss_pred cc--ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCcc--hhhhh-ceec--------cccc Q lcl|NC_011356. 2 IE--TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-PVRRG--KLRRN-VVVL--------SRCS 67 (148) Q Consensus 2 m~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-P~~~g--~l~~~-i~~~--------~~~~ 67 (148) |+ +++++. .++|.+.|++|.....+ ....+...++.+....+.+- |..+. .|..+ +..+ .-.. T Consensus 1 Ms~~i~i~~~-~~~~~~~L~~l~~~~~~--~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~ 77 (155) T protein:vir:10 1 MANRIELELV-DREVQERLAALYAAVTD--TLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQ 77 (155) T ss_pred CCceEEEEec-hHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccc Confidence 55 455554 36789999999765532 35777888888888777664 22110 00000 0000 0011 Q ss_pred cccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCcc-------CCCCCchhH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 68 RDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV-------NMPPHPFVR-PAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~-------~~~a~PFl~-pA~~~~~~~~~~~~~~~~~ 139 (148) ..|.+..++... .+. .. +..+++..|+...+||+. .+||+|||. ..-++-.+++.+ .+. T Consensus 78 ~tG~L~~Si~~~-----~~~-~~---v~vGtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~----~I~ 144 (155) T protein:vir:10 78 VTNALARSITTR-----ADR-DQ---AQIGSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARD----AVL 144 (155) T ss_pred cchhhhhhhhce-----ecC-CE---EEEecCcchhhhhhcccccCCCCccccCCccccCCCccccchHHHHH----HHH Confidence 112222222111 011 11 112456789999999974 699999997 333333444444 444 Q ss_pred HHHHHHhcC Q lcl|NC_011356. 140 RAIDEVLRR 148 (148) Q Consensus 140 ~~i~k~~kk 148 (148) +.|.+.+++ T Consensus 145 ~~i~~~l~~ 153 (155) T protein:vir:10 145 DVLLAALSQ 153 (155) T ss_pred HHHHHHHhh Confidence 455555555 No 132 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=97.80 E-value=7.4e-09 Score=65.16 Aligned_cols=92 Identities=14% Similarity=0.200 Sum_probs=48.6 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+.. |.+-+...++|.+.-+.+..+ .+.+. +.. T Consensus 1 M~~~--i~~~~~~~~~L~~~lk~l~~k------------------------------~V~VG------------i~~--- 33 (189) T protein:vir:10 1 MGRV--IRKQGPARVKLNAFIKGMNDY------------------------------SVRIG------------WFS--- 33 (189) T ss_pred Ccce--eccCcHHHHHHHHHHHHhhCC------------------------------eEEEE------------ecC--- Confidence 4444 343333333333211110000 00110 000 Q ss_pred ccccccccceeecCCCCCcceeeecccCc--cCCCCCchhHHHHHHHHHHHHHHHHHHHHHH------HHHHhcC Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGT--VNMPPHPFVRPAFDVRSEQAAQVAIARMNRA------IDEVLRR 148 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT--~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~------i~k~~kk 148 (148) .... ..+.+.+.++.+.|||+ .+.||||||+|++++++++..+.+...+... .+++|.. T Consensus 34 --~~~y------~dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~ 100 (189) T protein:vir:10 34 --TAKY------PDGTPTAYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEG 100 (189) T ss_pred --CCCC------CCcccHHHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Confidence 0000 01124567899999998 4589999999999999998888777776653 2344444 No 133 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=97.76 E-value=5.8e-08 Score=60.26 Aligned_cols=130 Identities=15% Similarity=0.104 Sum_probs=60.7 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCCCcchhhhhceecccccccc------ceee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANV-LKEEVVSRAPVRRGKLRRNVVVLSRCSRDG------GMES 74 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~-i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~------~~~~ 74 (148) |+|+....++++|++.|++|.... + .-++.+.+.- =.+......|+.. |. ..+..| .... T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~---v-~vGi~~~~~~~~~~~~~~G~~va~------iA---ai~EfG~~I~~~~~~~ 67 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRS---V-SAGWYSTARYPDKAGGSVGIQVAR------IA---RLNEYGGTIDHPGGTR 67 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCe---E-EEEEcCCCCCCCcccccccchHHH------HH---hHHHcCCccccCccce Confidence 999999999999999999986431 1 1122211100 0000000011100 00 000111 1011 Q ss_pred eeeeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH------HHhcC Q lcl|NC_011356. 75 GVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID------EVLRR 148 (148) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~------k~~kk 148 (148) .+. .....+.......++. +-..-++|.-..|.++||||||+++++.+++++.+.+.+.+...+. ++|.. T Consensus 68 ~~~-~~~~~g~~~~~~~~k~---~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~ 143 (193) T protein:vir:96 68 YIR-DAIVRGRFVGVRFVRN---DFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQ 143 (193) T ss_pred eee-eccccccccccceecc---CcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Confidence 111 1111111111111111 1112345555679999999999999999999877776666554333 22222 No 134 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=97.69 E-value=1.1e-07 Score=58.70 Aligned_cols=88 Identities=17% Similarity=0.202 Sum_probs=60.8 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+++. |-++|.+.|+..++++.+. +++.+.+.++.|...|..+||+|+|.|++||..... .|++...|.+.. T Consensus 13 makvky---G~~dmvk~~~~f~~~i~~~-vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk---~GGltavI~vGA 85 (100) T protein:vir:96 13 MAKVKY---GADSMVVELDKFDKKIEEW-VKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF---DGGLSSVISVGA 85 (100) T ss_pred hhhhee---chHHHHHHHhcchHHHHHH-HHHHHHHHHHHHHhhHHhhccccccccceeeeeeee---cCCeeEEEecch Confidence 888887 9999999999999998755 599999999999999999999999999999987653 344433332221 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVA 134 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~ 134 (148) .-..+.-. +-.+..+ T Consensus 86 eYAIkrms---------------------------------------qllvtvi 100 (100) T protein:vir:96 86 DYAIKRMS---------------------------------------QLLVTVI 100 (100) T ss_pred hHHHHHHH---------------------------------------HHHhhcC Confidence 11000000 0000000 No 135 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.61 E-value=2.5e-06 Score=51.30 Aligned_cols=121 Identities=13% Similarity=0.202 Sum_probs=79.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |..|+|++. .++|+++++.+..+.. .++.--...+|..+..+||.+||= .||+=|.+|............... T Consensus 1 ~~~~~f~~d-~~~l~~~i~~~~~k~~-~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~~g~~~~~Iy--- 75 (123) T protein:vir:74 1 MAKVTFEYD-AQELRTNIRNLDRRME-SAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANKLGPGSHELI--- 75 (123) T ss_pred CceeEEEec-HHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEE--- Confidence 999999998 7899999999876643 343333455888899999999995 577766666332221111111111 Q ss_pred eeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 79 RGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) -.....|.-|+|.++...++ -+.|+++..-+++++-+...+.+ |+++- T Consensus 76 -----------------lsh~veYG~~LEla~~~kya--Ii~Ptv~~~~~~im~g~~~ll~~-l~~~~ 123 (123) T protein:vir:74 76 -----------------MSYSVHYGIWLEIANSGQYA--VIGPFLPVMGRKLMHDLEHLIDR-LERAQ 123 (123) T ss_pred -----------------EecCeeecceeeecCCCCce--eecchHHHHhHHHHHHHHHHHHH-hhccC Confidence 12344799999998865443 68899998888887766654443 33332 No 136 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=97.46 E-value=1.1e-06 Score=53.35 Aligned_cols=114 Identities=14% Similarity=0.112 Sum_probs=75.4 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecccc Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDT 85 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (148) |+++|+.+....|+++-++++.+-+-.||..+.-++...|--.+|+||..|-+|=-.....+ | . T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei~~n--g-t------------- 64 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYKKLEPI--P-S------------- 64 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccceeeecc--C-c------------- Confidence 89999999999999999988764456888888889999999999999999987721110000 0 0 Q ss_pred ccccceeecCCCCCcceeeeccc--CccCCCCCc--------------hhHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_011356. 86 GNSDNTMKADNPRNAFYWRFVEM--GTVNMPPHP--------------FVRPAFDVR-SEQAAQVAIARMNR 140 (148) Q Consensus 86 ~~~~~~~~~~~~~~~~y~~~~E~--GT~~~~a~P--------------Fl~pA~~~~-~~~~~~~~~~~~~~ 140 (148) ...+..+-.+-|+-+|+- |+-+..|+| ||..+|+.+ .+.+...++++++- T Consensus 65 -----ritGRVGYSAnYA~yVHda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 65 -----GMIGRVGYTANYAAAVNAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred -----eeEEeeccceeeeeeeecCccccCCCcCCCCCcceecCCCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 011111223345555543 444444444 999999765 44555556666655 No 137 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=97.37 E-value=3.1e-06 Score=50.75 Aligned_cols=111 Identities=7% Similarity=0.097 Sum_probs=66.5 Q ss_pred HHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeeeeeeccccccccc Q lcl|NC_011356. 14 ISRDLQL-LSGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDN 90 (148) Q Consensus 14 l~~~l~~-l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (148) |+++|+. |++....++.++||.++++.+.+..|.+.-+ |||..-+.+..+..... .+.... T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~----------------~g~~~r 64 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTK----------------VGSQER 64 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeec----------------cCCccc Confidence 6666653 4555556889999999999999999998654 77876666554432211 111111 Q ss_pred eeec---CCCCCcceeeecccCccCC----CCCc--hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 91 TMKA---DNPRNAFYWRFVEMGTVNM----PPHP--FVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 91 ~~~~---~~~~~~~y~~~~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .+.. +..+.+...|..|||..+- .|+- -+..|+++.+..+.+.++++|++ T Consensus 65 tV~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PRG~G~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 65 AVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred eEEEEeecCCCceeeEeeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 2222 2233456689999995332 2332 46666766666555555444444 No 138 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=97.29 E-value=2.1e-07 Score=57.14 Aligned_cols=106 Identities=12% Similarity=0.107 Sum_probs=44.7 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+ ++.=.|++-+...+..|... .++-.... -...|..+. ... .. .. T Consensus 1 ~~-~~~~~g~~~~~~~~~~l~~~----~v~vG~l~---------~a~yp~G~~--~~~----------------~~--~~ 46 (168) T protein:vir:94 1 MT-TIARKGVKMPPHLEAQFQSG----EVKAGVLS---------GSTYPQMTY--TDQ----------------RT--GK 46 (168) T ss_pred Cc-cccchhhhhhHHHHHhhhcc----ceeeeccc---------cCccccccc--chh----------------hc--cc Confidence 21 11122333333322222110 00000000 000011000 000 00 00 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI--DEVLRR 148 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i--~k~~kk 148 (148) .......+.+.+.++.++||||.+.||||||||++++++++..+.+...++..+ +.+|.. T Consensus 47 -------~~~~~~~g~~va~Ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 108 (168) T protein:vir:94 47 -------QIEDARGGMPVAVIAQALEYGHGQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAADTALRT 108 (168) T ss_pred -------ccccccccccHHHHHHHHhcCCCCCCCchhhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHH Confidence 000001112456789999999999999999999999999887766655443211 111111 No 139 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=97.28 E-value=1.3e-06 Score=52.81 Aligned_cols=91 Identities=10% Similarity=0.136 Sum_probs=44.6 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+ ++...|++.|...|++| ++.. +. .++.. T Consensus 1 ~~~-~~~~~G~~~L~~~~k~l-----------------------~~~~-----------V~------------VGi~~-- 31 (160) T protein:vir:95 1 MVK-RVIHPARAKLVGAMKNL-----------------------QTAN-----------AQ------------VGYFQ-- 31 (160) T ss_pred Cce-eechHhHHHHHHHHHHH-----------------------hCCe-----------eE------------Eeecc-- Confidence 443 34445555555555443 0000 11 11100 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHH----HHHHHHHHHHHHHHHHHHH-------HhcC Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDV----RSEQAAQVAIARMNRAIDE-------VLRR 148 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~----~~~~~~~~~~~~~~~~i~k-------~~kk 148 (148) ..+. ...+.+-+..+.|.||||.+.|++||||++|+. ++...+..+...+...+.. .|-. T Consensus 32 ---d~g~-----~~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~~~~~~~LG~ 102 (160) T protein:vir:95 32 ---EQGQ-----HSSGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSSLNTDPSNTLEA 102 (160) T ss_pred ---cccc-----CCCCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHH Confidence 0000 011123446788999999999999999999974 4444444444433333331 1111 No 140 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=97.22 E-value=1.5e-06 Score=52.59 Aligned_cols=137 Identities=10% Similarity=0.072 Sum_probs=54.2 Q ss_pred Cccceeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH-HHHHHHhCCCCcchhhhhceeccccccccceeeee Q lcl|NC_011356. 1 MIETLLDFSG---LEDISRDLQLLSGAENNRVLREATRAGANVL-KEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGV 76 (148) Q Consensus 1 Mm~~~~~i~G---l~el~~~l~~l~~~~~~~~~~~al~~~a~~i-~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~ 76 (148) =|+|++++.| ++++++.|++|... .+.-++.+.+..= .+......|+..-..-+.. ......+.+. ... T Consensus 4 ~~~~~~k~~~~~~~~~~~~~l~~l~~~----~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~Ef--G~~i~~p~~~-~~~ 76 (200) T protein:vir:99 4 GFSKSNSVAAPLKHFQMLKQFDALKGK----TVQAGWFETDRYPAKEGETIGPLVAKIARQLEF--GGVINHPGGT-KYI 76 (200) T ss_pred CcceeeeeecchHHHHHHHHHHHhhCC----eEEEEEcCCCCcCCcccccccchHHHHHhHHHc--CCeeccCCCc-ccc Confidence 2346666776 55555555555321 1111221111000 0000000011000000000 0000011111 000 Q ss_pred eeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHH------HHhcC Q lcl|NC_011356. 77 HIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID------EVLRR 148 (148) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~------k~~kk 148 (148) ......+....... ..++..-|+.|.--.|.++||||||+|+++.+++++.+.+...+.+.|+ ++|.+ T Consensus 77 -~~~~~~g~~~g~rf---v~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~ 150 (200) T protein:vir:99 77 -KDAIVDGRYVGTRF---VHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQ 150 (200) T ss_pred -cccccccccccccc---ccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Confidence 00011000000011 1123334555555568899999999999999999888877666654332 33333 No 141 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.22 E-value=4.3e-06 Score=49.98 Aligned_cols=104 Identities=14% Similarity=0.149 Sum_probs=69.3 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+++|+|. +..+.++|. ++..+|...-++.|..++..-+|.++|.|++|-... .++. | T Consensus 1 M~vkV~id-~~~~~~~l~--------~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~~----~~g~----I----- 58 (112) T protein:vir:80 1 MPIKVRVD-LSKAKGSVK--------KAKERGQFALINQAAADIALYVPFLSGDLSNQYVIM----NDKE----I----- 58 (112) T ss_pred CceeEEee-hHHHHHHHH--------HHHHHHHHHHHHHHHHHhhcCCCcccCccccceeec----cCce----E----- Confidence 99888887 344444332 233456677778888888899999999999873210 0110 0 Q ss_pred ccccccccceeecCCCCCcceeeecccCccC--------CCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVN--------MPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--------~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .-+++|++.+-||-.. .....|+..|.....+++++.+.+.+.+.| T Consensus 59 ---------------~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 59 ---------------MWTSIYARRLYNGINFNFTLTHHPLAGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred ---------------EecCchhhHhhhcccCCCCcCCCCCcchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 1234577777665432 233468888999998988888888888888 No 142 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=97.15 E-value=3.1e-07 Score=56.23 Aligned_cols=100 Identities=17% Similarity=0.195 Sum_probs=57.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) |.+---+-.-|..+--.|..|.+. ..+.+.+.+=.+.+.+.++++.|+++|.+++|..+..+....|. T Consensus 1 ma~gpt~kNP~~KFGvs~~d~~K~---~EVn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~NkGR--------- 68 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDFDKL---PEVNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNKGR--------- 68 (108) T ss_pred CCCCcccccchhhhcCChhhhhhc---hhhhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhccCc--------- Confidence 333211111122222222222221 12345666767788899999999999999999876544332221 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCC----CCC----chhHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM----PPH----PFVRPAFDV 125 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~----~a~----PFl~pA~~~ 125 (148) +..+...||+||+||||.+. |+| -|=..||.. T Consensus 69 -------------G~~G~~~~~AH~VEFGs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:79 69 -------------GKVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred -------------cccCCcchhhhhhhhhccccccccchhhHHHhhcccccCC Confidence 11235679999999999874 433 244555544 No 143 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=97.13 E-value=5.2e-06 Score=49.57 Aligned_cols=103 Identities=17% Similarity=0.077 Sum_probs=62.8 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+++|++.++...+.. +.+.+|...-+..|..+...-+|.++|.|++|-.+... .|. | T Consensus 1 mmkvkv~~~~~~~~~~~----------~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s~---~g~----I---- 59 (108) T protein:vir:98 1 MPKIRVELSGAKDKLSP----------QTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISSD---AEE----I---- 59 (108) T ss_pred CceeEeeehHHHHHHHH----------HHHHHHHHHHHHHHHHhhcccCcCcCCccccceeeccC---Cce----E---- Confidence 99999998875442211 12234555666777788888999999999998543321 110 0 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCC-----CCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM-----PPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~-----~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .-+++|++++=||..+. ....|+..|.....+++++.+.+.++= T Consensus 60 ----------------~y~tPYAr~qYYg~~~n~~~p~ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 60 ----------------YYNTPYAKRRFYEPAYNYTTPGTGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred ----------------EecChhhHHhhhccccCCCCCCCcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 12335777776664433 334577777777776666555444443 No 144 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=96.93 E-value=3.5e-05 Score=45.04 Aligned_cols=116 Identities=15% Similarity=0.160 Sum_probs=71.0 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--Ccchhhhhceeccccccccceeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHI 78 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~ 78 (148) |..|+|++. .++|+++++.+..+.. ..+.--+..+|..+..+||.+||= .||+=|..|............... T Consensus 1 ~~~~~f~~~-~~~l~~~i~~~~~k~~-~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~~~~~~~Iy--- 75 (120) T protein:vir:10 1 MAKIEFKFK-DIELRRGVEDMEAKVD-RAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIV--- 75 (120) T ss_pred CceEEEEec-HHHHHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEE--- Confidence 999999999 5899999999876542 344444566788899999999995 477766665432211111111111 Q ss_pred eeeccccccccceeecCCCCCcceeeecc--cCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 79 RGVNPDTGNSDNTMKADNPRNAFYWRFVE--MGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~y~~~~E--~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) -.....|.-|+| -|..++ -++|+++..-+++++-+..-+.+-= T Consensus 76 -----------------lsh~veYG~~LEla~~~kya----Il~PTi~~~~~~il~g~~~ll~~l~ 120 (120) T protein:vir:10 76 -----------------FAHTVHYGIWLEIANSGRYE----IIMPTVHHEGKLMAQRLRGLLGRLR 120 (120) T ss_pred -----------------EecCeeecceEEeeCCCCcc----cccchHHHHhHHHHHHHHHHhhhcC Confidence 113346888999 444333 4666666666666655544433321 No 145 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.92 E-value=7.7e-06 Score=48.63 Aligned_cols=132 Identities=11% Similarity=0.115 Sum_probs=60.6 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcc--hhh-hhceeccccccccceeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRG--KLR-RNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g--~l~-~~i~~~~~~~~~~~~~~~~~~~ 79 (148) +..+++|...|..+-..+.....+..++.-|+.++...+.+ .|..+- .++ ..+................... T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhcc Confidence 55666777666655433221112334555566666665554 342110 010 0000000000000000000011 Q ss_pred eeccccccccceeecCCCCCcceeeecccCc----------cCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGT----------VNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT----------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ......+.....+....+++..|+....||- ..+|++|||.=. ++.++++.+.+.+.|.+ T Consensus 81 sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred ceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 1111111222222223467889999999993 258999999866 34566666666666665 No 146 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.90 E-value=8.2e-06 Score=48.47 Aligned_cols=132 Identities=11% Similarity=0.111 Sum_probs=59.5 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hh-hhceeccccccccceeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LR-RNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~-~~i~~~~~~~~~~~~~~~~~~~ 79 (148) +..+++|...|..+-..+.....+..++.-|+.++...+.+ .|..+-. ++ ..+................... T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcc Confidence 45556666655554333211112334555566666655554 3421110 10 0000000000000000000011 Q ss_pred eeccccccccceeecCCCCCcceeeecccCc----------cCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGT----------VNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT----------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ......+.....+....+++..|+....||- ..+|++|||.=. ++.++++++.+.+.|.+ T Consensus 81 sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred eeeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 1111111122222223467889999999993 368999999866 34566666666666666 No 147 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=96.89 E-value=3.8e-05 Score=44.82 Aligned_cols=140 Identities=14% Similarity=0.176 Sum_probs=65.5 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----cchhhhhceeccccccccceeeeeeeee- Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVR----RGKLRRNVVVLSRCSRDGGMESGVHIRG- 80 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~----~g~l~~~i~~~~~~~~~~~~~~~~~~~~- 80 (148) |+|+||+++++.|..|++....+++..|+..++.-+...+.+.+... ...+++.+++.... .+.....+.... T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~kas--~~~l~a~I~~~~~ 78 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVKRAT--VNKPRALIRVNRG 78 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheecccC--CCCeEEEEEEecc Confidence 99999999999999997775568889999998888888877766543 34555555442211 122121221110 Q ss_pred ------e---ccccc-----c--ccceeecCC--CCCccee--------eecccCccCC-------C-CCchhHHHHHHH Q lcl|NC_011356. 81 ------V---NPDTG-----N--SDNTMKADN--PRNAFYW--------RFVEMGTVNM-------P-PHPFVRPAFDVR 126 (148) Q Consensus 81 ------~---~~~~~-----~--~~~~~~~~~--~~~~~y~--------~~~E~GT~~~-------~-a~PFl~pA~~~~ 126 (148) . ....+ . ....++.+. -..+|.+ -|.--|.... | +.| +..+++.. T Consensus 79 ~i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~-~~e~~~~~ 157 (184) T protein:vir:39 79 NLPAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAP-LTTAFKEE 157 (184) T ss_pred ceeeeeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEEecCcccceeEEEcCchHH-HHHHHHHH Confidence 0 00000 0 000011110 0112211 1222232221 2 122 23333322 Q ss_pred H-----HHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 127 S-----EQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 127 ~-----~~~~~~~~~~~~~~i~k~~kk 148 (148) - +.+...|..+|..+|++.++| T Consensus 158 ~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 158 LPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 2 333334444444444444455 No 148 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=96.83 E-value=1.4e-05 Score=47.27 Aligned_cols=104 Identities=13% Similarity=0.118 Sum_probs=69.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+++|+|.. ..+..+|. +++.++...-++.|..++..-+|.++|.|++|-.+. .++. | T Consensus 1 M~vkv~vn~-~~~~~~l~--------~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~~----~~g~----I----- 58 (112) T protein:vir:45 1 MPIKVRVDL-SKAKGSVK--------KAKERGQFALINQAAADIALYVPFLSGDLSNQYVIM----NDKE----I----- 58 (112) T ss_pred CceeEEeeh-HHHHHHHH--------HHHHHHHHHHHHHHHHHhhcCCccccCccccceeec----cCCe----E----- Confidence 999888874 33333322 233456777778888888999999999999873210 0110 0 Q ss_pred ccccccccceeecCCCCCcceeeecccCccC--------CCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVN--------MPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--------~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .-+++|++++=||... .....|+..|.....+++++.+.+.+++.| T Consensus 59 ---------------~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 59 ---------------MWTSIYARRLYKGINFNFTLTHHPLAGPEWDQRAKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred ---------------EecChhhHHhhhccccCCCCCCCCCCchhhHHHHHHhhHHHHHHHHHHHHhhcC Confidence 1233577776665432 334568888999888888888888888888 No 149 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=96.80 E-value=1.7e-06 Score=52.24 Aligned_cols=126 Identities=12% Similarity=0.123 Sum_probs=55.3 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceecc----ccccc-cceeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLS----RCSRD-GGMESGV 76 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~----~~~~~-~~~~~~~ 76 (148) |+++-.-.-++++++.|+.|.... + .-++. .+.|..-..|.... ....+ +.+. | T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~---v-~vGi~---------------~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~--I 59 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYS---L-QIGLF---------------GEDDSFIQMIAGVHEFGLTIRPKGKYLT--I 59 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCE---E-EEEEe---------------cCCCcchhheeehhhcCCeeecCCceee--e Confidence 777655555778888888775321 1 11111 01111101111000 00000 1100 0 Q ss_pred eeeeecccc-cc------ccceeecCCCCCcceeeecccCcc--CCCCCchhHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_011356. 77 HIRGVNPDT-GN------SDNTMKADNPRNAFYWRFVEMGTV--NMPPHPFVRPAFDVRSEQAAQVAIARMNRAI----- 142 (148) Q Consensus 77 ~~~~~~~~~-~~------~~~~~~~~~~~~~~y~~~~E~GT~--~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i----- 142 (148) ......... .. ...........+.--...+|||+. +.||||||||++++++++..+.+...+...| T Consensus 60 p~~~a~~~k~~~~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~ 139 (199) T protein:vir:80 60 PTPEAGDRRARDIPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLS 139 (199) T ss_pred cchhhhcccccccCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 000000000 00 000000000112223556899974 7899999999999999988877766666533 Q ss_pred -HHHhcC Q lcl|NC_011356. 143 -DEVLRR 148 (148) Q Consensus 143 -~k~~kk 148 (148) +++|.. T Consensus 140 a~~~L~~ 146 (199) T protein:vir:80 140 AEQVYNR 146 (199) T ss_pred HHHHHHH Confidence 223333 No 150 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.69 E-value=1.4e-05 Score=47.18 Aligned_cols=132 Identities=15% Similarity=0.071 Sum_probs=60.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hhhhceecccccccccee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LRRNVVVLSRCSRDGGME 73 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~~~i~~~~~~~~~~~~~ 73 (148) ||+ .+++|...|+.|-..+....-+..++.-|+.++...+.+ .|..+.. ++..-.........|... T Consensus 1 m~~------~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~ 74 (155) T protein:vir:79 1 MTD------DLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVK 74 (155) T ss_pred Cch------HHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCccc Confidence 775 244555555444333211111334555555555555543 4532211 110000000011111111 Q ss_pred eeee------eeeeccccccccceeecCCCCCcceeeecccCcc----------CCCCCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 74 SGVH------IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV----------NMPPHPFVRPAFDVRSEQAAQVAIAR 137 (148) Q Consensus 74 ~~~~------~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~ 137 (148) .... ...+.... ......++..+++..|+...-||.. .+|++|||.=.- +.++++.+.+.+. T Consensus 75 ~~~m~~~l~~a~~l~~~~-~~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LGls~-~d~~~I~~~i~~~ 152 (155) T protein:vir:79 75 REAMFRKLRTARYLRIDV-DSTGLAIGFDERLSRIARVHQEGQKAPVEPGGPLAQYPVRVVLGFSD-ADRELVRDRLLRE 152 (155) T ss_pred chhhhhhhhhhheeeeee-cCcEEEEEecCcchhhhhhhhcCCcccCCCCCcccccccccccCCCH-HHHHHHHHHHHHH Confidence 1100 00111111 1222333445678889999999943 689999997664 4667777777766 Q ss_pred HHH Q lcl|NC_011356. 138 MNR 140 (148) Q Consensus 138 ~~~ 140 (148) |.+ T Consensus 153 l~r 155 (155) T protein:vir:79 153 LTR 155 (155) T ss_pred hhC Confidence 666 No 151 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.69 E-value=1.3e-05 Score=47.44 Aligned_cols=132 Identities=12% Similarity=0.103 Sum_probs=61.0 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hh-hhceeccccccccceeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LR-RNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~-~~i~~~~~~~~~~~~~~~~~~~ 79 (148) +.-+++|...|..|-..+...-.+..+..-|+.+....+.+ .|..+-. ++ ..+....+.....-........ T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~~ 80 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhhh Confidence 45556666666655433211122344556666666666554 3421110 01 1111000000000000000111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCc----------cCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGT----------VNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT----------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ......+.....+....+++..|+...-||- +.+|++|||.=.- ..++++++.+.+.|.+ T Consensus 81 sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~-~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG-EDVQMIEEIILAHLER 150 (150) T ss_pred hhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCH-HHHHHHHHHHHHHHhC Confidence 1111112222222223467889999999993 4689999998653 4566666666666665 No 152 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.20 E-value=4.7e-05 Score=44.29 Aligned_cols=133 Identities=12% Similarity=0.052 Sum_probs=61.1 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hhhhceeccccccccceee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LRRNVVVLSRCSRDGGMES 74 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~~~i~~~~~~~~~~~~~~ 74 (148) |+-. +.+|...|+.|-..+....-+..+..-|+.+....+.+ +|..+.. ++............+..-. T Consensus 1 M~~~-----~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~ 75 (152) T protein:vir:10 1 MSEP-----IEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFD 75 (152) T ss_pred CchH-----HHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHH Confidence 4332 44555555544333211111334555666666655554 4533221 1111100000001111100 Q ss_pred eee-eeeeccccccccceeecCCCCCcceeeecccC-----------ccCCCCCchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 75 GVH-IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMG-----------TVNMPPHPFVRPAFDVRSEQAAQVAIARMNRA 141 (148) Q Consensus 75 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G-----------T~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~ 141 (148) ... ..... .........++..+++..|+...-|| ++.+|++|||.=. +..++++.+.+.+.|..+ T Consensus 76 ~L~~a~~l~-~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 76 KITQPRFMR-LRLESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFT-DDDLQMIEDYMINILAGS 152 (152) T ss_pred hhhhcceee-eeecCcEEEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCC-HHHHHHHHHHHHHHHhcC Confidence 000 00001 11112223344446788999999998 5669999999766 346666666666666666 No 153 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=96.13 E-value=6.1e-05 Score=43.69 Aligned_cols=104 Identities=16% Similarity=0.176 Sum_probs=61.8 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+|+|+|. ++.+.+.|.. +.+.++...-+..+..+...-+|.++|.|++|..+... .+. | T Consensus 1 M~~kVkv~-l~~~~~~l~~-------~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~~---~~~----I----- 60 (114) T protein:vir:47 1 MNIAIKVD-LQKAKQKLSN-------ESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVGQ---GDA----V----- 60 (114) T ss_pred CceeEEee-hhHHHHHHHH-------HHHHHHHHHHHHHHHHhhccCCcCccCccccceeeeeC---CcE----E----- Confidence 99888887 5555555532 22234555566777788888999999999998543210 010 0 Q ss_pred ccccccccceeecCCCCCcceeeecccCc----------cCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGT----------VNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT----------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .-+++|++++=||. ..+....|+..|.....+++++.+.+.+.- T Consensus 61 ---------------~y~tPYAr~qyYg~~~~~~~~~~~~p~~g~~W~eraka~~~~~~~~~~~k~~g~ 114 (114) T protein:vir:47 61 ---------------VYGTVYARAQFYGSNGIVTFRRYTTPGTGKRWDQVATSKHAEEWARAFVKGMGL 114 (114) T ss_pred ---------------EecCchhhHhhhcccCCCCCCccCCCCCcchhHHHHHhhhhHHHHHHHHHhhCC Confidence 12335666665542 123445677777777776666655544333 No 154 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.13 E-value=3.5e-05 Score=45.02 Aligned_cols=131 Identities=15% Similarity=0.108 Sum_probs=58.3 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCc---chhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRR---GKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~---g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) +..|+++...|+.|-..+....-+..++.-|+.+....+.+ .|..+ ..-+..+................... T Consensus 1 m~~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~~ 80 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRTSR 80 (149) T ss_pred CchHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhhhh Confidence 22345555555444322211111344555666666655554 45311 00001111100000000000000000 Q ss_pred eeccccccccceeecCCCCCcceeeecccCcc----------CCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV----------NMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .... .........+..+++..|+....||.. .+|++|||.=. ++.++++++.+.+.|.+ T Consensus 81 ~l~~-~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 81 FMKA-KGSDSAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred hhhe-eecCceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCC-HHHHHHHHHHHHHHHhC Confidence 0111 111122233345678899999999954 68999999855 44666666666666665 No 155 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=96.12 E-value=0.00046 Score=38.88 Aligned_cols=141 Identities=14% Similarity=0.207 Sum_probs=66.0 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc----hhhhhceeccccccccceeeeeeee-- Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRG----KLRRNVVVLSRCSRDGGMESGVHIR-- 79 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g----~l~~~i~~~~~~~~~~~~~~~~~~~-- 79 (148) |+|+||+++++.|+.|++....+++..|+..+|.-+...+...+...+| .++..++...... .+....|... T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~--~~l~a~I~~~~~ 78 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATV--KNPQARIKVNRG 78 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccC--CCceEEEEEecc Confidence 7788999999999999887667888899998888888888777665443 4555555432211 1211111110 Q ss_pred -----eecccc-----------------ccccceee-cC-CCCCcce-------ee-ecc-cCccCCC----CCch---h Q lcl|NC_011356. 80 -----GVNPDT-----------------GNSDNTMK-AD-NPRNAFY-------WR-FVE-MGTVNMP----PHPF---V 119 (148) Q Consensus 80 -----~~~~~~-----------------~~~~~~~~-~~-~~~~~~y-------~~-~~E-~GT~~~~----a~PF---l 119 (148) ...... ........ +. .-..+|+ || |.- .|...-| -=|. + T Consensus 79 ~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpis~~l 158 (192) T protein:vir:34 79 DLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVPL 158 (192) T ss_pred ceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEechhHHH Confidence 000000 00000000 00 0011222 22 111 2332111 1122 2 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHhcC Q lcl|NC_011356. 120 RPAFDVRSEQAA-QVAIARMNRAIDEVLRR 148 (148) Q Consensus 120 ~pA~~~~~~~~~-~~~~~~~~~~i~k~~kk 148 (148) ..||+..-++++ +.|..+|..+|...++- T Consensus 159 ~~af~~~~~~~~~~~~~~El~~~L~~~lr~ 188 (192) T protein:vir:34 159 TTAFKQNIERIRRERLPKELGYALQHQLRM 188 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 555554443332 33333444433333333 No 156 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.05 E-value=2.7e-05 Score=45.64 Aligned_cols=126 Identities=14% Similarity=0.101 Sum_probs=59.0 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hhh-hceeccccc-----cccceee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LRR-NVVVLSRCS-----RDGGMES 74 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~~-~i~~~~~~~-----~~~~~~~ 74 (148) +..+++|...|+.|-..+...--+..+..-|+.+....+.+ .|..+-. +.. .+....... ..+.+.. T Consensus 1 m~d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l~~ 80 (149) T protein:vir:98 1 MSELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRTNR 80 (149) T ss_pred CchHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhhhh Confidence 22345666665555333211112344566666666666554 3431100 100 010000000 0011111 Q ss_pred eeeeeeeccccccccceeecCCCCCcceeeecccCcc----------CCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 75 GVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV----------NMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .+ ... ........+..+++..|+....||.. .+|++|||.=. ++.++++++.+.+.|.+ T Consensus 81 sl-----~~~-~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 81 FM-----KAK-GSDSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred hh-----hhe-ecCCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCC-HHHHHHHHHHHHHHhhC Confidence 11 111 11122233344678899999999953 58999999743 45566666666666665 No 157 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.95 E-value=5e-05 Score=44.17 Aligned_cols=131 Identities=12% Similarity=0.069 Sum_probs=63.9 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) +..+++|...|..|-..+....-++.++.-|+.++...+.+ .|..+.. ++....................... T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~ 80 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARY 80 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhh Confidence 44566777766665444322222344555556555555544 3532211 1111000000000000000001111 Q ss_pred eccccccccceeecCCCCCcceeeecccC----------ccCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMG----------TVNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~G----------T~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) ..... ......++..+++..|+....|| ++.+|++|||.=. ++.++++++.+.+.|.- T Consensus 81 l~~~~-~~~~~~v~~~Gt~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 81 MKTQA-DANTAVVTFAGNAQRIATVHQFGLRDRVNKAGLTAQYPARELLGMD-GVDMEHITNLLLLHLGA 148 (148) T ss_pred eeeee-eCCeeeEEeeccchhhhhhhhcCccccccCCCCccccCcccccCCC-HHHHHHHHHHHHHHhcC Confidence 11111 11223333446778899999999 4569999999855 45777788877777777 No 158 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=94.98 E-value=0.00032 Score=39.75 Aligned_cols=105 Identities=10% Similarity=0.006 Sum_probs=57.6 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |+++|+|. ++.+.++|. .+.+.++-..-+..+..++..-+|.++|.+..+........ .+.+ T Consensus 1 M~ikVkv~-l~~~~~~~~-------~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~-~~~I--------- 62 (116) T protein:vir:15 1 MAFRINVD-LDGFMDQTS-------LDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSD-GSEI--------- 62 (116) T ss_pred CCceEEee-hhHhhhhhh-------HHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecC-CceE--------- Confidence 99888877 455544442 22334556666677888888899999987543322111100 0000 Q ss_pred ccccccccceeecCCCCCcceeeecccC-----------ccCCCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMG-----------TVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~G-----------T~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 139 (148) .-+++|++.+=|| |.......|+..|.....+...+.+.++++ T Consensus 63 ---------------~y~tPYAr~qyYg~~~~~~~~~~~t~p~ag~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 63 ---------------TYSTPYAKAQFYGIINDKYPVHNYTTPGTTKRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred ---------------EecCchhHHHhcccccCCCCcccccCCCCCcchhHHHHhhhHHHHHHHHHHhcC Confidence 0122344444332 222334557777877777766665555555 No 159 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=94.96 E-value=0.0012 Score=36.66 Aligned_cols=145 Identities=12% Similarity=0.128 Sum_probs=58.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH-H----HHHHhCCCCcchhhhhceecccc-ccccceeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLK-E----EVVSRAPVRRGKLRRNVVVLSRC-SRDGGMESG 75 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~-~----~ak~~aP~~~g~l~~~i~~~~~~-~~~~~~~~~ 75 (148) |.+++.++|++++.+.|+.|++... +++..|+.++|..-. . .+........+.+.++.+.+--. ..++.+... T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~-~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~ 79 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQ-QAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAV 79 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhh-HHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEE Confidence 9999999999999999999998654 566667766554443 2 23333444455555432211000 011111110 Q ss_pred eee--------eeeccc-cc-----cccceeecCCCCCcceeeec--------------------c----------cCcc Q lcl|NC_011356. 76 VHI--------RGVNPD-TG-----NSDNTMKADNPRNAFYWRFV--------------------E----------MGTV 111 (148) Q Consensus 76 ~~~--------~~~~~~-~~-----~~~~~~~~~~~~~~~y~~~~--------------------E----------~GT~ 111 (148) |.. .+..+. .. ....+.+..+....+-..|+ - -|.. T Consensus 80 I~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~ 159 (205) T protein:vir:63 80 IGARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGAT 159 (205) T ss_pred EecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCce Confidence 000 000000 00 00000001111111111111 0 0111 Q ss_pred CCCCCc---hhHHHHHHHHH----HHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 112 NMPPHP---FVRPAFDVRSE----QAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 112 ~~~a~P---Fl~pA~~~~~~----~~~~~~~~~~~~~i~k~~kk 148 (148) +. +.+ |..|.+++.-. .+...+.+.+.+++++-.-+ T Consensus 160 k~-~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~r~~~~ 202 (205) T protein:vir:63 160 KL-SNNVYLLYGPSVDQVFRTVADDITTEVLDALADEFLRQFTR 202 (205) T ss_pred ec-CCceEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHHHhhhh Confidence 11 112 45565555433 33333333333333333322 No 160 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=94.89 E-value=0.0022 Score=35.19 Aligned_cols=144 Identities=14% Similarity=0.098 Sum_probs=75.8 Q ss_pred Cccceeeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----cchhhhhceeccccccccceeee Q lcl|NC_011356. 1 MIETLLDFSG-LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVR----RGKLRRNVVVLSRCSRDGGMESG 75 (148) Q Consensus 1 Mm~~~~~i~G-l~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~----~g~l~~~i~~~~~~~~~~~~~~~ 75 (148) =|+++|++++ ++.+.+.|..+++. ..+++..||..++.-+...+.+.+... ...+++.+.+....+ ... .. T Consensus 4 ~~~l~idv~~~l~~i~~~l~~~~~~-~~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~~-~~~--~~ 79 (177) T protein:vir:96 4 GFEMKIDVSREAEDIAAMVAATTKQ-LELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQRQ-KGE--VR 79 (177) T ss_pred CceeEEehhHHHHHHHHHHhhcHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccCC-CcE--EE Confidence 5667888877 44444555555544 467888899888888887777765443 345666665543221 111 11 Q ss_pred eeeee----e-cccc-ccccceeecC--CCCCccee-------e-ecccCccC-------CCCCchhHHHHHHHHHHHHH Q lcl|NC_011356. 76 VHIRG----V-NPDT-GNSDNTMKAD--NPRNAFYW-------R-FVEMGTVN-------MPPHPFVRPAFDVRSEQAAQ 132 (148) Q Consensus 76 ~~~~~----~-~~~~-~~~~~~~~~~--~~~~~~y~-------~-~~E~GT~~-------~~a~PFl~pA~~~~~~~~~~ 132 (148) +.... . .-.. ......+... .-..+|.+ + |.--|... .|--|=+..+++...+++.+ T Consensus 80 i~~~~~~i~l~~~~~~r~t~~Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~~~~~ 159 (177) T protein:vir:96 80 FWVGLDPIGVYRLGTPKVTQKGVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWERRVFQ 159 (177) T ss_pred EEEeccceehhhcccCCCCccceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHHHHHH Confidence 11110 0 0000 0000011100 00111111 1 11112222 23344456778877788888 Q ss_pred HHHHHHHHHHHHHhcC Q lcl|NC_011356. 133 VAIARMNRAIDEVLRR 148 (148) Q Consensus 133 ~~~~~~~~~i~k~~kk 148 (148) .|...|.++|+.+|+- T Consensus 160 ~~~~~l~~Ei~~~L~g 175 (177) T protein:vir:96 160 RFKELFEQEARAIING 175 (177) T ss_pred HHHHHHHHHHHHHhcc Confidence 8899999999999999 No 161 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=93.82 E-value=0.00047 Score=38.85 Aligned_cols=103 Identities=22% Similarity=0.180 Sum_probs=59.7 Q ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccccccccceeec Q lcl|NC_011356. 15 SRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKA 94 (148) Q Consensus 15 ~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (148) +..|..+.+.+..+.+.+|-..-+..|..++..-+|.++|.|++|..+.. +. | T Consensus 1 ~~dL~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~s-----~~----I------------------ 53 (113) T protein:vir:79 1 MSDLSVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVND-----TG----I------------------ 53 (113) T ss_pred CchHHHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhccccccC-----Ce----e------------------ Confidence 44444444444445566777788888999999999999999999853210 10 0 Q ss_pred CCCCCcceeeecccCccC----------CCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011356. 95 DNPRNAFYWRFVEMGTVN----------MPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVL 146 (148) Q Consensus 95 ~~~~~~~y~~~~E~GT~~----------~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~ 146 (148) .-+++|++++=||... .....|+..|.....++.++.+.+.+.++-+=.- T Consensus 54 --~y~tPYAr~qyYg~~~~~~~~~~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~~~~ 113 (113) T protein:vir:79 54 --HYTAKYARAQFYGFVNGHRVRNYSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAKGEY 113 (113) T ss_pred --EecChhhhHhhccccCCCCccccCCCCCCchhhHHHHHHhHHHHHHHHHHHhhccccccC Confidence 1123466666555322 3345577777776666555554444433321111 No 162 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=93.78 E-value=9.1e-05 Score=42.73 Aligned_cols=117 Identities=17% Similarity=0.193 Sum_probs=55.2 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) +.++.+... +|.+-++.-+ ++ +.-+..=+.. +...-|+++.|+++|.+++|..+..+.. .| T Consensus 5 ~~KFGvS~~---e~~K~irns~-EV-~~GiNdFMe~---~A~~~aK~~SPV~~GeY~~S~~V~~ka~-NG---------- 65 (150) T protein:vir:81 5 FEKFGVSDS---ELAKHIRNSA-EV-DAGINDFMEN---EAIPYAKSISPVDDGEYAASWAVMKKAK-NG---------- 65 (150) T ss_pred hhhhcCCHH---HHHHhhccch-hh-hhhHHHHHHh---hhhhhhhccCCcccchhHHHHHHHhhcc-cC---------- Confidence 334455444 4433333322 11 1112222222 2234568999999999999987644321 11 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCC---------------------CCCchhH-----HHHH-HHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM---------------------PPHPFVR-----PAFD-VRSEQAAQV 133 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~---------------------~a~PFl~-----pA~~-~~~~~~~~~ 133 (148) ++..+..+||+||+||||--- .---|-| |+-. -..+.+..- T Consensus 66 ------------RG~~G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqkvash 133 (150) T protein:vir:81 66 ------------RGVFGPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQKVASH 133 (150) T ss_pred ------------ccccCccchhhhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHHHHHh Confidence 111245679999999997321 0001111 1111 112233334 Q ss_pred HHHHHHHHHHHHhcC Q lcl|NC_011356. 134 AIARMNRAIDEVLRR 148 (148) Q Consensus 134 ~~~~~~~~i~k~~kk 148 (148) |.-.|+-.|.+.|.- T Consensus 134 fggslkggiskslsd 148 (150) T protein:vir:81 134 FGGSLKGGISKSLSD 148 (150) T ss_pred ccccccccccccccc Confidence 444455555555555 No 163 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=93.45 E-value=0.00053 Score=38.56 Aligned_cols=102 Identities=16% Similarity=0.137 Sum_probs=52.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+++|++.+++..+. .+.+.++...-+..+..++..-+|.++|.|++|..+... .|. T Consensus 1 m~kV~vdl~~~~~~ls----------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~---------~I~--- 58 (118) T protein:vir:98 1 MAKVVVELGGIKRKVS----------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV---------GVT--- 58 (118) T ss_pred CceeeechhHHhhhhh----------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC---------eeE--- Confidence 9999999888765441 112233455556677778888999999999998543211 010 Q ss_pred eccccccccceeecCCCCCcceeeecccC------------ccC--CCCCchhHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMG------------TVN--MPPHPFVRPAFDVRS--EQAAQVAIARMNRA 141 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~G------------T~~--~~a~PFl~pA~~~~~--~~~~~~~~~~~~~~ 141 (148) -+++|++.+=|| |.. +....|..++.-..+ +...+.+...+.-. T Consensus 59 -----------------Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~~g~k 118 (118) T protein:vir:98 59 -----------------WSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLRGMGFK 118 (118) T ss_pred -----------------ECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHHhcCCC Confidence 122344433332 111 223445555443222 22233222222111 No 164 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=93.45 E-value=0.00053 Score=38.56 Aligned_cols=102 Identities=16% Similarity=0.137 Sum_probs=52.1 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) ||+++|++.+++..+. .+.+.++...-+..+..++..-+|.++|.|++|..+... .|. T Consensus 1 m~kV~vdl~~~~~~ls----------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~---------~I~--- 58 (118) T protein:vir:30 1 MAKVVVELGGIKRKVS----------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV---------GVT--- 58 (118) T ss_pred CceeeechhHHhhhhh----------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC---------eeE--- Confidence 9999999888765441 112233455556677778888999999999998543211 010 Q ss_pred eccccccccceeecCCCCCcceeeecccC------------ccC--CCCCchhHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMG------------TVN--MPPHPFVRPAFDVRS--EQAAQVAIARMNRA 141 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~G------------T~~--~~a~PFl~pA~~~~~--~~~~~~~~~~~~~~ 141 (148) -+++|++.+=|| |.. +....|..++.-..+ +...+.+...+.-. T Consensus 59 -----------------Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~~g~k 118 (118) T protein:vir:30 59 -----------------WSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLRGMGFK 118 (118) T ss_pred -----------------ECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHHhcCCC Confidence 122344433332 111 223445555443222 22233222222111 No 165 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=92.36 E-value=0.00018 Score=41.05 Aligned_cols=90 Identities=17% Similarity=0.220 Sum_probs=54.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) +.++.+.+.- +++|++ +.+.+.+=...+..+-+.+.|+.+|.+|+|+.+..++...|.. T Consensus 11 lakfgi~ldd-------fdklpe------vnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrg-------- 69 (108) T protein:vir:10 11 LAKFGVRLDD-------FDKLPE------VNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRG-------- 69 (108) T ss_pred hhhhccchhh-------hhccch------hhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccc-------- Confidence 4444444332 334442 2455666667777888999999999999998876544332211 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCC----CCC----chhHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM----PPH----PFVRPAFDV 125 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~----~a~----PFl~pA~~~ 125 (148) ..+...+.+|.+|||..+. |+| -|=..||.. T Consensus 70 --------------kvgatdpqahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 70 --------------KVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred --------------cccCcchhhhhhhhhccccccccchhhhHHhhcccccCC Confidence 1112336899999998763 443 255555554 No 166 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=92.36 E-value=0.00018 Score=41.05 Aligned_cols=90 Identities=17% Similarity=0.220 Sum_probs=54.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRG 80 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~ 80 (148) +.++.+.+.- +++|++ +.+.+.+=...+..+-+.+.|+.+|.+|+|+.+..++...|.. T Consensus 11 lakfgi~ldd-------fdklpe------vnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrg-------- 69 (108) T protein:vir:10 11 LAKFGVRLDD-------FDKLPE------VNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRG-------- 69 (108) T ss_pred hhhhccchhh-------hhccch------hhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccc-------- Confidence 4444444332 334442 2455666667777888999999999999998876544332211 Q ss_pred eccccccccceeecCCCCCcceeeecccCccCC----CCC----chhHHHHHH Q lcl|NC_011356. 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNM----PPH----PFVRPAFDV 125 (148) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~----~a~----PFl~pA~~~ 125 (148) ..+...+.+|.+|||..+. |+| -|=..||.. T Consensus 70 --------------kvgatdpqahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 70 --------------KVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred --------------cccCcchhhhhhhhhccccccccchhhhHHhhcccccCC Confidence 1112336899999998763 443 255555554 No 167 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=91.61 E-value=0.002 Score=35.44 Aligned_cols=136 Identities=13% Similarity=-0.001 Sum_probs=54.0 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCcch--hh-hhceecccc-ccccce Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LR-RNVVVLSRC-SRDGGM 72 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~--l~-~~i~~~~~~-~~~~~~ 72 (148) |+- ++ .+|...|..|-..+....-+..++.-|+.+....+.+ .|..+.. ++ ..+...... ...... T Consensus 1 m~~--~~---~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m 75 (156) T protein:vir:11 1 MAD--SL---EALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKM 75 (156) T ss_pred Cch--hH---HHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhh Confidence 332 22 3333333333222110111234555555555555543 3532110 00 000000000 000000 Q ss_pred ee-eeeeeeeccccccccceeecCCCCCcceeeecccCcc----------CCCCCchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 73 ES-GVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV----------NMPPHPFVRPAFDVRSEQAAQVAIARMNRA 141 (148) Q Consensus 73 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~ 141 (148) .. ........... ......++..+++..|++..-||.. .+|++|||.=.- +.++++++.+.+.|... T Consensus 76 ~~~l~~~~~l~~~~-~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~i~~~i~~~l~~~ 153 (156) T protein:vir:11 76 FQKLRTVRYLRAKG-DAQAITVSFAGRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDS-SDMETIQNGILAHIDAN 153 (156) T ss_pred hhhhhhhheeeeee-cCcEEEEEecCCchhhhhhhcccccccccCCCCcccccccccCCCCH-HHHHHHHHHHHHHHhhc Confidence 00 00000011111 1222233344688899999999965 689999997653 34555555444444332 Q ss_pred HHH Q lcl|NC_011356. 142 IDE 144 (148) Q Consensus 142 i~k 144 (148) --- T Consensus 154 ~~~ 156 (156) T protein:vir:11 154 SPI 156 (156) T ss_pred CCC Confidence 221 No 168 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=89.71 E-value=0.0054 Score=33.01 Aligned_cols=91 Identities=14% Similarity=0.145 Sum_probs=56.6 Q ss_pred HHHHHHHHHHHHHHHHHHhCCC--CcchhhhhceeccccccccceeeeeeeeeeccccccccceeecCCCCCcceeeecc Q lcl|NC_011356. 30 LREATRAGANVLKEEVVSRAPV--RRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVE 107 (148) Q Consensus 30 ~~~al~~~a~~i~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E 107 (148) +..-..-+|..+..+||.+||= .||+-|..|.............. .-.....|.-|+| T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~~g~~~~~i--------------------~lsh~v~Yg~~LE 60 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEI--------------------VFAHTVHYGIWLE 60 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccccCCceEEE--------------------EEecCeeccceEE Confidence 4445566788999999999995 47776666643221111111111 1123446999999 Q ss_pred cCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 108 MGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 108 ~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .++...++ .++|+++..-+++++-+...+.+.= T Consensus 61 ~a~~~kya--Il~Ptv~~~~~~i~~g~~~ll~~l~ 93 (93) T protein:vir:10 61 IANSGRYE--IIMPTVHHEGKLMAQRLRGLLGRLR 93 (93) T ss_pred eecCCCcc--chhhhHHHHHHHHHHHHHHHHHhcC Confidence 99876654 7889998888877776655444322 No 169 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=88.26 E-value=0.033 Score=28.68 Aligned_cols=140 Identities=16% Similarity=0.267 Sum_probs=53.0 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHH--------HHHHHHHHHHHHHHHHHhCCCCcc----hhhhhceeccccccccceeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVL--------REATRAGANVLKEEVVSRAPVRRG----KLRRNVVVLSRCSRDGGMESG 75 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~--------~~al~~~a~~i~~~ak~~aP~~~g----~l~~~i~~~~~~~~~~~~~~~ 75 (148) |+||+++++.|+.|++....++. ..|+..++.-|..+.....+...+ .++.-++...... ++..... T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~-~~~~~~~ 79 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASP-SGKMTAR 79 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccC-CCceEEE Confidence 99999999999998876443443 344444555555554322222111 1222222111110 1111100 Q ss_pred eee--e-----eecc-------ccc----cccceeecCC-------CC-Ccceeee-cc-cCccCCC----CCchhHHHH Q lcl|NC_011356. 76 VHI--R-----GVNP-------DTG----NSDNTMKADN-------PR-NAFYWRF-VE-MGTVNMP----PHPFVRPAF 123 (148) Q Consensus 76 ~~~--~-----~~~~-------~~~----~~~~~~~~~~-------~~-~~~y~~~-~E-~GT~~~~----a~PFl~pA~ 123 (148) +.+ . ..+. ..+ .......+.. .. +-.+||- .- .|...-| -=|.-.|.- T Consensus 80 I~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~R~~gk~R~PIevvkIpis~~l~ 159 (192) T protein:vir:79 80 IRVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVVKIPLSGPLT 159 (192) T ss_pred EEEecCceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceEecCCCccCCeeeEeechHHHHH Confidence 000 0 0000 000 0000011000 00 1112332 22 2432221 124444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 124 DVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 124 ~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ++-+.++...+.+++.++|..+|+. T Consensus 160 ~af~~e~~r~~~~~~~~el~~~L~~ 184 (192) T protein:vir:79 160 QAFEDARDRIIAAEMPKQLGYALKQ 184 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444455555555555555555444 No 170 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=85.67 E-value=0.018 Score=30.09 Aligned_cols=130 Identities=21% Similarity=0.286 Sum_probs=76.4 Q ss_pred Ccc---ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----------CC-Ccchhhhhceeccc Q lcl|NC_011356. 1 MIE---TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-----------PV-RRGKLRRNVVVLSR 65 (148) Q Consensus 1 Mm~---~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-----------P~-~~g~l~~~i~~~~~ 65 (148) |.+ +-|+++-.+++. ..+.-++.|..+.+++.-.+|+.++ |. .||.|..||..-.. T Consensus 1 M~~~~~lHvdF~qp~~~~---------Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vp 71 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEELV---------FNRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVP 71 (170) T ss_pred CCCCceeEEeeecCCcee---------ecHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccc Confidence 555 344444333321 2344578899999999999998653 33 48899999864322 Q ss_pred c---ccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCccC-------------------C-CCCchhHHH Q lcl|NC_011356. 66 C---SRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN-------------------M-PPHPFVRPA 122 (148) Q Consensus 66 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~-------------------~-~a~PFl~pA 122 (148) . +.+|-+ ..| ... .+.|.... .-...||-.|+.||-.. . |-.-||.-+ T Consensus 72 ras~~rpG~m-VkI--aPN-qk~G~g~r-----~i~g~fYPafL~YGVr~gakr~k~hhr~a~ggsgwriaPR~Nym~~~ 142 (170) T protein:vir:44 72 RASKKRPGLM-VKI--APN-QKNGEGNR-----HINGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVEPRNNYMTEV 142 (170) T ss_pred cccCCCCcee-EEe--cCC-CCCCCCcc-----ccccccchhhhhhhhhcccccchhhcccccCCCcceeccchhHHHHH Confidence 1 222211 111 110 01111111 11235899999998421 1 335699999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 123 FDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ++..+..+...+..+|++.|+-.-.| T Consensus 143 l~~~~~wt~~~L~r~L~~sLrp~~r~ 168 (170) T protein:vir:44 143 LDKRRSWTRYVLSRELRKSLRPQRRK 168 (170) T ss_pred HHhhHHHHHHHHHHHHHHhcCccccc Confidence 99999998888888888777655444 No 171 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=84.48 E-value=0.016 Score=30.39 Aligned_cols=132 Identities=20% Similarity=0.301 Sum_probs=75.8 Q ss_pred Ccc---ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----------CC-Ccchhhhhceeccc Q lcl|NC_011356. 1 MIE---TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-----------PV-RRGKLRRNVVVLSR 65 (148) Q Consensus 1 Mm~---~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-----------P~-~~g~l~~~i~~~~~ 65 (148) |.+ +-|+++-.++ +. ..+.-++.|..+.+++.-.+|+.++ |. .||.|..||..-.. T Consensus 14 m~~~~~lHvdF~qp~~-------~~--Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIgy~Vp 84 (187) T protein:vir:48 14 MNQTAFLHVDFKQPKE-------LE--FNRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIGYYVP 84 (187) T ss_pred hhhccceeEeeecCCc-------ee--ecHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhhhccc Confidence 433 2233322222 21 2344578899999999999998764 33 48889988864322 Q ss_pred ---cccccceeeeeeeeeeccccccccceeecCCCCCcceeeecccCccC---------------------C-CCCchhH Q lcl|NC_011356. 66 ---CSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN---------------------M-PPHPFVR 120 (148) Q Consensus 66 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~---------------------~-~a~PFl~ 120 (148) .+++|-+ ..|. . ..+.|..... ..-...||-.|+.||-.. . |-.-||. T Consensus 85 kat~~RpG~m-VkIa--P-Nqk~G~g~r~---~Pi~gdfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwriaPR~Nym~ 157 (187) T protein:vir:48 85 KKTTRRPGLM-VKIS--P-NQKNGQGNRR---FPEGAPYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRLAPRNNFMA 157 (187) T ss_pred cccCCCCcce-EEec--C-CcccCccccc---ccccccchhHHHHhhhhhhhhccchhhhhhhcccCCcceeccchhHHH Confidence 1222221 1111 0 0111111110 011235899999998421 1 3346999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 121 PAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 121 pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) -+++..+..+...+..+|++.|+-.-.+ T Consensus 158 ~~L~~~~~wt~~~L~raL~~sLrp~~r~ 185 (187) T protein:vir:48 158 DVIERRRHWTQELLSRELQRSLRPVKRK 185 (187) T ss_pred HHHHhhHHHHHHHHHHHHHHhcCccccc Confidence 9999999999888888888877655555 No 172 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=77.55 E-value=0.032 Score=28.81 Aligned_cols=127 Identities=17% Similarity=0.247 Sum_probs=60.5 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeee-ee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVH-IR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~-~~ 79 (148) |. .+.|+--|.|+.+-....+.+ .+.+..-...++..+..-||+.||+||+|-.++...+ .|.+...+. .. T Consensus 1 mi--~i~idkp~almek~~ev~~~i-----e~t~~~~~~~l~~i~~ntapiktg~lr~sh~~siegs-tgelsn~~~yl~ 72 (133) T protein:vir:42 1 MI--EIRIDKPDALMEKPHEVQGKI-----EETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEGS-TGELSNLAYYLP 72 (133) T ss_pred Ce--eeecCCchhhhcchhhhhhHH-----HHHHHHHHHHHHHHhhhccccccccceeeeeEEeecC-ccchhhhhHHhh Confidence 43 555566688887766554443 3445555566777777889999999998866543322 122211111 00 Q ss_pred eeccccccccceeecCCCCCcceeeecccC---ccCCCCCchhHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMG---TVNMPPHPFVRPA--FDVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G---T~~~~a~PFl~pA--~~~~~~~~~~~~~~~~~~ 140 (148) .+-.+.| ++ ......+.||-=+-.- .+..||..||.-+ |...+.-+.+.+.+-++. T Consensus 73 ~vl~grg----wv-fpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 73 FVLHGRG----WV-FPVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred Hhhhccc----ce-eeccccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 1111111 00 0011222333211111 1234666677654 344444444444444444 No 173 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=64.77 E-value=0.1 Score=25.97 Aligned_cols=127 Identities=17% Similarity=0.202 Sum_probs=58.7 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeee-ee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVH-IR 79 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~-~~ 79 (148) |. .+.|+--|.|+.+-.+..+.+ .+.+..-...++.-+..-||+.||+||+|-.++...+ .|.+...+. .. T Consensus 1 mi--~i~idkp~almek~~ev~~~i-----e~t~~~~~~~l~~i~~ntapiktg~lr~sh~~siegs-tgelsn~~~yl~ 72 (133) T protein:vir:41 1 MI--RINIDKPEALMEKASEVEDRV-----EQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEGS-TGELTNTVPYLQ 72 (133) T ss_pred Ce--eeecCCchhhhcchhhhhhHH-----HHHHHHHHHHHHHHhhhccccccccceeeeeEEeecC-ccchhhhhHHhh Confidence 43 555566688887766554443 3445555566777777889999999998866543322 122211111 00 Q ss_pred eeccccccccceeecCCCCCcceeeecccC---ccCCCCCchhHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMG---TVNMPPHPFVRPAF--DVRSEQAAQVAIARMNR 140 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~G---T~~~~a~PFl~pA~--~~~~~~~~~~~~~~~~~ 140 (148) .+-.+.| ++ ......+.||-=+-.- .+..||..||.-+. ...+.-+.+.+.+-+-. T Consensus 73 ~vl~grg----wv-fpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlis 133 (133) T protein:vir:41 73 WVLFGRG----WV-FPVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIVEDSFIEWLIS 133 (133) T ss_pred Hhhhccc----ce-eeecccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHhcC Confidence 1111111 00 0011222333211111 12345666776543 33344344443333322 No 174 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=64.04 E-value=0.31 Score=23.40 Aligned_cols=130 Identities=11% Similarity=-0.001 Sum_probs=80.7 Q ss_pred eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeecccc Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPDT 85 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (148) |+++=...=+..|.+.-+++. +..+++++.++..-....+..|- .. -...+|.+..+|.+....... T Consensus 1 m~~~~~~~d~s~l~~~l~~l~-~~~~~v~R~A~~~ga~vv~dear-----------~~-aP~~tG~LkksI~~~~~~~~s 67 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVV-EHSSDVVRTMTYESAVAVRESAK-----------AF-VNDETGKLRNNLYVAYSPEES 67 (157) T ss_pred CeeEeecccHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHH-----------Hh-CCCCcchhhhheeeeeccccC Confidence 888742322333433334554 45566667666666655555442 11 134678899999887766554 Q ss_pred cccccee-ecCCCCCcceeeecccCccCCCC------C-----------c-h--hHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 86 GNSDNTM-KADNPRNAFYWRFVEMGTVNMPP------H-----------P-F--VRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) Q Consensus 86 ~~~~~~~-~~~~~~~~~y~~~~E~GT~~~~a------~-----------P-F--l~pA~~~~~~~~~~~~~~~~~~~i~k 144 (148) +...... ++.+...++||||+|||+..-.. . + + =+|=+.-.-+...+.+.+.+.++|.+ T Consensus 68 ~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k 147 (157) T protein:vir:97 68 VEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAK 147 (157) T ss_pred CCceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHH Confidence 4333332 45556788999999999754211 1 1 1 35667777777888888888888888 Q ss_pred HhcC Q lcl|NC_011356. 145 VLRR 148 (148) Q Consensus 145 ~~kk 148 (148) .+.. T Consensus 148 ~I~e 151 (157) T protein:vir:97 148 KYAE 151 (157) T ss_pred HHHH Confidence 8877 No 175 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=57.92 E-value=0.42 Score=22.63 Aligned_cols=138 Identities=16% Similarity=0.230 Sum_probs=69.2 Q ss_pred Ccc--ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC---CC-Ccchhhhhceecccc---ccccc Q lcl|NC_011356. 1 MIE--TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA---PV-RRGKLRRNVVVLSRC---SRDGG 71 (148) Q Consensus 1 Mm~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a---P~-~~g~l~~~i~~~~~~---~~~~~ 71 (148) |-. +-|+++-.+++.=+=..|-... -++-..=++.+...|-.-++... |. .||.|..||..-... +.+|- T Consensus 1 m~~~~lHvdF~qp~~~~Fnr~riRraF-v~igq~hmr~ArrlV~rrgrs~pGe~P~~qTGrLa~SIgy~Vpras~~rpG~ 79 (168) T protein:vir:45 1 MTTSFLHVDFQQPAEMRFNRARVRRAF-VTIGQRHMRDARRLVMRHARSAPGENPGYQTGRLARSIGYMVPRASKHRPGF 79 (168) T ss_pred CCccceeeeeecCCceeecHHHHHHHH-HHHhHHHHHHHHHHHhhcccccCCCCCcchhhhhhhhhhhccccccCCCCce Confidence 433 3444444444322211111111 11222224555555555554321 32 589999998643222 22222 Q ss_pred eeeeeeeeeeccccccccceeecCCCCCcceeeecccCccC--------------------CCCCchhHHHHHHHHHHHH Q lcl|NC_011356. 72 MESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVN--------------------MPPHPFVRPAFDVRSEQAA 131 (148) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~--------------------~~a~PFl~pA~~~~~~~~~ 131 (148) + ..| ... ...|..... -...||-.|+.||-.. -|-.-||.-+++..+.... T Consensus 80 m-vkI--aPN-qk~G~g~r~-----i~gdfYPafL~YGVr~gakr~r~h~rga~ggsgwriaPR~Nym~~~l~~~~~wt~ 150 (168) T protein:vir:45 80 M-ARI--APN-QRNGEGNRR-----ITGDFYPAFLFYGVRGGAKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTR 150 (168) T ss_pred E-EEe--cCC-CCCCCCCCc-----cccccchhhhhhhhhcchhhhhhhhccccCCCcceeccchhhHHHHHHhhHHHHH Confidence 1 111 111 011111111 1245899999998421 1445699999999999988 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_011356. 132 QVAIARMNRAIDEVLRR 148 (148) Q Consensus 132 ~~~~~~~~~~i~k~~kk 148 (148) ..+..+|++.|+-.-.+ T Consensus 151 ~~L~r~L~~sLrp~rr~ 167 (168) T protein:vir:45 151 YFLARELRKSLKPERRR 167 (168) T ss_pred HHHHHHHHHhcCccccc Confidence 88888877777655444 No 176 >protein:vir:101654 Length: 126 # NCBI annotation: gp17 # Family: family:all:11115 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654772;genbank:gi:109302770;genbank:GeneID:4156088 Probab=57.08 E-value=0.024 Score=29.46 Aligned_cols=117 Identities=13% Similarity=0.231 Sum_probs=51.0 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCC---cchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVS---RAPVR---RGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~---~aP~~---~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) ++||-.|+.+-+-...-....++-..|-+-++.+++.-.. ..|.- +-.|+.. -...+|.....+.+..+ T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksg-----yvenpgdyaksirvsfi 75 (126) T protein:vir:10 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSG-----YVENPGDYAKSIRVSFI 75 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccc-----cccCchhhhhhhheeee Confidence 6788888876553321112234455555666666654433 22321 1111111 11123334444444444 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCC---CchhHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPP---HPFVRPAFDVRSEQAAQV 133 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a---~PFl~pA~~~~~~~~~~~ 133 (148) +.+.|-....+ -...|-.+|+|||..+||. +-.--.-|+-.-...+.+ T Consensus 76 ksksglpkarv----matdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:10 76 KSKSGLPKARV----MATDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred ecccCCcccce----ehhhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 44433332222 2233667889999999864 211111122222211111 No 177 >protein:vir:7859 Length: 126 # NCBI annotation: gp16 # Family: family:all:11115 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817466;genbank:gi:29565895;genbank:GeneID:1259088 Probab=57.08 E-value=0.024 Score=29.46 Aligned_cols=117 Identities=13% Similarity=0.231 Sum_probs=51.0 Q ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCC---cchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 8 FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVS---RAPVR---RGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 8 i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~---~aP~~---~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) ++||-.|+.+-+-...-....++-..|-+-++.+++.-.. ..|.- +-.|+.. -...+|.....+.+..+ T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksg-----yvenpgdyaksirvsfi 75 (126) T protein:vir:78 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSG-----YVENPGDYAKSIRVSFI 75 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccc-----cccCchhhhhhhheeee Confidence 6788888876553321112234455555666666654433 22321 1111111 11123334444444444 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCC---CchhHHHHHHHHHHHHHH Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPP---HPFVRPAFDVRSEQAAQV 133 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a---~PFl~pA~~~~~~~~~~~ 133 (148) +.+.|-....+ -...|-.+|+|||..+||. +-.--.-|+-.-...+.+ T Consensus 76 ksksglpkarv----matdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:78 76 KSKSGLPKARV----MATDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred ecccCCcccce----ehhhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 44433332222 2233667889999999864 211111122222211111 No 178 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=50.37 E-value=0.61 Score=21.75 Aligned_cols=128 Identities=9% Similarity=-0.015 Sum_probs=62.3 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CCcchhhhhceeccccccccceeee Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAP-----VRRGKLRRNVVVLSRCSRDGGMESG 75 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP-----~~~g~l~~~i~~~~~~~~~~~~~~~ 75 (148) +..+.=-..-|.++.. ..++.. .++++++........|.+.+--..- -..|...+.+.+.. ..+.+... T Consensus 9 ~~gl~~~~~~l~~~~~--~~~~~~-~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~---~g~~~~v~ 82 (141) T protein:vir:79 9 FREFKRVCKKMEKLTK--IDLDKF-CKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYK---QGNNYIIE 82 (141) T ss_pred HHHHHHHHHHHHHHhH--HHHHHH-HHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceee---cCCeeEEE Confidence 3322111122222211 123222 2344444444444444444321111 11111111111111 11111111 Q ss_pred eeeeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 76 VHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) + . . ...+-+...|+|..--|....|++++|..+.+..+.++.+.+.+.|++.|++.+.- T Consensus 83 v--~-----n-------~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 83 V--V-----N-------PTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred E--e-----c-------CCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1 0 0 01112233456666666678899999999999999999999999999999999999 No 179 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=47.75 E-value=0.11 Score=25.90 Aligned_cols=91 Identities=19% Similarity=0.163 Sum_probs=39.9 Q ss_pred Ccc-ceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeee Q lcl|NC_011356. 1 MIE-TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIR 79 (148) Q Consensus 1 Mm~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~ 79 (148) |.+ ++-.-.-+|.+++. . -++.-..-+|+.-...||.+||+++|.+|+.+.+....+.......-+ T Consensus 1 madaftpNp~~FDqIl~s-------~---~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVV--- 67 (92) T protein:vir:78 1 MADAFTPNPTWFDQIMRT-------P---KVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVV--- 67 (92) T ss_pred CCCccCCChhHHHHhhcc-------c---chhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccceeEEe--- Confidence 444 33333333333221 1 112233456677778899999999999999987765444333222111 Q ss_pred eeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHH Q lcl|NC_011356. 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSE 128 (148) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~ 128 (148) .... ---.+|--|- -|..|+...+. T Consensus 68 G~D~------------------KTlLvESrTG------NLakalk~~rs 92 (92) T protein:vir:78 68 GSDE------------------KTLLIESRTG------NLARSVKRRRS 92 (92) T ss_pred ecCc------------------ceeeeecccc------hHHHHHhhhcC Confidence 1000 0011111110 01222221111 No 180 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=41.06 E-value=0.94 Score=20.72 Aligned_cols=130 Identities=9% Similarity=-0.000 Sum_probs=64.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CCCcchhhhhceecccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA----------PVRRGKLRRNVVVLSRCSRDG 70 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~~ 70 (148) =++++= ++-|..-+..|..-.+++.+++++.+.....+.++..+-... -..+++++++|.+.......+ T Consensus 6 ~~~i~G-l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 6 DLDLLG-FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 022211 112333333444444556677778888888888877774321 124567788887654444333 Q ss_pred ceeeeeeeeeeccccccccce-eecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 71 GMESGVHIRGVNPDTGNSDNT-MKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .....+.. .....+....+ ..-++.....=-+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 85 ~~~~~vg~--~~~~~~~~~y~~f~E~GT~~~~a~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 85 IKTVKIGL--NKADRSPWFYLKFHEWGTSKMPAHPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ceeEEeee--ccCCCCCcceeeeeccCCCCCCCCcch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 32222211 11111111101 011111110111222 567777777888888888999999999 No 181 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=41.06 E-value=0.94 Score=20.72 Aligned_cols=130 Identities=9% Similarity=-0.000 Sum_probs=64.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CCCcchhhhhceecccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA----------PVRRGKLRRNVVVLSRCSRDG 70 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~~ 70 (148) =++++= ++-|..-+..|..-.+++.+++++.+.....+.++..+-... -..+++++++|.+.......+ T Consensus 6 ~~~i~G-l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 6 DLDLLG-FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 022211 112333333444444556677778888888888877774321 124567788887654444333 Q ss_pred ceeeeeeeeeeccccccccce-eecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 71 GMESGVHIRGVNPDTGNSDNT-MKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .....+.. .....+....+ ..-++.....=-+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 85 ~~~~~vg~--~~~~~~~~~y~~f~E~GT~~~~a~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 85 IKTVKIGL--NKADRSPWFYLKFHEWGTSKMPAHPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ceeEEeee--ccCCCCCcceeeeeccCCCCCCCCcch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 32222211 11111111101 011111110111222 567777777888888888999999999 No 182 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=41.06 E-value=0.94 Score=20.72 Aligned_cols=130 Identities=9% Similarity=-0.000 Sum_probs=64.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CCCcchhhhhceecccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA----------PVRRGKLRRNVVVLSRCSRDG 70 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~~ 70 (148) =++++= ++-|..-+..|..-.+++.+++++.+.....+.++..+-... -..+++++++|.+.......+ T Consensus 6 ~~~i~G-l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 6 DLDLLG-FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 022211 112333333444444556677778888888888877774321 124567788887654444333 Q ss_pred ceeeeeeeeeeccccccccce-eecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 71 GMESGVHIRGVNPDTGNSDNT-MKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .....+.. .....+....+ ..-++.....=-+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 85 ~~~~~vg~--~~~~~~~~~y~~f~E~GT~~~~a~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 85 IKTVKIGL--NKADRSPWFYLKFHEWGTSKMPAHPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ceeEEeee--ccCCCCCcceeeeeccCCCCCCCCcch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 32222211 11111111101 011111110111222 567777777888888888999999999 No 183 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=41.06 E-value=0.94 Score=20.72 Aligned_cols=130 Identities=9% Similarity=-0.000 Sum_probs=64.4 Q ss_pred CccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------CCCcchhhhhceecccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA----------PVRRGKLRRNVVVLSRCSRDG 70 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a----------P~~~g~l~~~i~~~~~~~~~~ 70 (148) =++++= ++-|..-+..|..-.+++.+++++.+.....+.++..+-... -..+++++++|.+.......+ T Consensus 6 ~~~i~G-l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 6 DLDLLG-FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 022211 112333333444444556677778888888888877774321 124567788887654444333 Q ss_pred ceeeeeeeeeeccccccccce-eecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 71 GMESGVHIRGVNPDTGNSDNT-MKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i 142 (148) .....+.. .....+....+ ..-++.....=-+|+ +|=+.-.-+...+.+.+.+.++|+++| T Consensus 85 ~~~~~vg~--~~~~~~~~~y~~f~E~GT~~~~a~PFl---------~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 85 IKTVKIGL--NKADRSPWFYLKFHEWGTSKMPAHPFI---------EPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ceeEEeee--ccCCCCCcceeeeeccCCCCCCCCcch---------hHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 32222211 11111111101 011111110111222 567777777888888888999999999 No 184 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=38.83 E-value=0.17 Score=24.83 Aligned_cols=101 Identities=16% Similarity=0.075 Sum_probs=48.3 Q ss_pred ccc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHH---HHHHHHHHhCCCCcchhhhhceeccccccccceeeeee Q lcl|NC_011356. 2 IET-LLDFSGLEDISRDLQLLSGAENNRVLREATRAGAN---VLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVH 77 (148) Q Consensus 2 m~~-~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~---~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~ 77 (148) |++ +|+.. ..+.| .+.+|..+++ .+......-+|.++|.|++|-...+... +| T Consensus 1 ~~f~~f~~~----~~k~l-----------~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tvIg-sg------- 57 (105) T protein:vir:78 1 MSFSSFKDA----VIDDI-----------HNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKIIIQ-KN------- 57 (105) T ss_pred CCcccccch----HHHHH-----------HHhcCCCCchhhHHHHHHhCCCCcccccccccccccceeec-CC------- Confidence 443 44322 22222 2222222221 3445566678899999998733211100 00 Q ss_pred eeeeccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 78 IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNR 140 (148) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 140 (148) .+......-++|++.+=|... ...-|+..+...+++.+.+.+..-++- T Consensus 58 -------------~I~y~~~~~aPYAr~qYYe~~--Rg~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 58 -------------SIVARVFSLTPYARRQYYENR--RNPRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred -------------eeEeeccccCchhhhhhhccc--CCCchhHHhhhcchhHHHHHHhcccCC Confidence 111111223567777777543 333488888888877654444322221 No 185 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=33.63 E-value=1.3 Score=19.88 Aligned_cols=129 Identities=9% Similarity=-0.008 Sum_probs=60.7 Q ss_pred CccceeeehhHHHHHHHH--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-CC--------CCcchhhhhceeccccccc Q lcl|NC_011356. 1 MIETLLDFSGLEDISRDL--QLLSGAENNRVLREATRAGANVLKEEVVSR-AP--------VRRGKLRRNVVVLSRCSRD 69 (148) Q Consensus 1 Mm~~~~~i~Gl~el~~~l--~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-aP--------~~~g~l~~~i~~~~~~~~~ 69 (148) =++++ =|+-|..-+..| +...+++.+++++.+..-..+.++..+-.. -| ..+++++++|.++...... T Consensus 6 ~~~i~-Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~ 84 (149) T protein:vir:13 6 EIKFE-GLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKIRKKK 84 (149) T ss_pred EEEee-cHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccccccc Confidence 11211 012233333344 223334455667777777777776666432 12 2356888888876544433 Q ss_pred cceeeeeeeeeeccccccc-ccee---ecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011356. 70 GGMESGVHIRGVNPDTGNS-DNTM---KADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDE 144 (148) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k 144 (148) +..... ++...+.. .... .-.+.+...--+|+ +|=+.-+-++..+.+.+.+.+.|+++|-. T Consensus 85 g~~~~~-----VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~---------~pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 85 GNLQCV-----VGWEKSDNTPFYYMKMEEWGTSERPPHHAF---------GKTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred ceeEEE-----eeccCCCCCccceeeeeccCccCCCCCccc---------hHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 333222 22221111 1111 11111111111332 35555566666777777777777777777 No 186 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=32.12 E-value=1.4 Score=19.70 Aligned_cols=129 Identities=12% Similarity=0.063 Sum_probs=69.2 Q ss_pred eeehhHHHHHHHHHHhHHHH---HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeec Q lcl|NC_011356. 6 LDFSGLEDISRDLQLLSGAE---NNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVN 82 (148) Q Consensus 6 ~~i~Gl~el~~~l~~l~~~~---~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 82 (148) |.++ ++ +.-|++|...+ ..++.+++++.+...-.+.++..+ +.+..+. ....+|++..+|.+.... T Consensus 1 M~~~-~~--i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~-------k~~ap~~-~~~~~g~l~~~I~i~~~k 69 (135) T protein:vir:57 1 MIPE-IE--ISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADM-------KQNAGYD-NSSTNAHMRDSIKIRSSR 69 (135) T ss_pred Ccee-ee--ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-------HHhCCCC-CCCchhhHHhhccccccc Confidence 4443 11 22334443332 223333444444444444444433 3333222 123346677776654433 Q ss_pred cccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 83 PDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) ...+ ...+....+....|.++..|--.....+| =+|=+.-..++..+.+.+.+..+|++.|+| T Consensus 70 ~~~~--~~~v~v~vg~~~~~~~~~~f~E~GT~~~~-a~PF~~pa~~~~~~~~~~~~~~~~~~~l~k 132 (135) T protein:vir:57 70 GKAG--STVVVLRVGPTRSHYMKALAQEFGTIKQV-AKPFIRPALDYNKMQVLRILTVEIRDGLST 132 (135) T ss_pred cccc--ceeEEEEecCCCCcceeEeecccCCCCCC-CCcchhHhHHHhHHHHHHHHHHHHHHHHHH Confidence 2221 12233444445555555555555555555 468899999999999999999999999999 No 187 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=124 Identities=6% Similarity=-0.063 Sum_probs=55.4 Q ss_pred eeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccc Q lcl|NC_011356. 6 LDFSG-LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPD 84 (148) Q Consensus 6 ~~i~G-l~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 84 (148) |.++. +++ |++.-+++.... .+..+++ .+..+.+-.-.|+.+...+ ...+++.+.|.+...... T Consensus 1 M~v~v~~~~----L~~~l~~l~~~~-~k~~~~A-------l~aga~~~~e~l~~~aP~~---~~~~hl~d~I~vs~~k~~ 65 (125) T protein:vir:98 1 MGARIESNN----IEQGLKNAVLKM-NLNSNVI-------VKAGAMSLVPLLKSNTPFA---NTKKHARDHIAVSNVKTD 65 (125) T ss_pred CeeEeeHHH----HHHHHHHHHHHH-HHHHHHH-------HHHHHHHHHHHHHHhCCCC---CCCchhhhheeecccccc Confidence 88886 443 333223333222 2222222 2222222223344443322 223456666655332211 Q ss_pred cccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011356. 85 TGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLR 147 (148) Q Consensus 85 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~k 147 (148) .+ ........+.+-...++..|-...-.-+| =.|=++-..++..+.+.+.+..+|+++.| T Consensus 66 ~~--~g~~~v~VG~~k~~~~~a~F~E~GT~k~~-a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 66 RH--TSEKIVTIGYAKGVSHRIHATEFGTMYQK-PQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred cc--cceEEEEeccCCCCceEEEeccCCccCCC-CCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 11 11111112211222222222222223332 24778888888888888888888888888 No 188 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=124 Identities=6% Similarity=-0.063 Sum_probs=55.4 Q ss_pred eeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccc Q lcl|NC_011356. 6 LDFSG-LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPD 84 (148) Q Consensus 6 ~~i~G-l~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 84 (148) |.++. +++ |++.-+++.... .+..+++ .+..+.+-.-.|+.+...+ ...+++.+.|.+...... T Consensus 1 M~v~v~~~~----L~~~l~~l~~~~-~k~~~~A-------l~aga~~~~e~l~~~aP~~---~~~~hl~d~I~vs~~k~~ 65 (125) T protein:vir:47 1 MGARIESNN----IEQGLKNAVLKM-NLNSNVI-------VKAGAMSLVPLLKSNTPFA---NTKKHARDHIAVSNVKTD 65 (125) T ss_pred CeeEeeHHH----HHHHHHHHHHHH-HHHHHHH-------HHHHHHHHHHHHHHhCCCC---CCCchhhhheeecccccc Confidence 88886 443 333223333222 2222222 2222222223344443322 223456666655332211 Q ss_pred cccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011356. 85 TGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLR 147 (148) Q Consensus 85 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~k 147 (148) .+ ........+.+-...++..|-...-.-+| =.|=++-..++..+.+.+.+..+|+++.| T Consensus 66 ~~--~g~~~v~VG~~k~~~~~a~F~E~GT~k~~-a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 66 RH--TSEKIVTIGYAKGVSHRIHATEFGTMYQK-PQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred cc--cceEEEEeccCCCCceEEEeccCCccCCC-CCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 11 11111112211222222222222223332 24778888888888888888888888888 No 189 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=124 Identities=6% Similarity=-0.063 Sum_probs=55.4 Q ss_pred eeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccc Q lcl|NC_011356. 6 LDFSG-LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPD 84 (148) Q Consensus 6 ~~i~G-l~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 84 (148) |.++. +++ |++.-+++.... .+..+++ .+..+.+-.-.|+.+...+ ...+++.+.|.+...... T Consensus 1 M~v~v~~~~----L~~~l~~l~~~~-~k~~~~A-------l~aga~~~~e~l~~~aP~~---~~~~hl~d~I~vs~~k~~ 65 (125) T protein:vir:79 1 MGARIESNN----IEQGLKNAVLKM-NLNSNVI-------VKAGAMSLVPLLKSNTPFA---NTKKHARDHIAVSNVKTD 65 (125) T ss_pred CeeEeeHHH----HHHHHHHHHHHH-HHHHHHH-------HHHHHHHHHHHHHHhCCCC---CCCchhhhheeecccccc Confidence 88886 443 333223333222 2222222 2222222223344443322 223456666655332211 Q ss_pred cccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011356. 85 TGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLR 147 (148) Q Consensus 85 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~k 147 (148) .+ ........+.+-...++..|-...-.-+| =.|=++-..++..+.+.+.+..+|+++.| T Consensus 66 ~~--~g~~~v~VG~~k~~~~~a~F~E~GT~k~~-a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 66 RH--TSEKIVTIGYAKGVSHRIHATEFGTMYQK-PQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred cc--cceEEEEeccCCCCceEEEeccCCccCCC-CCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 11 11111112211222222222222223332 24778888888888888888888888888 No 190 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=124 Identities=6% Similarity=-0.063 Sum_probs=55.4 Q ss_pred eeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccc Q lcl|NC_011356. 6 LDFSG-LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPD 84 (148) Q Consensus 6 ~~i~G-l~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 84 (148) |.++. +++ |++.-+++.... .+..+++ .+..+.+-.-.|+.+...+ ...+++.+.|.+...... T Consensus 1 M~v~v~~~~----L~~~l~~l~~~~-~k~~~~A-------l~aga~~~~e~l~~~aP~~---~~~~hl~d~I~vs~~k~~ 65 (125) T protein:vir:94 1 MGARIESNN----IEQGLKNAVLKM-NLNSNVI-------VKAGAMSLVPLLKSNTPFA---NTKKHARDHIAVSNVKTD 65 (125) T ss_pred CeeEeeHHH----HHHHHHHHHHHH-HHHHHHH-------HHHHHHHHHHHHHHhCCCC---CCCchhhhheeecccccc Confidence 88886 443 333223333222 2222222 2222222223344443322 223456666655332211 Q ss_pred cccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011356. 85 TGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLR 147 (148) Q Consensus 85 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~k 147 (148) .+ ........+.+-...++..|-...-.-+| =.|=++-..++..+.+.+.+..+|+++.| T Consensus 66 ~~--~g~~~v~VG~~k~~~~~a~F~E~GT~k~~-a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 66 RH--TSEKIVTIGYAKGVSHRIHATEFGTMYQK-PQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred cc--cceEEEEeccCCCCceEEEeccCCccCCC-CCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 11 11111112211222222222222223332 24778888888888888888888888888 No 191 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=124 Identities=6% Similarity=-0.063 Sum_probs=55.4 Q ss_pred eeehh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeeeccc Q lcl|NC_011356. 6 LDFSG-LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGVNPD 84 (148) Q Consensus 6 ~~i~G-l~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 84 (148) |.++. +++ |++.-+++.... .+..+++ .+..+.+-.-.|+.+...+ ...+++.+.|.+...... T Consensus 1 M~v~v~~~~----L~~~l~~l~~~~-~k~~~~A-------l~aga~~~~e~l~~~aP~~---~~~~hl~d~I~vs~~k~~ 65 (125) T protein:vir:81 1 MGARIESNN----IEQGLKNAVLKM-NLNSNVI-------VKAGAMSLVPLLKSNTPFA---NTKKHARDHIAVSNVKTD 65 (125) T ss_pred CeeEeeHHH----HHHHHHHHHHHH-HHHHHHH-------HHHHHHHHHHHHHHhCCCC---CCCchhhhheeecccccc Confidence 88886 443 333223333222 2222222 2222222223344443322 223456666655332211 Q ss_pred cccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011356. 85 TGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLR 147 (148) Q Consensus 85 ~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~k 147 (148) .+ ........+.+-...++..|-...-.-+| =.|=++-..++..+.+.+.+..+|+++.| T Consensus 66 ~~--~g~~~v~VG~~k~~~~~a~F~E~GT~k~~-a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 66 RH--TSEKIVTIGYAKGVSHRIHATEFGTMYQK-PQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred cc--cceEEEEeccCCCCceEEEeccCCccCCC-CCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 11 11111112211222222222222223332 24778888888888888888888888888 No 192 >protein:vir:6154 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:10918 # MgeID: mge:127 # MgeName: phBC6A51 # Cross-refs: genbank:acc:NP_852533;genbank:gi:31415793;genbank:GeneID:1489145 Probab=23.69 E-value=0.09 Score=26.32 Aligned_cols=118 Identities=10% Similarity=0.170 Sum_probs=70.6 Q ss_pred ccceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhhceeccccccccceeeeeeeeee Q lcl|NC_011356. 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRCSRDGGMESGVHIRGV 81 (148) Q Consensus 2 m~~~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~~i~~~~~~~~~~~~~~~~~~~~~ 81 (148) |.+.+-++|-..+++.-+- .-.+--+.+.+.+-...-...|..+||.-.|.|..||..+..-- + T Consensus 1 mrirvvvkgksnvlkahnp---nryktpieqtvekhtrlqanqasnrapilhgplsesipasvkmv--------v----- 64 (119) T protein:vir:61 1 MRIRVVVKGKSNVLKAHNP---NRYKTPIEQTVEKHTRLQANQASNRAPILHGPLSESIPASVKMV--------V----- 64 (119) T ss_pred CeeEEEeecccceecccCC---ccccccHHHHHHHhhhhhcccccccCceeecccccccchhhhhh--------h----- Confidence 9999999998887765441 11112334555555566667778889999999998885432211 0 Q ss_pred ccccccccceeecCCCCCcceeeecccCccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~~~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .....+.++++-.|+...||-. -...-||+.+.=+.++-.++ .+.+-+.++.|- T Consensus 65 -------gariigtygspliyaavqefth--ktkkgfmrktafegeqpfve----disktvqrvakg 118 (119) T protein:vir:61 65 -------GARIIGTYGSPLIYAAVQEFTH--KTKKGFMRKTAFEGEQPFVE----DISKTVQRVAKG 118 (119) T ss_pred -------hhhhcccccchHHHHHHHHHhh--hhhhhhhhhhcccCCcchHH----HHHHHHHHhhcC Confidence 0112233456778999999852 23445777655555554333 444555555555 No 193 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=21.70 E-value=1.7 Score=19.26 Aligned_cols=137 Identities=15% Similarity=0.175 Sum_probs=50.4 Q ss_pred ccc--eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCc---------chhh--hhce-- Q lcl|NC_011356. 2 IET--LLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRA-----PVRR---------GKLR--RNVV-- 61 (148) Q Consensus 2 m~~--~~~i~Gl~el~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-----P~~~---------g~l~--~~i~-- 61 (148) |.+ ++....++.|...|..|. +.-+.=+.-|...|..+...+++++ |..+ |..+ ..+. T Consensus 1 m~~~~~~n~~dl~~l~~~L~ll~--L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL 78 (231) T protein:vir:37 1 MQIRLGLKQEDLDAFVRDLRTLN--LTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKV 78 (231) T ss_pred CCccCCcCHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHh Confidence 664 555667777777777553 2111223445666677777777653 4321 1100 0000 Q ss_pred --eccccccccceee------------eeeeeeeccccc--------ccc-c-e-------------eecC--------- Q lcl|NC_011356. 62 --VLSRCSRDGGMES------------GVHIRGVNPDTG--------NSD-N-T-------------MKAD--------- 95 (148) Q Consensus 62 --~~~~~~~~~~~~~------------~~~~~~~~~~~~--------~~~-~-~-------------~~~~--------- 95 (148) ........++... .+|..+.....+ ... . . .... T Consensus 79 ~~~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~ 158 (231) T protein:vir:37 79 LRYASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQGKT 158 (231) T ss_pred HHhhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCCCC Confidence 0000000000000 001100000000 000 0 0 0000 Q ss_pred -CCCC--cce------------eeeccc----------CccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 96 -NPRN--AFY------------WRFVEM----------GTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 96 -~~~~--~~y------------~~~~E~----------GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) +..+ .|. -+.++- =|...|++|||...-++..+ .+.+.|++++.. T Consensus 159 ~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e~~~--------~l~~~l~~i~~~ 228 (231) T protein:vir:37 159 KYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKENVD--------ILREITLKFLSG 228 (231) T ss_pred CcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHHHHH--------HHHHHHHHHhcc Confidence 0000 000 011111 13567999999877665444 333333333333 No 194 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=21.69 E-value=2.3 Score=18.62 Aligned_cols=135 Identities=14% Similarity=0.181 Sum_probs=52.6 Q ss_pred cc--ceeeehhHHHHHHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCcc-------------hhhhh Q lcl|NC_011356. 2 IE--TLLDFSGLEDISRDLQLLS--GAENNRVLREATRAGANVLKEEVVSRA-----PVRRG-------------KLRRN 59 (148) Q Consensus 2 m~--~~~~i~Gl~el~~~l~~l~--~~~~~~~~~~al~~~a~~i~~~ak~~a-----P~~~g-------------~l~~~ 59 (148) |. ++++..+++.|...|..|. ..-. +.-+...|..+...+++++ |..+. .|... T Consensus 1 M~i~~~~n~~~~~~l~~~L~ll~L~p~~R----r~ll~~iak~lr~~~k~rIr~Q~~PDGs~~~pRKr~k~KM~~kL~k~ 76 (227) T protein:vir:37 1 MNIRMGIDKEDLKKFLKDLEIISLPDKKK----REILIRSLQMIKRQAVKSAANQRNPMGGSWKKRKNGTAKMLRRIAKL 76 (227) T ss_pred CcccccCCHHHHHHHHHHHHHhcCCHHHH----HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCchhcchhHHHHhhhHHH Confidence 55 5777888888888876553 3322 4455666666777777653 43221 11222 Q ss_pred ceecccccc------cc--ceeeeeeeeeeccccc---------c-ccce--------------eecCC----------C Q lcl|NC_011356. 60 VVVLSRCSR------DG--GMESGVHIRGVNPDTG---------N-SDNT--------------MKADN----------P 97 (148) Q Consensus 60 i~~~~~~~~------~~--~~~~~~~~~~~~~~~~---------~-~~~~--------------~~~~~----------~ 97 (148) +.+...... .| +....+|..+.....+ . .... ..... . T Consensus 77 l~~~~~~~~a~v~f~~g~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~~k~k~~k~~~r 156 (227) T protein:vir:37 77 ANSKAEKAQGTLFYKQKRTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVANGKTKNGKAKRR 156 (227) T ss_pred cceeecccceEEEecCcchHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccCCCCCCcCCccc Confidence 221100000 00 0000111111000000 0 0000 00000 0 Q ss_pred CC--cce------------eeeccc------------CccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011356. 98 RN--AFY------------WRFVEM------------GTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) Q Consensus 98 ~~--~~y------------~~~~E~------------GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~i~k~~kk 148 (148) .+ .|. -+.++- =+...|++|||...-+ + +.+.|...|.++..+ T Consensus 157 kps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~----e----~~~~l~r~l~~~~~~ 225 (227) T protein:vir:37 157 KPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREE----E----NAKIILAEIQKYTQK 225 (227) T ss_pred cCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHH----H----HHHHHHHHHHHHhhh Confidence 00 000 001110 0345799999998443 2 233444444444444 Done!