Query lcl|NC_011308.1_cdsid_YP_002261425.1 [gene=P40_gp9] [protein=gp9] [protein_id=YP_002261425.1] [location=7394..7858] Match_columns 154 No_of_seqs 112 out of 241 Neff 7.3 Searched_HMMs 1612 Date Thu Nov 7 13:14:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78077 Length: 141 100.0 1.9E-46 1.2E-49 271.3 15.2 141 13-154 1-141 (141) 2 protein:vir:94490 Length: 137 100.0 5.2E-38 3.2E-41 225.0 13.9 134 13-147 1-137 (137) 3 protein:vir:93738 Length: 137 100.0 5.2E-38 3.2E-41 225.0 13.9 134 13-147 1-137 (137) 4 protein:vir:97427 Length: 137 100.0 5.2E-38 3.2E-41 225.0 13.9 134 13-147 1-137 (137) 5 protein:vir:5978 Length: 144 # 100.0 7.2E-38 4.5E-41 224.2 14.7 141 10-151 1-144 (144) 6 protein:vir:95894 Length: 137 100.0 7.4E-38 4.6E-41 224.2 13.9 134 13-147 1-137 (137) 7 protein:vir:106570 Length: 182 100.0 1E-37 6.5E-41 223.3 13.7 143 12-154 1-177 (182) 8 protein:vir:96121 Length: 137 100.0 1.5E-37 9.6E-41 222.4 14.1 134 13-147 1-137 (137) 9 protein:vir:107099 Length: 137 100.0 1.5E-37 9.5E-41 222.4 13.8 134 13-147 1-137 (137) 10 protein:vir:94108 Length: 149 100.0 1.7E-37 1.1E-40 222.2 13.8 145 1-147 1-149 (149) 11 protein:vir:94796 Length: 137 100.0 1.8E-37 1.1E-40 222.1 13.9 134 13-147 1-137 (137) 12 protein:vir:105330 Length: 137 100.0 1.5E-37 9.5E-41 222.4 13.5 134 13-147 1-137 (137) 13 protein:vir:96829 Length: 135 100.0 1.1E-36 7E-40 217.6 13.8 134 13-147 1-135 (135) 14 protein:vir:105916 Length: 149 100.0 1.4E-36 8.8E-40 217.1 13.8 145 1-147 1-149 (149) 15 protein:vir:1243 Length: 116 # 100.0 6.1E-36 3.8E-39 213.6 10.9 113 34-147 1-116 (116) 16 protein:vir:97327 Length: 116 100.0 6.1E-36 3.8E-39 213.6 10.9 113 34-147 1-116 (116) 17 protein:vir:95062 Length: 116 100.0 7.1E-36 4.4E-39 213.3 10.9 113 34-147 1-116 (116) 18 protein:vir:101594 Length: 173 100.0 1.4E-31 8.6E-35 189.8 13.0 138 15-153 1-173 (173) 19 protein:vir:94654 Length: 142 100.0 1.6E-31 9.9E-35 189.4 13.2 137 13-151 1-142 (142) 20 protein:vir:107545 Length: 140 99.9 6.7E-30 4.1E-33 180.5 9.3 131 13-148 1-140 (140) 21 protein:vir:97982 Length: 140 99.9 6.7E-30 4.1E-33 180.5 9.3 131 13-148 1-140 (140) 22 protein:vir:99101 Length: 142 99.9 1E-29 6.5E-33 179.5 9.8 132 13-148 1-142 (142) 23 protein:vir:8669 Length: 142 # 99.9 1E-29 6.5E-33 179.5 9.8 132 13-148 1-142 (142) 24 protein:vir:3617 Length: 112 # 99.9 7.9E-29 4.9E-32 174.6 11.7 110 13-151 1-112 (112) 25 protein:vir:95789 Length: 114 99.9 1.5E-28 9.3E-32 173.1 12.1 113 13-154 1-113 (114) 26 protein:vir:9930 Length: 108 # 99.9 2.6E-28 1.6E-31 171.8 12.4 108 17-152 1-108 (108) 27 protein:vir:106041 Length: 137 99.9 5.5E-29 3.4E-32 175.5 8.0 126 13-145 1-137 (137) 28 protein:vir:99744 Length: 115 99.9 5.7E-28 3.5E-31 170.0 11.9 109 15-151 1-115 (115) 29 protein:vir:743 Length: 108 # 99.9 7.3E-28 4.6E-31 169.3 12.1 108 15-151 1-108 (108) 30 protein:vir:98409 Length: 108 99.9 6.8E-28 4.2E-31 169.5 11.7 108 15-151 1-108 (108) 31 protein:vir:9312 Length: 115 # 99.9 9.7E-28 6E-31 168.7 12.1 109 15-151 1-115 (115) 32 protein:vir:96358 Length: 115 99.9 9.7E-28 6E-31 168.7 12.1 109 15-151 1-115 (115) 33 protein:vir:78858 Length: 115 99.9 9.7E-28 6E-31 168.7 12.1 109 15-151 1-115 (115) 34 protein:vir:96225 Length: 115 99.9 9.7E-28 6E-31 168.7 12.1 109 15-151 1-115 (115) 35 protein:vir:97144 Length: 115 99.9 9.7E-28 6E-31 168.7 12.1 109 15-151 1-115 (115) 36 protein:vir:103917 Length: 115 99.9 9.7E-28 6E-31 168.7 12.1 109 15-151 1-115 (115) 37 protein:vir:94538 Length: 125 99.9 1.1E-27 6.9E-31 168.4 11.5 119 1-154 1-122 (125) 38 protein:vir:106623 Length: 115 99.9 2.1E-27 1.3E-30 166.8 12.1 109 15-151 1-115 (115) 39 protein:vir:96486 Length: 112 99.9 2.2E-27 1.4E-30 166.7 11.3 109 13-150 1-112 (112) 40 protein:vir:2740 Length: 114 # 99.9 9E-27 5.6E-30 163.4 10.8 111 13-152 1-114 (114) 41 protein:vir:4906 Length: 114 # 99.9 9E-27 5.6E-30 163.4 10.8 111 13-152 1-114 (114) 42 protein:vir:102441 Length: 137 99.9 4.7E-27 2.9E-30 164.9 8.5 129 13-146 1-137 (137) 43 protein:vir:106506 Length: 137 99.9 3.7E-26 2.3E-29 160.0 7.7 129 16-151 1-137 (137) 44 protein:vir:105467 Length: 144 99.8 5.4E-24 3.3E-27 148.2 12.4 131 13-154 1-140 (144) 45 protein:vir:100075 Length: 140 99.8 2E-23 1.3E-26 145.0 10.7 115 13-154 1-133 (140) 46 protein:vir:1437 Length: 140 # 99.8 2.9E-23 1.8E-26 144.1 10.9 116 13-154 1-133 (140) 47 protein:vir:100243 Length: 140 99.8 3.6E-23 2.2E-26 143.7 10.9 116 13-154 1-133 (140) 48 protein:vir:97088 Length: 157 99.8 6.3E-23 3.9E-26 142.3 11.8 139 13-154 1-150 (157) 49 protein:vir:80362 Length: 140 99.8 1E-22 6.2E-26 141.2 11.4 116 13-154 1-133 (140) 50 protein:vir:93617 Length: 148 99.8 6.4E-22 4E-25 136.8 11.2 117 12-154 1-143 (148) 51 protein:vir:194 Length: 149 # 99.8 4.6E-22 2.8E-25 137.6 10.2 119 10-154 1-144 (149) 52 protein:vir:105089 Length: 133 99.7 4.4E-21 2.7E-24 132.2 11.3 116 13-154 1-131 (133) 53 protein:vir:9879 Length: 127 # 99.7 3.5E-21 2.2E-24 132.7 10.0 115 17-152 1-127 (127) 54 protein:vir:79034 Length: 141 99.7 1.3E-20 8.1E-24 129.6 12.1 121 13-154 1-135 (141) 55 protein:vir:1273 Length: 127 # 99.7 1.9E-20 1.2E-23 128.7 10.9 115 13-154 1-126 (127) 56 protein:vir:4347 Length: 164 # 99.7 1.8E-20 1.1E-23 128.8 10.1 116 13-154 1-151 (164) 57 protein:vir:1891 Length: 179 # 99.7 2.4E-20 1.5E-23 128.2 10.6 140 13-154 1-166 (179) 58 protein:vir:102085 Length: 146 99.7 3.8E-20 2.4E-23 127.0 11.4 115 13-154 1-143 (146) 59 protein:vir:107568 Length: 146 99.7 3.8E-20 2.4E-23 127.0 11.4 115 13-154 1-143 (146) 60 protein:vir:105007 Length: 146 99.7 3.8E-20 2.4E-23 127.0 11.4 115 13-154 1-143 (146) 61 protein:vir:102875 Length: 146 99.7 3.8E-20 2.4E-23 127.0 11.4 115 13-154 1-143 (146) 62 protein:vir:5745 Length: 135 # 99.7 3.1E-20 1.9E-23 127.5 10.8 116 13-154 1-131 (135) 63 protein:vir:4704 Length: 125 # 99.7 5.4E-20 3.4E-23 126.2 11.3 114 13-154 1-124 (125) 64 protein:vir:9414 Length: 125 # 99.7 5.4E-20 3.4E-23 126.2 11.3 114 13-154 1-124 (125) 65 protein:vir:81106 Length: 125 99.7 5.4E-20 3.4E-23 126.2 11.3 114 13-154 1-124 (125) 66 protein:vir:98342 Length: 125 99.7 5.4E-20 3.4E-23 126.2 11.3 114 13-154 1-124 (125) 67 protein:vir:79988 Length: 125 99.7 5.4E-20 3.4E-23 126.2 11.3 114 13-154 1-124 (125) 68 protein:vir:3873 Length: 128 # 99.7 8.2E-20 5.1E-23 125.2 12.0 115 13-154 1-127 (128) 69 protein:vir:1386 Length: 149 # 99.6 5.9E-19 3.7E-22 120.5 10.6 116 13-154 1-144 (149) 70 protein:vir:99528 Length: 92 # 99.6 3.2E-19 2E-22 122.0 8.0 86 13-127 1-92 (92) 71 protein:vir:9708 Length: 125 # 99.6 4.3E-18 2.6E-21 115.8 11.1 112 16-154 1-123 (125) 72 protein:vir:102963 Length: 163 99.6 7E-18 4.3E-21 114.6 12.1 119 13-154 1-154 (163) 73 protein:vir:966 Length: 123 # 99.6 2.8E-17 1.7E-20 111.4 12.8 118 13-152 1-123 (123) 74 protein:vir:81147 Length: 126 99.6 3.3E-17 2.1E-20 110.9 12.4 120 13-154 1-126 (126) 75 protein:vir:102154 Length: 119 99.5 3.1E-17 1.9E-20 111.1 9.2 113 13-154 1-118 (119) 76 protein:vir:102338 Length: 116 99.4 2E-15 1.2E-18 101.2 8.8 111 34-154 1-115 (116) 77 protein:vir:103280 Length: 142 99.3 3.4E-14 2.1E-17 94.4 10.8 110 13-153 1-142 (142) 78 protein:vir:95372 Length: 124 99.3 1.1E-13 7E-17 91.6 12.4 116 13-152 1-124 (124) 79 protein:vir:107703 Length: 147 99.2 1.3E-13 8.2E-17 91.2 11.4 107 13-154 1-141 (147) 80 protein:vir:79638 Length: 146 99.2 1.5E-13 9.3E-17 90.9 11.3 111 13-154 1-145 (146) 81 protein:vir:104347 Length: 145 99.2 1.4E-13 8.5E-17 91.1 10.2 114 9-153 1-145 (145) 82 protein:vir:80116 Length: 127 99.2 5E-13 3.1E-16 88.0 12.4 118 13-154 1-126 (127) 83 protein:vir:105773 Length: 131 99.2 1.8E-13 1.1E-16 90.5 9.5 120 15-152 1-131 (131) 84 protein:vir:97190 Length: 148 99.1 3.2E-13 2E-16 89.1 9.6 109 13-154 1-146 (148) 85 protein:vir:94994 Length: 131 99.1 4.9E-13 3.1E-16 88.1 9.3 103 13-154 1-131 (131) 86 protein:vir:78380 Length: 131 99.1 1.5E-12 9.1E-16 85.5 9.6 103 13-154 1-131 (131) 87 protein:vir:10367 Length: 119 99.1 2.3E-13 1.4E-16 89.9 5.2 96 50-154 1-115 (119) 88 protein:vir:3163 Length: 145 # 99.0 4.7E-13 2.9E-16 88.2 6.4 115 16-154 1-143 (145) 89 protein:vir:95157 Length: 144 99.0 2.4E-12 1.5E-15 84.2 9.1 106 13-154 1-144 (144) 90 protein:vir:81067 Length: 119 99.0 6E-13 3.7E-16 87.6 5.2 96 50-154 1-115 (119) 91 protein:vir:80425 Length: 134 99.0 4.8E-12 3E-15 82.6 9.5 102 13-154 1-133 (134) 92 protein:vir:100887 Length: 139 98.9 1.5E-11 9.5E-15 79.9 10.6 112 17-154 1-130 (139) 93 protein:vir:96288 Length: 100 98.9 3.8E-12 2.4E-15 83.2 6.7 99 1-101 1-100 (100) 94 protein:vir:94944 Length: 121 98.9 4.3E-12 2.7E-15 82.9 6.9 95 13-139 1-121 (121) 95 protein:vir:4956 Length: 153 # 98.8 6.6E-11 4.1E-14 76.4 9.3 115 13-154 1-134 (153) 96 protein:vir:96774 Length: 152 98.8 1.2E-10 7.2E-14 75.0 9.9 112 1-149 1-152 (152) 97 protein:vir:100223 Length: 139 98.7 2.2E-10 1.4E-13 73.6 10.0 112 17-154 1-130 (139) 98 protein:vir:5000 Length: 141 # 98.7 2.3E-10 1.5E-13 73.4 8.9 115 13-154 1-134 (141) 99 protein:vir:1988 Length: 156 # 98.7 9E-11 5.6E-14 75.6 6.4 119 13-152 1-156 (156) 100 protein:vir:100652 Length: 134 98.6 8.1E-10 5E-13 70.4 10.7 123 13-153 1-134 (134) 101 protein:vir:6246 Length: 143 # 98.6 2E-10 1.2E-13 73.8 7.2 115 13-152 1-143 (143) 102 protein:vir:103841 Length: 155 98.6 2.2E-10 1.4E-13 73.5 6.2 121 13-154 1-155 (155) 103 protein:vir:79091 Length: 175 98.6 3.6E-10 2.2E-13 72.4 7.1 120 13-153 1-175 (175) 104 protein:vir:99196 Length: 155 98.5 8.4E-10 5.2E-13 70.3 8.9 121 13-154 1-155 (155) 105 protein:vir:107851 Length: 175 98.5 8.9E-10 5.5E-13 70.2 8.6 120 13-153 1-175 (175) 106 protein:vir:9513 Length: 134 # 98.5 1.9E-09 1.2E-12 68.4 10.1 123 13-153 1-134 (134) 107 protein:vir:101302 Length: 134 98.5 1.9E-09 1.2E-12 68.4 10.1 123 13-153 1-134 (134) 108 protein:vir:4859 Length: 140 # 98.5 1.4E-09 8.4E-13 69.2 9.2 115 13-154 1-134 (140) 109 protein:vir:1332 Length: 143 # 98.5 6.5E-10 4.1E-13 70.9 7.1 115 13-152 1-143 (143) 110 protein:vir:99833 Length: 190 98.5 1.7E-09 1.1E-12 68.6 9.1 138 13-154 1-186 (190) 111 protein:vir:79225 Length: 155 98.5 9.1E-10 5.7E-13 70.1 7.1 121 13-154 1-155 (155) 112 protein:vir:4833 Length: 140 # 98.4 2.6E-09 1.6E-12 67.6 8.9 115 13-154 1-134 (140) 113 protein:vir:80970 Length: 112 98.0 1.7E-07 1E-10 57.7 9.7 111 13-154 1-112 (112) 114 protein:vir:3848 Length: 159 # 97.9 3.1E-07 1.9E-10 56.2 10.7 114 13-154 1-153 (159) 115 protein:vir:7449 Length: 123 # 97.9 1.5E-07 9.3E-11 58.0 8.3 113 13-154 1-120 (123) 116 protein:vir:93898 Length: 133 97.9 3.9E-07 2.4E-10 55.7 10.3 122 13-152 1-133 (133) 117 protein:vir:9647 Length: 132 # 97.9 4.3E-07 2.7E-10 55.5 10.4 120 13-152 1-132 (132) 118 protein:vir:45 Length: 112 # N 97.8 3.9E-07 2.4E-10 55.7 9.7 111 13-153 1-112 (112) 119 protein:vir:94419 Length: 133 97.8 6E-07 3.7E-10 54.7 10.4 120 13-152 1-133 (133) 120 protein:vir:78644 Length: 133 97.8 6E-07 3.7E-10 54.7 10.4 120 13-152 1-133 (133) 121 protein:vir:9363 Length: 133 # 97.8 6E-07 3.7E-10 54.7 10.4 120 13-152 1-133 (133) 122 protein:vir:96973 Length: 133 97.8 6E-07 3.7E-10 54.7 10.4 120 13-152 1-133 (133) 123 protein:vir:4200 Length: 133 # 97.7 1.7E-07 1.1E-10 57.7 6.1 129 14-152 1-133 (133) 124 protein:vir:6216 Length: 125 # 97.7 7.3E-07 4.5E-10 54.2 9.5 120 13-153 1-125 (125) 125 protein:vir:78335 Length: 133 97.7 7E-07 4.4E-10 54.3 9.3 122 13-154 1-133 (133) 126 protein:vir:98557 Length: 149 97.7 8.2E-07 5.1E-10 54.0 9.1 120 13-152 1-149 (149) 127 protein:vir:8106 Length: 150 # 97.7 9E-08 5.6E-11 59.2 3.8 132 13-154 1-143 (150) 128 protein:vir:4790 Length: 114 # 97.6 1.3E-06 8E-10 52.9 9.4 114 13-152 1-114 (114) 129 protein:vir:101508 Length: 120 97.5 1.9E-06 1.2E-09 52.0 9.1 112 13-154 1-120 (120) 130 protein:vir:99546 Length: 200 97.5 1.1E-06 6.9E-10 53.2 7.6 118 7-154 1-143 (200) 131 protein:vir:96105 Length: 193 97.5 5.3E-08 3.3E-11 60.5 0.2 112 13-154 1-136 (193) 132 protein:vir:1581 Length: 116 # 97.4 2.3E-06 1.4E-09 51.5 8.3 116 13-151 1-116 (116) 133 protein:vir:96012 Length: 133 97.4 6.5E-06 4E-09 49.0 10.8 124 13-154 1-133 (133) 134 protein:vir:79179 Length: 155 97.4 5.3E-06 3.3E-09 49.5 10.2 118 13-152 1-155 (155) 135 protein:vir:94069 Length: 168 97.4 1.8E-07 1.1E-10 57.5 2.1 91 27-154 1-101 (168) 136 protein:vir:4162 Length: 133 # 97.3 1E-06 6.5E-10 53.4 5.9 129 14-152 1-133 (133) 137 protein:vir:2026 Length: 150 # 97.3 5.7E-06 3.5E-09 49.3 9.7 122 13-152 1-150 (150) 138 protein:vir:6071 Length: 150 # 97.3 9.7E-06 6E-09 48.1 10.5 122 13-152 1-150 (150) 139 protein:vir:1164 Length: 156 # 97.2 9.1E-06 5.7E-09 48.2 10.0 120 13-154 1-154 (156) 140 protein:vir:79115 Length: 148 97.2 8.6E-06 5.3E-09 48.4 9.7 120 13-152 1-148 (148) 141 protein:vir:5703 Length: 150 # 97.2 1.3E-05 7.8E-09 47.4 10.4 119 13-152 1-150 (150) 142 protein:vir:100312 Length: 152 97.1 1.8E-05 1.1E-08 46.6 10.3 120 13-153 1-152 (152) 143 protein:vir:98636 Length: 138 97.1 2.8E-05 1.7E-08 45.6 11.1 127 4-154 1-137 (138) 144 protein:vir:7993 Length: 108 # 97.1 4E-07 2.5E-10 55.6 1.0 100 13-137 1-108 (108) 145 protein:vir:1838 Length: 149 # 97.0 1.2E-05 7.7E-09 47.5 8.9 119 13-152 1-149 (149) 146 protein:vir:98892 Length: 108 97.0 1.3E-05 8.3E-09 47.3 8.9 107 13-152 1-108 (108) 147 protein:vir:77650 Length: 155 96.8 3.6E-06 2.2E-09 50.4 4.6 97 13-154 1-98 (155) 148 protein:vir:101563 Length: 155 96.8 3E-06 1.9E-09 50.9 4.1 95 13-154 1-98 (155) 149 protein:vir:5257 Length: 148 # 96.7 3.2E-06 2E-09 50.7 3.5 89 13-154 1-91 (148) 150 protein:vir:107757 Length: 189 96.6 6.1E-06 3.8E-09 49.2 4.4 88 13-154 1-93 (189) 151 protein:vir:79687 Length: 113 96.5 4.5E-05 2.8E-08 44.4 8.3 109 13-154 1-110 (113) 152 protein:vir:102190 Length: 93 96.2 3.6E-05 2.2E-08 45.0 6.3 88 38-154 1-93 (93) 153 protein:vir:78894 Length: 105 96.2 8.3E-06 5.1E-09 48.4 2.7 99 13-152 1-105 (105) 154 protein:vir:78607 Length: 155 96.1 8.7E-06 5.4E-09 48.3 2.6 96 13-154 1-97 (155) 155 protein:vir:106728 Length: 155 96.1 8.4E-06 5.2E-09 48.4 2.5 95 13-154 1-97 (155) 156 protein:vir:3036 Length: 118 # 95.4 0.00028 1.7E-07 40.1 8.2 112 13-154 1-116 (118) 157 protein:vir:9823 Length: 118 # 95.4 0.00028 1.7E-07 40.1 8.2 112 13-154 1-116 (118) 158 protein:vir:4096 Length: 140 # 94.8 0.00026 1.6E-07 40.3 6.2 118 13-154 1-133 (140) 159 protein:vir:2688 Length: 123 # 94.8 0.001 6.2E-07 37.0 9.4 111 19-152 1-123 (123) 160 protein:vir:1087 Length: 161 # 94.3 0.0007 4.4E-07 37.9 7.6 131 12-154 1-155 (161) 161 protein:vir:4460 Length: 170 # 93.6 0.00085 5.3E-07 37.4 6.6 126 13-154 1-162 (170) 162 protein:vir:80037 Length: 199 93.3 0.00014 8.7E-08 41.7 1.9 106 13-154 1-139 (199) 163 protein:vir:7412 Length: 168 # 93.1 0.0022 1.4E-06 35.1 8.0 128 13-154 1-167 (168) 164 protein:vir:1028 Length: 168 # 91.2 0.0043 2.7E-06 33.6 7.4 128 13-154 1-167 (168) 165 protein:vir:95260 Length: 160 91.2 0.00077 4.8E-07 37.6 3.3 85 13-154 1-93 (160) 166 protein:vir:3994 Length: 168 # 90.4 0.0062 3.8E-06 32.7 7.4 128 13-154 1-159 (168) 167 protein:vir:102608 Length: 108 89.8 0.00097 6E-07 37.1 2.5 100 13-137 1-108 (108) 168 protein:vir:105825 Length: 108 89.8 0.00097 6E-07 37.1 2.5 100 13-137 1-108 (108) 169 protein:vir:487 Length: 187 # 89.0 0.0076 4.7E-06 32.2 6.8 144 1-154 1-179 (187) 170 protein:vir:6375 Length: 205 # 88.9 0.025 1.5E-05 29.4 9.6 142 13-154 1-197 (205) 171 protein:vir:8432 Length: 149 # 88.1 0.016 1E-05 30.4 8.0 130 1-153 1-149 (149) 172 protein:vir:96763 Length: 177 88.0 0.035 2.2E-05 28.5 10.1 147 1-154 1-170 (177) 173 protein:vir:78163 Length: 92 # 86.3 0.0025 1.6E-06 34.8 2.5 88 13-140 1-92 (92) 174 protein:vir:396 Length: 184 # 85.0 0.053 3.3E-05 27.6 9.1 138 15-154 1-182 (184) 175 protein:vir:4514 Length: 168 # 84.8 0.027 1.7E-05 29.2 7.4 138 1-154 1-161 (168) 176 protein:vir:99454 Length: 150 84.2 0.063 3.9E-05 27.2 9.2 129 13-143 1-150 (150) 177 protein:vir:79034 Length: 141 75.6 0.1 6.4E-05 26.0 7.4 113 16-154 1-132 (141) 178 protein:vir:78503 Length: 131 73.4 0.17 0.00011 24.8 8.8 121 13-138 1-131 (131) 179 protein:vir:2347 Length: 131 # 73.4 0.17 0.00011 24.8 8.8 121 13-138 1-131 (131) 180 protein:vir:78298 Length: 131 73.4 0.17 0.00011 24.8 8.8 121 13-138 1-131 (131) 181 protein:vir:7776 Length: 119 # 72.6 0.14 8.8E-05 25.2 7.4 110 13-127 1-119 (119) 182 protein:vir:6154 Length: 119 # 70.0 0.0043 2.7E-06 33.6 -1.6 113 1-154 1-116 (119) 183 protein:vir:3427 Length: 192 # 58.5 0.41 0.00026 22.7 10.8 138 13-154 1-186 (192) 184 protein:vir:8330 Length: 141 # 24.5 0.097 6E-05 26.1 -2.3 129 1-154 3-139 (141) No 1 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=100.00 E-value=1.9e-46 Score=271.29 Aligned_cols=141 Identities=60% Similarity=1.074 Sum_probs=137.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+ +|+|.++++++++++++++.++++.++++.+.++++..|+.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 ~~-~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yV 79 (141) T protein:vir:78 1 MN-EFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYY 79 (141) T ss_pred Cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCcccee Confidence 85 79999999999999999999999999999999999999999999999999999999998888999999999999999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) |||||+|+.+++++++||+|++++|+||+|+||+|||||+||++++++++.++|+++|++|| T Consensus 80 E~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 80 EFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred ecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 99999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=100.00 E-value=5.2e-38 Score=224.98 Aligned_cols=134 Identities=24% Similarity=0.302 Sum_probs=124.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||..+++.++|.+.++.+.+.+.+.+.+++.+.+ ..+++.++.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a-~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTT-AKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCccccc Confidence 9988889999999999999999988888877665 456778999999999999999999999999999999999999999 Q ss_pred ccCccccccCCCC---cccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGG---RAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~---~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|...+.+ ++.+|+++.+++.|++|+||||||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999998875 667999999999999999999999999999999999999999 No 3 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=100.00 E-value=5.2e-38 Score=224.98 Aligned_cols=134 Identities=24% Similarity=0.302 Sum_probs=124.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||..+++.++|.+.++.+.+.+.+.+.+++.+.+ ..+++.++.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a-~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTT-AKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCccccc Confidence 9988889999999999999999988888877665 456778999999999999999999999999999999999999999 Q ss_pred ccCccccccCCCC---cccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGG---RAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~---~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|...+.+ ++.+|+++.+++.|++|+||||||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999998875 667999999999999999999999999999999999999999 No 4 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=100.00 E-value=5.2e-38 Score=224.98 Aligned_cols=134 Identities=24% Similarity=0.302 Sum_probs=124.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||..+++.++|.+.++.+.+.+.+.+.+++.+.+ ..+++.++.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a-~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTT-AKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCccccc Confidence 9988889999999999999999988888877665 456778999999999999999999999999999999999999999 Q ss_pred ccCccccccCCCC---cccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGG---RAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~---~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|...+.+ ++.+|+++.+++.|++|+||||||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999998875 667999999999999999999999999999999999999999 No 5 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=100.00 E-value=7.2e-38 Score=224.21 Aligned_cols=141 Identities=21% Similarity=0.290 Sum_probs=126.5 Q ss_pred CCccch--hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCC Q lcl|NC_011308. 10 GGHMAD--DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSD 87 (154) Q Consensus 10 ~~~Ma~--~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~ 87 (154) ---||+ ++++..+|.+.++.+++.+.+.+++++.+.+.+ +++.++.++|||||+|++||++++..++++++|+++++ T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~-i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~ 79 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEK-IAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAE 79 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCC Confidence 112333 456888999999999999999999988777655 56789999999999999999999998999999999999 Q ss_pred cccccccCccccccCCCCcccccceec-ccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 88 YAIYYEFGTGEKSEKGGGRAGGWSYMD-KNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 88 YA~yVE~GTg~~~~~~~~~~~~~~~~~-~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) ||+||||||++|+++|++++++|++.. ..|.|++|+||||||||+||++.+++.|.+.|++.++ T Consensus 80 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 80 YAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred ccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 999999999999999999999998866 5689999999999999999999999999999999999 No 6 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=100.00 E-value=7.4e-38 Score=224.15 Aligned_cols=134 Identities=24% Similarity=0.301 Sum_probs=124.6 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||..+++.++|.+.++.+.+.+.+.+.+++.+.+. .+++.++.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~-~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTA-KIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCccccc Confidence 99988999999999999999999999988877655 55678999999999999999999999899999999999999999 Q ss_pred ccCccccccCCCC---cccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGG---RAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~---~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|...+.+ ++.+|++..+.|.|++|+||||||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999998875 567999999999999999999999999999999999999999 No 7 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=100.00 E-value=1e-37 Score=223.33 Aligned_cols=143 Identities=21% Similarity=0.175 Sum_probs=115.0 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCccccccccceeEEe--ecCceEEEEecCC Q lcl|NC_011308. 12 HMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKT---EIAEGGHGDSSNNVTGEYANKTDFEV--DKRKQEVKIGNSS 86 (154) Q Consensus 12 ~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~---~i~~~~ak~~aPvdTG~Lr~SI~~~~--~~~~~~~~V~~~~ 86 (154) =|+++|++.++|.+.++.+++.+++++.+++.+++. ..++..|+.++|||||+|++||++++ .+++++++|++++ T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~ 80 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSS 80 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCC Confidence 333588999999999998887777666665544432 23456789999999999999998665 4567899999999 Q ss_pred CcccccccCccccccCC----------CCcccccceecc------------------cCceeecCCCCCCchhHHHHHHH Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKG----------GGRAGGWSYMDK------------------NGKWHFTRGSKASKRMRYTFRDE 138 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~----------~~~~~~~~~~~~------------------~g~~~~t~g~~a~PFl~pA~~~~ 138 (154) +||+|||||||+|+... ..++++|+++.. .|.||+|.||||||||+||++++ T Consensus 81 ~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~ 160 (182) T protein:vir:10 81 MVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKM 160 (182) T ss_pred CccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHh Confidence 99999999999987533 347889976432 37889999999999999999999 Q ss_pred HHHHHHHHHHHhhc-cC Q lcl|NC_011308. 139 KSKVKDYVIKVFGG-LD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~-l~ 154 (154) ++++.++|.+++++ |. T Consensus 161 ~~~i~~~i~~~i~~~l~ 177 (182) T protein:vir:10 161 AKEAPEIIKRSIDQELH 177 (182) T ss_pred HHHHHHHHHHHHHHHHH Confidence 99999999887776 33 No 8 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=100.00 E-value=1.5e-37 Score=222.40 Aligned_cols=134 Identities=24% Similarity=0.297 Sum_probs=123.4 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||.-+++.++|.+.++.+.+.+.+.+.+++.+.+.+ +++.|+.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~-~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yv 79 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLA-IYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEYAIYV 79 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCccccc Confidence 997667888898889888899999999888777655 5678999999999999999999998899999999999999999 Q ss_pred ccCccccccCCCC---cccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGG---RAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~---~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|..+|.+ ++.+|++....|.|++|+||+|||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 80 EFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9999999998886 667999999999999999999999999999999999999999 No 9 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=100.00 E-value=1.5e-37 Score=222.42 Aligned_cols=134 Identities=22% Similarity=0.292 Sum_probs=122.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||.-+.+.++|.+.++.+++.+.+.+++++.+.+ ..+++.|+.++|||||+|++||++.+..++++++|+++++||+|| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a-~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~~v 79 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTT-TIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYV 79 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCccccc Confidence 9976678899999999999999999988876655 456788999999999999999999988889999999999999999 Q ss_pred ccCccccccCCCCc---ccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGGR---AGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~~---~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|...|.++ +.+|+++..++.|++|+||+|||||+||+++++++|+++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 99999999888755 56799999999999999999999999999999999999999 No 10 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=100.00 E-value=1.7e-37 Score=222.17 Aligned_cols=145 Identities=23% Similarity=0.242 Sum_probs=126.1 Q ss_pred CC-cceeeecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceE Q lcl|NC_011308. 1 MR-SRLLIDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQE 79 (154) Q Consensus 1 ~~-~~~~~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~ 79 (154) |. |-...-|.+ ||+-..|.++|.+.++.+.+.+.+.+.+++.+.+. .+++.|+.++|||||+|++||++++..++++ T Consensus 1 ~~~~~~~~~~~~-Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~-~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~ 78 (149) T protein:vir:94 1 MKLSYYDLSRCH-MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTT-KIYNTAVALAPVDLGFLEESIDFKYFDGGLS 78 (149) T ss_pred Ceeeeeecchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCcccchhhcCeeEEeeCCcEE Confidence 22 222334554 98755588889999999999999999988877665 4567899999999999999999999888999 Q ss_pred EEEecCCCcccccccCccccccCCCCc---ccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 80 VKIGNSSDYAIYYEFGTGEKSEKGGGR---AGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 80 ~~V~~~~~YA~yVE~GTg~~~~~~~~~---~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) ++|+++++||+|||||||+|+..|.++ +.+|+|.+..|.+++|+||||||||+||+++++++|+++|. T Consensus 79 ~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 79 SVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred EEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999888765 56799999999999999999999999999999999999999 No 11 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=100.00 E-value=1.8e-37 Score=222.06 Aligned_cols=134 Identities=24% Similarity=0.294 Sum_probs=122.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||.-..+.++|.+.++.+.+.+.+.+.+++.+.+. .+++.++.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~-~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTV-KIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCccccc Confidence 99655588899999999999998888888776655 45778999999999999999999999999999999999999999 Q ss_pred ccCccccccCCCC---cccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGG---RAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~---~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|.+.+++ ++.+|+++.+.|.|++|+||||||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 9999999998875 667899999999999999999999999999999999999999 No 12 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=100.00 E-value=1.5e-37 Score=222.42 Aligned_cols=134 Identities=22% Similarity=0.288 Sum_probs=122.9 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||+...+.++|.+.++.+++.+.+.+.+++.+.+.+ +++.++.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~-i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTI-IHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYV 79 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCcccccc Confidence 997656888999999999999999998888776655 5778999999999999999999998889999999999999999 Q ss_pred ccCccccccCCCCc---ccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGGGR---AGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~~~---~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|...|.++ +.+|+|+...|.|++|+||||||||+||+++++++|.++|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99999999888765 56899999999999999999999999999999999999999 No 13 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=100.00 E-value=1.1e-36 Score=217.65 Aligned_cols=134 Identities=21% Similarity=0.229 Sum_probs=122.4 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) ||....+.++|.+.++.+.+.+.+.+.+++.+.+. .+++.|+.++|||||+|++||++++..++++++|+++++||+|| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~-~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~v 79 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTL-KIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYV 79 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchh Confidence 99754588889998999999998888888776654 45778999999999999999999999999999999999999999 Q ss_pred ccCccccccCCC-CcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGG-GRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 93 E~GTg~~~~~~~-~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) |||||+|.+.+. +++.+|++.++.|.|++|+||||||||+||+++++++|+++|. T Consensus 80 e~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 80 NYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred hcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 999999998776 6888999999999999999999999999999999999999999 No 14 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=100.00 E-value=1.4e-36 Score=217.11 Aligned_cols=145 Identities=23% Similarity=0.254 Sum_probs=124.4 Q ss_pred CCcce-eeecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceE Q lcl|NC_011308. 1 MRSRL-LIDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQE 79 (154) Q Consensus 1 ~~~~~-~~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~ 79 (154) |.-.. ..-|. |||+-..+.++|.+.++.+.+.+.+.+++++.+.+. .+++.|+.++|||||+|++||++++..++++ T Consensus 1 ~~~~~~~~~~~-~Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~-~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~ 78 (149) T protein:vir:10 1 MKLNYYDLSRC-HMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTT-KIYNTAVALAPVDLGFLEESIDFKYFDGGLS 78 (149) T ss_pred Ceeeeeccchh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCcccchhhccceEEecCCcEE Confidence 22111 12233 498644578889998999999999999888877654 5577899999999999999999999888999 Q ss_pred EEEecCCCcccccccCccccccCCCCc---ccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 80 VKIGNSSDYAIYYEFGTGEKSEKGGGR---AGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 80 ~~V~~~~~YA~yVE~GTg~~~~~~~~~---~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) ++|+++++||+|||||||+|+..|.++ +.+|++.+..+.|++|+||||||||+||+++++++|+++|. T Consensus 79 ~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 79 SVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred EEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999999999999999999888765 56799999999999999999999999999999999999999 No 15 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=100.00 E-value=6.1e-36 Score=213.64 Aligned_cols=113 Identities=25% Similarity=0.344 Sum_probs=101.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccccccCccccccCCCC---ccccc Q lcl|NC_011308. 34 AEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGG---RAGGW 110 (154) Q Consensus 34 ~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~---~~~~~ 110 (154) +++.+++++.+.+.+ +++.|+.++|||||+|++||++++..++++++|+++++||+|||||||+|+.++.+ ++.+| T Consensus 1 v~~~v~~~~~~~~~~-i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~ 79 (116) T protein:vir:12 1 MERWVKRGIAKTTAK-IHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPW 79 (116) T ss_pred ChHHHHHHHHHHHHH-HHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccce Confidence 555555555555444 46678999999999999999999999999999999999999999999999999986 88899 Q ss_pred ceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 111 SYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 111 ~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) +|++..|+|++|+||+|||||+||+++++++|.++|. T Consensus 80 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 9999999999999999999999999999999999988 No 16 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=100.00 E-value=6.1e-36 Score=213.64 Aligned_cols=113 Identities=25% Similarity=0.344 Sum_probs=101.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccccccCccccccCCCC---ccccc Q lcl|NC_011308. 34 AEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGG---RAGGW 110 (154) Q Consensus 34 ~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~---~~~~~ 110 (154) +++.+++++.+.+.+ +++.|+.++|||||+|++||++++..++++++|+++++||+|||||||+|+.++.+ ++.+| T Consensus 1 v~~~v~~~~~~~~~~-i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~ 79 (116) T protein:vir:97 1 MERWVKRGIAKTTAK-IHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPW 79 (116) T ss_pred ChHHHHHHHHHHHHH-HHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccce Confidence 555555555555444 46678999999999999999999999999999999999999999999999999986 88899 Q ss_pred ceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 111 SYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 111 ~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) +|++..|+|++|+||+|||||+||+++++++|.++|. T Consensus 80 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 9999999999999999999999999999999999988 No 17 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=100.00 E-value=7.1e-36 Score=213.29 Aligned_cols=113 Identities=25% Similarity=0.336 Sum_probs=101.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCc---cccc Q lcl|NC_011308. 34 AEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGR---AGGW 110 (154) Q Consensus 34 ~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~---~~~~ 110 (154) +++.+++++.+.+.+ +++.|+.++|||||+|++||++.+..++++++|+++++||+|||||||+|+++|+++ +.+| T Consensus 1 v~~~v~~~~~~~~~~-i~~~ak~~apv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~~ 79 (116) T protein:vir:95 1 MERWVKRGIAKTTAK-IHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIPW 79 (116) T ss_pred ChHHHHHHHHHHHHH-HHHHHHhhCCccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccccccccc Confidence 555555555555444 467889999999999999999999999999999999999999999999999999766 6789 Q ss_pred ceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 111 SYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 111 ~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) +|++..|+||+|+||+|||||+||++++++.|.++|- T Consensus 80 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999999999999999999999999999999998 No 18 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.95 E-value=1.4e-31 Score=189.76 Aligned_cols=138 Identities=15% Similarity=0.075 Sum_probs=108.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEe--ecCceEEEEecCCCccccc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEV--DKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~--~~~~~~~~V~~~~~YA~yV 92 (154) ++|+|.++|.+.++.+.+.+.+.+.+++.. +++.++..|+.+||||||+|++||+++. .++++++.|+++++||.|| T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~-~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fv 79 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEE-AANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYM 79 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhh Confidence 678899999999999998888777776654 5566778899999999999999998765 3455888999999999999 Q ss_pred ccCccccccCCC----------Ccccccceec-------------------ccCceeecCCCCCCchhHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKSEKGG----------GRAGGWSYMD-------------------KNGKWHFTRGSKASKRMRYTFRDEKSKVK 143 (154) Q Consensus 93 E~GTg~~~~~~~----------~~~~~~~~~~-------------------~~g~~~~t~g~~a~PFl~pA~~~~~~~i~ 143 (154) ||||+.+...|+ +..++|+... ....++.|.||+|||||+||++++++++. T Consensus 80 EfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~ 159 (173) T protein:vir:10 80 EFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYL 159 (173) T ss_pred hcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHH Confidence 999998766554 3344443321 12235667899999999999999997755 Q ss_pred HH----HHHHhhcc Q lcl|NC_011308. 144 DY----VIKVFGGL 153 (154) Q Consensus 144 ~~----i~~~l~~l 153 (154) ++ |.++|++| T Consensus 160 ~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 160 KDLENLLKTYNKKI 173 (173) T ss_pred HHHHHHHHHHhhcC Confidence 55 56667777 No 19 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.95 E-value=1.6e-31 Score=189.44 Aligned_cols=137 Identities=14% Similarity=0.065 Sum_probs=109.5 Q ss_pred cch-hhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCc--eEEEEecCCCc Q lcl|NC_011308. 13 MAD-DIK-FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRK--QEVKIGNSSDY 88 (154) Q Consensus 13 Ma~-~v~-~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~--~~~~V~~~~~Y 88 (154) ||. +++ ..+++.+.++.+.+.+.+.+.+++.+.+.+ +++.++.++|||||+|++||++.+..++ ++++|+++++| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~-i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~Y 79 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAAND-MVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTY 79 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCccc Confidence 884 333 335567777777777777788777766555 4678999999999999999998876554 67899999999 Q ss_pred ccccccCccccccCCCCcccccceec-ccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMD-KNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~-~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |.||||||+||.+.|+++++.|+... ....++.+.|++|||||+||+++++++|.+.|++. + T Consensus 80 A~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~-~ 142 (142) T protein:vir:94 80 AADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGI-R 142 (142) T ss_pred chhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhc-C Confidence 99999999999999999999877433 34567778899999999999999998886665543 3 No 20 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.93 E-value=6.7e-30 Score=180.53 Aligned_cols=131 Identities=11% Similarity=0.062 Sum_probs=97.1 Q ss_pred cch---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC---ceEEEEecCC Q lcl|NC_011308. 13 MAD---DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR---KQEVKIGNSS 86 (154) Q Consensus 13 Ma~---~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V~~~~ 86 (154) |+. ++++..+.+.+.+.+...+++.+++++ ..+++.++.++|||||+|++||.+....+ .+++.|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a 75 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLT-----RRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATA 75 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCc Confidence 763 355555555555555555555555433 35567789999999999999998765433 3678899999 Q ss_pred CcccccccCccccccCCCCcccccceeccc---CceeecCCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGGGRAGGWSYMDKN---GKWHFTRGSKASKRMRYTFRDEKSKVKDYVIK 148 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~---g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~ 148 (154) +||+||||||+||.+.|++++.+||+.+.. .+++++.|++|+|||+||++......+++-.- T Consensus 76 ~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 76 DYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred cchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 999999999999999999999999886533 24567789999999999999864443333322 No 21 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.93 E-value=6.7e-30 Score=180.53 Aligned_cols=131 Identities=11% Similarity=0.062 Sum_probs=97.1 Q ss_pred cch---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC---ceEEEEecCC Q lcl|NC_011308. 13 MAD---DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR---KQEVKIGNSS 86 (154) Q Consensus 13 Ma~---~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V~~~~ 86 (154) |+. ++++..+.+.+.+.+...+++.+++++ ..+++.++.++|||||+|++||.+....+ .+++.|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a 75 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLT-----RRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATA 75 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCc Confidence 763 355555555555555555555555433 35567789999999999999998765433 3678899999 Q ss_pred CcccccccCccccccCCCCcccccceeccc---CceeecCCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGGGRAGGWSYMDKN---GKWHFTRGSKASKRMRYTFRDEKSKVKDYVIK 148 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~---g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~ 148 (154) +||+||||||+||.+.|++++.+||+.+.. .+++++.|++|+|||+||++......+++-.- T Consensus 76 ~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 76 DYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred cchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 999999999999999999999999886533 24567789999999999999864443333322 No 22 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.93 E-value=1e-29 Score=179.46 Aligned_cols=132 Identities=10% Similarity=0.035 Sum_probs=95.3 Q ss_pred cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC----ceEEEEecCCC Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR----KQEVKIGNSSD 87 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~----~~~~~V~~~~~ 87 (154) |+. .+++.. +++.++.+.+.+...+++++...+. .+++.|+.++|||||+|++||.+++... ++++.|+++++ T Consensus 1 m~~~~~~~~g-l~~~l~~~~~~~~~~~~~~i~~~a~-~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~ 78 (142) T protein:vir:99 1 MVQVSVRYEG-FDYNPVGAAAQVGPILRRTHSSLTR-QIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAK 78 (142) T ss_pred CceeEEEeee-cchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcc Confidence 653 444432 3344455555555555555544443 4567889999999999999998765332 46788999999 Q ss_pred cccccccCccccccCCCCcccccceecccCcee-----ecCCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 88 YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWH-----FTRGSKASKRMRYTFRDEKSKVKDYVIK 148 (154) Q Consensus 88 YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~-----~t~g~~a~PFl~pA~~~~~~~i~~~i~~ 148 (154) ||+||||||+||.+.|...+.. .+...|.++ ++.|++|+|||+||++.+.++..++..+ T Consensus 79 YA~~ve~GT~ph~i~pk~~~al--~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 79 YAAAVHEGTRPHVIRAKHAQAL--HFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ccceeccCCccceeccccCcee--eEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 9999999999999998765543 333344444 5569999999999999999887777666 No 23 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.93 E-value=1e-29 Score=179.46 Aligned_cols=132 Identities=10% Similarity=0.035 Sum_probs=95.3 Q ss_pred cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC----ceEEEEecCCC Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR----KQEVKIGNSSD 87 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~----~~~~~V~~~~~ 87 (154) |+. .+++.. +++.++.+.+.+...+++++...+. .+++.|+.++|||||+|++||.+++... ++++.|+++++ T Consensus 1 m~~~~~~~~g-l~~~l~~~~~~~~~~~~~~i~~~a~-~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~ 78 (142) T protein:vir:86 1 MVQVSVRYEG-FDYNPVGAAAQVGPILRRTHSSLTR-QIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAK 78 (142) T ss_pred CceeEEEeee-cchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcc Confidence 653 444432 3344455555555555555544443 4567889999999999999998765332 46788999999 Q ss_pred cccccccCccccccCCCCcccccceecccCcee-----ecCCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 88 YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWH-----FTRGSKASKRMRYTFRDEKSKVKDYVIK 148 (154) Q Consensus 88 YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~-----~t~g~~a~PFl~pA~~~~~~~i~~~i~~ 148 (154) ||+||||||+||.+.|...+.. .+...|.++ ++.|++|+|||+||++.+.++..++..+ T Consensus 79 YA~~ve~GT~ph~i~pk~~~al--~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 79 YAAAVHEGTRPHVIRAKHAQAL--HFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ccceeccCCccceeccccCcee--eEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 9999999999999998765543 333344444 5569999999999999999887777666 No 24 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.92 E-value=7.9e-29 Score=174.65 Aligned_cols=110 Identities=17% Similarity=0.182 Sum_probs=89.5 Q ss_pred cchhh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccc Q lcl|NC_011308. 13 MADDI--KFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAI 90 (154) Q Consensus 13 Ma~~v--~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~ 90 (154) |+..| +|.++|.+.++.+. ..+.+++++.+.+ ..++..++.++|||||+|++||+++...++++++|+++++||+ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~--~~~~~~~al~~~~-~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~ 77 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAA--SLKGVQQVVKSNT-SNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSA 77 (112) T ss_pred CceeeeehhHHHHHHHHHhhh--hHHHHHHHHHHHH-HHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccc Confidence 88755 45555544444332 2344555555544 4556789999999999999999999888899999999999999 Q ss_pred ccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 91 YYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 91 yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |||||| +.|||||||+||++.+++++.+.|++.|+ T Consensus 78 ~vE~GT--------------------------~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 78 YVEYGT--------------------------RFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eeeccc--------------------------cccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 999998 47999999999999999999999999999 No 25 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.92 E-value=1.5e-28 Score=173.12 Aligned_cols=113 Identities=14% Similarity=0.061 Sum_probs=99.2 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+++|+|.++|.+.++.+.+.+.+.+.+++.+.+.. ++..++.+||||||+|++||+++ .++++++|+++++||+|| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~-i~~~ak~~aPv~TG~Lr~sI~~~--~~g~~~~V~~~~~Ya~yv 77 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEK-GKRIAKQLAPKDTEFLKDHITTS--YPGMEAHIHGEAGYDGYQ 77 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhCCcCchhhhhceeee--cCceEEEeecCCCcccee Confidence 999999999999999999998888898887766554 56789999999999999999875 467899999999999999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) |||| ++|+|||||+||++.+++++.+.|.+.|++-= T Consensus 78 E~GT--------------------------~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~ 113 (114) T protein:vir:95 78 EYGT--------------------------RFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAF 113 (114) T ss_pred ecCc--------------------------cccCCCccchhhHHHHHHHHHHHHHHHHHhhc Confidence 9998 47999999999999999998877777766544 No 26 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.92 E-value=2.6e-28 Score=171.83 Aligned_cols=108 Identities=17% Similarity=0.113 Sum_probs=98.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccccccCc Q lcl|NC_011308. 17 IKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGT 96 (154) Q Consensus 17 v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GT 96 (154) |+|.++|.+.++.+.+.+.+.+++++.+.+.. ++..++.++|||||+|++||+++.. +++++.|+++++||+|||||| T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~-i~~~ak~~aPv~TG~Lr~sI~~~~~-~~~~~~v~~~~~Ya~~vE~GT 78 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAAR-IERQAKILAPVDTGWLRAQIYSEQQ-RLLHYRVVSPALYSIYLELGT 78 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhcCCcCchhhhcceeeeec-CcEEEEeecCcccchhcccCc Confidence 89999999999999999999998888776655 5668999999999999999998865 457899999999999999998 Q ss_pred cccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 97 GEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 97 g~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) ++|+|||||+||++.+++++.+.|++.|+. T Consensus 79 --------------------------~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 79 --------------------------RKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred --------------------------cccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 479999999999999999999999999999 No 27 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.92 E-value=5.5e-29 Score=175.51 Aligned_cols=126 Identities=10% Similarity=0.054 Sum_probs=97.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC---ceEEEEecCCCcc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR---KQEVKIGNSSDYA 89 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V~~~~~YA 89 (154) |+..+++..+...+.+.+...+++.+++++. .+++.|+.++|||||+|++||.+.+..+ ++++.|+++++|| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~-----~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA 75 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITR-----RIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYA 75 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCce Confidence 9988888877777777777776666666544 3456789999999999999999876443 4678999999999 Q ss_pred cccccCccccccCCCCcccccceecccCcee-----ecCCCCCCchhHHHHHHH---HHHHHHH Q lcl|NC_011308. 90 IYYEFGTGEKSEKGGGRAGGWSYMDKNGKWH-----FTRGSKASKRMRYTFRDE---KSKVKDY 145 (154) Q Consensus 90 ~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~-----~t~g~~a~PFl~pA~~~~---~~~i~~~ 145 (154) +||||||+||.+.|++.+..-|+. .|.|+ ++.|++|+|||+||+++. +++|.-. T Consensus 76 ~~ve~GT~ph~I~pk~~k~l~f~~--~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~~ 137 (137) T protein:vir:10 76 APVHEGSRPHRITARHANALHFFW--HGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDIHMT 137 (137) T ss_pred eeeeecCCCceeecccCceeeeee--CCceEEeeeeecCCCCCCchHHHHHHHHhhccccccCC Confidence 999999999999999877754332 24444 456999999999999985 4444322 No 28 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.91 E-value=5.7e-28 Score=169.96 Aligned_cols=109 Identities=14% Similarity=0.120 Sum_probs=97.7 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+.+++.+.+.++. ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~SI~~~~-~g~~~~~V~~~~~Y 78 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TVDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhccccCCCCcchhhhhceeeee-cCcEEEEecCCccc Confidence 678899999999999999988889888887776664 4566665 999999999998875 46699999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| ++|+|||||+||++.+++.+.+.|.++|+ T Consensus 79 a~~vE~GT--------------------------~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred cccccccc--------------------------cccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 99999999 48999999999999999999999999999 No 29 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.91 E-value=7.3e-28 Score=169.34 Aligned_cols=108 Identities=17% Similarity=0.104 Sum_probs=92.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccccc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEF 94 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~ 94 (154) ++|+|.++|.+.++... ..+.+++++.+.+ ..+++.++.++|||||+|++||++++..++++++|+++++||+|||| T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a-~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~ 77 (108) T protein:vir:74 1 MKITGIDALQKKLRKNA--TLDDVKHVVKSNT-ASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEY 77 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHH-HHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCcccceec Confidence 67888887776666432 3455666665554 45567899999999999999999999889999999999999999999 Q ss_pred CccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 95 GTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 95 GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) || +.|+|||||+||++.+++++.+.|++.|+ T Consensus 78 GT--------------------------~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 78 GT--------------------------RFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred cc--------------------------cccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 98 47999999999999999999999999999 No 30 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.91 E-value=6.8e-28 Score=169.53 Aligned_cols=108 Identities=16% Similarity=0.071 Sum_probs=92.3 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccccc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEF 94 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~ 94 (154) ++|+|.++|.+.++... ....+++++.+.+. .++..|+.++|||||+|++||.+++..++++++|+++++||+|||| T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~-~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~ 77 (108) T protein:vir:98 1 MKITGIDALQKKLRKNA--TLNDVKHVVKRNTV-SMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEY 77 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHH-HHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeec Confidence 67888888877766542 34556666655544 5567899999999999999999998888999999999999999999 Q ss_pred CccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 95 GTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 95 GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) || +.|+|||||+||++.+++++.++|++.|+ T Consensus 78 GT--------------------------~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 78 GT--------------------------RFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred cc--------------------------cccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 99 47999999999999999999999999999 No 31 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.91 E-value=9.7e-28 Score=168.68 Aligned_cols=109 Identities=14% Similarity=0.122 Sum_probs=98.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+++++.+.+.+++ ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~~~~~v~~~~~Y 78 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TGDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccCCCCCCchhhhhcceeee-cCceEEEeecCccc Confidence 678899999999999999999999988888777664 4677776 999999999998875 46789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|+ T Consensus 79 a~~vE~GT--------------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhhcccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999 47999999999999999999999999999 No 32 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.91 E-value=9.7e-28 Score=168.68 Aligned_cols=109 Identities=14% Similarity=0.122 Sum_probs=98.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+++++.+.+.+++ ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~~~~~v~~~~~Y 78 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TGDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccCCCCCCchhhhhcceeee-cCceEEEeecCccc Confidence 678899999999999999999999988888777664 4677776 999999999998875 46789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|+ T Consensus 79 a~~vE~GT--------------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhhcccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999 47999999999999999999999999999 No 33 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.91 E-value=9.7e-28 Score=168.68 Aligned_cols=109 Identities=14% Similarity=0.122 Sum_probs=98.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+++++.+.+.+++ ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~~~~~v~~~~~Y 78 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TGDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccCCCCCCchhhhhcceeee-cCceEEEeecCccc Confidence 678899999999999999999999988888777664 4677776 999999999998875 46789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|+ T Consensus 79 a~~vE~GT--------------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhhcccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999 47999999999999999999999999999 No 34 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.91 E-value=9.7e-28 Score=168.68 Aligned_cols=109 Identities=14% Similarity=0.122 Sum_probs=98.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+++++.+.+.+++ ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~~~~~v~~~~~Y 78 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TGDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccCCCCCCchhhhhcceeee-cCceEEEeecCccc Confidence 678899999999999999999999988888777664 4677776 999999999998875 46789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|+ T Consensus 79 a~~vE~GT--------------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhhcccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999 47999999999999999999999999999 No 35 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.91 E-value=9.7e-28 Score=168.68 Aligned_cols=109 Identities=14% Similarity=0.122 Sum_probs=98.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+++++.+.+.+++ ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~~~~~v~~~~~Y 78 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TGDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccCCCCCCchhhhhcceeee-cCceEEEeecCccc Confidence 678899999999999999999999988888777664 4677776 999999999998875 46789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|+ T Consensus 79 a~~vE~GT--------------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhhcccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999 47999999999999999999999999999 No 36 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.91 E-value=9.7e-28 Score=168.68 Aligned_cols=109 Identities=14% Similarity=0.122 Sum_probs=98.4 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.+.|.+.++.+.+.+.+.+++++.+.+.+++ ..++.++ |+|||+|++||.++. .++++++|+++++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~-~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~~~~~v~~~~~Y 78 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYV-VRAKLKAREVMNKGYWTGNLSRNIRYKK-TGDLQYTITSHAAY 78 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhccccCCCCCCchhhhhcceeee-cCceEEEeecCccc Confidence 678899999999999999999999988888777664 4677776 999999999998875 46789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|+ T Consensus 79 a~~vE~GT--------------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 79 SGFLEFGT--------------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhhcccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999 47999999999999999999999999999 No 37 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.91 E-value=1.1e-27 Score=168.36 Aligned_cols=119 Identities=13% Similarity=0.002 Sum_probs=101.3 Q ss_pred CCcceeeecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEE---eecCc Q lcl|NC_011308. 1 MRSRLLIDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFE---VDKRK 77 (154) Q Consensus 1 ~~~~~~~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~---~~~~~ 77 (154) |-++ |+++|+|.++|.+.++.+.+.+.+.+.+++...+ +.++..++.++|+|||+|++||++. .+.++ T Consensus 1 Ma~~--------~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a-~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~ 71 (125) T protein:vir:94 1 MAND--------FNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSL-SRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGV 71 (125) T ss_pred CCCc--------eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHH-HHHHHHHHhhCCCCChhhhhhceecceeccCCc Confidence 2221 3446788899999999999888888888776654 5567789999999999999999753 46678 Q ss_pred eEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 78 QEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 78 ~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) ++++|+++++||+|||||| ++|+|||||+||++.+++.+.+.|.++|++.= T Consensus 72 ~~~~v~~~~~Ya~~vEfGT--------------------------~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~ 122 (125) T protein:vir:94 72 VTGRYVARADYSSYNEYGT--------------------------YRMSAQPFMAPSVAAMTPFFYKAVRDALNKAA 122 (125) T ss_pred EEEEeeCCCCccceeeccc--------------------------ccCCCCcccchhHHHHHHHHHHHHHHHHHHHh Confidence 9999999999999999998 47999999999999999999999999998887 No 38 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.90 E-value=2.1e-27 Score=166.84 Aligned_cols=109 Identities=17% Similarity=0.174 Sum_probs=97.1 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------CccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS------NNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) ++|+|.++|.+.++.+.+.+.+.+.+++.+.+.. ++..++.++ |||||+|++||.++ ..++++++|+++++| T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~-i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~-~~g~~~~~v~~~~~Y 78 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKE-GVGIAVSNAKEVMNKGYWTGNLASLIEVK-KIGDLHYRVISTAHY 78 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhccccCCCCcchhhhhceeee-ecCcEEEEeeCCCcc Confidence 6788999999999999998888888887776655 456677776 89999999999876 456789999999999 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) |+|||||| +.|+|||||+||++.+++.+.+.|+++|. T Consensus 79 a~~vEfGT--------------------------~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 79 SGFLEFGT--------------------------RYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred chheeccc--------------------------ccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 58999999999999999999999999999 No 39 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.90 E-value=2.2e-27 Score=166.74 Aligned_cols=109 Identities=17% Similarity=0.178 Sum_probs=92.2 Q ss_pred cch-hhhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcc Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMF--DDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYA 89 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l--~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA 89 (154) ||+ +|+|.++|.+.++.+ .+.+++.+++++.+.+.++ +..++.++|||||+|++||++ ..++++++|+++++|| T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~-~~~a~~~apvdTG~Lr~sI~~--~~~~~~~~v~~~~~Ya 77 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAA-VSKAQFKKGYSTGATRRSITL--EAGSDRAVVEALTNYS 77 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHH-HHHhhhcCCCCchhhhhceee--ecCceEEEecCCCCcc Confidence 994 888999998888876 4567778887777766554 567899999999999999975 5677899999999999 Q ss_pred cccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHh Q lcl|NC_011308. 90 IYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVF 150 (154) Q Consensus 90 ~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l 150 (154) +|||||| +.|+|||||+||++.+++.+.+.|++.- T Consensus 78 ~~vE~GT--------------------------r~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 78 GYLEVGT--------------------------RKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ceeccCc--------------------------cccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 9999998 5899999999999999999777666544 No 40 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.89 E-value=9e-27 Score=163.36 Aligned_cols=111 Identities=19% Similarity=0.202 Sum_probs=95.2 Q ss_pred cch-hhhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcc Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMF--DDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYA 89 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l--~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA 89 (154) |++ +++|.+.|.+.++++ .+.+++.+++++.+.+.++ ++.|+.++||+||+|++||.+++..++ ++|+++++|| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~-~~~a~~~~p~~TG~Lr~sI~~~~~~~~--~~V~~~~~Ya 77 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAA-VNRAQFNKGYSTGATRRSITLQVESDK--ATVEALTSYS 77 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHH-HHhcccCCCCCchhhhhceeeeecCCe--eEecCCCCcc Confidence 984 788888888877766 4567778888777776554 456888999999999999998876665 6799999999 Q ss_pred cccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 90 IYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 90 ~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) +|||||| +.|+|||||+||++.+++.+.+.|++.++- T Consensus 78 ~~vEfGT--------------------------~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 78 GYLEVGT--------------------------RKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ceecccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 9999998 479999999999999999999999999999 No 41 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.89 E-value=9e-27 Score=163.36 Aligned_cols=111 Identities=19% Similarity=0.202 Sum_probs=95.2 Q ss_pred cch-hhhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcc Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMF--DDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYA 89 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l--~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA 89 (154) |++ +++|.+.|.+.++++ .+.+++.+++++.+.+.++ ++.|+.++||+||+|++||.+++..++ ++|+++++|| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~-~~~a~~~~p~~TG~Lr~sI~~~~~~~~--~~V~~~~~Ya 77 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAA-VNRAQFNKGYSTGATRRSITLQVESDK--ATVEALTSYS 77 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHH-HHhcccCCCCCchhhhhceeeeecCCe--eEecCCCCcc Confidence 984 788888888877766 4567778888777776554 456888999999999999998876665 6799999999 Q ss_pred cccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 90 IYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 90 ~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) +|||||| +.|+|||||+||++.+++.+.+.|++.++- T Consensus 78 ~~vEfGT--------------------------~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 78 GYLEVGT--------------------------RKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ceecccc--------------------------cccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 9999998 479999999999999999999999999999 No 42 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.89 E-value=4.7e-27 Score=164.93 Aligned_cols=129 Identities=14% Similarity=0.102 Sum_probs=96.2 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec----CceEEEEecCCCc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK----RKQEVKIGNSSDY 88 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~----~~~~~~V~~~~~Y 88 (154) |-..+.+..+...+.+++...+++.+++++. .+++.|+.++|||||+|++||.++... ..+++.|+++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~r~~l~~~a~-----~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:10 1 MTVTARYERNPVGEARQFQVIARRRLSRITR-----GTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADY 75 (137) T ss_pred CeeEEEeccCchhHHHHHHHHHHHHHHHHHH-----HHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCcc Confidence 5555666666666667777776666665443 345678999999999999999876432 2357889999999 Q ss_pred ccccccCccccccCCCCcccccceecc----cCceeecCCCCCCchhHHHHHHHHHHHHHHH Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDK----NGKWHFTRGSKASKRMRYTFRDEKSKVKDYV 146 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~----~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i 146 (154) |+||||||+||.+.|+.++..++|... .++.+.+.|++|+|||+||+++++++....- T Consensus 76 A~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 76 ARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred ceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 999999999999999876643333321 2334456799999999999999999865544 No 43 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=99.87 E-value=3.7e-26 Score=159.98 Aligned_cols=129 Identities=7% Similarity=0.023 Sum_probs=90.1 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee---cCceEEEEecCCCccccc Q lcl|NC_011308. 16 DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD---KRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 16 ~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V~~~~~YA~yV 92 (154) .|....+|++ ..+.+...+.+++++...+. -+++.++.++|||||+|++||.+.+. +..+.+.|+++++||+|| T Consensus 1 ~~~~~~~l~~--~~l~~~~~~~~~~~~~~~a~-~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~v 77 (137) T protein:vir:10 1 MVAHTLRIER--AQLHGLGMDEARKAVNRVVR-RTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAV 77 (137) T ss_pred CcccccccCh--hhHhhHHHHHHHHHHHHHHH-HHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceee Confidence 2333333333 24444445555555544433 44667899999999999999987653 234677899999999999 Q ss_pred ccCccccccCCCCcccccceecccCceee-----cCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHF-----TRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~-----t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) ||||+||.++|+.++..+|+.+ |.+++ ..|++|+|||+||++..+++.- +.-.|+ T Consensus 78 e~GT~ph~I~pk~~kaL~f~~~--G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~~--~~~~~~ 137 (137) T protein:vir:10 78 HNGRRALTIRAKGNGRLKFTVE--GRTVYARSVHQPARAGRPYLSQALREVAPQEG--FRVTIG 137 (137) T ss_pred ecCCCCceeecCCCccceeecC--CeeEeccceecCCCCCChhhHHHHHHhhcccc--eeEeeC Confidence 9999999999999998877644 55554 4599999999999998887611 000111 No 44 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.84 E-value=5.4e-24 Score=148.16 Aligned_cols=131 Identities=14% Similarity=0.127 Sum_probs=94.3 Q ss_pred cch-hhhhHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeE---EeecCceEEEEe Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDT-----AEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDF---EVDKRKQEVKIG 83 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~-----~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~---~~~~~~~~~~V~ 83 (154) ||. .|++ +.|+++.+.|.+. +.+.+++++.+++.++ .+.++.++|||||+||+||+. ...+++++++|+ T Consensus 1 Ms~~~id~-~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~-~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~ 78 (144) T protein:vir:10 1 MSLGHVDD-AQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQS-LRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLI 78 (144) T ss_pred CCCCCccH-HHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHH-HHHHHHhCCCCcchhccceeecceeeecCeeEEEEe Confidence 884 3443 2344444444432 3456677776666555 456899999999999999974 456788999999 Q ss_pred cCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 84 NSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 84 ~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) ++++||+||||||+... |+-.||........| .+++|||.+|++..+..+.++|++.|.+|. T Consensus 79 n~~~YA~~VE~Ghr~~~----G~~v~~~~~~~~~g~-----V~G~~~~~~a~~~~~~~~~~~l~k~l~~l~ 140 (144) T protein:vir:10 79 NNAEYASYVESGHRQTP----GRYVPVLKKRLVRDW-----VPGQFYMKKSIPQIQRQLPQLVTEGLWGLK 140 (144) T ss_pred cCCCcccccccceeecC----CcccccCCCccccce-----ecCccchHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999985321 222233322222333 478899999999999999999999999998 No 45 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.82 E-value=2e-23 Score=144.98 Aligned_cols=115 Identities=17% Similarity=0.170 Sum_probs=89.2 Q ss_pred cc-hhhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec---CceEEEEe---- Q lcl|NC_011308. 13 MA-DDIKFEMDMSKIKDMFDDTAEK-ALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK---RKQEVKIG---- 83 (154) Q Consensus 13 Ma-~~v~~~~~l~~~~~~l~~~~~~-~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~---- 83 (154) |+ .+++|.++|.+.++.|.+.+.+ .+.++ +..++++++..++.++|++||.|++||.+.... +...+.|+ T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~a-l~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~ 79 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRA-TVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVR 79 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeec Confidence 98 4788889999999999877655 45555 455677888899999999999999999764321 12222332 Q ss_pred --------cCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 84 --------NSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 84 --------~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) ++..|+.|+|||| ++|||||||+||++.+++++.+.|.++++. |+ T Consensus 80 ~~~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:10 80 TKGKADSPNNAFYWRFDEFGT--------------------------QHMKAQPFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred cccccCCCCccceeeeeccCC--------------------------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 4467999999998 589999999999999999988888877744 44 No 46 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.82 E-value=2.9e-23 Score=144.12 Aligned_cols=116 Identities=16% Similarity=0.133 Sum_probs=90.6 Q ss_pred cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee---cCceEEEEe----- Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD---KRKQEVKIG----- 83 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V~----- 83 (154) |+. +++|.++|.+.++.|++.+.+.+.+.++..++++++..++.++|++||.|++||.+... .+.....|+ T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~ 80 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeecc Confidence 984 78888999999999987766555444455667788889999999999999999976432 112223332 Q ss_pred -------cCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 84 -------NSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 84 -------~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) .+..|+.|+|||| ++|||||||+||++.+++++.+.|.++|+. |+ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:14 81 KGKADSPNNAFYWRFDEFGT--------------------------QHMKAQPFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred ccccCCCCccceeeeecccc--------------------------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 3567999999998 589999999999999999999888888854 44 No 47 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.81 E-value=3.6e-23 Score=143.66 Aligned_cols=116 Identities=15% Similarity=0.091 Sum_probs=90.2 Q ss_pred cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec-----CceEEEEe--- Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK-----RKQEVKIG--- 83 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~-----~~~~~~V~--- 83 (154) ||. +|+|.++|.+.++.|.+...+.+.+.++..++++++..++.++|++||.|++||.+.... +..++.|+ T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeecccc Confidence 994 788989999999999877765554444556778888999999999999999999765321 12233332 Q ss_pred -------cCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 84 -------NSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 84 -------~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) ++..|+.|+|||| ++|||||||+||++++++++.+.|.+.|+. |+ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:10 81 KGKADSPNNAFYWRFVELGT--------------------------QFMKAEPFMRPAFDASIAQAEGAIRTEIARAID 133 (140) T ss_pred ccccCCCCcccccceeccCc--------------------------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 3467999999998 479999999999999999998888877732 33 No 48 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.81 E-value=6.3e-23 Score=142.28 Aligned_cols=139 Identities=11% Similarity=0.046 Sum_probs=98.2 Q ss_pred cchhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec-----CceEEEEecC Q lcl|NC_011308. 13 MADDIKFE--MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK-----RKQEVKIGNS 85 (154) Q Consensus 13 Ma~~v~~~--~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~-----~~~~~~V~~~ 85 (154) |+.++... +.|...++.|.+...+.+.++ +..++++++++|+.++|++||.|++||.+.... +..++.|+.+ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A-~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~ 79 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTM-TYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWR 79 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHH-HHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeec Confidence 98776543 467888888887766655555 556678889999999999999999999765421 2234446654 Q ss_pred ---CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHh-hccC Q lcl|NC_011308. 86 ---SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVF-GGLD 154 (154) Q Consensus 86 ---~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l-~~l~ 154 (154) ++|+.|||||+..+.........+|++.... .-.+.+|||||||+|||+..++++.+.+.+.| ++|+ T Consensus 80 ~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~--~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~ 150 (157) T protein:vir:97 80 KKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVK--LVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYA 150 (157) T ss_pred CCccceeeeeecCcccccccccCCcccccccccc--cCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHH Confidence 5789999999876655555555666543322 22256799999999999999999888764433 2333 No 49 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.80 E-value=1e-22 Score=141.18 Aligned_cols=116 Identities=14% Similarity=0.092 Sum_probs=89.4 Q ss_pred cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec---CceEEEE------ Q lcl|NC_011308. 13 MA-DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK---RKQEVKI------ 82 (154) Q Consensus 13 Ma-~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V------ 82 (154) || ++|+|.++|.+.++.|.+.+.+.+.+.++..++++++..++.++|++||.|++||.+.... ....+.+ T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeeccc Confidence 98 4788889999999999877655444444455677888899999999999999999764321 1112222 Q ss_pred ------ecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 83 ------GNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 83 ------~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) .++..|+.|+|||| +.|||||||+||++.+++++.+.|.++|+. |+ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:80 81 KGKADSPSNAFYWRFDEFGT--------------------------QHMKAQPFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred ccccCCCCCcceeeeeccCC--------------------------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 23467999999998 589999999999999999999998888754 44 No 50 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=99.78 E-value=6.4e-22 Score=136.79 Aligned_cols=117 Identities=11% Similarity=0.089 Sum_probs=88.0 Q ss_pred ccc--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee---cCceEEEE---- Q lcl|NC_011308. 12 HMA--DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD---KRKQEVKI---- 82 (154) Q Consensus 12 ~Ma--~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V---- 82 (154) =|+ .+|+|.++|.+.++.|++.+.+.+.+.++..++++++..++.++|++||.|++||.+... .+.+...| T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 222 356677899999999988877777777778888999999999999999999999975421 11111111 Q ss_pred ----------------ecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHH Q lcl|NC_011308. 83 ----------------GNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYV 146 (154) Q Consensus 83 ----------------~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i 146 (154) ..+..|+.|+|||| ..|||||||+||++++++++.+.| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--------------------------~~~pa~PFl~pA~~~~k~~~~~~~ 134 (148) T protein:vir:93 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGT--------------------------VNMPPHPFVRPAFDVRSEQAAQVA 134 (148) T ss_pred cccccccccceeecCCCCCcceeeeeccCC--------------------------CCCCCCcchhHHHHHhHHHHHHHH Confidence 13356899999998 479999999999999999877776 Q ss_pred HHHhhc-cC Q lcl|NC_011308. 147 IKVFGG-LD 154 (154) Q Consensus 147 ~~~l~~-l~ 154 (154) .+.|++ |+ T Consensus 135 ~~~~~~~i~ 143 (148) T protein:vir:93 135 IARMNRAID 143 (148) T ss_pred HHHHHHHHH Confidence 665543 33 No 51 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.78 E-value=4.6e-22 Score=137.58 Aligned_cols=119 Identities=13% Similarity=0.095 Sum_probs=86.0 Q ss_pred CCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCc----eEEE---- Q lcl|NC_011308. 10 GGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRK----QEVK---- 81 (154) Q Consensus 10 ~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~----~~~~---- 81 (154) =-.|+++|+|.++|.+.++.|++.+.+.+.+.++..++++++..++.++|++||.|++||++...... +... T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 11122456788999999999998877666666666778888999999999999999999975422111 1111 Q ss_pred ----------------EecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011308. 82 ----------------IGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDY 145 (154) Q Consensus 82 ----------------V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~ 145 (154) -.++..|+.|+|||| .+|||||||+||++++++++.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PF~~pA~~~~k~~~~~~ 134 (149) T protein:vir:19 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT--------------------------ANMPAHPFVRPAYDTREEEAASV 134 (149) T ss_pred ccccccccccceeecCCCCccceeeeeccCC--------------------------CCCCCCcchhHHHHHHHHHHHHH Confidence 112345777888877 58999999999999999987777 Q ss_pred HHHHhhc-cC Q lcl|NC_011308. 146 VIKVFGG-LD 154 (154) Q Consensus 146 i~~~l~~-l~ 154 (154) |.+.|++ |+ T Consensus 135 ~~~~l~~~l~ 144 (149) T protein:vir:19 135 AIARMNQAID 144 (149) T ss_pred HHHHHHHHHH Confidence 7666653 33 No 52 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.75 E-value=4.4e-21 Score=132.18 Aligned_cols=116 Identities=11% Similarity=0.104 Sum_probs=93.1 Q ss_pred cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc----cccceeEEe--e----cCceEEE Q lcl|NC_011308. 13 MA-DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE----YANKTDFEV--D----KRKQEVK 81 (154) Q Consensus 13 Ma-~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~----Lr~SI~~~~--~----~~~~~~~ 81 (154) |+ .+|+|.++|.+.++.|.+.+.+.+.+.++..++++++..++.++|++||. |++||.++. . .+.+.+. T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~ 80 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLR 80 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEE Confidence 65 47899999999999999888777766677788889999999999999998 788886532 1 1224456 Q ss_pred EecCC---CcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHh-hccC Q lcl|NC_011308. 82 IGNSS---DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVF-GGLD 154 (154) Q Consensus 82 V~~~~---~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l-~~l~ 154 (154) |+.+. -|+.|+|||| ..|||||||+||++.+++++.+.|.+.| ++|+ T Consensus 81 vg~~~~~~~y~~f~E~GT--------------------------~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~ 131 (133) T protein:vir:10 81 VGPSKQHHMKVLAQEFGT--------------------------VKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQ 131 (133) T ss_pred ecCCCCccceEeeeccCC--------------------------CCCCCCccchHHHHHhHHHHHHHHHHHHHHHhh Confidence 66553 4899999998 4789999999999999999988888775 4566 No 53 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.74 E-value=3.5e-21 Score=132.70 Aligned_cols=115 Identities=16% Similarity=0.172 Sum_probs=93.1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCc-------cccccccceeEEeecCceEEEEecC-- Q lcl|NC_011308. 17 IKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGD--SSNN-------VTGEYANKTDFEVDKRKQEVKIGNS-- 85 (154) Q Consensus 17 v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~--~aPv-------dTG~Lr~SI~~~~~~~~~~~~V~~~-- 85 (154) |.|.+.|.+.++.. .++.+++++...++++..+ +.+ .+|| |||+|++||..+...++++++|+.. T Consensus 1 i~G~~~L~~~Lk~~---s~~dvk~VVkkN~ael~~r-~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~ 76 (127) T protein:vir:98 1 MTGMPALEVKLRSM---SEKRWDRVANKNLTEMFNR-AARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGY 76 (127) T ss_pred CcChHHHHHHHHHh---hHHHHHHHHhhhhHHHHHH-HHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcc Confidence 77888887766533 4566888888888887554 344 4788 9999999999999999999999985 Q ss_pred -CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 86 -SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 86 -~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) .+||+||||||+... .|+.+ .+++|||||.|||+..++.|.++|.+.+++ T Consensus 77 t~dYapyvEyGTR~m~---------------~~~~~--gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 77 IKDYAPHVEYGHRIVR---------------NGKQV--GYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred cccccceeecceeeee---------------ccccc--ccccCccccccchHHHhHHHHHHHHHHhcC Confidence 999999999996332 11111 258999999999999999999999999999 No 54 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.74 E-value=1.3e-20 Score=129.59 Aligned_cols=121 Identities=18% Similarity=0.140 Sum_probs=86.7 Q ss_pred cch----hhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeE---------EeecCce Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMF-DDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDF---------EVDKRKQ 78 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l-~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~---------~~~~~~~ 78 (154) ||. ++++.+++.+.++.+ +..+.+.+++++.+++.++ .+.++..+|||||+||+||+. ...++++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l-~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~ 79 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARL-LGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNY 79 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH-HHHHHHhCCCcchhhcccccccccccccceeecCCee Confidence 665 344444444444433 3355666677766666554 457889999999999999843 3356678 Q ss_pred EEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 79 EVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 79 ~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +++|+++++||+|||+||.... ++ +..+++.+|..|+++.+..+.++|++.|+++= T Consensus 80 ~v~v~n~~~YA~~VE~Ghr~~~----~~----------------gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l 135 (141) T protein:vir:79 80 IIEVVNPTEYASYVNFGHRTKD----GK----------------GWVKGQHFLTISEMELQSQVDKIIEKKLLILL 135 (141) T ss_pred EEEEecCCcchhhhhcceeecC----Cc----------------ceeCCchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8999999999999999984221 10 13478889999999999998888888877754 No 55 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.72 E-value=1.9e-20 Score=128.70 Aligned_cols=115 Identities=12% Similarity=0.097 Sum_probs=91.0 Q ss_pred cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc---ccccccceeEE---e-ecCceEEEEec Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNV---TGEYANKTDFE---V-DKRKQEVKIGN 84 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvd---TG~Lr~SI~~~---~-~~~~~~~~V~~ 84 (154) |+. +|+|.++|.+.++.|...+.+.+.++ ++.+++++...++.++|++ ||+|++||.+. . ..+..+++|+. T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~~~~~~a-l~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~ 79 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIEKVEPVA-LKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGP 79 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHHHHHHHH-HHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEee Confidence 885 67888999999999988887655554 5566778888999999975 89999999642 1 23446778886 Q ss_pred C---CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 85 S---SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 85 ~---~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) + +.|+.|+|||| ..|+|||||+||++++++++.+.|.+.|++-= T Consensus 80 ~~~~~~y~~f~E~GT--------------------------~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~l 126 (127) T protein:vir:12 80 NKKVAYRGRFLEWGT--------------------------SKMPPQPFIEKGGKEGEGPAVELMERILTAPI 126 (127) T ss_pred CCCCcceeeeeccCc--------------------------cCCCCCccchHhHHHHHHHHHHHHHHHHHHhc Confidence 5 45888899998 47899999999999999998887776665433 No 56 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.72 E-value=1.8e-20 Score=128.79 Aligned_cols=116 Identities=17% Similarity=0.198 Sum_probs=86.4 Q ss_pred cch----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-----cccccccceeEEeec------Cc Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN-----VTGEYANKTDFEVDK------RK 77 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv-----dTG~Lr~SI~~~~~~------~~ 77 (154) ||+ +|+|.++|.+.++.|.+.+.+.+.+.++..+++++++.++.++|+ ++|.|++||.+.... +. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 886 456889999999999988877666667777889999999999997 678999999664311 11 Q ss_pred eEEEEe-------------------cCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH Q lcl|NC_011308. 78 QEVKIG-------------------NSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE 138 (154) Q Consensus 78 ~~~~V~-------------------~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~ 138 (154) +...|+ .+..|+.|||||| .+|||||||+||++++ T Consensus 81 ~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT--------------------------~km~a~PFlrPA~~~~ 134 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGT--------------------------EDMRAQPFMRSALADN 134 (164) T ss_pred eeEEecccccccccccccccccCCCCCcceEEEeecCC--------------------------CCCCCCcchhhhHHHh Confidence 222221 2345888888887 5799999999999999 Q ss_pred HHHHHHHHHHHhh-ccC Q lcl|NC_011308. 139 KSKVKDYVIKVFG-GLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~-~l~ 154 (154) ++++.+.|.+.|+ +|+ T Consensus 135 k~~~~~~~~~~l~~~i~ 151 (164) T protein:vir:43 135 IAEVTSTFVSEYEKGID 151 (164) T ss_pred HHHHHHHHHHHHHHHHH Confidence 9998766555443 233 No 57 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=99.71 E-value=2.4e-20 Score=128.19 Aligned_cols=140 Identities=9% Similarity=0.098 Sum_probs=88.3 Q ss_pred cch----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-----cccccccceeEEeecCceEEEEe Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN-----VTGEYANKTDFEVDKRKQEVKIG 83 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv-----dTG~Lr~SI~~~~~~~~~~~~V~ 83 (154) ||+ +|+|.++|.+.++.|++.+.+.+.+.++..+++++++.|+.+||+ ++|.|++||.+...... ..-. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~--~~~~ 78 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQ--FRRT 78 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccc--cccc Confidence 885 456889999999999988876665666677789999999999965 57888999865432110 1112 Q ss_pred cCCCcccccccCccccccCCCCccc--cc-ce-------ecccCcee------ecCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_011308. 84 NSSDYAIYYEFGTGEKSEKGGGRAG--GW-SY-------MDKNGKWH------FTRGSKASKRMRYTFRDEKSKVKDYVI 147 (154) Q Consensus 84 ~~~~YA~yVE~GTg~~~~~~~~~~~--~~-~~-------~~~~g~~~------~t~g~~a~PFl~pA~~~~~~~i~~~i~ 147 (154) .+..|.++|+.||+++......... .+ .+ .+....|| -|.+|||||||+||++++++++.+.|. T Consensus 79 g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~ 158 (179) T protein:vir:18 79 GDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFS 158 (179) T ss_pred cceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHH Confidence 2333555566666554332211000 00 00 00000111 157899999999999999998887777 Q ss_pred HHhh-ccC Q lcl|NC_011308. 148 KVFG-GLD 154 (154) Q Consensus 148 ~~l~-~l~ 154 (154) +.|+ +|+ T Consensus 159 ~~l~~~i~ 166 (179) T protein:vir:18 159 TEMGKAID 166 (179) T ss_pred HHHHHHHH Confidence 6664 345 No 58 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.71 E-value=3.8e-20 Score=127.02 Aligned_cols=115 Identities=17% Similarity=0.109 Sum_probs=88.5 Q ss_pred cch----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEE---------------- Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFE---------------- 72 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~---------------- 72 (154) ||+ +|+|.++|.+.+++|.+...+.+.+++ ..++++++..++.++|+++|.|++++... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 886 477889999999999988776555554 56678888899999999999998876421 Q ss_pred -eecCceEEEEec------CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011308. 73 -VDKRKQEVKIGN------SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDY 145 (154) Q Consensus 73 -~~~~~~~~~V~~------~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~ 145 (154) ...+...+.|+. +..||.|+|||| .+|||+|||+||++++++++.+. T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pa~~~~k~~~~~~ 133 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEWGT--------------------------SKMPAHPFIEPGFNASKAEAVRA 133 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeeccCC--------------------------CCCCCCcchhHHHHHhHHHHHHH Confidence 122334455653 346999999998 47999999999999999998888 Q ss_pred HHHHhhc-cC Q lcl|NC_011308. 146 VIKVFGG-LD 154 (154) Q Consensus 146 i~~~l~~-l~ 154 (154) |.++|++ |+ T Consensus 134 ~~~~l~~~l~ 143 (146) T protein:vir:10 134 MTDILKNEMR 143 (146) T ss_pred HHHHHHHHHh Confidence 7776653 44 No 59 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.71 E-value=3.8e-20 Score=127.02 Aligned_cols=115 Identities=17% Similarity=0.109 Sum_probs=88.5 Q ss_pred cch----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEE---------------- Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFE---------------- 72 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~---------------- 72 (154) ||+ +|+|.++|.+.+++|.+...+.+.+++ ..++++++..++.++|+++|.|++++... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 886 477889999999999988776555554 56678888899999999999998876421 Q ss_pred -eecCceEEEEec------CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011308. 73 -VDKRKQEVKIGN------SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDY 145 (154) Q Consensus 73 -~~~~~~~~~V~~------~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~ 145 (154) ...+...+.|+. +..||.|+|||| .+|||+|||+||++++++++.+. T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pa~~~~k~~~~~~ 133 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEWGT--------------------------SKMPAHPFIEPGFNASKAEAVRA 133 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeeccCC--------------------------CCCCCCcchhHHHHHhHHHHHHH Confidence 122334455653 346999999998 47999999999999999998888 Q ss_pred HHHHhhc-cC Q lcl|NC_011308. 146 VIKVFGG-LD 154 (154) Q Consensus 146 i~~~l~~-l~ 154 (154) |.++|++ |+ T Consensus 134 ~~~~l~~~l~ 143 (146) T protein:vir:10 134 MTDILKNEMR 143 (146) T ss_pred HHHHHHHHHh Confidence 7776653 44 No 60 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.71 E-value=3.8e-20 Score=127.02 Aligned_cols=115 Identities=17% Similarity=0.109 Sum_probs=88.5 Q ss_pred cch----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEE---------------- Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFE---------------- 72 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~---------------- 72 (154) ||+ +|+|.++|.+.+++|.+...+.+.+++ ..++++++..++.++|+++|.|++++... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 886 477889999999999988776555554 56678888899999999999998876421 Q ss_pred -eecCceEEEEec------CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011308. 73 -VDKRKQEVKIGN------SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDY 145 (154) Q Consensus 73 -~~~~~~~~~V~~------~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~ 145 (154) ...+...+.|+. +..||.|+|||| .+|||+|||+||++++++++.+. T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pa~~~~k~~~~~~ 133 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEWGT--------------------------SKMPAHPFIEPGFNASKAEAVRA 133 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeeccCC--------------------------CCCCCCcchhHHHHHhHHHHHHH Confidence 122334455653 346999999998 47999999999999999998888 Q ss_pred HHHHhhc-cC Q lcl|NC_011308. 146 VIKVFGG-LD 154 (154) Q Consensus 146 i~~~l~~-l~ 154 (154) |.++|++ |+ T Consensus 134 ~~~~l~~~l~ 143 (146) T protein:vir:10 134 MTDILKNEMR 143 (146) T ss_pred HHHHHHHHHh Confidence 7776653 44 No 61 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.71 E-value=3.8e-20 Score=127.02 Aligned_cols=115 Identities=17% Similarity=0.109 Sum_probs=88.5 Q ss_pred cch----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEE---------------- Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFE---------------- 72 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~---------------- 72 (154) ||+ +|+|.++|.+.+++|.+...+.+.+++ ..++++++..++.++|+++|.|++++... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al-~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKAL-AAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 886 477889999999999988776555554 56678888899999999999998876421 Q ss_pred -eecCceEEEEec------CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011308. 73 -VDKRKQEVKIGN------SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDY 145 (154) Q Consensus 73 -~~~~~~~~~V~~------~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~ 145 (154) ...+...+.|+. +..||.|+|||| .+|||+|||+||++++++++.+. T Consensus 80 ~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pa~~~~k~~~~~~ 133 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKADRSPWFYLKFHEWGT--------------------------SKMPAHPFIEPGFNASKAEAVRA 133 (146) T ss_pred cccccceeEEeeeccCCCCCcceeeeeccCC--------------------------CCCCCCcchhHHHHHhHHHHHHH Confidence 122334455653 346999999998 47999999999999999998888 Q ss_pred HHHHhhc-cC Q lcl|NC_011308. 146 VIKVFGG-LD 154 (154) Q Consensus 146 i~~~l~~-l~ 154 (154) |.++|++ |+ T Consensus 134 ~~~~l~~~l~ 143 (146) T protein:vir:10 134 MTDILKNEMR 143 (146) T ss_pred HHHHHHHHHh Confidence 7776653 44 No 62 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=99.71 E-value=3.1e-20 Score=127.52 Aligned_cols=116 Identities=13% Similarity=0.134 Sum_probs=90.0 Q ss_pred cchh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc----cccccceeEEe---e--cCceEEE Q lcl|NC_011308. 13 MADD--IKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVT----GEYANKTDFEV---D--KRKQEVK 81 (154) Q Consensus 13 Ma~~--v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdT----G~Lr~SI~~~~---~--~~~~~~~ 81 (154) |+.+ |+|.++|.+.++.|...+.+.+.+.++..++++++..++.++|+++ |.|++||.+.- . ....++. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~ 80 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLR 80 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEE Confidence 8865 5688899999999998887777666667788889999999999975 99999997542 1 1223455 Q ss_pred EecCCC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh-ccC Q lcl|NC_011308. 82 IGNSSD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG-GLD 154 (154) Q Consensus 82 V~~~~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~-~l~ 154 (154) |+.+.. |+.|+|||| ..|||||||+||++++++++.+.|.+.|+ +|+ T Consensus 81 vg~~~~~~~~~~f~E~GT--------------------------~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ 131 (135) T protein:vir:57 81 VGPTRSHYMKALAQEFGT--------------------------IKQVAKPFIRPALDYNKMQVLRILTVEIRDGLS 131 (135) T ss_pred ecCCCCcceeEeecccCC--------------------------CCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHH Confidence 666554 478889998 47899999999999999988777766663 344 No 63 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.70 E-value=5.4e-20 Score=126.21 Aligned_cols=114 Identities=13% Similarity=0.006 Sum_probs=94.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc--cccceeEEe-e----cCceEEEEecC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE--YANKTDFEV-D----KRKQEVKIGNS 85 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~--Lr~SI~~~~-~----~~~~~~~V~~~ 85 (154) |+.+|+| ++|...+++|.....+. .+.++..++++++..++.++|++++. |++||.++- . .+...+.|+.+ T Consensus 1 M~v~v~~-~~L~~~l~~l~~~~~k~-~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~ 78 (125) T protein:vir:47 1 MGARIES-NNIEQGLKNAVLKMNLN-SNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA 78 (125) T ss_pred CeeEeeH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC Confidence 9999887 56888888888766544 45566788889999999999998887 999997642 1 12345678877 Q ss_pred CC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .+ ||.|+|||| ..|||+|||+||++++++++.+.|.+.|++|+ T Consensus 79 k~~~~~a~F~E~GT--------------------------~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:47 79 KGVSHRIHATEFGT--------------------------MYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred CCCceEEEeccCCc--------------------------cCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 65 899999999 47899999999999999999999999999999 No 64 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.70 E-value=5.4e-20 Score=126.21 Aligned_cols=114 Identities=13% Similarity=0.006 Sum_probs=94.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc--cccceeEEe-e----cCceEEEEecC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE--YANKTDFEV-D----KRKQEVKIGNS 85 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~--Lr~SI~~~~-~----~~~~~~~V~~~ 85 (154) |+.+|+| ++|...+++|.....+. .+.++..++++++..++.++|++++. |++||.++- . .+...+.|+.+ T Consensus 1 M~v~v~~-~~L~~~l~~l~~~~~k~-~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~ 78 (125) T protein:vir:94 1 MGARIES-NNIEQGLKNAVLKMNLN-SNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA 78 (125) T ss_pred CeeEeeH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC Confidence 9999887 56888888888766544 45566788889999999999998887 999997642 1 12345678877 Q ss_pred CC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .+ ||.|+|||| ..|||+|||+||++++++++.+.|.+.|++|+ T Consensus 79 k~~~~~a~F~E~GT--------------------------~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:94 79 KGVSHRIHATEFGT--------------------------MYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred CCCceEEEeccCCc--------------------------cCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 65 899999999 47899999999999999999999999999999 No 65 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.70 E-value=5.4e-20 Score=126.21 Aligned_cols=114 Identities=13% Similarity=0.006 Sum_probs=94.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc--cccceeEEe-e----cCceEEEEecC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE--YANKTDFEV-D----KRKQEVKIGNS 85 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~--Lr~SI~~~~-~----~~~~~~~V~~~ 85 (154) |+.+|+| ++|...+++|.....+. .+.++..++++++..++.++|++++. |++||.++- . .+...+.|+.+ T Consensus 1 M~v~v~~-~~L~~~l~~l~~~~~k~-~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~ 78 (125) T protein:vir:81 1 MGARIES-NNIEQGLKNAVLKMNLN-SNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA 78 (125) T ss_pred CeeEeeH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC Confidence 9999887 56888888888766544 45566788889999999999998887 999997642 1 12345678877 Q ss_pred CC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .+ ||.|+|||| ..|||+|||+||++++++++.+.|.+.|++|+ T Consensus 79 k~~~~~a~F~E~GT--------------------------~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:81 79 KGVSHRIHATEFGT--------------------------MYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred CCCceEEEeccCCc--------------------------cCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 65 899999999 47899999999999999999999999999999 No 66 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.70 E-value=5.4e-20 Score=126.21 Aligned_cols=114 Identities=13% Similarity=0.006 Sum_probs=94.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc--cccceeEEe-e----cCceEEEEecC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE--YANKTDFEV-D----KRKQEVKIGNS 85 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~--Lr~SI~~~~-~----~~~~~~~V~~~ 85 (154) |+.+|+| ++|...+++|.....+. .+.++..++++++..++.++|++++. |++||.++- . .+...+.|+.+ T Consensus 1 M~v~v~~-~~L~~~l~~l~~~~~k~-~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~ 78 (125) T protein:vir:98 1 MGARIES-NNIEQGLKNAVLKMNLN-SNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA 78 (125) T ss_pred CeeEeeH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC Confidence 9999887 56888888888766544 45566788889999999999998887 999997642 1 12345678877 Q ss_pred CC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .+ ||.|+|||| ..|||+|||+||++++++++.+.|.+.|++|+ T Consensus 79 k~~~~~a~F~E~GT--------------------------~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:98 79 KGVSHRIHATEFGT--------------------------MYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred CCCceEEEeccCCc--------------------------cCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 65 899999999 47899999999999999999999999999999 No 67 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.70 E-value=5.4e-20 Score=126.21 Aligned_cols=114 Identities=13% Similarity=0.006 Sum_probs=94.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc--cccceeEEe-e----cCceEEEEecC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE--YANKTDFEV-D----KRKQEVKIGNS 85 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~--Lr~SI~~~~-~----~~~~~~~V~~~ 85 (154) |+.+|+| ++|...+++|.....+. .+.++..++++++..++.++|++++. |++||.++- . .+...+.|+.+ T Consensus 1 M~v~v~~-~~L~~~l~~l~~~~~k~-~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~ 78 (125) T protein:vir:79 1 MGARIES-NNIEQGLKNAVLKMNLN-SNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA 78 (125) T ss_pred CeeEeeH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC Confidence 9999887 56888888888766544 45566788889999999999998887 999997642 1 12345678877 Q ss_pred CC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .+ ||.|+|||| ..|||+|||+||++++++++.+.|.+.|++|+ T Consensus 79 k~~~~~a~F~E~GT--------------------------~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:79 79 KGVSHRIHATEFGT--------------------------MYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred CCCceEEEeccCCc--------------------------cCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 65 899999999 47899999999999999999999999999999 No 68 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.70 E-value=8.2e-20 Score=125.21 Aligned_cols=115 Identities=10% Similarity=0.025 Sum_probs=92.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc------cccceeEE---eecCceEEEEe Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE------YANKTDFE---VDKRKQEVKIG 83 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~------Lr~SI~~~---~~~~~~~~~V~ 83 (154) |+.+|+|.++|.+.+++|.+.+.+.+.+ ++..++++++..++.++|+++|. |+++|.+. ...+..++.|+ T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~-al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG 79 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARA-AVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVG 79 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEee Confidence 9999999999999999999887665554 45667788888999999998765 66777542 23344567787 Q ss_pred cC---CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 84 NS---SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 84 ~~---~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .+ .-|+.|+|||| ..|+|+|||+||++.+++++.+.|.+.|++-= T Consensus 80 ~~k~~~~y~~f~E~GT--------------------------~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i 127 (128) T protein:vir:38 80 YGKDTGWRAHFPNSGT--------------------------SMQDPQHFIEETQEIMRPVVIAAFLSHLKEGG 127 (128) T ss_pred ecCCCceEEeeeccCc--------------------------cCCCCCcchhHHHHHhHHHHHHHHHHHHHhhc Confidence 65 45999999998 47899999999999999999988888877644 No 69 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.65 E-value=5.9e-19 Score=120.52 Aligned_cols=116 Identities=15% Similarity=0.106 Sum_probs=85.5 Q ss_pred cch----hhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-------------ccccccceeEE-e Q lcl|NC_011308. 13 MAD----DIKFEMDMSKIKDMFD-DTAEKALKQIGEHMKTEIAEGGHGDSSNNV-------------TGEYANKTDFE-V 73 (154) Q Consensus 13 Ma~----~v~~~~~l~~~~~~l~-~~~~~~v~~a~~~~~~~i~~~~ak~~aPvd-------------TG~Lr~SI~~~-~ 73 (154) ||+ +|+|.++|.+.++.|. ....+.+.+.++..++++++..++.++|+. +|.++++|.+. + T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 996 5667788998898884 333344555566777889999999999974 56888888652 2 Q ss_pred --ecCceEEEEec------CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHH Q lcl|NC_011308. 74 --DKRKQEVKIGN------SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDY 145 (154) Q Consensus 74 --~~~~~~~~V~~------~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~ 145 (154) ..+...+.|+. +.-|+.|+|||| ..|||||||+||++.+++++.+. T Consensus 81 ~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT--------------------------~k~~a~pF~~pa~~~~~~~~~~~ 134 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKSDNTPFYYMKMEEWGT--------------------------SERPPHHAFGKTNKILKRVYDNI 134 (149) T ss_pred ccccceeEEEeeccCCCCCccceeeeeccCc--------------------------cCCCCCccchHHHHHHHHHHHHH Confidence 33344567763 456999999998 47899999999999999987766 Q ss_pred HHHHhhc-cC Q lcl|NC_011308. 146 VIKVFGG-LD 154 (154) Q Consensus 146 i~~~l~~-l~ 154 (154) |.+.|+. |+ T Consensus 135 ~~~~l~k~i~ 144 (149) T protein:vir:13 135 AQKKYDNFVK 144 (149) T ss_pred HHHHHHHHHH Confidence 6554432 22 No 70 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.64 E-value=3.2e-19 Score=122.00 Aligned_cols=86 Identities=15% Similarity=0.167 Sum_probs=68.9 Q ss_pred cch---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEe---cCC Q lcl|NC_011308. 13 MAD---DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIG---NSS 86 (154) Q Consensus 13 Ma~---~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~---~~~ 86 (154) ||. +|+|.+.|.+.++... -.+.+++++.+.++++ ++.|+.++|||||+|++||.+++..+++++.|. ..+ T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~--~~~~v~~vv~~~~~~l-~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a 77 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQ--NMNTVKKVVKKHTANL-MTATQQAVPVDTGHLKQSAQIQISRDGFTGSVTYGGGLV 77 (92) T ss_pred CCceeeEeehHHHHHHHHHhhc--cHHHHHHHHHHHHHHH-HHHHHHhCCCCccccceeeeEEeecCCeeEEEEeccCcc Confidence 775 6677777776655432 2366788888887776 678999999999999999999999999999884 679 Q ss_pred CcccccccCccccccCCCCcccccceecccCceeecCCCCC Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKA 127 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a 127 (154) +||+||||||+ .|+| T Consensus 78 ~Ya~YvE~GTR--------------------------~M~A 92 (92) T protein:vir:99 78 NYAAYVEFGTR--------------------------FMDS 92 (92) T ss_pred cccccccccee--------------------------ecCC Confidence 99999999994 5665 No 71 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.60 E-value=4.3e-18 Score=115.82 Aligned_cols=112 Identities=14% Similarity=-0.055 Sum_probs=90.4 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccc----cccceeEEe---ec-CceEEEEecC-- Q lcl|NC_011308. 16 DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGE----YANKTDFEV---DK-RKQEVKIGNS-- 85 (154) Q Consensus 16 ~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~----Lr~SI~~~~---~~-~~~~~~V~~~-- 85 (154) .|+|.++|.+.++.|.+...+..+ .++..++++++..++.++|+++|. |++||.+.- .. +..+++|+.+ T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~-~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~ 79 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAK-AAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA 79 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHH-HHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC Confidence 478999999999999887766554 556677888899999999999988 999997532 22 2345677764 Q ss_pred -CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 -SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 -~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +.|+.|+|||| ..|||+|||+||++++++++.+.|.+.|++-= T Consensus 80 ~~~y~~f~E~GT--------------------------~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L 123 (125) T protein:vir:97 80 TGWRAHYPNDGT--------------------------IYQRGQDFKERTINQMTPKAKQLYAEKVKEGL 123 (125) T ss_pred CceeEeeeccCc--------------------------cCCCcCccchHhHHHhHHHHHHHHHHHHHHHh Confidence 45999999998 47999999999999999999999888887644 No 72 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.60 E-value=7e-18 Score=114.64 Aligned_cols=119 Identities=19% Similarity=0.231 Sum_probs=90.6 Q ss_pred cchhhhhHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhcCCc--------------------------- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDT-----AEKALKQIGEHMKTEIAEGGHGDSSNN--------------------------- 60 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~-----~~~~v~~a~~~~~~~i~~~~ak~~aPv--------------------------- 60 (154) |++.|.+. .++++.++|.+. +.+.+.+.+.+++.+++ +.++..+|| T Consensus 1 m~~~~d~~-~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll-~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k 78 (163) T protein:vir:10 1 MSGGFDYR-SFAKFANNFNRNANHAKVDRFMRQTLNYEGTELK-SKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGK 78 (163) T ss_pred CCCccCHH-HHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHH-HHHHHhCCcccchhhhhhhhhcccchhhhhcccccc Confidence 99877654 466666666433 34456666666666654 356777786 Q ss_pred cccccccceeE---EeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH Q lcl|NC_011308. 61 VTGEYANKTDF---EVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD 137 (154) Q Consensus 61 dTG~Lr~SI~~---~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~ 137 (154) +||+||+||+. ...++.++++|+++++||+|||||+.. .+ |. ..|++++|..|.++ T Consensus 79 ~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~---~~-------------gG-----fV~G~fml~~s~~~ 137 (163) T protein:vir:10 79 QGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKT---VN-------------GG-----FVPGQFFLHKTVED 137 (163) T ss_pred ccchhhccceecceeecCCceEEEEEecCCccchhhcceee---cC-------------Cc-----eeccchhhHHHHHH Confidence 89999999975 446777899999999999999999532 21 22 24789999999999 Q ss_pred HHHHHHHHHHHHhhccC Q lcl|NC_011308. 138 EKSKVKDYVIKVFGGLD 154 (154) Q Consensus 138 ~~~~i~~~i~~~l~~l~ 154 (154) .+.++.++|++.|.++- T Consensus 138 ~~~~~~~~~e~~l~~~l 154 (163) T protein:vir:10 138 TKSDMEKRVRDKYDGFM 154 (163) T ss_pred HHHHHHHHHHHHHHHHH Confidence 99999999999999987 No 73 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.57 E-value=2.8e-17 Score=111.35 Aligned_cols=118 Identities=17% Similarity=0.183 Sum_probs=94.1 Q ss_pred cchhhhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCc- Q lcl|NC_011308. 13 MADDIKFE---MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDY- 88 (154) Q Consensus 13 Ma~~v~~~---~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y- 88 (154) ||+.|+.. +.+.+.+..+.+.+.+.++.++.+.+.+++. ..+..+|++||.|++||++...+++..++|+++..| T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~-~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~ 79 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVK-QLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYR 79 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhCCccccccccceeeeecCCeeEEEEEecCCcc Confidence 99876643 4456666666777778888888887777654 567899999999999999988777777788888777 Q ss_pred -ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 89 -AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 89 -A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) ++.+|||... + .|. ..+|+||+.||.+...+.+++.|.++|++ T Consensus 80 l~HLLE~GHa~---r-------------~GG-----rV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 80 LTHLLENGHAK---R-------------NGG-----RVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred eEEeeecceee---c-------------CCc-----eeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 7999999531 1 122 24899999999999999999999999999 No 74 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.56 E-value=3.3e-17 Score=110.93 Aligned_cols=120 Identities=18% Similarity=0.246 Sum_probs=90.1 Q ss_pred cch-hhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee--cCceEEEEecCCCc Q lcl|NC_011308. 13 MAD-DIKF-EMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD--KRKQEVKIGNSSDY 88 (154) Q Consensus 13 Ma~-~v~~-~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~--~~~~~~~V~~~~~Y 88 (154) ||+ +++. .+.+.+.++.+.+.+.+.+++++.+++.++ ...++.++|++||.|++||++... .+....+|+++..| T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~-~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~ 79 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKV-LKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHY 79 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhCCcccchhhccccccccccCCcceEEEeccCCC Confidence 985 3332 344666677778888888888888776655 557899999999999999976542 22344566776666 Q ss_pred --ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 89 --AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 89 --A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +..+||||-. . +| ..++|+|||+||++...+++++.|.++|++=- T Consensus 80 ~l~HLLEfGha~---r-------------~g-----GrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 80 RRVHLLEFGHAK---V-------------NG-----GRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred Cceeeeecceec---C-------------CC-----CccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 7899999731 1 11 13799999999999999999999999998544 No 75 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.53 E-value=3.1e-17 Score=111.09 Aligned_cols=113 Identities=17% Similarity=0.139 Sum_probs=90.8 Q ss_pred cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCC---Cc Q lcl|NC_011308. 13 MAD-DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSS---DY 88 (154) Q Consensus 13 Ma~-~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~---~Y 88 (154) ||+ +++|.+++...++++.. ..+.+++.++..+++++...+..++|++||+|.. |+..+...+ .++|+.+. =| T Consensus 1 Ma~iel~G~del~~~l~~~g~-~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~kk~g-~~~VG~~ks~~fy 77 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMV-LDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRVKNTG-LATEGTASSSEFY 77 (119) T ss_pred CceeehhhHHHHHHHHHhhhh-hhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeeeecCc-eeEeccCCcchhh Confidence 985 77888888888877774 4456666667788899999999999999999997 666676666 47777765 49 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCC-chhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKAS-KRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~-PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +.|.|||| ..|||| |||.||++.+++++...|.+.|.+== T Consensus 78 ~kF~EFGT--------------------------Skm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~ 118 (119) T protein:vir:10 78 DIFQNFGT--------------------------SEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKM 118 (119) T ss_pred hhhccccc--------------------------cccCCCCCccccccccChHHHHHHHHHHHHHhc Confidence 99999999 478999 99999999999998888777664332 No 76 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.37 E-value=2e-15 Score=101.21 Aligned_cols=111 Identities=10% Similarity=0.054 Sum_probs=77.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCc---cccccccceeE-EeecCceEEEEecCCCcccccccCccccccCCCCcccc Q lcl|NC_011308. 34 AEKALKQIGEHMKTEIAEGGHGDSSNN---VTGEYANKTDF-EVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGG 109 (154) Q Consensus 34 ~~~~v~~a~~~~~~~i~~~~ak~~aPv---dTG~Lr~SI~~-~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~ 109 (154) +.+++++++.+.+.+. .+.++.++|| |||+||+||++ ++.. ..++|+++++||+|||||.+...-+ T Consensus 1 l~~~~~~~~~~~a~~l-~~~vk~rTPv~~~d~G~LR~sW~~g~v~k--~~~~v~N~~eYA~~VE~GHRq~~g~------- 70 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKL-LRKVKPKTPVAKIDGGTARKSWKYKELNL--FDGVVSNNVEYIHHLEYGHRTRQGT------- 70 (116) T ss_pred CchHHHHHHHHHHHHH-HHHHHhhCCCCcCCCcccccCceeeeeec--cCceeecCCcccccccCCceeeCCc------- Confidence 4444555555555444 3457888998 67999999987 3333 3357999999999999996422111 Q ss_pred cceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 110 WSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 110 ~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) ..+....|+.......+++.||..|+++.+..+.+++++.|.++= T Consensus 71 g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l 115 (116) T protein:vir:10 71 GTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFW 115 (116) T ss_pred ceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhc Confidence 112223444444556789999999999999999999999997655 No 77 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=99.29 E-value=3.4e-14 Score=94.39 Aligned_cols=110 Identities=19% Similarity=0.197 Sum_probs=88.1 Q ss_pred cchh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec---------------- Q lcl|NC_011308. 13 MADD-IKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK---------------- 75 (154) Q Consensus 13 Ma~~-v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~---------------- 75 (154) ||++ ..|..++.+..++.+..++..++++++++..+++. .+|||||+||+||.+++.. T Consensus 1 Ma~~~~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~-----~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t 75 (142) T protein:vir:10 1 MANDVVSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVK-----LSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNET 75 (142) T ss_pred CccchhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCcccchhhcccceeeecCcccccccCcCCCCccc Confidence 9965 57888899999999888888888888877766643 6899999999999654321 Q ss_pred ---------------CceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHH Q lcl|NC_011308. 76 ---------------RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKS 140 (154) Q Consensus 76 ---------------~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~ 140 (154) .+.++.|.++++||.++|||+ .+|.|..|.+-+++.... T Consensus 76 ~~~~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~--------------------------S~QAP~G~v~~a~q~~~~ 129 (142) T protein:vir:10 76 RNSLRRQIYALARDANTNVIYISNRLDYAQGLEFGS--------------------------SNQAPSGVLGVVQKRLGR 129 (142) T ss_pred hhhHHHHHHHhhhccccceEEEeeCcchhhhhhccc--------------------------cCCCcchHHHHHHHHHHH Confidence 133466889999999999997 479999999999998888 Q ss_pred HHHHHHHHHhhcc Q lcl|NC_011308. 141 KVKDYVIKVFGGL 153 (154) Q Consensus 141 ~i~~~i~~~l~~l 153 (154) .+.+.+.+.=+.| T Consensus 130 ~v~~a~~e~~~~~ 142 (142) T protein:vir:10 130 YFAEAVQEAKRAL 142 (142) T ss_pred HHHHHHHHhhccC Confidence 7777666666666 No 78 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=99.26 E-value=1.1e-13 Score=91.56 Aligned_cols=116 Identities=19% Similarity=0.212 Sum_probs=87.4 Q ss_pred cchhhh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhcCCccccccccceeEEeecCceEEEEecCC Q lcl|NC_011308. 13 MADDIK---FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAE---GGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~---~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~---~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~ 86 (154) ||+ |. +.+.+.+.+..+.+.+.+.|++++.+.+.+++. ...+..+|++||.++.||.......+ .+|++.. T Consensus 1 M~~-i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~--~~V~nk~ 77 (124) T protein:vir:95 1 MAK-IKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNG--WVIHNKT 77 (124) T ss_pred Ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCc--eeEEEcC Confidence 985 43 445577777777776766666666555555443 23446899999999999988776655 3799988 Q ss_pred Cc--ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 87 DY--AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 87 ~Y--A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) +| +...|||.- .+..| ..+|+|+++|+.+...+.+++.|++.|+. T Consensus 78 ~yqLtHLLE~GHA---kr~GG------------------RV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 78 EYRLAHLLEYGHA---TVDGG------------------RVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred CCceeeeeeccee---ccCCc------------------ccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 99 999999952 22111 25899999999999999999999999999 No 79 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=99.23 E-value=1.3e-13 Score=91.18 Aligned_cols=107 Identities=17% Similarity=0.196 Sum_probs=81.7 Q ss_pred cch-hh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee---------------- Q lcl|NC_011308. 13 MAD-DI-KFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD---------------- 74 (154) Q Consensus 13 Ma~-~v-~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~---------------- 74 (154) ||+ ++ .|..++.+..++++..++..++.+++++..+++. .+|||||+||+||.+.+. T Consensus 1 ma~~~~~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~ 75 (147) T protein:vir:10 1 MANYQIRRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVS-----RSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGV 75 (147) T ss_pred CCCcchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcchhhccccceeecCccccccCCcCCCccc Confidence 997 44 7888999999988888888888888877766643 689999999999965421 Q ss_pred ----------------cCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH Q lcl|NC_011308. 75 ----------------KRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE 138 (154) Q Consensus 75 ----------------~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~ 138 (154) +.+.++.|.++++||.++|||+ .+|.|..|.+-+++.. T Consensus 76 t~a~~~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~--------------------------S~QAP~G~V~~t~q~~ 129 (147) T protein:vir:10 76 VRGEEQAKTYGMFSRGGAITSVHFSNMLIYANALEYGH--------------------------SQQAPSGVVGLVALRL 129 (147) T ss_pred hhhhhhHHHHHHhhhccCcceEEEeeCcchhhhhhccc--------------------------cCCCCchHHHHHHHHH Confidence 1233567889999999999997 3789999999888766 Q ss_pred HHHHHHHHHHHhhccC Q lcl|NC_011308. 139 KSKVKDYVIKVFGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~l~ 154 (154) ..-+ .+++.++- T Consensus 130 ~~~v----~~~~~e~k 141 (147) T protein:vir:10 130 RSYM----ADAIKQAR 141 (147) T ss_pred HHHH----HHHHHHHH Confidence 5443 33333333 No 80 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=99.22 E-value=1.5e-13 Score=90.89 Aligned_cols=111 Identities=14% Similarity=0.179 Sum_probs=84.9 Q ss_pred cchh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec------------C-- Q lcl|NC_011308. 13 MADD--IKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK------------R-- 76 (154) Q Consensus 13 Ma~~--v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~------------~-- 76 (154) ||+. ..|.+++.+..++.+..++..++++++++..+++. .+|||||+||.||.+.+.. + T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~-----~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~ 75 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVD-----IAPVDTGRFKANMQITANKPPLYALNQYDPDGEK 75 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcchhhccccceeecCcccccccCCCCCCcc Confidence 9983 48999999999999888888888888877766643 6899999999999665311 0 Q ss_pred ------------------ceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH Q lcl|NC_011308. 77 ------------------KQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE 138 (154) Q Consensus 77 ------------------~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~ 138 (154) +.++-|.++++||.++|||+ .+|.|..|.+.+++.. T Consensus 76 t~~~~~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~--------------------------S~QAP~G~v~~~~~~~ 129 (146) T protein:vir:79 76 IKAEGRRTLYALLHGGGAIKSIYFSNMLIYANALEYGH--------------------------SKQAPAGVFGIVAIRL 129 (146) T ss_pred cHHHHHHHHHHHHhcccccceeEEeeCchhhhhhhccc--------------------------cCCCcchHHHHHHHHH Confidence 12455779999999999996 4789999999999987 Q ss_pred HHHHHHHHHHHhhccC Q lcl|NC_011308. 139 KSKVKDYVIKVFGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~l~ 154 (154) ..-+.+.+.+.=+.+- T Consensus 130 ~~~v~~a~~e~k~~~~ 145 (146) T protein:vir:79 130 RSYMAEAIREARKKNA 145 (146) T ss_pred HHHHHHHHHHHHhhcc Confidence 7666554444433333 No 81 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=99.20 E-value=1.4e-13 Score=91.10 Aligned_cols=114 Identities=15% Similarity=0.154 Sum_probs=82.0 Q ss_pred cCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec------------C Q lcl|NC_011308. 9 RGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK------------R 76 (154) Q Consensus 9 ~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~------------~ 76 (154) =.=+|++...|..++.+..++++..++..++++++++..+++ ..+|||||+||+||.+.++. + T Consensus 1 ~~~~m~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv-----~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G 75 (145) T protein:vir:10 1 MARNIGSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIV-----DLSPVDTGRFKANWQISANSPAQQSLNEYDQTG 75 (145) T ss_pred CCCcccchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCccchhhccccceeecccccccccccCCCC Confidence 011233335688889999999988888888888887776664 36899999999999664311 1 Q ss_pred -------------------ceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH Q lcl|NC_011308. 77 -------------------KQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD 137 (154) Q Consensus 77 -------------------~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~ 137 (154) +.++-|.++++||.++|||+ .+|.|..|.+-+++. T Consensus 76 ~~t~~~~~~~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~--------------------------S~QAP~G~v~~~~~~ 129 (145) T protein:vir:10 76 GQTKTYLARQARAVANSKATSVIYITNRLDYAADLEYGA--------------------------SNQAPAGVLGVVQAR 129 (145) T ss_pred ccchhhHHHHHHHhhcccccceEEEeeCchhhhHhhccc--------------------------cCCCcchHHHHHHHH Confidence 12245779999999999997 479999999999999 Q ss_pred HHHHHHHHHHHHhhcc Q lcl|NC_011308. 138 EKSKVKDYVIKVFGGL 153 (154) Q Consensus 138 ~~~~i~~~i~~~l~~l 153 (154) ....+.+.+.+.=+-| T Consensus 130 ~~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 130 LGRYFQEAVEEARRAI 145 (145) T ss_pred HHHHHHHHHHHhhccC Confidence 8755544443333333 No 82 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=99.19 E-value=5e-13 Score=88.02 Aligned_cols=118 Identities=19% Similarity=0.233 Sum_probs=88.1 Q ss_pred cchhhh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHhcCCccccccccceeEEeecCceEEEEecCC Q lcl|NC_011308. 13 MADDIK---FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEG---GHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~---~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~---~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~ 86 (154) ||+ |. +.+.+.+.+..+.+.+.+.|++++.+.+.+++.. ..+..+|++||.++.||+......+ .+|++.. T Consensus 1 M~~-i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~--~~v~nk~ 77 (127) T protein:vir:80 1 MAN-IKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGG--WVIHNKT 77 (127) T ss_pred Ccc-ccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCc--eeEeecC Confidence 885 44 3345666676676777777777776665555432 2346899999999999987765554 4799988 Q ss_pred Cc--ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 87 DY--AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 87 ~Y--A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +| +...|||.- .+..| ..+|+|+++|+.+...+.+++.|++.|+.=- T Consensus 78 ~yqLtHLLE~GHA---kr~GG------------------RV~a~pHI~paee~~~~~l~~~i~~~l~~~~ 126 (127) T protein:vir:80 78 EYRLAHLLEYGHA---TVDGG------------------RVPETPHIRPVEDWLEKEFEDRVERAIKNES 126 (127) T ss_pred Ccceeehhhccee---ccCCc------------------ccCCccchhhHHHHHHHHHHHHHHHHhcCCC Confidence 99 999999952 22211 2589999999999999999999999998777 No 83 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=99.17 E-value=1.8e-13 Score=90.45 Aligned_cols=120 Identities=13% Similarity=0.121 Sum_probs=82.4 Q ss_pred hhhhhHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccce--eEEeecCceEEEEecCCC Q lcl|NC_011308. 15 DDIKFEMDMSK----IKDMFD-DTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKT--DFEVDKRKQEVKIGNSSD 87 (154) Q Consensus 15 ~~v~~~~~l~~----~~~~l~-~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI--~~~~~~~~~~~~V~~~~~ 87 (154) ++|+|+.+... +++.+. +++.+++.++....+. .|-..+|+||+.|-||. ++.+++.+++|+||.++. T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~-----~AA~~TPIDTSTLiNSQfrei~~ngtritGRVGYSAn 75 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGAN-----HAAVITPVKSSTLINSQYKKLEPIPSGMIGRVGYTAN 75 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHh-----hhhhccccchhhhccccceeeeccCceeEEeecccee Confidence 56777765444 444443 3555666665544332 34456899999999998 466788889999999999 Q ss_pred cccccccCcccccc--CCCCcccccceecccCceeecCCCCCCc-hhHHHHHHH-HHHHHHHHHHHhhc Q lcl|NC_011308. 88 YAIYYEFGTGEKSE--KGGGRAGGWSYMDKNGKWHFTRGSKASK-RMRYTFRDE-KSKVKDYVIKVFGG 152 (154) Q Consensus 88 YA~yVE~GTg~~~~--~~~~~~~~~~~~~~~g~~~~t~g~~a~P-Fl~pA~~~~-~~~i~~~i~~~l~~ 152 (154) ||.|||.-.|..-. +|++++.-|. -.|.| ||..++++. ...+..+|.++++- T Consensus 76 YA~yVHda~Gklkgqprp~gkgn~w~-------------p~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 76 YAAAVNAAKGKLKGKPRPDGSGNYWD-------------PNGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred eeeeeecCccccCCCcCCCCCcceec-------------CCCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 99999997776532 2333333221 12334 999999886 55688889998888 No 84 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=99.15 E-value=3.2e-13 Score=89.10 Aligned_cols=109 Identities=12% Similarity=0.088 Sum_probs=89.4 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec----------------- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK----------------- 75 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~----------------- 75 (154) |++...|..++.+..++++..+...++++++++...++. .+|||||+||.||.+.+.. T Consensus 1 m~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~-----~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~ 75 (148) T protein:vir:97 1 MPSLSEFSRRITLRGRKVAEGADALTRKVALAADQAVVS-----GTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGST 75 (148) T ss_pred CCccchhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcchhhhhhhheeecccccccccccCCCCCCcc Confidence 999889999999999999999988888888877766643 5899999999999655321 Q ss_pred --------------------CceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHH Q lcl|NC_011308. 76 --------------------RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTF 135 (154) Q Consensus 76 --------------------~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~ 135 (154) -+.++-|.++++||.+.|||+ .+|.|..|.+-++ T Consensus 76 ~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~--------------------------S~QAP~G~v~~t~ 129 (148) T protein:vir:97 76 EAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGY--------------------------SAQAPANFVEQAV 129 (148) T ss_pred cccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccc--------------------------cCCCcchHHHHHH Confidence 013456889999999999996 4799999999999 Q ss_pred HHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 136 RDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 136 ~~~~~~i~~~i~~~l~~l~ 154 (154) +....-+++ .++++|.. T Consensus 130 ~~~~~~v~~--~~~~~~~~ 146 (148) T protein:vir:97 130 LEAVQVVQF--GRVVDGDP 146 (148) T ss_pred HHHHHHHHh--hhhhcCCC Confidence 988877765 67777777 No 85 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=99.12 E-value=4.9e-13 Score=88.06 Aligned_cols=103 Identities=16% Similarity=0.189 Sum_probs=78.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee------------------ Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD------------------ 74 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~------------------ 74 (154) |+ |..++.+..+++++.++..++++++++..+++ ..+|||||+||+||.+.+. T Consensus 1 ms----F~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv-----~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~ 71 (131) T protein:vir:94 1 MS----FALDVTRFVEKAKKNPEKVIRQVSIKLFSAII-----KASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTAT 71 (131) T ss_pred CC----cccCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCCchhhhhccchhccccccccccCCCCCCchhhH Confidence 65 67778888888888888888888887776665 3679999999999965431 Q ss_pred ----------cCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHH Q lcl|NC_011308. 75 ----------KRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKD 144 (154) Q Consensus 75 ----------~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~ 144 (154) ..+.++.|.++++||.++|||+ .+|.|..|.+-+++..... T Consensus 72 ~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~--------------------------S~QAP~g~v~~~~~~~~~~--- 122 (131) T protein:vir:94 72 GNATSFVLNAADWHTFTLTNNLPYAQRLEYGW--------------------------SQQAPQGFVRVNVSRFQQL--- 122 (131) T ss_pred HHHHHHHhhccccceEEEeeCchhhhhhhccc--------------------------cCCCcchHHHHHHHHHHHH--- Confidence 1234567999999999999997 4789999999998876655 Q ss_pred HHHHHhhccC Q lcl|NC_011308. 145 YVIKVFGGLD 154 (154) Q Consensus 145 ~i~~~l~~l~ 154 (154) ++++.+++- T Consensus 123 -v~~~~~e~k 131 (131) T protein:vir:94 123 -LNEEASKVK 131 (131) T ss_pred -HHHHHHhcC Confidence 444444444 No 86 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=99.06 E-value=1.5e-12 Score=85.47 Aligned_cols=103 Identities=17% Similarity=0.193 Sum_probs=78.1 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec----------------- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK----------------- 75 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~----------------- 75 (154) |+ |..++.+..++.+..++..++++++++..+++ ..+|||||+||+||.+.++. T Consensus 1 ms----f~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv-----~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~ 71 (131) T protein:vir:78 1 MS----FALDVSKFVEKAKKNPEKVIRQVSIKLFSAII-----KASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) T ss_pred CC----cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCCchhhhccccceecccccccccCCCCCCchhhH Confidence 65 67778888888888888888888887776665 36899999999999665421 Q ss_pred -----------CceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHH Q lcl|NC_011308. 76 -----------RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKD 144 (154) Q Consensus 76 -----------~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~ 144 (154) .+.++.|.++++||.++|||+ .+|.|..|.+-+++..... T Consensus 72 ~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~--------------------------S~QAP~G~v~~~~~~~~~~--- 122 (131) T protein:vir:78 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGW--------------------------SQQAPQGFVRVNVSRFQQL--- 122 (131) T ss_pred HHHHHHHhhccCCceEEEeeCchhhhHhhccc--------------------------cCCCcchHHHHHHHHHHHH--- Confidence 124566899999999999997 3789999999998876655 Q ss_pred HHHHHhhccC Q lcl|NC_011308. 145 YVIKVFGGLD 154 (154) Q Consensus 145 ~i~~~l~~l~ 154 (154) ++++.+++- T Consensus 123 -v~~~~~e~k 131 (131) T protein:vir:78 123 -LNEEASKVK 131 (131) T ss_pred -HHHHHHhcC Confidence 444444444 No 87 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.06 E-value=2.3e-13 Score=89.86 Aligned_cols=96 Identities=16% Similarity=0.207 Sum_probs=68.5 Q ss_pred HHHHHHhcCCccccccccceeEEee-----cCceEEEEecC---CCcccccccCccccccCCCCcccccceecccCceee Q lcl|NC_011308. 50 AEGGHGDSSNNVTGEYANKTDFEVD-----KRKQEVKIGNS---SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHF 121 (154) Q Consensus 50 ~~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~---~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~ 121 (154) ++++++..+|++||.|++||..-+. .+.-++.|+-| ++|...||||. +.... ....++|.|+. T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh---w~~~~------~~~~~dG~w~~ 71 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGH---WQTHA------AYKGKDGEWYS 71 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccce---eeeee------eeeccCceeee Confidence 7788899999999999999965432 12345567554 67888999992 22211 12334566654 Q ss_pred -------cCCCCCCchhHHHHHHHHHHHHHHHHHH----hhccC Q lcl|NC_011308. 122 -------TRGSKASKRMRYTFRDEKSKVKDYVIKV----FGGLD 154 (154) Q Consensus 122 -------t~g~~a~PFl~pA~~~~~~~i~~~i~~~----l~~l~ 154 (154) ++.+||+|||+|||+....+++.++.+. +.||- T Consensus 72 ~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~ 115 (119) T protein:vir:10 72 SSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQ 115 (119) T ss_pred cCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5689999999999999999888887777 55555 No 88 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=99.05 E-value=4.7e-13 Score=88.16 Aligned_cols=115 Identities=14% Similarity=0.112 Sum_probs=75.3 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----------------------CCccccccccceeEEe Q lcl|NC_011308. 16 DIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDS----------------------SNNVTGEYANKTDFEV 73 (154) Q Consensus 16 ~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~----------------------aPvdTG~Lr~SI~~~~ 73 (154) .|+...++.+++..+.+.+...+.+++..+..++.++..... .-++||.|++||..++ T Consensus 1 ~i~~~~~i~~~l~~l~~~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~~~~ 80 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDGLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDINAAS 80 (145) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHHHHh Confidence 355666677777766665555555544433333322211111 1259999999998765 Q ss_pred e--cCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHH----HHHHH Q lcl|NC_011308. 74 D--KRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKV----KDYVI 147 (154) Q Consensus 74 ~--~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i----~~~i~ 147 (154) . .++..+.||+|.+||.+.+||+. ...+||||||-++....+.++ .+.+. T Consensus 81 ~~~~~~~~a~vGtn~~YA~~hqfG~~------------------------~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~ 136 (145) T protein:vir:31 81 MMDRANRMAVIGTNLDYAEHHEFGAP------------------------EAGIPARPIFGPAGAYASQQAPDVIGDEID 136 (145) T ss_pred hhcccCceeEecCCchhhhhhccCCc------------------------ccccCCCCccCCCccchHHHHHHHHHHHHH Confidence 3 34567899999999999999972 024899999999876655544 45556 Q ss_pred HHhhccC Q lcl|NC_011308. 148 KVFGGLD 154 (154) Q Consensus 148 ~~l~~l~ 154 (154) ..|+++. T Consensus 137 ~~L~~~~ 143 (145) T protein:vir:31 137 TNLEGAV 143 (145) T ss_pred HHhhhhc Confidence 6677766 No 89 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=99.01 E-value=2.4e-12 Score=84.24 Aligned_cols=106 Identities=14% Similarity=0.092 Sum_probs=80.3 Q ss_pred cchh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec---------------- Q lcl|NC_011308. 13 MADD-IKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK---------------- 75 (154) Q Consensus 13 Ma~~-v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~---------------- 75 (154) ||+. ..|...+.+..+.++..+...++++++++...++ ..+|||||++|.||.+.+.. T Consensus 1 MA~~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv-----~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~ 75 (144) T protein:vir:95 1 MAKSLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLA-----YKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGS 75 (144) T ss_pred CchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCccchhhccccceeccccccccccccccccccc Confidence 9975 3688888888888888888888887776665554 35899999999999766431 Q ss_pred ---------------------CceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHH Q lcl|NC_011308. 76 ---------------------RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYT 134 (154) Q Consensus 76 ---------------------~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA 134 (154) .+.++.|.++++||.+.|||+ .+|.|..|.+-+ T Consensus 76 t~d~sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~--------------------------S~QAP~G~vr~~ 129 (144) T protein:vir:95 76 TQRASAAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGY--------------------------SAQAPAGFVERA 129 (144) T ss_pred cCCCchhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccc--------------------------cCCCcchHHHHH Confidence 113455889999999999997 378999999999 Q ss_pred HHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 135 FRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 135 ~~~~~~~i~~~i~~~l~~l~ 154 (154) ++....-+++ ++=+| T Consensus 130 ~q~~~~~v~~-----~~~~~ 144 (144) T protein:vir:95 130 VLIGRKMRKK-----FKIKD 144 (144) T ss_pred HHHHHHHHHh-----hccCC Confidence 9887765443 34444 No 90 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.00 E-value=6e-13 Score=87.57 Aligned_cols=96 Identities=16% Similarity=0.205 Sum_probs=67.6 Q ss_pred HHHHHHhcCCccccccccceeEEee-----cCceEEEEecC---CCcccccccCccccccCCCCcccccceecccCceee Q lcl|NC_011308. 50 AEGGHGDSSNNVTGEYANKTDFEVD-----KRKQEVKIGNS---SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHF 121 (154) Q Consensus 50 ~~~~ak~~aPvdTG~Lr~SI~~~~~-----~~~~~~~V~~~---~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~ 121 (154) ++++++..+|++||.|++||..-+. .+.-++.|+-| ++|...||||. +..... ...++|.|+. T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh---w~~~~~------~~~~dG~w~~ 71 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGH---WQTHAA------YKGKDGEWYS 71 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccce---eeeeee------eeccCceeee Confidence 7788899999999999999965432 12345567554 67888999992 222111 2234455542 Q ss_pred -------cCCCCCCchhHHHHHHHHHHHHHHHHHH----hhccC Q lcl|NC_011308. 122 -------TRGSKASKRMRYTFRDEKSKVKDYVIKV----FGGLD 154 (154) Q Consensus 122 -------t~g~~a~PFl~pA~~~~~~~i~~~i~~~----l~~l~ 154 (154) ++.+||+|||+|||+....+++.++.+. +.||- T Consensus 72 ~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~ 115 (119) T protein:vir:81 72 SSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQ 115 (119) T ss_pred cCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4678999999999999999888887777 55555 No 91 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=98.98 E-value=4.8e-12 Score=82.63 Aligned_cols=102 Identities=12% Similarity=0.215 Sum_probs=78.4 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec-----------C----- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK-----------R----- 76 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~-----------~----- 76 (154) |+ |..++.+..++++..++..++++++++..+++. .+|||||+||+||.+.+.. + T Consensus 1 ms----F~~~i~~~~~~ve~~~~~~~r~~a~~~~~~vv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~~ 71 (134) T protein:vir:80 1 MS----YTDRFNVIAKGIEDNVDNLVKNVALAIGSNVIA-----DTPILTGQARRNWQTELNQMPESVLDIPESPSEGMD 71 (134) T ss_pred CC----cccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcchhhhcccceeecCcccccccCcCCCCccch Confidence 55 778888888888888888888888877776653 5899999999999655421 0 Q ss_pred ---------------ceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHH Q lcl|NC_011308. 77 ---------------KQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSK 141 (154) Q Consensus 77 ---------------~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~ 141 (154) +.++-|.++++||.++|||+ .+|.|..|.+-+++..... T Consensus 72 ~~~~~~~~vi~~~k~g~~iyi~Nn~pYA~~LEyG~--------------------------S~QAP~G~v~~t~~~~~~~ 125 (134) T protein:vir:80 72 EALQVLQQTVGQYKAGDTVHITNNAPYIKELNSGS--------------------------SQQAPANFVETSIMRATRL 125 (134) T ss_pred hhHHHHHHHHhhccCcceEEEeeCchhhhhhhccc--------------------------cCCCcchHHHHHHHHHHHH Confidence 13355789999999999997 4799999999888876655 Q ss_pred HHHHHHHHhhccC Q lcl|NC_011308. 142 VKDYVIKVFGGLD 154 (154) Q Consensus 142 i~~~i~~~l~~l~ 154 (154) +++ ++.|. T Consensus 126 v~~-----~~~~~ 133 (134) T protein:vir:80 126 IRN-----VKVVP 133 (134) T ss_pred HHh-----hccCC Confidence 443 55555 No 92 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=98.93 E-value=1.5e-11 Score=79.88 Aligned_cols=112 Identities=11% Similarity=0.102 Sum_probs=80.1 Q ss_pred hhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCCcc----------ccccccceeEEe-ecCc---eEE Q lcl|NC_011308. 17 IKFEMDMSKIKDMFDDTAE--KALKQIGEHMKTEIAEGGHGDSSNNV----------TGEYANKTDFEV-DKRK---QEV 80 (154) Q Consensus 17 v~~~~~l~~~~~~l~~~~~--~~v~~a~~~~~~~i~~~~ak~~aPvd----------TG~Lr~SI~~~~-~~~~---~~~ 80 (154) |++...|..+++.|.+... ...++.+..+++++.+...+..+|.. .+.|+++|.++- +.++ -.. T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceee Confidence 4555556666666655542 33445566777788887788888862 367999997642 1111 223 Q ss_pred EEecCC--CcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 81 KIGNSS--DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 81 ~V~~~~--~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .||-+. -+|.|+|+|| ..|||+||+.++.++.++++.+.+.++++++= T Consensus 81 ~VG~~k~~~~A~f~n~GT--------------------------~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l 130 (139) T protein:vir:10 81 TVGFHNKAHIARFLNDGT--------------------------KYIRADHFVDNARDDAKDAVFAAEAEKYQAMI 130 (139) T ss_pred eeCCCCCcceEeecccCc--------------------------cccCCCchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 566654 3789999998 47999999999999999999999999997765 No 93 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=98.92 E-value=3.8e-12 Score=83.20 Aligned_cols=99 Identities=19% Similarity=0.015 Sum_probs=75.0 Q ss_pred CCcceeeecCCccchhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceE Q lcl|NC_011308. 1 MRSRLLIDRGGHMADDIK-FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQE 79 (154) Q Consensus 1 ~~~~~~~~~~~~Ma~~v~-~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~ 79 (154) |.-...-=---||| +|+ |.++|.+.++++++.+.+++++.+.+++ +.+.+.|..++|||||+|++||.+.+..++++ T Consensus 1 ~~~~~~~~~~~~ma-kvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa-~~I~~~Avs~APVD~G~Lk~SI~~dyk~GGlt 78 (100) T protein:vir:96 1 MKLNYYDLSRCHMA-KVKYGADSMVVELDKFDKKIEEWVKKGIAKTT-TKIYNTAVALAPVDLGFLEESIDFKYFDGGLS 78 (100) T ss_pred Ccccccccchhhhh-hheechHHHHHHHhcchHHHHHHHHHHHHHHH-HHHHhhHHhhccccccccceeeeeeeecCCee Confidence 43322222235998 465 5678999999999999999999998876 45678999999999999999999999999999 Q ss_pred EEEecCCCcccccccCcccccc Q lcl|NC_011308. 80 VKIGNSSDYAIYYEFGTGEKSE 101 (154) Q Consensus 80 ~~V~~~~~YA~yVE~GTg~~~~ 101 (154) +.|..+++||+--=---=...+ T Consensus 79 avI~vGAeYAIkrmsqllvtvi 100 (100) T protein:vir:96 79 SVISVGADYAIKRMSQLLVTVI 100 (100) T ss_pred EEEecchhHHHHHHHHHHhhcC Confidence 9999999999811000000000 No 94 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.91 E-value=4.3e-12 Score=82.88 Aligned_cols=95 Identities=19% Similarity=0.309 Sum_probs=76.6 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec----------------- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK----------------- 75 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~----------------- 75 (154) |+ ...|..++.+..+++++.++..+++++++....++ ..+|||||++|.||.+.+.. T Consensus 1 ~~-~~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv-----~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~ 74 (121) T protein:vir:94 1 MI-SMKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVI-----ARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPA 74 (121) T ss_pred Cc-cchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCchhhhhccccccccCcccccCCCCCCCcchhH Confidence 65 47899999999999988888888888877666554 46899999999999664321 Q ss_pred ---------CceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHH Q lcl|NC_011308. 76 ---------RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEK 139 (154) Q Consensus 76 ---------~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~ 139 (154) .+.++-|.++++||.+.|+|+ .+|.|..|.+.++...+ T Consensus 75 ~~~~~~~~~~~~~iyi~NnlpYA~~LE~G~--------------------------S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 75 PAIVVSSNVALPHFYITNGAPYAQQLEKGS--------------------------STQAPLGIVRVTLASLR 121 (121) T ss_pred HHHHHHHhhccceEEEeeCcchhhhhhccc--------------------------CCCCcchHHHHHHHhhC Confidence 123456889999999999997 48999999999999888 No 95 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=98.78 E-value=6.6e-11 Score=76.41 Aligned_cols=115 Identities=14% Similarity=0.034 Sum_probs=76.5 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc---------cccccccceeEEe-ecCc---eE Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN---------VTGEYANKTDFEV-DKRK---QE 79 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv---------dTG~Lr~SI~~~~-~~~~---~~ 79 (154) |++--++..++.+.+.+|.+.. ...++.+..+++++.+...+..+|. ..|.|++||.++- +.++ -+ T Consensus 1 M~~~~~glee~~~~lekL~~~~-~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~ 79 (153) T protein:vir:49 1 MTGLDEALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGV 79 (153) T ss_pred CccHHHHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccce Confidence 9863345555655555554322 3445556667888888777777775 2358999997642 1111 13 Q ss_pred EEEecCCC----cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH--HHHHHHHHHHHhhcc Q lcl|NC_011308. 80 VKIGNSSD----YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE--KSKVKDYVIKVFGGL 153 (154) Q Consensus 80 ~~V~~~~~----YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~--~~~i~~~i~~~l~~l 153 (154) ..||-+.+ ||.|+|+|| ..|||+||+.++.++. +..+.+.+.++++++ T Consensus 80 s~VG~~~~~~a~~a~f~n~GT--------------------------~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~i 133 (153) T protein:vir:49 80 STVGWKNNYHAQNARRLNDGT--------------------------KKYRADHFITNVQNDSTVKNKVLLAEKEEYEKL 133 (153) T ss_pred eeecccCCccceeeeecccCc--------------------------ccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHH Confidence 46776533 589999998 5799999999999886 566776666666665 Q ss_pred C Q lcl|NC_011308. 154 D 154 (154) Q Consensus 154 ~ 154 (154) = T Consensus 134 l 134 (153) T protein:vir:49 134 I 134 (153) T ss_pred H Confidence 4 No 96 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.76 E-value=1.2e-10 Score=75.04 Aligned_cols=112 Identities=18% Similarity=0.159 Sum_probs=84.0 Q ss_pred CCcceeeecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--------------cccccc Q lcl|NC_011308. 1 MRSRLLIDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN--------------VTGEYA 66 (154) Q Consensus 1 ~~~~~~~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv--------------dTG~Lr 66 (154) |-|- |-.|..|+ |..++.+..++.+..++..++.+++++...++ ..+|| |||++| T Consensus 1 ~~~~--~~~~~~ms----Faa~i~~~~~~~e~~~~~~~R~~~~~i~~~vv-----~~sPVg~~~~~~~~a~~~ydtGrfR 69 (152) T protein:vir:96 1 MLSC--ICGGNPMS----WSKSLKNIIVKNENLTEKQLRAGLFDAANTVI-----LGSPVGAPELWQQPAPNYYRAGSYR 69 (152) T ss_pred Ccce--eeCCCccc----ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----Hhhccccccccccccccccchhhhh Confidence 5553 34555555 77788888888888888888888777666554 34799 999999 Q ss_pred cceeEEeec--------------------------CceEEEEecCCCcccccccCccccccCCCCcccccceecccCcee Q lcl|NC_011308. 67 NKTDFEVDK--------------------------RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWH 120 (154) Q Consensus 67 ~SI~~~~~~--------------------------~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~ 120 (154) .||.+++.. -+.++.|.++++||...|||+ T Consensus 70 anw~vS~~~p~~~~~~~~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~------------------------ 125 (152) T protein:vir:96 70 SNHRVSISKITSFEKGISSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGH------------------------ 125 (152) T ss_pred hhheeeecCCCcccccCCCCCchHHHHHHHHhhccccceEEEeeCchhhhHhhccc------------------------ Confidence 999765421 124557889999999999996 Q ss_pred ecCCCCCCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 121 FTRGSKASKRMRYTFRDEKSKVKDYVIKV 149 (154) Q Consensus 121 ~t~g~~a~PFl~pA~~~~~~~i~~~i~~~ 149 (154) .+|.|.-|.+.+++....-+.+.++.. T Consensus 126 --S~QAP~G~vr~t~~~~~~~v~ea~~~~ 152 (152) T protein:vir:96 126 --SSQAPNGVYRPAVRRLVKFLNTELKAK 152 (152) T ss_pred --cCCCCchHHHHHHHHHHHHHHHHhccC Confidence 478999999999998777766665555 No 97 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=98.71 E-value=2.2e-10 Score=73.55 Aligned_cols=112 Identities=10% Similarity=0.113 Sum_probs=79.6 Q ss_pred hhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCCc-------cc---cccccceeEEe-ecCc---eEE Q lcl|NC_011308. 17 IKFEMDMSKIKDMFDDTAE--KALKQIGEHMKTEIAEGGHGDSSNN-------VT---GEYANKTDFEV-DKRK---QEV 80 (154) Q Consensus 17 v~~~~~l~~~~~~l~~~~~--~~v~~a~~~~~~~i~~~~ak~~aPv-------dT---G~Lr~SI~~~~-~~~~---~~~ 80 (154) |.|...|..+++.|++... ...++.+..+++++.+...+..+|. ++ +.|+++|.+.. +.++ -++ T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccc Confidence 4555556666666665542 3344566777888888888888884 23 46999997653 1111 124 Q ss_pred EEecCC--CcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 81 KIGNSS--DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 81 ~V~~~~--~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .||-+. -.|.|+|+|| ..|||+||+..+.++.++++.+.+.++++++= T Consensus 81 ~VG~~~~~~~Ahf~n~GT--------------------------~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l 130 (139) T protein:vir:10 81 TVGFHNKAHIARFLNDGT--------------------------KNIRADHFVDNARDDAKDAVFAAEAEKYQAMI 130 (139) T ss_pred eeCCCCCceeeeeeccCc--------------------------cccCCCchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 566543 3478999998 47999999999999999999999999998775 No 98 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=98.66 E-value=2.3e-10 Score=73.38 Aligned_cols=115 Identities=11% Similarity=-0.022 Sum_probs=76.6 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc---------cccccccceeEEee-cCc---eE Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN---------VTGEYANKTDFEVD-KRK---QE 79 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv---------dTG~Lr~SI~~~~~-~~~---~~ 79 (154) |++--++..++.+.+.+|.+. ....++.+..+++++.+...+..+|. ..+.|++||.++-. .++ -. T Consensus 1 M~~~~~gl~e~~~~lekl~~~-~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~ 79 (141) T protein:vir:50 1 MVGLAEALDEWLKTVASIGNL-TPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGV 79 (141) T ss_pred CccHHHHHHHHHHHHHHhcCC-CHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCe Confidence 986435555555555555422 23345555667888888778888874 46699999977421 111 13 Q ss_pred EEEecCCC----cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH--HHHHHHHHHHHhhcc Q lcl|NC_011308. 80 VKIGNSSD----YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE--KSKVKDYVIKVFGGL 153 (154) Q Consensus 80 ~~V~~~~~----YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~--~~~i~~~i~~~l~~l 153 (154) ..||-+.+ +|.|+|+|| ..|||+||+.++.+++ ++.|.+.+.+.++++ T Consensus 80 s~VG~~~~~~~~~A~f~n~GT--------------------------~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~ 133 (141) T protein:vir:50 80 STVGWKNNYHAQNARRLNDGT--------------------------KKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNS 133 (141) T ss_pred eeeccCCCccceeeeccccCc--------------------------cccCCCchhHHHHHhhhhHHHHHHHHHHHHHHH Confidence 36775444 688999998 5799999999999875 667777777776644 Q ss_pred C Q lcl|NC_011308. 154 D 154 (154) Q Consensus 154 ~ 154 (154) = T Consensus 134 l 134 (141) T protein:vir:50 134 L 134 (141) T ss_pred H Confidence 3 No 99 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.66 E-value=9e-11 Score=75.65 Aligned_cols=119 Identities=13% Similarity=0.179 Sum_probs=63.5 Q ss_pred cchhhhhHHH---HHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCC---------------------------- Q lcl|NC_011308. 13 MADDIKFEMD---MSKIKDMFDDTAE--KALKQIGEHMKTEIAEGGHGDSSN---------------------------- 59 (154) Q Consensus 13 Ma~~v~~~~~---l~~~~~~l~~~~~--~~v~~a~~~~~~~i~~~~ak~~aP---------------------------- 59 (154) |+..|++..+ +.+.+..|....+ ..+..++..+.....++......| T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~L 80 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSIL 80 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcch Confidence 8876555433 3333433322211 112222222222222222111111 Q ss_pred ccccccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHH Q lcl|NC_011308. 60 NVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEK 139 (154) Q Consensus 60 vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~ 139 (154) .+||+|++||.+.+..+ .+.||+|..||.+.+||+..-. ++ ....+|++|||-=+ ++.+ T Consensus 81 ~~tg~L~~Si~~~~~~~--~v~vGt~~~yA~vHqfG~~~~~---~~---------------~~~~iPaRpfLG~s-~~d~ 139 (156) T protein:vir:19 81 TLHGDLARSITTDYGQD--YALIGSPKIYAAIHQWGGTPDM---AP---------------RPAGVPARPYMGLD-KTGE 139 (156) T ss_pred hhhHHHHHHhhheecCC--EEEEecchhhhHHhhcCccccc---CC---------------CccccCCccccCCC-HHHH Confidence 48899999998876544 5789999999999999963211 01 11258999999432 3344 Q ss_pred HHHHH----HHHHHhhc Q lcl|NC_011308. 140 SKVKD----YVIKVFGG 152 (154) Q Consensus 140 ~~i~~----~i~~~l~~ 152 (154) ..|.+ .|.++|+. T Consensus 140 ~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 140 QEIFDAIRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHHHHHHHhhC Confidence 44444 45555555 No 100 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=98.62 E-value=8.1e-10 Score=70.42 Aligned_cols=123 Identities=12% Similarity=0.077 Sum_probs=83.7 Q ss_pred cchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cccccccceeEEe---ecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDM-FDDTAEKALKQIGEHMKTEIAEGGHGDSSNN--VTGEYANKTDFEV---DKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~-l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv--dTG~Lr~SI~~~~---~~~~~~~~V~~~~ 86 (154) ||.++++.+++.+.+.+ +..+....+.+-++..+++.+....+.+.++ |||.+.+++.++- ..+.-++.|+=.. T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 99999999988777664 4444445555555555666667778877666 9999999996531 2222344554322 Q ss_pred C-----cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_011308. 87 D-----YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGL 153 (154) Q Consensus 87 ~-----YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l 153 (154) + +-+..|||+ .....|+|+.++||- -+..|++..++.+.+.++..|+.| T Consensus 81 ~~~R~~ivHLnE~Gy---------------t~~r~Gk~i~PrG~G---~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 PFERFRIVHLIENGH---------------VEKKSGKFVKPKAMG---GINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeeEEEeeecce---------------eecCCCCeeccchhh---HHHHHHHhhhHHHHHHHHHHHhcC Confidence 2 344456664 122456666666664 477799999999999999999999 No 101 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=98.61 E-value=2e-10 Score=73.81 Aligned_cols=115 Identities=14% Similarity=0.193 Sum_probs=76.2 Q ss_pred cch------hhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-----------cccccccceeEEee Q lcl|NC_011308. 13 MAD------DIKFEMDMSKIKDMF-DDTAEKALKQIGEHMKTEIAEGGHGDSSNN-----------VTGEYANKTDFEVD 74 (154) Q Consensus 13 Ma~------~v~~~~~l~~~~~~l-~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv-----------dTG~Lr~SI~~~~~ 74 (154) ||+ +|+|...+..-++++ +..+.+.++.+.. .+++++-..++..+|+ +||.|..||++--. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~-~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT 79 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANK-ASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTAS 79 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHH-HHHHHHHHHHHhhcCCcccccccccccCcchhhcccccccc Confidence 664 466666666666666 4445555554433 3456777788999999 79999999987655 Q ss_pred cCceEEEEec--CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH--------HHHHHHH Q lcl|NC_011308. 75 KRKQEVKIGN--SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD--------EKSKVKD 144 (154) Q Consensus 75 ~~~~~~~V~~--~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~--------~~~~i~~ 144 (154) .....++.|. .++||+++|||+.-+ ...|+.||..++-. ++.+|.+ T Consensus 80 ~raa~VrAG~~krVPYA~~I~~G~r~r------------------------~Isp~rFl~~a~a~te~~~~r~Ye~~i~~ 135 (143) T protein:vir:62 80 AKGAVIKAGSASRVPYAAAIHFGYRAR------------------------NISPNRFLFRAMARKSDVVAATYERRIAA 135 (143) T ss_pred ccceeeeeCCcCCCCcccccccCcccc------------------------cccchhhhhhhhhccCHHHHHHHHHHHHH Confidence 4445556665 579999999996322 22356677666554 4445666 Q ss_pred HHHHHhhc Q lcl|NC_011308. 145 YVIKVFGG 152 (154) Q Consensus 145 ~i~~~l~~ 152 (154) .+++.|.. T Consensus 136 vl~k~l~s 143 (143) T protein:vir:62 136 VVEKYLES 143 (143) T ss_pred HHHHHhcC Confidence 66666666 No 102 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.57 E-value=2.2e-10 Score=73.53 Aligned_cols=121 Identities=14% Similarity=0.166 Sum_probs=62.3 Q ss_pred cchh--hhhHH-HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH------------------Hh---c--CCcccc Q lcl|NC_011308. 13 MADD--IKFEM-DMSKIKDMFDDTAE---KALKQIGEHMKTEIAEGGH------------------GD---S--SNNVTG 63 (154) Q Consensus 13 Ma~~--v~~~~-~l~~~~~~l~~~~~---~~v~~a~~~~~~~i~~~~a------------------k~---~--aPvdTG 63 (154) ||.. |++.+ .+.+.+..|...+. ..+..++..+.....++.. +. . .=++|| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG 80 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTN 80 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccch Confidence 8854 44432 34444544433321 1222222222111111111 00 0 015899 Q ss_pred ccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhH-HHHHH----H Q lcl|NC_011308. 64 EYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMR-YTFRD----E 138 (154) Q Consensus 64 ~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~-pA~~~----~ 138 (154) .|++||.+.+.. ..+.||+|..||.+.+||+.. +. + ....+||+|||- +.-++ . T Consensus 81 ~L~~Si~~~~~~--~~v~vGtn~~YA~iHqfGg~~-~~---~---------------~~~~iPARPfLG~s~~~e~~~ei 139 (155) T protein:vir:10 81 ALARSITTRADR--DQAQIGSNLSYAAIQQLGGQA-GR---G---------------RKVTIPARPYLPVLRNGQLKPSA 139 (155) T ss_pred hhhhhhhceecC--CEEEEecCcchhhhhhccccc-CC---C---------------CccccCCccccCCCccccchHHH Confidence 999999988754 457899999999999999521 00 0 013689999995 32222 2 Q ss_pred HHHHHHHHHHHhhccC Q lcl|NC_011308. 139 KSKVKDYVIKVFGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~l~ 154 (154) ...|.+.+.+.|..=- T Consensus 140 ~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 140 RDAVLDVLLAALSQGR 155 (155) T ss_pred HHHHHHHHHHHHhhcC Confidence 3444444444442212 No 103 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.56 E-value=3.6e-10 Score=72.38 Aligned_cols=120 Identities=13% Similarity=0.179 Sum_probs=63.2 Q ss_pred cch--hhhhHH-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcCC--------------------------- Q lcl|NC_011308. 13 MAD--DIKFEM-DMSKIKDMFDDTA---EKALKQIGEHMKTEIAEGGHGDSSN--------------------------- 59 (154) Q Consensus 13 Ma~--~v~~~~-~l~~~~~~l~~~~---~~~v~~a~~~~~~~i~~~~ak~~aP--------------------------- 59 (154) ||. +|++.+ .+.+.++.+...+ ...+..++..+.....++-.....| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 884 444432 2444444443222 1112222222221111111111111 Q ss_pred -------------ccccccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCC Q lcl|NC_011308. 60 -------------NVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSK 126 (154) Q Consensus 60 -------------vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~ 126 (154) ++||.|++||.+++..+ .+.||||..||.+.+||+-. +.++ ...+| T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~--~v~vGtn~~YAaiHqfGg~~----~~~~---------------~v~IP 139 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGED--YSVIGSNKEYAAIQHFGGQA----GRGL---------------KVTIP 139 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCC--EEEEecCcchhhHhhccccc----CCCc---------------ccccC Confidence 37999999999887544 57899999999999999521 0111 12689 Q ss_pred CCchhHHHH---------HHHHHHHHHHHHHHhhcc Q lcl|NC_011308. 127 ASKRMRYTF---------RDEKSKVKDYVIKVFGGL 153 (154) Q Consensus 127 a~PFl~pA~---------~~~~~~i~~~i~~~l~~l 153 (154) |+|||-=+- +...+.+.+.+.++|+.= T Consensus 140 ARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 140 GRAWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred cccccCCCcccchhHHHHHHHHHHHHHHHHHHhccC Confidence 999996432 334444444455555444 No 104 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.55 E-value=8.4e-10 Score=70.34 Aligned_cols=121 Identities=16% Similarity=0.219 Sum_probs=67.6 Q ss_pred cchhhhhH---HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH----------------HHhc-------CCcccc Q lcl|NC_011308. 13 MADDIKFE---MDMSKIKDMFDDTAE---KALKQIGEHMKTEIAEGG----------------HGDS-------SNNVTG 63 (154) Q Consensus 13 Ma~~v~~~---~~l~~~~~~l~~~~~---~~v~~a~~~~~~~i~~~~----------------ak~~-------aPvdTG 63 (154) ||..|++. +.+.+.+..|...+. ..+..++..+...+.++- ++.. .=.+|| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg 80 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch Confidence 88654433 234444444433221 111111111111111111 0100 115999 Q ss_pred ccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHH-----HHHH Q lcl|NC_011308. 64 EYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYT-----FRDE 138 (154) Q Consensus 64 ~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA-----~~~~ 138 (154) .|++||.+.+.. ..+.|++|..||.+.+||+.. + + ++ ...+|++|||--+ ..+- T Consensus 81 ~L~~Si~~~~~~--~~v~vGtn~~YA~iHqfGg~~-~--~-~~---------------~v~iPaRpfLG~s~~~~l~~e~ 139 (155) T protein:vir:99 81 ALARSVTTWADR--NEAGIGSNLVYAAIHQFGGDA-G--R-GH---------------QVEIPARRYLPFDENGQLAAGA 139 (155) T ss_pred hhhhhhhceecC--CEEEEecCccchhhhhccccc-C--C-CC---------------ccccCCccccCCCCccccchHH Confidence 999999988754 458899999999999999521 0 0 00 1268999999532 2345 Q ss_pred HHHHHHHHHHHhhccC Q lcl|NC_011308. 139 KSKVKDYVIKVFGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~l~ 154 (154) +..|.++|.+.|+.-- T Consensus 140 ~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 140 RQSILEIVLTALSRNR 155 (155) T ss_pred HHHHHHHHHHHHhccC Confidence 6677777887777666 No 105 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.53 E-value=8.9e-10 Score=70.20 Aligned_cols=120 Identities=13% Similarity=0.149 Sum_probs=63.4 Q ss_pred cch--hhhhHH-HHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHH----------------------------- Q lcl|NC_011308. 13 MAD--DIKFEM-DMSKIKDMFDDTA------EKALKQIGEHMKTEIAEGGH----------------------------- 54 (154) Q Consensus 13 Ma~--~v~~~~-~l~~~~~~l~~~~------~~~v~~a~~~~~~~i~~~~a----------------------------- 54 (154) ||. +|++.+ .+.+.+..+...+ ...|...+...+.+-.+.+. T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 884 444432 3444444443222 12222222222111111110 Q ss_pred -----Hhc---CCccccccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCC Q lcl|NC_011308. 55 -----GDS---SNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSK 126 (154) Q Consensus 55 -----k~~---aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~ 126 (154) +.. .=++||.|++||.+.+..+ .+.||||..||.+.+||+.. +.++ ...+| T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~--~v~vGtn~~YAaiHqfGg~~----~~~~---------------~v~iP 139 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHDDN--SAVIGSNKEYAAIHQFGGQA----GRGL---------------KVTIP 139 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeecCC--EEEEecChhhhhhhhccccc----CCCC---------------ccccC Confidence 000 0147999999999887544 57899999999999999521 0010 13689 Q ss_pred CCchhHHHH---------HHHHHHHHHHHHHHhhcc Q lcl|NC_011308. 127 ASKRMRYTF---------RDEKSKVKDYVIKVFGGL 153 (154) Q Consensus 127 a~PFl~pA~---------~~~~~~i~~~i~~~l~~l 153 (154) |+|||-=+- +.....+.+.|.++|+.= T Consensus 140 aRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 140 ARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred CccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 999996532 333444444444444444 No 106 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=98.52 E-value=1.9e-09 Score=68.44 Aligned_cols=123 Identities=11% Similarity=0.110 Sum_probs=82.4 Q ss_pred cchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cccccccceeEEe---ecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDM-FDDTAEKALKQIGEHMKTEIAEGGHGDSSNN--VTGEYANKTDFEV---DKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~-l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv--dTG~Lr~SI~~~~---~~~~~~~~V~~~~ 86 (154) ||.+++|.+++.+.+.+ +.++....+.+-++..+++.+....+...++ |||.+.+++.++- ..+.-++.|+=.. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 99999999988777664 4444445555555555666677778888886 9999999996531 2222234554322 Q ss_pred C-----cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_011308. 87 D-----YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGL 153 (154) Q Consensus 87 ~-----YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l 153 (154) + +-+.-|||+- ....|+|+.++||- -+..|++..++.+.+.|+..|++| T Consensus 81 ~~~R~~iiHLNE~Gyt---------------r~~~Gk~i~PrG~G---~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 81 SKDRYKIVHLIEYGHV---------------QKGTGKFIKPKAMG---GVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeEEEEeecccce---------------ecccCCccCcchhh---HHHHHHHhhhHHHHHHHHHHHhcC Confidence 2 3445556531 11245555555553 477799999999999999999999 No 107 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=98.52 E-value=1.9e-09 Score=68.44 Aligned_cols=123 Identities=11% Similarity=0.110 Sum_probs=82.4 Q ss_pred cchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cccccccceeEEe---ecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDM-FDDTAEKALKQIGEHMKTEIAEGGHGDSSNN--VTGEYANKTDFEV---DKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~-l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv--dTG~Lr~SI~~~~---~~~~~~~~V~~~~ 86 (154) ||.+++|.+++.+.+.+ +.++....+.+-++..+++.+....+...++ |||.+.+++.++- ..+.-++.|+=.. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 99999999988777664 4444445555555555666677778888886 9999999996531 2222234554322 Q ss_pred C-----cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_011308. 87 D-----YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGL 153 (154) Q Consensus 87 ~-----YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l 153 (154) + +-+.-|||+- ....|+|+.++||- -+..|++..++.+.+.|+..|++| T Consensus 81 ~~~R~~iiHLNE~Gyt---------------r~~~Gk~i~PrG~G---~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 SKDRYKIVHLIEYGHV---------------QKGTGKFIKPKAMG---GVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeEEEEeecccce---------------ecccCCccCcchhh---HHHHHHHhhhHHHHHHHHHHHhcC Confidence 2 3445556531 11245555555553 477799999999999999999999 No 108 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=98.51 E-value=1.4e-09 Score=69.20 Aligned_cols=115 Identities=12% Similarity=-0.008 Sum_probs=74.6 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc------ccc---ccccceeEEe-ecCc---eE Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN------VTG---EYANKTDFEV-DKRK---QE 79 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv------dTG---~Lr~SI~~~~-~~~~---~~ 79 (154) |++--++..++.+.+.+|.+.. ...++.+..+++++.+...+..+|. +|| .|++||.++- +.++ -+ T Consensus 1 M~~~~d~l~e~~~~lekl~~~~-~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~ 79 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGV 79 (140) T ss_pred CccHHHHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCce Confidence 9863344444444444443221 3455566678888888888888883 344 6999998642 1111 13 Q ss_pred EEEecC----CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH--HHHHHHHHHHHhhcc Q lcl|NC_011308. 80 VKIGNS----SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE--KSKVKDYVIKVFGGL 153 (154) Q Consensus 80 ~~V~~~----~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~--~~~i~~~i~~~l~~l 153 (154) ..||-+ +-+|.|+|+|| ..|||+||+.++.+++ +..+.+.+.+.++++ T Consensus 80 s~VG~~kk~~a~~A~f~n~GT--------------------------~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~ 133 (140) T protein:vir:48 80 STVGWVNRYHAQNARRLNDGT--------------------------KKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKL 133 (140) T ss_pred eeeccCCCcceeeeeccccCc--------------------------cccCCCchhHHHHHhhhhHHHHHHHHHHHHHHH Confidence 357754 34789999998 4799999999999976 666766666665554 Q ss_pred C Q lcl|NC_011308. 154 D 154 (154) Q Consensus 154 ~ 154 (154) = T Consensus 134 l 134 (140) T protein:vir:48 134 I 134 (140) T ss_pred H Confidence 3 No 109 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=98.50 E-value=6.5e-10 Score=70.94 Aligned_cols=115 Identities=15% Similarity=0.204 Sum_probs=74.5 Q ss_pred cch------hhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-----------ccccccceeEEee Q lcl|NC_011308. 13 MAD------DIKFEMDMSKIKDMF-DDTAEKALKQIGEHMKTEIAEGGHGDSSNNV-----------TGEYANKTDFEVD 74 (154) Q Consensus 13 Ma~------~v~~~~~l~~~~~~l-~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvd-----------TG~Lr~SI~~~~~ 74 (154) ||+ .|+|...+..-++++ +..+.+.++.+.. .+++++-..++..+|+- ||.|..||++--. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~-~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT 79 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANK-ASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTAS 79 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHH-HHHHHHHHHHHhhcCCcccccccccccccchhhcccccccc Confidence 764 456666666666665 4444555554433 34567777789999986 8999999987654 Q ss_pred cCceEEEEecC--CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH--------HHHHHHH Q lcl|NC_011308. 75 KRKQEVKIGNS--SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD--------EKSKVKD 144 (154) Q Consensus 75 ~~~~~~~V~~~--~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~--------~~~~i~~ 144 (154) .....++.|.. ++||+++|||+..+ ...++.||+.++-. ++.+|.+ T Consensus 80 ~raa~VrAGr~arVPYA~~I~~G~r~r------------------------~Is~~rFl~~a~a~te~~~~r~Ye~~i~~ 135 (143) T protein:vir:13 80 AKGAVIKAGSAARVPYAAAIHFGYRKR------------------------NISANRFLYRAMARKSDVVAATYERRIAA 135 (143) T ss_pred ccceeeeecCcCCCCcccccccCCccc------------------------ccchhhhhhhhhhccCHHHHHHHHHHHHH Confidence 44445556643 79999999996322 22366677666655 4445666 Q ss_pred HHHHHhhc Q lcl|NC_011308. 145 YVIKVFGG 152 (154) Q Consensus 145 ~i~~~l~~ 152 (154) .+++.|.. T Consensus 136 vl~k~l~s 143 (143) T protein:vir:13 136 VVEKYLES 143 (143) T ss_pred HHHHHhcC Confidence 66666666 No 110 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.48 E-value=1.7e-09 Score=68.59 Aligned_cols=138 Identities=14% Similarity=0.161 Sum_probs=63.9 Q ss_pred cc-hhhhhHH-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhc------------------------CCcccc Q lcl|NC_011308. 13 MA-DDIKFEM-DMSKIKDMFDDTA---EKALKQIGEHMKTEIAEGGHGDS------------------------SNNVTG 63 (154) Q Consensus 13 Ma-~~v~~~~-~l~~~~~~l~~~~---~~~v~~a~~~~~~~i~~~~ak~~------------------------aPvdTG 63 (154) |+ +.|++.. .+.+.++.+...+ ..-+..++..+.....++..... .=.+|| T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~tg 80 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLDG 80 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceecH Confidence 54 3344321 2333333332221 11122222222121212111111 114899 Q ss_pred ccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcc--------cccc----eecccCce-------eecCC Q lcl|NC_011308. 64 EYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRA--------GGWS----YMDKNGKW-------HFTRG 124 (154) Q Consensus 64 ~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~--------~~~~----~~~~~g~~-------~~t~g 124 (154) .|++||.+.+..+ .++|++|..||...+||.-. ...+.... .++. .....+.+ ..+-. T Consensus 81 ~L~~Si~~~~~~~--~v~vGtn~~yA~iHq~Gg~i-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~ 157 (190) T protein:vir:99 81 HLRNLLRYQLDGS--ELLFGSDRPYAAIHHFGGTI-QRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQ 157 (190) T ss_pred HHHHHHhheecCc--EEEEecCcchhhhhhcCCcc-cccccchhhhhhhhhhhhhhhcccccccccccchhcccccceee Confidence 9999999887654 58899999999999999432 11111110 0110 00110000 11224 Q ss_pred CCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 125 SKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 125 ~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +|++|||--+ ++.+.+|.+.|.+.|+++= T Consensus 158 IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~ 186 (190) T protein:vir:99 158 MPARPWLGTS-SQDDDTILQRVERYLQRAL 186 (190) T ss_pred ecCcccCCCC-HHHHHHHHHHHHHHHHHHH Confidence 6999999443 3344555555555555544 No 111 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.46 E-value=9.1e-10 Score=70.14 Aligned_cols=121 Identities=16% Similarity=0.206 Sum_probs=66.2 Q ss_pred cchhhhhH---HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH----------------HHhc-------CCcccc Q lcl|NC_011308. 13 MADDIKFE---MDMSKIKDMFDDTAE---KALKQIGEHMKTEIAEGG----------------HGDS-------SNNVTG 63 (154) Q Consensus 13 Ma~~v~~~---~~l~~~~~~l~~~~~---~~v~~a~~~~~~~i~~~~----------------ak~~-------aPvdTG 63 (154) |+..|++. ..+.+.+..|...+. ..+..++..+...+.++- ++.. .=++|| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 88654433 234444444433322 111222211111111111 1110 116999 Q ss_pred ccccceeEEeecCceEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHH-----HHH Q lcl|NC_011308. 64 EYANKTDFEVDKRKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTF-----RDE 138 (154) Q Consensus 64 ~Lr~SI~~~~~~~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~-----~~~ 138 (154) .|++||.+++.. ..+.||++..||.+.+||+-.. + ++ ...+|++|||--+- .+- T Consensus 81 ~L~~Si~~~~~~--~~v~vGt~~~YA~iHqfGg~~~---~-~~---------------~v~iPaRpfLG~s~~~~l~~~~ 139 (155) T protein:vir:79 81 ALARSVTTWADR--NEAGIGSNLVYAAIHQFGGDAG---R-GH---------------QVEIPARRYLPFDENGQLAAGA 139 (155) T ss_pred hhhhhhhceecC--CEEEEecCchhhhhhhcccccC---C-CC---------------ccccCCccccCCCCccccchHH Confidence 999999887654 4688999999999999996210 0 00 12589999994322 233 Q ss_pred HHHHHHHHHHHhhccC Q lcl|NC_011308. 139 KSKVKDYVIKVFGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~l~ 154 (154) ++.|.++|.+.|+.-- T Consensus 140 ~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 140 RQSILEVVLTALSRNR 155 (155) T ss_pred HHHHHHHHHHHHHhcC Confidence 4567777777775444 No 112 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=98.43 E-value=2.6e-09 Score=67.63 Aligned_cols=115 Identities=13% Similarity=0.029 Sum_probs=74.9 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc------cc---cccccceeEEee-cCce---E Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN------VT---GEYANKTDFEVD-KRKQ---E 79 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv------dT---G~Lr~SI~~~~~-~~~~---~ 79 (154) |++--++..++.+.+.+|.+.. ...++.+..+++++.+...+..+|. .| |.|++||.++-. .++. + T Consensus 1 M~~~~d~l~e~~~~v~kl~~~~-~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~ 79 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGV 79 (140) T ss_pred CccHHHHHHHHHHHHHHhccCC-HHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccccc Confidence 8863334444444344443221 3455666778888888777888874 24 479999987521 1111 2 Q ss_pred EEEecCC----CcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH--HHHHHHHHHHHhhcc Q lcl|NC_011308. 80 VKIGNSS----DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE--KSKVKDYVIKVFGGL 153 (154) Q Consensus 80 ~~V~~~~----~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~--~~~i~~~i~~~l~~l 153 (154) ..||-+. -+|.|+|+|| +.|||+||+.++.++. ++.+.+.+.+++++| T Consensus 80 s~VG~~k~~~a~~a~f~NdGT--------------------------~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~ 133 (140) T protein:vir:48 80 ATVGWKNNYHAQNARRLNDGT--------------------------KKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKL 133 (140) T ss_pred eeecccCCCceeEEeecccCc--------------------------cccCCCchHHHHHHhhhhHHHHHHHHHHHHHHH Confidence 3566553 4689999998 4799999999999865 677777777666554 Q ss_pred C Q lcl|NC_011308. 154 D 154 (154) Q Consensus 154 ~ 154 (154) = T Consensus 134 l 134 (140) T protein:vir:48 134 I 134 (140) T ss_pred H Confidence 3 No 113 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.96 E-value=1.7e-07 Score=57.73 Aligned_cols=111 Identities=14% Similarity=-0.002 Sum_probs=63.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+++|++. +..+.+.+.++ .+++...++.++++ .....+|.+||+|++|- +.+.+ +.|..+++||.++ T Consensus 1 M~vkV~id--~~~~~~~l~~a----~~~aq~~~~~ev~~-~~~~yVP~~tG~L~~s~-~~~~~----g~I~y~tPYAr~q 68 (112) T protein:vir:80 1 MPIKVRVD--LSKAKGSVKKA----KERGQFALINQAAA-DIALYVPFLSGDLSNQY-VIMND----KEIMWTSIYARRL 68 (112) T ss_pred CceeEEee--hHHHHHHHHHH----HHHHHHHHHHHHHH-HhhcCCCcccCccccce-eeccC----ceEEecCchhhHh Confidence 99887764 33333334433 33443334445544 34678999999999995 22222 4678889999999 Q ss_pred ccCc-cccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 93 EFGT-GEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 93 E~GT-g~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) -||- +.+ + .+.+ .....-++..|.....+.+.+.+.+++++== T Consensus 69 YY~~~~~~------~------------~~~~-p~ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 69 YNGINFNF------T------------LTHH-PLAGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred hhcccCCC------C------------cCCC-CCcchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 9973 111 0 0111 1122335566777777776666666554322 No 114 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=97.94 E-value=3.1e-07 Score=56.24 Aligned_cols=114 Identities=12% Similarity=0.060 Sum_probs=77.9 Q ss_pred cchhhhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCCcc-----------------------cccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAE--KALKQIGEHMKTEIAEGGHGDSSNNV-----------------------TGEYAN 67 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~--~~v~~a~~~~~~~i~~~~ak~~aPvd-----------------------TG~Lr~ 67 (154) |..+ |.+.|...++++.+... ..-+..+..+++++.+...+..+|.. +|.|++ T Consensus 1 mm~~--~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD 78 (159) T protein:vir:38 1 MAND--MGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQD 78 (159) T ss_pred Ccch--HHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCcccc Confidence 6544 44556666666655322 23344555667777777777777762 369999 Q ss_pred ceeEEe--ecCce---EEEEecCC----CcccccccCccccccCCCCcccccceecccCceeecCCCCCC-----chhHH Q lcl|NC_011308. 68 KTDFEV--DKRKQ---EVKIGNSS----DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKAS-----KRMRY 133 (154) Q Consensus 68 SI~~~~--~~~~~---~~~V~~~~----~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~-----PFl~p 133 (154) ||.++- +.++. +..||-+. -+|.|++.||. .|||+ +|+.. T Consensus 79 ~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~--------------------------~m~~k~~~gdHFvek 132 (159) T protein:vir:38 79 SITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQH--------------------------HMSPKRYKNMHFLDK 132 (159) T ss_pred ceeeecCccccccccceeeecccCCccceEeeecccCcc--------------------------ccCCCCccCChhHHH Confidence 997742 22221 34577633 46789999983 56655 89999 Q ss_pred HHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 134 TFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 134 A~~~~~~~i~~~i~~~l~~l~ 154 (154) +.++.++.|.+.+.+++++|= T Consensus 133 t~~~~k~~Vl~A~~~~~~~il 153 (159) T protein:vir:38 133 AQQEAKKSVAEAELKAYKEVM 153 (159) T ss_pred HHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999998886 No 115 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.89 E-value=1.5e-07 Score=57.99 Aligned_cols=113 Identities=15% Similarity=0.172 Sum_probs=78.2 Q ss_pred cch-hhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--cccccccccee--EEeec-CceEEEEecC Q lcl|NC_011308. 13 MAD-DIKFE-MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTD--FEVDK-RKQEVKIGNS 85 (154) Q Consensus 13 Ma~-~v~~~-~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~--~~~~~-~~~~~~V~~~ 85 (154) ||. +|+|. ..+.+.+..+..+....+.--+. ..+...+..||.+|| -+||.-|.+|. .+..+ +.+++.+.-+ T Consensus 1 ~~~~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d-~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~~g~~~~~Iylsh~ 79 (123) T protein:vir:74 1 MAKVTFEYDAQELRTNIRNLDRRMESAVDALMD-YEAAYATGQLKMRAPWTDRTGAARSGLLAVANKLGPGSHELIMSYS 79 (123) T ss_pred CceeEEEecHHHHHHHHHhhHHHHHHHHHHHHH-HHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecC Confidence 884 34442 22333344444444444443333 334445788999999 49999999994 33333 3577788889 Q ss_pred CCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) ++|.+|.|.+++ |.+ --+.|+++..-++|.+-+...|.+|. T Consensus 80 veYG~~LEla~~--------------------------~ky--aIi~Ptv~~~~~~im~g~~~ll~~l~ 120 (123) T protein:vir:74 80 VHYGIWLEIANS--------------------------GQY--AVIGPFLPVMGRKLMHDLEHLIDRLE 120 (123) T ss_pred eeecceeeecCC--------------------------CCc--eeecchHHHHhHHHHHHHHHHHHHhh Confidence 999999998874 111 14799999999999999999999999 No 116 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=97.87 E-value=3.9e-07 Score=55.72 Aligned_cols=122 Identities=15% Similarity=0.094 Sum_probs=78.3 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe--ecCc---eEEEEec Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV--DKRK---QEVKIGN 84 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~--~~~~---~~~~V~~ 84 (154) ||.+++|.+++.+.+. .|.++..+.+.+-++..+++.+....+.... .|||..-+++.++- ...+ -++.|+= T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEe Confidence 9999999998777665 5555544555444455555555555666655 59999999986541 1111 2333332 Q ss_pred CCC---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 85 SSD---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 85 ~~~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) ..+ |. -||. .-|- +..+|+|+.++||- -+..|++..++.+.+.+++.|+. T Consensus 81 ~gp~~R~~-iVHL-------------NE~G-ytr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRKN-IIHL-------------NEHG-YTRDGKKYTPRGFG---VIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ecCCCcee-EEEe-------------eccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 211 11 1111 0122 35678888888885 58999999999999999999999 No 117 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=97.87 E-value=4.3e-07 Score=55.48 Aligned_cols=120 Identities=15% Similarity=0.104 Sum_probs=74.3 Q ss_pred cch--hhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cccccccceeEEe---ecCceEEEEec Q lcl|NC_011308. 13 MAD--DIKFEMDMSKIKDM-FDDTAEKALKQIGEHMKTEIAEGGHGDSSNN--VTGEYANKTDFEV---DKRKQEVKIGN 84 (154) Q Consensus 13 Ma~--~v~~~~~l~~~~~~-l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv--dTG~Lr~SI~~~~---~~~~~~~~V~~ 84 (154) ||. +|++.+++.+.+.+ +.++..+.+.+.++..+++.++...+...|+ |||.+-+++.++- ..+--++.|+- T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW 80 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 80 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecc Confidence 553 79999988777765 7776556666666677778888888999997 9999999986531 11112333433 Q ss_pred CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHH----HHHHHHHHhhc Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSK----VKDYVIKVFGG 152 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~----i~~~i~~~l~~ 152 (154) +-+--.-||. .-|-| |+++.++|+- ++..|++..++. +...|++.|.+ T Consensus 81 ~GpR~~ivHL-------------NE~Gy----Gk~~~PrG~G---~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 81 TTPRWNIVHL-------------QELEY----GWKHNRRGVG---VIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred cCCceeEEee-------------ecccc----cCCcCCCcch---HHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 2210000100 01112 6666666664 789999999954 55555555555 No 118 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=97.84 E-value=3.9e-07 Score=55.75 Aligned_cols=111 Identities=14% Similarity=0.039 Sum_probs=64.8 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+++|++.. .++...+.++ ++++...++.++++ .....+|.+||+|++|-. .+.+ +.|..+++||.+. T Consensus 1 M~vkv~vn~--~~~~~~l~~a----~~r~q~~~~~ev~~-~~~~yVP~~~G~L~~S~~-~~~~----g~I~y~tPYAr~q 68 (112) T protein:vir:45 1 MPIKVRVDL--SKAKGSVKKA----KERGQFALINQAAA-DIALYVPFLSGDLSNQYV-IMND----KEIMWTSIYARRL 68 (112) T ss_pred CceeEEeeh--HHHHHHHHHH----HHHHHHHHHHHHHH-HhhcCCccccCcccccee-eccC----CeEEecChhhHHh Confidence 998877643 3333334333 33333333444444 346789999999999953 2332 3577889999999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh-cc Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG-GL 153 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~-~l 153 (154) -||.--+ +++ ..+ .....-++..|.....+.+.+.+.++++ +| T Consensus 69 YY~~~~~-----~~~------------~~~-p~ag~~W~erak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 69 YKGINFN-----FTL------------THH-PLAGPEWDQRAKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred hhccccC-----CCC------------CCC-CCCchhhHHHHHHhhHHHHHHHHHHHHhhcC Confidence 8874211 110 011 1223345666777777777777666654 34 No 119 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=97.81 E-value=6e-07 Score=54.70 Aligned_cols=120 Identities=16% Similarity=0.096 Sum_probs=79.5 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe--ecCc---eEEEEec Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV--DKRK---QEVKIGN 84 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~--~~~~---~~~~V~~ 84 (154) ||.+++|.+++.+.+. .+.++....+.+-++..+++.+....+.... .|||..-+++.++- ...+ -++.|+= T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999999998877765 4666655555555555555666656666655 59999999886531 1111 2233332 Q ss_pred CCCcccccccCccccccCCCCccc-----ccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAG-----GWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~-----~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) .. |+.|.. -|- +..+|+|+.++||- -+..|++..++.+.+.+++.|+. T Consensus 81 ~g----------------p~~R~~iVHLNE~G-ytr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 81 VG----------------PMNRKNIIHLNEHG-YTRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ec----------------CCCceeEEEeeccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 11 111111 122 35678888888885 58999999999999999999999 No 120 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=97.81 E-value=6e-07 Score=54.70 Aligned_cols=120 Identities=16% Similarity=0.096 Sum_probs=79.5 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe--ecCc---eEEEEec Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV--DKRK---QEVKIGN 84 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~--~~~~---~~~~V~~ 84 (154) ||.+++|.+++.+.+. .+.++....+.+-++..+++.+....+.... .|||..-+++.++- ...+ -++.|+= T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999999998877765 4666655555555555555666656666655 59999999886531 1111 2233332 Q ss_pred CCCcccccccCccccccCCCCccc-----ccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAG-----GWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~-----~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) .. |+.|.. -|- +..+|+|+.++||- -+..|++..++.+.+.+++.|+. T Consensus 81 ~g----------------p~~R~~iVHLNE~G-ytr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 81 VG----------------PMNRKNIIHLNEHG-YTRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ec----------------CCCceeEEEeeccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 11 111111 122 35678888888885 58999999999999999999999 No 121 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=97.81 E-value=6e-07 Score=54.70 Aligned_cols=120 Identities=16% Similarity=0.096 Sum_probs=79.5 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe--ecCc---eEEEEec Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV--DKRK---QEVKIGN 84 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~--~~~~---~~~~V~~ 84 (154) ||.+++|.+++.+.+. .+.++....+.+-++..+++.+....+.... .|||..-+++.++- ...+ -++.|+= T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999999998877765 4666655555555555555666656666655 59999999886531 1111 2233332 Q ss_pred CCCcccccccCccccccCCCCccc-----ccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAG-----GWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~-----~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) .. |+.|.. -|- +..+|+|+.++||- -+..|++..++.+.+.+++.|+. T Consensus 81 ~g----------------p~~R~~iVHLNE~G-ytr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VG----------------PMNRKNIIHLNEHG-YTRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ec----------------CCCceeEEEeeccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 11 111111 122 35678888888885 58999999999999999999999 No 122 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=97.81 E-value=6e-07 Score=54.70 Aligned_cols=120 Identities=16% Similarity=0.096 Sum_probs=79.5 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe--ecCc---eEEEEec Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV--DKRK---QEVKIGN 84 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~--~~~~---~~~~V~~ 84 (154) ||.+++|.+++.+.+. .+.++....+.+-++..+++.+....+.... .|||..-+++.++- ...+ -++.|+= T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999999998877765 4666655555555555555666656666655 59999999886531 1111 2233332 Q ss_pred CCCcccccccCccccccCCCCccc-----ccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAG-----GWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~-----~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) .. |+.|.. -|- +..+|+|+.++||- -+..|++..++.+.+.+++.|+. T Consensus 81 ~g----------------p~~R~~iVHLNE~G-ytr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 81 VG----------------PMNRKNIIHLNEHG-YTRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ec----------------CCCceeEEEeeccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 11 111111 122 35678888888885 58999999999999999999999 No 123 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=97.72 E-value=1.7e-07 Score=57.67 Aligned_cols=129 Identities=12% Similarity=0.142 Sum_probs=75.4 Q ss_pred chhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccc Q lcl|NC_011308. 14 ADDIKFE--MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIY 91 (154) Q Consensus 14 a~~v~~~--~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~y 91 (154) -+++.+. +.|+..--.+...+++-+.....+. +..+...+|+.||+||.|-.+++.+ .+|...+.+.|.+| T Consensus 1 mi~i~idkp~almek~~ev~~~ie~t~~~~~~~l-----~~i~~ntapiktg~lr~sh~~sieg--stgelsn~~~yl~~ 73 (133) T protein:vir:42 1 MIEIRIDKPDALMEKPHEVQGKIEETLEKILNQL-----QGIAENTAPVKTGNLRDSHIISIEG--STGELSNLAYYLPF 73 (133) T ss_pred CeeeecCCchhhhcchhhhhhHHHHHHHHHHHHH-----HHHhhhccccccccceeeeeEEeec--CccchhhhhHHhhH Confidence 0333322 2222222233444444444433322 3455667899999999998888764 47888999999999 Q ss_pred cccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHH--HHHHHHHHHHHHHhhc Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFR--DEKSKVKDYVIKVFGG 152 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~--~~~~~i~~~i~~~l~~ 152 (154) |=+|-| ++-|..+++.|+-.-+.-.- ..+-.||.-||.-++. +.+.-+++-+..-|++ T Consensus 74 vl~grg--wvfpv~~kal~wpelphpva-yarpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 74 VLHGRG--WVFPVRRKALWWPELPHPVA-YARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred hhhccc--ceeeccccccccCCCCCccc-ccCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 999976 44455555555422111111 1233455557765544 4455577888888888 No 124 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=97.71 E-value=7.3e-07 Score=54.22 Aligned_cols=120 Identities=14% Similarity=0.104 Sum_probs=85.0 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc----cccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNN----VTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) |+.+=-|..++..-+..+.+.- +.|..-.++.+++......+...|+ ..|.||+++++.+.++.+.++....+=| T Consensus 1 m~sNNNGFae~~~~~~tl~kVd-~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d~V~V~Fed~a~y 79 (125) T protein:vir:62 1 MASNNNGFAEALEDINTLLRVN-KKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDDRVSVEFKDEAWY 79 (125) T ss_pred CCCCchhHHHHHHHhhhhhhhh-hhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCCeEEEEEcchhhh Confidence 9876556655555555554332 4455555555555554455555554 5689999999999999988888888899 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHh-hcc Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVF-GGL 153 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l-~~l 153 (154) =.|+|.||.+.. ++| ..+||.|....|++++.+|+++|.+-| ..| T Consensus 80 W~f~EnGt~~~~--~~g------------------~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 80 WYLVEHGHKKAK--GKG------------------RVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred hhhhhccccccc--ccc------------------ccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 999999985321 111 258899999999999999999986644 444 No 125 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=97.70 E-value=7e-07 Score=54.32 Aligned_cols=122 Identities=16% Similarity=0.138 Sum_probs=77.5 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccccccceeEEe---ecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDS--SNNVTGEYANKTDFEV---DKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~--aPvdTG~Lr~SI~~~~---~~~~~~~~V~~~~ 86 (154) ||.+|+|.+++.+.+. .+.++....+.+-++..+++.+....+.. +..|||..-+++.++- ..+.-++.|+=.. T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~g 80 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKG 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEec Confidence 9999999998877765 46665555555555555555555555553 4569999999886531 1222233333211 Q ss_pred CcccccccCccccccCCCCccc-----ccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGGGRAG-----GWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~~~~~-----~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) |+.|.. -|- +..+|+|+.++||- -+..|++..++.+.+.+++.|+++= T Consensus 81 ----------------p~~R~~iVHLNE~G-Ytr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 81 ----------------PKDRYKIIHLNEYG-YTRNGKKITPAGTG---SVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred ----------------CCCceeEEEeeccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHHHhhC Confidence 111111 122 35688888888885 4788888888888888887777766 No 126 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.67 E-value=8.2e-07 Score=53.95 Aligned_cols=120 Identities=10% Similarity=0.096 Sum_probs=71.4 Q ss_pred cchhhhhHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCC------------------------ccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSN------------------------NVTGE 64 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aP------------------------vdTG~ 64 (154) |++--++...|..+++.|..+ +...|...+...+.+ +......| .++|+ T Consensus 1 m~d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~---rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~ 77 (149) T protein:vir:98 1 MSELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQ---RIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLR 77 (149) T ss_pred CchHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHH---HHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhh Confidence 875334666777777766422 222222222222222 21112122 25588 Q ss_pred cccceeEEeecCceEE-EEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHH Q lcl|NC_011308. 65 YANKTDFEVDKRKQEV-KIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVK 143 (154) Q Consensus 65 Lr~SI~~~~~~~~~~~-~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~ 143 (154) |.+||.+....++.++ .+|++..||...+||..... ++++ +| ..+|++|||-=+ ++.+..|. T Consensus 78 l~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~-~~~~---~~------------~~iPaRp~LG~s-~~d~~~i~ 140 (149) T protein:vir:98 78 TNRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRP-NRHS---RD------------VQYAARPLLGFT-RDDEQMIE 140 (149) T ss_pred hhhhhhheecCCeeEEEecCcchHHhhHhhccccccc-cCCC---cc------------eeccccccCCCC-HHHHHHHH Confidence 9999988887765432 24899999999999953211 1111 11 247999999544 35578889 Q ss_pred HHHHHHhhc Q lcl|NC_011308. 144 DYVIKVFGG 152 (154) Q Consensus 144 ~~i~~~l~~ 152 (154) +.|.+.|.+ T Consensus 141 ~~i~~~l~~ 149 (149) T protein:vir:98 141 DIIIRHLGK 149 (149) T ss_pred HHHHHHhhC Confidence 999999999 No 127 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=97.66 E-value=9e-08 Score=59.20 Aligned_cols=132 Identities=20% Similarity=0.232 Sum_probs=65.5 Q ss_pred cchhh-hhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCc Q lcl|NC_011308. 13 MADDI-KFE---MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDY 88 (154) Q Consensus 13 Ma~~v-~~~---~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 88 (154) |.+-+ +|- +++++.++.-. .+...|..-+.+ .+...|+.++||++|+.++||.+.-....-.+.++..+.| T Consensus 1 mgNP~~KFGvS~~e~~K~irns~-EV~~GiNdFMe~----~A~~~aK~~SPV~~GeY~~S~~V~~ka~NGRG~~G~~~~~ 75 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRNSA-EVDAGINDFMEN----EAIPYAKSISPVDDGEYAASWAVMKKAKNGRGVFGPKAWY 75 (150) T ss_pred CCCchhhhcCCHHHHHHhhccch-hhhhhHHHHHHh----hhhhhhhccCCcccchhHHHHHHHhhcccCccccCccchh Confidence 77654 232 34444433222 334444444332 3334689999999999999997644333346789999999 Q ss_pred ccccccCccccccCCCCccccc------ceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGW------SYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~------~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) |.|||||||..-..+.+++... -..-.+|+|... -|..|--.. -.-..+..-+.-.|++ +. T Consensus 76 AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrv--gpdtptkaq---giaqkvashfggslkggis 143 (150) T protein:vir:81 76 AHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRV--GPDTPTKAQ---GIAQKVASHFGGSLKGGIS 143 (150) T ss_pred hhhhhhccccccccccccccccCcccceeeeecCccceec--CCCCchhhh---hHHHHHHHhcccccccccc Confidence 9999999986433333322211 111123444321 122221111 1111222222222221 11 No 128 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=97.60 E-value=1.3e-06 Score=52.88 Aligned_cols=114 Identities=13% Similarity=0.084 Sum_probs=67.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+.+|++. +..+.+.|.+. .++++-..++.++.+ .....+|.+||+|++|..+... + +.|..+++||.+. T Consensus 1 M~~kVkv~--l~~~~~~l~~~---~l~r~Q~~~~~ev~~-~~~~YVP~~~G~L~~S~~~~~~-~---~~I~y~tPYAr~q 70 (114) T protein:vir:47 1 MNIAIKVD--LQKAKQKLSNE---SMTRGKIAVASKILL-DNEQYIPLRGGELRASGRIVGQ-G---DAVVYGTVYARAQ 70 (114) T ss_pred CceeEEee--hhHHHHHHHHH---HHHHHHHHHHHHHHH-hhccCCcCccCccccceeeeeC-C---cEEEecCchhhHh Confidence 99877764 34433444322 222332233344443 3467899999999999765432 2 4688899999999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) -||-- +.++.. ..+......-++..|..+..+++.+.+.+.++= T Consensus 71 yYg~~-----~~~~~~-----------~~~~p~~g~~W~eraka~~~~~~~~~~~k~~g~ 114 (114) T protein:vir:47 71 FYGSN-----GIVTFR-----------RYTTPGTGKRWDQVATSKHAEEWARAFVKGMGL 114 (114) T ss_pred hhccc-----CCCCCC-----------ccCCCCCcchhHHHHHhhhhHHHHHHHHHhhCC Confidence 88731 111100 011233444566778888888887777765544 No 129 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=97.51 E-value=1.9e-06 Score=52.00 Aligned_cols=112 Identities=15% Similarity=0.250 Sum_probs=77.2 Q ss_pred cch-hhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe---ecCceEEEEecC Q lcl|NC_011308. 13 MAD-DIKFEM-DMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV---DKRKQEVKIGNS 85 (154) Q Consensus 13 Ma~-~v~~~~-~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~---~~~~~~~~V~~~ 85 (154) ||. +|+|.. ++.+.+..+..+....+.-.+.. .+...+..||.+|| -+||.-|.+|.-.+ ..+.+++.+.-+ T Consensus 1 ~~~~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~-~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~~~~~~~Iylsh~ 79 (120) T protein:vir:10 1 MAKIEFKFKDIELRRGVEDMEAKVDRAMKATSNY-HAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHT 79 (120) T ss_pred CceEEEEecHHHHHHHHhhhHHHHHHHHHHHHHH-HHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecC Confidence 884 344432 23333444445555555544443 34445788999999 49999999996544 233467778889 Q ss_pred CCcccccccC-ccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 86 SDYAIYYEFG-TGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 86 ~~YA~yVE~G-Tg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) ++|.+|.|.- .|.| --+.|+++..-+++.+-+...|.+|- T Consensus 80 veYG~~LEla~~~ky-----------------------------aIl~PTi~~~~~~il~g~~~ll~~l~ 120 (120) T protein:vir:10 80 VHYGIWLEIANSGRY-----------------------------EIIMPTVHHEGKLMAQRLRGLLGRLR 120 (120) T ss_pred eeecceEEeeCCCCc-----------------------------ccccchHHHHhHHHHHHHHHHhhhcC Confidence 9999999943 1211 13789999999999999999999999 No 130 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=97.48 E-value=1.1e-06 Score=53.23 Aligned_cols=118 Identities=15% Similarity=0.153 Sum_probs=63.1 Q ss_pred eecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCC Q lcl|NC_011308. 7 IDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSS 86 (154) Q Consensus 7 ~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~ 86 (154) .+.|=.|+++|++-..+.+++++|... .. . -|.-|-+..+- + .+......+++. T Consensus 1 ~~~~~~~~~k~~~~~~~~~~~~~l~~l-----~~---------------~--~v~vGi~~~~~---y-~~~~~~~dG~~v 54 (200) T protein:vir:99 1 MKKGFSKSNSVAAPLKHFQMLKQFDAL-----KG---------------K--TVQAGWFETDR---Y-PAKEGETIGPLV 54 (200) T ss_pred CCcCcceeeeeecchHHHHHHHHHHHh-----hC---------------C--eEEEEEcCCCC---c-CCcccccccchH Confidence 567778888888766666655544321 00 0 01122222111 0 000112344554 Q ss_pred -CcccccccCccccccCCCCccccccee-cc--------------cCceee----cCCCCCCchhHHHHHHHHHHHHHHH Q lcl|NC_011308. 87 -DYAIYYEFGTGEKSEKGGGRAGGWSYM-DK--------------NGKWHF----TRGSKASKRMRYTFRDEKSKVKDYV 146 (154) Q Consensus 87 -~YA~yVE~GTg~~~~~~~~~~~~~~~~-~~--------------~g~~~~----t~g~~a~PFl~pA~~~~~~~i~~~i 146 (154) ..|.+.|||+-... |.+. .|+++ +. .+.+++ +...||||||+|+++++++++.+.+ T Consensus 55 a~IA~~~EfG~~i~~--p~~~--~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~ 130 (200) T protein:vir:99 55 AKIARQLEFGGVINH--PGGT--KYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQ 130 (200) T ss_pred HHHHhHHHcCCeecc--CCCc--cccccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHH Confidence 44899999964221 1111 11111 11 122222 2357999999999999999999988 Q ss_pred HHHhhc-----cC Q lcl|NC_011308. 147 IKVFGG-----LD 154 (154) Q Consensus 147 ~~~l~~-----l~ 154 (154) ++.++. +| T Consensus 131 ~~~~~~~l~g~~~ 143 (200) T protein:vir:99 131 AQIARQLLDGTIN 143 (200) T ss_pred HHHHHHHHhCCCC Confidence 888874 44 No 131 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=97.47 E-value=5.3e-08 Score=60.45 Aligned_cols=112 Identities=15% Similarity=0.066 Sum_probs=55.2 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCc-ccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDY-AIY 91 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~y 91 (154) |+++-. .+.+.++++.|.. + .+. -|.-|-+..+.. .+...+.++++..| |.+ T Consensus 1 m~~~~~-~~~~~~~~~~l~~----------------l----~~~--~v~vGi~~~~~~----~~~~~~~~G~~va~iAai 53 (193) T protein:vir:96 1 MSLRRD-SELIAAHLQMLRA----------------M----RGR--SVSAGWYSTARY----PDKAGGSVGIQVARIARL 53 (193) T ss_pred Ceeccc-hHHHHHHHHHHHH----------------h----cCC--eEEEEEcCCCCC----CCcccccccchHHHHHhH Confidence 764411 1122222221111 0 000 122344433221 11223567888777 999 Q ss_pred cccCccccccCCCCcccccceecccC--------------ceee----cCCCCCCchhHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAGGWSYMDKNG--------------KWHF----TRGSKASKRMRYTFRDEKSKVKDYVIKVFGGL 153 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~~~~~~~~~g--------------~~~~----t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l 153 (154) -|||+-.. +..++..+......| ..++ +..+||||||++++++++.++.+.+++.++.+ T Consensus 54 ~EfG~~I~---~~~~~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~ 130 (193) T protein:vir:96 54 NEYGGTID---HPGGTRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRL 130 (193) T ss_pred HHcCCccc---cCccceeeeeccccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHH Confidence 99996322 111111111111111 1111 33689999999999999999888877777654 Q ss_pred -----C Q lcl|NC_011308. 154 -----D 154 (154) Q Consensus 154 -----~ 154 (154) | T Consensus 131 ~~g~~~ 136 (193) T protein:vir:96 131 ARGQIT 136 (193) T ss_pred HhCCCC Confidence 4 No 132 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=97.39 E-value=2.3e-06 Score=51.51 Aligned_cols=116 Identities=14% Similarity=-0.032 Sum_probs=68.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+++|+.. +..+.+.+.. +.++++-..++.++.. .....+|.+||+|..|-...+..++ +.|..+++||.+. T Consensus 1 M~ikVkv~--l~~~~~~~~~---~~~~r~Q~~l~~qv~~-~m~~YVP~~tg~~~ls~~~~~~~~~--~~I~y~tPYAr~q 72 (116) T protein:vir:15 1 MAFRINVD--LDGFMDQTSL---DNVKRGQYALVNQAMY-DMEQFVPKDRPEEPLRQSVHATSDG--SEITYSTPYAKAQ 72 (116) T ss_pred CCceEEee--hhHhhhhhhH---HHHHHHHHHHHHHHHH-hhhccCCcccCCcccccceeeecCC--ceEEecCchhHHH Confidence 99887754 5555555532 2223332333344433 3467899999886666554444432 5788899999998 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFG 151 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~ 151 (154) =||- .+. .. ...-+.+.+.- ..++..|.......+.+.+.++++ T Consensus 73 yYg~-~~~---~~----------~~~~~t~p~ag-~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 73 FYGI-IND---KY----------PVHNYTTPGTT-KRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred hccc-ccC---CC----------CcccccCCCCC-cchhHHHHhhhHHHHHHHHHHhcC Confidence 7762 110 00 00011122332 235566888889999999999988 No 133 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=97.39 E-value=6.5e-06 Score=49.02 Aligned_cols=124 Identities=12% Similarity=0.117 Sum_probs=74.7 Q ss_pred cchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CccccccccceeEEe---ecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKD-MFDDTAEKALKQIGEHMKTEIAEGGHGDSS--NNVTGEYANKTDFEV---DKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~-~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a--PvdTG~Lr~SI~~~~---~~~~~~~~V~~~~ 86 (154) |+ +|++.+++.+.+. .+.++..+.+.+-++..+++.+....|... -.|||..-+++.++- ..+.-++.|+=.. T Consensus 1 m~-evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~g 79 (133) T protein:vir:96 1 MR-LIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEG 79 (133) T ss_pred Cc-cccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeec Confidence 86 7999988766654 565554555544444555555544444443 349999888875531 1122233443321 Q ss_pred C---cccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 87 D---YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 87 ~---YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) + |. -||. .-|.+++.+|+|+.++||- -+..|++..++.+.+.+++.|++|= T Consensus 80 p~~R~~-iVHL-------------NE~G~ytr~Gk~i~PrG~G---~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 80 EKHRYS-IVHL-------------NEKGFYAKDGKFIRPKGMG---AIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred CCCcee-eEee-------------ecccceecCCceeccchhh---HHHHHHHhhhHHHHHHHHHHHHHhC Confidence 1 11 1110 1233356688899888885 5788888888888888888887777 No 134 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=97.38 E-value=5.3e-06 Score=49.49 Aligned_cols=118 Identities=14% Similarity=0.142 Sum_probs=67.2 Q ss_pred cchhhh-hHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCC------------------cccccccc-- Q lcl|NC_011308. 13 MADDIK-FEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSN------------------NVTGEYAN-- 67 (154) Q Consensus 13 Ma~~v~-~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aP------------------vdTG~Lr~-- 67 (154) |+++++ +...|..+++.|... +...|...+...+.+ +......| -.+|.+++ T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~---Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~ 77 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQS---RVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREA 77 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHH---HHHhhcCCCCCCCcccchhhhhhhhhcccCcccchh Confidence 998864 445577777766432 333333333332222 22222233 13565544 Q ss_pred ---------ceeEEeecCceEEEE---ecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHH Q lcl|NC_011308. 68 ---------KTDFEVDKRKQEVKI---GNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTF 135 (154) Q Consensus 68 ---------SI~~~~~~~~~~~~V---~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~ 135 (154) ||.+....+. +.| |++..||....||....+ ++. +. ...+|++|||-=+- T Consensus 78 m~~~l~~a~~l~~~~~~d~--a~Vg~~Gs~~~yAaiHQfG~~~r~-~~~------------~~---~v~iPaRp~LGls~ 139 (155) T protein:vir:79 78 MFRKLRTARYLRIDVDSTG--LAIGFDERLSRIARVHQEGQKAPV-EPG------------GP---LAQYPVRVVLGFSD 139 (155) T ss_pred hhhhhhhhheeeeeecCcE--EEEEecCcchhhhhhhhcCCcccC-CCC------------Cc---ccccccccccCCCH Confidence 3555555444 556 999999999999942111 010 11 12579999995553 Q ss_pred HHHHHHHHHHHHHHhhc Q lcl|NC_011308. 136 RDEKSKVKDYVIKVFGG 152 (154) Q Consensus 136 ~~~~~~i~~~i~~~l~~ 152 (154) +.+.+|..+|...|.. T Consensus 140 -~d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 140 -ADRELVRDRLLRELTR 155 (155) T ss_pred -HHHHHHHHHHHHHhhC Confidence 4567788888888888 No 135 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=97.37 E-value=1.8e-07 Score=57.52 Aligned_cols=91 Identities=12% Similarity=0.164 Sum_probs=44.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEe-----ecCce---EEEEe-cCCCcccccccCcc Q lcl|NC_011308. 27 KDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEV-----DKRKQ---EVKIG-NSSDYAIYYEFGTG 97 (154) Q Consensus 27 ~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~-----~~~~~---~~~V~-~~~~YA~yVE~GTg 97 (154) +.. +.+-.++...++...... . -+.-|-|...- ... ...+. ...-+ +.+.+|.|.||||. T Consensus 1 ~~~--------~~~~g~~~~~~~~~~l~~-~-~v~vG~l~~a~-yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~~ 69 (168) T protein:vir:94 1 MTT--------IARKGVKMPPHLEAQFQS-G-EVKAGVLSGST-YPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGHG 69 (168) T ss_pred Ccc--------ccchhhhhhHHHHHhhhc-c-ceeeeccccCc-ccccccchhhcccccccccccccHHHHHHHHhcCCC Confidence 111 111111111111111100 0 01223332110 000 00000 00001 33578889999972 Q ss_pred ccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 98 EKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) ..||||||+|++++++.++.+.+.+.|++ +| T Consensus 70 --------------------------~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~ 101 (168) T protein:vir:94 70 --------------------------QNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAA 101 (168) T ss_pred --------------------------CCCCchhhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 57999999999999999999999999884 44 No 136 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=97.34 E-value=1e-06 Score=53.38 Aligned_cols=129 Identities=14% Similarity=0.089 Sum_probs=68.9 Q ss_pred chhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccc Q lcl|NC_011308. 14 ADDIKFE--MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIY 91 (154) Q Consensus 14 a~~v~~~--~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~y 91 (154) -+++.+. +.|+..--.+...+++-+.....+. +..+...+|+.||+||.|-.+++.+ .+|...+.+.|-+| T Consensus 1 mi~i~idkp~almek~~ev~~~ie~t~~~~~~~l-----~~i~~ntapiktg~lr~sh~~sieg--stgelsn~~~yl~~ 73 (133) T protein:vir:41 1 MIRINIDKPEALMEKASEVEDRVEQTVTLLMIEL-----EEILMNTAPIKTGELRISHTWSVEG--STGELTNTVPYLQW 73 (133) T ss_pred CeeeecCCchhhhcchhhhhhHHHHHHHHHHHHH-----HHHhhhccccccccceeeeeEEeec--CccchhhhhHHhhH Confidence 0333322 2222222233444444444433322 3445667899999999998888764 47888999999999 Q ss_pred cccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH--HHHHHHHHHHHhhc Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE--KSKVKDYVIKVFGG 152 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~--~~~i~~~i~~~l~~ 152 (154) |=+|-| ++-|-.+++.|+-.-+.-.- ..+-.||.-||.-++... +.-+++-+..-|-. T Consensus 74 vl~grg--wvfpv~~kal~wpelphpva-yarpappndyfsa~vay~~~~give~s~iewlis 133 (133) T protein:vir:41 74 VLFGRG--WVFPVEKKALYWPELPHPVA-YARPAPPNDYFSAAVAYIDAKGIVEDSFIEWLIS 133 (133) T ss_pred hhhccc--ceeeecccccccCCCCCccc-ccCCCCCchhhhhhhhhhcccchhHHHHHHHhcC Confidence 999976 44455555555421111111 123345555776655443 33343333332222 No 137 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.31 E-value=5.7e-06 Score=49.35 Aligned_cols=122 Identities=11% Similarity=0.117 Sum_probs=71.0 Q ss_pred cchhh-hhHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH---------------H------HhcCCccccccc Q lcl|NC_011308. 13 MADDI-KFEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGG---------------H------GDSSNNVTGEYA 66 (154) Q Consensus 13 Ma~~v-~~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~---------------a------k~~aPvdTG~Lr 66 (154) |. ++ ++...|..+++.|... +...|...+...+.+-.+.. + ....-.++|.|. T Consensus 1 ~~-~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~ 79 (150) T protein:vir:20 1 MN-EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITS 79 (150) T ss_pred Cc-hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhh Confidence 75 44 3445677777666422 23333333332222222211 0 011235778899 Q ss_pred cceeEEeecCceEEE--EecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHH Q lcl|NC_011308. 67 NKTDFEVDKRKQEVK--IGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKD 144 (154) Q Consensus 67 ~SI~~~~~~~~~~~~--V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~ 144 (154) +||.++...+...+. ++++..||...+||-..- +.++ .+| ..+|++|||-=+ ++.+..|.+ T Consensus 80 ~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~---~~~~-~~~------------~~iPaRp~LG~s-~~d~~~i~~ 142 (150) T protein:vir:20 80 RFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEE---NRKD-GKK------------IDYPARPLLGFT-GEDVQMIEE 142 (150) T ss_pred hhhheeecCcEEEEEeeCCcchhhhhhhhcccccc---cccC-CCc------------eeccccccCCCC-HHHHHHHHH Confidence 999988877665432 489999999999994211 1110 111 247999999655 345677888 Q ss_pred HHHHHhhc Q lcl|NC_011308. 145 YVIKVFGG 152 (154) Q Consensus 145 ~i~~~l~~ 152 (154) .|.+.|.. T Consensus 143 ~i~~~l~k 150 (150) T protein:vir:20 143 IILAHLER 150 (150) T ss_pred HHHHHHhC Confidence 88888888 No 138 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.26 E-value=9.7e-06 Score=48.07 Aligned_cols=122 Identities=11% Similarity=0.127 Sum_probs=69.6 Q ss_pred cchhh-hhHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHH---------------------HhcCCccccccc Q lcl|NC_011308. 13 MADDI-KFEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGH---------------------GDSSNNVTGEYA 66 (154) Q Consensus 13 Ma~~v-~~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~a---------------------k~~aPvdTG~Lr 66 (154) |. ++ ++...|..+++.|+.. +...|...+...+.+-.+... +...-..+|.|. T Consensus 1 ~~-~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~ 79 (150) T protein:vir:60 1 MN-EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITS 79 (150) T ss_pred Cc-hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhc Confidence 75 44 3555677766666422 222222222222222211110 001123577889 Q ss_pred cceeEEeecCceEE--EEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHH Q lcl|NC_011308. 67 NKTDFEVDKRKQEV--KIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKD 144 (154) Q Consensus 67 ~SI~~~~~~~~~~~--~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~ 144 (154) +||++++..+..++ .++++..||...+||-..- +.++ .+ ...+|++|||-=+ ++.+..|.+ T Consensus 80 ~sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~---~~~~-~~------------~~~iPaRp~LG~s-~~d~~~i~~ 142 (150) T protein:vir:60 80 RFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEE---NRKD-GK------------KIDYPARPLLGFT-GEDVQMIEE 142 (150) T ss_pred ceeeeeeeCcEEEEEeeCCCchhhhhhhhcccccc---ccCC-CC------------ceecCCcccCCCC-HHHHHHHHH Confidence 99998887776554 2499999999999994221 1111 01 1247999999655 344677888 Q ss_pred HHHHHhhc Q lcl|NC_011308. 145 YVIKVFGG 152 (154) Q Consensus 145 ~i~~~l~~ 152 (154) .|.+.|.. T Consensus 143 ~i~~~l~r 150 (150) T protein:vir:60 143 IILAHLDR 150 (150) T ss_pred HHHHHHhC Confidence 88888888 No 139 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=97.23 E-value=9.1e-06 Score=48.21 Aligned_cols=120 Identities=10% Similarity=0.150 Sum_probs=68.7 Q ss_pred cchhhh-hHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCCc--------------ccc---------- Q lcl|NC_011308. 13 MADDIK-FEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSNN--------------VTG---------- 63 (154) Q Consensus 13 Ma~~v~-~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aPv--------------dTG---------- 63 (154) |+++++ +...|..++..|... +...|...+...+.+ +...+..|- ..| T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~---Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~ 77 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQK---RVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQ 77 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHH---HHHhhcCCCCCCCcccchHHHhhhccccccchhhhh Confidence 998864 556777777776432 233333333322222 222222331 011 Q ss_pred ccccc--eeEEeecCceEEEE---ecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHH Q lcl|NC_011308. 64 EYANK--TDFEVDKRKQEVKI---GNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDE 138 (154) Q Consensus 64 ~Lr~S--I~~~~~~~~~~~~V---~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~ 138 (154) .|+.| |..+...+ .+.| +++..||...+||..... .+.+ .. ..+|++|||-=+ ++. T Consensus 78 ~l~~~~~l~~~~~~~--~a~vg~~Gs~~~yA~iHQfG~~~~~-~~~~------------~~---v~iPaRp~LG~s-~~d 138 (156) T protein:vir:11 78 KLRTVRYLRAKGDAQ--AITVSFAGRIARIARVHQYGLRDRA-EPGA------------PE---VSYAQRLLLGFD-SSD 138 (156) T ss_pred hhhhhheeeeeecCc--EEEEEecCCchhhhhhhcccccccc-cCCC------------Cc---ccccccccCCCC-HHH Confidence 13333 55554444 3455 899999999999953211 1111 11 247999999555 356 Q ss_pred HHHHHHHHHHHhhccC Q lcl|NC_011308. 139 KSKVKDYVIKVFGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~l~~l~ 154 (154) +.+|.+.|.+.|.++. T Consensus 139 ~~~i~~~i~~~l~~~~ 154 (156) T protein:vir:11 139 METIQNGILAHIDANS 154 (156) T ss_pred HHHHHHHHHHHHhhcC Confidence 7789999999999999 No 140 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=97.21 E-value=8.6e-06 Score=48.35 Aligned_cols=120 Identities=13% Similarity=0.138 Sum_probs=69.6 Q ss_pred cchhhhhHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCC-----------------------cccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSN-----------------------NVTGEY 65 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aP-----------------------vdTG~L 65 (154) |++--++...|..+++.+... +.+.|...+...+.+-. .....| .+++.| T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf---~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~ 77 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARI---AEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRL 77 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHH---HhhcCCCCCcCcccchHHHhhcccccccccchhhh Confidence 875345566677777666432 23333333332222222 222223 245667 Q ss_pred ccceeEEeecCceEE-EEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHH Q lcl|NC_011308. 66 ANKTDFEVDKRKQEV-KIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKD 144 (154) Q Consensus 66 r~SI~~~~~~~~~~~-~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~ 144 (154) ..|+..+...++..+ -+|++..||...+||-.. ++.++ +. +..+|++|||-=+ ++.+..|.. T Consensus 78 ~~~l~~~~~~~~~~v~~~Gt~~~yAaiHQfG~~~---r~~~~----------~~---~v~iPaRp~LG~s-~~d~~~i~~ 140 (148) T protein:vir:79 78 ARYMKTQADANTAVVTFAGNAQRIATVHQFGLRD---RVNKA----------GL---TAQYPARELLGMD-GVDMEHITN 140 (148) T ss_pred hhheeeeeeCCeeeEEeeccchhhhhhhhcCccc---cccCC----------CC---ccccCcccccCCC-HHHHHHHHH Confidence 778877776654332 249999999999999311 11110 11 1247999999655 345677888 Q ss_pred HHHHHhhc Q lcl|NC_011308. 145 YVIKVFGG 152 (154) Q Consensus 145 ~i~~~l~~ 152 (154) .|.+.|.+ T Consensus 141 ~i~~~l~~ 148 (148) T protein:vir:79 141 LLLLHLGA 148 (148) T ss_pred HHHHHhcC Confidence 88888888 No 141 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.19 E-value=1.3e-05 Score=47.44 Aligned_cols=119 Identities=11% Similarity=0.154 Sum_probs=70.0 Q ss_pred cchhh-hhHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCC------------------------cccc Q lcl|NC_011308. 13 MADDI-KFEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSN------------------------NVTG 63 (154) Q Consensus 13 Ma~~v-~~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aP------------------------vdTG 63 (154) |. ++ +....|..+++.|+.. +...|...+...+.+-.+ .+..| ..+| T Consensus 1 m~-~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~---~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~ 76 (150) T protein:vir:57 1 MN-EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVM---AQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKL 76 (150) T ss_pred Cc-hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHH---hhcCCCCCCCcccChHHHHHhccCCCcccchhh Confidence 75 44 3555666666666422 223333222222222221 12122 3667 Q ss_pred ccccceeEEeecCceEE--EEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHH Q lcl|NC_011308. 64 EYANKTDFEVDKRKQEV--KIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSK 141 (154) Q Consensus 64 ~Lr~SI~~~~~~~~~~~--~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~ 141 (154) .|.+||.++.+.+..++ .+|++..||...+||-.... ++++ + + .-+|++|||-=+ ++.+.. T Consensus 77 ~l~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~-~~~~---~---------~---~~iPaRp~LG~s-~~d~~~ 139 (150) T protein:vir:57 77 ITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEET-RKDG---K---------K---IDYPARPLLGFT-GEDVQM 139 (150) T ss_pred hhccceeeeeeCcEEEEEeecCCchhhhhhhhccccccc-cCCC---c---------e---eecCCcccCCCC-HHHHHH Confidence 88889988887776554 34999999999999942211 1111 1 1 247999999655 345677 Q ss_pred HHHHHHHHhhc Q lcl|NC_011308. 142 VKDYVIKVFGG 152 (154) Q Consensus 142 i~~~i~~~l~~ 152 (154) |.+.|.+.|.. T Consensus 140 i~~~i~~~l~r 150 (150) T protein:vir:57 140 IEEIILAHLDR 150 (150) T ss_pred HHHHHHHHHhC Confidence 88888888888 No 142 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=97.08 E-value=1.8e-05 Score=46.62 Aligned_cols=120 Identities=12% Similarity=0.189 Sum_probs=63.4 Q ss_pred cchhhh-hHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCC------------------ccccc----c Q lcl|NC_011308. 13 MADDIK-FEMDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSN------------------NVTGE----Y 65 (154) Q Consensus 13 Ma~~v~-~~~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aP------------------vdTG~----L 65 (154) |+++++ +...|..+++.|... +...|...+...+ .++......| .++|. | T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t---~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L 77 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQ---RRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKI 77 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHH---HHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhh Confidence 998875 445577777776532 2233333222222 2222222233 12333 3 Q ss_pred ccc--eeEEeecCceEEEE---ecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHH Q lcl|NC_011308. 66 ANK--TDFEVDKRKQEVKI---GNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKS 140 (154) Q Consensus 66 r~S--I~~~~~~~~~~~~V---~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~ 140 (154) +.| +.++...++ ++| +++..||...+||-......++.. +.-+|++|||-=+ ++.+. T Consensus 78 ~~a~~l~~~a~~~~--~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~---------------~v~iPaRp~LG~s-~~d~~ 139 (152) T protein:vir:10 78 TQPRFMRLRLESEG--VSLGYEGGDAVIARIHQQGLIGRVRKDWDL---------------KVKYASRELLGFT-DDDLQ 139 (152) T ss_pred hhcceeeeeecCcE--EEEEecCCchhhhhhhccCccccccCCCCc---------------ceeccccccCCCC-HHHHH Confidence 333 444444443 455 999999999999932211111110 0147999999554 33455 Q ss_pred HHHHHHHHHhhcc Q lcl|NC_011308. 141 KVKDYVIKVFGGL 153 (154) Q Consensus 141 ~i~~~i~~~l~~l 153 (154) .|.+.|.+.|.+= T Consensus 140 ~I~~~i~~~l~~a 152 (152) T protein:vir:10 140 MIEDYMINILAGS 152 (152) T ss_pred HHHHHHHHHHhcC Confidence 6777777777766 No 143 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=97.06 E-value=2.8e-05 Score=45.58 Aligned_cols=127 Identities=14% Similarity=0.015 Sum_probs=74.2 Q ss_pred ceeeecCCccchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cccccccceeEEe---ecCc Q lcl|NC_011308. 4 RLLIDRGGHMADDIKFEMDMSKIKDM-FDDTAEKALKQIGEHMKTEIAEGGHGDSSNN--VTGEYANKTDFEV---DKRK 77 (154) Q Consensus 4 ~~~~~~~~~Ma~~v~~~~~l~~~~~~-l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPv--dTG~Lr~SI~~~~---~~~~ 77 (154) -||------|| +|+|.+++.+.+.. |.++..+.+.+.++..+++.++...+...++ |||..-+++.++- ..+. T Consensus 1 ~~~~~~~~~~a-evkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~ 79 (138) T protein:vir:98 1 MLLEVSMSGFA-NLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGI 79 (138) T ss_pred Ceeeecccccc-cccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCc Confidence 23322223365 68999887776654 7766555565556666667777777777774 9999888875431 1122 Q ss_pred eEEEEecCCCcccc---cccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-c Q lcl|NC_011308. 78 QEVKIGNSSDYAIY---YEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-L 153 (154) Q Consensus 78 ~~~~V~~~~~YA~y---VE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l 153 (154) -++.|+=.-+-..- -|+|+ |+++.++|+- ++..|++..++.+.+.|+..|++ | T Consensus 80 r~V~igW~GpR~~ivHLNE~Gy--------------------Gk~i~PrG~G---~I~ka~~~se~~y~~~vk~el~k~l 136 (138) T protein:vir:98 80 PKVKLGFTTPRWNIVHLQELEY--------------------GWKHNRRGVG---VIRRYSDILETIYPRGIRDKLKRGF 136 (138) T ss_pred eEEEEeeecCeeeEEeeecccc--------------------cCCcCCCcch---HHHHHHHhhhHHHHHHHHHHHHHHh Confidence 23333322111011 12332 5566666663 78899999888877777555543 3 Q ss_pred C Q lcl|NC_011308. 154 D 154 (154) Q Consensus 154 ~ 154 (154) | T Consensus 137 ~ 137 (138) T protein:vir:98 137 D 137 (138) T ss_pred c Confidence 3 No 144 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=97.06 E-value=4e-07 Score=55.64 Aligned_cols=100 Identities=18% Similarity=0.280 Sum_probs=50.0 Q ss_pred cchhhh-------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec-CceEEEEec Q lcl|NC_011308. 13 MADDIK-------FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK-RKQEVKIGN 84 (154) Q Consensus 13 Ma~~v~-------~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~-~~~~~~V~~ 84 (154) |++-.. |--.++.+ +++. .+.+.|..-+. +++ ..+|.++||++|..|+||.+.-.. +.-.+.|+. T Consensus 1 ma~gpt~kNP~~KFGvs~~d~-~K~~-EVn~GvNeFMd----E~~-~~~K~~SPV~~G~Y~~S~~V~ers~NkGRG~~G~ 73 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDF-DKLP-EVNQGVNEFMD----EVV-DAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGA 73 (108) T ss_pred CCCCcccccchhhhcCChhhh-hhch-hhhhhHHHHHH----HHH-HHHhhcCCCCchhhHHHHHHHHhhhccCccccCC Confidence 765322 22122221 1121 23334443332 333 357899999999999999653222 223578999 Q ss_pred CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD 137 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~ 137 (154) ...||.+|||||-....-....++.- .|=--|+.+ T Consensus 74 ~~~~AH~VEFGs~hndeyapaqktak------------------qfggtay~d 108 (108) T protein:vir:79 74 TDPQAHLVEFGSAHNDEYAPAQKTAK------------------QFGGTAYGD 108 (108) T ss_pred cchhhhhhhhhccccccccchhhHHH------------------hhcccccCC Confidence 99999999999853221111111100 000001111 No 145 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=97.02 E-value=1.2e-05 Score=47.47 Aligned_cols=119 Identities=9% Similarity=0.092 Sum_probs=68.6 Q ss_pred cchhhhhH-HHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCCc------------------------ccc Q lcl|NC_011308. 13 MADDIKFE-MDMSKIKDMFDDT----AEKALKQIGEHMKTEIAEGGHGDSSNN------------------------VTG 63 (154) Q Consensus 13 Ma~~v~~~-~~l~~~~~~l~~~----~~~~v~~a~~~~~~~i~~~~ak~~aPv------------------------dTG 63 (154) |. +++.. ..|..+++.|... +...|...+... ..++...+..|- .+| T Consensus 1 m~-~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~---t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l 76 (149) T protein:vir:18 1 MS-ELTALQERLAGLIASLSPAARRKMAAEIAKKLRTS---QQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKL 76 (149) T ss_pred Cc-hHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHH---HHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhh Confidence 86 45433 4566666666432 223333322222 222222333341 224 Q ss_pred ccccceeEEeecCceE-EEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHH Q lcl|NC_011308. 64 EYANKTDFEVDKRKQE-VKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKV 142 (154) Q Consensus 64 ~Lr~SI~~~~~~~~~~-~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i 142 (154) .|.+|+......++.+ +.++++..||...+||..... ++++ +| ..+|++|||-=+ ++.+..| T Consensus 77 ~~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~-~~~~---~~------------v~iPaRp~LG~s-~~d~~~I 139 (149) T protein:vir:18 77 RTSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRP-NRNS---RD------------VQYEARPLLGFT-RDDEQMI 139 (149) T ss_pred hhhhhhheeecCceeEEEecccchhhhhhhhccccccc-cCCC---cc------------ccccccccCCCC-HHHHHHH Confidence 5567777766665543 346999999999999953221 1111 11 257999999655 4557889 Q ss_pred HHHHHHHhhc Q lcl|NC_011308. 143 KDYVIKVFGG 152 (154) Q Consensus 143 ~~~i~~~l~~ 152 (154) .+.|.+.|.+ T Consensus 140 ~~~i~~~l~~ 149 (149) T protein:vir:18 140 EDVIISHLGK 149 (149) T ss_pred HHHHHHHHhC Confidence 9999999999 No 146 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=97.00 E-value=1.3e-05 Score=47.29 Aligned_cols=107 Identities=10% Similarity=-0.030 Sum_probs=63.5 Q ss_pred cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccc Q lcl|NC_011308. 13 MA-DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIY 91 (154) Q Consensus 13 Ma-~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~y 91 (154) |+ ++|++.. ..+.+.+. .++++-.-++.++.+ .....+|.+||+|++|-.+. ++ .+.|..+++||.+ T Consensus 1 mmkvkv~~~~----~~~~~~~~---~~~~aq~~~~~ev~~-~~~~yVP~~~G~L~~s~~~~--s~--~g~I~y~tPYAr~ 68 (108) T protein:vir:98 1 MPKIRVELSG----AKDKLSPQ---TQRRGQYAMANQMLQ-DMNQFVPMEEGILRLTGNIS--SD--AEEIYYNTPYAKR 68 (108) T ss_pred CceeEeeehH----HHHHHHHH---HHHHHHHHHHHHHHH-hhcccCcCcCCccccceeec--cC--CceEEecChhhHH Confidence 54 4444322 22222222 222222223344443 34678999999999995333 22 2578889999999 Q ss_pred cccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) .-||..-+..+ .....-++..|.....+++.+.+.++++= T Consensus 69 qYYg~~~n~~~---------------------p~ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 69 RFYEPAYNYTT---------------------PGTGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred hhhccccCCCC---------------------CCCcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 99985321111 22333456678888888888888888877 No 147 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=96.84 E-value=3.6e-06 Score=50.43 Aligned_cols=97 Identities=14% Similarity=0.061 Sum_probs=49.1 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |+++-++...+. +.|.. ..++ .+ +.+ ...-|..+|..-..-... ..++.. --+.+.+|.|. T Consensus 1 m~~~r~~l~~~~---~~l~~---~~v~-----VG--i~~---~a~y~d~~~~~~~~~~~~-~~~~~~--G~pva~ia~~~ 61 (155) T protein:vir:77 1 MSVTRRGLTLPK---DRYRS---MSVK-----AG--VLA---GATYPDESGKKLADGSIL-KKDPRA--GLPVAMIAMAL 61 (155) T ss_pred CcchHHHHHHHH---HHHhc---CceE-----Ee--ecC---CCCCccccchhhhhhhhc-cccccc--cccHhhhhhhh Confidence 776655432222 11111 0000 00 000 001122233222211110 111111 12335799999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhcc-C Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGL-D 154 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l-~ 154 (154) ||||. ..||||||+|+++++++++.+.+.+.++.- | T Consensus 62 e~G~~--------------------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~ 98 (155) T protein:vir:77 62 NYGTS--------------------------KLPARPFMEKTIADRSAEWIKGLTVMMTMGYD 98 (155) T ss_pred hcCCC--------------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHccCc Confidence 99972 579999999999999999998888877653 3 No 148 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=96.84 E-value=3e-06 Score=50.87 Aligned_cols=95 Identities=17% Similarity=0.141 Sum_probs=47.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcCCccccccccceeEEeecCceEEEEe-cCCCccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGG-HGDSSNNVTGEYANKTDFEVDKRKQEVKIG-NSSDYAI 90 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~-ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~-~~~~YA~ 90 (154) |+++.+.... +++.|... ++.-.. +...-|..+|..-..... ...+ ..-+ +.+.+|. T Consensus 1 m~v~r~~L~~---~~~~l~~~--------------~V~VGi~~~a~y~d~~g~~~~~g~~-~~~~---~~~G~pva~ia~ 59 (155) T protein:vir:10 1 MSVTRRGLTL---PKDRYKSM--------------SVKAGVLAGATYPDESGKKLADGTI-LKKD---PRAGLPVAMIAM 59 (155) T ss_pred CcchHHHHHH---HHHHhhCC--------------eeEEeecCCCCCCccccchhhhhhh-hccc---cccCcchhhhhh Confidence 7765443222 11222110 000000 000001111211111100 0101 1112 2356899 Q ss_pred ccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhcc-C Q lcl|NC_011308. 91 YYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGL-D 154 (154) Q Consensus 91 yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l-~ 154 (154) |.||||. ..||||||+|+++++++++.+.+.+.++.- | T Consensus 60 ~~e~G~~--------------------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~ 98 (155) T protein:vir:10 60 ALNYGTS--------------------------KLPARPFMEKTIADRSAEWIKGLTVMMTMGYD 98 (155) T ss_pred hhhcCCC--------------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHcCCC Confidence 9999973 579999999999999999999988888763 3 No 149 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=96.72 E-value=3.2e-06 Score=50.68 Aligned_cols=89 Identities=17% Similarity=0.164 Sum_probs=50.1 Q ss_pred cchhhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccc Q lcl|NC_011308. 13 MADDIKFEM-DMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIY 91 (154) Q Consensus 13 Ma~~v~~~~-~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~y 91 (154) |++.+++.. .+.++++.+... .+. -|.-|-+...=.-....++ .+.+..|.+ T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l--------------------~~~--~v~VGi~~~~~~~~~~~~g-----~~vA~ia~~ 53 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSL--------------------KEK--AVYVGFPAEFDEKVKGSEN-----FNLASLAAV 53 (148) T ss_pred CccccccccHHHHHHHHHHHHh--------------------hCC--eEEEEeecCcCCCCCCCCC-----CCHHHHHHH Confidence 998877643 222222222110 000 0112222111000001111 244678999 Q ss_pred cccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) .||||. ..||||||+|+++++++++.+.+.+.+++ +| T Consensus 54 ~E~G~~--------------------------~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~ 91 (148) T protein:vir:52 54 LEFGNE--------------------------HIPARPFLRQTLEENQEKYTALFIQWFDQGVP 91 (148) T ss_pred HhcCCC--------------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHcCCC Confidence 999973 57999999999999999999988887764 23 No 150 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=96.62 E-value=6.1e-06 Score=49.19 Aligned_cols=88 Identities=17% Similarity=0.131 Sum_probs=50.6 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |++.|++.+...+.++.. ++. +. +. -|.-|-+..+ . ..++ ...+..|.+. T Consensus 1 M~~~i~~~~~~~~~L~~~-------lk~----l~--------~k--~V~VGi~~~~---~-y~dG-----~~vA~Ia~~~ 50 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAF-------IKG----MN--------DY--SVRIGWFSTA---K-YPDG-----TPTAYVASIH 50 (189) T ss_pred CcceeccCcHHHHHHHHH-------HHH----hh--------CC--eEEEEecCCC---C-CCCc-----ccHHHHHHHH Confidence 988887654332222211 110 00 00 0122333211 0 1112 1246789999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-----cC Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-----LD 154 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-----l~ 154 (154) |||+- + ...||||||+|+++++++++.+.+.+.++. +| T Consensus 51 E~G~p-~-----------------------~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~ 93 (189) T protein:vir:10 51 EFGAP-S-----------------------RGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMN 93 (189) T ss_pred HhcCc-C-----------------------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 99972 1 247999999999999999999998888874 44 No 151 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=96.48 E-value=4.5e-05 Score=44.41 Aligned_cols=109 Identities=12% Similarity=0.128 Sum_probs=61.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |++ |..+.+.++....+..+.+ +..++++ .....+|.+||.|++|-. +.++ .|..+++||.+. T Consensus 1 ~~d-------L~~~~~~~~~~~~~raQ~~---l~~ev~~-~~~pYVP~~~G~Lk~S~~--i~s~----~I~y~tPYAr~q 63 (113) T protein:vir:79 1 MSD-------LSVFSRMAQSTGSRSVRLQ---VLNQMHQ-DMEQYVPKRAGFLRSQSF--VNDT----GIHYTAKYARAQ 63 (113) T ss_pred Cch-------HHHHHHhhchhHHHHHHHH---HHHHHHH-hhcccCcccccchhcccc--ccCC----eeEecChhhhHh Confidence 542 3444444444333333332 2334443 457889999999999964 3333 377888999999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~-l~ 154 (154) =||. ..+-+.+. +|.......++..|.....+++.+.+.+++.. .- T Consensus 64 yYg~-~~~~~~~~---------------~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~ 110 (113) T protein:vir:79 64 FYGF-VNGHRVRN---------------YSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAK 110 (113) T ss_pred hccc-cCCCCccc---------------cCCCCCCchhhHHHHHHhHHHHHHHHHHHhhcccc Confidence 8872 11111000 11112333466778888888888887776543 22 No 152 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=96.19 E-value=3.6e-05 Score=44.96 Aligned_cols=88 Identities=14% Similarity=0.184 Sum_probs=66.7 Q ss_pred HHHHHHHHHHHHHHHHHHhcCC--ccccccccceeEEe--ec-CceEEEEecCCCcccccccCccccccCCCCcccccce Q lcl|NC_011308. 38 LKQIGEHMKTEIAEGGHGDSSN--NVTGEYANKTDFEV--DK-RKQEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSY 112 (154) Q Consensus 38 v~~a~~~~~~~i~~~~ak~~aP--vdTG~Lr~SI~~~~--~~-~~~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~ 112 (154) +.-.+..+ +.-.+..||.+|| -+||.-|.+|.-.+ .+ +.+++.+.-+++|.+|.|.+++ T Consensus 1 ~~~~~d~a-a~~le~~aK~nApW~DRTg~AR~~l~~~~~~~g~~~~~i~lsh~v~Yg~~LE~a~~--------------- 64 (93) T protein:vir:10 1 MKATSNYH-AVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHTVHYGIWLEIANS--------------- 64 (93) T ss_pred CchhhhHH-HHHHHHHHhcCCCccccchhhhhhhcccccccCCceEEEEEecCeeccceEEeecC--------------- Confidence 33333322 3334678999999 49999999995433 33 4577888889999999999874 Q ss_pred ecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 113 MDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 113 ~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) |++ --+.|+++..-++|.+.+...|.+|- T Consensus 65 -----------~ky--aIl~Ptv~~~~~~i~~g~~~ll~~l~ 93 (93) T protein:vir:10 65 -----------GRY--EIIMPTVHHEGKLMAQRLRGLLGRLR 93 (93) T ss_pred -----------CCc--cchhhhHHHHHHHHHHHHHHHHHhcC Confidence 111 25899999999999999999999999 No 153 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=96.17 E-value=8.3e-06 Score=48.45 Aligned_cols=99 Identities=8% Similarity=0.044 Sum_probs=58.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCccccccccce--eEEeecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHM----KTEIAEGGHGDSSNNVTGEYANKT--DFEVDKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~----~~~i~~~~ak~~aPvdTG~Lr~SI--~~~~~~~~~~~~V~~~~ 86 (154) |+.+ +|.+ .+.+.+.+..+.. ..|++ +.....+|.+||.|++|= .+.+.++.+...+..-+ T Consensus 1 ~~f~-~f~~-----------~~~k~l~kr~L~~~g~vq~Evl-R~~~PyvP~~tG~Lk~S~~l~tvIgsg~I~y~~~~~a 67 (105) T protein:vir:78 1 MSFS-SFKD-----------AVIDDIHNKALSTAAKAGGELV-ELAQPVTPILYGDLRRSSYFKIIIQKNSIVARVFSLT 67 (105) T ss_pred CCcc-cccc-----------hHHHHHHHhcCCCCchhhHHHH-HHhCCCCcccccccccccccceeecCCeeEeeccccC Confidence 7632 2222 1222222222221 12333 345677899999999984 45556666666666679 Q ss_pred CcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) +||.+.=|.. +....|+..++..+++.|.++++..++= T Consensus 68 PYAr~qYYe~----------------------------~Rg~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 68 PYARRQYYEN----------------------------RRNPRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred chhhhhhhcc----------------------------cCCCchhHHhhhcchhHHHHHHhcccCC Confidence 9999876542 1222366777788888898888855544 No 154 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=96.13 E-value=8.7e-06 Score=48.33 Aligned_cols=96 Identities=16% Similarity=0.108 Sum_probs=46.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEe-cCCCcccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIG-NSSDYAIY 91 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~-~~~~YA~y 91 (154) |.++-++. ...++.|.. ..|+ .+ +.. ...-|..||.--..-... ..+ +.-+ +.+.+|.| T Consensus 1 m~v~~k~L---~~~~~~l~~---~~v~-----VG--i~~---~a~y~d~~~~~~~~~~~~-~~~---~~~g~~va~ia~~ 60 (155) T protein:vir:78 1 MSVTRRGL---TLPKDRYRS---MSVK-----AG--VLA---GATYPDESGKKLADGTIL-TKD---PRAGLPVAMIAMA 60 (155) T ss_pred CcchHHHH---HHHHHHHhC---CeeE-----Ee--ecC---CCCCCcccchhhhhhhhc-ccc---cccCCcHHHHHHh Confidence 65543331 111111110 0000 00 000 001122222111110000 000 1112 23568889 Q ss_pred cccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .||||. +.||||||+|++++++.++.+.+.+.++.-. T Consensus 61 ~E~G~~--------------------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~ 97 (155) T protein:vir:78 61 LNYGTS--------------------------KLPARPFMEKTITDRSAEWIKGLTVMMTMGY 97 (155) T ss_pred hhcCCC--------------------------CCCCcchhhHHHHHHHHHHHHHHHHHHHcCC Confidence 999972 6799999999999999999999888887644 No 155 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=96.12 E-value=8.4e-06 Score=48.41 Aligned_cols=95 Identities=16% Similarity=0.103 Sum_probs=46.4 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccc-eeEEeecCceEEEEe-cCCCccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANK-TDFEVDKRKQEVKIG-NSSDYAI 90 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~S-I~~~~~~~~~~~~V~-~~~~YA~ 90 (154) |.++-++. ...++.|.. ..|+ .+ +.. ...-|..+|.--.. .-... + ..-+ +.+.+|. T Consensus 1 m~v~~k~L---~~~~~~l~~---~~v~-----VG--i~~---~a~y~d~~~~~~~~~~~~~~--~---~~~g~~va~ia~ 59 (155) T protein:vir:10 1 MSVTRRGL---TLPKDRYRS---MSVK-----AG--VLA---GATYPDESGKKLADGTILTK--D---PRAGLPVAMIAM 59 (155) T ss_pred CcchHHHH---HHHHHHHhC---CeeE-----Ee--ecC---CCCCccccchhhhhhhhccc--c---cccCCcHHHHHH Confidence 55543331 111111110 0000 00 000 00112222211110 00110 0 1111 2456888 Q ss_pred ccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 91 YYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 91 yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) |.||||. ..||||||+|++++++.++.+.+.+.++.-. T Consensus 60 ~~E~G~~--------------------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~ 97 (155) T protein:vir:10 60 ALNYGTS--------------------------KLPARPFMEKTIADRSAEWIKGLTVMMTMGY 97 (155) T ss_pred HHhcCCC--------------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHcCC Confidence 9999972 6799999999999999999999888887644 No 156 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=95.43 E-value=0.00028 Score=40.05 Aligned_cols=112 Identities=17% Similarity=0.102 Sum_probs=49.5 Q ss_pred cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccc Q lcl|NC_011308. 13 MA-DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIY 91 (154) Q Consensus 13 Ma-~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~y 91 (154) |+ ++|+ +..+.+.|.+. .++++-..++.++.. .....+|.+||+|++|-. +.+++ |..+++||.+ T Consensus 1 m~kV~vd----l~~~~~~ls~~---~~~k~Q~~~~~ev~~-~~~~YVP~~tG~Lk~S~~--i~~~~----I~Y~tPYAr~ 66 (118) T protein:vir:30 1 MAKVVVE----LGGIKRKVSPQ---ALAKGKLIMNNQVMM-SMNPYVPYRDGALRGSSR--ANSVG----VTWSGPHARA 66 (118) T ss_pred Cceeeec----hhHHhhhhhHH---HHHHHHHHHHHHHHH-HhhcCCCCccCcccccee--ecCCe----eEECCchhhH Confidence 65 3333 33333444322 222222223334433 346789999999999964 33443 5677899988 Q ss_pred cccCc---cccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 92 YEFGT---GEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 92 VE~GT---g~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .=||- +..+.+.+. +.+. ....-+..++. .....+..-.+-++++++ T Consensus 67 qYY~~~~~~~~g~~~~~--------------~~~p-~~g~~Wd~R~k-a~~~~~~~w~~~~~k~~g 116 (118) T protein:vir:30 67 QFYGGAYNKYKSFKFKK--------------YTTP-GTGKRWDKRAL-ANATIVKDWEKSLLRGMG 116 (118) T ss_pred hhhccccCCCCcccccc--------------ccCC-CCCCcccchhh-cchhhhHHHHHHHHHhcC Confidence 76652 111111000 0000 11111222222 222233333444444445 No 157 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=95.43 E-value=0.00028 Score=40.05 Aligned_cols=112 Identities=17% Similarity=0.102 Sum_probs=49.5 Q ss_pred cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCcccc Q lcl|NC_011308. 13 MA-DDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIY 91 (154) Q Consensus 13 Ma-~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~y 91 (154) |+ ++|+ +..+.+.|.+. .++++-..++.++.. .....+|.+||+|++|-. +.+++ |..+++||.+ T Consensus 1 m~kV~vd----l~~~~~~ls~~---~~~k~Q~~~~~ev~~-~~~~YVP~~tG~Lk~S~~--i~~~~----I~Y~tPYAr~ 66 (118) T protein:vir:98 1 MAKVVVE----LGGIKRKVSPQ---ALAKGKLIMNNQVMM-SMNPYVPYRDGALRGSSR--ANSVG----VTWSGPHARA 66 (118) T ss_pred Cceeeec----hhHHhhhhhHH---HHHHHHHHHHHHHHH-HhhcCCCCccCcccccee--ecCCe----eEECCchhhH Confidence 65 3333 33333444322 222222223334433 346789999999999964 33443 5677899988 Q ss_pred cccCc---cccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 92 YEFGT---GEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 92 VE~GT---g~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) .=||- +..+.+.+. +.+. ....-+..++. .....+..-.+-++++++ T Consensus 67 qYY~~~~~~~~g~~~~~--------------~~~p-~~g~~Wd~R~k-a~~~~~~~w~~~~~k~~g 116 (118) T protein:vir:98 67 QFYGGAYNKYKSFKFKK--------------YTTP-GTGKRWDKRAL-ANATIVKDWEKSLLRGMG 116 (118) T ss_pred hhhccccCCCCcccccc--------------ccCC-CCCCcccchhh-cchhhhHHHHHHHHHhcC Confidence 76652 111111000 0000 11111222222 222233333444444445 No 158 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=94.80 E-value=0.00026 Score=40.27 Aligned_cols=118 Identities=12% Similarity=0.147 Sum_probs=71.8 Q ss_pred cchhhhh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc---cccccce--------eEEeecCc Q lcl|NC_011308. 13 MADDIKF----EMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVT---GEYANKT--------DFEVDKRK 77 (154) Q Consensus 13 Ma~~v~~----~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdT---G~Lr~SI--------~~~~~~~~ 77 (154) |+.++.. .+.|.+.+.+++...++.|.++...-+..++........||.- |.+|+-. ..+...=+ T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLg 80 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLG 80 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcc Confidence 8876442 3467777778888889999988877776666666667789853 3555544 22211112 Q ss_pred eEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 78 QEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 78 ~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) ++.+-...=.|-.|-..|-|.|. --+|-||...++..-+.+.+.+.++|-++= T Consensus 81 f~i~~k~kf~YLvfPD~G~G~sn------------------------~~~q~FmerGl~~~t~~i~E~L~~~l~k~i 133 (140) T protein:vir:40 81 FELLTKPKFNYLIFPDQGIGKHN------------------------KTKQDFMQLGVEESSQEIVEMLEQAVFKEI 133 (140) T ss_pred eeEeecCcccccccccccCCCCC------------------------cchHHHHHhccccchhHHHHHHHHHHHHHH Confidence 33222233457777777765442 145569998888877776665555443332 No 159 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=94.77 E-value=0.001 Score=37.03 Aligned_cols=111 Identities=13% Similarity=0.079 Sum_probs=61.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CccccccccceeEEe---ecC--ceEEEEecCCCcccc Q lcl|NC_011308. 19 FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS--NNVTGEYANKTDFEV---DKR--KQEVKIGNSSDYAIY 91 (154) Q Consensus 19 ~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a--PvdTG~Lr~SI~~~~---~~~--~~~~~V~~~~~YA~y 91 (154) .+.+|+ +.|.++....+.+-++..+++.+....|... -.|||..-+++.++- ..+ .-++.|+=.. T Consensus 1 ilk~lE---~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~g----- 72 (123) T protein:vir:26 1 MLKKLE---SVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVG----- 72 (123) T ss_pred ChhhHH---HhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeec----- Confidence 333333 3444443333333333344444443344433 349999888886541 111 1223333221 Q ss_pred cccCccccccCCCCccc-----ccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 92 YEFGTGEKSEKGGGRAG-----GWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 92 VE~GTg~~~~~~~~~~~-----~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) |+.|.. -|- +..+|+|+.++||- -+..|++..++.+.+.|++.|+. T Consensus 73 -----------p~~R~~iVHLNE~G-Ytr~Gk~i~PRG~G---~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 73 -----------PMNRKNIIHLNEHG-YTRDGKKYTPRGFG---VIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred -----------CCCceeeEeeeccc-eecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 111111 122 35678888888885 58999999999999999999999 No 160 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=94.34 E-value=0.0007 Score=37.88 Aligned_cols=131 Identities=15% Similarity=-0.015 Sum_probs=67.7 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH------HhcCCcccc---ccccceeEEee-cCce Q lcl|NC_011308. 12 HMADDIKFEMDMSKIKDMFDDTAEK---ALKQIGEHMKTEIAEGGH------GDSSNNVTG---EYANKTDFEVD-KRKQ 78 (154) Q Consensus 12 ~Ma~~v~~~~~l~~~~~~l~~~~~~---~v~~a~~~~~~~i~~~~a------k~~aPvdTG---~Lr~SI~~~~~-~~~~ 78 (154) -|-.+--|.+.|...++++.+..-+ +-+..+..+++++.+... +-..+..|| .|++||..+-. .++. T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg~ 80 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDGI 80 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCcc Confidence 3332333556666666666543211 112233334444433322 223344666 99999976421 1111 Q ss_pred ---EEEEecC---CCcccccccCccccccCCCCcccccceecccCcee---ecCCCCCCchhHHHHHH--HHHHHHHHHH Q lcl|NC_011308. 79 ---EVKIGNS---SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWH---FTRGSKASKRMRYTFRD--EKSKVKDYVI 147 (154) Q Consensus 79 ---~~~V~~~---~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~---~t~g~~a~PFl~pA~~~--~~~~i~~~i~ 147 (154) +.+||-+ +--|.|++.||..-..- ..|..+ .|..+++-+|+..+-+. .++.+-+... T Consensus 81 ~dG~StVGw~~kka~ia~~indGtr~~~~~------------~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~ 148 (161) T protein:vir:10 81 KDGNSTVGWDYTKSRVGHLIENGTRFPMYS------------KKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEA 148 (161) T ss_pred cCCceeccccCchhhhhhhhcccchhhhhh------------cccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHH Confidence 2345554 33588899998531110 111111 13467889999998884 5566666666 Q ss_pred HHhhccC Q lcl|NC_011308. 148 KVFGGLD 154 (154) Q Consensus 148 ~~l~~l~ 154 (154) .++++|= T Consensus 149 ~~y~eil 155 (161) T protein:vir:10 149 EVFSEIL 155 (161) T ss_pred HHHHHHH Confidence 6666554 No 161 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=93.60 E-value=0.00085 Score=37.42 Aligned_cols=126 Identities=17% Similarity=0.251 Sum_probs=64.9 Q ss_pred cchh----hhhH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC--- Q lcl|NC_011308. 13 MADD----IKFE---------MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR--- 76 (154) Q Consensus 13 Ma~~----v~~~---------~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~--- 76 (154) |++. |+|. ..+.+...++.+.......+.++..+ +.....+.--.||.|..||.+.|... T Consensus 1 M~~~~~lHvdF~qp~~~~Fnr~r~RraF~~iGq~h~r~Arrlvm~RG----rs~pGe~P~~~TGrLa~SIgy~Vpras~~ 76 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEELVFNRARMRRAFVKIGQVHMRDARRLVMKRG----RSKPGENPSYRTGQLARSIGYYVPRASKK 76 (170) T ss_pred CCCCceeEEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHHhc----CCCCCCCCcchhhhhhhhhhhccccccCC Confidence 5542 2222 12222222222222222221111111 11112232348999999998766433 Q ss_pred --ceEEEEecCC------------CcccccccCcc--ccccCCCCcc----cccceecccCceeecCCCCCCchhHHHHH Q lcl|NC_011308. 77 --KQEVKIGNSS------------DYAIYYEFGTG--EKSEKGGGRA----GGWSYMDKNGKWHFTRGSKASKRMRYTFR 136 (154) Q Consensus 77 --~~~~~V~~~~------------~YA~yVE~GTg--~~~~~~~~~~----~~~~~~~~~g~~~~t~g~~a~PFl~pA~~ 136 (154) |+-+.|-.|. -|-.|.+||-. ....+.+.+. .+|.. .|-.-||..+++ T Consensus 77 rpG~mVkIaPNqk~G~g~r~i~g~fYPafL~YGVr~gakr~k~hhr~a~ggsgwri------------aPR~Nym~~~l~ 144 (170) T protein:vir:44 77 RPGLMVKIAPNQKNGEGNRHINGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRV------------EPRNNYMTEVLD 144 (170) T ss_pred CCceeEEecCCCCCCCCccccccccchhhhhhhhhcccccchhhcccccCCCccee------------ccchhHHHHHHH Confidence 6666665542 47888888842 2222111111 12332 255569999999 Q ss_pred HHHHHHHHHHHHHhhccC Q lcl|NC_011308. 137 DEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 137 ~~~~~i~~~i~~~l~~l~ 154 (154) +.+......+..+|+.-= T Consensus 145 ~~~~wt~~~L~r~L~~sL 162 (170) T protein:vir:44 145 KRRSWTRYVLSRELRKSL 162 (170) T ss_pred hhHHHHHHHHHHHHHHhc Confidence 999998888888876533 No 162 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=93.35 E-value=0.00014 Score=41.69 Aligned_cols=106 Identities=17% Similarity=0.056 Sum_probs=41.6 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |++- +..+++.++++.|...-.+.|+ --.+-+.|.+-..| |..- T Consensus 1 m~vt-~~~~~~~~~~~~l~~L~~k~v~----------------vGi~~~d~~~~~~I-------------------a~~~ 44 (199) T protein:vir:80 1 MKVT-TDKSTMNKAIRELDQLDRYSLQ----------------IGLFGEDDSFIQMI-------------------AGVH 44 (199) T ss_pred Cccc-ccHHHHHHHHHHHHHhcCCEEE----------------EEEecCCCcchhhe-------------------eehh Confidence 7642 2334444444433211000000 00011111111111 1222 Q ss_pred ccCccccc-----------------------cCCCCccccccee-cccCceee--c--CCCCCCchhHHHHHHHHHHHHH Q lcl|NC_011308. 93 EFGTGEKS-----------------------EKGGGRAGGWSYM-DKNGKWHF--T--RGSKASKRMRYTFRDEKSKVKD 144 (154) Q Consensus 93 E~GTg~~~-----------------------~~~~~~~~~~~~~-~~~g~~~~--t--~g~~a~PFl~pA~~~~~~~i~~ 144 (154) |||..... ..|+++...-+.. ...+.++. . ...||||||+|+++++++++.+ T Consensus 45 E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~ 124 (199) T protein:vir:80 45 EFGLTIRPKGKYLTIPTPEAGDRRARDIPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGE 124 (199) T ss_pred hcCCeeecCCceeeecchhhhcccccccCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHH Confidence 33321100 0011110000000 00000000 0 2469999999999999999999 Q ss_pred HHHHHhhc-----cC Q lcl|NC_011308. 145 YVIKVFGG-----LD 154 (154) Q Consensus 145 ~i~~~l~~-----l~ 154 (154) .+.+.++. +| T Consensus 125 ~~~~~~~~vl~g~~~ 139 (199) T protein:vir:80 125 LFEGWIDDVIHGKLS 139 (199) T ss_pred HHHHHHHHHHhCCCc Confidence 88888874 44 No 163 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=93.07 E-value=0.0022 Score=35.15 Aligned_cols=128 Identities=13% Similarity=0.073 Sum_probs=61.4 Q ss_pred cchhhhhHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHH--HHhcCCcccc---ccccceeEEee-cCc-- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDT-------AEKALKQIGEHMKTEIAEGG--HGDSSNNVTG---EYANKTDFEVD-KRK-- 77 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~-------~~~~v~~a~~~~~~~i~~~~--ak~~aPvdTG---~Lr~SI~~~~~-~~~-- 77 (154) |++ |.+.|...++++.+. ....|..|..+.-++..... .+-..-..|| .|++||..+-. .++ T Consensus 1 M~~---~~~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~ 77 (168) T protein:vir:74 1 MAT---FEEAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred Ccc---HHHHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCccc Confidence 864 445455555555432 23333333333222211111 1222334566 89999975421 111 Q ss_pred -eEEEEecCC----------CcccccccCccccccCCCCcccccceecccCceeec---CCCCCCchhHHHHHH--HHHH Q lcl|NC_011308. 78 -QEVKIGNSS----------DYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFT---RGSKASKRMRYTFRD--EKSK 141 (154) Q Consensus 78 -~~~~V~~~~----------~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t---~g~~a~PFl~pA~~~--~~~~ 141 (154) =+.+||-+. --|.|++.||.-|.. ....|+-|.- ..+++-+|+..+=++ .++. T Consensus 78 dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~-----------~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~ 146 (168) T protein:vir:74 78 DGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQF-----------TTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQG 146 (168) T ss_pred CCceeecccccccccccchhhhhhhhccccccccc-----------ccccccccccccccccccchhHHHHHhhhhhHHH Confidence 123566543 368999999863321 1112222221 246888999887776 3455 Q ss_pred HHHHHHHHhhcc--------C Q lcl|NC_011308. 142 VKDYVIKVFGGL--------D 154 (154) Q Consensus 142 i~~~i~~~l~~l--------~ 154 (154) |-+...+++++| | T Consensus 147 V~~Ae~~~y~eIl~~k~~~~~ 167 (168) T protein:vir:74 147 ILKAEAEAMRKIINRKKKENN 167 (168) T ss_pred HHHHHHHHHHHHHHhhcCCCC Confidence 444443333333 3 No 164 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=91.21 E-value=0.0043 Score=33.55 Aligned_cols=128 Identities=12% Similarity=0.059 Sum_probs=60.4 Q ss_pred cchhhhhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH--HhcCCcccc---ccccceeEEee-cCc-- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDD-------TAEKALKQIGEHMKTEIAEGGH--GDSSNNVTG---EYANKTDFEVD-KRK-- 77 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~-------~~~~~v~~a~~~~~~~i~~~~a--k~~aPvdTG---~Lr~SI~~~~~-~~~-- 77 (154) |.+ |.+.|...++++++ .....|..|..+.-++...... +-..-.+|| .|++||..+-. .++ T Consensus 1 M~~---~~d~l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg~~ 77 (168) T protein:vir:10 1 MVS---FYDAMQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred CCc---HHHHHHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheeccccccccc Confidence 754 44444444443333 3334444443333222222221 122234666 89999975421 111 Q ss_pred -eEEEEecC----------CCcccccccCccccccCCCCcccccceecccCceeec---CCCCCCchhHHHHHHH--HHH Q lcl|NC_011308. 78 -QEVKIGNS----------SDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFT---RGSKASKRMRYTFRDE--KSK 141 (154) Q Consensus 78 -~~~~V~~~----------~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t---~g~~a~PFl~pA~~~~--~~~ 141 (154) =+.+||-+ +--|.|++.||.-|.. ....|+-|.- ..+++-+|+..+-++. ++. T Consensus 78 dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~-----------~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~ 146 (168) T protein:vir:10 78 DGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQF-----------TTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQG 146 (168) T ss_pred CCceeecccCccccccccchheeeecccccccccc-----------ccccccccccccccccccchhHHHhhhchhhhHH Confidence 12345543 3458899999963321 1111222221 2468888998877753 444 Q ss_pred HH--------HHHHHHhhccC Q lcl|NC_011308. 142 VK--------DYVIKVFGGLD 154 (154) Q Consensus 142 i~--------~~i~~~l~~l~ 154 (154) |- ++|.+.-++=| T Consensus 147 V~~Ae~~~y~eIl~~k~~~~~ 167 (168) T protein:vir:10 147 ILKAEAEAMRKIINRKKKESN 167 (168) T ss_pred HHHHHHHHHHHHHHhhcCCCC Confidence 43 34444433333 No 165 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=91.21 E-value=0.00077 Score=37.64 Aligned_cols=85 Identities=13% Similarity=0.082 Sum_probs=33.7 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEEEEecCCCccccc Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEVKIGNSSDYAIYY 92 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yV 92 (154) |- +.+....++.-.+.+++.++. -|.-|-..+.=.. +++. +-+..|.|- T Consensus 1 ~~---------------------~~~~~~G~~~L~~~~k~l~~~--~V~VGi~~d~g~~---~dG~-----sv~~vA~~~ 49 (160) T protein:vir:95 1 MV---------------------KRVIHPARAKLVGAMKNLQTA--NAQVGYFQEQGQH---SSGF-----SYPALMYLQ 49 (160) T ss_pred Cc---------------------eeechHhHHHHHHHHHHHhCC--eeEEeeccccccC---CCCc-----cHHHHHhhh Confidence 11 111111111111111111111 1333443322001 1111 224688999 Q ss_pred ccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH-----HHHHHH---HHHHHHhhccC Q lcl|NC_011308. 93 EFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD-----EKSKVK---DYVIKVFGGLD 154 (154) Q Consensus 93 E~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~-----~~~~i~---~~i~~~l~~l~ 154 (154) ||||. ..|++|||+++++. +...+. +.+...+.+.. T Consensus 50 EfG~~--------------------------~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~ 93 (160) T protein:vir:95 50 EVIGV--------------------------PSASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSSLN 93 (160) T ss_pred hcCcc--------------------------cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 99973 35899999999973 222222 22223333222 No 166 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=90.39 E-value=0.0062 Score=32.71 Aligned_cols=128 Identities=10% Similarity=0.031 Sum_probs=61.1 Q ss_pred cchhhhhHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH--HHhcCCccc---cccccceeEEee-cCce- Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTA-------EKALKQIGEHMKTEIAEGG--HGDSSNNVT---GEYANKTDFEVD-KRKQ- 78 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~-------~~~v~~a~~~~~~~i~~~~--ak~~aPvdT---G~Lr~SI~~~~~-~~~~- 78 (154) |++ |.+.|...++++.+.. ...|..+..+.-++..... .+-..-.+| +.|++||..+-. .++. T Consensus 1 M~~---~~d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~ 77 (168) T protein:vir:39 1 MVS---FYDAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred Ccc---HHHHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCccc Confidence 754 5555555555554422 2333333322222211111 122223345 689999975421 1110 Q ss_pred --EEEEec----------CCCcccccccCccccccCCCCcccccceecccCceee---cCCCCCCchhHHHHHHH--HHH Q lcl|NC_011308. 79 --EVKIGN----------SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHF---TRGSKASKRMRYTFRDE--KSK 141 (154) Q Consensus 79 --~~~V~~----------~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~---t~g~~a~PFl~pA~~~~--~~~ 141 (154) +.+||- .+--|.|++.||.-+.. ....|.-|. +..+++-+|+..+-++. ++. T Consensus 78 dG~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~-----------~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~a 146 (168) T protein:vir:39 78 DGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQF-----------TTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQG 146 (168) T ss_pred CCceeccccCccccccccchhheehhccccccchh-----------hhhcccccccccceeecccchhHHHhhhhhhhHH Confidence 234554 34458999999953211 011111111 12468889998888864 555 Q ss_pred HHHHHHHHhhccC Q lcl|NC_011308. 142 VKDYVIKVFGGLD 154 (154) Q Consensus 142 i~~~i~~~l~~l~ 154 (154) +-+....++++|= T Consensus 147 V~~Ae~e~~~eil 159 (168) T protein:vir:39 147 ILKAEAEAMRKII 159 (168) T ss_pred HHHHHHHHHHHHH Confidence 5555544444442 No 167 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=89.78 E-value=0.00097 Score=37.11 Aligned_cols=100 Identities=18% Similarity=0.265 Sum_probs=49.7 Q ss_pred cchhhh-------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec-CceEEEEec Q lcl|NC_011308. 13 MADDIK-------FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK-RKQEVKIGN 84 (154) Q Consensus 13 Ma~~v~-------~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~-~~~~~~V~~ 84 (154) ||+-.. |--.|+. .++|+ .+.+.+..-+ .+++. .-+.++||.||..|+|..+...+ +.-.+.|++ T Consensus 1 ma~gpt~knplakfgi~ldd-fdklp-evnqgvnef~----dev~a-awk~nspv~~g~yrdsvqvterstnkgrgkvga 73 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDD-FDKLP-EVNQGVNEFI----DEVVA-AWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGA 73 (108) T ss_pred CCCCCccccchhhhccchhh-hhccc-hhhhhHHHHH----HHHHH-hhhcCCCccccccccceeecccccccccccccC Confidence 765322 2222222 12222 2333333333 33332 34788999999999999764332 334578999 Q ss_pred CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD 137 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~ 137 (154) ..+.|..||||+-....-....++. +.|=--|+.+ T Consensus 74 tdpqahlvefgs~hndeyapaqkta------------------kqfggtay~d 108 (108) T protein:vir:10 74 TDPQAHLVEFGSAHNDEYAPAQKTA------------------KQFGGTAYGD 108 (108) T ss_pred cchhhhhhhhhccccccccchhhhH------------------HhhcccccCC Confidence 9999999999974322111111110 0000001111 No 168 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=89.78 E-value=0.00097 Score=37.11 Aligned_cols=100 Identities=18% Similarity=0.265 Sum_probs=49.7 Q ss_pred cchhhh-------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec-CceEEEEec Q lcl|NC_011308. 13 MADDIK-------FEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK-RKQEVKIGN 84 (154) Q Consensus 13 Ma~~v~-------~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~-~~~~~~V~~ 84 (154) ||+-.. |--.|+. .++|+ .+.+.+..-+ .+++. .-+.++||.||..|+|..+...+ +.-.+.|++ T Consensus 1 ma~gpt~knplakfgi~ldd-fdklp-evnqgvnef~----dev~a-awk~nspv~~g~yrdsvqvterstnkgrgkvga 73 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDD-FDKLP-EVNQGVNEFI----DEVVA-AWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGA 73 (108) T ss_pred CCCCCccccchhhhccchhh-hhccc-hhhhhHHHHH----HHHHH-hhhcCCCccccccccceeecccccccccccccC Confidence 765322 2222222 12222 2333333333 33332 34788999999999999764332 334578999 Q ss_pred CCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHH Q lcl|NC_011308. 85 SSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRD 137 (154) Q Consensus 85 ~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~ 137 (154) ..+.|..||||+-....-....++. +.|=--|+.+ T Consensus 74 tdpqahlvefgs~hndeyapaqkta------------------kqfggtay~d 108 (108) T protein:vir:10 74 TDPQAHLVEFGSAHNDEYAPAQKTA------------------KQFGGTAYGD 108 (108) T ss_pred cchhhhhhhhhccccccccchhhhH------------------HhhcccccCC Confidence 9999999999974322111111110 0000001111 No 169 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=88.97 E-value=0.0076 Score=32.20 Aligned_cols=144 Identities=12% Similarity=0.186 Sum_probs=70.3 Q ss_pred CCcceeeecCCc-cchh----hhhH--H-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccc Q lcl|NC_011308. 1 MRSRLLIDRGGH-MADD----IKFE--M-------DMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYA 66 (154) Q Consensus 1 ~~~~~~~~~~~~-Ma~~----v~~~--~-------~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr 66 (154) |..-+--|.|-+ |++. |+|. + .+.+...++.+.......+.++..+ +.....+.--.||+|. T Consensus 1 ~~~~~~~~~~~nam~~~~~lHvdF~qp~~~~Fnr~riRraF~~iGq~h~r~ArrLvm~RG----rs~pge~P~~qTGrLa 76 (187) T protein:vir:48 1 MKNCVQRDDGVNAMNQTAFLHVDFKQPKELEFNRARLRRAFVQIGRVYMRDARRLVIKRG----RSGPGENPGYQTGRLA 76 (187) T ss_pred CccccccccchhhhhhccceeEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHhcc----cCCCCCCCcchhhhhh Confidence 665555555433 3321 2221 1 1222222222222111111111100 1111223234899999 Q ss_pred cceeEEee-----cCceEEEEecC--------------CCcccccccCcc--ccccCCCCcccccceecccCceeecCCC Q lcl|NC_011308. 67 NKTDFEVD-----KRKQEVKIGNS--------------SDYAIYYEFGTG--EKSEKGGGRAGGWSYMDKNGKWHFTRGS 125 (154) Q Consensus 67 ~SI~~~~~-----~~~~~~~V~~~--------------~~YA~yVE~GTg--~~~~~~~~~~~~~~~~~~~g~~~~t~g~ 125 (154) .||.+.|. ..|+-+.|-.| .-|-.|.+||-. ..+...+.+... ......|.. . T Consensus 77 ~SIgy~Vpkat~~RpG~mVkIaPNqk~G~g~r~~Pi~gdfYPafL~YGVr~ga~~~~~~~k~~~---~~~~sgwri---a 150 (187) T protein:vir:48 77 RSIGYYVPKKTTRRPGLMVKISPNQKNGQGNRRFPEGAPYYPAFLYYGVRHSAYGMDKKDKRQK---KHHSSTFRL---A 150 (187) T ss_pred hhhhhccccccCCCCcceEEecCCcccCcccccccccccchhHHHHhhhhhhhhccchhhhhhh---cccCCccee---c Confidence 99987764 45666666655 248889999852 222211111000 000122222 2 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 126 KASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 126 ~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) |-.-||..+++..+......+..+|+.-= T Consensus 151 PR~Nym~~~L~~~~~wt~~~L~raL~~sL 179 (187) T protein:vir:48 151 PRNNFMADVIERRRHWTQELLSRELQRSL 179 (187) T ss_pred cchhHHHHHHHhhHHHHHHHHHHHHHHhc Confidence 55569999999999998888888876533 No 170 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=88.90 E-value=0.025 Score=29.39 Aligned_cols=142 Identities=13% Similarity=0.115 Sum_probs=64.1 Q ss_pred cchhh--hhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHhcCCccccccccceeEEe----ecCceEEEE Q lcl|NC_011308. 13 MADDI--KFEMDMSKIKDMFDDTAEKA----LKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEV----DKRKQEVKI 82 (154) Q Consensus 13 Ma~~v--~~~~~l~~~~~~l~~~~~~~----v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~----~~~~~~~~V 82 (154) |++++ ++.+++.+.++.+++...+. |++.+.+...+..+...........+.|..++.+.+ +.+.+++.| T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~I 80 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAVI 80 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEEE Confidence 99875 45566777777776555444 444444444344443333445566889997554432 556789999 Q ss_pred ecCCCcccccccCccccccCCC----------Ccccccc------------------------eecccCceee-cCCCC- Q lcl|NC_011308. 83 GNSSDYAIYYEFGTGEKSEKGG----------GRAGGWS------------------------YMDKNGKWHF-TRGSK- 126 (154) Q Consensus 83 ~~~~~YA~yVE~GTg~~~~~~~----------~~~~~~~------------------------~~~~~g~~~~-t~g~~- 126 (154) .+...=-..--|++.....+++ ...+.|+ +....|++.. ..|+. T Consensus 81 ~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~k 160 (205) T protein:vir:63 81 GARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGATK 160 (205) T ss_pred ecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCcee Confidence 8865422233334322221111 0011111 0111122111 12221 Q ss_pred -CCc---hhHHHHHHHHH----HHHHHHHHHh-hccC Q lcl|NC_011308. 127 -ASK---RMRYTFRDEKS----KVKDYVIKVF-GGLD 154 (154) Q Consensus 127 -a~P---Fl~pA~~~~~~----~i~~~i~~~l-~~l~ 154 (154) +.+ +.-|++.+.-. .|...|...| ++.+ T Consensus 161 ~~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~ 197 (205) T protein:vir:63 161 LSNNVYLLYGPSVDQVFRTVADDITTEVLDALADEFL 197 (205) T ss_pred cCCceEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHH Confidence 223 45666655433 2322222221 2222 No 171 >protein:vir:8432 Length: 149 # NCBI annotation: gp27 # Family: family:all:30885 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818328;genbank:gi:29566764;genbank:GeneID:1260117 Probab=88.09 E-value=0.016 Score=30.40 Aligned_cols=130 Identities=17% Similarity=0.242 Sum_probs=57.6 Q ss_pred CCcceeee---cCCccchhhhhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhcCCccccccccceeEE-- Q lcl|NC_011308. 1 MRSRLLID---RGGHMADDIKFEMDMSKIKDMFDDTAEKA---LKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFE-- 72 (154) Q Consensus 1 ~~~~~~~~---~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~---v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~-- 72 (154) |-.-++|+ =-|..-+-|.-..+|.++.+.+-+.+++- +-++.+- +-+++. ..---|..||+.+..|+-. T Consensus 1 mprdvvvklrdvrgalldgvsssrdlrrivqrfindveqtwhdvwdvsml--gvlaqq-tgvphpyqtgdykahikkkkl 77 (149) T protein:vir:84 1 MPRDVVVKLRDVRGALLDGVSSSRDLRRIVQRFINDVEQTWHDVWDVSML--GVLAQQ-TGVPHPYQTGDYKAHIKKKKL 77 (149) T ss_pred CCchheehhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHHhHhhHHHH--HHHHhh-cCCCCCccccchhhhhhhhhH Confidence 33222221 11111011111223444444333333222 2222221 112221 1222477999999988421 Q ss_pred ----------eecCc-eEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHH Q lcl|NC_011308. 73 ----------VDKRK-QEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSK 141 (154) Q Consensus 73 ----------~~~~~-~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~ 141 (154) +-.++ -.+-|++|.+-|.|+||||.- .+| |...||. ..+ | .|||+-.+ + T Consensus 78 tamqkirikkflkggmpiglvynndekahwieygtkr--drp-gsrspwg-----------pnt-p----tpafeimq-r 137 (149) T protein:vir:84 78 TAMQKIRIKKFLKGGMPIGLVYNNDEKAHWIEYGTKR--DRP-GSRSPWG-----------PNT-P----TPAFEIMQ-R 137 (149) T ss_pred HHHHHHHHHHHhhcCCceeEEecCCcchhhhhhcccc--CCC-CCCCCCC-----------CCC-C----ChhHHHHH-H Confidence 12334 457899999999999999842 122 2334443 111 2 35555433 3 Q ss_pred HHHHHHHHhhcc Q lcl|NC_011308. 142 VKDYVIKVFGGL 153 (154) Q Consensus 142 i~~~i~~~l~~l 153 (154) +.++|++-++=- T Consensus 138 varimnedvryr 149 (149) T protein:vir:84 138 VARIMNEDVRYR 149 (149) T ss_pred HHHHhhhhcccC Confidence 455555433322 No 172 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=87.96 E-value=0.035 Score=28.54 Aligned_cols=147 Identities=14% Similarity=0.094 Sum_probs=60.8 Q ss_pred CCcceeeecCCccchhhhh-HHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeec Q lcl|NC_011308. 1 MRSRLLIDRGGHMADDIKF-EMDMSKIKD----MFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDK 75 (154) Q Consensus 1 ~~~~~~~~~~~~Ma~~v~~-~~~l~~~~~----~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~ 75 (154) |- ++=.|.++++. ...+...+. .+++++..+|++++..+-.++++..++... +....+++.+.+.-.+ T Consensus 1 ~~------~~~~l~idv~~~l~~i~~~l~~~~~~~~~A~~rAlNrta~~~rt~~~r~v~~~~~-i~~k~ir~r~~~~~a~ 73 (177) T protein:vir:96 1 MA------HGFEMKIDVSREAEDIAAMVAATTKQLELAAQRAMTKAGQWLRTHSVRELGQQLG-IKQEPLKKRFRVYPQR 73 (177) T ss_pred CC------cCceeEEehhHHHHHHHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCHHHHHhhheeeccC Confidence 22 22223334432 223333344 334445555555555544455554444443 3467788888776555 Q ss_pred CceEEEEecCCCcccccccCccccccCC---C-----------CcccccceecccCce---eecCCCCCCchhHHHHHHH Q lcl|NC_011308. 76 RKQEVKIGNSSDYAIYYEFGTGEKSEKG---G-----------GRAGGWSYMDKNGKW---HFTRGSKASKRMRYTFRDE 138 (154) Q Consensus 76 ~~~~~~V~~~~~YA~yVE~GTg~~~~~~---~-----------~~~~~~~~~~~~g~~---~~t~g~~a~PFl~pA~~~~ 138 (154) +.+++.|.++..--+...||+......+ . .+.++|..+...|+. +..-..|--|=|..+++.. T Consensus 74 ~~~~~~i~~~~~~i~l~~~~~~r~t~~Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~ 153 (177) T protein:vir:96 74 QKGEVRFWVGLDPIGVYRLGTPKVTQKGVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERW 153 (177) T ss_pred CCcEEEEEEeccceehhhcccCCCCccceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHH Confidence 6677777776543344456642111100 0 011122222211110 0000122223344555554 Q ss_pred HHHHHHHHHHH-hhccC Q lcl|NC_011308. 139 KSKVKDYVIKV-FGGLD 154 (154) Q Consensus 139 ~~~i~~~i~~~-l~~l~ 154 (154) .+++.+.+.+. -+||+ T Consensus 154 ~~~~~~~~~~~l~~Ei~ 170 (177) T protein:vir:96 154 ERRVFQRFKELFEQEAR 170 (177) T ss_pred HHHHHHHHHHHHHHHHH Confidence 44443322221 12333 No 173 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=86.29 E-value=0.0025 Score=34.81 Aligned_cols=88 Identities=19% Similarity=0.235 Sum_probs=42.8 Q ss_pred cchhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEe--ecCceEE-EEecCCCc Q lcl|NC_011308. 13 MADDIKFEMD-MSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEV--DKRKQEV-KIGNSSDY 88 (154) Q Consensus 13 Ma~~v~~~~~-l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~--~~~~~~~-~V~~~~~Y 88 (154) |++-+.-... .+.+++. .--+++.+ .+++-.-..||.++|||||..|+...++- ....-+. .|+++. - T Consensus 1 madaftpNp~~FDqIl~s---~~VrALt~----gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D~-K 72 (92) T protein:vir:78 1 MADAFTPNPTWFDQIMRT---PKVRALVD----GVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSDE-K 72 (92) T ss_pred CCCccCCChhHHHHhhcc---cchhhhhh----hhhhhhhhhhcccCcccccccccccchhhhhccccceeEEeecCc-c Confidence 8875442211 1111111 11122222 22333345689999999999999986543 1222233 455553 4 Q ss_pred ccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHH Q lcl|NC_011308. 89 AIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKS 140 (154) Q Consensus 89 A~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~ 140 (154) -..||--||.-... +...+. T Consensus 73 TlLvESrTGNLaka--------------------------------lk~~rs 92 (92) T protein:vir:78 73 TLLIESRTGNLARS--------------------------------VKRRRS 92 (92) T ss_pred eeeeecccchHHHH--------------------------------HhhhcC Confidence 56788877633221 111111 No 174 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=85.03 E-value=0.053 Score=27.57 Aligned_cols=138 Identities=16% Similarity=0.138 Sum_probs=54.8 Q ss_pred hhhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHH----HHHHHHHHHhcCCccccccccceeEE-eecCceEEEEecCCCc Q lcl|NC_011308. 15 DDIKFEMDMSKIKDMFD-DTAEKALKQIGEHMK----TEIAEGGHGDSSNNVTGEYANKTDFE-VDKRKQEVKIGNSSDY 88 (154) Q Consensus 15 ~~v~~~~~l~~~~~~l~-~~~~~~v~~a~~~~~----~~i~~~~ak~~aPvdTG~Lr~SI~~~-~~~~~~~~~V~~~~~Y 88 (154) ++|++.+++.+.+..++ +++.+++.+|+..++ .++++...+.. -+.-..+++.+++. -+.+.+++.|.++..- T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~-~i~~~~ir~r~~~~kas~~~l~a~I~~~~~~ 79 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDT-RVPRKLVKQRARVKRATVNKPRALIRVNRGN 79 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCHHHHHhhheecccCCCCeEEEEEEeccc Confidence 56888888877777763 334555555544444 33333333332 34456677777653 2345566776655432 Q ss_pred ccccccCcccc-------------ccCCCCcc---cccceecccCceee-cC-----------CCC-CCchhHHHHHHHH Q lcl|NC_011308. 89 AIYYEFGTGEK-------------SEKGGGRA---GGWSYMDKNGKWHF-TR-----------GSK-ASKRMRYTFRDEK 139 (154) Q Consensus 89 A~yVE~GTg~~-------------~~~~~~~~---~~~~~~~~~g~~~~-t~-----------g~~-a~PFl~pA~~~~~ 139 (154) -+...+|+... +..-.|+. ..|....++|.++. .+ ..| +.| +..++++.- T Consensus 80 i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~-~~e~~~~~~ 158 (184) T protein:vir:39 80 LPAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAP-LTTAFKEEL 158 (184) T ss_pred eeeeeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEEecCcccceeEEEcCchHH-HHHHHHHHH Confidence 22233443110 00000110 01111112333211 11 112 222 233333221 Q ss_pred -----HH----HHHHHHHHhhccC Q lcl|NC_011308. 140 -----SK----VKDYVIKVFGGLD 154 (154) Q Consensus 140 -----~~----i~~~i~~~l~~l~ 154 (154) +. +.+.|..+|+.+= T Consensus 159 ~~~~~~~~~~el~~~l~~~L~~~l 182 (184) T protein:vir:39 159 PKLMESDMPKELRASLTNQLRLIL 182 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhc Confidence 12 2222222222222 No 175 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=84.76 E-value=0.027 Score=29.15 Aligned_cols=138 Identities=12% Similarity=0.217 Sum_probs=65.4 Q ss_pred CCc-ceeeecCCccchhhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecC-- Q lcl|NC_011308. 1 MRS-RLLIDRGGHMADDIKFEM-DMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKR-- 76 (154) Q Consensus 1 ~~~-~~~~~~~~~Ma~~v~~~~-~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~-- 76 (154) |-+ -|-|| =+--.++.|.. .+.+..-++...-.+...+.++..+ +.....+.--.||.|..||...|... T Consensus 1 m~~~~lHvd--F~qp~~~~Fnr~riRraFv~igq~hmr~ArrlV~rrg----rs~pGe~P~~qTGrLa~SIgy~Vpras~ 74 (168) T protein:vir:45 1 MTTSFLHVD--FQQPAEMRFNRARVRRAFVTIGQRHMRDARRLVMRHA----RSAPGENPGYQTGRLARSIGYMVPRASK 74 (168) T ss_pred CCccceeee--eecCCceeecHHHHHHHHHHHhHHHHHHHHHHHhhcc----cccCCCCCcchhhhhhhhhhhccccccC Confidence 322 22222 00000111221 2222222222222222222222111 11222332348999999998766433 Q ss_pred ---ceEEEEecCC------------CcccccccCcccc--ccCCCCcccccceecccC--ceeecCCCCCCchhHHHHHH Q lcl|NC_011308. 77 ---KQEVKIGNSS------------DYAIYYEFGTGEK--SEKGGGRAGGWSYMDKNG--KWHFTRGSKASKRMRYTFRD 137 (154) Q Consensus 77 ---~~~~~V~~~~------------~YA~yVE~GTg~~--~~~~~~~~~~~~~~~~~g--~~~~t~g~~a~PFl~pA~~~ 137 (154) |+-+.|-.|. -|-.|.+||-.-- ..+.+. ....| .|.. .|-.-||..++++ T Consensus 75 ~rpG~mvkIaPNqk~G~g~r~i~gdfYPafL~YGVr~gakr~r~h~-------rga~ggsgwri---aPR~Nym~~~l~~ 144 (168) T protein:vir:45 75 HRPGFMARIAPNQRNGEGNRRITGDFYPAFLFYGVRGGAKRRRSHH-------RGASGGSGWRL---APRNNFMVETLEK 144 (168) T ss_pred CCCceEEEecCCCCCCCCCCccccccchhhhhhhhhcchhhhhhhh-------ccccCCCccee---ccchhhHHHHHHh Confidence 6666766553 3778888884211 111111 11111 2222 2555699999999 Q ss_pred HHHHHHHHHHHHhhccC Q lcl|NC_011308. 138 EKSKVKDYVIKVFGGLD 154 (154) Q Consensus 138 ~~~~i~~~i~~~l~~l~ 154 (154) .+......+..+|+.-= T Consensus 145 ~~~wt~~~L~r~L~~sL 161 (168) T protein:vir:45 145 NRSWTRYFLARELRKSL 161 (168) T ss_pred hHHHHHHHHHHHHHHhc Confidence 99998888888876533 No 176 >protein:vir:99454 Length: 150 # NCBI annotation: hypothetical protein # Family: family:all:32760 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919085;genbank:gi:119757043;genbank:GeneID:4606107 Probab=84.16 E-value=0.063 Score=27.18 Aligned_cols=129 Identities=20% Similarity=0.252 Sum_probs=76.8 Q ss_pred cchhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEe--ecCceEEEEecCCCcc Q lcl|NC_011308. 13 MADDIKFEMDM-SKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEV--DKRKQEVKIGNSSDYA 89 (154) Q Consensus 13 Ma~~v~~~~~l-~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~--~~~~~~~~V~~~~~YA 89 (154) |..--.|+.++ +.++++++....+.|...+.+.+-+|.+..-...- .|--.|-..=.+++ .++...++-+- -+=| T Consensus 1 mt~l~~f~~d~re~lld~le~~areeiap~vq~~ahdile~yg~~hd-ydv~~iiea~et~v~rr~~rvvvr~gw-pepa 78 (150) T protein:vir:99 1 MTTLAGFEADAREALLDELEDHAREEIAPAVQQHAHDILEAYGREND-YDVQSIIDAAETRVERRKGSVVVRWGW-PEPA 78 (150) T ss_pred CCccchhhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhccccc-cchhhhhhhhhhheeecCCeEEEEecC-CCcc Confidence 77656677654 55678888888888888888888888765432221 11111111112222 33333333333 4568 Q ss_pred cccccCccccccCCCCcccccc------------eecccCcee-e-----cCCCCCCchhHHHHHHHHHHHH Q lcl|NC_011308. 90 IYYEFGTGEKSEKGGGRAGGWS------------YMDKNGKWH-F-----TRGSKASKRMRYTFRDEKSKVK 143 (154) Q Consensus 90 ~yVE~GTg~~~~~~~~~~~~~~------------~~~~~g~~~-~-----t~g~~a~PFl~pA~~~~~~~i~ 143 (154) +|.|-||-.|.+.-++-....| |....+.|. + ..|.|...|++.+++--+.++. T Consensus 79 iyfergt~dhvvea~nad~lsfvwedpp~wvre~fe~e~~g~rvfl~e~~v~glpesrfirdtln~lr~~fa 150 (150) T protein:vir:99 79 IFFERGTVDHVVEATNADVLSFIWEDPPRWVRQGYEREGGGWRVFLPEVEVSGLPESRFIRDTLNWLRRRFA 150 (150) T ss_pred eeeeccchhhhhhccccchhhhhhcCchhHhHhhcCcCCCceEEEeecccccCCcchhhHHHHHHHHHHhcC Confidence 9999999988775544332211 111122221 1 3589999999999999998887 No 177 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=75.63 E-value=0.1 Score=26.00 Aligned_cols=113 Identities=12% Similarity=0.104 Sum_probs=47.0 Q ss_pred hhhhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEeecCceEE--EE------- Q lcl|NC_011308. 16 DIKFE----MDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVDKRKQEV--KI------- 82 (154) Q Consensus 16 ~v~~~----~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~--~V------- 82 (154) .=... +.|+++.+.|.+.....+.+.+.....++... ....+-..| -++.+.+.- .+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~-l~~~vk~~t---------PVdTG~Lr~sw~~~~~~~~~ 70 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAAR-LLGKVIRRT---------PVDTGFLRQGWNGVAYARSL 70 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH-HHHHHHHhC---------CCcchhhccccccccccccc Confidence 11100 24555556665555555555555554444332 122221111 112221110 00 Q ss_pred ---ecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHH--HHHHHHHHHHHHHHhhc-cC Q lcl|NC_011308. 83 ---GNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTF--RDEKSKVKDYVIKVFGG-LD 154 (154) Q Consensus 83 ---~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~--~~~~~~i~~~i~~~l~~-l~ 154 (154) .++..|.+ +-|+ ..++..+-..| ++-++++||..+++ +....++++.+.+.|++ || T Consensus 71 ~~~~~g~~~~v--~v~n----------~~~YA~~VE~G----hr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~ 132 (141) T protein:vir:79 71 PVYKQGNNYII--EVVN----------PTEYASYVNFG----HRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLL 132 (141) T ss_pred ceeecCCeeEE--EEec----------CCcchhhhhcc----eeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHH Confidence 01111211 1111 11222222233 23467788888877 66666777777666543 44 No 178 >protein:vir:78503 Length: 131 # NCBI annotation: gp18 # Family: family:all:2819 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491589;genbank:gi:157786412;genbank:GeneID:5625655 Probab=73.37 E-value=0.17 Score=24.79 Aligned_cols=121 Identities=8% Similarity=-0.017 Sum_probs=55.0 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccccc---cceeEEeecCceEEEEecCCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDS--SNNVTGEYA---NKTDFEVDKRKQEVKIGNSSD 87 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~--aPvdTG~Lr---~SI~~~~~~~~~~~~V~~~~~ 87 (154) |+. +-+...|.++...++. +..+|.+...+.......+.+... .-+..|.-- .||.. ..+.+..-|+-+++ T Consensus 1 ma~-~~~~~~ln~vvA~l~~-vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~--~~gdvD~~v~Ldap 76 (131) T protein:vir:78 1 MPL-YYGRSGLNKVVSHLPG-VVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITR--TNGSVDAYVNMEAP 76 (131) T ss_pred Ccc-cccchhhhhhhhhchh-HHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeee--eeCCcceEEeecCC Confidence 873 4444555555554442 223333333333222222222222 111112222 34432 33345566778888 Q ss_pred cccccccCccccccCCCCcccccceecccCceeecC-----CCCCCchhHHHHHHH Q lcl|NC_011308. 88 YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTR-----GSKASKRMRYTFRDE 138 (154) Q Consensus 88 YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~-----g~~a~PFl~pA~~~~ 138 (154) -|--+|||..|+++... .+.+...+.+.|.++.|+ |+-.-|-+..--... T Consensus 77 na~aIEfGhapsgvf~p-~k~G~~tkapeglYILTrAA~lgg~~~~~t~~~RGkr~ 131 (131) T protein:vir:78 77 SPESIEYGHYPSGVFDP-EKYGRVTKAPQGLYILTGAAGFGGQTAISTGAKRGKRG 131 (131) T ss_pred CchhheeccccccccCC-cccCcccCCCCcceeeecccccccccccccccccCCCC Confidence 89999999999888654 333334566677666553 111111111000000 No 179 >protein:vir:2347 Length: 131 # NCBI annotation: gp17 # Family: family:all:2819 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075284;genbank:gi:12657871;genbank:GeneID:920131 Probab=73.37 E-value=0.17 Score=24.79 Aligned_cols=121 Identities=8% Similarity=-0.017 Sum_probs=55.0 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccccc---cceeEEeecCceEEEEecCCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDS--SNNVTGEYA---NKTDFEVDKRKQEVKIGNSSD 87 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~--aPvdTG~Lr---~SI~~~~~~~~~~~~V~~~~~ 87 (154) |+. +-+...|.++...++. +..+|.+...+.......+.+... .-+..|.-- .||.. ..+.+..-|+-+++ T Consensus 1 ma~-~~~~~~ln~vvA~l~~-vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~--~~gdvD~~v~Ldap 76 (131) T protein:vir:23 1 MPL-YYGRSGLNKVVSHLPG-VVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITR--TNGSVDAYVNMEAP 76 (131) T ss_pred Ccc-cccchhhhhhhhhchh-HHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeee--eeCCcceEEeecCC Confidence 873 4444555555554442 223333333333222222222222 111112222 34432 33345566778888 Q ss_pred cccccccCccccccCCCCcccccceecccCceeecC-----CCCCCchhHHHHHHH Q lcl|NC_011308. 88 YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTR-----GSKASKRMRYTFRDE 138 (154) Q Consensus 88 YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~-----g~~a~PFl~pA~~~~ 138 (154) -|--+|||..|+++... .+.+...+.+.|.++.|+ |+-.-|-+..--... T Consensus 77 na~aIEfGhapsgvf~p-~k~G~~tkapeglYILTrAA~lgg~~~~~t~~~RGkr~ 131 (131) T protein:vir:23 77 SPESIEYGHYPSGVFDP-EKYGRVTKAPQGLYILTGAAGFGGQTAISTGAKRGKRG 131 (131) T ss_pred CchhheeccccccccCC-cccCcccCCCCcceeeecccccccccccccccccCCCC Confidence 89999999999888654 333334566677666553 111111111000000 No 180 >protein:vir:78298 Length: 131 # NCBI annotation: gp18 # Family: family:all:2819 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491670;genbank:gi:157786494;genbank:GeneID:5625776 Probab=73.37 E-value=0.17 Score=24.79 Aligned_cols=121 Identities=8% Similarity=-0.017 Sum_probs=55.0 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccccc---cceeEEeecCceEEEEecCCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDS--SNNVTGEYA---NKTDFEVDKRKQEVKIGNSSD 87 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~--aPvdTG~Lr---~SI~~~~~~~~~~~~V~~~~~ 87 (154) |+. +-+...|.++...++. +..+|.+...+.......+.+... .-+..|.-- .||.. ..+.+..-|+-+++ T Consensus 1 ma~-~~~~~~ln~vvA~l~~-vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~--~~gdvD~~v~Ldap 76 (131) T protein:vir:78 1 MPL-YYGRSGLNKVVSHLPG-VVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITR--TNGSVDAYVNMEAP 76 (131) T ss_pred Ccc-cccchhhhhhhhhchh-HHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeee--eeCCcceEEeecCC Confidence 873 4444555555554442 223333333333222222222222 111112222 34432 33345566778888 Q ss_pred cccccccCccccccCCCCcccccceecccCceeecC-----CCCCCchhHHHHHHH Q lcl|NC_011308. 88 YAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTR-----GSKASKRMRYTFRDE 138 (154) Q Consensus 88 YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~-----g~~a~PFl~pA~~~~ 138 (154) -|--+|||..|+++... .+.+...+.+.|.++.|+ |+-.-|-+..--... T Consensus 77 na~aIEfGhapsgvf~p-~k~G~~tkapeglYILTrAA~lgg~~~~~t~~~RGkr~ 131 (131) T protein:vir:78 77 SPESIEYGHYPSGVFDP-EKYGRVTKAPQGLYILTGAAGFGGQTAISTGAKRGKRG 131 (131) T ss_pred CchhheeccccccccCC-cccCcccCCCCcceeeecccccccccccccccccCCCC Confidence 89999999999888654 333334566677666553 111111111000000 No 181 >protein:vir:7776 Length: 119 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817610;genbank:gi:29566040;genbank:GeneID:1259234 Probab=72.59 E-value=0.14 Score=25.23 Aligned_cols=110 Identities=16% Similarity=0.141 Sum_probs=52.3 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----Cc-cccccccceeEEeecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSS-----NN-VTGEYANKTDFEVDKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~a-----Pv-dTG~Lr~SI~~~~~~~~~~~~V~~~~ 86 (154) |+.-+ ....|.++...+.. +..+|.+...+...+...+.+..++ +. -.|-| .+| .+..+.+..-|+-++ T Consensus 1 Ma~~y-~~~~ln~vvA~l~~-v~~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~~~-~~I--d~a~gdvD~~v~l~a 75 (119) T protein:vir:77 1 MARLI-GQKAMNHVISHLDG-VKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHL-TRV--TVTRGDVDSFINLEA 75 (119) T ss_pred Ccccc-cccchhhhhhhchh-HHHHHHHHHHhhhHHHHHHHHHHhhcccccceecCCCc-cee--ccccCCcceEEeecC Confidence 87433 33444555444432 2233333333222222222222211 10 11112 222 223334455677788 Q ss_pred CcccccccCccccccCCCCcccccc-eecccCceeecC--CCCC Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGGGRAGGWS-YMDKNGKWHFTR--GSKA 127 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~~~~~~~~-~~~~~g~~~~t~--g~~a 127 (154) +-|--+|||..|.++...|.+.+.. -+.+.|.++.|+ |..+ T Consensus 76 pna~aIEfGhapsgvf~pG~~yg~vdtkapeglYILTrAA~l~g 119 (119) T protein:vir:77 76 PNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) T ss_pred CCchhhcccccccceecccccccccCCCCCCCeEeeecccccCC Confidence 8899999999998887655544333 223577777765 2222 No 182 >protein:vir:6154 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:10918 # MgeID: mge:127 # MgeName: phBC6A51 # Cross-refs: genbank:acc:NP_852533;genbank:gi:31415793;genbank:GeneID:1489145 Probab=70.02 E-value=0.0043 Score=33.55 Aligned_cols=113 Identities=15% Similarity=0.151 Sum_probs=61.4 Q ss_pred CCcceeeecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeEEee---cCc Q lcl|NC_011308. 1 MRSRLLIDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDFEVD---KRK 77 (154) Q Consensus 1 ~~~~~~~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~~~~---~~~ 77 (154) ||-|++++.-.+.- + +-+ ++.-..-+++.+...+ .+..+.|...+|.-.|.|..||--++. +.. T Consensus 1 mrirvvvkgksnvl---k-------ahn--pnryktpieqtvekht-rlqanqasnrapilhgplsesipasvkmvvgar 67 (119) T protein:vir:61 1 MRIRVVVKGKSNVL---K-------AHN--PNRYKTPIEQTVEKHT-RLQANQASNRAPILHGPLSESIPASVKMVVGAR 67 (119) T ss_pred CeeEEEeeccccee---c-------ccC--CccccccHHHHHHHhh-hhhcccccccCceeecccccccchhhhhhhhhh Confidence 98888887554221 1 100 1122222333333322 344566778899999999999955442 344 Q ss_pred eEEEEecCCCcccccccCccccccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_011308. 78 QEVKIGNSSDYAIYYEFGTGEKSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGGLD 154 (154) Q Consensus 78 ~~~~V~~~~~YA~yVE~GTg~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~l~ 154 (154) +.++-++..-||.-.||-.. .++|-+..|.+.-.|||.. .|.+.++++. T Consensus 68 iigtygspliyaavqefthk----------------tkkgfmrktafegeqpfve------------disktvqrva 116 (119) T protein:vir:61 68 IIGTYGSPLIYAAVQEFTHK----------------TKKGFMRKTAFEGEQPFVE------------DISKTVQRVA 116 (119) T ss_pred hcccccchHHHHHHHHHhhh----------------hhhhhhhhhcccCCcchHH------------HHHHHHHHhh Confidence 56667777789999998631 1122222333344556653 3333444443 No 183 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=58.50 E-value=0.41 Score=22.70 Aligned_cols=138 Identities=15% Similarity=0.167 Sum_probs=53.9 Q ss_pred cchhhhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH----HHHHHHhcCCccccccccceeEEe-ecCceEEEEecCC Q lcl|NC_011308. 13 MADDIKFEMDMSKIKDMFDDT-AEKALKQIGEHMKTEI----AEGGHGDSSNNVTGEYANKTDFEV-DKRKQEVKIGNSS 86 (154) Q Consensus 13 Ma~~v~~~~~l~~~~~~l~~~-~~~~v~~a~~~~~~~i----~~~~ak~~aPvdTG~Lr~SI~~~~-~~~~~~~~V~~~~ 86 (154) |+ |++++++.+.++.|++. +.+++.+|+..++..+ +..-++.. -+....++..+.+.- +.+.+.++|..+. T Consensus 1 ~~--ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~-~I~~k~Ir~r~r~~kAs~~~l~a~I~~~~ 77 (192) T protein:vir:34 1 MA--IKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARET-KVRRKLVKERARLKRATVKNPQARIKVNR 77 (192) T ss_pred Cc--chhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCHHHHHhhheeccccCCCceEEEEEec Confidence 76 56888888888776443 4555555444444333 33222222 234456666665532 3344566666543 Q ss_pred CcccccccCccccccCCC--------------------Cc-c--cccceecccCcee-ecC--CCC-----------CCc Q lcl|NC_011308. 87 DYAIYYEFGTGEKSEKGG--------------------GR-A--GGWSYMDKNGKWH-FTR--GSK-----------ASK 129 (154) Q Consensus 87 ~YA~yVE~GTg~~~~~~~--------------------~~-~--~~~~~~~~~g~~~-~t~--g~~-----------a~P 129 (154) .--+-.-+|+........ |+ . ..+....++|.|+ |.+ |.. ..| T Consensus 78 ~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpis~~ 157 (192) T protein:vir:34 78 GDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVP 157 (192) T ss_pred cceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEechhHH Confidence 322222233321110000 00 0 0011111222221 111 111 122 Q ss_pred hhHHHHHHHHHH-HHHHHHHHh----hccC Q lcl|NC_011308. 130 RMRYTFRDEKSK-VKDYVIKVF----GGLD 154 (154) Q Consensus 130 Fl~pA~~~~~~~-i~~~i~~~l----~~l~ 154 (154) +..||+...++ +++.|.++| ...= T Consensus 158 -l~~af~~~~~~~~~~~~~~El~~~L~~~l 186 (192) T protein:vir:34 158 -LTTAFKQNIERIRRERLPKELGYALQHQL 186 (192) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 35666554432 223333222 2211 No 184 >protein:vir:8330 Length: 141 # NCBI annotation: gp47 # Family: family:all:30831 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817898;genbank:gi:29566331;genbank:GeneID:1259526 Probab=24.45 E-value=0.097 Score=26.14 Aligned_cols=129 Identities=21% Similarity=0.320 Sum_probs=52.8 Q ss_pred CCcceeeecCCccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceeE-EeecCceE Q lcl|NC_011308. 1 MRSRLLIDRGGHMADDIKFEMDMSKIKDMFDDTAEKALKQIGEHMKTEIAEGGHGDSSNNVTGEYANKTDF-EVDKRKQE 79 (154) Q Consensus 1 ~~~~~~~~~~~~Ma~~v~~~~~l~~~~~~l~~~~~~~v~~a~~~~~~~i~~~~ak~~aPvdTG~Lr~SI~~-~~~~~~~~ 79 (154) .|.|+|...- +|+++.+-+-.++.|. ..+.++. -=..|.-+.--..-|..||..++||.- .-++.+-. T Consensus 3 vrrrvlfnel-----evkiendpevklqtla--faervke----ywqdiapdagdpghpyatghykdsiqriaskgrgpr 71 (141) T protein:vir:83 3 VRRRVLFNEL-----EVKIENDPEVKLQTLA--FAERVKE----YWQDIAPDAGDPGHPYATGHYKDSIQRIASKGRGPR 71 (141) T ss_pred cchhhhhhhe-----eeeecCCCccchhhhH--HHHHHHH----HHHhhCCCCCCCCCCccccchhHHHHHHHhccCCCc Confidence 5555554321 2333332222222211 1111111 111111111122346789999999942 22333344 Q ss_pred EE------EecCCCcccccccCccc-cccCCCCcccccceecccCceeecCCCCCCchhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011308. 80 VK------IGNSSDYAIYYEFGTGE-KSEKGGGRAGGWSYMDKNGKWHFTRGSKASKRMRYTFRDEKSKVKDYVIKVFGG 152 (154) Q Consensus 80 ~~------V~~~~~YA~yVE~GTg~-~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~PFl~pA~~~~~~~i~~~i~~~l~~ 152 (154) ++ |++-.--|-..|||||- |+.. .++-|..-+ |+.+|.-.+ |.|-|.-|. ++..-+.- - T Consensus 72 grwiwhhwvgtfdtvaglleygtgydygpe---skgnwigld--gerhwgwkt-ptpafayaa-----rvaahfgg---t 137 (141) T protein:vir:83 72 GRWIWHHWVGTFDTVAGLLEYGTGYDYGPE---SKGNWIGLD--GERHWGWKT-PTPAFAYAA-----RVAAHFGG---T 137 (141) T ss_pred ccchhhhhhhhhHHHHHHHhhccCcccCCC---CCCceeecc--CcccccccC-CCchhhHHH-----HHHhhhcc---c Confidence 43 66666678889999982 2221 233344333 333333233 334332221 22222221 2 Q ss_pred cC Q lcl|NC_011308. 153 LD 154 (154) Q Consensus 153 l~ 154 (154) +| T Consensus 138 md 139 (141) T protein:vir:83 138 MD 139 (141) T ss_pred cc Confidence 33 Done!