Query lcl|NC_015262.1_cdsid_YP_004306110.1 [gene=phiCD6356_09] [protein=putative head-tail joining protein] [protein_id=YP_004306110.1] [location=6383..6826] Match_columns 147 No_of_seqs 136 out of 399 Neff 8.7 Searched_HMMs 1612 Date Thu Nov 7 13:12:14 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:101594 Length: 173 100.0 7E-35 4.3E-38 207.8 12.8 136 5-147 1-172 (173) 2 protein:vir:102875 Length: 146 100.0 1.2E-33 7.2E-37 201.1 13.1 128 1-147 1-144 (146) 3 protein:vir:107568 Length: 146 100.0 1.2E-33 7.2E-37 201.1 13.1 128 1-147 1-144 (146) 4 protein:vir:102085 Length: 146 100.0 1.2E-33 7.2E-37 201.1 13.1 128 1-147 1-144 (146) 5 protein:vir:105007 Length: 146 100.0 1.2E-33 7.2E-37 201.1 13.1 128 1-147 1-144 (146) 6 protein:vir:93617 Length: 148 100.0 1.4E-33 8.7E-37 200.7 12.7 128 1-147 2-144 (148) 7 protein:vir:100075 Length: 140 100.0 1.9E-33 1.2E-36 200.0 12.6 127 1-147 1-134 (140) 8 protein:vir:1386 Length: 149 # 100.0 2.2E-33 1.3E-36 199.7 12.5 128 1-147 1-145 (149) 9 protein:vir:5745 Length: 135 # 100.0 6E-33 3.8E-36 197.2 13.0 127 1-147 1-132 (135) 10 protein:vir:1273 Length: 127 # 100.0 6.7E-33 4.2E-36 197.0 13.0 123 1-146 1-127 (127) 11 protein:vir:94538 Length: 125 100.0 1.5E-32 9.1E-36 195.1 13.1 122 1-147 1-124 (125) 12 protein:vir:80362 Length: 140 100.0 1.3E-32 8.1E-36 195.4 12.8 127 1-147 1-134 (140) 13 protein:vir:1437 Length: 140 # 100.0 1.3E-32 7.8E-36 195.5 12.6 127 1-147 1-134 (140) 14 protein:vir:100243 Length: 140 100.0 1.6E-32 9.7E-36 194.9 13.0 127 1-147 1-134 (140) 15 protein:vir:194 Length: 149 # 100.0 1.9E-32 1.2E-35 194.4 12.2 128 1-147 2-145 (149) 16 protein:vir:3873 Length: 128 # 100.0 5.3E-32 3.3E-35 192.1 12.3 122 3-146 1-128 (128) 17 protein:vir:97088 Length: 157 100.0 1.1E-31 6.8E-35 190.3 13.4 144 1-147 1-151 (157) 18 protein:vir:1891 Length: 179 # 100.0 7.7E-32 4.7E-35 191.2 12.1 128 1-147 1-167 (179) 19 protein:vir:4347 Length: 164 # 99.9 2.6E-31 1.6E-34 188.2 10.6 128 1-147 1-152 (164) 20 protein:vir:105089 Length: 133 99.9 1.5E-30 9.1E-34 184.1 13.1 124 2-147 1-132 (133) 21 protein:vir:106570 Length: 182 99.9 1.8E-30 1.1E-33 183.6 13.5 139 1-147 1-178 (182) 22 protein:vir:95789 Length: 114 99.9 2.4E-30 1.5E-33 183.0 12.5 114 3-146 1-114 (114) 23 protein:vir:9708 Length: 125 # 99.9 3E-30 1.9E-33 182.4 12.4 120 6-147 1-125 (125) 24 protein:vir:3617 Length: 112 # 99.9 2.2E-30 1.4E-33 183.1 11.6 112 1-142 1-112 (112) 25 protein:vir:5978 Length: 144 # 99.9 8.4E-30 5.2E-33 180.0 12.8 133 1-142 4-144 (144) 26 protein:vir:9930 Length: 108 # 99.9 2.2E-29 1.4E-32 177.7 11.4 108 7-143 1-108 (108) 27 protein:vir:78858 Length: 115 99.9 2.5E-29 1.6E-32 177.4 11.6 109 5-142 1-115 (115) 28 protein:vir:96225 Length: 115 99.9 2.5E-29 1.6E-32 177.4 11.6 109 5-142 1-115 (115) 29 protein:vir:9312 Length: 115 # 99.9 2.5E-29 1.6E-32 177.4 11.6 109 5-142 1-115 (115) 30 protein:vir:96358 Length: 115 99.9 2.5E-29 1.6E-32 177.4 11.6 109 5-142 1-115 (115) 31 protein:vir:103917 Length: 115 99.9 2.5E-29 1.6E-32 177.4 11.6 109 5-142 1-115 (115) 32 protein:vir:97144 Length: 115 99.9 2.5E-29 1.6E-32 177.4 11.6 109 5-142 1-115 (115) 33 protein:vir:106623 Length: 115 99.9 5.1E-29 3.2E-32 175.7 11.6 109 5-142 1-115 (115) 34 protein:vir:99744 Length: 115 99.9 4.9E-29 3.1E-32 175.8 11.2 109 5-142 1-115 (115) 35 protein:vir:94108 Length: 149 99.9 4.2E-29 2.6E-32 176.2 10.5 127 1-138 13-149 (149) 36 protein:vir:107099 Length: 137 99.9 9E-29 5.6E-32 174.3 11.6 127 1-138 1-137 (137) 37 protein:vir:105330 Length: 137 99.9 1.3E-28 8.1E-32 173.4 11.9 127 1-138 1-137 (137) 38 protein:vir:94796 Length: 137 99.9 1.2E-28 7.3E-32 173.7 11.4 127 1-138 1-137 (137) 39 protein:vir:105916 Length: 149 99.9 8.1E-29 5E-32 174.6 10.4 127 1-138 13-149 (149) 40 protein:vir:94654 Length: 142 99.9 2.8E-28 1.7E-31 171.6 12.5 133 1-142 1-142 (142) 41 protein:vir:95894 Length: 137 99.9 3.4E-28 2.1E-31 171.1 11.2 127 1-138 1-137 (137) 42 protein:vir:96121 Length: 137 99.9 3.7E-28 2.3E-31 171.0 11.4 127 1-138 1-137 (137) 43 protein:vir:743 Length: 108 # 99.9 3.7E-28 2.3E-31 171.0 10.8 108 5-142 1-108 (108) 44 protein:vir:79988 Length: 125 99.9 9.8E-28 6E-31 168.7 12.5 121 1-146 1-125 (125) 45 protein:vir:9414 Length: 125 # 99.9 9.8E-28 6E-31 168.7 12.5 121 1-146 1-125 (125) 46 protein:vir:81106 Length: 125 99.9 9.8E-28 6E-31 168.7 12.5 121 1-146 1-125 (125) 47 protein:vir:98342 Length: 125 99.9 9.8E-28 6E-31 168.7 12.5 121 1-146 1-125 (125) 48 protein:vir:4704 Length: 125 # 99.9 9.8E-28 6E-31 168.7 12.5 121 1-146 1-125 (125) 49 protein:vir:94490 Length: 137 99.9 6.4E-28 4E-31 169.7 11.5 127 1-138 1-137 (137) 50 protein:vir:93738 Length: 137 99.9 6.4E-28 4E-31 169.7 11.5 127 1-138 1-137 (137) 51 protein:vir:97427 Length: 137 99.9 6.4E-28 4E-31 169.7 11.5 127 1-138 1-137 (137) 52 protein:vir:96486 Length: 112 99.9 5.4E-28 3.3E-31 170.1 10.9 110 1-141 1-112 (112) 53 protein:vir:96829 Length: 135 99.9 7.9E-28 4.9E-31 169.2 11.4 127 1-138 1-135 (135) 54 protein:vir:98409 Length: 108 99.9 8.3E-28 5.2E-31 169.0 10.7 108 5-142 1-108 (108) 55 protein:vir:102154 Length: 119 99.9 7.2E-28 4.4E-31 169.4 9.8 118 1-146 1-119 (119) 56 protein:vir:4906 Length: 114 # 99.9 1.3E-27 7.9E-31 168.0 10.2 112 1-143 1-114 (114) 57 protein:vir:2740 Length: 114 # 99.9 1.3E-27 7.9E-31 168.0 10.2 112 1-143 1-114 (114) 58 protein:vir:8669 Length: 142 # 99.9 3.3E-27 2.1E-30 165.8 9.9 131 1-139 2-142 (142) 59 protein:vir:99101 Length: 142 99.9 3.3E-27 2.1E-30 165.8 9.9 131 1-139 2-142 (142) 60 protein:vir:105467 Length: 144 99.8 3.3E-24 2.1E-27 149.3 12.4 139 1-147 1-142 (144) 61 protein:vir:81147 Length: 126 99.8 6.7E-24 4.2E-27 147.6 11.7 125 1-145 1-126 (126) 62 protein:vir:106041 Length: 137 99.8 1.5E-24 9.4E-28 151.2 7.5 123 1-136 1-137 (137) 63 protein:vir:95062 Length: 116 99.8 1.3E-23 7.9E-27 146.1 8.3 106 24-138 1-116 (116) 64 protein:vir:97327 Length: 116 99.8 1.9E-23 1.2E-26 145.1 8.6 106 24-138 1-116 (116) 65 protein:vir:1243 Length: 116 # 99.8 1.9E-23 1.2E-26 145.1 8.6 106 24-138 1-116 (116) 66 protein:vir:102441 Length: 137 99.8 2.2E-23 1.4E-26 144.8 8.0 125 1-137 1-137 (137) 67 protein:vir:107545 Length: 140 99.8 5.4E-23 3.4E-26 142.7 6.5 127 1-136 1-140 (140) 68 protein:vir:97982 Length: 140 99.8 5.4E-23 3.4E-26 142.7 6.5 127 1-136 1-140 (140) 69 protein:vir:4956 Length: 153 # 99.8 4.6E-22 2.9E-25 137.6 9.2 124 1-147 1-136 (153) 70 protein:vir:78077 Length: 141 99.8 1.3E-21 7.9E-25 135.1 11.2 133 1-146 1-141 (141) 71 protein:vir:100887 Length: 139 99.8 7.6E-22 4.7E-25 136.4 9.6 121 3-147 1-132 (139) 72 protein:vir:79034 Length: 141 99.8 2.3E-21 1.4E-24 133.7 10.9 128 1-147 1-137 (141) 73 protein:vir:5000 Length: 141 # 99.7 4.3E-21 2.7E-24 132.3 8.9 123 1-147 1-136 (141) 74 protein:vir:106506 Length: 137 99.7 3E-21 1.9E-24 133.1 7.2 127 6-146 1-137 (137) 75 protein:vir:4859 Length: 140 # 99.7 1.6E-20 9.7E-24 129.2 9.2 124 1-147 1-136 (140) 76 protein:vir:100223 Length: 139 99.7 3.3E-20 2.1E-23 127.4 8.7 121 1-147 1-132 (139) 77 protein:vir:100652 Length: 134 99.7 7.2E-20 4.5E-23 125.5 9.9 122 3-144 1-134 (134) 78 protein:vir:4833 Length: 140 # 99.7 1.4E-19 8.4E-23 124.0 8.7 124 1-147 1-136 (140) 79 protein:vir:9513 Length: 134 # 99.7 2.4E-19 1.5E-22 122.7 9.8 128 3-144 1-134 (134) 80 protein:vir:101302 Length: 134 99.7 2.4E-19 1.5E-22 122.7 9.8 128 3-144 1-134 (134) 81 protein:vir:102963 Length: 163 99.6 8.6E-19 5.4E-22 119.6 11.1 127 1-147 1-156 (163) 82 protein:vir:966 Length: 123 # 99.6 8.3E-19 5.1E-22 119.7 9.9 122 1-143 1-123 (123) 83 protein:vir:9647 Length: 132 # 99.6 1.5E-18 9.4E-22 118.3 10.3 128 1-147 1-132 (132) 84 protein:vir:9879 Length: 127 # 99.6 5.2E-18 3.2E-21 115.3 8.2 118 7-143 1-127 (127) 85 protein:vir:99528 Length: 92 # 99.6 5.4E-18 3.4E-21 115.2 7.7 91 1-118 1-92 (92) 86 protein:vir:10367 Length: 119 99.5 2.8E-18 1.8E-21 116.8 4.9 106 39-147 1-113 (119) 87 protein:vir:81067 Length: 119 99.5 2.9E-18 1.8E-21 116.7 4.9 106 39-147 1-113 (119) 88 protein:vir:98636 Length: 138 99.5 4.9E-17 3E-20 110.0 10.2 128 1-147 7-138 (138) 89 protein:vir:3848 Length: 159 # 99.5 6.6E-17 4.1E-20 109.3 9.2 130 1-147 1-155 (159) 90 protein:vir:95372 Length: 124 99.4 5.4E-16 3.4E-19 104.3 9.7 119 1-143 1-124 (124) 91 protein:vir:78335 Length: 133 99.4 2.1E-15 1.3E-18 101.1 10.5 127 3-145 1-133 (133) 92 protein:vir:80116 Length: 127 99.4 2.6E-15 1.6E-18 100.5 9.9 122 1-146 1-127 (127) 93 protein:vir:94419 Length: 133 99.3 5.5E-15 3.4E-18 98.8 9.6 125 3-143 1-133 (133) 94 protein:vir:96973 Length: 133 99.3 5.5E-15 3.4E-18 98.8 9.6 125 3-143 1-133 (133) 95 protein:vir:78644 Length: 133 99.3 5.5E-15 3.4E-18 98.8 9.6 125 3-143 1-133 (133) 96 protein:vir:9363 Length: 133 # 99.3 5.5E-15 3.4E-18 98.8 9.6 125 3-143 1-133 (133) 97 protein:vir:93898 Length: 133 99.3 7.7E-15 4.8E-18 98.0 9.9 125 3-143 1-133 (133) 98 protein:vir:102338 Length: 116 99.2 3.4E-14 2.1E-17 94.5 8.8 113 24-146 1-116 (116) 99 protein:vir:96012 Length: 133 99.1 4.2E-13 2.6E-16 88.5 10.1 129 1-145 1-133 (133) 100 protein:vir:6246 Length: 143 # 99.1 3.1E-13 1.9E-16 89.1 7.9 123 1-147 1-139 (143) 101 protein:vir:6216 Length: 125 # 99.0 1.2E-12 7.2E-16 86.0 8.7 121 1-145 1-125 (125) 102 protein:vir:1332 Length: 143 # 99.0 8.1E-13 5E-16 86.9 7.8 123 1-147 1-139 (143) 103 protein:vir:79638 Length: 146 98.8 6.6E-11 4.1E-14 76.4 10.5 123 1-147 1-146 (146) 104 protein:vir:104347 Length: 145 98.8 3.5E-11 2.1E-14 77.9 8.1 123 1-145 1-145 (145) 105 protein:vir:1988 Length: 156 # 98.7 3.3E-10 2.1E-13 72.5 10.6 122 1-147 1-156 (156) 106 protein:vir:7412 Length: 168 # 98.6 4.4E-10 2.7E-13 71.9 9.3 144 1-147 1-161 (168) 107 protein:vir:3163 Length: 145 # 98.6 5.6E-10 3.5E-13 71.3 9.6 114 6-147 1-141 (145) 108 protein:vir:107703 Length: 147 98.6 1.3E-09 7.8E-13 69.4 11.3 124 1-147 1-147 (147) 109 protein:vir:103280 Length: 142 98.6 4.8E-10 3E-13 71.7 8.9 120 1-145 1-142 (142) 110 protein:vir:2688 Length: 123 # 98.6 3.8E-10 2.4E-13 72.2 8.2 117 13-143 1-123 (123) 111 protein:vir:79091 Length: 175 98.6 9.7E-10 6E-13 70.0 9.6 122 1-147 1-174 (175) 112 protein:vir:103841 Length: 155 98.6 1.2E-09 7.2E-13 69.6 10.0 123 1-147 1-153 (155) 113 protein:vir:94994 Length: 131 98.5 1.1E-09 7E-13 69.6 7.6 112 1-142 1-131 (131) 114 protein:vir:78380 Length: 131 98.4 3.6E-09 2.2E-12 66.9 9.3 112 1-142 1-131 (131) 115 protein:vir:79225 Length: 155 98.4 8.7E-09 5.4E-12 64.8 11.3 121 1-145 1-155 (155) 116 protein:vir:1028 Length: 168 # 98.4 5.6E-09 3.5E-12 65.8 9.5 144 1-147 1-161 (168) 117 protein:vir:99196 Length: 155 98.3 1.7E-08 1E-11 63.2 11.4 121 1-145 1-155 (155) 118 protein:vir:99546 Length: 200 98.3 9.5E-09 5.9E-12 64.6 8.9 103 1-147 5-140 (200) 119 protein:vir:99833 Length: 190 98.3 1.9E-08 1.2E-11 63.0 10.5 133 1-147 2-188 (190) 120 protein:vir:5257 Length: 148 # 98.3 3.5E-09 2.2E-12 66.9 6.5 81 1-147 1-92 (148) 121 protein:vir:107851 Length: 175 98.2 2.3E-08 1.4E-11 62.5 10.1 122 1-147 1-174 (175) 122 protein:vir:7449 Length: 123 # 98.2 5.7E-08 3.5E-11 60.3 11.5 119 1-147 1-122 (123) 123 protein:vir:94944 Length: 121 98.2 1.2E-08 7.6E-12 64.0 7.6 103 1-130 2-121 (121) 124 protein:vir:1087 Length: 161 # 98.2 1.8E-08 1.1E-11 63.1 7.8 142 1-147 1-157 (161) 125 protein:vir:101508 Length: 120 98.2 6.8E-08 4.2E-11 59.9 10.8 115 1-146 1-120 (120) 126 protein:vir:80425 Length: 134 98.1 1.1E-08 6.5E-12 64.3 6.3 112 1-143 1-134 (134) 127 protein:vir:95157 Length: 144 98.1 1.8E-08 1.1E-11 63.1 7.4 116 1-146 1-144 (144) 128 protein:vir:97190 Length: 148 98.1 1.7E-08 1.1E-11 63.1 7.1 120 1-147 1-148 (148) 129 protein:vir:3994 Length: 168 # 98.1 2.3E-08 1.4E-11 62.5 7.7 144 1-147 1-161 (168) 130 protein:vir:101563 Length: 155 98.1 6.9E-09 4.3E-12 65.3 4.1 95 3-147 1-99 (155) 131 protein:vir:96774 Length: 152 98.1 2.1E-08 1.3E-11 62.7 6.8 111 1-140 11-152 (152) 132 protein:vir:77650 Length: 155 98.1 1.6E-08 9.8E-12 63.3 5.9 93 3-147 1-99 (155) 133 protein:vir:96105 Length: 193 98.0 7.4E-08 4.6E-11 59.7 8.9 111 1-147 1-133 (193) 134 protein:vir:107757 Length: 189 97.9 1E-07 6.2E-11 58.9 8.1 83 1-147 1-90 (189) 135 protein:vir:80970 Length: 112 97.9 3.6E-07 2.2E-10 55.9 9.9 112 1-145 1-112 (112) 136 protein:vir:4096 Length: 140 # 97.8 1.1E-07 6.6E-11 58.8 6.3 129 1-147 1-135 (140) 137 protein:vir:105773 Length: 131 97.8 1.7E-07 1.1E-10 57.7 7.4 126 5-143 1-131 (131) 138 protein:vir:106728 Length: 155 97.7 1.1E-07 7.1E-11 58.6 4.6 94 25-147 1-99 (155) 139 protein:vir:78607 Length: 155 97.7 1.2E-07 7.2E-11 58.6 4.6 94 25-147 1-99 (155) 140 protein:vir:7993 Length: 108 # 97.7 7.4E-08 4.6E-11 59.7 3.5 100 1-128 1-108 (108) 141 protein:vir:45 Length: 112 # N 97.6 1.3E-06 8.3E-10 52.8 10.1 112 1-145 1-112 (112) 142 protein:vir:95260 Length: 160 97.6 8.7E-07 5.4E-10 53.8 8.1 87 1-147 1-91 (160) 143 protein:vir:94069 Length: 168 97.5 2.5E-07 1.5E-10 56.8 3.9 102 1-147 1-102 (168) 144 protein:vir:96288 Length: 100 97.4 4.2E-07 2.6E-10 55.6 4.8 85 1-116 13-100 (100) 145 protein:vir:80037 Length: 199 96.8 6.3E-07 3.9E-10 54.6 0.3 124 1-147 1-136 (199) 146 protein:vir:4790 Length: 114 # 96.8 4.6E-05 2.8E-08 44.4 10.5 114 1-147 1-114 (114) 147 protein:vir:98892 Length: 108 96.8 2.6E-05 1.6E-08 45.7 9.1 107 1-143 2-108 (108) 148 protein:vir:8106 Length: 150 # 96.7 3.8E-06 2.4E-09 50.3 4.1 134 1-147 1-144 (150) 149 protein:vir:98557 Length: 149 96.6 4.5E-05 2.8E-08 44.4 9.3 119 1-143 1-149 (149) 150 protein:vir:5703 Length: 150 # 96.4 9.5E-05 5.9E-08 42.6 9.7 120 7-143 1-150 (150) 151 protein:vir:2026 Length: 150 # 96.3 0.00011 6.9E-08 42.3 9.7 120 7-143 1-150 (150) 152 protein:vir:96763 Length: 177 96.1 0.00022 1.4E-07 40.6 10.0 138 1-147 5-171 (177) 153 protein:vir:6071 Length: 150 # 96.1 0.00017 1E-07 41.3 9.3 120 7-143 1-150 (150) 154 protein:vir:1581 Length: 116 # 95.9 0.00015 9.6E-08 41.5 8.6 116 1-142 1-116 (116) 155 protein:vir:396 Length: 184 # 95.9 0.00016 9.8E-08 41.4 8.7 135 5-147 1-180 (184) 156 protein:vir:9823 Length: 118 # 95.9 0.00019 1.2E-07 41.0 9.1 116 1-147 2-117 (118) 157 protein:vir:3036 Length: 118 # 95.9 0.00019 1.2E-07 41.0 9.1 116 1-147 2-117 (118) 158 protein:vir:102190 Length: 93 95.7 8.7E-05 5.4E-08 42.8 6.5 91 28-146 1-93 (93) 159 protein:vir:79179 Length: 155 95.7 0.0003 1.9E-07 39.9 9.4 125 1-143 1-155 (155) 160 protein:vir:4514 Length: 168 # 95.7 0.00026 1.6E-07 40.2 8.8 139 1-147 1-163 (168) 161 protein:vir:97088 Length: 157 95.6 0.00039 2.4E-07 39.3 9.4 132 1-147 5-155 (157) 162 protein:vir:3427 Length: 192 # 95.5 0.00028 1.8E-07 40.0 8.6 132 5-147 1-188 (192) 163 protein:vir:4460 Length: 170 # 95.5 0.00035 2.2E-07 39.5 8.9 138 1-147 1-164 (170) 164 protein:vir:6375 Length: 205 # 95.5 0.00096 5.9E-07 37.1 11.2 147 1-147 1-198 (205) 165 protein:vir:100312 Length: 152 95.4 0.00052 3.2E-07 38.6 9.6 127 1-144 1-152 (152) 166 protein:vir:1838 Length: 149 # 94.9 0.00079 4.9E-07 37.6 9.1 119 7-143 1-149 (149) 167 protein:vir:102608 Length: 108 94.3 0.00013 7.9E-08 41.9 3.5 104 1-128 1-108 (108) 168 protein:vir:105825 Length: 108 94.3 0.00013 7.9E-08 41.9 3.5 104 1-128 1-108 (108) 169 protein:vir:1164 Length: 156 # 94.0 0.0018 1.1E-06 35.7 9.0 129 1-147 1-156 (156) 170 protein:vir:79687 Length: 113 93.8 0.0016 1E-06 35.9 8.4 110 14-147 1-111 (113) 171 protein:vir:487 Length: 187 # 93.4 0.0021 1.3E-06 35.3 8.4 147 1-147 1-181 (187) 172 protein:vir:79115 Length: 148 93.3 0.0025 1.6E-06 34.8 8.7 119 7-143 1-148 (148) 173 protein:vir:79034 Length: 141 90.8 0.0041 2.5E-06 33.7 6.8 127 5-147 1-133 (141) 174 protein:vir:4200 Length: 133 # 88.9 0.0086 5.4E-06 31.9 7.1 126 2-143 1-133 (133) 175 protein:vir:78163 Length: 92 # 84.5 0.0035 2.2E-06 34.0 2.4 92 1-131 1-92 (92) 176 protein:vir:8432 Length: 149 # 81.7 0.071 4.4E-05 26.9 8.4 120 1-147 1-148 (149) 177 protein:vir:4162 Length: 133 # 80.5 0.043 2.7E-05 28.1 6.8 126 2-143 1-133 (133) 178 protein:vir:78894 Length: 105 75.9 0.025 1.6E-05 29.3 4.1 102 1-147 1-105 (105) 179 protein:vir:79555 Length: 192 74.6 0.16 9.7E-05 25.0 9.8 136 7-147 1-188 (192) 180 protein:vir:6154 Length: 119 # 56.0 0.017 1.1E-05 30.3 -1.0 118 1-147 1-118 (119) 181 protein:vir:7859 Length: 126 # 42.8 0.33 0.0002 23.2 3.8 106 7-118 1-126 (126) 182 protein:vir:101654 Length: 126 42.8 0.33 0.0002 23.2 3.8 106 7-118 1-126 (126) 183 protein:vir:99454 Length: 150 38.7 1.1 0.00065 20.5 7.5 123 1-134 1-150 (150) 184 protein:vir:3787 Length: 231 # 32.0 1.5 0.0009 19.7 8.1 142 1-146 1-231 (231) No 1 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=100.00 E-value=7e-35 Score=207.85 Aligned_cols=136 Identities=26% Similarity=0.340 Sum_probs=117.0 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCccc Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKI 84 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~ 84 (147) |+|+|||+|+++|++|++.+++++++|+.++|..|+++|++++|++||+|++||.++.....++ ..+ ..++++ T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~-~~~------~v~~~~ 73 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDL-ISK------KITVNE 73 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCce-eEE------eeCCCc Confidence 9999999999999999999999999999999999999999999999999999998876654432 222 234678 Q ss_pred chhhhhhcccccccccccccc------------------------------------cccccccCCCCCCCcchhhHHHH Q lcl|NC_015262. 85 FYGKFLEFGASAHKIPIKKGK------------------------------------KKGRIINHPGVSPKPFLAPAYES 128 (147) Q Consensus 85 ~y~~~vE~GT~~~~~~~~~~~------------------------------------~~~~~~~~~~~~a~PFl~pA~~~ 128 (147) +|+.|+||||+++...|.... .....+.||||+|||||+|||++ T Consensus 74 ~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~ 153 (173) T protein:vir:10 74 LYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIE 153 (173) T ss_pred ccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHH Confidence 999999999998876664221 11235778999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 129 KKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 129 ~~~~~~~~i~~~l~~~i~~ 147 (147) +++++.++|.+.|+++|+= T Consensus 154 ~~~~~~~~i~~~i~~~lrk 172 (173) T protein:vir:10 154 GKKQYLKDLENLLKTYNKK 172 (173) T ss_pred hHHHHHHHHHHHHHHHhhc Confidence 9999999999999999998 No 2 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=100.00 E-value=1.2e-33 Score=201.13 Aligned_cols=128 Identities=34% Similarity=0.625 Sum_probs=117.0 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--------------hhhcceecccc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGK--------------LKDGLKVSGVK 64 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~--------------l~~sI~~~~~~ 64 (147) |+ ++|+|+||++|+++|++|++++++++++|++++|++|+++++.++|+++|. ++++|.+...+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88 688999999999999999999999999999999999999999999987664 45566666677 Q ss_pred cCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 65 KKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 65 ~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ..++..++.||++...+..+|||||+||||++ |||||||+||+++++++++++|.++|+++ T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pa~~~~k~~~~~~~~~~l~~~ 141 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-------------------MPAHPFIEPGFNASKAEAVRAMTDILKNE 141 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-------------------CCCCcchhHHHHHhHHHHHHHHHHHHHHH Confidence 77788889999999888899999999999986 89999999999999999999999999999 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |++ T Consensus 142 l~k 144 (146) T protein:vir:10 142 MRL 144 (146) T ss_pred Hhh Confidence 999 No 3 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=100.00 E-value=1.2e-33 Score=201.13 Aligned_cols=128 Identities=34% Similarity=0.625 Sum_probs=117.0 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--------------hhhcceecccc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGK--------------LKDGLKVSGVK 64 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~--------------l~~sI~~~~~~ 64 (147) |+ ++|+|+||++|+++|++|++++++++++|++++|++|+++++.++|+++|. ++++|.+...+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88 688999999999999999999999999999999999999999999987664 45566666677 Q ss_pred cCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 65 KKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 65 ~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ..++..++.||++...+..+|||||+||||++ |||||||+||+++++++++++|.++|+++ T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pa~~~~k~~~~~~~~~~l~~~ 141 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-------------------MPAHPFIEPGFNASKAEAVRAMTDILKNE 141 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-------------------CCCCcchhHHHHHhHHHHHHHHHHHHHHH Confidence 77788889999999888899999999999986 89999999999999999999999999999 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |++ T Consensus 142 l~k 144 (146) T protein:vir:10 142 MRL 144 (146) T ss_pred Hhh Confidence 999 No 4 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=100.00 E-value=1.2e-33 Score=201.13 Aligned_cols=128 Identities=34% Similarity=0.625 Sum_probs=117.0 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--------------hhhcceecccc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGK--------------LKDGLKVSGVK 64 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~--------------l~~sI~~~~~~ 64 (147) |+ ++|+|+||++|+++|++|++++++++++|++++|++|+++++.++|+++|. ++++|.+...+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88 688999999999999999999999999999999999999999999987664 45566666677 Q ss_pred cCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 65 KKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 65 ~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ..++..++.||++...+..+|||||+||||++ |||||||+||+++++++++++|.++|+++ T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pa~~~~k~~~~~~~~~~l~~~ 141 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-------------------MPAHPFIEPGFNASKAEAVRAMTDILKNE 141 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-------------------CCCCcchhHHHHHhHHHHHHHHHHHHHHH Confidence 77788889999999888899999999999986 89999999999999999999999999999 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |++ T Consensus 142 l~k 144 (146) T protein:vir:10 142 MRL 144 (146) T ss_pred Hhh Confidence 999 No 5 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=100.00 E-value=1.2e-33 Score=201.13 Aligned_cols=128 Identities=34% Similarity=0.625 Sum_probs=117.0 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch--------------hhhcceecccc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGK--------------LKDGLKVSGVK 64 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~--------------l~~sI~~~~~~ 64 (147) |+ ++|+|+||++|+++|++|++++++++++|++++|++|+++++.++|+++|. ++++|.+...+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88 688999999999999999999999999999999999999999999987664 45566666677 Q ss_pred cCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 65 KKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 65 ~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ..++..++.||++...+..+|||||+||||++ |||||||+||+++++++++++|.++|+++ T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pa~~~~k~~~~~~~~~~l~~~ 141 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-------------------MPAHPFIEPGFNASKAEAVRAMTDILKNE 141 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-------------------CCCCcchhHHHHHhHHHHHHHHHHHHHHH Confidence 77788889999999888899999999999986 89999999999999999999999999999 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |++ T Consensus 142 l~k 144 (146) T protein:vir:10 142 MRL 144 (146) T ss_pred Hhh Confidence 999 No 6 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=99.96 E-value=1.4e-33 Score=200.71 Aligned_cols=128 Identities=23% Similarity=0.496 Sum_probs=112.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEee--- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGI--- 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~--- 76 (147) |+++|+|+|||+|++.|++|+.++. ++++.||+++|++|+++|+.++|++||+|++||.++......|.....|+. T Consensus 2 m~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~~ 81 (148) T protein:vir:93 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRGV 81 (148) T ss_pred cceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeeccc Confidence 8899999999999999999998874 688999999999999999999999999999999887666655555444432 Q ss_pred -----------eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 77 -----------TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 77 -----------~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i 145 (147) ..+...++|||||+||||++ |||||||+|||++++++++++|.++++++| T Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-------------------~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i 142 (148) T protein:vir:93 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTVN-------------------MPPHPFVRPAFDVRSEQAAQVAIARMNRAI 142 (148) T ss_pred ccccccccceeecCCCCCcceeeeeccCCCC-------------------CCCCcchhHHHHHhHHHHHHHHHHHHHHHH Confidence 23445678999999999996 899999999999999999999999999999 Q ss_pred cC Q lcl|NC_015262. 146 GL 147 (147) Q Consensus 146 ~~ 147 (147) += T Consensus 143 ~k 144 (148) T protein:vir:93 143 DE 144 (148) T ss_pred HH Confidence 87 No 7 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.96 E-value=1.9e-33 Score=199.99 Aligned_cols=127 Identities=28% Similarity=0.518 Sum_probs=115.0 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee-- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT-- 77 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~-- 77 (147) |+ +|+|+|||+|++.|++|++++. +++++|++++|++|++++++++|++||+|++||.++..+...+...+.||+. T Consensus 1 Ma-~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~ 79 (140) T protein:vir:10 1 MS-SIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVR 79 (140) T ss_pred Cc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeec Confidence 87 7999999999999999998875 6899999999999999999999999999999999888777777777777653 Q ss_pred ----ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 ----KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ----~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ....++++||+|+||||++ |||||||+||++++++++.++|.++++++|+= T Consensus 80 ~~~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k 134 (140) T protein:vir:10 80 TKGKADSPNNAFYWRFDEFGTQH-------------------MKAQPFMRPAFDASIGEAEGAIRTELARAIDR 134 (140) T ss_pred cccccCCCCccceeeeeccCCCC-------------------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 3455778999999999985 99999999999999999999999999999987 No 8 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.96 E-value=2.2e-33 Score=199.66 Aligned_cols=128 Identities=23% Similarity=0.376 Sum_probs=117.8 Q ss_pred Cc--eeeeehhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCCC-------------cchhhhcceeccc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMG--KSGDKLLNEAVKAGGNVILQDALPRVSKR-------------SGKLKDGLKVSGV 63 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~--~~~~~~~~~al~~~a~~v~~~ak~~ap~~-------------tG~l~~sI~~~~~ 63 (147) |+ ++|+|+||+||+++|++|+ .+.++++++||+++|++|++++++++|+. +||++++|.++.+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 98 5689999999999999995 46789999999999999999999999963 6799999999999 Q ss_pred ccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 64 KKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 64 ~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) +..++..++.||+.+..++++|||||+||||++ |||||||+||++++++++.++|.++|++ T Consensus 81 ~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k-------------------~~a~pF~~pa~~~~~~~~~~~~~~~l~k 141 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSE-------------------RPPHHAFGKTNKILKRVYDNIAQKKYDN 141 (149) T ss_pred ccccceeEEEeeccCCCCCccceeeeeccCccC-------------------CCCCccchHHHHHHHHHHHHHHHHHHHH Confidence 988899999999999888899999999999997 8999999999999999999999998877 Q ss_pred HhcC Q lcl|NC_015262. 144 GLGL 147 (147) Q Consensus 144 ~i~~ 147 (147) +|+= T Consensus 142 ~i~~ 145 (149) T protein:vir:13 142 FVKE 145 (149) T ss_pred HHHH Confidence 7775 No 9 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=99.96 E-value=6e-33 Score=197.22 Aligned_cols=127 Identities=22% Similarity=0.375 Sum_probs=112.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCc----chhhhcceecccccCCCceEEEEe Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRS----GKLKDGLKVSGVKKKGGTKYVLVG 75 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~t----G~l~~sI~~~~~~~~~~~~~~~Vg 75 (147) |+++|+|+||+||++.|++|+.++. +++++|++++|++|+++++.++|+++ |+|++||.++..+...+...+.|+ T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~ 80 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLR 80 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEE Confidence 9999999999999999999999985 68899999999999999999999864 999999999888777776666665 Q ss_pred eeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 76 ITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 76 ~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ++.. ....||+||+||||++ |||||||+|||++++++++++|.++|+++|+= T Consensus 81 vg~~-~~~~~~~~f~E~GT~~-------------------~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~k 132 (135) T protein:vir:57 81 VGPT-RSHYMKALAQEFGTIK-------------------QVAKPFIRPALDYNKMQVLRILTVEIRDGLST 132 (135) T ss_pred ecCC-CCcceeEeecccCCCC-------------------CCCCcchhHhHHHhHHHHHHHHHHHHHHHHHH Confidence 5433 2446889999999997 89999999999999999999999998888877 No 10 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.96 E-value=6.7e-33 Score=196.97 Aligned_cols=123 Identities=29% Similarity=0.559 Sum_probs=113.3 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---cchhhhcceeccccc-CCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR---SGKLKDGLKVSGVKK-KGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~---tG~l~~sI~~~~~~~-~~~~~~~~Vg~ 76 (147) |+ +|+|+||+||++.|++|+.++++++++||+++|.+|.+++++++|++ ||+|++||.++.++. .++..++.||+ T Consensus 1 M~-~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~ 79 (127) T protein:vir:12 1 MA-DMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGP 79 (127) T ss_pred Ce-eeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEee Confidence 76 79999999999999999999999999999999999999999999964 899999998877654 45777889998 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++ +.+|||||+||||++ |||||||+||+++++++++++|.++|+++|+ T Consensus 80 ~~---~~~~y~~f~E~GT~~-------------------~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 80 NK---KVAYRGRFLEWGTSK-------------------MPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred CC---CCcceeeeeccCccC-------------------CCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 64 468999999999996 8999999999999999999999999999999 No 11 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.96 E-value=1.5e-32 Score=195.12 Aligned_cols=122 Identities=18% Similarity=0.268 Sum_probs=113.8 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+ |+|+|+|+|+|.+.|+++++++.+.+.+++.++++.++++++.++|++||+|++||..+.++..++...+.||. T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~-- 78 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVA-- 78 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeC-- Confidence 87 56778899999999999999999999999999999999999999999999999999988888888888888874 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .++|++|+||||++ |||||||+||++++++.+.+.|.++|+++|+- T Consensus 79 ----~~~Ya~~vEfGT~~-------------------~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~ 124 (125) T protein:vir:94 79 ----RADYSSYNEYGTYR-------------------MSAQPFMAPSVAAMTPFFYKAVRDALNKAAKF 124 (125) T ss_pred ----CCCccceeeccccc-------------------CCCCcccchhHHHHHHHHHHHHHHHHHHHhcc Confidence 46899999999986 89999999999999999999999999999999 No 12 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.96 E-value=1.3e-32 Score=195.39 Aligned_cols=127 Identities=30% Similarity=0.541 Sum_probs=113.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee-- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT-- 77 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~-- 77 (147) |+ +|+|+|||+|++.|++|+.++. +++++|++++|.+|++++++++|++||+|++||.++..+...+...+.+|+. T Consensus 1 Ma-~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~ 79 (140) T protein:vir:80 1 MS-SIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVR 79 (140) T ss_pred Cc-eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeecc Confidence 88 8999999999999999998874 6889999999999999999999999999999999877766666666666653 Q ss_pred ----ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 ----KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ----~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ....+++|||+|+||||++ |||||||+||++++++++.++|.++++++|+= T Consensus 80 ~~~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k 134 (140) T protein:vir:80 80 TKGKADSPSNAFYWRFDEFGTQH-------------------MKAQPFMRPAFDASIGEAEGAIRTELARAIDQ 134 (140) T ss_pred cccccCCCCCcceeeeeccCCCC-------------------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 2345678999999999986 89999999999999999999999999999987 No 13 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.96 E-value=1.3e-32 Score=195.47 Aligned_cols=127 Identities=28% Similarity=0.517 Sum_probs=115.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee-- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT-- 77 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~-- 77 (147) |+ +|+|+|||+|++.|++|++++. +++++|++++|.++++++++++|++||+|++||.++..+...+...+.||+. T Consensus 1 M~-~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~ 79 (140) T protein:vir:14 1 MS-SIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVR 79 (140) T ss_pred Cc-eeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeec Confidence 77 7999999999999999998875 5789999999999999999999999999999999988888778777777753 Q ss_pred ----ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 ----KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ----~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ....+++|||||+||||++ |||||||+||++++++++.++|.++++++|+= T Consensus 80 ~~~~~~~~~~~~y~~f~E~GT~~-------------------~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k 134 (140) T protein:vir:14 80 TKGKADSPNNAFYWRFDEFGTQH-------------------MKAQPFMRPAFDASIGEAEGAIRTELARAIDR 134 (140) T ss_pred cccccCCCCccceeeeeccccCC-------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3345678999999999986 89999999999999999999999999999987 No 14 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.96 E-value=1.6e-32 Score=194.94 Aligned_cols=127 Identities=26% Similarity=0.501 Sum_probs=110.5 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee-- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT-- 77 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~-- 77 (147) |+ +|+|+|+|+|++.|++|++++. +++++|++++|.++++++++++|++||+|++||.++..+...+...+.+++. T Consensus 1 Ma-~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~ 79 (140) T protein:vir:10 1 MS-SVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVR 79 (140) T ss_pred Cc-eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeeccc Confidence 88 7999999999999999998874 6899999999999999999999999999999998876655555444444432 Q ss_pred ----ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 ----KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ----~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ....++++||||+||||++ |||||||+||+++++++++++|.++++++|+= T Consensus 80 ~~~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k 134 (140) T protein:vir:10 80 TKGKADSPNNAFYWRFVELGTQF-------------------MKAEPFMRPAFDASIAQAEGAIRTEIARAIDQ 134 (140) T ss_pred cccccCCCCcccccceeccCcCC-------------------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 2344678999999999986 89999999999999999999999999988876 No 15 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.96 E-value=1.9e-32 Score=194.44 Aligned_cols=128 Identities=23% Similarity=0.502 Sum_probs=107.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCC-ceEEEEe--- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGG-TKYVLVG--- 75 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~-~~~~~Vg--- 75 (147) |+++|+|+|||+|++.|+.|++++. ++++.|++++|++|+++|++++|++||+|++||.++..+...+ .....|+ T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~ 81 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccc Confidence 8899999999999999999999875 6889999999999999999999999999999998754433221 1111111 Q ss_pred -----------eeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 76 -----------ITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 76 -----------~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ........++||||+||||++ |||||||+||+++++++++++|.++|+++ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-------------------~~a~PF~~pA~~~~k~~~~~~~~~~l~~~ 142 (149) T protein:vir:19 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTAN-------------------MPAHPFVRPAYDTREEEAASVAIARMNQA 142 (149) T ss_pred cccccccccceeecCCCCccceeeeeccCCCC-------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHH Confidence 122334568999999999986 89999999999999999999999999999 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |+= T Consensus 143 l~k 145 (149) T protein:vir:19 143 IDE 145 (149) T ss_pred HHH Confidence 988 No 16 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.95 E-value=5.3e-32 Score=192.06 Aligned_cols=122 Identities=21% Similarity=0.285 Sum_probs=113.5 Q ss_pred eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc------chhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRS------GKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~t------G~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |+|+|+||+||+++|++|+.++++++++||+++|++++++++.++|+++ |||+++|.++.++..++..++.||| T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~ 80 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGY 80 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeee Confidence 6678999999999999999999999999999999999999999999864 5799999998888888889999998 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++ +.+|||||+||||++ |||||||+||+++++++++++|.++|+++|= T Consensus 81 ~k---~~~~y~~f~E~GT~k-------------------~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 81 GK---DTGWRAHFPNSGTSM-------------------QDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred cC---CCceEEeeeccCccC-------------------CCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 65 457999999999986 8999999999999999999999999999998 No 17 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.95 E-value=1.1e-31 Score=190.32 Aligned_cols=144 Identities=18% Similarity=0.263 Sum_probs=118.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccC--CCceEEEEeeec Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKK--GGTKYVLVGITK 78 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~--~~~~~~~Vg~~~ 78 (147) ||++|.--.|++|.+.|+.|++...+++++|+.++|++|+++|+.++|++||+|++||.+...+.. .|..++.|||+. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 888887777889999999999988999999999999999999999999999999999988654433 466677788865 Q ss_pred cCCcccchhhhhhcccccccccccc----cccccccccC-CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKK----GKKKGRIINH-PGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~----~~~~~~~~~~-~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) . +++|+||+||||..+...... +.....++++ .+||||||||||||+.++++.++|.++++++|+= T Consensus 81 ~---~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e 151 (157) T protein:vir:97 81 K---AAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAE 151 (157) T ss_pred C---ccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHH Confidence 4 579999999998654433221 1223334444 5799999999999999999999999999999986 No 18 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=99.95 E-value=7.7e-32 Score=191.17 Aligned_cols=128 Identities=23% Similarity=0.416 Sum_probs=107.7 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCC-----Ccchhhhcceecccc---cCCCc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSK-----RSGKLKDGLKVSGVK---KKGGT 69 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~-----~tG~l~~sI~~~~~~---~~~~~ 69 (147) |+ ++|+|+||+||.++|++|++++. ++++.||+++|++|+++|++++|+ ++|+|.++|.+.... ...+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 87 89999999999999999999884 688999999999999999999965 578999999775433 33444 Q ss_pred eEEEEeeec----------------------------cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc Q lcl|NC_015262. 70 KYVLVGITK----------------------------EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF 121 (147) Q Consensus 70 ~~~~Vg~~~----------------------------~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF 121 (147) ..+.||+.. ....++|||||+||||++ |||||| T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~k-------------------mpa~PF 141 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEH-------------------TSARPI 141 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCC-------------------CCCCcc Confidence 455555321 233468999999999986 999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 122 LAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 122 l~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) |+|||++++++++++|.++|+++|+= T Consensus 142 lrPA~~~~~~~a~~~i~~~l~~~i~k 167 (179) T protein:vir:18 142 LRPAMNGVDNDVINVFSTEMGKAIDR 167 (179) T ss_pred chhhHHhhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999998887 No 19 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.94 E-value=2.6e-31 Score=188.23 Aligned_cols=128 Identities=21% Similarity=0.347 Sum_probs=106.8 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCC-----Ccchhhhcceeccc---ccCCCc Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSK-----RSGKLKDGLKVSGV---KKKGGT 69 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~-----~tG~l~~sI~~~~~---~~~~~~ 69 (147) |+ ++|+|+|||+|.++|++|+.++. ++++.||+++|++|++++++++|+ ++|+|+++|.+... ....+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 87 68999999999999999999985 689999999999999999999996 56899999976432 222333 Q ss_pred eEEEEee-------------eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_015262. 70 KYVLVGI-------------TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNV 136 (147) Q Consensus 70 ~~~~Vg~-------------~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~ 136 (147) ....||+ ....++++|||||+||||++ |||||||+|||++++++++++ T Consensus 81 ~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~k-------------------m~a~PFlrPA~~~~k~~~~~~ 141 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTED-------------------MRAQPFMRSALADNIAEVTST 141 (164) T ss_pred eeEEecccccccccccccccccCCCCCcceEEEeecCCCC-------------------CCCCcchhhhHHHhHHHHHHH Confidence 3444443 22334568999999999986 999999999999999999999 Q ss_pred HHHHHHHHhcC Q lcl|NC_015262. 137 MKEILKRGLGL 147 (147) Q Consensus 137 i~~~l~~~i~~ 147 (147) |.++|+++|+= T Consensus 142 ~~~~l~~~i~k 152 (164) T protein:vir:43 142 FVSEYEKGIDR 152 (164) T ss_pred HHHHHHHHHHH Confidence 99999999876 No 20 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.94 E-value=1.5e-30 Score=184.15 Aligned_cols=124 Identities=19% Similarity=0.318 Sum_probs=105.3 Q ss_pred ceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcch----hhhcceecccc-cC--CCceEEE Q lcl|NC_015262. 2 SVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGK----LKDGLKVSGVK-KK--GGTKYVL 73 (147) Q Consensus 2 ~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~----l~~sI~~~~~~-~~--~~~~~~~ 73 (147) .++|+|+||++|+++|++|+.++. ++++.||+++|++|+++++.++|+++|. |++||.++... .. .+..++. T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~ 80 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLR 80 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEE Confidence 348999999999999999999875 5789999999999999999999998875 89999764322 22 3334455 Q ss_pred EeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 74 VGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 74 Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ||.++ ..+|||+|+||||++ |||||||+|||++++++++++|.++++++|+= T Consensus 81 vg~~~---~~~~y~~f~E~GT~k-------------------~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K 132 (133) T protein:vir:10 81 VGPSK---QHHMKVLAQEFGTVK-------------------QVADPFIRPALDYNVQTVLRVLTVEIRNGIQN 132 (133) T ss_pred ecCCC---CccceEeeeccCCCC-------------------CCCCccchHHHHHhHHHHHHHHHHHHHHHhhc Confidence 55433 456899999999997 89999999999999999999999999999999 No 21 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.94 E-value=1.8e-30 Score=183.64 Aligned_cols=139 Identities=17% Similarity=0.214 Sum_probs=108.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAV----KAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al----~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |. +|+|+|+|+|.++|+++++.+++.+++++ ++++..++++|+.++|++||+|++||.+.. ...++.....|+ T Consensus 1 m~-~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~-~~~~~~~~g~V~- 77 (182) T protein:vir:10 1 MI-EVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEV-KVDGDEVIGRWW- 77 (182) T ss_pred Ce-EEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeee-eecCCeEEEEee- Confidence 54 89999999999999999987776555555 666777888899999999999999998654 344555556665 Q ss_pred eccCCcccchhhhhhccccccccccccc-----------c------------------------cccccccCCCCCCCcc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKG-----------K------------------------KKGRIINHPGVSPKPF 121 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~-----------~------------------------~~~~~~~~~~~~a~PF 121 (147) +++.|+.|+||||.++....... . ..+..+.+++|||||| T Consensus 78 -----~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPF 152 (182) T protein:vir:10 78 -----NSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQF 152 (182) T ss_pred -----cCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcc Confidence 34689999999997654322110 0 0133466899999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 122 LAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 122 l~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) |+||++++++++.+.|.++|+++|+= T Consensus 153 l~pA~~~~~~~i~~~i~~~i~~~l~~ 178 (182) T protein:vir:10 153 MTPAANKMAKEAPEIIKRSIDQELHD 178 (182) T ss_pred hHHHHHHhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999998887 No 22 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.94 E-value=2.4e-30 Score=182.98 Aligned_cols=114 Identities=17% Similarity=0.209 Sum_probs=102.7 Q ss_pred eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCc Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNS 82 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~ 82 (147) |+|+|+|+|+|.+.|+++++.+.+.++++|+++|..++++|+.++|++||+|++||.++. ++..+.|| + T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~-----~g~~~~V~------~ 69 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSY-----PGMEAHIH------G 69 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeec-----CceEEEee------c Confidence 566778999999999999999988899999999999999999999999999999997643 23445555 3 Q ss_pred ccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 83 KIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 83 ~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) .++|++||||||++ |||||||+||++++++++.+.|.++|+++|+ T Consensus 70 ~~~Ya~yvE~GT~~-------------------~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 70 EAGYDGYQEYGTRF-------------------QPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred CCCccceeecCccc-------------------cCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 46899999999985 8999999999999999999999999999999 No 23 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.94 E-value=3e-30 Score=182.43 Aligned_cols=120 Identities=20% Similarity=0.248 Sum_probs=109.8 Q ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcch----hhhcceeccccc-CCCceEEEEeeeccC Q lcl|NC_015262. 6 TTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGK----LKDGLKVSGVKK-KGGTKYVLVGITKED 80 (147) Q Consensus 6 ~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~----l~~sI~~~~~~~-~~~~~~~~Vg~~~~~ 80 (147) =|+||+||+++|++|+.++.++.++|++++|++++++++.++|+++|. |++||.++.++. ..|..++.|||++ T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k-- 78 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGK-- 78 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecC-- Confidence 489999999999999999999999999999999999999999998764 999999877654 4577788999864 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ..+|||||+||||++ |||||||+||+++++++++++|.++++++|.| T Consensus 79 -~~~~y~~f~E~GT~k-------------------~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 79 -ATGWRAHYPNDGTIY-------------------QRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred -CCceeEeeeccCccC-------------------CCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 357999999999996 89999999999999999999999999999999 No 24 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.94 E-value=2.2e-30 Score=183.13 Aligned_cols=112 Identities=21% Similarity=0.357 Sum_probs=98.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+++|+|+|||+|++.|+++.. .+.+++++++++..++++++.++|++||+|++||.++. ..+...+.||+ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~--~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~---~~~~~~~~V~~---- 71 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS--LKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMEL---TEGGFSGQAGP---- 71 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeee---cCCceEEEeec---- Confidence 9999999999999999998754 36689999999999999999999999999999997543 33445677764 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++|++||||||++ |||||||+||++.+++++.+.|.+.|+ T Consensus 72 --~~~Ya~~vE~GT~k-------------------~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 72 --HTDYSAYVEYGTRF-------------------QSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred --CCCccceeeccccc-------------------cCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 46899999999986 899999999999999999999988888 No 25 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.93 E-value=8.4e-30 Score=179.98 Aligned_cols=133 Identities=18% Similarity=0.244 Sum_probs=112.8 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+++++++|+++|.+.|+.+++.+.+.+++++.++|..++++++.++|++||+|++||.+.. ..+..++.||. T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~---~~~g~~~~V~~---- 76 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDY---KNNGLTAEITV---- 76 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEe---ecCcEEEEEec---- Confidence 88999999999999999999999999999999999999999999999999999999997643 22334556653 Q ss_pred Ccccchhhhhhcccccccccccccc--------cccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK--------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~--------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) ...|+.||||||.+|...+...+ ..+..+.+++|||||||+||++.+++.+.+.|++.+- T Consensus 77 --~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 77 --GAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred --CCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 46899999999999887765433 2355667899999999999999999988877666665 No 26 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.93 E-value=2.2e-29 Score=177.67 Aligned_cols=108 Identities=21% Similarity=0.262 Sum_probs=98.1 Q ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcccch Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFY 86 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y 86 (147) |+|||+|++.|+++++.+.+.+++++.++|..++++++.++|++||+|++||.++.. +...+.|+ ++++| T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~----~~~~~~v~------~~~~Y 70 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQ----RLLHYRVV------SPALY 70 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeec----CcEEEEee------cCccc Confidence 999999999999999999999999999999999999999999999999999976542 33445554 45689 Q ss_pred hhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 87 GKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 87 ~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) ++|+||||++ |+|||||+||++.+++++.+.|.+.|+| T Consensus 71 a~~vE~GT~~-------------------m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 71 SIYLELGTRK-------------------MEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred chhcccCccc-------------------cCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 9999999985 9999999999999999999999999999 No 27 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.93 E-value=2.5e-29 Score=177.37 Aligned_cols=109 Identities=20% Similarity=0.402 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|++|++.+.+.++.++++++..+.+++++++ |++||+|++||.++. .+...+.||. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~v~~-- 74 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYTITS-- 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEEeec-- Confidence 9999999999999999999999999999999999999999998 889999999998653 2445556653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++||+|+||||++ |||||||+|||+.+++++.+.|.+.++ T Consensus 75 ----~~~Ya~~vE~GT~k-------------------m~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 75 ----HAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----Cccchhhhcccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 46899999999986 999999999999999999999999999 No 28 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.93 E-value=2.5e-29 Score=177.37 Aligned_cols=109 Identities=20% Similarity=0.402 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|++|++.+.+.++.++++++..+.+++++++ |++||+|++||.++. .+...+.||. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~v~~-- 74 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYTITS-- 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEEeec-- Confidence 9999999999999999999999999999999999999999998 889999999998653 2445556653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++||+|+||||++ |||||||+|||+.+++++.+.|.+.++ T Consensus 75 ----~~~Ya~~vE~GT~k-------------------m~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 75 ----HAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----Cccchhhhcccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 46899999999986 999999999999999999999999999 No 29 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.93 E-value=2.5e-29 Score=177.37 Aligned_cols=109 Identities=20% Similarity=0.402 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|++|++.+.+.++.++++++..+.+++++++ |++||+|++||.++. .+...+.||. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~v~~-- 74 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYTITS-- 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEEeec-- Confidence 9999999999999999999999999999999999999999998 889999999998653 2445556653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++||+|+||||++ |||||||+|||+.+++++.+.|.+.++ T Consensus 75 ----~~~Ya~~vE~GT~k-------------------m~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 75 ----HAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----Cccchhhhcccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 46899999999986 999999999999999999999999999 No 30 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.93 E-value=2.5e-29 Score=177.37 Aligned_cols=109 Identities=20% Similarity=0.402 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|++|++.+.+.++.++++++..+.+++++++ |++||+|++||.++. .+...+.||. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~v~~-- 74 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYTITS-- 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEEeec-- Confidence 9999999999999999999999999999999999999999998 889999999998653 2445556653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++||+|+||||++ |||||||+|||+.+++++.+.|.+.++ T Consensus 75 ----~~~Ya~~vE~GT~k-------------------m~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 75 ----HAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----Cccchhhhcccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 46899999999986 999999999999999999999999999 No 31 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.93 E-value=2.5e-29 Score=177.37 Aligned_cols=109 Identities=20% Similarity=0.402 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|++|++.+.+.++.++++++..+.+++++++ |++||+|++||.++. .+...+.||. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~v~~-- 74 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYTITS-- 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEEeec-- Confidence 9999999999999999999999999999999999999999998 889999999998653 2445556653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++||+|+||||++ |||||||+|||+.+++++.+.|.+.++ T Consensus 75 ----~~~Ya~~vE~GT~k-------------------m~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 75 ----HAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----Cccchhhhcccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 46899999999986 999999999999999999999999999 No 32 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.93 E-value=2.5e-29 Score=177.37 Aligned_cols=109 Identities=20% Similarity=0.402 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|++|++.+.+.++.++++++..+.+++++++ |++||+|++||.++. .+...+.||. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~----~g~~~~~v~~-- 74 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TGDLQYTITS-- 74 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee----cCceEEEeec-- Confidence 9999999999999999999999999999999999999999998 889999999998653 2445556653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) .++||+|+||||++ |||||||+|||+.+++++.+.|.+.++ T Consensus 75 ----~~~Ya~~vE~GT~k-------------------m~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 75 ----HAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ----Cccchhhhcccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 46899999999986 999999999999999999999999999 No 33 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.92 E-value=5.1e-29 Score=175.69 Aligned_cols=109 Identities=23% Similarity=0.391 Sum_probs=97.4 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|+++++.+.+.+++++++++..+++++++++ |++||+|++||.++. .+...+.|+ T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~----~g~~~~~v~--- 73 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKK----IGDLHYRVI--- 73 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee----cCcEEEEee--- Confidence 9999999999999999999999999999999999999999988 789999999997642 344445554 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) +.++|++|+||||++ |+|||||+|||+++++.+++.|.+.|. T Consensus 74 ---~~~~Ya~~vEfGT~k-------------------m~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 74 ---STAHYSGFLEFGTRY-------------------MEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred ---CCCccchheeccccc-------------------CCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 457899999999986 999999999999999999999888888 No 34 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.92 E-value=4.9e-29 Score=175.78 Aligned_cols=109 Identities=20% Similarity=0.384 Sum_probs=98.7 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV------SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a------p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+|+|||+|++.|+++++.+.+.+++++++++..+++++++++ |++||+|++||.++. ++...+.|+ T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~----~g~~~~~V~--- 73 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TVDLQYTIT--- 73 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee----cCcEEEEec--- Confidence 9999999999999999999999999999999999999999997 999999999998653 344555564 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) +.++|++|+||||++ |+|||||+|||+++++.+++.|.+.++ T Consensus 74 ---~~~~Ya~~vE~GT~~-------------------m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 74 ---SHAAYSGFLEFGTRY-------------------MEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred ---CCccccccccccccc-------------------cCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 457899999999986 999999999999999999999999888 No 35 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.92 E-value=4.2e-29 Score=176.18 Aligned_cols=127 Identities=15% Similarity=0.304 Sum_probs=106.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ +++ .|+|+|.+.|+++++++.+.+++++.+++..|+++|+.++|++||+|++||.++.. .++..+.|+ T Consensus 13 Ma-~~~-~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~---~~g~~~~V~----- 82 (149) T protein:vir:94 13 MA-KVK-YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF---DGGLSSVIS----- 82 (149) T ss_pred HH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEee---CCcEEEEEe----- Confidence 85 454 39999999999999999999999999999999999999999999999999976432 233455555 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||++|...+.... ..+..+.|++|||||||+||++.+++++.+.|. T Consensus 83 -~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 83 -VGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred -cCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 346899999999998876654322 234677899999999999999999999999999 No 36 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.92 E-value=9e-29 Score=174.34 Aligned_cols=127 Identities=13% Similarity=0.248 Sum_probs=106.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+- + +.|+|+|.+.|+++++.+.+.+++++++++..++++|++++|++||+|++||.+.. ..++..+.|| T Consensus 1 Ma~-~-~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:10 1 MAK-V-KYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDF---KKGGLTGVIN----- 70 (137) T ss_pred Cch-h-HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEe---eCCcEEEEEe----- Confidence 874 3 46999999999999999999999999999999999999999999999999997542 2233445555 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||++|...+.... ..+..+.|++|||||||+||++++++++.+.|. T Consensus 71 -~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 71 -IGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred -cCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 346899999999998876654332 245567899999999999999999999999999 No 37 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.92 E-value=1.3e-28 Score=173.45 Aligned_cols=127 Identities=13% Similarity=0.254 Sum_probs=106.0 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ +++ .|+|+|.+.|+++++.+++.+++++.+++..|++++++++|++||+|++||.+.. ..+.....|| T Consensus 1 Ma-~~~-~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:10 1 MA-KVK-YGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDF---KKGGLTGVIN----- 70 (137) T ss_pred Cc-cch-hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEe---cCCcEEEEEe----- Confidence 77 343 4999999999999999999999999999999999999999999999999997542 2233444554 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||++|...+.... ..+..+.|++|||||||+||++++++++.+.|. T Consensus 71 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 71 -IGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred -cCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 346899999999998876655432 234567889999999999999999999999999 No 38 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.92 E-value=1.2e-28 Score=173.69 Aligned_cols=127 Identities=13% Similarity=0.235 Sum_probs=106.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ +++ .|+|+|.+.|+++++++++.+++++.++|..+++++++++|++||+|++||.+.. ..++..+.|| T Consensus 1 Ma-~~~-~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:94 1 MA-KVK-YGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDF---KDGGFTGVIN----- 70 (137) T ss_pred Cc-hhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEe---ecCcEEEEEe----- Confidence 77 454 3999999999999999999999999999999999999999999999999997543 2233455665 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||.+|...+.... ..+..+.+.+|||||||+||++.+++++.+.|. T Consensus 71 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 71 -IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred -cCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 346899999999999887765422 234556688999999999999999999999999 No 39 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.92 E-value=8.1e-29 Score=174.59 Aligned_cols=127 Identities=15% Similarity=0.295 Sum_probs=106.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ +++ .|+|+|.+.|+++++++.+.+++++.+++..|+++|+.++|++||+|++||.++. ..+...+.|| T Consensus 13 Ma-~v~-~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~---~~~g~~~~V~----- 82 (149) T protein:vir:10 13 MA-KVK-YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKY---FDGGLSSVIS----- 82 (149) T ss_pred hH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEe---cCCcEEEEEe----- Confidence 85 454 4999999999999999999999999999999999999999999999999997653 2233455665 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||+.+...+.... .....+.|++|||||||+||++.+++++.+.|. T Consensus 83 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 83 -VGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred -cCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 346899999999998876654322 234567899999999999999999999999999 No 40 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.92 E-value=2.8e-28 Score=171.65 Aligned_cols=133 Identities=20% Similarity=0.288 Sum_probs=108.5 Q ss_pred Cceeeeeh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 1 MSVEITTE-GFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 1 M~~~~~i~-Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) |+ +++++ |+++|.+.|+.+.+.+++.+++++..+|..++.+|++++|++||+|++||.+. +...+....+.|| T Consensus 1 Ma-~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~-~~~~g~~~~~~v~---- 74 (142) T protein:vir:94 1 MA-GLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAV-PSGGRFSFSVTIG---- 74 (142) T ss_pred Cc-eeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceee-eccCCceEEEEEe---- Confidence 77 45544 89999999999999999999999999999999999999999999999999754 3344444556665 Q ss_pred CCcccchhhhhhcccccccccccccc--------cccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGK--------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~--------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) +.++|+.||||||++|.+.|+..+ .+...+.|||++|||||+||++.+++++.+.|++ |+ T Consensus 75 --~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~-~~ 142 (142) T protein:vir:94 75 --TNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKG-IR 142 (142) T ss_pred --cCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHh-cC Confidence 356899999999999877765433 2455678899999999999999999888665544 44 No 41 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.91 E-value=3.4e-28 Score=171.14 Aligned_cols=127 Identities=13% Similarity=0.229 Sum_probs=107.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+-. ++|+++|.+.|+++++++++.+++++.+++..++++++.++|++||+|++||.+.. ..+...+.|| T Consensus 1 Ma~~--~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:95 1 MAKV--KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDGGFTGVIN----- 70 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEe---eCCceEEEEe----- Confidence 7754 47999999999999999999999999999999999999999999999999997543 2233455665 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||.+|...+...+ ..+..+.+.+|||||||+||++.+++++.+.|. T Consensus 71 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 71 -IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred -cCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 346899999999998887765432 234566778999999999999999999999999 No 42 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.91 E-value=3.7e-28 Score=170.96 Aligned_cols=127 Identities=15% Similarity=0.240 Sum_probs=106.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+. +. .|+|+|++.|+++++.+++.++++|.++|..++++|+.++|+|||+|++||.+... .++..+.||. T Consensus 1 Ma~-~~-~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~---~~g~~~~V~~---- 71 (137) T protein:vir:96 1 MAK-VK-YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVT---DGGFSSVISV---- 71 (137) T ss_pred Cch-hH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEee---cCceEEEEec---- Confidence 873 43 69999999999999999999999999999999999999999999999999975432 2234556653 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) .+.|+.||||||++|...+.... ..+..+.|++|||||||+||++.+++.+.+.|. T Consensus 72 --~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 72 --GAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred --CCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 46899999999998877665322 245567889999999999999999999999999 No 43 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.91 E-value=3.7e-28 Score=170.98 Aligned_cols=108 Identities=19% Similarity=0.317 Sum_probs=94.0 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCccc Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKI 84 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~ 84 (147) |+|+|||+|++.|+++.. .+.++++++++|..|+++|+.++|++||+|++||.++. ..+...+.|+ +.+ T Consensus 1 i~i~Gld~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~---~~~~~~~~V~------~~~ 69 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT--LDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEF---TDGGLSGTTG------PHT 69 (108) T ss_pred CcchhHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeee---ecCceEEEee------cCC Confidence 999999999999998653 46688999999999999999999999999999997543 2334456665 456 Q ss_pred chhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 85 FYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 85 ~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) +|++||||||++ |||||||+||++++++++.+.|.+.|+ T Consensus 70 ~Ya~~vE~GT~k-------------------m~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 70 DYAGYVEYGTRF-------------------QSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred Ccccceeccccc-------------------cCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 899999999986 999999999999999999999988888 No 44 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.91 E-value=9.8e-28 Score=168.67 Aligned_cols=121 Identities=21% Similarity=0.205 Sum_probs=103.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhhcceecccccCC--CceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSG--KLKDGLKVSGVKKKG--GTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG--~l~~sI~~~~~~~~~--~~~~~~Vg~ 76 (147) |+++++++||++ .|+.|+.++++..+.|+++||+++++.+++++|++++ ||+|+|.++..+..+ +..++.||+ T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:79 1 MGARIESNNIEQ---GLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 998888876555 5556666777888999999999999999999998755 599999998777653 566788887 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++. ..||+||+||||++ ||||||++||+++++++++++|.++|++-.- T Consensus 78 ~k~---~~~~a~F~E~GT~k-------------------~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 78 AKG---VSHRIHATEFGTMY-------------------QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCC---CceEEEeccCCccC-------------------CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 654 35999999999997 8999999999999999999999999988777 No 45 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.91 E-value=9.8e-28 Score=168.67 Aligned_cols=121 Identities=21% Similarity=0.205 Sum_probs=103.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhhcceecccccCC--CceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSG--KLKDGLKVSGVKKKG--GTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG--~l~~sI~~~~~~~~~--~~~~~~Vg~ 76 (147) |+++++++||++ .|+.|+.++++..+.|+++||+++++.+++++|++++ ||+|+|.++..+..+ +..++.||+ T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:94 1 MGARIESNNIEQ---GLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 998888876555 5556666777888999999999999999999998755 599999998777653 566788887 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++. ..||+||+||||++ ||||||++||+++++++++++|.++|++-.- T Consensus 78 ~k~---~~~~a~F~E~GT~k-------------------~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 78 AKG---VSHRIHATEFGTMY-------------------QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCC---CceEEEeccCCccC-------------------CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 654 35999999999997 8999999999999999999999999988777 No 46 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.91 E-value=9.8e-28 Score=168.67 Aligned_cols=121 Identities=21% Similarity=0.205 Sum_probs=103.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhhcceecccccCC--CceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSG--KLKDGLKVSGVKKKG--GTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG--~l~~sI~~~~~~~~~--~~~~~~Vg~ 76 (147) |+++++++||++ .|+.|+.++++..+.|+++||+++++.+++++|++++ ||+|+|.++..+..+ +..++.||+ T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:81 1 MGARIESNNIEQ---GLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 998888876555 5556666777888999999999999999999998755 599999998777653 566788887 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++. ..||+||+||||++ ||||||++||+++++++++++|.++|++-.- T Consensus 78 ~k~---~~~~a~F~E~GT~k-------------------~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 78 AKG---VSHRIHATEFGTMY-------------------QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCC---CceEEEeccCCccC-------------------CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 654 35999999999997 8999999999999999999999999988777 No 47 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.91 E-value=9.8e-28 Score=168.67 Aligned_cols=121 Identities=21% Similarity=0.205 Sum_probs=103.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhhcceecccccCC--CceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSG--KLKDGLKVSGVKKKG--GTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG--~l~~sI~~~~~~~~~--~~~~~~Vg~ 76 (147) |+++++++||++ .|+.|+.++++..+.|+++||+++++.+++++|++++ ||+|+|.++..+..+ +..++.||+ T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:98 1 MGARIESNNIEQ---GLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 998888876555 5556666777888999999999999999999998755 599999998777653 566788887 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++. ..||+||+||||++ ||||||++||+++++++++++|.++|++-.- T Consensus 78 ~k~---~~~~a~F~E~GT~k-------------------~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 78 AKG---VSHRIHATEFGTMY-------------------QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCC---CceEEEeccCCccC-------------------CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 654 35999999999997 8999999999999999999999999988777 No 48 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.91 E-value=9.8e-28 Score=168.67 Aligned_cols=121 Identities=21% Similarity=0.205 Sum_probs=103.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--hhhhcceecccccCC--CceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSG--KLKDGLKVSGVKKKG--GTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG--~l~~sI~~~~~~~~~--~~~~~~Vg~ 76 (147) |+++++++||++ .|+.|+.++++..+.|+++||+++++.+++++|++++ ||+|+|.++..+..+ +..++.||+ T Consensus 1 M~v~v~~~~L~~---~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~ 77 (125) T protein:vir:47 1 MGARIESNNIEQ---GLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGY 77 (125) T ss_pred CeeEeeHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEecc Confidence 998888876555 5556666777888999999999999999999998755 599999998777653 566788887 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++. ..||+||+||||++ ||||||++||+++++++++++|.++|++-.- T Consensus 78 ~k~---~~~~a~F~E~GT~k-------------------~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 78 AKG---VSHRIHATEFGTMY-------------------QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CCC---CceEEEeccCCccC-------------------CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 654 35999999999997 8999999999999999999999999988777 No 49 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.91 E-value=6.4e-28 Score=169.65 Aligned_cols=127 Identities=12% Similarity=0.213 Sum_probs=106.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+-. ++|+++|.+.|+++++++.+.+++++++++..++++++.++|++||+|++||.+.. ..+...+.|| T Consensus 1 Ma~~--~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:94 1 MAKV--KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDSGFTGVIN----- 70 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEe---ecCceEEEEe----- Confidence 7744 47999999999999999999999999999999999999999999999999997542 2233455565 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||.+|...+.... ..+..+.+.+|||||||+||++.+++.+.+.|. T Consensus 71 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 71 -IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred -cCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 346899999999998877765322 234556678999999999999999999999999 No 50 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.91 E-value=6.4e-28 Score=169.65 Aligned_cols=127 Identities=12% Similarity=0.213 Sum_probs=106.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+-. ++|+++|.+.|+++++++.+.+++++++++..++++++.++|++||+|++||.+.. ..+...+.|| T Consensus 1 Ma~~--~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:93 1 MAKV--KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDSGFTGVIN----- 70 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEe---ecCceEEEEe----- Confidence 7744 47999999999999999999999999999999999999999999999999997542 2233455565 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||.+|...+.... ..+..+.+.+|||||||+||++.+++.+.+.|. T Consensus 71 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 71 -IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred -cCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 346899999999998877765322 234556678999999999999999999999999 No 51 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.91 E-value=6.4e-28 Score=169.65 Aligned_cols=127 Identities=12% Similarity=0.213 Sum_probs=106.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+-. ++|+++|.+.|+++++++.+.+++++++++..++++++.++|++||+|++||.+.. ..+...+.|| T Consensus 1 Ma~~--~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~---~~~~~~~~V~----- 70 (137) T protein:vir:97 1 MAKV--KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDSGFTGVIN----- 70 (137) T ss_pred Cchh--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEe---ecCceEEEEe----- Confidence 7744 47999999999999999999999999999999999999999999999999997542 2233455565 Q ss_pred Ccccchhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|++||||||.+|...+.... ..+..+.+.+|||||||+||++.+++.+.+.|. T Consensus 71 -~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 71 -IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred -cCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 346899999999998877765322 234556678999999999999999999999999 No 52 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.91 E-value=5.4e-28 Score=170.08 Aligned_cols=110 Identities=19% Similarity=0.294 Sum_probs=96.5 Q ss_pred CceeeeehhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESM--GKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l--~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+ +|+|+|||+|+++|+++ ++++++++++++.+.+..+++.++.++|++||+|++||.++ .++..+.||+ T Consensus 1 Ma-~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~-----~~~~~~~v~~-- 72 (112) T protein:vir:96 1 MA-TIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLE-----AGSDRAVVEA-- 72 (112) T ss_pred Cc-eeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeee-----cCceEEEecC-- Confidence 77 79999999999999988 56788999999999999999999999999999999999753 2344566653 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEIL 141 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l 141 (147) ..+|++|+||||++ |+|||||+|||+++++.+.+.|++.- T Consensus 73 ----~~~Ya~~vE~GTr~-------------------m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 73 ----LTNYSGYLEVGTRK-------------------MEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ----CCCccceeccCccc-------------------cCCCCchhhhHHHHHHHHHHHHHhcC Confidence 46899999999986 99999999999999999999988744 No 53 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.91 E-value=7.9e-28 Score=169.16 Aligned_cols=127 Identities=17% Similarity=0.292 Sum_probs=106.5 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+. +++ |+|+|.+.|+++++.+++.+++++.+++..++++|+.++|++||+|++||.+.. ..+...+.|| T Consensus 1 Ma~-~~~-Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~---~~~g~~~~V~----- 70 (135) T protein:vir:96 1 MAK-VKY-GADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDF---ENGGFTGVVK----- 70 (135) T ss_pred Cch-hhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEe---ecCcEEEEEe----- Confidence 884 444 999999999999999999999999999999999999999999999999997542 2333455565 Q ss_pred Ccccchhhhhhcccccccccccccc--------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGK--------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~--------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +...|+.||||||.+|...+..+. ..+..+.|++|||||||+||++.+++++.+.|. T Consensus 71 -~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 71 -IGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred -cCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 456899999999998876654322 235567889999999999999999999988888 No 54 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.91 E-value=8.3e-28 Score=169.04 Aligned_cols=108 Identities=18% Similarity=0.275 Sum_probs=93.6 Q ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCccc Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKI 84 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~ 84 (147) |+|+|||+|++.|+++.. ...++++++++|..++++|+.++|++||+|++||.+.. ..+...+.|| +.+ T Consensus 1 i~i~Gld~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~---~~~~~~~~V~------~~~ 69 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT--LNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEF---TDGGLTGTTI------PHT 69 (108) T ss_pred CcchhHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeee---ecCceEEEee------cCC Confidence 999999999999998653 45688999999999999999999999999999997542 3344556665 346 Q ss_pred chhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 85 FYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 85 ~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) +|++||||||+. |+|||||+||++.+++++.+.|.+.|+ T Consensus 70 ~Ya~~vE~GT~~-------------------m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 70 DYAGYVEYGTRF-------------------QAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred Cccceeeccccc-------------------cCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 899999999985 999999999999999999999988888 No 55 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.91 E-value=7.2e-28 Score=169.40 Aligned_cols=118 Identities=25% Similarity=0.345 Sum_probs=106.8 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ ++++.|+|+|++.|++|+...+++.++||++|+++|++++..++|++||+|+. |..+ ....| ++.||+++ T Consensus 1 Ma-~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~--~kk~g--~~~VG~~k-- 72 (119) T protein:vir:10 1 MA-SLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIR--VKNTG--LATEGTAS-- 72 (119) T ss_pred Cc-eeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeee--eecCc--eeEeccCC-- Confidence 76 89999999999999999999999999999999999999999999999999996 4322 22223 78999865 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCC-cchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPK-PFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~-PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) +.+||..|+||||++ |||| |||.||++++++++++.|.++|.+.+| T Consensus 73 -s~~fy~kF~EFGTSk-------------------m~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 73 -SSEFYDIFQNFGTSE-------------------QKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred -cchhhhhhccccccc-------------------cCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 678999999999997 8999 999999999999999999999999999 No 56 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.90 E-value=1.3e-27 Score=168.03 Aligned_cols=112 Identities=16% Similarity=0.226 Sum_probs=95.9 Q ss_pred CceeeeehhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESM--GKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l--~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |. +|+|+|||+|++.|+++ +.++++++++++.+.++.+++.|+.++|++||+|++||.++.. ++. +.||+ T Consensus 1 Ma-~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~---~~~--~~V~~-- 72 (114) T protein:vir:49 1 MA-TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVE---SDK--ATVEA-- 72 (114) T ss_pred Ce-eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeec---CCe--eEecC-- Confidence 77 79999999999999988 4567788888888888888888888899999999999976532 222 34553 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) ..+||+|+||||++ |+|||||+||++.+++++.+.|.+.++- T Consensus 73 ----~~~Ya~~vEfGT~k-------------------m~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 73 ----LTSYSGYLEVGTRK-------------------MEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ----CCCccceecccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 46899999999986 8999999999999999999999998888 No 57 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.90 E-value=1.3e-27 Score=168.03 Aligned_cols=112 Identities=16% Similarity=0.226 Sum_probs=95.9 Q ss_pred CceeeeehhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESM--GKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l--~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |. +|+|+|||+|++.|+++ +.++++++++++.+.++.+++.|+.++|++||+|++||.++.. ++. +.||+ T Consensus 1 Ma-~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~---~~~--~~V~~-- 72 (114) T protein:vir:27 1 MA-TIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVE---SDK--ATVEA-- 72 (114) T ss_pred Ce-eeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeec---CCe--eEecC-- Confidence 77 79999999999999988 4567788888888888888888888899999999999976532 222 34553 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) ..+||+|+||||++ |+|||||+||++.+++++.+.|.+.++- T Consensus 73 ----~~~Ya~~vEfGT~k-------------------m~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 73 ----LTSYSGYLEVGTRK-------------------MEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ----CCCccceecccccc-------------------cCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 46899999999986 8999999999999999999999998888 No 58 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.90 E-value=3.3e-27 Score=165.75 Aligned_cols=131 Identities=21% Similarity=0.257 Sum_probs=103.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+.++.++||+. .|+.+...+..++++++.+.+..++.+||+++|++||+|++||...... ......+.+++ T Consensus 2 ~~~~~~~~gl~~---~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~-~~~~~~~~~~v---- 73 (142) T protein:vir:86 2 VQVSVRYEGFDY---NPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQV-MVTPFHVSGGV---- 73 (142) T ss_pred ceeEEEeeecch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecc-ccccceEEEEe---- Confidence 889999999876 6666667788899999999999999999999999999999999754322 22222233332 Q ss_pred Ccccchhhhhhccccccccccccccc----------ccccccCCCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKK----------KGRIINHPGVSPKPFLAPAYESKKDEAKNVMKE 139 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~----------~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~ 139 (147) .+.+.|+.||||||.+|.+.|+..+. +...+.|||++|||||+||++.+.++......+ T Consensus 74 ~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 74 TAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 34678999999999999888876542 446689999999999999999998876655555 No 59 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.90 E-value=3.3e-27 Score=165.75 Aligned_cols=131 Identities=21% Similarity=0.257 Sum_probs=103.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+.++.++||+. .|+.+...+..++++++.+.+..++.+||+++|++||+|++||...... ......+.+++ T Consensus 2 ~~~~~~~~gl~~---~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~-~~~~~~~~~~v---- 73 (142) T protein:vir:99 2 VQVSVRYEGFDY---NPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQV-MVTPFHVSGGV---- 73 (142) T ss_pred ceeEEEeeecch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecc-ccccceEEEEe---- Confidence 889999999876 6666667788899999999999999999999999999999999754322 22222233332 Q ss_pred Ccccchhhhhhccccccccccccccc----------ccccccCCCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKK----------KGRIINHPGVSPKPFLAPAYESKKDEAKNVMKE 139 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~----------~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~ 139 (147) .+.+.|+.||||||.+|.+.|+..+. +...+.|||++|||||+||++.+.++......+ T Consensus 74 ~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 74 TAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 34678999999999999888876542 446689999999999999999998876655555 No 60 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.85 E-value=3.3e-24 Score=149.28 Aligned_cols=139 Identities=13% Similarity=0.081 Sum_probs=117.6 Q ss_pred Cc-eeeeehhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee Q lcl|NC_015262. 1 MS-VEITTEGFDAVLSKIESMGK--SGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT 77 (147) Q Consensus 1 M~-~~~~i~Gl~el~~~l~~l~~--~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~ 77 (147) || .+|+++||++|++.|+++.. .+.+.+++++++.+..++.++++++|++||+|++||....+...+++.++.|+ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~-- 78 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLI-- 78 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEe-- Confidence 88 59999999999999998864 46788999999999999999999999999999999998888777777777775 Q ss_pred ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ++.+|||||||||+....+ .-..........+++++|||.+|.+..+..+.+.|.+.|.+-+|| T Consensus 79 ----n~~~YA~~VE~Ghr~~~G~--~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~k~l~~l~d~ 142 (144) T protein:vir:10 79 ----NNAEYASYVESGHRQTPGR--YVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQLVTEGLWGLKDL 142 (144) T ss_pred ----cCCCcccccccceeecCCc--ccccCCCccccceecCccchHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 4679999999999764322 111122233446789999999999999999999999999999999 No 61 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.84 E-value=6.7e-24 Score=147.63 Aligned_cols=125 Identities=22% Similarity=0.329 Sum_probs=105.7 Q ss_pred CceeeeehhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 1 MSVEITTEGF-DAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 1 M~~~~~i~Gl-~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) |+ +|+|.+| ++|.+.|+++.+++.+.++++++++|..+++++|+++|++||.|++||+++.....++..++ .++ T Consensus 1 Ma-~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~v--v~~-- 75 (126) T protein:vir:81 1 MA-NITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRI--IWN-- 75 (126) T ss_pred Cc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEE--Eec-- Confidence 88 6999999 45888899999999999999999999999999999999999999999988877655544332 232 Q ss_pred CCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i 145 (147) +....++|++||||.+. ++..+||+|||+||++...+++.+.|++.|+.+= T Consensus 76 -~~~~~l~HLLEfGha~r--------------~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 76 -KKHYRRVHLLEFGHAKV--------------NGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred -cCCCCceeeeecceecC--------------CCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 34567799999999752 3345999999999999999999999999888766 No 62 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.83 E-value=1.5e-24 Score=151.18 Aligned_cols=123 Identities=19% Similarity=0.227 Sum_probs=93.7 Q ss_pred Cceeeeeh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 1 MSVEITTE-GFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 1 M~~~~~i~-Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) |+++++|+ ..++|. +.+...+++++.+.+..++.+++.++|++||+|++||........+...++.|| T Consensus 1 m~~s~~i~i~~~~l~-------~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~---- 69 (137) T protein:vir:10 1 MPVTARIHINEPELE-------RQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVE---- 69 (137) T ss_pred CCeeEEEeeCHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEe---- Confidence 88888887 344433 345567888899999999999999999999999999987665544444555665 Q ss_pred CCcccchhhhhhccccccccccccccc----------ccccccCCCCCCCcchhhHHHHH---HHHHHHH Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGKK----------KGRIINHPGVSPKPFLAPAYESK---KDEAKNV 136 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~~----------~~~~~~~~~~~a~PFl~pA~~~~---~~~~~~~ 136 (147) +.+.|+.||||||++|.+.++.++. +.+.+.|||++|||||+||++.. +++|.-. T Consensus 70 --~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~~ 137 (137) T protein:vir:10 70 --DNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDIHMT 137 (137) T ss_pred --cCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccccccCC Confidence 3568999999999999887765432 34568899999999999999974 3333222 No 63 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.81 E-value=1.3e-23 Score=146.09 Aligned_cols=106 Identities=12% Similarity=0.237 Sum_probs=88.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccc Q lcl|NC_015262. 24 GDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKK 103 (147) Q Consensus 24 ~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~ 103 (147) +++++++++.+++..|+.+|++++|++||+|++||.+.. ..++..+.|+ +...|+.|+||||..|...++. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~---~~~~~~~~V~------~~~~Ya~yvE~GTg~~~~~~~~ 71 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDGGFTGVIN------IGSEYAIYVNYGTGIYATGAGG 71 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEe---ecCcEEEEEe------cCCCccceeecCccccccCCCc Confidence 888999999999999999999999999999999997543 2233455565 3468999999999999877654 Q ss_pred cc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 104 GK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 104 ~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) .+ ..+..+.|++|+|||||+||++.+++.+.+.|. T Consensus 72 ~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 72 SRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 32 134567799999999999999999999999999 No 64 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.81 E-value=1.9e-23 Score=145.12 Aligned_cols=106 Identities=12% Similarity=0.237 Sum_probs=88.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccc Q lcl|NC_015262. 24 GDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKK 103 (147) Q Consensus 24 ~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~ 103 (147) +++++++++.+++..++.+|++++|++||+|++||.+.. ..+...+.|+ +...|+.||||||..|...+.. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~---~~~~~~~~V~------~~~~YA~yvE~GTg~~~~~~~~ 71 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDGGFTGVIN------IGSEYAIYVNYGTGIYATGAGG 71 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEe---ecCcEEEEEe------cCCCcccccccCCcccccCCCc Confidence 888999999999999999999999999999999997543 2233455565 3468999999999999877764 Q ss_pred cc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 104 GK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 104 ~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) .. ..+..+.|++|+|||||+||++.+++.+.+.|. T Consensus 72 ~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 72 SRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 22 234566789999999999999999999999998 No 65 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.81 E-value=1.9e-23 Score=145.12 Aligned_cols=106 Identities=12% Similarity=0.237 Sum_probs=88.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccc Q lcl|NC_015262. 24 GDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKK 103 (147) Q Consensus 24 ~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~ 103 (147) +++++++++.+++..++.+|++++|++||+|++||.+.. ..+...+.|+ +...|+.||||||..|...+.. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~---~~~~~~~~V~------~~~~YA~yvE~GTg~~~~~~~~ 71 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF---KDGGFTGVIN------IGSEYAIYVNYGTGIYATGAGG 71 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEe---ecCcEEEEEe------cCCCcccccccCCcccccCCCc Confidence 888999999999999999999999999999999997543 2233455565 3468999999999999877764 Q ss_pred cc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 104 GK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 104 ~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) .. ..+..+.|++|+|||||+||++.+++.+.+.|. T Consensus 72 ~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 72 SRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 22 234566789999999999999999999999998 No 66 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.81 E-value=2.2e-23 Score=144.79 Aligned_cols=125 Identities=18% Similarity=0.194 Sum_probs=94.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccC-CCceEEEEeeecc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKK-GGTKYVLVGITKE 79 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~-~~~~~~~Vg~~~~ 79 (147) |.+++.+. .....+.+.+..+++++++..+..++.+||.++|++||+|++||........ .+..++.|| T Consensus 1 ~~~~~~~~------~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~---- 70 (137) T protein:vir:10 1 MTVTARYE------RNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVT---- 70 (137) T ss_pred CeeEEEec------cCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEec---- Confidence 77776665 1222234456667888999999999999999999999999999986533221 122233343 Q ss_pred CCcccchhhhhhcccccccccccccc-----------cccccccCCCCCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGK-----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVM 137 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~-----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i 137 (147) +.+.|+.||||||++|.+.|+.++ .+++.++|||++|+|||+||++.++++....- T Consensus 71 --~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 71 --AHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred --CCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 457999999999999988886533 24577889999999999999999999886554 No 67 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.79 E-value=5.4e-23 Score=142.66 Aligned_cols=127 Identities=14% Similarity=0.144 Sum_probs=93.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |. .|..+. .|.-....+.+.+...++++++..+..++.+||.++|++||+|++||.............+.|+ T Consensus 1 ~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~----- 72 (140) T protein:vir:10 1 MA-TIRARA--RIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVE----- 72 (140) T ss_pred Ce-eeeeee--eeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEec----- Confidence 54 222221 2333444555566778888899999999999999999999999999986555444444444444 Q ss_pred Ccccchhhhhhccccccccccccccc----------ccccccCCCCCCCcchhhHHHHH---HHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKK----------KGRIINHPGVSPKPFLAPAYESK---KDEAKNV 136 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~----------~~~~~~~~~~~a~PFl~pA~~~~---~~~~~~~ 136 (147) +.+.|+.||||||++|.+.|+..+. +.+.++|||++|||||+||++.. +++|... T Consensus 73 -~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 73 -ATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred -CCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 4578999999999999888775543 45678999999999999999984 5555544 No 68 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.79 E-value=5.4e-23 Score=142.66 Aligned_cols=127 Identities=14% Similarity=0.144 Sum_probs=93.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |. .|..+. .|.-....+.+.+...++++++..+..++.+||.++|++||+|++||.............+.|+ T Consensus 1 ~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~----- 72 (140) T protein:vir:97 1 MA-TIRARA--RIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVE----- 72 (140) T ss_pred Ce-eeeeee--eeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEec----- Confidence 54 222221 2333444555566778888899999999999999999999999999986555444444444444 Q ss_pred Ccccchhhhhhccccccccccccccc----------ccccccCCCCCCCcchhhHHHHH---HHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKK----------KGRIINHPGVSPKPFLAPAYESK---KDEAKNV 136 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~----------~~~~~~~~~~~a~PFl~pA~~~~---~~~~~~~ 136 (147) +.+.|+.||||||++|.+.|+..+. +.+.++|||++|||||+||++.. +++|... T Consensus 73 -~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 73 -ATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred -CCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 4578999999999999888775543 45678999999999999999984 5555544 No 69 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=99.77 E-value=4.6e-22 Score=137.56 Aligned_cols=124 Identities=18% Similarity=0.249 Sum_probs=102.8 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC---------cchhhhcceecccccCC-Cce Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR---------SGKLKDGLKVSGVKKKG-GTK 70 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~---------tG~l~~sI~~~~~~~~~-~~~ 70 (147) |. +|+ .||++|+++|++|...+.+..++|+++||+++++.++..+|+. .+||+|+|.++.....+ ... T Consensus 1 M~-~~~-~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG 78 (153) T protein:vir:49 1 MT-GLD-EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNG 78 (153) T ss_pred Cc-cHH-HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccc Confidence 76 577 8999999999999998889999999999999999999999863 36999999987543332 245 Q ss_pred EEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHH--HHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 71 YVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESK--KDEAKNVMKEILKRGLGL 147 (147) Q Consensus 71 ~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~--~~~~~~~i~~~l~~~i~~ 147 (147) +..|||... ..+||+||+|+||++ |||+||+.++.+++ ++++++++.+++++.|+= T Consensus 79 ~s~VG~~~~--~~a~~a~f~n~GT~k-------------------m~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~ 136 (153) T protein:vir:49 79 VSTVGWKNN--YHAQNARRLNDGTKK-------------------YRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRR 136 (153) T ss_pred eeeecccCC--ccceeeeecccCccc-------------------CCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHh Confidence 779999753 357999999999996 99999999999986 678998777777666654 No 70 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.77 E-value=1.3e-21 Score=135.14 Aligned_cols=133 Identities=16% Similarity=0.190 Sum_probs=97.3 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNE-AVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~-al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) |. +|+|+. ...+.++.+.+.+.+.++. ++..++..++..|+.++|++||+|++||.... ... +..+.||. T Consensus 1 ~~-~~~f~~--~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v-~~~--g~~~~V~~--- 71 (141) T protein:vir:78 1 MN-EFEFDS--NIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKV-RKS--SKEVIVGN--- 71 (141) T ss_pred Cc-chhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeee-ecC--CcEEEEec--- Confidence 65 566653 3444445555555555554 57777888999999999999999999997543 222 23445653 Q ss_pred CCcccchhhhhhcccccccccccccc-------cccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGK-------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~-------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) .+.||.||||||..+...+..++ ..+..+.+.||||||||+||++.+++++.+.|.++|+ +|| T Consensus 72 ---~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~-~l~ 141 (141) T protein:vir:78 72 ---SSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALR-GIN 141 (141) T ss_pred ---CCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhh-ccC Confidence 46899999999988776654333 2345677889999999999999999999998888765 566 No 71 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=99.77 E-value=7.6e-22 Score=136.36 Aligned_cols=121 Identities=17% Similarity=0.250 Sum_probs=104.6 Q ss_pred eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----------cchhhhcceecccccCCC-ceE Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR----------SGKLKDGLKVSGVKKKGG-TKY 71 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~----------tG~l~~sI~~~~~~~~~~-~~~ 71 (147) ++|+ .||++|+++|+.|.....+.-++++++||+++++.++.++|+. .+||+|+|.++..+..+. ... T Consensus 1 v~~~-~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~ 79 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGS 79 (139) T ss_pred CCHH-HHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCccccccccee Confidence 3333 6999999999999877777788999999999999999999962 369999999987655443 445 Q ss_pred EEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 72 VLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 72 ~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ..|||.+ .+|++||+||||.+ |||+||+.++.+++++++++++.+++++.|+= T Consensus 80 ~~VG~~k----~~~~A~f~n~GT~k-------------------~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~ 132 (139) T protein:vir:10 80 STVGFHN----KAHIARFLNDGTKY-------------------IRADHFVDNARDDAKDAVFAAEAEKYQAMIAK 132 (139) T ss_pred eeeCCCC----CcceEeecccCccc-------------------cCCCchHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 6789864 36899999999986 99999999999999999999999999999998 No 72 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.76 E-value=2.3e-21 Score=133.70 Aligned_cols=128 Identities=20% Similarity=0.273 Sum_probs=103.9 Q ss_pred Cce--eeeehhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecc------cccCCCceE Q lcl|NC_015262. 1 MSV--EITTEGFDAVLSKIESMGK-SGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSG------VKKKGGTKY 71 (147) Q Consensus 1 M~~--~~~i~Gl~el~~~l~~l~~-~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~------~~~~~~~~~ 71 (147) |+- +|+++||++|.+.|+++.+ ++.+.+++++++.|..+..+++.++|++||+|++||.... +...++... T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 884 8999999999999998755 7889999999999999999999999999999999996542 233444455 Q ss_pred EEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 72 VLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 72 ~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) +.|+ ++.+||||||+||+... +.+++|+++||..|.+..+..+.+.+.+.|.+-|+= T Consensus 81 v~v~------n~~~YA~~VE~Ghr~~~-------------~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 81 IEVV------NPTEYASYVNFGHRTKD-------------GKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred EEEe------cCCcchhhhhcceeecC-------------CcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5554 45799999999997532 225788888898888888888887777777776665 No 73 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=99.73 E-value=4.3e-21 Score=132.25 Aligned_cols=123 Identities=19% Similarity=0.231 Sum_probs=106.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---------CcchhhhcceecccccCCC--c Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK---------RSGKLKDGLKVSGVKKKGG--T 69 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~---------~tG~l~~sI~~~~~~~~~~--~ 69 (147) |. +|+ .||++|+++|++|...+.+.-.+|+++||+++++.++..+|+ ..+||+|||.++... .+| . T Consensus 1 M~-~~~-~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~-~DG~~d 77 (141) T protein:vir:50 1 MV-GLA-EALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTN-ADGRKN 77 (141) T ss_pred Cc-cHH-HHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCc-cccccC Confidence 76 687 999999999999998888889999999999999999999985 357999999987754 333 3 Q ss_pred eEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHH--HHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 70 KYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESK--KDEAKNVMKEILKRGLGL 147 (147) Q Consensus 70 ~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~--~~~~~~~i~~~l~~~i~~ 147 (147) .+..|||... ..+|++||+++||++ |||+||+.++.+.+ ++++++++.+++++.|+= T Consensus 78 g~s~VG~~~~--~~~~~A~f~n~GT~k-------------------~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~ 136 (141) T protein:vir:50 78 GVSTVGWKNN--YHAQNARRLNDGTKK-------------------YRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEE 136 (141) T ss_pred CeeeeccCCC--ccceeeeccccCccc-------------------cCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHh Confidence 4678999643 357999999999996 89999999999865 789999999999999887 No 74 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=99.73 E-value=3e-21 Score=133.10 Aligned_cols=127 Identities=14% Similarity=0.118 Sum_probs=91.8 Q ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcccc Q lcl|NC_015262. 6 TTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIF 85 (147) Q Consensus 6 ~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~ 85 (147) -|.+..+|.+. .|.....++++++++..+..++.++|.++|++||+|++||........+....+.|+ +.+. T Consensus 1 ~~~~~~~l~~~--~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~------~~~~ 72 (137) T protein:vir:10 1 MVAHTLRIERA--QLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVE------YTAR 72 (137) T ss_pred CcccccccChh--hHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEec------CCcc Confidence 13333333332 344445678889999999999999999999999999999987655444443344443 4579 Q ss_pred hhhhhhcccccccccccccc----------cccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 86 YGKFLEFGASAHKIPIKKGK----------KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 86 y~~~vE~GT~~~~~~~~~~~----------~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) |+.|+||||++|.+.|+..+ .+.+.++|||++|+|||+||++...+.. -++=-|+ T Consensus 73 YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~------~~~~~~~ 137 (137) T protein:vir:10 73 YAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQE------GFRVTIG 137 (137) T ss_pred cceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhccc------ceeEeeC Confidence 99999999999999887644 2466789999999999999999877632 1111122 No 75 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=99.71 E-value=1.6e-20 Score=129.16 Aligned_cols=124 Identities=17% Similarity=0.233 Sum_probs=106.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC------C---cchhhhcceecccccCC-Cce Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK------R---SGKLKDGLKVSGVKKKG-GTK 70 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~------~---tG~l~~sI~~~~~~~~~-~~~ 70 (147) |. +|+ .||++|+++|++|...+.+.-.+++++||+++++.++..+|+ . .+||+|||.++.....+ ... T Consensus 1 M~-~~~-d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g 78 (140) T protein:vir:48 1 MT-GLD-EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNG 78 (140) T ss_pred Cc-cHH-HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCc Confidence 76 677 799999999999998888889999999999999999999994 2 45899999987543222 244 Q ss_pred EEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHH--HHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 71 YVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESK--KDEAKNVMKEILKRGLGL 147 (147) Q Consensus 71 ~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~--~~~~~~~i~~~l~~~i~~ 147 (147) +..|||.+. ..+|++||+++||++ |||+||+.++.+.+ +.++++++.+++++.|+= T Consensus 79 ~s~VG~~kk--~~a~~A~f~n~GT~k-------------------~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~ 136 (140) T protein:vir:48 79 VSTVGWVNR--YHAQNARRLNDGTKK-------------------YRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRK 136 (140) T ss_pred eeeeccCCC--cceeeeeccccCccc-------------------cCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHh Confidence 778999753 357999999999986 99999999999966 789999999999999987 No 76 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=99.69 E-value=3.3e-20 Score=127.38 Aligned_cols=121 Identities=17% Similarity=0.255 Sum_probs=102.5 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----------cchhhhcceecccccCC-Cc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR----------SGKLKDGLKVSGVKKKG-GT 69 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~----------tG~l~~sI~~~~~~~~~-~~ 69 (147) |. |+ .||++|+++|++|.....+.-.+++++||+++++.++.++|+. .+||+++|.++.....+ .. T Consensus 1 ~~--~~-~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~ 77 (139) T protein:vir:10 1 MD--MD-EALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHN 77 (139) T ss_pred CC--HH-HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCcccccccc Confidence 43 33 6999999999999887777788999999999999999999952 36899999987643332 23 Q ss_pred eEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 70 KYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 70 ~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ..+.|||.. .+|.|||+|+||.+ |||+||+..+.++.++++.+++.+++++.|+= T Consensus 78 g~~~VG~~~----~~~~Ahf~n~GT~~-------------------~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~ 132 (139) T protein:vir:10 78 GSSTVGFHN----KAHIARFLNDGTKN-------------------IRADHFVDNARDDAKDAVFAAEAEKYQAMIAK 132 (139) T ss_pred ccceeCCCC----CceeeeeeccCccc-------------------cCCCchHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 457788853 35789999999986 99999999999999999999999999999988 No 77 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=99.69 E-value=7.2e-20 Score=125.54 Aligned_cols=122 Identities=19% Similarity=0.217 Sum_probs=106.8 Q ss_pred eeeeehhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESM--GKSGDKLLNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l--~~~~~~~~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+++++|++||+++|++. +..+.++.++||.++++.|.+++|.++.+ |||.+.+++..+.+...+|...+.|||. T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~- 79 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWR- 79 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEE- Confidence 678899999999999876 67789999999999999999999998765 9999999999999998899999999994 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhh--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAP--------AYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~p--------A~~~~~~~~~~~i~~~l~~~ 144 (147) ...+.+.+.||.|||+.+ ++.-+|+.| |+++.+..+.+.++++|++- T Consensus 80 G~~~R~~ivHLnE~Gyt~-------------------~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 80 GPFERFRIVHLIENGHVE-------------------KKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred cCCceeeEEEeeecceee-------------------cCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 344678999999999964 234455555 99999999999999999887 No 78 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=99.66 E-value=1.4e-19 Score=124.02 Aligned_cols=124 Identities=18% Similarity=0.257 Sum_probs=105.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC------c---chhhhcceecccccCC-Cce Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR------S---GKLKDGLKVSGVKKKG-GTK 70 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~------t---G~l~~sI~~~~~~~~~-~~~ 70 (147) |. +|+ .||++|+.+|++|...+.+.-.+++++||+++++.++..+|+. | +||+|+|.++.....+ ... T Consensus 1 M~-~~~-d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG 78 (140) T protein:vir:48 1 MT-GLD-EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNG 78 (140) T ss_pred Cc-cHH-HHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceeccccccccccc Confidence 76 577 7999999999999988888899999999999999999999842 3 5999999987543322 245 Q ss_pred EEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHH--HHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 71 YVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESK--KDEAKNVMKEILKRGLGL 147 (147) Q Consensus 71 ~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~--~~~~~~~i~~~l~~~i~~ 147 (147) +..|||.+. ..+|+++|+++||++ |||+||+..+.+.+ ++++++++.+++++.|.= T Consensus 79 ~s~VG~~k~--~~a~~a~f~NdGT~k-------------------~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~k 136 (140) T protein:vir:48 79 VATVGWKNN--YHAQNARRLNDGTKK-------------------YRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRK 136 (140) T ss_pred ceeecccCC--CceeEEeecccCccc-------------------cCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHh Confidence 778999864 357999999999986 99999999999965 889999999999988855 No 79 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=99.66 E-value=2.4e-19 Score=122.69 Aligned_cols=128 Identities=18% Similarity=0.214 Sum_probs=105.8 Q ss_pred eeeeehhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESM--GKSGDKLLNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l--~~~~~~~~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+++++|++||+++|++. +..+.++.++||.++++.+.+++|.++++ |||.+.+++..+.+...+|..++.|||. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~- 79 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWR- 79 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEE- Confidence 678999999999999876 66789999999999999999999999985 9999999999999998889999999994 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCc--chhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKP--FLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~P--Fl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ...+.+.+.||.|||+..... ++++.|+- -+..|++..++.+.+.++++|++- T Consensus 80 G~~~R~~iiHLNE~Gytr~~~-------------Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 80 GSKDRYKIVHLIEYGHVQKGT-------------GKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred cCCceeEEEEeecccceeccc-------------CCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 344678999999999754110 01111111 145599999999999999999987 No 80 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=99.66 E-value=2.4e-19 Score=122.69 Aligned_cols=128 Identities=18% Similarity=0.214 Sum_probs=105.8 Q ss_pred eeeeehhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESM--GKSGDKLLNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l--~~~~~~~~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+++++|++||+++|++. +..+.++.++||.++++.+.+++|.++++ |||.+.+++..+.+...+|..++.|||. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~- 79 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWR- 79 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEE- Confidence 678999999999999876 66789999999999999999999999985 9999999999999998889999999994 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCc--chhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKP--FLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~P--Fl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) ...+.+.+.||.|||+..... ++++.|+- -+..|++..++.+.+.++++|++- T Consensus 80 G~~~R~~iiHLNE~Gytr~~~-------------Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 80 GSKDRYKIVHLIEYGHVQKGT-------------GKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred cCCceeEEEEeecccceeccc-------------CCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 344678999999999754110 01111111 145599999999999999999987 No 81 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.64 E-value=8.6e-19 Score=119.61 Aligned_cols=127 Identities=17% Similarity=0.252 Sum_probs=111.8 Q ss_pred CceeeeehhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCC---------------------------Cc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMG--KSGDKLLNEAVKAGGNVILQDALPRVSK---------------------------RS 51 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~--~~~~~~~~~al~~~a~~v~~~ak~~ap~---------------------------~t 51 (147) |+..|++++|+++.+.|..+. ..+.+.+++.+.+.|..+...++.++|+ +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 999999999999999998874 3467789999999999999999999996 79 Q ss_pred chhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHH Q lcl|NC_015262. 52 GKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKD 131 (147) Q Consensus 52 G~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~ 131 (147) |+|++|+.+..++..++...+.|+ +..+|||||||||+... ..+.|++++|..|.+.... T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v~------N~~~YA~~VE~GHR~~~--------------gGfV~G~fml~~s~~~~~~ 140 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKVY------NKVYYAPHVEYGHKTVN--------------GGFVPGQFFLHKTVEDTKS 140 (163) T ss_pred chhhccceecceeecCCceEEEEE------ecCCccchhhcceeecC--------------CceeccchhhHHHHHHHHH Confidence 999999999998888887777775 46799999999987642 2469999999999999999 Q ss_pred HHHHHHHHHHHHHhcC Q lcl|NC_015262. 132 EAKNVMKEILKRGLGL 147 (147) Q Consensus 132 ~~~~~i~~~l~~~i~~ 147 (147) ++.+.+++.|.+-|+= T Consensus 141 ~~~~~~e~~l~~~l~k 156 (163) T protein:vir:10 141 DMEKRVRDKYDGFMRK 156 (163) T ss_pred HHHHHHHHHHHHHHHH Confidence 9999999999888876 No 82 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.63 E-value=8.3e-19 Score=119.71 Aligned_cols=122 Identities=18% Similarity=0.193 Sum_probs=101.1 Q ss_pred CceeeeehhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 1 MSVEITTEGFD-AVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 1 M~~~~~i~Gl~-el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) |+.+++|..|+ ++.+.|+++.+++.+.+..++++.|..+.+++++.+|++||.+++|+.++... .+..+.+. T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~---~~~~~v~~---- 73 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLK---NGDQVIYQ---- 73 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecC---CeeEEEEE---- Confidence 99999999995 57999999999999999999999999999999999999999999999765422 22222221 Q ss_pred CCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) .+.....+|++|||+.+. +..+++|+|||+||++...+.+.+.+.+.|++ T Consensus 74 ~~~~y~l~HLLE~GHa~r--------------~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 74 KAPTYRLTHLLENGHAKR--------------NGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred ecCCcceEEeeecceeec--------------CCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 122334689999998753 22358999999999999999999999999999 No 83 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=99.62 E-value=1.5e-18 Score=118.27 Aligned_cols=128 Identities=20% Similarity=0.248 Sum_probs=113.4 Q ss_pred CceeeeehhHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIES-MGK-SGDKLLNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~-l~~-~~~~~~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) ||.-.+++|++||+++|++ |++ .+.++.++||.++++.+++.+|.++|+ |||+..++|.++.++..+|...+.||| T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW 80 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 80 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecc Confidence 9988899999999999987 988 589999999999999999999999995 999999999999999999999999999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .. +.+..-|+.|||+.+.+.+.. .-++..|++..+..+.+.++++|++.|+= T Consensus 81 ~G---pR~~ivHLNE~GyGk~~~PrG----------------~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 81 TT---PRWNIVHLQELEYGWKHNRRG----------------VGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred cC---CceeEEeeecccccCCcCCCc----------------chHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 63 456667999999865432222 23699999999999999999999999999 No 84 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.57 E-value=5.2e-18 Score=115.33 Aligned_cols=118 Identities=18% Similarity=0.201 Sum_probs=90.5 Q ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCC-------CcchhhhcceecccccCCCceEEEEeee Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPR--VSK-------RSGKLKDGLKVSGVKKKGGTKYVLVGIT 77 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~--ap~-------~tG~l~~sI~~~~~~~~~~~~~~~Vg~~ 77 (147) |.|+|+|.++|++... +.+++.++.-...+...+++. +|+ +||+|++||....+ .++..+.||+. T Consensus 1 i~G~~~L~~~Lk~~s~---~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~---~~g~~~~vgp~ 74 (127) T protein:vir:98 1 MTGMPALEVKLRSMSE---KRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKV---NSSKDVITGNF 74 (127) T ss_pred CcChHHHHHHHHHhhH---HHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEe---cCCceEEeccC Confidence 9999999999987622 336777777777788888775 677 99999999976543 34455566653 Q ss_pred ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 78 KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 78 ~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) . ....|+.||||||+-...+.. ..+++|||||.|||+..++.+.+.|.+.+++ T Consensus 75 g---~t~dYapyvEyGTR~m~~~~~----------~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 75 G---YIKDYAPHVEYGHRIVRNGKQ----------VGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred c---ccccccceeecceeeeecccc----------cccccCccccccchHHHhHHHHHHHHHHhcC Confidence 2 235899999999985321111 2358899999999999999999999999999 No 85 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.56 E-value=5.4e-18 Score=115.24 Aligned_cols=91 Identities=18% Similarity=0.264 Sum_probs=71.4 Q ss_pred Cc-eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 1 MS-VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 1 M~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) |+ ++|+|.|+|+|++.|++.... +.+++++++.+..++.+|++++|++||+|++||...... ++....|+.. T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~--~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~---~g~~~~v~~~-- 73 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNM--NTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISR---DGFTGSVTYG-- 73 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccH--HHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeec---CCeeEEEEec-- Confidence 98 689999999999999876542 457899999999999999999999999999999755322 2222333221 Q ss_pred CCcccchhhhhhcccccccccccccccccccccCCCCCC Q lcl|NC_015262. 80 DNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSP 118 (147) Q Consensus 80 ~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a 118 (147) .+.+.|+.|+||||+. |+| T Consensus 74 -gp~a~Ya~YvE~GTR~-------------------M~A 92 (92) T protein:vir:99 74 -GGLVNYAAYVEFGTRF-------------------MDS 92 (92) T ss_pred -cCccccccccccceee-------------------cCC Confidence 2456899999999996 666 No 86 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.54 E-value=2.8e-18 Score=116.78 Aligned_cols=106 Identities=23% Similarity=0.316 Sum_probs=82.3 Q ss_pred HHHHHHHhCCCCcchhhhcceec--ccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccc----c-ccccccc Q lcl|NC_015262. 39 ILQDALPRVSKRSGKLKDGLKVS--GVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKK----G-KKKGRII 111 (147) Q Consensus 39 v~~~ak~~ap~~tG~l~~sI~~~--~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~----~-~~~~~~~ 111 (147) |+++++..+|++||.|++||... ..+..+|..++.|||+. .+++|+|++|||+......... + ....... T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~---rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~ 77 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRK---KAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLV 77 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCC---CcCCcccccccceeeeeeeeeccCceeeecCcccc Confidence 99999999999999999999765 55566778888888875 4579999999996432211110 0 0112244 Q ss_pred cCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 112 NHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 112 ~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ++..+||+||||||||+...++.++|.+.+++.+.= T Consensus 78 ~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~E 113 (119) T protein:vir:10 78 NPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAE 113 (119) T ss_pred CceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 567899999999999999999999999998888765 No 87 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.54 E-value=2.9e-18 Score=116.71 Aligned_cols=106 Identities=23% Similarity=0.313 Sum_probs=82.7 Q ss_pred HHHHHHHhCCCCcchhhhcceec--ccccCCCceEEEEeeeccCCcccchhhhhhccccccccccc---ccc--cccccc Q lcl|NC_015262. 39 ILQDALPRVSKRSGKLKDGLKVS--GVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIK---KGK--KKGRII 111 (147) Q Consensus 39 v~~~ak~~ap~~tG~l~~sI~~~--~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~---~~~--~~~~~~ 111 (147) |+++++..+|++||.|++||... ..+..+|..++.|||+. .+++|+|++|||+........ ... ...... T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~---rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~ 77 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRK---KAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLV 77 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccC---CcCCcccccccceeeeeeeeeccCceeeecCcccc Confidence 99999999999999999999765 55566778888888875 457999999999643221111 000 112345 Q ss_pred cCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 112 NHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 112 ~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .+..+||+||||||||+...++.++|.+.+++.+.= T Consensus 78 ~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~E 113 (119) T protein:vir:81 78 NPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAE 113 (119) T ss_pred CceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 567899999999999999999999999998888765 No 88 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=99.53 E-value=4.9e-17 Score=110.00 Aligned_cols=128 Identities=20% Similarity=0.263 Sum_probs=112.1 Q ss_pred CceeeeehhHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIES-MGKS-GDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~-l~~~-~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) ||.-.+++|++|++++|++ |++. ++++.++||.++++.+++..|.+.+ .|||...+++..+.+...+|...+.||| T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW 86 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 86 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEee Confidence 8877889999999999987 7765 8999999999999999999999998 6999999999999999899999999999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .. +.+..-|+.|||+.+.+.+. ..-++..|++..+..+.+.|+++|++.|+= T Consensus 87 ~G---pR~~ivHLNE~GyGk~i~Pr----------------G~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 87 TT---PRWNIVHLQELEYGWKHNRR----------------GVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred ec---CeeeEEeeecccccCCcCCC----------------cchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 64 35666799999986543222 223699999999999999999999999999 No 89 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=99.50 E-value=6.6e-17 Score=109.28 Aligned_cols=130 Identities=13% Similarity=0.076 Sum_probs=105.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-----------------------cchhhhc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR-----------------------SGKLKDG 57 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~-----------------------tG~l~~s 57 (147) |..+|+ .+|++++++|+.+.....+.-.+++++||+++++.++..+|+. +|||+|+ T Consensus 1 mm~~~~-~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~ 79 (159) T protein:vir:38 1 MANDMG-EFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDS 79 (159) T ss_pred CcchHH-HHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccc Confidence 888998 7899999999887666666778999999999999999999862 3699999 Q ss_pred ceecccccCCCc--eEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHH Q lcl|NC_015262. 58 LKVSGVKKKGGT--KYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKN 135 (147) Q Consensus 58 I~~~~~~~~~~~--~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~ 135 (147) |.++.....+|. ...+|||.... .+|+++|+..||+++++.+ +...+|+..+.++.++++++ T Consensus 80 I~~~~~~~iDg~~dG~s~VGw~~~~--~a~~a~f~NdGT~~m~~k~--------------~~gdHFvekt~~~~k~~Vl~ 143 (159) T protein:vir:38 80 ITYKPGYTADKLHTGDTDVGFEGKY--YDFLAKIVNNGQHHMSPKR--------------YKNMHFLDKAQQEAKKSVAE 143 (159) T ss_pred eeeecCccccccccceeeecccCCc--cceEeeecccCccccCCCC--------------ccCChhHHHHHHHHHHHHHH Confidence 988766444443 37899997543 4699999999998732110 22348999999999999999 Q ss_pred HHHHHHHHHhcC Q lcl|NC_015262. 136 VMKEILKRGLGL 147 (147) Q Consensus 136 ~i~~~l~~~i~~ 147 (147) ++.+++++-|.= T Consensus 144 A~~~~~~~il~~ 155 (159) T protein:vir:38 144 AELKAYKEVMNH 155 (159) T ss_pred HHHHHHHHHhhc Confidence 999999999988 No 90 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=99.44 E-value=5.4e-16 Score=104.27 Aligned_cols=119 Identities=13% Similarity=0.198 Sum_probs=91.0 Q ss_pred CceeeeehhH-HHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEe Q lcl|NC_015262. 1 MSVEITTEGF-DAVLSKIESMGKSGDKLLNEAV----KAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVG 75 (147) Q Consensus 1 M~~~~~i~Gl-~el~~~l~~l~~~~~~~~~~al----~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg 75 (147) |+ .|+|.+| +++.+.|+.+.+++.+.+++++ ++++..+.+++++.+|++||.++.++..+..... ..| T Consensus 1 M~-~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~-----~~V- 73 (124) T protein:vir:95 1 MA-KIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNG-----WVI- 73 (124) T ss_pred Cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCc-----eeE- Confidence 88 5999999 5788999999988877776666 4455555566667899999999999987665432 122 Q ss_pred eeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 76 ITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 76 ~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) ++ .......|++|||+.+.. +..++|+|||+|+.+...+.+.+.|++.|++ T Consensus 74 ~n---k~~yqLtHLLE~GHAkr~--------------GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 74 HN---KTEYRLAHLLEYGHATVD--------------GGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred EE---cCCCceeeeeecceeccC--------------CcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 22 122334899999997632 2358999999999999999999999999999 No 91 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=99.40 E-value=2.1e-15 Score=101.06 Aligned_cols=127 Identities=13% Similarity=0.274 Sum_probs=106.2 Q ss_pred eeeeehhHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHhC--CCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 3 VEITTEGFDAVLSKIES-MGKS-GDKLLNEAVKAGGNVILQDALPRV--SKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~-l~~~-~~~~~~~al~~~a~~v~~~ak~~a--p~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+++++|++||+++|++ |++. +.++.++||.++++.+++..|.+. ..|||...+++..+.+...+|...+.|||.. T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~g 80 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKG 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEec Confidence 67899999999999975 7764 789999999999999999999975 5799999999999999988899999999965 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc--hhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF--LAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF--l~pA~~~~~~~~~~~i~~~l~~~i 145 (147) .. +.+..-|+.|||+... +. ++.|+-| +..|+++.+..+.+.++++|++.| T Consensus 81 p~-~R~~iVHLNE~GYtr~----------Gk-----~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 81 PK-DRYKIIHLNEYGYTRN----------GK-----KITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred CC-CceeEEEeeccceecC----------CC-----eEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 33 3456689999996331 11 1223333 999999999999999999999999 No 92 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=99.38 E-value=2.6e-15 Score=100.54 Aligned_cols=122 Identities=13% Similarity=0.193 Sum_probs=92.7 Q ss_pred CceeeeehhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhCCCCcchhhhcceecccccCCCceEEEEe Q lcl|NC_015262. 1 MSVEITTEGF-DAVLSKIESMGKSGDKLLNEAVKAGGNVIL----QDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVG 75 (147) Q Consensus 1 M~~~~~i~Gl-~el~~~l~~l~~~~~~~~~~al~~~a~~v~----~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg 75 (147) |+ .|+|.+| +++.+.|+++.+++...+.+++.+.++.+. ++++...|++||.++.++..+..... ..| T Consensus 1 M~-~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~-----~~v- 73 (127) T protein:vir:80 1 MA-NIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGG-----WVI- 73 (127) T ss_pred Cc-cccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCc-----eeE- Confidence 88 4999999 578899999999888888877755555555 55556899999999999976554321 122 Q ss_pred eeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 76 ITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 76 ~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) +++ ......|++|||+.+. +...++|+|||+|+.+...+++.+.|++.|+.+=+ T Consensus 74 ~nk---~~yqLtHLLE~GHAkr--------------~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 74 HNK---TEYRLAHLLEYGHATV--------------DGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred eec---CCcceeehhhcceecc--------------CCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 221 1123489999999763 22358999999999999999999999999988888 No 93 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=99.34 E-value=5.5e-15 Score=98.77 Aligned_cols=125 Identities=19% Similarity=0.294 Sum_probs=102.1 Q ss_pred eeeeehhHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCc--eEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIES-MGKS-GDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGT--KYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~-l~~~-~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~--~~~~Vg~ 76 (147) |+++++|++||+++|++ |++. +.++.++||.++++.+++..|.+.. .|||...+++..+.+....+. ..+.||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 67899999999999976 7764 6899999999999999999999987 799999999999988765554 7789999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc--hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF--LAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF--l~pA~~~~~~~~~~~i~~~l~~ 143 (147) .... +.+..-|+.|||+.... . ++.|+-| +..|++..+..+.+.++++|++ T Consensus 81 ~gp~-~R~~iVHLNE~Gytr~G----------k-----~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 81 VGPM-NRKNIIHLNEHGYTRDG----------K-----KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCC-CceeEEEeeccceecCC----------C-----eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 6533 34566899999963311 1 1233333 9999999999999999999999 No 94 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=99.34 E-value=5.5e-15 Score=98.77 Aligned_cols=125 Identities=19% Similarity=0.294 Sum_probs=102.1 Q ss_pred eeeeehhHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCc--eEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIES-MGKS-GDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGT--KYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~-l~~~-~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~--~~~~Vg~ 76 (147) |+++++|++||+++|++ |++. +.++.++||.++++.+++..|.+.. .|||...+++..+.+....+. ..+.||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 67899999999999976 7764 6899999999999999999999987 799999999999988765554 7789999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc--hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF--LAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF--l~pA~~~~~~~~~~~i~~~l~~ 143 (147) .... +.+..-|+.|||+.... . ++.|+-| +..|++..+..+.+.++++|++ T Consensus 81 ~gp~-~R~~iVHLNE~Gytr~G----------k-----~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 81 VGPM-NRKNIIHLNEHGYTRDG----------K-----KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCC-CceeEEEeeccceecCC----------C-----eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 6533 34566899999963311 1 1233333 9999999999999999999999 No 95 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=99.34 E-value=5.5e-15 Score=98.77 Aligned_cols=125 Identities=19% Similarity=0.294 Sum_probs=102.1 Q ss_pred eeeeehhHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCc--eEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIES-MGKS-GDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGT--KYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~-l~~~-~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~--~~~~Vg~ 76 (147) |+++++|++||+++|++ |++. +.++.++||.++++.+++..|.+.. .|||...+++..+.+....+. ..+.||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 67899999999999976 7764 6899999999999999999999987 799999999999988765554 7789999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc--hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF--LAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF--l~pA~~~~~~~~~~~i~~~l~~ 143 (147) .... +.+..-|+.|||+.... . ++.|+-| +..|++..+..+.+.++++|++ T Consensus 81 ~gp~-~R~~iVHLNE~Gytr~G----------k-----~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 81 VGPM-NRKNIIHLNEHGYTRDG----------K-----KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCC-CceeEEEeeccceecCC----------C-----eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 6533 34566899999963311 1 1233333 9999999999999999999999 No 96 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=99.34 E-value=5.5e-15 Score=98.77 Aligned_cols=125 Identities=19% Similarity=0.294 Sum_probs=102.1 Q ss_pred eeeeehhHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCc--eEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIES-MGKS-GDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGT--KYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~-l~~~-~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~--~~~~Vg~ 76 (147) |+++++|++||+++|++ |++. +.++.++||.++++.+++..|.+.. .|||...+++..+.+....+. ..+.||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 67899999999999976 7764 6899999999999999999999987 799999999999988765554 7789999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc--hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF--LAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF--l~pA~~~~~~~~~~~i~~~l~~ 143 (147) .... +.+..-|+.|||+.... . ++.|+-| +..|++..+..+.+.++++|++ T Consensus 81 ~gp~-~R~~iVHLNE~Gytr~G----------k-----~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPM-NRKNIIHLNEHGYTRDG----------K-----KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCC-CceeEEEeeccceecCC----------C-----eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 6533 34566899999963311 1 1233333 9999999999999999999999 No 97 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=99.34 E-value=7.7e-15 Score=97.96 Aligned_cols=125 Identities=19% Similarity=0.289 Sum_probs=101.5 Q ss_pred eeeeehhHHHHHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCC--ceEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIES-MG-KSGDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGG--TKYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~-l~-~~~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~--~~~~~Vg~ 76 (147) |+++++|++||+++|++ |+ ..+.++.++||.++++.+++..|.+.. .|||...+++..+.+....+ ...+.||| T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEe Confidence 67899999999999975 55 568899999999999999999999987 79999999999998875555 47789999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcc--hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPF--LAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PF--l~pA~~~~~~~~~~~i~~~l~~ 143 (147) .... +.+..-|+.|||+.... . ++.|+-| +..|++..+..+.+.++++|++ T Consensus 81 ~gp~-~R~~iVHLNE~Gytr~G----------k-----~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPM-NRKNIIHLNEHGYTRDG----------K-----KYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ecCC-CceeEEEeeccceecCC----------C-----eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 6533 34566899999963311 1 1233333 9999999999999999999999 No 98 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.25 E-value=3.4e-14 Score=94.45 Aligned_cols=113 Identities=17% Similarity=0.165 Sum_probs=94.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC---CcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhccccccccc Q lcl|NC_015262. 24 GDKLLNEAVKAGGNVILQDALPRVSK---RSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIP 100 (147) Q Consensus 24 ~~~~~~~al~~~a~~v~~~ak~~ap~---~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~ 100 (147) +.+.+++++++.|..+...++.++|+ ++|+|++|+.+..+...++. | .++..||+||||||+..... T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~----v------~N~~eYA~~VE~GHRq~~g~ 70 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGV----V------SNNVEYIHHLEYGHRTRQGT 70 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCce----e------ecCCcccccccCCceeeCCc Confidence 67788899999999999999999997 56999999998777665542 2 25689999999999876554 Q ss_pred ccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 101 IKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 101 ~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) +......++....++.+.+.||+.|.++.+..+.+.+++.|.+-|+ T Consensus 71 g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 71 GTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4434445556667889999999999999999999999999999999 No 99 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=99.14 E-value=4.2e-13 Score=88.47 Aligned_cols=129 Identities=19% Similarity=0.233 Sum_probs=104.9 Q ss_pred CceeeeehhHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIE-SMGK-SGDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~-~l~~-~~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |+ +|+|++||+++|+ .|++ .+.++.++||.++++.+.+..|.+.- .|||...+++..+.+....|...+.||| T Consensus 1 m~---evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W 77 (133) T protein:vir:96 1 MR---LIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYW 77 (133) T ss_pred Cc---cccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEe Confidence 76 5799999999996 5765 57899999999999999999999875 5999999999999998888999999999 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i 145 (147) .... +.+...|+.|||+.... +..+++.|+- -+..|+++.++.+.+.++++|++.| T Consensus 78 ~gp~-~R~~iVHLNE~G~ytr~---------Gk~i~PrG~G---~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 78 EGEK-HRYSIVHLNEKGFYAKD---------GKFIRPKGMG---AIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred ecCC-CceeeEeeecccceecC---------Cceeccchhh---HHHHHHHhhhHHHHHHHHHHHHHhC Confidence 6543 34566899999965321 1111222222 3999999999999999999999999 No 100 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=99.11 E-value=3.1e-13 Score=89.13 Aligned_cols=123 Identities=26% Similarity=0.405 Sum_probs=98.2 Q ss_pred Cc----eeeeehhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCC-----------Ccchhhhcceecccc Q lcl|NC_015262. 1 MS----VEITTEGFDAVLSKIESM-GKSGDKLLNEAVKAGGNVILQDALPRVSK-----------RSGKLKDGLKVSGVK 64 (147) Q Consensus 1 M~----~~~~i~Gl~el~~~l~~l-~~~~~~~~~~al~~~a~~v~~~ak~~ap~-----------~tG~l~~sI~~~~~~ 64 (147) |+ ..|+|+|+.++.+.|..+ +.++.+.++.+.+.+|+++...+++.+|+ +||.|..||++..+. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 76 679999999999999999 88999999999999999999999999998 699999999876544 Q ss_pred cCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 65 KKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 65 ~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) . ...+..| ..+..+|+.|++||+..+. +.|+-|++.++-+.++++.+.-++.|.+- T Consensus 81 r---aa~VrAG----~~krVPYA~~I~~G~r~r~-----------------Isp~rFl~~a~a~te~~~~r~Ye~~i~~v 136 (143) T protein:vir:62 81 K---GAVIKAG----SASRVPYAAAIHFGYRARN-----------------ISPNRFLFRAMARKSDVVAATYERRIAAV 136 (143) T ss_pred c---ceeeeeC----CcCCCCcccccccCccccc-----------------ccchhhhhhhhhccCHHHHHHHHHHHHHH Confidence 3 2223333 2246799999999987543 56888999999888887776655555555 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |+- T Consensus 137 l~k 139 (143) T protein:vir:62 137 VEK 139 (143) T ss_pred HHH Confidence 544 No 101 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=99.05 E-value=1.2e-12 Score=86.01 Aligned_cols=121 Identities=20% Similarity=0.348 Sum_probs=97.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----CcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK----RSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~----~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |++ .=.|+.|.++.|..|.+--+++..+.|.++|+...+..+.+.|. ..|||+|+|++-.. +.. +.|-. T Consensus 1 m~s--NNNGFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk---~d~--V~V~F 73 (125) T protein:vir:62 1 MAS--NNNGFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVK---DDR--VSVEF 73 (125) T ss_pred CCC--CchhHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEee---CCe--EEEEE Confidence 665 45699999999999877667888999999999999999999986 46899999975322 222 22222 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i 145 (147) ...+|||+|+|.||.+....++ ..||.|....|+++++.|.+.|.+.|-.++ T Consensus 74 ----ed~a~yW~f~EnGt~~~~~~g~-------------vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 74 ----KDEAWYWYLVEHGHKKAKGKGR-------------VKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred ----cchhhhhhhhhccccccccccc-------------cchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 2468999999999987532222 789999999999999999999999999999 No 102 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=99.05 E-value=8.1e-13 Score=86.87 Aligned_cols=123 Identities=23% Similarity=0.359 Sum_probs=97.8 Q ss_pred Cc----eeeeehhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-----------cchhhhcceecccc Q lcl|NC_015262. 1 MS----VEITTEGFDAVLSKIESM-GKSGDKLLNEAVKAGGNVILQDALPRVSKR-----------SGKLKDGLKVSGVK 64 (147) Q Consensus 1 M~----~~~~i~Gl~el~~~l~~l-~~~~~~~~~~al~~~a~~v~~~ak~~ap~~-----------tG~l~~sI~~~~~~ 64 (147) |+ ..|+|+|+..+.+.|..+ +.++.+.++.+.+.+|+++...+++.+|+. +|.|..||++..+. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 76 679999999999999999 889999999999999999999999999975 89999999876554 Q ss_pred cCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 65 KKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRG 144 (147) Q Consensus 65 ~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~ 144 (147) +. ..+..| .....+|+.|++||+..+. +.++-|++.++-+.+++..+.-++.|.+- T Consensus 81 ra---a~VrAG----r~arVPYA~~I~~G~r~r~-----------------Is~~rFl~~a~a~te~~~~r~Ye~~i~~v 136 (143) T protein:vir:13 81 KG---AVIKAG----SAARVPYAAAIHFGYRKRN-----------------ISANRFLYRAMARKSDVVAATYERRIAAV 136 (143) T ss_pred cc---eeeeec----CcCCCCcccccccCCcccc-----------------cchhhhhhhhhhccCHHHHHHHHHHHHHH Confidence 32 223333 2334799999999987643 56888999999888887776655555555 Q ss_pred hcC Q lcl|NC_015262. 145 LGL 147 (147) Q Consensus 145 i~~ 147 (147) |+- T Consensus 137 l~k 139 (143) T protein:vir:13 137 VEK 139 (143) T ss_pred HHH Confidence 544 No 103 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.82 E-value=6.6e-11 Score=76.41 Aligned_cols=123 Identities=13% Similarity=0.142 Sum_probs=90.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc---------CCCceE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK---------KGGTKY 71 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~---------~~~~~~ 71 (147) |+- .++.++.+.+....+.++..+..++++.+..+..++..+.|+|||.++.|+.++.... .+|... T Consensus 1 ma~----~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t 76 (146) T protein:vir:79 1 MAD----YSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKI 76 (146) T ss_pred CCc----chhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCccc Confidence 662 3455678888888888888899999999999999999999999999999986642111 112211 Q ss_pred EE--------------EeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015262. 72 VL--------------VGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVM 137 (147) Q Consensus 72 ~~--------------Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i 137 (147) .. .|-.-....+.+|+.++|||++. |.|..|.+.++.+-.. +.+.. T Consensus 77 ~~~~~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~-------------------QAP~G~v~~~~~~~~~-~v~~a 136 (146) T protein:vir:79 77 KAEGRRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSK-------------------QAPAGVFGIVAIRLRS-YMAEA 136 (146) T ss_pred HHHHHHHHHHHHhcccccceeEEeeCchhhhhhhccccC-------------------CCcchHHHHHHHHHHH-HHHHH Confidence 10 01111223568999999999986 8999999999987754 55555 Q ss_pred HHHHHHHhcC Q lcl|NC_015262. 138 KEILKRGLGL 147 (147) Q Consensus 138 ~~~l~~~i~~ 147 (147) ..++++.+.| T Consensus 137 ~~e~k~~~~l 146 (146) T protein:vir:79 137 IREARKKNAL 146 (146) T ss_pred HHHHHhhccC Confidence 5679999999 No 104 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.80 E-value=3.5e-11 Score=77.93 Aligned_cols=123 Identities=15% Similarity=0.102 Sum_probs=83.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccc---------cCCCceE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVK---------KKGGTKY 71 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~---------~~~~~~~ 71 (147) |+-.+ -++-.+...+..+.+.++..+...+++.+..+..++..+.|+|||.+|.|+.++... ..+|..+ T Consensus 1 ~~~~m--~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t 78 (145) T protein:vir:10 1 MARNI--GSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQT 78 (145) T ss_pred CCCcc--cchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccc Confidence 32110 012223445555666666677778999999999999999999999999998664211 1122211 Q ss_pred E-------------EEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 72 V-------------LVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 72 ~-------------~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) . .+|-.-....+.+|+.++||||+. |+|..|.+.++..- .++.+... T Consensus 79 ~~~~~~~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~-------------------QAP~G~v~~~~~~~-~~~v~~~~ 138 (145) T protein:vir:10 79 KTYLARQARAVANSKATSVIYITNRLDYAADLEYGASN-------------------QAPAGVLGVVQARL-GRYFQEAV 138 (145) T ss_pred hhhHHHHHHHhhcccccceEEEeeCchhhhHhhccccC-------------------CCcchHHHHHHHHH-HHHHHHHH Confidence 1 112222223568999999999986 89999999999988 45555556 Q ss_pred HHHHHHh Q lcl|NC_015262. 139 EILKRGL 145 (147) Q Consensus 139 ~~l~~~i 145 (147) ++++++| T Consensus 139 ~e~k~~~ 145 (145) T protein:vir:10 139 EEARRAI 145 (145) T ss_pred HHhhccC Confidence 7899999 No 105 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.70 E-value=3.3e-10 Score=72.53 Aligned_cols=122 Identities=14% Similarity=0.317 Sum_probs=84.2 Q ss_pred Cceeeeeh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-------------------------- Q lcl|NC_015262. 1 MSVEITTE-GFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S-------------------------- 48 (147) Q Consensus 1 M~~~~~i~-Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p-------------------------- 48 (147) |++.++|+ .+++|.+.|.+|....+. +..++..++.++...+.+. | T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~--~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~ 78 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRD--RAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGS 78 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhcc--HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCc Confidence 99999988 678899999888654332 2445555555555554332 2 Q ss_pred --CCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHH Q lcl|NC_015262. 49 --KRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAY 126 (147) Q Consensus 49 --~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~ 126 (147) .+||+|++||.... +...+.||. +..||.+.+||+..... .....+||+|||- -= T Consensus 79 ~L~~tg~L~~Si~~~~-----~~~~v~vGt------~~~yA~vHqfG~~~~~~-----------~~~~~iPaRpfLG-~s 135 (156) T protein:vir:19 79 ILTLHGDLARSITTDY-----GQDYALIGS------PKIYAAIHQWGGTPDMA-----------PRPAGVPARPYMG-LD 135 (156) T ss_pred chhhhHHHHHHhhhee-----cCCEEEEec------chhhhHHhhcCcccccC-----------CCccccCCccccC-CC Confidence 24689999996432 233566664 46899999999764321 1234699999995 44 Q ss_pred HHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 127 ESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 127 ~~~~~~~~~~i~~~l~~~i~~ 147 (147) +..++++.+.+.+.|.+.++= T Consensus 136 ~~d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 136 KTGEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHHHHHHHHHHHhhC Confidence 677888888888888888888 No 106 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=98.62 E-value=4.4e-10 Score=71.89 Aligned_cols=144 Identities=16% Similarity=0.233 Sum_probs=99.8 Q ss_pred CceeeeehhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCC------CCcc---hhhhcceecccccCC-Cc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSG-DKLLNEAVKAGGNVILQDALPRVS------KRSG---KLKDGLKVSGVKKKG-GT 69 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~ak~~ap------~~tG---~l~~sI~~~~~~~~~-~~ 69 (147) |. +|+ ..|++++.++++|..++ .+.-.++..+||+++++.....+| +.|| ||+|||..+...-.+ .. T Consensus 1 M~-~~~-~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~d 78 (168) T protein:vir:74 1 MA-TFE-EAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKD 78 (168) T ss_pred Cc-cHH-HHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCcccC Confidence 55 333 44677777777766443 233456778888888888777665 2344 999999876553322 24 Q ss_pred eEEEEeeeccCC----cccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH--HHHHHHHHHHHHHHH Q lcl|NC_015262. 70 KYVLVGITKEDN----SKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES--KKDEAKNVMKEILKR 143 (147) Q Consensus 70 ~~~~Vg~~~~~~----~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~--~~~~~~~~i~~~l~~ 143 (147) ...+|||..... ..++.|+|+.-||+.++.... ........+.+.|++.+|+..+-+. .++.+.++..+++++ T Consensus 79 G~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~-~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~~~y~e 157 (168) T protein:vir:74 79 GQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTR-SGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEAEAMRK 157 (168) T ss_pred Cceeecccccccccccchhhhhhhhcccccccccccc-cccccccccccccccchhHHHHHhhhhhHHHHHHHHHHHHHH Confidence 567899987542 367889999999975433222 2223344456679999999999998 679999999999888 Q ss_pred HhcC Q lcl|NC_015262. 144 GLGL 147 (147) Q Consensus 144 ~i~~ 147 (147) -|+= T Consensus 158 Il~~ 161 (168) T protein:vir:74 158 IINR 161 (168) T ss_pred HHHh Confidence 8876 No 107 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.61 E-value=5.6e-10 Score=71.32 Aligned_cols=114 Identities=15% Similarity=0.328 Sum_probs=71.4 Q ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C----------------------CCcchhhhcc Q lcl|NC_015262. 6 TTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S----------------------KRSGKLKDGL 58 (147) Q Consensus 6 ~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p----------------------~~tG~l~~sI 58 (147) =|+-.+++.+.|++|...+.. ++...+..+.+++..+. | .+||.|++|| T Consensus 1 ~i~~~~~i~~~l~~l~~~~~~----~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si 76 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDGLTD----GLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDI 76 (145) T ss_pred CcccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHH Confidence 244445566666666554433 34444444444433221 1 3689999999 Q ss_pred eecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 59 KVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 59 ~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) ..+..... ....+.|| ++..|+.+.+||+.+. .+||+|||-++.+...+++.+.+. T Consensus 77 ~~~~~~~~-~~~~a~vG------tn~~YA~~hqfG~~~~-----------------~IPaRPfLG~~~~~~~~~~~~ii~ 132 (145) T protein:vir:31 77 NAASMMDR-ANRMAVIG------TNLDYAEHHEFGAPEA-----------------GIPARPIFGPAGAYASQQAPDVIG 132 (145) T ss_pred HHHhhhcc-cCceeEec------CCchhhhhhccCCccc-----------------ccCCCCccCCCccchHHHHHHHHH Confidence 75433222 23345565 4568999999998642 389999999998777777777777 Q ss_pred HHHHHHhcC Q lcl|NC_015262. 139 EILKRGLGL 147 (147) Q Consensus 139 ~~l~~~i~~ 147 (147) +.+.+.|.= T Consensus 133 ~~i~~~L~~ 141 (145) T protein:vir:31 133 DEIDTNLEG 141 (145) T ss_pred HHHHHHhhh Confidence 777766544 No 108 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.60 E-value=1.3e-09 Score=69.38 Aligned_cols=124 Identities=14% Similarity=0.164 Sum_probs=86.0 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccc---------cCCCceE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVK---------KKGGTKY 71 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~---------~~~~~~~ 71 (147) |+ +. .+.++...+....+.++..+...+++.+..+..++..+.|+|||.+|.|+.++... ...|... T Consensus 1 ma-~~---~~~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t 76 (147) T protein:vir:10 1 MA-NY---QIRRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVV 76 (147) T ss_pred CC-Cc---chhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccch Confidence 66 33 34467778888888888889999999999999999999999999999998654111 1111111 Q ss_pred EEE--------------eeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015262. 72 VLV--------------GITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVM 137 (147) Q Consensus 72 ~~V--------------g~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i 137 (147) ... |..-....+.+|+.++|||++. |+|..|.+.++..-..-+.+++ T Consensus 77 ~a~~~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~-------------------QAP~G~V~~t~q~~~~~v~~~~ 137 (147) T protein:vir:10 77 RGEEQAKTYGMFSRGGAITSVHFSNMLIYANALEYGHSQ-------------------QAPSGVVGLVALRLRSYMADAI 137 (147) T ss_pred hhhhhHHHHHHhhhccCcceEEEeeCcchhhhhhccccC-------------------CCCchHHHHHHHHHHHHHHHHH Confidence 110 1111223568999999999986 8999999999988766555555 Q ss_pred HHHHHHHhcC Q lcl|NC_015262. 138 KEILKRGLGL 147 (147) Q Consensus 138 ~~~l~~~i~~ 147 (147) .+.=+..=-| T Consensus 138 ~e~k~~~~~~ 147 (147) T protein:vir:10 138 KQARRQQNAL 147 (147) T ss_pred HHHHhhhccC Confidence 5433323334 No 109 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.60 E-value=4.8e-10 Score=71.66 Aligned_cols=120 Identities=15% Similarity=0.105 Sum_probs=84.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc---------CCCceE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK---------KGGTKY 71 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~---------~~~~~~ 71 (147) |.- ++ -.+...+....++++......+++.+..+.+++..+.|+|||.++.|+.++.... .+|... T Consensus 1 Ma~--~~---~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t 75 (142) T protein:vir:10 1 MAN--DV---VSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNET 75 (142) T ss_pred Ccc--ch---hhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccc Confidence 663 22 2355566677777777888889999999999999999999999999987642111 112211 Q ss_pred EE-------------EeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 72 VL-------------VGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 72 ~~-------------Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) .. .|..-....+.+|+.++|||++. |.|..|.+.++.+-.. +.+... T Consensus 76 ~~~~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~-------------------QAP~G~v~~a~q~~~~-~v~~a~ 135 (142) T protein:vir:10 76 RNSLRRQIYALARDANTNVIYISNRLDYAQGLEFGSSN-------------------QAPSGVLGVVQKRLGR-YFAEAV 135 (142) T ss_pred hhhHHHHHHHhhhccccceEEEeeCcchhhhhhccccC-------------------CCcchHHHHHHHHHHH-HHHHHH Confidence 11 11112223568999999999985 8999999999988755 445555 Q ss_pred HHHHHHh Q lcl|NC_015262. 139 EILKRGL 145 (147) Q Consensus 139 ~~l~~~i 145 (147) +++++.| T Consensus 136 ~e~~~~~ 142 (142) T protein:vir:10 136 QEAKRAL 142 (142) T ss_pred HHhhccC Confidence 6688888 No 110 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=98.59 E-value=3.8e-10 Score=72.20 Aligned_cols=117 Identities=19% Similarity=0.256 Sum_probs=90.6 Q ss_pred HHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcchhhhcceecccccCCCc--eEEEEeeeccCCcccch Q lcl|NC_015262. 13 VLSKIE-SMGK-SGDKLLNEAVKAGGNVILQDALPRVS--KRSGKLKDGLKVSGVKKKGGT--KYVLVGITKEDNSKIFY 86 (147) Q Consensus 13 l~~~l~-~l~~-~~~~~~~~al~~~a~~v~~~ak~~ap--~~tG~l~~sI~~~~~~~~~~~--~~~~Vg~~~~~~~~~~y 86 (147) |+++|+ .|++ .+.++.++||.++++.+.+..|.++- .|||..-+++..+.+....+. ..+.|||.... +.+.. T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~-~R~~i 79 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-NRKNI 79 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCC-Cceee Confidence 888885 5665 57899999999999999999999875 699999999999888655554 78999996533 34566 Q ss_pred hhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 87 GKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 87 ~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) .|+.|||+... +..+++.|+- -+..|++..+..+.+.++++|++ T Consensus 80 VHLNE~GYtr~----------Gk~i~PRG~G---~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 80 IHLNEHGYTRD----------GKKYTPRGFG---VIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred EeeeccceecC----------CCeEccchhh---HHHHHHHhhhHHHHHHHHHHhcC Confidence 89999996331 1111112222 29999999999999999999999 No 111 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.56 E-value=9.7e-10 Score=70.00 Aligned_cols=122 Identities=15% Similarity=0.259 Sum_probs=80.5 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C------------------------- Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S------------------------- 48 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p------------------------- 48 (147) || ++|+|.+ +++.+.|++|...+.. .+..++..++.++...+.+. | T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d-~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~ 78 (175) T protein:vir:79 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQ-KADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGE 78 (175) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhcC-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhcccccccccccc Confidence 88 5666665 7899999888776542 44566666666666654431 2 Q ss_pred ---------------CCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccC Q lcl|NC_015262. 49 ---------------KRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINH 113 (147) Q Consensus 49 ---------------~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~ 113 (147) .+||+|++||.... +...+.||. +..||.+..||+.... ... T Consensus 79 ~~~~~~~~~~~~~~L~~tG~L~~Si~~~~-----~~~~v~vGt------n~~YAaiHqfGg~~~~------------~~~ 135 (175) T protein:vir:79 79 LTAAASRRKAGLMILQDSGQMAASTATDS-----GEDYSVIGS------NKEYAAIQHFGGQAGR------------GLK 135 (175) T ss_pred chhhHhhhccCCCcceechhhhhhhhhee-----cCCEEEEec------CcchhhHhhcccccCC------------Ccc Confidence 24889999997543 223566764 4689999999975311 123 Q ss_pred CCCCCCcchhhHHHH-----HHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 114 PGVSPKPFLAPAYES-----KKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 114 ~~~~a~PFl~pA~~~-----~~~~~~~~i~~~l~~~i~~ 147 (147) ..+||+|||--+-+. ..+.|.+.+.+.|.++|.= T Consensus 136 v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~ 174 (175) T protein:vir:79 136 VTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred cccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHhcc Confidence 469999999854432 2456777777777777766 No 112 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.56 E-value=1.2e-09 Score=69.56 Aligned_cols=123 Identities=13% Similarity=0.186 Sum_probs=81.8 Q ss_pred Cceeeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-----------------------CCc Q lcl|NC_015262. 1 MSVEITTEG-FDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S-----------------------KRS 51 (147) Q Consensus 1 M~~~~~i~G-l~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p-----------------------~~t 51 (147) |+..|+|+. .++|.+.|++|...+.+ ....++..++.++...+.+. | .+| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~-~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~t 79 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTD-TLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVT 79 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccc Confidence 997777775 35688888888776543 45666666666666655443 1 258 Q ss_pred chhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhh-HHHHHH Q lcl|NC_015262. 52 GKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAP-AYESKK 130 (147) Q Consensus 52 G~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~p-A~~~~~ 130 (147) |+|++||.... +...+.||. +..||.+.+||+.... .+...+||+|||-- .-++-. T Consensus 80 G~L~~Si~~~~-----~~~~v~vGt------n~~YA~iHqfGg~~~~------------~~~~~iPARPfLG~s~~~e~~ 136 (155) T protein:vir:10 80 NALARSITTRA-----DRDQAQIGS------NLSYAAIQQLGGQAGR------------GRKVTIPARPYLPVLRNGQLK 136 (155) T ss_pred hhhhhhhhcee-----cCCEEEEec------CcchhhhhhcccccCC------------CCccccCCccccCCCccccch Confidence 89999997442 223466663 5689999999975321 12346999999973 334445 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_015262. 131 DEAKNVMKEILKRGLGL 147 (147) Q Consensus 131 ~~~~~~i~~~l~~~i~~ 147 (147) +++.+.|.+.+.+.|.- T Consensus 137 ~ei~~~I~~~i~~~l~~ 153 (155) T protein:vir:10 137 PSARDAVLDVLLAALSQ 153 (155) T ss_pred HHHHHHHHHHHHHHHhh Confidence 67777777777777766 No 113 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.47 E-value=1.1e-09 Score=69.65 Aligned_cols=112 Identities=15% Similarity=0.178 Sum_probs=73.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccc---------cCCCce- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVK---------KKGGTK- 70 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~---------~~~~~~- 70 (147) |++..++. + ..++++..+..++++.+..+..++....|+|||.++.|+.++... ..+|.. T Consensus 1 msF~~~i~---~-------~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t 70 (131) T protein:vir:94 1 MSFALDVT---R-------FVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTA 70 (131) T ss_pred CCcccCHH---H-------HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhh Confidence 77654433 3 334555566778888888888889999999999999998654211 111110 Q ss_pred ---------EEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 71 ---------YVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEIL 141 (147) Q Consensus 71 ---------~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l 141 (147) ....|-.-....+.+|+.++|||++. |+|..|.+.++..-...+.++ .+++ T Consensus 71 ~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~-------------------QAP~g~v~~~~~~~~~~v~~~-~~e~ 130 (131) T protein:vir:94 71 TGNATSFVLNAADWHTFTLTNNLPYAQRLEYGWSQ-------------------QAPQGFVRVNVSRFQQLLNEE-ASKV 130 (131) T ss_pred HHHHHHHHhhccccceEEEeeCchhhhhhhccccC-------------------CCcchHHHHHHHHHHHHHHHH-HHhc Confidence 00111111223568999999999986 899999999998875544443 3345 Q ss_pred H Q lcl|NC_015262. 142 K 142 (147) Q Consensus 142 ~ 142 (147) + T Consensus 131 k 131 (131) T protein:vir:94 131 K 131 (131) T ss_pred C Confidence 5 No 114 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.42 E-value=3.6e-09 Score=66.88 Aligned_cols=112 Identities=16% Similarity=0.174 Sum_probs=73.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc---------CCCceE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK---------KGGTKY 71 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~---------~~~~~~ 71 (147) |++..++. + ...+++..+...+++.+..+..++....|+|||.++.|+.++.... ..|... T Consensus 1 msf~~~i~---~-------~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t 70 (131) T protein:vir:78 1 MSFALDVS---K-------FVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) T ss_pred CCcCcCHH---H-------HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhh Confidence 77655433 3 3345555667778888888888889999999999999986542111 111100 Q ss_pred ----------EEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 72 ----------VLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEIL 141 (147) Q Consensus 72 ----------~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l 141 (147) ...|-.-....+.+|+.++|||++. |+|..|.+.++..-...+.++ .+++ T Consensus 71 ~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~-------------------QAP~G~v~~~~~~~~~~v~~~-~~e~ 130 (131) T protein:vir:78 71 TSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQ-------------------QAPQGFVRVNVSRFQQLLNEE-ASKV 130 (131) T ss_pred HHHHHHHHhhccCCceEEEeeCchhhhHhhccccC-------------------CCcchHHHHHHHHHHHHHHHH-HHhc Confidence 0111111223568999999999986 899999999998875544443 3345 Q ss_pred H Q lcl|NC_015262. 142 K 142 (147) Q Consensus 142 ~ 142 (147) + T Consensus 131 k 131 (131) T protein:vir:78 131 K 131 (131) T ss_pred C Confidence 5 No 115 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.41 E-value=8.7e-09 Score=64.76 Aligned_cols=121 Identities=14% Similarity=0.182 Sum_probs=79.0 Q ss_pred Cceeeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------CCc Q lcl|NC_015262. 1 MSVEITTEG-FDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV--------S--------------------KRS 51 (147) Q Consensus 1 M~~~~~i~G-l~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a--------p--------------------~~t 51 (147) |++.|+|+- .+++.+.|.+|...+.. ....++..+..++...+.+. | .+| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d-~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTD-TLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccc Confidence 986666652 36889999998776653 45666777777777665543 1 368 Q ss_pred chhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHH---- Q lcl|NC_015262. 52 GKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYE---- 127 (147) Q Consensus 52 G~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~---- 127 (147) |+|++||.... +...+.|| ++..||.+.+||+.... .+...+||+|||--+-+ T Consensus 80 G~L~~Si~~~~-----~~~~v~vG------t~~~YA~iHqfGg~~~~------------~~~v~iPaRpfLG~s~~~~l~ 136 (155) T protein:vir:79 80 NALARSVTTWA-----DRNEAGIG------SNLVYAAIHQFGGDAGR------------GHQVEIPARRYLPFDENGQLA 136 (155) T ss_pred hhhhhhhhcee-----cCCEEEEe------cCchhhhhhhcccccCC------------CCccccCCccccCCCCccccc Confidence 99999997442 22345666 35689999999975321 12235899999964432 Q ss_pred -HHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 128 -SKKDEAKNVMKEILKRGL 145 (147) Q Consensus 128 -~~~~~~~~~i~~~l~~~i 145 (147) +-+++|.+.+.+.|.++= T Consensus 137 ~~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 137 AGARQSILEVVLTALSRNR 155 (155) T ss_pred hHHHHHHHHHHHHHHHhcC Confidence 334566666666665555 No 116 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=98.38 E-value=5.6e-09 Score=65.82 Aligned_cols=144 Identities=15% Similarity=0.220 Sum_probs=96.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCC------Ccc---hhhhcceecccccCC-Cc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSG-DKLLNEAVKAGGNVILQDALPRVSK------RSG---KLKDGLKVSGVKKKG-GT 69 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~ak~~ap~------~tG---~l~~sI~~~~~~~~~-~~ 69 (147) |. +|. ..|++++.+++.|.... .+.-.+...+||+++++.+...+|. .|| ||+|||..+...-.+ .. T Consensus 1 M~-~~~-d~l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg~~d 78 (168) T protein:vir:10 1 MV-SFY-DAMQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKD 78 (168) T ss_pred CC-cHH-HHHHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheecccccccccC Confidence 43 221 12555666666664222 2234567778888888888877763 444 999999876543322 24 Q ss_pred eEEEEeeeccCC----cccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH--HHHHHHHHHHHHHHH Q lcl|NC_015262. 70 KYVLVGITKEDN----SKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES--KKDEAKNVMKEILKR 143 (147) Q Consensus 70 ~~~~Vg~~~~~~----~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~--~~~~~~~~i~~~l~~ 143 (147) ...+|||....- ..++.|+|+.-||+.++...+ ........+.+.|++.+|+..+-+. .++.+.++..+++++ T Consensus 79 G~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~-~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~~~y~e 157 (168) T protein:vir:10 79 GQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTR-SGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRK 157 (168) T ss_pred CceeecccCccccccccchheeeeccccccccccccc-cccccccccccccccchhHHHhhhchhhhHHHHHHHHHHHHH Confidence 567899976532 367889999999975433222 2223344456679999999999997 479999999988888 Q ss_pred HhcC Q lcl|NC_015262. 144 GLGL 147 (147) Q Consensus 144 ~i~~ 147 (147) -|+= T Consensus 158 Il~~ 161 (168) T protein:vir:10 158 IINR 161 (168) T ss_pred HHHh Confidence 8876 No 117 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.35 E-value=1.7e-08 Score=63.23 Aligned_cols=121 Identities=12% Similarity=0.165 Sum_probs=79.4 Q ss_pred Cceeeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------CCc Q lcl|NC_015262. 1 MSVEITTEG-FDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV--------S--------------------KRS 51 (147) Q Consensus 1 M~~~~~i~G-l~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a--------p--------------------~~t 51 (147) ||..|+|+. .++|.+.|.+|...+.. .+..++..++.++...+.+. | .+| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d-~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTD-TLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhc Confidence 997666652 47889999998777653 45666777777777665543 1 358 Q ss_pred chhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHH----- Q lcl|NC_015262. 52 GKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAY----- 126 (147) Q Consensus 52 G~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~----- 126 (147) |+|++||.... +...+.||. +..||.+.+||+.... .....+|++|||--+- T Consensus 80 g~L~~Si~~~~-----~~~~v~vGt------n~~YA~iHqfGg~~~~------------~~~v~iPaRpfLG~s~~~~l~ 136 (155) T protein:vir:99 80 NALARSVTTWA-----DRNEAGIGS------NLVYAAIHQFGGDAGR------------GHQVEIPARRYLPFDENGQLA 136 (155) T ss_pred hhhhhhhhcee-----cCCEEEEec------CccchhhhhcccccCC------------CCccccCCccccCCCCccccc Confidence 89999997542 223466663 5689999999975321 1123599999996433 Q ss_pred HHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 127 ESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 127 ~~~~~~~~~~i~~~l~~~i 145 (147) .+.+++|.+.+.+.|.+.= T Consensus 137 ~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 137 AGARQSILEIVLTALSRNR 155 (155) T ss_pred hHHHHHHHHHHHHHHhccC Confidence 2345566666666666655 No 118 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=98.29 E-value=9.5e-09 Score=64.56 Aligned_cols=103 Identities=17% Similarity=0.151 Sum_probs=58.2 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec-- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK-- 78 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~-- 78 (147) |+++++++|-+.|.+.+++|... . + ..+.||+-. T Consensus 5 ~~~~~k~~~~~~~~~~~~~l~~l-~--------------------------~-----------------~~v~vGi~~~~ 40 (200) T protein:vir:99 5 FSKSNSVAAPLKHFQMLKQFDAL-K--------------------------G-----------------KTVQAGWFETD 40 (200) T ss_pred cceeeeeecchHHHHHHHHHHHh-h--------------------------C-----------------CeEEEEEcCCC Confidence 89999999844444433333110 0 0 011122110 Q ss_pred ---------cCCcccchhhhhhcccccccccccccccc----c------------------ccccCCCCCCCcchhhHHH Q lcl|NC_015262. 79 ---------EDNSKIFYGKFLEFGASAHKIPIKKGKKK----G------------------RIINHPGVSPKPFLAPAYE 127 (147) Q Consensus 79 ---------~~~~~~~y~~~vE~GT~~~~~~~~~~~~~----~------------------~~~~~~~~~a~PFl~pA~~ 127 (147) ...+-+..|.+.|||+.-..+....+... + ....+..+||+|||||+++ T Consensus 41 ~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~ 120 (200) T protein:vir:99 41 RYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWA 120 (200) T ss_pred CcCCcccccccchHHHHHhHHHcCCeeccCCCccccccccccccccccccccccccceeeeeccccccCCCcchhhHHHH Confidence 01123456888999964221111111000 0 0112346899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 128 SKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 128 ~~~~~~~~~i~~~l~~~i~~ 147 (147) .+++++.+.++..+.+.|+- T Consensus 121 ~~~~~~~~~~~~~~~~~l~g 140 (200) T protein:vir:99 121 TFNKDKVKIQAQIARQLLDG 140 (200) T ss_pred HHHHHHHHHHHHHHHHHHhC Confidence 99999999999999887755 No 119 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.29 E-value=1.9e-08 Score=62.96 Aligned_cols=133 Identities=14% Similarity=0.179 Sum_probs=83.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C------------------------CCc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S------------------------KRS 51 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p------------------------~~t 51 (147) |.++++|. +++|.+.|+.|...+.. .+..++..++.++...+.+. | .+| T Consensus 2 ~~i~i~~d-~~~~~~~L~~l~~~~~~-~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~t 79 (190) T protein:vir:99 2 AGITLEWD-GRRALDVLNAGSAALGD-PSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLD 79 (190) T ss_pred ceeEEEec-HHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceec Confidence 55777775 57888888888766543 35667777777777665442 2 247 Q ss_pred chhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccc----------cccccc------------ Q lcl|NC_015262. 52 GKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKK----------GKKKGR------------ 109 (147) Q Consensus 52 G~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~----------~~~~~~------------ 109 (147) |+|++||..... ...+.||. +..|+...+||.......... +..... T Consensus 80 g~L~~Si~~~~~-----~~~v~vGt------n~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 148 (190) T protein:vir:99 80 GHLRNLLRYQLD-----GSELLFGS------DRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQD 148 (190) T ss_pred HHHHHHHhheec-----CcEEEEec------CcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchh Confidence 899999974322 22466663 478999999995432221110 000000 Q ss_pred ---cccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 110 ---IINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 110 ---~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ...+..+|++|||--+ ++.++++.+.|.+.|.+.|.= T Consensus 149 ~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~ 188 (190) T protein:vir:99 149 VQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRE 188 (190) T ss_pred cccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhh Confidence 0112247999999655 566788888888888888877 No 120 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=98.29 E-value=3.5e-09 Score=66.92 Aligned_cols=81 Identities=21% Similarity=0.425 Sum_probs=57.7 Q ss_pred Cceeeeeh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee- Q lcl|NC_015262. 1 MSVEITTE--GFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT- 77 (147) Q Consensus 1 M~~~~~i~--Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~- 77 (147) |+++++.. |+++|++.|+.|... .+.||+- T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~-----------------------------------------------~v~VGi~~ 33 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEK-----------------------------------------------AVYVGFPA 33 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCC-----------------------------------------------eEEEEeec Confidence 88766664 677777777766321 1122221 Q ss_pred --------ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 --------KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 --------~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ....+.+..+.+.|||+. .+||+|||||+++.+++++.+.+...++..++- T Consensus 34 ~~~~~~~~~~g~~vA~ia~~~E~G~~-------------------~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~ 92 (148) T protein:vir:52 34 EFDEKVKGSENFNLASLAAVLEFGNE-------------------HIPARPFLRQTLEENQEKYTALFIQWFDQGVPA 92 (148) T ss_pred CcCCCCCCCCCCCHHHHHHHHhcCCC-------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCH Confidence 011245778999999975 399999999999999999999999888876665 No 121 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.25 E-value=2.3e-08 Score=62.47 Aligned_cols=122 Identities=13% Similarity=0.208 Sum_probs=75.9 Q ss_pred Cce--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C------------------------- Q lcl|NC_015262. 1 MSV--EITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S------------------------- 48 (147) Q Consensus 1 M~~--~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p------------------------- 48 (147) ||. +|+|. .++|.+.|++|...+.. .+..++..++.++.....+. | T Consensus 1 Ms~~i~i~~~-~~~l~~~L~~l~~~~~d-~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~ 78 (175) T protein:vir:10 1 MSDFVNFQID-DSALRTRLLQLEQAGHQ-KAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGE 78 (175) T ss_pred CceeEEEEec-HHHHHHHHHHHHHHhcc-HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhh Confidence 884 45555 47788899888766542 33455555555555544332 1 Q ss_pred ---------------CCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccC Q lcl|NC_015262. 49 ---------------KRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINH 113 (147) Q Consensus 49 ---------------~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~ 113 (147) .+||.|++||..... ...+.||. +..||.+..||+.... ... T Consensus 79 ~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~-----~~~v~vGt------n~~YAaiHqfGg~~~~------------~~~ 135 (175) T protein:vir:10 79 LTAAASRRKAGLMILQDSGQMAASVSTDHD-----DNSAVIGS------NKEYAAIHQFGGQAGR------------GLK 135 (175) T ss_pred hhhhhhhhccCCCcceechhhhhhhheeec-----CCEEEEec------ChhhhhhhhcccccCC------------CCc Confidence 247889999964432 23566664 4689999999975321 122 Q ss_pred CCCCCCcchhhHHHH-----HHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 114 PGVSPKPFLAPAYES-----KKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 114 ~~~~a~PFl~pA~~~-----~~~~~~~~i~~~l~~~i~~ 147 (147) ..+||+|||--+-+. ..+.+++.+.+.|.+++.= T Consensus 136 v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~ 174 (175) T protein:vir:10 136 VTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred cccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhcc Confidence 369999999865322 2456666666666666666 No 122 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=98.21 E-value=5.7e-08 Score=60.30 Aligned_cols=119 Identities=14% Similarity=0.202 Sum_probs=86.5 Q ss_pred Cc-eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEeee Q lcl|NC_015262. 1 MS-VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGIT 77 (147) Q Consensus 1 M~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~~ 77 (147) |+ ++|+|. .++|++.++.+..+....+.--....|..+..+||.+||. +||+.|.+|.-... .. |...+.|.+. T Consensus 1 ~~~~~f~~d-~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~-~~-g~~~~~Iyls 77 (123) T protein:vir:74 1 MAKVTFEYD-AQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVAN-KL-GPGSHELIMS 77 (123) T ss_pred CceeEEEec-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccc-cC-CCceEEEEEe Confidence 65 556654 6788999999988888888777888999999999999996 79999999953322 22 2222333332 Q ss_pred ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) -...|+.|+|.++... + .-+.|+++...+++++-++..+.+-=+- T Consensus 78 ----h~veYG~~LEla~~~k-------------------y--aIi~Ptv~~~~~~im~g~~~ll~~l~~~ 122 (123) T protein:vir:74 78 ----YSVHYGIWLEIANSGQ-------------------Y--AVIGPFLPVMGRKLMHDLEHLIDRLERA 122 (123) T ss_pred ----cCeeecceeeecCCCC-------------------c--eeecchHHHHhHHHHHHHHHHHHHhhcc Confidence 2358999999887531 1 2689999999999999988876654444 No 123 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.20 E-value=1.2e-08 Score=63.96 Aligned_cols=103 Identities=13% Similarity=0.090 Sum_probs=68.0 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc---------CCCceE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK---------KGGTKY 71 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~---------~~~~~~ 71 (147) |+++|.- .|....+++++.+...+++.+..+.+.+....|+|||.+|.|+.++.... ..|... T Consensus 2 ~~~sf~~--------~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t 73 (121) T protein:vir:94 2 ISMKFNV--------NLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTP 73 (121) T ss_pred ccchhhc--------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchh Confidence 5554433 34444445555666778888888889999999999999999986642111 111100 Q ss_pred EE--------EeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHH Q lcl|NC_015262. 72 VL--------VGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKK 130 (147) Q Consensus 72 ~~--------Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~ 130 (147) .. .|..-....+.+|+..+|||++. |+|..|.+.++.+-+ T Consensus 74 ~~~~~~~~~~~~~~iyi~NnlpYA~~LE~G~S~-------------------QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 74 APAIVVSSNVALPHFYITNGAPYAQQLEKGSST-------------------QAPLGIVRVTLASLR 121 (121) T ss_pred HHHHHHHHhhccceEEEeeCcchhhhhhcccCC-------------------CCcchHHHHHHHhhC Confidence 00 00011123567999999999986 899999999988776 No 124 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=98.16 E-value=1.8e-08 Score=63.07 Aligned_cols=142 Identities=18% Similarity=0.160 Sum_probs=93.3 Q ss_pred Cceeeee--hhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCC------Cc---chhhhcceecccccCC- Q lcl|NC_015262. 1 MSVEITT--EGFDAVLSKIESMGKSG-DKLLNEAVKAGGNVILQDALPRVSK------RS---GKLKDGLKVSGVKKKG- 67 (147) Q Consensus 1 M~~~~~i--~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~ak~~ap~------~t---G~l~~sI~~~~~~~~~- 67 (147) |.-+=.+ ..|++++.+++++..+. .+.-.+...+||+++++.+...+|. .| |||+|||..+...-.+ T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg~ 80 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDGI 80 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCcc Confidence 3322222 24455555555554332 2334567788899988888877763 34 5999999877553322 Q ss_pred CceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHH--HHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 68 GTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYE--SKKDEAKNVMKEILKRGL 145 (147) Q Consensus 68 ~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~--~~~~~~~~~i~~~l~~~i 145 (147) .....+|||... +++.++|++-||+-.... .-......-++..|++.+|+..+-+ +.++.+.++..+++++-| T Consensus 81 ~dG~StVGw~~k---ka~ia~~indGtr~~~~~--~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~~~y~eil 155 (161) T protein:vir:10 81 KDGNSTVGWDYT---KSRVGHLIENGTRFPMYS--KKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEAEVFSEIL 155 (161) T ss_pred cCCceeccccCc---hhhhhhhhcccchhhhhh--cccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHHHHHHHHH Confidence 234578898643 478899999999631111 1122233456677999999999999 577999999999998888 Q ss_pred cC Q lcl|NC_015262. 146 GL 147 (147) Q Consensus 146 ~~ 147 (147) .= T Consensus 156 ~~ 157 (161) T protein:vir:10 156 KK 157 (161) T ss_pred Hh Confidence 77 No 125 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=98.15 E-value=6.8e-08 Score=59.87 Aligned_cols=115 Identities=15% Similarity=0.192 Sum_probs=84.9 Q ss_pred Cc-eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEeee Q lcl|NC_015262. 1 MS-VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGIT 77 (147) Q Consensus 1 M~-~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~~ 77 (147) |+ ++|+|. .++|++.++.+..+....+.--....|..+..+||.+||. +||+.|..|.........+ .+.|.+. T Consensus 1 ~~~~~f~~~-~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~~~~--~~~Iyls 77 (120) T protein:vir:10 1 MAKIEFKFK-DIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPD--RYEIVFA 77 (120) T ss_pred CceEEEEec-HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCc--eEEEEEe Confidence 66 677776 5789999999988888877777788899999999999996 6999999996433222222 2233222 Q ss_pred ccCCcccchhhhhh--cccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 78 KEDNSKIFYGKFLE--FGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 78 ~~~~~~~~y~~~vE--~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) -...|+.|+| .|..++ -+.|+++...+++++-++..|.+ || T Consensus 78 ----h~veYG~~LEla~~~kya-----------------------Il~PTi~~~~~~il~g~~~ll~~-l~ 120 (120) T protein:vir:10 78 ----HTVHYGIWLEIANSGRYE-----------------------IIMPTVHHEGKLMAQRLRGLLGR-LR 120 (120) T ss_pred ----cCeeecceEEeeCCCCcc-----------------------cccchHHHHhHHHHHHHHHHhhh-cC Confidence 2358899999 454332 48899999999999999886654 44 No 126 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=98.15 E-value=1.1e-08 Score=64.31 Aligned_cols=112 Identities=15% Similarity=0.115 Sum_probs=71.3 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc---------CCCce- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK---------KGGTK- 70 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~---------~~~~~- 70 (147) |++..++ ....+.++..+...+++.+..+..++....|+|||.+|.|+.++.... ..+.. T Consensus 1 msF~~~i----------~~~~~~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~ 70 (134) T protein:vir:80 1 MSYTDRF----------NVIAKGIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGM 70 (134) T ss_pred CCcccCH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccc Confidence 7655443 233445555677788888888888899999999999999986552110 01110 Q ss_pred ------------EEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 71 ------------YVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 71 ------------~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) ....|-.-....+.+|+.++|||++. |+|..|.+-+..+-..-+.+ . T Consensus 71 ~~~~~~~~~vi~~~k~g~~iyi~Nn~pYA~~LEyG~S~-------------------QAP~G~v~~t~~~~~~~v~~--~ 129 (134) T protein:vir:80 71 DEALQVLQQTVGQYKAGDTVHITNNAPYIKELNSGSSQ-------------------QAPANFVETSIMRATRLIRN--V 129 (134) T ss_pred hhhHHHHHHHHhhccCcceEEEeeCchhhhhhhccccC-------------------CCcchHHHHHHHHHHHHHHh--h Confidence 00111111223568999999999985 89999999887766554433 2 Q ss_pred HHHHH Q lcl|NC_015262. 139 EILKR 143 (147) Q Consensus 139 ~~l~~ 143 (147) +.+-+ T Consensus 130 ~~~~~ 134 (134) T protein:vir:80 130 KVVPQ 134 (134) T ss_pred ccCCC Confidence 22333 No 127 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.14 E-value=1.8e-08 Score=63.09 Aligned_cols=116 Identities=15% Similarity=0.077 Sum_probs=78.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc--------------- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK--------------- 65 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~--------------- 65 (147) |+- .+-++...++...+.++..+...+++.|..+...+..+.|+|||.+|.|+.++.... T Consensus 1 MA~-----~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~ 75 (144) T protein:vir:95 1 MAK-----SLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGS 75 (144) T ss_pred Cch-----hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccc Confidence 663 233456677777778888899999999999999999999999999999987653211 Q ss_pred ---CCCceE----------EEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHH Q lcl|NC_015262. 66 ---KGGTKY----------VLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDE 132 (147) Q Consensus 66 ---~~~~~~----------~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~ 132 (147) .++... ..+|-.-....+.+|+..+|||++. |+|..|.+.++.+-..- T Consensus 76 t~d~sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~-------------------QAP~G~vr~~~q~~~~~ 136 (144) T protein:vir:95 76 TQRASAAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSA-------------------QAPAGFVERAVLIGRKM 136 (144) T ss_pred cCCCchhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccC-------------------CCcchHHHHHHHHHHHH Confidence 011000 0111111223568999999999986 89999999998877554 Q ss_pred HHHHHHHHHHHHhc Q lcl|NC_015262. 133 AKNVMKEILKRGLG 146 (147) Q Consensus 133 ~~~~i~~~l~~~i~ 146 (147) +.+. |-+| T Consensus 137 v~~~------~~~~ 144 (144) T protein:vir:95 137 RKKF------KIKD 144 (144) T ss_pred HHhh------ccCC Confidence 3321 1112 No 128 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=98.12 E-value=1.7e-08 Score=63.14 Aligned_cols=120 Identities=13% Similarity=0.096 Sum_probs=78.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceeccccc-----------CCCc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKK-----------KGGT 69 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~-----------~~~~ 69 (147) |.. +-++...+....+.++..+...+++.+..+..++....|+|||.+|.|+.++.... ..|. T Consensus 1 m~~------~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~ 74 (148) T protein:vir:97 1 MPS------LSEFSRRITLRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGS 74 (148) T ss_pred CCc------cchhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCc Confidence 664 33455566677777777888889999999999999999999999999986651111 0010 Q ss_pred -------eE----------EEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHH Q lcl|NC_015262. 70 -------KY----------VLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDE 132 (147) Q Consensus 70 -------~~----------~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~ 132 (147) .. ..+|..-....+.+|+..+|||++. |.|..|.+.++..-..- T Consensus 75 ~~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~-------------------QAP~G~v~~t~~~~~~~ 135 (148) T protein:vir:97 75 TEAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSA-------------------QAPANFVEQAVLEAVQV 135 (148) T ss_pred ccccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccC-------------------CCcchHHHHHHHHHHHH Confidence 00 0111112223568999999999985 89999999998776554 Q ss_pred HHHHHHHHHHHHhcC Q lcl|NC_015262. 133 AKNVMKEILKRGLGL 147 (147) Q Consensus 133 ~~~~i~~~l~~~i~~ 147 (147) +.+ .++++.-=+- T Consensus 136 v~~--~~~~~~~~~~ 148 (148) T protein:vir:97 136 VQF--GRVVDGDPGS 148 (148) T ss_pred HHh--hhhhcCCCCC Confidence 432 2222222222 No 129 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=98.12 E-value=2.3e-08 Score=62.48 Aligned_cols=144 Identities=15% Similarity=0.203 Sum_probs=92.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCC------C---cchhhhcceecccccCC-Cc Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSG-DKLLNEAVKAGGNVILQDALPRVSK------R---SGKLKDGLKVSGVKKKG-GT 69 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~ak~~ap~------~---tG~l~~sI~~~~~~~~~-~~ 69 (147) |. +|. ..|++++.++++|..++ .+.-.+...+||+++++.....+|. . .+||+|||..+...-.+ .. T Consensus 1 M~-~~~-d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~d 78 (168) T protein:vir:39 1 MV-SFY-DAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKD 78 (168) T ss_pred Cc-cHH-HHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCcccC Confidence 43 221 13555666666665332 2334567778888888877766652 3 37999999876653322 23 Q ss_pred eEEEEeeeccC----CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH--HHHHHHHHHHHHHHH Q lcl|NC_015262. 70 KYVLVGITKED----NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES--KKDEAKNVMKEILKR 143 (147) Q Consensus 70 ~~~~Vg~~~~~----~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~--~~~~~~~~i~~~l~~ 143 (147) ...+|||.... ...++.|+|+.-||+-+.... +........+...|++.+|+..+-+. .++.+.++..+++++ T Consensus 79 G~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~-~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~~Ae~e~~~e 157 (168) T protein:vir:39 79 GQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTT-RSGRKYKNPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRK 157 (168) T ss_pred CceeccccCccccccccchhheehhccccccchhhh-hcccccccccceeecccchhHHHhhhhhhhHHHHHHHHHHHHH Confidence 46789997542 235778999999996321111 11122233455579999999999996 479999999999988 Q ss_pred HhcC Q lcl|NC_015262. 144 GLGL 147 (147) Q Consensus 144 ~i~~ 147 (147) -|.= T Consensus 158 il~~ 161 (168) T protein:vir:39 158 IINR 161 (168) T ss_pred HHHh Confidence 7765 No 130 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=98.08 E-value=6.9e-09 Score=65.32 Aligned_cols=95 Identities=16% Similarity=0.237 Sum_probs=56.0 Q ss_pred eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhccee----cccccCCCceEEEEeeec Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKV----SGVKKKGGTKYVLVGITK 78 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~----~~~~~~~~~~~~~Vg~~~ 78 (147) |++.-+||+++++.|+.. +++--.+.+.+. -|.... ......+ .. T Consensus 1 m~v~r~~L~~~~~~l~~~---------------------~V~VGi~~~a~y-~d~~g~~~~~g~~~~~~---------~~ 49 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYKSM---------------------SVKAGVLAGATY-PDESGKKLADGTILKKD---------PR 49 (155) T ss_pred CcchHHHHHHHHHHhhCC---------------------eeEEeecCCCCC-Cccccchhhhhhhhccc---------cc Confidence 556667877777655541 011111111111 010000 0000000 01 Q ss_pred cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ...+.+.++.|+||||.. +||+|||||+++.+++++.+.+...++.+++- T Consensus 50 ~G~pva~ia~~~e~G~~~-------------------IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~ 99 (155) T protein:vir:10 50 AGLPVAMIAMALNYGTSK-------------------LPARPFMEKTIADRSAEWIKGLTVMMTMGYDA 99 (155) T ss_pred cCcchhhhhhhhhcCCCC-------------------CCCcchhHHHHHHHHHHHHHHHHHHHHcCCCH Confidence 112346678899999853 89999999999999999999999999887776 No 131 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.08 E-value=2.1e-08 Score=62.68 Aligned_cols=111 Identities=18% Similarity=0.190 Sum_probs=73.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--------------CcchhhhcceecccccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK--------------RSGKLKDGLKVSGVKKK 66 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~--------------~tG~l~~sI~~~~~~~~ 66 (147) |++..++. ..-++++..+...+++.+..+...+-...|+ |||.+|.|+.++..... T Consensus 11 msFaa~i~----------~~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~ 80 (152) T protein:vir:96 11 MSWSKSLK----------NIIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKIT 80 (152) T ss_pred ccccccHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCC Confidence 77665544 3444555567777888888888888888998 99999999876522211 Q ss_pred -------CCceEE----------EEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHH Q lcl|NC_015262. 67 -------GGTKYV----------LVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESK 129 (147) Q Consensus 67 -------~~~~~~----------~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~ 129 (147) ++...+ .+|-.-....+.+|+..+|||++. |+|..|.+.++..- T Consensus 81 ~~~~~~~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~-------------------QAP~G~vr~t~~~~ 141 (152) T protein:vir:96 81 SFEKGISSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGHSS-------------------QAPNGVYRPAVRRL 141 (152) T ss_pred cccccCCCCCchHHHHHHHHhhccccceEEEeeCchhhhHhhccccC-------------------CCCchHHHHHHHHH Confidence 111111 112122223567999999999886 89999999988776 Q ss_pred HHHHHHHHHHH Q lcl|NC_015262. 130 KDEAKNVMKEI 140 (147) Q Consensus 130 ~~~~~~~i~~~ 140 (147) ..-+.++++.+ T Consensus 142 ~~~v~ea~~~~ 152 (152) T protein:vir:96 142 VKFLNTELKAK 152 (152) T ss_pred HHHHHHHhccC Confidence 66555544444 No 132 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=98.07 E-value=1.6e-08 Score=63.34 Aligned_cols=93 Identities=19% Similarity=0.265 Sum_probs=54.4 Q ss_pred eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC------cchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 3 VEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKR------SGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 3 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~------tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |++.-.||+.+.+.|+... ++--++.+ +|...... . ....++ T Consensus 1 m~~~r~~l~~~~~~l~~~~---------------------v~VGi~~~a~y~d~~~~~~~~~--~-~~~~~~-------- 48 (155) T protein:vir:77 1 MSVTRRGLTLPKDRYRSMS---------------------VKAGVLAGATYPDESGKKLADG--S-ILKKDP-------- 48 (155) T ss_pred CcchHHHHHHHHHHHhcCc---------------------eEEeecCCCCCccccchhhhhh--h-hccccc-------- Confidence 4455567776666554311 11112211 11111000 0 000000 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ....+.+.++.+.||||. .+||+|||||+++++++++.+.+...++.+++- T Consensus 49 -~~G~pva~ia~~~e~G~~-------------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~ 99 (155) T protein:vir:77 49 -RAGLPVAMIAMALNYGTS-------------------KLPARPFMEKTIADRSAEWIKGLTVMMTMGYDA 99 (155) T ss_pred -cccccHhhhhhhhhcCCC-------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHccCcH Confidence 011234667889999985 399999999999999999999999988887666 No 133 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=98.03 E-value=7.4e-08 Score=59.66 Aligned_cols=111 Identities=19% Similarity=0.196 Sum_probs=63.3 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+++...+++++|++.|+.|....-. -|=+.++.. ...++. .+|+ T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~v~------------------------vGi~~~~~~----~~~~~~---~~G~---- 45 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRSVS------------------------AGWYSTARY----PDKAGG---SVGI---- 45 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCeEE------------------------EEEcCCCCC----CCcccc---cccc---- Confidence 99998888888888888776432100 011111000 000000 1111 Q ss_pred Ccccchhhhhhcccc-cccccccc-------cccc--------------cccccCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGAS-AHKIPIKK-------GKKK--------------GRIINHPGVSPKPFLAPAYESKKDEAKNVMK 138 (147) Q Consensus 81 ~~~~~y~~~vE~GT~-~~~~~~~~-------~~~~--------------~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~ 138 (147) +.+.++.+.|||.. .++.+... +... .....+..+||+|||+++++.+++++.+.++ T Consensus 46 -~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~ 124 (193) T protein:vir:96 46 -QVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQN 124 (193) T ss_pred -hHHHHHhHHHcCCccccCccceeeeeccccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHH Confidence 23455888999953 22111110 0000 0011234689999999999999999999999 Q ss_pred HHHHHHhcC Q lcl|NC_015262. 139 EILKRGLGL 147 (147) Q Consensus 139 ~~l~~~i~~ 147 (147) +.+++.+.- T Consensus 125 ~~~~~~~~g 133 (193) T protein:vir:96 125 RIAMRLARG 133 (193) T ss_pred HHHHHHHhC Confidence 998887776 No 134 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=97.94 E-value=1e-07 Score=58.95 Aligned_cols=83 Identities=18% Similarity=0.245 Sum_probs=50.7 Q ss_pred Cceeeee--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MSVEITT--EGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~~~~~i--~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |+..|+- .+++.|.+.|++|.. ..+.||+-. T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~-----------------------------------------------k~V~VGi~~ 33 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMND-----------------------------------------------YSVRIGWFS 33 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhC-----------------------------------------------CeEEEEecC Confidence 6655543 222333322222210 011112110 Q ss_pred -----cCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 79 -----EDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 79 -----~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ...+.+..+.+.|||+-. ..+||+|||+|++++++++..+.+...+++.|+= T Consensus 34 ~~~y~dG~~vA~Ia~~~E~G~p~-----------------~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G 90 (189) T protein:vir:10 34 TAKYPDGTPTAYVASIHEFGAPS-----------------RGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVG 90 (189) T ss_pred CCCCCCcccHHHHHHHHHhcCcC-----------------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhC Confidence 012346778899999742 1389999999999999999999999999987744 No 135 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.86 E-value=3.6e-07 Score=55.93 Aligned_cols=112 Identities=17% Similarity=0.156 Sum_probs=76.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+++|+|. +..+.+.| .+...++...-++.|..++..-+|.+||.|++|-.. ..++. |.+ T Consensus 1 M~vkV~id-~~~~~~~l-------~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~----~~~g~----I~y---- 60 (112) T protein:vir:80 1 MPIKVRVD-LSKAKGSV-------KKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI----MNDKE----IMW---- 60 (112) T ss_pred CceeEEee-hHHHHHHH-------HHHHHHHHHHHHHHHHHHhhcCCCcccCccccceee----ccCce----EEe---- Confidence 99999886 33333332 334556777778888888889999999999998421 11221 222 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i 145 (147) ..+||+++-||...+. ..+.+|+ ...-|+..+.....+++++.+.+.+.+.| T Consensus 61 --~tPYAr~qYY~~~~~~----------~~~~~p~-ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 61 --TSIYARRLYNGINFNF----------TLTHHPL-AGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred --cCchhhHhhhcccCCC----------CcCCCCC-cchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 3589988888753221 1233443 33457777999999999999999999999 No 136 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=97.81 E-value=1.1e-07 Score=58.81 Aligned_cols=129 Identities=12% Similarity=0.179 Sum_probs=88.2 Q ss_pred Cc--eeeeehhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhCCCCc---chhhhcceecccccCCCceEEEE Q lcl|NC_015262. 1 MS--VEITTEGFDAVLSKIESMGKSGDKLLNEAVKA-GGNVILQDALPRVSKRS---GKLKDGLKVSGVKKKGGTKYVLV 74 (147) Q Consensus 1 M~--~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~-~a~~v~~~ak~~ap~~t---G~l~~sI~~~~~~~~~~~~~~~V 74 (147) |+ .++++.++++|.+.+.++|.+++++++++|.. ++..+.+.+....|++. |.+++.......+.-... .... T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~-~~NL 79 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVK-MGNL 79 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhh-hhhc Confidence 87 67778899999999999999999999999987 67777788889999863 333433322111111111 2222 Q ss_pred eeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 75 GITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 75 g~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) |..-...+...|..|...|-..+. --||-||+..++...+.+.+.+.++|-+.|.= T Consensus 80 gf~i~~k~kf~YLvfPD~G~G~sn-----------------~~~q~FmerGl~~~t~~i~E~L~~~l~k~in~ 135 (140) T protein:vir:40 80 GFELLTKPKFNYLIFPDQGIGKHN-----------------KTKQDFMQLGVEESSQEIVEMLEQAVFKEIND 135 (140) T ss_pred ceeEeecCcccccccccccCCCCC-----------------cchHHHHHhccccchhHHHHHHHHHHHHHHHH Confidence 332223456678888877754431 24666999999999998888877777777654 No 137 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=97.81 E-value=1.7e-07 Score=57.68 Aligned_cols=126 Identities=15% Similarity=0.114 Sum_probs=89.7 Q ss_pred eeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcc Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSK 83 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~ 83 (147) |+++|+.+....|+++-+++. +..-+||..+.......|--.+|.||..|-+|= .......++.....||++ T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQ-frei~~ngtritGRVGYS------ 73 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQ-YKKLEPIPSGMIGRVGYT------ 73 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhcccc-ceeeeccCceeEEeeccc------ Confidence 899999999999999988876 567778888888888888889999999999985 456667777788888864 Q ss_pred cchhhhhhc--ccccccccccccccccccccCCCCCCCc-chhhHHHHHH-HHHHHHHHHHHHH Q lcl|NC_015262. 84 IFYGKFLEF--GASAHKIPIKKGKKKGRIINHPGVSPKP-FLAPAYESKK-DEAKNVMKEILKR 143 (147) Q Consensus 84 ~~y~~~vE~--GT~~~~~~~~~~~~~~~~~~~~~~~a~P-Fl~pA~~~~~-~~~~~~i~~~l~~ 143 (147) +.||-||.- |+-+..+.+..... |.-| .|.| ||..+|+.+. +.+...|+++++- T Consensus 74 AnYA~yVHda~Gklkgqprp~gkgn----~w~p--~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 74 ANYAAAVNAAKGKLKGKPRPDGSGN----YWDP--NGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred eeeeeeeecCccccCCCcCCCCCcc----eecC--CCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 678888865 55443333332111 1112 2334 9999998764 4455555555554 No 138 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=97.68 E-value=1.1e-07 Score=58.63 Aligned_cols=94 Identities=12% Similarity=0.155 Sum_probs=47.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEe-----eeccCCcccchhhhhhcccccccc Q lcl|NC_015262. 25 DKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVG-----ITKEDNSKIFYGKFLEFGASAHKI 99 (147) Q Consensus 25 ~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg-----~~~~~~~~~~y~~~vE~GT~~~~~ 99 (147) =++.++.|++. .+++.... +.-|=+.++ ....+......+| ......+.+.++.+.||||. T Consensus 1 m~v~~k~L~~~----~~~l~~~~-v~VGi~~~a-----~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~---- 66 (155) T protein:vir:10 1 MSVTRRGLTLP----KDRYRSMS-VKAGVLAGA-----TYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTS---- 66 (155) T ss_pred CcchHHHHHHH----HHHHhCCe-eEEeecCCC-----CCccccchhhhhhhhcccccccCCcHHHHHHHHhcCCC---- Confidence 11122222222 22221100 000100000 0000000000000 00112244667888999975 Q ss_pred cccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 100 PIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 100 ~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .+||+|||||+++++++++.+.+...++.+++- T Consensus 67 ---------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~ 99 (155) T protein:vir:10 67 ---------------KLPARPFMEKTIADRSAEWIKGLTVMMTMGYDA 99 (155) T ss_pred ---------------CCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCH Confidence 399999999999999999999999999887776 No 139 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=97.68 E-value=1.2e-07 Score=58.60 Aligned_cols=94 Identities=12% Similarity=0.155 Sum_probs=46.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEe-----eeccCCcccchhhhhhcccccccc Q lcl|NC_015262. 25 DKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVG-----ITKEDNSKIFYGKFLEFGASAHKI 99 (147) Q Consensus 25 ~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg-----~~~~~~~~~~y~~~vE~GT~~~~~ 99 (147) -++.++.|++. .+++.... +.-|=+.++ ....+......+| ......+.+.++.+.||||. T Consensus 1 m~v~~k~L~~~----~~~l~~~~-v~VGi~~~a-----~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~---- 66 (155) T protein:vir:78 1 MSVTRRGLTLP----KDRYRSMS-VKAGVLAGA-----TYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTS---- 66 (155) T ss_pred CcchHHHHHHH----HHHHhCCe-eEEeecCCC-----CCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCC---- Confidence 11122222222 22221100 000100000 0000000000000 00112234567788899975 Q ss_pred cccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 100 PIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 100 ~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .+||+|||||++++++++..+.+...++.+++- T Consensus 67 ---------------~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~ 99 (155) T protein:vir:78 67 ---------------KLPARPFMEKTITDRSAEWIKGLTVMMTMGYDA 99 (155) T ss_pred ---------------CCCCcchhhHHHHHHHHHHHHHHHHHHHcCCCH Confidence 399999999999999999999999999887776 No 140 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=97.67 E-value=7.4e-08 Score=59.66 Aligned_cols=100 Identities=17% Similarity=0.190 Sum_probs=55.6 Q ss_pred Cceeeeeh------h--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEE Q lcl|NC_015262. 1 MSVEITTE------G--FDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYV 72 (147) Q Consensus 1 M~~~~~i~------G--l~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~ 72 (147) |+..-.-+ | +++|. +++ .++..+.+=.+++...+|++.|+++|.+++|+.+.+.....|.. T Consensus 1 ma~gpt~kNP~~KFGvs~~d~~----K~~-----EVn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~NkGRG-- 69 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDFD----KLP-----EVNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNKGRG-- 69 (108) T ss_pred CCCCcccccchhhhcCChhhhh----hch-----hhhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhccCcc-- Confidence 54321111 1 22222 122 23444444455777889999999999999999876555544432 Q ss_pred EEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH Q lcl|NC_015262. 73 LVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES 128 (147) Q Consensus 73 ~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~ 128 (147) .| +...||+|||||||..........+....+.++ |++. T Consensus 70 ~~------G~~~~~AH~VEFGs~hndeyapaqktakqfggt-----------ay~d 108 (108) T protein:vir:79 70 KV------GATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGT-----------AYGD 108 (108) T ss_pred cc------CCcchhhhhhhhhccccccccchhhHHHhhccc-----------ccCC Confidence 22 356799999999998665443332222222222 2222 No 141 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=97.65 E-value=1.3e-06 Score=52.78 Aligned_cols=112 Identities=19% Similarity=0.133 Sum_probs=75.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+++|+|.. ..+.. .+.+.+.++....++.|..++..-+|.+||.|++|-.+ ..+|. |-+ T Consensus 1 M~vkv~vn~-~~~~~-------~l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~----~~~g~----I~y---- 60 (112) T protein:vir:45 1 MPIKVRVDL-SKAKG-------SVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI----MNDKE----IMW---- 60 (112) T ss_pred CceeEEeeh-HHHHH-------HHHHHHHHHHHHHHHHHHHHhhcCCccccCccccceee----ccCCe----EEe---- Confidence 999999864 33322 22334556777778888888888999999999998421 11221 111 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGL 145 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i 145 (147) ..+||+++=||...+. ....+|+ ...-|+..+.....+++.+.+.+.+++.| T Consensus 61 --~tPYAr~qYY~~~~~~----------~~~~~p~-ag~~W~erak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 61 --TSIYARRLYKGINFNF----------TLTHHPL-AGPEWDQRAKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred --cChhhHHhhhccccCC----------CCCCCCC-CchhhHHHHHHhhHHHHHHHHHHHHhhcC Confidence 3578888777654321 1223343 33457777999999999999999999999 No 142 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=97.57 E-value=8.7e-07 Score=53.82 Aligned_cols=87 Identities=11% Similarity=0.096 Sum_probs=50.8 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |--.+...|++.|...|++|+..- +.-|-..+. ..-.+| T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~~~~------------------------V~VGi~~d~-----g~~~dG------------ 39 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQTAN------------------------AQVGYFQEQ-----GQHSSG------------ 39 (160) T ss_pred CceeechHhHHHHHHHHHHHhCCe------------------------eEEeecccc-----ccCCCC------------ Confidence 655777777777777666652210 111111110 000011 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH----HHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES----KKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~----~~~~~~~~i~~~l~~~i~~ 147 (147) .+-+..+.|.||||.. +|++||||++|+. ....+++.....+.+.+.. T Consensus 40 ~sv~~vA~~~EfG~~~-------------------iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~ 91 (160) T protein:vir:95 40 FSYPALMYLQEVIGVP-------------------SASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSS 91 (160) T ss_pred ccHHHHHhhhhcCccc-------------------CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 1234568899999864 8999999999973 5566666666666666663 No 143 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=97.47 E-value=2.5e-07 Score=56.78 Aligned_cols=102 Identities=18% Similarity=0.220 Sum_probs=54.0 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ .+.=+|+.-+...+..|... .++.+.-. ....| .|............. .... T Consensus 1 ~~-~~~~~g~~~~~~~~~~l~~~-------~v~vG~l~-----~a~yp--~G~~~~~~~~~~~~~-----------~~~g 54 (168) T protein:vir:94 1 MT-TIARKGVKMPPHLEAQFQSG-------EVKAGVLS-----GSTYP--QMTYTDQRTGKQIED-----------ARGG 54 (168) T ss_pred Cc-cccchhhhhhHHHHHhhhcc-------ceeeeccc-----cCccc--ccccchhhccccccc-----------cccc Confidence 65 35555655544444443211 01100000 01111 111111110000000 0111 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) -+.+.++.++|||+.. +||+||||++++++++++.+.+...++..++. T Consensus 55 ~~va~Ia~~~E~G~~~-------------------IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~ 102 (168) T protein:vir:94 55 MPVAVIAQALEYGHGQ-------------------NHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAA 102 (168) T ss_pred ccHHHHHHHHhcCCCC-------------------CCCchhhHHHHHHHHHHHHHHHHHHHhcCCCH Confidence 2346788899999853 89999999999999999999998888776665 No 144 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=97.44 E-value=4.2e-07 Score=55.56 Aligned_cols=85 Identities=14% Similarity=0.225 Sum_probs=61.7 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |+ .++- |-++|.+.|+++++++.+.+++.+.+.|+.|...|..++|+|+|.|++||...- +..+-...++|| T Consensus 13 ma-kvky-G~~dmvk~~~~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dy-k~GGltavI~vG----- 84 (100) T protein:vir:96 13 MA-KVKY-GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKY-FDGGLSSVISVG----- 84 (100) T ss_pred hh-hhee-chHHHHHHHhcchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeee-ecCCeeEEEecc----- Confidence 55 3433 889999999999999999999999999999999999999999999999997532 233334456665 Q ss_pred Ccccchhhh--hh-cccccccccccccccccccccCCCC Q lcl|NC_015262. 81 NSKIFYGKF--LE-FGASAHKIPIKKGKKKGRIINHPGV 116 (147) Q Consensus 81 ~~~~~y~~~--vE-~GT~~~~~~~~~~~~~~~~~~~~~~ 116 (147) +.|+.- -. .=|. + T Consensus 85 ---AeYAIkrmsqllvtv--------------------i 100 (100) T protein:vir:96 85 ---ADYAIKRMSQLLVTV--------------------I 100 (100) T ss_pred ---hhHHHHHHHHHHhhc--------------------C Confidence 345430 00 0000 0 No 145 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=96.81 E-value=6.3e-07 Score=54.57 Aligned_cols=124 Identities=10% Similarity=0.118 Sum_probs=55.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHH--HHH-----HHHHHHHHHHHHHHHhCCCCcchhhhcceecccc-----cCCC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKL--LNE-----AVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVK-----KKGG 68 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~--~~~-----al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~-----~~~~ 68 (147) |+++-.-.-++++++.|+.|....-++ ..+ ++-+++.+.=..++...+.-+-.+.+.+...... ...+ T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~v~vGi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~~~~p~g 80 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYSLQIGLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIPGLFKPKG 80 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCEEEEEEecCCCcchhheeehhhcCCeeecCCceeeecchhhhcccccccCcccccCC Confidence 887655556788888887775421111 100 0000000000000000000000011111100000 0001 Q ss_pred ceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 69 TKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 69 ~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .....++ ..+--+..+|||+.. ..+||+|||||+++.++++..+.++..+++.|+- T Consensus 81 ~~~~~~~------~~~~~~~~~e~g~~~-----------------~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g 136 (199) T protein:vir:80 81 KNILAVA------GPDGKLTVMFYLKTE-----------------VNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHG 136 (199) T ss_pred cceeeee------ccccceeeeeecccc-----------------ccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 0000111 001112335666532 2489999999999999999999999999987654 No 146 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=96.81 E-value=4.6e-05 Score=44.38 Aligned_cols=114 Identities=18% Similarity=0.267 Sum_probs=66.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |++.|+|. ++.+.+.|.. +.++++...-+..+..++..-+|.+||.|++|..+. ..++. |-+ T Consensus 1 M~~kVkv~-l~~~~~~l~~------~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~---~~~~~----I~y---- 62 (114) T protein:vir:47 1 MNIAIKVD-LQKAKQKLSN------ESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIV---GQGDA----VVY---- 62 (114) T ss_pred CceeEEee-hhHHHHHHHH------HHHHHHHHHHHHHHHHhhccCCcCccCccccceeee---eCCcE----EEe---- Confidence 88888876 5554444421 233455566677788888889999999999986432 11221 111 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ..+||+++=||...... ...+.+| .....|+..|.....+ .+.+.+.+..+| T Consensus 63 --~tPYAr~qyYg~~~~~~--------~~~~~~p-~~g~~W~eraka~~~~----~~~~~~~k~~g~ 114 (114) T protein:vir:47 63 --GTVYARAQFYGSNGIVT--------FRRYTTP-GTGKRWDQVATSKHAE----EWARAFVKGMGL 114 (114) T ss_pred --cCchhhHhhhcccCCCC--------CCccCCC-CCcchhHHHHHhhhhH----HHHHHHHHhhCC Confidence 35788877776321100 0112233 2334466656655544 456667788899 No 147 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=96.80 E-value=2.6e-05 Score=45.68 Aligned_cols=107 Identities=16% Similarity=0.079 Sum_probs=65.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |.++|++.++...+. .+.+.++...-++.|..++..-+|.+||.|++|-.+.. .+|. |-+ T Consensus 2 mkvkv~~~~~~~~~~---------~~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s---~~g~----I~y---- 61 (108) T protein:vir:98 2 PKIRVELSGAKDKLS---------PQTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISS---DAEE----IYY---- 61 (108) T ss_pred ceeEeeehHHHHHHH---------HHHHHHHHHHHHHHHHHhhcccCcCcCCccccceeecc---CCce----EEe---- Confidence 778888877654221 12334566677788888888899999999999843322 1221 211 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKR 143 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~ 143 (147) ..+||+++=||...+.. +|+ ...-++..|.....+++++.+.++++= T Consensus 62 --~tPYAr~qYYg~~~n~~-------------~p~-ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 62 --NTPYAKRRFYEPAYNYT-------------TPG-TGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred --cChhhHHhhhccccCCC-------------CCC-CcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 35789888888654322 222 233466667777766666665555444 No 148 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=96.75 E-value=3.8e-06 Score=50.30 Aligned_cols=134 Identities=19% Similarity=0.230 Sum_probs=59.0 Q ss_pred Cceeeeehh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MSVEITTEG--FDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~~~~~i~G--l~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |.--+.--| +++|.+-+++ ..+++.-++.-+.+.|.+ -||++.|+++|++++|+.+.+.... |.. .+ T Consensus 1 mgNP~~KFGvS~~e~~K~irn-s~EV~~GiNdFMe~~A~~---~aK~~SPV~~GeY~~S~~V~~ka~N-GRG--~~---- 69 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRN-SAEVDAGINDFMENEAIP---YAKSISPVDDGEYAASWAVMKKAKN-GRG--VF---- 69 (150) T ss_pred CCCchhhhcCCHHHHHHhhcc-chhhhhhHHHHHHhhhhh---hhhccCCcccchhHHHHHHHhhccc-Ccc--cc---- Confidence 653333223 3444443333 234455555555554443 3688999999999999977554433 321 22 Q ss_pred cCCcccchhhhhhccccccccccc--ccccccccccCCCCCCCcchh-----hHH-HHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIK--KGKKKGRIINHPGVSPKPFLA-----PAY-ESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~--~~~~~~~~~~~~~~~a~PFl~-----pA~-~~~~~~~~~~i~~~l~~~i~~ 147 (147) ...+||+|||||||..-...+. +++...-...+..+.---|-| |.- .-..+.+.+-+.-.|+-+|-- T Consensus 70 --G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqkvashfggslkggisk 144 (150) T protein:vir:81 70 --GPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQKVASHFGGSLKGGISK 144 (150) T ss_pred --CccchhhhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHHHHHhccccccccccc Confidence 3568999999999964332221 111110000110000000100 000 001111222222222222222 No 149 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.65 E-value=4.5e-05 Score=44.41 Aligned_cols=119 Identities=11% Similarity=0.133 Sum_probs=69.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CC------------------------C Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRV-----SK------------------------R 50 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~a-----p~------------------------~ 50 (147) |+ .+++|.+.|+.|-..+. ...+..++..++.++...+.+. |. . T Consensus 1 m~------d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~ 74 (149) T protein:vir:98 1 MS------ELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFA 74 (149) T ss_pred Cc------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccch Confidence 43 45666666666544332 1234456666666666665443 31 1 Q ss_pred cchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHH Q lcl|NC_015262. 51 SGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKK 130 (147) Q Consensus 51 tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~ 130 (147) +|.|.+||.... +...+.||+. +++..||....||..-.+.+.. ....+|++|||-=+ +..+ T Consensus 75 ~g~l~~sl~~~~-----~~~~~~V~~~---Gs~~~yAa~HQfG~~~r~~~~~---------~~~~iPaRp~LG~s-~~d~ 136 (149) T protein:vir:98 75 RLRTNRFMKAKG-----SDSAAVVEFT---GRVQRMARVHQYGLKDRPNRHS---------RDVQYAARPLLGFT-RDDE 136 (149) T ss_pred hhhhhhhhhhee-----cCCeeEEEec---CcchHHhhHhhccccccccCCC---------cceeccccccCCCC-HHHH Confidence 245556664321 1223455542 4567999999999653221111 12358999998754 5667 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_015262. 131 DEAKNVMKEILKR 143 (147) Q Consensus 131 ~~~~~~i~~~l~~ 143 (147) .++.+.+.+.|.+ T Consensus 137 ~~i~~~i~~~l~~ 149 (149) T protein:vir:98 137 QMIEDIIIRHLGK 149 (149) T ss_pred HHHHHHHHHHhhC Confidence 8899999999999 No 150 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.41 E-value=9.5e-05 Score=42.63 Aligned_cols=120 Identities=11% Similarity=0.049 Sum_probs=70.0 Q ss_pred ehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----C----C--------------------Ccchhhh Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRV-----S----K--------------------RSGKLKD 56 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~a-----p----~--------------------~tG~l~~ 56 (147) +..+++|...|..+-..+. ...+..++..++.++...+.+. | + .+|.+.. T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhcc Confidence 5567777777766544432 2234456666777777666543 3 1 1123333 Q ss_pred cceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_015262. 57 GLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNV 136 (147) Q Consensus 57 sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~ 136 (147) ||... .+...+.||+. .+++..||....||-...+.+.. ....+||+|||-=+ +..+.++.+. T Consensus 81 sl~~~-----~~~~~a~vg~~--~G~~~~yAaiHQfG~~~r~~~~~---------~~~~iPaRp~LG~s-~~d~~~i~~~ 143 (150) T protein:vir:57 81 FLHIR-----ASPEQASMEFY--GGKSPKIASVHQFGLSEETRKDG---------KKIDYPARPLLGFT-GEDVQMIEEI 143 (150) T ss_pred ceeee-----eeCcEEEEEee--cCCchhhhhhhhccccccccCCC---------ceeecCCcccCCCC-HHHHHHHHHH Confidence 33321 12233455542 23567999999999543221111 12248999999876 5557889999 Q ss_pred HHHHHHH Q lcl|NC_015262. 137 MKEILKR 143 (147) Q Consensus 137 i~~~l~~ 143 (147) +.+.|.+ T Consensus 144 i~~~l~r 150 (150) T protein:vir:57 144 ILAHLDR 150 (150) T ss_pred HHHHHhC Confidence 9999999 No 151 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.34 E-value=0.00011 Score=42.27 Aligned_cols=120 Identities=11% Similarity=0.050 Sum_probs=71.2 Q ss_pred ehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----C----C--------------------Ccchhhh Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRV-----S----K--------------------RSGKLKD 56 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~a-----p----~--------------------~tG~l~~ 56 (147) +..+++|...|..+-..+. ...+..++..++.++...+.+. | . .+|.|.+ T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~~ 80 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhhh Confidence 5566677777766644332 2234456666666766665543 2 1 1345555 Q ss_pred cceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_015262. 57 GLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNV 136 (147) Q Consensus 57 sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~ 136 (147) ||.... +...+.||+.. +++..||...-||-........ ....+||+|||-=+ +..++++.+. T Consensus 81 sl~~~~-----~~~~~~vg~~~--Gs~~~yAa~HQfG~~~~~~~~~---------~~~~iPaRp~LG~s-~~d~~~i~~~ 143 (150) T protein:vir:20 81 FLHIRA-----SPEQASMEFYG--GKSPKIASVHQFGLSEENRKDG---------KKIDYPARPLLGFT-GEDVQMIEEI 143 (150) T ss_pred hhheee-----cCcEEEEEeeC--CcchhhhhhhhcccccccccCC---------CceeccccccCCCC-HHHHHHHHHH Confidence 554321 22345555432 3567899999999542211110 12358999999866 4557899999 Q ss_pred HHHHHHH Q lcl|NC_015262. 137 MKEILKR 143 (147) Q Consensus 137 i~~~l~~ 143 (147) +.+.|.| T Consensus 144 i~~~l~k 150 (150) T protein:vir:20 144 ILAHLER 150 (150) T ss_pred HHHHHhC Confidence 9999999 No 152 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=96.06 E-value=0.00022 Score=40.62 Aligned_cols=138 Identities=11% Similarity=0.085 Sum_probs=75.0 Q ss_pred Cceeeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----CcchhhhcceecccccCCCceEEEEe Q lcl|NC_015262. 1 MSVEITTEG-FDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSK----RSGKLKDGLKVSGVKKKGGTKYVLVG 75 (147) Q Consensus 1 M~~~~~i~G-l~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~----~tG~l~~sI~~~~~~~~~~~~~~~Vg 75 (147) |.++|++++ ++.+.+.|..++..+.+++..|+..++..+..++...+.. ....++..+.+.... ......+.++ T Consensus 5 ~~l~idv~~~l~~i~~~l~~~~~~~~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~-~~~~~~i~~~ 83 (177) T protein:vir:96 5 FEMKIDVSREAEDIAAMVAATTKQLELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQR-QKGEVRFWVG 83 (177) T ss_pred ceeEEehhHHHHHHHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccC-CCcEEEEEEe Confidence 888899887 5556666677788888888888888888887777665533 346677777655433 2233333322 Q ss_pred eeccCCcccchhhhhhccccccccccc------ccccc-----------cccccC-------CCCCCCcchhhHHHHHHH Q lcl|NC_015262. 76 ITKEDNSKIFYGKFLEFGASAHKIPIK------KGKKK-----------GRIINH-------PGVSPKPFLAPAYESKKD 131 (147) Q Consensus 76 ~~~~~~~~~~y~~~vE~GT~~~~~~~~------~~~~~-----------~~~~~~-------~~~~a~PFl~pA~~~~~~ 131 (147) .+ . -.. .-||+......+- ...+| ..+.+. ...|-.|=+..+++...+ T Consensus 84 ~~--~---i~l---~~~~~~r~t~~Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~ 155 (177) T protein:vir:96 84 LD--P---IGV---YRLGTPKVTQKGVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWER 155 (177) T ss_pred cc--c---eeh---hhcccCCCCccceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHH Confidence 11 1 111 1123211110000 00000 000000 112223335677777777 Q ss_pred HHHHHHHHHHHHHhcC Q lcl|NC_015262. 132 EAKNVMKEILKRGLGL 147 (147) Q Consensus 132 ~~~~~i~~~l~~~i~~ 147 (147) ++.+.+...|.++|+- T Consensus 156 ~~~~~~~~~l~~Ei~~ 171 (177) T protein:vir:96 156 RVFQRFKELFEQEARA 171 (177) T ss_pred HHHHHHHHHHHHHHHH Confidence 7878788888888877 No 153 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.05 E-value=0.00017 Score=41.28 Aligned_cols=120 Identities=11% Similarity=0.049 Sum_probs=68.8 Q ss_pred ehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CC------------------------Ccchhhh Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRV-----SK------------------------RSGKLKD 56 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~a-----p~------------------------~tG~l~~ 56 (147) +..+++|...|..+-..+. ...+..++..|+.++...+.+. |. .+|.+.. T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcc Confidence 5566666666665544432 1234455666777766665543 31 1223333 Q ss_pred cceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_015262. 57 GLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNV 136 (147) Q Consensus 57 sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~ 136 (147) +|... .+...+.||+. .+++..||....||-.....+. .....+|++|||-=+ ++.+.++.+. T Consensus 81 sl~~~-----~~~~~a~vg~~--~Gt~~~yAaiHQfG~~~~~~~~---------~~~~~iPaRp~LG~s-~~d~~~i~~~ 143 (150) T protein:vir:60 81 FLHIR-----ASPEQASMEFY--GGKSPKIASVHQFGLSEENRKD---------GKKIDYPARPLLGFT-GEDVQMIEEI 143 (150) T ss_pred eeeee-----eeCcEEEEEee--CCCchhhhhhhhccccccccCC---------CCceecCCcccCCCC-HHHHHHHHHH Confidence 33321 12233455542 2356799999999954321111 112358999999866 5557889999 Q ss_pred HHHHHHH Q lcl|NC_015262. 137 MKEILKR 143 (147) Q Consensus 137 i~~~l~~ 143 (147) +.+.|.+ T Consensus 144 i~~~l~r 150 (150) T protein:vir:60 144 ILAHLDR 150 (150) T ss_pred HHHHHhC Confidence 9999999 No 154 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=95.93 E-value=0.00015 Score=41.49 Aligned_cols=116 Identities=16% Similarity=0.090 Sum_probs=65.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |++.|+|. ++.+.++|. .+.++++-..-+..+..++..-+|.+||++..|.... +...++.+. + T Consensus 1 M~ikVkv~-l~~~~~~~~------~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~-~~~~~~~I~----y---- 64 (116) T protein:vir:15 1 MAFRINVD-LDGFMDQTS------LDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVH-ATSDGSEIT----Y---- 64 (116) T ss_pred CCceEEee-hhHhhhhhh------HHHHHHHHHHHHHHHHHhhhccCCcccCCccccccee-eecCCceEE----e---- Confidence 88888876 444444332 1244455666677788888889999998855443211 111122221 1 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILK 142 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~ 142 (147) ..+||+++=||.... ......+.+|+..++ |+..|-....+...+.+.++++ T Consensus 65 --~tPYAr~qyYg~~~~-------~~~~~~~t~p~ag~~-W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 65 --STPYAKAQFYGIIND-------KYPVHNYTTPGTTKR-WDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred --cCchhHHHhcccccC-------CCCcccccCCCCCcc-hhHHHHhhhHHHHHHHHHHhcC Confidence 357887665654211 111223445555554 5555777777777666666666 No 155 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=95.93 E-value=0.00016 Score=41.42 Aligned_cols=135 Identities=16% Similarity=0.256 Sum_probs=65.1 Q ss_pred eeehhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCC----CcchhhhcceecccccCCCceEEEEeeecc Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKS-GDKLLNEAVKAGGNVILQDALPRVSK----RSGKLKDGLKVSGVKKKGGTKYVLVGITKE 79 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~-~~~~~~~al~~~a~~v~~~ak~~ap~----~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~ 79 (147) ++|+||+++++.|..+++. +.+++..|+..++..+..++...+.. ....++..+.+...... ...+.|..... T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~kas~~--~l~a~I~~~~~ 78 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVKRATVN--KPRALIRVNRG 78 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheecccCCC--CeEEEEEEecc Confidence 9999999999999999665 68888888888888887777665543 34456666654333222 22222211110 Q ss_pred CC---------------------------------cccchhhhhhcccccccccccc-cccccccccCCCCC-CCcchhh Q lcl|NC_015262. 80 DN---------------------------------SKIFYGKFLEFGASAHKIPIKK-GKKKGRIINHPGVS-PKPFLAP 124 (147) Q Consensus 80 ~~---------------------------------~~~~y~~~vE~GT~~~~~~~~~-~~~~~~~~~~~~~~-a~PFl~p 124 (147) .- ..+|.+. .-.|+......-.+ .... ..+. .| +.| +.. T Consensus 79 ~i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~-~~~G~~~Vf~R~gk~R~PI-~~~~---~~i~~~-~~e 152 (184) T protein:vir:39 79 NLPAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQ-LKNGRWHVMRRTSKPRYPI-EVVS---IPLAAP-LTT 152 (184) T ss_pred ceeeeeccccccccCccccccccccceeeecceecCcceeee-cCCCceEEEEEecCcccce-eEEE---cCchHH-HHH Confidence 00 0000000 00111000000000 0000 0111 11 122 233 Q ss_pred HHHHH-----HHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 125 AYESK-----KDEAKNVMKEILKRGLGL 147 (147) Q Consensus 125 A~~~~-----~~~~~~~i~~~l~~~i~~ 147 (147) +++.. .+.+.+.|..+|..+|++ T Consensus 153 ~~~~~~~~~~~~~~~~el~~~l~~~L~~ 180 (184) T protein:vir:39 153 AFKEELPKLMESDMPKELRASLTNQLRL 180 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 44332 245566677777777777 No 156 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=95.92 E-value=0.00019 Score=40.99 Aligned_cols=116 Identities=18% Similarity=0.166 Sum_probs=66.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |.+.|++.++...+. .+.++++...-++.|..++..-+|.+||.|++|..+. ++. |-+ T Consensus 2 ~kV~vdl~~~~~~ls---------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~-----~~~----I~Y---- 59 (118) T protein:vir:98 2 AKVVVELGGIKRKVS---------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRAN-----SVG----VTW---- 59 (118) T ss_pred ceeeechhHHhhhhh---------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeec-----CCe----eEE---- Confidence 778888887765332 1233455666677888888899999999999986432 121 111 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ..+||+.+=||...+.. .......+.||+..++=|.| +. ......+...+.+.++++| T Consensus 60 --~tPYAr~qYY~~~~~~~----~g~~~~~~~~p~~g~~Wd~R-~k--a~~~~~~~w~~~~~k~~g~ 117 (118) T protein:vir:98 60 --SGPHARAQFYGGAYNKY----KSFKFKKYTTPGTGKRWDKR-AL--ANATIVKDWEKSLLRGMGF 117 (118) T ss_pred --CCchhhHhhhccccCCC----CccccccccCCCCCCcccch-hh--cchhhhHHHHHHHHHhcCC Confidence 34788766565321111 11222333444433333322 22 1123445677888899999 No 157 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=95.92 E-value=0.00019 Score=40.99 Aligned_cols=116 Identities=18% Similarity=0.166 Sum_probs=66.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |.+.|++.++...+. .+.++++...-++.|..++..-+|.+||.|++|..+. ++. |-+ T Consensus 2 ~kV~vdl~~~~~~ls---------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~-----~~~----I~Y---- 59 (118) T protein:vir:30 2 AKVVVELGGIKRKVS---------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRAN-----SVG----VTW---- 59 (118) T ss_pred ceeeechhHHhhhhh---------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeec-----CCe----eEE---- Confidence 778888887765332 1233455666677888888899999999999986432 121 111 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ..+||+.+=||...+.. .......+.||+..++=|.| +. ......+...+.+.++++| T Consensus 60 --~tPYAr~qYY~~~~~~~----~g~~~~~~~~p~~g~~Wd~R-~k--a~~~~~~~w~~~~~k~~g~ 117 (118) T protein:vir:30 60 --SGPHARAQFYGGAYNKY----KSFKFKKYTTPGTGKRWDKR-AL--ANATIVKDWEKSLLRGMGF 117 (118) T ss_pred --CCchhhHhhhccccCCC----CccccccccCCCCCCcccch-hh--cchhhhHHHHHHHHHhcCC Confidence 34788766565321111 11222333444433333322 22 1123445677888899999 No 158 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=95.75 E-value=8.7e-05 Score=42.84 Aligned_cols=91 Identities=14% Similarity=0.207 Sum_probs=64.5 Q ss_pred HHHHHHHHHHHHHHHHHHhCCC--CcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccc Q lcl|NC_015262. 28 LNEAVKAGGNVILQDALPRVSK--RSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGK 105 (147) Q Consensus 28 ~~~al~~~a~~v~~~ak~~ap~--~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~ 105 (147) +......+|..+..+||.+||. +||+.|..|.-..... |...+.|.+. -...|..|+|.++... T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~~--g~~~~~i~ls----h~v~Yg~~LE~a~~~k-------- 66 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTP--QPDRYEIVFA----HTVHYGIWLEIANSGR-------- 66 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhccccccc--CCceEEEEEe----cCeeccceEEeecCCC-------- Confidence 5555666788999999999996 7999999995432212 1122333332 2358999999987631 Q ss_pred cccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 106 KKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 106 ~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) -.-+.|+++...+++++-++..+.+ || T Consensus 67 -------------yaIl~Ptv~~~~~~i~~g~~~ll~~-l~ 93 (93) T protein:vir:10 67 -------------YEIIMPTVHHEGKLMAQRLRGLLGR-LR 93 (93) T ss_pred -------------ccchhhhHHHHHHHHHHHHHHHHHh-cC Confidence 1269999999999999999886654 55 No 159 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=95.73 E-value=0.0003 Score=39.87 Aligned_cols=125 Identities=11% Similarity=0.085 Sum_probs=68.8 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhC-----C----C--------------Ccchhhh Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDK-LLNEAVKAGGNVILQDALPRV-----S----K--------------RSGKLKD 56 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~-~~~~al~~~a~~v~~~ak~~a-----p----~--------------~tG~l~~ 56 (147) |+- .+.+|.+.|+.|-..+.. .-+..++.-++.++...+.+. | + .+|.+.+ T Consensus 1 m~~-----~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~ 75 (155) T protein:vir:79 1 MTD-----DLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKR 75 (155) T ss_pred Cch-----HHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccc Confidence 663 455566666555443321 223445566666666655442 3 1 1344333 Q ss_pred cceec------ccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHH Q lcl|NC_015262. 57 GLKVS------GVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKK 130 (147) Q Consensus 57 sI~~~------~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~ 130 (147) ++-.. ..+...+...+.||+. +++..||...-||....+... .....+||+|||-=+ +..+ T Consensus 76 ~~m~~~l~~a~~l~~~~~~d~a~Vg~~---Gs~~~yAaiHQfG~~~r~~~~---------~~~v~iPaRp~LGls-~~d~ 142 (155) T protein:vir:79 76 EAMFRKLRTARYLRIDVDSTGLAIGFD---ERLSRIARVHQEGQKAPVEPG---------GPLAQYPVRVVLGFS-DADR 142 (155) T ss_pred hhhhhhhhhhheeeeeecCcEEEEEec---CcchhhhhhhhcCCcccCCCC---------CcccccccccccCCC-HHHH Confidence 22110 0111223334566552 356789999999964322111 112359999999766 4467 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_015262. 131 DEAKNVMKEILKR 143 (147) Q Consensus 131 ~~~~~~i~~~l~~ 143 (147) .+|.+.+.+.|.| T Consensus 143 ~~I~~~i~~~l~r 155 (155) T protein:vir:79 143 ELVRDRLLRELTR 155 (155) T ss_pred HHHHHHHHHHhhC Confidence 8999999999999 No 160 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=95.66 E-value=0.00026 Score=40.23 Aligned_cols=139 Identities=11% Similarity=0.166 Sum_probs=78.1 Q ss_pred Cc---eeeeehhHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHhC---C-CCcchhhhcceeccccc Q lcl|NC_015262. 1 MS---VEITTEGFDAVLSKIESMGKSGDKLLNEAV--------KAGGNVILQDALPRV---S-KRSGKLKDGLKVSGVKK 65 (147) Q Consensus 1 M~---~~~~i~Gl~el~~~l~~l~~~~~~~~~~al--------~~~a~~v~~~ak~~a---p-~~tG~l~~sI~~~~~~~ 65 (147) |. +-|+++-.+++.=. ...++.+. +.+...|...+++.. | ..||.|..||....++. T Consensus 1 m~~~~lHvdF~qp~~~~Fn--------r~riRraFv~igq~hmr~ArrlV~rrgrs~pGe~P~~qTGrLa~SIgy~Vpra 72 (168) T protein:vir:45 1 MTTSFLHVDFQQPAEMRFN--------RARVRRAFVTIGQRHMRDARRLVMRHARSAPGENPGYQTGRLARSIGYMVPRA 72 (168) T ss_pred CCccceeeeeecCCceeec--------HHHHHHHHHHHhHHHHHHHHHHHhhcccccCCCCCcchhhhhhhhhhhccccc Confidence 43 22233322222111 12344443 444444444443321 2 46999999997654443 Q ss_pred CC--CceEEEEeeeccCC------cccchhhhhhcccccccccccccccccccccCCCCCC-CcchhhHHHHHHHHHHHH Q lcl|NC_015262. 66 KG--GTKYVLVGITKEDN------SKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSP-KPFLAPAYESKKDEAKNV 136 (147) Q Consensus 66 ~~--~~~~~~Vg~~~~~~------~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a-~PFl~pA~~~~~~~~~~~ 136 (147) .. -+--+.|-++...+ ...+|=-|+.||........+..........+..+.| +-||..+++..+...... T Consensus 73 s~~rpG~mvkIaPNqk~G~g~r~i~gdfYPafL~YGVr~gakr~r~h~rga~ggsgwriaPR~Nym~~~l~~~~~wt~~~ 152 (168) T protein:vir:45 73 SKHRPGFMARIAPNQRNGEGNRRITGDFYPAFLFYGVRGGAKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTRYF 152 (168) T ss_pred cCCCCceEEEecCCCCCCCCCCccccccchhhhhhhhhcchhhhhhhhccccCCCcceeccchhhHHHHHHhhHHHHHHH Confidence 22 23445666655554 5678989999997543211111110111111223444 469999999999999999 Q ss_pred HHHHHHHHhcC Q lcl|NC_015262. 137 MKEILKRGLGL 147 (147) Q Consensus 137 i~~~l~~~i~~ 147 (147) +..+|++.|+. T Consensus 153 L~r~L~~sLrp 163 (168) T protein:vir:45 153 LARELRKSLKP 163 (168) T ss_pred HHHHHHHhcCc Confidence 99999999999 No 161 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=95.55 E-value=0.00039 Score=39.26 Aligned_cols=132 Identities=13% Similarity=0.008 Sum_probs=82.9 Q ss_pred CceeeeehhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH-------HHhC-----CCCcchhhhcceecccccCC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSG-DKLLNEAVKAGGNVILQDA-------LPRV-----SKRSGKLKDGLKVSGVKKKG 67 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~a-------k~~a-----p~~tG~l~~sI~~~~~~~~~ 67 (147) |. +++++||.+.++.|.....++ .+++..+...--+.++..+ +.+. +.++|.......++...... T Consensus 5 ~~-~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~~~a 83 (157) T protein:vir:97 5 IR-SVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRKKAA 83 (157) T ss_pred ee-cccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecCCcc Confidence 97 699999999999998887765 5677776666666665554 2222 33444443332222211110 Q ss_pred C-ceEEEEee-----eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 68 G-TKYVLVGI-----TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEIL 141 (147) Q Consensus 68 ~-~~~~~Vg~-----~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l 141 (147) - ...+.-|. ........||+.|+|+||..+.+ .+|++. |=|.-.-++..+.+.++|.++| T Consensus 84 ~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~P------------a~PFlR--PA~d~~k~~a~~~~~~~l~k~I 149 (157) T protein:vir:97 84 PHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIP------------AKPFLR--PGYDSVAMQIPDIARAAGAKKY 149 (157) T ss_pred ceeeeeecCcccccccccCCcccccccccccCCCCcCC------------CCcccc--hHHHHhHHHHHHHHHHHHHHHH Confidence 0 00111121 11222346999999999965322 124443 6799999999999999999999 Q ss_pred HHHhcC Q lcl|NC_015262. 142 KRGLGL 147 (147) Q Consensus 142 ~~~i~~ 147 (147) .+.|+= T Consensus 150 ~e~l~g 155 (157) T protein:vir:97 150 AELQRG 155 (157) T ss_pred HHHhcC Confidence 999988 No 162 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=95.54 E-value=0.00028 Score=40.03 Aligned_cols=132 Identities=11% Similarity=0.204 Sum_probs=66.3 Q ss_pred eeehhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCc----chhhhcceecccccCCCceEEEEeee-- Q lcl|NC_015262. 5 ITTEGFDAVLSKIESMGKS-GDKLLNEAVKAGGNVILQDALPRVSKRS----GKLKDGLKVSGVKKKGGTKYVLVGIT-- 77 (147) Q Consensus 5 ~~i~Gl~el~~~l~~l~~~-~~~~~~~al~~~a~~v~~~ak~~ap~~t----G~l~~sI~~~~~~~~~~~~~~~Vg~~-- 77 (147) ++|+||+++++.|+.|++. +.++...|+...|..+..++...+...+ ..++..++............+.+.-+ T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~~~l~a~I~~~~~~l 80 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATVKNPQARIKVNRGDL 80 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccCCCceEEEEEeccce Confidence 6778999999999999776 6788888888888888777766665443 35555555543322221111111100 Q ss_pred ------------------------------------ccCCcccchhhhhhcccccccccccccccccccccCC------- Q lcl|NC_015262. 78 ------------------------------------KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHP------- 114 (147) Q Consensus 78 ------------------------------------~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~------- 114 (147) +.....+|++.. -.|. +..+....+.. T Consensus 81 ~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m-~ng~---------~~Vf~R~~gk~R~PIe~v 150 (192) T protein:vir:34 81 PVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQL-KNGR---------WHVMQRVAGKNRYPIDVV 150 (192) T ss_pred eeeeecccccccccccccccccccccccccceeeecceecCCcccccC-CCCC---------ceeEEEccCCCccceeEE Confidence 000001111110 0111 11111100110 Q ss_pred CCC-CCcchhhHHHHHHH-----HHHHHHHHHHHHHhcC Q lcl|NC_015262. 115 GVS-PKPFLAPAYESKKD-----EAKNVMKEILKRGLGL 147 (147) Q Consensus 115 ~~~-a~PFl~pA~~~~~~-----~~~~~i~~~l~~~i~~ 147 (147) .+| ..| +..+|+...+ ++...|..+|...|+| T Consensus 151 kIpis~~-l~~af~~~~~~~~~~~~~~El~~~L~~~lr~ 188 (192) T protein:vir:34 151 KIPMAVP-LTTAFKQNIERIRRERLPKELGYALQHQLRM 188 (192) T ss_pred EechhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 011 122 4667766554 3445666667777777 No 163 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=95.48 E-value=0.00035 Score=39.50 Aligned_cols=138 Identities=16% Similarity=0.183 Sum_probs=85.4 Q ss_pred Cce----eeeehhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhC-----------C-CCcchhhhcceeccc Q lcl|NC_015262. 1 MSV----EITTEGFDAVLSKIESMGKSG-DKLLNEAVKAGGNVILQDALPRV-----------S-KRSGKLKDGLKVSGV 63 (147) Q Consensus 1 M~~----~~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~ak~~a-----------p-~~tG~l~~sI~~~~~ 63 (147) |.- -|++.-.+++ .. ...++.+..+.++..-.+|+.++ | ..||.|..||....+ T Consensus 1 M~~~~~lHvdF~qp~~~---------~Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vp 71 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEEL---------VFNRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVP 71 (170) T ss_pred CCCCceeEEeeecCCce---------eecHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccc Confidence 542 2223222221 11 24578888888888888888654 3 369999999976544 Q ss_pred ccCC--CceEEEEeeeccCCc------ccchhhhhhcccccccccccccccccccccCCCCCC-CcchhhHHHHHHHHHH Q lcl|NC_015262. 64 KKKG--GTKYVLVGITKEDNS------KIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSP-KPFLAPAYESKKDEAK 134 (147) Q Consensus 64 ~~~~--~~~~~~Vg~~~~~~~------~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a-~PFl~pA~~~~~~~~~ 134 (147) +... -+--+.|-++...+. ..+|=-|+.||.+......+.....+....+..+.| +-||..+++..+.... T Consensus 72 ras~~rpG~mVkIaPNqk~G~g~r~i~g~fYPafL~YGVr~gakr~k~hhr~a~ggsgwriaPR~Nym~~~l~~~~~wt~ 151 (170) T protein:vir:44 72 RASKKRPGLMVKIAPNQKNGEGNRHINGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVEPRNNYMTEVLDKRRSWTR 151 (170) T ss_pred cccCCCCceeEEecCCCCCCCCccccccccchhhhhhhhhcccccchhhcccccCCCcceeccchhHHHHHHHhhHHHHH Confidence 4322 234456655554442 248888999997532211111111111112233444 4699999999999999 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_015262. 135 NVMKEILKRGLGL 147 (147) Q Consensus 135 ~~i~~~l~~~i~~ 147 (147) ..+..+|++.|+- T Consensus 152 ~~L~r~L~~sLrp 164 (170) T protein:vir:44 152 YVLSRELRKSLRP 164 (170) T ss_pred HHHHHHHHHhcCc Confidence 9999999999999 No 164 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=95.46 E-value=0.00096 Score=37.14 Aligned_cols=147 Identities=16% Similarity=0.180 Sum_probs=64.4 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HH-HHHHhCCCCcchhhhcceecccc-cCCCceEEEE Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVI----LQ-DALPRVSKRSGKLKDGLKVSGVK-KKGGTKYVLV 74 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v----~~-~ak~~ap~~tG~l~~sI~~~~~~-~~~~~~~~~V 74 (147) |++++.++|++++.+.|++|++...+.+..|+.++|..- .. .+...+....+.+.+++..+-++ -..+..+..| T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~I 80 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAVI 80 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEEE Confidence 999999999999999999999988776655555544443 33 24555555677777543322222 1111222222 Q ss_pred eeeccCCc----------------------------ccchhhh-hhcccccc--cccccccccccccccC-----CCC-- Q lcl|NC_015262. 75 GITKEDNS----------------------------KIFYGKF-LEFGASAH--KIPIKKGKKKGRIINH-----PGV-- 116 (147) Q Consensus 75 g~~~~~~~----------------------------~~~y~~~-vE~GT~~~--~~~~~~~~~~~~~~~~-----~~~-- 116 (147) .......+ ..+-.-| ++.=++.. ......+-+....-++ .|+ T Consensus 81 ~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~k 160 (205) T protein:vir:63 81 GARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGATK 160 (205) T ss_pred ecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCcee Confidence 21111110 0111111 11100000 0000000000000000 000 Q ss_pred CCCc---chhhHHHHH----HHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 117 SPKP---FLAPAYESK----KDEAKNVMKEILKRGLGL 147 (147) Q Consensus 117 ~a~P---Fl~pA~~~~----~~~~~~~i~~~l~~~i~~ 147 (147) -+.+ +.-|++++. ++.+...|.+.+.+.++= T Consensus 161 ~~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~r 198 (205) T protein:vir:63 161 LSNNVYLLYGPSVDQVFRTVADDITTEVLDALADEFLR 198 (205) T ss_pred cCCceEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHHH Confidence 0112 455665554 444555555555555554 No 165 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=95.41 E-value=0.00052 Score=38.59 Aligned_cols=127 Identities=14% Similarity=0.110 Sum_probs=71.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhC-----C----C--------------Ccchhhh Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDK-LLNEAVKAGGNVILQDALPRV-----S----K--------------RSGKLKD 56 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~-~~~~al~~~a~~v~~~ak~~a-----p----~--------------~tG~l~~ 56 (147) |+- .+.+|.+.|+.+-..+.. .-+..++..++.++...+.+. | + ++|.+-. T Consensus 1 M~~-----~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~ 75 (152) T protein:vir:10 1 MSE-----PIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFD 75 (152) T ss_pred Cch-----HHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHH Confidence 764 355666666655444321 223455666777766665543 3 1 1233333 Q ss_pred cceec-ccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHH Q lcl|NC_015262. 57 GLKVS-GVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKN 135 (147) Q Consensus 57 sI~~~-~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~ 135 (147) ++... ..+...+...+.||+. +++..||....||-......++.. ...+|++|||-=+ +..+.+|.+ T Consensus 76 ~L~~a~~l~~~a~~~~~~Vg~~---Gt~~~yAaiHQfG~~~r~~~~~~~--------~v~iPaRp~LG~s-~~d~~~I~~ 143 (152) T protein:vir:10 76 KITQPRFMRLRLESEGVSLGYE---GGDAVIARIHQQGLIGRVRKDWDL--------KVKYASRELLGFT-DDDLQMIED 143 (152) T ss_pred hhhhcceeeeeecCcEEEEEec---CCchhhhhhhccCccccccCCCCc--------ceeccccccCCCC-HHHHHHHHH Confidence 33211 1111222334666653 356789999999854222111111 1248999999766 456688999 Q ss_pred HHHHHHHHH Q lcl|NC_015262. 136 VMKEILKRG 144 (147) Q Consensus 136 ~i~~~l~~~ 144 (147) .+.+.|..+ T Consensus 144 ~i~~~l~~a 152 (152) T protein:vir:10 144 YMINILAGS 152 (152) T ss_pred HHHHHHhcC Confidence 999999999 No 166 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=94.88 E-value=0.00079 Score=37.60 Aligned_cols=119 Identities=10% Similarity=0.118 Sum_probs=65.5 Q ss_pred ehhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhC-----CC-------Cc-----------------chhhh Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGDK-LLNEAVKAGGNVILQDALPRV-----SK-------RS-----------------GKLKD 56 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~~-~~~~al~~~a~~v~~~ak~~a-----p~-------~t-----------------G~l~~ 56 (147) +..|+++.+.|+.+-..+.. .-+..++..++.++...+.+. |. .. |.+.+ T Consensus 1 m~~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~~ 80 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRTSR 80 (149) T ss_pred CchHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhhhh Confidence 33466666666655433321 123456666677766665543 31 11 11222 Q ss_pred cceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_015262. 57 GLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNV 136 (147) Q Consensus 57 sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~ 136 (147) +|... .+...+.||+. +++..||....||......+.. ....+||+|||-=+ ++.+.+|.+. T Consensus 81 ~l~~~-----~~~~~~~v~~~---Gtn~~yAaiHQfG~~~r~~~~~---------~~v~iPaRp~LG~s-~~d~~~I~~~ 142 (149) T protein:vir:18 81 FMKAK-----GSDSAAVVEFT---GKVQRMARVHQYGLKDRPNRNS---------RDVQYEARPLLGFT-RDDEQMIEDV 142 (149) T ss_pred hhhee-----ecCceeEEEec---ccchhhhhhhhccccccccCCC---------ccccccccccCCCC-HHHHHHHHHH Confidence 23211 11112344332 4567899999999653221111 12358999998866 5567889999 Q ss_pred HHHHHHH Q lcl|NC_015262. 137 MKEILKR 143 (147) Q Consensus 137 i~~~l~~ 143 (147) +.+.|.+ T Consensus 143 i~~~l~~ 149 (149) T protein:vir:18 143 IISHLGK 149 (149) T ss_pred HHHHHhC Confidence 9999999 No 167 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=94.35 E-value=0.00013 Score=41.94 Aligned_cols=104 Identities=17% Similarity=0.228 Sum_probs=54.7 Q ss_pred Cceeeeehh-HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEG-FDA---VLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~G-l~e---l~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |+..-.-+. |.. -++.|+.|++ +++.+.+-..+|...-|++.|+.||.+++|+.++......|.. .|| T Consensus 1 ma~gpt~knplakfgi~lddfdklpe-----vnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrg--kvg- 72 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLPE-----VNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRG--KVG- 72 (108) T ss_pred CCCCCccccchhhhccchhhhhccch-----hhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccc--ccc- Confidence 442211110 000 1122333432 3445555556667777899999999999999877655544432 333 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES 128 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~ 128 (147) ...+-+|++|||...........+.... |=.-||+. T Consensus 73 -----atdpqahlvefgs~hndeyapaqktakq-----------fggtay~d 108 (108) T protein:vir:10 73 -----ATDPQAHLVEFGSAHNDEYAPAQKTAKQ-----------FGGTAYGD 108 (108) T ss_pred -----CcchhhhhhhhhccccccccchhhhHHh-----------hcccccCC Confidence 2346799999998765443332222222 22222222 No 168 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=94.35 E-value=0.00013 Score=41.94 Aligned_cols=104 Identities=17% Similarity=0.228 Sum_probs=54.7 Q ss_pred Cceeeeehh-HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEee Q lcl|NC_015262. 1 MSVEITTEG-FDA---VLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGI 76 (147) Q Consensus 1 M~~~~~i~G-l~e---l~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~ 76 (147) |+..-.-+. |.. -++.|+.|++ +++.+.+-..+|...-|++.|+.||.+++|+.++......|.. .|| T Consensus 1 ma~gpt~knplakfgi~lddfdklpe-----vnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrg--kvg- 72 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLPE-----VNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRG--KVG- 72 (108) T ss_pred CCCCCccccchhhhccchhhhhccch-----hhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccc--ccc- Confidence 442211110 000 1122333432 3445555556667777899999999999999877655544432 333 Q ss_pred eccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHH Q lcl|NC_015262. 77 TKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYES 128 (147) Q Consensus 77 ~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~ 128 (147) ...+-+|++|||...........+.... |=.-||+. T Consensus 73 -----atdpqahlvefgs~hndeyapaqktakq-----------fggtay~d 108 (108) T protein:vir:10 73 -----ATDPQAHLVEFGSAHNDEYAPAQKTAKQ-----------FGGTAYGD 108 (108) T ss_pred -----CcchhhhhhhhhccccccccchhhhHHh-----------hcccccCC Confidence 2346799999998765443332222222 22222222 No 169 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=93.97 E-value=0.0018 Score=35.68 Aligned_cols=129 Identities=11% Similarity=0.157 Sum_probs=67.6 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhC-----CC-------C-------cchhhhcc-- Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDK-LLNEAVKAGGNVILQDALPRV-----SK-------R-------SGKLKDGL-- 58 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~-~~~~al~~~a~~v~~~ak~~a-----p~-------~-------tG~l~~sI-- 58 (147) |+- .+.+|.+.|+.|-..+.. .-++.++..++.++...+.+. |. . +|...... T Consensus 1 m~~-----~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m 75 (156) T protein:vir:11 1 MAD-----SLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKM 75 (156) T ss_pred Cch-----hHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhh Confidence 653 234444444443322211 123345666666666665443 31 1 11111100 Q ss_pred ----e-ecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHH Q lcl|NC_015262. 59 ----K-VSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEA 133 (147) Q Consensus 59 ----~-~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~ 133 (147) . ....+...+...+.||+. +++..||...-||..-..... .....+||+|||-=+ ++.+.++ T Consensus 76 ~~~l~~~~~l~~~~~~~~a~vg~~---Gs~~~yA~iHQfG~~~~~~~~---------~~~v~iPaRp~LG~s-~~d~~~i 142 (156) T protein:vir:11 76 FQKLRTVRYLRAKGDAQAITVSFA---GRIARIARVHQYGLRDRAEPG---------APEVSYAQRLLLGFD-SSDMETI 142 (156) T ss_pred hhhhhhhheeeeeecCcEEEEEec---CCchhhhhhhcccccccccCC---------CCcccccccccCCCC-HHHHHHH Confidence 0 000111222334566653 356789999999964221111 011248999999766 4667889 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_015262. 134 KNVMKEILKRGLGL 147 (147) Q Consensus 134 ~~~i~~~l~~~i~~ 147 (147) .+.+.+.|.+..-+ T Consensus 143 ~~~i~~~l~~~~~~ 156 (156) T protein:vir:11 143 QNGILAHIDANSPI 156 (156) T ss_pred HHHHHHHHhhcCCC Confidence 99999999998888 No 170 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=93.76 E-value=0.0016 Score=35.89 Aligned_cols=110 Identities=16% Similarity=0.116 Sum_probs=68.1 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCCcccchhhhhhc Q lcl|NC_015262. 14 LSKIESMGKSGD-KLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEF 92 (147) Q Consensus 14 ~~~l~~l~~~~~-~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~ 92 (147) +..|+.+.+.+. +.+.++-..-++.|..++..-+|.+||.|++|..+. ++. |-+ ..+||+++=| T Consensus 1 ~~dL~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~-----s~~----I~y------~tPYAr~qyY 65 (113) T protein:vir:79 1 MSDLSVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVN-----DTG----IHY------TAKYARAQFY 65 (113) T ss_pred CchHHHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhcccccc-----CCe----eEe------cChhhhHhhc Confidence 333333433332 345566777788888999999999999999986421 121 111 3578887777 Q ss_pred ccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 93 GASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 93 GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) |..... ....+.+|+ ....|+..|.....++..+.+.+++-++-.- T Consensus 66 g~~~~~--------~~~~~t~p~-ag~~W~eraKa~h~~~w~~~~~~a~~~G~~~ 111 (113) T protein:vir:79 66 GFVNGH--------RVRNYSTPG-TGRRWDLKAKAVYKADWQKVAVAAFLKEAKG 111 (113) T ss_pred cccCCC--------CccccCCCC-CCchhhHHHHHHhHHHHHHHHHHHhhccccc Confidence 643211 011122333 3344667788888888888888877777777 No 171 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=93.42 E-value=0.0021 Score=35.27 Aligned_cols=147 Identities=19% Similarity=0.261 Sum_probs=84.1 Q ss_pred Cceeeee-hhHHHHHHH------HHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHhC-----------C-CCcchhhhcce Q lcl|NC_015262. 1 MSVEITT-EGFDAVLSK------IESMGK-SG-DKLLNEAVKAGGNVILQDALPRV-----------S-KRSGKLKDGLK 59 (147) Q Consensus 1 M~~~~~i-~Gl~el~~~------l~~l~~-~~-~~~~~~al~~~a~~v~~~ak~~a-----------p-~~tG~l~~sI~ 59 (147) |+--+.- .|...+... |++... .. ...++.+..+.++..-.+|+.++ | ..||.|..||. T Consensus 1 ~~~~~~~~~~~nam~~~~~lHvdF~qp~~~~Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIg 80 (187) T protein:vir:48 1 MKNCVQRDDGVNAMNQTAFLHVDFKQPKELEFNRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIG 80 (187) T ss_pred CccccccccchhhhhhccceeEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhh Confidence 2211110 011111100 111111 11 24688889999999988888765 2 36999999997 Q ss_pred ecccccCC--CceEEEEeeeccCC--------cccchhhhhhcccccccccccccc--cccccccCCCCCCC-cchhhHH Q lcl|NC_015262. 60 VSGVKKKG--GTKYVLVGITKEDN--------SKIFYGKFLEFGASAHKIPIKKGK--KKGRIINHPGVSPK-PFLAPAY 126 (147) Q Consensus 60 ~~~~~~~~--~~~~~~Vg~~~~~~--------~~~~y~~~vE~GT~~~~~~~~~~~--~~~~~~~~~~~~a~-PFl~pA~ 126 (147) ...++... .+--+.|-++...+ ...+|=-|+.||.+.....-.... ..+....+..+.|+ -||.-++ T Consensus 81 y~Vpkat~~RpG~mVkIaPNqk~G~g~r~~Pi~gdfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwriaPR~Nym~~~L 160 (187) T protein:vir:48 81 YYVPKKTTRRPGLMVKISPNQKNGQGNRRFPEGAPYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRLAPRNNFMADVI 160 (187) T ss_pred hccccccCCCCcceEEecCCcccCcccccccccccchhHHHHhhhhhhhhccchhhhhhhcccCCcceeccchhHHHHHH Confidence 65443222 22334554443222 224888899999754322211111 11111233335554 5999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 127 ESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 127 ~~~~~~~~~~i~~~l~~~i~~ 147 (147) +..+......+..+|++.|+. T Consensus 161 ~~~~~wt~~~L~raL~~sLrp 181 (187) T protein:vir:48 161 ERRRHWTQELLSRELQRSLRP 181 (187) T ss_pred HhhHHHHHHHHHHHHHHhcCc Confidence 999999999999999999999 No 172 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=93.33 E-value=0.0025 Score=34.85 Aligned_cols=119 Identities=10% Similarity=0.028 Sum_probs=65.4 Q ss_pred ehhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhC-----CC-----------------------Ccchhhhc Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKSGDK-LLNEAVKAGGNVILQDALPRV-----SK-----------------------RSGKLKDG 57 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~~~~-~~~~al~~~a~~v~~~ak~~a-----p~-----------------------~tG~l~~s 57 (147) ...+++|.+.|+.|-..+.. .-+..++.-++.++...+.+. |. .++.+..+ T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~ 80 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARY 80 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhh Confidence 44577777777766555432 223455666777766665543 31 01122233 Q ss_pred ceecccccCCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015262. 58 LKVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVM 137 (147) Q Consensus 58 I~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i 137 (147) |... .+.....||+. +++..||...-||-.....+. .....+|++|||-=+ +..+.++.+.+ T Consensus 81 l~~~-----~~~~~~~v~~~---Gt~~~yAaiHQfG~~~r~~~~---------~~~v~iPaRp~LG~s-~~d~~~i~~~i 142 (148) T protein:vir:79 81 MKTQ-----ADANTAVVTFA---GNAQRIATVHQFGLRDRVNKA---------GLTAQYPARELLGMD-GVDMEHITNLL 142 (148) T ss_pred eeee-----eeCCeeeEEee---ccchhhhhhhhcCccccccCC---------CCccccCcccccCCC-HHHHHHHHHHH Confidence 3211 11223455442 356789999999943221111 112348999999765 44566777777 Q ss_pred HHHHHH Q lcl|NC_015262. 138 KEILKR 143 (147) Q Consensus 138 ~~~l~~ 143 (147) .+.|.- T Consensus 143 ~~~l~~ 148 (148) T protein:vir:79 143 LLHLGA 148 (148) T ss_pred HHHhcC Confidence 777766 No 173 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=90.78 E-value=0.0041 Score=33.68 Aligned_cols=127 Identities=9% Similarity=-0.004 Sum_probs=69.5 Q ss_pred eee-hhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCC Q lcl|NC_015262. 5 ITT-EGF--DAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDN 81 (147) Q Consensus 5 ~~i-~Gl--~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~ 81 (147) +.= .++ ++|.+-.++|.+.....+.+.++..++.+..++...+...|. +.....+. .+.+|.... T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tP-----VdTG~Lr~-----sw~~~~~~~-- 68 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTP-----VDTGFLRQ-----GWNGVAYAR-- 68 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----Ccchhhcc-----ccccccccc-- Confidence 332 122 245444455544444456666777777777766655532221 11111111 111110000 Q ss_pred cccchhh-hhhcccccccccccccccccccccCCCCCCCcchhhHH--HHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 82 SKIFYGK-FLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAY--ESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 82 ~~~~y~~-~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~--~~~~~~~~~~i~~~l~~~i~~ 147 (147) ++. ..+-++......+....+...++||.-++++||..+++ +...+++.+.+.+.+.+.|+= T Consensus 69 ----~~~~~~~g~~~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~ 133 (141) T protein:vir:79 69 ----SLPVYKQGNNYIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLI 133 (141) T ss_pred ----ccceeecCCeeEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 000 00111111112223345677899999999999999998 888899999999999999988 No 174 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=88.89 E-value=0.0086 Score=31.89 Aligned_cols=126 Identities=18% Similarity=0.223 Sum_probs=74.6 Q ss_pred ceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCC Q lcl|NC_015262. 2 SVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDN 81 (147) Q Consensus 2 ~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~ 81 (147) .+++.|.--|.|+.+-.... ..+.+.+..-...++.-+...+|..||+|++|-.++..- +.| .-. T Consensus 1 mi~i~idkp~almek~~ev~----~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg-stg----------els 65 (133) T protein:vir:42 1 MIEIRIDKPDALMEKPHEVQ----GKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEG-STG----------ELS 65 (133) T ss_pred CeeeecCCchhhhcchhhhh----hHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeec-Ccc----------chh Confidence 45888888788777655444 445556666667778888889999999999986544221 111 223 Q ss_pred cccchhhhhhcccccccccccccccccccccCC-----CCCCCcchhhH--HHHHHHHHHHHHHHHHHH Q lcl|NC_015262. 82 SKIFYGKFLEFGASAHKIPIKKGKKKGRIINHP-----GVSPKPFLAPA--YESKKDEAKNVMKEILKR 143 (147) Q Consensus 82 ~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~-----~~~a~PFl~pA--~~~~~~~~~~~i~~~l~~ 143 (147) ..++|..||=||-.-..+. .+..-+..+.-|| ..||.-||.-+ +-..+.-+.+.+.+-|++ T Consensus 66 n~~~yl~~vl~grgwvfpv-~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 66 NLAYYLPFVLHGRGWVFPV-RRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred hhhHHhhHhhhcccceeec-cccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 4578999999986432222 2222233333333 13455566644 444455555666666665 No 175 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=84.49 E-value=0.0035 Score=34.04 Aligned_cols=92 Identities=17% Similarity=0.252 Sum_probs=52.5 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |.--|.-. +.+.+++-+-+ -++.-+.-+|+.....||+++|+|||.++|.+.+..++.........||.+.. T Consensus 1 madaftpN--p~~FDqIl~s~-----~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D~K- 72 (92) T protein:vir:78 1 MADAFTPN--PTWFDQIMRTP-----KVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSDEK- 72 (92) T ss_pred CCCccCCC--hhHHHHhhccc-----chhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccceeEEeecCcc- Confidence 65433321 12222221111 12222334577778889999999999999999988888777666677775432 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHH Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKD 131 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~ 131 (147) -.++|.-|.. |+.|+...+. T Consensus 73 ------TlLvESrTGN-------------------------Lakalk~~rs 92 (92) T protein:vir:78 73 ------TLLIESRTGN-------------------------LARSVKRRRS 92 (92) T ss_pred ------eeeeecccch-------------------------HHHHHhhhcC Confidence 2457766643 3333333222 No 176 >protein:vir:8432 Length: 149 # NCBI annotation: gp27 # Family: family:all:30885 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818328;genbank:gi:29566764;genbank:GeneID:1260117 Probab=81.74 E-value=0.071 Score=26.89 Aligned_cols=120 Identities=15% Similarity=0.235 Sum_probs=53.3 Q ss_pred CceeeeehhHHHHHHHHH-HH--HHHHHHHHHHHHHHHHHHHHHH-------------HHH-h--CCCCcchhhhcceec Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIE-SM--GKSGDKLLNEAVKAGGNVILQD-------------ALP-R--VSKRSGKLKDGLKVS 61 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~-~l--~~~~~~~~~~al~~~a~~v~~~-------------ak~-~--ap~~tG~l~~sI~~~ 61 (147) |--++-++ |.++.-.|- -. +.++.+++. .-.+.+.+. |++ . .|..||+++.+|+.+ T Consensus 1 mprdvvvk-lrdvrgalldgvsssrdlrrivq----rfindveqtwhdvwdvsmlgvlaqqtgvphpyqtgdykahikkk 75 (149) T protein:vir:84 1 MPRDVVVK-LRDVRGALLDGVSSSRDLRRIVQ----RFINDVEQTWHDVWDVSMLGVLAQQTGVPHPYQTGDYKAHIKKK 75 (149) T ss_pred CCchheeh-hhhhhhhhhhccccchHHHHHHH----HHHHHHHHHHHhHhhHHHHHHHHhhcCCCCCccccchhhhhhhh Confidence 76655443 333332221 11 112222222 211222111 111 2 267899999999754 Q ss_pred cccc---------CCCceEEEEeeeccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHH Q lcl|NC_015262. 62 GVKK---------KGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDE 132 (147) Q Consensus 62 ~~~~---------~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~ 132 (147) +... -.|+ ..+|... .+..-+||+||||..-.++. ..|+-|-.| .|||+- T Consensus 76 kltamqkirikkflkgg--mpiglvy---nndekahwieygtkrdrpgs----------rspwgpntp--tpafei---- 134 (149) T protein:vir:84 76 KLTAMQKIRIKKFLKGG--MPIGLVY---NNDEKAHWIEYGTKRDRPGS----------RSPWGPNTP--TPAFEI---- 134 (149) T ss_pred hHHHHHHHHHHHHhhcC--CceeEEe---cCCcchhhhhhccccCCCCC----------CCCCCCCCC--ChhHHH---- Confidence 3211 1122 2233322 23466899999996432221 122333333 356654 Q ss_pred HHHHHHHHHHHHhcC Q lcl|NC_015262. 133 AKNVMKEILKRGLGL 147 (147) Q Consensus 133 ~~~~i~~~l~~~i~~ 147 (147) .+.+.+.+++.++. T Consensus 135 -mqrvarimnedvry 148 (149) T protein:vir:84 135 -MQRVARIMNEDVRY 148 (149) T ss_pred -HHHHHHHhhhhccc Confidence 34455555555555 No 177 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=80.52 E-value=0.043 Score=28.09 Aligned_cols=126 Identities=16% Similarity=0.154 Sum_probs=70.1 Q ss_pred ceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccCC Q lcl|NC_015262. 2 SVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKEDN 81 (147) Q Consensus 2 ~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~~ 81 (147) .+++.|.--|.|+.+-... +..+.+.+..-...++.-+...+|..||+|++|-.++..- ..| .-. T Consensus 1 mi~i~idkp~almek~~ev----~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg-stg----------els 65 (133) T protein:vir:41 1 MIRINIDKPEALMEKASEV----EDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEG-STG----------ELT 65 (133) T ss_pred CeeeecCCchhhhcchhhh----hhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeec-Ccc----------chh Confidence 4588888878877765544 4445556666667778888889999999999986544221 111 223 Q ss_pred cccchhhhhhcccccccccccccccccccccCC-----CCCCCcchhhHH--HHHHHHHHHHHHHHHHH Q lcl|NC_015262. 82 SKIFYGKFLEFGASAHKIPIKKGKKKGRIINHP-----GVSPKPFLAPAY--ESKKDEAKNVMKEILKR 143 (147) Q Consensus 82 ~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~-----~~~a~PFl~pA~--~~~~~~~~~~i~~~l~~ 143 (147) ..++|..||=||-.-..+. .+..-+..+.-|| ..||.-||.-+. -..+.-+.+.+.+-|-- T Consensus 66 n~~~yl~~vl~grgwvfpv-~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlis 133 (133) T protein:vir:41 66 NTVPYLQWVLFGRGWVFPV-EKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIVEDSFIEWLIS 133 (133) T ss_pred hhhHHhhHhhhcccceeee-cccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHhcC Confidence 4578999999986432222 2222233333333 134455666443 33333333333332222 No 178 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=75.91 E-value=0.025 Score=29.32 Aligned_cols=102 Identities=19% Similarity=0.205 Sum_probs=49.5 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeee Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGN---VILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGIT 77 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~---~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~ 77 (147) |++ +--+ +...+.|. +.+|..+++ ++.+.+..-+|.+||+|++|-....+ -..|...+.+ T Consensus 1 ~~f--~~f~-~~~~k~l~----------kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tv-Igsg~I~y~~--- 63 (105) T protein:vir:78 1 MSF--SSFK-DAVIDDIH----------NKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKII-IQKNSIVARV--- 63 (105) T ss_pred CCc--cccc-chHHHHHH----------HhcCCCCchhhHHHHHHhCCCCccccccccccccccee-ecCCeeEeec--- Confidence 552 1111 12222222 223332222 44445555679999999998432211 1122222211 Q ss_pred ccCCcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 78 KEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 78 ~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) .+-++||+++=|... ...-|+..+.-..++. |.+.+..+++| T Consensus 64 ---~~~aPYAr~qYYe~~---------------------Rg~~WfErm~a~hk~~----I~~~vegg~~~ 105 (105) T protein:vir:78 64 ---FSLTPYARRQYYENR---------------------RNPRWYEMAVSYGIQS----INQIVEGGMRL 105 (105) T ss_pred ---cccCchhhhhhhccc---------------------CCCchhHHhhhcchhH----HHHHHhcccCC Confidence 123578876655432 1222677777777665 45555588888 No 179 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=74.57 E-value=0.16 Score=25.00 Aligned_cols=136 Identities=16% Similarity=0.288 Sum_probs=49.7 Q ss_pred ehhHHHHHHHHHHHHHH-HHHH--------HHHHHHHHHHHHHHHHHHhCCCCc----chhhhcceecccccCCCceEEE Q lcl|NC_015262. 7 TEGFDAVLSKIESMGKS-GDKL--------LNEAVKAGGNVILQDALPRVSKRS----GKLKDGLKVSGVKKKGGTKYVL 73 (147) Q Consensus 7 i~Gl~el~~~l~~l~~~-~~~~--------~~~al~~~a~~v~~~ak~~ap~~t----G~l~~sI~~~~~~~~~~~~~~~ 73 (147) |+||+++++.|+.|+.. +.++ ...|+..++..|..+.....+... -.++.-+........ +..++. T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~-~~~~~~ 79 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPS-GKMTAR 79 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCC-CceEEE Confidence 99999999999998664 3333 334444455555555422221111 112222221111111 111111 Q ss_pred EeeeccC-----------------------------Ccccchhhhh---hccccccccccccccccc--ccccCCCCCCC Q lcl|NC_015262. 74 VGITKED-----------------------------NSKIFYGKFL---EFGASAHKIPIKKGKKKG--RIINHPGVSPK 119 (147) Q Consensus 74 Vg~~~~~-----------------------------~~~~~y~~~v---E~GT~~~~~~~~~~~~~~--~~~~~~~~~a~ 119 (147) |.++-.+ +...+-.-|+ -.|. .|......++.+. ..+.-|- .+ T Consensus 80 I~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr-~~V~~R~~gk~R~PIevvkIpi--s~ 156 (192) T protein:vir:79 80 IRVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGR-WHVMRRIDGKNRYPIDVVKIPL--SG 156 (192) T ss_pred EEEecCceeeeeecccccccccccccccccccceEEcceecCchhccccCCCC-ccceEecCCCccCCeeeEeech--HH Confidence 1110000 0000001111 1111 0000000000000 0011110 12 Q ss_pred cchhhHHHHHH-----HHHHHHHHHHHHHHhcC Q lcl|NC_015262. 120 PFLAPAYESKK-----DEAKNVMKEILKRGLGL 147 (147) Q Consensus 120 PFl~pA~~~~~-----~~~~~~i~~~l~~~i~~ 147 (147) | +..+|+... +++.+.+..+|...|++ T Consensus 157 ~-l~~af~~e~~r~~~~~~~~el~~~L~~qlr~ 188 (192) T protein:vir:79 157 P-LTQAFEDARDRIIAAEMPKQLGYALKQQLRL 188 (192) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2 335555544 45556666677777777 No 180 >protein:vir:6154 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:10918 # MgeID: mge:127 # MgeName: phBC6A51 # Cross-refs: genbank:acc:NP_852533;genbank:gi:31415793;genbank:GeneID:1489145 Probab=56.05 E-value=0.017 Score=30.27 Aligned_cols=118 Identities=16% Similarity=0.197 Sum_probs=67.1 Q ss_pred CceeeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeeccC Q lcl|NC_015262. 1 MSVEITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITKED 80 (147) Q Consensus 1 M~~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~~~ 80 (147) |.+.+-++|-..+++.-+ +..-+.-+.+.+++-...-..+|..++|--.|-|..||..+..-..+.. .+| .. T Consensus 1 mrirvvvkgksnvlkahn--pnryktpieqtvekhtrlqanqasnrapilhgplsesipasvkmvvgar---iig---ty 72 (119) T protein:vir:61 1 MRIRVVVKGKSNVLKAHN--PNRYKTPIEQTVEKHTRLQANQASNRAPILHGPLSESIPASVKMVVGAR---IIG---TY 72 (119) T ss_pred CeeEEEeecccceecccC--CccccccHHHHHHHhhhhhcccccccCceeecccccccchhhhhhhhhh---hcc---cc Confidence 999999999887765442 1111223445566666666677888899999999999965432222211 122 33 Q ss_pred CcccchhhhhhcccccccccccccccccccccCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015262. 81 NSKIFYGKFLEFGASAHKIPIKKGKKKGRIINHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLGL 147 (147) Q Consensus 81 ~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~~ 147 (147) ++...|+...||-+.. -+-|||...=+.++.+.+.|.+.+++--.= T Consensus 73 gspliyaavqefthkt---------------------kkgfmrktafegeqpfvedisktvqrvakg 118 (119) T protein:vir:61 73 GSPLIYAAVQEFTHKT---------------------KKGFMRKTAFEGEQPFVEDISKTVQRVAKG 118 (119) T ss_pred cchHHHHHHHHHhhhh---------------------hhhhhhhhcccCCcchHHHHHHHHHHhhcC Confidence 5667899999997542 122444333333333344444444332222 No 181 >protein:vir:7859 Length: 126 # NCBI annotation: gp16 # Family: family:all:11115 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817466;genbank:gi:29565895;genbank:GeneID:1259088 Probab=42.81 E-value=0.33 Score=23.22 Aligned_cols=106 Identities=15% Similarity=0.200 Sum_probs=52.0 Q ss_pred ehhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------CCcchhhhcceecccccCCC Q lcl|NC_015262. 7 TEGFDAVLSKIESMGK-SGDKLLNEAVKAGGNVILQDALP---RVS--------------KRSGKLKDGLKVSGVKKKGG 68 (147) Q Consensus 7 i~Gl~el~~~l~~l~~-~~~~~~~~al~~~a~~v~~~ak~---~ap--------------~~tG~l~~sI~~~~~~~~~~ 68 (147) ++||..|+.+-+-... ....++-..|-.-|.+|++---. ..| ...|++..||+++..+..+| T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksgyvenpgdyaksirvsfiksksg 80 (126) T protein:vir:78 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSGYVENPGDYAKSIRVSFIKSKSG 80 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccccccCchhhhhhhheeeeecccC Confidence 6688877766543221 11223444555556666554322 222 24688999999988887776 Q ss_pred ceEEEEeeeccCCcccchhhhhhcccccccccccccccccccc--cCCCCCC Q lcl|NC_015262. 69 TKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRII--NHPGVSP 118 (147) Q Consensus 69 ~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~--~~~~~~a 118 (147) --...|-. ..+-.+|+|||....+.-..+.....-+- +.--..+ T Consensus 81 lpkarvma------tdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:78 81 LPKARVMA------TDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred Ccccceeh------hhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 54444421 12335679999764433222111110000 1111112 No 182 >protein:vir:101654 Length: 126 # NCBI annotation: gp17 # Family: family:all:11115 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654772;genbank:gi:109302770;genbank:GeneID:4156088 Probab=42.81 E-value=0.33 Score=23.22 Aligned_cols=106 Identities=15% Similarity=0.200 Sum_probs=52.0 Q ss_pred ehhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------CCcchhhhcceecccccCCC Q lcl|NC_015262. 7 TEGFDAVLSKIESMGK-SGDKLLNEAVKAGGNVILQDALP---RVS--------------KRSGKLKDGLKVSGVKKKGG 68 (147) Q Consensus 7 i~Gl~el~~~l~~l~~-~~~~~~~~al~~~a~~v~~~ak~---~ap--------------~~tG~l~~sI~~~~~~~~~~ 68 (147) ++||..|+.+-+-... ....++-..|-.-|.+|++---. ..| ...|++..||+++..+..+| T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksgyvenpgdyaksirvsfiksksg 80 (126) T protein:vir:10 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSGYVENPGDYAKSIRVSFIKSKSG 80 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccccccCchhhhhhhheeeeecccC Confidence 6688877766543221 11223444555556666554322 222 24688999999988887776 Q ss_pred ceEEEEeeeccCCcccchhhhhhcccccccccccccccccccc--cCCCCCC Q lcl|NC_015262. 69 TKYVLVGITKEDNSKIFYGKFLEFGASAHKIPIKKGKKKGRII--NHPGVSP 118 (147) Q Consensus 69 ~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~~~~~~~~~~~~~--~~~~~~a 118 (147) --...|-. ..+-.+|+|||....+.-..+.....-+- +.--..+ T Consensus 81 lpkarvma------tdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:10 81 LPKARVMA------TDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred Ccccceeh------hhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 54444421 12335679999764433222111110000 1111112 No 183 >protein:vir:99454 Length: 150 # NCBI annotation: hypothetical protein # Family: family:all:32760 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919085;genbank:gi:119757043;genbank:GeneID:4606107 Probab=38.74 E-value=1.1 Score=20.46 Aligned_cols=123 Identities=15% Similarity=0.148 Sum_probs=65.5 Q ss_pred Ccee--eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhcceecccccCCCceEEEEeeec Q lcl|NC_015262. 1 MSVE--ITTEGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRVSKRSGKLKDGLKVSGVKKKGGTKYVLVGITK 78 (147) Q Consensus 1 M~~~--~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~Vg~~~ 78 (147) |..= |.-.--++|+++ |.+.....+..++.+-|..|.+.--+.--.+-..+.+.= .+.+.+.+|..++.-||. T Consensus 1 mt~l~~f~~d~re~lld~---le~~areeiap~vq~~ahdile~yg~~hdydv~~iiea~-et~v~rr~~rvvvr~gwp- 75 (150) T protein:vir:99 1 MTTLAGFEADAREALLDE---LEDHAREEIAPAVQQHAHDILEAYGRENDYDVQSIIDAA-ETRVERRKGSVVVRWGWP- 75 (150) T ss_pred CCccchhhHHHHHHHHHH---HHHHHHHhhhHHHHHHHHHHHHHhccccccchhhhhhhh-hhheeecCCeEEEEecCC- Confidence 6532 221222233333 333333445555666666665554433333322222221 234445566655555553 Q ss_pred cCCcccchhhhhhccccccccccc-------------------------ccccccccccCCCCCCCcchhhHHHHHHHHH Q lcl|NC_015262. 79 EDNSKIFYGKFLEFGASAHKIPIK-------------------------KGKKKGRIINHPGVSPKPFLAPAYESKKDEA 133 (147) Q Consensus 79 ~~~~~~~y~~~vE~GT~~~~~~~~-------------------------~~~~~~~~~~~~~~~a~PFl~pA~~~~~~~~ 133 (147) .-+.|.|-||..|....+ .+..+...+...|.|---|+|.++.--+.++ T Consensus 76 ------epaiyfergt~dhvvea~nad~lsfvwedpp~wvre~fe~e~~g~rvfl~e~~v~glpesrfirdtln~lr~~f 149 (150) T protein:vir:99 76 ------EPAIFFERGTVDHVVEATNADVLSFIWEDPPRWVRQGYEREGGGWRVFLPEVEVSGLPESRFIRDTLNWLRRRF 149 (150) T ss_pred ------CcceeeeccchhhhhhccccchhhhhhcCchhHhHhhcCcCCCceEEEeecccccCCcchhhHHHHHHHHHHhc Confidence 225677888876654432 3445667778888999999999887766666 Q ss_pred H Q lcl|NC_015262. 134 K 134 (147) Q Consensus 134 ~ 134 (147) . T Consensus 150 a 150 (150) T protein:vir:99 150 A 150 (150) T ss_pred C Confidence 5 No 184 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=32.00 E-value=1.5 Score=19.69 Aligned_cols=142 Identities=13% Similarity=0.112 Sum_probs=61.3 Q ss_pred Cceeeee--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-------CC--cch--hhhcc---- Q lcl|NC_015262. 1 MSVEITT--EGFDAVLSKIESMGKSGDKLLNEAVKAGGNVILQDALPRV-----S-------KR--SGK--LKDGL---- 58 (147) Q Consensus 1 M~~~~~i--~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~ak~~a-----p-------~~--tG~--l~~sI---- 58 (147) |++.+++ +++..|.+.|..|. .....-+.-+...|..+...+++++ | +. .|. ...-+ T Consensus 1 m~~~~~~n~~dl~~l~~~L~ll~-L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL~ 79 (231) T protein:vir:37 1 MQIRLGLKQEDLDAFVRDLRTLN-LTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKVL 79 (231) T ss_pred CCccCCcCHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHhH Confidence 7755554 47777877777541 1112233455666667777776654 2 21 222 11111 Q ss_pred eecccccCCCceEEEEeeeccCCcccchhhhhhcccccccc----------------------------------cccc- Q lcl|NC_015262. 59 KVSGVKKKGGTKYVLVGITKEDNSKIFYGKFLEFGASAHKI----------------------------------PIKK- 103 (147) Q Consensus 59 ~~~~~~~~~~~~~~~Vg~~~~~~~~~~y~~~vE~GT~~~~~----------------------------------~~~~- 103 (147) .........+...+.++... .....|....||-..... .++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~g---~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~ 156 (231) T protein:vir:37 80 RYASILAEERGKGRIYYKNP---LTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQG 156 (231) T ss_pred HhhccccccCCceEEeeecc---hHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCC Confidence 11112222222222222111 122333344444211110 0000 Q ss_pred ---ccc-------------------cccc--c--------cCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015262. 104 ---GKK-------------------KGRI--I--------NHPGVSPKPFLAPAYESKKDEAKNVMKEILKRGLG 146 (147) Q Consensus 104 ---~~~-------------------~~~~--~--------~~~~~~a~PFl~pA~~~~~~~~~~~i~~~l~~~i~ 146 (147) ++. +... . =.-..|++|||-..-++....+...|...+...-- T Consensus 157 k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 157 KTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKENVDILREITLKFLSGEYK 231 (231) T ss_pred CCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHHHHHHHHHHHHHHhcccCC Confidence 000 0000 0 01246999999977776666655555555544444 Done!