Query lcl|NC_011044.1_cdsid_YP_002003860.1 [gene=Nigel_21] [protein=gp21] [protein_id=YP_002003860.1] [location=18187..18600] Match_columns 137 No_of_seqs 107 out of 279 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 13:58:54 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_21 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_21_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106041 Length: 137 100.0 6.1E-48 3.8E-51 279.4 11.9 137 1-137 1-137 (137) 2 protein:vir:97982 Length: 140 100.0 7.7E-45 4.7E-48 262.4 12.6 137 1-137 1-140 (140) 3 protein:vir:107545 Length: 140 100.0 7.7E-45 4.7E-48 262.4 12.6 137 1-137 1-140 (140) 4 protein:vir:99101 Length: 142 100.0 1.6E-42 1E-45 249.7 12.9 137 1-137 2-142 (142) 5 protein:vir:8669 Length: 142 # 100.0 1.6E-42 1E-45 249.7 12.9 137 1-137 2-142 (142) 6 protein:vir:102441 Length: 137 100.0 1.2E-41 7.6E-45 244.9 11.2 135 1-135 1-137 (137) 7 protein:vir:106506 Length: 137 100.0 3.3E-41 2E-44 242.5 10.9 136 2-137 1-136 (137) 8 protein:vir:94654 Length: 142 100.0 3.4E-35 2.1E-38 209.6 13.0 132 1-135 4-142 (142) 9 protein:vir:94108 Length: 149 100.0 4.8E-33 3E-36 197.8 10.9 133 1-136 17-149 (149) 10 protein:vir:96121 Length: 137 100.0 1.3E-32 7.8E-36 195.5 11.5 133 1-136 1-137 (137) 11 protein:vir:107099 Length: 137 100.0 1.8E-32 1.1E-35 194.6 11.9 133 1-136 1-137 (137) 12 protein:vir:105916 Length: 149 100.0 1.6E-32 9.7E-36 194.9 10.9 133 1-136 17-149 (149) 13 protein:vir:105330 Length: 137 100.0 6.9E-32 4.3E-35 191.4 11.2 133 1-136 1-137 (137) 14 protein:vir:94796 Length: 137 99.9 9.4E-32 5.8E-35 190.7 11.6 133 1-136 5-137 (137) 15 protein:vir:95894 Length: 137 99.9 1.4E-31 8.5E-35 189.8 11.5 133 1-136 1-137 (137) 16 protein:vir:94490 Length: 137 99.9 1.8E-31 1.1E-34 189.2 11.4 133 1-136 1-137 (137) 17 protein:vir:97427 Length: 137 99.9 1.8E-31 1.1E-34 189.2 11.4 133 1-136 1-137 (137) 18 protein:vir:93738 Length: 137 99.9 1.8E-31 1.1E-34 189.2 11.4 133 1-136 1-137 (137) 19 protein:vir:96829 Length: 135 99.9 8.4E-31 5.2E-34 185.5 12.1 131 1-136 1-135 (135) 20 protein:vir:5978 Length: 144 # 99.9 6.3E-31 3.9E-34 186.2 11.1 132 1-137 4-142 (144) 21 protein:vir:101594 Length: 173 99.9 1.1E-30 7.1E-34 184.7 11.2 135 1-137 3-165 (173) 22 protein:vir:95062 Length: 116 99.9 1.1E-30 7E-34 184.8 9.4 116 18-136 1-116 (116) 23 protein:vir:97327 Length: 116 99.9 7.4E-30 4.6E-33 180.3 9.7 116 18-136 1-116 (116) 24 protein:vir:1243 Length: 116 # 99.9 7.4E-30 4.6E-33 180.3 9.7 116 18-136 1-116 (116) 25 protein:vir:106570 Length: 182 99.9 2.1E-29 1.3E-32 177.8 11.1 136 1-137 6-171 (182) 26 protein:vir:78077 Length: 141 99.9 2.2E-28 1.4E-31 172.2 10.8 128 2-137 1-135 (141) 27 protein:vir:9930 Length: 108 # 99.8 3.3E-24 2E-27 149.3 9.8 104 1-137 1-105 (108) 28 protein:vir:94538 Length: 125 99.8 9.9E-24 6.1E-27 146.7 8.8 108 1-137 5-117 (125) 29 protein:vir:95789 Length: 114 99.8 1.1E-23 6.7E-27 146.5 8.7 103 1-137 1-108 (114) 30 protein:vir:98409 Length: 108 99.8 5.8E-23 3.6E-26 142.5 9.0 103 1-137 3-106 (108) 31 protein:vir:743 Length: 108 # 99.8 7.6E-23 4.7E-26 141.9 9.2 103 1-137 3-106 (108) 32 protein:vir:99744 Length: 115 99.8 8.9E-23 5.5E-26 141.5 9.3 104 1-137 3-113 (115) 33 protein:vir:3617 Length: 112 # 99.8 7.4E-23 4.6E-26 141.9 8.8 105 1-137 1-110 (112) 34 protein:vir:106623 Length: 115 99.8 2.4E-22 1.5E-25 139.1 9.3 104 1-137 3-113 (115) 35 protein:vir:103917 Length: 115 99.8 3E-22 1.9E-25 138.5 9.7 104 1-137 3-113 (115) 36 protein:vir:96358 Length: 115 99.8 3E-22 1.9E-25 138.5 9.7 104 1-137 3-113 (115) 37 protein:vir:96225 Length: 115 99.8 3E-22 1.9E-25 138.5 9.7 104 1-137 3-113 (115) 38 protein:vir:9312 Length: 115 # 99.8 3E-22 1.9E-25 138.5 9.7 104 1-137 3-113 (115) 39 protein:vir:97144 Length: 115 99.8 3E-22 1.9E-25 138.5 9.7 104 1-137 3-113 (115) 40 protein:vir:78858 Length: 115 99.8 3E-22 1.9E-25 138.5 9.7 104 1-137 3-113 (115) 41 protein:vir:96486 Length: 112 99.8 3.2E-22 2E-25 138.4 8.9 103 1-137 6-111 (112) 42 protein:vir:105467 Length: 144 99.8 1E-21 6.3E-25 135.7 8.7 123 1-137 1-134 (144) 43 protein:vir:97088 Length: 157 99.7 3.9E-21 2.4E-24 132.4 10.0 132 1-137 1-152 (157) 44 protein:vir:2740 Length: 114 # 99.7 2.2E-21 1.4E-24 133.9 7.6 103 1-137 6-114 (114) 45 protein:vir:4906 Length: 114 # 99.7 2.2E-21 1.4E-24 133.9 7.6 103 1-137 6-114 (114) 46 protein:vir:100075 Length: 140 99.7 2.3E-20 1.4E-23 128.3 8.5 108 1-137 6-127 (140) 47 protein:vir:100243 Length: 140 99.7 6.9E-20 4.3E-23 125.6 8.8 108 1-137 6-127 (140) 48 protein:vir:1437 Length: 140 # 99.7 1.8E-19 1.1E-22 123.3 8.6 108 1-137 6-127 (140) 49 protein:vir:80362 Length: 140 99.7 2.2E-19 1.4E-22 122.9 8.6 108 1-137 6-127 (140) 50 protein:vir:79034 Length: 141 99.6 8.1E-19 5E-22 119.8 8.7 114 1-137 1-129 (141) 51 protein:vir:93617 Length: 148 99.6 1.2E-18 7.2E-22 118.9 8.4 108 1-137 2-137 (148) 52 protein:vir:194 Length: 149 # 99.6 2.8E-18 1.7E-21 116.8 7.9 108 1-137 2-138 (149) 53 protein:vir:1273 Length: 127 # 99.6 5.7E-18 3.5E-21 115.1 8.6 108 1-137 6-121 (127) 54 protein:vir:1891 Length: 179 # 99.5 1.7E-17 1E-20 112.6 8.3 132 1-137 9-160 (179) 55 protein:vir:5745 Length: 135 # 99.5 4.3E-17 2.6E-20 110.3 8.9 108 1-137 7-125 (135) 56 protein:vir:102963 Length: 163 99.5 3.5E-17 2.2E-20 110.8 8.2 113 1-137 1-148 (163) 57 protein:vir:105089 Length: 133 99.5 6E-17 3.7E-20 109.5 8.8 108 1-137 6-125 (133) 58 protein:vir:105007 Length: 146 99.5 6.1E-17 3.8E-20 109.5 8.6 108 1-137 9-137 (146) 59 protein:vir:107568 Length: 146 99.5 6.1E-17 3.8E-20 109.5 8.6 108 1-137 9-137 (146) 60 protein:vir:102085 Length: 146 99.5 6.1E-17 3.8E-20 109.5 8.6 108 1-137 9-137 (146) 61 protein:vir:102875 Length: 146 99.5 6.1E-17 3.8E-20 109.5 8.6 108 1-137 9-137 (146) 62 protein:vir:99528 Length: 92 # 99.5 3.1E-17 2E-20 111.1 6.4 82 1-87 4-92 (92) 63 protein:vir:4347 Length: 164 # 99.5 6.8E-17 4.2E-20 109.2 8.1 108 1-137 9-153 (164) 64 protein:vir:9879 Length: 127 # 99.5 1.4E-16 8.7E-20 107.5 7.4 109 1-136 1-127 (127) 65 protein:vir:81147 Length: 126 99.4 3.9E-16 2.4E-19 105.0 8.8 112 1-137 4-121 (126) 66 protein:vir:3873 Length: 128 # 99.4 3.7E-16 2.3E-19 105.2 8.5 108 1-137 1-122 (128) 67 protein:vir:1386 Length: 149 # 99.4 7.7E-16 4.7E-19 103.5 8.2 108 1-137 9-146 (149) 68 protein:vir:9708 Length: 125 # 99.4 1.8E-15 1.1E-18 101.5 8.0 108 1-137 2-118 (125) 69 protein:vir:79988 Length: 125 99.3 1.8E-14 1.1E-17 95.9 9.0 108 1-137 1-123 (125) 70 protein:vir:4704 Length: 125 # 99.3 1.8E-14 1.1E-17 95.9 9.0 108 1-137 1-123 (125) 71 protein:vir:81106 Length: 125 99.3 1.8E-14 1.1E-17 95.9 9.0 108 1-137 1-123 (125) 72 protein:vir:9414 Length: 125 # 99.3 1.8E-14 1.1E-17 95.9 9.0 108 1-137 1-123 (125) 73 protein:vir:98342 Length: 125 99.3 1.8E-14 1.1E-17 95.9 9.0 108 1-137 1-123 (125) 74 protein:vir:102338 Length: 116 99.2 2E-14 1.3E-17 95.7 6.2 102 18-137 1-109 (116) 75 protein:vir:966 Length: 123 # 99.1 2.7E-13 1.7E-16 89.5 9.1 110 1-137 1-120 (123) 76 protein:vir:102154 Length: 119 99.1 1.8E-13 1.1E-16 90.5 5.2 106 1-137 6-113 (119) 77 protein:vir:104347 Length: 145 99.0 4.6E-12 2.9E-15 82.7 9.3 108 1-137 5-144 (145) 78 protein:vir:94994 Length: 131 98.9 6.2E-12 3.8E-15 82.0 8.6 102 1-135 1-131 (131) 79 protein:vir:78380 Length: 131 98.9 1.1E-11 6.8E-15 80.7 9.4 102 1-135 1-131 (131) 80 protein:vir:107703 Length: 147 98.9 1.2E-11 7.3E-15 80.5 9.5 108 1-137 1-146 (147) 81 protein:vir:80425 Length: 134 98.9 1.3E-11 8E-15 80.3 9.7 104 1-137 1-132 (134) 82 protein:vir:103280 Length: 142 98.9 1.1E-11 6.7E-15 80.7 9.1 108 1-137 1-141 (142) 83 protein:vir:95157 Length: 144 98.8 1E-10 6.3E-14 75.4 10.0 108 1-137 1-143 (144) 84 protein:vir:79638 Length: 146 98.8 1.1E-10 6.6E-14 75.3 9.8 108 1-137 1-143 (146) 85 protein:vir:3163 Length: 145 # 98.8 2.2E-11 1.4E-14 79.0 6.0 105 1-137 2-134 (145) 86 protein:vir:97190 Length: 148 98.7 1.2E-10 7.3E-14 75.0 9.0 108 1-137 1-145 (148) 87 protein:vir:81067 Length: 119 98.7 1.2E-11 7.7E-15 80.4 3.6 97 33-137 1-106 (119) 88 protein:vir:10367 Length: 119 98.7 1.2E-11 7.6E-15 80.4 3.6 98 33-137 1-106 (119) 89 protein:vir:94944 Length: 121 98.7 6.7E-11 4.1E-14 76.4 7.2 97 1-128 2-121 (121) 90 protein:vir:99833 Length: 190 98.7 2E-10 1.2E-13 73.8 8.2 125 1-137 2-181 (190) 91 protein:vir:80116 Length: 127 98.6 7.1E-10 4.4E-13 70.7 8.6 108 1-137 1-121 (127) 92 protein:vir:95372 Length: 124 98.5 6.8E-10 4.2E-13 70.8 8.1 108 1-137 1-121 (124) 93 protein:vir:7449 Length: 123 # 98.5 5.6E-10 3.5E-13 71.3 7.2 113 1-135 4-123 (123) 94 protein:vir:79225 Length: 155 98.5 4.1E-10 2.5E-13 72.0 6.5 109 1-137 1-146 (155) 95 protein:vir:101508 Length: 120 98.5 4.7E-10 2.9E-13 71.7 6.6 110 1-135 4-120 (120) 96 protein:vir:107851 Length: 175 98.5 4.2E-10 2.6E-13 72.0 6.1 110 1-137 1-163 (175) 97 protein:vir:96774 Length: 152 98.4 3.1E-09 1.9E-12 67.2 9.7 104 1-137 11-152 (152) 98 protein:vir:79091 Length: 175 98.4 8.6E-10 5.3E-13 70.3 6.6 110 1-137 1-163 (175) 99 protein:vir:99196 Length: 155 98.4 8.4E-10 5.2E-13 70.3 6.2 109 1-137 1-146 (155) 100 protein:vir:6246 Length: 143 # 98.4 4.3E-10 2.6E-13 72.0 4.4 107 1-137 11-136 (143) 101 protein:vir:1988 Length: 156 # 98.4 1.9E-09 1.2E-12 68.4 7.0 111 1-137 1-149 (156) 102 protein:vir:103841 Length: 155 98.3 3.5E-09 2.2E-12 66.9 6.3 109 1-137 1-146 (155) 103 protein:vir:1332 Length: 143 # 98.3 1.7E-09 1.1E-12 68.6 4.3 107 1-137 11-136 (143) 104 protein:vir:4200 Length: 133 # 98.2 3.5E-09 2.2E-12 66.9 5.6 123 2-136 1-133 (133) 105 protein:vir:96288 Length: 100 98.2 2.5E-09 1.6E-12 67.7 4.5 84 1-99 17-100 (100) 106 protein:vir:105773 Length: 131 98.2 1.6E-08 9.7E-12 63.4 7.7 121 1-137 3-127 (131) 107 protein:vir:4162 Length: 133 # 98.1 9E-09 5.6E-12 64.7 4.8 123 2-137 1-126 (133) 108 protein:vir:4956 Length: 153 # 98.1 1.4E-08 8.4E-12 63.7 5.5 108 1-137 4-133 (153) 109 protein:vir:100887 Length: 139 98.0 3.6E-08 2.3E-11 61.4 6.0 104 5-137 1-129 (139) 110 protein:vir:102190 Length: 93 97.8 2.4E-08 1.5E-11 62.4 3.2 89 22-135 1-93 (93) 111 protein:vir:98557 Length: 149 97.7 3.6E-07 2.3E-10 55.9 7.5 116 1-137 1-146 (149) 112 protein:vir:100223 Length: 139 97.6 2.1E-07 1.3E-10 57.2 5.5 108 1-137 1-129 (139) 113 protein:vir:5703 Length: 150 # 97.5 8.8E-07 5.4E-10 53.8 7.8 117 1-137 1-147 (150) 114 protein:vir:79115 Length: 148 97.5 1.2E-06 7.5E-10 53.0 8.2 116 1-137 1-145 (148) 115 protein:vir:6071 Length: 150 # 97.5 1.2E-06 7.4E-10 53.0 7.8 115 1-137 1-147 (150) 116 protein:vir:5000 Length: 141 # 97.5 2.9E-07 1.8E-10 56.4 4.4 108 1-137 1-133 (141) 117 protein:vir:2026 Length: 150 # 97.4 1.4E-06 8.4E-10 52.8 7.3 115 1-137 1-147 (150) 118 protein:vir:7993 Length: 108 # 97.4 1.4E-07 9E-11 58.1 1.6 98 1-107 1-108 (108) 119 protein:vir:1838 Length: 149 # 97.3 1.9E-06 1.2E-09 52.0 7.2 116 1-137 1-146 (149) 120 protein:vir:80970 Length: 112 97.3 1.1E-06 6.8E-10 53.3 5.9 106 1-137 1-111 (112) 121 protein:vir:79179 Length: 155 97.3 2.3E-06 1.4E-09 51.5 7.6 116 1-137 1-152 (155) 122 protein:vir:45 Length: 112 # N 97.3 1.1E-06 7E-10 53.2 5.8 103 1-137 1-103 (112) 123 protein:vir:4859 Length: 140 # 97.3 9.7E-07 6E-10 53.6 5.3 108 1-137 1-133 (140) 124 protein:vir:100652 Length: 134 97.1 3.4E-06 2.1E-09 50.5 6.6 113 1-134 1-134 (134) 125 protein:vir:100312 Length: 152 97.0 7E-06 4.3E-09 48.8 7.7 119 1-137 1-148 (152) 126 protein:vir:6216 Length: 125 # 96.9 6.5E-06 4E-09 49.0 6.7 110 1-137 1-119 (125) 127 protein:vir:101302 Length: 134 96.9 7.5E-06 4.6E-09 48.7 7.0 109 1-134 1-134 (134) 128 protein:vir:9513 Length: 134 # 96.9 7.5E-06 4.6E-09 48.7 7.0 109 1-134 1-134 (134) 129 protein:vir:4833 Length: 140 # 96.9 3.9E-06 2.4E-09 50.3 5.0 104 1-137 1-133 (140) 130 protein:vir:96105 Length: 193 96.8 3.2E-06 2E-09 50.7 4.2 106 1-137 1-125 (193) 131 protein:vir:4790 Length: 114 # 96.6 1.4E-05 9E-09 47.1 6.5 106 1-137 1-107 (114) 132 protein:vir:1164 Length: 156 # 96.5 4E-05 2.5E-08 44.7 8.4 115 1-137 1-149 (156) 133 protein:vir:8106 Length: 150 # 96.5 4.3E-06 2.6E-09 50.0 2.8 124 1-137 1-145 (150) 134 protein:vir:1581 Length: 116 # 96.5 2.8E-05 1.8E-08 45.5 7.1 112 1-137 1-114 (116) 135 protein:vir:9647 Length: 132 # 96.3 4.6E-06 2.8E-09 49.9 1.9 111 1-137 1-129 (132) 136 protein:vir:99546 Length: 200 96.2 4E-05 2.5E-08 44.7 6.5 107 1-137 5-132 (200) 137 protein:vir:9823 Length: 118 # 95.7 0.00013 7.8E-08 42.0 7.2 109 1-135 2-118 (118) 138 protein:vir:3036 Length: 118 # 95.7 0.00013 7.8E-08 42.0 7.2 109 1-135 2-118 (118) 139 protein:vir:98892 Length: 108 95.3 0.00014 8.9E-08 41.6 6.3 105 1-133 2-108 (108) 140 protein:vir:3848 Length: 159 # 95.3 0.00025 1.6E-07 40.3 7.5 111 1-137 1-152 (159) 141 protein:vir:98636 Length: 138 94.7 0.00036 2.2E-07 39.4 6.8 111 1-137 7-131 (138) 142 protein:vir:79687 Length: 113 94.0 0.00053 3.3E-07 38.5 6.2 102 1-137 1-104 (113) 143 protein:vir:105825 Length: 108 93.9 0.0001 6.2E-08 42.5 2.1 98 1-107 1-108 (108) 144 protein:vir:102608 Length: 108 93.9 0.0001 6.2E-08 42.5 2.1 98 1-107 1-108 (108) 145 protein:vir:99454 Length: 150 91.9 0.0049 3E-06 33.3 8.4 127 1-132 1-150 (150) 146 protein:vir:93898 Length: 133 91.6 0.0028 1.7E-06 34.6 6.8 108 1-133 1-133 (133) 147 protein:vir:5257 Length: 148 # 91.1 0.0006 3.7E-07 38.2 2.6 83 1-137 1-85 (148) 148 protein:vir:78163 Length: 92 # 91.1 0.0005 3.1E-07 38.7 2.1 91 1-108 1-92 (92) 149 protein:vir:94069 Length: 168 90.8 0.00048 3E-07 38.8 1.7 84 1-137 1-95 (168) 150 protein:vir:80037 Length: 199 90.3 0.00099 6.1E-07 37.1 3.0 98 1-137 1-133 (199) 151 protein:vir:78894 Length: 105 89.9 0.0013 7.8E-07 36.5 3.3 97 1-137 1-101 (105) 152 protein:vir:1087 Length: 161 # 88.6 0.0084 5.2E-06 32.0 6.8 125 1-137 2-157 (161) 153 protein:vir:96973 Length: 133 88.4 0.01 6.2E-06 31.5 7.1 108 1-133 1-133 (133) 154 protein:vir:78644 Length: 133 88.4 0.01 6.2E-06 31.5 7.1 108 1-133 1-133 (133) 155 protein:vir:9363 Length: 133 # 88.4 0.01 6.2E-06 31.5 7.1 108 1-133 1-133 (133) 156 protein:vir:94419 Length: 133 88.4 0.01 6.2E-06 31.5 7.1 108 1-133 1-133 (133) 157 protein:vir:78607 Length: 155 87.6 0.0014 8.7E-07 36.2 1.9 85 1-137 1-92 (155) 158 protein:vir:487 Length: 187 # 87.1 0.0042 2.6E-06 33.6 4.2 129 1-137 14-173 (187) 159 protein:vir:101563 Length: 155 87.0 0.00086 5.4E-07 37.4 0.4 85 1-137 1-92 (155) 160 protein:vir:78335 Length: 133 86.8 0.013 7.8E-06 31.0 6.7 112 1-137 1-128 (133) 161 protein:vir:106728 Length: 155 86.7 0.0017 1.1E-06 35.8 1.8 85 1-137 1-92 (155) 162 protein:vir:77650 Length: 155 86.3 0.0011 6.9E-07 36.8 0.6 85 1-137 1-92 (155) 163 protein:vir:95260 Length: 160 85.7 0.0028 1.7E-06 34.6 2.4 76 21-137 1-81 (160) 164 protein:vir:1028 Length: 168 # 81.8 0.037 2.3E-05 28.5 6.9 123 1-137 1-161 (168) 165 protein:vir:3994 Length: 168 # 80.4 0.038 2.4E-05 28.4 6.5 123 1-137 1-161 (168) 166 protein:vir:96012 Length: 133 78.4 0.041 2.6E-05 28.2 6.0 110 1-135 1-133 (133) 167 protein:vir:7412 Length: 168 # 78.1 0.069 4.3E-05 27.0 7.1 123 1-137 1-161 (168) 168 protein:vir:8432 Length: 149 # 77.5 0.071 4.4E-05 26.9 7.0 114 1-137 20-144 (149) 169 protein:vir:4514 Length: 168 # 73.0 0.1 6.2E-05 26.1 6.6 126 1-137 1-155 (168) 170 protein:vir:107757 Length: 189 69.2 0.062 3.9E-05 27.2 4.6 61 1-61 88-189 (189) 171 protein:vir:80037 Length: 199 68.8 0.035 2.2E-05 28.6 3.1 59 1-59 134-199 (199) 172 protein:vir:4096 Length: 140 # 64.5 0.1 6.3E-05 26.0 4.8 110 1-137 1-136 (140) 173 protein:vir:96105 Length: 193 63.1 0.041 2.6E-05 28.2 2.4 57 1-57 108-193 (193) 174 protein:vir:2688 Length: 123 # 62.5 0.17 0.0001 24.8 5.6 102 7-133 1-123 (123) 175 protein:vir:4460 Length: 170 # 54.9 0.093 5.7E-05 26.3 2.8 126 1-137 1-156 (170) 176 protein:vir:3787 Length: 231 # 53.6 0.52 0.00033 22.1 7.3 136 1-137 1-228 (231) 177 protein:vir:6375 Length: 205 # 45.4 0.77 0.00048 21.2 9.7 137 1-137 1-191 (205) 178 protein:vir:3427 Length: 192 # 34.4 0.99 0.00062 20.6 5.1 132 1-137 1-165 (192) No 1 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=100.00 E-value=6.1e-48 Score=279.42 Aligned_cols=137 Identities=86% Similarity=1.407 Sum_probs=133.7 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) ||||++|+++...+.+++++.+++.+++++..++++||.++|||||+||+||+......++.++++.|+++++||+|||| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~ 80 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHE 80 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeee Confidence 99999999999999999999999999999999999999999999999999999887777777889999999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||+||.|+|+++++|+|+|.|+++|+|+|+|||++|+|||+|||+++++++++|||| T Consensus 81 GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~~ 137 (137) T protein:vir:10 81 GSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDIHMT 137 (137) T ss_pred cCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccccccCC Confidence 999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=100.00 E-value=7.7e-45 Score=262.44 Aligned_cols=137 Identities=71% Similarity=1.166 Sum_probs=130.5 Q ss_pred Cce---ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhh Q lcl|NC_011044. 1 MPV---TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAA 77 (137) Q Consensus 1 msv---~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ 77 (137) |.- .++|+.+...+.+++.+.+++.+++++..++++||.++|||||+||+||+......++.++++.|+++++||+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 443 46789999999999999999999999999999999999999999999999887777777889999999999999 Q ss_pred hhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 78 VHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 78 vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) |||||+||.|+|+++++|+|+|.|+++|+|+|+|||++|||||+||++++++++++|+|| T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=100.00 E-value=7.7e-45 Score=262.44 Aligned_cols=137 Identities=71% Similarity=1.166 Sum_probs=130.5 Q ss_pred Cce---ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhh Q lcl|NC_011044. 1 MPV---TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAA 77 (137) Q Consensus 1 msv---~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ 77 (137) |.- .++|+.+...+.+++.+.+++.+++++..++++||.++|||||+||+||+......++.++++.|+++++||+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 443 46789999999999999999999999999999999999999999999999887777777889999999999999 Q ss_pred hhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 78 VHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 78 vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) |||||+||.|+|+++++|+|+|.|+++|+|+|+|||++|||||+||++++++++++|+|| T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=100.00 E-value=1.6e-42 Score=249.68 Aligned_cols=137 Identities=55% Similarity=0.887 Sum_probs=127.3 Q ss_pred Cceehhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc-cCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPVTARI---HINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY-RPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv~~~l---~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~-~~~~~~~~v~~~~~YA~ 76 (137) |+++.++ +++++.+.++++..+++.+++++..++++||.++|||||+|++||+...... ...++++.|+++++||+ T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~ 81 (142) T protein:vir:99 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAA 81 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccc Confidence 8887665 5899999999999999999999999999999999999999999998765433 34467888999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||||||+||.|+|+++++|.|+|.|+++|+|+|+||||+|||||+||+++++.++++|.+- T Consensus 82 ~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 82 AVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=100.00 E-value=1.6e-42 Score=249.68 Aligned_cols=137 Identities=55% Similarity=0.887 Sum_probs=127.3 Q ss_pred Cceehhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc-cCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPVTARI---HINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY-RPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv~~~l---~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~-~~~~~~~~v~~~~~YA~ 76 (137) |+++.++ +++++.+.++++..+++.+++++..++++||.++|||||+|++||+...... ...++++.|+++++||+ T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~ 81 (142) T protein:vir:86 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAA 81 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccc Confidence 8887665 5899999999999999999999999999999999999999999998765433 34467888999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||||||+||.|+|+++++|.|+|.|+++|+|+|+||||+|||||+||+++++.++++|.+- T Consensus 82 ~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 82 AVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=100.00 E-value=1.2e-41 Score=244.88 Aligned_cols=135 Identities=47% Similarity=0.693 Sum_probs=125.1 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc-cCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY-RPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~-~~~~~~~~v~~~~~YA~~vE 79 (137) |.|+++++.+...+.++++.++++++++++..++++||.++|||||+|++||+...... ...++++.|+++++||+||| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve 80 (137) T protein:vir:10 1 MTVTARYERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVH 80 (137) T ss_pred CeeEEEeccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeee Confidence 99999999999999999999999999999999999999999999999999998765433 34557889999999999999 Q ss_pred cCCCCCccccccCC-cceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhc Q lcl|NC_011044. 80 EGSRPHRIVARHAQ-ALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIH 135 (137) Q Consensus 80 ~GT~ph~i~pk~~k-~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~ 135 (137) |||+||.|+|++++ +|.|.+.|+++|+|+|+||||+|+|||+|||+++++++..-- T Consensus 81 ~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 81 DGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred cCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 99999999999977 999999999999999999999999999999999998776544 No 7 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=100.00 E-value=3.3e-41 Score=242.51 Aligned_cols=136 Identities=31% Similarity=0.362 Sum_probs=125.9 Q ss_pred ceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhcC Q lcl|NC_011044. 2 PVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEG 81 (137) Q Consensus 2 sv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~G 81 (137) -|++.++++...|.+++.+.+++++++++..++++||.++|+|||+|++||+.......+..+++.|+++++||+||||| T Consensus 1 ~~~~~~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~G 80 (137) T protein:vir:10 1 MVAHTLRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNG 80 (137) T ss_pred CcccccccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceeeecC Confidence 57778888888999999999999999999999999999999999999999998887777778999999999999999999 Q ss_pred CCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 82 SRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 82 T~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) |+||.|+|+++++|+|+|.|+++|+|+|+|||++|+|||+|||+++++++-+---. T Consensus 81 T~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~~~~~~~ 136 (137) T protein:vir:10 81 RRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQEGFRVTI 136 (137) T ss_pred CCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhcccceeEee Confidence 99999999999999999999999999999999999999999999988876643222 No 8 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=100.00 E-value=3.4e-35 Score=209.55 Aligned_cols=132 Identities=23% Similarity=0.335 Sum_probs=113.4 Q ss_pred Cce---ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhh Q lcl|NC_011044. 1 MPV---TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAA 77 (137) Q Consensus 1 msv---~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ 77 (137) |++ +.+|...++.+.+++++.+++++.+++..++++|+.++|||||+|++||++.+. ..+..+++.|+++++||.| T Consensus 4 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~-~~g~~~~~~v~~~~~YA~~ 82 (142) T protein:vir:94 4 LNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPS-GGRFSFSVTIGTNVTYAAD 82 (142) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeec-cCCceEEEEEecCcccchh Confidence 433 356778889999999999999999999999999999999999999999987654 4456788999999999999 Q ss_pred hhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH----HhHhhc Q lcl|NC_011044. 78 VHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA----ADPDIH 135 (137) Q Consensus 78 vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~----~~~~i~ 135 (137) |||||+||.|+|+++++|+|. +..+++++|+|||++|+|||+||+++++. .+++|+ T Consensus 83 vE~Gt~~~~i~pk~~k~l~~~--~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 83 VEYGTAPHVIVPKDKKALYWP--GAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred hhccCCCceeccCCCccceec--ccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 999999999999999999875 44567899999999999999999987544 223333 No 9 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.96 E-value=4.8e-33 Score=197.77 Aligned_cols=133 Identities=15% Similarity=0.054 Sum_probs=117.7 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) -.+..+|...|+.+.+++++.+++++.+++..++++||.++|||||+|++||.+.+.. .++++.|+++++||+|||| T Consensus 17 ~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~---~g~~~~V~~~~~YA~~VE~ 93 (149) T protein:vir:94 17 KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFD---GGLSSVISVGADYAIYVEY 93 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeC---CcEEEEEecCCCccccccc Confidence 1245678899999999999999999999999999999999999999999999876432 3588999999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||++|.++|+..+++.+.|.+.....+.+.|||++|||||+||+++..+...++=| T Consensus 94 GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 94 GTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred CccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99999999999998888777666667788899999999999999998887777777 No 10 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.96 E-value=1.3e-32 Score=195.46 Aligned_cols=133 Identities=15% Similarity=0.034 Sum_probs=118.0 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|...|+.+.+.++..++++|.+++..++++|+.++|||||+|++||...+..+ ++++.|+++++||+ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~---g~~~~V~~~~~YA~ 77 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDG---GFSSVISVGAEYAI 77 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecC---ceEEEEecCCCccc Confidence 54 467888999999999999999999999999999999999999999999998765432 47889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||++|.+.|...+++.+.|.+.+.+.+.+.|||++|||||+||+++..+...++=- T Consensus 78 yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 78 YVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999999999999999999999888888999999999999999999977765444333 No 11 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.96 E-value=1.8e-32 Score=194.62 Aligned_cols=133 Identities=17% Similarity=0.132 Sum_probs=117.3 Q ss_pred Cce----ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPV----TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv----~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |+- ..+|...|+.+.+.+.+.+++++++++..++++||.++|||||+|++||...... .++++.|+++++||+ T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~~~~~Ya~ 77 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKK---GGLTGVINIGSEYAV 77 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeC---CcEEEEEecCCCccc Confidence 765 4588899999999999999999999999999999999999999999999865422 357889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||++|.+.|...+.+.+.|.+.+...+.+.++|++|||||+||++++++...++=- T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 78 YVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 999999999999999999998888888888888899999999999999987765544443 No 12 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.95 E-value=1.6e-32 Score=194.94 Aligned_cols=133 Identities=15% Similarity=0.054 Sum_probs=117.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) -.+..+|...|+.+.+++++.+++++.+.+..++++||.++|+|||+|++||.+.+.. .++++.|+++++||+|||| T Consensus 17 ~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~---~g~~~~V~~~~~YA~~vE~ 93 (149) T protein:vir:10 17 KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFD---GGLSSVISVGADYAIYVEY 93 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecC---CcEEEEEecCCCccccccc Confidence 1245678899999999999999999999999999999999999999999999876432 3588999999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||++|.++|...+++.+.|.......+.+.|||++|||||+||++++++...++=| T Consensus 94 GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 94 GTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred CccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99999999988888887777665666778899999999999999999998888877 No 13 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.95 E-value=6.9e-32 Score=191.43 Aligned_cols=133 Identities=17% Similarity=0.125 Sum_probs=115.3 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|...|+.+++.++..+++++.+++..|+++||.++|||||+|++||+.... ..++++.|+++++||+ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~---~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFK---KGGLTGVINIGSEYAV 77 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEec---CCcEEEEEecCCcccc Confidence 55 4568999999999999999999999999999999999999999999999986542 2357889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||++|.++|+..+++.+.|.+.....+.+.++|++|||||+||++++++...++=- T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 78 YVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999999999999999888777666666667889999999999999987775555444 No 14 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.95 E-value=9.4e-32 Score=190.68 Aligned_cols=133 Identities=17% Similarity=0.077 Sum_probs=115.9 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) -.+..+|...|+.+.+.++..+++++.+.+..++++|+.++|||||+|++||+..+.. .++++.|+++++||+|||| T Consensus 5 ~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~~~~~YA~~vE~ 81 (137) T protein:vir:94 5 KYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKD---GGFTGVINIGSEYAIYVNY 81 (137) T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeec---CcEEEEEecCCCccccccc Confidence 1257889999999999999999999999999999999999999999999999876532 3578899999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||+||.+.++..+.+.+.|.+.+...+.+.++|++|||||+||++++++...++=- T Consensus 82 GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 82 GTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred CccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 99999999999998888888777777777888999999999999987765444333 No 15 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.95 E-value=1.4e-31 Score=189.78 Aligned_cols=133 Identities=19% Similarity=0.110 Sum_probs=115.3 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|...|+.+.+.+++.+++++.+++..++++|+.++|||||+|++||++.+.. .++++.|+++++||+ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~---~~~~~~V~~~~~YA~ 77 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---GGFTGVINIGSEYAI 77 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeC---CceEEEEecCCCccc Confidence 44 57889999999999999999999999999999999999999999999999865432 357889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||++|.+.+...+.+.+.|.+.+...+.+.++|++|||||+||+++.++...++=- T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 78 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999888888777777777777788999999999999977664444333 No 16 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.95 E-value=1.8e-31 Score=189.17 Aligned_cols=133 Identities=19% Similarity=0.116 Sum_probs=114.4 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|...|+.+.+.+++.+++++.+++..++++|+.++|||||+|++||+..... .++++.|+++++||+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~~~~~YA~ 77 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---SGFTGVINIGSEYAI 77 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec---CceEEEEecCCCccc Confidence 44 56789999999999999999999999999999999999999999999999876432 357889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||+||.+.+...+...+.|.+.+...+.+.++|++|||||+||++++++...++=- T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 78 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999998888877777666666667778999999999999987775444433 No 17 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.95 E-value=1.8e-31 Score=189.17 Aligned_cols=133 Identities=19% Similarity=0.116 Sum_probs=114.4 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|...|+.+.+.+++.+++++.+++..++++|+.++|||||+|++||+..... .++++.|+++++||+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~~~~~YA~ 77 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---SGFTGVINIGSEYAI 77 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec---CceEEEEecCCCccc Confidence 44 56789999999999999999999999999999999999999999999999876432 357889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||+||.+.+...+...+.|.+.+...+.+.++|++|||||+||++++++...++=- T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 78 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999998888877777666666667778999999999999987775444433 No 18 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.95 E-value=1.8e-31 Score=189.17 Aligned_cols=133 Identities=19% Similarity=0.116 Sum_probs=114.4 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|...|+.+.+.+++.+++++.+++..++++|+.++|||||+|++||+..... .++++.|+++++||+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~~~~~YA~ 77 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---SGFTGVINIGSEYAI 77 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec---CceEEEEecCCCccc Confidence 44 56789999999999999999999999999999999999999999999999876432 357889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||+||.+.+...+...+.|.+.+...+.+.++|++|||||+||++++++...++=- T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 78 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999998888877777666666667778999999999999987775444433 No 19 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.94 E-value=8.4e-31 Score=185.46 Aligned_cols=131 Identities=17% Similarity=0.098 Sum_probs=112.3 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) |. +..+|..+|+.+.+.++..+++++.+++..++++|+.++|+|||+|++||.+... ..++++.|+++++||+ T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~---~~g~~~~V~~~~~YA~ 77 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFE---NGGFTGVVKIGSNYAV 77 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEee---cCcEEEEEecCCCccc Confidence 77 4568889999999999999999999999999999999999999999999987542 2347889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) ||||||+||.+.|..++.+.|...+. ..+.+.++|++|||||+||++++.++..++=+ T Consensus 78 ~ve~GT~~~~~~~~~~~~~~~~~~~~--~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 78 YVNYGTGIYATKGSRAHKIPWTYKDP--NGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred hhhcccccccCCCccccccccccccC--CcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 99999999999888777766644332 24567889999999999999998887777766 No 20 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.94 E-value=6.3e-31 Score=186.15 Aligned_cols=132 Identities=16% Similarity=0.153 Sum_probs=107.8 Q ss_pred Cce------ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPV------TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv------~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ||+ ..+|..+++.+.+.+.+.+++++.++|..++++|+.++|||||+|++||.+... ..++++.|+++++| T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~---~~g~~~~V~~~~~Y 80 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYK---NNGLTAEITVGAEY 80 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEee---cCcEEEEEecCCCc Confidence 555 356777888889999999999999999999999999999999999999987653 23578999999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHh-HhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAAD-PDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~-~~i~~~ 137 (137) |+||||||+||.+.|..++..+++.... ..+.+.++|++|||||+||++++++.. .+|++. T Consensus 81 A~~vE~GT~~~~~~~~~~~~~~~~~~~~--~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~ 142 (144) T protein:vir:59 81 AIYVEYGTGIYAVDGNGRKTPWTYYSPK--LGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRL 142 (144) T ss_pred cchhhcCccccccCCCcccccccccccc--ccceecCCCCCCCcchhHHHHHHHHHHHHHHHHh Confidence 9999999999999999888877654322 233456789999999999999876533 334444 No 21 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.94 E-value=1.1e-30 Score=184.73 Aligned_cols=135 Identities=14% Similarity=0.167 Sum_probs=108.5 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) ++|..+|..+|+.|.+.++..+++++.+++..|+++|+.++|||||+|++||.+...... .++++.|+++.+||.|||| T Consensus 3 i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~-~~~~~~v~~~~~Ya~fvEf 81 (173) T protein:vir:10 3 VKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAK-DLISKKITVNELYGAYMEF 81 (173) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccC-ceeEEeeCCCcccchhhhc Confidence 777899999999999999999999999999999999999999999999999987765443 4688899999999999999 Q ss_pred CCCCCccccccCCcce---------eecC------------------CeeEEeeeEecCCCCCCchhhhhHHHHHH-HhH Q lcl|NC_011044. 81 GSRPHRIVARHAQALH---------FFWH------------------GREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADP 132 (137) Q Consensus 81 GT~ph~i~pk~~k~l~---------~~~~------------------g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~ 132 (137) ||+.|...|+...... ++.. +...|+ .++||||+|||||+||+++++. ... T Consensus 82 GT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~G~~aqPFl~PA~~~~~~~~~~ 160 (173) T protein:vir:10 82 GTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFA-KILGAGINPQPFLYPAWIEGKKQYLK 160 (173) T ss_pred ccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceee-EeecCCCCCCccchhHHHHhHHHHHH Confidence 9998877775221100 0000 112333 4789999999999999999876 445 Q ss_pred hhccC Q lcl|NC_011044. 133 DIHMT 137 (137) Q Consensus 133 ~i~~~ 137 (137) +|.+. T Consensus 161 ~i~~~ 165 (173) T protein:vir:10 161 DLENL 165 (173) T ss_pred HHHHH Confidence 55555 No 22 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.94 E-value=1.1e-30 Score=184.76 Aligned_cols=116 Identities=19% Similarity=0.115 Sum_probs=101.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCccee Q lcl|NC_011044. 18 SGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHF 97 (137) Q Consensus 18 ~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~ 97 (137) +++++++++.+++..++++||.++|||||+|++||...... .++++.|+++++||+||||||++|.++|+..+++.+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~---~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNI 77 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeec---CcEEEEEecCCCccceeecCccccccCCCccccccc Confidence 88999999999999999999999999999999999876533 358899999999999999999999999999999988 Q ss_pred ecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 98 FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 98 ~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) .|.+.....+.+.++||+|||||+||+++.+....++=- T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 887777777778899999999999999987764433333 No 23 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.93 E-value=7.4e-30 Score=180.30 Aligned_cols=116 Identities=19% Similarity=0.113 Sum_probs=98.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCccee Q lcl|NC_011044. 18 SGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHF 97 (137) Q Consensus 18 ~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~ 97 (137) +++++++++.+++..++++||.++|||||+|++||...... .++++.|+++++||+||||||++|.++|+..+.+.+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeec---CcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 88999999999999999999999999999999999865532 358899999999999999999999999998888877 Q ss_pred ecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 98 FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 98 ~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) .|.......+.+.++||+|||||+||+++.+....++=- T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 776555555666788999999999999987765433322 No 24 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.93 E-value=7.4e-30 Score=180.30 Aligned_cols=116 Identities=19% Similarity=0.113 Sum_probs=98.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCccee Q lcl|NC_011044. 18 SGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHF 97 (137) Q Consensus 18 ~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~ 97 (137) +++++++++.+++..++++||.++|||||+|++||...... .++++.|+++++||+||||||++|.++|+..+.+.+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeec---CcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 88999999999999999999999999999999999865532 358899999999999999999999999998888877 Q ss_pred ecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhcc Q lcl|NC_011044. 98 FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHM 136 (137) Q Consensus 98 ~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~ 136 (137) .|.......+.+.++||+|||||+||+++.+....++=- T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 776555555666788999999999999987765433322 No 25 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.93 E-value=2.1e-29 Score=177.82 Aligned_cols=136 Identities=15% Similarity=0.062 Sum_probs=97.2 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKH----RSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~----~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) +.+..+|.++|+.+++.+++.+++++ ++++..++++||.++|||||+|++||++++... +.++++.|+++++||+ T Consensus 6 i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~-~~~~~g~V~~~~~ya~ 84 (182) T protein:vir:10 6 LKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVD-GDEVIGRWWNSSMVAV 84 (182) T ss_pred EecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeec-CCeEEEEeecCCCccc Confidence 44568899999999887776665555 566677788999999999999999999877554 5579999999999999 Q ss_pred hhhcCCCCCc----------cccccCCcceeecCC--------e--eEE-----eeeEecCCCCCCchhhhhHHHHHHHh Q lcl|NC_011044. 77 AVHEGSRPHR----------IVARHAQALHFFWHG--------R--EIF-----RKSVWHPGVRSRPFLRNAAQRIAAAD 131 (137) Q Consensus 77 ~vE~GT~ph~----------i~pk~~k~l~~~~~g--------~--~~~-----~k~V~~pG~~a~pfl~~A~~~~~~~~ 131 (137) ||||||+||. +.|...+.-||.... . +.. ......+||+|||||+||+++++... T Consensus 85 yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~~~i 164 (182) T protein:vir:10 85 FREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKMAKEA 164 (182) T ss_pred eeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHhHHHH Confidence 9999999874 233334433331110 0 000 11134589999999999999987744 Q ss_pred H-hhccC Q lcl|NC_011044. 132 P-DIHMT 137 (137) Q Consensus 132 ~-~i~~~ 137 (137) . +|+.. T Consensus 165 ~~~i~~~ 171 (182) T protein:vir:10 165 PEIIKRS 171 (182) T ss_pred HHHHHHH Confidence 3 33333 No 26 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.92 E-value=2.2e-28 Score=172.19 Aligned_cols=128 Identities=13% Similarity=0.049 Sum_probs=99.9 Q ss_pred ceehhhhhhHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 2 PVTARIHINEPELERQSGAIFRGKHRSL-----TRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 2 sv~~~l~~~~~~l~~~~~~~~~~~~~~~-----a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) =-..+|+.+..++.+++...+.+.++.+ ++.+++.|+.++|||||+|++||...+... +..+.|+++++||+ T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~---g~~~~V~~~~~YA~ 77 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKS---SKEVIVGNSSDYAI 77 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecC---CcEEEEecCCCccc Confidence 2245777777777777766666655544 455789999999999999999998776443 35678999999999 Q ss_pred hhhcCCCCCccccccCCcceeecC--CeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWH--GREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~--g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||||||++|...|..++..||+.. |.|++ +.|++|||||+||+++..+..++|=+- T Consensus 78 yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~-----t~G~~aqpFl~~A~~~~~~~i~~~i~~ 135 (141) T protein:vir:78 78 YYEFGTGEKSERGGGKAGGWFYMDKKGHWHF-----TRGSQASKRMRYTFRDEQDKVRVFTER 135 (141) T ss_pred eeecCCcccccCCCCCcCcceeecCCCeeEe-----ccCCCCchhhhhhHHhhHHHHHHHHHH Confidence 999999999999998888888763 55554 459999999999999887765554433 No 27 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.84 E-value=3.3e-24 Score=149.33 Aligned_cols=104 Identities=20% Similarity=0.160 Sum_probs=91.5 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) ++|..+|...|+.+.+.+...+++++.+++..++++||.++|||||+|++||.+... .+..+.|+++++||+|||| T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~----~~~~~~v~~~~~Ya~~vE~ 76 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQ----RLLHYRVVSPALYSIYLEL 76 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeec----CcEEEEeecCcccchhccc Confidence 999999999999999999999999999999999999999999999999999986542 2357889999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) ||+ .++|||||+||++.+.+. +.+|++. T Consensus 77 GT~-----------------------------~m~a~Pf~~pa~~~~~~~~~~~i~~~ 105 (108) T protein:vir:99 77 GTR-----------------------------KMEAQSFLDPALRKEWPVLMANIKKM 105 (108) T ss_pred Ccc-----------------------------ccCCCcchhhhHHHHHHHHHHHHHHH Confidence 984 367999999999988664 5555555 No 28 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.82 E-value=9.9e-24 Score=146.70 Aligned_cols=108 Identities=12% Similarity=0.100 Sum_probs=90.3 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) || |..+|..+|+.+.+.+...+.+++...+..++++++.++|+|||+|++||....+...+.++++.|+++++||+ T Consensus 5 ~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~~Ya~ 84 (125) T protein:vir:94 5 FNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARADYSS 84 (125) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCCCccc Confidence 44 45788899999999988999999999999999999999999999999999876666666789999999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) ||||||. .++|||||+||++.+.+. ...|.+. T Consensus 85 ~vEfGT~-----------------------------~~~a~Pfl~pa~~~~~~~~~~~l~~~ 117 (125) T protein:vir:94 85 YNEYGTY-----------------------------RMSAQPFMAPSVAAMTPFFYKAVRDA 117 (125) T ss_pred eeecccc-----------------------------cCCCCcccchhHHHHHHHHHHHHHHH Confidence 9999984 267999999999976542 2222222 No 29 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.82 E-value=1.1e-23 Score=146.49 Aligned_cols=103 Identities=13% Similarity=0.104 Sum_probs=87.5 Q ss_pred Cce----ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPV----TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv----~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) ||+ ..+|...|+.+.+.+...+++++.+.+..++++|+.++|+|||+||+||.+.. .+.++.|+++++||+ T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~-----~g~~~~V~~~~~Ya~ 75 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSY-----PGMEAHIHGEAGYDG 75 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeec-----CceEEEeecCCCccc Confidence 665 57788888999998888899999999999999999999999999999997542 356889999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) ||||||+ .++|||||+||++.+++. ...|++. T Consensus 76 yvE~GT~-----------------------------~~~aqPfl~pa~~~~~~~~~~~l~~~ 108 (114) T protein:vir:95 76 YQEYGTR-----------------------------FQPGTPHFRPMMEQIQPQFQKDMTDV 108 (114) T ss_pred eeecCcc-----------------------------ccCCCccchhhHHHHHHHHHHHHHHH Confidence 9999994 267999999999988774 4444444 No 30 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.80 E-value=5.8e-23 Score=142.50 Aligned_cols=103 Identities=17% Similarity=0.160 Sum_probs=83.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) ++|..+|...|+.+. ....+++++++++..++++|+.++|||||+|++||..... ..++++.|+++++||+|||| T Consensus 3 i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~---~~~~~~~V~~~~~Ya~~vE~ 77 (108) T protein:vir:98 3 ITGIDALQKKLRKNA--TLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFT---DGGLTGTTIPHTDYAGYVEY 77 (108) T ss_pred chhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeee---cCceEEEeecCCCccceeec Confidence 888888888887654 3456788999999999999999999999999999975432 23578999999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ||+ .++|||||+||++.+.. ...+|+.. T Consensus 78 GT~-----------------------------~m~aqPFl~pa~~~~~~~~~~~i~~~ 106 (108) T protein:vir:98 78 GTR-----------------------------FQAAQPFVKPAFDVQKKIFTNDLERL 106 (108) T ss_pred ccc-----------------------------ccCCCcchhhHHHHHHHHHHHHHHHH Confidence 995 26799999999998755 33444444 No 31 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.80 E-value=7.6e-23 Score=141.85 Aligned_cols=103 Identities=18% Similarity=0.179 Sum_probs=83.9 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) ++|..+|...|+... ....+++++++++..++++|+.++|+|||+|++||..... ..+..+.|+++++||+|||| T Consensus 3 i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~---~~~~~~~V~~~~~Ya~~vE~ 77 (108) T protein:vir:74 3 ITGIDALQKKLRKNA--TLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFT---DGGLSGTTGPHTDYAGYVEY 77 (108) T ss_pred chhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeee---cCceEEEeecCCCcccceec Confidence 777788888777653 3567889999999999999999999999999999986542 23578899999999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ||+ .++|||||+||++.+.. ...+|.+. T Consensus 78 GT~-----------------------------km~aqpf~~pa~~~~~~~~~~~i~~~ 106 (108) T protein:vir:74 78 GTR-----------------------------FQSAQPFVKPAFNIQKKVFTNDLERL 106 (108) T ss_pred ccc-----------------------------ccCCCcchhhHHHHHHHHHHHHHHHH Confidence 994 26799999999998755 34444444 No 32 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.79 E-value=8.9e-23 Score=141.47 Aligned_cols=104 Identities=13% Similarity=0.158 Sum_probs=89.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|.+.|+.+.+.+...+++++++.+..++++++.++ |+|||+|++||.... +.++.+.|+++++| T Consensus 3 i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~----~g~~~~~V~~~~~Y 78 (115) T protein:vir:99 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK----TVDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee----cCcEEEEecCCccc Confidence 77789999999999999999999999999999999999997 999999999998653 23578899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ .++|||||+||++.+++ -+.+|++. T Consensus 79 a~~vE~GT~-----------------------------~m~a~PFl~PA~~~~k~~~~~~l~~~ 113 (115) T protein:vir:99 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKTL 113 (115) T ss_pred ccccccccc-----------------------------ccCCCCcchhhHHHHHHHHHHHHHHH Confidence 999999993 27799999999997755 34444444 No 33 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.79 E-value=7.4e-23 Score=141.92 Aligned_cols=105 Identities=21% Similarity=0.214 Sum_probs=80.2 Q ss_pred Cceehhhh---hhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPVTARIH---INEPELERQ-SGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv~~~l~---~~~~~l~~~-~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) ||.+.+|+ .-++.|.+. ....+++++++++..++++++.++|||||+|++||.... ...+.++.|+++++||+ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~---~~~~~~~~V~~~~~Ya~ 77 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMEL---TEGGFSGQAGPHTDYSA 77 (112) T ss_pred CceeeeehhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeee---cCCceEEEeecCCCccc Confidence 76654433 223333332 236678899999999999999999999999999997543 23357899999999999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) ||||||+ .++|||||+||++.+++. ..+|++. T Consensus 78 ~vE~GT~-----------------------------k~~a~Pfl~pa~~~~~~~~~~~i~~~ 110 (112) T protein:vir:36 78 YVEYGTR-----------------------------FQSAQPFVKPAYNEQKGVFIKDLERL 110 (112) T ss_pred eeecccc-----------------------------ccCCCcchhhhHHHHHHHHHHHHHHH Confidence 9999994 267999999999988664 4445444 No 34 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.78 E-value=2.4e-22 Score=139.13 Aligned_cols=104 Identities=14% Similarity=0.083 Sum_probs=88.5 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) +.|..+|...|+.+++.+...+++++++.+..++++++.++ |||||+|++||.... . .+..+.|+++++| T Consensus 3 i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:10 3 SKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKK--I--GDLHYRVISTAHY 78 (115) T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee--c--CcEEEEeeCCCcc Confidence 77789999999999999999999999999999999999997 889999999997542 2 3578899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.+++ -+.+|++. T Consensus 79 a~~vEfGT~-----------------------------km~a~PFl~PA~~~~k~~~~~~i~~~ 113 (115) T protein:vir:10 79 SGFLEFGTR-----------------------------YMEPAPFMFPTYQTLKKSTINDLKRL 113 (115) T ss_pred chheecccc-----------------------------cCCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 27799999999998755 33344444 No 35 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.78 E-value=3e-22 Score=138.55 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|...|+.+++.+...+++++.+.+..++++++.++ |+|||+|++||.... . .+..+.|+++++| T Consensus 3 ~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:10 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK--T--GDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee--c--CceEEEeecCccc Confidence 77889999999999999999999999999999999999998 999999999998653 2 2467899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.++. -...|++. T Consensus 79 a~~vE~GT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~i~~~ 113 (115) T protein:vir:10 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKAL 113 (115) T ss_pred hhhhccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 16799999999997654 23333333 No 36 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.78 E-value=3e-22 Score=138.55 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|...|+.+++.+...+++++.+.+..++++++.++ |+|||+|++||.... . .+..+.|+++++| T Consensus 3 ~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:96 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK--T--GDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee--c--CceEEEeecCccc Confidence 77889999999999999999999999999999999999998 999999999998653 2 2467899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.++. -...|++. T Consensus 79 a~~vE~GT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~i~~~ 113 (115) T protein:vir:96 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKAL 113 (115) T ss_pred hhhhccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 16799999999997654 23333333 No 37 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.78 E-value=3e-22 Score=138.55 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|...|+.+++.+...+++++.+.+..++++++.++ |+|||+|++||.... . .+..+.|+++++| T Consensus 3 ~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:96 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK--T--GDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee--c--CceEEEeecCccc Confidence 77889999999999999999999999999999999999998 999999999998653 2 2467899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.++. -...|++. T Consensus 79 a~~vE~GT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~i~~~ 113 (115) T protein:vir:96 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKAL 113 (115) T ss_pred hhhhccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 16799999999997654 23333333 No 38 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.78 E-value=3e-22 Score=138.55 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|...|+.+++.+...+++++.+.+..++++++.++ |+|||+|++||.... . .+..+.|+++++| T Consensus 3 ~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:93 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK--T--GDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee--c--CceEEEeecCccc Confidence 77889999999999999999999999999999999999998 999999999998653 2 2467899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.++. -...|++. T Consensus 79 a~~vE~GT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~i~~~ 113 (115) T protein:vir:93 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKAL 113 (115) T ss_pred hhhhccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 16799999999997654 23333333 No 39 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.78 E-value=3e-22 Score=138.55 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|...|+.+++.+...+++++.+.+..++++++.++ |+|||+|++||.... . .+..+.|+++++| T Consensus 3 ~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:97 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK--T--GDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee--c--CceEEEeecCccc Confidence 77889999999999999999999999999999999999998 999999999998653 2 2467899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.++. -...|++. T Consensus 79 a~~vE~GT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~i~~~ 113 (115) T protein:vir:97 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKAL 113 (115) T ss_pred hhhhccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 16799999999997654 23333333 No 40 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.78 E-value=3e-22 Score=138.55 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADV------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) ++|..+|...|+.+++.+...+++++.+.+..++++++.++ |+|||+|++||.... . .+..+.|+++++| T Consensus 3 ~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~--~--g~~~~~v~~~~~Y 78 (115) T protein:vir:78 3 IDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK--T--GDLQYTITSHAAY 78 (115) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee--c--CceEEEeecCccc Confidence 77889999999999999999999999999999999999998 999999999998653 2 2467899999999 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+||||||+ -++|||||+||++.++. -...|++. T Consensus 79 a~~vE~GT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~i~~~ 113 (115) T protein:vir:78 79 SGFLEFGTR-----------------------------YMEAEPFMWPVYEVIRKSTVEELKAL 113 (115) T ss_pred hhhhccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHH Confidence 999999994 16799999999997654 23333333 No 41 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.77 E-value=3.2e-22 Score=138.44 Aligned_cols=103 Identities=16% Similarity=0.187 Sum_probs=85.1 Q ss_pred CceehhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhh Q lcl|NC_011044. 1 MPVTARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAV 78 (137) Q Consensus 1 msv~~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~v 78 (137) +.|..+|.++|+.+ .+.++.++++.+.+.+..+++.|+.++|||||+|++||... ..+.++.|+++++||+|| T Consensus 6 i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~-----~~~~~~~v~~~~~Ya~~v 80 (112) T protein:vir:96 6 FEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLE-----AGSDRAVVEALTNYSGYL 80 (112) T ss_pred ehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeee-----cCceEEEecCCCCcccee Confidence 55678888888877 45788889999999999999999999999999999999743 235688999999999999 Q ss_pred hcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 79 HEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 79 E~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ||||+ -++|||||+||++..++ -..+|+.. T Consensus 81 E~GTr-----------------------------~m~AqPF~~PA~~~~~~~~~~~l~~L 111 (112) T protein:vir:96 81 EVGTR-----------------------------KMEAQPFMRPALDQVVPEMVEEMAKW 111 (112) T ss_pred ccCcc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHhc Confidence 99994 16799999999998755 34444444 No 42 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.75 E-value=1e-21 Score=135.69 Aligned_cols=123 Identities=17% Similarity=0.142 Sum_probs=88.4 Q ss_pred Cc-------eehhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecC Q lcl|NC_011044. 1 MP-------VTARIHINEPELER--QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEAT 71 (137) Q Consensus 1 ms-------v~~~l~~~~~~l~~--~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~ 71 (137) || +..+|..+|+.+.. .+.+.+++.+++++..++++++.++|||||+||+||....+...+.++++.|+++ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~ 80 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINN 80 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecC Confidence 44 33445555555433 3567888999999999999999999999999999998776666677889999999 Q ss_pred ccchhhhhcCCCCCc--cccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 72 ADYAAAVHEGSRPHR--IVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 72 ~~YA~~vE~GT~ph~--i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ++||+||||||+... ..|...+.+.. +. .++++||.+|.++...+.+.+=+- T Consensus 81 ~~YA~~VE~Ghr~~~G~~v~~~~~~~~~---------g~-----V~G~~~~~~a~~~~~~~~~~~l~k 134 (144) T protein:vir:10 81 AEYASYVESGHRQTPGRYVPVLKKRLVR---------DW-----VPGQFYMKKSIPQIQRQLPQLVTE 134 (144) T ss_pred CCcccccccceeecCCcccccCCCcccc---------ce-----ecCccchHHHHHHHHHHHHHHHHH Confidence 999999999996431 12322222211 11 236889999998877765554443 No 43 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.74 E-value=3.9e-21 Score=132.45 Aligned_cols=132 Identities=16% Similarity=0.055 Sum_probs=91.5 Q ss_pred Cceeh------hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec--ccCcEEEEEEec-- Q lcl|NC_011044. 1 MPVTA------RIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR--YRPFHVDGGVEA-- 70 (137) Q Consensus 1 msv~~------~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~--~~~~~~~~~v~~-- 70 (137) |||+- .|...++.|.+..+.+++.++.+.|+.+.++|+.++|++||+|++||...... +.+...+..|+. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 88853 56677788888888899999999999999999999999999999999764322 222333333433 Q ss_pred -CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-h--------hccC Q lcl|NC_011044. 71 -TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-D--------IHMT 137 (137) Q Consensus 71 -~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-~--------i~~~ 137 (137) ..+|++|||||+..+.+ +...+...|+|.+.. ..+..++||+|||+||+|..+++.. . |.+. T Consensus 81 ~~a~~g~~vEfG~~~~~~-~~~~~~~~~~~~~~~----~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~ 152 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHA-AYRDKDGQWYSSKVK----LVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAEL 152 (157) T ss_pred CccceeeeeecCcccccc-cccCCcccccccccc----cCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHH Confidence 57899999999865443 333334445555432 2223457799999999998755332 2 2222 No 44 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.73 E-value=2.2e-21 Score=133.86 Aligned_cols=103 Identities=17% Similarity=0.206 Sum_probs=80.9 Q ss_pred CceehhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhh Q lcl|NC_011044. 1 MPVTARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAV 78 (137) Q Consensus 1 msv~~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~v 78 (137) ++|..+|..+|+.+ .+.+++++++.+..++..+++.|+.++|++||+|++||...... .+ +.|+++++||+|| T Consensus 6 ~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~---~~--~~V~~~~~Ya~~v 80 (114) T protein:vir:27 6 FEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES---DK--ATVEALTSYSGYL 80 (114) T ss_pred eehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC---Ce--eEecCCCCcccee Confidence 56678888888776 45677888888888888888888889999999999999865422 22 4689999999999 Q ss_pred hcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH----HhHhhccC Q lcl|NC_011044. 79 HEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA----ADPDIHMT 137 (137) Q Consensus 79 E~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~----~~~~i~~~ 137 (137) ||||+ -++|||||+||++.+++ .+.+|-+| T Consensus 81 EfGT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 81 EVGTR-----------------------------KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 99994 25699999999998755 34444455 No 45 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.73 E-value=2.2e-21 Score=133.86 Aligned_cols=103 Identities=17% Similarity=0.206 Sum_probs=80.9 Q ss_pred CceehhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhh Q lcl|NC_011044. 1 MPVTARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAV 78 (137) Q Consensus 1 msv~~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~v 78 (137) ++|..+|..+|+.+ .+.+++++++.+..++..+++.|+.++|++||+|++||...... .+ +.|+++++||+|| T Consensus 6 ~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~---~~--~~V~~~~~Ya~~v 80 (114) T protein:vir:49 6 FEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES---DK--ATVEALTSYSGYL 80 (114) T ss_pred eehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC---Ce--eEecCCCCcccee Confidence 56678888888776 45677888888888888888888889999999999999865422 22 4689999999999 Q ss_pred hcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH----HhHhhccC Q lcl|NC_011044. 79 HEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA----ADPDIHMT 137 (137) Q Consensus 79 E~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~----~~~~i~~~ 137 (137) ||||+ -++|||||+||++.+++ .+.+|-+| T Consensus 81 EfGT~-----------------------------km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 81 EVGTR-----------------------------KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccccc-----------------------------ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 99994 25699999999998755 34444455 No 46 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.70 E-value=2.3e-20 Score=128.27 Aligned_cols=108 Identities=19% Similarity=0.217 Sum_probs=82.9 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEe---------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVE---------- 69 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~---------- 69 (137) +.|..+|...|+.|.+.+. +++++++..++..++++||.++|++||+|++||...............++ T Consensus 6 i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~~~~~~ 85 (140) T protein:vir:10 6 IIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKAD 85 (140) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeeccccccC Confidence 6677889999999988774 68899999999999999999999999999999976443332222222222 Q ss_pred --cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 70 --ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 70 --~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ++..|+.|+||||. .++|||||+||++.+.+ ..+.|.+. T Consensus 86 ~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pA~~~~~~~~~~~~~~~ 127 (140) T protein:vir:10 86 SPNNAFYWRFDEFGTQ-----------------------------HMKAQPFMRPAFDASIGEAEGAIRTE 127 (140) T ss_pred CCCccceeeeeccCCC-----------------------------CCCCCcchhhhHHHHHHHHHHHHHHH Confidence 55789999999983 47799999999998766 33333333 No 47 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.68 E-value=6.9e-20 Score=125.65 Aligned_cols=108 Identities=24% Similarity=0.285 Sum_probs=83.3 Q ss_pred CceehhhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc--cCcEEEEEEe-------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQS-GAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY--RPFHVDGGVE-------- 69 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~-~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~--~~~~~~~~v~-------- 69 (137) +++..+|...|+.|.+.. .+++++++...+..+++++|.++|++||+|++||....... ......+.++ T Consensus 6 i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (140) T protein:vir:10 6 ILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRTKGKAD 85 (140) T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeeccccccccC Confidence 667789999999998876 46889999999999999999999999999999997643322 2222333332 Q ss_pred --cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 70 --ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 70 --~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ++..|+.|+||||.. ++|+|||+||++.+++ ..+.|.+. T Consensus 86 ~~~~~~y~~f~E~GT~~-----------------------------~~a~PFl~pA~~~~~~~~~~~~~~~ 127 (140) T protein:vir:10 86 SPNNAFYWRFVELGTQF-----------------------------MKAEPFMRPAFDASIAQAEGAIRTE 127 (140) T ss_pred CCCcccccceeccCcCC-----------------------------CCCCcchhhhHHHHHHHHHHHHHHH Confidence 457899999999842 5799999999998865 44444444 No 48 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.66 E-value=1.8e-19 Score=123.32 Aligned_cols=108 Identities=19% Similarity=0.209 Sum_probs=83.2 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEe---------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVE---------- 69 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~---------- 69 (137) +.|..+|...|+.|++... +++++++...+..++++++.++|++||+|++||...............|+ T Consensus 6 i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~~~~~~ 85 (140) T protein:vir:14 6 IIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKAD 85 (140) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeeccccccC Confidence 6667889999999988765 57799999999999999999999999999999987544332222222222 Q ss_pred --cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 70 --ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 70 --~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) .+..|+.|+||||. .++|||||+||++.+.++ .+.|.+. T Consensus 86 ~~~~~~y~~f~E~GT~-----------------------------~~~a~pFl~pa~~~~~~~~~~~~~~~ 127 (140) T protein:vir:14 86 SPNNAFYWRFDEFGTQ-----------------------------HMKAQPFMRPAFDASIGEAEGAIRTE 127 (140) T ss_pred CCCccceeeeeccccC-----------------------------CCCCCcchhHHHHHHHHHHHHHHHHH Confidence 45789999999983 367999999999988653 3344444 No 49 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.65 E-value=2.2e-19 Score=122.88 Aligned_cols=108 Identities=19% Similarity=0.215 Sum_probs=81.6 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEE----------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGV----------- 68 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v----------- 68 (137) ++|..+|...|+.|.+.+. +++++++...+..++++|+.++|++||+|++||.............+.+ T Consensus 6 i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (140) T protein:vir:80 6 IVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKAD 85 (140) T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeecccccccC Confidence 6667889999999987764 5789999999999999999999999999999997644332211111122 Q ss_pred -ecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 69 -EATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 69 -~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) .++..|+.|+||||. .++|||||+||++.+.+ ..+.|.+. T Consensus 86 ~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pA~~~~~~~~~~~~~~~ 127 (140) T protein:vir:80 86 SPSNAFYWRFDEFGTQ-----------------------------HMKAQPFMRPAFDASIGEAEGAIRTE 127 (140) T ss_pred CCCCcceeeeeccCCC-----------------------------CCCCCcchhhhHHHHHHHHHHHHHHH Confidence 245789999999984 25699999999998866 33444444 No 50 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.62 E-value=8.1e-19 Score=119.76 Aligned_cols=114 Identities=19% Similarity=0.201 Sum_probs=82.5 Q ss_pred Cc--------eehhhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceee------eecccCcEEE Q lcl|NC_011044. 1 MP--------VTARIHINEPELER-QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGEL------PQRYRPFHVD 65 (137) Q Consensus 1 ms--------v~~~l~~~~~~l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~------~~~~~~~~~~ 65 (137) || +..+|..+|+.+.+ .+.+.+++++++++..+++.++.++|||||+||+||... .+...+.+++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 33 34556666665544 677888999999999999999999999999999999643 2233556788 Q ss_pred EEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 66 GGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 66 ~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) +.|+++++||+||||||+-. +.. +| .+++.||..|.++...+.+.+=+. T Consensus 81 v~v~n~~~YA~~VE~Ghr~~---~~~----------gf----------V~G~fml~~s~~~~~~~~~~~~~~ 129 (141) T protein:vir:79 81 IEVVNPTEYASYVNFGHRTK---DGK----------GW----------VKGQHFLTISEMELQSQVDKIIEK 129 (141) T ss_pred EEEecCCcchhhhhcceeec---CCc----------ce----------eCCchhHHHHHHHHHHHHHHHHHH Confidence 99999999999999998521 111 01 136777888887766655444333 No 51 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=99.61 E-value=1.2e-18 Score=118.91 Aligned_cols=108 Identities=23% Similarity=0.252 Sum_probs=77.0 Q ss_pred Cc------eehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEE----- Q lcl|NC_011044. 1 MP------VTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGV----- 68 (137) Q Consensus 1 ms------v~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v----- 68 (137) |+ |..+|...|+.|++.+. .+.+++|...+..|+++|+.++|++||.|++||........+......| T Consensus 2 m~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~~ 81 (148) T protein:vir:93 2 IETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRGV 81 (148) T ss_pred cceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeeccc Confidence 33 34666788888887765 5778999999999999999999999999999997543222211111111 Q ss_pred ---------------ecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhH Q lcl|NC_011044. 69 ---------------EATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADP 132 (137) Q Consensus 69 ---------------~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~ 132 (137) ..+..|+.|+||||. .++|||||+||++.+++ ... T Consensus 82 ~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----------------------------~~pa~PFl~pA~~~~k~~~~~ 132 (148) T protein:vir:93 82 NPDTGNSDNTMKADNPRNAFYWRFVEMGTV-----------------------------NMPPHPFVRPAFDVRSEQAAQ 132 (148) T ss_pred ccccccccceeecCCCCCcceeeeeccCCC-----------------------------CCCCCcchhHHHHHhHHHHHH Confidence 234578888888873 25799999999998755 344 Q ss_pred hhccC Q lcl|NC_011044. 133 DIHMT 137 (137) Q Consensus 133 ~i~~~ 137 (137) .|.+. T Consensus 133 ~~~~~ 137 (148) T protein:vir:93 133 VAIAR 137 (148) T ss_pred HHHHH Confidence 44444 No 52 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.58 E-value=2.8e-18 Score=116.83 Aligned_cols=108 Identities=21% Similarity=0.240 Sum_probs=75.5 Q ss_pred Cc------eehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccC------------ Q lcl|NC_011044. 1 MP------VTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRP------------ 61 (137) Q Consensus 1 ms------v~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~------------ 61 (137) |+ |..+|...|+.|++.+. .+++.++...|+.|+++|+.++|++||.|++||......... T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~ 81 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccc Confidence 33 34667788888887765 577899999999999999999999999999999643222110 Q ss_pred ---------cEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-Hh Q lcl|NC_011044. 62 ---------FHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-AD 131 (137) Q Consensus 62 ---------~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~ 131 (137) ......-..+..|+.|+||||. .++|||||+||++.+++ .. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PF~~pA~~~~k~~~~ 132 (149) T protein:vir:19 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTA-----------------------------NMPAHPFVRPAYDTREEEAA 132 (149) T ss_pred cccccccccceeecCCCCccceeeeeccCCC-----------------------------CCCCCcchhHHHHHHHHHHH Confidence 0001111234567777777763 36799999999998866 44 Q ss_pred HhhccC Q lcl|NC_011044. 132 PDIHMT 137 (137) Q Consensus 132 ~~i~~~ 137 (137) +.|.+. T Consensus 133 ~~~~~~ 138 (149) T protein:vir:19 133 SVAIAR 138 (149) T ss_pred HHHHHH Confidence 444444 No 53 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.57 E-value=5.7e-18 Score=115.13 Aligned_cols=108 Identities=9% Similarity=-0.007 Sum_probs=85.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc---chhhhccceeeeec-ccCcEEEEEEec---Ccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVR---TGNLGRTVGELPQR-YRPFHVDGGVEA---TAD 73 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~---TG~Lr~SI~~~~~~-~~~~~~~~~v~~---~~~ 73 (137) ++|..+|...|+.|.+.+.+.++++++..|..+.++++.++|++ ||+|++||.....+ ..+....+.||. +.. T Consensus 6 i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~~~~~ 85 (127) T protein:vir:12 6 FDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPNKKVAY 85 (127) T ss_pred ehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeCCCCcc Confidence 77779999999999999999999999999999999999999985 89999999754332 233445667764 467 Q ss_pred chhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 74 YAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 74 YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) |+.|+||||.. ++|||||+||++.+++. ...+.+. T Consensus 86 y~~f~E~GT~~-----------------------------~~a~Pf~~pa~~~~~~~~~~~~~~~ 121 (127) T protein:vir:12 86 RGRFLEWGTSK-----------------------------MPPQPFIEKGGKEGEGPAVELMERI 121 (127) T ss_pred eeeeeccCccC-----------------------------CCCCccchHhHHHHHHHHHHHHHHH Confidence 99999999953 36999999999987663 3333333 No 54 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=99.53 E-value=1.7e-17 Score=112.55 Aligned_cols=132 Identities=11% Similarity=0.048 Sum_probs=81.3 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCc-----cchhhhccceeeeecccCcEEEEEEecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPV-----RTGNLGRTVGELPQRYRPFHVDGGVEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv-----~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y 74 (137) +.+..+|...|+.|++.+. ++++++|.+.|+.|+++|+.++|+ ++|.|++||........ ..-..++.| T Consensus 9 i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~-----~~~~g~~~~ 83 (179) T protein:vir:18 9 LTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQ-----FRRTGDLAF 83 (179) T ss_pred eecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccc-----cccccceeE Confidence 4467889999999998875 678999999999999999999976 57889999864322110 111223446 Q ss_pred hhhhhcCCCCCcccc------ccCCccee-----ecCCeeEEeeeEe--cCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVA------RHAQALHF-----FWHGREIFRKSVW--HPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~p------k~~k~l~~-----~~~g~~~~~k~V~--~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ..+++.||.++.... ........ ...+..++.+-|. ...++|||||+||++.++. ..+.|.+. T Consensus 84 ~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 84 RVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 666777776654221 11111000 0001111111111 1357899999999998765 44555554 No 55 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=99.51 E-value=4.3e-17 Score=110.33 Aligned_cols=108 Identities=14% Similarity=0.103 Sum_probs=79.1 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccc----hhhhccceeeeecc--cCcEEEEEEecCcc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRT----GNLGRTVGELPQRY--RPFHVDGGVEATAD 73 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~T----G~Lr~SI~~~~~~~--~~~~~~~~v~~~~~ 73 (137) ++|..+|...|+.|.+.+. ++.+.++.+.+..++++++.++|+++ |+|++||....... +...+.+.|+.+.. T Consensus 7 i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~vg~~~~ 86 (135) T protein:vir:57 7 ISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLRVGPTRS 86 (135) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEEecCCCC Confidence 4445677888899988875 56789999999999999999999974 99999997654332 22334555665443 Q ss_pred ---chhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 74 ---YAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 74 ---YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) |+.|+||||.. ++|||||+||++.++. ....|.+. T Consensus 87 ~~~~~~f~E~GT~~-----------------------------~~a~PF~~pa~~~~~~~~~~~~~~~ 125 (135) T protein:vir:57 87 HYMKALAQEFGTIK-----------------------------QVAKPFIRPALDYNKMQVLRILTVE 125 (135) T ss_pred cceeEeecccCCCC-----------------------------CCCCcchhHhHHHhHHHHHHHHHHH Confidence 47778999853 3699999999998766 33333333 No 56 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.51 E-value=3.5e-17 Score=110.77 Aligned_cols=113 Identities=14% Similarity=0.137 Sum_probs=85.2 Q ss_pred Ccee---hhhhhhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHhCCc---------------------------cc Q lcl|NC_011044. 1 MPVT---ARIHINEPELER-----QSGAIFRGKHRSLTRRIATQARADVPV---------------------------RT 45 (137) Q Consensus 1 msv~---~~l~~~~~~l~~-----~~~~~~~~~~~~~a~~i~~~ak~~aPv---------------------------~T 45 (137) ||+. .+|+.-.+.|.+ .+...+++.++++|..+.+.++.++|| +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 8884 223333333322 345678899999999999999999998 79 Q ss_pred hhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH Q lcl|NC_011044. 46 GNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ 125 (137) Q Consensus 46 G~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~ 125 (137) |+||+||+...+...+.++++.|+++++||+||||||+- ++ |+|+ +.+++|..|.+ T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~---~~-----------gGfV----------~G~fml~~s~~ 136 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKT---VN-----------GGFV----------PGQFFLHKTVE 136 (163) T ss_pred chhhccceecceeecCCceEEEEEecCCccchhhcceee---cC-----------Ccee----------ccchhhHHHHH Confidence 999999998777777778999999999999999999742 22 2332 27888999988 Q ss_pred HHHHHhHhhccC Q lcl|NC_011044. 126 RIAAADPDIHMT 137 (137) Q Consensus 126 ~~~~~~~~i~~~ 137 (137) +...+.+.+=+- T Consensus 137 ~~~~~~~~~~e~ 148 (163) T protein:vir:10 137 DTKSDMEKRVRD 148 (163) T ss_pred HHHHHHHHHHHH Confidence 887766665444 No 57 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.50 E-value=6e-17 Score=109.52 Aligned_cols=108 Identities=10% Similarity=0.058 Sum_probs=78.9 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchh----hhccceeeeec---ccCcEEEEEEecC- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGN----LGRTVGELPQR---YRPFHVDGGVEAT- 71 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~----Lr~SI~~~~~~---~~~~~~~~~v~~~- 71 (137) ++|..+|...|+.|.+.++ ++++.++...|..++++++.++|+++|. |++||...... .....+.+.||.+ T Consensus 6 i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~vg~~~ 85 (133) T protein:vir:10 6 VKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRVGPSK 85 (133) T ss_pred eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEecCCC Confidence 5556889999999998875 5778999999999999999999999887 78888542211 1122344555543 Q ss_pred --ccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 72 --ADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 72 --~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ..|+.|+||||.. ++|||||+||++.+.. ..+.|.+. T Consensus 86 ~~~~y~~f~E~GT~k-----------------------------~~a~PF~~pA~~~~~~~~~~~~~~~ 125 (133) T protein:vir:10 86 QHHMKVLAQEFGTVK-----------------------------QVADPFIRPALDYNVQTVLRVLTVE 125 (133) T ss_pred CccceEeeeccCCCC-----------------------------CCCCccchHHHHHhHHHHHHHHHHH Confidence 3588999999843 3699999999998765 33333333 No 58 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.50 E-value=6.1e-17 Score=109.47 Aligned_cols=108 Identities=10% Similarity=-0.013 Sum_probs=79.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeee--------------ecccCcEEEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELP--------------QRYRPFHVDG 66 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~--------------~~~~~~~~~~ 66 (137) ++|..+|...|+.|.+...+.+++++...|..++++++.++|+++|.|++++.... .........+ T Consensus 9 i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~ 88 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTV 88 (146) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeE Confidence 44568889999999999889999999999999999999999999999888764311 1111222334 Q ss_pred EEe------cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 67 GVE------ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 67 ~v~------~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) .|+ ++..|+.|+||||. .++|+|||+||++.+++. ...|.+. T Consensus 89 ~vg~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 89 KIGLNKADRSPWFYLKFHEWGTS-----------------------------KMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred EeeeccCCCCCcceeeeeccCCC-----------------------------CCCCCcchhHHHHHhHHHHHHHHHHH Confidence 443 44679999999983 256999999999987653 2333333 No 59 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.50 E-value=6.1e-17 Score=109.47 Aligned_cols=108 Identities=10% Similarity=-0.013 Sum_probs=79.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeee--------------ecccCcEEEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELP--------------QRYRPFHVDG 66 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~--------------~~~~~~~~~~ 66 (137) ++|..+|...|+.|.+...+.+++++...|..++++++.++|+++|.|++++.... .........+ T Consensus 9 i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~ 88 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTV 88 (146) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeE Confidence 44568889999999999889999999999999999999999999999888764311 1111222334 Q ss_pred EEe------cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 67 GVE------ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 67 ~v~------~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) .|+ ++..|+.|+||||. .++|+|||+||++.+++. ...|.+. T Consensus 89 ~vg~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 89 KIGLNKADRSPWFYLKFHEWGTS-----------------------------KMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred EeeeccCCCCCcceeeeeccCCC-----------------------------CCCCCcchhHHHHHhHHHHHHHHHHH Confidence 443 44679999999983 256999999999987653 2333333 No 60 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.50 E-value=6.1e-17 Score=109.47 Aligned_cols=108 Identities=10% Similarity=-0.013 Sum_probs=79.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeee--------------ecccCcEEEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELP--------------QRYRPFHVDG 66 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~--------------~~~~~~~~~~ 66 (137) ++|..+|...|+.|.+...+.+++++...|..++++++.++|+++|.|++++.... .........+ T Consensus 9 i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~ 88 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTV 88 (146) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeE Confidence 44568889999999999889999999999999999999999999999888764311 1111222334 Q ss_pred EEe------cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 67 GVE------ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 67 ~v~------~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) .|+ ++..|+.|+||||. .++|+|||+||++.+++. ...|.+. T Consensus 89 ~vg~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 89 KIGLNKADRSPWFYLKFHEWGTS-----------------------------KMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred EeeeccCCCCCcceeeeeccCCC-----------------------------CCCCCcchhHHHHHhHHHHHHHHHHH Confidence 443 44679999999983 256999999999987653 2333333 No 61 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.50 E-value=6.1e-17 Score=109.47 Aligned_cols=108 Identities=10% Similarity=-0.013 Sum_probs=79.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeee--------------ecccCcEEEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELP--------------QRYRPFHVDG 66 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~--------------~~~~~~~~~~ 66 (137) ++|..+|...|+.|.+...+.+++++...|..++++++.++|+++|.|++++.... .........+ T Consensus 9 i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~ 88 (146) T protein:vir:10 9 LLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTV 88 (146) T ss_pred ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeE Confidence 44568889999999999889999999999999999999999999999888764311 1111222334 Q ss_pred EEe------cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 67 GVE------ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 67 ~v~------~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) .|+ ++..|+.|+||||. .++|+|||+||++.+++. ...|.+. T Consensus 89 ~vg~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pa~~~~k~~~~~~~~~~ 137 (146) T protein:vir:10 89 KIGLNKADRSPWFYLKFHEWGTS-----------------------------KMPAHPFIEPGFNASKAEAVRAMTDI 137 (146) T ss_pred EeeeccCCCCCcceeeeeccCCC-----------------------------CCCCCcchhHHHHHhHHHHHHHHHHH Confidence 443 44679999999983 256999999999987653 2333333 No 62 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.49 E-value=3.1e-17 Score=111.05 Aligned_cols=82 Identities=21% Similarity=0.193 Sum_probs=61.3 Q ss_pred Cc----eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEE---ecCcc Q lcl|NC_011044. 1 MP----VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGV---EATAD 73 (137) Q Consensus 1 ms----v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v---~~~~~ 73 (137) |+ |..+|.++|+...+ ...+++.+++.+..++++|+.++|+|||+||+||.......+ +.+.| +.+++ T Consensus 4 ~~i~~~Gld~L~~~L~~~~~--~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g---~~~~v~~~gp~a~ 78 (92) T protein:vir:99 4 YSISWDGLDALDEALANQQN--MNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDG---FTGSVTYGGGLVN 78 (92) T ss_pred eeeEeehHHHHHHHHHhhcc--HHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCC---eeEEEEeccCccc Confidence 33 34555555544332 356788999999999999999999999999999987654432 33344 58899 Q ss_pred chhhhhcCCCCCcc Q lcl|NC_011044. 74 YAAAVHEGSRPHRI 87 (137) Q Consensus 74 YA~~vE~GT~ph~i 87 (137) ||+||||||+-+.. T Consensus 79 Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 79 YAAYVEFGTRFMDS 92 (92) T ss_pred cccccccceeecCC Confidence 99999999987654 No 63 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.49 E-value=6.8e-17 Score=109.22 Aligned_cols=108 Identities=12% Similarity=0.055 Sum_probs=76.6 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCc-----cchhhhccceeeeec---ccCcEEEEEE--- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPV-----RTGNLGRTVGELPQR---YRPFHVDGGV--- 68 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv-----~TG~Lr~SI~~~~~~---~~~~~~~~~v--- 68 (137) +.|..+|...|+.|.+.+. ++++.+|...++.|+++++.++|+ ++|.|++||.+.... .....+...| T Consensus 9 i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~~vg~~ 88 (164) T protein:vir:43 9 ITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLGFRIGVL 88 (164) T ss_pred eecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccceeEEeccc Confidence 4456789999999999875 688999999999999999999997 578999998542100 0111111111 Q ss_pred ----------------ecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHh- Q lcl|NC_011044. 69 ----------------EATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAAD- 131 (137) Q Consensus 69 ----------------~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~- 131 (137) +.+..|+.|+||||. .++|+|||+||++.++.+. T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~-----------------------------km~a~PFlrPA~~~~k~~~~ 139 (164) T protein:vir:43 89 HGAVLPKKGERSDKTANAPTPHWRLLEFGTE-----------------------------DMRAQPFMRSALADNIAEVT 139 (164) T ss_pred ccccccccccccccCCCCCcceEEEeecCCC-----------------------------CCCCCcchhhhHHHhHHHHH Confidence 234578888888872 4779999999999887643 Q ss_pred H--------hhccC Q lcl|NC_011044. 132 P--------DIHMT 137 (137) Q Consensus 132 ~--------~i~~~ 137 (137) + +|+.+ T Consensus 140 ~~~~~~l~~~i~ka 153 (164) T protein:vir:43 140 STFVSEYEKGIDRA 153 (164) T ss_pred HHHHHHHHHHHHHH Confidence 2 33333 No 64 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.45 E-value=1.4e-16 Score=107.49 Aligned_cols=109 Identities=17% Similarity=0.223 Sum_probs=78.9 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCc-------cchhhhccceeeeecccCcEEEEEEec- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARAD--VPV-------RTGNLGRTVGELPQRYRPFHVDGGVEA- 70 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~--aPv-------~TG~Lr~SI~~~~~~~~~~~~~~~v~~- 70 (137) |.+.+.|...|+.. ....+++.++.-..++.+.++.+ +|| |||.|++||+....+++ .++.+++ T Consensus 1 i~G~~~L~~~Lk~~---s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g---~~~~vgp~ 74 (127) T protein:vir:98 1 MTGMPALEVKLRSM---SEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSS---KDVITGNF 74 (127) T ss_pred CcChHHHHHHHHHh---hHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCC---ceEEeccC Confidence 88888888888755 33446777777777778878776 899 99999999998776553 5566666 Q ss_pred --CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCC-CCCCchhhhhHHHH-----HHHhHhhcc Q lcl|NC_011044. 71 --TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPG-VRSRPFLRNAAQRI-----AAADPDIHM 136 (137) Q Consensus 71 --~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG-~~a~pfl~~A~~~~-----~~~~~~i~~ 136 (137) ..+||+|||||||-.. +|+.+ | +++||||.||++.. ++.++..+. T Consensus 75 g~t~dYapyvEyGTR~m~-------------~~~~~--------gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 75 GYIKDYAPHVEYGHRIVR-------------NGKQV--------GYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred cccccccceeecceeeee-------------ccccc--------ccccCccccccchHHHhHHHHHHHHHHhcC Confidence 4999999999997431 12222 3 66999999999864 334444555 No 65 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.44 E-value=3.9e-16 Score=105.05 Aligned_cols=112 Identities=19% Similarity=0.198 Sum_probs=84.5 Q ss_pred Cce---ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccc--h Q lcl|NC_011044. 1 MPV---TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADY--A 75 (137) Q Consensus 1 msv---~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y--A 75 (137) |++ .++|...|+.+.+.+...+++++++++..+..++|.++|++||.|++||........+. ....+.++..| + T Consensus 4 i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~-~~~vv~~~~~~~l~ 82 (126) T protein:vir:81 4 ITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGT-TKRIIWNKKHYRRV 82 (126) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCc-ceEEEeccCCCCce Confidence 444 24577778899999999999999999999999999999999999999998665444333 33455666666 7 Q ss_pred hhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 76 AAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 76 ~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) +++||||. . ..|+. ++|+|||+||.+.... .+.+|++. T Consensus 83 HLLEfGha---~-----------r~gGr----------V~a~Phi~Pa~e~~~~~~~~~i~~~ 121 (126) T protein:vir:81 83 HLLEFGHA---K-----------VNGGR----------VKEYPHLRPAYDKHGARLPDELKRV 121 (126) T ss_pred eeeeccee---c-----------CCCCc----------cCCCcchHHHHHHHHHHHHHHHHHH Confidence 89999973 1 12221 5699999999998765 44555555 No 66 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.43 E-value=3.7e-16 Score=105.21 Aligned_cols=108 Identities=16% Similarity=0.112 Sum_probs=81.7 Q ss_pred Cce----ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccch------hhhccceeeeecccCcEEEEEEec Q lcl|NC_011044. 1 MPV----TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTG------NLGRTVGELPQRYRPFHVDGGVEA 70 (137) Q Consensus 1 msv----~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG------~Lr~SI~~~~~~~~~~~~~~~v~~ 70 (137) ||+ ..+|...|+.|.+.+.+..++++.+.|..++.+++.++|+++| +|+++|........+....+.||. T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~ 80 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGY 80 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeee Confidence 555 5778899999999988899999999999999999999999765 578887654444444445566754 Q ss_pred ---CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-hHhhccC Q lcl|NC_011044. 71 ---TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-DPDIHMT 137 (137) Q Consensus 71 ---~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-~~~i~~~ 137 (137) +..|+.|+||||.. ++|+|||+||++.+++. ...+.+. T Consensus 81 ~k~~~~y~~f~E~GT~k-----------------------------~~a~pF~~pa~~~~~~~~~~~~~~~ 122 (128) T protein:vir:38 81 GKDTGWRAHFPNSGTSM-----------------------------QDPQHFIEETQEIMRPVVIAAFLSH 122 (128) T ss_pred cCCCceEEeeeccCccC-----------------------------CCCCcchhHHHHHhHHHHHHHHHHH Confidence 46799999999831 46999999999987542 2222222 No 67 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.40 E-value=7.7e-16 Score=103.46 Aligned_cols=108 Identities=6% Similarity=0.009 Sum_probs=81.3 Q ss_pred CceehhhhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCcc-------------chhhhccceeeeecccCcEEE Q lcl|NC_011044. 1 MPVTARIHINEPELE--RQSGAIFRGKHRSLTRRIATQARADVPVR-------------TGNLGRTVGELPQRYRPFHVD 65 (137) Q Consensus 1 msv~~~l~~~~~~l~--~~~~~~~~~~~~~~a~~i~~~ak~~aPv~-------------TG~Lr~SI~~~~~~~~~~~~~ 65 (137) +++..+|...|+.|. +..+++.+.+|+..|..++++++.++|+. +|+++++|....+...+.... T Consensus 9 i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~g~~~ 88 (149) T protein:vir:13 9 FEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKIRKKKGNLQ 88 (149) T ss_pred eecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceecccccccceeE Confidence 555688889999984 56788899999999999999999999974 568999987655544444445 Q ss_pred EEEe------cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHh------ Q lcl|NC_011044. 66 GGVE------ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPD------ 133 (137) Q Consensus 66 ~~v~------~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~------ 133 (137) +.|| ++..|+.|+||||. + ++|||||+||++.++.+... T Consensus 89 ~~VG~~~~~~~~~~y~~f~E~GT~--------------------------k---~~a~pF~~pa~~~~~~~~~~~~~~~l 139 (149) T protein:vir:13 89 CVVGWEKSDNTPFYYMKMEEWGTS--------------------------E---RPPHHAFGKTNKILKRVYDNIAQKKY 139 (149) T ss_pred EEeeccCCCCCccceeeeeccCcc--------------------------C---CCCCccchHHHHHHHHHHHHHHHHHH Confidence 5665 35689999999983 1 46999999999987654322 Q ss_pred ---hccC Q lcl|NC_011044. 134 ---IHMT 137 (137) Q Consensus 134 ---i~~~ 137 (137) |+.. T Consensus 140 ~k~i~~~ 146 (149) T protein:vir:13 140 DNFVKEK 146 (149) T ss_pred HHHHHHH Confidence 2222 No 68 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.36 E-value=1.8e-15 Score=101.45 Aligned_cols=108 Identities=13% Similarity=0.035 Sum_probs=84.2 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh----hhccceeeeeccc-CcEEEEEEec---Cc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGN----LGRTVGELPQRYR-PFHVDGGVEA---TA 72 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~----Lr~SI~~~~~~~~-~~~~~~~v~~---~~ 72 (137) .+|..+|..+|+.|.+......++++.++|..++++++.++|+++|. |++||.....+.. .......||. +. T Consensus 2 v~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~~~ 81 (125) T protein:vir:97 2 TKGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKATG 81 (125) T ss_pred chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCCCc Confidence 78999999999999999999999999999999999999999998876 9999976443322 2223345543 46 Q ss_pred cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHh-HhhccC Q lcl|NC_011044. 73 DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAAD-PDIHMT 137 (137) Q Consensus 73 ~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~-~~i~~~ 137 (137) .|+.|+||||.. ++|+|||+||++..+++. ..+.+. T Consensus 82 ~y~~f~E~GT~k-----------------------------~~~~pF~~pa~~~~k~~~~~~~~~~ 118 (125) T protein:vir:97 82 WRAHYPNDGTIY-----------------------------QRGQDFKERTINQMTPKAKQLYAEK 118 (125) T ss_pred eeEeeeccCccC-----------------------------CCcCccchHhHHHhHHHHHHHHHHH Confidence 799999999831 569999999999876533 333333 No 69 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.28 E-value=1.8e-14 Score=95.88 Aligned_cols=108 Identities=6% Similarity=-0.041 Sum_probs=78.5 Q ss_pred Cceehh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhccceeeeeccc--CcEEEEEEecCc- Q lcl|NC_011044. 1 MPVTAR---IHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGN--LGRTVGELPQRYR--PFHVDGGVEATA- 72 (137) Q Consensus 1 msv~~~---l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~--Lr~SI~~~~~~~~--~~~~~~~v~~~~- 72 (137) |+|..+ |+..++.|........+.+++..|..+++.++.++|+++|. |++||.....+.. .....+.||.+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 999654 45667777777777778899999999999999999997655 9999976543322 234456677654 Q ss_pred --cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-----hhccC Q lcl|NC_011044. 73 --DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-----DIHMT 137 (137) Q Consensus 73 --~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-----~i~~~ 137 (137) -||.|+||||- + ++|+||++||++++++..- .|+.. T Consensus 81 ~~~~a~F~E~GT~--------------------------k---~~a~pF~~~a~~~~~~ev~~~~~~~lrk~ 123 (125) T protein:vir:79 81 VSHRIHATEFGTM--------------------------Y---QKPQLFITKTEKQGKNKVLKTMLDTAKRL 123 (125) T ss_pred CceEEEeccCCcc--------------------------C---CCCCchhhHHHHHhHHHHHHHHHHHHHHH Confidence 48889999993 1 4699999999998766332 22222 No 70 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.28 E-value=1.8e-14 Score=95.88 Aligned_cols=108 Identities=6% Similarity=-0.041 Sum_probs=78.5 Q ss_pred Cceehh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhccceeeeeccc--CcEEEEEEecCc- Q lcl|NC_011044. 1 MPVTAR---IHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGN--LGRTVGELPQRYR--PFHVDGGVEATA- 72 (137) Q Consensus 1 msv~~~---l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~--Lr~SI~~~~~~~~--~~~~~~~v~~~~- 72 (137) |+|..+ |+..++.|........+.+++..|..+++.++.++|+++|. |++||.....+.. .....+.||.+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 999654 45667777777777778899999999999999999997655 9999976543322 234456677654 Q ss_pred --cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-----hhccC Q lcl|NC_011044. 73 --DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-----DIHMT 137 (137) Q Consensus 73 --~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-----~i~~~ 137 (137) -||.|+||||- + ++|+||++||++++++..- .|+.. T Consensus 81 ~~~~a~F~E~GT~--------------------------k---~~a~pF~~~a~~~~~~ev~~~~~~~lrk~ 123 (125) T protein:vir:47 81 VSHRIHATEFGTM--------------------------Y---QKPQLFITKTEKQGKNKVLKTMLDTAKRL 123 (125) T ss_pred CceEEEeccCCcc--------------------------C---CCCCchhhHHHHHhHHHHHHHHHHHHHHH Confidence 48889999993 1 4699999999998766332 22222 No 71 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.28 E-value=1.8e-14 Score=95.88 Aligned_cols=108 Identities=6% Similarity=-0.041 Sum_probs=78.5 Q ss_pred Cceehh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhccceeeeeccc--CcEEEEEEecCc- Q lcl|NC_011044. 1 MPVTAR---IHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGN--LGRTVGELPQRYR--PFHVDGGVEATA- 72 (137) Q Consensus 1 msv~~~---l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~--Lr~SI~~~~~~~~--~~~~~~~v~~~~- 72 (137) |+|..+ |+..++.|........+.+++..|..+++.++.++|+++|. |++||.....+.. .....+.||.+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 999654 45667777777777778899999999999999999997655 9999976543322 234456677654 Q ss_pred --cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-----hhccC Q lcl|NC_011044. 73 --DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-----DIHMT 137 (137) Q Consensus 73 --~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-----~i~~~ 137 (137) -||.|+||||- + ++|+||++||++++++..- .|+.. T Consensus 81 ~~~~a~F~E~GT~--------------------------k---~~a~pF~~~a~~~~~~ev~~~~~~~lrk~ 123 (125) T protein:vir:81 81 VSHRIHATEFGTM--------------------------Y---QKPQLFITKTEKQGKNKVLKTMLDTAKRL 123 (125) T ss_pred CceEEEeccCCcc--------------------------C---CCCCchhhHHHHHhHHHHHHHHHHHHHHH Confidence 48889999993 1 4699999999998766332 22222 No 72 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.28 E-value=1.8e-14 Score=95.88 Aligned_cols=108 Identities=6% Similarity=-0.041 Sum_probs=78.5 Q ss_pred Cceehh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhccceeeeeccc--CcEEEEEEecCc- Q lcl|NC_011044. 1 MPVTAR---IHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGN--LGRTVGELPQRYR--PFHVDGGVEATA- 72 (137) Q Consensus 1 msv~~~---l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~--Lr~SI~~~~~~~~--~~~~~~~v~~~~- 72 (137) |+|..+ |+..++.|........+.+++..|..+++.++.++|+++|. |++||.....+.. .....+.||.+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 999654 45667777777777778899999999999999999997655 9999976543322 234456677654 Q ss_pred --cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-----hhccC Q lcl|NC_011044. 73 --DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-----DIHMT 137 (137) Q Consensus 73 --~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-----~i~~~ 137 (137) -||.|+||||- + ++|+||++||++++++..- .|+.. T Consensus 81 ~~~~a~F~E~GT~--------------------------k---~~a~pF~~~a~~~~~~ev~~~~~~~lrk~ 123 (125) T protein:vir:94 81 VSHRIHATEFGTM--------------------------Y---QKPQLFITKTEKQGKNKVLKTMLDTAKRL 123 (125) T ss_pred CceEEEeccCCcc--------------------------C---CCCCchhhHHHHHhHHHHHHHHHHHHHHH Confidence 48889999993 1 4699999999998766332 22222 No 73 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.28 E-value=1.8e-14 Score=95.88 Aligned_cols=108 Identities=6% Similarity=-0.041 Sum_probs=78.5 Q ss_pred Cceehh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhccceeeeeccc--CcEEEEEEecCc- Q lcl|NC_011044. 1 MPVTAR---IHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGN--LGRTVGELPQRYR--PFHVDGGVEATA- 72 (137) Q Consensus 1 msv~~~---l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~--Lr~SI~~~~~~~~--~~~~~~~v~~~~- 72 (137) |+|..+ |+..++.|........+.+++..|..+++.++.++|+++|. |++||.....+.. .....+.||.+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 999654 45667777777777778899999999999999999997655 9999976543322 234456677654 Q ss_pred --cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-----hhccC Q lcl|NC_011044. 73 --DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-----DIHMT 137 (137) Q Consensus 73 --~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-----~i~~~ 137 (137) -||.|+||||- + ++|+||++||++++++..- .|+.. T Consensus 81 ~~~~a~F~E~GT~--------------------------k---~~a~pF~~~a~~~~~~ev~~~~~~~lrk~ 123 (125) T protein:vir:98 81 VSHRIHATEFGTM--------------------------Y---QKPQLFITKTEKQGKNKVLKTMLDTAKRL 123 (125) T ss_pred CceEEEeccCCcc--------------------------C---CCCCchhhHHHHHhHHHHHHHHHHHHHHH Confidence 48889999993 1 4699999999998766332 22222 No 74 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.22 E-value=2e-14 Score=95.66 Aligned_cols=102 Identities=13% Similarity=0.149 Sum_probs=72.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCc---cchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCC--Cc--cccc Q lcl|NC_011044. 18 SGAIFRGKHRSLTRRIATQARADVPV---RTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRP--HR--IVAR 90 (137) Q Consensus 18 ~~~~~~~~~~~~a~~i~~~ak~~aPv---~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~p--h~--i~pk 90 (137) +.+.++++++++|..+.+.++.++|| |||+||+||....+... .+.|+++++||.|||||++. +. ..|. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~----~~~v~N~~eYA~~VE~GHRq~~g~g~~~~~ 76 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLF----DGVVSNNVEYIHHLEYGHRTRQGTGTSENY 76 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeecc----CceeecCCcccccccCCceeeCCcceeccc Confidence 88888999999999999999999999 56999999987544332 24689999999999999863 32 2233 Q ss_pred cCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 91 HAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 91 ~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) .++.|.-.| .+.+-||..+.++...+.+.+=+. T Consensus 77 ~gkrlk~~~--------------V~G~fml~~s~~e~~~~~~~~~~~ 109 (116) T protein:vir:10 77 RPKPNGISF--------------VPGVFMLARSVDEMSSIIDDELNQ 109 (116) T ss_pred ccccccCCc--------------cCceehHHHHHHHHHHHHHHHHHH Confidence 333332211 225556777777776666655444 No 75 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.14 E-value=2.7e-13 Score=89.51 Aligned_cols=110 Identities=11% Similarity=0.141 Sum_probs=80.2 Q ss_pred Ccee-------hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCcc Q lcl|NC_011044. 1 MPVT-------ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATAD 73 (137) Q Consensus 1 msv~-------~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~ 73 (137) |+-+ ..|.+.|+.+.+.+...+++.+++++..+..+.+..+|++||.+++||...... .+..+.+.++.. T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~---~~~~~v~~~~~~ 77 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLK---NGDQVIYQKAPT 77 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecC---CeeEEEEEecCC Confidence 5543 334566777777778889999999999999999999999999999999765432 234566666666 Q ss_pred c--hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 74 Y--AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 74 Y--A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) | ++.+|||+.- ..|+++ +|+||++||.+.... .+.+|+.- T Consensus 78 y~l~HLLE~GHa~--------------r~GGrV----------~a~phI~paee~~~~~l~~~i~r~ 120 (123) T protein:vir:96 78 YRLTHLLENGHAK--------------RNGGRV----------SPKVHIAPVEEELVSNYISRVEKR 120 (123) T ss_pred cceEEeeecceee--------------cCCcee----------CcchhhhHHHHHHHHHHHHHHHHH Confidence 6 7999999631 134332 599999999998654 44444444 No 76 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.07 E-value=1.8e-13 Score=90.46 Aligned_cols=106 Identities=11% Similarity=0.130 Sum_probs=79.5 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) +.|..+|...+.++.......-+++|+++++.|+.+++.++|++||+|+. |...+ ..++...+|..-+..-|+.|.|| T Consensus 6 l~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~-kk~g~~~VG~~ks~~fy~kF~EF 83 (119) T protein:vir:10 6 IEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRV-KNTGLATEGTASSSEFYDIFQNF 83 (119) T ss_pred hhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeee-ecCceeEeccCCcchhhhhhccc Confidence 77888999999999988888999999999999999999999999999998 44443 33444333443445689999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCC-chhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSR-PFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~-pfl~~A~~~~~~-~~~~i~~~ 137 (137) ||.- +||| ||+.||++..++ +...|.+. T Consensus 84 GTSk-----------------------------m~a~~pF~~~a~~~~~~eA~~~~~~e 113 (119) T protein:vir:10 84 GTSE-----------------------------QKAHVGYFDRAVDETTNEAVEEVAEI 113 (119) T ss_pred cccc-----------------------------cCCCCCccccccccChHHHHHHHHHH Confidence 9931 4588 999999875433 22222222 No 77 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.98 E-value=4.6e-12 Score=82.74 Aligned_cols=108 Identities=13% Similarity=0.037 Sum_probs=80.3 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec---------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR---------------------- 58 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---------------------- 58 (137) |.....|..++..+.+.+...+...+++++..+.+.....+|||||+||.||...... T Consensus 5 m~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~~~~~ 84 (145) T protein:vir:10 5 IGSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKTYLAR 84 (145) T ss_pred ccchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchhhHHH Confidence 7777778888888999999999999999999999999999999999999999653211 Q ss_pred ------ccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHH----H Q lcl|NC_011044. 59 ------YRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRI----A 128 (137) Q Consensus 59 ------~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~----~ 128 (137) ....+-++.+.++++||.++|||+. +|.|..|.+.++++. + T Consensus 85 ~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S-----------------------------~QAP~G~v~~~~~~~~~~v~ 135 (145) T protein:vir:10 85 QARAVANSKATSVIYITNRLDYAADLEYGAS-----------------------------NQAPAGVLGVVQARLGRYFQ 135 (145) T ss_pred HHHHhhcccccceEEEeeCchhhhHhhcccc-----------------------------CCCcchHHHHHHHHHHHHHH Confidence 0112334678899999999999962 466777777776553 2 Q ss_pred HHhHhhccC Q lcl|NC_011044. 129 AADPDIHMT 137 (137) Q Consensus 129 ~~~~~i~~~ 137 (137) .+..+++.. T Consensus 136 ~~~~e~k~~ 144 (145) T protein:vir:10 136 EAVEEARRA 144 (145) T ss_pred HHHHHhhcc Confidence 233333333 No 78 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.94 E-value=6.2e-12 Score=82.04 Aligned_cols=102 Identities=12% Similarity=0.090 Sum_probs=73.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec---------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR---------------------- 58 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---------------------- 58 (137) ||-+.+++. +.+.+...+...+++++.++.++....+|||||+||.||...... T Consensus 1 msF~~~i~~----~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~ 76 (131) T protein:vir:94 1 MSFALDVTR----FVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATS 76 (131) T ss_pred CCcccCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHH Confidence 887655555 666666666677777777788888889999999999999543210 Q ss_pred ---ccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH----HHHHh Q lcl|NC_011044. 59 ---YRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR----IAAAD 131 (137) Q Consensus 59 ---~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~----~~~~~ 131 (137) ....+-++.+.++++||.++|||+. +|.|..|++.++++ +..+. T Consensus 77 ~i~~~~~g~~iyi~Nn~pYA~~LEyG~S-----------------------------~QAP~g~v~~~~~~~~~~v~~~~ 127 (131) T protein:vir:94 77 FVLNAADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQLLNEEA 127 (131) T ss_pred HHhhccccceEEEeeCchhhhhhhcccc-----------------------------CCCcchHHHHHHHHHHHHHHHHH Confidence 1123456789999999999999962 57788888888765 34455 Q ss_pred Hhhc Q lcl|NC_011044. 132 PDIH 135 (137) Q Consensus 132 ~~i~ 135 (137) .++| T Consensus 128 ~e~k 131 (131) T protein:vir:94 128 SKVK 131 (131) T ss_pred HhcC Confidence 5555 No 79 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.92 E-value=1.1e-11 Score=80.69 Aligned_cols=102 Identities=12% Similarity=0.088 Sum_probs=73.2 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec---------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR---------------------- 58 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---------------------- 58 (137) ||-+..++. +.+.+...+...+++++.++.++....+|||||+||.||...... T Consensus 1 msf~~~i~~----~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~ 76 (131) T protein:vir:78 1 MSFALDVSK----FVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAAN 76 (131) T ss_pred CCcCcCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhhHHHHHH Confidence 887666555 555566666666677777777777889999999999999643210 Q ss_pred ---ccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH----HHHHh Q lcl|NC_011044. 59 ---YRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR----IAAAD 131 (137) Q Consensus 59 ---~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~----~~~~~ 131 (137) ....+-++.+.++++||.++|||+. +|.|..|++.++++ +..+. T Consensus 77 ~i~~~~~g~~iyi~Nn~pYA~~LEyG~S-----------------------------~QAP~G~v~~~~~~~~~~v~~~~ 127 (131) T protein:vir:78 77 FVLNAADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQLLNEEA 127 (131) T ss_pred HHhhccCCceEEEeeCchhhhHhhcccc-----------------------------CCCcchHHHHHHHHHHHHHHHHH Confidence 1122456789999999999999973 57788888888876 34455 Q ss_pred Hhhc Q lcl|NC_011044. 132 PDIH 135 (137) Q Consensus 132 ~~i~ 135 (137) .++| T Consensus 128 ~e~k 131 (131) T protein:vir:78 128 SKVK 131 (131) T ss_pred HhcC Confidence 5555 No 80 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.92 E-value=1.2e-11 Score=80.50 Aligned_cols=108 Identities=14% Similarity=0.037 Sum_probs=80.2 Q ss_pred Cce--ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec-------------------- Q lcl|NC_011044. 1 MPV--TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR-------------------- 58 (137) Q Consensus 1 msv--~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~-------------------- 58 (137) |-- ...|..++..+.+.+...+...+++++..+.+.....+|||||.||.||...... T Consensus 1 ma~~~~~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~~ 80 (147) T protein:vir:10 1 MANYQIRRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGEE 80 (147) T ss_pred CCCcchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhhh Confidence 655 2368888888999999999999999999999999999999999999999642111 Q ss_pred ---------ccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHH-- Q lcl|NC_011044. 59 ---------YRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRI-- 127 (137) Q Consensus 59 ---------~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~-- 127 (137) ..+.+-++.+.++++||.++|||+. ++.|..|.+-++++. T Consensus 81 ~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S-----------------------------~QAP~G~V~~t~q~~~~ 131 (147) T protein:vir:10 81 QAKTYGMFSRGGAITSVHFSNMLIYANALEYGHS-----------------------------QQAPSGVVGLVALRLRS 131 (147) T ss_pred hHHHHHHhhhccCcceEEEeeCcchhhhhhcccc-----------------------------CCCCchHHHHHHHHHHH Confidence 0123447889999999999999973 466777777777542 Q ss_pred --HHHhHhhc---cC Q lcl|NC_011044. 128 --AAADPDIH---MT 137 (137) Q Consensus 128 --~~~~~~i~---~~ 137 (137) .....+.+ |. T Consensus 132 ~v~~~~~e~k~~~~~ 146 (147) T protein:vir:10 132 YMADAIKQARRQQNA 146 (147) T ss_pred HHHHHHHHHHhhhcc Confidence 23333333 33 No 81 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=98.92 E-value=1.3e-11 Score=80.28 Aligned_cols=104 Identities=15% Similarity=0.189 Sum_probs=75.9 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec---------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR---------------------- 58 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---------------------- 58 (137) ||-+++++. +.+.++..+...+++++..+.+.....+|||||+||.||...... T Consensus 1 msF~~~i~~----~~~~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~~~~~~~ 76 (134) T protein:vir:80 1 MSYTDRFNV----IAKGIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGMDEALQV 76 (134) T ss_pred CCcccCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccchhhHHH Confidence 888666655 566666666667777777777778889999999999999543211 Q ss_pred ------ccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH Q lcl|NC_011044. 59 ------YRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP 132 (137) Q Consensus 59 ------~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~ 132 (137) ....+-++.+.++++||.++|||+. ++.|..|.+-++++.-.... T Consensus 77 ~~~vi~~~k~g~~iyi~Nn~pYA~~LEyG~S-----------------------------~QAP~G~v~~t~~~~~~~v~ 127 (134) T protein:vir:80 77 LQQTVGQYKAGDTVHITNNAPYIKELNSGSS-----------------------------QQAPANFVETSIMRATRLIR 127 (134) T ss_pred HHHHHhhccCcceEEEeeCchhhhhhhcccc-----------------------------CCCcchHHHHHHHHHHHHHH Confidence 0012245778999999999999963 47788888888877666666 Q ss_pred hhccC Q lcl|NC_011044. 133 DIHMT 137 (137) Q Consensus 133 ~i~~~ 137 (137) +.+.. T Consensus 128 ~~~~~ 132 (134) T protein:vir:80 128 NVKVV 132 (134) T ss_pred hhccC Confidence 66665 No 82 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.91 E-value=1.1e-11 Score=80.71 Aligned_cols=108 Identities=13% Similarity=0.008 Sum_probs=78.3 Q ss_pred Ccee-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec--------------------- Q lcl|NC_011044. 1 MPVT-ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR--------------------- 58 (137) Q Consensus 1 msv~-~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~--------------------- 58 (137) |-.+ ..|..++..+.+.+...+...+++++..+.++....+|||||+||.||...... T Consensus 1 Ma~~~~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~ 80 (142) T protein:vir:10 1 MANDVVSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNSLR 80 (142) T ss_pred CccchhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhhHH Confidence 7653 457778888888888888889999999999999999999999999999642111 Q ss_pred -------ccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHH---- Q lcl|NC_011044. 59 -------YRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRI---- 127 (137) Q Consensus 59 -------~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~---- 127 (137) ....+-++.+.++++||.++|||+. |+.|..|++.++++. T Consensus 81 ~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S-----------------------------~QAP~G~v~~a~q~~~~~v 131 (142) T protein:vir:10 81 RQIYALARDANTNVIYISNRLDYAQGLEFGSS-----------------------------NQAPSGVLGVVQKRLGRYF 131 (142) T ss_pred HHHHHhhhccccceEEEeeCcchhhhhhcccc-----------------------------CCCcchHHHHHHHHHHHHH Confidence 0113456788999999999999973 366777777776542 Q ss_pred HHHhHhhccC Q lcl|NC_011044. 128 AAADPDIHMT 137 (137) Q Consensus 128 ~~~~~~i~~~ 137 (137) ..+..++|-- T Consensus 132 ~~a~~e~~~~ 141 (142) T protein:vir:10 132 AEAVQEAKRA 141 (142) T ss_pred HHHHHHhhcc Confidence 2223333332 No 83 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.77 E-value=1e-10 Score=75.37 Aligned_cols=108 Identities=8% Similarity=-0.032 Sum_probs=81.9 Q ss_pred Ccee-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc-------------------- Q lcl|NC_011044. 1 MPVT-ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY-------------------- 59 (137) Q Consensus 1 msv~-~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~-------------------- 59 (137) |.-+ ..|...+..+.+.++..+...++++|..+.......+|||||++|.||....... T Consensus 1 MA~~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d~s 80 (144) T protein:vir:95 1 MAKSLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQRAS 80 (144) T ss_pred CchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCCCc Confidence 6642 3567777778888999999999999999999999999999999999996543210 Q ss_pred --------------cCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH Q lcl|NC_011044. 60 --------------RPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ 125 (137) Q Consensus 60 --------------~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~ 125 (137) ...+-++.+.++++||.++|||+. +|.|..|++.++. T Consensus 81 g~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S-----------------------------~QAP~G~vr~~~q 131 (144) T protein:vir:95 81 AAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYS-----------------------------AQAPAGFVERAVL 131 (144) T ss_pred hhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhcccc-----------------------------CCCcchHHHHHHH Confidence 012356778999999999999963 5778888888877 Q ss_pred HHHHHhHhhccC Q lcl|NC_011044. 126 RIAAADPDIHMT 137 (137) Q Consensus 126 ~~~~~~~~i~~~ 137 (137) +.-.-.++.+-. T Consensus 132 ~~~~~v~~~~~~ 143 (144) T protein:vir:95 132 IGRKMRKKFKIK 143 (144) T ss_pred HHHHHHHhhccC Confidence 665544444444 No 84 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.76 E-value=1.1e-10 Score=75.26 Aligned_cols=108 Identities=14% Similarity=0.042 Sum_probs=80.4 Q ss_pred Cce-e-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec-----------c-------- Q lcl|NC_011044. 1 MPV-T-ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR-----------Y-------- 59 (137) Q Consensus 1 msv-~-~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~-----------~-------- 59 (137) |-- + .+|..++..+.+.++..+...+++++..+.+.....+|||||.||.||...... . T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~ 80 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEG 80 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHH Confidence 766 2 478899999999999999999999999999999999999999999999643211 0 Q ss_pred ----------cCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH- Q lcl|NC_011044. 60 ----------RPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA- 128 (137) Q Consensus 60 ----------~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~- 128 (137) ...+-++.+.++++||.++|||+ .++.|..|.+.++++.. T Consensus 81 ~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~-----------------------------S~QAP~G~v~~~~~~~~~ 131 (146) T protein:vir:79 81 RRTLYALLHGGGAIKSIYFSNMLIYANALEYGH-----------------------------SKQAPAGVFGIVAIRLRS 131 (146) T ss_pred HHHHHHHHhcccccceeEEeeCchhhhhhhccc-----------------------------cCCCcchHHHHHHHHHHH Confidence 01124678889999999999996 25677777877776532 Q ss_pred ---HHhHhhccC Q lcl|NC_011044. 129 ---AADPDIHMT 137 (137) Q Consensus 129 ---~~~~~i~~~ 137 (137) ....+++-- T Consensus 132 ~v~~a~~e~k~~ 143 (146) T protein:vir:79 132 YMAEAIREARKK 143 (146) T ss_pred HHHHHHHHHHhh Confidence 222222222 No 85 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.76 E-value=2.2e-11 Score=79.02 Aligned_cols=105 Identities=12% Similarity=0.060 Sum_probs=60.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------------C---------------Cccchhhhccce Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARAD------------V---------------PVRTGNLGRTVG 53 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~------------a---------------Pv~TG~Lr~SI~ 53 (137) +.....++..++++.+ .+...+.+++..+.+.+..+ . -++||.|++||. T Consensus 2 i~~~~~i~~~l~~l~~----~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~ 77 (145) T protein:vir:31 2 VEDENNIPEAREAIQD----GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDIN 77 (145) T ss_pred cccHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHH Confidence 3333334444444444 34444444444443332222 1 237999999998 Q ss_pred eeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhH Q lcl|NC_011044. 54 ELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADP 132 (137) Q Consensus 54 ~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~ 132 (137) ....... ....+.||++..||.+++||++ +++ +||+|||-++.++... .++ T Consensus 78 ~~~~~~~-~~~~a~vGtn~~YA~~hqfG~~------------------------~~~---IPaRPfLG~~~~~~~~~~~~ 129 (145) T protein:vir:31 78 AASMMDR-ANRMAVIGTNLDYAEHHEFGAP------------------------EAG---IPARPIFGPAGAYASQQAPD 129 (145) T ss_pred HHhhhcc-cCceeEecCCchhhhhhccCCc------------------------ccc---cCCCCccCCCccchHHHHHH Confidence 7654332 2345789999999999999973 243 5599999988654332 222 Q ss_pred hhccC Q lcl|NC_011044. 133 DIHMT 137 (137) Q Consensus 133 ~i~~~ 137 (137) -|..+ T Consensus 130 ii~~~ 134 (145) T protein:vir:31 130 VIGDE 134 (145) T ss_pred HHHHH Confidence 33333 No 86 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=98.73 E-value=1.2e-10 Score=75.01 Aligned_cols=108 Identities=11% Similarity=0.063 Sum_probs=81.3 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc--------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY--------------------- 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~--------------------- 59 (137) |--...|..++..+.+.++..+...+++++..+.+.....+|||||+||.||....... T Consensus 1 m~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~~~~~ 80 (148) T protein:vir:97 1 MPSLSEFSRRITLRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTEAANT 80 (148) T ss_pred CCccchhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCcccccch Confidence 77777788888889999999999999999999999999999999999999996431110 Q ss_pred -------------cCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH Q lcl|NC_011044. 60 -------------RPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR 126 (137) Q Consensus 60 -------------~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~ 126 (137) -..+-++.+.++++||..+|||+. +|.|..|++.++.+ T Consensus 81 ~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S-----------------------------~QAP~G~v~~t~~~ 131 (148) T protein:vir:97 81 QAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYS-----------------------------AQAPANFVEQAVLE 131 (148) T ss_pred hHHHHHHHHHhhccCCCceEEEeecchhhhHhhcccc-----------------------------CCCcchHHHHHHHH Confidence 011236788999999999999963 46677777777765 Q ss_pred HHH---HhHhhccC Q lcl|NC_011044. 127 IAA---ADPDIHMT 137 (137) Q Consensus 127 ~~~---~~~~i~~~ 137 (137) ... +.+-+++- T Consensus 132 ~~~~v~~~~~~~~~ 145 (148) T protein:vir:97 132 AVQVVQFGRVVDGD 145 (148) T ss_pred HHHHHHhhhhhcCC Confidence 433 23334444 No 87 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=98.72 E-value=1.2e-11 Score=80.37 Aligned_cols=97 Identities=23% Similarity=0.123 Sum_probs=57.9 Q ss_pred HHHHHHHhCCccchhhhccceee--eecccCcEEEEEEec---CccchhhhhcCCCCCccccccCCcceeecCCeeEEee Q lcl|NC_011044. 33 IATQARADVPVRTGNLGRTVGEL--PQRYRPFHVDGGVEA---TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRK 107 (137) Q Consensus 33 i~~~ak~~aPv~TG~Lr~SI~~~--~~~~~~~~~~~~v~~---~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k 107 (137) +.++++..+|++||+|++||-.. ..++.+...+..|+- .++|++.+|||+ +-. +.++-...|.|+-.+ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh---w~~----~~~~~~~dG~w~~~~ 73 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGH---WQT----HAAYKGKDGEWYSSS 73 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccce---eee----eeeeeccCceeeecC Confidence 88999999999999999999432 223333345556643 468999999995 211 112112234444222 Q ss_pred eEec---CCCCCCchhhhhHHHHHHHhHhh-ccC Q lcl|NC_011044. 108 SVWH---PGVRSRPFLRNAAQRIAAADPDI-HMT 137 (137) Q Consensus 108 ~V~~---pG~~a~pfl~~A~~~~~~~~~~i-~~~ 137 (137) +.. -=+||+|||+||+|-.+++.+.| +.. T Consensus 74 -~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r 106 (119) T protein:vir:81 74 -VKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAA 106 (119) T ss_pred -ccccCceecCCCCccchhHHHHHHHHHHHHHHH Confidence 111 12669999999999544432222 222 No 88 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=98.72 E-value=1.2e-11 Score=80.40 Aligned_cols=98 Identities=21% Similarity=0.094 Sum_probs=58.1 Q ss_pred HHHHHHHhCCccchhhhccceee--eecccCcEEEEEEec---CccchhhhhcCCCCCccccccCCcceeecCCeeEEee Q lcl|NC_011044. 33 IATQARADVPVRTGNLGRTVGEL--PQRYRPFHVDGGVEA---TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRK 107 (137) Q Consensus 33 i~~~ak~~aPv~TG~Lr~SI~~~--~~~~~~~~~~~~v~~---~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k 107 (137) +.++++..+|++||+|++||-.. ..++.+...+..|+- .++|++.+|||+ +- .+.++-...|.|+-.+ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh---w~----~~~~~~~~dG~w~~~~ 73 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGH---WQ----THAAYKGKDGEWYSSS 73 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccce---ee----eeeeeeccCceeeecC Confidence 88999999999999999999432 223333345556643 468999999995 21 1112212234444221 Q ss_pred --eEecCCCCCCchhhhhHHHHHHHhHhh-ccC Q lcl|NC_011044. 108 --SVWHPGVRSRPFLRNAAQRIAAADPDI-HMT 137 (137) Q Consensus 108 --~V~~pG~~a~pfl~~A~~~~~~~~~~i-~~~ 137 (137) ..+.-=+||+|||+||+|-.+++.+.| +.. T Consensus 74 ~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r 106 (119) T protein:vir:10 74 VKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAA 106 (119) T ss_pred ccccCceecCCCCccchhHHHHHHHHHHHHHHH Confidence 001112669999999999554433222 222 No 89 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.71 E-value=6.7e-11 Score=76.37 Aligned_cols=97 Identities=14% Similarity=0.137 Sum_probs=68.3 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecc--------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRY--------------------- 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~--------------------- 59 (137) |+- +|..++..+.+.+...++..+++++..+.+.....+|||||.+|.||....... T Consensus 2 ~~~--sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~ 79 (121) T protein:vir:94 2 ISM--KFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVV 79 (121) T ss_pred ccc--hhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHH Confidence 222 345555556666666666777777888888888999999999999996532110 Q ss_pred --cCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH Q lcl|NC_011044. 60 --RPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA 128 (137) Q Consensus 60 --~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~ 128 (137) ...+-++.+.++++||.++|||+. +|.|..|++.++.+.. T Consensus 80 ~~~~~~~~iyi~NnlpYA~~LE~G~S-----------------------------~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 80 SSNVALPHFYITNGAPYAQQLEKGSS-----------------------------TQAPLGIVRVTLASLR 121 (121) T ss_pred HHhhccceEEEeeCcchhhhhhcccC-----------------------------CCCcchHHHHHHHhhC Confidence 111345688999999999999962 5667777777776555 No 90 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.65 E-value=2e-10 Score=73.81 Aligned_cols=125 Identities=17% Similarity=0.088 Sum_probs=68.7 Q ss_pred Cceehhhh-----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----C----C--------------------ccch Q lcl|NC_011044. 1 MPVTARIH-----INEPELERQSGAIFRGKHRSLTRRIATQARAD-----V----P--------------------VRTG 46 (137) Q Consensus 1 msv~~~l~-----~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~-----a----P--------------------v~TG 46 (137) |.++.++| ..|..|...+. ..+..++++++.+.+..+.+ . | .+|| T Consensus 2 ~~i~i~~d~~~~~~~L~~l~~~~~-~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~tg 80 (190) T protein:vir:99 2 AGITLEWDGRRALDVLNAGSAALG-DPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLDG 80 (190) T ss_pred ceeEEEecHHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceecH Confidence 44444544 33444444332 34567778887777655444 1 2 2579 Q ss_pred hhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCccee-------ecCCee--------------EE Q lcl|NC_011044. 47 NLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHF-------FWHGRE--------------IF 105 (137) Q Consensus 47 ~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~-------~~~g~~--------------~~ 105 (137) .|++||......+ .+.||+++.||.+++||.. |.+...+.+.+ -..++. +. T Consensus 81 ~L~~Si~~~~~~~-----~v~vGtn~~yA~iHq~Gg~---i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 152 (190) T protein:vir:99 81 HLRNLLRYQLDGS-----ELLFGSDRPYAAIHHFGGT---IQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIG 152 (190) T ss_pred HHHHHHhheecCc-----EEEEecCcchhhhhhcCCc---ccccccchhhhhhhhhhhhhhhcccccccccccchhcccc Confidence 9999998665332 4678999999999999963 33322222211 011110 11 Q ss_pred eeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 106 RKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 106 ~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ...|+ +|++|||.-.-++...-++.|.+- T Consensus 153 ~~~v~---IPaRpfLG~s~~d~~~I~~~i~~~ 181 (190) T protein:vir:99 153 PYTIQ---MPARPWLGTSSQDDDTILQRVERY 181 (190) T ss_pred cceee---ecCcccCCCCHHHHHHHHHHHHHH Confidence 11233 469999977655544333333333 No 91 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.55 E-value=7.1e-10 Score=70.75 Aligned_cols=108 Identities=12% Similarity=0.136 Sum_probs=76.2 Q ss_pred Cce------ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCccchhhhccceeeeecccCcEEEEEEec Q lcl|NC_011044. 1 MPV------TARIHINEPELERQSGAIFRGKHRSLTRRIATQAR----ADVPVRTGNLGRTVGELPQRYRPFHVDGGVEA 70 (137) Q Consensus 1 msv------~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak----~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~ 70 (137) |+- ..+|...|+...+.+...+++.+.+++..+.++.+ ..+|++||.+.+||.......+ ..|.+ T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~-----~~v~n 75 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGG-----WVIHN 75 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCc-----eeEee Confidence 443 34567777777888888888888666666665555 6899999999999986554332 35777 Q ss_pred Cccc--hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHH-HHHhHhhccC Q lcl|NC_011044. 71 TADY--AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRI-AAADPDIHMT 137 (137) Q Consensus 71 ~~~Y--A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~-~~~~~~i~~~ 137 (137) ...| ++.+|||+.- + .|+ + .+|+|+++||.+.. ...+++|+.- T Consensus 76 k~~yqLtHLLE~GHAk------r--------~GG-----R-----V~a~pHI~paee~~~~~l~~~i~~~ 121 (127) T protein:vir:80 76 KTEYRLAHLLEYGHAT------V--------DGG-----R-----VPETPHIRPVEDWLEKEFEDRVERA 121 (127) T ss_pred cCCcceeehhhcceec------c--------CCc-----c-----cCCccchhhHHHHHHHHHHHHHHHH Confidence 7788 9999999731 1 122 1 34899999999984 4456666655 No 92 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.54 E-value=6.8e-10 Score=70.84 Aligned_cols=108 Identities=13% Similarity=0.119 Sum_probs=73.7 Q ss_pred Cce------ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhCCccchhhhccceeeeecccCcEEEEEEec Q lcl|NC_011044. 1 MPV------TARIHINEPELERQSGAIFRGKHRSLTRRIA----TQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEA 70 (137) Q Consensus 1 msv------~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~----~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~ 70 (137) |+- .++|...|+...+.+...+++.+++++.... ..++..+|++||...+||.......+ .+|.+ T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~-----~~V~n 75 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNG-----WVIHN 75 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCc-----eeEEE Confidence 443 3556677777777777777777755555554 45556899999999999987665432 25777 Q ss_pred Cccc--hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 71 TADY--AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 71 ~~~Y--A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ..+| ++.+|||+.- -+|+ + .+++|+++||.+.... .+++|+.- T Consensus 76 k~~yqLtHLLE~GHAk--------------r~GG-----R-----V~a~pHI~paee~~~~~l~~~i~~~ 121 (124) T protein:vir:95 76 KTEYRLAHLLEYGHAT--------------VDGG-----R-----VPGTPHIRPIEDWLEKEFEDRVEKA 121 (124) T ss_pred cCCCceeeeeecceec--------------cCCc-----c-----cCCccchhHHHHHHHHHHHHHHHHH Confidence 7788 9999999731 1122 1 3499999999988654 44444443 No 93 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=98.52 E-value=5.6e-10 Score=71.31 Aligned_cols=113 Identities=12% Similarity=0.060 Sum_probs=76.5 Q ss_pred Ccee---hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEecCccch Q lcl|NC_011044. 1 MPVT---ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVEATADYA 75 (137) Q Consensus 1 msv~---~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA 75 (137) |+.- .+|...+..+.......+.-.+..+|..++++||.+||| +||+-|++|+..+...+.+..++.+..+++|. T Consensus 4 ~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~~g~~~~~Iylsh~veYG 83 (123) T protein:vir:74 4 VTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANKLGPGSHELIMSYSVHYG 83 (123) T ss_pred eEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecCeeec Confidence 4442 233445555555666666667788899999999999999 69999999987665555567999999999999 Q ss_pred hhhh--cCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhc Q lcl|NC_011044. 76 AAVH--EGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIH 135 (137) Q Consensus 76 ~~vE--~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~ 135 (137) +|+| .|.+||.|.|-..+.. .- |-..++..-.+.++-| T Consensus 84 ~~LEla~~~kyaIi~Ptv~~~~---------------------~~-im~g~~~ll~~l~~~~ 123 (123) T protein:vir:74 84 IWLEIANSGQYAVIGPFLPVMG---------------------RK-LMHDLEHLIDRLERAQ 123 (123) T ss_pred ceeeecCCCCceeecchHHHHh---------------------HH-HHHHHHHHHHHhhccC Confidence 9999 5667888877544311 11 1122333333333333 No 94 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.52 E-value=4.1e-10 Score=72.04 Aligned_cols=109 Identities=18% Similarity=0.161 Sum_probs=63.3 Q ss_pred Cceehhhh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------ccc Q lcl|NC_011044. 1 MPVTARIH-------INEPELERQSGAIFRGKHRSLTRRIATQARADV--------P--------------------VRT 45 (137) Q Consensus 1 msv~~~l~-------~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a--------P--------------------v~T 45 (137) ||+...|. ..|..|...+. ..+..++.++..+++..+.+. | ++| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~-d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVT-DTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccc Confidence 87744433 33333333332 245667777777776554441 1 479 Q ss_pred hhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH Q lcl|NC_011044. 46 GNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ 125 (137) Q Consensus 46 G~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~ 125 (137) |.|++||...... -.+.||++..||.+++||+.. .+ .++|+ +||+|||--.-+ T Consensus 80 G~L~~Si~~~~~~-----~~v~vGt~~~YA~iHqfGg~~---~~----------------~~~v~---iPaRpfLG~s~~ 132 (155) T protein:vir:79 80 NALARSVTTWADR-----NEAGIGSNLVYAAIHQFGGDA---GR----------------GHQVE---IPARRYLPFDEN 132 (155) T ss_pred hhhhhhhhceecC-----CEEEEecCchhhhhhhccccc---CC----------------CCccc---cCCccccCCCCc Confidence 9999999866432 245789999999999999742 11 12344 559999953321 Q ss_pred HH--HHHhHhhccC Q lcl|NC_011044. 126 RI--AAADPDIHMT 137 (137) Q Consensus 126 ~~--~~~~~~i~~~ 137 (137) +. ...++.|.++ T Consensus 133 ~~l~~~~~~~I~~~ 146 (155) T protein:vir:79 133 GQLAAGARQSILEV 146 (155) T ss_pred cccchHHHHHHHHH Confidence 11 1122233333 No 95 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=98.51 E-value=4.7e-10 Score=71.73 Aligned_cols=110 Identities=12% Similarity=0.128 Sum_probs=77.6 Q ss_pred Ccee---hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEecCccch Q lcl|NC_011044. 1 MPVT---ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVEATADYA 75 (137) Q Consensus 1 msv~---~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA 75 (137) |+.- .++...+..+..+....+.-.+..+|..++++||.+||| +||+-|++|+..+...+++..++.+..+++|. T Consensus 4 ~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~~~~~~~Iylsh~veYG 83 (120) T protein:vir:10 4 IEFKFKDIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHTVHYG 83 (120) T ss_pred EEEEecHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecCeeec Confidence 5552 233455566666667777778889999999999999999 69999999987665566777899999999999 Q ss_pred hhhh--cCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhc Q lcl|NC_011044. 76 AAVH--EGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIH 135 (137) Q Consensus 76 ~~vE--~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~ 135 (137) +|+| .|.++|.|.|...+... - | ++.+..-..+++ T Consensus 84 ~~LEla~~~kyaIl~PTi~~~~~---------------------~-i---l~g~~~ll~~l~ 120 (120) T protein:vir:10 84 IWLEIANSGRYEIIMPTVHHEGK---------------------L-M---AQRLRGLLGRLR 120 (120) T ss_pred ceEEeeCCCCcccccchHHHHhH---------------------H-H---HHHHHHHhhhcC Confidence 9999 89999998885443111 0 0 111111222222 No 96 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.50 E-value=4.2e-10 Score=72.02 Aligned_cols=110 Identities=16% Similarity=0.198 Sum_probs=65.0 Q ss_pred CceehhhhhhHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHh-----C---------------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAI------FRGKHRSLTRRIATQARAD-----V---------------------------- 41 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~------~~~~~~~~a~~i~~~ak~~-----a---------------------------- 41 (137) ||+..+|..+...+.+.+... .+..+..+++.+++....+ . T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 997555544433333333222 2456777777766554433 1 Q ss_pred ------------CccchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeE Q lcl|NC_011044. 42 ------------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSV 109 (137) Q Consensus 42 ------------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V 109 (137) -.+||.|++||.+....+ .+.||+|..||.+++||+..- + .+.| T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~-----~v~vGtn~~YAaiHqfGg~~~--~-----------------~~~v 136 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHDDN-----SAVIGSNKEYAAIHQFGGQAG--R-----------------GLKV 136 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeecCC-----EEEEecChhhhhhhhcccccC--C-----------------CCcc Confidence 125899999998765322 467999999999999997421 0 1124 Q ss_pred ecCCCCCCchhhhhHHHH--HHHhHhhccC Q lcl|NC_011044. 110 WHPGVRSRPFLRNAAQRI--AAADPDIHMT 137 (137) Q Consensus 110 ~~pG~~a~pfl~~A~~~~--~~~~~~i~~~ 137 (137) + +||+|||--.-++. ...++.|-++ T Consensus 137 ~---iPaRpfLG~s~~d~~~~e~~~~Il~~ 163 (175) T protein:vir:10 137 T---IPARPWLPVTADGELQPEAVEPVLNT 163 (175) T ss_pred c---cCCccccCCCcccccchHHHHHHHHH Confidence 3 56999997653322 1233444444 No 97 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.45 E-value=3.1e-09 Score=67.22 Aligned_cols=104 Identities=13% Similarity=0.114 Sum_probs=73.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------------cchhhhccceeeeecc------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPV--------------RTGNLGRTVGELPQRY------- 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--------------~TG~Lr~SI~~~~~~~------- 59 (137) ||-++.++. +.+.++..++..+++++..+.......+|| |||++|.||....... T Consensus 11 msFaa~i~~----~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~~~~~~~ 86 (152) T protein:vir:96 11 MSWSKSLKN----IIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKITSFEKGI 86 (152) T ss_pred ccccccHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCCcccccC Confidence 887666665 555566666666677777777777899999 9999999997542211 Q ss_pred ----------------cCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhh Q lcl|NC_011044. 60 ----------------RPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNA 123 (137) Q Consensus 60 ----------------~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A 123 (137) -..+-++.+.++++||..+|||+ .+|.|.-|.+.+ T Consensus 87 ~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~-----------------------------S~QAP~G~vr~t 137 (152) T protein:vir:96 87 SSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGH-----------------------------SSQAPNGVYRPA 137 (152) T ss_pred CCCCchHHHHHHHHhhccccceEEEeeCchhhhHhhccc-----------------------------cCCCCchHHHHH Confidence 01234678899999999999994 257788888888 Q ss_pred HHHHHH-HhHhhccC Q lcl|NC_011044. 124 AQRIAA-ADPDIHMT 137 (137) Q Consensus 124 ~~~~~~-~~~~i~~~ 137 (137) +.+... .++.||-. T Consensus 138 ~~~~~~~v~ea~~~~ 152 (152) T protein:vir:96 138 VRRLVKFLNTELKAK 152 (152) T ss_pred HHHHHHHHHHHhccC Confidence 876543 34444444 No 98 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.45 E-value=8.6e-10 Score=70.29 Aligned_cols=110 Identities=15% Similarity=0.151 Sum_probs=64.0 Q ss_pred CceehhhhhhHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHh-----C---------------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA------IFRGKHRSLTRRIATQARAD-----V---------------------------- 41 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~------~~~~~~~~~a~~i~~~ak~~-----a---------------------------- 41 (137) ||+.-+|..+...+.+.+.. ..+..+..+++.++...+.+ . T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 88643332222222222211 23556777887777655442 1 Q ss_pred ------------CccchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeE Q lcl|NC_011044. 42 ------------PVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSV 109 (137) Q Consensus 42 ------------Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V 109 (137) -.+||.|++||......+ .+.||+|..||.+++||+... + .+.| T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~-----~v~vGtn~~YAaiHqfGg~~~-------~------------~~~v 136 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGED-----YSVIGSNKEYAAIQHFGGQAG-------R------------GLKV 136 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCC-----EEEEecCcchhhHhhcccccC-------C------------Cccc Confidence 125999999998765332 457999999999999997421 0 1234 Q ss_pred ecCCCCCCchhhhhHHH--HHHHhHhhccC Q lcl|NC_011044. 110 WHPGVRSRPFLRNAAQR--IAAADPDIHMT 137 (137) Q Consensus 110 ~~pG~~a~pfl~~A~~~--~~~~~~~i~~~ 137 (137) + +||+|||--.-++ ....++.|.++ T Consensus 137 ~---IPARPfLG~s~~de~~~~~~~~I~~~ 163 (175) T protein:vir:79 137 T---IPGRAWLPVTADGELQPEAVEPVLNT 163 (175) T ss_pred c---cCcccccCCCcccchhHHHHHHHHHH Confidence 4 5599999754332 12234445555 No 99 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.43 E-value=8.4e-10 Score=70.33 Aligned_cols=109 Identities=18% Similarity=0.153 Sum_probs=62.8 Q ss_pred Cceehhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----------------------------Cccc Q lcl|NC_011044. 1 MPVTARI-------HINEPELERQSGAIFRGKHRSLTRRIATQARADV----------------------------PVRT 45 (137) Q Consensus 1 msv~~~l-------~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a----------------------------Pv~T 45 (137) ||+-.+| ...|..|...++ ..+..++.+++.+.+..+.+- -.+| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~-d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVT-DTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhc Confidence 8874333 333344433332 245667777777766554441 1479 Q ss_pred hhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH Q lcl|NC_011044. 46 GNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ 125 (137) Q Consensus 46 G~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~ 125 (137) |.|++||......+ .+.||++..||.+++||+.. .+ .++|. +|++|||--.-+ T Consensus 80 g~L~~Si~~~~~~~-----~v~vGtn~~YA~iHqfGg~~---~~----------------~~~v~---iPaRpfLG~s~~ 132 (155) T protein:vir:99 80 NALARSVTTWADRN-----EAGIGSNLVYAAIHQFGGDA---GR----------------GHQVE---IPARRYLPFDEN 132 (155) T ss_pred hhhhhhhhceecCC-----EEEEecCccchhhhhccccc---CC----------------CCccc---cCCccccCCCCc Confidence 99999998664322 45789999999999999732 11 11244 569999953221 Q ss_pred HH--HHHhHhhccC Q lcl|NC_011044. 126 RI--AAADPDIHMT 137 (137) Q Consensus 126 ~~--~~~~~~i~~~ 137 (137) +. ...++.|.++ T Consensus 133 ~~l~~e~~~~I~~~ 146 (155) T protein:vir:99 133 GQLAAGARQSILEI 146 (155) T ss_pred cccchHHHHHHHHH Confidence 11 1122233333 No 100 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=98.43 E-value=4.3e-10 Score=71.96 Aligned_cols=107 Identities=21% Similarity=0.252 Sum_probs=76.0 Q ss_pred CceehhhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCc-----------cchhhhccceeeeecccCcEEEEEE Q lcl|NC_011044. 1 MPVTARIHINEPEL-ERQSGAIFRGKHRSLTRRIATQARADVPV-----------RTGNLGRTVGELPQRYRPFHVDGGV 68 (137) Q Consensus 1 msv~~~l~~~~~~l-~~~~~~~~~~~~~~~a~~i~~~ak~~aPv-----------~TG~Lr~SI~~~~~~~~~~~~~~~v 68 (137) ..+..+|...++++ ...+.+.++.+.+.+|+.+...+++.+|+ +||.|..||+...+. ....+.. T Consensus 11 V~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~---raa~VrA 87 (143) T protein:vir:62 11 VDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA---KGAVIKA 87 (143) T ss_pred hHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc---cceeeee Confidence 44678888999988 77888999999999999999999999999 799999999865432 2345666 Q ss_pred ec--CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHH-----HHHhHhhccC Q lcl|NC_011044. 69 EA--TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRI-----AAADPDIHMT 137 (137) Q Consensus 69 ~~--~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~-----~~~~~~i~~~ 137 (137) |. .++||+++|||++.+.|.| +.||..|.-+- +--+.+|..+ T Consensus 88 G~~krVPYA~~I~~G~r~r~Isp---------------------------~rFl~~a~a~te~~~~r~Ye~~i~~v 136 (143) T protein:vir:62 88 GSASRVPYAAAIHFGYRARNISP---------------------------NRFLFRAMARKSDVVAATYERRIAAV 136 (143) T ss_pred CCcCCCCcccccccCcccccccc---------------------------hhhhhhhhhccCHHHHHHHHHHHHHH Confidence 66 7999999999987665543 33333332111 1112222222 No 101 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.38 E-value=1.9e-09 Score=68.37 Aligned_cols=111 Identities=15% Similarity=0.211 Sum_probs=64.1 Q ss_pred CceehhhhhhHHHHHHHHHH---H--HHHHHHHHHHHHHHHHHHh-----CC---------------------------- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA---I--FRGKHRSLTRRIATQARAD-----VP---------------------------- 42 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~---~--~~~~~~~~a~~i~~~ak~~-----aP---------------------------- 42 (137) ||+..+|..+.+.+.+.+.. . .+..++.++..+.+..+.+ .| T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~L 80 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSIL 80 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcch Confidence 77765555444444443322 1 1235666666665544333 12 Q ss_pred ccchhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhh Q lcl|NC_011044. 43 VRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRN 122 (137) Q Consensus 43 v~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~ 122 (137) .+||.|++||......+ .+.||++..||.+++||+... .. .+.| .+|++|||-- T Consensus 81 ~~tg~L~~Si~~~~~~~-----~v~vGt~~~yA~vHqfG~~~~---~~---------------~~~~---~iPaRpfLG~ 134 (156) T protein:vir:19 81 TLHGDLARSITTDYGQD-----YALIGSPKIYAAIHQWGGTPD---MA---------------PRPA---GVPARPYMGL 134 (156) T ss_pred hhhHHHHHHhhheecCC-----EEEEecchhhhHHhhcCcccc---cC---------------CCcc---ccCCccccCC Confidence 26799999998654322 457899999999999997421 11 1223 3669999976 Q ss_pred hHHHHHHHhHhhccC Q lcl|NC_011044. 123 AAQRIAAADPDIHMT 137 (137) Q Consensus 123 A~~~~~~~~~~i~~~ 137 (137) .-++...-.+.|.+- T Consensus 135 s~~d~~~I~~~i~~~ 149 (156) T protein:vir:19 135 DKTGEQEIFDAIRKR 149 (156) T ss_pred CHHHHHHHHHHHHHH Confidence 554433322222222 No 102 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.28 E-value=3.5e-09 Score=66.95 Aligned_cols=109 Identities=15% Similarity=0.163 Sum_probs=60.7 Q ss_pred Cceehhhhh-------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-----------------------ccc Q lcl|NC_011044. 1 MPVTARIHI-------NEPELERQSGAIFRGKHRSLTRRIATQARADV-----P-----------------------VRT 45 (137) Q Consensus 1 msv~~~l~~-------~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~a-----P-----------------------v~T 45 (137) ||+..+|.. .|..|...+. .....+..+++.+....+.+. | .+| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~-~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~t 79 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVT-DTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVT 79 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccc Confidence 875433332 2333333222 245567777777766554331 1 368 Q ss_pred hhhhccceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhh-H Q lcl|NC_011044. 46 GNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNA-A 124 (137) Q Consensus 46 G~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A-~ 124 (137) |.|++||......+ .+.||++..||.+++||+... + .+++. +||+|||--. - T Consensus 80 G~L~~Si~~~~~~~-----~v~vGtn~~YA~iHqfGg~~~---~----------------~~~~~---iPARPfLG~s~~ 132 (155) T protein:vir:10 80 NALARSITTRADRD-----QAQIGSNLSYAAIQQLGGQAG---R----------------GRKVT---IPARPYLPVLRN 132 (155) T ss_pred hhhhhhhhceecCC-----EEEEecCcchhhhhhcccccC---C----------------CCccc---cCCccccCCCcc Confidence 99999998664322 457899999999999997421 0 12244 5699999632 1 Q ss_pred HH-HHHHhHhhccC Q lcl|NC_011044. 125 QR-IAAADPDIHMT 137 (137) Q Consensus 125 ~~-~~~~~~~i~~~ 137 (137) ++ -..-++.|.+. T Consensus 133 ~e~~~ei~~~I~~~ 146 (155) T protein:vir:10 133 GQLKPSARDAVLDV 146 (155) T ss_pred ccchHHHHHHHHHH Confidence 11 11122223332 No 103 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=98.27 E-value=1.7e-09 Score=68.64 Aligned_cols=107 Identities=20% Similarity=0.242 Sum_probs=75.1 Q ss_pred CceehhhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCcc-----------chhhhccceeeeecccCcEEEEEE Q lcl|NC_011044. 1 MPVTARIHINEPEL-ERQSGAIFRGKHRSLTRRIATQARADVPVR-----------TGNLGRTVGELPQRYRPFHVDGGV 68 (137) Q Consensus 1 msv~~~l~~~~~~l-~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~-----------TG~Lr~SI~~~~~~~~~~~~~~~v 68 (137) ..+..+|...++++ ...+.+.++.+.+.+|+.+...+++.+|+- ||.|..||+...+.. ...+.. T Consensus 11 V~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~r---aa~VrA 87 (143) T protein:vir:13 11 VDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASAK---GAVIKA 87 (143) T ss_pred hHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhcccccccccc---ceeeee Confidence 44567888899888 778889999999999999999999999995 899999998654332 345666 Q ss_pred ec--CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH-----HHhHhhccC Q lcl|NC_011044. 69 EA--TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA-----AADPDIHMT 137 (137) Q Consensus 69 ~~--~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~-----~~~~~i~~~ 137 (137) |. .++||+++|||++-+.|. ++-||+.|.-+-. --+.+|..+ T Consensus 88 Gr~arVPYA~~I~~G~r~r~Is---------------------------~~rFl~~a~a~te~~~~r~Ye~~i~~v 136 (143) T protein:vir:13 88 GSAARVPYAAAIHFGYRKRNIS---------------------------ANRFLYRAMARKSDVVAATYERRIAAV 136 (143) T ss_pred cCcCCCCcccccccCCcccccc---------------------------hhhhhhhhhhccCHHHHHHHHHHHHHH Confidence 64 489999999998655443 3344444432111 122223222 No 104 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=98.25 E-value=3.5e-09 Score=66.95 Aligned_cols=123 Identities=15% Similarity=0.153 Sum_probs=88.0 Q ss_pred ceehhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 2 PVTARIHINEPELER--QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 2 sv~~~l~~~~~~l~~--~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) -+.-++|+-...+.+ .++..+++.+..+..+++..+...+|++||+||.|-.+.+. +.+|...+.++|-.||- T Consensus 1 mi~i~idkp~almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sie-----gstgelsn~~~yl~~vl 75 (133) T protein:vir:42 1 MIEIRIDKPDALMEKPHEVQGKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIE-----GSTGELSNLAYYLPFVL 75 (133) T ss_pred CeeeecCCchhhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEee-----cCccchhhhhHHhhHhh Confidence 233355654444444 67778888899999999999999999999999999655443 23578889999999999 Q ss_pred cCCCCCccccccCCcceeecCCeeE-EeeeEecCCCCCCchhhhhHHHHH-------HHhHhhcc Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREI-FRKSVWHPGVRSRPFLRNAAQRIA-------AADPDIHM 136 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~-~~k~V~~pG~~a~pfl~~A~~~~~-------~~~~~i~~ 136 (137) ||.+ ++.|..+|+|+|.---.++ +++ -.+|+.|+.-++-..+ .-++++++ T Consensus 76 ~grg--wvfpv~~kal~wpelphpvayar-----pappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 76 HGRG--WVFPVRRKALWWPELPHPVAYAR-----PAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred hccc--ceeeccccccccCCCCCcccccC-----CCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 9986 8899999999875221111 111 2567888887765432 24566666 No 105 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=98.23 E-value=2.5e-09 Score=67.72 Aligned_cols=84 Identities=19% Similarity=0.158 Sum_probs=64.3 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) --|...+.+++..+++.+...+++.+.+.|..|...|..++|||||+|++||... ...+++++.|..+++||+ T Consensus 17 kyG~~dmvk~~~~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~d---yk~GGltavI~vGAeYAI---- 89 (100) T protein:vir:96 17 KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFK---YFDGGLSSVISVGADYAI---- 89 (100) T ss_pred eechHHHHHHHhcchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeee---eecCCeeEEEecchhHHH---- Confidence 1235667788899999999999999999999999999999999999999999754 334467888999999998 Q ss_pred CCCCCccccccCCcceeec Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFW 99 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~ 99 (137) + +..+.|...+ T Consensus 90 --k------rmsqllvtvi 100 (100) T protein:vir:96 90 --K------RMSQLLVTVI 100 (100) T ss_pred --H------HHHHHHhhcC Confidence 1 1122222111 No 106 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=98.17 E-value=1.6e-08 Score=63.39 Aligned_cols=121 Identities=21% Similarity=0.149 Sum_probs=85.7 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) ..+..++...+.++.+++. ....++|.++...+...|-..+|+||..|-+|== ......+..++|.||..+.||.||| T Consensus 3 V~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQf-rei~~ngtritGRVGYSAnYA~yVH 81 (131) T protein:vir:10 3 VKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQY-KKLEPIPSGMIGRVGYTANYAAAVN 81 (131) T ss_pred cchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccc-eeeeccCceeEEeeccceeeeeeee Confidence 5556888888988888877 4777888888888899999999999999999943 4455667789999999999999999 Q ss_pred cCCCCCc--cccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHH-HHHhHhhccC Q lcl|NC_011044. 80 EGSRPHR--IVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRI-AAADPDIHMT 137 (137) Q Consensus 80 ~GT~ph~--i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~-~~~~~~i~~~ 137 (137) .-.+-.. .+|..+.. .+.| +.-.-||..++++. ......|..- T Consensus 82 da~Gklkgqprp~gkgn--------------~w~p-~ae~eFL~kgfe~~~~d~i~avik~ 127 (131) T protein:vir:10 82 AAKGKLKGKPRPDGSGN--------------YWDP-NGEPDFLRKGFERDGLNEIKAIIRQ 127 (131) T ss_pred cCccccCCCcCCCCCcc--------------eecC-CCChhhhhhhhhccchHHHHHHHhh Confidence 8433221 22322221 1223 22346899999875 3344444433 No 107 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=98.08 E-value=9e-09 Score=64.68 Aligned_cols=123 Identities=15% Similarity=0.131 Sum_probs=85.3 Q ss_pred ceehhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 2 PVTARIHINEPELER--QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 2 sv~~~l~~~~~~l~~--~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) -+.-++|+-...+.+ .++..+++.+..+..+++..+...+|++||+||.|-.+.+. +.+|...+.++|-.||- T Consensus 1 mi~i~idkp~almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sie-----gstgelsn~~~yl~~vl 75 (133) T protein:vir:41 1 MIRINIDKPEALMEKASEVEDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVE-----GSTGELTNTVPYLQWVL 75 (133) T ss_pred CeeeecCCchhhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEee-----cCccchhhhhHHhhHhh Confidence 233355654444444 67788888899999999999999999999999999655443 23578889999999999 Q ss_pred cCCCCCccccccCCcceeecCCeeE-EeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREI-FRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~-~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||.+ ++.|..+|+|+|.---.++ +++ -.+|+.|+.-++-..++ +.-+.+| T Consensus 76 ~grg--wvfpv~~kal~wpelphpvayar-----pappndyfsa~vay~~~-~give~s 126 (133) T protein:vir:41 76 FGRG--WVFPVEKKALYWPELPHPVAYAR-----PAPPNDYFSAAVAYIDA-KGIVEDS 126 (133) T ss_pred hccc--ceeeecccccccCCCCCcccccC-----CCCCchhhhhhhhhhcc-cchhHHH Confidence 9986 8899999999875221111 111 25678888877654322 2222222 No 108 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=98.07 E-value=1.4e-08 Score=63.72 Aligned_cols=108 Identities=8% Similarity=0.037 Sum_probs=70.7 Q ss_pred Cc-eehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhccceeeeecccC-cEEEEEEe Q lcl|NC_011044. 1 MP-VTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVR------T---GNLGRTVGELPQRYRP-FHVDGGVE 69 (137) Q Consensus 1 ms-v~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~------T---G~Lr~SI~~~~~~~~~-~~~~~~v~ 69 (137) |+ +..+|..++++|.+.....-+++++..|..++...+..+|.. | |+|++||........+ ......|| T Consensus 4 ~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~s~VG 83 (153) T protein:vir:49 4 LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGVSTVG 83 (153) T ss_pred HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccceeeec Confidence 33 345566666666665566677889999999999999999873 3 5899999865433221 11122455 Q ss_pred cC----ccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-------HhHhhccC Q lcl|NC_011044. 70 AT----ADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-------ADPDIHMT 137 (137) Q Consensus 70 ~~----~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-------~~~~i~~~ 137 (137) -+ ..||.|+|+||.- ++|+||+.++.++.+. +.+.+++. T Consensus 84 ~~~~~~a~~a~f~n~GT~k-----------------------------m~~~hFie~tr~e~~~k~~vl~A~~~~~~~i 133 (153) T protein:vir:49 84 WKNNYHAQNARRLNDGTKK-----------------------------YRADHFITNVQNDSTVKNKVLLAEKEEYEKL 133 (153) T ss_pred ccCCccceeeeecccCccc-----------------------------CCCChhhHHHHHHhhHHHHHHHHHHHHHHHH Confidence 33 5678999999831 5699999999876432 22344433 No 109 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=97.96 E-value=3.6e-08 Score=61.36 Aligned_cols=104 Identities=12% Similarity=0.105 Sum_probs=63.9 Q ss_pred hhhhhhHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHhCCcc----------chhhhccceeeeecccCc-EEEE Q lcl|NC_011044. 5 ARIHINEPELERQSG-------AIFRGKHRSLTRRIATQARADVPVR----------TGNLGRTVGELPQRYRPF-HVDG 66 (137) Q Consensus 5 ~~l~~~~~~l~~~~~-------~~~~~~~~~~a~~i~~~ak~~aPv~----------TG~Lr~SI~~~~~~~~~~-~~~~ 66 (137) ..|+..++.+.+++. ..-..++...|..++...+.++|.. .++|+++|.....+..+. .... T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceee Confidence 233333444433333 3345578888899999999999972 368999998655332221 1222 Q ss_pred EEec--CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-----HhHhhccC Q lcl|NC_011044. 67 GVEA--TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-----ADPDIHMT 137 (137) Q Consensus 67 ~v~~--~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-----~~~~i~~~ 137 (137) .||- ...+|.|+|+||. + ++|+||+.++.++++. +.+.+|+. T Consensus 81 ~VG~~k~~~~A~f~n~GT~--------------------------k---~~~~hFie~t~~e~~~evl~a~~~~~k~~ 129 (139) T protein:vir:10 81 TVGFHNKAHIARFLNDGTK--------------------------Y---IRADHFVDNARDDAKDAVFAAEAEKYQAM 129 (139) T ss_pred eeCCCCCcceEeecccCcc--------------------------c---cCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 3443 3457788899982 1 6699999999987643 23333333 No 110 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=97.85 E-value=2.4e-08 Score=62.39 Aligned_cols=89 Identities=13% Similarity=0.114 Sum_probs=63.5 Q ss_pred HHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEecCccchhhhhcCC--CCCccccccCCccee Q lcl|NC_011044. 22 FRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHEGS--RPHRIVARHAQALHF 97 (137) Q Consensus 22 ~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT--~ph~i~pk~~k~l~~ 97 (137) +...++..|..++.+||.+||| +||+-|++|+..+...+....++.+..+++|.+|+|.++ ++|.|.|...+... T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~~g~~~~~i~lsh~v~Yg~~LE~a~~~kyaIl~Ptv~~~~~- 79 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHTVHYGIWLEIANSGRYEIIMPTVHHEGK- 79 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccccCCceEEEEEecCeeccceEEeecCCCccchhhhHHHHHH- Confidence 6777888899999999999999 699999999876655555678999999999999999965 67777775433110 Q ss_pred ecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhc Q lcl|NC_011044. 98 FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIH 135 (137) Q Consensus 98 ~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~ 135 (137) - | ++.+..-..+++ T Consensus 80 --------------------~-i---~~g~~~ll~~l~ 93 (93) T protein:vir:10 80 --------------------L-M---AQRLRGLLGRLR 93 (93) T ss_pred --------------------H-H---HHHHHHHHHhcC Confidence 0 0 111222222233 No 111 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.69 E-value=3.6e-07 Score=55.89 Aligned_cols=116 Identities=12% Similarity=0.038 Sum_probs=70.1 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC----c--------------------cchhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARAD-----VP----V--------------------RTGNLGR 50 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~-----aP----v--------------------~TG~Lr~ 50 (137) ||=..+|+..|..+...+. ...+..+.++++.+....+.+ .| | ++|.|.+ T Consensus 1 m~d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l~~ 80 (149) T protein:vir:98 1 MSELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRTNR 80 (149) T ss_pred CchHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhhhh Confidence 9988888888877766654 234567778887777665554 23 2 2377889 Q ss_pred cceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH Q lcl|NC_011044. 51 TVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA 130 (137) Q Consensus 51 SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~ 130 (137) ||+.....++ ..++.+|++..||..++||.. +++..+. ..|. +|++|||-=.-++-..- T Consensus 81 sl~~~~~~~~--~~V~~~Gs~~~yAa~HQfG~~---~r~~~~~-------------~~~~---iPaRp~LG~s~~d~~~i 139 (149) T protein:vir:98 81 FMKAKGSDSA--AVVEFTGRVQRMARVHQYGLK---DRPNRHS-------------RDVQ---YAARPLLGFTRDDEQMI 139 (149) T ss_pred hhhheecCCe--eEEEecCcchHHhhHhhcccc---ccccCCC-------------ccee---ccccccCCCCHHHHHHH Confidence 9876543332 223345999999999999974 3332211 1344 45999996554432222 Q ss_pred hHhhccC Q lcl|NC_011044. 131 DPDIHMT 137 (137) Q Consensus 131 ~~~i~~~ 137 (137) .+-|.+- T Consensus 140 ~~~i~~~ 146 (149) T protein:vir:98 140 EDIIIRH 146 (149) T ss_pred HHHHHHH Confidence 2222221 No 112 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=97.64 E-value=2.1e-07 Score=57.24 Aligned_cols=108 Identities=12% Similarity=0.100 Sum_probs=62.3 Q ss_pred CceehhhhhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhCCc-------c---chhhhccceeeeecccC-cEEEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQ---SGAIFRGKHRSLTRRIATQARADVPV-------R---TGNLGRTVGELPQRYRP-FHVDG 66 (137) Q Consensus 1 msv~~~l~~~~~~l~~~---~~~~~~~~~~~~a~~i~~~ak~~aPv-------~---TG~Lr~SI~~~~~~~~~-~~~~~ 66 (137) |+-..-|+--+..|.+. ....-..++...|..++...+.++|. + .++|+++|........+ ..... T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccc Confidence 44332232222333222 23334567888899999999999996 2 35799999765422221 11123 Q ss_pred EEecC--ccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-----HhHhhccC Q lcl|NC_011044. 67 GVEAT--ADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-----ADPDIHMT 137 (137) Q Consensus 67 ~v~~~--~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-----~~~~i~~~ 137 (137) .||-+ ...|.|+|+||. + ++|+||+.++.+++.+ +.+.+|+. T Consensus 81 ~VG~~~~~~~Ahf~n~GT~--------------------------~---~~~~hFie~t~~e~~~ev~~a~~~~~ke~ 129 (139) T protein:vir:10 81 TVGFHNKAHIARFLNDGTK--------------------------N---IRADHFVDNARDDAKDAVFAAEAEKYQAM 129 (139) T ss_pred eeCCCCCceeeeeeccCcc--------------------------c---cCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 34422 334678888872 1 6799999999887533 33444444 No 113 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.54 E-value=8.8e-07 Score=53.79 Aligned_cols=117 Identities=14% Similarity=0.038 Sum_probs=69.8 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----C----Cc--------------------cchhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARAD-----V----PV--------------------RTGNLGR 50 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~-----a----Pv--------------------~TG~Lr~ 50 (137) |+-..+++..|..+...+. ...+..+.++++.+....+.+ . || .+|.|.+ T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhcc Confidence 8888888888877776654 334556777777776665554 2 33 3456777 Q ss_pred cceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH Q lcl|NC_011044. 51 TVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA 130 (137) Q Consensus 51 SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~ 130 (137) ||+.....+. ..+...+|++..||..++||.. +++..++ ..|. +|++|||-=.-++...- T Consensus 81 sl~~~~~~~~-a~vg~~~G~~~~yAaiHQfG~~---~r~~~~~-------------~~~~---iPaRp~LG~s~~d~~~i 140 (150) T protein:vir:57 81 FLHIRASPEQ-ASMEFYGGKSPKIASVHQFGLS---EETRKDG-------------KKID---YPARPLLGFTGEDVQMI 140 (150) T ss_pred ceeeeeeCcE-EEEEeecCCchhhhhhhhcccc---ccccCCC-------------ceee---cCCcccCCCCHHHHHHH Confidence 8875443221 1122235999999999999963 2322211 1355 45999998765543322 Q ss_pred hHhhccC Q lcl|NC_011044. 131 DPDIHMT 137 (137) Q Consensus 131 ~~~i~~~ 137 (137) .+-|.+- T Consensus 141 ~~~i~~~ 147 (150) T protein:vir:57 141 EEIILAH 147 (150) T ss_pred HHHHHHH Confidence 2222222 No 114 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=97.52 E-value=1.2e-06 Score=53.02 Aligned_cols=116 Identities=13% Similarity=0.013 Sum_probs=71.8 Q ss_pred CceehhhhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----C----Cc-------------------cchhhhcc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAI-FRGKHRSLTRRIATQARAD-----V----PV-------------------RTGNLGRT 51 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~-~~~~~~~~a~~i~~~ak~~-----a----Pv-------------------~TG~Lr~S 51 (137) |+-..+|+..|..+...+... .+..++++++.+....+.+ . || +++.|.+| T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~ 80 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARY 80 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhh Confidence 999999999998888777543 3456777777776665544 2 33 23456677 Q ss_pred ceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHh Q lcl|NC_011044. 52 VGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAAD 131 (137) Q Consensus 52 I~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~ 131 (137) ++.....+ ...++-+|+++.||..++||-. +++..+ .+.|++ |++|||-=.-++...-. T Consensus 81 l~~~~~~~--~~~v~~~Gt~~~yAaiHQfG~~---~r~~~~-------------~~~v~i---PaRp~LG~s~~d~~~i~ 139 (148) T protein:vir:79 81 MKTQADAN--TAVVTFAGNAQRIATVHQFGLR---DRVNKA-------------GLTAQY---PARELLGMDGVDMEHIT 139 (148) T ss_pred eeeeeeCC--eeeEEeeccchhhhhhhhcCcc---ccccCC-------------CCcccc---CcccccCCCHHHHHHHH Confidence 76554322 2233346999999999999953 232211 123554 59999987755443333 Q ss_pred HhhccC Q lcl|NC_011044. 132 PDIHMT 137 (137) Q Consensus 132 ~~i~~~ 137 (137) +-|.+- T Consensus 140 ~~i~~~ 145 (148) T protein:vir:79 140 NLLLLH 145 (148) T ss_pred HHHHHH Confidence 333333 No 115 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.48 E-value=1.2e-06 Score=53.05 Aligned_cols=115 Identities=14% Similarity=0.035 Sum_probs=70.6 Q ss_pred CceehhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----C----Cc--------------------cchhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA-IFRGKHRSLTRRIATQARAD-----V----PV--------------------RTGNLGR 50 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~-----a----Pv--------------------~TG~Lr~ 50 (137) |+-..+++..|..+...+.. ..+..+.++++.+....+.+ . || ++|.|.+ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcc Confidence 88888888888777766643 34556777777776655444 2 23 2566788 Q ss_pred cceeeeecccCcEEEE--EEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH Q lcl|NC_011044. 51 TVGELPQRYRPFHVDG--GVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA 128 (137) Q Consensus 51 SI~~~~~~~~~~~~~~--~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~ 128 (137) ||+.....+ ++.+ .+|++..||..++||.. ++++.++ +.|. +|++|||-=.-++.. T Consensus 81 sl~~~~~~~---~a~vg~~~Gt~~~yAaiHQfG~~---~~~~~~~-------------~~~~---iPaRp~LG~s~~d~~ 138 (150) T protein:vir:60 81 FLHIRASPE---QASMEFYGGKSPKIASVHQFGLS---EENRKDG-------------KKID---YPARPLLGFTGEDVQ 138 (150) T ss_pred eeeeeeeCc---EEEEEeeCCCchhhhhhhhcccc---ccccCCC-------------Ccee---cCCcccCCCCHHHHH Confidence 887654322 2333 35999999999999963 3332221 1355 449999987755543 Q ss_pred HHhHhhccC Q lcl|NC_011044. 129 AADPDIHMT 137 (137) Q Consensus 129 ~~~~~i~~~ 137 (137) .-.+.|.+- T Consensus 139 ~i~~~i~~~ 147 (150) T protein:vir:60 139 MIEEIILAH 147 (150) T ss_pred HHHHHHHHH Confidence 333333332 No 116 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=97.48 E-value=2.9e-07 Score=56.41 Aligned_cols=108 Identities=9% Similarity=0.021 Sum_probs=64.7 Q ss_pred Cce----ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc---------cchhhhccceeeeecccCcE-EEE Q lcl|NC_011044. 1 MPV----TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPV---------RTGNLGRTVGELPQRYRPFH-VDG 66 (137) Q Consensus 1 msv----~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv---------~TG~Lr~SI~~~~~~~~~~~-~~~ 66 (137) |-- ..+|...+++|.+.....-.+++...|..++...+..+|. ..++|++||.....+..+.. ... T Consensus 1 M~~~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~s 80 (141) T protein:vir:50 1 MVGLAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGVS 80 (141) T ss_pred CccHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCee Confidence 432 2223333333333333445677888899999999999996 35689999976543322110 112 Q ss_pred EEec----CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH-------HHhHhhc Q lcl|NC_011044. 67 GVEA----TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA-------AADPDIH 135 (137) Q Consensus 67 ~v~~----~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~-------~~~~~i~ 135 (137) .||- ...+|.|+|+||.- ++|+||+.+|.++.+ ++.+.++ T Consensus 81 ~VG~~~~~~~~~A~f~n~GT~k-----------------------------~~~~hFve~~~~~a~~k~~Vl~A~~~~~k 131 (141) T protein:vir:50 81 TVGWKNNYHAQNARRLNDGTKK-----------------------------YRADHFVTNVQNDSTVQKKVLLEKKRNTK 131 (141) T ss_pred eeccCCCccceeeeccccCccc-----------------------------cCCCchhHHHHHhhhhHHHHHHHHHHHHH Confidence 3442 24578888999831 569999999997642 2344444 Q ss_pred cC Q lcl|NC_011044. 136 MT 137 (137) Q Consensus 136 ~~ 137 (137) +. T Consensus 132 ~~ 133 (141) T protein:vir:50 132 NS 133 (141) T ss_pred HH Confidence 33 No 117 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.41 E-value=1.4e-06 Score=52.75 Aligned_cols=115 Identities=14% Similarity=0.037 Sum_probs=69.3 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----C----Cc--------------------cchhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARAD-----V----PV--------------------RTGNLGR 50 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~-----a----Pv--------------------~TG~Lr~ 50 (137) |+=..+|+..|..+...+. ...+..+.++++.+....+.+ . || ++|.|.+ T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~~ 80 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhhh Confidence 8877788888777776654 344556777777776655444 2 22 3577888 Q ss_pred cceeeeecccCcEEEE--EEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH Q lcl|NC_011044. 51 TVGELPQRYRPFHVDG--GVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA 128 (137) Q Consensus 51 SI~~~~~~~~~~~~~~--~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~ 128 (137) ||+.....+ ++.+ .+|++..||..++||.. +++..+. +.|+ +|++|||-=.-++.. T Consensus 81 sl~~~~~~~---~~~vg~~~Gs~~~yAa~HQfG~~---~~~~~~~-------------~~~~---iPaRp~LG~s~~d~~ 138 (150) T protein:vir:20 81 FLHIRASPE---QASMEFYGGKSPKIASVHQFGLS---EENRKDG-------------KKID---YPARPLLGFTGEDVQ 138 (150) T ss_pred hhheeecCc---EEEEEeeCCcchhhhhhhhcccc---cccccCC-------------Ccee---ccccccCCCCHHHHH Confidence 887654322 2333 24999999999999963 3333221 1344 459999977655432 Q ss_pred HHhHhhccC Q lcl|NC_011044. 129 AADPDIHMT 137 (137) Q Consensus 129 ~~~~~i~~~ 137 (137) .-.+-|.+- T Consensus 139 ~i~~~i~~~ 147 (150) T protein:vir:20 139 MIEEIILAH 147 (150) T ss_pred HHHHHHHHH Confidence 222222222 No 118 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=97.38 E-value=1.4e-07 Score=58.08 Aligned_cols=98 Identities=18% Similarity=0.193 Sum_probs=56.6 Q ss_pred Cc-e------ehhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecC Q lcl|NC_011044. 1 MP-V------TARIHINEPELER--QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEAT 71 (137) Q Consensus 1 ms-v------~~~l~~~~~~l~~--~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~ 71 (137) |. + .++|-..+..|.+ ++++-+++++. ++...+|.++||++|..|+||..- ...-+.-.|.++.. T Consensus 1 ma~gpt~kNP~~KFGvs~~d~~K~~EVn~GvNeFMd----E~~~~~K~~SPV~~G~Y~~S~~V~--ers~NkGRG~~G~~ 74 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDFDKLPEVNQGVNEFMD----EVVDAWKNNSPVGTGAYRDSVQVT--ERSTNKGRGKVGAT 74 (108) T ss_pred CCCCcccccchhhhcCChhhhhhchhhhhhHHHHHH----HHHHHHhhcCCCCchhhHHHHHHH--HhhhccCccccCCc Confidence 32 2 2445444545544 44445555554 567789999999999999999632 22222235789999 Q ss_pred ccchhhhhcCCCCCc-cccccCCcceeecCCeeEEee Q lcl|NC_011044. 72 ADYAAAVHEGSRPHR-IVARHAQALHFFWHGREIFRK 107 (137) Q Consensus 72 ~~YA~~vE~GT~ph~-i~pk~~k~l~~~~~g~~~~~k 107 (137) ..||++|||||--.. .-|..+-+-.| |++.+.. T Consensus 75 ~~~AH~VEFGs~hndeyapaqktakqf---ggtay~d 108 (108) T protein:vir:79 75 DPQAHLVEFGSAHNDEYAPAQKTAKQF---GGTAYGD 108 (108) T ss_pred chhhhhhhhhccccccccchhhHHHhh---cccccCC Confidence 999999999984221 11221111111 3333222 No 119 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=97.33 E-value=1.9e-06 Score=52.00 Aligned_cols=116 Identities=12% Similarity=0.041 Sum_probs=67.7 Q ss_pred CceehhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC----c--------------------cchhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA-IFRGKHRSLTRRIATQARAD-----VP----V--------------------RTGNLGR 50 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~-----aP----v--------------------~TG~Lr~ 50 (137) |+-..+++..|..+...+.. .-+..+.++++.+....+.+ .| | ++|.+.+ T Consensus 1 m~~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~~ 80 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRTSR 80 (149) T ss_pred CchHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhhhh Confidence 88777777777776665543 23456777777776665554 33 3 1133456 Q ss_pred cceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH Q lcl|NC_011044. 51 TVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA 130 (137) Q Consensus 51 SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~ 130 (137) |++.....+ ...++.+|++..||..++||.. +++..+. +.|+ +|++|||--.-++...- T Consensus 81 ~l~~~~~~~--~~~v~~~Gtn~~yAaiHQfG~~---~r~~~~~-------------~~v~---iPaRp~LG~s~~d~~~I 139 (149) T protein:vir:18 81 FMKAKGSDS--AAVVEFTGKVQRMARVHQYGLK---DRPNRNS-------------RDVQ---YEARPLLGFTRDDEQMI 139 (149) T ss_pred hhheeecCc--eeEEEecccchhhhhhhhcccc---ccccCCC-------------cccc---ccccccCCCCHHHHHHH Confidence 665443322 2334557999999999999974 3332211 1355 45999998765554333 Q ss_pred hHhhccC Q lcl|NC_011044. 131 DPDIHMT 137 (137) Q Consensus 131 ~~~i~~~ 137 (137) ++.|.+- T Consensus 140 ~~~i~~~ 146 (149) T protein:vir:18 140 EDVIISH 146 (149) T ss_pred HHHHHHH Confidence 3333222 No 120 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.33 E-value=1.1e-06 Score=53.27 Aligned_cols=106 Identities=17% Similarity=0.127 Sum_probs=61.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) |+|..+++ +..+...+.+...++...++.++.+.....+|.+||.|++|-. . .+ ++.|..+++||.++.| T Consensus 1 M~vkV~id--~~~~~~~l~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~---~-~~----~g~I~y~tPYAr~qYY 70 (112) T protein:vir:80 1 MPIKVRVD--LSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYV---I-MN----DKEIMWTSIYARRLYN 70 (112) T ss_pred CceeEEee--hHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCcccccee---e-cc----CceEEecCchhhHhhh Confidence 99966665 5556666666667777788888888889999999999999932 1 11 2367888999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH-----HHhHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA-----AADPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~-----~~~~~i~~~ 137 (137) |..-+. + .-++||+-++=|. +|..+-+ .....+.+- T Consensus 71 ~~~~~~--~------------------~~~~p~ag~~W~e-rak~~~~~~~~~~~~k~~~~~ 111 (112) T protein:vir:80 71 GINFNF--T------------------LTHHPLAGPKWDQ-RAKVDKLESWIEVAQKAVEEG 111 (112) T ss_pred cccCCC--C------------------cCCCCCcchhhHH-HHHhhhhHHHHHHHHHHHhhc Confidence 842110 0 0123333222222 1221110 001111111 No 121 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=97.32 E-value=2.3e-06 Score=51.50 Aligned_cols=116 Identities=13% Similarity=0.007 Sum_probs=64.4 Q ss_pred Cc-eehhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----C----Cc--------------cchhhhcc---- Q lcl|NC_011044. 1 MP-VTARIHINEPELERQSGA-IFRGKHRSLTRRIATQARAD-----V----PV--------------RTGNLGRT---- 51 (137) Q Consensus 1 ms-v~~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~-----a----Pv--------------~TG~Lr~S---- 51 (137) |+ -..+|+..+..|...+.. ..+..++++++.+....+.+ . || ++|.++++ T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~~ 80 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMFR 80 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhhh Confidence 54 345566666666665542 33556788888777665544 2 34 24555443 Q ss_pred -------ceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhH Q lcl|NC_011044. 52 -------VGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAA 124 (137) Q Consensus 52 -------I~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~ 124 (137) |+..... +...++..|++..||..++||.. +++..+ .+.|+ +|++|||--.- T Consensus 81 ~l~~a~~l~~~~~~--d~a~Vg~~Gs~~~yAaiHQfG~~---~r~~~~-------------~~~v~---iPaRp~LGls~ 139 (155) T protein:vir:79 81 KLRTARYLRIDVDS--TGLAIGFDERLSRIARVHQEGQK---APVEPG-------------GPLAQ---YPVRVVLGFSD 139 (155) T ss_pred hhhhhheeeeeecC--cEEEEEecCcchhhhhhhhcCCc---ccCCCC-------------Ccccc---cccccccCCCH Confidence 4433221 11122234999999999999973 222111 12344 55999997776 Q ss_pred HHHHHHhHhhccC Q lcl|NC_011044. 125 QRIAAADPDIHMT 137 (137) Q Consensus 125 ~~~~~~~~~i~~~ 137 (137) ++...-++-|.+- T Consensus 140 ~d~~~I~~~i~~~ 152 (155) T protein:vir:79 140 ADRELVRDRLLRE 152 (155) T ss_pred HHHHHHHHHHHHH Confidence 6543333333333 No 122 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=97.31 E-value=1.1e-06 Score=53.20 Aligned_cols=103 Identities=17% Similarity=0.097 Sum_probs=62.7 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) |+|..+++ +..+...+.+...++...+++++.......+|.+||.|++|-. .. + ++.|..+++||.++-| T Consensus 1 M~vkv~vn--~~~~~~~l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~--~~-~-----~g~I~y~tPYAr~qYY 70 (112) T protein:vir:45 1 MPIKVRVD--LSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYV--IM-N-----DKEIMWTSIYARRLYK 70 (112) T ss_pred CceeEEee--hHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCcccccee--ec-c-----CCeEEecChhhHHhhh Confidence 99965554 4556556666677777888888988889999999999999942 11 1 1357788999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) |..-+ ++ .-++||+-++=|.+ |.-+ ..+.|.+. T Consensus 71 ~~~~~---~~-----------------~~~~p~ag~~W~er-ak~~---~~~~~~~~ 103 (112) T protein:vir:45 71 GINFN---FT-----------------LTHHPLAGPEWDQR-AKID---KMDVWEKV 103 (112) T ss_pred ccccC---CC-----------------CCCCCCCchhhHHH-HHHh---hHHHHHHH Confidence 85322 10 01234433333332 2211 11122221 No 123 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=97.30 E-value=9.7e-07 Score=53.56 Aligned_cols=108 Identities=10% Similarity=0.036 Sum_probs=63.8 Q ss_pred Cce----ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cc---hhhhccceeeeecccC-cEEEE Q lcl|NC_011044. 1 MPV----TARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPV------RT---GNLGRTVGELPQRYRP-FHVDG 66 (137) Q Consensus 1 msv----~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv------~T---G~Lr~SI~~~~~~~~~-~~~~~ 66 (137) |-- ..+|..++++|.+.....-.++++..|..++...+..+|. +| ++|++||.....+..+ ..... T Consensus 1 M~~~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~s 80 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGVS 80 (140) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCcee Confidence 422 2223333333333334455667888899999999999995 33 4699999864332211 00112 Q ss_pred EEec----CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH-------hHhhc Q lcl|NC_011044. 67 GVEA----TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA-------DPDIH 135 (137) Q Consensus 67 ~v~~----~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~-------~~~i~ 135 (137) .||- ...+|.|+++||. -++|+||+.+|.++.+.+ .+.++ T Consensus 81 ~VG~~kk~~a~~A~f~n~GT~-----------------------------k~~~~hFve~~~~e~~~k~~vl~A~~~~~~ 131 (140) T protein:vir:48 81 TVGWVNRYHAQNARRLNDGTK-----------------------------KYRADHFVTNVQNDSAVQTKVLLAEKEEYE 131 (140) T ss_pred eeccCCCcceeeeeccccCcc-----------------------------ccCCCchhHHHHHhhhhHHHHHHHHHHHHH Confidence 3432 3667889999982 166999999999865433 22222 Q ss_pred cC Q lcl|NC_011044. 136 MT 137 (137) Q Consensus 136 ~~ 137 (137) .. T Consensus 132 ~~ 133 (140) T protein:vir:48 132 KL 133 (140) T ss_pred HH Confidence 22 No 124 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=97.11 E-value=3.4e-06 Score=50.55 Aligned_cols=113 Identities=7% Similarity=0.067 Sum_probs=63.8 Q ss_pred Ccee----hhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEe--- Q lcl|NC_011044. 1 MPVT----ARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVE--- 69 (137) Q Consensus 1 msv~----~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~--- 69 (137) |||. .+|.++|+.. ...++.+.+++|.++++.++.++|.+..+ |||.+.+++........+..-.+.|+ T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 8874 5677777666 56778888999999999999999998887 99999988865443322222233332 Q ss_pred c--CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH--------HHHHhHhh Q lcl|NC_011044. 70 A--TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR--------IAAADPDI 134 (137) Q Consensus 70 ~--~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~--------~~~~~~~i 134 (137) + --.+-++.|||+- ..++++.+.- .|+=+ +..|++. ++...+++ T Consensus 81 ~~~R~~ivHLnE~Gyt----~~r~Gk~i~P--------------rG~G~---i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 PFERFRIVHLIENGHV----EKKSGKFVKP--------------KAMGG---INRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeeEEEeeeccee----ecCCCCeecc--------------chhhH---HHHHHHhhhHHHHHHHHHHHhcC Confidence 1 1235566777751 0111111000 01111 2334432 22233333 No 125 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=97.04 E-value=7e-06 Score=48.85 Aligned_cols=119 Identities=16% Similarity=0.068 Sum_probs=62.6 Q ss_pred Cce-ehhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh---------CCc--------------cchhhhccceee Q lcl|NC_011044. 1 MPV-TARIHINEPELERQSGA-IFRGKHRSLTRRIATQARAD---------VPV--------------RTGNLGRTVGEL 55 (137) Q Consensus 1 msv-~~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~---------aPv--------------~TG~Lr~SI~~~ 55 (137) ||= +.+++..|..+...+.. .-++.+.++++.+....+.+ .|| ++|.+-+++... T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~a 80 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQP 80 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhhc Confidence 776 33344555544444432 33456778888877666555 244 233332222110 Q ss_pred ---eecccCcEE-EEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHh Q lcl|NC_011044. 56 ---PQRYRPFHV-DGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAAD 131 (137) Q Consensus 56 ---~~~~~~~~~-~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~ 131 (137) .....+.++ ++.+|++..||..++||-. +++..++.+ +|++ |++|||--+-++...-+ T Consensus 81 ~~l~~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~---~r~~~~~~~------------~v~i---PaRp~LG~s~~d~~~I~ 142 (152) T protein:vir:10 81 RFMRLRLESEGVSLGYEGGDAVIARIHQQGLI---GRVRKDWDL------------KVKY---ASRELLGFTDDDLQMIE 142 (152) T ss_pred ceeeeeecCcEEEEEecCCchhhhhhhccCcc---ccccCCCCc------------ceec---cccccCCCCHHHHHHHH Confidence 011222223 2334999999999999963 444433322 3454 49999977755543333 Q ss_pred HhhccC Q lcl|NC_011044. 132 PDIHMT 137 (137) Q Consensus 132 ~~i~~~ 137 (137) +-|.+- T Consensus 143 ~~i~~~ 148 (152) T protein:vir:10 143 DYMINI 148 (152) T ss_pred HHHHHH Confidence 333322 No 126 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=96.94 E-value=6.5e-06 Score=49.04 Aligned_cols=110 Identities=16% Similarity=0.177 Sum_probs=81.0 Q ss_pred Ccee----hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cchhhhccceeeeecccCcEEEEEEecCc Q lcl|NC_011044. 1 MPVT----ARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPV----RTGNLGRTVGELPQRYRPFHVDGGVEATA 72 (137) Q Consensus 1 msv~----~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~ 72 (137) |+-. ++.-..+..|.+--.++-.+.|.++|+.-.+..+-+.|+ ..|+||++++..+. +..+.+.....+ T Consensus 1 m~sNNNGFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk---~d~V~V~Fed~a 77 (125) T protein:vir:62 1 MASNNNGFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVK---DDRVSVEFKDEA 77 (125) T ss_pred CCCCchhHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEee---CCeEEEEEcchh Confidence 7653 344455555555556677889999999988888888887 46899999985443 335778889999 Q ss_pred cchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCC-CCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 73 DYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPG-VRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 73 ~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG-~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) -|..|+|.||.+ .+++ | .++|.|....++.++.++..|--. T Consensus 78 ~yW~f~EnGt~~-----~~~~-------------------g~vkaqhf~~~Tf~~nk~kI~~iM~k 119 (125) T protein:vir:62 78 WYWYLVEHGHKK-----AKGK-------------------GRVKGKHFVQNTFDAEGDKIADIMAQ 119 (125) T ss_pred hhhhhhhccccc-----cccc-------------------cccchhhhhhccHHhhHHHHHHHHHH Confidence 999999999943 3332 3 569999999999888877776322 No 127 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=96.93 E-value=7.5e-06 Score=48.69 Aligned_cols=109 Identities=10% Similarity=0.039 Sum_probs=65.8 Q ss_pred Ccee----hhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEe--- Q lcl|NC_011044. 1 MPVT----ARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVE--- 69 (137) Q Consensus 1 msv~----~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~--- 69 (137) |||. .+|.++|+.. ...++.+.+++|.++++.++.++|.+.++ |||.+.+++........+..-.+.|+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 8884 5566777665 56778888999999999999999999998 99999999875544432222233342 Q ss_pred c--CccchhhhhcCC-C---CCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH--------HHHHhHhh Q lcl|NC_011044. 70 A--TADYAAAVHEGS-R---PHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR--------IAAADPDI 134 (137) Q Consensus 70 ~--~~~YA~~vE~GT-~---ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~--------~~~~~~~i 134 (137) + --.+-++.|||+ + ...|+|+.-.. +..|++. ++...+++ T Consensus 81 ~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~-------------------------i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 SKDRYKIVHLIEYGHVQKGTGKFIKPKAMGG-------------------------VNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeEEEEeecccceecccCCccCcchhhH-------------------------HHHHHHhhhHHHHHHHHHHHhcC Confidence 2 234566778884 1 11223321111 2333332 33333333 No 128 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=96.93 E-value=7.5e-06 Score=48.69 Aligned_cols=109 Identities=10% Similarity=0.039 Sum_probs=65.8 Q ss_pred Ccee----hhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEe--- Q lcl|NC_011044. 1 MPVT----ARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVE--- 69 (137) Q Consensus 1 msv~----~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~--- 69 (137) |||. .+|.++|+.. ...++.+.+++|.++++.++.++|.+.++ |||.+.+++........+..-.+.|+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 8884 5566777665 56778888999999999999999999998 99999999875544432222233342 Q ss_pred c--CccchhhhhcCC-C---CCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH--------HHHHhHhh Q lcl|NC_011044. 70 A--TADYAAAVHEGS-R---PHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR--------IAAADPDI 134 (137) Q Consensus 70 ~--~~~YA~~vE~GT-~---ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~--------~~~~~~~i 134 (137) + --.+-++.|||+ + ...|+|+.-.. +..|++. ++...+++ T Consensus 81 ~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~-------------------------i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 81 SKDRYKIVHLIEYGHVQKGTGKFIKPKAMGG-------------------------VNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeEEEEeecccceecccCCccCcchhhH-------------------------HHHHHHhhhHHHHHHHHHHHhcC Confidence 2 234566778884 1 11223321111 2333332 33333333 No 129 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=96.88 E-value=3.9e-06 Score=50.26 Aligned_cols=104 Identities=13% Similarity=0.082 Sum_probs=62.9 Q ss_pred CceehhhhhhHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhccceeeeecccC--c Q lcl|NC_011044. 1 MPVTARIHINEPELERQS-------GAIFRGKHRSLTRRIATQARADVPVR------T---GNLGRTVGELPQRYRP--F 62 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~-------~~~~~~~~~~~a~~i~~~ak~~aPv~------T---G~Lr~SI~~~~~~~~~--~ 62 (137) |- .|+..|..|.+++ ...-.+++...|..++...+..+|.. | |+|++||........+ . T Consensus 1 M~---~~~d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~d 77 (140) T protein:vir:48 1 MT---GLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKN 77 (140) T ss_pred Cc---cHHHHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccc Confidence 42 3444444444444 34445678888899999999999863 4 4799999865332111 1 Q ss_pred EEEEEEec----CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH-------HHh Q lcl|NC_011044. 63 HVDGGVEA----TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA-------AAD 131 (137) Q Consensus 63 ~~~~~v~~----~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~-------~~~ 131 (137) +. ..||- .+.+|.|+++||. -++|+||+.++.++.+ ++. T Consensus 78 G~-s~VG~~k~~~a~~a~f~NdGT~-----------------------------k~~~~hFve~t~~e~~~~~~vl~A~~ 127 (140) T protein:vir:48 78 GV-ATVGWKNNYHAQNARRLNDGTK-----------------------------KYRADHFVTNVQNDSAVRDKVLLAEK 127 (140) T ss_pred cc-eeecccCCCceeEEeecccCcc-----------------------------ccCCCchHHHHHHhhhhHHHHHHHHH Confidence 11 12332 3567888888882 1669999999987542 233 Q ss_pred HhhccC Q lcl|NC_011044. 132 PDIHMT 137 (137) Q Consensus 132 ~~i~~~ 137 (137) +..+.. T Consensus 128 ~~y~~~ 133 (140) T protein:vir:48 128 EEYEKL 133 (140) T ss_pred HHHHHH Confidence 333333 No 130 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=96.83 E-value=3.2e-06 Score=50.73 Aligned_cols=106 Identities=16% Similarity=0.072 Sum_probs=49.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccc-hhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADY-AAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y-A~~vE 79 (137) ||+...+. .+.++.+++. .- ..--|.-|-+..+... + ...+.+++++.| |.++| T Consensus 1 m~~~~~~~-~~~~~~~~l~---------------~l--~~~~v~vGi~~~~~~~----~---~~~~~~G~~va~iAai~E 55 (193) T protein:vir:96 1 MSLRRDSE-LIAAHLQMLR---------------AM--RGRSVSAGWYSTARYP----D---KAGGSVGIQVARIARLNE 55 (193) T ss_pred CeeccchH-HHHHHHHHHH---------------Hh--cCCeEEEEEcCCCCCC----C---cccccccchHHHHHhHHH Confidence 88743321 1111111111 00 0111233544332211 0 112356777777 99999 Q ss_pred cCCCCCccccccCCcceee--cCCeeEEe----------------eeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFF--WHGREIFR----------------KSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~--~~g~~~~~----------------k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||+ +|+++....+... ..|..++. +.| .+||||||++++++++.....+..- T Consensus 56 fG~---~I~~~~~~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v---~IPaRPFlr~t~~~~~~~~~~~~~~ 125 (193) T protein:vir:96 56 YGG---TIDHPGGTRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRI---TIPARPFMRYAWNLFSADRAAIQNR 125 (193) T ss_pred cCC---ccccCccceeeeeccccccccccceeccCcceeeEeecceec---cCCCcchhhhhHHHHHHHHHHHHHH Confidence 997 4444333332211 11222222 233 3569999999999887654443322 No 131 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=96.64 E-value=1.4e-05 Score=47.12 Aligned_cols=106 Identities=16% Similarity=0.146 Sum_probs=60.4 Q ss_pred CceehhhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQS-GAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~-~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) |++..+++ +..+.+.+ ...+++.....+.++.+.....+|.+||.|++|.... .+ .+.|..+++||.++- T Consensus 1 M~~kVkv~--l~~~~~~l~~~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~---~~----~~~I~y~tPYAr~qy 71 (114) T protein:vir:47 1 MNIAIKVD--LQKAKQKLSNESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIV---GQ----GDAVVYGTVYARAQF 71 (114) T ss_pred CceeEEee--hhHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcCccCccccceeee---eC----CcEEEecCchhhHhh Confidence 88855544 55565554 3455666677788888888899999999999996432 11 135888899999999 Q ss_pred cCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) ||..+-. +.. .-++|++-++=|. +|.-+ -.+.|... T Consensus 72 Yg~~~~~-~~~-----------------~~~~p~~g~~W~e-raka~---~~~~~~~~ 107 (114) T protein:vir:47 72 YGSNGIV-TFR-----------------RYTTPGTGKRWDQ-VATSK---HAEEWARA 107 (114) T ss_pred hcccCCC-CCC-----------------ccCCCCCcchhHH-HHHhh---hhHHHHHH Confidence 9842210 000 0112333332222 23221 11112222 No 132 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=96.53 E-value=4e-05 Score=44.68 Aligned_cols=115 Identities=15% Similarity=0.049 Sum_probs=60.9 Q ss_pred Ccee-hhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh---------CCcc----------chh----------hh Q lcl|NC_011044. 1 MPVT-ARIHINEPELERQSGA-IFRGKHRSLTRRIATQARAD---------VPVR----------TGN----------LG 49 (137) Q Consensus 1 msv~-~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~---------aPv~----------TG~----------Lr 49 (137) ||=. .+|+..|..|...+.. .-++.+.++++.+....+.+ .||. +|. |+ T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l~ 80 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKLR 80 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhhh Confidence 7662 3355555555544432 23456888888877766655 2331 111 22 Q ss_pred c--cceeeeecccCcEEE-EEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH Q lcl|NC_011044. 50 R--TVGELPQRYRPFHVD-GGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR 126 (137) Q Consensus 50 ~--SI~~~~~~~~~~~~~-~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~ 126 (137) . +|+... .+.++. +..|++..||..++||.. +++..+ .+.|+ +|++|||.-.-++ T Consensus 81 ~~~~l~~~~---~~~~a~vg~~Gs~~~yA~iHQfG~~---~~~~~~-------------~~~v~---iPaRp~LG~s~~d 138 (156) T protein:vir:11 81 TVRYLRAKG---DAQAITVSFAGRIARIARVHQYGLR---DRAEPG-------------APEVS---YAQRLLLGFDSSD 138 (156) T ss_pred hhheeeeee---cCcEEEEEecCCchhhhhhhccccc---ccccCC-------------CCccc---ccccccCCCCHHH Confidence 2 243322 222222 233899999999999973 333222 12355 4599999777555 Q ss_pred HHHHhHhhccC Q lcl|NC_011044. 127 IAAADPDIHMT 137 (137) Q Consensus 127 ~~~~~~~i~~~ 137 (137) ...-.+-|.+- T Consensus 139 ~~~i~~~i~~~ 149 (156) T protein:vir:11 139 METIQNGILAH 149 (156) T ss_pred HHHHHHHHHHH Confidence 43333333222 No 133 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=96.50 E-value=4.3e-06 Score=50.02 Aligned_cols=124 Identities=19% Similarity=0.194 Sum_probs=61.0 Q ss_pred Cce-ehhhhhhHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCcc Q lcl|NC_011044. 1 MPV-TARIHINEPELERQ------SGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATAD 73 (137) Q Consensus 1 msv-~~~l~~~~~~l~~~------~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~ 73 (137) |-- .++|-..+..|.+. +++-++++|.+.| ...||.+.||++|+.+.||........ -.+.++.... T Consensus 1 mgNP~~KFGvS~~e~~K~irns~EV~~GiNdFMe~~A---~~~aK~~SPV~~GeY~~S~~V~~ka~N---GRG~~G~~~~ 74 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRNSAEVDAGINDFMENEA---IPYAKSISPVDDGEYAASWAVMKKAKN---GRGVFGPKAW 74 (150) T ss_pred CCCchhhhcCCHHHHHHhhccchhhhhhHHHHHHhhh---hhhhhccCCcccchhHHHHHHHhhccc---CccccCccch Confidence 432 34454455555543 4455556655543 356799999999999999964322121 3578999999 Q ss_pred chhhhhcCCCCCccccccCCccee-------ecCCeeEEeeeEecCC--CCCCchhhhhHHHHHHHh-----HhhccC Q lcl|NC_011044. 74 YAAAVHEGSRPHRIVARHAQALHF-------FWHGREIFRKSVWHPG--VRSRPFLRNAAQRIAAAD-----PDIHMT 137 (137) Q Consensus 74 YA~~vE~GT~ph~i~pk~~k~l~~-------~~~g~~~~~k~V~~pG--~~a~pfl~~A~~~~~~~~-----~~i~~~ 137 (137) ||+||||||+...-..+.++.-+- .+..+. |.+ | -|- |++|-.- +.++.-- .-|-.. T Consensus 75 ~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddge-frr-v-gpdtptkaqgia----qkvashfggslkggisks 145 (150) T protein:vir:81 75 YAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGE-FRR-V-GPDTPTKAQGIA----QKVASHFGGSLKGGISKS 145 (150) T ss_pred hhhhhhhccccccccccccccccCcccceeeeecCcc-cee-c-CCCCchhhhhHH----HHHHHhcccccccccccc Confidence 999999999865432222221111 111111 111 1 121 2222111 1111100 000000 No 134 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=96.46 E-value=2.8e-05 Score=45.52 Aligned_cols=112 Identities=13% Similarity=0.032 Sum_probs=64.4 Q ss_pred CceehhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSG-AIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~-~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) |++..+++ +..+.+.+. +.++++....+.++.......+|.|||+|..|.......+ .+.|..+++||.+.- T Consensus 1 M~ikVkv~--l~~~~~~~~~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~-----~~~I~y~tPYAr~qy 73 (116) T protein:vir:15 1 MAFRINVD--LDGFMDQTSLDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSD-----GSEITYSTPYAKAQF 73 (116) T ss_pred CCceEEee--hhHhhhhhhHHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecC-----CceEEecCchhHHHh Confidence 99855554 777887664 6777778888888999999999999988666643333222 246888899999999 Q ss_pred cCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH-HHhHhhccC Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA-AADPDIHMT 137 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~-~~~~~i~~~ 137 (137) ||.- .+.. .+. .-.+||+-++=|++ |..+-+ .=.+-.+.. T Consensus 74 Yg~~----~~~~------------~~~-~~t~p~ag~~W~er-aK~~h~~~w~~~~~k~ 114 (116) T protein:vir:15 74 YGII----NDKY------------PVH-NYTTPGTTKRWDLK-AKSMFMSSWIDTFTKG 114 (116) T ss_pred cccc----cCCC------------Ccc-cccCCCCCcchhHH-HHhhhHHHHHHHHHHh Confidence 9851 1100 011 12235554443332 221111 000000111 No 135 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=96.30 E-value=4.6e-06 Score=49.86 Aligned_cols=111 Identities=9% Similarity=0.061 Sum_probs=63.1 Q ss_pred Ccee------hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEec Q lcl|NC_011044. 1 MPVT------ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVEA 70 (137) Q Consensus 1 msv~------~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~~ 70 (137) ||+- .+|.++|+. |.+ .++.+.+++|+++++.++...|.+.|+ |||.+-+++........+..-.+.|+- T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW 80 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 80 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecc Confidence 6664 455666655 665 577888999999999999999999998 999999998765444333223344432 Q ss_pred C-ccch--hhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH-----HHhHhhccC Q lcl|NC_011044. 71 T-ADYA--AAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA-----AADPDIHMT 137 (137) Q Consensus 71 ~-~~YA--~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~-----~~~~~i~~~ 137 (137) . ..|. +..|||+ ...|+|+... ++..|++... ...++++.. T Consensus 81 ~GpR~~ivHLNE~Gy-Gk~~~PrG~G-------------------------~I~~a~~~se~~~~~~~~~elkk~ 129 (132) T protein:vir:96 81 TTPRWNIVHLQELEY-GWKHNRRGVG-------------------------VIRRYSDILETIYPRGIRDKLKRG 129 (132) T ss_pred cCCceeEEeeecccc-cCCcCCCcch-------------------------HHHHHHHhhhhHHHHHHHHHHHHH Confidence 1 1221 2234554 2223333222 2444444321 122222222 No 136 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=96.17 E-value=4e-05 Score=44.68 Aligned_cols=107 Identities=13% Similarity=0.127 Sum_probs=45.8 Q ss_pred Cceehhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCc-cchhh Q lcl|NC_011044. 1 MPVTARIHI--NEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATA-DYAAA 77 (137) Q Consensus 1 msv~~~l~~--~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~-~YA~~ 77 (137) ||-+.++.. .+.++.+++.. | ...-|.-|-+..+-... .+ ....++++ ..|.+ T Consensus 5 ~~~~~k~~~~~~~~~~~~~l~~-----l------------~~~~v~vGi~~~~~y~~--~~-----~~~dG~~va~IA~~ 60 (200) T protein:vir:99 5 FSKSNSVAAPLKHFQMLKQFDA-----L------------KGKTVQAGWFETDRYPA--KE-----GETIGPLVAKIARQ 60 (200) T ss_pred cceeeeeecchHHHHHHHHHHH-----h------------hCCeEEEEEcCCCCcCC--cc-----cccccchHHHHHhH Confidence 444334432 23333222211 0 11112223322211000 00 11234444 55999 Q ss_pred hhcCCCCCccc-cccCCcceeec---------------CCeeEEe--eeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 78 VHEGSRPHRIV-ARHAQALHFFW---------------HGREIFR--KSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 78 vE~GT~ph~i~-pk~~k~l~~~~---------------~g~~~~~--k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) +|||+ +|+ |...+.+++.. .+..+|. +.|+ +||||||+++++++......+-.- T Consensus 61 ~EfG~---~i~~p~~~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~---IP~RPFlr~t~~~~~~~~~~~~~~ 132 (200) T protein:vir:99 61 LEFGG---VINHPGGTKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIV---IPARPFMRLAWATFNKDKVKIQAQ 132 (200) T ss_pred HHcCC---eeccCCCccccccccccccccccccccccccceeeeecccccc---CCCcchhhHHHHHHHHHHHHHHHH Confidence 99997 333 33333333321 1222332 3454 459999999999876644332211 No 137 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=95.70 E-value=0.00013 Score=41.97 Aligned_cols=109 Identities=15% Similarity=0.124 Sum_probs=57.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) |.|...|+.-...|++ +.+++.....+.++.......+|.+||.|++|... .++ .|..+++||...-| T Consensus 2 ~kV~vdl~~~~~~ls~---~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i---~~~------~I~Y~tPYAr~qYY 69 (118) T protein:vir:98 2 AKVVVELGGIKRKVSP---QALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRA---NSV------GVTWSGPHARAQFY 69 (118) T ss_pred ceeeechhHHhhhhhH---HHHHHHHHHHHHHHHHHhhcCCCCccCccccceee---cCC------eeEECCchhhHhhh Confidence 7775555543333332 33445566667788888889999999999999542 111 25677899999998 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchh--------hhhHHHHHHHhHhhc Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFL--------RNAAQRIAAADPDIH 135 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl--------~~A~~~~~~~~~~i~ 135 (137) |..-... +...| +.-+|||+-++=|+ ...|.+...+-=-|+ T Consensus 70 ~~~~~~~-----~g~~~---------~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~~g~k 118 (118) T protein:vir:98 70 GGAYNKY-----KSFKF---------KKYTTPGTGKRWDKRALANATIVKDWEKSLLRGMGFK 118 (118) T ss_pred ccccCCC-----Ccccc---------ccccCCCCCCcccchhhcchhhhHHHHHHHHHhcCCC Confidence 8421100 00111 01123333332222 122333322222223 No 138 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=95.70 E-value=0.00013 Score=41.97 Aligned_cols=109 Identities=15% Similarity=0.124 Sum_probs=57.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) |.|...|+.-...|++ +.+++.....+.++.......+|.+||.|++|... .++ .|..+++||...-| T Consensus 2 ~kV~vdl~~~~~~ls~---~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i---~~~------~I~Y~tPYAr~qYY 69 (118) T protein:vir:30 2 AKVVVELGGIKRKVSP---QALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRA---NSV------GVTWSGPHARAQFY 69 (118) T ss_pred ceeeechhHHhhhhhH---HHHHHHHHHHHHHHHHHhhcCCCCccCccccceee---cCC------eeEECCchhhHhhh Confidence 7775555543333332 33445566667788888889999999999999542 111 25677899999998 Q ss_pred CCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchh--------hhhHHHHHHHhHhhc Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFL--------RNAAQRIAAADPDIH 135 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl--------~~A~~~~~~~~~~i~ 135 (137) |..-... +...| +.-+|||+-++=|+ ...|.+...+-=-|+ T Consensus 70 ~~~~~~~-----~g~~~---------~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~~g~k 118 (118) T protein:vir:30 70 GGAYNKY-----KSFKF---------KKYTTPGTGKRWDKRALANATIVKDWEKSLLRGMGFK 118 (118) T ss_pred ccccCCC-----Ccccc---------ccccCCCCCCcccchhhcchhhhHHHHHHHHHhcCCC Confidence 8421100 00111 01123333332222 122333322222223 No 139 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=95.32 E-value=0.00014 Score=41.65 Aligned_cols=105 Identities=10% Similarity=-0.012 Sum_probs=56.7 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) |.|...++.....+.+ ..++++...++.++.+.....+|.+||.|++|-... .+ .+.|..+++||.++-| T Consensus 2 mkvkv~~~~~~~~~~~---~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~---s~----~g~I~y~tPYAr~qYY 71 (108) T protein:vir:98 2 PKIRVELSGAKDKLSP---QTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNIS---SD----AEEIYYNTPYAKRRFY 71 (108) T ss_pred ceeEeeehHHHHHHHH---HHHHHHHHHHHHHHHHhhcccCcCcCCccccceeec---cC----CceEEecChhhHHhhh Confidence 7776666654444433 233445566677788888889999999999994321 21 1468888999999999 Q ss_pred CCCCCccccccCCcceeecCCeeEEeee-EecCCCCCCchhhhhHHHHH-HHhHh Q lcl|NC_011044. 81 GSRPHRIVARHAQALHFFWHGREIFRKS-VWHPGVRSRPFLRNAAQRIA-AADPD 133 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~~~~g~~~~~k~-V~~pG~~a~pfl~~A~~~~~-~~~~~ 133 (137) |..-+--.|..+ ..+|-+. ..| ...+.+.. ....+ T Consensus 72 g~~~n~~~p~ag---------~~W~eraka~~---------~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 72 EPAYNYTTPGTG---------PRWDMKAKRLF---------ISDWERAYMKGANW 108 (108) T ss_pred ccccCCCCCCCc---------chhHHHHHhhh---------hHHHHHHHHHhhcC Confidence 953221122211 1111110 001 11122211 12222 No 140 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=95.28 E-value=0.00025 Score=40.32 Aligned_cols=111 Identities=14% Similarity=0.029 Sum_probs=65.0 Q ss_pred CceehhhhhhHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhCCcc-----------------------chhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA-------IFRGKHRSLTRRIATQARADVPVR-----------------------TGNLGR 50 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~-------~~~~~~~~~a~~i~~~ak~~aPv~-----------------------TG~Lr~ 50 (137) |.+ +|+..|..+.+++.+ .-.+++...|...+...+..+|.. +|+|++ T Consensus 1 mm~--~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD 78 (159) T protein:vir:38 1 MAN--DMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQD 78 (159) T ss_pred Ccc--hHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCcccc Confidence 555 677777777777743 223456777888888889999982 369999 Q ss_pred cceeeee-cccC-cEEEEEEec----CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhH Q lcl|NC_011044. 51 TVGELPQ-RYRP-FHVDGGVEA----TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAA 124 (137) Q Consensus 51 SI~~~~~-~~~~-~~~~~~v~~----~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~ 124 (137) ||..... ...+ ..-...||- .+.+|.|++.||.-+. |+ -++..+|+..+. T Consensus 79 ~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~--~k----------------------~~~gdHFvekt~ 134 (159) T protein:vir:38 79 SITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMS--PK----------------------RYKNMHFLDKAQ 134 (159) T ss_pred ceeeecCccccccccceeeecccCCccceEeeecccCccccC--CC----------------------CccCChhHHHHH Confidence 9975432 1110 000123332 3577889999985431 11 134567888887 Q ss_pred HHHH-----HHhHhhccC Q lcl|NC_011044. 125 QRIA-----AADPDIHMT 137 (137) Q Consensus 125 ~~~~-----~~~~~i~~~ 137 (137) ++.+ ++.+..++. T Consensus 135 ~~~k~~Vl~A~~~~~~~i 152 (159) T protein:vir:38 135 QEAKKSVAEAELKAYKEV 152 (159) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 6542 222333332 No 141 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=94.68 E-value=0.00036 Score=39.45 Aligned_cols=111 Identities=10% Similarity=0.068 Sum_probs=58.7 Q ss_pred Ccee------hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEe- Q lcl|NC_011044. 1 MPVT------ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVE- 69 (137) Q Consensus 1 msv~------~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~- 69 (137) ||+- .++.++|+. |.+ .++.+.+++|+++++.++...|.+.++ |||..-+++........+..-++.|+ T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW 86 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 86 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEee Confidence 6654 455566654 444 477888899999999999999999996 99997777654433322222223331 Q ss_pred cCccch--hhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 70 ATADYA--AAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 70 ~~~~YA--~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) .+..|. +..|||++ ..|+|+... ++..|++.... ....|++- T Consensus 87 ~GpR~~ivHLNE~GyG-k~i~PrG~G-------------------------~I~ka~~~se~~y~~~vk~e 131 (138) T protein:vir:98 87 TTPRWNIVHLQELEYG-WKHNRRGVG-------------------------VIRRYSDILETIYPRGIRDK 131 (138) T ss_pred ecCeeeEEeeeccccc-CCcCCCcch-------------------------HHHHHHHhhhHHHHHHHHHH Confidence 111222 22345551 233333222 24444443221 12222222 No 142 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=93.99 E-value=0.00053 Score=38.53 Aligned_cols=102 Identities=22% Similarity=0.147 Sum_probs=54.5 Q ss_pred CceehhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA-IFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) || +|..+.+.+.. .++++-..++.+|.......+|.|||.|++|... . ++.|..+++||.+.- T Consensus 1 ~~-------dL~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i---~------s~~I~y~tPYAr~qy 64 (113) T protein:vir:79 1 MS-------DLSVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFV---N------DTGIHYTAKYARAQF 64 (113) T ss_pred Cc-------hHHHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhccccc---c------CCeeEecChhhhHhh Confidence 44 34444444432 4455566667888888899999999999999641 1 123777889999999 Q ss_pred cCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) ||.-+. .+. +.-.+||+-++=|.+ |.-+-+. -++-.++. T Consensus 65 Yg~~~~--~~~----------------~~~t~p~ag~~W~er-aKa~h~~~w~~~~~~a 104 (113) T protein:vir:79 65 YGFVNG--HRV----------------RNYSTPGTGRRWDLK-AKAVYKADWQKVAVAA 104 (113) T ss_pred ccccCC--CCc----------------cccCCCCCCchhhHH-HHHHhHHHHHHHHHHH Confidence 984221 000 001124444433332 2221111 11111221 No 143 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=93.94 E-value=0.0001 Score=42.52 Aligned_cols=98 Identities=18% Similarity=0.192 Sum_probs=56.4 Q ss_pred Cc-e------ehhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecC Q lcl|NC_011044. 1 MP-V------TARIHINEPELER--QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEAT 71 (137) Q Consensus 1 ms-v------~~~l~~~~~~l~~--~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~ 71 (137) |. + .++|-..|..+.+ .+++-+.+++ .++....|.+.||.||..|+|.... ...-+.-.+.|+.. T Consensus 1 ma~gpt~knplakfgi~lddfdklpevnqgvnef~----dev~aawk~nspv~~g~yrdsvqvt--erstnkgrgkvgat 74 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLPEVNQGVNEFI----DEVVAAWKNNSPVGTGAYRDSVQVT--ERSTNKGRGKVGAT 74 (108) T ss_pred CCCCCccccchhhhccchhhhhccchhhhhHHHHH----HHHHHhhhcCCCccccccccceeec--ccccccccccccCc Confidence 32 2 3455555555554 3444444444 4556678999999999999998642 22223346789999 Q ss_pred ccchhhhhcCCCCCc-cccccCCcceeecCCeeEEee Q lcl|NC_011044. 72 ADYAAAVHEGSRPHR-IVARHAQALHFFWHGREIFRK 107 (137) Q Consensus 72 ~~YA~~vE~GT~ph~-i~pk~~k~l~~~~~g~~~~~k 107 (137) -+.|+.||||.--.. .-|..+-+-.| |++.+.. T Consensus 75 dpqahlvefgs~hndeyapaqktakqf---ggtay~d 108 (108) T protein:vir:10 75 DPQAHLVEFGSAHNDEYAPAQKTAKQF---GGTAYGD 108 (108) T ss_pred chhhhhhhhhccccccccchhhhHHhh---cccccCC Confidence 999999999974211 11221111111 3333222 No 144 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=93.94 E-value=0.0001 Score=42.52 Aligned_cols=98 Identities=18% Similarity=0.192 Sum_probs=56.4 Q ss_pred Cc-e------ehhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecC Q lcl|NC_011044. 1 MP-V------TARIHINEPELER--QSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEAT 71 (137) Q Consensus 1 ms-v------~~~l~~~~~~l~~--~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~ 71 (137) |. + .++|-..|..+.+ .+++-+.+++ .++....|.+.||.||..|+|.... ...-+.-.+.|+.. T Consensus 1 ma~gpt~knplakfgi~lddfdklpevnqgvnef~----dev~aawk~nspv~~g~yrdsvqvt--erstnkgrgkvgat 74 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLPEVNQGVNEFI----DEVVAAWKNNSPVGTGAYRDSVQVT--ERSTNKGRGKVGAT 74 (108) T ss_pred CCCCCccccchhhhccchhhhhccchhhhhHHHHH----HHHHHhhhcCCCccccccccceeec--ccccccccccccCc Confidence 32 2 3455555555554 3444444444 4556678999999999999998642 22223346789999 Q ss_pred ccchhhhhcCCCCCc-cccccCCcceeecCCeeEEee Q lcl|NC_011044. 72 ADYAAAVHEGSRPHR-IVARHAQALHFFWHGREIFRK 107 (137) Q Consensus 72 ~~YA~~vE~GT~ph~-i~pk~~k~l~~~~~g~~~~~k 107 (137) -+.|+.||||.--.. .-|..+-+-.| |++.+.. T Consensus 75 dpqahlvefgs~hndeyapaqktakqf---ggtay~d 108 (108) T protein:vir:10 75 DPQAHLVEFGSAHNDEYAPAQKTAKQF---GGTAYGD 108 (108) T ss_pred chhhhhhhhhccccccccchhhhHHhh---cccccCC Confidence 999999999974211 11221111111 3333222 No 145 >protein:vir:99454 Length: 150 # NCBI annotation: hypothetical protein # Family: family:all:32760 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919085;genbank:gi:119757043;genbank:GeneID:4606107 Probab=91.89 E-value=0.0049 Score=33.26 Aligned_cols=127 Identities=16% Similarity=0.172 Sum_probs=75.1 Q ss_pred CceehhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCccchhhhccceeeeecccCcEEEEEEecCc Q lcl|NC_011044. 1 MPVTARIHINE-PELERQSGAIFRGKHRSLTRRIATQARAD-------VPVRTGNLGRTVGELPQRYRPFHVDGGVEATA 72 (137) Q Consensus 1 msv~~~l~~~~-~~l~~~~~~~~~~~~~~~a~~i~~~ak~~-------aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~ 72 (137) |+-...|..++ +.+.+.+.+.. -.++|-.+|..|-.. ..-+--.|-..-.+++.+..+ .++..-|- - T Consensus 1 mt~l~~f~~d~re~lld~le~~a---reeiap~vq~~ahdile~yg~~hdydv~~iiea~et~v~rr~~-rvvvr~gw-p 75 (150) T protein:vir:99 1 MTTLAGFEADAREALLDELEDHA---REEIAPAVQQHAHDILEAYGRENDYDVQSIIDAAETRVERRKG-SVVVRWGW-P 75 (150) T ss_pred CCccchhhHHHHHHHHHHHHHHH---HHhhhHHHHHHHHHHHHHhccccccchhhhhhhhhhheeecCC-eEEEEecC-C Confidence 88877777665 33334444332 333444444433221 111111111111333333332 33334343 4 Q ss_pred cchhhhhcCCCCCccccccCCcceeecC---------------CeeEEeeeEecCCCCCCchhhhhHHHHHHHhH Q lcl|NC_011044. 73 DYAAAVHEGSRPHRIVARHAQALHFFWH---------------GREIFRKSVWHPGVRSRPFLRNAAQRIAAADP 132 (137) Q Consensus 73 ~YA~~vE~GT~ph~i~pk~~k~l~~~~~---------------g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~ 132 (137) +-|+|.|-||--|.+..+++-.|.|.|. |-.+|-..|...|-|-..|++.+|.-...+-- T Consensus 76 epaiyfergt~dhvvea~nad~lsfvwedpp~wvre~fe~e~~g~rvfl~e~~v~glpesrfirdtln~lr~~fa 150 (150) T protein:vir:99 76 EPAIFFERGTVDHVVEATNADVLSFIWEDPPRWVRQGYEREGGGWRVFLPEVEVSGLPESRFIRDTLNWLRRRFA 150 (150) T ss_pred CcceeeeccchhhhhhccccchhhhhhcCchhHhHhhcCcCCCceEEEeecccccCCcchhhHHHHHHHHHHhcC Confidence 6799999999999999999999999664 22356677888899999999999875544333 No 146 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=91.61 E-value=0.0028 Score=34.57 Aligned_cols=108 Identities=8% Similarity=-0.021 Sum_probs=58.4 Q ss_pred Ccee----hhhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCc--EEEEEEe- Q lcl|NC_011044. 1 MPVT----ARIHINEPEL--ERQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPF--HVDGGVE- 69 (137) Q Consensus 1 msv~----~~l~~~~~~l--~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~--~~~~~v~- 69 (137) |||. .+|.++|+.- ...++.+.+++|..+++.++...|.+..+ |||..-+++.......... .-.+.|+ T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEe Confidence 8885 4556666443 34677888899999999999999999885 9998888776443321111 1223332 Q ss_pred --cCccch--hhhhcCC--CCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH--------HHHHhHh Q lcl|NC_011044. 70 --ATADYA--AAVHEGS--RPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR--------IAAADPD 133 (137) Q Consensus 70 --~~~~YA--~~vE~GT--~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~--------~~~~~~~ 133 (137) +.-.|. +..|||. .++.|+|+.-.. +..|++. ++....+ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-------------------------i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-------------------------IAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-------------------------HHHHHHhhhHHHHHHHHHHhcC Confidence 222333 3346663 122333332221 2233322 2222222 No 147 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=91.13 E-value=0.0006 Score=38.25 Aligned_cols=83 Identities=19% Similarity=0.119 Sum_probs=39.2 Q ss_pred Cceehhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHI-NEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~-~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) |++..+.+. .+++|.+.+... .+. -|.-|-+...= +...+. . =.+.+..|.++| T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l---------------~~~--~v~VGi~~~~~--~~~~~~-~-----g~~vA~ia~~~E 55 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSL---------------KEK--AVYVGFPAEFD--EKVKGS-E-----NFNLASLAAVLE 55 (148) T ss_pred CccccccccHHHHHHHHHHHHh---------------hCC--eEEEEeecCcC--CCCCCC-C-----CCCHHHHHHHHh Confidence 999766543 233332222211 111 11223221100 000010 0 134578999999 Q ss_pred cCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-hhccC Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-DIHMT 137 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-~i~~~ 137 (137) ||+. .+|++|||+++++++..... .|... T Consensus 56 ~G~~-----------------------------~IP~Rpflr~t~~~~~~~~~~~~~~~ 85 (148) T protein:vir:52 56 FGNE-----------------------------HIPARPFLRQTLEENQEKYTALFIQW 85 (148) T ss_pred cCCC-----------------------------CCCCcchhHHHHHHHHHHHHHHHHHH Confidence 9951 36699999999998654221 12111 No 148 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=91.09 E-value=0.0005 Score=38.70 Aligned_cols=91 Identities=14% Similarity=0.103 Sum_probs=57.3 Q ss_pred CceehhhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGA-IFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVH 79 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~-~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE 79 (137) |. .-|.-+-..+...++. -++..+.-+|+.....||.++|||||..|+.++.+.+......-...||++ +--..|| T Consensus 1 ma--daftpNp~~FDqIl~s~~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D-~KTlLvE 77 (92) T protein:vir:78 1 MA--DAFTPNPTWFDQIMRTPKVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSD-EKTLLIE 77 (92) T ss_pred CC--CccCCChhHHHHhhcccchhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccceeEEeecC-cceeeee Confidence 33 3456666666665553 455666777888899999999999999999998877666665555566665 3445566 Q ss_pred cCCCCCccccccCCcceeecCCeeEEeee Q lcl|NC_011044. 80 EGSRPHRIVARHAQALHFFWHGREIFRKS 108 (137) Q Consensus 80 ~GT~ph~i~pk~~k~l~~~~~g~~~~~k~ 108 (137) --|+-. +++|+-. ++ T Consensus 78 SrTGNL------akalk~~--------rs 92 (92) T protein:vir:78 78 SRTGNL------ARSVKRR--------RS 92 (92) T ss_pred cccchH------HHHHhhh--------cC Confidence 655421 1111100 00 No 149 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=90.76 E-value=0.00048 Score=38.76 Aligned_cols=84 Identities=18% Similarity=0.104 Sum_probs=33.1 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceee----------eecccCcEEEEEEec Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGEL----------PQRYRPFHVDGGVEA 70 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~----------~~~~~~~~~~~~v~~ 70 (137) |+...+ .-++..... +.. .... -++-|-|...--.. .......+ .+ T Consensus 1 ~~~~~~--~g~~~~~~~------------~~~----l~~~-~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g-----~~ 56 (168) T protein:vir:94 1 MTTIAR--KGVKMPPHL------------EAQ----FQSG-EVKAGVLSGSTYPQMTYTDQRTGKQIEDARGG-----MP 56 (168) T ss_pred Cccccc--hhhhhhHHH------------HHh----hhcc-ceeeeccccCcccccccchhhccccccccccc-----cc Confidence 444221 111111111 111 1111 11223222110000 00000000 13 Q ss_pred CccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHh-hccC Q lcl|NC_011044. 71 TADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPD-IHMT 137 (137) Q Consensus 71 ~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~-i~~~ 137 (137) .+.+|.|+|||+. .+|+||||+++++++...... |... T Consensus 57 va~Ia~~~E~G~~-----------------------------~IP~RPFlr~t~~~~~~~~~~~~~~~ 95 (168) T protein:vir:94 57 VAVIAQALEYGHG-----------------------------QNHPRPFMQQTYAAQYRAWSRDLTLT 95 (168) T ss_pred HHHHHHHHhcCCC-----------------------------CCCCchhhHHHHHHHHHHHHHHHHHH Confidence 4688999999952 366999999999876543221 1111 No 150 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=90.31 E-value=0.00099 Score=37.06 Aligned_cols=98 Identities=15% Similarity=0.039 Sum_probs=41.5 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchhhhhc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAAAVHE 80 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~ 80 (137) |+|+.--+ .+.++.+.+ +.+ ++ .++.......++ . .-+.-|...|| T Consensus 1 m~vt~~~~-~~~~~~~~l-------------------~~L----~~---k~v~vGi~~~d~-~------~~~~Ia~~~E~ 46 (199) T protein:vir:80 1 MKVTTDKS-TMNKAIREL-------------------DQL----DR---YSLQIGLFGEDD-S------FIQMIAGVHEF 46 (199) T ss_pred CcccccHH-HHHHHHHHH-------------------HHh----cC---CEEEEEEecCCC-c------chhheeehhhc Confidence 88853211 112111111 112 11 222221111111 0 12567888899 Q ss_pred CCCCCccccccCCccee-----------------ecCCeeE-------------EeeeEecCCCCCCchhhhhHHHHHHH Q lcl|NC_011044. 81 GSRPHRIVARHAQALHF-----------------FWHGREI-------------FRKSVWHPGVRSRPFLRNAAQRIAAA 130 (137) Q Consensus 81 GT~ph~i~pk~~k~l~~-----------------~~~g~~~-------------~~k~V~~pG~~a~pfl~~A~~~~~~~ 130 (137) |. +|+|++ +.|.. +..|... +..+ .+-.+|+||||+++++++... T Consensus 47 Ga---~I~~~~-~~l~Ip~~~a~~~k~~~~~~~~~p~g~~~~~~~~~~~~~~~~e~g~-~~~~IP~RPFlr~t~~~~~~~ 121 (199) T protein:vir:80 47 GL---TIRPKG-KYLTIPTPEAGDRRARDIPGLFKPKGKNILAVAGPDGKLTVMFYLK-TEVNIPERSFLRSTFDEKSNK 121 (199) T ss_pred CC---eeecCC-ceeeecchhhhcccccccCcccccCCcceeeeeccccceeeeeecc-ccccCCCCchhHHHHHHHHHH Confidence 97 566664 34332 1111111 1110 011457999999999987554 Q ss_pred hHhh-----ccC Q lcl|NC_011044. 131 DPDI-----HMT 137 (137) Q Consensus 131 ~~~i-----~~~ 137 (137) .... +.+ T Consensus 122 ~~~~~~~~~~~v 133 (199) T protein:vir:80 122 WGELFEGWIDDV 133 (199) T ss_pred HHHHHHHHHHHH Confidence 3221 111 No 151 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=89.91 E-value=0.0013 Score=36.48 Aligned_cols=97 Identities=12% Similarity=0.009 Sum_probs=43.0 Q ss_pred CceehhhhhhHHHHHHHHHHHHHH-HHH---HHHHHHHHHHHHhCCccchhhhccceeeeecccCcEEEEEEecCccchh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRG-KHR---SLTRRIATQARADVPVRTGNLGRTVGELPQRYRPFHVDGGVEATADYAA 76 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~-~~~---~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) ||.+. |+ ..+.+.+.+ .|. ++-.++.+.+.-.+|.+||.|++|-.... ..+...++..++.-++||. T Consensus 1 ~~f~~-f~-------~~~~k~l~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~t-vIgsg~I~y~~~~~aPYAr 71 (105) T protein:vir:78 1 MSFSS-FK-------DAVIDDIHNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKI-IIQKNSIVARVFSLTPYAR 71 (105) T ss_pred CCccc-cc-------chHHHHHHHhcCCCCchhhHHHHHHhCCCCcccccccccccccce-eecCCeeEeeccccCchhh Confidence 76532 22 222222211 111 11123444555668999999999954332 2333345555566689999 Q ss_pred hhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 77 AVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 77 ~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) +.-|..+ +++ .|| .++.-+-++-+.+|-+- T Consensus 72 ~qYYe~~-------Rg~-~Wf-----------------------Erm~a~hk~~I~~~veg 101 (105) T protein:vir:78 72 RQYYENR-------RNP-RWY-----------------------EMAVSYGIQSINQIVEG 101 (105) T ss_pred hhhhccc-------CCC-chh-----------------------HHhhhcchhHHHHHHhc Confidence 9988652 111 111 11111111111111111 No 152 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=88.59 E-value=0.0084 Score=31.97 Aligned_cols=125 Identities=13% Similarity=0.070 Sum_probs=66.9 Q ss_pred CceehhhhhhHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHhCCc------cch---hhhccceeeeecc---- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIF--------RGKHRSLTRRIATQARADVPV------RTG---NLGRTVGELPQRY---- 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~--------~~~~~~~a~~i~~~ak~~aPv------~TG---~Lr~SI~~~~~~~---- 59 (137) |.--.=|+..|..+.+++...+ .+.....|...+......+|. +|| +|++||....... T Consensus 2 ~~~~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg~~ 81 (161) T protein:vir:10 2 MEEKQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDGIK 81 (161) T ss_pred cchhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCccc Confidence 4443337777777777775422 123445555555555555554 455 9999997544322 Q ss_pred cCcEEEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-------Hh- Q lcl|NC_011044. 60 RPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-------AD- 131 (137) Q Consensus 60 ~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-------~~- 131 (137) .+.+.+|--+.-+.-|.|++.||+--.+...+.+.+. +++ |. +++.+|+..+-+++.+ +. T Consensus 82 dG~StVGw~~kka~ia~~indGtr~~~~~~~~~~~~n----~Gt-----~~---i~gDHFvd~~r~~~~~k~aV~~Ae~~ 149 (161) T protein:vir:10 82 DGNSTVGWDYTKSRVGHLIENGTRFPMYSKKGTKYRK----GGQ-----VA---ITSDPFVSTYRDSMEAQVAMFSAEAE 149 (161) T ss_pred CCceeccccCchhhhhhhhcccchhhhhhcccccccC----Ccc-----ee---ecCcchhHHHHhhhhhHHHHHHHHHH Confidence 1222333323446779999999974333333333322 222 22 5588999887664322 11 Q ss_pred --HhhccC Q lcl|NC_011044. 132 --PDIHMT 137 (137) Q Consensus 132 --~~i~~~ 137 (137) ++|-+- T Consensus 150 ~y~eil~~ 157 (161) T protein:vir:10 150 VFSEILKK 157 (161) T ss_pred HHHHHHHh Confidence 233222 No 153 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=88.38 E-value=0.01 Score=31.55 Aligned_cols=108 Identities=8% Similarity=-0.039 Sum_probs=58.0 Q ss_pred Ccee----hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCc--EEEEEEe- Q lcl|NC_011044. 1 MPVT----ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPF--HVDGGVE- 69 (137) Q Consensus 1 msv~----~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~--~~~~~v~- 69 (137) |||. .+|.++|+. |.+ .++.+.+++|..+++.++...|.+..+ |||..-+++.......... .-.+.|+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 8885 455565543 443 557788899999999999999999875 9998888876443322111 1223332 Q ss_pred --cCccch--hhhhcCC--CCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH--------HHHHHhHh Q lcl|NC_011044. 70 --ATADYA--AAVHEGS--RPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ--------RIAAADPD 133 (137) Q Consensus 70 --~~~~YA--~~vE~GT--~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~--------~~~~~~~~ 133 (137) +.-.|. +..|||. .++.|+|+.-.. +..|++ .++....+ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-------------------------i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-------------------------IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-------------------------HHHHHHhhhHHHHHHHHHHhcC Confidence 222333 3346663 122333332221 222222 22222222 No 154 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=88.38 E-value=0.01 Score=31.55 Aligned_cols=108 Identities=8% Similarity=-0.039 Sum_probs=58.0 Q ss_pred Ccee----hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCc--EEEEEEe- Q lcl|NC_011044. 1 MPVT----ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPF--HVDGGVE- 69 (137) Q Consensus 1 msv~----~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~--~~~~~v~- 69 (137) |||. .+|.++|+. |.+ .++.+.+++|..+++.++...|.+..+ |||..-+++.......... .-.+.|+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 8885 455565543 443 557788899999999999999999875 9998888876443322111 1223332 Q ss_pred --cCccch--hhhhcCC--CCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH--------HHHHHhHh Q lcl|NC_011044. 70 --ATADYA--AAVHEGS--RPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ--------RIAAADPD 133 (137) Q Consensus 70 --~~~~YA--~~vE~GT--~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~--------~~~~~~~~ 133 (137) +.-.|. +..|||. .++.|+|+.-.. +..|++ .++....+ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-------------------------i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-------------------------IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-------------------------HHHHHHhhhHHHHHHHHHHhcC Confidence 222333 3346663 122333332221 222222 22222222 No 155 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=88.38 E-value=0.01 Score=31.55 Aligned_cols=108 Identities=8% Similarity=-0.039 Sum_probs=58.0 Q ss_pred Ccee----hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCc--EEEEEEe- Q lcl|NC_011044. 1 MPVT----ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPF--HVDGGVE- 69 (137) Q Consensus 1 msv~----~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~--~~~~~v~- 69 (137) |||. .+|.++|+. |.+ .++.+.+++|..+++.++...|.+..+ |||..-+++.......... .-.+.|+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 8885 455565543 443 557788899999999999999999875 9998888876443322111 1223332 Q ss_pred --cCccch--hhhhcCC--CCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH--------HHHHHhHh Q lcl|NC_011044. 70 --ATADYA--AAVHEGS--RPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ--------RIAAADPD 133 (137) Q Consensus 70 --~~~~YA--~~vE~GT--~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~--------~~~~~~~~ 133 (137) +.-.|. +..|||. .++.|+|+.-.. +..|++ .++....+ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-------------------------i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-------------------------IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-------------------------HHHHHHhhhHHHHHHHHHHhcC Confidence 222333 3346663 122333332221 222222 22222222 No 156 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=88.38 E-value=0.01 Score=31.55 Aligned_cols=108 Identities=8% Similarity=-0.039 Sum_probs=58.0 Q ss_pred Ccee----hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCc--EEEEEEe- Q lcl|NC_011044. 1 MPVT----ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPF--HVDGGVE- 69 (137) Q Consensus 1 msv~----~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~--~~~~~v~- 69 (137) |||. .+|.++|+. |.+ .++.+.+++|..+++.++...|.+..+ |||..-+++.......... .-.+.|+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 8885 455565543 443 557788899999999999999999875 9998888876443322111 1223332 Q ss_pred --cCccch--hhhhcCC--CCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHH--------HHHHHhHh Q lcl|NC_011044. 70 --ATADYA--AAVHEGS--RPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQ--------RIAAADPD 133 (137) Q Consensus 70 --~~~~YA--~~vE~GT--~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~--------~~~~~~~~ 133 (137) +.-.|. +..|||. .++.|+|+.-.. +..|++ .++....+ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-------------------------i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-------------------------IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-------------------------HHHHHHhhhHHHHHHHHHHhcC Confidence 222333 3346663 122333332221 222222 22222222 No 157 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=87.62 E-value=0.0014 Score=36.23 Aligned_cols=85 Identities=19% Similarity=0.160 Sum_probs=33.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeee----ccc-CcEEEEEE-ecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQ----RYR-PFHVDGGV-EATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~----~~~-~~~~~~~v-~~~~~Y 74 (137) |+|..+ .|+.+.++++ . .-|.-|-+..+=.-... ... .......- .+.+.+ T Consensus 1 m~v~~k---~L~~~~~~l~---------------~-----~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~i 57 (155) T protein:vir:78 1 MSVTRR---GLTLPKDRYR---------------S-----MSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMI 57 (155) T ss_pred CcchHH---HHHHHHHHHh---------------C-----CeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHH Confidence 666332 1222211111 0 01222222221000000 000 00000001 224568 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-hhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-DIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-~i~~~ 137 (137) |.|.|||+ + ++||||||+++++++..... .+... T Consensus 58 a~~~E~G~--------------------------~---~IP~RPFlr~t~~~~~~~~~~~l~~~ 92 (155) T protein:vir:78 58 AMALNYGT--------------------------S---KLPARPFMEKTITDRSAEWIKGLTVM 92 (155) T ss_pred HHhhhcCC--------------------------C---CCCCcchhhHHHHHHHHHHHHHHHHH Confidence 88889884 2 36699999999998766321 11111 No 158 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=87.11 E-value=0.0042 Score=33.62 Aligned_cols=129 Identities=17% Similarity=0.147 Sum_probs=72.5 Q ss_pred CceehhhhhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh-----------CCc-cchhhhccceeeeecc--cCcEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQ--SGAIFRGKHRSLTRRIATQARAD-----------VPV-RTGNLGRTVGELPQRY--RPFHV 64 (137) Q Consensus 1 msv~~~l~~~~~~l~~~--~~~~~~~~~~~~a~~i~~~ak~~-----------aPv-~TG~Lr~SI~~~~~~~--~~~~~ 64 (137) |+.+.-|+.+..+..+. -...++++..+++......|+.+ .|. +||.|.+||...+.+. .-.++ T Consensus 14 m~~~~~lHvdF~qp~~~~Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIgy~Vpkat~~RpG~ 93 (187) T protein:vir:48 14 MNQTAFLHVDFKQPKELEFNRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIGYYVPKKTTRRPGL 93 (187) T ss_pred hhhccceeEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhhhccccccCCCCcc Confidence 88777777766655542 23566677777766666666654 344 7999999998655421 12234 Q ss_pred EEEEecC--------------ccchhhhhcCCCCCcccccc-CCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH Q lcl|NC_011044. 65 DGGVEAT--------------ADYAAAVHEGSRPHRIVARH-AQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA 129 (137) Q Consensus 65 ~~~v~~~--------------~~YA~~vE~GT~ph~i~pk~-~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~ 129 (137) -+.|-.| ..|=.|++||-+.-...-.. .+.-.-.-.++|-. .|-+-||..+|++.+. T Consensus 94 mVkIaPNqk~G~g~r~~Pi~gdfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwri--------aPR~Nym~~~L~~~~~ 165 (187) T protein:vir:48 94 MVKISPNQKNGQGNRRFPEGAPYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRL--------APRNNFMADVIERRRH 165 (187) T ss_pred eEEecCCcccCcccccccccccchhHHHHhhhhhhhhccchhhhhhhcccCCccee--------ccchhHHHHHHHhhHH Confidence 4555443 37999999997532211100 00000001133332 3457799999987665 Q ss_pred HhHhhccC Q lcl|NC_011044. 130 ADPDIHMT 137 (137) Q Consensus 130 ~~~~i~~~ 137 (137) .-..|--. T Consensus 166 wt~~~L~r 173 (187) T protein:vir:48 166 WTQELLSR 173 (187) T ss_pred HHHHHHHH Confidence 44333222 No 159 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=87.05 E-value=0.00086 Score=37.38 Aligned_cols=85 Identities=20% Similarity=0.189 Sum_probs=33.9 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccc-----eeeeecccCcEEEEEE-ecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTV-----GELPQRYRPFHVDGGV-EATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI-----~~~~~~~~~~~~~~~v-~~~~~Y 74 (137) |||..+ .|+.+.++++. . -|.-|-+..+= ...+............ .+.+.+ T Consensus 1 m~v~r~---~L~~~~~~l~~-------------------~-~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~i 57 (155) T protein:vir:10 1 MSVTRR---GLTLPKDRYKS-------------------M-SVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMI 57 (155) T ss_pred CcchHH---HHHHHHHHhhC-------------------C-eeEEeecCCCCCCccccchhhhhhhhccccccCcchhhh Confidence 777432 12222211111 0 02222222210 0000000000000011 223567 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-hhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-DIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-~i~~~ 137 (137) |.|+|||| + .+||||||+++++++..... .+.+. T Consensus 58 a~~~e~G~--------------------------~---~IP~RPFlr~t~~~~~~~~~~~l~~~ 92 (155) T protein:vir:10 58 AMALNYGT--------------------------S---KLPARPFMEKTIADRSAEWIKGLTVM 92 (155) T ss_pred hhhhhcCC--------------------------C---CCCCcchhHHHHHHHHHHHHHHHHHH Confidence 88888884 2 25699999999998766322 11111 No 160 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=86.85 E-value=0.013 Score=31.01 Aligned_cols=112 Identities=11% Similarity=0.036 Sum_probs=55.8 Q ss_pred Ccee----hhhhhhHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhC--CccchhhhccceeeeecccCcEEEEEEec-- Q lcl|NC_011044. 1 MPVT----ARIHINEPE-LER-QSGAIFRGKHRSLTRRIATQARADV--PVRTGNLGRTVGELPQRYRPFHVDGGVEA-- 70 (137) Q Consensus 1 msv~----~~l~~~~~~-l~~-~~~~~~~~~~~~~a~~i~~~ak~~a--Pv~TG~Lr~SI~~~~~~~~~~~~~~~v~~-- 70 (137) |||. .+|.++|+. |.+ .++.+.+++|..+++.++...|.+. ..|||..-+++........+..-++.|+= T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~g 80 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKG 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEec Confidence 8885 455566543 443 5577888999999999999999964 44999877777544332222222333321 Q ss_pred -Cccch--hhhhcCCC--CCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH-HhHhhccC Q lcl|NC_011044. 71 -TADYA--AAVHEGSR--PHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA-ADPDIHMT 137 (137) Q Consensus 71 -~~~YA--~~vE~GT~--ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~-~~~~i~~~ 137 (137) .-.|. +..|||.- +..|+|+ |+=. +..|++...+ ....|++- T Consensus 81 p~~R~~iVHLNE~GYtr~Gk~i~Pr----------------------G~G~---i~~a~~~se~~y~~~vk~e 128 (133) T protein:vir:78 81 PKDRYKIIHLNEYGYTRNGKKITPA----------------------GTGS---VARSLRISERAYRAIVQKK 128 (133) T ss_pred CCCceeEEEeeccceecCCCeEccc----------------------hhhH---HHHHHHhhhHHHHHHHHHH Confidence 11232 23355520 1112222 2211 3333332211 11111111 No 161 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=86.67 E-value=0.0017 Score=35.76 Aligned_cols=85 Identities=19% Similarity=0.159 Sum_probs=33.7 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeee----ccc-CcEEEEEE-ecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQ----RYR-PFHVDGGV-EATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~----~~~-~~~~~~~v-~~~~~Y 74 (137) |+|..+ .|+.+.++++ . .-|.-|-+..+=.-... ... .......- .+.+.+ T Consensus 1 m~v~~k---~L~~~~~~l~---------------~-----~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~i 57 (155) T protein:vir:10 1 MSVTRR---GLTLPKDRYR---------------S-----MSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMI 57 (155) T ss_pred CcchHH---HHHHHHHHHh---------------C-----CeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHH Confidence 666332 1222211111 0 01222222221000000 000 00000001 224568 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-hhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-DIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-~i~~~ 137 (137) |.|.|||+ + .+||||||+++++++..... .|... T Consensus 58 a~~~E~G~--------------------------~---~IP~RPFlr~t~~~~~~~~~~~l~~~ 92 (155) T protein:vir:10 58 AMALNYGT--------------------------S---KLPARPFMEKTIADRSAEWIKGLTVM 92 (155) T ss_pred HHHHhcCC--------------------------C---CCCCcchhHHHHHHHHHHHHHHHHHH Confidence 88899984 2 36699999999998766321 11111 No 162 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=86.35 E-value=0.0011 Score=36.78 Aligned_cols=85 Identities=19% Similarity=0.139 Sum_probs=34.6 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeec----ccC-cEEEEE-EecCccc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLTRRIATQARADVPVRTGNLGRTVGELPQR----YRP-FHVDGG-VEATADY 74 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~----~~~-~~~~~~-v~~~~~Y 74 (137) |++..+. |+.+.++++. --|.-|-+..+-.-.... ... ...... -.+.+.+ T Consensus 1 m~~~r~~---l~~~~~~l~~--------------------~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~i 57 (155) T protein:vir:77 1 MSVTRRG---LTLPKDRYRS--------------------MSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMI 57 (155) T ss_pred CcchHHH---HHHHHHHHhc--------------------CceEEeecCCCCCccccchhhhhhhhccccccccccHhhh Confidence 7764331 2222111110 012223222211000000 000 000001 1234578 Q ss_pred hhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhH-hhccC Q lcl|NC_011044. 75 AAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADP-DIHMT 137 (137) Q Consensus 75 A~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~-~i~~~ 137 (137) |.|+|||| + .+||||||+++++++..... .+.+. T Consensus 58 a~~~e~G~--------------------------~---~IP~RPFlr~t~~~~~~~~~~~l~~~ 92 (155) T protein:vir:77 58 AMALNYGT--------------------------S---KLPARPFMEKTIADRSAEWIKGLTVM 92 (155) T ss_pred hhhhhcCC--------------------------C---CCCCCchhhHHHHHHHHHHHHHHHHH Confidence 88899985 2 36699999999998765321 11111 No 163 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=85.73 E-value=0.0028 Score=34.62 Aligned_cols=76 Identities=5% Similarity=-0.088 Sum_probs=32.3 Q ss_pred HHHHHHHHHHHHHHHHHHHh--CCccchhhhcc-ceeeeecccCcEEEEEEecCccchhhhhcCCCCCccccccCCccee Q lcl|NC_011044. 21 IFRGKHRSLTRRIATQARAD--VPVRTGNLGRT-VGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRIVARHAQALHF 97 (137) Q Consensus 21 ~~~~~~~~~a~~i~~~ak~~--aPv~TG~Lr~S-I~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~ 97 (137) -+++......+.++..++.+ .-|.-|-..+. +. . + -.+-+..|.|.||||. T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g~~------~----d--G~sv~~vA~~~EfG~~-------------- 54 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQTANAQVGYFQEQGQH------S----S--GFSYPALMYLQEVIGV-------------- 54 (160) T ss_pred CceeechHhHHHHHHHHHHHhCCeeEEeeccccccC------C----C--CccHHHHHhhhhcCcc-------------- Confidence 22222222233333333333 11333432222 11 0 0 1133578999999972 Q ss_pred ecCCeeEEeeeEecCCCCCCchhhhhHHHHHH--HhHhhccC Q lcl|NC_011044. 98 FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA--ADPDIHMT 137 (137) Q Consensus 98 ~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~--~~~~i~~~ 137 (137) + +|++||||++++.... ....++-+ T Consensus 55 ------------~---iPaRPf~R~tfe~~~~~~~~~~~~~~ 81 (160) T protein:vir:95 55 ------------P---SASGKVYRRLFEITMMLNKQTLLEQT 81 (160) T ss_pred ------------c---CCCcchhHHHHHHHHHHHHHHHHHHH Confidence 2 3488999988863111 11111111 No 164 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=81.82 E-value=0.037 Score=28.45 Aligned_cols=123 Identities=15% Similarity=0.101 Sum_probs=58.3 Q ss_pred CceehhhhhhHHHHHHHHHHH-------H-HHHHHHHHHHHHHHHHHhCCc------cch---hhhccceeeeecc---- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAI-------F-RGKHRSLTRRIATQARADVPV------RTG---NLGRTVGELPQRY---- 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~-------~-~~~~~~~a~~i~~~ak~~aPv------~TG---~Lr~SI~~~~~~~---- 59 (137) |- -|+..|..+.+++... - .+.....|...+......+|- +|| +|.+||....... T Consensus 1 M~---~~~d~l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg~~ 77 (168) T protein:vir:10 1 MV---SFYDAMQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred CC---cHHHHHHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheeccccccccc Confidence 43 3444444444444331 1 123334444444444444433 455 8999997433221 Q ss_pred cCcEEEEEE-------ecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH---- Q lcl|NC_011044. 60 RPFHVDGGV-------EATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA---- 128 (137) Q Consensus 60 ~~~~~~~~v-------~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~---- 128 (137) .+.+.+|-- +..+.-|.|++.||+-|.-.-+..+.+. .++. |. +++.+|+..+-++.+ T Consensus 78 dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~---~~g~-----v~---i~gDHFvd~~r~d~a~k~~ 146 (168) T protein:vir:10 78 DGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYK---KPGE-----VA---VHADHFIEETRKNPIVQQG 146 (168) T ss_pred CCceeecccCccccccccchheeeeccccccccccccccccccc---cccc-----cc---cccchhHHHhhhchhhhHH Confidence 111111111 2356779999999986644334443322 1111 22 558899988766532 Q ss_pred ---HHh---HhhccC Q lcl|NC_011044. 129 ---AAD---PDIHMT 137 (137) Q Consensus 129 ---~~~---~~i~~~ 137 (137) ++. ++|-+- T Consensus 147 V~~Ae~~~y~eIl~~ 161 (168) T protein:vir:10 147 ILKAEAEAMRKIINR 161 (168) T ss_pred HHHHHHHHHHHHHHh Confidence 121 222222 No 165 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=80.43 E-value=0.038 Score=28.36 Aligned_cols=123 Identities=15% Similarity=0.089 Sum_probs=57.9 Q ss_pred CceehhhhhhHHHHHHHHHHHH-------H-HHHHHHHHHHHHHHHHhCCc------cc---hhhhccceeeeecc---- Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIF-------R-GKHRSLTRRIATQARADVPV------RT---GNLGRTVGELPQRY---- 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~-------~-~~~~~~a~~i~~~ak~~aPv------~T---G~Lr~SI~~~~~~~---- 59 (137) |- .|+..|..+.+++...+ + +.....|...+......+|- +| ++|++||....... T Consensus 1 M~---~~~d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~ 77 (168) T protein:vir:39 1 MV---SFYDAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred Cc---cHHHHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCccc Confidence 43 34444444444444321 1 22333444433333333332 34 68999997544321 Q ss_pred cCcEEEEEE-------ecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHH---- Q lcl|NC_011044. 60 RPFHVDGGV-------EATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIA---- 128 (137) Q Consensus 60 ~~~~~~~~v-------~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~---- 128 (137) .+.+.+|-- +..+.-|.|++.||+.+.-..+.++.+ ..++. |. +++.+|+..+-++.+ T Consensus 78 dG~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y---~~~g~-----v~---i~gDHFvd~~r~~~a~k~a 146 (168) T protein:vir:39 78 DGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKY---KNPGE-----VA---VHADHFIEETRKNPIVQQG 146 (168) T ss_pred CCceeccccCccccccccchhheehhccccccchhhhhccccc---ccccc-----ee---ecccchhHHHhhhhhhhHH Confidence 111221111 235677999999998644333332221 12222 33 458899988777542 Q ss_pred ---HHh---HhhccC Q lcl|NC_011044. 129 ---AAD---PDIHMT 137 (137) Q Consensus 129 ---~~~---~~i~~~ 137 (137) ++. ++|-+- T Consensus 147 V~~Ae~e~~~eil~~ 161 (168) T protein:vir:39 147 ILKAEAEAMRKIINR 161 (168) T ss_pred HHHHHHHHHHHHHHh Confidence 121 233222 No 166 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=78.38 E-value=0.041 Score=28.16 Aligned_cols=110 Identities=14% Similarity=0.049 Sum_probs=55.2 Q ss_pred Cc---eehhhhhhHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcEEEEEEec--- Q lcl|NC_011044. 1 MP---VTARIHINEP-ELER-QSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFHVDGGVEA--- 70 (137) Q Consensus 1 ms---v~~~l~~~~~-~l~~-~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~~~~~v~~--- 70 (137) || +..+|.++|+ +|.+ .++.+.+++|..+++.++...|.+.-+ |||..-+++........+..-++.|+= T Consensus 1 m~evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~gp 80 (133) T protein:vir:96 1 MRLIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEGE 80 (133) T ss_pred CccccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeecC Confidence 54 5677777663 3333 667888899999999999999988655 899766665433222222122233322 Q ss_pred Cccch--hhhhcCCC---CCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH--------HHHHhHhhc Q lcl|NC_011044. 71 TADYA--AAVHEGSR---PHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR--------IAAADPDIH 135 (137) Q Consensus 71 ~~~YA--~~vE~GT~---ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~--------~~~~~~~i~ 135 (137) .-.|. +..|||+- ...|+|+ |+=. +..|++. ++...++.- T Consensus 81 ~~R~~iVHLNE~G~ytr~Gk~i~Pr----------------------G~G~---I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 81 KHRYSIVHLNEKGFYAKDGKFIRPK----------------------GMGA---IDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred CCceeeEeeecccceecCCceeccc----------------------hhhH---HHHHHHhhhHHHHHHHHHHHHHhC Confidence 11222 23455531 0112222 2111 3333332 222222222 No 167 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=78.14 E-value=0.069 Score=26.96 Aligned_cols=123 Identities=14% Similarity=0.127 Sum_probs=57.8 Q ss_pred CceehhhhhhHHHHHHHHHHHH-------H-HHHHHHHHHHHHHHHHhCCc------cch---hhhccceeeeecccC-c Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIF-------R-GKHRSLTRRIATQARADVPV------RTG---NLGRTVGELPQRYRP-F 62 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~-------~-~~~~~~a~~i~~~ak~~aPv------~TG---~Lr~SI~~~~~~~~~-~ 62 (137) |-- |+..|..+.+++...+ + +.....|...+......+|- +|| +|.+||........+ . T Consensus 1 M~~---~~~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~ 77 (168) T protein:vir:74 1 MAT---FEEAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred Ccc---HHHHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCccc Confidence 543 4444444444443311 1 22333344444334444332 455 899999743322110 0 Q ss_pred EEEEEEe----------cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHH--- Q lcl|NC_011044. 63 HVDGGVE----------ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAA--- 129 (137) Q Consensus 63 ~~~~~v~----------~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~--- 129 (137) .-...|| ..+.-|.|++.||+-|.-.-+..+.+. .++. |. +++.+|+..+-++++. T Consensus 78 dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~---~~g~-----v~---i~gDHFvd~~r~~~~~k~~ 146 (168) T protein:vir:74 78 DGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYK---KPGE-----VA---VHADHFIEETRMNLIVQQG 146 (168) T ss_pred CCceeecccccccccccchhhhhhhhcccccccccccccccccc---cccc-----cc---cccchhHHHHHhhhhhHHH Confidence 0011232 245689999999986644334444322 1111 22 5588999887665331 Q ss_pred ----Hh---HhhccC Q lcl|NC_011044. 130 ----AD---PDIHMT 137 (137) Q Consensus 130 ----~~---~~i~~~ 137 (137) +. ++|-+- T Consensus 147 V~~Ae~~~y~eIl~~ 161 (168) T protein:vir:74 147 ILKAEAEAMRKIINR 161 (168) T ss_pred HHHHHHHHHHHHHHh Confidence 11 222222 No 168 >protein:vir:8432 Length: 149 # NCBI annotation: gp27 # Family: family:all:30885 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818328;genbank:gi:29566764;genbank:GeneID:1260117 Probab=77.51 E-value=0.071 Score=26.88 Aligned_cols=114 Identities=11% Similarity=0.060 Sum_probs=49.9 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCccchhhhccceeeeec----------ccCcEEEEEEe Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSL-TRRIATQARADVPVRTGNLGRTVGELPQR----------YRPFHVDGGVE 69 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~-a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~----------~~~~~~~~~v~ 69 (137) .|-+..|.+-...+...+.+...+.-.-. ...+..+.---.|..||+.+..|+..... -.+.--.|.|+ T Consensus 20 vsssrdlrrivqrfindveqtwhdvwdvsmlgvlaqqtgvphpyqtgdykahikkkkltamqkirikkflkggmpiglvy 99 (149) T protein:vir:84 20 VSSSRDLRRIVQRFINDVEQTWHDVWDVSMLGVLAQQTGVPHPYQTGDYKAHIKKKKLTAMQKIRIKKFLKGGMPIGLVY 99 (149) T ss_pred cccchHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHhhcCCCCCccccchhhhhhhhhHHHHHHHHHHHHhhcCCceeEEe Confidence 22222222222222222222222111100 11111112222466899999888642111 01223567899 Q ss_pred cCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHhHhhccC Q lcl|NC_011044. 70 ATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAADPDIHMT 137 (137) Q Consensus 70 ~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~~~i~~~ 137 (137) +|-+-|.|+||||+-. +|... -+|+ |-++ .||++-| ...-+|-|. T Consensus 100 nndekahwieygtkrd--rpgsr----spwg-----------pntp-----tpafeim-qrvarimne 144 (149) T protein:vir:84 100 NNDEKAHWIEYGTKRD--RPGSR----SPWG-----------PNTP-----TPAFEIM-QRVARIMNE 144 (149) T ss_pred cCCcchhhhhhccccC--CCCCC----CCCC-----------CCCC-----ChhHHHH-HHHHHHhhh Confidence 9999999999999633 22211 1222 2222 2455422 333344444 No 169 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=72.96 E-value=0.1 Score=26.07 Aligned_cols=126 Identities=18% Similarity=0.182 Sum_probs=66.2 Q ss_pred Cc-----e----ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CC-ccchhhhccceeeeecc--cCcEEE Q lcl|NC_011044. 1 MP-----V----TARIHINEPELERQSGAIFRGKHRSLTRRIATQARAD---VP-VRTGNLGRTVGELPQRY--RPFHVD 65 (137) Q Consensus 1 ms-----v----~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~~~ak~~---aP-v~TG~Lr~SI~~~~~~~--~~~~~~ 65 (137) |. | -.+++.+-..+-+.+-.+-+.-++.+.+.|...++.. -| +.||.|.+||...+.+. .-.++- T Consensus 1 m~~~~lHvdF~qp~~~~Fnr~riRraFv~igq~hmr~ArrlV~rrgrs~pGe~P~~qTGrLa~SIgy~Vpras~~rpG~m 80 (168) T protein:vir:45 1 MTTSFLHVDFQQPAEMRFNRARVRRAFVTIGQRHMRDARRLVMRHARSAPGENPGYQTGRLARSIGYMVPRASKHRPGFM 80 (168) T ss_pred CCccceeeeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHhhcccccCCCCCcchhhhhhhhhhhccccccCCCCceE Confidence 22 2 1233344445555554444555666666666555332 24 37999999998655433 112455 Q ss_pred EEEec------------CccchhhhhcCCCCCccccccCCccee--ecCCeeEEeeeEecCCCCCCchhhhhHHHHHHHh Q lcl|NC_011044. 66 GGVEA------------TADYAAAVHEGSRPHRIVARHAQALHF--FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAAD 131 (137) Q Consensus 66 ~~v~~------------~~~YA~~vE~GT~ph~i~pk~~k~l~~--~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~~ 131 (137) +.|-. .-.|-.|++||-+.-. +..+.-.- ..+.+|.. .|-+-||..+|++.+..- T Consensus 81 vkIaPNqk~G~g~r~i~gdfYPafL~YGVr~ga---kr~r~h~rga~ggsgwri--------aPR~Nym~~~l~~~~~wt 149 (168) T protein:vir:45 81 ARIAPNQRNGEGNRRITGDFYPAFLFYGVRGGA---KRRRSHHRGASGGSGWRL--------APRNNFMVETLEKNRSWT 149 (168) T ss_pred EEecCCCCCCCCCCccccccchhhhhhhhhcch---hhhhhhhccccCCCccee--------ccchhhHHHHHHhhHHHH Confidence 55543 3478899999975432 11111000 11123333 345779999998776544 Q ss_pred HhhccC Q lcl|NC_011044. 132 PDIHMT 137 (137) Q Consensus 132 ~~i~~~ 137 (137) ..+--. T Consensus 150 ~~~L~r 155 (168) T protein:vir:45 150 RYFLAR 155 (168) T ss_pred HHHHHH Confidence 333222 No 170 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=69.23 E-value=0.062 Score=27.20 Aligned_cols=61 Identities=11% Similarity=-0.001 Sum_probs=25.8 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHHH------HHHHHHHH----------------------------------Hh Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSLT------RRIATQAR----------------------------------AD 40 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~a------~~i~~~ak----------------------------------~~ 40 (137) +++-...+..|+.+.......++..+.+.- .-|..... .. T Consensus 88 l~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 167 (189) T protein:vir:10 88 VVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQAKGTLNLSGVST 167 (189) T ss_pred HhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhhhhccccccccCC Confidence 333222333333333333333333322110 00000000 01 Q ss_pred CC-ccchhhhccceeeeecccC Q lcl|NC_011044. 41 VP-VRTGNLGRTVGELPQRYRP 61 (137) Q Consensus 41 aP-v~TG~Lr~SI~~~~~~~~~ 61 (137) -| +|||+|++||...+.+... T Consensus 168 kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 168 DPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred CchhhHHHHHhhcceeeeecCC Confidence 24 3899999999887765544 No 171 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=68.78 E-value=0.035 Score=28.57 Aligned_cols=59 Identities=14% Similarity=0.070 Sum_probs=23.4 Q ss_pred CceehhhhhhHHHHHHHHHHHHHHHHHHH-----H-HHHHHHHHHhCC-ccchhhhccceeeeecc Q lcl|NC_011044. 1 MPVTARIHINEPELERQSGAIFRGKHRSL-----T-RRIATQARADVP-VRTGNLGRTVGELPQRY 59 (137) Q Consensus 1 msv~~~l~~~~~~l~~~~~~~~~~~~~~~-----a-~~i~~~ak~~aP-v~TG~Lr~SI~~~~~~~ 59 (137) |++-...+..|+.+.......++..+.+. + .-|.+.-...-| +|||.|++||...+.+. T Consensus 134 l~g~~~a~~~L~~~G~~~~~~Ik~~I~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 134 IHGKLSAEQVYNRLGAKIVDDIQMKIVEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HhCCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 22211222222333222222222222210 0 001100001133 48999999999877655 No 172 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=64.49 E-value=0.1 Score=26.03 Aligned_cols=110 Identities=13% Similarity=0.096 Sum_probs=56.5 Q ss_pred Cceeh--------hhhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCcc---chhhhccceeeeec-----ccCcE Q lcl|NC_011044. 1 MPVTA--------RIHINEPELERQSGAIFRGKHR-SLTRRIATQARADVPVR---TGNLGRTVGELPQR-----YRPFH 63 (137) Q Consensus 1 msv~~--------~l~~~~~~l~~~~~~~~~~~~~-~~a~~i~~~ak~~aPv~---TG~Lr~SI~~~~~~-----~~~~~ 63 (137) |+-+. +|...+.+++....+++++.|. +++..+........||- .|.+|+-.+....+ ..+.+ T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLg 80 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLG 80 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcc Confidence 65544 4445555555566677776655 55666777778889995 34556555532211 12223 Q ss_pred EEEEEecCccchhhhhcCCCCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH---------HHHHhHhh Q lcl|NC_011044. 64 VDGGVEATADYAAAVHEGSRPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR---------IAAADPDI 134 (137) Q Consensus 64 ~~~~v~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~---------~~~~~~~i 134 (137) +++.-...-.|-.|-..|-+-|. --+|.||+..+++ +.+-.+.| T Consensus 81 f~i~~k~kf~YLvfPD~G~G~sn---------------------------~~~q~FmerGl~~~t~~i~E~L~~~l~k~i 133 (140) T protein:vir:40 81 FELLTKPKFNYLIFPDQGIGKHN---------------------------KTKQDFMQLGVEESSQEIVEMLEQAVFKEI 133 (140) T ss_pred eeEeecCcccccccccccCCCCC---------------------------cchHHHHHhccccchhHHHHHHHHHHHHHH Confidence 33332222334444444433221 1245577766643 34455566 Q ss_pred ccC Q lcl|NC_011044. 135 HMT 137 (137) Q Consensus 135 ~~~ 137 (137) ..| T Consensus 134 n~~ 136 (140) T protein:vir:40 134 NDT 136 (140) T ss_pred HHh Confidence 666 No 173 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=63.14 E-value=0.041 Score=28.18 Aligned_cols=57 Identities=9% Similarity=-0.044 Sum_probs=25.5 Q ss_pred Ccee--hhhhhhHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHh--------------------CC-ccchhhhcc Q lcl|NC_011044. 1 MPVT--ARIHINEPELERQSGA------IFRGKHRSLTRRIATQARAD--------------------VP-VRTGNLGRT 51 (137) Q Consensus 1 msv~--~~l~~~~~~l~~~~~~------~~~~~~~~~a~~i~~~ak~~--------------------aP-v~TG~Lr~S 51 (137) |.-+ ..-+.-.+.+.+.+.. ..+.+|+.++..++...|.. -| +|||+|++| T Consensus 108 lr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~~~ppna~~Ti~~KG~~~PLidTG~l~~S 187 (193) T protein:vir:96 108 MRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTGPWVANSASTVRRKGFNRPLVDTAHMLQS 187 (193) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhCCCCchhHHHHHHhh Confidence 2111 0001111111111111 13344555555555444443 12 489999999 Q ss_pred ceeeee Q lcl|NC_011044. 52 VGELPQ 57 (137) Q Consensus 52 I~~~~~ 57 (137) |...++ T Consensus 188 Ity~Vv 193 (193) T protein:vir:96 188 ISSRVT 193 (193) T ss_pred hcceeC Confidence 987765 No 174 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=62.50 E-value=0.17 Score=24.81 Aligned_cols=102 Identities=7% Similarity=-0.074 Sum_probs=50.5 Q ss_pred hhhhHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhccceeeeecccCcE--EEEEEe---cCccch-- Q lcl|NC_011044. 7 IHINEP-ELE-RQSGAIFRGKHRSLTRRIATQARADVPV--RTGNLGRTVGELPQRYRPFH--VDGGVE---ATADYA-- 75 (137) Q Consensus 7 l~~~~~-~l~-~~~~~~~~~~~~~~a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~~~~~--~~~~v~---~~~~YA-- 75 (137) |.++|+ +|. ..++.+.+++|..+++.++...|.+.-+ |||..-+++........... -++.|+ +.-.|. T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iV 80 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNII 80 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCCCceeeE Confidence 555552 333 3667888899999999999999988655 89976666543322111111 223332 222232 Q ss_pred hhhhcCC--CCCccccccCCcceeecCCeeEEeeeEecCCCCCCchhhhhHHH--------HHHHhHh Q lcl|NC_011044. 76 AAVHEGS--RPHRIVARHAQALHFFWHGREIFRKSVWHPGVRSRPFLRNAAQR--------IAAADPD 133 (137) Q Consensus 76 ~~vE~GT--~ph~i~pk~~k~l~~~~~g~~~~~k~V~~pG~~a~pfl~~A~~~--------~~~~~~~ 133 (137) +..|||. .++.|+|+.... +..|++. ++....+ T Consensus 81 HLNE~GYtr~Gk~i~PRG~G~-------------------------i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 81 HLNEHGYTRDGKKYTPRGFGV-------------------------IAKTLAANERKYREIIKKELAR 123 (123) T ss_pred eeeccceecCCCeEccchhhH-------------------------HHHHHHhhhHHHHHHHHHHhcC Confidence 3346663 122233322221 2333322 2222222 No 175 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=54.85 E-value=0.093 Score=26.25 Aligned_cols=126 Identities=19% Similarity=0.160 Sum_probs=69.7 Q ss_pred CceehhhhhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh-----------CCc-cchhhhccceeeeecc--cCcEE Q lcl|NC_011044. 1 MPVTARIHINEPELERQ--SGAIFRGKHRSLTRRIATQARAD-----------VPV-RTGNLGRTVGELPQRY--RPFHV 64 (137) Q Consensus 1 msv~~~l~~~~~~l~~~--~~~~~~~~~~~~a~~i~~~ak~~-----------aPv-~TG~Lr~SI~~~~~~~--~~~~~ 64 (137) |.=++-|+.+..+..+. ....++++..+++......|+.+ -|. +||.|.+||...+.+. .-.++ T Consensus 1 M~~~~~lHvdF~qp~~~~Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vpras~~rpG~ 80 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEELVFNRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVPRASKKRPGL 80 (170) T ss_pred CCCCceeEEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccccccCCCCce Confidence 66666666655555442 23556666666666666666533 344 7999999998655433 11234 Q ss_pred EEEEec------------CccchhhhhcCCCCCccccccCCccee--ecCCeeEEeeeEecCCCCCCchhhhhHHHHHHH Q lcl|NC_011044. 65 DGGVEA------------TADYAAAVHEGSRPHRIVARHAQALHF--FWHGREIFRKSVWHPGVRSRPFLRNAAQRIAAA 130 (137) Q Consensus 65 ~~~v~~------------~~~YA~~vE~GT~ph~i~pk~~k~l~~--~~~g~~~~~k~V~~pG~~a~pfl~~A~~~~~~~ 130 (137) -+.|-. +..|-.|++||-+.- .|..+.-.- ..+.+|-. .|-+-||..+|++.+.. T Consensus 81 mVkIaPNqk~G~g~r~i~g~fYPafL~YGVr~g---akr~k~hhr~a~ggsgwri--------aPR~Nym~~~l~~~~~w 149 (170) T protein:vir:44 81 MVKIAPNQKNGEGNRHINGAFYPAFLFYGVRRG---AKRKKGHHRGASGGSGWRV--------EPRNNYMTEVLDKRRSW 149 (170) T ss_pred eEEecCCCCCCCCccccccccchhhhhhhhhcc---cccchhhcccccCCCccee--------ccchhHHHHHHHhhHHH Confidence 455533 237899999997532 122111100 01122322 34577999999876654 Q ss_pred hHhhccC Q lcl|NC_011044. 131 DPDIHMT 137 (137) Q Consensus 131 ~~~i~~~ 137 (137) -..|--. T Consensus 150 t~~~L~r 156 (170) T protein:vir:44 150 TRYVLSR 156 (170) T ss_pred HHHHHHH Confidence 4433222 No 176 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=53.58 E-value=0.52 Score=22.12 Aligned_cols=136 Identities=14% Similarity=0.055 Sum_probs=66.7 Q ss_pred Cceehhhh-hhHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHh-----C----Cc---c--chh---------hhc Q lcl|NC_011044. 1 MPVTARIH-INEPELERQSG------AIFRGKHRSLTRRIATQARAD-----V----PV---R--TGN---------LGR 50 (137) Q Consensus 1 msv~~~l~-~~~~~l~~~~~------~~~~~~~~~~a~~i~~~ak~~-----a----Pv---~--TG~---------Lr~ 50 (137) |++.-+|+ .++..|.+++. ..-++.+.+++..+...++.+ . |+ + .|. |.+ T Consensus 1 m~~~~~~n~~dl~~l~~~L~ll~L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL~~ 80 (231) T protein:vir:37 1 MQIRLGLKQEDLDAFVRDLRTLNLTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKVLR 80 (231) T ss_pred CCccCCcCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHhHH Confidence 99977776 55666766664 233456788888777766665 2 22 2 222 222 Q ss_pred cceeeeecccCcEEEEEEecCccchhhhhcCCCCCcc--------ccc--------cCCc---ceeecC----------- Q lcl|NC_011044. 51 TVGELPQRYRPFHVDGGVEATADYAAAVHEGSRPHRI--------VAR--------HAQA---LHFFWH----------- 100 (137) Q Consensus 51 SI~~~~~~~~~~~~~~~v~~~~~YA~~vE~GT~ph~i--------~pk--------~~k~---l~~~~~----------- 100 (137) .........++..+....+.....|..++||-....- .|. -++. |-|.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~ 160 (231) T protein:vir:37 81 YASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQGKTKY 160 (231) T ss_pred hhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCCCCCc Confidence 2222222222221111223345688889999532110 000 0111 122221 Q ss_pred -------------------------CeeE--Eeee-EecCCCCCCchhhhhHHHHHH----HhHhhccC Q lcl|NC_011044. 101 -------------------------GREI--FRKS-VWHPGVRSRPFLRNAAQRIAA----ADPDIHMT 137 (137) Q Consensus 101 -------------------------g~~~--~~k~-V~~pG~~a~pfl~~A~~~~~~----~~~~i~~~ 137 (137) |... -.|. +.++ .|++|||.-.-++... ....|.++ T Consensus 161 rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~-~paR~FLG~~~~e~~~~l~~~l~~i~~~ 228 (231) T protein:vir:37 161 RLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIH-VPARPFLDTREKENVDILREITLKFLSG 228 (231) T ss_pred CcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeee-cCcccccCCCHHHHHHHHHHHHHHHhcc Confidence 1110 1122 3343 7799999887776544 45555666 No 177 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=45.42 E-value=0.77 Score=21.20 Aligned_cols=137 Identities=9% Similarity=0.036 Sum_probs=66.3 Q ss_pred Cce------ehhhhhhHHHHHHHHHHHHHHHHHHHHHHHH----HH-HHHhCCccchhhhccceeeeec-ccCcEEEEEE Q lcl|NC_011044. 1 MPV------TARIHINEPELERQSGAIFRGKHRSLTRRIA----TQ-ARADVPVRTGNLGRTVGELPQR-YRPFHVDGGV 68 (137) Q Consensus 1 msv------~~~l~~~~~~l~~~~~~~~~~~~~~~a~~i~----~~-ak~~aPv~TG~Lr~SI~~~~~~-~~~~~~~~~v 68 (137) |++ ..+++..|+.|++........++++.|..-. .+ .....-+..++|+++++..++. -....+++.| T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~I 80 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQQAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAVI 80 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEEE Confidence 776 4566777888888888777777777665442 22 3344556789999876555444 3344577788 Q ss_pred ecCccchhhhhcCCCCCccccccCCcceee--------cCCeeEEeee---------------Ee-cCC--------CC- Q lcl|NC_011044. 69 EATADYAAAVHEGSRPHRIVARHAQALHFF--------WHGREIFRKS---------------VW-HPG--------VR- 115 (137) Q Consensus 69 ~~~~~YA~~vE~GT~ph~i~pk~~k~l~~~--------~~g~~~~~k~---------------V~-~pG--------~~- 115 (137) .+...=-..--|++.+..-+++++...... ..|.+...-+ ++ .+| .. T Consensus 81 ~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~k 160 (205) T protein:vir:63 81 GARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGATK 160 (205) T ss_pred ecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCcee Confidence 765443333334444333332222222211 1122111100 01 112 11 Q ss_pred -CCc---hhhhhHHHHHHH-h----HhhccC Q lcl|NC_011044. 116 -SRP---FLRNAAQRIAAA-D----PDIHMT 137 (137) Q Consensus 116 -a~p---fl~~A~~~~~~~-~----~~i~~~ 137 (137) +.+ +..|+++++-.. . ++|..- T Consensus 161 ~~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~ 191 (205) T protein:vir:63 161 LSNNVYLLYGPSVDQVFRTVADDITTEVLDA 191 (205) T ss_pred cCCceEEEEcCcHHHHHhhhhhhhhHHHHHH Confidence 112 456777665331 1 111111 No 178 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=34.36 E-value=0.99 Score=20.60 Aligned_cols=132 Identities=11% Similarity=0.092 Sum_probs=55.1 Q ss_pred Cce--ehhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCc----cchhhhccceeeeecccCcEEEEEEecCcc Q lcl|NC_011044. 1 MPV--TARIHINEPELERQ-SGAIFRGKHRSLTRRIATQARADVPV----RTGNLGRTVGELPQRYRPFHVDGGVEATAD 73 (137) Q Consensus 1 msv--~~~l~~~~~~l~~~-~~~~~~~~~~~~a~~i~~~ak~~aPv----~TG~Lr~SI~~~~~~~~~~~~~~~v~~~~~ 73 (137) |++ .+++..+|..+++. +..++.++++.++..+..++...+.- +...+++.++...... ....+.|..+-. T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~--~~l~a~I~~~~~ 78 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATV--KNPQARIKVNRG 78 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccC--CCceEEEEEecc Confidence 555 67777888887664 77888888888888777766655443 4557788877654333 334455544322 Q ss_pred chhhhhcCCCCCccccc-------c------CCcce---------e---ecCCee-EEeeeEecCCCCCCchhhhhHHHH Q lcl|NC_011044. 74 YAAAVHEGSRPHRIVAR-------H------AQALH---------F---FWHGRE-IFRKSVWHPGVRSRPFLRNAAQRI 127 (137) Q Consensus 74 YA~~vE~GT~ph~i~pk-------~------~k~l~---------~---~~~g~~-~~~k~V~~pG~~a~pfl~~A~~~~ 127 (137) --+-.-+|+.+-....+ . ++.+. | .-.|.| +|.+ + +|-.-.|.=.--+.-- T Consensus 79 ~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R-~--~gk~R~PIe~vkIpis 155 (192) T protein:vir:34 79 DLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQR-V--AGKNRYPIDVVKIPMA 155 (192) T ss_pred ceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEE-c--cCCCccceeEEEechh Confidence 22222333321111000 0 00010 1 001111 2222 1 1111111100000000 Q ss_pred HHHhHhhccC Q lcl|NC_011044. 128 AAADPDIHMT 137 (137) Q Consensus 128 ~~~~~~i~~~ 137 (137) .+-.+...++ T Consensus 156 ~~l~~af~~~ 165 (192) T protein:vir:34 156 VPLTTAFKQN 165 (192) T ss_pred HHHHHHHHHH Confidence 0011111222 Done!