Query lcl|NC_021326.1_cdsid_YP_008058997.1 [gene=M175_gp24] [protein=putative major tail protein] [protein_id=YP_008058997.1] [location=complement(21799..22149)] Match_columns 116 No_of_seqs 119 out of 299 Neff 7.9 Searched_HMMs 1612 Date Thu Nov 7 18:59:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_24 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_24_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95062 Length: 116 100.0 1.2E-38 7.6E-42 228.4 11.7 116 1-116 1-116 (116) 2 protein:vir:1243 Length: 116 # 100.0 3E-38 1.9E-41 226.3 11.8 116 1-116 1-116 (116) 3 protein:vir:97327 Length: 116 100.0 3E-38 1.9E-41 226.3 11.8 116 1-116 1-116 (116) 4 protein:vir:107099 Length: 137 100.0 1E-36 6.5E-40 217.8 11.3 116 1-116 22-137 (137) 5 protein:vir:96121 Length: 137 100.0 1.1E-36 6.9E-40 217.7 11.4 116 1-116 22-137 (137) 6 protein:vir:105330 Length: 137 100.0 9.9E-37 6.2E-40 218.0 11.1 116 1-116 22-137 (137) 7 protein:vir:94796 Length: 137 100.0 1.1E-36 6.9E-40 217.7 11.3 116 1-116 22-137 (137) 8 protein:vir:94108 Length: 149 100.0 2.2E-36 1.4E-39 216.0 11.3 116 1-116 34-149 (149) 9 protein:vir:94490 Length: 137 100.0 2.4E-36 1.5E-39 215.8 11.3 116 1-116 22-137 (137) 10 protein:vir:97427 Length: 137 100.0 2.4E-36 1.5E-39 215.8 11.3 116 1-116 22-137 (137) 11 protein:vir:93738 Length: 137 100.0 2.4E-36 1.5E-39 215.8 11.3 116 1-116 22-137 (137) 12 protein:vir:95894 Length: 137 100.0 4.3E-36 2.6E-39 214.5 11.3 116 1-116 22-137 (137) 13 protein:vir:105916 Length: 149 100.0 5.3E-36 3.3E-39 214.0 11.2 116 1-116 34-149 (149) 14 protein:vir:96829 Length: 135 100.0 6.1E-34 3.8E-37 202.7 11.2 114 1-116 22-135 (135) 15 protein:vir:78077 Length: 141 100.0 1.3E-33 7.8E-37 201.0 9.7 113 1-116 17-134 (141) 16 protein:vir:5978 Length: 144 # 100.0 2.5E-33 1.6E-36 199.3 10.6 113 1-116 27-140 (144) 17 protein:vir:8669 Length: 142 # 100.0 2.3E-32 1.5E-35 194.0 9.2 116 1-116 22-141 (142) 18 protein:vir:99101 Length: 142 100.0 2.3E-32 1.5E-35 194.0 9.2 116 1-116 22-141 (142) 19 protein:vir:106570 Length: 182 99.9 8.1E-32 5E-35 191.0 10.3 116 1-116 27-169 (182) 20 protein:vir:94654 Length: 142 99.9 1.1E-31 7E-35 190.2 9.8 114 1-116 24-139 (142) 21 protein:vir:107545 Length: 140 99.9 6.3E-32 3.9E-35 191.6 8.3 114 1-114 21-140 (140) 22 protein:vir:97982 Length: 140 99.9 6.3E-32 3.9E-35 191.6 8.3 114 1-114 21-140 (140) 23 protein:vir:106041 Length: 137 99.9 9.7E-32 6E-35 190.6 7.6 114 1-114 18-137 (137) 24 protein:vir:102441 Length: 137 99.9 1.3E-30 7.8E-34 184.5 8.0 115 1-115 18-137 (137) 25 protein:vir:106506 Length: 137 99.9 3.7E-30 2.3E-33 181.9 7.8 116 1-116 17-135 (137) 26 protein:vir:101594 Length: 173 99.9 8.1E-30 5E-33 180.1 9.4 116 1-116 20-163 (173) 27 protein:vir:743 Length: 108 # 99.9 2E-26 1.2E-29 161.5 9.0 87 1-116 18-104 (108) 28 protein:vir:98409 Length: 108 99.9 3.6E-26 2.2E-29 160.1 8.7 87 1-116 18-104 (108) 29 protein:vir:3617 Length: 112 # 99.9 5E-26 3.1E-29 159.3 9.0 87 1-116 22-108 (112) 30 protein:vir:9930 Length: 108 # 99.9 6.6E-26 4.1E-29 158.6 8.3 86 1-116 18-103 (108) 31 protein:vir:96486 Length: 112 99.9 2.3E-25 1.4E-28 155.6 8.0 85 1-116 25-109 (112) 32 protein:vir:95789 Length: 114 99.9 3.4E-25 2.1E-28 154.7 8.1 85 1-116 22-106 (114) 33 protein:vir:99744 Length: 115 99.8 5.1E-25 3.2E-28 153.8 8.0 86 1-116 20-111 (115) 34 protein:vir:103917 Length: 115 99.8 9.7E-25 6E-28 152.2 8.3 86 1-116 20-111 (115) 35 protein:vir:78858 Length: 115 99.8 9.7E-25 6E-28 152.2 8.3 86 1-116 20-111 (115) 36 protein:vir:97144 Length: 115 99.8 9.7E-25 6E-28 152.2 8.3 86 1-116 20-111 (115) 37 protein:vir:96225 Length: 115 99.8 9.7E-25 6E-28 152.2 8.3 86 1-116 20-111 (115) 38 protein:vir:96358 Length: 115 99.8 9.7E-25 6E-28 152.2 8.3 86 1-116 20-111 (115) 39 protein:vir:9312 Length: 115 # 99.8 9.7E-25 6E-28 152.2 8.3 86 1-116 20-111 (115) 40 protein:vir:106623 Length: 115 99.8 9.7E-25 6E-28 152.2 8.2 86 1-116 20-111 (115) 41 protein:vir:94538 Length: 125 99.8 8.1E-25 5E-28 152.7 7.7 87 1-116 26-115 (125) 42 protein:vir:4906 Length: 114 # 99.8 3.5E-24 2.2E-27 149.2 6.7 85 1-116 25-109 (114) 43 protein:vir:2740 Length: 114 # 99.8 3.5E-24 2.2E-27 149.2 6.7 85 1-116 25-109 (114) 44 protein:vir:97088 Length: 157 99.8 1.6E-21 1E-24 134.6 9.4 111 1-116 24-142 (157) 45 protein:vir:105467 Length: 144 99.7 4.6E-21 2.8E-24 132.1 9.3 104 1-116 27-133 (144) 46 protein:vir:100075 Length: 140 99.7 1.8E-20 1.1E-23 128.9 7.3 87 1-116 24-125 (140) 47 protein:vir:100243 Length: 140 99.7 3E-20 1.9E-23 127.6 7.5 87 1-116 24-125 (140) 48 protein:vir:1437 Length: 140 # 99.7 7E-20 4.4E-23 125.6 7.8 87 1-116 24-125 (140) 49 protein:vir:80362 Length: 140 99.7 8.8E-20 5.5E-23 125.0 7.7 87 1-116 24-125 (140) 50 protein:vir:194 Length: 149 # 99.7 9.4E-20 5.8E-23 124.9 6.6 87 1-116 26-136 (149) 51 protein:vir:93617 Length: 148 99.6 2.4E-19 1.5E-22 122.7 7.7 87 1-116 26-135 (148) 52 protein:vir:1273 Length: 127 # 99.6 6E-19 3.7E-22 120.5 7.2 87 1-116 23-119 (127) 53 protein:vir:9879 Length: 127 # 99.6 8.2E-19 5.1E-22 119.7 6.5 96 1-116 15-122 (127) 54 protein:vir:79034 Length: 141 99.6 3.9E-18 2.4E-21 116.0 8.5 93 1-116 27-128 (141) 55 protein:vir:107568 Length: 146 99.6 4E-18 2.5E-21 116.0 7.8 87 1-116 26-135 (146) 56 protein:vir:102085 Length: 146 99.6 4E-18 2.5E-21 116.0 7.8 87 1-116 26-135 (146) 57 protein:vir:105007 Length: 146 99.6 4E-18 2.5E-21 116.0 7.8 87 1-116 26-135 (146) 58 protein:vir:102875 Length: 146 99.6 4E-18 2.5E-21 116.0 7.8 87 1-116 26-135 (146) 59 protein:vir:1891 Length: 179 # 99.6 3.8E-18 2.3E-21 116.1 7.3 109 1-116 27-158 (179) 60 protein:vir:4347 Length: 164 # 99.6 4.6E-18 2.8E-21 115.6 7.1 87 1-116 27-143 (164) 61 protein:vir:105089 Length: 133 99.6 7.4E-18 4.6E-21 114.5 7.8 87 1-116 23-123 (133) 62 protein:vir:5745 Length: 135 # 99.5 1.2E-17 7.4E-21 113.3 7.4 87 1-116 25-123 (135) 63 protein:vir:3873 Length: 128 # 99.5 3.4E-17 2.1E-20 110.9 7.8 87 1-116 22-120 (128) 64 protein:vir:1386 Length: 149 # 99.5 5.8E-17 3.6E-20 109.6 7.8 87 1-116 28-136 (149) 65 protein:vir:99528 Length: 92 # 99.5 3E-17 1.9E-20 111.2 6.1 67 1-96 18-92 (92) 66 protein:vir:9708 Length: 125 # 99.5 7E-17 4.3E-20 109.2 7.5 87 1-116 19-116 (125) 67 protein:vir:81147 Length: 126 99.5 1.1E-16 7.1E-20 108.0 7.9 92 1-116 24-119 (126) 68 protein:vir:81106 Length: 125 99.4 3.7E-16 2.3E-19 105.2 7.8 87 1-116 21-117 (125) 69 protein:vir:4704 Length: 125 # 99.4 3.7E-16 2.3E-19 105.2 7.8 87 1-116 21-117 (125) 70 protein:vir:79988 Length: 125 99.4 3.7E-16 2.3E-19 105.2 7.8 87 1-116 21-117 (125) 71 protein:vir:9414 Length: 125 # 99.4 3.7E-16 2.3E-19 105.2 7.8 87 1-116 21-117 (125) 72 protein:vir:98342 Length: 125 99.4 3.7E-16 2.3E-19 105.2 7.8 87 1-116 21-117 (125) 73 protein:vir:102338 Length: 116 99.3 2.7E-15 1.7E-18 100.4 8.1 104 1-116 1-108 (116) 74 protein:vir:102963 Length: 163 99.3 3.7E-15 2.3E-18 99.7 8.4 92 1-116 26-147 (163) 75 protein:vir:102154 Length: 119 99.3 1.8E-15 1.1E-18 101.4 4.6 85 1-116 23-111 (119) 76 protein:vir:966 Length: 123 # 99.3 1.7E-14 1.1E-17 96.0 8.3 92 1-116 25-118 (123) 77 protein:vir:10367 Length: 119 99.1 6.3E-14 3.9E-17 93.0 3.8 96 16-116 1-104 (119) 78 protein:vir:81067 Length: 119 99.1 1.1E-13 7.1E-17 91.5 3.7 96 16-116 1-104 (119) 79 protein:vir:94994 Length: 131 98.9 6.7E-12 4.2E-15 81.8 7.1 87 1-116 14-128 (131) 80 protein:vir:103280 Length: 142 98.9 8.6E-12 5.3E-15 81.3 7.6 87 1-116 19-136 (142) 81 protein:vir:107703 Length: 147 98.9 1E-11 6.4E-15 80.8 7.7 87 1-116 20-138 (147) 82 protein:vir:95372 Length: 124 98.8 1.5E-11 9.2E-15 79.9 7.7 90 1-116 28-119 (124) 83 protein:vir:104347 Length: 145 98.8 1.3E-11 8.2E-15 80.2 7.4 87 1-116 22-139 (145) 84 protein:vir:78380 Length: 131 98.8 1.6E-11 1E-14 79.7 7.5 87 1-116 14-128 (131) 85 protein:vir:79638 Length: 146 98.8 2.4E-11 1.5E-14 78.8 7.4 87 1-116 20-138 (146) 86 protein:vir:80116 Length: 127 98.8 3E-11 1.9E-14 78.2 7.4 90 1-116 28-119 (127) 87 protein:vir:96288 Length: 100 98.8 9.8E-12 6.1E-15 80.9 4.1 67 1-69 34-100 (100) 88 protein:vir:3163 Length: 145 # 98.7 1.7E-11 1.1E-14 79.6 3.4 89 1-116 15-132 (145) 89 protein:vir:95157 Length: 144 98.7 1.8E-10 1.1E-13 74.0 7.9 87 1-116 19-142 (144) 90 protein:vir:80425 Length: 134 98.6 1.8E-10 1.1E-13 74.0 7.9 86 1-116 14-130 (134) 91 protein:vir:97190 Length: 148 98.6 2.1E-10 1.3E-13 73.7 7.5 87 1-116 18-141 (148) 92 protein:vir:100887 Length: 139 98.5 5.4E-10 3.4E-13 71.4 7.9 87 1-116 21-123 (139) 93 protein:vir:94944 Length: 121 98.5 2.4E-10 1.5E-13 73.4 5.2 79 1-108 17-121 (121) 94 protein:vir:105773 Length: 131 98.5 4.9E-10 3E-13 71.6 6.6 103 1-116 20-126 (131) 95 protein:vir:4956 Length: 153 # 98.4 1.3E-09 8.3E-13 69.2 7.5 87 1-116 22-127 (153) 96 protein:vir:6246 Length: 143 # 98.4 6.7E-10 4.1E-13 70.9 5.4 89 1-116 29-138 (143) 97 protein:vir:1332 Length: 143 # 98.3 1.9E-09 1.2E-12 68.4 5.5 89 1-116 29-138 (143) 98 protein:vir:96774 Length: 152 98.3 5.8E-09 3.6E-12 65.8 7.7 87 1-116 24-150 (152) 99 protein:vir:5000 Length: 141 # 98.2 8.3E-09 5.1E-12 64.9 7.6 87 1-116 22-127 (141) 100 protein:vir:100223 Length: 139 98.2 1.2E-08 7.4E-12 64.0 7.4 87 1-116 21-123 (139) 101 protein:vir:99833 Length: 190 98.1 1.5E-08 9.3E-12 63.5 6.9 114 1-116 13-183 (190) 102 protein:vir:4859 Length: 140 # 98.1 2.9E-08 1.8E-11 61.9 7.6 87 1-116 22-127 (140) 103 protein:vir:79225 Length: 155 98.0 1.6E-08 1E-11 63.3 5.6 92 1-116 14-152 (155) 104 protein:vir:4833 Length: 140 # 98.0 5E-08 3.1E-11 60.6 7.1 87 1-116 22-127 (140) 105 protein:vir:103841 Length: 155 98.0 7E-08 4.4E-11 59.8 7.5 92 1-116 14-152 (155) 106 protein:vir:99196 Length: 155 97.9 5.4E-08 3.3E-11 60.4 5.7 92 1-116 14-152 (155) 107 protein:vir:7449 Length: 123 # 97.9 6.2E-08 3.8E-11 60.1 5.7 85 1-116 24-113 (123) 108 protein:vir:1988 Length: 156 # 97.8 6.6E-08 4.1E-11 60.0 5.4 92 1-116 14-147 (156) 109 protein:vir:6216 Length: 125 # 97.8 1.1E-07 7E-11 58.7 5.9 92 1-116 21-118 (125) 110 protein:vir:102190 Length: 93 97.8 9.2E-08 5.7E-11 59.2 5.2 81 5-116 1-86 (93) 111 protein:vir:79091 Length: 175 97.7 8.4E-08 5.2E-11 59.4 4.6 92 1-116 14-169 (175) 112 protein:vir:8106 Length: 150 # 97.7 4.6E-08 2.8E-11 60.8 3.1 113 1-116 25-147 (150) 113 protein:vir:80970 Length: 112 97.6 3.7E-07 2.3E-10 55.9 6.6 90 1-116 16-105 (112) 114 protein:vir:100652 Length: 134 97.6 4.8E-07 3E-10 55.2 6.9 95 1-116 24-128 (134) 115 protein:vir:101508 Length: 120 97.6 3.1E-07 1.9E-10 56.3 5.7 83 1-116 24-113 (120) 116 protein:vir:45 Length: 112 # N 97.6 4.3E-07 2.7E-10 55.5 6.4 90 1-116 16-105 (112) 117 protein:vir:107851 Length: 175 97.6 4.3E-07 2.7E-10 55.5 6.3 92 1-116 14-169 (175) 118 protein:vir:4162 Length: 133 # 97.5 3.4E-07 2.1E-10 56.0 5.1 108 1-116 19-133 (133) 119 protein:vir:4200 Length: 133 # 97.5 3.4E-07 2.1E-10 56.0 5.0 108 1-116 19-128 (133) 120 protein:vir:7993 Length: 108 # 97.5 5.8E-08 3.6E-11 60.2 0.6 84 1-84 23-108 (108) 121 protein:vir:101302 Length: 134 97.4 1.5E-06 9.6E-10 52.4 7.2 95 1-116 24-128 (134) 122 protein:vir:9513 Length: 134 # 97.4 1.5E-06 9.6E-10 52.4 7.2 95 1-116 24-128 (134) 123 protein:vir:4790 Length: 114 # 97.2 2.8E-06 1.7E-09 51.0 6.6 93 1-116 12-109 (114) 124 protein:vir:3848 Length: 159 # 97.0 9.5E-06 5.9E-09 48.1 7.9 87 1-116 9-146 (159) 125 protein:vir:98892 Length: 108 96.8 8.2E-06 5.1E-09 48.5 6.0 88 1-116 16-103 (108) 126 protein:vir:94069 Length: 168 96.6 1.3E-06 8.3E-10 52.8 0.9 83 1-116 1-93 (168) 127 protein:vir:98557 Length: 149 96.6 1.6E-05 1E-08 46.9 6.7 97 1-116 7-148 (149) 128 protein:vir:9647 Length: 132 # 96.5 2.6E-05 1.6E-08 45.8 7.0 90 1-116 26-131 (132) 129 protein:vir:79687 Length: 113 96.4 2.1E-05 1.3E-08 46.2 5.9 91 1-116 7-102 (113) 130 protein:vir:1581 Length: 116 # 96.3 4E-05 2.5E-08 44.7 7.1 96 1-116 17-116 (116) 131 protein:vir:77650 Length: 155 95.8 1.7E-05 1E-08 46.8 2.9 78 1-116 1-90 (155) 132 protein:vir:78607 Length: 155 95.6 1.8E-05 1.1E-08 46.6 2.2 81 2-116 1-90 (155) 133 protein:vir:98636 Length: 138 95.6 0.00015 9.1E-08 41.6 7.1 90 1-116 32-129 (138) 134 protein:vir:106728 Length: 155 95.5 1.9E-05 1.2E-08 46.5 2.1 81 2-116 1-90 (155) 135 protein:vir:93898 Length: 133 95.4 0.00017 1.1E-07 41.2 6.9 93 1-116 24-132 (133) 136 protein:vir:101563 Length: 155 95.3 3.3E-05 2E-08 45.2 2.7 77 1-116 3-90 (155) 137 protein:vir:107757 Length: 189 95.3 2.8E-05 1.7E-08 45.6 2.3 79 1-116 1-81 (189) 138 protein:vir:96105 Length: 193 95.3 5E-05 3.1E-08 44.2 3.7 107 1-116 3-124 (193) 139 protein:vir:5257 Length: 148 # 95.3 1.1E-05 7E-09 47.7 0.1 79 1-116 1-83 (148) 140 protein:vir:79179 Length: 155 95.3 0.00041 2.5E-07 39.2 8.6 95 1-116 8-154 (155) 141 protein:vir:78335 Length: 133 95.2 0.00027 1.7E-07 40.1 7.6 93 1-116 24-126 (133) 142 protein:vir:3036 Length: 118 # 95.2 0.00023 1.4E-07 40.5 7.0 95 1-116 16-116 (118) 143 protein:vir:9823 Length: 118 # 95.2 0.00023 1.4E-07 40.5 7.0 95 1-116 16-116 (118) 144 protein:vir:96012 Length: 133 95.2 0.00024 1.5E-07 40.4 7.1 94 1-116 23-126 (133) 145 protein:vir:99546 Length: 200 95.1 0.00012 7.2E-08 42.1 5.1 112 1-116 1-131 (200) 146 protein:vir:96973 Length: 133 95.0 0.0003 1.8E-07 39.9 7.2 93 1-116 24-132 (133) 147 protein:vir:94419 Length: 133 95.0 0.0003 1.8E-07 39.9 7.2 93 1-116 24-132 (133) 148 protein:vir:78644 Length: 133 95.0 0.0003 1.8E-07 39.9 7.2 93 1-116 24-132 (133) 149 protein:vir:9363 Length: 133 # 95.0 0.0003 1.8E-07 39.9 7.2 93 1-116 24-132 (133) 150 protein:vir:105825 Length: 108 95.0 4.9E-05 3.1E-08 44.2 2.7 84 1-84 18-108 (108) 151 protein:vir:102608 Length: 108 95.0 4.9E-05 3.1E-08 44.2 2.7 84 1-84 18-108 (108) 152 protein:vir:1838 Length: 149 # 94.8 0.00037 2.3E-07 39.4 7.0 97 1-116 7-148 (149) 153 protein:vir:2688 Length: 123 # 94.2 0.00055 3.4E-07 38.5 6.6 93 1-116 14-122 (123) 154 protein:vir:5703 Length: 150 # 94.2 0.00064 4E-07 38.1 7.0 97 1-116 7-149 (150) 155 protein:vir:2026 Length: 150 # 94.0 0.00066 4.1E-07 38.0 6.7 97 1-116 7-149 (150) 156 protein:vir:4460 Length: 170 # 93.9 0.00062 3.8E-07 38.2 6.4 104 1-116 20-155 (170) 157 protein:vir:100312 Length: 152 93.9 0.0013 7.8E-07 36.5 8.0 98 1-116 8-150 (152) 158 protein:vir:78894 Length: 105 93.6 0.00026 1.6E-07 40.3 3.8 85 1-116 10-100 (105) 159 protein:vir:6071 Length: 150 # 92.9 0.00076 4.7E-07 37.7 5.3 97 1-116 7-149 (150) 160 protein:vir:487 Length: 187 # 92.9 0.001 6.3E-07 37.0 5.9 104 1-116 33-172 (187) 161 protein:vir:95260 Length: 160 92.6 0.00033 2.1E-07 39.7 2.8 76 1-116 2-86 (160) 162 protein:vir:79115 Length: 148 91.4 0.0042 2.6E-06 33.6 7.5 97 1-116 7-147 (148) 163 protein:vir:4514 Length: 168 # 88.0 0.007 4.4E-06 32.4 6.0 103 1-116 19-154 (168) 164 protein:vir:80037 Length: 199 87.7 0.0016 1E-06 35.9 2.3 104 1-116 1-127 (199) 165 protein:vir:78163 Length: 92 # 83.8 0.0034 2.1E-06 34.1 2.0 73 1-76 16-92 (92) 166 protein:vir:1087 Length: 161 # 83.2 0.017 1E-05 30.4 5.6 104 1-116 12-148 (161) 167 protein:vir:1028 Length: 168 # 81.6 0.03 1.8E-05 29.0 6.3 101 1-116 8-152 (168) 168 protein:vir:7412 Length: 168 # 80.3 0.04 2.5E-05 28.3 6.5 101 1-116 8-152 (168) 169 protein:vir:96105 Length: 193 76.3 0.01 6.4E-06 31.5 2.1 40 1-40 131-193 (193) 170 protein:vir:78607 Length: 155 75.6 0.011 6.9E-06 31.3 2.1 41 1-41 99-155 (155) 171 protein:vir:80037 Length: 199 75.5 0.017 1E-05 30.3 3.0 42 1-42 136-199 (199) 172 protein:vir:94069 Length: 168 74.7 0.013 7.9E-06 31.0 2.2 51 1-66 102-168 (168) 173 protein:vir:96763 Length: 177 74.3 0.063 3.9E-05 27.2 5.9 116 1-116 29-162 (177) 174 protein:vir:1164 Length: 156 # 74.2 0.1 6.5E-05 26.0 7.0 97 1-116 8-151 (156) 175 protein:vir:99546 Length: 200 73.5 0.011 7.1E-06 31.2 1.6 40 1-40 130-200 (200) 176 protein:vir:106728 Length: 155 73.0 0.015 9.3E-06 30.6 2.1 41 1-41 99-155 (155) 177 protein:vir:6154 Length: 119 # 64.6 0.0051 3.1E-06 33.2 -2.3 92 1-116 22-116 (119) 178 protein:vir:8432 Length: 149 # 56.9 0.31 0.0002 23.3 6.1 93 1-116 26-143 (149) 179 protein:vir:3994 Length: 168 # 56.1 0.28 0.00017 23.6 5.6 102 1-116 8-152 (168) 180 protein:vir:99454 Length: 150 38.1 0.8 0.0005 21.1 5.2 111 1-112 11-150 (150) 181 protein:vir:4096 Length: 140 # 35.8 0.54 0.00033 22.1 3.8 89 1-116 26-126 (140) 182 protein:vir:3427 Length: 192 # 20.8 2.7 0.0017 18.2 7.8 115 1-116 21-179 (192) No 1 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=100.00 E-value=1.2e-38 Score=228.42 Aligned_cols=116 Identities=98% Similarity=1.566 Sum_probs=114.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) |+++++++|++++..|+++||+++|+|||+|++||+++++.++++++|+++++||+||||||++|++++...++++++|. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~~~ 80 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIPWS 80 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....|++++|+||+|||||+||+++++++|+++|| T Consensus 81 ~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 2 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=100.00 E-value=3e-38 Score=226.30 Aligned_cols=116 Identities=99% Similarity=1.573 Sum_probs=114.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) |+++++++|++++..|+++||+++|+|||+|++||+++++.++++++|+++++||+||||||++|++++.+...++++|. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~ 80 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....|++++|+||+|||||+||+++++++|+++|| T Consensus 81 ~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 3 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=100.00 E-value=3e-38 Score=226.30 Aligned_cols=116 Identities=99% Similarity=1.573 Sum_probs=114.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) |+++++++|++++..|+++||+++|+|||+|++||+++++.++++++|+++++||+||||||++|++++.+...++++|. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~ 80 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....|++++|+||+|||||+||+++++++|+++|| T Consensus 81 ~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 4 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=100.00 E-value=1e-36 Score=217.83 Aligned_cols=116 Identities=82% Similarity=1.403 Sum_probs=113.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..|+++|+++||+|||+|++||.++++.++++++|+++++||+||||||++|..++..+..+.+.|. T Consensus 22 ~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:10 22 TIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWC 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCcccccccCccccccCCCccccccccce Confidence 88899999999999999999999999999999999998888999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++...+++++|+||||||||+||+++++++|+++|| T Consensus 102 ~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 102 YKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred eeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 999999999999999999999999999999999999 No 5 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=100.00 E-value=1.1e-36 Score=217.68 Aligned_cols=116 Identities=64% Similarity=1.218 Sum_probs=113.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..++++|++++|+|||+|++||.+++..++++++|+++++||+||||||++|..++.+...+++.|. T Consensus 22 ~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:96 22 MEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEYAIYVEFGTGIYATGPGGSRARKLPWT 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCcccccccCccccccCCCccccccccce Confidence 88899999999999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+++++|+||||||||+||+++++++|+++|| T Consensus 102 ~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 102 YKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred eeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 6 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=100.00 E-value=9.9e-37 Score=217.97 Aligned_cols=116 Identities=82% Similarity=1.402 Sum_probs=113.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..|+++|++++|+|||+|++||+++++.++++++|+++++||+||||||++|.+++....++.++|. T Consensus 22 ~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:10 22 TIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWR 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCccccccccCccccccCCCccccccccee Confidence 88899999999999999999999999999999999999998999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+++++|+||||||||+||+++++++|.++|| T Consensus 102 ~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 102 YKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred eeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 7 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=100.00 E-value=1.1e-36 Score=217.68 Aligned_cols=116 Identities=97% Similarity=1.554 Sum_probs=113.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..|+++|+.++|+|||+|++||+++++.++++++|+++++||+||||||++|.+.+.+...+.++|. T Consensus 22 ~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:94 22 IERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCcccccccCccccccCCCcccccccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+.+++|+||||||||+||+++++++|.++|| T Consensus 102 ~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 102 YKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred eeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 8 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=100.00 E-value=2.2e-36 Score=216.04 Aligned_cols=116 Identities=63% Similarity=1.171 Sum_probs=113.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++++++++|++++..|+++|+.++|+|||+|++||.+++..++++++|+++++||+||||||++|.+++.+..+.+++|. T Consensus 34 ~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~ 113 (149) T protein:vir:94 34 IEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWS 113 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCcccccccCccccccCCCccccccccce Confidence 88899999999999999999999999999999999999888899999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+++++|+||||||||+||+++++++|.++|+ T Consensus 114 ~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 114 FKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred eecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 9 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=100.00 E-value=2.4e-36 Score=215.82 Aligned_cols=116 Identities=100% Similarity=1.578 Sum_probs=113.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..|+++|+.++|+|||+|++||+++++.++++++|+++++||+||||||++|.+.+.+...+.+.|. T Consensus 22 ~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:94 22 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccccCccccccCCCcccccccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+.+++|+||||||||+||+++++++|+++|| T Consensus 102 ~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 102 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred eeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 10 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=100.00 E-value=2.4e-36 Score=215.82 Aligned_cols=116 Identities=100% Similarity=1.578 Sum_probs=113.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..|+++|+.++|+|||+|++||+++++.++++++|+++++||+||||||++|.+.+.+...+.+.|. T Consensus 22 ~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:97 22 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccccCccccccCCCcccccccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+.+++|+||||||||+||+++++++|+++|| T Consensus 102 ~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 102 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred eeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 11 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=100.00 E-value=2.4e-36 Score=215.82 Aligned_cols=116 Identities=100% Similarity=1.578 Sum_probs=113.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..|+++|+.++|+|||+|++||+++++.++++++|+++++||+||||||++|.+.+.+...+.+.|. T Consensus 22 ~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:93 22 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccccCccccccCCCcccccccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+.+++|+||||||||+||+++++++|+++|| T Consensus 102 ~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 102 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred eeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 12 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=100.00 E-value=4.3e-36 Score=214.50 Aligned_cols=116 Identities=99% Similarity=1.573 Sum_probs=113.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|++++..++++|+.++|+|||+|++||++++..++++++|+++++||+||||||++|.+.+...+.+++.|. T Consensus 22 ~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 101 (137) T protein:vir:95 22 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 101 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCcccccccCccccccCCCcccccccccc Confidence 88999999999999999999999999999999999999988999999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+.+++|+||||||||+||+++++++|.++|| T Consensus 102 ~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 102 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred eeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 13 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=100.00 E-value=5.3e-36 Score=213.99 Aligned_cols=116 Identities=63% Similarity=1.171 Sum_probs=113.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++++++++|++++..|+++|+.++|+|||+|++||.+++..++++++|+++++||+||||||++|..++....+..++|. T Consensus 34 ~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~ 113 (149) T protein:vir:10 34 IEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWS 113 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCcccccccCccccccCCcccccccccce Confidence 89999999999999999999999999999999999999888899999999999999999999999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+++++|+||||||||+||+++++++|+++|+ T Consensus 114 ~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 114 FKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred eeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999 No 14 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.96 E-value=6.1e-34 Score=202.67 Aligned_cols=114 Identities=73% Similarity=1.284 Sum_probs=108.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++++++++|.++++.|+++|+.++|+|||+|++||.++++.++++++|+++++||+|||+||++|.+.+. ..+++.|+ T Consensus 22 ~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ve~GT~~~~~~~~--~~~~~~~~ 99 (135) T protein:vir:96 22 MEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYVNYGTGIYATKGS--RAHKIPWT 99 (135) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchhhcccccccCCCc--cccccccc Confidence 8889999999999999999999999999999999999999999999999999999999999999987664 44678888 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+.+++|+||||||||+||+++++++|.+.|+ T Consensus 100 ~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 100 YKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred cccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 888999999999999999999999999999999999 No 15 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.96 E-value=1.3e-33 Score=200.96 Aligned_cols=113 Identities=25% Similarity=0.360 Sum_probs=100.9 Q ss_pred ChHHHHHHHH-----HHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcc Q lcl|NC_021326. 1 MERWVKRGIA-----KTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) Q Consensus 1 i~~~~~~~~~-----~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~ 75 (116) +++.+.++++ .+++.++..|+.++|||||+|++||.+++..++++++|+++++||+|||+|||+|..++.+ + T Consensus 17 ~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~GTG~~~~~~~g---r 93 (141) T protein:vir:78 17 IEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEFGTGEKSERGGG---K 93 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceeecCCcccccCCCC---C Confidence 5555555444 4455678999999999999999999999988889999999999999999999999988865 7 Q ss_pred cccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 76 ~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +++|+|.+..|++++|+||+|||||+||+++++++|.+.|. T Consensus 94 k~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~ 134 (141) T protein:vir:78 94 AGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTE 134 (141) T ss_pred cCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHH Confidence 89999999999999999999999999999999999998888 No 16 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.96 E-value=2.5e-33 Score=199.32 Aligned_cols=113 Identities=43% Similarity=0.816 Sum_probs=103.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++++++++|.++++.++++|+.++|+|||+|++||+++++.++++++|+++++||+||||||++|.+.+.. +..+|. T Consensus 27 ~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~---~~~~~~ 103 (144) T protein:vir:59 27 VLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEYAIYVEYGTGIYAVDGNG---RKTPWT 103 (144) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCccchhhcCccccccCCCc---cccccc Confidence 88899999999999999999999999999999999999999999999999999999999999999988765 444565 Q ss_pred ccc-cccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKD-ANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~-~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +.. ..+.+++|+||||||||+||++++++.|.++|- T Consensus 104 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~ 140 (144) T protein:vir:59 104 YYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMR 140 (144) T ss_pred cccccccceecCCCCCCCcchhHHHHHHHHHHHHHHH Confidence 544 568899999999999999999999999999999 No 17 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.95 E-value=2.3e-32 Score=193.99 Aligned_cols=116 Identities=22% Similarity=0.188 Sum_probs=105.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecC----cEEEEEecCCccccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS----GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~----~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~ 76 (116) +++.++++|++++..|+++||+++|||||+|++||.+++..+ ++++.|+++++||+||||||++|.++|+..+++. T Consensus 22 ~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~ 101 (142) T protein:vir:86 22 VGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALH 101 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccceeccCCccceeccccCceee Confidence 888999999999999999999999999999999998876533 4678899999999999999999999999999999 Q ss_pred ccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 77 IPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 77 ~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +.|.+...+.+.++|||++|||||+||+++++.+...... T Consensus 102 f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~ 141 (142) T protein:vir:86 102 FWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRV 141 (142) T ss_pred EecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhcc Confidence 9999999999999999999999999999998877555555 No 18 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.95 E-value=2.3e-32 Score=193.99 Aligned_cols=116 Identities=22% Similarity=0.188 Sum_probs=105.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecC----cEEEEEecCCccccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS----GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~----~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~ 76 (116) +++.++++|++++..|+++||+++|||||+|++||.+++..+ ++++.|+++++||+||||||++|.++|+..+++. T Consensus 22 ~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~ 101 (142) T protein:vir:99 22 VGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALH 101 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccceeccCCccceeccccCceee Confidence 888999999999999999999999999999999998876533 4678899999999999999999999999999999 Q ss_pred ccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 77 IPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 77 ~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +.|.+...+.+.++|||++|||||+||+++++.+...... T Consensus 102 f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~ 141 (142) T protein:vir:99 102 FWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRV 141 (142) T ss_pred EecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhcc Confidence 9999999999999999999999999999998877555555 No 19 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.95 E-value=8.1e-32 Score=191.05 Aligned_cols=116 Identities=20% Similarity=0.245 Sum_probs=92.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee--cCcEEEEEecCCccccccccCCcccccCCCC------- Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK--DSGFTGVINIGSEYAIYVNYGTGIYATGAGG------- 71 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~--~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~------- 71 (116) |++++++++++++..|+++|+.++|||||+|++||.+++. .+++++.|+++++||+|||||||++...... T Consensus 27 v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~ 106 (182) T protein:vir:10 27 TANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSSMVAVFREFGTGLVGERSHKQLPKNVA 106 (182) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCCCccceeecCcccccccCccccCccce Confidence 5555666667777888899999999999999999987654 4568999999999999999999998643221 Q ss_pred cCcccccccccc------------------cccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 72 SRAKKIPWSYKD------------------ANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 72 ~~~~~~~~~~~~------------------~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ...++.+|+++. ..+.+++|+||||||||+||++++++++.++|. T Consensus 107 ~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~ 169 (182) T protein:vir:10 107 IIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKMAKEAPEIIK 169 (182) T ss_pred eeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHhHHHHHHHHH Confidence 112455665432 236778999999999999999999999999888 No 20 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.95 E-value=1.1e-31 Score=190.24 Aligned_cols=114 Identities=26% Similarity=0.328 Sum_probs=103.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCc--EEEEEecCCccccccccCCcccccCCCCcCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSG--FTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIP 78 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~--~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~ 78 (116) ++++++++|++++..++++|++++|+|||+|++||++++..++ +++.|+++++||+||||||++|.+.|+.++.+. T Consensus 24 ~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~-- 101 (142) T protein:vir:94 24 LTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAADVEYGTAPHVIVPKDKKALY-- 101 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccchhhhccCCCceeccCCCccce-- Confidence 7889999999999999999999999999999999998877654 678999999999999999999999988877774 Q ss_pred ccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 79 ~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) |.....+.+.+.|||++|||||+||+++++++|.+.|. T Consensus 102 ~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~ 139 (142) T protein:vir:94 102 WPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAK 139 (142) T ss_pred ecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHH Confidence 44455677889999999999999999999999999998 No 21 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.95 E-value=6.3e-32 Score=191.63 Aligned_cols=114 Identities=18% Similarity=0.139 Sum_probs=103.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecC---cEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS---GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) +...+++.+++++..++.+||.++|||||+||+||..+...+ .+++.|+++++||+||||||++|.++|+.++.+.+ T Consensus 21 ~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~ 100 (140) T protein:vir:10 21 SGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHF 100 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCCceeecCCCcccee Confidence 788889999999999999999999999999999999865543 37788999999999999999999999999999999 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHH---HHHHHHh Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAG---RAFFNKY 114 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~---k~~i~~~ 114 (116) +|.+...+.+.++|||++|||||+||+++. +++|+.- T Consensus 101 ~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 101 WWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred ecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 998888899999999999999999999984 5666655 No 22 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.95 E-value=6.3e-32 Score=191.63 Aligned_cols=114 Identities=18% Similarity=0.139 Sum_probs=103.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecC---cEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS---GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) +...+++.+++++..++.+||.++|||||+||+||..+...+ .+++.|+++++||+||||||++|.++|+.++.+.+ T Consensus 21 ~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~ 100 (140) T protein:vir:97 21 SGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHF 100 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCCceeecCCCcccee Confidence 788889999999999999999999999999999999865543 37788999999999999999999999999999999 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHH---HHHHHHh Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAG---RAFFNKY 114 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~---k~~i~~~ 114 (116) +|.+...+.+.++|||++|||||+||+++. +++|+.- T Consensus 101 ~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 101 WWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred ecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 998888899999999999999999999984 5666655 No 23 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.94 E-value=9.7e-32 Score=190.60 Aligned_cols=114 Identities=18% Similarity=0.125 Sum_probs=102.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecC---cEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS---GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~---~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) ++..++++|++++..++.+||.++|||||+||+||+++...+ ++++.|+++++||+||||||++|.+.|+.++.++| T Consensus 18 v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f 97 (137) T protein:vir:10 18 TGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHEGSRPHRITARHANALHF 97 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeeecCCCceeecccCceeee Confidence 788889999999999999999999999999999999876543 47889999999999999999999999999999999 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHH---HHHHHHh Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAG---RAFFNKY 114 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~---k~~i~~~ 114 (116) +|.+...+.+.++|||++|||||+||+++. +++|.-- T Consensus 98 ~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~~ 137 (137) T protein:vir:10 98 FWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDIHMT 137 (137) T ss_pred eeCCceEEeeeeecCCCCCCchHHHHHHHHhhccccccCC Confidence 999888888999999999999999999985 4555433 No 24 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.93 E-value=1.3e-30 Score=184.52 Aligned_cols=115 Identities=19% Similarity=0.176 Sum_probs=101.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecC----cEEEEEecCCccccccccCCcccccCCCCcC-cc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDS----GFTGVINIGSEYAIYVNYGTGIYATGAGGSR-AK 75 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~----~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~-~~ 75 (116) +...++++|++++..|+++||+++|||||+|++||.+++..+ ++++.|+++++||+||||||++|.++|+.++ .+ T Consensus 18 ~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l 97 (137) T protein:vir:10 18 FQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRRPGGVL 97 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeeecCCCCceeeccccceee Confidence 888999999999999999999999999999999998865432 3678899999999999999999999998755 77 Q ss_pred cccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhc Q lcl|NC_021326. 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYF 115 (116) Q Consensus 76 ~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i 115 (116) .|.+.+...+++.+.|||++|||||+||+++++++.-.-- T Consensus 98 ~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 98 RFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred eEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 8777777788999999999999999999999998765433 No 25 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=99.93 E-value=3.7e-30 Score=181.91 Aligned_cols=116 Identities=17% Similarity=0.200 Sum_probs=104.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec-C--cEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD-S--GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~-~--~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) +.+.+++++++.+..++.+||.++|+|||+|++||+.+... + .+++.|+++++||.|||+||++|.++|+.++++.| T Consensus 17 ~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f 96 (137) T protein:vir:10 17 GMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNGRRALTIRAKGNGRLKF 96 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceeeecCCCCceeecCCCcccee Confidence 77888899999999999999999999999999999987653 2 36788999999999999999999999999999999 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .|.+...+.+.++|||++|+|||+||+++.+++.==++. T Consensus 97 ~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~~~~~~ 135 (137) T protein:vir:10 97 TVEGRTVYARSVHQPARAGRPYLSQALREVAPQEGFRVT 135 (137) T ss_pred ecCCeeEeccceecCCCCCChhhHHHHHHhhcccceeEe Confidence 999888899999999999999999999998876444444 No 26 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.93 E-value=8.1e-30 Score=180.07 Aligned_cols=116 Identities=18% Similarity=0.214 Sum_probs=95.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee--cCcEEEEEecCCccccccccCCcccccCCCC------- Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK--DSGFTGVINIGSEYAIYVNYGTGIYATGAGG------- 71 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~--~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~------- 71 (116) ++++++++|.+++..|+++|+.+||+|||+|++||.++.. .+++++.|++++.||.||||||+.+...|.. T Consensus 20 ~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~ 99 (173) T protein:vir:10 20 IDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYMEFGTGAKVSVPKEFADMAAS 99 (173) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhhhcccccccCCCchhhhhhcc Confidence 8889999999999999999999999999999999987643 3458889999999999999999988776652 Q ss_pred cCccccccccc-------------------ccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 72 SRAKKIPWSYK-------------------DANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 72 ~~~~~~~~~~~-------------------~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .....++|+.. .....++.||||+|||||+||++++++++.++|. T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~ 163 (173) T protein:vir:10 100 FKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLE 163 (173) T ss_pred cccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHH Confidence 22233333221 1223457799999999999999999999999998 No 27 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.88 E-value=2e-26 Score=161.46 Aligned_cols=87 Identities=36% Similarity=0.580 Sum_probs=82.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) .++.++++|.+++..|+++|+.+||+|||+|++||.++++.++++++|+++++||+||||||+ T Consensus 18 ~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~----------------- 80 (108) T protein:vir:74 18 TLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYGTR----------------- 80 (108) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCcccceecccc----------------- Confidence 667788999999999999999999999999999999999889999999999999999999994 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .|+|||||+||++.+++++.++|. T Consensus 81 ------------km~aqpf~~pa~~~~~~~~~~~i~ 104 (108) T protein:vir:74 81 ------------FQSAQPFVKPAFNIQKKVFTNDLE 104 (108) T ss_pred ------------ccCCCcchhhHHHHHHHHHHHHHH Confidence 499999999999999999999999 No 28 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.87 E-value=3.6e-26 Score=160.08 Aligned_cols=87 Identities=37% Similarity=0.589 Sum_probs=82.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) -+..++++|++++..++++|+.+||||||+|++||.++++.++++++|+++++||+||||||+ T Consensus 18 ~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~----------------- 80 (108) T protein:vir:98 18 TLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGTR----------------- 80 (108) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeecccc----------------- Confidence 556678899999999999999999999999999999998889999999999999999999994 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .|+|||||+||++.+++++.++|. T Consensus 81 ------------~m~aqPFl~pa~~~~~~~~~~~i~ 104 (108) T protein:vir:98 81 ------------FQAAQPFVKPAFDVQKKIFTNDLE 104 (108) T ss_pred ------------ccCCCcchhhHHHHHHHHHHHHHH Confidence 499999999999999999999999 No 29 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.87 E-value=5e-26 Score=159.31 Aligned_cols=87 Identities=33% Similarity=0.623 Sum_probs=82.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) .++.++++|++++..|+++|+.++|+|||+|++||.++.+.++++++|+++++||+||||||. T Consensus 22 ~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~----------------- 84 (112) T protein:vir:36 22 SLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAYVEYGTR----------------- 84 (112) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccceeecccc----------------- Confidence 557789999999999999999999999999999999988889999999999999999999993 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .|||||||+||++.+++++.++|. T Consensus 85 ------------k~~a~Pfl~pa~~~~~~~~~~~i~ 108 (112) T protein:vir:36 85 ------------FQSAQPFVKPAYNEQKGVFIKDLE 108 (112) T ss_pred ------------ccCCCcchhhhHHHHHHHHHHHHH Confidence 489999999999999999999999 No 30 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.87 E-value=6.6e-26 Score=158.62 Aligned_cols=86 Identities=26% Similarity=0.399 Sum_probs=80.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++.++++|.+++..|+++|+.++|+|||+|++||.++.. +++.+.|+++++||+|||||| T Consensus 18 ~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~-~~~~~~v~~~~~Ya~~vE~GT------------------ 78 (108) T protein:vir:99 18 VRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQ-RLLHYRVVSPALYSIYLELGT------------------ 78 (108) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeec-CcEEEEeecCcccchhcccCc------------------ Confidence 8889999999999999999999999999999999988754 567899999999999999998 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++|+|||||+||++.+++++.++|. T Consensus 79 -----------~~m~a~Pf~~pa~~~~~~~~~~~i~ 103 (108) T protein:vir:99 79 -----------RKMEAQSFLDPALRKEWPVLMANIK 103 (108) T ss_pred -----------cccCCCcchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999999 No 31 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.86 E-value=2.3e-25 Score=155.65 Aligned_cols=85 Identities=22% Similarity=0.251 Sum_probs=79.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++++++++.+.+..+++.|+.++|||||+||+||.+ +.+++++.|+++++||+|||||| T Consensus 25 v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~--~~~~~~~~v~~~~~Ya~~vE~GT------------------ 84 (112) T protein:vir:96 25 RSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITL--EAGSDRAVVEALTNYSGYLEVGT------------------ 84 (112) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceee--ecCceEEEecCCCCccceeccCc------------------ Confidence 7888888888899999999999999999999999965 67789999999999999999999 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||+++++++|.++|. T Consensus 85 -----------r~m~AqPF~~PA~~~~~~~~~~~l~ 109 (112) T protein:vir:96 85 -----------RKMEAQPFMRPALDQVVPEMVEEMA 109 (112) T ss_pred -----------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999999 No 32 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.85 E-value=3.4e-25 Score=154.73 Aligned_cols=85 Identities=22% Similarity=0.333 Sum_probs=79.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +.+.++++|.+++..++++|+.+||+|||+||+||.+ +.++++++|+++++||+|||||| T Consensus 22 ~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~--~~~g~~~~V~~~~~Ya~yvE~GT------------------ 81 (114) T protein:vir:95 22 AVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITT--SYPGMEAHIHGEAGYDGYQEYGT------------------ 81 (114) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceee--ecCceEEEeecCCCccceeecCc------------------ Confidence 6778899999999999999999999999999999976 56788999999999999999998 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++|+|||||+||++.+++++.++|. T Consensus 82 -----------~~~~aqPfl~pa~~~~~~~~~~~l~ 106 (114) T protein:vir:95 82 -----------RFQPGTPHFRPMMEQIQPQFQKDMT 106 (114) T ss_pred -----------cccCCCccchhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 33 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.85 E-value=5.1e-25 Score=153.77 Aligned_cols=86 Identities=16% Similarity=0.170 Sum_probs=80.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +.++++++|++++..++++|+.++ |+|||+|++||.++ ..+++++.|+++++||+|||||| T Consensus 20 ~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~-~~g~~~~~V~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:99 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTVDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeee-ecCcEEEEecCCccccccccccc------------ Confidence 778889999999999999999998 99999999999886 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|+|||||+||++.+++.+.++|. T Consensus 87 -----------------~~m~a~PFl~PA~~~~k~~~~~~l~ 111 (115) T protein:vir:99 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCcchhhHHHHHHHHHHHHH Confidence 4599999999999999999999999 No 34 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=79.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..++++++|+++++||+|||||| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:10 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTGDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceee-ecCceEEEeecCccchhhhcccc------------ Confidence 778888999999999999999998 99999999999887 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|- T Consensus 87 -----------------~km~a~Pfl~PA~~~~~~~~~~~i~ 111 (115) T protein:vir:10 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 35 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=79.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..++++++|+++++||+|||||| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:78 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTGDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceee-ecCceEEEeecCccchhhhcccc------------ Confidence 778888999999999999999998 99999999999887 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|- T Consensus 87 -----------------~km~a~Pfl~PA~~~~~~~~~~~i~ 111 (115) T protein:vir:78 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 36 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=79.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..++++++|+++++||+|||||| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:97 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTGDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceee-ecCceEEEeecCccchhhhcccc------------ Confidence 778888999999999999999998 99999999999887 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|- T Consensus 87 -----------------~km~a~Pfl~PA~~~~~~~~~~~i~ 111 (115) T protein:vir:97 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 37 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=79.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..++++++|+++++||+|||||| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:96 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTGDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceee-ecCceEEEeecCccchhhhcccc------------ Confidence 778888999999999999999998 99999999999887 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|- T Consensus 87 -----------------~km~a~Pfl~PA~~~~~~~~~~~i~ 111 (115) T protein:vir:96 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 38 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=79.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..++++++|+++++||+|||||| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:96 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTGDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceee-ecCceEEEeecCccchhhhcccc------------ Confidence 778888999999999999999998 99999999999887 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|- T Consensus 87 -----------------~km~a~Pfl~PA~~~~~~~~~~~i~ 111 (115) T protein:vir:96 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 39 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=79.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..++++++|+++++||+|||||| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vE~GT------------ 86 (115) T protein:vir:93 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYK-KTGDLQYTITSHAAYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceee-ecCceEEEeecCccchhhhcccc------------ Confidence 778888999999999999999998 99999999999887 45778999999999999999999 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|- T Consensus 87 -----------------~km~a~Pfl~PA~~~~~~~~~~~i~ 111 (115) T protein:vir:93 87 -----------------RYMEAEPFMWPVYEVIRKSTVEELK 111 (115) T ss_pred -----------------cccCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999998 No 40 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.84 E-value=9.7e-25 Score=152.24 Aligned_cols=86 Identities=15% Similarity=0.131 Sum_probs=79.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC------CcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM------PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a------PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +++.++++|.+++..++++|+++| |+|||+|++||.++ ..+++++.|+++++||+|||||| T Consensus 20 ~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~-~~g~~~~~v~~~~~Ya~~vEfGT------------ 86 (115) T protein:vir:10 20 IEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVK-KIGDLHYRVISTAHYSGFLEFGT------------ 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeee-ecCcEEEEeeCCCccchheeccc------------ Confidence 677889999999999999999998 88999999999876 56778999999999999999998 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|+|||||+||++.+++++.++|- T Consensus 87 -----------------~km~a~PFl~PA~~~~k~~~~~~i~ 111 (115) T protein:vir:10 87 -----------------RYMEPAPFMFPTYQTLKKSTINDLK 111 (115) T ss_pred -----------------ccCCCCCchhhhHHHHHHHHHHHHH Confidence 4599999999999999999999888 No 41 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.84 E-value=8.1e-25 Score=152.66 Aligned_cols=87 Identities=28% Similarity=0.448 Sum_probs=80.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeE---eecCcEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMD---FKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~---~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) +.+.++++|.+++..|+++|+.+||+|||+|++||... .+.++++++|+++++||+|||||| T Consensus 26 ~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~~Ya~~vEfGT--------------- 90 (125) T protein:vir:94 26 LVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARADYSSYNEYGT--------------- 90 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCCCccceeeccc--------------- Confidence 67788889999999999999999999999999999753 456789999999999999999998 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|+|||||+||++++++.+.+.|. T Consensus 91 --------------~~~~a~Pfl~pa~~~~~~~~~~~l~ 115 (125) T protein:vir:94 91 --------------YRMSAQPFMAPSVAAMTPFFYKAVR 115 (125) T ss_pred --------------ccCCCCcccchhHHHHHHHHHHHHH Confidence 3589999999999999999999998 No 42 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.82 E-value=3.5e-24 Score=149.16 Aligned_cols=85 Identities=21% Similarity=0.270 Sum_probs=74.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++++++++...+..+++.|+.++|+|||+|++||.+++.+++ ++|+++++||+|||||| T Consensus 25 v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~--~~V~~~~~Ya~~vEfGT------------------ 84 (114) T protein:vir:49 25 RSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDK--ATVEALTSYSGYLEVGT------------------ 84 (114) T ss_pred HHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCe--eEecCCCCccceecccc------------------ Confidence 6667777777777777777777889999999999998877665 67999999999999998 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|. T Consensus 85 -----------~km~a~Pfl~PA~~~~~~~~~~~l~ 109 (114) T protein:vir:49 85 -----------RKMEAQPFMKPALDEVAPKMVEELA 109 (114) T ss_pred -----------cccCCCCchhhhHHHHHHHHHHHHH Confidence 3599999999999999999999999 No 43 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.82 E-value=3.5e-24 Score=149.16 Aligned_cols=85 Identities=21% Similarity=0.270 Sum_probs=74.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) +++++++++...+..+++.|+.++|+|||+|++||.+++.+++ ++|+++++||+|||||| T Consensus 25 v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~--~~V~~~~~Ya~~vEfGT------------------ 84 (114) T protein:vir:27 25 RSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDK--ATVEALTSYSGYLEVGT------------------ 84 (114) T ss_pred HHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCe--eEecCCCCccceecccc------------------ Confidence 6667777777777777777777889999999999998877665 67999999999999998 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+|||||+||++.+++.+.++|. T Consensus 85 -----------~km~a~Pfl~PA~~~~~~~~~~~l~ 109 (114) T protein:vir:27 85 -----------RKMEAQPFMKPALDEVAPKMVEELA 109 (114) T ss_pred -----------cccCCCCchhhhHHHHHHHHHHHHH Confidence 3599999999999999999999999 No 44 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.75 E-value=1.6e-21 Score=134.56 Aligned_cols=111 Identities=14% Similarity=0.126 Sum_probs=81.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee----cCc-EEEEEec---CCccccccccCCcccccCCCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK----DSG-FTGVINI---GSEYAIYVNYGTGIYATGAGGS 72 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~----~~~-~~~~V~~---~~~YA~~ve~GT~~~~~~~~~~ 72 (116) .++++++++.++|+.|.++|+.+||++||+|++||..... .++ .++.|+. +.+|++|+|||+..+....... T Consensus 24 ~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~ 103 (157) T protein:vir:97 24 SSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDK 103 (157) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecCCccceeeeeecCcccccccccCC Confidence 7778899999999999999999999999999999977542 123 3445665 4579999999975543322221 Q ss_pred CcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) + ..+.|.+. ..-.+.+|||||||+|||+..++++.+.|. T Consensus 104 ~-~~~~~~~~----~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~ 142 (157) T protein:vir:97 104 D-GQWYSSKV----KLVNPKWIPAKPFLRPGYDSVAMQIPDIAR 142 (157) T ss_pred c-cccccccc----ccCCCCcCCCCcccchHHHHhHHHHHHHHH Confidence 1 12233321 122357899999999999999999888876 No 45 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.73 E-value=4.6e-21 Score=132.09 Aligned_cols=104 Identities=18% Similarity=0.182 Sum_probs=82.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccccccccee---EeecCcEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTM---DFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~---~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) +++.+++++++++..++++++.++|||||+||+||.. ....++++++|+++++||+||||||+....+ .. T Consensus 27 ~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~~~YA~~VE~Ghr~~~G~-------~v 99 (144) T protein:vir:10 27 VKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINNAEYASYVESGHRQTPGR-------YV 99 (144) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecCCCcccccccceeecCCc-------cc Confidence 7888999999999999999999999999999999975 3456789999999999999999999643211 11 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++. +........++++||.+|++..++.|.++|- T Consensus 100 ~~~-----~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~ 133 (144) T protein:vir:10 100 PVL-----KKRLVRDWVPGQFYMKKSIPQIQRQLPQLVT 133 (144) T ss_pred ccC-----CCccccceecCccchHHHHHHHHHHHHHHHH Confidence 111 1112223347789999999999998888877 No 46 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.69 E-value=1.8e-20 Score=128.90 Aligned_cols=87 Identities=22% Similarity=0.248 Sum_probs=71.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee---cCcEEEEEe------------cCCccccccccCCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK---DSGFTGVIN------------IGSEYAIYVNYGTGIY 65 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V~------------~~~~YA~~ve~GT~~~ 65 (116) -+++++++|.+++..|+++|+.+||++||+|++||.+... ..+....++ ++..|+.|+|||| T Consensus 24 ~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~y~~f~E~GT--- 100 (140) T protein:vir:10 24 STKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGT--- 100 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeeccccccCCCCccceeeeeccCC--- Confidence 2467889999999999999999999999999999976432 112222332 3457999999998 Q ss_pred ccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 66 ATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|||||||+||++.+++.+.+.|. T Consensus 101 --------------------------~~~~a~PFl~pA~~~~~~~~~~~~~ 125 (140) T protein:vir:10 101 --------------------------QHMKAQPFMRPAFDASIGEAEGAIR 125 (140) T ss_pred --------------------------CCCCCCcchhhhHHHHHHHHHHHHH Confidence 5699999999999999999888888 No 47 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.68 E-value=3e-20 Score=127.59 Aligned_cols=87 Identities=22% Similarity=0.243 Sum_probs=71.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec-----CcEEEEEe----------cCCccccccccCCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD-----SGFTGVIN----------IGSEYAIYVNYGTGIY 65 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~-----~~~~~~V~----------~~~~YA~~ve~GT~~~ 65 (116) .+++++++|.+++..|+++|+.+||++||+|++||.+.... +.....++ ++..|+.|+|||| T Consensus 24 ~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--- 100 (140) T protein:vir:10 24 STKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRTKGKADSPNNAFYWRFVELGT--- 100 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeeccccccccCCCCcccccceeccCc--- Confidence 35678899999999999999999999999999999764321 12233332 3456999999998 Q ss_pred ccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 66 ATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|||||||+||++++++++.+.|. T Consensus 101 --------------------------~~~~a~PFl~pA~~~~~~~~~~~~~ 125 (140) T protein:vir:10 101 --------------------------QFMKAEPFMRPAFDASIAQAEGAIR 125 (140) T ss_pred --------------------------CCCCCCcchhhhHHHHHHHHHHHHH Confidence 4589999999999999999999998 No 48 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.67 E-value=7e-20 Score=125.59 Aligned_cols=87 Identities=22% Similarity=0.243 Sum_probs=70.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee---cCcEEEEEe------------cCCccccccccCCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK---DSGFTGVIN------------IGSEYAIYVNYGTGIY 65 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V~------------~~~~YA~~ve~GT~~~ 65 (116) -+++++++|.+++..++++|+.+||++||+|++||.+... .+.....|+ .+..|++|+|||| T Consensus 24 ~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~y~~f~E~GT--- 100 (140) T protein:vir:14 24 SAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGT--- 100 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeeccccccCCCCccceeeeecccc--- Confidence 2456788999999999999999999999999999977432 122222332 3457999999998 Q ss_pred ccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 66 ATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|||||||+||++++++++.+.|. T Consensus 101 --------------------------~~~~a~pFl~pa~~~~~~~~~~~~~ 125 (140) T protein:vir:14 101 --------------------------QHMKAQPFMRPAFDASIGEAEGAIR 125 (140) T ss_pred --------------------------CCCCCCcchhHHHHHHHHHHHHHHH Confidence 5699999999999999999888888 No 49 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.66 E-value=8.8e-20 Score=125.04 Aligned_cols=87 Identities=21% Similarity=0.232 Sum_probs=70.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec---CcEEEEEe------------cCCccccccccCCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---SGFTGVIN------------IGSEYAIYVNYGTGIY 65 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V~------------~~~~YA~~ve~GT~~~ 65 (116) -++++++++.+++..|+++|+.+||++||+|++||.+.... .+....++ ++..|+.|+|||| T Consensus 24 ~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--- 100 (140) T protein:vir:80 24 STKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPSNAFYWRFDEFGT--- 100 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeecccccccCCCCCcceeeeeccCC--- Confidence 34567889999999999999999999999999999764321 11122222 3466999999998 Q ss_pred ccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 66 ATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|||||||+||++.+++++.+.|. T Consensus 101 --------------------------~~~~a~PFl~pA~~~~~~~~~~~~~ 125 (140) T protein:vir:80 101 --------------------------QHMKAQPFMRPAFDASIGEAEGAIR 125 (140) T ss_pred --------------------------CCCCCCcchhhhHHHHHHHHHHHHH Confidence 5599999999999999999988888 No 50 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.65 E-value=9.4e-20 Score=124.89 Aligned_cols=87 Identities=21% Similarity=0.285 Sum_probs=66.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCc----EEEEEe--------------------cCCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSG----FTGVIN--------------------IGSEYAI 56 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~----~~~~V~--------------------~~~~YA~ 56 (116) .+++++++|..+|+.|+++|+.+||++||+|++||.+...... +...|+ .+..|+. T Consensus 26 ~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) T protein:vir:19 26 NNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccccccccccccceeecCCCCccceee Confidence 3467789999999999999999999999999999976432111 111111 1234666 Q ss_pred ccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 57 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 57 ~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) |+|||| .+|||||||+||++++++++.+.|. T Consensus 106 f~E~GT-----------------------------~~~~a~PF~~pA~~~~k~~~~~~~~ 136 (149) T protein:vir:19 106 FVELGT-----------------------------ANMPAHPFVRPAYDTREEEAASVAI 136 (149) T ss_pred eeccCC-----------------------------CCCCCCcchhHHHHHHHHHHHHHHH Confidence 666666 6799999999999999999888888 No 51 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=99.64 E-value=2.4e-19 Score=122.68 Aligned_cols=87 Identities=22% Similarity=0.323 Sum_probs=68.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee---cCcEEEEEe--------------------cCCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK---DSGFTGVIN--------------------IGSEYAIY 57 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V~--------------------~~~~YA~~ 57 (116) .+++++++|.+++..|+++|+.+||++||.|++||.+... .+.+...|+ .+..|++| T Consensus 26 ~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f 105 (148) T protein:vir:93 26 NNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRF 105 (148) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecccccccccccceeecCCCCCcceeee Confidence 3567788999999999999999999999999999976421 111211111 23458888 Q ss_pred cccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 58 ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +|||| .+|||||||+||++++++++.+.|. T Consensus 106 ~E~GT-----------------------------~~~pa~PFl~pA~~~~k~~~~~~~~ 135 (148) T protein:vir:93 106 VEMGT-----------------------------VNMPPHPFVRPAFDVRSEQAAQVAI 135 (148) T ss_pred eccCC-----------------------------CCCCCCcchhHHHHHhHHHHHHHHH Confidence 88887 5699999999999999999888888 No 52 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.61 E-value=6e-19 Score=120.48 Aligned_cols=87 Identities=15% Similarity=0.150 Sum_probs=74.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcc---cccccccceeE-e---ecCcEEEEEecC---CccccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVD---TGYLRESVTMD-F---KDSGFTGVINIG---SEYAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvd---TG~Lr~SI~~~-~---~~~~~~~~V~~~---~~YA~~ve~GT~~~~~~~~ 70 (116) +++.++++|.++|..|+++++.++|++ ||+|++||... + .++..+..|+.+ ..|++|+|||| T Consensus 23 ~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~~~~~y~~f~E~GT-------- 94 (127) T protein:vir:12 23 IEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPNKKVAYRGRFLEWGT-------- 94 (127) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeCCCCcceeeeeccCc-------- Confidence 788899999999999999999999985 89999999652 2 234467778864 45899999999 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++.+.+.|. T Consensus 95 ---------------------~~~~a~Pf~~pa~~~~~~~~~~~~~ 119 (127) T protein:vir:12 95 ---------------------SKMPPQPFIEKGGKEGEGPAVELME 119 (127) T ss_pred ---------------------cCCCCCccchHhHHHHHHHHHHHHH Confidence 4489999999999999999999999 No 53 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.60 E-value=8.2e-19 Score=119.75 Aligned_cols=96 Identities=19% Similarity=0.208 Sum_probs=79.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHh--CCc-------ccccccccceeEeecCcEEEEEecC---CccccccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISL--MPV-------DTGYLRESVTMDFKDSGFTGVINIG---SEYAIYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~--aPv-------dTG~Lr~SI~~~~~~~~~~~~V~~~---~~YA~~ve~GT~~~~~~ 68 (116) -.+.+++.+++-..++...|..+ +|+ |||+||+||+.++.++|+++.++.. .+||+||||||+--..+ T Consensus 15 s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~m~~~ 94 (127) T protein:vir:98 15 SEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRIVRNG 94 (127) T ss_pred hHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccccceeecceeeeecc Confidence 33446777888888888888876 899 9999999999999999999999984 99999999999642100 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +. .-.+++||||.|||+..++.|.++|- T Consensus 95 ------------------~~--~gf~~aqp~l~paf~~Qk~iF~~DL~ 122 (127) T protein:vir:98 95 ------------------KQ--VGYANGTKYLFNNVKKQREIYRQDML 122 (127) T ss_pred ------------------cc--cccccCccccccchHHHhHHHHHHHH Confidence 00 11378999999999999999999999 No 54 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.58 E-value=3.9e-18 Score=116.02 Aligned_cols=93 Identities=22% Similarity=0.295 Sum_probs=75.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccccccccee---------EeecCcEEEEEecCCccccccccCCcccccCCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTM---------DFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGG 71 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~---------~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~ 71 (116) +++.+++++++.+..+++.++.++|||||+||+||.. ...+++++++|+++++||+|||+||+....+ T Consensus 27 ~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~v~v~n~~~YA~~VE~Ghr~~~~~--- 103 (141) T protein:vir:79 27 LDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYIIEVVNPTEYASYVNFGHRTKDGK--- 103 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeEEEEecCCcchhhhhcceeecCCc--- Confidence 8888899999999999999999999999999999853 2345668899999999999999998542111 Q ss_pred cCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 72 SRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +..|++.||..|.++.++.+.+.|- T Consensus 104 --------------------gfV~G~fml~~s~~~~~~~~~~~~~ 128 (141) T protein:vir:79 104 --------------------GWVKGQHFLTISEMELQSQVDKIIE 128 (141) T ss_pred --------------------ceeCCchhHHHHHHHHHHHHHHHHH Confidence 2347777888888888877766666 No 55 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.57 E-value=4e-18 Score=115.97 Aligned_cols=87 Identities=21% Similarity=0.237 Sum_probs=70.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe-----------------ecCcEEEEEec------CCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF-----------------KDSGFTGVINI------GSEYAIY 57 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~-----------------~~~~~~~~V~~------~~~YA~~ 57 (116) ++++++++|.++|..|+++|+.++|+++|.|++++.... ..++..+.|+. +..|+.| T Consensus 26 ~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f 105 (146) T protein:vir:10 26 GEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKF 105 (146) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCCCCCcceeee Confidence 778899999999999999999999999999888764311 12233445553 3458999 Q ss_pred cccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 58 ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +|||| .+|||||||+||++++++.+.+.|. T Consensus 106 ~E~GT-----------------------------~~~~a~PFl~pa~~~~k~~~~~~~~ 135 (146) T protein:vir:10 106 HEWGT-----------------------------SKMPAHPFIEPGFNASKAEAVRAMT 135 (146) T ss_pred eccCC-----------------------------CCCCCCcchhHHHHHhHHHHHHHHH Confidence 99998 4599999999999999999998888 No 56 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.57 E-value=4e-18 Score=115.97 Aligned_cols=87 Identities=21% Similarity=0.237 Sum_probs=70.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe-----------------ecCcEEEEEec------CCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF-----------------KDSGFTGVINI------GSEYAIY 57 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~-----------------~~~~~~~~V~~------~~~YA~~ 57 (116) ++++++++|.++|..|+++|+.++|+++|.|++++.... ..++..+.|+. +..|+.| T Consensus 26 ~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f 105 (146) T protein:vir:10 26 GEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKF 105 (146) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCCCCCcceeee Confidence 778899999999999999999999999999888764311 12233445553 3458999 Q ss_pred cccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 58 ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +|||| .+|||||||+||++++++.+.+.|. T Consensus 106 ~E~GT-----------------------------~~~~a~PFl~pa~~~~k~~~~~~~~ 135 (146) T protein:vir:10 106 HEWGT-----------------------------SKMPAHPFIEPGFNASKAEAVRAMT 135 (146) T ss_pred eccCC-----------------------------CCCCCCcchhHHHHHhHHHHHHHHH Confidence 99998 4599999999999999999998888 No 57 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.57 E-value=4e-18 Score=115.97 Aligned_cols=87 Identities=21% Similarity=0.237 Sum_probs=70.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe-----------------ecCcEEEEEec------CCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF-----------------KDSGFTGVINI------GSEYAIY 57 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~-----------------~~~~~~~~V~~------~~~YA~~ 57 (116) ++++++++|.++|..|+++|+.++|+++|.|++++.... ..++..+.|+. +..|+.| T Consensus 26 ~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f 105 (146) T protein:vir:10 26 GEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKF 105 (146) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCCCCCcceeee Confidence 778899999999999999999999999999888764311 12233445553 3458999 Q ss_pred cccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 58 ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +|||| .+|||||||+||++++++.+.+.|. T Consensus 106 ~E~GT-----------------------------~~~~a~PFl~pa~~~~k~~~~~~~~ 135 (146) T protein:vir:10 106 HEWGT-----------------------------SKMPAHPFIEPGFNASKAEAVRAMT 135 (146) T ss_pred eccCC-----------------------------CCCCCCcchhHHHHHhHHHHHHHHH Confidence 99998 4599999999999999999998888 No 58 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.57 E-value=4e-18 Score=115.97 Aligned_cols=87 Identities=21% Similarity=0.237 Sum_probs=70.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe-----------------ecCcEEEEEec------CCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF-----------------KDSGFTGVINI------GSEYAIY 57 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~-----------------~~~~~~~~V~~------~~~YA~~ 57 (116) ++++++++|.++|..|+++|+.++|+++|.|++++.... ..++..+.|+. +..|+.| T Consensus 26 ~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f 105 (146) T protein:vir:10 26 GEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKF 105 (146) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCCCCCcceeee Confidence 778899999999999999999999999999888764311 12233445553 3458999 Q ss_pred cccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 58 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 58 ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +|||| .+|||||||+||++++++.+.+.|. T Consensus 106 ~E~GT-----------------------------~~~~a~PFl~pa~~~~k~~~~~~~~ 135 (146) T protein:vir:10 106 HEWGT-----------------------------SKMPAHPFIEPGFNASKAEAVRAMT 135 (146) T ss_pred eccCC-----------------------------CCCCCCcchhHHHHHhHHHHHHHHH Confidence 99998 4599999999999999999998888 No 59 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=99.57 E-value=3.8e-18 Score=116.11 Aligned_cols=109 Identities=16% Similarity=0.140 Sum_probs=67.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc-----ccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcC-- Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV-----DTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSR-- 73 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv-----dTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~-- 73 (116) -++++++||.++|+.|+++|+++||+ ++|.|++||.+..... ...-..+..|...++.||.++........ T Consensus 27 ~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~--~~~~~g~~~~~vgv~~~~~~~~~~~~~~~~~ 104 (179) T protein:vir:18 27 RNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSK--QFRRTGDLAFRVGVMGGARQYANTKANVRKG 104 (179) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeeccccc--ccccccceeEeeecccccccccccccccccC Confidence 25678999999999999999999976 5789999996642210 00111122234445555554332111000 Q ss_pred ----------------cccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 74 ----------------AKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 74 ----------------~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .-.++|. +.. +.|.+|||||||+||++++++++.+.|. T Consensus 105 ~~~~~~~~~g~~~~~~~~~~y~~-fvE----fGT~kmpa~PFlrPA~~~~~~~a~~~i~ 158 (179) T protein:vir:18 105 RAGKTYKTSGDKGNPGGDTWYWR-FLE----FGTEHTSARPILRPAMNGVDNDVINVFS 158 (179) T ss_pred cccccccccccccCCCCccceeE-Eec----cCCCCCCCCccchhhHHhhHHHHHHHHH Confidence 0011111 111 2368899999999999999999888888 No 60 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.56 E-value=4.6e-18 Score=115.64 Aligned_cols=87 Identities=14% Similarity=0.156 Sum_probs=66.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc-----ccccccccceeEee------cCcEEEEEe-------------------c Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV-----DTGYLRESVTMDFK------DSGFTGVIN-------------------I 50 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv-----dTG~Lr~SI~~~~~------~~~~~~~V~-------------------~ 50 (116) -+++++++|.++++.|+++|+.++|+ ++|+|++||.+... .+.+...|+ . T Consensus 27 ~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~~~~~ 106 (164) T protein:vir:43 27 KRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLGFRIGVLHGAVLPKKGERSDKTANA 106 (164) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccceeEEecccccccccccccccccCCCC Confidence 24678899999999999999999997 67999999966421 112222222 1 Q ss_pred CCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 51 GSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 51 ~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +..|++|+|||| .+|||||||+||++++++++.+.|. T Consensus 107 ~~~y~~f~EfGT-----------------------------~km~a~PFlrPA~~~~k~~~~~~~~ 143 (164) T protein:vir:43 107 PTPHWRLLEFGT-----------------------------EDMRAQPFMRSALADNIAEVTSTFV 143 (164) T ss_pred CcceEEEeecCC-----------------------------CCCCCCcchhhhHHHhHHHHHHHHH Confidence 235777777777 5799999999999999999887777 No 61 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.55 E-value=7.4e-18 Score=114.49 Aligned_cols=87 Identities=15% Similarity=0.208 Sum_probs=69.1 Q ss_pred Ch-HHHHHHHHHHHHHHHHHHHHhCCccccc----ccccceeEe--ecC----cEEEEEecCC---ccccccccCCcccc Q lcl|NC_021326. 1 ME-RWVKRGIAKTTAKIHNTIISLMPVDTGY----LRESVTMDF--KDS----GFTGVINIGS---EYAIYVNYGTGIYA 66 (116) Q Consensus 1 i~-~~~~~~~~~~a~~v~~~ak~~aPvdTG~----Lr~SI~~~~--~~~----~~~~~V~~~~---~YA~~ve~GT~~~~ 66 (116) ++ ++++++|.++|..|+++|+.+||+++|. |++||.+.. ..+ .+...|+.+. .|+.|+|||| T Consensus 23 ~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~vg~~~~~~~y~~f~E~GT---- 98 (133) T protein:vir:10 23 VATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRVGPSKQHHMKVLAQEFGT---- 98 (133) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEecCCCCccceEeeeccCC---- Confidence 54 4668999999999999999999999987 788886532 211 2345565442 4889999998 Q ss_pred cCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 67 TGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++++.+.|. T Consensus 99 -------------------------~k~~a~PF~~pA~~~~~~~~~~~~~ 123 (133) T protein:vir:10 99 -------------------------VKQVADPFIRPALDYNVQTVLRVLT 123 (133) T ss_pred -------------------------CCCCCCccchHHHHHhHHHHHHHHH Confidence 4589999999999999998888888 No 62 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=99.53 E-value=1.2e-17 Score=113.35 Aligned_cols=87 Identities=16% Similarity=0.209 Sum_probs=69.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccc----ccccccceeEe-e--c--CcEEEEEecCCc---cccccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDT----GYLRESVTMDF-K--D--SGFTGVINIGSE---YAIYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdT----G~Lr~SI~~~~-~--~--~~~~~~V~~~~~---YA~~ve~GT~~~~~~ 68 (116) -+++++++|.+++..|+++|+.++|+++ |+|++||.+.. + . ..+...|+.+.. |++|+|||| T Consensus 25 ~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~vg~~~~~~~~~~f~E~GT------ 98 (135) T protein:vir:57 25 GTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLRVGPTRSHYMKALAQEFGT------ 98 (135) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEEecCCCCcceeEeecccCC------ Confidence 2456688999999999999999999975 99999997642 1 1 123455665443 488889998 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++.+.+.|. T Consensus 99 -----------------------~~~~a~PF~~pa~~~~~~~~~~~~~ 123 (135) T protein:vir:57 99 -----------------------IKQVAKPFIRPALDYNKMQVLRILT 123 (135) T ss_pred -----------------------CCCCCCcchhHhHHHhHHHHHHHHH Confidence 4589999999999999999888888 No 63 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.51 E-value=3.4e-17 Score=110.90 Aligned_cols=87 Identities=20% Similarity=0.188 Sum_probs=72.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccc------cccccceeEe---ecCcEEEEEecC---CccccccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTG------YLRESVTMDF---KDSGFTGVINIG---SEYAIYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG------~Lr~SI~~~~---~~~~~~~~V~~~---~~YA~~ve~GT~~~~~~ 68 (116) ++++++++|.++|..+++.++.++|+++| +|+++|.+.- .++..+..|+.+ .-|++|+|||| T Consensus 22 ~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k~~~~y~~f~E~GT------ 95 (128) T protein:vir:38 22 VAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGKDTGWRAHFPNSGT------ 95 (128) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeeecCCCceEEeeeccCc------ Confidence 78889999999999999999999999765 5777775531 233456778764 45999999998 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++.+.+.|. T Consensus 96 -----------------------~k~~a~pF~~pa~~~~~~~~~~~~~ 120 (128) T protein:vir:38 96 -----------------------SMQDPQHFIEETQEIMRPVVIAAFL 120 (128) T ss_pred -----------------------cCCCCCcchhHHHHHhHHHHHHHHH Confidence 4489999999999999999999998 No 64 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.49 E-value=5.8e-17 Score=109.62 Aligned_cols=87 Identities=11% Similarity=0.138 Sum_probs=71.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcc-------------cccccccceeE-e--ecCcEEEEEec------CCcccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVD-------------TGYLRESVTMD-F--KDSGFTGVINI------GSEYAIYV 58 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvd-------------TG~Lr~SI~~~-~--~~~~~~~~V~~------~~~YA~~v 58 (116) ++++++++|.+++..|+++++.++|+. +|+++++|.+. + ..+.....|+. +..|++|+ T Consensus 28 ~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~g~~~~~VG~~~~~~~~~~y~~f~ 107 (149) T protein:vir:13 28 NEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKIRKKKGNLQCVVGWEKSDNTPFYYMKME 107 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceecccccccceeEEEeeccCCCCCccceeeee Confidence 677888999999999999999999974 56899998662 2 23334567764 34699999 Q ss_pred ccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 59 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 59 e~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) |||| ..|||||||+||++++++++.+.|. T Consensus 108 E~GT-----------------------------~k~~a~pF~~pa~~~~~~~~~~~~~ 136 (149) T protein:vir:13 108 EWGT-----------------------------SERPPHHAFGKTNKILKRVYDNIAQ 136 (149) T ss_pred ccCc-----------------------------cCCCCCccchHHHHHHHHHHHHHHH Confidence 9998 4599999999999999998888777 No 65 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.49 E-value=3e-17 Score=111.17 Aligned_cols=67 Identities=33% Similarity=0.508 Sum_probs=58.8 Q ss_pred Ch-----HHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEe---cCCccccccccCCcccccCCCCc Q lcl|NC_021326. 1 ME-----RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVIN---IGSEYAIYVNYGTGIYATGAGGS 72 (116) Q Consensus 1 i~-----~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~---~~~~YA~~ve~GT~~~~~~~~~~ 72 (116) |+ ..+++.|.+.+..++.+|+++||+|||+||+||.+++.++++++.|. +.++||+||||||+ T Consensus 18 L~~~~~~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~Ya~YvE~GTR--------- 88 (92) T protein:vir:99 18 LANQQNMNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDGFTGSVTYGGGLVNYAAYVEFGTR--------- 88 (92) T ss_pred HHhhccHHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCCeeEEEEeccCcccccccccccee--------- Confidence 22 34788999999999999999999999999999999999999999884 67999999999994 Q ss_pred CcccccccccccccceeccCCCCC Q lcl|NC_021326. 73 RAKKIPWSYKDANGKWHTTKGQHA 96 (116) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~g~~a 96 (116) .|+| T Consensus 89 --------------------~M~A 92 (92) T protein:vir:99 89 --------------------FMDS 92 (92) T ss_pred --------------------ecCC Confidence 3565 No 66 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.48 E-value=7e-17 Score=109.15 Aligned_cols=87 Identities=20% Similarity=0.183 Sum_probs=73.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccc----ccccceeEe---ec-CcEEEEEecC---CccccccccCCcccccCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGY----LRESVTMDF---KD-SGFTGVINIG---SEYAIYVNYGTGIYATGA 69 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~----Lr~SI~~~~---~~-~~~~~~V~~~---~~YA~~ve~GT~~~~~~~ 69 (116) ++++.+++|.++|..+++.++.++|+++|. |++||.+.. .. +....+|+.+ ..|+.|+|||| T Consensus 19 ~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~~~~y~~f~E~GT------- 91 (125) T protein:vir:97 19 APKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKATGWRAHYPNDGT------- 91 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCCCceeEeeeccCc------- Confidence 778899999999999999999999999877 999997632 22 2245677754 46999999998 Q ss_pred CCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 70 GGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++.+.+.|. T Consensus 92 ----------------------~k~~~~pF~~pa~~~~k~~~~~~~~ 116 (125) T protein:vir:97 92 ----------------------IYQRGQDFKERTINQMTPKAKQLYA 116 (125) T ss_pred ----------------------cCCCcCccchHhHHHhHHHHHHHHH Confidence 4599999999999999999888888 No 67 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.47 E-value=1.1e-16 Score=108.00 Aligned_cols=92 Identities=18% Similarity=0.241 Sum_probs=75.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee--cCcEEEEEecCCcc--ccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK--DSGFTGVINIGSEY--AIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~--~~~~~~~V~~~~~Y--A~~ve~GT~~~~~~~~~~~~~~ 76 (116) +.+.+++++++++..+.+++|+++|++||.|++||++... .++...+|+++..| ++++||||... . T Consensus 24 v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~l~HLLEfGha~r---~------- 93 (126) T protein:vir:81 24 VAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHYRRVHLLEFGHAKV---N------- 93 (126) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCCCCceeeeecceecC---C------- Confidence 8888999999999999999999999999999999976532 23345566666666 88999998521 0 Q ss_pred ccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 77 IPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 77 ~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) | ..++|+|||+||++...+++.++|. T Consensus 94 ---------g-----GrV~a~Phi~Pa~e~~~~~~~~~i~ 119 (126) T protein:vir:81 94 ---------G-----GRVKEYPHLRPAYDKHGARLPDELK 119 (126) T ss_pred ---------C-----CccCCCcchHHHHHHHHHHHHHHHH Confidence 1 1179999999999999999999888 No 68 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.42 E-value=3.7e-16 Score=105.18 Aligned_cols=87 Identities=16% Similarity=0.075 Sum_probs=72.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccc--ccccceeEe-ec----CcEEEEEecCCc---cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGY--LRESVTMDF-KD----SGFTGVINIGSE---YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~--Lr~SI~~~~-~~----~~~~~~V~~~~~---YA~~ve~GT~~~~~~~~ 70 (116) .++..++++.+++..+++.++.++|+++|. |++||.++- +. +.....|+.+.+ |++|+|||| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT-------- 92 (125) T protein:vir:81 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT-------- 92 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc-------- Confidence 667778899999999999999999997766 999997642 21 234566777654 899999999 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++++.+.|. T Consensus 93 ---------------------~k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:81 93 ---------------------MYQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ---------------------cCCCCCchhhHHHHHhHHHHHHHHH Confidence 4589999999999999999999988 No 69 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.42 E-value=3.7e-16 Score=105.18 Aligned_cols=87 Identities=16% Similarity=0.075 Sum_probs=72.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccc--ccccceeEe-ec----CcEEEEEecCCc---cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGY--LRESVTMDF-KD----SGFTGVINIGSE---YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~--Lr~SI~~~~-~~----~~~~~~V~~~~~---YA~~ve~GT~~~~~~~~ 70 (116) .++..++++.+++..+++.++.++|+++|. |++||.++- +. +.....|+.+.+ |++|+|||| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT-------- 92 (125) T protein:vir:47 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT-------- 92 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc-------- Confidence 667778899999999999999999997766 999997642 21 234566777654 899999999 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++++.+.|. T Consensus 93 ---------------------~k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:47 93 ---------------------MYQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ---------------------cCCCCCchhhHHHHHhHHHHHHHHH Confidence 4589999999999999999999988 No 70 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.42 E-value=3.7e-16 Score=105.18 Aligned_cols=87 Identities=16% Similarity=0.075 Sum_probs=72.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccc--ccccceeEe-ec----CcEEEEEecCCc---cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGY--LRESVTMDF-KD----SGFTGVINIGSE---YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~--Lr~SI~~~~-~~----~~~~~~V~~~~~---YA~~ve~GT~~~~~~~~ 70 (116) .++..++++.+++..+++.++.++|+++|. |++||.++- +. +.....|+.+.+ |++|+|||| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT-------- 92 (125) T protein:vir:79 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT-------- 92 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc-------- Confidence 667778899999999999999999997766 999997642 21 234566777654 899999999 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++++.+.|. T Consensus 93 ---------------------~k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:79 93 ---------------------MYQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ---------------------cCCCCCchhhHHHHHhHHHHHHHHH Confidence 4589999999999999999999988 No 71 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.42 E-value=3.7e-16 Score=105.18 Aligned_cols=87 Identities=16% Similarity=0.075 Sum_probs=72.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccc--ccccceeEe-ec----CcEEEEEecCCc---cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGY--LRESVTMDF-KD----SGFTGVINIGSE---YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~--Lr~SI~~~~-~~----~~~~~~V~~~~~---YA~~ve~GT~~~~~~~~ 70 (116) .++..++++.+++..+++.++.++|+++|. |++||.++- +. +.....|+.+.+ |++|+|||| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT-------- 92 (125) T protein:vir:94 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT-------- 92 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc-------- Confidence 667778899999999999999999997766 999997642 21 234566777654 899999999 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++++.+.|. T Consensus 93 ---------------------~k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:94 93 ---------------------MYQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ---------------------cCCCCCchhhHHHHHhHHHHHHHHH Confidence 4589999999999999999999988 No 72 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.42 E-value=3.7e-16 Score=105.18 Aligned_cols=87 Identities=16% Similarity=0.075 Sum_probs=72.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccc--ccccceeEe-ec----CcEEEEEecCCc---cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGY--LRESVTMDF-KD----SGFTGVINIGSE---YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~--Lr~SI~~~~-~~----~~~~~~V~~~~~---YA~~ve~GT~~~~~~~~ 70 (116) .++..++++.+++..+++.++.++|+++|. |++||.++- +. +.....|+.+.+ |++|+|||| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT-------- 92 (125) T protein:vir:98 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT-------- 92 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc-------- Confidence 667778899999999999999999997766 999997642 21 234566777654 899999999 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||||||+||++++++++.+.|. T Consensus 93 ---------------------~k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:98 93 ---------------------MYQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ---------------------cCCCCCchhhHHHHHhHHHHHHHHH Confidence 4589999999999999999999988 No 73 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.35 E-value=2.7e-15 Score=100.42 Aligned_cols=104 Identities=17% Similarity=0.215 Sum_probs=80.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc---cccccccccee-EeecCcEEEEEecCCccccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV---DTGYLRESVTM-DFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv---dTG~Lr~SI~~-~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~ 76 (116) |++.+++++++.|.++.+.++.++|| |||+||+||.. ++.. ...+|+++++||+|||||++....++. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k--~~~~v~N~~eYA~~VE~GHRq~~g~g~------ 72 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNL--FDGVVSNNVEYIHHLEYGHRTRQGTGT------ 72 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeec--cCceeecCCcccccccCCceeeCCcce------ Confidence 99999999999999999999999998 67999999977 3333 235699999999999999876443321 Q ss_pred ccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 77 IPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 77 ~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+++...+.-+.+-||..++.+-+.++-++|- T Consensus 73 ----~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~ 108 (116) T protein:vir:10 73 ----SENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELN 108 (116) T ss_pred ----ecccccccccCCccCceehHHHHHHHHHHHHHHHHH Confidence 112233444445568888999999998877666665 No 74 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.34 E-value=3.7e-15 Score=99.71 Aligned_cols=92 Identities=15% Similarity=0.261 Sum_probs=78.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc---------------------------cccccccccee---EeecCcEEEEEec Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV---------------------------DTGYLRESVTM---DFKDSGFTGVINI 50 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv---------------------------dTG~Lr~SI~~---~~~~~~~~~~V~~ 50 (116) |++.+++.+.+.|..+.+.++.++|| +||+||+||.. ....++.+.+|++ T Consensus 26 ~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~tG~lr~swk~~~~~k~~~~~~v~v~N 105 (163) T protein:vir:10 26 VDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQGGTLQKGWSKSRIEVSGRTYKQKVYN 105 (163) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccccccchhhccceecceeecCCceEEEEEe Confidence 77889999999999999999999997 89999999976 3356678899999 Q ss_pred CCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 51 GSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 51 ~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +++||+|||+|++... |. ..|++++|..|.++.++++.++|- T Consensus 106 ~~~YA~~VE~GHR~~~-------------------gG-----fV~G~fml~~s~~~~~~~~~~~~e 147 (163) T protein:vir:10 106 KVYYAPHVEYGHKTVN-------------------GG-----FVPGQFFLHKTVEDTKSDMEKRVR 147 (163) T ss_pred cCCccchhhcceeecC-------------------Cc-----eeccchhhHHHHHHHHHHHHHHHH Confidence 9999999999986532 11 137888999999999998888887 No 75 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.30 E-value=1.8e-15 Score=101.38 Aligned_cols=85 Identities=26% Similarity=0.388 Sum_probs=73.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCC---ccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGS---EYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~---~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) .++.-++||.++++.|.+++..++|++||+|+. |...++.+| .+.|+.+. -|+.|.|||| T Consensus 23 ~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~kk~g-~~~VG~~ks~~fy~kF~EFGT--------------- 85 (119) T protein:vir:10 23 DESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRVKNTG-LATEGTASSSEFYDIFQNFGT--------------- 85 (119) T ss_pred hHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeeeecCc-eeEeccCCcchhhhhhccccc--------------- Confidence 666677899999999999999999999999998 656677777 35666544 5999999999 Q ss_pred cccccccccceeccCCCCCC-cchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQ-PFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~-PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||| |||.||++.++..+++.|+ T Consensus 86 --------------Skm~a~~pF~~~a~~~~~~eA~~~~~ 111 (119) T protein:vir:10 86 --------------SEQKAHVGYFDRAVDETTNEAVEEVA 111 (119) T ss_pred --------------cccCCCCCccccccccChHHHHHHHH Confidence 448999 9999999999999999998 No 76 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.27 E-value=1.7e-14 Score=96.04 Aligned_cols=92 Identities=11% Similarity=0.020 Sum_probs=76.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCcc--ccccccCCcccccCCCCcCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY--AIYVNYGTGIYATGAGGSRAKKIP 78 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y--A~~ve~GT~~~~~~~~~~~~~~~~ 78 (116) +.+.+++++++++..+..+.++.+|++||.+++||.+....++...+++++..| ++.+|||+... . T Consensus 25 v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l~HLLE~GHa~r---~--------- 92 (123) T protein:vir:96 25 VVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRLTHLLENGHAKR---N--------- 92 (123) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcceEEeeecceeec---C--------- Confidence 778888899999999999999999999999999998887767777778877776 79999997532 1 Q ss_pred ccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 79 ~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) |.+ .+|+|||.||.+...+.|.++|- T Consensus 93 -------GGr-----V~a~phI~paee~~~~~l~~~i~ 118 (123) T protein:vir:96 93 -------GGR-----VSPKVHIAPVEEELVSNYISRVE 118 (123) T ss_pred -------Cce-----eCcchhhhHHHHHHHHHHHHHHH Confidence 111 38999999999998887766666 No 77 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.10 E-value=6.3e-14 Score=92.96 Aligned_cols=96 Identities=16% Similarity=0.201 Sum_probs=63.9 Q ss_pred HHHHHHHhCCcccccccccceeEee----cC-cEEEEEecC---CccccccccCCcccccCCCCcCcccccccccccccc Q lcl|NC_021326. 16 IHNTIISLMPVDTGYLRESVTMDFK----DS-GFTGVINIG---SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGK 87 (116) Q Consensus 16 v~~~ak~~aPvdTG~Lr~SI~~~~~----~~-~~~~~V~~~---~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (116) |.++|+..+|++||+|++||..-+. .+ ..++.|+.+ ++|++++|||. .-........-+.|.... .+ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh---w~~~~~~~~~dG~w~~~~--~~ 75 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGH---WQTHAAYKGKDGEWYSSS--VK 75 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccce---eeeeeeeeccCceeeecC--cc Confidence 9999999999999999999965322 22 356777654 67999999994 111111111112222111 12 Q ss_pred eeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 88 WHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 88 ~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ...+..+||+|||+|||+....++...|- T Consensus 76 l~~~~~vPa~pFlRpA~da~~~~a~~~~~ 104 (119) T protein:vir:10 76 LVNPKWIPARPFLRPGYDSVAMQIPDIAK 104 (119) T ss_pred ccCceecCCCCccchhHHHHHHHHHHHHH Confidence 23467899999999999988877666665 No 78 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.06 E-value=1.1e-13 Score=91.54 Aligned_cols=96 Identities=16% Similarity=0.201 Sum_probs=64.2 Q ss_pred HHHHHHHhCCcccccccccceeEee----cC-cEEEEEecC---CccccccccCCcccccCCCCcCcccccccccccccc Q lcl|NC_021326. 16 IHNTIISLMPVDTGYLRESVTMDFK----DS-GFTGVINIG---SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGK 87 (116) Q Consensus 16 v~~~ak~~aPvdTG~Lr~SI~~~~~----~~-~~~~~V~~~---~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (116) |.++|+..+|++||+|++||..-+. .+ ..++.|+.+ ++|++++|||. .-.........+.|.... .+ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh---w~~~~~~~~~dG~w~~~~--~~ 75 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGH---WQTHAAYKGKDGEWYSSS--VK 75 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccce---eeeeeeeeccCceeeecC--cc Confidence 9999999999999999999965332 22 356777654 67999999994 111111111222232111 12 Q ss_pred eeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 88 WHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 88 ~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ...+..+||+|||+|||+....++...|- T Consensus 76 l~~~~~vPa~pFlRpA~da~~~~a~~~~~ 104 (119) T protein:vir:81 76 LVNPKWIPARPFLRPGYDSVAMQIPDIAK 104 (119) T ss_pred ccCceecCCCCccchhHHHHHHHHHHHHH Confidence 33467899999999999988877666665 No 79 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.89 E-value=6.7e-12 Score=81.84 Aligned_cols=87 Identities=23% Similarity=0.229 Sum_probs=74.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee----------------------------cCcEEEEEecCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK----------------------------DSGFTGVINIGS 52 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~----------------------------~~~~~~~V~~~~ 52 (116) +++.++..+++++.++...+...+|||||.||.||.+.+. ..+-+..|.+++ T Consensus 14 ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi~Nn~ 93 (131) T protein:vir:94 14 AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATSFVLNAADWHTFTLTNNL 93 (131) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHHHHhhccccceEEEeeCc Confidence 8888888999999999999999999999999999965431 112356688999 Q ss_pred ccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 53 EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 53 ~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +||.++|||+ .+|.|..|.+-++.+-...+.+... T Consensus 94 pYA~~LEyG~-----------------------------S~QAP~g~v~~~~~~~~~~v~~~~~ 128 (131) T protein:vir:94 94 PYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLLNEEAS 128 (131) T ss_pred hhhhhhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHHHHH Confidence 9999999997 4589999999999998888888777 No 80 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.88 E-value=8.6e-12 Score=81.26 Aligned_cols=87 Identities=20% Similarity=0.190 Sum_probs=70.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec-------------------------------CcEEEEEe Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD-------------------------------SGFTGVIN 49 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~-------------------------------~~~~~~V~ 49 (116) ++..+...+++++.++..+....+|||||.||.||...+.. .+-+..|. T Consensus 19 ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~~~g~~iyi~ 98 (142) T protein:vir:10 19 VTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNSLRRQIYALARDANTNVIYIS 98 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhhHHHHHHHhhhccccceEEEe Confidence 77777778999999999999999999999999999653211 12345677 Q ss_pred cCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 50 IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 50 ~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++++||.++|||+ .++.|..|++-++.+-...+.+... T Consensus 99 Nn~pYA~~LEyG~-----------------------------S~QAP~G~v~~a~q~~~~~v~~a~~ 136 (142) T protein:vir:10 99 NRLDYAQGLEFGS-----------------------------SNQAPSGVLGVVQKRLGRYFAEAVQ 136 (142) T ss_pred eCcchhhhhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHHHHH Confidence 8999999999997 5588999999998887776666665 No 81 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.88 E-value=1e-11 Score=80.82 Aligned_cols=87 Identities=18% Similarity=0.211 Sum_probs=71.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee--------------------------------cCcEEEEE Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK--------------------------------DSGFTGVI 48 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~--------------------------------~~~~~~~V 48 (116) ++..++..+++++.++...+...+|||||.||.||.+.+. ..+.+..| T Consensus 20 ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~~~~~~~~~~~~~~~~~~iyi 99 (147) T protein:vir:10 20 AESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGEEQAKTYGMFSRGGAITSVHF 99 (147) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhhhhHHHHHHhhhccCcceEEE Confidence 7888888999999999999999999999999999965321 11235678 Q ss_pred ecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 49 NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 49 ~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++++||.++|||+ .++.|..|.+-++.+-...+.+-.. T Consensus 100 ~Nn~pYA~~LEyG~-----------------------------S~QAP~G~V~~t~q~~~~~v~~~~~ 138 (147) T protein:vir:10 100 SNMLIYANALEYGH-----------------------------SQQAPSGVVGLVALRLRSYMADAIK 138 (147) T ss_pred eeCcchhhhhhccc-----------------------------cCCCCchHHHHHHHHHHHHHHHHHH Confidence 89999999999997 4588999999999887766666655 No 82 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.85 E-value=1.5e-11 Score=79.95 Aligned_cols=90 Identities=21% Similarity=0.228 Sum_probs=69.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCcc--ccccccCCcccccCCCCcCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY--AIYVNYGTGIYATGAGGSRAKKIP 78 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y--A~~ve~GT~~~~~~~~~~~~~~~~ 78 (116) |+++++++.+.+++.+..++++.+|++||..++||......++ .+|++..+| ++.+|||+... +. T Consensus 28 v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~--~~V~nk~~yqLtHLLE~GHAkr---~G-------- 94 (124) T protein:vir:95 28 VEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNG--WVIHNKTEYRLAHLLEYGHATV---DG-------- 94 (124) T ss_pred HHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCc--eeEEEcCCCceeeeeecceecc---CC-------- Confidence 5555555556667777777778999999999999988766655 379998889 99999997541 11 Q ss_pred ccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 79 ~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+ .+++|+++||.+...+.+.++|- T Consensus 95 --------GR-----V~a~pHI~paee~~~~~l~~~i~ 119 (124) T protein:vir:95 95 --------GR-----VPGTPHIRPIEDWLEKEFEDRVE 119 (124) T ss_pred --------cc-----cCCccchhHHHHHHHHHHHHHHH Confidence 11 48999999999998887777666 No 83 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.85 E-value=1.3e-11 Score=80.23 Aligned_cols=87 Identities=21% Similarity=0.232 Sum_probs=70.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee------------cC-------------------cEEEEEe Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK------------DS-------------------GFTGVIN 49 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~------------~~-------------------~~~~~V~ 49 (116) ++..++..+++++.++.......+|||||.||.||.+.+. .+ +-+..|. T Consensus 22 ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~k~g~~iyi~ 101 (145) T protein:vir:10 22 AEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKTYLARQARAVANSKATSVIYIT 101 (145) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchhhHHHHHHHhhcccccceEEEe Confidence 8888888999999999999999999999999999976431 01 1124567 Q ss_pred cCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 50 IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 50 ~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++++||.++|||+ .++.|..|.+-++.+-...+.+... T Consensus 102 Nn~pYA~~LEyG~-----------------------------S~QAP~G~v~~~~~~~~~~v~~~~~ 139 (145) T protein:vir:10 102 NRLDYAADLEYGA-----------------------------SNQAPAGVLGVVQARLGRYFQEAVE 139 (145) T ss_pred eCchhhhHhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHHHHH Confidence 8999999999997 4588999999999888766665555 No 84 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.83 E-value=1.6e-11 Score=79.71 Aligned_cols=87 Identities=23% Similarity=0.224 Sum_probs=74.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee----------------------------cCcEEEEEecCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK----------------------------DSGFTGVINIGS 52 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~----------------------------~~~~~~~V~~~~ 52 (116) ++..++..+++++.++...+...+|||||.||.||.+.+. .-+-+..|.+++ T Consensus 14 ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi~Nn~ 93 (131) T protein:vir:78 14 AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNL 93 (131) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhhHHHHHHHHhhccCCceEEEeeCc Confidence 8888888999999999999999999999999999976431 012345688999 Q ss_pred ccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 53 EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 53 ~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +||.++|||+ .+|.|..|.+-++.+-...+.+... T Consensus 94 pYA~~LEyG~-----------------------------S~QAP~G~v~~~~~~~~~~v~~~~~ 128 (131) T protein:vir:78 94 PYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLLNEEAS 128 (131) T ss_pred hhhhHhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHHHHH Confidence 9999999997 4589999999999998888888877 No 85 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.80 E-value=2.4e-11 Score=78.84 Aligned_cols=87 Identities=18% Similarity=0.230 Sum_probs=69.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee------------cC--------------------cEEEEE Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK------------DS--------------------GFTGVI 48 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~------------~~--------------------~~~~~V 48 (116) ++..+...+++++.++.......+|||||.||.||.+.+. .+ +-+..| T Consensus 20 ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~~~~i~~~~~g~~~~~~iyi 99 (146) T protein:vir:79 20 VESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEGRRTLYALLHGGGAIKSIYF 99 (146) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHHHHHHHHHHhcccccceeEE Confidence 7777888899999999999999999999999999976431 01 124556 Q ss_pred ecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 49 NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 49 ~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++++||.++|||+ .++.|..|.+.++.+-...+.+... T Consensus 100 ~NnlpYA~~LEyG~-----------------------------S~QAP~G~v~~~~~~~~~~v~~a~~ 138 (146) T protein:vir:79 100 SNMLIYANALEYGH-----------------------------SKQAPAGVFGIVAIRLRSYMAEAIR 138 (146) T ss_pred eeCchhhhhhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHHHHH Confidence 68999999999997 4589999999999877665555443 No 86 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.78 E-value=3e-11 Score=78.24 Aligned_cols=90 Identities=21% Similarity=0.224 Sum_probs=67.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCcc--ccccccCCcccccCCCCcCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY--AIYVNYGTGIYATGAGGSRAKKIP 78 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y--A~~ve~GT~~~~~~~~~~~~~~~~ 78 (116) |++++++..++++..+..+++..+|++||..++||......++ .+|++..+| ++.+|||+... +. T Consensus 28 v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~--~~v~nk~~yqLtHLLE~GHAkr---~G-------- 94 (127) T protein:vir:80 28 LEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGG--WVIHNKTEYRLAHLLEYGHATV---DG-------- 94 (127) T ss_pred HHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCc--eeEeecCCcceeehhhcceecc---CC-------- Confidence 5555555556666666666678999999999999987665554 579998899 99999997541 11 Q ss_pred ccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 79 ~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+ .+++|+++||.+...+.+.++|- T Consensus 95 --------GR-----V~a~pHI~paee~~~~~l~~~i~ 119 (127) T protein:vir:80 95 --------GR-----VPETPHIRPVEDWLEKEFEDRVE 119 (127) T ss_pred --------cc-----cCCccchhhHHHHHHHHHHHHHH Confidence 11 48899999999998887777776 No 87 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=98.76 E-value=9.8e-12 Score=80.93 Aligned_cols=67 Identities=46% Similarity=0.790 Sum_probs=55.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGA 69 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~ 69 (116) |.+.+++.+.++++.|...|+.+||||||+|++||.++++.+|+++.|..+++||+-. -++..+..- T Consensus 34 i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk~GGltavI~vGAeYAIkr--msqllvtvi 100 (100) T protein:vir:96 34 IEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIKR--MSQLLVTVI 100 (100) T ss_pred HHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeeeecCCeeEEEecchhHHHHH--HHHHHhhcC Confidence 9999999999999999999999999999999999999999999999999999999711 010000000 No 88 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.69 E-value=1.7e-11 Score=79.57 Aligned_cols=89 Identities=17% Similarity=0.228 Sum_probs=59.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHh-----C-------C---------------cccccccccceeEee--cCcEEEEEecC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISL-----M-------P---------------VDTGYLRESVTMDFK--DSGFTGVINIG 51 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~-----a-------P---------------vdTG~Lr~SI~~~~~--~~~~~~~V~~~ 51 (116) +++.+...|.+.+..+.+.+..+ . | +|||.|++||+.++. .++..+.||++ T Consensus 15 l~~~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~~~~~~~~~~~~a~vGtn 94 (145) T protein:vir:31 15 IQDGLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDINAASMMDRANRMAVIGTN 94 (145) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHHHHhhhcccCceeEecCC Confidence 44444445555555555444321 1 2 279999999987654 34567899999 Q ss_pred CccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 52 SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 52 ~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+||.+++||+.. -.+||||||-++....++.+.+.|. T Consensus 95 ~~YA~~hqfG~~~---------------------------~~IPaRPfLG~~~~~~~~~~~~ii~ 132 (145) T protein:vir:31 95 LDYAEHHEFGAPE---------------------------AGIPARPIFGPAGAYASQQAPDVIG 132 (145) T ss_pred chhhhhhccCCcc---------------------------cccCCCCccCCCccchHHHHHHHHH Confidence 9999999999831 2489999999987655444433333 No 89 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.65 E-value=1.8e-10 Score=74.01 Aligned_cols=87 Identities=21% Similarity=0.210 Sum_probs=69.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec-------------------------------------Cc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD-------------------------------------SG 43 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~-------------------------------------~~ 43 (116) +++.+...+++++.++.......+|||||.+|.||.+.+.. -+ T Consensus 19 ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d~sg~~tl~~~~~vi~~~~~g 98 (144) T protein:vir:95 19 IDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQRASAAETLNSAKLVLRNKKPG 98 (144) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCCCchhHHHHHHHHHHhhcCcc Confidence 88888889999999999999999999999999999765331 01 Q ss_pred EEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 44 FTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 44 ~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) -+..|.++++||.++|||+ .++.|.-|.+-++.+....+++-== T Consensus 99 ~~iyi~NnlpYA~~LEyG~-----------------------------S~QAP~G~vr~~~q~~~~~v~~~~~ 142 (144) T protein:vir:95 99 QAIFITNNLPYIRRLNDGY-----------------------------SAQAPAGFVERAVLIGRKMRKKFKI 142 (144) T ss_pred ceEEEeeCchhhhhhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHhhcc Confidence 2445778999999999997 4589999999999887665544322 No 90 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=98.65 E-value=1.8e-10 Score=73.96 Aligned_cols=86 Identities=22% Similarity=0.258 Sum_probs=69.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec------------C-------------------cEEEEEe Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------S-------------------GFTGVIN 49 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~------------~-------------------~~~~~V~ 49 (116) +++.+...+++++.++.......+|||||.||.||.+.+.. + +-+..|. T Consensus 14 ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~~~vi~~~k~g~~iyi~ 93 (134) T protein:vir:80 14 IEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGMDEALQVLQQTVGQYKAGDTVHIT 93 (134) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccchhhHHHHHHHHhhccCcceEEEe Confidence 88888899999999999999999999999999999654321 0 1234577 Q ss_pred cCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 50 IGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 50 ~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ++++||.++|||+ .++.|..|.+-++.+-...+++ .. T Consensus 94 Nn~pYA~~LEyG~-----------------------------S~QAP~G~v~~t~~~~~~~v~~-~~ 130 (134) T protein:vir:80 94 NNAPYIKELNSGS-----------------------------SQQAPANFVETSIMRATRLIRN-VK 130 (134) T ss_pred eCchhhhhhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHh-hc Confidence 9999999999997 5589999999998887666655 33 No 91 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=98.62 E-value=2.1e-10 Score=73.67 Aligned_cols=87 Identities=20% Similarity=0.110 Sum_probs=72.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec--------------C-----------------------c Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD--------------S-----------------------G 43 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~--------------~-----------------------~ 43 (116) ++..+...+++++.++.......+|||||.||.||.+.+.. + + T Consensus 18 ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~~~~~~~~i~~~~~vi~~~k~g 97 (148) T protein:vir:97 18 VAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTEAANTQAAIDQAESVIRGYNYG 97 (148) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCcccccchhHHHHHHHHHhhccCCC Confidence 88888889999999999999999999999999999664210 0 1 Q ss_pred EEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 44 FTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 44 ~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+..|.++++||..+|||+ .++.|..|.+-++..-...+++.-+ T Consensus 98 ~~iyi~NnlpYA~~LEyG~-----------------------------S~QAP~G~v~~t~~~~~~~v~~~~~ 141 (148) T protein:vir:97 98 EEIHITNNLPYIQRLNDGY-----------------------------SAQAPANFVEQAVLEAVQVVQFGRV 141 (148) T ss_pred ceEEEeecchhhhHhhccc-----------------------------cCCCcchHHHHHHHHHHHHHHhhhh Confidence 2456778999999999997 5589999999999998888877666 No 92 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=98.55 E-value=5.4e-10 Score=71.38 Aligned_cols=87 Identities=15% Similarity=0.056 Sum_probs=65.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcc----------cccccccceeEee-cCc---EEEEEecCC--ccccccccCCcc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVD----------TGYLRESVTMDFK-DSG---FTGVINIGS--EYAIYVNYGTGI 64 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvd----------TG~Lr~SI~~~~~-~~~---~~~~V~~~~--~YA~~ve~GT~~ 64 (116) -.+.-.+++.++|..+++..+.++|.. .++|++||.+.-. .++ ....||.+. .+|+|+|+|| T Consensus 21 ~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~~VG~~k~~~~A~f~n~GT-- 98 (139) T protein:vir:10 21 SISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSSTVGFHNKAHIARFLNDGT-- 98 (139) T ss_pred CHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceeeeeCCCCCcceEeecccCc-- Confidence 122234467888899999999999972 3689999977532 122 234577654 3689999998 Q ss_pred cccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 65 YATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||+||+.++.++.++.+.+.++ T Consensus 99 ---------------------------~k~~~~hFie~t~~e~~~evl~a~~ 123 (139) T protein:vir:10 99 ---------------------------KYIRADHFVDNARDDAKDAVFAAEA 123 (139) T ss_pred ---------------------------cccCCCchHHHHHHHHHHHHHHHHH Confidence 4599999999999999998888888 No 93 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.52 E-value=2.4e-10 Score=73.35 Aligned_cols=79 Identities=20% Similarity=0.176 Sum_probs=65.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee-------c-------------------CcEEEEEecCCcc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-------D-------------------SGFTGVINIGSEY 54 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~-------~-------------------~~~~~~V~~~~~Y 54 (116) +++.++..+++++.++...+...+|||||.+|.||.+.+. + .+-+..|.++++| T Consensus 17 ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~~~~~~~~~iyi~NnlpY 96 (121) T protein:vir:94 17 LREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVVSSNVALPHFYITNGAPY 96 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHHHHhhccceEEEeeCcch Confidence 8888888899999999999999999999999999976431 0 0124468899999 Q ss_pred ccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH Q lcl|NC_021326. 55 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR 108 (116) Q Consensus 55 A~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k 108 (116) |.++|+|+ .+|.|..|.+-++.+-+ T Consensus 97 A~~LE~G~-----------------------------S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 97 AQQLEKGS-----------------------------STQAPLGIVRVTLASLR 121 (121) T ss_pred hhhhhccc-----------------------------CCCCcchHHHHHHHhhC Confidence 99999997 55889999999998766 No 94 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=98.51 E-value=4.9e-10 Score=71.65 Aligned_cols=103 Identities=14% Similarity=0.109 Sum_probs=75.2 Q ss_pred Ch-HHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe--ecCcEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 ME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF--KDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~-~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~--~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) |. +.+.++|..+...+...|-..+|+||..|-+|---++ ...+++|+||.++.||.|||.-.|.....|... T Consensus 20 I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei~~ngtritGRVGYSAnYA~yVHda~Gklkgqprp~----- 94 (131) T protein:vir:10 20 IAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYKKLEPIPSGMIGRVGYTANYAAAVNAAKGKLKGKPRPD----- 94 (131) T ss_pred hccchHHHHHHHHHHHHHhhhhhccccchhhhccccceeeeccCceeEEeeccceeeeeeeecCccccCCCcCCC----- Confidence 33 4555677788888888888999999999999975444 455699999999999999999777664444332 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHHH-HHHHHhcC Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGR-AFFNKYFS 116 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k-~~i~~~i~ 116 (116) ....+|-|+-+++ ||.-++++.+ ..|...|. T Consensus 95 -------gkgn~w~p~ae~e-FL~kgfe~~~~d~i~avik 126 (131) T protein:vir:10 95 -------GSGNYWDPNGEPD-FLRKGFERDGLNEIKAIIR 126 (131) T ss_pred -------CCcceecCCCChh-hhhhhhhccchHHHHHHHh Confidence 1223555766776 9999998764 44555555 No 95 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=98.44 E-value=1.3e-09 Score=69.23 Aligned_cols=87 Identities=10% Similarity=0.060 Sum_probs=64.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcc------c---ccccccceeEeec-Cc---EEEEEecCC----ccccccccCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVD------T---GYLRESVTMDFKD-SG---FTGVINIGS----EYAIYVNYGTG 63 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvd------T---G~Lr~SI~~~~~~-~~---~~~~V~~~~----~YA~~ve~GT~ 63 (116) ..+.-++++.++|..+++..+..+|.. | |+|++||..+-.+ +| -...||.+. -||.|+|+|| T Consensus 22 ~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~s~VG~~~~~~a~~a~f~n~GT- 100 (153) T protein:vir:49 22 TPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGVSTVGWKNNYHAQNARRLNDGT- 100 (153) T ss_pred CHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccceeeecccCCccceeeeecccCc- Confidence 556667789999999999999999872 3 6999999875321 22 145677653 4689999998 Q ss_pred ccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHH--HHHHHHhcC Q lcl|NC_021326. 64 IYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG--RAFFNKYFS 116 (116) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~--k~~i~~~i~ 116 (116) .+|||+||+.++.++. ++.+.+.++ T Consensus 101 ----------------------------~km~~~hFie~tr~e~~~k~~vl~A~~ 127 (153) T protein:vir:49 101 ----------------------------KKYRADHFITNVQNDSTVKNKVLLAEK 127 (153) T ss_pred ----------------------------ccCCCChhhHHHHHHhhHHHHHHHHHH Confidence 4599999999999875 444544333 No 96 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=98.42 E-value=6.7e-10 Score=70.89 Aligned_cols=89 Identities=12% Similarity=0.094 Sum_probs=65.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc-----------ccccccccceeEeecCcEEEEEec--CCccccccccCCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV-----------DTGYLRESVTMDFKDSGFTGVINI--GSEYAIYVNYGTGIYAT 67 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv-----------dTG~Lr~SI~~~~~~~~~~~~V~~--~~~YA~~ve~GT~~~~~ 67 (116) |.+.++.+++++|+.+...+++.+|+ +||.|..||++--+........|. .++||+++|||+..+. T Consensus 29 l~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~raa~VrAG~~krVPYA~~I~~G~r~r~- 107 (143) T protein:vir:62 29 LNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASAKGAVIKAGSASRVPYAAAIHFGYRARN- 107 (143) T ss_pred hhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccccceeeeeCCcCCCCcccccccCccccc- Confidence 78999999999999999999999999 799999999875444444555665 5789999999975532 Q ss_pred CCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH--------HHHHHhcC Q lcl|NC_021326. 68 GAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--------AFFNKYFS 116 (116) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k--------~~i~~~i~ 116 (116) +.|+-||+.|+-... .+|.+.|. T Consensus 108 --------------------------Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~ 138 (143) T protein:vir:62 108 --------------------------ISPNRFLFRAMARKSDVVAATYERRIAAVVE 138 (143) T ss_pred --------------------------ccchhhhhhhhhccCHHHHHHHHHHHHHHHH Confidence 345556666554333 33444444 No 97 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=98.31 E-value=1.9e-09 Score=68.41 Aligned_cols=89 Identities=13% Similarity=0.109 Sum_probs=65.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcc-----------cccccccceeEeecCcEEEEEecC--CccccccccCCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVD-----------TGYLRESVTMDFKDSGFTGVINIG--SEYAIYVNYGTGIYAT 67 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvd-----------TG~Lr~SI~~~~~~~~~~~~V~~~--~~YA~~ve~GT~~~~~ 67 (116) |.+.++.+++++|+.+...+++++|+- ||.|..||++--+........|.. ++||+++|||+..+. T Consensus 29 l~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~raa~VrAGr~arVPYA~~I~~G~r~r~- 107 (143) T protein:vir:13 29 LNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASAKGAVIKAGSAARVPYAAAIHFGYRKRN- 107 (143) T ss_pred chHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccccceeeeecCcCCCCcccccccCCcccc- Confidence 789999999999999999999999985 899999998754444445556643 799999999975432 Q ss_pred CCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH--------HHHHHhcC Q lcl|NC_021326. 68 GAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR--------AFFNKYFS 116 (116) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k--------~~i~~~i~ 116 (116) +.++-||+.|+-... .+|.+.|. T Consensus 108 --------------------------Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~ 138 (143) T protein:vir:13 108 --------------------------ISANRFLYRAMARKSDVVAATYERRIAAVVE 138 (143) T ss_pred --------------------------cchhhhhhhhhhccCHHHHHHHHHHHHHHHH Confidence 345557666654433 33444444 No 98 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.29 E-value=5.8e-09 Score=65.76 Aligned_cols=87 Identities=22% Similarity=0.284 Sum_probs=72.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--------------ccccccccceeEeec------------------------- Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--------------DTGYLRESVTMDFKD------------------------- 41 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--------------dTG~Lr~SI~~~~~~------------------------- 41 (116) ++..++..+++++.++...+...+|| |||.+|.||.+.+.. T Consensus 24 ~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~~~~~~~~~~~~t~~~~~~~i~~~ 103 (152) T protein:vir:96 24 NENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKITSFEKGISSQSSIMMDLQSDIAKF 103 (152) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCCcccccCCCCCchHHHHHHHHhhc Confidence 77788888999999999999999999 999999999765321 Q ss_pred -CcEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 42 -SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 42 -~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) -+-+..|.++++||..+|||+ .++.|.-|.+.++.+-...+.+.+- T Consensus 104 ~~g~~iyi~NnlPYA~~LEyG~-----------------------------S~QAP~G~vr~t~~~~~~~v~ea~~ 150 (152) T protein:vir:96 104 KIGETLFMTNPLPYATSIEYGH-----------------------------SSQAPNGVYRPAVRRLVKFLNTELK 150 (152) T ss_pred cccceEEEeeCchhhhHhhccc-----------------------------cCCCCchHHHHHHHHHHHHHHHHhc Confidence 012456778999999999996 5589999999999998888888777 No 99 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=98.24 E-value=8.3e-09 Score=64.89 Aligned_cols=87 Identities=10% Similarity=0.014 Sum_probs=64.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc---------ccccccccceeEee-cCc---EEEEEecCCc----cccccccCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV---------DTGYLRESVTMDFK-DSG---FTGVINIGSE----YAIYVNYGTG 63 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv---------dTG~Lr~SI~~~~~-~~~---~~~~V~~~~~----YA~~ve~GT~ 63 (116) ..+.-.+++.++|..++...+..+|. ..++|++||.++-. .+| -...||.+.. +|.|+|+|| T Consensus 22 ~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~s~VG~~~~~~~~~A~f~n~GT- 100 (141) T protein:vir:50 22 TPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGVSTVGWKNNYHAQNARRLNDGT- 100 (141) T ss_pred CHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCeeeeccCCCccceeeeccccCc- Confidence 55666778999999999999999995 35799999977532 122 1346776433 689999998 Q ss_pred ccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHH--HHHHHHhcC Q lcl|NC_021326. 64 IYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG--RAFFNKYFS 116 (116) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~--k~~i~~~i~ 116 (116) ..|||+||+.++.++. ++.|.+.++ T Consensus 101 ----------------------------~k~~~~hFve~~~~~a~~k~~Vl~A~~ 127 (141) T protein:vir:50 101 ----------------------------KKYRADHFVTNVQNDSTVQKKVLLEKK 127 (141) T ss_pred ----------------------------cccCCCchhHHHHHhhhhHHHHHHHHH Confidence 5599999999999865 555555555 No 100 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=98.19 E-value=1.2e-08 Score=64.02 Aligned_cols=87 Identities=16% Similarity=0.098 Sum_probs=64.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc-------c---cccccccceeEee-cCc---EEEEEecCC--ccccccccCCcc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV-------D---TGYLRESVTMDFK-DSG---FTGVINIGS--EYAIYVNYGTGI 64 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv-------d---TG~Lr~SI~~~~~-~~~---~~~~V~~~~--~YA~~ve~GT~~ 64 (116) ..+.-.+++.++|+.++...+.++|. + .++|+++|..... .++ -...||.+. ..|+|+|+|| T Consensus 21 ~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~~VG~~~~~~~Ahf~n~GT-- 98 (139) T protein:vir:10 21 SVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSSTVGFHNKAHIARFLNDGT-- 98 (139) T ss_pred CHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccceeCCCCCceeeeeeccCc-- Confidence 23333467788899999999999995 2 3589999977532 122 134566543 3479999998 Q ss_pred cccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 65 YATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|||+||+..+.++.++.+.+.++ T Consensus 99 ---------------------------~~~~~~hFie~t~~e~~~ev~~a~~ 123 (139) T protein:vir:10 99 ---------------------------KNIRADHFVDNARDDAKDAVFAAEA 123 (139) T ss_pred ---------------------------cccCCCchHHHHHHHHHHHHHHHHH Confidence 4599999999999999998888888 No 101 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.13 E-value=1.5e-08 Score=63.48 Aligned_cols=114 Identities=13% Similarity=0.070 Sum_probs=58.0 Q ss_pred ChH----------HHHHHHHHHHHHHHHHHHHh-----CC------------------------cccccccccceeEeec Q lcl|NC_021326. 1 MER----------WVKRGIAKTTAKIHNTIISL-----MP------------------------VDTGYLRESVTMDFKD 41 (116) Q Consensus 1 i~~----------~~~~~~~~~a~~v~~~ak~~-----aP------------------------vdTG~Lr~SI~~~~~~ 41 (116) +.+ ..++.+...++.+.+..+.+ .| .+||.|++||+.++.. T Consensus 13 ~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~tg~L~~Si~~~~~~ 92 (190) T protein:vir:99 13 ALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLDGHLRNLLRYQLDG 92 (190) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceecHHHHHHHhheecC Confidence 111 12334555555555544332 22 2689999999987765 Q ss_pred CcEEEEEecCCccccccccCCcccccCCCC----cCccccccccccc-------ccc----eeccCCCCCCcchhHH--- Q lcl|NC_021326. 42 SGFTGVINIGSEYAIYVNYGTGIYATGAGG----SRAKKIPWSYKDA-------NGK----WHTTKGQHAQPFWEPA--- 103 (116) Q Consensus 42 ~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~----~~~~~~~~~~~~~-------~~~----~~~~~g~~a~PFl~pA--- 103 (116) + .+.|+++..||..++||.......... .......+..... ++. ...+-.+|++|||--. T Consensus 93 ~--~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~IPaRpfLG~s~~d 170 (190) T protein:vir:99 93 S--ELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQMPARPWLGTSSQD 170 (190) T ss_pred c--EEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccceeeecCcccCCCCHHH Confidence 5 578999999999999996543221110 0001111111000 010 1123458999999433 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_021326. 104 IDAGRAFFNKYFS 116 (116) Q Consensus 104 ~~~~k~~i~~~i~ 116 (116) .++-++.|.+.|. T Consensus 171 ~~~I~~~i~~~l~ 183 (190) T protein:vir:99 171 DDTILQRVERYLQ 183 (190) T ss_pred HHHHHHHHHHHHH Confidence 2222233333333 No 102 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=98.08 E-value=2.9e-08 Score=61.88 Aligned_cols=87 Identities=10% Similarity=0.046 Sum_probs=63.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc------c---cccccccceeEeec-Cc---EEEEEecC----CccccccccCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV------D---TGYLRESVTMDFKD-SG---FTGVINIG----SEYAIYVNYGTG 63 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv------d---TG~Lr~SI~~~~~~-~~---~~~~V~~~----~~YA~~ve~GT~ 63 (116) ..+.-.+++.++|..+++..+..+|. + .++|++||..+-.+ +| -...||.+ ..+|.|+|+|| T Consensus 22 ~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~s~VG~~kk~~a~~A~f~n~GT- 100 (140) T protein:vir:48 22 TPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGVSTVGWVNRYHAQNARRLNDGT- 100 (140) T ss_pred CHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCceeeeccCCCcceeeeeccccCc- Confidence 45666678899999999999999995 3 35799999874221 22 13467764 34689999998 Q ss_pred ccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHH--HHHHHHhcC Q lcl|NC_021326. 64 IYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG--RAFFNKYFS 116 (116) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~--k~~i~~~i~ 116 (116) ..|||+||+.++.++. ++.+.+.++ T Consensus 101 ----------------------------~k~~~~hFve~~~~e~~~k~~vl~A~~ 127 (140) T protein:vir:48 101 ----------------------------KKYRADHFVTNVQNDSAVQTKVLLAEK 127 (140) T ss_pred ----------------------------cccCCCchhHHHHHhhhhHHHHHHHHH Confidence 4599999999999876 444544444 No 103 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.05 E-value=1.6e-08 Score=63.29 Aligned_cols=92 Identities=14% Similarity=-0.021 Sum_probs=56.1 Q ss_pred ChHH----------HHHHHHHHHHHHHHHHHHh--------CC--------------------cccccccccceeEeecC Q lcl|NC_021326. 1 MERW----------VKRGIAKTTAKIHNTIISL--------MP--------------------VDTGYLRESVTMDFKDS 42 (116) Q Consensus 1 i~~~----------~~~~~~~~a~~v~~~ak~~--------aP--------------------vdTG~Lr~SI~~~~~~~ 42 (116) |.++ ....+...++.++.....+ .| .+||.|++||..+...+ T Consensus 14 ~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG~L~~Si~~~~~~~ 93 (155) T protein:vir:79 14 VRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTNALARSVTTWADRN 93 (155) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccchhhhhhhhceecCC Confidence 1111 2333444455554444322 11 37999999998876544 Q ss_pred cEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchh---------HHHHHHHHHHHH Q lcl|NC_021326. 43 GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWE---------PAIDAGRAFFNK 113 (116) Q Consensus 43 ~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~---------pA~~~~k~~i~~ 113 (116) .+.|+++..||.+++||+..... ....+|++|||- -+.++-...+.+ T Consensus 94 --~v~vGt~~~YA~iHqfGg~~~~~----------------------~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~~ 149 (155) T protein:vir:79 94 --EAGIGSNLVYAAIHQFGGDAGRG----------------------HQVEIPARRYLPFDENGQLAAGARQSILEVVLT 149 (155) T ss_pred --EEEEecCchhhhhhhcccccCCC----------------------CccccCCccccCCCCccccchHHHHHHHHHHHH Confidence 67899999999999999743100 013589999993 223344455666 Q ss_pred hcC Q lcl|NC_021326. 114 YFS 116 (116) Q Consensus 114 ~i~ 116 (116) .|+ T Consensus 150 ~l~ 152 (155) T protein:vir:79 150 ALS 152 (155) T ss_pred HHH Confidence 666 No 104 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=97.98 E-value=5e-08 Score=60.62 Aligned_cols=87 Identities=11% Similarity=0.072 Sum_probs=63.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcc------c---ccccccceeEeec-CcE---EEEEecCC----ccccccccCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVD------T---GYLRESVTMDFKD-SGF---TGVINIGS----EYAIYVNYGTG 63 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvd------T---G~Lr~SI~~~~~~-~~~---~~~V~~~~----~YA~~ve~GT~ 63 (116) +.+.-.+++.++|+.+++..+..+|.. | |+|++||..+-.+ +|. ...||.+. -+|.|+|.|| T Consensus 22 ~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~s~VG~~k~~~a~~a~f~NdGT- 100 (140) T protein:vir:48 22 TPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGVATVGWKNNYHAQNARRLNDGT- 100 (140) T ss_pred CHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccccceeecccCCCceeEEeecccCc- Confidence 455666688999999999999999962 4 5899999875321 221 34577653 4689999998 Q ss_pred ccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHH--HHHHHHhcC Q lcl|NC_021326. 64 IYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG--RAFFNKYFS 116 (116) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~--k~~i~~~i~ 116 (116) ..|||+||+..+.++. ++.+.+.++ T Consensus 101 ----------------------------~k~~~~hFve~t~~e~~~~~~vl~A~~ 127 (140) T protein:vir:48 101 ----------------------------KKYRADHFVTNVQNDSAVRDKVLLAEK 127 (140) T ss_pred ----------------------------cccCCCchHHHHHHhhhhHHHHHHHHH Confidence 4599999999999865 555555554 No 105 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.96 E-value=7e-08 Score=59.80 Aligned_cols=92 Identities=16% Similarity=0.060 Sum_probs=54.7 Q ss_pred ChHH----------HHHHHHHHHHHHHHHHHHh-----------CC-----------------cccccccccceeEeecC Q lcl|NC_021326. 1 MERW----------VKRGIAKTTAKIHNTIISL-----------MP-----------------VDTGYLRESVTMDFKDS 42 (116) Q Consensus 1 i~~~----------~~~~~~~~a~~v~~~ak~~-----------aP-----------------vdTG~Lr~SI~~~~~~~ 42 (116) |.++ ....+...++.+......+ .| .+||.|++||..+...+ T Consensus 14 ~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG~L~~Si~~~~~~~ 93 (155) T protein:vir:10 14 VQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTNALARSITTRADRD 93 (155) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccchhhhhhhhceecCC Confidence 1111 2233444455544444221 11 26899999999876555 Q ss_pred cEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchh-HH--------HHHHHHHHHH Q lcl|NC_021326. 43 GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWE-PA--------IDAGRAFFNK 113 (116) Q Consensus 43 ~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~-pA--------~~~~k~~i~~ 113 (116) .+.|+++..||.+++||+..-. .....+||+|||- +. .+.-.+.+.+ T Consensus 94 --~v~vGtn~~YA~iHqfGg~~~~----------------------~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i~~ 149 (155) T protein:vir:10 94 --QAQIGSNLSYAAIQQLGGQAGR----------------------GRKVTIPARPYLPVLRNGQLKPSARDAVLDVLLA 149 (155) T ss_pred --EEEEecCcchhhhhhcccccCC----------------------CCccccCCccccCCCccccchHHHHHHHHHHHHH Confidence 5789999999999999974310 0124699999994 11 2333344555 Q ss_pred hcC Q lcl|NC_021326. 114 YFS 116 (116) Q Consensus 114 ~i~ 116 (116) .|+ T Consensus 150 ~l~ 152 (155) T protein:vir:10 150 ALS 152 (155) T ss_pred HHh Confidence 555 No 106 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=97.88 E-value=5.4e-08 Score=60.43 Aligned_cols=92 Identities=14% Similarity=-0.011 Sum_probs=55.5 Q ss_pred ChHH----------HHHHHHHHHHHHHHHHHHh--------CC--------------------cccccccccceeEeecC Q lcl|NC_021326. 1 MERW----------VKRGIAKTTAKIHNTIISL--------MP--------------------VDTGYLRESVTMDFKDS 42 (116) Q Consensus 1 i~~~----------~~~~~~~~a~~v~~~ak~~--------aP--------------------vdTG~Lr~SI~~~~~~~ 42 (116) |.++ .+..+...++.+......+ .| .+||.|++||.+++..+ T Consensus 14 ~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg~L~~Si~~~~~~~ 93 (155) T protein:vir:99 14 VRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTNALARSVTTWADRN 93 (155) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhchhhhhhhhceecCC Confidence 1111 2334444455444444322 11 37899999999876544 Q ss_pred cEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchh--------H-HHHHHHHHHHH Q lcl|NC_021326. 43 GFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWE--------P-AIDAGRAFFNK 113 (116) Q Consensus 43 ~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~--------p-A~~~~k~~i~~ 113 (116) .+.|+++..||..++||+.... .....+|++|||- | ..+.-.+.+.+ T Consensus 94 --~v~vGtn~~YA~iHqfGg~~~~----------------------~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~~ 149 (155) T protein:vir:99 94 --EAGIGSNLVYAAIHQFGGDAGR----------------------GHQVEIPARRYLPFDENGQLAAGARQSILEIVLT 149 (155) T ss_pred --EEEEecCccchhhhhcccccCC----------------------CCccccCCccccCCCCccccchHHHHHHHHHHHH Confidence 5789999999999999974210 0013689999994 1 22334445666 Q ss_pred hcC Q lcl|NC_021326. 114 YFS 116 (116) Q Consensus 114 ~i~ 116 (116) .|+ T Consensus 150 ~l~ 152 (155) T protein:vir:99 150 ALS 152 (155) T ss_pred HHh Confidence 666 No 107 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.86 E-value=6.2e-08 Score=60.10 Aligned_cols=85 Identities=13% Similarity=0.093 Sum_probs=63.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeEee--c-CcEEEEEecCCccccccccCCcccccCCCCcCcc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMDFK--D-SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~~~--~-~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~ 75 (116) ++..+.-.+..+|..++.+||.+||+ +||+-|++|.-.++ + +..+..+..+++|.+|.|.+++. T Consensus 24 ~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~~g~~~~~Iylsh~veYG~~LEla~~~----------- 92 (123) T protein:vir:74 24 MESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANKLGPGSHELIMSYSVHYGIWLEIANSG----------- 92 (123) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecCeeecceeeecCCC----------- Confidence 33333334556777899999999999 89999999954433 3 34677788899999999988742 Q ss_pred cccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 76 ~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +++ -+.|+++..-++|.+.++ T Consensus 93 -------------------kya-Ii~Ptv~~~~~~im~g~~ 113 (123) T protein:vir:74 93 -------------------QYA-VIGPFLPVMGRKLMHDLE 113 (123) T ss_pred -------------------Cce-eecchHHHHhHHHHHHHH Confidence 222 678888888888888888 No 108 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.83 E-value=6.6e-08 Score=59.96 Aligned_cols=92 Identities=16% Similarity=0.152 Sum_probs=54.6 Q ss_pred ChHHH---------HHHHHHHHHHHHHHHHHh-----CC----------------------------cccccccccceeE Q lcl|NC_021326. 1 MERWV---------KRGIAKTTAKIHNTIISL-----MP----------------------------VDTGYLRESVTMD 38 (116) Q Consensus 1 i~~~~---------~~~~~~~a~~v~~~ak~~-----aP----------------------------vdTG~Lr~SI~~~ 38 (116) +.+++ +..+...++.+.+....+ .| .+||.|++||..+ T Consensus 14 l~~~L~~l~~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~L~~tg~L~~Si~~~ 93 (156) T protein:vir:19 14 IQLALDELGTVTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSILTLHGDLARSITTD 93 (156) T ss_pred HHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcchhhhHHHHHHhhhe Confidence 11111 123344444444433221 22 2689999999887 Q ss_pred eecCcEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 39 FKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 39 ~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ...+ .+.||++..||.+++||+..... .....+|++|||- -=++.+..|.+.|. T Consensus 94 ~~~~--~v~vGt~~~yA~vHqfG~~~~~~---------------------~~~~~iPaRpfLG-~s~~d~~~I~~~i~ 147 (156) T protein:vir:19 94 YGQD--YALIGSPKIYAAIHQWGGTPDMA---------------------PRPAGVPARPYMG-LDKTGEQEIFDAIR 147 (156) T ss_pred ecCC--EEEEecchhhhHHhhcCcccccC---------------------CCccccCCccccC-CCHHHHHHHHHHHH Confidence 6555 57899999999999999754210 0124699999994 33444555444444 No 109 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=97.78 E-value=1.1e-07 Score=58.66 Aligned_cols=92 Identities=16% Similarity=0.170 Sum_probs=76.0 Q ss_pred ChHHH-HHHHHHHHHHHHHHHHHhCCc----ccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcc Q lcl|NC_021326. 1 MERWV-KRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) Q Consensus 1 i~~~~-~~~~~~~a~~v~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~ 75 (116) |++.| .+.|.++|+...+..+=+.|+ ..|+||++|.+.++++.+..+....+-|..|+|.||.+..- T Consensus 21 Vd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d~V~V~Fed~a~yW~f~EnGt~~~~~-------- 92 (125) T protein:vir:62 21 VNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDDRVSVEFKDEAWYWYLVEHGHKKAKG-------- 92 (125) T ss_pred hhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCCeEEEEEcchhhhhhhhhcccccccc-------- Confidence 44433 457888888888888777776 46899999999999999998888899999999999965311 Q ss_pred cccccccccccceeccCC-CCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 76 KIPWSYKDANGKWHTTKG-QHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 76 ~~~~~~~~~~~~~~~~~g-~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .| .++|-|....|++++.+|.+.|+ T Consensus 93 ----------------~g~vkaqhf~~~Tf~~nk~kI~~iM~ 118 (125) T protein:vir:62 93 ----------------KGRVKGKHFVQNTFDAEGDKIADIMA 118 (125) T ss_pred ----------------ccccchhhhhhccHHhhHHHHHHHHH Confidence 22 58899999999999999999999 No 110 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=97.76 E-value=9.2e-08 Score=59.16 Aligned_cols=81 Identities=11% Similarity=0.132 Sum_probs=66.8 Q ss_pred HHHHHHHHHHHHHHHHHHhCCc--ccccccccceeEee--c-CcEEEEEecCCccccccccCCcccccCCCCcCcccccc Q lcl|NC_021326. 5 VKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMDFK--D-SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPW 79 (116) Q Consensus 5 ~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~~~--~-~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~ 79 (116) +.-.+..+|..++.+||.+||+ +||+-|++|...++ + +..+..+..+++|.+|.|.+++. T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~~g~~~~~i~lsh~v~Yg~~LE~a~~~--------------- 65 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHTVHYGIWLEIANSG--------------- 65 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccccCCceEEEEEecCeeccceEEeecCC--------------- Confidence 7777778899999999999999 89999999965443 3 34777888899999999998842 Q ss_pred cccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 80 ~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +++ .+.|+++..-++|.+.|+ T Consensus 66 ---------------kya-Il~Ptv~~~~~~i~~g~~ 86 (93) T protein:vir:10 66 ---------------RYE-IIMPTVHHEGKLMAQRLR 86 (93) T ss_pred ---------------Ccc-chhhhHHHHHHHHHHHHH Confidence 222 788999999999988888 No 111 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.73 E-value=8.4e-08 Score=59.37 Aligned_cols=92 Identities=14% Similarity=0.115 Sum_probs=52.7 Q ss_pred ChHH----------HHHHHHHHHHHHHHHHHHh-----CC---------------------------------------- Q lcl|NC_021326. 1 MERW----------VKRGIAKTTAKIHNTIISL-----MP---------------------------------------- 25 (116) Q Consensus 1 i~~~----------~~~~~~~~a~~v~~~ak~~-----aP---------------------------------------- 25 (116) |.++ ....+...++.++.....+ .| T Consensus 14 ~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~L 93 (175) T protein:vir:79 14 LRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELTAAASRRKAGLMIL 93 (175) T ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccchhhHhhhccCCCcc Confidence 1111 1234444555554443321 11 Q ss_pred cccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHH- Q lcl|NC_021326. 26 VDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAI- 104 (116) Q Consensus 26 vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~- 104 (116) ++||.|++||.++...+ .+.||+|..||.+++||+..- . .....+||+|||-=.- T Consensus 94 ~~tG~L~~Si~~~~~~~--~v~vGtn~~YAaiHqfGg~~~----~------------------~~~v~IPARPfLG~s~~ 149 (175) T protein:vir:79 94 QDSGQMAASTATDSGED--YSVIGSNKEYAAIQHFGGQAG----R------------------GLKVTIPGRAWLPVTAD 149 (175) T ss_pred eechhhhhhhhheecCC--EEEEecCcchhhHhhcccccC----C------------------CcccccCcccccCCCcc Confidence 26999999999886655 678999999999999997421 0 0123689999996321 Q ss_pred --------HHHHHHHHHhcC Q lcl|NC_021326. 105 --------DAGRAFFNKYFS 116 (116) Q Consensus 105 --------~~~k~~i~~~i~ 116 (116) +.-...+.+.|. T Consensus 150 de~~~~~~~~I~~~i~~~l~ 169 (175) T protein:vir:79 150 GELQPEAVEPVLNTILRHLM 169 (175) T ss_pred cchhHHHHHHHHHHHHHHHH Confidence 222222222222 No 112 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=97.73 E-value=4.6e-08 Score=60.83 Aligned_cols=113 Identities=19% Similarity=0.205 Sum_probs=61.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) |+.-|...|++.| ..-||++.||+.|+.+.||.+.-+...-.+.++...+||+||||||+.-..++.+++....... T Consensus 25 V~~GiNdFMe~~A---~~~aK~~SPV~~GeY~~S~~V~~ka~NGRG~~G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdg 101 (150) T protein:vir:81 25 VDAGINDFMENEA---IPYAKSISPVDDGEYAASWAVMKKAKNGRGVFGPKAWYAHFVEFGTGADKKQGRGKKGKRGKDG 101 (150) T ss_pred hhhhHHHHHHhhh---hhhhhccCCcccchhHHHHHHHhhcccCccccCccchhhhhhhhccccccccccccccccCccc Confidence 4444444443332 3457999999999999999663332223689999999999999999987777776666544332 Q ss_pred ccc---cccceec-cCCCC--CCcchhHHHH----HHHHHHHHhcC Q lcl|NC_021326. 81 YKD---ANGKWHT-TKGQH--AQPFWEPAID----AGRAFFNKYFS 116 (116) Q Consensus 81 ~~~---~~~~~~~-~~g~~--a~PFl~pA~~----~~k~~i~~~i~ 116 (116) ... ..|.+.+ -|-+| +|-.-..... ..+--|.+.|| T Consensus 102 krtveiddgefrrvgpdtptkaqgiaqkvashfggslkggisksls 147 (150) T protein:vir:81 102 KRTVEIDDGEFRRVGPDTPTKAQGIAQKVASHFGGSLKGGISKSLS 147 (150) T ss_pred ceeeeecCccceecCCCCchhhhhHHHHHHHhcccccccccccccc Confidence 111 1222222 12222 2211000000 00112344444 No 113 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.62 E-value=3.7e-07 Score=55.87 Aligned_cols=90 Identities=13% Similarity=0.022 Sum_probs=57.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++++.+++....+.+|.+++...+|.|||.|++|-. .+ ++ +.|..+.+||.++.||..-. T Consensus 16 l~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~-~~-~~---g~I~y~tPYAr~qYY~~~~~--------------- 75 (112) T protein:vir:80 16 VKKAKERGQFALINQAAADIALYVPFLSGDLSNQYV-IM-ND---KEIMWTSIYARRLYNGINFN--------------- 75 (112) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCcccCcccccee-ec-cC---ceEEecCchhhHhhhcccCC--------------- Confidence 455555566667888888889999999999999942 12 22 46888999999999984211 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +....+|+.-++ ++..|.....+.+.+... T Consensus 76 -----~~~~~~p~ag~~-W~erak~~~~~~~~~~~~ 105 (112) T protein:vir:80 76 -----FTLTHHPLAGPK-WDQRAKVDKLESWIEVAQ 105 (112) T ss_pred -----CCcCCCCCcchh-hHHHHHhhhhHHHHHHHH Confidence 111224544444 445565555554444433 No 114 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=97.59 E-value=4.8e-07 Score=55.21 Aligned_cols=95 Identities=16% Similarity=0.141 Sum_probs=65.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE---eecCcEEEEEecCCc-----cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD---FKDSGFTGVINIGSE-----YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~---~~~~~~~~~V~~~~~-----YA~~ve~GT~~~~~~~~ 70 (116) |++..++||..+++.|+.++|.+..+ |||.+.+++..+ ..++--+..|+...+ +-++.|||+.. T Consensus 24 ~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G~~~R~~ivHLnE~Gyt~------ 97 (134) T protein:vir:10 24 MVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRGPFERFRIVHLIENGHVE------ 97 (134) T ss_pred hhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEcCCceeeEEEeeecceee------ Confidence 99999999999999999999988777 999999998653 223334566665332 56788899721 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) -..|+++..+||- =+..|++..++.+.+.+- T Consensus 98 ------------~r~Gk~i~PrG~G---~i~~a~~~~e~~~~~~ik 128 (134) T protein:vir:10 98 ------------KKSGKFVKPKAMG---GINRAIRQGQNKYFETLK 128 (134) T ss_pred ------------cCCCCeeccchhh---HHHHHHHhhhHHHHHHHH Confidence 1233344444444 345588887776555444 No 115 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=97.58 E-value=3.1e-07 Score=56.29 Aligned_cols=83 Identities=12% Similarity=0.133 Sum_probs=61.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeEee---cCcEEEEEecCCccccccc--cCCcccccCCCCcC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMDFK---DSGFTGVINIGSEYAIYVN--YGTGIYATGAGGSR 73 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~~~---~~~~~~~V~~~~~YA~~ve--~GT~~~~~~~~~~~ 73 (116) ++..+.--+..+|..++.+||.+||+ +||+-|++|...++ .+..+..+..+++|.+|.| .+...+ T Consensus 24 ~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~~~~~~~Iylsh~veYG~~LEla~~~kya-------- 95 (120) T protein:vir:10 24 VDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHTVHYGIWLEIANSGRYE-------- 95 (120) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecCeeecceEEeeCCCCcc-------- Confidence 33333444567788899999999999 89999999987553 2336777778999999999 444322 Q ss_pred cccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 74 AKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) -+.|++...-++|.+.|+ T Consensus 96 -------------------------Il~PTi~~~~~~il~g~~ 113 (120) T protein:vir:10 96 -------------------------IIMPTVHHEGKLMAQRLR 113 (120) T ss_pred -------------------------cccchHHHHhHHHHHHHH Confidence 466777777777777777 No 116 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=97.57 E-value=4.3e-07 Score=55.48 Aligned_cols=90 Identities=14% Similarity=0.084 Sum_probs=58.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++++++++....+.+|..++...+|.|||.|++|-.+ + ++ +.|..+++||.++-||..-+. T Consensus 16 l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~-~-~~---g~I~y~tPYAr~qYY~~~~~~-------------- 76 (112) T protein:vir:45 16 VKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI-M-ND---KEIMWTSIYARRLYKGINFNF-------------- 76 (112) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCccccceee-c-cC---CeEEecChhhHHhhhccccCC-------------- Confidence 4555555566678888888889999999999999432 2 22 357888999999999863321 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ....+|+.-++ ++..|.......+.+... T Consensus 77 ------~~~~~p~ag~~-W~erak~~~~~~~~~~~~ 105 (112) T protein:vir:45 77 ------TLTHHPLAGPE-WDQRAKIDKMDVWEKVAQ 105 (112) T ss_pred ------CCCCCCCCchh-hHHHHHHhhHHHHHHHHH Confidence 11224554444 555566655554444333 No 117 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.57 E-value=4.3e-07 Score=55.49 Aligned_cols=92 Identities=17% Similarity=0.214 Sum_probs=53.1 Q ss_pred ChHHH----------HHHHHHHHHHHHHHHHHh-----CC---------------------------------------- Q lcl|NC_021326. 1 MERWV----------KRGIAKTTAKIHNTIISL-----MP---------------------------------------- 25 (116) Q Consensus 1 i~~~~----------~~~~~~~a~~v~~~ak~~-----aP---------------------------------------- 25 (116) |.+++ +..+...++.++.....+ .| T Consensus 14 l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~~~~~~~~~~~~~L 93 (175) T protein:vir:10 14 LRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELTAAASRRKAGLMIL 93 (175) T ss_pred HHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhhhhhhhhccCCCcc Confidence 11111 223334444443333211 11 Q ss_pred cccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHH-- Q lcl|NC_021326. 26 VDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA-- 103 (116) Q Consensus 26 vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA-- 103 (116) .+||.|++||.+++..+ .+.||+|..||.+++||+... + .+...+||+|||-=. T Consensus 94 ~~tG~L~~Si~~~~~~~--~v~vGtn~~YAaiHqfGg~~~--~--------------------~~~v~iPaRpfLG~s~~ 149 (175) T protein:vir:10 94 QDSGQMAASVSTDHDDN--SAVIGSNKEYAAIHQFGGQAG--R--------------------GLKVTIPARPWLPVTAD 149 (175) T ss_pred eechhhhhhhheeecCC--EEEEecChhhhhhhhcccccC--C--------------------CCccccCCccccCCCcc Confidence 26899999999877555 678999999999999997421 0 012469999999743 Q ss_pred -------HHHHHHHHHHhcC Q lcl|NC_021326. 104 -------IDAGRAFFNKYFS 116 (116) Q Consensus 104 -------~~~~k~~i~~~i~ 116 (116) .+.-...+.+.|+ T Consensus 150 d~~~~e~~~~Il~~~~~~l~ 169 (175) T protein:vir:10 150 GELQPEAVEPVLNTILRHLM 169 (175) T ss_pred cccchHHHHHHHHHHHHHHH Confidence 2233333444443 No 118 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=97.51 E-value=3.4e-07 Score=56.04 Aligned_cols=108 Identities=20% Similarity=0.243 Sum_probs=74.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++..+++.+.....+++.-+...||+.||+||.|-.++++ |.+|+..+.+.|-+||-+|-|-. -|..++++-++-. T Consensus 19 v~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sie--gstgelsn~~~yl~~vl~grgwv--fpv~~kal~wpel 94 (133) T protein:vir:41 19 VEDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVE--GSTGELTNTVPYLQWVLFGRGWV--FPVEKKALYWPEL 94 (133) T ss_pred hhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEee--cCccchhhhhHHhhHhhhcccce--eeecccccccCCC Confidence 8889999999999999999999999999999999877776 44889999999999999997632 2333333322222 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHH-------HHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRA-------FFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~-------~i~~~i~ 116 (116) .+.. .+ .+-.||+.||.-++-..-+ .|+=-|| T Consensus 95 phpv---ay-arpappndyfsa~vay~~~~give~s~iewlis 133 (133) T protein:vir:41 95 PHPV---AY-ARPAPPNDYFSAAVAYIDAKGIVEDSFIEWLIS 133 (133) T ss_pred CCcc---cc-cCCCCCchhhhhhhhhhcccchhHHHHHHHhcC Confidence 2211 11 1235677788776544322 2333334 No 119 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=97.50 E-value=3.4e-07 Score=56.05 Aligned_cols=108 Identities=16% Similarity=0.179 Sum_probs=73.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) ++..+++.+.....+++.-+...||+.||+||.|-.++++ |.+|+..+.+.|-+||-+|-|-. -|..++++-++-. T Consensus 19 v~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sie--gstgelsn~~~yl~~vl~grgwv--fpv~~kal~wpel 94 (133) T protein:vir:42 19 VQGKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIE--GSTGELSNLAYYLPFVLHGRGWV--FPVRRKALWWPEL 94 (133) T ss_pred hhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEee--cCccchhhhhHHhhHhhhcccce--eeccccccccCCC Confidence 8889999999999999999999999999999999877776 44889999999999999997632 2333333322222 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHH--HHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~--~i~~~i~ 116 (116) .+.. .+ .+-.||+.||.-++-..-+ .++..+- T Consensus 95 phpv---ay-arpappndyfsa~vay~~~~give~s~i 128 (133) T protein:vir:42 95 PHPV---AY-ARPAPPNDYFSAVVAYSAPEGVVEETLI 128 (133) T ss_pred CCcc---cc-cCCCCCchhhhhhhhhhcccchhHHHHH Confidence 2211 11 1235677788776543322 2222222 No 120 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=97.48 E-value=5.8e-08 Score=60.24 Aligned_cols=84 Identities=19% Similarity=0.156 Sum_probs=53.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe-ecCcEEEEEecCCccccccccCCcccccCCC-CcCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF-KDSGFTGVINIGSEYAIYVNYGTGIYATGAG-GSRAKKIP 78 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~-~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~-~~~~~~~~ 78 (116) ---.+.+.+....+++...+|++.||++|..|+|+.+.- ..+.-.+.|+.+.+||++|||||--...... .+.+..+. T Consensus 23 K~~EVn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~NkGRG~~G~~~~~AH~VEFGs~hndeyapaqktakqfg 102 (108) T protein:vir:79 23 KLPEVNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFG 102 (108) T ss_pred hchhhhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhccCccccCCcchhhhhhhhhccccccccchhhHHHhhc Confidence 112455566666678889999999999999999996532 2222368999999999999999965443322 22222221 Q ss_pred cccccc Q lcl|NC_021326. 79 WSYKDA 84 (116) Q Consensus 79 ~~~~~~ 84 (116) ...... T Consensus 103 gtay~d 108 (108) T protein:vir:79 103 GTAYGD 108 (108) T ss_pred ccccCC Confidence 111111 No 121 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=97.37 E-value=1.5e-06 Score=52.45 Aligned_cols=95 Identities=17% Similarity=0.170 Sum_probs=64.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCcEEEEEecCCc-----cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSGFTGVINIGSE-----YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~~~~~V~~~~~-----YA~~ve~GT~~~~~~~~ 70 (116) |++..++||..+++.++.++|.+.++ |||.+.+++..+ + .++.-+..|+...+ +-++.|||+... T Consensus 24 ~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~----- 98 (134) T protein:vir:10 24 MVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRGSKDRYKIVHLIEYGHVQK----- 98 (134) T ss_pred hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEcCCceeEEEEeecccceec----- Confidence 99999999999999999999999997 999999998653 2 23334466666332 567888995221 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+++..+||- -+..|++..++.+.+.+- T Consensus 99 -------------~~Gk~i~PrG~G---~i~~a~~~~e~~~~~~ik 128 (134) T protein:vir:10 99 -------------GTGKFIKPKAMG---GVNRAIRQGQNKYFETLK 128 (134) T ss_pred -------------ccCCccCcchhh---HHHHHHHhhhHHHHHHHH Confidence 123333334444 355577777765554444 No 122 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=97.37 E-value=1.5e-06 Score=52.45 Aligned_cols=95 Identities=17% Similarity=0.170 Sum_probs=64.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCcEEEEEecCCc-----cccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSGFTGVINIGSE-----YAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~~~~~V~~~~~-----YA~~ve~GT~~~~~~~~ 70 (116) |++..++||..+++.++.++|.+.++ |||.+.+++..+ + .++.-+..|+...+ +-++.|||+... T Consensus 24 ~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~----- 98 (134) T protein:vir:95 24 MVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRGSKDRYKIVHLIEYGHVQK----- 98 (134) T ss_pred hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEcCCceeEEEEeecccceec----- Confidence 99999999999999999999999997 999999998653 2 23334466666332 567888995221 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+++..+||- -+..|++..++.+.+.+- T Consensus 99 -------------~~Gk~i~PrG~G---~i~~a~~~~e~~~~~~ik 128 (134) T protein:vir:95 99 -------------GTGKFIKPKAMG---GVNRAIRQGQNKYFETLK 128 (134) T ss_pred -------------ccCCccCcchhh---HHHHHHHhhhHHHHHHHH Confidence 123333334444 355577777765554444 No 123 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=97.17 E-value=2.8e-06 Score=51.03 Aligned_cols=93 Identities=22% Similarity=0.185 Sum_probs=60.0 Q ss_pred Ch-----HHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcc Q lcl|NC_021326. 1 ME-----RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) Q Consensus 1 i~-----~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~ 75 (116) ++ +.++++-...+.+|..++...+|.|||.|++|..+.. + .+.|..+.+||.++-||-..-... T Consensus 12 ~~~~l~~~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~--~--~~~I~y~tPYAr~qyYg~~~~~~~------- 80 (114) T protein:vir:47 12 AKQKLSNESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVG--Q--GDAVVYGTVYARAQFYGSNGIVTF------- 80 (114) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCCcCccCccccceeeee--C--CcEEEecCchhhHhhhcccCCCCC------- Confidence 33 3445555566777888888899999999999976532 2 246889999999999984211000 Q ss_pred cccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 76 ~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..+.+|+.-++ ++..|.....+.+.+-.. T Consensus 81 -----------~~~~~p~~g~~-W~eraka~~~~~~~~~~~ 109 (114) T protein:vir:47 81 -----------RRYTTPGTGKR-WDQVATSKHAEEWARAFV 109 (114) T ss_pred -----------CccCCCCCcch-hHHHHHhhhhHHHHHHHH Confidence 11224554444 566677776665555444 No 124 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=96.97 E-value=9.5e-06 Score=48.11 Aligned_cols=87 Identities=13% Similarity=0.074 Sum_probs=59.9 Q ss_pred Ch--------------HHHHHHHHHHHHHHHHHHHHhCCcc-----------------------cccccccceeEe--ec Q lcl|NC_021326. 1 ME--------------RWVKRGIAKTTAKIHNTIISLMPVD-----------------------TGYLRESVTMDF--KD 41 (116) Q Consensus 1 i~--------------~~~~~~~~~~a~~v~~~ak~~aPvd-----------------------TG~Lr~SI~~~~--~~ 41 (116) |+ +.-..++.++|...++..+..+|.. .|+|++||.++- .- T Consensus 9 l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~I~~~~~~~i 88 (159) T protein:vir:38 9 YNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDSITYKPGYTA 88 (159) T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccceeeecCccc Confidence 11 2223356778888888889999972 369999997742 22 Q ss_pred CcE---EEEEecC----CccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCC-----cchhHHHHHHHH Q lcl|NC_021326. 42 SGF---TGVINIG----SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQ-----PFWEPAIDAGRA 109 (116) Q Consensus 42 ~~~---~~~V~~~----~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~-----PFl~pA~~~~k~ 109 (116) +|. +..||.+ +-+|.|++.||. +|||+ +|+..+.++.++ T Consensus 89 Dg~~dG~s~VGw~~~~~a~~a~f~NdGT~-----------------------------~m~~k~~~gdHFvekt~~~~k~ 139 (159) T protein:vir:38 89 DKLHTGDTDVGFEGKYYDFLAKIVNNGQH-----------------------------HMSPKRYKNMHFLDKAQQEAKK 139 (159) T ss_pred cccccceeeecccCCccceEeeecccCcc-----------------------------ccCCCCccCChhHHHHHHHHHH Confidence 221 4567763 346899999993 35554 899999999998 Q ss_pred HHHHhcC Q lcl|NC_021326. 110 FFNKYFS 116 (116) Q Consensus 110 ~i~~~i~ 116 (116) .+.+.++ T Consensus 140 ~Vl~A~~ 146 (159) T protein:vir:38 140 SVAEAEL 146 (159) T ss_pred HHHHHHH Confidence 8776666 No 125 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=96.75 E-value=8.2e-06 Score=48.46 Aligned_cols=88 Identities=16% Similarity=0.163 Sum_probs=58.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) .++.++++-...+.+|..++...+|.|||.|++|-.+.. + .+.|..+++||.++-||..-+. T Consensus 16 ~~~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s--~--~g~I~y~tPYAr~qYYg~~~n~-------------- 77 (108) T protein:vir:98 16 SPQTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISS--D--AEEIYYNTPYAKRRFYEPAYNY-------------- 77 (108) T ss_pred HHHHHHHHHHHHHHHHHHhhcccCcCcCCccccceeecc--C--CceEEecChhhHHhhhccccCC-------------- Confidence 344555666777888888889999999999999954432 2 2578899999999999853211 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .+|+.-++ ++..|.......+.+... T Consensus 78 ---------~~p~ag~~-W~eraka~~~~~~~~~~~ 103 (108) T protein:vir:98 78 ---------TTPGTGPR-WDMKAKRLFISDWERAYM 103 (108) T ss_pred ---------CCCCCcch-hHHHHHhhhhHHHHHHHH Confidence 12333333 455566666555555444 No 126 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=96.63 E-value=1.3e-06 Score=52.79 Aligned_cols=83 Identities=17% Similarity=0.222 Sum_probs=39.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cccccccccceeE----eecCcEE---EEEe-cCCccccccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMD----FKDSGFT---GVIN-IGSEYAIYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~----~~~~~~~---~~V~-~~~~YA~~ve~GT~~~~~~~~ 70 (116) |+-.-++.++... ..++.+.. +.-|-|...=... ....+.. ..-+ +.+.+|.+.|||+ T Consensus 1 ~~~~~~~g~~~~~----~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~-------- 68 (168) T protein:vir:94 1 MTTIARKGVKMPP----HLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGH-------- 68 (168) T ss_pred CccccchhhhhhH----HHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCC-------- Confidence 3333333322222 22222211 1223222110000 0000000 0000 2345677888886 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++|+||||++++++++..+.+.|. T Consensus 69 ---------------------~~IP~RPFlr~t~~~~~~~~~~~~~ 93 (168) T protein:vir:94 69 ---------------------GQNHPRPFMQQTYAAQYRAWSRDLT 93 (168) T ss_pred ---------------------CCCCCchhhHHHHHHHHHHHHHHHH Confidence 3489999999999999998888887 No 127 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.62 E-value=1.6e-05 Score=46.86 Aligned_cols=97 Identities=10% Similarity=0.001 Sum_probs=56.0 Q ss_pred ChHHHHH------------HHHHHHHHHHHHHHHh-----CC----c--------------------ccccccccceeEe Q lcl|NC_021326. 1 MERWVKR------------GIAKTTAKIHNTIISL-----MP----V--------------------DTGYLRESVTMDF 39 (116) Q Consensus 1 i~~~~~~------------~~~~~a~~v~~~ak~~-----aP----v--------------------dTG~Lr~SI~~~~ 39 (116) ++.++.. .+.+.++.+....+.+ .| + ++|.|.+||.+.. T Consensus 7 l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~ 86 (149) T protein:vir:98 7 LQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRTNRFMKAKG 86 (149) T ss_pred HHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhhhhhhhhee Confidence 2222222 2555555555555432 33 2 3478899998887 Q ss_pred ecCcEE-EEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchh---HHHHHHHHHHHHhc Q lcl|NC_021326. 40 KDSGFT-GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWE---PAIDAGRAFFNKYF 115 (116) Q Consensus 40 ~~~~~~-~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~---pA~~~~k~~i~~~i 115 (116) ..++.. +.+|++..||..++||......... ....+|++|||- -.-++-...+.+.| T Consensus 87 ~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~-------------------~~~~iPaRp~LG~s~~d~~~i~~~i~~~l 147 (149) T protein:vir:98 87 SDSAAVVEFTGRVQRMARVHQYGLKDRPNRHS-------------------RDVQYAARPLLGFTRDDEQMIEDIIIRHL 147 (149) T ss_pred cCCeeEEEecCcchHHhhHhhccccccccCCC-------------------cceeccccccCCCCHHHHHHHHHHHHHHh Confidence 766543 2348999999999999753221110 013589999994 33333344555555 Q ss_pred C Q lcl|NC_021326. 116 S 116 (116) Q Consensus 116 ~ 116 (116) + T Consensus 148 ~ 148 (149) T protein:vir:98 148 G 148 (149) T ss_pred h Confidence 5 No 128 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=96.47 E-value=2.6e-05 Score=45.76 Aligned_cols=90 Identities=16% Similarity=0.236 Sum_probs=61.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-ee-cCc-EEEEEecCCc-cc--cccccCCcccccCCCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-FK-DSG-FTGVINIGSE-YA--IYVNYGTGIYATGAGGS 72 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~~-~~~-~~~~V~~~~~-YA--~~ve~GT~~~~~~~~~~ 72 (116) |++..++||.++++.++...|.+.|+ |||.+-+++... +. .+| -+..|+...+ |. +.-|+|+ T Consensus 26 v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW~GpR~~ivHLNE~Gy---------- 95 (132) T protein:vir:96 26 VNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTTPRWNIVHLQELEY---------- 95 (132) T ss_pred HHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecccCCceeEEeeecccc---------- Confidence 99999999999999999999999997 999999998653 22 233 3456666433 21 2223554 Q ss_pred CcccccccccccccceeccCCCCCCcchhHHHHHHHHHHH--------HhcC Q lcl|NC_021326. 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFN--------KYFS 116 (116) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~--------~~i~ 116 (116) |+++..+||- ++..|++..++.+. +.|- T Consensus 96 -------------Gk~~~PrG~G---~I~~a~~~se~~~~~~~~~elkk~l~ 131 (132) T protein:vir:96 96 -------------GWKHNRRGVG---VIRRYSDILETIYPRGIRDKLKRGFD 131 (132) T ss_pred -------------cCCcCCCcch---HHHHHHHhhhhHHHHHHHHHHHHHhc Confidence 4445556666 77788877774333 3333 No 129 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=96.38 E-value=2.1e-05 Score=46.24 Aligned_cols=91 Identities=19% Similarity=0.165 Sum_probs=58.5 Q ss_pred ChH-----HHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcc Q lcl|NC_021326. 1 MER-----WVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) Q Consensus 1 i~~-----~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~ 75 (116) +++ .++++-...+.+|..++...+|.|||.|++|..+ .++ .|..+++||.++-||.-.. T Consensus 7 ~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i--~s~----~I~y~tPYAr~qyYg~~~~---------- 70 (113) T protein:vir:79 7 FSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFV--NDT----GIHYTAKYARAQFYGFVNG---------- 70 (113) T ss_pred HHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhccccc--cCC----eeEecChhhhHhhccccCC---------- Confidence 222 4444666678889999999999999999999643 333 3778899999999874211 Q ss_pred cccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 76 ~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ...+.+.+|+.-++ ++..|....++.+.+... T Consensus 71 --------~~~~~~t~p~ag~~-W~eraKa~h~~~w~~~~~ 102 (113) T protein:vir:79 71 --------HRVRNYSTPGTGRR-WDLKAKAVYKADWQKVAV 102 (113) T ss_pred --------CCccccCCCCCCch-hhHHHHHHhHHHHHHHHH Confidence 00111235665555 555677766665554432 No 130 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=96.30 E-value=4e-05 Score=44.68 Aligned_cols=96 Identities=18% Similarity=0.046 Sum_probs=59.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) -.++++++-...+.++..++...+|.|||+|..|-...+..+ .+.|..+++||.++-||.- +. T Consensus 17 ~~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~--~~~I~y~tPYAr~qyYg~~-~~-------------- 79 (116) T protein:vir:15 17 SLDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSD--GSEITYSTPYAKAQFYGII-ND-------------- 79 (116) T ss_pred hHHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecC--CceEEecCchhHHHhcccc-cC-------------- Confidence 135566666777888888889999999988776655444444 3678899999999988741 00 Q ss_pred ccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ......+.||+.-++ ++..|-...... +.+.|- T Consensus 80 --~~~~~~~t~p~ag~~-W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 80 --KYPVHNYTTPGTTKR-WDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred --CCCcccccCCCCCcc-hhHHHHhhhHHHHHHHHHHhcC Confidence 001122345666666 444565555433 333333 No 131 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=95.83 E-value=1.7e-05 Score=46.76 Aligned_cols=78 Identities=27% Similarity=0.335 Sum_probs=37.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeE-----------eecCcEEEEE-ecCCccccccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMD-----------FKDSGFTGVI-NIGSEYAIYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~-----------~~~~~~~~~V-~~~~~YA~~ve~GT~~~~~~ 68 (116) |+. .++.|++...++.. --|.-|-+..+=.-+ .+.+ ... .+.+.+|.|.|||| T Consensus 1 m~~-~r~~l~~~~~~l~~-----~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~---~~~G~pva~ia~~~e~G~------ 65 (155) T protein:vir:77 1 MSV-TRRGLTLPKDRYRS-----MSVKAGVLAGATYPDESGKKLADGSILKKD---PRAGLPVAMIAMALNYGT------ 65 (155) T ss_pred Ccc-hHHHHHHHHHHHhc-----CceEEeecCCCCCccccchhhhhhhhcccc---ccccccHhhhhhhhhcCC------ Confidence 111 11122222222211 012222222211000 0000 011 12334777888886 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++||||||+|++++++.++.+.|. T Consensus 66 -----------------------~~IP~RPFlr~t~~~~~~~~~~~l~ 90 (155) T protein:vir:77 66 -----------------------SKLPARPFMEKTIADRSAEWIKGLT 90 (155) T ss_pred -----------------------CCCCCCchhhHHHHHHHHHHHHHHH Confidence 3589999999999999999888888 No 132 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=95.60 E-value=1.8e-05 Score=46.63 Aligned_cols=81 Identities=25% Similarity=0.310 Sum_probs=37.6 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeE-----eecCcE---EEEEe-cCCccccccccCCcccccCCCCc Q lcl|NC_021326. 2 ERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMD-----FKDSGF---TGVIN-IGSEYAIYVNYGTGIYATGAGGS 72 (116) Q Consensus 2 ~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~-----~~~~~~---~~~V~-~~~~YA~~ve~GT~~~~~~~~~~ 72 (116) =+..++.|.+...++.. --|.-|-+..+=.-+ ...+.+ .+.-+ +.+.+|.+.|||| T Consensus 1 m~v~~k~L~~~~~~l~~-----~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~---------- 65 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRS-----MSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGT---------- 65 (155) T ss_pred CcchHHHHHHHHHHHhC-----CeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCC---------- Confidence 11222223332222211 001222222210000 000000 00011 1234667788876 Q ss_pred CcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++||||||+|++++++.++.+.|. T Consensus 66 -------------------~~IP~RPFlr~t~~~~~~~~~~~l~ 90 (155) T protein:vir:78 66 -------------------SKLPARPFMEKTITDRSAEWIKGLT 90 (155) T ss_pred -------------------CCCCCcchhhHHHHHHHHHHHHHHH Confidence 3589999999999999999888888 No 133 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=95.56 E-value=0.00015 Score=41.60 Aligned_cols=90 Identities=14% Similarity=0.159 Sum_probs=61.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE---eecCcEEEEEecCCc-cc--cccccCCcccccCCCCc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD---FKDSGFTGVINIGSE-YA--IYVNYGTGIYATGAGGS 72 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~---~~~~~~~~~V~~~~~-YA--~~ve~GT~~~~~~~~~~ 72 (116) |++.+++||..+++.++...|.+.++ |||..-+++... ..++--+..|+...+ |. +.-|+|+ T Consensus 32 ~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW~GpR~~ivHLNE~Gy---------- 101 (138) T protein:vir:98 32 VNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTTPRWNIVHLQELEY---------- 101 (138) T ss_pred hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEeeecCeeeEEeeecccc---------- Confidence 99999999999999999999999985 999998887543 222334556665433 21 2234564 Q ss_pred CcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) |+++..+||- ++..|++..++.+.+.+- T Consensus 102 -------------Gk~i~PrG~G---~I~ka~~~se~~y~~~vk 129 (138) T protein:vir:98 102 -------------GWKHNRRGVG---VIRRYSDILETIYPRGIR 129 (138) T ss_pred -------------cCCcCCCcch---HHHHHHHhhhHHHHHHHH Confidence 3344455665 788888887776555543 No 134 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=95.51 E-value=1.9e-05 Score=46.49 Aligned_cols=81 Identities=25% Similarity=0.309 Sum_probs=37.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeE-----eecCcE---EEEEe-cCCccccccccCCcccccCCCCc Q lcl|NC_021326. 2 ERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMD-----FKDSGF---TGVIN-IGSEYAIYVNYGTGIYATGAGGS 72 (116) Q Consensus 2 ~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~-----~~~~~~---~~~V~-~~~~YA~~ve~GT~~~~~~~~~~ 72 (116) =+..++.|.+...++.. --|.-|-+..+=.-+ ...+.+ .+.-+ +.+.+|.+.|||| T Consensus 1 m~v~~k~L~~~~~~l~~-----~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~---------- 65 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRS-----MSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGT---------- 65 (155) T ss_pred CcchHHHHHHHHHHHhC-----CeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCC---------- Confidence 11222223333222211 001222222210000 000000 00011 1234677788886 Q ss_pred CcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 73 RAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++||||||+|++++++.++.+.|. T Consensus 66 -------------------~~IP~RPFlr~t~~~~~~~~~~~l~ 90 (155) T protein:vir:10 66 -------------------SKLPARPFMEKTIADRSAEWIKGLT 90 (155) T ss_pred -------------------CCCCCcchhHHHHHHHHHHHHHHHH Confidence 3589999999999999999888888 No 135 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=95.38 E-value=0.00017 Score=41.20 Aligned_cols=93 Identities=17% Similarity=0.113 Sum_probs=63.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCc--EEEEEecCC---ccc--cccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSG--FTGVINIGS---EYA--IYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~--~~~~V~~~~---~YA--~~ve~GT~~~~~~ 68 (116) |++.+++||..+++.++...|.+..+ |||.+-+++..+ . ..+. -+..|+... .|. +.-|+|. T Consensus 24 ~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gy------ 97 (133) T protein:vir:93 24 MQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHLNEHGY------ 97 (133) T ss_pred hHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEeecCCCceeEEEeeccce------ Confidence 99999999999999999999988884 999999998664 1 2222 445666643 342 3445663 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++. ++++|. T Consensus 98 --------------tr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:93 98 --------------TRDGKKYTPRGFG---VIAKTLAANERKYREIIKKELA 132 (133) T ss_pred --------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhc Confidence 3356666667776 466677766654 444444 No 136 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=95.31 E-value=3.3e-05 Score=45.17 Aligned_cols=77 Identities=27% Similarity=0.361 Sum_probs=37.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcE----------EEEEe-cCCccccccccCCcccccCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGF----------TGVIN-IGSEYAIYVNYGTGIYATGA 69 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~----------~~~V~-~~~~YA~~ve~GT~~~~~~~ 69 (116) |++ +.|.+....+.. . -|.-|-+..+=.-+ .++. ....+ +.+.+|.|.|||| T Consensus 3 v~r---~~L~~~~~~l~~----~-~V~VGi~~~a~y~d--~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~------- 65 (155) T protein:vir:10 3 VTR---RGLTLPKDRYKS----M-SVKAGVLAGATYPD--ESGKKLADGTILKKDPRAGLPVAMIAMALNYGT------- 65 (155) T ss_pred chH---HHHHHHHHHhhC----C-eeEEeecCCCCCCc--cccchhhhhhhhccccccCcchhhhhhhhhcCC------- Confidence 322 122222222211 0 02222222210000 0000 00111 1234677888886 Q ss_pred CCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 70 GGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++||||||+|++++++.++.+.|. T Consensus 66 ----------------------~~IP~RPFlr~t~~~~~~~~~~~l~ 90 (155) T protein:vir:10 66 ----------------------SKLPARPFMEKTIADRSAEWIKGLT 90 (155) T ss_pred ----------------------CCCCCcchhHHHHHHHHHHHHHHHH Confidence 3489999999999999999888888 No 137 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=95.29 E-value=2.8e-05 Score=45.56 Aligned_cols=79 Identities=20% Similarity=0.290 Sum_probs=41.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIP 78 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~ 78 (116) |-..|+..- .....+.+..+++.- |.-|-+..+ ...+|. +.+..|.+.|||+.. T Consensus 1 M~~~i~~~~-~~~~~L~~~lk~l~~k~V~VGi~~~~----~y~dG~-----~vA~Ia~~~E~G~p~-------------- 56 (189) T protein:vir:10 1 MGRVIRKQG-PARVKLNAFIKGMNDYSVRIGWFSTA----KYPDGT-----PTAYVASIHEFGAPS-------------- 56 (189) T ss_pred CcceeccCc-HHHHHHHHHHHHhhCCeEEEEecCCC----CCCCcc-----cHHHHHHHHHhcCcC-------------- Confidence 333333211 112223333333321 111222110 011221 235678889999731 Q ss_pred ccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 79 ~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .++||||||+|++++++..+.+.|. T Consensus 57 -------------~~IP~RPFlr~t~~~~~~~~~~~l~ 81 (189) T protein:vir:10 57 -------------RGIPARSFIRPTIAAQQAAWSQQMR 81 (189) T ss_pred -------------CCCCCchhhhHHHHHHHHHHHHHHH Confidence 3589999999999999998888776 No 138 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=95.29 E-value=5e-05 Score=44.16 Aligned_cols=107 Identities=17% Similarity=0.081 Sum_probs=50.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cccccccccceeEeecCcEEEEEecCCcc-ccccccCCcccccCCCC------ Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDSGFTGVINIGSEY-AIYVNYGTGIYATGAGG------ 71 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~~ve~GT~~~~~~~~~------ 71 (116) |+.-.+ ....+.++.+++.- |.-|-+..+... +.-.+.++++..| |.+.|||.......... T Consensus 3 ~~~~~~-----~~~~~~~~l~~l~~~~v~vGi~~~~~~~----~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~ 73 (193) T protein:vir:96 3 LRRDSE-----LIAAHLQMLRAMRGRSVSAGWYSTARYP----DKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAI 73 (193) T ss_pred eccchH-----HHHHHHHHHHHhcCCeEEEEEcCCCCCC----CcccccccchHHHHHhHHHcCCccccCccceeeeecc Confidence 332222 12223333333322 223444332211 1223567777666 99999997643221110 Q ss_pred cCcc-cccccccccc-cce----eccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 72 SRAK-KIPWSYKDAN-GKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 72 ~~~~-~~~~~~~~~~-~~~----~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..+. .+..+....+ +.. -.+-.+||||||++++++++..+.+.+. T Consensus 74 ~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~ 124 (193) T protein:vir:96 74 VRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQN 124 (193) T ss_pred ccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHH Confidence 0000 0000000000 000 1245799999999999999987776555 No 139 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=95.27 E-value=1.1e-05 Score=47.71 Aligned_cols=79 Identities=16% Similarity=0.262 Sum_probs=40.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeec---CcEEEEE-ecCCccccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD---SGFTGVI-NIGSEYAIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~---~~~~~~V-~~~~~YA~~ve~GT~~~~~~~~~~~~~~ 76 (116) |--.++... .....+.+..+++.- ..+.+-+-. .+-...- .+.+..|.+.|||+ T Consensus 1 M~~~~k~~~-~~~~~l~~~l~~l~~-------~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~-------------- 58 (148) T protein:vir:52 1 MAVTVTANF-SAAKQLIEQMKSLKE-------KAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGN-------------- 58 (148) T ss_pred Ccccccccc-HHHHHHHHHHHHhhC-------CeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCC-------------- Confidence 221122111 112233333333321 111111110 0000000 13455788999996 Q ss_pred ccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 77 IPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 77 ~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +++|+||||+|+++++++.+.+.|. T Consensus 59 ---------------~~IP~Rpflr~t~~~~~~~~~~~~~ 83 (148) T protein:vir:52 59 ---------------EHIPARPFLRQTLEENQEKYTALFI 83 (148) T ss_pred ---------------CCCCCcchhHHHHHHHHHHHHHHHH Confidence 4589999999999999999888888 No 140 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=95.26 E-value=0.00041 Score=39.18 Aligned_cols=95 Identities=14% Similarity=0.113 Sum_probs=49.7 Q ss_pred ChHHHHH------------HHHHHHHHHHHHHHH-----hCC----c--------------cccccccc----------- Q lcl|NC_021326. 1 MERWVKR------------GIAKTTAKIHNTIIS-----LMP----V--------------DTGYLRES----------- 34 (116) Q Consensus 1 i~~~~~~------------~~~~~a~~v~~~ak~-----~aP----v--------------dTG~Lr~S----------- 34 (116) +++++.. .+...++.+....+. ..| + .+|.++++ T Consensus 8 l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~ 87 (155) T protein:vir:79 8 LERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMFRKLRTARY 87 (155) T ss_pred HHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhhhhhhhhhe Confidence 3333222 234444444443332 233 2 25655443 Q ss_pred ceeEeecCcEEEEE---ecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHH---HHHH Q lcl|NC_021326. 35 VTMDFKDSGFTGVI---NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAI---DAGR 108 (116) Q Consensus 35 I~~~~~~~~~~~~V---~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~---~~~k 108 (116) |+++...++ +.| |++..||..++||........ .....+|++|||-=.- ++-. T Consensus 88 l~~~~~~d~--a~Vg~~Gs~~~yAaiHQfG~~~r~~~~-------------------~~~v~iPaRp~LGls~~d~~~I~ 146 (155) T protein:vir:79 88 LRIDVDSTG--LAIGFDERLSRIARVHQEGQKAPVEPG-------------------GPLAQYPVRVVLGFSDADRELVR 146 (155) T ss_pred eeeeecCcE--EEEEecCcchhhhhhhhcCCcccCCCC-------------------CcccccccccccCCCHHHHHHHH Confidence 555555454 445 999999999999964321110 1124689999994433 3333 Q ss_pred HHHHHhcC Q lcl|NC_021326. 109 AFFNKYFS 116 (116) Q Consensus 109 ~~i~~~i~ 116 (116) ..+..-|+ T Consensus 147 ~~i~~~l~ 154 (155) T protein:vir:79 147 DRLLRELT 154 (155) T ss_pred HHHHHHhh Confidence 44555566 No 141 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=95.23 E-value=0.00027 Score=40.11 Aligned_cols=93 Identities=16% Similarity=0.200 Sum_probs=60.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC--CcccccccccceeE---eecCcEEEEEecCCc---cc--cccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMD---FKDSGFTGVINIGSE---YA--IYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a--PvdTG~Lr~SI~~~---~~~~~~~~~V~~~~~---YA--~~ve~GT~~~~~~~~ 70 (116) |++.+++||..+++.++...|.+. ..|||.+-+++..+ ..++--+..|+...+ |. +.-|+|. T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~gp~~R~~iVHLNE~GY-------- 95 (133) T protein:vir:78 24 LPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKGPKDRYKIIHLNEYGY-------- 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEecCCCceeEEEeeccce-------- Confidence 999999999999999999999854 45999999998653 233334566766433 42 3445663 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++.+.+.+- T Consensus 96 ------------tr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk 126 (133) T protein:vir:78 96 ------------TRNGKKITPAGTG---SVARSLRISERAYRAIVQ 126 (133) T ss_pred ------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHH Confidence 3355666666666 455555555543333222 No 142 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=95.19 E-value=0.00023 Score=40.49 Aligned_cols=95 Identities=16% Similarity=0.114 Sum_probs=52.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) -++.++++-...+.+|..++...+|.+||.|++|..+ .+++ |..+.+||..+-||..-....+ T Consensus 16 s~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i--~~~~----I~Y~tPYAr~qYY~~~~~~~~g----------- 78 (118) T protein:vir:30 16 SPQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRA--NSVG----VTWSGPHARAQFYGGAYNKYKS----------- 78 (118) T ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCCccCccccceee--cCCe----eEECCchhhHhhhccccCCCCc----------- Confidence 2456667777778889999999999999999999643 3343 6688999988888742111110 Q ss_pred ccccccceeccCCCCCCcchhHHH------HHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAI------DAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~------~~~k~~i~~~i~ 116 (116) .......||+.-++ +..++. ..-++.+.+-|- T Consensus 79 ---~~~~~~~~p~~g~~-Wd~R~ka~~~~~~~w~~~~~k~~g 116 (118) T protein:vir:30 79 ---FKFKKYTTPGTGKR-WDKRALANATIVKDWEKSLLRGMG 116 (118) T ss_pred ---cccccccCCCCCCc-ccchhhcchhhhHHHHHHHHHhcC Confidence 00011123333333 111111 111222333333 No 143 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=95.19 E-value=0.00023 Score=40.49 Aligned_cols=95 Identities=16% Similarity=0.114 Sum_probs=52.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~ 80 (116) -++.++++-...+.+|..++...+|.+||.|++|..+ .+++ |..+.+||..+-||..-....+ T Consensus 16 s~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i--~~~~----I~Y~tPYAr~qYY~~~~~~~~g----------- 78 (118) T protein:vir:98 16 SPQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRA--NSVG----VTWSGPHARAQFYGGAYNKYKS----------- 78 (118) T ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCCccCccccceee--cCCe----eEECCchhhHhhhccccCCCCc----------- Confidence 2456667777778889999999999999999999643 3343 6688999988888742111110 Q ss_pred ccccccceeccCCCCCCcchhHHH------HHHHHHHHHhcC Q lcl|NC_021326. 81 YKDANGKWHTTKGQHAQPFWEPAI------DAGRAFFNKYFS 116 (116) Q Consensus 81 ~~~~~~~~~~~~g~~a~PFl~pA~------~~~k~~i~~~i~ 116 (116) .......||+.-++ +..++. ..-++.+.+-|- T Consensus 79 ---~~~~~~~~p~~g~~-Wd~R~ka~~~~~~~w~~~~~k~~g 116 (118) T protein:vir:98 79 ---FKFKKYTTPGTGKR-WDKRALANATIVKDWEKSLLRGMG 116 (118) T ss_pred ---cccccccCCCCCCc-ccchhhcchhhhHHHHHHHHHhcC Confidence 00011123333333 111111 111222333333 No 144 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=95.18 E-value=0.00024 Score=40.41 Aligned_cols=94 Identities=21% Similarity=0.270 Sum_probs=62.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE---eecCcEEEEEecCCc---cc--cccccCCcccccCCC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD---FKDSGFTGVINIGSE---YA--IYVNYGTGIYATGAG 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~---~~~~~~~~~V~~~~~---YA--~~ve~GT~~~~~~~~ 70 (116) |++.+++||..+++.++...|.+.-+ |||..-+++..+ ..++.-+..|+...+ |. +.-|+|+ T Consensus 23 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~gp~~R~~iVHLNE~G~-------- 94 (133) T protein:vir:96 23 LMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEGEKHRYSIVHLNEKGF-------- 94 (133) T ss_pred HHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeecCCCceeeEeeecccc-------- Confidence 99999999999999999999977655 999999887653 223334567776443 42 4456664 Q ss_pred CcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 71 GSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +...|+++..+||- =+..|++..++.+.+.+- T Consensus 95 -----------ytr~Gk~i~PrG~G---~I~~al~~se~~y~~~vk 126 (133) T protein:vir:96 95 -----------YAKDGKFIRPKGMG---AIDKALRASRDKFFKVYA 126 (133) T ss_pred -----------eecCCceeccchhh---HHHHHHHhhhHHHHHHHH Confidence 23356666667776 466666666654333333 No 145 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=95.08 E-value=0.00012 Score=42.15 Aligned_cols=112 Identities=14% Similarity=0.062 Sum_probs=45.7 Q ss_pred ChHHH----HHHHHHHHHHHHHHHHHhCC--cccccccccceeEeecCcEEEEEecCC-ccccccccCCcccccCCC-C- Q lcl|NC_021326. 1 MERWV----KRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDSGFTGVINIGS-EYAIYVNYGTGIYATGAG-G- 71 (116) Q Consensus 1 i~~~~----~~~~~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~-~YA~~ve~GT~~~~~~~~-~- 71 (116) |++-+ +-.-.+...++.++.+++.- |.-|-+..+-.. +.-....++.. ..|.+.|||+........ . T Consensus 1 ~~~~~~~~~k~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~----~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~ 76 (200) T protein:vir:99 1 MKKGFSKSNSVAAPLKHFQMLKQFDALKGKTVQAGWFETDRYP----AKEGETIGPLVAKIARQLEFGGVINHPGGTKYI 76 (200) T ss_pred CCcCcceeeeeecchHHHHHHHHHHHhhCCeEEEEEcCCCCcC----CcccccccchHHHHHhHHHcCCeeccCCCcccc Confidence 21111 11111122222233333311 111222111000 00012334444 458999999764322111 0 Q ss_pred ----cCccccc--cccccccccee----ccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 72 ----SRAKKIP--WSYKDANGKWH----TTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 72 ----~~~~~~~--~~~~~~~~~~~----~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..+.... ......++..+ .+-.+||||||++++++++..+.+.+. T Consensus 77 ~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~ 131 (200) T protein:vir:99 77 KDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQA 131 (200) T ss_pred ccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHH Confidence 0000000 00000111111 256789999999999999998877665 No 146 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=95.03 E-value=0.0003 Score=39.92 Aligned_cols=93 Identities=16% Similarity=0.106 Sum_probs=63.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCc--EEEEEecCC---ccc--cccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSG--FTGVINIGS---EYA--IYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~--~~~~V~~~~---~YA--~~ve~GT~~~~~~ 68 (116) |++.+++||..+++.++...|.+..+ |||.+-+++..+ . .++. -+..|+... .|. +.-|+|. T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gy------ 97 (133) T protein:vir:96 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHLNEHGY------ 97 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCCCceeEEEeeccce------ Confidence 99999999999999999999988884 999999998653 2 1222 445666633 342 3445663 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++. ++++|. T Consensus 98 --------------tr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:96 98 --------------TRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELA 132 (133) T ss_pred --------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhc Confidence 3356666667776 466666666654 444444 No 147 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=95.03 E-value=0.0003 Score=39.92 Aligned_cols=93 Identities=16% Similarity=0.106 Sum_probs=63.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCc--EEEEEecCC---ccc--cccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSG--FTGVINIGS---EYA--IYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~--~~~~V~~~~---~YA--~~ve~GT~~~~~~ 68 (116) |++.+++||..+++.++...|.+..+ |||.+-+++..+ . .++. -+..|+... .|. +.-|+|. T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gy------ 97 (133) T protein:vir:94 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHLNEHGY------ 97 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCCCceeEEEeeccce------ Confidence 99999999999999999999988884 999999998653 2 1222 445666633 342 3445663 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++. ++++|. T Consensus 98 --------------tr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:94 98 --------------TRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELA 132 (133) T ss_pred --------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhc Confidence 3356666667776 466666666654 444444 No 148 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=95.03 E-value=0.0003 Score=39.92 Aligned_cols=93 Identities=16% Similarity=0.106 Sum_probs=63.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCc--EEEEEecCC---ccc--cccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSG--FTGVINIGS---EYA--IYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~--~~~~V~~~~---~YA--~~ve~GT~~~~~~ 68 (116) |++.+++||..+++.++...|.+..+ |||.+-+++..+ . .++. -+..|+... .|. +.-|+|. T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gy------ 97 (133) T protein:vir:78 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHLNEHGY------ 97 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCCCceeEEEeeccce------ Confidence 99999999999999999999988884 999999998653 2 1222 445666633 342 3445663 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++. ++++|. T Consensus 98 --------------tr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:78 98 --------------TRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELA 132 (133) T ss_pred --------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhc Confidence 3356666667776 466666666654 444444 No 149 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=95.03 E-value=0.0003 Score=39.92 Aligned_cols=93 Identities=16% Similarity=0.106 Sum_probs=63.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCc--EEEEEecCC---ccc--cccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSG--FTGVINIGS---EYA--IYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~--~~~~V~~~~---~YA--~~ve~GT~~~~~~ 68 (116) |++.+++||..+++.++...|.+..+ |||.+-+++..+ . .++. -+..|+... .|. +.-|+|. T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHLNE~Gy------ 97 (133) T protein:vir:93 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHLNEHGY------ 97 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCCCceeEEEeeccce------ Confidence 99999999999999999999988884 999999998653 2 1222 445666633 342 3445663 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++. ++++|. T Consensus 98 --------------tr~Gk~i~PrG~G---~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:93 98 --------------TRDGKKYTPRGFG---VIAKTLAASERKYREIIKKELA 132 (133) T ss_pred --------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhc Confidence 3356666667776 466666666654 444444 No 150 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=94.96 E-value=4.9e-05 Score=44.19 Aligned_cols=84 Identities=19% Similarity=0.188 Sum_probs=53.4 Q ss_pred Ch-----HHHHHHHHHHHHHHHHHHHHhCCcccccccccceeE-eecCcEEEEEecCCccccccccCCcccccCCC-CcC Q lcl|NC_021326. 1 ME-----RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMD-FKDSGFTGVINIGSEYAIYVNYGTGIYATGAG-GSR 73 (116) Q Consensus 1 i~-----~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~-~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~-~~~ 73 (116) ++ ..+.+.+.+...+|...=|++.||.||..|+|+.+. ...+.-.+.|+.+.+.|+.||||+--...... .+. T Consensus 18 lddfdklpevnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrgkvgatdpqahlvefgs~hndeyapaqkt 97 (108) T protein:vir:10 18 LDDFDKLPEVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQAHLVEFGSAHNDEYAPAQKT 97 (108) T ss_pred hhhhhccchhhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccccccCcchhhhhhhhhccccccccchhhh Confidence 11 245566666677788888999999999999998653 22233468899999999999999854333221 222 Q ss_pred ccccccccccc Q lcl|NC_021326. 74 AKKIPWSYKDA 84 (116) Q Consensus 74 ~~~~~~~~~~~ 84 (116) +..+....... T Consensus 98 akqfggtay~d 108 (108) T protein:vir:10 98 AKQFGGTAYGD 108 (108) T ss_pred HHhhcccccCC Confidence 22221111111 No 151 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=94.96 E-value=4.9e-05 Score=44.19 Aligned_cols=84 Identities=19% Similarity=0.188 Sum_probs=53.4 Q ss_pred Ch-----HHHHHHHHHHHHHHHHHHHHhCCcccccccccceeE-eecCcEEEEEecCCccccccccCCcccccCCC-CcC Q lcl|NC_021326. 1 ME-----RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMD-FKDSGFTGVINIGSEYAIYVNYGTGIYATGAG-GSR 73 (116) Q Consensus 1 i~-----~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~-~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~-~~~ 73 (116) ++ ..+.+.+.+...+|...=|++.||.||..|+|+.+. ...+.-.+.|+.+.+.|+.||||+--...... .+. T Consensus 18 lddfdklpevnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrgkvgatdpqahlvefgs~hndeyapaqkt 97 (108) T protein:vir:10 18 LDDFDKLPEVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQAHLVEFGSAHNDEYAPAQKT 97 (108) T ss_pred hhhhhccchhhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccccccCcchhhhhhhhhccccccccchhhh Confidence 11 245566666677788888999999999999998653 22233468899999999999999854333221 222 Q ss_pred ccccccccccc Q lcl|NC_021326. 74 AKKIPWSYKDA 84 (116) Q Consensus 74 ~~~~~~~~~~~ 84 (116) +..+....... T Consensus 98 akqfggtay~d 108 (108) T protein:vir:10 98 AKQFGGTAYGD 108 (108) T ss_pred HHhhcccccCC Confidence 22221111111 No 152 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=94.77 E-value=0.00037 Score=39.37 Aligned_cols=97 Identities=11% Similarity=0.032 Sum_probs=52.5 Q ss_pred ChHHHH------------HHHHHHHHHHHHHHHHh-----CC----c--------------------ccccccccceeEe Q lcl|NC_021326. 1 MERWVK------------RGIAKTTAKIHNTIISL-----MP----V--------------------DTGYLRESVTMDF 39 (116) Q Consensus 1 i~~~~~------------~~~~~~a~~v~~~ak~~-----aP----v--------------------dTG~Lr~SI~~~~ 39 (116) +++.+. +.+.+.++.+....+.+ .| + ++|.|.+||++.. T Consensus 7 ~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~~~l~~~~ 86 (149) T protein:vir:18 7 LQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRTSRFMKAKG 86 (149) T ss_pred HHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhhhhhhheee Confidence 222221 13555566555555432 44 2 1234566777766 Q ss_pred ecCcE-EEEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHH---HHHHHHhc Q lcl|NC_021326. 40 KDSGF-TGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG---RAFFNKYF 115 (116) Q Consensus 40 ~~~~~-~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~---k~~i~~~i 115 (116) ..++. .+.++++..||..++||......... ....+|++|||-=.-+.. ...+.+.| T Consensus 87 ~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~-------------------~~v~iPaRp~LG~s~~d~~~I~~~i~~~l 147 (149) T protein:vir:18 87 SDSAAVVEFTGKVQRMARVHQYGLKDRPNRNS-------------------RDVQYEARPLLGFTRDDEQMIEDVIISHL 147 (149) T ss_pred cCceeEEEecccchhhhhhhhccccccccCCC-------------------ccccccccccCCCCHHHHHHHHHHHHHHH Confidence 66653 34579999999999999653321110 113589999996443322 22344444 Q ss_pred C Q lcl|NC_021326. 116 S 116 (116) Q Consensus 116 ~ 116 (116) + T Consensus 148 ~ 148 (149) T protein:vir:18 148 G 148 (149) T ss_pred h Confidence 4 No 153 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=94.16 E-value=0.00055 Score=38.47 Aligned_cols=93 Identities=17% Similarity=0.113 Sum_probs=62.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCc--ccccccccceeE-e--ecCc--EEEEEecCC---ccc--cccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPV--DTGYLRESVTMD-F--KDSG--FTGVINIGS---EYA--IYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPv--dTG~Lr~SI~~~-~--~~~~--~~~~V~~~~---~YA--~~ve~GT~~~~~~ 68 (116) |++.+++||..+++.++...|.+.-+ |||..-+++..+ . ..+. -+..|+... .|. +.-|+|. T Consensus 14 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVHLNE~GY------ 87 (123) T protein:vir:26 14 MQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIHLNEHGY------ 87 (123) T ss_pred HHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCCCceeeEeeeccce------ Confidence 99999999999999999999977655 999999988654 1 1222 445666643 332 3445663 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHH----HHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAF----FNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~----i~~~i~ 116 (116) ..+|+++..+||- =+..|++..++. ++++|. T Consensus 88 --------------tr~Gk~i~PRG~G---~i~~a~~~se~~y~~~vk~eL~ 122 (123) T protein:vir:26 88 --------------TRDGKKYTPRGFG---VIAKTLAANERKYREIIKKELA 122 (123) T ss_pred --------------ecCCCeEccchhh---HHHHHHHhhhHHHHHHHHHHhc Confidence 3456666667776 466677766654 444444 No 154 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=94.15 E-value=0.00064 Score=38.08 Aligned_cols=97 Identities=9% Similarity=0.013 Sum_probs=55.0 Q ss_pred ChHHHH------------HHHHHHHHHHHHHHHHh-----CC----c--------------------ccccccccceeEe Q lcl|NC_021326. 1 MERWVK------------RGIAKTTAKIHNTIISL-----MP----V--------------------DTGYLRESVTMDF 39 (116) Q Consensus 1 i~~~~~------------~~~~~~a~~v~~~ak~~-----aP----v--------------------dTG~Lr~SI~~~~ 39 (116) ++.++. +.+.+.++.+....+.+ .| + .+|.|.+||+++. T Consensus 7 l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~sl~~~~ 86 (150) T protein:vir:57 7 FEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRA 86 (150) T ss_pred HHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhccceeeee Confidence 222222 23455565555555432 33 2 3567888898887 Q ss_pred ecCcEEE--EEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH---HHHHHh Q lcl|NC_021326. 40 KDSGFTG--VINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR---AFFNKY 114 (116) Q Consensus 40 ~~~~~~~--~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k---~~i~~~ 114 (116) ..++.+. .++++..||..++||-........ ....+|++|||-=.-+..+ ..+.+. T Consensus 87 ~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~-------------------~~~~iPaRp~LG~s~~d~~~i~~~i~~~ 147 (150) T protein:vir:57 87 SPEQASMEFYGGKSPKIASVHQFGLSEETRKDG-------------------KKIDYPARPLLGFTGEDVQMIEEIILAH 147 (150) T ss_pred eCcEEEEEeecCCchhhhhhhhccccccccCCC-------------------ceeecCCcccCCCCHHHHHHHHHHHHHH Confidence 7776543 348999999999999543211110 0134899999965533322 234444 Q ss_pred cC Q lcl|NC_021326. 115 FS 116 (116) Q Consensus 115 i~ 116 (116) |+ T Consensus 148 l~ 149 (150) T protein:vir:57 148 LD 149 (150) T ss_pred Hh Confidence 44 No 155 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=93.97 E-value=0.00066 Score=38.02 Aligned_cols=97 Identities=9% Similarity=0.018 Sum_probs=54.3 Q ss_pred ChHHH------------HHHHHHHHHHHHHHHHHh-----CC----c--------------------ccccccccceeEe Q lcl|NC_021326. 1 MERWV------------KRGIAKTTAKIHNTIISL-----MP----V--------------------DTGYLRESVTMDF 39 (116) Q Consensus 1 i~~~~------------~~~~~~~a~~v~~~ak~~-----aP----v--------------------dTG~Lr~SI~~~~ 39 (116) ++.++ ++.+.+.++.+....+.+ .| + ++|.|.+||+++. T Consensus 7 l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~~sl~~~~ 86 (150) T protein:vir:20 7 FEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITSRFLHIRA 86 (150) T ss_pred HHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhhhhhheee Confidence 22222 223455555555544432 23 2 4578899998887 Q ss_pred ecCcEEE--EEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH---HHHHHh Q lcl|NC_021326. 40 KDSGFTG--VINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR---AFFNKY 114 (116) Q Consensus 40 ~~~~~~~--~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k---~~i~~~ 114 (116) ..++... .++++..||..++||-......+. ....+|++|||-=.-+... ..+.+. T Consensus 87 ~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~-------------------~~~~iPaRp~LG~s~~d~~~i~~~i~~~ 147 (150) T protein:vir:20 87 SPEQASMEFYGGKSPKIASVHQFGLSEENRKDG-------------------KKIDYPARPLLGFTGEDVQMIEEIILAH 147 (150) T ss_pred cCcEEEEEeeCCcchhhhhhhhcccccccccCC-------------------CceeccccccCCCCHHHHHHHHHHHHHH Confidence 7776443 248899999999999543211100 1235899999965533322 233444 Q ss_pred cC Q lcl|NC_021326. 115 FS 116 (116) Q Consensus 115 i~ 116 (116) |+ T Consensus 148 l~ 149 (150) T protein:vir:20 148 LE 149 (150) T ss_pred Hh Confidence 44 No 156 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=93.91 E-value=0.00062 Score=38.19 Aligned_cols=104 Identities=18% Similarity=0.241 Sum_probs=69.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC-----------C-cccccccccceeEeecC-----cEEEEEecCC----------- Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM-----------P-VDTGYLRESVTMDFKDS-----GFTGVINIGS----------- 52 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a-----------P-vdTG~Lr~SI~~~~~~~-----~~~~~V~~~~----------- 52 (116) =+..|++++.+++.....+|+.++ | ..||.|..||...+... |+-..|-.|. T Consensus 20 nr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vpras~~rpG~mVkIaPNqk~G~g~r~i~g 99 (170) T protein:vir:44 20 NRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVPRASKKRPGLMVKIAPNQKNGEGNRHING 99 (170) T ss_pred cHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccccccCCCCceeEEecCCCCCCCCcccccc Confidence 566788899999999999988543 3 38999999997655433 6555665432 Q ss_pred -ccccccccCCcccccCCCCcCcc---cccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 53 -EYAIYVNYGTGIYATGAGGSRAK---KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 53 -~YA~~ve~GT~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) -|-.|.+||-..-..+-+..... ...|.. .|-+=||..+++..+...+..|+ T Consensus 100 ~fYPafL~YGVr~gakr~k~hhr~a~ggsgwri------------aPR~Nym~~~l~~~~~wt~~~L~ 155 (170) T protein:vir:44 100 AFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRV------------EPRNNYMTEVLDKRRSWTRYVLS 155 (170) T ss_pred ccchhhhhhhhhcccccchhhcccccCCCccee------------ccchhHHHHHHHhhHHHHHHHHH Confidence 47889999964322221111100 112221 34556999999999999988888 No 157 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=93.86 E-value=0.0013 Score=36.49 Aligned_cols=98 Identities=6% Similarity=0.011 Sum_probs=48.5 Q ss_pred ChHHHH------------HHHHHHHHHHHHHHHHh-----CC----c--------------cccc----cccc--ceeEe Q lcl|NC_021326. 1 MERWVK------------RGIAKTTAKIHNTIISL-----MP----V--------------DTGY----LRES--VTMDF 39 (116) Q Consensus 1 i~~~~~------------~~~~~~a~~v~~~ak~~-----aP----v--------------dTG~----Lr~S--I~~~~ 39 (116) +++++. +.+.+.++.+....+.+ +| + ++|. |+.| |.++. T Consensus 8 ~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~a~~l~~~a 87 (152) T protein:vir:10 8 VKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQPRFMRLRL 87 (152) T ss_pred HHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhhcceeeeee Confidence 222221 13445555555544432 33 3 2233 3333 34444 Q ss_pred ecCcEE-EEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHH---HHHHHHHHHHhc Q lcl|NC_021326. 40 KDSGFT-GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA---IDAGRAFFNKYF 115 (116) Q Consensus 40 ~~~~~~-~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA---~~~~k~~i~~~i 115 (116) ..++.+ +.++++..||..++||-......++.. ...+|++|||-=. .++-...+.+.| T Consensus 88 ~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~------------------~v~iPaRp~LG~s~~d~~~I~~~i~~~l 149 (152) T protein:vir:10 88 ESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDL------------------KVKYASRELLGFTDDDLQMIEDYMINIL 149 (152) T ss_pred cCcEEEEEecCCchhhhhhhccCccccccCCCCc------------------ceeccccccCCCCHHHHHHHHHHHHHHH Confidence 444432 233899999999999954322221111 1238999999443 333334555555 Q ss_pred C Q lcl|NC_021326. 116 S 116 (116) Q Consensus 116 ~ 116 (116) + T Consensus 150 ~ 150 (152) T protein:vir:10 150 A 150 (152) T ss_pred h Confidence 5 No 158 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=93.62 E-value=0.00026 Score=40.27 Aligned_cols=85 Identities=14% Similarity=0.208 Sum_probs=49.2 Q ss_pred ChHHHHH-HHHH---HHHHHHHHHHHhCCcccccccccc--eeEeecCcEEEEEecCCccccccccCCcccccCCCCcCc Q lcl|NC_021326. 1 MERWVKR-GIAK---TTAKIHNTIISLMPVDTGYLRESV--TMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 74 (116) Q Consensus 1 i~~~~~~-~~~~---~a~~v~~~ak~~aPvdTG~Lr~SI--~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~ 74 (116) +.+.+.+ .|.. +-.++.+.+.=.+|.+||.|++|- .+.+..+.++..+..-++||.++-|... T Consensus 10 ~~k~l~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tvIgsg~I~y~~~~~aPYAr~qYYe~~----------- 78 (105) T protein:vir:78 10 VIDDIHNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKIIIQKNSIVARVFSLTPYARRQYYENR----------- 78 (105) T ss_pred HHHHHHHhcCCCCchhhHHHHHHhCCCCcccccccccccccceeecCCeeEeeccccCchhhhhhhccc----------- Confidence 3333332 2221 222444444556799999999994 3445555566666667999998887542 Q ss_pred ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 75 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 75 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ..|+ |+..+...++..|++..- T Consensus 79 -------------------Rg~~-WfErm~a~hk~~I~~~ve 100 (105) T protein:vir:78 79 -------------------RNPR-WYEMAVSYGIQSINQIVE 100 (105) T ss_pred -------------------CCCc-hhHHhhhcchhHHHHHHh Confidence 0111 556666666665555444 No 159 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=92.93 E-value=0.00076 Score=37.70 Aligned_cols=97 Identities=9% Similarity=0.019 Sum_probs=54.5 Q ss_pred ChHHH------------HHHHHHHHHHHHHHHHHh-----CC----c--------------------ccccccccceeEe Q lcl|NC_021326. 1 MERWV------------KRGIAKTTAKIHNTIISL-----MP----V--------------------DTGYLRESVTMDF 39 (116) Q Consensus 1 i~~~~------------~~~~~~~a~~v~~~ak~~-----aP----v--------------------dTG~Lr~SI~~~~ 39 (116) ++.++ ++.+.+.++.+....+.+ .| + ++|.|.+||+++. T Consensus 7 l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~sl~~~~ 86 (150) T protein:vir:60 7 FEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRA 86 (150) T ss_pred HHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcceeeeee Confidence 22111 223445555555544432 33 2 3577888998888 Q ss_pred ecCcEEE--EEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH---HHHHHh Q lcl|NC_021326. 40 KDSGFTG--VINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR---AFFNKY 114 (116) Q Consensus 40 ~~~~~~~--~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k---~~i~~~ 114 (116) ..++.+. .++++..||..++||-........ ....+|++|||-=.-+... ..+.+. T Consensus 87 ~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~-------------------~~~~iPaRp~LG~s~~d~~~i~~~i~~~ 147 (150) T protein:vir:60 87 SPEQASMEFYGGKSPKIASVHQFGLSEENRKDG-------------------KKIDYPARPLLGFTGEDVQMIEEIILAH 147 (150) T ss_pred eCcEEEEEeeCCCchhhhhhhhccccccccCCC-------------------CceecCCcccCCCCHHHHHHHHHHHHHH Confidence 7776543 348999999999999543211100 1234899999965533322 234444 Q ss_pred cC Q lcl|NC_021326. 115 FS 116 (116) Q Consensus 115 i~ 116 (116) |+ T Consensus 148 l~ 149 (150) T protein:vir:60 148 LD 149 (150) T ss_pred Hh Confidence 44 No 160 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=92.85 E-value=0.001 Score=36.98 Aligned_cols=104 Identities=17% Similarity=0.257 Sum_probs=69.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhC-----------C-cccccccccceeEee-----cCcEEEEEecC------------ Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLM-----------P-VDTGYLRESVTMDFK-----DSGFTGVINIG------------ 51 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~a-----------P-vdTG~Lr~SI~~~~~-----~~~~~~~V~~~------------ 51 (116) =+..|++++.+++.....+|+.++ | ..||.|..||...+. ..|+-..|-.| T Consensus 33 nr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIgy~Vpkat~~RpG~mVkIaPNqk~G~g~r~~Pi 112 (187) T protein:vir:48 33 NRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIGYYVPKKTTRRPGLMVKISPNQKNGQGNRRFPE 112 (187) T ss_pred cHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhhhccccccCCCCcceEEecCCcccCcccccccc Confidence 466788889999999988888664 3 379999999976554 45655566554 Q ss_pred --CccccccccCCcccccCCCC----cCc-ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 52 --SEYAIYVNYGTGIYATGAGG----SRA-KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 52 --~~YA~~ve~GT~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .-|-.|.+||-..-...-.. ..+ ....|.. .|-+=||..+++..+...+..|+ T Consensus 113 ~gdfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwri------------aPR~Nym~~~L~~~~~wt~~~L~ 172 (187) T protein:vir:48 113 GAPYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRL------------APRNNFMADVIERRRHWTQELLS 172 (187) T ss_pred cccchhHHHHhhhhhhhhccchhhhhhhcccCCccee------------ccchhHHHHHHHhhHHHHHHHHH Confidence 24888999996432221111 111 1122221 34456999999999999988888 No 161 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=92.59 E-value=0.00033 Score=39.66 Aligned_cols=76 Identities=14% Similarity=0.197 Sum_probs=38.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccccccccc-ceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCcccccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRES-VTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPW 79 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~S-I~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~ 79 (116) |++.....+++-...+++.++.. |.-|-..+. +. ++|. +-+..|.|.||||. T Consensus 2 ~~~~~~~G~~~L~~~~k~l~~~~--V~VGi~~d~g~~----~dG~-----sv~~vA~~~EfG~~---------------- 54 (160) T protein:vir:95 2 VKRVIHPARAKLVGAMKNLQTAN--AQVGYFQEQGQH----SSGF-----SYPALMYLQEVIGV---------------- 54 (160) T ss_pred ceeechHhHHHHHHHHHHHhCCe--eEEeeccccccC----CCCc-----cHHHHHhhhhcCcc---------------- Confidence 66655555555555554433322 233333222 11 1121 22357889999973 Q ss_pred cccccccceeccCCCCCCcchhHHHHH----HH-HHHH---HhcC Q lcl|NC_021326. 80 SYKDANGKWHTTKGQHAQPFWEPAIDA----GR-AFFN---KYFS 116 (116) Q Consensus 80 ~~~~~~~~~~~~~g~~a~PFl~pA~~~----~k-~~i~---~~i~ 116 (116) .+|++|||+++++. .+ ..+. +++. T Consensus 55 -------------~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~ 86 (160) T protein:vir:95 55 -------------PSASGKVYRRLFEITMMLNKQTLLEQTKKNLY 86 (160) T ss_pred -------------cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 37999999999973 22 2222 1121 No 162 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=91.43 E-value=0.0042 Score=33.59 Aligned_cols=97 Identities=10% Similarity=-0.005 Sum_probs=51.7 Q ss_pred ChHHHH------------HHHHHHHHHHHHHHHH-----hCC----c-------------------ccccccccceeEee Q lcl|NC_021326. 1 MERWVK------------RGIAKTTAKIHNTIIS-----LMP----V-------------------DTGYLRESVTMDFK 40 (116) Q Consensus 1 i~~~~~------------~~~~~~a~~v~~~ak~-----~aP----v-------------------dTG~Lr~SI~~~~~ 40 (116) +++++. +.+.+.++.+....+. ..| + +++.|.+|++.+.. T Consensus 7 l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~ 86 (148) T protein:vir:79 7 LEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARYMKTQAD 86 (148) T ss_pred HHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhheeeeee Confidence 222222 2344555555444433 233 2 23556777877766 Q ss_pred cCcEE-EEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHH---HHHHHhcC Q lcl|NC_021326. 41 DSGFT-GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR---AFFNKYFS 116 (116) Q Consensus 41 ~~~~~-~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k---~~i~~~i~ 116 (116) .++.. +-+|++..||..++||-...+... .....+|++|||-=.-+..+ ..+.+.|+ T Consensus 87 ~~~~~v~~~Gt~~~yAaiHQfG~~~r~~~~-------------------~~~v~iPaRp~LG~s~~d~~~i~~~i~~~l~ 147 (148) T protein:vir:79 87 ANTAVVTFAGNAQRIATVHQFGLRDRVNKA-------------------GLTAQYPARELLGMDGVDMEHITNLLLLHLG 147 (148) T ss_pred CCeeeEEeeccchhhhhhhhcCccccccCC-------------------CCccccCcccccCCCHHHHHHHHHHHHHHhc Confidence 55432 335999999999999943221100 01235899999965533322 23444455 No 163 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=88.04 E-value=0.007 Score=32.39 Aligned_cols=103 Identities=17% Similarity=0.263 Sum_probs=61.7 Q ss_pred ChHHHHHHH--------HHHHHHHHHHHHHhCC-----cccccccccceeEeecC-----cEEEEEecCC---------- Q lcl|NC_021326. 1 MERWVKRGI--------AKTTAKIHNTIISLMP-----VDTGYLRESVTMDFKDS-----GFTGVINIGS---------- 52 (116) Q Consensus 1 i~~~~~~~~--------~~~a~~v~~~ak~~aP-----vdTG~Lr~SI~~~~~~~-----~~~~~V~~~~---------- 52 (116) =+..|+++. .++...|...++. +| ..||.|..||...+... |+-..|-.|. T Consensus 19 nr~riRraFv~igq~hmr~ArrlV~rrgrs-~pGe~P~~qTGrLa~SIgy~Vpras~~rpG~mvkIaPNqk~G~g~r~i~ 97 (168) T protein:vir:45 19 NRARVRRAFVTIGQRHMRDARRLVMRHARS-APGENPGYQTGRLARSIGYMVPRASKHRPGFMARIAPNQRNGEGNRRIT 97 (168) T ss_pred cHHHHHHHHHHHhHHHHHHHHHHHhhcccc-cCCCCCcchhhhhhhhhhhccccccCCCCceEEEecCCCCCCCCCCccc Confidence 344566664 4555555555533 33 48999999997655433 6666666543 Q ss_pred --ccccccccCCcccccCCCCcCc---ccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 53 --EYAIYVNYGTGIYATGAGGSRA---KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 53 --~YA~~ve~GT~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) -|-.|.+||-..-..+.+.... ....|.. .|-+-||..+++..+...+..|+ T Consensus 98 gdfYPafL~YGVr~gakr~r~h~rga~ggsgwri------------aPR~Nym~~~l~~~~~wt~~~L~ 154 (168) T protein:vir:45 98 GDFYPAFLFYGVRGGAKRRRSHHRGASGGSGWRL------------APRNNFMVETLEKNRSWTRYFLA 154 (168) T ss_pred cccchhhhhhhhhcchhhhhhhhccccCCCccee------------ccchhhHHHHHHhhHHHHHHHHH Confidence 3778999986433222111110 0112221 34556999999999999888888 No 164 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=87.68 E-value=0.0016 Score=35.87 Aligned_cols=104 Identities=14% Similarity=0.142 Sum_probs=34.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEe--ecCcEEEEEecCCccccc------------cccCCc-cc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDF--KDSGFTGVINIGSEYAIY------------VNYGTG-IY 65 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~--~~~~~~~~V~~~~~YA~~------------ve~GT~-~~ 65 (116) |+=--. ++....+.++.+.+.- .++.+-+ +++.....|.+-.+|+.- .+.... +- T Consensus 1 m~vt~~---~~~~~~~~~~l~~L~~-------k~v~vGi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~ 70 (199) T protein:vir:80 1 MKVTTD---KSTMNKAIRELDQLDR-------YSLQIGLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRAR 70 (199) T ss_pred Cccccc---HHHHHHHHHHHHHhcC-------CEEEEEEecCCCcchhheeehhhcCCeeecCCceeeecchhhhccccc Confidence 110000 0112223333333321 1111111 111112222222222210 110000 00 Q ss_pred ----ccCCCCcCccccccccccccc--ceecc--CCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 66 ----ATGAGGSRAKKIPWSYKDANG--KWHTT--KGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 66 ----~~~~~~~~~~~~~~~~~~~~~--~~~~~--~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) ...+++...... .+....+ -.+.. -++|+||||+|++++++.++.+.|. T Consensus 71 ~~~~~~~p~g~~~~~~--~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~ 127 (199) T protein:vir:80 71 DIPGLFKPKGKNILAV--AGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFE 127 (199) T ss_pred ccCcccccCCcceeee--eccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHH Confidence 000000000000 0000000 11222 3789999999999999999888776 No 165 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=83.79 E-value=0.0034 Score=34.15 Aligned_cols=73 Identities=16% Similarity=0.216 Sum_probs=42.4 Q ss_pred ChHHHHHH-HHHHHHHHHHHHHHhCCcccccccccceeEe-ecC-c-EEEEEecCCccccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRG-IAKTTAKIHNTIISLMPVDTGYLRESVTMDF-KDS-G-FTGVINIGSEYAIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~-~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~-~~~-~-~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~ 76 (116) |+.+=-++ +.-+|++....||+.||||||..|+...++- +.. . -..+|+++. -...||--||.....-+.. +. T Consensus 16 l~s~~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D~-KTlLvESrTGNLakalk~~--rs 92 (92) T protein:vir:78 16 MRTPKVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSDE-KTLLIESRTGNLARSVKRR--RS 92 (92) T ss_pred hcccchhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccceeEEeecCc-ceeeeecccchHHHHHhhh--cC Confidence 33332233 4456778889999999999999999987753 221 1 224566544 3566777776432211110 00 No 166 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=83.24 E-value=0.017 Score=30.35 Aligned_cols=104 Identities=12% Similarity=0.081 Sum_probs=54.3 Q ss_pred ChHHHHH---------------HHHHHHHHHHHHHHHhCCc------ccc---cccccceeEeec-Cc---EEEEEecCC Q lcl|NC_021326. 1 MERWVKR---------------GIAKTTAKIHNTIISLMPV------DTG---YLRESVTMDFKD-SG---FTGVINIGS 52 (116) Q Consensus 1 i~~~~~~---------------~~~~~a~~v~~~ak~~aPv------dTG---~Lr~SI~~~~~~-~~---~~~~V~~~~ 52 (116) |+..+++ ...++|...+......+|. .|| +|++||..+-.+ +| -+.+||... T Consensus 12 L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg~~dG~StVGw~~ 91 (161) T protein:vir:10 12 MNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDGIKDGNSTVGWDY 91 (161) T ss_pred HHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCcccCCceeccccC Confidence 1111111 2334455555555555554 454 999999775321 11 145677643 Q ss_pred c---cccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHH--HHHHHHHHhcC Q lcl|NC_021326. 53 E---YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAID--AGRAFFNKYFS 116 (116) Q Consensus 53 ~---YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~--~~k~~i~~~i~ 116 (116) . -|.|++.||+--.....+...+ .-.+..|++-+|+..+-+ +.++.+.+..+ T Consensus 92 kka~ia~~indGtr~~~~~~~~~~~~------------n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~ 148 (161) T protein:vir:10 92 TKSRVGHLIENGTRFPMYSKKGTKYR------------KGGQVAITSDPFVSTYRDSMEAQVAMFSAEA 148 (161) T ss_pred chhhhhhhhcccchhhhhhccccccc------------CCcceeecCcchhHHHHhhhhhHHHHHHHHH Confidence 3 4799999995311111110000 112567899999999887 34454444443 No 167 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=81.61 E-value=0.03 Score=28.96 Aligned_cols=101 Identities=20% Similarity=0.157 Sum_probs=52.7 Q ss_pred ChHHHH--------------H-HHHHHHHHHHHHHHHhCCc------ccc---cccccceeEee-----cCcEEEEEecC Q lcl|NC_021326. 1 MERWVK--------------R-GIAKTTAKIHNTIISLMPV------DTG---YLRESVTMDFK-----DSGFTGVINIG 51 (116) Q Consensus 1 i~~~~~--------------~-~~~~~a~~v~~~ak~~aPv------dTG---~Lr~SI~~~~~-----~~~~~~~V~~~ 51 (116) ++..++ . ...++|...++.....+|. +|| +|++||..+-. .+| +.+||.+ T Consensus 8 l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG-~s~VGf~ 86 (168) T protein:vir:10 8 MQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDG-QSVVGWE 86 (168) T ss_pred HHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheecccccccccCC-ceeeccc Confidence 111111 1 1233444444444444443 565 89999976432 122 4566664 Q ss_pred C----------ccccccccCCcccccCCCCcCcccccccccccccceec---cCCCCCCcchhHHHHHH--HHHHHHhcC Q lcl|NC_021326. 52 S----------EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHT---TKGQHAQPFWEPAIDAG--RAFFNKYFS 116 (116) Q Consensus 52 ~----------~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~g~~a~PFl~pA~~~~--k~~i~~~i~ 116 (116) . .-|.|++.||+-|.- +...++.|. .+.|++-+|+..+-+.. ++.|.+..+ T Consensus 87 ~k~~~~~~~ka~iAr~lNDGTk~~~~--------------~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~ 152 (168) T protein:vir:10 87 RSTEKGTHTKGYIANIINNGSRFPQF--------------TTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEA 152 (168) T ss_pred Cccccccccchheeeecccccccccc--------------ccccccccccccccccccchhHHHhhhchhhhHHHHHHHH Confidence 3 348899999964321 112222222 35678889999887753 344333333 No 168 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=80.27 E-value=0.04 Score=28.28 Aligned_cols=101 Identities=20% Similarity=0.166 Sum_probs=52.2 Q ss_pred ChHHH--------------HHH-HHHHHHHHHHHHHHhCC------cccc---cccccceeEee-----cCcEEEEEecC Q lcl|NC_021326. 1 MERWV--------------KRG-IAKTTAKIHNTIISLMP------VDTG---YLRESVTMDFK-----DSGFTGVINIG 51 (116) Q Consensus 1 i~~~~--------------~~~-~~~~a~~v~~~ak~~aP------vdTG---~Lr~SI~~~~~-----~~~~~~~V~~~ 51 (116) |+..+ +.- ..++|...++.....+| ..|| +|++||..+-. .+| +.+||.+ T Consensus 8 l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG-~s~VGf~ 86 (168) T protein:vir:74 8 MQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDG-QSVVGWE 86 (168) T ss_pred HHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCcccCC-ceeeccc Confidence 11111 111 22333333333444444 3455 89999976432 122 4567765 Q ss_pred Cc----------cccccccCCcccccCCCCcCcccccccccccccceec---cCCCCCCcchhHHHHH--HHHHHHHhcC Q lcl|NC_021326. 52 SE----------YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHT---TKGQHAQPFWEPAIDA--GRAFFNKYFS 116 (116) Q Consensus 52 ~~----------YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~g~~a~PFl~pA~~~--~k~~i~~~i~ 116 (116) .. -|.|++.||+-|.- +...++.|. .+.|++-+|+..+-+. .++.|.+..+ T Consensus 87 ~k~~~~~~~kA~iAr~lNDGTk~~~~--------------~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~ 152 (168) T protein:vir:74 87 RSTEKGTHTKGYIANIINNGSRFPQF--------------TTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEA 152 (168) T ss_pred ccccccccchhhhhhhhccccccccc--------------ccccccccccccccccccchhHHHHHhhhhhHHHHHHHHH Confidence 43 58999999964321 112222222 3568889999998776 3444444333 No 169 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=76.33 E-value=0.01 Score=31.46 Aligned_cols=40 Identities=15% Similarity=0.127 Sum_probs=24.5 Q ss_pred Ch--HHHHHHHHHHHHHHHHHHHH---------hCC------------cccccccccceeEee Q lcl|NC_021326. 1 ME--RWVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRESVTMDFK 40 (116) Q Consensus 1 i~--~~~~~~~~~~a~~v~~~ak~---------~aP------------vdTG~Lr~SI~~~~~ 40 (116) ++ --++++|+..+..++++++. |+| +|||+|++||+.++. T Consensus 131 ~~g~~~~~~~l~~~G~~~~~~ik~~I~~~~~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 131 ARGQITPDQALAQIGLALEGYIARSIRTGPWVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred HhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 11 12444555555555555554 233 499999999988765 No 170 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=75.57 E-value=0.011 Score=31.30 Aligned_cols=41 Identities=20% Similarity=0.050 Sum_probs=23.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHh----CC------------cccccccccceeEeec Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISL----MP------------VDTGYLRESVTMDFKD 41 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~----aP------------vdTG~Lr~SI~~~~~~ 41 (116) .+.++++.-..++..|+..+... +| +|||.|++||+.++.. T Consensus 99 ~~~~L~~~G~~~~~~Ik~~I~~~~~pna~~Ti~~Kg~~kPLidTG~l~~SIty~V~~ 155 (155) T protein:vir:78 99 AEVAMGQIGQAMKDDIKTTISEWPADNSADWAGKKGFNHGLIWTSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCchhHHHHHHHhhhhhccC Confidence 34444444444444444444332 22 4899999999988765 No 171 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=75.48 E-value=0.017 Score=30.32 Aligned_cols=42 Identities=24% Similarity=0.168 Sum_probs=23.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHH---------hCC-------------cccccccccceeEeecC Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIIS---------LMP-------------VDTGYLRESVTMDFKDS 42 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~---------~aP-------------vdTG~Lr~SI~~~~~~~ 42 (116) =+-.++++|+..+..++++++. |+| +|||.|++||+.++.+. T Consensus 136 g~~~a~~~L~~~G~~~~~~Ik~~I~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 136 GKLSAEQVYNRLGAKIVDDIQMKIVEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 0112334444444444444443 122 38999999999987655 No 172 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=74.74 E-value=0.013 Score=30.97 Aligned_cols=51 Identities=18% Similarity=0.181 Sum_probs=24.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHh----CC------------cccccccccceeEeecCcEEEEEecCCccccccccCCcc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISL----MP------------VDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGI 64 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~----aP------------vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~ 64 (116) .+..+...-..++..|+..+... +| +|||.|++||+.++-.++-.+ -.| T Consensus 102 ~~~~L~~lG~~~~~~Ik~~I~~~~ppna~sTi~~KG~~~PLiDTG~l~~SIty~Vv~d~~~~---------------~~~ 166 (168) T protein:vir:94 102 ADTALRTVGQRMAEDIQDTIRNWPADNSPEWAAIKGFNAGLRQTGVLLNAIDSAVIIDGEHG---------------EAP 166 (168) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCccHHHHHhcCCCCchhHHHHHHhhcceeeeecCCCC---------------CCC Confidence 33333333333334444444332 22 499999999999776544211 111 Q ss_pred cc Q lcl|NC_021326. 65 YA 66 (116) Q Consensus 65 ~~ 66 (116) .. T Consensus 167 ~~ 168 (168) T protein:vir:94 167 RE 168 (168) T ss_pred CC Confidence 00 No 173 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=74.25 E-value=0.063 Score=27.16 Aligned_cols=116 Identities=7% Similarity=-0.020 Sum_probs=54.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC----cccccccccceeEeecCcEEEEEecCCccccccccCCcccccCCCCcCccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMP----VDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKK 76 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aP----vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~ 76 (116) +.+++.+||++++..+...+...+. +....+++.+++....++.++.|+++..--+...+|+......+....... T Consensus 29 ~~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~~~~~~~i~~~~~~i~l~~~~~~r~t~~Gv~~g~~~ 108 (177) T protein:vir:96 29 LELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQRQKGEVRFWVGLDPIGVYRLGTPKVTQKGVKVNRNE 108 (177) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccCCCcEEEEEEeccceehhhcccCCCCccceEEeeEE Confidence 7777777777777776666654443 345778888877654556778888765444444566533322221100000 Q ss_pred c--ccccccccc---ceec---------cCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 77 I--PWSYKDANG---KWHT---------TKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 77 ~--~~~~~~~~~---~~~~---------~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) + .+......| .+.+ .-..|--|=|..+++...+.+.+.|. T Consensus 109 ~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~~~~~~~~ 162 (177) T protein:vir:96 109 YDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWERRVFQRFK 162 (177) T ss_pred cCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHHHHHHHHH Confidence 0 000000011 0110 11122222244555544444433333 No 174 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=74.21 E-value=0.1 Score=25.97 Aligned_cols=97 Identities=11% Similarity=0.029 Sum_probs=46.7 Q ss_pred ChHHHHH------------HHHHHHHHHHHHHHHh-----CC----c---c-------cc----------ccccc--cee Q lcl|NC_021326. 1 MERWVKR------------GIAKTTAKIHNTIISL-----MP----V---D-------TG----------YLRES--VTM 37 (116) Q Consensus 1 i~~~~~~------------~~~~~a~~v~~~ak~~-----aP----v---d-------TG----------~Lr~S--I~~ 37 (116) |++.+.. .+.+.++.+....+.+ .| + . +| .|+.| |.. T Consensus 8 l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l~~~~~l~~ 87 (156) T protein:vir:11 8 LEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKLRTVRYLRA 87 (156) T ss_pred HHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhhhhhheeee Confidence 2222222 3455555555554432 33 2 0 12 13333 555 Q ss_pred EeecCcEE-EEEecCCccccccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHH---HHHH Q lcl|NC_021326. 38 DFKDSGFT-GVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA---FFNK 113 (116) Q Consensus 38 ~~~~~~~~-~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~---~i~~ 113 (116) +...++.+ +..+++..||..++||......... ....+|++|||-=.-+..+. .|.+ T Consensus 88 ~~~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~~~-------------------~~v~iPaRp~LG~s~~d~~~i~~~i~~ 148 (156) T protein:vir:11 88 KGDAQAITVSFAGRIARIARVHQYGLRDRAEPGA-------------------PEVSYAQRLLLGFDSSDMETIQNGILA 148 (156) T ss_pred eecCcEEEEEecCCchhhhhhhcccccccccCCC-------------------CcccccccccCCCCHHHHHHHHHHHHH Confidence 54445432 2238999999999999643211110 01358999999655333322 2333 Q ss_pred hcC Q lcl|NC_021326. 114 YFS 116 (116) Q Consensus 114 ~i~ 116 (116) -|+ T Consensus 149 ~l~ 151 (156) T protein:vir:11 149 HID 151 (156) T ss_pred HHh Confidence 333 No 175 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=73.54 E-value=0.011 Score=31.22 Aligned_cols=40 Identities=15% Similarity=0.269 Sum_probs=25.2 Q ss_pred ChH----------HHHHHHHHHHHHHHHHHHH---------hCC------------cccccccccceeEee Q lcl|NC_021326. 1 MER----------WVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRESVTMDFK 40 (116) Q Consensus 1 i~~----------~~~~~~~~~a~~v~~~ak~---------~aP------------vdTG~Lr~SI~~~~~ 40 (116) +++ -++++|+..+..++.+++. |+| +|||.|++||+.+++ T Consensus 130 ~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 130 QAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKSGPWAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 111 2345566666666655554 333 499999999998877 No 176 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=73.01 E-value=0.015 Score=30.60 Aligned_cols=41 Identities=20% Similarity=0.050 Sum_probs=23.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHh----CC------------cccccccccceeEeec Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISL----MP------------VDTGYLRESVTMDFKD 41 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~----aP------------vdTG~Lr~SI~~~~~~ 41 (116) .+++++..-..++..|+..+... +| +|||.|++||+.++.. T Consensus 99 ~~~~L~~lG~~~~~~Ik~~I~~~~~pna~~Ti~~KG~~kPLidTG~l~~SIty~Vv~ 155 (155) T protein:vir:10 99 AEVAMGQIGQAMKDDIKTTISEWPADNSADWAGKKGFNHGLIWTSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCchhHHHHHHHhhhhhccC Confidence 44444444444444444444322 22 4899999999887654 No 177 >protein:vir:6154 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:10918 # MgeID: mge:127 # MgeName: phBC6A51 # Cross-refs: genbank:acc:NP_852533;genbank:gi:31415793;genbank:GeneID:1489145 Probab=64.62 E-value=0.0051 Score=33.17 Aligned_cols=92 Identities=20% Similarity=0.175 Sum_probs=58.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcccccccccceeEee---cCcEEEEEecCCccccccccCCcccccCCCCcCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK---DSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~---~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~ 77 (116) -+.-+++.+++....-...|-.+||+-.|-|-.||-.+++ +..+.++-++..-||...||-+... T Consensus 22 yktpieqtvekhtrlqanqasnrapilhgplsesipasvkmvvgariigtygspliyaavqefthktk------------ 89 (119) T protein:vir:61 22 YKTPIEQTVEKHTRLQANQASNRAPILHGPLSESIPASVKMVVGARIIGTYGSPLIYAAVQEFTHKTK------------ 89 (119) T ss_pred ccccHHHHHHHhhhhhcccccccCceeecccccccchhhhhhhhhhhcccccchHHHHHHHHHhhhhh------------ Confidence 4455667777777777778888999999999999976654 2346677777788999888865221 Q ss_pred cccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 78 ~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .|-...|..--.|||...--. ..++.+ T Consensus 90 -------kgfmrktafegeqpfvedisk-----tvqrva 116 (119) T protein:vir:61 90 -------KGFMRKTAFEGEQPFVEDISK-----TVQRVA 116 (119) T ss_pred -------hhhhhhhcccCCcchHHHHHH-----HHHHhh Confidence 111112333456777654332 223333 No 178 >protein:vir:8432 Length: 149 # NCBI annotation: gp27 # Family: family:all:30885 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818328;genbank:gi:29566764;genbank:GeneID:1260117 Probab=56.91 E-value=0.31 Score=23.34 Aligned_cols=93 Identities=22% Similarity=0.376 Sum_probs=48.4 Q ss_pred ChHHHHHHHHHH---HHHHHHH------HHH-h--CCcccccccccceeE------------eecCc-EEEEEecCCccc Q lcl|NC_021326. 1 MERWVKRGIAKT---TAKIHNT------IIS-L--MPVDTGYLRESVTMD------------FKDSG-FTGVINIGSEYA 55 (116) Q Consensus 1 i~~~~~~~~~~~---a~~v~~~------ak~-~--aPvdTG~Lr~SI~~~------------~~~~~-~~~~V~~~~~YA 55 (116) +++.+++.+... =-.+-+. |++ - -|..||+.+..|.-. +..+| -.+-|++|.+-| T Consensus 26 lrrivqrfindveqtwhdvwdvsmlgvlaqqtgvphpyqtgdykahikkkkltamqkirikkflkggmpiglvynndeka 105 (149) T protein:vir:84 26 LRRIVQRFINDVEQTWHDVWDVSMLGVLAQQTGVPHPYQTGDYKAHIKKKKLTAMQKIRIKKFLKGGMPIGLVYNNDEKA 105 (149) T ss_pred HHHHHHHHHHHHHHHHHhHhhHHHHHHHHhhcCCCCCccccchhhhhhhhhHHHHHHHHHHHHhhcCCceeEEecCCcch Confidence 444444433222 1122211 111 1 356899999888531 22344 468899999999 Q ss_pred cccccCCcccccCCCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 56 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 56 ~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) +|+||||..- +|. ...+|. |.+| .|||+..+. +.+.|. T Consensus 106 hwieygtkrd--rpg----srspwg-----------pntp-----tpafeimqr-varimn 143 (149) T protein:vir:84 106 HWIEYGTKRD--RPG----SRSPWG-----------PNTP-----TPAFEIMQR-VARIMN 143 (149) T ss_pred hhhhhccccC--CCC----CCCCCC-----------CCCC-----ChhHHHHHH-HHHHhh Confidence 9999999532 111 122332 4444 467665442 233333 No 179 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=56.06 E-value=0.28 Score=23.65 Aligned_cols=102 Identities=19% Similarity=0.149 Sum_probs=49.9 Q ss_pred ChHHH--------------HH-HHHHHHHHHHHHHHHhCCc------cc---ccccccceeEeec-Cc---EEEEEecCC Q lcl|NC_021326. 1 MERWV--------------KR-GIAKTTAKIHNTIISLMPV------DT---GYLRESVTMDFKD-SG---FTGVINIGS 52 (116) Q Consensus 1 i~~~~--------------~~-~~~~~a~~v~~~ak~~aPv------dT---G~Lr~SI~~~~~~-~~---~~~~V~~~~ 52 (116) |+..+ +. ...++|...+......+|. .| ++|++||..+-.+ ++ -+.+||.+. T Consensus 8 l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~dG~StVGw~~ 87 (168) T protein:vir:39 8 MQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQSVVGWER 87 (168) T ss_pred HHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCcccCCceeccccC Confidence 11111 11 2233344444434444442 34 7899999765321 11 134666633 Q ss_pred ----------ccccccccCCcccccCCCCcCcccccccccccccceec---cCCCCCCcchhHHHHHH--HHHHHHhcC Q lcl|NC_021326. 53 ----------EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHT---TKGQHAQPFWEPAIDAG--RAFFNKYFS 116 (116) Q Consensus 53 ----------~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~g~~a~PFl~pA~~~~--k~~i~~~i~ 116 (116) .-|.|++.||+-+. |....|..+. +..|++-+|+..+-+.. ++.+.+..+ T Consensus 88 k~~~~~~~~a~iAr~lNDGTrf~~--------------~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~~Ae~ 152 (168) T protein:vir:39 88 STEKGTHTKGYIANIINNGSRFPQ--------------FTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQGILKAEA 152 (168) T ss_pred ccccccccchhheehhccccccch--------------hhhhcccccccccceeecccchhHHHhhhhhhhHHHHHHHH Confidence 34899999995311 0011111111 34578889999888753 343333333 No 180 >protein:vir:99454 Length: 150 # NCBI annotation: hypothetical protein # Family: family:all:32760 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919085;genbank:gi:119757043;genbank:GeneID:4606107 Probab=38.08 E-value=0.8 Score=21.12 Aligned_cols=111 Identities=13% Similarity=0.104 Sum_probs=55.1 Q ss_pred ChHHHHHHHHHHH-HHHHHHHHHhCC-----------cccccccccceeEe--ecCcEEEEEecCCccccccccCCcccc Q lcl|NC_021326. 1 MERWVKRGIAKTT-AKIHNTIISLMP-----------VDTGYLRESVTMDF--KDSGFTGVINIGSEYAIYVNYGTGIYA 66 (116) Q Consensus 1 i~~~~~~~~~~~a-~~v~~~ak~~aP-----------vdTG~Lr~SI~~~~--~~~~~~~~V~~~~~YA~~ve~GT~~~~ 66 (116) .++++-+-++.-| ++|.-+.++.|. .|--.|-..-.+++ ..+.+..+.+. .+=|+|.|-||--|+ T Consensus 11 ~re~lld~le~~areeiap~vq~~ahdile~yg~~hdydv~~iiea~et~v~rr~~rvvvr~gw-pepaiyfergt~dhv 89 (150) T protein:vir:99 11 AREALLDELEDHAREEIAPAVQQHAHDILEAYGRENDYDVQSIIDAAETRVERRKGSVVVRWGW-PEPAIFFERGTVDHV 89 (150) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhccccccchhhhhhhhhhheeecCCeEEEEecC-CCcceeeeccchhhh Confidence 2222222122111 122222222221 01111111112223 23334444444 345899999999999 Q ss_pred cCCCCcCcccccccccc----------cccce-----eccCCCCCCcchhHHHHHHHHHHH Q lcl|NC_021326. 67 TGAGGSRAKKIPWSYKD----------ANGKW-----HTTKGQHAQPFWEPAIDAGRAFFN 112 (116) Q Consensus 67 ~~~~~~~~~~~~~~~~~----------~~~~~-----~~~~g~~a~PFl~pA~~~~k~~i~ 112 (116) ...+....+.+-|--.. ..|-+ +...|.|-.-|++..+.--+.+|. T Consensus 90 vea~nad~lsfvwedpp~wvre~fe~e~~g~rvfl~e~~v~glpesrfirdtln~lr~~fa 150 (150) T protein:vir:99 90 VEATNADVLSFIWEDPPRWVRQGYEREGGGWRVFLPEVEVSGLPESRFIRDTLNWLRRRFA 150 (150) T ss_pred hhccccchhhhhhcCchhHhHhhcCcCCCceEEEeecccccCCcchhhHHHHHHHHHHhcC Confidence 98888888777664111 12222 236799999999998866555554 No 181 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=35.78 E-value=0.54 Score=22.07 Aligned_cols=89 Identities=12% Similarity=0.166 Sum_probs=53.1 Q ss_pred ChHHHHHHHH-HHHHHHHHHHHHhCCcc---cccccccceeEee------cCcEEEEEecCC--ccccccccCCcccccC Q lcl|NC_021326. 1 MERWVKRGIA-KTTAKIHNTIISLMPVD---TGYLRESVTMDFK------DSGFTGVINIGS--EYAIYVNYGTGIYATG 68 (116) Q Consensus 1 i~~~~~~~~~-~~a~~v~~~ak~~aPvd---TG~Lr~SI~~~~~------~~~~~~~V~~~~--~YA~~ve~GT~~~~~~ 68 (116) .+++|.++|. +++..+.+.+..+.||. .|.+|+-.+.... ...+.-.|.+.. .|-.|-..|-|.| T Consensus 26 sE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLgf~i~~k~kf~YLvfPD~G~G~s--- 102 (140) T protein:vir:40 26 SEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLGFELLTKPKFNYLIFPDQGIGKH--- 102 (140) T ss_pred HHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcceeEeecCcccccccccccCCCC--- Confidence 6677777766 55667778888899995 3456665554322 122333343333 3555655665443 Q ss_pred CCCcCcccccccccccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Q lcl|NC_021326. 69 AGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~k~~i~~~i~ 116 (116) .--+|-||...++..-+.|.+.|- T Consensus 103 ------------------------n~~~q~FmerGl~~~t~~i~E~L~ 126 (140) T protein:vir:40 103 ------------------------NKTKQDFMQLGVEESSQEIVEMLE 126 (140) T ss_pred ------------------------CcchHHHHHhccccchhHHHHHHH Confidence 123455888888877777666665 No 182 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=20.75 E-value=2.7 Score=18.20 Aligned_cols=115 Identities=11% Similarity=0.045 Sum_probs=52.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCccc----ccccccceeEe-ecCcEEEEEecCCccccccccCCcccccCCC----- Q lcl|NC_021326. 1 MERWVKRGIAKTTAKIHNTIISLMPVDT----GYLRESVTMDF-KDSGFTGVINIGSEYAIYVNYGTGIYATGAG----- 70 (116) Q Consensus 1 i~~~~~~~~~~~a~~v~~~ak~~aPvdT----G~Lr~SI~~~~-~~~~~~~~V~~~~~YA~~ve~GT~~~~~~~~----- 70 (116) |.+++.+||++++..+...+...+...+ ..+++.+++.- +.+++.+.|..+..--+..-+|+........ T Consensus 21 vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~kAs~~~l~a~I~~~~~~l~~~~l~~~~~~~~rr~~~~~ 100 (192) T protein:vir:34 21 VPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATVKNPQARIKVNRGDLPVIKLGNARVVLSRRRRRKK 100 (192) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheeccccCCCceEEEEEeccceeeeeeccccccccccccccc Confidence 7778888888888888887776666544 46777776642 2345677777654333333333322110000 Q ss_pred ---------CcCccccccccc------ccc---cceeccCCCCC-----------CcchhHHHHHHHH-----HHHHhcC Q lcl|NC_021326. 71 ---------GSRAKKIPWSYK------DAN---GKWHTTKGQHA-----------QPFWEPAIDAGRA-----FFNKYFS 116 (116) Q Consensus 71 ---------~~~~~~~~~~~~------~~~---~~~~~~~g~~a-----------~PFl~pA~~~~k~-----~i~~~i~ 116 (116) +.....+...+. ... +.+.++.|-.- .| +..||+.+.+ .+.++|+ T Consensus 101 ~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpis~~-l~~af~~~~~~~~~~~~~~El~ 179 (192) T protein:vir:34 101 GQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVP-LTTAFKQNIERIRRERLPKELG 179 (192) T ss_pred ccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEechhHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000 001 11111122111 22 4666665543 3334444 Done!