Query lcl|NC_015266.1_cdsid_YP_004306441.1 [gene=29] [protein=gp29] [protein_id=YP_004306441.1] [location=complement(22060..22527)] Match_columns 155 No_of_seqs 111 out of 338 Neff 7.0 Searched_HMMs 1612 Date Thu Nov 7 12:58:53 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_29 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_29_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79179 Length: 155 100.0 2E-57 1.2E-60 331.4 14.8 155 1-155 1-155 (155) 2 protein:vir:1164 Length: 156 # 100.0 9.4E-53 5.9E-56 305.8 14.6 152 1-155 1-152 (156) 3 protein:vir:100312 Length: 152 100.0 1.2E-52 7.3E-56 305.3 14.9 150 1-155 1-151 (152) 4 protein:vir:98557 Length: 149 100.0 3.1E-52 1.9E-55 303.0 13.4 149 1-155 1-149 (149) 5 protein:vir:5703 Length: 150 # 100.0 8.1E-52 5E-55 300.7 14.6 149 1-155 1-150 (150) 6 protein:vir:2026 Length: 150 # 100.0 6.1E-52 3.8E-55 301.3 13.9 149 1-155 1-150 (150) 7 protein:vir:6071 Length: 150 # 100.0 1.4E-51 8.5E-55 299.4 14.7 149 1-155 1-150 (150) 8 protein:vir:1838 Length: 149 # 100.0 1.2E-51 7.6E-55 299.7 13.6 149 1-155 1-149 (149) 9 protein:vir:79115 Length: 148 100.0 5.2E-51 3.2E-54 296.3 13.7 148 1-155 1-148 (148) 10 protein:vir:99833 Length: 190 100.0 5.3E-44 3.3E-47 257.8 11.0 145 1-155 1-184 (190) 11 protein:vir:1988 Length: 156 # 100.0 1.4E-42 8.8E-46 250.0 10.9 145 1-155 1-152 (156) 12 protein:vir:107851 Length: 175 100.0 3.9E-41 2.4E-44 242.1 10.0 147 1-155 1-170 (175) 13 protein:vir:99196 Length: 155 100.0 1.2E-40 7.3E-44 239.5 11.7 141 1-155 1-153 (155) 14 protein:vir:79091 Length: 175 100.0 6.6E-41 4.1E-44 240.9 8.9 147 1-155 1-170 (175) 15 protein:vir:79225 Length: 155 100.0 1.1E-39 6.9E-43 234.1 11.6 141 1-155 1-153 (155) 16 protein:vir:103841 Length: 155 100.0 1.3E-39 7.8E-43 233.8 9.5 141 1-155 1-153 (155) 17 protein:vir:3163 Length: 145 # 100.0 2.3E-34 1.4E-37 205.0 8.0 132 1-155 1-137 (145) 18 protein:vir:3787 Length: 231 # 99.9 1E-30 6.3E-34 185.0 11.7 144 1-155 8-228 (231) 19 protein:vir:78755 Length: 228 99.9 4E-30 2.5E-33 181.7 10.4 137 1-155 4-216 (228) 20 protein:vir:3750 Length: 227 # 99.9 3.4E-29 2.1E-32 176.7 11.4 139 1-155 5-224 (227) 21 protein:vir:98860 Length: 230 99.9 8.7E-27 5.4E-30 163.5 10.5 139 1-155 7-227 (230) 22 protein:vir:274 Length: 166 # 99.8 9.8E-23 6.1E-26 141.2 8.9 139 1-155 7-149 (166) 23 protein:vir:96105 Length: 193 98.4 4.6E-10 2.8E-13 71.8 4.2 83 66-155 1-129 (193) 24 protein:vir:94796 Length: 137 98.0 2.2E-07 1.4E-10 57.0 10.8 115 1-154 1-137 (137) 25 protein:vir:94654 Length: 142 98.0 1.4E-07 8.9E-11 58.1 9.5 117 1-155 1-140 (142) 26 protein:vir:99546 Length: 200 98.0 5.5E-09 3.4E-12 65.9 1.5 90 57-155 1-136 (200) 27 protein:vir:93738 Length: 137 97.8 5.7E-07 3.5E-10 54.8 10.8 115 1-154 1-137 (137) 28 protein:vir:94490 Length: 137 97.8 5.7E-07 3.5E-10 54.8 10.8 115 1-154 1-137 (137) 29 protein:vir:97427 Length: 137 97.8 5.7E-07 3.5E-10 54.8 10.8 115 1-154 1-137 (137) 30 protein:vir:95894 Length: 137 97.8 7.1E-07 4.4E-10 54.3 10.9 115 1-154 1-137 (137) 31 protein:vir:106041 Length: 137 97.8 2.6E-07 1.6E-10 56.7 8.5 103 1-155 1-130 (137) 32 protein:vir:105330 Length: 137 97.8 6.2E-07 3.9E-10 54.6 10.6 115 1-154 1-137 (137) 33 protein:vir:107099 Length: 137 97.8 9.6E-07 5.9E-10 53.6 10.7 115 1-154 1-137 (137) 34 protein:vir:2740 Length: 114 # 97.7 7.4E-07 4.6E-10 54.2 9.8 109 1-155 1-114 (114) 35 protein:vir:4906 Length: 114 # 97.7 7.4E-07 4.6E-10 54.2 9.8 109 1-155 1-114 (114) 36 protein:vir:96829 Length: 135 97.7 1.5E-06 9.6E-10 52.4 11.1 115 1-154 1-135 (135) 37 protein:vir:5978 Length: 144 # 97.7 1E-06 6.3E-10 53.4 9.6 118 1-154 4-144 (144) 38 protein:vir:96121 Length: 137 97.6 1.9E-06 1.2E-09 51.9 10.7 115 1-154 1-137 (137) 39 protein:vir:95789 Length: 114 97.6 1.3E-06 7.8E-10 52.9 9.3 107 1-155 1-111 (114) 40 protein:vir:94108 Length: 149 97.5 3.8E-06 2.4E-09 50.3 10.4 115 1-154 13-149 (149) 41 protein:vir:9930 Length: 108 # 97.5 3.4E-06 2.1E-09 50.6 10.0 106 2-155 1-108 (108) 42 protein:vir:96486 Length: 112 97.4 4.1E-06 2.6E-09 50.1 9.9 106 1-155 1-110 (112) 43 protein:vir:99101 Length: 142 97.4 5.3E-06 3.3E-09 49.5 9.9 113 1-155 1-139 (142) 44 protein:vir:8669 Length: 142 # 97.4 5.3E-06 3.3E-09 49.5 9.9 113 1-155 1-139 (142) 45 protein:vir:105916 Length: 149 97.3 5.9E-06 3.7E-09 49.2 9.5 115 1-154 13-149 (149) 46 protein:vir:97088 Length: 157 97.2 4.6E-06 2.9E-09 49.8 8.5 120 1-155 6-147 (157) 47 protein:vir:4347 Length: 164 # 97.1 4.4E-06 2.7E-09 50.0 7.4 139 1-155 1-148 (164) 48 protein:vir:106506 Length: 137 97.0 5.4E-06 3.3E-09 49.5 6.8 108 1-155 1-129 (137) 49 protein:vir:106623 Length: 115 97.0 2.1E-05 1.3E-08 46.3 10.0 112 1-154 1-115 (115) 50 protein:vir:106570 Length: 182 97.0 1.9E-05 1.2E-08 46.5 9.3 124 1-155 1-174 (182) 51 protein:vir:3617 Length: 112 # 96.9 1.4E-05 8.9E-09 47.1 8.6 106 1-154 1-112 (112) 52 protein:vir:97144 Length: 115 96.9 2.2E-05 1.3E-08 46.2 9.3 110 1-154 1-115 (115) 53 protein:vir:96225 Length: 115 96.9 2.2E-05 1.3E-08 46.2 9.3 110 1-154 1-115 (115) 54 protein:vir:103917 Length: 115 96.9 2.2E-05 1.3E-08 46.2 9.3 110 1-154 1-115 (115) 55 protein:vir:96358 Length: 115 96.9 2.2E-05 1.3E-08 46.2 9.3 110 1-154 1-115 (115) 56 protein:vir:78858 Length: 115 96.9 2.2E-05 1.3E-08 46.2 9.3 110 1-154 1-115 (115) 57 protein:vir:9312 Length: 115 # 96.9 2.2E-05 1.3E-08 46.2 9.3 110 1-154 1-115 (115) 58 protein:vir:78077 Length: 141 96.8 5.4E-05 3.4E-08 44.0 11.0 121 1-155 1-139 (141) 59 protein:vir:107545 Length: 140 96.8 1E-05 6.2E-09 48.0 6.5 106 1-148 1-140 (140) 60 protein:vir:97982 Length: 140 96.8 1E-05 6.2E-09 48.0 6.5 106 1-148 1-140 (140) 61 protein:vir:94538 Length: 125 96.7 1.9E-05 1.2E-08 46.4 7.5 109 1-155 1-120 (125) 62 protein:vir:99744 Length: 115 96.7 7.1E-05 4.4E-08 43.3 10.6 112 1-154 1-115 (115) 63 protein:vir:80037 Length: 199 96.7 3.7E-06 2.3E-09 50.4 3.5 83 64-155 1-132 (199) 64 protein:vir:1891 Length: 179 # 96.6 1.8E-05 1.1E-08 46.7 6.7 144 1-155 1-163 (179) 65 protein:vir:100075 Length: 140 96.6 3.4E-05 2.1E-08 45.1 8.3 116 1-155 1-130 (140) 66 protein:vir:98409 Length: 108 96.6 5.7E-05 3.6E-08 43.8 9.4 106 1-154 1-108 (108) 67 protein:vir:9708 Length: 125 # 96.6 6.5E-05 4E-08 43.5 9.7 114 1-155 1-121 (125) 68 protein:vir:743 Length: 108 # 96.5 6.9E-05 4.3E-08 43.4 9.7 106 1-154 1-108 (108) 69 protein:vir:93617 Length: 148 96.5 3.6E-05 2.2E-08 44.9 7.8 132 1-155 1-140 (148) 70 protein:vir:100243 Length: 140 96.3 8.2E-05 5.1E-08 43.0 8.7 125 1-155 1-130 (140) 71 protein:vir:1273 Length: 127 # 96.2 0.00016 1E-07 41.3 10.0 113 1-155 1-124 (127) 72 protein:vir:105089 Length: 133 96.2 9.4E-05 5.8E-08 42.7 8.4 121 1-155 1-128 (133) 73 protein:vir:80362 Length: 140 96.0 0.00014 8.4E-08 41.8 8.8 124 1-155 1-130 (140) 74 protein:vir:96105 Length: 193 96.0 5.3E-05 3.3E-08 44.0 6.4 80 1-93 112-193 (193) 75 protein:vir:5745 Length: 135 # 95.9 0.00019 1.2E-07 40.9 9.1 119 1-155 1-128 (135) 76 protein:vir:97327 Length: 116 95.8 9.5E-05 5.9E-08 42.6 7.1 97 8-154 1-116 (116) 77 protein:vir:1243 Length: 116 # 95.8 9.5E-05 5.9E-08 42.6 7.1 97 8-154 1-116 (116) 78 protein:vir:99546 Length: 200 95.8 6.2E-05 3.9E-08 43.6 6.1 80 1-93 119-200 (200) 79 protein:vir:105467 Length: 144 95.6 0.00064 4E-07 38.1 10.8 120 1-155 1-138 (144) 80 protein:vir:194 Length: 149 # 95.6 0.0002 1.2E-07 40.9 7.9 131 1-155 1-141 (149) 81 protein:vir:102441 Length: 137 95.6 0.00019 1.2E-07 41.0 7.7 109 1-155 1-132 (137) 82 protein:vir:3873 Length: 128 # 95.4 0.00029 1.8E-07 40.0 8.3 115 1-155 1-125 (128) 83 protein:vir:95062 Length: 116 95.4 0.0002 1.3E-07 40.8 7.4 97 8-154 1-116 (116) 84 protein:vir:79034 Length: 141 95.4 0.00046 2.8E-07 38.9 9.2 125 1-155 1-137 (141) 85 protein:vir:101594 Length: 173 95.4 0.00052 3.2E-07 38.6 9.5 119 1-155 1-168 (173) 86 protein:vir:1386 Length: 149 # 95.3 0.00044 2.7E-07 39.0 8.8 129 1-155 1-141 (149) 87 protein:vir:1437 Length: 140 # 95.3 0.00044 2.7E-07 39.0 8.7 111 1-155 1-130 (140) 88 protein:vir:107568 Length: 146 95.2 0.00015 9.4E-08 41.5 6.1 129 1-155 1-140 (146) 89 protein:vir:105007 Length: 146 95.2 0.00015 9.4E-08 41.5 6.1 129 1-155 1-140 (146) 90 protein:vir:102875 Length: 146 95.2 0.00015 9.4E-08 41.5 6.1 129 1-155 1-140 (146) 91 protein:vir:102085 Length: 146 95.2 0.00015 9.4E-08 41.5 6.1 129 1-155 1-140 (146) 92 protein:vir:107757 Length: 189 95.2 0.00017 1E-07 41.3 6.2 91 1-97 69-189 (189) 93 protein:vir:80037 Length: 199 95.0 0.00025 1.5E-07 40.4 6.7 83 1-95 115-199 (199) 94 protein:vir:101563 Length: 155 94.3 5.2E-05 3.2E-08 44.1 1.3 78 71-155 1-95 (155) 95 protein:vir:5257 Length: 148 # 94.2 5.1E-05 3.1E-08 44.1 1.0 75 54-155 1-88 (148) 96 protein:vir:5257 Length: 148 # 94.0 0.00044 2.7E-07 39.0 5.8 78 1-93 71-148 (148) 97 protein:vir:77650 Length: 155 93.8 7.5E-05 4.6E-08 43.2 1.1 68 65-155 1-95 (155) 98 protein:vir:102963 Length: 163 93.7 0.0018 1.1E-06 35.6 8.6 142 1-155 1-156 (163) 99 protein:vir:106728 Length: 155 93.6 7.6E-05 4.7E-08 43.2 0.9 68 65-155 1-95 (155) 100 protein:vir:6246 Length: 143 # 93.5 0.0024 1.5E-06 34.9 8.9 123 1-155 1-143 (143) 101 protein:vir:966 Length: 123 # 93.3 0.0051 3.1E-06 33.2 10.4 118 1-155 1-123 (123) 102 protein:vir:78607 Length: 155 93.3 0.0001 6.2E-08 42.5 1.0 78 65-155 1-95 (155) 103 protein:vir:107757 Length: 189 93.0 0.00024 1.5E-07 40.4 2.6 77 64-155 1-86 (189) 104 protein:vir:94069 Length: 168 92.7 0.00029 1.8E-07 39.9 2.7 81 44-155 1-98 (168) 105 protein:vir:1332 Length: 143 # 92.7 0.004 2.5E-06 33.7 8.9 122 1-155 1-143 (143) 106 protein:vir:9414 Length: 125 # 92.1 0.0053 3.3E-06 33.1 8.8 112 1-155 1-122 (125) 107 protein:vir:4704 Length: 125 # 92.1 0.0053 3.3E-06 33.1 8.8 112 1-155 1-122 (125) 108 protein:vir:79988 Length: 125 92.1 0.0053 3.3E-06 33.1 8.8 112 1-155 1-122 (125) 109 protein:vir:81106 Length: 125 92.1 0.0053 3.3E-06 33.1 8.8 112 1-155 1-122 (125) 110 protein:vir:98342 Length: 125 92.1 0.0053 3.3E-06 33.1 8.8 112 1-155 1-122 (125) 111 protein:vir:100223 Length: 139 90.3 0.0045 2.8E-06 33.5 6.6 119 1-155 3-132 (139) 112 protein:vir:78607 Length: 155 89.7 0.0044 2.7E-06 33.5 6.0 78 1-94 78-155 (155) 113 protein:vir:100887 Length: 139 89.7 0.0079 4.9E-06 32.1 7.4 119 1-155 3-132 (139) 114 protein:vir:106728 Length: 155 89.7 0.0044 2.7E-06 33.5 6.0 78 1-94 78-155 (155) 115 protein:vir:4956 Length: 153 # 89.6 0.0077 4.8E-06 32.2 7.3 119 1-155 4-136 (153) 116 protein:vir:94069 Length: 168 88.9 0.0062 3.8E-06 32.7 6.3 88 1-108 81-168 (168) 117 protein:vir:81147 Length: 126 88.4 0.029 1.8E-05 29.1 9.6 115 1-155 1-124 (126) 118 protein:vir:5000 Length: 141 # 85.8 0.023 1.4E-05 29.6 7.5 119 1-155 4-132 (141) 119 protein:vir:4833 Length: 140 # 85.4 0.032 2E-05 28.8 8.1 119 1-155 4-136 (140) 120 protein:vir:102154 Length: 119 85.3 0.033 2E-05 28.7 8.1 110 1-155 1-116 (119) 121 protein:vir:4859 Length: 140 # 84.5 0.035 2.2E-05 28.5 7.9 119 1-155 4-132 (140) 122 protein:vir:101563 Length: 155 83.6 0.011 7E-06 31.3 4.8 78 1-94 78-155 (155) 123 protein:vir:77650 Length: 155 83.3 0.012 7.2E-06 31.2 4.7 78 1-94 74-155 (155) 124 protein:vir:3848 Length: 159 # 80.8 0.091 5.7E-05 26.3 10.0 131 1-155 1-151 (159) 125 protein:vir:95260 Length: 160 78.5 0.03 1.8E-05 29.0 5.3 90 1-104 62-160 (160) 126 protein:vir:7412 Length: 168 # 76.9 0.099 6.2E-05 26.1 7.7 126 1-155 1-161 (168) 127 protein:vir:107703 Length: 147 71.6 0.15 9.2E-05 25.1 7.2 133 1-155 1-142 (147) 128 protein:vir:99528 Length: 92 # 70.7 0.13 8.2E-05 25.4 6.7 84 1-131 1-92 (92) 129 protein:vir:103280 Length: 142 70.4 0.13 8.2E-05 25.4 6.7 132 1-155 1-140 (142) 130 protein:vir:3994 Length: 168 # 57.0 0.44 0.00028 22.5 7.3 126 1-155 4-161 (168) 131 protein:vir:1028 Length: 168 # 51.4 0.58 0.00036 21.9 7.9 126 1-155 4-157 (168) 132 protein:vir:10367 Length: 119 48.7 0.27 0.00017 23.7 4.3 87 57-155 1-109 (119) 133 protein:vir:102338 Length: 116 47.2 0.56 0.00035 22.0 5.8 101 19-154 1-116 (116) 134 protein:vir:81067 Length: 119 41.8 0.39 0.00024 22.8 4.1 87 57-155 1-109 (119) 135 protein:vir:9879 Length: 127 # 37.2 1.1 0.0007 20.3 8.4 122 1-155 1-127 (127) 136 protein:vir:78335 Length: 133 33.4 0.64 0.0004 21.6 3.9 119 1-153 1-133 (133) 137 protein:vir:98636 Length: 138 33.3 0.83 0.00051 21.0 4.4 119 1-155 7-138 (138) 138 protein:vir:103765 Length: 549 31.8 0.19 0.00012 24.5 0.7 107 1-155 1-124 (549) 139 protein:vir:96012 Length: 133 31.4 1.2 0.00072 20.2 4.9 118 1-146 1-133 (133) 140 protein:vir:101302 Length: 134 25.2 0.9 0.00056 20.8 3.2 124 1-155 1-133 (134) 141 protein:vir:9513 Length: 134 # 25.2 0.9 0.00056 20.8 3.2 124 1-155 1-133 (134) 142 protein:vir:95372 Length: 124 23.4 2.3 0.0014 18.6 9.6 117 1-155 1-124 (124) 143 protein:vir:95157 Length: 144 23.0 2.4 0.0015 18.5 6.2 132 1-151 1-144 (144) 144 protein:vir:79638 Length: 146 20.4 2.8 0.0017 18.2 6.7 133 1-155 1-142 (146) No 1 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=100.00 E-value=2e-57 Score=331.41 Aligned_cols=155 Identities=86% Similarity=1.313 Sum_probs=152.2 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+|||++|+++|+.++++|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+.+.+.+.++|+.+.++||. T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~~ 80 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMFR 80 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999998888888899999999999 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) +++++++|++++++|+++|+|.|+|.+||+|||||++++|++++++|+||||||||||++|+++|+++|.+||+| T Consensus 81 ~l~~a~~l~~~~~~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LGls~~d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 81 KLRTARYLRIDVDSTGLAIGFDERLSRIARVHQEGQKAPVEPGGPLAQYPVRVVLGFSDADRELVRDRLLRELTR 155 (155) T ss_pred hhhhhheeeeeecCcEEEEEecCcchhhhhhhhcCCcccCCCCCcccccccccccCCCHHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=100.00 E-value=9.4e-53 Score=305.80 Aligned_cols=152 Identities=45% Similarity=0.744 Sum_probs=142.3 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+|||++|+++|+.++++|+|+.+++||++||++|+.+|++||++|++|||+||+|+++.+...+.. +..+...|+. T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~---~~~~~~~m~~ 77 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQG---RIRRKIKMFQ 77 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhcc---ccccchhhhh Confidence 9999999999999999999999999999999999999999999999999999999999887655422 2223456999 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) .++++++|++++++|+++|||+|++.+||++||||+++++++++++|+||||||||||++|+++|+++|.+||++ T Consensus 78 ~l~~~~~l~~~~~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~i~~~i~~~l~~ 152 (156) T protein:vir:11 78 KLRTVRYLRAKGDAQAITVSFAGRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDSSDMETIQNGILAHIDA 152 (156) T ss_pred hhhhhheeeeeecCcEEEEEecCCchhhhhhhcccccccccCCCCcccccccccCCCCHHHHHHHHHHHHHHHhh Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=100.00 E-value=1.2e-52 Score=305.27 Aligned_cols=150 Identities=32% Similarity=0.626 Sum_probs=139.4 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+|||++++++|+.++++|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+..+ ++..+...||. T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~-----k~~~~~~~m~~ 75 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGV-----KSKIKSGKMFD 75 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhh-----cccccchhHHH Confidence 9999999999999999999999999999999999999999999999999999999999876433 33445567999 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC-CceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG-GPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~-~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) +++.+++|++++++|+++|||+|+|.+||+|||||++++++.+ ..+|+||||||||||++|+++|+++|.+||+. T Consensus 76 ~L~~a~~l~~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~~ 151 (152) T protein:vir:10 76 KITQPRFMRLRLESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTDDDLQMIEDYMINILAG 151 (152) T ss_pred hhhhcceeeeeecCcEEEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCCHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999999999999999999988654 55899999999999999999999999999999 No 4 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=100.00 E-value=3.1e-52 Score=302.99 Aligned_cols=149 Identities=35% Similarity=0.601 Sum_probs=140.3 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+ ||++|+++|+.++++|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+..++.. ...++|+. T Consensus 1 m~-d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~-----~~~~~l~~ 74 (149) T protein:vir:98 1 MS-ELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKG-----RIRREMFA 74 (149) T ss_pred Cc-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccC-----CCCcccch Confidence 99 6999999999999999999999999999999999999999999999999999999987654322 23457999 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ++++++||++++++|+++|+|+|+|.+||++||||++++|++++++|+||||||||||++|+++|+++|.+||+| T Consensus 75 ~g~l~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 75 RLRTNRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFTRDDEQMIEDIIIRHLGK 149 (149) T ss_pred hhhhhhhhhheecCCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCCHHHHHHHHHHHHHHhhC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=100.00 E-value=8.1e-52 Score=300.69 Aligned_cols=149 Identities=35% Similarity=0.629 Sum_probs=139.9 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+ ||++|+++|..++.+|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+...+. + ...+.|+. T Consensus 1 m~-~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~----~-~~~~~l~~ 74 (150) T protein:vir:57 1 MN-EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKT----G-RVKRKMFA 74 (150) T ss_pred Cc-hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhc----c-CCCcccch Confidence 76 999999999999999999999999999999999999999999999999999999998765542 2 22357999 Q ss_pred hhhhcceeeEEEcCcEEEEEe-cccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGF-DDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~-~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ++++++||++++++|+++|+| .|+|.+||++||||+++++++++++|+||||||||||++|+++|+++|.+||+| T Consensus 75 ~~~l~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 75 KLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLDR 150 (150) T ss_pred hhhhccceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCHHHHHHHHHHHHHHHhC Confidence 999999999999999999998 699999999999999999999999999999999999999999999999999999 No 6 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=100.00 E-value=6.1e-52 Score=301.35 Aligned_cols=149 Identities=36% Similarity=0.628 Sum_probs=139.8 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+ ||++|+++|+.++++|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+...+. |+ ..++|+. T Consensus 1 ~~-~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~----g~-~~~~l~~ 74 (150) T protein:vir:20 1 MN-EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKT----GR-VKRKMFA 74 (150) T ss_pred Cc-hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhc----cC-CCccccc Confidence 76 999999999999999999999999999999999999999999999999999999998765432 22 2357999 Q ss_pred hhhhcceeeEEEcCcEEEEEe-cccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGF-DDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~-~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ++++++||++++++|+++|+| .|+|.+||++||||+++++++++++|+||||||||||++|+++|+++|.+||+| T Consensus 75 ~~~l~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 75 KLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLER 150 (150) T ss_pred hhhhhhhhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCHHHHHHHHHHHHHHHhC Confidence 999999999999999999998 699999999999999999999999999999999999999999999999999999 No 7 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=100.00 E-value=1.4e-51 Score=299.45 Aligned_cols=149 Identities=34% Similarity=0.607 Sum_probs=139.9 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+ ||++++++|..++.+|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+..++.. ...++|+. T Consensus 1 ~~-~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~-----~~~~~l~~ 74 (150) T protein:vir:60 1 MN-EFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTG-----RVKRKMFA 74 (150) T ss_pred Cc-hHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhc-----CCCccchh Confidence 76 9999999999999999999999999999999999999999999999999999999987655421 22357999 Q ss_pred hhhhcceeeEEEcCcEEEEEe-cccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGF-DDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~-~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ++++++||++++++|+++|+| .|+|.+||++||||+++++.++.++|+||||||||||++|+++|+++|.+||+| T Consensus 75 ~~~l~~sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 75 KLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLDR 150 (150) T ss_pred hhhhcceeeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCCHHHHHHHHHHHHHHHhC Confidence 999999999999999999998 699999999999999999999999999999999999999999999999999999 No 8 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=100.00 E-value=1.2e-51 Score=299.71 Aligned_cols=149 Identities=36% Similarity=0.630 Sum_probs=139.9 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+ ||++++++|..++++|+|+++++||++||++|+.+|++||++|++|||+||+|+++.+...+ +|. ..+.|+. T Consensus 1 m~-~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~----~g~-~~~~~~~ 74 (149) T protein:vir:18 1 MS-ELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSK----KGR-IKREMFA 74 (149) T ss_pred Cc-hHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhc----cCc-ccchhhh Confidence 98 79999999999999999999999999999999999999999999999999999999875433 222 2356999 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) +++++++|++.++++++.|+|.|+|.+||++||||+++++++++++|+||||||||||++|+++|+++|.+||+| T Consensus 75 ~l~~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 75 KLRTSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTRDDEQMIEDVIISHLGK 149 (149) T ss_pred hhhhhhhhheeecCceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCCHHHHHHHHHHHHHHHhC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 9 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=100.00 E-value=5.2e-51 Score=296.26 Aligned_cols=148 Identities=43% Similarity=0.688 Sum_probs=138.8 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |+ ||++|+++|..++++|+|.++++||++||++|+.+|++||++|++|||+||+|+++.+..++ |+ ..+.|+. T Consensus 1 m~-~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~-----g~-~~~~~~~ 73 (148) T protein:vir:79 1 MS-ESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRA-----GR-IRRAMFM 73 (148) T ss_pred Cc-cHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhc-----cc-ccccccc Confidence 98 59999999999999999999999999999999999999999999999999999998764332 22 2356999 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) +++++++|++.++++++.|+|.|+|.+||++||||++++|++++++|+||||||||||++|+++|+++|.+||+= T Consensus 74 ~l~~~~~l~~~~~~~~~~v~~~Gt~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 74 RLRLARYMKTQADANTAVVTFAGNAQRIATVHQFGLRDRVNKAGLTAQYPARELLGMDGVDMEHITNLLLLHLGA 148 (148) T ss_pred hhhhhhheeeeeeCCeeeEEeeccchhhhhhhhcCccccccCCCCccccCcccccCCCHHHHHHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 10 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=100.00 E-value=5.3e-44 Score=257.83 Aligned_cols=145 Identities=23% Similarity=0.283 Sum_probs=128.9 Q ss_pred Cch-----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccc Q lcl|NC_015266. 1 MDD-----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKR 75 (155) Q Consensus 1 m~~-----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~ 75 (155) |.. |+.++.++|..++..|+ ++++||++||+.|+.++++||++|++|||+||+|+++.+..++.. ... T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~~--~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~-----~~~ 73 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAALG--DPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRK-----NRD 73 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhc-----CCC Confidence 443 78899999999999996 457899999999999999999999999999999999998765432 235 Q ss_pred hhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccC--------------------------------- Q lcl|NC_015266. 76 QAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP--------------------------------- 122 (155) Q Consensus 76 ~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~--------------------------------- 122 (155) ++|++++.|.+||++++++|+|.| |+|.+||+|||||+++.+.. T Consensus 74 ~~L~~tg~L~~Si~~~~~~~~v~v---Gtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 150 (190) T protein:vir:99 74 KILTLDGHLRNLLRYQLDGSELLF---GSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQ 150 (190) T ss_pred ccceecHHHHHHHhheecCcEEEE---ecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcc Confidence 789999999999999999999999 99999999999999876643 Q ss_pred -CCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 123 -GGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 123 -~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ++++|+||||||||||++|+++|+++|.+||.+ T Consensus 151 ~~~~~v~IPaRpfLG~s~~d~~~I~~~i~~~l~~ 184 (190) T protein:vir:99 151 IGPYTIQMPARPWLGTSSQDDDTILQRVERYLQR 184 (190) T ss_pred cccceeeecCcccCCCCHHHHHHHHHHHHHHHHH Confidence 234689999999999999999999999999999 No 11 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=100.00 E-value=1.4e-42 Score=250.00 Aligned_cols=145 Identities=16% Similarity=0.151 Sum_probs=124.5 Q ss_pred Cc------hhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MD------DDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD-GSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~------~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD-G~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |+ .|+.++...|..|...+ +.+++|++||+.|+.++++||++|++|| |+||+|+++.+..++ .+.+.. T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~---~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r--~~~~~~ 75 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVT---RDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWR--QDHGFV 75 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhh---ccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHh--hccCCC Confidence 33 35566777776654432 3468999999999999999999999998 999999999987664 334445 Q ss_pred cchhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHh Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYL 153 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l 153 (155) ..++|.++++|.+||++++++|+|+| |+|.+||++||||++..+ +.+.|+||||||||||++|+++|+++|.+|| T Consensus 76 ~~~~L~~tg~L~~Si~~~~~~~~v~v---Gt~~~yA~vHqfG~~~~~--~~~~~~iPaRpfLG~s~~d~~~I~~~i~~~l 150 (156) T protein:vir:19 76 PGSILTLHGDLARSITTDYGQDYALI---GSPKIYAAIHQWGGTPDM--APRPAGVPARPYMGLDKTGEQEIFDAIRKRV 150 (156) T ss_pred CCcchhhhHHHHHHhhheecCCEEEE---ecchhhhHHhhcCccccc--CCCccccCCccccCCCHHHHHHHHHHHHHHH Confidence 56799999999999999999999999 999999999999999864 4457899999999999999999999999999 Q ss_pred cC Q lcl|NC_015266. 154 NR 155 (155) Q Consensus 154 ~r 155 (155) .+ T Consensus 151 ~~ 152 (156) T protein:vir:19 151 SA 152 (156) T ss_pred HH Confidence 99 No 12 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=100.00 E-value=3.9e-41 Score=242.09 Aligned_cols=147 Identities=22% Similarity=0.182 Sum_probs=122.6 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhc--------- Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKG--------- 65 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~--------- 65 (155) |+. |..++...|..|...+. ++++||++||+.|+.+|++||++|++|||+||.|+..+...+. T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~--d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~ 78 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGH--QKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGE 78 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhc--cHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhh Confidence 664 44678888888877774 4578999999999999999999999999999999875432111 Q ss_pred --cccccCcccchhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHH- Q lcl|NC_015266. 66 --LRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADR- 142 (155) Q Consensus 66 --~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~- 142 (155) ...+++....++|.+++.|.+||++++++++|.| |||.+||+|||||++. +++++|+||||||||||++|+ T Consensus 79 ~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~v---Gtn~~YAaiHqfGg~~---~~~~~v~iPaRpfLG~s~~d~~ 152 (175) T protein:vir:10 79 LTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVI---GSNKEYAAIHQFGGQA---GRGLKVTIPARPWLPVTADGEL 152 (175) T ss_pred hhhhhhhhccCCCcceechhhhhhhheeecCCEEEE---ecChhhhhhhhccccc---CCCCccccCCccccCCCccccc Confidence 1122334567789999999999999999999999 9999999999999986 456679999999999998775 Q ss_pred -----HHHHHHHHHHhcC Q lcl|NC_015266. 143 -----ELVRDRLLRYLNR 155 (155) Q Consensus 143 -----~~I~~~i~~~l~r 155 (155) ++|++++.+||.+ T Consensus 153 ~~e~~~~Il~~~~~~l~~ 170 (175) T protein:vir:10 153 QPEAVEPVLNTILRHLMD 170 (175) T ss_pred chHHHHHHHHHHHHHHHH Confidence 8899999999998 No 13 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=100.00 E-value=1.2e-40 Score=239.48 Aligned_cols=141 Identities=17% Similarity=0.177 Sum_probs=124.9 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |+. |..++.++|..|...+. ++++||++||+.|+.++++||+ |||+||+|+++.+...+ .++|+.. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~--d~~~l~~~ig~~l~~~~~~rF~----pdG~~W~pls~~t~~~r--~~~g~~~ 72 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVT--DTLPVMRGIAAELLAETEFAFM----DEGPGWPQLSPVTVAAR--EAKGRGP 72 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHhh----ccCCCCCCCChHHHHHH--hccCCCC Confidence 765 55788889999988885 5688999999999999999994 99999999999987664 3455566 Q ss_pred chhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH------HHHHHHHHH Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS------ADRELVRDR 148 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~------~d~~~I~~~ 148 (155) .++|.+++.|.+||++++++++|.| |+|.+||+|||||+++ +..++|+||||||||+|+ +|+++|+++ T Consensus 73 ~~iL~~tg~L~~Si~~~~~~~~v~v---Gtn~~YA~iHqfGg~~---~~~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~ 146 (155) T protein:vir:99 73 HPILQVTNALARSVTTWADRNEAGI---GSNLVYAAIHQFGGDA---GRGHQVEIPARRYLPFDENGQLAAGARQSILEI 146 (155) T ss_pred CCcchhchhhhhhhhceecCCEEEE---ecCccchhhhhccccc---CCCCccccCCccccCCCCccccchHHHHHHHHH Confidence 7899999999999999999999999 9999999999999986 345679999999999995 678999999 Q ss_pred HHHHhcC Q lcl|NC_015266. 149 LLRYLNR 155 (155) Q Consensus 149 i~~~l~r 155 (155) |.+||+| T Consensus 147 i~~~l~~ 153 (155) T protein:vir:99 147 VLTALSR 153 (155) T ss_pred HHHHHhc Confidence 9999999 No 14 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=100.00 E-value=6.6e-41 Score=240.85 Aligned_cols=147 Identities=18% Similarity=0.115 Sum_probs=121.1 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhh-----------h Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKG-----------G 63 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~-----------~ 63 (155) |+. |..++..+|..|...+. +++++|++||+.|+.+|++||++|++|||.||.|+..... . T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~--d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~ 78 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGH--QKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGE 78 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhc--CHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhcccccccccccc Confidence 775 44678888999888884 4578999999999999999999999999666666442111 0 Q ss_pred hccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHH- Q lcl|NC_015266. 64 KGLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADR- 142 (155) Q Consensus 64 ~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~- 142 (155) .....++.....++|.+++.|.+||++++++|+|.| |||.+||+|||||+.. +++.+|+||||||||||++|+ T Consensus 79 ~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~v---Gtn~~YAaiHqfGg~~---~~~~~v~IPARPfLG~s~~de~ 152 (175) T protein:vir:79 79 LTAAASRRKAGLMILQDSGQMAASTATDSGEDYSVI---GSNKEYAAIQHFGGQA---GRGLKVTIPGRAWLPVTADGEL 152 (175) T ss_pred chhhHhhhccCCCcceechhhhhhhhheecCCEEEE---ecCcchhhHhhccccc---CCCcccccCcccccCCCcccch Confidence 111112334567789999999999999999999999 9999999999999975 456789999999999999995 Q ss_pred -----HHHHHHHHHHhcC Q lcl|NC_015266. 143 -----ELVRDRLLRYLNR 155 (155) Q Consensus 143 -----~~I~~~i~~~l~r 155 (155) ++|+++|.+||.+ T Consensus 153 ~~~~~~~I~~~i~~~l~~ 170 (175) T protein:vir:79 153 QPEAVEPVLNTILRHLMD 170 (175) T ss_pred hHHHHHHHHHHHHHHHHH Confidence 8899999999999 No 15 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=100.00 E-value=1.1e-39 Score=234.15 Aligned_cols=141 Identities=17% Similarity=0.177 Sum_probs=123.1 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |+. |..++..+|..|...+. +.++||++||+.|+.+|++||+ |||+||+|+++.+...+. ++|+.. T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~--d~~~l~~~ig~~l~~~~~~rF~----~eG~~W~pls~~t~~~r~--~~g~~~ 72 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVT--DTLPVMRGIAAELLAETEFAFM----DEGPGWPQLSPATVAARE--AKGRGP 72 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHhh----ccCCCCCCCCHHHHHHHh--ccCCCC Confidence 554 45788888888888874 5678999999999999999995 899999999999876543 445556 Q ss_pred chhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH------HHHHHHHHH Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS------ADRELVRDR 148 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~------~d~~~I~~~ 148 (155) .++|.++++|.+||+++++++.|.| |||.+||+|||||+++ +..++|+||||||||+|+ +|+++|+++ T Consensus 73 ~~iL~~tG~L~~Si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~---~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~ 146 (155) T protein:vir:79 73 HPILQVTNALARSVTTWADRNEAGI---GSNLVYAAIHQFGGDA---GRGHQVEIPARRYLPFDENGQLAAGARQSILEV 146 (155) T ss_pred CCccccchhhhhhhhceecCCEEEE---ecCchhhhhhhccccc---CCCCccccCCccccCCCCccccchHHHHHHHHH Confidence 7899999999999999999999999 9999999999999986 345578999999999996 556999999 Q ss_pred HHHHhcC Q lcl|NC_015266. 149 LLRYLNR 155 (155) Q Consensus 149 i~~~l~r 155 (155) |.+||.| T Consensus 147 i~~~l~r 153 (155) T protein:vir:79 147 VLTALSR 153 (155) T ss_pred HHHHHHh Confidence 9999999 No 16 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=100.00 E-value=1.3e-39 Score=233.84 Aligned_cols=141 Identities=18% Similarity=0.172 Sum_probs=120.5 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |+. |..++..+|..|...+. +.++||++||+.|+.+|++||+ |||+||+|+++.+..+. .++|+.. T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~--~~~~l~~~ig~~l~~~~~~rF~----p~G~~W~plsp~t~~~r--~k~g~~~ 72 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVT--DTLPLMRGIAAELLAETEFAFM----DEGPGWPQLSPVTVAAR--AAKGRGA 72 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHh----hcCCCCCCCCccchHHH--HhccCCC Confidence 664 55678888888888774 5678999999999999999995 99999999999887553 3455566 Q ss_pred chhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHH------HHHHHH Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADR------ELVRDR 148 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~------~~I~~~ 148 (155) .++|.+++.|.+||++++++|+|.| |+|.+||+|||||++. +..++++||||||||||++|+ ++|.++ T Consensus 73 ~~~L~~tG~L~~Si~~~~~~~~v~v---Gtn~~YA~iHqfGg~~---~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~ 146 (155) T protein:vir:10 73 HPILQVTNALARSITTRADRDQAQI---GSNLSYAAIQQLGGQA---GRGRKVTIPARPYLPVLRNGQLKPSARDAVLDV 146 (155) T ss_pred CCccccchhhhhhhhceecCCEEEE---ecCcchhhhhhccccc---CCCCccccCCccccCCCccccchHHHHHHHHHH Confidence 7899999999999999999999999 9999999999999986 345678999999999997664 778888 Q ss_pred HHHHhcC Q lcl|NC_015266. 149 LLRYLNR 155 (155) Q Consensus 149 i~~~l~r 155 (155) |.+||.| T Consensus 147 i~~~l~~ 153 (155) T protein:vir:10 147 LLAALSQ 153 (155) T ss_pred HHHHHhh Confidence 8888887 No 17 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=99.96 E-value=2.3e-34 Score=205.02 Aligned_cols=132 Identities=14% Similarity=0.098 Sum_probs=109.0 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |=++...+.+.+..+...+.+ .|.+|++.+..++++||+++.+|||+||+|+++.+.+++. ..++|.+ T Consensus 1 ~i~~~~~i~~~l~~l~~~~~~-----~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~-------~~~~L~~ 68 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDGLTD-----GLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKG-------SDTPLID 68 (145) T ss_pred CcccHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhc-------CCCCCcc Confidence 888888888888877766654 5899999999999999999999999999999998865542 2468999 Q ss_pred hhhhcceeeEEE----cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHH-HHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDV----DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADR-ELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~----~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~-~~I~~~i~~~l~r 155 (155) +++|.+||++++ +++++.| |||.+||++||||+. +++||||||||++.+|. ++|.++|.+++.+ T Consensus 69 tG~L~~Si~~~~~~~~~~~~a~v---Gtn~~YA~~hqfG~~--------~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~~ 137 (145) T protein:vir:31 69 NSRLLTDINAASMMDRANRMAVI---GTNLDYAEHHEFGAP--------EAGIPARPIFGPAGAYASQQAPDVIGDEIDT 137 (145) T ss_pred CHHHHHHHHHHhhhcccCceeEe---cCCchhhhhhccCCc--------ccccCCCCccCCCccchHHHHHHHHHHHHHH Confidence 999999998765 4566887 999999999999974 57899999999998764 4666666666555 No 18 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=99.94 E-value=1e-30 Score=185.00 Aligned_cols=144 Identities=19% Similarity=0.237 Sum_probs=115.7 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |.+|+.+|.++|..| .|+|+.|+.|++.||..|+.++++||++|++|||+||+|+++. +++.+...||. T Consensus 8 n~~dl~~l~~~L~ll--~L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~---------~~k~k~~rm~~ 76 (231) T protein:vir:37 8 KQEDLDAFVRDLRTL--NLTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPV---------DGEIKNKRLLK 76 (231) T ss_pred CHHHHHHHHHHHHHh--cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhccc---------ccchhhHHHHH Confidence 889999999999944 8999999999999999999999999999999999999998742 22233446888 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccC---------------------------------C---- Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP---------------------------------G---- 123 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~---------------------------------~---- 123 (155) ++....++....+++.+.+.|.|....+|++||||+++.++. . T Consensus 77 kL~~~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~ 156 (231) T protein:vir:37 77 KVLRYASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQG 156 (231) T ss_pred HhHHhhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCC Confidence 887777777766677777767899999999999997654320 0 Q ss_pred ---------------------------------------CceeeecCccccCCCHHHHHHHHH-HHHHHhcC Q lcl|NC_015266. 124 ---------------------------------------GPLAQYPVRVVLGFSSADRELVRD-RLLRYLNR 155 (155) Q Consensus 124 ---------------------------------------~~~v~iPaRp~LG~s~~d~~~I~~-~i~~~l~r 155 (155) ...|++|+|||||+|++|...|++ +|..+|+. T Consensus 157 k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e~~~~l~~~l~~i~~~ 228 (231) T protein:vir:37 157 KTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKENVDILREITLKFLSG 228 (231) T ss_pred CCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHHHHHHHHHHHHHHhcc Confidence 013789999999999998877665 45556666 No 19 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=99.93 E-value=4e-30 Score=181.73 Aligned_cols=137 Identities=27% Similarity=0.383 Sum_probs=115.1 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |..|+.+|.++|..| +|+|+.|+.|+++||.+|+.++++||++|++|||+||+|+++. + ..||. T Consensus 4 ~~~dl~~l~~~L~ll--~L~p~~RrrLl~~iar~lr~~~~~rIr~Q~~PDGs~~~pRKr~-------------k-rKMl~ 67 (228) T protein:vir:78 4 ITLDTRRGKDQLNLL--ALPPKKRKRLVWRAANEMKKLATRNVRQQQDPNGNAWAPRKRG-------------K-RKMLR 67 (228) T ss_pred chhhHHHHHHHHHHh--cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCChhhhhh-------------H-HHHHh Confidence 777999999999955 9999999999999999999999999999999999999998742 1 13777 Q ss_pred hhhhcceeeEEE-cCcEEEEEecc-----cccccccccccCccccccCC------------------------------- Q lcl|NC_015266. 81 KLRTARYLRIDV-DDTGLAIGFDD-----RLSRIVRVHQEGQKAPVEPG------------------------------- 123 (155) Q Consensus 81 ~~~l~~sl~~~~-~~~~~~v~~~G-----~~~~yAaiHqfG~~~~~~~~------------------------------- 123 (155) .+ .++|.+.. ++++++|+|.| ....+|++||||.++.|+.. T Consensus 68 ~L--~k~Lk~~~~~~~~a~v~f~~~~~~~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~paTr~QAk~Lr~lGy~~ 145 (228) T protein:vir:78 68 GL--PKLLQIREPRQDMAELGFTKGTMSAHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQASKAQARKLRELGFKR 145 (228) T ss_pred hh--HHhhhhhcccccceEEEeecCcccchHHHHHHHHhcCcccccccchhhhhhcccCCCCCCCCHHHHHHHHHhhccc Confidence 65 68888654 57899999977 46789999999987654321 Q ss_pred ---------------------------------------CceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 124 ---------------------------------------GPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 124 ---------------------------------------~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ...|++|+|||||+|++|...+++.+++-+.= T Consensus 146 ~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~e~~~~l~~~l~~i~~ 216 (228) T protein:vir:78 146 PGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANARQRQQAFALRPESIDY 216 (228) T ss_pred cCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHHHHHHHHHHHHHhccc Confidence 01468999999999999999999999988765 No 20 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=99.93 E-value=3.4e-29 Score=176.68 Aligned_cols=139 Identities=19% Similarity=0.240 Sum_probs=113.4 Q ss_pred CchhHHHHHHHHHHH-HHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGL-LAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~~~~~l~~~l~~l-l~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |+-|..++......| +.+|+|+.|+.|+..||.+|+.++++||++|++|||+||+|+++. ...|| T Consensus 5 ~~~n~~~~~~l~~~L~ll~L~p~~Rr~ll~~iak~lr~~~k~rIr~Q~~PDGs~~~pRKr~--------------k~KM~ 70 (227) T protein:vir:37 5 MGIDKEDLKKFLKDLEIISLPDKKKREILIRSLQMIKRQAVKSAANQRNPMGGSWKKRKNG--------------TAKML 70 (227) T ss_pred ccCCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCchhcch--------------hHHHH Confidence 555555555544444 669999999999999999999999999999999999999998742 12488 Q ss_pred hhhhhcceeeEEEcCcEEEEEec-ccccccccccccCccccccC------------------------------------ Q lcl|NC_015266. 80 RKLRTARYLRIDVDDTGLAIGFD-DRLSRIVRVHQEGQKAPVEP------------------------------------ 122 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~~~~v~~~-G~~~~yAaiHqfG~~~~~~~------------------------------------ 122 (155) .++ .+++...+++++++|+|. |....+|++||||+++.|+. T Consensus 71 ~kL--~k~l~~~~~~~~a~v~f~~g~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~~k~ 148 (227) T protein:vir:37 71 RRI--AKLANSKAEKAQGTLFYKQKRTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVANGKT 148 (227) T ss_pred hhh--HHHcceeecccceEEEecCcchHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccCCCC Confidence 876 678999999999999995 88899999999997654310 Q ss_pred -------------------------------------------CCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 123 -------------------------------------------GGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 123 -------------------------------------------~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) ....|++|+|||||+|+++...|+..+.+-++. T Consensus 149 k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~e~~~~l~r~l~~~~~ 224 (227) T protein:vir:37 149 KNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREEENAKIILAEIQKYTQ 224 (227) T ss_pred CCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHHHHHHHHHHHHHHHhh Confidence 012468999999999999998888888877777 No 21 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=99.89 E-value=8.7e-27 Score=163.47 Aligned_cols=139 Identities=20% Similarity=0.284 Sum_probs=105.6 Q ss_pred Cc---hhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MD---DDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~---~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+ +++.+|.+.|. +..|+|+.|+.|++.||.+|+.++++||++|++|||+||+|++.. ++ - T Consensus 7 ~~ln~~~~~~l~~~L~--ll~L~p~kRrrll~~iak~lr~~~k~rIr~Q~~PDGs~w~pRKr~-------------k~-K 70 (230) T protein:vir:98 7 MGVNPDDLRDFLKDLE--LLKIPPKKKKEILIRTLQEMKKRSVKSASNQRTPTGSGWKPRKNG-------------NA-K 70 (230) T ss_pred ccCCHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCChhhhhh-------------hH-H Confidence 22 36788888888 348999999999999999999999999999999999999998742 11 2 Q ss_pred hhhhh-hhcceeeEEEcCcEEEEEecccccccccccccCccccccC---------------------------------- Q lcl|NC_015266. 78 MFRKL-RTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP---------------------------------- 122 (155) Q Consensus 78 l~~~~-~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~---------------------------------- 122 (155) ||..+ .+........+.+++.++|.|....+|++||||.++.++. T Consensus 71 Ml~~L~k~l~~~~~~~~~~~v~~~~~~~~~rIA~vHq~G~~~~~~~~~~~~r~~~~~~~~paTr~QAk~Lr~lGy~v~~g 150 (230) T protein:vir:98 71 MLRRIAKTLKFTSADREIKRVCTISRNAQRRSQKEHQRGAKITNLKSVILRKSRAGTAKDPATMRQAKKLRDLGYTVPNG 150 (230) T ss_pred HHhhhHHHHHHhhcccccceeeeecccchhhhhhhhhccchhhhhhhhhhhhhcCCCCcccccHHHHHHHHHcCCccCCC Confidence 66655 2222223334556777777888899999999996542210 Q ss_pred --------------------------------------C------CceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 123 --------------------------------------G------GPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 123 --------------------------------------~------~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) + ...|++|+|||||+|++|...|++.+..-+.- T Consensus 151 ~~~~~~k~~kkps~kwI~~nls~~qAgliIR~L~~k~~k~~~~~t~W~I~~PaR~FLG~~~~e~~~~l~~~l~~i~~ 227 (230) T protein:vir:98 151 TTKSGKKRYRRPSAREIVATLSRAKASLLIRYFQEKEERQGKRLTKWIIPTEKRPFLDERDKENAEILKEFILKFSG 227 (230) T ss_pred CCCcCCCCCCCCCHHHHHHhhhHHHHHHHHHHHhccccccccCccceeeecCcccccCCChHHHHHHHHHHHHHhcc Confidence 0 13478999999999999999988777766665 No 22 >protein:vir:274 Length: 166 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536654;genbank:gi:17975132;genbank:GeneID:929088 Probab=99.79 E-value=9.8e-23 Score=141.24 Aligned_cols=139 Identities=16% Similarity=0.202 Sum_probs=100.9 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) =.+++.+|.+.|. +.+|+|+.|+.|++.||.+|+.++++||++|++|||+||+|+++. +. .||. T Consensus 7 ~~~q~~~l~~~L~--ll~L~p~~Rr~ll~~iak~lr~~~~~rIr~Q~~PDGs~~~pRKr~-------------k~-KMl~ 70 (166) T protein:vir:27 7 EDRSYLRVMEQLE--LLGLDRKTRDKMLRRIGAQIAKTTRKNIRAQRDPDGSAWAKRKRG-------------RG-KLLK 70 (166) T ss_pred ChHHHHHHHHHHH--HhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhhhh-------------hH-HHHH Confidence 1224666666665 558999999999999999999999999999999999999998742 11 3666 Q ss_pred hhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccC----CCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLG----FSSADRELVRDRLLRYLNR 155 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG----~s~~d~~~I~~~i~~~l~r 155 (155) .+.....+....++++++|+|.|....+|++||||+++.++..++.+.+|+|---. .|......+.+.+-.-..+ T Consensus 71 ~l~k~~~~~~~~~~~~~~v~~~g~~~rIA~vHq~G~~~~~~~~~~~~~~~~~~~~~~~~pATr~QAk~Lr~~~~~~~~~ 149 (166) T protein:vir:27 71 GFTQKLKHFQRDNNRTLVVGWPSARGRVAYEHHHGIAQESGLSARKRQAKQQNEPRKTDPATREQAKRCAISITASSLR 149 (166) T ss_pred hhHHHhhhhccCCCCeEEEEecCchhhhhhhhhcCcccccccchhhHHHhhccCCCCCccCCHHHHHHHHHhcCccccc Confidence 66555556666677899999999999999999999999998888777777773222 2333333322222111111 No 23 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=98.41 E-value=4.6e-10 Score=71.78 Aligned_cols=83 Identities=13% Similarity=0.077 Sum_probs=47.2 Q ss_pred cccccCcccchhhhhhhhhcceeeEEEcCcEEEEEec-------------cccccc-ccccccCccccccCC-------- Q lcl|NC_015266. 66 LRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFD-------------DRLSRI-VRVHQEGQKAPVEPG-------- 123 (155) Q Consensus 66 ~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~-------------G~~~~y-AaiHqfG~~~~~~~~-------- 123 (155) -.-+.+...-..+.+.+. ..+...+.|||. |++..| |++|+||+++.++.+ T Consensus 1 m~~~~~~~~~~~~~~~l~-------~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~ 73 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLR-------AMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAI 73 (193) T ss_pred CeeccchHHHHHHHHHHH-------HhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeecc Confidence 000000000011111111 122445555554 666555 999999998764432 Q ss_pred -----------------------CceeeecCccccCCCHHH-HHHHHHHHHHHhcC Q lcl|NC_015266. 124 -----------------------GPLAQYPVRVVLGFSSAD-RELVRDRLLRYLNR 155 (155) Q Consensus 124 -----------------------~~~v~iPaRp~LG~s~~d-~~~I~~~i~~~l~r 155 (155) .+.|+|||||||..+-+| .+++.+.+...+.+ T Consensus 74 ~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~ 129 (193) T protein:vir:96 74 VRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMR 129 (193) T ss_pred ccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHH Confidence 235689999999999666 56677777777766 No 24 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=97.99 E-value=2.2e-07 Score=57.03 Aligned_cols=115 Identities=12% Similarity=-0.017 Sum_probs=69.2 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+ .+.+|...|..+...+.. .-.+.|.+.+..+....+.. .| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~-~~~~al~~~a~~v~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIER-WVKRGIAKTTVKIHNTIISL-----MP---------------------------- 46 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 887 667777777766665532 22344556666665544432 12 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cc---eeeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GP---LAQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~---~v~iPaRp~LG~s 138 (155) .++|.|.+||.+....++.++. .|++..||...+||.......+ +. ...+||+|||-=+ T Consensus 47 -vdTG~Lr~SI~~~~~~~~~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:94 47 -VDTGYLRESVTMDFKDGGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred -cCcchhhcCceeEeecCcEEEE-EecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHH Confidence 1457788899888877765432 2899999999999954322111 11 1248999999644 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++....|.+-|+ T Consensus 125 ---~~~~~~~~~~~l~ 137 (137) T protein:vir:94 125 ---IDAGRVFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 3333334444444 No 25 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=97.97 E-value=1.4e-07 Score=58.09 Aligned_cols=117 Identities=10% Similarity=-0.022 Sum_probs=72.9 Q ss_pred Cch-----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccc Q lcl|NC_015266. 1 MDD-----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKR 75 (155) Q Consensus 1 m~~-----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~ 75 (155) |+. ++++|.+.|..+.+++.. .-...+.+++..+....+.+ .| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~-~~~~~l~~~a~~i~~~ak~~-----aP-------------------------- 48 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTG-AAREATEAAANDMVNMAKGL-----CP-------------------------- 48 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC-------------------------- Confidence 433 677888888888777643 33456677777776664332 22 Q ss_pred hhhhhhhhhcceeeEEEcCcEEEE-EecccccccccccccCcccc---ccC-----------CCceee---ecCccccCC Q lcl|NC_015266. 76 QAMFRKLRTARYLRIDVDDTGLAI-GFDDRLSRIVRVHQEGQKAP---VEP-----------GGPLAQ---YPVRVVLGF 137 (155) Q Consensus 76 ~~l~~~~~l~~sl~~~~~~~~~~v-~~~G~~~~yAaiHqfG~~~~---~~~-----------~~~~v~---iPaRp~LG~ 137 (155) .++|.|.+||++....++..+ +..|++..||..|+||.... ++. ..++|. +|++|||.= T Consensus 49 ---v~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~ 125 (142) T protein:vir:94 49 ---VDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRP 125 (142) T ss_pred ---ccchhhhccceeeeccCCceEEEEEecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhH Confidence 135778888988777665321 12389999999999996432 111 112343 789999987 Q ss_pred CHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 138 SSADRELVRDRLLRYLNR 155 (155) Q Consensus 138 s~~d~~~I~~~i~~~l~r 155 (155) +-++.. ..|.+++.+ T Consensus 126 A~~~~~---~~i~~~~~~ 140 (142) T protein:vir:94 126 AIAAAS---TFLRNHAKG 140 (142) T ss_pred HHHHHH---HHHHHHHHh Confidence 754432 444555555 No 26 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=97.96 E-value=5.5e-09 Score=65.88 Aligned_cols=90 Identities=13% Similarity=0.127 Sum_probs=50.6 Q ss_pred cchhhhhhccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEec-------------cc-ccccccccccCccccccC Q lcl|NC_015266. 57 RKIKKGGKGLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFD-------------DR-LSRIVRVHQEGQKAPVEP 122 (155) Q Consensus 57 ~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~-------------G~-~~~yAaiHqfG~~~~~~~ 122 (155) -+. .-....+.++..+-.-+.+.+. ..+...+.|||. |+ ++.+|++|.||+++++.. T Consensus 1 ~~~--~~~~~~k~~~~~~~~~~~~~l~-------~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~ 71 (200) T protein:vir:99 1 MKK--GFSKSNSVAAPLKHFQMLKQFD-------ALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPG 71 (200) T ss_pred CCc--CcceeeeeecchHHHHHHHHHH-------HhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCC Confidence 000 0000000011111111112111 134567788875 33 346799999999876443 Q ss_pred C-------------------------------CceeeecCccccCCCHHH-HHHHHHHHHHHhcC Q lcl|NC_015266. 123 G-------------------------------GPLAQYPVRVVLGFSSAD-RELVRDRLLRYLNR 155 (155) Q Consensus 123 ~-------------------------------~~~v~iPaRp~LG~s~~d-~~~I~~~i~~~l~r 155 (155) + .+.|+||+||||--+-+| .+++.+.+...+.+ T Consensus 72 ~~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~ 136 (200) T protein:vir:99 72 GTKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQ 136 (200) T ss_pred CccccccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHH Confidence 2 247899999999999666 67788877777776 No 27 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=97.84 E-value=5.7e-07 Score=54.82 Aligned_cols=115 Identities=12% Similarity=0.004 Sum_probs=67.6 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.. .+.+|.+.|..+-..+. ..-++.+++.+..+....+.. .| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISL-----MP---------------------------- 46 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 888 45566666665555443 222344555566665554432 11 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cce---eeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GPL---AQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~~---v~iPaRp~LG~s 138 (155) .++|.|.+||++....++..+. .|++..||...+||.......+ +.+ ..+||+|||-=+ T Consensus 47 -vdTG~Lr~SI~~~~~~~~~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:93 47 -VDTGYLRESVTMDFKDSGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred -ccccchhccceeEeecCceEEE-EecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 1467788889888777754432 2899999999999974422111 111 248999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++....|.+.|+ T Consensus 125 ---~~~~~~~~~~~l~ 137 (137) T protein:vir:93 125 ---IDAGRAFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 3344444455555 No 28 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=97.84 E-value=5.7e-07 Score=54.82 Aligned_cols=115 Identities=12% Similarity=0.004 Sum_probs=67.6 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.. .+.+|.+.|..+-..+. ..-++.+++.+..+....+.. .| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISL-----MP---------------------------- 46 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 888 45566666665555443 222344555566665554432 11 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cce---eeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GPL---AQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~~---v~iPaRp~LG~s 138 (155) .++|.|.+||++....++..+. .|++..||...+||.......+ +.+ ..+||+|||-=+ T Consensus 47 -vdTG~Lr~SI~~~~~~~~~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:94 47 -VDTGYLRESVTMDFKDSGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred -ccccchhccceeEeecCceEEE-EecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 1467788889888777754432 2899999999999974422111 111 248999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++....|.+.|+ T Consensus 125 ---~~~~~~~~~~~l~ 137 (137) T protein:vir:94 125 ---IDAGRAFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 3344444455555 No 29 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=97.84 E-value=5.7e-07 Score=54.82 Aligned_cols=115 Identities=12% Similarity=0.004 Sum_probs=67.6 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.. .+.+|.+.|..+-..+. ..-++.+++.+..+....+.. .| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISL-----MP---------------------------- 46 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 888 45566666665555443 222344555566665554432 11 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cce---eeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GPL---AQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~~---v~iPaRp~LG~s 138 (155) .++|.|.+||++....++..+. .|++..||...+||.......+ +.+ ..+||+|||-=+ T Consensus 47 -vdTG~Lr~SI~~~~~~~~~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:97 47 -VDTGYLRESVTMDFKDSGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred -ccccchhccceeEeecCceEEE-EecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 1467788889888777754432 2899999999999974422111 111 248999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++....|.+.|+ T Consensus 125 ---~~~~~~~~~~~l~ 137 (137) T protein:vir:97 125 ---IDAGRAFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 3344444455555 No 30 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=97.82 E-value=7.1e-07 Score=54.29 Aligned_cols=115 Identities=12% Similarity=-0.003 Sum_probs=66.4 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+ .+.+|.+.|..+-..+. ..-.+.+.+.+..+....+.. .| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~v~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISL-----MP---------------------------- 46 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 888 45555555555544442 222334555555555544322 11 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cce---eeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GPL---AQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~~---v~iPaRp~LG~s 138 (155) .++|.|.+||.+....+++.+. .|++..||....||.......+ +.+ ..+||+|||-=+ T Consensus 47 -v~TG~L~~Si~~~~~~~~~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:95 47 -VDTGYLRESVTMDFKDGGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred -ccchhhhcCeeeEeeCCceEEE-EecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 1457788899888877765432 2899999999999964422111 111 248999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++-...|.+.|+ T Consensus 125 ---~~~~~~~i~k~l~ 137 (137) T protein:vir:95 125 ---IDAGRAFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 3334444555555 No 31 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=97.82 E-value=2.6e-07 Score=56.72 Aligned_cols=103 Identities=17% Similarity=0.163 Sum_probs=55.4 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |+. |...+.+.+...+ +..++++++.+....+.+ .| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~--------k~~l~~~a~~i~~~ak~~-----aP------------------------- 42 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIF--------RGKHRSITRRIATQARAD-----VP------------------------- 42 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHh-----CC------------------------- Confidence 543 2222333332222 223556666654443211 11 Q ss_pred chhhhhhhhhcceeeEEEcCcE-EE-EEecccccccccccccCcc---ccccCC-------------Cceeeec---Ccc Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTG-LA-IGFDDRLSRIVRVHQEGQK---APVEPG-------------GPLAQYP---VRV 133 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~-~~-v~~~G~~~~yAaiHqfG~~---~~~~~~-------------~~~v~iP---aRp 133 (155) .++|.|.+||++....++ .. .+..|++..||.+|+||.. +.++.+ .++|++| ||| T Consensus 43 ----v~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~P 118 (137) T protein:vir:10 43 ----VRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRP 118 (137) T ss_pred ----cccchhhcCceeeeeccccceEEEEEecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCc Confidence 245788899998765442 11 1224999999999999974 333221 1357777 999 Q ss_pred ccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 134 VLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 134 ~LG~s~~d~~~I~~~i~~~l~r 155 (155) || ...+.+++++ T Consensus 119 fl----------~~A~~~~~~~ 130 (137) T protein:vir:10 119 FL----------RNAARRVVAA 130 (137) T ss_pred hH----------HHHHHHHhhc Confidence 98 3333444444 No 32 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=97.82 E-value=6.2e-07 Score=54.60 Aligned_cols=115 Identities=11% Similarity=-0.028 Sum_probs=70.6 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+ .+.+|...|..+-..+... -.+.+...|..+....+.+ .| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~-~~~al~~~a~~i~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRW-AKKGIAKTTTIIHNSIVSN-----MP---------------------------- 46 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 887 4667777777766655432 2345666676666655544 12 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccc---cCC-------------C---ceeeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPV---EPG-------------G---PLAQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~---~~~-------------~---~~v~iPaRp~LG~s 138 (155) .++|.|.+||+++...+++... .|++..||...+||...-. ... + ....+||||||==+ T Consensus 47 -v~TG~Lr~SI~~~~~~~~~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA 124 (137) T protein:vir:10 47 -VDTGYLRESVSMDFKKGGLTGV-INIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPA 124 (137) T ss_pred -cCcchhhcCeeeEecCCcEEEE-EecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHH Confidence 2467788899888887765432 3899999999999954311 100 1 11248999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++-...|.+.|+ T Consensus 125 ---~~~~~~~i~k~i~ 137 (137) T protein:vir:10 125 ---IDEGRAFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 2333344555555 No 33 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=97.75 E-value=9.6e-07 Score=53.58 Aligned_cols=115 Identities=13% Similarity=0.015 Sum_probs=69.8 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+. .+.+|.+.|..+-..+.. .-.+.+++.+..+....+.+ .| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~-~~~~al~~~a~~i~~~ak~~-----aP---------------------------- 46 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIR-WAKKGIAKTTTIIHNSIVSN-----MP---------------------------- 46 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 988 455666666665555432 23456777777777766654 22 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCcccccc-CC---------------Cce---eeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVE-PG---------------GPL---AQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~-~~---------------~~~---v~iPaRp~LG~s 138 (155) .++|.|.+||.+....+++.+. .|++..||....||...... +. +.+ ..+||||||==+ T Consensus 47 -vdTG~Lr~SI~~~~~~~~~~~~-V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:10 47 -VDTGYLRESVSMDFKKGGLTGV-INIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPA 124 (137) T ss_pred -cCcchhhcCeeEEeeCCcEEEE-EecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHH Confidence 1457788899888877765532 38999999999999543211 00 011 237999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++-...|...|+ T Consensus 125 ---~~~~~~~i~k~i~ 137 (137) T protein:vir:10 125 ---IDEGRAFFNKYFS 137 (137) T ss_pred ---HHHHHHHHHHhcC Confidence 2223334444444 No 34 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.74 E-value=7.4e-07 Score=54.19 Aligned_cols=109 Identities=13% Similarity=0.218 Sum_probs=65.8 Q ss_pred Cch-h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD-D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~-~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ + |++|.+.|..+. ++.+-.+.+++.+..+....++. ++...| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~---~~~~v~~~~~~~~~~~~~~~~~~-----a~~~~p----------------------- 49 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNA---SPEKRSKVLRKYGSKLKEAAVNR-----AQFNKG----------------------- 49 (114) T ss_pred CeeeeeehHHHHHHHHHHhc---CHHHHHHHHHHHHHHHHHHHHHh-----cccCCC----------------------- Confidence 776 3 444544444321 23333456666666655544432 221111 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) .++|.|.+||.+..+++++.| |++..||..+.||.. .+||||||.=.-+ ....+.+.|.+.|-= T Consensus 50 --~~TG~Lr~sI~~~~~~~~~~V---~~~~~Ya~~vEfGT~----------km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 50 --YSTGATRRSITLQVESDKATV---EALTSYSGYLEVGTR----------KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred --CCchhhhhceeeeecCCeeEe---cCCCCccceeccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 135677889999999999999 899999999999954 5899999974432 333344444333333 No 35 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.74 E-value=7.4e-07 Score=54.19 Aligned_cols=109 Identities=13% Similarity=0.218 Sum_probs=65.8 Q ss_pred Cch-h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD-D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~-~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ + |++|.+.|..+. ++.+-.+.+++.+..+....++. ++...| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~---~~~~v~~~~~~~~~~~~~~~~~~-----a~~~~p----------------------- 49 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNA---SPEKRSKVLRKYGSKLKEAAVNR-----AQFNKG----------------------- 49 (114) T ss_pred CeeeeeehHHHHHHHHHHhc---CHHHHHHHHHHHHHHHHHHHHHh-----cccCCC----------------------- Confidence 776 3 444544444321 23333456666666655544432 221111 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) .++|.|.+||.+..+++++.| |++..||..+.||.. .+||||||.=.-+ ....+.+.|.+.|-= T Consensus 50 --~~TG~Lr~sI~~~~~~~~~~V---~~~~~Ya~~vEfGT~----------km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 50 --YSTGATRRSITLQVESDKATV---EALTSYSGYLEVGTR----------KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred --CCchhhhhceeeeecCCeeEe---cCCCCccceeccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 135677889999999999999 899999999999954 5899999974432 333344444333333 No 36 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=97.70 E-value=1.5e-06 Score=52.45 Aligned_cols=115 Identities=8% Similarity=-0.037 Sum_probs=66.7 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.. -|++|...|..+...+. ..-.+.|.+.++.+....+.+ .| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~-~~~~~al~~~a~~v~~~ak~~-----ap---------------------------- 46 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDME-KWVKKGITKTTLKIYNTAIHL-----MP---------------------------- 46 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 886 34455555655555442 122334555555554443221 12 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccC--------------CCc---eeeecCccccCCCHH Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP--------------GGP---LAQYPVRVVLGFSSA 140 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~--------------~~~---~v~iPaRp~LG~s~~ 140 (155) .++|.|.+||.+....+++.+. .|++..||...+||....... .+. ...+|++|||= . T Consensus 47 -vdTG~Lr~SI~~~~~~~g~~~~-V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~---~ 121 (135) T protein:vir:96 47 -VDTGFLRQSTTVDFENGGFTGV-VKIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWE---P 121 (135) T ss_pred -ccchhhhcceeEEeecCcEEEE-EecCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchh---H Confidence 2567788899888877764432 289999999999996432110 011 13489999994 3 Q ss_pred HHHHHHHHHHHHhc Q lcl|NC_015266. 141 DRELVRDRLLRYLN 154 (155) Q Consensus 141 d~~~I~~~i~~~l~ 154 (155) -.++....+.+.|+ T Consensus 122 A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 122 AIDAGRQTFEQYFS 135 (135) T ss_pred HHHHHHHHHHHhcC Confidence 34444555555555 No 37 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.66 E-value=1e-06 Score=53.43 Aligned_cols=118 Identities=11% Similarity=0.097 Sum_probs=65.8 Q ss_pred CchhH-----HHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccc Q lcl|NC_015266. 1 MDDDL-----RALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKR 75 (155) Q Consensus 1 m~~~~-----~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~ 75 (155) |++-+ .++...|..+...+... -.+.+.+.++.+....+.+ .| T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~-v~~~l~~~a~~i~~~ak~~-----ap-------------------------- 51 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQ-VEQVIIKTAEKIAGLAASL-----AP-------------------------- 51 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----CC-------------------------- Confidence 77733 34444444444443211 1234455555554443322 11 Q ss_pred hhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCcccccc-CCC-------------c---eeeecCccccCCC Q lcl|NC_015266. 76 QAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVE-PGG-------------P---LAQYPVRVVLGFS 138 (155) Q Consensus 76 ~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~-~~~-------------~---~v~iPaRp~LG~s 138 (155) .++|.|.+||.+..+.++.++.+ |++..||..+.||...... ++. . ...+||+|||-=+ T Consensus 52 ---v~TG~Lr~SI~~~~~~~g~~~~V-~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA 127 (144) T protein:vir:59 52 ---VDEGNLKNSIQIDYKNNGLTAEI-TVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPA 127 (144) T ss_pred ---ccchhhhcCeeEEeecCcEEEEE-ecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHH Confidence 14678889999888877654322 8999999999999643221 111 1 1248999999654 Q ss_pred HH-HHHHHHHHHHHHhc Q lcl|NC_015266. 139 SA-DRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~-d~~~I~~~i~~~l~ 154 (155) -+ .+..|.+.|.+.+- T Consensus 128 ~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 128 VEEGGEYFEREMRRLRG 144 (144) T ss_pred HHHHHHHHHHHHHHhcC Confidence 32 34455555555555 No 38 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=97.63 E-value=1.9e-06 Score=51.89 Aligned_cols=115 Identities=10% Similarity=-0.029 Sum_probs=68.1 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.. .+.+|.+.|..+-..+... -.+.+.+.|..+....+.. .| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~-~~~~l~~~a~~~~~~ak~~-----~p---------------------------- 46 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEW-VKKGILKTTLAIYNTAVAL-----AP---------------------------- 46 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 887 5666666666655544322 2334556666665554422 12 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cc---eeeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GP---LAQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~---~v~iPaRp~LG~s 138 (155) .++|.|.+||.++...++..+ ..|++..||....||...-...+ +. ...+||+|||-=+ T Consensus 47 -vdTG~L~~Si~~~~~~~g~~~-~V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA 124 (137) T protein:vir:96 47 -VDLGFLKESIDFKVTDGGFSS-VISVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPA 124 (137) T ss_pred -cCccchhcCceeEeecCceEE-EEecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHH Confidence 145677888988877666543 23899999999999964321110 01 1348999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++-...|...|+ T Consensus 125 ---~~~~~~~i~k~i~ 137 (137) T protein:vir:96 125 ---IDEGRKVFNRYFS 137 (137) T ss_pred ---HHHHHHHHHHhhC Confidence 3344445555555 No 39 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=97.60 E-value=1.3e-06 Score=52.92 Aligned_cols=107 Identities=8% Similarity=0.002 Sum_probs=74.5 Q ss_pred Cchh---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=. |++|.+.|..+...... .-...+.+.|..+....+.+ .| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~-~v~~al~~~a~~i~~~ak~~-----aP---------------------------- 46 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVE-QSLQVLKNNGEKGKRIAKQL-----AP---------------------------- 46 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 7743 55555555555544422 22345666677666655544 22 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhcC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLNR 155 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~r 155 (155) .++|.|.+||+++.++....| |++..||..-.||.. .+||+|||.=+- .....+.+.|.+.|.+ T Consensus 47 -v~TG~Lr~sI~~~~~g~~~~V---~~~~~Ya~yvE~GT~----------~~~aqPfl~pa~~~~~~~~~~~l~~~l~~ 111 (114) T protein:vir:95 47 -KDTEFLKDHITTSYPGMEAHI---HGEAGYDGYQEYGTR----------FQPGTPHFRPMMEQIQPQFQKDMTDVMKG 111 (114) T ss_pred -cCchhhhhceeeecCceEEEe---ecCCCccceeecCcc----------ccCCCccchhhHHHHHHHHHHHHHHHHHh Confidence 134667788988888888888 889999999999954 489999998874 4567888889999988 No 40 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=97.47 E-value=3.8e-06 Score=50.31 Aligned_cols=115 Identities=8% Similarity=-0.072 Sum_probs=64.2 Q ss_pred Cchh---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+- +++|.+.|..+...+.. .-.+.+.+.++.+....+.+ .| T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~-~~~~al~~~a~~v~~~ak~~-----aP---------------------------- 58 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEE-WVKKGIAKTTTKIYNTAVAL-----AP---------------------------- 58 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 6653 33455555544444321 22234455555555443321 11 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCcccccc---CC----------------CceeeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVE---PG----------------GPLAQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~---~~----------------~~~v~iPaRp~LG~s 138 (155) .++|.|.+||.+....+++... .|++..||....||...-.. ++ .....+||||||-= T Consensus 59 -vdTG~Lr~SI~~~~~~~g~~~~-V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~p- 135 (149) T protein:vir:94 59 -VDLGFLEESIDFKYFDGGLSSV-ISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNP- 135 (149) T ss_pred -cccchhhcCeeEEeeCCcEEEE-EecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHH- Confidence 2467788999988888866532 38999999999999643211 00 01134799999953 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) -.++-...|.+.|+ T Consensus 136 --A~~~~~~~i~~~i~ 149 (149) T protein:vir:94 136 --AIDAGRKTFEQYFS 149 (149) T ss_pred --HHHHHHHHHHHhhC Confidence 23334445555555 No 41 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.45 E-value=3.4e-06 Score=50.56 Aligned_cols=106 Identities=7% Similarity=0.019 Sum_probs=68.5 Q ss_pred chhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhhh Q lcl|NC_015266. 2 DDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFRK 81 (155) Q Consensus 2 ~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~~ 81 (155) =+-|++|.+.|..+...+...- ++.|.+.|..+....+. ..| .++ T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v-~~al~~~a~~i~~~ak~-----~aP-----------------------------v~T 45 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAV-DKELSKSAARIERQAKI-----LAP-----------------------------VDT 45 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHh-----cCC-----------------------------cCc Confidence 2245555555555554443221 23455556555444322 123 135 Q ss_pred hhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 82 LRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 82 ~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) |.|.+||.+..+++ .+.| +++..||...-||.. .+||||||.=+-+ ....+.+.|.+.|.| T Consensus 46 G~Lr~sI~~~~~~~~~~~v---~~~~~Ya~~vE~GT~----------~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 46 GWLRAQIYSEQQRLLHYRV---VSPALYSIYLELGTR----------KMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred hhhhcceeeeecCcEEEEe---ecCcccchhcccCcc----------ccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 66788888877654 3445 889999999999964 4899999987744 566778889999999 No 42 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.40 E-value=4.1e-06 Score=50.11 Aligned_cols=106 Identities=10% Similarity=0.184 Sum_probs=61.1 Q ss_pred Cch-h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD-D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~-~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |++ + +++|...|..+. .+.+-+..+++.+..+....++.-. ...| T Consensus 1 Ma~i~i~Gld~L~~~l~~~~---~~~~v~~~v~~~~~~~~~~~~~~a~-~~ap--------------------------- 49 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNA---SSERRSKVLRKYGAKLKEAAVSKAQ-FKKG--------------------------- 49 (112) T ss_pred CceeeehHHHHHHHHHHhhc---CHHHHHHHHHHHHHHHHHHHHHHhh-hcCC--------------------------- Confidence 886 3 344444443221 1222234445544444443333222 1122 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~l~r 155 (155) .++|.|.+||++..++..+.| |++..||....||.. .+||||||+=.-+..+.. +.+-|.| T Consensus 50 --vdTG~Lr~sI~~~~~~~~~~v---~~~~~Ya~~vE~GTr----------~m~AqPF~~PA~~~~~~~---~~~~l~~ 110 (112) T protein:vir:96 50 --YSTGATRRSITLEAGSDRAVV---EALTNYSGYLEVGTR----------KMEAQPFMRPALDQVVPE---MVEEMAK 110 (112) T ss_pred --CCchhhhhceeeecCceEEEe---cCCCCccceeccCcc----------ccCCCCchhhhHHHHHHH---HHHHHHh Confidence 245677888998888888888 899999999999964 489999998554333222 3333333 No 43 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=97.35 E-value=5.3e-06 Score=49.49 Aligned_cols=113 Identities=17% Similarity=0.142 Sum_probs=52.6 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+ .++-|+..|..+-+.+... -++.+..++..+....+.. .|| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~-~~~~i~~~a~~v~~~Ak~~---------aPv----------------------- 47 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPI-LRRTHSSLTRQIANETRAR---------VPV----------------------- 47 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh---------CCc----------------------- Confidence 322 1222222222222222111 1223445555444444322 122 Q ss_pred hhhhhhhcceeeEEEcCc----EEEEEecccccccccccccCcc---ccccCC-------------Cceeeec---Cccc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT----GLAIGFDDRLSRIVRVHQEGQK---APVEPG-------------GPLAQYP---VRVV 134 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~----~~~v~~~G~~~~yAaiHqfG~~---~~~~~~-------------~~~v~iP---aRp~ 134 (155) ++|.|.+||.+....+ ++.++ .+++..||..|+||.. +.+..+ .++|+.| ++|| T Consensus 48 --~tG~Lr~SI~~~~~~~~~~~~~~~~-v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pf 124 (142) T protein:vir:99 48 --LTGHLGRSVREDPQVMVTPFHVSGG-VTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPY 124 (142) T ss_pred --cchhhhcceeeeeccccccceEEEE-eccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCch Confidence 3466778887665433 23332 2789999999999974 222211 1346655 9999 Q ss_pred cCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 135 LGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 135 LG~s~~d~~~I~~~i~~~l~r 155 (155) |- ..+.+...+...+ T Consensus 125 l~------~A~~~~~~~~~~~ 139 (142) T protein:vir:99 125 LR------NAGEAVVRRDRRI 139 (142) T ss_pred hH------HHHHHHHhhhhhh Confidence 93 2222222222222 No 44 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=97.35 E-value=5.3e-06 Score=49.49 Aligned_cols=113 Identities=17% Similarity=0.142 Sum_probs=52.6 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+ .++-|+..|..+-+.+... -++.+..++..+....+.. .|| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~-~~~~i~~~a~~v~~~Ak~~---------aPv----------------------- 47 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPI-LRRTHSSLTRQIANETRAR---------VPV----------------------- 47 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh---------CCc----------------------- Confidence 322 1222222222222222111 1223445555444444322 122 Q ss_pred hhhhhhhcceeeEEEcCc----EEEEEecccccccccccccCcc---ccccCC-------------Cceeeec---Cccc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT----GLAIGFDDRLSRIVRVHQEGQK---APVEPG-------------GPLAQYP---VRVV 134 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~----~~~v~~~G~~~~yAaiHqfG~~---~~~~~~-------------~~~v~iP---aRp~ 134 (155) ++|.|.+||.+....+ ++.++ .+++..||..|+||.. +.+..+ .++|+.| ++|| T Consensus 48 --~tG~Lr~SI~~~~~~~~~~~~~~~~-v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pf 124 (142) T protein:vir:86 48 --LTGHLGRSVREDPQVMVTPFHVSGG-VTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPY 124 (142) T ss_pred --cchhhhcceeeeeccccccceEEEE-eccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCch Confidence 3466778887665433 23332 2789999999999974 222211 1346655 9999 Q ss_pred cCCCHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 135 LGFSSADRELVRDRLLRYLNR 155 (155) Q Consensus 135 LG~s~~d~~~I~~~i~~~l~r 155 (155) |- ..+.+...+...+ T Consensus 125 l~------~A~~~~~~~~~~~ 139 (142) T protein:vir:86 125 LR------NAGEAVVRRDRRI 139 (142) T ss_pred hH------HHHHHHHhhhhhh Confidence 93 2222222222222 No 45 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=97.28 E-value=5.9e-06 Score=49.24 Aligned_cols=115 Identities=9% Similarity=-0.037 Sum_probs=63.0 Q ss_pred Cchh---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |.+- +++|...|..+...+.. .-.+.+.+.++.+....+.. .| T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~-~~~~~l~~~a~~v~~~ak~~-----aP---------------------------- 58 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEE-WVKKGIAKTTTKIYNTAVAL-----AP---------------------------- 58 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CC---------------------------- Confidence 7653 33444444444443321 22234445555554443221 12 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCcccccc---CC-------------C---ceeeecCccccCCC Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVE---PG-------------G---PLAQYPVRVVLGFS 138 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~---~~-------------~---~~v~iPaRp~LG~s 138 (155) .++|.|.+||.+....++++.. .|++..||....||...-.. ++ + ....+||||||-=+ T Consensus 59 -vdTG~L~~SI~~~~~~~g~~~~-V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 136 (149) T protein:vir:10 59 -VDLGFLEESIDFKYFDGGLSSV-ISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPA 136 (149) T ss_pred -cccchhhccceEEecCCcEEEE-EecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHH Confidence 2467788999988888765432 38999999999999643111 00 0 11347999999533 Q ss_pred HHHHHHHHHHHHHHhc Q lcl|NC_015266. 139 SADRELVRDRLLRYLN 154 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~ 154 (155) .++-...|.+.|+ T Consensus 137 ---~~~~k~~i~~~i~ 149 (149) T protein:vir:10 137 ---IDAGRKTFEQYFS 149 (149) T ss_pred ---HHHHHHHHHHhhC Confidence 3333344445555 No 46 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.24 E-value=4.6e-06 Score=49.84 Aligned_cols=120 Identities=11% Similarity=0.060 Sum_probs=69.7 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) =+-||.+|.+.|+.|-+. ...--++.+.+=|+.++...+.+. |. + T Consensus 6 ~~~d~s~l~~~l~~l~~~-~~~v~R~A~~~ga~vv~dear~~a-----P~-----------------------------~ 50 (157) T protein:vir:97 6 RSVDITGILAGLETVVEH-SSDVVRTMTYESAVAVRESAKAFV-----ND-----------------------------E 50 (157) T ss_pred ecccHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHhC-----CC-----------------------------C Confidence 056888888888877432 222234566666777777776543 32 1 Q ss_pred hhhhcceeeEEEc----CcEE---EEEecccccccccccccCccccc----cCC----------CceeeecCccccCCC- Q lcl|NC_015266. 81 KLRTARYLRIDVD----DTGL---AIGFDDRLSRIVRVHQEGQKAPV----EPG----------GPLAQYPVRVVLGFS- 138 (155) Q Consensus 81 ~~~l~~sl~~~~~----~~~~---~v~~~G~~~~yAaiHqfG~~~~~----~~~----------~~~v~iPaRp~LG~s- 138 (155) +|.|.++|..... .++. .|++......|+....||-.... .+. +..+.|||+|||-=. T Consensus 51 tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~ 130 (157) T protein:vir:97 51 TGKLRNNLYVAYSPEESVEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGY 130 (157) T ss_pred cchhhhheeeeeccccCCCceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHH Confidence 2334445543221 1222 26665667789999999943311 011 123569999999744 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_015266. 139 SADRELVRDRLLRYLNR 155 (155) Q Consensus 139 ~~d~~~I~~~i~~~l~r 155 (155) +...+++.+.+.+.|.+ T Consensus 131 d~~k~~a~~~~~~~l~k 147 (157) T protein:vir:97 131 DSVAMQIPDIARAAGAK 147 (157) T ss_pred HHhHHHHHHHHHHHHHH Confidence 23456677777777766 No 47 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=97.14 E-value=4.4e-06 Score=49.95 Aligned_cols=139 Identities=11% Similarity=0.030 Sum_probs=69.7 Q ss_pred Cch-------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhh-hccccccCc Q lcl|NC_015266. 1 MDD-------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGG-KGLRTKVGR 72 (155) Q Consensus 1 m~~-------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~-~~~~~~~~~ 72 (155) |.| -|++|...|..|-..+...--+.-|+.-|+.+....+.+.-.-.+| +.+.+-..+- .....+++. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~----~~~~~l~~~i~~~~~~~~~~ 76 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDP----GTGRSISDNIALRWNGRLFK 76 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCC----CccchhhhhhhhhcccCccc Confidence 776 4667877777776554322234567777888888777776433222 2221111000 000001111 Q ss_pred ccchhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHH Q lcl|NC_015266. 73 IKRQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLR 151 (155) Q Consensus 73 ~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~ 151 (155) .........+.......... ...+.++ .+++..|++.+-||.. .+||||||.=.- +..+++++++.+ T Consensus 77 ~~~~~~~~vg~~~~~~~~~~-~~~~~~~-~~~~~~y~~f~EfGT~----------km~a~PFlrPA~~~~k~~~~~~~~~ 144 (164) T protein:vir:43 77 RTGDLGFRIGVLHGAVLPKK-GERSDKT-ANAPTPHWRLLEFGTE----------DMRAQPFMRSALADNIAEVTSTFVS 144 (164) T ss_pred cccceeEEeccccccccccc-ccccccC-CCCCcceEEEeecCCC----------CCCCCcchhhhHHHhHHHHHHHHHH Confidence 00000011110000000000 1111121 2556789999999953 589999998874 467777777777 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) .|.+ T Consensus 145 ~l~~ 148 (164) T protein:vir:43 145 EYEK 148 (164) T ss_pred HHHH Confidence 6666 No 48 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=97.01 E-value=5.4e-06 Score=49.48 Aligned_cols=108 Identities=17% Similarity=0.133 Sum_probs=55.4 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |=-...+|+.. .++++..+.-++.++.++..+....+. ..| .+ T Consensus 1 ~~~~~~~l~~~---~l~~~~~~~~~~~~~~~a~~ve~~ak~-----~aP-----------------------------v~ 43 (137) T protein:vir:10 1 MVAHTLRIERA---QLHGLGMDEARKAVNRVVRRTFTRSQI-----LAP-----------------------------VD 43 (137) T ss_pred CcccccccChh---hHhhHHHHHHHHHHHHHHHHHHHHHHh-----cCC-----------------------------cC Confidence 43333233221 111111111122344444444443211 112 35 Q ss_pred hhhhcceeeEEEc-CcEE-EEEecccccccccccccCcc---ccccCC-------------Cceeeec---CccccCCCH Q lcl|NC_015266. 81 KLRTARYLRIDVD-DTGL-AIGFDDRLSRIVRVHQEGQK---APVEPG-------------GPLAQYP---VRVVLGFSS 139 (155) Q Consensus 81 ~~~l~~sl~~~~~-~~~~-~v~~~G~~~~yAaiHqfG~~---~~~~~~-------------~~~v~iP---aRp~LG~s~ 139 (155) +|.|.+||++... +++. .++..+++..||.+||||.. ++++.+ .++|+.| +|||| T Consensus 44 TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL---- 119 (137) T protein:vir:10 44 TGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYL---- 119 (137) T ss_pred chhhhccceeeeeeccccEEEEEecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhh---- Confidence 6778889988764 3322 22234899999999999963 333221 2456766 99996 Q ss_pred HHHHHHHHHHHHHhcC Q lcl|NC_015266. 140 ADRELVRDRLLRYLNR 155 (155) Q Consensus 140 ~d~~~I~~~i~~~l~r 155 (155) .+.+.+...+ T Consensus 120 ------~~Al~~~~~~ 129 (137) T protein:vir:10 120 ------SQALREVAPQ 129 (137) T ss_pred ------HHHHHHhhcc Confidence 4455566666 No 49 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.00 E-value=2.1e-05 Score=46.26 Aligned_cols=112 Identities=9% Similarity=0.035 Sum_probs=63.8 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |+=+ |++|...|..+-...... -.+.+.+-|..+....+..-.. -.+.|+ T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~-~~~al~~~~~~i~~~a~~~a~~---~~~~pv------------------------- 51 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDD-VDDILKNNAKEGVGIAVSNAKE---VMNKGY------------------------- 51 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcc---ccCCCC------------------------- Confidence 4432 555555555443332111 1234555565555555432110 011222 Q ss_pred hhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 80 RKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) ++|.|.+||....+++ .+.| +++..||....||.. .+||||||.=.- .....+.+.|.+-++ T Consensus 52 ~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vEfGT~----------km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 52 WTGNLASLIEVKKIGDLHYRV---ISTAHYSGFLEFGTR----------YMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred cchhhhhceeeeecCcEEEEe---eCCCccchheecccc----------cCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 3566778887766554 2445 788999999999964 489999997552 344555666666666 No 50 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=96.95 E-value=1.9e-05 Score=46.52 Aligned_cols=124 Identities=13% Similarity=0.069 Sum_probs=68.7 Q ss_pred Cch-h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD-D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~-~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |-. + +.+|...|..+-+.+..+- .+.+..+.+.+....++..+.. .| T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v-~~a~~~~~~~~a~~v~~~ak~~-~P--------------------------- 51 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKAT-ANAQENAIEQAEAYAVDELQSS-IK--------------------------- 51 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhh-CC--------------------------- Confidence 332 3 4555555555444332111 1234444444433334333321 12 Q ss_pred hhhhhhhhcceeeEEEcCcE-EEEEecccccccccccccCccccc--c------------CCCc---------------- Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTG-LAIGFDDRLSRIVRVHQEGQKAPV--E------------PGGP---------------- 125 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~-~~v~~~G~~~~yAaiHqfG~~~~~--~------------~~~~---------------- 125 (155) .++|.|.+||.+++..++ ..++..+++..||..+.||...-. . ...+ T Consensus 52 --vdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~ 129 (182) T protein:vir:10 52 --YSTGELTRSFKHEVKVDGDEVIGRWWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIY 129 (182) T ss_pred --CCchhhhhceeeeeeecCCeEEEEeecCCCccceeecCcccccccCccccCccceeeeecCCceeecccccccccccc Confidence 256778888976554332 233455899999999999963210 0 0000 Q ss_pred --------------eeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 126 --------------LAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 126 --------------~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) ...+||||||==+-+ .+..|.++|.+++.+ T Consensus 130 ~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~ 174 (182) T protein:vir:10 130 GIPKIKINGKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQ 174 (182) T ss_pred ccceeeecCceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHH Confidence 024799999966544 478888888888888 No 51 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=96.93 E-value=1.4e-05 Score=47.14 Aligned_cols=106 Identities=11% Similarity=0.127 Sum_probs=62.6 Q ss_pred Cch--h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccc Q lcl|NC_015266. 1 MDD--D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKR 75 (155) Q Consensus 1 m~~--~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~ 75 (155) |+- + |++|...|..+. ....-.+.+++.+..+....+.+ .| T Consensus 1 M~~~i~i~Gld~l~~~L~~~~---~~~~~~~al~~~~~~i~~~ak~~-----aP-------------------------- 46 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAA---SLKGVQQVVKSNTSNMTANMQKL-----VP-------------------------- 46 (112) T ss_pred CceeeeehhHHHHHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHh-----CC-------------------------- Confidence 544 2 334433333321 11222345556665555544321 22 Q ss_pred hhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHHHHhc Q lcl|NC_015266. 76 QAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLLRYLN 154 (155) Q Consensus 76 ~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||+.....++..+.. |++..||....||.. .+||+|||-=+ +.....+.+.|.+.|- T Consensus 47 ---vdTG~Lr~si~~~~~~~~~~~~V-~~~~~Ya~~vE~GT~----------k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 47 ---VDTGYMKRSIKMELTEGGFSGQA-GPHTDYSAYVEYGTR----------FQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred ---CCchhhhhceeeeecCCceEEEe-ecCCCccceeecccc----------ccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 13456778888777766554432 889999999999954 48999999655 3445666777766666 No 52 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=96.90 E-value=2.2e-05 Score=46.16 Aligned_cols=110 Identities=11% Similarity=0.091 Sum_probs=65.2 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=+ |++|.+.|..+-..+... -.+.+.+-|..+....+.+ +|. ..| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~-v~~a~~~~~~~i~~~a~~~-----a~~~~~~p------------------------ 50 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDD-VDDILQENAKEYVVRAKLK-----AREVMNKG------------------------ 50 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----ccccCCCC------------------------ Confidence 4432 555555555443333211 1334555555555544433 121 111 Q ss_pred hhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.- .....+.+.|.+.+. T Consensus 51 -~~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vE~GT~----------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 51 -YWTGNLSRNIRYKKTGDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -CCchhhhhcceeeecCceEEEe---ecCccchhhhccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 13567788888776543 3456 888899999999964 489999998774 455666666666666 No 53 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=96.90 E-value=2.2e-05 Score=46.16 Aligned_cols=110 Identities=11% Similarity=0.091 Sum_probs=65.2 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=+ |++|.+.|..+-..+... -.+.+.+-|..+....+.+ +|. ..| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~-v~~a~~~~~~~i~~~a~~~-----a~~~~~~p------------------------ 50 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDD-VDDILQENAKEYVVRAKLK-----AREVMNKG------------------------ 50 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----ccccCCCC------------------------ Confidence 4432 555555555443333211 1334555555555544433 121 111 Q ss_pred hhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.- .....+.+.|.+.+. T Consensus 51 -~~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vE~GT~----------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 51 -YWTGNLSRNIRYKKTGDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -CCchhhhhcceeeecCceEEEe---ecCccchhhhccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 13567788888776543 3456 888899999999964 489999998774 455666666666666 No 54 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=96.90 E-value=2.2e-05 Score=46.16 Aligned_cols=110 Identities=11% Similarity=0.091 Sum_probs=65.2 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=+ |++|.+.|..+-..+... -.+.+.+-|..+....+.+ +|. ..| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~-v~~a~~~~~~~i~~~a~~~-----a~~~~~~p------------------------ 50 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDD-VDDILQENAKEYVVRAKLK-----AREVMNKG------------------------ 50 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----ccccCCCC------------------------ Confidence 4432 555555555443333211 1334555555555544433 121 111 Q ss_pred hhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.- .....+.+.|.+.+. T Consensus 51 -~~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vE~GT~----------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 51 -YWTGNLSRNIRYKKTGDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -CCchhhhhcceeeecCceEEEe---ecCccchhhhccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 13567788888776543 3456 888899999999964 489999998774 455666666666666 No 55 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=96.90 E-value=2.2e-05 Score=46.16 Aligned_cols=110 Identities=11% Similarity=0.091 Sum_probs=65.2 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=+ |++|.+.|..+-..+... -.+.+.+-|..+....+.+ +|. ..| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~-v~~a~~~~~~~i~~~a~~~-----a~~~~~~p------------------------ 50 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDD-VDDILQENAKEYVVRAKLK-----AREVMNKG------------------------ 50 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----ccccCCCC------------------------ Confidence 4432 555555555443333211 1334555555555544433 121 111 Q ss_pred hhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.- .....+.+.|.+.+. T Consensus 51 -~~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vE~GT~----------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 51 -YWTGNLSRNIRYKKTGDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -CCchhhhhcceeeecCceEEEe---ecCccchhhhccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 13567788888776543 3456 888899999999964 489999998774 455666666666666 No 56 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=96.90 E-value=2.2e-05 Score=46.16 Aligned_cols=110 Identities=11% Similarity=0.091 Sum_probs=65.2 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=+ |++|.+.|..+-..+... -.+.+.+-|..+....+.+ +|. ..| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~-v~~a~~~~~~~i~~~a~~~-----a~~~~~~p------------------------ 50 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDD-VDDILQENAKEYVVRAKLK-----AREVMNKG------------------------ 50 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----ccccCCCC------------------------ Confidence 4432 555555555443333211 1334555555555544433 121 111 Q ss_pred hhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.- .....+.+.|.+.+. T Consensus 51 -~~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vE~GT~----------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 51 -YWTGNLSRNIRYKKTGDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -CCchhhhhcceeeecCceEEEe---ecCccchhhhccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 13567788888776543 3456 888899999999964 489999998774 455666666666666 No 57 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=96.90 E-value=2.2e-05 Score=46.16 Aligned_cols=110 Identities=11% Similarity=0.091 Sum_probs=65.2 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+=+ |++|.+.|..+-..+... -.+.+.+-|..+....+.+ +|. ..| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~-v~~a~~~~~~~i~~~a~~~-----a~~~~~~p------------------------ 50 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDD-VDDILQENAKEYVVRAKLK-----AREVMNKG------------------------ 50 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----ccccCCCC------------------------ Confidence 4432 555555555443333211 1334555555555544433 121 111 Q ss_pred hhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~ 154 (155) .++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.- .....+.+.|.+.+. T Consensus 51 -~~TG~Lr~sI~~~~~g~~~~~v---~~~~~Ya~~vE~GT~----------km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 51 -YWTGNLSRNIRYKKTGDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -CCchhhhhcceeeecCceEEEe---ecCccchhhhccccc----------ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 13567788888776543 3456 888899999999964 489999998774 455666666666666 No 58 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=96.83 E-value=5.4e-05 Score=43.96 Aligned_cols=121 Identities=12% Similarity=0.041 Sum_probs=62.3 Q ss_pred Cch-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDD-DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~-~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |++ +|+.-.+.+ ...+.... .+-+..++..+. ...++.+.- ..+ =. T Consensus 1 ~~~~~f~~~~~~~---~~~~~k~~-~~~~~~~a~~~~---~~~ie~~ak-~~~-------------------------pv 47 (141) T protein:vir:78 1 MNEFEFDSNIPKA---RKLIEKKV-LQALEDIGEHMT---TELAEGGHG-VTS-------------------------NN 47 (141) T ss_pred CcchhHHHHHHHH---HHHHHHHH-HHHHHHHHHHHH---HHHHHHhhh-hcc-------------------------cc Confidence 765 344333332 22222111 111233333222 222222210 000 13 Q ss_pred hhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccC-------------CCce---eeecCccccCCCH-HHH Q lcl|NC_015266. 80 RKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP-------------GGPL---AQYPVRVVLGFSS-ADR 142 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~-------------~~~~---v~iPaRp~LG~s~-~d~ 142 (155) ++|.|.+||.+.+..++..+- .|++..||...+||....... .+++ .-.||+|||==+- +.+ T Consensus 48 dtG~L~~SI~~~v~~~g~~~~-V~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~ 126 (141) T protein:vir:78 48 DTGEYAQKSGYKVRKSSKEVI-VGNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQ 126 (141) T ss_pred ccchhhcceeeeeecCCcEEE-EecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhH Confidence 567888999887654443332 289999999999996432211 1111 1379999995442 345 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_015266. 143 ELVRDRLLRYLNR 155 (155) Q Consensus 143 ~~I~~~i~~~l~r 155 (155) .+|..+|.+.|.. T Consensus 127 ~~i~~~i~~~~~~ 139 (141) T protein:vir:78 127 DKVRVFTERALRG 139 (141) T ss_pred HHHHHHHHHHhhc Confidence 6677777777777 No 59 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=96.77 E-value=1e-05 Score=48.00 Aligned_cols=106 Identities=20% Similarity=0.124 Sum_probs=53.0 Q ss_pred Cc---------hhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccC Q lcl|NC_015266. 1 MD---------DDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVG 71 (155) Q Consensus 1 m~---------~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~ 71 (155) |. -|...+...+...+ ++.++.++..+....+.+ .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~v~~~ak~~---------aP------------------ 45 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHL--------RAFHRSLTRRIANQSRVA---------VP------------------ 45 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHhc---------CC------------------ Confidence 32 23333333222211 122334443333333222 12 Q ss_pred cccchhhhhhhhhcceeeEEEc--CcEEEEEecccccccccccccCccc---cccC----------C---Cceeeec--- Q lcl|NC_015266. 72 RIKRQAMFRKLRTARYLRIDVD--DTGLAIGFDDRLSRIVRVHQEGQKA---PVEP----------G---GPLAQYP--- 130 (155) Q Consensus 72 ~~~~~~l~~~~~l~~sl~~~~~--~~~~~v~~~G~~~~yAaiHqfG~~~---~~~~----------~---~~~v~iP--- 130 (155) .++|.|.+||+.... ++...++..+++..||.+++||... .++. + .++|+.| T Consensus 46 -------vdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~ 118 (140) T protein:vir:10 46 -------VRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTR 118 (140) T ss_pred -------ccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCC Confidence 245778889987543 3344455568999999999999743 2221 1 2456755 Q ss_pred CccccCCCHHH----HHHHHHH Q lcl|NC_015266. 131 VRVVLGFSSAD----RELVRDR 148 (155) Q Consensus 131 aRp~LG~s~~d----~~~I~~~ 148 (155) |+|||-=.-+. +..|..+ T Consensus 119 a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 119 ARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred CChhHHHHHHHHhhhhhhccCC Confidence 99998433221 2333333 No 60 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=96.77 E-value=1e-05 Score=48.00 Aligned_cols=106 Identities=20% Similarity=0.124 Sum_probs=53.0 Q ss_pred Cc---------hhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccC Q lcl|NC_015266. 1 MD---------DDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVG 71 (155) Q Consensus 1 m~---------~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~ 71 (155) |. -|...+...+...+ ++.++.++..+....+.+ .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~v~~~ak~~---------aP------------------ 45 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHL--------RAFHRSLTRRIANQSRVA---------VP------------------ 45 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHhc---------CC------------------ Confidence 32 23333333222211 122334443333333222 12 Q ss_pred cccchhhhhhhhhcceeeEEEc--CcEEEEEecccccccccccccCccc---cccC----------C---Cceeeec--- Q lcl|NC_015266. 72 RIKRQAMFRKLRTARYLRIDVD--DTGLAIGFDDRLSRIVRVHQEGQKA---PVEP----------G---GPLAQYP--- 130 (155) Q Consensus 72 ~~~~~~l~~~~~l~~sl~~~~~--~~~~~v~~~G~~~~yAaiHqfG~~~---~~~~----------~---~~~v~iP--- 130 (155) .++|.|.+||+.... ++...++..+++..||.+++||... .++. + .++|+.| T Consensus 46 -------vdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~ 118 (140) T protein:vir:97 46 -------VRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTR 118 (140) T ss_pred -------ccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCC Confidence 245778889987543 3344455568999999999999743 2221 1 2456755 Q ss_pred CccccCCCHHH----HHHHHHH Q lcl|NC_015266. 131 VRVVLGFSSAD----RELVRDR 148 (155) Q Consensus 131 aRp~LG~s~~d----~~~I~~~ 148 (155) |+|||-=.-+. +..|..+ T Consensus 119 a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 119 ARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred CChhHHHHHHHHhhhhhhccCC Confidence 99998433221 2333333 No 61 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=96.68 E-value=1.9e-05 Score=46.44 Aligned_cols=109 Identities=9% Similarity=-0.027 Sum_probs=56.1 Q ss_pred Cchh-------HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDDD-------LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~~-------~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |.++ |.+|...|..+...+.... .+.+.+-++.+....+ +.+| T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v-~~al~~~a~~i~~~ak-----~~ap------------------------ 50 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYS-VEAMKTSLSRAVEKSK-----GLAR------------------------ 50 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHH-----hhCC------------------------ Confidence 6664 3444444443333221111 1112222222222211 1122 Q ss_pred cchhhhhhhhhcceeeE---EEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHH Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRI---DVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRL 149 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~---~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i 149 (155) .++|.|.+||.. ....+++.+. .|++..||....||.. .+|++|||.=+- ....++...| T Consensus 51 -----~~tG~L~~sI~~~~~~~~~~~~~~~-v~~~~~Ya~~vEfGT~----------~~~a~Pfl~pa~~~~~~~~~~~l 114 (125) T protein:vir:94 51 -----VDTGYMRNNIQQDEVKEEHGVVTGR-YVARADYSSYNEYGTY----------RMSAQPFMAPSVAAMTPFFYKAV 114 (125) T ss_pred -----CCChhhhhhceecceeccCCcEEEE-eeCCCCccceeecccc----------cCCCCcccchhHHHHHHHHHHHH Confidence 123445555542 2334444432 3889999999999954 489999998773 3456677777 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+.|.+ T Consensus 115 ~~~l~~ 120 (125) T protein:vir:94 115 RDALNK 120 (125) T ss_pred HHHHHH Confidence 777777 No 62 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=96.68 E-value=7.1e-05 Score=43.33 Aligned_cols=112 Identities=11% Similarity=0.063 Sum_probs=63.3 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |+=+ |++|...|..+-.... ..-.+.+++-|..+....+..-.. -.+.| . T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~v~~av~~~~~~i~~~a~~~a~~---~~~~p-------------------------~ 51 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKARE---VMNKG-------------------------Y 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc---ccCCC-------------------------C Confidence 4432 5555555554433322 112345556666665554432110 01111 1 Q ss_pred hhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhc Q lcl|NC_015266. 80 RKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLN 154 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~ 154 (155) ++|.|.+||.+..+++ .+.| |++..||....||.. .+||||||.=.-+ ....+.+.|.+-+- T Consensus 52 ~TG~Lr~SI~~~~~g~~~~~V---~~~~~Ya~~vE~GT~----------~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 52 WTGNLSRNIRYKKTVDLQYTI---TSHAAYSGFLEFGTR----------YMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred cchhhhhceeeeecCcEEEEe---cCCcccccccccccc----------ccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 3567788888887664 3455 888999999999964 4899999986633 33444444444444 No 63 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=96.66 E-value=3.7e-06 Score=50.36 Aligned_cols=83 Identities=17% Similarity=0.143 Sum_probs=47.0 Q ss_pred hccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEecccc----cccccccccCccccccCC---------------- Q lcl|NC_015266. 64 KGLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDDRL----SRIVRVHQEGQKAPVEPG---------------- 123 (155) Q Consensus 64 ~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~----~~yAaiHqfG~~~~~~~~---------------- 123 (155) =+....+ ..-.-+.+.+ +. .+.-.+.|||.+.+ ..+|++|-||.++.++++ T Consensus 1 m~vt~~~--~~~~~~~~~l---~~----L~~k~v~vGi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~ 71 (199) T protein:vir:80 1 MKVTTDK--STMNKAIREL---DQ----LDRYSLQIGLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARD 71 (199) T ss_pred CcccccH--HHHHHHHHHH---HH----hcCCEEEEEEecCCCcchhheeehhhcCCeeecCCceeeecchhhhcccccc Confidence 0000000 0001111222 12 35678999998765 689999999988765432 Q ss_pred ----------------------------CceeeecCccccCCCH-HHHHHHHHHHHHHhcC Q lcl|NC_015266. 124 ----------------------------GPLAQYPVRVVLGFSS-ADRELVRDRLLRYLNR 155 (155) Q Consensus 124 ----------------------------~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~r 155 (155) .+.++||+||||--+- +..+++.+.+...+.+ T Consensus 72 ~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~ 132 (199) T protein:vir:80 72 IPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDD 132 (199) T ss_pred cCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHH Confidence 1235899999995552 3455566655555554 No 64 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=96.59 E-value=1.8e-05 Score=46.65 Aligned_cols=144 Identities=15% Similarity=0.051 Sum_probs=68.2 Q ss_pred Cch-------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-------cCcccchhhhhhcc Q lcl|NC_015266. 1 MDD-------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGS-------AYVPRKIKKGGKGL 66 (155) Q Consensus 1 m~~-------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~-------pW~p~k~~~~~~~~ 66 (155) |.| -|++|...|+.|-..+...--+.-|++-|+.+....+.+.-.-..|.-+ .|...+........ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 776 3678888888776655333334566677788887777765433333211 11111110000000 Q ss_pred -ccccCcccch---hhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-H Q lcl|NC_015266. 67 -RTKVGRIKRQ---AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-D 141 (155) Q Consensus 67 -~~~~~~~~~~---~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d 141 (155) ..+.+..... .............+.....+ ..+-.+.+..|++..-||.. .+||+|||.=.-+ . T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~~y~~fvEfGT~----------kmpa~PFlrPA~~~~ 149 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSG-DKGNPGGDTWYWRFLEFGTE----------HTSARPILRPAMNGV 149 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccc-cccCCCCccceeEEeccCCC----------CCCCCccchhhHHhh Confidence 0000000000 00000000000000000001 11112456789999999943 5899999988744 6 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_015266. 142 RELVRDRLLRYLNR 155 (155) Q Consensus 142 ~~~I~~~i~~~l~r 155 (155) .+++++.|.+.|.+ T Consensus 150 ~~~a~~~i~~~l~~ 163 (179) T protein:vir:18 150 DNDVINVFSTEMGK 163 (179) T ss_pred HHHHHHHHHHHHHH Confidence 66777777766666 No 65 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=96.58 E-value=3.4e-05 Score=45.07 Aligned_cols=116 Identities=16% Similarity=0.060 Sum_probs=63.5 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ -|++|...|..|........-++.+...|+.+....+.+. |..+-|-. .... ....+.+ T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~a-----P~~tG~l~--~sI~-----~~~~~~~-- 66 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRA-----PKKTGKLR--RNIV-----SAALRQK-- 66 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCChhhHH--Hhcc-----ccccccc-- Confidence 764 3667777777665544322234567777777777766543 43221110 0000 0000000 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEe---------cccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGF---------DDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVR 146 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~---------~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~ 146 (155) .....+.|++ .+++..|+....||.. .+||+|||.=+- ..+.++. T Consensus 67 ---------------~~~~~~~~g~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~ 121 (140) T protein:vir:10 67 ---------------DAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGTQ----------HMKAQPFMRPAFDASIGEAE 121 (140) T ss_pred ---------------cccceEEeeeeeccccccCCCCccceeeeeccCCC----------CCCCCcchhhhHHHHHHHHH Confidence 0011111111 2566789999999954 489999998874 4567777 Q ss_pred HHHHHHhcC Q lcl|NC_015266. 147 DRLLRYLNR 155 (155) Q Consensus 147 ~~i~~~l~r 155 (155) +++.+.+.+ T Consensus 122 ~~~~~~~~~ 130 (140) T protein:vir:10 122 GAIRTELAR 130 (140) T ss_pred HHHHHHHHH Confidence 777777766 No 66 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=96.57 E-value=5.7e-05 Score=43.83 Aligned_cols=106 Identities=13% Similarity=0.100 Sum_probs=66.1 Q ss_pred Cch-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDD-DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~-~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |+= -|++|.+.|..+.. .......+++.|..+....+.+ +| . T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~-----ap-----------------------------v 43 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT---LNDVKHVVKRNTVSMNKNMQNL-----AP-----------------------------V 43 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHh-----CC-----------------------------C Confidence 432 26666666654321 2333456677777666655432 22 1 Q ss_pred hhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhc Q lcl|NC_015266. 80 RKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLN 154 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~ 154 (155) ++|.|.+||.+..+.+++.+. .|++..||..-.||.. .+||+|||.=.-+ ....+.+.|.+.|- T Consensus 44 dTG~Lr~si~~~~~~~~~~~~-V~~~~~Ya~~vE~GT~----------~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 44 DTGNMKRSITSEFTDGGLTGT-TIPHTDYAGYVEYGTR----------FQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred CchhhHhhceeeeecCceEEE-eecCCCccceeecccc----------ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 345677888888777765532 2889999999999965 4899999977644 34455555555555 No 67 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=96.56 E-value=6.5e-05 Score=43.54 Aligned_cols=114 Identities=11% Similarity=0.055 Sum_probs=74.4 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |.+-|++|...|+.|..... ...++.+++-|+.+....+++. |-...... +. T Consensus 1 mv~Gl~el~~~l~~l~~~~~-~~~~~al~~ga~~~~~~~k~~a-----p~~~~~~~-------------------~h--- 52 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAP-KTAKAAVTEVAKEFEKALKANT-----PVYEVETD-------------------ER--- 52 (125) T ss_pred CchhHHHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CcCCCCch-------------------hh--- Confidence 99999999999998876543 2234567777777777776653 32211000 01 Q ss_pred hhhhcceeeE------EEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHh Q lcl|NC_015266. 81 KLRTARYLRI------DVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYL 153 (155) Q Consensus 81 ~~~l~~sl~~------~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l 153 (155) +.++|.+ ....-.+.|||.-....|+....||.. .+|++||+.=+- +...++++++.+-| T Consensus 53 ---l~d~I~~~~~k~~~~g~~~~~VG~~k~~~~y~~f~E~GT~----------k~~~~pF~~pa~~~~k~~~~~~~~~~~ 119 (125) T protein:vir:97 53 ---LQEDTVISGFKGANVGIVSKEIGYGKATGWRAHYPNDGTI----------YQRGQDFKERTINQMTPKAKQLYAEKV 119 (125) T ss_pred ---HHhhhhcccccccccCceEEEEeecCCCceeEeeeccCcc----------CCCcCccchHhHHHhHHHHHHHHHHHH Confidence 1222221 112225678875557789999999953 589999987663 45678888888888 Q ss_pred cC Q lcl|NC_015266. 154 NR 155 (155) Q Consensus 154 ~r 155 (155) .+ T Consensus 120 ~~ 121 (125) T protein:vir:97 120 KE 121 (125) T ss_pred HH Confidence 88 No 68 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=96.53 E-value=6.9e-05 Score=43.39 Aligned_cols=106 Identities=13% Similarity=0.118 Sum_probs=65.8 Q ss_pred Cch-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDD-DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~-~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |+= -|++|.+.|..... .....+.+.+.|..+....+.+ .| . T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~-----aP-----------------------------v 43 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT---LDDVKHVVKSNTASMNKNMQNL-----AP-----------------------------V 43 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHh-----CC-----------------------------C Confidence 432 25555555554321 1223345666666655544321 12 1 Q ss_pred hhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHHHHhc Q lcl|NC_015266. 80 RKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLLRYLN 154 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~ 154 (155) ++|.|.+||++..+.++..+.. |++..||..-.||.. .+||+|||.=. +.....+.+.|.+.|- T Consensus 44 ~TG~Lr~si~~~~~~~~~~~~V-~~~~~Ya~~vE~GT~----------km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 44 DTGNMKRSITSEFTDGGLSGTT-GPHTDYAGYVEYGTR----------FQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred CchhhhccceeeeecCceEEEe-ecCCCcccceecccc----------ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 3466778888887777644322 888999999999964 48999998877 4556667777766666 No 69 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=96.47 E-value=3.6e-05 Score=44.94 Aligned_cols=132 Identities=14% Similarity=0.073 Sum_probs=60.2 Q ss_pred Cch------hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD------DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~------~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |.+ -|++|...|+.|-..+...--+..++.-|+.+....+.+ +|.-+.. ++............|... T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~-----aP~~~g~--l~~~i~~~~~~~~~g~~~ 73 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRRGK--LRRNVVVLSRRSRDGGME 73 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCCCcch--hhhhceeccccccCCcee Confidence 333 245666666655433321112345666666677666665 3421110 000000000000111100 Q ss_pred chhhhhhhhhcceeeEEEcCcEEEE-EecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHH Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTGLAI-GFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRY 152 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~~~v-~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~ 152 (155) ..+. ..............| ...+.+..|+...-||.. .+||+|||.=+- +...++++.+.+. T Consensus 74 ~~v~------~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~pa~PFl~pA~~~~k~~~~~~~~~~ 137 (148) T protein:vir:93 74 SGVH------IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV----------NMPPHPFVRPAFDVRSEQAAQVAIAR 137 (148) T ss_pred eeee------ecccccccccccceeecCCCCCcceeeeeccCCC----------CCCCCcchhHHHHHhHHHHHHHHHHH Confidence 0000 000111111112222 234566789999999943 589999998774 3456666666666 Q ss_pred hcC Q lcl|NC_015266. 153 LNR 155 (155) Q Consensus 153 l~r 155 (155) |.+ T Consensus 138 ~~~ 140 (148) T protein:vir:93 138 MNR 140 (148) T ss_pred HHH Confidence 666 No 70 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=96.28 E-value=8.2e-05 Score=43.00 Aligned_cols=125 Identities=14% Similarity=0.030 Sum_probs=63.1 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ -|++|...|+.|-.......-++.++.-|..+....+.+. |-.+ -..+..-....++.+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-----p~~t-------G~l~~sI~~~~~~~~~- 67 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARA-----PKKT-------GKLKRNIVTAALKQKD- 67 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCh-------hhHHHhceeccccccc- Confidence 764 4566666666655443222224566777777777766653 3211 0000000000000000 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHHHHhcC Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLNR 155 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~r 155 (155) ....+.+....+........++..|+....||.. .+||+|||.-.- +.+++|++.+.+-|.+ T Consensus 68 -------~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~~~~ 130 (140) T protein:vir:10 68 -------SPGIATAGVRVRTKGKADSPNNAFYWRFVELGTQ----------FMKAEPFMRPAFDASIAQAEGAIRTEIAR 130 (140) T ss_pred -------ccceeEEeeccccccccCCCCcccccceeccCcC----------CCCCCcchhhhHHHHHHHHHHHHHHHHHH Confidence 0111222111111111112356789999999954 489999999884 4557777777777766 No 71 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=96.21 E-value=0.00016 Score=41.35 Aligned_cols=113 Identities=12% Similarity=0.126 Sum_probs=68.3 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ -|++|.+.|..|-.... ...+..+..-|..+....+.+ .|-+..+ T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~-~~~~~al~~~a~~v~~~~k~~-----ap~~~~~---------------------- 52 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIE-KVEPVALKAGGEIIAERQRSH-----VNRSDKK---------------------- 52 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCCCCCC---------------------- Confidence 665 35666666666554432 233456666677766666554 2311100 Q ss_pred hhhhhhhhcceeeE------EEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRI------DVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRL 149 (155) Q Consensus 77 ~l~~~~~l~~sl~~------~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i 149 (155) ++.+.++|.. .-+.-.+.||+..+...|+....||.. .+||+|||.=+ ++...++++++ T Consensus 53 ----tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~~~~~y~~f~E~GT~----------~~~a~Pf~~pa~~~~~~~~~~~~ 118 (127) T protein:vir:12 53 ----QPHMQDNITVSNVRESKDGVRFVAVGPNKKVAYRGRFLEWGTS----------KMPPQPFIEKGGKEGEGPAVELM 118 (127) T ss_pred ----hhHHHHhhhccccccccCceeEEEEeeCCCCcceeeeeccCcc----------CCCCCccchHhHHHHHHHHHHHH Confidence 1223333321 112235668776777899999999954 48999999877 34667788888 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+-|.+ T Consensus 119 ~~~~~~ 124 (127) T protein:vir:12 119 ERILTA 124 (127) T ss_pred HHHHHH Confidence 887777 No 72 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=96.16 E-value=9.4e-05 Score=42.67 Aligned_cols=121 Identities=14% Similarity=0.032 Sum_probs=57.4 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |-+ -|++|...|+.|-....-.--+..++.-|+.+....+.+ .|-.+.=.+ . T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~-----ap~~~~~~~-------------------~ 56 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQH-----AGFDETSTG-------------------Q 56 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCCCcch-------------------h Confidence 332 355666666655443321111334556666666666555 232110000 0 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEeccccc--ccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHHHHh Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLS--RIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLLRYL 153 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~--~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l 153 (155) .+.+......+....-....+.|...++.. .|+...-||.. .+||+|||+=+ +...+++++++.+.| T Consensus 57 ~~~~~I~v~~~~~~~~~~~~~~v~vg~~~~~~~y~~f~E~GT~----------k~~a~PF~~pA~~~~~~~~~~~~~~~~ 126 (133) T protein:vir:10 57 HMRDSIKIRSSTRKAQGNAVVTLRVGPSKQHHMKVLAQEFGTV----------KQVADPFIRPALDYNVQTVLRVLTVEI 126 (133) T ss_pred hhhhcccccccccccCccceEEEEecCCCCccceEeeeccCCC----------CCCCCccchHHHHHhHHHHHHHHHHHH Confidence 011110000011111111222232222333 35666689953 58999999988 567777888888777 Q ss_pred cC Q lcl|NC_015266. 154 NR 155 (155) Q Consensus 154 ~r 155 (155) .+ T Consensus 127 ~~ 128 (133) T protein:vir:10 127 RN 128 (133) T ss_pred HH Confidence 77 No 73 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=96.05 E-value=0.00014 Score=41.79 Aligned_cols=124 Identities=15% Similarity=0.029 Sum_probs=61.2 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ -|.+|...|..|-......--++.++..|..+....+.+. |.-+.+-..+ .... ..+.+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~a-----P~~tG~l~~~--i~~~-----~~~~~~- 67 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRA-----PKKTGKLRRN--IVSA-----ALRQKD- 67 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCcchhhhc--eeee-----cccccc- Confidence 664 3556666666654443222224567777887777766653 4221111000 0000 000000 Q ss_pred hhhhhhhhcceeeEEEcCc-EEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhc Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDT-GLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLN 154 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~-~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~ 154 (155) ...+........ ...+ ..+++..|+....||.. .+||+|||.=+-+ .+.++.+++.+.+. T Consensus 68 -------~~~~~~~~~~~~~~~~~-~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:80 68 -------APGLATAGVRVRTKGKA-DSPSNAFYWRFDEFGTQ----------HMKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred -------ccceeeeeeeccccccc-CCCCCcceeeeeccCCC----------CCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 000000000000 0001 12556789999999954 4899999988844 45677777777766 Q ss_pred C Q lcl|NC_015266. 155 R 155 (155) Q Consensus 155 r 155 (155) + T Consensus 130 ~ 130 (140) T protein:vir:80 130 R 130 (140) T ss_pred H Confidence 6 No 74 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=96.02 E-value=5.3e-05 Score=44.04 Aligned_cols=80 Identities=6% Similarity=0.028 Sum_probs=58.1 Q ss_pred CchhHHHHHHHHHHHHHhcC--chhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLA--PAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~--~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) +++.-+...+.+..++..+- ..+-.++|..||..+....++-|.+- +|+|+++.|.+++. ..++| T Consensus 112 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~------~~ppna~~Ti~~KG-------~~~PL 178 (193) T protein:vir:96 112 WNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTG------PWVANSASTVRRKG-------FNRPL 178 (193) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCCcHHHHHHhC-------CCCch Confidence 33334445555555544321 12346799999999999999999873 58899998876542 46789 Q ss_pred hhhhhhcceeeEEEc Q lcl|NC_015266. 79 FRKLRTARYLRIDVD 93 (155) Q Consensus 79 ~~~~~l~~sl~~~~~ 93 (155) .+++.|.+||+|.+. T Consensus 179 idTG~l~~SIty~Vv 193 (193) T protein:vir:96 179 VDTAHMLQSISSRVT 193 (193) T ss_pred hHHHHHHhhhcceeC Confidence 999999999999888 No 75 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=95.91 E-value=0.00019 Score=40.93 Aligned_cols=119 Identities=11% Similarity=0.040 Sum_probs=56.8 Q ss_pred Cchh-----HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccc Q lcl|NC_015266. 1 MDDD-----LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKR 75 (155) Q Consensus 1 m~~~-----~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~ 75 (155) |.-+ |++|...|..|-......--+..++.-++.+....+.+ .|-...+.+ |..+. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~-----ap~~~~~~~--------------g~l~~ 61 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQN-----AGYDNSSTN--------------AHMRD 61 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCCCCch--------------hhHHh Confidence 4432 55666666665544322212344556666666555433 343333321 11011 Q ss_pred hhhhhhhhhcceeeEEEcCcEEEEEeccc-cccccccc--ccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHHH Q lcl|NC_015266. 76 QAMFRKLRTARYLRIDVDDTGLAIGFDDR-LSRIVRVH--QEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLLR 151 (155) Q Consensus 76 ~~l~~~~~l~~sl~~~~~~~~~~v~~~G~-~~~yAaiH--qfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~ 151 (155) .+..... +.......+.|+. |. ...|-..| .||.. .+||+|||.=+ ++...++++++.+ T Consensus 62 ~I~i~~~------k~~~~~~~v~v~v-g~~~~~~~~~~f~E~GT~----------~~~a~PF~~pa~~~~~~~~~~~~~~ 124 (135) T protein:vir:57 62 SIKIRSS------RGKAGSTVVVLRV-GPTRSHYMKALAQEFGTI----------KQVAKPFIRPALDYNKMQVLRILTV 124 (135) T ss_pred hcccccc------cccccceeEEEEe-cCCCCcceeEeecccCCC----------CCCCCcchhHhHHHhHHHHHHHHHH Confidence 1111100 1111122333332 33 33443345 88843 48999999988 5566767777776 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) -|.+ T Consensus 125 ~~~~ 128 (135) T protein:vir:57 125 EIRD 128 (135) T ss_pred HHHH Confidence 6666 No 76 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.84 E-value=9.5e-05 Score=42.65 Aligned_cols=97 Identities=13% Similarity=0.087 Sum_probs=54.3 Q ss_pred HHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhhhhhhcce Q lcl|NC_015266. 8 LEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFRKLRTARY 87 (155) Q Consensus 8 l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~s 87 (155) +++.+. +.+.+.+..+....+.. .| .++|.|.+| T Consensus 1 v~~~v~------------~~~~~~~~~i~~~ak~~-----aP-----------------------------v~TG~Lr~S 34 (116) T protein:vir:97 1 MERWVK------------RGIAKTTAKIHNTIISL-----MP-----------------------------VDTGYLRES 34 (116) T ss_pred ChHHHH------------HHHHHHHHHHHHHHHHh-----CC-----------------------------cCccccccc Confidence 112221 23444455544433221 12 145788899 Q ss_pred eeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cce---eeecCccccCCCHHHHHHHHHH Q lcl|NC_015266. 88 LRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GPL---AQYPVRVVLGFSSADRELVRDR 148 (155) Q Consensus 88 l~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~~---v~iPaRp~LG~s~~d~~~I~~~ 148 (155) |.++..++++.+. .|++..||...+||...-...+ +.+ ..+||+|||-=+ .++-... T Consensus 35 I~~~~~~~~~~~~-V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA---~~~~~~~ 110 (116) T protein:vir:97 35 VTMDFKDGGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA---IDAGRAF 110 (116) T ss_pred ceEEeecCcEEEE-EecCCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHH---HHHHHHH Confidence 9998888875532 2899999999999954422111 112 248999999443 3333344 Q ss_pred HHHHhc Q lcl|NC_015266. 149 LLRYLN 154 (155) Q Consensus 149 i~~~l~ 154 (155) |..-|+ T Consensus 111 i~k~i~ 116 (116) T protein:vir:97 111 FNKYFS 116 (116) T ss_pred HHHhhC Confidence 455555 No 77 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.84 E-value=9.5e-05 Score=42.65 Aligned_cols=97 Identities=13% Similarity=0.087 Sum_probs=54.3 Q ss_pred HHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhhhhhhcce Q lcl|NC_015266. 8 LEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFRKLRTARY 87 (155) Q Consensus 8 l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~s 87 (155) +++.+. +.+.+.+..+....+.. .| .++|.|.+| T Consensus 1 v~~~v~------------~~~~~~~~~i~~~ak~~-----aP-----------------------------v~TG~Lr~S 34 (116) T protein:vir:12 1 MERWVK------------RGIAKTTAKIHNTIISL-----MP-----------------------------VDTGYLRES 34 (116) T ss_pred ChHHHH------------HHHHHHHHHHHHHHHHh-----CC-----------------------------cCccccccc Confidence 112221 23444455544433221 12 145788899 Q ss_pred eeEEEcCcEEEEEecccccccccccccCccccccCC----------------Cce---eeecCccccCCCHHHHHHHHHH Q lcl|NC_015266. 88 LRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPG----------------GPL---AQYPVRVVLGFSSADRELVRDR 148 (155) Q Consensus 88 l~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~----------------~~~---v~iPaRp~LG~s~~d~~~I~~~ 148 (155) |.++..++++.+. .|++..||...+||...-...+ +.+ ..+||+|||-=+ .++-... T Consensus 35 I~~~~~~~~~~~~-V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA---~~~~~~~ 110 (116) T protein:vir:12 35 VTMDFKDGGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA---IDAGRAF 110 (116) T ss_pred ceEEeecCcEEEE-EecCCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHH---HHHHHHH Confidence 9998888875532 2899999999999954422111 112 248999999443 3333344 Q ss_pred HHHHhc Q lcl|NC_015266. 149 LLRYLN 154 (155) Q Consensus 149 i~~~l~ 154 (155) |..-|+ T Consensus 111 i~k~i~ 116 (116) T protein:vir:12 111 FNKYFS 116 (116) T ss_pred HHHhhC Confidence 455555 No 78 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=95.83 E-value=6.2e-05 Score=43.64 Aligned_cols=80 Identities=6% Similarity=0.039 Sum_probs=57.5 Q ss_pred CchhHHHHHHHHHHHHHhcC--chhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLA--PAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~--~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) +++.-.+..+.+...+..+- ..+-.++|..||..+....+..|... +|+|+++.|.+++ | ..+.| T Consensus 119 ~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~~------~~ppna~sTi~~K-----g--~~~PL 185 (200) T protein:vir:99 119 WATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKSG------PWAANSPATIRAK-----G--FDKPL 185 (200) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCChHHHHHHh-----C--CCCch Confidence 33334444444444444321 12335799999999999999999863 4899999987654 2 45789 Q ss_pred hhhhhhcceeeEEEc Q lcl|NC_015266. 79 FRKLRTARYLRIDVD 93 (155) Q Consensus 79 ~~~~~l~~sl~~~~~ 93 (155) .+++.|.+||+|.++ T Consensus 186 idTG~l~~SIty~Ve 200 (200) T protein:vir:99 186 IDTAHMWQTVSSKVS 200 (200) T ss_pred HHHHHHHhHhccccC Confidence 999999999999998 No 79 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=95.61 E-value=0.00064 Score=38.09 Aligned_cols=120 Identities=13% Similarity=0.080 Sum_probs=61.8 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchh----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAA----RRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~----r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |+. |+..|++.+..|-+...... -.+.++++|..+....+++ .|= T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~-----tPV----------------------- 52 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEAN-----TPV----------------------- 52 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHh-----CCC----------------------- Confidence 654 55555544443332221111 1234445555544443332 331 Q ss_pred cchhhhhhhhhcceeeE---EEcCcEEEEEecccccccccccccCccccccC------CC-ceeeecCccccCCCHHH-H Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRI---DVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP------GG-PLAQYPVRVVLGFSSAD-R 142 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~---~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~------~~-~~v~iPaRp~LG~s~~d-~ 142 (155) ++|.|.+|++. ..++++..|.. |++.+||..-.||-+..+.+ .+ ...-+|-++||=.+-+. + T Consensus 53 ------dTG~Lr~S~~~~~~~~~~~~~~~~V-~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~ 125 (144) T protein:vir:10 53 ------KQGNLRRSWTAEGPTYGCGGWTIKL-INNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQ 125 (144) T ss_pred ------CcchhccceeecceeeecCeeEEEE-ecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHH Confidence 24455555542 24455544322 89999999999997643321 11 11236888888666443 4 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_015266. 143 ELVRDRLLRYLNR 155 (155) Q Consensus 143 ~~I~~~i~~~l~r 155 (155) ..+..+|.++|.. T Consensus 126 ~~~~~~l~k~l~~ 138 (144) T protein:vir:10 126 RQLPQLVTEGLWG 138 (144) T ss_pred HHHHHHHHHHHHH Confidence 5556666666666 No 80 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=95.59 E-value=0.0002 Score=40.92 Aligned_cols=131 Identities=15% Similarity=0.092 Sum_probs=58.3 Q ss_pred Cch---h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcCcccchhhhhhccccccCc Q lcl|NC_015266. 1 MDD---D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPD--GSAYVPRKIKKGGKGLRTKVGR 72 (155) Q Consensus 1 m~~---~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PD--G~pW~p~k~~~~~~~~~~~~~~ 72 (155) |.+ + |++|...|+.|-..+....-+..+..-|+.+....+.+. |- |.=+...+-. .......+. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-----P~~~g~l~~si~~~---~~~~~~~~~ 72 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRA-----PVRTGKLKKNVVVV---TQKSRRRGE 72 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-----CCCchhhhhhcccc---ccccccccc Confidence 332 2 446666666554443211123455566777766666542 32 1111000000 000000000 Q ss_pred ccchhhhhhhhhcceeeEEEcCcEEE-EEecccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHHHH Q lcl|NC_015266. 73 IKRQAMFRKLRTARYLRIDVDDTGLA-IGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDRLL 150 (155) Q Consensus 73 ~~~~~l~~~~~l~~sl~~~~~~~~~~-v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~ 150 (155) ....... +............ +...+++..|+...-||.. .+||+|||.=+- ..+.++++++. T Consensus 73 ~~~~v~~------~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PF~~pA~~~~k~~~~~~~~ 136 (149) T protein:vir:19 73 ISSGVHI------RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAI 136 (149) T ss_pred eeecccc------cccccccccccceeecCCCCccceeeeeccCCC----------CCCCCcchhHHHHHHHHHHHHHHH Confidence 0000000 0011111111122 2234567789999999953 489999997663 34566666666 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +.|.+ T Consensus 137 ~~l~~ 141 (149) T protein:vir:19 137 ARMNQ 141 (149) T ss_pred HHHHH Confidence 65555 No 81 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=95.55 E-value=0.00019 Score=40.96 Aligned_cols=109 Identities=16% Similarity=0.119 Sum_probs=49.8 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |. ...+++ .+...+++.++..|++..+ +........ . + ...-.+ T Consensus 1 ~~------------~~~~~~-~~~~~~~~~~~~v~r~~l~-~~a~~v~~~---------A---k----------~~aPv~ 44 (137) T protein:vir:10 1 MT------------VTARYE-RNPVGEARQFQVIARRRLS-RITRGTANQ---------A---R----------ADVPVK 44 (137) T ss_pred Ce------------eEEEec-cCchhHHHHHHHHHHHHHH-HHHHHHHHH---------H---H----------hcCCcc Confidence 10 001111 1223344444444433221 111110000 0 0 001124 Q ss_pred hhhhcceeeEEEcCcEEE--E-EecccccccccccccCcc---ccccC---------C-----Cceee---ecCccccCC Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLA--I-GFDDRLSRIVRVHQEGQK---APVEP---------G-----GPLAQ---YPVRVVLGF 137 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~--v-~~~G~~~~yAaiHqfG~~---~~~~~---------~-----~~~v~---iPaRp~LG~ 137 (155) +|.|.+||......++.. | +..|++..||.+|+||.. ++++. + +++|+ +|+|||| T Consensus 45 tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL-- 122 (137) T protein:vir:10 45 TGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFL-- 122 (137) T ss_pred chhhhcCceeeeeeccccceEEEEecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchH-- Confidence 566777787654333221 1 113899999999999965 33321 1 24465 4599995 Q ss_pred CHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 138 SSADRELVRDRLLRYLNR 155 (155) Q Consensus 138 s~~d~~~I~~~i~~~l~r 155 (155) ...+.+..+| T Consensus 123 --------~~A~~~~~~~ 132 (137) T protein:vir:10 123 --------RNAAERVVAR 132 (137) T ss_pred --------HHHHHHhhhh Confidence 5556666666 No 82 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=95.44 E-value=0.00029 Score=40.00 Aligned_cols=115 Identities=13% Similarity=0.104 Sum_probs=64.6 Q ss_pred CchhHH---HHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-cCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLR---ALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGS-AYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~---~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~-pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |+-+++ +|...|+.|....+ ...++.++.-|+.+....+++ .|-++ .+. . T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~-k~~~~al~~ga~~~~~~~k~~-----ap~~~~~~~-------------~------- 54 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVA-KEARAAVRDGAQKFADKLKSN-----TPEWDGETD-------------M------- 54 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCCcCCCCc-------------c------- Confidence 887654 45555554443322 222344555555555554432 34221 000 0 Q ss_pred hhhhhhhhcceeeE-----EEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRI-----DVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLL 150 (155) Q Consensus 77 ~l~~~~~l~~sl~~-----~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~ 150 (155) ++.+.++|.+ .-..-.+.||+...+..|+....||. +++||+|||.=. ++...++++++. T Consensus 55 ----~~h~~d~I~~~~~k~~~g~~~~~VG~~k~~~~y~~f~E~GT----------~k~~a~pF~~pa~~~~~~~~~~~~~ 120 (128) T protein:vir:38 55 ----SGHLRDDIKLSSVRETSGLTEVDVGYGKDTGWRAHFPNSGT----------SMQDPQHFIEETQEIMRPVVIAAFL 120 (128) T ss_pred ----cchhhhhhccccccccCceeEEEeeecCCCceEEeeeccCc----------cCCCCCcchhHHHHHhHHHHHHHHH Confidence 0111222211 11122367887666778999999995 358999999766 346788889999 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|.| T Consensus 121 ~~l~k 125 (128) T protein:vir:38 121 SHLKE 125 (128) T ss_pred HHHHh Confidence 98888 No 83 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.42 E-value=0.0002 Score=40.81 Aligned_cols=97 Identities=13% Similarity=0.085 Sum_probs=53.9 Q ss_pred HHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhhhhhhcce Q lcl|NC_015266. 8 LEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFRKLRTARY 87 (155) Q Consensus 8 l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~s 87 (155) +++.+. +.+.+.+..+....+. ..| + ++|.|.+| T Consensus 1 v~~~v~------------~~~~~~~~~i~~~ak~-----~ap----v-------------------------~TG~Lr~S 34 (116) T protein:vir:95 1 MERWVK------------RGIAKTTAKIHNTIIS-----LMP----V-------------------------DTGYLRES 34 (116) T ss_pred ChHHHH------------HHHHHHHHHHHHHHHh-----hCC----c-------------------------cccccccc Confidence 111111 1244444444333322 111 2 45788899 Q ss_pred eeEEEcCcEEEEEecccccccccccccCccccccC----------------CCce---eeecCccccCCCHHHHHHHHHH Q lcl|NC_015266. 88 LRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEP----------------GGPL---AQYPVRVVLGFSSADRELVRDR 148 (155) Q Consensus 88 l~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~----------------~~~~---v~iPaRp~LG~s~~d~~~I~~~ 148 (155) |.+...++++... .|++..||...+||...-... .+.+ .-+||||||-=+ .++.... T Consensus 35 I~~~~~~~~~~~~-V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA---~~~~~~~ 110 (116) T protein:vir:95 35 VTMDFKDGGFTGV-INIGSEYAIYVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPA---IDAGRAF 110 (116) T ss_pred eeEEeecCcEEEE-EecCCCccceeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHH---HHHHHHH Confidence 9998888875533 288999999999995432210 1112 248999999544 3333344 Q ss_pred HHHHhc Q lcl|NC_015266. 149 LLRYLN 154 (155) Q Consensus 149 i~~~l~ 154 (155) |..-|+ T Consensus 111 i~k~is 116 (116) T protein:vir:95 111 FNKYFS 116 (116) T ss_pred HHHhhC Confidence 555555 No 84 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=95.38 E-value=0.00046 Score=38.89 Aligned_cols=125 Identities=16% Similarity=0.138 Sum_probs=58.5 Q ss_pred Cch----h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDD----D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~----~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |++ | |+++.+.|..+...=-+..-+..++++|..+....+++ .|=++ |.. T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~-----tPVdT------------------G~L 57 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRR-----TPVDT------------------GFL 57 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCcc------------------hhh Confidence 665 3 44444444433331112233456777777776655443 34211 110 Q ss_pred cchhhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHH-HHH----HHHH Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSAD-REL----VRDR 148 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d-~~~----I~~~ 148 (155) ++. +..+....++.+...+++.+|.. +++..||..-.||-+..+. + ++ +|-+.+|=.|.+. +.. |... T Consensus 58 r~s--w~~~~~~~~~~~~~~g~~~~v~v-~n~~~YA~~VE~Ghr~~~~-~-gf--V~G~fml~~s~~~~~~~~~~~~~~~ 130 (141) T protein:vir:79 58 RQG--WNGVAYARSLPVYKQGNNYIIEV-VNPTEYASYVNFGHRTKDG-K-GW--VKGQHFLTISEMELQSQVDKIIEKK 130 (141) T ss_pred ccc--ccccccccccceeecCCeeEEEE-ecCCcchhhhhcceeecCC-c-ce--eCCchhHHHHHHHHHHHHHHHHHHH Confidence 100 01111122234444566555433 8999999999999765322 1 12 3566666454332 222 3333 Q ss_pred HHHHhcC Q lcl|NC_015266. 149 LLRYLNR 155 (155) Q Consensus 149 i~~~l~r 155 (155) |.++|.+ T Consensus 131 l~~~l~~ 137 (141) T protein:vir:79 131 LLILLKG 137 (141) T ss_pred HHHHHHH Confidence 4444444 No 85 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=95.37 E-value=0.00052 Score=38.61 Aligned_cols=119 Identities=9% Similarity=-0.095 Sum_probs=72.1 Q ss_pred Cchh-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDDD-LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~~-~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |+=+ +++|...|..+-..+. ...++.+.+.++.+....+.+ .| + T Consensus 1 i~i~Gld~L~~~L~~l~~~~~-~~~~~a~~~~a~~i~~~ak~~-----aP----v------------------------- 45 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDID-KNINATTEEAANFIEDRAKTL-----AP----K------------------------- 45 (173) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------- Confidence 5433 6666666666555442 223445667777777666554 22 1 Q ss_pred hhhhhcceeeEEEc--CcEEEEEecccccccccccccCcccc---ccC------------------------------CC Q lcl|NC_015266. 80 RKLRTARYLRIDVD--DTGLAIGFDDRLSRIVRVHQEGQKAP---VEP------------------------------GG 124 (155) Q Consensus 80 ~~~~l~~sl~~~~~--~~~~~v~~~G~~~~yAaiHqfG~~~~---~~~------------------------------~~ 124 (155) ++|.|.+||.+... .+.+.++ .+++..||....||...- +.. +. T Consensus 46 ~TG~Lr~sI~~~~~~~~~~~~~~-v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 124 (173) T protein:vir:10 46 NFGKLAQSISTSDLKAKDLISKK-ITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGI 124 (173) T ss_pred CchhhhhcceeeeeccCceeEEe-eCCCcccchhhhcccccccCCCchhhhhhccccccccccccccccccccccccccc Confidence 24556777766542 3334332 378899999999996421 000 00 Q ss_pred c------------eeeecCccccCCC-HHHHHHHHHHHHHHhcC Q lcl|NC_015266. 125 P------------LAQYPVRVVLGFS-SADRELVRDRLLRYLNR 155 (155) Q Consensus 125 ~------------~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~r 155 (155) + ..-.||+|||==+ .+.++.+.+.|.++|.+ T Consensus 125 ~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~ 168 (173) T protein:vir:10 125 DEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLLKT 168 (173) T ss_pred chhcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHHHH Confidence 0 0137999999555 56778888888888888 No 86 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=95.27 E-value=0.00044 Score=39.00 Aligned_cols=129 Identities=12% Similarity=0.169 Sum_probs=60.3 Q ss_pred Cch----h---HHHHHHHHHHHHH-hcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCc Q lcl|NC_015266. 1 MDD----D---LRALEKWAGGLLA-KLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGR 72 (155) Q Consensus 1 m~~----~---~~~l~~~l~~ll~-~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~ 72 (155) |+| + |++|...|+.|-. .....--+..++.-|+.+....+.+.-. +.| ++.+.... ....+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~--~~~--~~~~~~~~------~~~~~- 69 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHI--SDD--NSKSGRKG------SRPPG- 69 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cCC--cccccccc------ccccc- Confidence 988 3 4555555554421 0000111245566666676666655322 222 22211100 00001 Q ss_pred ccchhhhhhhhhcceeeEEEcCcEEEEEec---ccccccccccccCccccccCCCceeeecCccccCCCH-HHHHHHHHH Q lcl|NC_015266. 73 IKRQAMFRKLRTARYLRIDVDDTGLAIGFD---DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSS-ADRELVRDR 148 (155) Q Consensus 73 ~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~---G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~ 148 (155) .|.+..... .+.-.-....+.||+. +++..|++..-||. +++||+|||.=.- +.+.++.++ T Consensus 70 ----~~~d~i~~~-~~~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT----------~k~~a~pF~~pa~~~~~~~~~~~ 134 (149) T protein:vir:13 70 ----HAANNIPEP-KIRKKKGNLQCVVGWEKSDNTPFYYMKMEEWGT----------SERPPHHAFGKTNKILKRVYDNI 134 (149) T ss_pred ----hhhhcceec-ccccccceeEEEeeccCCCCCccceeeeeccCc----------cCCCCCccchHHHHHHHHHHHHH Confidence 111111000 0111112224567663 34668999999994 3589999997653 345666666 Q ss_pred HHHHhcC Q lcl|NC_015266. 149 LLRYLNR 155 (155) Q Consensus 149 i~~~l~r 155 (155) +.+-|.+ T Consensus 135 ~~~~l~k 141 (149) T protein:vir:13 135 AQKKYDN 141 (149) T ss_pred HHHHHHH Confidence 5555444 No 87 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=95.25 E-value=0.00044 Score=39.01 Aligned_cols=111 Identities=14% Similarity=0.066 Sum_probs=61.7 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ -|.+|.+.|..|-.......-++.+...|..+....+.+. |..+ T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~a-----P~~t------------------------ 51 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRA-----PKKT------------------------ 51 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCh------------------------ Confidence 664 3566666666665444322224567777888877766542 3210 Q ss_pred hhhhhhhhcceeeE-----EEcCcEEEEEe---------cccccccccccccCccccccCCCceeeecCccccCCCHH-H Q lcl|NC_015266. 77 AMFRKLRTARYLRI-----DVDDTGLAIGF---------DDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-D 141 (155) Q Consensus 77 ~l~~~~~l~~sl~~-----~~~~~~~~v~~---------~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d 141 (155) +.+.+||.. ......+.|++ .+++..|+....||.. .+||+|||.=+-+ . T Consensus 52 -----G~l~~sI~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~pFl~pa~~~~ 116 (140) T protein:vir:14 52 -----GKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGTQ----------HMKAQPFMRPAFDAS 116 (140) T ss_pred -----hhHHhhcccccccccccceeEEeeeeeccccccCCCCccceeeeeccccC----------CCCCCcchhHHHHHH Confidence 001111111 00111111211 2456789999999954 5899999988754 4 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_015266. 142 RELVRDRLLRYLNR 155 (155) Q Consensus 142 ~~~I~~~i~~~l~r 155 (155) +.++.+++.+.+.+ T Consensus 117 ~~~~~~~~~~~~~~ 130 (140) T protein:vir:14 117 IGEAEGAIRTELAR 130 (140) T ss_pred HHHHHHHHHHHHHH Confidence 56777777777666 No 88 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=95.22 E-value=0.00015 Score=41.52 Aligned_cols=129 Identities=15% Similarity=0.174 Sum_probs=61.5 Q ss_pred Cchh-------HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDDD-------LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~~-------~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |.|. |++|...|..|-.... ..-++.++.-|+.+....+.+. |-++-.. +..... ..+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~a-----p~~~~~~--~~~~~~---~~~~~-- 67 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERA-----PRSPSPK--KRSKSE---PWRTG-- 67 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CCccccc--cccccc---ccccc-- Confidence 7764 3445555544433211 1123344555555655555553 3211110 000000 00000 Q ss_pred cchhhhhhhhhcceeeEEEcCcEEEEEec---ccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHH Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDVDDTGLAIGFD---DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRL 149 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~~~~~~~v~~~---G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i 149 (155) ..+.+... .......-+...+.|++. ++...|+....||.. .+||+|||.=+ ++..++|++.+ T Consensus 68 --~~~~~~i~-~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pa~~~~k~~~~~~~ 134 (146) T protein:vir:10 68 --QHGADQIK-VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS----------KMPAHPFIEPGFNASKAEAVRAM 134 (146) T ss_pred --ccccccce-eccccccccceeEEeeeccCCCCCcceeeeeccCCC----------CCCCCcchhHHHHHhHHHHHHHH Confidence 00100000 011122223344556653 345689999999954 58999999766 34567777777 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+.|.+ T Consensus 135 ~~~l~~ 140 (146) T protein:vir:10 135 TDILKN 140 (146) T ss_pred HHHHHH Confidence 777777 No 89 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=95.22 E-value=0.00015 Score=41.52 Aligned_cols=129 Identities=15% Similarity=0.174 Sum_probs=61.5 Q ss_pred Cchh-------HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDDD-------LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~~-------~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |.|. |++|...|..|-.... ..-++.++.-|+.+....+.+. |-++-.. +..... ..+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~a-----p~~~~~~--~~~~~~---~~~~~-- 67 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERA-----PRSPSPK--KRSKSE---PWRTG-- 67 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CCccccc--cccccc---ccccc-- Confidence 7764 3445555544433211 1123344555555655555553 3211110 000000 00000 Q ss_pred cchhhhhhhhhcceeeEEEcCcEEEEEec---ccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHH Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDVDDTGLAIGFD---DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRL 149 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~~~~~~~v~~~---G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i 149 (155) ..+.+... .......-+...+.|++. ++...|+....||.. .+||+|||.=+ ++..++|++.+ T Consensus 68 --~~~~~~i~-~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pa~~~~k~~~~~~~ 134 (146) T protein:vir:10 68 --QHGADQIK-VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS----------KMPAHPFIEPGFNASKAEAVRAM 134 (146) T ss_pred --ccccccce-eccccccccceeEEeeeccCCCCCcceeeeeccCCC----------CCCCCcchhHHHHHhHHHHHHHH Confidence 00100000 011122223344556653 345689999999954 58999999766 34567777777 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+.|.+ T Consensus 135 ~~~l~~ 140 (146) T protein:vir:10 135 TDILKN 140 (146) T ss_pred HHHHHH Confidence 777777 No 90 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=95.22 E-value=0.00015 Score=41.52 Aligned_cols=129 Identities=15% Similarity=0.174 Sum_probs=61.5 Q ss_pred Cchh-------HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDDD-------LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~~-------~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |.|. |++|...|..|-.... ..-++.++.-|+.+....+.+. |-++-.. +..... ..+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~a-----p~~~~~~--~~~~~~---~~~~~-- 67 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERA-----PRSPSPK--KRSKSE---PWRTG-- 67 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CCccccc--cccccc---ccccc-- Confidence 7764 3445555544433211 1123344555555655555553 3211110 000000 00000 Q ss_pred cchhhhhhhhhcceeeEEEcCcEEEEEec---ccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHH Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDVDDTGLAIGFD---DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRL 149 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~~~~~~~v~~~---G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i 149 (155) ..+.+... .......-+...+.|++. ++...|+....||.. .+||+|||.=+ ++..++|++.+ T Consensus 68 --~~~~~~i~-~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pa~~~~k~~~~~~~ 134 (146) T protein:vir:10 68 --QHGADQIK-VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS----------KMPAHPFIEPGFNASKAEAVRAM 134 (146) T ss_pred --ccccccce-eccccccccceeEEeeeccCCCCCcceeeeeccCCC----------CCCCCcchhHHHHHhHHHHHHHH Confidence 00100000 011122223344556653 345689999999954 58999999766 34567777777 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+.|.+ T Consensus 135 ~~~l~~ 140 (146) T protein:vir:10 135 TDILKN 140 (146) T ss_pred HHHHHH Confidence 777777 No 91 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=95.22 E-value=0.00015 Score=41.52 Aligned_cols=129 Identities=15% Similarity=0.174 Sum_probs=61.5 Q ss_pred Cchh-------HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDDD-------LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~~-------~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |.|. |++|...|..|-.... ..-++.++.-|+.+....+.+. |-++-.. +..... ..+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~a-----p~~~~~~--~~~~~~---~~~~~-- 67 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERA-----PRSPSPK--KRSKSE---PWRTG-- 67 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhC-----CCccccc--cccccc---ccccc-- Confidence 7764 3445555544433211 1123344555555655555553 3211110 000000 00000 Q ss_pred cchhhhhhhhhcceeeEEEcCcEEEEEec---ccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHH Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDVDDTGLAIGFD---DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRL 149 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~~~~~~~v~~~---G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i 149 (155) ..+.+... .......-+...+.|++. ++...|+....||.. .+||+|||.=+ ++..++|++.+ T Consensus 68 --~~~~~~i~-~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pa~~~~k~~~~~~~ 134 (146) T protein:vir:10 68 --QHGADQIK-VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS----------KMPAHPFIEPGFNASKAEAVRAM 134 (146) T ss_pred --ccccccce-eccccccccceeEEeeeccCCCCCcceeeeeccCCC----------CCCCCcchhHHHHHhHHHHHHHH Confidence 00100000 011122223344556653 345689999999954 58999999766 34567777777 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+.|.+ T Consensus 135 ~~~l~~ 140 (146) T protein:vir:10 135 TDILKN 140 (146) T ss_pred HHHHHH Confidence 777777 No 92 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=95.17 E-value=0.00017 Score=41.29 Aligned_cols=91 Identities=9% Similarity=-0.100 Sum_probs=60.0 Q ss_pred CchhHHHHHHHHHHHHHhc-C-chhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhcccccc-------- Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKL-A-PAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKV-------- 70 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L-~-~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~-------- 70 (155) +++.-.+..+.|...+... . ..+-..+|..||..+....+..|..- +|+|+++.|.+.+..... T Consensus 69 ~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~------~~ppna~sTi~~Kg~~~~~~~~~~~~ 142 (189) T protein:vir:10 69 IAAQQAAWSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARL------KDPPLSPLTIYIRKFIKDGGVIHGYK 142 (189) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCCcHHHHHHhcccCcccchhhhh Confidence 3333334444444444321 0 12335689999999999999999874 377888777765532211 Q ss_pred --------------------CcccchhhhhhhhhcceeeEEEcCcEE Q lcl|NC_015266. 71 --------------------GRIKRQAMFRKLRTARYLRIDVDDTGL 97 (155) Q Consensus 71 --------------------~~~~~~~l~~~~~l~~sl~~~~~~~~~ 97 (155) +....++|.+++.+.+||+|.+-...+ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 143 DIMRLRSEMQQEQAKGTLNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred hhhhhhhhhhhhhhhccccccccCCCchhhHHHHHhhcceeeeecCC Confidence 112457899999999999998877766 No 93 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=95.02 E-value=0.00025 Score=40.36 Aligned_cols=83 Identities=8% Similarity=0.135 Sum_probs=58.2 Q ss_pred CchhHHHHHHHHHHHHHhc--CchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKL--APAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L--~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) +++.-.+..+.+...+... ...+-.++|..||..+....+..|.+ | .|+|+++.|.+.+ || ..+.| T Consensus 115 ~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~Ik~~I~~-----~-~~ppna~~Tia~r----Kg--~~kPL 182 (199) T protein:vir:80 115 FDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDDIQMKIVE-----I-QTPAKSAATLARN----PR--KNNPL 182 (199) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHhc-----c-CCCCCCHHHHHHh----cC--CCCch Confidence 4444444444444444431 11234678999999999999999975 3 4899999986432 22 35689 Q ss_pred hhhhhhcceeeEEEcCc Q lcl|NC_015266. 79 FRKLRTARYLRIDVDDT 95 (155) Q Consensus 79 ~~~~~l~~sl~~~~~~~ 95 (155) .+++.|.+||+|.+-+. T Consensus 183 idTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 183 IVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHHHHhhcceeeeeC Confidence 99999999999988766 No 94 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=94.33 E-value=5.2e-05 Score=44.06 Aligned_cols=78 Identities=9% Similarity=0.022 Sum_probs=33.0 Q ss_pred CcccchhhhhhhhhcceeeEEEcCcEEEEEec---------ccccccccccccCccccccCC-------CceeeecCccc Q lcl|NC_015266. 71 GRIKRQAMFRKLRTARYLRIDVDDTGLAIGFD---------DRLSRIVRVHQEGQKAPVEPG-------GPLAQYPVRVV 134 (155) Q Consensus 71 ~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~---------G~~~~yAaiHqfG~~~~~~~~-------~~~v~iPaRp~ 134 (155) -...++.|..- .+ +..+..+.|||. |....|+++|.++....++-. -.+++||+||| T Consensus 1 m~v~r~~L~~~---~~----~l~~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPF 73 (155) T protein:vir:10 1 MSVTRRGLTLP---KD----RYKSMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred CcchHHHHHHH---HH----HhhCCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCCCCCCCcch Confidence 00111111110 01 112234666663 223334444444432111100 00468999999 Q ss_pred cCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 135 LGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 135 LG~s~~-d~~~I~~~i~~~l~r 155 (155) |--+-+ ..+++.+.+..-+.. T Consensus 74 lr~t~~~~~~~~~~~l~~~~~~ 95 (155) T protein:vir:10 74 MEKTIADRSAEWIKGLTVMMTM 95 (155) T ss_pred hHHHHHHHHHHHHHHHHHHHHc Confidence 987643 345555555555544 No 95 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=94.22 E-value=5.1e-05 Score=44.13 Aligned_cols=75 Identities=12% Similarity=0.329 Sum_probs=36.9 Q ss_pred CcccchhhhhhccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEecc------------cccccccccccCcccccc Q lcl|NC_015266. 54 YVPRKIKKGGKGLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDD------------RLSRIVRVHQEGQKAPVE 121 (155) Q Consensus 54 W~p~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G------------~~~~yAaiHqfG~~~~~~ 121 (155) .. ..-+........+...+.- .+...+.|||.+ ++..+|++|.||. T Consensus 1 M~----------~~~k~~~~~~~~l~~~l~~-------l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~----- 58 (148) T protein:vir:52 1 MA----------VTVTANFSAAKQLIEQMKS-------LKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGN----- 58 (148) T ss_pred Cc----------cccccccHHHHHHHHHHHH-------hhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCC----- Confidence 10 0000000011122222211 135577787742 2567999999993 Q ss_pred CCCceeeecCccccCCCH-HHHHHHHHHHHHHhcC Q lcl|NC_015266. 122 PGGPLAQYPVRVVLGFSS-ADRELVRDRLLRYLNR 155 (155) Q Consensus 122 ~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~r 155 (155) ++||+||||=-+- +..+++.+.+...+.. T Consensus 59 -----~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~ 88 (148) T protein:vir:52 59 -----EHIPARPFLRQTLEENQEKYTALFIQWFDQ 88 (148) T ss_pred -----CCCCCcchhHHHHHHHHHHHHHHHHHHHHc Confidence 3699999994432 2344455444444433 No 96 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=94.05 E-value=0.00044 Score=38.99 Aligned_cols=78 Identities=6% Similarity=0.070 Sum_probs=56.2 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) +++.-.++.+.+...+.. ..+-..+|..||..+....+..|.+. +|+|+++.|.+++ | ..+.|.+ T Consensus 71 ~~~~~~~~~~~~~~~~~~--~~~~~~~L~~~G~~~~~~ik~~I~~~------~~ppna~sTi~~K-----g--~~~PLid 135 (148) T protein:vir:52 71 LEENQEKYTALFIQWFDQ--GVPAAQIYERLSVMAQGDVQMNIVKG------EWVANAKSTIRRK-----K--SSKPLID 135 (148) T ss_pred HHHHHHHHHHHHHHHHHc--CCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCCcHHHHHhc-----C--CCCchhH Confidence 333334444444444432 12235789999999999999999763 5899999987643 2 3578999 Q ss_pred hhhhcceeeEEEc Q lcl|NC_015266. 81 KLRTARYLRIDVD 93 (155) Q Consensus 81 ~~~l~~sl~~~~~ 93 (155) ++.|.+||+|.+- T Consensus 136 TG~l~~SIty~V~ 148 (148) T protein:vir:52 136 TGKMRQSVRGIVK 148 (148) T ss_pred HHHHHHHhhhhcC Confidence 9999999999887 No 97 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=93.76 E-value=7.5e-05 Score=43.21 Aligned_cols=68 Identities=15% Similarity=0.072 Sum_probs=31.3 Q ss_pred ccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEec-------------------------cc-ccccccccccCccc Q lcl|NC_015266. 65 GLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFD-------------------------DR-LSRIVRVHQEGQKA 118 (155) Q Consensus 65 ~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~-------------------------G~-~~~yAaiHqfG~~~ 118 (155) ....++| -+.+.+.+. .-.+.|||. |. +..+|+++.|| T Consensus 1 m~~~r~~---l~~~~~~l~----------~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G--- 64 (155) T protein:vir:77 1 MSVTRRG---LTLPKDRYR----------SMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYG--- 64 (155) T ss_pred CcchHHH---HHHHHHHHh----------cCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcC--- Confidence 1111111 111222111 122333331 22 34566677776 Q ss_pred cccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 119 PVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 119 ~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) +++||+||||=-+-+ ..+++.+.+..-+.. T Consensus 65 -------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~ 95 (155) T protein:vir:77 65 -------TSKLPARPFMEKTIADRSAEWIKGLTVMMTM 95 (155) T ss_pred -------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHc Confidence 367999999977643 344555555554444 No 98 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=93.67 E-value=0.0018 Score=35.58 Aligned_cols=142 Identities=13% Similarity=0.065 Sum_probs=72.7 Q ss_pred Cch--hHHHH---HHHHHHHHHhcCch-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD--DLRAL---EKWAGGLLAKLAPA-ARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~--~~~~l---~~~l~~ll~~L~~~-~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |++ |+.+| .+.|..++..-.+. --.+++.++|+.|.+...+|+=-+.-+++..-. . ...+...+ ... T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~--~---~~~~k~~k--~~~ 73 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEF--T---TKDGKHVK--FWA 73 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhh--h---hcccchhh--hhc Confidence 887 34444 44444433321111 235789999999988888876433322221100 0 00000000 000 Q ss_pred chhhhhhhhhcceeeE---EEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHH-----HHH Q lcl|NC_015266. 75 RQAMFRKLRTARYLRI---DVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRE-----LVR 146 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~---~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~-----~I~ 146 (155) ...--++|.|.+|++. .-+++...|.. .++.+||..--||=++... + =+|-+++|=.|.++.+ .|. T Consensus 74 ~~~~k~tG~lr~swk~~~~~k~~~~~~v~v-~N~~~YA~~VE~GHR~~~g--G---fV~G~fml~~s~~~~~~~~~~~~e 147 (163) T protein:vir:10 74 SAHGKQGGTLQKGWSKSRIEVSGRTYKQKV-YNKVYYAPHVEYGHKTVNG--G---FVPGQFFLHKTVEDTKSDMEKRVR 147 (163) T ss_pred cccccccchhhccceecceeecCCceEEEE-EecCCccchhhcceeecCC--c---eeccchhhHHHHHHHHHHHHHHHH Confidence 0011234455554432 33455544422 7889999999999877542 2 2689999988876542 333 Q ss_pred HHHHHHhcC Q lcl|NC_015266. 147 DRLLRYLNR 155 (155) Q Consensus 147 ~~i~~~l~r 155 (155) ..|.++|.+ T Consensus 148 ~~l~~~l~k 156 (163) T protein:vir:10 148 DKYDGFMRK 156 (163) T ss_pred HHHHHHHHH Confidence 444444444 No 99 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=93.61 E-value=7.6e-05 Score=43.17 Aligned_cols=68 Identities=10% Similarity=0.061 Sum_probs=31.4 Q ss_pred ccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEecc---------c-----------------ccccccccccCccc Q lcl|NC_015266. 65 GLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDD---------R-----------------LSRIVRVHQEGQKA 118 (155) Q Consensus 65 ~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G---------~-----------------~~~yAaiHqfG~~~ 118 (155) ..-.+ +.|..... +..+-.+.|||.. . +..+|.+|.|| T Consensus 1 m~v~~------k~L~~~~~-------~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G--- 64 (155) T protein:vir:10 1 MSVTR------RGLTLPKD-------RYRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYG--- 64 (155) T ss_pred CcchH------HHHHHHHH-------HHhCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcC--- Confidence 11111 11111111 1112334455422 1 23355555555 Q ss_pred cccCCCceeeecCccccCCC-HHHHHHHHHHHHHHhcC Q lcl|NC_015266. 119 PVEPGGPLAQYPVRVVLGFS-SADRELVRDRLLRYLNR 155 (155) Q Consensus 119 ~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~r 155 (155) +++||+||||=-+ ++..+++.+.+...+.. T Consensus 65 -------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~ 95 (155) T protein:vir:10 65 -------TSKLPARPFMEKTIADRSAEWIKGLTVMMTM 95 (155) T ss_pred -------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHc Confidence 3689999999554 23445566666666655 No 100 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=93.47 E-value=0.0024 Score=34.94 Aligned_cols=123 Identities=16% Similarity=0.198 Sum_probs=76.5 Q ss_pred Cchh---------HHHHHHHHHHHH-HhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhcccccc Q lcl|NC_015266. 1 MDDD---------LRALEKWAGGLL-AKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKV 70 (155) Q Consensus 1 m~~~---------~~~l~~~l~~ll-~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~ 70 (155) |++- +.++...+..+. ..| +.+.++.++++|+.+... ..+..|+|..=+..+. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~-----ar~~tP~g~r~~~~s~----------- 63 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKEL-NKAVREANKASGEVLIPQ-----AKHESPDGKRDAKSSK----------- 63 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCch-hHHHHHHHHHHHHHHHHH-----HHhhcCCccccccccc----------- Confidence 8762 333333333331 112 234456677888776654 3567899965443221 Q ss_pred CcccchhhhhhhhhcceeeEEEcCcEEEEEecc-cccccccccccCccccccCCCceeeecCccccC--CCH-------H Q lcl|NC_015266. 71 GRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDD-RLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLG--FSS-------A 140 (155) Q Consensus 71 ~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G-~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG--~s~-------~ 140 (155) -.++|+|.+||+..++...+.|-..+ +..+||..-|||...+ +|-.+-||= ..+ - T Consensus 64 -------~~r~G~L~~Sir~aaT~raa~VrAG~~krVPYA~~I~~G~r~r--------~Isp~rFl~~a~a~te~~~~r~ 128 (143) T protein:vir:62 64 -------KYRPGKLDKSIKVTASAKGAVIKAGSASRVPYAAAIHFGYRAR--------NISPNRFLFRAMARKSDVVAAT 128 (143) T ss_pred -------ccCcchhhccccccccccceeeeeCCcCCCCcccccccCcccc--------cccchhhhhhhhhccCHHHHHH Confidence 23557888999999999999998755 4789999999995431 244555552 221 2 Q ss_pred HHHHHHHHHHHHhcC Q lcl|NC_015266. 141 DRELVRDRLLRYLNR 155 (155) Q Consensus 141 d~~~I~~~i~~~l~r 155 (155) =|.+|..+|..||.- T Consensus 129 Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 129 YERRIAAVVEKYLES 143 (143) T ss_pred HHHHHHHHHHHHhcC Confidence 356777777777777 No 101 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=93.31 E-value=0.0051 Score=33.17 Aligned_cols=118 Identities=12% Similarity=0.148 Sum_probs=62.2 Q ss_pred Cch--hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDD--DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~--~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) |.+ .+..|.+.+..-|.+.+.-- .+-+.++.+..-......++. .+|. T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v-~~~v~~~v~~~a~~~~~~lk~-~sP~---------------------------- 50 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDV-VDDIDDIKKDITKNGVKQLRE-SSPK---------------------------- 50 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-hCCc---------------------------- Confidence 877 56677666666665553211 122333333333333344442 3442 Q ss_pred hhhhhhcceeeEEEcCcEEEEEeccccccc--ccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 79 FRKLRTARYLRIDVDDTGLAIGFDDRLSRI--VRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 79 ~~~~~l~~sl~~~~~~~~~~v~~~G~~~~y--AaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) ++|.++++++...++++..|-+..++ .| +..-+||=.-+ .|+ ..|+|||+.-..+ -.+++.+.|..-|.| T Consensus 51 -~TG~yaksW~~k~~~~~~~~v~~~~~-~y~l~HLLE~GHa~r--~GG---rV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 51 -RTGDYAKNWTSQKLKNGDQVIYQKAP-TYRLTHLLENGHAKR--NGG---RVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred -cccccccceeeeecCCeeEEEEEecC-CcceEEeeecceeec--CCc---eeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 12333444555555555444343333 44 45558994432 222 3599999976654 356777778888888 No 102 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=93.27 E-value=0.0001 Score=42.50 Aligned_cols=78 Identities=9% Similarity=-0.028 Sum_probs=32.0 Q ss_pred ccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEecc---------cccccccccccCccccccCC-------Cceee Q lcl|NC_015266. 65 GLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDD---------RLSRIVRVHQEGQKAPVEPG-------GPLAQ 128 (155) Q Consensus 65 ~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G---------~~~~yAaiHqfG~~~~~~~~-------~~~v~ 128 (155) ..-. ++.|..... +..+-.+.|||.. ....++++|.++....++.- --+++ T Consensus 1 m~v~------~k~L~~~~~-------~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~ 67 (155) T protein:vir:78 1 MSVT------RRGLTLPKD-------RYRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSK 67 (155) T ss_pred Ccch------HHHHHHHHH-------HHhCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCCC Confidence 1111 111111111 1112334555422 12223334444322111000 00468 Q ss_pred ecCccccCCCH-HHHHHHHHHHHHHhcC Q lcl|NC_015266. 129 YPVRVVLGFSS-ADRELVRDRLLRYLNR 155 (155) Q Consensus 129 iPaRp~LG~s~-~d~~~I~~~i~~~l~r 155 (155) ||+||||=-+- +..+++.+.+...+.. T Consensus 68 IP~RPFlr~t~~~~~~~~~~~l~~~~~~ 95 (155) T protein:vir:78 68 LPARPFMEKTITDRSAEWIKGLTVMMTM 95 (155) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHc Confidence 99999998763 3445566666555555 No 103 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=92.98 E-value=0.00024 Score=40.45 Aligned_cols=77 Identities=14% Similarity=0.218 Sum_probs=40.2 Q ss_pred hccccccCcccchhhhhhhhhcceeeEEEcCcEEEEEeccc--------ccccccccccCccccccCCCceeeecCcccc Q lcl|NC_015266. 64 KGLRTKVGRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDDR--------LSRIVRVHQEGQKAPVEPGGPLAQYPVRVVL 135 (155) Q Consensus 64 ~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~--------~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~L 135 (155) =+..-+++......|...+ + ..+...+.|||... +..+|.+|.||-.. .+||+|||| T Consensus 1 M~~~i~~~~~~~~~L~~~l---k----~l~~k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~p~--------~~IP~RPFl 65 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFI---K----GMNDYSVRIGWFSTAKYPDGTPTAYVASIHEFGAPS--------RGIPARSFI 65 (189) T ss_pred CcceeccCcHHHHHHHHHH---H----HhhCCeEEEEecCCCCCCCcccHHHHHHHHHhcCcC--------CCCCCchhh Confidence 0000000000011121111 1 12456777887532 56799999999542 369999999 Q ss_pred CCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 136 GFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 136 G~s~~-d~~~I~~~i~~~l~r 155 (155) =-+-+ ..+++.+.+...+.. T Consensus 66 r~t~~~~~~~~~~~l~~~~~~ 86 (189) T protein:vir:10 66 RPTIAAQQAAWSQQMRFYAKQ 86 (189) T ss_pred hHHHHHHHHHHHHHHHHHHHH Confidence 77643 345555555555554 No 104 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=92.73 E-value=0.00029 Score=39.94 Aligned_cols=81 Identities=11% Similarity=0.049 Sum_probs=33.4 Q ss_pred HhcCCCCCCcCcccchhhhhhccccccCcccchhhhhhhhhcceeeEEEcCcE-EEEEe--------------cc-cccc Q lcl|NC_015266. 44 AAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFRKLRTARYLRIDVDDTG-LAIGF--------------DD-RLSR 107 (155) Q Consensus 44 ~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~~~~-~~v~~--------------~G-~~~~ 107 (155) -.-+.+.|..- -...-+.++ ...+.+.+-++. ...|| .| ++.. T Consensus 1 ~~~~~~~g~~~--------------------~~~~~~~l~-~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~ 59 (168) T protein:vir:94 1 MTTIARKGVKM--------------------PPHLEAQFQ-SGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAV 59 (168) T ss_pred Cccccchhhhh--------------------hHHHHHhhh-ccceeeeccccCcccccccchhhcccccccccccccHHH Confidence 00011111110 000111111 011121111100 00100 01 4668 Q ss_pred cccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHHHHhcC Q lcl|NC_015266. 108 IVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLLRYLNR 155 (155) Q Consensus 108 yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~r 155 (155) +|++|.||. ++||+||||--+ ++..+++.+.+...|.- T Consensus 60 Ia~~~E~G~----------~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~ 98 (168) T protein:vir:94 60 IAQALEYGH----------GQNHPRPFMQQTYAAQYRAWSRDLTLTLKA 98 (168) T ss_pred HHHHHhcCC----------CCCCCchhhHHHHHHHHHHHHHHHHHHHhc Confidence 999999993 579999999443 23445555555555443 No 105 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=92.70 E-value=0.004 Score=33.70 Aligned_cols=122 Identities=16% Similarity=0.208 Sum_probs=77.4 Q ss_pred Cchh---------HHHHHHHHHHHH-HhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhcccccc Q lcl|NC_015266. 1 MDDD---------LRALEKWAGGLL-AKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKV 70 (155) Q Consensus 1 m~~~---------~~~l~~~l~~ll-~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~ 70 (155) |++- +.++...+..+. ..| +.+.++.++++|+.+... ..+..|+|..=+|.+++ T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~-----ar~~tP~g~~~p~~srr---------- 64 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKEL-NKAVREANKASGEVLIPQ-----AKHESPDGHRDPKSSKR---------- 64 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcc-hHHHHHHHHHHHHHHHHH-----HHhhcCCcccccccccc---------- Confidence 8772 333333344331 112 234456777888776654 45779999766654432 Q ss_pred CcccchhhhhhhhhcceeeEEEcCcEEEEEeccc--ccccccccccCccccccCCCceeeecCccccC--CCH------- Q lcl|NC_015266. 71 GRIKRQAMFRKLRTARYLRIDVDDTGLAIGFDDR--LSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLG--FSS------- 139 (155) Q Consensus 71 ~~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~--~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG--~s~------- 139 (155) .+.|+|.+||+..++...+.|-. |+ -.+||..-|||..- -+|-++-||= ..+ T Consensus 65 --------~r~G~L~~Sir~aaT~raa~VrA-Gr~arVPYA~~I~~G~r~--------r~Is~~rFl~~a~a~te~~~~r 127 (143) T protein:vir:13 65 --------YRPGKLDKSIKVTASAKGAVIKA-GSAARVPYAAAIHFGYRK--------RNISANRFLYRAMARKSDVVAA 127 (143) T ss_pred --------cccchhhccccccccccceeeee-cCcCCCCcccccccCCcc--------cccchhhhhhhhhhccCHHHHH Confidence 24567889999999999999876 53 37999999999542 1355666763 221 Q ss_pred HHHHHHHHHHHHHhcC Q lcl|NC_015266. 140 ADRELVRDRLLRYLNR 155 (155) Q Consensus 140 ~d~~~I~~~i~~~l~r 155 (155) -=|.+|..+|..||.- T Consensus 128 ~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 128 TYERRIAAVVEKYLES 143 (143) T ss_pred HHHHHHHHHHHHHhcC Confidence 2356777777777777 No 106 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=92.07 E-value=0.0053 Score=33.08 Aligned_cols=112 Identities=11% Similarity=0.143 Sum_probs=63.1 Q ss_pred Cchh--HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDD--LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~--~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) |+-+ +.+|++.|..|-..- ....+..+++-|+.+...-+.+ .|-.. .+ + T Consensus 1 M~v~v~~~~L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~-----aP~~~----------------~~-----~-- 51 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSN-----TPFAN----------------TK-----K-- 51 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-----CCCCC----------------CC-----c-- Confidence 6653 456666666554332 2223445666666665444433 23110 00 0 Q ss_pred hhhhhhcceeeEEE-------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHH Q lcl|NC_015266. 79 FRKLRTARYLRIDV-------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLL 150 (155) Q Consensus 79 ~~~~~l~~sl~~~~-------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~ 150 (155) .+.++|.+.- ....+.||+.-...-||+...||.. ++||+||+.=+ ++..++++.++. T Consensus 52 ----hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~----------k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:94 52 ----HARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGTM----------YQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ----hhhhheeecccccccccceEEEEeccCCCCceEEEeccCCcc----------CCCCCchhhHHHHHhHHHHHHHHH Confidence 1233332211 1224567663334578999999954 58999999777 456788899998 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|-| T Consensus 118 ~~lrk 122 (125) T protein:vir:94 118 DTAKR 122 (125) T ss_pred HHHHH Confidence 88887 No 107 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=92.07 E-value=0.0053 Score=33.08 Aligned_cols=112 Identities=11% Similarity=0.143 Sum_probs=63.1 Q ss_pred Cchh--HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDD--LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~--~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) |+-+ +.+|++.|..|-..- ....+..+++-|+.+...-+.+ .|-.. .+ + T Consensus 1 M~v~v~~~~L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~-----aP~~~----------------~~-----~-- 51 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSN-----TPFAN----------------TK-----K-- 51 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-----CCCCC----------------CC-----c-- Confidence 6653 456666666554332 2223445666666665444433 23110 00 0 Q ss_pred hhhhhhcceeeEEE-------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHH Q lcl|NC_015266. 79 FRKLRTARYLRIDV-------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLL 150 (155) Q Consensus 79 ~~~~~l~~sl~~~~-------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~ 150 (155) .+.++|.+.- ....+.||+.-...-||+...||.. ++||+||+.=+ ++..++++.++. T Consensus 52 ----hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~----------k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:47 52 ----HARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGTM----------YQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ----hhhhheeecccccccccceEEEEeccCCCCceEEEeccCCcc----------CCCCCchhhHHHHHhHHHHHHHHH Confidence 1233332211 1224567663334578999999954 58999999777 456788899998 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|-| T Consensus 118 ~~lrk 122 (125) T protein:vir:47 118 DTAKR 122 (125) T ss_pred HHHHH Confidence 88887 No 108 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=92.07 E-value=0.0053 Score=33.08 Aligned_cols=112 Identities=11% Similarity=0.143 Sum_probs=63.1 Q ss_pred Cchh--HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDD--LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~--~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) |+-+ +.+|++.|..|-..- ....+..+++-|+.+...-+.+ .|-.. .+ + T Consensus 1 M~v~v~~~~L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~-----aP~~~----------------~~-----~-- 51 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSN-----TPFAN----------------TK-----K-- 51 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-----CCCCC----------------CC-----c-- Confidence 6653 456666666554332 2223445666666665444433 23110 00 0 Q ss_pred hhhhhhcceeeEEE-------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHH Q lcl|NC_015266. 79 FRKLRTARYLRIDV-------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLL 150 (155) Q Consensus 79 ~~~~~l~~sl~~~~-------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~ 150 (155) .+.++|.+.- ....+.||+.-...-||+...||.. ++||+||+.=+ ++..++++.++. T Consensus 52 ----hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~----------k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:79 52 ----HARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGTM----------YQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ----hhhhheeecccccccccceEEEEeccCCCCceEEEeccCCcc----------CCCCCchhhHHHHHhHHHHHHHHH Confidence 1233332211 1224567663334578999999954 58999999777 456788899998 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|-| T Consensus 118 ~~lrk 122 (125) T protein:vir:79 118 DTAKR 122 (125) T ss_pred HHHHH Confidence 88887 No 109 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=92.07 E-value=0.0053 Score=33.08 Aligned_cols=112 Identities=11% Similarity=0.143 Sum_probs=63.1 Q ss_pred Cchh--HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDD--LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~--~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) |+-+ +.+|++.|..|-..- ....+..+++-|+.+...-+.+ .|-.. .+ + T Consensus 1 M~v~v~~~~L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~-----aP~~~----------------~~-----~-- 51 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSN-----TPFAN----------------TK-----K-- 51 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-----CCCCC----------------CC-----c-- Confidence 6653 456666666554332 2223445666666665444433 23110 00 0 Q ss_pred hhhhhhcceeeEEE-------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHH Q lcl|NC_015266. 79 FRKLRTARYLRIDV-------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLL 150 (155) Q Consensus 79 ~~~~~l~~sl~~~~-------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~ 150 (155) .+.++|.+.- ....+.||+.-...-||+...||.. ++||+||+.=+ ++..++++.++. T Consensus 52 ----hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~----------k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:81 52 ----HARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGTM----------YQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ----hhhhheeecccccccccceEEEEeccCCCCceEEEeccCCcc----------CCCCCchhhHHHHHhHHHHHHHHH Confidence 1233332211 1224567663334578999999954 58999999777 456788899998 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|-| T Consensus 118 ~~lrk 122 (125) T protein:vir:81 118 DTAKR 122 (125) T ss_pred HHHHH Confidence 88887 No 110 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=92.07 E-value=0.0053 Score=33.08 Aligned_cols=112 Identities=11% Similarity=0.143 Sum_probs=63.1 Q ss_pred Cchh--HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhh Q lcl|NC_015266. 1 MDDD--LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAM 78 (155) Q Consensus 1 m~~~--~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l 78 (155) |+-+ +.+|++.|..|-..- ....+..+++-|+.+...-+.+ .|-.. .+ + T Consensus 1 M~v~v~~~~L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~-----aP~~~----------------~~-----~-- 51 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSN-----TPFAN----------------TK-----K-- 51 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-----CCCCC----------------CC-----c-- Confidence 6653 456666666554332 2223445666666665444433 23110 00 0 Q ss_pred hhhhhhcceeeEEE-------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCC-HHHHHHHHHHHH Q lcl|NC_015266. 79 FRKLRTARYLRIDV-------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFS-SADRELVRDRLL 150 (155) Q Consensus 79 ~~~~~l~~sl~~~~-------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~ 150 (155) .+.++|.+.- ....+.||+.-...-||+...||.. ++||+||+.=+ ++..++++.++. T Consensus 52 ----hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT~----------k~~a~pF~~~a~~~~~~ev~~~~~ 117 (125) T protein:vir:98 52 ----HARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGTM----------YQKPQLFITKTEKQGKNKVLKTML 117 (125) T ss_pred ----hhhhheeecccccccccceEEEEeccCCCCceEEEeccCCcc----------CCCCCchhhHHHHHhHHHHHHHHH Confidence 1233332211 1224567663334578999999954 58999999777 456788899998 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|-| T Consensus 118 ~~lrk 122 (125) T protein:vir:98 118 DTAKR 122 (125) T ss_pred HHHHH Confidence 88887 No 111 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=90.29 E-value=0.0045 Score=33.47 Aligned_cols=119 Identities=13% Similarity=0.152 Sum_probs=65.4 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |++.|.++...|+.|. .+++.+..++...-|+.+...-++. .|. +.. ..++. ... T Consensus 3 ~~~~l~e~l~~lekl~-~~~~~~~~k~tkaGA~v~~~~L~~~-----tp~-------~~~-~~~~~-----------~~~ 57 (139) T protein:vir:10 3 MDEALGQWLKQVSKAA-QLSVSDQEKITKAGADVYAKELAET-----TKE-------KHP-NTKGD-----------GGK 57 (139) T ss_pred HHHHHHHHHHHHHHhc-cCCHHHHHHHHHHHHHHHHHHHHHh-----ccc-------ccc-cCCCC-----------CCC Confidence 8888888888887753 5666777777777777665554432 221 100 00000 001 Q ss_pred hhhhcceeeEEE------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHH-HHHHHH----HH Q lcl|NC_015266. 81 KLRTARYLRIDV------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSAD-RELVRD----RL 149 (155) Q Consensus 81 ~~~l~~sl~~~~------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d-~~~I~~----~i 149 (155) ...++++|.+.. .+..+.||| .....+|+.-.+|. +.+|+.+|+==+.++ ..+|+. .+ T Consensus 58 ~~HlaD~I~~~~~~idg~~~g~~~VG~-~~~~~~Ahf~n~GT----------~~~~~~hFie~t~~e~~~ev~~a~~~~~ 126 (139) T protein:vir:10 58 YGHLSEDISSAAGDIDGDHNGSSTVGF-HNKAHIARFLNDGT----------KNIRADHFVDNARDDAKDAVFAAEAEKY 126 (139) T ss_pred CCcccccceecCccccccccccceeCC-CCCceeeeeeccCc----------cccCCCchHHHHHHHHHHHHHHHHHHHH Confidence 123556665542 233366877 34455667777773 469999998554332 234444 44 Q ss_pred HHHhcC Q lcl|NC_015266. 150 LRYLNR 155 (155) Q Consensus 150 ~~~l~r 155 (155) .+.|.+ T Consensus 127 ke~l~~ 132 (139) T protein:vir:10 127 QAMIAK 132 (139) T ss_pred HHHHhh Confidence 444555 No 112 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=89.70 E-value=0.0044 Score=33.51 Aligned_cols=78 Identities=9% Similarity=0.096 Sum_probs=54.4 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) +++.-.+..+.|...+..- .+-..+|..||..+....+..|..- . +|+++.+..++ | ..+.|.+ T Consensus 78 ~~~~~~~~~~~l~~~~~~~--~~~~~~L~~~G~~~~~~Ik~~I~~~-----~--~pna~~Ti~~K-----g--~~kPLid 141 (155) T protein:vir:78 78 ITDRSAEWIKGLTVMMTMG--YDAEVAMGQIGQAMKDDIKTTISEW-----P--ADNSADWAGKK-----G--FNHGLIW 141 (155) T ss_pred HHHHHHHHHHHHHHHHHcC--CCHHHHHHHHHHHHHHHHHHHHhcC-----C--CCCcHHHHHhc-----C--CCCchhH Confidence 4444444555555555442 2335689999999999999999852 1 46666665432 2 4578999 Q ss_pred hhhhcceeeEEEcC Q lcl|NC_015266. 81 KLRTARYLRIDVDD 94 (155) Q Consensus 81 ~~~l~~sl~~~~~~ 94 (155) ++.|.+||+|.+-. T Consensus 142 TG~l~~SIty~V~~ 155 (155) T protein:vir:78 142 TSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHhhhhhccC Confidence 99999999998877 No 113 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=89.70 E-value=0.0079 Score=32.12 Aligned_cols=119 Identities=14% Similarity=0.170 Sum_probs=66.5 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |++-|+++...|+.+ ..++..+..++..+-|+.+...-++.- |. +-- ..++ .++ . T Consensus 3 ~~~~lee~l~~i~kl-~~~~~~~~~ki~kaGA~v~~e~L~~~t-----p~-------~~~-~~~~---~~~--------~ 57 (139) T protein:vir:10 3 MDEALGQWLKQVSKA-AELSISDQEKITKAGADVYAKKLAETT-----KE-------KHP-NTKG---DGG--------K 57 (139) T ss_pred HHHHHHHHHHHHHHh-hccCHHHHHHHHHHHHHHHHHHHHHhc-----cc-------ccC-cCCC---CCC--------C Confidence 888888888888776 456667677777777777665554332 21 100 0000 000 0 Q ss_pred hhhhcceeeEEE------cCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHh Q lcl|NC_015266. 81 KLRTARYLRIDV------DDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYL 153 (155) Q Consensus 81 ~~~l~~sl~~~~------~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l 153 (155) ...++++|.+.. .+..+.||| +....+|+.-.||. +.+|+.||+==+.+ -..+|+.++.+-| T Consensus 58 ~~HlaD~I~~s~~~~dg~~~g~~~VG~-~k~~~~A~f~n~GT----------~k~~~~hFie~t~~e~~~evl~a~~~~~ 126 (139) T protein:vir:10 58 YGHLSEDIRSAAGDIDGDHNGSSTVGF-HNKAHIARFLNDGT----------KYIRADHFVDNARDDAKDAVFAAEAEKY 126 (139) T ss_pred CcchhhcceecCcccccccceeeeeCC-CCCcceEeecccCc----------cccCCCchHHHHHHHHHHHHHHHHHHHH Confidence 113455555432 122345887 45567788888884 46999999854432 2345555544444 Q ss_pred ----cC Q lcl|NC_015266. 154 ----NR 155 (155) Q Consensus 154 ----~r 155 (155) .+ T Consensus 127 k~~l~~ 132 (139) T protein:vir:10 127 QAMIAK 132 (139) T ss_pred HHHHhh Confidence 44 No 114 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=89.67 E-value=0.0044 Score=33.51 Aligned_cols=78 Identities=9% Similarity=0.089 Sum_probs=54.6 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) +++.-.+..+.|...+..- .+-...|..||..+....+..|.. +. +|+++.+..++ | ..+.|.+ T Consensus 78 ~~~~~~~~~~~l~~~~~~~--~~~~~~L~~lG~~~~~~Ik~~I~~-----~~--~pna~~Ti~~K-----G--~~kPLid 141 (155) T protein:vir:10 78 IADRSAEWIKGLTVMMTMG--YDAEVAMGQIGQAMKDDIKTTISE-----WP--ADNSADWAGKK-----G--FNHGLIW 141 (155) T ss_pred HHHHHHHHHHHHHHHHHcC--CCHHHHHHHHHHHHHHHHHHHHhc-----CC--CCCcHHHHHhc-----C--CCCchhH Confidence 4444445555555555442 233568999999999999999975 21 46677665432 2 4578999 Q ss_pred hhhhcceeeEEEcC Q lcl|NC_015266. 81 KLRTARYLRIDVDD 94 (155) Q Consensus 81 ~~~l~~sl~~~~~~ 94 (155) ++.|.+||+|.+-. T Consensus 142 TG~l~~SIty~Vv~ 155 (155) T protein:vir:10 142 TSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHhhhhhccC Confidence 99999999998777 No 115 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=89.64 E-value=0.0077 Score=32.18 Aligned_cols=119 Identities=15% Similarity=0.206 Sum_probs=59.2 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |++-|+++...|..|. .+.+...+++.+.=|..+.. .++.. .|.. ...+++ .+. T Consensus 4 ~~~glee~~~~lekL~-~~~~~~~~katkAGA~v~~e----~L~~~-tp~~-h~~~~k-----------t~~-------- 57 (153) T protein:vir:49 4 LDEALEGWLKTVASIG-DLTPAEQAKITTAGAKVFKE----ELAEV-TREK-HYSKKK-----------DLK-------- 57 (153) T ss_pred HHHHHHHHHHHHHHhc-cCCHHHHHHHHHHHHHHHHH----HHHHh-cccc-CCCCCC-----------CCC-------- Confidence 4445666666666653 45545455555543433333 33322 1211 011110 000 Q ss_pred hhhhcceeeEEE------cCcEEEEEec-ccccccccccccCccccccCCCceeeecCccccCCCHHH---HHHHHHH-- Q lcl|NC_015266. 81 KLRTARYLRIDV------DDTGLAIGFD-DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSAD---RELVRDR-- 148 (155) Q Consensus 81 ~~~l~~sl~~~~------~~~~~~v~~~-G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d---~~~I~~~-- 148 (155) ...++++|.+.. .+..+.|||. .....||..-.+|. +.||+.||+==+.++ ..+|+.. T Consensus 58 ~~HlaD~I~~s~~~idG~~dG~s~VG~~~~~~a~~a~f~n~GT----------~km~~~hFie~tr~e~~~k~~vl~A~~ 127 (153) T protein:vir:49 58 YGHMADGLAVQSTNADGRKNGVSTVGWKNNYHAQNARRLNDGT----------KKYRADHFITNVQNDSTVKNKVLLAEK 127 (153) T ss_pred CCcccccceeccccccccccceeeecccCCccceeeeecccCc----------ccCCCChhhHHHHHHhhHHHHHHHHHH Confidence 124456665532 1225568883 44578888888884 458999999433333 3456643 Q ss_pred --HHHHhcC Q lcl|NC_015266. 149 --LLRYLNR 155 (155) Q Consensus 149 --i~~~l~r 155 (155) +.+-|.+ T Consensus 128 ~~~~~il~~ 136 (153) T protein:vir:49 128 EEYEKLIRR 136 (153) T ss_pred HHHHHHHHh Confidence 3344444 No 116 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=88.94 E-value=0.0062 Score=32.70 Aligned_cols=88 Identities=15% Similarity=0.022 Sum_probs=57.7 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) +++.-.++.+.+..++..- .+-..+|..||..+....+..|.. ++ +|+++.+.+++ | ..+.|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~--~~~~~~L~~lG~~~~~~Ik~~I~~-----~~--ppna~sTi~~K-----G--~~~PLiD 144 (168) T protein:vir:94 81 YAAQYRAWSRDLTLTLKAG--AAADTALRTVGQRMAEDIQDTIRN-----WP--ADNSPEWAAIK-----G--FNAGLRQ 144 (168) T ss_pred HHHHHHHHHHHHHHHHhcC--CCHHHHHHHHHHHHHHHHHHHhhc-----CC--CCccHHHHHhc-----C--CCCchhH Confidence 4444555555566655532 233568999999999999999975 22 57777776543 2 4568999 Q ss_pred hhhhcceeeEEEcCcEEEEEeccccccc Q lcl|NC_015266. 81 KLRTARYLRIDVDDTGLAIGFDDRLSRI 108 (155) Q Consensus 81 ~~~l~~sl~~~~~~~~~~v~~~G~~~~y 108 (155) ++.|.+||+|.+-.|+=. |-.-.- T Consensus 145 TG~l~~SIty~Vv~d~~~----~~~~~~ 168 (168) T protein:vir:94 145 TGVLLNAIDSAVIIDGEH----GEAPRE 168 (168) T ss_pred HHHHHhhcceeeeecCCC----CCCCCC Confidence 999999999966544322 211111 No 117 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=88.38 E-value=0.029 Score=29.05 Aligned_cols=115 Identities=20% Similarity=0.209 Sum_probs=52.6 Q ss_pred Cch-hHHHHHHH----HHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccc Q lcl|NC_015266. 1 MDD-DLRALEKW----AGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKR 75 (155) Q Consensus 1 m~~-~~~~l~~~----l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~ 75 (155) |++ ++..|.+. |....+...+ .-++...++|..++...+. .+|. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~-~v~~~v~~~a~~~~~~ik~-----~aP~------------------------- 49 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAE-GVRKKVDETARKVLKEAQA-----LAPK------------------------- 49 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh-----hCCc------------------------- Confidence 665 34444444 3333333322 2234455666666555554 3342 Q ss_pred hhhhhhhhhcceeeEEEc---CcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHH Q lcl|NC_015266. 76 QAMFRKLRTARYLRIDVD---DTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLR 151 (155) Q Consensus 76 ~~l~~~~~l~~sl~~~~~---~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~ 151 (155) ++|.+.++++...+ +....|.+.-.....+..--||-.-+ .++ .+||+|||.-..+ -.+++.+.|.+ T Consensus 50 ----rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~l~HLLEfGha~r--~gG---rV~a~Phi~Pa~e~~~~~~~~~i~~ 120 (126) T protein:vir:81 50 ----RTGEYARTFTITKEDGYGTTKRIIWNKKHYRRVHLLEFGHAKV--NGG---RVKEYPHLRPAYDKHGARLPDELKR 120 (126) T ss_pred ----ccchhhccccccccccCCcceEEEeccCCCCceeeeecceecC--CCC---ccCCCcchHHHHHHHHHHHHHHHHH Confidence 01122222222111 11112222222123355667775432 222 2799999976543 34566666666 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) .|.- T Consensus 121 ~l~~ 124 (126) T protein:vir:81 121 VIEN 124 (126) T ss_pred Hhhc Confidence 6666 No 118 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=85.76 E-value=0.023 Score=29.59 Aligned_cols=119 Identities=14% Similarity=0.133 Sum_probs=62.6 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |++-|+++...|..|. ++.+.+..++...=|..+.. .++.. .|. +.- ..++. .. T Consensus 4 ~~~gl~e~~~~lekl~-~~~~~~~~katkAGA~v~~~----~L~~~-tp~-------~hy------~~~~~-------~~ 57 (141) T protein:vir:50 4 LAEALDEWLKTVASIG-NLTPAEQVEITTAGAKVFKK----ELEEV-TRE-------KHY------SRKKN-------PK 57 (141) T ss_pred HHHHHHHHHHHHHHhc-CCCHHHHHHHHHHHHHHHHH----HHHHh-ccc-------CCC------CCCCC-------CC Confidence 5555777777777664 45555555565543333333 22222 121 100 00000 01 Q ss_pred hhhhcceeeEEE------cCcEEEEEec-ccccccccccccCccccccCCCceeeecCccccCCCHHH---HHHHHHHHH Q lcl|NC_015266. 81 KLRTARYLRIDV------DDTGLAIGFD-DRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSAD---RELVRDRLL 150 (155) Q Consensus 81 ~~~l~~sl~~~~------~~~~~~v~~~-G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d---~~~I~~~i~ 150 (155) ...++++|.+.. .+..+.|||. .+...+|..-.||. +.||+-||+==+.+| ..+|+.... T Consensus 58 ~~HlaD~I~~~~~~~DG~~dg~s~VG~~~~~~~~~A~f~n~GT----------~k~~~~hFve~~~~~a~~k~~Vl~A~~ 127 (141) T protein:vir:50 58 FGHMADGLAIQSTNADGRKNGVSTVGWKNNYHAQNARRLNDGT----------KKYRADHFVTNVQNDSTVQKKVLLEKK 127 (141) T ss_pred CCccccceeeccCccccccCCeeeeccCCCccceeeeccccCc----------cccCCCchhHHHHHhhhhHHHHHHHHH Confidence 124566676542 2335579883 33367888888884 358999998666554 345666555 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-|.+ T Consensus 128 ~~~k~ 132 (141) T protein:vir:50 128 RNTKN 132 (141) T ss_pred HHHHH Confidence 55544 No 119 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=85.43 E-value=0.032 Score=28.80 Aligned_cols=119 Identities=16% Similarity=0.194 Sum_probs=60.3 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |++-|+++...|..|. .+.+.+..++...=|+.+ +++++..+ |.+- +.. + +.+ . T Consensus 4 ~~d~l~e~~~~v~kl~-~~~~~~~~katkAGAkv~----~~~L~~~t-p~~h-~~~-------r----~t~--------~ 57 (140) T protein:vir:48 4 LDEALEGWLKTVASIG-DLTPAEQAKITTAGAKVF----KKELAEVT-REKH-YSK-------K----KDL--------K 57 (140) T ss_pred HHHHHHHHHHHHHHhc-cCCHHHHHHHHHHhHHHH----HHHHHHhc-ccCC-CCC-------C----CCC--------C Confidence 4444555655556544 455555544443333333 33333331 2110 000 0 000 0 Q ss_pred hhhhcceeeEEE------cCcEEEEEeccc-ccccccccccCccccccCCCceeeecCccccCCCHHH---HHHHHHHHH Q lcl|NC_015266. 81 KLRTARYLRIDV------DDTGLAIGFDDR-LSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSAD---RELVRDRLL 150 (155) Q Consensus 81 ~~~l~~sl~~~~------~~~~~~v~~~G~-~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d---~~~I~~~i~ 150 (155) ...++++|.+.. .+....|||... ...||..-.+|. +.||+.||+==+.+| ..+|+.... T Consensus 58 ~~HlaD~I~~~~~~idg~~dG~s~VG~~k~~~a~~a~f~NdGT----------~k~~~~hFve~t~~e~~~~~~vl~A~~ 127 (140) T protein:vir:48 58 YGHMADGLAVQSTNVDGRKNGVATVGWKNNYHAQNARRLNDGT----------KKYRADHFVTNVQNDSAVRDKVLLAEK 127 (140) T ss_pred CCcccccceecccccccccccceeecccCCCceeEEeecccCc----------cccCCCchHHHHHHhhhhHHHHHHHHH Confidence 124566676542 133556888533 467888888884 359999999766655 245555554 Q ss_pred HH----hcC Q lcl|NC_015266. 151 RY----LNR 155 (155) Q Consensus 151 ~~----l~r 155 (155) +- |.| T Consensus 128 ~~y~~~l~k 136 (140) T protein:vir:48 128 EEYEKLIRK 136 (140) T ss_pred HHHHHHHHh Confidence 44 444 No 120 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=85.30 E-value=0.033 Score=28.73 Aligned_cols=110 Identities=10% Similarity=0.062 Sum_probs=56.5 Q ss_pred Cch----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD----DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~----~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |.+ -+++|.+.|+.+...-. ..-++.|+..|+.+..+.+.+ .| |- .|..+ T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~-~ie~kAlk~g~e~I~~~~~~n-----~P----~~--------------tg~lk-- 54 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDE-STKRKGIKAGITKIGKAIEKN-----SP----IK--------------SGRLS-- 54 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhH-HHHHHHHHHHhHHHHHHHhhc-----CC----cc--------------cCCcc-- Confidence 544 24455555544443221 222456777777777654322 12 21 01100 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCc-cccCCCH-HHHHHHHHHHHHHhc Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVR-VVLGFSS-ADRELVRDRLLRYLN 154 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaR-p~LG~s~-~d~~~I~~~i~~~l~ 154 (155) ......-..+.+.||..-+..-|+-.+-||.. .+||+ ||+.=+- .-+.++..++.+=|- T Consensus 55 ---------kik~~~kk~g~~~VG~~ks~~fy~kF~EFGTS----------km~a~~pF~~~a~~~~~~eA~~~~~~el~ 115 (119) T protein:vir:10 55 ---------KVKIRVKNTGLATEGTASSSEFYDIFQNFGTS----------EQKAHVGYFDRAVDETTNEAVEEVAEIIF 115 (119) T ss_pred ---------eeeeeeecCceeEeccCCcchhhhhhcccccc----------ccCCCCCccccccccChHHHHHHHHHHHH Confidence 00011122346778654445589999999965 48999 9997663 234444444444444 Q ss_pred C Q lcl|NC_015266. 155 R 155 (155) Q Consensus 155 r 155 (155) + T Consensus 116 ~ 116 (119) T protein:vir:10 116 R 116 (119) T ss_pred H Confidence 4 No 121 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=84.50 E-value=0.035 Score=28.54 Aligned_cols=119 Identities=13% Similarity=0.130 Sum_probs=62.7 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) |++-|+++...|..|. .+++.+..++.+.=|..+...-.+.-. +..+.+..+ + . T Consensus 4 ~~d~l~e~~~~lekl~-~~~~~~~~katkAGA~v~~~~L~~~tp-~~h~~~~~t----------------~--------~ 57 (140) T protein:vir:48 4 LDEALEGWLKTVASIG-DLTPAEQAKITTAGAKVFKEELAEVTR-QKHYSNKKH----------------L--------K 57 (140) T ss_pred HHHHHHHHHHHHHHhc-cCCHHHHHHHHHHHHHHHHHHHHHhcc-ccCCCCCCC----------------C--------C Confidence 5555666666666653 555555555655433333332222111 111111111 0 0 Q ss_pred hhhhcceeeEEE------cCcEEEEEeccc-ccccccccccCccccccCCCceeeecCccccCCCHHH---HHHHHHHHH Q lcl|NC_015266. 81 KLRTARYLRIDV------DDTGLAIGFDDR-LSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSAD---RELVRDRLL 150 (155) Q Consensus 81 ~~~l~~sl~~~~------~~~~~~v~~~G~-~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d---~~~I~~~i~ 150 (155) ...++++|.+.. .+....|||... ...+|..-.+|. +.||+-||+==+.++ ..+|+.... T Consensus 58 ~~HlaD~I~~~~~~iDg~~~g~s~VG~~kk~~a~~A~f~n~GT----------~k~~~~hFve~~~~e~~~k~~vl~A~~ 127 (140) T protein:vir:48 58 YGHMADGLSVQSTNVDGRKNGVSTVGWVNRYHAQNARRLNDGT----------KKYRADHFVTNVQNDSAVQTKVLLAEK 127 (140) T ss_pred CCcchhceeecccccccccCceeeeccCCCcceeeeeccccCc----------cccCCCchhHHHHHhhhhHHHHHHHHH Confidence 124566676542 133556888433 578888899994 459999998666554 345666444 Q ss_pred HHhcC Q lcl|NC_015266. 151 RYLNR 155 (155) Q Consensus 151 ~~l~r 155 (155) +-+.+ T Consensus 128 ~~~~~ 132 (140) T protein:vir:48 128 EEYEK 132 (140) T ss_pred HHHHH Confidence 44444 No 122 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=83.65 E-value=0.011 Score=31.27 Aligned_cols=78 Identities=9% Similarity=0.072 Sum_probs=51.5 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFR 80 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~ 80 (155) +++.-.+..+.|...+..- .+-..+|..||..+....+..|..-. |+| ++.+..+ || ..+.|.+ T Consensus 78 ~~~~~~~~~~~l~~~~~~~--~~~~~~L~~~G~~~~~~Ik~~I~~~~------~p~-~~~Ti~~-----KG--~~~PLid 141 (155) T protein:vir:10 78 IADRSAEWIKGLTVMMTMG--YDAEVAMGQIGQAMKDDIKTTISEWP------ADN-NADWAGK-----KG--FNHGLIW 141 (155) T ss_pred HHHHHHHHHHHHHHHHHcC--CCHHHHHHHHHHHHHHHHHHHHhcCC------CCC-ChHHHHh-----cC--CCCchHH Confidence 3333344444444444332 22356899999999999999998643 544 3344332 22 4578999 Q ss_pred hhhhcceeeEEEcC Q lcl|NC_015266. 81 KLRTARYLRIDVDD 94 (155) Q Consensus 81 ~~~l~~sl~~~~~~ 94 (155) ++.|.+|++|.+-. T Consensus 142 TG~l~~Sity~Vv~ 155 (155) T protein:vir:10 142 TSHLLNSIEQEIVK 155 (155) T ss_pred HHHHHHhhhhhccC Confidence 99999999997766 No 123 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=83.28 E-value=0.012 Score=31.21 Aligned_cols=78 Identities=9% Similarity=0.097 Sum_probs=51.1 Q ss_pred CchhH----HHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDL----RALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~----~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |-.-| .+..+.|...+..- .+-..+|..||..+....+..|..-. |+| ++.+..+ || ..+ T Consensus 74 lr~t~~~~~~~~~~~l~~~~~~~--~~~~~~L~~lG~~~~~~Iq~~I~~~~------~p~-~~~Ti~~-----KG--~d~ 137 (155) T protein:vir:77 74 MEKTIADRSAEWIKGLTVMMTMG--YDAEVAMGQIGQAMKDDIKTTISEWP------ADN-NADWAGK-----KG--FNH 137 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHHcc--CcHHHHHHHHHHHHHHHHHHHHhcCC------CCC-ChHHHHh-----cC--CCC Confidence 44333 33444444444332 22356899999999999999998643 544 3344432 22 457 Q ss_pred hhhhhhhhcceeeEEEcC Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDD 94 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~ 94 (155) .|.+++.|.+|++|.+-. T Consensus 138 PLidTG~l~~SIty~Vv~ 155 (155) T protein:vir:77 138 GLIWTSHLLNSIEQEIVK 155 (155) T ss_pred chhHHHHHHHhhhhhccC Confidence 899999999999998776 No 124 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=80.81 E-value=0.091 Score=26.28 Aligned_cols=131 Identities=11% Similarity=0.156 Sum_probs=61.1 Q ss_pred CchhHHH-HHHHHHHHHH--hcCchhHHHHHHHHHHHHHHHHHHHHHhcC---CCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDDDLRA-LEKWAGGLLA--KLAPAARRRLFRELGRDMRRAQQSRVAAQQ---NPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~~~~~-l~~~l~~ll~--~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~---~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |..+|.+ |+++|..+-. .+++.+..++..+=| ...+++++..+ ..+-.+|.+-.... .+.-.+ T Consensus 1 mm~~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGA----kv~~e~L~~~Tp~~h~~~~k~~~~~~~~--~k~~~~----- 69 (159) T protein:vir:38 1 MANDMGEFYNNWVNEVEKGMKLSVEDKAKITGEGA----EAFSTVLHDHTPRSNEIYRRGRSAGHAN--AKHHNR----- 69 (159) T ss_pred CcchHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhH----HHHHHHHHHhcccCCCcccccccccccc--ccccCc----- Confidence 5555554 4444444422 344455444433322 23334444432 11222332100000 000000 Q ss_pred chhhhhhhhhcceeeEEE--c-----CcEEEEEecc-cccccccccccCccccccCCCceeeecCccccC--CCH----H Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDV--D-----DTGLAIGFDD-RLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLG--FSS----A 140 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~--~-----~~~~~v~~~G-~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG--~s~----~ 140 (155) -.....|+++|.+.. + +..+.|||.+ ....||..-..|.. .+|..|+=| |=+ + T Consensus 70 ---~~~~~HlaD~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~----------~m~~k~~~gdHFvekt~~~ 136 (159) T protein:vir:38 70 ---NRKTKHLQDSITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQH----------HMSPKRYKNMHFLDKAQQE 136 (159) T ss_pred ---CcCCCccccceeeecCccccccccceeeecccCCccceEeeecccCcc----------ccCCCCccCChhHHHHHHH Confidence 122345677887753 2 3367799944 34678888888854 367776666 222 2 Q ss_pred HHHHHHHHHHHHhcC Q lcl|NC_015266. 141 DRELVRDRLLRYLNR 155 (155) Q Consensus 141 d~~~I~~~i~~~l~r 155 (155) -..+|+....+-|.+ T Consensus 137 ~k~~Vl~A~~~~~~~ 151 (159) T protein:vir:38 137 AKKSVAEAELKAYKE 151 (159) T ss_pred HHHHHHHHHHHHHHH Confidence 345555555555555 No 125 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=78.47 E-value=0.03 Score=28.95 Aligned_cols=90 Identities=11% Similarity=-0.008 Sum_probs=47.4 Q ss_pred CchhHH---------HHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccC Q lcl|NC_015266. 1 MDDDLR---------ALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVG 71 (155) Q Consensus 1 m~~~~~---------~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~ 71 (155) |-.-|+ .+......+...+.... ...+.-+|+.+....+.-++.= |+-..|+|.++.+.++|. T Consensus 62 ~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~~~~LG~~~~~~ik~~I~~~--~~p~~w~pNap~Ti~~Kg----- 133 (160) T protein:vir:95 62 YRRLFEITMMLNKQTLLEQTKKNLYKQLSSLN-TDPSNTLEAFAKNAQKAIKRGF--GNSAILPPNAPSTVKKKG----- 133 (160) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-hhHHHHHHHHHHHHHHHHHhhc--CCccCCCCCcHHHHHhcC----- Confidence 333221 11111111222221100 0112335555555555555442 222369999999887653 Q ss_pred cccchhhhhhhhhcceeeEEEcCcEEEEEeccc Q lcl|NC_015266. 72 RIKRQAMFRKLRTARYLRIDVDDTGLAIGFDDR 104 (155) Q Consensus 72 ~~~~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~ 104 (155) ..++|++++.|..||+|++++.+--= . T Consensus 134 --s~~PLiDTg~l~~Si~y~v~~~~~~~----~ 160 (160) T protein:vir:95 134 --FNAPLVETGDLRDNLAYKISTKKGIK----K 160 (160) T ss_pred --CCCcchhhHHHhhhhhheeecccccC----C Confidence 45689999999999999887654321 1 No 126 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=76.92 E-value=0.099 Score=26.08 Aligned_cols=126 Identities=11% Similarity=0.149 Sum_probs=68.0 Q ss_pred Cch---hHHHHHHHHHHHHHhcCchhHHHHHH----HHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcc Q lcl|NC_015266. 1 MDD---DLRALEKWAGGLLAKLAPAARRRLFR----ELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~---~~~~l~~~l~~ll~~L~~~~r~~l~~----~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~ 73 (155) |.+ .|......+..+.-+|+..++.++-. -.++.|...|++++-..+. .+ T Consensus 1 M~~~~~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~---------------------t~-- 57 (168) T protein:vir:74 1 MATFEEAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRD---------------------TG-- 57 (168) T ss_pred CccHHHHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCC---------------------Cc-- Confidence 553 34444444555555677777666543 3344455555544432211 00 Q ss_pred cchhhhhhhhhcceeeEEE------cCcEEEEEeccc-------ccccccccccCcccccc----C----CCceeeecCc Q lcl|NC_015266. 74 KRQAMFRKLRTARYLRIDV------DDTGLAIGFDDR-------LSRIVRVHQEGQKAPVE----P----GGPLAQYPVR 132 (155) Q Consensus 74 ~~~~l~~~~~l~~sl~~~~------~~~~~~v~~~G~-------~~~yAaiHqfG~~~~~~----~----~~~~v~iPaR 132 (155) ....|++||.++. .+....|||.+. -+.+|+++.-|.+..+. + .+-.|.||+= T Consensus 58 ------~~~HLaDsI~~~~~niDg~~dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gD 131 (168) T protein:vir:74 58 ------EDPHLADSIVMKNKNIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHAD 131 (168) T ss_pred ------ccchhhhheeecccccCcccCCceeecccccccccccchhhhhhhhcccccccccccccccccccccccccccc Confidence 1134567776554 345667998644 46899999999653221 1 1124789999 Q ss_pred cccCCCHHH---HHHHHHHHHH----HhcC Q lcl|NC_015266. 133 VVLGFSSAD---RELVRDRLLR----YLNR 155 (155) Q Consensus 133 p~LG~s~~d---~~~I~~~i~~----~l~r 155 (155) +|+=-..+| .++|+..-.+ -|.+ T Consensus 132 HFvd~~r~~~~~k~~V~~Ae~~~y~eIl~~ 161 (168) T protein:vir:74 132 HFIEETRMNLIVQQGILKAEAEAMRKIINR 161 (168) T ss_pred hhHHHHHhhhhhHHHHHHHHHHHHHHHHHh Confidence 999665554 2444444333 3333 No 127 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=71.61 E-value=0.15 Score=25.14 Aligned_cols=133 Identities=11% Similarity=0.055 Sum_probs=70.0 Q ss_pred Cch-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCC-CCC----cCcccchhhh--hhccccccCc Q lcl|NC_015266. 1 MDD-DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNP-DGS----AYVPRKIKKG--GKGLRTKVGR 72 (155) Q Consensus 1 m~~-~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~P-DG~----pW~p~k~~~~--~~~~~~~~~~ 72 (155) |.| ++..+...+....+..+ ...-..+++++..+...... .+| |+- .|...-.... -.....+.|. T Consensus 1 ma~~~~~~F~~~i~~~~~~ve-~~~~~~~r~~a~~i~~~vv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~ 74 (147) T protein:vir:10 1 MANYQIRRFQGEIDAWINAAE-STLEHAIEIFVRDVHDALVS-----RSPVDTGRFKGNWQITFNEIPNHALNRYDKTGG 74 (147) T ss_pred CCCcchhhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCCCcchhhccccceeecCccccccCCcCCCcc Confidence 988 67777777777777663 23345788888888777665 233 211 2322110000 0000000000 Q ss_pred ccchhhhhhhhhc-ceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHH Q lcl|NC_015266. 73 IKRQAMFRKLRTA-RYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLR 151 (155) Q Consensus 73 ~~~~~l~~~~~l~-~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~ 151 (155) . -...+... ..+.... ..+-+|. .+.+.+||.--.||... -+...|.+++.+....|.+-... T Consensus 75 ~----t~a~~~~~~~~~~~~~-~~~~~iy-i~Nn~pYA~~LEyG~S~----------QAP~G~V~~t~q~~~~~v~~~~~ 138 (147) T protein:vir:10 75 V----VRGEEQAKTYGMFSRG-GAITSVH-FSNMLIYANALEYGHSQ----------QAPSGVVGLVALRLRSYMADAIK 138 (147) T ss_pred c----hhhhhhHHHHHHhhhc-cCcceEE-EeeCcchhhhhhccccC----------CCCchHHHHHHHHHHHHHHHHHH Confidence 0 00000000 0011111 1121332 28899999999999764 46777888888888888777776 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) =+.| T Consensus 139 e~k~ 142 (147) T protein:vir:10 139 QARR 142 (147) T ss_pred HHHh Confidence 6766 No 128 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=70.74 E-value=0.13 Score=25.41 Aligned_cols=84 Identities=14% Similarity=0.224 Sum_probs=51.6 Q ss_pred Cch---h---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD---D---LRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~---~---~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |++ . +++|.+.|.. .-...+-.+++.+.|..|....+.. .| T Consensus 1 Ma~~~i~~~Gld~L~~~L~~---~~~~~~v~~vv~~~~~~l~~~ak~~-----ap------------------------- 47 (92) T protein:vir:99 1 MADYSISWDGLDALDEALAN---QQNMNTVKKVVKKHTANLMTATQQA-----VP------------------------- 47 (92) T ss_pred CCceeeEeehHHHHHHHHHh---hccHHHHHHHHHHHHHHHHHHHHHh-----CC------------------------- Confidence 887 2 3344444332 1112333456777777776555553 12 Q ss_pred chhhhhhhhhcceeeEEEcCcEE--EEEecccccccccccccCccccccCCCceeeecC Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTGL--AIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPV 131 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~~--~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPa 131 (155) .++|.|.+||......++. .|...|....||..--||.+- ++| T Consensus 48 ----~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~Ya~YvE~GTR~----------M~A 92 (92) T protein:vir:99 48 ----VDTGHLKQSAQIQISRDGFTGSVTYGGGLVNYAAYVEFGTRF----------MDS 92 (92) T ss_pred ----CCccccceeeeEEeecCCeeEEEEeccCccccccccccceee----------cCC Confidence 2457788899988777654 454446788999999999764 566 No 129 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=70.37 E-value=0.13 Score=25.41 Aligned_cols=132 Identities=11% Similarity=0.062 Sum_probs=71.3 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCC--CCC---cCcccchh--hhhhccccccCcc Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNP--DGS---AYVPRKIK--KGGKGLRTKVGRI 73 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~P--DG~---pW~p~k~~--~~~~~~~~~~~~~ 73 (155) |.++...+...+...+...+ ...-.++++++..+...... .+| .|. .|...-.. ........+.| T Consensus 1 Ma~~~~sf~~~i~~~~~~ve-~~~~~v~r~~a~~i~~~vv~-----~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G-- 72 (142) T protein:vir:10 1 MANDVVSFRNSINAWIDGVT-EGVELIVEGTLTKATKDIVK-----LSPVDTGRFRGNWQATGNSPAAQSLNNYDPDG-- 72 (142) T ss_pred CccchhhhhccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCcccchhhcccceeeecCcccccccCcCCCC-- Confidence 99887777777777777663 23345788888888777765 233 121 23321100 00000000001 Q ss_pred cchhhhhhhhhc-ceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHHH Q lcl|NC_015266. 74 KRQAMFRKLRTA-RYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLRY 152 (155) Q Consensus 74 ~~~~l~~~~~l~-~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~ 152 (155) ......+... .-|.-..-.+.+-| +.|.+||.-..||...+ ....|.+++.+....|.+-...= T Consensus 73 --~~t~~~~~~~~~~i~~~~~g~~iyi---~Nn~pYA~~LEyG~S~Q----------AP~G~v~~a~q~~~~~v~~a~~e 137 (142) T protein:vir:10 73 --NETRNSLRRQIYALARDANTNVIYI---SNRLDYAQGLEFGSSNQ----------APSGVLGVVQKRLGRYFAEAVQE 137 (142) T ss_pred --ccchhhHHHHHHHhhhccccceEEE---eeCcchhhhhhccccCC----------CcchHHHHHHHHHHHHHHHHHHH Confidence 0111111100 01111112333333 89999999999997653 56778888888877777777666 Q ss_pred hcC Q lcl|NC_015266. 153 LNR 155 (155) Q Consensus 153 l~r 155 (155) +.+ T Consensus 138 ~~~ 140 (142) T protein:vir:10 138 AKR 140 (142) T ss_pred hhc Confidence 666 No 130 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=57.02 E-value=0.44 Score=22.52 Aligned_cols=126 Identities=11% Similarity=0.160 Sum_probs=68.9 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHH----HHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRE----LGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~----Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |++.|..+.+.+..|.-+|++.++.++-.. .++.|...|+++.-..+ +.| T Consensus 4 ~~d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~r---------------------ktg----- 57 (168) T protein:vir:39 4 FYDAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHR---------------------DTG----- 57 (168) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCC---------------------CCC----- Confidence 444566666666666657777776665433 23334444443221110 011 Q ss_pred hhhhhhhhcceeeEEEc------CcEEEEEeccc-------ccccccccccCccccc----cC----CCceeeecCcccc Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVD------DTGLAIGFDDR-------LSRIVRVHQEGQKAPV----EP----GGPLAQYPVRVVL 135 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~------~~~~~v~~~G~-------~~~yAaiHqfG~~~~~----~~----~~~~v~iPaRp~L 135 (155) ....|++||.++.. +....|||.+. ...+|++-.-|.+..+ .+ +.-.|.||+=+|+ T Consensus 58 ---~~~HLADsI~~~~~niDg~~dG~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFv 134 (168) T protein:vir:39 58 ---EDPHLADSIVMKNKNIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFI 134 (168) T ss_pred ---CCccchhheeecccccCcccCCceeccccCccccccccchhheehhccccccchhhhhcccccccccceeecccchh Confidence 11345677766542 34556888664 6678888777764311 01 1224789999999 Q ss_pred CCCHHH---HHHHHHHHHHH----hcC Q lcl|NC_015266. 136 GFSSAD---RELVRDRLLRY----LNR 155 (155) Q Consensus 136 G~s~~d---~~~I~~~i~~~----l~r 155 (155) =-..+| ..+|+..-.+- |.+ T Consensus 135 d~~r~~~a~k~aV~~Ae~e~~~eil~~ 161 (168) T protein:vir:39 135 EETRKNPIVQQGILKAEAEAMRKIINR 161 (168) T ss_pred HHHhhhhhhhHHHHHHHHHHHHHHHHh Confidence 777666 25555554333 334 No 131 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=51.40 E-value=0.58 Score=21.87 Aligned_cols=126 Identities=10% Similarity=0.133 Sum_probs=71.2 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHH----HHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRE----LGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~----Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |++.+..+.+.+..|...|+..++.++-.. .++.|...|++.+-+.+ +.+ T Consensus 4 ~~d~l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k---------------------~t~----- 57 (168) T protein:vir:10 4 FYDAMQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHR---------------------DTG----- 57 (168) T ss_pred HHHHHHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccC---------------------CCC----- Confidence 455566666666666556777777666433 34444445544332211 000 Q ss_pred hhhhhhhhcceeeEEE------cCcEEEEEeccc-------ccccccccccCcccccc----C----CCceeeecCcccc Q lcl|NC_015266. 77 AMFRKLRTARYLRIDV------DDTGLAIGFDDR-------LSRIVRVHQEGQKAPVE----P----GGPLAQYPVRVVL 135 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~------~~~~~~v~~~G~-------~~~yAaiHqfG~~~~~~----~----~~~~v~iPaRp~L 135 (155) ....|++||.++. .+....|||.+. -..+|+++.-|.+..+. + .+-.|.||+=+|+ T Consensus 58 ---~~~HLaDsI~~~~~niDg~~dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFv 134 (168) T protein:vir:10 58 ---EDPHLADSIVMKNKNIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFI 134 (168) T ss_pred ---ccchhhhhheecccccccccCCceeecccCccccccccchheeeeccccccccccccccccccccccccccccchhH Confidence 1124566676543 345667999754 67899999999653221 1 1124789999999 Q ss_pred CCCHHH---HHHHHHHHHHHhcC Q lcl|NC_015266. 136 GFSSAD---RELVRDRLLRYLNR 155 (155) Q Consensus 136 G~s~~d---~~~I~~~i~~~l~r 155 (155) =-..+| .++|+..-.+=+.+ T Consensus 135 d~~r~d~a~k~~V~~Ae~~~y~e 157 (168) T protein:vir:10 135 EETRKNPIVQQGILKAEAEAMRK 157 (168) T ss_pred HHhhhchhhhHHHHHHHHHHHHH Confidence 777666 35555544333333 No 132 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=48.70 E-value=0.27 Score=23.71 Aligned_cols=87 Identities=10% Similarity=0.028 Sum_probs=40.9 Q ss_pred cchhhhhhccccccCcccchhhhhhhhhcceeeEEEc----CcEEE---EEecccccccccccccCccc--cc--cCCCc Q lcl|NC_015266. 57 RKIKKGGKGLRTKVGRIKRQAMFRKLRTARYLRIDVD----DTGLA---IGFDDRLSRIVRVHQEGQKA--PV--EPGGP 125 (155) Q Consensus 57 ~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~----~~~~~---v~~~G~~~~yAaiHqfG~~~--~~--~~~~~ 125 (155) .. ..- +...-.++|.|..+|....+ .|+-. |++.-...+|+..-.||--. .+ ...+. T Consensus 1 ~r--------Dea----karv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~ 68 (119) T protein:vir:10 1 MR--------ESA----KAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGE 68 (119) T ss_pred CC--------ccc----ccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccceeeeeeeeeccCce Confidence 00 000 01122455667777754332 22322 44444445666666777211 00 11111 Q ss_pred ----------eeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 126 ----------LAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 126 ----------~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) ...|||+|||==.=| -...+.+++.+-+.+ T Consensus 69 w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~ 109 (119) T protein:vir:10 69 WYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAK 109 (119) T ss_pred eeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHH Confidence 236999999963222 345566666666555 No 133 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=47.23 E-value=0.56 Score=21.96 Aligned_cols=101 Identities=11% Similarity=0.161 Sum_probs=54.8 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchhhhhhhhhcceeeE-EEcCcEE Q lcl|NC_015266. 19 LAPAARRRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMFRKLRTARYLRI-DVDDTGL 97 (155) Q Consensus 19 L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~sl~~-~~~~~~~ 97 (155) |+. .-.+.+++||..+.+.+.+| .|=|+- +++.|.+|.+. .+...+. T Consensus 1 l~~-~~~~~~~~~a~~l~~~vk~r-----TPv~~~--------------------------d~G~LR~sW~~g~v~k~~~ 48 (116) T protein:vir:10 1 MSK-NLRRAKNNIGNKLLRKVKPK-----TPVAKI--------------------------DGGTARKSWKYKELNLFDG 48 (116) T ss_pred Cch-HHHHHHHHHHHHHHHHHHhh-----CCCCcC--------------------------CCcccccCceeeeeeccCc Confidence 321 22345778888888777665 343321 12222333322 2223333 Q ss_pred EEEecccccccccccccCccccccCCCce---------eeecCccccCCCHHHHH-----HHHHHHHHHhc Q lcl|NC_015266. 98 AIGFDDRLSRIVRVHQEGQKAPVEPGGPL---------AQYPVRVVLGFSSADRE-----LVRDRLLRYLN 154 (155) Q Consensus 98 ~v~~~G~~~~yAaiHqfG~~~~~~~~~~~---------v~iPaRp~LG~s~~d~~-----~I~~~i~~~l~ 154 (155) +| .++..||..--||-++.+..+.+. --.|=+-+|=.|.++.+ .+...|.+||+ T Consensus 49 ~v---~N~~eYA~~VE~GHRq~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 49 VV---SNNVEYIHHLEYGHRTRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ee---ecCCcccccccCCceeeCCcceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 45 689999999999988865543210 12466777777765543 33344445555 No 134 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=41.82 E-value=0.39 Score=22.80 Aligned_cols=87 Identities=10% Similarity=0.028 Sum_probs=40.9 Q ss_pred cchhhhhhccccccCcccchhhhhhhhhcceeeEEEc----CcEEE---EEecccccccccccccCccc--cc--cCCCc Q lcl|NC_015266. 57 RKIKKGGKGLRTKVGRIKRQAMFRKLRTARYLRIDVD----DTGLA---IGFDDRLSRIVRVHQEGQKA--PV--EPGGP 125 (155) Q Consensus 57 ~k~~~~~~~~~~~~~~~~~~~l~~~~~l~~sl~~~~~----~~~~~---v~~~G~~~~yAaiHqfG~~~--~~--~~~~~ 125 (155) .. ..- +...-.++|.|..+|....+ .|+-. |++.-...+|+..-.||--. .+ ...+. T Consensus 1 ~r--------Dea----karv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~ 68 (119) T protein:vir:81 1 MR--------ESA----KAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGE 68 (119) T ss_pred CC--------ccc----ccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccceeeeeeeeeccCce Confidence 00 000 01122455667777754432 22222 44444445666666777111 00 11111 Q ss_pred ----------eeeecCccccCCCHH-HHHHHHHHHHHHhcC Q lcl|NC_015266. 126 ----------LAQYPVRVVLGFSSA-DRELVRDRLLRYLNR 155 (155) Q Consensus 126 ----------~v~iPaRp~LG~s~~-d~~~I~~~i~~~l~r 155 (155) ...|||+|||==.=| -...+.+++.+-+.+ T Consensus 69 w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~ 109 (119) T protein:vir:81 69 WYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAK 109 (119) T ss_pred eeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHH Confidence 246999999963322 345566666666555 No 135 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=37.24 E-value=1.1 Score=20.29 Aligned_cols=122 Identities=16% Similarity=0.114 Sum_probs=61.6 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHH-HHhcCCCCCCcCcccchhhhhhccccccCcccchhhh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSR-VAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQAMF 79 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~R-f~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~l~ 79 (155) |. -+++|...|.. .+ +.++=..+...+.+= -+.|+.+ |.||+-- .++ . T Consensus 1 i~-G~~~L~~~Lk~----~s-------~~dvk~VVkkN~ael~~r~q~~~-~~pv~~~-----------~k~-------~ 49 (127) T protein:vir:98 1 MT-GMPALEVKLRS----MS-------EKRWDRVANKNLTEMFNRAARPP-GTPIGKN-----------TKR-------H 49 (127) T ss_pred Cc-ChHHHHHHHHH----hh-------HHHHHHHHhhhhHHHHHHHHhcc-CCceecc-----------ccc-------c Confidence 32 23344443332 21 223322222222221 2233333 6666420 011 2 Q ss_pred hhhhhcceeeEEEcCcEEE--EEecccccccccccccCccccccCCCceee-ecCccccCCCHHHHHHH-HHHHHHHhcC Q lcl|NC_015266. 80 RKLRTARYLRIDVDDTGLA--IGFDDRLSRIVRVHQEGQKAPVEPGGPLAQ-YPVRVVLGFSSADRELV-RDRLLRYLNR 155 (155) Q Consensus 80 ~~~~l~~sl~~~~~~~~~~--v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~-iPaRp~LG~s~~d~~~I-~~~i~~~l~r 155 (155) +++.+.+|++.+...++.. |+-.|....||..--||.+. +. +++.|- .||=|||+=.=+....| ..-+.+-+-| T Consensus 50 dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~-m~-~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 50 KSGELLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRI-VR-NGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred CcccceeeeEEEEecCCceEEeccCcccccccceeecceee-ee-cccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 4677888999988888887 43344568899999999763 22 223333 78999998664443333 2223333333 No 136 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=33.39 E-value=0.64 Score=21.64 Aligned_cols=119 Identities=13% Similarity=0.165 Sum_probs=62.1 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHH----HHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARR----RLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~----~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |+=++.=+++-|..|-.++.++.-+ .-+.+.|+.+....+.+|+.=++- |..-..-+ T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDT-Gati~ev~------------------ 61 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDT-GASIDEIN------------------ 61 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcc-cceeeeEE------------------ Confidence 9988777777677666677654421 246677777777777776653221 21111100 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEeccccccccccc--ccCcccc---ccCCC-----ceeeecCccccCCCHHHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVH--QEGQKAP---VEPGG-----PLAQYPVRVVLGFSSADRELVR 146 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiH--qfG~~~~---~~~~~-----~~v~iPaRp~LG~s~~d~~~I~ 146 (155) .+. -.+.-..-.+.|+|.|+-.+|.-|| -||.+-. +++++ +..+.-.++|. +.|. T Consensus 62 -------~s~-p~~~~G~r~V~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~-------~~vk 126 (133) T protein:vir:78 62 -------IEK-PSYDKGVRSIKIDWKGPKDRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYR-------AIVQ 126 (133) T ss_pred -------ecC-eeeeCCceEEEEEEecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHH-------HHHH Confidence 000 1122234567899999988888888 4664321 12221 12234445554 2333 Q ss_pred HHHHHHh Q lcl|NC_015266. 147 DRLLRYL 153 (155) Q Consensus 147 ~~i~~~l 153 (155) +-|.+-| T Consensus 127 ~el~k~l 133 (133) T protein:vir:78 127 KKIGDKL 133 (133) T ss_pred HHHHhhC Confidence 3333333 No 137 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=33.31 E-value=0.83 Score=21.04 Aligned_cols=119 Identities=19% Similarity=0.259 Sum_probs=51.9 Q ss_pred Cch--hHHHHHHHHHHHHHhcCchhH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCccc Q lcl|NC_015266. 1 MDD--DLRALEKWAGGLLAKLAPAAR----RRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIK 74 (155) Q Consensus 1 m~~--~~~~l~~~l~~ll~~L~~~~r----~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~ 74 (155) |+. |+.=+++.|..|-.+|.++.- ..-+.+.|+.++...+..|.-=++ .|..-..-. T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkD-TGat~dev~---------------- 69 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKR-TGETTESAV---------------- 69 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhh-ccceeeeee---------------- Confidence 332 333333333333333333211 125677888888888777764321 121111000 Q ss_pred chhhhhhhhhcceeeEEEcCcEEEEEecccccccccccc--cCccccccCCCc-----eeeecCccccCCCHHHHHHHHH Q lcl|NC_015266. 75 RQAMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQ--EGQKAPVEPGGP-----LAQYPVRVVLGFSSADRELVRD 147 (155) Q Consensus 75 ~~~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHq--fG~~~~~~~~~~-----~v~iPaRp~LG~s~~d~~~I~~ 147 (155) .+ -..+.-.--.+.|+|.|+ +|.-||. ||..-.++|++. .++.-.++|... |.+ T Consensus 70 ---------~s-~p~~~~G~r~V~igW~Gp--R~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~-------vk~ 130 (138) T protein:vir:98 70 ---------VS-GVRREDGIPKVKLGFTTP--RWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRG-------IRD 130 (138) T ss_pred ---------ec-CeeecCCceEEEEeeecC--eeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHH-------HHH Confidence 00 011222234678999887 8888887 565333344332 223333444322 222 Q ss_pred HHHHHhcC Q lcl|NC_015266. 148 RLLRYLNR 155 (155) Q Consensus 148 ~i~~~l~r 155 (155) .+..-|.= T Consensus 131 el~k~l~~ 138 (138) T protein:vir:98 131 KLKRGFDG 138 (138) T ss_pred HHHHHhcC Confidence 22222222 No 138 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=31.85 E-value=0.19 Score=24.52 Aligned_cols=107 Identities=14% Similarity=0.165 Sum_probs=49.4 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHH---HHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccchh Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARR---RLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQA 77 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~---~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~~ 77 (155) |+.|-+.+.+.|....+.|.. +|. ...++++++..- .+.+|..-..+|+.+ +...... T Consensus 1 m~~d~~~~~~~l~~r~~~l~~-~R~~~e~~w~e~~~~~lP-~~~~~~~~~~~~~~~-----------------~~~~~~~ 61 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKE-KRQSYEAVWNDVIDYLMP-RLDKFGQLPRPDSEK-----------------GRERSQK 61 (549) T ss_pred CCcchHHHHHHHHHHHHHHHH-HhhhHHHHHHHHHHHhcc-ccccccccCCCCCCc-----------------ccccccc Confidence 999999999888888888752 222 244555554432 122232211112110 0000011 Q ss_pred hhhhhhhcceeeEEEcCcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHH------------HHH Q lcl|NC_015266. 78 MFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADR------------ELV 145 (155) Q Consensus 78 l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~------------~~I 145 (155) ++++ .++ =.....|+-.+.|++. |.|||.+|+..|. +++ T Consensus 62 ~~ds-------------tg~-----~a~~~LAs~l~~~ltp-----------p~~~wF~l~~~~~~~~e~~~v~~~l~~v 112 (549) T protein:vir:10 62 MFDS-------------TAP-----LALRNFVAAMDSMITP-----------ATQLWHRLKTGNDALNEIASVKAYLQGV 112 (549) T ss_pred cccc-------------hHH-----HHHHHHHHHHHhhccC-----------CCCccccccCCccchhhhhHHHHHHHHH Confidence 1111 000 0122456666667664 7899987775432 222 Q ss_pred HHHHHHHhc--C Q lcl|NC_015266. 146 RDRLLRYLN--R 155 (155) Q Consensus 146 ~~~i~~~l~--r 155 (155) .+.+.+.|+ + T Consensus 113 e~~~~~~~~~~~ 124 (549) T protein:vir:10 113 VRTLFAARYRWQ 124 (549) T ss_pred HHHHHHHHhhhh Confidence 233333221 2 No 139 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=31.43 E-value=1.2 Score=20.22 Aligned_cols=118 Identities=16% Similarity=0.184 Sum_probs=56.1 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHH----HHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARR----RLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~----~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |++ +.=+++-+..|-.++.++.-+ .-+...|+.+....+.+|+.=++- |......+ T Consensus 1 m~e-vkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDT-Gatidev~------------------ 60 (133) T protein:vir:96 1 MRL-IYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDT-GAEYGEVK------------------ 60 (133) T ss_pred Ccc-ccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhc-cceeeeEE------------------ Confidence 984 333333344444455443321 246677888888777777653221 21111100 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEeccccccccccc--ccCcccc----ccCCC-----ceeeecCccccCCCHHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVH--QEGQKAP----VEPGG-----PLAQYPVRVVLGFSSADRELV 145 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiH--qfG~~~~----~~~~~-----~~v~iPaRp~LG~s~~d~~~I 145 (155) .+. -.+...--.+.|+|.|+-.+|.-|| -||.-.+ +++++ +..+.-.++|...-.++.+++ T Consensus 61 -------~s~-p~~~~g~rtV~i~W~gp~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~kl 132 (133) T protein:vir:96 61 -------LSK-PTWENGKRTIRVYWEGEKHRYSIVHLNEKGFYAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVSKL 132 (133) T ss_pred -------ecC-ceecCCceEEEEEeecCCCceeeEeeecccceecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHHHh Confidence 000 0111123357899989988888888 4663321 12222 123344555554333333333 Q ss_pred H Q lcl|NC_015266. 146 R 146 (155) Q Consensus 146 ~ 146 (155) + T Consensus 133 l 133 (133) T protein:vir:96 133 L 133 (133) T ss_pred C Confidence 3 No 140 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=25.16 E-value=0.9 Score=20.82 Aligned_cols=124 Identities=10% Similarity=0.100 Sum_probs=62.0 Q ss_pred CchhHHHHHHHHHHHHHhcCchhH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAAR----RRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r----~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) ||=++.=+++-|..|-..+.++.- .+-+.+.|+.+....+.+|.-=++ .|.--...+- T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkD-TG~t~~ev~~----------------- 62 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKD-TGALINEVSF----------------- 62 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhh-ccceeccEEe----------------- Confidence 887766555555555555543321 135677788888887777765333 2322221110 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEeccccccccccc--ccCccccccCCCceeeecCccccCCC---HHHHHHHHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVH--QEGQKAPVEPGGPLAQYPVRVVLGFS---SADRELVRDRLLR 151 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiH--qfG~~~~~~~~~~~v~iPaRp~LG~s---~~d~~~I~~~i~~ 151 (155) +. ..+.-.--.+.|+|.|...+|.-|| -||-+..- .++.+ -.|=|=.|. ++-+....+++.+ T Consensus 63 --------s~-p~~~~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~~--~Gk~i--~PrG~G~i~~a~~~~e~~~~~~ik~ 129 (134) T protein:vir:10 63 --------SK-PEWINGKRTITVHWRGSKDRYKIVHLIEYGHVQKG--TGKFI--KPKAMGGVNRAIRQGQNKYFETLKR 129 (134) T ss_pred --------cC-eeecCCceEEEEEEEcCCceeEEEEeecccceecc--cCCcc--CcchhhHHHHHHHhhhHHHHHHHHH Confidence 00 1111223457899989988888887 57754321 11111 111111110 2334444555555 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) =|.| T Consensus 130 eL~k 133 (134) T protein:vir:10 130 ELKK 133 (134) T ss_pred HHhc Confidence 5555 No 141 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=25.16 E-value=0.9 Score=20.82 Aligned_cols=124 Identities=10% Similarity=0.100 Sum_probs=62.0 Q ss_pred CchhHHHHHHHHHHHHHhcCchhH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAAR----RRLFRELGRDMRRAQQSRVAAQQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r----~~l~~~Ig~~L~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) ||=++.=+++-|..|-..+.++.- .+-+.+.|+.+....+.+|.-=++ .|.--...+- T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkD-TG~t~~ev~~----------------- 62 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKD-TGALINEVSF----------------- 62 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhh-ccceeccEEe----------------- Confidence 887766555555555555543321 135677788888887777765333 2322221110 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEeccccccccccc--ccCccccccCCCceeeecCccccCCC---HHHHHHHHHHHHH Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRIVRVH--QEGQKAPVEPGGPLAQYPVRVVLGFS---SADRELVRDRLLR 151 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~yAaiH--qfG~~~~~~~~~~~v~iPaRp~LG~s---~~d~~~I~~~i~~ 151 (155) +. ..+.-.--.+.|+|.|...+|.-|| -||-+..- .++.+ -.|=|=.|. ++-+....+++.+ T Consensus 63 --------s~-p~~~~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~~--~Gk~i--~PrG~G~i~~a~~~~e~~~~~~ik~ 129 (134) T protein:vir:95 63 --------SK-PEWINGKRTITVHWRGSKDRYKIVHLIEYGHVQKG--TGKFI--KPKAMGGVNRAIRQGQNKYFETLKR 129 (134) T ss_pred --------cC-eeecCCceEEEEEEEcCCceeEEEEeecccceecc--cCCcc--CcchhhHHHHHHHhhhHHHHHHHHH Confidence 00 1111223457899989988888887 57754321 11111 111111110 2334444555555 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) =|.| T Consensus 130 eL~k 133 (134) T protein:vir:95 130 ELKK 133 (134) T ss_pred HHhc Confidence 5555 No 142 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=23.41 E-value=2.3 Score=18.59 Aligned_cols=117 Identities=13% Similarity=0.097 Sum_probs=45.7 Q ss_pred Cch-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHh---cCCCCCCcCcccchhhhhhccccccCcccch Q lcl|NC_015266. 1 MDD-DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAA---QQNPDGSAYVPRKIKKGGKGLRTKVGRIKRQ 76 (155) Q Consensus 1 m~~-~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~---q~~PDG~pW~p~k~~~~~~~~~~~~~~~~~~ 76 (155) |++ .+..|.+.+..-|..-+.. -.+-+.++-+..-..+-+.++. +.+|.-+ T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~-v~~~v~~~v~~vak~a~~~lkk~i~~tspkrT------------------------ 55 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQV-IADDVEQIMDDVTKEAVGRLKSKIQEVGLVQT------------------------ 55 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhHhcCcccc------------------------ Confidence 776 3555555555444332110 0011222222222223333333 2345211 Q ss_pred hhhhhhhhcceeeEEEcCcEEEEEeccccccc--ccccccCccccccCCCceeeecCccccCCCHH-HHHHHHHHHHHHh Q lcl|NC_015266. 77 AMFRKLRTARYLRIDVDDTGLAIGFDDRLSRI--VRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSA-DRELVRDRLLRYL 153 (155) Q Consensus 77 ~l~~~~~l~~sl~~~~~~~~~~v~~~G~~~~y--AaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~-d~~~I~~~i~~~l 153 (155) |..+++++...+.++.+|. ...+| +..--||-.-+ +|+ ..+|+|++.=-.+ -.+++.+-|.+-| T Consensus 56 -----G~YaK~W~~kk~~e~~~V~---nk~~yqLtHLLE~GHAkr--~GG---RV~a~pHI~paee~~~~~l~~~i~~~l 122 (124) T protein:vir:95 56 -----GDYMRGWTRKRVPNGWVIH---NKTEYRLAHLLEYGHATV--DGG---RVPGTPHIRPIEDWLEKEFEDRVEKAI 122 (124) T ss_pred -----cchhccceeeeecCceeEE---EcCCCceeeeeecceecc--CCc---ccCCccchhHHHHHHHHHHHHHHHHHh Confidence 1122233333333333441 12244 44455664432 222 2589999853322 2333444444444 Q ss_pred cC Q lcl|NC_015266. 154 NR 155 (155) Q Consensus 154 ~r 155 (155) .. T Consensus 123 ~~ 124 (124) T protein:vir:95 123 KQ 124 (124) T ss_pred cC Confidence 44 No 143 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=22.99 E-value=2.4 Score=18.53 Aligned_cols=132 Identities=12% Similarity=0.109 Sum_probs=57.2 Q ss_pred CchhHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CcCcccchhhhhhccc----cccC Q lcl|NC_015266. 1 MDDDLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNP-D-G---SAYVPRKIKKGGKGLR----TKVG 71 (155) Q Consensus 1 m~~~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~P-D-G---~pW~p~k~~~~~~~~~----~~~~ 71 (155) |.+.+..+...++...+.++ ..-..++++++..+......+ +| | | ..|...-....-.... ...+ T Consensus 1 MA~~~~~f~~~i~~~~~~ve-~~~~~~~r~~a~~v~~~vv~~-----sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~ 74 (144) T protein:vir:95 1 MAKSLLDLADRLEKKAKAID-EAASQNAVDTALAIVGDLAYK-----TPVDTSQALSNWIVTLESPSGQQIKPHFPGSQG 74 (144) T ss_pred CchhhhhhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCccchhhccccceecccccccccccccccccc Confidence 99977777777777777664 233457888887777766653 33 1 2 1233211100000000 0000 Q ss_pred cccchhhhhhhhhcceeeEEEc-CcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHH- Q lcl|NC_015266. 72 RIKRQAMFRKLRTARYLRIDVD-DTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRL- 149 (155) Q Consensus 72 ~~~~~~l~~~~~l~~sl~~~~~-~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i- 149 (155) .........++.-..+.-..+. ++.+- ...|.+||.-..||...+. ..-|.+++......+.+-+ T Consensus 75 ~t~d~sg~~tl~~~~~vi~~~~~g~~iy---i~NnlpYA~~LEyG~S~QA----------P~G~vr~~~q~~~~~v~~~~ 141 (144) T protein:vir:95 75 STQRASAAETLNSAKLVLRNKKPGQAIF---ITNNLPYIRRLNDGYSAQA----------PAGFVERAVLIGRKMRKKFK 141 (144) T ss_pred ccCCCchhHHHHHHHHHHhhcCccceEE---EeeCchhhhhhhccccCCC----------cchHHHHHHHHHHHHHHhhc Confidence 0000011111110011111111 23333 3889999999999977543 3334444433322222111 Q ss_pred -HH Q lcl|NC_015266. 150 -LR 151 (155) Q Consensus 150 -~~ 151 (155) .| T Consensus 142 ~~~ 144 (144) T protein:vir:95 142 IKD 144 (144) T ss_pred cCC Confidence 01 No 144 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=20.42 E-value=2.8 Score=18.15 Aligned_cols=133 Identities=11% Similarity=0.058 Sum_probs=68.6 Q ss_pred Cch-hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhcCCC--CC---CcCcccc--hhhhhhccccccCc Q lcl|NC_015266. 1 MDD-DLRALEKWAGGLLAKLAPAARRRLFRELGRDMRRAQQSRVAAQQNP--DG---SAYVPRK--IKKGGKGLRTKVGR 72 (155) Q Consensus 1 m~~-~~~~l~~~l~~ll~~L~~~~r~~l~~~Ig~~L~~~t~~Rf~~q~~P--DG---~pW~p~k--~~~~~~~~~~~~~~ 72 (155) |.| ++..+...+...++..+ ...-.++++++..+...... .+| .| ..|...- +.........+.|. T Consensus 1 ma~~~~~sFa~~i~~~~~~ve-~~~~~~~r~~a~~i~~~vv~-----~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~ 74 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVE-SGLNDVIQIFGEKVHGALVD-----IAPVDTGRFKANMQITANKPPLYALNQYDPDGE 74 (146) T ss_pred CCcchhHHHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCCCcchhhccccceeecCcccccccCCCCCCc Confidence 998 56666666777666653 23345788888888777765 233 12 1243211 00000000000111 Q ss_pred ccchhhhhhhhhcceeeEEEc-CcEEEEEecccccccccccccCccccccCCCceeeecCccccCCCHHHHHHHHHHHHH Q lcl|NC_015266. 73 IKRQAMFRKLRTARYLRIDVD-DTGLAIGFDDRLSRIVRVHQEGQKAPVEPGGPLAQYPVRVVLGFSSADRELVRDRLLR 151 (155) Q Consensus 73 ~~~~~l~~~~~l~~sl~~~~~-~~~~~v~~~G~~~~yAaiHqfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~ 151 (155) .++..-......+...+. .+.+- .+.|.+||.-..||-..+ ....|.+++.+....|.+-... T Consensus 75 ---~t~~~~~~~i~~~~~g~~~~~~iy---i~NnlpYA~~LEyG~S~Q----------AP~G~v~~~~~~~~~~v~~a~~ 138 (146) T protein:vir:79 75 ---KIKAEGRRTLYALLHGGGAIKSIY---FSNMLIYANALEYGHSKQ----------APAGVFGIVAIRLRSYMAEAIR 138 (146) T ss_pred ---ccHHHHHHHHHHHHhcccccceeE---EeeCchhhhhhhccccCC----------CcchHHHHHHHHHHHHHHHHHH Confidence 111110000001111111 12232 389999999999996543 5667888888877777777666 Q ss_pred HhcC Q lcl|NC_015266. 152 YLNR 155 (155) Q Consensus 152 ~l~r 155 (155) =+.| T Consensus 139 e~k~ 142 (146) T protein:vir:79 139 EARK 142 (146) T ss_pred HHHh Confidence 6666 Done!