Query lcl|NC_020839.1_cdsid_YP_007673220.1 [gene=RHWG_00020] [protein=hypothetical protein] [protein_id=YP_007673220.1] [location=complement(18297..18806)] Match_columns 169 No_of_seqs 126 out of 386 Neff 7.8 Searched_HMMs 1612 Date Thu Nov 7 17:14:34 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_20 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_20_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99833 Length: 190 100.0 7.7E-47 4.8E-50 273.4 14.3 162 1-169 3-184 (190) 2 protein:vir:79091 Length: 175 100.0 3E-44 1.8E-47 259.2 12.0 147 1-169 4-170 (175) 3 protein:vir:99196 Length: 155 100.0 2.1E-43 1.3E-46 254.6 14.0 144 1-169 4-153 (155) 4 protein:vir:1988 Length: 156 # 100.0 3.9E-43 2.4E-46 253.1 13.9 148 1-169 4-152 (156) 5 protein:vir:79225 Length: 155 100.0 6.5E-43 4E-46 251.9 13.8 144 1-169 4-153 (155) 6 protein:vir:107851 Length: 175 100.0 9.9E-43 6.1E-46 250.9 11.9 147 1-169 4-170 (175) 7 protein:vir:2026 Length: 150 # 100.0 1.2E-41 7.4E-45 244.9 13.4 143 6-169 1-150 (150) 8 protein:vir:98557 Length: 149 100.0 3.9E-41 2.4E-44 242.1 13.3 144 1-169 1-149 (149) 9 protein:vir:103841 Length: 155 100.0 3.1E-41 1.9E-44 242.6 12.3 144 1-169 4-153 (155) 10 protein:vir:6071 Length: 150 # 100.0 4.7E-41 2.9E-44 241.7 13.2 143 6-169 1-150 (150) 11 protein:vir:5703 Length: 150 # 100.0 6.7E-41 4.2E-44 240.8 13.2 143 6-169 1-150 (150) 12 protein:vir:79179 Length: 155 100.0 6.6E-41 4.1E-44 240.9 12.6 150 1-169 1-155 (155) 13 protein:vir:1838 Length: 149 # 100.0 2.5E-40 1.5E-43 237.7 12.9 144 1-169 1-149 (149) 14 protein:vir:1164 Length: 156 # 100.0 9.1E-40 5.7E-43 234.6 12.3 147 4-169 1-152 (156) 15 protein:vir:79115 Length: 148 100.0 1.1E-39 6.9E-43 234.1 12.4 143 1-169 1-148 (148) 16 protein:vir:100312 Length: 152 100.0 1.8E-39 1.1E-42 233.0 12.5 146 1-169 1-151 (152) 17 protein:vir:3163 Length: 145 # 100.0 3.3E-34 2E-37 204.2 10.5 132 1-169 1-141 (145) 18 protein:vir:78755 Length: 228 99.8 2.1E-21 1.3E-24 134.0 11.1 152 1-169 1-216 (228) 19 protein:vir:3787 Length: 231 # 99.7 3.5E-21 2.2E-24 132.7 10.7 158 1-169 2-228 (231) 20 protein:vir:3750 Length: 227 # 99.7 1.5E-19 9.2E-23 123.8 11.5 153 1-169 2-224 (227) 21 protein:vir:98860 Length: 230 99.6 2.4E-18 1.5E-21 117.2 10.3 153 1-169 4-227 (230) 22 protein:vir:274 Length: 166 # 99.4 8.5E-16 5.3E-19 103.2 8.9 142 1-169 1-148 (166) 23 protein:vir:94654 Length: 142 99.0 1.5E-11 9.4E-15 79.9 11.3 132 1-168 1-142 (142) 24 protein:vir:4906 Length: 114 # 98.7 1.9E-10 1.2E-13 73.8 10.4 112 1-169 1-114 (114) 25 protein:vir:2740 Length: 114 # 98.7 1.9E-10 1.2E-13 73.8 10.4 112 1-169 1-114 (114) 26 protein:vir:106041 Length: 137 98.6 2.6E-10 1.6E-13 73.1 8.6 119 1-169 4-132 (137) 27 protein:vir:8669 Length: 142 # 98.6 2.9E-10 1.8E-13 72.9 8.2 127 1-165 1-142 (142) 28 protein:vir:99101 Length: 142 98.6 2.9E-10 1.8E-13 72.9 8.2 127 1-165 1-142 (142) 29 protein:vir:96486 Length: 112 98.6 1.2E-09 7.2E-13 69.6 9.8 111 1-168 1-112 (112) 30 protein:vir:106570 Length: 182 98.4 5.4E-09 3.4E-12 65.9 9.6 137 1-169 1-174 (182) 31 protein:vir:105330 Length: 137 98.4 3.5E-09 2.2E-12 67.0 8.1 128 1-164 1-137 (137) 32 protein:vir:96829 Length: 135 98.4 3.4E-09 2.1E-12 67.0 8.1 126 1-164 1-135 (135) 33 protein:vir:96358 Length: 115 98.3 1E-08 6.5E-12 64.3 10.0 111 4-168 1-115 (115) 34 protein:vir:96225 Length: 115 98.3 1E-08 6.5E-12 64.3 10.0 111 4-168 1-115 (115) 35 protein:vir:78858 Length: 115 98.3 1E-08 6.5E-12 64.3 10.0 111 4-168 1-115 (115) 36 protein:vir:103917 Length: 115 98.3 1E-08 6.5E-12 64.3 10.0 111 4-168 1-115 (115) 37 protein:vir:9312 Length: 115 # 98.3 1E-08 6.5E-12 64.3 10.0 111 4-168 1-115 (115) 38 protein:vir:97144 Length: 115 98.3 1E-08 6.5E-12 64.3 10.0 111 4-168 1-115 (115) 39 protein:vir:94796 Length: 137 98.3 5.6E-09 3.5E-12 65.8 8.1 128 1-164 1-137 (137) 40 protein:vir:95789 Length: 114 98.3 1.8E-08 1.1E-11 63.0 10.2 107 2-168 1-114 (114) 41 protein:vir:3617 Length: 112 # 98.3 1.4E-08 8.5E-12 63.7 9.0 107 1-168 2-112 (112) 42 protein:vir:106506 Length: 137 98.2 1.2E-08 7.5E-12 64.0 7.5 124 1-168 1-137 (137) 43 protein:vir:96105 Length: 193 98.2 2.5E-09 1.6E-12 67.7 3.5 103 46-169 1-129 (193) 44 protein:vir:94490 Length: 137 98.2 1.5E-08 9.1E-12 63.5 7.6 128 1-164 1-137 (137) 45 protein:vir:97427 Length: 137 98.2 1.5E-08 9.1E-12 63.5 7.6 128 1-164 1-137 (137) 46 protein:vir:93738 Length: 137 98.2 1.5E-08 9.1E-12 63.5 7.6 128 1-164 1-137 (137) 47 protein:vir:95894 Length: 137 98.1 2.1E-08 1.3E-11 62.7 7.6 128 1-164 1-137 (137) 48 protein:vir:97982 Length: 140 98.1 1.6E-08 9.9E-12 63.3 6.4 120 1-162 7-140 (140) 49 protein:vir:107545 Length: 140 98.1 1.6E-08 9.9E-12 63.3 6.4 120 1-162 7-140 (140) 50 protein:vir:1273 Length: 127 # 98.1 1.1E-07 7.1E-11 58.7 10.8 111 1-168 1-127 (127) 51 protein:vir:106623 Length: 115 98.1 1.1E-07 6.9E-11 58.7 10.3 111 4-168 1-115 (115) 52 protein:vir:99744 Length: 115 98.0 1.3E-07 8.3E-11 58.3 10.5 111 4-168 1-115 (115) 53 protein:vir:107099 Length: 137 98.0 4.4E-08 2.7E-11 60.9 7.3 128 1-164 1-137 (137) 54 protein:vir:96121 Length: 137 98.0 6.5E-08 4E-11 60.0 8.1 128 1-164 1-137 (137) 55 protein:vir:105916 Length: 149 98.0 3E-08 1.8E-11 61.9 6.0 128 1-164 13-149 (149) 56 protein:vir:743 Length: 108 # 98.0 8.9E-08 5.5E-11 59.2 8.1 104 4-168 1-108 (108) 57 protein:vir:98409 Length: 108 97.9 1.4E-07 8.5E-11 58.2 8.9 104 4-168 1-108 (108) 58 protein:vir:5978 Length: 144 # 97.9 3.1E-07 2E-10 56.2 10.8 130 1-168 3-144 (144) 59 protein:vir:94108 Length: 149 97.9 4.7E-08 2.9E-11 60.8 6.0 128 1-164 13-149 (149) 60 protein:vir:100075 Length: 140 97.9 1.7E-07 1.1E-10 57.6 8.6 126 1-169 1-130 (140) 61 protein:vir:1437 Length: 140 # 97.9 2.2E-07 1.3E-10 57.1 8.9 123 1-169 1-130 (140) 62 protein:vir:4347 Length: 164 # 97.8 1.6E-07 9.7E-11 57.9 7.7 139 1-169 4-148 (164) 63 protein:vir:9930 Length: 108 # 97.8 5.7E-07 3.5E-10 54.8 10.0 104 6-169 1-108 (108) 64 protein:vir:102441 Length: 137 97.7 1.5E-07 9.1E-11 58.0 5.7 121 2-166 1-137 (137) 65 protein:vir:105089 Length: 133 97.7 6.2E-07 3.8E-10 54.6 9.2 115 1-169 1-128 (133) 66 protein:vir:80362 Length: 140 97.7 2.4E-07 1.5E-10 56.9 6.5 124 1-169 1-130 (140) 67 protein:vir:78077 Length: 141 97.6 6.5E-07 4.1E-10 54.5 8.0 132 1-169 1-139 (141) 68 protein:vir:100243 Length: 140 97.6 1.2E-06 7.5E-10 53.0 9.3 121 1-169 1-130 (140) 69 protein:vir:5745 Length: 135 # 97.6 2E-06 1.2E-09 51.8 9.8 115 1-169 2-128 (135) 70 protein:vir:94538 Length: 125 97.5 1.8E-06 1.1E-09 52.1 9.5 109 1-169 4-120 (125) 71 protein:vir:101594 Length: 173 97.5 2E-06 1.2E-09 51.8 9.5 130 4-169 1-168 (173) 72 protein:vir:99546 Length: 200 97.4 1.6E-07 1E-10 57.8 2.0 103 67-169 1-136 (200) 73 protein:vir:1891 Length: 179 # 97.3 2E-06 1.2E-09 51.9 6.7 139 1-169 4-163 (179) 74 protein:vir:1332 Length: 143 # 97.0 6E-06 3.7E-09 49.2 7.0 122 1-169 6-143 (143) 75 protein:vir:102154 Length: 119 97.0 1.4E-05 8.6E-09 47.2 8.8 108 1-168 1-119 (119) 76 protein:vir:96105 Length: 193 96.8 1.1E-05 6.5E-09 47.9 7.1 81 1-96 107-193 (193) 77 protein:vir:1386 Length: 149 # 96.8 1.7E-05 1E-08 46.8 8.0 124 1-169 1-141 (149) 78 protein:vir:6246 Length: 143 # 96.8 1.1E-05 6.9E-09 47.7 7.0 122 1-169 6-143 (143) 79 protein:vir:107757 Length: 189 96.7 1E-05 6.2E-09 48.0 6.1 91 1-100 64-189 (189) 80 protein:vir:105467 Length: 144 96.7 3.5E-05 2.2E-08 45.0 8.9 130 1-169 1-138 (144) 81 protein:vir:99546 Length: 200 96.7 1.5E-05 9.1E-09 47.1 6.8 81 1-96 114-200 (200) 82 protein:vir:105007 Length: 146 96.6 3.8E-05 2.4E-08 44.8 8.4 125 1-169 4-140 (146) 83 protein:vir:102875 Length: 146 96.6 3.8E-05 2.4E-08 44.8 8.4 125 1-169 4-140 (146) 84 protein:vir:107568 Length: 146 96.6 3.8E-05 2.4E-08 44.8 8.4 125 1-169 4-140 (146) 85 protein:vir:102085 Length: 146 96.6 3.8E-05 2.4E-08 44.8 8.4 125 1-169 4-140 (146) 86 protein:vir:93617 Length: 148 96.6 2.3E-05 1.4E-08 46.1 7.1 128 1-169 1-140 (148) 87 protein:vir:97088 Length: 157 96.5 6.3E-05 3.9E-08 43.6 9.2 132 1-169 2-155 (157) 88 protein:vir:5257 Length: 148 # 96.5 1.6E-05 1E-08 46.8 5.9 81 1-96 66-148 (148) 89 protein:vir:80037 Length: 199 96.4 2E-05 1.3E-08 46.3 6.3 84 1-98 110-199 (199) 90 protein:vir:79034 Length: 141 96.4 5.9E-05 3.6E-08 43.8 8.8 124 1-169 1-137 (141) 91 protein:vir:4704 Length: 125 # 96.4 5.2E-05 3.2E-08 44.1 8.2 112 2-169 1-122 (125) 92 protein:vir:79988 Length: 125 96.4 5.2E-05 3.2E-08 44.1 8.2 112 2-169 1-122 (125) 93 protein:vir:98342 Length: 125 96.4 5.2E-05 3.2E-08 44.1 8.2 112 2-169 1-122 (125) 94 protein:vir:9414 Length: 125 # 96.4 5.2E-05 3.2E-08 44.1 8.2 112 2-169 1-122 (125) 95 protein:vir:81106 Length: 125 96.4 5.2E-05 3.2E-08 44.1 8.2 112 2-169 1-122 (125) 96 protein:vir:3873 Length: 128 # 96.3 4.6E-05 2.9E-08 44.4 7.7 115 2-169 1-125 (128) 97 protein:vir:194 Length: 149 # 95.9 0.00019 1.2E-07 41.0 9.1 131 1-169 1-141 (149) 98 protein:vir:95062 Length: 116 95.7 6.3E-05 3.9E-08 43.6 5.7 108 11-164 1-116 (116) 99 protein:vir:102963 Length: 163 95.6 0.00024 1.5E-07 40.4 8.5 140 2-169 1-156 (163) 100 protein:vir:1243 Length: 116 # 95.5 0.0001 6.2E-08 42.5 5.9 108 11-164 1-116 (116) 101 protein:vir:97327 Length: 116 95.5 0.0001 6.2E-08 42.5 5.9 108 11-164 1-116 (116) 102 protein:vir:94069 Length: 168 93.8 0.0001 6.5E-08 42.4 2.0 78 58-169 1-98 (168) 103 protein:vir:966 Length: 123 # 93.5 0.0012 7.7E-07 36.5 7.4 114 1-169 1-123 (123) 104 protein:vir:78607 Length: 155 93.5 0.00061 3.8E-07 38.2 5.7 81 1-97 73-155 (155) 105 protein:vir:81147 Length: 126 93.4 0.0017 1E-06 35.8 7.9 114 1-169 1-124 (126) 106 protein:vir:106728 Length: 155 93.4 0.00065 4.1E-07 38.0 5.6 81 1-97 73-155 (155) 107 protein:vir:77650 Length: 155 93.0 0.00097 6E-07 37.1 5.9 81 1-97 73-155 (155) 108 protein:vir:101563 Length: 155 92.9 0.00093 5.7E-07 37.2 5.7 81 1-97 73-155 (155) 109 protein:vir:9708 Length: 125 # 92.6 0.0028 1.7E-06 34.6 7.9 112 1-169 1-125 (125) 110 protein:vir:94069 Length: 168 91.5 0.0015 9.6E-07 36.0 5.2 89 1-107 76-168 (168) 111 protein:vir:107757 Length: 189 90.6 0.0007 4.3E-07 37.9 2.5 74 67-169 1-86 (189) 112 protein:vir:5257 Length: 148 # 90.2 0.00099 6.1E-07 37.1 3.0 71 56-169 1-88 (148) 113 protein:vir:95372 Length: 124 89.4 0.0073 4.6E-06 32.3 7.1 118 1-169 1-124 (124) 114 protein:vir:95260 Length: 160 86.7 0.012 7.5E-06 31.1 6.5 92 1-119 65-160 (160) 115 protein:vir:80116 Length: 127 85.8 0.034 2.1E-05 28.6 8.5 118 1-169 1-124 (127) 116 protein:vir:99528 Length: 92 # 77.5 0.044 2.7E-05 28.0 5.9 85 1-145 1-92 (92) 117 protein:vir:9879 Length: 127 # 73.4 0.15 9.4E-05 25.1 7.7 122 6-169 1-127 (127) 118 protein:vir:103280 Length: 142 68.3 0.23 0.00014 24.1 7.5 131 1-169 1-140 (142) 119 protein:vir:107703 Length: 147 68.2 0.24 0.00015 24.0 8.3 134 1-169 1-142 (147) 120 protein:vir:78380 Length: 131 65.5 0.25 0.00016 23.9 7.1 124 2-168 1-131 (131) 121 protein:vir:4956 Length: 153 # 61.7 0.18 0.00011 24.7 5.6 120 1-169 1-132 (153) 122 protein:vir:94994 Length: 131 60.3 0.37 0.00023 22.9 7.6 124 2-168 1-131 (131) 123 protein:vir:102338 Length: 116 58.2 0.082 5.1E-05 26.5 3.1 110 18-168 1-116 (116) 124 protein:vir:4859 Length: 140 # 54.7 0.16 9.9E-05 25.0 4.1 117 1-169 1-132 (140) 125 protein:vir:100887 Length: 139 52.0 0.26 0.00016 23.8 4.7 115 4-169 1-128 (139) 126 protein:vir:104347 Length: 145 51.7 0.57 0.00036 21.9 7.9 129 1-169 8-143 (145) 127 protein:vir:5000 Length: 141 # 47.7 0.3 0.00019 23.4 4.4 119 1-169 1-132 (141) 128 protein:vir:79638 Length: 146 42.4 0.89 0.00055 20.9 8.3 133 1-169 1-142 (146) 129 protein:vir:95157 Length: 144 36.7 1.1 0.00069 20.3 5.7 131 1-142 1-144 (144) 130 protein:vir:100223 Length: 139 36.4 0.51 0.00031 22.2 3.8 115 4-169 1-128 (139) 131 protein:vir:94944 Length: 121 35.6 1.2 0.00076 20.1 6.1 114 1-157 1-121 (121) 132 protein:vir:3848 Length: 159 # 34.0 1.3 0.00082 19.9 6.5 131 1-169 2-159 (159) 133 protein:vir:4833 Length: 140 # 32.4 1.1 0.00067 20.4 4.9 121 1-169 1-132 (140) 134 protein:vir:97190 Length: 148 31.4 1.5 0.00093 19.6 6.6 135 1-169 1-146 (148) 135 protein:vir:10367 Length: 119 28.3 1.1 0.00069 20.3 4.2 93 72-169 1-117 (119) 136 protein:vir:81067 Length: 119 27.7 1.2 0.00071 20.2 4.2 93 72-169 1-117 (119) 137 protein:vir:80425 Length: 134 25.3 2.1 0.0013 18.8 6.8 125 2-169 1-133 (134) 138 protein:vir:105773 Length: 131 24.7 1.7 0.0011 19.3 4.6 125 4-169 1-131 (131) No 1 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=100.00 E-value=7.7e-47 Score=273.39 Aligned_cols=162 Identities=26% Similarity=0.368 Sum_probs=135.8 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) -|+|++|++++.++|++|++.++|+++||++||+.|++++++||++|++|||+||+|++++|++++++++ ++ +|. T Consensus 3 ~i~i~~d~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~----~~-~L~ 77 (190) T protein:vir:99 3 GITLEWDGRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNR----DK-ILT 77 (190) T ss_pred eeEEEecHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCC----Cc-cce Confidence 3467778899999999999999999999999999999999999999999999999999999987765433 22 333 Q ss_pred hhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhc--------------------cccccccCce Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFG--------------------GSSSTISIPW 140 (169) Q Consensus 81 ~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~--------------------~~~~~~~~~~ 140 (169) .++.+..||++.++++.|.||||++||+||||||++.+.....+..+. .....+..++ T Consensus 78 --~tg~L~~Si~~~~~~~~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (190) T protein:vir:99 78 --LDGHLRNLLRYQLDGSELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYT 155 (190) T ss_pred --ecHHHHHHHhheecCcEEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccce Confidence 455667889999999999999999999999999998876543322211 1122345668 Q ss_pred eeccCcccCCCCHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 141 GDIPARPFMGISAGDQENIEAALMEWLEP 169 (169) Q Consensus 141 ~~iPaRpfLG~s~~d~~~I~~~i~~~l~p 169 (169) ++||||||||||++|+++|+++|.+||+= T Consensus 156 v~IPaRpfLG~s~~d~~~I~~~i~~~l~~ 184 (190) T protein:vir:99 156 IQMPARPWLGTSSQDDDTILQRVERYLQR 184 (190) T ss_pred eeecCcccCCCCHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998 No 2 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=100.00 E-value=3e-44 Score=259.22 Aligned_cols=147 Identities=22% Similarity=0.295 Sum_probs=128.0 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCcc-------- Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQT-------- 72 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~-------- 72 (169) ||+|++|++++.++|++|+..+.|++++|++||+.|++++++||++|++|| |+||+++|++.+.+.++. T Consensus 4 ~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~Pd---W~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 4 FVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPR---WQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred EEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCC---CCCCChHHHHhhccccccccccccch Confidence 999999999999999999999999999999999999999999999999997 999999998877655432 Q ss_pred ------ccchhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCc Q lcl|NC_020839. 73 ------VSFKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPAR 146 (169) Q Consensus 73 ------~~~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaR 146 (169) ..+.++|. .++.++.||++.++++.|.||||++||+||||||++++ ...++|||| T Consensus 81 ~~~~~~~~~~~~L~--~tG~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~-----------------~~~v~IPAR 141 (175) T protein:vir:79 81 AAASRRKAGLMILQ--DSGQMAASTATDSGEDYSVIGSNKEYAAIQHFGGQAGR-----------------GLKVTIPGR 141 (175) T ss_pred hhHhhhccCCCcce--echhhhhhhhheecCCEEEEecCcchhhHhhcccccCC-----------------CcccccCcc Confidence 12223333 45566779999999999999999999999999998643 346799999 Q ss_pred ccCCCCHHHH------HHHHHHHHHhcCC Q lcl|NC_020839. 147 PFMGISAGDQ------ENIEAALMEWLEP 169 (169) Q Consensus 147 pfLG~s~~d~------~~I~~~i~~~l~p 169 (169) ||||||++|+ ++|+++|.+||+= T Consensus 142 PfLG~s~~de~~~~~~~~I~~~i~~~l~~ 170 (175) T protein:vir:79 142 AWLPVTADGELQPEAVEPVLNTILRHLMD 170 (175) T ss_pred cccCCCcccchhHHHHHHHHHHHHHHHHH Confidence 9999999995 8999999999988 No 3 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=100.00 E-value=2.1e-43 Score=254.56 Aligned_cols=144 Identities=23% Similarity=0.330 Sum_probs=127.8 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) ||+|++|++++.++|.+|...+.|+++||+.||+.|++++++||+ |||+||+|+++.|++.+++.+.. ..++|. T Consensus 4 ~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~----pdG~~W~pls~~t~~~r~~~g~~--~~~iL~ 77 (155) T protein:vir:99 4 RIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFM----DEGPGWPQLSPVTVAAREAKGRG--PHPILQ 77 (155) T ss_pred EEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh----ccCCCCCCCChHHHHHHhccCCC--CCCcch Confidence 999999999999999999999999999999999999999999995 99999999999999888765432 234444 Q ss_pred hhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH------H Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA------G 154 (169) Q Consensus 81 ~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~------~ 154 (169) . ++.+..||++.++++.|.||||++||+||||||++++ .+.++||||||||+|+ + T Consensus 78 ~--tg~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~-----------------~~~v~iPaRpfLG~s~~~~l~~e 138 (155) T protein:vir:99 78 V--TNALARSVTTWADRNEAGIGSNLVYAAIHQFGGDAGR-----------------GHQVEIPARRYLPFDENGQLAAG 138 (155) T ss_pred h--chhhhhhhhceecCCEEEEecCccchhhhhcccccCC-----------------CCccccCCccccCCCCccccchH Confidence 4 4556778999999999999999999999999998754 4568999999999985 7 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) |+++|.++|.+||+= T Consensus 139 ~~~~I~~~i~~~l~~ 153 (155) T protein:vir:99 139 ARQSILEIVLTALSR 153 (155) T ss_pred HHHHHHHHHHHHHhc Confidence 889999999999999 No 4 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=100.00 E-value=3.9e-43 Score=253.07 Aligned_cols=148 Identities=20% Similarity=0.294 Sum_probs=127.9 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPD-GSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~Pd-G~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) -|+|++|++++.++|++|.... +.+++|++||+.|++++++||++|++|| |+||+|++++|++++.+.+.. .++ +| T Consensus 4 ~i~~~~d~~~l~~~L~~l~~~~-~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~-~~~-~L 80 (156) T protein:vir:19 4 DMNVAVDVRRIQLALDELGTVT-RDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFV-PGS-IL 80 (156) T ss_pred EEEEeecHHHHHHHHHHHHhhh-ccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCC-CCc-ch Confidence 4589999999999999997655 4569999999999999999999999998 999999999999887665432 223 33 Q ss_pred hhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHHHHH Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQENI 159 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~~~I 159 (169) ..++.+..||++.++++.|+||||++||++||||+++++ ..+.++||||||||+|++|+++| T Consensus 81 --~~tg~L~~Si~~~~~~~~v~vGt~~~yA~vHqfG~~~~~----------------~~~~~~iPaRpfLG~s~~d~~~I 142 (156) T protein:vir:19 81 --TLHGDLARSITTDYGQDYALIGSPKIYAAIHQWGGTPDM----------------APRPAGVPARPYMGLDKTGEQEI 142 (156) T ss_pred --hhhHHHHHHhhheecCCEEEEecchhhhHHhhcCccccc----------------CCCccccCCccccCCCHHHHHHH Confidence 345666778999999999999999999999999998754 23457899999999999999999 Q ss_pred HHHHHHhcCC Q lcl|NC_020839. 160 EAALMEWLEP 169 (169) Q Consensus 160 ~~~i~~~l~p 169 (169) .++|.+||+= T Consensus 143 ~~~i~~~l~~ 152 (156) T protein:vir:19 143 FDAIRKRVSA 152 (156) T ss_pred HHHHHHHHHH Confidence 9999999999 No 5 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=100.00 E-value=6.5e-43 Score=251.87 Aligned_cols=144 Identities=24% Similarity=0.340 Sum_probs=127.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) ||+|++|++++.++|.+|...+.|++++|+.||+.|++++++||+ |||+||+|+++.|++++++.+.. ..++|. T Consensus 4 ~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~----~eG~~W~pls~~t~~~r~~~g~~--~~~iL~ 77 (155) T protein:vir:79 4 RIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFM----DEGPGWPQLSPATVAAREAKGRG--PHPILQ 77 (155) T ss_pred EEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh----ccCCCCCCCCHHHHHHHhccCCC--CCCccc Confidence 999999999999999999999999999999999999999999995 88999999999999888765542 234444 Q ss_pred hhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH------H Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA------G 154 (169) Q Consensus 81 ~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~------~ 154 (169) . ++.+..||++.++++.|.||||++||+||||||++++ .+.++||||||||+|+ + T Consensus 78 ~--tG~L~~Si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~-----------------~~~v~iPaRpfLG~s~~~~l~~~ 138 (155) T protein:vir:79 78 V--TNALARSVTTWADRNEAGIGSNLVYAAIHQFGGDAGR-----------------GHQVEIPARRYLPFDENGQLAAG 138 (155) T ss_pred c--chhhhhhhhceecCCEEEEecCchhhhhhhcccccCC-----------------CCccccCCccccCCCCccccchH Confidence 4 4566778999999999999999999999999998754 3467999999999985 5 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) |+++|.++|.+||+= T Consensus 139 ~~~~I~~~i~~~l~r 153 (155) T protein:vir:79 139 ARQSILEVVLTALSR 153 (155) T ss_pred HHHHHHHHHHHHHHh Confidence 679999999999987 No 6 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=100.00 E-value=9.9e-43 Score=250.88 Aligned_cols=147 Identities=18% Similarity=0.211 Sum_probs=121.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccC---------- Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRK---------- 70 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~---------- 70 (169) ||+|++|++++.++|++|+..+.|+++||++||+.|++++++||++|++|||+||.|+... .+.+.+ T Consensus 4 ~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~---~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 4 FVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIH---MRVGGKKAYKKNGELT 80 (175) T ss_pred eEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhh---hhhcccccchhhhhhh Confidence 9999999999999999999999999999999999999999999999999997777776543 222111 Q ss_pred ----ccccchhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCc Q lcl|NC_020839. 71 ----QTVSFKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPAR 146 (169) Q Consensus 71 ----~~~~~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaR 146 (169) ....+.++|. .++.+..||++.+++++|.||||++||+|||||+++++ .+.++|||| T Consensus 81 ~~~~~~~~~~~~L~--~tG~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~-----------------~~~v~iPaR 141 (175) T protein:vir:10 81 AAASRRKAGLMILQ--DSGQMAASVSTDHDDNSAVIGSNKEYAAIHQFGGQAGR-----------------GLKVTIPAR 141 (175) T ss_pred hhhhhhccCCCcce--echhhhhhhheeecCCEEEEecChhhhhhhhcccccCC-----------------CCccccCCc Confidence 1122333444 44566778999999999999999999999999998643 446799999 Q ss_pred ccCCCCHHHH------HHHHHHHHHhcCC Q lcl|NC_020839. 147 PFMGISAGDQ------ENIEAALMEWLEP 169 (169) Q Consensus 147 pfLG~s~~d~------~~I~~~i~~~l~p 169 (169) ||||||++|+ ++|++++.+||.= T Consensus 142 pfLG~s~~d~~~~e~~~~Il~~~~~~l~~ 170 (175) T protein:vir:10 142 PWLPVTADGELQPEAVEPVLNTILRHLMD 170 (175) T ss_pred cccCCCcccccchHHHHHHHHHHHHHHHH Confidence 9999998775 8899999999866 No 7 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=100.00 E-value=1.2e-41 Score=244.95 Aligned_cols=143 Identities=27% Similarity=0.416 Sum_probs=123.7 Q ss_pred Ech-HHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhh Q lcl|NC_020839. 6 VKD-KELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPT 82 (169) Q Consensus 6 ~~~-~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~ 82 (169) +|+ ++++.+|..|++.+. +.++||++||+.|+.++++||++|++|||+||+|+++.|++.+.+++. ++| . T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~----~~l---~ 73 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVK----RKM---F 73 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCC----ccc---c Confidence 555 789999999998876 678999999999999999999999999999999999999877765432 233 3 Q ss_pred hhhhhhhhhheecCCcEEEe----cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHHHH Q lcl|NC_020839. 83 KTLSSPSNFAVSSGGDWARL----SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQEN 158 (169) Q Consensus 83 ~~~~~~~si~~~~~~~~v~v----Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~~~ 158 (169) .++.++.||.+.++++.++| |+|.+||++||||+++.+ ....++++||||||||||++|+++ T Consensus 74 ~~~~l~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~--------------~~~~~~~~iPaRp~LG~s~~d~~~ 139 (150) T protein:vir:20 74 AKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEEN--------------RKDGKKIDYPARPLLGFTGEDVQM 139 (150) T ss_pred chhhhhhhhheeecCcEEEEEeeCCcchhhhhhhhccccccc--------------ccCCCceeccccccCCCCHHHHHH Confidence 45667788999999999887 999999999999998643 234578999999999999999999 Q ss_pred HHHHHHHhcCC Q lcl|NC_020839. 159 IEAALMEWLEP 169 (169) Q Consensus 159 I~~~i~~~l~p 169 (169) |.++|.+||.= T Consensus 140 i~~~i~~~l~k 150 (150) T protein:vir:20 140 IEEIILAHLER 150 (150) T ss_pred HHHHHHHHHhC Confidence 99999999999 No 8 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=100.00 E-value=3.9e-41 Score=242.15 Aligned_cols=144 Identities=22% Similarity=0.321 Sum_probs=122.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |- |.++|+.+|+.|++.+. +.++||++||+.|+.++++||++|++|||+||+|+++.|++.+.+.. .++| T Consensus 1 m~----d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~----~~~l 72 (149) T protein:vir:98 1 MS----ELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRI----RREM 72 (149) T ss_pred Cc----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCC----Cccc Confidence 33 34689999999988884 67899999999999999999999999999999999999987665432 2344 Q ss_pred hhhhhhhhhhhhhheecCCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD 155 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d 155 (169) +. .++++.|+++.++++.|.| |+|.+||++||||+++.+. ...++++||||||||||++| T Consensus 73 ~~---~g~l~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~--------------~~~~~~~iPaRp~LG~s~~d 135 (149) T protein:vir:98 73 FA---RLRTNRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRPN--------------RHSRDVQYAARPLLGFTRDD 135 (149) T ss_pred ch---hhhhhhhhhheecCCeeEEEecCcchHHhhHhhcccccccc--------------CCCcceeccccccCCCCHHH Confidence 43 4566778888889999888 9999999999999986432 23568999999999999999 Q ss_pred HHHHHHHHHHhcCC Q lcl|NC_020839. 156 QENIEAALMEWLEP 169 (169) Q Consensus 156 ~~~I~~~i~~~l~p 169 (169) +++|+++|.+||.= T Consensus 136 ~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 136 EQMIEDIIIRHLGK 149 (149) T ss_pred HHHHHHHHHHHhhC Confidence 99999999999999 No 9 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=100.00 E-value=3.1e-41 Score=242.64 Aligned_cols=144 Identities=23% Similarity=0.349 Sum_probs=122.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) =|+|++|++++.++|++|...+.|++++|++||+.|++++++||+ |||+||+|++|.|++++.++++. ..++|. T Consensus 4 ~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~----p~G~~W~plsp~t~~~r~k~g~~--~~~~L~ 77 (155) T protein:vir:10 4 RIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFM----DEGPGWPQLSPVTVAARAAKGRG--AHPILQ 77 (155) T ss_pred eEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHh----hcCCCCCCCCccchHHHHhccCC--CCCccc Confidence 478999999999999999999999999999999999999999995 89999999999999887665432 334444 Q ss_pred hhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHH---- Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQ---- 156 (169) Q Consensus 81 ~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~---- 156 (169) .+ +.+..||++.++++.|.||||++||+||||||++++ .+.++||||||||+|++|+ T Consensus 78 ~t--G~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~-----------------~~~~~iPARPfLG~s~~~e~~~e 138 (155) T protein:vir:10 78 VT--NALARSITTRADRDQAQIGSNLSYAAIQQLGGQAGR-----------------GRKVTIPARPYLPVLRNGQLKPS 138 (155) T ss_pred cc--hhhhhhhhceecCCEEEEecCcchhhhhhcccccCC-----------------CCccccCCccccCCCccccchHH Confidence 44 455678999999999999999999999999998754 3457999999999987664 Q ss_pred --HHHHHHHHHhcCC Q lcl|NC_020839. 157 --ENIEAALMEWLEP 169 (169) Q Consensus 157 --~~I~~~i~~~l~p 169 (169) +.|.++|.+||+= T Consensus 139 i~~~I~~~i~~~l~~ 153 (155) T protein:vir:10 139 ARDAVLDVLLAALSQ 153 (155) T ss_pred HHHHHHHHHHHHHhh Confidence 7777777777766 No 10 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=100.00 E-value=4.7e-41 Score=241.70 Aligned_cols=143 Identities=26% Similarity=0.413 Sum_probs=122.0 Q ss_pred Ech-HHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhh Q lcl|NC_020839. 6 VKD-KELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPT 82 (169) Q Consensus 6 ~~~-~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~ 82 (169) .|+ ++++.+|..+...+. +.++||++||+.|++++++||++|++|||+||+|+++.|++.+.+++. ++|+ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~----~~l~--- 73 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVK----RKMF--- 73 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCC----ccch--- Confidence 555 789999999888875 568899999999999999999999999999999999999877765432 2333 Q ss_pred hhhhhhhhhheecCCcEEEe----cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHHHH Q lcl|NC_020839. 83 KTLSSPSNFAVSSGGDWARL----SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQEN 158 (169) Q Consensus 83 ~~~~~~~si~~~~~~~~v~v----Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~~~ 158 (169) ..+.++.|+++.++++.++| |+|.+||++||||+++.. ....++++||||||||||++|+++ T Consensus 74 ~~~~l~~sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~--------------~~~~~~~~iPaRp~LG~s~~d~~~ 139 (150) T protein:vir:60 74 AKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEEN--------------RKDGKKIDYPARPLLGFTGEDVQM 139 (150) T ss_pred hhhhhcceeeeeeeCcEEEEEeeCCCchhhhhhhhccccccc--------------cCCCCceecCCcccCCCCHHHHHH Confidence 34566778888888888887 999999999999998643 224568999999999999999999 Q ss_pred HHHHHHHhcCC Q lcl|NC_020839. 159 IEAALMEWLEP 169 (169) Q Consensus 159 I~~~i~~~l~p 169 (169) |+++|.+||.= T Consensus 140 i~~~i~~~l~r 150 (150) T protein:vir:60 140 IEEIILAHLDR 150 (150) T ss_pred HHHHHHHHHhC Confidence 99999999999 No 11 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=100.00 E-value=6.7e-41 Score=240.83 Aligned_cols=143 Identities=26% Similarity=0.408 Sum_probs=121.6 Q ss_pred Ech-HHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhh Q lcl|NC_020839. 6 VKD-KELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPT 82 (169) Q Consensus 6 ~~~-~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~ 82 (169) .|+ ++++++|..++..+. +.++||++||+.|+.++++||++|++|||+||+|+|+.|++.+.+++. ++|+ T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~----~~l~--- 73 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVK----RKMF--- 73 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCC----cccc--- Confidence 444 789999999988875 568899999999999999999999999999999999999877765432 2333 Q ss_pred hhhhhhhhhheecCCcEEEe----cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHHHH Q lcl|NC_020839. 83 KTLSSPSNFAVSSGGDWARL----SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQEN 158 (169) Q Consensus 83 ~~~~~~~si~~~~~~~~v~v----Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~~~ 158 (169) ..+.++.|+.+.++++.+.| |+|.+||++||||+++.+. ...++++||||||||||++|+++ T Consensus 74 ~~~~l~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~--------------~~~~~~~iPaRp~LG~s~~d~~~ 139 (150) T protein:vir:57 74 AKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETR--------------KDGKKIDYPARPLLGFTGEDVQM 139 (150) T ss_pred hhhhhccceeeeeeCcEEEEEeecCCchhhhhhhhcccccccc--------------CCCceeecCCcccCCCCHHHHHH Confidence 34566678888888888877 9999999999999986432 24568899999999999999999 Q ss_pred HHHHHHHhcCC Q lcl|NC_020839. 159 IEAALMEWLEP 169 (169) Q Consensus 159 I~~~i~~~l~p 169 (169) |+++|.+||.= T Consensus 140 i~~~i~~~l~r 150 (150) T protein:vir:57 140 IEEIILAHLDR 150 (150) T ss_pred HHHHHHHHHhC Confidence 99999999999 No 12 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=100.00 E-value=6.6e-41 Score=240.85 Aligned_cols=150 Identities=22% Similarity=0.248 Sum_probs=124.0 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |- -|.++|+.+|+.|++.+. ++++||+.||+.|+.++++||++|++|||+||+|+|+.+...+.+.+++.....+ T Consensus 1 m~---~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~ 77 (155) T protein:vir:79 1 MT---DDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREA 77 (155) T ss_pred Cc---hHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchh Confidence 43 256899999999999885 6788999999999999999999999999999999999886555443333222221 Q ss_pred hhhhhhhhhhhhhheecCCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD 155 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d 155 (169) +...++++.++.+.++++.|.| |||.+||+|||||+++.+. ...++++||||||||||++| T Consensus 78 --m~~~l~~a~~l~~~~~~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~--------------~~~~~v~iPaRp~LGls~~d 141 (155) T protein:vir:79 78 --MFRKLRTARYLRIDVDSTGLAIGFDERLSRIARVHQEGQKAPVE--------------PGGPLAQYPVRVVLGFSDAD 141 (155) T ss_pred --hhhhhhhhheeeeeecCcEEEEEecCcchhhhhhhhcCCcccCC--------------CCCcccccccccccCCCHHH Confidence 2234666678888999999999 9999999999999986432 24678999999999999999 Q ss_pred HHHHHHHHHHhcCC Q lcl|NC_020839. 156 QENIEAALMEWLEP 169 (169) Q Consensus 156 ~~~I~~~i~~~l~p 169 (169) +++|+++|.+||.= T Consensus 142 ~~~I~~~i~~~l~r 155 (155) T protein:vir:79 142 RELVRDRLLRELTR 155 (155) T ss_pred HHHHHHHHHHHhhC Confidence 99999999999999 No 13 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=100.00 E-value=2.5e-40 Score=237.73 Aligned_cols=144 Identities=22% Similarity=0.336 Sum_probs=116.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |-+| +++.++|+.|++.+. +.++||++||+.|+.++++||++|++|||+||+|+++.|++.+.+. ..++| T Consensus 1 m~~~----~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~----~~~~~ 72 (149) T protein:vir:18 1 MSEL----TALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGR----IKREM 72 (149) T ss_pred CchH----HHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCc----ccchh Confidence 5543 677778888777774 4678999999999999999999999999999999999997654332 23344 Q ss_pred hhhhhhhhhhhhhheecCCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD 155 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d 155 (169) +. .++++.++.+.++++.+.| |||.+||++||||+++... ...++++||||||||||++| T Consensus 73 ~~---~l~~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~--------------~~~~~v~iPaRp~LG~s~~d 135 (149) T protein:vir:18 73 FA---KLRTSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPN--------------RNSRDVQYEARPLLGFTRDD 135 (149) T ss_pred hh---hhhhhhhhheeecCceeEEEecccchhhhhhhhcccccccc--------------CCCccccccccccCCCCHHH Confidence 43 3555566666666666665 9999999999999986532 23578999999999999999 Q ss_pred HHHHHHHHHHhcCC Q lcl|NC_020839. 156 QENIEAALMEWLEP 169 (169) Q Consensus 156 ~~~I~~~i~~~l~p 169 (169) +++|+++|.+||.= T Consensus 136 ~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 136 EQMIEDVIISHLGK 149 (149) T ss_pred HHHHHHHHHHHHhC Confidence 99999999999999 No 14 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=100.00 E-value=9.1e-40 Score=234.62 Aligned_cols=147 Identities=18% Similarity=0.253 Sum_probs=122.3 Q ss_pred EEEchHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKDKELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) .+=+.++++++|..|++.+. +.+.||++||+.|+.++++||++|++|||+||+|+++.|++.+.++.. ....|+ T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~--~~~~m~-- 76 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIR--RKIKMF-- 76 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccc--cchhhh-- Confidence 33356889999999998886 457899999999999999999999999999999999999877654332 223343 Q ss_pred hhhhhhhhhhheecCCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSSGGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQEN 158 (169) Q Consensus 82 ~~~~~~~~si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~~~ 158 (169) ..++++.++.+.++++.+.| |+|.+||++||||+++.+ ....++++||||||||||++|+++ T Consensus 77 -~~l~~~~~l~~~~~~~~a~vg~~Gs~~~yA~iHQfG~~~~~--------------~~~~~~v~iPaRp~LG~s~~d~~~ 141 (156) T protein:vir:11 77 -QKLRTVRYLRAKGDAQAITVSFAGRIARIARVHQYGLRDRA--------------EPGAPEVSYAQRLLLGFDSSDMET 141 (156) T ss_pred -hhhhhhheeeeeecCcEEEEEecCCchhhhhhhcccccccc--------------cCCCCcccccccccCCCCHHHHHH Confidence 33555667888888999988 999999999999998643 224568899999999999999999 Q ss_pred HHHHHHHhcCC Q lcl|NC_020839. 159 IEAALMEWLEP 169 (169) Q Consensus 159 I~~~i~~~l~p 169 (169) |+++|.+||+= T Consensus 142 i~~~i~~~l~~ 152 (156) T protein:vir:11 142 IQNGILAHIDA 152 (156) T ss_pred HHHHHHHHHhh Confidence 99999999986 No 15 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=100.00 E-value=1.1e-39 Score=234.13 Aligned_cols=143 Identities=23% Similarity=0.312 Sum_probs=116.7 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |-+ .++|+++|..|++.+++ .++||++||+.|++++++||++|++|||+||+|+|+.+.+.+.+. .++| T Consensus 1 m~~----~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~-----~~~~ 71 (148) T protein:vir:79 1 MSE----SRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRI-----RRAM 71 (148) T ss_pred Ccc----HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccc-----cccc Confidence 544 37899999999998864 579999999999999999999999999999999999875433111 1223 Q ss_pred hhhhhhhhhhhhhheecCCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD 155 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d 155 (169) . ..++++.++.+.++++.+.| |||++||++||||+++... ...++++||||||||||++| T Consensus 72 ~---~~l~~~~~l~~~~~~~~~~v~~~Gt~~~yAaiHQfG~~~r~~--------------~~~~~v~iPaRp~LG~s~~d 134 (148) T protein:vir:79 72 F---MRLRLARYMKTQADANTAVVTFAGNAQRIATVHQFGLRDRVN--------------KAGLTAQYPARELLGMDGVD 134 (148) T ss_pred c---chhhhhhheeeeeeCCeeeEEeeccchhhhhhhhcCcccccc--------------CCCCccccCcccccCCCHHH Confidence 2 34455567777777777777 9999999999999986432 23568899999999999999 Q ss_pred HHHHHHHHHHhcCC Q lcl|NC_020839. 156 QENIEAALMEWLEP 169 (169) Q Consensus 156 ~~~I~~~i~~~l~p 169 (169) +++|+++|.+||.= T Consensus 135 ~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 135 MEHITNLLLLHLGA 148 (148) T ss_pred HHHHHHHHHHHhcC Confidence 99999999999999 No 16 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=100.00 E-value=1.8e-39 Score=232.96 Aligned_cols=146 Identities=18% Similarity=0.241 Sum_probs=118.3 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |=+ +.+++..+|+.|++.+. +.+.||++||+.|+.++++||++|++|||+||+|+++.+...+.. ...+.| T Consensus 1 M~~---~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~----~~~~~m 73 (152) T protein:vir:10 1 MSE---PIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSK----IKSGKM 73 (152) T ss_pred Cch---HHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhccc----ccchhH Confidence 544 56888999999988886 567899999999999999999999999999999999876433322 223344 Q ss_pred hhhhhhhhhhhhhheecCCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD 155 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d 155 (169) +. .++.+.++.+.++++.++| |+|++||++||||+++.... .....++||||||||||++| T Consensus 74 ~~---~L~~a~~l~~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~-------------~~~~~v~iPaRp~LG~s~~d 137 (152) T protein:vir:10 74 FD---KITQPRFMRLRLESEGVSLGYEGGDAVIARIHQQGLIGRVRK-------------DWDLKVKYASRELLGFTDDD 137 (152) T ss_pred HH---hhhhcceeeeeecCcEEEEEecCCchhhhhhhccCccccccC-------------CCCcceeccccccCCCCHHH Confidence 43 2444557788888999988 99999999999999864321 12346799999999999999 Q ss_pred HHHHHHHHHHhcCC Q lcl|NC_020839. 156 QENIEAALMEWLEP 169 (169) Q Consensus 156 ~~~I~~~i~~~l~p 169 (169) +++|+++|.+||.- T Consensus 138 ~~~I~~~i~~~l~~ 151 (152) T protein:vir:10 138 LQMIEDYMINILAG 151 (152) T ss_pred HHHHHHHHHHHHhc Confidence 99999999999999 No 17 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=100.00 E-value=3.3e-34 Score=204.16 Aligned_cols=132 Identities=17% Similarity=0.262 Sum_probs=99.2 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) ||+++.+ ++++|++|.. ++.+.|.+|++.|.+++++||+++.+|||.||+||+++|++++.+ ++ ++. T Consensus 1 ~i~~~~~---i~~~l~~l~~---~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~------~~-~L~ 67 (145) T protein:vir:31 1 MVEDENN---IPEAREAIQD---GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGS------DT-PLI 67 (145) T ss_pred CcccHHH---HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcC------CC-CCc Confidence 8887644 5556666653 467789999999999999999999999999999999999876632 22 333 Q ss_pred hhhhhhhhhhhhee----cCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHH Q lcl|NC_020839. 81 PTKTLSSPSNFAVS----SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQ 156 (169) Q Consensus 81 ~~~~~~~~~si~~~----~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~ 156 (169) .+ +.+..||++. ++++.|.||||++||++||||++ .++||||||||++++|. T Consensus 68 ~t--G~L~~Si~~~~~~~~~~~~a~vGtn~~YA~~hqfG~~----------------------~~~IPaRPfLG~~~~~~ 123 (145) T protein:vir:31 68 DN--SRLLTDINAASMMDRANRMAVIGTNLDYAEHHEFGAP----------------------EAGIPARPIFGPAGAYA 123 (145) T ss_pred cC--HHHHHHHHHHhhhcccCceeEecCCchhhhhhccCCc----------------------ccccCCCCccCCCccch Confidence 33 3444455543 45678999999999999999975 36799999999998763 Q ss_pred -HHHH----HHHHHhcCC Q lcl|NC_020839. 157 -ENIE----AALMEWLEP 169 (169) Q Consensus 157 -~~I~----~~i~~~l~p 169 (169) ++|. ++|.++|+= T Consensus 124 ~~~~~~ii~~~i~~~L~~ 141 (145) T protein:vir:31 124 SQQAPDVIGDEIDTNLEG 141 (145) T ss_pred HHHHHHHHHHHHHHHhhh Confidence 3444 455555655 No 18 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=99.76 E-value=2.1e-21 Score=133.99 Aligned_cols=152 Identities=17% Similarity=0.224 Sum_probs=102.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) ||+|+.|...|+++|+-|.-.....+.|+..||.+|+.+.++|+.+|++|||++|+|+++.. ++|+. T Consensus 1 m~t~~~dl~~l~~~L~ll~L~p~~RrrLl~~iar~lr~~~~~rIr~Q~~PDGs~~~pRKr~k-------------rKMl~ 67 (228) T protein:vir:78 1 MITITLDTRRGKDQLNLLALPPKKRKRLVWRAANEMKKLATRNVRQQQDPNGNAWAPRKRGK-------------RKMLR 67 (228) T ss_pred CccchhhHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCChhhhhhH-------------HHHHh Confidence 99999999999999998744444678999999999999999999999999999999987421 12332 Q ss_pred hhhhhhhhhhhheecCCcEEEecC--------cccchhhhhcccccccchhh---------h-----------h---hhh Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSGGDWARLSS--------RAIQSAVMQFGAKKGAFGSY---------Q-----------G---KGF 129 (169) Q Consensus 81 ~~~~~~~~~si~~~~~~~~v~vGt--------~~~YA~iHqfGg~~~~~~~~---------~-----------~---~~~ 129 (169) . +.... .+.. ..++.++||- ....|++||||.+....... . . .++ T Consensus 68 ~--L~k~L-k~~~-~~~~~a~v~f~~~~~~~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~paTr~QAk~Lr~lGy 143 (228) T protein:vir:78 68 G--LPKLL-QIRE-PRQDMAELGFTKGTMSAHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQASKAQARKLRELGF 143 (228) T ss_pred h--hHHhh-hhhc-ccccceEEEeecCcccchHHHHHHHHhcCcccccccchhhhhhcccCCCCCCCCHHHHHHHHHhhc Confidence 2 11111 2221 1234455542 23589999999754332110 0 0 000 Q ss_pred cc---cc---ccccCc---------------------------eeeccCcccCCCCHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 130 GG---SS---STISIP---------------------------WGDIPARPFMGISAGDQENIEAALMEWLEP 169 (169) Q Consensus 130 ~~---~~---~~~~~~---------------------------~~~iPaRpfLG~s~~d~~~I~~~i~~~l~p 169 (169) .. ++ +.+..+ .+++|+|+|||+|+++...+...+.+-+.= T Consensus 144 ~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~e~~~~l~~~l~~i~~ 216 (228) T protein:vir:78 144 KRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANARQRQQAFALRPESIDY 216 (228) T ss_pred cccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHHHHHHHHHHHHHhccc Confidence 00 11 111111 278999999999999999999999988876 No 19 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=99.75 E-value=3.5e-21 Score=132.70 Aligned_cols=158 Identities=13% Similarity=0.146 Sum_probs=90.9 Q ss_pred CeEEEEchHH---HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKDKE---LEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~~~---l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) -+.+++|-++ |+++|+-|.-.....+.|+..||..|+.+.++|+.+|++|||+||+|+++. +++... .. T Consensus 2 ~~~~~~n~~dl~~l~~~L~ll~L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~-------~~k~k~-~r 73 (231) T protein:vir:37 2 QIRLGLKQEDLDAFVRDLRTLNLTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPV-------DGEIKN-KR 73 (231) T ss_pred CccCCcCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhccc-------ccchhh-HH Confidence 3367777554 455555442222245689999999999999999999999999999998642 111111 12 Q ss_pred hhhhhhhhhhhhhhheecCCcEEEecCc---ccchhhhhcccccccchh------------------hhh---hhhcc-- Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGDWARLSSR---AIQSAVMQFGAKKGAFGS------------------YQG---KGFGG-- 131 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~~v~vGt~---~~YA~iHqfGg~~~~~~~------------------~~~---~~~~~-- 131 (169) |+.. +..........+++.+.++.| ..+|++||||.+...... ++. .++.. T Consensus 74 m~~k---L~~~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~ 150 (231) T protein:vir:37 74 LLKK---VLRYASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRN 150 (231) T ss_pred HHHH---hHHhhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccC Confidence 2221 222223333344555555434 468999999985433210 000 11110 Q ss_pred -c-------ccc-------------------------cc-----C-ceeeccCcccCCCCHHHHHHHHHHHHH-hcCC Q lcl|NC_020839. 132 -S-------SST-------------------------IS-----I-PWGDIPARPFMGISAGDQENIEAALME-WLEP 169 (169) Q Consensus 132 -~-------~~~-------------------------~~-----~-~~~~iPaRpfLG~s~~d~~~I~~~i~~-~l~p 169 (169) . .+. .. . =.+++|+|||||+|+++...|.+.+.+ +|-- T Consensus 151 ~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e~~~~l~~~l~~i~~~ 228 (231) T protein:vir:37 151 GKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKENVDILREITLKFLSG 228 (231) T ss_pred CCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHHHHHHHHHHHHHHhcc Confidence 0 000 00 0 127899999999999887666655544 4443 No 20 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=99.69 E-value=1.5e-19 Score=123.81 Aligned_cols=153 Identities=10% Similarity=0.104 Sum_probs=95.4 Q ss_pred CeEEEEchHHHHHHHHHHH-HHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLE-GRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~-~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) -|++++|-.++....++|. ..+. ..+.|+..||.+|+.+.++|+.+|++|||+||+|+++. .++ T Consensus 2 ~i~~~~n~~~~~~l~~~L~ll~L~p~~Rr~ll~~iak~lr~~~k~rIr~Q~~PDGs~~~pRKr~-------------k~K 68 (227) T protein:vir:37 2 NIRMGIDKEDLKKFLKDLEIISLPDKKKREILIRSLQMIKRQAVKSAANQRNPMGGSWKKRKNG-------------TAK 68 (227) T ss_pred cccccCCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCchhcch-------------hHH Confidence 3488999888877777774 2232 35689999999999999999999999999999998742 113 Q ss_pred hhhhhhhhhhhhhhheecCCcEEEe--cCcccchhhhhcccccccchhhh----------------------hhhhccc- Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGDWARL--SSRAIQSAVMQFGAKKGAFGSYQ----------------------GKGFGGS- 132 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~~v~v--Gt~~~YA~iHqfGg~~~~~~~~~----------------------~~~~~~~- 132 (169) |+... .....+.++.+...|.+ |.....|++||||.+........ ..++... T Consensus 69 M~~kL---~k~l~~~~~~~~a~v~f~~g~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~ 145 (227) T protein:vir:37 69 MLRRI---AKLANSKAEKAQGTLFYKQKRTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVAN 145 (227) T ss_pred HHhhh---HHHcceeecccceEEEecCcchHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccC Confidence 44322 22223443333333333 22346899999998655322100 0011000 Q ss_pred ---------ccc--------------------------------ccCc-eeeccCcccCCCCHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 133 ---------SST--------------------------------ISIP-WGDIPARPFMGISAGDQENIEAALMEWLEP 169 (169) Q Consensus 133 ---------~~~--------------------------------~~~~-~~~iPaRpfLG~s~~d~~~I~~~i~~~l~p 169 (169) .+. .... .+++|+|+|||+|+++...|...+.+-+.- T Consensus 146 ~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~e~~~~l~r~l~~~~~ 224 (227) T protein:vir:37 146 GKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREEENAKIILAEIQKYTQ 224 (227) T ss_pred CCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHHHHHHHHHHHHHHHhh Confidence 000 0011 268999999999998877766666555544 No 21 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=99.61 E-value=2.4e-18 Score=117.15 Aligned_cols=153 Identities=16% Similarity=0.159 Sum_probs=90.2 Q ss_pred CeEEEEchHHHHHHHHHHH---HHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLE---GRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~---~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) =|..++|-+++....+.|. -.....+.|+..||.+|+.+.++|+.+|++|||++|+|++.. + ++ T Consensus 4 ~i~~~ln~~~~~~l~~~L~ll~L~p~kRrrll~~iak~lr~~~k~rIr~Q~~PDGs~w~pRKr~---------k----~K 70 (230) T protein:vir:98 4 AIKMGVNPDDLRDFLKDLELLKIPPKKKKEILIRTLQEMKKRSVKSASNQRTPTGSGWKPRKNG---------N----AK 70 (230) T ss_pred cCcccCCHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCChhhhhh---------h----HH Confidence 3455566555554444443 222245789999999999999999999999999999998632 0 12 Q ss_pred hhhhh-hhhhhhhhhheecCCcEEEecC---cccchhhhhcccccccchhhh----------------------hhhhcc Q lcl|NC_020839. 78 LIGPT-KTLSSPSNFAVSSGGDWARLSS---RAIQSAVMQFGAKKGAFGSYQ----------------------GKGFGG 131 (169) Q Consensus 78 l~~~~-~~~~~~~si~~~~~~~~v~vGt---~~~YA~iHqfGg~~~~~~~~~----------------------~~~~~~ 131 (169) |+... .++. -.....+...|.++. ....|++||||.+........ ..++.. T Consensus 71 Ml~~L~k~l~---~~~~~~~~~~v~~~~~~~~~rIA~vHq~G~~~~~~~~~~~~r~~~~~~~~paTr~QAk~Lr~lGy~v 147 (230) T protein:vir:98 71 MLRRIAKTLK---FTSADREIKRVCTISRNAQRRSQKEHQRGAKITNLKSVILRKSRAGTAKDPATMRQAKKLRDLGYTV 147 (230) T ss_pred HHhhhHHHHH---HhhcccccceeeeecccchhhhhhhhhccchhhhhhhhhhhhhcCCCCcccccHHHHHHHHHcCCcc Confidence 33322 2221 122222344444433 336899999997654322100 011110 Q ss_pred ----------ccc--------------------------ccc------CceeeccCcccCCCCHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 132 ----------SSS--------------------------TIS------IPWGDIPARPFMGISAGDQENIEAALMEWLEP 169 (169) Q Consensus 132 ----------~~~--------------------------~~~------~~~~~iPaRpfLG~s~~d~~~I~~~i~~~l~p 169 (169) +.+ ... .-.+++|+|+|||+|+++...|.+.+..-+-- T Consensus 148 ~~g~~~~~~k~~kkps~kwI~~nls~~qAgliIR~L~~k~~k~~~~~t~W~I~~PaR~FLG~~~~e~~~~l~~~l~~i~~ 227 (230) T protein:vir:98 148 PNGTTKSGKKRYRRPSAREIVATLSRAKASLLIRYFQEKEERQGKRLTKWIIPTEKRPFLDERDKENAEILKEFILKFSG 227 (230) T ss_pred CCCCCCcCCCCCCCCCHHHHHHhhhHHHHHHHHHHHhccccccccCccceeeecCcccccCCChHHHHHHHHHHHHHhcc Confidence 000 000 11268999999999999888776666554433 No 22 >protein:vir:274 Length: 166 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536654;genbank:gi:17975132;genbank:GeneID:929088 Probab=99.41 E-value=8.5e-16 Score=103.20 Aligned_cols=142 Identities=15% Similarity=0.169 Sum_probs=81.0 Q ss_pred CeEEEEchHHHHHHHHHHHH---HhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEG---RLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~---~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) ||+|++|..++.+..++|.. .....+.|+..||.+|+.++++||.+|++|||++|+|+++. .++ T Consensus 1 m~~~~~~~~q~~~l~~~L~ll~L~p~~Rr~ll~~iak~lr~~~~~rIr~Q~~PDGs~~~pRKr~-------------k~K 67 (166) T protein:vir:27 1 MFEIKAEDRSYLRVMEQLELLGLDRKTRDKMLRRIGAQIAKTTRKNIRAQRDPDGSAWAKRKRG-------------RGK 67 (166) T ss_pred CeeeccChHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhhhh-------------hHH Confidence 99999999877755555542 22245789999999999999999999999999999997632 112 Q ss_pred hhhhhhhhhhhhhhheecCCcEEEecCcccc---hhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHH Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGDWARLSSRAIQ---SAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAG 154 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~~v~vGt~~~Y---A~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~ 154 (169) |+... .... .+....+++.|.||.+..+ |++||||.+............... ..+..+-|| |.. T Consensus 68 Ml~~l--~k~~-~~~~~~~~~~~~v~~~g~~~rIA~vHq~G~~~~~~~~~~~~~~~~~----~~~~~~~pA------Tr~ 134 (166) T protein:vir:27 68 LLKGF--TQKL-KHFQRDNNRTLVVGWPSARGRVAYEHHHGIAQESGLSARKRQAKQQ----NEPRKTDPA------TRE 134 (166) T ss_pred HHHhh--HHHh-hhhccCCCCeEEEEecCchhhhhhhhhcCcccccccchhhHHHhhc----cCCCCCccC------CHH Confidence 33322 2222 2333345677888877655 666999987654321111100000 011111221 222 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) .-..+. ...-|..| T Consensus 135 QAk~Lr-~~~~~~~~ 148 (166) T protein:vir:27 135 QAKRCA-ISITASSL 148 (166) T ss_pred HHHHHH-HhcCcccc Confidence 222211 12223333 No 23 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=98.95 E-value=1.5e-11 Score=79.91 Aligned_cols=132 Identities=13% Similarity=0.140 Sum_probs=79.7 Q ss_pred Ce--EEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MF--TVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi--~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |. +++++.++|.+.|+.+.+.+.+ ....+.+++..+.+..+.+ .| | T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~-----aP----v---------------------- 49 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGL-----CP----V---------------------- 49 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c---------------------- Confidence 54 5567788899999988877653 5567777777777665332 22 2 Q ss_pred hhhhhhhhhhhhhhheecCCc----EEEecCcccchhhhhcccccccchhhhhhhhccccccccCceee---ccCcccCC Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGD----WARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGD---IPARPFMG 150 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~----~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~---iPaRpfLG 150 (169) +|+.|.+||.+....+ .+.||++++||..|+||...............+.+......++. +|++|||. T Consensus 50 -----~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~ 124 (142) T protein:vir:94 50 -----DTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMR 124 (142) T ss_pred -----cchhhhccceeeeccCCceEEEEEecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchh Confidence 1334444555443222 58899999999999999865433222222222222222333443 78999998 Q ss_pred CCHHHHHHHHHHHHHhcC Q lcl|NC_020839. 151 ISAGDQENIEAALMEWLE 168 (169) Q Consensus 151 ~s~~d~~~I~~~i~~~l~ 168 (169) -+-++......-+.+-|+ T Consensus 125 ~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 125 PAIAAASTFLRNHAKGIR 142 (142) T ss_pred HHHHHHHHHHHHHHHhcC Confidence 775544444444445566 No 24 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=98.73 E-value=1.9e-10 Score=73.84 Aligned_cols=112 Identities=17% Similarity=0.271 Sum_probs=79.7 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |++|++++ ++|.+.|.++.. ..+...+++..+..+.+..+++- +...|. T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~-~~~v~~~~~~~~~~~~~~~~~~a-----~~~~p~------------------------ 50 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNAS-PEKRSKVLRKYGSKLKEAAVNRA-----QFNKGY------------------------ 50 (114) T ss_pred CeeeeeehHHHHHHHHHHhcC-HHHHHHHHHHHHHHHHHHHHHhc-----ccCCCC------------------------ Confidence 99999997 778887777632 23445666666666665554332 111111 Q ss_pred hhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHH Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQEN 158 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~ 158 (169) .|+.+..||.+..+++.++||++..||..+.||.. .+||||||.-. ++.+.. T Consensus 51 ---~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~vEfGT~------------------------km~a~Pfl~PA~~~~~~~ 103 (114) T protein:vir:49 51 ---STGATRRSITLQVESDKATVEALTSYSGYLEVGTR------------------------KMEAQPFMKPALDEVAPK 103 (114) T ss_pred ---CchhhhhceeeeecCCeeEecCCCCccceeccccc------------------------ccCCCCchhhhHHHHHHH Confidence 23455567887778888999999999999999963 69999998744 455777 Q ss_pred HHHHHHHhcCC Q lcl|NC_020839. 159 IEAALMEWLEP 169 (169) Q Consensus 159 I~~~i~~~l~p 169 (169) +.+.|.+.|+- T Consensus 104 ~~~~l~~l~k~ 114 (114) T protein:vir:49 104 MVEELAKWDET 114 (114) T ss_pred HHHHHHHHhcC Confidence 88888888888 No 25 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=98.73 E-value=1.9e-10 Score=73.84 Aligned_cols=112 Identities=17% Similarity=0.271 Sum_probs=79.7 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |++|++++ ++|.+.|.++.. ..+...+++..+..+.+..+++- +...|. T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~-~~~v~~~~~~~~~~~~~~~~~~a-----~~~~p~------------------------ 50 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNAS-PEKRSKVLRKYGSKLKEAAVNRA-----QFNKGY------------------------ 50 (114) T ss_pred CeeeeeehHHHHHHHHHHhcC-HHHHHHHHHHHHHHHHHHHHHhc-----ccCCCC------------------------ Confidence 99999997 778887777632 23445666666666665554332 111111 Q ss_pred hhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHH Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQEN 158 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~ 158 (169) .|+.+..||.+..+++.++||++..||..+.||.. .+||||||.-. ++.+.. T Consensus 51 ---~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~vEfGT~------------------------km~a~Pfl~PA~~~~~~~ 103 (114) T protein:vir:27 51 ---STGATRRSITLQVESDKATVEALTSYSGYLEVGTR------------------------KMEAQPFMKPALDEVAPK 103 (114) T ss_pred ---CchhhhhceeeeecCCeeEecCCCCccceeccccc------------------------ccCCCCchhhhHHHHHHH Confidence 23455567887778888999999999999999963 69999998744 455777 Q ss_pred HHHHHHHhcCC Q lcl|NC_020839. 159 IEAALMEWLEP 169 (169) Q Consensus 159 I~~~i~~~l~p 169 (169) +.+.|.+.|+- T Consensus 104 ~~~~l~~l~k~ 114 (114) T protein:vir:27 104 MVEELAKWDET 114 (114) T ss_pred HHHHHHHHhcC Confidence 88888888888 No 26 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=98.64 E-value=2.6e-10 Score=73.10 Aligned_cols=119 Identities=13% Similarity=0.070 Sum_probs=71.9 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) +.+|++|...+.+.+.... +..+..++..+....+.+ . || T Consensus 4 s~~i~i~~~~l~~~v~~~~------k~~l~~~a~~i~~~ak~~-----a----Pv------------------------- 43 (137) T protein:vir:10 4 TARIHINEPELERQTGAIF------RGKHRSITRRIATQARAD-----V----PV------------------------- 43 (137) T ss_pred eEEEeeCHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHh-----C----Cc------------------------- Confidence 6788888777766555443 344566666665543221 2 22 Q ss_pred hhhhhhhhhhhheecC---C--cEEEecCcccchhhhhcccccc--cchhhhhhhhccccccccCceeecc---CcccCC Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSG---G--DWARLSSRAIQSAVMQFGAKKG--AFGSYQGKGFGGSSSTISIPWGDIP---ARPFMG 150 (169) Q Consensus 81 ~~~~~~~~~si~~~~~---~--~~v~vGt~~~YA~iHqfGg~~~--~~~~~~~~~~~~~~~~~~~~~~~iP---aRpfLG 150 (169) +++.|..||..... + ..+.||++.+||++|+||+... .....+.+.|.+.++.+..+.|+.| +||||- T Consensus 44 --~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~ 121 (137) T protein:vir:10 44 --RTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLR 121 (137) T ss_pred --ccchhhcCceeeeeccccceEEEEEecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHH Confidence 13444456654332 1 2567999999999999998532 2234556666778888888888888 999962 Q ss_pred CCHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 151 ISAGDQENIEAALMEWLEP 169 (169) Q Consensus 151 ~s~~d~~~I~~~i~~~l~p 169 (169) +..++ .+. -+| T Consensus 122 --~A~~~----~~~--~~~ 132 (137) T protein:vir:10 122 --NAARR----VVA--ADP 132 (137) T ss_pred --HHHHH----Hhh--ccc Confidence 22221 110 133 No 27 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=98.62 E-value=2.9e-10 Score=72.87 Aligned_cols=127 Identities=14% Similarity=0.110 Sum_probs=77.0 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.+++++.+.|+..|..+...++. .+..+..++..+.+..+.. . ||. T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~-----a----Pv~----------------------- 48 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRAR-----V----PVL----------------------- 48 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----C----Ccc----------------------- Confidence 999888888888877777766542 3445555555554444322 2 231 Q ss_pred hhhhhhhhhhhhheecCC------cEEEecCcccchhhhhccccccc--chhhhhhhhccccccccCceeecc---Cccc Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGG------DWARLSSRAIQSAVMQFGAKKGA--FGSYQGKGFGGSSSTISIPWGDIP---ARPF 148 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~------~~v~vGt~~~YA~iHqfGg~~~~--~~~~~~~~~~~~~~~~~~~~~~iP---aRpf 148 (169) ++.|..||.+.... ..+.|+++..||++|+||..... ....+.+.|.+.+..+...+|+.| ++|| T Consensus 49 ----tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pf 124 (142) T protein:vir:86 49 ----TGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPY 124 (142) T ss_pred ----chhhhcceeeeeccccccceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCch Confidence 23334455433221 24668899999999999986332 233444556677777888888876 9999 Q ss_pred CCCCHHHHH---HHHHHHHH Q lcl|NC_020839. 149 MGISAGDQE---NIEAALME 165 (169) Q Consensus 149 LG~s~~d~~---~I~~~i~~ 165 (169) |- ++.++ +..+++.+ T Consensus 125 l~--~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 125 LR--NAGEAVVRRDRRIRVR 142 (142) T ss_pred hH--HHHHHHHhhhhhhccC Confidence 74 43333 23333333 No 28 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=98.62 E-value=2.9e-10 Score=72.87 Aligned_cols=127 Identities=14% Similarity=0.110 Sum_probs=77.0 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.+++++.+.|+..|..+...++. .+..+..++..+.+..+.. . ||. T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~-----a----Pv~----------------------- 48 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRAR-----V----PVL----------------------- 48 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----C----Ccc----------------------- Confidence 999888888888877777766542 3445555555554444322 2 231 Q ss_pred hhhhhhhhhhhhheecCC------cEEEecCcccchhhhhccccccc--chhhhhhhhccccccccCceeecc---Cccc Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGG------DWARLSSRAIQSAVMQFGAKKGA--FGSYQGKGFGGSSSTISIPWGDIP---ARPF 148 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~------~~v~vGt~~~YA~iHqfGg~~~~--~~~~~~~~~~~~~~~~~~~~~~iP---aRpf 148 (169) ++.|..||.+.... ..+.|+++..||++|+||..... ....+.+.|.+.+..+...+|+.| ++|| T Consensus 49 ----tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pf 124 (142) T protein:vir:99 49 ----TGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPY 124 (142) T ss_pred ----chhhhcceeeeeccccccceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCch Confidence 23334455433221 24668899999999999986332 233444556677777888888876 9999 Q ss_pred CCCCHHHHH---HHHHHHHH Q lcl|NC_020839. 149 MGISAGDQE---NIEAALME 165 (169) Q Consensus 149 LG~s~~d~~---~I~~~i~~ 165 (169) |- ++.++ +..+++.+ T Consensus 125 l~--~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 125 LR--NAGEAVVRRDRRIRVR 142 (142) T ss_pred hH--HHHHHHHhhhhhhccC Confidence 74 43333 23333333 No 29 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=98.55 E-value=1.2e-09 Score=69.59 Aligned_cols=111 Identities=18% Similarity=0.248 Sum_probs=73.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.+|++++ ++|.+.|+.+.. ..+.+..++..+..+...+++.-.. ..| + T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~-~~~v~~~v~~~~~~~~~~~~~~a~~-~ap----v------------------------ 50 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNAS-SERRSKVLRKYGAKLKEAAVSKAQF-KKG----Y------------------------ 50 (112) T ss_pred CceeeehHHHHHHHHHHhhcC-HHHHHHHHHHHHHHHHHHHHHHhhh-cCC----C------------------------ Confidence 99999997 778887776642 2344455555555555544443322 122 1 Q ss_pred hhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHHHHHH Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGDQENI 159 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d~~~I 159 (169) .|+.+..||++..++..+.||++..||....||.. .|||||||.-.=+-.+.. T Consensus 51 ---dTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~vE~GTr------------------------~m~AqPF~~PA~~~~~~~ 103 (112) T protein:vir:96 51 ---STGATRRSITLEAGSDRAVVEALTNYSGYLEVGTR------------------------KMEAQPFMRPALDQVVPE 103 (112) T ss_pred ---CchhhhhceeeecCceEEEecCCCCccceeccCcc------------------------ccCCCCchhhhHHHHHHH Confidence 24555668888889999999999999999999964 599999998554433333 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) ..--.+-|+ T Consensus 104 ~~~~l~~L~ 112 (112) T protein:vir:96 104 MVEEMAKWE 112 (112) T ss_pred HHHHHHhcC Confidence 333334455 No 30 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=98.39 E-value=5.4e-09 Score=65.91 Aligned_cols=137 Identities=15% Similarity=0.108 Sum_probs=81.0 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |++|++++ ++|.+.|..+...+.+ ..+.+..+.+.+...++++.+.. .| T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~-~P---------------------------- 51 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSS-IK---------------------------- 51 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-CC---------------------------- Confidence 99999997 8899999999877754 45667677676666666666543 23 Q ss_pred hhhhhhhhhhhhhheec----CCcEEEecCcccchhhhhcccccccc---------hhhhhhhhccc--cccc------- Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSS----GGDWARLSSRAIQSAVMQFGAKKGAF---------GSYQGKGFGGS--SSTI------- 136 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~----~~~~v~vGt~~~YA~iHqfGg~~~~~---------~~~~~~~~~~~--~~~~------- 136 (169) .+++.|.+||.+.. +.-...|+++.+||.+++||+-.... ....+....+. ...+ T Consensus 52 ---vdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~ 128 (182) T protein:vir:10 52 ---YSTGELTRSFKHEVKVDGDEVIGRWWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKI 128 (182) T ss_pred ---CCchhhhhceeeeeeecCCeEEEEeecCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccc Confidence 12344445664322 23357899999999999999632110 00000000000 0000 Q ss_pred --------c----CceeeccCcccCCCCHHH-HHHHHHHHHHhcCC Q lcl|NC_020839. 137 --------S----IPWGDIPARPFMGISAGD-QENIEAALMEWLEP 169 (169) Q Consensus 137 --------~----~~~~~iPaRpfLG~s~~d-~~~I~~~i~~~l~p 169 (169) . .....+||||||=-+-++ +..|.++|.++++= T Consensus 129 ~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~ 174 (182) T protein:vir:10 129 YGIPKIKINGKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQ 174 (182) T ss_pred cccceeeecCceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHH Confidence 0 011248999998755433 56666666666665 No 31 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=98.37 E-value=3.5e-09 Score=66.96 Aligned_cols=128 Identities=13% Similarity=0.094 Sum_probs=79.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++++..++|.+.|+.+...+.+ ....+...|..+.+..+.+ .| . T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSN-----MP----V------------------------ 47 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 999998889999999999887764 4567777787777766654 22 1 Q ss_pred hhhhhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhh--hhhhhccccccccCc---eeeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSY--QGKGFGGSSSTISIP---WGDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~--~~~~~~~~~~~~~~~---~~~iPaRpfLG~s 152 (169) +|+.+..||++. .++..+.||+++.||..++||......... ..+...+....+.+. ...+|+||||-=+ T Consensus 48 ---~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA 124 (137) T protein:vir:10 48 ---DTGYLRESVSMDFKKGGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---CcchhhcCeeeEecCCcEEEEEecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHH Confidence 133444455544 344478999999999999999744321111 111111111111122 2359999998644 Q ss_pred H-HHHHHHHHHHH Q lcl|NC_020839. 153 A-GDQENIEAALM 164 (169) Q Consensus 153 ~-~d~~~I~~~i~ 164 (169) - +.+..|.+.|. T Consensus 125 ~~~~~~~i~k~i~ 137 (137) T protein:vir:10 125 IDEGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 2 33455555555 No 32 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=98.37 E-value=3.4e-09 Score=66.99 Aligned_cols=126 Identities=10% Similarity=0.044 Sum_probs=75.7 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |..+++-.++|.+.|+++...+.+ ....|.+.++.+.+..+.+ .| . T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~-----ap----v------------------------ 47 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHL-----MP----V------------------------ 47 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 999988778899999998877753 4566777777766654322 22 1 Q ss_pred hhhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhhhhhhhcc-----ccccccCceeeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGG-----SSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~-----~~~~~~~~~~~iPaRpfLG~s 152 (169) +|+.+..||++ ..++..++||+++.||...+||..+............+ .+..+.. ..+|++|||==+ T Consensus 48 ---dTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~--~~~~a~pfl~~A 122 (135) T protein:vir:96 48 ---DTGFLRQSTTVDFENGGFTGVVKIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTT--YGQMPQPFWEPA 122 (135) T ss_pred ---cchhhhcceeEEeecCcEEEEEecCCCccchhhcccccccCCCccccccccccccCCcceeec--CCcCCCcchhHH Confidence 13444445554 34455789999999999999997543222211111111 1222222 358999998654 Q ss_pred HH-HHHHHHHHHH Q lcl|NC_020839. 153 AG-DQENIEAALM 164 (169) Q Consensus 153 ~~-d~~~I~~~i~ 164 (169) =+ .+..|.++|. T Consensus 123 ~~~~~~~~~~~i~ 135 (135) T protein:vir:96 123 IDAGRQTFEQYFS 135 (135) T ss_pred HHHHHHHHHHhcC Confidence 33 3444555555 No 33 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=98.34 E-value=1e-08 Score=64.34 Aligned_cols=111 Identities=15% Similarity=0.208 Sum_probs=77.3 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.|...+.+ ....+.+-|..+....+.+-.. . ...|+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~--~-~~~p~-------------------------- 51 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--V-MNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-CCCCC-------------------------- Confidence 77776 7888888888766654 4566777777776655543111 0 01111 Q ss_pred hhhhhhhhhhheec-CCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSS-GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~-~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||+++. ++..+.||++..||....||+. .|||||||.-. +..+..+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~------------------------km~a~Pfl~PA~~~~~~~~ 106 (115) T protein:vir:96 52 -WTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -CchhhhhcceeeecCceEEEeecCccchhhhccccc------------------------ccCCCCchhhhHHHHHHHH Confidence 2344455666554 4456899999999999999964 59999999866 3567888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 107 ~~~i~~~~k 115 (115) T protein:vir:96 107 VEELKALFE 115 (115) T ss_pred HHHHHHHhC Confidence 889999999 No 34 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=98.34 E-value=1e-08 Score=64.34 Aligned_cols=111 Identities=15% Similarity=0.208 Sum_probs=77.3 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.|...+.+ ....+.+-|..+....+.+-.. . ...|+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~--~-~~~p~-------------------------- 51 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--V-MNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-CCCCC-------------------------- Confidence 77776 7888888888766654 4566777777776655543111 0 01111 Q ss_pred hhhhhhhhhhheec-CCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSS-GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~-~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||+++. ++..+.||++..||....||+. .|||||||.-. +..+..+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~------------------------km~a~Pfl~PA~~~~~~~~ 106 (115) T protein:vir:96 52 -WTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -CchhhhhcceeeecCceEEEeecCccchhhhccccc------------------------ccCCCCchhhhHHHHHHHH Confidence 2344455666554 4456899999999999999964 59999999866 3567888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 107 ~~~i~~~~k 115 (115) T protein:vir:96 107 VEELKALFE 115 (115) T ss_pred HHHHHHHhC Confidence 889999999 No 35 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=98.34 E-value=1e-08 Score=64.34 Aligned_cols=111 Identities=15% Similarity=0.208 Sum_probs=77.3 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.|...+.+ ....+.+-|..+....+.+-.. . ...|+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~--~-~~~p~-------------------------- 51 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--V-MNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-CCCCC-------------------------- Confidence 77776 7888888888766654 4566777777776655543111 0 01111 Q ss_pred hhhhhhhhhhheec-CCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSS-GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~-~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||+++. ++..+.||++..||....||+. .|||||||.-. +..+..+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~------------------------km~a~Pfl~PA~~~~~~~~ 106 (115) T protein:vir:78 52 -WTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -CchhhhhcceeeecCceEEEeecCccchhhhccccc------------------------ccCCCCchhhhHHHHHHHH Confidence 2344455666554 4456899999999999999964 59999999866 3567888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 107 ~~~i~~~~k 115 (115) T protein:vir:78 107 VEELKALFE 115 (115) T ss_pred HHHHHHHhC Confidence 889999999 No 36 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=98.34 E-value=1e-08 Score=64.34 Aligned_cols=111 Identities=15% Similarity=0.208 Sum_probs=77.3 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.|...+.+ ....+.+-|..+....+.+-.. . ...|+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~--~-~~~p~-------------------------- 51 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--V-MNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-CCCCC-------------------------- Confidence 77776 7888888888766654 4566777777776655543111 0 01111 Q ss_pred hhhhhhhhhhheec-CCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSS-GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~-~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||+++. ++..+.||++..||....||+. .|||||||.-. +..+..+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~------------------------km~a~Pfl~PA~~~~~~~~ 106 (115) T protein:vir:10 52 -WTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -CchhhhhcceeeecCceEEEeecCccchhhhccccc------------------------ccCCCCchhhhHHHHHHHH Confidence 2344455666554 4456899999999999999964 59999999866 3567888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 107 ~~~i~~~~k 115 (115) T protein:vir:10 107 VEELKALFE 115 (115) T ss_pred HHHHHHHhC Confidence 889999999 No 37 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=98.34 E-value=1e-08 Score=64.34 Aligned_cols=111 Identities=15% Similarity=0.208 Sum_probs=77.3 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.|...+.+ ....+.+-|..+....+.+-.. . ...|+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~--~-~~~p~-------------------------- 51 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--V-MNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-CCCCC-------------------------- Confidence 77776 7888888888766654 4566777777776655543111 0 01111 Q ss_pred hhhhhhhhhhheec-CCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSS-GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~-~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||+++. ++..+.||++..||....||+. .|||||||.-. +..+..+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~------------------------km~a~Pfl~PA~~~~~~~~ 106 (115) T protein:vir:93 52 -WTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -CchhhhhcceeeecCceEEEeecCccchhhhccccc------------------------ccCCCCchhhhHHHHHHHH Confidence 2344455666554 4456899999999999999964 59999999866 3567888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 107 ~~~i~~~~k 115 (115) T protein:vir:93 107 VEELKALFE 115 (115) T ss_pred HHHHHHHhC Confidence 889999999 No 38 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=98.34 E-value=1e-08 Score=64.34 Aligned_cols=111 Identities=15% Similarity=0.208 Sum_probs=77.3 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.|...+.+ ....+.+-|..+....+.+-.. . ...|+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~--~-~~~p~-------------------------- 51 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--V-MNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-CCCCC-------------------------- Confidence 77776 7888888888766654 4566777777776655543111 0 01111 Q ss_pred hhhhhhhhhhheec-CCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSS-GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~-~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||+++. ++..+.||++..||....||+. .|||||||.-. +..+..+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~------------------------km~a~Pfl~PA~~~~~~~~ 106 (115) T protein:vir:97 52 -WTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -CchhhhhcceeeecCceEEEeecCccchhhhccccc------------------------ccCCCCchhhhHHHHHHHH Confidence 2344455666554 4456899999999999999964 59999999866 3567888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 107 ~~~i~~~~k 115 (115) T protein:vir:97 107 VEELKALFE 115 (115) T ss_pred HHHHHHHhC Confidence 889999999 No 39 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=98.31 E-value=5.6e-09 Score=65.81 Aligned_cols=128 Identities=13% Similarity=0.062 Sum_probs=76.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++....++|.+.|+.+...+.. ..+.|.+.+..+.+..+.. .| + T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISL-----MP----V------------------------ 47 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 999988888999999988877653 3566666677666655432 22 1 Q ss_pred hhhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhhh--hhhhccccccccCc---eeeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQ--GKGFGGSSSTISIP---WGDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~--~~~~~~~~~~~~~~---~~~iPaRpfLG~s 152 (169) +|+.|..||++ ..+.-.+.||++..||...+||..+....... .....+....+... ...+|++|||--+ T Consensus 48 ---dTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:94 48 ---DTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---CcchhhcCceeEeecCcEEEEEecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHH Confidence 13334445544 44455789999999999999996543221111 11111111112222 2359999998755 Q ss_pred HH-HHHHHHHHHH Q lcl|NC_020839. 153 AG-DQENIEAALM 164 (169) Q Consensus 153 ~~-d~~~I~~~i~ 164 (169) -+ .+..|...|. T Consensus 125 ~~~~~~~~~~~l~ 137 (137) T protein:vir:94 125 IDAGRVFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 33 3445555555 No 40 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=98.28 E-value=1.8e-08 Score=63.02 Aligned_cols=107 Identities=16% Similarity=0.159 Sum_probs=77.2 Q ss_pred eEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 2 FTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 2 i~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) ++|++++ ++|.+.|+.+.....+ ....+...|..+.+..+.+ .| - T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~-----aP-------v--------------------- 47 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQL-----AP-------K--------------------- 47 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC-------c--------------------- Confidence 8888886 7888888888766543 4567777777776666543 22 0 Q ss_pred hhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH-----H Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA-----G 154 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~-----~ 154 (169) .++.+..||+++.++..++||++..||..-.||.. .+|++|||.-+- . T Consensus 48 ---~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~yvE~GT~------------------------~~~aqPfl~pa~~~~~~~ 100 (114) T protein:vir:95 48 ---DTEFLKDHITTSYPGMEAHIHGEAGYDGYQEYGTR------------------------FQPGTPHFRPMMEQIQPQ 100 (114) T ss_pred ---CchhhhhceeeecCceEEEeecCCCccceeecCcc------------------------ccCCCccchhhHHHHHHH Confidence 13444567888888889999999999999999963 589999998653 2 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_020839. 155 DQENIEAALMEWLE 168 (169) Q Consensus 155 d~~~I~~~i~~~l~ 168 (169) -.+.|.+.|..-|+ T Consensus 101 ~~~~l~~~l~~~~k 114 (114) T protein:vir:95 101 FQKDMTDVMKGAFK 114 (114) T ss_pred HHHHHHHHHHhhcC Confidence 34556666666666 No 41 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=98.26 E-value=1.4e-08 Score=63.70 Aligned_cols=107 Identities=13% Similarity=0.125 Sum_probs=70.6 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) -++|++++ ++|.+.|+.+... ......+++.+..+.+..+. ..| . T Consensus 2 ~~~i~i~Gld~l~~~L~~~~~~-~~~~~al~~~~~~i~~~ak~-----~aP----v------------------------ 47 (112) T protein:vir:36 2 KSSLSFKGIDQLVKHLDKAASL-KGVQQVVKSNTSNMTANMQK-----LVP----V------------------------ 47 (112) T ss_pred ceeeeehhHHHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHH-----hCC----C------------------------ Confidence 45777775 6666666655432 23455666666666655532 223 0 Q ss_pred hhhhhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHH Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQ 156 (169) Q Consensus 80 ~~~~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~ 156 (169) .++.+..||++. .++..+.||++..||....||.. .+||+|||--+ +..+ T Consensus 48 ---dTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~------------------------k~~a~Pfl~pa~~~~~ 100 (112) T protein:vir:36 48 ---DTGYMKRSIKMELTEGGFSGQAGPHTDYSAYVEYGTR------------------------FQSAQPFVKPAYNEQK 100 (112) T ss_pred ---CchhhhhceeeeecCCceEEEeecCCCccceeecccc------------------------ccCCCcchhhhHHHHH Confidence 122333455543 34558999999999999999964 59999998644 4567 Q ss_pred HHHHHHHHHhcC Q lcl|NC_020839. 157 ENIEAALMEWLE 168 (169) Q Consensus 157 ~~I~~~i~~~l~ 168 (169) ..+.+.|.+.|+ T Consensus 101 ~~~~~~i~~~lr 112 (112) T protein:vir:36 101 GVFIKDLERLLK 112 (112) T ss_pred HHHHHHHHHHcC Confidence 788899999999 No 42 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=98.19 E-value=1.2e-08 Score=63.99 Aligned_cols=124 Identities=13% Similarity=0.066 Sum_probs=67.0 Q ss_pred Ce--EEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MF--TVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi--~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) || ++++|-..++..+..+ .+..+..++..+.+..+. ..|. T Consensus 1 ~~~~~~~l~~~~l~~~~~~~------~~~~~~~~a~~ve~~ak~-----~aPv--------------------------- 42 (137) T protein:vir:10 1 MVAHTLRIERAQLHGLGMDE------ARKAVNRVVRRTFTRSQI-----LAPV--------------------------- 42 (137) T ss_pred CcccccccChhhHhhHHHHH------HHHHHHHHHHHHHHHHHh-----cCCc--------------------------- Confidence 77 4555555444444333 445566666666665422 1221 Q ss_pred hhhhhhhhhhhhhheec---CCcE--EEecCcccchhhhhcccccc--cchhhhhhhhccccccccCceeecc---Cccc Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSS---GGDW--ARLSSRAIQSAVMQFGAKKG--AFGSYQGKGFGGSSSTISIPWGDIP---ARPF 148 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~---~~~~--v~vGt~~~YA~iHqfGg~~~--~~~~~~~~~~~~~~~~~~~~~~~iP---aRpf 148 (169) +++.+..||.+.. ++.. ..|+++++||++|+||+... .....+.+.|.+.++++..+.|+.| +||| T Consensus 43 ----~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~Pf 118 (137) T protein:vir:10 43 ----DTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPY 118 (137) T ss_pred ----CchhhhccceeeeeeccccEEEEEecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChh Confidence 1233344565433 2233 45889999999999998533 2234456666777888888888877 9999 Q ss_pred CCCCHHHHHHHHHH-HHHhcC Q lcl|NC_020839. 149 MGISAGDQENIEAA-LMEWLE 168 (169) Q Consensus 149 LG~s~~d~~~I~~~-i~~~l~ 168 (169) |- +..++.+-.. +.-+|- T Consensus 119 L~--~Al~~~~~~~~~~~~~~ 137 (137) T protein:vir:10 119 LS--QALREVAPQEGFRVTIG 137 (137) T ss_pred hH--HHHHHhhcccceeEeeC Confidence 53 1111111000 000000 No 43 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=98.18 E-value=2.5e-09 Score=67.70 Aligned_cols=103 Identities=13% Similarity=0.081 Sum_probs=50.3 Q ss_pred hcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhhhhhhhhhhhee-------cCCcEEEecCcccc-hhhhhcccc Q lcl|NC_020839. 46 AGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTKTLSSPSNFAVS-------SGGDWARLSSRAIQ-SAVMQFGAK 117 (169) Q Consensus 46 ~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~si~~~-------~~~~~v~vGt~~~Y-A~iHqfGg~ 117 (169) -.-.-|...++. ++.....+. ...+.+. .+++..++|+++.| |++|+||+. T Consensus 1 m~~~~~~~~~~~--------------------~~~~l~~l~-~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~ 59 (193) T protein:vir:96 1 MSLRRDSELIAA--------------------HLQMLRAMR-GRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGT 59 (193) T ss_pred CeeccchHHHHH--------------------HHHHHHHhc-CCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCc Confidence 000000011111 111111111 0111111 12345678999988 999999998 Q ss_pred cccchhhhhhhhc-----------------cccccccCceeeccCcccCCCCHHH-HHHHHHHHHHhcCC Q lcl|NC_020839. 118 KGAFGSYQGKGFG-----------------GSSSTISIPWGDIPARPFMGISAGD-QENIEAALMEWLEP 169 (169) Q Consensus 118 ~~~~~~~~~~~~~-----------------~~~~~~~~~~~~iPaRpfLG~s~~d-~~~I~~~i~~~l~p 169 (169) +.......+.+.. ........+.++||+||||..+-++ .+.+.+.+...++= T Consensus 60 I~~~~~~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~ 129 (193) T protein:vir:96 60 IDHPGGTRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMR 129 (193) T ss_pred cccCccceeeeeccccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHH Confidence 8754433221111 1111234457899999999998655 45566655555443 No 44 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=98.18 E-value=1.5e-08 Score=63.53 Aligned_cols=128 Identities=13% Similarity=0.103 Sum_probs=75.6 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++....++|.+.|+.+...+.+ ..+.+.+.+..+.+..+.. .| + T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL-----MP----V------------------------ 47 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 998876778899999888776643 4566666666666655432 22 1 Q ss_pred hhhhhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhhh--hhhccccccccCce---eeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGSSSTISIPW---GDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~~~~~~~~~---~~iPaRpfLG~s 152 (169) +|+.|.+||++. .+.-.++||++.+||...+||..+........ ....+....+...+ ..+|++|||--+ T Consensus 48 ---dTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:94 48 ---DTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---cccchhccceeEeecCceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 134444466544 34457999999999999999975432221111 11111111112222 358999998754 Q ss_pred HH-HHHHHHHHHH Q lcl|NC_020839. 153 AG-DQENIEAALM 164 (169) Q Consensus 153 ~~-d~~~I~~~i~ 164 (169) -+ .+..|.+.|. T Consensus 125 ~~~~~~~~~~~l~ 137 (137) T protein:vir:94 125 IDAGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 32 3455555555 No 45 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=98.18 E-value=1.5e-08 Score=63.53 Aligned_cols=128 Identities=13% Similarity=0.103 Sum_probs=75.6 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++....++|.+.|+.+...+.+ ..+.+.+.+..+.+..+.. .| + T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL-----MP----V------------------------ 47 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 998876778899999888776643 4566666666666655432 22 1 Q ss_pred hhhhhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhhh--hhhccccccccCce---eeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGSSSTISIPW---GDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~~~~~~~~~---~~iPaRpfLG~s 152 (169) +|+.|.+||++. .+.-.++||++.+||...+||..+........ ....+....+...+ ..+|++|||--+ T Consensus 48 ---dTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:97 48 ---DTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---cccchhccceeEeecCceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 134444466544 34457999999999999999975432221111 11111111112222 358999998754 Q ss_pred HH-HHHHHHHHHH Q lcl|NC_020839. 153 AG-DQENIEAALM 164 (169) Q Consensus 153 ~~-d~~~I~~~i~ 164 (169) -+ .+..|.+.|. T Consensus 125 ~~~~~~~~~~~l~ 137 (137) T protein:vir:97 125 IDAGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 32 3455555555 No 46 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=98.18 E-value=1.5e-08 Score=63.53 Aligned_cols=128 Identities=13% Similarity=0.103 Sum_probs=75.6 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++....++|.+.|+.+...+.+ ..+.+.+.+..+.+..+.. .| + T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL-----MP----V------------------------ 47 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 998876778899999888776643 4566666666666655432 22 1 Q ss_pred hhhhhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhhh--hhhccccccccCce---eeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGSSSTISIPW---GDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~~~~~~~~~---~~iPaRpfLG~s 152 (169) +|+.|.+||++. .+.-.++||++.+||...+||..+........ ....+....+...+ ..+|++|||--+ T Consensus 48 ---dTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:93 48 ---DTGYLRESVTMDFKDSGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---cccchhccceeEeecCceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 134444466544 34457999999999999999975432221111 11111111112222 358999998754 Q ss_pred HH-HHHHHHHHHH Q lcl|NC_020839. 153 AG-DQENIEAALM 164 (169) Q Consensus 153 ~~-d~~~I~~~i~ 164 (169) -+ .+..|.+.|. T Consensus 125 ~~~~~~~~~~~l~ 137 (137) T protein:vir:93 125 IDAGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 32 3455555555 No 47 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=98.13 E-value=2.1e-08 Score=62.70 Aligned_cols=128 Identities=14% Similarity=0.105 Sum_probs=74.7 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++....++|.+.|+.+...+. ...+.+...+..+.+..+.. .| + T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL-----MP----V------------------------ 47 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 99887677888888888877664 34566666666666655332 22 1 Q ss_pred hhhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhhhh--hhhccccccccCce---eeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGSSSTISIPW---GDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~~~~~~~~~---~~iPaRpfLG~s 152 (169) +|+.+.+||++ ..+.-.++||++..||...+||..+........ +...+....+...+ ..+|++|||--+ T Consensus 48 ---~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:95 48 ---DTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---cchhhhcCeeeEeeCCceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHH Confidence 13444445554 334457899999999999999975432222111 11111111122222 348999998754 Q ss_pred HH-HHHHHHHHHH Q lcl|NC_020839. 153 AG-DQENIEAALM 164 (169) Q Consensus 153 ~~-d~~~I~~~i~ 164 (169) -+ .+..|.+.|. T Consensus 125 ~~~~~~~i~k~l~ 137 (137) T protein:vir:95 125 IDAGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 32 3455555555 No 48 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=98.10 E-value=1.6e-08 Score=63.33 Aligned_cols=120 Identities=15% Similarity=0.053 Sum_probs=66.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) +-++++|...+++.+... .+..+..++..+.+..+.+ . ||. T Consensus 7 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~v~~~ak~~-----a----Pvd------------------------ 47 (140) T protein:vir:97 7 RARIEIDEAALERESGEH------LRAFHRSLTRRIANQSRVA-----V----PVR------------------------ 47 (140) T ss_pred eeeeeeCHHHHHHHHHHH------HHHHHHHHHHHHHHHHHhc-----C----Ccc------------------------ Confidence 455555655555544333 2333455555554444332 2 231 Q ss_pred hhhhhhhhhhhhe--ecCC-c--EEEecCcccchhhhhccccccc--chhhhhhhhccccccccCceeecc---CcccCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV--SSGG-D--WARLSSRAIQSAVMQFGAKKGA--FGSYQGKGFGGSSSTISIPWGDIP---ARPFMG 150 (169) Q Consensus 81 ~~~~~~~~~si~~--~~~~-~--~v~vGt~~~YA~iHqfGg~~~~--~~~~~~~~~~~~~~~~~~~~~~iP---aRpfLG 150 (169) ++.+..||.. ..++ . .+.|++++.||.+++||..... ....+.+.|.+.++.+...+|+.| ++|||- T Consensus 48 ---tG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~ 124 (140) T protein:vir:97 48 ---TGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMR 124 (140) T ss_pred ---chhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHH Confidence 2333445553 2222 2 3567899999999999986442 233445555677778888888866 999964 Q ss_pred CCHH----HHHHHHHH Q lcl|NC_020839. 151 ISAG----DQENIEAA 162 (169) Q Consensus 151 ~s~~----d~~~I~~~ 162 (169) -.-+ .+..|... T Consensus 125 ~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 125 NSAQRVVTNDPRVRMT 140 (140) T ss_pred HHHHHHhhhhhhccCC Confidence 3221 12333333 No 49 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=98.10 E-value=1.6e-08 Score=63.33 Aligned_cols=120 Identities=15% Similarity=0.053 Sum_probs=66.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) +-++++|...+++.+... .+..+..++..+.+..+.+ . ||. T Consensus 7 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~v~~~ak~~-----a----Pvd------------------------ 47 (140) T protein:vir:10 7 RARIEIDEAALERESGEH------LRAFHRSLTRRIANQSRVA-----V----PVR------------------------ 47 (140) T ss_pred eeeeeeCHHHHHHHHHHH------HHHHHHHHHHHHHHHHHhc-----C----Ccc------------------------ Confidence 455555655555544333 2333455555554444332 2 231 Q ss_pred hhhhhhhhhhhhe--ecCC-c--EEEecCcccchhhhhccccccc--chhhhhhhhccccccccCceeecc---CcccCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV--SSGG-D--WARLSSRAIQSAVMQFGAKKGA--FGSYQGKGFGGSSSTISIPWGDIP---ARPFMG 150 (169) Q Consensus 81 ~~~~~~~~~si~~--~~~~-~--~v~vGt~~~YA~iHqfGg~~~~--~~~~~~~~~~~~~~~~~~~~~~iP---aRpfLG 150 (169) ++.+..||.. ..++ . .+.|++++.||.+++||..... ....+.+.|.+.++.+...+|+.| ++|||- T Consensus 48 ---tG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~ 124 (140) T protein:vir:10 48 ---TGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMR 124 (140) T ss_pred ---chhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHH Confidence 2333445553 2222 2 3567899999999999986442 233445555677778888888866 999964 Q ss_pred CCHH----HHHHHHHH Q lcl|NC_020839. 151 ISAG----DQENIEAA 162 (169) Q Consensus 151 ~s~~----d~~~I~~~ 162 (169) -.-+ .+..|... T Consensus 125 ~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 125 NSAQRVVTNDPRVRMT 140 (140) T ss_pred HHHHHHhhhhhhccCC Confidence 3221 12333333 No 50 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.08 E-value=1.1e-07 Score=58.65 Aligned_cols=111 Identities=15% Similarity=0.322 Sum_probs=73.6 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |++|++++ ++|.+.|+.|...... .+..+..-|..+.+..+.+ .|-+..| T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~-----ap~~~~~----------------------- 52 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSH-----VNRSDKK----------------------- 52 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCCCC----------------------- Confidence 99999997 8899999998766543 4677777777777777655 3321111 Q ss_pred hhhhhhhhhhhhhhe------ecCCcEEEecCc---ccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAV------SSGGDWARLSSR---AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 79 ~~~~~~~~~~~si~~------~~~~~~v~vGt~---~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) ++.++. +|.+ ..+...+.||.+ ..|+.+..||.. .+|++||| T Consensus 53 ---tg~l~~--~I~~~~~k~~~~g~~~v~Vg~~~~~~~y~~f~E~GT~------------------------~~~a~Pf~ 103 (127) T protein:vir:12 53 ---QPHMQD--NITVSNVRESKDGVRFVAVGPNKKVAYRGRFLEWGTS------------------------KMPPQPFI 103 (127) T ss_pred ---hhHHHH--hhhccccccccCceeEEEEeeCCCCcceeeeeccCcc------------------------CCCCCccc Confidence 012222 3322 123446788854 568888999963 58999998 Q ss_pred CCC-----HHHHHHHHHHHHHhcC Q lcl|NC_020839. 150 GIS-----AGDQENIEAALMEWLE 168 (169) Q Consensus 150 G~s-----~~d~~~I~~~i~~~l~ 168 (169) .-+ ++-.+.+.+.+.+-|+ T Consensus 104 ~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 104 EKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred hHhHHHHHHHHHHHHHHHHHHhcC Confidence 854 3345667777777788 No 51 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=98.05 E-value=1.1e-07 Score=58.71 Aligned_cols=111 Identities=17% Similarity=0.161 Sum_probs=72.5 Q ss_pred EEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.+.....+ ....+..-|..+.+..+.+=.. -.+.|| T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~---~~~~pv-------------------------- 51 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKE---VMNKGY-------------------------- 51 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCCCC-------------------------- Confidence 77776 7888888888766543 4566777777777666543211 011222 Q ss_pred hhhhhhhhhhheecCC-cEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSSGG-DWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~~~-~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||.+..++ -.+.|+++..||....||+. .|||||||.-. ++....+ T Consensus 52 -~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vEfGT~------------------------km~a~PFl~PA~~~~k~~~ 106 (115) T protein:vir:10 52 -WTGNLASLIEVKKIGDLHYRVISTAHYSGFLEFGTR------------------------YMEPAPFMFPTYQTLKKST 106 (115) T ss_pred -cchhhhhceeeeecCcEEEEeeCCCccchheecccc------------------------cCCCCCchhhhHHHHHHHH Confidence 133444566655543 46899999999999999964 59999999854 2345555 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+-++ T Consensus 107 ~~~i~~~i~ 115 (115) T protein:vir:10 107 INDLKRLLS 115 (115) T ss_pred HHHHHHHhC Confidence 566666666 No 52 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=98.04 E-value=1.3e-07 Score=58.28 Aligned_cols=111 Identities=14% Similarity=0.169 Sum_probs=76.0 Q ss_pred EEEch-HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++++ ++|.+.|+.+..... .....+...|..+....+..=.. -++.|+ T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~---~~~~p~-------------------------- 51 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE---VMNKGY-------------------------- 51 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCCCC-------------------------- Confidence 77776 788888888876654 35667777777777666543111 011111 Q ss_pred hhhhhhhhhhheecCC-cEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 82 TKTLSSPSNFAVSSGG-DWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 82 ~~~~~~~~si~~~~~~-~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||.+..++ -.+.||++..||....||+. .|||||||.-. +.....+ T Consensus 52 -~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~~vE~GT~------------------------~m~a~PFl~PA~~~~k~~~ 106 (115) T protein:vir:99 52 -WTGNLSRNIRYKKTVDLQYTITSHAAYSGFLEFGTR------------------------YMEAEPFMWPVYEVIRKST 106 (115) T ss_pred -cchhhhhceeeeecCcEEEEecCCcccccccccccc------------------------ccCCCCcchhhHHHHHHHH Confidence 234455567666554 47899999999999999964 59999999855 3456777 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+-++ T Consensus 107 ~~~l~~~~k 115 (115) T protein:vir:99 107 VEELKTLFE 115 (115) T ss_pred HHHHHHHhC Confidence 777888888 No 53 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=98.01 E-value=4.4e-08 Score=60.90 Aligned_cols=128 Identities=15% Similarity=0.139 Sum_probs=79.0 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |-.+....++|.+.|+.+...+.+ ....+.+++..+.+..+.+ .| + T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP----v------------------------ 47 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSN-----MP----V------------------------ 47 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 999977778999999998877753 5677888888888776654 23 1 Q ss_pred hhhhhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhh--hhhhccccccccCce---eeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQ--GKGFGGSSSTISIPW---GDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~--~~~~~~~~~~~~~~~---~~iPaRpfLG~s 152 (169) +|+.+..||++. .++-.+.||++++||...+||.......... ...+.+....+...+ .-+|+||||==+ T Consensus 48 ---dTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 124 (137) T protein:vir:10 48 ---DTGYLRESVSMDFKKGGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPA 124 (137) T ss_pred ---CcchhhcCeeEEeeCCcEEEEEecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHH Confidence 133444466543 3445689999999999999996443211111 111111111122222 248999998644 Q ss_pred H-HHHHHHHHHHH Q lcl|NC_020839. 153 A-GDQENIEAALM 164 (169) Q Consensus 153 ~-~d~~~I~~~i~ 164 (169) - +.+..|...|. T Consensus 125 ~~~~~~~i~k~i~ 137 (137) T protein:vir:10 125 IDEGRAFFNKYFS 137 (137) T ss_pred HHHHHHHHHHhcC Confidence 3 33555666666 No 54 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=98.00 E-value=6.5e-08 Score=59.98 Aligned_cols=128 Identities=15% Similarity=0.142 Sum_probs=74.9 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |-++....++|.+.|+.+...+.+ ..+.+.+.|..+.+..+.. .| . T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~-----~p----v------------------------ 47 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVAL-----AP----V------------------------ 47 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c------------------------ Confidence 999987778888888888776653 4556666676666654422 22 0 Q ss_pred hhhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhh--hhhhhccccccccCce---eeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSY--QGKGFGGSSSTISIPW---GDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~--~~~~~~~~~~~~~~~~---~~iPaRpfLG~s 152 (169) +|+.+.+||++ ..++..++||++..||...+||......... ....+.+....+.+.+ ..+|++|||-=+ T Consensus 48 ---dTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA 124 (137) T protein:vir:96 48 ---DLGFLKESIDFKVTDGGFSSVISVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPA 124 (137) T ss_pred ---CccchhcCceeEeecCceEEEEecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHH Confidence 12333345544 4456679999999999999999754322111 1111222222222222 348999998744 Q ss_pred HHH-HHHHHHHHH Q lcl|NC_020839. 153 AGD-QENIEAALM 164 (169) Q Consensus 153 ~~d-~~~I~~~i~ 164 (169) -++ +..|...|. T Consensus 125 ~~~~~~~i~k~i~ 137 (137) T protein:vir:96 125 IDEGRKVFNRYFS 137 (137) T ss_pred HHHHHHHHHHhhC Confidence 333 344444444 No 55 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=97.99 E-value=3e-08 Score=61.87 Aligned_cols=128 Identities=13% Similarity=0.104 Sum_probs=73.2 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++.+-.++|.+.|+.+...+.+ ....+.+.+..+.+..+.+ .| T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~-----aP----------------------------- 58 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVAL-----AP----------------------------- 58 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----------------------------- Confidence 988876557788888777766642 4556666666666554321 22 Q ss_pred hhhhhhhhhhhhheec--CCcEEEecCcccchhhhhcccccccchhhhh--hhhccc---cccccCceeeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSS--GGDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGS---SSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~~--~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~---~~~~~~~~~~iPaRpfLG~s 152 (169) .+|+.+..||.+.. +.-.+.||++..||...+||........... ....+. ..........+|+||||-=+ T Consensus 59 --vdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA 136 (149) T protein:vir:10 59 --VDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPA 136 (149) T ss_pred --cccchhhccceEEecCCcEEEEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHH Confidence 01344445665443 3446899999999999999974332111100 000111 01111122358999998755 Q ss_pred H-HHHHHHHHHHH Q lcl|NC_020839. 153 A-GDQENIEAALM 164 (169) Q Consensus 153 ~-~d~~~I~~~i~ 164 (169) - +.+..|.+.|+ T Consensus 137 ~~~~k~~i~~~i~ 149 (149) T protein:vir:10 137 IDAGRKTFEQYFS 149 (149) T ss_pred HHHHHHHHHHhhC Confidence 3 34566666666 No 56 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.96 E-value=8.9e-08 Score=59.23 Aligned_cols=104 Identities=13% Similarity=0.146 Sum_probs=70.2 Q ss_pred EEEch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPT 82 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~ 82 (169) |++++ ++|.+.|+++.. .......+...|..+.+..+. ..| . T Consensus 1 i~i~Gld~l~~~l~~~~~-~~~~~~al~~~a~~i~~~ak~-----~aP----v--------------------------- 43 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT-LDDVKHVVKSNTASMNKNMQN-----LAP----V--------------------------- 43 (108) T ss_pred CcchhHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHH-----hCC----C--------------------------- Confidence 66665 667777766532 234556666667666655432 122 0 Q ss_pred hhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHH Q lcl|NC_020839. 83 KTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENI 159 (169) Q Consensus 83 ~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I 159 (169) .|+.+..||.+. .+.-.+.||++..||..-.||.. .+||+|||.-. +..+..+ T Consensus 44 ~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~------------------------km~aqpf~~pa~~~~~~~~ 99 (108) T protein:vir:74 44 DTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYGTR------------------------FQSAQPFVKPAFNIQKKVF 99 (108) T ss_pred CchhhhccceeeeecCceEEEeecCCCcccceecccc------------------------ccCCCcchhhHHHHHHHHH Confidence 123333455544 35557999999999999999964 58999998866 5678888 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 100 ~~~i~~~~k 108 (108) T protein:vir:74 100 TNDLERLTK 108 (108) T ss_pred HHHHHHHcC Confidence 899999999 No 57 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.95 E-value=1.4e-07 Score=58.21 Aligned_cols=104 Identities=13% Similarity=0.133 Sum_probs=69.6 Q ss_pred EEEch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPT 82 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~ 82 (169) |++++ ++|.+.|+.+.. .......++..|..+.+..+.+ .| + T Consensus 1 i~i~Gld~l~~~l~~~~~-~~~~~~al~~~a~~i~~~ak~~-----ap----v--------------------------- 43 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT-LNDVKHVVKRNTVSMNKNMQNL-----AP----V--------------------------- 43 (108) T ss_pred CcchhHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHh-----CC----C--------------------------- Confidence 66665 667777766532 2335567777777766665432 22 1 Q ss_pred hhhhhhhhhhee--cCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH-HHHHHH Q lcl|NC_020839. 83 KTLSSPSNFAVS--SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA-GDQENI 159 (169) Q Consensus 83 ~~~~~~~si~~~--~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~-~d~~~I 159 (169) .|+.+..||.+. .+.-.+.||++..||..-.||.. .+|++|||.-+- .....+ T Consensus 44 dTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~------------------------~m~aqPFl~pa~~~~~~~~ 99 (108) T protein:vir:98 44 DTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGTR------------------------FQAAQPFVKPAFDVQKKIF 99 (108) T ss_pred CchhhHhhceeeeecCceEEEeecCCCccceeecccc------------------------ccCCCcchhhHHHHHHHHH Confidence 123333455544 34557899999999999999964 599999987653 456778 Q ss_pred HHHHHHhcC Q lcl|NC_020839. 160 EAALMEWLE 168 (169) Q Consensus 160 ~~~i~~~l~ 168 (169) .+.|.+.|+ T Consensus 100 ~~~i~~~lr 108 (108) T protein:vir:98 100 TNDLERLTK 108 (108) T ss_pred HHHHHHHcC Confidence 888889999 No 58 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.94 E-value=3.1e-07 Score=56.23 Aligned_cols=130 Identities=13% Similarity=0.152 Sum_probs=68.4 Q ss_pred CeEEEEch---HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccch Q lcl|NC_020839. 1 MFTVDVKD---KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFK 76 (169) Q Consensus 1 mi~i~~~~---~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~ 76 (169) +-+|+++. +++.+.|+.+...+. .....+.+.++.+.+..+.+ .| + T Consensus 3 ~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~-----ap----v--------------------- 52 (144) T protein:vir:59 3 LMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASL-----AP----V--------------------- 52 (144) T ss_pred cceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c--------------------- Confidence 33666664 355555666555443 23455555555555544322 22 2 Q ss_pred hhhhhhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhhhhh-----hhccccccccCceeeccCcccC Q lcl|NC_020839. 77 PLIGPTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGK-----GFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 77 ~l~~~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~-----~~~~~~~~~~~~~~~iPaRpfL 149 (169) +|+.+.+||.+ ..++-.++||++..||..++||............ .....+..+.. ..+|++||| T Consensus 53 ------~TG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t--~g~~a~Pfl 124 (144) T protein:vir:59 53 ------DEGNLKNSIQIDYKNNGLTAEITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRT--QGAPAQPFF 124 (144) T ss_pred ------cchhhhcCeeEEeecCcEEEEEecCCCccchhhcCccccccCCCccccccccccccccceecC--CCCCCCcch Confidence 13344445554 4445579999999999999999744322111110 01111112211 258999998 Q ss_pred CCCHH-HHHHHHHHHHHhcC Q lcl|NC_020839. 150 GISAG-DQENIEAALMEWLE 168 (169) Q Consensus 150 G~s~~-d~~~I~~~i~~~l~ 168 (169) -=+-+ .+..|.+.|.+.+= T Consensus 125 ~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 125 WPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred hHHHHHHHHHHHHHHHHhcC Confidence 75533 34555555555555 No 59 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=97.92 E-value=4.7e-08 Score=60.77 Aligned_cols=128 Identities=13% Similarity=0.064 Sum_probs=72.3 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |.++..-.++|.+.|+.+...+.+ ....+.+.++.+.+..+.+ .| T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~-----aP----------------------------- 58 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVAL-----AP----------------------------- 58 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----------------------------- Confidence 988876557788888777666642 4456666666665554322 22 Q ss_pred hhhhhhhhhhhhheecC--CcEEEecCcccchhhhhcccccccchhhhh--hhhcccccc---ccCceeeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSSG--GDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGSSST---ISIPWGDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~~~~~--~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~~~~---~~~~~~~iPaRpfLG~s 152 (169) .+|+.+..||.+... .-.+.||++..||...+||........... ....+.... .......+|+||||-=+ T Consensus 59 --vdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA 136 (149) T protein:vir:94 59 --VDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPA 136 (149) T ss_pred --cccchhhcCeeEEeeCCcEEEEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHH Confidence 023444456655443 446889999999999999975432111100 011111100 11112358999998744 Q ss_pred H-HHHHHHHHHHH Q lcl|NC_020839. 153 A-GDQENIEAALM 164 (169) Q Consensus 153 ~-~d~~~I~~~i~ 164 (169) - +.+..|.+.|+ T Consensus 137 ~~~~~~~i~~~i~ 149 (149) T protein:vir:94 137 IDAGRKTFEQYFS 149 (149) T ss_pred HHHHHHHHHHhhC Confidence 3 34556666666 No 60 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.89 E-value=1.7e-07 Score=57.65 Aligned_cols=126 Identities=20% Similarity=0.214 Sum_probs=71.9 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |++|++++ ++|.+.|+.|.....+ .+..+...|+.+....+.+- |..+-| ++.+......+.+.. . T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~a-----P~~tG~--l~~sI~~~~~~~~~~----~ 69 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRA-----PKKTGK--LRRNIVSAALRQKDA----P 69 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCChhh--HHHhccccccccccc----c Confidence 99999996 8999999999876643 57788888888888877653 421111 111100000000000 0 Q ss_pred hhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH-HHH Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA-GDQ 156 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~-~d~ 156 (169) ... ...+.. .....+..+++..|+.+..||+. .+|++|||.-+- +.+ T Consensus 70 ~~~-------~~g~~~-~~~~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl~pA~~~~~ 117 (140) T protein:vir:10 70 GLA-------TAGVRV-RTKGKADSPNNAFYWRFDEFGTQ------------------------HMKAQPFMRPAFDASI 117 (140) T ss_pred ceE-------Eeeeee-ccccccCCCCccceeeeeccCCC------------------------CCCCCcchhhhHHHHH Confidence 000 000000 00111223466789999999964 599999998653 345 Q ss_pred HHHHHHHHHhcCC Q lcl|NC_020839. 157 ENIEAALMEWLEP 169 (169) Q Consensus 157 ~~I~~~i~~~l~p 169 (169) +.+.+++.+.|+= T Consensus 118 ~~~~~~~~~~~~~ 130 (140) T protein:vir:10 118 GEAEGAIRTELAR 130 (140) T ss_pred HHHHHHHHHHHHH Confidence 5566666655555 No 61 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.87 E-value=2.2e-07 Score=57.11 Aligned_cols=123 Identities=20% Similarity=0.214 Sum_probs=70.4 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCC--CCCCCcccchhHHHHHhccCccccc Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAP--DGSPWAPKSSATIKAYERRKQTVSF 75 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~P--dG~~W~pl~~~t~~~~~~~~~~~~~ 75 (169) |++|++++ ++|.+.|+.|.....+ .+..+...|+.+.+..+.+- | +|. ++.+......+... T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~a-----P~~tG~----l~~sI~~~~~~~~~---- 67 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRA-----PKKTGK----LRRNIVSAALRQKD---- 67 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCChhh----HHhhcccccccccc---- Confidence 99999997 8899999999876653 47788888988888877652 3 111 11110000000000 Q ss_pred hhhhhhhhhhhhhhhhhee-cCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH- Q lcl|NC_020839. 76 KPLIGPTKTLSSPSNFAVS-SGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA- 153 (169) Q Consensus 76 ~~l~~~~~~~~~~~si~~~-~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~- 153 (169) .......... .....+..+++..|+.+..||.. .+|++|||.-+- T Consensus 68 ---------~~~~~~vg~~~~~~~~~~~~~~~~y~~f~E~GT~------------------------~~~a~pFl~pa~~ 114 (140) T protein:vir:14 68 ---------APGLATAGVRVRTKGKADSPNNAFYWRFDEFGTQ------------------------HMKAQPFMRPAFD 114 (140) T ss_pred ---------cceeEEeeeeeccccccCCCCccceeeeeccccC------------------------CCCCCcchhHHHH Confidence 0000000000 00111223466789999999964 599999998664 Q ss_pred HHHHHHHHHHHHhcCC Q lcl|NC_020839. 154 GDQENIEAALMEWLEP 169 (169) Q Consensus 154 ~d~~~I~~~i~~~l~p 169 (169) +.+..+.+++.+.|+= T Consensus 115 ~~~~~~~~~~~~~~~~ 130 (140) T protein:vir:14 115 ASIGEAEGAIRTELAR 130 (140) T ss_pred HHHHHHHHHHHHHHHH Confidence 3345555555555554 No 62 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=97.84 E-value=1.6e-07 Score=57.88 Aligned_cols=139 Identities=10% Similarity=0.099 Sum_probs=73.8 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccc-hhHHHHHhccCccccch Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKS-SATIKAYERRKQTVSFK 76 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~-~~t~~~~~~~~~~~~~~ 76 (169) .++++|.+ ++|.+.|+.|.....+ .+..+..-|+.+.+..+.+--.-.+| +.+.+ ...+......+...... T Consensus 4 ~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~----~~~~~l~~~i~~~~~~~~~~~~~ 79 (164) T protein:vir:43 4 TVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDP----GTGRSISDNIALRWNGRLFKRTG 79 (164) T ss_pred ceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCC----CccchhhhhhhhhcccCcccccc Confidence 56777776 8899999999877653 46888888998988888876432222 22110 00000000000000000 Q ss_pred hhhhhhhhhhh-hhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH-H Q lcl|NC_020839. 77 PLIGPTKTLSS-PSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA-G 154 (169) Q Consensus 77 ~l~~~~~~~~~-~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~-~ 154 (169) .+.. ..+.. ............+..+++..|+.++.||+. ++|+||||.-.- + T Consensus 80 ~~~~--~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~------------------------km~a~PFlrPA~~~ 133 (164) T protein:vir:43 80 DLGF--RIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTE------------------------DMRAQPFMRSALAD 133 (164) T ss_pred ceeE--EecccccccccccccccccCCCCCcceEEEeecCCC------------------------CCCCCcchhhhHHH Confidence 0000 00000 000000111112334566789999999963 699999998653 3 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) .++.+.++|.+.|+= T Consensus 134 ~k~~~~~~~~~~l~~ 148 (164) T protein:vir:43 134 NIAEVTSTFVSEYEK 148 (164) T ss_pred hHHHHHHHHHHHHHH Confidence 556666666655555 No 63 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.79 E-value=5.7e-07 Score=54.81 Aligned_cols=104 Identities=11% Similarity=0.091 Sum_probs=68.3 Q ss_pred Ech-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhh Q lcl|NC_020839. 6 VKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTK 83 (169) Q Consensus 6 ~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~ 83 (169) |++ ++|.+.|+++...+.+ ....|...|..+.+..+. ..| . . T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~-----~aP----v---------------------------~ 44 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKI-----LAP----V---------------------------D 44 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----cCC----c---------------------------C Confidence 665 6777777777665543 345566666665554322 223 0 1 Q ss_pred hhhhhhhhheecCC-cEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHH-HHHHHHH Q lcl|NC_020839. 84 TLSSPSNFAVSSGG-DWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAG-DQENIEA 161 (169) Q Consensus 84 ~~~~~~si~~~~~~-~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~-d~~~I~~ 161 (169) |+.+..||.+..++ ..+.|+++..||....||.. .+||||||.-+-+ .+..+.+ T Consensus 45 TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~GT~------------------------~m~a~Pf~~pa~~~~~~~~~~ 100 (108) T protein:vir:99 45 TGWLRAQIYSEQQRLLHYRVVSPALYSIYLELGTR------------------------KMEAQSFLDPALRKEWPVLMA 100 (108) T ss_pred chhhhcceeeeecCcEEEEeecCcccchhcccCcc------------------------ccCCCcchhhhHHHHHHHHHH Confidence 34444566655544 47899999999999999964 5999999986643 4556666 Q ss_pred HHHHhcCC Q lcl|NC_020839. 162 ALMEWLEP 169 (169) Q Consensus 162 ~i~~~l~p 169 (169) .|.+.|+= T Consensus 101 ~i~~~lrk 108 (108) T protein:vir:99 101 NIKKMFKR 108 (108) T ss_pred HHHHHhcC Confidence 67777666 No 64 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=97.72 E-value=1.5e-07 Score=58.05 Aligned_cols=121 Identities=12% Similarity=0.044 Sum_probs=58.0 Q ss_pred eEEEEch----HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 2 FTVDVKD----KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 2 i~i~~~~----~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) +.+++.. ..+.+.+... ++..+..++..+....+ +.. || T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v------~r~~l~~~a~~v~~~Ak-----~~a----Pv---------------------- 43 (137) T protein:vir:10 1 MTVTARYERNPVGEARQFQVI------ARRRLSRITRGTANQAR-----ADV----PV---------------------- 43 (137) T ss_pred CeeEEEeccCchhHHHHHHHH------HHHHHHHHHHHHHHHHH-----hcC----Cc---------------------- Confidence 4333333 2233322222 22334444444433221 111 22 Q ss_pred hhhhhhhhhhhhhhhee--c----CCcEEEecCcccchhhhhcccccc---cchhhhhhhhccccccccCceeecc---C Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVS--S----GGDWARLSSRAIQSAVMQFGAKKG---AFGSYQGKGFGGSSSTISIPWGDIP---A 145 (169) Q Consensus 78 l~~~~~~~~~~~si~~~--~----~~~~v~vGt~~~YA~iHqfGg~~~---~~~~~~~~~~~~~~~~~~~~~~~iP---a 145 (169) +++.+..||... . ....+.||+++.||.+|+||.... +...+....|.+.+.++..+.|+.| + T Consensus 44 -----~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a 118 (137) T protein:vir:10 44 -----KTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRA 118 (137) T ss_pred -----cchhhhcCceeeeeeccccceEEEEecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCC Confidence 122233344322 1 123567999999999999998633 3222335667777777777777655 9 Q ss_pred cccCCCCHHHHHHHHHHHHHh Q lcl|NC_020839. 146 RPFMGISAGDQENIEAALMEW 166 (169) Q Consensus 146 RpfLG~s~~d~~~I~~~i~~~ 166 (169) ||||- +..++.+...-..- T Consensus 119 ~PfL~--~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 119 RPFLR--NAAERVVARETATS 137 (137) T ss_pred CchHH--HHHHHhhhhhcccC Confidence 99954 22222111111111 No 65 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.72 E-value=6.2e-07 Score=54.63 Aligned_cols=115 Identities=15% Similarity=0.146 Sum_probs=70.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |++|+|++ ++|.+.|+.|...... .+..+..-|+.+.+..+.+ .|-.. + .+. T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~-----ap~~~-~-----~~~-------------- 55 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQH-----AGFDE-T-----STG-------------- 55 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCC-C-----cch-------------- Confidence 99999998 8999999999876643 3577888888888777766 23110 0 000 Q ss_pred hhhhhhhhhhhhh------hheecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCccc Q lcl|NC_020839. 78 LIGPTKTLSSPSN------FAVSSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPF 148 (169) Q Consensus 78 l~~~~~~~~~~~s------i~~~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpf 148 (169) ..++.++. .....+...|.||.+. .|+.+..||.. ++|++|| T Consensus 56 -----~~~~~~I~v~~~~~~~~~~~~~~v~vg~~~~~~~y~~f~E~GT~------------------------k~~a~PF 106 (133) T protein:vir:10 56 -----QHMRDSIKIRSSTRKAQGNAVVTLRVGPSKQHHMKVLAQEFGTV------------------------KQVADPF 106 (133) T ss_pred -----hhhhhcccccccccccCccceEEEEecCCCCccceEeeeccCCC------------------------CCCCCcc Confidence 00111100 0011122356777554 36666799963 6899999 Q ss_pred CCCC-HHHHHHHHHHHHHhcCC Q lcl|NC_020839. 149 MGIS-AGDQENIEAALMEWLEP 169 (169) Q Consensus 149 LG~s-~~d~~~I~~~i~~~l~p 169 (169) |.-+ ++.++.+.+++.+.|+= T Consensus 107 ~~pA~~~~~~~~~~~~~~~~~~ 128 (133) T protein:vir:10 107 IRPALDYNVQTVLRVLTVEIRN 128 (133) T ss_pred chHHHHHhHHHHHHHHHHHHHH Confidence 9966 44556566666555555 No 66 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.69 E-value=2.4e-07 Score=56.90 Aligned_cols=124 Identities=22% Similarity=0.211 Sum_probs=72.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcccchhHHHHHhccCccccc Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPD--GSPWAPKSSATIKAYERRKQTVSF 75 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~Pd--G~~W~pl~~~t~~~~~~~~~~~~~ 75 (169) |++|++++ ++|.+.|+.|.....+ .+..+...|..+.+..+.+- |. |.-+......+. +.+... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~a-----P~~tG~l~~~i~~~~~----~~~~~~-- 69 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRA-----PKKTGKLRRNIVSAAL----RQKDAP-- 69 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCcchhhhceeeecc----cccccc-- Confidence 99999997 8899999999876653 36788888888888877663 42 221111111000 000000 Q ss_pred hhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH-H Q lcl|NC_020839. 76 KPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA-G 154 (169) Q Consensus 76 ~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~-~ 154 (169) .. .... +.+ .....+..+++..|+.+..||+. .+|++|||.-+- + T Consensus 70 --~~--~~~~-----~~~-~~~~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl~pA~~~ 115 (140) T protein:vir:80 70 --GL--ATAG-----VRV-RTKGKADSPSNAFYWRFDEFGTQ------------------------HMKAQPFMRPAFDA 115 (140) T ss_pred --ce--eeee-----eec-ccccccCCCCCcceeeeeccCCC------------------------CCCCCcchhhhHHH Confidence 00 0000 000 00111233566789999999964 589999998664 3 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) .+..+.+++.+.|+= T Consensus 116 ~~~~~~~~~~~~~~~ 130 (140) T protein:vir:80 116 SIGEAEGAIRTELAR 130 (140) T ss_pred HHHHHHHHHHHHHHH Confidence 456666666666555 No 67 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=97.62 E-value=6.5e-07 Score=54.49 Aligned_cols=132 Identities=9% Similarity=-0.014 Sum_probs=68.3 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) |-+|+++.. ..+++..+.... ...+..+|..+....-++-....+| T Consensus 1 ~~~~~f~~~-~~~~~~~~~k~~---~~~~~~~a~~~~~~~ie~~ak~~~p------------------------------ 46 (141) T protein:vir:78 1 MNEFEFDSN-IPKARKLIEKKV---LQALEDIGEHMTTELAEGGHGVTSN------------------------------ 46 (141) T ss_pred CcchhHHHH-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhhhccc------------------------------ Confidence 888888763 344444444332 2223344433332221111111111 Q ss_pred hhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhh----hhhhhccccccccCceeeccCcccCCCCHH Q lcl|NC_020839. 81 PTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSY----QGKGFGGSSSTISIPWGDIPARPFMGISAG 154 (169) Q Consensus 81 ~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~----~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~ 154 (169) .+++.|.+||.+ ..++..+.||++..||...+||.-+...... ...++-..+++... .-.||+|||=-+-+ T Consensus 47 -vdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t--~G~~aqpFl~~A~~ 123 (141) T protein:vir:78 47 -NDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFT--RGSQASKRMRYTFR 123 (141) T ss_pred -cccchhhcceeeeeecCCcEEEEecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEec--cCCCCchhhhhhHH Confidence 123334445544 4466788999999999999999643221110 00111111222111 23899999965533 Q ss_pred -HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 -DQENIEAALMEWLEP 169 (169) Q Consensus 155 -d~~~I~~~i~~~l~p 169 (169) .+..|.++|.+.|+= T Consensus 124 ~~~~~i~~~i~~~~~~ 139 (141) T protein:vir:78 124 DEQDKVRVFTERALRG 139 (141) T ss_pred hhHHHHHHHHHHHhhc Confidence 466778888888777 No 68 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.61 E-value=1.2e-06 Score=53.04 Aligned_cols=121 Identities=18% Similarity=0.154 Sum_probs=70.2 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcccchhHHHHHhccCccccc Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPD--GSPWAPKSSATIKAYERRKQTVSF 75 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~Pd--G~~W~pl~~~t~~~~~~~~~~~~~ 75 (169) |.++++++ ++|.+.|+.|.....+ .+..+...|..+.+....+- |- |.=+....-. ..+.. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-----p~~tG~l~~sI~~~-----~~~~~---- 66 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARA-----PKKTGKLKRNIVTA-----ALKQK---- 66 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCChhhHHHhceec-----ccccc---- Confidence 99999998 8899999999877653 36788888888888877653 31 2111100000 00000 Q ss_pred hhhhhhhhhhhhhhhhheecC---CcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 76 KPLIGPTKTLSSPSNFAVSSG---GDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 76 ~~l~~~~~~~~~~~si~~~~~---~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) .... .+.+... .......++..|+.+..||+. .+||+|||.-+ T Consensus 67 ------~~~~----~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl~pA 112 (140) T protein:vir:10 67 ------DSPG----IATAGVRVRTKGKADSPNNAFYWRFVELGTQ------------------------FMKAEPFMRPA 112 (140) T ss_pred ------cccc----eeEEeeccccccccCCCCcccccceeccCcC------------------------CCCCCcchhhh Confidence 0000 0111110 001122356789999999964 58999999866 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +.++.+.+.+.+.|+= T Consensus 113 ~~~~~~~~~~~~~~~~~~ 130 (140) T protein:vir:10 113 FDASIAQAEGAIRTEIAR 130 (140) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 3 3445556655555554 No 69 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.55 E-value=2e-06 Score=51.84 Aligned_cols=115 Identities=10% Similarity=0.089 Sum_probs=66.9 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |++|++.+ ++|.+.|+.|.....+ .+..++.-|+.+.+..+.+ .|-...+.+ T Consensus 2 ~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~-----ap~~~~~~~-------------------- 56 (135) T protein:vir:57 2 IPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQN-----AGYDNSSTN-------------------- 56 (135) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCCCCCch-------------------- Confidence 99999996 8999999999877653 2567777788777776544 342222211 Q ss_pred hhhhhhhhhhhhhhh---e--ecCCcEEEecCcccch-hhh--hcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 78 LIGPTKTLSSPSNFA---V--SSGGDWARLSSRAIQS-AVM--QFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 78 l~~~~~~~~~~~si~---~--~~~~~~v~vGt~~~YA-~iH--qfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) +.++.++.+. . ..+...+.||.+..|. ..| .||.. ++|++||| T Consensus 57 -----g~l~~~I~i~~~k~~~~~~~v~v~vg~~~~~~~~~~f~E~GT~------------------------~~~a~PF~ 107 (135) T protein:vir:57 57 -----AHMRDSIKIRSSRGKAGSTVVVLRVGPTRSHYMKALAQEFGTI------------------------KQVAKPFI 107 (135) T ss_pred -----hhHHhhcccccccccccceeEEEEecCCCCcceeEeecccCCC------------------------CCCCCcch Confidence 1111111111 0 1122245677665542 244 88853 68999999 Q ss_pred CCC-HHHHHHHHHHHHHhcCC Q lcl|NC_020839. 150 GIS-AGDQENIEAALMEWLEP 169 (169) Q Consensus 150 G~s-~~d~~~I~~~i~~~l~p 169 (169) .-+ ++.++.+.+++.+-|+= T Consensus 108 ~pa~~~~~~~~~~~~~~~~~~ 128 (135) T protein:vir:57 108 RPALDYNKMQVLRILTVEIRD 128 (135) T ss_pred hHhHHHhHHHHHHHHHHHHHH Confidence 865 34444444444444443 No 70 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.55 E-value=1.8e-06 Score=52.10 Aligned_cols=109 Identities=22% Similarity=0.175 Sum_probs=62.6 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) =++|++++ ++|.+.|+.+.....+ ....+..-++.+..... +..| . T Consensus 4 ~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak-----~~ap-------~-------------------- 51 (125) T protein:vir:94 4 DFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSK-----GLAR-------V-------------------- 51 (125) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-----hhCC-------C-------------------- Confidence 23678775 6777777777654432 23333333444443321 1122 1 Q ss_pred hhhhhhhhhhhhhh-----eecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH Q lcl|NC_020839. 79 IGPTKTLSSPSNFA-----VSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA 153 (169) Q Consensus 79 ~~~~~~~~~~~si~-----~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~ 153 (169) .++.+..||. ...+.-.++||++..||....||.. .+|++|||.-+- T Consensus 52 ----~tG~L~~sI~~~~~~~~~~~~~~~v~~~~~Ya~~vEfGT~------------------------~~~a~Pfl~pa~ 103 (125) T protein:vir:94 52 ----DTGYMRNNIQQDEVKEEHGVVTGRYVARADYSSYNEYGTY------------------------RMSAQPFMAPSV 103 (125) T ss_pred ----CChhhhhhceecceeccCCcEEEEeeCCCCccceeecccc------------------------cCCCCcccchhH Confidence 1222333443 2235567999999999999999963 589999988653 Q ss_pred -HHHHHHHHHHHHhcCC Q lcl|NC_020839. 154 -GDQENIEAALMEWLEP 169 (169) Q Consensus 154 -~d~~~I~~~i~~~l~p 169 (169) +.+..+.+.|.+.|+= T Consensus 104 ~~~~~~~~~~l~~~l~~ 120 (125) T protein:vir:94 104 AAMTPFFYKAVRDALNK 120 (125) T ss_pred HHHHHHHHHHHHHHHHH Confidence 2344555555555544 No 71 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=97.53 E-value=2e-06 Score=51.82 Aligned_cols=130 Identities=15% Similarity=0.126 Sum_probs=71.8 Q ss_pred EEEch-HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGP 81 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~ 81 (169) |++.+ ++|.+.|+.|...+. .....|...++.+.+..+.+ .| + T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~-----aP----v-------------------------- 45 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTL-----AP----K-------------------------- 45 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC----c-------------------------- Confidence 77776 788888888876664 25567777777777766553 22 1 Q ss_pred hhhhhhhhhhheec----CCcEEEecCcccchhhhhcccccccchhhhhhh----hc---------------c------- Q lcl|NC_020839. 82 TKTLSSPSNFAVSS----GGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKG----FG---------------G------- 131 (169) Q Consensus 82 ~~~~~~~~si~~~~----~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~----~~---------------~------- 131 (169) .++.+..||.++. +...+.|+++..||....||+............ .. . T Consensus 46 -~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 124 (173) T protein:vir:10 46 -NFGKLAQSISTSDLKAKDLISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGI 124 (173) T ss_pred -CchhhhhcceeeeeccCceeEEeeCCCcccchhhhcccccccCCCchhhhhhccccccccccccccccccccccccccc Confidence 1223334554332 223467889999999999997543221110000 00 0 Q ss_pred -----ccccccCceeeccCcccCCCC-HHHHHHHHHHHHHhcCC Q lcl|NC_020839. 132 -----SSSTISIPWGDIPARPFMGIS-AGDQENIEAALMEWLEP 169 (169) Q Consensus 132 -----~~~~~~~~~~~iPaRpfLG~s-~~d~~~I~~~i~~~l~p 169 (169) ..........-+||+|||=-+ .+.+..+.+.|.++|+= T Consensus 125 ~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~ 168 (173) T protein:vir:10 125 DEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLLKT 168 (173) T ss_pred chhcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHHHH Confidence 000000001238999998655 44555555555555555 No 72 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=97.39 E-value=1.6e-07 Score=57.78 Aligned_cols=103 Identities=13% Similarity=0.086 Sum_probs=48.8 Q ss_pred hccC-----ccccchhhhhhhhhhhhhhhhheecC---------CcEEEecCcccc-hhhhhcccccccchhhhhhhhcc Q lcl|NC_020839. 67 ERRK-----QTVSFKPLIGPTKTLSSPSNFAVSSG---------GDWARLSSRAIQ-SAVMQFGAKKGAFGSYQGKGFGG 131 (169) Q Consensus 67 ~~~~-----~~~~~~~l~~~~~~~~~~~si~~~~~---------~~~v~vGt~~~Y-A~iHqfGg~~~~~~~~~~~~~~~ 131 (169) .+.| +...+..+......+....+..+..+ .+..+.|+++.| |++|.||+.+.......+.++.. T Consensus 1 ~~~~~~~~~k~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~ 80 (200) T protein:vir:99 1 MKKGFSKSNSVAAPLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAI 80 (200) T ss_pred CCcCcceeeeeecchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCcccccccc Confidence 1101 00111122222222222222111111 111345667766 99999999877544333222211 Q ss_pred -----------------ccccccCceeeccCcccCCCCHHH-HHHHHHHHHHhcCC Q lcl|NC_020839. 132 -----------------SSSTISIPWGDIPARPFMGISAGD-QENIEAALMEWLEP 169 (169) Q Consensus 132 -----------------~~~~~~~~~~~iPaRpfLG~s~~d-~~~I~~~i~~~l~p 169 (169) .......+.++||+||||--+-++ .+.+.+.+...++= T Consensus 81 ~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~ 136 (200) T protein:vir:99 81 VDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQ 136 (200) T ss_pred ccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHH Confidence 112234567899999999977555 55565555555543 No 73 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=97.26 E-value=2e-06 Score=51.88 Aligned_cols=139 Identities=12% Similarity=0.109 Sum_probs=70.1 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCC--C-----CCCcccchhHHHHHhccC Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPD--G-----SPWAPKSSATIKAYERRK 70 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~Pd--G-----~~W~pl~~~t~~~~~~~~ 70 (169) .++|+|.+ ++|++.|+.|...+.+ ++..|..-|+.+.+..+.+--.-..|. | -.|...+... ++. T Consensus 4 ~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~-----~~~ 78 (179) T protein:vir:18 4 SVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQF-----RRT 78 (179) T ss_pred eEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeeccccccc-----ccc Confidence 45666665 8899999999877653 578888889998888887653322221 1 1111111100 000 Q ss_pred ccccchhhhhhhhhhh----hhhhhheecC------CcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCce Q lcl|NC_020839. 71 QTVSFKPLIGPTKTLS----SPSNFAVSSG------GDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPW 140 (169) Q Consensus 71 ~~~~~~~l~~~~~~~~----~~~si~~~~~------~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~ 140 (169) ........ ...++.. ....+.-..+ +....-+.+..|+++..||. T Consensus 79 g~~~~~vg-v~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT------------------------ 133 (179) T protein:vir:18 79 GDLAFRVG-VMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGT------------------------ 133 (179) T ss_pred cceeEeee-cccccccccccccccccCcccccccccccccCCCCccceeEEeccCC------------------------ Confidence 00000000 0000000 0000000001 11112235678999999995 Q ss_pred eeccCcccCCCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 141 GDIPARPFMGISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 141 ~~iPaRpfLG~s~-~d~~~I~~~i~~~l~p 169 (169) .++||+|||.-.- +.++.+.+.|.+.|+= T Consensus 134 ~kmpa~PFlrPA~~~~~~~a~~~i~~~l~~ 163 (179) T protein:vir:18 134 EHTSARPILRPAMNGVDNDVINVFSTEMGK 163 (179) T ss_pred CCCCCCccchhhHHhhHHHHHHHHHHHHHH Confidence 3699999998653 3455555555555544 No 74 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=97.00 E-value=6e-06 Score=49.23 Aligned_cols=122 Identities=21% Similarity=0.311 Sum_probs=83.6 Q ss_pred CeEEEEch-HHHHHHHHHHHHH--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGR--LSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~--~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) ...|.|++ ..+...|.++... ..+++..+.++|+.+..... +..|+|..=++.|..+ T Consensus 6 ~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar-----~~tP~g~~~p~~srr~--------------- 65 (143) T protein:vir:13 6 AYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAK-----HESPDGHRDPKSSKRY--------------- 65 (143) T ss_pred chheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHH-----hhcCCccccccccccc--------------- Confidence 56777777 6788888887433 36788888888888776543 5689998766655332 Q ss_pred hhhhhhhhhhhhhhheecCCc--EEEecC--cccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC-- Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGD--WARLSS--RAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI-- 151 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~--~v~vGt--~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~-- 151 (169) +++++..||.+..+.. .|..|. .++||..-|||...+ +|-++.||=- T Consensus 66 -----r~G~L~~Sir~aaT~raa~VrAGr~arVPYA~~I~~G~r~r----------------------~Is~~rFl~~a~ 118 (143) T protein:vir:13 66 -----RPGKLDKSIKVTASAKGAVIKAGSAARVPYAAAIHFGYRKR----------------------NISANRFLYRAM 118 (143) T ss_pred -----ccchhhccccccccccceeeeecCcCCCCcccccccCCccc----------------------ccchhhhhhhhh Confidence 2455556777666544 466674 489999999996432 3446666641 Q ss_pred -------CHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 -------SAGDQENIEAALMEWLEP 169 (169) Q Consensus 152 -------s~~d~~~I~~~i~~~l~p 169 (169) +.--|+.|..+++.||.- T Consensus 119 a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 119 ARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred hccCHHHHHHHHHHHHHHHHHHhcC Confidence 222378999999999999 No 75 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=96.97 E-value=1.4e-05 Score=47.23 Aligned_cols=108 Identities=11% Similarity=0.116 Sum_probs=72.1 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |.+|++++ ++|...|++|...... ....++..|+.+..+.+.| . ||. | ++ + T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n-----~----P~~-----t-------g~------l 53 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKN-----S----PIK-----S-------GR------L 53 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhc-----C----Ccc-----c-------CC------c Confidence 99999998 8899989888766653 4567888888877765332 2 331 0 00 0 Q ss_pred hhhhhhhhhhhhhheecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCc-ccCCCC-- Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPAR-PFMGIS-- 152 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaR-pfLG~s-- 152 (169) . . .+.... ....++||.+. -|+-.+.||+. .+||+ ||+.=+ T Consensus 54 k------k--ik~~~k-k~g~~~VG~~ks~~fy~kF~EFGTS------------------------km~a~~pF~~~a~~ 100 (119) T protein:vir:10 54 S------K--VKIRVK-NTGLATEGTASSSEFYDIFQNFGTS------------------------EQKAHVGYFDRAVD 100 (119) T ss_pred c------e--eeeeee-cCceeEeccCCcchhhhhhcccccc------------------------ccCCCCCccccccc Confidence 0 0 011111 23378888776 69999999974 69999 999843 Q ss_pred ---HHHHHHHHHHHHHhcC Q lcl|NC_020839. 153 ---AGDQENIEAALMEWLE 168 (169) Q Consensus 153 ---~~d~~~I~~~i~~~l~ 168 (169) ++..+.|.+++.+-++ T Consensus 101 ~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 101 ETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred cChHHHHHHHHHHHHHhcC Confidence 3446777777777778 No 76 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=96.84 E-value=1.1e-05 Score=47.87 Aligned_cols=81 Identities=21% Similarity=0.313 Sum_probs=56.3 Q ss_pred CeEEEEch--HHHHHHHHHHHHHh----hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCcccc Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRL----SDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVS 74 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~----~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~ 74 (169) .+..++++ +.+.+++..+...+ -+...+|..+|..+...+++.|++. +|+|++++|+++|. . T Consensus 107 Flr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~------~~ppna~~Ti~~KG------~ 174 (193) T protein:vir:96 107 FMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTG------PWVANSASTVRRKG------F 174 (193) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCCcHHHHHHhC------C Confidence 44455454 34556555555443 2678999999999999999999873 59999999997552 4 Q ss_pred chhhhhhhhhhhhhhhhheecC Q lcl|NC_020839. 75 FKPLIGPTKTLSSPSNFAVSSG 96 (169) Q Consensus 75 ~~~l~~~~~~~~~~~si~~~~~ 96 (169) ++||.. |+.+..||++... T Consensus 175 ~~PLid---TG~l~~SIty~Vv 193 (193) T protein:vir:96 175 NRPLVD---TAHMLQSISSRVT 193 (193) T ss_pred CCchhH---HHHHHhhhcceeC Confidence 667764 4455557765433 No 77 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=96.80 E-value=1.7e-05 Score=46.77 Aligned_cols=124 Identities=10% Similarity=0.122 Sum_probs=65.4 Q ss_pred Ce---EEEEch-HHHHHHHHHHH--HHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccc Q lcl|NC_020839. 1 MF---TVDVKD-KELEAVFSGLE--GRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTV 73 (169) Q Consensus 1 mi---~i~~~~-~~l~~~L~~l~--~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~ 73 (169) |- +|++++ ++|.+.|+.|. ..... .+..+..-|+.+.+..+.+.-. +.+. +.+....+ . T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~--~~~~--~~~~~~~~----------~ 66 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHI--SDDN--SKSGRKGS----------R 66 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cCCc--cccccccc----------c Confidence 66 456655 77777787773 22222 4577888888888887777533 2221 21111000 0 Q ss_pred cchhhhhhhhhhhhhh---hhheecCCcEEEec------CcccchhhhhcccccccchhhhhhhhccccccccCceeecc Q lcl|NC_020839. 74 SFKPLIGPTKTLSSPS---NFAVSSGGDWARLS------SRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIP 144 (169) Q Consensus 74 ~~~~l~~~~~~~~~~~---si~~~~~~~~v~vG------t~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iP 144 (169) . .++++..+ .+....+...+.|| ++..|+.+..||. .++| T Consensus 67 ~-------~~~~~d~i~~~~~~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT------------------------~k~~ 115 (149) T protein:vir:13 67 P-------PGHAANNIPEPKIRKKKGNLQCVVGWEKSDNTPFYYMKMEEWGT------------------------SERP 115 (149) T ss_pred c-------cchhhhcceecccccccceeEEEeeccCCCCCccceeeeeccCc------------------------cCCC Confidence 0 01111111 11112234456775 4568999999996 3699 Q ss_pred CcccCCCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 145 ARPFMGISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 145 aRpfLG~s~-~d~~~I~~~i~~~l~p 169 (169) ++|||.=+- +.+.++.+++.+.|+= T Consensus 116 a~pF~~pa~~~~~~~~~~~~~~~l~k 141 (149) T protein:vir:13 116 PHHAFGKTNKILKRVYDNIAQKKYDN 141 (149) T ss_pred CCccchHHHHHHHHHHHHHHHHHHHH Confidence 999988442 2334444444443332 No 78 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=96.80 E-value=1.1e-05 Score=47.75 Aligned_cols=122 Identities=21% Similarity=0.297 Sum_probs=82.0 Q ss_pred CeEEEEch-HHHHHHHHHHHHH--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGR--LSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~--~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) ...|.|++ ..+...|..+... ..+++..+.++|+.+..... +..|+|..=+..+.. T Consensus 6 ~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar-----~~tP~g~r~~~~s~~---------------- 64 (143) T protein:vir:62 6 AYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAK-----HESPDGKRDAKSSKK---------------- 64 (143) T ss_pred chheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHH-----hhcCCcccccccccc---------------- Confidence 56788887 6788888888433 36788888888888776543 568998644332211 Q ss_pred hhhhhhhhhhhhhhheecCCc--EEEecC--cccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC-- Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGD--WARLSS--RAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI-- 151 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~--~v~vGt--~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~-- 151 (169) ..++++..||.+..+.. .|..|. .++||+.-|||...+ +|-.+.||-- T Consensus 65 ----~r~G~L~~Sir~aaT~raa~VrAG~~krVPYA~~I~~G~r~r----------------------~Isp~rFl~~a~ 118 (143) T protein:vir:62 65 ----YRPGKLDKSIKVTASAKGAVIKAGSASRVPYAAAIHFGYRAR----------------------NISPNRFLFRAM 118 (143) T ss_pred ----cCcchhhccccccccccceeeeeCCcCCCCcccccccCcccc----------------------cccchhhhhhhh Confidence 12455566777666544 466687 789999999996532 2334555531 Q ss_pred -------CHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 -------SAGDQENIEAALMEWLEP 169 (169) Q Consensus 152 -------s~~d~~~I~~~i~~~l~p 169 (169) +.--|+.|..+++.||.- T Consensus 119 a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 119 ARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred hccCHHHHHHHHHHHHHHHHHHhcC Confidence 222378999999999999 No 79 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=96.71 E-value=1e-05 Score=47.99 Aligned_cols=91 Identities=12% Similarity=0.004 Sum_probs=61.2 Q ss_pred CeEEEEch--HHHHHHHHHHHHHh----hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCcc-- Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRL----SDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQT-- 72 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~----~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~-- 72 (169) -+.-++++ ++..+.|....... .+...+|..||..+...++..|.+. .|+|++++|++.|..++.. T Consensus 64 Flr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~------~~ppna~sTi~~Kg~~~~~~~ 137 (189) T protein:vir:10 64 FIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARL------KDPPLSPLTIYIRKFIKDGGV 137 (189) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCCcHHHHHHhcccCcccc Confidence 55555554 44556555555442 3678999999999999999999874 5999999999888754322 Q ss_pred ---------------------------ccchhhhhhhhhhhhhhhhheecCCcEE Q lcl|NC_020839. 73 ---------------------------VSFKPLIGPTKTLSSPSNFAVSSGGDWA 100 (169) Q Consensus 73 ---------------------------~~~~~l~~~~~~~~~~~si~~~~~~~~v 100 (169) .+++||.. |+.+..||++......+ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~kPLid---TG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 138 IHGYKDIMRLRSEMQQEQAKGTLNLSGVSTDPLDF---TGYMRATLSYTVTKEKS 189 (189) T ss_pred hhhhhhhhhhhhhhhhhhhhccccccccCCCchhh---HHHHHhhcceeeeecCC Confidence 24567764 44445567665433322 No 80 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=96.67 E-value=3.5e-05 Score=45.01 Aligned_cols=130 Identities=15% Similarity=0.100 Sum_probs=70.7 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS--DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~--~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |..-.++.+.|++.+..|..... +..+.+....+.+...+.+.+.+. .| . T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~-tP----V----------------------- 52 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEAN-TP----V----------------------- 52 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHh-CC----C----------------------- Confidence 77666676677777666665442 344555555555555555555443 34 1 Q ss_pred hhhhhhhhhhhhhhe-----ecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH Q lcl|NC_020839. 79 IGPTKTLSSPSNFAV-----SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA 153 (169) Q Consensus 79 ~~~~~~~~~~~si~~-----~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~ 153 (169) +|+.+..||+. +.+.-.++||++++||..-+||-+..... +....+...... -+|.++||=.+. T Consensus 53 ----dTG~Lr~S~~~~~~~~~~~~~~~~V~n~~~YA~~VE~Ghr~~~G~-----~v~~~~~~~~~g--~V~G~~~~~~a~ 121 (144) T protein:vir:10 53 ----KQGNLRRSWTAEGPTYGCGGWTIKLINNAEYASYVESGHRQTPGR-----YVPVLKKRLVRD--WVPGQFYMKKSI 121 (144) T ss_pred ----CcchhccceeecceeeecCeeEEEEecCCCcccccccceeecCCc-----ccccCCCccccc--eecCccchHHHH Confidence 12333334432 23445689999999999999997543211 011111111112 367788866554 Q ss_pred H-HHHHHHHHHHHhcCC Q lcl|NC_020839. 154 G-DQENIEAALMEWLEP 169 (169) Q Consensus 154 ~-d~~~I~~~i~~~l~p 169 (169) + -+..+..+|.++|+= T Consensus 122 ~~~~~~~~~~l~k~l~~ 138 (144) T protein:vir:10 122 PQIQRQLPQLVTEGLWG 138 (144) T ss_pred HHHHHHHHHHHHHHHHH Confidence 3 345555666665555 No 81 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=96.67 E-value=1.5e-05 Score=47.08 Aligned_cols=81 Identities=27% Similarity=0.379 Sum_probs=54.8 Q ss_pred CeEEEEch--HHHHHHHHHHHHHh----hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCcccc Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRL----SDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVS 74 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~----~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~ 74 (169) .+.-++++ +.+.+.+.+.+..+ -+...+|..||..+...+++.|++. +|+|++++|+++|. . T Consensus 114 Flr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~~------~~ppna~sTi~~Kg------~ 181 (200) T protein:vir:99 114 FMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKSG------PWAANSPATIRAKG------F 181 (200) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCChHHHHHHhC------C Confidence 34444443 34455554444433 2677999999999999999999863 59999999997542 4 Q ss_pred chhhhhhhhhhhhhhhhheecC Q lcl|NC_020839. 75 FKPLIGPTKTLSSPSNFAVSSG 96 (169) Q Consensus 75 ~~~l~~~~~~~~~~~si~~~~~ 96 (169) ++||.. |+.+..||++... T Consensus 182 ~~PLid---TG~l~~SIty~Ve 200 (200) T protein:vir:99 182 DKPLID---TAHMWQTVSSKVS 200 (200) T ss_pred CCchHH---HHHHHhHhccccC Confidence 567764 4445557766554 No 82 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=96.56 E-value=3.8e-05 Score=44.82 Aligned_cols=125 Identities=14% Similarity=0.202 Sum_probs=65.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) .++|+|++ ++|.+.|+.|...... .+..+..-|+.+.+..+.+.-.... +++........ T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~-------~~~~~~~~~~~----------- 65 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPS-------PKKRSKSEPWR----------- 65 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccc-------ccccccccccc----------- Confidence 45677776 7788888888765432 4566666677777766666422111 11111000000 Q ss_pred hhhhhhhhhhh---hhheecCCcEEEec------CcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 79 IGPTKTLSSPS---NFAVSSGGDWARLS------SRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 79 ~~~~~~~~~~~---si~~~~~~~~v~vG------t~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) .++.++..+ ......+...+.|| ++..||.+..||.. .+|++||| T Consensus 66 --~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl 119 (146) T protein:vir:10 66 --TGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS------------------------KMPAHPFI 119 (146) T ss_pred --ccccccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCC------------------------CCCCCcch Confidence 001111111 11122234455555 44579999999953 58999999 Q ss_pred CCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 150 GISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 150 G~s~-~d~~~I~~~i~~~l~p 169 (169) .-+- +.++.+.+.+.+.|+= T Consensus 120 ~pa~~~~k~~~~~~~~~~l~~ 140 (146) T protein:vir:10 120 EPGFNASKAEAVRAMTDILKN 140 (146) T ss_pred hHHHHHhHHHHHHHHHHHHHH Confidence 8543 3344455554444444 No 83 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=96.56 E-value=3.8e-05 Score=44.82 Aligned_cols=125 Identities=14% Similarity=0.202 Sum_probs=65.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) .++|+|++ ++|.+.|+.|...... .+..+..-|+.+.+..+.+.-.... +++........ T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~-------~~~~~~~~~~~----------- 65 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPS-------PKKRSKSEPWR----------- 65 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccc-------ccccccccccc----------- Confidence 45677776 7788888888765432 4566666677777766666422111 11111000000 Q ss_pred hhhhhhhhhhh---hhheecCCcEEEec------CcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 79 IGPTKTLSSPS---NFAVSSGGDWARLS------SRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 79 ~~~~~~~~~~~---si~~~~~~~~v~vG------t~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) .++.++..+ ......+...+.|| ++..||.+..||.. .+|++||| T Consensus 66 --~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl 119 (146) T protein:vir:10 66 --TGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS------------------------KMPAHPFI 119 (146) T ss_pred --ccccccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCC------------------------CCCCCcch Confidence 001111111 11122234455555 44579999999953 58999999 Q ss_pred CCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 150 GISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 150 G~s~-~d~~~I~~~i~~~l~p 169 (169) .-+- +.++.+.+.+.+.|+= T Consensus 120 ~pa~~~~k~~~~~~~~~~l~~ 140 (146) T protein:vir:10 120 EPGFNASKAEAVRAMTDILKN 140 (146) T ss_pred hHHHHHhHHHHHHHHHHHHHH Confidence 8543 3344455554444444 No 84 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=96.56 E-value=3.8e-05 Score=44.82 Aligned_cols=125 Identities=14% Similarity=0.202 Sum_probs=65.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) .++|+|++ ++|.+.|+.|...... .+..+..-|+.+.+..+.+.-.... +++........ T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~-------~~~~~~~~~~~----------- 65 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPS-------PKKRSKSEPWR----------- 65 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccc-------ccccccccccc----------- Confidence 45677776 7788888888765432 4566666677777766666422111 11111000000 Q ss_pred hhhhhhhhhhh---hhheecCCcEEEec------CcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 79 IGPTKTLSSPS---NFAVSSGGDWARLS------SRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 79 ~~~~~~~~~~~---si~~~~~~~~v~vG------t~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) .++.++..+ ......+...+.|| ++..||.+..||.. .+|++||| T Consensus 66 --~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl 119 (146) T protein:vir:10 66 --TGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS------------------------KMPAHPFI 119 (146) T ss_pred --ccccccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCC------------------------CCCCCcch Confidence 001111111 11122234455555 44579999999953 58999999 Q ss_pred CCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 150 GISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 150 G~s~-~d~~~I~~~i~~~l~p 169 (169) .-+- +.++.+.+.+.+.|+= T Consensus 120 ~pa~~~~k~~~~~~~~~~l~~ 140 (146) T protein:vir:10 120 EPGFNASKAEAVRAMTDILKN 140 (146) T ss_pred hHHHHHhHHHHHHHHHHHHHH Confidence 8543 3344455554444444 No 85 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=96.56 E-value=3.8e-05 Score=44.82 Aligned_cols=125 Identities=14% Similarity=0.202 Sum_probs=65.3 Q ss_pred CeEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) .++|+|++ ++|.+.|+.|...... .+..+..-|+.+.+..+.+.-.... +++........ T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~-------~~~~~~~~~~~----------- 65 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPS-------PKKRSKSEPWR----------- 65 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccc-------ccccccccccc----------- Confidence 45677776 7788888888765432 4566666677777766666422111 11111000000 Q ss_pred hhhhhhhhhhh---hhheecCCcEEEec------CcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 79 IGPTKTLSSPS---NFAVSSGGDWARLS------SRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 79 ~~~~~~~~~~~---si~~~~~~~~v~vG------t~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) .++.++..+ ......+...+.|| ++..||.+..||.. .+|++||| T Consensus 66 --~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PFl 119 (146) T protein:vir:10 66 --TGQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTS------------------------KMPAHPFI 119 (146) T ss_pred --ccccccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCC------------------------CCCCCcch Confidence 001111111 11122234455555 44579999999953 58999999 Q ss_pred CCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 150 GISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 150 G~s~-~d~~~I~~~i~~~l~p 169 (169) .-+- +.++.+.+.+.+.|+= T Consensus 120 ~pa~~~~k~~~~~~~~~~l~~ 140 (146) T protein:vir:10 120 EPGFNASKAEAVRAMTDILKN 140 (146) T ss_pred hHHHHHhHHHHHHHHHHHHHH Confidence 8543 3344455554444444 No 86 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=96.56 E-value=2.3e-05 Score=46.06 Aligned_cols=128 Identities=13% Similarity=0.159 Sum_probs=61.1 Q ss_pred Ce--EEEEch-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhH-HHHH-hccCccc Q lcl|NC_020839. 1 MF--TVDVKD-KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSAT-IKAY-ERRKQTV 73 (169) Q Consensus 1 mi--~i~~~~-~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t-~~~~-~~~~~~~ 73 (169) |- +|++++ ++|.+.|+.|.....+ .+..+..-|+.+.+..+.+ .|.-+ + .++... .... ...+.. T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~-----aP~~~-g-~l~~~i~~~~~~~~~g~~- 72 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSR-----APVRR-G-KLRRNVVVLSRRSRDGGM- 72 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCCCc-c-hhhhhceeccccccCCce- Confidence 55 445554 6788888888655443 3567777788888877766 34211 0 000000 0000 000000 Q ss_pred cchhhhhhhhhhhhhhhhheecCCcEEE----ecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccC Q lcl|NC_020839. 74 SFKPLIGPTKTLSSPSNFAVSSGGDWAR----LSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFM 149 (169) Q Consensus 74 ~~~~l~~~~~~~~~~~si~~~~~~~~v~----vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfL 149 (169) .. ..... .+....+..... -+++..|+....||.. .+|++||| T Consensus 73 ~~-------~v~~~--~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~------------------------~~pa~PFl 119 (148) T protein:vir:93 73 ES-------GVHIR--GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTV------------------------NMPPHPFV 119 (148) T ss_pred ee-------eeeec--ccccccccccceeecCCCCCcceeeeeccCCC------------------------CCCCCcch Confidence 00 00000 000000111111 1355679999999953 59999999 Q ss_pred CCCH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 150 GISA-GDQENIEAALMEWLEP 169 (169) Q Consensus 150 G~s~-~d~~~I~~~i~~~l~p 169 (169) .-+- +.++.+.+++.+.|+= T Consensus 120 ~pA~~~~k~~~~~~~~~~~~~ 140 (148) T protein:vir:93 120 RPAFDVRSEQAAQVAIARMNR 140 (148) T ss_pred hHHHHHhHHHHHHHHHHHHHH Confidence 8553 2334444444444444 No 87 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=96.50 E-value=6.3e-05 Score=43.61 Aligned_cols=132 Identities=17% Similarity=0.091 Sum_probs=63.2 Q ss_pred CeEE-EEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTV-DVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i-~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) =++| ++|.++|.+.|+.|.+.... +++.+.+-|+.++++.+.+- |. + T Consensus 2 ~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~a-----P~-------~------------------- 50 (157) T protein:vir:97 2 KFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFV-----ND-------E------------------- 50 (157) T ss_pred eeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CC-------C------------------- Confidence 1244 34557788888888654433 57788888888888777432 31 0 Q ss_pred hhhhhhhhhhhhhheec-------CCcEEEecC---cccchhhhhcccccccch-hhhhhhhccccccccCceeeccCcc Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSS-------GGDWARLSS---RAIQSAVMQFGAKKGAFG-SYQGKGFGGSSSTISIPWGDIPARP 147 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~-------~~~~v~vGt---~~~YA~iHqfGg~~~~~~-~~~~~~~~~~~~~~~~~~~~iPaRp 147 (169) ++.+..||.+.. +.....||- ..+|+..+.||....... ......+.+.... ..-...|||+| T Consensus 51 -----tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~-~~t~~~~Pa~P 124 (157) T protein:vir:97 51 -----TGKLRNNLYVAYSPEESVEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVK-LVNPKWIPAKP 124 (157) T ss_pred -----cchhhhheeeeeccccCCCceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccc-cCCCCcCCCCc Confidence 111222332211 111223444 457888889994321111 1111111111111 12235699999 Q ss_pred cCCCC-----HHHHHH----HHHHHHHhcCC Q lcl|NC_020839. 148 FMGIS-----AGDQEN----IEAALMEWLEP 169 (169) Q Consensus 148 fLG~s-----~~d~~~----I~~~i~~~l~p 169 (169) ||--. ++-.+. |.+.|.+-|+= T Consensus 125 FlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 125 FLRPGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred ccchHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 99832 122222 23333333333 No 88 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=96.47 E-value=1.6e-05 Score=46.85 Aligned_cols=81 Identities=15% Similarity=0.226 Sum_probs=57.0 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) -+.-++++ +.+.+.|.++.....+...+|..+|..+...++..|.+. +|+|++++|+++|. .++|| T Consensus 66 flr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~~------~~ppna~sTi~~Kg------~~~PL 133 (148) T protein:vir:52 66 FLRQTLEENQEKYTALFIQWFDQGVPAAQIYERLSVMAQGDVQMNIVKG------EWVANAKSTIRRKK------SSKPL 133 (148) T ss_pred hhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCCCcHHHHHhcC------CCCch Confidence 45445554 456666666665555788999999999999999999863 59999999997542 45677 Q ss_pred hhhhhhhhhhhhhheecC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSG 96 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~ 96 (169) .. |+.+..||++... T Consensus 134 id---TG~l~~SIty~V~ 148 (148) T protein:vir:52 134 ID---TGKMRQSVRGIVK 148 (148) T ss_pred hH---HHHHHHHhhhhcC Confidence 64 4444456665433 No 89 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=96.45 E-value=2e-05 Score=46.29 Aligned_cols=84 Identities=15% Similarity=0.198 Sum_probs=56.3 Q ss_pred CeEEEEch--HHHHHHHHHHHHHh----hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCcccc Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRL----SDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVS 74 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~----~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~ 74 (169) -+.-++++ +++.+.+.+..... .+...+|..+|+.+...++..|.+. .|+|++++|++++++ . T Consensus 110 Flr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~Ik~~I~~~------~~ppna~~Tia~rKg-----~ 178 (199) T protein:vir:80 110 FLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDDIQMKIVEI------QTPAKSAATLARNPR-----K 178 (199) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCCHHHHHHhcC-----C Confidence 34444443 45555555555442 3678999999999999999999763 499999999975432 3 Q ss_pred chhhhhhhhhhhhhhhhheecCCc Q lcl|NC_020839. 75 FKPLIGPTKTLSSPSNFAVSSGGD 98 (169) Q Consensus 75 ~~~l~~~~~~~~~~~si~~~~~~~ 98 (169) ++||.. |+.+.+||++..-+. T Consensus 179 ~kPLid---TG~l~~SIty~V~~~ 199 (199) T protein:vir:80 179 NNPLIV---TGKMKNSVTWKVMKS 199 (199) T ss_pred CCchHH---HHHHHhhcceeeeeC Confidence 567764 445555776654333 No 90 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=96.44 E-value=5.9e-05 Score=43.78 Aligned_cols=124 Identities=18% Similarity=0.183 Sum_probs=52.5 Q ss_pred CeEE-EEchHHHHHHHHHHHHHhh-h----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCcccc Q lcl|NC_020839. 1 MFTV-DVKDKELEAVFSGLEGRLS-D----PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVS 74 (169) Q Consensus 1 mi~i-~~~~~~l~~~L~~l~~~~~-~----~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~ 74 (169) |-+- .+|.+.|++....|..... + .+..++++|..+.+.+.++ .|=.+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~-----tPVdT--------------------- 54 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRR-----TPVDT--------------------- 54 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCcc--------------------- Confidence 5442 4444455555555533222 2 3455666666666555432 34110 Q ss_pred chhhhhhhhhhhhhhhhhe--ecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 75 FKPLIGPTKTLSSPSNFAV--SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 75 ~~~l~~~~~~~~~~~si~~--~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) +.|+..-..+....++.+ ..+...++|++|++||..-++|-++... ..-+|.+-+|=.+ T Consensus 55 -G~Lr~sw~~~~~~~~~~~~~~g~~~~v~v~n~~~YA~~VE~Ghr~~~~------------------~gfV~G~fml~~s 115 (141) T protein:vir:79 55 -GFLRQGWNGVAYARSLPVYKQGNNYIIEVVNPTEYASYVNFGHRTKDG------------------KGWVKGQHFLTIS 115 (141) T ss_pred -hhhcccccccccccccceeecCCeeEEEEecCCcchhhhhcceeecCC------------------cceeCCchhHHHH Confidence 000000000000011122 2234468999999999999999653210 0112333333233 Q ss_pred HH-HHHHH----HHHHHHhcCC Q lcl|NC_020839. 153 AG-DQENI----EAALMEWLEP 169 (169) Q Consensus 153 ~~-d~~~I----~~~i~~~l~p 169 (169) .+ -+..+ ...|.++|+= T Consensus 116 ~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 116 EMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHH Confidence 21 22222 3333333333 No 91 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=96.40 E-value=5.2e-05 Score=44.07 Aligned_cols=112 Identities=6% Similarity=0.060 Sum_probs=66.2 Q ss_pred eEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) -.|+++.++|+..|..|...... .+.+++.-|+.+.+..+.+ .|- |. ++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~-----aP~---~~-------------~~--------- 50 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSN-----TPF---AN-------------TK--------- 50 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC---CC-------------CC--------- Confidence 67777778899999998766543 3456666666665554444 341 11 00 Q ss_pred hhhhhhhhhhhhe-----ecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV-----SSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 81 ~~~~~~~~~si~~-----~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..++..+.++. ..+..++.||-+. -||...+||+ +++|++||+.=+ T Consensus 51 --~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT------------------------~k~~a~pF~~~a 104 (125) T protein:vir:47 51 --KHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT------------------------MYQKPQLFITKT 104 (125) T ss_pred --chhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc------------------------cCCCCCchhhHH Confidence 01222211110 1134468888765 4888999996 469999998855 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +..+++.+++.+.|+= T Consensus 105 ~~~~~~ev~~~~~~~lrk 122 (125) T protein:vir:47 105 EKQGKNKVLKTMLDTAKR 122 (125) T ss_pred HHHhHHHHHHHHHHHHHH Confidence 3 3455555555555444 No 92 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=96.40 E-value=5.2e-05 Score=44.07 Aligned_cols=112 Identities=6% Similarity=0.060 Sum_probs=66.2 Q ss_pred eEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) -.|+++.++|+..|..|...... .+.+++.-|+.+.+..+.+ .|- |. ++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~-----aP~---~~-------------~~--------- 50 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSN-----TPF---AN-------------TK--------- 50 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC---CC-------------CC--------- Confidence 67777778899999998766543 3456666666665554444 341 11 00 Q ss_pred hhhhhhhhhhhhe-----ecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV-----SSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 81 ~~~~~~~~~si~~-----~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..++..+.++. ..+..++.||-+. -||...+||+ +++|++||+.=+ T Consensus 51 --~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT------------------------~k~~a~pF~~~a 104 (125) T protein:vir:79 51 --KHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT------------------------MYQKPQLFITKT 104 (125) T ss_pred --chhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc------------------------cCCCCCchhhHH Confidence 01222211110 1134468888765 4888999996 469999998855 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +..+++.+++.+.|+= T Consensus 105 ~~~~~~ev~~~~~~~lrk 122 (125) T protein:vir:79 105 EKQGKNKVLKTMLDTAKR 122 (125) T ss_pred HHHhHHHHHHHHHHHHHH Confidence 3 3455555555555444 No 93 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=96.40 E-value=5.2e-05 Score=44.07 Aligned_cols=112 Identities=6% Similarity=0.060 Sum_probs=66.2 Q ss_pred eEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) -.|+++.++|+..|..|...... .+.+++.-|+.+.+..+.+ .|- |. ++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~-----aP~---~~-------------~~--------- 50 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSN-----TPF---AN-------------TK--------- 50 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC---CC-------------CC--------- Confidence 67777778899999998766543 3456666666665554444 341 11 00 Q ss_pred hhhhhhhhhhhhe-----ecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV-----SSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 81 ~~~~~~~~~si~~-----~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..++..+.++. ..+..++.||-+. -||...+||+ +++|++||+.=+ T Consensus 51 --~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT------------------------~k~~a~pF~~~a 104 (125) T protein:vir:98 51 --KHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT------------------------MYQKPQLFITKT 104 (125) T ss_pred --chhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc------------------------cCCCCCchhhHH Confidence 01222211110 1134468888765 4888999996 469999998855 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +..+++.+++.+.|+= T Consensus 105 ~~~~~~ev~~~~~~~lrk 122 (125) T protein:vir:98 105 EKQGKNKVLKTMLDTAKR 122 (125) T ss_pred HHHhHHHHHHHHHHHHHH Confidence 3 3455555555555444 No 94 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=96.40 E-value=5.2e-05 Score=44.07 Aligned_cols=112 Identities=6% Similarity=0.060 Sum_probs=66.2 Q ss_pred eEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) -.|+++.++|+..|..|...... .+.+++.-|+.+.+..+.+ .|- |. ++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~-----aP~---~~-------------~~--------- 50 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSN-----TPF---AN-------------TK--------- 50 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC---CC-------------CC--------- Confidence 67777778899999998766543 3456666666665554444 341 11 00 Q ss_pred hhhhhhhhhhhhe-----ecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV-----SSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 81 ~~~~~~~~~si~~-----~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..++..+.++. ..+..++.||-+. -||...+||+ +++|++||+.=+ T Consensus 51 --~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT------------------------~k~~a~pF~~~a 104 (125) T protein:vir:94 51 --KHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT------------------------MYQKPQLFITKT 104 (125) T ss_pred --chhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc------------------------cCCCCCchhhHH Confidence 01222211110 1134468888765 4888999996 469999998855 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +..+++.+++.+.|+= T Consensus 105 ~~~~~~ev~~~~~~~lrk 122 (125) T protein:vir:94 105 EKQGKNKVLKTMLDTAKR 122 (125) T ss_pred HHHhHHHHHHHHHHHHHH Confidence 3 3455555555555444 No 95 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=96.40 E-value=5.2e-05 Score=44.07 Aligned_cols=112 Identities=6% Similarity=0.060 Sum_probs=66.2 Q ss_pred eEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) -.|+++.++|+..|..|...... .+.+++.-|+.+.+..+.+ .|- |. ++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~-----aP~---~~-------------~~--------- 50 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSN-----TPF---AN-------------TK--------- 50 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC---CC-------------CC--------- Confidence 67777778899999998766543 3456666666665554444 341 11 00 Q ss_pred hhhhhhhhhhhhe-----ecCCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 81 PTKTLSSPSNFAV-----SSGGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 81 ~~~~~~~~~si~~-----~~~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..++..+.++. ..+..++.||-+. -||...+||+ +++|++||+.=+ T Consensus 51 --~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~~~~a~F~E~GT------------------------~k~~a~pF~~~a 104 (125) T protein:vir:81 51 --KHARDHIAVSNVKTDRHTSEKIVTIGYAKGVSHRIHATEFGT------------------------MYQKPQLFITKT 104 (125) T ss_pred --chhhhheeecccccccccceEEEEeccCCCCceEEEeccCCc------------------------cCCCCCchhhHH Confidence 01222211110 1134468888765 4888999996 469999998855 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +..+++.+++.+.|+= T Consensus 105 ~~~~~~ev~~~~~~~lrk 122 (125) T protein:vir:81 105 EKQGKNKVLKTMLDTAKR 122 (125) T ss_pred HHHhHHHHHHHHHHHHHH Confidence 3 3455555555555444 No 96 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=96.35 E-value=4.6e-05 Score=44.36 Aligned_cols=115 Identities=14% Similarity=0.096 Sum_probs=69.2 Q ss_pred eEEEEch-HHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCC-CCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 2 FTVDVKD-KELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGS-PWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 2 i~i~~~~-~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~-~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) ++|+|++ ++|.+.|+.|...... .+..+..-|+.+....+++ .|-+. .+.+ T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~-----ap~~~~~~~~--------------------- 54 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSN-----TPEWDGETDM--------------------- 54 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCcCCCCcc--------------------- Confidence 8888887 8899999888766542 4566666676666666543 34211 0000 Q ss_pred hhhhhhhhhhhhh---heecCCcEEEecCc---ccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 79 IGPTKTLSSPSNF---AVSSGGDWARLSSR---AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 79 ~~~~~~~~~~~si---~~~~~~~~v~vGt~---~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) +++++..+.+ ....+...+.||.+ ..|+.+..||+ +++|++|||.-. T Consensus 55 ---~~h~~d~I~~~~~k~~~g~~~~~VG~~k~~~~y~~f~E~GT------------------------~k~~a~pF~~pa 107 (128) T protein:vir:38 55 ---SGHLRDDIKLSSVRETSGLTEVDVGYGKDTGWRAHFPNSGT------------------------SMQDPQHFIEET 107 (128) T ss_pred ---cchhhhhhccccccccCceeEEEeeecCCCceEEeeeccCc------------------------cCCCCCcchhHH Confidence 0112221111 11223345777754 46899999996 369999998854 Q ss_pred H-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 A-GDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~-~d~~~I~~~i~~~l~p 169 (169) - +.+.++.+++.+-|+= T Consensus 108 ~~~~~~~~~~~~~~~l~k 125 (128) T protein:vir:38 108 QEIMRPVVIAAFLSHLKE 125 (128) T ss_pred HHHhHHHHHHHHHHHHHh Confidence 3 3456666666666666 No 97 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=95.93 E-value=0.00019 Score=41.02 Aligned_cols=131 Identities=15% Similarity=0.172 Sum_probs=61.6 Q ss_pred CeEEEEch---HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcccchhHHHHHhccCccc Q lcl|NC_020839. 1 MFTVDVKD---KELEAVFSGLEGRLSD--PSELMAQIGELLLDSTLARFQAGKAPD--GSPWAPKSSATIKAYERRKQTV 73 (169) Q Consensus 1 mi~i~~~~---~~l~~~L~~l~~~~~~--~~~l~~~Ig~~l~~~~~~rF~~q~~Pd--G~~W~pl~~~t~~~~~~~~~~~ 73 (169) |-+++++. ++|.+.|+.|...+.+ .+..+...|+.+.+..+.+ .|- |.=+...+-.+..... .+. . T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~-----aP~~~g~l~~si~~~~~~~~~-~~~-~ 73 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDR-----APVRTGKLKKNVVVVTQKSRR-RGE-I 73 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCCCchhhhhhcccccccccc-ccc-e Confidence 76666664 5677777777665543 3567777777777777665 331 2111100000000000 000 0 Q ss_pred cchhhhhhhhhhhhhhhhheecCCcEE--EecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 74 SFKPLIGPTKTLSSPSNFAVSSGGDWA--RLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 74 ~~~~l~~~~~~~~~~~si~~~~~~~~v--~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) ... ....... .........+ .-+++..|+.+..||+. .+|++|||.- T Consensus 74 ~~~-v~~~~~~------~~~~~~~~~~~~~~~~~~~y~~f~E~GT~------------------------~~~a~PF~~p 122 (149) T protein:vir:19 74 SSG-VHIRGVN------PRTGNSDNTMKANNPRNAFYWRFVELGTA------------------------NMPAHPFVRP 122 (149) T ss_pred eec-ccccccc------cccccccceeecCCCCccceeeeeccCCC------------------------CCCCCcchhH Confidence 000 0000000 0000011111 12356679999999963 5899999875 Q ss_pred CH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 SA-GDQENIEAALMEWLEP 169 (169) Q Consensus 152 s~-~d~~~I~~~i~~~l~p 169 (169) +- +.+..+.+++.+.|+= T Consensus 123 A~~~~k~~~~~~~~~~l~~ 141 (149) T protein:vir:19 123 AYDTREEEAASVAIARMNQ 141 (149) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 42 3344444444444444 No 98 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.72 E-value=6.3e-05 Score=43.61 Aligned_cols=108 Identities=13% Similarity=0.051 Sum_probs=52.2 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhhhhhhhhh Q lcl|NC_020839. 11 LEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTKTLSSPSN 90 (169) Q Consensus 11 l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~s 90 (169) ++++ .+..+...+..+.+..+. ..| | +|+.|.+| T Consensus 1 v~~~----------v~~~~~~~~~~i~~~ak~-----~ap----v---------------------------~TG~Lr~S 34 (116) T protein:vir:95 1 MERW----------VKRGIAKTTAKIHNTIIS-----LMP----V---------------------------DTGYLRES 34 (116) T ss_pred ChHH----------HHHHHHHHHHHHHHHHHh-----hCC----c---------------------------cccccccc Confidence 1111 122334444444333322 122 2 13444456 Q ss_pred hheec--CCcEEEecCcccchhhhhcccccccchhhhh--hhhccccccccCc---eeeccCcccCCCCHHH-HHHHHHH Q lcl|NC_020839. 91 FAVSS--GGDWARLSSRAIQSAVMQFGAKKGAFGSYQG--KGFGGSSSTISIP---WGDIPARPFMGISAGD-QENIEAA 162 (169) Q Consensus 91 i~~~~--~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~--~~~~~~~~~~~~~---~~~iPaRpfLG~s~~d-~~~I~~~ 162 (169) |.+.. +.-.+.||++..||...+||..+........ +.+.+......+. ..-+||||||-=+-++ +..|... T Consensus 35 I~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~ 114 (116) T protein:vir:95 35 VTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKY 114 (116) T ss_pred eeEEeecCcEEEEEecCCCccceeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHh Confidence 65443 4447899999999999999975433221111 1111111111122 2349999998755433 3444444 Q ss_pred HH Q lcl|NC_020839. 163 LM 164 (169) Q Consensus 163 i~ 164 (169) |. T Consensus 115 is 116 (116) T protein:vir:95 115 FS 116 (116) T ss_pred hC Confidence 44 No 99 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=95.60 E-value=0.00024 Score=40.39 Aligned_cols=140 Identities=12% Similarity=0.137 Sum_probs=74.9 Q ss_pred eEEEEchHHHHHHHHHHHHHhh--h----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccc Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLS--D----PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSF 75 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~--~----~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~ 75 (169) -++.+|.++|++...+|..... + ....|.++|+.|.+.+.+|+=-+..+++ |.. .+....+..+...+ T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~--~~~---~~~~~~k~~k~~~~- 74 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDH--WVE---FTTKDGKHVKFWAS- 74 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhh--hhh---hhhcccchhhhhcc- Confidence 6778888888888877754432 2 4567788888888877776643332221 111 00000000000000 Q ss_pred hhhhhhhhhhhhhhhhhe-----ecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCC Q lcl|NC_020839. 76 KPLIGPTKTLSSPSNFAV-----SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMG 150 (169) Q Consensus 76 ~~l~~~~~~~~~~~si~~-----~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG 150 (169) .....++.+..|+.. ..+...|+|.++.+||..-.||=++.. + .-+|.+.+|= T Consensus 75 ---~~~k~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~~-----------------g--GfV~G~fml~ 132 (163) T protein:vir:10 75 ---AHGKQGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTVN-----------------G--GFVPGQFFLH 132 (163) T ss_pred ---ccccccchhhccceecceeecCCceEEEEEecCCccchhhcceeecC-----------------C--ceeccchhhH Confidence 011122333334443 234456899999999999999954321 1 2377788877 Q ss_pred CCHHH-----HHHHHHHHHHhcCC Q lcl|NC_020839. 151 ISAGD-----QENIEAALMEWLEP 169 (169) Q Consensus 151 ~s~~d-----~~~I~~~i~~~l~p 169 (169) .|.+. ...|.+.|.++|+= T Consensus 133 ~s~~~~~~~~~~~~e~~l~~~l~k 156 (163) T protein:vir:10 133 KTVEDTKSDMEKRVRDKYDGFMRK 156 (163) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Confidence 66543 23344444444444 No 100 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.48 E-value=0.0001 Score=42.52 Aligned_cols=108 Identities=13% Similarity=0.046 Sum_probs=52.7 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhhhhhhhhh Q lcl|NC_020839. 11 LEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTKTLSSPSN 90 (169) Q Consensus 11 l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~s 90 (169) ++++ .+..+...+..+.+..+. ..| + +|+.|.+| T Consensus 1 v~~~----------v~~~~~~~~~~i~~~ak~-----~aP----v---------------------------~TG~Lr~S 34 (116) T protein:vir:12 1 MERW----------VKRGIAKTTAKIHNTIIS-----LMP----V---------------------------DTGYLRES 34 (116) T ss_pred ChHH----------HHHHHHHHHHHHHHHHHH-----hCC----c---------------------------Cccccccc Confidence 1111 223344444444443322 122 2 13444456 Q ss_pred hheec--CCcEEEecCcccchhhhhcccccccchhhh--hhhhccccccccCc---eeeccCcccCCCCHHH-HHHHHHH Q lcl|NC_020839. 91 FAVSS--GGDWARLSSRAIQSAVMQFGAKKGAFGSYQ--GKGFGGSSSTISIP---WGDIPARPFMGISAGD-QENIEAA 162 (169) Q Consensus 91 i~~~~--~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~--~~~~~~~~~~~~~~---~~~iPaRpfLG~s~~d-~~~I~~~ 162 (169) |.+.. +.-.+.||++..||...+||..+....... .+...+......+. ..-+|++|||-=+-++ +..|... T Consensus 35 I~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~ 114 (116) T protein:vir:12 35 VTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKY 114 (116) T ss_pred ceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHh Confidence 65443 444789999999999999996543222111 01111111111112 2349999998655333 4445555 Q ss_pred HH Q lcl|NC_020839. 163 LM 164 (169) Q Consensus 163 i~ 164 (169) |. T Consensus 115 i~ 116 (116) T protein:vir:12 115 FS 116 (116) T ss_pred hC Confidence 55 No 101 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.48 E-value=0.0001 Score=42.52 Aligned_cols=108 Identities=13% Similarity=0.046 Sum_probs=52.7 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhhhhhhhhh Q lcl|NC_020839. 11 LEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTKTLSSPSN 90 (169) Q Consensus 11 l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~s 90 (169) ++++ .+..+...+..+.+..+. ..| + +|+.|.+| T Consensus 1 v~~~----------v~~~~~~~~~~i~~~ak~-----~aP----v---------------------------~TG~Lr~S 34 (116) T protein:vir:97 1 MERW----------VKRGIAKTTAKIHNTIIS-----LMP----V---------------------------DTGYLRES 34 (116) T ss_pred ChHH----------HHHHHHHHHHHHHHHHHH-----hCC----c---------------------------Cccccccc Confidence 1111 223344444444443322 122 2 13444456 Q ss_pred hheec--CCcEEEecCcccchhhhhcccccccchhhh--hhhhccccccccCc---eeeccCcccCCCCHHH-HHHHHHH Q lcl|NC_020839. 91 FAVSS--GGDWARLSSRAIQSAVMQFGAKKGAFGSYQ--GKGFGGSSSTISIP---WGDIPARPFMGISAGD-QENIEAA 162 (169) Q Consensus 91 i~~~~--~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~--~~~~~~~~~~~~~~---~~~iPaRpfLG~s~~d-~~~I~~~ 162 (169) |.+.. +.-.+.||++..||...+||..+....... .+...+......+. ..-+|++|||-=+-++ +..|... T Consensus 35 I~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~ 114 (116) T protein:vir:97 35 VTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKY 114 (116) T ss_pred ceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHh Confidence 65443 444789999999999999996543222111 01111111111112 2349999998655333 4445555 Q ss_pred HH Q lcl|NC_020839. 163 LM 164 (169) Q Consensus 163 i~ 164 (169) |. T Consensus 115 i~ 116 (116) T protein:vir:97 115 FS 116 (116) T ss_pred hC Confidence 55 No 102 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=93.81 E-value=0.0001 Score=42.40 Aligned_cols=78 Identities=17% Similarity=0.186 Sum_probs=31.0 Q ss_pred cchhHHHHHhccCccccchhhhhhhhhhhhhhhhheec-CCcEEEec------------------Ccccchhhhhccccc Q lcl|NC_020839. 58 KSSATIKAYERRKQTVSFKPLIGPTKTLSSPSNFAVSS-GGDWARLS------------------SRAIQSAVMQFGAKK 118 (169) Q Consensus 58 l~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~si~~~~-~~~~v~vG------------------t~~~YA~iHqfGg~~ 118 (169) -+. .++++ ....+...+-++. ....+.. .......| ++..+|++|.||. T Consensus 1 ~~~-------~~~~g--~~~~~~~~~~l~~-~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~-- 68 (168) T protein:vir:94 1 MTT-------IARKG--VKMPPHLEAQFQS-GEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGH-- 68 (168) T ss_pred Ccc-------ccchh--hhhhHHHHHhhhc-cceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCC-- Confidence 110 00000 0011111111110 0111110 00001111 2346788999994 Q ss_pred ccchhhhhhhhccccccccCceeeccCcccCCCC-HHHHHHHHHHHHHhcCC Q lcl|NC_020839. 119 GAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS-AGDQENIEAALMEWLEP 169 (169) Q Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s-~~d~~~I~~~i~~~l~p 169 (169) ++||+||||=-+ ++..+++.+.+...|+= T Consensus 69 ----------------------~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~ 98 (168) T protein:vir:94 69 ----------------------GQNHPRPFMQQTYAAQYRAWSRDLTLTLKA 98 (168) T ss_pred ----------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHhc Confidence 479999999432 13344555555555555 No 103 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=93.51 E-value=0.0012 Score=36.50 Aligned_cols=114 Identities=14% Similarity=0.169 Sum_probs=60.9 Q ss_pred CeE-EEEch--HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccch Q lcl|NC_020839. 1 MFT-VDVKD--KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFK 76 (169) Q Consensus 1 mi~-i~~~~--~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~ 76 (169) |-+ |++|+ +.+.+.|+....... +......++|..+.....+ .+|.- T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~-----~sP~~------------------------ 51 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRE-----SSPKR------------------------ 51 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----hCCcc------------------------ Confidence 764 77775 566777777766554 3556666666666666553 45520 Q ss_pred hhhhhhhhhhhhhhhheec--CCcEEEecCcccc--hhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 77 PLIGPTKTLSSPSNFAVSS--GGDWARLSSRAIQ--SAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 77 ~l~~~~~~~~~~~si~~~~--~~~~v~vGt~~~Y--A~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ++.+..|+.+.. ++..+.+=++..| |.+.+||=..+ -...+|+||||.-. T Consensus 52 -------TG~yaksW~~k~~~~~~~~v~~~~~~y~l~HLLE~GHa~r-------------------~GGrV~a~phI~pa 105 (123) T protein:vir:96 52 -------TGDYAKNWTSQKLKNGDQVIYQKAPTYRLTHLLENGHAKR-------------------NGGRVSPKVHIAPV 105 (123) T ss_pred -------ccccccceeeeecCCeeEEEEEecCCcceEEeeecceeec-------------------CCceeCcchhhhHH Confidence 011111121111 2223445455555 44448882110 01357999998765 Q ss_pred HH-HHHHHHHHHHHhcCC Q lcl|NC_020839. 153 AG-DQENIEAALMEWLEP 169 (169) Q Consensus 153 ~~-d~~~I~~~i~~~l~p 169 (169) .+ ..+.+.+.|.+.|+- T Consensus 106 ee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 106 EEELVSNYISRVEKRLSQ 123 (123) T ss_pred HHHHHHHHHHHHHHHhcC Confidence 43 356666666666666 No 104 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=93.51 E-value=0.00061 Score=38.19 Aligned_cols=81 Identities=17% Similarity=0.035 Sum_probs=54.6 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) -+.-++++ ++..+.|.+......+...+|..||+.+...+++.|.+- + +|++++|++.| ..++|| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~------~-~pna~~Ti~~K------g~~kPL 139 (155) T protein:vir:78 73 FMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEW------P-ADNSADWAGKK------GFNHGL 139 (155) T ss_pred hhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcC------C-CCCcHHHHHhc------CCCCch Confidence 55555554 445555666555556788999999999999999999852 2 58899998643 245677 Q ss_pred hhhhhhhhhhhhhheecCC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGG 97 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~ 97 (169) .. |+.+.+||++..-. T Consensus 140 id---TG~l~~SIty~V~~ 155 (155) T protein:vir:78 140 IW---TSHLLNSVEQEIVK 155 (155) T ss_pred hH---HHHHHHhhhhhccC Confidence 64 44445566654332 No 105 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=93.43 E-value=0.0017 Score=35.83 Aligned_cols=114 Identities=15% Similarity=0.195 Sum_probs=68.0 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |-+|++|+ +++.+.|+.+..... ...+...++|..++...+.+ +|. . T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~-----aP~---------r---------------- 50 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQAL-----APK---------R---------------- 50 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCc---------c---------------- Confidence 99999987 567777877776554 46677777787777777664 341 0 Q ss_pred hhhhhhhhhhhhhhhee----cCCcEEEecCcccc--hhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVS----SGGDWARLSSRAIQ--SAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 78 l~~~~~~~~~~~si~~~----~~~~~v~vGt~~~Y--A~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) |+.+..||.+. .+...+.+=.+..| +.+.-||-... -...+|++|||.- T Consensus 51 ------TG~y~ksw~vk~~~~~g~~~~vv~~~~~~~l~HLLEfGha~r-------------------~gGrV~a~Phi~P 105 (126) T protein:vir:81 51 ------TGEYARTFTITKEDGYGTTKRIIWNKKHYRRVHLLEFGHAKV-------------------NGGRVKEYPHLRP 105 (126) T ss_pred ------cchhhccccccccccCCcceEEEeccCCCCceeeeecceecC-------------------CCCccCCCcchHH Confidence 01111122111 11122222223334 56677784311 0134899999986 Q ss_pred CH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 SA-GDQENIEAALMEWLEP 169 (169) Q Consensus 152 s~-~d~~~I~~~i~~~l~p 169 (169) .. ...+.+.+.|.+.|+= T Consensus 106 a~e~~~~~~~~~i~~~l~~ 124 (126) T protein:vir:81 106 AYDKHGARLPDELKRVIEN 124 (126) T ss_pred HHHHHHHHHHHHHHHHhhc Confidence 65 4578888899999999 No 106 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=93.39 E-value=0.00065 Score=38.04 Aligned_cols=81 Identities=17% Similarity=0.033 Sum_probs=54.4 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) -+.-++++ ++..+.|.+......+...+|..+|+.+...++..|.+- + +|++++|++.| ..++|| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~------~-~pna~~Ti~~K------G~~kPL 139 (155) T protein:vir:10 73 FMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEW------P-ADNSADWAGKK------GFNHGL 139 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcC------C-CCCcHHHHHhc------CCCCch Confidence 55555554 445566666655556788999999999999999999752 2 68899998643 245677 Q ss_pred hhhhhhhhhhhhhheecCC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGG 97 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~ 97 (169) .. |+.+.+||++..-. T Consensus 140 id---TG~l~~SIty~Vv~ 155 (155) T protein:vir:10 140 IW---TSHLLNSVEQEIVK 155 (155) T ss_pred hH---HHHHHHhhhhhccC Confidence 64 44455566654322 No 107 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=92.98 E-value=0.00097 Score=37.09 Aligned_cols=81 Identities=15% Similarity=0.002 Sum_probs=52.9 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) -+.-++++ ++..+.|.+.....-+...+|..||..+...++..|++. +|+| +++|+++| ..++|| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~lG~~~~~~Iq~~I~~~------~~p~-~~~Ti~~K------G~d~PL 139 (155) T protein:vir:77 73 FMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEW------PADN-NADWAGKK------GFNHGL 139 (155) T ss_pred hhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHhcC------CCCC-ChHHHHhc------CCCCch Confidence 45555554 445555655555555788999999999999999999864 4765 56787643 245677 Q ss_pred hhhhhhhhhhhhhheecCC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGG 97 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~ 97 (169) .. |+.+.+||++..-. T Consensus 140 id---TG~l~~SIty~Vv~ 155 (155) T protein:vir:77 140 IW---TSHLLNSIEQEIVK 155 (155) T ss_pred hH---HHHHHHhhhhhccC Confidence 64 44445566654322 No 108 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=92.86 E-value=0.00093 Score=37.21 Aligned_cols=81 Identities=15% Similarity=0.002 Sum_probs=52.7 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) -+.-++++ ++..+.|.++....-+...+|..+|..+...+++.|.+. +|+| +++|++.| ..++|| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~------~~p~-~~~Ti~~K------G~~~PL 139 (155) T protein:vir:10 73 FMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEW------PADN-NADWAGKK------GFNHGL 139 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcC------CCCC-ChHHHHhc------CCCCch Confidence 45555554 445555665555555788999999999999999999864 4755 56777543 245677 Q ss_pred hhhhhhhhhhhhhheecCC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSSGG 97 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~~~ 97 (169) .. |+.+.+||++..-. T Consensus 140 id---TG~l~~Sity~Vv~ 155 (155) T protein:vir:10 140 IW---TSHLLNSIEQEIVK 155 (155) T ss_pred HH---HHHHHHhhhhhccC Confidence 64 44445566654322 No 109 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=92.57 E-value=0.0028 Score=34.58 Aligned_cols=112 Identities=11% Similarity=-0.003 Sum_probs=60.3 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSD-PSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~-~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |++ -.++|.+.|+.|...... .+..+..-|+.+.+..+.+- |-.+... . T Consensus 1 mv~---Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~a-----p~~~~~~------------------~---- 50 (125) T protein:vir:97 1 MTK---GLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANT-----PVYEVET------------------D---- 50 (125) T ss_pred Cch---hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhC-----CcCCCCc------------------h---- Confidence 542 237788888877655432 45677777777777666542 3211000 0 Q ss_pred hhhhhhhhhhhhh----eecCCcEEEecCc---ccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFA----VSSGGDWARLSSR---AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 80 ~~~~~~~~~~si~----~~~~~~~v~vGt~---~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..++.++.++ ...+...+.||.+ ..|+.+.+||. +++|++|||.=+ T Consensus 51 ---~hl~d~I~~~~~k~~~~g~~~~~VG~~k~~~~y~~f~E~GT------------------------~k~~~~pF~~pa 103 (125) T protein:vir:97 51 ---ERLQEDTVISGFKGANVGIVSKEIGYGKATGWRAHYPNDGT------------------------IYQRGQDFKERT 103 (125) T ss_pred ---hhHHhhhhcccccccccCceEEEEeecCCCceeEeeeccCc------------------------cCCCcCccchHh Confidence 0122211111 1123345677754 46899999995 369999998854 Q ss_pred HH-HHHHHHHHHHHhcC----C Q lcl|NC_020839. 153 AG-DQENIEAALMEWLE----P 169 (169) Q Consensus 153 ~~-d~~~I~~~i~~~l~----p 169 (169) -+ .+.++.+++.+-|+ = T Consensus 104 ~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 104 INQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred HHHhHHHHHHHHHHHHHHHhcC Confidence 32 33444444444444 3 No 110 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=91.51 E-value=0.0015 Score=36.00 Aligned_cols=89 Identities=7% Similarity=-0.049 Sum_probs=55.4 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) -+..++++ +.+.+.|.++.....+...+|..||+.+...++..|.+- + +|++++|+++| ..++|| T Consensus 76 Flr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~-----~--ppna~sTi~~K------G~~~PL 142 (168) T protein:vir:94 76 FMQQTYAAQYRAWSRDLTLTLKAGAAADTALRTVGQRMAEDIQDTIRNW-----P--ADNSPEWAAIK------GFNAGL 142 (168) T ss_pred hhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHhhcC-----C--CCccHHHHHhc------CCCCch Confidence 44555554 456666666665555788999999999999999999752 2 68999998743 245677 Q ss_pred hhhhhhhhhhhhhheec--CCcEEEecCccc Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSS--GGDWARLSSRAI 107 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~--~~~~v~vGt~~~ 107 (169) .. |+.+.+||++.. +...-+ +... T Consensus 143 iD---TG~l~~SIty~Vv~d~~~~~--~~~~ 168 (168) T protein:vir:94 143 RQ---TGVLLNAIDSAVIIDGEHGE--APRE 168 (168) T ss_pred hH---HHHHHhhcceeeeecCCCCC--CCCC Confidence 64 444445666532 211111 1111 No 111 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=90.61 E-value=0.0007 Score=37.89 Aligned_cols=74 Identities=16% Similarity=0.158 Sum_probs=31.5 Q ss_pred hccCccccchhhhhhhhhhhhhhhhheecCCcEEEec----------C-cccchhhhhcccccccchhhhhhhhcccccc Q lcl|NC_020839. 67 ERRKQTVSFKPLIGPTKTLSSPSNFAVSSGGDWARLS----------S-RAIQSAVMQFGAKKGAFGSYQGKGFGGSSST 135 (169) Q Consensus 67 ~~~~~~~~~~~l~~~~~~~~~~~si~~~~~~~~v~vG----------t-~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~ 135 (169) -....+..+..+- .+...+. ..+...|.|| + +..+|++|.||.. T Consensus 1 M~~~i~~~~~~~~------~L~~~lk-~l~~k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~p------------------ 55 (189) T protein:vir:10 1 MGRVIRKQGPARV------KLNAFIK-GMNDYSVRIGWFSTAKYPDGTPTAYVASIHEFGAP------------------ 55 (189) T ss_pred CcceeccCcHHHH------HHHHHHH-HhhCCeEEEEecCCCCCCCcccHHHHHHHHHhcCc------------------ Confidence 0000000011000 1111111 1123345554 2 2347889999942 Q ss_pred ccCceeeccCcccCCCCHHH-HHHHHHHHHHhcCC Q lcl|NC_020839. 136 ISIPWGDIPARPFMGISAGD-QENIEAALMEWLEP 169 (169) Q Consensus 136 ~~~~~~~iPaRpfLG~s~~d-~~~I~~~i~~~l~p 169 (169) ..+||+||||=-+=++ .+++.+.+...++= T Consensus 56 ----~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~ 86 (189) T protein:vir:10 56 ----SRGIPARSFIRPTIAAQQAAWSQQMRFYAKQ 86 (189) T ss_pred ----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHH Confidence 3479999999754322 33444443333332 No 112 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=90.25 E-value=0.00099 Score=37.05 Aligned_cols=71 Identities=25% Similarity=0.406 Sum_probs=29.9 Q ss_pred cccchhHHHHHhccCccccchhhhhhhhhhhhhhhhheecCCcEEEecC---------------cccchhhhhccccccc Q lcl|NC_020839. 56 APKSSATIKAYERRKQTVSFKPLIGPTKTLSSPSNFAVSSGGDWARLSS---------------RAIQSAVMQFGAKKGA 120 (169) Q Consensus 56 ~pl~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~si~~~~~~~~v~vGt---------------~~~YA~iHqfGg~~~~ 120 (169) .. -+.+.....+- .+...+. ..+...|.||- +..+|++|.||. T Consensus 1 M~-----------~~~k~~~~~~~------~l~~~l~-~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~---- 58 (148) T protein:vir:52 1 MA-----------VTVTANFSAAK------QLIEQMK-SLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGN---- 58 (148) T ss_pred Cc-----------cccccccHHHH------HHHHHHH-HhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCC---- Confidence 10 00011111111 1111111 11233444443 235789999994 Q ss_pred chhhhhhhhccccccccCceeeccCcccCC--CCHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 121 FGSYQGKGFGGSSSTISIPWGDIPARPFMG--ISAGDQENIEAALMEWLEP 169 (169) Q Consensus 121 ~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG--~s~~d~~~I~~~i~~~l~p 169 (169) .+||+||||= +.+. .+++.+.+..-++= T Consensus 59 --------------------~~IP~Rpflr~t~~~~-~~~~~~~~~~~~~~ 88 (148) T protein:vir:52 59 --------------------EHIPARPFLRQTLEEN-QEKYTALFIQWFDQ 88 (148) T ss_pred --------------------CCCCCcchhHHHHHHH-HHHHHHHHHHHHHc Confidence 3799999994 4332 23333333333333 No 113 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=89.41 E-value=0.0073 Score=32.28 Aligned_cols=118 Identities=9% Similarity=0.139 Sum_probs=60.8 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |-.|++|+ +++.+.|....+... +....+.+++..+.......+.+ .+|.- T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~-tspkr------------------------- 54 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQE-VGLVQ------------------------- 54 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHh-cCccc------------------------- Confidence 99998886 667777766665554 45667777777777666666653 45521 Q ss_pred hhhhhhhhhhhhhhheecCCcEEEecCcccc--hhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHH- Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGDWARLSSRAIQ--SAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAG- 154 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~~v~vGt~~~Y--A~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~- 154 (169) ++....++....+.+.-+|=+..+| +.+.-||=... .....+++|++.-.++ T Consensus 55 ------TG~YaK~W~~kk~~e~~~V~nk~~yqLtHLLE~GHAkr-------------------~GGRV~a~pHI~paee~ 109 (124) T protein:vir:95 55 ------TGDYMRGWTRKRVPNGWVIHNKTEYRLAHLLEYGHATV-------------------DGGRVPGTPHIRPIEDW 109 (124) T ss_pred ------ccchhccceeeeecCceeEEEcCCCceeeeeecceecc-------------------CCcccCCccchhHHHHH Confidence 1111112222222221223333455 55555662210 1136889999874332 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) ..+.+.+.|.+-|+- T Consensus 110 ~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 110 LEKEFEDRVEKAIKQ 124 (124) T ss_pred HHHHHHHHHHHHhcC Confidence 233444444444444 No 114 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=86.72 E-value=0.012 Score=31.09 Aligned_cols=92 Identities=17% Similarity=0.152 Sum_probs=44.8 Q ss_pred CeEEE--Ech-HHHHHHHHHHHHHhhhH-HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccch Q lcl|NC_020839. 1 MFTVD--VKD-KELEAVFSGLEGRLSDP-SELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFK 76 (169) Q Consensus 1 mi~i~--~~~-~~l~~~L~~l~~~~~~~-~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~ 76 (169) .++++ .+. ..+...+..+...+... ...++-+|+.+...++.-++.=.+| ..|+|.+|+|+++|. .++ T Consensus 65 tfe~~~~~~~~~~~~~~~~~i~~~~~~g~~~~~~~LG~~~~~~ik~~I~~~~~p--~~w~pNap~Ti~~Kg------s~~ 136 (160) T protein:vir:95 65 LFEITMMLNKQTLLEQTKKNLYKQLSSLNTDPSNTLEAFAKNAQKAIKRGFGNS--AILPPNAPSTVKKKG------FNA 136 (160) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHHHHHHHHhhcCCc--cCCCCCcHHHHHhcC------CCC Confidence 33321 111 12222233333333221 1122346777777777666653233 479999999997662 567 Q ss_pred hhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccc Q lcl|NC_020839. 77 PLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKG 119 (169) Q Consensus 77 ~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~ 119 (169) ||.. |+.+..||++...+. ||--. T Consensus 137 PLiD---Tg~l~~Si~y~v~~~----------------~~~~~ 160 (160) T protein:vir:95 137 PLVE---TGDLRDNLAYKISTK----------------KGIKK 160 (160) T ss_pred cchh---hHHHhhhhhheeecc----------------cccCC Confidence 8764 444445666544322 21100 No 115 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=85.82 E-value=0.034 Score=28.63 Aligned_cols=118 Identities=10% Similarity=0.167 Sum_probs=68.2 Q ss_pred CeEEEEch--HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchh Q lcl|NC_020839. 1 MFTVDVKD--KELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKP 77 (169) Q Consensus 1 mi~i~~~~--~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~ 77 (169) |-.|.+|+ +++.+.|....+... +......+++..+...+...+.. .+|-- T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~-tsPkr------------------------- 54 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEE-EGLVQ------------------------- 54 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-cCccc------------------------- Confidence 99988886 667777777666554 46677777788888777777765 46611 Q ss_pred hhhhhhhhhhhhhhheecCCcEEEecCcccc--hhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHH- Q lcl|NC_020839. 78 LIGPTKTLSSPSNFAVSSGGDWARLSSRAIQ--SAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAG- 154 (169) Q Consensus 78 l~~~~~~~~~~~si~~~~~~~~v~vGt~~~Y--A~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~- 154 (169) ++....+++...+...-.|=+..+| +.+.-||=... -....+++|++.-.++ T Consensus 55 ------TG~YaK~W~~k~~~~~~~v~nk~~yqLtHLLE~GHAkr-------------------~GGRV~a~pHI~paee~ 109 (127) T protein:vir:80 55 ------TGDYKRGWTRKRTPGGWVIHNKTEYRLAHLLEYGHATV-------------------DGGRVPETPHIRPVEDW 109 (127) T ss_pred ------cccccccceeeeccCceeEeecCCcceeehhhcceecc-------------------CCcccCCccchhhHHHH Confidence 0111112222222221223333456 55566663210 1135889999875443 Q ss_pred HHHHHHHHHHHhcCC Q lcl|NC_020839. 155 DQENIEAALMEWLEP 169 (169) Q Consensus 155 d~~~I~~~i~~~l~p 169 (169) ..+++.+.|.+-|+- T Consensus 110 ~~~~l~~~i~~~l~~ 124 (127) T protein:vir:80 110 LEKEFEDRVERAIKN 124 (127) T ss_pred HHHHHHHHHHHHhcC Confidence 366777777777777 No 116 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=77.49 E-value=0.044 Score=28.03 Aligned_cols=85 Identities=14% Similarity=0.157 Sum_probs=48.6 Q ss_pred CeEEEEchHHHHHHHHHHHHH--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGR--LSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~--~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |-.+.++.+.+++.++.|... ..+...++...|..|....+.+ .| + T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~~~v~~vv~~~~~~l~~~ak~~-----ap----~----------------------- 48 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNMNTVKKVVKKHTANLMTATQQA-----VP----V----------------------- 48 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHh-----CC----C----------------------- Confidence 876555544444444444322 2345566777777776555543 22 1 Q ss_pred hhhhhhhhhhhhhheec--CCcEEEe---cCcccchhhhhcccccccchhhhhhhhccccccccCceeeccC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVSS--GGDWARL---SSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPA 145 (169) Q Consensus 79 ~~~~~~~~~~~si~~~~--~~~~v~v---Gt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPa 145 (169) .|+.+.+||.++. ++-.+.| |....||..-.||.+- |+| T Consensus 49 ----dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~Ya~YvE~GTR~------------------------M~A 92 (92) T protein:vir:99 49 ----DTGHLKQSAQIQISRDGFTGSVTYGGGLVNYAAYVEFGTRF------------------------MDS 92 (92) T ss_pred ----CccccceeeeEEeecCCeeEEEEeccCccccccccccceee------------------------cCC Confidence 2455556666554 3334555 5678899999999763 444 No 117 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=73.45 E-value=0.15 Score=25.07 Aligned_cols=122 Identities=8% Similarity=0.095 Sum_probs=58.0 Q ss_pred Ech-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhhh Q lcl|NC_020839. 6 VKD-KELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTKT 84 (169) Q Consensus 6 ~~~-~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~~ 84 (169) +.+ ++|+..|.+. .+.|-..+-+.=+..|....++ .+ |.||..-. .. -.++. T Consensus 1 i~G~~~L~~~Lk~~--s~~dvk~VVkkN~ael~~r~q~------~~-~~pv~~~~-------------k~-----~dTG~ 53 (127) T protein:vir:98 1 MTGMPALEVKLRSM--SEKRWDRVANKNLTEMFNRAAR------PP-GTPIGKNT-------------KR-----HKSGE 53 (127) T ss_pred CcChHHHHHHHHHh--hHHHHHHHHhhhhHHHHHHHHh------cc-CCceeccc-------------cc-----cCccc Confidence 333 5666666544 2334444444444444433332 22 44553210 00 01223 Q ss_pred hhhhhhhheecCCcEEEecCc---ccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH-HHHHH Q lcl|NC_020839. 85 LSSPSNFAVSSGGDWARLSSR---AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD-QENIE 160 (169) Q Consensus 85 ~~~~~si~~~~~~~~v~vGt~---~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d-~~~I~ 160 (169) ++.|+.+....++-.+.+|.- ..||+.--||.+.-. .++.+ .-+||=|||+-+=+- ..... T Consensus 54 lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~m~-----------~~~~~----gf~~aqp~l~paf~~Qk~iF~ 118 (127) T protein:vir:98 54 LLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRIVR-----------NGKQV----GYANGTKYLFNNVKKQREIYR 118 (127) T ss_pred ceeeeEEEEecCCceEEeccCcccccccceeecceeeee-----------ccccc----ccccCccccccchHHHhHHHH Confidence 344444445556666778764 889999999976321 11111 137888999743221 22222 Q ss_pred HHHHHhcCC Q lcl|NC_020839. 161 AALMEWLEP 169 (169) Q Consensus 161 ~~i~~~l~p 169 (169) +-+.+-|+- T Consensus 119 ~DL~~l~k~ 127 (127) T protein:vir:98 119 QDMLNELRR 127 (127) T ss_pred HHHHHHhcC Confidence 223333333 No 118 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=68.34 E-value=0.23 Score=24.10 Aligned_cols=131 Identities=10% Similarity=0.079 Sum_probs=65.3 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCC--CC---CCCcccc--hhHHHHHhccCcc Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAP--DG---SPWAPKS--SATIKAYERRKQT 72 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~P--dG---~~W~pl~--~~t~~~~~~~~~~ 72 (169) |-. +...+...+...+...+ +...++++++..+.+.+.. .+| .| ..|...- +.+..... . T Consensus 1 Ma~---~~~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~-----~sPVdTGr~R~nw~vs~~~~~~~~~~~----~ 68 (142) T protein:vir:10 1 MAN---DVVSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVK-----LSPVDTGRFRGNWQATGNSPAAQSLNN----Y 68 (142) T ss_pred Ccc---chhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCcccchhhcccceeeecCcccccccC----c Confidence 653 22334444444444443 3556678888888877765 344 23 3465431 11110000 0 Q ss_pred ccchhhhhhhhhhhhh-hhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 73 VSFKPLIGPTKTLSSP-SNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 73 ~~~~~l~~~~~~~~~~-~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) ..+..... ..+... ..|......+.+.|+.|++||.-.+||-..+ ...-|.++ T Consensus 69 d~~G~~t~--~~~~~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~Q------------------------AP~G~v~~ 122 (142) T protein:vir:10 69 DPDGNETR--NSLRRQIYALARDANTNVIYISNRLDYAQGLEFGSSNQ------------------------APSGVLGV 122 (142) T ss_pred CCCCccch--hhHHHHHHHhhhccccceEEEeeCcchhhhhhccccCC------------------------CcchHHHH Confidence 00000000 001100 1222233567899999999999999996543 23455556 Q ss_pred CHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 SAGDQENIEAALMEWLEP 169 (169) Q Consensus 152 s~~d~~~I~~~i~~~l~p 169 (169) +.+....|.+-....++= T Consensus 123 a~q~~~~~v~~a~~e~~~ 140 (142) T protein:vir:10 123 VQKRLGRYFAEAVQEAKR 140 (142) T ss_pred HHHHHHHHHHHHHHHhhc Confidence 655555555555555544 No 119 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=68.21 E-value=0.24 Score=23.98 Aligned_cols=134 Identities=10% Similarity=0.117 Sum_probs=65.7 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--hhHHHHHhccCcc Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAP-D-G---SPWAPKS--SATIKAYERRKQT 72 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~P-d-G---~~W~pl~--~~t~~~~~~~~~~ 72 (169) |-+-.+- .+...+...+...+ +...++++++..+.+.... .+| | | ..|...- +.+..... . T Consensus 1 ma~~~~~--~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~----~ 69 (147) T protein:vir:10 1 MANYQIR--RFQGEIDAWINAAESTLEHAIEIFVRDVHDALVS-----RSPVDTGRFKGNWQITFNEIPNHALNR----Y 69 (147) T ss_pred CCCcchh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcchhhccccceeecCccccccCC----c Confidence 6654432 33334444444333 3456778888888777765 344 2 2 3465431 11100000 0 Q ss_pred ccchhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 73 VSFKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 73 ~~~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ..+.......+......-+.-....+.+.|+.|++||.-.+||-.. -+..-|.+++ T Consensus 70 dp~g~~t~a~~~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~------------------------QAP~G~V~~t 125 (147) T protein:vir:10 70 DKTGGVVRGEEQAKTYGMFSRGGAITSVHFSNMLIYANALEYGHSQ------------------------QAPSGVVGLV 125 (147) T ss_pred CCCccchhhhhhHHHHHHhhhccCcceEEEeeCcchhhhhhccccC------------------------CCCchHHHHH Confidence 0011111111111111112222345679999999999999999653 3344556666 Q ss_pred HHHHHHHHHHHHHhcCC Q lcl|NC_020839. 153 AGDQENIEAALMEWLEP 169 (169) Q Consensus 153 ~~d~~~I~~~i~~~l~p 169 (169) .+..+.|.+-....++- T Consensus 126 ~q~~~~~v~~~~~e~k~ 142 (147) T protein:vir:10 126 ALRLRSYMADAIKQARR 142 (147) T ss_pred HHHHHHHHHHHHHHHHh Confidence 66666665555544544 No 120 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=65.47 E-value=0.25 Score=23.87 Aligned_cols=124 Identities=9% Similarity=0.160 Sum_probs=64.7 Q ss_pred eEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--hhHHHHHhccCcccc Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAP-D-G---SPWAPKS--SATIKAYERRKQTVS 74 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~P-d-G---~~W~pl~--~~t~~~~~~~~~~~~ 74 (169) .++.. ++.++.++.... ...++++++..+.+.... .+| | | ..|...- +.+..... .... T Consensus 1 msf~~---~i~~~~~~ve~~---~~~~~r~~a~~~~~~iv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~---~d~~ 66 (131) T protein:vir:78 1 MSFAL---DVSKFVEKAKKN---PEKVIRQVSIKLFSAIIK-----ASPVDTGRFRMNWMASGGTPADGTTDA---TDKA 66 (131) T ss_pred CCcCc---CHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH-----hCCCchhhhccccceecccccccccCC---CCCC Confidence 33322 344555555433 345556666666666554 233 2 1 2354331 11100000 0000 Q ss_pred chhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHH Q lcl|NC_020839. 75 FKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAG 154 (169) Q Consensus 75 ~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~ 154 (169) +...+ ..+. .-|....-.+.+-|++|++||.-.++|-. +-+..-|.+++.+ T Consensus 67 g~~t~---~~~~--~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S------------------------~QAP~G~v~~~~~ 117 (131) T protein:vir:78 67 GTTAT---SNAA--NFVLNAADWHTFTLTNNLPYAQRLEYGWS------------------------QQAPQGFVRVNVS 117 (131) T ss_pred chhhH---HHHH--HHHhhccCCceEEEeeCchhhhHhhcccc------------------------CCCcchHHHHHHH Confidence 00000 0111 11222233578999999999999999954 3445577778888 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_020839. 155 DQENIEAALMEWLE 168 (169) Q Consensus 155 d~~~I~~~i~~~l~ 168 (169) ....|.+-....+| T Consensus 118 ~~~~~v~~~~~e~k 131 (131) T protein:vir:78 118 RFQQLLNEEASKVK 131 (131) T ss_pred HHHHHHHHHHHhcC Confidence 88888888888888 No 121 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=61.72 E-value=0.18 Score=24.72 Aligned_cols=120 Identities=10% Similarity=0.084 Sum_probs=50.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRL-SDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~-~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |-+++.-.+++...|+.|.... .+...+...=|.. +++.+... .|.. ...+. ++.. T Consensus 1 M~~~~~glee~~~~lekL~~~~~~~~~katkAGA~v----~~e~L~~~-tp~~-h~~~~------------kt~~----- 57 (153) T protein:vir:49 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKV----FKEELAEV-TREK-HYSKK------------KDLK----- 57 (153) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHH----HHHHHHHh-cccc-CCCCC------------CCCC----- Confidence 8886522244444444443222 1233344433333 34444433 2321 11110 0101 Q ss_pred hhhhhhhhhhhhhee----cCCcEEEecCc----ccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVS----SGGDWARLSSR----AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 80 ~~~~~~~~~~si~~~----~~~~~v~vGt~----~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) ...++..+.++.. ..+..+.||-. .-||.+-++|+. .||+.||+=- T Consensus 58 --~~HlaD~I~~s~~~idG~~dG~s~VG~~~~~~a~~a~f~n~GT~------------------------km~~~hFie~ 111 (153) T protein:vir:49 58 --YGHMADGLAVQSTNADGRKNGVSTVGWKNNYHAQNARRLNDGTK------------------------KYRADHFITN 111 (153) T ss_pred --CCcccccceeccccccccccceeeecccCCccceeeeecccCcc------------------------cCCCChhhHH Confidence 1244443333210 01224566643 346788889963 6999999743 Q ss_pred CHHH---HHHHHHHHHHhcCC Q lcl|NC_020839. 152 SAGD---QENIEAALMEWLEP 169 (169) Q Consensus 152 s~~d---~~~I~~~i~~~l~p 169 (169) ..++ +.+|.....+-++- T Consensus 112 tr~e~~~k~~vl~A~~~~~~~ 132 (153) T protein:vir:49 112 VQNDSTVKNKVLLAEKEEYEK 132 (153) T ss_pred HHHHhhHHHHHHHHHHHHHHH Confidence 3222 23344322222222 No 122 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=60.32 E-value=0.37 Score=22.92 Aligned_cols=124 Identities=11% Similarity=0.157 Sum_probs=65.6 Q ss_pred eEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--hhHHHHHhccCcccc Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAP-D-G---SPWAPKS--SATIKAYERRKQTVS 74 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~P-d-G---~~W~pl~--~~t~~~~~~~~~~~~ 74 (169) .++.. ++.++.++.... ...+.++++..+.+.... .+| | | ..|...- +.+..... ... T Consensus 1 msF~~---~i~~~~~~ve~~---~~~~~r~~a~~~~~~iv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~----~d~ 65 (131) T protein:vir:94 1 MSFAL---DVTRFVEKAKKN---PEKVIRQVSIKLFSAIIK-----ASPVDTGRFRMNWMASGSTPADGTTDA----TDK 65 (131) T ss_pred CCccc---CHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH-----hCCCchhhhhccchhccccccccccCC----CCC Confidence 33322 345555555433 345567777776666654 233 2 2 2354331 11000000 000 Q ss_pred chhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHH Q lcl|NC_020839. 75 FKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAG 154 (169) Q Consensus 75 ~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~ 154 (169) + .-.....+. .-|.-..-.+.+-|+.|++||.-.++|-. +-+..-|.+++.+ T Consensus 66 ~--g~~t~~~~~--~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S------------------------~QAP~g~v~~~~~ 117 (131) T protein:vir:94 66 S--GNTATGNAT--SFVLNAADWHTFTLTNNLPYAQRLEYGWS------------------------QQAPQGFVRVNVS 117 (131) T ss_pred C--chhhHHHHH--HHHhhccccceEEEeeCchhhhhhhcccc------------------------CCCcchHHHHHHH Confidence 0 000001111 12222234578999999999999999954 3445577778888 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_020839. 155 DQENIEAALMEWLE 168 (169) Q Consensus 155 d~~~I~~~i~~~l~ 168 (169) ....|.+-....+| T Consensus 118 ~~~~~v~~~~~e~k 131 (131) T protein:vir:94 118 RFQQLLNEEASKVK 131 (131) T ss_pred HHHHHHHHHHHhcC Confidence 88888888888888 No 123 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=58.19 E-value=0.082 Score=26.53 Aligned_cols=110 Identities=11% Similarity=0.124 Sum_probs=54.1 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhhhhhhhhhhhhhhe-ecC Q lcl|NC_020839. 18 LEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIGPTKTLSSPSNFAV-SSG 96 (169) Q Consensus 18 l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~~~~~~~~~~si~~-~~~ 96 (169) |. ......+.+||..+.+.+.+|- |=|.. .+++++. |+.. ... T Consensus 1 l~---~~~~~~~~~~a~~l~~~vk~rT-----Pv~~~--------------------------d~G~LR~--sW~~g~v~ 44 (116) T protein:vir:10 1 MS---KNLRRAKNNIGNKLLRKVKPKT-----PVAKI--------------------------DGGTARK--SWKYKELN 44 (116) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHhhC-----CCCcC--------------------------CCccccc--Cceeeeee Confidence 11 1245667888888888877653 32211 1122332 2222 112 Q ss_pred CcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCHHH-----HHHHHHHHHHhcC Q lcl|NC_020839. 97 GDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISAGD-----QENIEAALMEWLE 168 (169) Q Consensus 97 ~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~~d-----~~~I~~~i~~~l~ 168 (169) ....+|.++..||..-.||=++.... .......++..... -.|-+=+|=.|.+. ...+.+.|.++|. T Consensus 45 k~~~~v~N~~eYA~~VE~GHRq~~g~---g~~~~~~gkrlk~~--~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 45 LFDGVVSNNVEYIHHLEYGHRTRQGT---GTSENYRPKPNGIS--FVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ccCceeecCCcccccccCCceeeCCc---ceecccccccccCC--ccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 22345889999999999996653211 00000001111111 13444455555433 4556667777777 No 124 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=54.74 E-value=0.16 Score=24.96 Aligned_cols=117 Identities=12% Similarity=0.070 Sum_probs=51.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh----hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccch Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS----DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFK 76 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~----~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~ 76 (169) |-+++ + .+++++..|..... +...+...=|. .+.+.+... .|.+. .... .+. T Consensus 1 M~~~~-d--~l~e~~~~lekl~~~~~~~~~katkAGA~----v~~~~L~~~-tp~~h---~~~~----------~t~--- 56 (140) T protein:vir:48 1 MTGLD-E--ALEGWLKTVASIGDLTPAEQAKITTAGAK----VFKEELAEV-TRQKH---YSNK----------KHL--- 56 (140) T ss_pred CccHH-H--HHHHHHHHHHHhccCCHHHHHHHHHHHHH----HHHHHHHHh-ccccC---CCCC----------CCC--- Confidence 88865 2 34444444443332 23334433233 333444433 23211 0000 000 Q ss_pred hhhhhhhhhhhhhhhheec----CCcEEEecCc----ccchhhhhcccccccchhhhhhhhccccccccCceeeccCccc Q lcl|NC_020839. 77 PLIGPTKTLSSPSNFAVSS----GGDWARLSSR----AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPF 148 (169) Q Consensus 77 ~l~~~~~~~~~~~si~~~~----~~~~v~vGt~----~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpf 148 (169) ....++.++.++... .+..+.||-. .-+|.+.|+|++ .||+-+| T Consensus 57 ----~~~HlaD~I~~~~~~iDg~~~g~s~VG~~kk~~a~~A~f~n~GT~------------------------k~~~~hF 108 (140) T protein:vir:48 57 ----KYGHMADGLSVQSTNVDGRKNGVSTVGWVNRYHAQNARRLNDGTK------------------------KYRADHF 108 (140) T ss_pred ----CCCcchhceeecccccccccCceeeeccCCCcceeeeeccccCcc------------------------ccCCCch Confidence 012455544443110 1224566632 356788899963 5999999 Q ss_pred CCCCHHH---HHHHHHHHHHhcCC Q lcl|NC_020839. 149 MGISAGD---QENIEAALMEWLEP 169 (169) Q Consensus 149 LG~s~~d---~~~I~~~i~~~l~p 169 (169) +=-+.++ +.+|.....+-++- T Consensus 109 ve~~~~e~~~k~~vl~A~~~~~~~ 132 (140) T protein:vir:48 109 VTNVQNDSAVQTKVLLAEKEEYEK 132 (140) T ss_pred hHHHHHhhhhHHHHHHHHHHHHHH Confidence 7644432 23344433333332 No 125 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=52.03 E-value=0.26 Score=23.84 Aligned_cols=115 Identities=10% Similarity=0.096 Sum_probs=55.5 Q ss_pred EEEchHHHHHHHHHHHHHhh----hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 4 VDVKDKELEAVFSGLEGRLS----DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 4 i~~~~~~l~~~L~~l~~~~~----~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |+++ +.++++|..|..... +...+...-|..+....+++-.... .. .++.+... T Consensus 1 v~~~-~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~------~~-----------~~~~~~~~---- 58 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKH------PN-----------TKGDGGKY---- 58 (139) T ss_pred CCHH-HHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhccccc------Cc-----------CCCCCCCC---- Confidence 3333 456666666654432 2344555555655555543322110 00 01111111 Q ss_pred hhhhhhhhhhhhheec------CCcEEEecCcc--cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSS------GGDWARLSSRA--IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 80 ~~~~~~~~~~si~~~~------~~~~v~vGt~~--~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) .+++. +|.++. .+..+.||-+. -+|++-+||+ +.+|+.+|+== T Consensus 59 ---~HlaD--~I~~s~~~~dg~~~g~~~VG~~k~~~~A~f~n~GT------------------------~k~~~~hFie~ 109 (139) T protein:vir:10 59 ---GHLSE--DIRSAAGDIDGDHNGSSTVGFHNKAHIARFLNDGT------------------------KYIRADHFVDN 109 (139) T ss_pred ---cchhh--cceecCcccccccceeeeeCCCCCcceEeecccCc------------------------cccCCCchHHH Confidence 13443 333322 12234566544 3677788885 36999999654 Q ss_pred CH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 SA-GDQENIEAALMEWLEP 169 (169) Q Consensus 152 s~-~d~~~I~~~i~~~l~p 169 (169) +. +-+.+|..++.+-|+= T Consensus 110 t~~e~~~evl~a~~~~~k~ 128 (139) T protein:vir:10 110 ARDDAKDAVFAAEAEKYQA 128 (139) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 33 2355666666655544 No 126 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=51.67 E-value=0.57 Score=21.90 Aligned_cols=129 Identities=13% Similarity=0.103 Sum_probs=60.4 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC--CC---CCCccc--chhHHHHHhccCccc Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAP--DG---SPWAPK--SSATIKAYERRKQTV 73 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~P--dG---~~W~pl--~~~t~~~~~~~~~~~ 73 (169) +.++..+ +.++.+++.. +...++++++..+.+.... .+| .| ..|... ++.+.....-.+.+ T Consensus 8 ~~sF~~~---i~~~~~~ve~---~~~~v~r~~a~~i~~~vv~-----~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G- 75 (145) T protein:vir:10 8 VVTFEKS---IADWIDRAED---GFGIVVSNTVIKTANAIVD-----LSPVDTGRFKANWQISANSPAQQSLNEYDQTG- 75 (145) T ss_pred hhccccC---HHHHHHHHHH---HHHHHHHHHHHHHHHHHHH-----hCCccchhhccccceeecccccccccccCCCC- Confidence 3344333 3344444433 2445677777777777655 344 12 346553 12211111100000 Q ss_pred cchhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH Q lcl|NC_020839. 74 SFKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA 153 (169) Q Consensus 74 ~~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~ 153 (169) ........... .-|....-.+.+-|++|++||.-.+||-..+. ..-|.+++. T Consensus 76 --~~t~~~~~~~~--~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QA------------------------P~G~v~~~~ 127 (145) T protein:vir:10 76 --GQTKTYLARQA--RAVANSKATSVIYITNRLDYAADLEYGASNQA------------------------PAGVLGVVQ 127 (145) T ss_pred --ccchhhHHHHH--HHhhcccccceEEEeeCchhhhHhhccccCCC------------------------cchHHHHHH Confidence 00000001111 12332234578999999999999999964432 334455555 Q ss_pred HHHHHHHHHHHHhcCC Q lcl|NC_020839. 154 GDQENIEAALMEWLEP 169 (169) Q Consensus 154 ~d~~~I~~~i~~~l~p 169 (169) +....|.+-+..-++- T Consensus 128 ~~~~~~v~~~~~e~k~ 143 (145) T protein:vir:10 128 ARLGRYFQEAVEEARR 143 (145) T ss_pred HHHHHHHHHHHHHhhc Confidence 5444444444444444 No 127 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=47.68 E-value=0.3 Score=23.44 Aligned_cols=119 Identities=8% Similarity=0.069 Sum_probs=53.8 Q ss_pred CeEEEEc-hHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhh Q lcl|NC_020839. 1 MFTVDVK-DKELEAVFSGLEGRL-SDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPL 78 (169) Q Consensus 1 mi~i~~~-~~~l~~~L~~l~~~~-~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l 78 (169) |-+++ + .+++...|++|.... .+...+...=|.. +++.+... .|... +. . +.+. T Consensus 1 M~~~~-~gl~e~~~~lekl~~~~~~~~~katkAGA~v----~~~~L~~~-tp~~h-y~--~----------~~~~----- 56 (141) T protein:vir:50 1 MVGLA-EALDEWLKTVASIGNLTPAEQVEITTAGAKV----FKKELEEV-TREKH-YS--R----------KKNP----- 56 (141) T ss_pred CccHH-HHHHHHHHHHHHhcCCCHHHHHHHHHHHHHH----HHHHHHHh-cccCC-CC--C----------CCCC----- Confidence 88876 3 244444444443111 1233444433333 33444432 22111 10 0 0000 Q ss_pred hhhhhhhhhhhhhhee----cCCcEEEecCcc----cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCC Q lcl|NC_020839. 79 IGPTKTLSSPSNFAVS----SGGDWARLSSRA----IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMG 150 (169) Q Consensus 79 ~~~~~~~~~~~si~~~----~~~~~v~vGt~~----~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG 150 (169) ....++.++.++.. ..+..+.||-.. -+|.+-|+|+. .||+-+|+= T Consensus 57 --~~~HlaD~I~~~~~~~DG~~dg~s~VG~~~~~~~~~A~f~n~GT~------------------------k~~~~hFve 110 (141) T protein:vir:50 57 --KFGHMADGLAIQSTNADGRKNGVSTVGWKNNYHAQNARRLNDGTK------------------------KYRADHFVT 110 (141) T ss_pred --CCCccccceeeccCccccccCCeeeeccCCCccceeeeccccCcc------------------------ccCCCchhH Confidence 01244544433321 112345677433 35778888964 589999976 Q ss_pred CCHHH---HHHHHHHHHHhcCC Q lcl|NC_020839. 151 ISAGD---QENIEAALMEWLEP 169 (169) Q Consensus 151 ~s~~d---~~~I~~~i~~~l~p 169 (169) -+.++ +.+|+....+-|+- T Consensus 111 ~~~~~a~~k~~Vl~A~~~~~k~ 132 (141) T protein:vir:50 111 NVQNDSTVQKKVLLEKKRNTKN 132 (141) T ss_pred HHHHhhhhHHHHHHHHHHHHHH Confidence 55433 34566555555554 No 128 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=42.36 E-value=0.89 Score=20.87 Aligned_cols=133 Identities=12% Similarity=0.105 Sum_probs=62.9 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCC--CC---CCCcccc--hhHHHHHhccCcc Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAP--DG---SPWAPKS--SATIKAYERRKQT 72 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~P--dG---~~W~pl~--~~t~~~~~~~~~~ 72 (169) |-+-.+- .+...+...+...+ +...++++++..+.+.+.. .+| .| ..|...- +.+.......+ T Consensus 1 ma~~~~~--sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~-----~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp-- 71 (146) T protein:vir:79 1 MADYSIR--EFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVD-----IAPVDTGRFKANMQITANKPPLYALNQYDP-- 71 (146) T ss_pred CCcchhH--HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcchhhccccceeecCcccccccCCCC-- Confidence 6554322 33344444444333 3556778888888877765 344 23 3465531 11110000000 Q ss_pred ccchhhhhhhhhhhhhhhhhe-ecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 73 VSFKPLIGPTKTLSSPSNFAV-SSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 73 ~~~~~l~~~~~~~~~~~si~~-~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) .+...+. ........... ..-.+.+-|++|++||.-.+||-..+. ..-|.++ T Consensus 72 -~G~~t~~--~~~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QA------------------------P~G~v~~ 124 (146) T protein:vir:79 72 -DGEKIKA--EGRRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSKQA------------------------PAGVFGI 124 (146) T ss_pred -CCcccHH--HHHHHHHHHHhcccccceeEEeeCchhhhhhhccccCCC------------------------cchHHHH Confidence 0000000 00000011111 123568999999999999999964432 3344555 Q ss_pred CHHHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 SAGDQENIEAALMEWLEP 169 (169) Q Consensus 152 s~~d~~~I~~~i~~~l~p 169 (169) +.+....|.+-....+|- T Consensus 125 ~~~~~~~~v~~a~~e~k~ 142 (146) T protein:vir:79 125 VAIRLRSYMAEAIREARK 142 (146) T ss_pred HHHHHHHHHHHHHHHHHh Confidence 555555554444444444 No 129 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=36.68 E-value=1.1 Score=20.33 Aligned_cols=131 Identities=10% Similarity=0.028 Sum_probs=53.0 Q ss_pred CeEEEEchHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCC--CC---CCCcccch--hHHHHHh--ccC Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLS-DPSELMAQIGELLLDSTLARFQAGKAP--DG---SPWAPKSS--ATIKAYE--RRK 70 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~-~~~~l~~~Ig~~l~~~~~~rF~~q~~P--dG---~~W~pl~~--~t~~~~~--~~~ 70 (169) |-+ +.-.+...++......+ +...+.++++..+.+....+ +| .| ..|...-. .+.-... ..+ T Consensus 1 MA~---~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~-----sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~ 72 (144) T protein:vir:95 1 MAK---SLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYK-----TPVDTSQALSNWIVTLESPSGQQIKPHFPGS 72 (144) T ss_pred Cch---hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCccchhhccccceecccccccccccccccc Confidence 554 11123333334443333 35567777777777766653 44 12 34654311 1000000 000 Q ss_pred cc-ccchhhhhhhhhhh-hhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccc-cCceee Q lcl|NC_020839. 71 QT-VSFKPLIGPTKTLS-SPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTI-SIPWGD 142 (169) Q Consensus 71 ~~-~~~~~l~~~~~~~~-~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~-~~~~~~ 142 (169) .+ ..+... ..++. ...-|..-.-++.+-|.+|++||.-.++|-..+...-.-...+....+.+ ..+-++ T Consensus 73 ~~~t~d~sg---~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G~vr~~~q~~~~~v~~~~~~~ 144 (144) T protein:vir:95 73 QGSTQRASA---AETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAGFVERAVLIGRKMRKKFKIKD 144 (144) T ss_pred ccccCCCch---hHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHhhccCC Confidence 00 000000 00111 11122222245789999999999999999765543322111111100000 000011 No 130 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=36.41 E-value=0.51 Score=22.20 Aligned_cols=115 Identities=10% Similarity=0.127 Sum_probs=52.9 Q ss_pred EEEchHHHHHHHHHHHHHhh----hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhh Q lcl|NC_020839. 4 VDVKDKELEAVFSGLEGRLS----DPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLI 79 (169) Q Consensus 4 i~~~~~~l~~~L~~l~~~~~----~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~ 79 (169) |+++ +.|+++|..|..... +...+...-|..+....+++ .|... . ..++.+..+ T Consensus 1 ~~~~-~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~-----tp~~~-~-----------~~~~~~~~~---- 58 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAET-----TKEKH-P-----------NTKGDGGKY---- 58 (139) T ss_pred CCHH-HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHh-----ccccc-c-----------cCCCCCCCC---- Confidence 3333 456666666654432 23345555555555544433 22110 0 000111111 Q ss_pred hhhhhhhhhhhhheec------CCcEEEecCcc--cchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 80 GPTKTLSSPSNFAVSS------GGDWARLSSRA--IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 80 ~~~~~~~~~~si~~~~------~~~~v~vGt~~--~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) ..++. +|.++. .+..+.||-+. -.|++-++|+ +.+|+.+|+== T Consensus 59 ---~HlaD--~I~~~~~~idg~~~g~~~VG~~~~~~~Ahf~n~GT------------------------~~~~~~hFie~ 109 (139) T protein:vir:10 59 ---GHLSE--DISSAAGDIDGDHNGSSTVGFHNKAHIARFLNDGT------------------------KNIRADHFVDN 109 (139) T ss_pred ---Ccccc--cceecCccccccccccceeCCCCCceeeeeeccCc------------------------cccCCCchHHH Confidence 13333 333322 12236677443 2356777885 36999999764 Q ss_pred CH-HHHHHHHHHHHHhcCC Q lcl|NC_020839. 152 SA-GDQENIEAALMEWLEP 169 (169) Q Consensus 152 s~-~d~~~I~~~i~~~l~p 169 (169) +. +-+.+|...+.+-|+- T Consensus 110 t~~e~~~ev~~a~~~~~ke 128 (139) T protein:vir:10 110 ARDDAKDAVFAAEAEKYQA 128 (139) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 33 2344555555555444 No 131 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=35.63 E-value=1.2 Score=20.11 Aligned_cols=114 Identities=12% Similarity=0.135 Sum_probs=52.9 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-CC----CCCcccc--hhHHHHHhccCccc Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAP-DG----SPWAPKS--SATIKAYERRKQTV 73 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~P-dG----~~W~pl~--~~t~~~~~~~~~~~ 73 (169) |+..++..+ +.++.+++...+ ...+++++..+.+.... .+| |+ ..|...- +.+.-... . . T Consensus 1 ~~~~sf~~~-i~~~~~~ve~~~---~~~~r~~~~~~~~~vv~-----~sPVdtGrfRanw~vs~~~p~~~~~~~--~--d 67 (121) T protein:vir:94 1 MISMKFNVN-LSRLRSNLREEA---KKKAIRIAQEIVNGVIA-----RSPVLAGDYRSSWNVSEGSMEFKFNNG--G--N 67 (121) T ss_pred Cccchhhcc-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHH-----hcCCchhhhhccccccccCcccccCCC--C--C Confidence 998887763 566666665443 34556666666665442 233 22 3354431 11000000 0 0 Q ss_pred cchhhhhhhhhhhhhhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH Q lcl|NC_020839. 74 SFKPLIGPTKTLSSPSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA 153 (169) Q Consensus 74 ~~~~l~~~~~~~~~~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~ 153 (169) ........ .+ .+......+.+-|.+|++||.-..+|-..++..... +++- T Consensus 68 p~g~~t~~--~~----~~~~~~~~~~iyi~NnlpYA~~LE~G~S~QAP~G~v------------------------~~t~ 117 (121) T protein:vir:94 68 PANPTPAP--AI----VVSSNVALPHFYITNGAPYAQQLEKGSSTQAPLGIV------------------------RVTL 117 (121) T ss_pred CCcchhHH--HH----HHHHhhccceEEEeeCcchhhhhhcccCCCCcchHH------------------------HHHH Confidence 00000000 01 112223356789999999999999996554432211 1111 Q ss_pred HHHH Q lcl|NC_020839. 154 GDQE 157 (169) Q Consensus 154 ~d~~ 157 (169) ...+ T Consensus 118 ~~~q 121 (121) T protein:vir:94 118 ASLR 121 (121) T ss_pred HhhC Confidence 1111 No 132 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=33.97 E-value=1.3 Score=19.92 Aligned_cols=131 Identities=8% Similarity=0.028 Sum_probs=54.3 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC----CCCCcccchhHHHHHhccCccccch Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPD----GSPWAPKSSATIKAYERRKQTVSFK 76 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~Pd----G~~W~pl~~~t~~~~~~~~~~~~~~ 76 (169) |-+++ +.|.++|.++...+.....--..|-..=...+++++..+ .|. -.+|.+-... ..+.... T Consensus 2 m~~~~---~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~-Tp~~h~~~~k~~~~~~~------~~k~~~~-- 69 (159) T protein:vir:38 2 ANDMG---EFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDH-TPRSNEIYRRGRSAGHA------NAKHHNR-- 69 (159) T ss_pred cchHH---HHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHh-cccCCCccccccccccc------cccccCc-- Confidence 55543 447788888766443111112333233334445555544 232 2233221000 0000000 Q ss_pred hhhhhhhhhhhhhhhheecCC-----cEEEecCcc----cchhhhhcccccccchhhhhhhhccccccccCceeeccCcc Q lcl|NC_020839. 77 PLIGPTKTLSSPSNFAVSSGG-----DWARLSSRA----IQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARP 147 (169) Q Consensus 77 ~l~~~~~~~~~~~si~~~~~~-----~~v~vGt~~----~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRp 147 (169) -.....++.++.+....+- ..+.||-.. -+|.+.|.|.. .+|..| T Consensus 70 --~~~~~HlaD~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~------------------------~m~~k~ 123 (159) T protein:vir:38 70 --NRKTKHLQDSITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQH------------------------HMSPKR 123 (159) T ss_pred --CcCCCccccceeeecCccccccccceeeecccCCccceEeeecccCcc------------------------ccCCCC Confidence 1122355554433221111 246677533 45677788864 467666 Q ss_pred cCC--CCH------------HHHHHHHHHHHHhcCC Q lcl|NC_020839. 148 FMG--ISA------------GDQENIEAALMEWLEP 169 (169) Q Consensus 148 fLG--~s~------------~d~~~I~~~i~~~l~p 169 (169) +=| |=+ +..+++.+||.+.-+- T Consensus 124 ~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 124 YKNMHFLDKAQQEAKKSVAEAELKAYKEVMNHDSDK 159 (159) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 666 211 1123333444333333 No 133 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=32.41 E-value=1.1 Score=20.40 Aligned_cols=121 Identities=14% Similarity=0.084 Sum_probs=52.1 Q ss_pred CeEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 1 MFTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 1 mi~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) |-+++ +.|+++|..+..........-.+|-..=...+.+++... .|.+. ... +++.. T Consensus 1 M~~~~---d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~-tp~~h-~~~------------r~t~~------ 57 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEV-TREKH-YSK------------KKDLK------ 57 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHh-cccCC-CCC------------CCCCC------ Confidence 88765 245555555544332111112233223333445555543 33211 100 01100 Q ss_pred hhhhhhhhhhhhee----cCCcEEEecCc----ccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCC Q lcl|NC_020839. 81 PTKTLSSPSNFAVS----SGGDWARLSSR----AIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGIS 152 (169) Q Consensus 81 ~~~~~~~~~si~~~----~~~~~v~vGt~----~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s 152 (169) ...++..+.++.. ..+....||-. .-+|.+-|+|++ .||+.+|+==+ T Consensus 58 -~~HlaD~I~~~~~~idg~~dG~s~VG~~k~~~a~~a~f~NdGT~------------------------k~~~~hFve~t 112 (140) T protein:vir:48 58 -YGHMADGLAVQSTNVDGRKNGVATVGWKNNYHAQNARRLNDGTK------------------------KYRADHFVTNV 112 (140) T ss_pred -CCcccccceecccccccccccceeecccCCCceeEEeecccCcc------------------------ccCCCchHHHH Confidence 1244443333210 01223445544 346778888863 59999997655 Q ss_pred HHH---HHHHHHHHHHhcCC Q lcl|NC_020839. 153 AGD---QENIEAALMEWLEP 169 (169) Q Consensus 153 ~~d---~~~I~~~i~~~l~p 169 (169) .++ +.+|.....+.++- T Consensus 113 ~~e~~~~~~vl~A~~~~y~~ 132 (140) T protein:vir:48 113 QNDSAVRDKVLLAEKEEYEK 132 (140) T ss_pred HHhhhhHHHHHHHHHHHHHH Confidence 543 23333333333222 No 134 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=31.44 E-value=1.5 Score=19.62 Aligned_cols=135 Identities=10% Similarity=0.078 Sum_probs=55.2 Q ss_pred CeEEE-EchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--hhHHHHHhccCcc Q lcl|NC_020839. 1 MFTVD-VKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAP-D-G---SPWAPKS--SATIKAYERRKQT 72 (169) Q Consensus 1 mi~i~-~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~P-d-G---~~W~pl~--~~t~~~~~~~~~~ 72 (169) |.++. +. .++.++.+++.+ +...++++++..+.+.... .+| | | ..|...- +.+.-...-.... T Consensus 1 m~~~~sFa-~~i~~~~~~ve~---~~~~~~r~~a~~i~~~vv~-----~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~ 71 (148) T protein:vir:97 1 MPSLSEFS-RRITLRGRKVAE---GADALTRKVALAADQAVVS-----GTPVDTGRARSNWIAAIGSAPSSVIDAYSPGE 71 (148) T ss_pred CCccchhc-ccHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH-----hCCCcchhhhhhhheeecccccccccccCCCC Confidence 77763 22 235555555543 3455667777777766654 244 2 2 3454331 1110000000000 Q ss_pred ccchhhhhhhhhhhh-hhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCC Q lcl|NC_020839. 73 VSFKPLIGPTKTLSS-PSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGI 151 (169) Q Consensus 73 ~~~~~l~~~~~~~~~-~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~ 151 (169) .++...-....++.. ..-|..-.-++.+-|++|++||.-.++|-..+... -|.++ T Consensus 72 ~G~~~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~------------------------G~v~~ 127 (148) T protein:vir:97 72 AGSTEAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPA------------------------NFVEQ 127 (148) T ss_pred CCcccccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcc------------------------hHHHH Confidence 000000000011111 11122222356889999999999999997654322 22222 Q ss_pred CHHHHHHHHHHHHHhcC--C Q lcl|NC_020839. 152 SAGDQENIEAALMEWLE--P 169 (169) Q Consensus 152 s~~d~~~I~~~i~~~l~--p 169 (169) +.+....|.+- .+.++ | T Consensus 128 t~~~~~~~v~~-~~~~~~~~ 146 (148) T protein:vir:97 128 AVLEAVQVVQF-GRVVDGDP 146 (148) T ss_pred HHHHHHHHHHh-hhhhcCCC Confidence 22222222111 11111 1 No 135 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=28.33 E-value=1.1 Score=20.35 Aligned_cols=93 Identities=19% Similarity=0.083 Sum_probs=32.8 Q ss_pred cccchhhhhhhhhhhhhhhhheec-------CCcEEEecCcc---cchhhhhcccccccchhhhhhhhcccccccc---- Q lcl|NC_020839. 72 TVSFKPLIGPTKTLSSPSNFAVSS-------GGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTIS---- 137 (169) Q Consensus 72 ~~~~~~l~~~~~~~~~~~si~~~~-------~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~---- 137 (169) .....+......++.|..||...+ +...-.||-|. ||+..-.||---.. .......+.++. T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~-----~~~~~~dG~w~~~~~~ 75 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTH-----AAYKGKDGEWYSSSVK 75 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccceeeee-----eeeeccCceeeecCcc Confidence 001111222223333344553221 11234566665 44455566611000 000011111111 Q ss_pred -CceeeccCcccCCCC-----HHHHH----HHHHHHHHhcCC Q lcl|NC_020839. 138 -IPWGDIPARPFMGIS-----AGDQE----NIEAALMEWLEP 169 (169) Q Consensus 138 -~~~~~iPaRpfLG~s-----~~d~~----~I~~~i~~~l~p 169 (169) .-...|||+|||==. ++..+ ...+.+.+-|+= T Consensus 76 l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:10 76 LVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred ccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 112469999998521 11122 222223333333 No 136 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=27.71 E-value=1.2 Score=20.25 Aligned_cols=93 Identities=18% Similarity=0.068 Sum_probs=33.0 Q ss_pred cccchhhhhhhhhhhhhhhhheec-------CCcEEEecCcc---cchhhhhcccccccchhhhhhhhccccccc----- Q lcl|NC_020839. 72 TVSFKPLIGPTKTLSSPSNFAVSS-------GGDWARLSSRA---IQSAVMQFGAKKGAFGSYQGKGFGGSSSTI----- 136 (169) Q Consensus 72 ~~~~~~l~~~~~~~~~~~si~~~~-------~~~~v~vGt~~---~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~----- 136 (169) ...+.+......++.|..||...+ +...-.||-|. ||+..-.||---.. .......+.++ T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~-----~~~~~~dG~w~~~~~~ 75 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTH-----AAYKGKDGEWYSSSVK 75 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccceeeee-----eeeeccCceeeecCcc Confidence 001111222223333344553221 11234566665 44455566611100 00011111111 Q ss_pred cCceeeccCcccCCCC-----HHHHH----HHHHHHHHhcCC Q lcl|NC_020839. 137 SIPWGDIPARPFMGIS-----AGDQE----NIEAALMEWLEP 169 (169) Q Consensus 137 ~~~~~~iPaRpfLG~s-----~~d~~----~I~~~i~~~l~p 169 (169) -.-...|||+|||==. ++..+ ...+.+.+-|+= T Consensus 76 l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:81 76 LVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred ccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 1112469999998521 11122 222223333333 No 137 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=25.27 E-value=2.1 Score=18.84 Aligned_cols=125 Identities=13% Similarity=0.187 Sum_probs=52.2 Q ss_pred eEEEEchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--hhHHHHHhccCcccc Q lcl|NC_020839. 2 FTVDVKDKELEAVFSGLEGRLSDPSELMAQIGELLLDSTLARFQAGKAP-D-G---SPWAPKS--SATIKAYERRKQTVS 74 (169) Q Consensus 2 i~i~~~~~~l~~~L~~l~~~~~~~~~l~~~Ig~~l~~~~~~rF~~q~~P-d-G---~~W~pl~--~~t~~~~~~~~~~~~ 74 (169) .++.. ++.++.+++.+ +...++++++..+.+....+ +| | | ..|...- +.+...-.-.+.... T Consensus 1 msF~~---~i~~~~~~ve~---~~~~~~r~~a~~~~~~vv~~-----sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (134) T protein:vir:80 1 MSYTD---RFNVIAKGIED---NVDNLVKNVALAIGSNVIAD-----TPILTGQARRNWQTELNQMPESVLDIPESPSEG 69 (134) T ss_pred CCccc---CHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHh-----CCCcchhhhcccceeecCcccccccCcCCCCcc Confidence 33332 34444444433 34556677777777766552 44 2 2 3465441 111100000000000 Q ss_pred chhhhhhhhhhhh-hhhhheecCCcEEEecCcccchhhhhcccccccchhhhhhhhccccccccCceeeccCcccCCCCH Q lcl|NC_020839. 75 FKPLIGPTKTLSS-PSNFAVSSGGDWARLSSRAIQSAVMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMGISA 153 (169) Q Consensus 75 ~~~l~~~~~~~~~-~~si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG~s~ 153 (169) . ..++.. ...|..-.-++.+-|++|++||.-.++|-..+.... |.+++. T Consensus 70 ~------~~~~~~~~~vi~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G------------------------~v~~t~ 119 (134) T protein:vir:80 70 M------DEALQVLQQTVGQYKAGDTVHITNNAPYIKELNSGSSQQAPAN------------------------FVETSI 119 (134) T ss_pred c------hhhHHHHHHHHhhccCcceEEEeeCchhhhhhhccccCCCcch------------------------HHHHHH Confidence 0 011111 112222223477899999999999999976543221 112222 Q ss_pred HHHHHHHHHHHHhcCC Q lcl|NC_020839. 154 GDQENIEAALMEWLEP 169 (169) Q Consensus 154 ~d~~~I~~~i~~~l~p 169 (169) +....|.+-+.. =| T Consensus 120 ~~~~~~v~~~~~--~~ 133 (134) T protein:vir:80 120 MRATRLIRNVKV--VP 133 (134) T ss_pred HHHHHHHHhhcc--CC Confidence 211111111110 01 No 138 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=24.70 E-value=1.7 Score=19.32 Aligned_cols=125 Identities=16% Similarity=0.104 Sum_probs=61.8 Q ss_pred EEEch-HHHHHHHHHHHHHhhhHH--HHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccchhHHHHHhccCccccchhhhh Q lcl|NC_020839. 4 VDVKD-KELEAVFSGLEGRLSDPS--ELMAQIGELLLDSTLARFQAGKAPDGSPWAPKSSATIKAYERRKQTVSFKPLIG 80 (169) Q Consensus 4 i~~~~-~~l~~~L~~l~~~~~~~~--~l~~~Ig~~l~~~~~~rF~~q~~PdG~~W~pl~~~t~~~~~~~~~~~~~~~l~~ 80 (169) |.+.+ .++...|+++++.....+ ..|..+.-.. +..-.-+.|..-+|+ +. T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~------------~~~AA~~TPIDTSTL---------------iN 53 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAG------------ANHAAVITPVKSSTL---------------IN 53 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHH------------Hhhhhhccccchhhh---------------cc Confidence 66665 677777777777665432 1222211111 111123555433321 11 Q ss_pred hhhhhhhhhhhheecCCcEEEecCcccchh-hhhcccccccchhhhhhhhccccccccCceeeccCcccCC--CCHHHHH Q lcl|NC_020839. 81 PTKTLSSPSNFAVSSGGDWARLSSRAIQSA-VMQFGAKKGAFGSYQGKGFGGSSSTISIPWGDIPARPFMG--ISAGDQE 157 (169) Q Consensus 81 ~~~~~~~~~si~~~~~~~~v~vGt~~~YA~-iHqfGg~~~~~~~~~~~~~~~~~~~~~~~~~~iPaRpfLG--~s~~d~~ 157 (169) + -...+.+..+.-...||-+..||+ +|.--|+-.-..+++.....+ .| =-.-+||- |.++..+ T Consensus 54 S-----Qfrei~~ngtritGRVGYSAnYA~yVHda~Gklkgqprp~gkgn~w------~p---~ae~eFL~kgfe~~~~d 119 (131) T protein:vir:10 54 S-----QYKKLEPIPSGMIGRVGYTANYAAAVNAAKGKLKGKPRPDGSGNYW------DP---NGEPDFLRKGFERDGLN 119 (131) T ss_pred c-----cceeeeccCceeEEeeccceeeeeeeecCccccCCCcCCCCCccee------cC---CCChhhhhhhhhccchH Confidence 1 012344555556689999999996 566544432111111111000 00 11235774 4444467 Q ss_pred HHHHHHHHhcCC Q lcl|NC_020839. 158 NIEAALMEWLEP 169 (169) Q Consensus 158 ~I~~~i~~~l~p 169 (169) +|..+|.+.++- T Consensus 120 ~i~avik~e~k~ 131 (131) T protein:vir:10 120 EIKAIIRQGYKV 131 (131) T ss_pred HHHHHHhhhcCC Confidence 899999998888 Done!