Query lcl|NC_019918.1_cdsid_YP_007236875.1 [gene=BN405_2-10_Ab1_orf_54] [protein=hypothetical protein] [protein_id=YP_007236875.1] [location=30941..31483] Match_columns 180 No_of_seqs 116 out of 190 Neff 5.8 Searched_HMMs 1612 Date Thu Nov 7 16:47:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_54 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_54_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96105 Length: 193 100.0 8.8E-54 5.5E-57 311.4 16.9 148 28-177 1-193 (193) 2 protein:vir:99546 Length: 200 100.0 1.3E-53 8.2E-57 310.5 16.2 155 18-177 1-200 (200) 3 protein:vir:107757 Length: 189 100.0 2E-53 1.3E-56 309.5 16.6 150 20-180 1-188 (189) 4 protein:vir:5257 Length: 148 # 100.0 3.1E-52 1.9E-55 303.0 16.7 146 20-177 1-148 (148) 5 protein:vir:78607 Length: 155 100.0 7.4E-50 4.6E-53 289.9 15.8 140 30-178 1-155 (155) 6 protein:vir:106728 Length: 155 100.0 8.1E-50 5E-53 289.7 15.9 140 30-178 1-155 (155) 7 protein:vir:94069 Length: 168 100.0 2.9E-49 1.8E-52 286.7 14.7 146 24-180 1-160 (168) 8 protein:vir:101563 Length: 155 100.0 7.2E-49 4.5E-52 284.5 15.4 140 30-178 1-155 (155) 9 protein:vir:80037 Length: 199 100.0 7.3E-49 4.5E-52 284.5 14.7 143 28-179 1-199 (199) 10 protein:vir:77650 Length: 155 100.0 2.5E-48 1.5E-51 281.6 15.4 140 30-178 1-155 (155) 11 protein:vir:95260 Length: 160 100.0 1.1E-45 6.7E-49 267.1 16.3 148 24-180 1-156 (160) 12 protein:vir:3163 Length: 145 # 98.6 3.2E-10 2E-13 72.7 6.8 77 94-180 1-84 (145) 13 protein:vir:4347 Length: 164 # 98.4 1.8E-10 1.1E-13 74.0 2.7 103 24-136 1-164 (164) 14 protein:vir:79225 Length: 155 98.3 1.5E-09 9E-13 69.0 5.6 86 91-180 1-94 (155) 15 protein:vir:79091 Length: 175 98.3 1.7E-09 1E-12 68.7 5.8 86 91-180 1-111 (175) 16 protein:vir:94538 Length: 125 98.3 1.1E-09 6.9E-13 69.7 4.3 92 24-119 1-125 (125) 17 protein:vir:103841 Length: 155 98.3 1.8E-09 1.1E-12 68.5 5.3 86 85-180 1-94 (155) 18 protein:vir:102875 Length: 146 98.3 2.3E-09 1.4E-12 68.0 5.8 109 1-120 1-146 (146) 19 protein:vir:107568 Length: 146 98.3 2.3E-09 1.4E-12 68.0 5.8 109 1-120 1-146 (146) 20 protein:vir:102085 Length: 146 98.3 2.3E-09 1.4E-12 68.0 5.8 109 1-120 1-146 (146) 21 protein:vir:105007 Length: 146 98.3 2.3E-09 1.4E-12 68.0 5.8 109 1-120 1-146 (146) 22 protein:vir:99833 Length: 190 98.3 2.9E-09 1.8E-12 67.4 6.2 85 90-180 1-94 (190) 23 protein:vir:1437 Length: 140 # 98.3 3.3E-09 2E-12 67.1 6.2 90 24-120 1-140 (140) 24 protein:vir:99196 Length: 155 98.3 3.3E-09 2.1E-12 67.0 5.6 86 91-180 1-94 (155) 25 protein:vir:1891 Length: 179 # 98.2 1E-09 6.2E-13 69.9 2.6 103 24-160 1-179 (179) 26 protein:vir:1386 Length: 149 # 98.2 4.1E-09 2.5E-12 66.6 5.6 112 1-143 1-149 (149) 27 protein:vir:1988 Length: 156 # 98.2 4.4E-09 2.7E-12 66.4 5.6 85 91-180 1-98 (156) 28 protein:vir:100075 Length: 140 98.2 6.4E-09 4E-12 65.5 6.0 90 24-120 1-140 (140) 29 protein:vir:100243 Length: 140 98.2 1.2E-08 7.2E-12 64.1 6.7 90 24-120 1-140 (140) 30 protein:vir:80362 Length: 140 98.1 1.1E-08 6.5E-12 64.3 5.9 90 24-120 1-140 (140) 31 protein:vir:194 Length: 149 # 98.1 8.1E-09 5E-12 65.0 5.3 94 24-128 1-149 (149) 32 protein:vir:93617 Length: 148 98.1 8E-09 5E-12 65.0 5.2 94 24-128 1-148 (148) 33 protein:vir:1273 Length: 127 # 98.1 1.1E-08 6.7E-12 64.3 5.0 100 1-117 1-127 (127) 34 protein:vir:95789 Length: 114 98.0 1.5E-08 9.4E-12 63.5 4.7 85 28-117 1-114 (114) 35 protein:vir:107851 Length: 175 98.0 2.7E-08 1.7E-11 62.1 5.3 86 91-180 1-111 (175) 36 protein:vir:97088 Length: 157 97.9 5.6E-08 3.5E-11 60.3 6.4 92 20-120 1-157 (157) 37 protein:vir:78858 Length: 115 97.9 1.8E-08 1.1E-11 63.1 3.0 80 28-113 1-115 (115) 38 protein:vir:96358 Length: 115 97.9 1.8E-08 1.1E-11 63.1 3.0 80 28-113 1-115 (115) 39 protein:vir:96225 Length: 115 97.9 1.8E-08 1.1E-11 63.1 3.0 80 28-113 1-115 (115) 40 protein:vir:97144 Length: 115 97.9 1.8E-08 1.1E-11 63.1 3.0 80 28-113 1-115 (115) 41 protein:vir:9312 Length: 115 # 97.9 1.8E-08 1.1E-11 63.1 3.0 80 28-113 1-115 (115) 42 protein:vir:103917 Length: 115 97.9 1.8E-08 1.1E-11 63.1 3.0 80 28-113 1-115 (115) 43 protein:vir:106623 Length: 115 97.9 1.5E-08 9.5E-12 63.4 2.4 80 28-113 1-115 (115) 44 protein:vir:9414 Length: 125 # 97.8 4.6E-08 2.8E-11 60.8 4.6 111 1-117 1-125 (125) 45 protein:vir:79988 Length: 125 97.8 4.6E-08 2.8E-11 60.8 4.6 111 1-117 1-125 (125) 46 protein:vir:4704 Length: 125 # 97.8 4.6E-08 2.8E-11 60.8 4.6 111 1-117 1-125 (125) 47 protein:vir:98342 Length: 125 97.8 4.6E-08 2.8E-11 60.8 4.6 111 1-117 1-125 (125) 48 protein:vir:81106 Length: 125 97.8 4.6E-08 2.8E-11 60.8 4.6 111 1-117 1-125 (125) 49 protein:vir:3873 Length: 128 # 97.8 5.8E-08 3.6E-11 60.3 4.6 85 20-117 1-128 (128) 50 protein:vir:3617 Length: 112 # 97.8 3.7E-08 2.3E-11 61.3 3.5 84 24-113 1-112 (112) 51 protein:vir:79091 Length: 175 97.8 1.5E-07 9E-11 58.1 6.6 93 24-119 1-175 (175) 52 protein:vir:2740 Length: 114 # 97.8 4.5E-08 2.8E-11 60.8 3.4 84 24-114 1-114 (114) 53 protein:vir:4906 Length: 114 # 97.8 4.5E-08 2.8E-11 60.8 3.4 84 24-114 1-114 (114) 54 protein:vir:5745 Length: 135 # 97.7 1.4E-07 8.4E-11 58.2 5.3 92 24-131 1-135 (135) 55 protein:vir:9930 Length: 108 # 97.7 6.5E-08 4E-11 60.0 3.5 99 2-114 1-108 (108) 56 protein:vir:105089 Length: 133 97.7 8E-08 5E-11 59.5 4.0 89 24-119 1-133 (133) 57 protein:vir:2026 Length: 150 # 97.7 1.1E-07 7.1E-11 58.6 4.5 91 1-114 47-150 (150) 58 protein:vir:99744 Length: 115 97.6 7.3E-08 4.5E-11 59.7 2.9 80 28-113 1-115 (115) 59 protein:vir:5978 Length: 144 # 97.6 1.4E-07 9E-11 58.1 4.3 87 25-113 1-144 (144) 60 protein:vir:107851 Length: 175 97.6 6.7E-07 4.2E-10 54.4 7.8 93 24-119 1-175 (175) 61 protein:vir:6071 Length: 150 # 97.5 2.6E-07 1.6E-10 56.7 4.7 91 1-114 47-150 (150) 62 protein:vir:96486 Length: 112 97.5 8.1E-08 5E-11 59.5 1.6 82 24-112 1-112 (112) 63 protein:vir:5703 Length: 150 # 97.5 3.3E-07 2.1E-10 56.1 4.4 91 1-114 47-150 (150) 64 protein:vir:98409 Length: 108 97.4 2E-07 1.3E-10 57.3 2.6 100 7-113 1-108 (108) 65 protein:vir:9708 Length: 125 # 97.2 3.4E-07 2.1E-10 56.1 2.0 98 1-118 1-125 (125) 66 protein:vir:743 Length: 108 # 97.2 5.4E-07 3.4E-10 54.9 3.0 100 7-113 1-108 (108) 67 protein:vir:103841 Length: 155 97.2 2.6E-06 1.6E-09 51.2 6.5 94 24-120 1-155 (155) 68 protein:vir:99833 Length: 190 97.2 1.6E-06 9.6E-10 52.4 5.2 95 1-120 51-190 (190) 69 protein:vir:98557 Length: 149 97.2 4.2E-06 2.6E-09 50.1 7.5 81 95-180 1-90 (149) 70 protein:vir:98557 Length: 149 97.1 9.6E-07 6E-10 53.6 3.5 90 1-114 47-149 (149) 71 protein:vir:101594 Length: 173 97.0 2.7E-06 1.7E-09 51.1 5.5 85 28-119 1-173 (173) 72 protein:vir:100312 Length: 152 97.0 3.2E-06 2E-09 50.7 5.3 91 1-115 48-152 (152) 73 protein:vir:79115 Length: 148 97.0 2.3E-06 1.4E-09 51.5 4.4 91 1-118 47-148 (148) 74 protein:vir:106570 Length: 182 96.9 1.5E-06 9.2E-10 52.5 3.1 96 24-122 1-182 (182) 75 protein:vir:1838 Length: 149 # 96.8 6.3E-06 3.9E-09 49.1 5.6 90 1-114 47-149 (149) 76 protein:vir:94654 Length: 142 96.5 9.1E-06 5.6E-09 48.2 4.9 86 25-116 1-142 (142) 77 protein:vir:2026 Length: 150 # 96.5 2.3E-05 1.4E-08 46.0 6.6 81 95-180 1-90 (150) 78 protein:vir:1988 Length: 156 # 96.4 3.5E-05 2.2E-08 45.0 7.1 95 1-118 51-156 (156) 79 protein:vir:79179 Length: 155 96.3 1.8E-05 1.1E-08 46.7 5.2 94 1-114 48-155 (155) 80 protein:vir:1164 Length: 156 # 96.2 2.8E-05 1.7E-08 45.6 6.0 96 1-123 48-156 (156) 81 protein:vir:6071 Length: 150 # 96.2 3.2E-05 2E-08 45.3 6.2 81 95-180 1-90 (150) 82 protein:vir:3163 Length: 145 # 96.2 3.6E-05 2.2E-08 44.9 6.1 96 1-123 43-145 (145) 83 protein:vir:5703 Length: 150 # 96.0 5.2E-05 3.3E-08 44.1 6.2 79 95-180 1-90 (150) 84 protein:vir:94490 Length: 137 95.8 1.2E-05 7.2E-09 47.6 2.0 82 24-109 1-137 (137) 85 protein:vir:93738 Length: 137 95.8 1.2E-05 7.2E-09 47.6 2.0 82 24-109 1-137 (137) 86 protein:vir:97427 Length: 137 95.8 1.2E-05 7.2E-09 47.6 2.0 82 24-109 1-137 (137) 87 protein:vir:107099 Length: 137 95.7 1.7E-05 1E-08 46.8 2.4 78 24-109 1-137 (137) 88 protein:vir:105330 Length: 137 95.6 2E-05 1.2E-08 46.4 2.3 82 24-109 1-137 (137) 89 protein:vir:78077 Length: 141 95.5 2.2E-05 1.4E-08 46.1 2.3 90 25-121 1-141 (141) 90 protein:vir:95894 Length: 137 95.5 2.4E-05 1.5E-08 45.9 2.5 78 24-109 1-137 (137) 91 protein:vir:79225 Length: 155 95.4 8.3E-05 5.1E-08 43.0 5.3 95 24-120 1-155 (155) 92 protein:vir:102154 Length: 119 95.4 7.2E-06 4.5E-09 48.8 -0.5 94 1-117 20-119 (119) 93 protein:vir:94796 Length: 137 95.3 3.1E-05 1.9E-08 45.3 2.4 82 24-109 1-137 (137) 94 protein:vir:99196 Length: 155 95.2 0.00015 9.5E-08 41.5 6.2 94 24-120 1-155 (155) 95 protein:vir:96829 Length: 135 95.2 2.4E-05 1.5E-08 45.9 1.5 82 24-109 1-135 (135) 96 protein:vir:105916 Length: 149 95.0 3.6E-05 2.2E-08 45.0 2.2 93 1-109 1-149 (149) 97 protein:vir:5000 Length: 141 # 94.9 0.00011 7.1E-08 42.2 4.6 110 1-123 1-141 (141) 98 protein:vir:96121 Length: 137 94.9 4.4E-05 2.7E-08 44.5 2.2 80 24-109 1-137 (137) 99 protein:vir:97327 Length: 116 94.7 3.4E-05 2.1E-08 45.1 1.1 77 24-109 1-116 (116) 100 protein:vir:1243 Length: 116 # 94.7 3.4E-05 2.1E-08 45.1 1.1 77 24-109 1-116 (116) 101 protein:vir:95062 Length: 116 94.5 3.5E-05 2.2E-08 45.0 0.8 77 24-109 1-116 (116) 102 protein:vir:94108 Length: 149 94.3 5.4E-05 3.3E-08 44.0 1.3 95 1-109 1-149 (149) 103 protein:vir:1838 Length: 149 # 94.2 0.0005 3.1E-07 38.7 6.4 81 95-180 1-90 (149) 104 protein:vir:4956 Length: 153 # 94.1 9E-05 5.6E-08 42.8 2.1 122 1-159 1-153 (153) 105 protein:vir:4833 Length: 140 # 94.0 0.00026 1.6E-07 40.3 4.5 117 1-120 1-140 (140) 106 protein:vir:79115 Length: 148 93.5 0.00098 6.1E-07 37.1 6.9 81 95-180 1-89 (148) 107 protein:vir:81067 Length: 119 93.3 0.00021 1.3E-07 40.8 2.7 83 1-120 1-119 (119) 108 protein:vir:10367 Length: 119 93.1 0.00024 1.5E-07 40.5 2.7 83 1-120 1-119 (119) 109 protein:vir:81147 Length: 126 93.0 0.00099 6.1E-07 37.1 6.1 86 24-120 1-126 (126) 110 protein:vir:4859 Length: 140 # 92.9 0.00055 3.4E-07 38.5 4.4 109 1-123 1-140 (140) 111 protein:vir:78755 Length: 228 92.3 0.0012 7.6E-07 36.5 5.6 97 1-156 49-228 (228) 112 protein:vir:3787 Length: 231 # 92.2 0.0027 1.7E-06 34.6 7.4 91 1-121 53-231 (231) 113 protein:vir:97427 Length: 137 91.6 0.00036 2.2E-07 39.5 1.8 64 91-180 1-64 (137) 114 protein:vir:94490 Length: 137 91.6 0.00036 2.2E-07 39.5 1.8 64 91-180 1-64 (137) 115 protein:vir:93738 Length: 137 91.6 0.00036 2.2E-07 39.5 1.8 64 91-180 1-64 (137) 116 protein:vir:100887 Length: 139 91.3 0.00084 5.2E-07 37.5 3.5 106 5-124 1-139 (139) 117 protein:vir:99101 Length: 142 91.1 0.00053 3.3E-07 38.5 2.3 82 24-113 1-142 (142) 118 protein:vir:8669 Length: 142 # 91.1 0.00053 3.3E-07 38.5 2.3 82 24-113 1-142 (142) 119 protein:vir:106041 Length: 137 91.1 0.00032 2E-07 39.8 1.1 85 24-121 1-137 (137) 120 protein:vir:96121 Length: 137 89.8 0.00067 4.2E-07 38.0 1.7 64 91-180 1-64 (137) 121 protein:vir:94796 Length: 137 89.2 0.00084 5.2E-07 37.4 1.7 64 91-180 1-64 (137) 122 protein:vir:79034 Length: 141 89.0 0.003 1.9E-06 34.4 4.6 116 1-122 1-141 (141) 123 protein:vir:96829 Length: 135 88.6 0.002 1.2E-06 35.4 3.4 64 91-180 1-64 (135) 124 protein:vir:1243 Length: 116 # 88.6 0.0014 8.8E-07 36.2 2.6 43 122-180 1-43 (116) 125 protein:vir:97327 Length: 116 88.6 0.0014 8.8E-07 36.2 2.6 43 122-180 1-43 (116) 126 protein:vir:79179 Length: 155 88.1 0.0058 3.6E-06 32.8 5.6 82 91-180 1-96 (155) 127 protein:vir:107545 Length: 140 86.2 0.0039 2.4E-06 33.8 3.5 88 21-121 1-140 (140) 128 protein:vir:97982 Length: 140 86.2 0.0039 2.4E-06 33.8 3.5 88 21-121 1-140 (140) 129 protein:vir:100223 Length: 139 85.7 0.0037 2.3E-06 33.9 3.2 119 5-125 1-139 (139) 130 protein:vir:101594 Length: 173 85.3 0.011 6.8E-06 31.3 5.5 62 96-180 1-62 (173) 131 protein:vir:106570 Length: 182 85.0 0.0041 2.5E-06 33.7 3.0 69 90-180 1-69 (182) 132 protein:vir:95062 Length: 116 84.9 0.0035 2.2E-06 34.0 2.6 43 122-180 1-43 (116) 133 protein:vir:3750 Length: 227 # 84.3 0.019 1.2E-05 30.0 6.3 85 1-120 53-227 (227) 134 protein:vir:95894 Length: 137 84.1 0.0028 1.7E-06 34.6 1.7 64 91-180 1-64 (137) 135 protein:vir:9930 Length: 108 # 82.2 0.0045 2.8E-06 33.4 2.1 60 92-180 1-60 (108) 136 protein:vir:966 Length: 123 # 81.5 0.0042 2.6E-06 33.6 1.7 87 24-114 1-123 (123) 137 protein:vir:100312 Length: 152 81.4 0.03 1.9E-05 28.9 6.3 80 91-180 1-91 (152) 138 protein:vir:1164 Length: 156 # 81.3 0.033 2E-05 28.7 6.5 82 91-180 1-93 (156) 139 protein:vir:98860 Length: 230 81.3 0.031 1.9E-05 28.8 6.3 85 1-120 55-230 (230) 140 protein:vir:5978 Length: 144 # 80.9 0.015 9.2E-06 30.6 4.4 69 86-180 1-69 (144) 141 protein:vir:105330 Length: 137 79.9 0.0045 2.8E-06 33.4 1.2 64 91-180 1-64 (137) 142 protein:vir:3848 Length: 159 # 79.6 0.018 1.1E-05 30.2 4.4 107 1-122 2-159 (159) 143 protein:vir:105916 Length: 149 79.3 0.017 1.1E-05 30.2 4.2 76 52-180 1-76 (149) 144 protein:vir:97144 Length: 115 79.0 0.0053 3.3E-06 33.1 1.3 68 96-180 1-68 (115) 145 protein:vir:9312 Length: 115 # 79.0 0.0053 3.3E-06 33.1 1.3 68 96-180 1-68 (115) 146 protein:vir:96225 Length: 115 79.0 0.0053 3.3E-06 33.1 1.3 68 96-180 1-68 (115) 147 protein:vir:103917 Length: 115 79.0 0.0053 3.3E-06 33.1 1.3 68 96-180 1-68 (115) 148 protein:vir:96358 Length: 115 79.0 0.0053 3.3E-06 33.1 1.3 68 96-180 1-68 (115) 149 protein:vir:78858 Length: 115 79.0 0.0053 3.3E-06 33.1 1.3 68 96-180 1-68 (115) 150 protein:vir:104347 Length: 145 78.6 0.064 4E-05 27.1 7.1 106 1-126 5-145 (145) 151 protein:vir:107099 Length: 137 77.6 0.0071 4.4E-06 32.4 1.6 64 91-180 1-64 (137) 152 protein:vir:100652 Length: 134 76.9 0.01 6.4E-06 31.5 2.3 81 20-115 1-134 (134) 153 protein:vir:94108 Length: 149 72.7 0.035 2.2E-05 28.6 4.1 76 52-180 1-76 (149) 154 protein:vir:105467 Length: 144 72.3 0.055 3.4E-05 27.5 5.0 66 91-180 1-69 (144) 155 protein:vir:96486 Length: 112 72.1 0.025 1.6E-05 29.4 3.1 67 91-180 1-67 (112) 156 protein:vir:98636 Length: 138 71.2 0.048 3E-05 27.8 4.5 88 18-118 1-138 (138) 157 protein:vir:78077 Length: 141 70.4 0.017 1.1E-05 30.3 1.8 63 95-180 1-64 (141) 158 protein:vir:3617 Length: 112 # 69.7 0.049 3E-05 27.8 4.2 64 91-180 1-64 (112) 159 protein:vir:94994 Length: 131 69.3 0.072 4.5E-05 26.8 5.0 93 1-116 39-131 (131) 160 protein:vir:9879 Length: 127 # 67.5 0.018 1.1E-05 30.1 1.3 101 8-114 1-127 (127) 161 protein:vir:4906 Length: 114 # 66.1 0.039 2.4E-05 28.3 2.8 67 91-180 1-67 (114) 162 protein:vir:2740 Length: 114 # 66.1 0.039 2.4E-05 28.3 2.8 67 91-180 1-67 (114) 163 protein:vir:102963 Length: 163 63.8 0.039 2.4E-05 28.3 2.4 112 1-121 1-163 (163) 164 protein:vir:78380 Length: 131 60.6 0.15 9.5E-05 25.0 5.0 93 1-116 39-131 (131) 165 protein:vir:99528 Length: 92 # 54.9 0.036 2.2E-05 28.5 0.6 64 91-180 1-65 (92) 166 protein:vir:102441 Length: 137 54.7 0.049 3E-05 27.8 1.3 60 85-180 1-60 (137) 167 protein:vir:103280 Length: 142 52.3 0.39 0.00024 22.8 5.8 96 1-120 44-142 (142) 168 protein:vir:7412 Length: 168 # 49.9 0.31 0.00019 23.4 4.9 117 1-126 1-168 (168) 169 protein:vir:106506 Length: 137 49.2 0.14 8.4E-05 25.3 2.8 59 84-180 1-59 (137) 170 protein:vir:80116 Length: 127 46.1 0.091 5.7E-05 26.3 1.3 91 24-121 1-127 (127) 171 protein:vir:79638 Length: 146 45.5 0.68 0.00042 21.5 6.0 102 1-126 45-146 (146) 172 protein:vir:97190 Length: 148 42.1 0.9 0.00056 20.8 6.4 107 1-122 1-148 (148) 173 protein:vir:6246 Length: 143 # 41.7 0.12 7.6E-05 25.6 1.3 109 1-128 1-143 (143) 174 protein:vir:1332 Length: 143 # 39.2 0.14 8.6E-05 25.3 1.2 109 1-128 1-143 (143) 175 protein:vir:1028 Length: 168 # 39.2 0.25 0.00015 23.9 2.6 100 1-126 50-168 (168) 176 protein:vir:107703 Length: 147 33.6 1.1 0.00069 20.3 5.2 101 1-126 45-147 (147) 177 protein:vir:4460 Length: 170 # 27.2 0.56 0.00035 22.0 2.4 73 85-180 1-74 (170) 178 protein:vir:9647 Length: 132 # 27.1 0.7 0.00044 21.4 2.9 70 92-180 1-70 (132) 179 protein:vir:94944 Length: 121 25.7 0.52 0.00032 22.1 1.9 83 17-99 1-121 (121) 180 protein:vir:80425 Length: 134 22.5 1.8 0.0011 19.1 4.3 96 1-117 39-134 (134) 181 protein:vir:96774 Length: 152 22.0 1.9 0.0012 19.1 4.3 91 1-119 57-152 (152) No 1 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=8.8e-54 Score=311.44 Aligned_cols=148 Identities=26% Similarity=0.418 Sum_probs=139.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---CCCCCCHHHHHHHHhcCCC--------------------- Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---ENGNLPVAQVAAYNEFGTT--------------------- 83 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~---~~~G~~vA~iA~i~EfGt~--------------------- 83 (180) |++|.+.++|++++++|++|++++|+|||++++.|++ +++|+++|+||+|||||.. T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~~ 80 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFVG 80 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeeccccccccc Confidence 8889999999999999999999999999999999876 3458999999999999943 Q ss_pred --------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019918. 84 --------------------RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD 143 (180) Q Consensus 84 --------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~ 143 (180) +||||||||++++++ +++|.+.+++++.+++.|+.+++++|+++|..++++||.+|++ T Consensus 81 ~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~--~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~ 158 (193) T protein:vir:96 81 VRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLF--SADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRT 158 (193) T ss_pred cceeccCcceeeEeecceeccCCCcchhhhhHHHH--HHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 799999999999886 4579999999999999999999999999999999999999999 Q ss_pred C-CCCCcHHHHHhcCCCCCchhHHHHHhhhhhhee Q lcl|NC_019918. 144 Y-PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIH 177 (180) Q Consensus 144 ~-~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~ 177 (180) + +|||||+||++||||+||||||+|++||+|+|+ T Consensus 159 ~~~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 159 GPWVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CCCCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 8 589999999999999999999999999999999 No 2 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=1.3e-53 Score=310.47 Aligned_cols=155 Identities=26% Similarity=0.392 Sum_probs=138.8 Q ss_pred HhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---CCCCCCHHHHHHHHhcCC------------ Q lcl|NC_019918. 18 LALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---ENGNLPVAQVAAYNEFGT------------ 82 (180) Q Consensus 18 ~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~---~~~G~~vA~iA~i~EfGt------------ 82 (180) |--|.+|...+ + ..+++++++++|++|++++|+|||+++++|++ ++||+++|+||+|||||+ T Consensus 1 ~~~~~~~~~k~--~-~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~ 77 (200) T protein:vir:99 1 MKKGFSKSNSV--A-APLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIK 77 (200) T ss_pred CCcCcceeeee--e-cchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCccccc Confidence 33344444332 2 33689999999999999999999999999974 568999999999999994 Q ss_pred -----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH Q lcl|NC_019918. 83 -----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMV 133 (180) Q Consensus 83 -----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~ 133 (180) .+||||||||++++++ +++|.+++++.+.+++.|+.+++++|+.+|..+ T Consensus 78 ~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~--~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~ 155 (200) T protein:vir:99 78 DAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATF--NKDKVKIQAQIARQLLDGTINPEQALAQIGLAL 155 (200) T ss_pred cccccccccccccccccccceeeeeccccccCCCcchhhHHHHHH--HHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH Confidence 3799999999999886 557999999999999999999999999999999 Q ss_pred HHHHHHHHhcC-CCCCcHHHHHhcCCCCCchhHHHHHhhhhhhee Q lcl|NC_019918. 134 AEQMQVNIDDY-PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIH 177 (180) Q Consensus 134 ~~~Iq~~I~~~-~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~ 177 (180) +++||.+|+++ +|||||+||++||||+||||||+|++||+|+|+ T Consensus 156 ~~~ik~~I~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 156 EGCIVRSIKSGPWAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred HHHHHHHHhcCCCCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 99999999998 579999999999999999999999999999999 No 3 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=2e-53 Score=309.46 Aligned_cols=150 Identities=22% Similarity=0.402 Sum_probs=137.5 Q ss_pred hhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCC--CCCCCcchhhHHHH Q lcl|NC_019918. 20 LGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTT--RNPTRPFMAPTFEE 97 (180) Q Consensus 20 ~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~--~IP~RpFlr~~~~~ 97 (180) |++.| +...+.+++|.+.|++|++++|+||||++++|+ ||+++|+||+|||||++ +||||||||+++++ T Consensus 1 M~~~i------~~~~~~~~~L~~~lk~l~~k~V~VGi~~~~~y~---dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~ 71 (189) T protein:vir:10 1 MGRVI------RKQGPARVKLNAFIKGMNDYSVRIGWFSTAKYP---DGTPTAYVASIHEFGAPSRGIPARSFIRPTIAA 71 (189) T ss_pred Cccee------ccCcHHHHHHHHHHHHhhCCeEEEEecCCCCCC---CcccHHHHHHHHHhcCcCCCCCCchhhhHHHHH Confidence 44444 445677788889999999999999999998875 89999999999999996 79999999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC-CCCCcHHHHHhcCC------------------ Q lcl|NC_019918. 98 FTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDY-PGSNSPAWAAYKGF------------------ 158 (180) Q Consensus 98 ~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~-~pPnsp~Ti~~KG~------------------ 158 (180) + +++|.+++++++.+++.|+.+++++|+.+|+.++++||.+|.++ +|||||+||++||+ T Consensus 72 ~--~~~~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~ 149 (189) T protein:vir:10 72 Q--QAAWSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRS 149 (189) T ss_pred H--HHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccchhhhhhhhhhhh Confidence 6 56799999999999999999999999999999999999999998 57999999999994 Q ss_pred -----------------CCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 159 -----------------NDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 159 -----------------~~PLIDTG~L~~SIty~V~~k~ 180 (180) ++||||||+|++||+|+|++|+ T Consensus 150 ~~~~~~~~~~~~~~~~s~kPLidTG~l~~SIty~V~~k~ 188 (189) T protein:vir:10 150 EMQQEQAKGTLNLSGVSTDPLDFTGYMRATLSYTVTKEK 188 (189) T ss_pred hhhhhhhhccccccccCCCchhhHHHHHhhcceeeeecC Confidence 7999999999999999999999 No 4 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=3.1e-52 Score=302.97 Aligned_cols=146 Identities=28% Similarity=0.361 Sum_probs=129.2 Q ss_pred hhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccC-CCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHH Q lcl|NC_019918. 20 LGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRY-GSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEF 98 (180) Q Consensus 20 ~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~-~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~ 98 (180) |+|. +|.+.+++++++++|++|++++|+||||++... ..++||+++|+||||||||+++||+|||||++++++ T Consensus 1 M~~~------~k~~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~ 74 (148) T protein:vir:52 1 MAVT------VTANFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEEN 74 (148) T ss_pred Cccc------cccccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHH Confidence 4433 456788899999999999999999999965332 246799999999999999999999999999999986 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcCCCCCchhHHHHHhhhhhhee Q lcl|NC_019918. 99 TSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYP-GSNSPAWAAYKGFNDPLFHTGKMLESVKFQIH 177 (180) Q Consensus 99 ~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~-pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~ 177 (180) + ++|.+++ .+++.|+.+++++|+.+|+.++++||.+|.++. |||||+||++||||+||||||+|++||+|+|+ T Consensus 75 ~--~~~~~~~----~~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 75 Q--EKYTALF----IQWFDQGVPAAQIYERLSVMAQGDVQMNIVKGEWVANAKSTIRRKKSSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred H--HHHHHHH----HHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHhcCCCCchhHHHHHHHHhhhhcC Confidence 4 4676655 466678899999999999999999999999984 79999999999999999999999999999999 No 5 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=7.4e-50 Score=289.95 Aligned_cols=140 Identities=28% Similarity=0.530 Sum_probs=125.3 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---------------CCCCCCHHHHHHHHhcCCCCCCCCcchhhH Q lcl|NC_019918. 30 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 94 (180) Q Consensus 30 ~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~---------------~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~ 94 (180) .|.++++|++++++| ++++|+|||+++++|++ +.+|+|+|+||+|||||+.+||||||||++ T Consensus 1 m~v~~k~L~~~~~~l---~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t 77 (155) T protein:vir:78 1 MSVTRRGLTLPKDRY---RSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHH---hCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCCCCCCcchhhHH Confidence 356688888887665 67899999999999986 456999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 174 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty 174 (180) +++++. +|.+.++ +++.++.+++++|+++|+.++++||.+|+++.|||||+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l~----~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~~pna~~Ti~~Kg~~kPLidTG~l~~SIty 151 (155) T protein:vir:78 78 ITDRSA--EWIKGLT----VMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNSADWAGKKGFNHGLIWTSHLLNSVEQ 151 (155) T ss_pred HHHHHH--HHHHHHH----HHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCchhHHHHHHHhhhh Confidence 998754 6776654 4556778999999999999999999999999999999999999999999999999999999 Q ss_pred heec Q lcl|NC_019918. 175 QIHR 178 (180) Q Consensus 175 ~V~~ 178 (180) +|++ T Consensus 152 ~V~~ 155 (155) T protein:vir:78 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 6 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=8.1e-50 Score=289.73 Aligned_cols=140 Identities=28% Similarity=0.530 Sum_probs=125.3 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---------------CCCCCCHHHHHHHHhcCCCCCCCCcchhhH Q lcl|NC_019918. 30 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 94 (180) Q Consensus 30 ~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~---------------~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~ 94 (180) .|.++++|++++++| ++++|+|||+++++|++ +.+|+|+|+||+|||||+.+||+|||||++ T Consensus 1 m~v~~k~L~~~~~~l---~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t 77 (155) T protein:vir:10 1 MSVTRRGLTLPKDRY---RSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHH---hCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCCCCCCCcchhHHH Confidence 356688888887665 67899999999999986 456999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 174 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty 174 (180) +++++. +|.+.++ +++.++.+++++|+++|+.++++||.+|+++.|||||+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l~----~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~~pna~~Ti~~KG~~kPLidTG~l~~SIty 151 (155) T protein:vir:10 78 IADRSA--EWIKGLT----VMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNSADWAGKKGFNHGLIWTSHLLNSVEQ 151 (155) T ss_pred HHHHHH--HHHHHHH----HHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCchhHHHHHHHhhhh Confidence 998654 6776654 4556788999999999999999999999999999999999999999999999999999999 Q ss_pred heec Q lcl|NC_019918. 175 QIHR 178 (180) Q Consensus 175 ~V~~ 178 (180) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:10 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 7 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=2.9e-49 Score=286.68 Aligned_cols=146 Identities=28% Similarity=0.489 Sum_probs=129.2 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC--------------CCCCCCHHHHHHHHhcCCCCCCCCc Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS--------------ENGNLPVAQVAAYNEFGTTRNPTRP 89 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~--------------~~~G~~vA~iA~i~EfGt~~IP~Rp 89 (180) |+ ....++++...+.+..|++..|+|||+++++|++ +++|+++|+||+|||||+.+||+|| T Consensus 1 ~~-----~~~~~g~~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~~~IP~RP 75 (168) T protein:vir:94 1 MT-----TIARKGVKMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGHGQNHPRP 75 (168) T ss_pred Cc-----cccchhhhhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCCCCCCCch Confidence 33 1245678888888999999999999999999874 3678999999999999999999999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHH Q lcl|NC_019918. 90 FMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKML 169 (180) Q Consensus 90 Flr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~ 169 (180) |||+++++++ .+|.+. +.+++.|+.+++++|+.+|+.++++||.+|.+++|||||+||++||||+||||||+|+ T Consensus 76 Flr~t~~~~~--~~~~~~----~~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~ppna~sTi~~KG~~~PLiDTG~l~ 149 (168) T protein:vir:94 76 FMQQTYAAQY--RAWSRD----LTLTLKAGAAADTALRTVGQRMAEDIQDTIRNWPADNSPEWAAIKGFNAGLRQTGVLL 149 (168) T ss_pred hhHHHHHHHH--HHHHHH----HHHHHhcCCCHHHHHHHHHHHHHHHHHHHhhcCCCCccHHHHHhcCCCCchhHHHHHH Confidence 9999999864 456654 4567788999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhheeccC Q lcl|NC_019918. 170 ESVKFQIHRRQ 180 (180) Q Consensus 170 ~SIty~V~~k~ 180 (180) +||+|+|++.+ T Consensus 150 ~SIty~Vv~d~ 160 (168) T protein:vir:94 150 NAIDSAVIIDG 160 (168) T ss_pred hhcceeeeecC Confidence 99999888555 No 8 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=7.2e-49 Score=284.52 Aligned_cols=140 Identities=27% Similarity=0.541 Sum_probs=124.3 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCC---------------CCCCHHHHHHHHhcCCCCCCCCcchhhH Q lcl|NC_019918. 30 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSEN---------------GNLPVAQVAAYNEFGTTRNPTRPFMAPT 94 (180) Q Consensus 30 ~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~---------------~G~~vA~iA~i~EfGt~~IP~RpFlr~~ 94 (180) .+.++++|++++++|+ +++|+||||++++|++++ +|+++|+||+|||||+.+||||||||++ T Consensus 1 m~v~r~~L~~~~~~l~---~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t 77 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYK---SMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHhh---CCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCCCCCCCcchhHHH Confidence 2456788888887664 578999999999998644 3999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 174 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty 174 (180) +++++. +|.+.++ +++.++.+++++|+.+|+.++++||.+|+++.+||+|+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l~----~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~~p~~~~Ti~~KG~~~PLidTG~l~~Sity 151 (155) T protein:vir:10 78 IADRSA--EWIKGLT----VMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNNADWAGKKGFNHGLIWTSHLLNSIEQ 151 (155) T ss_pred HHHHHH--HHHHHHH----HHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCChHHHHhcCCCCchHHHHHHHHhhhh Confidence 998754 5766554 5566788999999999999999999999999889999999999999999999999999999 Q ss_pred heec Q lcl|NC_019918. 175 QIHR 178 (180) Q Consensus 175 ~V~~ 178 (180) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:10 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 9 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=7.3e-49 Score=284.48 Aligned_cols=143 Identities=26% Similarity=0.476 Sum_probs=130.4 Q ss_pred ceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC-------------------------- Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG-------------------------- 81 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfG-------------------------- 81 (180) |+++.+.+.+++++++|++|++++|+|||+.+ ||.++++||.+|||| T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~v~vGi~~~-------d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~ 73 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYSLQIGLFGE-------DDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIP 73 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCEEEEEEecC-------CCcchhheeehhhcCCeeecCCceeeecchhhhcccccccC Confidence 66778889999999999999999999999943 566677777777777 Q ss_pred --------------------------C--CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH Q lcl|NC_019918. 82 --------------------------T--TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMV 133 (180) Q Consensus 82 --------------------------t--~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~ 133 (180) + .+||+|||||++++++ +++|.+++++++.+++.|+.+++++|+++|+.+ T Consensus 74 ~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~--~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~ 151 (199) T protein:vir:80 74 GLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEK--SNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKI 151 (199) T ss_pred cccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHH--HHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHH Confidence 2 2799999999999886 557999999999999999999999999999999 Q ss_pred HHHHHHHHhcC-CCCCcHHHHH-hcCCCCCchhHHHHHhhhhhheecc Q lcl|NC_019918. 134 AEQMQVNIDDY-PGSNSPAWAA-YKGFNDPLFHTGKMLESVKFQIHRR 179 (180) Q Consensus 134 ~~~Iq~~I~~~-~pPnsp~Ti~-~KG~~~PLIDTG~L~~SIty~V~~k 179 (180) +++||.+|.++ +|||||+||+ |||||+||||||+|++||+|+|++- T Consensus 152 ~~~Ik~~I~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 152 VDDIQMKIVEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHHHHHhccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 99999999998 5899999997 8999999999999999999999999 No 10 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=2.5e-48 Score=281.60 Aligned_cols=140 Identities=26% Similarity=0.531 Sum_probs=122.9 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---------------CCCCCCHHHHHHHHhcCCCCCCCCcchhhH Q lcl|NC_019918. 30 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 94 (180) Q Consensus 30 ~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~---------------~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~ 94 (180) .+.++.+|+.++++ |++++|+|||+++++|++ +++|+++|+||+|||||+.+||||||||++ T Consensus 1 m~~~r~~l~~~~~~---l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t 77 (155) T protein:vir:77 1 MSVTRRGLTLPKDR---YRSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHH---HhcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCCCCCCCCchhhHH Confidence 23456667777665 467889999999999986 456999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 174 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty 174 (180) +++++. +|.+.+++ ++.++.+++++|+.+|+.++++||.+|+++.+||+|+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l~~----~~~~~~~~~~~L~~lG~~~~~~Iq~~I~~~~~p~~~~Ti~~KG~d~PLidTG~l~~SIty 151 (155) T protein:vir:77 78 IADRSA--EWIKGLTV----MMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNNADWAGKKGFNHGLIWTSHLLNSIEQ 151 (155) T ss_pred HHHHHH--HHHHHHHH----HHHccCcHHHHHHHHHHHHHHHHHHHHhcCCCCCChHHHHhcCCCCchhHHHHHHHhhhh Confidence 998754 67766654 455678999999999999999999999999888999999999999999999999999999 Q ss_pred heec Q lcl|NC_019918. 175 QIHR 178 (180) Q Consensus 175 ~V~~ 178 (180) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:77 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=1.1e-45 Score=267.12 Aligned_cols=148 Identities=16% Similarity=0.265 Sum_probs=122.4 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHH---H Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFT---S 100 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~---~ 100 (180) |+. |.+.+++++|.+++++|+++.|+||||+++. .++||+|+++||+|||||+.+||+|||||++|+... + T Consensus 1 ~~~----~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g--~~~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~ 74 (160) T protein:vir:95 1 MVK----RVIHPARAKLVGAMKNLQTANAQVGYFQEQG--QHSSGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNK 74 (160) T ss_pred Cce----eechHhHHHHHHHHHHHhCCeeEEeeccccc--cCCCCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHH Confidence 332 5678899999999999999999999999883 346899999999999999999999999999997422 2 Q ss_pred HHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC-----CCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 101 QFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDY-----PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 101 ~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~-----~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) +..+.+..+....++..|+.++ .+.+|+.++++||.+|.+. ||||||+||++||||+||||||+|++||+|+ T Consensus 75 ~~~~~~~~~~i~~~~~~g~~~~---~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~Kgs~~PLiDTg~l~~Si~y~ 151 (160) T protein:vir:95 75 QTLLEQTKKNLYKQLSSLNTDP---SNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKKGFNAPLVETGDLRDNLAYK 151 (160) T ss_pred HHHHHHHHHHHHHHHhhcchhH---HHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhcCCCCcchhhHHHhhhhhhe Confidence 2223344444555666665443 4559999999999999874 4599999999999999999999999999999 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) |.++. T Consensus 152 v~~~~ 156 (160) T protein:vir:95 152 ISTKK 156 (160) T ss_pred eeccc Confidence 99999 No 12 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.56 E-value=3.2e-10 Score=72.67 Aligned_cols=77 Identities=13% Similarity=0.163 Sum_probs=55.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCCCCCchhHH Q lcl|NC_019918. 94 TFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGFNDPLFHTG 166 (180) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~~~PLIDTG 166 (180) -+. ....+.+.+++... +....|..+|......++..+.+ | |+|+||+|+++|+.++||+||| T Consensus 1 ~i~---~~~~i~~~l~~l~~-------~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG 70 (145) T protein:vir:31 1 MVE---DENNIPEAREAIQD-------GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNS 70 (145) T ss_pred Ccc---cHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCH Confidence 221 12234443433322 23456888999999999998864 2 3499999999999999999999 Q ss_pred HHHhhhhhheeccC Q lcl|NC_019918. 167 KMLESVKFQIHRRQ 180 (180) Q Consensus 167 ~L~~SIty~V~~k~ 180 (180) .|++||+|.+.... T Consensus 71 ~L~~Si~~~~~~~~ 84 (145) T protein:vir:31 71 RLLTDINAASMMDR 84 (145) T ss_pred HHHHHHHHHhhhcc Confidence 99999999985322 No 13 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.44 E-value=1.8e-10 Score=73.99 Aligned_cols=103 Identities=17% Similarity=0.279 Sum_probs=50.9 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhC------------------------------------------------------ Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDG------------------------------------------------------ 49 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------------------------------------ 49 (180) |..+|+++.. +|++|.+.|++|.. T Consensus 1 Ma~~~~~~i~--Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~ 78 (164) T protein:vir:43 1 MADTVEFSIT--GLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRT 78 (164) T ss_pred CCcceEEeee--cHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccc Confidence 6666655532 23333333333311 Q ss_pred --CEEEEEecccccCC-----CCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_019918. 50 --TTVEVGFFPEDRYG-----SENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQV 122 (180) Q Consensus 50 --~~V~VGi~~~~~~~-----~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~ 122 (180) ....||+..+.... ....+-+.+.++.++||||.++||||||||++++.. +.+.+.+...+.+.| T Consensus 79 ~~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k--~~~~~~~~~~l~~~i------ 150 (164) T protein:vir:43 79 GDLGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNI--AEVTSTFVSEYEKGI------ 150 (164) T ss_pred cceeEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhH--HHHHHHHHHHHHHHH------ Confidence 11223332221100 000111235678899999999999999999998753 344444444444322 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_019918. 123 NTLLKKLGRMVAEQ 136 (180) Q Consensus 123 ~~~L~~iG~~~~~~ 136 (180) +.+|.+.+..++.- T Consensus 151 ~ka~~k~~~~~~~~ 164 (164) T protein:vir:43 151 DRAIKRAAKKAAQG 164 (164) T ss_pred HHHHHHHHhhhccC Confidence 34444444433333 No 14 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.35 E-value=1.5e-09 Score=69.04 Aligned_cols=86 Identities=15% Similarity=0.128 Sum_probs=60.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-C--CCCCcHHHHHhc-----CCCCCc Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-Y--PGSNSPAWAAYK-----GFNDPL 162 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-~--~pPnsp~Ti~~K-----G~~~PL 162 (180) |--.++-.-+. +.+.+.+.++.....+...+|..||..+...++..|.. | |+|+||+|+++| +..++| T Consensus 1 M~~~i~i~~d~----~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL 76 (155) T protein:vir:79 1 MTTRIDVELDD----QEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPIL 76 (155) T ss_pred CceEEEEEech----HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCcc Confidence 32222110011 12334444444433477899999999999999999964 4 349999999865 356899 Q ss_pred hhHHHHHhhhhhheeccC Q lcl|NC_019918. 163 FHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 163 IDTG~L~~SIty~V~~k~ 180 (180) +|||.|++||+|++.... T Consensus 77 ~~tG~L~~Si~~~~~~~~ 94 (155) T protein:vir:79 77 QVTNALARSVTTWADRNE 94 (155) T ss_pred ccchhhhhhhhceecCCE Confidence 999999999999998877 No 15 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.34 E-value=1.7e-09 Score=68.69 Aligned_cols=86 Identities=14% Similarity=0.086 Sum_probs=61.0 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC----CCCCcHHHHHhc---------- Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDY----PGSNSPAWAAYK---------- 156 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~----~pPnsp~Ti~~K---------- 156 (180) |-..++ .+-. .+.+.+.+.++.....+...+|..||..+...++..|.+. |+|+||+|+++| T Consensus 1 Ms~~i~-i~~d---~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~ 76 (175) T protein:vir:79 1 MSDFVN-FQID---DSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKN 76 (175) T ss_pred CceEEE-EEec---hHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhcccccccccc Confidence 322111 1100 0123444444444445788999999999999999999764 349999998643 Q ss_pred -----------CCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 157 -----------GFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 157 -----------G~~~PLIDTG~L~~SIty~V~~k~ 180 (180) +..++|+|||.|++||+|.+.... T Consensus 77 ~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~ 111 (175) T protein:vir:79 77 GELTAAASRRKAGLMILQDSGQMAASTATDSGEDY 111 (175) T ss_pred ccchhhHhhhccCCCcceechhhhhhhhheecCCE Confidence 457899999999999999998777 No 16 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=98.32 E-value=1.1e-09 Score=69.67 Aligned_cols=92 Identities=21% Similarity=0.440 Sum_probs=56.7 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEecccccC----CCCCCC-----CC Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDRY----GSENGN-----LP 70 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~------------------------~V~VGi~~~~~~----~~~~~G-----~~ 70 (180) |.+.++|+.+ +|++|.+.|+.+.+. -+.-|-+.+.=. ...++| -+ T Consensus 1 Ma~~~~i~~~--Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~ 78 (125) T protein:vir:94 1 MANDFNIKFK--GVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVA 78 (125) T ss_pred CCCceeeeeh--hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeC Confidence 8888777753 455555555544221 011222211100 001111 23 Q ss_pred HHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 71 VAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDG 119 (180) Q Consensus 71 vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~ 119 (180) .+.+|.+.||||...|+||||+|++++. +..+.+.+++.+++++.-. T Consensus 79 ~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~--~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 79 RADYSSYNEYGTYRMSAQPFMAPSVAAM--TPFFYKAVRDALNKAAKFS 125 (125) T ss_pred CCCccceeecccccCCCCcccchhHHHH--HHHHHHHHHHHHHHHhccC Confidence 4679999999999999999999999875 4467777888887776654 No 17 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.31 E-value=1.8e-09 Score=68.54 Aligned_cols=86 Identities=14% Similarity=0.106 Sum_probs=60.5 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-C--CCCCcHHHHHh-----c Q lcl|NC_019918. 85 NPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-Y--PGSNSPAWAAY-----K 156 (180) Q Consensus 85 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-~--~pPnsp~Ti~~-----K 156 (180) .. .++.-.+++ +.+.+.+.++.....+...+|..||..+...++..|.. | |+|+||+|+++ + T Consensus 1 Ms--~~i~i~~~~--------~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~ 70 (155) T protein:vir:10 1 MA--NRIELELVD--------REVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGR 70 (155) T ss_pred CC--ceEEEEech--------HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccC Confidence 11 122222222 12334444444433477899999999999999999963 4 45999999864 3 Q ss_pred CCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 157 GFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 157 G~~~PLIDTG~L~~SIty~V~~k~ 180 (180) |..++|+|||.|++||+|.+.... T Consensus 71 ~~~~~L~~tG~L~~Si~~~~~~~~ 94 (155) T protein:vir:10 71 GAHPILQVTNALARSITTRADRDQ 94 (155) T ss_pred CCCCccccchhhhhhhhceecCCE Confidence 567899999999999999998777 No 18 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.31 E-value=2.3e-09 Score=67.98 Aligned_cols=109 Identities=20% Similarity=0.282 Sum_probs=52.5 Q ss_pred CCCCcccccch-hhHHHHHh-hhheecccceeeehHHHHHHHHHHHHHhhC----------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLA-LGVIILASFSFKTDRRRLTSLIKRVEALDG----------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~-~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----------------------------- 49 (180) |-||-+|.... --+++.|. |+-+.. .+.-+.....-+.+.+.++.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 66655544332 12222221 222211 01111111222223333333321 Q ss_pred ------CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 50 ------TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 50 ------~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) ..+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++.. ++.+.+.+...+.+.+.-.+ T Consensus 80 ~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 1223333211 11 23578889999999999999999999875 44566666666665543322 No 19 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.31 E-value=2.3e-09 Score=67.98 Aligned_cols=109 Identities=20% Similarity=0.282 Sum_probs=52.5 Q ss_pred CCCCcccccch-hhHHHHHh-hhheecccceeeehHHHHHHHHHHHHHhhC----------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLA-LGVIILASFSFKTDRRRLTSLIKRVEALDG----------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~-~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----------------------------- 49 (180) |-||-+|.... --+++.|. |+-+.. .+.-+.....-+.+.+.++.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 66655544332 12222221 222211 01111111222223333333321 Q ss_pred ------CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 50 ------TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 50 ------~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) ..+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++.. ++.+.+.+...+.+.+.-.+ T Consensus 80 ~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 1223333211 11 23578889999999999999999999875 44566666666665543322 No 20 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.31 E-value=2.3e-09 Score=67.98 Aligned_cols=109 Identities=20% Similarity=0.282 Sum_probs=52.5 Q ss_pred CCCCcccccch-hhHHHHHh-hhheecccceeeehHHHHHHHHHHHHHhhC----------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLA-LGVIILASFSFKTDRRRLTSLIKRVEALDG----------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~-~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----------------------------- 49 (180) |-||-+|.... --+++.|. |+-+.. .+.-+.....-+.+.+.++.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 66655544332 12222221 222211 01111111222223333333321 Q ss_pred ------CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 50 ------TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 50 ------~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) ..+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++.. ++.+.+.+...+.+.+.-.+ T Consensus 80 ~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 1223333211 11 23578889999999999999999999875 44566666666665543322 No 21 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.31 E-value=2.3e-09 Score=67.98 Aligned_cols=109 Identities=20% Similarity=0.282 Sum_probs=52.5 Q ss_pred CCCCcccccch-hhHHHHHh-hhheecccceeeehHHHHHHHHHHHHHhhC----------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLA-LGVIILASFSFKTDRRRLTSLIKRVEALDG----------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~-~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----------------------------- 49 (180) |-||-+|.... --+++.|. |+-+.. .+.-+.....-+.+.+.++.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~-~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGE-KIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccc Confidence 66655544332 12222221 222211 01111111222223333333321 Q ss_pred ------CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 50 ------TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 50 ------~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) ..+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++.. ++.+.+.+...+.+.+.-.+ T Consensus 80 ~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~--k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 80 KLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNAS--KAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred cccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHh--HHHHHHHHHHHHHHHHhhcC Confidence 1223333211 11 23578889999999999999999999875 44566666666665543322 No 22 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.30 E-value=2.9e-09 Score=67.41 Aligned_cols=85 Identities=14% Similarity=0.187 Sum_probs=59.5 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC-------CCCCcHHHHHhc--CCCC Q lcl|NC_019918. 90 FMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDY-------PGSNSPAWAAYK--GFND 160 (180) Q Consensus 90 Flr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~-------~pPnsp~Ti~~K--G~~~ 160 (180) -+.-.+. . +-..+.+.+...+.. -.+...++..||..+...+++.|.+. |+|++|+|+++| +..+ T Consensus 1 M~~i~i~-~-d~~~~~~~L~~l~~~----~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~ 74 (190) T protein:vir:99 1 MAGITLE-W-DGRRALDVLNAGSAA----LGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDK 74 (190) T ss_pred CceeEEE-e-cHHHHHHHHHHHHHH----hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCc Confidence 1111111 0 111233334444433 33678999999999999999999743 248999999766 5679 Q ss_pred CchhHHHHHhhhhhheeccC Q lcl|NC_019918. 161 PLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 161 PLIDTG~L~~SIty~V~~k~ 180 (180) +|+|||.|++||+|++.... T Consensus 75 ~L~~tg~L~~Si~~~~~~~~ 94 (190) T protein:vir:99 75 ILTLDGHLRNLLRYQLDGSE 94 (190) T ss_pred cceecHHHHHHHhheecCcE Confidence 99999999999999998777 No 23 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=98.28 E-value=3.3e-09 Score=67.09 Aligned_cols=90 Identities=20% Similarity=0.261 Sum_probs=49.7 Q ss_pred ecccceeeehHHHHHHHHHHHHHhh------------------------------------------------CCEEEEE Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALD------------------------------------------------GTTVEVG 55 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~------------------------------------------------~~~V~VG 55 (180) |. .++++ +|++|++.|+.|. ...+.|| T Consensus 1 M~-~~~i~----Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg 75 (140) T protein:vir:14 1 MS-SIQII----GLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAG 75 (140) T ss_pred Cc-eeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEee Confidence 33 23333 2222222222221 1123344 Q ss_pred ecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHH--HHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 56 FFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTS--QFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 56 i~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~--~~~~~~~~~~~~~~~l~g~~ 120 (180) +..+... ..++-+.+.++.+.||||.++||||||+|++++.+. ...+.+.+++.+.+++.|.. T Consensus 76 ~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 76 VRVRTKG--KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred eeecccc--ccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 3322211 112234577899999999999999999999987532 23345666666777777776 No 24 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.25 E-value=3.3e-09 Score=67.05 Aligned_cols=86 Identities=13% Similarity=0.108 Sum_probs=60.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHh-cC--CCCCcHHHHHhc-----CCCCCc Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNID-DY--PGSNSPAWAAYK-----GFNDPL 162 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~-~~--~pPnsp~Ti~~K-----G~~~PL 162 (180) |-.-++-.-+. +.+.+.+.++.....+...+|..||..+...++..|. +| |+|+||+|+++| +..++| T Consensus 1 Ms~~i~i~~d~----~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL 76 (155) T protein:vir:99 1 MTTRIDVELDD----QEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPIL 76 (155) T ss_pred CceEEEEEech----HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcc Confidence 32212110011 2234444444444446789999999999999999996 45 349999999866 246799 Q ss_pred hhHHHHHhhhhhheeccC Q lcl|NC_019918. 163 FHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 163 IDTG~L~~SIty~V~~k~ 180 (180) +|||.|++||+|.+.+.. T Consensus 77 ~~tg~L~~Si~~~~~~~~ 94 (155) T protein:vir:99 77 QVTNALARSVTTWADRNE 94 (155) T ss_pred hhchhhhhhhhceecCCE Confidence 999999999999998777 No 25 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.25 E-value=1e-09 Score=69.92 Aligned_cols=103 Identities=19% Similarity=0.290 Sum_probs=45.5 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 50 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 50 (180) |..+++++.. +|++|.+.|++|.+. T Consensus 1 Ma~~~~~~i~--Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~ 78 (179) T protein:vir:18 1 MADSVEVSLT--GLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRT 78 (179) T ss_pred CCceEEEEee--cHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccc Confidence 5555544422 222333222222110 Q ss_pred ---EEEEEecccccC--------------------CCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_019918. 51 ---TVEVGFFPEDRY--------------------GSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARL 107 (180) Q Consensus 51 ---~V~VGi~~~~~~--------------------~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~ 107 (180) .+.||+..+... +....+-..+.++.+.||||.++||||||||++++... T Consensus 79 g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~------- 151 (179) T protein:vir:18 79 GDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDN------- 151 (179) T ss_pred cceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHH------- Confidence 123333221110 00001112356777889999999999999999986432 Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCC Q lcl|NC_019918. 108 MKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFND 160 (180) Q Consensus 108 ~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~ 160 (180) ++++.+...+...|++.+.... .||-.- T Consensus 152 ----------------~a~~~i~~~l~~~i~k~lk~~~---------~~~~~~ 179 (179) T protein:vir:18 152 ----------------DVINVFSTEMGKAIDRAIRLAM---------KKGTTA 179 (179) T ss_pred ----------------HHHHHHHHHHHHHHHHHHHhhc---------ccCCCC Confidence 2233333333444444443210 011000 No 26 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=98.23 E-value=4.1e-09 Score=66.57 Aligned_cols=112 Identities=19% Similarity=0.242 Sum_probs=47.4 Q ss_pred CCCCcccccch-hhHHHHHh-hhheec-ccceeeehHHHHHHHHHHHHHhh----------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLA-LGVIIL-ASFSFKTDRRRLTSLIKRVEALD----------------------------- 48 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~-~~~~M~-~~v~~k~~~~~l~~l~~~l~~l~----------------------------- 48 (180) |-||-+|...- .-+++.|. ||.... ..+.-+.-...-.-+.+.++... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 66665544321 11222211 110000 00000000011111111111111 Q ss_pred -----CCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_019918. 49 -----GTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVN 123 (180) Q Consensus 49 -----~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~ 123 (180) ...|.||+..+ ++ +.+.++.+.||||.+.||+|||||++++.. .++.+.+...+.+.+.-. T Consensus 81 ~~~~g~~~~~VG~~~~-------~~-~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~--~~~~~~~~~~l~k~i~~~---- 146 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKS-------DN-TPFYYMKMEEWGTSERPPHHAFGKTNKILK--RVYDNIAQKKYDNFVKEK---- 146 (149) T ss_pred ccccceeEEEeeccCC-------CC-CccceeeeeccCccCCCCCccchHHHHHHH--HHHHHHHHHHHHHHHHHH---- Confidence 11245555322 12 235788899999999999999999998754 345544444443333221 Q ss_pred HHHHHHHHHHHHHHHHHHhc Q lcl|NC_019918. 124 TLLKKLGRMVAEQMQVNIDD 143 (180) Q Consensus 124 ~~L~~iG~~~~~~Iq~~I~~ 143 (180) +.+ T Consensus 147 -----------------lG~ 149 (149) T protein:vir:13 147 -----------------LGD 149 (149) T ss_pred -----------------hcC Confidence 111 No 27 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.22 E-value=4.4e-09 Score=66.42 Aligned_cols=85 Identities=8% Similarity=0.071 Sum_probs=58.9 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc------C--CCCCcHHHHHhcC----- Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD------Y--PGSNSPAWAAYKG----- 157 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~------~--~pPnsp~Ti~~KG----- 157 (180) |...++-..+...+. +.+.++.. ..+...++..||..+...++..|.+ | |+|++|+|+++|. T Consensus 1 ms~~i~~~~d~~~l~----~~L~~l~~-~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~ 75 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQ----LALDELGT-VTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFV 75 (156) T ss_pred CeEEEEEeecHHHHH----HHHHHHHh-hhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCC Confidence 544432111112233 33333322 2234579999999999999999963 4 3499999999873 Q ss_pred CCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 158 FNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 158 ~~~PLIDTG~L~~SIty~V~~k~ 180 (180) ..+||+|||.|++||+|.+.... T Consensus 76 ~~~~L~~tg~L~~Si~~~~~~~~ 98 (156) T protein:vir:19 76 PGSILTLHGDLARSITTDYGQDY 98 (156) T ss_pred CCcchhhhHHHHHHhhheecCCE Confidence 36799999999999999998777 No 28 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=98.19 E-value=6.4e-09 Score=65.51 Aligned_cols=90 Identities=20% Similarity=0.261 Sum_probs=46.9 Q ss_pred ecccceeeehHHHHHHHHHHHHHhh------------------------------------------------CCEEEEE Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALD------------------------------------------------GTTVEVG 55 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~------------------------------------------------~~~V~VG 55 (180) |. .+.++ +|++|.+.|+.|. ...+.|| T Consensus 1 Ma-~~~i~----Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g 75 (140) T protein:vir:10 1 MS-SIQII----GLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAG 75 (140) T ss_pred Cc-eeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEee Confidence 22 23333 2222222222221 1122333 Q ss_pred ecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHH--HHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 56 FFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFENVIRDGR 120 (180) Q Consensus 56 i~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~~l~g~~ 120 (180) +...... ..++.+.+.++.+.||||.++||+|||+|++++.+.+ +.+.+.+++.+.+++.|.. T Consensus 76 ~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 76 VRVRTKG--KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred eeecccc--ccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 3221110 1112345778999999999999999999999875431 2234445555667777766 No 29 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=98.16 E-value=1.2e-08 Score=64.09 Aligned_cols=90 Identities=18% Similarity=0.183 Sum_probs=48.2 Q ss_pred ecccceeeehHHHHHHHHHHHHHhh------------------------------------------------CCEEEEE Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALD------------------------------------------------GTTVEVG 55 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~------------------------------------------------~~~V~VG 55 (180) |. .++++. |++|.+.|+.|. ...+.|| T Consensus 1 Ma-~~~i~G----ld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~ 75 (140) T protein:vir:10 1 MS-SVQILG----LADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAG 75 (140) T ss_pred Cc-eeeehh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEe Confidence 22 333332 111211111111 0123333 Q ss_pred ecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHH--HHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 56 FFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTS--QFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 56 i~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~--~~~~~~~~~~~~~~~l~g~~ 120 (180) +..+.... ..+.+.+.++.+.||||.+.||+|||+|++++.+. .+.+.+.+++.+.+++.|+. T Consensus 76 ~~~~~~~~--~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 76 VRVRTKGK--ADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred eccccccc--cCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 33221110 12345677999999999999999999999987532 12334455556677777876 No 30 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=98.13 E-value=1.1e-08 Score=64.32 Aligned_cols=90 Identities=19% Similarity=0.276 Sum_probs=47.5 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEEEE Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVEVG 55 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~V~VG 55 (180) |. .+.|+ +|++|++.|+.|.. ..+.|| T Consensus 1 Ma-~~~i~----Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~ 75 (140) T protein:vir:80 1 MS-SIQIV----GLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAG 75 (140) T ss_pred Cc-eeeeh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeee Confidence 33 34333 22222222222210 012233 Q ss_pred ecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHH--HHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 56 FFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFENVIRDGR 120 (180) Q Consensus 56 i~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~~l~g~~ 120 (180) +..+... ..++.+.+.++.+.||||.++||||||+|++++.+.+ +.+.+.+++.+.+++.|.. T Consensus 76 ~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 76 VRVRTKG--KADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred eeccccc--ccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 3221111 0123445789999999999999999999999875331 2234455555666677765 No 31 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=98.12 E-value=8.1e-09 Score=64.95 Aligned_cols=94 Identities=23% Similarity=0.296 Sum_probs=44.4 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------------------EEE---------------E Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------TVE---------------V 54 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------------------~V~---------------V 54 (180) |+ .+..+..+|++|++.|+.|.+. .+. | T Consensus 1 mm---~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v 77 (149) T protein:vir:19 1 MI---ETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) T ss_pred Cc---ceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecc Confidence 33 2233334455554444444211 000 0 Q ss_pred Eecccc--cCCC----CCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Q lcl|NC_019918. 55 GFFPED--RYGS----ENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKK 128 (180) Q Consensus 55 Gi~~~~--~~~~----~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~ 128 (180) ++.... .... ...+-+.+.++.+.||||.++||+|||+|++++.+ .++.+.+...+.+.+. .+|.+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k--~~~~~~~~~~l~~~l~------k~~~k 149 (149) T protein:vir:19 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTRE--EEAASVAIARMNQAID------EVLSK 149 (149) T ss_pred cccccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHH--HHHHHHHHHHHHHHHH------HHhcC Confidence 000000 0000 00011245688899999999999999999998753 3444545444444321 22222 No 32 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.12 E-value=8e-09 Score=64.97 Aligned_cols=94 Identities=20% Similarity=0.297 Sum_probs=44.6 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------------------EEE--------------EE Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------TVE--------------VG 55 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------------------~V~--------------VG 55 (180) |+ .++++ ..+|++|++.|+.|.+. .+. |+ T Consensus 1 mm-~~~~~--i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~ 77 (148) T protein:vir:93 1 MI-ETLLD--FSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVH 77 (148) T ss_pred Cc-ceeee--ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeee Confidence 33 23333 34444444444444211 011 11 Q ss_pred eccccc--CCC----CCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Q lcl|NC_019918. 56 FFPEDR--YGS----ENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKK 128 (180) Q Consensus 56 i~~~~~--~~~----~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~ 128 (180) +..... ... ...+...+.++.+.||||.+.||||||||++++.+ ..+.+.+.+.+.+.+. .+|.+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k--~~~~~~~~~~~~~~i~------k~~~k 148 (148) T protein:vir:93 78 IRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRS--EQAAQVAIARMNRAID------EVLRR 148 (148) T ss_pred ecccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhH--HHHHHHHHHHHHHHHH------HHhcC Confidence 100000 000 00122346788899999999999999999998753 3444444444443221 22222 No 33 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.07 E-value=1.1e-08 Score=64.25 Aligned_cols=100 Identities=20% Similarity=0.178 Sum_probs=45.5 Q ss_pred CCCCcccccch-hhHHHHHh-hhheecccceeeehHHHHHHHHHHHHHhh-------------------------CCEEE Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLA-LGVIILASFSFKTDRRRLTSLIKRVEALD-------------------------GTTVE 53 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~-~~~~M~~~v~~k~~~~~l~~l~~~l~~l~-------------------------~~~V~ 53 (180) |-+ |.... --+++.|- |+-++.. ..-+......+.+.+.++... ...|. T Consensus 1 M~~---~~i~Gl~el~~~l~~l~~~~~~-~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~ 76 (127) T protein:vir:12 1 MAD---MSFDGIDDLTQYFEKIGGDIEK-VEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVA 76 (127) T ss_pred Cee---eeehhHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEE Confidence 111 00000 00011100 0100000 000000001111111111111 11233 Q ss_pred EEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 54 VGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 54 VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) ||+- -+.+.++.+.||||.+.||||||+|++++. +.++.+.+.+.+.+.++ T Consensus 77 Vg~~-----------~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~--~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 77 VGPN-----------KKVAYRGRFLEWGTSKMPPQPFIEKGGKEG--EGPAVELMERILTAPIK 127 (127) T ss_pred EeeC-----------CCCcceeeeeccCccCCCCCccchHhHHHH--HHHHHHHHHHHHHHhcC Confidence 3332 124668889999999999999999999875 44677888888888777 No 34 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=98.01 E-value=1.5e-08 Score=63.45 Aligned_cols=85 Identities=15% Similarity=0.322 Sum_probs=46.9 Q ss_pred ceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEecccccCCCCCCC-----CCHHHHHHHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDRYGSENGN-----LPVAQVAAYN 78 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l~~~------------------------~V~VGi~~~~~~~~~~~G-----~~vA~iA~i~ 78 (180) |+++.+ +|+++.+.|+.+.+. -|.-|.+.+.-. ...+| .+.+.+|.+. T Consensus 1 msi~i~--Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~-~~~~g~~~~V~~~~~Ya~yv 77 (114) T protein:vir:95 1 MAIKWQ--GIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHIT-TSYPGMEAHIHGEAGYDGYQ 77 (114) T ss_pred Ceeeee--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcee-eecCceEEEeecCCCcccee Confidence 333321 333444333333211 011222211100 00111 2346789999 Q ss_pred hcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 79 EFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 79 EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) ||||...|+||||+|++++.+ ..+.+.++..++..++ T Consensus 78 E~GT~~~~aqPfl~pa~~~~~--~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 78 EYGTRFQPGTPHFRPMMEQIQ--PQFQKDMTDVMKGAFK 114 (114) T ss_pred ecCccccCCCccchhhHHHHH--HHHHHHHHHHHHhhcC Confidence 999999999999999998754 4567777777776666 No 35 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.96 E-value=2.7e-08 Score=62.08 Aligned_cols=86 Identities=16% Similarity=0.117 Sum_probs=61.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC----CCCCcHHHHHh----------- Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDY----PGSNSPAWAAY----------- 155 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~----~pPnsp~Ti~~----------- 155 (180) |--.+ +.+-. .+.+.+++.++.....+...+|..||..++...+..|.+. +.|.+|+|+++ T Consensus 1 Ms~~i-~i~~~---~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~ 76 (175) T protein:vir:10 1 MSDFV-NFQID---DSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKN 76 (175) T ss_pred CceeE-EEEec---HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhh Confidence 32211 11111 1234555555555555788999999999999999999764 23999999863 Q ss_pred ----------cCCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 156 ----------KGFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 156 ----------KG~~~PLIDTG~L~~SIty~V~~k~ 180 (180) ++..++|+|||.|++||+|.+.+.. T Consensus 77 ~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~ 111 (175) T protein:vir:10 77 GELTAAASRRKAGLMILQDSGQMAASVSTDHDDNS 111 (175) T ss_pred hhhhhhhhhhccCCCcceechhhhhhhheeecCCE Confidence 2457899999999999999998777 No 36 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.92 E-value=5.6e-08 Score=60.32 Aligned_cols=92 Identities=17% Similarity=0.274 Sum_probs=44.0 Q ss_pred hhheecccceeeehHHHHHHHHHHHHHhhCC------------------------------EEEEEecccccCCCCCCCC Q lcl|NC_019918. 20 LGVIILASFSFKTDRRRLTSLIKRVEALDGT------------------------------TVEVGFFPEDRYGSENGNL 69 (180) Q Consensus 20 ~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~------------------------------~V~VGi~~~~~~~~~~~G~ 69 (180) |+|-|. ..+.++|...++.|.+..++ .+.+-...+. ..+|. T Consensus 1 m~~~~~-----~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~----s~~g~ 71 (157) T protein:vir:97 1 MKFSIR-----SVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEE----SVEGI 71 (157) T ss_pred CeeEee-----cccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeecccc----CCCce Confidence 444443 12333333333332221111 1111111110 00111 Q ss_pred C---------HHHHHHHHhcC------------------------CCCCCCCcchhhHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_019918. 70 P---------VAQVAAYNEFG------------------------TTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFEN 114 (180) Q Consensus 70 ~---------vA~iA~i~EfG------------------------t~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~ 114 (180) . .+-++.+.||| +..+||||||||+|+....+ +.+.+.+.+.|.+ T Consensus 72 ~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e 151 (157) T protein:vir:97 72 QTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAE 151 (157) T ss_pred EEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHH Confidence 1 13345567888 23599999999999876432 1223446667888 Q ss_pred HHhCCC Q lcl|NC_019918. 115 VIRDGR 120 (180) Q Consensus 115 ~l~g~~ 120 (180) ++.|+. T Consensus 152 ~l~g~~ 157 (157) T protein:vir:97 152 LQRGDT 157 (157) T ss_pred HhcCCC Confidence 888875 No 37 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=97.88 E-value=1.8e-08 Score=63.07 Aligned_cols=80 Identities=21% Similarity=0.340 Sum_probs=40.6 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |++ ++|++|++.|+.+ ... -|.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:78 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 2222222222221 100 111222221100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ++|.+.||||...||||||+|+++.. +..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:78 77 AYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 89999999999999999999999864 3355555555555 No 38 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=97.88 E-value=1.8e-08 Score=63.07 Aligned_cols=80 Identities=21% Similarity=0.340 Sum_probs=40.6 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |++ ++|++|++.|+.+ ... -|.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:96 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 2222222222221 100 111222221100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ++|.+.||||...||||||+|+++.. +..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:96 77 AYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 89999999999999999999999864 3355555555555 No 39 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=97.88 E-value=1.8e-08 Score=63.07 Aligned_cols=80 Identities=21% Similarity=0.340 Sum_probs=40.6 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |++ ++|++|++.|+.+ ... -|.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:96 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 2222222222221 100 111222221100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ++|.+.||||...||||||+|+++.. +..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:96 77 AYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 89999999999999999999999864 3355555555555 No 40 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=97.88 E-value=1.8e-08 Score=63.07 Aligned_cols=80 Identities=21% Similarity=0.340 Sum_probs=40.6 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |++ ++|++|++.|+.+ ... -|.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:97 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 2222222222221 100 111222221100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ++|.+.||||...||||||+|+++.. +..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:97 77 AYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 89999999999999999999999864 3355555555555 No 41 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=97.88 E-value=1.8e-08 Score=63.07 Aligned_cols=80 Identities=21% Similarity=0.340 Sum_probs=40.6 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |++ ++|++|++.|+.+ ... -|.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:93 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 2222222222221 100 111222221100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ++|.+.||||...||||||+|+++.. +..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:93 77 AYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 89999999999999999999999864 3355555555555 No 42 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=97.88 E-value=1.8e-08 Score=63.07 Aligned_cols=80 Identities=21% Similarity=0.340 Sum_probs=40.6 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |++ ++|++|++.|+.+ ... -|.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:10 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 2222222222221 100 111222221100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ++|.+.||||...||||||+|+++.. +..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~--~~~~~~~i~~~~k 115 (115) T protein:vir:10 77 AYSGFLEFGTRYMEAEPFMWPVYEVI--RKSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHH--HHHHHHHHHHHhC Confidence 89999999999999999999999864 3355555555555 No 43 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.86 E-value=1.5e-08 Score=63.44 Aligned_cols=80 Identities=20% Similarity=0.294 Sum_probs=39.4 Q ss_pred ceeeehHHHHHHHHHHHHHhhC--------------------------C----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEALDG--------------------------T----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l~~--------------------------~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |+++ +|++|.+.|+.+.+ . -|.-|-+.+.-....++| .+.+ T Consensus 1 i~i~----Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:10 1 MQSK----GLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTA 76 (115) T ss_pred Ceeh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCC Confidence 2222 22222222222211 0 011122221100011122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) .+|.+.||||...|+||||+|+++..+ ..+.+.+++++. T Consensus 77 ~Ya~~vEfGT~km~a~PFl~PA~~~~k--~~~~~~i~~~i~ 115 (115) T protein:vir:10 77 HYSGFLEFGTRYMEPAPFMFPTYQTLK--KSTINDLKRLLS 115 (115) T ss_pred ccchheecccccCCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998643 344444444444 No 44 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.84 E-value=4.6e-08 Score=60.83 Aligned_cols=111 Identities=19% Similarity=0.098 Sum_probs=45.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC----C-----EEEEEecccccCCCC---CCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG----T-----TVEVGFFPEDRYGSE---NGN 68 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----~-----~V~VGi~~~~~~~~~---~~G 68 (180) |.---+|+ -.--.|+.|..-++ ...-+..+.+-+-+.+.|+.-.. . .|.|+=+.......+ .-| T Consensus 1 M~v~v~~~-~L~~~l~~l~~~~~---k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG 76 (125) T protein:vir:94 1 MGARIESN-NIEQGLKNAVLKMN---LNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIG 76 (125) T ss_pred CeeEeeHH-HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEec Confidence 21111110 00111221111110 00000000111111122221110 0 222211000000000 000 Q ss_pred --CCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 69 --LPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 69 --~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) -+.+.+|-+.||||.++||+||+|+++++. ++++.+.+...++++.. T Consensus 77 ~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 77 YAKGVSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred cCCCCceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 112356778999999999999999999875 45677888888888776 No 45 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.84 E-value=4.6e-08 Score=60.83 Aligned_cols=111 Identities=19% Similarity=0.098 Sum_probs=45.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC----C-----EEEEEecccccCCCC---CCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG----T-----TVEVGFFPEDRYGSE---NGN 68 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----~-----~V~VGi~~~~~~~~~---~~G 68 (180) |.---+|+ -.--.|+.|..-++ ...-+..+.+-+-+.+.|+.-.. . .|.|+=+.......+ .-| T Consensus 1 M~v~v~~~-~L~~~l~~l~~~~~---k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG 76 (125) T protein:vir:79 1 MGARIESN-NIEQGLKNAVLKMN---LNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIG 76 (125) T ss_pred CeeEeeHH-HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEec Confidence 21111110 00111221111110 00000000111111122221110 0 222211000000000 000 Q ss_pred --CCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 69 --LPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 69 --~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) -+.+.+|-+.||||.++||+||+|+++++. ++++.+.+...++++.. T Consensus 77 ~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 77 YAKGVSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred cCCCCceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 112356778999999999999999999875 45677888888888776 No 46 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.84 E-value=4.6e-08 Score=60.83 Aligned_cols=111 Identities=19% Similarity=0.098 Sum_probs=45.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC----C-----EEEEEecccccCCCC---CCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG----T-----TVEVGFFPEDRYGSE---NGN 68 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----~-----~V~VGi~~~~~~~~~---~~G 68 (180) |.---+|+ -.--.|+.|..-++ ...-+..+.+-+-+.+.|+.-.. . .|.|+=+.......+ .-| T Consensus 1 M~v~v~~~-~L~~~l~~l~~~~~---k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG 76 (125) T protein:vir:47 1 MGARIESN-NIEQGLKNAVLKMN---LNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIG 76 (125) T ss_pred CeeEeeHH-HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEec Confidence 21111110 00111221111110 00000000111111122221110 0 222211000000000 000 Q ss_pred --CCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 69 --LPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 69 --~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) -+.+.+|-+.||||.++||+||+|+++++. ++++.+.+...++++.. T Consensus 77 ~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 77 YAKGVSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred cCCCCceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 112356778999999999999999999875 45677888888888776 No 47 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.84 E-value=4.6e-08 Score=60.83 Aligned_cols=111 Identities=19% Similarity=0.098 Sum_probs=45.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC----C-----EEEEEecccccCCCC---CCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG----T-----TVEVGFFPEDRYGSE---NGN 68 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----~-----~V~VGi~~~~~~~~~---~~G 68 (180) |.---+|+ -.--.|+.|..-++ ...-+..+.+-+-+.+.|+.-.. . .|.|+=+.......+ .-| T Consensus 1 M~v~v~~~-~L~~~l~~l~~~~~---k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG 76 (125) T protein:vir:98 1 MGARIESN-NIEQGLKNAVLKMN---LNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIG 76 (125) T ss_pred CeeEeeHH-HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEec Confidence 21111110 00111221111110 00000000111111122221110 0 222211000000000 000 Q ss_pred --CCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 69 --LPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 69 --~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) -+.+.+|-+.||||.++||+||+|+++++. ++++.+.+...++++.. T Consensus 77 ~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 77 YAKGVSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred cCCCCceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 112356778999999999999999999875 45677888888888776 No 48 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.84 E-value=4.6e-08 Score=60.83 Aligned_cols=111 Identities=19% Similarity=0.098 Sum_probs=45.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC----C-----EEEEEecccccCCCC---CCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG----T-----TVEVGFFPEDRYGSE---NGN 68 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~----~-----~V~VGi~~~~~~~~~---~~G 68 (180) |.---+|+ -.--.|+.|..-++ ...-+..+.+-+-+.+.|+.-.. . .|.|+=+.......+ .-| T Consensus 1 M~v~v~~~-~L~~~l~~l~~~~~---k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG 76 (125) T protein:vir:81 1 MGARIESN-NIEQGLKNAVLKMN---LNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIG 76 (125) T ss_pred CeeEeeHH-HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEec Confidence 21111110 00111221111110 00000000111111122221110 0 222211000000000 000 Q ss_pred --CCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 69 --LPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 69 --~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) -+.+.+|-+.||||.++||+||+|+++++. ++++.+.+...++++.. T Consensus 77 ~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~--~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 77 YAKGVSHRIHATEFGTMYQKPQLFITKTEKQG--KNKVLKTMLDTAKRLQK 125 (125) T ss_pred cCCCCceEEEeccCCccCCCCCchhhHHHHHh--HHHHHHHHHHHHHHHhC Confidence 112356778999999999999999999875 45677888888888776 No 49 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.80 E-value=5.8e-08 Score=60.27 Aligned_cols=85 Identities=16% Similarity=0.140 Sum_probs=42.0 Q ss_pred hhheeccc----------------ceeeehHHHHHHHHHHHHHhhC---------------------------CEEEEEe Q lcl|NC_019918. 20 LGVIILAS----------------FSFKTDRRRLTSLIKRVEALDG---------------------------TTVEVGF 56 (180) Q Consensus 20 ~~~~M~~~----------------v~~k~~~~~l~~l~~~l~~l~~---------------------------~~V~VGi 56 (180) |+|++..- +.-+.....-+.+.+.++.... ..+.||+ T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~ 80 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGY 80 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeee Confidence 33332210 0000000011111111111110 1133333 Q ss_pred cccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 57 FPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 57 ~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) . -+.+.++.+.||||.+.||+|||++++++. +.++.+.+.+.+++.+- T Consensus 81 ~-----------k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~--~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 81 G-----------KDTGWRAHFPNSGTSMQDPQHFIEETQEIM--RPVVIAAFLSHLKEGGM 128 (128) T ss_pred c-----------CCCceEEeeeccCccCCCCCcchhHHHHHh--HHHHHHHHHHHHHhhcC Confidence 1 122567889999999999999999999875 44566666666665544 No 50 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=97.79 E-value=3.7e-08 Score=61.34 Aligned_cols=84 Identities=23% Similarity=0.442 Sum_probs=41.4 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccCC-CCCCC-----CCHHHHH Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRYG-SENGN-----LPVAQVA 75 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------~V~VGi~~~~~~~-~~~~G-----~~vA~iA 75 (180) |...+.++ +|+++++.|+.+... -|.=|-+.+.-.. ..++| .+.+.+| T Consensus 1 M~~~i~i~----Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya 76 (112) T protein:vir:36 1 MKSSLSFK----GIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYS 76 (112) T ss_pred Cceeeeeh----hHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCcc Confidence 33333222 222222222221110 1112222111000 01122 2357799 Q ss_pred HHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 76 AYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 76 ~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) .+.||||...|+||||||+++..+ ..+.+.+++.++ T Consensus 77 ~~vE~GT~k~~a~Pfl~pa~~~~~--~~~~~~i~~~lr 112 (112) T protein:vir:36 77 AYVEYGTRFQSAQPFVKPAYNEQK--GVFIKDLERLLK 112 (112) T ss_pred ceeeccccccCCCcchhhhHHHHH--HHHHHHHHHHcC Confidence 999999999999999999998753 344555554444 No 51 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.78 E-value=1.5e-07 Score=58.06 Aligned_cols=93 Identities=13% Similarity=0.164 Sum_probs=46.6 Q ss_pred ecccceeeehHHHHHHHHHHHHHh-hCCE-E--EEE----------eccc------------------------------ Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEAL-DGTT-V--EVG----------FFPE------------------------------ 59 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l-~~~~-V--~VG----------i~~~------------------------------ 59 (180) |...++|+.+...+.+.+++|... .+.. + .|| |-.+ T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 555566666654433333333211 1100 0 000 0000 Q ss_pred ----------------------ccCCCCCC----CCCHHHHHHHHhcCCC-------CCCCCcchhhHHHHH-----HHH Q lcl|NC_019918. 60 ----------------------DRYGSENG----NLPVAQVAAYNEFGTT-------RNPTRPFMAPTFEEF-----TSQ 101 (180) Q Consensus 60 ----------------------~~~~~~~~----G~~vA~iA~i~EfGt~-------~IP~RpFlr~~~~~~-----~~~ 101 (180) =+|...++ |+ +..||++|+||.. +||+|||| ++++. +.. T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt-n~~YAaiHqfGg~~~~~~~v~IPARPfL--G~s~~de~~~~~~ 157 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGS-NKEYAAIQHFGGQAGRGLKVTIPGRAWL--PVTADGELQPEAV 157 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCCEEEEec-CcchhhHhhcccccCCCcccccCccccc--CCCcccchhHHHH Confidence 00000112 23 2468999999964 79999999 44321 112 Q ss_pred HHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 102 FHYARLMKSTFENVIRDG 119 (180) Q Consensus 102 ~~~~~~~~~~~~~~l~g~ 119 (180) +.+.+.+...++.++.+. T Consensus 158 ~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 158 EPVLNTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHHHHHhccC Confidence 457777777777777776 No 52 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.75 E-value=4.5e-08 Score=60.84 Aligned_cols=84 Identities=17% Similarity=0.286 Sum_probs=41.9 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccC-----CCCCCC---CCHHH Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRY-----GSENGN---LPVAQ 73 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------~V~VGi~~~~~~-----~~~~~G---~~vA~ 73 (180) |. +|++. +|++|++.|+.+.+. ...+++..+.-- ...++| .+.+. T Consensus 1 Ma-~i~~~----Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~ 75 (114) T protein:vir:27 1 MA-TIEFE----GLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTS 75 (114) T ss_pred Ce-eeeee----hHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCC Confidence 33 23333 233333333322110 011111111100 001222 23477 Q ss_pred HHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 74 VAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 74 iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) +|.++||||...||||||||+++..+ ..+.+.+++.++- T Consensus 76 Ya~~vEfGT~km~a~Pfl~PA~~~~~--~~~~~~l~~l~k~ 114 (114) T protein:vir:27 76 YSGYLEVGTRKMEAQPFMKPALDEVA--PKMVEELAKWDET 114 (114) T ss_pred ccceecccccccCCCCchhhhHHHHH--HHHHHHHHHHhcC Confidence 99999999999999999999998753 3444444444443 No 53 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.75 E-value=4.5e-08 Score=60.84 Aligned_cols=84 Identities=17% Similarity=0.286 Sum_probs=41.9 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccC-----CCCCCC---CCHHH Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRY-----GSENGN---LPVAQ 73 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------~V~VGi~~~~~~-----~~~~~G---~~vA~ 73 (180) |. +|++. +|++|++.|+.+.+. ...+++..+.-- ...++| .+.+. T Consensus 1 Ma-~i~~~----Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~ 75 (114) T protein:vir:49 1 MA-TIEFE----GLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTS 75 (114) T ss_pred Ce-eeeee----hHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCC Confidence 33 23333 233333333322110 011111111100 001222 23477 Q ss_pred HHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 74 VAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 74 iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) +|.++||||...||||||||+++..+ ..+.+.+++.++- T Consensus 76 Ya~~vEfGT~km~a~Pfl~PA~~~~~--~~~~~~l~~l~k~ 114 (114) T protein:vir:49 76 YSGYLEVGTRKMEAQPFMKPALDEVA--PKMVEELAKWDET 114 (114) T ss_pred ccceecccccccCCCCchhhhHHHHH--HHHHHHHHHHhcC Confidence 99999999999999999999998753 3444444444443 No 54 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.70 E-value=1.4e-07 Score=58.24 Aligned_cols=92 Identities=17% Similarity=0.326 Sum_probs=38.8 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC-----------------------EEEE------Eeccccc---CCCCCC---- Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT-----------------------TVEV------GFFPEDR---YGSENG---- 67 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~-----------------------~V~V------Gi~~~~~---~~~~~~---- 67 (180) |...++|+ +|++|++.|+.|... .+-| |-..+.- .....+ T Consensus 1 M~~~~~i~----Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~ 76 (135) T protein:vir:57 1 MIPEIEIS----GLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTV 76 (135) T ss_pred Cceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhccccccccccccee Confidence 44444444 222333322222110 0000 0000000 000001 Q ss_pred -----CCCHHH--HHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHH Q lcl|NC_019918. 68 -----NLPVAQ--VAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGR 131 (180) Q Consensus 68 -----G~~vA~--iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~ 131 (180) |.+-.. ++.+.||||.+.||||||+|++++.+ .++.+.+.+.+.+. |++++. T Consensus 77 v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~--~~~~~~~~~~~~~~----------l~ka~r 135 (135) T protein:vir:57 77 VVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNK--MQVLRILTVEIRDG----------LSTLSR 135 (135) T ss_pred EEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhH--HHHHHHHHHHHHHH----------HHHhcC Confidence 111122 23345999999999999999998753 34444444444332 333333 No 55 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.70 E-value=6.5e-08 Score=59.98 Aligned_cols=99 Identities=15% Similarity=0.196 Sum_probs=42.6 Q ss_pred CCCcccccchhhHHHHHhhhheecccceeeehHHHHHH----HHHHHHHhhCCEEEEEeccccc-CCCCCC----CCCHH Q lcl|NC_019918. 2 QDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTS----LIKRVEALDGTTVEVGFFPEDR-YGSENG----NLPVA 72 (180) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~----l~~~l~~l~~~~V~VGi~~~~~-~~~~~~----G~~vA 72 (180) -+|-. ...-.|+.+.= .+. +...+.+.+ +.+..+.+.. |.-|.+.++- ....++ =.+.+ T Consensus 1 i~Gld---~l~~~l~~~~~------~~~-~~v~~al~~~a~~i~~~ak~~aP--v~TG~Lr~sI~~~~~~~~~~~v~~~~ 68 (108) T protein:vir:99 1 MRGLD---RFLRSVERKQK------SVR-IAVDKELSKSAARIERQAKILAP--VDTGWLRAQIYSEQQRLLHYRVVSPA 68 (108) T ss_pred CchHH---HHHHHHHHHHH------HHH-HHHHHHHHHHHHHHHHHHHhcCC--cCchhhhcceeeeecCcEEEEeecCc Confidence 00000 00001111110 000 000111111 2222222221 1222221110 000001 12357 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) .+|.+.||||...|+||||+|+++..+ ..+.+.+++.+++ T Consensus 69 ~Ya~~vE~GT~~m~a~Pf~~pa~~~~~--~~~~~~i~~~lrk 108 (108) T protein:vir:99 69 LYSIYLELGTRKMEAQSFLDPALRKEW--PVLMANIKKMFKR 108 (108) T ss_pred ccchhcccCccccCCCcchhhhHHHHH--HHHHHHHHHHhcC Confidence 899999999999999999999998754 3566666666665 No 56 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.70 E-value=8e-08 Score=59.48 Aligned_cols=89 Identities=19% Similarity=0.288 Sum_probs=43.4 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------------------EEEEEec--ccccCCCCCC Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------TVEVGFF--PEDRYGSENG 67 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------------------~V~VGi~--~~~~~~~~~~ 67 (180) |. .++|+ +|++|.+.|+.|... .+...|. .......... T Consensus 1 M~-~~~i~----Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~ 75 (133) T protein:vir:10 1 MI-RMEVK----GLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNA 75 (133) T ss_pred Ce-eEeee----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccc Confidence 22 22222 222222222222110 0111110 0000000000 Q ss_pred ------CCC--HHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 68 ------NLP--VAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDG 119 (180) Q Consensus 68 ------G~~--vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~ 119 (180) |.+ ...++.+.||||.+.||||||+|++++. ++++.+.+.+.+.+.+... T Consensus 76 ~~~v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~--~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 76 VVTLRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYN--VQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred eEEEEecCCCCccceEeeeccCCCCCCCCccchHHHHHh--HHHHHHHHHHHHHHHhhcC Confidence 111 1223445599999999999999999875 4567788888888887776 No 57 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.67 E-value=1.1e-07 Score=58.64 Aligned_cols=91 Identities=13% Similarity=0.264 Sum_probs=50.0 Q ss_pred CCCCcccccchhhHHHHHh--hhheecccceeeehHHHHHHHHHHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLA--LGVIILASFSFKTDRRRLTSLIKRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~M~~~v~~k~~~~~l~~l~~~l-~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..|.-+....++.-. -+-.|.-. - .+...| -..+...+.|||..+. ++.||++ T Consensus 47 ~PdG~~W~p~k~~~~~~k~g~~~~~l~~~-------~---~l~~sl~~~~~~~~~~vg~~~Gs----------~~~yAa~ 106 (150) T protein:vir:20 47 APDGTPYAPRQQQSVRKKTGRVKRKMFAK-------L---ITSRFLHIRASPEQASMEFYGGK----------SPKIASV 106 (150) T ss_pred CCCCCCCcccchHHHHHhccCCCccccch-------h---hhhhhhheeecCcEEEEEeeCCc----------chhhhhh Confidence 5578877655443332211 00011100 0 122222 1235678999987442 4679999 Q ss_pred HhcCC----------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT----------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 78 ~EfGt----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) |.||- .+||+|||| ++.+. .++++.+.+...+.+ T Consensus 107 HQfG~~~~~~~~~~~~~iPaRp~L--G~s~~-d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 107 HQFGLSEENRKDGKKIDYPARPLL--GFTGE-DVQMIEEIILAHLER 150 (150) T ss_pred hhcccccccccCCCceeccccccC--CCCHH-HHHHHHHHHHHHHhC Confidence 99993 379999999 55442 344566655555555 No 58 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.63 E-value=7.3e-08 Score=59.70 Aligned_cols=80 Identities=20% Similarity=0.297 Sum_probs=40.1 Q ss_pred ceeeehHHHHHHHHHHHHHh--------------------------hCC----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEAL--------------------------DGT----TVEVGFFPEDRYGSENGN-----LPVA 72 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l--------------------------~~~----~V~VGi~~~~~~~~~~~G-----~~vA 72 (180) |+++ +|++|.+.|+.+ ... -+.-|.+..+-...-++| .+.+ T Consensus 1 i~i~----Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~ 76 (115) T protein:vir:99 1 MNID----GLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHA 76 (115) T ss_pred Ccch----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCc Confidence 2222 222222222221 100 011122211100001122 2347 Q ss_pred HHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) .+|.+.||||...|+||||+|+++..+ ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~~m~a~PFl~PA~~~~k--~~~~~~l~~~~k 115 (115) T protein:vir:99 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKTLFE 115 (115) T ss_pred cccccccccccccCCCCcchhhHHHHH--HHHHHHHHHHhC Confidence 899999999999999999999998753 345555555554 No 59 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.61 E-value=1.4e-07 Score=58.08 Aligned_cols=87 Identities=25% Similarity=0.346 Sum_probs=46.8 Q ss_pred cccceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEeccccc-CCCCCCC-----CCHHHH Q lcl|NC_019918. 25 LASFSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDR-YGSENGN-----LPVAQV 74 (180) Q Consensus 25 ~~~v~~k~~~~~l~~l~~~l~~l~~~------------------------~V~VGi~~~~~-~~~~~~G-----~~vA~i 74 (180) ++.++++.+.+.++++.+.|+.+.+. -|.-|-+...= ..-..+| .+.+.+ T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~Y 80 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEY 80 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCc Confidence 34566666665555554444332221 01112111110 0001122 335789 Q ss_pred HHHHhcCC---------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 75 AAYNEFGT---------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 75 A~i~EfGt---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) |.+.|||| +++||||||++++++. +..+.+.+++++- T Consensus 81 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~--~~~~~~~i~~~~g 144 (144) T protein:vir:59 81 AIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEG--GEYFEREMRRLRG 144 (144) T ss_pred cchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHH--HHHHHHHHHHhcC Confidence 99999997 3589999999999874 3455555555444 No 60 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.60 E-value=6.7e-07 Score=54.43 Aligned_cols=93 Identities=13% Similarity=0.207 Sum_probs=47.9 Q ss_pred ecccceeeehHHHHHHHHHHHHHhh-CC-EEE--EE----------ecccc----------------------------- Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALD-GT-TVE--VG----------FFPED----------------------------- 60 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~-~~-~V~--VG----------i~~~~----------------------------- 60 (180) |...++|+.+...+.+.+.+|.... +. .+. || |-.+. T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 5555677777665555554443221 10 000 00 00000 Q ss_pred -----------------------cCCCCCC----CCCHHHHHHHHhcCCC-------CCCCCcchhhHHHHH-----HHH Q lcl|NC_019918. 61 -----------------------RYGSENG----NLPVAQVAAYNEFGTT-------RNPTRPFMAPTFEEF-----TSQ 101 (180) Q Consensus 61 -----------------------~~~~~~~----G~~vA~iA~i~EfGt~-------~IP~RpFlr~~~~~~-----~~~ 101 (180) +|...++ |++ ..||++|+||.. +||+||||- +++. +.. T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn-~~YAaiHqfGg~~~~~~~v~iPaRpfLG--~s~~d~~~~e~~ 157 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGSN-KEYAAIHQFGGQAGRGLKVTIPARPWLP--VTADGELQPEAV 157 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeecCCEEEEecC-hhhhhhhhcccccCCCCccccCCccccC--CCcccccchHHH Confidence 0000011 333 457999999965 899999994 4321 122 Q ss_pred HHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 102 FHYARLMKSTFENVIRDG 119 (180) Q Consensus 102 ~~~~~~~~~~~~~~l~g~ 119 (180) +.|.+.+...+..++.+. T Consensus 158 ~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 158 EPVLNTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHHHHHhccC Confidence 456666667777777666 No 61 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.53 E-value=2.6e-07 Score=56.68 Aligned_cols=91 Identities=13% Similarity=0.259 Sum_probs=48.1 Q ss_pred CCCCcccccchhhHHHHHhh--hheecccceeeehHHHHHHHHHHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLAL--GVIILASFSFKTDRRRLTSLIKRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~M~~~v~~k~~~~~l~~l~~~l-~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..+.-.....++.-.= +-.|.- .+ .+...| -..+...+.|||..+. ++.||++ T Consensus 47 ~PdG~~W~p~~~~~~~~k~~~~~~~l~~---------~~-~l~~sl~~~~~~~~a~vg~~~Gt----------~~~yAai 106 (150) T protein:vir:60 47 APDGTPYAPRQQQSARKKTGRVKRKMFA---------KL-ITSRFLHIRASPEQASMEFYGGK----------SPKIASV 106 (150) T ss_pred CCCCCCCcccChHHHHHhhcCCCccchh---------hh-hhcceeeeeeeCcEEEEEeeCCC----------chhhhhh Confidence 45777776554433322110 000100 00 011111 1234567888886332 4689999 Q ss_pred HhcCC----------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT----------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 78 ~EfGt----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) |+||. .+||+|||| ++.+. .+.++.+.+...+.+ T Consensus 107 HQfG~~~~~~~~~~~~~iPaRp~L--G~s~~-d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 107 HQFGLSEENRKDGKKIDYPARPLL--GFTGE-DVQMIEEIILAHLDR 150 (150) T ss_pred hhccccccccCCCCceecCCcccC--CCCHH-HHHHHHHHHHHHHhC Confidence 99993 379999999 45442 344555555555554 No 62 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.50 E-value=8.1e-08 Score=59.46 Aligned_cols=82 Identities=17% Similarity=0.312 Sum_probs=40.3 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccCC---CCCCCC-----CHHH Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRYG---SENGNL-----PVAQ 73 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~----------------------~V~VGi~~~~~~~---~~~~G~-----~vA~ 73 (180) |. +|+++ +|++|++.|+.+.+. .-..++..+...+ -.++|. +.+. T Consensus 1 Ma-~i~i~----Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~ 75 (112) T protein:vir:96 1 MA-TIEFE----GLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTN 75 (112) T ss_pred Cc-eeeeh----HHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCC Confidence 33 34443 333333333322110 0111221111100 012222 3467 Q ss_pred HHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 74 VAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTF 112 (180) Q Consensus 74 iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 112 (180) +|.+.||||...|+||||+|+++..+. .+.+.+++.- T Consensus 76 Ya~~vE~GTr~m~AqPF~~PA~~~~~~--~~~~~l~~L~ 112 (112) T protein:vir:96 76 YSGYLEVGTRKMEAQPFMRPALDQVVP--EMVEEMAKWE 112 (112) T ss_pred ccceeccCccccCCCCchhhhHHHHHH--HHHHHHHhcC Confidence 999999999999999999999987432 3333333332 No 63 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.45 E-value=3.3e-07 Score=56.11 Aligned_cols=91 Identities=13% Similarity=0.282 Sum_probs=47.8 Q ss_pred CCCCcccccchhhHHHHHh--hhheecccceeeehHHHHHHHHHHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLA--LGVIILASFSFKTDRRRLTSLIKRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~M~~~v~~k~~~~~l~~l~~~l-~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..|.-.....++.-. -+-.|.. .+ .+.+.| -..+...+.|||..+. +..||++ T Consensus 47 ~PdG~~W~p~k~~~~~~k~~~~~~~l~~---------~~-~l~~sl~~~~~~~~a~vg~~~G~----------~~~yAai 106 (150) T protein:vir:57 47 APDGTPYAPRQQQSARKKTGRVKRKMFA---------KL-ITSRFLHIRASPEQASMEFYGGK----------SPKIASV 106 (150) T ss_pred CCCCCCCcccChHHHHHhccCCCcccch---------hh-hhccceeeeeeCcEEEEEeecCC----------chhhhhh Confidence 4577777654443332211 0000100 00 011111 1234567888886332 4689999 Q ss_pred HhcCC----------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT----------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 78 ~EfGt----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) |+||. .+||+|||| ++.+. .+.++.+.+...+.+ T Consensus 107 HQfG~~~r~~~~~~~~~iPaRp~L--G~s~~-d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 107 HQFGLSEETRKDGKKIDYPARPLL--GFTGE-DVQMIEEIILAHLDR 150 (150) T ss_pred hhccccccccCCCceeecCCcccC--CCCHH-HHHHHHHHHHHHHhC Confidence 99993 369999999 55442 344555555555555 No 64 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.40 E-value=2e-07 Score=57.27 Aligned_cols=100 Identities=20% Similarity=0.310 Sum_probs=39.8 Q ss_pred cccch-hhHHHHHhhhheecccceee-ehHHHHHHHHHHHHHhhCCEEEEEecccccC-CCCCCC-----CCHHHHHHHH Q lcl|NC_019918. 7 FTTSA-TPVLKTLALGVIILASFSFK-TDRRRLTSLIKRVEALDGTTVEVGFFPEDRY-GSENGN-----LPVAQVAAYN 78 (180) Q Consensus 7 ~~~~~-~~~~~~~~~~~~M~~~v~~k-~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~-~~~~~G-----~~vA~iA~i~ 78 (180) +.-+. --+++.|. ++...-.++ .-.+....+.+..+.+.. |.=|-+..+-. ...++| .+.+.+|.+. T Consensus 1 i~i~Gld~l~~~l~---~~~~~~~~~~al~~~a~~i~~~ak~~ap--vdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~v 75 (108) T protein:vir:98 1 MKITGIDALQKKLR---KNATLNDVKHVVKRNTVSMNKNMQNLAP--VDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYV 75 (108) T ss_pred CcchhHHHHHHHHH---HhhhHHHHHHHHHHHHHHHHHHHHHhCC--CCchhhHhhceeeeecCceEEEeecCCCcccee Confidence 11000 00111111 000000000 000011111112222111 11121111100 001122 2346789999 Q ss_pred hcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 79 EFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 79 EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ||||...|+||||+|+++... ..+.+.+++.++ T Consensus 76 E~GT~~m~aqPFl~pa~~~~~--~~~~~~i~~~lr 108 (108) T protein:vir:98 76 EYGTRFQAAQPFVKPAFDVQK--KIFTNDLERLTK 108 (108) T ss_pred eccccccCCCcchhhHHHHHH--HHHHHHHHHHcC Confidence 999999999999999998643 345555555555 No 65 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.21 E-value=3.4e-07 Score=56.06 Aligned_cols=98 Identities=17% Similarity=0.128 Sum_probs=47.9 Q ss_pred CCCC------------c--------ccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHH-------HhhCCEEE Q lcl|NC_019918. 1 MQDG------------R--------SFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVE-------ALDGTTVE 53 (180) Q Consensus 1 ~~~~------------~--------~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~-------~l~~~~V~ 53 (180) |-.| + .....|.++.+.+- ..+-+... ..-..+.+.+. ..+...+. T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k------~~ap~~~~-~~~~hl~d~I~~~~~k~~~~g~~~~~ 73 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALK------ANTPVYEV-ETDERLQEDTVISGFKGANVGIVSKE 73 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH------HhCCcCCC-CchhhHHhhhhcccccccccCceEEE Confidence 1111 0 00111222222211 11111100 00001222221 12334678 Q ss_pred EEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019918. 54 VGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRD 118 (180) Q Consensus 54 VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g 118 (180) |||..+ .+.++.+.||||.+.||+|||++++++. +.++.+.+.+.+.+.+.= T Consensus 74 VG~~k~-----------~~~y~~f~E~GT~k~~~~pF~~pa~~~~--k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 74 IGYGKA-----------TGWRAHYPNDGTIYQRGQDFKERTINQM--TPKAKQLYAEKVKEGLGL 125 (125) T ss_pred EeecCC-----------CceeEeeeccCccCCCcCccchHhHHHh--HHHHHHHHHHHHHHHhcC Confidence 888422 2568999999999999999999999874 345666666666654422 No 66 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.19 E-value=5.4e-07 Score=54.93 Aligned_cols=100 Identities=19% Similarity=0.273 Sum_probs=39.7 Q ss_pred cccch-hhHHHHHhhhheecccceee-ehHHHHHHHHHHHHHhhCCEEEEEecccccCCC-CCCC-----CCHHHHHHHH Q lcl|NC_019918. 7 FTTSA-TPVLKTLALGVIILASFSFK-TDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS-ENGN-----LPVAQVAAYN 78 (180) Q Consensus 7 ~~~~~-~~~~~~~~~~~~M~~~v~~k-~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~-~~~G-----~~vA~iA~i~ 78 (180) +.-+. .-+++.|- ++...-.++ .-.+....+.+..+.+.. |.=|.+.+.=... .++| .+.+.+|.+. T Consensus 1 i~i~Gld~l~~~l~---~~~~~~~~~~al~~~a~~i~~~ak~~aP--v~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~v 75 (108) T protein:vir:74 1 MKITGIDALQKKLR---KNATLDDVKHVVKSNTASMNKNMQNLAP--VDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYV 75 (108) T ss_pred CcchhHHHHHHHHH---HhhhHHHHHHHHHHHHHHHHHHHHHhCC--CCchhhhccceeeeecCceEEEeecCCCcccce Confidence 00000 00011110 000000000 000011111122222211 1112111110000 1122 2346799999 Q ss_pred hcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 79 EFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 79 EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) ||||...|+||||+|+++..+ ..+.+.+++.++ T Consensus 76 E~GT~km~aqpf~~pa~~~~~--~~~~~~i~~~~k 108 (108) T protein:vir:74 76 EYGTRFQSAQPFVKPAFNIQK--KVFTNDLERLTK 108 (108) T ss_pred eccccccCCCcchhhHHHHHH--HHHHHHHHHHcC Confidence 999999999999999998643 345555554444 No 67 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.17 E-value=2.6e-06 Score=51.22 Aligned_cols=94 Identities=14% Similarity=0.158 Sum_probs=44.8 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhC-C-------------EEEEEe-ccccc--------------------------- Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDG-T-------------TVEVGF-FPEDR--------------------------- 61 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~-~-------------~V~VGi-~~~~~--------------------------- 61 (180) |...++|+.+.+.+.+.+++|....+ . .+.=-| +++.. T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG 80 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTN 80 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccch Confidence 44456666555443333333321110 0 000001 11110 Q ss_pred -------CCCCCC----CCCHHHHHHHHhcCC-------CCCCCCcchhhHHHHH-HHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 62 -------YGSENG----NLPVAQVAAYNEFGT-------TRNPTRPFMAPTFEEF-TSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 62 -------~~~~~~----G~~vA~iA~i~EfGt-------~~IP~RpFlr~~~~~~-~~~~~~~~~~~~~~~~~l~g~~ 120 (180) |...++ |+ +..||++|+||. .+||+|||| ++++. +-+.++.+.+...+.+.+..+. T Consensus 81 ~L~~Si~~~~~~~~v~vGt-n~~YA~iHqfGg~~~~~~~~~iPARPfL--G~s~~~e~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 81 ALARSITTRADRDQAQIGS-NLSYAAIQQLGGQAGRGRKVTIPARPYL--PVLRNGQLKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred hhhhhhhceecCCEEEEec-CcchhhhhhcccccCCCCccccCCcccc--CCCccccchHHHHHHHHHHHHHHHhhcC Confidence 000011 33 345899999996 379999999 34321 1122456667777777775555 No 68 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=97.17 E-value=1.6e-06 Score=52.43 Aligned_cols=95 Identities=18% Similarity=0.203 Sum_probs=50.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHh Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNE 79 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~-~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~E 79 (180) -.||..|.-+ +-.+..-.=..+=.+=.+.- .|.+.+. ..+...|.||.. ..+|++|+ T Consensus 51 ~PdG~~W~p~-----~~~t~~rk~~~~~~~L~~tg---~L~~Si~~~~~~~~v~vGtn--------------~~yA~iHq 108 (190) T protein:vir:99 51 SPDGTPWQPL-----SPAYLRRKRKNRDKILTLDG---HLRNLLRYQLDGSELLFGSD--------------RPYAAIHH 108 (190) T ss_pred CCCCCCCccc-----cHHHHHHhhcCCCccceecH---HHHHHHhheecCcEEEEecC--------------cchhhhhh Confidence 3455445432 11111110000101111111 2333332 234567777742 45789999 Q ss_pred cCC--------------------------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 80 FGT--------------------------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENV 115 (180) Q Consensus 80 fGt--------------------------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 115 (180) ||. .+||+|||| ++.+ +.+++|.+.+...+..+ T Consensus 109 ~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~IPaRpfL--G~s~-~d~~~I~~~i~~~l~~~ 185 (190) T protein:vir:99 109 FGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQMPARPWL--GTSS-QDDDTILQRVERYLQRA 185 (190) T ss_pred cCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccceeeecCcccC--CCCH-HHHHHHHHHHHHHHHHH Confidence 992 368999999 4443 34567888888888888 Q ss_pred HhCCC Q lcl|NC_019918. 116 IRDGR 120 (180) Q Consensus 116 l~g~~ 120 (180) +.... T Consensus 186 ~~~~~ 190 (190) T protein:vir:99 186 LRERA 190 (190) T ss_pred HhhcC Confidence 77765 No 69 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.15 E-value=4.2e-06 Score=50.07 Aligned_cols=81 Identities=9% Similarity=0.046 Sum_probs=58.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcC--CCCCchhH Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKG--FNDPLFHT 165 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG--~~~PLIDT 165 (180) +++.. ++.+.+...+.++ . ..+....|..||..+....+..|.+ | |+|+++.|+++|+ ..+||+++ T Consensus 1 m~d~~---~l~~~L~~ll~~L-~-~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~ 75 (149) T protein:vir:98 1 MSELT---ALQERLTGLIASL-S-PAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFAR 75 (149) T ss_pred CchHH---HHHHHHHHHHHhc-C-chhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchh Confidence 33321 2333333333332 1 1245678999999999999999974 3 3489999999887 36899999 Q ss_pred HHHHhhhhhheeccC Q lcl|NC_019918. 166 GKMLESVKFQIHRRQ 180 (180) Q Consensus 166 G~L~~SIty~V~~k~ 180 (180) |.|.+||++.+.... T Consensus 76 g~l~~sl~~~~~~~~ 90 (149) T protein:vir:98 76 LRTNRFMKAKGSDSA 90 (149) T ss_pred hhhhhhhhheecCCe Confidence 999999999988887 No 70 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.11 E-value=9.6e-07 Score=53.58 Aligned_cols=90 Identities=13% Similarity=0.242 Sum_probs=45.6 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHH--HHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLT--SLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~--~l~~~l~-~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..+..+.-.-++.-. -+. ....+. .+.+.|. ..+...+.|||... +..||++ T Consensus 47 ~PdG~~W~p~~~~~~~~k~--~~~--------~~~l~~~g~l~~sl~~~~~~~~~~V~~~Gs-----------~~~yAa~ 105 (149) T protein:vir:98 47 APDGTPYAARKRQSVRSKK--GRI--------RREMFARLRTNRFMKAKGSDSAAVVEFTGR-----------VQRMARV 105 (149) T ss_pred CCCCCCCcccchHHHHhcc--CCC--------CcccchhhhhhhhhhheecCCeeEEEecCc-----------chHHhhH Confidence 3455555544332221100 000 000000 1111221 23456788888622 4689999 Q ss_pred HhcCC----------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT----------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 78 ~EfGt----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) |+||. .+||+|||| ++.+. .++++.+.+...+.+ T Consensus 106 HQfG~~~r~~~~~~~~~iPaRp~L--G~s~~-d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 106 HQYGLKDRPNRHSRDVQYAARPLL--GFTRD-DEQMIEDIIIRHLGK 149 (149) T ss_pred hhccccccccCCCcceeccccccC--CCCHH-HHHHHHHHHHHHhhC Confidence 99994 279999999 45432 344566555555555 No 71 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=97.05 E-value=2.7e-06 Score=51.15 Aligned_cols=85 Identities=14% Similarity=0.218 Sum_probs=40.3 Q ss_pred ceeeehHHHHHHHHHHHHHhhC---------------------------------CEEEEEecccccCCCCCCCCCHHHH Q lcl|NC_019918. 28 FSFKTDRRRLTSLIKRVEALDG---------------------------------TTVEVGFFPEDRYGSENGNLPVAQV 74 (180) Q Consensus 28 v~~k~~~~~l~~l~~~l~~l~~---------------------------------~~V~VGi~~~~~~~~~~~G~~vA~i 74 (180) |+| .+|++|.+.|+.|.+ ..+++-...+.. .-.....+.+.+ T Consensus 1 i~i----~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~-~~~~~v~~~~~Y 75 (173) T protein:vir:10 1 MAV----KGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKD-LISKKITVNELY 75 (173) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCc-eeEEeeCCCccc Confidence 222 222222222222211 011111110000 000001245778 Q ss_pred HHHHhcCCC-------------------------------------------------------CCCCCcchhhHHHHHH Q lcl|NC_019918. 75 AAYNEFGTT-------------------------------------------------------RNPTRPFMAPTFEEFT 99 (180) Q Consensus 75 A~i~EfGt~-------------------------------------------------------~IP~RpFlr~~~~~~~ 99 (180) |.+.||||. +.||||||+|++++. T Consensus 76 a~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~- 154 (173) T protein:vir:10 76 GAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEG- 154 (173) T ss_pred chhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHh- Confidence 999999963 489999999999774 Q ss_pred HHHHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 100 SQFHYARLMKSTFENVIRDG 119 (180) Q Consensus 100 ~~~~~~~~~~~~~~~~l~g~ 119 (180) +..+.+.+++.+...+.-= T Consensus 155 -~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 155 -KKQYLKDLENLLKTYNKKI 173 (173) T ss_pred -HHHHHHHHHHHHHHHhhcC Confidence 4455666666555544332 No 72 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.97 E-value=3.2e-06 Score=50.73 Aligned_cols=91 Identities=14% Similarity=0.236 Sum_probs=46.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHH--H-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKR--V-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~--l-~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..|..+...... - .-.++ +...+.++... | -..+...+.|||... +..||++ T Consensus 48 ~PDG~pW~p~k~~~~~---~------k~~~~-~~~m~~~L~~a~~l~~~a~~~~~~Vg~~Gt-----------~~~yAai 106 (152) T protein:vir:10 48 NPDGSAYEPRKKPKKG---V------KSKIK-SGKMFDKITQPRFMRLRLESEGVSLGYEGG-----------DAVIARI 106 (152) T ss_pred CCCCCCCchhhhhhhh---h------ccccc-chhHHHhhhhcceeeeeecCcEEEEEecCC-----------chhhhhh Confidence 4578777654322110 0 00000 11122222211 1 113456788988622 3679999 Q ss_pred HhcC-----------CCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFG-----------TTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENV 115 (180) Q Consensus 78 ~EfG-----------t~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 115 (180) |.|| ..+||+|||| ++.+. ...++.+.+...+..+ T Consensus 107 HQfG~~~r~~~~~~~~v~iPaRp~L--G~s~~-d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 107 HQQGLIGRVRKDWDLKVKYASRELL--GFTDD-DLQMIEDYMINILAGS 152 (152) T ss_pred hccCccccccCCCCcceeccccccC--CCCHH-HHHHHHHHHHHHHhcC Confidence 9999 3469999999 45432 2334444444444443 No 73 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=96.95 E-value=2.3e-06 Score=51.50 Aligned_cols=91 Identities=16% Similarity=0.298 Sum_probs=42.3 Q ss_pred CCCCcccccchhhHHHHH-hhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHh Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTL-ALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNE 79 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~E 79 (180) --||..|.-......+-- .-.-||...+. +...++ .......+.|||. |. +..||++|+ T Consensus 47 ~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~-------~~~~l~--~~~~~~~~~v~~~----------Gt-~~~yAaiHQ 106 (148) T protein:vir:79 47 NPDGSPYVPRKPQLRHRAGRIRRAMFMRLR-------LARYMK--TQADANTAVVTFA----------GN-AQRIATVHQ 106 (148) T ss_pred CCCCCcCcccchHHHhhcccccccccchhh-------hhhhee--eeeeCCeeeEEee----------cc-chhhhhhhh Confidence 447777754221111100 00011211110 011111 1123446777764 22 367999999 Q ss_pred cC----------CCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019918. 80 FG----------TTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRD 118 (180) Q Consensus 80 fG----------t~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g 118 (180) || +.+||+|||| ++.+. .+.+ +...+...+.| T Consensus 107 fG~~~r~~~~~~~v~iPaRp~L--G~s~~-d~~~----i~~~i~~~l~~ 148 (148) T protein:vir:79 107 FGLRDRVNKAGLTAQYPARELL--GMDGV-DMEH----ITNLLLLHLGA 148 (148) T ss_pred cCccccccCCCCccccCccccc--CCCHH-HHHH----HHHHHHHHhcC Confidence 99 3379999999 45432 2223 33444444445 No 74 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=96.92 E-value=1.5e-06 Score=52.53 Aligned_cols=96 Identities=20% Similarity=0.325 Sum_probs=42.0 Q ss_pred ecccceeeehH---HHHHHHHH--------HH----HHhh-------C--CEEEEEeccccc---CCCCCCC-----CCH Q lcl|NC_019918. 24 ILASFSFKTDR---RRLTSLIK--------RV----EALD-------G--TTVEVGFFPEDR---YGSENGN-----LPV 71 (180) Q Consensus 24 M~~~v~~k~~~---~~l~~l~~--------~l----~~l~-------~--~~V~VGi~~~~~---~~~~~~G-----~~v 71 (180) |. +|+|+.-+ ++|+++-+ .+ +... + --|.=|-+..+= ....+++ .+. T Consensus 1 m~-~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~ 79 (182) T protein:vir:10 1 MI-EVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNS 79 (182) T ss_pred Ce-EEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecC Confidence 43 55665433 22322111 11 0000 0 011222111110 0000011 122 Q ss_pred HHHHHHHhcCC------------------------------------------------------CCCCCCcchhhHHHH Q lcl|NC_019918. 72 AQVAAYNEFGT------------------------------------------------------TRNPTRPFMAPTFEE 97 (180) Q Consensus 72 A~iA~i~EfGt------------------------------------------------------~~IP~RpFlr~~~~~ 97 (180) +.+|.+.|||| .+.||||||+|++++ T Consensus 80 ~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~ 159 (182) T protein:vir:10 80 SMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANK 159 (182) T ss_pred CCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHH Confidence 45666666664 457999999999987 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_019918. 98 FTSQFHYARLMKSTFENVIRDGRQV 122 (180) Q Consensus 98 ~~~~~~~~~~~~~~~~~~l~g~~~~ 122 (180) .+ ..+.+.+++.+++++.-.... T Consensus 160 ~~--~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 160 MA--KEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred hH--HHHHHHHHHHHHHHHHHhhcC Confidence 54 456777776666654432211 No 75 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.79 E-value=6.3e-06 Score=49.08 Aligned_cols=90 Identities=14% Similarity=0.260 Sum_probs=44.7 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHH--HHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSL--IKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l--~~~l~-~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..+.-+.-.-+++ ..+. .....+..+ .+.|+ ......+.||+. |. +..||++ T Consensus 47 ~PdG~~W~p~~~~~~~~-------~~g~---~~~~~~~~l~~~~~l~~~~~~~~~~v~~~----------Gt-n~~yAai 105 (149) T protein:vir:18 47 APDGTPYAARKRQPVRS-------KKGR---IKREMFAKLRTSRFMKAKGSDSAAVVEFT----------GK-VQRMARV 105 (149) T ss_pred CCCCCCCcccchhhhhh-------ccCc---ccchhhhhhhhhhhhheeecCceeEEEec----------cc-chhhhhh Confidence 44666665443222211 1110 001111111 11111 123446777765 22 3679999 Q ss_pred HhcCC----------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT----------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 78 ~EfGt----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) |+||. .+||+|||| ++.+. .+.++.+.+...+.+ T Consensus 106 HQfG~~~r~~~~~~~v~iPaRp~L--G~s~~-d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 106 HQYGLKDRPNRNSRDVQYEARPLL--GFTRD-DEQMIEDVIISHLGK 149 (149) T ss_pred hhccccccccCCCccccccccccC--CCCHH-HHHHHHHHHHHHHhC Confidence 99994 279999999 45432 334555555555554 No 76 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=96.54 E-value=9.1e-06 Score=48.23 Aligned_cols=86 Identities=15% Similarity=0.243 Sum_probs=42.1 Q ss_pred cccceeeehHHHHHHHHHHHHH----h-----------------hCCEEEEEeccccc-CCCCCCC-------CCHHHHH Q lcl|NC_019918. 25 LASFSFKTDRRRLTSLIKRVEA----L-----------------DGTTVEVGFFPEDR-YGSENGN-------LPVAQVA 75 (180) Q Consensus 25 ~~~v~~k~~~~~l~~l~~~l~~----l-----------------~~~~V~VGi~~~~~-~~~~~~G-------~~vA~iA 75 (180) +++++++.+.++|.+.++.+.. . ...-|.=|-+...= ...+.+| .+.+.+| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA 80 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYA 80 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccc Confidence 4477777665544433322211 0 00012222221110 0001111 2457899 Q ss_pred HHHhcCCC---------------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 76 AYNEFGTT---------------------------RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVI 116 (180) Q Consensus 76 ~i~EfGt~---------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l 116 (180) .++|||+. ++||||||++++++.+ ..+.+.+ +++- T Consensus 81 ~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~--~~i~~~~----~~~~ 142 (142) T protein:vir:94 81 ADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAAS--TFLRNHA----KGIR 142 (142) T ss_pred hhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHH--HHHHHHH----HhcC Confidence 99999962 4889999999997643 3343333 3322 No 77 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.46 E-value=2.3e-05 Score=46.02 Aligned_cols=81 Identities=11% Similarity=0.107 Sum_probs=56.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcC--CCCCchhH Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKG--FNDPLFHT 165 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG--~~~PLIDT 165 (180) +++.. .+...+...+.++ . ..+....|..||..+....++.|.+ | |+|+++.|+++|. ..++|+++ T Consensus 1 ~~~~~---~l~~~L~~ll~~l-~-~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~ 75 (150) T protein:vir:20 1 MNEFK---RFEDRLTGLIESL-S-PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAK 75 (150) T ss_pred CchHH---HHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccch Confidence 33321 2233333333332 1 1234678999999999999999974 3 2489999998664 36799999 Q ss_pred HHHHhhhhhheeccC Q lcl|NC_019918. 166 GKMLESVKFQIHRRQ 180 (180) Q Consensus 166 G~L~~SIty~V~~k~ 180 (180) |.|.+||+|++.... T Consensus 76 ~~l~~sl~~~~~~~~ 90 (150) T protein:vir:20 76 LITSRFLHIRASPEQ 90 (150) T ss_pred hhhhhhhheeecCcE Confidence 999999999988777 No 78 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=96.37 E-value=3.5e-05 Score=45.02 Aligned_cols=95 Identities=18% Similarity=0.199 Sum_probs=46.5 Q ss_pred CCC-CcccccchhhHHHHHhhhheec-ccceeeehHHHHHHHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQD-GRSFTTSATPVLKTLALGVIIL-ASFSFKTDRRRLTSLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~M~-~~v~~k~~~~~l~~l~~~l~-~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --| |..+.--. | .|++.--+-. ..-.+=.+.- .|...|. ..+...|.||.. ..||++ T Consensus 51 ~Pd~G~~W~pls-~--~t~~~r~~~~~~~~~~L~~tg---~L~~Si~~~~~~~~v~vGt~--------------~~yA~v 110 (156) T protein:vir:19 51 DPDTGKGWEAWS-D--SWLAWRQDHGFVPGSILTLHG---DLARSITTDYGQDYALIGSP--------------KIYAAI 110 (156) T ss_pred CCCCCCCCcccC-h--HHHHHhhccCCCCCcchhhhH---HHHHHhhheecCCEEEEecc--------------hhhhHH Confidence 112 43332211 1 1111100000 0000001111 2222222 124556777652 468999 Q ss_pred HhcCCC--------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019918. 78 NEFGTT--------RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRD 118 (180) Q Consensus 78 ~EfGt~--------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g 118 (180) |+||.. +||+|||| ++.+. .+.++.+.+...+..++.- T Consensus 111 HqfG~~~~~~~~~~~iPaRpfL--G~s~~-d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 111 HQWGGTPDMAPRPAGVPARPYM--GLDKT-GEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred hhcCcccccCCCccccCCcccc--CCCHH-HHHHHHHHHHHHHHHHhhC Confidence 999953 79999999 45543 3556777777777777766 No 79 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.31 E-value=1.8e-05 Score=46.65 Aligned_cols=94 Identities=17% Similarity=0.257 Sum_probs=44.4 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHH-HHHHHH--HHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRR-RLTSLI--KRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAA 76 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~-~l~~l~--~~l-~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~ 76 (180) --||..|.. +|.-+-+-.-... +-+.... .+..+. +.| -..+...+.|||. | +++.||+ T Consensus 48 ~PDG~~W~p-----rk~~~~~~~~~~~-~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~----------G-s~~~yAa 110 (155) T protein:vir:79 48 NPDGSAYEP-----RKVKAGGKRLREK-AGRVKREAMFRKLRTARYLRIDVDSTGLAIGFD----------E-RLSRIAR 110 (155) T ss_pred CCCCCCCcc-----cchhhhhhhhhcc-cCcccchhhhhhhhhhheeeeeecCcEEEEEec----------C-cchhhhh Confidence 347777753 2211111110000 0000000 111111 001 1124456778874 2 2477999 Q ss_pred HHhcCC----------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 77 YNEFGT----------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 77 i~EfGt----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) +|.||. .+||+|||| ++.+. .++++.+.+...+.+ T Consensus 111 iHQfG~~~r~~~~~~~v~iPaRp~L--Gls~~-d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 111 VHQEGQKAPVEPGGPLAQYPVRVVL--GFSDA-DRELVRDRLLRELTR 155 (155) T ss_pred hhhcCCcccCCCCCccccccccccc--CCCHH-HHHHHHHHHHHHhhC Confidence 999993 379999999 45432 344555555555555 No 80 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=96.25 E-value=2.8e-05 Score=45.57 Aligned_cols=96 Identities=14% Similarity=0.221 Sum_probs=45.9 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHH--HH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKR--VE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~--l~-~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) --||..|.-+.-.-++. ..+ .++.....+..+... |+ ..+...+.|||.. ++..||++ T Consensus 48 ~PdG~~W~p~~~~~~~~-------~~~-~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G-----------s~~~yA~i 108 (156) T protein:vir:11 48 NPDGSAYEPRKKRELRG-------KQG-RIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG-----------RIARIARV 108 (156) T ss_pred CCCCCCCcccchHHHhh-------hcc-ccccchhhhhhhhhhheeeeeecCcEEEEEecC-----------Cchhhhhh Confidence 34888886543222211 001 001111111111111 11 1245678888752 23679999 Q ss_pred HhcCCC----------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_019918. 78 NEFGTT----------RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVN 123 (180) Q Consensus 78 ~EfGt~----------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~ 123 (180) |.||.. +||+|||| ++.+ ..++++ ...+...+.+. ++- T Consensus 109 HQfG~~~~~~~~~~~v~iPaRp~L--G~s~-~d~~~i----~~~i~~~l~~~-~~~ 156 (156) T protein:vir:11 109 HQYGLRDRAEPGAPEVSYAQRLLL--GFDS-SDMETI----QNGILAHIDAN-SPI 156 (156) T ss_pred hcccccccccCCCCcccccccccC--CCCH-HHHHHH----HHHHHHHHhhc-CCC Confidence 999942 79999999 4543 122333 34444444444 322 No 81 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.23 E-value=3.2e-05 Score=45.26 Aligned_cols=81 Identities=12% Similarity=0.095 Sum_probs=57.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCC--CCCchhH Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLFHT 165 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~--~~PLIDT 165 (180) +++.. .+...+...+.++ . ..+....+..||..+....+..|.+ | |+|+++.|+++|+. .++|+++ T Consensus 1 ~~~~~---~l~~~L~~~l~~L-~-~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~ 75 (150) T protein:vir:60 1 MNEFK---RFEDRLTGLIESL-S-PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAK 75 (150) T ss_pred CchHH---HHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhh Confidence 33321 1223333333332 1 2244678999999999999999963 3 34899999998754 5899999 Q ss_pred HHHHhhhhhheeccC Q lcl|NC_019918. 166 GKMLESVKFQIHRRQ 180 (180) Q Consensus 166 G~L~~SIty~V~~k~ 180 (180) |.|..||++++.... T Consensus 76 ~~l~~sl~~~~~~~~ 90 (150) T protein:vir:60 76 LITSRFLHIRASPEQ 90 (150) T ss_pred hhhcceeeeeeeCcE Confidence 999999999998887 No 82 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=96.15 E-value=3.6e-05 Score=44.94 Aligned_cols=96 Identities=16% Similarity=0.220 Sum_probs=42.7 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHH-----hhCCEEEEEecccccCCCCCCCCCHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEA-----LDGTTVEVGFFPEDRYGSENGNLPVAQVA 75 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~-----l~~~~V~VGi~~~~~~~~~~~G~~vA~iA 75 (180) -.||+.+.--+- .|.+. +.+=.+-.+.- .|...+.. -+...+.||- +..|| T Consensus 43 ~p~G~~W~pLs~---st~a~----k~~~~~L~~tG---~L~~Si~~~~~~~~~~~~a~vGt--------------n~~YA 98 (145) T protein:vir:31 43 DALGNPWEPLKE---STIRA----KGSDTPLIDNS---RLLTDINAASMMDRANRMAVIGT--------------NLDYA 98 (145) T ss_pred CCCCCCCcccCh---HHHHH----hcCCCCCccCH---HHHHHHHHHhhhcccCceeEecC--------------Cchhh Confidence 123333221000 11111 11100101111 22223321 1223344442 24699 Q ss_pred HHHhcCCC--CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_019918. 76 AYNEFGTT--RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVN 123 (180) Q Consensus 76 ~i~EfGt~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~ 123 (180) +||+||+. +||+||||-..... .++.+.+.+...+..-+.|- -++ T Consensus 99 ~~hqfG~~~~~IPaRPfLG~~~~~--~~~~~~~ii~~~i~~~L~~~-~~~ 145 (145) T protein:vir:31 99 EHHEFGAPEAGIPARPIFGPAGAY--ASQQAPDVIGDEIDTNLEGA-VID 145 (145) T ss_pred hhhccCCcccccCCCCccCCCccc--hHHHHHHHHHHHHHHHhhhh-ccC Confidence 99999985 69999999665432 23345555666666666553 111 No 83 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=95.98 E-value=5.2e-05 Score=44.05 Aligned_cols=79 Identities=13% Similarity=0.098 Sum_probs=57.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhC--CCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCC--CCCch Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRD--GRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLF 163 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g--~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~--~~PLI 163 (180) +++. +.+...+..++.. ..+...+|..||..+....++.|.+ | |+|+++.|+++|+. .++|+ T Consensus 1 m~~~-------~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~ 73 (150) T protein:vir:57 1 MNEF-------KRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMF 73 (150) T ss_pred CchH-------HHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccc Confidence 3332 1233333333332 1244678999999999999999963 3 34899999987753 58999 Q ss_pred hHHHHHhhhhhheeccC Q lcl|NC_019918. 164 HTGKMLESVKFQIHRRQ 180 (180) Q Consensus 164 DTG~L~~SIty~V~~k~ 180 (180) .+|.|..||+|.+.... T Consensus 74 ~~~~l~~sl~~~~~~~~ 90 (150) T protein:vir:57 74 AKLITSRFLHIRASPEQ 90 (150) T ss_pred hhhhhccceeeeeeCcE Confidence 99999999999988887 No 84 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=95.81 E-value=1.2e-05 Score=47.64 Aligned_cols=82 Identities=15% Similarity=0.250 Sum_probs=36.5 Q ss_pred ecccce-eeehHHHHHHHHH-------------------HHHHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_019918. 24 ILASFS-FKTDRRRLTSLIK-------------------RVEALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 77 (180) Q Consensus 24 M~~~v~-~k~~~~~l~~l~~-------------------~l~~l~~~~V~VGi~~~~-~~~~~~~G-----~~vA~iA~i 77 (180) |..... ...-.+.|+++.+ ..+.+.. |.=|-+..+ ......+| .+.+.+|.+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDSGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccccchhccceeEeecCceEEEEecCCCcccc Confidence 332210 0000111111111 1111111 222322221 01011222 335789999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .|||+ .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:94 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999987543 3444433 No 85 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=95.81 E-value=1.2e-05 Score=47.64 Aligned_cols=82 Identities=15% Similarity=0.250 Sum_probs=36.5 Q ss_pred ecccce-eeehHHHHHHHHH-------------------HHHHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_019918. 24 ILASFS-FKTDRRRLTSLIK-------------------RVEALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 77 (180) Q Consensus 24 M~~~v~-~k~~~~~l~~l~~-------------------~l~~l~~~~V~VGi~~~~-~~~~~~~G-----~~vA~iA~i 77 (180) |..... ...-.+.|+++.+ ..+.+.. |.=|-+..+ ......+| .+.+.+|.+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDSGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccccchhccceeEeecCceEEEEecCCCcccc Confidence 332210 0000111111111 1111111 222322221 01011222 335789999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .|||+ .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:93 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999987543 3444433 No 86 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=95.81 E-value=1.2e-05 Score=47.64 Aligned_cols=82 Identities=15% Similarity=0.250 Sum_probs=36.5 Q ss_pred ecccce-eeehHHHHHHHHH-------------------HHHHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_019918. 24 ILASFS-FKTDRRRLTSLIK-------------------RVEALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 77 (180) Q Consensus 24 M~~~v~-~k~~~~~l~~l~~-------------------~l~~l~~~~V~VGi~~~~-~~~~~~~G-----~~vA~iA~i 77 (180) |..... ...-.+.|+++.+ ..+.+.. |.=|-+..+ ......+| .+.+.+|.+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDSGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccccchhccceeEeecCceEEEEecCCCcccc Confidence 332210 0000111111111 1111111 222322221 01011222 335789999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .|||+ .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:97 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999987543 3444433 No 87 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=95.68 E-value=1.7e-05 Score=46.76 Aligned_cols=78 Identities=19% Similarity=0.363 Sum_probs=35.6 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEeccccc-CCCCCCC-----CCHHH Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDR-YGSENGN-----LPVAQ 73 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~~------------------------~V~VGi~~~~~-~~~~~~G-----~~vA~ 73 (180) |.+.. .++++|.+.|+.+.+. -|.=|-+...= .....+| .+.+. T Consensus 1 Ma~~~------~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:10 1 MAKVK------YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSE 74 (137) T ss_pred CchhH------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCC Confidence 33221 1222333222222110 11112111110 0001111 23467 Q ss_pred HHHHHhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 74 VAAYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 74 iA~i~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) +|.+.|||| +++|+||||++++++.+. .+.+.+. T Consensus 75 Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~i~k~i~ 137 (137) T protein:vir:10 75 YAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRA--FFNKYFS 137 (137) T ss_pred cccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHH--HHHHhcC Confidence 999999995 247999999999987433 3444433 No 88 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=95.56 E-value=2e-05 Score=46.36 Aligned_cols=82 Identities=16% Similarity=0.248 Sum_probs=35.7 Q ss_pred ecccc-eeeehHHHHHHHH-------------------HHHHHhhCCEEEEEeccccc-CCCCCCC-----CCHHHHHHH Q lcl|NC_019918. 24 ILASF-SFKTDRRRLTSLI-------------------KRVEALDGTTVEVGFFPEDR-YGSENGN-----LPVAQVAAY 77 (180) Q Consensus 24 M~~~v-~~k~~~~~l~~l~-------------------~~l~~l~~~~V~VGi~~~~~-~~~~~~G-----~~vA~iA~i 77 (180) |.... .+..-.+.|+++. +..+.+.. |.-|-+...= ..-..+| .+.+.+|.+ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMP--VDTGYLRESVSMDFKKGGLTGVINIGSEYAVY 78 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCcchhhcCeeeEecCCcEEEEEecCCccccc Confidence 32210 0000011111111 11122211 2222222210 0001122 234679999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .|||| +++||||||++++++.+ ..+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~--~~i~k~i~ 137 (137) T protein:vir:10 79 VNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGR--AFFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHH--HHHHHhhC Confidence 99996 25899999999998743 33444443 No 89 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=95.48 E-value=2.2e-05 Score=46.14 Aligned_cols=90 Identities=17% Similarity=0.336 Sum_probs=42.4 Q ss_pred cccceeeehHHHH-----HHHHHHHHHh------------hC--CEEEEEeccccc-CCCCCCC-----CCHHHHHHHHh Q lcl|NC_019918. 25 LASFSFKTDRRRL-----TSLIKRVEAL------------DG--TTVEVGFFPEDR-YGSENGN-----LPVAQVAAYNE 79 (180) Q Consensus 25 ~~~v~~k~~~~~l-----~~l~~~l~~l------------~~--~~V~VGi~~~~~-~~~~~~G-----~~vA~iA~i~E 79 (180) +..+.|..+.++. +++++.+++. .+ .-|.=|-+..+- +.--.+| .+.+.+|.+.| T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE 80 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYE 80 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceee Confidence 2233333222221 1111211111 00 112233332211 1000111 24578999999 Q ss_pred cCC--------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019918. 80 FGT--------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQ 121 (180) Q Consensus 80 fGt--------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~ 121 (180) ||| .+.||||||++++++.+ ..+.+.+++.+..+ + T Consensus 81 ~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~--~~i~~~i~~~~~~l-----~ 141 (141) T protein:vir:78 81 FGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQ--DKVRVFTERALRGI-----N 141 (141) T ss_pred cCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhH--HHHHHHHHHHhhcc-----C Confidence 997 35899999999998753 45666666555543 3 No 90 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=95.48 E-value=2.4e-05 Score=45.92 Aligned_cols=78 Identities=18% Similarity=0.349 Sum_probs=36.6 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhC------------------------CEEEEEecccc-cCCCCCCC-----CCHHH Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDG------------------------TTVEVGFFPED-RYGSENGN-----LPVAQ 73 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------~~V~VGi~~~~-~~~~~~~G-----~~vA~ 73 (180) |.... .++++|.+.|+.+.+ .-|.-|-+..+ .+...++| .+.+. T Consensus 1 Ma~~~------~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:95 1 MAKVK------YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSE 74 (137) T ss_pred CchhH------HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCC Confidence 33221 122222222221111 01222322221 01001122 34578 Q ss_pred HHHHHhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 74 VAAYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 74 iA~i~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) +|.+.|||+ ++.|+||||++++++.+. .+.+.+. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~i~k~l~ 137 (137) T protein:vir:95 75 YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 999999997 358999999999977433 3444443 No 91 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=95.43 E-value=8.3e-05 Score=42.96 Aligned_cols=95 Identities=12% Similarity=0.102 Sum_probs=36.0 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhC--CEE------------EEEe-ccccc--------------------------- Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDG--TTV------------EVGF-FPEDR--------------------------- 61 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~--~~V------------~VGi-~~~~~--------------------------- 61 (180) |...++|+.+...+.+.+++|..... ..+ .=-| +++.. T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 44445555444333332222211100 000 0000 01000 Q ss_pred -------CCCCCC----CCCHHHHHHHHhcCCC-------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 62 -------YGSENG----NLPVAQVAAYNEFGTT-------RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 62 -------~~~~~~----G~~vA~iA~i~EfGt~-------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) |...++ |++ ..||++|+||.. +||+||||--.-++.... +..+.+...+...+.-+. T Consensus 81 ~L~~Si~~~~~~~~v~vGt~-~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~-~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 81 ALARSVTTWADRNEAGIGSN-LVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAA-GARQSILEVVLTALSRNR 155 (155) T ss_pred hhhhhhhceecCCEEEEecC-chhhhhhhcccccCCCCccccCCccccCCCCccccch-HHHHHHHHHHHHHHHhcC Confidence 000011 333 468999999953 899999993221111001 112223333333332222 No 92 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=95.41 E-value=7.2e-06 Score=48.76 Aligned_cols=94 Identities=17% Similarity=0.185 Sum_probs=54.7 Q ss_pred CCCCcc-----cccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHH Q lcl|NC_019918. 1 MQDGRS-----FTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVA 75 (180) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA 75 (180) |..|.. +.--+.||++.+ ..++-++ ..+|.++...++.-+ .+.||+.- +-+-++ T Consensus 20 g~~~~~ie~kAlk~g~e~I~~~~------~~n~P~~--tg~lkkik~~~kk~g--~~~VG~~k-----------s~~fy~ 78 (119) T protein:vir:10 20 MVLDESTKRKGIKAGITKIGKAI------EKNSPIK--SGRLSKVKIRVKNTG--LATEGTAS-----------SSEFYD 78 (119) T ss_pred hhhhHHHHHHHHHHHhHHHHHHH------hhcCCcc--cCCcceeeeeeecCc--eeEeccCC-----------cchhhh Confidence 222222 334456777644 2233222 223444444444322 68888742 336799 Q ss_pred HHHhcCCCCCCCC-cchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 76 AYNEFGTTRNPTR-PFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 76 ~i~EfGt~~IP~R-pFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) -.+||||...|+| |||.+++++.. ++..+.+...+..=++ T Consensus 79 kF~EFGTSkm~a~~pF~~~a~~~~~--~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 79 IFQNFGTSEQKAHVGYFDRAVDETT--NEAVEEVAEIIFRKMR 119 (119) T ss_pred hhccccccccCCCCCccccccccCh--HHHHHHHHHHHHHhcC Confidence 9999999999999 99999998754 3444444444444333 No 93 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=95.26 E-value=3.1e-05 Score=45.34 Aligned_cols=82 Identities=16% Similarity=0.268 Sum_probs=35.3 Q ss_pred ecccc-eeeehHHHHHHHHHHH-------------------HHhhCCEEEEEeccccc-CCCCCCC-----CCHHHHHHH Q lcl|NC_019918. 24 ILASF-SFKTDRRRLTSLIKRV-------------------EALDGTTVEVGFFPEDR-YGSENGN-----LPVAQVAAY 77 (180) Q Consensus 24 M~~~v-~~k~~~~~l~~l~~~l-------------------~~l~~~~V~VGi~~~~~-~~~~~~G-----~~vA~iA~i 77 (180) |.+.. .+..-.+.|+++.+.+ +.+.. |.=|-+...= ....++| .+.+.+|.+ T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCcchhhcCceeEeecCcEEEEEecCCCcccc Confidence 33221 0011111222111111 11111 1112111110 0001122 234789999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .|||+ .++|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:94 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRV--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHH--HHHHhhC Confidence 99994 368999999999987533 3444443 No 94 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=95.23 E-value=0.00015 Score=41.50 Aligned_cols=94 Identities=13% Similarity=0.156 Sum_probs=38.1 Q ss_pred ecccceeeehHHHHHHHHHHHHHhhC--CEE--EEE----------e-ccccc--------------------------- Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLIKRVEALDG--TTV--EVG----------F-FPEDR--------------------------- 61 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~~~l~~l~~--~~V--~VG----------i-~~~~~--------------------------- 61 (180) |...++|+.+...+.+.+++|....+ ..+ .|| | +++.. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg 80 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch Confidence 55556666655443333333211100 000 000 1 11111 Q ss_pred -------CCCCCC----CCCHHHHHHHHhcCCC-------CCCCCcchhhHHHHHHH-HHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 62 -------YGSENG----NLPVAQVAAYNEFGTT-------RNPTRPFMAPTFEEFTS-QFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 62 -------~~~~~~----G~~vA~iA~i~EfGt~-------~IP~RpFlr~~~~~~~~-~~~~~~~~~~~~~~~l~g~~ 120 (180) |...++ |++ ..||++|+||.. +||+|||| ++++.-. ..+..+.+...+...+.-+. T Consensus 81 ~L~~Si~~~~~~~~v~vGtn-~~YA~iHqfGg~~~~~~~v~iPaRpfL--G~s~~~~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 81 ALARSVTTWADRNEAGIGSN-LVYAAIHQFGGDAGRGHQVEIPARRYL--PFDENGQLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred hhhhhhhceecCCEEEEecC-ccchhhhhcccccCCCCccccCCcccc--CCCCccccchHHHHHHHHHHHHHHhccC Confidence 000111 333 458999999953 89999999 3332100 00111223333333333333 No 95 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=95.15 E-value=2.4e-05 Score=45.95 Aligned_cols=82 Identities=20% Similarity=0.270 Sum_probs=36.7 Q ss_pred ecccc-eeeehHHHHHHHHH-------------------HHHHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_019918. 24 ILASF-SFKTDRRRLTSLIK-------------------RVEALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 77 (180) Q Consensus 24 M~~~v-~~k~~~~~l~~l~~-------------------~l~~l~~~~V~VGi~~~~-~~~~~~~G-----~~vA~iA~i 77 (180) |.+.. .+..-.+.|+++.+ ..+.+. -|.=|-+..+ .+.-.++| .+.+.+|.+ T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~a--pvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ 78 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLM--PVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVY 78 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccchhhhcceeEEeecCcEEEEEecCCCccch Confidence 33210 00111111111111 111111 1222322222 01011222 345789999 Q ss_pred HhcCC---------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 78 NEFGT---------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 78 ~EfGt---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .|||+ +++|+||||++++++.+. .+.+.+. T Consensus 79 ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~--~~~~~i~ 135 (135) T protein:vir:96 79 VNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQ--TFEQYFS 135 (135) T ss_pred hhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHH--HHHHhcC Confidence 99997 458999999999987533 3444433 No 96 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=95.04 E-value=3.6e-05 Score=44.96 Aligned_cols=93 Identities=20% Similarity=0.272 Sum_probs=37.6 Q ss_pred CCCCcccccchhhHHHHHhhhheecccce--eeehHHHHHHHHH-------------------HHHHhhCCEEEEEeccc Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFS--FKTDRRRLTSLIK-------------------RVEALDGTTVEVGFFPE 59 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~--~k~~~~~l~~l~~-------------------~l~~l~~~~V~VGi~~~ 59 (180) ||-.--|..- -+|.+ |. +..-.+.|+++.+ ..+.+. -|.=|.+.. T Consensus 1 ~~~~~~~~~~-----------~~Ma~-v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~a--PvdTG~L~~ 66 (149) T protein:vir:10 1 MKLNYYDLSR-----------CHMAK-VKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALA--PVDLGFLEE 66 (149) T ss_pred Ceeeeeccch-----------hhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--Ccccchhhc Confidence 3322211110 01221 10 0001111111111 111111 112222222 Q ss_pred cc-CCCCCCC-----CCHHHHHHHHhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHH Q lcl|NC_019918. 60 DR-YGSENGN-----LPVAQVAAYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHY 104 (180) Q Consensus 60 ~~-~~~~~~G-----~~vA~iA~i~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~ 104 (180) .= .....+| .+.+.+|.+.|||| +++|||||||+++++.+. .+ T Consensus 67 SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~--~i 144 (149) T protein:vir:10 67 SIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRK--TF 144 (149) T ss_pred cceEEecCCcEEEEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHH--HH Confidence 10 0011122 23478999999996 358999999999987543 34 Q ss_pred HHHHH Q lcl|NC_019918. 105 ARLMK 109 (180) Q Consensus 105 ~~~~~ 109 (180) .+.+. T Consensus 145 ~~~i~ 149 (149) T protein:vir:10 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 44433 No 97 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=94.94 E-value=0.00011 Score=42.20 Aligned_cols=110 Identities=17% Similarity=0.088 Sum_probs=54.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC------------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG------------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------------- 49 (180) |-| |.-.-.-.++.+.=-++.....+-+.+..+-+-+.+.|+.-.. T Consensus 1 M~~---~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~d 77 (141) T protein:vir:50 1 MVG---LAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKN 77 (141) T ss_pred Ccc---HHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccC Confidence 221 1111111222221111111111112222222223333332211 Q ss_pred CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_019918. 50 TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVN 123 (180) Q Consensus 50 ~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~ 123 (180) -.+.|||... ..+.+|-+.|+||.++|+-||+..+..+...+.++.+.+...++++|.-...-+ T Consensus 78 g~s~VG~~~~----------~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 78 GVSTVGWKNN----------YHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEEKEGCD 141 (141) T ss_pred CeeeeccCCC----------ccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhccCCC Confidence 1223444211 127899999999999999999999997654456778888888888876532222 No 98 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=94.87 E-value=4.4e-05 Score=44.48 Aligned_cols=80 Identities=25% Similarity=0.380 Sum_probs=35.5 Q ss_pred ecccceeeeh---HHHHHHHHHH-------------------HHHhhCCEEEEEeccccc-CCCCCCC-----CCHHHHH Q lcl|NC_019918. 24 ILASFSFKTD---RRRLTSLIKR-------------------VEALDGTTVEVGFFPEDR-YGSENGN-----LPVAQVA 75 (180) Q Consensus 24 M~~~v~~k~~---~~~l~~l~~~-------------------l~~l~~~~V~VGi~~~~~-~~~~~~G-----~~vA~iA 75 (180) |.+.. +.. .+.|+++.+. .+.+.. |.-|-+...- .....+| .+.+.+| T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~p--vdTG~L~~Si~~~~~~~g~~~~V~~~~~YA 76 (137) T protein:vir:96 1 MAKVK--YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAP--VDLGFLKESIDFKVTDGGFSSVISVGAEYA 76 (137) T ss_pred CchhH--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCccchhcCceeEeecCceEEEEecCCCcc Confidence 33221 111 1111111111 111111 1122221110 0001111 2347899 Q ss_pred HHHhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 76 AYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 76 ~i~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) .+.|||| +++|+||||++++++.+. .+.+.+. T Consensus 77 ~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~--~i~k~i~ 137 (137) T protein:vir:96 77 IYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRK--VFNRYFS 137 (137) T ss_pred cccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHH--HHHHhhC Confidence 9999997 458999999999987543 3444443 No 99 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=94.68 E-value=3.4e-05 Score=45.10 Aligned_cols=77 Identities=17% Similarity=0.367 Sum_probs=36.9 Q ss_pred ecccceeeehHHHHHH----HHHHHHHhhCCEEEEEeccccc-CCCCCCCC-----CHHHHHHHHhcC------------ Q lcl|NC_019918. 24 ILASFSFKTDRRRLTS----LIKRVEALDGTTVEVGFFPEDR-YGSENGNL-----PVAQVAAYNEFG------------ 81 (180) Q Consensus 24 M~~~v~~k~~~~~l~~----l~~~l~~l~~~~V~VGi~~~~~-~~~~~~G~-----~vA~iA~i~EfG------------ 81 (180) |.. ..++-+.+ +.+..+.+.. |.=|-+...= +...++|+ +.+++|.+.||| T Consensus 1 v~~-----~v~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:97 1 MER-----WVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred ChH-----HHHHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccc Confidence 111 11222222 2233333322 2223222110 11112222 357899999999 Q ss_pred -----------------CCCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 82 -----------------TTRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 82 -----------------t~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) ++++||||||++++++.+. .+.+.+. T Consensus 74 ~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~i~ 116 (116) T protein:vir:97 74 AKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) T ss_pred ccccceeeecCCceeeecCCcCCCcchHHHHHHHHH--HHHHhhC Confidence 4569999999999977543 3333332 No 100 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=94.68 E-value=3.4e-05 Score=45.10 Aligned_cols=77 Identities=17% Similarity=0.367 Sum_probs=36.9 Q ss_pred ecccceeeehHHHHHH----HHHHHHHhhCCEEEEEeccccc-CCCCCCCC-----CHHHHHHHHhcC------------ Q lcl|NC_019918. 24 ILASFSFKTDRRRLTS----LIKRVEALDGTTVEVGFFPEDR-YGSENGNL-----PVAQVAAYNEFG------------ 81 (180) Q Consensus 24 M~~~v~~k~~~~~l~~----l~~~l~~l~~~~V~VGi~~~~~-~~~~~~G~-----~vA~iA~i~EfG------------ 81 (180) |.. ..++-+.+ +.+..+.+.. |.=|-+...= +...++|+ +.+++|.+.||| T Consensus 1 v~~-----~v~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:12 1 MER-----WVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred ChH-----HHHHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccc Confidence 111 11222222 2233333322 2223222110 11112222 357899999999 Q ss_pred -----------------CCCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 82 -----------------TTRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 82 -----------------t~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) ++++||||||++++++.+. .+.+.+. T Consensus 74 ~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~i~ 116 (116) T protein:vir:12 74 AKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) T ss_pred ccccceeeecCCceeeecCCcCCCcchHHHHHHHHH--HHHHhhC Confidence 4569999999999977543 3333332 No 101 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=94.49 E-value=3.5e-05 Score=45.00 Aligned_cols=77 Identities=17% Similarity=0.370 Sum_probs=36.3 Q ss_pred ecccceeeehHHHHHHHH----HHHHHhhCCEEEEEeccccc-CCCCCCCC-----CHHHHHHHHhcC------------ Q lcl|NC_019918. 24 ILASFSFKTDRRRLTSLI----KRVEALDGTTVEVGFFPEDR-YGSENGNL-----PVAQVAAYNEFG------------ 81 (180) Q Consensus 24 M~~~v~~k~~~~~l~~l~----~~l~~l~~~~V~VGi~~~~~-~~~~~~G~-----~vA~iA~i~EfG------------ 81 (180) |. +..++.+.+.. ...+.+.. |.=|-+...- +...++|+ +.+++|.+.||| T Consensus 1 v~-----~~v~~~~~~~~~~i~~~ak~~ap--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:95 1 ME-----RWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred Ch-----HHHHHHHHHHHHHHHHHHHhhCC--ccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccc Confidence 11 11122222222 23333322 2223222110 11112222 357799999999 Q ss_pred -----------------CCCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 82 -----------------TTRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 82 -----------------t~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) ++.+||||||++++++.+. .+.+.+. T Consensus 74 ~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~is 116 (116) T protein:vir:95 74 AKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) T ss_pred cccccceeecCccceeeCCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 4469999999999977543 3333332 No 102 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=94.29 E-value=5.4e-05 Score=44.00 Aligned_cols=95 Identities=19% Similarity=0.246 Sum_probs=37.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHH-------------------HHHHHhhCCEEEEEeccccc Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLI-------------------KRVEALDGTTVEVGFFPEDR 61 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~-------------------~~l~~l~~~~V~VGi~~~~~ 61 (180) ||----|.. ... | +.+..++ ..-.+.|+++. +..+.+.. |.-|-+...= T Consensus 1 ~~~~~~~~~------~~~-M-a~~~~Gl--d~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aP--vdTG~Lr~SI 68 (149) T protein:vir:94 1 MKLSYYDLS------RCH-M-AKVKYGA--DSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAP--VDLGFLEESI 68 (149) T ss_pred Ceeeeeecc------hhh-H-HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cccchhhcCe Confidence 332211111 011 1 1111111 01111111111 11111111 1222222110 Q ss_pred -CCCCCCC-----CCHHHHHHHHhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHH Q lcl|NC_019918. 62 -YGSENGN-----LPVAQVAAYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYAR 106 (180) Q Consensus 62 -~~~~~~G-----~~vA~iA~i~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~ 106 (180) .....+| .+.+.+|.+.|||| +++||||||++++++.+ ..+.+ T Consensus 69 ~~~~~~~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~--~~i~~ 146 (149) T protein:vir:94 69 DFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGR--KTFEQ 146 (149) T ss_pred eEEeeCCcEEEEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHH--HHHHH Confidence 0011122 23478999999996 45899999999998743 23444 Q ss_pred HHH Q lcl|NC_019918. 107 LMK 109 (180) Q Consensus 107 ~~~ 109 (180) .+. T Consensus 147 ~i~ 149 (149) T protein:vir:94 147 YFS 149 (149) T ss_pred hhC Confidence 333 No 103 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=94.19 E-value=0.0005 Score=38.69 Aligned_cols=81 Identities=9% Similarity=0.018 Sum_probs=55.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCC--CCCchhH Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLFHT 165 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~--~~PLIDT 165 (180) +++.. .+.+.+...+.++- ......+|..||..+....++.|.+ | |+|+++.|++.|.. .++|..+ T Consensus 1 m~~~~---~~~~~l~~ll~~L~--~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~ 75 (149) T protein:vir:18 1 MSELT---ALQERLAGLIASLS--PAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAK 75 (149) T ss_pred CchHH---HHHHHHHHHHHhcC--CchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhh Confidence 33321 12233333333321 1234678999999999999999974 3 34899999987643 5789999 Q ss_pred HHHHhhhhhheeccC Q lcl|NC_019918. 166 GKMLESVKFQIHRRQ 180 (180) Q Consensus 166 G~L~~SIty~V~~k~ 180 (180) +.+.+++.+.+.... T Consensus 76 l~~~~~l~~~~~~~~ 90 (149) T protein:vir:18 76 LRTSRFMKAKGSDSA 90 (149) T ss_pred hhhhhhhheeecCce Confidence 999999998887777 No 104 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=94.06 E-value=9e-05 Score=42.75 Aligned_cols=122 Identities=18% Similarity=0.058 Sum_probs=52.7 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC------------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG------------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------------- 49 (180) |-| |.-...-.++.+.--........-+...++-+-+.+.|+.-.. T Consensus 1 M~~---~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~d 77 (153) T protein:vir:49 1 MTG---LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKN 77 (153) T ss_pred Ccc---HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceecccccccccc Confidence 322 1111111222221111111111112222332233333332211 Q ss_pred CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHH Q lcl|NC_019918. 50 TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKL 129 (180) Q Consensus 50 ~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~i 129 (180) ..+.|||.. -..+.+|-+.|+||.++|+.||++.+.++...+.++.+.+...++++|.....+ T Consensus 78 G~s~VG~~~----------~~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~------- 140 (153) T protein:vir:49 78 GVSTVGWKN----------NYHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGV------- 140 (153) T ss_pred ceeeecccC----------CccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCe------- Confidence 122233321 113678999999999999999999998775445566666666666666543211 Q ss_pred HHHHHHHHHHHHhcCCCCCcHHHHHhcCCC Q lcl|NC_019918. 130 GRMVAEQMQVNIDDYPGSNSPAWAAYKGFN 159 (180) Q Consensus 130 G~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~ 159 (180) -+|.+..+-|... T Consensus 141 -----------------~~~~~~~~~~~~~ 153 (153) T protein:vir:49 141 -----------------YLSASNFKTKRAT 153 (153) T ss_pred -----------------eeeccccccccCC Confidence 0001101101000 No 105 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=94.01 E-value=0.00026 Score=40.27 Aligned_cols=117 Identities=16% Similarity=0.135 Sum_probs=54.7 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCC-------EEEEEecccc------cCCCCCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGT-------TVEVGFFPED------RYGSENG 67 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~-------~V~VGi~~~~------~~~~~~~ 67 (180) |-| |.-...-.++.+.=-........-+.+..+-+-+.++|+.-... .-..|=.+|. +-..+.+ T Consensus 1 M~~---~~d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~d 77 (140) T protein:vir:48 1 MTG---LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKN 77 (140) T ss_pred Ccc---HHHHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccc Confidence 222 22112222222221111111122222233333333444333210 0011100000 0000111 Q ss_pred CC--------CHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh--CCC Q lcl|NC_019918. 68 NL--------PVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR--DGR 120 (180) Q Consensus 68 G~--------~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~--g~~ 120 (180) |. ..|.+|-+.++||..+|+.||+..+.++.+.+.++.+.+...+++++. |+. T Consensus 78 G~s~VG~~k~~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 78 GVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred cceeecccCCCceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 11 137899999999999999999999998765566777878888888873 332 No 106 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=93.54 E-value=0.00098 Score=37.07 Aligned_cols=81 Identities=10% Similarity=0.101 Sum_probs=56.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCC-CCCchhHH Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF-NDPLFHTG 166 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~-~~PLIDTG 166 (180) +++. +++.+.+...+.++ . ..+-..+|..||..+....++.|.+ | |+|+++.|.++||. .++|.+++ T Consensus 1 m~~~---~~l~~~L~~ll~~l-~-~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l 75 (148) T protein:vir:79 1 MSES---RELEAWLAGMLTKL-D-APARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRL 75 (148) T ss_pred CccH---HHHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchh Confidence 3322 12333333333332 1 1233578999999999999999973 3 23899999999986 47899999 Q ss_pred HHHhhhhhheeccC Q lcl|NC_019918. 167 KMLESVKFQIHRRQ 180 (180) Q Consensus 167 ~L~~SIty~V~~k~ 180 (180) .+..++++.+.... T Consensus 76 ~~~~~l~~~~~~~~ 89 (148) T protein:vir:79 76 RLARYMKTQADANT 89 (148) T ss_pred hhhhheeeeeeCCe Confidence 99999988887666 No 107 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=93.31 E-value=0.00021 Score=40.79 Aligned_cols=83 Identities=19% Similarity=0.285 Sum_probs=39.2 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHH----H----HhhCCEEEEEecccccCCCCCCCCCHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRV----E----ALDGTTVEVGFFPEDRYGSENGNLPVA 72 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l----~----~l~~~~V~VGi~~~~~~~~~~~G~~vA 72 (180) +.|. +-..++-.++ .|.+.| . .-+...-.|||-.-+. T Consensus 1 ~rDe-------------akarv~~~~G-----------~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA----------- 45 (119) T protein:vir:81 1 MRES-------------AKAFVNDETG-----------KLRSNLYVAYSPEESTNGVQTYAVSWRKKAA----------- 45 (119) T ss_pred CCcc-------------cccccCCCcc-----------chhhhheeeeccccCCCCeEEEEeeccCCcC----------- Confidence 1110 0000110000 122222 0 0011234455554332 Q ss_pred HHHHHHhcC------------------------CCCCCCCcchhhHHHHHHHHHHHHHHHHH----HHHHHHhCCC Q lcl|NC_019918. 73 QVAAYNEFG------------------------TTRNPTRPFMAPTFEEFTSQFHYARLMKS----TFENVIRDGR 120 (180) Q Consensus 73 ~iA~i~EfG------------------------t~~IP~RpFlr~~~~~~~~~~~~~~~~~~----~~~~~l~g~~ 120 (180) --+.+.||| ...+|+|||||+++|....+ ..+.+.+ .+.+++.|.. T Consensus 46 PhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~--a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 46 PHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQ--IPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CcccccccceeeeeeeeeccCceeeecCccccCceecCCCCccchhHHHHHHH--HHHHHHHHHHHHHHHHhccCC Confidence 123445888 34699999999999865432 3344444 4778888875 No 108 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=93.09 E-value=0.00024 Score=40.48 Aligned_cols=83 Identities=19% Similarity=0.285 Sum_probs=39.0 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHH----HH----hhCCEEEEEecccccCCCCCCCCCHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRV----EA----LDGTTVEVGFFPEDRYGSENGNLPVA 72 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l----~~----l~~~~V~VGi~~~~~~~~~~~G~~vA 72 (180) +.|. +-..++-.++ .|.+.| .. -+...-.|||-.-+. T Consensus 1 ~rDe-------------akarv~~~~G-----------~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA----------- 45 (119) T protein:vir:10 1 MRES-------------AKAFVNDETG-----------KLRSNLYVAYSTEESTNGVQTYAVSWRKKAA----------- 45 (119) T ss_pred CCcc-------------cccccCCCcc-----------chhhhheeeeccccCCCCEEEEEeecCCCcC----------- Confidence 1110 0000111000 122222 00 011234455554332 Q ss_pred HHHHHHhcC------------------------CCCCCCCcchhhHHHHHHHHHHHHHHHHH----HHHHHHhCCC Q lcl|NC_019918. 73 QVAAYNEFG------------------------TTRNPTRPFMAPTFEEFTSQFHYARLMKS----TFENVIRDGR 120 (180) Q Consensus 73 ~iA~i~EfG------------------------t~~IP~RpFlr~~~~~~~~~~~~~~~~~~----~~~~~l~g~~ 120 (180) --+.+.||| ...+|+|||||+++|....+ ..+.+.+ .+.+++.|.. T Consensus 46 PhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~--a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 46 PHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQ--IPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CcccccccceeeeeeeeeccCceeeecCccccCceecCCCCccchhHHHHHHH--HHHHHHHHHHHHHHHHhccCC Confidence 123445888 23699999999999865432 3344444 4778888875 No 109 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=93.04 E-value=0.00099 Score=37.05 Aligned_cols=86 Identities=16% Similarity=0.311 Sum_probs=41.5 Q ss_pred ecccceeeehH----HHHHH-------------------HHHHHHHhhCCEEEEE-----ecccccCCCCCCC------- Q lcl|NC_019918. 24 ILASFSFKTDR----RRLTS-------------------LIKRVEALDGTTVEVG-----FFPEDRYGSENGN------- 68 (180) Q Consensus 24 M~~~v~~k~~~----~~l~~-------------------l~~~l~~l~~~~V~VG-----i~~~~~~~~~~~G------- 68 (180) |. +|++-.-. +.|+. +.+.+++... ++-| |-..... +.++ T Consensus 1 Ma-~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP--~rTG~y~ksw~vk~~~--~~g~~~~vv~~ 75 (126) T protein:vir:81 1 MA-NITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAP--KRTGEYARTFTITKED--GYGTTKRIIWN 75 (126) T ss_pred Cc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cccchhhccccccccc--cCCcceEEEec Confidence 33 34333211 11221 1122222221 1112 2111110 0011 Q ss_pred CCHHHHHHHHhcCCCC-----CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 69 LPVAQVAAYNEFGTTR-----NPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 69 ~~vA~iA~i~EfGt~~-----IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) .+-..++-+.|||... +|+||||+|+++.. .+.+.+.+++++.|+. T Consensus 76 ~~~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~------~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 76 KKHYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKH------GARLPDELKRVIENGG 126 (126) T ss_pred cCCCCceeeeecceecCCCCccCCCcchHHHHHHH------HHHHHHHHHHHhhcCC Confidence 1113467789999753 89999999998653 3456778888888876 No 110 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=92.89 E-value=0.00055 Score=38.47 Aligned_cols=109 Identities=17% Similarity=0.091 Sum_probs=52.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC------------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG------------------------------- 49 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------------- 49 (180) |-| |.-...-.++.+.=-+.......-+.+..+-+-+.+.|+.-.. T Consensus 1 M~~---~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~ 77 (140) T protein:vir:48 1 MTG---LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKN 77 (140) T ss_pred Ccc---HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccC Confidence 221 1111111111111111111111111222222222233332211 Q ss_pred CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_019918. 50 TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVN 123 (180) Q Consensus 50 ~~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~ 123 (180) ..+.|||.. -..+.+|-+.++||.++|+-||+..+.++...+.++.+.+...++++|.-. ..+ T Consensus 78 g~s~VG~~k----------k~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~-~~~ 140 (140) T protein:vir:48 78 GVSTVGWVN----------RYHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRKK-GGE 140 (140) T ss_pred ceeeeccCC----------CcceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhh-cCC Confidence 122233321 124789999999999999999999999875555567777777777777653 112 No 111 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=92.32 E-value=0.0012 Score=36.54 Aligned_cols=97 Identities=16% Similarity=0.190 Sum_probs=48.4 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHH--HhhCCEEEEEecccccCCCCCCCCCHHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVE--ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYN 78 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~--~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~ 78 (180) -.||-+|.---- ...+.|.+|.+-|+ +.....+.|||..+.. ...++.||++| T Consensus 49 ~PDGs~~~pRKr-------------------~krKMl~~L~k~Lk~~~~~~~~a~v~f~~~~~------~~~~~rIA~vH 103 (228) T protein:vir:78 49 DPNGNAWAPRKR-------------------GKRKMLRGLPKLLQIREPRQDMAELGFTKGTM------SAHAGVIANTH 103 (228) T ss_pred CCCCCCChhhhh-------------------hHHHHHhhhHHhhhhhcccccceEEEeecCcc------cchHHHHHHHH Confidence 668888765431 12234444444443 2334578999964321 12478899999 Q ss_pred hcCC---------------------------------------------------------------------------- Q lcl|NC_019918. 79 EFGT---------------------------------------------------------------------------- 82 (180) Q Consensus 79 EfGt---------------------------------------------------------------------------- 82 (180) +||- T Consensus 104 q~G~~~~v~~~~~~~~~~~r~~~~~paTr~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k 183 (228) T protein:vir:78 104 QKGHTYKVTAASRRRIAPSDVGKNKQASKAQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVK 183 (228) T ss_pred hcCcccccccchhhhhhcccCCCCCCCCHHHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCc Confidence 9991 Q ss_pred ----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCC-CCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhc Q lcl|NC_019918. 83 ----TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDG-RQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYK 156 (180) Q Consensus 83 ----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~-~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~K 156 (180) +.+|+||||- ++. .++.+.+...+..+--|. ..+.+ |+.| T Consensus 184 ~~W~I~~PaR~FLG--~s~----~e~~~~l~~~l~~i~~g~~~~~qd----------------------------~~~~ 228 (228) T protein:vir:78 184 ESWEIQLPARPFLG--ANA----RQRQQAFALRPESIDYGWDVNKQD----------------------------MKGK 228 (228) T ss_pred cceeeecCcccccC--CCH----HHHHHHHHHHHHhcccCCCcchhh----------------------------ccCC Confidence 2467777763 221 123344444444443331 12211 2222 No 112 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=92.20 E-value=0.0027 Score=34.63 Aligned_cols=91 Identities=13% Similarity=0.237 Sum_probs=52.0 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHH--HhhCCEEEEEecccccCCCCCCCCCHHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVE--ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYN 78 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~--~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~ 78 (180) -.||-.|.--- - -. + ..| ..+.|.++.+.+. ..++....++++.+ .++.||++| T Consensus 53 ~PDGs~w~pRK-----~------~~-~-k~k-~~rm~~kL~~~~~~~~~~~~~~~~~~~~g----------~~~~IA~vH 108 (231) T protein:vir:37 53 SPDGTAWEKRK-----P------VD-G-EIK-NKRLLKKVLRYASILAEERGKGRIYYKNP----------LTGEIAQKQ 108 (231) T ss_pred CCCCCcCchhc-----c------cc-c-chh-hHHHHHHhHHhhccccccCCceEEeeecc----------hHHHHHHHh Confidence 66888776421 0 00 0 000 1134444444332 22333344554422 257899999 Q ss_pred hcC----------------------------------------------------------------------------- Q lcl|NC_019918. 79 EFG----------------------------------------------------------------------------- 81 (180) Q Consensus 79 EfG----------------------------------------------------------------------------- 81 (180) +|| T Consensus 109 Q~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~ 188 (231) T protein:vir:37 109 QDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKV 188 (231) T ss_pred hcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhccc Confidence 999 Q ss_pred ---------CCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019918. 82 ---------TTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQ 121 (180) Q Consensus 82 ---------t~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~ 121 (180) .+.+|+||||-..- +++.+++...+.+++.|... T Consensus 189 ~~~~~k~~W~I~~paR~FLG~~~------~e~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 189 NAGNGKTNWEIHVPARPFLDTRE------KENVDILREITLKFLSGEYK 231 (231) T ss_pred ccccCcceeeeecCcccccCCCH------HHHHHHHHHHHHHHhcccCC Confidence 02489999994332 34677888888899888766 No 113 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=91.56 E-value=0.00036 Score=39.49 Aligned_cols=64 Identities=17% Similarity=0.178 Sum_probs=32.4 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-..+ .+.+++.+.++..-.++- ..+.++++..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:97 1 MAKVK---YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccccchhc Confidence 43322 133333333433332221 123444444444444444432 23 69999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:97 55 SVTMDFKDSG 64 (137) T ss_pred cceeEeecCc Confidence 9999887655 No 114 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=91.56 E-value=0.00036 Score=39.49 Aligned_cols=64 Identities=17% Similarity=0.178 Sum_probs=32.4 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-..+ .+.+++.+.++..-.++- ..+.++++..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:94 1 MAKVK---YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccccchhc Confidence 43322 133333333433332221 123444444444444444432 23 69999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:94 55 SVTMDFKDSG 64 (137) T ss_pred cceeEeecCc Confidence 9999887655 No 115 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=91.56 E-value=0.00036 Score=39.49 Aligned_cols=64 Identities=17% Similarity=0.178 Sum_probs=32.4 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-..+ .+.+++.+.++..-.++- ..+.++++..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:93 1 MAKVK---YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccccchhc Confidence 43322 133333333433332221 123444444444444444432 23 69999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:93 55 SVTMDFKDSG 64 (137) T ss_pred cceeEeecCc Confidence 9999887655 No 116 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=91.25 E-value=0.00084 Score=37.46 Aligned_cols=106 Identities=13% Similarity=0.117 Sum_probs=48.4 Q ss_pred cccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhC--------------------------------CEE Q lcl|NC_019918. 5 RSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDG--------------------------------TTV 52 (180) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~--------------------------------~~V 52 (180) -.|.-..--.|+-+.--++....-+-+....+-+-+.+.|+.-.. ..+ T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceee Confidence 111111111111110000000000000111111112222222111 112 Q ss_pred EEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC-CHHH Q lcl|NC_019918. 53 EVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR-QVNT 124 (180) Q Consensus 53 ~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~-~~~~ 124 (180) .|||.. -+.+|-+.||||.++||.||+..+.++ .+.++.+.+...++++|.... +-+. T Consensus 81 ~VG~~k------------~~~~A~f~n~GT~k~~~~hFie~t~~e--~~~evl~a~~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 81 TVGFHN------------KAHIARFLNDGTKYIRADHFVDNARDD--AKDAVFAAEAEKYQAMIAKANGGGDK 139 (139) T ss_pred eeCCCC------------CcceEeecccCccccCCCchHHHHHHH--HHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 344421 145788999999999999999999976 455788888888888877632 2122 No 117 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=91.13 E-value=0.00053 Score=38.54 Aligned_cols=82 Identities=16% Similarity=0.128 Sum_probs=36.3 Q ss_pred ecccceeeeh--HHHHHHHHHH-------------------HHHhhCCEEEEEecccccC---CCC------CCC-CCHH Q lcl|NC_019918. 24 ILASFSFKTD--RRRLTSLIKR-------------------VEALDGTTVEVGFFPEDRY---GSE------NGN-LPVA 72 (180) Q Consensus 24 M~~~v~~k~~--~~~l~~l~~~-------------------l~~l~~~~V~VGi~~~~~~---~~~------~~G-~~vA 72 (180) |+ .++++.+ ++.|..+.++ .+.+. -|.=|-+..+=. ..+ ..| .+++ T Consensus 1 m~-~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~a--Pv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a 77 (142) T protein:vir:99 1 MV-QVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARV--PVLTGHLGRSVREDPQVMVTPFHVSGGVTAHA 77 (142) T ss_pred Cc-eeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccchhhhcceeeeeccccccceEEEEeccCc Confidence 22 3333332 2222222211 12221 122232221100 000 011 2457 Q ss_pred HHHHHHhcCCC-----------------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTT-----------------------------RNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) .+|.++||||. +.||||||+++++++..++ .+..++ T Consensus 78 ~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~-----~~~~~r 142 (142) T protein:vir:99 78 KYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD-----RRIRVR 142 (142) T ss_pred cccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh-----hhhccC Confidence 89999999962 4669999999998754321 122222 No 118 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=91.13 E-value=0.00053 Score=38.54 Aligned_cols=82 Identities=16% Similarity=0.128 Sum_probs=36.3 Q ss_pred ecccceeeeh--HHHHHHHHHH-------------------HHHhhCCEEEEEecccccC---CCC------CCC-CCHH Q lcl|NC_019918. 24 ILASFSFKTD--RRRLTSLIKR-------------------VEALDGTTVEVGFFPEDRY---GSE------NGN-LPVA 72 (180) Q Consensus 24 M~~~v~~k~~--~~~l~~l~~~-------------------l~~l~~~~V~VGi~~~~~~---~~~------~~G-~~vA 72 (180) |+ .++++.+ ++.|..+.++ .+.+. -|.=|-+..+=. ..+ ..| .+++ T Consensus 1 m~-~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~a--Pv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a 77 (142) T protein:vir:86 1 MV-QVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARV--PVLTGHLGRSVREDPQVMVTPFHVSGGVTAHA 77 (142) T ss_pred Cc-eeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccchhhhcceeeeeccccccceEEEEeccCc Confidence 22 3333332 2222222211 12221 122232221100 000 011 2457 Q ss_pred HHHHHHhcCCC-----------------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGTT-----------------------------RNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 73 ~iA~i~EfGt~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) .+|.++||||. +.||||||+++++++..++ .+..++ T Consensus 78 ~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~-----~~~~~r 142 (142) T protein:vir:86 78 KYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD-----RRIRVR 142 (142) T ss_pred cccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh-----hhhccC Confidence 89999999962 4669999999998754321 122222 No 119 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=91.13 E-value=0.00032 Score=39.77 Aligned_cols=85 Identities=13% Similarity=0.103 Sum_probs=37.2 Q ss_pred ecccceeeehHHHH------------HHHHHHHHHhhCCE--EEEEeccccc-C-CCCCC-------CCCHHHHHHHHhc Q lcl|NC_019918. 24 ILASFSFKTDRRRL------------TSLIKRVEALDGTT--VEVGFFPEDR-Y-GSENG-------NLPVAQVAAYNEF 80 (180) Q Consensus 24 M~~~v~~k~~~~~l------------~~l~~~l~~l~~~~--V~VGi~~~~~-~-~~~~~-------G~~vA~iA~i~Ef 80 (180) |.-+..+..+...+ +++...++...+.. |.-|-+..+= + ...++ -.+++.+|.++|| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~ 80 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHE 80 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeee Confidence 43344444433222 22222222222211 2223332220 0 00111 1235789999999 Q ss_pred CC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019918. 81 GT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQ 121 (180) Q Consensus 81 Gt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~ 121 (180) |+ +++||||||++++++...... -|+ ++ T Consensus 81 GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~-------ri~------~~ 137 (137) T protein:vir:10 81 GSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADP-------DIH------MT 137 (137) T ss_pred cCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccc-------ccc------CC Confidence 96 135599999999976422111 111 11 No 120 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=89.78 E-value=0.00067 Score=37.97 Aligned_cols=64 Identities=14% Similarity=0.140 Sum_probs=34.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-..+ .+.+++.+.++..-..+. ..++++|...+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~---~~~~~~l~~~a~~~~~~ak~~-------------------~p-vdTG~L~~ 54 (137) T protein:vir:96 1 MAKVK---YGNWDLVAELEDYRDEME---EWVKKGILKTTLAIYNTAVAL-------------------AP-VDLGFLKE 54 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-cCccchhc Confidence 32211 123334444443333321 134455666666555555533 23 69999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||+++|.... T Consensus 55 Si~~~~~~~g 64 (137) T protein:vir:96 55 SIDFKVTDGG 64 (137) T ss_pred CceeEeecCc Confidence 9999887665 No 121 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=89.21 E-value=0.00084 Score=37.45 Aligned_cols=64 Identities=17% Similarity=0.174 Sum_probs=33.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-.. ..+-+++.+.++..-..+. ....++|+..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~---~~G~~~l~~~L~~~~~~~~---~~~~~al~~~a~~v~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:94 1 MAKV---KYGNWDLVKELENYERDIE---RWVKRGIAKTTVKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-cCcchhhc Confidence 3221 1133334444444433331 133455555555555555432 23 58999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:94 55 SVTMDFKDGG 64 (137) T ss_pred CceeEeecCc Confidence 9999887665 No 122 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=88.96 E-value=0.003 Score=34.38 Aligned_cols=116 Identities=13% Similarity=0.147 Sum_probs=48.7 Q ss_pred CCCCcccccch-hhHHHHHhhhheecccceee-ehHHHHHHHHH----HHHHhhCCEEEEEec-----ccccCC-----C Q lcl|NC_019918. 1 MQDGRSFTTSA-TPVLKTLALGVIILASFSFK-TDRRRLTSLIK----RVEALDGTTVEVGFF-----PEDRYG-----S 64 (180) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~M~~~v~~k-~~~~~l~~l~~----~l~~l~~~~V~VGi~-----~~~~~~-----~ 64 (180) |..-.+|..+. .-..+-|. .+... .+. ..++-+.++.. .++... -|.-|-+ .+..+. . T Consensus 1 M~~~~~~d~~gl~~~~~~l~---~~~~~-~~~~~~~~~~~~~a~~l~~~vk~~t--PVdTG~Lr~sw~~~~~~~~~~~~~ 74 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKME---KLTKI-DLDKFCKDAARELAARLLGKVIRRT--PVDTGFLRQGWNGVAYARSLPVYK 74 (141) T ss_pred CCCCccCcHHHHHHHHHHHH---HHhHH-HHHHHHHHHHHHHHHHHHHHHHHhC--CCcchhhcccccccccccccceee Confidence 65544444442 22222221 01110 011 11111222222 222211 1222222 111100 0 Q ss_pred CCCC-----CCHHHHHHHHhcCCCCCCCCcchhhHHHH----HHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_019918. 65 ENGN-----LPVAQVAAYNEFGTTRNPTRPFMAPTFEE----FTSQFHYARLMKSTFENVIRDGRQV 122 (180) Q Consensus 65 ~~~G-----~~vA~iA~i~EfGt~~IP~RpFlr~~~~~----~~~~~~~~~~~~~~~~~~l~g~~~~ 122 (180) .+++ .+++.+|-+-|||+...|+|||....+.- ...+..+.+.+++.+..++.+-.++ T Consensus 75 ~g~~~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 75 QGNNYIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred cCCeeEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1122 24578999999999988888887665411 1122345555666666666554443 No 123 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=88.62 E-value=0.002 Score=35.38 Aligned_cols=64 Identities=13% Similarity=0.072 Sum_probs=31.9 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-.. .+ +-+++.+.+++.-.++- .-++++|...+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~--~~-Gl~~l~~~l~~~~~~~~---~~~~~al~~~a~~v~~~ak~~-------------------ap-vdTG~Lr~ 54 (135) T protein:vir:96 1 MAKV--KY-GADSIVVDLEKYSKDME---KWVKKGITKTTLKIYNTAIHL-------------------MP-VDTGFLRQ 54 (135) T ss_pred Cchh--hh-hHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccchhhhc Confidence 2111 01 22223333333333221 123445555555555444332 13 79999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||+++|.... T Consensus 55 SI~~~~~~~g 64 (135) T protein:vir:96 55 STTVDFENGG 64 (135) T ss_pred ceeEEeecCc Confidence 9999887665 No 124 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=88.60 E-value=0.0014 Score=36.19 Aligned_cols=43 Identities=19% Similarity=0.216 Sum_probs=24.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 122 VNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 122 ~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~~k~ 180 (180) +++++.+.-..+...|+..+.... | +|||.|++||++++.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~a---------------P-v~TG~Lr~SI~~~~~~~~ 43 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLM---------------P-VDTGYLRESVTMDFKDGG 43 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhC---------------C-cCcccccccceEEeecCc Confidence 333333333333333444443321 2 699999999999987765 No 125 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=88.60 E-value=0.0014 Score=36.19 Aligned_cols=43 Identities=19% Similarity=0.216 Sum_probs=24.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 122 VNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 122 ~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~~k~ 180 (180) +++++.+.-..+...|+..+.... | +|||.|++||++++.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~a---------------P-v~TG~Lr~SI~~~~~~~~ 43 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLM---------------P-VDTGYLRESVTMDFKDGG 43 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhC---------------C-cCcccccccceEEeecCc Confidence 333333333333333444443321 2 699999999999987765 No 126 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=88.07 E-value=0.0058 Score=32.84 Aligned_cols=82 Identities=12% Similarity=0.193 Sum_probs=54.2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhc-----C- Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYK-----G- 157 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~K-----G- 157 (180) |-..+. ++.+.+...+..+ . ..+....|..||..+....+..|.. | |+|+++.|..++ | T Consensus 1 m~~~~~------~l~~~l~~ll~~l-~-~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~ 72 (155) T protein:vir:79 1 MTDDLQ------ALERWAGGLLAKL-S-PAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGR 72 (155) T ss_pred CchHHH------HHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCc Confidence 332221 2333333333332 1 1244678999999999999999964 3 348898886543 3 Q ss_pred -CCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 158 -FNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 158 -~~~PLIDTG~L~~SIty~V~~k~ 180 (180) ...+|.+.+.+.++|+|++.... T Consensus 73 ~~~~~m~~~l~~a~~l~~~~~~d~ 96 (155) T protein:vir:79 73 VKREAMFRKLRTARYLRIDVDSTG 96 (155) T ss_pred ccchhhhhhhhhhheeeeeecCcE Confidence 24678999999999999998777 No 127 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=86.18 E-value=0.0039 Score=33.79 Aligned_cols=88 Identities=17% Similarity=0.117 Sum_probs=40.9 Q ss_pred hheecccceeeehHHHHHHHH------------HHHHHhhCC--EEEEEecccccCC-CCCCC--------CCHHHHHHH Q lcl|NC_019918. 21 GVIILASFSFKTDRRRLTSLI------------KRVEALDGT--TVEVGFFPEDRYG-SENGN--------LPVAQVAAY 77 (180) Q Consensus 21 ~~~M~~~v~~k~~~~~l~~l~------------~~l~~l~~~--~V~VGi~~~~~~~-~~~~G--------~~vA~iA~i 77 (180) =+.+...+++..+.+.+++.. ..++...+. -|.-|-+...=.. ..++| .+++.+|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 234445566666654433321 111111111 1333333222110 01111 245889999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQ 121 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~ 121 (180) +|||| ++.+|||||++++++..... .-|+. + T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~-------~~i~~------~ 140 (140) T protein:vir:10 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND-------PRVRM------T 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh-------hhccC------C Confidence 99996 24669999999997642211 11111 1 No 128 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=86.18 E-value=0.0039 Score=33.79 Aligned_cols=88 Identities=17% Similarity=0.117 Sum_probs=40.9 Q ss_pred hheecccceeeehHHHHHHHH------------HHHHHhhCC--EEEEEecccccCC-CCCCC--------CCHHHHHHH Q lcl|NC_019918. 21 GVIILASFSFKTDRRRLTSLI------------KRVEALDGT--TVEVGFFPEDRYG-SENGN--------LPVAQVAAY 77 (180) Q Consensus 21 ~~~M~~~v~~k~~~~~l~~l~------------~~l~~l~~~--~V~VGi~~~~~~~-~~~~G--------~~vA~iA~i 77 (180) =+.+...+++..+.+.+++.. ..++...+. -|.-|-+...=.. ..++| .+++.+|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 234445566666654433321 111111111 1333333222110 01111 245889999 Q ss_pred HhcCC-----------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019918. 78 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQ 121 (180) Q Consensus 78 ~EfGt-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~ 121 (180) +|||| ++.+|||||++++++..... .-|+. + T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~-------~~i~~------~ 140 (140) T protein:vir:97 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND-------PRVRM------T 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh-------hhccC------C Confidence 99996 24669999999997642211 11111 1 No 129 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=85.71 E-value=0.0037 Score=33.90 Aligned_cols=119 Identities=12% Similarity=0.090 Sum_probs=50.5 Q ss_pred cccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCE-----------------EEEE--ecccccCCCC Q lcl|NC_019918. 5 RSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTT-----------------VEVG--FFPEDRYGSE 65 (180) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~-----------------V~VG--i~~~~~~~~~ 65 (180) -+|.-..--.|+-+.=-++.....+-+....+-+-+.+.|+.-.... |.++ -.++..++.. T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccc Confidence 11111111111111111111111011111222222333333322210 0000 0000000000 Q ss_pred CCCCC-HHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHH Q lcl|NC_019918. 66 NGNLP-VAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTL 125 (180) Q Consensus 66 ~~G~~-vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~ 125 (180) .-|.. -+.+|-+-|+||.++||.+|+..+.++ .+.++.+.+...++++|.....-+.- T Consensus 81 ~VG~~~~~~~Ahf~n~GT~~~~~~hFie~t~~e--~~~ev~~a~~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 81 TVGFHNKAHIARFLNDGTKNIRADHFVDNARDD--AKDAVFAAEAEKYQAMIAKANGGDSK 139 (139) T ss_pred eeCCCCCceeeeeeccCccccCCCchHHHHHHH--HHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 01111 145788999999999999999999976 45578888888888888763211111 No 130 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=85.34 E-value=0.011 Score=31.32 Aligned_cols=62 Identities=8% Similarity=0.086 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+.++-+++.+.+++.-+.+ +.++...-..+...|+..+... +| +|||.|++||.+. T Consensus 1 i~i~Gld~L~~~L~~l~~~~-------~~~~~~a~~~~a~~i~~~ak~~----aP------------v~TG~Lr~sI~~~ 57 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-------DKNINATTEEAANFIEDRAKTL----AP------------KNFGKLAQSISTS 57 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHh----CC------------cCchhhhhcceee Confidence 23445555555555543322 2334444444444444444332 22 7999999999998 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) +.+++ T Consensus 58 ~~~~~ 62 (173) T protein:vir:10 58 DLKAK 62 (173) T ss_pred eeccC Confidence 77766 No 131 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=85.01 E-value=0.0041 Score=33.68 Aligned_cols=69 Identities=7% Similarity=0.056 Sum_probs=33.7 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHH Q lcl|NC_019918. 90 FMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKML 169 (180) Q Consensus 90 Flr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~ 169 (180) -++-. ..+-+.+.+.++..-..+. ..++.++..+...++..|+...... -| +|||.|+ T Consensus 1 m~~v~---i~Gld~L~~kl~~~~~~~~---~~v~~a~~~~~~~~a~~v~~~ak~~---------------~P-vdtG~Lr 58 (182) T protein:vir:10 1 MIEVE---LKGVNELRAKLKKLPDIMA---KATANAQENAIEQAEAYAVDELQSS---------------IK-YSTGELT 58 (182) T ss_pred CeEEE---EecHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhh---------------CC-CCchhhh Confidence 22211 2222233333333222211 1123444445455555554444321 34 7999999 Q ss_pred hhhhhheeccC Q lcl|NC_019918. 170 ESVKFQIHRRQ 180 (180) Q Consensus 170 ~SIty~V~~k~ 180 (180) +||+++|..+. T Consensus 59 ~SI~~~~~~~~ 69 (182) T protein:vir:10 59 RSFKHEVKVDG 69 (182) T ss_pred hceeeeeeecC Confidence 99999988766 No 132 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=84.85 E-value=0.0035 Score=34.02 Aligned_cols=43 Identities=19% Similarity=0.211 Sum_probs=24.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 122 VNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 122 ~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~~k~ 180 (180) +++++.+.-..+...|+..+... .| +|||.|++||++++.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~---------------ap-v~TG~Lr~SI~~~~~~~~ 43 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISL---------------MP-VDTGYLRESVTMDFKDGG 43 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhh---------------CC-ccccccccceeEEeecCc Confidence 23333333333333344333321 34 699999999999987766 No 133 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=84.27 E-value=0.019 Score=30.01 Aligned_cols=85 Identities=11% Similarity=0.147 Sum_probs=38.2 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHh Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNE 79 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~-~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~E 79 (180) -.||.+|.---- ...+.|.+|.+-|+ ........|||..+. ++.||++|+ T Consensus 53 ~PDGs~~~pRKr-------------------~k~KM~~kL~k~l~~~~~~~~a~v~f~~g~----------~~~IA~vHq 103 (227) T protein:vir:37 53 NPMGGSWKKRKN-------------------GTAKMLRRIAKLANSKAEKAQGTLFYKQKR----------TGEIAQEHQ 103 (227) T ss_pred CCCCCCCchhcc-------------------hhHHHHhhhHHHcceeecccceEEEecCcc----------hHHHHHHhh Confidence 668888854321 11122223222111 123445667776332 468999999 Q ss_pred cCC----------------------------------------------------------------------------- Q lcl|NC_019918. 80 FGT----------------------------------------------------------------------------- 82 (180) Q Consensus 80 fGt----------------------------------------------------------------------------- 82 (180) ||- T Consensus 104 ~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~ 183 (227) T protein:vir:37 104 EGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVANGKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKN 183 (227) T ss_pred cCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccCCCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhccc Confidence 991 Q ss_pred ------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 83 ------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 83 ------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) +.+|+||||-..- +++.+.+...+.++-.-.. T Consensus 184 ~~~~~~~k~~W~I~~PaR~FLG~~~------~e~~~~l~r~l~~~~~~~~ 227 (227) T protein:vir:37 184 GMNPSRHLTQWIIPTEKRSFLDTRE------EENAKIILAEIQKYTQKQQ 227 (227) T ss_pred ccccccCccceeeecCcccccCCCH------HHHHHHHHHHHHHHhhhcC Confidence 1245555553221 1233333333333322211 No 134 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=84.14 E-value=0.0028 Score=34.57 Aligned_cols=64 Identities=17% Similarity=0.184 Sum_probs=32.3 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-..+ .+.+++.+.+++.-..+ ...+.++++..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~---~~~~~~~~~~~a~~v~~~ak~~-------------------aP-v~TG~L~~ 54 (137) T protein:vir:95 1 MAKVK---YGNWDLVKELENYERDM---ERWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHh-------------------CC-ccchhhhc Confidence 43322 23333333333332222 1123444555455444444432 23 59999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 Si~~~~~~~~ 64 (137) T protein:vir:95 55 SVTMDFKDGG 64 (137) T ss_pred CeeeEeeCCc Confidence 9999887665 No 135 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=82.20 E-value=0.0045 Score=33.43 Aligned_cols=60 Identities=17% Similarity=0.216 Sum_probs=29.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhh Q lcl|NC_019918. 92 APTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLES 171 (180) Q Consensus 92 r~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~S 171 (180) -.++++ +.+.+++....+ ...++.+|...+..++..+|. + .| +|||.|++| T Consensus 1 i~Gld~------l~~~l~~~~~~~---~~~v~~al~~~a~~i~~~ak~--------~-----------aP-v~TG~Lr~s 51 (108) T protein:vir:99 1 MRGLDR------FLRSVERKQKSV---RIAVDKELSKSAARIERQAKI--------L-----------AP-VDTGWLRAQ 51 (108) T ss_pred CchHHH------HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh--------c-----------CC-cCchhhhcc Confidence 234433 333333332222 112345555555554444433 1 22 799999999 Q ss_pred hhhheeccC Q lcl|NC_019918. 172 VKFQIHRRQ 180 (180) Q Consensus 172 Ity~V~~k~ 180 (180) |++.+.+.- T Consensus 52 I~~~~~~~~ 60 (108) T protein:vir:99 52 IYSEQQRLL 60 (108) T ss_pred eeeeecCcE Confidence 987765432 No 136 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=81.52 E-value=0.0042 Score=33.59 Aligned_cols=87 Identities=14% Similarity=0.220 Sum_probs=36.5 Q ss_pred ecccceeeehHH----HHHHH-------------------HHHHHHhhCCEEEEEecccc-cCCCCCCC-------CCHH Q lcl|NC_019918. 24 ILASFSFKTDRR----RLTSL-------------------IKRVEALDGTTVEVGFFPED-RYGSENGN-------LPVA 72 (180) Q Consensus 24 M~~~v~~k~~~~----~l~~l-------------------~~~l~~l~~~~V~VGi~~~~-~~~~~~~G-------~~vA 72 (180) |...+++-.-.+ .|.++ .+.|+.... +.-|=.+.. .--...+| .+-- T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP--~~TG~yaksW~~k~~~~~~~~v~~~~~~y 78 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSP--KRTGDYAKNWTSQKLKNGDQVIYQKAPTY 78 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--ccccccccceeeeecCCeeEEEEEecCCc Confidence 555554432211 11111 112222110 111100000 00000011 1112 Q ss_pred HHHHHHhcCC-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 73 QVAAYNEFGT-----TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 73 ~iA~i~EfGt-----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) .|+-+.|||. ...|+||||+|+.+.. .+.+.+.+++.+.+ T Consensus 79 ~l~HLLE~GHa~r~GGrV~a~phI~paee~~--~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 79 RLTHLLENGHAKRNGGRVSPKVHIAPVEEEL--VSNYISRVEKRLSQ 123 (123) T ss_pred ceEEeeecceeecCCceeCcchhhhHHHHHH--HHHHHHHHHHHhcC Confidence 4778889994 2699999999998653 22444445544444 No 137 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=81.35 E-value=0.03 Score=28.91 Aligned_cols=80 Identities=13% Similarity=0.183 Sum_probs=45.8 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCC--CCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCCCCC Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDG--RQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGFNDP 161 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~--~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~~~P 161 (180) |...+. .+.+.+..++..- .+...+|..||..+....++.|.+ | |+|+++.+..+|+..+- T Consensus 1 M~~~~~----------~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~ 70 (152) T protein:vir:10 1 MSEPIE----------QVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKS 70 (152) T ss_pred CchHHH----------HHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccc Confidence 433332 1233333333321 244678999999999999999974 3 23788877766654433 Q ss_pred chhHHHHHhh--hhhheeccC Q lcl|NC_019918. 162 LFHTGKMLES--VKFQIHRRQ 180 (180) Q Consensus 162 LIDTG~L~~S--Ity~V~~k~ 180 (180) ......|+.| ++|+..... T Consensus 71 ~~m~~~L~~a~~l~~~a~~~~ 91 (152) T protein:vir:10 71 GKMFDKITQPRFMRLRLESEG 91 (152) T ss_pred hhHHHhhhhcceeeeeecCcE Confidence 3333344443 455554444 No 138 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=81.35 E-value=0.033 Score=28.72 Aligned_cols=82 Identities=11% Similarity=0.032 Sum_probs=48.1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc-----C--CCCCcHHHHHhcCC----C Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF----N 159 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~-----~--~pPnsp~Ti~~KG~----~ 159 (180) |..-+. ++.+.+...+.++ . ..+....|..||..+....+..|.. | |+|+++.|++.|.. . T Consensus 1 m~~~~~------~l~~~L~~ll~~L-~-~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~ 72 (156) T protein:vir:11 1 MADSLE------ALEDWAGPILRAL-E-PGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRK 72 (156) T ss_pred CchhHH------HHHHHHHHHHHhc-C-CcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccc Confidence 433221 2333344444332 1 1245679999999999999999964 4 24899999987632 2 Q ss_pred CCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 160 DPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 160 ~PLIDTG~L~~SIty~V~~k~ 180 (180) .+|.....+..+|.+.+.... T Consensus 73 ~~m~~~l~~~~~l~~~~~~~~ 93 (156) T protein:vir:11 73 IKMFQKLRTVRYLRAKGDAQA 93 (156) T ss_pred hhhhhhhhhhheeeeeecCcE Confidence 334333333444666665544 No 139 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=81.33 E-value=0.031 Score=28.85 Aligned_cols=85 Identities=12% Similarity=0.094 Sum_probs=39.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHH---HHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLT---SLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~---~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) -.||.+|.---- ...+.|. +++.-.+.-++....++|+.+. .+.||++ T Consensus 55 ~PDGs~w~pRKr-------------------~k~KMl~~L~k~l~~~~~~~~~~~v~~~~~~~----------~~rIA~v 105 (230) T protein:vir:98 55 TPTGSGWKPRKN-------------------GNAKMLRRIAKTLKFTSADREIKRVCTISRNA----------QRRSQKE 105 (230) T ss_pred CCCCCCChhhhh-------------------hhHHHHhhhHHHHHHhhcccccceeeeecccc----------hhhhhhh Confidence 668888854321 1122333 3333333223344445565332 2569999 Q ss_pred HhcCC--------------------------------------------------------------------------- Q lcl|NC_019918. 78 NEFGT--------------------------------------------------------------------------- 82 (180) Q Consensus 78 ~EfGt--------------------------------------------------------------------------- 82 (180) |.||- T Consensus 106 Hq~G~~~~~~~~~~~~r~~~~~~~~paTr~QAk~Lr~lGy~v~~g~~~~~~k~~kkps~kwI~~nls~~qAgliIR~L~~ 185 (230) T protein:vir:98 106 HQRGAKITNLKSVILRKSRAGTAKDPATMRQAKKLRDLGYTVPNGTTKSGKKRYRRPSAREIVATLSRAKASLLIRYFQE 185 (230) T ss_pred hhccchhhhhhhhhhhhhcCCCCcccccHHHHHHHHHcCCccCCCCCCcCCCCCCCCCHHHHHHhhhHHHHHHHHHHHhc Confidence 99991 Q ss_pred -------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 83 -------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 83 -------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) +.+|+||||-..- +++.+.+...+.++---.. T Consensus 186 k~~k~~~~~t~W~I~~PaR~FLG~~~------~e~~~~l~~~l~~i~~~~~ 230 (230) T protein:vir:98 186 KEERQGKRLTKWIIPTEKRPFLDERD------KENAEILKEFILKFSGIEK 230 (230) T ss_pred cccccccCccceeeecCcccccCCCh------HHHHHHHHHHHHHhccccC Confidence 1367777764332 1233334444333211111 No 140 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=80.92 E-value=0.015 Score=30.61 Aligned_cols=69 Identities=13% Similarity=0.169 Sum_probs=32.9 Q ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhH Q lcl|NC_019918. 86 PTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHT 165 (180) Q Consensus 86 P~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDT 165 (180) =+|..++ ++- +..++..+.+++.-..+. ..++++|...+..+++.+|.. + | +|| T Consensus 1 m~~ms~~--i~~-~g~~~l~~~l~~~~~~~~---~~v~~~l~~~a~~i~~~ak~~--------a-----------p-v~T 54 (144) T protein:vir:59 1 MALMSVR--IDP-SWRRIMSRNVRTFSGHVL---TQVEQVIIKTAEKIAGLAASL--------A-----------P-VDE 54 (144) T ss_pred CCcceee--ehh-HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh--------C-----------C-ccc Confidence 3333332 211 112223333333222221 123455555555555544432 1 2 689 Q ss_pred HHHHhhhhhheeccC Q lcl|NC_019918. 166 GKMLESVKFQIHRRQ 180 (180) Q Consensus 166 G~L~~SIty~V~~k~ 180 (180) |.|++||++++.... T Consensus 55 G~Lr~SI~~~~~~~g 69 (144) T protein:vir:59 55 GNLKNSIQIDYKNNG 69 (144) T ss_pred hhhhcCeeEEeecCc Confidence 999999999886554 No 141 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=79.93 E-value=0.0045 Score=33.45 Aligned_cols=64 Identities=19% Similarity=0.282 Sum_probs=35.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-... .+-+++.+.++..-.++- ..++.+|+..+..+++.+|... | +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~---~~~~~al~~~a~~i~~~ak~~a-------------------P-v~TG~Lr~ 54 (137) T protein:vir:10 1 MAKVK---YGNWDLVKELEEFEKETI---RWAKKGIAKTTTIIHNSIVSNM-------------------P-VDTGYLRE 54 (137) T ss_pred Cccch---hCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhC-------------------C-cCcchhhc Confidence 32211 122233344444333321 1345667777766666666532 2 59999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:10 55 SVSMDFKKGG 64 (137) T ss_pred CeeeEecCCc Confidence 9999887665 No 142 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=79.57 E-value=0.018 Score=30.19 Aligned_cols=107 Identities=11% Similarity=0.098 Sum_probs=59.5 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCC------------------------------ Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGT------------------------------ 50 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~------------------------------ 50 (180) |-+ |-....-.|+-+.=..+....-.-+.+..+-+-+.+.|+.-... T Consensus 2 m~~---~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD 78 (159) T protein:vir:38 2 AND---MGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQD 78 (159) T ss_pred cch---HHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCcccc Confidence 222 22223333333332222222222233344444444455443222 Q ss_pred ----------------EEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCC-----cchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 51 ----------------TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTR-----PFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 51 ----------------~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~R-----pFlr~~~~~~~~~~~~~~~~~ 109 (180) .+.|||. +-..+.||-+.+.||...|+. +|+..+..+ .+.++.+.+. T Consensus 79 ~I~~~~~~~iDg~~dG~s~VGw~----------~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~--~k~~Vl~A~~ 146 (159) T protein:vir:38 79 SITYKPGYTADKLHTGDTDVGFE----------GKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQE--AKKSVAEAEL 146 (159) T ss_pred ceeeecCccccccccceeeeccc----------CCccceEeeecccCccccCCCCccCChhHHHHHHH--HHHHHHHHHH Confidence 1223332 112378999999999999997 799988865 4557888888 Q ss_pred HHHHHHHhCCCCH Q lcl|NC_019918. 110 STFENVIRDGRQV 122 (180) Q Consensus 110 ~~~~~~l~g~~~~ 122 (180) ..+++++...-+- T Consensus 147 ~~~~~il~~~~~~ 159 (159) T protein:vir:38 147 KAYKEVMNHDSDK 159 (159) T ss_pred HHHHHHhhcccCC Confidence 8899998886554 No 143 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=79.29 E-value=0.017 Score=30.22 Aligned_cols=76 Identities=13% Similarity=0.204 Sum_probs=32.6 Q ss_pred EEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHH Q lcl|NC_019918. 52 VEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGR 131 (180) Q Consensus 52 V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~ 131 (180) ++|-...-.+ -.+|. + +|| ++ ++.+.+++.-.++- .-++++|+..+. T Consensus 1 ~~~~~~~~~~-------~~Ma~---v-~~G-------------ld------~l~~~l~~~~~~~~---~~~~~~l~~~a~ 47 (149) T protein:vir:10 1 MKLNYYDLSR-------CHMAK---V-KYG-------------AD------SMVVELDKFDKKIE---EWVKKGIAKTTT 47 (149) T ss_pred Ceeeeeccch-------hhhHH---H-HHH-------------HH------HHHHHHHHHHHHHH---HHHHHHHHHHHH Confidence 1111110000 00111 1 222 22 22222333222221 123444555555 Q ss_pred HHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 132 MVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 132 ~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~~k~ 180 (180) .+++.+|... | +|||.|++||++++.... T Consensus 48 ~v~~~ak~~a-------------------P-vdTG~L~~SI~~~~~~~g 76 (149) T protein:vir:10 48 KIYNTAVALA-------------------P-VDLGFLEESIDFKYFDGG 76 (149) T ss_pred HHHHHHHHhC-------------------C-cccchhhccceEEecCCc Confidence 5555444321 2 699999999999887665 No 144 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=79.05 E-value=0.0053 Score=33.06 Aligned_cols=68 Identities=9% Similarity=0.183 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+..+-+++.+.+++.-..+ ...+..++..-|..++..+|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 12233333333333332222 112345566556555555544321 122233 7999999999887 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:97 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 145 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=79.05 E-value=0.0053 Score=33.06 Aligned_cols=68 Identities=9% Similarity=0.183 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+..+-+++.+.+++.-..+ ...+..++..-|..++..+|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 12233333333333332222 112345566556555555544321 122233 7999999999887 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:93 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 146 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=79.05 E-value=0.0053 Score=33.06 Aligned_cols=68 Identities=9% Similarity=0.183 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+..+-+++.+.+++.-..+ ...+..++..-|..++..+|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 12233333333333332222 112345566556555555544321 122233 7999999999887 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:96 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 147 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=79.05 E-value=0.0053 Score=33.06 Aligned_cols=68 Identities=9% Similarity=0.183 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+..+-+++.+.+++.-..+ ...+..++..-|..++..+|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 12233333333333332222 112345566556555555544321 122233 7999999999887 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:10 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 148 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=79.05 E-value=0.0053 Score=33.06 Aligned_cols=68 Identities=9% Similarity=0.183 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+..+-+++.+.+++.-..+ ...+..++..-|..++..+|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 12233333333333332222 112345566556555555544321 122233 7999999999887 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:96 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 149 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=79.05 E-value=0.0053 Score=33.06 Aligned_cols=68 Identities=9% Similarity=0.183 Sum_probs=32.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_019918. 96 EEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 175 (180) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~ 175 (180) -+..+-+++.+.+++.-..+ ...+..++..-|..++..+|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 12233333333333332222 112345566556555555544321 122233 7999999999887 Q ss_pred eeccC Q lcl|NC_019918. 176 IHRRQ 180 (180) Q Consensus 176 V~~k~ 180 (180) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:78 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 150 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=78.59 E-value=0.064 Score=27.12 Aligned_cols=106 Identities=10% Similarity=0.070 Sum_probs=48.8 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHh---------hCCEEEEEecccc-cCCCCCCCC- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEAL---------DGTTVEVGFFPED-RYGSENGNL- 69 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l---------~~~~V~VGi~~~~-~~~~~~~G~- 69 (180) |-+-+||..+-+-..+..-=.+ ... .+.-.-+++.+|... ++..|.++-+... ..+...+|. T Consensus 5 m~~~~sF~~~i~~~~~~ve~~~--~~v-----~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~ 77 (145) T protein:vir:10 5 IGSVVTFEKSIADWIDRAEDGF--GIV-----VSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQ 77 (145) T ss_pred ccchhccccCHHHHHHHHHHHH--HHH-----HHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCcc Confidence 3333666554443332211000 000 000011122222211 2345555554422 111112232 Q ss_pred ------------------------CHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHH Q lcl|NC_019918. 70 ------------------------PVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTL 125 (180) Q Consensus 70 ------------------------~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~ 125 (180) +++-+|...|||+.+-+|+.|.|.++.+ |.+.+++++.++-.- T Consensus 78 t~~~~~~~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~------~~~~v~~~~~e~k~~------- 144 (145) T protein:vir:10 78 TKTYLARQARAVANSKATSVIYITNRLDYAADLEYGASNQAPAGVLGVVQAR------LGRYFQEAVEEARRA------- 144 (145) T ss_pred chhhHHHHHHHhhcccccceEEEeeCchhhhHhhccccCCCcchHHHHHHHH------HHHHHHHHHHHhhcc------- Confidence 3456888899999999999999999843 555555555443111 Q ss_pred H Q lcl|NC_019918. 126 L 126 (180) Q Consensus 126 L 126 (180) + T Consensus 145 ~ 145 (145) T protein:vir:10 145 I 145 (145) T ss_pred C Confidence 1 No 151 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=77.58 E-value=0.0071 Score=32.37 Aligned_cols=64 Identities=19% Similarity=0.283 Sum_probs=34.4 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-..+ .+-+++.+.++..-.++. ..+..+|+..+..+++.+|... | +|||.|++ T Consensus 1 Ma~~~---~Gl~~l~~~l~~~~~~~~---~~~~~al~~~a~~i~~~ak~~a-------------------P-vdTG~Lr~ 54 (137) T protein:vir:10 1 MAKVK---YGNWELVKELEDFEKETI---RWAKKGIAKTTTIIHNSIVSNM-------------------P-VDTGYLRE 54 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhC-------------------C-cCcchhhc Confidence 32211 122233333333222221 2345667777766666665532 2 59999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:10 55 SVSMDFKKGG 64 (137) T ss_pred CeeEEeeCCc Confidence 9998876554 No 152 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=76.91 E-value=0.01 Score=31.47 Aligned_cols=81 Identities=19% Similarity=0.317 Sum_probs=39.6 Q ss_pred hhheecccceeeehHHHHHH---------------------HHHHHH-Hhh----------------------CCEEEEE Q lcl|NC_019918. 20 LGVIILASFSFKTDRRRLTS---------------------LIKRVE-ALD----------------------GTTVEVG 55 (180) Q Consensus 20 ~~~~M~~~v~~k~~~~~l~~---------------------l~~~l~-~l~----------------------~~~V~VG 55 (180) |+|+++ ++ +...++|++ +++.++ ++. .+.|+|| T Consensus 1 Msvevk-Gv--~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vg 77 (134) T protein:vir:10 1 MSVKVT-GD--KALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIR 77 (134) T ss_pred CeEEee-cH--HHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEE Confidence 444433 11 111111111 112222 111 1467777 Q ss_pred ecccc-cCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhh--------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 56 FFPED-RYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAP--------TFEEFTSQFHYARLMKSTFENV 115 (180) Q Consensus 56 i~~~~-~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~~~ 115 (180) |-.+. +| -|--+||||..+-...+|++| +++.. +..+.+.++..++++ T Consensus 78 W~G~~~R~----------~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~--e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 78 WRGPFERF----------RIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQG--QNKYFETLKRELKKL 134 (134) T ss_pred EEcCCcee----------eEEEeeecceeecCCCCeeccchhhHHHHHHHhh--hHHHHHHHHHHHhcC Confidence 74332 33 266679999887888889888 55442 334455555555544 No 153 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=72.73 E-value=0.035 Score=28.55 Aligned_cols=76 Identities=13% Similarity=0.201 Sum_probs=33.3 Q ss_pred EEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHH Q lcl|NC_019918. 52 VEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGR 131 (180) Q Consensus 52 V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~ 131 (180) ++|-...-.+ --+|. .+|| ++ ++.+.+++.-..+- ..++++|...+. T Consensus 1 ~~~~~~~~~~-------~~Ma~----~~~G-------------ld------~l~~~L~~~~~~~~---~~~~~al~~~a~ 47 (149) T protein:vir:94 1 MKLSYYDLSR-------CHMAK----VKYG-------------AD------SMVVELDKFDKKIE---EWVKKGIAKTTT 47 (149) T ss_pred Ceeeeeecch-------hhHHH----HHHH-------------HH------HHHHHHHHHHHHHH---HHHHHHHHHHHH Confidence 1111111000 00111 1222 22 22223333222221 123455555555 Q ss_pred HHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhheeccC Q lcl|NC_019918. 132 MVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 180 (180) Q Consensus 132 ~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIty~V~~k~ 180 (180) .+++.+|.. .| +|||.|++||++++.... T Consensus 48 ~v~~~ak~~-------------------aP-vdTG~Lr~SI~~~~~~~g 76 (149) T protein:vir:94 48 KIYNTAVAL-------------------AP-VDLGFLEESIDFKYFDGG 76 (149) T ss_pred HHHHHHHHh-------------------CC-cccchhhcCeeEEeeCCc Confidence 555554432 23 699999999999887665 No 154 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=72.34 E-value=0.055 Score=27.49 Aligned_cols=66 Identities=11% Similarity=0.174 Sum_probs=31.4 Q ss_pred hhh-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHH Q lcl|NC_019918. 91 MAP-TFEEFTSQFHYARLMKSTFENVIRDG--RQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGK 167 (180) Q Consensus 91 lr~-~~~~~~~~~~~~~~~~~~~~~~l~g~--~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~ 167 (180) |-. .++ .+.-+++.+.+++... .+. ..++.+|+.+|..+...||.. .| +|||. T Consensus 1 Ms~~~id-~~gl~~~~~~l~~~~~---~~~~~~~~~~~l~~~~~~~~~~vk~~-------------------tP-VdTG~ 56 (144) T protein:vir:10 1 MSLGHVD-DAQFQQFASRVRQKID---SGYVKQELGKSSRRIGTQSLRILEAN-------------------TP-VKQGN 56 (144) T ss_pred CCCCCcc-HHHHHHHHHHHHHHHh---hcchHHHHHHHHHHHHHHHHHHHHHh-------------------CC-CCcch Confidence 432 222 2222223333332221 111 123455555555555555432 12 79999 Q ss_pred HHhhhhhheeccC Q lcl|NC_019918. 168 MLESVKFQIHRRQ 180 (180) Q Consensus 168 L~~SIty~V~~k~ 180 (180) |++|++..-..++ T Consensus 57 Lr~S~~~~~~~~~ 69 (144) T protein:vir:10 57 LRRSWTAEGPTYG 69 (144) T ss_pred hccceeecceeee Confidence 9999987644443 No 155 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=72.13 E-value=0.025 Score=29.35 Aligned_cols=67 Identities=18% Similarity=0.217 Sum_probs=35.8 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-. + +..+-+++.+.++ .... ...++.++...+......++....... | +|||.|++ T Consensus 1 Ma~-i-~i~Gld~L~~~l~----~~~~-~~~v~~~v~~~~~~~~~~~~~~a~~~a---------------p-vdTG~Lr~ 57 (112) T protein:vir:96 1 MAT-I-EFEGLDEMAQSLL----KNAS-SERRSKVLRKYGAKLKEAAVSKAQFKK---------------G-YSTGATRR 57 (112) T ss_pred Cce-e-eehHHHHHHHHHH----hhcC-HHHHHHHHHHHHHHHHHHHHHHhhhcC---------------C-CCchhhhh Confidence 321 1 2223222222222 2211 134567777777777777776554321 2 79999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||+++..... T Consensus 58 sI~~~~~~~~ 67 (112) T protein:vir:96 58 SITLEAGSDR 67 (112) T ss_pred ceeeecCceE Confidence 9987554444 No 156 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=71.23 E-value=0.048 Score=27.79 Aligned_cols=88 Identities=15% Similarity=0.227 Sum_probs=54.5 Q ss_pred HhhhheecccceeeehHHHHHHHHHHH-----HHhh------------------------------------------CC Q lcl|NC_019918. 18 LALGVIILASFSFKTDRRRLTSLIKRV-----EALD------------------------------------------GT 50 (180) Q Consensus 18 ~~~~~~M~~~v~~k~~~~~l~~l~~~l-----~~l~------------------------------------------~~ 50 (180) |.+-|.|.+.-.+|....-+..+.++| +.+. .+ T Consensus 1 ~~~~~~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r 80 (138) T protein:vir:98 1 MLLEVSMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIP 80 (138) T ss_pred CeeeecccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCce Confidence 777788887777775543333222211 1110 23 Q ss_pred EEEEEecccccCCCCCCCCCHHHHHHHHhcCCC-CCCCCc--chhhHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019918. 51 TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTT-RNPTRP--FMAPTFEEFTSQFHYARLMKSTFENVIRD 118 (180) Q Consensus 51 ~V~VGi~~~~~~~~~~~G~~vA~iA~i~EfGt~-~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~~~l~g 118 (180) .|+|||... +|+ |=-+||||.. .|-||- +++.+++.- +..+.+.++..+++.++| T Consensus 81 ~V~igW~Gp-R~~----------ivHLNE~GyGk~i~PrG~G~I~ka~~~s--e~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 81 KVKLGFTTP-RWN----------IVHLQELEYGWKHNRRGVGVIRRYSDIL--ETIYPRGIRDKLKRGFDG 138 (138) T ss_pred EEEEeeecC-eee----------EEeeecccccCCcCCCcchHHHHHHHhh--hHHHHHHHHHHHHHHhcC Confidence 566777532 442 5567899975 455554 577777653 456888899999999998 No 157 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=70.40 E-value=0.017 Score=30.26 Aligned_cols=63 Identities=11% Similarity=0.246 Sum_probs=26.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHH-HHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhh Q lcl|NC_019918. 95 FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQ-MQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVK 173 (180) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~-Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~SIt 173 (180) +++.+ +...+.+...++. ....+++..++...... |+..... + .| +|||.|++||+ T Consensus 1 ~~~~~----f~~~~~~~~~~~~---k~~~~~~~~~a~~~~~~~ie~~ak~----~-----------~p-vdtG~L~~SI~ 57 (141) T protein:vir:78 1 MNEFE----FDSNIPKARKLIE---KKVLQALEDIGEHMTTELAEGGHGV----T-----------SN-NDTGEYAQKSG 57 (141) T ss_pred Ccchh----HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhh----c-----------cc-cccchhhccee Confidence 32222 2222222211111 11223333333322221 1111111 1 23 79999999999 Q ss_pred hheeccC Q lcl|NC_019918. 174 FQIHRRQ 180 (180) Q Consensus 174 y~V~~k~ 180 (180) |+|.... T Consensus 58 ~~v~~~g 64 (141) T protein:vir:78 58 YKVRKSS 64 (141) T ss_pred eeeecCC Confidence 9986555 No 158 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=69.72 E-value=0.049 Score=27.76 Aligned_cols=64 Identities=17% Similarity=0.240 Sum_probs=33.2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |+..++- .+-+++.+.+ .+.. ....++.+|...+..+++++|. . + | +|||.|++ T Consensus 1 M~~~i~i-~Gld~l~~~L----~~~~-~~~~~~~al~~~~~~i~~~ak~----~----a-----------P-vdTG~Lr~ 54 (112) T protein:vir:36 1 MKSSLSF-KGIDQLVKHL----DKAA-SLKGVQQVVKSNTSNMTANMQK----L----V-----------P-VDTGYMKR 54 (112) T ss_pred Cceeeee-hhHHHHHHHH----Hhhh-hHHHHHHHHHHHHHHHHHHHHH----h----C-----------C-CCchhhhh Confidence 7766642 2322332222 2221 1123455555555555555543 1 1 1 79999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||+..+.+.. T Consensus 55 si~~~~~~~~ 64 (112) T protein:vir:36 55 SIKMELTEGG 64 (112) T ss_pred ceeeeecCCc Confidence 9987765543 No 159 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=69.27 E-value=0.072 Score=26.84 Aligned_cols=93 Identities=18% Similarity=0.156 Sum_probs=43.6 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhc Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEF 80 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~Ef 80 (180) ...|| |..|-..-+-++..++.-...-+-..+.......+..++. ...+.++ +++-+|...|| T Consensus 39 VdTGr-~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~--g~~iyi~--------------Nn~pYA~~LEy 101 (131) T protein:vir:94 39 VDTGR-FRMNWMASGSTPADGTTDATDKSGNTATGNATSFVLNAAD--WHTFTLT--------------NNLPYAQRLEY 101 (131) T ss_pred Cchhh-hhccchhccccccccccCCCCCCchhhHHHHHHHHhhccc--cceEEEe--------------eCchhhhhhhc Confidence 22232 2222211111122111111110000111112222222211 1112211 24668999999 Q ss_pred CCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 81 GTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVI 116 (180) Q Consensus 81 Gt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l 116 (180) |+.+-+|+.|.|.++. .|.+.+++++.++- T Consensus 102 G~S~QAP~g~v~~~~~------~~~~~v~~~~~e~k 131 (131) T protein:vir:94 102 GWSQQAPQGFVRVNVS------RFQQLLNEEASKVK 131 (131) T ss_pred cccCCCcchHHHHHHH------HHHHHHHHHHHhcC Confidence 9999999999999884 37778888887764 No 160 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=67.53 E-value=0.018 Score=30.11 Aligned_cols=101 Identities=13% Similarity=0.164 Sum_probs=49.1 Q ss_pred ccchhhHHHHHhhhheecccceee-ehHHHHHHHHHHHHHhhCCEEEEEe---cccccCC-----CCCCCCC-------- Q lcl|NC_019918. 8 TTSATPVLKTLALGVIILASFSFK-TDRRRLTSLIKRVEALDGTTVEVGF---FPEDRYG-----SENGNLP-------- 70 (180) Q Consensus 8 ~~~~~~~~~~~~~~~~M~~~v~~k-~~~~~l~~l~~~l~~l~~~~V~VGi---~~~~~~~-----~~~~G~~-------- 70 (180) -+----+.+.|- +| +.-.|+ +.++...++..+.+...+..|.+=+ ..+..-+ -.++|++ T Consensus 1 i~G~~~L~~~Lk---~~-s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~ 76 (127) T protein:vir:98 1 MTGMPALEVKLR---SM-SEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGY 76 (127) T ss_pred CcChHHHHHHHH---Hh-hHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcc Confidence 111111122221 11 111121 2344555666666554333231111 1111111 1233432 Q ss_pred HHHHHHHHhcCCC---------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 71 VAQVAAYNEFGTT---------RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 114 (180) Q Consensus 71 vA~iA~i~EfGt~---------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 114 (180) .+++|-+.|||++ -.|+-|||.|+|+.++. .+-+-++..++. T Consensus 77 t~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~--iF~~DL~~l~k~ 127 (127) T protein:vir:98 77 IKDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQRE--IYRQDMLNELRR 127 (127) T ss_pred cccccceeecceeeeecccccccccCccccccchHHHhH--HHHHHHHHHhcC Confidence 4789999999987 37899999999987543 455556666555 No 161 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=66.06 E-value=0.039 Score=28.30 Aligned_cols=67 Identities=19% Similarity=0.272 Sum_probs=36.1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-. + +..+-+++ .+.+.+. .+...++.++...|..++..++.... .+ .| +|||.|++ T Consensus 1 Ma~-i-~~~Gld~l----~~~L~~~-~~~~~v~~~~~~~~~~~~~~~~~~a~----~~-----------~p-~~TG~Lr~ 57 (114) T protein:vir:49 1 MAT-I-EFEGLDEM----AQSLLKN-ASPEKRSKVLRKYGSKLKEAAVNRAQ----FN-----------KG-YSTGATRR 57 (114) T ss_pred Cee-e-eeehHHHH----HHHHHHh-cCHHHHHHHHHHHHHHHHHHHHHhcc----cC-----------CC-CCchhhhh Confidence 321 1 11222222 2233332 12234567777777777766665431 11 22 69999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||+..+.+.+ T Consensus 58 sI~~~~~~~~ 67 (114) T protein:vir:49 58 SITLQVESDK 67 (114) T ss_pred ceeeeecCCe Confidence 9998876666 No 162 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=66.06 E-value=0.039 Score=28.30 Aligned_cols=67 Identities=19% Similarity=0.272 Sum_probs=36.1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_019918. 91 MAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 170 (180) Q Consensus 91 lr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~ 170 (180) |-. + +..+-+++ .+.+.+. .+...++.++...|..++..++.... .+ .| +|||.|++ T Consensus 1 Ma~-i-~~~Gld~l----~~~L~~~-~~~~~v~~~~~~~~~~~~~~~~~~a~----~~-----------~p-~~TG~Lr~ 57 (114) T protein:vir:27 1 MAT-I-EFEGLDEM----AQSLLKN-ASPEKRSKVLRKYGSKLKEAAVNRAQ----FN-----------KG-YSTGATRR 57 (114) T ss_pred Cee-e-eeehHHHH----HHHHHHh-cCHHHHHHHHHHHHHHHHHHHHHhcc----cC-----------CC-CCchhhhh Confidence 321 1 11222222 2233332 12234567777777777766665431 11 22 69999999 Q ss_pred hhhhheeccC Q lcl|NC_019918. 171 SVKFQIHRRQ 180 (180) Q Consensus 171 SIty~V~~k~ 180 (180) ||+..+.+.+ T Consensus 58 sI~~~~~~~~ 67 (114) T protein:vir:27 58 SITLQVESDK 67 (114) T ss_pred ceeeeecCCe Confidence 9998876666 No 163 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=63.82 E-value=0.039 Score=28.32 Aligned_cols=112 Identities=14% Similarity=0.143 Sum_probs=52.2 Q ss_pred CCCCcccccchhhHHHHH---hhhheecccceeeehHHHHHHHHHHHHHhhCC--------------------------- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTL---ALGVIILASFSFKTDRRRLTSLIKRVEALDGT--------------------------- 50 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~--------------------------- 50 (180) |++|-.|. .-.-..|-| +..-+|.. ...+-++++..+|.+..++ T Consensus 1 m~~~~d~~-~l~~f~k~l~~~~~~~~~~~-----~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~ 74 (163) T protein:vir:10 1 MSGGFDYR-SFAKFANNFNRNANHAKVDR-----FMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWAS 74 (163) T ss_pred CCCccCHH-HHHHHHHHHHHHhhhcchHH-----HHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcc Confidence 66665543 222222322 11111110 1112222222222211111 Q ss_pred -------EEEEEecccccCCCCCCC-----CCHHHHHHHHhcCCC-----CCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 51 -------TVEVGFFPEDRYGSENGN-----LPVAQVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKSTFE 113 (180) Q Consensus 51 -------~V~VGi~~~~~~~~~~~G-----~~vA~iA~i~EfGt~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 113 (180) .++=||-.+..+. ..++ .+.+.+|-+-|||.. -+|.+.+|+.+.++... .+.+.+++.+. T Consensus 75 ~~~k~tG~lr~swk~~~~~k-~~~~~~v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~--~~~~~~e~~l~ 151 (163) T protein:vir:10 75 AHGKQGGTLQKGWSKSRIEV-SGRTYKQKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKS--DMEKRVRDKYD 151 (163) T ss_pred ccccccchhhccceecceee-cCCceEEEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHH--HHHHHHHHHHH Confidence 2222232222211 1222 256789999999965 48999999999877643 46666665555 Q ss_pred HH----HhCCCC Q lcl|NC_019918. 114 NV----IRDGRQ 121 (180) Q Consensus 114 ~~----l~g~~~ 121 (180) .+ +.|+.. T Consensus 152 ~~l~k~~~~~~~ 163 (163) T protein:vir:10 152 GFMRKVVLGNGK 163 (163) T ss_pred HHHHHhhcCCCC Confidence 44 445443 No 164 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=60.57 E-value=0.15 Score=25.05 Aligned_cols=93 Identities=15% Similarity=0.136 Sum_probs=42.2 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhc Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEF 80 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~Ef 80 (180) ...|| |..|-..-+-++.-+..-..--+-..+.......+..++. ...+.+ .+++-+|...|| T Consensus 39 VdTGr-~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~--g~~iyi--------------~Nn~pYA~~LEy 101 (131) T protein:vir:78 39 VDTGR-FRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAAD--WHTFTL--------------TNNLPYAQRLEY 101 (131) T ss_pred Cchhh-hccccceecccccccccCCCCCCchhhHHHHHHHHhhccC--CceEEE--------------eeCchhhhHhhc Confidence 22222 2211111111111111111100000011111111111111 111111 134668999999 Q ss_pred CCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 81 GTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVI 116 (180) Q Consensus 81 Gt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l 116 (180) |+.+-+|+.|.|.++. .|.+++++++.++- T Consensus 102 G~S~QAP~G~v~~~~~------~~~~~v~~~~~e~k 131 (131) T protein:vir:78 102 GWSQQAPQGFVRVNVS------RFQQLLNEEASKVK 131 (131) T ss_pred cccCCCcchHHHHHHH------HHHHHHHHHHHhcC Confidence 9999999999999884 37788888887764 No 165 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=54.88 E-value=0.036 Score=28.48 Aligned_cols=64 Identities=22% Similarity=0.242 Sum_probs=33.8 Q ss_pred hhhH-HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHH Q lcl|NC_019918. 91 MAPT-FEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKML 169 (180) Q Consensus 91 lr~~-~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~ 169 (180) |-.. + +....+++.+ .+.+... ..++++++...|..++...|.. + | +|||.|+ T Consensus 1 Ma~~~i-~~~Gld~L~~----~L~~~~~-~~~v~~vv~~~~~~l~~~ak~~--------a-----------p-~dTG~lr 54 (92) T protein:vir:99 1 MADYSI-SWDGLDALDE----ALANQQN-MNTVKKVVKKHTANLMTATQQA--------V-----------P-VDTGHLK 54 (92) T ss_pred CCceee-EeehHHHHHH----HHHhhcc-HHHHHHHHHHHHHHHHHHHHHh--------C-----------C-CCccccc Confidence 2221 1 1112222222 2222211 1345666777776666665552 2 2 7999999 Q ss_pred hhhhhheeccC Q lcl|NC_019918. 170 ESVKFQIHRRQ 180 (180) Q Consensus 170 ~SIty~V~~k~ 180 (180) +||+..+.+.. T Consensus 55 rSI~~~~~~~g 65 (92) T protein:vir:99 55 QSAQIQISRDG 65 (92) T ss_pred eeeeEEeecCC Confidence 99997777665 No 166 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=54.67 E-value=0.049 Score=27.77 Aligned_cols=60 Identities=17% Similarity=0.076 Sum_probs=24.3 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchh Q lcl|NC_019918. 85 NPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFH 164 (180) Q Consensus 85 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLID 164 (180) ++..--+ .. +.....+.+...+ +.+|+.++..+++..|. ..-+| T Consensus 1 ~~~~~~~--~~----~~~~~~~~~~~v~----------r~~l~~~a~~v~~~Ak~--------------------~aPv~ 44 (137) T protein:vir:10 1 MTVTARY--ER----NPVGEARQFQVIA----------RRRLSRITRGTANQARA--------------------DVPVK 44 (137) T ss_pred CeeEEEe--cc----CchhHHHHHHHHH----------HHHHHHHHHHHHHHHHh--------------------cCCcc Confidence 1111111 11 1001111111111 12233333333333332 22369 Q ss_pred HHHHHhhhhhheeccC Q lcl|NC_019918. 165 TGKMLESVKFQIHRRQ 180 (180) Q Consensus 165 TG~L~~SIty~V~~k~ 180 (180) ||.|++||.+.+.... T Consensus 45 tG~Lr~SI~~~~~~~~ 60 (137) T protein:vir:10 45 TGNLGRSIREDPIVVA 60 (137) T ss_pred chhhhcCceeeeeecc Confidence 9999999998765444 No 167 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=52.35 E-value=0.39 Score=22.82 Aligned_cols=96 Identities=13% Similarity=0.135 Sum_probs=41.9 Q ss_pred CCCCcccccchhhHHHHHhhhheec--ccceeeehHHHHHHHHHHHHHh-hCCEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIIL--ASFSFKTDRRRLTSLIKRVEAL-DGTTVEVGFFPEDRYGSENGNLPVAQVAAY 77 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~--~~v~~k~~~~~l~~l~~~l~~l-~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i 77 (180) ...|| |..|-..-+-++.-+..-. ++= ....+.....+..+... ....+.++ +++-+|.. T Consensus 44 VdTGr-~R~nw~vs~~~~~~~~~~~~d~~G--~~t~~~~~~~~~~i~~~~~g~~iyi~--------------Nn~pYA~~ 106 (142) T protein:vir:10 44 VDTGR-FRGNWQATGNSPAAQSLNNYDPDG--NETRNSLRRQIYALARDANTNVIYIS--------------NRLDYAQG 106 (142) T ss_pred ccchh-hcccceeeecCcccccccCcCCCC--ccchhhHHHHHHHhhhccccceEEEe--------------eCcchhhh Confidence 33332 1111111111111111000 000 00011111111122110 11112222 23568899 Q ss_pred HhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019918. 78 NEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGR 120 (180) Q Consensus 78 ~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~ 120 (180) .|||+..-.|+.|.|.++. .|.+++++++.++ .+.. T Consensus 107 LEyG~S~QAP~G~v~~a~q------~~~~~v~~a~~e~-~~~~ 142 (142) T protein:vir:10 107 LEFGSSNQAPSGVLGVVQK------RLGRYFAEAVQEA-KRAL 142 (142) T ss_pred hhccccCCCcchHHHHHHH------HHHHHHHHHHHHh-hccC Confidence 9999999999999999984 3777777777665 2333 No 168 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=49.87 E-value=0.31 Score=23.36 Aligned_cols=117 Identities=12% Similarity=0.100 Sum_probs=58.5 Q ss_pred CCCCcccccchhhHHHHH-hhhheecccceeeehHHHHHHHHHHHHHhhC------------------------------ Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTL-ALGVIILASFSFKTDRRRLTSLIKRVEALDG------------------------------ 49 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~------------------------------ 49 (180) |-| |.-.---.++.+ -|+.|+...-..+.+..+-+-+.++|....+ T Consensus 1 M~~---~~~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~ 77 (168) T protein:vir:74 1 MAT---FEEAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVK 77 (168) T ss_pred Ccc---HHHHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCccc Confidence 322 222222233333 1233443322334445554555555543321 Q ss_pred -CEEEEEecccccCCCCCCCCC-HHHHHHHHhcCC------------------CCCCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 50 -TTVEVGFFPEDRYGSENGNLP-VAQVAAYNEFGT------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 109 (180) Q Consensus 50 -~~V~VGi~~~~~~~~~~~G~~-vA~iA~i~EfGt------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 109 (180) -...|||.. ..++|+. =|+||.|.+-|+ +.||.=.|+..+-++...+.++.+... T Consensus 78 dG~s~VGf~~-----k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~ 152 (168) T protein:vir:74 78 DGQSVVGWER-----STEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEA 152 (168) T ss_pred CCceeecccc-----cccccccchhhhhhhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHHHHHH Confidence 234566642 1234433 489999999998 368999999888766444555555555 Q ss_pred HHHHHHHhCCCCHHHHH Q lcl|NC_019918. 110 STFENVIRDGRQVNTLL 126 (180) Q Consensus 110 ~~~~~~l~g~~~~~~~L 126 (180) ..+.+++....--.. | T Consensus 153 ~~y~eIl~~k~~~~~-~ 168 (168) T protein:vir:74 153 EAMRKIINRKKKENN-L 168 (168) T ss_pred HHHHHHHHhhcCCCC-C Confidence 555665543211111 1 No 169 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=49.21 E-value=0.14 Score=25.34 Aligned_cols=59 Identities=14% Similarity=0.138 Sum_probs=29.9 Q ss_pred CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCch Q lcl|NC_019918. 84 RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLF 163 (180) Q Consensus 84 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLI 163 (180) -|-+|+-|... ++..++ . ..++++++.++...+...|. | -| + T Consensus 1 ~~~~~~~l~~~--------~l~~~~----~------~~~~~~~~~~a~~ve~~ak~--------~-----------aP-v 42 (137) T protein:vir:10 1 MVAHTLRIERA--------QLHGLG----M------DEARKAVNRVVRRTFTRSQI--------L-----------AP-V 42 (137) T ss_pred CcccccccChh--------hHhhHH----H------HHHHHHHHHHHHHHHHHHHh--------c-----------CC-c Confidence 22222222111 111111 1 13345566666665555543 1 12 8 Q ss_pred hHHHHHhhhhhheeccC Q lcl|NC_019918. 164 HTGKMLESVKFQIHRRQ 180 (180) Q Consensus 164 DTG~L~~SIty~V~~k~ 180 (180) |||+|++||.+.+.+.. T Consensus 43 ~TG~Lr~SI~~~~~~~~ 59 (137) T protein:vir:10 43 DTGYLRASGRLVLGRER 59 (137) T ss_pred Cchhhhccceeeeeecc Confidence 99999999999886543 No 170 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=46.10 E-value=0.091 Score=26.28 Aligned_cols=91 Identities=16% Similarity=0.291 Sum_probs=40.7 Q ss_pred ecccceeeeh----HHHHHH--------HHHHHHHhhC---CEEEEEecccc--cCCCC---------CCCC-----CHH Q lcl|NC_019918. 24 ILASFSFKTD----RRRLTS--------LIKRVEALDG---TTVEVGFFPED--RYGSE---------NGNL-----PVA 72 (180) Q Consensus 24 M~~~v~~k~~----~~~l~~--------l~~~l~~l~~---~~V~VGi~~~~--~~~~~---------~~G~-----~vA 72 (180) |. +|++-.- .+.|.+ +.+.++...+ ..++-.+.+.+ ..+.+ .++. +.- T Consensus 1 M~-~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~~y 79 (127) T protein:vir:80 1 MA-NIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKTEY 79 (127) T ss_pred Cc-cccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecCCc Confidence 22 2333211 111111 1111110000 01111111111 00000 0111 123 Q ss_pred HHHHHHhcCCC-----CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019918. 73 QVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQ 121 (180) Q Consensus 73 ~iA~i~EfGt~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~ 121 (180) +|+-+.|||.- ..++|||++|..+. ..+.+.+.+.+.+.|+.- T Consensus 80 qLtHLLE~GHAkr~GGRV~a~pHI~paee~------~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 80 RLAHLLEYGHATVDGGRVPETPHIRPVEDW------LEKEFEDRVERAIKNESR 127 (127) T ss_pred ceeehhhcceeccCCcccCCccchhhHHHH------HHHHHHHHHHHHhcCCCC Confidence 58899999953 69999999999754 345577778888877644 No 171 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=45.48 E-value=0.68 Score=21.49 Aligned_cols=102 Identities=11% Similarity=0.088 Sum_probs=42.4 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhc Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEF 80 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~Ef 80 (180) ...|| |..|-..-+-++.-+..-..-=+=..+.......+..+..-.+ --.+-++ .+++-+|...|| T Consensus 45 VDTGr-~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~~~~i~~~~~g~~-~~~~iyi-----------~NnlpYA~~LEy 111 (146) T protein:vir:79 45 VDTGR-FKANMQITANKPPLYALNQYDPDGEKIKAEGRRTLYALLHGGG-AIKSIYF-----------SNMLIYANALEY 111 (146) T ss_pred Ccchh-hccccceeecCcccccccCCCCCCcccHHHHHHHHHHHHhccc-ccceeEE-----------eeCchhhhhhhc Confidence 33333 2111111111111111100000000001111111111111000 0011111 134668899999 Q ss_pred CCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_019918. 81 GTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLL 126 (180) Q Consensus 81 Gt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L 126 (180) |+.+-.|+.|.|.++. .|.+++++++.++-.- -+| T Consensus 112 G~S~QAP~G~v~~~~~------~~~~~v~~a~~e~k~~-----~~l 146 (146) T protein:vir:79 112 GHSKQAPAGVFGIVAI------RLRSYMAEAIREARKK-----NAL 146 (146) T ss_pred cccCCCcchHHHHHHH------HHHHHHHHHHHHHHhh-----ccC Confidence 9999999999999984 3777777777765332 112 No 172 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=42.08 E-value=0.9 Score=20.84 Aligned_cols=107 Identities=12% Similarity=0.066 Sum_probs=52.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHH---------hhCCEEEEEeccccc---CCCCCCC Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEA---------LDGTTVEVGFFPEDR---YGSENGN 68 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~---------l~~~~V~VGi~~~~~---~~~~~~G 68 (180) |-+=+||...-+-..+..-=. +...+ + .---+++.+|.. -++..|.+|-+.... +.+...| T Consensus 1 m~~~~sFa~~i~~~~~~ve~~--~~~~~--r---~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G 73 (148) T protein:vir:97 1 MPSLSEFSRRITLRGRKVAEG--ADALT--R---KVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAG 73 (148) T ss_pred CCccchhcccHHHHHHHHHHH--HHHHH--H---HHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCC Confidence 888788876655442221100 00000 0 000011122211 134456666553221 1111112 Q ss_pred C-----------------------------CHHHHHHHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 69 L-----------------------------PVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDG 119 (180) Q Consensus 69 ~-----------------------------~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~ 119 (180) . +++-+|.-.|||+..-.|+.|.|.++. .|.+.+++ .++.++. T Consensus 74 ~~~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~t~~------~~~~~v~~--~~~~~~~ 145 (148) T protein:vir:97 74 STEAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPANFVEQAVL------EAVQVVQF--GRVVDGD 145 (148) T ss_pred cccccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcchHHHHHHH------HHHHHHHh--hhhhcCC Confidence 1 345688889999999999999999884 36666655 2344443 Q ss_pred CCH Q lcl|NC_019918. 120 RQV 122 (180) Q Consensus 120 ~~~ 122 (180) -.. T Consensus 146 ~~~ 148 (148) T protein:vir:97 146 PGS 148 (148) T ss_pred CCC Confidence 211 No 173 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=41.71 E-value=0.12 Score=25.59 Aligned_cols=109 Identities=17% Similarity=0.080 Sum_probs=49.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccc--------------cCCCC- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPED--------------RYGSE- 65 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~--------------~~~~~- 65 (180) |.| ++.++----=|+- ..-||...+..- +.+.|+.-++....|+++.-. .|+.. T Consensus 1 ma~-~~~~~vrV~Glr~--f~~~mrK~~g~d--------l~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~ 69 (143) T protein:vir:62 1 MAQ-RSAYTIRVDGLRE--FQRNVRTLRDKE--------LNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGK 69 (143) T ss_pred CCc-ccchheehHHHHH--HHHHHHHhhCCc--------hhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcch Confidence 433 2222210000110 111333221110 011111111112222222111 00000 Q ss_pred ----------------CCCC-CHHHHHHHHhcCCC--CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_019918. 66 ----------------NGNL-PVAQVAAYNEFGTT--RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLL 126 (180) Q Consensus 66 ----------------~~G~-~vA~iA~i~EfGt~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L 126 (180) .-|- .-.-+|.+-+||++ +|-++-|+..++... ++.|.+.+++.|.++++-.. T Consensus 70 L~~Sir~aaT~raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~t--e~~~~r~Ye~~i~~vl~k~l------ 141 (143) T protein:vir:62 70 LDKSIKVTASAKGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARK--SDVVAATYERRIAAVVEKYL------ 141 (143) T ss_pred hhccccccccccceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhcc--CHHHHHHHHHHHHHHHHHHh------ Confidence 0011 12236778899997 799999999999775 46799999999998876542 Q ss_pred HH Q lcl|NC_019918. 127 KK 128 (180) Q Consensus 127 ~~ 128 (180) +. T Consensus 142 ~s 143 (143) T protein:vir:62 142 ES 143 (143) T ss_pred cC Confidence 22 No 174 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=39.24 E-value=0.14 Score=25.28 Aligned_cols=109 Identities=18% Similarity=0.097 Sum_probs=48.8 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccc--------------cCCCC- Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPED--------------RYGSE- 65 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~--------------~~~~~- 65 (180) |.| ++.++----=|+- ..-||...+..- +.+.|+.-++....|+++.-. .|+.. T Consensus 1 ma~-~~~~~vkV~Glr~--f~~~mrK~~g~d--------l~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~ 69 (143) T protein:vir:13 1 MAQ-RSAYTIQVDGLRQ--FQRNVRALRDKE--------LNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGK 69 (143) T ss_pred CCc-ccchheehHHHHH--HHHHHHHhhCCc--------chHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccch Confidence 433 2222211000111 112333221110 011111111111222222111 01000 Q ss_pred ----------------CCC-CCHHHHHHHHhcCCC--CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_019918. 66 ----------------NGN-LPVAQVAAYNEFGTT--RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLL 126 (180) Q Consensus 66 ----------------~~G-~~vA~iA~i~EfGt~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L 126 (180) .-| ..-.-+|.+-+||++ +|-++-|+..++... ++.|.+.+++.|.++++-.. T Consensus 70 L~~Sir~aaT~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~t--e~~~~r~Ye~~i~~vl~k~l------ 141 (143) T protein:vir:13 70 LDKSIKVTASAKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARK--SDVVAATYERRIAAVVEKYL------ 141 (143) T ss_pred hhccccccccccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhcc--CHHHHHHHHHHHHHHHHHHh------ Confidence 001 000126778899997 799999999999775 46799999999998876542 Q ss_pred HH Q lcl|NC_019918. 127 KK 128 (180) Q Consensus 127 ~~ 128 (180) +. T Consensus 142 ~s 143 (143) T protein:vir:13 142 ES 143 (143) T ss_pred cC Confidence 22 No 175 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=39.18 E-value=0.25 Score=23.90 Aligned_cols=100 Identities=13% Similarity=0.107 Sum_probs=57.3 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCC-HHHHHHHHh Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLP-VAQVAAYNE 79 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~-vA~iA~i~E 79 (180) --++| -|-....|+=+|-+.+. +| .....-...|||... .+.|+. -|+||.|.+ T Consensus 50 Hy~~k-----~t~~~~HLaDsI~~~~~-ni--------------Dg~~dG~s~VGf~~k-----~~~~~~~ka~iAr~lN 104 (168) T protein:vir:10 50 HYRHR-----DTGEDPHLADSIVMKNK-NI--------------DGVKDGQSVVGWERS-----TEKGTHTKGYIANIIN 104 (168) T ss_pred hhccC-----CCCccchhhhhheeccc-cc--------------ccccCCceeecccCc-----cccccccchheeeecc Confidence 11222 12333456666655533 22 123456789999733 235555 699999999 Q ss_pred cCC------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_019918. 80 FGT------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLL 126 (180) Q Consensus 80 fGt------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L 126 (180) -|+ +.||.=.|+..+-.+...+.++.+.....+.+++....--.. | T Consensus 105 DGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~~~y~eIl~~k~~~~~-~ 168 (168) T protein:vir:10 105 NGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKIINRKKKESN-L 168 (168) T ss_pred ccccccccccccccccccccccccccchhHHHhhhchhhhHHHHHHHHHHHHHHHHhhcCCCC-C Confidence 998 369999999888766444555555555556666544211111 1 No 176 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=33.63 E-value=1.1 Score=20.32 Aligned_cols=101 Identities=12% Similarity=0.059 Sum_probs=43.2 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHH-HHhh-CCEEEEEecccccCCCCCCCCCHHHHHHHH Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRV-EALD-GTTVEVGFFPEDRYGSENGNLPVAQVAAYN 78 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l-~~l~-~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~ 78 (180) ...|| |..|-..-+-++..++.-..-=+=..+.......+..+ .... ...+.++ +++.+|... T Consensus 45 VdTGr-~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~~~~~~~~~~~~~~~~~~iyi~--------------Nn~pYA~~L 109 (147) T protein:vir:10 45 VDTGR-FKGNWQITFNEIPNHALNRYDKTGGVVRGEEQAKTYGMFSRGGAITSVHFS--------------NMLIYANAL 109 (147) T ss_pred Ccchh-hccccceeecCccccccCCcCCCccchhhhhhHHHHHHhhhccCcceEEEe--------------eCcchhhhh Confidence 22232 22221111111111111110000000000000001111 0000 1111221 246688999 Q ss_pred hcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_019918. 79 EFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLL 126 (180) Q Consensus 79 EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L 126 (180) |||+.+-+|+.|.|.++. .|.+++++++.++-+-+ .+| T Consensus 110 EyG~S~QAP~G~V~~t~q------~~~~~v~~~~~e~k~~~----~~~ 147 (147) T protein:vir:10 110 EYGHSQQAPSGVVGLVAL------RLRSYMADAIKQARRQQ----NAL 147 (147) T ss_pred hccccCCCCchHHHHHHH------HHHHHHHHHHHHHHhhh----ccC Confidence 999999999999998874 37778888887765432 223 No 177 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=27.18 E-value=0.56 Score=21.97 Aligned_cols=73 Identities=14% Similarity=0.313 Sum_probs=43.9 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHH-hcCCCCCcHHHHHhcCCCCCch Q lcl|NC_019918. 85 NPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNI-DDYPGSNSPAWAAYKGFNDPLF 163 (180) Q Consensus 85 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I-~~~~pPnsp~Ti~~KG~~~PLI 163 (180) .+--+||.--|++.+ .+.-+..-...++..+|+.-..+.|.-+ ..|.- +-.++|-. T Consensus 1 M~~~~~lHvdF~qp~--------------~~~Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs---------~pGe~P~~ 57 (170) T protein:vir:44 1 MPQKAYLHVDFVQPE--------------ELVFNRARMRRAFVKIGQVHMRDARRLVMKRGRS---------KPGENPSY 57 (170) T ss_pred CCCCceeEEeeecCC--------------ceeecHHHHHHHHHHHhHHHHHHHHHHHHHhcCC---------CCCCCCcc Confidence 444445543343321 1111222345668888988888887544 22210 01247999 Q ss_pred hHHHHHhhhhhheeccC Q lcl|NC_019918. 164 HTGKMLESVKFQIHRRQ 180 (180) Q Consensus 164 DTG~L~~SIty~V~~k~ 180 (180) -||.|..||.|.|-+.- T Consensus 58 ~TGrLa~SIgy~Vpras 74 (170) T protein:vir:44 58 RTGQLARSIGYYVPRAS 74 (170) T ss_pred hhhhhhhhhhhcccccc Confidence 99999999999998874 No 178 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=27.07 E-value=0.7 Score=21.42 Aligned_cols=70 Identities=21% Similarity=0.231 Sum_probs=39.8 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCCchhHHHHHhh Q lcl|NC_019918. 92 APTFEEFTSQFHYARLMKSTFENVIRDGRQVNTLLKKLGRMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLES 171 (180) Q Consensus 92 r~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~iG~~~~~~Iq~~I~~~~pPnsp~Ti~~KG~~~PLIDTG~L~~S 171 (180) -.++.+..+-+++-+.+++-+-..-- +.-.+.||...|+..+..+|..+. ..-|||.+.++ T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v-~ri~nkAL~~~ge~v~~~lK~~~~------------------~f~DTG~t~de 61 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKV-NRVVNRSLKEIGKELEPSFKSAIS------------------IYKRTGETTES 61 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHH-HHHhHHHHHHHHHHHHHHHHHhhh------------------hhhhcchhhcc Confidence 23444444544444445443332111 224578999999999999999884 34477776666 Q ss_pred hhhheeccC Q lcl|NC_019918. 172 VKFQIHRRQ 180 (180) Q Consensus 172 Ity~V~~k~ 180 (180) |...=...+ T Consensus 62 v~~s~~~~~ 70 (132) T protein:vir:96 62 AVVSGVRRE 70 (132) T ss_pred eeecCeeec Confidence 543222222 No 179 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=25.67 E-value=0.52 Score=22.14 Aligned_cols=83 Identities=14% Similarity=0.056 Sum_probs=36.3 Q ss_pred HHhhhheecccceeeehHHHHHHHHH-----HHHHh------------hCCEEEEEecccc-cCCCCCCC---------- Q lcl|NC_019918. 17 TLALGVIILASFSFKTDRRRLTSLIK-----RVEAL------------DGTTVEVGFFPED-RYGSENGN---------- 68 (180) Q Consensus 17 ~~~~~~~M~~~v~~k~~~~~l~~l~~-----~l~~l------------~~~~V~VGi~~~~-~~~~~~~G---------- 68 (180) -|+|+-+....--.+.....++...+ .+..+ ++..|.+|-+... ......+| T Consensus 1 ~~~~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~~ 80 (121) T protein:vir:94 1 MISMKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVVS 80 (121) T ss_pred CccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHHH Confidence 22222222111111111111222211 11111 2334555444311 11111112 Q ss_pred ----------CCHHHHHHHHhcCCCCCCCCcchhhHHHHHH Q lcl|NC_019918. 69 ----------LPVAQVAAYNEFGTTRNPTRPFMAPTFEEFT 99 (180) Q Consensus 69 ----------~~vA~iA~i~EfGt~~IP~RpFlr~~~~~~~ 99 (180) .+++-+|...|||+..-+|+.|.|.++.+.. T Consensus 81 ~~~~~~~iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 81 SNVALPHFYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred HhhccceEEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 1334578889999999999999999986543 No 180 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=22.48 E-value=1.8 Score=19.15 Aligned_cols=96 Identities=13% Similarity=0.128 Sum_probs=39.1 Q ss_pred CCCCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhc Q lcl|NC_019918. 1 MQDGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEF 80 (180) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA~i~Ef 80 (180) ...|| |..|-..-+-++..+..-..--+=+.....+..+...+..+.-. .+-++ .+++-+|...|| T Consensus 39 VdTGr-~Ranw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~~~vi~~~k~g--~~iyi-----------~Nn~pYA~~LEy 104 (134) T protein:vir:80 39 ILTGQ-ARRNWQTELNQMPESVLDIPESPSEGMDEALQVLQQTVGQYKAG--DTVHI-----------TNNAPYIKELNS 104 (134) T ss_pred Ccchh-hhcccceeecCcccccccCcCCCCccchhhHHHHHHHHhhccCc--ceEEE-----------eeCchhhhhhhc Confidence 22232 22222111112222211111000000011222222222222110 11111 134668999999 Q ss_pred CCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 81 GTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIR 117 (180) Q Consensus 81 Gt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~ 117 (180) |+..-+|+.|.|-++.+ |.+.+++ ++-+-. T Consensus 105 G~S~QAP~G~v~~t~~~------~~~~v~~-~~~~~~ 134 (134) T protein:vir:80 105 GSSQQAPANFVETSIMR------ATRLIRN-VKVVPQ 134 (134) T ss_pred cccCCCcchHHHHHHHH------HHHHHHh-hccCCC Confidence 99999999999988743 5555544 222221 No 181 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=22.02 E-value=1.9 Score=19.06 Aligned_cols=91 Identities=9% Similarity=0.182 Sum_probs=41.2 Q ss_pred CC-----CCcccccchhhHHHHHhhhheecccceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHH Q lcl|NC_019918. 1 MQ-----DGRSFTTSATPVLKTLALGVIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVA 75 (180) Q Consensus 1 ~~-----~~~~~~~~~~~~~~~~~~~~~M~~~v~~k~~~~~l~~l~~~l~~l~~~~V~VGi~~~~~~~~~~~G~~vA~iA 75 (180) +- |+-.|..|-.--+.+++.+. ...+.-...+..+...+..+. +| +.-|= .+++-+| T Consensus 57 ~~a~~~ydtGrfRanw~vS~~~p~~~~-----~~~~~~~~t~~~~~~~i~~~~-----~g---~~iyi-----~NnlPYA 118 (152) T protein:vir:96 57 QPAPNYYRAGSYRSNHRVSISKITSFE-----KGISSQSSIMMDLQSDIAKFK-----IG---ETLFM-----TNPLPYA 118 (152) T ss_pred cccccccchhhhhhhheeeecCCCccc-----ccCCCCCchHHHHHHHHhhcc-----cc---ceEEE-----eeCchhh Confidence 10 33334333322222222110 000000111222222222211 11 11110 2245688 Q ss_pred HHHhcCCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019918. 76 AYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVIRDG 119 (180) Q Consensus 76 ~i~EfGt~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~l~g~ 119 (180) .-.|||+.+-+|.-|.|.++. .|.+++++++++ + T Consensus 119 ~~LEyG~S~QAP~G~vr~t~~------~~~~~v~ea~~~----~ 152 (152) T protein:vir:96 119 TSIEYGHSSQAPNGVYRPAVR------RLVKFLNTELKA----K 152 (152) T ss_pred hHhhccccCCCCchHHHHHHH------HHHHHHHHHhcc----C Confidence 999999999999999999884 366666665554 3 Done!