Query lcl|NC_021331.1_cdsid_YP_008059794.1 [gene=M186_gp72] [protein=hypothetical protein] [protein_id=YP_008059794.1] [location=33587..34030] Match_columns 147 No_of_seqs 104 out of 134 Neff 6.2 Searched_HMMs 1612 Date Thu Nov 7 17:31:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_72 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_72_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79638 Length: 146 100.0 7.3E-58 4.5E-61 333.8 16.0 146 1-147 1-146 (146) 2 protein:vir:107703 Length: 147 100.0 2.8E-55 1.7E-58 319.7 15.5 146 1-147 1-147 (147) 3 protein:vir:103280 Length: 142 100.0 1.3E-53 8.2E-57 310.5 14.0 142 1-145 1-142 (142) 4 protein:vir:104347 Length: 145 100.0 9.6E-53 5.9E-56 305.8 13.5 144 1-145 1-145 (145) 5 protein:vir:78380 Length: 131 100.0 8.3E-50 5.1E-53 289.7 11.6 131 8-142 1-131 (131) 6 protein:vir:94994 Length: 131 100.0 1.8E-49 1.1E-52 287.9 11.5 131 8-142 1-131 (131) 7 protein:vir:97190 Length: 148 100.0 3.2E-49 2E-52 286.5 11.0 141 1-144 1-148 (148) 8 protein:vir:95157 Length: 144 100.0 1.6E-47 9.9E-51 277.1 13.4 139 1-145 1-144 (144) 9 protein:vir:80425 Length: 134 100.0 2.3E-47 1.4E-50 276.3 10.8 134 8-143 1-134 (134) 10 protein:vir:96774 Length: 152 100.0 2.1E-45 1.3E-48 265.5 12.1 135 1-144 1-152 (152) 11 protein:vir:94944 Length: 121 100.0 1.7E-44 1.1E-47 260.5 10.4 121 1-131 1-121 (121) 12 protein:vir:79034 Length: 141 99.8 3.2E-23 2E-26 143.9 10.3 124 1-147 1-140 (141) 13 protein:vir:105467 Length: 144 99.7 2.4E-20 1.5E-23 128.2 11.3 118 1-147 1-142 (144) 14 protein:vir:9930 Length: 108 # 99.6 1.7E-18 1.1E-21 117.9 11.4 107 5-144 1-108 (108) 15 protein:vir:102963 Length: 163 99.6 3.6E-18 2.2E-21 116.2 9.7 117 1-147 1-159 (163) 16 protein:vir:95789 Length: 114 99.6 7.3E-18 4.5E-21 114.5 11.1 112 1-146 1-114 (114) 17 protein:vir:94654 Length: 142 99.6 9.3E-18 5.7E-21 114.0 11.5 112 1-142 1-142 (142) 18 protein:vir:96121 Length: 137 99.6 9.8E-18 6.1E-21 113.8 10.6 107 1-139 1-137 (137) 19 protein:vir:3617 Length: 112 # 99.6 9.5E-18 5.9E-21 113.9 10.2 108 1-142 1-112 (112) 20 protein:vir:95894 Length: 137 99.6 2E-17 1.2E-20 112.2 10.5 107 1-139 1-137 (137) 21 protein:vir:94538 Length: 125 99.5 3E-17 1.9E-20 111.1 10.8 117 1-147 1-122 (125) 22 protein:vir:94490 Length: 137 99.5 3.7E-17 2.3E-20 110.7 10.5 107 1-139 1-137 (137) 23 protein:vir:97427 Length: 137 99.5 3.7E-17 2.3E-20 110.7 10.5 107 1-139 1-137 (137) 24 protein:vir:93738 Length: 137 99.5 3.7E-17 2.3E-20 110.7 10.5 107 1-139 1-137 (137) 25 protein:vir:96829 Length: 135 99.5 6.5E-17 4E-20 109.3 10.6 107 1-139 1-135 (135) 26 protein:vir:94108 Length: 149 99.5 4.7E-17 2.9E-20 110.1 9.6 107 1-139 13-149 (149) 27 protein:vir:94796 Length: 137 99.5 7.9E-17 4.9E-20 108.9 10.6 107 1-139 1-137 (137) 28 protein:vir:5978 Length: 144 # 99.5 1.2E-16 7.2E-20 107.9 11.5 111 1-143 1-144 (144) 29 protein:vir:102338 Length: 116 99.5 3.9E-17 2.4E-20 110.5 8.0 93 21-147 1-115 (116) 30 protein:vir:107099 Length: 137 99.5 1.4E-16 8.8E-20 107.5 10.4 107 1-139 1-137 (137) 31 protein:vir:743 Length: 108 # 99.5 1.7E-16 1.1E-19 107.0 10.8 104 5-142 1-108 (108) 32 protein:vir:105916 Length: 149 99.5 1.1E-16 6.8E-20 108.1 9.6 107 1-139 13-149 (149) 33 protein:vir:105330 Length: 137 99.5 2.4E-16 1.5E-19 106.2 11.0 107 1-139 1-137 (137) 34 protein:vir:96486 Length: 112 99.5 3.6E-16 2.2E-19 105.3 9.9 108 1-142 1-112 (112) 35 protein:vir:98409 Length: 108 99.4 7.4E-16 4.6E-19 103.5 10.4 104 5-142 1-108 (108) 36 protein:vir:81147 Length: 126 99.4 4.9E-15 3E-18 99.1 10.9 115 1-146 1-126 (126) 37 protein:vir:103917 Length: 115 99.4 6.6E-15 4.1E-18 98.3 11.4 107 3-142 1-115 (115) 38 protein:vir:96225 Length: 115 99.4 6.6E-15 4.1E-18 98.3 11.4 107 3-142 1-115 (115) 39 protein:vir:9312 Length: 115 # 99.4 6.6E-15 4.1E-18 98.3 11.4 107 3-142 1-115 (115) 40 protein:vir:96358 Length: 115 99.4 6.6E-15 4.1E-18 98.3 11.4 107 3-142 1-115 (115) 41 protein:vir:78858 Length: 115 99.4 6.6E-15 4.1E-18 98.3 11.4 107 3-142 1-115 (115) 42 protein:vir:97144 Length: 115 99.4 6.6E-15 4.1E-18 98.3 11.4 107 3-142 1-115 (115) 43 protein:vir:4906 Length: 114 # 99.4 3E-15 1.9E-18 100.2 9.3 108 1-143 1-114 (114) 44 protein:vir:2740 Length: 114 # 99.4 3E-15 1.9E-18 100.2 9.3 108 1-143 1-114 (114) 45 protein:vir:99744 Length: 115 99.3 1.1E-14 6.7E-18 97.2 10.7 107 3-142 1-115 (115) 46 protein:vir:966 Length: 123 # 99.3 1.4E-14 8.5E-18 96.6 10.7 114 1-144 1-123 (123) 47 protein:vir:106623 Length: 115 99.3 3E-14 1.8E-17 94.7 11.2 108 3-143 1-115 (115) 48 protein:vir:78077 Length: 141 99.2 1.3E-13 7.9E-17 91.3 11.4 110 1-146 1-141 (141) 49 protein:vir:1243 Length: 116 # 99.2 8.2E-14 5.1E-17 92.3 7.6 87 21-139 1-116 (116) 50 protein:vir:97327 Length: 116 99.2 8.2E-14 5.1E-17 92.3 7.6 87 21-139 1-116 (116) 51 protein:vir:8669 Length: 142 # 99.2 9.4E-14 5.8E-17 92.0 7.4 111 1-139 1-142 (142) 52 protein:vir:99101 Length: 142 99.2 9.4E-14 5.8E-17 92.0 7.4 111 1-139 1-142 (142) 53 protein:vir:106570 Length: 182 99.1 7.5E-13 4.7E-16 87.0 11.7 117 1-147 1-182 (182) 54 protein:vir:101594 Length: 173 99.1 8.1E-13 5E-16 86.9 10.1 110 1-141 1-173 (173) 55 protein:vir:100075 Length: 140 99.1 1.2E-12 7.4E-16 86.0 10.9 130 1-147 1-136 (140) 56 protein:vir:95062 Length: 116 99.1 2.8E-13 1.7E-16 89.4 7.3 87 21-139 1-116 (116) 57 protein:vir:80362 Length: 140 99.1 1.1E-12 6.8E-16 86.1 8.9 129 1-147 1-136 (140) 58 protein:vir:107568 Length: 146 99.0 4.5E-12 2.8E-15 82.8 11.8 132 1-147 1-144 (146) 59 protein:vir:102875 Length: 146 99.0 4.5E-12 2.8E-15 82.8 11.8 132 1-147 1-144 (146) 60 protein:vir:105007 Length: 146 99.0 4.5E-12 2.8E-15 82.8 11.8 132 1-147 1-144 (146) 61 protein:vir:102085 Length: 146 99.0 4.5E-12 2.8E-15 82.8 11.8 132 1-147 1-144 (146) 62 protein:vir:100243 Length: 140 99.0 1.9E-12 1.2E-15 84.9 9.5 130 1-147 1-140 (140) 63 protein:vir:1437 Length: 140 # 99.0 1.5E-12 9.2E-16 85.4 8.4 130 1-147 1-136 (140) 64 protein:vir:194 Length: 149 # 99.0 4E-12 2.5E-15 83.0 8.5 134 1-144 1-149 (149) 65 protein:vir:80116 Length: 127 98.9 1.4E-11 8.6E-15 80.1 10.5 109 1-144 1-127 (127) 66 protein:vir:107545 Length: 140 98.9 5.8E-12 3.6E-15 82.2 7.2 111 1-146 1-140 (140) 67 protein:vir:97982 Length: 140 98.9 5.8E-12 3.6E-15 82.2 7.2 111 1-146 1-140 (140) 68 protein:vir:93617 Length: 148 98.9 8.4E-12 5.2E-15 81.3 7.9 134 1-144 1-148 (148) 69 protein:vir:99528 Length: 92 # 98.9 3.5E-12 2.2E-15 83.4 5.7 88 1-119 1-92 (92) 70 protein:vir:95372 Length: 124 98.9 2.9E-11 1.8E-14 78.3 10.5 109 1-144 1-124 (124) 71 protein:vir:1273 Length: 127 # 98.9 3.5E-11 2.2E-14 77.9 10.8 117 1-146 1-127 (127) 72 protein:vir:9708 Length: 125 # 98.9 2.7E-11 1.7E-14 78.5 9.9 120 1-147 1-125 (125) 73 protein:vir:106041 Length: 137 98.8 7.6E-12 4.7E-15 81.6 6.2 108 1-146 1-137 (137) 74 protein:vir:1891 Length: 179 # 98.8 2.2E-11 1.4E-14 79.0 7.8 140 1-146 1-179 (179) 75 protein:vir:98342 Length: 125 98.8 1.4E-10 9E-14 74.5 10.2 116 1-143 1-125 (125) 76 protein:vir:4704 Length: 125 # 98.8 1.4E-10 9E-14 74.5 10.2 116 1-143 1-125 (125) 77 protein:vir:9414 Length: 125 # 98.8 1.4E-10 9E-14 74.5 10.2 116 1-143 1-125 (125) 78 protein:vir:79988 Length: 125 98.8 1.4E-10 9E-14 74.5 10.2 116 1-143 1-125 (125) 79 protein:vir:81106 Length: 125 98.8 1.4E-10 9E-14 74.5 10.2 116 1-143 1-125 (125) 80 protein:vir:105089 Length: 133 98.7 2.1E-10 1.3E-13 73.6 10.7 118 1-144 1-133 (133) 81 protein:vir:1386 Length: 149 # 98.7 4.7E-10 2.9E-13 71.7 10.7 127 1-147 1-149 (149) 82 protein:vir:97088 Length: 157 98.7 5.1E-10 3.2E-13 71.5 10.5 122 1-147 1-153 (157) 83 protein:vir:4347 Length: 164 # 98.6 2.2E-10 1.4E-13 73.5 7.3 139 1-146 1-164 (164) 84 protein:vir:3873 Length: 128 # 98.6 1.2E-09 7.3E-13 69.5 11.1 122 1-147 1-127 (128) 85 protein:vir:5745 Length: 135 # 98.6 1.1E-09 6.9E-13 69.7 10.3 120 1-147 1-135 (135) 86 protein:vir:102441 Length: 137 98.5 4.6E-10 2.8E-13 71.8 5.2 116 1-147 1-135 (137) 87 protein:vir:106506 Length: 137 98.3 1E-09 6.2E-13 69.9 4.0 113 1-147 1-132 (137) 88 protein:vir:9879 Length: 127 # 98.1 1E-08 6.4E-12 64.4 5.3 109 1-144 1-127 (127) 89 protein:vir:102154 Length: 119 98.0 5.6E-08 3.5E-11 60.3 8.4 114 1-146 1-119 (119) 90 protein:vir:80970 Length: 112 97.5 1.6E-06 9.8E-10 52.4 8.8 100 1-147 1-112 (112) 91 protein:vir:6246 Length: 143 # 97.5 1.2E-06 7.7E-10 53.0 7.8 115 1-147 1-143 (143) 92 protein:vir:7449 Length: 123 # 97.4 3.3E-06 2.1E-09 50.6 9.4 116 1-147 1-121 (123) 93 protein:vir:96288 Length: 100 97.2 4.4E-06 2.7E-09 49.9 7.8 87 1-138 13-100 (100) 94 protein:vir:1332 Length: 143 # 97.2 4.1E-06 2.6E-09 50.1 7.7 115 1-147 1-143 (143) 95 protein:vir:45 Length: 112 # N 96.9 1.7E-05 1.1E-08 46.7 8.9 100 1-147 1-112 (112) 96 protein:vir:79687 Length: 113 96.9 8.5E-06 5.3E-09 48.4 7.1 97 11-145 1-113 (113) 97 protein:vir:4956 Length: 153 # 96.9 2.2E-05 1.4E-08 46.1 9.1 121 1-147 1-140 (153) 98 protein:vir:98892 Length: 108 96.8 2.1E-05 1.3E-08 46.2 8.4 101 1-143 1-108 (108) 99 protein:vir:101508 Length: 120 96.7 3.7E-05 2.3E-08 44.9 9.3 115 1-146 1-120 (120) 100 protein:vir:7993 Length: 108 # 96.4 3.6E-06 2.3E-09 50.4 1.8 101 1-140 1-108 (108) 101 protein:vir:100887 Length: 139 96.4 6E-05 3.7E-08 43.7 8.3 119 1-147 3-135 (139) 102 protein:vir:4200 Length: 133 # 96.3 2E-05 1.3E-08 46.3 5.7 96 1-133 1-133 (133) 103 protein:vir:8106 Length: 150 # 96.3 7.5E-06 4.6E-09 48.7 2.9 111 1-147 5-136 (150) 104 protein:vir:4790 Length: 114 # 95.9 0.00021 1.3E-07 40.8 9.3 102 1-147 1-114 (114) 105 protein:vir:4162 Length: 133 # 95.9 4.7E-05 2.9E-08 44.3 5.5 96 1-133 1-133 (133) 106 protein:vir:1581 Length: 116 # 95.9 0.00011 7E-08 42.2 7.6 100 1-139 1-116 (116) 107 protein:vir:5000 Length: 141 # 95.7 0.00029 1.8E-07 40.0 9.3 119 1-147 1-139 (141) 108 protein:vir:3036 Length: 118 # 95.7 9.8E-05 6.1E-08 42.6 6.5 96 1-147 1-117 (118) 109 protein:vir:9823 Length: 118 # 95.7 9.8E-05 6.1E-08 42.6 6.5 96 1-147 1-117 (118) 110 protein:vir:100223 Length: 139 95.5 0.00027 1.7E-07 40.1 8.5 112 5-147 1-135 (139) 111 protein:vir:79225 Length: 155 95.5 0.00024 1.5E-07 40.4 8.2 134 1-144 1-155 (155) 112 protein:vir:99196 Length: 155 95.5 0.00022 1.4E-07 40.6 7.8 134 1-142 1-155 (155) 113 protein:vir:3163 Length: 145 # 95.5 0.00014 8.4E-08 41.8 6.7 130 1-147 1-145 (145) 114 protein:vir:103841 Length: 155 95.3 0.00022 1.3E-07 40.7 7.3 135 1-144 1-155 (155) 115 protein:vir:107851 Length: 175 94.9 0.0006 3.7E-07 38.2 8.5 137 1-144 1-175 (175) 116 protein:vir:99833 Length: 190 94.8 0.0013 8.4E-07 36.3 10.2 135 1-146 1-190 (190) 117 protein:vir:79091 Length: 175 94.7 0.00066 4.1E-07 38.0 8.2 132 1-144 1-175 (175) 118 protein:vir:4833 Length: 140 # 94.7 0.00086 5.3E-07 37.4 8.8 115 1-147 1-139 (140) 119 protein:vir:4859 Length: 140 # 94.3 0.0013 7.9E-07 36.4 8.8 121 1-147 1-139 (140) 120 protein:vir:81067 Length: 119 91.4 0.00071 4.4E-07 37.9 3.2 84 36-147 1-113 (119) 121 protein:vir:10367 Length: 119 91.0 0.00081 5E-07 37.5 3.2 84 36-147 1-113 (119) 122 protein:vir:100652 Length: 134 90.3 0.012 7.3E-06 31.2 8.9 116 1-145 1-134 (134) 123 protein:vir:102190 Length: 93 89.0 0.0059 3.7E-06 32.8 6.2 91 25-146 1-93 (93) 124 protein:vir:78894 Length: 105 88.9 0.0017 1E-06 35.8 3.1 100 5-140 1-105 (105) 125 protein:vir:1988 Length: 156 # 87.7 0.034 2.1E-05 28.6 9.5 135 1-144 1-156 (156) 126 protein:vir:3848 Length: 159 # 87.5 0.036 2.2E-05 28.5 9.5 116 1-147 2-154 (159) 127 protein:vir:9513 Length: 134 # 86.8 0.029 1.8E-05 29.0 8.6 116 1-145 1-134 (134) 128 protein:vir:101302 Length: 134 86.8 0.029 1.8E-05 29.0 8.6 116 1-145 1-134 (134) 129 protein:vir:105773 Length: 131 83.5 0.029 1.8E-05 29.0 7.0 112 1-143 1-131 (131) 130 protein:vir:9647 Length: 132 # 81.9 0.081 5.1E-05 26.6 9.8 116 1-147 4-130 (132) 131 protein:vir:105825 Length: 108 74.8 0.013 8E-06 30.9 2.2 101 1-140 1-108 (108) 132 protein:vir:102608 Length: 108 74.8 0.013 8E-06 30.9 2.2 101 1-140 1-108 (108) 133 protein:vir:98636 Length: 138 67.3 0.25 0.00016 23.8 9.4 116 1-147 10-136 (138) 134 protein:vir:6216 Length: 125 # 65.6 0.28 0.00017 23.6 9.2 111 1-145 1-125 (125) 135 protein:vir:78163 Length: 92 # 64.9 0.027 1.7E-05 29.2 1.7 90 1-115 1-92 (92) 136 protein:vir:79179 Length: 155 55.2 0.49 0.0003 22.3 7.8 134 1-143 1-155 (155) 137 protein:vir:2026 Length: 150 # 53.5 0.53 0.00033 22.1 7.5 130 1-143 1-150 (150) 138 protein:vir:101563 Length: 155 50.6 0.48 0.0003 22.3 6.0 97 26-147 1-97 (155) 139 protein:vir:77650 Length: 155 50.3 0.61 0.00038 21.8 6.5 97 1-147 1-97 (155) 140 protein:vir:8432 Length: 149 # 49.7 0.63 0.00039 21.7 9.6 121 1-144 16-149 (149) 141 protein:vir:1164 Length: 156 # 48.5 0.53 0.00033 22.1 5.9 137 1-147 1-156 (156) 142 protein:vir:100312 Length: 152 45.0 0.64 0.0004 21.6 5.8 132 1-144 1-152 (152) 143 protein:vir:5257 Length: 148 # 44.7 0.8 0.00049 21.1 8.0 91 1-147 1-96 (148) 144 protein:vir:79115 Length: 148 43.7 0.83 0.00052 21.0 7.9 127 1-143 1-148 (148) 145 protein:vir:98557 Length: 149 42.7 0.88 0.00054 20.9 7.7 130 1-143 1-149 (149) 146 protein:vir:6071 Length: 150 # 40.0 0.99 0.00062 20.6 7.7 130 1-143 1-150 (150) 147 protein:vir:5703 Length: 150 # 36.3 1.2 0.00073 20.2 7.8 130 1-143 1-150 (150) 148 protein:vir:4230 Length: 111 # 34.0 0.6 0.00037 21.8 3.8 101 1-135 1-111 (111) 149 protein:vir:94069 Length: 168 33.8 0.88 0.00055 20.9 4.7 92 43-147 1-106 (168) 150 protein:vir:107757 Length: 189 33.2 1.4 0.00085 19.8 6.8 86 1-147 1-98 (189) 151 protein:vir:78335 Length: 133 30.6 1.6 0.00097 19.5 8.7 116 1-145 1-133 (133) 152 protein:vir:78607 Length: 155 29.9 1.6 0.001 19.4 6.6 96 26-147 1-97 (155) 153 protein:vir:106728 Length: 155 28.4 1.8 0.0011 19.2 6.6 97 26-147 1-97 (155) 154 protein:vir:1838 Length: 149 # 26.8 1.9 0.0012 19.0 7.8 129 1-143 1-149 (149) 155 protein:vir:96973 Length: 133 25.0 2.1 0.0013 18.8 8.2 116 1-143 1-133 (133) 156 protein:vir:78644 Length: 133 25.0 2.1 0.0013 18.8 8.2 116 1-143 1-133 (133) 157 protein:vir:9363 Length: 133 # 25.0 2.1 0.0013 18.8 8.2 116 1-143 1-133 (133) 158 protein:vir:94419 Length: 133 25.0 2.1 0.0013 18.8 8.2 116 1-143 1-133 (133) 159 protein:vir:93898 Length: 133 22.4 2.4 0.0015 18.5 8.7 116 1-143 1-133 (133) 160 protein:vir:2435 Length: 111 # 21.2 1.5 0.00096 19.5 3.6 101 1-135 1-111 (111) No 1 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=100.00 E-value=7.3e-58 Score=333.82 Aligned_cols=146 Identities=71% Similarity=1.204 Sum_probs=144.4 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) || +.|+.+|+++|++|++++|++++.++|+++++++++|+.+|||||||||+||++|+++||++.++.+||+|+.|.+. T Consensus 1 ma-~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~ 79 (146) T protein:vir:79 1 MA-DYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAE 79 (146) T ss_pred CC-cchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHH Confidence 99 58899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) +..++.+++.|+++|++|||+||+|||.+||||||+|||.|||++++++|++||+++++|+|.|||| T Consensus 80 ~~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~a~~e~k~~~~l 146 (146) T protein:vir:79 80 GRRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSKQAPAGVFGIVAIRLRSYMAEAIREARKKNAL 146 (146) T ss_pred HHHHHHHHHhcccccceeEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 9999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=100.00 E-value=2.8e-55 Score=319.69 Aligned_cols=146 Identities=60% Similarity=1.009 Sum_probs=143.1 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) || +.|+.+|+++|++|++++|++++.++|+++++++++|+.+|||||||||+||++|+++||.+..+.+||+|+.+.+. T Consensus 1 ma-~~~~~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~ 79 (147) T protein:vir:10 1 MA-NYQIRRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGE 79 (147) T ss_pred CC-CcchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhh Confidence 99 68999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHh-hhcC Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRA-KNAL 147 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~-~~~~ 147 (147) +..++..++.+.++|++|||+||+|||.+||||||+|||.||||+++++|++||+++++|+|. |++| T Consensus 80 ~~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~QAP~G~V~~t~q~~~~~v~~~~~e~k~~~~~~ 147 (147) T protein:vir:10 80 EQAKTYGMFSRGGAITSVHFSNMLIYANALEYGHSQQAPSGVVGLVALRLRSYMADAIKQARRQQNAL 147 (147) T ss_pred hhHHHHHHhhhccCcceEEEeeCcchhhhhhccccCCCCchHHHHHHHHHHHHHHHHHHHHHhhhccC Confidence 999999999999999999999999999999999999999999999999999999999999965 9999 No 3 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=100.00 E-value=1.3e-53 Score=310.47 Aligned_cols=142 Identities=46% Similarity=0.751 Sum_probs=136.1 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) ||+ |..+|+++|++|+++++++++.++|+++++++++|+.+|||||||||+||++|+++||.+..+++||+|+.+.+. T Consensus 1 Ma~--~~~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 78 (142) T protein:vir:10 1 MAN--DVVSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNS 78 (142) T ss_pred Ccc--chhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhh Confidence 998 778899999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKN 145 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~ 145 (147) +...+. .+.+.++|++|||+||+|||.+||||||+|||.|||++++++|++||+++++|+|+|+ T Consensus 79 ~~~~~~-~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~a~q~~~~~v~~a~~e~~~~~ 142 (142) T protein:vir:10 79 LRRQIY-ALARDANTNVIYISNRLDYAQGLEFGSSNQAPSGVLGVVQKRLGRYFAEAVQEAKRAL 142 (142) T ss_pred HHHHHH-HhhhccccceEEEeeCcchhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHHhhccC Confidence 876654 4567899999999999999999999999999999999999999999999999999999 No 4 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=100.00 E-value=9.6e-53 Score=305.78 Aligned_cols=144 Identities=40% Similarity=0.643 Sum_probs=135.0 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |||+. ++.+|+++|++|+++++++++.++|+++++++++|+.+|||||||||+||++|+++||.++.+++||+|+.+.+ T Consensus 1 ~~~~m~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~ 80 (145) T protein:vir:10 1 MARNIGSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKT 80 (145) T ss_pred CCCcccchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchh Confidence 99984 67889999999999999999999999999999999999999999999999999999999999999999998876 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKN 145 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~ 145 (147) ... ....++.++++|++|||+||+|||.+||||||+|||.||||+++++|+++|+++++|+|.-+ T Consensus 81 ~~~-~~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 81 YLA-RQARAVANSKATSVIYITNRLDYAADLEYGASNQAPAGVLGVVQARLGRYFQEAVEEARRAI 145 (145) T ss_pred hHH-HHHHHhhcccccceEEEeeCchhhhHhhccccCCCcchHHHHHHHHHHHHHHHHHHHhhccC Confidence 433 33446688999999999999999999999999999999999999999999999999998877 No 5 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=100.00 E-value=8.3e-50 Score=289.67 Aligned_cols=131 Identities=24% Similarity=0.446 Sum_probs=124.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHH Q lcl|NC_021331. 8 REFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYA 87 (147) Q Consensus 8 ~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~ 87 (147) -+|+.+|++|+++++++++.++|++++++++.|+.++||||||||+||++|+++||.+..+.+||+|+.+.+.+ .. T Consensus 1 msf~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~----~~ 76 (131) T protein:vir:78 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNA----AN 76 (131) T ss_pred CCcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhhHHHH----HH Confidence 23889999999999999999999999999999999999999999999999999999999999999999887655 45 Q ss_pred HHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 88 ILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 88 ~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k 142 (147) ++.++++|++|||+||+|||.+||||||+|||.||||+++++|+++|+++++|+| T Consensus 77 ~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:78 77 FVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred HHhhccCCceEEEeeCchhhhHhhccccCCCcchHHHHHHHHHHHHHHHHHHhcC Confidence 5677899999999999999999999999999999999999999999999999999 No 6 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=100.00 E-value=1.8e-49 Score=287.85 Aligned_cols=131 Identities=24% Similarity=0.457 Sum_probs=124.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHH Q lcl|NC_021331. 8 REFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYA 87 (147) Q Consensus 8 ~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~ 87 (147) -+|+++|++|+++++++++.++|++++++++.|+.+|||||||||+||++|+++||.+..+++||+|+.+..++.. T Consensus 1 msF~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~---- 76 (131) T protein:vir:94 1 MSFALDVTRFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATS---- 76 (131) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHH---- Confidence 2388999999999999999999999999999999999999999999999999999999999999999999776544 Q ss_pred HHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 88 ILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 88 ~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k 142 (147) ++.++++|++|||+||+|||.+||||||+|||.||||+++++|+++|+++++|+| T Consensus 77 ~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~g~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:94 77 FVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred HHhhccccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHhcC Confidence 5567899999999999999999999999999999999999999999999999999 No 7 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=100.00 E-value=3.2e-49 Score=286.47 Aligned_cols=141 Identities=25% Similarity=0.344 Sum_probs=126.3 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCC--cchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKH--GDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~--G~~t~ 78 (147) || |+.+|+++|++|++++|++++.++|+++++++++|+.++||||||||+||++|+++||++..++.||+ |+.+. T Consensus 1 m~---~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~~ 77 (148) T protein:vir:97 1 MP---SLSEFSRRITLRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTEA 77 (148) T ss_pred CC---ccchhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCcccc Confidence 99 68899999999999999999999999999999999999999999999999999999999999998875 55555 Q ss_pred hhHHHH---HHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHH--HHHHHHhh Q lcl|NC_021331. 79 AEGKRA---IYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAE--AIKESRAK 144 (147) Q Consensus 79 ~~~~~~---i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~--a~~e~k~~ 144 (147) ..+... ...++.++|+|++|||+||+|||.+||||||+|||.||||+++++|+++|++ +++|+-+- T Consensus 78 ~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~t~~~~~~~v~~~~~~~~~~~~ 148 (148) T protein:vir:97 78 ANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPANFVEQAVLEAVQVVQFGRVVDGDPGS 148 (148) T ss_pred cchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcchHHHHHHHHHHHHHHhhhhhcCCCCC Confidence 555543 4578889999999999999999999999999999999999999999999854 44444444 No 8 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=100.00 E-value=1.6e-47 Score=277.13 Aligned_cols=139 Identities=26% Similarity=0.310 Sum_probs=122.7 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch---- Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK---- 76 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~---- 76 (147) ||| |+.+|++++++|++++|+.++.++|++|+++++.|+++|||||||||+||++|+++|++++.++.+|.|.. T Consensus 1 MA~--~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d 78 (144) T protein:vir:95 1 MAK--SLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQR 78 (144) T ss_pred Cch--hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCC Confidence 999 78899999999999999999999999999999999999999999999999999999999988877765432 Q ss_pred -hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021331. 77 -TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKN 145 (147) Q Consensus 77 -t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~ 145 (147) +..........++.++++|++|||+||+|||.+||||||+|||.||||+++++|+++|++ +|-+- T Consensus 79 ~sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G~vr~~~q~~~~~v~~----~~~~~ 144 (144) T protein:vir:95 79 ASAAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAGFVERAVLIGRKMRKK----FKIKD 144 (144) T ss_pred CchhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHh----hccCC Confidence 223333455677888999999999999999999999999999999999999999999875 33333 No 9 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=100.00 E-value=2.3e-47 Score=276.30 Aligned_cols=134 Identities=18% Similarity=0.294 Sum_probs=123.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHH Q lcl|NC_021331. 8 REFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYA 87 (147) Q Consensus 8 ~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~ 87 (147) -+|+++|++|++++|++++.++|+++++++++|+.++||||||||+||++|+++||++..+.+||+|..+ ......+.. T Consensus 1 msF~~~i~~~~~~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~-~~~~~~~~~ 79 (134) T protein:vir:80 1 MSYTDRFNVIAKGIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGM-DEALQVLQQ 79 (134) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccc-hhhHHHHHH Confidence 3489999999999999999999999999999999999999999999999999999999999999998644 345566677 Q ss_pred HHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021331. 88 ILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRA 143 (147) Q Consensus 88 ~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~ 143 (147) ++.++|+|++|||+||+|||.+||||||+|||.||||+++++|+++|++ ++.+-. T Consensus 80 vi~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~t~~~~~~~v~~-~~~~~~ 134 (134) T protein:vir:80 80 TVGQYKAGDTVHITNNAPYIKELNSGSSQQAPANFVETSIMRATRLIRN-VKVVPQ 134 (134) T ss_pred HHhhccCcceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHh-hccCCC Confidence 8889999999999999999999999999999999999999999999999 677744 No 10 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=100.00 E-value=2.1e-45 Score=265.53 Aligned_cols=135 Identities=21% Similarity=0.290 Sum_probs=117.5 Q ss_pred CCc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------------ccchhcccceeccCCcc Q lcl|NC_021331. 1 MAK---NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPV--------------DTGRYRGNWQVTANKPP 63 (147) Q Consensus 1 MAk---~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPV--------------dtG~~R~nw~vs~~~~~ 63 (147) |-. +-+--+|+++|++|++++|+++++++|++++++++.|+.+||| ||||||+||++|+++|+ T Consensus 1 ~~~~~~~~~~msFaa~i~~~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~ 80 (152) T protein:vir:96 1 MLSCICGGNPMSWSKSLKNIIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKIT 80 (152) T ss_pred CcceeeCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCC Confidence 432 1223459999999999999999999999999999999999999 99999999999999999 Q ss_pred ccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021331. 64 LYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRA 143 (147) Q Consensus 64 ~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~ 143 (147) .+.++..|++|+ +.. ....+.++++|++|||+||+|||.+||||||+|||.||||+++++|++||++ ++|+ T Consensus 81 ~~~~~~~~~~~t--~~~----~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~QAP~G~vr~t~~~~~~~v~e---a~~~ 151 (152) T protein:vir:96 81 SFEKGISSQSSI--MMD----LQSDIAKFKIGETLFMTNPLPYATSIEYGHSSQAPNGVYRPAVRRLVKFLNT---ELKA 151 (152) T ss_pred cccccCCCCCch--HHH----HHHHHhhccccceEEEeeCchhhhHhhccccCCCCchHHHHHHHHHHHHHHH---Hhcc Confidence 887776666664 433 3445778899999999999999999999999999999999999999999997 4566 Q ss_pred h Q lcl|NC_021331. 144 K 144 (147) Q Consensus 144 ~ 144 (147) | T Consensus 152 ~ 152 (152) T protein:vir:96 152 K 152 (152) T ss_pred C Confidence 7 No 11 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=100.00 E-value=1.7e-44 Score=260.50 Aligned_cols=121 Identities=25% Similarity=0.340 Sum_probs=112.1 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |+. -+|.++|++|++++++.++.++|+++++++++|+.++||||||||+||+||+++|+.+..+..||+|+.+.+. T Consensus 1 ~~~----~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~ 76 (121) T protein:vir:94 1 MIS----MKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPA 76 (121) T ss_pred Ccc----chhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHH Confidence 664 2588999999999999999999999999999999999999999999999999999999999999999999887 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLR 131 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~ 131 (147) +.. ...+.+++|||+||+|||.+||||||+|||.||||++++||+ T Consensus 77 ~~~------~~~~~~~~iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 77 IVV------SSNVALPHFYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred HHH------HHhhccceEEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 633 234568999999999999999999999999999999999999 No 12 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.81 E-value=3.2e-23 Score=143.88 Aligned_cols=124 Identities=20% Similarity=0.261 Sum_probs=87.2 Q ss_pred CCc--ccc---hHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAK--NYT---IREFHGNIDAWIN-AVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk--~~s---~~~F~~~i~~f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) ||+ +++ +..|.++|.+.++ .++..+++++++++.++++.++.+||||||+||.||+++...-. ..+.++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~----~~~~~~- 75 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARS----LPVYKQ- 75 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccc----cceeec- Confidence 998 455 4556777766655 57888899999999999999999999999999999987531110 011111 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHH------HHHHH----HHHHHHHHHHHHhh Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGI------VAVKL----RSYMAEAIKESRAK 144 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~------a~~~~----~~~v~~a~~e~k~~ 144 (147) +.+-+|.|.||+|||++|||||+++.|.|||.. +.+++ +++|++.+.+.=.+ T Consensus 76 ------------------g~~~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 76 ------------------GNNYIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred ------------------CCeeEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123468899999999999999999999998864 33444 44445555554222 Q ss_pred hcC Q lcl|NC_021331. 145 NAL 147 (147) Q Consensus 145 ~~~ 147 (147) +-= T Consensus 138 ~~~ 140 (141) T protein:vir:79 138 VFD 140 (141) T ss_pred hhc Confidence 111 No 13 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.72 E-value=2.4e-20 Score=128.17 Aligned_cols=118 Identities=17% Similarity=0.217 Sum_probs=82.9 Q ss_pred CCc-ccch---HHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAK-NYTI---REFHGNIDAWINA--VDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk-~~s~---~~F~~~i~~f~~~--v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |++ ++++ .+|.++|.+.+.. +++.+++.+++++.++++.++.+||||||+||+||+++-- ..+ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~----------~~~- 69 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGP----------TYG- 69 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecce----------eee- Confidence 887 4664 4456666665543 5678899999999999999999999999999999987510 011 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCc-----------hHH------HHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPA-----------GVL------GIVAVKLRSYMAEA 137 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~-----------G~V------~~a~~~~~~~v~~a 137 (147) +.+-++.|.|++|||++|||||+++.+. ||| +.|++++.+.+.+- T Consensus 70 ------------------~~~~~~~V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~ 131 (144) T protein:vir:10 70 ------------------CGGWTIKLINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQL 131 (144) T ss_pred ------------------cCeeEEEEecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHH Confidence 1234678999999999999999988762 555 67777665555544 Q ss_pred HHHHHhh-hcC Q lcl|NC_021331. 138 IKESRAK-NAL 147 (147) Q Consensus 138 ~~e~k~~-~~~ 147 (147) +.+.=.+ +=| T Consensus 132 l~k~l~~l~d~ 142 (144) T protein:vir:10 132 VTEGLWGLKDL 142 (144) T ss_pred HHHHHHHHhhh Confidence 4332211 112 No 14 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.63 E-value=1.7e-18 Score=117.94 Aligned_cols=107 Identities=17% Similarity=0.186 Sum_probs=92.6 Q ss_pred cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHH Q lcl|NC_021331. 5 YT-IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKR 83 (147) Q Consensus 5 ~s-~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~ 83 (147) ++ +.+|.+.|.+..+.++..+...+++.+.++.++++..+|||||.||.||.++... T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~---------------------- 58 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQR---------------------- 58 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecC---------------------- Confidence 43 7789999999999999999999999999999999999999999999999875311 Q ss_pred HHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021331. 84 AIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAK 144 (147) Q Consensus 84 ~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~ 144 (147) +..+.|.++.+||.+||||||.+.++.|++.++......|.+.++++=-| T Consensus 59 -----------~~~~~v~~~~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 59 -----------LLHYRVVSPALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred -----------cEEEEeecCcccchhcccCccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 12466889999999999999999999999999999988777777766333 No 15 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.59 E-value=3.6e-18 Score=116.24 Aligned_cols=117 Identities=20% Similarity=0.343 Sum_probs=83.8 Q ss_pred CCcccchHHH---HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCc---------------------------cc Q lcl|NC_021331. 1 MAKNYTIREF---HGNIDAWIN--AVDSGLKDCVELFAEKVHTDLVKRSPV---------------------------DT 48 (147) Q Consensus 1 MAk~~s~~~F---~~~i~~f~~--~v~~~~~~~~r~~a~~l~~~vv~~tPV---------------------------dt 48 (147) |.-++++.+| .++|.+.+. .++...+++++++|.+++++++.+||| +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 9998876654 445544432 356678999999999999999999998 89 Q ss_pred chhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHH---- Q lcl|NC_021331. 49 GRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLG---- 124 (147) Q Consensus 49 G~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~---- 124 (147) |.||.||.++- +..+| .+.+|-|.|+.|||++|||||++. +.|||. T Consensus 81 G~lr~swk~~~----------~~k~~-------------------~~~~v~v~N~~~YA~~VE~GHR~~-~gGfV~G~fm 130 (163) T protein:vir:10 81 GTLQKGWSKSR----------IEVSG-------------------RTYKQKVYNKVYYAPHVEYGHKTV-NGGFVPGQFF 130 (163) T ss_pred chhhccceecc----------eeecC-------------------CceEEEEEecCCccchhhcceeec-CCceeccchh Confidence 99999999851 11111 133577889999999999999665 467774 Q ss_pred --HHHH----HHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 125 --IVAV----KLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 125 --~a~~----~~~~~v~~a~~e~k~~~~~ 147 (147) .|.+ .++.++++.+.++=.++=+ T Consensus 131 l~~s~~~~~~~~~~~~e~~l~~~l~k~~~ 159 (163) T protein:vir:10 131 LHKTVEDTKSDMEKRVRDKYDGFMRKVVL 159 (163) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 4554 4455666666665444444 No 16 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.59 E-value=7.3e-18 Score=114.53 Aligned_cols=112 Identities=13% Similarity=0.066 Sum_probs=94.0 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |+-.| .+.+|.+.|++..+.+.+.+...+++.+.++..++...+|||||.||.||.++. T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~-------------------- 60 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSY-------------------- 60 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeec-------------------- Confidence 88666 488999999999999999999999999999999999999999999999998642 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-Hhhhc Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-RAKNA 146 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k~~~~ 146 (147) .|.+..|.++.+|+.+|||||+.|+|+.|++.++++....+.+.+.+. |..+= T Consensus 61 --------------~g~~~~V~~~~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 61 --------------PGMEAHIHGEAGYDGYQEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred --------------CceEEEeecCCCccceeecCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 123456778999999999999999999999999998877666655544 33322 No 17 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.59 E-value=9.3e-18 Score=113.97 Aligned_cols=112 Identities=21% Similarity=0.306 Sum_probs=98.2 Q ss_pred CCcc---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAKN---YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk~---~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) ||+- +++..|.+.|.++.+++...++..+.+.+.++.+.+...+|||||.||+||.+.+.. T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~---------------- 64 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSG---------------- 64 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeecc---------------- Confidence 9983 578889999999999999999999999999999999999999999999999864321 Q ss_pred hhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC---------------------------CCCCchHHHHHHHHH Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS---------------------------KQAPAGVLGIVAVKL 130 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s---------------------------~QAp~G~V~~a~~~~ 130 (147) .+...++.|.++++||.++||||+ ++.|+.|++.++.+- T Consensus 65 --------------~g~~~~~~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~ 130 (142) T protein:vir:94 65 --------------GRFSFSVTIGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAA 130 (142) T ss_pred --------------CCceEEEEEecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHH Confidence 011236788999999999999974 377999999999999 Q ss_pred HHHHHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIKESR 142 (147) Q Consensus 131 ~~~v~~a~~e~k 142 (147) ...|.+-++++| T Consensus 131 ~~~i~~~~~~~~ 142 (142) T protein:vir:94 131 STFLRNHAKGIR 142 (142) T ss_pred HHHHHHHHHhcC Confidence 999999999999 No 18 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.58 E-value=9.8e-18 Score=113.83 Aligned_cols=107 Identities=15% Similarity=0.109 Sum_probs=92.6 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+-+ .+.+|.+.|+++.+.+++.+.+.+++.+.++.++++..+|||||+||+||.+.+.. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~------------------ 62 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTD------------------ 62 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeec------------------ Confidence 99965 68899999999999999999999999999999999999999999999999875311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .|.++.|.++++||.++|||| ++|.|+.|++.++.+- T Consensus 63 --------------~g~~~~V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~ 128 (137) T protein:vir:96 63 --------------GGFSSVISVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEG 128 (137) T ss_pred --------------CceEEEEecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHH Confidence 133577889999999999997 5577888999998888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~i~k~i~ 137 (137) T protein:vir:96 129 RKVFNRYFS 137 (137) T ss_pred HHHHHHhhC Confidence 777777666 No 19 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.57 E-value=9.5e-18 Score=113.91 Aligned_cols=108 Identities=15% Similarity=0.183 Sum_probs=85.8 Q ss_pred CCcccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) |+.+++ +.+|.+.|++.. -++.+++.+++.+..+..+++..+|||||.||+||.++.. T Consensus 1 M~~~i~i~Gld~l~~~L~~~~--~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~----------------- 61 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAA--SLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELT----------------- 61 (112) T ss_pred CceeeeehhHHHHHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeec----------------- Confidence 998875 455555555543 2356888999999999999999999999999999987531 Q ss_pred hhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) ..|.++.|.++.+||.+|||||+.+.|+.|++.+++.....+.+.++++ | T Consensus 62 ---------------~~~~~~~V~~~~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 62 ---------------EGGFSGQAGPHTDYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred ---------------CCceEEEeecCCCccceeeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 1134678889999999999999999999999999988766665555444 5 No 20 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.56 E-value=2e-17 Score=112.17 Aligned_cols=107 Identities=17% Similarity=0.169 Sum_probs=92.5 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+.+ .+.+|.+.|+.+.+++++.+...+++.+.++.++++..+|||||.||+||++.+.. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~------------------ 62 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeC------------------ Confidence 99964 68899999999999999999999999999999999999999999999999864311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .+-+..|.++++||.++|||| ++|.|+.|++.++++- T Consensus 63 --------------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 128 (137) T protein:vir:95 63 --------------GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG 128 (137) T ss_pred --------------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHH Confidence 122456779999999999998 6788999999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~i~k~l~ 137 (137) T protein:vir:95 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhhC Confidence 877777776 No 21 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.55 E-value=3e-17 Score=111.14 Aligned_cols=117 Identities=15% Similarity=0.204 Sum_probs=91.1 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||..+| +.++.+.|+...+.+.+.+...+++.+..+..+....+|||||.||.||.++.-. .. T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~---------~~--- 68 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVK---------EE--- 68 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceeccee---------cc--- Confidence 998543 5678888998888888999999999999999999999999999999999875211 00 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) ..+-++.+.++.+||.+||||||.|.|+.|++.++++....+.+.+.+. -+-++ T Consensus 69 -----------------~~~~~~~v~~~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~-l~~a~ 122 (125) T protein:vir:94 69 -----------------HGVVTGRYVARADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDA-LNKAA 122 (125) T ss_pred -----------------CCcEEEEeeCCCCccceeecccccCCCCcccchhHHHHHHHHHHHHHHH-HHHHh Confidence 1123466788999999999999999999999999887655555444442 11222 No 22 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.54 E-value=3.7e-17 Score=110.65 Aligned_cols=107 Identities=17% Similarity=0.171 Sum_probs=92.4 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+-+ .+.+|.+.|+.+.+++.+.+.+.+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec------------------ Confidence 99965 68899999999999999999999999999999999999999999999999865321 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .|-+.-|.++++||.++|||| ++|.|+.|++.++++. T Consensus 63 --------------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 128 (137) T protein:vir:94 63 --------------SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG 128 (137) T ss_pred --------------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHH Confidence 022345779999999999998 5688889999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~~~~~l~ 137 (137) T protein:vir:94 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhhC Confidence 888777776 No 23 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.54 E-value=3.7e-17 Score=110.65 Aligned_cols=107 Identities=17% Similarity=0.171 Sum_probs=92.4 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+-+ .+.+|.+.|+.+.+++.+.+.+.+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec------------------ Confidence 99965 68899999999999999999999999999999999999999999999999865321 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .|-+.-|.++++||.++|||| ++|.|+.|++.++++. T Consensus 63 --------------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 128 (137) T protein:vir:97 63 --------------SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG 128 (137) T ss_pred --------------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHH Confidence 022345779999999999998 5688889999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~~~~~l~ 137 (137) T protein:vir:97 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhhC Confidence 888777776 No 24 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.54 E-value=3.7e-17 Score=110.65 Aligned_cols=107 Identities=17% Similarity=0.171 Sum_probs=92.4 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+-+ .+.+|.+.|+.+.+++.+.+.+.+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec------------------ Confidence 99965 68899999999999999999999999999999999999999999999999865321 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .|-+.-|.++++||.++|||| ++|.|+.|++.++++. T Consensus 63 --------------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 128 (137) T protein:vir:93 63 --------------SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG 128 (137) T ss_pred --------------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHH Confidence 022345779999999999998 5688889999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~~~~~l~ 137 (137) T protein:vir:93 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhhC Confidence 888777776 No 25 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.52 E-value=6.5e-17 Score=109.32 Aligned_cols=107 Identities=19% Similarity=0.148 Sum_probs=92.3 Q ss_pred CCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKN-YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~-~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+. .-+.+|.+.|+++.+++.+.+++.+++.+.++.+.++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~------------------ 62 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFEN------------------ 62 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeec------------------ Confidence 9986 478899999999999999999999999999999999999999999999999864311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC---------------------------CCCCCchHHHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH---------------------------SKQAPAGVLGIVAVKLRS 132 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~---------------------------s~QAp~G~V~~a~~~~~~ 132 (147) .|-+.-|.++++||.++|||| +.+.|+.|++.++++... T Consensus 63 --------------~g~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~ 128 (135) T protein:vir:96 63 --------------GGFTGVVKIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQ 128 (135) T ss_pred --------------CcEEEEEecCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHH Confidence 122345779999999999998 558899999999988888 Q ss_pred HHHHHHH Q lcl|NC_021331. 133 YMAEAIK 139 (147) Q Consensus 133 ~v~~a~~ 139 (147) .|.+.+. T Consensus 129 ~~~~~i~ 135 (135) T protein:vir:96 129 TFEQYFS 135 (135) T ss_pred HHHHhcC Confidence 8777777 No 26 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.52 E-value=4.7e-17 Score=110.12 Aligned_cols=107 Identities=20% Similarity=0.137 Sum_probs=91.7 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+.. -+++|.+.|+++.+++.+.+.+.+++.+.++..+.+..+|||||+||+||++.+.. T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 74 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFD------------------ 74 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeC------------------ Confidence 99853 68899999999999999999999999999999999999999999999999875310 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .+-+..|.++++||.++|||| ++|.|+.|++.|+.+- T Consensus 75 --------------~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~ 140 (149) T protein:vir:94 75 --------------GGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAG 140 (149) T ss_pred --------------CcEEEEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHH Confidence 123466889999999999997 4577889999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+.+. T Consensus 141 ~~~i~~~i~ 149 (149) T protein:vir:94 141 RKTFEQYFS 149 (149) T ss_pred HHHHHHhhC Confidence 777777776 No 27 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.52 E-value=7.9e-17 Score=108.86 Aligned_cols=107 Identities=18% Similarity=0.192 Sum_probs=92.1 Q ss_pred CCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKN-YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~-~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+- .-+.+|.+.|+++.+++++.+...+++.+.++.++++..+|||||+||+||++.+.. T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeec------------------ Confidence 9995 378899999999999999999999999999999999999999999999999865311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .+-++.|.++++||.++|||| ++|.|+.|++.++++. T Consensus 63 --------------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 128 (137) T protein:vir:94 63 --------------GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAG 128 (137) T ss_pred --------------CcEEEEEecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHH Confidence 123466789999999999994 4688888999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~~~~~l~ 137 (137) T protein:vir:94 129 RVFFNKYFS 137 (137) T ss_pred HHHHHHhhC Confidence 888777777 No 28 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.52 E-value=1.2e-16 Score=107.93 Aligned_cols=111 Identities=18% Similarity=0.187 Sum_probs=93.9 Q ss_pred CCc---ccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAK---NYT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk---~~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) ||+ .++ +..|.++|+++.+.+.+.+++.+++.|.++.+.++..+|||||+||+||.+.+.. T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~------------- 67 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKN------------- 67 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeec------------- Confidence 655 343 4578889999999999999999999999999999999999999999999875311 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCC---------------------------CCCCCchHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH---------------------------SKQAPAGVLGIVA 127 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~---------------------------s~QAp~G~V~~a~ 127 (147) .|.+.-|.++++||.++|||| +++.|+.|++.++ T Consensus 68 -------------------~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~ 128 (144) T protein:vir:59 68 -------------------NGLTAEITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAV 128 (144) T ss_pred -------------------CcEEEEEecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHH Confidence 123456789999999999997 3577888999999 Q ss_pred HHHHHHHHHHHHHHHh Q lcl|NC_021331. 128 VKLRSYMAEAIKESRA 143 (147) Q Consensus 128 ~~~~~~v~~a~~e~k~ 143 (147) +.-...|.+.++++-| T Consensus 129 ~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 129 EEGGEYFEREMRRLRG 144 (144) T ss_pred HHHHHHHHHHHHHhcC Confidence 9999999988888888 No 29 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.50 E-value=3.9e-17 Score=110.55 Aligned_cols=93 Identities=18% Similarity=0.329 Sum_probs=68.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCc---ccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccce Q lcl|NC_021331. 21 VDSGLKDCVELFAEKVHTDLVKRSPV---DTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRA 97 (147) Q Consensus 21 v~~~~~~~~r~~a~~l~~~vv~~tPV---dtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~ 97 (147) ++..++++++++|.++++.++.+||| |||.||.||.++- + .+.+++ T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~----------v---------------------~k~~~~ 49 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKE----------L---------------------NLFDGV 49 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeee----------e---------------------eccCce Confidence 66677888999999999999999998 6799999998741 1 133445 Q ss_pred EEEecCchhhhhhhcCCCCCCCch-------------HH------HHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 98 IYFSNMLIYANALEYGHSKQAPAG-------------VL------GIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 98 iyi~Nn~pYA~~LEyG~s~QAp~G-------------~V------~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) || ||++||+++||||+++...| || +.|.+++.+++.+.+++.=.+ .| T Consensus 50 v~--N~~eYA~~VE~GHRq~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~-~l 115 (116) T protein:vir:10 50 VS--NNVEYIHHLEYGHRTRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIID-FW 115 (116) T ss_pred ee--cCCcccccccCCceeeCCcceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHH-hc Confidence 54 99999999999999887764 44 677777766665444433111 22 No 30 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.49 E-value=1.4e-16 Score=107.48 Aligned_cols=107 Identities=19% Similarity=0.153 Sum_probs=89.5 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+.+ -+.+|.+.|.++.+++.+.+...+++.+.++.++++..+|||||+||+||.+.+.. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKK------------------ 62 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeC------------------ Confidence 99964 78899999999999999999999999999999999999999999999999864311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .|-+++|.++++||.++|||+ ++|.|+.|++.++.+- T Consensus 63 --------------~~~~~~V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 128 (137) T protein:vir:10 63 --------------GGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEG 128 (137) T ss_pred --------------CcEEEEEecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHH Confidence 122467889999999999995 4577888888888776 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~i~k~i~ 137 (137) T protein:vir:10 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhcC Confidence 666665555 No 31 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.49 E-value=1.7e-16 Score=107.04 Aligned_cols=104 Identities=14% Similarity=0.149 Sum_probs=81.5 Q ss_pred cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhH Q lcl|NC_021331. 5 YT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEG 81 (147) Q Consensus 5 ~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~ 81 (147) ++ +.+|.+.|.+- ..++.+.+.+++.+..+..+++.++|||||.||+||.+.+.. T Consensus 1 i~i~Gld~l~~~l~~~--~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~-------------------- 58 (108) T protein:vir:74 1 MKITGIDALQKKLRKN--ATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTD-------------------- 58 (108) T ss_pred CcchhHHHHHHHHHHh--hhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeec-------------------- Confidence 44 44555555542 245668899999999999999999999999999999875321 Q ss_pred HHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 82 KRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 82 ~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) .+.++.|.++.+||.+|||||+.|.|+.|++.++......+.+.+.++ | T Consensus 59 ------------~~~~~~V~~~~~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 59 ------------GGLSGTTGPHTDYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ------------CceEEEeecCCCcccceeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 123566789999999999999999999999999988766666555544 5 No 32 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.49 E-value=1.1e-16 Score=108.08 Aligned_cols=107 Identities=21% Similarity=0.167 Sum_probs=91.3 Q ss_pred CCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKN-YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~-~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+- +-+.+|.+.|+++.+++.+.+.+.+++.+.++.++.+..+|||||.||+||.+.+.. T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~------------------ 74 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFD------------------ 74 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecC------------------ Confidence 9985 368899999999999999999999999999999999999999999999999875311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .+-+..|.++++||.++|||| ++|.|+.|++.++.+- T Consensus 75 --------------~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~ 140 (149) T protein:vir:10 75 --------------GGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAG 140 (149) T ss_pred --------------CcEEEEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHH Confidence 122456789999999999997 4467888999999888 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+.+. T Consensus 141 k~~i~~~i~ 149 (149) T protein:vir:10 141 RKTFEQYFS 149 (149) T ss_pred HHHHHHhhC Confidence 887777777 No 33 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.48 E-value=2.4e-16 Score=106.19 Aligned_cols=107 Identities=18% Similarity=0.151 Sum_probs=89.3 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||+.. .+.+|.+.|++..+.+...++..+++.+.++.++++..+|||||.||+||++.+.. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKK------------------ 62 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecC------------------ Confidence 99964 78999999999999999999999999999999999999999999999999875311 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC-----------------------------CCCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH-----------------------------SKQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~-----------------------------s~QAp~G~V~~a~~~~ 130 (147) .|-+.+|.++++||.++|||+ ++|.|+.|++.|+.+- T Consensus 63 --------------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~ 128 (137) T protein:vir:10 63 --------------GGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEG 128 (137) T ss_pred --------------CcEEEEEecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHH Confidence 122466889999999999995 3477888888888776 Q ss_pred HHHHHHHHH Q lcl|NC_021331. 131 RSYMAEAIK 139 (147) Q Consensus 131 ~~~v~~a~~ 139 (147) ...|.+-+. T Consensus 129 ~~~i~k~i~ 137 (137) T protein:vir:10 129 RAFFNKYFS 137 (137) T ss_pred HHHHHHhhC Confidence 666665555 No 34 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.46 E-value=3.6e-16 Score=105.27 Aligned_cols=108 Identities=15% Similarity=0.131 Sum_probs=88.3 Q ss_pred CCc-ccc-hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAK-NYT-IREFHGNIDAWI--NAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk-~~s-~~~F~~~i~~f~--~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||+ .|+ +.++.+.|.+.. +.+++.+.+...+++.++.+..+...|||||.||.|.+++.+ T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~---------------- 64 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAG---------------- 64 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecC---------------- Confidence 996 342 677777777663 466777777777777788888888899999999999876411 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k 142 (147) |.++.|..+.+||.+||||+|.++++.|++.+++.-...|.+.++.+. T Consensus 65 ------------------~~~~~v~~~~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 65 ------------------SDRAVVEALTNYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ------------------ceEEEecCCCCccceeccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 345667788999999999999999999999999999999998888888 No 35 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.44 E-value=7.4e-16 Score=103.52 Aligned_cols=104 Identities=14% Similarity=0.129 Sum_probs=80.3 Q ss_pred cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhH Q lcl|NC_021331. 5 YT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEG 81 (147) Q Consensus 5 ~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~ 81 (147) +. +++|.+.|.+.. .+..+...+++.+..+..+++..+|||||.||+||.+.+.. T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~-------------------- 58 (108) T protein:vir:98 1 MKITGIDALQKKLRKNA--TLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTD-------------------- 58 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeec-------------------- Confidence 33 455666665532 45667889999999999999999999999999999865311 Q ss_pred HHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 82 KRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 82 ~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) .+-++.|.++.+||.+|||||+.|.|+.|++.+++.....+.+.++++ | T Consensus 59 ------------~~~~~~V~~~~~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 59 ------------GGLTGTTIPHTDYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ------------CceEEEeecCCCccceeeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 122466789999999999999999999999999987765555544443 4 No 36 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.37 E-value=4.9e-15 Score=99.06 Aligned_cols=115 Identities=23% Similarity=0.337 Sum_probs=88.8 Q ss_pred CCcccchHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNYTIREFHGN----IDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~s~~~F~~~----i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||+ ++|.+|++. |+.|.+.+.+.+++.+++++.++..++...+|++||.|+.||.++... ..|. T Consensus 1 Ma~-i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~----------~~g~- 68 (126) T protein:vir:81 1 MAN-ITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKED----------GYGT- 68 (126) T ss_pred Ccc-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccc----------cCCc- Confidence 996 888776554 888999999999999999999999999999999999999999886311 0111 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchh--hhhhhcCCCC-----CCCchHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIY--ANALEYGHSK-----QAPAGVLGIVAVKLRSYMAEAIKESRAKNA 146 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pY--A~~LEyG~s~-----QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~ 146 (147) +.+| +.|+..| +.-|||||.+ .++..|++.+.+...+-+.+.+++.=.-=| T Consensus 69 ------------------~~~v-v~~~~~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 69 ------------------TKRI-IWNKKHYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred ------------------ceEE-EeccCCCCceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 1112 2244445 6789999998 488999999999888888877777633333 No 37 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.37 E-value=6.6e-15 Score=98.32 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|....+.++..+...+++.+.++.......+ |||||.||.|+.++... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g-------------- 66 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTG-------------- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecC-------------- Confidence 223 267788888888888888899999999999999998876 99999999999875211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +.++.|..+.+||.+||||+|.+.|..|++.+++.....|.+.++++ | T Consensus 67 -------------------~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -------------------ceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 22345677899999999999999999999999998887777777766 5 No 38 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.37 E-value=6.6e-15 Score=98.32 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|....+.++..+...+++.+.++.......+ |||||.||.|+.++... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g-------------- 66 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTG-------------- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecC-------------- Confidence 223 267788888888888888899999999999999998876 99999999999875211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +.++.|..+.+||.+||||+|.+.|..|++.+++.....|.+.++++ | T Consensus 67 -------------------~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -------------------ceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 22345677899999999999999999999999998887777777766 5 No 39 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.37 E-value=6.6e-15 Score=98.32 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|....+.++..+...+++.+.++.......+ |||||.||.|+.++... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g-------------- 66 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTG-------------- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecC-------------- Confidence 223 267788888888888888899999999999999998876 99999999999875211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +.++.|..+.+||.+||||+|.+.|..|++.+++.....|.+.++++ | T Consensus 67 -------------------~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -------------------ceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 22345677899999999999999999999999998887777777766 5 No 40 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.37 E-value=6.6e-15 Score=98.32 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|....+.++..+...+++.+.++.......+ |||||.||.|+.++... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g-------------- 66 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTG-------------- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecC-------------- Confidence 223 267788888888888888899999999999999998876 99999999999875211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +.++.|..+.+||.+||||+|.+.|..|++.+++.....|.+.++++ | T Consensus 67 -------------------~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -------------------ceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 22345677899999999999999999999999998887777777766 5 No 41 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.37 E-value=6.6e-15 Score=98.32 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|....+.++..+...+++.+.++.......+ |||||.||.|+.++... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g-------------- 66 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTG-------------- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecC-------------- Confidence 223 267788888888888888899999999999999998876 99999999999875211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +.++.|..+.+||.+||||+|.+.|..|++.+++.....|.+.++++ | T Consensus 67 -------------------~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -------------------ceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 22345677899999999999999999999999998887777777766 5 No 42 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.37 E-value=6.6e-15 Score=98.32 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|....+.++..+...+++.+.++.......+ |||||.||.|+.++... T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g-------------- 66 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTG-------------- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecC-------------- Confidence 223 267788888888888888899999999999999998876 99999999999875211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +.++.|..+.+||.+||||+|.+.|..|++.+++.....|.+.++++ | T Consensus 67 -------------------~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -------------------ceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 22345677899999999999999999999999998887777777766 5 No 43 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.36 E-value=3e-15 Score=100.18 Aligned_cols=108 Identities=14% Similarity=0.106 Sum_probs=75.9 Q ss_pred CCcccc---hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWI--NAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~--~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||. ++ +.++.+.|.+.. +.+++.+.+...+++.++.+......|||||.||.||.+++.. T Consensus 1 Ma~-i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~-------------- 65 (114) T protein:vir:49 1 MAT-IEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES-------------- 65 (114) T ss_pred Cee-eeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC-------------- Confidence 995 44 455666665542 2334444444444444444444445799999999999875311 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-RA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k~ 143 (147) +-..|..+.+||.+||||+|.++|..|++.++..-...+.+.++++ |. T Consensus 66 --------------------~~~~V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 66 --------------------DKATVEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred --------------------CeeEecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 1134667899999999999999999999999999988888888777 55 No 44 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.36 E-value=3e-15 Score=100.18 Aligned_cols=108 Identities=14% Similarity=0.106 Sum_probs=75.9 Q ss_pred CCcccc---hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWI--NAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~--~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||. ++ +.++.+.|.+.. +.+++.+.+...+++.++.+......|||||.||.||.+++.. T Consensus 1 Ma~-i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~-------------- 65 (114) T protein:vir:27 1 MAT-IEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES-------------- 65 (114) T ss_pred Cee-eeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC-------------- Confidence 995 44 455666665542 2334444444444444444444445799999999999875311 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-RA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k~ 143 (147) +-..|..+.+||.+||||+|.++|..|++.++..-...+.+.++++ |. T Consensus 66 --------------------~~~~V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 66 --------------------DKATVEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred --------------------CeeEecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 1134667899999999999999999999999999988888888777 55 No 45 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.33 E-value=1.1e-14 Score=97.17 Aligned_cols=107 Identities=14% Similarity=0.110 Sum_probs=88.1 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+++|.+.|++..+.+.+.+...+++.+.++..+++..+ |||||.||.|+.++.+ | T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~-------------g- 66 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT-------------V- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeec-------------C- Confidence 223 267788889888888999999999999999999998876 9999999999976421 1 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKES-R 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~-k 142 (147) +-+..+..+..||.+||||+|.++|+.|++.++......+.+.++++ | T Consensus 67 -------------------~~~~~V~~~~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 67 -------------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred -------------------cEEEEecCCccccccccccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 12355677899999999999999999999999998877777777665 5 No 46 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.32 E-value=1.4e-14 Score=96.58 Aligned_cols=114 Identities=23% Similarity=0.347 Sum_probs=83.1 Q ss_pred CCcccchHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNYTIREF----HGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~s~~~F----~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||++++|.+| ++.|++|.+.+.+.+++.+++++.+++.+|...||++||.++.||.+.... .|. T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~-----------~~~- 68 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLK-----------NGD- 68 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecC-----------Cee- Confidence 9999998888 566778888999999999999999999999999999999999999875311 111 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCC-----CchHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQA-----PAGVLGIVAVKLRSYMAEAIKESRAK 144 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QA-----p~G~V~~a~~~~~~~v~~a~~e~k~~ 144 (147) --++|-.|.-+.+.-|||||-++. +..+++.+.+.+.+.|.+.+++.=.| T Consensus 69 ------------------~~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 69 ------------------QVIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred ------------------EEEEEecCCcceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 012333333445899999997754 33555677776666555555554333 No 47 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.30 E-value=3e-14 Score=94.74 Aligned_cols=108 Identities=14% Similarity=0.115 Sum_probs=87.9 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 3 KNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS------PVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 3 k~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t------PVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) -.| -+.+|.+.|++..+.+.+.+...+++.+..+...++... |||||.||+|+.++.+ | T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~-------------g- 66 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKI-------------G- 66 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeec-------------C- Confidence 122 267788888888888989999999999999999998865 9999999999986421 1 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~ 143 (147) +-++.+..+.+||.+||||+|.+++..|++.++++....|.+.++++=. T Consensus 67 -------------------~~~~~v~~~~~Ya~~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 67 -------------------DLHYRVISTAHYSGFLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred -------------------cEEEEeeCCCccchheecccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 1245577889999999999999999999999999888777777666543 No 48 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.23 E-value=1.3e-13 Score=91.28 Aligned_cols=110 Identities=17% Similarity=0.180 Sum_probs=76.9 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTD-----LVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~-----vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |++ + .|...+++..+++++...+.++++++..... .+..+|||||+||+||...+.. T Consensus 1 ~~~-~---~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~-------------- 62 (141) T protein:vir:78 1 MNE-F---EFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRK-------------- 62 (141) T ss_pred Ccc-h---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeec-------------- Confidence 775 2 4777888887888888888777777765443 3457999999999999754211 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCC--------------------------CCCCCchHHHHHHHH Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH--------------------------SKQAPAGVLGIVAVK 129 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~--------------------------s~QAp~G~V~~a~~~ 129 (147) .|.++.|.|+++||.|+|||+ ++|.|+.|++.|+.+ T Consensus 63 ------------------~g~~~~V~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~ 124 (141) T protein:vir:78 63 ------------------SSKEVIVGNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRD 124 (141) T ss_pred ------------------CCcEEEEecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHh Confidence 123456889999999999996 569999999999876 Q ss_pred HHHHHHHHHHHHHhhhc Q lcl|NC_021331. 130 LRSYMAEAIKESRAKNA 146 (147) Q Consensus 130 ~~~~v~~a~~e~k~~~~ 146 (147) -..-|.+.+.+.=..+- T Consensus 125 ~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 125 EQDKVRVFTERALRGIN 141 (141) T ss_pred hHHHHHHHHHHHhhccC Confidence 54444443333322222 No 49 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.18 E-value=8.2e-14 Score=92.34 Aligned_cols=87 Identities=17% Similarity=0.172 Sum_probs=74.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEE Q lcl|NC_021331. 21 VDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYF 100 (147) Q Consensus 21 v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi 100 (147) +++.+++.+.+.+.++.+.++..+|||||+||+||.+.+.. .|-+..| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~--------------------------------~~~~~~V 48 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD--------------------------------GGFTGVI 48 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeec--------------------------------CcEEEEE Confidence 77888888999999999999999999999999999864311 1224567 Q ss_pred ecCchhhhhhhcC-----------------------------CCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 101 SNMLIYANALEYG-----------------------------HSKQAPAGVLGIVAVKLRSYMAEAIK 139 (147) Q Consensus 101 ~Nn~pYA~~LEyG-----------------------------~s~QAp~G~V~~a~~~~~~~v~~a~~ 139 (147) .++++||.++||| |++|.|+.|++.|+.+-...|.+.+. T Consensus 49 ~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 49 NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ecCCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 7999999999999 77899999999999988888877777 No 50 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.18 E-value=8.2e-14 Score=92.34 Aligned_cols=87 Identities=17% Similarity=0.172 Sum_probs=74.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEE Q lcl|NC_021331. 21 VDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYF 100 (147) Q Consensus 21 v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi 100 (147) +++.+++.+.+.+.++.+.++..+|||||+||+||.+.+.. .|-+..| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~--------------------------------~~~~~~V 48 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD--------------------------------GGFTGVI 48 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeec--------------------------------CcEEEEE Confidence 77888888999999999999999999999999999864311 1224567 Q ss_pred ecCchhhhhhhcC-----------------------------CCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 101 SNMLIYANALEYG-----------------------------HSKQAPAGVLGIVAVKLRSYMAEAIK 139 (147) Q Consensus 101 ~Nn~pYA~~LEyG-----------------------------~s~QAp~G~V~~a~~~~~~~v~~a~~ 139 (147) .++++||.++||| |++|.|+.|++.|+.+-...|.+.+. T Consensus 49 ~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 49 NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ecCCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 7999999999999 77899999999999988888877777 No 51 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.16 E-value=9.4e-14 Score=92.01 Aligned_cols=111 Identities=14% Similarity=0.018 Sum_probs=85.0 Q ss_pred CC-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MA-KNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MA-k~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |+ .+++++-|..++.++.++++..+++.+++++.++.+..+..+|||||.||.||......- T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~----------------- 63 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVM----------------- 63 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccc----------------- Confidence 55 478888899999999999999999999999999999999999999999999997543110 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-----------------------------CCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-----------------------------KQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-----------------------------~QAp~G~V~~a~~~~ 130 (147) +...+-++-+..+++||.++|||+. +|.|+.|++.++++. T Consensus 64 -----------~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~ 132 (142) T protein:vir:86 64 -----------VTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAV 132 (142) T ss_pred -----------cccceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHH Confidence 0111223446689999999999974 678888888888654 Q ss_pred H-HHHHHHHH Q lcl|NC_021331. 131 R-SYMAEAIK 139 (147) Q Consensus 131 ~-~~v~~a~~ 139 (147) . +..+.+++ T Consensus 133 ~~~~~~~~~r 142 (142) T protein:vir:86 133 VRRDRRIRVR 142 (142) T ss_pred HhhhhhhccC Confidence 2 22232333 No 52 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.16 E-value=9.4e-14 Score=92.01 Aligned_cols=111 Identities=14% Similarity=0.018 Sum_probs=85.0 Q ss_pred CC-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MA-KNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MA-k~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |+ .+++++-|..++.++.++++..+++.+++++.++.+..+..+|||||.||.||......- T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~----------------- 63 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVM----------------- 63 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccc----------------- Confidence 55 478888899999999999999999999999999999999999999999999997543110 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-----------------------------CCCCchHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-----------------------------KQAPAGVLGIVAVKL 130 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-----------------------------~QAp~G~V~~a~~~~ 130 (147) +...+-++-+..+++||.++|||+. +|.|+.|++.++++. T Consensus 64 -----------~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~ 132 (142) T protein:vir:99 64 -----------VTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAV 132 (142) T ss_pred -----------cccceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHH Confidence 0111223446689999999999974 678888888888654 Q ss_pred H-HHHHHHHH Q lcl|NC_021331. 131 R-SYMAEAIK 139 (147) Q Consensus 131 ~-~~v~~a~~ 139 (147) . +..+.+++ T Consensus 133 ~~~~~~~~~r 142 (142) T protein:vir:99 133 VRRDRRIRVR 142 (142) T ss_pred HhhhhhhccC Confidence 2 22232333 No 53 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.15 E-value=7.5e-13 Score=87.05 Aligned_cols=117 Identities=13% Similarity=0.095 Sum_probs=76.2 Q ss_pred CCc-cc-chHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAK-NY-TIREFHGNIDAWINAVDSGLKDCV----ELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk-~~-s~~~F~~~i~~f~~~v~~~~~~~~----r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |++ .+ -+..+.+.|.++.+.+++.+...+ .+++..+..+.+...|||||.||+|....+..-. T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~----------- 69 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDG----------- 69 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecC----------- Confidence 886 23 367788888888776666655555 4555555666677889999999999865432110 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCC---------------------------------------- Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH---------------------------------------- 114 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~---------------------------------------- 114 (147) .+-+..+.++.+||.++|||+ T Consensus 70 -------------------~~~~g~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~ 130 (182) T protein:vir:10 70 -------------------DEVIGRWWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYG 130 (182) T ss_pred -------------------CeEEEEeecCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccc Confidence 011234557777777777663 Q ss_pred --------------CCCCCchHHHHHHHH----HHHHHHHHHHH-HHhhhcC Q lcl|NC_021331. 115 --------------SKQAPAGVLGIVAVK----LRSYMAEAIKE-SRAKNAL 147 (147) Q Consensus 115 --------------s~QAp~G~V~~a~~~----~~~~v~~a~~e-~k~~~~~ 147 (147) ++|.|+.|++.|+++ +.++|.+++++ +|-.+|= T Consensus 131 ~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 131 IPKIKINGKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred cceeeecCceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHHHHHHhhcC Confidence 578889999988864 45555555555 3555555 No 54 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.11 E-value=8.1e-13 Score=86.88 Aligned_cols=110 Identities=15% Similarity=0.156 Sum_probs=81.7 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |- =--+.++.+.|.++.+.+++.+.+.+++.|..+...++.+.|||||.||.|+.++... ..| T Consensus 1 i~-i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~----------~~~------ 63 (173) T protein:vir:10 1 MA-VKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLK----------AKD------ 63 (173) T ss_pred Cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeec----------cCc------ Confidence 22 1126778899999999999999999999999999999999999999999999875311 011 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCC--------------------------------------------- Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS--------------------------------------------- 115 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s--------------------------------------------- 115 (147) +-++-+..+..|+.++|||+| T Consensus 64 --------------~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 129 (173) T protein:vir:10 64 --------------LISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAA 129 (173) T ss_pred --------------eeEEeeCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcc Confidence 112335577888888888865 Q ss_pred ----------CCCCchHHHHHHHH--------HHHHHHHHHHHH Q lcl|NC_021331. 116 ----------KQAPAGVLGIVAVK--------LRSYMAEAIKES 141 (147) Q Consensus 116 ----------~QAp~G~V~~a~~~--------~~~~v~~a~~e~ 141 (147) +|.|+.|.+.|+++ +.+.|+++++++ T Consensus 130 ~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 130 YPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred cceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 58888899888754 444555555555 No 55 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.10 E-value=1.2e-12 Score=85.95 Aligned_cols=130 Identities=15% Similarity=0.117 Sum_probs=83.0 Q ss_pred CCc-ccc-hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAK-NYT-IREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk-~~s-~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) ||+ .++ +.+|.+.|+.+.+.+. +.+...+++.+..+..++..++|++||.++.|..++......+ ... T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~---------~~~ 71 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDA---------PGL 71 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccc---------cce Confidence 996 233 6678888888877764 5678899999999999999999999999999998764333211 100 Q ss_pred hhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHH-HHh--hhcC Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKE-SRA--KNAL 147 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e-~k~--~~~~ 147 (147) ...+. ....+.......+..|+.+||||+|.|.|..|++.|+..-..-+.+++.+ ++. +..| T Consensus 72 ~~~g~--------~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~ 136 (140) T protein:vir:10 72 ATAGV--------RVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVL 136 (140) T ss_pred EEeee--------eeccccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00000 00000011123567899999999999999999999987544332222221 111 1111 No 56 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.10 E-value=2.8e-13 Score=89.41 Aligned_cols=87 Identities=17% Similarity=0.172 Sum_probs=73.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEE Q lcl|NC_021331. 21 VDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYF 100 (147) Q Consensus 21 v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi 100 (147) +++.+++.+.+++.++.+.++..+|||||.||+||.+.+.. .+-+..| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~--------------------------------~~~~~~V 48 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD--------------------------------GGFTGVI 48 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeec--------------------------------CcEEEEE Confidence 77788888999999999999999999999999999764311 0123557 Q ss_pred ecCchhhhhhhcC-----------------------------CCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 101 SNMLIYANALEYG-----------------------------HSKQAPAGVLGIVAVKLRSYMAEAIK 139 (147) Q Consensus 101 ~Nn~pYA~~LEyG-----------------------------~s~QAp~G~V~~a~~~~~~~v~~a~~ 139 (147) .++++||.++||| |++|.|+.|++.|+.+-...|.+.+. T Consensus 49 ~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 49 NIGSEYAIYVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ecCCCccceeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 7999999999999 77899999999999998888887777 No 57 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.06 E-value=1.1e-12 Score=86.14 Aligned_cols=129 Identities=15% Similarity=0.113 Sum_probs=83.0 Q ss_pred CCcccc---hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||. +. +.+|.+.|+.+.+.+.. .+...+++.+..+..++...+|++||.++.|..++....... .. T Consensus 1 Ma~-~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~---------~~ 70 (140) T protein:vir:80 1 MSS-IQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDA---------PG 70 (140) T ss_pred Cce-eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccc---------cc Confidence 995 43 56778888887776644 557899999999999999999999999999998753221110 00 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHH-HHh--hhcC Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKE-SRA--KNAL 147 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e-~k~--~~~~ 147 (147) ....+ ...+.........+..|+.+||||+|.|.|+.|++.|+.+...-+.+++.+ ++. +..| T Consensus 71 ~~~~~--------~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~ 136 (140) T protein:vir:80 71 LATAG--------VRVRTKGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQAL 136 (140) T ss_pred eeeee--------eecccccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00000 000000112234578899999999999999999999997664444333332 111 1122 No 58 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.05 E-value=4.5e-12 Score=82.80 Aligned_cols=132 Identities=11% Similarity=0.074 Sum_probs=86.0 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||++++ +.+|.+.|....+.+++.....+++.|..+..++..++|+++|.++.+-........... +.+.-++. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGA-DQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccc-ccceeccc Confidence 998754 577888888888888888889999999999999999999999999876332111110000 00000000 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEe------cCchhhhhhhcCCCCCCCchHHHHHHHHHHHH-HHHHHHHHHhhhcC Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFS------NMLIYANALEYGHSKQAPAGVLGIVAVKLRSY-MAEAIKESRAKNAL 147 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~------Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~-v~~a~~e~k~~~~~ 147 (147) .....+..+.+. .+..|+.+||||+|.|.|..|++.|++.-..- ++....+++.++.+ T Consensus 80 --------------~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 80 --------------KLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred --------------cccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 000001122222 34579999999999999999999999876444 44444455555444 No 59 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.05 E-value=4.5e-12 Score=82.80 Aligned_cols=132 Identities=11% Similarity=0.074 Sum_probs=86.0 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||++++ +.+|.+.|....+.+++.....+++.|..+..++..++|+++|.++.+-........... +.+.-++. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGA-DQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccc-ccceeccc Confidence 998754 577888888888888888889999999999999999999999999876332111110000 00000000 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEe------cCchhhhhhhcCCCCCCCchHHHHHHHHHHHH-HHHHHHHHHhhhcC Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFS------NMLIYANALEYGHSKQAPAGVLGIVAVKLRSY-MAEAIKESRAKNAL 147 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~------Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~-v~~a~~e~k~~~~~ 147 (147) .....+..+.+. .+..|+.+||||+|.|.|..|++.|++.-..- ++....+++.++.+ T Consensus 80 --------------~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 80 --------------KLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred --------------cccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 000001122222 34579999999999999999999999876444 44444455555444 No 60 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.05 E-value=4.5e-12 Score=82.80 Aligned_cols=132 Identities=11% Similarity=0.074 Sum_probs=86.0 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||++++ +.+|.+.|....+.+++.....+++.|..+..++..++|+++|.++.+-........... +.+.-++. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGA-DQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccc-ccceeccc Confidence 998754 577888888888888888889999999999999999999999999876332111110000 00000000 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEe------cCchhhhhhhcCCCCCCCchHHHHHHHHHHHH-HHHHHHHHHhhhcC Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFS------NMLIYANALEYGHSKQAPAGVLGIVAVKLRSY-MAEAIKESRAKNAL 147 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~------Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~-v~~a~~e~k~~~~~ 147 (147) .....+..+.+. .+..|+.+||||+|.|.|..|++.|++.-..- ++....+++.++.+ T Consensus 80 --------------~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 80 --------------KLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred --------------cccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 000001122222 34579999999999999999999999876444 44444455555444 No 61 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.05 E-value=4.5e-12 Score=82.80 Aligned_cols=132 Identities=11% Similarity=0.074 Sum_probs=86.0 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||++++ +.+|.+.|....+.+++.....+++.|..+..++..++|+++|.++.+-........... +.+.-++. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~ 79 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGA-DQIKVTKA 79 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccc-ccceeccc Confidence 998754 577888888888888888889999999999999999999999999876332111110000 00000000 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEe------cCchhhhhhhcCCCCCCCchHHHHHHHHHHHH-HHHHHHHHHhhhcC Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFS------NMLIYANALEYGHSKQAPAGVLGIVAVKLRSY-MAEAIKESRAKNAL 147 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~------Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~-v~~a~~e~k~~~~~ 147 (147) .....+..+.+. .+..|+.+||||+|.|.|..|++.|++.-..- ++....+++.++.+ T Consensus 80 --------------~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 80 --------------KLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred --------------cccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 000001122222 34579999999999999999999999876444 44444455555444 No 62 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.04 E-value=1.9e-12 Score=84.90 Aligned_cols=130 Identities=15% Similarity=0.063 Sum_probs=85.1 Q ss_pred CCc-ccc-hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAK-NYT-IREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk-~~s-~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) ||+ .++ +.+|.+.|+.+.+.+. +.+...+++.+..+.+++..++|++||.++.|-.++....... .+... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~-------~~~~~ 73 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDS-------PGIAT 73 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccc-------cceeE Confidence 996 232 5678888888887764 4668899999999999999999999999999988764332211 11111 Q ss_pred hhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHH-HHHHHHHHHHHhh------hcC Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLR-SYMAEAIKESRAK------NAL 147 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~-~~v~~a~~e~k~~------~~~ 147 (147) ... +.+.........+..|+.+||||+|.|+|+.|++.|+.+-. ++++....+++.+ -|| T Consensus 74 ~~~----------~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 74 AGV----------RVRTKGKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred Eee----------ccccccccCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 000 00001111223467899999999999999999999996553 3333333333322 244 No 63 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.02 E-value=1.5e-12 Score=85.43 Aligned_cols=130 Identities=15% Similarity=0.119 Sum_probs=82.3 Q ss_pred CCc-ccc-hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAK-NYT-IREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk-~~s-~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) ||+ .+. +.+|.+.|+.+.+.+.. .+.+.+++.+..+..++..++|++||.++.|..++......+ ... T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~---------~~~ 71 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDA---------PGL 71 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhccccccccccc---------cee Confidence 996 233 56788888888777654 567899999999999999999999999999988753322211 110 Q ss_pred hhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHH-HHHHh--hhcC Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAI-KESRA--KNAL 147 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~-~e~k~--~~~~ 147 (147) ...+ ...+.+.......+.+|+.+||||+|.|.|+.|++.++......+.+++ .+++. +..| T Consensus 72 ~~vg--------~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~ 136 (140) T protein:vir:14 72 ATAG--------VRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVL 136 (140) T ss_pred EEee--------eeeccccccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 0000 0000011122346789999999999999999999999865432222222 11111 1111 No 64 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=98.96 E-value=4e-12 Score=83.05 Aligned_cols=134 Identities=18% Similarity=0.197 Sum_probs=82.8 Q ss_pred CCc-ccc---hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccc-cc-cccCCCC Q lcl|NC_021331. 1 MAK-NYT---IREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPL-YA-LNQYDKH 73 (147) Q Consensus 1 MAk-~~s---~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~-~~-~~~~d~~ 73 (147) |++ +++ +.+|.+.|+.+.+.+.+ .....+++.|..+..++..++|++||.++.|..++...... +. ...+... T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 665 444 45788888888777664 45788999999999999999999999999999876432211 10 0000000 Q ss_pred cchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHH--------HHHHHHHHHHHhh Q lcl|NC_021331. 74 GDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLR--------SYMAEAIKESRAK 144 (147) Q Consensus 74 G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~--------~~v~~a~~e~k~~ 144 (147) +.... ..........+-..+..|+.++|||+|.|.|+.|++.|+.+-. ..+.++++++=.| T Consensus 81 ~~~~~----------~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 81 GVNPR----------TGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccccc----------cccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 00000 0000111122333457799999999999999999999986432 2233333333222 No 65 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.93 E-value=1.4e-11 Score=80.11 Aligned_cols=109 Identities=20% Similarity=0.327 Sum_probs=78.5 Q ss_pred CCcccchHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCcccchhcccceeccCCccccccccCCC Q lcl|NC_021331. 1 MAKNYTIREFHGNID----AWINAVDSGLKDCVELFAEKVHTDLV----KRSPVDTGRYRGNWQVTANKPPLYALNQYDK 72 (147) Q Consensus 1 MAk~~s~~~F~~~i~----~f~~~v~~~~~~~~r~~a~~l~~~vv----~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~ 72 (147) ||+ ++|.+|++.|. .|++.+.+.+++.+.+++.++...+. ..+|++||.|+.+|.+... T Consensus 1 M~~-i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~------------ 67 (127) T protein:vir:80 1 MAN-IKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRT------------ 67 (127) T ss_pred Ccc-ccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeec------------ Confidence 997 99999865555 78888888888888888888777776 5899999999999975421 Q ss_pred CcchhhhhHHHHHHHHHhcccccceEEEecCchh--hhhhhcCCCCCC-----CchHHHHHHHHHHHHHHHHHHHH---H Q lcl|NC_021331. 73 HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIY--ANALEYGHSKQA-----PAGVLGIVAVKLRSYMAEAIKES---R 142 (147) Q Consensus 73 ~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pY--A~~LEyG~s~QA-----p~G~V~~a~~~~~~~v~~a~~e~---k 142 (147) +.+.+|| |..+| +.-||+||-.+. +...++.+.+...+-+.+-+++. - T Consensus 68 --------------------~~~~~v~--nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~ 125 (127) T protein:vir:80 68 --------------------PGGWVIH--NKTEYRLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNE 125 (127) T ss_pred --------------------cCceeEe--ecCCcceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCC Confidence 0122444 88999 899999997643 34566777777655555444432 2 Q ss_pred hh Q lcl|NC_021331. 143 AK 144 (147) Q Consensus 143 ~~ 144 (147) +| T Consensus 126 ~~ 127 (127) T protein:vir:80 126 SR 127 (127) T ss_pred CC Confidence 33 No 66 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=98.90 E-value=5.8e-12 Score=82.20 Aligned_cols=111 Identities=12% Similarity=0.022 Sum_probs=77.1 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |++--....|.-+.+.+.+.+...+++.++.++.++.+.++..+|||||.||+||+..... .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~-----------~~------ 63 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQV-----------YT------ 63 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeee-----------CC------ Confidence 8763334445566678888899999999999999999999999999999999999853211 00 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-----------------------------CCCCchHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-----------------------------KQAPAGVLGIVAVKLR 131 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-----------------------------~QAp~G~V~~a~~~~~ 131 (147) ..+-++.+..+++||.++|||+. +|.|+.|++.++.+.. T Consensus 64 ------------~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~ 131 (140) T protein:vir:10 64 ------------PFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVV 131 (140) T ss_pred ------------CceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHh Confidence 01124557789999999999964 3556666665554321 Q ss_pred HHHHHHHHHHHhhhc Q lcl|NC_021331. 132 SYMAEAIKESRAKNA 146 (147) Q Consensus 132 ~~v~~a~~e~k~~~~ 146 (147) ..+-|.||- T Consensus 132 ------~~~~~i~~~ 140 (140) T protein:vir:10 132 ------TNDPRVRMT 140 (140) T ss_pred ------hhhhhccCC Confidence 122344444 No 67 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=98.90 E-value=5.8e-12 Score=82.20 Aligned_cols=111 Identities=12% Similarity=0.022 Sum_probs=77.1 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |++--....|.-+.+.+.+.+...+++.++.++.++.+.++..+|||||.||+||+..... .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~-----------~~------ 63 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQV-----------YT------ 63 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeee-----------CC------ Confidence 8763334445566678888899999999999999999999999999999999999853211 00 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-----------------------------CCCCchHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-----------------------------KQAPAGVLGIVAVKLR 131 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-----------------------------~QAp~G~V~~a~~~~~ 131 (147) ..+-++.+..+++||.++|||+. +|.|+.|++.++.+.. T Consensus 64 ------------~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~ 131 (140) T protein:vir:97 64 ------------PFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVV 131 (140) T ss_pred ------------CceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHh Confidence 01124557789999999999964 3556666665554321 Q ss_pred HHHHHHHHHHHhhhc Q lcl|NC_021331. 132 SYMAEAIKESRAKNA 146 (147) Q Consensus 132 ~~v~~a~~e~k~~~~ 146 (147) ..+-|.||- T Consensus 132 ------~~~~~i~~~ 140 (140) T protein:vir:97 132 ------TNDPRVRMT 140 (140) T ss_pred ------hhhhhccCC Confidence 122344444 No 68 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.89 E-value=8.4e-12 Score=81.30 Aligned_cols=134 Identities=16% Similarity=0.140 Sum_probs=81.6 Q ss_pred CCc-ccc---hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccc-cccCCCCc Q lcl|NC_021331. 1 MAK-NYT---IREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYA-LNQYDKHG 74 (147) Q Consensus 1 MAk-~~s---~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~-~~~~d~~G 74 (147) |.+ +++ +.+|.+.|+.+.+.+. +.....+++.+..+..++..++|++||.++.|-.++...-+.+. ...+...+ T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 554 454 5577788888776654 45577888999999999999999999999999877644322221 11110000 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHH--------HHHHHHHHHHHHhh Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKL--------RSYMAEAIKESRAK 144 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~--------~~~v~~a~~e~k~~ 144 (147) ..... ......-..+-..+.+|+.++|||+|.|.|+.|++.|+.+- .+.+.+.++++=.| T Consensus 81 ~~~~~----------~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 81 VNPDT----------GNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred ccccc----------ccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 00000 00000000112245789999999999999999999998643 33333333443333 No 69 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=98.89 E-value=3.5e-12 Score=83.41 Aligned_cols=88 Identities=20% Similarity=0.291 Sum_probs=65.1 Q ss_pred CCc-ccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAK-NYT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk-~~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||| +++ +..|.+.|.+..+ .+.+++++++.+.+|-++.+...|||||.+|+|+.+++..- |- T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~--~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~-----------g~- 66 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQN--MNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRD-----------GF- 66 (92) T ss_pred CCceeeEeehHHHHHHHHHhhcc--HHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecC-----------Ce- Confidence 999 443 6677777776543 36688999999999999999999999999999998764210 00 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCC Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAP 119 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp 119 (147) .+.++...=+..|+.|||||++-++. T Consensus 67 -----------------~~~v~~~gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 67 -----------------TGSVTYGGGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred -----------------eEEEEeccCccccccccccceeecCC Confidence 01111112468899999999999887 No 70 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.88 E-value=2.9e-11 Score=78.34 Aligned_cols=109 Identities=18% Similarity=0.272 Sum_probs=76.2 Q ss_pred CCcccchHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCcccchhcccceeccCCccccccccCCC Q lcl|NC_021331. 1 MAKNYTIREFHGN----IDAWINAVDSGLKDCVELFAEKVHTDLV----KRSPVDTGRYRGNWQVTANKPPLYALNQYDK 72 (147) Q Consensus 1 MAk~~s~~~F~~~----i~~f~~~v~~~~~~~~r~~a~~l~~~vv----~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~ 72 (147) ||+ ++|.+|++. |+.|.+.+.+.+++.+.+++.+++..|. ..+|++||.|+.+|.+.... T Consensus 1 M~~-i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~----------- 68 (124) T protein:vir:95 1 MAK-IKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVP----------- 68 (124) T ss_pred Ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeec----------- Confidence 997 998888654 5567788888888888777777776665 48999999999999875311 Q ss_pred CcchhhhhHHHHHHHHHhcccccceEEEecCchh--hhhhhcCCCCCCC-----chHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021331. 73 HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIY--ANALEYGHSKQAP-----AGVLGIVAVKLRSYMAEAIKESRAK 144 (147) Q Consensus 73 ~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pY--A~~LEyG~s~QAp-----~G~V~~a~~~~~~~v~~a~~e~k~~ 144 (147) -+.+|| |..+| +.-|||||-.+.+ ...++.+.+...+-|.+-+++.=.. T Consensus 69 ---------------------e~~~V~--nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 69 ---------------------NGWVIH--NKTEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred ---------------------CceeEE--EcCCCceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 122455 88899 8999999976543 3455666666655555555443222 No 71 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.88 E-value=3.5e-11 Score=77.88 Aligned_cols=117 Identities=14% Similarity=0.097 Sum_probs=85.2 Q ss_pred CCcccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc---cchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVD---TGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVd---tG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) ||+ ++ +++|.+.|.+..+.+++.....+++.|..+..++..++|++ ||.++.|-.++-. ..+..| T Consensus 1 M~~-~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~--------k~~~~g 71 (127) T protein:vir:12 1 MAD-MSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNV--------RESKDG 71 (127) T ss_pred Cee-eeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhcccc--------ccccCc Confidence 997 43 67788888888888888889999999999999999999986 8999999876411 111112 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEe---cCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHH-HHhhhc Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFS---NMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKE-SRAKNA 146 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~---Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e-~k~~~~ 146 (147) .. +|.|. ++.+|+.+||||+|.|.|++|++.|+++-...+-+++.+ ++.++= T Consensus 72 ~~--------------------~v~Vg~~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 72 VR--------------------FVAVGPNKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred ee--------------------EEEEeeCCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 11 22232 457899999999999999999999998765554444433 333333 No 72 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=98.87 E-value=2.7e-11 Score=78.51 Aligned_cols=120 Identities=13% Similarity=0.008 Sum_probs=90.0 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccch----hcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGR----YRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~----~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) |-++. .+|.++|.+..+.+++...+.+++.|..+...+..++|+++|. ++.|-.++- + ..+..|.. T Consensus 1 mv~Gl--~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~--~------k~~~~g~~ 70 (125) T protein:vir:97 1 MTKGL--DEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISG--F------KGANVGIV 70 (125) T ss_pred CchhH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhccc--c------cccccCce Confidence 99854 7899999999888888899999999999999999999999887 666655431 0 11222221 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHH-HHHHHHHHHHHhhhcC Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLR-SYMAEAIKESRAKNAL 147 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~-~~v~~a~~e~k~~~~~ 147 (147) +.. +-|=-.+..|+.++|||+|.|.|.+|++.|+++-. ++++....+++..++| T Consensus 71 ~~~-----------------VG~~k~~~~y~~f~E~GT~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 71 SKE-----------------IGYGKATGWRAHYPNDGTIYQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred EEE-----------------EeecCCCceeEeeeccCccCCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 110 01111356899999999999999999999998774 4555555667999999 No 73 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=98.85 E-value=7.6e-12 Score=81.56 Aligned_cols=108 Identities=14% Similarity=0.057 Sum_probs=73.7 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |..+.- +.-+..+..+.+...++..+++++.++..+.+..+|||||+||+||...... T Consensus 1 m~~s~~---i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~------------------- 58 (137) T protein:vir:10 1 MPVTAR---IHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQT------------------- 58 (137) T ss_pred CCeeEE---EeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeec------------------- Confidence 665432 2222244456677778888899999999999999999999999999864311 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-----------------------------CCCCchHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-----------------------------KQAPAGVLGIVAVKLR 131 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-----------------------------~QAp~G~V~~a~~~~~ 131 (147) +...+-++.+.++++||.++|||+. +|.|..|++.++.+.. T Consensus 59 ----------~~~~~~~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~ 128 (137) T protein:vir:10 59 ----------YRPFHVGGGVEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVV 128 (137) T ss_pred ----------cccceEEEEEecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHh Confidence 0011235789999999999999953 5667777777766531 Q ss_pred HHHHHHHHHHHhhhc Q lcl|NC_021331. 132 SYMAEAIKESRAKNA 146 (147) Q Consensus 132 ~~v~~a~~e~k~~~~ 146 (147) .++=|.|+- T Consensus 129 ------~~~~ri~~~ 137 (137) T protein:vir:10 129 ------AADPDIHMT 137 (137) T ss_pred ------hccccccCC Confidence 123344444 No 74 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.82 E-value=2.2e-11 Score=78.99 Aligned_cols=140 Identities=14% Similarity=0.124 Sum_probs=81.2 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCc-----ccchhcccceeccCCcccccccc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKRSPV-----DTGRYRGNWQVTANKPPLYALNQ 69 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~tPV-----dtG~~R~nw~vs~~~~~~~~~~~ 69 (147) ||.+++ +.+|.+.|+.+.+.+. +.+...+++.|.-+..++..+.|+ ++|.++.|-.+.-+...+... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~-- 78 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRT-- 78 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccc-- Confidence 998655 7789999999988875 456889999999999999999966 456666666554433322111 Q ss_pred CCCCcchhhhhHH------HHHHHHHhcccccceEEE--------ecCchhhhhhhcCCCCCCCchHHHHHHHHH----- Q lcl|NC_021331. 70 YDKHGDKTIAEGK------RAIYAILRGGGAVRAIYF--------SNMLIYANALEYGHSKQAPAGVLGIVAVKL----- 130 (147) Q Consensus 70 ~d~~G~~t~~~~~------~~i~~~~~~~~~g~~iyi--------~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~----- 130 (147) |......+. ..........+.+...|. .-+.+|+.+||||+|.|+|..|++.++.+- T Consensus 79 ----g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~ 154 (179) T protein:vir:18 79 ----GDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVI 154 (179) T ss_pred ----cceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHH Confidence 111000000 000000000011111221 125789999999999999999999988632 Q ss_pred -------HHHHHHHHHHH--Hhhhc Q lcl|NC_021331. 131 -------RSYMAEAIKES--RAKNA 146 (147) Q Consensus 131 -------~~~v~~a~~e~--k~~~~ 146 (147) .+-|++++++. |++-| T Consensus 155 ~~i~~~l~~~i~k~lk~~~~~~~~~ 179 (179) T protein:vir:18 155 NVFSTEMGKAIDRAIRLAMKKGTTA 179 (179) T ss_pred HHHHHHHHHHHHHHHHhhcccCCCC Confidence 22223333333 22233 No 75 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=98.75 E-value=1.4e-10 Score=74.53 Aligned_cols=116 Identities=13% Similarity=0.096 Sum_probs=86.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccch--hcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGR--YRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~--~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) |+--+++.++.+.|+.+...+++.....+++.|.-+.+.+...+|+++|. +|.|..+|- +. ..+..|.. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--~k-----~~~~~g~~-- 71 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--VK-----TDRHTSEK-- 71 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--cc-----cccccceE-- Confidence 99989998999999999888888888889999988889999999999887 899998862 11 11111111 Q ss_pred hhHHHHHHHHHhcccccceEEEecCc---hhhhhhhcCCCCCCCchHHHHHHHHHHH----HHHHHHHHHHh Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNML---IYANALEYGHSKQAPAGVLGIVAVKLRS----YMAEAIKESRA 143 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~---pYA~~LEyG~s~QAp~G~V~~a~~~~~~----~v~~a~~e~k~ 143 (147) .+.+..+- =|+.++|||+|.|.|++|++.|+++-.. ++.+.+++++- T Consensus 72 ------------------~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 72 ------------------IVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------EEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 12233332 3789999999999999999999975544 55555555544 No 76 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=98.75 E-value=1.4e-10 Score=74.53 Aligned_cols=116 Identities=13% Similarity=0.096 Sum_probs=86.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccch--hcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGR--YRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~--~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) |+--+++.++.+.|+.+...+++.....+++.|.-+.+.+...+|+++|. +|.|..+|- +. ..+..|.. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--~k-----~~~~~g~~-- 71 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--VK-----TDRHTSEK-- 71 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--cc-----cccccceE-- Confidence 99989998999999999888888888889999988889999999999887 899998862 11 11111111 Q ss_pred hhHHHHHHHHHhcccccceEEEecCc---hhhhhhhcCCCCCCCchHHHHHHHHHHH----HHHHHHHHHHh Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNML---IYANALEYGHSKQAPAGVLGIVAVKLRS----YMAEAIKESRA 143 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~---pYA~~LEyG~s~QAp~G~V~~a~~~~~~----~v~~a~~e~k~ 143 (147) .+.+..+- =|+.++|||+|.|.|++|++.|+++-.. ++.+.+++++- T Consensus 72 ------------------~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 72 ------------------IVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------EEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 12233332 3789999999999999999999975544 55555555544 No 77 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=98.75 E-value=1.4e-10 Score=74.53 Aligned_cols=116 Identities=13% Similarity=0.096 Sum_probs=86.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccch--hcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGR--YRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~--~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) |+--+++.++.+.|+.+...+++.....+++.|.-+.+.+...+|+++|. +|.|..+|- +. ..+..|.. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--~k-----~~~~~g~~-- 71 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--VK-----TDRHTSEK-- 71 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--cc-----cccccceE-- Confidence 99989998999999999888888888889999988889999999999887 899998862 11 11111111 Q ss_pred hhHHHHHHHHHhcccccceEEEecCc---hhhhhhhcCCCCCCCchHHHHHHHHHHH----HHHHHHHHHHh Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNML---IYANALEYGHSKQAPAGVLGIVAVKLRS----YMAEAIKESRA 143 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~---pYA~~LEyG~s~QAp~G~V~~a~~~~~~----~v~~a~~e~k~ 143 (147) .+.+..+- =|+.++|||+|.|.|++|++.|+++-.. ++.+.+++++- T Consensus 72 ------------------~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 72 ------------------IVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------EEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 12233332 3789999999999999999999975544 55555555544 No 78 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=98.75 E-value=1.4e-10 Score=74.53 Aligned_cols=116 Identities=13% Similarity=0.096 Sum_probs=86.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccch--hcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGR--YRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~--~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) |+--+++.++.+.|+.+...+++.....+++.|.-+.+.+...+|+++|. +|.|..+|- +. ..+..|.. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--~k-----~~~~~g~~-- 71 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--VK-----TDRHTSEK-- 71 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--cc-----cccccceE-- Confidence 99989998999999999888888888889999988889999999999887 899998862 11 11111111 Q ss_pred hhHHHHHHHHHhcccccceEEEecCc---hhhhhhhcCCCCCCCchHHHHHHHHHHH----HHHHHHHHHHh Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNML---IYANALEYGHSKQAPAGVLGIVAVKLRS----YMAEAIKESRA 143 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~---pYA~~LEyG~s~QAp~G~V~~a~~~~~~----~v~~a~~e~k~ 143 (147) .+.+..+- =|+.++|||+|.|.|++|++.|+++-.. ++.+.+++++- T Consensus 72 ------------------~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 72 ------------------IVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------EEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 12233332 3789999999999999999999975544 55555555544 No 79 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=98.75 E-value=1.4e-10 Score=74.53 Aligned_cols=116 Identities=13% Similarity=0.096 Sum_probs=86.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccch--hcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGR--YRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~--~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) |+--+++.++.+.|+.+...+++.....+++.|.-+.+.+...+|+++|. +|.|..+|- +. ..+..|.. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~--~k-----~~~~~g~~-- 71 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSN--VK-----TDRHTSEK-- 71 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecc--cc-----cccccceE-- Confidence 99989998999999999888888888889999988889999999999887 899998862 11 11111111 Q ss_pred hhHHHHHHHHHhcccccceEEEecCc---hhhhhhhcCCCCCCCchHHHHHHHHHHH----HHHHHHHHHHh Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNML---IYANALEYGHSKQAPAGVLGIVAVKLRS----YMAEAIKESRA 143 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~---pYA~~LEyG~s~QAp~G~V~~a~~~~~~----~v~~a~~e~k~ 143 (147) .+.+..+- =|+.++|||+|.|.|++|++.|+++-.. ++.+.+++++- T Consensus 72 ------------------~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 72 ------------------IVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------EEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 12233332 3789999999999999999999975544 55555555544 No 80 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=98.74 E-value=2.1e-10 Score=73.60 Aligned_cols=118 Identities=14% Similarity=0.125 Sum_probs=76.0 Q ss_pred CCcc-c-chHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchh----cccceeccCCccccccccCCCC Q lcl|NC_021331. 1 MAKN-Y-TIREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDTGRY----RGNWQVTANKPPLYALNQYDKH 73 (147) Q Consensus 1 MAk~-~-s~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdtG~~----R~nw~vs~~~~~~~~~~~~d~~ 73 (147) |++- + -+.+|.+.|+++.+.+.+ .....+++.|..+..++..++|+++|.. +.|..++.... ....+ T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~------~~~~~ 74 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTR------KAQGN 74 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhccccccccc------ccCcc Confidence 9962 3 266888999998888754 4568899999999999999999999984 44443321100 00000 Q ss_pred cchhhhhHHHHHHHHHhcccccceEEEecC---chhhhhhhcCCCCCCCchHHHHHHHHHHH----HHHHHHH-HHHhh Q lcl|NC_021331. 74 GDKTIAEGKRAIYAILRGGGAVRAIYFSNM---LIYANALEYGHSKQAPAGVLGIVAVKLRS----YMAEAIK-ESRAK 144 (147) Q Consensus 74 G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn---~pYA~~LEyG~s~QAp~G~V~~a~~~~~~----~v~~a~~-e~k~~ 144 (147) |. -.+++..+ -.|+.++|||+|.|.|+.|++.|+.+-.. ++.+.++ +++-| T Consensus 75 ~~--------------------~~v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 75 AV--------------------VTLRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred ce--------------------EEEEecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 00 01223222 24889999999999999999999984433 3333322 23333 No 81 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=98.67 E-value=4.7e-10 Score=71.72 Aligned_cols=127 Identities=9% Similarity=0.103 Sum_probs=76.9 Q ss_pred CCcccc-----hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCcccchh---cccceeccCCccccccccC Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWI--NAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRY---RGNWQVTANKPPLYALNQY 70 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~--~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~---R~nw~vs~~~~~~~~~~~~ 70 (147) ||.+|+ |.+|.+.|+... +.+++.....+++.|.-+..++..++|++.+-. +..|..+-...+.....++ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 999754 567888888774 456777788999999999999999999864311 1111110000000000000 Q ss_pred CCCcchhhhhHHHHHHHHHhcccccc-eEEEe------cCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_021331. 71 DKHGDKTIAEGKRAIYAILRGGGAVR-AIYFS------NMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKE--- 140 (147) Q Consensus 71 d~~G~~t~~~~~~~i~~~~~~~~~g~-~iyi~------Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e--- 140 (147) .. +.|. .+.+. .+..|+.++|||+|.|.|+.|++.|+.+...-+.+++.+ T Consensus 81 ~~--------------------~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~ 140 (149) T protein:vir:13 81 RK--------------------KKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYD 140 (149) T ss_pred cc--------------------ccceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHH Confidence 00 0111 12332 356899999999999999999999996554444333322 Q ss_pred --HHhhhcC Q lcl|NC_021331. 141 --SRAKNAL 147 (147) Q Consensus 141 --~k~~~~~ 147 (147) ++..||= T Consensus 141 k~i~~~lG~ 149 (149) T protein:vir:13 141 NFVKEKLGD 149 (149) T ss_pred HHHHHHhcC Confidence 3555555 No 82 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=98.65 E-value=5.1e-10 Score=71.53 Aligned_cols=122 Identities=16% Similarity=0.105 Sum_probs=80.9 Q ss_pred CCc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAK---NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk---~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) |.- +.++..+.+.|+++-+..++.+...+++-|.-+.++++.+.|++||.|+.|..+....-.+ ..|..+ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s-------~~g~~~ 73 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEES-------VEGIQT 73 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccC-------CCceEE Confidence 443 4667788888999888888888888889999999999999999999999999775322110 111111 Q ss_pred hhhHHHHHHHHHhcccccceEEEe-cCchhhhhhhcCCCC------------------------CCCchHHHHHHHHHHH Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFS-NMLIYANALEYGHSK------------------------QAPAGVLGIVAVKLRS 132 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~-Nn~pYA~~LEyG~s~------------------------QAp~G~V~~a~~~~~~ 132 (147) . -|++. =+.||+..+||||+. ..|..|+|.++..-.+ T Consensus 74 ~------------------~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~ 135 (157) T protein:vir:97 74 Y------------------AVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAM 135 (157) T ss_pred E------------------EEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHH Confidence 0 01111 157999999999753 5679999999976544 Q ss_pred HHHHHH-HHHHhhhc--C Q lcl|NC_021331. 133 YMAEAI-KESRAKNA--L 147 (147) Q Consensus 133 ~v~~a~-~e~k~~~~--~ 147 (147) -+.+++ ++++.++. | T Consensus 136 ~a~~~~~~~l~k~I~e~l 153 (157) T protein:vir:97 136 QIPDIARAAGAKKYAELQ 153 (157) T ss_pred HHHHHHHHHHHHHHHHHh Confidence 433332 32322221 1 No 83 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.61 E-value=2.2e-10 Score=73.48 Aligned_cols=139 Identities=16% Similarity=0.093 Sum_probs=80.8 Q ss_pred CCcccc-----hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCc-----ccchhcccceeccCCcccccccc Q lcl|NC_021331. 1 MAKNYT-----IREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKRSPV-----DTGRYRGNWQVTANKPPLYALNQ 69 (147) Q Consensus 1 MAk~~s-----~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~tPV-----dtG~~R~nw~vs~~~~~~~~~~~ 69 (147) ||.+++ +.+|.+.|+.+.+.+. +.....+++.|.-+..++..++|+ ++|.++.|..++...--... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~--- 77 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKR--- 77 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCcccc--- Confidence 998643 6688888888887775 456788999999999999999997 56788887766432111100 Q ss_pred CCCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHH------------HHHH Q lcl|NC_021331. 70 YDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSY------------MAEA 137 (147) Q Consensus 70 ~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~------------v~~a 137 (147) .|......+...... .............++.+|+.+||||+|.|+|..|++.++.+-.+- |+++ T Consensus 78 ---~~~~~~~vg~~~~~~-~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka 153 (164) T protein:vir:43 78 ---TGDLGFRIGVLHGAV-LPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRA 153 (164) T ss_pred ---ccceeEEeccccccc-ccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHH Confidence 010000000000000 000000111223466899999999999999999999998633222 2222 Q ss_pred HHHH--Hhhhc Q lcl|NC_021331. 138 IKES--RAKNA 146 (147) Q Consensus 138 ~~e~--k~~~~ 146 (147) ++.. |++.| T Consensus 154 ~~k~~~~~~~~ 164 (164) T protein:vir:43 154 IKRAAKKAAQG 164 (164) T ss_pred HHHHHhhhccC Confidence 2222 22222 No 84 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=98.60 E-value=1.2e-09 Score=69.53 Aligned_cols=122 Identities=16% Similarity=0.085 Sum_probs=83.4 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |.-.+ .+.+|.+.|+...+.+++...+.+++.|..+...+...+|+++|..|.+=...- ....+ ...+.+|.. T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d-~I~~~--~~k~~~g~~--- 74 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRD-DIKLS--SVRETSGLT--- 74 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhh-hhccc--cccccCcee--- Confidence 88776 478999999999999988899999999999999999999999987654321110 00000 001111111 Q ss_pred hHHHHHHHHHhcccccceEEEe---cCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHH-HHHHhhhcC Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFS---NMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAI-KESRAKNAL 147 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~---Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~-~e~k~~~~~ 147 (147) ++.|. .+..|+.++|||+|.|.|.+|++.++++-..-+.+++ +++| .+| T Consensus 75 -----------------~~~VG~~k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~--k~i 127 (128) T protein:vir:38 75 -----------------EVDVGYGKDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLK--EGG 127 (128) T ss_pred -----------------EEEeeecCCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHH--hhc Confidence 12221 2356999999999999999999999987755444444 4443 344 No 85 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=98.58 E-value=1.1e-09 Score=69.66 Aligned_cols=120 Identities=17% Similarity=0.141 Sum_probs=77.3 Q ss_pred CCcccc---hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccc----chhcccceeccCCccccccccCCC Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDT----GRYRGNWQVTANKPPLYALNQYDK 72 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdt----G~~R~nw~vs~~~~~~~~~~~~d~ 72 (147) |.-+++ +++|.+.|+.+.+.+.+ .....+++.|..+..++..++||++ |.++.|-.++-.... T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~--------- 71 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGK--------- 71 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhccccccccc--------- Confidence 666665 66888888888888754 4568899999999999999999986 777777765422111 Q ss_pred CcchhhhhHHHHHHHHHhcccccceEEEecCc---hhhhhhhcCCCCCCCchHHHHHHHHHHH-HHHHHHHHHHhhhc-- Q lcl|NC_021331. 73 HGDKTIAEGKRAIYAILRGGGAVRAIYFSNML---IYANALEYGHSKQAPAGVLGIVAVKLRS-YMAEAIKESRAKNA-- 146 (147) Q Consensus 73 ~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~---pYA~~LEyG~s~QAp~G~V~~a~~~~~~-~v~~a~~e~k~~~~-- 146 (147) .|.. +-++.+..+- .|+.++|||+|.|+|+.|++.|+.+-.. +++....+++..+- T Consensus 72 ~~~~------------------~v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka 133 (135) T protein:vir:57 72 AGST------------------VVVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTL 133 (135) T ss_pred ccce------------------eEEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHh Confidence 1110 1123333333 3477889999999999999999875433 22222233322211 Q ss_pred -C Q lcl|NC_021331. 147 -L 147 (147) Q Consensus 147 -~ 147 (147) = T Consensus 134 ~r 135 (135) T protein:vir:57 134 SR 135 (135) T ss_pred cC Confidence 1 No 86 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=98.45 E-value=4.6e-10 Score=71.78 Aligned_cols=116 Identities=14% Similarity=0.099 Sum_probs=74.4 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |-- |.+ |..+.....+++...+...+++++.++....+...|||||.||+|++.++.. T Consensus 1 ~~~--~~~-~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~------------------- 58 (137) T protein:vir:10 1 MTV--TAR-YERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIV------------------- 58 (137) T ss_pred Cee--EEE-eccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeee------------------- Confidence 322 211 2333344455677777778899999999999999999999999999865321 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCC--------------CchHH-HHHH----HHHHHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQA--------------PAGVL-GIVA----VKLRSYMAEAIKES 141 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QA--------------p~G~V-~~a~----~~~~~~v~~a~~e~ 141 (147) .+...+.++++..+++||.++|||+.... ..++| +-.+ +.-+.++..|++++ T Consensus 59 ---------~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~ 129 (137) T protein:vir:10 59 ---------VAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERV 129 (137) T ss_pred ---------ccccceEEEEecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHh Confidence 01122346788899999999999975321 11122 0000 22456677777777 Q ss_pred HhhhcC Q lcl|NC_021331. 142 RAKNAL 147 (147) Q Consensus 142 k~~~~~ 147 (147) +.|.-- T Consensus 130 ~~~~~~ 135 (137) T protein:vir:10 130 VARETA 135 (137) T ss_pred hhhhcc Confidence 776665 No 87 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=98.31 E-value=1e-09 Score=69.91 Aligned_cols=113 Identities=19% Similarity=0.209 Sum_probs=76.9 Q ss_pred CCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKN-YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~-~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |=-. .-|.. .++...+.+.+++.++.++.++.++.+...|||||.||.||+...... T Consensus 1 ~~~~~~~l~~-----~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~----------------- 58 (137) T protein:vir:10 1 MVAHTLRIER-----AQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRE----------------- 58 (137) T ss_pred CcccccccCh-----hhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeec----------------- Confidence 4331 22222 345566777788889999999999999999999999999998643110 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC--------CCC-----chH-----HHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK--------QAP-----AGV-----LGIVAVKLRSYMAEAIKES 141 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~--------QAp-----~G~-----V~~a~~~~~~~v~~a~~e~ 141 (147) ...+-++++..+++||.++|+|+.. ++- .++ |..-=+.-+.++..|++++ T Consensus 59 ------------~g~~v~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~ 126 (137) T protein:vir:10 59 ------------RGAVVIGSVEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREV 126 (137) T ss_pred ------------cccEEEEEecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHh Confidence 0112346788999999999999642 110 111 1101133567899999999 Q ss_pred HhhhcC Q lcl|NC_021331. 142 RAKNAL 147 (147) Q Consensus 142 k~~~~~ 147 (147) +.+-|| T Consensus 127 ~~~~~~ 132 (137) T protein:vir:10 127 APQEGF 132 (137) T ss_pred hcccce Confidence 999999 No 88 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=98.10 E-value=1e-08 Score=64.39 Aligned_cols=109 Identities=13% Similarity=0.148 Sum_probs=77.8 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCc-------ccchhcccceeccCCccccccccCC Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKR--SPV-------DTGRYRGNWQVTANKPPLYALNQYD 71 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~--tPV-------dtG~~R~nw~vs~~~~~~~~~~~~d 71 (147) |- -++.+...|. +..+.++.+++++=..++.++++.. +|| |||.+|.|-+..+...... T Consensus 1 i~---G~~~L~~~Lk---~~s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~------ 68 (127) T protein:vir:98 1 MT---GMPALEVKLR---SMSEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKD------ 68 (127) T ss_pred Cc---ChHHHHHHHH---HhhHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCce------ Confidence 22 2455555554 4466779999999999999999985 899 9999999977654332110 Q ss_pred CCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC---------CCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 72 KHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK---------QAPAGVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 72 ~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~---------QAp~G~V~~a~~~~~~~v~~a~~e~k 142 (147) +.+=......+||+||||||+- +..+.++..++..-+.+|.+=++++= T Consensus 69 -----------------------~~vgp~g~t~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~ 125 (127) T protein:vir:98 69 -----------------------VITGNFGYIKDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNEL 125 (127) T ss_pred -----------------------EEeccCcccccccceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHh Confidence 0011122368999999999984 44789999999999988887777664 Q ss_pred hh Q lcl|NC_021331. 143 AK 144 (147) Q Consensus 143 ~~ 144 (147) -| T Consensus 126 k~ 127 (127) T protein:vir:98 126 RR 127 (127) T ss_pred cC Confidence 44 No 89 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=98.04 E-value=5.6e-08 Score=60.33 Aligned_cols=114 Identities=20% Similarity=0.217 Sum_probs=74.1 Q ss_pred CCcccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchh Q lcl|NC_021331. 1 MAKNYT---IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKT 77 (147) Q Consensus 1 MAk~~s---~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t 77 (147) || ++. |.+|...|+..-...+..-.+.+++.+.-+.+++...+||+||.+.. +... .-+.|-.+ T Consensus 1 Ma-~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk---ik~~---------~kk~g~~~ 67 (119) T protein:vir:10 1 MA-SLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK---VKIR---------VKNTGLAT 67 (119) T ss_pred Cc-eeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce---eeee---------eecCceeE Confidence 99 565 45555556566666777778889999999999999999999999984 3211 11112100 Q ss_pred hhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCc-hHHHHHHHH-HHHHHHHHHHHHHhhhc Q lcl|NC_021331. 78 IAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPA-GVLGIVAVK-LRSYMAEAIKESRAKNA 146 (147) Q Consensus 78 ~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~-G~V~~a~~~-~~~~v~~a~~e~k~~~~ 146 (147) .++.- +..=|..++|||+|.|.+. ||+..++.+ ....+.....+++-++= T Consensus 68 --------------VG~~k-----s~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 68 --------------EGTAS-----SSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred --------------eccCC-----cchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 01100 1235999999999999998 999988853 23333334444443333 No 90 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.51 E-value=1.6e-06 Score=52.40 Aligned_cols=100 Identities=15% Similarity=0.091 Sum_probs=66.0 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |+-.+.+. +.+|..++.+..+.....++.++++.+-.-.|.|||.|++|=.+ T Consensus 1 M~vkV~id-----~~~~~~~l~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~----------------------- 52 (112) T protein:vir:80 1 MPIKVRVD-----LSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI----------------------- 52 (112) T ss_pred CceeEEee-----hHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCccccceee----------------------- Confidence 98665433 23343445555666677888899999988999999999998211 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC--------CCCchHHHHHHH----HHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK--------QAPAGVLGIVAV----KLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~--------QAp~G~V~~a~~----~~~~~v~~a~~e~k~~~~~ 147 (147) ...| .|..+.|||.++-||+.. .+..-|...+.. +|.+.+.+++++ || T Consensus 53 -----------~~~g---~I~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~-----~l 112 (112) T protein:vir:80 53 -----------MNDK---EIMWTSIYARRLYNGINFNFTLTHHPLAGPKWDQRAKVDKLESWIEVAQKAVEE-----GL 112 (112) T ss_pred -----------ccCc---eEEecCchhhHhhhcccCCCCcCCCCCcchhhHHHHHhhhhHHHHHHHHHHHhh-----cC Confidence 0112 356799999999997532 566778876654 455555555543 34 No 91 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=97.47 E-value=1.2e-06 Score=52.97 Aligned_cols=115 Identities=18% Similarity=0.228 Sum_probs=74.4 Q ss_pred CCcc----c---chHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCc-----------ccchhcccceeccCC Q lcl|NC_021331. 1 MAKN----Y---TIREFHGNIDAWI-NAVDSGLKDCVELFAEKVHTDLVKRSPV-----------DTGRYRGNWQVTANK 61 (147) Q Consensus 1 MAk~----~---s~~~F~~~i~~f~-~~v~~~~~~~~r~~a~~l~~~vv~~tPV-----------dtG~~R~nw~vs~~~ 61 (147) ||.- + -+..|...+.... ..+.+.+....+.+|.-++..+..-||+ .||+|.+|..++-.. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 9972 2 2567777777663 3466778888889999999999999999 699999998764111 Q ss_pred ccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEec--CchhhhhhhcCCCCCC--CchHHHHHHH----HHHHH Q lcl|NC_021331. 62 PPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSN--MLIYANALEYGHSKQA--PAGVLGIVAV----KLRSY 133 (147) Q Consensus 62 ~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~N--n~pYA~~LEyG~s~QA--p~G~V~~a~~----~~~~~ 133 (147) -+-+|-+.- .+|||..++|||..+. |.-|+.-+.. .|..+ T Consensus 81 --------------------------------raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~ 128 (143) T protein:vir:62 81 --------------------------------KGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAAT 128 (143) T ss_pred --------------------------------cceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHH Confidence 011223333 6999999999999887 8888875442 23333 Q ss_pred HHHHHHHH-HhhhcC Q lcl|NC_021331. 134 MAEAIKES-RAKNAL 147 (147) Q Consensus 134 v~~a~~e~-k~~~~~ 147 (147) .+.-+..+ ...++- T Consensus 129 Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 129 YERRIAAVVEKYLES 143 (143) T ss_pred HHHHHHHHHHHHhcC Confidence 33222222 111222 No 92 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.41 E-value=3.3e-06 Score=50.63 Aligned_cols=116 Identities=23% Similarity=0.254 Sum_probs=89.2 Q ss_pred CCc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAK---NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk---~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||| -+++.+|...|.+|..+.+..+.......|..+-..++...|= .||-.|....-++.. .| T Consensus 1 ~~~~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~-----------~g- 68 (123) T protein:vir:74 1 MAKVTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANK-----------LG- 68 (123) T ss_pred CceeEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccccc-----------CC- Confidence 999 4788999999999999999999999999999999999999995 488887765432111 00 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) +.-.+||++-+++|..+||.+++++ .-+++.|++.+..-|=+=.+++-+|+-- T Consensus 69 -----------------~~~~~Iylsh~veYG~~LEla~~~k--yaIi~Ptv~~~~~~im~g~~~ll~~l~~ 121 (123) T protein:vir:74 69 -----------------PGSHELIMSYSVHYGIWLEIANSGQ--YAVIGPFLPVMGRKLMHDLEHLIDRLER 121 (123) T ss_pred -----------------CceEEEEEecCeeecceeeecCCCC--ceeecchHHHHhHHHHHHHHHHHHHhhc Confidence 0124799999999999999998754 3578888887766666666666555444 No 93 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=97.18 E-value=4.4e-06 Score=49.94 Aligned_cols=87 Identities=20% Similarity=0.221 Sum_probs=57.7 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) ||| .+--.++.++++.|-++++..+++.+-++|.+|.+..+...|||||.||.|-.+-.. +|+ T Consensus 13 makvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk------------~GG---- 76 (100) T protein:vir:96 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF------------DGG---- 76 (100) T ss_pred hhhheechHHHHHHHhcchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeeee------------cCC---- Confidence 999 233334667899999999999999999999999999999999999999999875321 111 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAI 138 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~ 138 (147) -+--|+=..+||. .|+.|.+--.+ T Consensus 77 ----------------ltavI~vGAeYAI-------------------krmsqllvtvi 100 (100) T protein:vir:96 77 ----------------LSSVISVGADYAI-------------------KRMSQLLVTVI 100 (100) T ss_pred ----------------eeEEEecchhHHH-------------------HHHHHHHhhcC Confidence 1112333344443 12222222111 No 94 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=97.18 E-value=4.1e-06 Score=50.09 Aligned_cols=115 Identities=18% Similarity=0.218 Sum_probs=72.4 Q ss_pred CCcc----c---chHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCcc-----------cchhcccceeccCC Q lcl|NC_021331. 1 MAKN----Y---TIREFHGNIDAWI-NAVDSGLKDCVELFAEKVHTDLVKRSPVD-----------TGRYRGNWQVTANK 61 (147) Q Consensus 1 MAk~----~---s~~~F~~~i~~f~-~~v~~~~~~~~r~~a~~l~~~vv~~tPVd-----------tG~~R~nw~vs~~~ 61 (147) ||.- + -+..|...+.+.. ..+.+.+....+.+|.-++..+..-||+. +|+|.+|..++-.. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 9972 2 2456777666652 34567778888889999999999999997 89999998764110 Q ss_pred ccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEe--cCchhhhhhhcCCCCCC--CchHHHHHHH----HHHHH Q lcl|NC_021331. 62 PPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFS--NMLIYANALEYGHSKQA--PAGVLGIVAV----KLRSY 133 (147) Q Consensus 62 ~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~--Nn~pYA~~LEyG~s~QA--p~G~V~~a~~----~~~~~ 133 (147) -+-.|-+. -.+|||..++|||..+. |.-|+.-+.. .|..+ T Consensus 81 --------------------------------raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~ 128 (143) T protein:vir:13 81 --------------------------------KGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAAT 128 (143) T ss_pred --------------------------------cceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHH Confidence 01122233 14999999999999887 8777764442 23333 Q ss_pred HHHHHHHH-HhhhcC Q lcl|NC_021331. 134 MAEAIKES-RAKNAL 147 (147) Q Consensus 134 v~~a~~e~-k~~~~~ 147 (147) .+.-+..+ ...++- T Consensus 129 Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 129 YERRIAAVVEKYLES 143 (143) T ss_pred HHHHHHHHHHHHhcC Confidence 33222222 111222 No 95 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=96.91 E-value=1.7e-05 Score=46.70 Aligned_cols=100 Identities=14% Similarity=0.082 Sum_probs=63.9 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |.-.+.+.- .++-.++.+.++.....++.++++.+-.-.|.|||.|++|=.+ T Consensus 1 M~vkv~vn~-----~~~~~~l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~----------------------- 52 (112) T protein:vir:45 1 MPIKVRVDL-----SKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI----------------------- 52 (112) T ss_pred CceeEEeeh-----HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCccccceee----------------------- Confidence 987554331 2232344455556677788899999988999999999997211 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCC--------CCCCchHHHHHHH----HHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS--------KQAPAGVLGIVAV----KLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s--------~QAp~G~V~~a~~----~~~~~v~~a~~e~k~~~~~ 147 (147) ...| .|..+.|||.++=||.. ..+..-|...+.. +|.+.+.+++++ || T Consensus 53 -----------~~~g---~I~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~-----gl 112 (112) T protein:vir:45 53 -----------MNDK---EIMWTSIYARRLYKGINFNFTLTHHPLAGPEWDQRAKIDKMDVWEKVAQKAVEE-----GL 112 (112) T ss_pred -----------ccCC---eEEecChhhHHhhhccccCCCCCCCCCCchhhHHHHHHhhHHHHHHHHHHHHhh-----cC Confidence 0112 36679999999977532 2566678776554 455555544433 44 No 96 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=96.89 E-value=8.5e-06 Score=48.37 Aligned_cols=97 Identities=16% Similarity=0.170 Sum_probs=62.4 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHH Q lcl|NC_021331. 11 HGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAIL 89 (147) Q Consensus 11 ~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~ 89 (147) +.+|+.+-+.+.. ..+.....++.++++.+-.-.|.|||.||+|=.++ T Consensus 1 ~~dL~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~------------------------------- 49 (113) T protein:vir:79 1 MSDLSVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVN------------------------------- 49 (113) T ss_pred CchHHHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhcccccc------------------------------- Confidence 3344444444433 34445667788899999999999999999983211 Q ss_pred hcccccceEEEecCchhhhhhhcCCCC----------CCCchHHHHHHH----HHHHHHHHH-HHHHHhhh Q lcl|NC_021331. 90 RGGGAVRAIYFSNMLIYANALEYGHSK----------QAPAGVLGIVAV----KLRSYMAEA-IKESRAKN 145 (147) Q Consensus 90 ~~~~~g~~iyi~Nn~pYA~~LEyG~s~----------QAp~G~V~~a~~----~~~~~v~~a-~~e~k~~~ 145 (147) .+ +|..+.|||.++=||... .+..-|...+.. +|.+.+.++ ....|++. T Consensus 50 ----s~---~I~y~tPYAr~qyYg~~~~~~~~~~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~~~~ 113 (113) T protein:vir:79 50 ----DT---GIHYTAKYARAQFYGFVNGHRVRNYSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAKGEY 113 (113) T ss_pred ----CC---eeEecChhhhHhhccccCCCCccccCCCCCCchhhHHHHHHhHHHHHHHHHHHhhccccccC Confidence 11 266799999999987543 445667766554 466665553 33445555 No 97 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=96.86 E-value=2.2e-05 Score=46.09 Aligned_cols=121 Identities=13% Similarity=0.059 Sum_probs=64.5 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------cc---hhcccceeccCCccccccccC Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVD------TG---RYRGNWQVTANKPPLYALNQY 70 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVd------tG---~~R~nw~vs~~~~~~~~~~~~ 70 (147) ||.-- .|.+|..+|......+.+...+.++.-|.-+-..+...||.. || -++.|-.++- . .+ T Consensus 1 M~~~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~--~------~i 72 (153) T protein:vir:49 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQS--T------NA 72 (153) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceecc--c------cc Confidence 88521 255555555555544555555666655555555566667652 22 3444444321 0 01 Q ss_pred CC--CcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHH---HHHHH---HHHHHH- Q lcl|NC_021331. 71 DK--HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKL---RSYMA---EAIKES- 141 (147) Q Consensus 71 d~--~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~---~~~v~---~a~~e~- 141 (147) |. +|..+ .|.... ...=||.++|+|++.|.|..||+-+..+- ..+++ ++.+++ T Consensus 73 dG~~dG~s~------------VG~~~~------~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il 134 (153) T protein:vir:49 73 DGRKNGVST------------VGWKNN------YHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLI 134 (153) T ss_pred cccccceee------------ecccCC------ccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 11 11111 111100 01345899999999999999999888653 23444 333443 Q ss_pred HhhhcC Q lcl|NC_021331. 142 RAKNAL 147 (147) Q Consensus 142 k~~~~~ 147 (147) +-++|| T Consensus 135 ~~~~~~ 140 (153) T protein:vir:49 135 RRKGGV 140 (153) T ss_pred HhcCCe Confidence 667777 No 98 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=96.78 E-value=2.1e-05 Score=46.18 Aligned_cols=101 Identities=12% Similarity=0.059 Sum_probs=63.3 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |+| -+++..+.+.+.. ..++.....++.++++.+-.-.|.|||.|++|=.++. T Consensus 1 mmkvkv~~~~~~~~~~~------~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s-------------------- 54 (108) T protein:vir:98 1 MPKIRVELSGAKDKLSP------QTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISS-------------------- 54 (108) T ss_pred CceeEeeehHHHHHHHH------HHHHHHHHHHHHHHHHhhcccCcCcCCccccceeecc-------------------- Confidence 877 3666654433332 3444556778888888888899999999999955431 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC----C-CCCCchHHHHHHH-HHHHHHHHHHHHHHh Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH----S-KQAPAGVLGIVAV-KLRSYMAEAIKESRA 143 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~----s-~QAp~G~V~~a~~-~~~~~v~~a~~e~k~ 143 (147) ..| .|..+.|||.++=||. + ..+..-|...+.. ....|++-+.+++|= T Consensus 55 -------------~~g---~I~y~tPYAr~qYYg~~~n~~~p~ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 55 -------------DAE---EIYYNTPYAKRRFYEPAYNYTTPGTGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred -------------CCc---eEEecChhhHHhhhccccCCCCCCCcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 012 2567999999998872 2 3555667765543 233444444444433 No 99 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=96.72 E-value=3.7e-05 Score=44.90 Aligned_cols=115 Identities=19% Similarity=0.229 Sum_probs=88.5 Q ss_pred CCc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAK---NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk---~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) ||| -++..+....|.+|..+.+..+.......|..+...++..+|= .||-.|....-++... T Consensus 1 ~~~~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~------------- 67 (120) T protein:vir:10 1 MAKIEFKFKDIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTP------------- 67 (120) T ss_pred CceEEEEecHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccC------------- Confidence 999 4788889999999999999999999999999999999999995 4888777654321110 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNA 146 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~ 146 (147) -+.-.+||++-+++|..+||.-++++ ..+++.|+..+.+-|-+=.+++=+|+- T Consensus 68 ----------------~~~~~~Iylsh~veYG~~LEla~~~k--yaIl~PTi~~~~~~il~g~~~ll~~l~ 120 (120) T protein:vir:10 68 ----------------QPDRYEIVFAHTVHYGIWLEIANSGR--YEIIMPTVHHEGKLMAQRLRGLLGRLR 120 (120) T ss_pred ----------------CCceEEEEEecCeeecceEEeeCCCC--cccccchHHHHhHHHHHHHHHHhhhcC Confidence 00113799999999999999666554 457888888877776666666666655 No 100 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=96.37 E-value=3.6e-06 Score=50.40 Aligned_cols=101 Identities=25% Similarity=0.320 Sum_probs=56.1 Q ss_pred CCccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNY----TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~----s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||.+- -+..|--.+++|.+.. .+++-+.+.-.+++..-+..|||++|.+|.||+|.-.+-..+. T Consensus 1 ma~gpt~kNP~~KFGvs~~d~~K~~--EVn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~NkGR---------- 68 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVRLDDFDKLP--EVNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNKGR---------- 68 (108) T ss_pred CCCCcccccchhhhcCChhhhhhch--hhhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhccCc---------- Confidence 77642 2445555666666632 2444455555677777888999999999999999744322110 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC---CCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS---KQAPAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s---~QAp~G~V~~a~~~~~~~v~~a~~e 140 (147) | + +.-..|||..+|||.. .-||. -....|+=.-+-.. T Consensus 69 ----G-----------~------~G~~~~~AH~VEFGs~hndeyapa------qktakqfggtay~d 108 (108) T protein:vir:79 69 ----G-----------K------VGATDPQAHLVEFGSAHNDEYAPA------QKTAKQFGGTAYGD 108 (108) T ss_pred ----c-----------c------cCCcchhhhhhhhhccccccccch------hhHHHhhcccccCC Confidence 0 1 2236899999999953 23332 11111110000001 No 101 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=96.35 E-value=6e-05 Score=43.71 Aligned_cols=119 Identities=13% Similarity=0.037 Sum_probs=60.6 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc-------c---chhcccceeccCCccccccccC Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVD-------T---GRYRGNWQVTANKPPLYALNQY 70 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVd-------t---G~~R~nw~vs~~~~~~~~~~~~ 70 (147) |.+ .|.+|.++|+.......+...+.++.-|..+-..|...||.. | +-++.|-.+|-...+.. T Consensus 3 ~~~--~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~----- 75 (139) T protein:vir:10 3 MDE--ALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGD----- 75 (139) T ss_pred HHH--HHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccc----- Confidence 221 123333333333332333334566666666677788889962 2 33555555542111100 Q ss_pred CCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHH-HHhh---hc Q lcl|NC_021331. 71 DKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKE-SRAK---NA 146 (147) Q Consensus 71 d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e-~k~~---~~ 146 (147) ..|..+ .|. -+..-+|.++|+|++.|.|..|+.-|.++...-|-+|+.+ +|.- .+ T Consensus 76 -~~g~~~------------VG~--------~k~~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~ 134 (139) T protein:vir:10 76 -HNGSST------------VGF--------HNKAHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKAN 134 (139) T ss_pred -cceeee------------eCC--------CCCcceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 011111 111 1223357999999999999999999988765444443333 3332 22 Q ss_pred C Q lcl|NC_021331. 147 L 147 (147) Q Consensus 147 ~ 147 (147) . T Consensus 135 ~ 135 (139) T protein:vir:10 135 G 135 (139) T ss_pred C Confidence 2 No 102 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=96.34 E-value=2e-05 Score=46.30 Aligned_cols=96 Identities=18% Similarity=0.228 Sum_probs=60.0 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |-+ ++|-++ .|-.-..+++.++++.+.++..++-.-+.-..||.||.+|.|+.+|+.. T Consensus 1 mi~i~idkp~---almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg------------------ 59 (133) T protein:vir:42 1 MIEIRIDKPD---ALMEKPHEVQGKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEG------------------ 59 (133) T ss_pred CeeeecCCch---hhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeec------------------ Confidence 665 455442 2222245677777777777766665555556799999999999998643 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC--------------------------------CC----CCCchHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH--------------------------------SK----QAPAGVL 123 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~--------------------------------s~----QAp~G~V 123 (147) .+=.++|.+||-+.+=+|. |. -+|.|+| T Consensus 60 ----------------stgelsn~~~yl~~vl~grgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~giv 123 (133) T protein:vir:42 60 ----------------STGELSNLAYYLPFVLHGRGWVFPVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVV 123 (133) T ss_pred ----------------CccchhhhhHHhhHhhhcccceeeccccccccCCCCCcccccCCCCCchhhhhhhhhhcccchh Confidence 1222456666666655541 11 4677888 Q ss_pred HHHHHHHHHH Q lcl|NC_021331. 124 GIVAVKLRSY 133 (147) Q Consensus 124 ~~a~~~~~~~ 133 (147) +-++-+|-+- T Consensus 124 e~s~iewlre 133 (133) T protein:vir:42 124 EETLIEWLRE 133 (133) T ss_pred HHHHHHHHhC Confidence 8777777443 No 103 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=96.25 E-value=7.5e-06 Score=48.69 Aligned_cols=111 Identities=23% Similarity=0.316 Sum_probs=54.0 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) .+| ++|+.+|.+-|.. .-+|...+.+.+.+.|+-- .+..||||.|.+|+||+|.-.+-. |- T Consensus 5 ~~KFGvS~~e~~K~irn-s~EV~~GiNdFMe~~A~~~---aK~~SPV~~GeY~~S~~V~~ka~N----------GR---- 66 (150) T protein:vir:81 5 FEKFGVSDSELAKHIRN-SAEVDAGINDFMENEAIPY---AKSISPVDDGEYAASWAVMKKAKN----------GR---- 66 (150) T ss_pred hhhhcCCHHHHHHhhcc-chhhhhhHHHHHHhhhhhh---hhccCCcccchhHHHHHHHhhccc----------Cc---- Confidence 455 4777777665543 2345555555555544432 245799999999999998643311 10 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCC-------------chHHHHHHHHHHHH-------HHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAP-------------AGVLGIVAVKLRSY-------MAEAIK 139 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp-------------~G~V~~a~~~~~~~-------v~~a~~ 139 (147) | + +.-..|||..+|||.-..-- .--|++---++.+. -.-.+. T Consensus 67 -G-----------~------~G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaq 128 (150) T protein:vir:81 67 -G-----------V------FGPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQ 128 (150) T ss_pred -c-----------c------cCccchhhhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHH Confidence 0 1 22368999999998642111 11121111111100 000011 Q ss_pred HHHhhhcC Q lcl|NC_021331. 140 ESRAKNAL 147 (147) Q Consensus 140 e~k~~~~~ 147 (147) ++-...|= T Consensus 129 kvashfgg 136 (150) T protein:vir:81 129 KVASHFGG 136 (150) T ss_pred HHHHhccc Confidence 11111112 No 104 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=95.92 E-value=0.00021 Score=40.76 Aligned_cols=102 Identities=15% Similarity=0.104 Sum_probs=62.9 Q ss_pred CCcc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKN--YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~--~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) |.-+ +++..+.+.|.. +.++.....++.++++.+-.-.|.|||.|++|=.+.. T Consensus 1 M~~kVkv~l~~~~~~l~~------~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~------------------- 55 (114) T protein:vir:47 1 MNIAIKVDLQKAKQKLSN------ESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVG------------------- 55 (114) T ss_pred CceeEEeehhHHHHHHHH------HHHHHHHHHHHHHHHHhhccCCcCccCccccceeeee------------------- Confidence 7665 455554443332 2334456777888888888899999999999843321 Q ss_pred hhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC----------CCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS----------KQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s----------~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) ..| .|.++.|||.++=||+- ..+..-|...+...-. ++-++-++.=+|| T Consensus 56 --------------~~~---~I~y~tPYAr~qyYg~~~~~~~~~~~~p~~g~~W~eraka~~~---~~~~~~~~k~~g~ 114 (114) T protein:vir:47 56 --------------QGD---AVVYGTVYARAQFYGSNGIVTFRRYTTPGTGKRWDQVATSKHA---EEWARAFVKGMGL 114 (114) T ss_pred --------------CCc---EEEecCchhhHhhhcccCCCCCCccCCCCCcchhHHHHHhhhh---HHHHHHHHHhhCC Confidence 012 25679999999999752 2566778776554322 2222233344566 No 105 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=95.86 E-value=4.7e-05 Score=44.30 Aligned_cols=96 Identities=17% Similarity=0.219 Sum_probs=59.4 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |-+ ++|-++ .|-.-..+++.++++.+.++..++-.-+.-..||.||.+|.|+.+|+.. T Consensus 1 mi~i~idkp~---almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg------------------ 59 (133) T protein:vir:41 1 MIRINIDKPE---ALMEKASEVEDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEG------------------ 59 (133) T ss_pred CeeeecCCch---hhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeec------------------ Confidence 665 455442 2222245677777777777766665555556799999999999998643 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC--------------------------------CC----CCCchHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH--------------------------------SK----QAPAGVL 123 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~--------------------------------s~----QAp~G~V 123 (147) .+=.++|.+||-+.+=+|. |. -+|.|+| T Consensus 60 ----------------stgelsn~~~yl~~vl~grgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~giv 123 (133) T protein:vir:41 60 ----------------STGELTNTVPYLQWVLFGRGWVFPVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIV 123 (133) T ss_pred ----------------CccchhhhhHHhhHhhhcccceeeecccccccCCCCCcccccCCCCCchhhhhhhhhhcccchh Confidence 1222456666666665541 11 4677777 Q ss_pred HHHHHHHHHH Q lcl|NC_021331. 124 GIVAVKLRSY 133 (147) Q Consensus 124 ~~a~~~~~~~ 133 (147) +-++-+|--- T Consensus 124 e~s~iewlis 133 (133) T protein:vir:41 124 EDSFIEWLIS 133 (133) T ss_pred HHHHHHHhcC Confidence 7777776322 No 106 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=95.86 E-value=0.00011 Score=42.23 Aligned_cols=100 Identities=15% Similarity=0.085 Sum_probs=60.6 Q ss_pred CCcccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAV-DSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v-~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |+-.+.+. ++.+-.++ .+.++.....++.++++.+-.-.|.|||.+..+=+.++.. T Consensus 1 M~ikVkv~-----l~~~~~~~~~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~------------------ 57 (116) T protein:vir:15 1 MAFRINVD-----LDGFMDQTSLDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATS------------------ 57 (116) T ss_pred CCceEEee-----hhHhhhhhhHHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeec------------------ Confidence 88765433 23333333 3455556777888899999999999998754442221110 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-----------CCCCchHHHHHHH----HHHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-----------KQAPAGVLGIVAV----KLRSYMAEAIK 139 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-----------~QAp~G~V~~a~~----~~~~~v~~a~~ 139 (147) .. =+|.++.|||.++=||+- .++..-|-..+-. +|.+++.++++ T Consensus 58 -------------~~---~~I~y~tPYAr~qyYg~~~~~~~~~~~t~p~ag~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 58 -------------DG---SEITYSTPYAKAQFYGIINDKYPVHNYTTPGTTKRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred -------------CC---ceEEecCchhHHHhcccccCCCCcccccCCCCCcchhHHHHhhhHHHHHHHHHHhcC Confidence 00 135579999999988752 2456667765543 45555555544 No 107 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=95.72 E-value=0.00029 Score=39.99 Aligned_cols=119 Identities=14% Similarity=0.103 Sum_probs=60.0 Q ss_pred CCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc---------cchhcccceeccCCccccccccC Q lcl|NC_021331. 1 MAKNYT-IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVD---------TGRYRGNWQVTANKPPLYALNQY 70 (147) Q Consensus 1 MAk~~s-~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVd---------tG~~R~nw~vs~~~~~~~~~~~~ 70 (147) ||.-.+ |.+|..+|......+.+...+.++.-|.-+...+...||.. .|-++.|-.++-. .. T Consensus 1 M~~~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~--------~~ 72 (141) T protein:vir:50 1 MVGLAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQST--------NA 72 (141) T ss_pred CccHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccC--------cc Confidence 774111 33344444443333333344555544444445555567642 2234444433210 01 Q ss_pred CC--CcchhhhhHHHHHHHHHhcccccceEEEecC--chhhhhhhcCCCCCCCchHHHHHHHHH---HHHHHHHHHHHH- Q lcl|NC_021331. 71 DK--HGDKTIAEGKRAIYAILRGGGAVRAIYFSNM--LIYANALEYGHSKQAPAGVLGIVAVKL---RSYMAEAIKESR- 142 (147) Q Consensus 71 d~--~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn--~pYA~~LEyG~s~QAp~G~V~~a~~~~---~~~v~~a~~e~k- 142 (147) |. +|..+ .|. .|. .=+|.+||+|++.|.|..||.-+.++. +.+++..++++| T Consensus 73 DG~~dg~s~------------VG~--------~~~~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~ 132 (141) T protein:vir:50 73 DGRKNGVST------------VGW--------KNNYHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKN 132 (141) T ss_pred ccccCCeee------------ecc--------CCCccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHH Confidence 11 11111 111 122 345799999999999999999999753 345555455554 Q ss_pred --hhhcC Q lcl|NC_021331. 143 --AKNAL 147 (147) Q Consensus 143 --~~~~~ 147 (147) -|.++ T Consensus 133 ~l~~~~~ 139 (141) T protein:vir:50 133 SLEEKEG 139 (141) T ss_pred HHHhccC Confidence 45555 No 108 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=95.66 E-value=9.8e-05 Score=42.56 Aligned_cols=96 Identities=15% Similarity=0.087 Sum_probs=59.4 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |++ -+++..+...|.. +.+++....++.++++.+-.-.|.+||.||+|=.++ T Consensus 1 m~kV~vdl~~~~~~ls~------~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~--------------------- 53 (118) T protein:vir:30 1 MAKVVVELGGIKRKVSP------QALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRAN--------------------- 53 (118) T ss_pred CceeeechhHHhhhhhH------HHHHHHHHHHHHHHHHHhhcCCCCccCccccceeec--------------------- Confidence 877 4677665544432 333445667777888888888999999999983321 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC--------------CCCchHHHHHH------HHHHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK--------------QAPAGVLGIVA------VKLRSYMAEAIK 139 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~--------------QAp~G~V~~a~------~~~~~~v~~a~~ 139 (147) .+ +|.++.|||.++=||+.. ++..-|-.... ..|.+++.+. T Consensus 54 --------------~~---~I~Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~-- 114 (118) T protein:vir:30 54 --------------SV---GVTWSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLRG-- 114 (118) T ss_pred --------------CC---eeEECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHHh-- Confidence 01 266899999999997532 34455543222 3354444433 Q ss_pred HHHhhhcC Q lcl|NC_021331. 140 ESRAKNAL 147 (147) Q Consensus 140 e~k~~~~~ 147 (147) +|+ T Consensus 115 -----~g~ 117 (118) T protein:vir:30 115 -----MGF 117 (118) T ss_pred -----cCC Confidence 333 No 109 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=95.66 E-value=9.8e-05 Score=42.56 Aligned_cols=96 Identities=15% Similarity=0.087 Sum_probs=59.4 Q ss_pred CCc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAK-NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk-~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |++ -+++..+...|.. +.+++....++.++++.+-.-.|.+||.||+|=.++ T Consensus 1 m~kV~vdl~~~~~~ls~------~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~--------------------- 53 (118) T protein:vir:98 1 MAKVVVELGGIKRKVSP------QALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRAN--------------------- 53 (118) T ss_pred CceeeechhHHhhhhhH------HHHHHHHHHHHHHHHHHhhcCCCCccCccccceeec--------------------- Confidence 877 4677665544432 333445667777888888888999999999983321 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC--------------CCCchHHHHHH------HHHHHHHHHHHH Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK--------------QAPAGVLGIVA------VKLRSYMAEAIK 139 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~--------------QAp~G~V~~a~------~~~~~~v~~a~~ 139 (147) .+ +|.++.|||.++=||+.. ++..-|-.... ..|.+++.+. T Consensus 54 --------------~~---~I~Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~-- 114 (118) T protein:vir:98 54 --------------SV---GVTWSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLRG-- 114 (118) T ss_pred --------------CC---eeEECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHHh-- Confidence 01 266899999999997532 34455543222 3354444433 Q ss_pred HHHhhhcC Q lcl|NC_021331. 140 ESRAKNAL 147 (147) Q Consensus 140 e~k~~~~~ 147 (147) +|+ T Consensus 115 -----~g~ 117 (118) T protein:vir:98 115 -----MGF 117 (118) T ss_pred -----cCC Confidence 333 No 110 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=95.53 E-value=0.00027 Score=40.14 Aligned_cols=112 Identities=15% Similarity=0.160 Sum_probs=59.3 Q ss_pred cchHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCCc-------ccch---hcccceeccCCcccccc Q lcl|NC_021331. 5 YTIREFHGNIDAWINAVDSGL-------KDCVELFAEKVHTDLVKRSPV-------DTGR---YRGNWQVTANKPPLYAL 67 (147) Q Consensus 5 ~s~~~F~~~i~~f~~~v~~~~-------~~~~r~~a~~l~~~vv~~tPV-------dtG~---~R~nw~vs~~~~~~~~~ 67 (147) +| |...|+.|.+++++-+ .+.++.-|.-+...|...||- ++|. ++.|-.++-. T Consensus 1 ~~---~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~------- 70 (139) T protein:vir:10 1 MD---MDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAG------- 70 (139) T ss_pred CC---HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCc------- Confidence 33 4455555655555443 456666666677777788883 1222 4444333210 Q ss_pred ccCCC--CcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHH----HHHH Q lcl|NC_021331. 68 NQYDK--HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEA----IKES 141 (147) Q Consensus 68 ~~~d~--~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a----~~e~ 141 (147) .+|. .|..+ .|.. ...| .|.++|+|++.|.|..||.-|.++...-|-+| .+|+ T Consensus 71 -~idg~~~g~~~------------VG~~--~~~~------~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~ 129 (139) T protein:vir:10 71 -DIDGDHNGSST------------VGFH--NKAH------IARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQAM 129 (139) T ss_pred -cccccccccce------------eCCC--CCce------eeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 11111 1111 1122 26899999999999999999988764433333 3333 Q ss_pred HhhhcC Q lcl|NC_021331. 142 RAKNAL 147 (147) Q Consensus 142 k~~~~~ 147 (147) =.+.++ T Consensus 130 l~~~~~ 135 (139) T protein:vir:10 130 IAKANG 135 (139) T ss_pred HhhcCC Confidence 333444 No 111 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=95.51 E-value=0.00024 Score=40.38 Aligned_cols=134 Identities=13% Similarity=0.141 Sum_probs=68.1 Q ss_pred CCccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccC-CCCcc Q lcl|NC_021331. 1 MAKNY----TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQY-DKHGD 75 (147) Q Consensus 1 MAk~~----s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~-d~~G~ 75 (147) |+.-+ +...+.+.|.++...++ +...+++.++..+.+.+..+== ..| ..|..- +|.+...... ...+. T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~-d~~~l~~~ig~~l~~~~~~rF~-~eG---~~W~pl--s~~t~~~r~~~g~~~~ 73 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVT-DTLPVMRGIAAELLAETEFAFM-DEG---PGWPQL--SPATVAAREAKGRGPH 73 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHhh-ccC---CCCCCC--CHHHHHHHhccCCCCC Confidence 87743 44677888888777775 4667788888888887776421 112 124320 1111000000 00111 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC-------CCCchHHHHHH---------HHHHHHHHHHHH Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK-------QAPAGVLGIVA---------VKLRSYMAEAIK 139 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~-------QAp~G~V~~a~---------~~~~~~v~~a~~ 139 (147) ..+.+. ..+..-+...-.++.+-|.+|++||..-+||-.. -....|+.++- +++..++.+-.+ T Consensus 74 ~iL~~t-G~L~~Si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~~~l~ 152 (155) T protein:vir:79 74 PILQVT-NALARSVTTWADRNEAGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEVVLTALS 152 (155) T ss_pred Cccccc-hhhhhhhhceecCCEEEEecCchhhhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHHHHHH Confidence 111111 1111111122235678889999999999999642 34446666543 233333333333 Q ss_pred HHHhh Q lcl|NC_021331. 140 ESRAK 144 (147) Q Consensus 140 e~k~~ 144 (147) |+| T Consensus 153 --r~r 155 (155) T protein:vir:79 153 --RNR 155 (155) T ss_pred --hcC Confidence 555 No 112 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=95.49 E-value=0.00022 Score=40.63 Aligned_cols=134 Identities=13% Similarity=0.128 Sum_probs=68.6 Q ss_pred CCcc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccc-cCCCCcc Q lcl|NC_021331. 1 MAKN----YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALN-QYDKHGD 75 (147) Q Consensus 1 MAk~----~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~-~~d~~G~ 75 (147) |+.- ++...+.+.|.++...++ +...++++++..+.+.+..+== ..| .-|..- +|.+.... .....+. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~-d~~~l~~~ig~~l~~~~~~rF~-pdG---~~W~pl--s~~t~~~r~~~g~~~~ 73 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVT-DTLPVMRGIAAELLAETEFAFM-DEG---PGWPQL--SPVTVAAREAKGRGPH 73 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHhh-ccC---CCCCCC--ChHHHHHHhccCCCCC Confidence 8763 345778888888877775 4678888888888887776421 112 124321 11111000 0001111 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC-------CCCchHHHHHH---------HHHHHHHHHHHH Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK-------QAPAGVLGIVA---------VKLRSYMAEAIK 139 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~-------QAp~G~V~~a~---------~~~~~~v~~a~~ 139 (147) ..+.+. ..+..-+...-..+.+-|.+|++||..-+||-.. -....|+.++- +++..++.+-.+ T Consensus 74 ~iL~~t-g~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~~~l~ 152 (155) T protein:vir:99 74 PILQVT-NALARSVTTWADRNEAGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEIVLTALS 152 (155) T ss_pred Ccchhc-hhhhhhhhceecCCEEEEecCccchhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHHHHHh Confidence 111111 1111111222235678899999999999999542 23345665543 334444443333 Q ss_pred HHH Q lcl|NC_021331. 140 ESR 142 (147) Q Consensus 140 e~k 142 (147) .-| T Consensus 153 ~~~ 155 (155) T protein:vir:99 153 RNR 155 (155) T ss_pred ccC Confidence 333 No 113 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=95.48 E-value=0.00014 Score=41.80 Aligned_cols=130 Identities=19% Similarity=0.249 Sum_probs=64.3 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVK-----RSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~-----~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |=+ + ..+|.+-.+++...+...+.+++..+++.+.. .+|. |+ .|..= ++.+... +.++ T Consensus 1 ~i~--~----~~~i~~~l~~l~~~~~~~l~~i~~~~~~~~~~rf~~~~~p~--G~---~W~pL--s~st~a~----k~~~ 63 (145) T protein:vir:31 1 MVE--D----ENNIPEAREAIQDGLTDGLERLHTITLRELITNMSDGQDAL--GN---PWEPL--KESTIRA----KGSD 63 (145) T ss_pred Ccc--c----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--CC---CCccc--ChHHHHH----hcCC Confidence 665 1 22333334444445555566666666665544 3443 42 46521 1111000 0111 Q ss_pred hhhhh---HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC--CCCchHHHHHH----HHHHHHHHHHHHHH-Hhhh Q lcl|NC_021331. 76 KTIAE---GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK--QAPAGVLGIVA----VKLRSYMAEAIKES-RAKN 145 (147) Q Consensus 76 ~t~~~---~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~--QAp~G~V~~a~----~~~~~~v~~a~~e~-k~~~ 145 (147) .+.-+ ....+..-+.....++.+.|..|++||...+||..+ ..|..|+.++. +++..++.+.+... ++.. T Consensus 64 ~~L~~tG~L~~Si~~~~~~~~~~~~a~vGtn~~YA~~hqfG~~~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~~~L~~~~ 143 (145) T protein:vir:31 64 TPLIDNSRLLTDINAASMMDRANRMAVIGTNLDYAEHHEFGAPEAGIPARPIFGPAGAYASQQAPDVIGDEIDTNLEGAV 143 (145) T ss_pred CCCccCHHHHHHHHHHhhhcccCceeEecCCchhhhhhccCCcccccCCCCccCCCccchHHHHHHHHHHHHHHHhhhhc Confidence 11111 111122111122346678899999999999999875 88889998765 35555555554432 3222 Q ss_pred cC Q lcl|NC_021331. 146 AL 147 (147) Q Consensus 146 ~~ 147 (147) == T Consensus 144 ~~ 145 (145) T protein:vir:31 144 ID 145 (145) T ss_pred cC Confidence 11 No 114 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=95.32 E-value=0.00022 Score=40.67 Aligned_cols=135 Identities=14% Similarity=0.142 Sum_probs=70.5 Q ss_pred CCcccc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CcccchhcccceeccCCccccccc-cCCCCc Q lcl|NC_021331. 1 MAKNYT----IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRS-PVDTGRYRGNWQVTANKPPLYALN-QYDKHG 74 (147) Q Consensus 1 MAk~~s----~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~t-PVdtG~~R~nw~vs~~~~~~~~~~-~~d~~G 74 (147) |+..++ ...+.+.|+++....+ +....++.++..+.+.+..+= | .|+ -|..- +|.+-... +...++ T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~-~~~~l~~~ig~~l~~~~~~rF~p--~G~---~W~pl--sp~t~~~r~k~g~~~ 72 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVT-DTLPLMRGIAAELLAETEFAFMD--EGP---GWPQL--SPVTVAARAAKGRGA 72 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHhh--cCC---CCCCC--CccchHHHHhccCCC Confidence 997554 3456667777666664 466788888888877776642 2 121 24321 12111100 011111 Q ss_pred chhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-------CCCCchHHHHHH-----HHHHHHHHHHHHHH- Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-------KQAPAGVLGIVA-----VKLRSYMAEAIKES- 141 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-------~QAp~G~V~~a~-----~~~~~~v~~a~~e~- 141 (147) ...+... ..+..-+...-..+.+-|.+|++||..-+||-. +-....|..++. .++.+.|.+.+.+. T Consensus 73 ~~~L~~t-G~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i~~~l 151 (155) T protein:vir:10 73 HPILQVT-NALARSITTRADRDQAQIGSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARDAVLDVLLAAL 151 (155) T ss_pred CCccccc-hhhhhhhhceecCCEEEEecCcchhhhhhcccccCCCCccccCCccccCCCccccchHHHHHHHHHHHHHHH Confidence 1222111 111111222234567889999999999999953 234446666543 24445555555554 Q ss_pred -Hhh Q lcl|NC_021331. 142 -RAK 144 (147) Q Consensus 142 -k~~ 144 (147) |+| T Consensus 152 ~~~r 155 (155) T protein:vir:10 152 SQGR 155 (155) T ss_pred hhcC Confidence 666 No 115 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=94.91 E-value=0.0006 Score=38.24 Aligned_cols=137 Identities=10% Similarity=0.037 Sum_probs=67.5 Q ss_pred CCcc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCc--------- Q lcl|NC_021331. 1 MAKN----YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKP--------- 62 (147) Q Consensus 1 MAk~----~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~--------- 62 (147) |+-- ++...+.+.|++++.... +...++++|+..+.+....+ .| | =..|..+.-.- T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~-d~~~l~~~Ig~~l~~~t~~rF~~e~~P-d----w~p~~p~t~~~r~~~g~~~~ 74 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGH-QKAGAMRKIAQALVLVTEDNFAAQGRP-R----WQALSEATIHMRVGGKKAYK 74 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhc-cHHHHHHHHHHHHHHHHHHHHHhccCC-C----CCCCchhhhhhhhcccccch Confidence 8853 355567777777776665 34567888888887777653 34 1 01111110000 Q ss_pred ---cccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-------CCCCchHHHHHH----- Q lcl|NC_021331. 63 ---PLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-------KQAPAGVLGIVA----- 127 (147) Q Consensus 63 ---~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-------~QAp~G~V~~a~----- 127 (147) ...........++.++.+. ..+..-+...-..+.+-|..|++||..-.||-. +-....|+.++- T Consensus 75 k~~~~~~~~~~~~~~~~~L~~t-G~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~~~ 153 (175) T protein:vir:10 75 KNGELTAAASRRKAGLMILQDS-GQMAASVSTDHDDNSAVIGSNKEYAAIHQFGGQAGRGLKVTIPARPWLPVTADGELQ 153 (175) T ss_pred hhhhhhhhhhhhccCCCcceec-hhhhhhhheeecCCEEEEecChhhhhhhhcccccCCCCccccCCccccCCCcccccc Confidence 0000000001111111111 111111222223567889999999999999965 345556776653 Q ss_pred ----HHHHHHHHHHHHH-HHhh Q lcl|NC_021331. 128 ----VKLRSYMAEAIKE-SRAK 144 (147) Q Consensus 128 ----~~~~~~v~~a~~e-~k~~ 144 (147) +++-..+.+.... +|.| T Consensus 154 ~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 154 PEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred hHHHHHHHHHHHHHHHHHhccC Confidence 3344444433333 3555 No 116 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=94.80 E-value=0.0013 Score=36.32 Aligned_cols=135 Identities=21% Similarity=0.226 Sum_probs=70.4 Q ss_pred CCc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCC Q lcl|NC_021331. 1 MAK---NYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDK 72 (147) Q Consensus 1 MAk---~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~ 72 (147) |+- .+++..|.+.|+++...++ +.....++++..+.+.+..+ .| | | ..|...- +.+. .+.-. T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~~-~~~~l~~~ig~~l~~~~~~rf~~~~~P-d-G---~~W~p~~--~~t~--~rk~~ 70 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAALG-DPSGLLQDIGELLLNIHRRRFQAQVSP-D-G---TPWQPLS--PAYL--RRKRK 70 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--HHHH--HHhhc Confidence 774 2456678888888888776 45678888888888777653 34 2 2 3464321 1110 01111 Q ss_pred CcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchH------------------------------ Q lcl|NC_021331. 73 HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGV------------------------------ 122 (147) Q Consensus 73 ~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~------------------------------ 122 (147) +|+.++.... .+..-+...-..+.+-|.+|++||..-+||-..+.+... T Consensus 71 ~~~~~L~~tg-~L~~Si~~~~~~~~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 149 (190) T protein:vir:99 71 NRDKILTLDG-HLRNLLRYQLDGSELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDV 149 (190) T ss_pred CCCccceecH-HHHHHHhheecCcEEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhc Confidence 2222222211 112222222345678889999999999999554443322 Q ss_pred --------------HH---HHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021331. 123 --------------LG---IVAVKLRSYMAEAIKESRAKNA 146 (147) Q Consensus 123 --------------V~---~a~~~~~~~v~~a~~e~k~~~~ 146 (147) .. --.+++..++.+-..++=.+-+ T Consensus 150 ~~~~~~v~IPaRpfLG~s~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 150 QIGPYTIQMPARPWLGTSSQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred ccccceeeecCcccCCCCHHHHHHHHHHHHHHHHHHHhhcC Confidence 21 1223444444444444422333 No 117 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=94.70 E-value=0.00066 Score=38.02 Aligned_cols=132 Identities=11% Similarity=0.083 Sum_probs=68.5 Q ss_pred CCccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCcccc------ Q lcl|NC_021331. 1 MAKNY----TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLY------ 65 (147) Q Consensus 1 MAk~~----s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~------ 65 (147) |+.-+ +...+.+.|.+.+..++ +....+++|+..+......+ .| | |..= +|.+. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~-d~~~lm~~Ig~~l~~~t~~rF~~~~~P-d-------W~pl--s~~t~~~r~~~ 69 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGH-QKADAMRKITQALVLVTEDNFAAQGRP-R-------WQAL--SEATIHMRVGG 69 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhc-CHHHHHHHHHHHHHHHHHHHHHhcCCC-C-------CCCC--ChHHHHhhccc Confidence 88633 44668888888777765 45678888888888777662 34 1 3210 01000 Q ss_pred -----------ccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC-------CCCCchHHHHHH Q lcl|NC_021331. 66 -----------ALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS-------KQAPAGVLGIVA 127 (147) Q Consensus 66 -----------~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s-------~QAp~G~V~~a~ 127 (147) ...+....+...+.+. ..+..-+...-..+.+-|.+|++||..-+||-. +-....|+.++- T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~L~~t-G~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~ 148 (175) T protein:vir:79 70 KKAYKKNGELTAAASRRKAGLMILQDS-GQMAASTATDSGEDYSVIGSNKEYAAIQHFGGQAGRGLKVTIPGRAWLPVTA 148 (175) T ss_pred cccccccccchhhHhhhccCCCcceec-hhhhhhhhheecCCEEEEecCcchhhHhhcccccCCCcccccCcccccCCCc Confidence 0000000111111111 111111222223557889999999999999953 233445555432 Q ss_pred ---------HHHHHHHHHHHHHH-Hhh Q lcl|NC_021331. 128 ---------VKLRSYMAEAIKES-RAK 144 (147) Q Consensus 128 ---------~~~~~~v~~a~~e~-k~~ 144 (147) +++..++.+-.+.+ ++| T Consensus 149 ~de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 149 DGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred ccchhHHHHHHHHHHHHHHHHHHhccC Confidence 44555555555444 445 No 118 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=94.69 E-value=0.00086 Score=37.40 Aligned_cols=115 Identities=16% Similarity=0.112 Sum_probs=60.0 Q ss_pred CCcccchHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhCCcc------cc---hhcccceeccCCccc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDS-------GLKDCVELFAEKVHTDLVKRSPVD------TG---RYRGNWQVTANKPPL 64 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~-------~~~~~~r~~a~~l~~~vv~~tPVd------tG---~~R~nw~vs~~~~~~ 64 (147) ||. |...|+.|.+++++ ...+.++.=|.-+...+...||.. || -++.|-.++- + T Consensus 1 M~~------~~d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~--~-- 70 (140) T protein:vir:48 1 MTG------LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQS--T-- 70 (140) T ss_pred Ccc------HHHHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecc--c-- Confidence 774 55555666655544 333444444445555566667753 22 2444444331 0 Q ss_pred cccccCCC--CcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHH---HHHHHHHHH Q lcl|NC_021331. 65 YALNQYDK--HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKL---RSYMAEAIK 139 (147) Q Consensus 65 ~~~~~~d~--~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~---~~~v~~a~~ 139 (147) .+|. .|..+ .|..-. +..=+|.+||+|+|.|.|..||.-+.++- ..++..... T Consensus 71 ----~idg~~dG~s~------------VG~~k~------~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~ 128 (140) T protein:vir:48 71 ----NVDGRKNGVAT------------VGWKNN------YHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKE 128 (140) T ss_pred ----cccccccccee------------ecccCC------CceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHH Confidence 1111 12211 111111 12345899999999999999999999753 344444433 Q ss_pred H---HHhhhcC Q lcl|NC_021331. 140 E---SRAKNAL 147 (147) Q Consensus 140 e---~k~~~~~ 147 (147) + +=-|+|. T Consensus 129 ~y~~~l~kk~~ 139 (140) T protein:vir:48 129 EYEKLIRKKGG 139 (140) T ss_pred HHHHHHHhhcC Confidence 3 3334455 No 119 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=94.26 E-value=0.0013 Score=36.45 Aligned_cols=121 Identities=12% Similarity=0.017 Sum_probs=62.2 Q ss_pred CCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------cc---hhcccceeccCCccccccccC Q lcl|NC_021331. 1 MAKNYT-IREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVD------TG---RYRGNWQVTANKPPLYALNQY 70 (147) Q Consensus 1 MAk~~s-~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVd------tG---~~R~nw~vs~~~~~~~~~~~~ 70 (147) ||.--+ |.+|..+|......+.+...+.++.-|.-+...+...||.. || -++.|-.++- . .+ T Consensus 1 M~~~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~--~------~i 72 (140) T protein:vir:48 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQS--T------NV 72 (140) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecc--c------cc Confidence 775211 44444444444433444455556655555556666678852 22 2444443320 0 11 Q ss_pred CC--CcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHH---HHHHHHHHHHHH--- Q lcl|NC_021331. 71 DK--HGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKL---RSYMAEAIKESR--- 142 (147) Q Consensus 71 d~--~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~---~~~v~~a~~e~k--- 142 (147) |. .|..+ .|.... +..=+|.+||+|++.|.|..||.-+.++. ..++.....+.| T Consensus 73 Dg~~~g~s~------------VG~~kk------~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l 134 (140) T protein:vir:48 73 DGRKNGVST------------VGWVNR------YHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLI 134 (140) T ss_pred ccccCceee------------eccCCC------cceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHH Confidence 11 11111 111100 22456899999999999999999999753 345544433333 Q ss_pred hhhcC Q lcl|NC_021331. 143 AKNAL 147 (147) Q Consensus 143 ~~~~~ 147 (147) -|+++ T Consensus 135 ~~~~~ 139 (140) T protein:vir:48 135 RKKGG 139 (140) T ss_pred HhhcC Confidence 34555 No 120 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=91.36 E-value=0.00071 Score=37.86 Aligned_cols=84 Identities=18% Similarity=0.039 Sum_probs=51.1 Q ss_pred HHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEE-Ee---cCchhhhhhh Q lcl|NC_021331. 36 VHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIY-FS---NMLIYANALE 111 (147) Q Consensus 36 l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iy-i~---Nn~pYA~~LE 111 (147) +-++...+-|++||.||.|..+..+.-.+ ..|-.+| ++ =..||..-+| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S----------------------------~dG~~~Y~Vswn~rkAPhghlvE 52 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEES----------------------------TNGVQTYAVSWRKKAAPHGHLLE 52 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccC----------------------------CCCeEEEEeeccCCcCCcccccc Confidence 55566678999999999998765322111 1122344 32 3578999999 Q ss_pred cCC------------------------CCCCCchHHHHHHH-HHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 112 YGH------------------------SKQAPAGVLGIVAV-KLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 112 yG~------------------------s~QAp~G~V~~a~~-~~~~~v~~a~~e~k~~~~~ 147 (147) ||| +.-.+..|+|.++. ...++.+.+.+..+.|+.= T Consensus 53 ~Ghw~~~~~~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~E 113 (119) T protein:vir:81 53 FGHWQTHAAYKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAE 113 (119) T ss_pred cceeeeeeeeeccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 993 45677789998886 3344444343333333332 No 121 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=91.03 E-value=0.00081 Score=37.54 Aligned_cols=84 Identities=18% Similarity=0.039 Sum_probs=50.7 Q ss_pred HHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEE-Ee---cCchhhhhhh Q lcl|NC_021331. 36 VHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIY-FS---NMLIYANALE 111 (147) Q Consensus 36 l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iy-i~---Nn~pYA~~LE 111 (147) +-++...+-|++||.||.|..+..+.-.++ .|-.+| ++ =..||..-+| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~----------------------------dG~~~Y~Vswn~rkAPhghlvE 52 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEEST----------------------------NGVQTYAVSWRKKAAPHGHLLE 52 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCC----------------------------CCEEEEEeecCCCcCCcccccc Confidence 555666789999999999987654322111 122344 32 3578999999 Q ss_pred cCC------------------------CCCCCchHHHHHHH-HHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 112 YGH------------------------SKQAPAGVLGIVAV-KLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 112 yG~------------------------s~QAp~G~V~~a~~-~~~~~v~~a~~e~k~~~~~ 147 (147) ||| +.-.+..|+|.++. ...++.+.+.+..+.|+.= T Consensus 53 ~Ghw~~~~~~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~E 113 (119) T protein:vir:10 53 FGHWQTHAAYKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAE 113 (119) T ss_pred cceeeeeeeeeccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 993 24667789998886 3344444333333333322 No 122 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=90.33 E-value=0.012 Score=31.17 Aligned_cols=116 Identities=15% Similarity=0.177 Sum_probs=75.7 Q ss_pred CCccc-chHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAW--INAVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f--~~~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |+-.+ -+.+..++|++- -.++....+..+++.+..++..++...+| |||.....-.+| .|-. .+|- T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s--~p~~-------~~G~ 71 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRT--EPEW-------IKGK 71 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeec--Ceee-------cCCc Confidence 88654 467888888876 46788889999999999999999998888 999998887765 2211 1121 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecC-----chhhhhhhcCCCC-CC-----Cc--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNM-----LIYANALEYGHSK-QA-----PA--GVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn-----~pYA~~LEyG~s~-QA-----p~--G~V~~a~~~~~~~v~~a~~e~k 142 (147) . +|-|... -.|..-.||||+. .+ |. |.++-++..-...+.+.+++-= T Consensus 72 r--------------------~V~vgW~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL 131 (134) T protein:vir:10 72 R--------------------TVTIRWRGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKREL 131 (134) T ss_pred e--------------------EEEEEEEcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHH Confidence 1 2223232 2355667999972 22 44 5666677666555554444433 Q ss_pred hhh Q lcl|NC_021331. 143 AKN 145 (147) Q Consensus 143 ~~~ 145 (147) -|| T Consensus 132 ~kl 134 (134) T protein:vir:10 132 KKL 134 (134) T ss_pred hcC Confidence 344 No 123 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=88.99 E-value=0.0059 Score=32.79 Aligned_cols=91 Identities=16% Similarity=0.169 Sum_probs=64.4 Q ss_pred HHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEec Q lcl|NC_021331. 25 LKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSN 102 (147) Q Consensus 25 ~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~N 102 (147) +....+-.|.++-..++...|= .||-.|....-+++- .|. .--+||++- T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~-----------~g~------------------~~~~i~lsh 51 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVAST-----------PQP------------------DRYEIVFAH 51 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccc-----------cCC------------------ceEEEEEec Confidence 4444555566777788888884 477777765432211 010 113799999 Q ss_pred CchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021331. 103 MLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNA 146 (147) Q Consensus 103 n~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~ 146 (147) +++|..+||.+++++ .-+++.|++.+.+-|-+=.+++-+|+- T Consensus 52 ~v~Yg~~LE~a~~~k--yaIl~Ptv~~~~~~i~~g~~~ll~~l~ 93 (93) T protein:vir:10 52 TVHYGIWLEIANSGR--YEIIMPTVHHEGKLMAQRLRGLLGRLR 93 (93) T ss_pred CeeccceEEeecCCC--ccchhhhHHHHHHHHHHHHHHHHHhcC Confidence 999999999999765 358889998888777777787777777 No 124 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=88.89 E-value=0.0017 Score=35.82 Aligned_cols=100 Identities=8% Similarity=-0.003 Sum_probs=54.3 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHH Q lcl|NC_021331. 5 YTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRA 84 (147) Q Consensus 5 ~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~ 84 (147) .|+.+|..++.+-..+-.-...-.+. .++++..---+|.+||.|+.|=..++ T Consensus 1 ~~f~~f~~~~~k~l~kr~L~~~g~vq---~EvlR~~~PyvP~~tG~Lk~S~~l~t------------------------- 52 (105) T protein:vir:78 1 MSFSSFKDAVIDDIHNKALSTAAKAG---GELVELAQPVTPILYGDLRRSSYFKI------------------------- 52 (105) T ss_pred CCcccccchHHHHHHHhcCCCCchhh---HHHHHHhCCCCcccccccccccccce------------------------- Confidence 34455654444332221111001111 15566666678999999999854321 Q ss_pred HHHHHhcccccceEEEec-CchhhhhhhcCCCCCCCchHHHHHHH----HHHHHHHHHHHH Q lcl|NC_021331. 85 IYAILRGGGAVRAIYFSN-MLIYANALEYGHSKQAPAGVLGIVAV----KLRSYMAEAIKE 140 (147) Q Consensus 85 i~~~~~~~~~g~~iyi~N-n~pYA~~LEyG~s~QAp~G~V~~a~~----~~~~~v~~a~~e 140 (147) ....|.++|=.| -+|||.+.=|.+ |-..-|.+.... +++++|+-.++- T Consensus 53 ------vIgsg~I~y~~~~~aPYAr~qYYe~--~Rg~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 53 ------IIQKNSIVARVFSLTPYARRQYYEN--RRNPRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred ------eecCCeeEeeccccCchhhhhhhcc--cCCCchhHHhhhcchhHHHHHHhcccCC Confidence 124466666322 489999999877 355558776664 455555533322 No 125 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=87.67 E-value=0.034 Score=28.61 Aligned_cols=135 Identities=12% Similarity=0.179 Sum_probs=68.0 Q ss_pred CCccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCC Q lcl|NC_021331. 1 MAKNY----TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYD 71 (147) Q Consensus 1 MAk~~----s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d 71 (147) |+-.+ +...|.+.|.++....+. ..++++|+..+.+.+..+ +| |+|. .|..- +|.+-...... T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~--~~l~~~Ig~~l~~~~~~rf~~~~~P-d~G~---~W~pl--s~~t~~~r~~~ 72 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRD--RAIPRVMAAALLSSTEQAFERQADP-DTGK---GWEAW--SDSWLAWRQDH 72 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhcc--HHHHHHHHHHHHHHHHHHHHhcCCC-CCCC---CCccc--ChHHHHHhhcc Confidence 77654 455667777776544432 257888888887777653 23 3342 34320 11111000001 Q ss_pred -CCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCC--------chHHHHHHH---HHHHHHHHHHH Q lcl|NC_021331. 72 -KHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAP--------AGVLGIVAV---KLRSYMAEAIK 139 (147) Q Consensus 72 -~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp--------~G~V~~a~~---~~~~~v~~a~~ 139 (147) ..+..++-.. .....-+...-.++.+-|.+|++||..-+||-..+.. ..|+.++-. ++..++.+.++ T Consensus 73 ~~~~~~~L~~t-g~L~~Si~~~~~~~~v~vGt~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s~~d~~~I~~~i~~~l~ 151 (156) T protein:vir:19 73 GFVPGSILTLH-GDLARSITTDYGQDYALIGSPKIYAAIHQWGGTPDMAPRPAGVPARPYMGLDKTGEQEIFDAIRKRVS 151 (156) T ss_pred CCCCCcchhhh-HHHHHHhhheecCCEEEEecchhhhHHhhcCcccccCCCccccCCccccCCCHHHHHHHHHHHHHHHH Confidence 0112222221 1222222223346678889999999999999775433 344444443 44444444444 Q ss_pred HHHhh Q lcl|NC_021331. 140 ESRAK 144 (147) Q Consensus 140 e~k~~ 144 (147) .+=.| T Consensus 152 ~~~~~ 156 (156) T protein:vir:19 152 AALRQ 156 (156) T ss_pred HHhhC Confidence 44333 No 126 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=87.51 E-value=0.036 Score=28.50 Aligned_cols=116 Identities=18% Similarity=0.200 Sum_probs=61.3 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCcc---c--------------------ch Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLK-------DCVELFAEKVHTDLVKRSPVD---T--------------------GR 50 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~-------~~~r~~a~~l~~~vv~~tPVd---t--------------------G~ 50 (147) |+ +|...|+.|.+++++.+. +..+.-|.-....|..-||.. . |- T Consensus 2 m~------~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~H 75 (159) T protein:vir:38 2 AN------DMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKH 75 (159) T ss_pred cc------hHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCc Confidence 54 366668888888866322 333433444444555567762 2 22 Q ss_pred hcccceeccCCccccccccCC--CCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCc-----hHH Q lcl|NC_021331. 51 YRGNWQVTANKPPLYALNQYD--KHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPA-----GVL 123 (147) Q Consensus 51 ~R~nw~vs~~~~~~~~~~~~d--~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~-----G~V 123 (147) ++.|-.++-. ..+| .+|+.+ .|..-. +..=||.+|+.|++.|.|+ .|| T Consensus 76 laD~I~~~~~-------~~iDg~~dG~s~------------VGw~~~------~~a~~a~f~NdGT~~m~~k~~~gdHFv 130 (159) T protein:vir:38 76 LQDSITYKPG-------YTADKLHTGDTD------------VGFEGK------YYDFLAKIVNNGQHHMSPKRYKNMHFL 130 (159) T ss_pred cccceeeecC-------ccccccccceee------------ecccCC------ccceEeeecccCccccCCCCccCChhH Confidence 3333333211 0111 112211 111111 2245689999999999886 599 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 124 GIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 124 ~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) .-+.++...-|-+|..+.-.++== T Consensus 131 ekt~~~~k~~Vl~A~~~~~~~il~ 154 (159) T protein:vir:38 131 DKAQQEAKKSVAEAELKAYKEVMN 154 (159) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh Confidence 999988876666555554333322 No 127 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=86.84 E-value=0.029 Score=29.02 Aligned_cols=116 Identities=17% Similarity=0.202 Sum_probs=75.8 Q ss_pred CCccc-chHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAW--INAVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f--~~~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |+--+ -+.+..++|++- -.++....+..+++.+..++..++...+| |||.....-.+| .|-. .+|. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s--~p~~-------~~G~ 71 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFS--KPEW-------INGK 71 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEec--Ceee-------cCCc Confidence 87644 367777888866 45788889999999999999999999998 999999987765 2211 1111 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecC-----chhhhhhhcCCCCC------CCc--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNM-----LIYANALEYGHSKQ------APA--GVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn-----~pYA~~LEyG~s~Q------Ap~--G~V~~a~~~~~~~v~~a~~e~k 142 (147) . +|-|... -.|..-.||||+.. .|. |.++-++..-...+.+.+++-= T Consensus 72 r--------------------~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL 131 (134) T protein:vir:95 72 R--------------------TITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKREL 131 (134) T ss_pred e--------------------EEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHH Confidence 1 2222232 33556679998653 354 5566677666555554444433 Q ss_pred hhh Q lcl|NC_021331. 143 AKN 145 (147) Q Consensus 143 ~~~ 145 (147) -|| T Consensus 132 ~kl 134 (134) T protein:vir:95 132 KKL 134 (134) T ss_pred hcC Confidence 344 No 128 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=86.84 E-value=0.029 Score=29.02 Aligned_cols=116 Identities=17% Similarity=0.202 Sum_probs=75.8 Q ss_pred CCccc-chHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAW--INAVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f--~~~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |+--+ -+.+..++|++- -.++....+..+++.+..++..++...+| |||.....-.+| .|-. .+|. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s--~p~~-------~~G~ 71 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFS--KPEW-------INGK 71 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEec--Ceee-------cCCc Confidence 87644 367777888866 45788889999999999999999999998 999999987765 2211 1111 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecC-----chhhhhhhcCCCCC------CCc--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNM-----LIYANALEYGHSKQ------APA--GVLGIVAVKLRSYMAEAIKESR 142 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn-----~pYA~~LEyG~s~Q------Ap~--G~V~~a~~~~~~~v~~a~~e~k 142 (147) . +|-|... -.|..-.||||+.. .|. |.++-++..-...+.+.+++-= T Consensus 72 r--------------------~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL 131 (134) T protein:vir:10 72 R--------------------TITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKREL 131 (134) T ss_pred e--------------------EEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHH Confidence 1 2222232 33556679998653 354 5566677666555554444433 Q ss_pred hhh Q lcl|NC_021331. 143 AKN 145 (147) Q Consensus 143 ~~~ 145 (147) -|| T Consensus 132 ~kl 134 (134) T protein:vir:10 132 KKL 134 (134) T ss_pred hcC Confidence 344 No 129 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=83.51 E-value=0.029 Score=28.99 Aligned_cols=112 Identities=11% Similarity=0.057 Sum_probs=67.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |= ---+..-...+.++++.+.. .....+....+.+...--.-|||||+.|-+|=-= .+..+|+. T Consensus 1 ik-V~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfr-----------ei~~ngtr--- 65 (131) T protein:vir:10 1 MP-VKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYK-----------KLEPIPSG--- 65 (131) T ss_pred CC-cchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccce-----------eeeccCce--- Confidence 32 12345566677777777663 4555555555555555566799999999988432 22223221 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhc------------C----CCCCCCchHHHHHHHHH-HHHHHHHHHHH- Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEY------------G----HSKQAPAGVLGIVAVKL-RSYMAEAIKES- 141 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEy------------G----~s~QAp~G~V~~a~~~~-~~~v~~a~~e~- 141 (147) -+--+.+++.||.++.. | |+.-|-..|..-.+++- .+.++..++|- T Consensus 66 ----------------itGRVGYSAnYA~yVHda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~~d~i~avik~e~ 129 (131) T protein:vir:10 66 ----------------MIGRVGYTANYAAAVNAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDGLNEIKAIIRQGY 129 (131) T ss_pred ----------------eEEeeccceeeeeeeecCccccCCCcCCCCCcceecCCCChhhhhhhhhccchHHHHHHHhhhc Confidence 12336689999999965 3 44466677888788653 44555544432 Q ss_pred Hh Q lcl|NC_021331. 142 RA 143 (147) Q Consensus 142 k~ 143 (147) |- T Consensus 130 k~ 131 (131) T protein:vir:10 130 KV 131 (131) T ss_pred CC Confidence 44 No 130 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=81.92 E-value=0.081 Score=26.55 Aligned_cols=116 Identities=11% Similarity=0.082 Sum_probs=71.8 Q ss_pred CCcccchHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNYTIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||.=--+.+..++|++ |-+ ++....+..+++.+..++..++...|| |||..-..-++| .|-+ .+|.. T Consensus 4 ~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s--~~~~-------~~G~r 74 (132) T protein:vir:96 4 FANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVS--GVRR-------EDGIP 74 (132) T ss_pred cccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeec--Ceee-------cCCce Confidence 7752345777888887 666 589999999999999999999999998 999988877665 2221 11221 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCch--hhhhh-hcCCCC---CCCchHHHHHHHHHHHHHH-HHHHHHHhhhcC Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLI--YANAL-EYGHSK---QAPAGVLGIVAVKLRSYMA-EAIKESRAKNAL 147 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p--YA~~L-EyG~s~---QAp~G~V~~a~~~~~~~v~-~a~~e~k~~~~~ 147 (147) + |-|..+-| |...| ||||.. ..+-|.++-++..-..++- ..-.|++. .| T Consensus 75 ~--------------------V~VgW~GpR~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk--~l 130 (132) T protein:vir:96 75 K--------------------VKLGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKR--GF 130 (132) T ss_pred E--------------------EEecccCCceeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHH--Hh Confidence 1 22222222 22334 588753 4445778877776653332 22233322 23 No 131 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=74.82 E-value=0.013 Score=30.93 Aligned_cols=101 Identities=26% Similarity=0.339 Sum_probs=49.0 Q ss_pred CCccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNY----TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~----s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||.+- -+..|--.|++|-+.-+ +.+-+.+...++...-+..+||.+|.+|.|-||+-.+...+. .+-| T Consensus 1 ma~gpt~knplakfgi~lddfdklpe--vnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgr----gkvg-- 72 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLPE--VNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGR----GKVG-- 72 (108) T ss_pred CCCCCccccchhhhccchhhhhccch--hhhhHHHHHHHHHHhhhcCCCccccccccceeeccccccccc----cccc-- Confidence 87642 35667777777755322 233455555666677788999999999999999765533221 0001 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC---CCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS---KQAPAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s---~QAp~G~V~~a~~~~~~~v~~a~~e 140 (147) -+-|-|.-+|||.. .-||. -....|+=.-|-.+ T Consensus 73 -------------------------atdpqahlvefgs~hndeyapa------qktakqfggtay~d 108 (108) T protein:vir:10 73 -------------------------ATDPQAHLVEFGSAHNDEYAPA------QKTAKQFGGTAYGD 108 (108) T ss_pred -------------------------Ccchhhhhhhhhccccccccch------hhhHHhhcccccCC Confidence 12233344444421 01111 00000000000000 No 132 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=74.82 E-value=0.013 Score=30.93 Aligned_cols=101 Identities=26% Similarity=0.339 Sum_probs=49.0 Q ss_pred CCccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNY----TIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~----s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||.+- -+..|--.|++|-+.-+ +.+-+.+...++...-+..+||.+|.+|.|-||+-.+...+. .+-| T Consensus 1 ma~gpt~knplakfgi~lddfdklpe--vnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgr----gkvg-- 72 (108) T protein:vir:10 1 MANGPTRKNPLAKFGVRLDDFDKLPE--VNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGR----GKVG-- 72 (108) T ss_pred CCCCCccccchhhhccchhhhhccch--hhhhHHHHHHHHHHhhhcCCCccccccccceeeccccccccc----cccc-- Confidence 87642 35667777777755322 233455555666677788999999999999999765533221 0001 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC---CCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS---KQAPAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s---~QAp~G~V~~a~~~~~~~v~~a~~e 140 (147) -+-|-|.-+|||.. .-||. -....|+=.-|-.+ T Consensus 73 -------------------------atdpqahlvefgs~hndeyapa------qktakqfggtay~d 108 (108) T protein:vir:10 73 -------------------------ATDPQAHLVEFGSAHNDEYAPA------QKTAKQFGGTAYGD 108 (108) T ss_pred -------------------------Ccchhhhhhhhhccccccccch------hhhHHhhcccccCC Confidence 12233344444421 01111 00000000000000 No 133 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=67.28 E-value=0.25 Score=23.84 Aligned_cols=116 Identities=12% Similarity=0.116 Sum_probs=68.8 Q ss_pred CCcccchHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcch Q lcl|NC_021331. 1 MAKNYTIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGDK 76 (147) Q Consensus 1 MAk~~s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~~ 76 (147) ||.=--+.+..++|++ |-+ ++....+..+++.+..+...++...+| |||..-..-.+| .|-+ .+|.. T Consensus 10 ~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s--~p~~-------~~G~r 80 (138) T protein:vir:98 10 FANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVS--GVRR-------EDGIP 80 (138) T ss_pred cccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeec--Ceee-------cCCce Confidence 6652345677777776 544 488889999999999999999999985 999977665544 3221 11221 Q ss_pred hhhhHHHHHHHHHhcccccceEEEecCch--hhhhh-hcCCCC---CCCchHHHHHHHHHHHHHHHHHH-HHHhhhcC Q lcl|NC_021331. 77 TIAEGKRAIYAILRGGGAVRAIYFSNMLI--YANAL-EYGHSK---QAPAGVLGIVAVKLRSYMAEAIK-ESRAKNAL 147 (147) Q Consensus 77 t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p--YA~~L-EyG~s~---QAp~G~V~~a~~~~~~~v~~a~~-e~k~~~~~ 147 (147) + |-|...-| |...| ||||.. ..+-|.++-++..-....-+.++ |++ .+| T Consensus 81 ~--------------------V~igW~GpR~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~--k~l 136 (138) T protein:vir:98 81 K--------------------VKLGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLK--RGF 136 (138) T ss_pred E--------------------EEEeeecCeeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHH--HHh Confidence 1 22222222 22344 588854 34457788777766544443332 222 233 No 134 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=65.57 E-value=0.28 Score=23.60 Aligned_cols=111 Identities=17% Similarity=0.158 Sum_probs=63.7 Q ss_pred CCcccchHHHHHHHHHHHHH--H-HHHHHHHHHHHHHHHHHHHHHhCCc----ccchhcccceeccCCccccccccCCCC Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINA--V-DSGLKDCVELFAEKVHTDLVKRSPV----DTGRYRGNWQVTANKPPLYALNQYDKH 73 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~--v-~~~~~~~~r~~a~~l~~~vv~~tPV----dtG~~R~nw~vs~~~~~~~~~~~~d~~ 73 (147) ||-|. .-|+..++.+-.- | ++---+.+.+.|---+.++.-.-|+ ..|-+|.+.+|-+.. T Consensus 1 m~sNN--NGFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~------------ 66 (125) T protein:vir:62 1 MASNN--NGFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKD------------ 66 (125) T ss_pred CCCCc--hhHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeC------------ Confidence 99753 2355555544332 2 2222234455555556666655664 357888888874311 Q ss_pred cchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCch------HHHHHHHHHHHHHHH-HHHHHHhhh Q lcl|NC_021331. 74 GDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAG------VLGIVAVKLRSYMAE-AIKESRAKN 145 (147) Q Consensus 74 G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G------~V~~a~~~~~~~v~~-a~~e~k~~~ 145 (147) ..-.+-+.+..=|=..+|.||+.|.++| ||.-|+..-..-+++ +++.+=-++ T Consensus 67 --------------------d~V~V~Fed~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 67 --------------------DRVSVEFKDEAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred --------------------CeEEEEEcchhhhhhhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 1224667888999999999999997777 666676543322222 233333344 No 135 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=64.94 E-value=0.027 Score=29.16 Aligned_cols=90 Identities=21% Similarity=0.280 Sum_probs=43.4 Q ss_pred CCcccchHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD--SGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~--~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) ||..++-.+ .|.+++- ..+....+-.|.+.+..-+...|||||.+|...++.--. ..+..... T Consensus 1 madaftpNp------~~FDqIl~s~~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q--~~~RtT~M------- 65 (92) T protein:vir:78 1 MADAFTPNP------TWFDQIMRTPKVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQ--GRSRETAM------- 65 (92) T ss_pred CCCccCCCh------hHHHHhhcccchhhhhhhhhhhhhhhhcccCcccccccccccchhhhh--ccccceeE------- Confidence 999876332 3443332 235567788888899999999999999999998764211 11100000 Q ss_pred hhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCC Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHS 115 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s 115 (147) +.|..+-..+-=+-+=.-+..|.--.| T Consensus 66 ----------VVG~D~KTlLvESrTGNLakalk~~rs 92 (92) T protein:vir:78 66 ----------VVGSDEKTLLIESRTGNLARSVKRRRS 92 (92) T ss_pred ----------EeecCcceeeeecccchHHHHHhhhcC Confidence 000000000000111111222222222 No 136 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=55.19 E-value=0.49 Score=22.30 Aligned_cols=134 Identities=13% Similarity=0.112 Sum_probs=67.1 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |.+ ++.++...|..+...+. .....++++|+..+......+ +| | |. .|..- .+.+......+++| T Consensus 1 m~~--~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~P-D-G~---~W~pr--k~~~~~~~~~~~~g 71 (155) T protein:vir:79 1 MTD--DLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNP-D-GS---AYEPR--KVKAGGKRLREKAG 71 (155) T ss_pred Cch--HHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCccc--chhhhhhhhhcccC Confidence 987 46788888888887764 344567888888887777652 45 2 11 23210 11110001111222 Q ss_pred chhhhhHHHHH--HHHHhcccccceEE---EecCchhhhhhhcCCCC----------CCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAI--YAILRGGGAVRAIY---FSNMLIYANALEYGHSK----------QAPAGVLGIVAVKLRSYMAEAIK 139 (147) Q Consensus 75 ~~t~~~~~~~i--~~~~~~~~~g~~iy---i~Nn~pYA~~LEyG~s~----------QAp~G~V~~a~~~~~~~v~~a~~ 139 (147) ..........+ ...+...-..+.+- ...|++||..--||-.. -....|..++-+....|.+-+.. T Consensus 72 ~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LGls~~d~~~I~~~i~~ 151 (155) T protein:vir:79 72 RVKREAMFRKLRTARYLRIDVDSTGLAIGFDERLSRIARVHQEGQKAPVEPGGPLAQYPVRVVLGFSDADRELVRDRLLR 151 (155) T ss_pred cccchhhhhhhhhhheeeeeecCcEEEEEecCcchhhhhhhhcCCcccCCCCCcccccccccccCCCHHHHHHHHHHHHH Confidence 11100000000 00011111123333 38999999999999653 23335666776666666555555 Q ss_pred HHHh Q lcl|NC_021331. 140 ESRA 143 (147) Q Consensus 140 e~k~ 143 (147) -+.- T Consensus 152 ~l~r 155 (155) T protein:vir:79 152 ELTR 155 (155) T ss_pred HhhC Confidence 4432 No 137 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=53.51 E-value=0.53 Score=22.11 Aligned_cols=130 Identities=12% Similarity=0.208 Sum_probs=61.7 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVK-----RSPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~-----~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |-. +..+...|.+...++. ......+++|+..+...... .+| | |. .|...- +.+ ....-..+ T Consensus 1 ~~~---~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~P-d-G~---~W~p~k--~~~--~~~k~g~~ 68 (150) T protein:vir:20 1 MNE---FKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAP-D-GT---PYAPRQ--QQS--VRKKTGRV 68 (150) T ss_pred Cch---HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccc--hHH--HHHhccCC Confidence 432 3445555555555543 34556788888887777655 345 2 21 243211 111 00000011 Q ss_pred chhhhhHHHHHHHHHhcccccc--eEEE--ecCchhhhhhhcCCCC----------CCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVR--AIYF--SNMLIYANALEYGHSK----------QAPAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~--~iyi--~Nn~pYA~~LEyG~s~----------QAp~G~V~~a~~~~~~~v~~a~~e 140 (147) ...+..... ....+...-..+ +|++ ..|.+||..--||-+. -....|+.++-...+.|.+-+..- T Consensus 69 ~~~l~~~~~-l~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~ 147 (150) T protein:vir:20 69 KRKMFAKLI-TSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAH 147 (150) T ss_pred Cccccchhh-hhhhhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCHHHHHHHHHHHHHH Confidence 111111110 111111111122 3433 7899999999999542 233456667777766666655555 Q ss_pred HHh Q lcl|NC_021331. 141 SRA 143 (147) Q Consensus 141 ~k~ 143 (147) ++- T Consensus 148 l~k 150 (150) T protein:vir:20 148 LER 150 (150) T ss_pred HhC Confidence 543 No 138 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=50.61 E-value=0.48 Score=22.35 Aligned_cols=97 Identities=14% Similarity=0.057 Sum_probs=41.3 Q ss_pred HHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCch Q lcl|NC_021331. 26 KDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLI 105 (147) Q Consensus 26 ~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p 105 (147) .++-|+=...++..+... .|.++-++... ..|..|. ....+.. .. .+-+.| ++..- T Consensus 1 m~v~r~~L~~~~~~l~~~------------~V~VGi~~~a~--y~d~~g~-~~~~g~~--~~--~~~~~G-----~pva~ 56 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYKSM------------SVKAGVLAGAT--YPDESGK-KLADGTI--LK--KDPRAG-----LPVAM 56 (155) T ss_pred CcchHHHHHHHHHHhhCC------------eeEEeecCCCC--CCccccc-hhhhhhh--hc--cccccC-----cchhh Confidence 112222122222333221 13333332211 1122221 1111100 00 000001 13344 Q ss_pred hhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 106 YANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 106 YA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) +|.++|||+..-.|..|+|.++.+.+.-+.+.+.++ .+.++ T Consensus 57 ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~-~~~~~ 97 (155) T protein:vir:10 57 IAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVM-MTMGY 97 (155) T ss_pred hhhhhhcCCCCCCCcchhHHHHHHHHHHHHHHHHHH-HHcCC Confidence 677999999999999999999987655444444332 12233 No 139 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=50.35 E-value=0.61 Score=21.75 Aligned_cols=97 Identities=14% Similarity=0.010 Sum_probs=39.8 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |.. .|+-...++.++.. ...+ |++ +.... ..|..|.... T Consensus 1 m~~-------------------------~r~~l~~~~~~l~~------~~v~----VGi--~~~a~--y~d~~~~~~~-- 39 (155) T protein:vir:77 1 MSV-------------------------TRRGLTLPKDRYRS------MSVK----AGV--LAGAT--YPDESGKKLA-- 39 (155) T ss_pred Ccc-------------------------hHHHHHHHHHHHhc------CceE----Eee--cCCCC--Cccccchhhh-- Confidence 333 11111111222211 1122 222 22110 1111111100 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) +.+... +.+. -=.+..-+|.++|||+..-.|..|+|.++.+.+.-+.+.+..+ .+.++ T Consensus 40 ----~~~~~~--~~~~--~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~-~~~~~ 97 (155) T protein:vir:77 40 ----DGSILK--KDPR--AGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVM-MTMGY 97 (155) T ss_pred ----hhhhcc--cccc--ccccHhhhhhhhhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH-HHccC Confidence 000000 0000 0124455888999999999999999999987655444444332 11233 No 140 >protein:vir:8432 Length: 149 # NCBI annotation: gp27 # Family: family:all:30885 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818328;genbank:gi:29566764;genbank:GeneID:1260117 Probab=49.68 E-value=0.63 Score=21.68 Aligned_cols=121 Identities=22% Similarity=0.337 Sum_probs=64.7 Q ss_pred CCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhCCcccchhcccceeccCCccccccccCCCCcchhh Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAWINAVDSGLKDCVELFAEKVHT-DLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTI 78 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~-~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~ 78 (147) .-.++ |-+++..-+.+|+..+|+.--.+-.-.-+-++. .--..-|+.||.+++-..-.. -. T Consensus 16 lldgvsssrdlrrivqrfindveqtwhdvwdvsmlgvlaqqtgvphpyqtgdykahikkkk-----------------lt 78 (149) T protein:vir:84 16 LLDGVSSSRDLRRIVQRFINDVEQTWHDVWDVSMLGVLAQQTGVPHPYQTGDYKAHIKKKK-----------------LT 78 (149) T ss_pred hhhccccchHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHhhcCCCCCccccchhhhhhhhh-----------------HH Confidence 22333 334566667788888877766665544443332 222245788999987643110 00 Q ss_pred hhHHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC-----C------CCchHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021331. 79 AEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK-----Q------APAGVLGIVAVKLRSYMAEAIKESRAK 144 (147) Q Consensus 79 ~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~-----Q------Ap~G~V~~a~~~~~~~v~~a~~e~k~~ 144 (147) +-.--.|..++.|.-+-..+| ||-+-|..+|||.-. . .|..-+++ .|++.+|+++ ++|-| T Consensus 79 amqkirikkflkggmpiglvy--nndekahwieygtkrdrpgsrspwgpntptpafei-mqrvarimne---dvryr 149 (149) T protein:vir:84 79 AMQKIRIKKFLKGGMPIGLVY--NNDEKAHWIEYGTKRDRPGSRSPWGPNTPTPAFEI-MQRVARIMNE---DVRYR 149 (149) T ss_pred HHHHHHHHHHhhcCCceeEEe--cCCcchhhhhhccccCCCCCCCCCCCCCCChhHHH-HHHHHHHhhh---hcccC Confidence 111123445555544444555 999999999999633 1 22222222 3455555553 44555 No 141 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=48.50 E-value=0.53 Score=22.11 Aligned_cols=137 Identities=15% Similarity=0.219 Sum_probs=70.0 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVK-----RSPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~-----~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |.+ ++..+.+.|...+.+++ .....++++|+..+...... ..| | |. .|...-...-..-....+..+ T Consensus 1 m~~--~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~P-d-G~---~W~p~~~~~~~~~~~~~~~~~ 73 (156) T protein:vir:11 1 MAD--SLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNP-D-GS---AYEPRKKRELRGKQGRIRRKI 73 (156) T ss_pred Cch--hHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccchHHHhhhccccccch Confidence 998 56788888888877664 23445788888877776655 245 2 21 243211000000000000000 Q ss_pred chhhhhHHHHHHHHHhcccccc--eEE-EecCchhhhhhhcCCCCC----------CCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVR--AIY-FSNMLIYANALEYGHSKQ----------APAGVLGIVAVKLRSYMAEAIKES 141 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~--~iy-i~Nn~pYA~~LEyG~s~Q----------Ap~G~V~~a~~~~~~~v~~a~~e~ 141 (147) .+-..... ...+...-..+ +|. ...|..||..--||-..+ ....|+.++-+..+.|.+-+.+-+ T Consensus 74 --~m~~~l~~-~~~l~~~~~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~~d~~~i~~~i~~~l 150 (156) T protein:vir:11 74 --KMFQKLRT-VRYLRAKGDAQAITVSFAGRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDSSDMETIQNGILAHI 150 (156) T ss_pred --hhhhhhhh-hheeeeeecCcEEEEEecCCchhhhhhhcccccccccCCCCcccccccccCCCCHHHHHHHHHHHHHHH Confidence 00000000 00011011122 232 379999999999997632 333566677777777777666666 Q ss_pred HhhhcC Q lcl|NC_021331. 142 RAKNAL 147 (147) Q Consensus 142 k~~~~~ 147 (147) +...=+ T Consensus 151 ~~~~~~ 156 (156) T protein:vir:11 151 DANSPI 156 (156) T ss_pred hhcCCC Confidence 666666 No 142 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=45.04 E-value=0.64 Score=21.65 Aligned_cols=132 Identities=13% Similarity=0.154 Sum_probs=66.2 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |+++ +..+...|+.....+. .....++++|+..+......+ +|= | .-|...-.. .-.......+| T Consensus 1 M~~~--~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PD--G---~pW~p~k~~--~~~~k~~~~~~ 71 (152) T protein:vir:10 1 MSEP--IEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPD--G---SAYEPRKKP--KKGVKSKIKSG 71 (152) T ss_pred CchH--HHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCC--C---CCCchhhhh--hhhhcccccch Confidence 9984 5677777777777654 244567888888887777653 452 1 124332110 00000000000 Q ss_pred chhhhhHHHHHHHHHhc--ccccceE-EEecCchhhhhhhcCCCCCC-----------CchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRG--GGAVRAI-YFSNMLIYANALEYGHSKQA-----------PAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~--~~~g~~i-yi~Nn~pYA~~LEyG~s~QA-----------p~G~V~~a~~~~~~~v~~a~~e 140 (147) . +-..... ...+.. ...+-+| |+..|++||.---||-..+. ...|+.++-.....|.+-+.+- T Consensus 72 ~--m~~~L~~-a~~l~~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~ 148 (152) T protein:vir:10 72 K--MFDKITQ-PRFMRLRLESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTDDDLQMIEDYMINI 148 (152) T ss_pred h--HHHhhhh-cceeeeeecCcEEEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCCHHHHHHHHHHHHHH Confidence 0 0000000 000000 0112234 34899999999999965422 2356667776666666655555 Q ss_pred HHhh Q lcl|NC_021331. 141 SRAK 144 (147) Q Consensus 141 ~k~~ 144 (147) +.+- T Consensus 149 l~~a 152 (152) T protein:vir:10 149 LAGS 152 (152) T ss_pred HhcC Confidence 5443 No 143 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=44.67 E-value=0.8 Score=21.12 Aligned_cols=91 Identities=15% Similarity=0.124 Sum_probs=42.6 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |+-.+.... ..++++.+. ++ ++ ++..|-++-|.....+...++|- T Consensus 1 M~~~~k~~~--~~~~~l~~~--------l~--------~l------------~~~~v~VGi~~~~~~~~~~~~g~----- 45 (148) T protein:vir:52 1 MAVTVTANF--SAAKQLIEQ--------MK--------SL------------KEKAVYVGFPAEFDEKVKGSENF----- 45 (148) T ss_pred Ccccccccc--HHHHHHHHH--------HH--------Hh------------hCCeEEEEeecCcCCCCCCCCCC----- Confidence 776443221 112221111 11 11 12344444443222222222221 Q ss_pred HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCCCCCchHHHHHHHHH----HHHHHHHHHHH-HhhhcC Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQAPAGVLGIVAVKL----RSYMAEAIKES-RAKNAL 147 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~QAp~G~V~~a~~~~----~~~v~~a~~e~-k~~~~~ 147 (147) +++-.|...|||+..-.|..|+|.++.+- .+.+.++++.. .++.+| T Consensus 46 ---------------------~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~~~~L 96 (148) T protein:vir:52 46 ---------------------NLASLAAVLEFGNEHIPARPFLRQTLEENQEKYTALFIQWFDQGVPAAQIY 96 (148) T ss_pred ---------------------CHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHH Confidence 45667788899999999999999988654 33333332210 112222 No 144 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=43.71 E-value=0.83 Score=21.02 Aligned_cols=127 Identities=17% Similarity=0.248 Sum_probs=65.2 Q ss_pred CCcccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDS-GLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~-~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |.. +..+...|...++++.. ....++++|+..+.+....+ .| | |. -|..- ++.. ....| T Consensus 1 m~~---~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~P-D-G~---~W~p~--s~~~-----~~~~g 65 (148) T protein:vir:79 1 MSE---SRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNP-D-GS---PYVPR--KPQL-----RHRAG 65 (148) T ss_pred Ccc---HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---cCccc--chHH-----Hhhcc Confidence 774 56666777777766543 23457788888887777653 44 2 21 13210 0000 00001 Q ss_pred c--hhhhh---HHHHHHHHHhcccccceEEEecCchhhhhhhcCCCC----------CCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 D--KTIAE---GKRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSK----------QAPAGVLGIVAVKLRSYMAEAIK 139 (147) Q Consensus 75 ~--~t~~~---~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~----------QAp~G~V~~a~~~~~~~v~~a~~ 139 (147) . ..+.. ....+ ..........+-|+..|++||..-.||-.. -...-|+.++-.....|.+-+.. T Consensus 66 ~~~~~~~~~l~~~~~l-~~~~~~~~~~v~~~Gt~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d~~~i~~~i~~ 144 (148) T protein:vir:79 66 RIRRAMFMRLRLARYM-KTQADANTAVVTFAGNAQRIATVHQFGLRDRVNKAGLTAQYPARELLGMDGVDMEHITNLLLL 144 (148) T ss_pred cccccccchhhhhhhe-eeeeeCCeeeEEeeccchhhhhhhhcCccccccCCCCccccCcccccCCCHHHHHHHHHHHHH Confidence 0 00000 00000 000111112233468999999999999543 23345667777777777777777 Q ss_pred HHHh Q lcl|NC_021331. 140 ESRA 143 (147) Q Consensus 140 e~k~ 143 (147) -+-+ T Consensus 145 ~l~~ 148 (148) T protein:vir:79 145 HLGA 148 (148) T ss_pred HhcC Confidence 7777 No 145 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=42.66 E-value=0.88 Score=20.90 Aligned_cols=130 Identities=13% Similarity=0.208 Sum_probs=60.3 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |.+ +..+...|.+...+++ ......+++|+..+......+ +|- |. .|... .+.+.... ...+ T Consensus 1 m~d---~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~Pd--G~---~W~p~--~~~~~~~k--~~~~ 68 (149) T protein:vir:98 1 MSE---LTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPD--GT---PYAAR--KRQSVRSK--KGRI 68 (149) T ss_pred Cch---HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCC--CC---CCccc--chHHHHhc--cCCC Confidence 653 5566666776666553 345567888888877766553 452 21 35433 11111000 0000 Q ss_pred chhhhhHHHHHHHHHhcccccc--eE-EEecCchhhhhhhcCCCCC----------CCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRGGGAVR--AI-YFSNMLIYANALEYGHSKQ----------APAGVLGIVAVKLRSYMAEAIKES 141 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~~~~g~--~i-yi~Nn~pYA~~LEyG~s~Q----------Ap~G~V~~a~~~~~~~v~~a~~e~ 141 (147) ...+-.... ....+...-..+ +| |+..|.+||..-.||-..+ ....|+.++-+....|.+-+.+-+ T Consensus 69 ~~~l~~~g~-l~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l 147 (149) T protein:vir:98 69 RREMFARLR-TNRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFTRDDEQMIEDIIIRHL 147 (149) T ss_pred Ccccchhhh-hhhhhhheecCCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCCHHHHHHHHHHHHHHh Confidence 011100000 001111111122 23 4589999999999996532 233455555555444444333333 Q ss_pred Hh Q lcl|NC_021331. 142 RA 143 (147) Q Consensus 142 k~ 143 (147) .- T Consensus 148 ~~ 149 (149) T protein:vir:98 148 GK 149 (149) T ss_pred hC Confidence 22 No 146 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=39.98 E-value=0.99 Score=20.60 Aligned_cols=130 Identities=12% Similarity=0.191 Sum_probs=61.1 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |-. +..+...|.+..+++. ......+++|+..+.+....+ .| | |. .|... .+.+.. .....+ T Consensus 1 ~~~---~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~P-d-G~---~W~p~--~~~~~~--~k~~~~ 68 (150) T protein:vir:60 1 MNE---FKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAP-D-GT---PYAPR--QQQSAR--KKTGRV 68 (150) T ss_pred Cch---HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCccc--ChHHHH--HhhcCC Confidence 432 3444445555555442 334557888888877776552 44 2 21 24322 111000 000001 Q ss_pred chhhhhHHHHHHHHHh--cccccceEE--EecCchhhhhhhcCCCCC----------CCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILR--GGGAVRAIY--FSNMLIYANALEYGHSKQ----------APAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~--~~~~g~~iy--i~Nn~pYA~~LEyG~s~Q----------Ap~G~V~~a~~~~~~~v~~a~~e 140 (147) ...+-... .....+. ....+-+|. +..|.+||..-.||-+.+ ....|+.++-+..+.|.+.+..- T Consensus 69 ~~~l~~~~-~l~~sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~ 147 (150) T protein:vir:60 69 KRKMFAKL-ITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAH 147 (150) T ss_pred Cccchhhh-hhcceeeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCCHHHHHHHHHHHHHH Confidence 11110000 0000000 011122343 488999999999996532 34467777777777776666665 Q ss_pred HHh Q lcl|NC_021331. 141 SRA 143 (147) Q Consensus 141 ~k~ 143 (147) +.- T Consensus 148 l~r 150 (150) T protein:vir:60 148 LDR 150 (150) T ss_pred HhC Confidence 533 No 147 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=36.26 E-value=1.2 Score=20.18 Aligned_cols=130 Identities=12% Similarity=0.169 Sum_probs=59.0 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |-. +..+...|.+..+++. ......+++|+..+......+ .| | |. -|... .+.+... .-..+ T Consensus 1 m~~---~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~P-d-G~---~W~p~--k~~~~~~--k~~~~ 68 (150) T protein:vir:57 1 MNE---FKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAP-D-GT---PYAPR--QQQSARK--KTGRV 68 (150) T ss_pred Cch---HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCccc--ChHHHHH--hccCC Confidence 432 3334444444444432 334557888888877776552 44 2 21 24321 1110000 00001 Q ss_pred chhhhhHHHHHHHHHhc--ccccceEE--EecCchhhhhhhcCCCCC----------CCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DKTIAEGKRAIYAILRG--GGAVRAIY--FSNMLIYANALEYGHSKQ----------APAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 75 ~~t~~~~~~~i~~~~~~--~~~g~~iy--i~Nn~pYA~~LEyG~s~Q----------Ap~G~V~~a~~~~~~~v~~a~~e 140 (147) ...+-... .....+.. ...+-+|. +..|.+||..-.||-+.+ ....|+.++-+....|.+-+..- T Consensus 69 ~~~l~~~~-~l~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~ 147 (150) T protein:vir:57 69 KRKMFAKL-ITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGEDVQMIEEIILAH 147 (150) T ss_pred Ccccchhh-hhccceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCHHHHHHHHHHHHHH Confidence 11110000 00000000 11122343 378999999999996542 44466777777766665555555 Q ss_pred HHh Q lcl|NC_021331. 141 SRA 143 (147) Q Consensus 141 ~k~ 143 (147) +.- T Consensus 148 l~r 150 (150) T protein:vir:57 148 LDR 150 (150) T ss_pred HhC Confidence 533 No 148 >protein:vir:4230 Length: 111 # NCBI annotation: predicted 12.0Kd protein # Family: family:all:2819 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039685;swissprot:sw:q05227;genbank:gi:9625451;uniprot:Q05227;genbank:GeneID:2942925 Probab=34.01 E-value=0.6 Score=21.80 Aligned_cols=101 Identities=14% Similarity=0.104 Sum_probs=37.8 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |||=+--.+ +-+.+ .-.++..+..-.+++. ||.|+|.--.-.+---....-.+.++++ T Consensus 1 makvyanaN--~v~a~-~~~~k~avr~E~~~v~---------------~RAraNLA~a~astri~~~g~~p~~it~---- 58 (111) T protein:vir:42 1 MAKVYANAN--KVAAR-YVETRDAVRDERNKVT---------------RRAKANLARQNSTTRITDEGYFPATITE---- 58 (111) T ss_pred Ccceecchh--hhhhh-chhHHHHHHHHHhhhh---------------hhHHHhHHHhhhccccccccccCceeec---- Confidence 999553222 11111 1223333333333333 4444443110000000000000111111 Q ss_pred HHHHHHHHHhcccccce-EEEecCchhhhhhhcCCC---------CCCCchHHHHHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRA-IYFSNMLIYANALEYGHS---------KQAPAGVLGIVAVKLRSYMA 135 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~-iyi~Nn~pYA~~LEyG~s---------~QAp~G~V~~a~~~~~~~v~ 135 (147) -.||+ .|+-=..|-+..|||||- +.+|.|-|=++-.-+.--+. T Consensus 59 ------------~~gdvD~~~~l~APnamAiEfGH~PSG~F~g~dTKaPe~~YILt~AAiggt~~ 111 (111) T protein:vir:42 59 ------------QDGDVDFHTILNAPNALALEFGHAPSGFFAGTDTKPPEATYILTRAAIGGTVS 111 (111) T ss_pred ------------ccCCcceEEEecCCChhhhhcccCCcceecccccCCCCceeeeeccccccccC Confidence 12343 555567999999999982 14444444333222211111 No 149 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=33.80 E-value=0.88 Score=20.88 Aligned_cols=92 Identities=16% Similarity=0.056 Sum_probs=36.1 Q ss_pred hCCcccchhc---------ccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCchhhhhhhcC Q lcl|NC_021331. 43 RSPVDTGRYR---------GNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLIYANALEYG 113 (147) Q Consensus 43 ~tPVdtG~~R---------~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG 113 (147) .+++.++.+- ++-.+-++.++.. .-|.|....... ... ...++. =.+.+-+|.++||| T Consensus 1 ~~~~~~~g~~~~~~~~~~l~~~~v~vG~l~~a----~yp~G~~~~~~~------~~~-~~~~~~--g~~va~Ia~~~E~G 67 (168) T protein:vir:94 1 MTTIARKGVKMPPHLEAQFQSGEVKAGVLSGS----TYPQMTYTDQRT------GKQ-IEDARG--GMPVAVIAQALEYG 67 (168) T ss_pred CccccchhhhhhHHHHHhhhccceeeeccccC----cccccccchhhc------ccc-cccccc--cccHHHHHHHHhcC Confidence 3444332221 1122222222211 001111000000 000 000000 01345667899999 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHH-HHhhh----cC Q lcl|NC_021331. 114 HSKQAPAGVLGIVAVKLRSYMAEAIKE-SRAKN----AL 147 (147) Q Consensus 114 ~s~QAp~G~V~~a~~~~~~~v~~a~~e-~k~~~----~~ 147 (147) +..-.|..|+|.++.+-++-+.+.+.. +++.. +| T Consensus 68 ~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~L 106 (168) T protein:vir:94 68 HGQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAADTAL 106 (168) T ss_pred CCCCCCchhhHHHHHHHHHHHHHHHHHHHhcCCCHHHHH Confidence 999999999999987443332222222 13211 11 No 150 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=33.25 E-value=1.4 Score=19.84 Aligned_cols=86 Identities=13% Similarity=0.076 Sum_probs=41.9 Q ss_pred CCcccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhh Q lcl|NC_021331. 1 MAKNYTI-REFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIA 79 (147) Q Consensus 1 MAk~~s~-~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~ 79 (147) |...+.- ..+.+.|.+ .++ ++ ++..|.++-|+.. ..|+|. T Consensus 1 M~~~i~~~~~~~~~L~~-----------~lk--------~l------------~~k~V~VGi~~~~----~y~dG~---- 41 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNA-----------FIK--------GM------------NDYSVRIGWFSTA----KYPDGT---- 41 (189) T ss_pred CcceeccCcHHHHHHHH-----------HHH--------Hh------------hCCeEEEEecCCC----CCCCcc---- Confidence 8875532 112222221 111 11 2344555544332 122221 Q ss_pred hHHHHHHHHHhcccccceEEEecCchhhhhhhcCC--CCCCCchHHHHHHHH----HHHHHHHHHHHH-Hhhh----cC Q lcl|NC_021331. 80 EGKRAIYAILRGGGAVRAIYFSNMLIYANALEYGH--SKQAPAGVLGIVAVK----LRSYMAEAIKES-RAKN----AL 147 (147) Q Consensus 80 ~~~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~--s~QAp~G~V~~a~~~----~~~~v~~a~~e~-k~~~----~~ 147 (147) .++-.|...|||+ .+-.|..|+|.++.+ |.+.+.+.+..+ ++.. .| T Consensus 42 ----------------------~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L 98 (189) T protein:vir:10 42 ----------------------PTAYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQAL 98 (189) T ss_pred ----------------------cHHHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Confidence 1355677889998 446789999999974 445455455432 2222 22 No 151 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=30.64 E-value=1.6 Score=19.52 Aligned_cols=116 Identities=19% Similarity=0.127 Sum_probs=70.0 Q ss_pred CCccc-chHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHH--hCCcccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVK--RSPVDTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~--~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |.-.+ .+.+..++|++ |-+ ++....+..+++.+..+...++. .+.-|||..-..-.+| .|-. .+|. T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s--~p~~-------~~G~ 71 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIE--KPSY-------DKGV 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEec--Ceee-------eCCc Confidence 87654 46777777775 543 47788888999999999999888 4667999988776654 2211 1111 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCch----hhhhh-hcCCC------CCCCchHHHHHHHHHHH-HHHHHHHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLI----YANAL-EYGHS------KQAPAGVLGIVAVKLRS-YMAEAIKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p----YA~~L-EyG~s------~QAp~G~V~~a~~~~~~-~v~~a~~e~k~ 143 (147) . +|-|...-| +...| ||||- ...+-|.++-++..-.. +++..-.|++. T Consensus 72 r--------------------~V~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k 131 (133) T protein:vir:78 72 R--------------------SIKIDWKGPKDRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGD 131 (133) T ss_pred e--------------------EEEEEEecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHh Confidence 1 233333333 12233 77772 13334667777766643 34444456666 Q ss_pred hh Q lcl|NC_021331. 144 KN 145 (147) Q Consensus 144 ~~ 145 (147) .+ T Consensus 132 ~l 133 (133) T protein:vir:78 132 KL 133 (133) T ss_pred hC Confidence 66 No 152 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=29.94 E-value=1.6 Score=19.44 Aligned_cols=96 Identities=13% Similarity=-0.011 Sum_probs=40.7 Q ss_pred HHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcc-cccceEEEecCc Q lcl|NC_021331. 26 KDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGG-GAVRAIYFSNML 104 (147) Q Consensus 26 ~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~-~~g~~iyi~Nn~ 104 (147) .++.|+-...++.++.. . .|.++-+.... ..|.+|..- +.+++... ..+. ++.. T Consensus 1 m~v~~k~L~~~~~~l~~------~------~v~VGi~~~a~--y~d~~~~~~-------~~~~~~~~~~~~g----~~va 55 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRS------M------SVKAGVLAGAT--YPDESGKKL-------ADGTILTKDPRAG----LPVA 55 (155) T ss_pred CcchHHHHHHHHHHHhC------C------eeEEeecCCCC--CCcccchhh-------hhhhhcccccccC----CcHH Confidence 22233322333333321 1 22233332211 112222111 00000000 0000 1334 Q ss_pred hhhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 105 IYANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 105 pYA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) -+|.+.|||+..-.|..|+|.++.+.+.-..+.+..+ .+.++ T Consensus 56 ~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~-~~~~~ 97 (155) T protein:vir:78 56 MIAMALNYGTSKLPARPFMEKTITDRSAEWIKGLTVM-MTMGY 97 (155) T ss_pred HHHHhhhcCCCCCCCcchhhHHHHHHHHHHHHHHHHH-HHcCC Confidence 4667899999999999999999987655444444332 11233 No 153 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=28.39 E-value=1.8 Score=19.25 Aligned_cols=97 Identities=13% Similarity=-0.017 Sum_probs=40.5 Q ss_pred HHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhhHHHHHHHHHhcccccceEEEecCch Q lcl|NC_021331. 26 KDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAEGKRAIYAILRGGGAVRAIYFSNMLI 105 (147) Q Consensus 26 ~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p 105 (147) .++.|+-...++.++.. . .|.++-+.... ..|.+|..- +.++... +.+. -=++..- T Consensus 1 m~v~~k~L~~~~~~l~~------~------~v~VGi~~~a~--y~d~~~~~~-------~~~~~~~-~~~~--~g~~va~ 56 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRS------M------SVKAGVLAGAT--YPDESGKKL-------ADGTILT-KDPR--AGLPVAM 56 (155) T ss_pred CcchHHHHHHHHHHHhC------C------eeEEeecCCCC--Cccccchhh-------hhhhhcc-cccc--cCCcHHH Confidence 22233322333333321 1 12233332211 112222111 0000000 0000 0013444 Q ss_pred hhhhhhcCCCCCCCchHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021331. 106 YANALEYGHSKQAPAGVLGIVAVKLRSYMAEAIKESRAKNAL 147 (147) Q Consensus 106 YA~~LEyG~s~QAp~G~V~~a~~~~~~~v~~a~~e~k~~~~~ 147 (147) +|.+.|||+..-.|..|+|.++.+.++-+.+.+..+ .+.++ T Consensus 57 ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~-~~~~~ 97 (155) T protein:vir:10 57 IAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVM-MTMGY 97 (155) T ss_pred HHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHH-HHcCC Confidence 667899999999999999999986655444443332 11233 No 154 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=26.78 E-value=1.9 Score=19.04 Aligned_cols=129 Identities=15% Similarity=0.204 Sum_probs=58.5 Q ss_pred CCcccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh-----CCcccchhcccceeccCCccccccccCCCCc Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVD-SGLKDCVELFAEKVHTDLVKR-----SPVDTGRYRGNWQVTANKPPLYALNQYDKHG 74 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~-~~~~~~~r~~a~~l~~~vv~~-----tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G 74 (147) |-. +..+...|.+....+. .....++++|+..+......+ +| | |. .|.... +.+.. .+.| T Consensus 1 m~~---~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~P-d-G~---~W~p~~--~~~~~----~~~g 66 (149) T protein:vir:18 1 MSE---LTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAP-D-GT---PYAARK--RQPVR----SKKG 66 (149) T ss_pred Cch---HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccc--hhhhh----hccC Confidence 542 3444444444444432 223446788888777766553 55 2 22 354321 11110 0111 Q ss_pred ch--hh-hhH-HHHHHHHHhcccccceEEEecCchhhhhhhcCCCCC----------CCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021331. 75 DK--TI-AEG-KRAIYAILRGGGAVRAIYFSNMLIYANALEYGHSKQ----------APAGVLGIVAVKLRSYMAEAIKE 140 (147) Q Consensus 75 ~~--t~-~~~-~~~i~~~~~~~~~g~~iyi~Nn~pYA~~LEyG~s~Q----------Ap~G~V~~a~~~~~~~v~~a~~e 140 (147) .. .+ ..+ .+.............+.|+..|.+||.--.||-..+ ....|..++-+....|.+.+.+- T Consensus 67 ~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~ 146 (149) T protein:vir:18 67 RIKREMFAKLRTSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTRDDEQMIEDVIISH 146 (149) T ss_pred cccchhhhhhhhhhhhheeecCceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCCHHHHHHHHHHHHHH Confidence 10 00 000 000111111112222345789999999999997632 23345556666555555544444 Q ss_pred HHh Q lcl|NC_021331. 141 SRA 143 (147) Q Consensus 141 ~k~ 143 (147) +.- T Consensus 147 l~~ 149 (149) T protein:vir:18 147 LGK 149 (149) T ss_pred HhC Confidence 432 No 155 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=25.04 E-value=2.1 Score=18.81 Aligned_cols=116 Identities=20% Similarity=0.147 Sum_probs=66.8 Q ss_pred CCccc-chHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |.-.+ -+.+..++|++ |-+ ++....+..+++.+..+...++....+ |||..-..-.+| .|-+. .|. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s--~p~~~-------~g~ 71 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKS--KPYTK-------VGS 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEec--Ceeec-------cCC Confidence 87654 35677777775 544 478888999999999999999998774 999988776554 23210 010 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCch----hhhhh-hcCCCC------CCCchHHHHHHHHHHHHHHH-HHHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLI----YANAL-EYGHSK------QAPAGVLGIVAVKLRSYMAE-AIKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p----YA~~L-EyG~s~------QAp~G~V~~a~~~~~~~v~~-a~~e~k~ 143 (147) . --+|-|...-| +...| ||||-. ..+-|.++-++..-...+-+ .-.|++- T Consensus 72 ~------------------~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 72 Q------------------ERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred c------------------ceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 0 01223333332 12233 777621 33346677777665444333 2334433 No 156 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=25.04 E-value=2.1 Score=18.81 Aligned_cols=116 Identities=20% Similarity=0.147 Sum_probs=66.8 Q ss_pred CCccc-chHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |.-.+ -+.+..++|++ |-+ ++....+..+++.+..+...++....+ |||..-..-.+| .|-+. .|. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s--~p~~~-------~g~ 71 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKS--KPYTK-------VGS 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEec--Ceeec-------cCC Confidence 87654 35677777775 544 478888999999999999999998774 999988776554 23210 010 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCch----hhhhh-hcCCCC------CCCchHHHHHHHHHHHHHHH-HHHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLI----YANAL-EYGHSK------QAPAGVLGIVAVKLRSYMAE-AIKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p----YA~~L-EyG~s~------QAp~G~V~~a~~~~~~~v~~-a~~e~k~ 143 (147) . --+|-|...-| +...| ||||-. ..+-|.++-++..-...+-+ .-.|++- T Consensus 72 ~------------------~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 72 Q------------------ERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred c------------------ceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 0 01223333332 12233 777621 33346677777665444333 2334433 No 157 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=25.04 E-value=2.1 Score=18.81 Aligned_cols=116 Identities=20% Similarity=0.147 Sum_probs=66.8 Q ss_pred CCccc-chHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |.-.+ -+.+..++|++ |-+ ++....+..+++.+..+...++....+ |||..-..-.+| .|-+. .|. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s--~p~~~-------~g~ 71 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKS--KPYTK-------VGS 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEec--Ceeec-------cCC Confidence 87654 35677777775 544 478888999999999999999998774 999988776554 23210 010 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCch----hhhhh-hcCCCC------CCCchHHHHHHHHHHHHHHH-HHHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLI----YANAL-EYGHSK------QAPAGVLGIVAVKLRSYMAE-AIKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p----YA~~L-EyG~s~------QAp~G~V~~a~~~~~~~v~~-a~~e~k~ 143 (147) . --+|-|...-| +...| ||||-. ..+-|.++-++..-...+-+ .-.|++- T Consensus 72 ~------------------~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 72 Q------------------ERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred c------------------ceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 0 01223333332 12233 777621 33346677777665444333 2334433 No 158 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=25.04 E-value=2.1 Score=18.81 Aligned_cols=116 Identities=20% Similarity=0.147 Sum_probs=66.8 Q ss_pred CCccc-chHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDA-WIN-AVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~-f~~-~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |.-.+ -+.+..++|++ |-+ ++....+..+++.+..+...++....+ |||..-..-.+| .|-+. .|. T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s--~p~~~-------~g~ 71 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKS--KPYTK-------VGS 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEec--Ceeec-------cCC Confidence 87654 35677777775 544 478888999999999999999998774 999988776554 23210 010 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCch----hhhhh-hcCCCC------CCCchHHHHHHHHHHHHHHH-HHHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLI----YANAL-EYGHSK------QAPAGVLGIVAVKLRSYMAE-AIKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p----YA~~L-EyG~s~------QAp~G~V~~a~~~~~~~v~~-a~~e~k~ 143 (147) . --+|-|...-| +...| ||||-. ..+-|.++-++..-...+-+ .-.|++- T Consensus 72 ~------------------~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 72 Q------------------ERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred c------------------ceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 0 01223333332 12233 777621 33346677777665444333 2334433 No 159 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=22.44 E-value=2.4 Score=18.45 Aligned_cols=116 Identities=19% Similarity=0.114 Sum_probs=66.8 Q ss_pred CCccc-chHHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHhCCc--ccchhcccceeccCCccccccccCCCCcc Q lcl|NC_021331. 1 MAKNY-TIREFHGNIDAW-I-NAVDSGLKDCVELFAEKVHTDLVKRSPV--DTGRYRGNWQVTANKPPLYALNQYDKHGD 75 (147) Q Consensus 1 MAk~~-s~~~F~~~i~~f-~-~~v~~~~~~~~r~~a~~l~~~vv~~tPV--dtG~~R~nw~vs~~~~~~~~~~~~d~~G~ 75 (147) |.-.+ -+.+..++|++- - .++....+..+++.+..+...++....+ |||..-..-.+| .|-+. .|. T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s--~p~~~-------~g~ 71 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKS--KPYTK-------VGS 71 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEec--Ceeec-------cCC Confidence 87654 356777777754 2 4688888999999999999999998774 999988776654 22110 000 Q ss_pred hhhhhHHHHHHHHHhcccccceEEEecCch----hhhhh-hcCCCC------CCCchHHHHHHHHHHHHHHHH-HHHHHh Q lcl|NC_021331. 76 KTIAEGKRAIYAILRGGGAVRAIYFSNMLI----YANAL-EYGHSK------QAPAGVLGIVAVKLRSYMAEA-IKESRA 143 (147) Q Consensus 76 ~t~~~~~~~i~~~~~~~~~g~~iyi~Nn~p----YA~~L-EyG~s~------QAp~G~V~~a~~~~~~~v~~a-~~e~k~ 143 (147) . --+|-|...-| +...| ||||-. ..+-|.++-++..-...+-+. -.|++- T Consensus 72 ~------------------~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 72 Q------------------ERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred c------------------ceEEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 0 01222333332 12233 777621 333466777776664443332 334433 No 160 >protein:vir:2435 Length: 111 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046837;genbank:gi:9630405;genbank:GeneID:1261628 Probab=21.22 E-value=1.5 Score=19.54 Aligned_cols=101 Identities=19% Similarity=0.147 Sum_probs=39.0 Q ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhcccceeccCCccccccccCCCCcchhhhh Q lcl|NC_021331. 1 MAKNYTIREFHGNIDAWINAVDSGLKDCVELFAEKVHTDLVKRSPVDTGRYRGNWQVTANKPPLYALNQYDKHGDKTIAE 80 (147) Q Consensus 1 MAk~~s~~~F~~~i~~f~~~v~~~~~~~~r~~a~~l~~~vv~~tPVdtG~~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~ 80 (147) |||=+.-.+ .+-.-.-.++..+..-.+++..+.-..+.. .|++=++.-. + -+...+.. T Consensus 1 makvyanaN---~v~ahl~~vk~avr~Ea~ev~~RAr~NLA~--------arastri~k~----g-------~~P~~I~~ 58 (111) T protein:vir:24 1 MAKVYANAN---KVAARHVDVRKRVKEERDGVTRRARTNLAR--------ANKTTRITKE----G-------YFPASIEE 58 (111) T ss_pred Ccccccchh---hHhhhchhHHHHHHHHHhhhhhhHHHhHHH--------hhhcceeccc----c-------cCcccccc Confidence 998553222 111111233333333344433333222221 1222222100 0 00001100 Q ss_pred HHHHHHHHHhcccccce-EEEecCchhhhhhhcCCC---------CCCCchHHHHHHHHHHHHHH Q lcl|NC_021331. 81 GKRAIYAILRGGGAVRA-IYFSNMLIYANALEYGHS---------KQAPAGVLGIVAVKLRSYMA 135 (147) Q Consensus 81 ~~~~i~~~~~~~~~g~~-iyi~Nn~pYA~~LEyG~s---------~QAp~G~V~~a~~~~~~~v~ 135 (147) -.||+ .|+-=..|-+..|||||- +.||.|-|-++-.-+.--+. T Consensus 59 ------------~~gdvD~~~~l~APnamAiEfGH~PSG~F~g~dTKaP~glYILt~AA~~g~~~ 111 (111) T protein:vir:24 59 ------------VDGDVDFHTVLHAPNAFALEFGHAPSGFFAGTDTKPPDPEYILTRAAIGGTVS 111 (111) T ss_pred ------------ccCCcceEEEecCCChhhhhccCCCcceecccccCCCCCceeeeccccccccC Confidence 11343 555567899999999982 24555544333222211111 Done!