Query lcl|NC_016654.1_cdsid_YP_005087236.1 [gene=RoPhREQ3_gp44] [protein=hypothetical protein] [protein_id=YP_005087236.1] [location=24247..24573] Match_columns 108 No_of_seqs 94 out of 97 Neff 6.2 Searched_HMMs 1612 Date Thu Nov 7 13:15:03 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_44 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_44_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:45 Length: 112 # N 99.9 2.8E-27 1.7E-30 166.1 10.0 104 1-107 1-112 (112) 2 protein:vir:80970 Length: 112 99.9 8.4E-27 5.2E-30 163.5 9.8 104 1-107 1-112 (112) 3 protein:vir:98892 Length: 108 99.9 1.3E-25 8E-29 157.0 9.2 103 1-105 2-108 (108) 4 protein:vir:1581 Length: 116 # 99.8 1.1E-24 6.8E-28 152.0 8.5 104 1-104 1-116 (116) 5 protein:vir:4790 Length: 114 # 99.8 5.9E-24 3.7E-27 147.9 9.7 102 1-108 3-113 (114) 6 protein:vir:3036 Length: 118 # 99.8 1.8E-22 1.1E-25 139.9 8.4 100 1-108 2-116 (118) 7 protein:vir:9823 Length: 118 # 99.8 1.8E-22 1.1E-25 139.9 8.4 100 1-108 2-116 (118) 8 protein:vir:79687 Length: 113 99.8 9.2E-22 5.7E-25 135.9 8.9 102 1-108 1-110 (113) 9 protein:vir:105773 Length: 131 99.7 1.2E-20 7.2E-24 129.9 5.4 105 1-105 1-131 (131) 10 protein:vir:78894 Length: 105 99.7 1.5E-20 9.4E-24 129.3 3.8 100 1-105 1-105 (105) 11 protein:vir:95789 Length: 114 99.6 4.3E-18 2.7E-21 115.8 10.2 105 1-108 1-114 (114) 12 protein:vir:94538 Length: 125 99.5 3.7E-17 2.3E-20 110.7 9.6 105 1-108 5-123 (125) 13 protein:vir:3617 Length: 112 # 99.5 2.2E-16 1.4E-19 106.4 9.3 101 1-104 1-112 (112) 14 protein:vir:5978 Length: 144 # 99.4 4E-16 2.5E-19 105.0 9.8 104 1-108 4-144 (144) 15 protein:vir:78077 Length: 141 99.4 3.5E-16 2.1E-19 105.3 8.4 107 1-107 2-141 (141) 16 protein:vir:106570 Length: 182 99.4 9E-16 5.6E-19 103.1 9.9 108 1-108 2-181 (182) 17 protein:vir:107099 Length: 137 99.4 6.1E-16 3.8E-19 104.0 8.5 100 1-100 1-137 (137) 18 protein:vir:96121 Length: 137 99.4 1E-15 6.3E-19 102.8 8.8 100 1-100 1-137 (137) 19 protein:vir:94654 Length: 142 99.4 1.7E-15 1.1E-18 101.5 9.6 103 1-104 4-142 (142) 20 protein:vir:98409 Length: 108 99.4 2.7E-15 1.7E-18 100.5 9.1 99 3-104 1-108 (108) 21 protein:vir:105330 Length: 137 99.4 1.6E-15 9.9E-19 101.7 7.7 100 1-100 1-137 (137) 22 protein:vir:9930 Length: 108 # 99.4 4.6E-15 2.8E-18 99.2 9.8 101 1-105 7-108 (108) 23 protein:vir:743 Length: 108 # 99.3 4.7E-15 2.9E-18 99.1 9.3 99 3-104 1-108 (108) 24 protein:vir:94108 Length: 149 99.3 2.5E-15 1.6E-18 100.6 7.7 100 1-100 14-149 (149) 25 protein:vir:105916 Length: 149 99.3 3E-15 1.9E-18 100.2 7.5 100 1-100 14-149 (149) 26 protein:vir:95894 Length: 137 99.3 5.6E-15 3.5E-18 98.7 8.9 100 1-100 1-137 (137) 27 protein:vir:94796 Length: 137 99.3 4.6E-15 2.9E-18 99.2 7.6 100 1-100 1-137 (137) 28 protein:vir:94490 Length: 137 99.3 7.6E-15 4.7E-18 98.0 8.7 100 1-100 1-137 (137) 29 protein:vir:93738 Length: 137 99.3 7.6E-15 4.7E-18 98.0 8.7 100 1-100 1-137 (137) 30 protein:vir:97427 Length: 137 99.3 7.6E-15 4.7E-18 98.0 8.7 100 1-100 1-137 (137) 31 protein:vir:1243 Length: 116 # 99.3 4.2E-15 2.6E-18 99.4 7.2 88 13-100 1-116 (116) 32 protein:vir:97327 Length: 116 99.3 4.2E-15 2.6E-18 99.4 7.2 88 13-100 1-116 (116) 33 protein:vir:96829 Length: 135 99.3 9.3E-15 5.8E-18 97.5 8.9 100 1-100 1-135 (135) 34 protein:vir:95062 Length: 116 99.3 7.5E-15 4.7E-18 98.0 7.1 88 13-100 1-116 (116) 35 protein:vir:8669 Length: 142 # 99.2 2.7E-14 1.7E-17 95.0 6.9 101 1-101 2-142 (142) 36 protein:vir:99101 Length: 142 99.2 2.7E-14 1.7E-17 95.0 6.9 101 1-101 2-142 (142) 37 protein:vir:99744 Length: 115 99.2 7.9E-14 4.9E-17 92.4 9.1 99 3-104 1-115 (115) 38 protein:vir:96225 Length: 115 99.2 1.1E-13 6.7E-17 91.7 9.6 99 3-104 1-115 (115) 39 protein:vir:97144 Length: 115 99.2 1.1E-13 6.7E-17 91.7 9.6 99 3-104 1-115 (115) 40 protein:vir:78858 Length: 115 99.2 1.1E-13 6.7E-17 91.7 9.6 99 3-104 1-115 (115) 41 protein:vir:103917 Length: 115 99.2 1.1E-13 6.7E-17 91.7 9.6 99 3-104 1-115 (115) 42 protein:vir:9312 Length: 115 # 99.2 1.1E-13 6.7E-17 91.7 9.6 99 3-104 1-115 (115) 43 protein:vir:96358 Length: 115 99.2 1.1E-13 6.7E-17 91.7 9.6 99 3-104 1-115 (115) 44 protein:vir:2740 Length: 114 # 99.2 1E-13 6.2E-17 91.9 8.7 102 1-105 1-114 (114) 45 protein:vir:4906 Length: 114 # 99.2 1E-13 6.2E-17 91.9 8.7 102 1-105 1-114 (114) 46 protein:vir:96486 Length: 112 99.2 1.1E-13 6.9E-17 91.6 8.6 100 1-103 1-112 (112) 47 protein:vir:106623 Length: 115 99.2 2.6E-13 1.6E-16 89.5 9.5 99 3-104 1-115 (115) 48 protein:vir:101594 Length: 173 99.1 4.7E-13 2.9E-16 88.2 9.7 106 3-108 1-171 (173) 49 protein:vir:79034 Length: 141 99.1 4.2E-13 2.6E-16 88.4 8.7 108 1-108 1-136 (141) 50 protein:vir:106041 Length: 137 99.1 1.5E-13 9.4E-17 90.9 5.8 101 1-106 3-137 (137) 51 protein:vir:105467 Length: 144 99.0 1E-12 6.3E-16 86.3 8.2 107 1-108 1-141 (144) 52 protein:vir:100075 Length: 140 99.0 4E-12 2.5E-15 83.1 9.0 105 1-108 1-133 (140) 53 protein:vir:102441 Length: 137 98.9 3.1E-12 1.9E-15 83.7 6.4 99 1-99 3-137 (137) 54 protein:vir:80362 Length: 140 98.9 9.7E-12 6E-15 81.0 8.2 105 1-108 1-133 (140) 55 protein:vir:102875 Length: 146 98.9 1.5E-11 9.2E-15 79.9 9.1 105 1-108 5-143 (146) 56 protein:vir:105007 Length: 146 98.9 1.5E-11 9.2E-15 79.9 9.1 105 1-108 5-143 (146) 57 protein:vir:102085 Length: 146 98.9 1.5E-11 9.2E-15 79.9 9.1 105 1-108 5-143 (146) 58 protein:vir:107568 Length: 146 98.9 1.5E-11 9.2E-15 79.9 9.1 105 1-108 5-143 (146) 59 protein:vir:100243 Length: 140 98.9 1.1E-11 6.6E-15 80.8 8.0 105 1-108 1-133 (140) 60 protein:vir:97982 Length: 140 98.9 7.2E-12 4.5E-15 81.7 6.9 100 1-101 8-140 (140) 61 protein:vir:107545 Length: 140 98.9 7.2E-12 4.5E-15 81.7 6.9 100 1-101 8-140 (140) 62 protein:vir:106506 Length: 137 98.8 1.1E-11 6.9E-15 80.6 6.2 102 1-108 4-137 (137) 63 protein:vir:194 Length: 149 # 98.8 4E-11 2.5E-14 77.6 8.5 105 1-108 2-144 (149) 64 protein:vir:1437 Length: 140 # 98.8 3.5E-11 2.2E-14 77.9 8.0 105 1-108 1-133 (140) 65 protein:vir:104347 Length: 145 98.8 9.4E-11 5.9E-14 75.5 9.9 103 1-107 5-145 (145) 66 protein:vir:3873 Length: 128 # 98.8 8.2E-11 5.1E-14 75.9 9.2 105 1-108 1-128 (128) 67 protein:vir:93617 Length: 148 98.7 7.9E-11 4.9E-14 76.0 8.2 105 1-108 4-143 (148) 68 protein:vir:97088 Length: 157 98.7 1.3E-10 8.1E-14 74.8 8.8 108 1-108 1-150 (157) 69 protein:vir:103280 Length: 142 98.7 4.6E-10 2.8E-13 71.8 10.1 103 1-107 1-142 (142) 70 protein:vir:9708 Length: 125 # 98.6 3.6E-10 2.2E-13 72.4 9.1 105 1-108 1-124 (125) 71 protein:vir:1273 Length: 127 # 98.6 5.8E-10 3.6E-13 71.2 9.5 105 1-108 2-127 (127) 72 protein:vir:79638 Length: 146 98.6 5.9E-10 3.7E-13 71.2 9.4 104 1-108 1-145 (146) 73 protein:vir:99528 Length: 92 # 98.6 1.8E-10 1.1E-13 74.0 6.5 77 1-77 4-92 (92) 74 protein:vir:105089 Length: 133 98.6 4.9E-10 3E-13 71.6 8.8 105 1-108 2-131 (133) 75 protein:vir:9414 Length: 125 # 98.6 9.2E-10 5.7E-13 70.1 9.5 105 1-108 1-125 (125) 76 protein:vir:4704 Length: 125 # 98.6 9.2E-10 5.7E-13 70.1 9.5 105 1-108 1-125 (125) 77 protein:vir:81106 Length: 125 98.6 9.2E-10 5.7E-13 70.1 9.5 105 1-108 1-125 (125) 78 protein:vir:98342 Length: 125 98.6 9.2E-10 5.7E-13 70.1 9.5 105 1-108 1-125 (125) 79 protein:vir:79988 Length: 125 98.6 9.2E-10 5.7E-13 70.1 9.5 105 1-108 1-125 (125) 80 protein:vir:94994 Length: 131 98.6 1.1E-09 6.9E-13 69.7 9.9 100 1-104 1-131 (131) 81 protein:vir:78380 Length: 131 98.6 1.3E-09 7.8E-13 69.4 10.1 100 1-104 1-131 (131) 82 protein:vir:5745 Length: 135 # 98.5 1.1E-09 6.6E-13 69.8 8.9 105 1-108 1-131 (135) 83 protein:vir:102963 Length: 163 98.5 2.8E-09 1.8E-12 67.4 10.3 107 1-108 1-155 (163) 84 protein:vir:1891 Length: 179 # 98.4 2.4E-09 1.5E-12 67.9 7.7 105 1-108 5-166 (179) 85 protein:vir:107703 Length: 147 98.4 7.3E-09 4.5E-12 65.2 10.0 103 1-107 1-147 (147) 86 protein:vir:97190 Length: 148 98.3 7.8E-09 4.9E-12 65.0 9.2 103 1-108 1-147 (148) 87 protein:vir:102338 Length: 116 98.3 4.3E-09 2.7E-12 66.5 7.5 96 13-108 1-116 (116) 88 protein:vir:1386 Length: 149 # 98.3 9.9E-09 6.1E-12 64.5 8.9 105 1-108 1-148 (149) 89 protein:vir:96774 Length: 152 98.3 1.4E-08 8.4E-12 63.7 9.4 99 1-106 11-152 (152) 90 protein:vir:4347 Length: 164 # 98.2 5.5E-09 3.4E-12 65.9 6.5 105 1-108 5-151 (164) 91 protein:vir:80425 Length: 134 98.2 1.4E-08 8.6E-12 63.7 8.5 100 1-108 1-134 (134) 92 protein:vir:94944 Length: 121 98.2 1.5E-08 9.5E-12 63.4 7.7 89 1-92 2-121 (121) 93 protein:vir:102154 Length: 119 98.1 3.4E-08 2.1E-11 61.5 8.1 106 1-108 2-119 (119) 94 protein:vir:95157 Length: 144 98.0 5.6E-08 3.5E-11 60.4 7.9 99 1-105 1-144 (144) 95 protein:vir:9879 Length: 127 # 97.9 4.4E-08 2.8E-11 60.9 5.6 100 5-105 1-127 (127) 96 protein:vir:81147 Length: 126 97.9 1.7E-07 1E-10 57.7 8.2 107 1-107 1-126 (126) 97 protein:vir:100652 Length: 134 97.6 1E-06 6.2E-10 53.5 8.2 105 1-106 1-134 (134) 98 protein:vir:101302 Length: 134 97.5 1.5E-06 9E-10 52.6 8.1 105 1-106 1-134 (134) 99 protein:vir:9513 Length: 134 # 97.5 1.5E-06 9E-10 52.6 8.1 105 1-106 1-134 (134) 100 protein:vir:3163 Length: 145 # 97.4 4.5E-07 2.8E-10 55.4 5.0 106 2-108 1-140 (145) 101 protein:vir:6246 Length: 143 # 97.4 4.5E-07 2.8E-10 55.4 4.5 107 1-108 1-142 (143) 102 protein:vir:1332 Length: 143 # 97.4 8.1E-07 5E-10 54.0 5.8 107 1-108 1-142 (143) 103 protein:vir:79091 Length: 175 97.3 1.3E-06 8.4E-10 52.8 6.3 107 1-108 1-173 (175) 104 protein:vir:100887 Length: 139 97.2 5.1E-06 3.2E-09 49.6 8.4 102 3-108 1-131 (139) 105 protein:vir:99196 Length: 155 97.2 5.5E-06 3.4E-09 49.4 8.5 106 1-107 1-155 (155) 106 protein:vir:99833 Length: 190 97.2 6.1E-06 3.8E-09 49.2 8.6 107 1-108 4-187 (190) 107 protein:vir:78335 Length: 133 97.1 8.3E-06 5.2E-09 48.4 8.6 106 1-107 1-133 (133) 108 protein:vir:1988 Length: 156 # 97.1 7.2E-06 4.5E-09 48.8 8.3 106 1-108 1-155 (156) 109 protein:vir:96288 Length: 100 97.1 2.2E-06 1.4E-09 51.6 5.3 76 1-99 23-100 (100) 110 protein:vir:103841 Length: 155 97.1 6.1E-06 3.8E-09 49.2 7.5 107 1-108 1-152 (155) 111 protein:vir:100223 Length: 139 97.0 1.3E-05 8.2E-09 47.3 8.8 102 3-108 1-131 (139) 112 protein:vir:966 Length: 123 # 96.8 3E-05 1.8E-08 45.4 9.4 105 1-105 1-123 (123) 113 protein:vir:107851 Length: 175 96.8 1.9E-05 1.2E-08 46.4 8.0 107 1-108 1-173 (175) 114 protein:vir:98636 Length: 138 96.7 2.1E-05 1.3E-08 46.2 8.1 105 1-108 7-137 (138) 115 protein:vir:9647 Length: 132 # 96.6 2.6E-05 1.6E-08 45.7 7.6 107 1-108 1-131 (132) 116 protein:vir:93898 Length: 133 96.5 3.3E-05 2E-08 45.2 7.7 104 1-105 1-133 (133) 117 protein:vir:79225 Length: 155 96.4 2.6E-05 1.6E-08 45.7 6.5 106 1-107 1-155 (155) 118 protein:vir:96973 Length: 133 96.3 4.9E-05 3E-08 44.2 7.7 104 1-105 1-133 (133) 119 protein:vir:9363 Length: 133 # 96.3 4.9E-05 3E-08 44.2 7.7 104 1-105 1-133 (133) 120 protein:vir:78644 Length: 133 96.3 4.9E-05 3E-08 44.2 7.7 104 1-105 1-133 (133) 121 protein:vir:94419 Length: 133 96.3 4.9E-05 3E-08 44.2 7.7 104 1-105 1-133 (133) 122 protein:vir:4956 Length: 153 # 95.8 0.00017 1E-07 41.3 8.3 104 1-108 1-139 (153) 123 protein:vir:81067 Length: 119 95.4 3.9E-05 2.4E-08 44.7 3.5 80 28-108 1-112 (119) 124 protein:vir:10367 Length: 119 95.3 4.4E-05 2.7E-08 44.5 3.5 80 28-108 1-112 (119) 125 protein:vir:96012 Length: 133 94.9 0.00038 2.3E-07 39.3 7.5 103 1-107 1-133 (133) 126 protein:vir:5000 Length: 141 # 94.8 0.00047 2.9E-07 38.8 7.8 104 2-108 1-135 (141) 127 protein:vir:3848 Length: 159 # 94.4 0.0016 9.8E-07 35.9 9.5 108 1-108 1-154 (159) 128 protein:vir:95372 Length: 124 94.0 0.00074 4.6E-07 37.7 7.0 104 1-105 1-124 (124) 129 protein:vir:4859 Length: 140 # 94.0 0.00087 5.4E-07 37.4 7.3 104 1-108 1-135 (140) 130 protein:vir:4200 Length: 133 # 93.1 0.0009 5.6E-07 37.3 5.9 104 2-105 1-133 (133) 131 protein:vir:2688 Length: 123 # 91.5 0.0031 1.9E-06 34.3 6.9 101 1-105 5-123 (123) 132 protein:vir:4833 Length: 140 # 91.3 0.0016 9.9E-07 35.9 5.1 104 1-108 4-139 (140) 133 protein:vir:6216 Length: 125 # 91.3 0.0049 3E-06 33.3 7.7 107 1-107 1-125 (125) 134 protein:vir:4162 Length: 133 # 89.8 0.0035 2.2E-06 34.0 5.6 105 2-108 1-132 (133) 135 protein:vir:80116 Length: 127 87.2 0.014 8.9E-06 30.7 7.1 106 1-108 1-127 (127) 136 protein:vir:5257 Length: 148 # 86.8 0.016 1E-05 30.4 7.2 88 1-108 1-91 (148) 137 protein:vir:96105 Length: 193 83.4 0.019 1.2E-05 30.1 5.9 93 1-108 1-132 (193) 138 protein:vir:106728 Length: 155 82.1 0.014 8.9E-06 30.7 4.8 90 1-108 1-98 (155) 139 protein:vir:78607 Length: 155 81.9 0.015 9.4E-06 30.6 4.8 90 1-108 1-98 (155) 140 protein:vir:98557 Length: 149 80.1 0.044 2.7E-05 28.0 6.7 102 1-105 1-149 (149) 141 protein:vir:2026 Length: 150 # 80.1 0.054 3.3E-05 27.5 7.2 99 5-105 1-150 (150) 142 protein:vir:101563 Length: 155 76.3 0.039 2.4E-05 28.3 5.2 90 1-108 1-98 (155) 143 protein:vir:6071 Length: 150 # 74.8 0.13 8.2E-05 25.4 7.7 99 5-105 1-150 (150) 144 protein:vir:7993 Length: 108 # 73.4 0.016 1E-05 30.4 2.4 87 1-90 4-108 (108) 145 protein:vir:1028 Length: 168 # 71.6 0.19 0.00012 24.5 8.0 107 1-108 18-160 (168) 146 protein:vir:7412 Length: 168 # 71.2 0.2 0.00012 24.4 9.1 107 1-108 18-160 (168) 147 protein:vir:79179 Length: 155 69.7 0.16 0.0001 24.9 7.0 103 1-105 1-155 (155) 148 protein:vir:94069 Length: 168 69.6 0.058 3.6E-05 27.4 4.5 100 1-108 1-101 (168) 149 protein:vir:99546 Length: 200 66.1 0.088 5.4E-05 26.4 4.8 94 1-108 7-139 (200) 150 protein:vir:5703 Length: 150 # 63.2 0.16 0.0001 24.9 5.7 99 5-105 1-150 (150) 151 protein:vir:79115 Length: 148 60.7 0.22 0.00014 24.2 5.9 105 1-108 1-147 (148) 152 protein:vir:77650 Length: 155 60.6 0.14 8.7E-05 25.3 4.8 90 1-108 1-98 (155) 153 protein:vir:8106 Length: 150 # 54.4 0.058 3.6E-05 27.4 1.6 108 1-108 1-135 (150) 154 protein:vir:100312 Length: 152 54.2 0.31 0.00019 23.4 5.6 105 1-106 1-152 (152) 155 protein:vir:107757 Length: 189 49.8 0.43 0.00027 22.6 5.6 87 1-108 1-89 (189) 156 protein:vir:487 Length: 187 # 44.9 0.57 0.00035 21.9 5.5 103 1-108 20-180 (187) 157 protein:vir:4460 Length: 170 # 41.9 0.8 0.0005 21.1 5.8 103 1-108 7-163 (170) 158 protein:vir:1838 Length: 149 # 41.6 0.68 0.00042 21.5 5.4 102 1-105 1-149 (149) 159 protein:vir:7449 Length: 123 # 36.6 1.2 0.00072 20.2 8.0 102 1-107 4-123 (123) 160 protein:vir:1087 Length: 161 # 30.7 1.6 0.00096 19.5 9.0 107 1-108 2-156 (161) 161 protein:vir:102608 Length: 108 29.8 0.42 0.00026 22.7 2.2 86 1-90 14-108 (108) 162 protein:vir:105825 Length: 108 29.8 0.42 0.00026 22.7 2.2 86 1-90 14-108 (108) 163 protein:vir:78163 Length: 92 # 26.2 1.2 0.00074 20.2 4.0 84 1-93 1-92 (92) 164 protein:vir:4096 Length: 140 # 23.5 1.6 0.001 19.4 4.2 107 1-108 1-138 (140) 165 protein:vir:6154 Length: 119 # 20.2 0.42 0.00026 22.7 0.3 101 1-106 1-119 (119) No 1 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=99.90 E-value=2.8e-27 Score=166.14 Aligned_cols=104 Identities=16% Similarity=0.220 Sum_probs=92.0 Q ss_pred CCccccHH---HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhcccc--CC- Q lcl|NC_016654. 1 MPVEFNYG---IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEV--GW- 74 (108) Q Consensus 1 m~vk~n~~---~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~--~~- 74 (108) |+|+++-+ +...+.+|.++|+..++.+|++++++|||+|||+|++|+.+ .++|.|.|+||||++|||+. +| T Consensus 1 M~vkv~vn~~~~~~~l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~---~~~g~I~y~tPYAr~qYY~~~~~~~ 77 (112) T protein:vir:45 1 MPIKVRVDLSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI---MNDKEIMWTSIYARRLYKGINFNFT 77 (112) T ss_pred CceeEEeehHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCccccceee---ccCCeEEecChhhHHhhhccccCCC Confidence 99887754 55568888999999999999999999999999999999753 34578999999999999954 43 Q ss_pred --CCCCCccchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 75 --HHVDGQAKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 75 --~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) ++|.+.++|+|+++.+++++|.+.+++.++++| T Consensus 78 ~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 78 LTHHPLAGPEWDQRAKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred CCCCCCCchhhHHHHHHhhHHHHHHHHHHHHhhcC Confidence 578888999999999999999999999999999 No 2 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=99.89 E-value=8.4e-27 Score=163.53 Aligned_cols=104 Identities=18% Similarity=0.223 Sum_probs=91.2 Q ss_pred CCccccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccc--cCC- Q lcl|NC_016654. 1 MPVEFNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEE--VGW- 74 (108) Q Consensus 1 m~vk~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~--~~~- 74 (108) |+|+++- .+...+.++..+|+..++.+|++++++|||+|||+|++|+.+ .++|.|.|+||||++|||+ ++| T Consensus 1 M~vkV~id~~~~~~~l~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~---~~~g~I~y~tPYAr~qYY~~~~~~~ 77 (112) T protein:vir:80 1 MPIKVRVDLSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI---MNDKEIMWTSIYARRLYNGINFNFT 77 (112) T ss_pred CceeEEeehHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCccccceee---ccCceEEecCchhhHhhhcccCCCC Confidence 8766663 356678888999999999999999999999999999999753 3457899999999999994 333 Q ss_pred --CCCCCccchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 75 --HHVDGQAKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 75 --~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) ++|.+.++|+|+++.+++++|.+.+++.++++| T Consensus 78 ~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 78 LTHHPLAGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred cCCCCCcchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 577888999999999999999999999999999 No 3 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=99.87 E-value=1.3e-25 Score=157.04 Aligned_cols=103 Identities=18% Similarity=0.224 Sum_probs=91.9 Q ss_pred CCccccHHHHH-H-HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccc--cCCCC Q lcl|NC_016654. 1 MPVEFNYGIAA-T-VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEE--VGWHH 76 (108) Q Consensus 1 m~vk~n~~~~~-~-v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~--~~~~~ 76 (108) |-|++|.+... . ..++..+|+...+.+|++++++|||+|||+|++|+++.+++ |.|.|+||||++|||+ .+|.+ T Consensus 2 mkvkv~~~~~~~~~~~~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s~~--g~I~y~tPYAr~qYYg~~~n~~~ 79 (108) T protein:vir:98 2 PKIRVELSGAKDKLSPQTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISSDA--EEIYYNTPYAKRRFYEPAYNYTT 79 (108) T ss_pred ceeEeeehHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcCcCCccccceeeccCC--ceEEecChhhHHhhhccccCCCC Confidence 99999976443 3 34577899999999999999999999999999999998864 7899999999999996 46678 Q ss_pred CCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 77 VDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 77 ~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) |.+.++|+|+++.+|+++|.+.+.++++= T Consensus 80 p~ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 80 PGTGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred CCCcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 88889999999999999999999999988 No 4 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=99.84 E-value=1.1e-24 Score=151.95 Aligned_cols=104 Identities=11% Similarity=0.109 Sum_probs=89.3 Q ss_pred CCccc--cHH-HHHHH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhcccc---- Q lcl|NC_016654. 1 MPVEF--NYG-IAATV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEV---- 72 (108) Q Consensus 1 m~vk~--n~~-~~~~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~---- 72 (108) |+|++ |-+ +...+ .++..+|+...+.++++++++|||+|||+|+.|+.+.+..+.|.|.|+||||++|||+. T Consensus 1 M~ikVkv~l~~~~~~~~~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~~~~I~y~tPYAr~qyYg~~~~~ 80 (116) T protein:vir:15 1 MAFRINVDLDGFMDQTSLDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSDGSEITYSTPYAKAQFYGIINDK 80 (116) T ss_pred CCceEEeehhHhhhhhhHHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecCCceEEecCchhHHHhcccccCC Confidence 65554 433 44545 47889999999999999999999999999999988888888899999999999999943 Q ss_pred -CC---CCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 73 -GW---HHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 73 -~~---~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) +| .+|.++.+|+|+++.+|+++|.+++.++++ T Consensus 81 ~~~~~~t~p~ag~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 81 YPVHNYTTPGTTKRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred CCcccccCCCCCcchhHHHHhhhHHHHHHHHHHhcC Confidence 33 467888999999999999999999999999 No 5 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=99.83 E-value=5.9e-24 Score=147.92 Aligned_cols=102 Identities=15% Similarity=0.200 Sum_probs=84.0 Q ss_pred CCccccHH-HHHHH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhcccc------ Q lcl|NC_016654. 1 MPVEFNYG-IAATV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEV------ 72 (108) Q Consensus 1 m~vk~n~~-~~~~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~------ 72 (108) |.|++|.+ +...+ .++..+|+...+.++++++++|||+|||+|++|+.+.++. |.|.|+||||++|||+. T Consensus 3 ~kVkv~l~~~~~~l~~~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~~~--~~I~y~tPYAr~qyYg~~~~~~~ 80 (114) T protein:vir:47 3 IAIKVDLQKAKQKLSNESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVGQG--DAVVYGTVYARAQFYGSNGIVTF 80 (114) T ss_pred eeEEeehhHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcCccCccccceeeeeCC--cEEEecCchhhHhhhcccCCCCC Confidence 44555543 44555 4678999999999999999999999999999999886644 67999999999999943 Q ss_pred -CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 -GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 -~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ++++|.+..+|+|+++.+++++|.+.+. +.+| T Consensus 81 ~~~~~p~~g~~W~eraka~~~~~~~~~~~----k~~g 113 (114) T protein:vir:47 81 RRYTTPGTGKRWDQVATSKHAEEWARAFV----KGMG 113 (114) T ss_pred CccCCCCCcchhHHHHHhhhhHHHHHHHH----HhhC Confidence 2357888899999999999999998555 6777 No 6 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=99.78 E-value=1.8e-22 Score=139.86 Aligned_cols=100 Identities=20% Similarity=0.171 Sum_probs=83.8 Q ss_pred CCccccHH-HHHHHH-HHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhcccc------ Q lcl|NC_016654. 1 MPVEFNYG-IAATVR-GAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEV------ 72 (108) Q Consensus 1 m~vk~n~~-~~~~v~-~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~------ 72 (108) |-|++|-+ +.+.+. ++..+|+...+.++++++++|||+|||+|++|+.+.++ .|.|+||||++|||+. T Consensus 2 ~kV~vdl~~~~~~ls~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~----~I~Y~tPYAr~qYY~~~~~~~~ 77 (118) T protein:vir:30 2 AKVVVELGGIKRKVSPQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV----GVTWSGPHARAQFYGGAYNKYK 77 (118) T ss_pred ceeeechhHHhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC----eeEECCchhhHhhhccccCCCC Confidence 99999975 556664 78899999999999999999999999999999987765 3999999999999953 Q ss_pred --C---CCCCCCccchhhHHHHHhH--HHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 --G---WHHVDGQAKYLENAVNATQ--ATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 --~---~~~~~~~~k~le~a~~~~~--~~i~~~i~~~ir~~Lg 108 (108) + ..||.+..+|+++++.+++ ++|.+ ...+.+| T Consensus 78 g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~----~~~k~~g 116 (118) T protein:vir:30 78 SFKFKKYTTPGTGKRWDKRALANATIVKDWEK----SLLRGMG 116 (118) T ss_pred ccccccccCCCCCCcccchhhcchhhhHHHHH----HHHHhcC Confidence 2 3467788899999998765 78877 4566777 No 7 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=99.78 E-value=1.8e-22 Score=139.86 Aligned_cols=100 Identities=20% Similarity=0.171 Sum_probs=83.8 Q ss_pred CCccccHH-HHHHHH-HHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhcccc------ Q lcl|NC_016654. 1 MPVEFNYG-IAATVR-GAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEV------ 72 (108) Q Consensus 1 m~vk~n~~-~~~~v~-~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~------ 72 (108) |-|++|-+ +.+.+. ++..+|+...+.++++++++|||+|||+|++|+.+.++ .|.|+||||++|||+. T Consensus 2 ~kV~vdl~~~~~~ls~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~----~I~Y~tPYAr~qYY~~~~~~~~ 77 (118) T protein:vir:98 2 AKVVVELGGIKRKVSPQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV----GVTWSGPHARAQFYGGAYNKYK 77 (118) T ss_pred ceeeechhHHhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC----eeEECCchhhHhhhccccCCCC Confidence 99999975 556664 78899999999999999999999999999999987765 3999999999999953 Q ss_pred --C---CCCCCCccchhhHHHHHhH--HHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 --G---WHHVDGQAKYLENAVNATQ--ATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 --~---~~~~~~~~k~le~a~~~~~--~~i~~~i~~~ir~~Lg 108 (108) + ..||.+..+|+++++.+++ ++|.+ ...+.+| T Consensus 78 g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~----~~~k~~g 116 (118) T protein:vir:98 78 SFKFKKYTTPGTGKRWDKRALANATIVKDWEK----SLLRGMG 116 (118) T ss_pred ccccccccCCCCCCcccchhhcchhhhHHHHH----HHHHhcC Confidence 2 3467788899999998765 78877 4566777 No 8 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=99.76 E-value=9.2e-22 Score=135.92 Aligned_cols=102 Identities=15% Similarity=0.153 Sum_probs=80.8 Q ss_pred CCccccHHHHHHHH-HHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccccC------ Q lcl|NC_016654. 1 MPVEFNYGIAATVR-GAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVG------ 73 (108) Q Consensus 1 m~vk~n~~~~~~v~-~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~------ 73 (108) || .|+ .+...+. ++..+|+...+.+|++++++|||+|||+|++|+.+. +|.|.|+||||++|||+.. T Consensus 1 ~~-dL~-~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~----s~~I~y~tPYAr~qyYg~~~~~~~~ 74 (113) T protein:vir:79 1 MS-DLS-VFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVN----DTGIHYTAKYARAQFYGFVNGHRVR 74 (113) T ss_pred Cc-hHH-HHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhcccccc----CCeeEecChhhhHhhccccCCCCcc Confidence 43 222 3444444 477889999999999999999999999999998754 4569999999999999532 Q ss_pred -CCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 74 -WHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 74 -~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +.+|.+..+|+|+++.+|+++|.+++.+++.++-= T Consensus 75 ~~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~ 110 (113) T protein:vir:79 75 NYSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAK 110 (113) T ss_pred ccCCCCCCchhhHHHHHHhHHHHHHHHHHHhhcccc Confidence 33577889999999999999999988876654333 No 9 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=99.69 E-value=1.2e-20 Score=129.89 Aligned_cols=105 Identities=17% Similarity=0.205 Sum_probs=87.1 Q ss_pred CCcc--------ccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc----CcEEEEEecCchhhhh Q lcl|NC_016654. 1 MPVE--------FNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD----GMEAVVYFDTPYAARQ 68 (108) Q Consensus 1 m~vk--------~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~----~~~g~V~y~~pYA~~~ 68 (108) |.|+ +|+-+.....+.+.|||..+.......|+.|+|+||+||.||+|.++. ..+|+|||+++||.|+ T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei~~ngtritGRVGYSAnYA~yV 80 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYKKLEPIPSGMIGRVGYTANYAAAV 80 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccceeeeccCceeEEeeccceeeeeee Confidence 6664 222222222357999999999999999999999999999999997653 3589999999999999 Q ss_pred cc-------------ccCCCCCCCccchhhHHHHHh-HHHHHHHHHHHHHH Q lcl|NC_016654. 69 HE-------------EVGWHHVDGQAKYLENAVNAT-QATVAEVIGEAIRR 105 (108) Q Consensus 69 h~-------------~~~~~~~~~~~k~le~a~~~~-~~~i~~~i~~~ir~ 105 (108) |. ..+|..|.|+++||..+++++ .+.+..+|.+++|- T Consensus 81 Hda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 81 NAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred ecCccccCCCcCCCCCcceecCCCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 98 346888999999999999865 78899999999998 No 10 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=99.67 E-value=1.5e-20 Score=129.25 Aligned_cols=100 Identities=22% Similarity=0.215 Sum_probs=77.8 Q ss_pred CCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEec----CchhhhhccccCCC Q lcl|NC_016654. 1 MPV-EFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFD----TPYAARQHEEVGWH 75 (108) Q Consensus 1 m~v-k~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~----~pYA~~~h~~~~~~ 75 (108) ||. +|.-.+.+.+.+..-..--.+.-+|++.+++|||++||+|++|+...+..++|+|.|+ ||||++|||+.. + T Consensus 1 ~~f~~f~~~~~k~l~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tvIgsg~I~y~~~~~aPYAr~qYYe~~-R 79 (105) T protein:vir:78 1 MSFSSFKDAVIDDIHNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKIIIQKNSIVARVFSLTPYARRQYYENR-R 79 (105) T ss_pred CCcccccchHHHHHHHhcCCCCchhhHHHHHHhCCCCcccccccccccccceeecCCeeEeeccccCchhhhhhhccc-C Confidence 774 3444455555443222111344589999999999999999999999999999999998 999999998543 3 Q ss_pred CCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 76 HVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 76 ~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) + +.|||+++.+|+++|.++++..+|= T Consensus 80 g----~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 80 N----PRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred C----CchhHHhhhcchhHHHHHHhcccCC Confidence 3 3599999999999999999966655 No 11 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.60 E-value=4.3e-18 Score=115.80 Aligned_cols=105 Identities=16% Similarity=0.185 Sum_probs=92.6 Q ss_pred CCccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccc Q lcl|NC_016654. 1 MPVEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEE 71 (108) Q Consensus 1 m~vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~ 71 (108) |||+|.. . +.+.+.+.+.++|..++..+.+++...+|+|||+|++|..+..++.+|.|+.+++||.|++|| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~yvE~G 80 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGYDGYQEYG 80 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCccceeecC Confidence 9999974 2 233345567889999999999999999999999999999998888899999999999999999 Q ss_pred cCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 72 VGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 72 ~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) |.+..+ | +||.+++..++.++.+.+.+.|+++|= T Consensus 81 T~~~~a--q-Pfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 81 TRFQPG--T-PHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred ccccCC--C-ccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 976543 3 799999999999999999999999999 No 12 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.53 E-value=3.7e-17 Score=110.65 Aligned_cols=105 Identities=9% Similarity=0.102 Sum_probs=88.7 Q ss_pred CCccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec-----ccCcEEEEEecCchhh Q lcl|NC_016654. 1 MPVEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA-----SDGMEAVVYFDTPYAA 66 (108) Q Consensus 1 m~vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~-----~~~~~g~V~y~~pYA~ 66 (108) |||+|.. . +...+.+++.+++..++..+..++...+|+|||+|++|..+. .++.++.|+++++||. T Consensus 5 ~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~~Ya~ 84 (125) T protein:vir:94 5 FNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARADYSS 84 (125) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCCCccc Confidence 8899873 2 333455678889999999999999999999999999998654 2345799999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +++|||.+..|. +||.+++..+++.+.+.+.++|++.+= T Consensus 85 ~vEfGT~~~~a~---Pfl~pa~~~~~~~~~~~l~~~l~~a~k 123 (125) T protein:vir:94 85 YNEYGTYRMSAQ---PFMAPSVAAMTPFFYKAVRDALNKAAK 123 (125) T ss_pred eeecccccCCCC---cccchhHHHHHHHHHHHHHHHHHHHhc Confidence 999999865444 799999999999999999999988888 No 13 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.46 E-value=2.2e-16 Score=106.43 Aligned_cols=101 Identities=12% Similarity=0.209 Sum_probs=84.1 Q ss_pred CCccccH----HHHHHHH-----HHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEFNY----GIAATVR-----GAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~n~----~~~~~v~-----~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) ||..|+. .+.+.+. +++++++..++..|..++...+|+|||+|++|..+.. ++.++.|+++++||.+++ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE 80 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAYVE 80 (112) T ss_pred CceeeeehhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccceee Confidence 6666653 2333333 5678889999999999999999999999999998765 345899999999999999 Q ss_pred cccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 70 ~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |||.+. ++| +||.+++..++.++.+.|.+.|| T Consensus 81 ~GT~k~--~a~-Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 81 YGTRFQ--SAQ-PFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred cccccc--CCC-cchhhhHHHHHHHHHHHHHHHcC Confidence 999754 343 69999999999999999999999 No 14 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.45 E-value=4e-16 Score=104.98 Aligned_cols=104 Identities=21% Similarity=0.247 Sum_probs=81.8 Q ss_pred CCccccHH-----------HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhh Q lcl|NC_016654. 1 MPVEFNYG-----------IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAAR 67 (108) Q Consensus 1 m~vk~n~~-----------~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~ 67 (108) ||++++.. +.+.+.+++.++|..+++.+..++...+|+|||+|++|..+.+ ++.+|.|+.+++||.| T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ 83 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEYAIY 83 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCccch Confidence 99998742 3345667788999999999999999999999999999987654 4568999999999999 Q ss_pred hccccCCCCC----------------------CCc--cchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 68 QHEEVGWHHV----------------------DGQ--AKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 68 ~h~~~~~~~~----------------------~~~--~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +|+||..+.. .++ .+||.+++..+++.+.+. |++-.| T Consensus 84 vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~----i~~~~g 144 (144) T protein:vir:59 84 VEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFERE----MRRLRG 144 (144) T ss_pred hhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHH----HHHhcC Confidence 9999843211 111 259999999998888774 555555 No 15 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.43 E-value=3.5e-16 Score=105.35 Aligned_cols=107 Identities=19% Similarity=0.180 Sum_probs=78.4 Q ss_pred CCccccHHH---HHHHHHHHHH-----HHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhcc Q lcl|NC_016654. 1 MPVEFNYGI---AATVRGAAKS-----GLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHE 70 (108) Q Consensus 1 m~vk~n~~~---~~~v~~a~~~-----al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~ 70 (108) =+++|..+. ...+++++.+ |++.+++.+...+..++|+|||+|++|....+ ++.++.|+.+++||.|+|+ T Consensus 2 ~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~ 81 (141) T protein:vir:78 2 NEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEF 81 (141) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceeec Confidence 356666432 2233344344 45556666788899999999999999997654 5678999999999999999 Q ss_pred ccCCCC--------------C-------CCc--cchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 71 EVGWHH--------------V-------DGQ--AKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 71 ~~~~~~--------------~-------~~~--~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) ||+-+. | .|+ .+||.+|+.++++++.++|.+.|++-= T Consensus 82 GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 82 GTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred CCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 984211 1 122 269999999999999998888887544 No 16 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.42 E-value=9e-16 Score=103.07 Aligned_cols=108 Identities=17% Similarity=0.198 Sum_probs=77.0 Q ss_pred CCccccH--HHH-----------HHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc----cCcEEEEEecCc Q lcl|NC_016654. 1 MPVEFNY--GIA-----------ATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS----DGMEAVVYFDTP 63 (108) Q Consensus 1 m~vk~n~--~~~-----------~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~----~~~~g~V~y~~p 63 (108) |+|+|+. .+. +.++++..++++.++..+.+++..++|+|||+|++|....+ +..+|.|+-+++ T Consensus 2 ~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ 81 (182) T protein:vir:10 2 IEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSSM 81 (182) T ss_pred eEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCCC Confidence 8888873 122 23333444444555666777888999999999999986443 235789999999 Q ss_pred hhhhhccccCCCC-------------------------------------------------CCCc--cchhhHHHHHhH Q lcl|NC_016654. 64 YAARQHEEVGWHH-------------------------------------------------VDGQ--AKYLENAVNATQ 92 (108) Q Consensus 64 YA~~~h~~~~~~~-------------------------------------------------~~~~--~k~le~a~~~~~ 92 (108) ||.|+|+||+-.. .+|+ .+||.+++.+++ T Consensus 82 ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~ 161 (182) T protein:vir:10 82 VAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKMA 161 (182) T ss_pred ccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHhH Confidence 9999999874110 0122 269999999999 Q ss_pred HHHHHHHH----HHHHHhcC Q lcl|NC_016654. 93 ATVAEVIG----EAIRRSIA 108 (108) Q Consensus 93 ~~i~~~i~----~~ir~~Lg 108 (108) +.+.+.|. ++||+.|| T Consensus 162 ~~i~~~i~~~i~~~l~~~~g 181 (182) T protein:vir:10 162 KEAPEIIKRSIDQELHDKLG 181 (182) T ss_pred HHHHHHHHHHHHHHHHHhhc Confidence 98887776 66677788 No 17 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.41 E-value=6.1e-16 Score=103.98 Aligned_cols=100 Identities=21% Similarity=0.307 Sum_probs=81.5 Q ss_pred CCccc-c-----H---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEF-N-----Y---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~-n-----~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |+.-+ . . .+.+.+.++++++|+.++..+.+++...+|+|||+|++|..+.+ ++.++.|+.+++||.++| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~~vE 80 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVN 80 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCcccccc Confidence 88743 1 1 24456777899999999999999999999999999999998664 456899999999999999 Q ss_pred cccCCCCCC------------------------Cc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHVD------------------------GQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~~------------------------~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +||..+.+. ++ .+||++++.+++++|.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 81 YGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 998653211 11 25999999999999999998 No 18 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.40 E-value=1e-15 Score=102.77 Aligned_cols=100 Identities=15% Similarity=0.192 Sum_probs=81.0 Q ss_pred CCccc-c-----H---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec--ccCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEF-N-----Y---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA--SDGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~-n-----~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~--~~~~~g~V~y~~pYA~~~h 69 (108) |..-+ + . .+.+.+.+++.++|..++..+.+++...+|+|||+|++|..+. .++.++.|+.+++||.|+| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE 80 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEYAIYVE 80 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCcccccc Confidence 76644 2 1 2345566788899999999999999999999999999999765 3566899999999999999 Q ss_pred cccCCCCCCC------------------------c--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHVDG------------------------Q--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~~~------------------------~--~k~le~a~~~~~~~i~~~i~ 100 (108) +||.++.+++ + .+||.+++.++++.|.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 81 FGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred cCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9986542211 1 26999999999999999998 No 19 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.39 E-value=1.7e-15 Score=101.52 Aligned_cols=103 Identities=22% Similarity=0.295 Sum_probs=80.8 Q ss_pred CCccccHH--------HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc--C--cEEEEEecCchhhhh Q lcl|NC_016654. 1 MPVEFNYG--------IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD--G--MEAVVYFDTPYAARQ 68 (108) Q Consensus 1 m~vk~n~~--------~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~--~--~~g~V~y~~pYA~~~ 68 (108) |+++++.. +.+.+.+++.++|..++..+..++...+|+|||+|++|..+.+. + .++.|+.+.+||.++ T Consensus 4 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~~v 83 (142) T protein:vir:94 4 LNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAADV 83 (142) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccchhh Confidence 88888852 33456678999999999999999999999999999999976653 2 368899999999999 Q ss_pred ccccCCC-----------------------CCCCc-cchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 69 HEEVGWH-----------------------HVDGQ-AKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 69 h~~~~~~-----------------------~~~~~-~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+|+.-+ ||.-. .+||.+++.++++.|.+++ ++|| T Consensus 84 E~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~-~~~~ 142 (142) T protein:vir:94 84 EYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHA-KGIR 142 (142) T ss_pred hccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHH-HhcC Confidence 9997532 11111 2699999999998886655 4677 No 20 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.37 E-value=2.7e-15 Score=100.48 Aligned_cols=99 Identities=10% Similarity=0.189 Sum_probs=82.3 Q ss_pred ccccH--HHHHHH-----HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhccccC Q lcl|NC_016654. 3 VEFNY--GIAATV-----RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHEEVG 73 (108) Q Consensus 3 vk~n~--~~~~~v-----~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~~~~ 73 (108) |+|.. .+.+.+ ..++++++..++..+..++...+|+|||+|++|..+.. ++.++.|+.+++||.++++||. T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~ 80 (108) T protein:vir:98 1 MKITGIDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGTR 80 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeecccc Confidence 55553 233333 34577899999999999999999999999999998775 3568999999999999999998 Q ss_pred CCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 74 WHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 74 ~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) +..|. +||.+++..+++.+.+.|.+.|| T Consensus 81 ~m~aq---PFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 81 FQAAQ---PFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ccCCC---cchhhHHHHHHHHHHHHHHHHcC Confidence 54433 79999999999999999999999 No 21 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.36 E-value=1.6e-15 Score=101.70 Aligned_cols=100 Identities=20% Similarity=0.294 Sum_probs=79.8 Q ss_pred CCccc-c-----H---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEF-N-----Y---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~-n-----~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |.--+ + . .+.+.+.+++.++|+.++..|.+++...+|+|||+|++|..+++ ++.++.|+.+++||.++| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVN 80 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCccccccc Confidence 65432 2 1 24456677889999999999999999999999999999987664 355899999999999999 Q ss_pred cccCCCC------------------------CCCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHH------------------------VDGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~------------------------~~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +||..+. .+++ .+||.+++.+++++|.+.|+ T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 81 YGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9985421 1122 26999999999999999999 No 22 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.36 E-value=4.6e-15 Score=99.20 Aligned_cols=101 Identities=17% Similarity=0.158 Sum_probs=85.0 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC-cEEEEEecCchhhhhccccCCCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG-MEAVVYFDTPYAARQHEEVGWHHVDG 79 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~-~~g~V~y~~pYA~~~h~~~~~~~~~~ 79 (108) .-=+|+ .+...+.++++++|..++..+.+++...+|+|||+|++|..+...+ ..+.|+-+++||.|+++||.+..+. T Consensus 7 l~~~l~-~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~GT~~m~a~- 84 (108) T protein:vir:99 7 FLRSVE-RKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLELGTRKMEAQ- 84 (108) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEeecCcccchhcccCccccCCC- Confidence 111111 3445677788999999999999999999999999999999877654 5788999999999999999765443 Q ss_pred ccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 80 QAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 80 ~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) +||.+++..++..+.+.|.+.||| T Consensus 85 --Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 85 --SFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred --cchhhhHHHHHHHHHHHHHHHhcC Confidence 799999999999999999999999 No 23 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.34 E-value=4.7e-15 Score=99.11 Aligned_cols=99 Identities=12% Similarity=0.203 Sum_probs=82.7 Q ss_pred ccccH--HHHHHH-----HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhccccC Q lcl|NC_016654. 3 VEFNY--GIAATV-----RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHEEVG 73 (108) Q Consensus 3 vk~n~--~~~~~v-----~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~~~~ 73 (108) |+|.. .+.+.+ .+.++++|..++..|..++...+|+|||+|++|..+.. ++.++.|+.+++||.++++||. T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~ 80 (108) T protein:vir:74 1 MKITGIDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYGTR 80 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCcccceecccc Confidence 55553 233333 34578999999999999999999999999999998775 3458999999999999999997 Q ss_pred CCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 74 WHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 74 ~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) +. ++| +||.+++..++.++.+.|.+.|| T Consensus 81 km--~aq-pf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 81 FQ--SAQ-PFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred cc--CCC-cchhhHHHHHHHHHHHHHHHHcC Confidence 54 444 59999999999999999999999 No 24 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.34 E-value=2.5e-15 Score=100.59 Aligned_cols=100 Identities=18% Similarity=0.276 Sum_probs=78.4 Q ss_pred CCcccc-----H---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhcc Q lcl|NC_016654. 1 MPVEFN-----Y---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHE 70 (108) Q Consensus 1 m~vk~n-----~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~ 70 (108) -+|++. . .+.+.+.+++.+++..++..|.+++...+|+|||+|++|..+.+ ++.+|.|+.+++||.++|+ T Consensus 14 a~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~VE~ 93 (149) T protein:vir:94 14 AKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEY 93 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCccccccc Confidence 112211 1 23456677889999999999999999999999999999997654 4568999999999999999 Q ss_pred ccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 71 EVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 71 ~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) ||..+.. +|+ .+||.+++..++++|.+.|. T Consensus 94 GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 94 GTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred CccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9854321 112 25999999999999999988 No 25 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.33 E-value=3e-15 Score=100.17 Aligned_cols=100 Identities=18% Similarity=0.276 Sum_probs=78.7 Q ss_pred CCcccc-----H---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhcc Q lcl|NC_016654. 1 MPVEFN-----Y---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHE 70 (108) Q Consensus 1 m~vk~n-----~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~ 70 (108) -+|++. . .+...+.+++.++++.++..|.+++...+|+|||+|++|..+.+ ++.+|.|+.+++||.++|+ T Consensus 14 a~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~vE~ 93 (149) T protein:vir:10 14 AKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEY 93 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCccccccc Confidence 122221 1 23456677889999999999999999999999999999987654 4568999999999999999 Q ss_pred ccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 71 EVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 71 ~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) ||..+.. +|+ .+||.+++.++++++.+.|. T Consensus 94 GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 94 GTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred CccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9854321 112 26999999999999999998 No 26 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.33 E-value=5.6e-15 Score=98.71 Aligned_cols=100 Identities=21% Similarity=0.259 Sum_probs=80.1 Q ss_pred CCccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |...+-. .+...+.+++.+++..++..+.+++...+|+|||+|++|..+.+ ++.+|.|+.+++||.++| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCcccccc Confidence 7765421 23456677888899999999999999999999999999987654 456899999999999999 Q ss_pred cccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +|+.-+.. .++ .+||.+++..+++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99842211 111 26999999999999999999 No 27 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.32 E-value=4.6e-15 Score=99.17 Aligned_cols=100 Identities=21% Similarity=0.320 Sum_probs=78.8 Q ss_pred CC-ccccH--------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MP-VEFNY--------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~-vk~n~--------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |. |+++- .+...+.+++.++|+.++..+..++...+|+|||+|++|..+.+ ++.++.|+.+.+||.++| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCcccccc Confidence 54 33332 23445667788899999999999999999999999999997654 456899999999999999 Q ss_pred cccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +|+.-+.. .++ .+||.+++..+++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 99743211 111 26999999999999999998 No 28 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.31 E-value=7.6e-15 Score=97.98 Aligned_cols=100 Identities=21% Similarity=0.261 Sum_probs=79.8 Q ss_pred CCccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |...+-. .+...+.+++.+++..++..+..++...+|+|||+|++|..+.+ ++.+|.|+.+.+||.++| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 7765521 23456667788889999999999999999999999999987654 455899999999999999 Q ss_pred cccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +|+.-+.. .++ .+||.+++..+++.+.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99842210 111 26999999999999999998 No 29 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.31 E-value=7.6e-15 Score=97.98 Aligned_cols=100 Identities=21% Similarity=0.261 Sum_probs=79.8 Q ss_pred CCccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |...+-. .+...+.+++.+++..++..+..++...+|+|||+|++|..+.+ ++.+|.|+.+.+||.++| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 7765521 23456667788889999999999999999999999999987654 455899999999999999 Q ss_pred cccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +|+.-+.. .++ .+||.+++..+++.+.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99842210 111 26999999999999999998 No 30 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.31 E-value=7.6e-15 Score=97.98 Aligned_cols=100 Identities=21% Similarity=0.261 Sum_probs=79.8 Q ss_pred CCccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |...+-. .+...+.+++.+++..++..+..++...+|+|||+|++|..+.+ ++.+|.|+.+.+||.++| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 7765521 23456667788889999999999999999999999999987654 455899999999999999 Q ss_pred cccCCCCC------------------------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWHHV------------------------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~~~------------------------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +|+.-+.. .++ .+||.+++..+++.+.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99842210 111 26999999999999999998 No 31 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.31 E-value=4.2e-15 Score=99.40 Aligned_cols=88 Identities=23% Similarity=0.325 Sum_probs=75.5 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhccccCCCCCC------------ Q lcl|NC_016654. 13 VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHEEVGWHHVD------------ 78 (108) Q Consensus 13 v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~~~~~~~~~------------ 78 (108) |+++++++++.++..+.+++...+|+|||+|++|..+.+ ++.+|.|+-+++||.|+|+||..+.++ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~ 80 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccccee Confidence 899999999999999999999999999999999997654 456899999999999999997543211 Q ss_pred ------------Cc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 79 ------------GQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 79 ------------~~--~k~le~a~~~~~~~i~~~i~ 100 (108) ++ .+||.+|+.++++.|.+.|. T Consensus 81 ~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 11 26999999999999999888 No 32 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.31 E-value=4.2e-15 Score=99.40 Aligned_cols=88 Identities=23% Similarity=0.325 Sum_probs=75.5 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhccccCCCCCC------------ Q lcl|NC_016654. 13 VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHEEVGWHHVD------------ 78 (108) Q Consensus 13 v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~~~~~~~~~------------ 78 (108) |+++++++++.++..+.+++...+|+|||+|++|..+.+ ++.+|.|+-+++||.|+|+||..+.++ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~ 80 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWS 80 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccccee Confidence 899999999999999999999999999999999997654 456899999999999999997543211 Q ss_pred ------------Cc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 79 ------------GQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 79 ------------~~--~k~le~a~~~~~~~i~~~i~ 100 (108) ++ .+||.+|+.++++.|.+.|. T Consensus 81 ~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 11 26999999999999999888 No 33 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.31 E-value=9.3e-15 Score=97.50 Aligned_cols=100 Identities=23% Similarity=0.310 Sum_probs=79.2 Q ss_pred CCcc-ccH--------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVE-FNY--------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk-~n~--------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |... +.- .+...+.+++.+++..+++.+..++...+|+|||+|++|..+.+ ++.+|.|+.+++||.++| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ve 80 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYVN 80 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchhh Confidence 8843 331 23445667888899999999999999999999999999987654 455899999999999999 Q ss_pred cccCCC---------------CCC-------Cc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 70 EEVGWH---------------HVD-------GQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 70 ~~~~~~---------------~~~-------~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +|+.-+ ++. ++ .+||.+++...++.+.++|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 81 YGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred cccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 988432 111 11 26999999999999999998 No 34 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.28 E-value=7.5e-15 Score=98.01 Aligned_cols=88 Identities=23% Similarity=0.327 Sum_probs=75.0 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhccccCCCCC------------- Q lcl|NC_016654. 13 VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQHEEVGWHHV------------- 77 (108) Q Consensus 13 v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h~~~~~~~~------------- 77 (108) |++++++++..++..+.+++...+|+|||+|++|..+.+ ++.+|.|+-+++||.|+|+|+..+.. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~~~ 80 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIPWS 80 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccccccccce Confidence 899999999999999999999999999999999997654 45689999999999999999643211 Q ss_pred -----------CCc--cchhhHHHHHhHHHHHHHHH Q lcl|NC_016654. 78 -----------DGQ--AKYLENAVNATQATVAEVIG 100 (108) Q Consensus 78 -----------~~~--~k~le~a~~~~~~~i~~~i~ 100 (108) +++ .+||.+|+.++++.+.+.|. T Consensus 81 ~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 81 YKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 122 26999999999999999888 No 35 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.22 E-value=2.7e-14 Score=94.97 Aligned_cols=101 Identities=26% Similarity=0.254 Sum_probs=78.6 Q ss_pred CCccccH--------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC------cEEEEEecCchhh Q lcl|NC_016654. 1 MPVEFNY--------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG------MEAVVYFDTPYAA 66 (108) Q Consensus 1 m~vk~n~--------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~------~~g~V~y~~pYA~ 66 (108) |++.+.. .+...++.++++++..++..|..++...+|+|||+|++|...++.. .++.|+.+++||. T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~ 81 (142) T protein:vir:86 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAA 81 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccc Confidence 5555553 2556677889999999999999999999999999999998765421 3577888999999 Q ss_pred hhccccCCC-------------------------CCCCc-cchhhHHHHHhHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWH-------------------------HVDGQ-AKYLENAVNATQATVAEVIGE 101 (108) Q Consensus 67 ~~h~~~~~~-------------------------~~~~~-~k~le~a~~~~~~~i~~~i~~ 101 (108) ++|+||..+ ||.-+ .+||++++..++++..+++-+ T Consensus 82 ~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 82 AVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 999998521 23211 369999999999888777666 No 36 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.22 E-value=2.7e-14 Score=94.97 Aligned_cols=101 Identities=26% Similarity=0.254 Sum_probs=78.6 Q ss_pred CCccccH--------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC------cEEEEEecCchhh Q lcl|NC_016654. 1 MPVEFNY--------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG------MEAVVYFDTPYAA 66 (108) Q Consensus 1 m~vk~n~--------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~------~~g~V~y~~pYA~ 66 (108) |++.+.. .+...++.++++++..++..|..++...+|+|||+|++|...++.. .++.|+.+++||. T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~ 81 (142) T protein:vir:99 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAA 81 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccc Confidence 5555553 2556677889999999999999999999999999999998765421 3577888999999 Q ss_pred hhccccCCC-------------------------CCCCc-cchhhHHHHHhHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWH-------------------------HVDGQ-AKYLENAVNATQATVAEVIGE 101 (108) Q Consensus 67 ~~h~~~~~~-------------------------~~~~~-~k~le~a~~~~~~~i~~~i~~ 101 (108) ++|+||..+ ||.-+ .+||++++..++++..+++-+ T Consensus 82 ~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 82 AVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 999998521 23211 369999999999888777666 No 37 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.21 E-value=7.9e-14 Score=92.41 Aligned_cols=99 Identities=8% Similarity=0.084 Sum_probs=81.3 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------CcccchhhcceeecccC-cEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASDG-MEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~~-~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.++|..++..+..++...+ |.|||+|++|+.+..++ .++.|+.++.||. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~ 80 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccc Confidence 55553 1 2344556778888888888888887664 99999999999887665 5899999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.+. +||.++++.++..+.+.|.+.+| T Consensus 81 ~vE~GT~~m~a~---PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred cccccccccCCC---CcchhhHHHHHHHHHHHHHHHhC Confidence 999999754443 69999999999999999999999 No 38 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.21 E-value=1.1e-13 Score=91.68 Aligned_cols=99 Identities=8% Similarity=0.096 Sum_probs=79.9 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------Ccccchhhcceeeccc-CcEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASD-GMEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~-~~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.+++...+..+..++...+ |+|||+|++|..+..+ +.++.|+.+++||+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 55553 2 2334556677788888888888877665 9999999999987754 46889999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.|. +||.+++..++..+.+.|.+.++ T Consensus 81 ~vE~GT~km~a~---Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999999999999 No 39 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.21 E-value=1.1e-13 Score=91.68 Aligned_cols=99 Identities=8% Similarity=0.096 Sum_probs=79.9 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------Ccccchhhcceeeccc-CcEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASD-GMEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~-~~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.+++...+..+..++...+ |+|||+|++|..+..+ +.++.|+.+++||+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 55553 2 2334556677788888888888877665 9999999999987754 46889999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.|. +||.+++..++..+.+.|.+.++ T Consensus 81 ~vE~GT~km~a~---Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999999999999 No 40 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.21 E-value=1.1e-13 Score=91.68 Aligned_cols=99 Identities=8% Similarity=0.096 Sum_probs=79.9 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------Ccccchhhcceeeccc-CcEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASD-GMEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~-~~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.+++...+..+..++...+ |+|||+|++|..+..+ +.++.|+.+++||+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 55553 2 2334556677788888888888877665 9999999999987754 46889999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.|. +||.+++..++..+.+.|.+.++ T Consensus 81 ~vE~GT~km~a~---Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999999999999 No 41 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.21 E-value=1.1e-13 Score=91.68 Aligned_cols=99 Identities=8% Similarity=0.096 Sum_probs=79.9 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------Ccccchhhcceeeccc-CcEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASD-GMEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~-~~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.+++...+..+..++...+ |+|||+|++|..+..+ +.++.|+.+++||+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 55553 2 2334556677788888888888877665 9999999999987754 46889999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.|. +||.+++..++..+.+.|.+.++ T Consensus 81 ~vE~GT~km~a~---Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999999999999 No 42 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.21 E-value=1.1e-13 Score=91.68 Aligned_cols=99 Identities=8% Similarity=0.096 Sum_probs=79.9 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------Ccccchhhcceeeccc-CcEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASD-GMEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~-~~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.+++...+..+..++...+ |+|||+|++|..+..+ +.++.|+.+++||+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 55553 2 2334556677788888888888877665 9999999999987754 46889999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.|. +||.+++..++..+.+.|.+.++ T Consensus 81 ~vE~GT~km~a~---Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999999999999 No 43 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.21 E-value=1.1e-13 Score=91.68 Aligned_cols=99 Identities=8% Similarity=0.096 Sum_probs=79.9 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------Ccccchhhcceeeccc-CcEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASD-GMEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~-~~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.+++...+..+..++...+ |+|||+|++|..+..+ +.++.|+.+++||+ T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 55553 2 2334556677788888888888877665 9999999999987754 46889999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.|. +||.+++..++..+.+.|.+.++ T Consensus 81 ~vE~GT~km~a~---Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 81 FLEFGTRYMEAE---PFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999999999999 No 44 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.19 E-value=1e-13 Score=91.86 Aligned_cols=102 Identities=17% Similarity=0.105 Sum_probs=81.1 Q ss_pred CC-ccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhh Q lcl|NC_016654. 1 MP-VEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQ 68 (108) Q Consensus 1 m~-vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~ 68 (108) |. |+|.. .+.+. ++++.++++...++.+...+...+|+|||+|++|..+..+.+.+.|+.+++||.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 65 67763 22222 34455555556666666667778899999999999998888889999999999999 Q ss_pred ccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 69 HEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 69 h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ++||.+..|. +||.++++.++.++.+.+.+-++- T Consensus 81 EfGT~km~a~---Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 81 EVGTRKMEAQ---PFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred cccccccCCC---CchhhhHHHHHHHHHHHHHHHhcC Confidence 9999866554 799999999999999999888888 No 45 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.19 E-value=1e-13 Score=91.86 Aligned_cols=102 Identities=17% Similarity=0.105 Sum_probs=81.1 Q ss_pred CC-ccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhh Q lcl|NC_016654. 1 MP-VEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQ 68 (108) Q Consensus 1 m~-vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~ 68 (108) |. |+|.. .+.+. ++++.++++...++.+...+...+|+|||+|++|..+..+.+.+.|+.+++||.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 65 67763 22222 34455555556666666667778899999999999998888889999999999999 Q ss_pred ccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 69 HEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 69 h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ++||.+..|. +||.++++.++.++.+.+.+-++- T Consensus 81 EfGT~km~a~---Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 81 EVGTRKMEAQ---PFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred cccccccCCC---CchhhhHHHHHHHHHHHHHHHhcC Confidence 9999866554 799999999999999999888888 No 46 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.18 E-value=1.1e-13 Score=91.60 Aligned_cols=100 Identities=18% Similarity=0.106 Sum_probs=77.7 Q ss_pred CC-ccccH--HHH---------HHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhh Q lcl|NC_016654. 1 MP-VEFNY--GIA---------ATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQ 68 (108) Q Consensus 1 m~-vk~n~--~~~---------~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~ 68 (108) |+ |+|.. .+. ..+++++++++...+..+...+...+|+|||+|++|..+..++.++.|+.+++||.|+ T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~v 80 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNYSGYL 80 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCcccee Confidence 66 77763 222 2345556666666677788888999999999999999998888999999999999999 Q ss_pred ccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHH Q lcl|NC_016654. 69 HEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAI 103 (108) Q Consensus 69 h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~i 103 (108) ++||.+-.+. +||.++++..+..+.+.+.+-- T Consensus 81 E~GTr~m~Aq---PF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 81 EVGTRKMEAQ---PFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ccCccccCCC---CchhhhHHHHHHHHHHHHHhcC Confidence 9999755443 6999999998888766554322 No 47 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.15 E-value=2.6e-13 Score=89.54 Aligned_cols=99 Identities=9% Similarity=0.041 Sum_probs=79.6 Q ss_pred ccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcC------CcccchhhcceeecccC-cEEEEEecCchhh Q lcl|NC_016654. 3 VEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERC------PKETGALRNSAGTASDG-MEAVVYFDTPYAA 66 (108) Q Consensus 3 vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~v------P~dtG~L~~S~~v~~~~-~~g~V~y~~pYA~ 66 (108) |+|.. . +.+.+.+++.++|...+..+..++...+ |+|||+|++|..+..++ .++.|+.+++||. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSG 80 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccch Confidence 66663 2 2234455678888888888888887655 89999999999887655 5899999999999 Q ss_pred hhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) |+++||.+-.+. +||.++++.++..+.+.|.+.|. T Consensus 81 ~vEfGT~km~a~---PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 81 FLEFGTRYMEPA---PFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred heecccccCCCC---CchhhhHHHHHHHHHHHHHHHhC Confidence 999999755443 69999999999999888888887 No 48 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.13 E-value=4.7e-13 Score=88.19 Aligned_cols=106 Identities=15% Similarity=0.131 Sum_probs=83.9 Q ss_pred ccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc----CcEEEEEecCchhhhhc Q lcl|NC_016654. 3 VEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD----GMEAVVYFDTPYAARQH 69 (108) Q Consensus 3 vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~----~~~g~V~y~~pYA~~~h 69 (108) |+|.. .+...+.+++.+|+..+++.|..++...+|.|||+|++|..+... ..++.|.-++.||+|++ T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvE 80 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYME 80 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhhh Confidence 54442 234556678899999999999999999999999999999976642 23556777899999999 Q ss_pred cccCCCC--C------------------------------------------------CCc--cchhhHHHHHhHHHHHH Q lcl|NC_016654. 70 EEVGWHH--V------------------------------------------------DGQ--AKYLENAVNATQATVAE 97 (108) Q Consensus 70 ~~~~~~~--~------------------------------------------------~~~--~k~le~a~~~~~~~i~~ 97 (108) +||.... | +|+ ..||-+|+..+++.+.+ T Consensus 81 fGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~ 160 (173) T protein:vir:10 81 FGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLK 160 (173) T ss_pred cccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHHH Confidence 9986321 1 122 25999999999999999 Q ss_pred HHHHHHHHhcC Q lcl|NC_016654. 98 VIGEAIRRSIA 108 (108) Q Consensus 98 ~i~~~ir~~Lg 108 (108) .|.+.|+++|= T Consensus 161 ~i~~~i~~~lr 171 (173) T protein:vir:10 161 DLENLLKTYNK 171 (173) T ss_pred HHHHHHHHHhh Confidence 99999999998 No 49 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.11 E-value=4.2e-13 Score=88.42 Aligned_cols=108 Identities=18% Similarity=0.193 Sum_probs=80.3 Q ss_pred CCc--cccH----HHHHH--------HHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec-----------ccCcE Q lcl|NC_016654. 1 MPV--EFNY----GIAAT--------VRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA-----------SDGME 55 (108) Q Consensus 1 m~v--k~n~----~~~~~--------v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~-----------~~~~~ 55 (108) ||- .|+. .+.+. +.+..++++..++..+++++...+|+|||+|++|-.+. .+..+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 543 3332 23333 34456677777889999999999999999999995432 12345 Q ss_pred EEEEecCchhhhhccccCCCCCCC---ccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 56 AVVYFDTPYAARQHEEVGWHHVDG---QAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 56 g~V~y~~pYA~~~h~~~~~~~~~~---~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+|+-++|||.++.||.....+.+ ...+|+++....+..+.+++++.|++-|+ T Consensus 81 v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~ 136 (141) T protein:vir:79 81 IEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLK 136 (141) T ss_pred EEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 689999999999999876443321 23578999999999999999999999999 No 50 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.10 E-value=1.5e-13 Score=90.86 Aligned_cols=101 Identities=17% Similarity=0.189 Sum_probs=72.5 Q ss_pred CCccccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc-----CcEEEEEecCchhhhhcccc Q lcl|NC_016654. 1 MPVEFNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD-----GMEAVVYFDTPYAARQHEEV 72 (108) Q Consensus 1 m~vk~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~-----~~~g~V~y~~pYA~~~h~~~ 72 (108) ||++|.. .+.+.+.+.++++++.++..+..++...+|+|||+|++|...... ..++.|+.+++||.++|+|+ T Consensus 3 ~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~GT 82 (137) T protein:vir:10 3 VTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHEGS 82 (137) T ss_pred eeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeeecC Confidence 4555553 366778888999999999999999999999999999999875542 13678999999999999998 Q ss_pred CCC-------------------------CCCCc-cchhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 73 GWH-------------------------HVDGQ-AKYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 73 ~~~-------------------------~~~~~-~k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) ..+ ||.-+ .+||++++......-.+| +-- T Consensus 83 ~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri-----~~~ 137 (137) T protein:vir:10 83 RPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDI-----HMT 137 (137) T ss_pred CCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccccc-----cCC Confidence 421 33111 269999987643222221 111 No 51 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.04 E-value=1e-12 Score=86.33 Aligned_cols=107 Identities=14% Similarity=0.141 Sum_probs=79.8 Q ss_pred CC-ccccH----HHHHHH---------HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc-----cCcEEEEEec Q lcl|NC_016654. 1 MP-VEFNY----GIAATV---------RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS-----DGMEAVVYFD 61 (108) Q Consensus 1 m~-vk~n~----~~~~~v---------~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~-----~~~~g~V~y~ 61 (108) || ..|+. .+.+.+ .+.++++++.++..+++.+...+|+|||+|++|-.+.. +..+++|+.+ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~ 80 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINN 80 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecC Confidence 66 24442 233333 34467778888999999999999999999999976542 3457889999 Q ss_pred CchhhhhccccCCCCC---------------CCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 62 TPYAARQHEEVGWHHV---------------DGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 62 ~pYA~~~h~~~~~~~~---------------~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +|||.+++||+.+... +| .+||+++.......+.+.+++.|.+-+= T Consensus 81 ~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G-~~~~~~a~~~~~~~~~~~l~k~l~~l~d 141 (144) T protein:vir:10 81 AEYASYVESGHRQTPGRYVPVLKKRLVRDWVPG-QFYMKKSIPQIQRQLPQLVTEGLWGLKD 141 (144) T ss_pred CCcccccccceeecCCcccccCCCccccceecC-ccchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9999999998764422 12 2688999998888888888877776666 No 52 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=98.98 E-value=4e-12 Score=83.07 Aligned_cols=105 Identities=21% Similarity=0.224 Sum_probs=78.3 Q ss_pred CC-ccccH--HH-------HHHH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC-----cEEEEE----- Q lcl|NC_016654. 1 MP-VEFNY--GI-------AATV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG-----MEAVVY----- 59 (108) Q Consensus 1 m~-vk~n~--~~-------~~~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~-----~~g~V~----- 59 (108) |+ ++|.. .+ ...+ .++..+||..++..|.+++...+|.+||.|++|..+.... ....|+ T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~ 80 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeecc Confidence 65 66662 12 2233 3467889999999999999999999999999998765321 122333 Q ss_pred -------ecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 60 -------FDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 60 -------y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+.+|+.++++||.+..|. +||.+++..+++++.+.+.+++++.|- T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~~~~a~---PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:10 81 KGKADSPNNAFYWRFDEFGTQHMKAQ---PFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred ccccCCCCccceeeeeccCCCCCCCC---cchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 3478999999999765444 699999999999998888777766665 No 53 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=98.92 E-value=3.1e-12 Score=83.66 Aligned_cols=99 Identities=20% Similarity=0.232 Sum_probs=74.3 Q ss_pred CCccccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cC----cEEEEEecCchhhhhccc Q lcl|NC_016654. 1 MPVEFNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DG----MEAVVYFDTPYAARQHEE 71 (108) Q Consensus 1 m~vk~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~----~~g~V~y~~pYA~~~h~~ 71 (108) -|+++.. .+...+....+++|+.++..+..++...+|+|||+|++|+.... +. .++.|+.+++||.++|+| T Consensus 3 ~~~~~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~G 82 (137) T protein:vir:10 3 VTARYERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDG 82 (137) T ss_pred eEEEeccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeeecC Confidence 3334433 35667788889999999999999999999999999999997543 22 256788999999999999 Q ss_pred cCCC--------------------------CCCCc-cchhhHHHHHhHHHHHHHH Q lcl|NC_016654. 72 VGWH--------------------------HVDGQ-AKYLENAVNATQATVAEVI 99 (108) Q Consensus 72 ~~~~--------------------------~~~~~-~k~le~a~~~~~~~i~~~i 99 (108) |..| ||.-+ .+||+++++.+++.....- T Consensus 83 T~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 83 TRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred CCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 8533 22111 2589999888887776544 No 54 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=98.89 E-value=9.7e-12 Score=80.97 Aligned_cols=105 Identities=21% Similarity=0.223 Sum_probs=77.8 Q ss_pred CC-ccccH--HHHH-------HH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc-----CcEEEEE----- Q lcl|NC_016654. 1 MP-VEFNY--GIAA-------TV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD-----GMEAVVY----- 59 (108) Q Consensus 1 m~-vk~n~--~~~~-------~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~-----~~~g~V~----- 59 (108) |+ ++|.. .+.+ .+ .++..+|+..++..|..++...+|.+||.|++|..+... ...+.++ T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeeccc Confidence 65 66663 2222 22 235678999999999999999999999999999876531 1122333 Q ss_pred -------ecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 60 -------FDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 60 -------y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .++.||.++++||.+..|. +||.+++..+++++.+.+.+++++.|- T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~~~~a~---PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:80 81 KGKADSPSNAFYWRFDEFGTQHMKAQ---PFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred ccccCCCCCcceeeeeccCCCCCCCC---cchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 3478999999999765444 799999999999998887777766665 No 55 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.89 E-value=1.5e-11 Score=79.95 Aligned_cols=105 Identities=10% Similarity=0.160 Sum_probs=80.3 Q ss_pred CCccccH--HHHH-------HHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc-------------------c Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS-------------------D 52 (108) Q Consensus 1 m~vk~n~--~~~~-------~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~-------------------~ 52 (108) |+|+|.. .+.+ .+.+...+||..++..+..++...+|.++|+|++|..... . T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 7788873 2333 3455688999999999999999999999999987643110 0 Q ss_pred CcEEEEEec------CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 53 GMEAVVYFD------TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 53 ~~~g~V~y~------~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ...+.|+|+ ..||+++++||.... ++ +||++++..+++++.+.+.++|++.|. T Consensus 85 ~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~--a~-PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 85 IKTVKIGLNKADRSPWFYLKFHEWGTSKMP--AH-PFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred ceeEEeeeccCCCCCcceeeeeccCCCCCC--CC-cchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 123567765 469999999887443 33 799999999999999999988888888 No 56 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.89 E-value=1.5e-11 Score=79.95 Aligned_cols=105 Identities=10% Similarity=0.160 Sum_probs=80.3 Q ss_pred CCccccH--HHHH-------HHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc-------------------c Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS-------------------D 52 (108) Q Consensus 1 m~vk~n~--~~~~-------~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~-------------------~ 52 (108) |+|+|.. .+.+ .+.+...+||..++..+..++...+|.++|+|++|..... . T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 7788873 2333 3455688999999999999999999999999987643110 0 Q ss_pred CcEEEEEec------CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 53 GMEAVVYFD------TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 53 ~~~g~V~y~------~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ...+.|+|+ ..||+++++||.... ++ +||++++..+++++.+.+.++|++.|. T Consensus 85 ~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~--a~-PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 85 IKTVKIGLNKADRSPWFYLKFHEWGTSKMP--AH-PFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred ceeEEeeeccCCCCCcceeeeeccCCCCCC--CC-cchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 123567765 469999999887443 33 799999999999999999988888888 No 57 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.89 E-value=1.5e-11 Score=79.95 Aligned_cols=105 Identities=10% Similarity=0.160 Sum_probs=80.3 Q ss_pred CCccccH--HHHH-------HHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc-------------------c Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS-------------------D 52 (108) Q Consensus 1 m~vk~n~--~~~~-------~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~-------------------~ 52 (108) |+|+|.. .+.+ .+.+...+||..++..+..++...+|.++|+|++|..... . T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 7788873 2333 3455688999999999999999999999999987643110 0 Q ss_pred CcEEEEEec------CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 53 GMEAVVYFD------TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 53 ~~~g~V~y~------~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ...+.|+|+ ..||+++++||.... ++ +||++++..+++++.+.+.++|++.|. T Consensus 85 ~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~--a~-PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 85 IKTVKIGLNKADRSPWFYLKFHEWGTSKMP--AH-PFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred ceeEEeeeccCCCCCcceeeeeccCCCCCC--CC-cchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 123567765 469999999887443 33 799999999999999999988888888 No 58 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.89 E-value=1.5e-11 Score=79.95 Aligned_cols=105 Identities=10% Similarity=0.160 Sum_probs=80.3 Q ss_pred CCccccH--HHHH-------HHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc-------------------c Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS-------------------D 52 (108) Q Consensus 1 m~vk~n~--~~~~-------~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~-------------------~ 52 (108) |+|+|.. .+.+ .+.+...+||..++..+..++...+|.++|+|++|..... . T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g 84 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGG 84 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccccccc Confidence 7788873 2333 3455688999999999999999999999999987643110 0 Q ss_pred CcEEEEEec------CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 53 GMEAVVYFD------TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 53 ~~~g~V~y~------~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ...+.|+|+ ..||+++++||.... ++ +||++++..+++++.+.+.++|++.|. T Consensus 85 ~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~--a~-PFl~pa~~~~k~~~~~~~~~~l~~~l~ 143 (146) T protein:vir:10 85 IKTVKIGLNKADRSPWFYLKFHEWGTSKMP--AH-PFIEPGFNASKAEAVRAMTDILKNEMR 143 (146) T ss_pred ceeEEeeeccCCCCCcceeeeeccCCCCCC--CC-cchhHHHHHhHHHHHHHHHHHHHHHHh Confidence 123567765 469999999887443 33 799999999999999999988888888 No 59 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=98.88 E-value=1.1e-11 Score=80.75 Aligned_cols=105 Identities=25% Similarity=0.271 Sum_probs=77.8 Q ss_pred CC-ccccH--H-------HHHHH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC---cE--EE--EE--- Q lcl|NC_016654. 1 MP-VEFNY--G-------IAATV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG---ME--AV--VY--- 59 (108) Q Consensus 1 m~-vk~n~--~-------~~~~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~---~~--g~--V~--- 59 (108) |+ ++|.. . +...+ .++..+|+...+..|.+++...+|.+||+|++|..+.... +. .. |. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeecccc Confidence 65 55552 1 22233 3456789999999999999999999999999998764321 11 11 11 Q ss_pred -------ecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 60 -------FDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 60 -------y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+++|+.++++||.++.|. +||.+++..+++.+.+.+.+++++.|- T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~~~~a~---PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:10 81 KGKADSPNNAFYWRFVELGTQFMKAE---PFMRPAFDASIAQAEGAIRTEIARAID 133 (140) T ss_pred ccccCCCCcccccceeccCcCCCCCC---cchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 2478999999999866554 699999999999999988888877776 No 60 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=98.88 E-value=7.2e-12 Score=81.66 Aligned_cols=100 Identities=17% Similarity=0.184 Sum_probs=71.0 Q ss_pred CCccccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cC---cEEEEEecCchhhhhccccCC Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DG---MEAVVYFDTPYAARQHEEVGW 74 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~---~~g~V~y~~pYA~~~h~~~~~ 74 (108) -.++++. .+...+...+++.+..++..+..++...+|+|||+|++|..... ++ ..+.|+-+++||.|+|+||.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~p 87 (140) T protein:vir:97 8 ARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRP 87 (140) T ss_pred eeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCC Confidence 2334443 35667788889999999999999999999999999999987543 22 246677789999999999853 Q ss_pred C-------------------------CCCCc--cchhhHHHHHhHHHHHHHHHH Q lcl|NC_016654. 75 H-------------------------HVDGQ--AKYLENAVNATQATVAEVIGE 101 (108) Q Consensus 75 ~-------------------------~~~~~--~k~le~a~~~~~~~i~~~i~~ 101 (108) + || |+ .+||++++........+|--- T Consensus 88 h~I~pk~~k~L~~~~~G~~~~~k~V~hp-G~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 88 HAIRARNAQYLHFWWHGREMFRKSVWHP-GTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred ceeecCCCccceeecCCCEEEeeeeecC-CCCCChhHHHHHHHHhhhhhhccCC Confidence 2 22 22 269999998754433332222 No 61 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=98.88 E-value=7.2e-12 Score=81.66 Aligned_cols=100 Identities=17% Similarity=0.184 Sum_probs=71.0 Q ss_pred CCccccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cC---cEEEEEecCchhhhhccccCC Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DG---MEAVVYFDTPYAARQHEEVGW 74 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~---~~g~V~y~~pYA~~~h~~~~~ 74 (108) -.++++. .+...+...+++.+..++..+..++...+|+|||+|++|..... ++ ..+.|+-+++||.|+|+||.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~p 87 (140) T protein:vir:10 8 ARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRP 87 (140) T ss_pred eeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCC Confidence 2334443 35667788889999999999999999999999999999987543 22 246677789999999999853 Q ss_pred C-------------------------CCCCc--cchhhHHHHHhHHHHHHHHHH Q lcl|NC_016654. 75 H-------------------------HVDGQ--AKYLENAVNATQATVAEVIGE 101 (108) Q Consensus 75 ~-------------------------~~~~~--~k~le~a~~~~~~~i~~~i~~ 101 (108) + || |+ .+||++++........+|--- T Consensus 88 h~I~pk~~k~L~~~~~G~~~~~k~V~hp-G~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 88 HAIRARNAQYLHFWWHGREMFRKSVWHP-GTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred ceeecCCCccceeecCCCEEEeeeeecC-CCCCChhHHHHHHHHhhhhhhccCC Confidence 2 22 22 269999998754433332222 No 62 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=98.82 E-value=1.1e-11 Score=80.63 Aligned_cols=102 Identities=17% Similarity=0.191 Sum_probs=73.8 Q ss_pred CCccccHH-HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc--C---cEEEEEecCchhhhhccccCC Q lcl|NC_016654. 1 MPVEFNYG-IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD--G---MEAVVYFDTPYAARQHEEVGW 74 (108) Q Consensus 1 m~vk~n~~-~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~--~---~~g~V~y~~pYA~~~h~~~~~ 74 (108) =+.+||.. +...+.+..++++..++..+..++...+|+|||+|++|...... . .++.|+.+++||.++|+||.- T Consensus 4 ~~~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~GT~p 83 (137) T protein:vir:10 4 HTLRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNGRRA 83 (137) T ss_pred cccccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceeeecCCCC Confidence 58899975 45667788899999999999999999999999999999876542 2 356788899999999999842 Q ss_pred C--CCC----------------------Cc--cchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 75 H--HVD----------------------GQ--AKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 75 ~--~~~----------------------~~--~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) | .|+ |+ .+||+++++..+... -++--|| T Consensus 84 h~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~------~~~~~~~ 137 (137) T protein:vir:10 84 LTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQE------GFRVTIG 137 (137) T ss_pred ceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhccc------ceeEeeC Confidence 2 111 11 257777766544332 3333444 No 63 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=98.80 E-value=4e-11 Score=77.62 Aligned_cols=105 Identities=17% Similarity=0.247 Sum_probs=72.7 Q ss_pred CCccccH----HHHH-------HHH-HHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC--c------------ Q lcl|NC_016654. 1 MPVEFNY----GIAA-------TVR-GAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG--M------------ 54 (108) Q Consensus 1 m~vk~n~----~~~~-------~v~-~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~--~------------ 54 (108) |+++|.. ++.+ .+. ++..+|+..+++.|..++...+|.+||.|++|..+.... . T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~ 81 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccc Confidence 4444442 2322 232 355789999999999999999999999999997543210 0 Q ss_pred ---------EEEEE---ecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 55 ---------EAVVY---FDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 55 ---------~g~V~---y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ..... .+..|++++.+||... +++ +||++++..+++++.+.+.++|++.|= T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~--~a~-PF~~pA~~~~k~~~~~~~~~~l~~~l~ 144 (149) T protein:vir:19 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTANM--PAH-PFVRPAYDTREEEAASVAIARMNQAID 144 (149) T ss_pred cccccccccceeecCCCCccceeeeeccCCCCC--CCC-cchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111 3467999999888644 344 699999999999887777766666665 No 64 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=98.79 E-value=3.5e-11 Score=77.92 Aligned_cols=105 Identities=21% Similarity=0.223 Sum_probs=76.7 Q ss_pred CC-ccccH--HHHH-------HH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC-----cEEEEE----- Q lcl|NC_016654. 1 MP-VEFNY--GIAA-------TV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG-----MEAVVY----- 59 (108) Q Consensus 1 m~-vk~n~--~~~~-------~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~-----~~g~V~----- 59 (108) |. ++|.. .+.+ .+ .++..+|+..++..|..++...+|.+||+|++|..+.... ....|+ T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~ 80 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeecc Confidence 54 66663 2222 22 2356789999999999999999999999999998765321 122333 Q ss_pred -------ecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 60 -------FDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 60 -------y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+++|+.++++||.+..|. +||.+++..+++++.+.+.+++++.|- T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~~~~a~---pFl~pa~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:14 81 KGKADSPNNAFYWRFDEFGTQHMKAQ---PFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred ccccCCCCccceeeeeccccCCCCCC---cchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 2578999999999765444 799999999999988877766665554 No 65 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.77 E-value=9.4e-11 Score=75.54 Aligned_cols=103 Identities=12% Similarity=0.073 Sum_probs=77.9 Q ss_pred CCccccH-----HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc-------------C--------- Q lcl|NC_016654. 1 MPVEFNY-----GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD-------------G--------- 53 (108) Q Consensus 1 m~vk~n~-----~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~-------------~--------- 53 (108) |+-.+.| .+...++...+..++.++..++.......|+|||.|++|-.+..+ + T Consensus 5 m~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~~~~~ 84 (145) T protein:vir:10 5 IGSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKTYLAR 84 (145) T ss_pred ccchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchhhHHH Confidence 4443333 467888899999999999999999999999999999999765431 1 Q ss_pred -----------cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 54 -----------MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 54 -----------~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) .+-.|.+|+|||.+++||.+-+-|. -|.+..+.... .+.+-+.+++|+.+ T Consensus 85 ~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~---G~v~~~~~~~~-~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 85 QARAVANSKATSVIYITNRLDYAADLEYGASNQAPA---GVLGVVQARLG-RYFQEAVEEARRAI 145 (145) T ss_pred HHHHhhcccccceEEEeeCchhhhHhhccccCCCcc---hHHHHHHHHHH-HHHHHHHHHhhccC Confidence 0135778999999999987655555 38888777664 44455568899999 No 66 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=98.76 E-value=8.2e-11 Score=75.87 Aligned_cols=105 Identities=13% Similarity=0.116 Sum_probs=82.0 Q ss_pred CCccccH--H-------HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc---------cC--cEEEEEe Q lcl|NC_016654. 1 MPVEFNY--G-------IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS---------DG--MEAVVYF 60 (108) Q Consensus 1 m~vk~n~--~-------~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~---------~~--~~g~V~y 60 (108) |||++.. + +...+.++..+||..++..+...+...+|.++|.++.++.... +. ....||| T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~ 80 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGY 80 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeee Confidence 9999984 2 2334556788899999999999999999999988666653221 11 2356888 Q ss_pred c---CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 61 D---TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 61 ~---~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) + ..||+++++||....|. +||++++...++++.+.+.++|+++|= T Consensus 81 ~k~~~~y~~f~E~GT~k~~a~---pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 81 GKDTGWRAHFPNSGTSMQDPQ---HFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred cCCCceEEeeeccCccCCCCC---cchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 6 46788888888755444 799999999999999999999999999 No 67 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.73 E-value=7.9e-11 Score=75.96 Aligned_cols=105 Identities=17% Similarity=0.180 Sum_probs=75.0 Q ss_pred CCccccH--HHHH-------HH-HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC---c------------- Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TV-RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG---M------------- 54 (108) Q Consensus 1 m~vk~n~--~~~~-------~v-~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~---~------------- 54 (108) |+++|.. ++.+ .+ +++..+||..++..|..++...+|.+||+|++|..+.... + T Consensus 4 ~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~~~~ 83 (148) T protein:vir:93 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNP 83 (148) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeeccccc Confidence 4455442 2222 22 2456779999999999999999999999999997654211 0 Q ss_pred ------EEEEEe---cCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 55 ------EAVVYF---DTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 55 ------~g~V~y---~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ...+.+ +.+|+.++++||... +++ +||++++..+++++.+.+.+++++.|- T Consensus 84 ~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~--pa~-PFl~pA~~~~k~~~~~~~~~~~~~~i~ 143 (148) T protein:vir:93 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVNM--PPH-PFVRPAFDVRSEQAAQVAIARMNRAID 143 (148) T ss_pred ccccccceeecCCCCCcceeeeeccCCCCC--CCC-cchhHHHHHhHHHHHHHHHHHHHHHHH Confidence 011223 357888888888743 344 699999999999998888888887777 No 68 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=98.71 E-value=1.3e-10 Score=74.77 Aligned_cols=108 Identities=19% Similarity=0.222 Sum_probs=74.5 Q ss_pred CCccccH----HHH-------HHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc---CcEEE----EEec- Q lcl|NC_016654. 1 MPVEFNY----GIA-------ATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD---GMEAV----VYFD- 61 (108) Q Consensus 1 m~vk~n~----~~~-------~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~---~~~g~----V~y~- 61 (108) |||.|.. ++. ....+....|+..++..|..++...+|.+||+|++|..+..+ .+.|. |+|+ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 8888842 222 233345678888889999999999999999999999987542 22233 6775 Q ss_pred --CchhhhhccccCC-----CCCCC----------------ccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 62 --TPYAARQHEEVGW-----HHVDG----------------QAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 62 --~pYA~~~h~~~~~-----~~~~~----------------~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .||+.++++|..- +.|++ -..||.+|+...++++.+.+.+.|++.+- T Consensus 81 ~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~ 150 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYA 150 (157) T ss_pred CccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHH Confidence 5777777766311 01111 03699999999999999887665555554 No 69 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.65 E-value=4.6e-10 Score=71.79 Aligned_cols=103 Identities=11% Similarity=0.059 Sum_probs=75.9 Q ss_pred CCc-----cccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc---------------------- Q lcl|NC_016654. 1 MPV-----EFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD---------------------- 52 (108) Q Consensus 1 m~v-----k~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~---------------------- 52 (108) |.. .-.- .+...++++....++.++..++.......|+|||.|++|-.+..+ T Consensus 1 Ma~~~~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~ 80 (142) T protein:vir:10 1 MANDVVSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNSLR 80 (142) T ss_pred CccchhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhhHH Confidence 553 3332 467788888888899999999999999999999999999655421 Q ss_pred -----------CcEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 53 -----------GMEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 53 -----------~~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) +.+-.|.+++|||.+++||.+-+-|.| |.+.++.. -..|.+-..+++|+.| T Consensus 81 ~~~~~i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G---~v~~a~q~-~~~~v~~a~~e~~~~~ 142 (142) T protein:vir:10 81 RQIYALARDANTNVIYISNRLDYAQGLEFGSSNQAPSG---VLGVVQKR-LGRYFAEAVQEAKRAL 142 (142) T ss_pred HHHHHhhhccccceEEEeeCcchhhhhhccccCCCcch---HHHHHHHH-HHHHHHHHHHHhhccC Confidence 112457789999999999877655553 77776544 4455566667888888 No 70 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=98.63 E-value=3.6e-10 Score=72.39 Aligned_cols=105 Identities=10% Similarity=0.166 Sum_probs=82.0 Q ss_pred CCccccH------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccch----hhcceeeccc-----C-cEEEEEec--- Q lcl|NC_016654. 1 MPVEFNY------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGA----LRNSAGTASD-----G-MEAVVYFD--- 61 (108) Q Consensus 1 m~vk~n~------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~----L~~S~~v~~~-----~-~~g~V~y~--- 61 (108) |.-=|+- .+...++++..+|+..++..+...+...+|.++|. |+.|..+... + ....|+|+ T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~~ 80 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKAT 80 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCCC Confidence 4433331 23345567788999999999999999999999988 8888866431 1 24568885 Q ss_pred CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 62 TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 62 ~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ..||+++++||....|. +||++++...++++.+.+.+++++.|+ T Consensus 81 ~~y~~f~E~GT~k~~~~---pF~~pa~~~~k~~~~~~~~~~~~~~L~ 124 (125) T protein:vir:97 81 GWRAHYPNDGTIYQRGQ---DFKERTINQMTPKAKQLYAEKVKEGLG 124 (125) T ss_pred ceeEeeeccCccCCCcC---ccchHhHHHhHHHHHHHHHHHHHHHhc Confidence 56888888888744443 799999999999999999999999999 No 71 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.60 E-value=5.8e-10 Score=71.21 Aligned_cols=105 Identities=9% Similarity=0.080 Sum_probs=79.6 Q ss_pred CCccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc---cchhhcceeeccc------CcEEEEEecC Q lcl|NC_016654. 1 MPVEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKE---TGALRNSAGTASD------GMEAVVYFDT 62 (108) Q Consensus 1 m~vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~d---tG~L~~S~~v~~~------~~~g~V~y~~ 62 (108) =++++.. .+...+.++..+||..++..|..++...+|.+ ||.|++|..+... .....|+|+. T Consensus 2 ~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~ 81 (127) T protein:vir:12 2 ADMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPNK 81 (127) T ss_pred eeeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeCC Confidence 2333331 12334567789999999999999999999975 9999999875421 2356688864 Q ss_pred ---chhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 63 ---PYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 63 ---pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +|+.++++||... +++ +||++++..+++++.+.+.+.|++.|= T Consensus 82 ~~~~y~~f~E~GT~~~--~a~-Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 82 KVAYRGRFLEWGTSKM--PPQ-PFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred CCcceeeeeccCccCC--CCC-ccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 5666677788644 343 799999999999999999999999999 No 72 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.60 E-value=5.9e-10 Score=71.16 Aligned_cols=104 Identities=14% Similarity=0.082 Sum_probs=75.6 Q ss_pred CCc----cccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc-------------C------- Q lcl|NC_016654. 1 MPV----EFNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD-------------G------- 53 (108) Q Consensus 1 m~v----k~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~-------------~------- 53 (108) |.= .|-. .+...++++....+..++..++......+|+|||.|+.|-.+.++ + T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~ 80 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEG 80 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHH Confidence 532 2332 356778888888899999999999999999999999999765431 0 Q ss_pred --------------cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 54 --------------MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 54 --------------~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+-.|.+|+|||.+++||.+-+-|.| |.+..+... ..|.+-..+++|+.|| T Consensus 81 ~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QAP~G---~v~~~~~~~-~~~v~~a~~e~k~~~~ 145 (146) T protein:vir:79 81 RRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSKQAPAG---VFGIVAIRL-RSYMAEAIREARKKNA 145 (146) T ss_pred HHHHHHHHhcccccceeEEeeCchhhhhhhccccCCCcch---HHHHHHHHH-HHHHHHHHHHHHhhcc Confidence 12345679999999999876555553 777665443 4444556668999999 No 73 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=98.60 E-value=1.8e-10 Score=74.01 Aligned_cols=77 Identities=17% Similarity=0.200 Sum_probs=62.0 Q ss_pred CCccccH--HHHHHHHHH-----HHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEE---ecCchhhhh Q lcl|NC_016654. 1 MPVEFNY--GIAATVRGA-----AKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVY---FDTPYAARQ 68 (108) Q Consensus 1 m~vk~n~--~~~~~v~~a-----~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~---y~~pYA~~~ 68 (108) |+++|.. .+.+.+++. +++.+...+..+.+++...+|+|||.|++|..++. ++.++.|+ -.+.||.|+ T Consensus 4 ~~i~~~Gld~L~~~L~~~~~~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~Ya~Yv 83 (92) T protein:vir:99 4 YSISWDGLDALDEALANQQNMNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDGFTGSVTYGGGLVNYAAYV 83 (92) T ss_pred eeeEeehHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCCeeEEEEeccCcccccccc Confidence 7777774 455556543 78888899999999999999999999999998775 34578875 459999999 Q ss_pred ccccCCCCC Q lcl|NC_016654. 69 HEEVGWHHV 77 (108) Q Consensus 69 h~~~~~~~~ 77 (108) +|||.|-.. T Consensus 84 E~GTR~M~A 92 (92) T protein:vir:99 84 EFGTRFMDS 92 (92) T ss_pred ccceeecCC Confidence 999987554 No 74 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=98.59 E-value=4.9e-10 Score=71.64 Aligned_cols=105 Identities=17% Similarity=0.134 Sum_probs=77.7 Q ss_pred CCccccH--HHHH-------HHH-HHHHHHHHHHHHHHHHHhhhcCCcccch----hhcceeecc----cCcEE----EE Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TVR-GAAKSGLHDAAEVVKQEAIERCPKETGA----LRNSAGTAS----DGMEA----VV 58 (108) Q Consensus 1 m~vk~n~--~~~~-------~v~-~a~~~al~~~~~~v~~~s~~~vP~dtG~----L~~S~~v~~----~~~~g----~V 58 (108) |+++|.. .+.+ .+. +...+||..++..|..++...+|+++|+ |++|..+.. ....| .| T Consensus 2 ~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~v 81 (133) T protein:vir:10 2 IRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRV 81 (133) T ss_pred eeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEe Confidence 8888884 3322 232 3457889999999999999999999998 666665432 12233 34 Q ss_pred Eec---CchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 59 YFD---TPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 59 ~y~---~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +++ ..|++++++||...- ++ +||++++..+++++.+.+.+++++.|- T Consensus 82 g~~~~~~~y~~f~E~GT~k~~--a~-PF~~pA~~~~~~~~~~~~~~~~~~~l~ 131 (133) T protein:vir:10 82 GPSKQHHMKVLAQEFGTVKQV--AD-PFIRPALDYNVQTVLRVLTVEIRNGIQ 131 (133) T ss_pred cCCCCccceEeeeccCCCCCC--CC-ccchHHHHHhHHHHHHHHHHHHHHHhh Confidence 444 247778888886443 33 699999999999999999999999999 No 75 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=98.56 E-value=9.2e-10 Score=70.12 Aligned_cols=105 Identities=8% Similarity=0.050 Sum_probs=80.8 Q ss_pred CCccccH-HH-------HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccch--hhcceeecc---cCc----EEEEEecCc Q lcl|NC_016654. 1 MPVEFNY-GI-------AATVRGAAKSGLHDAAEVVKQEAIERCPKETGA--LRNSAGTAS---DGM----EAVVYFDTP 63 (108) Q Consensus 1 m~vk~n~-~~-------~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~--L~~S~~v~~---~~~----~g~V~y~~p 63 (108) |+|+++- ++ .....+..++|+...+..++.....-+|.+++. |+.|..+.. +.+ ...|||+-+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 9999984 33 223445567888888998988889999999888 999987753 112 345888754 Q ss_pred ---hhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 64 ---YAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 64 ---YA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ||+++++||....|. +|++++++.+++++.+++.++||+-+= T Consensus 81 ~~~~a~F~E~GT~k~~a~---pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 81 VSHRIHATEFGTMYQKPQ---LFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccCCCCC---chhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 676777787644333 799999999999999999999988888 No 76 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=98.56 E-value=9.2e-10 Score=70.12 Aligned_cols=105 Identities=8% Similarity=0.050 Sum_probs=80.8 Q ss_pred CCccccH-HH-------HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccch--hhcceeecc---cCc----EEEEEecCc Q lcl|NC_016654. 1 MPVEFNY-GI-------AATVRGAAKSGLHDAAEVVKQEAIERCPKETGA--LRNSAGTAS---DGM----EAVVYFDTP 63 (108) Q Consensus 1 m~vk~n~-~~-------~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~--L~~S~~v~~---~~~----~g~V~y~~p 63 (108) |+|+++- ++ .....+..++|+...+..++.....-+|.+++. |+.|..+.. +.+ ...|||+-+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 9999984 33 223445567888888998988889999999888 999987753 112 345888754 Q ss_pred ---hhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 64 ---YAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 64 ---YA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ||+++++||....|. +|++++++.+++++.+++.++||+-+= T Consensus 81 ~~~~a~F~E~GT~k~~a~---pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 81 VSHRIHATEFGTMYQKPQ---LFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccCCCCC---chhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 676777787644333 799999999999999999999988888 No 77 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=98.56 E-value=9.2e-10 Score=70.12 Aligned_cols=105 Identities=8% Similarity=0.050 Sum_probs=80.8 Q ss_pred CCccccH-HH-------HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccch--hhcceeecc---cCc----EEEEEecCc Q lcl|NC_016654. 1 MPVEFNY-GI-------AATVRGAAKSGLHDAAEVVKQEAIERCPKETGA--LRNSAGTAS---DGM----EAVVYFDTP 63 (108) Q Consensus 1 m~vk~n~-~~-------~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~--L~~S~~v~~---~~~----~g~V~y~~p 63 (108) |+|+++- ++ .....+..++|+...+..++.....-+|.+++. |+.|..+.. +.+ ...|||+-+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 9999984 33 223445567888888998988889999999888 999987753 112 345888754 Q ss_pred ---hhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 64 ---YAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 64 ---YA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ||+++++||....|. +|++++++.+++++.+++.++||+-+= T Consensus 81 ~~~~a~F~E~GT~k~~a~---pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 81 VSHRIHATEFGTMYQKPQ---LFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccCCCCC---chhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 676777787644333 799999999999999999999988888 No 78 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=98.56 E-value=9.2e-10 Score=70.12 Aligned_cols=105 Identities=8% Similarity=0.050 Sum_probs=80.8 Q ss_pred CCccccH-HH-------HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccch--hhcceeecc---cCc----EEEEEecCc Q lcl|NC_016654. 1 MPVEFNY-GI-------AATVRGAAKSGLHDAAEVVKQEAIERCPKETGA--LRNSAGTAS---DGM----EAVVYFDTP 63 (108) Q Consensus 1 m~vk~n~-~~-------~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~--L~~S~~v~~---~~~----~g~V~y~~p 63 (108) |+|+++- ++ .....+..++|+...+..++.....-+|.+++. |+.|..+.. +.+ ...|||+-+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 9999984 33 223445567888888998988889999999888 999987753 112 345888754 Q ss_pred ---hhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 64 ---YAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 64 ---YA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ||+++++||....|. +|++++++.+++++.+++.++||+-+= T Consensus 81 ~~~~a~F~E~GT~k~~a~---pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 81 VSHRIHATEFGTMYQKPQ---LFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccCCCCC---chhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 676777787644333 799999999999999999999988888 No 79 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=98.56 E-value=9.2e-10 Score=70.12 Aligned_cols=105 Identities=8% Similarity=0.050 Sum_probs=80.8 Q ss_pred CCccccH-HH-------HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccch--hhcceeecc---cCc----EEEEEecCc Q lcl|NC_016654. 1 MPVEFNY-GI-------AATVRGAAKSGLHDAAEVVKQEAIERCPKETGA--LRNSAGTAS---DGM----EAVVYFDTP 63 (108) Q Consensus 1 m~vk~n~-~~-------~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~--L~~S~~v~~---~~~----~g~V~y~~p 63 (108) |+|+++- ++ .....+..++|+...+..++.....-+|.+++. |+.|..+.. +.+ ...|||+-+ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 9999984 33 223445567888888998988889999999888 999987753 112 345888754 Q ss_pred ---hhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 64 ---YAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 64 ---YA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ||+++++||....|. +|++++++.+++++.+++.++||+-+= T Consensus 81 ~~~~a~F~E~GT~k~~a~---pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 81 VSHRIHATEFGTMYQKPQ---LFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccCCCCC---chhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 676777787644333 799999999999999999999988888 No 80 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.56 E-value=1.1e-09 Score=69.67 Aligned_cols=100 Identities=12% Similarity=0.160 Sum_probs=73.7 Q ss_pred CCccccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc--------------------------- Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD--------------------------- 52 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~--------------------------- 52 (108) ||...+- .+...+++..+..++.++..++.......|+|||.|++|-.+..+ T Consensus 1 msF~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~ 80 (131) T protein:vir:94 1 MSFALDVTRFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATSFVLN 80 (131) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHHHHhh Confidence 8775553 467788889999999999999999999999999999999765532 Q ss_pred ---CcEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 53 ---GMEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 53 ---~~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) +.+-.|.+++|||.+++||.+-+-|.| |.+..+... ..|.+-+.+++| T Consensus 81 ~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~g---~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:94 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQG---FVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred ccccceEEEeeCchhhhhhhccccCCCcch---HHHHHHHHH-HHHHHHHHHhcC Confidence 112458889999999999877555553 666665443 334444555666 No 81 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.56 E-value=1.3e-09 Score=69.38 Aligned_cols=100 Identities=13% Similarity=0.168 Sum_probs=74.0 Q ss_pred CCccccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc--------------------------- Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD--------------------------- 52 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~--------------------------- 52 (108) ||...+- .+...+++..+..++.++..++.......|+|||.|++|-.+..+ T Consensus 1 msf~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~ 80 (131) T protein:vir:78 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) T ss_pred CCcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhhHHHHHHHHhh Confidence 8875553 467888999999999999999999999999999999999765542 Q ss_pred ---CcEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHH Q lcl|NC_016654. 53 ---GMEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIR 104 (108) Q Consensus 53 ---~~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir 104 (108) +.+-.|++++|||.+++||.+-+-|.| |.+..+.... .+.+-+.+++| T Consensus 81 ~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G---~v~~~~~~~~-~~v~~~~~e~k 131 (131) T protein:vir:78 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQG---FVRVNVSRFQ-QLLNEEASKVK 131 (131) T ss_pred ccCCceEEEeeCchhhhHhhccccCCCcch---HHHHHHHHHH-HHHHHHHHhcC Confidence 112458889999999999887555553 6666654433 33344455566 No 82 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=98.52 E-value=1.1e-09 Score=69.78 Aligned_cols=105 Identities=15% Similarity=0.115 Sum_probs=76.8 Q ss_pred CCccccH----HHHH-------HH-HHHHHHHHHHHHHHHHHHhhhcCCccc----chhhcceeeccc---Cc----EEE Q lcl|NC_016654. 1 MPVEFNY----GIAA-------TV-RGAAKSGLHDAAEVVKQEAIERCPKET----GALRNSAGTASD---GM----EAV 57 (108) Q Consensus 1 m~vk~n~----~~~~-------~v-~~a~~~al~~~~~~v~~~s~~~vP~dt----G~L~~S~~v~~~---~~----~g~ 57 (108) |+++++- .+.+ .+ +++..+||..++..|...+...+|+|+ |.|++|..+... .+ ... T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~ 80 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLR 80 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEE Confidence 7766662 2222 33 234568899999999999999999986 999999876532 22 234 Q ss_pred EEecCch---hhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 58 VYFDTPY---AARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 58 V~y~~pY---A~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) |+++-+| +.++++||... +++ +||++++..+++++.+.+.++|++.|= T Consensus 81 vg~~~~~~~~~~f~E~GT~~~--~a~-PF~~pa~~~~~~~~~~~~~~~~~~~l~ 131 (135) T protein:vir:57 81 VGPTRSHYMKALAQEFGTIKQ--VAK-PFIRPALDYNKMQVLRILTVEIRDGLS 131 (135) T ss_pred ecCCCCcceeEeecccCCCCC--CCC-cchhHhHHHhHHHHHHHHHHHHHHHHH Confidence 5665554 55556677644 333 799999999999999999999999988 No 83 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=98.49 E-value=2.8e-09 Score=67.44 Aligned_cols=107 Identities=19% Similarity=0.253 Sum_probs=80.1 Q ss_pred CCccccH----HHHHHHHH---------HHHHHHHHHHHHHHHHhhhcCCc---------------------------cc Q lcl|NC_016654. 1 MPVEFNY----GIAATVRG---------AAKSGLHDAAEVVKQEAIERCPK---------------------------ET 40 (108) Q Consensus 1 m~vk~n~----~~~~~v~~---------a~~~al~~~~~~v~~~s~~~vP~---------------------------dt 40 (108) ||+.||- .+.+.+.+ ..++.+..++..+++.+...+|+ +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 9999995 24444433 34666677788888888888886 89 Q ss_pred chhhcceeec-----ccCcEEEEEecCchhhhhccccCCCC---CCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 41 GALRNSAGTA-----SDGMEAVVYFDTPYAARQHEEVGWHH---VDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 41 G~L~~S~~v~-----~~~~~g~V~y~~pYA~~~h~~~~~~~---~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) |+|++|-.+. .+..+.+|..++|||.++.||-.-.. -+| .++|+.+...-.+++.+.+++.|.+-|. T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~~gGfV~G-~fml~~s~~~~~~~~~~~~e~~l~~~l~ 155 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTVNGGFVPG-QFFLHKTVEDTKSDMEKRVRDKYDGFMR 155 (163) T ss_pred chhhccceecceeecCCceEEEEEecCCccchhhcceeecCCceecc-chhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999996653 34456789999999999988532111 112 3689999999999999999999988887 No 84 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.39 E-value=2.4e-09 Score=67.87 Aligned_cols=105 Identities=15% Similarity=0.149 Sum_probs=72.7 Q ss_pred CCccccH--HHH-------HHH-HHHHHHHHHHHHHHHHHHhhhcCCc-----ccchhhcceeeccc----CcEEE---- Q lcl|NC_016654. 1 MPVEFNY--GIA-------ATV-RGAAKSGLHDAAEVVKQEAIERCPK-----ETGALRNSAGTASD----GMEAV---- 57 (108) Q Consensus 1 m~vk~n~--~~~-------~~v-~~a~~~al~~~~~~v~~~s~~~vP~-----dtG~L~~S~~v~~~----~~~g~---- 57 (108) |+|+|.. .+. ..+ .++..+||..+++.|..++...+|. ++|.|..|..+.-+ ..+|. T Consensus 5 ~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~~~~~ 84 (179) T protein:vir:18 5 VEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGDLAFR 84 (179) T ss_pred EEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccceeEe Confidence 6776663 222 233 3456889999999999999999965 56677766544311 11111 Q ss_pred ---------------------E-------------EecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHH Q lcl|NC_016654. 58 ---------------------V-------------YFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAI 103 (108) Q Consensus 58 ---------------------V-------------~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~i 103 (108) + .-+++|++++++||.. .+++ +||.+++..+++++.+.|.+.| T Consensus 85 vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~k--mpa~-PFlrPA~~~~~~~a~~~i~~~l 161 (179) T protein:vir:18 85 VGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEH--TSAR-PILRPAMNGVDNDVINVFSTEM 161 (179) T ss_pred eecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCC--CCCC-ccchhhHHhhHHHHHHHHHHHH Confidence 1 1257899999998863 3333 7999999999999999888887 Q ss_pred HHhcC Q lcl|NC_016654. 104 RRSIA 108 (108) Q Consensus 104 r~~Lg 108 (108) ++.|- T Consensus 162 ~~~i~ 166 (179) T protein:vir:18 162 GKAID 166 (179) T ss_pred HHHHH Confidence 77776 No 85 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.37 E-value=7.3e-09 Score=65.21 Aligned_cols=103 Identities=13% Similarity=0.087 Sum_probs=70.3 Q ss_pred CCc----cccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc--------------------- Q lcl|NC_016654. 1 MPV----EFNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD--------------------- 52 (108) Q Consensus 1 m~v----k~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~--------------------- 52 (108) |.= .|.. .+...++......++.++..++......+|+|||.||+|-.+..+ T Consensus 1 ma~~~~~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~~ 80 (147) T protein:vir:10 1 MANYQIRRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGEE 80 (147) T ss_pred CCCcchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhhh Confidence 422 2332 356778888888889999999999999999999999999654311 Q ss_pred -------------CcEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH---hc Q lcl|NC_016654. 53 -------------GMEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRR---SI 107 (108) Q Consensus 53 -------------~~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~---~L 107 (108) ..+-.|++++|||.+++||.+-+-|.| |.+-.+.....-+.+ ...++|+ .| T Consensus 81 ~~~~~~~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~QAP~G---~V~~t~q~~~~~v~~-~~~e~k~~~~~~ 147 (147) T protein:vir:10 81 QAKTYGMFSRGGAITSVHFSNMLIYANALEYGHSQQAPSG---VVGLVALRLRSYMAD-AIKQARRQQNAL 147 (147) T ss_pred hHHHHHHhhhccCcceEEEeeCcchhhhhhccccCCCCch---HHHHHHHHHHHHHHH-HHHHHHhhhccC Confidence 113467889999999999877555553 777665544433333 3334444 34 No 86 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=98.33 E-value=7.8e-09 Score=65.02 Aligned_cols=103 Identities=19% Similarity=0.189 Sum_probs=76.7 Q ss_pred CCccccH-----HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc-----------C----------- Q lcl|NC_016654. 1 MPVEFNY-----GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD-----------G----------- 53 (108) Q Consensus 1 m~vk~n~-----~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~-----------~----------- 53 (108) |.-.+.+ .+..+++++....+..++..++.......|+|||.+|.|-.+..+ + T Consensus 1 m~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~~~~~ 80 (148) T protein:vir:97 1 MPSLSEFSRRITLRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTEAANT 80 (148) T ss_pred CCccchhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCcccccch Confidence 7766554 356778888888889999999999999999999999999655421 0 Q ss_pred -----------------cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 54 -----------------MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 54 -----------------~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+-.|.+|.|||.+++||.+-+-|.| |.+..+.....-+.+ ++.+|+.-| T Consensus 81 ~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G---~v~~t~~~~~~~v~~--~~~~~~~~~ 147 (148) T protein:vir:97 81 QAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPAN---FVEQAVLEAVQVVQF--GRVVDGDPG 147 (148) T ss_pred hHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcch---HHHHHHHHHHHHHHh--hhhhcCCCC Confidence 01247789999999999877655553 777776655544444 677888888 No 87 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=98.31 E-value=4.3e-09 Score=66.45 Aligned_cols=96 Identities=10% Similarity=0.093 Sum_probs=76.3 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCc---ccchhhcceee-cccCcEEEEEecCchhhhhccccCCCCCC-------C-- Q lcl|NC_016654. 13 VRGAAKSGLHDAAEVVKQEAIERCPK---ETGALRNSAGT-ASDGMEAVVYFDTPYAARQHEEVGWHHVD-------G-- 79 (108) Q Consensus 13 v~~a~~~al~~~~~~v~~~s~~~vP~---dtG~L~~S~~v-~~~~~~g~V~y~~pYA~~~h~~~~~~~~~-------~-- 79 (108) |.+..+++++..+..+++.+...+|+ |||+|++|=.+ .+....+.|.-+++||.++.||-.-+.-. + T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~v~N~~eYA~~VE~GHRq~~g~g~~~~~~gkr 80 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGVVSNNVEYIHHLEYGHRTRQGTGTSENYRPKP 80 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCceeecCCcccccccCCceeeCCcceeccccccc Confidence 78888899999999999999999997 78999999776 34555567888899999998853321111 1 Q ss_pred -------ccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 80 -------QAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 80 -------~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ...+|+....+-+..+.+++++.|.+-|- T Consensus 81 lk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 81 NGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 12578999988888899998888888888 No 88 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=98.29 E-value=9.9e-09 Score=64.47 Aligned_cols=105 Identities=10% Similarity=0.107 Sum_probs=73.6 Q ss_pred CC----ccccH--HHHH---------HHHHHHHHHHHHHHHHHHHHhhhcCCcc-------------cchhhcceeecc- Q lcl|NC_016654. 1 MP----VEFNY--GIAA---------TVRGAAKSGLHDAAEVVKQEAIERCPKE-------------TGALRNSAGTAS- 51 (108) Q Consensus 1 m~----vk~n~--~~~~---------~v~~a~~~al~~~~~~v~~~s~~~vP~d-------------tG~L~~S~~v~~- 51 (108) |+ |+|.. ++.+ .++++..+||..++..|..++...+|.+ +|.++.+..+.- T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 65 45542 2222 3455677899999999999999999975 334554444321 Q ss_pred --cCc--EEEEEe------cCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHH----HHHHhcC Q lcl|NC_016654. 52 --DGM--EAVVYF------DTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGE----AIRRSIA 108 (108) Q Consensus 52 --~~~--~g~V~y------~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~----~ir~~Lg 108 (108) ..+ ...||| ++.|++++++||....|. +||++++...++++.+++.+ .|++.|| T Consensus 81 ~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~---pF~~pa~~~~~~~~~~~~~~~l~k~i~~~lG 148 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPH---HAFGKTNKILKRVYDNIAQKKYDNFVKEKLG 148 (149) T ss_pred ccccceeEEEeeccCCCCCccceeeeeccCccCCCCC---ccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 111 346877 457999999998754333 79999999999999876655 6899999 No 89 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.28 E-value=1.4e-08 Score=63.73 Aligned_cols=99 Identities=13% Similarity=0.134 Sum_probs=76.4 Q ss_pred CCccccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc--------------ccchhhcceeecccC------------ Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPK--------------ETGALRNSAGTASDG------------ 53 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~--------------dtG~L~~S~~v~~~~------------ 53 (108) ||...+- .+...++...+..++.++..++.......|+ |||.+|.|-.++.+. T Consensus 11 msFaa~i~~~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~~~~~~~~~~~ 90 (152) T protein:vir:96 11 MSWSKSLKNIIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKITSFEKGISSQS 90 (152) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCCcccccCCCCC Confidence 8877774 4677888889999999999999998889999 999999997665321 Q ss_pred ----------------cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 54 ----------------MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 54 ----------------~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) .+-.|.+|.|||.+++||.+-+-|.| |.+..+ .+|.+++++++|-+ T Consensus 91 ~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~QAP~G---~vr~t~----~~~~~~v~ea~~~~ 152 (152) T protein:vir:96 91 SIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGHSSQAPNG---VYRPAV----RRLVKFLNTELKAK 152 (152) T ss_pred chHHHHHHHHhhccccceEEEeeCchhhhHhhccccCCCCch---HHHHHH----HHHHHHHHHHhccC Confidence 12257889999999999876555554 555544 46777888888888 No 90 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.24 E-value=5.5e-09 Score=65.87 Aligned_cols=105 Identities=16% Similarity=0.147 Sum_probs=73.4 Q ss_pred CCccccH--HHHH-------HH-HHHHHHHHHHHHHHHHHHhhhcCCc-----ccchhhcceeecccC------------ Q lcl|NC_016654. 1 MPVEFNY--GIAA-------TV-RGAAKSGLHDAAEVVKQEAIERCPK-----ETGALRNSAGTASDG------------ 53 (108) Q Consensus 1 m~vk~n~--~~~~-------~v-~~a~~~al~~~~~~v~~~s~~~vP~-----dtG~L~~S~~v~~~~------------ 53 (108) |+|+|.. .+.+ .+ +++..+||..+++.|..++...+|. ++|.|++|..+..+. T Consensus 5 ~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~~ 84 (164) T protein:vir:43 5 VEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLGFR 84 (164) T ss_pred eEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccceeEE Confidence 5566552 2222 22 2456789999999999999999986 667888776442100 Q ss_pred -----------cEEE----EEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 54 -----------MEAV----VYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 54 -----------~~g~----V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .++. ...+++|++++++||... +++ +||.+++..+++++.+.+.+.|++.|- T Consensus 85 vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km--~a~-PFlrPA~~~~k~~~~~~~~~~l~~~i~ 151 (164) T protein:vir:43 85 IGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDM--RAQ-PFMRSALADNIAEVTSTFVSEYEKGID 151 (164) T ss_pred ecccccccccccccccccCCCCCcceEEEeecCCCCC--CCC-cchhhhHHHhHHHHHHHHHHHHHHHHH Confidence 0011 113468999999998743 333 699999999999999999888888886 No 91 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=98.23 E-value=1.4e-08 Score=63.65 Aligned_cols=100 Identities=14% Similarity=0.186 Sum_probs=69.1 Q ss_pred CCccccH-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC-------------------------- Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG-------------------------- 53 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~-------------------------- 53 (108) ||..-+- .+...+++.....++.++..++.......|+|||.||.|-.+.++. T Consensus 1 msF~~~i~~~~~~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~~~v 80 (134) T protein:vir:80 1 MSYTDRFNVIAKGIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGMDEALQVLQQT 80 (134) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccchhhHHHHHHH Confidence 7776664 4678899999999999999999999999999999999997655321 Q ss_pred -------cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 54 -------MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 54 -------~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+-.|.+|+|||.+++||.+-+-|.| |.+....+. .+++++ +|-=-- T Consensus 81 i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G---~v~~t~~~~----~~~v~~-~~~~~~ 134 (134) T protein:vir:80 81 VGQYKAGDTVHITNNAPYIKELNSGSSQQAPAN---FVETSIMRA----TRLIRN-VKVVPQ 134 (134) T ss_pred HhhccCcceEEEeeCchhhhhhhccccCCCcch---HHHHHHHHH----HHHHHh-hccCCC Confidence 11347789999999999877655553 655544333 233322 110000 No 92 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.18 E-value=1.5e-08 Score=63.44 Aligned_cols=89 Identities=18% Similarity=0.243 Sum_probs=68.0 Q ss_pred CCccccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC------------------------ Q lcl|NC_016654. 1 MPVEFNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG------------------------ 53 (108) Q Consensus 1 m~vk~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~------------------------ 53 (108) |+..|+. .+...+++..+..++.++..+........|+|||.+|.|-.+..+. T Consensus 2 ~~~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~~~ 81 (121) T protein:vir:94 2 ISMKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVVSS 81 (121) T ss_pred ccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHHHH Confidence 6666664 4567788888888888999999998999999999999997665321 Q ss_pred ----cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhH Q lcl|NC_016654. 54 ----MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQ 92 (108) Q Consensus 54 ----~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~ 92 (108) .+-.|.+|.|||.+++||.+-+-|.| +.+-.+.+.+ T Consensus 82 ~~~~~~iyi~NnlpYA~~LE~G~S~QAP~G---~v~~t~~~~q 121 (121) T protein:vir:94 82 NVALPHFYITNGAPYAQQLEKGSSTQAPLG---IVRVTLASLR 121 (121) T ss_pred hhccceEEEeeCcchhhhhhcccCCCCcch---HHHHHHHhhC Confidence 11258889999999999887666664 6666666655 No 93 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=98.10 E-value=3.4e-08 Score=61.55 Aligned_cols=106 Identities=15% Similarity=0.184 Sum_probs=81.1 Q ss_pred CCccccH--HHHHHH-------HHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecC---chhhhh Q lcl|NC_016654. 1 MPVEFNY--GIAATV-------RGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDT---PYAARQ 68 (108) Q Consensus 1 m~vk~n~--~~~~~v-------~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~---pYA~~~ 68 (108) -++.|+. .+.+.+ ++=..+||..+++.|+.++..-+|++||.|.....+--..+-+.|+.+- -|...+ T Consensus 2 a~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~g~~~VG~~ks~~fy~kF~ 81 (119) T protein:vir:10 2 ASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNTGLATEGTASSSEFYDIFQ 81 (119) T ss_pred ceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecCceeEeccCCcchhhhhhc Confidence 4666663 333332 3336778888999999999999999999999844333334556777764 688888 Q ss_pred ccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 69 HEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 69 h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+||+ ..+++..||+++.+..+++..+++.++|++.|= T Consensus 82 EFGTS--km~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 82 NFGTS--EQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred ccccc--ccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 88886 444555699999999999999999999999999 No 94 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.01 E-value=5.6e-08 Score=60.35 Aligned_cols=99 Identities=16% Similarity=0.169 Sum_probs=65.8 Q ss_pred CCcc---ccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC--------------------- Q lcl|NC_016654. 1 MPVE---FNY---GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG--------------------- 53 (108) Q Consensus 1 m~vk---~n~---~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~--------------------- 53 (108) |.-- |.. .+...++++....+..++..++.......|+|||.+|.|-.+..+. T Consensus 1 MA~~~~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d~s 80 (144) T protein:vir:95 1 MAKSLLDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQRAS 80 (144) T ss_pred CchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCCCc Confidence 6642 221 3566788888888889999999999999999999999997655331 Q ss_pred ------------------cEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 54 ------------------MEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 54 ------------------~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) .+-.|.+|.|||.+++||.+-+-|.| |...++.+...-+.+. .|.. T Consensus 81 g~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G---~vr~~~q~~~~~v~~~---~~~~ 144 (144) T protein:vir:95 81 AAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAG---FVERAVLIGRKMRKKF---KIKD 144 (144) T ss_pred hhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcch---HHHHHHHHHHHHHHhh---ccCC Confidence 12247789999999999887666664 5554444333222221 1111 No 95 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=97.90 E-value=4.4e-08 Score=60.89 Aligned_cols=100 Identities=16% Similarity=0.179 Sum_probs=68.1 Q ss_pred ccH--HHHHHHHHH----HHHHHHHHHHHHHHHhhhc--CCc-------ccchhhcceeecc--cCcEEEEE---ecCch Q lcl|NC_016654. 5 FNY--GIAATVRGA----AKSGLHDAAEVVKQEAIER--CPK-------ETGALRNSAGTAS--DGMEAVVY---FDTPY 64 (108) Q Consensus 5 ~n~--~~~~~v~~a----~~~al~~~~~~v~~~s~~~--vP~-------dtG~L~~S~~v~~--~~~~g~V~---y~~pY 64 (108) |.. .+++.+++. +++-+..-+-++-..+... +|+ |||+|++|..... ++.+|.++ |.+.| T Consensus 1 i~G~~~L~~~Lk~~s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dY 80 (127) T protein:vir:98 1 MTGMPALEVKLRSMSEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDY 80 (127) T ss_pred CcChHHHHHHHHHhhHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccc Confidence 332 233333333 2333333344455555554 777 9999999987553 44566655 56999 Q ss_pred hhhhccccCCCC-------CCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 65 AARQHEEVGWHH-------VDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 65 A~~~h~~~~~~~-------~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) |.|++||+.|-. .+|| +||.++++..+....+-+.+.+|+ T Consensus 81 apyvEyGTR~m~~~~~~gf~~aq-p~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 81 APHVEYGHRIVRNGKQVGYANGT-KYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred cceeecceeeeecccccccccCc-cccccchHHHhHHHHHHHHHHhcC Confidence 999999998632 2343 799999999999999999999999 No 96 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=97.87 E-value=1.7e-07 Score=57.73 Aligned_cols=107 Identities=20% Similarity=0.278 Sum_probs=77.5 Q ss_pred CC-ccccH----------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeeccc---CcEEEEEecCchhh Q lcl|NC_016654. 1 MP-VEFNY----------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASD---GMEAVVYFDTPYAA 66 (108) Q Consensus 1 m~-vk~n~----------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~---~~~g~V~y~~pYA~ 66 (108) |+ |.++. .+...+.+..+.++..++..+..+....+|++||.|++|-.+... ++...|+|+.+..+ T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~ 80 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHYR 80 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCCCC Confidence 43 44443 122456667888899999999999999999999999999766543 33457899998777 Q ss_pred hhcc-ccCCCCCCC-c---cchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 67 RQHE-EVGWHHVDG-Q---AKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 67 ~~h~-~~~~~~~~~-~---~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) ..|. +++....+| + .+|+.++......++.+.|.+.|+.+= T Consensus 81 l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 81 RVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred ceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 7565 333333322 1 369999999999999998888888443 No 97 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=97.55 E-value=1e-06 Score=53.48 Aligned_cols=105 Identities=15% Similarity=0.244 Sum_probs=77.0 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhh--cCCcccchhhcceeeccc----C-cEEEEEecC Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIE--RCPKETGALRNSAGTASD----G-MEAVVYFDT 62 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~--~vP~dtG~L~~S~~v~~~----~-~~g~V~y~~ 62 (108) |||++.. ++++. +.+-..+||..+++.|+.+... -+..|||.+..+..++-- + .+..|+|.. T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 9999984 34443 4445889999999999999764 344599999999876532 2 367899988 Q ss_pred chhhhh--cc---ccC------CCCCCCccchhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 63 PYAARQ--HE---EVG------WHHVDGQAKYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 63 pYA~~~--h~---~~~------~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) |--||- |. |+. |-+|.|.+.. ++++...+..+.+++.++|++- T Consensus 81 ~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i-~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 PFERFRIVHLIENGHVEKKSGKFVKPKAMGGI-NRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeeEEEeeecceeecCCCCeeccchhhHH-HHHHHhhhHHHHHHHHHHHhcC Confidence 855553 54 331 2356666543 7799999999999999999987 No 98 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=97.47 E-value=1.5e-06 Score=52.58 Aligned_cols=105 Identities=16% Similarity=0.179 Sum_probs=78.3 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCCc--ccchhhcceeecc----cC-cEEEEEecC Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCPK--ETGALRNSAGTAS----DG-MEAVVYFDT 62 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP~--dtG~L~~S~~v~~----~~-~~g~V~y~~ 62 (108) |||++.. ++++. +.+-+.+||..+++.|+......++. |||.+..+..++- ++ -+..|+|.. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 9999985 34443 44458899999999999998866664 9999999887653 22 357899988 Q ss_pred chhhh--hcc---cc------CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 63 PYAAR--QHE---EV------GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 63 pYA~~--~h~---~~------~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) |=-|| +|. |+ .|-+|.|.+.. ++++...+..+.++++++|++- T Consensus 81 ~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i-~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 SKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGV-NRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeEEEEeecccceecccCCccCcchhhHH-HHHHHhhhHHHHHHHHHHHhcC Confidence 84444 353 32 14467776644 7899999999999999999987 No 99 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=97.47 E-value=1.5e-06 Score=52.58 Aligned_cols=105 Identities=16% Similarity=0.179 Sum_probs=78.3 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCCc--ccchhhcceeecc----cC-cEEEEEecC Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCPK--ETGALRNSAGTAS----DG-MEAVVYFDT 62 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP~--dtG~L~~S~~v~~----~~-~~g~V~y~~ 62 (108) |||++.. ++++. +.+-+.+||..+++.|+......++. |||.+..+..++- ++ -+..|+|.. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 9999985 34443 44458899999999999998866664 9999999887653 22 357899988 Q ss_pred chhhh--hcc---cc------CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 63 PYAAR--QHE---EV------GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 63 pYA~~--~h~---~~------~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) |=-|| +|. |+ .|-+|.|.+.. ++++...+..+.++++++|++- T Consensus 81 ~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i-~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 81 SKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGV-NRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCceeEEEEeecccceecccCCccCcchhhHH-HHHHHhhhHHHHHHHHHHHhcC Confidence 84444 353 32 14467776644 7899999999999999999987 No 100 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.44 E-value=4.5e-07 Score=55.36 Aligned_cols=106 Identities=17% Similarity=0.173 Sum_probs=69.2 Q ss_pred CccccHHH---HHHHHHHHHHHHHHHHHHHHHHh------------hhcCC---------------cccchhhcceeec- Q lcl|NC_016654. 2 PVEFNYGI---AATVRGAAKSGLHDAAEVVKQEA------------IERCP---------------KETGALRNSAGTA- 50 (108) Q Consensus 2 ~vk~n~~~---~~~v~~a~~~al~~~~~~v~~~s------------~~~vP---------------~dtG~L~~S~~v~- 50 (108) =|+...++ ..++.++...+|..++..++... .+..| .|||.|++|.... T Consensus 1 ~i~~~~~i~~~l~~l~~~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~~~~ 80 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDGLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDINAAS 80 (145) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHHHHh Confidence 24444332 33444444555555544433321 13444 4777888876533 Q ss_pred ---ccCcEEEEEecCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 51 ---SDGMEAVVYFDTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 51 ---~~~~~g~V~y~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+..++.||.|.+||+.+|+|+.--+=+++ +||.....+..+++.++|.+.+-+.|+ T Consensus 81 ~~~~~~~~a~vGtn~~YA~~hqfG~~~~~IPaR-PfLG~~~~~~~~~~~~ii~~~i~~~L~ 140 (145) T protein:vir:31 81 MMDRANRMAVIGTNLDYAEHHEFGAPEAGIPAR-PIFGPAGAYASQQAPDVIGDEIDTNLE 140 (145) T ss_pred hhcccCceeEecCCchhhhhhccCCcccccCCC-CccCCCccchHHHHHHHHHHHHHHHhh Confidence 345678999999999999998754334444 699887777778899999999999999 No 101 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=97.40 E-value=4.5e-07 Score=55.39 Aligned_cols=107 Identities=19% Similarity=0.264 Sum_probs=78.2 Q ss_pred CC------ccccH--HHHHH--------HHHHHHHHHHHHHHHHHHHhhhcCCc-----------ccchhhcceeecccC Q lcl|NC_016654. 1 MP------VEFNY--GIAAT--------VRGAAKSGLHDAAEVVKQEAIERCPK-----------ETGALRNSAGTASDG 53 (108) Q Consensus 1 m~------vk~n~--~~~~~--------v~~a~~~al~~~~~~v~~~s~~~vP~-----------dtG~L~~S~~v~~~~ 53 (108) |+ ||++. .+... |.++.+.++.++++.++..+...+|. .||.|..|..+.... T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 54 54442 22222 35667888899999999999999999 699999999988766 Q ss_pred cEEEE--Ee--cCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHH----HHHhcC Q lcl|NC_016654. 54 MEAVV--YF--DTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEA----IRRSIA 108 (108) Q Consensus 54 ~~g~V--~y--~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~----ir~~Lg 108 (108) ..+.| |- ..|||.++|||+.-++-. ...||-+++...-+.|.++.++. |.+.|| T Consensus 81 raa~VrAG~~krVPYA~~I~~G~r~r~Is-p~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~ 142 (143) T protein:vir:62 81 KGAVIKAGSASRVPYAAAIHFGYRARNIS-PNRFLFRAMARKSDVVAATYERRIAAVVEKYLE 142 (143) T ss_pred cceeeeeCCcCCCCcccccccCccccccc-chhhhhhhhhccCHHHHHHHHHHHHHHHHHHhc Confidence 55444 44 689999999986545422 24799999998888877665554 556777 No 102 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=97.39 E-value=8.1e-07 Score=53.98 Aligned_cols=107 Identities=19% Similarity=0.267 Sum_probs=78.1 Q ss_pred CC------ccccH--HHHHH--------HHHHHHHHHHHHHHHHHHHhhhcCCcc-----------cchhhcceeecccC Q lcl|NC_016654. 1 MP------VEFNY--GIAAT--------VRGAAKSGLHDAAEVVKQEAIERCPKE-----------TGALRNSAGTASDG 53 (108) Q Consensus 1 m~------vk~n~--~~~~~--------v~~a~~~al~~~~~~v~~~s~~~vP~d-----------tG~L~~S~~v~~~~ 53 (108) |+ |++.. .|... |.++.+.++.++++.++..+...+|.- +|.|..|..+.... T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 54 44442 22222 356678888999999999999999998 89999999988776 Q ss_pred cEEEE--Ee--cCchhhhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHHH----HHHhcC Q lcl|NC_016654. 54 MEAVV--YF--DTPYAARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGEA----IRRSIA 108 (108) Q Consensus 54 ~~g~V--~y--~~pYA~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~----ir~~Lg 108 (108) ..+.| |- -.|||.++|||+.-++-. ...||-+++...-+.|.++.++. |.+.|| T Consensus 81 raa~VrAGr~arVPYA~~I~~G~r~r~Is-~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~ 142 (143) T protein:vir:13 81 KGAVIKAGSAARVPYAAAIHFGYRKRNIS-ANRFLYRAMARKSDVVAATYERRIAAVVEKYLE 142 (143) T ss_pred cceeeeecCcCCCCcccccccCCcccccc-hhhhhhhhhhccCHHHHHHHHHHHHHHHHHHhc Confidence 55554 32 379999999986544433 34799999998888887665555 556677 No 103 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.31 E-value=1.3e-06 Score=52.77 Aligned_cols=107 Identities=14% Similarity=0.155 Sum_probs=68.2 Q ss_pred CC----ccccH-HHHHHHHHHHH------HHHHHHHHHHHHHhh-----hcC---------------------------- Q lcl|NC_016654. 1 MP----VEFNY-GIAATVRGAAK------SGLHDAAEVVKQEAI-----ERC---------------------------- 36 (108) Q Consensus 1 m~----vk~n~-~~~~~v~~a~~------~al~~~~~~v~~~s~-----~~v---------------------------- 36 (108) || |+|+. .+...+.+.+. ..+..+++.+..... .-- T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 66 55553 34455444322 234445555443321 111 Q ss_pred ------------CcccchhhcceeecccCcEEEEEecCchhhhhccccCCC-----CCCCccchhhHHHHHh-----HHH Q lcl|NC_016654. 37 ------------PKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWH-----HVDGQAKYLENAVNAT-----QAT 94 (108) Q Consensus 37 ------------P~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~-----~~~~~~k~le~a~~~~-----~~~ 94 (108) =.|||.|.+|....++.....||.|.+||+.+|+|..-. +-+++ .||----.++ .+. T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~~~~v~IPAR-PfLG~s~~de~~~~~~~~ 159 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGSNKEYAAIQHFGGQAGRGLKVTIPGR-AWLPVTADGELQPEAVEP 159 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCCEEEEecCcchhhHhhcccccCCCcccccCcc-cccCCCcccchhHHHHHH Confidence 136999999999888888999999999999999976321 12233 4765433333 567 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_016654. 95 VAEVIGEAIRRSIA 108 (108) Q Consensus 95 i~~~i~~~ir~~Lg 108 (108) |.++|.+.|++.|. T Consensus 160 I~~~i~~~l~~a~~ 173 (175) T protein:vir:79 160 VLNTILRHLMDAAN 173 (175) T ss_pred HHHHHHHHHHHHhc Confidence 88888888888888 No 104 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=97.21 E-value=5.1e-06 Score=49.59 Aligned_cols=102 Identities=18% Similarity=0.206 Sum_probs=68.4 Q ss_pred ccccHH---HHHHHHHH-------HHHHHHHHHHHHHHHhhhcCCc-------cc---chhhcceeecc---c---CcEE Q lcl|NC_016654. 3 VEFNYG---IAATVRGA-------AKSGLHDAAEVVKQEAIERCPK-------ET---GALRNSAGTAS---D---GMEA 56 (108) Q Consensus 3 vk~n~~---~~~~v~~a-------~~~al~~~~~~v~~~s~~~vP~-------dt---G~L~~S~~v~~---~---~~~g 56 (108) |.|... +...|++. ..+++.+.+..+...-...+|. +| +.|+.|..+.. + .+.. T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceee Confidence 555543 33344332 3445566666666666677775 23 46888887653 2 2334 Q ss_pred EEEecCchhhhhcc---ccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 57 VVYFDTPYAARQHE---EVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 57 ~V~y~~pYA~~~h~---~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .|||+..| .+.|+ ||.+..|+ +|+|+...+.++++.+.+.+++|+-|. T Consensus 81 ~VG~~k~~-~~A~f~n~GT~k~~~~---hFie~t~~e~~~evl~a~~~~~k~~l~ 131 (139) T protein:vir:10 81 TVGFHNKA-HIARFLNDGTKYIRAD---HFVDNARDDAKDAVFAAEAEKYQAMIA 131 (139) T ss_pred eeCCCCCc-ceEeecccCccccCCC---chHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 69998763 44454 66555554 799999999999999999999999988 No 105 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=97.19 E-value=5.5e-06 Score=49.40 Aligned_cols=106 Identities=18% Similarity=0.127 Sum_probs=64.0 Q ss_pred CCccc----cH-HHHHHHHHH------HHHHHHHHHHHHHHH--------hhhcCC--------------------cccc Q lcl|NC_016654. 1 MPVEF----NY-GIAATVRGA------AKSGLHDAAEVVKQE--------AIERCP--------------------KETG 41 (108) Q Consensus 1 m~vk~----n~-~~~~~v~~a------~~~al~~~~~~v~~~--------s~~~vP--------------------~dtG 41 (108) ||+.| |. .+.+.+.+- ....+..+++.+... -.+..| .+|| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg 80 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch Confidence 76655 43 233333321 122233344433332 223333 4899 Q ss_pred hhhcceeecccCcEEEEEecCchhhhhccccCCC-----CCCCccchhhHH-----HHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 42 ALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWH-----HVDGQAKYLENA-----VNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 42 ~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~-----~~~~~~k~le~a-----~~~~~~~i~~~i~~~ir~~L 107 (108) .|.+|....++.....||.|.+||+.+|+|.... +.+++ .||--- ..+..++|.++|.+.|+++= T Consensus 81 ~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~~~~v~iPaR-pfLG~s~~~~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 81 ALARSVTTWADRNEAGIGSNLVYAAIHQFGGDAGRGHQVEIPAR-RYLPFDENGQLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred hhhhhhhceecCCEEEEecCccchhhhhcccccCCCCccccCCc-cccCCCCccccchHHHHHHHHHHHHHHhccC Confidence 9999999999999999999999999999976432 12333 477422 12445677778888887776 No 106 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=97.18 E-value=6.1e-06 Score=49.17 Aligned_cols=107 Identities=20% Similarity=0.248 Sum_probs=66.5 Q ss_pred CCccccH-HHHHHHHHHH------HHHHHHHHHHHHHHh------------hhcCC-----------------cccchhh Q lcl|NC_016654. 1 MPVEFNY-GIAATVRGAA------KSGLHDAAEVVKQEA------------IERCP-----------------KETGALR 44 (108) Q Consensus 1 m~vk~n~-~~~~~v~~a~------~~al~~~~~~v~~~s------------~~~vP-----------------~dtG~L~ 44 (108) ++|++|. .+...+.... ...+..++..+.+.. .++.| .|||.|. T Consensus 4 i~i~~d~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~tg~L~ 83 (190) T protein:vir:99 4 ITLEWDGRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLDGHLR 83 (190) T ss_pred eEEEecHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceecHHHH Confidence 5566664 2333333322 223444444444321 12333 3778999 Q ss_pred cceeecccCcEEEEEecCchhhhhccccCCCCCCC-----------------------------------------ccch Q lcl|NC_016654. 45 NSAGTASDGMEAVVYFDTPYAARQHEEVGWHHVDG-----------------------------------------QAKY 83 (108) Q Consensus 45 ~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~~~~~-----------------------------------------~~k~ 83 (108) +|....++..+..||.|.+||+.+|+|..-..+.. .=.| T Consensus 84 ~Si~~~~~~~~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~IPaRpf 163 (190) T protein:vir:99 84 NLLRYQLDGSELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQMPARPW 163 (190) T ss_pred HHHhheecCcEEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccceeeecCccc Confidence 99988888889999999999999998743221110 0135 Q ss_pred hhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 84 LENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 84 le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) |--. .+...+|.++|.+.|++.|. T Consensus 164 LG~s-~~d~~~I~~~i~~~l~~~~~ 187 (190) T protein:vir:99 164 LGTS-SQDDDTILQRVERYLQRALR 187 (190) T ss_pred CCCC-HHHHHHHHHHHHHHHHHHHh Confidence 4221 45678899999999999998 No 107 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=97.10 E-value=8.3e-06 Score=48.43 Aligned_cols=106 Identities=13% Similarity=0.124 Sum_probs=79.2 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHH--hhhcCCcccchhhcceeecc----c-CcEEEEEecC Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQE--AIERCPKETGALRNSAGTAS----D-GMEAVVYFDT 62 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~--s~~~vP~dtG~L~~S~~v~~----~-~~~g~V~y~~ 62 (108) |||++.. ++++. |.+.+.+||..+++.|... ++.-+..|||....+..++- + .-+..|+|.. T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~g 80 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKG 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEec Confidence 9999984 33333 4455788999999988877 45577789999999887552 2 2367899999 Q ss_pred chhhhh--cc---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 63 PYAARQ--HE---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 63 pYA~~~--h~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) |=-||- |- |+ .|-+|.|.+. +++++......+.++++++|++.| T Consensus 81 p~~R~~iVHLNE~GYtr~Gk~i~PrG~G~-i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 81 PKDRYKIIHLNEYGYTRNGKKITPAGTGS-VARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred CCCceeEEEeeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHHHhhC Confidence 855543 53 33 2457877663 588889999999999999999999 No 108 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.10 E-value=7.2e-06 Score=48.77 Aligned_cols=106 Identities=13% Similarity=0.062 Sum_probs=64.5 Q ss_pred CCcccc----H-HHHHHHHHHHH-----HHHHHHHHHHHHH-------------hhhcCC-------------------- Q lcl|NC_016654. 1 MPVEFN----Y-GIAATVRGAAK-----SGLHDAAEVVKQE-------------AIERCP-------------------- 37 (108) Q Consensus 1 m~vk~n----~-~~~~~v~~a~~-----~al~~~~~~v~~~-------------s~~~vP-------------------- 37 (108) ||+.++ . .+...+.+-.. ..+..++..+.+. ..+..| T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~L 80 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSIL 80 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcch Confidence 776664 2 23333332111 1233333333322 112333 Q ss_pred cccchhhcceeecccCcEEEEEecCchhhhhccccCCC-CC-----CCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 38 KETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWH-HV-----DGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 38 ~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~-~~-----~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .|||.|.+|....++...+.||.|.+||+.+|||..-+ ++ +++ .||--- .+...+|.++|.+.|++.|. T Consensus 81 ~~tg~L~~Si~~~~~~~~v~vGt~~~yA~vHqfG~~~~~~~~~~~iPaR-pfLG~s-~~d~~~I~~~i~~~l~~~~~ 155 (156) T protein:vir:19 81 TLHGDLARSITTDYGQDYALIGSPKIYAAIHQWGGTPDMAPRPAGVPAR-PYMGLD-KTGEQEIFDAIRKRVSAALR 155 (156) T ss_pred hhhHHHHHHhhheecCCEEEEecchhhhHHhhcCcccccCCCccccCCc-cccCCC-HHHHHHHHHHHHHHHHHHhh Confidence 27799999988888888999999999999999976432 11 122 466322 35577888888888888888 No 109 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=97.08 E-value=2.2e-06 Score=51.62 Aligned_cols=76 Identities=21% Similarity=0.380 Sum_probs=59.0 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec--ccCcEEEEEecCchhhhhccccCCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA--SDGMEAVVYFDTPYAARQHEEVGWHHVD 78 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~--~~~~~g~V~y~~pYA~~~h~~~~~~~~~ 78 (108) |...+| .+...+.+-+++++-.+++.+...+...+|+|||.|++|..++ ..+.+|.|.-.+.||++---.. T Consensus 23 mvk~~~-~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk~GGltavI~vGAeYAIkrmsql------ 95 (100) T protein:vir:96 23 MVVELD-KFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIKRMSQL------ 95 (100) T ss_pred HHHHHh-cchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeeeecCCeeEEEecchhHHHHHHHHH------ Confidence 666666 4677888899999999999999999999999999999999887 3456899999999999432100 Q ss_pred CccchhhHHHHHhHHHHHHHH Q lcl|NC_016654. 79 GQAKYLENAVNATQATVAEVI 99 (108) Q Consensus 79 ~~~k~le~a~~~~~~~i~~~i 99 (108) +.-+| T Consensus 96 ----------------lvtvi 100 (100) T protein:vir:96 96 ----------------LVTVI 100 (100) T ss_pred ----------------HhhcC Confidence 00000 No 110 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.06 E-value=6.1e-06 Score=49.19 Aligned_cols=107 Identities=17% Similarity=0.099 Sum_probs=65.3 Q ss_pred CC----ccccHH-HHHHHHHHH------HHHHHHHHHHHHHHh--------hhcCC--------------------cccc Q lcl|NC_016654. 1 MP----VEFNYG-IAATVRGAA------KSGLHDAAEVVKQEA--------IERCP--------------------KETG 41 (108) Q Consensus 1 m~----vk~n~~-~~~~v~~a~------~~al~~~~~~v~~~s--------~~~vP--------------------~dtG 41 (108) || |++|.. +...+.+.. ...+..+++.+.... .+..| .||| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG 80 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTN 80 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccch Confidence 66 555542 444443321 223444455444432 12211 4789 Q ss_pred hhhcceeecccCcEEEEEecCchhhhhccccCCC-CC----CCccchhhHHHH-HhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 42 ALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWH-HV----DGQAKYLENAVN-ATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 42 ~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~-~~----~~~~k~le~a~~-~~~~~i~~~i~~~ir~~Lg 108 (108) .|.+|....++.....||.|.+||+.+|+|..-. ++ +++ .||--.-. +-++++.+.|.+.|.+.|- T Consensus 81 ~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~~~~~~iPAR-PfLG~s~~~e~~~ei~~~I~~~i~~~l~ 152 (155) T protein:vir:10 81 ALARSITTRADRDQAQIGSNLSYAAIQQLGGQAGRGRKVTIPAR-PYLPVLRNGQLKPSARDAVLDVLLAALS 152 (155) T ss_pred hhhhhhhceecCCEEEEecCcchhhhhhcccccCCCCccccCCc-cccCCCccccchHHHHHHHHHHHHHHHh Confidence 9999988888888999999999999999976421 11 233 58763222 2356777777777777776 No 111 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=96.99 E-value=1.3e-05 Score=47.32 Aligned_cols=102 Identities=19% Similarity=0.188 Sum_probs=68.2 Q ss_pred ccccH---HHHHHHHHH-------HHHHHHHHHHHHHHHhhhcCCc-------cc---chhhcceeecc---c---CcEE Q lcl|NC_016654. 3 VEFNY---GIAATVRGA-------AKSGLHDAAEVVKQEAIERCPK-------ET---GALRNSAGTAS---D---GMEA 56 (108) Q Consensus 3 vk~n~---~~~~~v~~a-------~~~al~~~~~~v~~~s~~~vP~-------dt---G~L~~S~~v~~---~---~~~g 56 (108) |.|.. ++.+.+++. ..+++...++.+...-...+|. ++ +.|..|..+.. + .+.. T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~ 80 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSS 80 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccc Confidence 44443 234444443 2455566666666666777773 33 35888877653 2 2346 Q ss_pred EEEecCchhhhhcc---ccCCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 57 VVYFDTPYAARQHE---EVGWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 57 ~V~y~~pYA~~~h~---~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .|||+..| .+.|+ ||.+..|+ +|+|++..+.++++.+.+++++++-|. T Consensus 81 ~VG~~~~~-~~Ahf~n~GT~~~~~~---hFie~t~~e~~~ev~~a~~~~~ke~l~ 131 (139) T protein:vir:10 81 TVGFHNKA-HIARFLNDGTKNIRAD---HFVDNARDDAKDAVFAAEAEKYQAMIA 131 (139) T ss_pred eeCCCCCc-eeeeeeccCccccCCC---chHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 79999874 34454 66555454 799999999999999999999988887 No 112 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=96.81 E-value=3e-05 Score=45.42 Aligned_cols=105 Identities=14% Similarity=0.104 Sum_probs=68.6 Q ss_pred CCcccc--H----------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEE-EecCchhhh Q lcl|NC_016654. 1 MPVEFN--Y----------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVV-YFDTPYAAR 67 (108) Q Consensus 1 m~vk~n--~----------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V-~y~~pYA~~ 67 (108) |+-++. . .....+...++.++..++..+..+-...+|++||.+.+|=.+..++....| .|+.+--+- T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l 80 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRL 80 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcce Confidence 776544 2 122334456777788888888888889999999999999777665443344 444432222 Q ss_pred hcc-ccCCCCC-CCc---cchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 68 QHE-EVGWHHV-DGQ---AKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 68 ~h~-~~~~~~~-~~~---~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) .|. +.+.-.- .|+ -.++.++...-.+.+.+.|.++|++ T Consensus 81 ~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 81 THLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred EEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 353 2221111 111 2588999888899999999999999 No 113 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.76 E-value=1.9e-05 Score=46.43 Aligned_cols=107 Identities=13% Similarity=0.135 Sum_probs=65.2 Q ss_pred CC----ccccH-HHHHHHHHHH------HHHHHHHHHHHHHHhh------------hcCC-------------------- Q lcl|NC_016654. 1 MP----VEFNY-GIAATVRGAA------KSGLHDAAEVVKQEAI------------ERCP-------------------- 37 (108) Q Consensus 1 m~----vk~n~-~~~~~v~~a~------~~al~~~~~~v~~~s~------------~~vP-------------------- 37 (108) || |+++. .+...+.+.+ ...+..+++.+..... ++.| T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 76 55564 2444444432 2234444544444321 1222 Q ss_pred -------------cccchhhcceeecccCcEEEEEecCchhhhhccccCCC-CC----CCccchhhHHHH-----HhHHH Q lcl|NC_016654. 38 -------------KETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWH-HV----DGQAKYLENAVN-----ATQAT 94 (108) Q Consensus 38 -------------~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~-~~----~~~~k~le~a~~-----~~~~~ 94 (108) .+||.|.+|....++.....||.|.+||+.+|+|.... +. +++ .||----. +..++ T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~~~~v~iPaR-pfLG~s~~d~~~~e~~~~ 159 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGSNKEYAAIHQFGGQAGRGLKVTIPAR-PWLPVTADGELQPEAVEP 159 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeecCCEEEEecChhhhhhhhcccccCCCCccccCCc-cccCCCcccccchHHHHH Confidence 36889999999888889999999999999999976432 11 222 46553222 23466 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_016654. 95 VAEVIGEAIRRSIA 108 (108) Q Consensus 95 i~~~i~~~ir~~Lg 108 (108) |...+.+.|.+.|. T Consensus 160 Il~~~~~~l~~~~~ 173 (175) T protein:vir:10 160 VLNTILRHLMDAAN 173 (175) T ss_pred HHHHHHHHHHHHhc Confidence 77777777777777 No 114 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=96.73 E-value=2.1e-05 Score=46.18 Aligned_cols=105 Identities=20% Similarity=0.220 Sum_probs=74.8 Q ss_pred CCcccc--------HHHH-----HHHHHHHHHHHHHHHHHHHHHhhhcCC--cccchhhcceeecc----c-CcEEEEEe Q lcl|NC_016654. 1 MPVEFN--------YGIA-----ATVRGAAKSGLHDAAEVVKQEAIERCP--KETGALRNSAGTAS----D-GMEAVVYF 60 (108) Q Consensus 1 m~vk~n--------~~~~-----~~v~~a~~~al~~~~~~v~~~s~~~vP--~dtG~L~~S~~v~~----~-~~~g~V~y 60 (108) ||-==+ .++. ++|.+-+.+||..+++.|...-....| .|||....+..+.- + ..+..|+| T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW 86 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 86 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEee Confidence 442111 1222 234555778888888888887665555 69999999876542 2 23678999 Q ss_pred cCchhhhh--cc---ccC-CCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 61 DTPYAARQ--HE---EVG-WHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 61 ~~pYA~~~--h~---~~~-~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ..| ||- |- |++ |-||+|.+ .+++++......+.+.++++|++.|- T Consensus 87 ~Gp--R~~ivHLNE~GyGk~i~PrG~G-~I~ka~~~se~~y~~~vk~el~k~l~ 137 (138) T protein:vir:98 87 TTP--RWNIVHLQELEYGWKHNRRGVG-VIRRYSDILETIYPRGIRDKLKRGFD 137 (138) T ss_pred ecC--eeeEEeeecccccCCcCCCcch-HHHHHHHhhhHHHHHHHHHHHHHHhc Confidence 999 443 53 443 45788776 67999999999999999999999998 No 115 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=96.59 E-value=2.6e-05 Score=45.74 Aligned_cols=107 Identities=19% Similarity=0.200 Sum_probs=78.3 Q ss_pred CCcccc--------HHHHH-----HHHHHHHHHHHHHHHHHHHHhhhcCCc--ccchhhcceeecc----cC-cEEEEEe Q lcl|NC_016654. 1 MPVEFN--------YGIAA-----TVRGAAKSGLHDAAEVVKQEAIERCPK--ETGALRNSAGTAS----DG-MEAVVYF 60 (108) Q Consensus 1 m~vk~n--------~~~~~-----~v~~a~~~al~~~~~~v~~~s~~~vP~--dtG~L~~S~~v~~----~~-~~g~V~y 60 (108) ||-=-+ .++.. +|.+...+||..+++.|+.....-+|. |||.+..+..++- ++ .+..|+| T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW 80 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 80 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecc Confidence 553111 12222 466678899999999999998888885 9999999887652 22 3678999 Q ss_pred cCchhhhhcc---ccC-CCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 61 DTPYAARQHE---EVG-WHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 61 ~~pYA~~~h~---~~~-~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +.|=-+-+|- |++ |-||+|.+ .+++++......+.+.+.++|++.|- T Consensus 81 ~GpR~~ivHLNE~GyGk~~~PrG~G-~I~~a~~~se~~~~~~~~~elkk~l~ 131 (132) T protein:vir:96 81 TTPRWNIVHLQELEYGWKHNRRGVG-VIRRYSDILETIYPRGIRDKLKRGFD 131 (132) T ss_pred cCCceeEEeeecccccCCcCCCcch-HHHHHHHhhhhHHHHHHHHHHHHHhc Confidence 9992222353 443 45788766 67999999999999999999999998 No 116 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=96.51 E-value=3.3e-05 Score=45.18 Aligned_cols=104 Identities=17% Similarity=0.242 Sum_probs=75.8 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCC--cccchhhcceeecc----cC---cEEEEEe Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCP--KETGALRNSAGTAS----DG---MEAVVYF 60 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP--~dtG~L~~S~~v~~----~~---~~g~V~y 60 (108) |||++.. ++++. |.+.+.+||..+++.|...-...+. .|||.+..+..++- ++ -+..|+| T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEe Confidence 9999984 33333 4455888999999988888654444 69999999987662 22 3468999 Q ss_pred cCchhhhh--cc---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 61 DTPYAARQ--HE---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 61 ~~pYA~~~--h~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ..|=-||- |- |+ .|-+|.|.+. +++++......+.++++++|++ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-IAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHhcC Confidence 99855553 53 33 1346777653 5888889999999999999999 No 117 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=96.39 E-value=2.6e-05 Score=45.72 Aligned_cols=106 Identities=19% Similarity=0.122 Sum_probs=64.3 Q ss_pred CCccc----cH-HHHHHHHHHH------HHHHHHHHHHHHHHh--------hhcCC--------------------cccc Q lcl|NC_016654. 1 MPVEF----NY-GIAATVRGAA------KSGLHDAAEVVKQEA--------IERCP--------------------KETG 41 (108) Q Consensus 1 m~vk~----n~-~~~~~v~~a~------~~al~~~~~~v~~~s--------~~~vP--------------------~dtG 41 (108) ||+.| |. .+.+.+.+.. ...+..++..+.... .+..| .||| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 65544 43 2333333321 222233444443322 23333 5899 Q ss_pred hhhcceeecccCcEEEEEecCchhhhhccccCCC-----CCCCccchhhHHH-----HHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 42 ALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWH-----HVDGQAKYLENAV-----NATQATVAEVIGEAIRRSI 107 (108) Q Consensus 42 ~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~-----~~~~~~k~le~a~-----~~~~~~i~~~i~~~ir~~L 107 (108) .|.+|....++.....||.|.+||+.+|+|..-. +-+++ .||---- .+-.++|.++|.+.|+++= T Consensus 81 ~L~~Si~~~~~~~~v~vGt~~~YA~iHqfGg~~~~~~~v~iPaR-pfLG~s~~~~l~~~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 81 ALARSVTTWADRNEAGIGSNLVYAAIHQFGGDAGRGHQVEIPAR-RYLPFDENGQLAAGARQSILEVVLTALSRNR 155 (155) T ss_pred hhhhhhhceecCCEEEEecCchhhhhhhcccccCCCCccccCCc-cccCCCCccccchHHHHHHHHHHHHHHHhcC Confidence 9999999999999999999999999999986432 12333 4774322 2334677888888887666 No 118 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=96.33 E-value=4.9e-05 Score=44.21 Aligned_cols=104 Identities=17% Similarity=0.247 Sum_probs=75.0 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCC--cccchhhcceeecc----cC---cEEEEEe Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCP--KETGALRNSAGTAS----DG---MEAVVYF 60 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP--~dtG~L~~S~~v~~----~~---~~g~V~y 60 (108) |||++.. ++++. |.+.+.+||..+++.|...-...+. .|||.+..+..++- ++ -+..|+| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999984 33333 4445788999999988888654444 69999999887653 12 3468999 Q ss_pred cCchhhhh--cc---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 61 DTPYAARQ--HE---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 61 ~~pYA~~~--h~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ..|=-||- |- |+ .|-+|.|.+. +++++......+.++++++|++ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHhcC Confidence 99855553 53 33 1346777653 5888888899999999999999 No 119 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=96.33 E-value=4.9e-05 Score=44.21 Aligned_cols=104 Identities=17% Similarity=0.247 Sum_probs=75.0 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCC--cccchhhcceeecc----cC---cEEEEEe Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCP--KETGALRNSAGTAS----DG---MEAVVYF 60 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP--~dtG~L~~S~~v~~----~~---~~g~V~y 60 (108) |||++.. ++++. |.+.+.+||..+++.|...-...+. .|||.+..+..++- ++ -+..|+| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999984 33333 4445788999999988888654444 69999999887653 12 3468999 Q ss_pred cCchhhhh--cc---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 61 DTPYAARQ--HE---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 61 ~~pYA~~~--h~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ..|=-||- |- |+ .|-+|.|.+. +++++......+.++++++|++ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHhcC Confidence 99855553 53 33 1346777653 5888888899999999999999 No 120 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=96.33 E-value=4.9e-05 Score=44.21 Aligned_cols=104 Identities=17% Similarity=0.247 Sum_probs=75.0 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCC--cccchhhcceeecc----cC---cEEEEEe Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCP--KETGALRNSAGTAS----DG---MEAVVYF 60 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP--~dtG~L~~S~~v~~----~~---~~g~V~y 60 (108) |||++.. ++++. |.+.+.+||..+++.|...-...+. .|||.+..+..++- ++ -+..|+| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999984 33333 4445788999999988888654444 69999999887653 12 3468999 Q ss_pred cCchhhhh--cc---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 61 DTPYAARQ--HE---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 61 ~~pYA~~~--h~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ..|=-||- |- |+ .|-+|.|.+. +++++......+.++++++|++ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHhcC Confidence 99855553 53 33 1346777653 5888888899999999999999 No 121 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=96.33 E-value=4.9e-05 Score=44.21 Aligned_cols=104 Identities=17% Similarity=0.247 Sum_probs=75.0 Q ss_pred CCccccH--HHHHH---------HHHHHHHHHHHHHHHHHHHhhhcCC--cccchhhcceeecc----cC---cEEEEEe Q lcl|NC_016654. 1 MPVEFNY--GIAAT---------VRGAAKSGLHDAAEVVKQEAIERCP--KETGALRNSAGTAS----DG---MEAVVYF 60 (108) Q Consensus 1 m~vk~n~--~~~~~---------v~~a~~~al~~~~~~v~~~s~~~vP--~dtG~L~~S~~v~~----~~---~~g~V~y 60 (108) |||++.. ++++. |.+.+.+||..+++.|...-...+. .|||.+..+..++- ++ -+..|+| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 9999984 33333 4445788999999988888654444 69999999887653 12 3468999 Q ss_pred cCchhhhh--cc---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 61 DTPYAARQ--HE---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 61 ~~pYA~~~--h~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ..|=-||- |- |+ .|-+|.|.+. +++++......+.++++++|++ T Consensus 81 ~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~-i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 81 VGPMNRKNIIHLNEHGYTRDGKKYTPRGFGV-IAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCceeEEEeeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHhcC Confidence 99855553 53 33 1346777653 5888888899999999999999 No 122 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=95.79 E-value=0.00017 Score=41.27 Aligned_cols=104 Identities=23% Similarity=0.268 Sum_probs=60.5 Q ss_pred CCccccH---HHHHHHH-------HHHHHHHHHHHHHHHHHhhhcCCc------cc---chhhcceeecc---c---CcE Q lcl|NC_016654. 1 MPVEFNY---GIAATVR-------GAAKSGLHDAAEVVKQEAIERCPK------ET---GALRNSAGTAS---D---GME 55 (108) Q Consensus 1 m~vk~n~---~~~~~v~-------~a~~~al~~~~~~v~~~s~~~vP~------dt---G~L~~S~~v~~---~---~~~ 55 (108) |. .|.. ++...++ +...+++...+..+...-...+|. .| +.|..|+.+.. + .+. T Consensus 1 M~-~~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~ 79 (153) T protein:vir:49 1 MT-GLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGV 79 (153) T ss_pred Cc-cHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccce Confidence 32 1332 2333332 334566665565555554545543 23 47888877642 2 235 Q ss_pred EEEEecCch----hhhhccccCCCCCCCccchhhHHHHHh--HHHHH----HHHHHHHHHhcC Q lcl|NC_016654. 56 AVVYFDTPY----AARQHEEVGWHHVDGQAKYLENAVNAT--QATVA----EVIGEAIRRSIA 108 (108) Q Consensus 56 g~V~y~~pY----A~~~h~~~~~~~~~~~~k~le~a~~~~--~~~i~----~~i~~~ir~~Lg 108 (108) ..|||+-+| |+++..||.+..|+ +|+|++..+. +.++. +.+.+.|++.+| T Consensus 80 s~VG~~~~~~a~~a~f~n~GT~km~~~---hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~ 139 (153) T protein:vir:49 80 STVGWKNNYHAQNARRLNDGTKKYRAD---HFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGG 139 (153) T ss_pred eeecccCCccceeeeecccCcccCCCC---hhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCC Confidence 579998666 44445577655444 7999998765 56676 466777777888 No 123 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=95.41 E-value=3.9e-05 Score=44.73 Aligned_cols=80 Identities=20% Similarity=0.301 Sum_probs=57.0 Q ss_pred HHHHhhhcCCcccchhhcceeec---ccCcEEE----EEec---CchhhhhccccCC------CCCCC------------ Q lcl|NC_016654. 28 VKQEAIERCPKETGALRNSAGTA---SDGMEAV----VYFD---TPYAARQHEEVGW------HHVDG------------ 79 (108) Q Consensus 28 v~~~s~~~vP~dtG~L~~S~~v~---~~~~~g~----V~y~---~pYA~~~h~~~~~------~~~~~------------ 79 (108) |..++...+|++||+|++|.++. .+...|. |+|| +||..-++||- | +.++| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Gh-w~~~~~~~~~dG~w~~~~~~l~~~ 79 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGH-WQTHAAYKGKDGEWYSSSVKLVNP 79 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccce-eeeeeeeeccCceeeecCccccCc Confidence 66667889999999999999754 2333444 6666 57666666651 1 11111 Q ss_pred ----ccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 80 ----QAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 80 ----~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) -..||.++++.-+....+++...+++.+. T Consensus 80 ~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~ 112 (119) T protein:vir:81 80 KWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYA 112 (119) T ss_pred eecCCCCccchhHHHHHHHHHHHHHHHHHHHHH Confidence 02599999999999999999998888777 No 124 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=95.33 E-value=4.4e-05 Score=44.49 Aligned_cols=80 Identities=18% Similarity=0.250 Sum_probs=57.0 Q ss_pred HHHHhhhcCCcccchhhcceeec---ccCcEEE----EEec---Cchhhhhcccc------------CCC-------CC- Q lcl|NC_016654. 28 VKQEAIERCPKETGALRNSAGTA---SDGMEAV----VYFD---TPYAARQHEEV------------GWH-------HV- 77 (108) Q Consensus 28 v~~~s~~~vP~dtG~L~~S~~v~---~~~~~g~----V~y~---~pYA~~~h~~~------------~~~-------~~- 77 (108) |..++...+|++||+|++|.++. .+...|. |+|| +||..-++||- .|- +| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~ 80 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPK 80 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccceeeeeeeeeccCceeeecCccccCce Confidence 66667889999999999999754 2333444 6666 57666666651 011 11 Q ss_pred --CCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 78 --DGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 78 --~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ++ ..||.++++.-+....+++...+++.+. T Consensus 81 ~vPa-~pFlRpA~da~~~~a~~~~~~r~~~rv~ 112 (119) T protein:vir:10 81 WIPA-RPFLRPGYDSVAMQIPDIAKAAGAKKYA 112 (119) T ss_pred ecCC-CCccchhHHHHHHHHHHHHHHHHHHHHH Confidence 12 2599999999999999999998888777 No 125 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=94.93 E-value=0.00038 Score=39.35 Aligned_cols=103 Identities=17% Similarity=0.246 Sum_probs=72.4 Q ss_pred CC-------------ccccHHHHHHHHHHHHHHHHHHHHHHHHHh--hhcCCcccchhhcceeecc----cC-cEEEEEe Q lcl|NC_016654. 1 MP-------------VEFNYGIAATVRGAAKSGLHDAAEVVKQEA--IERCPKETGALRNSAGTAS----DG-MEAVVYF 60 (108) Q Consensus 1 m~-------------vk~n~~~~~~v~~a~~~al~~~~~~v~~~s--~~~vP~dtG~L~~S~~v~~----~~-~~g~V~y 60 (108) || -||.. ++|.+.+.+||..+++.|...- +.-+=.|||....+..++. ++ -+..|+| T Consensus 1 m~evkGv~eilk~lE~k~G~---~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W 77 (133) T protein:vir:96 1 MRLIYDTKKLERELEKRLSK---RALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYW 77 (133) T ss_pred CccccCHHHHHHHHHHhcCH---HHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEe Confidence 44 23332 3455667888888888877774 3333459999998876543 22 3578999 Q ss_pred cCchhhhh--cc---c-c----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 61 DTPYAARQ--HE---E-V----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 61 ~~pYA~~~--h~---~-~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) ..|=-||- |- | + .|-+|.|.+. +++++......+.++++++|++-| T Consensus 78 ~gp~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~-I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 78 EGEKHRYSIVHLNEKGFYAKDGKFIRPKGMGA-IDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred ecCCCceeeEeeecccceecCCceeccchhhH-HHHHHHhhhHHHHHHHHHHHHHhC Confidence 99855543 43 2 1 2457887663 688899999999999999999999 No 126 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=94.84 E-value=0.00047 Score=38.82 Aligned_cols=104 Identities=19% Similarity=0.156 Sum_probs=65.3 Q ss_pred CccccH---HHHHHHHH-------HHHHHHHHHHHHHHHHhhhcCCc---------ccchhhcceeecc---c---CcEE Q lcl|NC_016654. 2 PVEFNY---GIAATVRG-------AAKSGLHDAAEVVKQEAIERCPK---------ETGALRNSAGTAS---D---GMEA 56 (108) Q Consensus 2 ~vk~n~---~~~~~v~~-------a~~~al~~~~~~v~~~s~~~vP~---------dtG~L~~S~~v~~---~---~~~g 56 (108) =+.|.. ++...|++ ...+++.+.+..+...-...+|. ..+.|..|..+.. + .+.. T Consensus 1 M~~~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~s 80 (141) T protein:vir:50 1 MVGLAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGVS 80 (141) T ss_pred CccHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCee Confidence 122332 23333333 45666676677666667777773 3557888887753 2 2345 Q ss_pred EEEecCch-hhhhcc---ccCCCCCCCccchhhHHHHHh--HHHHHHHHHHHHHHhcC Q lcl|NC_016654. 57 VVYFDTPY-AARQHE---EVGWHHVDGQAKYLENAVNAT--QATVAEVIGEAIRRSIA 108 (108) Q Consensus 57 ~V~y~~pY-A~~~h~---~~~~~~~~~~~k~le~a~~~~--~~~i~~~i~~~ir~~Lg 108 (108) .|||+-.| |.+.|+ ||.+..|+ +|+|++..+. +++|.+...+++++-|- T Consensus 81 ~VG~~~~~~~~~A~f~n~GT~k~~~~---hFve~~~~~a~~k~~Vl~A~~~~~k~~l~ 135 (141) T protein:vir:50 81 TVGWKNNYHAQNARRLNDGTKKYRAD---HFVTNVQNDSTVQKKVLLEKKRNTKNSLE 135 (141) T ss_pred eeccCCCccceeeeccccCccccCCC---chhHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 79997777 433354 66544443 7999999754 67888888888887665 No 127 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=94.35 E-value=0.0016 Score=35.93 Aligned_cols=108 Identities=17% Similarity=0.146 Sum_probs=67.5 Q ss_pred CCccccHHH---HHHHHHH-------HHHHHHHHHHHHHHHhhhcCCc-----------------------ccchhhcce Q lcl|NC_016654. 1 MPVEFNYGI---AATVRGA-------AKSGLHDAAEVVKQEAIERCPK-----------------------ETGALRNSA 47 (108) Q Consensus 1 m~vk~n~~~---~~~v~~a-------~~~al~~~~~~v~~~s~~~vP~-----------------------dtG~L~~S~ 47 (108) |.+.|+..+ +..|++. ..++..+-|..+..--...+|. ..|.|..|+ T Consensus 1 mm~~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~I 80 (159) T protein:vir:38 1 MANDMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDSI 80 (159) T ss_pred CcchHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccce Confidence 999998533 3344321 2222233333333333455554 135888888 Q ss_pred eecc----c---CcEEEEEecCch-hhhhcc---ccCCCCCC--CccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 48 GTAS----D---GMEAVVYFDTPY-AARQHE---EVGWHHVD--GQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 48 ~v~~----~---~~~g~V~y~~pY-A~~~h~---~~~~~~~~--~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .+.. + .|+..|||+-.| |.+.|+ ||.+..|+ ..-+|+|++..+.+++|.+...+++++=|- T Consensus 81 ~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~ 154 (159) T protein:vir:38 81 TYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAYKEVMN 154 (159) T ss_pred eeecCccccccccceeeecccCCccceEeeecccCccccCCCCccCChhHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 7642 2 256779996555 333354 77665443 123899999999999999999999999888 No 128 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=93.98 E-value=0.00074 Score=37.73 Aligned_cols=104 Identities=17% Similarity=0.143 Sum_probs=61.1 Q ss_pred CC-ccccH--------------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchh Q lcl|NC_016654. 1 MP-VEFNY--------------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYA 65 (108) Q Consensus 1 m~-vk~n~--------------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA 65 (108) |+ |+++. .+...|+++++.+-..+++.|..+...-.|++||...+|=.+..+. +|.++||-.+- T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~-e~~~V~nk~~y 79 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVP-NGWVIHNKTEY 79 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeec-CceeEEEcCCC Confidence 54 33332 1222233344444455556666777789999999999987666543 45688885332 Q ss_pred hhhcc-ccCCCCCCC-c---cchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 66 ARQHE-EVGWHHVDG-Q---AKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 66 ~~~h~-~~~~~~~~~-~---~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) +-.|- +.+.-..+| + -+.+.++-..-..++.+-|++.|++ T Consensus 80 qLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 80 RLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred ceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 33354 222212222 1 2567777777777777777777777 No 129 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=93.98 E-value=0.00087 Score=37.38 Aligned_cols=104 Identities=23% Similarity=0.221 Sum_probs=64.2 Q ss_pred CCccccH---HHHHHH-------HHHHHHHHHHHHHHHHHHhhhcCC------cccc---hhhcceeecc---cC---cE Q lcl|NC_016654. 1 MPVEFNY---GIAATV-------RGAAKSGLHDAAEVVKQEAIERCP------KETG---ALRNSAGTAS---DG---ME 55 (108) Q Consensus 1 m~vk~n~---~~~~~v-------~~a~~~al~~~~~~v~~~s~~~vP------~dtG---~L~~S~~v~~---~~---~~ 55 (108) |. .|+. ++.+.+ .+...+++.+-+..+...-...+| ..|| .|..|..+.. ++ +. T Consensus 1 M~-~~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~ 79 (140) T protein:vir:48 1 MT-GLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGV 79 (140) T ss_pred Cc-cHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCce Confidence 32 2332 233333 234566666666666666777888 4555 5999888652 22 34 Q ss_pred EEEEecCch----hhhhccccCCCCCCCccchhhHHHHHh--HHHHHHHHHHHHHHhcC Q lcl|NC_016654. 56 AVVYFDTPY----AARQHEEVGWHHVDGQAKYLENAVNAT--QATVAEVIGEAIRRSIA 108 (108) Q Consensus 56 g~V~y~~pY----A~~~h~~~~~~~~~~~~k~le~a~~~~--~~~i~~~i~~~ir~~Lg 108 (108) ..|||+-.| |+++-.||.+..++ +|+|++..+. +.++.+...+++++-|- T Consensus 80 s~VG~~kk~~a~~A~f~n~GT~k~~~~---hFve~~~~e~~~k~~vl~A~~~~~~~~l~ 135 (140) T protein:vir:48 80 STVGWVNRYHAQNARRLNDGTKKYRAD---HFVTNVQNDSAVQTKVLLAEKEEYEKLIR 135 (140) T ss_pred eeeccCCCcceeeeeccccCccccCCC---chhHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 579997554 33334466544443 7999999865 67788877777777665 No 130 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=93.09 E-value=0.0009 Score=37.29 Aligned_cols=104 Identities=22% Similarity=0.281 Sum_probs=68.7 Q ss_pred CccccHH-----H--HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccccCC Q lcl|NC_016654. 2 PVEFNYG-----I--AATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVGW 74 (108) Q Consensus 2 ~vk~n~~-----~--~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~ 74 (108) -+++|-+ + ...++..+++.+.....++...+..-+|+.||+|+.|...++.+.+|++.-..||-.++-.|-+| T Consensus 1 mi~i~idkp~almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~siegstgelsn~~~yl~~vl~grgw 80 (133) T protein:vir:42 1 MIEIRIDKPDALMEKPHEVQGKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEGSTGELSNLAYYLPFVLHGRGW 80 (133) T ss_pred CeeeecCCchhhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeecCccchhhhhHHhhHhhhcccc Confidence 2444421 1 22456667778888888888889999999999999999999999999999999999998766554 Q ss_pred C--------------CCCCc------cchhhHHHH--HhHHHHHHHHHHHHHH Q lcl|NC_016654. 75 H--------------HVDGQ------AKYLENAVN--ATQATVAEVIGEAIRR 105 (108) Q Consensus 75 ~--------------~~~~~------~k~le~a~~--~~~~~i~~~i~~~ir~ 105 (108) - ||-+- .-||..+.. +-+.-+.+..-+-+|+ T Consensus 81 vfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 81 VFPVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred eeeccccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 2 33221 247766543 2222222222223333 No 131 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=91.48 E-value=0.0031 Score=34.32 Aligned_cols=101 Identities=15% Similarity=0.202 Sum_probs=70.3 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHh--hhcCCcccchhhcceeecc----cC---cEEEEEecCchhhhh--c Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEA--IERCPKETGALRNSAGTAS----DG---MEAVVYFDTPYAARQ--H 69 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s--~~~vP~dtG~L~~S~~v~~----~~---~~g~V~y~~pYA~~~--h 69 (108) |--||.. ++|.+.+.+||..+++.|...- +.-+=.|||....+..++- ++ -+..|+|..|=-||- | T Consensus 5 lE~k~G~---~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iVH 81 (123) T protein:vir:26 5 LESVYGK---QSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNIIH 81 (123) T ss_pred HHHhcCH---HHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCCCceeeEe Confidence 3333332 4566678888888888877774 3334459999999887652 22 356899999855543 5 Q ss_pred c---cc----CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 70 E---EV----GWHHVDGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 70 ~---~~----~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) - |+ .|-+|.|.+. +++++......+.++++++|++ T Consensus 82 LNE~GYtr~Gk~i~PRG~G~-i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 82 LNEHGYTRDGKKYTPRGFGV-IAKTLAANERKYREIIKKELAR 123 (123) T ss_pred eeccceecCCCeEccchhhH-HHHHHHhhhHHHHHHHHHHhcC Confidence 3 33 1346777653 5888889999999999999999 No 132 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=91.29 E-value=0.0016 Score=35.92 Aligned_cols=104 Identities=19% Similarity=0.181 Sum_probs=60.5 Q ss_pred CCccccHHHHHHH-------HHHHHHHHHHHHHHHHHHhhhcCCc------ccc---hhhcceeecc---c---CcEEEE Q lcl|NC_016654. 1 MPVEFNYGIAATV-------RGAAKSGLHDAAEVVKQEAIERCPK------ETG---ALRNSAGTAS---D---GMEAVV 58 (108) Q Consensus 1 m~vk~n~~~~~~v-------~~a~~~al~~~~~~v~~~s~~~vP~------dtG---~L~~S~~v~~---~---~~~g~V 58 (108) |+--|+ ++.+.+ .+...+++.+-+..+.......+|. .|| .|..|..+.. + .+...| T Consensus 4 ~~d~l~-e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~s~V 82 (140) T protein:vir:48 4 LDEALE-GWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGVATV 82 (140) T ss_pred HHHHHH-HHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccccceee Confidence 332122 223333 2345556666666666666777763 344 6888887652 2 234569 Q ss_pred EecCch-h---hhhccccCCCCCCCccchhhHHHHHh--HHHHHHHHHHHHHH----hcC Q lcl|NC_016654. 59 YFDTPY-A---ARQHEEVGWHHVDGQAKYLENAVNAT--QATVAEVIGEAIRR----SIA 108 (108) Q Consensus 59 ~y~~pY-A---~~~h~~~~~~~~~~~~k~le~a~~~~--~~~i~~~i~~~ir~----~Lg 108 (108) ||+-+| | +++-.||.+..|+ +|+|++..+. ++++.+-..+++++ ..| T Consensus 83 G~~k~~~a~~a~f~NdGT~k~~~~---hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~ 139 (140) T protein:vir:48 83 GWKNNYHAQNARRLNDGTKKYRAD---HFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGG 139 (140) T ss_pred cccCCCceeEEeecccCccccCCC---chHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcC Confidence 999775 3 3334466654444 7999999754 67777766655444 445 No 133 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=91.28 E-value=0.0049 Score=33.27 Aligned_cols=107 Identities=16% Similarity=0.172 Sum_probs=74.5 Q ss_pred CCccccH--HHHH------HH-HHHHHHHHHHHHHHHHHHhhhcCCc----ccchhhcceeecccCcEEEEEec--Cchh Q lcl|NC_016654. 1 MPVEFNY--GIAA------TV-RGAAKSGLHDAAEVVKQEAIERCPK----ETGALRNSAGTASDGMEAVVYFD--TPYA 65 (108) Q Consensus 1 m~vk~n~--~~~~------~v-~~a~~~al~~~~~~v~~~s~~~vP~----dtG~L~~S~~v~~~~~~g~V~y~--~pYA 65 (108) |+..=|. ++.. +| ++-.+.+|.+++...+..-.|-+|. -.|.|+.|..|.+......|.|- +-|= T Consensus 1 m~sNNNGFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d~V~V~Fed~a~yW 80 (125) T protein:vir:62 1 MASNNNGFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDDRVSVEFKDEAWYW 80 (125) T ss_pred CCCCchhHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCCeEEEEEcchhhhh Confidence 7766552 1111 12 2346788888888888888787775 34689999888887777778774 3444 Q ss_pred hhhccccCCCCCCCc---cchhhHHHHHhHHHHHHHHHHHHHHhc Q lcl|NC_016654. 66 ARQHEEVGWHHVDGQ---AKYLENAVNATQATVAEVIGEAIRRSI 107 (108) Q Consensus 66 ~~~h~~~~~~~~~~~---~k~le~a~~~~~~~i~~~i~~~ir~~L 107 (108) +..+-|+...+-.|+ .+|..--++.++++|.+|+.+.|-..| T Consensus 81 ~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 81 YLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred hhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 444445543322331 379899999999999999999999999 No 134 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=89.78 E-value=0.0035 Score=34.03 Aligned_cols=105 Identities=18% Similarity=0.240 Sum_probs=68.4 Q ss_pred CccccHH-----H--HHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccccCC Q lcl|NC_016654. 2 PVEFNYG-----I--AATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVGW 74 (108) Q Consensus 2 ~vk~n~~-----~--~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~ 74 (108) -+++|-+ + ...++..+++.+.....++...+..-+|+.||+|+.|...++.+.+|++.-..||-.++-.|-+| T Consensus 1 mi~i~idkp~almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~siegstgelsn~~~yl~~vl~grgw 80 (133) T protein:vir:41 1 MIRINIDKPEALMEKASEVEDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEGSTGELTNTVPYLQWVLFGRGW 80 (133) T ss_pred CeeeecCCchhhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeecCccchhhhhHHhhHhhhcccc Confidence 2444421 1 22456667778888888888889999999999999999999999999999999999998766554 Q ss_pred C--------------CCCCc------cchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 75 H--------------HVDGQ------AKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 75 ~--------------~~~~~------~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) - ||-+- .-||..+..-... .-++++.+-+.|= T Consensus 81 vfpv~~kal~wpelphpvayarpappndyfsa~vay~~~--~give~s~iewli 132 (133) T protein:vir:41 81 VFPVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDA--KGIVEDSFIEWLI 132 (133) T ss_pred eeeecccccccCCCCCcccccCCCCCchhhhhhhhhhcc--cchhHHHHHHHhc Confidence 2 33221 2476655432111 1123333333333 No 135 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=87.17 E-value=0.014 Score=30.70 Aligned_cols=106 Identities=17% Similarity=0.231 Sum_probs=56.9 Q ss_pred CC-ccccH-------H---HHHHHHHHHHHHHHHH----HHHHHHHhhhcCCcccchhhcceeecccCcEEEEEec-Cch Q lcl|NC_016654. 1 MP-VEFNY-------G---IAATVRGAAKSGLHDA----AEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFD-TPY 64 (108) Q Consensus 1 m~-vk~n~-------~---~~~~v~~a~~~al~~~----~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~-~pY 64 (108) |+ |+++. . ....+...++.++..+ ++.+..+...--|++||...+|=.+.... ++.++|| ++| T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~-~~~~v~nk~~y 79 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTP-GGWVIHNKTEY 79 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeecc-CceeEeecCCc Confidence 54 33332 1 1223444555555444 44445555678999999999987655543 4678888 465 Q ss_pred hhhhcc-ccCCCCCCC-c---cchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 65 AARQHE-EVGWHHVDG-Q---AKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 65 A~~~h~-~~~~~~~~~-~---~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +-.|- +.+.-..+| + -+.+.++-..-..++.+-|++.|+.+-- T Consensus 80 -qLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 80 -RLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred -ceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 23354 222111221 1 2466666555566666656555554322 No 136 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=86.78 E-value=0.016 Score=30.41 Aligned_cols=88 Identities=13% Similarity=0.101 Sum_probs=46.0 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecc--cCcEEEEEe-cCchhhhhccccCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTAS--DGMEAVVYF-DTPYAARQHEEVGWHHV 77 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y-~~pYA~~~h~~~~~~~~ 77 (108) |||++..+... -++..+.+... +.. .++ =|+... ......-+. .+-.|.+++||.. +-| T Consensus 1 M~~~~k~~~~~--~~~l~~~l~~l-------~~~-------~v~-VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~-~IP 62 (148) T protein:vir:52 1 MAVTVTANFSA--AKQLIEQMKSL-------KEK-------AVY-VGFPAEFDEKVKGSENFNLASLAAVLEFGNE-HIP 62 (148) T ss_pred CccccccccHH--HHHHHHHHHHh-------hCC-------eEE-EEeecCcCCCCCCCCCCCHHHHHHHHhcCCC-CCC Confidence 99988765422 11112222211 110 000 011100 000111112 2345667777765 233 Q ss_pred CCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 78 DGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 78 ~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ++ .||...+.+++++|.+.+++.++.++- T Consensus 63 -~R-pflr~t~~~~~~~~~~~~~~~~~~~~~ 91 (148) T protein:vir:52 63 -AR-PFLRQTLEENQEKYTALFIQWFDQGVP 91 (148) T ss_pred -Cc-chhHHHHHHHHHHHHHHHHHHHHcCCC Confidence 33 699999999999999999999887766 No 137 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=83.39 E-value=0.019 Score=30.06 Aligned_cols=93 Identities=14% Similarity=0.066 Sum_probs=52.9 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCch-hhhhccccCCCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPY-AARQHEEVGWHHVDG 79 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pY-A~~~h~~~~~~~~~~ 79 (108) |++|.+-....++.+..++ +... ...| |-+..+.. .+...++++.+..| |...+||...++|.+ T Consensus 1 m~~~~~~~~~~~~~~~l~~--------l~~~-~v~v----Gi~~~~~~--~~~~~~~~G~~va~iAai~EfG~~I~~~~~ 65 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRA--------MRGR-SVSA----GWYSTARY--PDKAGGSVGIQVARIARLNEYGGTIDHPGG 65 (193) T ss_pred CeeccchHHHHHHHHHHHH--------hcCC-eEEE----EEcCCCCC--CCcccccccchHHHHHhHHHcCCccccCcc Confidence 9999875433333222211 1111 1111 22221111 12334567777777 777777765444332 Q ss_pred c--------------------------------------cchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 80 Q--------------------------------------AKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 80 ~--------------------------------------~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) + -.||...+.+++++|.+++++.+++-|- T Consensus 66 ~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~ 132 (193) T protein:vir:96 66 TRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLAR 132 (193) T ss_pred ceeeeeccccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHh Confidence 1 1599999999999999999888887554 No 138 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=82.10 E-value=0.014 Score=30.68 Aligned_cols=90 Identities=11% Similarity=0.087 Sum_probs=44.8 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHh-------hhcCCcccchhhcceeecccCcEEEEEe-cCchhhhhcccc Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEA-------IERCPKETGALRNSAGTASDGMEAVVYF-DTPYAARQHEEV 72 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s-------~~~vP~dtG~L~~S~~v~~~~~~g~V~y-~~pYA~~~h~~~ 72 (108) |.|+=. +|....+.+...+ ..-.|-.+|.--..+....... .-+. .+-+|.+++|++ T Consensus 1 m~v~~k-------------~L~~~~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~--~~g~~va~ia~~~E~G~ 65 (155) T protein:vir:10 1 MSVTRR-------------GLTLPKDRYRSMSVKAGVLAGATYPDESGKKLADGTILTKDP--RAGLPVAMIAMALNYGT 65 (155) T ss_pred CcchHH-------------HHHHHHHHHhCCeeEEeecCCCCCccccchhhhhhhhccccc--ccCCcHHHHHHHHhcCC Confidence 544322 2222222221111 1112223332222222211111 1121 233666778776 Q ss_pred CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 ~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) . +=+++ .||...+.+++++|.+.+.+.++..+. T Consensus 66 ~--~IP~R-PFlr~t~~~~~~~~~~~l~~~~~~~~~ 98 (155) T protein:vir:10 66 S--KLPAR-PFMEKTIADRSAEWIKGLTVMMTMGYD 98 (155) T ss_pred C--CCCCc-chhHHHHHHHHHHHHHHHHHHHHcCCC Confidence 4 22233 699999999999999999998887666 No 139 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=81.86 E-value=0.015 Score=30.57 Aligned_cols=90 Identities=11% Similarity=0.090 Sum_probs=45.0 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHh-------hhcCCcccchhhcceeecccCcEEEEEe-cCchhhhhcccc Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEA-------IERCPKETGALRNSAGTASDGMEAVVYF-DTPYAARQHEEV 72 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s-------~~~vP~dtG~L~~S~~v~~~~~~g~V~y-~~pYA~~~h~~~ 72 (108) |+|+=. +|....+.+...+ ..-.|..+|.--..+....... .-+. .+-+|.+++|++ T Consensus 1 m~v~~k-------------~L~~~~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~--~~g~~va~ia~~~E~G~ 65 (155) T protein:vir:78 1 MSVTRR-------------GLTLPKDRYRSMSVKAGVLAGATYPDESGKKLADGTILTKDP--RAGLPVAMIAMALNYGT 65 (155) T ss_pred CcchHH-------------HHHHHHHHHhCCeeEEeecCCCCCCcccchhhhhhhhccccc--ccCCcHHHHHHhhhcCC Confidence 544322 2222222221111 1112333333322222221111 1121 233566778776 Q ss_pred CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 ~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) . +=+++ .||...+.+++++|.+.+.+.++..+. T Consensus 66 ~--~IP~R-PFlr~t~~~~~~~~~~~l~~~~~~~~~ 98 (155) T protein:vir:78 66 S--KLPAR-PFMEKTITDRSAEWIKGLTVMMTMGYD 98 (155) T ss_pred C--CCCCc-chhhHHHHHHHHHHHHHHHHHHHcCCC Confidence 4 22223 699999999999999999998887666 No 140 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=80.14 E-value=0.044 Score=28.01 Aligned_cols=102 Identities=13% Similarity=0.084 Sum_probs=54.4 Q ss_pred CCccccHHHHHHHHHHH--------HHHHHHHHHHHHHHh------------hhcCCc-----------------ccchh Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAA--------KSGLHDAAEVVKQEA------------IERCPK-----------------ETGAL 43 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~--------~~al~~~~~~v~~~s------------~~~vP~-----------------dtG~L 43 (108) || .|. .+...+.... ..-+..++..+.... ++..|. ++|.| T Consensus 1 m~-d~~-~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l 78 (149) T protein:vir:98 1 MS-ELT-ALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRT 78 (149) T ss_pred Cc-hHH-HHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhh Confidence 65 333 2222222221 112344444443332 123333 33566 Q ss_pred hcceeecccCcEEEE---EecCchhhhhccccCCC----CCCCc---cchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 44 RNSAGTASDGMEAVV---YFDTPYAARQHEEVGWH----HVDGQ---AKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 44 ~~S~~v~~~~~~g~V---~y~~pYA~~~h~~~~~~----~~~~~---~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) .+|.....+..+..| |.|.+||+..|+|...+ .+..+ -.||-=- .+...+|.++|.+.|.+ T Consensus 79 ~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 79 NRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred hhhhhheecCCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCC-HHHHHHHHHHHHHHhhC Confidence 666666666666666 66899999999976532 11111 1355322 35567888888888888 No 141 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=80.07 E-value=0.054 Score=27.55 Aligned_cols=99 Identities=11% Similarity=0.053 Sum_probs=52.0 Q ss_pred ccH--HHHHHHHHHH--------HHHHHHHHHHHHHH--------h----hhcCCc-----------------ccchhhc Q lcl|NC_016654. 5 FNY--GIAATVRGAA--------KSGLHDAAEVVKQE--------A----IERCPK-----------------ETGALRN 45 (108) Q Consensus 5 ~n~--~~~~~v~~a~--------~~al~~~~~~v~~~--------s----~~~vP~-----------------dtG~L~~ 45 (108) ||- .+...+.... ..-+..+++.+... . +++.|. ++|.|.+ T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~~ 80 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhhh Confidence 331 1111111111 11123333332222 1 135443 4445556 Q ss_pred ceeecccCcEEEE----EecCchhhhhccccCCC----CCC----CccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 46 SAGTASDGMEAVV----YFDTPYAARQHEEVGWH----HVD----GQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 46 S~~v~~~~~~g~V----~y~~pYA~~~h~~~~~~----~~~----~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) |....++...+.| |-|.+||+..|+|..-+ +|. ++ .||-=- .+...+|.++|.+.|.| T Consensus 81 sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaR-p~LG~s-~~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPAR-PLLGFT-GEDVQMIEEIILAHLER 150 (150) T ss_pred hhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceecccc-ccCCCC-HHHHHHHHHHHHHHHhC Confidence 6666666677777 44789999999976432 221 11 365433 35578889999999999 No 142 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=76.30 E-value=0.039 Score=28.34 Aligned_cols=90 Identities=11% Similarity=0.027 Sum_probs=44.5 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHH-hhh------cCCcccchhhcceeecccCcEEEEEe-cCchhhhhcccc Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQE-AIE------RCPKETGALRNSAGTASDGMEAVVYF-DTPYAARQHEEV 72 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~-s~~------~vP~dtG~L~~S~~v~~~~~~g~V~y-~~pYA~~~h~~~ 72 (108) |+|+-.. |......+... ... --|-..|+....+...-.. +.-+. .+-+|.+++|++ T Consensus 1 m~v~r~~-------------L~~~~~~l~~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~--~~~G~pva~ia~~~e~G~ 65 (155) T protein:vir:10 1 MSVTRRG-------------LTLPKDRYKSMSVKAGVLAGATYPDESGKKLADGTILKKD--PRAGLPVAMIAMALNYGT 65 (155) T ss_pred CcchHHH-------------HHHHHHHhhCCeeEEeecCCCCCCccccchhhhhhhhccc--cccCcchhhhhhhhhcCC Confidence 7665321 11111111111 000 0122222222221111000 11111 134777888887 Q ss_pred CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 ~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ..- | ++ .||...+.+++++|.+.+.+.++..+- T Consensus 66 ~~I-P-~R-PFlr~t~~~~~~~~~~~l~~~~~~~~~ 98 (155) T protein:vir:10 66 SKL-P-AR-PFMEKTIADRSAEWIKGLTVMMTMGYD 98 (155) T ss_pred CCC-C-Cc-chhHHHHHHHHHHHHHHHHHHHHcCCC Confidence 532 2 22 699999999999999999999888766 No 143 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=74.79 E-value=0.13 Score=25.41 Aligned_cols=99 Identities=11% Similarity=0.056 Sum_probs=51.1 Q ss_pred ccH--HHHHHHHHHH--------HHHHHHHHHHHHHHh------------hhcCCc-----------------ccchhhc Q lcl|NC_016654. 5 FNY--GIAATVRGAA--------KSGLHDAAEVVKQEA------------IERCPK-----------------ETGALRN 45 (108) Q Consensus 5 ~n~--~~~~~v~~a~--------~~al~~~~~~v~~~s------------~~~vP~-----------------dtG~L~~ 45 (108) ||- .+...+.... .+-+..+++.+.+.. +++.|. ++|.|.+ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcc Confidence 331 1211111111 111233333333321 235554 3445555 Q ss_pred ceeecccCcEEEEE----ecCchhhhhccccCCC----CCC----CccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 46 SAGTASDGMEAVVY----FDTPYAARQHEEVGWH----HVD----GQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 46 S~~v~~~~~~g~V~----y~~pYA~~~h~~~~~~----~~~----~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) |....++...+.|+ -|.+||+..|+|..-+ .+. ++ .||-=- .+...+|.++|.++|.| T Consensus 81 sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaR-p~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPAR-PLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred eeeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCc-ccCCCC-HHHHHHHHHHHHHHHhC Confidence 55556666667774 4899999999976532 111 11 365433 34567788888888888 No 144 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=73.36 E-value=0.016 Score=30.36 Aligned_cols=87 Identities=20% Similarity=0.156 Sum_probs=58.1 Q ss_pred CCccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec---ccCcEEEEEecCchhhhh Q lcl|NC_016654. 1 MPVEFNY---------GIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA---SDGMEAVVYFDTPYAARQ 68 (108) Q Consensus 1 m~vk~n~---------~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~---~~~~~g~V~y~~pYA~~~ 68 (108) =|-|-|+ ++.+ .--+.+++++..+.+.......-|+++|..+.|..|. .+.+.|.|+=..|||..+ T Consensus 4 gpt~kNP~~KFGvs~~d~~K--~~EVn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~NkGRG~~G~~~~~AH~V 81 (108) T protein:vir:79 4 GPTRKNPLAKFGVRLDDFDK--LPEVNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQAHLV 81 (108) T ss_pred CcccccchhhhcCChhhhhh--chhhhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhccCccccCCcchhhhhh Confidence 1223232 1211 2236788899999999999999999999999998765 467889999999999999 Q ss_pred ccccCCCCC---CCc--c-chhhHHHHH Q lcl|NC_016654. 69 HEEVGWHHV---DGQ--A-KYLENAVNA 90 (108) Q Consensus 69 h~~~~~~~~---~~~--~-k~le~a~~~ 90 (108) ++++- |++ ++| + .|-..+..+ T Consensus 82 EFGs~-hndeyapaqktakqfggtay~d 108 (108) T protein:vir:79 82 EFGSA-HNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred hhhcc-ccccccchhhHHHhhcccccCC Confidence 88764 222 232 1 122222222 No 145 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=71.58 E-value=0.19 Score=24.49 Aligned_cols=107 Identities=18% Similarity=0.209 Sum_probs=64.8 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc---hhhcceeecc---c---CcEEEEEecCc------h- Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETG---ALRNSAGTAS---D---GMEAVVYFDTP------Y- 64 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG---~L~~S~~v~~---~---~~~g~V~y~~p------Y- 64 (108) |++.++..=...|.+|-.+.+......+.+. .-|.-..|| .|..|+.+.. + .|+..|||+-. | T Consensus 18 l~~~ls~eqkakITkAGAkv~~~~L~~~tk~-kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG~s~VGf~~k~~~~~~~k 96 (168) T protein:vir:10 18 LSTKMSVEDKAEVTKAGAKVFEQALAYEVRN-RHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQSVVGWERSTEKGTHTK 96 (168) T ss_pred hhcCCCHHHHHHHhHhhhHHHHHHHHHHhhH-hhhccCCCCccchhhhhheecccccccccCCceeecccCccccccccc Confidence 6666776555666666666666655555553 345566777 7888887553 2 35678999754 4 Q ss_pred ---hhhhccccCCC---CCC-------Cc-----cchhhHHHHHh--HHHHHHHHHHHHHHhcC Q lcl|NC_016654. 65 ---AARQHEEVGWH---HVD-------GQ-----AKYLENAVNAT--QATVAEVIGEAIRRSIA 108 (108) Q Consensus 65 ---A~~~h~~~~~~---~~~-------~~-----~k~le~a~~~~--~~~i~~~i~~~ir~~Lg 108 (108) |+++-.||.|+ +-. |+ -+|++.+-.+. ++.|.+.-.+++++=|- T Consensus 97 a~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~~~y~eIl~ 160 (168) T protein:vir:10 97 GYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKIIN 160 (168) T ss_pred hheeeeccccccccccccccccccccccccccccchhHHHhhhchhhhHHHHHHHHHHHHHHHH Confidence 56666677542 111 11 27999887653 56666655444444333 No 146 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=71.17 E-value=0.2 Score=24.43 Aligned_cols=107 Identities=17% Similarity=0.182 Sum_probs=58.1 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc---hhhcceeecc------cCcEEEEEecCch------- Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETG---ALRNSAGTAS------DGMEAVVYFDTPY------- 64 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG---~L~~S~~v~~------~~~~g~V~y~~pY------- 64 (108) |+..+...=...|.+|-.+.+......+-+ ..-|--..|| .|..|+.+.. ..|+..|||+-.| T Consensus 18 l~~~lt~eqkakITkAGAkv~~~~L~~~t~-~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG~s~VGf~~k~~~~~~~k 96 (168) T protein:vir:74 18 LSTKMTVEDKAEVTKAGAKVFEQALAYEVR-NRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQSVVGWERSTEKGTHTK 96 (168) T ss_pred hccCCCHHHHHHHHHhhhHHHHHHHHHHhH-HhhcccCCCcccchhhhheeecccccCcccCCceeecccccccccccch Confidence 333333222334444444444443333333 2345556677 7888887653 2356789998663 Q ss_pred ---hhhhccccCCC---CCC-------Cc-----cchhhHHHHH--hHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 65 ---AARQHEEVGWH---HVD-------GQ-----AKYLENAVNA--TQATVAEVIGEAIRRSIA 108 (108) Q Consensus 65 ---A~~~h~~~~~~---~~~-------~~-----~k~le~a~~~--~~~~i~~~i~~~ir~~Lg 108 (108) |+++-.||.|+ +-. |+ -+|++.+-.+ .++.|.+.-.++.++=|- T Consensus 97 A~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~~~y~eIl~ 160 (168) T protein:vir:74 97 GYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEAEAMRKIIN 160 (168) T ss_pred hhhhhhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHHHHHHHHHHHHHH Confidence 56666677542 111 11 2799988666 456666655444444333 No 147 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=69.69 E-value=0.16 Score=24.91 Aligned_cols=103 Identities=15% Similarity=0.135 Sum_probs=54.0 Q ss_pred CCccccH------HHHHHHHHHHHH-HHHHHHHHHHHHh------------hhcCCc-----------ccchhhcc---- Q lcl|NC_016654. 1 MPVEFNY------GIAATVRGAAKS-GLHDAAEVVKQEA------------IERCPK-----------ETGALRNS---- 46 (108) Q Consensus 1 m~vk~n~------~~~~~v~~a~~~-al~~~~~~v~~~s------------~~~vP~-----------dtG~L~~S---- 46 (108) |+=.|.. .+.+.+..+..+ -+..++..+.... .++.|. .+|.+.++ T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~~ 80 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMFR 80 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhhh Confidence 7765553 111111111111 1333444333321 245553 24544443 Q ss_pred -------eeecccCcEEEE---EecCchhhhhccccCCC----CC----CCccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 47 -------AGTASDGMEAVV---YFDTPYAARQHEEVGWH----HV----DGQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 47 -------~~v~~~~~~g~V---~y~~pYA~~~h~~~~~~----~~----~~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) ..+.++.....| |.|.+||+..|+|..-+ ++ +++ .||.=- .+...+|.++|.+.|.| T Consensus 81 ~l~~a~~l~~~~~~d~a~Vg~~Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaR-p~LGls-~~d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 81 KLRTARYLRIDVDSTGLAIGFDERLSRIARVHQEGQKAPVEPGGPLAQYPVR-VVLGFS-DADRELVRDRLLRELTR 155 (155) T ss_pred hhhhhheeeeeecCcEEEEEecCcchhhhhhhhcCCcccCCCCCcccccccc-cccCCC-HHHHHHHHHHHHHHhhC Confidence 234455566777 66799999999975422 22 122 466433 35678899999999999 No 148 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=69.56 E-value=0.058 Score=27.38 Aligned_cols=100 Identities=9% Similarity=0.023 Sum_probs=45.3 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEe-cCchhhhhccccCCCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYF-DTPYAARQHEEVGWHHVDG 79 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y-~~pYA~~~h~~~~~~~~~~ 79 (108) |+.+.+..+..-.. ..+.+....-.|.-....-.|.-|..-...+.... .+.-+. .+-+|.+++|+.. +=++ T Consensus 1 ~~~~~~~g~~~~~~--~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~---~~~~g~~va~Ia~~~E~G~~--~IP~ 73 (168) T protein:vir:94 1 MTTIARKGVKMPPH--LEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIE---DARGGMPVAVIAQALEYGHG--QNHP 73 (168) T ss_pred CccccchhhhhhHH--HHHhhhccceeeeccccCcccccccchhhcccccc---cccccccHHHHHHHHhcCCC--CCCC Confidence 77766654322111 11111100000000001111111111111111111 111111 2456777787764 3233 Q ss_pred ccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 80 QAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 80 ~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) + .||+..+.+++++|.+.+++.++..+. T Consensus 74 R-PFlr~t~~~~~~~~~~~~~~~~~~~~~ 101 (168) T protein:vir:94 74 R-PFMQQTYAAQYRAWSRDLTLTLKAGAA 101 (168) T ss_pred c-hhhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 3 699999999999999999998887655 No 149 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=66.13 E-value=0.088 Score=26.38 Aligned_cols=94 Identities=7% Similarity=-0.053 Sum_probs=45.6 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCch-hhhhccccCCCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPY-AARQHEEVGWHHVDG 79 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pY-A~~~h~~~~~~~~~~ 79 (108) ||+++..+ +++.+.. +.|... . .-.. .-|-+..+..- +...++.+-+.-| |.+++||...++|.+ T Consensus 7 ~~~k~~~~--~~~~~~~-~~l~~l----~-~~~v----~vGi~~~~~y~--~~~~~~dG~~va~IA~~~EfG~~i~~p~~ 72 (200) T protein:vir:99 7 KSNSVAAP--LKHFQML-KQFDAL----K-GKTV----QAGWFETDRYP--AKEGETIGPLVAKIARQLEFGGVINHPGG 72 (200) T ss_pred eeeeeecc--hHHHHHH-HHHHHh----h-CCeE----EEEEcCCCCcC--CcccccccchHHHHHhHHHcCCeeccCCC Confidence 89998863 1222111 111110 0 0000 00111100000 0111233333434 666666655554432 Q ss_pred c--------------------------------------cchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 80 Q--------------------------------------AKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 80 ~--------------------------------------~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) + -.||...+.+++++|.+.++..+++-|- T Consensus 73 ~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~ 139 (200) T protein:vir:99 73 TKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLD 139 (200) T ss_pred ccccccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 1 1599999999999999998888877653 No 150 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=63.19 E-value=0.16 Score=24.91 Aligned_cols=99 Identities=11% Similarity=0.056 Sum_probs=52.1 Q ss_pred ccH--HHHHHHHHHH--------HHHHHHHHHHHHHHh------------hhcCCc-----------------ccchhhc Q lcl|NC_016654. 5 FNY--GIAATVRGAA--------KSGLHDAAEVVKQEA------------IERCPK-----------------ETGALRN 45 (108) Q Consensus 5 ~n~--~~~~~v~~a~--------~~al~~~~~~v~~~s------------~~~vP~-----------------dtG~L~~ 45 (108) ||. .+...+.... ..-+..++..+.... +++.|. ++|.|.+ T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhcc Confidence 331 1222222111 112233333333321 235554 3344555 Q ss_pred ceeecccCcEEEEE----ecCchhhhhccccCCC----CCC----CccchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 46 SAGTASDGMEAVVY----FDTPYAARQHEEVGWH----HVD----GQAKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 46 S~~v~~~~~~g~V~----y~~pYA~~~h~~~~~~----~~~----~~~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) |.....+...+.|+ -+.+||+..|+|...+ ++. ++ .||-=- .+...+|.++|.+.|.| T Consensus 81 sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaR-p~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPAR-PLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred ceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCc-ccCCCC-HHHHHHHHHHHHHHHhC Confidence 55566666677774 4889999999986543 111 11 465433 35567888888889888 No 151 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=60.73 E-value=0.22 Score=24.19 Aligned_cols=105 Identities=17% Similarity=0.108 Sum_probs=43.4 Q ss_pred CCccccHHHHHHHHH-------HHHH-HHHHHHHHHHHH------------hhhcCCc----------------ccchhh Q lcl|NC_016654. 1 MPVEFNYGIAATVRG-------AAKS-GLHDAAEVVKQE------------AIERCPK----------------ETGALR 44 (108) Q Consensus 1 m~vk~n~~~~~~v~~-------a~~~-al~~~~~~v~~~------------s~~~vP~----------------dtG~L~ 44 (108) |+= |. .+...+.. +..+ -+..++..+.+. -+++.|. +++.|. T Consensus 1 m~~-~~-~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~ 78 (148) T protein:vir:79 1 MSE-SR-ELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLA 78 (148) T ss_pred Ccc-HH-HHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhh Confidence 431 11 12222211 1111 123333333222 2346663 223333 Q ss_pred cceeecccCcEEEE---EecCchhhhhccccCCCCCCC---ccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 45 NSAGTASDGMEAVV---YFDTPYAARQHEEVGWHHVDG---QAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 45 ~S~~v~~~~~~g~V---~y~~pYA~~~h~~~~~~~~~~---~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) +|-....+.....| |.|.+||+..|+|..-+ +.+ ...+=.+++-+-.++-.+.|.+.|.+.|+ T Consensus 79 ~~l~~~~~~~~~~v~~~Gt~~~yAaiHQfG~~~r-~~~~~~~v~iPaRp~LG~s~~d~~~i~~~i~~~l~ 147 (148) T protein:vir:79 79 RYMKTQADANTAVVTFAGNAQRIATVHQFGLRDR-VNKAGLTAQYPARELLGMDGVDMEHITNLLLLHLG 147 (148) T ss_pred hheeeeeeCCeeeEEeeccchhhhhhhhcCcccc-ccCCCCccccCcccccCCCHHHHHHHHHHHHHHhc Confidence 33333334444455 77899999999975422 111 11222233333333333444445555555 No 152 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=60.61 E-value=0.14 Score=25.26 Aligned_cols=90 Identities=11% Similarity=0.111 Sum_probs=44.0 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh-------hcCCcccchhhcceeecccCcEEEEEe-cCchhhhhcccc Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAI-------ERCPKETGALRNSAGTASDGMEAVVYF-DTPYAARQHEEV 72 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~-------~~vP~dtG~L~~S~~v~~~~~~g~V~y-~~pYA~~~h~~~ 72 (108) |++.-. +|......+...+. .-.|-.+|.....+.....+ +..+. .+-+|.+++|++ T Consensus 1 m~~~r~-------------~l~~~~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~--~~~G~pva~ia~~~e~G~ 65 (155) T protein:vir:77 1 MSVTRR-------------GLTLPKDRYRSMSVKAGVLAGATYPDESGKKLADGSILKKD--PRAGLPVAMIAMALNYGT 65 (155) T ss_pred CcchHH-------------HHHHHHHHHhcCceEEeecCCCCCccccchhhhhhhhcccc--ccccccHhhhhhhhhcCC Confidence 554322 12222222211110 01122222222211111100 11111 134788888876 Q ss_pred CCCCCCCccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 73 GWHHVDGQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 73 ~~~~~~~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) .. =+++ .||...+++++++|.+.+.+.++..+- T Consensus 66 ~~--IP~R-PFlr~t~~~~~~~~~~~l~~~~~~~~~ 98 (155) T protein:vir:77 66 SK--LPAR-PFMEKTIADRSAEWIKGLTVMMTMGYD 98 (155) T ss_pred CC--CCCC-chhhHHHHHHHHHHHHHHHHHHHccCc Confidence 42 2223 699999999999999999988887655 No 153 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=54.44 E-value=0.058 Score=27.37 Aligned_cols=108 Identities=19% Similarity=0.163 Sum_probs=59.6 Q ss_pred CCccccH------HHHHHHH--HHHHHHHHHHHHHHH-HHhhhcCCcccchhhcceeecc--cCcEEEEEecCchhhhhc Q lcl|NC_016654. 1 MPVEFNY------GIAATVR--GAAKSGLHDAAEVVK-QEAIERCPKETGALRNSAGTAS--DGMEAVVYFDTPYAARQH 69 (108) Q Consensus 1 m~vk~n~------~~~~~v~--~a~~~al~~~~~~v~-~~s~~~vP~dtG~L~~S~~v~~--~~~~g~V~y~~pYA~~~h 69 (108) |-=-|.- ++.+-+. --+..++++..+.++ ..+...-|+|.|+.+.|..|.- ..+.|.++=..|||..++ T Consensus 1 mgNP~~KFGvS~~e~~K~irns~EV~~GiNdFMe~~A~~~aK~~SPV~~GeY~~S~~V~~ka~NGRG~~G~~~~~AH~VE 80 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRNSAEVDAGINDFMENEAIPYAKSISPVDDGEYAASWAVMKKAKNGRGVFGPKAWYAHFVE 80 (150) T ss_pred CCCchhhhcCCHHHHHHhhccchhhhhhHHHHHHhhhhhhhhccCCcccchhHHHHHHHhhcccCccccCccchhhhhhh Confidence 3221110 2222222 125667777776554 4478899999999999987653 337888999999999999 Q ss_pred cccCCCCCCCccchhhHHHH----------------HhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 70 EEVGWHHVDGQAKYLENAVN----------------ATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 70 ~~~~~~~~~~~~k~le~a~~----------------~~~~~i~~~i~~~ir~~Lg 108 (108) +++.-....+++|--.+.+. -.-+.-.+-|+..+-...| T Consensus 81 FGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqkvashfg 135 (150) T protein:vir:81 81 FGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQKVASHFG 135 (150) T ss_pred hccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHHHHHhcc Confidence 97753222222111011000 0112223445555555555 No 154 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=54.20 E-value=0.31 Score=23.37 Aligned_cols=105 Identities=6% Similarity=-0.016 Sum_probs=53.4 Q ss_pred CCccccHH------HHHHHHHHHHH-HHHHHHHHHHHHh------------hhcCCcccch---------------hhcc Q lcl|NC_016654. 1 MPVEFNYG------IAATVRGAAKS-GLHDAAEVVKQEA------------IERCPKETGA---------------LRNS 46 (108) Q Consensus 1 m~vk~n~~------~~~~v~~a~~~-al~~~~~~v~~~s------------~~~vP~dtG~---------------L~~S 46 (108) |+=.|..- +.+.+..+..+ -+..+++.+.+.. .++.|...+. |+.| T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~a 80 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQP 80 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhhc Confidence 77665531 11222212111 2344555444332 2566643222 3333 Q ss_pred ee--ecccCcEEEE---EecCchhhhhccccCCCCCCCcc--------chhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 47 AG--TASDGMEAVV---YFDTPYAARQHEEVGWHHVDGQA--------KYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 47 ~~--v~~~~~~g~V---~y~~pYA~~~h~~~~~~~~~~~~--------k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) .+ ...+.....| |.|.+||+..|+|..-+...+++ .||.=- .+...+|.++|.+.|.+. T Consensus 81 ~~l~~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 81 RFMRLRLESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFT-DDDLQMIEDYMINILAGS 152 (152) T ss_pred ceeeeeecCcEEEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCC-HHHHHHHHHHHHHHHhcC Confidence 32 3344455667 55689999999976533222211 355332 345577777888888777 No 155 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=49.77 E-value=0.43 Score=22.58 Aligned_cols=87 Identities=9% Similarity=0.021 Sum_probs=45.7 Q ss_pred CCccccHH--HHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccCcEEEEEecCchhhhhccccCCCCCC Q lcl|NC_016654. 1 MPVEFNYG--IAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDGMEAVVYFDTPYAARQHEEVGWHHVD 78 (108) Q Consensus 1 m~vk~n~~--~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~~~g~V~y~~pYA~~~h~~~~~~~~~ 78 (108) |++.+... ...++.+..+ .+-.. ...| |-+..+ ....| .-.+-.|.+++||..-++=+ T Consensus 1 M~~~i~~~~~~~~~L~~~lk--------~l~~k-~V~V----Gi~~~~-----~y~dG--~~vA~Ia~~~E~G~p~~~IP 60 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIK--------GMNDY-SVRI----GWFSTA-----KYPDG--TPTAYVASIHEFGAPSRGIP 60 (189) T ss_pred CcceeccCcHHHHHHHHHHH--------HhhCC-eEEE----EecCCC-----CCCCc--ccHHHHHHHHHhcCcCCCCC Confidence 99988852 2222322111 11000 0011 111100 00111 11356677777776433323 Q ss_pred CccchhhHHHHHhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 79 GQAKYLENAVNATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 79 ~~~k~le~a~~~~~~~i~~~i~~~ir~~Lg 108 (108) ++ .||...+.+++++|.+.+...++.-|- T Consensus 61 ~R-PFlr~t~~~~~~~~~~~l~~~~~~vl~ 89 (189) T protein:vir:10 61 AR-SFIRPTIAAQQAAWSQQMRFYAKQIVV 89 (189) T ss_pred Cc-hhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 33 699999999999999988888887552 No 156 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=44.87 E-value=0.57 Score=21.93 Aligned_cols=103 Identities=17% Similarity=0.224 Sum_probs=57.5 Q ss_pred CCccccHH----H-HHHHHHHHHHHHHHHHHHHHHHhhhc-----------CCc-ccchhhcceeecc------------ Q lcl|NC_016654. 1 MPVEFNYG----I-AATVRGAAKSGLHDAAEVVKQEAIER-----------CPK-ETGALRNSAGTAS------------ 51 (108) Q Consensus 1 m~vk~n~~----~-~~~v~~a~~~al~~~~~~v~~~s~~~-----------vP~-dtG~L~~S~~v~~------------ 51 (108) |-|.|++- | ...|. +|...++..+.++|..+ .|. .||.|-+|+-.-+ T Consensus 20 lHvdF~qp~~~~Fnr~riR----raF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIgy~Vpkat~~RpG~mV 95 (187) T protein:vir:48 20 LHVDFKQPKELEFNRARLR----RAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIGYYVPKKTTRRPGLMV 95 (187) T ss_pred eeEeeecCCceeecHHHHH----HHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhhhccccccCCCCcceE Confidence 44444431 1 12233 33344444444443321 344 8888888863221 Q ss_pred ----cC--cEEE----EEecCchhhhhccccCC-------CCC-------C-----CccchhhHHHHHhHHHHHHHHHHH Q lcl|NC_016654. 52 ----DG--MEAV----VYFDTPYAARQHEEVGW-------HHV-------D-----GQAKYLENAVNATQATVAEVIGEA 102 (108) Q Consensus 52 ----~~--~~g~----V~y~~pYA~~~h~~~~~-------~~~-------~-----~~~k~le~a~~~~~~~i~~~i~~~ 102 (108) +. +.|. |+- --|-.++||+..- +|- . .+..|++.++.+.++....++..+ T Consensus 96 kIaPNqk~G~g~r~~Pi~g-dfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwriaPR~Nym~~~L~~~~~wt~~~L~ra 174 (187) T protein:vir:48 96 KISPNQKNGQGNRRFPEGA-PYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRLAPRNNFMADVIERRRHWTQELLSRE 174 (187) T ss_pred EecCCcccCcccccccccc-cchhHHHHhhhhhhhhccchhhhhhhcccCCcceeccchhHHHHHHHhhHHHHHHHHHHH Confidence 11 2222 222 3588999996421 111 1 123599999999999888899999 Q ss_pred HHHhcC Q lcl|NC_016654. 103 IRRSIA 108 (108) Q Consensus 103 ir~~Lg 108 (108) |++.|= T Consensus 175 L~~sLr 180 (187) T protein:vir:48 175 LQRSLR 180 (187) T ss_pred HHHhcC Confidence 999998 No 157 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=41.85 E-value=0.8 Score=21.10 Aligned_cols=103 Identities=21% Similarity=0.295 Sum_probs=59.1 Q ss_pred CCccccHH----H-HHHHHHHHHHHHHHHHHHHHHHhhhc-----------CCc-ccchhhcceeecc------------ Q lcl|NC_016654. 1 MPVEFNYG----I-AATVRGAAKSGLHDAAEVVKQEAIER-----------CPK-ETGALRNSAGTAS------------ 51 (108) Q Consensus 1 m~vk~n~~----~-~~~v~~a~~~al~~~~~~v~~~s~~~-----------vP~-dtG~L~~S~~v~~------------ 51 (108) |-|.|.+- | ...|. +|...++..+.++|..+ -|. .||.|-+|+-.-+ T Consensus 7 lHvdF~qp~~~~Fnr~r~R----raF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vpras~~rpG~mV 82 (170) T protein:vir:44 7 LHVDFVQPEELVFNRARMR----RAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVPRASKKRPGLMV 82 (170) T ss_pred eEEeeecCCceeecHHHHH----HHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccccccCCCCceeE Confidence 55555541 1 12333 33334444444443311 454 8999999863221 Q ss_pred ----cC--cEEE--EEecCchhhhhccccCC-------CCCCC----------ccchhhHHHHHhHHHHHHHHHHHHHHh Q lcl|NC_016654. 52 ----DG--MEAV--VYFDTPYAARQHEEVGW-------HHVDG----------QAKYLENAVNATQATVAEVIGEAIRRS 106 (108) Q Consensus 52 ----~~--~~g~--V~y~~pYA~~~h~~~~~-------~~~~~----------~~k~le~a~~~~~~~i~~~i~~~ir~~ 106 (108) +. ++|. |. ..-|-.++||+..- +|-.+ +..|++.++.+.++....++..+|++. T Consensus 83 kIaPNqk~G~g~r~i~-g~fYPafL~YGVr~gakr~k~hhr~a~ggsgwriaPR~Nym~~~l~~~~~wt~~~L~r~L~~s 161 (170) T protein:vir:44 83 KIAPNQKNGEGNRHIN-GAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVEPRNNYMTEVLDKRRSWTRYVLSRELRKS 161 (170) T ss_pred EecCCCCCCCCccccc-cccchhhhhhhhhcccccchhhcccccCCCcceeccchhHHHHHHHhhHHHHHHHHHHHHHHh Confidence 11 1222 22 23488899996421 12111 236999999999998888999999999 Q ss_pred cC Q lcl|NC_016654. 107 IA 108 (108) Q Consensus 107 Lg 108 (108) |= T Consensus 162 Lr 163 (170) T protein:vir:44 162 LR 163 (170) T ss_pred cC Confidence 98 No 158 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=41.65 E-value=0.68 Score=21.50 Aligned_cols=102 Identities=10% Similarity=0.083 Sum_probs=47.8 Q ss_pred CCccccHHHHHHHHH-------HH-HHHHHHHHHHHHHHh------------hhcCCcccchh----------------- Q lcl|NC_016654. 1 MPVEFNYGIAATVRG-------AA-KSGLHDAAEVVKQEA------------IERCPKETGAL----------------- 43 (108) Q Consensus 1 m~vk~n~~~~~~v~~-------a~-~~al~~~~~~v~~~s------------~~~vP~dtG~L----------------- 43 (108) |+ .|.. +...+.. +. ..-+..++..+.... +++.|.-.+++ T Consensus 1 m~-~~~~-~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~ 78 (149) T protein:vir:18 1 MS-ELTA-LQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRT 78 (149) T ss_pred Cc-hHHH-HHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhh Confidence 55 3332 1121211 11 112334444333322 23555443332 Q ss_pred hcceeecccCcE---EEEEecCchhhhhccccCCCC-CCCc------cchhhHHHHHhHHHHHHHHHHHHHH Q lcl|NC_016654. 44 RNSAGTASDGME---AVVYFDTPYAARQHEEVGWHH-VDGQ------AKYLENAVNATQATVAEVIGEAIRR 105 (108) Q Consensus 44 ~~S~~v~~~~~~---g~V~y~~pYA~~~h~~~~~~~-~~~~------~k~le~a~~~~~~~i~~~i~~~ir~ 105 (108) .+|-...++... |.+|.|.+||+..|+|...+. +.+. =.||-=- .+...+|.++|.+.|.| T Consensus 79 ~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 79 SRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred hhhhheeecCceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCC-HHHHHHHHHHHHHHHhC Confidence 222222222223 345778999999999865431 1111 1355432 34567888888888888 No 159 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=36.56 E-value=1.2 Score=20.22 Aligned_cols=102 Identities=13% Similarity=0.190 Sum_probs=57.7 Q ss_pred CCccccH-HHHH---HHHHHHHHHHHHHHH----HHHHHhhhcCCc--ccchhhcceeeccc-----CcEEEEEecCchh Q lcl|NC_016654. 1 MPVEFNY-GIAA---TVRGAAKSGLHDAAE----VVKQEAIERCPK--ETGALRNSAGTASD-----GMEAVVYFDTPYA 65 (108) Q Consensus 1 m~vk~n~-~~~~---~v~~a~~~al~~~~~----~v~~~s~~~vP~--dtG~L~~S~~v~~~-----~~~g~V~y~~pYA 65 (108) |.++||- .+.. .+.+..+.++...++ .+..+++.-+|= .||.=|.+.+-.+. ..+-.++++++|- T Consensus 4 ~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~~g~~~~~Iylsh~veYG 83 (123) T protein:vir:74 4 VTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANKLGPGSHELIMSYSVHYG 83 (123) T ss_pred eEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecCeeec Confidence 8888874 2222 334445555544443 566778888886 67765554433322 2345678899999 Q ss_pred hhhccccCCCCCCCccchhhHHHHHhHHHHHHHHHH---HHHHhc Q lcl|NC_016654. 66 ARQHEEVGWHHVDGQAKYLENAVNATQATVAEVIGE---AIRRSI 107 (108) Q Consensus 66 ~~~h~~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~---~ir~~L 107 (108) .|++...+ ++..=|.+.+...-.++.+-+.+ +|++.- T Consensus 84 ~~LEla~~-----~kyaIi~Ptv~~~~~~im~g~~~ll~~l~~~~ 123 (123) T protein:vir:74 84 IWLEIANS-----GQYAVIGPFLPVMGRKLMHDLEHLIDRLERAQ 123 (123) T ss_pred ceeeecCC-----CCceeecchHHHHhHHHHHHHHHHHHHhhccC Confidence 99986543 33334566655555555543332 333333 No 160 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=30.70 E-value=1.6 Score=19.53 Aligned_cols=107 Identities=14% Similarity=0.117 Sum_probs=54.2 Q ss_pred CC--------------------ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc---hhhcceeecc---c-- Q lcl|NC_016654. 1 MP--------------------VEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETG---ALRNSAGTAS---D-- 52 (108) Q Consensus 1 m~--------------------vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG---~L~~S~~v~~---~-- 52 (108) |. ..+...=...+.+|-.+.+......+-+ ..-|-+..|| .|..|..+.. + T Consensus 2 ~~~~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~-~kHy~~~kt~k~~HLADsI~~~~~niDg~ 80 (161) T protein:vir:10 2 MEEKQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTK-DKHYRIRKTGENPHLADSILVQNTNIDGI 80 (161) T ss_pred cchhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhh-hhcCcCCCCCCcchhhhheeecccccCcc Confidence 11 1111111222333333333333333222 2456677777 8999987653 2 Q ss_pred -CcEEEEEecCchhhhhcc---ccCC---C-------CCCC----ccchhhHHHH--HhHHHHHHHHHHHHHHhcC Q lcl|NC_016654. 53 -GMEAVVYFDTPYAARQHE---EVGW---H-------HVDG----QAKYLENAVN--ATQATVAEVIGEAIRRSIA 108 (108) Q Consensus 53 -~~~g~V~y~~pYA~~~h~---~~~~---~-------~~~~----~~k~le~a~~--~~~~~i~~~i~~~ir~~Lg 108 (108) .|+..|||+-+||.--|. |+.| + ++.- .-+|++.+-. ..++.+.+.-.+++++=|- T Consensus 81 ~dG~StVGw~~kka~ia~~indGtr~~~~~~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~~~y~eil~ 156 (161) T protein:vir:10 81 KDGNSTVGWDYTKSRVGHLIENGTRFPMYSKKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEAEVFSEILK 156 (161) T ss_pred cCCceeccccCchhhhhhhhcccchhhhhhcccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHHHHHHHHHH Confidence 356789998777544443 5432 1 2211 1379998876 3556666655555544443 No 161 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=29.83 E-value=0.42 Score=22.66 Aligned_cols=86 Identities=19% Similarity=0.177 Sum_probs=54.4 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec---ccCcEEEEEecCchhhhhccccCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA---SDGMEAVVYFDTPYAARQHEEVGWHHV 77 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~---~~~~~g~V~y~~pYA~~~h~~~~~~~~ 77 (108) .-|+|+ .+.+ .--+.++++...+.|...=.+--|+.+|..+.|..|. ++.+.|.|+-.-|-|.-+.++.- +++ T Consensus 14 fgi~ld-dfdk--lpevnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrgkvgatdpqahlvefgs~-hnd 89 (108) T protein:vir:10 14 FGVRLD-DFDK--LPEVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQAHLVEFGSA-HND 89 (108) T ss_pred hccchh-hhhc--cchhhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccccccCcchhhhhhhhhcc-ccc Confidence 223332 1111 1125678888888888888899999999999999875 46788999998888888876542 222 Q ss_pred ---CCc--c-chhhHHHHH Q lcl|NC_016654. 78 ---DGQ--A-KYLENAVNA 90 (108) Q Consensus 78 ---~~~--~-k~le~a~~~ 90 (108) ++| + .|-..+..+ T Consensus 90 eyapaqktakqfggtay~d 108 (108) T protein:vir:10 90 EYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred cccchhhhHHhhcccccCC Confidence 222 1 122222222 No 162 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=29.83 E-value=0.42 Score=22.66 Aligned_cols=86 Identities=19% Similarity=0.177 Sum_probs=54.4 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeec---ccCcEEEEEecCchhhhhccccCCCCC Q lcl|NC_016654. 1 MPVEFNYGIAATVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTA---SDGMEAVVYFDTPYAARQHEEVGWHHV 77 (108) Q Consensus 1 m~vk~n~~~~~~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~---~~~~~g~V~y~~pYA~~~h~~~~~~~~ 77 (108) .-|+|+ .+.+ .--+.++++...+.|...=.+--|+.+|..+.|..|. ++.+.|.|+-.-|-|.-+.++.- +++ T Consensus 14 fgi~ld-dfdk--lpevnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrgkvgatdpqahlvefgs~-hnd 89 (108) T protein:vir:10 14 FGVRLD-DFDK--LPEVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQAHLVEFGSA-HND 89 (108) T ss_pred hccchh-hhhc--cchhhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccccccCcchhhhhhhhhcc-ccc Confidence 223332 1111 1125678888888888888899999999999999875 46788999998888888876542 222 Q ss_pred ---CCc--c-chhhHHHHH Q lcl|NC_016654. 78 ---DGQ--A-KYLENAVNA 90 (108) Q Consensus 78 ---~~~--~-k~le~a~~~ 90 (108) ++| + .|-..+..+ T Consensus 90 eyapaqktakqfggtay~d 108 (108) T protein:vir:10 90 EYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred cccchhhhHHhhcccccCC Confidence 222 1 122222222 No 163 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=26.19 E-value=1.2 Score=20.15 Aligned_cols=84 Identities=20% Similarity=0.200 Sum_probs=47.3 Q ss_pred CCccccHH---HHHHHHHHHHHHHH-HHHHHHHHHhhhcCCcccchhhcceeecccC----cEEEEEecCchhhhhcccc Q lcl|NC_016654. 1 MPVEFNYG---IAATVRGAAKSGLH-DAAEVVKQEAIERCPKETGALRNSAGTASDG----MEAVVYFDTPYAARQHEEV 72 (108) Q Consensus 1 m~vk~n~~---~~~~v~~a~~~al~-~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~----~~g~V~y~~pYA~~~h~~~ 72 (108) |.=-|.+| |.+.+..+..+||- -+++..+.++..-.|+|||..+..-.++-.. .+-.|+-+-|--.-++-.+ T Consensus 1 madaftpNp~~FDqIl~s~~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D~KTlLvESrT 80 (92) T protein:vir:78 1 MADAFTPNPTWFDQIMRTPKVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSDEKTLLIESRT 80 (92) T ss_pred CCCccCCChhHHHHhhcccchhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccceeEEeecCcceeeeeccc Confidence 77767654 67778888888884 5678899999999999999999876655432 2233332212111111111 Q ss_pred CCCCCCCccchhhHHHHHhHH Q lcl|NC_016654. 73 GWHHVDGQAKYLENAVNATQA 93 (108) Q Consensus 73 ~~~~~~~~~k~le~a~~~~~~ 93 (108) + =|-+++...++ T Consensus 81 G---------NLakalk~~rs 92 (92) T protein:vir:78 81 G---------NLARSVKRRRS 92 (92) T ss_pred c---------hHHHHHhhhcC Confidence 1 01111111111 No 164 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=23.51 E-value=1.6 Score=19.41 Aligned_cols=107 Identities=12% Similarity=0.199 Sum_probs=52.7 Q ss_pred CCccc----cH--HHHHHHHH---HHHHHHHHH----HHHHHHH-hhhcCCccc---chhhcceeeccc-Cc---EEEEE Q lcl|NC_016654. 1 MPVEF----NY--GIAATVRG---AAKSGLHDA----AEVVKQE-AIERCPKET---GALRNSAGTASD-GM---EAVVY 59 (108) Q Consensus 1 m~vk~----n~--~~~~~v~~---a~~~al~~~----~~~v~~~-s~~~vP~dt---G~L~~S~~v~~~-~~---~g~V~ 59 (108) |+-+| +. .....+++ .++++++.+ +--+..+ ...++|+-. |-+++-.....+ .. ..-.+ T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLg 80 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLG 80 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcc Confidence 65444 32 12223322 234444333 2233333 357899964 344443332221 11 11233 Q ss_pred e----cCchhhhhcc--ccCCCCCCCccchhhHHHHHhHHHHHHHHHH----HHHHhcC Q lcl|NC_016654. 60 F----DTPYAARQHE--EVGWHHVDGQAKYLENAVNATQATVAEVIGE----AIRRSIA 108 (108) Q Consensus 60 y----~~pYA~~~h~--~~~~~~~~~~~k~le~a~~~~~~~i~~~i~~----~ir~~Lg 108 (108) | -..|..-+++ |.+-+++.+ .+|||+.+...-+.|.+.+-+ +|.+-|| T Consensus 81 f~i~~k~kf~YLvfPD~G~G~sn~~~-q~FmerGl~~~t~~i~E~L~~~l~k~in~~Lg 138 (140) T protein:vir:40 81 FELLTKPKFNYLIFPDQGIGKHNKTK-QDFMQLGVEESSQEIVEMLEQAVFKEINDTLG 138 (140) T ss_pred eeEeecCcccccccccccCCCCCcch-HHHHHhccccchhHHHHHHHHHHHHHHHHhhc Confidence 3 2234433444 555566544 479999999887777665554 5566677 No 165 >protein:vir:6154 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:10918 # MgeID: mge:127 # MgeName: phBC6A51 # Cross-refs: genbank:acc:NP_852533;genbank:gi:31415793;genbank:GeneID:1489145 Probab=20.22 E-value=0.42 Score=22.67 Aligned_cols=101 Identities=14% Similarity=0.182 Sum_probs=46.4 Q ss_pred CCccccH----HHHH-----HHHHHHHHHHHHHHHHHHHHhhhcCCcccchhhcceeecccC-----cEEEEEecCchhh Q lcl|NC_016654. 1 MPVEFNY----GIAA-----TVRGAAKSGLHDAAEVVKQEAIERCPKETGALRNSAGTASDG-----MEAVVYFDTPYAA 66 (108) Q Consensus 1 m~vk~n~----~~~~-----~v~~a~~~al~~~~~~v~~~s~~~vP~dtG~L~~S~~v~~~~-----~~g~V~y~~pYA~ 66 (108) |-+++-- ++.+ +-+--.++.++.-...-+.+++..+|+.-|-|..|.-..+.- .-|.-+...-||. T Consensus 1 mrirvvvkgksnvlkahnpnryktpieqtvekhtrlqanqasnrapilhgplsesipasvkmvvgariigtygspliyaa 80 (119) T protein:vir:61 1 MRIRVVVKGKSNVLKAHNPNRYKTPIEQTVEKHTRLQANQASNRAPILHGPLSESIPASVKMVVGARIIGTYGSPLIYAA 80 (119) T ss_pred CeeEEEeecccceecccCCccccccHHHHHHHhhhhhcccccccCceeecccccccchhhhhhhhhhhcccccchHHHHH Confidence 5554321 1111 111112333333333444578899999999999997655532 2344455677999 Q ss_pred hhccccCCCCCCCccchh-hHHHHHhHHH---HHHHHHHHHHHh Q lcl|NC_016654. 67 RQHEEVGWHHVDGQAKYL-ENAVNATQAT---VAEVIGEAIRRS 106 (108) Q Consensus 67 ~~h~~~~~~~~~~~~k~l-e~a~~~~~~~---i~~~i~~~ir~~ 106 (108) -|.+ .|..-++ |+ ..+++++++- |-+.+++--+.. T Consensus 81 vqef----thktkkg-fmrktafegeqpfvedisktvqrvakgh 119 (119) T protein:vir:61 81 VQEF----THKTKKG-FMRKTAFEGEQPFVEDISKTVQRVAKGH 119 (119) T ss_pred HHHH----hhhhhhh-hhhhhcccCCcchHHHHHHHHHHhhcCC Confidence 8875 2221111 11 1122222222 222222222222 Done!