Query lcl|NC_012756.1_cdsid_YP_002925173.1 [gene=PH10_gp40] [protein=hypothetical protein] [protein_id=YP_002925173.1] [location=18999..19346] Match_columns 115 No_of_seqs 90 out of 95 Neff 5.4 Searched_HMMs 1612 Date Thu Nov 7 13:00:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_40 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_40_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95372 Length: 124 100.0 1.6E-49 1E-52 288.1 12.7 115 1-115 1-123 (124) 2 protein:vir:80116 Length: 127 100.0 3.3E-49 2E-52 286.4 12.1 115 1-115 1-123 (127) 3 protein:vir:966 Length: 123 # 100.0 1.5E-46 9.1E-50 271.9 12.6 115 1-115 1-122 (123) 4 protein:vir:81147 Length: 126 100.0 3.4E-45 2.1E-48 264.4 12.4 115 1-115 1-123 (126) 5 protein:vir:102963 Length: 163 99.8 3.5E-24 2.2E-27 149.2 10.1 114 1-115 1-151 (163) 6 protein:vir:105467 Length: 144 99.8 1.4E-23 8.8E-27 145.8 11.1 114 1-115 1-137 (144) 7 protein:vir:79034 Length: 141 99.8 8.2E-23 5.1E-26 141.7 11.6 113 1-115 1-132 (141) 8 protein:vir:9930 Length: 108 # 99.8 9.7E-22 6E-25 135.8 10.8 107 3-115 1-107 (108) 9 protein:vir:3617 Length: 112 # 99.7 6.3E-21 3.9E-24 131.3 10.4 107 1-115 1-112 (112) 10 protein:vir:95789 Length: 114 99.7 6E-21 3.7E-24 131.5 9.8 108 1-115 1-110 (114) 11 protein:vir:94538 Length: 125 99.7 1.3E-20 8.2E-24 129.6 9.7 109 1-115 1-119 (125) 12 protein:vir:743 Length: 108 # 99.7 5.1E-20 3.2E-23 126.4 11.6 107 1-115 1-108 (108) 13 protein:vir:2740 Length: 114 # 99.7 6.9E-20 4.3E-23 125.6 8.9 108 1-115 1-113 (114) 14 protein:vir:4906 Length: 114 # 99.7 6.9E-20 4.3E-23 125.6 8.9 108 1-115 1-113 (114) 15 protein:vir:98409 Length: 108 99.7 5.3E-19 3.3E-22 120.8 11.4 107 1-115 1-108 (108) 16 protein:vir:97144 Length: 115 99.6 7.4E-19 4.6E-22 120.0 11.2 109 1-115 1-115 (115) 17 protein:vir:96225 Length: 115 99.6 7.4E-19 4.6E-22 120.0 11.2 109 1-115 1-115 (115) 18 protein:vir:103917 Length: 115 99.6 7.4E-19 4.6E-22 120.0 11.2 109 1-115 1-115 (115) 19 protein:vir:78858 Length: 115 99.6 7.4E-19 4.6E-22 120.0 11.2 109 1-115 1-115 (115) 20 protein:vir:9312 Length: 115 # 99.6 7.4E-19 4.6E-22 120.0 11.2 109 1-115 1-115 (115) 21 protein:vir:96358 Length: 115 99.6 7.4E-19 4.6E-22 120.0 11.2 109 1-115 1-115 (115) 22 protein:vir:106623 Length: 115 99.6 7.9E-19 4.9E-22 119.8 11.3 109 1-115 1-115 (115) 23 protein:vir:5978 Length: 144 # 99.6 8.9E-19 5.5E-22 119.6 11.3 114 1-115 4-144 (144) 24 protein:vir:102338 Length: 116 99.6 3.1E-19 1.9E-22 122.0 7.7 95 20-115 1-112 (116) 25 protein:vir:99744 Length: 115 99.6 1.5E-18 9.3E-22 118.3 11.1 109 1-115 1-115 (115) 26 protein:vir:95894 Length: 137 99.6 2.8E-18 1.7E-21 116.9 11.2 110 1-111 1-137 (137) 27 protein:vir:97088 Length: 157 99.6 2.1E-18 1.3E-21 117.5 10.2 115 1-115 1-146 (157) 28 protein:vir:93738 Length: 137 99.6 3.6E-18 2.2E-21 116.2 11.2 110 1-111 1-137 (137) 29 protein:vir:94490 Length: 137 99.6 3.6E-18 2.2E-21 116.2 11.2 110 1-111 1-137 (137) 30 protein:vir:97427 Length: 137 99.6 3.6E-18 2.2E-21 116.2 11.2 110 1-111 1-137 (137) 31 protein:vir:94796 Length: 137 99.6 4.6E-18 2.9E-21 115.6 11.3 110 1-111 1-137 (137) 32 protein:vir:107099 Length: 137 99.6 5E-18 3.1E-21 115.4 11.0 110 1-111 1-137 (137) 33 protein:vir:105330 Length: 137 99.6 6.3E-18 3.9E-21 114.9 10.9 110 1-111 1-137 (137) 34 protein:vir:94108 Length: 149 99.6 8.9E-18 5.5E-21 114.1 10.9 110 1-111 13-149 (149) 35 protein:vir:96121 Length: 137 99.6 2.4E-17 1.5E-20 111.7 11.2 110 1-111 1-137 (137) 36 protein:vir:105916 Length: 149 99.6 2.3E-17 1.4E-20 111.8 10.9 110 1-111 13-149 (149) 37 protein:vir:96486 Length: 112 99.5 6.1E-17 3.8E-20 109.5 9.7 107 1-114 1-112 (112) 38 protein:vir:94654 Length: 142 99.5 2.6E-16 1.6E-19 106.0 10.1 113 1-114 1-142 (142) 39 protein:vir:96829 Length: 135 99.5 4.5E-16 2.8E-19 104.7 10.9 110 1-111 1-135 (135) 40 protein:vir:194 Length: 149 # 99.4 1.3E-15 7.9E-19 102.2 9.8 110 1-115 4-140 (149) 41 protein:vir:80362 Length: 140 99.4 1.4E-15 8.6E-19 102.1 9.7 110 1-115 1-129 (140) 42 protein:vir:1437 Length: 140 # 99.4 1.6E-15 9.9E-19 101.7 9.7 110 1-115 1-129 (140) 43 protein:vir:105007 Length: 146 99.4 1.7E-15 1.1E-18 101.5 9.7 110 1-115 1-139 (146) 44 protein:vir:102875 Length: 146 99.4 1.7E-15 1.1E-18 101.5 9.7 110 1-115 1-139 (146) 45 protein:vir:107568 Length: 146 99.4 1.7E-15 1.1E-18 101.5 9.7 110 1-115 1-139 (146) 46 protein:vir:102085 Length: 146 99.4 1.7E-15 1.1E-18 101.5 9.7 110 1-115 1-139 (146) 47 protein:vir:100075 Length: 140 99.4 1.7E-15 1.1E-18 101.5 9.7 110 1-115 1-129 (140) 48 protein:vir:1273 Length: 127 # 99.4 2.1E-15 1.3E-18 101.1 9.9 110 1-115 1-123 (127) 49 protein:vir:101594 Length: 173 99.4 4E-15 2.5E-18 99.5 10.7 114 1-115 1-167 (173) 50 protein:vir:100243 Length: 140 99.4 4.1E-15 2.5E-18 99.5 10.3 110 1-115 1-129 (140) 51 protein:vir:93617 Length: 148 99.3 2.2E-14 1.4E-17 95.4 9.4 110 1-115 4-139 (148) 52 protein:vir:9708 Length: 125 # 99.3 4.4E-14 2.7E-17 93.8 10.9 109 1-115 1-120 (125) 53 protein:vir:1243 Length: 116 # 99.3 1.2E-14 7.7E-18 96.8 7.8 91 20-111 1-116 (116) 54 protein:vir:97327 Length: 116 99.3 1.2E-14 7.7E-18 96.8 7.8 91 20-111 1-116 (116) 55 protein:vir:95062 Length: 116 99.3 1.4E-14 8.5E-18 96.6 7.8 91 20-111 1-116 (116) 56 protein:vir:3873 Length: 128 # 99.3 4.5E-14 2.8E-17 93.8 10.4 110 1-115 1-124 (128) 57 protein:vir:78077 Length: 141 99.3 8.6E-14 5.3E-17 92.2 11.7 113 1-115 1-138 (141) 58 protein:vir:105089 Length: 133 99.2 5E-14 3.1E-17 93.5 9.7 110 1-115 2-131 (133) 59 protein:vir:98342 Length: 125 99.2 6.1E-14 3.8E-17 93.1 9.6 110 1-115 1-125 (125) 60 protein:vir:4704 Length: 125 # 99.2 6.1E-14 3.8E-17 93.1 9.6 110 1-115 1-125 (125) 61 protein:vir:9414 Length: 125 # 99.2 6.1E-14 3.8E-17 93.1 9.6 110 1-115 1-125 (125) 62 protein:vir:79988 Length: 125 99.2 6.1E-14 3.8E-17 93.1 9.6 110 1-115 1-125 (125) 63 protein:vir:81106 Length: 125 99.2 6.1E-14 3.8E-17 93.1 9.6 110 1-115 1-125 (125) 64 protein:vir:1891 Length: 179 # 99.2 8.7E-14 5.4E-17 92.2 9.0 110 1-115 1-162 (179) 65 protein:vir:106570 Length: 182 99.2 3E-13 1.9E-16 89.2 11.9 114 1-115 2-173 (182) 66 protein:vir:5745 Length: 135 # 99.2 1.7E-13 1E-16 90.6 9.7 110 1-115 1-127 (135) 67 protein:vir:4347 Length: 164 # 99.2 1.2E-13 7.7E-17 91.3 8.5 110 1-115 1-147 (164) 68 protein:vir:1386 Length: 149 # 99.1 3.4E-13 2.1E-16 88.9 9.1 110 1-115 1-144 (149) 69 protein:vir:107703 Length: 147 99.1 1.5E-12 9.6E-16 85.3 9.8 108 1-115 1-142 (147) 70 protein:vir:103280 Length: 142 99.0 1.6E-12 9.7E-16 85.3 9.2 108 1-115 1-140 (142) 71 protein:vir:10367 Length: 119 99.0 2.1E-13 1.3E-16 90.1 3.2 81 35-115 1-116 (119) 72 protein:vir:81067 Length: 119 99.0 2.4E-13 1.5E-16 89.8 3.2 81 35-115 1-116 (119) 73 protein:vir:79638 Length: 146 99.0 5.1E-12 3.2E-15 82.5 10.0 109 1-115 1-142 (146) 74 protein:vir:104347 Length: 145 99.0 2.4E-12 1.5E-15 84.2 7.9 108 1-114 1-145 (145) 75 protein:vir:8669 Length: 142 # 99.0 1.7E-12 1.1E-15 85.1 6.1 110 1-112 2-142 (142) 76 protein:vir:99101 Length: 142 99.0 1.7E-12 1.1E-15 85.1 6.1 110 1-112 2-142 (142) 77 protein:vir:9879 Length: 127 # 98.9 8.8E-12 5.5E-15 81.2 9.1 109 1-115 1-126 (127) 78 protein:vir:95157 Length: 144 98.8 2.7E-11 1.7E-14 78.5 9.0 106 1-115 1-144 (144) 79 protein:vir:94994 Length: 131 98.8 4.4E-11 2.7E-14 77.4 9.0 102 1-114 1-131 (131) 80 protein:vir:97190 Length: 148 98.7 7.4E-11 4.6E-14 76.1 8.1 103 1-115 1-143 (148) 81 protein:vir:78380 Length: 131 98.7 1.3E-10 7.9E-14 74.8 8.9 102 1-114 1-131 (131) 82 protein:vir:106041 Length: 137 98.7 2.9E-11 1.8E-14 78.3 4.9 106 1-113 1-137 (137) 83 protein:vir:80425 Length: 134 98.7 9.8E-11 6.1E-14 75.5 7.0 102 1-115 1-134 (134) 84 protein:vir:102154 Length: 119 98.6 1.8E-10 1.1E-13 74.0 7.5 109 1-115 1-115 (119) 85 protein:vir:94944 Length: 121 98.6 2.1E-10 1.3E-13 73.7 6.4 93 1-103 2-121 (121) 86 protein:vir:97982 Length: 140 98.5 3.8E-10 2.3E-13 72.2 5.6 111 1-115 1-139 (140) 87 protein:vir:107545 Length: 140 98.5 3.8E-10 2.3E-13 72.2 5.6 111 1-115 1-139 (140) 88 protein:vir:102441 Length: 137 98.4 5.8E-10 3.6E-13 71.2 3.6 104 1-110 3-137 (137) 89 protein:vir:96774 Length: 152 98.3 4.5E-09 2.8E-12 66.4 7.9 99 1-115 11-150 (152) 90 protein:vir:100652 Length: 134 98.3 1E-08 6.3E-12 64.4 8.7 115 1-115 1-132 (134) 91 protein:vir:4956 Length: 153 # 98.2 1.5E-08 9.3E-12 63.5 8.5 109 1-115 1-135 (153) 92 protein:vir:99528 Length: 92 # 98.2 1.2E-08 7.6E-12 64.0 7.1 82 1-91 1-92 (92) 93 protein:vir:100887 Length: 139 98.1 4.2E-08 2.6E-11 61.0 9.0 109 1-115 3-131 (139) 94 protein:vir:7993 Length: 108 # 98.1 3.7E-09 2.3E-12 66.8 3.0 87 1-92 1-108 (108) 95 protein:vir:100223 Length: 139 98.1 4.2E-08 2.6E-11 61.0 8.4 109 1-115 3-131 (139) 96 protein:vir:9513 Length: 134 # 98.1 4.9E-08 3E-11 60.7 8.4 115 1-115 1-132 (134) 97 protein:vir:101302 Length: 134 98.1 4.9E-08 3E-11 60.7 8.4 115 1-115 1-132 (134) 98 protein:vir:9647 Length: 132 # 97.8 1.9E-07 1.2E-10 57.4 8.2 111 1-115 1-131 (132) 99 protein:vir:78335 Length: 133 97.8 2E-07 1.3E-10 57.3 8.0 114 1-115 1-130 (133) 100 protein:vir:8106 Length: 150 # 97.7 4.5E-08 2.8E-11 60.8 3.1 110 1-115 1-143 (150) 101 protein:vir:106506 Length: 137 97.7 1.9E-07 1.2E-10 57.5 6.3 103 1-111 4-137 (137) 102 protein:vir:78644 Length: 133 97.7 5.8E-07 3.6E-10 54.8 8.2 114 1-115 1-132 (133) 103 protein:vir:9363 Length: 133 # 97.7 5.8E-07 3.6E-10 54.8 8.2 114 1-115 1-132 (133) 104 protein:vir:96973 Length: 133 97.7 5.8E-07 3.6E-10 54.8 8.2 114 1-115 1-132 (133) 105 protein:vir:94419 Length: 133 97.7 5.8E-07 3.6E-10 54.8 8.2 114 1-115 1-132 (133) 106 protein:vir:4859 Length: 140 # 97.7 8E-07 5E-10 54.0 8.9 109 1-115 1-135 (140) 107 protein:vir:93898 Length: 133 97.6 7.3E-07 4.5E-10 54.2 8.5 114 1-115 1-132 (133) 108 protein:vir:5000 Length: 141 # 97.6 8E-07 4.9E-10 54.0 8.5 109 1-115 1-135 (141) 109 protein:vir:98636 Length: 138 97.5 1.7E-06 1E-09 52.3 8.3 111 1-115 7-137 (138) 110 protein:vir:6216 Length: 125 # 97.5 7.3E-07 4.5E-10 54.2 6.3 111 1-115 1-123 (125) 111 protein:vir:4833 Length: 140 # 97.2 4.6E-06 2.9E-09 49.8 8.5 109 1-115 1-135 (140) 112 protein:vir:2688 Length: 123 # 97.0 1.2E-05 7.3E-09 47.6 8.7 106 9-115 1-122 (123) 113 protein:vir:96012 Length: 133 97.0 1.3E-05 8E-09 47.4 8.7 115 1-115 1-130 (133) 114 protein:vir:3848 Length: 159 # 96.9 3.6E-05 2.2E-08 44.9 10.3 115 1-115 1-154 (159) 115 protein:vir:3163 Length: 145 # 96.6 2.3E-05 1.4E-08 46.0 7.7 110 1-115 1-136 (145) 116 protein:vir:102608 Length: 108 96.1 7.4E-06 4.6E-09 48.7 2.2 88 1-92 1-108 (108) 117 protein:vir:105825 Length: 108 96.1 7.4E-06 4.6E-09 48.7 2.2 88 1-92 1-108 (108) 118 protein:vir:6246 Length: 143 # 96.1 6.1E-05 3.8E-08 43.7 7.1 111 1-115 1-138 (143) 119 protein:vir:99196 Length: 155 95.6 0.00012 7.4E-08 42.1 6.9 112 1-115 1-148 (155) 120 protein:vir:103841 Length: 155 95.6 0.00014 8.5E-08 41.8 7.0 112 1-115 1-148 (155) 121 protein:vir:1332 Length: 143 # 95.5 0.00015 9.1E-08 41.6 7.0 111 1-115 1-138 (143) 122 protein:vir:1988 Length: 156 # 94.9 0.00024 1.5E-07 40.4 6.3 110 1-115 1-155 (156) 123 protein:vir:7449 Length: 123 # 94.9 0.00091 5.6E-07 37.3 9.4 105 1-115 4-117 (123) 124 protein:vir:79225 Length: 155 94.2 0.00096 6E-07 37.1 8.1 112 1-115 1-148 (155) 125 protein:vir:79091 Length: 175 93.3 0.0012 7.6E-07 36.5 6.9 112 1-115 1-173 (175) 126 protein:vir:96288 Length: 100 93.2 0.0014 8.5E-07 36.3 7.2 84 1-114 13-100 (100) 127 protein:vir:107851 Length: 175 92.9 0.0032 2E-06 34.3 8.7 111 1-115 1-173 (175) 128 protein:vir:101508 Length: 120 92.8 0.0041 2.6E-06 33.6 9.2 107 1-115 1-117 (120) 129 protein:vir:94069 Length: 168 91.3 0.00036 2.3E-07 39.4 1.5 89 16-115 1-97 (168) 130 protein:vir:99833 Length: 190 90.7 0.019 1.2E-05 30.0 10.4 111 1-115 1-187 (190) 131 protein:vir:78163 Length: 92 # 90.1 0.00039 2.4E-07 39.3 0.7 83 1-104 1-92 (92) 132 protein:vir:80970 Length: 112 89.4 0.0046 2.9E-06 33.4 5.9 100 1-115 1-105 (112) 133 protein:vir:5257 Length: 148 # 88.2 0.0011 6.9E-07 36.8 1.8 85 1-115 1-87 (148) 134 protein:vir:78607 Length: 155 87.3 0.0035 2.2E-06 34.0 3.9 80 10-115 1-94 (155) 135 protein:vir:106728 Length: 155 87.0 0.0036 2.2E-06 34.0 3.7 80 10-115 1-94 (155) 136 protein:vir:107757 Length: 189 86.9 0.0016 1E-06 35.9 1.8 84 1-115 1-85 (189) 137 protein:vir:102190 Length: 93 86.9 0.0082 5.1E-06 32.0 5.7 82 24-115 1-90 (93) 138 protein:vir:1087 Length: 161 # 86.2 0.015 9.2E-06 30.6 6.7 114 1-115 1-156 (161) 139 protein:vir:3994 Length: 168 # 85.6 0.014 8.9E-06 30.7 6.3 114 1-115 1-160 (168) 140 protein:vir:101563 Length: 155 82.5 0.0079 4.9E-06 32.1 3.5 80 1-115 1-94 (155) 141 protein:vir:7412 Length: 168 # 81.9 0.033 2E-05 28.7 6.6 110 1-115 1-160 (168) 142 protein:vir:77650 Length: 155 81.3 0.0087 5.4E-06 31.9 3.3 80 1-115 1-94 (155) 143 protein:vir:45 Length: 112 # N 76.4 0.044 2.7E-05 28.0 5.6 98 1-115 1-105 (112) 144 protein:vir:1028 Length: 168 # 75.8 0.077 4.8E-05 26.7 6.7 110 1-115 1-160 (168) 145 protein:vir:79687 Length: 113 72.6 0.096 5.9E-05 26.2 6.4 98 1-115 1-102 (113) 146 protein:vir:80037 Length: 199 68.4 0.029 1.8E-05 29.0 2.6 98 10-115 1-131 (199) 147 protein:vir:79034 Length: 141 66.0 0.12 7.5E-05 25.6 5.5 103 5-115 1-128 (141) 148 protein:vir:98557 Length: 149 63.6 0.31 0.00019 23.4 7.2 112 1-115 1-148 (149) 149 protein:vir:78894 Length: 105 62.8 0.039 2.4E-05 28.3 2.2 96 1-115 1-100 (105) 150 protein:vir:99546 Length: 200 62.8 0.044 2.7E-05 28.0 2.5 88 20-115 1-135 (200) 151 protein:vir:98892 Length: 108 62.7 0.19 0.00011 24.6 5.9 99 1-115 1-103 (108) 152 protein:vir:4790 Length: 114 # 59.9 0.11 6.9E-05 25.8 4.1 103 1-115 1-109 (114) 153 protein:vir:1581 Length: 116 # 56.3 0.23 0.00014 24.1 5.2 100 1-115 1-112 (116) 154 protein:vir:8432 Length: 149 # 50.9 0.6 0.00037 21.8 7.7 106 1-115 16-143 (149) 155 protein:vir:6071 Length: 150 # 48.3 0.67 0.00042 21.5 8.3 114 1-115 1-149 (150) 156 protein:vir:95260 Length: 160 45.8 0.11 6.8E-05 25.9 1.7 74 23-115 1-90 (160) 157 protein:vir:96763 Length: 177 45.4 0.77 0.00048 21.2 9.4 115 1-115 7-174 (177) 158 protein:vir:96105 Length: 193 45.3 0.57 0.00036 21.9 5.6 59 1-59 111-193 (193) 159 protein:vir:4200 Length: 133 # 44.7 0.47 0.00029 22.4 5.0 106 1-112 1-133 (133) 160 protein:vir:79179 Length: 155 39.7 1 0.00062 20.6 7.3 110 1-115 1-154 (155) 161 protein:vir:1164 Length: 156 # 39.1 1 0.00064 20.5 8.3 113 1-115 1-151 (156) 162 protein:vir:966 Length: 123 # 37.8 1.1 0.00068 20.4 7.2 106 1-112 10-123 (123) 163 protein:vir:5703 Length: 150 # 35.8 1.2 0.00075 20.1 8.1 114 1-115 1-149 (150) 164 protein:vir:2026 Length: 150 # 31.9 1.5 0.00091 19.7 8.4 111 1-115 1-149 (150) 165 protein:vir:4162 Length: 133 # 26.7 1.5 0.00095 19.6 4.7 103 1-115 1-133 (133) 166 protein:vir:1838 Length: 149 # 24.4 2.2 0.0014 18.7 7.3 111 1-115 1-148 (149) 167 protein:vir:95372 Length: 124 23.0 2.4 0.0015 18.5 7.9 104 1-112 9-124 (124) No 1 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=100.00 E-value=1.6e-49 Score=288.06 Aligned_cols=115 Identities=37% Similarity=0.636 Sum_probs=108.6 Q ss_pred Cc----hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH----HhCCccchhhhcchhceeecCceEEEeeCCcc Q lcl|NC_012756. 1 MS----NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELK----ETSPKRYGKYRRSWKKKKLANGSFVVFNAVAS 72 (115) Q Consensus 1 ~~----d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk----~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ 72 (115) || |+|+++|+++|++|+++++++|++++++++++++++|+ ++||++||+|+|||+.++++++++|++++.|| T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~~~V~nk~~yq 80 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNGWVIHNKTEYR 80 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCceeEEEcCCCc Confidence 65 89999999999999999999999999999999987777 58999999999999999999998655556899 Q ss_pred eeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 73 LTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 73 ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ||||||||||+||||||+|+|||+|+||+++++|+++|+++|+ T Consensus 81 LtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~ 123 (124) T protein:vir:95 81 LAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIK 123 (124) T ss_pred eeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999999999999999999999999 No 2 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=100.00 E-value=3.3e-49 Score=286.40 Aligned_cols=115 Identities=34% Similarity=0.616 Sum_probs=108.1 Q ss_pred Cc----hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH----HhCCccchhhhcchhceeecCceEEEeeCCcc Q lcl|NC_012756. 1 MS----NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELK----ETSPKRYGKYRRSWKKKKLANGSFVVFNAVAS 72 (115) Q Consensus 1 ~~----d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk----~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ 72 (115) || |+|+++|+++|++|+++++++|++++++++++++++|+ .+||++||+|+|||+.++++++++|++.+.|| T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~~yq 80 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKTEYR 80 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecCCcc Confidence 65 89999999999999999999999999999998887777 59999999999999999999987666556899 Q ss_pred eeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 73 LTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 73 ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ||||||||||+||||||+|+|||+||||+++++|+++|+++|+ T Consensus 81 LtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~ 123 (127) T protein:vir:80 81 LAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIK 123 (127) T ss_pred eeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999999999999999999999999 No 3 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=100.00 E-value=1.5e-46 Score=271.86 Aligned_cols=115 Identities=40% Similarity=0.727 Sum_probs=109.1 Q ss_pred Cc-----hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceE-EEeeC-Ccce Q lcl|NC_012756. 1 MS-----NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSF-VVFNA-VASL 73 (115) Q Consensus 1 ~~-----d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~-vv~n~-~~~l 73 (115) |+ |+|+++|+++|++|+++++++|+++++++|++++++|+++||++||+|+|||++++.++++. +|||+ .||| T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l 80 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRL 80 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcce Confidence 65 58999999999999999999999999999999999999999999999999999999988864 45554 7999 Q ss_pred eeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 74 THILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 74 tHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ||||||||++||||||+|+|||+||++++.++|+++|+++|+ T Consensus 81 ~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~ 122 (123) T protein:vir:96 81 THLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLS 122 (123) T ss_pred EEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhc Confidence 999999999999999999999999999999999999999999 No 4 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=100.00 E-value=3.4e-45 Score=264.40 Aligned_cols=115 Identities=30% Similarity=0.503 Sum_probs=107.6 Q ss_pred Cc----hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecC---ceEEEee-CCcc Q lcl|NC_012756. 1 MS----NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLAN---GSFVVFN-AVAS 72 (115) Q Consensus 1 ~~----d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~---~~~vv~n-~~~~ 72 (115) || |+|+++|+++|++|++++++.|+++++++|++++++|+++||++||+|+|||++++..+ +.+|++| ++|| T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~ 80 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHYR 80 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCCCC Confidence 65 79999999999999999999999999999999999999999999999999999887653 3455555 5899 Q ss_pred eeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 73 LTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 73 ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |||||||||++||||||+|+|||+||++++.++|+++|+++|+ T Consensus 81 l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~ 123 (126) T protein:vir:81 81 RVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIE 123 (126) T ss_pred ceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhh Confidence 9999999999999999999999999999999999999999999 No 5 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.84 E-value=3.5e-24 Score=149.17 Aligned_cols=114 Identities=23% Similarity=0.338 Sum_probs=91.3 Q ss_pred CchHHHH----HHHHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhCCc---------------------------cc Q lcl|NC_012756. 1 MSNDLAD----LIAKELAAYS--DEVTEEVDKIAEQVADETVDELKETSPK---------------------------RY 47 (115) Q Consensus 1 ~~d~La~----~I~~~L~~y~--~~v~~~~~~~~~~~a~~~~~~lk~~sP~---------------------------~T 47 (115) ||+.+.- .+.+.|.+.+ ..+...++++++++|..++.+++..+|+ +| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 8876543 3333444332 3456678999999999999999998886 89 Q ss_pred hhhhcchhcee---ecCce-EEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 48 GKYRRSWKKKK---LANGS-FVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 48 G~y~k~W~~kk---~~~~~-~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |.|++||++.+ .++++ +.|+|+.++ |||||+||++++||||||++||..+++....+|++.|++.+. T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v~N~~~Y-A~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~ 151 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKVYNKVYY-APHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYD 151 (163) T ss_pred chhhccceecceeecCCceEEEEEecCCc-cchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHH Confidence 99999998854 34444 456777555 999999999999999999999999999999999999999888 No 6 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.83 E-value=1.4e-23 Score=145.85 Aligned_cols=114 Identities=23% Similarity=0.333 Sum_probs=94.5 Q ss_pred Cch---HHH--HHHHHHHHhhHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee---cCc-eEEEeeC Q lcl|NC_012756. 1 MSN---DLA--DLIAKELAAYSD--EVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL---ANG-SFVVFNA 69 (115) Q Consensus 1 ~~d---~La--~~I~~~L~~y~~--~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~---~~~-~~vv~n~ 69 (115) ||. +++ +++++.|+++.. .+.+.+++.+++++..+..++++++|++||.|++||+..+. +++ .+.|.|+ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~ 80 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINN 80 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecC Confidence 662 222 467777887754 57789999999999999999999999999999999987543 333 3456666 Q ss_pred CcceeeheecceeecCC------------cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNG------------GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~G------------GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .++ ||+|||||++++| |||+|+||+.++.+.....|.+.|++.+. T Consensus 81 ~~Y-A~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~k~l~ 137 (144) T protein:vir:10 81 AEY-ASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQLVTEGLW 137 (144) T ss_pred CCc-ccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHHHHHHHH Confidence 444 9999999999988 89999999999999999999999999988 No 7 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.81 E-value=8.2e-23 Score=141.68 Aligned_cols=113 Identities=18% Similarity=0.306 Sum_probs=93.7 Q ss_pred Cch-------HHHHHHHHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhcee---------ecCce Q lcl|NC_012756. 1 MSN-------DLADLIAKELAAYSD-EVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKK---------LANGS 63 (115) Q Consensus 1 ~~d-------~La~~I~~~L~~y~~-~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk---------~~~~~ 63 (115) ||+ +| +++.+.|+.... ++...++++++++|..+.+++++.+|++||+|++||+... .+++. T Consensus 1 M~~~~~~d~~gl-~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~ 79 (141) T protein:vir:79 1 MARWGSVDFREF-KRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNY 79 (141) T ss_pred CCCCccCcHHHH-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCee Confidence 666 34 566677877644 7899999999999999999999999999999999997642 22332 Q ss_pred -EEEeeCCcceeeheecceeecCC-cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 64 -FVVFNAVASLTHILENGHLSRNG-GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 64 -~vv~n~~~~ltHLLE~GHakr~G-GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +.|.|+.++ ||+||+||++++| |||+|++|+..+.+...++|.+.+++.|+ T Consensus 80 ~v~v~n~~~Y-A~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~ 132 (141) T protein:vir:79 80 IIEVVNPTEY-ASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLL 132 (141) T ss_pred EEEEecCCcc-hhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHH Confidence 345565443 9999999999999 99999999999999999999888888888 No 8 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.77 E-value=9.7e-22 Score=135.77 Aligned_cols=107 Identities=19% Similarity=0.124 Sum_probs=94.2 Q ss_pred hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeeheeccee Q lcl|NC_012756. 3 NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHL 82 (115) Q Consensus 3 d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHa 82 (115) =+=.+++++.|+++++.+...+++++.+.|..+..+++.++|++||.|++||+..+.+.....|.+..++ +|+|||||+ T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~~~~~Y-a~~vE~GT~ 79 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVVSPALY-SIYLELGTR 79 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEeecCccc-chhcccCcc Confidence 2223567777889999999999999999999999999999999999999999998877666666655433 899999998 Q ss_pred ecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 83 SRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 83 kr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +++|+|||+|+.+.....|+++|+++++ T Consensus 80 -----~m~a~Pf~~pa~~~~~~~~~~~i~~~lr 107 (108) T protein:vir:99 80 -----KMEAQSFLDPALRKEWPVLMANIKKMFK 107 (108) T ss_pred -----ccCCCcchhhhHHHHHHHHHHHHHHHhc Confidence 5999999999999999999999999999 No 9 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.74 E-value=6.3e-21 Score=131.31 Aligned_cols=107 Identities=20% Similarity=0.211 Sum_probs=88.7 Q ss_pred CchHH----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceee Q lcl|NC_012756. 1 MSNDL----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTH 75 (115) Q Consensus 1 ~~d~L----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltH 75 (115) ||-.+ -+++.+.|.+... .+.+++++++.+..+.++++.++|++||.|++||+.+...++ ++.|.+..++ +| T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~--~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Y-a~ 77 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS--LKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDY-SA 77 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCc-cc Confidence 55333 3567777877654 366899999999999999999999999999999998765544 4455554433 89 Q ss_pred heecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|||||+ ++||+|||+||.+.....|.++|+++++ T Consensus 78 ~vE~GT~-----k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 78 YVEYGTR-----FQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eeecccc-----ccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 9999999 5999999999999999999999999999 No 10 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.73 E-value=6e-21 Score=131.46 Aligned_cols=108 Identities=14% Similarity=0.142 Sum_probs=94.4 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeehee Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILE 78 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE 78 (115) ||=.+ .+++.+.|+++++.+.+.+++++++.|..+.+++++.+|++||.|++||+.+..+....|..+..| +++|| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Y--a~yvE 78 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGY--DGYQE 78 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCc--cceee Confidence 66332 368888999999999999999999999999999999999999999999998877655455444454 78999 Q ss_pred cceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 79 NGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 79 ~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |||++ ++|+|||+|+.+.....|.++|++.++ T Consensus 79 ~GT~~-----~~aqPfl~pa~~~~~~~~~~~l~~~l~ 110 (114) T protein:vir:95 79 YGTRF-----QPGTPHFRPMMEQIQPQFQKDMTDVMK 110 (114) T ss_pred cCccc-----cCCCccchhhHHHHHHHHHHHHHHHHH Confidence 99994 999999999999999999999999998 No 11 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.72 E-value=1.3e-20 Score=129.57 Aligned_cols=109 Identities=15% Similarity=0.179 Sum_probs=89.5 Q ss_pred CchHH------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee--cCc--eEEEeeCC Q lcl|NC_012756. 1 MSNDL------ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL--ANG--SFVVFNAV 70 (115) Q Consensus 1 ~~d~L------a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~--~~~--~~vv~n~~ 70 (115) |++++ -+++.+.|+++.+.+.+.+..++.+.+..+.++++.++|++||.|++||+.... .++ ...|.++. T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~ 80 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARA 80 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCC Confidence 77752 278999999999999999999999999999999999999999999999976532 222 23444443 Q ss_pred cceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 71 ASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 71 ~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ++ +|+|||||++ ++|+|||+|+.+.....|.+.|++.++ T Consensus 81 ~Y-a~~vEfGT~~-----~~a~Pfl~pa~~~~~~~~~~~l~~~l~ 119 (125) T protein:vir:94 81 DY-SSYNEYGTYR-----MSAQPFMAPSVAAMTPFFYKAVRDALN 119 (125) T ss_pred Cc-cceeeccccc-----CCCCcccchhHHHHHHHHHHHHHHHHH Confidence 33 8999999995 899999999999887777777777666 No 12 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.71 E-value=5.1e-20 Score=126.36 Aligned_cols=107 Identities=17% Similarity=0.188 Sum_probs=84.6 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeeheec Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHILEN 79 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLLE~ 79 (115) |.=+=-+++++.|++.. ....+++++++.|..+.++++.++|++||.|++||+.+...++ ..+|.+..++ +|+||| T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Y-a~~vE~ 77 (108) T protein:vir:74 1 MKITGIDALQKKLRKNA--TLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDY-AGYVEY 77 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCc-ccceec Confidence 21111133444444433 3467899999999999999999999999999999998765443 4556665444 899999 Q ss_pred ceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 80 GHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 80 GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ||++ ++|+|||+||.+...+.|.++|+++++ T Consensus 78 GT~k-----m~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 78 GTRF-----QSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred cccc-----cCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 9994 999999999999999999999999999 No 13 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.68 E-value=6.9e-20 Score=125.64 Aligned_cols=108 Identities=16% Similarity=0.206 Sum_probs=87.5 Q ss_pred Cch-HHH--HHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceee Q lcl|NC_012756. 1 MSN-DLA--DLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTH 75 (115) Q Consensus 1 ~~d-~La--~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltH 75 (115) |++ ++. +++.+.|+++ .+.+.+.+++++.++++++++..+..+|++||.+++||+.....++..|-.+..| +| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Y--a~ 78 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSY--SG 78 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCc--cc Confidence 774 222 6777777776 4667777888888888888888877889999999999998765555444444444 79 Q ss_pred heecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|||||+ +++|+|||+||.+.....|.++|+++++ T Consensus 79 ~vEfGT~-----km~a~Pfl~PA~~~~~~~~~~~l~~l~k 113 (114) T protein:vir:27 79 YLEVGTR-----KMEAQPFMKPALDEVAPKMVEELAKWDE 113 (114) T ss_pred eeccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhc Confidence 9999998 5999999999999999999999999999 No 14 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.68 E-value=6.9e-20 Score=125.64 Aligned_cols=108 Identities=16% Similarity=0.206 Sum_probs=87.5 Q ss_pred Cch-HHH--HHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceee Q lcl|NC_012756. 1 MSN-DLA--DLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTH 75 (115) Q Consensus 1 ~~d-~La--~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltH 75 (115) |++ ++. +++.+.|+++ .+.+.+.+++++.++++++++..+..+|++||.+++||+.....++..|-.+..| +| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Y--a~ 78 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSY--SG 78 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCc--cc Confidence 774 222 6777777776 4667777888888888888888877889999999999998765555444444444 79 Q ss_pred heecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|||||+ +++|+|||+||.+.....|.++|+++++ T Consensus 79 ~vEfGT~-----km~a~Pfl~PA~~~~~~~~~~~l~~l~k 113 (114) T protein:vir:49 79 YLEVGTR-----KMEAQPFMKPALDEVAPKMVEELAKWDE 113 (114) T ss_pred eeccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhc Confidence 9999998 5999999999999999999999999999 No 15 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.66 E-value=5.3e-19 Score=120.78 Aligned_cols=107 Identities=17% Similarity=0.203 Sum_probs=85.3 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeeheec Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHILEN 79 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLLE~ 79 (115) |.=+=-+++.+.|++.. ....+++++++.+..+.+++++++|++||.|++||+.....++ ..+|.+..++ +|+||| T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Y-a~~vE~ 77 (108) T protein:vir:98 1 MKITGIDALQKKLRKNA--TLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDY-AGYVEY 77 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCc-cceeec Confidence 32222244555555533 3466899999999999999999999999999999987755433 4556555443 899999 Q ss_pred ceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 80 GHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 80 GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ||+ +++|+|||+||.+...+.|.++|+++++ T Consensus 78 GT~-----~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 78 GTR-----FQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ccc-----ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 999 5999999999999999999999999999 No 16 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.65 E-value=7.4e-19 Score=119.98 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-+++.+.|+++++.+.+.+++++++.+..+.++.++++ |++||.+++||+..+.+....+|....++ + T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccc-h Confidence 7666677888889999999999999999999999999999987 89999999999998887766767666443 7 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++|+|||+||.+...+.|.++|+++++ T Consensus 80 ~~vE~GT~-----km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 4999999999999999999999999999 No 17 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.65 E-value=7.4e-19 Score=119.98 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-+++.+.|+++++.+.+.+++++++.+..+.++.++++ |++||.+++||+..+.+....+|....++ + T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccc-h Confidence 7666677888889999999999999999999999999999987 89999999999998887766767666443 7 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++|+|||+||.+...+.|.++|+++++ T Consensus 80 ~~vE~GT~-----km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 4999999999999999999999999999 No 18 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.65 E-value=7.4e-19 Score=119.98 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-+++.+.|+++++.+.+.+++++++.+..+.++.++++ |++||.+++||+..+.+....+|....++ + T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccc-h Confidence 7666677888889999999999999999999999999999987 89999999999998887766767666443 7 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++|+|||+||.+...+.|.++|+++++ T Consensus 80 ~~vE~GT~-----km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 4999999999999999999999999999 No 19 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.65 E-value=7.4e-19 Score=119.98 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-+++.+.|+++++.+.+.+++++++.+..+.++.++++ |++||.+++||+..+.+....+|....++ + T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccc-h Confidence 7666677888889999999999999999999999999999987 89999999999998887766767666443 7 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++|+|||+||.+...+.|.++|+++++ T Consensus 80 ~~vE~GT~-----km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 4999999999999999999999999999 No 20 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.65 E-value=7.4e-19 Score=119.98 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-+++.+.|+++++.+.+.+++++++.+..+.++.++++ |++||.+++||+..+.+....+|....++ + T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccc-h Confidence 7666677888889999999999999999999999999999987 89999999999998887766767666443 7 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++|+|||+||.+...+.|.++|+++++ T Consensus 80 ~~vE~GT~-----km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 4999999999999999999999999999 No 21 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.65 E-value=7.4e-19 Score=119.98 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-+++.+.|+++++.+.+.+++++++.+..+.++.++++ |++||.+++||+..+.+....+|....++ + T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccc-h Confidence 7666677888889999999999999999999999999999987 89999999999998887766767666443 7 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++|+|||+||.+...+.|.++|+++++ T Consensus 80 ~~vE~GT~-----km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhhccccc-----ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 4999999999999999999999999999 No 22 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.65 E-value=7.9e-19 Score=119.83 Aligned_cols=109 Identities=17% Similarity=0.177 Sum_probs=96.9 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |.=+=-+++.+.|+++++.+.+.+++++++.+..+.+++++++ |++||.+++||...+.+..+..|....++ + T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Y-a 79 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHY-S 79 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCcc-c Confidence 6666668888889999999999999999999999999999987 88999999999998877666666655443 8 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+||||.+ +++++|||+||.+.....|.++|+++++ T Consensus 80 ~~vEfGT~-----km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 80 GFLEFGTR-----YMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred hheecccc-----cCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999998 5999999999999999999999999999 No 23 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.65 E-value=8.9e-19 Score=119.55 Aligned_cols=114 Identities=17% Similarity=0.245 Sum_probs=95.5 Q ss_pred CchHH----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceee Q lcl|NC_012756. 1 MSNDL----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTH 75 (115) Q Consensus 1 ~~d~L----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltH 75 (115) ||+.+ -+++.+.|+.+.+.+.+.+++++.++|.++..+++.++|++||.|++||+.....++ +.+|.+...+ +. T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~Y-A~ 82 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEY-AI 82 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCc-cc Confidence 88855 368888999999999999999999999999999999999999999999987655444 4556555333 78 Q ss_pred heecceeecC---Cc-------------------ccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRN---GG-------------------RVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~---GG-------------------rV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ++||||.... +| .+||+||++||.+...+.|.++|++++= T Consensus 83 ~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 83 YVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred hhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 9999984321 11 3799999999999999999999999999 No 24 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.63 E-value=3.1e-19 Score=122.03 Aligned_cols=95 Identities=24% Similarity=0.384 Sum_probs=83.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCc---cchhhhcchhceeecCceEEEeeCCcceeeheecceeecCC---------- Q lcl|NC_012756. 20 VTEEVDKIAEQVADETVDELKETSPK---RYGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNG---------- 86 (115) Q Consensus 20 v~~~~~~~~~~~a~~~~~~lk~~sP~---~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~G---------- 86 (115) +.+.++++++++|.+++.++++.+|+ +||.|++||+.........+|+|+..+ ||++|+||+.++| T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~v~N~~eY-A~~VE~GHRq~~g~g~~~~~~gk 79 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGVVSNNVEY-IHHLEYGHRTRQGTGTSENYRPK 79 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCceeecCCcc-cccccCCceeeCCcceecccccc Confidence 78889999999999999999999998 469999999986655444456666444 8999999999987 Q ss_pred ----cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 87 ----GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 87 ----GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |||+|++|++-+++....+|++.+++.+. T Consensus 80 rlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~ 112 (116) T protein:vir:10 80 PNGISFVPGVFMLARSVDEMSSIIDDELNQIII 112 (116) T ss_pred cccCCccCceehHHHHHHHHHHHHHHHHHHHHH Confidence 59999999999999999999999999997 No 25 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.63 E-value=1.5e-18 Score=118.31 Aligned_cols=109 Identities=19% Similarity=0.200 Sum_probs=98.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhhcchhceeecCceEEEeeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS------PKRYGKYRRSWKKKKLANGSFVVFNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~lt 74 (115) |+=+=-|++.+.|++.++.+.+.+++++++.+.++.+++++++ |++||.+++|+...+.+.....|.+..++ + T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Y-a 79 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAY-S 79 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccc-c Confidence 7666678888899999999999999999999999999999987 99999999999998877666667665444 8 Q ss_pred eheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ++||||++ +++|+||++||.+.....|.++|+++++ T Consensus 80 ~~vE~GT~-----~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 80 GFLEFGTR-----YMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred cccccccc-----ccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 89999998 5999999999999999999999999999 No 26 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.62 E-value=2.8e-18 Score=116.85 Aligned_cols=110 Identities=17% Similarity=0.128 Sum_probs=90.3 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||+-+ .+++++.|+++.+++.+.+++++.+++.++..+++..+|++||.|++||+.+....+ ..+|.+...+ ++++ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~Y-A~~v 79 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEY-AIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCc-cccc Confidence 98855 457889999999999999999999999999999999999999999999987655443 3445544332 8999 Q ss_pred ecceee---cCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.. ++++ .+|++|||+||.+.....|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999743 2221 279999999999999999988888 No 27 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.61 E-value=2.1e-18 Score=117.47 Aligned_cols=115 Identities=17% Similarity=0.209 Sum_probs=89.1 Q ss_pred CchHH----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhce----eecCceE---EEe-e Q lcl|NC_012756. 1 MSNDL----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKK----KLANGSF---VVF-N 68 (115) Q Consensus 1 ~~d~L----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~k----k~~~~~~---vv~-n 68 (115) ||=++ -+.+...|+++.+.+.+.+..++.+.|..++++++..+|++||+|+++.... +.+++.. |-. . T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 44333 3357778999999999999999999999999999999999999999998654 2334433 223 3 Q ss_pred CCcceeeheecceeecCC-------------------cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 AVASLTHILENGHLSRNG-------------------GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 ~~~~ltHLLE~GHakr~G-------------------GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .+.++.||+||||..+.+ -++|++|||+||++...++..+.+.+-+. T Consensus 81 ~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~ 146 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGA 146 (157) T ss_pred CccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHH Confidence 477889999999976542 13899999999999888777777644322 No 28 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.61 E-value=3.6e-18 Score=116.24 Aligned_cols=110 Identities=17% Similarity=0.130 Sum_probs=91.4 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||.-+ .+++.+.|+++.+++.+.+++++++++..+.++++..+|++||.|++||+......+ +.+|.+...+ +|++ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~~v 79 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY-AIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCc-cccc Confidence 88855 457889999999999999999999999999999999999999999999988755444 4555554332 8999 Q ss_pred ecceee---cCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.. ++++ .+|++|||+||.+.....|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999843 2211 269999999999999999999888 No 29 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.61 E-value=3.6e-18 Score=116.24 Aligned_cols=110 Identities=17% Similarity=0.130 Sum_probs=91.4 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||.-+ .+++.+.|+++.+++.+.+++++++++..+.++++..+|++||.|++||+......+ +.+|.+...+ +|++ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~~v 79 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY-AIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCc-cccc Confidence 88855 457889999999999999999999999999999999999999999999988755444 4555554332 8999 Q ss_pred ecceee---cCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.. ++++ .+|++|||+||.+.....|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999843 2211 269999999999999999999888 No 30 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.61 E-value=3.6e-18 Score=116.24 Aligned_cols=110 Identities=17% Similarity=0.130 Sum_probs=91.4 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||.-+ .+++.+.|+++.+++.+.+++++++++..+.++++..+|++||.|++||+......+ +.+|.+...+ +|++ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~~v 79 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY-AIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCc-cccc Confidence 88855 457889999999999999999999999999999999999999999999988755444 4555554332 8999 Q ss_pred ecceee---cCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.. ++++ .+|++|||+||.+.....|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999843 2211 269999999999999999999888 No 31 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.60 E-value=4.6e-18 Score=115.61 Aligned_cols=110 Identities=17% Similarity=0.130 Sum_probs=90.5 Q ss_pred CchH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSND--LADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~--La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) |+.= =.+++.+.|+++++++.+.++.++++++..+..++++.+|++||.|++||+.+....+ +.+|.+... .++++ T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~-YA~~v 79 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSE-YAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCC-ccccc Confidence 8873 2457888899999999999999999999999999999999999999999988765544 445555433 38999 Q ss_pred eccee---ecCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHL---SRNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHa---kr~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||. .++++ .++++|||+||.+...+.|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 99954 33322 379999999999999999998888 No 32 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.60 E-value=5e-18 Score=115.42 Aligned_cols=110 Identities=17% Similarity=0.114 Sum_probs=90.3 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||.-. .+++.+.|+.+.+.+.+.++.++++++..+..++++.+|+|||.|++||+.+...++ ..+|.+...+ ++++ T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y-a~~v 79 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEY-AVYV 79 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCc-cccc Confidence 98853 468889999999999999999999999999999999999999999999988755544 3455554332 8999 Q ss_pred ecceee---cCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) |||+.. ++.+ .++++|||+||.+.....|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 999644 2221 268999999999999998888887 No 33 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.59 E-value=6.3e-18 Score=114.89 Aligned_cols=110 Identities=17% Similarity=0.114 Sum_probs=89.6 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) |+.-. .+++.+.|+.+++.+.+.+++++++++.++.+++++++|++||.|++||+.....++ +.+|.+...+ ++++ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~~v 79 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEY-AVYV 79 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCcc-cccc Confidence 88742 357888999999999999999999999999999999999999999999988755544 4455555333 8999 Q ss_pred ecceee---cCCcc---------------------cCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGGR---------------------VAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GGr---------------------V~~~phI~paee~~~~~~~~~i~ 111 (115) |||+.. +++++ .+++|||+||.+...+.|+++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999633 33221 78999999999999998888887 No 34 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.58 E-value=8.9e-18 Score=114.07 Aligned_cols=110 Identities=22% Similarity=0.214 Sum_probs=90.2 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) |++-. .+++.+.|+.+.+++.+.+++++++++..+..+++..+|++||.|++||+.+..+++ ..+|.+...+ ++++ T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~Y-A~~V 91 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADY-AIYV 91 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCc-cccc Confidence 77743 357889999999999999999999999999999999999999999999998777665 3445554333 8999 Q ss_pred ecceeec---CCcc---------------------cCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLSR---NGGR---------------------VAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHakr---~GGr---------------------V~~~phI~paee~~~~~~~~~i~ 111 (115) ||||... +.|+ .+++|||+||.+...+.|.++|- T Consensus 92 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 92 EYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred ccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9998542 2111 57999999999999988888887 No 35 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.56 E-value=2.4e-17 Score=111.67 Aligned_cols=110 Identities=21% Similarity=0.158 Sum_probs=90.2 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||.-. .+++.+.|+++.+.+.+.+++++++.+..+..++++.+|++||.|++||+.+...++ ..+|.+...+ ++++ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~Y-A~yv 79 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEY-AIYV 79 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCc-cccc Confidence 88844 468888899999999999999999999999999999999999999999988755554 4455555333 8999 Q ss_pred ecceee---cCCc---------------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLS---RNGG---------------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHak---r~GG---------------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.. +++| -.+++|||+||.+.....|.++|- T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 80 EFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999643 2221 167999999999999998888887 No 36 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.56 E-value=2.3e-17 Score=111.78 Aligned_cols=110 Identities=22% Similarity=0.216 Sum_probs=90.2 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) |++-. .+++.+.|+.+++++.+.+++++++++..+..+++..+|++||.|++||+.+..+++ ..+|.+...+ ++++ T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~Y-A~~v 91 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADY-AIYV 91 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCc-cccc Confidence 77742 347899999999999999999999999999999999999999999999998776655 3445554332 8999 Q ss_pred ecceeecC---Ccc---------------------cCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLSRN---GGR---------------------VAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHakr~---GGr---------------------V~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.... .|+ .+++|||+||.+...+.|.+.|- T Consensus 92 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 92 EYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred ccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99985422 111 57999999999999999988888 No 37 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.51 E-value=6.1e-17 Score=109.46 Aligned_cols=107 Identities=16% Similarity=0.209 Sum_probs=87.2 Q ss_pred Cch-HH--HHHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceee Q lcl|NC_012756. 1 MSN-DL--ADLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTH 75 (115) Q Consensus 1 ~~d-~L--a~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltH 75 (115) ||+ ++ -+++.+.|++. .+.+...++++..+.+..+....+..+|++||.+++|++.+..+....|..+..| ++ T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Y--a~ 78 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNY--SG 78 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCc--cc Confidence 764 22 34566667665 5678888999999999999999999999999999999987665554344333444 79 Q ss_pred heecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHh Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIG 114 (115) Q Consensus 76 LLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii 114 (115) +||||++ +++|+|||+||.+.....|.++|+++- T Consensus 79 ~vE~GTr-----~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 79 YLEVGTR-----KMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred eeccCcc-----ccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 9999999 599999999999999999999999998 No 38 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.47 E-value=2.6e-16 Score=106.03 Aligned_cols=113 Identities=19% Similarity=0.177 Sum_probs=87.7 Q ss_pred Cch---H-HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCce---EEEeeCCcce Q lcl|NC_012756. 1 MSN---D-LADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGS---FVVFNAVASL 73 (115) Q Consensus 1 ~~d---~-La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~---~vv~n~~~~l 73 (115) ||. . =.+++.+.|+.+.+.+...+++++++++..+..++++++|++||.|++||+.+...++. +.|.+... . T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~-Y 79 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVT-Y 79 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcc-c Confidence 332 1 13567778888899999999999999999999999999999999999999987766652 33444433 3 Q ss_pred eeheecceee---cC----------C----cc-----cCccchhhhHHHHHHHHHHHHHHHHh Q lcl|NC_012756. 74 THILENGHLS---RN----------G----GR-----VAGIVHIKPAEEKAIQNFEKRIKEIG 114 (115) Q Consensus 74 tHLLE~GHak---r~----------G----Gr-----V~~~phI~paee~~~~~~~~~i~~ii 114 (115) ++++||||.. +. + ++ .+++|||+||.+.....+.+.|++|= T Consensus 80 A~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 80 AADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred chhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 8999999743 11 1 12 57999999999999888888887776 No 39 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.46 E-value=4.5e-16 Score=104.73 Aligned_cols=110 Identities=21% Similarity=0.126 Sum_probs=89.7 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLL 77 (115) ||.-. .+++++.|+++.+.+.+.+++++.+++.++.++++.++|++||.|++||+.....++ ..+|.+...+ ++++ T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~Y-A~~v 79 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNY-AVYV 79 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCc-cchh Confidence 88642 358889999999999999999999999999999999999999999999987655544 3445554333 7899 Q ss_pred ecceeecC-----------------C-----cccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLSRN-----------------G-----GRVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 78 E~GHakr~-----------------G-----GrV~~~phI~paee~~~~~~~~~i~ 111 (115) ||||.... | -.+||+|||+||.+...+.|.+.|- T Consensus 80 e~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 80 NYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred hcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 99984421 1 1389999999999999888888877 No 40 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=99.41 E-value=1.3e-15 Score=102.24 Aligned_cols=110 Identities=18% Similarity=0.167 Sum_probs=83.1 Q ss_pred CchHHH--HHHHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-----C----------- Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAYSDEVTE-EVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-----N----------- 61 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y~~~v~~-~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-----~----------- 61 (115) |+=++. ++|++.|+..++.+.+ .++.++.+.|+.+.+++++++|++||.|+++.+..... . T Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~~~ 83 (149) T protein:vir:19 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) T ss_pred eeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccccc Confidence 333444 7889999999988874 56899999999999999999999999999997542211 0 Q ss_pred -------ceEEEee-CCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 62 -------GSFVVFN-AVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 62 -------~~~vv~n-~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ..++..+ ...+.+|++|||+.+ .||+|||+|+.+...++..+.|.+.++ T Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-----~~a~PF~~pA~~~~k~~~~~~~~~~l~ 140 (149) T protein:vir:19 84 PRTGNSDNTMKANNPRNAFYWRFVELGTAN-----MPAHPFVRPAYDTREEEAASVAIARMN 140 (149) T ss_pred cccccccceeecCCCCccceeeeeccCCCC-----CCCCcchhHHHHHHHHHHHHHHHHHHH Confidence 0111122 234568999999995 899999999999877766666665555 No 41 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=99.40 E-value=1.4e-15 Score=102.06 Aligned_cols=110 Identities=20% Similarity=0.167 Sum_probs=86.0 Q ss_pred Cch---HHHHHHHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec--C--ceE--EEe--- Q lcl|NC_012756. 1 MSN---DLADLIAKELAAYSDEVTE-EVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA--N--GSF--VVF--- 67 (115) Q Consensus 1 ~~d---~La~~I~~~L~~y~~~v~~-~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~--~--~~~--vv~--- 67 (115) |++ +=-++|++.|+..++.+.+ .+++++.+.|+.+.+++++++|++||.|++++...... + ..+ .+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeeccc Confidence 652 2236788889999887754 67999999999999999999999999999998654321 1 111 111 Q ss_pred ------eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 ------NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 ------n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ....+++|+||||+.. .||+|||+|+.+...+.+.+.|++.++ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:80 81 KGKADSPSNAFYWRFDEFGTQH-----MKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred ccccCCCCCcceeeeeccCCCC-----CCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 1235679999999994 999999999999998888888888776 No 42 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=99.40 E-value=1.6e-15 Score=101.70 Aligned_cols=110 Identities=19% Similarity=0.154 Sum_probs=84.4 Q ss_pred Cch---HHHHHHHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee--cCce--E--EE---- Q lcl|NC_012756. 1 MSN---DLADLIAKELAAYSDEVTE-EVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL--ANGS--F--VV---- 66 (115) Q Consensus 1 ~~d---~La~~I~~~L~~y~~~v~~-~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~--~~~~--~--vv---- 66 (115) |++ +=-++|.+.|+..++.+.. .+++++.+.|+.+.+++++++|++||.|++++..+.. .++. + .+ T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~ 80 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeecc Confidence 652 2226788889999888765 5699999999999999999999999999999766432 2221 1 11 Q ss_pred -----eeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 67 -----FNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 67 -----~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .....+.+|+||||+. +.||+|||+|+.+.+.+++.+.+.+.++ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~-----~~~a~pFl~pa~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:14 81 KGKADSPNNAFYWRFDEFGTQ-----HMKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred ccccCCCCccceeeeeccccC-----CCCCCcchhHHHHHHHHHHHHHHHHHHH Confidence 1123567899999999 4999999999999887777777776665 No 43 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=99.39 E-value=1.7e-15 Score=101.50 Aligned_cols=110 Identities=15% Similarity=0.137 Sum_probs=84.7 Q ss_pred CchHHH------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhc--eee------------- Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKK--KKL------------- 59 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~--kk~------------- 59 (115) |||.+. +++.+.|++..+++.+.+++++.+.|+.+.++++.++|+++|.++++-.. +.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 998754 88899999999999999999999999999999999999999998876321 110 Q ss_pred ---cCceEEE-e----eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 60 ---ANGSFVV-F----NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 60 ---~~~~~vv-~----n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +..++.| . +..++.+|+||||+.+ .|++|||+|+.+...+...+.|.+.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-----MPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-----CCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 1111222 1 2346789999999994 899999999999877666655555555 No 44 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=99.39 E-value=1.7e-15 Score=101.50 Aligned_cols=110 Identities=15% Similarity=0.137 Sum_probs=84.7 Q ss_pred CchHHH------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhc--eee------------- Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKK--KKL------------- 59 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~--kk~------------- 59 (115) |||.+. +++.+.|++..+++.+.+++++.+.|+.+.++++.++|+++|.++++-.. +.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 998754 88899999999999999999999999999999999999999998876321 110 Q ss_pred ---cCceEEE-e----eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 60 ---ANGSFVV-F----NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 60 ---~~~~~vv-~----n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +..++.| . +..++.+|+||||+.+ .|++|||+|+.+...+...+.|.+.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-----MPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-----CCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 1111222 1 2346789999999994 899999999999877666655555555 No 45 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=99.39 E-value=1.7e-15 Score=101.50 Aligned_cols=110 Identities=15% Similarity=0.137 Sum_probs=84.7 Q ss_pred CchHHH------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhc--eee------------- Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKK--KKL------------- 59 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~--kk~------------- 59 (115) |||.+. +++.+.|++..+++.+.+++++.+.|+.+.++++.++|+++|.++++-.. +.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 998754 88899999999999999999999999999999999999999998876321 110 Q ss_pred ---cCceEEE-e----eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 60 ---ANGSFVV-F----NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 60 ---~~~~~vv-~----n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +..++.| . +..++.+|+||||+.+ .|++|||+|+.+...+...+.|.+.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-----MPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-----CCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 1111222 1 2346789999999994 899999999999877666655555555 No 46 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=99.39 E-value=1.7e-15 Score=101.50 Aligned_cols=110 Identities=15% Similarity=0.137 Sum_probs=84.7 Q ss_pred CchHHH------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhc--eee------------- Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKK--KKL------------- 59 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~--kk~------------- 59 (115) |||.+. +++.+.|++..+++.+.+++++.+.|+.+.++++.++|+++|.++++-.. +.. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 998754 88899999999999999999999999999999999999999998876321 110 Q ss_pred ---cCceEEE-e----eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 60 ---ANGSFVV-F----NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 60 ---~~~~~vv-~----n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +..++.| . +..++.+|+||||+.+ .|++|||+|+.+...+...+.|.+.+. T Consensus 81 ~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 81 LEGGIKTVKIGLNKADRSPWFYLKFHEWGTSK-----MPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred ccccceeEEeeeccCCCCCcceeeeeccCCCC-----CCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 1111222 1 2346789999999994 899999999999877666655555555 No 47 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=99.39 E-value=1.7e-15 Score=101.50 Aligned_cols=110 Identities=20% Similarity=0.170 Sum_probs=84.8 Q ss_pred Cch---HHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec--C--ceE--EE---- Q lcl|NC_012756. 1 MSN---DLADLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA--N--GSF--VV---- 66 (115) Q Consensus 1 ~~d---~La~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~--~--~~~--vv---- 66 (115) |++ +=-+++.+.|+...+.+. +.+++++.+.++.+.+++++++|++||.|+++....... + ..+ .+ T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~ 80 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeecc Confidence 662 112678888888888875 568999999999999999999999999999998654322 1 111 11 Q ss_pred -----eeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 67 -----FNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 67 -----~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .....+.+|+|||||. ++||+|||+||.+...+++.+.+++.++ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~-----~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:10 81 KGKADSPNNAFYWRFDEFGTQ-----HMKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred ccccCCCCccceeeeeccCCC-----CCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 1124567999999999 5999999999999888877777776665 No 48 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.39 E-value=2.1e-15 Score=101.05 Aligned_cols=110 Identities=12% Similarity=0.087 Sum_probs=89.3 Q ss_pred Cch---HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc---chhhhcchhceee---cCc--eEEE-ee Q lcl|NC_012756. 1 MSN---DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKR---YGKYRRSWKKKKL---ANG--SFVV-FN 68 (115) Q Consensus 1 ~~d---~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~---TG~y~k~W~~kk~---~~~--~~vv-~n 68 (115) |++ +=-++|.+.|++....+.+.++++++..|+.+.+++++++|++ ||.++++-...+. +++ ++.| ++ T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~ 80 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPN 80 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeC Confidence 554 2227888888889999999999999999999999999999964 8999999965432 333 3333 33 Q ss_pred C-CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 A-VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 ~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) + ..+.+|+||||+.+ .||+|||+|+.+...+++.+.+++.++ T Consensus 81 ~~~~~y~~f~E~GT~~-----~~a~Pf~~pa~~~~~~~~~~~~~~~~~ 123 (127) T protein:vir:12 81 KKVAYRGRFLEWGTSK-----MPPQPFIEKGGKEGEGPAVELMERILT 123 (127) T ss_pred CCCcceeeeeccCccC-----CCCCccchHhHHHHHHHHHHHHHHHHH Confidence 3 56779999999995 899999999999988888888888877 No 49 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.38 E-value=4e-15 Score=99.52 Aligned_cols=114 Identities=17% Similarity=0.197 Sum_probs=92.3 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCce-E--EEeeCCcceeehe Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGS-F--VVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~-~--vv~n~~~~ltHLL 77 (115) |.=+=.++|++.|+.+++.+.+.+++++.+.++.+.+++++++|++||.|++|+.......+. + .+++.. ..+.++ T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~-~Ya~fv 79 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNE-LYGAYM 79 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCc-ccchhh Confidence 666667888999999999999999999999999999999999999999999999776654432 2 233333 338899 Q ss_pred ecceeec------------CC------------------------------c--------ccCccchhhhHHHHHHHHHH Q lcl|NC_012756. 78 ENGHLSR------------NG------------------------------G--------RVAGIVHIKPAEEKAIQNFE 107 (115) Q Consensus 78 E~GHakr------------~G------------------------------G--------rV~~~phI~paee~~~~~~~ 107 (115) |||..+- ++ | =.++|||++||.+...+.+. T Consensus 80 EfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~ 159 (173) T protein:vir:10 80 EFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYL 159 (173) T ss_pred hcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHH Confidence 9997531 00 0 17899999999999999999 Q ss_pred HHHHHHhC Q lcl|NC_012756. 108 KRIKEIGK 115 (115) Q Consensus 108 ~~i~~ii~ 115 (115) ++|++.|+ T Consensus 160 ~~i~~~i~ 167 (173) T protein:vir:10 160 KDLENLLK 167 (173) T ss_pred HHHHHHHH Confidence 99998888 No 50 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=99.37 E-value=4.1e-15 Score=99.48 Aligned_cols=110 Identities=19% Similarity=0.150 Sum_probs=83.5 Q ss_pred Cch---HHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhce----eecCceE--EEe--- Q lcl|NC_012756. 1 MSN---DLADLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKK----KLANGSF--VVF--- 67 (115) Q Consensus 1 ~~d---~La~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~k----k~~~~~~--vv~--- 67 (115) ||+ +=-++|.+.|+..++.+. +.+++++.+.|+.+.++++.++|++||.++++-..+ +.+.+.. .+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeecccc Confidence 652 223678888888888775 567999999999999999999999999999986442 2222222 221 Q ss_pred ------eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 ------NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 ------n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ....+.+|+||||+. +.||+|||+||.+...+++.+.|.+.++ T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~-----~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:10 81 KGKADSPNNAFYWRFVELGTQ-----FMKAEPFMRPAFDASIAQAEGAIRTEIA 129 (140) T ss_pred ccccCCCCcccccceeccCcC-----CCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 123456899999998 5899999999999888877777777665 No 51 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=99.28 E-value=2.2e-14 Score=95.44 Aligned_cols=110 Identities=16% Similarity=0.128 Sum_probs=79.7 Q ss_pred CchHHH--HHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhce----eecCce---------- Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKK----KLANGS---------- 63 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~k----k~~~~~---------- 63 (115) |+=++. ++|.+.|++.++.+. +.++.++.+.|+.+.++++.++|++||.++++-... ..+... T Consensus 4 ~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~~~~ 83 (148) T protein:vir:93 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRGVNP 83 (148) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeeccccc Confidence 333344 788888888887765 567899999999999999999999999999984322 122111 Q ss_pred --------EEEee-CCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 64 --------FVVFN-AVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 64 --------~vv~n-~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +...+ ..++.+|+||||-.+ .|++|||+||.+...++..+.|.+.++ T Consensus 84 ~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-----~pa~PFl~pA~~~~k~~~~~~~~~~~~ 139 (148) T protein:vir:93 84 DTGNSDNTMKADNPRNAFYWRFVEMGTVN-----MPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) T ss_pred ccccccceeecCCCCCcceeeeeccCCCC-----CCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 11122 245679999999985 999999999998766655555544444 No 52 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.28 E-value=4.4e-14 Score=93.83 Aligned_cols=109 Identities=14% Similarity=0.111 Sum_probs=87.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh----hhcchhceee-----cCceEEE-eeC- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGK----YRRSWKKKKL-----ANGSFVV-FNA- 69 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~----y~k~W~~kk~-----~~~~~vv-~n~- 69 (115) |.+-|.+- .+.|++....+.+..++++.+.|+.+.+++++++|+++|. ++++-..... +..++.| +.+ T Consensus 1 mv~Gl~el-~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~ 79 (125) T protein:vir:97 1 MTKGLDEI-LANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA 79 (125) T ss_pred CchhHHHH-HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC Confidence 98888654 4558888999999999999999999999999999999887 6677655332 2223323 444 Q ss_pred CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +++.+|+||||+.+ .||+|||+|+.+.+.++..+.+.+.++ T Consensus 80 ~~~y~~f~E~GT~k-----~~~~pF~~pa~~~~k~~~~~~~~~~~~ 120 (125) T protein:vir:97 80 TGWRAHYPNDGTIY-----QRGQDFKERTINQMTPKAKQLYAEKVK 120 (125) T ss_pred CceeEeeeccCccC-----CCcCccchHhHHHhHHHHHHHHHHHHH Confidence 66789999999995 899999999999888777777777666 No 53 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.27 E-value=1.2e-14 Score=96.83 Aligned_cols=91 Identities=15% Similarity=0.098 Sum_probs=73.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeeheecceee---cCCc-------- Q lcl|NC_012756. 20 VTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHILENGHLS---RNGG-------- 87 (115) Q Consensus 20 v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLLE~GHak---r~GG-------- 87 (115) +++.+++++.+++.++.+++++++|++||.|++||+.+...++ +.+|++...+ +.++|||+.. ++.| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~yvE~GTg~~~~~~~~~~~~~~~~ 79 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEY-AIYVNYGTGIYATGAGGSRAKKIPW 79 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCc-ccccccCCcccccCCCcccccccce Confidence 8899999999999999999999999999999999988765544 4455554333 7889999433 3322 Q ss_pred -------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 88 -------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 88 -------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) .++++|||+||.+.....|.++|- T Consensus 80 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 389999999999999998888777 No 54 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.27 E-value=1.2e-14 Score=96.83 Aligned_cols=91 Identities=15% Similarity=0.098 Sum_probs=73.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeeheecceee---cCCc-------- Q lcl|NC_012756. 20 VTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHILENGHLS---RNGG-------- 87 (115) Q Consensus 20 v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLLE~GHak---r~GG-------- 87 (115) +++.+++++.+++.++.+++++++|++||.|++||+.+...++ +.+|++...+ +.++|||+.. ++.| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Y-A~yvE~GTg~~~~~~~~~~~~~~~~ 79 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEY-AIYVNYGTGIYATGAGGSRAKKIPW 79 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEecCCCc-ccccccCCcccccCCCcccccccce Confidence 8899999999999999999999999999999999988765544 4455554333 7889999433 3322 Q ss_pred -------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 88 -------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 88 -------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) .++++|||+||.+.....|.++|- T Consensus 80 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 389999999999999998888777 No 55 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.27 E-value=1.4e-14 Score=96.58 Aligned_cols=91 Identities=15% Similarity=0.098 Sum_probs=74.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEEeeCCcceeeheecceee---cCCc-------- Q lcl|NC_012756. 20 VTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVVFNAVASLTHILENGHLS---RNGG-------- 87 (115) Q Consensus 20 v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv~n~~~~ltHLLE~GHak---r~GG-------- 87 (115) +++.+++++++++..+..++++.+|++||.|++||+.+...++ +.+|++...+ +.++|||+.. ++.| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Y-a~yvE~GTg~~~~~~~~~~~~~~~~ 79 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEY-AIYVNYGTGIYATGAGGSRAKNIPW 79 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecCcEEEEEecCCCc-cceeecCccccccCCCccccccccc Confidence 8899999999999999999999999999999999987765544 4556555333 7889999543 3322 Q ss_pred -------------ccCccchhhhHHHHHHHHHHHHHH Q lcl|NC_012756. 88 -------------RVAGIVHIKPAEEKAIQNFEKRIK 111 (115) Q Consensus 88 -------------rV~~~phI~paee~~~~~~~~~i~ 111 (115) ..+++|||+||.+.....|.++|- T Consensus 80 ~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 80 SYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 279999999999999998888888 No 56 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.27 E-value=4.5e-14 Score=93.78 Aligned_cols=110 Identities=16% Similarity=0.131 Sum_probs=87.4 Q ss_pred CchHHH--HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhc------chhceee--cCc--eEEE-e Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRR------SWKKKKL--ANG--SFVV-F 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k------~W~~kk~--~~~--~~vv-~ 67 (115) ||-++. +++.+.|++....+.+...+++.+.|+.+.++++.++|+++|.+++ +-...+. .++ ++.| + T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~ 80 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGY 80 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeee Confidence 777665 5888889999999999999999999999999999999998887544 3333221 122 2222 3 Q ss_pred eC-CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 NA-VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 n~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ++ ++..+|++|||+++ .+|+|||+|+.+.+.+++.+.+.+.++ T Consensus 81 ~k~~~~y~~f~E~GT~k-----~~a~pF~~pa~~~~~~~~~~~~~~~l~ 124 (128) T protein:vir:38 81 GKDTGWRAHFPNSGTSM-----QDPQHFIEETQEIMRPVVIAAFLSHLK 124 (128) T ss_pred cCCCceEEeeeccCccC-----CCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 33 56679999999995 899999999999999888888888887 No 57 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.26 E-value=8.6e-14 Score=92.21 Aligned_cols=113 Identities=15% Similarity=0.137 Sum_probs=81.7 Q ss_pred Cch-HHHHHHHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHHHHhCCccchhhhcchhceeec-CceEEEeeC-Ccceeeh Q lcl|NC_012756. 1 MSN-DLADLIAKELAAYSDEVTEEVDKI-AEQVADETVDELKETSPKRYGKYRRSWKKKKLA-NGSFVVFNA-VASLTHI 76 (115) Q Consensus 1 ~~d-~La~~I~~~L~~y~~~v~~~~~~~-~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-~~~~vv~n~-~~~ltHL 76 (115) |.| ++.+-..+.+..+.+.+.+.+++. ...++..+...++.++|+|||.|++||..+... +..++|++. .| |-. T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~Y--A~y 78 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDY--AIY 78 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCc--cce Confidence 544 455666666666666666666664 333444456678889999999999999887543 334555554 44 778 Q ss_pred eecc---eeecCCcc------------------cCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 77 LENG---HLSRNGGR------------------VAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 77 LE~G---Hakr~GGr------------------V~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|+| |+.+++|| .+|+|||+||.+....++.+.|++.++ T Consensus 79 VE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~ 138 (141) T protein:vir:78 79 YEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALR 138 (141) T ss_pred eecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhh Confidence 9999 44443331 589999999999999999999999999 No 58 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=99.25 E-value=5e-14 Score=93.51 Aligned_cols=110 Identities=15% Similarity=0.165 Sum_probs=79.1 Q ss_pred CchHHH--HHHHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchh----hhcchhce----eec-CceEEEe- Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAYSDEVTE-EVDKIAEQVADETVDELKETSPKRYGK----YRRSWKKK----KLA-NGSFVVF- 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y~~~v~~-~~~~~~~~~a~~~~~~lk~~sP~~TG~----y~k~W~~k----k~~-~~~~vv~- 67 (115) |+=++. +++.+.|++.+..+.+ .++.++.+.|+.+.++++.++|++||. ++++-..+ ... .+.+.|. T Consensus 2 ~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~v 81 (133) T protein:vir:10 2 IRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRV 81 (133) T ss_pred eeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEe Confidence 333333 6788888888888755 558999999999999999999999987 56565432 111 2222222 Q ss_pred --e-CCcceeeheecceeecCCcccCccchhhhHHHHHHH----HHHHHHHHHhC Q lcl|NC_012756. 68 --N-AVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQ----NFEKRIKEIGK 115 (115) Q Consensus 68 --n-~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~----~~~~~i~~ii~ 115 (115) + ..|+.+|+||||+++ .|++|||+||.+...+ .|.+.+++.|+ T Consensus 82 g~~~~~~~y~~f~E~GT~k-----~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~ 131 (133) T protein:vir:10 82 GPSKQHHMKVLAQEFGTVK-----QVADPFIRPALDYNVQTVLRVLTVEIRNGIQ 131 (133) T ss_pred cCCCCccceEeeeccCCCC-----CCCCccchHHHHHhHHHHHHHHHHHHHHHhh Confidence 2 356789999999995 8999999999996555 55555555555 No 59 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.23 E-value=6.1e-14 Score=93.05 Aligned_cols=110 Identities=18% Similarity=0.167 Sum_probs=87.2 Q ss_pred CchHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhcchhceeec----Cc--eE-EEeeC- Q lcl|NC_012756. 1 MSNDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGK--YRRSWKKKKLA----NG--SF-VVFNA- 69 (115) Q Consensus 1 ~~d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~--y~k~W~~kk~~----~~--~~-vv~n~- 69 (115) ||-++ .++|.+.|+.....+....+.++++.|+.+.+.|++++|+++|. +++++...... .+ ++ |=+++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 88887 47888999999999999999999999999999999999998876 99999764422 22 22 22444 Q ss_pred CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHH----HHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKR----IKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~----i~~ii~ 115 (115) +++.+|++|||-.+ .||+|||+|+.+.+.++..+. |+++-+ T Consensus 81 ~~~~a~F~E~GT~k-----~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 81 VSHRIHATEFGTMY-----QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccC-----CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 56889999999996 799999999999877755554 444444 No 60 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.23 E-value=6.1e-14 Score=93.05 Aligned_cols=110 Identities=18% Similarity=0.167 Sum_probs=87.2 Q ss_pred CchHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhcchhceeec----Cc--eE-EEeeC- Q lcl|NC_012756. 1 MSNDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGK--YRRSWKKKKLA----NG--SF-VVFNA- 69 (115) Q Consensus 1 ~~d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~--y~k~W~~kk~~----~~--~~-vv~n~- 69 (115) ||-++ .++|.+.|+.....+....+.++++.|+.+.+.|++++|+++|. +++++...... .+ ++ |=+++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 88887 47888999999999999999999999999999999999998876 99999764422 22 22 22444 Q ss_pred CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHH----HHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKR----IKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~----i~~ii~ 115 (115) +++.+|++|||-.+ .||+|||+|+.+.+.++..+. |+++-+ T Consensus 81 ~~~~a~F~E~GT~k-----~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 81 VSHRIHATEFGTMY-----QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccC-----CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 56889999999996 799999999999877755554 444444 No 61 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.23 E-value=6.1e-14 Score=93.05 Aligned_cols=110 Identities=18% Similarity=0.167 Sum_probs=87.2 Q ss_pred CchHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhcchhceeec----Cc--eE-EEeeC- Q lcl|NC_012756. 1 MSNDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGK--YRRSWKKKKLA----NG--SF-VVFNA- 69 (115) Q Consensus 1 ~~d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~--y~k~W~~kk~~----~~--~~-vv~n~- 69 (115) ||-++ .++|.+.|+.....+....+.++++.|+.+.+.|++++|+++|. +++++...... .+ ++ |=+++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 88887 47888999999999999999999999999999999999998876 99999764422 22 22 22444 Q ss_pred CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHH----HHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKR----IKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~----i~~ii~ 115 (115) +++.+|++|||-.+ .||+|||+|+.+.+.++..+. |+++-+ T Consensus 81 ~~~~a~F~E~GT~k-----~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 81 VSHRIHATEFGTMY-----QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccC-----CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 56889999999996 799999999999877755554 444444 No 62 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.23 E-value=6.1e-14 Score=93.05 Aligned_cols=110 Identities=18% Similarity=0.167 Sum_probs=87.2 Q ss_pred CchHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhcchhceeec----Cc--eE-EEeeC- Q lcl|NC_012756. 1 MSNDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGK--YRRSWKKKKLA----NG--SF-VVFNA- 69 (115) Q Consensus 1 ~~d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~--y~k~W~~kk~~----~~--~~-vv~n~- 69 (115) ||-++ .++|.+.|+.....+....+.++++.|+.+.+.|++++|+++|. +++++...... .+ ++ |=+++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 88887 47888999999999999999999999999999999999998876 99999764422 22 22 22444 Q ss_pred CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHH----HHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKR----IKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~----i~~ii~ 115 (115) +++.+|++|||-.+ .||+|||+|+.+.+.++..+. |+++-+ T Consensus 81 ~~~~a~F~E~GT~k-----~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 81 VSHRIHATEFGTMY-----QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccC-----CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 56889999999996 799999999999877755554 444444 No 63 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.23 E-value=6.1e-14 Score=93.05 Aligned_cols=110 Identities=18% Similarity=0.167 Sum_probs=87.2 Q ss_pred CchHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh--hhcchhceeec----Cc--eE-EEeeC- Q lcl|NC_012756. 1 MSNDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGK--YRRSWKKKKLA----NG--SF-VVFNA- 69 (115) Q Consensus 1 ~~d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~--y~k~W~~kk~~----~~--~~-vv~n~- 69 (115) ||-++ .++|.+.|+.....+....+.++++.|+.+.+.|++++|+++|. +++++...... .+ ++ |=+++ T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~ 80 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG 80 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC Confidence 88887 47888999999999999999999999999999999999998876 99999764422 22 22 22444 Q ss_pred CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHH----HHHHhC Q lcl|NC_012756. 70 VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKR----IKEIGK 115 (115) Q Consensus 70 ~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~----i~~ii~ 115 (115) +++.+|++|||-.+ .||+|||+|+.+.+.++..+. |+++-+ T Consensus 81 ~~~~a~F~E~GT~k-----~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 81 VSHRIHATEFGTMY-----QKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CceEEEeccCCccC-----CCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 56889999999996 799999999999877755554 444444 No 64 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=99.20 E-value=8.7e-14 Score=92.19 Aligned_cols=110 Identities=11% Similarity=0.071 Sum_probs=77.1 Q ss_pred CchHHH------HHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcc-----chhhhcchhce-------eecC Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKETSPKR-----YGKYRRSWKKK-------KLAN 61 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~~sP~~-----TG~y~k~W~~k-------k~~~ 61 (115) |+|.+. ++|.+.|+..++++. +.+..++.+.|+.+.+++++++|.. +|..+++-... ..++ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 998643 688888999998885 5679999999999999999999764 45555432111 1111 Q ss_pred ceEEE---------------------------------eeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHH Q lcl|NC_012756. 62 GSFVV---------------------------------FNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEK 108 (115) Q Consensus 62 ~~~vv---------------------------------~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~ 108 (115) ..+.+ -+..++.+|+||||..+ +|++|||+||.+...++..+ T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~k-----mpa~PFlrPA~~~~~~~a~~ 155 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEH-----TSARPILRPAMNGVDNDVIN 155 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCC-----CCCCccchhhHHhhHHHHHH Confidence 00000 11245678999999995 99999999999977665555 Q ss_pred HHHHHhC Q lcl|NC_012756. 109 RIKEIGK 115 (115) Q Consensus 109 ~i~~ii~ 115 (115) .|.+.++ T Consensus 156 ~i~~~l~ 162 (179) T protein:vir:18 156 VFSTEMG 162 (179) T ss_pred HHHHHHH Confidence 5554444 No 65 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.20 E-value=3e-13 Score=89.21 Aligned_cols=114 Identities=20% Similarity=0.204 Sum_probs=77.0 Q ss_pred CchH--HHHHHHHHHHhhHHHHHHHH----HHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc---eEEEeeCCc Q lcl|NC_012756. 1 MSND--LADLIAKELAAYSDEVTEEV----DKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG---SFVVFNAVA 71 (115) Q Consensus 1 ~~d~--La~~I~~~L~~y~~~v~~~~----~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~---~~vv~n~~~ 71 (115) |+=+ =.|+|.+.|+.+++.+.+.+ ++++++++.+++.++++.+|++||.|++|++......+ +..|++... T Consensus 2 ~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ 81 (182) T protein:vir:10 2 IEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSSM 81 (182) T ss_pred eEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCCC Confidence 2222 24566666777776665554 55556667777888999999999999999865433222 233444322 Q ss_pred ceeeheecceee-----------------cC--------------------------Cc------ccCccchhhhHHHHH Q lcl|NC_012756. 72 SLTHILENGHLS-----------------RN--------------------------GG------RVAGIVHIKPAEEKA 102 (115) Q Consensus 72 ~ltHLLE~GHak-----------------r~--------------------------GG------rV~~~phI~paee~~ 102 (115) + +.++|||... +. || -.|++||+.||.+.. T Consensus 82 y-a~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~ 160 (182) T protein:vir:10 82 V-AVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKM 160 (182) T ss_pred c-cceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHh Confidence 2 6788888411 10 11 158999999999999 Q ss_pred HHHHHHHHHHHhC Q lcl|NC_012756. 103 IQNFEKRIKEIGK 115 (115) Q Consensus 103 ~~~~~~~i~~ii~ 115 (115) .+.++++|++.|+ T Consensus 161 ~~~i~~~i~~~i~ 173 (182) T protein:vir:10 161 AKEAPEIIKRSID 173 (182) T ss_pred HHHHHHHHHHHHH Confidence 9999999998888 No 66 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=99.18 E-value=1.7e-13 Score=90.61 Aligned_cols=110 Identities=14% Similarity=0.100 Sum_probs=80.8 Q ss_pred CchHH----HHHHHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccc----hhhhcchhceee----cCceEEEe Q lcl|NC_012756. 1 MSNDL----ADLIAKELAAYSDEVTE-EVDKIAEQVADETVDELKETSPKRY----GKYRRSWKKKKL----ANGSFVVF 67 (115) Q Consensus 1 ~~d~L----a~~I~~~L~~y~~~v~~-~~~~~~~~~a~~~~~~lk~~sP~~T----G~y~k~W~~kk~----~~~~~vv~ 67 (115) |+-++ -+++.+.|++....+.+ .++.++.+.|+.+.++++.++|+++ |.++++-..+.. +.+.+.+. T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~ 80 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLR 80 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEE Confidence 55444 36788888889888864 5579999999999999999999965 888888654322 22333332 Q ss_pred ---eC-CcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 ---NA-VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 ---n~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ++ .|+.+|++|||..+ .|++|||+|+.+.+.+.+.+.+.+.++ T Consensus 81 vg~~~~~~~~~~f~E~GT~~-----~~a~PF~~pa~~~~~~~~~~~~~~~~~ 127 (135) T protein:vir:57 81 VGPTRSHYMKALAQEFGTIK-----QVAKPFIRPALDYNKMQVLRILTVEIR 127 (135) T ss_pred ecCCCCcceeEeecccCCCC-----CCCCcchhHhHHHhHHHHHHHHHHHHH Confidence 23 56779999999995 899999999999866655554444443 No 67 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=99.17 E-value=1.2e-13 Score=91.33 Aligned_cols=110 Identities=16% Similarity=0.151 Sum_probs=79.2 Q ss_pred CchHHH------HHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCc-----cchhhhcchhc----eeecC-ce Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKETSPK-----RYGKYRRSWKK----KKLAN-GS 63 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~~sP~-----~TG~y~k~W~~----kk~~~-~~ 63 (115) |||.+. ++|.+.|++.++++. +.+..++.+.|+.+.+++++++|+ .+|.++++-.. +.... +. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 888543 578888999998876 577999999999999999999996 55777765322 11110 00 Q ss_pred --E------------------EEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 64 --F------------------VVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 64 --~------------------vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) . +..+..++++|+||||..+ .|++|||+||.+.+.++..+.|...++ T Consensus 81 ~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~k-----m~a~PFlrPA~~~~k~~~~~~~~~~l~ 147 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTED-----MRAQPFMRSALADNIAEVTSTFVSEYE 147 (164) T ss_pred eeEEecccccccccccccccccCCCCCcceEEEeecCCCC-----CCCCcchhhhHHHhHHHHHHHHHHHHH Confidence 0 0012334679999999995 999999999999877776655554444 No 68 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=99.13 E-value=3.4e-13 Score=88.93 Aligned_cols=110 Identities=15% Similarity=0.129 Sum_probs=76.8 Q ss_pred CchHHH------HHHHHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhCCccch-------------hhhcchhceee Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYS--DEVTEEVDKIAEQVADETVDELKETSPKRYG-------------KYRRSWKKKKL 59 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~--~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG-------------~y~k~W~~kk~ 59 (115) |||.+. ++|.+.|+... ..+.+..++++.+.|+.+.+++++++|+..+ ..+.+-...+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 999543 88889999984 5778889999999999999999999997532 22222222111 Q ss_pred c--Cc--eEEE-e----eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHH----HHHHHHhC Q lcl|NC_012756. 60 A--NG--SFVV-F----NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFE----KRIKEIGK 115 (115) Q Consensus 60 ~--~~--~~vv-~----n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~----~~i~~ii~ 115 (115) . .+ .+.| + +..++++|+||||..+ .|++|||+|+.+.+.++.. +.++++|+ T Consensus 81 ~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k-----~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 144 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSE-----RPPHHAFGKTNKILKRVYDNIAQKKYDNFVK 144 (149) T ss_pred ccccceeEEEeeccCCCCCccceeeeeccCccC-----CCCCccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 11 1222 1 2345789999999996 8999999999976555555 44555555 No 69 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=99.06 E-value=1.5e-12 Score=85.34 Aligned_cols=108 Identities=18% Similarity=0.249 Sum_probs=82.7 Q ss_pred Cch-HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee-------------------- Q lcl|NC_012756. 1 MSN-DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL-------------------- 59 (115) Q Consensus 1 ~~d-~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~-------------------- 59 (115) |+| +|.+. +..+..+.+.++..++..+++++.++...+...||+|||.||.+|...-. T Consensus 1 ma~~~~~~F-~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~ 79 (147) T protein:vir:10 1 MANYQIRRF-QGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGE 79 (147) T ss_pred CCCcchhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhh Confidence 999 55443 44578899999999999999999999999999999999999999965310 Q ss_pred ------------cCce-EEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 60 ------------ANGS-FVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 60 ------------~~~~-~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +.+. +.+-|+.+| +--|||||.+ -++..+++.+.++..+-|.+.+.++=+ T Consensus 80 ~~~~~~~~~~~~~~~~~iyi~Nn~pY-A~~LEyG~S~-----QAP~G~V~~t~q~~~~~v~~~~~e~k~ 142 (147) T protein:vir:10 80 EQAKTYGMFSRGGAITSVHFSNMLIY-ANALEYGHSQ-----QAPSGVVGLVALRLRSYMADAIKQARR 142 (147) T ss_pred hhHHHHHHhhhccCcceEEEeeCcch-hhhhhccccC-----CCCchHHHHHHHHHHHHHHHHHHHHHh Confidence 1121 223355555 4456999995 788888889988877777777766555 No 70 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=99.04 E-value=1.6e-12 Score=85.32 Aligned_cols=108 Identities=22% Similarity=0.343 Sum_probs=81.7 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-------------C------ Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-------------N------ 61 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-------------~------ 61 (115) |++++.+.=.+ +..|.+.++..++..+++++..+...+...||+|||.|+.+|...-.. . T Consensus 1 Ma~~~~sf~~~-i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~ 79 (142) T protein:vir:10 1 MANDVVSFRNS-INAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNSL 79 (142) T ss_pred Cccchhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhhH Confidence 99755444333 777999999999999999999999999999999999999999663211 0 Q ss_pred ------------ce-EEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 62 ------------GS-FVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 62 ------------~~-~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +. +.+-|+.+| +--|||||.. -++..+++.+.++..+-|++.+.++=. T Consensus 80 ~~~~~~i~~~~~g~~iyi~Nn~pY-A~~LEyG~S~-----QAP~G~v~~a~q~~~~~v~~a~~e~~~ 140 (142) T protein:vir:10 80 RRQIYALARDANTNVIYISNRLDY-AQGLEFGSSN-----QAPSGVLGVVQKRLGRYFAEAVQEAKR 140 (142) T ss_pred HHHHHHhhhccccceEEEeeCcch-hhhhhccccC-----CCcchHHHHHHHHHHHHHHHHHHHhhc Confidence 11 222355555 4457999997 888899999998877777766665544 No 71 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.01 E-value=2.1e-13 Score=90.07 Aligned_cols=81 Identities=19% Similarity=0.267 Sum_probs=58.6 Q ss_pred HHHHHHHhCCccchhhhcchhce----eecCceEEE---ee-CCcceeeheecceeecC-------C------------c Q lcl|NC_012756. 35 TVDELKETSPKRYGKYRRSWKKK----KLANGSFVV---FN-AVASLTHILENGHLSRN-------G------------G 87 (115) Q Consensus 35 ~~~~lk~~sP~~TG~y~k~W~~k----k~~~~~~vv---~n-~~~~ltHLLE~GHakr~-------G------------G 87 (115) ++++.+..-|++||+|+++.-.. ...+|..+| +| ++.++.||+||||..+. | - T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~ 80 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPK 80 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcccccccceeeeeeeeeccCceeeecCccccCce Confidence 88888999999999999997332 334454333 34 47899999999998654 2 2 Q ss_pred ccCccchhhhHHHHHHHHHHHHHHH--------HhC Q lcl|NC_012756. 88 RVAGIVHIKPAEEKAIQNFEKRIKE--------IGK 115 (115) Q Consensus 88 rV~~~phI~paee~~~~~~~~~i~~--------ii~ 115 (115) |||++|||+||+|....+++..+.. +.. T Consensus 81 ~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~r 116 (119) T protein:vir:10 81 WIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQR 116 (119) T ss_pred ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 7899999999999655555544443 333 No 72 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.00 E-value=2.4e-13 Score=89.79 Aligned_cols=81 Identities=19% Similarity=0.267 Sum_probs=58.6 Q ss_pred HHHHHHHhCCccchhhhcchhce----eecCceEEE---ee-CCcceeeheecceeecC-------C------------c Q lcl|NC_012756. 35 TVDELKETSPKRYGKYRRSWKKK----KLANGSFVV---FN-AVASLTHILENGHLSRN-------G------------G 87 (115) Q Consensus 35 ~~~~lk~~sP~~TG~y~k~W~~k----k~~~~~~vv---~n-~~~~ltHLLE~GHakr~-------G------------G 87 (115) ++++.+..-|++||+|+++.-.. ...+|..+| +| ++.++.||+||||..+. | - T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~~~~ 80 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLVNPK 80 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcccccccceeeeeeeeeccCceeeecCccccCce Confidence 88888999999999999997432 334443333 34 47899999999998654 3 3 Q ss_pred ccCccchhhhHHHHHHHHHHHHHHH--------HhC Q lcl|NC_012756. 88 RVAGIVHIKPAEEKAIQNFEKRIKE--------IGK 115 (115) Q Consensus 88 rV~~~phI~paee~~~~~~~~~i~~--------ii~ 115 (115) |||++|||+||+|....+++..+.. +.. T Consensus 81 ~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~r 116 (119) T protein:vir:81 81 WIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQR 116 (119) T ss_pred ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 7899999999999655555544433 333 No 73 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.99 E-value=5.1e-12 Score=82.48 Aligned_cols=109 Identities=13% Similarity=0.189 Sum_probs=81.8 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhcee-------------ecCc----- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKK-------------LANG----- 62 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk-------------~~~~----- 62 (115) |+|.=-..-+..+..|.+.++..++..+++++.++...+...||+|||.|+.+|...- .|.. T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~ 80 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEG 80 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHH Confidence 9994323334456779999999999999999999999999999999999999996642 1110 Q ss_pred -------------eEEEe--eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 63 -------------SFVVF--NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 63 -------------~~vv~--n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .-++| |+.+| +--|||||.. -++..+++.+.+...+-|++.+.++=+ T Consensus 81 ~~~i~~~~~g~~~~~~iyi~NnlpY-A~~LEyG~S~-----QAP~G~v~~~~~~~~~~v~~a~~e~k~ 142 (146) T protein:vir:79 81 RRTLYALLHGGGAIKSIYFSNMLIY-ANALEYGHSK-----QAPAGVFGIVAIRLRSYMAEAIREARK 142 (146) T ss_pred HHHHHHHHhcccccceeEEeeCchh-hhhhhccccC-----CCcchHHHHHHHHHHHHHHHHHHHHHh Confidence 01233 55555 4457999995 788899999998887777775555444 No 74 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.98 E-value=2.4e-12 Score=84.25 Aligned_cols=108 Identities=14% Similarity=0.256 Sum_probs=80.1 Q ss_pred CchHHHH--HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhcee-------------ecCce-- Q lcl|NC_012756. 1 MSNDLAD--LIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKK-------------LANGS-- 63 (115) Q Consensus 1 ~~d~La~--~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk-------------~~~~~-- 63 (115) |+.++++ .-...+.+|.+.+++.++..+++++.++...+...||+|||.|+.+|...- .|..+ T Consensus 1 ~~~~m~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~ 80 (145) T protein:vir:10 1 MARNIGSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKT 80 (145) T ss_pred CCCcccchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchh Confidence 6666554 335567789999999999999999999999999999999999999996632 11111 Q ss_pred -----------------EEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHH---HHHHHh Q lcl|NC_012756. 64 -----------------FVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEK---RIKEIG 114 (115) Q Consensus 64 -----------------~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~---~i~~ii 114 (115) +.+-|+.+| +--|||||.+ -++..+++.+.+...+-|.+ +++++| T Consensus 81 ~~~~~~~~i~~~k~g~~iyi~Nn~pY-A~~LEyG~S~-----QAP~G~v~~~~~~~~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 81 YLARQARAVANSKATSVIYITNRLDY-AADLEYGASN-----QAPAGVLGVVQARLGRYFQEAVEEARRAI 145 (145) T ss_pred hHHHHHHHhhcccccceEEEeeCchh-hhHhhccccC-----CCcchHHHHHHHHHHHHHHHHHHHhhccC Confidence 122255555 4456999996 78888999999887755444 444455 No 75 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=98.95 E-value=1.7e-12 Score=85.07 Aligned_cols=110 Identities=10% Similarity=0.027 Sum_probs=76.4 Q ss_pred CchHHHHHH-HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc---e--EEEeeC-Ccce Q lcl|NC_012756. 1 MSNDLADLI-AKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG---S--FVVFNA-VASL 73 (115) Q Consensus 1 ~~d~La~~I-~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~---~--~vv~n~-~~~l 73 (115) |+-++.-+. ...|......+.+.+++.+++++..+..++++++|++||.+++||+.+...++ + ..|.++ .| T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~Y-- 79 (142) T protein:vir:86 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKY-- 79 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccc-- Confidence 222222111 24566667778888999999999999999999999999999999987654432 2 223333 44 Q ss_pred eeheecceee--------------cCCcc----------cCccchhhhHHHHHHHHHHHHHHH Q lcl|NC_012756. 74 THILENGHLS--------------RNGGR----------VAGIVHIKPAEEKAIQNFEKRIKE 112 (115) Q Consensus 74 tHLLE~GHak--------------r~GGr----------V~~~phI~paee~~~~~~~~~i~~ 112 (115) ++++|||+.. .+|++ .+++|||.||.+.++.+....+.| T Consensus 80 A~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 80 AAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred cceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 8999999741 11321 239999999998887775554444 No 76 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=98.95 E-value=1.7e-12 Score=85.07 Aligned_cols=110 Identities=10% Similarity=0.027 Sum_probs=76.4 Q ss_pred CchHHHHHH-HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc---e--EEEeeC-Ccce Q lcl|NC_012756. 1 MSNDLADLI-AKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG---S--FVVFNA-VASL 73 (115) Q Consensus 1 ~~d~La~~I-~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~---~--~vv~n~-~~~l 73 (115) |+-++.-+. ...|......+.+.+++.+++++..+..++++++|++||.+++||+.+...++ + ..|.++ .| T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~Y-- 79 (142) T protein:vir:99 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKY-- 79 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccc-- Confidence 222222111 24566667778888999999999999999999999999999999987654432 2 223333 44 Q ss_pred eeheecceee--------------cCCcc----------cCccchhhhHHHHHHHHHHHHHHH Q lcl|NC_012756. 74 THILENGHLS--------------RNGGR----------VAGIVHIKPAEEKAIQNFEKRIKE 112 (115) Q Consensus 74 tHLLE~GHak--------------r~GGr----------V~~~phI~paee~~~~~~~~~i~~ 112 (115) ++++|||+.. .+|++ .+++|||.||.+.++.+....+.| T Consensus 80 A~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 80 AAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred cceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 8999999741 11321 239999999998887775554444 No 77 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=98.93 E-value=8.8e-12 Score=81.18 Aligned_cols=109 Identities=17% Similarity=0.185 Sum_probs=82.2 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh--CCc-------cchhhhcchhceeecCceEEEee--- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKET--SPK-------RYGKYRRSWKKKKLANGSFVVFN--- 68 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~--sP~-------~TG~y~k~W~~kk~~~~~~vv~n--- 68 (115) |. =-++|.+.|+.. ..+++++.|++-+.++....++. +|+ +||.+++|-+.....++..+... T Consensus 1 i~--G~~~L~~~Lk~~---s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g 75 (127) T protein:vir:98 1 MT--GMPALEVKLRSM---SEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFG 75 (127) T ss_pred Cc--ChHHHHHHHHHh---hHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCc Confidence 11 124555666655 33568999999999998888885 788 99999999877766555433222 Q ss_pred -CCcceeeheecceeecCC----cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 -AVASLTHILENGHLSRNG----GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 -~~~~ltHLLE~GHakr~G----GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .+.+ +-.|||||+.-.| |+|+|+|++.|+++.-...|.++++++++ T Consensus 76 ~t~dY-apyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k 126 (127) T protein:vir:98 76 YIKDY-APHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELR 126 (127) T ss_pred ccccc-cceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhc Confidence 1333 4456999997655 89999999999999999999999999999 No 78 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=98.84 E-value=2.7e-11 Score=78.55 Aligned_cols=106 Identities=20% Similarity=0.234 Sum_probs=75.2 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-------------------- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-------------------- 60 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-------------------- 60 (115) ||+++.+. ...+..+.+.+++.++..+++++.++...+...||+|||.|+.+|...-.. T Consensus 1 MA~~~~~f-~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d~ 79 (144) T protein:vir:95 1 MAKSLLDL-ADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQRA 79 (144) T ss_pred Cchhhhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCCC Confidence 99976555 445677999999999999999999999999999999999999999765221 Q ss_pred C------------------ceEEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 61 N------------------GSFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 61 ~------------------~~~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) . ..+.+-|+-+|.. -|||||.. -++..+++.+.+...+ |.++++ |+. T Consensus 80 sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~-~LEyG~S~-----QAP~G~vr~~~q~~~~-~v~~~~-~~~ 144 (144) T protein:vir:95 80 SAAETLNSAKLVLRNKKPGQAIFITNNLPYIR-RLNDGYSA-----QAPAGFVERAVLIGRK-MRKKFK-IKD 144 (144) T ss_pred chhHHHHHHHHHHhhcCccceEEEeeCchhhh-hhhccccC-----CCcchHHHHHHHHHHH-HHHhhc-cCC Confidence 0 0112235555544 46999995 7777888877754432 222221 222 No 79 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.81 E-value=4.4e-11 Score=77.39 Aligned_cols=102 Identities=18% Similarity=0.284 Sum_probs=74.6 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee-------------c------- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL-------------A------- 60 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~-------------~------- 60 (115) ||= + ..+.+|.+.+++.++..+++++.++...+...||+|||.|+.+|...-. + T Consensus 1 msF--~----~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~ 74 (131) T protein:vir:94 1 MSF--A----LDVTRFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNA 74 (131) T ss_pred CCc--c----cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHH Confidence 432 2 2344577789999999999999999999999999999999999965421 1 Q ss_pred --------Cc-eEEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHh Q lcl|NC_012756. 61 --------NG-SFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIG 114 (115) Q Consensus 61 --------~~-~~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii 114 (115) -+ .+.+-|+.+| +--|||||.. .++..+++.+.++..+-|.+.++++= T Consensus 75 ~~~i~~~~~g~~iyi~Nn~pY-A~~LEyG~S~-----QAP~g~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:94 75 TSFVLNAADWHTFTLTNNLPY-AQRLEYGWSQ-----QAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred HHHHhhccccceEEEeeCchh-hhhhhccccC-----CCcchHHHHHHHHHHHHHHHHHHhcC Confidence 01 1233455555 4457999995 78888999998777776666666554 No 80 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=98.74 E-value=7.4e-11 Score=76.14 Aligned_cols=103 Identities=18% Similarity=0.220 Sum_probs=77.3 Q ss_pred CchH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee---------------cC-- Q lcl|NC_012756. 1 MSND--LADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL---------------AN-- 61 (115) Q Consensus 1 ~~d~--La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~---------------~~-- 61 (115) |+|- +++.|. .|.+.+++.++..+++++.++...+...||++||.|+.+|...-. |+ T Consensus 1 m~~~~sFa~~i~----~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~ 76 (148) T protein:vir:97 1 MPSLSEFSRRIT----LRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTE 76 (148) T ss_pred CCccchhcccHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCccc Confidence 7774 455555 489999999999999999999999999999999999999966511 11 Q ss_pred --------------------ce-EEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 62 --------------------GS-FVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 62 --------------------~~-~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +. +.+-|+-+| +--|||||.. -++..+++.+.+...+-+++ -++++ T Consensus 77 ~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpY-A~~LEyG~S~-----QAP~G~v~~t~~~~~~~v~~--~~~~~ 143 (148) T protein:vir:97 77 AANTQAAIDQAESVIRGYNYGEEIHITNNLPY-IQRLNDGYSA-----QAPANFVEQAVLEAVQVVQF--GRVVD 143 (148) T ss_pred ccchhHHHHHHHHHhhccCCCceEEEeecchh-hhHhhccccC-----CCcchHHHHHHHHHHHHHHh--hhhhc Confidence 01 122345555 4457999997 88899999999877766654 45666 No 81 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.72 E-value=1.3e-10 Score=74.85 Aligned_cols=102 Identities=18% Similarity=0.261 Sum_probs=73.5 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee-----------cC-------- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL-----------AN-------- 61 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~-----------~~-------- 61 (115) ||= +. .+..|.+.+++.++..+++++.++...+...||+|||.|+.+|...-. +. T Consensus 1 msf--~~----~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~ 74 (131) T protein:vir:78 1 MSF--AL----DVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNA 74 (131) T ss_pred CCc--Cc----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhhHHHH Confidence 432 22 234467778999999999999999999999999999999999975421 01 Q ss_pred ---------c-eEEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHh Q lcl|NC_012756. 62 ---------G-SFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIG 114 (115) Q Consensus 62 ---------~-~~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii 114 (115) + .+.+-|+.+| +--|||||.. .++..+++.+.++..+-|.+.++++= T Consensus 75 ~~~i~~~~~g~~iyi~Nn~pY-A~~LEyG~S~-----QAP~G~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:78 75 ANFVLNAADWHTFTLTNNLPY-AQRLEYGWSQ-----QAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred HHHHhhccCCceEEEeeCchh-hhHhhccccC-----CCcchHHHHHHHHHHHHHHHHHHhcC Confidence 1 1223345555 4457999995 78888999988777666666665554 No 82 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=98.70 E-value=2.9e-11 Score=78.33 Aligned_cols=106 Identities=9% Similarity=0.036 Sum_probs=69.3 Q ss_pred Cch--HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc--e--EEEeeC-Ccce Q lcl|NC_012756. 1 MSN--DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG--S--FVVFNA-VASL 73 (115) Q Consensus 1 ~~d--~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~--~--~vv~n~-~~~l 73 (115) |+. .|...... -.+++...++..+++++..+..+.+.++|++||.|++||+.+....+ + ..|.+. .| T Consensus 1 m~~s~~i~i~~~~----l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~Y-- 74 (137) T protein:vir:10 1 MPVTARIHINEPE----LERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDY-- 74 (137) T ss_pred CCeeEEEeeCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCc-- Confidence 443 33322222 23466777888888899999999999999999999999998765433 2 223333 44 Q ss_pred eeheecce---eec-----------CCcc-------cC---ccchhhhHHHHHHHHHHHHHHHH Q lcl|NC_012756. 74 THILENGH---LSR-----------NGGR-------VA---GIVHIKPAEEKAIQNFEKRIKEI 113 (115) Q Consensus 74 tHLLE~GH---akr-----------~GGr-------V~---~~phI~paee~~~~~~~~~i~~i 113 (115) ++++|||. ..+ +|.+ .| ++|||.||.++.+...+ +|+-- T Consensus 75 A~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~-ri~~~ 137 (137) T protein:vir:10 75 AAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADP-DIHMT 137 (137) T ss_pred eeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccc-cccCC Confidence 88999995 221 1222 23 99999999988644332 23222 No 83 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=98.67 E-value=9.8e-11 Score=75.47 Aligned_cols=102 Identities=19% Similarity=0.293 Sum_probs=71.7 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee-------------cCc----- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL-------------ANG----- 62 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~-------------~~~----- 62 (115) || ++.. +..|.+.+++.++..+++++.++...+...||+|||.||.+|...-. +.+ T Consensus 1 ms--F~~~----i~~~~~~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~~~~~ 74 (134) T protein:vir:80 1 MS--YTDR----FNVIAKGIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGMDEAL 74 (134) T ss_pred CC--cccC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccchhhH Confidence 43 2333 33477789999999999999999999999999999999999966421 111 Q ss_pred -------------e-EEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 63 -------------S-FVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 63 -------------~-~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) . +.+-|+.+| +--|||||.+ -++..+++.+.++..+-|++ ++.+=| T Consensus 75 ~~~~~vi~~~k~g~~iyi~Nn~pY-A~~LEyG~S~-----QAP~G~v~~t~~~~~~~v~~-~~~~~~ 134 (134) T protein:vir:80 75 QVLQQTVGQYKAGDTVHITNNAPY-IKELNSGSSQ-----QAPANFVETSIMRATRLIRN-VKVVPQ 134 (134) T ss_pred HHHHHHHhhccCcceEEEeeCchh-hhhhhccccC-----CCcchHHHHHHHHHHHHHHh-hccCCC Confidence 1 122255555 4457999997 77888888777666555544 554444 No 84 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=98.64 E-value=1.8e-10 Score=73.98 Aligned_cols=109 Identities=11% Similarity=0.005 Sum_probs=80.9 Q ss_pred Cch-H--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEE--eeCCcceee Q lcl|NC_012756. 1 MSN-D--LADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVV--FNAVASLTH 75 (115) Q Consensus 1 ~~d-~--La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv--~n~~~~ltH 75 (115) |++ + =-+++.+.|++.....+..-++++++.++-+.+++..++|++||++++ -+.....+|.+.| ....++..+ T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~kk~g~~~VG~~ks~~fy~k 79 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRVKNTGLATEGTASSSEFYDI 79 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeeeecCceeEeccCCcchhhhh Confidence 553 1 124666667777888888899999999999999999999999999997 2222233444433 234667799 Q ss_pred heecceeecCCcccCcc-chhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGI-VHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~GGrV~~~-phI~paee~~~~~~~~~i~~ii~ 115 (115) ++|||.++ .|++ ||+.|+.+....+-.+.|.+++. T Consensus 80 F~EFGTSk-----m~a~~pF~~~a~~~~~~eA~~~~~~el~ 115 (119) T protein:vir:10 80 FQNFGTSE-----QKAHVGYFDRAVDETTNEAVEEVAEIIF 115 (119) T ss_pred hccccccc-----cCCCCCccccccccChHHHHHHHHHHHH Confidence 99999996 8998 99999998666555555555555 No 85 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.58 E-value=2.1e-10 Score=73.66 Aligned_cols=93 Identities=23% Similarity=0.278 Sum_probs=70.6 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhcee-------------ecC------ Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKK-------------LAN------ 61 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk-------------~~~------ 61 (115) |+=+++..|. .+.+.+++.++..+++++.++...+...||+|||.||.+|...- .+. T Consensus 2 ~~~sf~~~i~----~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~ 77 (121) T protein:vir:94 2 ISMKFNVNLS----RLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAI 77 (121) T ss_pred ccchhhccHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHH Confidence 6666666655 48888999999999999999999999999999999999996631 111 Q ss_pred -------c-eEEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHH Q lcl|NC_012756. 62 -------G-SFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAI 103 (115) Q Consensus 62 -------~-~~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~ 103 (115) + .+.+-|+.++.. -|||||.+ -++..+++.+..... T Consensus 78 ~~~~~~~~~~iyi~NnlpYA~-~LE~G~S~-----QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 78 VVSSNVALPHFYITNGAPYAQ-QLEKGSST-----QAPLGIVRVTLASLR 121 (121) T ss_pred HHHHhhccceEEEeeCcchhh-hhhcccCC-----CCcchHHHHHHHhhC Confidence 1 122345555544 47999996 788888888876555 No 86 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=98.49 E-value=3.8e-10 Score=72.24 Aligned_cols=111 Identities=14% Similarity=0.131 Sum_probs=69.3 Q ss_pred CchHHHH-HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc--eEEEee-CCcceeeh Q lcl|NC_012756. 1 MSNDLAD-LIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG--SFVVFN-AVASLTHI 76 (115) Q Consensus 1 ~~d~La~-~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~--~~vv~n-~~~~ltHL 76 (115) |+.=.+. .+.-.+....+.+...++..++.++..+..+++.++|+|||.|++||+......+ ++++.. .....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 3321111 1111222344567778888889999999999999999999999999997654433 333221 22233789 Q ss_pred eecceee--------------cCCcc----------cCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 77 LENGHLS--------------RNGGR----------VAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 77 LE~GHak--------------r~GGr----------V~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|||+.- .+|++ .+++|||.||.++.+.+ .++|.- T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~----~~~i~~ 139 (140) T protein:vir:97 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTN----DPRVRM 139 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhh----hhhccC Confidence 9999741 22332 34899999999776332 223322 No 87 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=98.49 E-value=3.8e-10 Score=72.24 Aligned_cols=111 Identities=14% Similarity=0.131 Sum_probs=69.3 Q ss_pred CchHHHH-HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc--eEEEee-CCcceeeh Q lcl|NC_012756. 1 MSNDLAD-LIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG--SFVVFN-AVASLTHI 76 (115) Q Consensus 1 ~~d~La~-~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~--~~vv~n-~~~~ltHL 76 (115) |+.=.+. .+.-.+....+.+...++..++.++..+..+++.++|+|||.|++||+......+ ++++.. .....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 3321111 1111222344567778888889999999999999999999999999997654433 333221 22233789 Q ss_pred eecceee--------------cCCcc----------cCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 77 LENGHLS--------------RNGGR----------VAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 77 LE~GHak--------------r~GGr----------V~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|||+.- .+|++ .+++|||.||.++.+.+ .++|.- T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~----~~~i~~ 139 (140) T protein:vir:10 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTN----DPRVRM 139 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhh----hhhccC Confidence 9999741 22332 34899999999776332 223322 No 88 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=98.36 E-value=5.8e-10 Score=71.24 Aligned_cols=104 Identities=11% Similarity=0.126 Sum_probs=65.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecC---ceEE--E-eeCCccee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLAN---GSFV--V-FNAVASLT 74 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~---~~~v--v-~n~~~~lt 74 (115) .|-.+..- ...-.+.+...++.+++.++..+..+++.++|+|||.+++|++.....+ ++.. | .+..| + T Consensus 3 ~~~~~~~~----~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~Y--A 76 (137) T protein:vir:10 3 VTARYERN----PVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADY--A 76 (137) T ss_pred eEEEeccC----chhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCcc--c Confidence 11111110 1111234555677777888889999999999999999999998654332 2322 2 23344 8 Q ss_pred eheecceee--------c-------CCcc----------cCccchhhhHHHHHHHHHHHHH Q lcl|NC_012756. 75 HILENGHLS--------R-------NGGR----------VAGIVHIKPAEEKAIQNFEKRI 110 (115) Q Consensus 75 HLLE~GHak--------r-------~GGr----------V~~~phI~paee~~~~~~~~~i 110 (115) +++|||..- + +|++ .+++|||+||.++++.+-..-- T Consensus 77 ~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 77 RYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred eeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 999999741 1 2332 2389999999988776544333 No 89 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=98.33 E-value=4.5e-09 Score=66.37 Aligned_cols=99 Identities=20% Similarity=0.180 Sum_probs=70.5 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------------cchhhhcchhceeec------ Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPK--------------RYGKYRRSWKKKKLA------ 60 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~--------------~TG~y~k~W~~kk~~------ 60 (115) || ++.. +..+.+.+++.++..+++++.++.+.+...||+ |||.|+.+|...-.. T Consensus 11 ms--Faa~----i~~~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~~~~~ 84 (152) T protein:vir:96 11 MS--WSKS----LKNIIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKITSFEK 84 (152) T ss_pred cc--cccc----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCCcccc Confidence 32 2222 334777788999999999999999999999999 999999999775211 Q ss_pred -----Cc----------------eEEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 61 -----NG----------------SFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 61 -----~~----------------~~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .+ .+.+-|+-++... |||||.. -++..+++.+.+...+-|++ +|+ T Consensus 85 ~~~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~-LEyG~S~-----QAP~G~vr~t~~~~~~~v~e----a~~ 150 (152) T protein:vir:96 85 GISSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATS-IEYGHSS-----QAPNGVYRPAVRRLVKFLNT----ELK 150 (152) T ss_pred cCCCCCchHHHHHHHHhhccccceEEEeeCchhhhH-hhccccC-----CCCchHHHHHHHHHHHHHHH----Hhc Confidence 00 1123355555454 5999997 78888888888665555544 555 No 90 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=98.28 E-value=1e-08 Score=64.42 Aligned_cols=115 Identities=21% Similarity=0.282 Sum_probs=81.0 Q ss_pred CchHHH--HHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cc--eEEE-e-e Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NG--SFVV-F-N 68 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~--~~vv-~-n 68 (115) ||=+|- ++|.+.|++- ...+....++++.+.|+.+++++|.+-+ +|||..-.+-...++. +| ++.| + . T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 885553 4666666664 5677778889999999999999998654 6999999888776553 22 3333 4 2 Q ss_pred --CCcceeeheecceee-cCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 --AVASLTHILENGHLS-RNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 --~~~~ltHLLE~GHak-r~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .+|.+-||.||||.+ |+|-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~ 132 (134) T protein:vir:10 81 PFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELK 132 (134) T ss_pred CCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHh Confidence 379999999999985 8887765543 3555666555555555555555 No 91 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=98.22 E-value=1.5e-08 Score=63.47 Aligned_cols=109 Identities=18% Similarity=0.164 Sum_probs=76.3 Q ss_pred CchHHH---HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------ch---hhhcchhcee-----ecCce Q lcl|NC_012756. 1 MSNDLA---DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKR------YG---KYRRSWKKKK-----LANGS 63 (115) Q Consensus 1 ~~d~La---~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~------TG---~y~k~W~~kk-----~~~~~ 63 (115) |+| |+ .++.+.|+.-...+.+.-.++++.-|+...+.|+..+|.. || -++++-..+. ..+|+ T Consensus 1 M~~-~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~ 79 (153) T protein:vir:49 1 MTG-LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGV 79 (153) T ss_pred Ccc-HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccce Confidence 554 33 4555556666666667777889988999999999998853 44 4667765532 12334 Q ss_pred EEE-e-eC-CcceeeheecceeecCCcccCccchhhhHHHHH------HHHHHHHHHHHhC Q lcl|NC_012756. 64 FVV-F-NA-VASLTHILENGHLSRNGGRVAGIVHIKPAEEKA------IQNFEKRIKEIGK 115 (115) Q Consensus 64 ~vv-~-n~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~~------~~~~~~~i~~ii~ 115 (115) .+| + ++ ..+++|+||+|..+ .||+|||.++.+.+ .+.+.+.+++||. T Consensus 80 s~VG~~~~~~a~~a~f~n~GT~k-----m~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~ 135 (153) T protein:vir:49 80 STVGWKNNYHAQNARRLNDGTKK-----YRADHFITNVQNDSTVKNKVLLAEKEEYEKLIR 135 (153) T ss_pred eeecccCCccceeeeecccCccc-----CCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 433 3 33 36899999999986 89999999988753 3445578888888 No 92 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=98.17 E-value=1.2e-08 Score=63.95 Aligned_cols=82 Identities=17% Similarity=0.193 Sum_probs=60.9 Q ss_pred Cch-----HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCc-eEEE--e-e-CC Q lcl|NC_012756. 1 MSN-----DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANG-SFVV--F-N-AV 70 (115) Q Consensus 1 ~~d-----~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~-~~vv--~-n-~~ 70 (115) ||+ +=-+++.+.|++-+. .+.+++.|++.+..+.+++++++|++||.+++||+.....++ +.+| . . .. T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~--~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~ 78 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQN--MNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDGFTGSVTYGGGLVN 78 (92) T ss_pred CCceeeEeehHHHHHHHHHhhcc--HHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCCeeEEEEeccCccc Confidence 776 124667777776443 356899999999999999999999999999999997754443 2222 2 2 34 Q ss_pred cceeeheecceeecCCcccCc Q lcl|NC_012756. 71 ASLTHILENGHLSRNGGRVAG 91 (115) Q Consensus 71 ~~ltHLLE~GHakr~GGrV~~ 91 (115) | +-+||||++ +++| T Consensus 79 Y--a~YvE~GTR-----~M~A 92 (92) T protein:vir:99 79 Y--AAYVEFGTR-----FMDS 92 (92) T ss_pred c--cccccccee-----ecCC Confidence 4 678999999 4777 No 93 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=98.12 E-value=4.2e-08 Score=61.04 Aligned_cols=109 Identities=19% Similarity=0.205 Sum_probs=73.2 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCc-------cch---hhhcchhceeec-----CceE- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPK-------RYG---KYRRSWKKKKLA-----NGSF- 64 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~-------~TG---~y~k~W~~kk~~-----~~~~- 64 (115) |.+.|. ++.+.|+.-.....+.-.++++..|+...+.|++++|. +|| -++++-..+... +++. T Consensus 3 ~~~~le-e~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~~ 81 (139) T protein:vir:10 3 MDEALG-QWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSST 81 (139) T ss_pred HHHHHH-HHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccccceeee Confidence 553333 33333444444444555678888899999999999996 243 477777664321 2222 Q ss_pred EEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHH----HHHhC Q lcl|NC_012756. 65 VVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRI----KEIGK 115 (115) Q Consensus 65 vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i----~~ii~ 115 (115) |=++++++++|++|+|..+ .||+|||..+.+.+.++..+.+ ++++. T Consensus 82 VG~~k~~~~A~f~n~GT~k-----~~~~hFie~t~~e~~~evl~a~~~~~k~~l~ 131 (139) T protein:vir:10 82 VGFHNKAHIARFLNDGTKY-----IRADHFVDNARDDAKDAVFAAEAEKYQAMIA 131 (139) T ss_pred eCCCCCcceEeecccCccc-----cCCCchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3367789999999999986 9999999999887666555444 44444 No 94 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=98.10 E-value=3.7e-09 Score=66.83 Aligned_cols=87 Identities=30% Similarity=0.406 Sum_probs=49.5 Q ss_pred CchH------HHHHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhcee-ecCceEEEeeCCc Q lcl|NC_012756. 1 MSND------LADLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKK-LANGSFVVFNAVA 71 (115) Q Consensus 1 ~~d~------La~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk-~~~~~~vv~n~~~ 71 (115) |++. |+..=. .|.++ .-+|..++.++.+ +++..+|++||+++|.|+.||.+.+ ..+...-+...+. T Consensus 1 ma~gpt~kNP~~KFGv-s~~d~~K~~EVn~GvNeFMd----E~~~~~K~~SPV~~G~Y~~S~~V~ers~NkGRG~~G~~~ 75 (108) T protein:vir:79 1 MANGPTRKNPLAKFGV-RLDDFDKLPEVNQGVNEFMD----EVVDAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATD 75 (108) T ss_pred CCCCcccccchhhhcC-ChhhhhhchhhhhhHHHHHH----HHHHHHhhcCCCCchhhHHHHHHHHhhhccCccccCCcc Confidence 2211 111000 01111 3345555555554 6778899999999999999998854 4443344445555 Q ss_pred ceeeheecceeecC------------CcccCcc Q lcl|NC_012756. 72 SLTHILENGHLSRN------------GGRVAGI 92 (115) Q Consensus 72 ~ltHLLE~GHakr~------------GGrV~~~ 92 (115) +.+||+|||-+..+ ||-.-+. T Consensus 76 ~~AH~VEFGs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:79 76 PQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred hhhhhhhhhccccccccchhhHHHhhcccccCC Confidence 55999999977532 3322222 No 95 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=98.08 E-value=4.2e-08 Score=61.00 Aligned_cols=109 Identities=20% Similarity=0.248 Sum_probs=72.9 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCc-------cch---hhhcchhcee-----ecCceEE Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPK-------RYG---KYRRSWKKKK-----LANGSFV 65 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~-------~TG---~y~k~W~~kk-----~~~~~~v 65 (115) |.+.|.. +.+.|+.-.....+.-.+++...|+...+.|+.++|. ++| -++++-..+. ..+++.. T Consensus 3 ~~~~l~e-~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~~ 81 (139) T protein:vir:10 3 MDEALGQ-WLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSST 81 (139) T ss_pred HHHHHHH-HHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCccccccccccce Confidence 5544433 3333444444455556778888999999999999994 333 3777765543 1233333 Q ss_pred E-eeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHH----HHHHHHhC Q lcl|NC_012756. 66 V-FNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFE----KRIKEIGK 115 (115) Q Consensus 66 v-~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~----~~i~~ii~ 115 (115) | +.++++++|+||+|-.+ .||+|||..+.+.+.++.. +.++++|. T Consensus 82 VG~~~~~~~Ahf~n~GT~~-----~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~ 131 (139) T protein:vir:10 82 VGFHNKAHIARFLNDGTKN-----IRADHFVDNARDDAKDAVFAAEAEKYQAMIA 131 (139) T ss_pred eCCCCCceeeeeeccCccc-----cCCCchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 56779999999999885 9999999998887665544 44555555 No 96 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=98.06 E-value=4.9e-08 Score=60.65 Aligned_cols=115 Identities=22% Similarity=0.260 Sum_probs=81.7 Q ss_pred CchHHH--HHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cc--eE-EEe-e Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NG--SF-VVF-N 68 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~--~~-vv~-n 68 (115) ||=+|- ++|.+.|++- ...+....+.++.+.++.+++++|.+-+ +|||..-.+-...++. ++ ++ |.+ . T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 886663 5777777775 6677788899999999999999999888 4999999998776554 22 23 334 2 Q ss_pred --CCcceeeheecceeec-CCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 --AVASLTHILENGHLSR-NGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 --~~~~ltHLLE~GHakr-~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .+|.+-||.||||..+ +|-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~ 132 (134) T protein:vir:95 81 SKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELK 132 (134) T ss_pred CCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHh Confidence 3799999999999987 476554432 3444555555555555555444 No 97 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=98.06 E-value=4.9e-08 Score=60.65 Aligned_cols=115 Identities=22% Similarity=0.260 Sum_probs=81.7 Q ss_pred CchHHH--HHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cc--eE-EEe-e Q lcl|NC_012756. 1 MSNDLA--DLIAKELAAY--SDEVTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NG--SF-VVF-N 68 (115) Q Consensus 1 ~~d~La--~~I~~~L~~y--~~~v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~--~~-vv~-n 68 (115) ||=+|- ++|.+.|++- ...+....+.++.+.++.+++++|.+-+ +|||..-.+-...++. ++ ++ |.+ . T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 886663 5777777775 6677788899999999999999999888 4999999998776554 22 23 334 2 Q ss_pred --CCcceeeheecceeec-CCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 --AVASLTHILENGHLSR-NGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 --~~~~ltHLLE~GHakr-~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .+|.+-||.||||..+ +|-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~ 132 (134) T protein:vir:10 81 SKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELK 132 (134) T ss_pred CCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHh Confidence 3799999999999987 476554432 3444555555555555555444 No 98 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=97.85 E-value=1.9e-07 Score=57.39 Aligned_cols=111 Identities=16% Similarity=0.173 Sum_probs=79.4 Q ss_pred Cc---hHH-HHHHHHHHHh-hHH-HHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhhcchhceeec--Cc--eEE-Ee Q lcl|NC_012756. 1 MS---NDL-ADLIAKELAA-YSD-EVTEEVDKIAEQVADETVDELKETSPK--RYGKYRRSWKKKKLA--NG--SFV-VF 67 (115) Q Consensus 1 ~~---d~L-a~~I~~~L~~-y~~-~v~~~~~~~~~~~a~~~~~~lk~~sP~--~TG~y~k~W~~kk~~--~~--~~v-v~ 67 (115) || +=. -++|.+.|+. |.. .+....+.++.+.|+.+++.||++.|+ |||..-..-...++. +| ++. .+ T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW 80 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 80 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecc Confidence 43 322 2578888888 887 588889999999999999999999994 999988888776554 23 222 23 Q ss_pred -eCCcceeeheecceeecCCcc--cCccchhhhHHHHHH----HHHHHHHHHHhC Q lcl|NC_012756. 68 -NAVASLTHILENGHLSRNGGR--VAGIVHIKPAEEKAI----QNFEKRIKEIGK 115 (115) Q Consensus 68 -n~~~~ltHLLE~GHakr~GGr--V~~~phI~paee~~~----~~~~~~i~~ii~ 115 (115) ..+|+|-||-|||| |-+ -+|.-.|+.+.+... +.+.+.+++.++ T Consensus 81 ~GpR~~ivHLNE~Gy----Gk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~ 131 (132) T protein:vir:96 81 TTPRWNIVHLQELEY----GWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFD 131 (132) T ss_pred cCCceeEEeeecccc----cCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhc Confidence 35899999999999 433 445556777776666 444455555555 No 99 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=97.83 E-value=2e-07 Score=57.27 Aligned_cols=114 Identities=17% Similarity=0.220 Sum_probs=80.2 Q ss_pred CchHHH--HHHHHHHHh-hHHH-HHHHHHHHHHHHHHHHHHHHHH--hCCccchhhhcchhceeec--Cce--E-EEe-e Q lcl|NC_012756. 1 MSNDLA--DLIAKELAA-YSDE-VTEEVDKIAEQVADETVDELKE--TSPKRYGKYRRSWKKKKLA--NGS--F-VVF-N 68 (115) Q Consensus 1 ~~d~La--~~I~~~L~~-y~~~-v~~~~~~~~~~~a~~~~~~lk~--~sP~~TG~y~k~W~~kk~~--~~~--~-vv~-n 68 (115) ||=++- ++|.+.|+. |... +....+.++.+.++.+++.||+ .+.+|||.--..-...++. ++. + |.+ . T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~g 80 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKG 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEec Confidence 876653 477777776 5544 4667788899999999999999 5789999988887766553 332 2 334 3 Q ss_pred C--CcceeeheecceeecCCcccCccch--hhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 69 A--VASLTHILENGHLSRNGGRVAGIVH--IKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 69 ~--~~~ltHLLE~GHakr~GGrV~~~ph--I~paee~~~~~~~~~i~~ii~ 115 (115) . +|.|-||-|||+ +|||-++.++-| |+.+.+.....|.+.|++=++ T Consensus 81 p~~R~~iVHLNE~GY-tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~ 130 (133) T protein:vir:78 81 PKDRYKIIHLNEYGY-TRNGKKITPAGTGSVARSLRISERAYRAIVQKKIG 130 (133) T ss_pred CCCceeEEEeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHH Confidence 3 799999999998 779987766554 666666655555555544444 No 100 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=97.73 E-value=4.5e-08 Score=60.84 Aligned_cols=110 Identities=28% Similarity=0.355 Sum_probs=59.7 Q ss_pred CchHHH------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhcee-ecCceEEEeeCCcce Q lcl|NC_012756. 1 MSNDLA------DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKK-LANGSFVVFNAVASL 73 (115) Q Consensus 1 ~~d~La------~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk-~~~~~~vv~n~~~~l 73 (115) |.+-|+ +++.+.+.+ +.+|...+.++++++| .-..|++||++.|.|+.||.+.+ ..++.- ++..+.+. T Consensus 1 mgNP~~KFGvS~~e~~K~irn-s~EV~~GiNdFMe~~A---~~~aK~~SPV~~GeY~~S~~V~~ka~NGRG-~~G~~~~~ 75 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRN-SAEVDAGINDFMENEA---IPYAKSISPVDDGEYAASWAVMKKAKNGRG-VFGPKAWY 75 (150) T ss_pred CCCchhhhcCCHHHHHHhhcc-chhhhhhHHHHHHhhh---hhhhhccCCcccchhHHHHHHHhhcccCcc-ccCccchh Confidence 544443 234444433 6778888999998655 45678999999999999998854 445444 44555566 Q ss_pred eeheecceee--cC-----------Cc-----------ccCc-cchhhhH-HHHHHHHHHHHHHHHhC Q lcl|NC_012756. 74 THILENGHLS--RN-----------GG-----------RVAG-IVHIKPA-EEKAIQNFEKRIKEIGK 115 (115) Q Consensus 74 tHLLE~GHak--r~-----------GG-----------rV~~-~phI~pa-ee~~~~~~~~~i~~ii~ 115 (115) +||+|||.-. +. |. ||-+ -|--+.. .++...-|---++--|- T Consensus 76 AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqkvashfggslkggis 143 (150) T protein:vir:81 76 AHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQKVASHFGGSLKGGIS 143 (150) T ss_pred hhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHHHHHhcccccccccc Confidence 9999999541 11 11 2211 1111100 11111112212221111 No 101 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=97.72 E-value=1.9e-07 Score=57.48 Aligned_cols=103 Identities=19% Similarity=0.148 Sum_probs=63.3 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecC-ceEE---Ee-eCCcceee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLAN-GSFV---VF-NAVASLTH 75 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~-~~~v---v~-n~~~~ltH 75 (115) -+-.|.. ......+.+.++++++.++..+..+.|.++|++||.+++||+...... ++.+ |. |..| +- T Consensus 4 ~~~~l~~------~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~Y--A~ 75 (137) T protein:vir:10 4 HTLRIER------AQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARY--AA 75 (137) T ss_pred cccccCh------hhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCccc--ce Confidence 1111211 112334566788889999999999999999999999999999876533 3322 22 2344 67 Q ss_pred heeccee---e-----------cCCcc-------cC---ccchhhhHHHHHHHH--HHHHHH Q lcl|NC_012756. 76 ILENGHL---S-----------RNGGR-------VA---GIVHIKPAEEKAIQN--FEKRIK 111 (115) Q Consensus 76 LLE~GHa---k-----------r~GGr-------V~---~~phI~paee~~~~~--~~~~i~ 111 (115) ++|+|.. . .+|++ .| ++|||+||.++.+.. |.--|- T Consensus 76 ~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~~~~~~~~ 137 (137) T protein:vir:10 76 AVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQEGFRVTIG 137 (137) T ss_pred eeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhcccceeEeeC Confidence 8999963 2 11322 23 889999998754432 000000 No 102 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=97.66 E-value=5.8e-07 Score=54.76 Aligned_cols=114 Identities=19% Similarity=0.258 Sum_probs=84.7 Q ss_pred CchHHH--HHHHHHHHh-hHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cce----E-EEe Q lcl|NC_012756. 1 MSNDLA--DLIAKELAA-YSDE-VTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NGS----F-VVF 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~-y~~~-v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~~----~-vv~ 67 (115) ||=++- ++|.+.|+. |... +....+.++.+.++.+++.||++-. +|||..-..-...++. ++. + |.+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 876653 467777766 5554 3667788999999999999999876 7999988888776653 332 2 344 Q ss_pred -eC--CcceeeheecceeecCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 -NA--VASLTHILENGHLSRNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 -n~--~~~ltHLLE~GHakr~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .. +|.|-||-|||+ +|||-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~gp~~R~~iVHLNE~Gy-tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:78 81 VGPMNRKNIIHLNEHGY-TRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA 132 (133) T ss_pred ecCCCceeEEEeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc Confidence 33 799999999998 78998666554 4778888877777777776666 No 103 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=97.66 E-value=5.8e-07 Score=54.76 Aligned_cols=114 Identities=19% Similarity=0.258 Sum_probs=84.7 Q ss_pred CchHHH--HHHHHHHHh-hHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cce----E-EEe Q lcl|NC_012756. 1 MSNDLA--DLIAKELAA-YSDE-VTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NGS----F-VVF 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~-y~~~-v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~~----~-vv~ 67 (115) ||=++- ++|.+.|+. |... +....+.++.+.++.+++.||++-. +|||..-..-...++. ++. + |.+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 876653 467777766 5554 3667788999999999999999876 7999988888776653 332 2 344 Q ss_pred -eC--CcceeeheecceeecCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 -NA--VASLTHILENGHLSRNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 -n~--~~~ltHLLE~GHakr~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .. +|.|-||-|||+ +|||-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~gp~~R~~iVHLNE~Gy-tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:93 81 VGPMNRKNIIHLNEHGY-TRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA 132 (133) T ss_pred ecCCCceeEEEeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc Confidence 33 799999999998 78998666554 4778888877777777776666 No 104 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=97.66 E-value=5.8e-07 Score=54.76 Aligned_cols=114 Identities=19% Similarity=0.258 Sum_probs=84.7 Q ss_pred CchHHH--HHHHHHHHh-hHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cce----E-EEe Q lcl|NC_012756. 1 MSNDLA--DLIAKELAA-YSDE-VTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NGS----F-VVF 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~-y~~~-v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~~----~-vv~ 67 (115) ||=++- ++|.+.|+. |... +....+.++.+.++.+++.||++-. +|||..-..-...++. ++. + |.+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 876653 467777766 5554 3667788999999999999999876 7999988888776653 332 2 344 Q ss_pred -eC--CcceeeheecceeecCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 -NA--VASLTHILENGHLSRNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 -n~--~~~ltHLLE~GHakr~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .. +|.|-||-|||+ +|||-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~gp~~R~~iVHLNE~Gy-tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:96 81 VGPMNRKNIIHLNEHGY-TRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA 132 (133) T ss_pred ecCCCceeEEEeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc Confidence 33 799999999998 78998666554 4778888877777777776666 No 105 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=97.66 E-value=5.8e-07 Score=54.76 Aligned_cols=114 Identities=19% Similarity=0.258 Sum_probs=84.7 Q ss_pred CchHHH--HHHHHHHHh-hHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cce----E-EEe Q lcl|NC_012756. 1 MSNDLA--DLIAKELAA-YSDE-VTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NGS----F-VVF 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~-y~~~-v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~~----~-vv~ 67 (115) ||=++- ++|.+.|+. |... +....+.++.+.++.+++.||++-. +|||..-..-...++. ++. + |.+ T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 876653 467777766 5554 3667788999999999999999876 7999988888776653 332 2 344 Q ss_pred -eC--CcceeeheecceeecCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 -NA--VASLTHILENGHLSRNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 -n~--~~~ltHLLE~GHakr~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .. +|.|-||-|||+ +|||-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~gp~~R~~iVHLNE~Gy-tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:94 81 VGPMNRKNIIHLNEHGY-TRDGKKYTPRGFGVIAKTLAASERKYREIIKKELA 132 (133) T ss_pred ecCCCceeEEEeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc Confidence 33 799999999998 78998666554 4778888877777777776666 No 106 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=97.65 E-value=8e-07 Score=54.00 Aligned_cols=109 Identities=19% Similarity=0.181 Sum_probs=74.3 Q ss_pred CchHHHH---HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCC------ccchh---hhcchhceee-----cCce Q lcl|NC_012756. 1 MSNDLAD---LIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSP------KRYGK---YRRSWKKKKL-----ANGS 63 (115) Q Consensus 1 ~~d~La~---~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP------~~TG~---y~k~W~~kk~-----~~~~ 63 (115) |+| |.+ ++.+.|+.-...+.+.=.++++.-|+-..+.|+..+| ..||+ ++++-..+.. .+|+ T Consensus 1 M~~-~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~ 79 (140) T protein:vir:48 1 MTG-LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGV 79 (140) T ss_pred Ccc-HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCce Confidence 665 444 3334444444445566677788888888999999999 45554 7788765421 2333 Q ss_pred EEE-e-eC-CcceeeheecceeecCCcccCccchhhhHHHHH------HHHHHHHHHHHhC Q lcl|NC_012756. 64 FVV-F-NA-VASLTHILENGHLSRNGGRVAGIVHIKPAEEKA------IQNFEKRIKEIGK 115 (115) Q Consensus 64 ~vv-~-n~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~~------~~~~~~~i~~ii~ 115 (115) .+| + ++ +.+++|+|++|..+ +|++|||.++.+.+ .+.+.+.+++||. T Consensus 80 s~VG~~kk~~a~~A~f~n~GT~k-----~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~ 135 (140) T protein:vir:48 80 STVGWVNRYHAQNARRLNDGTKK-----YRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIR 135 (140) T ss_pred eeeccCCCcceeeeeccccCccc-----cCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 333 3 33 47899999999985 99999999999753 4455667778887 No 107 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=97.64 E-value=7.3e-07 Score=54.24 Aligned_cols=114 Identities=19% Similarity=0.255 Sum_probs=84.2 Q ss_pred CchHHH--HHHHHHHHh-h-HHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cce----E-EEe Q lcl|NC_012756. 1 MSNDLA--DLIAKELAA-Y-SDEVTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NGS----F-VVF 67 (115) Q Consensus 1 ~~d~La--~~I~~~L~~-y-~~~v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~~----~-vv~ 67 (115) ||=++- ++|.+.|+. | ...+....+.++.+.++.+++.||++-. +|||..-..-...++. ++. + |.+ T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEe Confidence 776553 466666666 2 4556677788999999999999999877 7999988888776552 332 2 344 Q ss_pred -eC--CcceeeheecceeecCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 -NA--VASLTHILENGHLSRNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 -n~--~~~ltHLLE~GHakr~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) .. +|.|-||-|||+ +|||-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 ~gp~~R~~iVHLNE~Gy-tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~ 132 (133) T protein:vir:93 81 VGPMNRKNIIHLNEHGY-TRDGKKYTPRGFGVIAKTLAANERKYREIIKKELA 132 (133) T ss_pred ecCCCceeEEEeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc Confidence 33 799999999998 78998666554 4778888877777777776666 No 108 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=97.62 E-value=8e-07 Score=54.02 Aligned_cols=109 Identities=18% Similarity=0.177 Sum_probs=74.8 Q ss_pred CchHHHH---HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc------c---hhhhcchhceee-----cCce Q lcl|NC_012756. 1 MSNDLAD---LIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKR------Y---GKYRRSWKKKKL-----ANGS 63 (115) Q Consensus 1 ~~d~La~---~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~------T---G~y~k~W~~kk~-----~~~~ 63 (115) |+| |++ ++.+.|+.-...+.+.=.+++..-|+-..+.|+..+|.. | |-++++-..+.. .+++ T Consensus 1 M~~-~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~ 79 (141) T protein:vir:50 1 MVG-LAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGV 79 (141) T ss_pred Ccc-HHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCe Confidence 665 443 344444444445566667788888888899999999842 3 457788765431 2334 Q ss_pred EEE-e-eC-CcceeeheecceeecCCcccCccchhhhHHHH------HHHHHHHHHHHHhC Q lcl|NC_012756. 64 FVV-F-NA-VASLTHILENGHLSRNGGRVAGIVHIKPAEEK------AIQNFEKRIKEIGK 115 (115) Q Consensus 64 ~vv-~-n~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~------~~~~~~~~i~~ii~ 115 (115) .+| + ++ ..+++|+|++|..+ +|++|||.++.+. ..+.+.+.+++||. T Consensus 80 s~VG~~~~~~~~~A~f~n~GT~k-----~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~ 135 (141) T protein:vir:50 80 STVGWKNNYHAQNARRLNDGTKK-----YRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLE 135 (141) T ss_pred eeeccCCCccceeeeccccCccc-----cCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 343 4 33 37899999999986 8999999999964 44566777777777 No 109 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=97.46 E-value=1.7e-06 Score=52.25 Aligned_cols=111 Identities=18% Similarity=0.183 Sum_probs=76.2 Q ss_pred Cc---hHH-HHHHHHHHHh-hHHH-HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceeec--Cc--eE-EEe Q lcl|NC_012756. 1 MS---NDL-ADLIAKELAA-YSDE-VTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKLA--NG--SF-VVF 67 (115) Q Consensus 1 ~~---d~L-a~~I~~~L~~-y~~~-v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~~--~~--~~-vv~ 67 (115) || +=- -++|.+.|+. |... +....+.++.+.++.+++.||.+.| +|||.--..-...++. +| ++ |.+ T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW 86 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 86 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEee Confidence 33 211 2466666776 6555 7778889999999999999999998 7999977776665543 23 22 223 Q ss_pred -eCCcceeeheecceeecCCc--ccCccchhhhHHHHHHHHHHHHH----HHHhC Q lcl|NC_012756. 68 -NAVASLTHILENGHLSRNGG--RVAGIVHIKPAEEKAIQNFEKRI----KEIGK 115 (115) Q Consensus 68 -n~~~~ltHLLE~GHakr~GG--rV~~~phI~paee~~~~~~~~~i----~~ii~ 115 (115) ..+|++-||-|||| |- +-+|.-.|+.+.+.....|.+.+ ++.++ T Consensus 87 ~GpR~~ivHLNE~Gy----Gk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~ 137 (138) T protein:vir:98 87 TTPRWNIVHLQELEY----GWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFD 137 (138) T ss_pred ecCeeeEEeeecccc----cCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhc Confidence 45999999999999 43 33455567777766666666555 44444 No 110 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=97.46 E-value=7.3e-07 Score=54.24 Aligned_cols=111 Identities=19% Similarity=0.168 Sum_probs=73.2 Q ss_pred CchH------HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCC----ccchhhhcchhceeecCceEEEeeCC Q lcl|NC_012756. 1 MSND------LADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSP----KRYGKYRRSWKKKKLANGSFVVFNAV 70 (115) Q Consensus 1 ~~d~------La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP----~~TG~y~k~W~~kk~~~~~~vv~n~~ 70 (115) |+++ +++-|...|+= -++.-.+..+++|+.-.+.|+-+=| +..|.++.+-++-...+.-.|.+-.. T Consensus 1 m~sNNNGFae~~~~~~tl~kV----d~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d~V~V~Fed~ 76 (125) T protein:vir:62 1 MASNNNGFAEALEDINTLLRV----NKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDDRVSVEFKDE 76 (125) T ss_pred CCCCchhHHHHHHHhhhhhhh----hhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCCeEEEEEcch Confidence 6642 23333333322 2222334444444444444444444 45678898888877777777777777 Q ss_pred cceeeheecceeecCC-cccCccchhhhHHHHHHHHHHHHHH-HHhC Q lcl|NC_012756. 71 ASLTHILENGHLSRNG-GRVAGIVHIKPAEEKAIQNFEKRIK-EIGK 115 (115) Q Consensus 71 ~~ltHLLE~GHakr~G-GrV~~~phI~paee~~~~~~~~~i~-~ii~ 115 (115) ...+-++|+||...|| |||.|+.|..-.++......++-+- .|+. T Consensus 77 a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d 123 (125) T protein:vir:62 77 AWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIIN 123 (125) T ss_pred hhhhhhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHh Confidence 6667899999999987 9999999999999988877776543 3555 No 111 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=97.24 E-value=4.6e-06 Score=49.82 Aligned_cols=109 Identities=17% Similarity=0.167 Sum_probs=73.2 Q ss_pred CchHHHHHHHH---HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCc------cchh---hhcchhcee-----ecCce Q lcl|NC_012756. 1 MSNDLADLIAK---ELAAYSDEVTEEVDKIAEQVADETVDELKETSPK------RYGK---YRRSWKKKK-----LANGS 63 (115) Q Consensus 1 ~~d~La~~I~~---~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~------~TG~---y~k~W~~kk-----~~~~~ 63 (115) |+| |.+.|.. .|+.-...+.+.=.++++.-|+-..+.|+..+|. .||+ ++.+-..+. ..+|+ T Consensus 1 M~~-~~d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~ 79 (140) T protein:vir:48 1 MTG-LDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGV 79 (140) T ss_pred Ccc-HHHHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccccc Confidence 665 4443333 3344444455666777788888889999999984 3543 777776542 12333 Q ss_pred EEE-eeC--CcceeeheecceeecCCcccCccchhhhHHHH------HHHHHHHHHHHHhC Q lcl|NC_012756. 64 FVV-FNA--VASLTHILENGHLSRNGGRVAGIVHIKPAEEK------AIQNFEKRIKEIGK 115 (115) Q Consensus 64 ~vv-~n~--~~~ltHLLE~GHakr~GGrV~~~phI~paee~------~~~~~~~~i~~ii~ 115 (115) .+| |.+ +.+++|+|++|..+ +|++|||..+.+. ..+.+.+..++||. T Consensus 80 s~VG~~k~~~a~~a~f~NdGT~k-----~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~ 135 (140) T protein:vir:48 80 ATVGWKNNYHAQNARRLNDGTKK-----YRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIR 135 (140) T ss_pred eeecccCCCceeEEeecccCccc-----cCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 333 443 46999999999985 9999999999864 45566667788885 No 112 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=97.00 E-value=1.2e-05 Score=47.61 Aligned_cols=106 Identities=19% Similarity=0.235 Sum_probs=73.0 Q ss_pred HHHHHHh-h-HHHHHHHHHHHHHHHHHHHHHHHHHhC--Cccchhhhcchhceeec--Cce----E-EEee-C--Cccee Q lcl|NC_012756. 9 IAKELAA-Y-SDEVTEEVDKIAEQVADETVDELKETS--PKRYGKYRRSWKKKKLA--NGS----F-VVFN-A--VASLT 74 (115) Q Consensus 9 I~~~L~~-y-~~~v~~~~~~~~~~~a~~~~~~lk~~s--P~~TG~y~k~W~~kk~~--~~~----~-vv~n-~--~~~lt 74 (115) |.+.|+. | ...+....+.++.+.++.+++.||++- =+|||.--..-...++. +++ + |.+. . +|.|- T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R~~iV 80 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRKNII 80 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCCCceeeE Confidence 3333332 2 234555667788888888888888864 47999888877766552 332 2 3453 3 79999 Q ss_pred eheecceeecCCcccCccc--hhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 75 HILENGHLSRNGGRVAGIV--HIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 75 HLLE~GHakr~GGrV~~~p--hI~paee~~~~~~~~~i~~ii~ 115 (115) ||-|||+ +|||-++.++- -|+.+.+.....|.+.|++=++ T Consensus 81 HLNE~GY-tr~Gk~i~PRG~G~i~~a~~~se~~y~~~vk~eL~ 122 (123) T protein:vir:26 81 HLNEHGY-TRDGKKYTPRGFGVIAKTLAANERKYREIIKKELA 122 (123) T ss_pred eeeccce-ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhc Confidence 9999998 77998666554 4777888777777777776565 No 113 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=96.98 E-value=1.3e-05 Score=47.38 Aligned_cols=115 Identities=18% Similarity=0.198 Sum_probs=73.3 Q ss_pred CchHHH-HHHHHHHHh-hH-HHHHHHHHHHHHHHHHHHHHHHHHhC--Cccchhhhcchhceeec--Cc--eE-EEee-C Q lcl|NC_012756. 1 MSNDLA-DLIAKELAA-YS-DEVTEEVDKIAEQVADETVDELKETS--PKRYGKYRRSWKKKKLA--NG--SF-VVFN-A 69 (115) Q Consensus 1 ~~d~La-~~I~~~L~~-y~-~~v~~~~~~~~~~~a~~~~~~lk~~s--P~~TG~y~k~W~~kk~~--~~--~~-vv~n-~ 69 (115) |++=-. ++|.+.|+. |. ..+....+.++.+.++.+++.||++- =+|||.--..-...++. ++ ++ |.+. . T Consensus 1 m~evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~gp 80 (133) T protein:vir:96 1 MRLIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEGE 80 (133) T ss_pred CccccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeecC Confidence 764221 244444443 32 34556677788888888888898864 47999877766554432 22 23 3443 3 Q ss_pred --CcceeeheecceeecCCcccCccch--hhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 70 --VASLTHILENGHLSRNGGRVAGIVH--IKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 70 --~~~ltHLLE~GHakr~GGrV~~~ph--I~paee~~~~~~~~~i~~ii~ 115 (115) +|.|-||-||||=+|+|-++.++-| |+.+.+.....|.+.|++=++ T Consensus 81 ~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~ 130 (133) T protein:vir:96 81 KHRYSIVHLNEKGFYAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVS 130 (133) T ss_pred CCceeeEeeecccceecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHH Confidence 7999999999999999988776654 566666555555554444443 No 114 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=96.87 E-value=3.6e-05 Score=44.94 Aligned_cols=115 Identities=21% Similarity=0.244 Sum_probs=73.6 Q ss_pred CchHHHHHHHHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhCCcc--------------------ch---hhhcch Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSD---EVTEEVDKIAEQVADETVDELKETSPKR--------------------YG---KYRRSW 54 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~---~v~~~~~~~~~~~a~~~~~~lk~~sP~~--------------------TG---~y~k~W 54 (115) |..+|.+.|..-|.+-.. ...+.=.+++..-|+-..+.|+..+|.. || -++++- T Consensus 1 mm~~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~I 80 (159) T protein:vir:38 1 MANDMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDSI 80 (159) T ss_pred CcchHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccce Confidence 888888766665555433 2234455677777888888999999974 32 445555 Q ss_pred hceee------cCceEEE-e-eC-CcceeeheecceeecCCcccCccchhhhHHHHHHHH----HHHHHHHHhC Q lcl|NC_012756. 55 KKKKL------ANGSFVV-F-NA-VASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQN----FEKRIKEIGK 115 (115) Q Consensus 55 ~~kk~------~~~~~vv-~-n~-~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~----~~~~i~~ii~ 115 (115) ..+.. .+|+.+| | ++ ..+++|+|+.|-.+...-...|-+||.-+.+.+... +.+..++||. T Consensus 81 ~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~ 154 (159) T protein:vir:38 81 TYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAYKEVMN 154 (159) T ss_pred eeecCccccccccceeeecccCCccceEeeecccCccccCCCCccCChhHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 44321 2333333 4 33 358999999999973333334458888887766555 4478888888 No 115 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=96.64 E-value=2.3e-05 Score=46.02 Aligned_cols=110 Identities=14% Similarity=0.125 Sum_probs=62.2 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH-hC----------------------Cccchhhhcchhce Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKE-TS----------------------PKRYGKYRRSWKKK 57 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~-~s----------------------P~~TG~y~k~W~~k 57 (115) |-.+..+ |.+.|+.-...+...|.++...+.+++.....+ .+ -.+||.+++|++.. T Consensus 1 ~i~~~~~-i~~~l~~l~~~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~~~ 79 (145) T protein:vir:31 1 MVEDENN-IPEAREAIQDGLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDINAA 79 (145) T ss_pred CcccHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHHHH Confidence 4433322 444444444444444544444444444333332 11 14678899998776 Q ss_pred eecC--c-eEEEeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 58 KLAN--G-SFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 58 k~~~--~-~~vv~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ...+ + +.+|-++..+ +...+||..+ +..|++|||.|+.+...+..++.|+++|. T Consensus 80 ~~~~~~~~~a~vGtn~~Y-A~~hqfG~~~---~~IPaRPfLG~~~~~~~~~~~~ii~~~i~ 136 (145) T protein:vir:31 80 SMMDRANRMAVIGTNLDY-AEHHEFGAPE---AGIPARPIFGPAGAYASQQAPDVIGDEID 136 (145) T ss_pred hhhcccCceeEecCCchh-hhhhccCCcc---cccCCCCccCCCccchHHHHHHHHHHHHH Confidence 5432 2 3344333333 4466899654 34999999999987666666666776666 No 116 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=96.11 E-value=7.4e-06 Score=48.70 Aligned_cols=88 Identities=28% Similarity=0.358 Sum_probs=50.7 Q ss_pred Cch------HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee--cCceEEEeeCCcc Q lcl|NC_012756. 1 MSN------DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL--ANGSFVVFNAVAS 72 (115) Q Consensus 1 ~~d------~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~--~~~~~vv~n~~~~ 72 (115) |++ -|+..=. .|.+|-+. .++.+-|.+.-++++..+|++||+.||.|+.|-.+... ..+.--|-.+.+| T Consensus 1 ma~gpt~knplakfgi-~lddfdkl--pevnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrgkvgatdpq 77 (108) T protein:vir:10 1 MANGPTRKNPLAKFGV-RLDDFDKL--PEVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQ 77 (108) T ss_pred CCCCCccccchhhhcc-chhhhhcc--chhhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccccccCcchh Confidence 443 2332211 14444332 12444555555667778999999999999999876532 2333445455666 Q ss_pred eeeheecceeecC------------CcccCcc Q lcl|NC_012756. 73 LTHILENGHLSRN------------GGRVAGI 92 (115) Q Consensus 73 ltHLLE~GHakr~------------GGrV~~~ 92 (115) +||+|||-+..+ ||---+. T Consensus 78 -ahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 78 -AHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred -hhhhhhhccccccccchhhhHHhhcccccCC Confidence 999999976432 3322222 No 117 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=96.11 E-value=7.4e-06 Score=48.70 Aligned_cols=88 Identities=28% Similarity=0.358 Sum_probs=50.7 Q ss_pred Cch------HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee--cCceEEEeeCCcc Q lcl|NC_012756. 1 MSN------DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL--ANGSFVVFNAVAS 72 (115) Q Consensus 1 ~~d------~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~--~~~~~vv~n~~~~ 72 (115) |++ -|+..=. .|.+|-+. .++.+-|.+.-++++..+|++||+.||.|+.|-.+... ..+.--|-.+.+| T Consensus 1 ma~gpt~knplakfgi-~lddfdkl--pevnqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnkgrgkvgatdpq 77 (108) T protein:vir:10 1 MANGPTRKNPLAKFGV-RLDDFDKL--PEVNQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNKGRGKVGATDPQ 77 (108) T ss_pred CCCCCccccchhhhcc-chhhhhcc--chhhhhHHHHHHHHHHhhhcCCCccccccccceeecccccccccccccCcchh Confidence 443 2332211 14444332 12444555555667778999999999999999876532 2333445455666 Q ss_pred eeeheecceeecC------------CcccCcc Q lcl|NC_012756. 73 LTHILENGHLSRN------------GGRVAGI 92 (115) Q Consensus 73 ltHLLE~GHakr~------------GGrV~~~ 92 (115) +||+|||-+..+ ||---+. T Consensus 78 -ahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 78 -AHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred -hhhhhhhccccccccchhhhHHhhcccccCC Confidence 999999976432 3322222 No 118 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=96.09 E-value=6.1e-05 Score=43.71 Aligned_cols=111 Identities=21% Similarity=0.293 Sum_probs=80.8 Q ss_pred CchHHHHHH--------HHHHHhh-HHHHHHHHHHHHHHHHHHHHHHHHHhCCc-----------cchhhhcchhceeec Q lcl|NC_012756. 1 MSNDLADLI--------AKELAAY-SDEVTEEVDKIAEQVADETVDELKETSPK-----------RYGKYRRSWKKKKLA 60 (115) Q Consensus 1 ~~d~La~~I--------~~~L~~y-~~~v~~~~~~~~~~~a~~~~~~lk~~sP~-----------~TG~y~k~W~~kk~~ 60 (115) |||.-.--| +.-|..- -.++-+.|+++.+++|.-+...+++.+|+ +||.+++|-+...+. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 887644333 3334443 45567889999999999999999999998 799999999888777 Q ss_pred CceEEEeeC---CcceeeheecceeecCCcccCccchhhhH----HHHHHHHHHHHHHHHhC Q lcl|NC_012756. 61 NGSFVVFNA---VASLTHILENGHLSRNGGRVAGIVHIKPA----EEKAIQNFEKRIKEIGK 115 (115) Q Consensus 61 ~~~~vv~n~---~~~ltHLLE~GHakr~GGrV~~~phI~pa----ee~~~~~~~~~i~~ii~ 115 (115) .+.+|=-.+ .++ +-.++||-..++ +.+..||..+ |+.+.+-+|.+|.++++ T Consensus 81 raa~VrAG~~krVPY-A~~I~~G~r~r~---Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~ 138 (143) T protein:vir:62 81 KGAVIKAGSASRVPY-AAAIHFGYRARN---ISPNRFLFRAMARKSDVVAATYERRIAAVVE 138 (143) T ss_pred cceeeeeCCcCCCCc-ccccccCccccc---ccchhhhhhhhhccCHHHHHHHHHHHHHHHH Confidence 666654433 344 668899965544 4566666544 55677888888888888 No 119 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=95.64 E-value=0.00012 Score=42.09 Aligned_cols=112 Identities=18% Similarity=0.092 Sum_probs=66.7 Q ss_pred CchHH-----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------ccc Q lcl|NC_012756. 1 MSNDL-----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS--------P--------------------KRY 47 (115) Q Consensus 1 ~~d~L-----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s--------P--------------------~~T 47 (115) ||..+ .++|.+.|......+. ++..+...+|..++.....+. | .+| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~-d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVT-DTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhc Confidence 66533 2467777887777665 366777777777766665542 1 367 Q ss_pred hhhhcchhceeecCceEEEeeCCcceeeheecceeecCCc--ccCccchhhhHHH-HHHHHHHHHHHHHhC Q lcl|NC_012756. 48 GKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGG--RVAGIVHIKPAEE-KAIQNFEKRIKEIGK 115 (115) Q Consensus 48 G~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~GG--rV~~~phI~paee-~~~~~~~~~i~~ii~ 115 (115) |.++.|++.....++..|=.|..| |.+-+||=..+-++ -.|++|||.-.++ ....+.++.|..+|. T Consensus 80 g~L~~Si~~~~~~~~v~vGtn~~Y--A~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~ 148 (155) T protein:vir:99 80 NALARSVTTWADRNEAGIGSNLVY--AAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEIVL 148 (155) T ss_pred hhhhhhhhceecCCEEEEecCccc--hhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHH Confidence 888888887765444333333344 56667874332223 3699999975443 223344455555554 No 120 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=95.57 E-value=0.00014 Score=41.78 Aligned_cols=112 Identities=14% Similarity=0.092 Sum_probs=62.9 Q ss_pred CchHH-----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC-----C-----------------------ccc Q lcl|NC_012756. 1 MSNDL-----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS-----P-----------------------KRY 47 (115) Q Consensus 1 ~~d~L-----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s-----P-----------------------~~T 47 (115) ||..+ .+.|.+.|....+... .+..+...+|..++.....+. | .+| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~-~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~t 79 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVT-DTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVT 79 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccc Confidence 66533 2357777777666554 355666666666665553321 1 247 Q ss_pred hhhhcchhceeecCceEEEeeCCcceeeheecceeecCCc--ccCccchhhhH-HHHHHHHHHHHHHHHhC Q lcl|NC_012756. 48 GKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGG--RVAGIVHIKPA-EEKAIQNFEKRIKEIGK 115 (115) Q Consensus 48 G~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~GG--rV~~~phI~pa-ee~~~~~~~~~i~~ii~ 115 (115) |.++.|++.....+...|=.|..| |..-+||=...-++ -.|++|||.-. .++...+.++.|.+++. T Consensus 80 G~L~~Si~~~~~~~~v~vGtn~~Y--A~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i~ 148 (155) T protein:vir:10 80 NALARSITTRADRDQAQIGSNLSY--AAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARDAVLDVLL 148 (155) T ss_pred hhhhhhhhceecCCEEEEecCcch--hhhhhcccccCCCCccccCCccccCCCccccchHHHHHHHHHHHH Confidence 888888887765444333333444 55667873321122 38999999733 33334445555555554 No 121 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=95.53 E-value=0.00015 Score=41.60 Aligned_cols=111 Identities=21% Similarity=0.288 Sum_probs=80.6 Q ss_pred CchHHHHHH--------HHHHHhh-HHHHHHHHHHHHHHHHHHHHHHHHHhCCcc-----------chhhhcchhceeec Q lcl|NC_012756. 1 MSNDLADLI--------AKELAAY-SDEVTEEVDKIAEQVADETVDELKETSPKR-----------YGKYRRSWKKKKLA 60 (115) Q Consensus 1 ~~d~La~~I--------~~~L~~y-~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~-----------TG~y~k~W~~kk~~ 60 (115) |||.-.--| +.-|.-- -.++-+.|+++.+++|.-+...+++.+|+- ||.+++|-+...+. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 887654433 2333333 445667899999999999999999999975 89999999888777 Q ss_pred CceEEEeeC---CcceeeheecceeecCCcccCccchhhhH----HHHHHHHHHHHHHHHhC Q lcl|NC_012756. 61 NGSFVVFNA---VASLTHILENGHLSRNGGRVAGIVHIKPA----EEKAIQNFEKRIKEIGK 115 (115) Q Consensus 61 ~~~~vv~n~---~~~ltHLLE~GHakr~GGrV~~~phI~pa----ee~~~~~~~~~i~~ii~ 115 (115) .+.+|=-.+ .++ +-.++||-.+++ +.+..||..+ |+.+.+-+|.+|.++++ T Consensus 81 raa~VrAGr~arVPY-A~~I~~G~r~r~---Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~ 138 (143) T protein:vir:13 81 KGAVIKAGSAARVPY-AAAIHFGYRKRN---ISANRFLYRAMARKSDVVAATYERRIAAVVE 138 (143) T ss_pred cceeeeecCcCCCCc-ccccccCCcccc---cchhhhhhhhhhccCHHHHHHHHHHHHHHHH Confidence 776665543 344 668899966654 4566666554 55677888888888888 No 122 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=94.87 E-value=0.00024 Score=40.41 Aligned_cols=110 Identities=14% Similarity=0.088 Sum_probs=56.1 Q ss_pred CchHHH-----HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC-------------------------- Q lcl|NC_012756. 1 MSNDLA-----DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKET-----SP-------------------------- 44 (115) Q Consensus 1 ~~d~La-----~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~-----sP-------------------------- 44 (115) ||-.+. ++|.+.|......... ..+.+++|..++.....+ +| T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~--~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~ 78 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRD--RAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGS 78 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhcc--HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCc Confidence 432221 3555666554432221 133444444444333321 23 Q ss_pred --ccchhhhcchhceeecCceEEEeeCCcceeeheecceeecCCc---ccCccchhhhHHHHHHHHHHHH----HHHHhC Q lcl|NC_012756. 45 --KRYGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGG---RVAGIVHIKPAEEKAIQNFEKR----IKEIGK 115 (115) Q Consensus 45 --~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~GG---rV~~~phI~paee~~~~~~~~~----i~~ii~ 115 (115) .+||.++.|++.....++..|=.|..| |.+-+||-..+-++ ..|++|||.-. +.-.+++.+. ++++++ T Consensus 79 ~L~~tg~L~~Si~~~~~~~~v~vGt~~~y--A~vHqfG~~~~~~~~~~~iPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~ 155 (156) T protein:vir:19 79 ILTLHGDLARSITTDYGQDYALIGSPKIY--AAIHQWGGTPDMAPRPAGVPARPYMGLD-KTGEQEIFDAIRKRVSAALR 155 (156) T ss_pred chhhhHHHHHHhhheecCCEEEEecchhh--hHHhhcCcccccCCCccccCCccccCCC-HHHHHHHHHHHHHHHHHHhh Confidence 145888888887765554444344444 56668997664443 59999999533 3333333333 333333 No 123 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=94.85 E-value=0.00091 Score=37.26 Aligned_cols=105 Identities=16% Similarity=0.181 Sum_probs=72.6 Q ss_pred CchHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceee--cCceEEE---eeCCcc Q lcl|NC_012756. 1 MSNDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKL--ANGSFVV---FNAVAS 72 (115) Q Consensus 1 ~~d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~--~~~~~vv---~n~~~~ 72 (115) |.=++ ++++.+.+.+|.......+.-..+..|..+..+.|.++| -|||.=|.+-..... +.+.+++ |+..|. T Consensus 4 ~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~~g~~~~~Iylsh~veYG 83 (123) T protein:vir:74 4 VTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANKLGPGSHELIMSYSVHYG 83 (123) T ss_pred eEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEEEEecCeeec Confidence 33333 346677788898888889999999999999999999999 589998887755433 3233443 344554 Q ss_pred eeeheecceeecCCcccCccch-hhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 73 LTHILENGHLSRNGGRVAGIVH-IKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 73 ltHLLE~GHakr~GGrV~~~ph-I~paee~~~~~~~~~i~~ii~ 115 (115) -.||.+|- ++|| |.|+..+..++|.+.++.++. T Consensus 84 --~~LEla~~--------~kyaIi~Ptv~~~~~~im~g~~~ll~ 117 (123) T protein:vir:74 84 --IWLEIANS--------GQYAVIGPFLPVMGRKLMHDLEHLID 117 (123) T ss_pred --ceeeecCC--------CCceeecchHHHHhHHHHHHHHHHHH Confidence 35676663 3444 467777777777776666665 No 124 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=94.24 E-value=0.00096 Score=37.12 Aligned_cols=112 Identities=17% Similarity=0.093 Sum_probs=63.4 Q ss_pred CchHH-----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C--------------------ccc Q lcl|NC_012756. 1 MSNDL-----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETS--------P--------------------KRY 47 (115) Q Consensus 1 ~~d~L-----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~s--------P--------------------~~T 47 (115) ||..+ .++|.+.|......+. .+..+.+.+|..+......+. | .+| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~-d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~t 79 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVT-DTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVT 79 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccc Confidence 55432 1467777777666554 356666666666666555432 1 467 Q ss_pred hhhhcchhceeecCceEEEeeCCcceeeheecceeecCCc--ccCccchhhhHHH-HHHHHHHHHHHHHhC Q lcl|NC_012756. 48 GKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGG--RVAGIVHIKPAEE-KAIQNFEKRIKEIGK 115 (115) Q Consensus 48 G~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~GG--rV~~~phI~paee-~~~~~~~~~i~~ii~ 115 (115) |.++.|++.....+...|=.|..| |..-+||=...-++ ..|++|||.-.++ ....+.++.|..+|. T Consensus 80 G~L~~Si~~~~~~~~v~vGt~~~Y--A~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~ 148 (155) T protein:vir:79 80 NALARSVTTWADRNEAGIGSNLVY--AAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEVVL 148 (155) T ss_pred hhhhhhhhceecCCEEEEecCchh--hhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHH Confidence 889999887765444333333344 56667884432222 3799999975442 222333444444444 No 125 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=93.25 E-value=0.0012 Score=36.53 Aligned_cols=112 Identities=12% Similarity=0.130 Sum_probs=63.8 Q ss_pred CchHH----H-HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC-------------------------- Q lcl|NC_012756. 1 MSNDL----A-DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKET-----SP-------------------------- 44 (115) Q Consensus 1 ~~d~L----a-~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~-----sP-------------------------- 44 (115) ||.-+ . ++|.+.|......+. .+..+.+++|..++.....+ +| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~-d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~ 79 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGH-QKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhc-CHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccc Confidence 66421 1 467777877766654 35666677777766555442 23 Q ss_pred --------------ccchhhhcchhceeecCceEEEeeCCcceeeheecceeecCCc--ccCccchhhhHHH-------- Q lcl|NC_012756. 45 --------------KRYGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGG--RVAGIVHIKPAEE-------- 100 (115) Q Consensus 45 --------------~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~GG--rV~~~phI~paee-------- 100 (115) .+||.++.||+.....+...|=.|..| |.+-+||=....|+ ..|++|||.-.++ T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~~Y--AaiHqfGg~~~~~~~v~IPARPfLG~s~~de~~~~~~ 157 (175) T protein:vir:79 80 TAAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGSNKEY--AAIQHFGGQAGRGLKVTIPGRAWLPVTADGELQPEAV 157 (175) T ss_pred hhhHhhhccCCCcceechhhhhhhhheecCCEEEEecCcch--hhHhhcccccCCCcccccCcccccCCCcccchhHHHH Confidence 237888888887755443333334455 44557874332222 4899999985432 Q ss_pred -HHHHHHHHHHHHHhC Q lcl|NC_012756. 101 -KAIQNFEKRIKEIGK 115 (115) Q Consensus 101 -~~~~~~~~~i~~ii~ 115 (115) .....+.+-++++++ T Consensus 158 ~~I~~~i~~~l~~a~~ 173 (175) T protein:vir:79 158 EPVLNTILRHLMDAAN 173 (175) T ss_pred HHHHHHHHHHHHHHhc Confidence 233344444444444 No 126 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=93.25 E-value=0.0014 Score=36.27 Aligned_cols=84 Identities=18% Similarity=0.113 Sum_probs=57.9 Q ss_pred CchH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhce-eecCceEEEee-CCcceeeh Q lcl|NC_012756. 1 MSND--LADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKK-KLANGSFVVFN-AVASLTHI 76 (115) Q Consensus 1 ~~d~--La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~k-k~~~~~~vv~n-~~~~ltHL 76 (115) |+.- =++.+.++|++|++.+...+++.+-++|..+-..+.+++|+|||.++.|-..+ +.|.-+-++.. ..|-+ T Consensus 13 makvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk~GGltavI~vGAeYAI--- 89 (100) T protein:vir:96 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAI--- 89 (100) T ss_pred hhhheechHHHHHHHhcchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeeeecCCeeEEEecchhHHH--- Confidence 3321 13455677888999999999999999999999999999999999999999887 55544445443 23311 Q ss_pred eecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHh Q lcl|NC_012756. 77 LENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIG 114 (115) Q Consensus 77 LE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii 114 (115) +.|-+.+--+| T Consensus 90 ---------------------------krmsqllvtvi 100 (100) T protein:vir:96 90 ---------------------------KRMSQLLVTVI 100 (100) T ss_pred ---------------------------HHHHHHHhhcC Confidence 12222222222 No 127 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=92.93 E-value=0.0032 Score=34.29 Aligned_cols=111 Identities=14% Similarity=0.149 Sum_probs=59.9 Q ss_pred CchHH-----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh-----CC-------------------------- Q lcl|NC_012756. 1 MSNDL-----ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKET-----SP-------------------------- 44 (115) Q Consensus 1 ~~d~L-----a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~-----sP-------------------------- 44 (115) ||.-+ .++|.+.|......+. ....+.+.+|..++.....+ +| T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~-d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~ 79 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGH-QKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhc-cHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhh Confidence 66311 1457777777665554 24445555555554444332 22 Q ss_pred --------------ccchhhhcchhceeecCceEEE-eeCCcceeeheecceeecCCc--ccCccchhhhHH-------- Q lcl|NC_012756. 45 --------------KRYGKYRRSWKKKKLANGSFVV-FNAVASLTHILENGHLSRNGG--RVAGIVHIKPAE-------- 99 (115) Q Consensus 45 --------------~~TG~y~k~W~~kk~~~~~~vv-~n~~~~ltHLLE~GHakr~GG--rV~~~phI~pae-------- 99 (115) .+||.++.|++.....+ +++| .|..| |..-.||=....++ -.|++|||.-.+ T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~-~v~vGtn~~Y--AaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~~~~e~ 156 (175) T protein:vir:10 80 TAAASRRKAGLMILQDSGQMAASVSTDHDDN-SAVIGSNKEY--AAIHQFGGQAGRGLKVTIPARPWLPVTADGELQPEA 156 (175) T ss_pred hhhhhhhccCCCcceechhhhhhhheeecCC-EEEEecChhh--hhhhhcccccCCCCccccCCccccCCCcccccchHH Confidence 24677788887665433 3444 33445 44556774332232 479999998643 Q ss_pred -HHHHHHHHHHHHHHhC Q lcl|NC_012756. 100 -EKAIQNFEKRIKEIGK 115 (115) Q Consensus 100 -e~~~~~~~~~i~~ii~ 115 (115) +.......+.+..+++ T Consensus 157 ~~~Il~~~~~~l~~~~~ 173 (175) T protein:vir:10 157 VEPVLNTILRHLMDAAN 173 (175) T ss_pred HHHHHHHHHHHHHHHhc Confidence 3344445555555555 No 128 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=92.81 E-value=0.0041 Score=33.65 Aligned_cols=107 Identities=17% Similarity=0.232 Sum_probs=75.0 Q ss_pred Cc---hHH-HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhcee--ecCceEEEeeCCcc Q lcl|NC_012756. 1 MS---NDL-ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKK--LANGSFVVFNAVAS 72 (115) Q Consensus 1 ~~---d~L-a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk--~~~~~~vv~n~~~~ 72 (115) |+ =++ ++++.+.+.+|.......+.-.++..|..+..+.|.++| -|||.=|.+-.... .+...+++ + T Consensus 1 ~~~~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~~~~~~~I-----y 75 (120) T protein:vir:10 1 MAKIEFKFKDIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEI-----V 75 (120) T ss_pred CceEEEEecHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccCCCceEEE-----E Confidence 33 233 236667788899999999999999999999999999999 58999777765533 23333333 3 Q ss_pred eeeheecceeec--CCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 73 LTHILENGHLSR--NGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 73 ltHLLE~GHakr--~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+|=+|||=..- |+|. +--|+|...+.-++|.+.++.++- T Consensus 76 lsh~veYG~~LEla~~~k---yaIl~PTi~~~~~~il~g~~~ll~ 117 (120) T protein:vir:10 76 FAHTVHYGIWLEIANSGR---YEIIMPTVHHEGKLMAQRLRGLLG 117 (120) T ss_pred EecCeeecceEEeeCCCC---cccccchHHHHhHHHHHHHHHHhh Confidence 455555554433 4443 224688888888888888888877 No 129 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=91.28 E-value=0.00036 Score=39.44 Aligned_cols=89 Identities=12% Similarity=0.090 Sum_probs=46.1 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cchhhhcchhceeecCceEE-Eee---CCcceeeheecceeecCCc Q lcl|NC_012756. 16 YSDEVTEEVDKIAEQVADETVDELKETSPK----RYGKYRRSWKKKKLANGSFV-VFN---AVASLTHILENGHLSRNGG 87 (115) Q Consensus 16 y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~----~TG~y~k~W~~kk~~~~~~v-v~n---~~~~ltHLLE~GHakr~GG 87 (115) .+-...+.++.. ..+.+++.+...+ .+-+|-.|+..... .+... -++ ...+++-++|||+. T Consensus 1 ~~~~~~~g~~~~-----~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~-~~~~~~~~~~g~~va~Ia~~~E~G~~----- 69 (168) T protein:vir:94 1 MTTIARKGVKMP-----PHLEAQFQSGEVKAGVLSGSTYPQMTYTDQR-TGKQIEDARGGMPVAVIAQALEYGHG----- 69 (168) T ss_pred Cccccchhhhhh-----HHHHHhhhccceeeeccccCcccccccchhh-cccccccccccccHHHHHHHHhcCCC----- Confidence 111111111111 1112222222110 12244444432211 00000 001 13466778999974 Q ss_pred ccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 88 RVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 88 rV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ..|++|||+|..+....++.+.++++++ T Consensus 70 ~IP~RPFlr~t~~~~~~~~~~~~~~~~~ 97 (168) T protein:vir:94 70 QNHPRPFMQQTYAAQYRAWSRDLTLTLK 97 (168) T ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHh Confidence 6999999999999999999999999999 No 130 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=90.68 E-value=0.019 Score=30.04 Aligned_cols=111 Identities=13% Similarity=0.113 Sum_probs=53.8 Q ss_pred Cc-----hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH-----hCC------------------------cc Q lcl|NC_012756. 1 MS-----NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKE-----TSP------------------------KR 46 (115) Q Consensus 1 ~~-----d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~-----~sP------------------------~~ 46 (115) |+ -++ +++.+.|....+.+. .+..+.+++|..++...++ .+| .+ T Consensus 1 M~~i~i~~d~-~~~~~~L~~l~~~~~-~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~ 78 (190) T protein:vir:99 1 MAGITLEWDG-RRALDVLNAGSAALG-DPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTL 78 (190) T ss_pred CceeEEEecH-HHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCcccee Confidence 22 111 233444444444333 2344455555544443333 233 14 Q ss_pred chhhhcchhceeecCceEEEeeCCcceeeheecceeec---------------CC------------------------c Q lcl|NC_012756. 47 YGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSR---------------NG------------------------G 87 (115) Q Consensus 47 TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr---------------~G------------------------G 87 (115) ||.+++|++.....+...|=.|..| +.+-+||=..+ .| . T Consensus 79 tg~L~~Si~~~~~~~~v~vGtn~~y--A~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v 156 (190) T protein:vir:99 79 DGHLRNLLRYQLDGSELLFGSDRPY--AAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTI 156 (190) T ss_pred cHHHHHHHhheecCcEEEEecCcch--hhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhccccccee Confidence 6889999987765544333233344 44456771110 00 0 Q ss_pred ccCccchhhhH---HHHHHHHHHHHHHHHhC Q lcl|NC_012756. 88 RVAGIVHIKPA---EEKAIQNFEKRIKEIGK 115 (115) Q Consensus 88 rV~~~phI~pa---ee~~~~~~~~~i~~ii~ 115 (115) ..|++|||.-. +++..+..++-+.++++ T Consensus 157 ~IPaRpfLG~s~~d~~~I~~~i~~~l~~~~~ 187 (190) T protein:vir:99 157 QMPARPWLGTSSQDDDTILQRVERYLQRALR 187 (190) T ss_pred eecCcccCCCCHHHHHHHHHHHHHHHHHHHh Confidence 24999999543 33455555555666666 No 131 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=90.14 E-value=0.00039 Score=39.26 Aligned_cols=83 Identities=23% Similarity=0.396 Sum_probs=49.0 Q ss_pred CchHH------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceee---cCceEEEeeCCc Q lcl|NC_012756. 1 MSNDL------ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKL---ANGSFVVFNAVA 71 (115) Q Consensus 1 ~~d~L------a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~---~~~~~vv~n~~~ 71 (115) |+|.+ =|.|++ +.+++..++-+|.++....|+++|++||.|++|...... .-.++.|....+ T Consensus 1 madaftpNp~~FDqIl~---------s~~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D~ 71 (92) T protein:vir:78 1 MADAFTPNPTWFDQIMR---------TPKVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSDE 71 (92) T ss_pred CCCccCCChhHHHHhhc---------ccchhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccccceeEEeecCc Confidence 87764 456664 345788888999999999999999999999999865322 223333433333 Q ss_pred ceeeheecceeecCCcccCccchhhhHHHHHHH Q lcl|NC_012756. 72 SLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQ 104 (115) Q Consensus 72 ~ltHLLE~GHakr~GGrV~~~phI~paee~~~~ 104 (115) - |-|+|- |.|. ++.+..++.. T Consensus 72 K-TlLvES----rTGN-------Lakalk~~rs 92 (92) T protein:vir:78 72 K-TLLIES----RTGN-------LARSVKRRRS 92 (92) T ss_pred c-eeeeec----ccch-------HHHHHhhhcC Confidence 2 445552 2221 1111111100 No 132 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=89.40 E-value=0.0046 Score=33.38 Aligned_cols=100 Identities=11% Similarity=0.122 Sum_probs=48.1 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeeheecc Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILENG 80 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~G 80 (115) |+=.+.= -+......+.+.++.+...++.++......--|.+||.|++|= ...+++ .|+|++-| |+-+-+| T Consensus 1 M~vkV~i----d~~~~~~~l~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~--~~~~~g-~I~y~tPY--Ar~qYY~ 71 (112) T protein:vir:80 1 MPIKVRV----DLSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQY--VIMNDK-EIMWTSIY--ARRLYNG 71 (112) T ss_pred CceeEEe----ehHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCccccce--eeccCc-eEEecCch--hhHhhhc Confidence 4411100 0111112233445566667788888888778899999999982 223334 46666544 3444444 Q ss_pred eee-----cCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 81 HLS-----RNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 81 Hak-----r~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +.- .|+|+ +... .|+++...-+.|.++++ T Consensus 72 ~~~~~~~~~~p~a--g~~W----~erak~~~~~~~~~~~~ 105 (112) T protein:vir:80 72 INFNFTLTHHPLA--GPKW----DQRAKVDKLESWIEVAQ 105 (112) T ss_pred ccCCCCcCCCCCc--chhh----HHHHHhhhhHHHHHHHH Confidence 431 12221 1111 23344444444444333 No 133 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=88.25 E-value=0.0011 Score=36.76 Aligned_cols=85 Identities=12% Similarity=0.098 Sum_probs=45.8 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEe--eCCcceeehee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVF--NAVASLTHILE 78 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~--n~~~~ltHLLE 78 (115) ||-..-..-. -.+.+.+.|++.+- -+..=||-....... .+. -...+++-+.| T Consensus 1 M~~~~k~~~~--------------------~~~~l~~~l~~l~~---~~v~VGi~~~~~~~~--~~~~g~~vA~ia~~~E 55 (148) T protein:vir:52 1 MAVTVTANFS--------------------AAKQLIEQMKSLKE---KAVYVGFPAEFDEKV--KGSENFNLASLAAVLE 55 (148) T ss_pred CccccccccH--------------------HHHHHHHHHHHhhC---CeEEEEeecCcCCCC--CCCCCCCHHHHHHHHh Confidence 4332211100 01223333333321 012222211100000 000 12356788999 Q ss_pred cceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 79 NGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 79 ~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ||+. ..|++|||+|..+....++.+.+..+++ T Consensus 56 ~G~~-----~IP~Rpflr~t~~~~~~~~~~~~~~~~~ 87 (148) T protein:vir:52 56 FGNE-----HIPARPFLRQTLEENQEKYTALFIQWFD 87 (148) T ss_pred cCCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHH Confidence 9965 6999999999999999999999999888 No 134 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=87.28 E-value=0.0035 Score=34.04 Aligned_cols=80 Identities=9% Similarity=0.063 Sum_probs=42.9 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-Cce--E----EEe------e-CCcceee Q lcl|NC_012756. 10 AKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-NGS--F----VVF------N-AVASLTH 75 (115) Q Consensus 10 ~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-~~~--~----vv~------n-~~~~ltH 75 (115) |+ ...+.|+.. .++|... ..+=||-..... +++ . .+. + ...+++- T Consensus 1 m~-------v~~k~L~~~--------~~~l~~~------~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~ 59 (155) T protein:vir:78 1 MS-------VTRRGLTLP--------KDRYRSM------SVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAM 59 (155) T ss_pred Cc-------chHHHHHHH--------HHHHhCC------eeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHH Confidence 11 111112222 2223221 122333221110 000 0 000 1 1345677 Q ss_pred heecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .+|||+. ..|++|||+|..+....++.+.++.+++ T Consensus 60 ~~E~G~~-----~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:78 60 ALNYGTS-----KLPARPFMEKTITDRSAEWIKGLTVMMT 94 (155) T ss_pred hhhcCCC-----CCCCcchhhHHHHHHHHHHHHHHHHHHH Confidence 8899974 6999999999999999999999998888 No 135 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=87.02 E-value=0.0036 Score=34.01 Aligned_cols=80 Identities=9% Similarity=0.071 Sum_probs=42.9 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-Cce--EE----Ee------e-CCcceee Q lcl|NC_012756. 10 AKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-NGS--FV----VF------N-AVASLTH 75 (115) Q Consensus 10 ~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-~~~--~v----v~------n-~~~~ltH 75 (115) |+ ...+.|+.. .++|... ..+=||-..... +++ .. +. + ...+++- T Consensus 1 m~-------v~~k~L~~~--------~~~l~~~------~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~ 59 (155) T protein:vir:10 1 MS-------VTRRGLTLP--------KDRYRSM------SVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAM 59 (155) T ss_pred Cc-------chHHHHHHH--------HHHHhCC------eeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHH Confidence 11 111112222 2223221 122333221110 000 00 00 1 1345677 Q ss_pred heecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 76 ILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 76 LLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ..|||+. ..|++|||+|..+....++.+.+..+++ T Consensus 60 ~~E~G~~-----~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:10 60 ALNYGTS-----KLPARPFMEKTIADRSAEWIKGLTVMMT 94 (155) T ss_pred HHhcCCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHH Confidence 8899974 6999999999999999999999998888 No 136 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=86.94 E-value=0.0016 Score=35.90 Aligned_cols=84 Identities=15% Similarity=0.222 Sum_probs=42.7 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-CceEEEeeCCcceeeheec Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-NGSFVVFNAVASLTHILEN 79 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-~~~~vv~n~~~~ltHLLE~ 79 (115) ||..+.. .+...+.|++..+ +.+ .-+..=||-..... +|. ...+++-+.|| T Consensus 1 M~~~i~~---------~~~~~~~L~~~lk-----------~l~---~k~V~VGi~~~~~y~dG~-----~vA~Ia~~~E~ 52 (189) T protein:vir:10 1 MGRVIRK---------QGPARVKLNAFIK-----------GMN---DYSVRIGWFSTAKYPDGT-----PTAYVASIHEF 52 (189) T ss_pred Ccceecc---------CcHHHHHHHHHHH-----------Hhh---CCeEEEEecCCCCCCCcc-----cHHHHHHHHHh Confidence 4333221 1111111222222 211 01111122111111 111 13567788999 Q ss_pred ceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 80 GHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 80 GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+-.+ .+|++|||+|..+....+..+.++..++ T Consensus 53 G~p~~---~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 85 (189) T protein:vir:10 53 GAPSR---GIPARSFIRPTIAAQQAAWSQQMRFYAK 85 (189) T ss_pred cCcCC---CCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 97543 4899999999999888888887777776 No 137 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=86.92 E-value=0.0082 Score=32.02 Aligned_cols=82 Identities=17% Similarity=0.178 Sum_probs=60.2 Q ss_pred HHHHHHHHHHHHHHHHHHhCC--ccchhhhcchhceee--cCceEEE---eeCCcceeeheecceeecCCcccCccc-hh Q lcl|NC_012756. 24 VDKIAEQVADETVDELKETSP--KRYGKYRRSWKKKKL--ANGSFVV---FNAVASLTHILENGHLSRNGGRVAGIV-HI 95 (115) Q Consensus 24 ~~~~~~~~a~~~~~~lk~~sP--~~TG~y~k~W~~kk~--~~~~~vv---~n~~~~ltHLLE~GHakr~GGrV~~~p-hI 95 (115) |...++-.|+.+..+.|.++| .|||.=|.+-..... ++..+++ ++..|. -.||.+|- ++| -| T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~~g~~~~~i~lsh~v~Yg--~~LE~a~~--------~kyaIl 70 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTPQPDRYEIVFAHTVHYG--IWLEIANS--------GRYEII 70 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccccCCceEEEEEecCeecc--ceEEeecC--------CCccch Confidence 777788888999999999999 589998888765443 3333443 344554 34677665 244 46 Q ss_pred hhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 96 KPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 96 ~paee~~~~~~~~~i~~ii~ 115 (115) .|...+..++|.+.++.++- T Consensus 71 ~Ptv~~~~~~i~~g~~~ll~ 90 (93) T protein:vir:10 71 MPTVHHEGKLMAQRLRGLLG 90 (93) T ss_pred hhhHHHHHHHHHHHHHHHHH Confidence 89999999999999998887 No 138 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=86.22 E-value=0.015 Score=30.60 Aligned_cols=114 Identities=18% Similarity=0.241 Sum_probs=59.8 Q ss_pred CchHHHHHHHHHHHhhHHHHHH--------HHHHHHHHHHHHHHHHHHHhC------Cccch---hhhcchhceee---- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTE--------EVDKIAEQVADETVDELKETS------PKRYG---KYRRSWKKKKL---- 59 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~--------~~~~~~~~~a~~~~~~lk~~s------P~~TG---~y~k~W~~kk~---- 59 (115) |-.+ ..-+-..|++|.+.++. +=.++...-|+-..+.|..-+ +.+|| -+++|-..+.. T Consensus 1 ~~~~-~~~fdd~L~~~~~~v~klv~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~~kt~k~~HLADsI~~~~~niDg 79 (161) T protein:vir:10 1 MMEE-KQLFEDIMNGIIFQAESVSTSLTVEDKAKITKAGANAFAIGLEKVTKDKHYRIRKTGENPHLADSILVQNTNIDG 79 (161) T ss_pred Ccch-hHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHhHHHHHHHHHHHhhhhcCcCCCCCCcchhhhheeecccccCc Confidence 2111 11122223333333322 122333334444444454444 45666 67888876532 Q ss_pred -cCceEEE-e-eCCcceeeheecceee----------cCCcc--cCccchhhhHHHH--HHHH----HHHHHHHHhC Q lcl|NC_012756. 60 -ANGSFVV-F-NAVASLTHILENGHLS----------RNGGR--VAGIVHIKPAEEK--AIQN----FEKRIKEIGK 115 (115) Q Consensus 60 -~~~~~vv-~-n~~~~ltHLLE~GHak----------r~GGr--V~~~phI~paee~--~~~~----~~~~i~~ii~ 115 (115) .+|+-+| + +++.+++|+|+.|-+. .|+|. ++|-+|+--+.+. +... .-+..++|+. T Consensus 80 ~~dG~StVGw~~kka~ia~~indGtr~~~~~~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~~~y~eil~ 156 (161) T protein:vir:10 80 IKDGNSTVGWDYTKSRVGHLIENGTRFPMYSKKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEAEVFSEILK 156 (161) T ss_pred ccCCceeccccCchhhhhhhhcccchhhhhhcccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHHHHHHHHHH Confidence 2333333 3 5678999999999532 35664 7888999888773 3333 3344577777 No 139 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=85.62 E-value=0.014 Score=30.69 Aligned_cols=114 Identities=18% Similarity=0.213 Sum_probs=58.0 Q ss_pred CchHHHHHHHHHHHhhHHHH----HHHHHHHHHHHHHHHHHHHHHhCC------ccchh---hhcchhceee-----cCc Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEV----TEEVDKIAEQVADETVDELKETSP------KRYGK---YRRSWKKKKL-----ANG 62 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v----~~~~~~~~~~~a~~~~~~lk~~sP------~~TG~---y~k~W~~kk~-----~~~ 62 (115) |. +|.+.++.-|++-...+ .++=.++...-|+-..+.|..-+| +.||+ +++|...+.. .+| T Consensus 1 M~-~~~d~l~~~~~~v~kl~~~lt~e~kakIT~AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~dG 79 (168) T protein:vir:39 1 MV-SFYDAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDG 79 (168) T ss_pred Cc-cHHHHHHHHHHHHHhccCCCCHHHHHHHHHHhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCcccCC Confidence 43 23333222222211111 111223333333333444555444 45654 6777766532 233 Q ss_pred eEEE-e-eC-------Ccceeeheeccee-----------ecCCcc--cCccchhhhHHHHH--HHH-H---HHHHHHHh Q lcl|NC_012756. 63 SFVV-F-NA-------VASLTHILENGHL-----------SRNGGR--VAGIVHIKPAEEKA--IQN-F---EKRIKEIG 114 (115) Q Consensus 63 ~~vv-~-n~-------~~~ltHLLE~GHa-----------kr~GGr--V~~~phI~paee~~--~~~-~---~~~i~~ii 114 (115) +-+| + ++ +.++|++|+.|-. -+++|+ ++|-+||--+.+.+ ... | .+..++|| T Consensus 80 ~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~~Ae~e~~~eil 159 (168) T protein:vir:39 80 QSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKII 159 (168) T ss_pred ceeccccCccccccccchhheehhccccccchhhhhcccccccccceeecccchhHHHhhhhhhhHHHHHHHHHHHHHHH Confidence 3333 3 32 5689999999964 256776 57889999888853 222 2 33446677 Q ss_pred C Q lcl|NC_012756. 115 K 115 (115) Q Consensus 115 ~ 115 (115) . T Consensus 160 ~ 160 (168) T protein:vir:39 160 N 160 (168) T ss_pred H Confidence 6 No 140 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=82.52 E-value=0.0079 Score=32.10 Aligned_cols=80 Identities=13% Similarity=0.090 Sum_probs=42.5 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhce---eecCceEEE---------e- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKK---KLANGSFVV---------F- 67 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~k---k~~~~~~vv---------~- 67 (115) |+ ...+.|+ .+.+.|... +-.=||-.. ..+.+..+. . T Consensus 1 m~----------------v~r~~L~--------~~~~~l~~~------~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~ 50 (155) T protein:vir:10 1 MS----------------VTRRGLT--------LPKDRYKSM------SVKAGVLAGATYPDESGKKLADGTILKKDPRA 50 (155) T ss_pred Cc----------------chHHHHH--------HHHHHhhCC------eeEEeecCCCCCCccccchhhhhhhhcccccc Confidence 11 0001111 122223321 122223111 011111110 0 Q ss_pred e-CCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 N-AVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 n-~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) + ...++|-.+|||+. ..|++|||+|..+....++.+.++++++ T Consensus 51 G~pva~ia~~~e~G~~-----~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:10 51 GLPVAMIAMALNYGTS-----KLPARPFMEKTIADRSAEWIKGLTVMMT 94 (155) T ss_pred CcchhhhhhhhhcCCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHH Confidence 1 13456777899984 6999999999999999999999999888 No 141 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=81.86 E-value=0.033 Score=28.73 Aligned_cols=110 Identities=15% Similarity=0.183 Sum_probs=57.5 Q ss_pred CchHHHHHHHHHHHhhHHHHHHH--------HHHHHHHHHHHHHHHHHHhCC------ccch---hhhcchhceee---- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEE--------VDKIAEQVADETVDELKETSP------KRYG---KYRRSWKKKKL---- 59 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~--------~~~~~~~~a~~~~~~lk~~sP------~~TG---~y~k~W~~kk~---- 59 (115) |+| |.+ .|.+|.+.+++- =.++...-|+-..+.|...+| +.|| -++.+-..+.. T Consensus 1 M~~-~~~----~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg 75 (168) T protein:vir:74 1 MAT-FEE----AMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDG 75 (168) T ss_pred Ccc-HHH----HHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCc Confidence 443 332 233333333331 122333333333444555554 4566 57888765432 Q ss_pred -cCceEEE-eeC--------Ccceeeheeccee-----------ecCCcc--cCccchhhhHHHHH--HHH----HHHHH Q lcl|NC_012756. 60 -ANGSFVV-FNA--------VASLTHILENGHL-----------SRNGGR--VAGIVHIKPAEEKA--IQN----FEKRI 110 (115) Q Consensus 60 -~~~~~vv-~n~--------~~~ltHLLE~GHa-----------kr~GGr--V~~~phI~paee~~--~~~----~~~~i 110 (115) .+|+.+| +.+ +.++|++|+.|-. .+++|+ ++|-+|+--+.+.+ .+. -.+.. T Consensus 76 ~~dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~~~y 155 (168) T protein:vir:74 76 VKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEAEAM 155 (168) T ss_pred ccCCceeecccccccccccchhhhhhhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHHHHHHHHH Confidence 2343333 332 5689999999953 123454 68999999888763 233 23334 Q ss_pred HHHhC Q lcl|NC_012756. 111 KEIGK 115 (115) Q Consensus 111 ~~ii~ 115 (115) ++||. T Consensus 156 ~eIl~ 160 (168) T protein:vir:74 156 RKIIN 160 (168) T ss_pred HHHHH Confidence 66666 No 142 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=81.33 E-value=0.0087 Score=31.88 Aligned_cols=80 Identities=11% Similarity=0.054 Sum_probs=41.8 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeec-Cce--EE---------Ee- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLA-NGS--FV---------VF- 67 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~-~~~--~v---------v~- 67 (115) |+- ....|+. ..+++... ..+=||-..... ++. .. .. T Consensus 1 m~~----------------~r~~l~~--------~~~~l~~~------~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~ 50 (155) T protein:vir:77 1 MSV----------------TRRGLTL--------PKDRYRSM------SVKAGVLAGATYPDESGKKLADGSILKKDPRA 50 (155) T ss_pred Ccc----------------hHHHHHH--------HHHHHhcC------ceEEeecCCCCCccccchhhhhhhhccccccc Confidence 110 0011211 12222221 122333221100 000 00 00 Q ss_pred -eCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 68 -NAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 68 -n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) -...++|-.+|||+. ..|++|||+|..+....++.+.+..+++ T Consensus 51 G~pva~ia~~~e~G~~-----~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:77 51 GLPVAMIAMALNYGTS-----KLPARPFMEKTIADRSAEWIKGLTVMMT 94 (155) T ss_pred cccHhhhhhhhhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHH Confidence 012456778899984 6999999999999999999999988888 No 143 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=76.36 E-value=0.044 Score=28.01 Aligned_cols=98 Identities=11% Similarity=0.102 Sum_probs=45.0 Q ss_pred Cc----hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCc---ce Q lcl|NC_012756. 1 MS----NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVA---SL 73 (115) Q Consensus 1 ~~----d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~---~l 73 (115) |+ =++. .|+.. +.+.++.+...++.++..+...--|.+||.+++| ....+++. |+|++-| |. T Consensus 1 M~vkv~vn~~-~~~~~-------l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S--~~~~~~g~-I~y~tPYAr~qY 69 (112) T protein:vir:45 1 MPIKVRVDLS-KAKGS-------VKKAKERGQFALINQAAADIALYVPFLSGDLSNQ--YVIMNDKE-IMWTSIYARRLY 69 (112) T ss_pred CceeEEeehH-HHHHH-------HHHHHHHHHHHHHHHHHHHhhcCCccccCccccc--eeeccCCe-EEecChhhHHhh Confidence 43 1221 12221 2233444556677777777777789999999997 23345554 5555432 21 Q ss_pred eeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 74 THILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 74 tHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) =|-.-+|....++|+ +... .|+++...-+.|.++++ T Consensus 70 Y~~~~~~~~~~~p~a--g~~W----~erak~~~~~~~~~~~~ 105 (112) T protein:vir:45 70 KGINFNFTLTHHPLA--GPEW----DQRAKIDKMDVWEKVAQ 105 (112) T ss_pred hccccCCCCCCCCCC--chhh----HHHHHHhhHHHHHHHHH Confidence 122112211222221 1111 22333344444444333 No 144 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=75.79 E-value=0.077 Score=26.69 Aligned_cols=110 Identities=18% Similarity=0.248 Sum_probs=58.0 Q ss_pred CchHHHHHHHHHHHhhHHHHHH--------HHHHHHHHHHHHHHHHHHHhCC------ccch---hhhcchhceee---- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTE--------EVDKIAEQVADETVDELKETSP------KRYG---KYRRSWKKKKL---- 59 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~--------~~~~~~~~~a~~~~~~lk~~sP------~~TG---~y~k~W~~kk~---- 59 (115) |.| |.+. |++|.+.+++ +=.++...-|+-..+.|...+| ++|| -++.+-..+.. T Consensus 1 M~~-~~d~----l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~~k~t~~~~HLaDsI~~~~~niDg 75 (168) T protein:vir:10 1 MVS-FYDA----MQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDG 75 (168) T ss_pred CCc-HHHH----HHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhccCCCCccchhhhhheeccccccc Confidence 432 3333 3333333333 1223333344444445555444 4566 57888876532 Q ss_pred -cCceEEE-ee-C-------Ccceeeheeccee-----------ecCCcc--cCccchhhhHHHHH--HHH-HH---HHH Q lcl|NC_012756. 60 -ANGSFVV-FN-A-------VASLTHILENGHL-----------SRNGGR--VAGIVHIKPAEEKA--IQN-FE---KRI 110 (115) Q Consensus 60 -~~~~~vv-~n-~-------~~~ltHLLE~GHa-----------kr~GGr--V~~~phI~paee~~--~~~-~~---~~i 110 (115) .+|+.+| +. + +.++|++|+.|-. -+++|+ ++|-+|+--+.+.+ .+. |. +.. T Consensus 76 ~~dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~~~y 155 (168) T protein:vir:10 76 VKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEAEAM 155 (168) T ss_pred ccCCceeecccCccccccccchheeeeccccccccccccccccccccccccccccchhHHHhhhchhhhHHHHHHHHHHH Confidence 2343333 33 2 5699999999953 123454 68999998888863 222 33 334 Q ss_pred HHHhC Q lcl|NC_012756. 111 KEIGK 115 (115) Q Consensus 111 ~~ii~ 115 (115) ++||. T Consensus 156 ~eIl~ 160 (168) T protein:vir:10 156 RKIIN 160 (168) T ss_pred HHHHH Confidence 56666 No 145 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=72.58 E-value=0.096 Score=26.17 Aligned_cols=98 Identities=19% Similarity=0.255 Sum_probs=49.7 Q ss_pred CchHHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeeheec Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILEN 79 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~ 79 (115) |+ .|.-...-+. +.++.+-..++.++..+.-.=-|.+||.+++|=. .+.+. |+|+.-| |+-+=+ T Consensus 1 ~~---------dL~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~---i~s~~-I~y~tPY--Ar~qyY 65 (113) T protein:vir:79 1 MS---------DLSVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSF---VNDTG-IHYTAKY--ARAQFY 65 (113) T ss_pred Cc---------hHHHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhcccc---ccCCe-eEecChh--hhHhhc Confidence 43 3333333322 2455566677788888887788999999999832 34443 5555543 222222 Q ss_pred ceeecCCcccCcc--chh-hhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 80 GHLSRNGGRVAGI--VHI-KPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 80 GHakr~GGrV~~~--phI-~paee~~~~~~~~~i~~ii~ 115 (115) |.. +|.++... |.= ..=.|+++...-+.+.++++ T Consensus 66 g~~--~~~~~~~~t~p~ag~~W~eraKa~h~~~w~~~~~ 102 (113) T protein:vir:79 66 GFV--NGHRVRNYSTPGTGRRWDLKAKAVYKADWQKVAV 102 (113) T ss_pred ccc--CCCCccccCCCCCCchhhHHHHHHhHHHHHHHHH Confidence 211 11111100 000 11234566666666666655 No 146 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=68.39 E-value=0.029 Score=29.00 Aligned_cols=98 Identities=5% Similarity=-0.029 Sum_probs=40.3 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhh--------hcchhceee---------------------- Q lcl|NC_012756. 10 AKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKY--------RRSWKKKKL---------------------- 59 (115) Q Consensus 10 ~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y--------~k~W~~kk~---------------------- 59 (115) |+.-+++.. +. .+.+.++.+. .+.++-.-|-.-|.. .=|+..+.. T Consensus 1 m~vt~~~~~-~~-~~~~~l~~L~---~k~v~vGi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~~ 75 (199) T protein:vir:80 1 MKVTTDKST-MN-KAIRELDQLD---RYSLQIGLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIPGL 75 (199) T ss_pred CcccccHHH-HH-HHHHHHHHhc---CCEEEEEEecCCCcchhheeehhhcCCeeecCCceeeecchhhhcccccccCcc Confidence 222111110 00 0111111110 001111111111100 001111100 Q ss_pred --cCceEE-EeeCCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 60 --ANGSFV-VFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 60 --~~~~~v-v~n~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .++..+ .......++..+|||.-. ...|++|||+|..+....++.+.++..++ T Consensus 76 ~~p~g~~~~~~~~~~~~~~~~e~g~~~---~~IP~RPFlr~t~~~~~~~~~~~~~~~~~ 131 (199) T protein:vir:80 76 FKPKGKNILAVAGPDGKLTVMFYLKTE---VNIPERSFLRSTFDEKSNKWGELFEGWID 131 (199) T ss_pred cccCCcceeeeeccccceeeeeecccc---ccCCCCchhHHHHHHHHHHHHHHHHHHHH Confidence 111111 111123345567888643 35899999999999998888888888777 No 147 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=65.96 E-value=0.12 Score=25.63 Aligned_cols=103 Identities=14% Similarity=0.106 Sum_probs=60.9 Q ss_pred HHHHH---HHHHHhhHHHH----HHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecC--ceEE---------- Q lcl|NC_012756. 5 LADLI---AKELAAYSDEV----TEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLAN--GSFV---------- 65 (115) Q Consensus 5 La~~I---~~~L~~y~~~v----~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~--~~~v---------- 65 (115) +|+.- .+.|+++.+.+ ...+.+.+++..+++..++.+..-.+|-. .+|. .++. T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPV--------dTG~Lr~sw~~~~~~~~~~~ 72 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPV--------DTGFLRQGWNGVAYARSLPV 72 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--------cchhhcccccccccccccce Confidence 77621 12445554444 33455566666666666666655444431 1221 1111 Q ss_pred Eee-CCc--ceeeheecceeecCCcc-cCccchhhhHH--HHHHHHHHHHHHHHhC Q lcl|NC_012756. 66 VFN-AVA--SLTHILENGHLSRNGGR-VAGIVHIKPAE--EKAIQNFEKRIKEIGK 115 (115) Q Consensus 66 v~n-~~~--~ltHLLE~GHakr~GGr-V~~~phI~pae--e~~~~~~~~~i~~ii~ 115 (115) ... ..+ .+.--+||.|-.-.|=| ++|+|++.|+. ++++..|+..+.++++ T Consensus 73 ~~~g~~~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~ 128 (141) T protein:vir:79 73 YKQGNNYIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIE 128 (141) T ss_pred eecCCeeEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHH Confidence 111 111 22334688888888854 77889999987 8999999999999999 No 148 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=63.57 E-value=0.31 Score=23.40 Aligned_cols=112 Identities=17% Similarity=0.172 Sum_probs=47.7 Q ss_pred CchHHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCCc------------------------cchhh Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKE-----TSPK------------------------RYGKY 50 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~-----~sP~------------------------~TG~y 50 (115) |+| | +++...|......++ ........++|..++...+. .+|- ++|.+ T Consensus 1 m~d-~-~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l 78 (149) T protein:vir:98 1 MSE-L-TALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRT 78 (149) T ss_pred Cch-H-HHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhh Confidence 775 3 233333333322221 12333445555544444333 2451 11344 Q ss_pred hcchhceeecCceEEEe-eCCcceeeheecceeec--C-C--cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 51 RRSWKKKKLANGSFVVF-NAVASLTHILENGHLSR--N-G--GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 51 ~k~W~~kk~~~~~~vv~-n~~~~ltHLLE~GHakr--~-G--GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +++.+.....++..|.+ .+.-..|..-.||=..+ + | ...|++|||.=.++. ..++.+.|.+-+. T Consensus 79 ~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s~~d-~~~i~~~i~~~l~ 148 (149) T protein:vir:98 79 NRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFTRDD-EQMIEDIIIRHLG 148 (149) T ss_pred hhhhhheecCCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCCHHH-HHHHHHHHHHHhh Confidence 45555544444443422 33222244556774432 1 1 257999999754332 2333333333222 No 149 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=62.80 E-value=0.039 Score=28.28 Aligned_cols=96 Identities=20% Similarity=0.148 Sum_probs=46.5 Q ss_pred Cc-hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhc-eeecCceEEEeeC--Ccceeeh Q lcl|NC_012756. 1 MS-NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKK-KKLANGSFVVFNA--VASLTHI 76 (115) Q Consensus 1 ~~-d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~-kk~~~~~~vv~n~--~~~ltHL 76 (115) || |.+.+.+.+.|..+.=+....+ -+|++.-..--.|.+||.+.+|=.. ...+.+.++++.- .|+ |+- T Consensus 1 ~~f~~f~~~~~k~l~kr~L~~~g~v-------q~EvlR~~~PyvP~~tG~Lk~S~~l~tvIgsg~I~y~~~~~aPY-Ar~ 72 (105) T protein:vir:78 1 MSFSSFKDAVIDDIHNKALSTAAKA-------GGELVELAQPVTPILYGDLRRSSYFKIIIQKNSIVARVFSLTPY-ARR 72 (105) T ss_pred CCcccccchHHHHHHHhcCCCCchh-------hHHHHHHhCCCCcccccccccccccceeecCCeeEeeccccCch-hhh Confidence 54 3444444444444332222211 1233322222358899999998654 4556666665421 111 233 Q ss_pred eecceeecCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 77 LENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 77 LE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +=|.|+ ||.- =+|.....--+.|.++++ T Consensus 73 qYYe~~-Rg~~----------WfErm~a~hk~~I~~~ve 100 (105) T protein:vir:78 73 QYYENR-RNPR----------WYEMAVSYGIQSINQIVE 100 (105) T ss_pred hhhccc-CCCc----------hhHHhhhcchhHHHHHHh Confidence 224443 2222 245555555666777777 No 150 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=62.76 E-value=0.044 Score=28.00 Aligned_cols=88 Identities=15% Similarity=0.164 Sum_probs=39.0 Q ss_pred HHHHHHHHHHHH----HHHHHHHHHHhCCccchhhhcchhcee-----e--cCceEEEeeCCcceeeheecceeec---- Q lcl|NC_012756. 20 VTEEVDKIAEQV----ADETVDELKETSPKRYGKYRRSWKKKK-----L--ANGSFVVFNAVASLTHILENGHLSR---- 84 (115) Q Consensus 20 v~~~~~~~~~~~----a~~~~~~lk~~sP~~TG~y~k~W~~kk-----~--~~~~~vv~n~~~~ltHLLE~GHakr---- 84 (115) ..++|...++-. -.++.++|++.+-+ ...=||-... . .+++ ....++-+.|||--.+ T Consensus 1 ~~~~~~~~~k~~~~~~~~~~~~~l~~l~~~---~v~vGi~~~~~y~~~~~~~dG~-----~va~IA~~~EfG~~i~~p~~ 72 (200) T protein:vir:99 1 MKKGFSKSNSVAAPLKHFQMLKQFDALKGK---TVQAGWFETDRYPAKEGETIGP-----LVAKIARQLEFGGVINHPGG 72 (200) T ss_pred CCcCcceeeeeecchHHHHHHHHHHHhhCC---eEEEEEcCCCCcCCcccccccc-----hHHHHHhHHHcCCeeccCCC Confidence 122222111111 12222233332100 0111111000 0 0010 1235566778884322 Q ss_pred C---------Cc-----------------------ccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 85 N---------GG-----------------------RVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 85 ~---------GG-----------------------rV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) . |+ -.|++|||+|+.+....+..+.++..++ T Consensus 73 ~~~~~~~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~ 135 (200) T protein:vir:99 73 TKYIKDAIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIAR 135 (200) T ss_pred ccccccccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHH Confidence 1 11 3699999999999888888877776665 No 151 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=62.66 E-value=0.19 Score=24.60 Aligned_cols=99 Identities=12% Similarity=0.078 Sum_probs=46.1 Q ss_pred Cch-HHH-HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeehee Q lcl|NC_012756. 1 MSN-DLA-DLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILE 78 (115) Q Consensus 1 ~~d-~La-~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE 78 (115) |+. .+. +.|.+.|. ...+..+...++.++......--|.+||.|++|=.... ++..|+|++-| |+-+= T Consensus 1 mmkvkv~~~~~~~~~~------~~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s--~~g~I~y~tPY--Ar~qY 70 (108) T protein:vir:98 1 MPKIRVELSGAKDKLS------PQTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISS--DAEEIYYNTPY--AKRRF 70 (108) T ss_pred CceeEeeehHHHHHHH------HHHHHHHHHHHHHHHHHhhcccCcCcCCccccceeecc--CCceEEecChh--hHHhh Confidence 221 111 12222221 12344556667777777777778999999999832222 33456666543 11111 Q ss_pred cceee--cCCcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 79 NGHLS--RNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 79 ~GHak--r~GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +|... .++|+ |.. =.|+++...-+.|.++++ T Consensus 71 Yg~~~n~~~p~a--g~~----W~eraka~~~~~~~~~~~ 103 (108) T protein:vir:98 71 YEPAYNYTTPGT--GPR----WDMKAKRLFISDWERAYM 103 (108) T ss_pred hccccCCCCCCC--cch----hHHHHHhhhhHHHHHHHH Confidence 22111 12221 111 133444444555555444 No 152 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=59.95 E-value=0.11 Score=25.83 Aligned_cols=103 Identities=17% Similarity=0.095 Sum_probs=52.0 Q ss_pred CchHHHH---HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeehe Q lcl|NC_012756. 1 MSNDLAD---LIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHIL 77 (115) Q Consensus 1 ~~d~La~---~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLL 77 (115) |+=.+-= .|.+.|. .+.++..-..++.++......=-|.+||.+++|=.. ..++..|+|++-| |+-+ T Consensus 1 M~~kVkv~l~~~~~~l~------~~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~--~~~~~~I~y~tPY--Ar~q 70 (114) T protein:vir:47 1 MNIAIKVDLQKAKQKLS------NESMTRGKIAVASKILLDNEQYIPLRGGELRASGRI--VGQGDAVVYGTVY--ARAQ 70 (114) T ss_pred CceeEEeehhHHHHHHH------HHHHHHHHHHHHHHHHHhhccCCcCccCccccceee--eeCCcEEEecCch--hhHh Confidence 3321111 1222111 123344556667777777777779999999997322 3345567777655 3333 Q ss_pred ecceeecCCcccCcc--chh-hhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 78 ENGHLSRNGGRVAGI--VHI-KPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 78 E~GHakr~GGrV~~~--phI-~paee~~~~~~~~~i~~ii~ 115 (115) =+|+. ++|++... |.= ..=.|+++...-+.+.++++ T Consensus 71 yYg~~--~~~~~~~~~~p~~g~~W~eraka~~~~~~~~~~~ 109 (114) T protein:vir:47 71 FYGSN--GIVTFRRYTTPGTGKRWDQVATSKHAEEWARAFV 109 (114) T ss_pred hhccc--CCCCCCccCCCCCcchhHHHHHhhhhHHHHHHHH Confidence 34432 11211110 100 11245666777777777666 No 153 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=56.30 E-value=0.23 Score=24.08 Aligned_cols=100 Identities=13% Similarity=0.116 Sum_probs=50.2 Q ss_pred CchHHHHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCcceeeheec Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEV-TEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILEN 79 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v-~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~ 79 (115) |+=.+-=.+.. ...-+ .+.++.+-..++.++..+...=-|.+||....+=......++..|+|++-| |+-+=| T Consensus 1 M~ikVkv~l~~----~~~~~~~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~~~~I~y~tPY--Ar~qyY 74 (116) T protein:vir:15 1 MAFRINVDLDG----FMDQTSLDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSDGSEITYSTPY--AKAQFY 74 (116) T ss_pred CCceEEeehhH----hhhhhhHHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecCCceEEecCch--hHHHhc Confidence 44322212222 22212 234455566677777777777789999987655444444455667777654 222223 Q ss_pred cee-----ecC------CcccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 80 GHL-----SRN------GGRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 80 GHa-----kr~------GGrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) |+. .++ |+| =.|+++......+++++. T Consensus 75 g~~~~~~~~~~~t~p~ag~~---------W~eraK~~h~~~w~~~~~ 112 (116) T protein:vir:15 75 GIINDKYPVHNYTTPGTTKR---------WDLKAKSMFMSSWIDTFT 112 (116) T ss_pred ccccCCCCcccccCCCCCcc---------hhHHHHhhhHHHHHHHHH Confidence 332 221 222 124555555555555554 No 154 >protein:vir:8432 Length: 149 # NCBI annotation: gp27 # Family: family:all:30885 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818328;genbank:gi:29566764;genbank:GeneID:1260117 Probab=50.93 E-value=0.6 Score=21.82 Aligned_cols=106 Identities=24% Similarity=0.355 Sum_probs=57.1 Q ss_pred CchHH--HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh----CCccchhhhcchhceeec-----------Cc- Q lcl|NC_012756. 1 MSNDL--ADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKET----SPKRYGKYRRSWKKKKLA-----------NG- 62 (115) Q Consensus 1 ~~d~L--a~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~----sP~~TG~y~k~W~~kk~~-----------~~- 62 (115) +-|.. +..+.+.+++|-.+++..-.+.-+ ++ ...-|.+. .|-.||.|+.-.+.|+.. .| T Consensus 16 lldgvsssrdlrrivqrfindveqtwhdvwd-vs--mlgvlaqqtgvphpyqtgdykahikkkkltamqkirikkflkgg 92 (149) T protein:vir:84 16 LLDGVSSSRDLRRIVQRFINDVEQTWHDVWD-VS--MLGVLAQQTGVPHPYQTGDYKAHIKKKKLTAMQKIRIKKFLKGG 92 (149) T ss_pred hhhccccchHHHHHHHHHHHHHHHHHHhHhh-HH--HHHHHHhhcCCCCCccccchhhhhhhhhHHHHHHHHHHHHhhcC Confidence 22222 234556666677776665544433 11 12222222 367999999887666532 11 Q ss_pred --eEEEeeCCcceeeheecceee-cCCcccCccch-hhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 63 --SFVVFNAVASLTHILENGHLS-RNGGRVAGIVH-IKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 63 --~~vv~n~~~~ltHLLE~GHak-r~GGrV~~~ph-I~paee~~~~~~~~~i~~ii~ 115 (115) --.|||+..- +|.+|||... |-|.|.|=-|- -.||+| ..+++.+|.. T Consensus 93 mpiglvynndek-ahwieygtkrdrpgsrspwgpntptpafe-----imqrvarimn 143 (149) T protein:vir:84 93 MPIGLVYNNDEK-AHWIEYGTKRDRPGSRSPWGPNTPTPAFE-----IMQRVARIMN 143 (149) T ss_pred CceeEEecCCcc-hhhhhhccccCCCCCCCCCCCCCCChhHH-----HHHHHHHHhh Confidence 2245665443 7999999754 55766654332 246664 3344555544 No 155 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=48.29 E-value=0.67 Score=21.52 Aligned_cols=114 Identities=11% Similarity=0.070 Sum_probs=50.0 Q ss_pred Cc--hHHHHHHHHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHH-hCCcc------------------------chhhhc Q lcl|NC_012756. 1 MS--NDLADLIAKELAAYSD-EVTEEVDKIAEQVADETVDELKE-TSPKR------------------------YGKYRR 52 (115) Q Consensus 1 ~~--d~La~~I~~~L~~y~~-~v~~~~~~~~~~~a~~~~~~lk~-~sP~~------------------------TG~y~k 52 (115) |. ++|.+.+...|.+.+. .-..-+.++.+.+-..+.+.+.+ .+|-- +|.+.. T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcc Confidence 44 3566666666666532 22222333333333333343433 34521 011111 Q ss_pred chhceeecCceEEEe--eCCcceeeheecceeecCC-----cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 53 SWKKKKLANGSFVVF--NAVASLTHILENGHLSRNG-----GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 53 ~W~~kk~~~~~~vv~--n~~~~ltHLLE~GHakr~G-----GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +-+.....++..|-+ .+...+|..-.||-..+.. ...|++|||.=.++. ..++.+.|.+-+. T Consensus 81 sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~~d-~~~i~~~i~~~l~ 149 (150) T protein:vir:60 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGED-VQMIEEIILAHLD 149 (150) T ss_pred eeeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCCHHH-HHHHHHHHHHHHh Confidence 112222222222322 3333335566788765432 247999999876544 2333333333333 No 156 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=45.79 E-value=0.11 Score=25.85 Aligned_cols=74 Identities=12% Similarity=0.069 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHHHHhC--------CccchhhhcchhceeecCceEEEeeCCcceeeheecceeecCCcccCccch Q lcl|NC_012756. 23 EVDKIAEQVADETVDELKETS--------PKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGGRVAGIVH 94 (115) Q Consensus 23 ~~~~~~~~~a~~~~~~lk~~s--------P~~TG~y~k~W~~kk~~~~~~vv~n~~~~ltHLLE~GHakr~GGrV~~~ph 94 (115) -|+.......+.+.+.+++.. |-+.|.|..|. ...++|-+-|||+. ..|++|| T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g~~~dG~--------------sv~~vA~~~EfG~~-----~iPaRPf 61 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQTANAQVGYFQEQGQHSSGF--------------SYPALMYLQEVIGV-----PSASGKV 61 (160) T ss_pred CceeechHhHHHHHHHHHHHhCCeeEEeeccccccCCCCc--------------cHHHHHhhhhcCcc-----cCCCcch Confidence 122222222333333333322 22223332111 12356677899964 5999999 Q ss_pred hhhHHHH--------HHHHHHHHHHHHhC Q lcl|NC_012756. 95 IKPAEEK--------AIQNFEKRIKEIGK 115 (115) Q Consensus 95 I~paee~--------~~~~~~~~i~~ii~ 115 (115) ||+..|. ..+++..++.+.+. T Consensus 62 ~R~tfe~~~~~~~~~~~~~~~~~i~~~~~ 90 (160) T protein:vir:95 62 YRRLFEITMMLNKQTLLEQTKKNLYKQLS 90 (160) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9998752 33444444544444 No 157 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=45.38 E-value=0.77 Score=21.20 Aligned_cols=115 Identities=12% Similarity=0.085 Sum_probs=57.4 Q ss_pred Cc---hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc----chhhhcchhceeecC-ceEEEeeC--C Q lcl|NC_012756. 1 MS---NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKR----YGKYRRSWKKKKLAN-GSFVVFNA--V 70 (115) Q Consensus 1 ~~---d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~----TG~y~k~W~~kk~~~-~~~vv~n~--~ 70 (115) |. |...+.|.+.|....+.+...+..++..++..++......-.+. ....++..+.++... ....++.. . T Consensus 7 l~idv~~~l~~i~~~l~~~~~~~~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~~~~~~~i~~~~~~ 86 (177) T protein:vir:96 7 MKIDVSREAEDIAAMVAATTKQLELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQRQKGEVRFWVGLDP 86 (177) T ss_pred eEEehhHHHHHHHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccCCCcEEEEEEeccc Confidence 22 44567888889999999999999999999888766655544332 345666555544332 12222221 1 Q ss_pred cceeeh----------------e--------eccee---ecCC-cc-------cCccchhhhHHHHH--------HHHHH Q lcl|NC_012756. 71 ASLTHI----------------L--------ENGHL---SRNG-GR-------VAGIVHIKPAEEKA--------IQNFE 107 (115) Q Consensus 71 ~~ltHL----------------L--------E~GHa---kr~G-Gr-------V~~~phI~paee~~--------~~~~~ 107 (115) -++-+| . -+||. .|-| +| +|-.+-+..+.+.. .+.|+ T Consensus 87 i~l~~~~~~r~t~~Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~~~~~~~~~~l~ 166 (177) T protein:vir:96 87 IGVYRLGTPKVTQKGVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWERRVFQRFKELFE 166 (177) T ss_pred eehhhcccCCCCccceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 122222 0 11211 0111 11 22223233333322 23455 Q ss_pred HHHHHHhC Q lcl|NC_012756. 108 KRIKEIGK 115 (115) Q Consensus 108 ~~i~~ii~ 115 (115) .+|..+++ T Consensus 167 ~Ei~~~L~ 174 (177) T protein:vir:96 167 QEARAIIN 174 (177) T ss_pred HHHHHHhc Confidence 55555555 No 158 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=45.26 E-value=0.57 Score=21.91 Aligned_cols=59 Identities=10% Similarity=-0.044 Sum_probs=31.6 Q ss_pred CchHHHHHHHHHHHhhHHHHHH---HHHHHHHHHHHHHHHHHHHh---------CC------------ccchhhhcchhc Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTE---EVDKIAEQVADETVDELKET---------SP------------KRYGKYRRSWKK 56 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~---~~~~~~~~~a~~~~~~lk~~---------sP------------~~TG~y~k~W~~ 56 (115) ..++-.+.+.+.++.+...+-. ..+++.+.++..++.+++.. || .+||.|.+|-+. T Consensus 111 t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~~~ppna~~Ti~~KG~~~PLidTG~l~~SIty 190 (193) T protein:vir:96 111 AWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTGPWVANSASTVRRKGFNRPLVDTAHMLQSISS 190 (193) T ss_pred hHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhCCCCchhHHHHHHhhhcc Confidence 3344455566666555443322 25566666666666666552 11 356777776655 Q ss_pred eee Q lcl|NC_012756. 57 KKL 59 (115) Q Consensus 57 kk~ 59 (115) +.+ T Consensus 191 ~Vv 193 (193) T protein:vir:96 191 RVT 193 (193) T ss_pred eeC Confidence 554 No 159 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=44.69 E-value=0.47 Score=22.40 Aligned_cols=106 Identities=19% Similarity=0.172 Sum_probs=53.1 Q ss_pred Cc---hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCc------ Q lcl|NC_012756. 1 MS---NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVA------ 71 (115) Q Consensus 1 ~~---d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~------ 71 (115) |- -|--++ |-+-+.++++.+++.+..+-.++..-+-.++|++||.++.|-.-...|.+.. ..|..| T Consensus 1 mi~i~idkp~a----lmek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~siegstge-lsn~~~yl~~vl 75 (133) T protein:vir:42 1 MIEIRIDKPDA----LMEKPHEVQGKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEGSTGE-LSNLAYYLPFVL 75 (133) T ss_pred CeeeecCCchh----hhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeecCccc-hhhhhHHhhHhh Confidence 21 111223 3345778888888888888888888888899999999999955544433211 112222 Q ss_pred -----------------ceeeheeccee-ecCCcccCccchhhhHHHHHHHHHHHHHHH Q lcl|NC_012756. 72 -----------------SLTHILENGHL-SRNGGRVAGIVHIKPAEEKAIQNFEKRIKE 112 (115) Q Consensus 72 -----------------~ltHLLE~GHa-kr~GGrV~~~phI~paee~~~~~~~~~i~~ 112 (115) -|.|-+-|.-- -.|-=+..+..+|.| +--..+.|.+-+++ T Consensus 76 ~grgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~-~give~s~iewlre 133 (133) T protein:vir:42 76 HGRGWVFPVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAP-EGVVEETLIEWLRE 133 (133) T ss_pred hcccceeeccccccccCCCCCcccccCCCCCchhhhhhhhhhcc-cchhHHHHHHHHhC Confidence 23444333211 111111112222222 01122334444444 No 160 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=39.71 E-value=1 Score=20.57 Aligned_cols=110 Identities=12% Similarity=0.225 Sum_probs=44.4 Q ss_pred CchHHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHH-----hCCcc------------------chhhhcc--- Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVT-EEVDKIAEQVADETVDELKE-----TSPKR------------------YGKYRRS--- 53 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~-~~~~~~~~~~a~~~~~~lk~-----~sP~~------------------TG~y~k~--- 53 (115) |+|+|. ++.+.|......++ ....+..+.+|..++....+ .+|-- +|...++ T Consensus 1 m~~~~~-~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~ 79 (155) T protein:vir:79 1 MTDDLQ-ALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMF 79 (155) T ss_pred CchHHH-HHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhh Confidence 999854 44444444443332 12334455555555444433 34521 1111000 Q ss_pred --------hhceeecCceEEE-e-eC--CcceeeheecceeecC--Cc---ccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 54 --------WKKKKLANGSFVV-F-NA--VASLTHILENGHLSRN--GG---RVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 54 --------W~~kk~~~~~~vv-~-n~--~~~ltHLLE~GHakr~--GG---rV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) -+.....++ .+| + .+ .|- ..-.||=..+- ++ ..|++|||.-..+. ..++.+.|..-+. T Consensus 80 ~~l~~a~~l~~~~~~d~-a~Vg~~Gs~~~yA--aiHQfG~~~r~~~~~~~v~iPaRp~LGls~~d-~~~I~~~i~~~l~ 154 (155) T protein:vir:79 80 RKLRTARYLRIDVDSTG-LAIGFDERLSRIA--RVHQEGQKAPVEPGGPLAQYPVRVVLGFSDAD-RELVRDRLLRELT 154 (155) T ss_pred hhhhhhheeeeeecCcE-EEEEecCcchhhh--hhhhcCCcccCCCCCcccccccccccCCCHHH-HHHHHHHHHHHhh Confidence 011111111 222 1 22 332 33356643321 22 47999999766543 2222222222222 No 161 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=39.05 E-value=1 Score=20.50 Aligned_cols=113 Identities=15% Similarity=0.189 Sum_probs=49.7 Q ss_pred CchHHHHHHHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHH-----hCCccc--------------hh----------h Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTE-EVDKIAEQVADETVDELKE-----TSPKRY--------------GK----------Y 50 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~-~~~~~~~~~a~~~~~~lk~-----~sP~~T--------------G~----------y 50 (115) |+|+|. +|...|......++. ...+...++|..++...+. .+|.-+ |. + T Consensus 1 m~~~~~-~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l 79 (156) T protein:vir:11 1 MADSLE-ALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKL 79 (156) T ss_pred CchhHH-HHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhh Confidence 999854 455555554443321 2334455555555444443 344211 00 0 Q ss_pred --hcchhceeecCceEEEe-eCCcceeeheecceeecC--Cc---ccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 51 --RRSWKKKKLANGSFVVF-NAVASLTHILENGHLSRN--GG---RVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 51 --~k~W~~kk~~~~~~vv~-n~~~~ltHLLE~GHakr~--GG---rV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ..+++.....++..|-+ .+.-.+|+.-.||-..+- +| ..|++|||.=..+. .+++.+.|.+-++ T Consensus 80 ~~~~~l~~~~~~~~a~vg~~Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~~d-~~~i~~~i~~~l~ 151 (156) T protein:vir:11 80 RTVRYLRAKGDAQAITVSFAGRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDSSD-METIQNGILAHID 151 (156) T ss_pred hhhheeeeeecCcEEEEEecCCchhhhhhhcccccccccCCCCcccccccccCCCCHHH-HHHHHHHHHHHHh Confidence 01122222222222211 222222444457765322 22 37999999766533 2333333433333 No 162 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=37.76 E-value=1.1 Score=20.35 Aligned_cols=106 Identities=12% Similarity=0.101 Sum_probs=69.8 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCC--------ccchhhhcchhceeecCceEEEeeCCcc Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSP--------KRYGKYRRSWKKKKLANGSFVVFNAVAS 72 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP--------~~TG~y~k~W~~kk~~~~~~vv~n~~~~ 72 (115) |+|.++++|....++-.+.+.+.++++.+++.+++...-...|- +.+|.=..-|-.. .+.+.+.|==.++ T Consensus 10 L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~--~~~y~l~HLLE~G 87 (123) T protein:vir:96 10 LAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQK--APTYRLTHLLENG 87 (123) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEe--cCCcceEEeeecc Confidence 99999999999999988899999999999998888877544332 2232111111111 1223333333343 Q ss_pred eeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHH Q lcl|NC_012756. 73 LTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKE 112 (115) Q Consensus 73 ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ 112 (115) |..=+| .|==||.=-.|-..++.+...+..++.|++ T Consensus 88 --Ha~r~G--GrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 88 --HAKRNG--GRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred --eeecCC--ceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 223344 344567777888899999999999999999 No 163 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=35.84 E-value=1.2 Score=20.14 Aligned_cols=114 Identities=11% Similarity=0.087 Sum_probs=46.0 Q ss_pred Cc--hHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHH-hCCcc------------------------chhhhc Q lcl|NC_012756. 1 MS--NDLADLIAKELAAYS-DEVTEEVDKIAEQVADETVDELKE-TSPKR------------------------YGKYRR 52 (115) Q Consensus 1 ~~--d~La~~I~~~L~~y~-~~v~~~~~~~~~~~a~~~~~~lk~-~sP~~------------------------TG~y~k 52 (115) |. ++|.+.+...|.+.+ ..-..-+..+.+.+-..+.+.+++ .+|-- +|.+.+ T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhcc Confidence 43 455566666555543 122222333333333444444433 34520 011111 Q ss_pred chhceeecCceEEEe--eCCcceeeheecceeecC---C--cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 53 SWKKKKLANGSFVVF--NAVASLTHILENGHLSRN---G--GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 53 ~W~~kk~~~~~~vv~--n~~~~ltHLLE~GHakr~---G--GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) +-+.....+...|-+ .+...+|..-.||=..+- + ...|++|||.-.++. ..++.+.|.+-+. T Consensus 81 sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~d-~~~i~~~i~~~l~ 149 (150) T protein:vir:57 81 FLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGED-VQMIEEIILAHLD 149 (150) T ss_pred ceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCHHH-HHHHHHHHHHHHh Confidence 112222222222211 222222444467744421 1 236999999876544 2333333333333 No 164 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=31.89 E-value=1.5 Score=19.68 Aligned_cols=111 Identities=12% Similarity=0.090 Sum_probs=47.4 Q ss_pred Cch--HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCc------------------------cchh Q lcl|NC_012756. 1 MSN--DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKE-----TSPK------------------------RYGK 49 (115) Q Consensus 1 ~~d--~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~-----~sP~------------------------~TG~ 49 (115) |.| +|.+.+...|...+. ........++|..++...+. .+|- .+|. T Consensus 1 ~~~~~~l~~~L~~ll~~l~~---~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~ 77 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSP---SGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLI 77 (150) T ss_pred CchHHHHHHHHHHHHHhcCC---hhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhh Confidence 543 444455554444321 12333445555555444443 2441 1123 Q ss_pred hhcchhceeecCceEEEe--eCCcceeeheecceeecC-C----cccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 50 YRRSWKKKKLANGSFVVF--NAVASLTHILENGHLSRN-G----GRVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 50 y~k~W~~kk~~~~~~vv~--n~~~~ltHLLE~GHakr~-G----GrV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) ++.+.+.....++..|-+ .+...+|..-.||=..+. + ...|++|||.=.++. ..++.+.|.+-++ T Consensus 78 l~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~~d-~~~i~~~i~~~l~ 149 (150) T protein:vir:20 78 TSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGED-VQMIEEIILAHLE 149 (150) T ss_pred hhhhhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCHHH-HHHHHHHHHHHHh Confidence 344444444333333322 222122333356643221 1 357999999866543 2333333333333 No 165 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=26.69 E-value=1.5 Score=19.56 Aligned_cols=103 Identities=22% Similarity=0.213 Sum_probs=50.1 Q ss_pred Cc---hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcchhceeecCceEEEeeCCc------ Q lcl|NC_012756. 1 MS---NDLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVA------ 71 (115) Q Consensus 1 ~~---d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~~sP~~TG~y~k~W~~kk~~~~~~vv~n~~~------ 71 (115) |- -|--++ |-+-+.++++.+++.+..+-.++..-+-.++|.+||.++.|-.-...|.+.. ..|..| T Consensus 1 mi~i~idkp~a----lmek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~siegstge-lsn~~~yl~~vl 75 (133) T protein:vir:41 1 MIRINIDKPEA----LMEKASEVEDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEGSTGE-LTNTVPYLQWVL 75 (133) T ss_pred CeeeecCCchh----hhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeecCccc-hhhhhHHhhHhh Confidence 21 111223 3345778888888888888888888888899999999999955544433211 112222 Q ss_pred -----------------ceeeheeccee-ecCCcccCccchhhh---HHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 72 -----------------SLTHILENGHL-SRNGGRVAGIVHIKP---AEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 72 -----------------~ltHLLE~GHa-kr~GGrV~~~phI~p---aee~~~~~~~~~i~~ii~ 115 (115) -|.|-+-|.-- -.|-=+..+..++.| +||. | |+=+|- T Consensus 76 ~grgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s----~---iewlis 133 (133) T protein:vir:41 76 FGRGWVFPVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIVEDS----F---IEWLIS 133 (133) T ss_pred hcccceeeecccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHH----H---HHHhcC Confidence 23333333211 011111111111211 1111 1 122222 No 166 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=24.40 E-value=2.2 Score=18.73 Aligned_cols=111 Identities=20% Similarity=0.212 Sum_probs=43.5 Q ss_pred Cch--HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCccc--------------h----------h Q lcl|NC_012756. 1 MSN--DLADLIAKELAAYSDEVTEEVDKIAEQVADETVDELKE-----TSPKRY--------------G----------K 49 (115) Q Consensus 1 ~~d--~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~~lk~-----~sP~~T--------------G----------~ 49 (115) |.| ++.+.+...|...+. ..-..+..++|..++...+. .+|.-+ | . T Consensus 1 m~~~~~~~~~l~~ll~~L~~---~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~ 77 (149) T protein:vir:18 1 MSELTALQERLAGLIASLSP---AARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLR 77 (149) T ss_pred CchHHHHHHHHHHHHHhcCC---chHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhh Confidence 664 222333333333211 11122344444444443333 355211 1 1 Q ss_pred hhcchhceeecCceEEEe-eCCcceeeheecceeecC--Cc---ccCccchhhhHHHHHHHHHHHHHHHHhC Q lcl|NC_012756. 50 YRRSWKKKKLANGSFVVF-NAVASLTHILENGHLSRN--GG---RVAGIVHIKPAEEKAIQNFEKRIKEIGK 115 (115) Q Consensus 50 y~k~W~~kk~~~~~~vv~-n~~~~ltHLLE~GHakr~--GG---rV~~~phI~paee~~~~~~~~~i~~ii~ 115 (115) .+++-+.....++..|.+ .+.-.+|..-.||-..+- ++ ..|++|||.=.++. ..++.+.|.+-+. T Consensus 78 ~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d-~~~I~~~i~~~l~ 148 (149) T protein:vir:18 78 TSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTRDD-EQMIEDVIISHLG 148 (149) T ss_pred hhhhhheeecCceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCCHHH-HHHHHHHHHHHHh Confidence 111222222223333322 222122444467765432 12 47999999876543 2333333333333 No 167 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=23.02 E-value=2.4 Score=18.53 Aligned_cols=104 Identities=15% Similarity=0.136 Sum_probs=55.7 Q ss_pred CchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH----HHHHh--------CCccchhhhcchhceeecCceEEEee Q lcl|NC_012756. 1 MSNDLADLIAKELAAYSDEVTEEVDKIAEQVADETVD----ELKET--------SPKRYGKYRRSWKKKKLANGSFVVFN 68 (115) Q Consensus 1 ~~d~La~~I~~~L~~y~~~v~~~~~~~~~~~a~~~~~----~lk~~--------sP~~TG~y~k~W~~kk~~~~~~vv~n 68 (115) |+|+++.+|..--++-++++.+.++++.+++++++++ ..... +-+.|+ .+|.+-.- +-+...|= T Consensus 9 La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~---e~~~V~nk-~~yqLtHL 84 (124) T protein:vir:95 9 LADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVP---NGWVIHNK-TEYRLAHL 84 (124) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeec---CceeEEEc-CCCceeee Confidence 8888888888888888888888887777777766663 33222 223333 23322111 11222222 Q ss_pred CCcceeeheecceeecCCcccCccchhhhHHHHHHHHHHHHHHH Q lcl|NC_012756. 69 AVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKE 112 (115) Q Consensus 69 ~~~~ltHLLE~GHakr~GGrV~~~phI~paee~~~~~~~~~i~~ 112 (115) -.++ |..-+| .|--|+.-=.|-=..+.+...+..++.|+. T Consensus 85 LE~G--HAkr~G--GRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 85 LEYG--HATVDG--GRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred eecc--eeccCC--cccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 2222 223333 122333333444455555556666666666 Done!