Query lcl|NC_019539.1_cdsid_YP_007010470.1 [gene=SEP1_gp17] [protein=putative structural protein] [protein_id=YP_007010470.1] [location=10751..11146] Match_columns 131 No_of_seqs 104 out of 137 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 17:34:15 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_17 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_17_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78380 Length: 131 100.0 1.7E-53 1.1E-56 309.8 14.2 131 1-131 1-131 (131) 2 protein:vir:94994 Length: 131 100.0 2.8E-53 1.7E-56 308.7 14.1 131 1-131 1-131 (131) 3 protein:vir:104347 Length: 145 100.0 1.1E-52 6.7E-56 305.5 13.4 131 1-131 9-142 (145) 4 protein:vir:103280 Length: 142 100.0 6.5E-52 4.1E-55 301.2 13.0 131 1-131 6-139 (142) 5 protein:vir:79638 Length: 146 100.0 8.2E-52 5.1E-55 300.7 13.0 131 1-131 7-141 (146) 6 protein:vir:107703 Length: 147 100.0 3.2E-51 2E-54 297.4 13.0 131 1-131 7-141 (147) 7 protein:vir:80425 Length: 134 100.0 4.4E-50 2.7E-53 291.2 13.4 130 1-131 1-133 (134) 8 protein:vir:97190 Length: 148 100.0 1.9E-48 1.2E-51 282.3 12.9 131 1-131 5-146 (148) 9 protein:vir:96774 Length: 152 100.0 2.2E-48 1.3E-51 281.9 13.1 128 1-130 11-152 (152) 10 protein:vir:95157 Length: 144 100.0 4.5E-48 2.8E-51 280.2 12.9 129 1-131 6-143 (144) 11 protein:vir:94944 Length: 121 100.0 3.9E-46 2.4E-49 269.5 11.2 118 1-120 4-121 (121) 12 protein:vir:79034 Length: 141 99.8 2.5E-22 1.6E-25 139.0 8.3 112 1-131 1-135 (141) 13 protein:vir:105467 Length: 144 99.7 8E-20 5E-23 125.3 7.7 106 1-131 1-140 (144) 14 protein:vir:102963 Length: 163 99.6 2.1E-18 1.3E-21 117.5 8.5 105 1-131 1-150 (163) 15 protein:vir:3617 Length: 112 # 99.6 7.8E-18 4.8E-21 114.4 9.8 103 1-131 1-112 (112) 16 protein:vir:9930 Length: 108 # 99.6 1.2E-17 7.2E-21 113.4 9.6 102 1-131 4-107 (108) 17 protein:vir:95789 Length: 114 99.5 7E-17 4.3E-20 109.2 9.1 101 1-131 1-110 (114) 18 protein:vir:102338 Length: 116 99.5 2.5E-17 1.5E-20 111.6 6.6 89 14-131 1-111 (116) 19 protein:vir:94654 Length: 142 99.5 2E-16 1.2E-19 106.7 9.9 105 1-131 4-142 (142) 20 protein:vir:743 Length: 108 # 99.5 2.2E-16 1.4E-19 106.4 10.0 102 1-131 1-108 (108) 21 protein:vir:98409 Length: 108 99.5 2.8E-16 1.7E-19 105.9 9.9 102 1-131 1-108 (108) 22 protein:vir:5978 Length: 144 # 99.4 1.1E-15 7.1E-19 102.5 9.8 103 1-131 4-143 (144) 23 protein:vir:95894 Length: 137 99.4 8.3E-16 5.2E-19 103.3 8.8 100 1-128 1-137 (137) 24 protein:vir:96121 Length: 137 99.4 1.1E-15 6.6E-19 102.7 8.7 100 1-128 1-137 (137) 25 protein:vir:94490 Length: 137 99.4 1.4E-15 8.6E-19 102.0 8.8 100 1-128 1-137 (137) 26 protein:vir:97427 Length: 137 99.4 1.4E-15 8.6E-19 102.0 8.8 100 1-128 1-137 (137) 27 protein:vir:93738 Length: 137 99.4 1.4E-15 8.6E-19 102.0 8.8 100 1-128 1-137 (137) 28 protein:vir:94538 Length: 125 99.4 1.3E-15 8E-19 102.2 8.0 106 1-131 5-119 (125) 29 protein:vir:94108 Length: 149 99.4 3.2E-15 2E-18 100.0 8.8 100 1-128 13-149 (149) 30 protein:vir:94796 Length: 137 99.4 3.3E-15 2E-18 100.0 8.8 100 1-128 1-137 (137) 31 protein:vir:96829 Length: 135 99.3 4.5E-15 2.8E-18 99.2 8.8 100 1-128 1-135 (135) 32 protein:vir:105916 Length: 149 99.3 5.7E-15 3.6E-18 98.7 8.8 100 1-128 13-149 (149) 33 protein:vir:107099 Length: 137 99.3 1E-14 6.4E-18 97.3 8.6 100 1-128 1-137 (137) 34 protein:vir:96486 Length: 112 99.3 1.1E-14 6.7E-18 97.2 8.5 100 1-131 1-112 (112) 35 protein:vir:105330 Length: 137 99.3 1.6E-14 9.9E-18 96.2 8.8 100 1-128 1-137 (137) 36 protein:vir:4906 Length: 114 # 99.3 2.4E-14 1.5E-17 95.2 9.0 100 1-131 4-112 (114) 37 protein:vir:2740 Length: 114 # 99.3 2.4E-14 1.5E-17 95.2 9.0 100 1-131 4-112 (114) 38 protein:vir:97144 Length: 115 99.3 4.3E-14 2.6E-17 93.9 9.8 102 1-131 1-114 (115) 39 protein:vir:9312 Length: 115 # 99.3 4.3E-14 2.6E-17 93.9 9.8 102 1-131 1-114 (115) 40 protein:vir:78858 Length: 115 99.3 4.3E-14 2.6E-17 93.9 9.8 102 1-131 1-114 (115) 41 protein:vir:103917 Length: 115 99.3 4.3E-14 2.6E-17 93.9 9.8 102 1-131 1-114 (115) 42 protein:vir:96225 Length: 115 99.3 4.3E-14 2.6E-17 93.9 9.8 102 1-131 1-114 (115) 43 protein:vir:96358 Length: 115 99.3 4.3E-14 2.6E-17 93.9 9.8 102 1-131 1-114 (115) 44 protein:vir:78077 Length: 141 99.2 5.2E-14 3.2E-17 93.4 9.5 103 1-131 4-141 (141) 45 protein:vir:106623 Length: 115 99.2 6E-14 3.7E-17 93.1 9.5 102 1-131 1-114 (115) 46 protein:vir:99744 Length: 115 99.2 5.7E-14 3.6E-17 93.2 9.1 102 1-131 1-114 (115) 47 protein:vir:1243 Length: 116 # 99.2 4.3E-14 2.7E-17 93.9 7.6 87 14-128 1-116 (116) 48 protein:vir:97327 Length: 116 99.2 4.3E-14 2.7E-17 93.9 7.6 87 14-128 1-116 (116) 49 protein:vir:95062 Length: 116 99.1 1.3E-13 7.9E-17 91.3 7.4 87 14-128 1-116 (116) 50 protein:vir:81147 Length: 126 99.1 6.5E-13 4E-16 87.4 8.9 105 1-131 1-122 (126) 51 protein:vir:101594 Length: 173 99.0 3.6E-12 2.3E-15 83.3 9.0 104 1-130 1-173 (173) 52 protein:vir:106041 Length: 137 99.0 2.9E-12 1.8E-15 83.8 7.6 102 1-131 1-135 (137) 53 protein:vir:99101 Length: 142 99.0 4.4E-12 2.7E-15 82.9 8.4 104 1-128 4-142 (142) 54 protein:vir:8669 Length: 142 # 99.0 4.4E-12 2.7E-15 82.9 8.4 104 1-128 4-142 (142) 55 protein:vir:97982 Length: 140 98.9 8.7E-12 5.4E-15 81.2 8.3 102 1-131 8-138 (140) 56 protein:vir:107545 Length: 140 98.9 8.7E-12 5.4E-15 81.2 8.3 102 1-131 8-138 (140) 57 protein:vir:966 Length: 123 # 98.9 1.2E-11 7.2E-15 80.5 8.2 103 1-131 1-121 (123) 58 protein:vir:100075 Length: 140 98.8 2.7E-11 1.7E-14 78.5 8.5 118 1-131 1-129 (140) 59 protein:vir:1437 Length: 140 # 98.8 4.2E-11 2.6E-14 77.5 9.3 118 1-131 1-129 (140) 60 protein:vir:106570 Length: 182 98.8 3.5E-11 2.2E-14 77.9 8.8 105 1-131 2-177 (182) 61 protein:vir:80362 Length: 140 98.8 1.9E-11 1.2E-14 79.4 7.3 118 1-131 1-129 (140) 62 protein:vir:100243 Length: 140 98.8 7.4E-11 4.6E-14 76.1 9.6 118 1-131 1-129 (140) 63 protein:vir:99528 Length: 92 # 98.7 4.2E-11 2.6E-14 77.4 6.4 80 1-108 4-92 (92) 64 protein:vir:93617 Length: 148 98.7 1E-10 6.5E-14 75.3 7.1 122 1-131 4-139 (148) 65 protein:vir:102441 Length: 137 98.6 1.1E-10 6.8E-14 75.2 6.1 107 1-131 5-130 (137) 66 protein:vir:1273 Length: 127 # 98.6 3.6E-10 2.2E-13 72.4 7.9 107 1-131 4-123 (127) 67 protein:vir:194 Length: 149 # 98.6 2.4E-10 1.5E-13 73.3 6.5 125 1-131 2-140 (149) 68 protein:vir:97088 Length: 157 98.5 1.2E-09 7.6E-13 69.5 9.5 107 1-131 1-153 (157) 69 protein:vir:105089 Length: 133 98.5 1.7E-09 1E-12 68.7 8.2 108 1-131 2-127 (133) 70 protein:vir:79988 Length: 125 98.4 3.9E-09 2.4E-12 66.7 8.6 108 1-131 1-124 (125) 71 protein:vir:9414 Length: 125 # 98.4 3.9E-09 2.4E-12 66.7 8.6 108 1-131 1-124 (125) 72 protein:vir:98342 Length: 125 98.4 3.9E-09 2.4E-12 66.7 8.6 108 1-131 1-124 (125) 73 protein:vir:81106 Length: 125 98.4 3.9E-09 2.4E-12 66.7 8.6 108 1-131 1-124 (125) 74 protein:vir:4704 Length: 125 # 98.4 3.9E-09 2.4E-12 66.7 8.6 108 1-131 1-124 (125) 75 protein:vir:9708 Length: 125 # 98.4 1.8E-09 1.1E-12 68.6 6.7 107 1-131 1-120 (125) 76 protein:vir:107568 Length: 146 98.4 5.4E-09 3.3E-12 65.9 8.9 120 1-131 5-139 (146) 77 protein:vir:102875 Length: 146 98.4 5.4E-09 3.3E-12 65.9 8.9 120 1-131 5-139 (146) 78 protein:vir:105007 Length: 146 98.4 5.4E-09 3.3E-12 65.9 8.9 120 1-131 5-139 (146) 79 protein:vir:102085 Length: 146 98.4 5.4E-09 3.3E-12 65.9 8.9 120 1-131 5-139 (146) 80 protein:vir:80116 Length: 127 98.4 4.9E-09 3E-12 66.2 8.6 101 1-131 1-123 (127) 81 protein:vir:106506 Length: 137 98.3 9.1E-10 5.6E-13 70.2 4.5 106 1-131 4-127 (137) 82 protein:vir:95372 Length: 124 98.3 5.5E-09 3.4E-12 65.8 8.3 101 1-131 1-122 (124) 83 protein:vir:3873 Length: 128 # 98.3 8.3E-09 5.1E-12 64.9 9.1 112 1-131 1-124 (128) 84 protein:vir:5745 Length: 135 # 98.3 8.4E-09 5.2E-12 64.9 8.3 108 1-131 1-127 (135) 85 protein:vir:9879 Length: 127 # 98.2 7.8E-09 4.9E-12 65.0 6.9 106 1-131 2-126 (127) 86 protein:vir:1891 Length: 179 # 98.0 3.2E-08 2E-11 61.7 7.0 131 1-131 5-173 (179) 87 protein:vir:4347 Length: 164 # 98.0 3.6E-08 2.2E-11 61.4 6.1 127 1-131 5-158 (164) 88 protein:vir:1386 Length: 149 # 97.8 1.8E-07 1.1E-10 57.6 8.0 116 1-131 1-140 (149) 89 protein:vir:80970 Length: 112 97.6 1.4E-06 8.7E-10 52.7 9.1 98 1-131 1-112 (112) 90 protein:vir:102154 Length: 119 97.2 1.3E-06 8.1E-10 52.8 4.8 101 1-131 1-119 (119) 91 protein:vir:100223 Length: 139 97.2 5.3E-06 3.3E-09 49.5 8.0 107 1-131 1-127 (139) 92 protein:vir:100887 Length: 139 97.1 5E-06 3.1E-09 49.6 7.6 109 1-131 1-127 (139) 93 protein:vir:45 Length: 112 # N 97.1 1.1E-05 6.8E-09 47.8 9.2 98 1-131 1-109 (112) 94 protein:vir:4956 Length: 153 # 97.0 9.2E-06 5.7E-09 48.2 8.3 107 1-131 1-131 (153) 95 protein:vir:6246 Length: 143 # 96.9 5.1E-06 3.2E-09 49.6 6.2 101 1-129 1-143 (143) 96 protein:vir:98892 Length: 108 96.9 1.7E-05 1.1E-08 46.7 8.8 99 1-131 2-107 (108) 97 protein:vir:79687 Length: 113 96.9 1.9E-05 1.2E-08 46.4 8.7 94 4-131 1-110 (113) 98 protein:vir:1332 Length: 143 # 96.7 6.5E-06 4.1E-09 49.0 5.3 101 1-129 1-143 (143) 99 protein:vir:4790 Length: 114 # 96.6 4E-05 2.5E-08 44.7 8.8 97 1-129 1-114 (114) 100 protein:vir:3163 Length: 145 # 96.5 2.5E-05 1.6E-08 45.8 6.9 105 1-131 2-140 (145) 101 protein:vir:5000 Length: 141 # 96.4 5.3E-05 3.3E-08 44.0 8.6 107 1-131 2-131 (141) 102 protein:vir:7449 Length: 123 # 96.1 9.6E-05 6E-08 42.6 8.0 104 1-131 4-120 (123) 103 protein:vir:1581 Length: 116 # 96.1 0.00012 7.3E-08 42.1 8.5 99 1-131 1-116 (116) 104 protein:vir:4200 Length: 133 # 96.0 3.3E-05 2E-08 45.2 5.2 92 1-122 2-133 (133) 105 protein:vir:96288 Length: 100 95.8 7.2E-05 4.5E-08 43.3 6.2 80 1-127 13-100 (100) 106 protein:vir:4833 Length: 140 # 95.7 0.00022 1.4E-07 40.7 8.5 109 1-131 2-131 (140) 107 protein:vir:1988 Length: 156 # 95.5 0.00014 9E-08 41.6 7.0 125 1-131 5-151 (156) 108 protein:vir:9823 Length: 118 # 95.4 3.9E-05 2.4E-08 44.8 3.5 96 1-131 2-118 (118) 109 protein:vir:3036 Length: 118 # 95.4 3.9E-05 2.4E-08 44.8 3.5 96 1-131 2-118 (118) 110 protein:vir:4859 Length: 140 # 95.4 0.00043 2.7E-07 39.0 9.0 107 1-131 2-131 (140) 111 protein:vir:101508 Length: 120 95.3 0.0004 2.5E-07 39.2 8.5 104 1-131 4-120 (120) 112 protein:vir:4162 Length: 133 # 95.0 0.00013 8E-08 41.9 5.0 92 1-122 2-133 (133) 113 protein:vir:99196 Length: 155 94.9 0.00048 3E-07 38.8 7.9 101 1-131 1-155 (155) 114 protein:vir:79225 Length: 155 94.8 0.00056 3.5E-07 38.4 8.0 101 1-131 1-155 (155) 115 protein:vir:7993 Length: 108 # 94.6 6.6E-05 4.1E-08 43.5 2.4 94 1-129 12-108 (108) 116 protein:vir:107851 Length: 175 94.0 0.001 6.2E-07 37.0 7.7 126 1-131 1-169 (175) 117 protein:vir:79091 Length: 175 93.0 0.0012 7.6E-07 36.5 6.5 101 1-131 1-172 (175) 118 protein:vir:99833 Length: 190 92.8 0.0021 1.3E-06 35.3 7.5 122 1-131 4-183 (190) 119 protein:vir:103841 Length: 155 91.9 0.0034 2.1E-06 34.1 7.5 124 1-131 1-155 (155) 120 protein:vir:3848 Length: 159 # 91.8 0.0047 2.9E-06 33.4 8.2 110 1-131 3-150 (159) 121 protein:vir:78894 Length: 105 91.3 0.0012 7.3E-07 36.7 4.4 100 1-129 1-105 (105) 122 protein:vir:81067 Length: 119 89.3 0.0013 7.8E-07 36.5 2.7 79 29-131 1-115 (119) 123 protein:vir:9513 Length: 134 # 89.0 0.0099 6.2E-06 31.6 7.5 105 1-130 1-134 (134) 124 protein:vir:101302 Length: 134 89.0 0.0099 6.2E-06 31.6 7.5 105 1-130 1-134 (134) 125 protein:vir:10367 Length: 119 88.8 0.0014 8.9E-07 36.2 2.7 79 29-131 1-115 (119) 126 protein:vir:102190 Length: 93 86.4 0.012 7.8E-06 31.0 6.4 87 18-131 1-93 (93) 127 protein:vir:8106 Length: 150 # 85.8 0.00078 4.9E-07 37.6 -0.5 100 1-131 1-139 (150) 128 protein:vir:100652 Length: 134 82.5 0.047 2.9E-05 27.9 7.8 105 1-130 1-134 (134) 129 protein:vir:9647 Length: 132 # 70.9 0.2 0.00013 24.4 8.0 106 1-131 1-127 (132) 130 protein:vir:6216 Length: 125 # 67.6 0.091 5.6E-05 26.3 5.2 102 1-130 1-125 (125) 131 protein:vir:77650 Length: 155 63.0 0.2 0.00013 24.4 6.1 93 19-131 1-94 (155) 132 protein:vir:101563 Length: 155 61.0 0.18 0.00011 24.7 5.4 93 19-131 1-94 (155) 133 protein:vir:106728 Length: 155 57.2 0.32 0.0002 23.3 6.2 92 19-131 1-94 (155) 134 protein:vir:78607 Length: 155 57.0 0.32 0.0002 23.3 6.1 93 19-131 1-94 (155) 135 protein:vir:105773 Length: 131 54.5 0.38 0.00024 22.9 6.1 105 1-131 1-130 (131) 136 protein:vir:98557 Length: 149 47.2 0.71 0.00044 21.4 7.9 120 1-131 1-148 (149) 137 protein:vir:2026 Length: 150 # 45.0 0.78 0.00049 21.2 7.8 122 1-131 1-149 (150) 138 protein:vir:78163 Length: 92 # 44.9 0.18 0.00011 24.6 2.8 87 1-104 1-92 (92) 139 protein:vir:6071 Length: 150 # 44.5 0.8 0.0005 21.1 8.0 122 1-131 1-149 (150) 140 protein:vir:5703 Length: 150 # 42.7 0.88 0.00054 20.9 8.2 122 1-131 1-149 (150) 141 protein:vir:94069 Length: 168 41.9 0.29 0.00018 23.5 3.4 93 18-131 1-100 (168) 142 protein:vir:98636 Length: 138 40.7 0.96 0.00059 20.7 7.2 106 1-131 7-133 (138) 143 protein:vir:105825 Length: 108 39.3 0.46 0.00029 22.4 4.1 95 1-118 12-108 (108) 144 protein:vir:102608 Length: 108 39.3 0.46 0.00029 22.4 4.1 95 1-118 12-108 (108) 145 protein:vir:5257 Length: 148 # 37.3 1.1 0.00068 20.4 5.8 71 55-131 1-87 (148) 146 protein:vir:96105 Length: 193 22.0 2.5 0.0016 18.4 6.4 102 1-131 1-131 (193) 147 protein:vir:1838 Length: 149 # 20.6 2.7 0.0017 18.2 6.8 117 1-131 1-148 (149) No 1 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=100.00 E-value=1.7e-53 Score=309.82 Aligned_cols=131 Identities=100% Similarity=1.389 Sum_probs=130.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) |||+.+|++|++++++++++++|+++++++++|+.++||||||||+||++|+++|+.++.+.+||+|+.+++++..+|++ T Consensus 1 msf~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~ 80 (131) T protein:vir:78 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) T ss_pred CCcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchhhHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +++|++|||+||+|||.+||||||+|||+||||+++++|+++|+++++|+| T Consensus 81 ~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:78 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred ccCCceEEEeeCchhhhHhhccccCCCcchHHHHHHHHHHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999 No 2 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=100.00 E-value=2.8e-53 Score=308.73 Aligned_cols=131 Identities=94% Similarity=1.345 Sum_probs=130.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) |||+.+|++|++++++++++++|+++++++++|+.++||||||||+||++|+++|+.++.+++||+|+.+++++..+|++ T Consensus 1 msF~~~i~~~~~~ve~~~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~ 80 (131) T protein:vir:94 1 MSFALDVTRFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATSFVLN 80 (131) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +++|++|||+||+|||.+||||||+|||+||||+++++|+++|+++++|+| T Consensus 81 ~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~g~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:94 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred ccccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999 No 3 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=100.00 E-value=1.1e-52 Score=305.47 Aligned_cols=131 Identities=32% Similarity=0.543 Sum_probs=128.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhH---HHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT---SNAANF 77 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~---~~~~~~ 77 (131) |||+++|++|+++++++++.++|+++++++++|+.++||||||||+||++|+++|+.++.+++||+|+.+. .++.++ T Consensus 9 ~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~ 88 (145) T protein:vir:10 9 VTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKTYLARQARA 88 (145) T ss_pred hccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999885 578999 Q ss_pred HhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 78 VLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 78 i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) |.++++|++|||+||+|||.+||||||+|||.||||+++++|+++|+++++|+| T Consensus 89 i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~~~~e~k 142 (145) T protein:vir:10 89 VANSKATSVIYITNRLDYAADLEYGASNQAPAGVLGVVQARLGRYFQEAVEEAR 142 (145) T ss_pred hhcccccceEEEeeCchhhhHhhccccCCCcchHHHHHHHHHHHHHHHHHHHhh Confidence 999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=100.00 E-value=6.5e-52 Score=301.20 Aligned_cols=131 Identities=31% Similarity=0.510 Sum_probs=127.1 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHH---HHH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNA---ANF 77 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~---~~~ 77 (131) |||+++|++|+++++++++.++|+++++++++|+.++||||||||+||++|+++|+.++.+++||+|+.+...+ ..+ T Consensus 6 ~sf~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~ 85 (142) T protein:vir:10 6 VSFRNSINAWIDGVTEGVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGNETRNSLRRQIYA 85 (142) T ss_pred hhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCccchhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999997765 467 Q ss_pred HhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 78 VLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 78 i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) |+++++|++|||+||+|||.+||||||+|||.|||++++++|+++|+++++|+| T Consensus 86 i~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~a~q~~~~~v~~a~~e~~ 139 (142) T protein:vir:10 86 LARDANTNVIYISNRLDYAQGLEFGSSNQAPSGVLGVVQKRLGRYFAEAVQEAK 139 (142) T ss_pred hhhccccceEEEeeCcchhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHHhh Confidence 788899999999999999999999999999999999999999999999999999 No 5 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=100.00 E-value=8.2e-52 Score=300.67 Aligned_cols=131 Identities=26% Similarity=0.443 Sum_probs=125.5 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHH---- Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAAN---- 76 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~---- 76 (131) .||+++|++|++++|++++.++|+++++++++|+.+|||||||||+||++|+++||.+..+.+||+|+.+.+.+.. T Consensus 7 ~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~~~~i~~ 86 (146) T protein:vir:79 7 REFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEGRRTLYA 86 (146) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHHHHHHHH Confidence 4899999999999999999999999999999999999999999999999999999999999999999999887744 Q ss_pred HHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 77 FVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 77 ~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) ++.++++|++|||+||+|||.+||||||+|||.|||++++++|+++|+++++|+| T Consensus 87 ~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~a~~e~k 141 (146) T protein:vir:79 87 LLHGGGAIKSIYFSNMLIYANALEYGHSKQAPAGVFGIVAIRLRSYMAEAIREAR 141 (146) T ss_pred HHhcccccceeEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHHHH Confidence 4456788999999999999999999999999999999999999999999999999 No 6 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=100.00 E-value=3.2e-51 Score=297.42 Aligned_cols=131 Identities=26% Similarity=0.440 Sum_probs=126.0 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHH----H Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAA----N 76 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~----~ 76 (131) -+|+.+|++|++++++++++++|+++++++++|+.++||||||||+||++|+++||.++.+.+||+|+.+.+.+. . T Consensus 7 ~~F~~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~~t~a~~~~~~~~ 86 (147) T protein:vir:10 7 RRFQGEIDAWINAAESTLEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGGVVRGEEQAKTYG 86 (147) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCccchhhhhhHHHHH Confidence 389999999999999999999999999999999999999999999999999999999999999999999987664 4 Q ss_pred HHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 77 FVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 77 ~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|.++++|++|||+||+|||.+||||||+|||.||||+++++|++||+++++|+| T Consensus 87 ~~~~~~~~~~iyi~Nn~pYA~~LEyG~S~QAP~G~V~~t~q~~~~~v~~~~~e~k 141 (147) T protein:vir:10 87 MFSRGGAITSVHFSNMLIYANALEYGHSQQAPSGVVGLVALRLRSYMADAIKQAR 141 (147) T ss_pred HhhhccCcceEEEeeCcchhhhhhccccCCCCchHHHHHHHHHHHHHHHHHHHHH Confidence 5678899999999999999999999999999999999999999999999999999 No 7 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=100.00 E-value=4.4e-50 Score=291.17 Aligned_cols=130 Identities=25% Similarity=0.378 Sum_probs=124.8 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCc---chhHHHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAG---TTATSNAANF 77 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g---~~~~~~~~~~ 77 (131) |||+++|++|++++|+++++++|+++++++++|+.++||||||||+||++|+++||.++.+.+|++| .++++.+.++ T Consensus 1 msF~~~i~~~~~~ve~~~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~~~v 80 (134) T protein:vir:80 1 MSYTDRFNVIAKGIEDNVDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEGMDEALQVLQQT 80 (134) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCccchhhHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999999999999999988 4578999999 Q ss_pred HhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 78 VLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 78 i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) |+++++|++|||+||+|||.+||||||+|||.||||+++++|+++|++ .+.+- T Consensus 81 i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~t~~~~~~~v~~-~~~~~ 133 (134) T protein:vir:80 81 VGQYKAGDTVHITNNAPYIKELNSGSSQQAPANFVETSIMRATRLIRN-VKVVP 133 (134) T ss_pred HhhccCcceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHh-hccCC Confidence 999999999999999999999999999999999999999999999998 45554 No 8 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=100.00 E-value=1.9e-48 Score=282.26 Aligned_cols=131 Identities=29% Similarity=0.412 Sum_probs=121.0 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCC---------cchhH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKA---------GTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~---------g~~~~ 71 (131) ++|+.+|++|++++|+++++++|+++++++++|+.++||||||||+||++|+++|+.++.+++||. |+.++ T Consensus 5 ~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~G~~~~~~~~~~i 84 (148) T protein:vir:97 5 SEFSRRITLRGRKVAEGADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEAGSTEAANTQAAI 84 (148) T ss_pred chhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCCCcccccchhHHH Confidence 589999999999999999999999999999999999999999999999999999999999988753 55678 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHH--HHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNE--EASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~--~~~e~k 131 (131) +.+..+|+++++|++|||+||+|||.+||||||+|||.|||++++++|+++|++ +++|.- T Consensus 85 ~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~t~~~~~~~v~~~~~~~~~~ 146 (148) T protein:vir:97 85 DQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPANFVEQAVLEAVQVVQFGRVVDGDP 146 (148) T ss_pred HHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcchHHHHHHHHHHHHHHhhhhhcCCC Confidence 889999999999999999999999999999999999999999999999999976 222222 No 9 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=100.00 E-value=2.2e-48 Score=281.91 Aligned_cols=128 Identities=27% Similarity=0.363 Sum_probs=120.0 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------------cchhhccccccccCcccccccCCCCCC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPV--------------DTGRFRMNWMASGGTPADGTTDATDKA 66 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPV--------------dtGr~R~nw~vs~~~~~~~~~~~~d~~ 66 (131) |||+.+|++|++++|+++++++|++++++++.|+++||| ||||||+||++|+++|+.+..+.+|+ T Consensus 11 msFaa~i~~~~~~~e~~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~p~~~~~~~~~~- 89 (152) T protein:vir:96 11 MSWSKSLKNIIVKNENLTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISKITSFEKGISSQ- 89 (152) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecCCCcccccCCCC- Confidence 999999999999999999999999999999999999999 99999999999999999876655555 Q ss_pred cchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_019539. 67 GTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV 130 (131) Q Consensus 67 g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~ 130 (131) +.++.++.++|.++++|++|||+||+|||.+||||||+|||.||||+++++|+++|+++++-- T Consensus 90 -~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~QAP~G~vr~t~~~~~~~v~ea~~~~ 152 (152) T protein:vir:96 90 -SSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGHSSQAPNGVYRPAVRRLVKFLNTELKAK 152 (152) T ss_pred -CchHHHHHHHHhhccccceEEEeeCchhhhHhhccccCCCCchHHHHHHHHHHHHHHHHhccC Confidence 556778999999999999999999999999999999999999999999999999999977644 No 10 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=100.00 E-value=4.5e-48 Score=280.16 Aligned_cols=129 Identities=23% Similarity=0.357 Sum_probs=121.0 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCC---------CCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDAT---------DKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~---------d~~g~~~~ 71 (131) |+|++++++|++++|+.+++++|++|+++++.|++++||||||||+||++|+++|+.++.+++ |++|..++ T Consensus 6 ~~f~~~i~~~~~~ve~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~~~~~~~~~~~~~~~t~d~sg~~tl 85 (144) T protein:vir:95 6 LDLADRLEKKAKAIDEAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPSGQQIKPHFPGSQGSTQRASAAETL 85 (144) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccccccccccccccccccCCCchhHHH Confidence 899999999999999999999999999999999999999999999999999999998887754 56788899 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +.+.++|+++++|++|||+||+|||.+||||||+|||.||||+++++|+++|++. +++ T Consensus 86 ~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G~vr~~~q~~~~~v~~~--~~~ 143 (144) T protein:vir:95 86 NSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAGFVERAVLIGRKMRKKF--KIK 143 (144) T ss_pred HHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHhh--ccC Confidence 9999999999999999999999999999999999999999999999999998753 222 No 11 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=100.00 E-value=3.9e-46 Score=269.55 Aligned_cols=118 Identities=28% Similarity=0.460 Sum_probs=113.3 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) |||+.+|++|++++++.+++++|+++++++++|+.++||||||||+||++|+++|+.++.+.+||+|+.++..+.... T Consensus 4 ~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~~~-- 81 (121) T protein:vir:94 4 MKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVVSS-- 81 (121) T ss_pred chhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHHHH-- Confidence 999999999999999999999999999999999999999999999999999999999999999999999998875544 Q ss_pred ccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHH Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQ 120 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~ 120 (131) .+.|++|||+||+|||.+||||||+|||+||||++++||+ T Consensus 82 ~~~~~~iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 82 NVALPHFYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred hhccceEEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 4568999999999999999999999999999999999999 No 12 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.77 E-value=2.5e-22 Score=138.99 Aligned_cols=112 Identities=20% Similarity=0.359 Sum_probs=74.8 Q ss_pred Cc----cc-hhHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCc Q lcl|NC_019539. 1 MS----FA-LDVSKFVEK--------AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAG 67 (131) Q Consensus 1 ms----f~-~~i~~~~~~--------~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g 67 (131) || |. ..+++|.++ ++..+++++++++.++++.++.+||||||+||.||+++... .....+++| T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~----~~~~~~~~g 76 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYA----RSLPVYKQG 76 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccc----cccceeecC Confidence 32 21 234444444 44555667888999999999999999999999999876321 111122222 Q ss_pred chhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHH------HHHHHHHHHH----HHHHHhhC Q lcl|NC_019539. 68 TTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVR------VNVSRFQQLL----NEEASKVK 131 (131) Q Consensus 68 ~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~------~a~~~~~~~v----~~~~~e~k 131 (131) + +-+|.|.||+|||++|||||+++.|.|||. .+.+++...+ ++.+.++= T Consensus 77 ~---------------~~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l 135 (141) T protein:vir:79 77 N---------------NYIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILL 135 (141) T ss_pred C---------------eeEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 246899999999999999999999999886 4445444443 33333322 No 13 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.67 E-value=8e-20 Score=125.28 Aligned_cols=106 Identities=25% Similarity=0.415 Sum_probs=77.0 Q ss_pred Cc---cc-hhHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCc Q lcl|NC_019539. 1 MS---FA-LDVSKFVEKAKK---------NPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAG 67 (131) Q Consensus 1 ms---f~-~~i~~~~~~~~~---------~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g 67 (131) || |. ..+++|.+++++ .+++.+++++.+++..++.+||||||+||+||.++- +..+ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~----------~~~~- 69 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEG----------PTYG- 69 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecc----------eeee- Confidence 54 33 567777776654 345677889999999999999999999999997752 1111 Q ss_pred chhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCc-----------hhH------HHHHHHH----HHHHHHH Q lcl|NC_019539. 68 TTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQ-----------GFV------RVNVSRF----QQLLNEE 126 (131) Q Consensus 68 ~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~-----------G~V------~~a~~~~----~~~v~~~ 126 (131) +.+-++.|.||+|||++|||||+++.+. ||| +.|.+++ ++++++. T Consensus 70 --------------~~~~~~~V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~k~ 135 (144) T protein:vir:10 70 --------------CGGWTIKLINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQLVTEG 135 (144) T ss_pred --------------cCeeEEEEecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHHHHHH Confidence 2234688999999999999999988763 555 5666555 4555666 Q ss_pred HHhhC Q lcl|NC_019539. 127 ASKVK 131 (131) Q Consensus 127 ~~e~k 131 (131) +.++. T Consensus 136 l~~l~ 140 (144) T protein:vir:10 136 LWGLK 140 (144) T ss_pred HHHHh Confidence 66666 No 14 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.60 E-value=2.1e-18 Score=117.54 Aligned_cols=105 Identities=22% Similarity=0.345 Sum_probs=75.7 Q ss_pred Cc--cc-hhHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHhCCc---------------------------cc Q lcl|NC_019539. 1 MS--FA-LDVSKFVEKAKK---------NPEKVIRQVSIKLFSAIIKASPV---------------------------DT 41 (131) Q Consensus 1 ms--f~-~~i~~~~~~~~~---------~~~~~~r~~a~~~~~~vv~~tPV---------------------------dt 41 (131) || |. +++++|.++++. .+++++++++.+++++++.+||| || T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 55 32 367777776642 36778899999999999999998 89 Q ss_pred hhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhH------HHH Q lcl|NC_019539. 42 GRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFV------RVN 115 (131) Q Consensus 42 Gr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V------~~a 115 (131) |.||.||.++- +.++|+ +.+|-|.|+.|||++|||||.+. +.||| +.| T Consensus 81 G~lr~swk~~~----------~~k~~~---------------~~~v~v~N~~~YA~~VE~GHR~~-~gGfV~G~fml~~s 134 (163) T protein:vir:10 81 GTLQKGWSKSR----------IEVSGR---------------TYKQKVYNKVYYAPHVEYGHKTV-NGGFVPGQFFLHKT 134 (163) T ss_pred chhhccceecc----------eeecCC---------------ceEEEEEecCCccchhhcceeec-CCceeccchhhHHH Confidence 99999998862 233333 23688999999999999999654 56777 566 Q ss_pred HHHHHHHHHHHHHhhC Q lcl|NC_019539. 116 VSRFQQLLNEEASKVK 131 (131) Q Consensus 116 ~~~~~~~v~~~~~e~k 131 (131) .+++...+.+.+++.= T Consensus 135 ~~~~~~~~~~~~e~~l 150 (163) T protein:vir:10 135 VEDTKSDMEKRVRDKY 150 (163) T ss_pred HHHHHHHHHHHHHHHH Confidence 6666555444443322 No 15 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.57 E-value=7.8e-18 Score=114.38 Aligned_cols=103 Identities=19% Similarity=0.281 Sum_probs=81.7 Q ss_pred Cccchh---HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFALD---VSKFVEKAK-----KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~~---i~~~~~~~~-----~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+++-+ ++++.+++. +.+...+++.+..+..+++..+|||||.||+||.++.. T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~------------------- 61 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELT------------------- 61 (112) T ss_pred CceeeeehhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeec------------------- Confidence 776654 566666655 44566788899999999999999999999999987531 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) ..|.++.|.++.+||.+|||||+.+.|+.|++.+++.....+.+.++++ | T Consensus 62 ---------~~~~~~~V~~~~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 62 ---------EGGFSGQAGPHTDYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred ---------CCceEEEeecCCCccceeeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 1234788999999999999999999999999999988866655555443 4 No 16 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.56 E-value=1.2e-17 Score=113.42 Aligned_cols=102 Identities=18% Similarity=0.246 Sum_probs=82.5 Q ss_pred Cc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHh Q lcl|NC_019539. 1 MS-FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVL 79 (131) Q Consensus 1 ms-f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~ 79 (131) +. +...|.+..+.+.+.+...+++.+.++..+++..+|||||+||.||.++... T Consensus 4 ld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~------------------------- 58 (108) T protein:vir:99 4 LDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQR------------------------- 58 (108) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecC------------------------- Confidence 22 5566666767777777888899999999999999999999999999776421 Q ss_pred hccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 80 NAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 80 ~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) +..+.|.++.+||.+||||||.+.|+.|++.++......+.+.++++ | T Consensus 59 ----~~~~~v~~~~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lr 107 (108) T protein:vir:99 59 ----LLHYRVVSPALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKKMFK 107 (108) T ss_pred ----cEEEEeecCcccchhcccCccccCCCcchhhhHHHHHHHHHHHHHHHhc Confidence 13577899999999999999999999999999998876655555443 3 No 17 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.50 E-value=7e-17 Score=109.16 Aligned_cols=101 Identities=17% Similarity=0.157 Sum_probs=77.5 Q ss_pred Cccch-hHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVEKA-------KKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~~~-------~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) ||+.- -++++.+.+ .+.+...+++.+.++..+++..+|||||+||+||.++.+ T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~------------------- 61 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYP------------------- 61 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecC------------------- Confidence 66432 355555444 445566778888889999999999999999999976521 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHh-hC Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASK-VK 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e-~k 131 (131) |....|.++.+|+.+|||||+.|+|+.|++.++++....+.+.+.+ +| T Consensus 62 -----------g~~~~V~~~~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~ 110 (114) T protein:vir:95 62 -----------GMEAHIHGEAGYDGYQEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMK 110 (114) T ss_pred -----------ceEEEeecCCCccceeecCccccCCCccchhhHHHHHHHHHHHHHHHHH Confidence 2356788999999999999999999999999998887665555543 33 No 18 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=99.50 E-value=2.5e-17 Score=111.60 Aligned_cols=89 Identities=25% Similarity=0.443 Sum_probs=70.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCc---cchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEe Q lcl|NC_019539. 14 AKKNPEKVIRQVSIKLFSAIIKASPV---DTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLT 90 (131) Q Consensus 14 ~~~~~~~~~r~~a~~~~~~vv~~tPV---dtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~ 90 (131) +...++++++++|.++++.++.+||| |||.||.||.++- .. +.+++ |. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~----------v~-----------------k~~~~--v~ 51 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKE----------LN-----------------LFDGV--VS 51 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeee----------ee-----------------ccCce--ee Confidence 67777789999999999999999998 6799999998751 12 22233 56 Q ss_pred eCchhhhhhhcCCCCCCCch-------------hH------HHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 91 NNLPYAQRLEYGWSQQAPQG-------------FV------RVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 91 Nn~pYa~~LEyG~S~QAp~G-------------~V------~~a~~~~~~~v~~~~~e~k 131 (131) ||++||++|||||+++...| || +.|.+++.+++.+.+++.= T Consensus 52 N~~eYA~~VE~GHRq~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~ 111 (116) T protein:vir:10 52 NNVEYIHHLEYGHRTRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQII 111 (116) T ss_pred cCCcccccccCCceeeCCcceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHH Confidence 99999999999998877654 55 6888888777666666554 No 19 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.48 E-value=2e-16 Score=106.70 Aligned_cols=105 Identities=19% Similarity=0.325 Sum_probs=84.1 Q ss_pred CccchhH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHH Q lcl|NC_019539. 1 MSFALDV-------SKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSN 73 (131) Q Consensus 1 msf~~~i-------~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~ 73 (131) |++..++ .++.+.+.+.+...+.+.+.++...++..+|||||+||+||.+.+.. T Consensus 4 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~------------------- 64 (142) T protein:vir:94 4 LNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSG------------------- 64 (142) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeecc------------------- Confidence 4444444 44444555667778888999999999999999999999999765421 Q ss_pred HHHHHhhccCCceEEEeeCchhhhhhhcCCC---------------------------CCCCchhHHHHHHHHHHHHHHH Q lcl|NC_019539. 74 AANFVLNAADWHTFTLTNNLPYAQRLEYGWS---------------------------QQAPQGFVRVNVSRFQQLLNEE 126 (131) Q Consensus 74 ~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S---------------------------~QAp~G~V~~a~~~~~~~v~~~ 126 (131) .+...++.|.++++||.++||||+ .+.|+.|++.++.+-...+.+- T Consensus 65 -------~g~~~~~~v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~ 137 (142) T protein:vir:94 65 -------GRFSFSVTIGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNH 137 (142) T ss_pred -------CCceEEEEEecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHH Confidence 112236889999999999999984 3679999999999999999999 Q ss_pred HHhhC Q lcl|NC_019539. 127 ASKVK 131 (131) Q Consensus 127 ~~e~k 131 (131) ++++| T Consensus 138 ~~~~~ 142 (142) T protein:vir:94 138 AKGIR 142 (142) T ss_pred HHhcC Confidence 99999 No 20 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.47 E-value=2.2e-16 Score=106.40 Aligned_cols=102 Identities=19% Similarity=0.258 Sum_probs=77.7 Q ss_pred CccchhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVEKA-----KKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAA 75 (131) Q Consensus 1 msf~~~i~~~~~~~-----~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~ 75 (131) |.|.. ++++.+++ ++.+...+++.+..+..+++..+|||||.||+||.+... T Consensus 1 i~i~G-ld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~---------------------- 57 (108) T protein:vir:74 1 MKITG-IDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFT---------------------- 57 (108) T ss_pred Ccchh-HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeee---------------------- Confidence 44432 23333332 344667889999999999999999999999999987542 Q ss_pred HHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 76 NFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 76 ~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) +.|.++.|.++.+||.+||||||.|+|+.|++.++......+.+.+.++ | T Consensus 58 ------~~~~~~~V~~~~~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 58 ------DGGLSGTTGPHTDYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ------cCceEEEeecCCCcccceeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 1134677899999999999999999999999999988866665555443 4 No 21 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.46 E-value=2.8e-16 Score=105.87 Aligned_cols=102 Identities=21% Similarity=0.304 Sum_probs=77.4 Q ss_pred CccchhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVEKAK-----KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAA 75 (131) Q Consensus 1 msf~~~i~~~~~~~~-----~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~ 75 (131) |.|.. ++++.+.++ ..+...+++.+..+..+++..+|||||.||+||.+... T Consensus 1 i~i~G-ld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~---------------------- 57 (108) T protein:vir:98 1 MKITG-IDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFT---------------------- 57 (108) T ss_pred Ccchh-HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeee---------------------- Confidence 55442 344444333 34567888999999999999999999999999976532 Q ss_pred HHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 76 NFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 76 ~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) +.+-++.|.++.+||.+|||||+.|.|+.|++.+++.....+.+.++++ | T Consensus 58 ------~~~~~~~V~~~~~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 58 ------DGGLTGTTIPHTDYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ------cCceEEEeecCCCccceeeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 1123578899999999999999999999999999988866655555443 3 No 22 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.41 E-value=1.1e-15 Score=102.49 Aligned_cols=103 Identities=22% Similarity=0.262 Sum_probs=84.3 Q ss_pred Ccc----------chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MSF----------ALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 msf----------~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) ||. ..++.++.+.+.+.+++.+++.+.++...++..+|||||+||+||.+.+. T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~----------------- 66 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYK----------------- 66 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEee----------------- Confidence 443 33455566677778888899999999999999999999999999976542 Q ss_pred HHHHHHHHhhccCCceEEEeeCchhhhhhhcCC---------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFTLTNNLPYAQRLEYGW---------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~---------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) ..|-+..|.++++||.++|||| +.+.|+.|++.+++.-...| T Consensus 67 -----------~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~ 135 (144) T protein:vir:59 67 -----------NNGLTAEITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYF 135 (144) T ss_pred -----------cCcEEEEEecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHH Confidence 1123577899999999999998 34778889999999999999 Q ss_pred HHHHHhhC Q lcl|NC_019539. 124 NEEASKVK 131 (131) Q Consensus 124 ~~~~~e~k 131 (131) .+.++++- T Consensus 136 ~~~i~~~~ 143 (144) T protein:vir:59 136 EREMRRLR 143 (144) T ss_pred HHHHHHhc Confidence 99999888 No 23 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.41 E-value=8.3e-16 Score=103.26 Aligned_cols=100 Identities=21% Similarity=0.245 Sum_probs=82.6 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+.+.+++.+.+...+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~------------------ 62 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeC------------------ Confidence 33 5566666667777777788888999999999999999999999999765421 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-+..|.++++||.++|||| +.|.|+.|++.+++.....| T Consensus 63 ----------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:95 63 ----------GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred ----------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 123567889999999999998 56889999999999999999 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~k~l~ 137 (137) T protein:vir:95 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 24 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.40 E-value=1.1e-15 Score=102.69 Aligned_cols=100 Identities=16% Similarity=0.236 Sum_probs=81.4 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+.+.+.+...+++.+.++..+++..+|||||+||+||.+.+. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~------------------- 61 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVT------------------- 61 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEee------------------- Confidence 54 345566666666667777888899999999999999999999999977542 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) ..|.++.|.++++||.++|||| +.|.|+.|++.++.+-...| T Consensus 62 ---------~~g~~~~V~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i 132 (137) T protein:vir:96 62 ---------DGGFSSVISVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVF 132 (137) T ss_pred ---------cCceEEEEecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHH Confidence 1234688999999999999998 45778899999999988888 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~k~i~ 137 (137) T protein:vir:96 133 NRYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 25 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.39 E-value=1.4e-15 Score=102.03 Aligned_cols=100 Identities=21% Similarity=0.246 Sum_probs=82.9 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+++.+.+...+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec------------------ Confidence 33 5566777777777777788899999999999999999999999999765421 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-+..|.++++||.++|||| +.|.|+.|++.+++.....| T Consensus 63 ----------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:94 63 ----------SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred ----------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 123566889999999999999 56889999999999999999 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~~~l~ 137 (137) T protein:vir:94 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 26 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.39 E-value=1.4e-15 Score=102.03 Aligned_cols=100 Identities=21% Similarity=0.246 Sum_probs=82.9 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+++.+.+...+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec------------------ Confidence 33 5566777777777777788899999999999999999999999999765421 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-+..|.++++||.++|||| +.|.|+.|++.+++.....| T Consensus 63 ----------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:97 63 ----------SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred ----------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 123566889999999999999 56889999999999999999 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~~~l~ 137 (137) T protein:vir:97 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 27 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.39 E-value=1.4e-15 Score=102.03 Aligned_cols=100 Identities=21% Similarity=0.246 Sum_probs=82.9 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+++.+.+...+++.+.++..+++..+|||||.||+||.+.+.. T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeec------------------ Confidence 33 5566777777777777788899999999999999999999999999765421 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-+..|.++++||.++|||| +.|.|+.|++.+++.....| T Consensus 63 ----------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:93 63 ----------SGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred ----------CceEEEEecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 123566889999999999999 56889999999999999999 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~~~l~ 137 (137) T protein:vir:93 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 28 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.38 E-value=1.3e-15 Score=102.22 Aligned_cols=106 Identities=15% Similarity=0.186 Sum_probs=73.7 Q ss_pred Cccch-hHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVE-------KAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~-------~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) ||+.- -++++.+ .+.+.+...+++.+..+..+.+..+|||||.||.||.++-.. .+. T Consensus 5 ~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~---------~~~------ 69 (125) T protein:vir:94 5 FNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVK---------EEH------ 69 (125) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceeccee---------ccC------ Confidence 44331 1344433 334455556677788889999999999999999999775211 111 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) .|-++.+.++.+||.+||||||.|.|+.|++.+++.....+.+.+.+. | T Consensus 70 ----------~~~~~~v~~~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~ 119 (125) T protein:vir:94 70 ----------GVVTGRYVARADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALN 119 (125) T ss_pred ----------CcEEEEeeCCCCccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHH Confidence 124577889999999999999999999999999877654444433332 1 No 29 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.35 E-value=3.2e-15 Score=100.02 Aligned_cols=100 Identities=22% Similarity=0.258 Sum_probs=80.3 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+++.+.+.+.+++.+.++..+++..+|||||+||+||.+.+. T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~------------------- 73 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF------------------- 73 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEee------------------- Confidence 22 445566666666677777888899999999999999999999999987531 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) ..|-+..|.++++||.++|||| +.|.|+.|++.++++-...| T Consensus 74 ---------~~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i 144 (149) T protein:vir:94 74 ---------DGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTF 144 (149) T ss_pred ---------CCcEEEEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHH Confidence 1123567899999999999998 44678899999999988888 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 145 ~~~i~ 149 (149) T protein:vir:94 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 88888 No 30 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.35 E-value=3.3e-15 Score=100.01 Aligned_cols=100 Identities=21% Similarity=0.253 Sum_probs=80.5 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|.++.+++.+.+...+++.+.++..+++..+|||||+||+||.+.+.. T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKD------------------ 62 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeec------------------ Confidence 33 4455555666666666777888899999999999999999999999765421 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-++.|.++++||.++|||| +.|.|+.|++.+++.....| T Consensus 63 ----------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:94 63 ----------GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFF 132 (137) T ss_pred ----------CcEEEEEecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHH Confidence 123577899999999999994 46888899999999999999 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~~~l~ 137 (137) T protein:vir:94 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 31 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.34 E-value=4.5e-15 Score=99.23 Aligned_cols=100 Identities=25% Similarity=0.297 Sum_probs=80.4 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+++.+.++..+++.+.++...++..+|||||+||+||.+.+. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~------------------- 61 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFE------------------- 61 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEee------------------- Confidence 55 344555555666667777888899999999999999999999999976532 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC---------------------------CCCCCchhHHHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW---------------------------SQQAPQGFVRVNVSRFQQLLNE 125 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~---------------------------S~QAp~G~V~~a~~~~~~~v~~ 125 (131) ..|-+..|.++++||.++|||| +.+.|+.|++.+++.....|.+ T Consensus 62 ---------~~g~~~~V~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~ 132 (135) T protein:vir:96 62 ---------NGGFTGVVKIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQ 132 (135) T ss_pred ---------cCcEEEEEecCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHH Confidence 1123466889999999999998 5588999999999999988888 Q ss_pred HHH Q lcl|NC_019539. 126 EAS 128 (131) Q Consensus 126 ~~~ 128 (131) .+. T Consensus 133 ~i~ 135 (135) T protein:vir:96 133 YFS 135 (135) T ss_pred hcC Confidence 888 No 32 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.33 E-value=5.7e-15 Score=98.66 Aligned_cols=100 Identities=22% Similarity=0.265 Sum_probs=80.6 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|+++.+++.+.+.+.+++.+.++..+++..+|||||+||+||.+.+. T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~------------------- 73 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF------------------- 73 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEec------------------- Confidence 22 455666666667777777888899999999999999999999999987531 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCCC-----------------------------CCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-----------------------------~QAp~G~V~~a~~~~~~~v 123 (131) ..|-+..|.++++||.++||||. .|.|+.|++.++.+-...| T Consensus 74 ---------~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i 144 (149) T protein:vir:10 74 ---------DGGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTF 144 (149) T ss_pred ---------CCcEEEEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHH Confidence 11235678999999999999983 4668889999999999988 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 145 ~~~i~ 149 (149) T protein:vir:10 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 88888 No 33 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.30 E-value=1e-14 Score=97.26 Aligned_cols=100 Identities=21% Similarity=0.238 Sum_probs=79.9 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|.++.+++.+.+...+++.+.++..+++..+|||||+||+||.+.+.. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKK------------------ 62 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeC------------------ Confidence 54 4455555666666777788899999999999999999999999999765321 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-+.+|.++++||.++|||+ +.|.|+.|++.++.+-...| T Consensus 63 ----------~~~~~~V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:10 63 ----------GGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred ----------CcEEEEEecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHH Confidence 123578899999999999995 45788899999998888888 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhcC Confidence 88877 No 34 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.29 E-value=1.1e-14 Score=97.16 Aligned_cols=100 Identities=20% Similarity=0.223 Sum_probs=74.7 Q ss_pred Cc---cchhHHHHHHHHH-----HHHHHHHHH----HHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MS---FALDVSKFVEKAK-----KNPEKVIRQ----VSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 ms---f~~~i~~~~~~~~-----~~~~~~~r~----~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+ |. =++++.++++ ++++.++++ ++.++....+...|||||.||+|..++.+ T Consensus 1 Ma~i~i~-Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~--------------- 64 (112) T protein:vir:96 1 MATIEFE-GLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAG--------------- 64 (112) T ss_pred Cceeeeh-HHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecC--------------- Confidence 33 32 2334433332 233344444 44556666677889999999999876432 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) |.++.|..+.+||.+||||+|.++|+.|++.+++.-...|.+.++++. T Consensus 65 ---------------~~~~~v~~~~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 65 ---------------SDRAVVEALTNYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ---------------ceEEEecCCCCccceeccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 346778889999999999999999999999999999999999999999 No 35 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.28 E-value=1.6e-14 Score=96.23 Aligned_cols=100 Identities=21% Similarity=0.240 Sum_probs=79.5 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ |.+.|.+..+.+.+.++..+++.+.++..+++..+|||||.||+||++.+.. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~------------------ 62 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKK------------------ 62 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecC------------------ Confidence 43 4455555555666677788888999999999999999999999999775421 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCC-----------------------------CCCCCchhHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGW-----------------------------SQQAPQGFVRVNVSRFQQLL 123 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~-----------------------------S~QAp~G~V~~a~~~~~~~v 123 (131) .|-+..|.++++||.++|||+ +.|.|..|++.++.+-...| T Consensus 63 ----------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i 132 (137) T protein:vir:10 63 ----------GGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred ----------CcEEEEEecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHH Confidence 123567899999999999996 35888899999999888888 Q ss_pred HHHHH Q lcl|NC_019539. 124 NEEAS 128 (131) Q Consensus 124 ~~~~~ 128 (131) .+.+. T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 88888 No 36 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.27 E-value=2.4e-14 Score=95.21 Aligned_cols=100 Identities=20% Similarity=0.231 Sum_probs=74.5 Q ss_pred CccchhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHH----hCCccchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVEKAK-----KNPEKVIRQVSIKLFSAIIK----ASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~~~~-----~~~~~~~r~~a~~~~~~vv~----~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |.|.. ++++.++++ ..+.+++++.+.++...++. ..|||||.||.||.++... T Consensus 4 i~~~G-ld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~----------------- 65 (114) T protein:vir:49 4 IEFEG-LDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES----------------- 65 (114) T ss_pred eeeeh-HHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC----------------- Confidence 33431 445555443 33556677666666666665 4699999999999876421 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +-..|..+.+|+.+||||+|.++|+.|++.++......+.+.++++- T Consensus 66 -------------~~~~V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~ 112 (114) T protein:vir:49 66 -------------DKATVEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWD 112 (114) T ss_pred -------------CeeEecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 12457788999999999999999999999999999888888888877 No 37 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.27 E-value=2.4e-14 Score=95.21 Aligned_cols=100 Identities=20% Similarity=0.231 Sum_probs=74.5 Q ss_pred CccchhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHH----hCCccchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVEKAK-----KNPEKVIRQVSIKLFSAIIK----ASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~~~~-----~~~~~~~r~~a~~~~~~vv~----~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |.|.. ++++.++++ ..+.+++++.+.++...++. ..|||||.||.||.++... T Consensus 4 i~~~G-ld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~----------------- 65 (114) T protein:vir:27 4 IEFEG-LDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES----------------- 65 (114) T ss_pred eeeeh-HHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC----------------- Confidence 33431 445555443 33556677666666666665 4699999999999876421 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +-..|..+.+|+.+||||+|.++|+.|++.++......+.+.++++- T Consensus 66 -------------~~~~V~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~ 112 (114) T protein:vir:27 66 -------------DKATVEALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWD 112 (114) T ss_pred -------------CeeEecCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 12457788999999999999999999999999999888888888877 No 38 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.25 E-value=4.3e-14 Score=93.88 Aligned_cols=102 Identities=13% Similarity=0.118 Sum_probs=78.2 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+.-+.+.+.+...+++.+.++..+.+..+ |||||.||+|+.++.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~--------------- 65 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT--------------- 65 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeec--------------- Confidence 6654 233333345555566777888888888888876 9999999999977521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|..+.|..+.+|+.+||||+|.++|+.|++.+++.....+.+.++++= T Consensus 66 --------------g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~ 114 (115) T protein:vir:97 66 --------------GDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALF 114 (115) T ss_pred --------------CceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 1234567888999999999999999999999999998888777776655 No 39 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.25 E-value=4.3e-14 Score=93.88 Aligned_cols=102 Identities=13% Similarity=0.118 Sum_probs=78.2 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+.-+.+.+.+...+++.+.++..+.+..+ |||||.||+|+.++.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~--------------- 65 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT--------------- 65 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeec--------------- Confidence 6654 233333345555566777888888888888876 9999999999977521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|..+.|..+.+|+.+||||+|.++|+.|++.+++.....+.+.++++= T Consensus 66 --------------g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~ 114 (115) T protein:vir:93 66 --------------GDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALF 114 (115) T ss_pred --------------CceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 1234567888999999999999999999999999998888777776655 No 40 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.25 E-value=4.3e-14 Score=93.88 Aligned_cols=102 Identities=13% Similarity=0.118 Sum_probs=78.2 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+.-+.+.+.+...+++.+.++..+.+..+ |||||.||+|+.++.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~--------------- 65 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT--------------- 65 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeec--------------- Confidence 6654 233333345555566777888888888888876 9999999999977521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|..+.|..+.+|+.+||||+|.++|+.|++.+++.....+.+.++++= T Consensus 66 --------------g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~ 114 (115) T protein:vir:78 66 --------------GDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALF 114 (115) T ss_pred --------------CceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 1234567888999999999999999999999999998888777776655 No 41 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.25 E-value=4.3e-14 Score=93.88 Aligned_cols=102 Identities=13% Similarity=0.118 Sum_probs=78.2 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+.-+.+.+.+...+++.+.++..+.+..+ |||||.||+|+.++.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~--------------- 65 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT--------------- 65 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeec--------------- Confidence 6654 233333345555566777888888888888876 9999999999977521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|..+.|..+.+|+.+||||+|.++|+.|++.+++.....+.+.++++= T Consensus 66 --------------g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~ 114 (115) T protein:vir:10 66 --------------GDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALF 114 (115) T ss_pred --------------CceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 1234567888999999999999999999999999998888777776655 No 42 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.25 E-value=4.3e-14 Score=93.88 Aligned_cols=102 Identities=13% Similarity=0.118 Sum_probs=78.2 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+.-+.+.+.+...+++.+.++..+.+..+ |||||.||+|+.++.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~--------------- 65 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT--------------- 65 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeec--------------- Confidence 6654 233333345555566777888888888888876 9999999999977521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|..+.|..+.+|+.+||||+|.++|+.|++.+++.....+.+.++++= T Consensus 66 --------------g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~ 114 (115) T protein:vir:96 66 --------------GDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALF 114 (115) T ss_pred --------------CceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 1234567888999999999999999999999999998888777776655 No 43 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.25 E-value=4.3e-14 Score=93.88 Aligned_cols=102 Identities=13% Similarity=0.118 Sum_probs=78.2 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+.-+.+.+.+...+++.+.++..+.+..+ |||||.||+|+.++.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~--------------- 65 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT--------------- 65 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeec--------------- Confidence 6654 233333345555566777888888888888876 9999999999977521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .|..+.|..+.+|+.+||||+|.++|+.|++.+++.....+.+.++++= T Consensus 66 --------------g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~ 114 (115) T protein:vir:96 66 --------------GDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALF 114 (115) T ss_pred --------------CceEEEeecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHh Confidence 1234567888999999999999999999999999998888777776655 No 44 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.24 E-value=5.2e-14 Score=93.41 Aligned_cols=103 Identities=13% Similarity=0.102 Sum_probs=77.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKL-----FSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAA 75 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~-----~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~ 75 (131) +-|....++..+++++.....++++++.. -...+..+|||||+||+||...+. T Consensus 4 ~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~---------------------- 61 (141) T protein:vir:78 4 FEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVR---------------------- 61 (141) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeee---------------------- Confidence 55888888888888888888777776664 344567899999999999965431 Q ss_pred HHHhhccCCceEEEeeCchhhhhhhcCC--------------------------CCCCCchhHHHHHHHH----HHHHHH Q lcl|NC_019539. 76 NFVLNAADWHTFTLTNNLPYAQRLEYGW--------------------------SQQAPQGFVRVNVSRF----QQLLNE 125 (131) Q Consensus 76 ~~i~~~~~g~~i~i~Nn~pYa~~LEyG~--------------------------S~QAp~G~V~~a~~~~----~~~v~~ 125 (131) ..|.++.|.|+.+||.|+|||+ +.|.|+.|++.++.+- .+++++ T Consensus 62 ------~~g~~~~V~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~ 135 (141) T protein:vir:78 62 ------KSSKEVIVGNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTER 135 (141) T ss_pred ------cCCcEEEEecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHH Confidence 1233567999999999999996 4689999999998665 445555 Q ss_pred HHHhhC Q lcl|NC_019539. 126 EASKVK 131 (131) Q Consensus 126 ~~~e~k 131 (131) .++.+- T Consensus 136 ~~~~l~ 141 (141) T protein:vir:78 136 ALRGIN 141 (141) T ss_pred HhhccC Confidence 555555 No 45 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.23 E-value=6e-14 Score=93.05 Aligned_cols=102 Identities=10% Similarity=0.119 Sum_probs=80.6 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |.|. ..|.+.-+.+.+.+...+++.+..+..+++..+ |||||.||+|+.++.+ | T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~-------------g- 66 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKI-------------G- 66 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeec-------------C- Confidence 5543 334444445556677888888888888888765 9999999999976521 1 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +-+..+..+..|+.+||||+|.++|+.|++.++++....|.+.++++= T Consensus 67 ---------------~~~~~v~~~~~Ya~~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i 114 (115) T protein:vir:10 67 ---------------DLHYRVISTAHYSGFLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLL 114 (115) T ss_pred ---------------cEEEEeeCCCccchheecccccCCCCCchhhhHHHHHHHHHHHHHHHh Confidence 234667888999999999999999999999999999888888888777 No 46 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.23 E-value=5.7e-14 Score=93.18 Aligned_cols=102 Identities=13% Similarity=0.108 Sum_probs=79.8 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKAS------PVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~t------PVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+|. ..|.+..+.+.+.+...+++.+.++..+++..+ |||||.||+|+.++.+ | T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~-------------g- 66 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKT-------------V- 66 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeec-------------C- Confidence 6554 344444455666677888888889998888776 9999999999976531 1 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) |-+..+..+..|+.+||||+|.++|+.|++.++......+.+.++++= T Consensus 67 ---------------~~~~~V~~~~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~ 114 (115) T protein:vir:99 67 ---------------DLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLF 114 (115) T ss_pred ---------------cEEEEecCCccccccccccccccCCCCcchhhHHHHHHHHHHHHHHHh Confidence 224667888999999999999999999999999988887777776655 No 47 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.21 E-value=4.3e-14 Score=93.85 Aligned_cols=87 Identities=23% Similarity=0.222 Sum_probs=75.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCc Q lcl|NC_019539. 14 AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNL 93 (131) Q Consensus 14 ~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~ 93 (131) +++.+.+.+.+.+.++...++..+|||||+||+||.+.+. ..|-+..|.+++ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~----------------------------~~~~~~~V~~~~ 52 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK----------------------------DGGFTGVINIGS 52 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEee----------------------------cCcEEEEEecCC Confidence 7777788899999999999999999999999999976531 112357788999 Q ss_pred hhhhhhhcC-----------------------------CCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019539. 94 PYAQRLEYG-----------------------------WSQQAPQGFVRVNVSRFQQLLNEEAS 128 (131) Q Consensus 94 pYa~~LEyG-----------------------------~S~QAp~G~V~~a~~~~~~~v~~~~~ 128 (131) +||.++||| |+.|.|+.|++.++.+-...|.+.+. T Consensus 53 ~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 53 EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred CcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 999999999 77899999999999999999988888 No 48 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.21 E-value=4.3e-14 Score=93.85 Aligned_cols=87 Identities=23% Similarity=0.222 Sum_probs=75.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCc Q lcl|NC_019539. 14 AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNL 93 (131) Q Consensus 14 ~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~ 93 (131) +++.+.+.+.+.+.++...++..+|||||+||+||.+.+. ..|-+..|.+++ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~----------------------------~~~~~~~V~~~~ 52 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK----------------------------DGGFTGVINIGS 52 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEee----------------------------cCcEEEEEecCC Confidence 7777788899999999999999999999999999976531 112357788999 Q ss_pred hhhhhhhcC-----------------------------CCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019539. 94 PYAQRLEYG-----------------------------WSQQAPQGFVRVNVSRFQQLLNEEAS 128 (131) Q Consensus 94 pYa~~LEyG-----------------------------~S~QAp~G~V~~a~~~~~~~v~~~~~ 128 (131) +||.++||| |+.|.|+.|++.++.+-...|.+.+. T Consensus 53 ~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 53 EYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred CcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 999999999 77899999999999999999988888 No 49 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.15 E-value=1.3e-13 Score=91.28 Aligned_cols=87 Identities=23% Similarity=0.224 Sum_probs=75.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCc Q lcl|NC_019539. 14 AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNL 93 (131) Q Consensus 14 ~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~ 93 (131) +++.+.+.+.+.+.++...++..+|||||.||+||...+.. .|-+..|.++. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~----------------------------~~~~~~V~~~~ 52 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD----------------------------GGFTGVINIGS 52 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeec----------------------------CcEEEEEecCC Confidence 77777888999999999999999999999999999764311 12246688999 Q ss_pred hhhhhhhcC-----------------------------CCCCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019539. 94 PYAQRLEYG-----------------------------WSQQAPQGFVRVNVSRFQQLLNEEAS 128 (131) Q Consensus 94 pYa~~LEyG-----------------------------~S~QAp~G~V~~a~~~~~~~v~~~~~ 128 (131) +||.++||| |+.|.|+.|++.++.+-...|.+.+. T Consensus 53 ~Ya~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 53 EYAIYVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred CccceeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999999 67899999999999999999999888 No 50 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.09 E-value=6.5e-13 Score=87.41 Aligned_cols=105 Identities=17% Similarity=0.284 Sum_probs=80.5 Q ss_pred Cc----------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MS----------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 ms----------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+ ....|+++.+.+.+.+...+++++.++..+++..+|++||.||.||.++... + .|. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~---------~-~g~-- 68 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKED---------G-YGT-- 68 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccc---------c-CCc-- Confidence 32 3344788888888889999999999999999999999999999999887311 1 111 Q ss_pred HHHHHHHHhhccCCceEEEeeCchh--hhhhhcCCCC-----CCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFTLTNNLPY--AQRLEYGWSQ-----QAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~i~Nn~pY--a~~LEyG~S~-----QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +.++ +.|+..| +.-|||||.+ .++..|++.+.+...+.+.+.++++= T Consensus 69 -------------~~~v-v~~~~~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l 122 (126) T protein:vir:81 69 -------------TKRI-IWNKKHYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVI 122 (126) T ss_pred -------------ceEE-EeccCCCCceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHh Confidence 1122 3344445 6679999987 48999999999999888888887766 No 51 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=98.98 E-value=3.6e-12 Score=83.30 Aligned_cols=104 Identities=17% Similarity=0.195 Sum_probs=74.5 Q ss_pred Ccc------chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHH Q lcl|NC_019539. 1 MSF------ALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNA 74 (131) Q Consensus 1 msf------~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~ 74 (131) |.| ...|.++.+.+...+...+++.+..+...++...|||||.||+|+.++... ..| T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~----------~~~------- 63 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLK----------AKD------- 63 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeec----------cCc------- Confidence 443 345566666666677778888999999999999999999999999876311 111 Q ss_pred HHHHhhccCCceEEEeeCchhhhhhhcCCC-------------------------------------------------- Q lcl|NC_019539. 75 ANFVLNAADWHTFTLTNNLPYAQRLEYGWS-------------------------------------------------- 104 (131) Q Consensus 75 ~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-------------------------------------------------- 104 (131) +-++-+..+..|+.++|||+| T Consensus 64 ---------~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 134 (173) T protein:vir:10 64 ---------LISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFA 134 (173) T ss_pred ---------eeEEeeCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceee Confidence 113446678889999999875 Q ss_pred -----CCCCchhHHHHHHH--------HHHHHHHHHHhh Q lcl|NC_019539. 105 -----QQAPQGFVRVNVSR--------FQQLLNEEASKV 130 (131) Q Consensus 105 -----~QAp~G~V~~a~~~--------~~~~v~~~~~e~ 130 (131) .|+|+.|.+.++.+ +.+.|+++++++ T Consensus 135 ~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 135 KILGAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred EeecCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 48888899877754 444555556666 No 52 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=98.96 E-value=2.9e-12 Score=83.83 Aligned_cols=102 Identities=15% Similarity=0.146 Sum_probs=72.4 Q ss_pred Cc--cc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHH Q lcl|NC_019539. 1 MS--FA--LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAAN 76 (131) Q Consensus 1 ms--f~--~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~ 76 (131) || +. .+..+..+.+...+...+++++.++..+.+..+|||||+||+||...... . T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~-----------~---------- 59 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQT-----------Y---------- 59 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeec-----------c---------- Confidence 54 33 34456667788888888999999999999999999999999999765311 0 Q ss_pred HHhhccCCceEEEeeCchhhhhhhcCCC-----------------------------CCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019539. 77 FVLNAADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQLLNEEA 127 (131) Q Consensus 77 ~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-----------------------------~QAp~G~V~~a~~~~~~~v~~~~ 127 (131) ...+-++.+.++++||.++|||+. .|.|..|++.++.+. -... T Consensus 60 ----~~~~~~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~----~~~~ 131 (137) T protein:vir:10 60 ----RPFHVGGGVEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRV----VAAD 131 (137) T ss_pred ----ccceEEEEEecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHH----hhcc Confidence 011236789999999999999962 556777777666552 1122 Q ss_pred HhhC Q lcl|NC_019539. 128 SKVK 131 (131) Q Consensus 128 ~e~k 131 (131) .++| T Consensus 132 ~ri~ 135 (137) T protein:vir:10 132 PDIH 135 (137) T ss_pred cccc Confidence 3334 No 53 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=98.96 E-value=4.4e-12 Score=82.87 Aligned_cols=104 Identities=10% Similarity=0.025 Sum_probs=75.7 Q ss_pred Ccc-----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHH Q lcl|NC_019539. 1 MSF-----ALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAA 75 (131) Q Consensus 1 msf-----~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~ 75 (131) ++| ..++....++++..+++.+++++.++...++..+|||||.||+||...... T Consensus 4 ~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~--------------------- 62 (142) T protein:vir:99 4 VSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQV--------------------- 62 (142) T ss_pred eEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecc--------------------- Confidence 333 355777777888888999999999999999999999999999999754321 Q ss_pred HHHhhccCCceEEEeeCchhhhhhhcCCC-----------------------------CCCCchhHHHHHHHHHHH-HHH Q lcl|NC_019539. 76 NFVLNAADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQL-LNE 125 (131) Q Consensus 76 ~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-----------------------------~QAp~G~V~~a~~~~~~~-v~~ 125 (131) .....+-++-+..+++||.++|||+. .|.|+.|++.++++.... .+. T Consensus 63 ---~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~ 139 (142) T protein:vir:99 63 ---MVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRI 139 (142) T ss_pred ---ccccceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhh Confidence 00112224557799999999999984 677889998888765433 222 Q ss_pred HHH Q lcl|NC_019539. 126 EAS 128 (131) Q Consensus 126 ~~~ 128 (131) .+| T Consensus 140 ~~r 142 (142) T protein:vir:99 140 RVR 142 (142) T ss_pred ccC Confidence 222 No 54 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=98.96 E-value=4.4e-12 Score=82.87 Aligned_cols=104 Identities=10% Similarity=0.025 Sum_probs=75.7 Q ss_pred Ccc-----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHH Q lcl|NC_019539. 1 MSF-----ALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAA 75 (131) Q Consensus 1 msf-----~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~ 75 (131) ++| ..++....++++..+++.+++++.++...++..+|||||.||+||...... T Consensus 4 ~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~--------------------- 62 (142) T protein:vir:86 4 VSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQV--------------------- 62 (142) T ss_pred eEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeecc--------------------- Confidence 333 355777777888888999999999999999999999999999999754321 Q ss_pred HHHhhccCCceEEEeeCchhhhhhhcCCC-----------------------------CCCCchhHHHHHHHHHHH-HHH Q lcl|NC_019539. 76 NFVLNAADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQL-LNE 125 (131) Q Consensus 76 ~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-----------------------------~QAp~G~V~~a~~~~~~~-v~~ 125 (131) .....+-++-+..+++||.++|||+. .|.|+.|++.++++.... .+. T Consensus 63 ---~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~ 139 (142) T protein:vir:86 63 ---MVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRI 139 (142) T ss_pred ---ccccceEEEEeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhh Confidence 00112224557799999999999984 677889998888765433 222 Q ss_pred HHH Q lcl|NC_019539. 126 EAS 128 (131) Q Consensus 126 ~~~ 128 (131) .+| T Consensus 140 ~~r 142 (142) T protein:vir:86 140 RVR 142 (142) T ss_pred ccC Confidence 222 No 55 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=98.91 E-value=8.7e-12 Score=81.22 Aligned_cols=102 Identities=14% Similarity=0.130 Sum_probs=72.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) ..|..+...+.+.++..+++.+++.+.++..+++..+|||||.||+||+..... .| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~-----------~~------------- 63 (140) T protein:vir:97 8 ARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQV-----------YT------------- 63 (140) T ss_pred eeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeee-----------CC------------- Confidence 447778889999999999999999999999999999999999999999754211 00 Q ss_pred ccCCceEEEeeCchhhhhhhcCCC-----------------------------CCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S-----------------------------~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) ..+-++.+..+++||.++|||+. .|.|+.|++.++.+.. ....++| T Consensus 64 -~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~----~~~~~i~ 138 (140) T protein:vir:97 64 -PFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVV----TNDPRVR 138 (140) T ss_pred -CceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHh----hhhhhcc Confidence 11124668899999999999983 3445555555544321 1112223 No 56 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=98.91 E-value=8.7e-12 Score=81.22 Aligned_cols=102 Identities=14% Similarity=0.130 Sum_probs=72.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) ..|..+...+.+.++..+++.+++.+.++..+++..+|||||.||+||+..... .| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~-----------~~------------- 63 (140) T protein:vir:10 8 ARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQV-----------YT------------- 63 (140) T ss_pred eeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeee-----------CC------------- Confidence 447778889999999999999999999999999999999999999999754211 00 Q ss_pred ccCCceEEEeeCchhhhhhhcCCC-----------------------------CCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWS-----------------------------QQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S-----------------------------~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) ..+-++.+..+++||.++|||+. .|.|+.|++.++.+.. ....++| T Consensus 64 -~~~~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~----~~~~~i~ 138 (140) T protein:vir:10 64 -PFRVRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVV----TNDPRVR 138 (140) T ss_pred -CceEEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHh----hhhhhcc Confidence 11124668899999999999983 3445555555544321 1112223 No 57 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=98.88 E-value=1.2e-11 Score=80.52 Aligned_cols=103 Identities=15% Similarity=0.231 Sum_probs=68.2 Q ss_pred Cccc-----------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcch Q lcl|NC_019539. 1 MSFA-----------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTT 69 (131) Q Consensus 1 msf~-----------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~ 69 (131) |+=+ ..|.+|.+.+.+.++..+++++.+++.+|+..||++||.++.||.+... .. T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~-----------~~--- 66 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKL-----------KN--- 66 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeec-----------CC--- Confidence 5543 4455666667777888899999999999999999999999999976531 11 Q ss_pred hHHHHHHHHhhccCCceEEEeeCchh--hhhhhcCCCCCC-----CchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 70 ATSNAANFVLNAADWHTFTLTNNLPY--AQRLEYGWSQQA-----PQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 70 ~~~~~~~~i~~~~~g~~i~i~Nn~pY--a~~LEyG~S~QA-----p~G~V~~a~~~~~~~v~~~~~e~k 131 (131) |..+.+.|+-.| +.-|||||-++. +..+++.+.+...+.|++.+++.= T Consensus 67 --------------~~~~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l 121 (123) T protein:vir:96 67 --------------GDQVIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRL 121 (123) T ss_pred --------------eeEEEEEecCCcceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHh Confidence 112334445455 788999997665 334445665555444444333322 No 58 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=98.83 E-value=2.7e-11 Score=78.48 Aligned_cols=118 Identities=15% Similarity=0.117 Sum_probs=68.4 Q ss_pred Cc-cchh-HHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MS-FALD-VSKFVEKAK--------KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 ms-f~~~-i~~~~~~~~--------~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+ |.-+ ++++.+.++ +.+...+++.+..+..+++.++|++||.|+.|..++...... +... T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~---------~~~~ 71 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKD---------APGL 71 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhcccccccccc---------ccce Confidence 54 3322 344443332 223456677788888899999999999999999876433211 1100 Q ss_pred HHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHH-hhC Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEAS-KVK 131 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~-e~k 131 (131) ... .-....+.......+..|+.+||||+|.|.|..|++.|+..-...+.+++. +++ T Consensus 72 ~~~----g~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:10 72 ATA----GVRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred EEe----eeeeccccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 000 000000011112356789999999999999999999988665443322222 222 No 59 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=98.82 E-value=4.2e-11 Score=77.46 Aligned_cols=118 Identities=14% Similarity=0.106 Sum_probs=68.5 Q ss_pred Cc-cchh-HHHHHHHHH-------H-HHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MS-FALD-VSKFVEKAK-------K-NPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 ms-f~~~-i~~~~~~~~-------~-~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+ +.-+ ++++.+.++ . .....+++.+..+..+++..+|++||.|+.|..++......+ ..+ T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~---------~~~ 71 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDA---------PGL 71 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhccccccccccc---------cee Confidence 43 3322 333333332 2 234567778888888999999999999999997754322111 000 Q ss_pred HHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHH-HHhhC Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEE-ASKVK 131 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~-~~e~k 131 (131) .. +.-..+.+.......+.+|+.+||||+|.|.|+.|++.++......+.++ .++++ T Consensus 72 ~~----vg~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:14 72 AT----AGVRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred EE----eeeeeccccccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHH Confidence 00 00000111122234678999999999999999999999886553332222 22222 No 60 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=98.82 E-value=3.5e-11 Score=77.88 Aligned_cols=105 Identities=13% Similarity=0.153 Sum_probs=61.5 Q ss_pred Cccc--------hhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA--------LDVSKFVEKAKKNP----EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~--------~~i~~~~~~~~~~~----~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |++. ..+.++-+.+++.+ .+.+.+++..+..+++...|||||.||+|....+..- T Consensus 2 ~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~------------- 68 (182) T protein:vir:10 2 IEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVD------------- 68 (182) T ss_pred eEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeec------------- Confidence 3332 22333333333333 3344555555666677889999999999986543210 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCC--------------------------------------------- Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGW--------------------------------------------- 103 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~--------------------------------------------- 103 (131) ..+-+..+.++.+||.++|||. T Consensus 69 -------------~~~~~g~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~ 135 (182) T protein:vir:10 69 -------------GDEVIGRWWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIK 135 (182) T ss_pred -------------CCeEEEEeecCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceee Confidence 0112455778888888888774 Q ss_pred ---------CCCCCchhHHHHHHHHH----HHHHHHHHh-hC Q lcl|NC_019539. 104 ---------SQQAPQGFVRVNVSRFQ----QLLNEEASK-VK 131 (131) Q Consensus 104 ---------S~QAp~G~V~~a~~~~~----~~v~~~~~e-~k 131 (131) +.|.|+.|++.+++... +++.+++++ +| T Consensus 136 ~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~~l~ 177 (182) T protein:vir:10 136 INGKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQELH 177 (182) T ss_pred ecCceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHHHHH Confidence 56888899988876543 333333333 23 No 61 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=98.81 E-value=1.9e-11 Score=79.35 Aligned_cols=118 Identities=14% Similarity=0.116 Sum_probs=68.9 Q ss_pred Cc-cchh-HHHHHHHHH---H-----HHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MS-FALD-VSKFVEKAK---K-----NPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 ms-f~~~-i~~~~~~~~---~-----~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+ +.-+ ++++.+.++ . .....+++.+..+..+++..+|++||.+++|..++......+ ... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~---------~~~ 71 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDA---------PGL 71 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccc---------cce Confidence 44 2211 233333222 2 224567788888999999999999999999997653221110 000 Q ss_pred HHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHH-hhC Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEAS-KVK 131 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~-e~k 131 (131) .. +.-............+..|+.+||||+|.|+|+.|++.++.+....+.+++. +++ T Consensus 72 ~~----~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:80 72 AT----AGVRVRTKGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELA 129 (140) T ss_pred ee----eeeecccccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 00 0000001111223467889999999999999999999998766444333332 222 No 62 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=98.78 E-value=7.4e-11 Score=76.12 Aligned_cols=118 Identities=16% Similarity=0.152 Sum_probs=71.5 Q ss_pred Cc---------cchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MS---------FALDVSKFVEKAK-KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 ms---------f~~~i~~~~~~~~-~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+ +...|+.+.+.+. +.+...+++.+..+..+++..+|++||.|+.|-.++...... ..+... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~-------~~~~~~ 73 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKD-------SPGIAT 73 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceeccccccc-------ccceeE Confidence 44 2233444444443 234567788888899999999999999999998776432211 111111 Q ss_pred HHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHH-HHHHHHHHhhC Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQ-QLLNEEASKVK 131 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~-~~v~~~~~e~k 131 (131) .. -..+.........+..|+.+||||+|.|+|+.|++.++.+-. ++++...++++ T Consensus 74 ~~------~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~ 129 (140) T protein:vir:10 74 AG------VRVRTKGKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIA 129 (140) T ss_pred Ee------eccccccccCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHH Confidence 00 000011111223467899999999999999999999996663 33333333343 No 63 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=98.72 E-value=4.2e-11 Score=77.45 Aligned_cols=80 Identities=23% Similarity=0.268 Sum_probs=59.3 Q ss_pred Cccc-hhHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHH Q lcl|NC_019539. 1 MSFA-LDVSKFVEKAKKN-----PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNA 74 (131) Q Consensus 1 msf~-~~i~~~~~~~~~~-----~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~ 74 (131) |++. .=++++.+.+++. +.+++++.+.++-.+.+...|||||.||+|+.+++.. T Consensus 4 ~~i~~~Gld~L~~~L~~~~~~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~-------------------- 63 (92) T protein:vir:99 4 YSISWDGLDALDEALANQQNMNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISR-------------------- 63 (92) T ss_pred eeeEeehHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeec-------------------- Confidence 3343 2256666666543 7889999999999999999999999999999877421 Q ss_pred HHHHhhccCCceEEE---eeCchhhhhhhcCCCCCCC Q lcl|NC_019539. 75 ANFVLNAADWHTFTL---TNNLPYAQRLEYGWSQQAP 108 (131) Q Consensus 75 ~~~i~~~~~g~~i~i---~Nn~pYa~~LEyG~S~QAp 108 (131) .|-+..| .=+..|++|||||++.++. T Consensus 64 --------~g~~~~v~~~gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 64 --------DGFTGSVTYGGGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred --------CCeeEEEEeccCccccccccccceeecCC Confidence 1112222 2467899999999999988 No 64 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.67 E-value=1e-10 Score=75.30 Aligned_cols=122 Identities=15% Similarity=0.154 Sum_probs=65.8 Q ss_pred Cccchh-HHHHHHHHHHH---H-----HHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALD-VSKFVEKAKKN---P-----EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~-i~~~~~~~~~~---~-----~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |+|.-+ ++++.+.++.- + ...+++.+..+..+++.++|++||.++.|-.++...-+. |.... T Consensus 4 ~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~---------g~~~~ 74 (148) T protein:vir:93 4 TLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRD---------GGMES 74 (148) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccC---------Cceee Confidence 334433 44555544422 1 224455566688888899999999999998766432221 11110 Q ss_pred HHHHHHHhhcc-CCc---eEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHH-HHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAA-DWH---TFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQ-LLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~-~g~---~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~-~v~~~~~e~k 131 (131) .-....+...+ ... ..+-..+.+|+.++|||.|.|+|+.|++.|+.+-.. +++...++++ T Consensus 75 ~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~ 139 (148) T protein:vir:93 75 GVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMN 139 (148) T ss_pred eeeecccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 00000000000 000 111224568999999999999999999999865422 2222222222 No 65 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=98.63 E-value=1.1e-10 Score=75.18 Aligned_cols=107 Identities=13% Similarity=0.101 Sum_probs=78.1 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) |-|..+.....+++...+...+++++.++....+...|||||.||+|++.+... .+ T Consensus 5 ~~~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~------------------------~~ 60 (137) T protein:vir:10 5 ARYERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIV------------------------VA 60 (137) T ss_pred EEeccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeee------------------------cc Confidence 668888888888899999999999999999999999999999999999865321 01 Q ss_pred ccCCceEEEeeCchhhhhhhcCCCCCC--------------CchhH-HHHH----HHHHHHHHHHHHhhC Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWSQQA--------------PQGFV-RVNV----SRFQQLLNEEASKVK 131 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S~QA--------------p~G~V-~~a~----~~~~~~v~~~~~e~k 131 (131) ...+..+++..+++||.++|+|+.... ..++| +-.+ +.-+.++..++++.+ T Consensus 61 ~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~ 130 (137) T protein:vir:10 61 GPLRLDSGVTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVV 130 (137) T ss_pred ccceEEEEecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhh Confidence 122346788899999999999985321 11222 0000 223566777777777 No 66 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.59 E-value=3.6e-10 Score=72.38 Aligned_cols=107 Identities=16% Similarity=0.067 Sum_probs=70.0 Q ss_pred Cccc------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc---chhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFA------LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVD---TGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~------~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVd---tGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |.+. ..|.+..+.+++.....+++.|..+..+++..+|++ ||.++.|-.++-- ..++.|. T Consensus 4 ~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~--------k~~~~g~--- 72 (127) T protein:vir:12 4 MSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNV--------RESKDGV--- 72 (127) T ss_pred eeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhcccc--------ccccCce--- Confidence 2222 333333334455556677888888999999999986 8999999976521 1111121 Q ss_pred HHHHHHHhhccCCceEEEe---eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHH-hhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLT---NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEAS-KVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~---Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~-e~k 131 (131) .+|.|. ++.+|+.+||||+|.|.|++|++.|+++-...+-+.+. +++ T Consensus 73 -------------~~v~Vg~~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~ 123 (127) T protein:vir:12 73 -------------RFVAVGPNKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILT 123 (127) T ss_pred -------------eEEEEeeCCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHH Confidence 123343 45779999999999999999999999877555444433 233 No 67 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=98.57 E-value=2.4e-10 Score=73.33 Aligned_cols=125 Identities=16% Similarity=0.179 Sum_probs=68.7 Q ss_pred CccchhHH---HHHHHHHH---H-----HHHHHHHHHHHHHHHHHHhCCccchhhccccccccCccccc-c-cCCCCCCc Q lcl|NC_019539. 1 MSFALDVS---KFVEKAKK---N-----PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADG-T-TDATDKAG 67 (131) Q Consensus 1 msf~~~i~---~~~~~~~~---~-----~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~-~-~~~~d~~g 67 (131) |+++.+|+ ++.+.++. . ....+++.|..+..+++..+|++||.++.|..++....... . .......+ T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~ 81 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccc Confidence 66655544 44333332 2 23455667777888899999999999999997764321110 0 00000000 Q ss_pred chhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHH-HHHhhC Q lcl|NC_019539. 68 TTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNE-EASKVK 131 (131) Q Consensus 68 ~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~-~~~e~k 131 (131) ... ...........+-..+..|+.++|||+|.|.|+.|++.|+.+-...+.+ ..++++ T Consensus 82 ~~~------~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~ 140 (149) T protein:vir:19 82 VNP------RTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) T ss_pred ccc------ccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHH Confidence 000 0000011112233345679999999999999999999998655332222 222222 No 68 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=98.54 E-value=1.2e-09 Score=69.45 Aligned_cols=107 Identities=16% Similarity=0.276 Sum_probs=66.0 Q ss_pred Cccch---hHHHH---HHHHHHHHHHHHH----HHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MSFAL---DVSKF---VEKAKKNPEKVIR----QVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 msf~~---~i~~~---~~~~~~~~~~~~r----~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |||+. +++.+ .+.+.+...+++| +.+.-+..+++.+.|++||.|+.|-.+.... ++++ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~---------~~s~--- 68 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSP---------EESV--- 68 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeecc---------ccCC--- Confidence 88764 23333 3344444444444 4555578888899999999999999775421 1111 Q ss_pred HHHHHHHHhhccCCceEE-Eee---CchhhhhhhcCCCC------------------------CCCchhHHHHHHHHHHH Q lcl|NC_019539. 71 TSNAANFVLNAADWHTFT-LTN---NLPYAQRLEYGWSQ------------------------QAPQGFVRVNVSRFQQL 122 (131) Q Consensus 71 ~~~~~~~i~~~~~g~~i~-i~N---n~pYa~~LEyG~S~------------------------QAp~G~V~~a~~~~~~~ 122 (131) .|..+| |+- +.||+..+||||+. ..|..|+|.++..-.+. T Consensus 69 ------------~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~ 136 (157) T protein:vir:97 69 ------------EGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQ 136 (157) T ss_pred ------------CceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHH Confidence 122222 322 56899999999742 56799999988665444 Q ss_pred HHH--------HHHhhC Q lcl|NC_019539. 123 LNE--------EASKVK 131 (131) Q Consensus 123 v~~--------~~~e~k 131 (131) +.+ .++|+. T Consensus 137 a~~~~~~~l~k~I~e~l 153 (157) T protein:vir:97 137 IPDIARAAGAKKYAELQ 153 (157) T ss_pred HHHHHHHHHHHHHHHHh Confidence 333 344444 No 69 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=98.45 E-value=1.7e-09 Score=68.70 Aligned_cols=108 Identities=13% Similarity=0.112 Sum_probs=61.7 Q ss_pred Cccc--------hhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCccchh----hccccccccCcccccccCCCCCCc Q lcl|NC_019539. 1 MSFA--------LDVSKFVEKAKKNP-EKVIRQVSIKLFSAIIKASPVDTGR----FRMNWMASGGTPADGTTDATDKAG 67 (131) Q Consensus 1 msf~--------~~i~~~~~~~~~~~-~~~~r~~a~~~~~~vv~~tPVdtGr----~R~nw~vs~~~~~~~~~~~~d~~g 67 (131) |+|. +.|.++.+++...+ ...+++.+..+..+++..+|+|+|. +|.|-.++... ..+.. T Consensus 2 ~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~-------~~~~~- 73 (133) T protein:vir:10 2 IRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSST-------RKAQG- 73 (133) T ss_pred eeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccc-------cccCc- Confidence 4444 22333333332222 4566777777899999999999987 44444332110 00000 Q ss_pred chhHHHHHHHHhhccCCc-eEEEeeC---chhhhhhhcCCCCCCCchhHHHHHHHHHHH-HHHHHHhhC Q lcl|NC_019539. 68 TTATSNAANFVLNAADWH-TFTLTNN---LPYAQRLEYGWSQQAPQGFVRVNVSRFQQL-LNEEASKVK 131 (131) Q Consensus 68 ~~~~~~~~~~i~~~~~g~-~i~i~Nn---~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~-v~~~~~e~k 131 (131) .|. .+.+..+ -.|+.++|||+|.|+|+.|++.++.+-... ++...++++ T Consensus 74 ---------------~~~~~v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~ 127 (133) T protein:vir:10 74 ---------------NAVVTLRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIR 127 (133) T ss_pred ---------------cceEEEEecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHH Confidence 011 1333322 248889999999999999999999755433 223333333 No 70 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=98.38 E-value=3.9e-09 Score=66.69 Aligned_cols=108 Identities=13% Similarity=0.177 Sum_probs=67.4 Q ss_pred CccchhHHHHHH---HHHHHH----HHHHHHHHHHHHHHHHHhCCccchh--hccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVE---KAKKNP----EKVIRQVSIKLFSAIIKASPVDTGR--FRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~---~~~~~~----~~~~r~~a~~~~~~vv~~tPVdtGr--~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) ||..-+++++.+ ++.... ...+++.|.-+...++..+|+++|. +|.|..+|-.. ..+..|. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k-------~~~~~g~--- 70 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVK-------TDRHTSE--- 70 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccc-------cccccce--- Confidence 776655544433 333322 2455666666777788899998877 99999886211 0011111 Q ss_pred HHHHHHHhhccCCceEEEeeCch---hhhhhhcCCCCCCCchhHHHHHHHHHH----HHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLP---YAQRLEYGWSQQAPQGFVRVNVSRFQQ----LLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~p---Ya~~LEyG~S~QAp~G~V~~a~~~~~~----~v~~~~~e~k 131 (131) ..+.+..+-. |+.++|||.|.|.|++|++.|+++-.. ++.+.++++. T Consensus 71 -------------~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:79 71 -------------KIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred -------------EEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 1233444333 789999999999999999999977654 3444444444 No 71 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=98.38 E-value=3.9e-09 Score=66.69 Aligned_cols=108 Identities=13% Similarity=0.177 Sum_probs=67.4 Q ss_pred CccchhHHHHHH---HHHHHH----HHHHHHHHHHHHHHHHHhCCccchh--hccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVE---KAKKNP----EKVIRQVSIKLFSAIIKASPVDTGR--FRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~---~~~~~~----~~~~r~~a~~~~~~vv~~tPVdtGr--~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) ||..-+++++.+ ++.... ...+++.|.-+...++..+|+++|. +|.|..+|-.. ..+..|. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k-------~~~~~g~--- 70 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVK-------TDRHTSE--- 70 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccc-------cccccce--- Confidence 776655544433 333322 2455666666777788899998877 99999886211 0011111 Q ss_pred HHHHHHHhhccCCceEEEeeCch---hhhhhhcCCCCCCCchhHHHHHHHHHH----HHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLP---YAQRLEYGWSQQAPQGFVRVNVSRFQQ----LLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~p---Ya~~LEyG~S~QAp~G~V~~a~~~~~~----~v~~~~~e~k 131 (131) ..+.+..+-. |+.++|||.|.|.|++|++.|+++-.. ++.+.++++. T Consensus 71 -------------~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:94 71 -------------KIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred -------------EEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 1233444333 789999999999999999999977654 3444444444 No 72 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=98.38 E-value=3.9e-09 Score=66.69 Aligned_cols=108 Identities=13% Similarity=0.177 Sum_probs=67.4 Q ss_pred CccchhHHHHHH---HHHHHH----HHHHHHHHHHHHHHHHHhCCccchh--hccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVE---KAKKNP----EKVIRQVSIKLFSAIIKASPVDTGR--FRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~---~~~~~~----~~~~r~~a~~~~~~vv~~tPVdtGr--~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) ||..-+++++.+ ++.... ...+++.|.-+...++..+|+++|. +|.|..+|-.. ..+..|. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k-------~~~~~g~--- 70 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVK-------TDRHTSE--- 70 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccc-------cccccce--- Confidence 776655544433 333322 2455666666777788899998877 99999886211 0011111 Q ss_pred HHHHHHHhhccCCceEEEeeCch---hhhhhhcCCCCCCCchhHHHHHHHHHH----HHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLP---YAQRLEYGWSQQAPQGFVRVNVSRFQQ----LLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~p---Ya~~LEyG~S~QAp~G~V~~a~~~~~~----~v~~~~~e~k 131 (131) ..+.+..+-. |+.++|||.|.|.|++|++.|+++-.. ++.+.++++. T Consensus 71 -------------~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:98 71 -------------KIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred -------------EEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 1233444333 789999999999999999999977654 3444444444 No 73 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=98.38 E-value=3.9e-09 Score=66.69 Aligned_cols=108 Identities=13% Similarity=0.177 Sum_probs=67.4 Q ss_pred CccchhHHHHHH---HHHHHH----HHHHHHHHHHHHHHHHHhCCccchh--hccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVE---KAKKNP----EKVIRQVSIKLFSAIIKASPVDTGR--FRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~---~~~~~~----~~~~r~~a~~~~~~vv~~tPVdtGr--~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) ||..-+++++.+ ++.... ...+++.|.-+...++..+|+++|. +|.|..+|-.. ..+..|. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k-------~~~~~g~--- 70 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVK-------TDRHTSE--- 70 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccc-------cccccce--- Confidence 776655544433 333322 2455666666777788899998877 99999886211 0011111 Q ss_pred HHHHHHHhhccCCceEEEeeCch---hhhhhhcCCCCCCCchhHHHHHHHHHH----HHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLP---YAQRLEYGWSQQAPQGFVRVNVSRFQQ----LLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~p---Ya~~LEyG~S~QAp~G~V~~a~~~~~~----~v~~~~~e~k 131 (131) ..+.+..+-. |+.++|||.|.|.|++|++.|+++-.. ++.+.++++. T Consensus 71 -------------~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:81 71 -------------KIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred -------------EEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 1233444333 789999999999999999999977654 3444444444 No 74 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=98.38 E-value=3.9e-09 Score=66.69 Aligned_cols=108 Identities=13% Similarity=0.177 Sum_probs=67.4 Q ss_pred CccchhHHHHHH---HHHHHH----HHHHHHHHHHHHHHHHHhCCccchh--hccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVE---KAKKNP----EKVIRQVSIKLFSAIIKASPVDTGR--FRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~---~~~~~~----~~~~r~~a~~~~~~vv~~tPVdtGr--~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) ||..-+++++.+ ++.... ...+++.|.-+...++..+|+++|. +|.|..+|-.. ..+..|. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k-------~~~~~g~--- 70 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVK-------TDRHTSE--- 70 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccc-------cccccce--- Confidence 776655544433 333322 2455666666777788899998877 99999886211 0011111 Q ss_pred HHHHHHHhhccCCceEEEeeCch---hhhhhhcCCCCCCCchhHHHHHHHHHH----HHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLP---YAQRLEYGWSQQAPQGFVRVNVSRFQQ----LLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~p---Ya~~LEyG~S~QAp~G~V~~a~~~~~~----~v~~~~~e~k 131 (131) ..+.+..+-. |+.++|||.|.|.|++|++.|+++-.. ++.+.++++. T Consensus 71 -------------~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:47 71 -------------KIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred -------------EEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 1233444333 789999999999999999999977654 3444444444 No 75 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=98.38 E-value=1.8e-09 Score=68.57 Aligned_cols=107 Identities=19% Similarity=0.161 Sum_probs=71.0 Q ss_pred Cc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchh----hccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MS-----FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGR----FRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 ms-----f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr----~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |- |.++|.+..+.+++.....+++.|..+...++..+|+++|. +|.|-.++-- ..++.|.. T Consensus 1 mv~Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~--------k~~~~g~~-- 70 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGF--------KGANVGIV-- 70 (125) T ss_pred CchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccc--------cccccCce-- Confidence 32 55555555555566667788888888999999999999887 7777655311 11222221 Q ss_pred HHHHHHHhhccCCceEEEe---eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHH-HhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLT---NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEA-SKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~---Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~-~e~k 131 (131) .+.|. .+..|+.++|||+|.|.|.+|++.|+++-...+.+++ ++++ T Consensus 71 --------------~~~VG~~k~~~~y~~f~E~GT~k~~~~pF~~pa~~~~k~~~~~~~~~~~~ 120 (125) T protein:vir:97 71 --------------SKEIGYGKATGWRAHYPNDGTIYQRGQDFKERTINQMTPKAKQLYAEKVK 120 (125) T ss_pred --------------EEEEeecCCCceeEeeeccCccCCCcCccchHhHHHhHHHHHHHHHHHHH Confidence 12222 3467999999999999999999999987644443333 3333 No 76 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.35 E-value=5.4e-09 Score=65.92 Aligned_cols=120 Identities=13% Similarity=0.126 Sum_probs=65.0 Q ss_pred Cccch-hHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVEK-------AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~~-------~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |++.- -+++|.++ +++.....+++.|..+..+++..+|+++|.++.+-........ .+...+. T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~---------~~~~~i~ 75 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQ---------HGADQIK 75 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccc---------cccccce Confidence 33221 12333333 3333556677778889999999999999999876322111000 0000000 Q ss_pred HHHHHHhhccCCceEEEe------eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHH-HHhhC Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLT------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEE-ASKVK 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~------Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~-~~e~k 131 (131) ..-+.....+..+.++ .+..|+.+||||+|.|.|..|++.+++.-...+-++ .++++ T Consensus 76 --~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 76 --VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred --eccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 0000000111223332 345799999999999999999999997764443332 23333 No 77 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.35 E-value=5.4e-09 Score=65.92 Aligned_cols=120 Identities=13% Similarity=0.126 Sum_probs=65.0 Q ss_pred Cccch-hHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVEK-------AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~~-------~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |++.- -+++|.++ +++.....+++.|..+..+++..+|+++|.++.+-........ .+...+. T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~---------~~~~~i~ 75 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQ---------HGADQIK 75 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccc---------cccccce Confidence 33221 12333333 3333556677778889999999999999999876322111000 0000000 Q ss_pred HHHHHHhhccCCceEEEe------eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHH-HHhhC Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLT------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEE-ASKVK 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~------Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~-~~e~k 131 (131) ..-+.....+..+.++ .+..|+.+||||+|.|.|..|++.+++.-...+-++ .++++ T Consensus 76 --~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 76 --VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred --eccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 0000000111223332 345799999999999999999999997764443332 23333 No 78 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.35 E-value=5.4e-09 Score=65.92 Aligned_cols=120 Identities=13% Similarity=0.126 Sum_probs=65.0 Q ss_pred Cccch-hHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVEK-------AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~~-------~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |++.- -+++|.++ +++.....+++.|..+..+++..+|+++|.++.+-........ .+...+. T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~---------~~~~~i~ 75 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQ---------HGADQIK 75 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccc---------cccccce Confidence 33221 12333333 3333556677778889999999999999999876322111000 0000000 Q ss_pred HHHHHHhhccCCceEEEe------eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHH-HHhhC Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLT------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEE-ASKVK 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~------Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~-~~e~k 131 (131) ..-+.....+..+.++ .+..|+.+||||+|.|.|..|++.+++.-...+-++ .++++ T Consensus 76 --~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 76 --VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred --eccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 0000000111223332 345799999999999999999999997764443332 23333 No 79 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.35 E-value=5.4e-09 Score=65.92 Aligned_cols=120 Identities=13% Similarity=0.126 Sum_probs=65.0 Q ss_pred Cccch-hHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVEK-------AKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~~-------~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |++.- -+++|.++ +++.....+++.|..+..+++..+|+++|.++.+-........ .+...+. T Consensus 5 ~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~---------~~~~~i~ 75 (146) T protein:vir:10 5 IDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQ---------HGADQIK 75 (146) T ss_pred eeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccc---------cccccce Confidence 33221 12333333 3333556677778889999999999999999876322111000 0000000 Q ss_pred HHHHHHhhccCCceEEEe------eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHH-HHhhC Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLT------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEE-ASKVK 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~------Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~-~~e~k 131 (131) ..-+.....+..+.++ .+..|+.+||||+|.|.|..|++.+++.-...+-++ .++++ T Consensus 76 --~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~ 139 (146) T protein:vir:10 76 --VTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILK 139 (146) T ss_pred --eccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 0000000111223332 345799999999999999999999997764443332 23333 No 80 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.35 E-value=4.9e-09 Score=66.16 Aligned_cols=101 Identities=17% Similarity=0.343 Sum_probs=71.3 Q ss_pred Cc------c----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCccchhhccccccccCcccccccCCCCCC Q lcl|NC_019539. 1 MS------F----ALDVSKFVEKAKKNPEKVIRQVSIKLFSAII----KASPVDTGRFRMNWMASGGTPADGTTDATDKA 66 (131) Q Consensus 1 ms------f----~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv----~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~ 66 (131) |+ + ...|.++.+.+.+.+++.+.+++.++...++ ..+|++||.++.+|.+... T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~------------- 67 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRT------------- 67 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeec------------- Confidence 43 3 2345577777777778888777777777766 5899999999999965421 Q ss_pred cchhHHHHHHHHhhccCCceEEEeeCchh--hhhhhcCCCCC-----CCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 67 GTTATSNAANFVLNAADWHTFTLTNNLPY--AQRLEYGWSQQ-----APQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 67 g~~~~~~~~~~i~~~~~g~~i~i~Nn~pY--a~~LEyG~S~Q-----Ap~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) ++...|-|..+| +.-||+||-.+ ++...++.+.+...+.+.+.+++. + T Consensus 68 -----------------~~~~~v~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~ 123 (127) T protein:vir:80 68 -----------------PGGWVIHNKTEYRLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIK 123 (127) T ss_pred -----------------cCceeEeecCCcceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhc Confidence 122457788999 77899999654 345667888888766666555544 4 No 81 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=98.35 E-value=9.1e-10 Score=70.16 Aligned_cols=106 Identities=15% Similarity=0.142 Sum_probs=73.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) =+..-+...+...+.+.+.++++.++.++..+.+..+|||||+||+||+.....- T Consensus 4 ~~~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~------------------------- 58 (137) T protein:vir:10 4 HTLRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRE------------------------- 58 (137) T ss_pred cccccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeec------------------------- Confidence 2345556677777888888999999999999999999999999999997753210 Q ss_pred ccCCceEEEeeCchhhhhhhcCCC--------CCCC-----chh-----HHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWS--------QQAP-----QGF-----VRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S--------~QAp-----~G~-----V~~a~~~~~~~v~~~~~e~k 131 (131) -..+-++.+..|++||.++|+|+. .++- .++ |..-=+.-+.+++.++++++ T Consensus 59 ~g~~v~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~ 127 (137) T protein:vir:10 59 RGAVVIGSVEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVA 127 (137) T ss_pred cccEEEEEecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhh Confidence 011225678899999999999983 1111 111 11111224677888888888 No 82 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.33 E-value=5.5e-09 Score=65.85 Aligned_cols=101 Identities=17% Similarity=0.312 Sum_probs=68.7 Q ss_pred Cc------c----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCccchhhccccccccCcccccccCCCCCC Q lcl|NC_019539. 1 MS------F----ALDVSKFVEKAKKNPEKVIRQVSIKLFSAII----KASPVDTGRFRMNWMASGGTPADGTTDATDKA 66 (131) Q Consensus 1 ms------f----~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv----~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~ 66 (131) |+ + ...|.++.+.+.+.+++.+.+++-++...|+ ..+|++||.++.+|.+... T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~------------- 67 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRV------------- 67 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeee------------- Confidence 43 3 3445666677777777777666676666665 4899999999999977632 Q ss_pred cchhHHHHHHHHhhccCCceEEEeeCchh--hhhhhcCCCCCC-----CchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 67 GTTATSNAANFVLNAADWHTFTLTNNLPY--AQRLEYGWSQQA-----PQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 67 g~~~~~~~~~~i~~~~~g~~i~i~Nn~pY--a~~LEyG~S~QA-----p~G~V~~a~~~~~~~v~~~~~e~k 131 (131) ++...|-|..+| +.-||+||-.+. +...++.+.+...+.|.+.+++.= T Consensus 68 -----------------~e~~~V~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l 122 (124) T protein:vir:95 68 -----------------PNGWVIHNKTEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAI 122 (124) T ss_pred -----------------cCceeEEEcCCCceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHh Confidence 112357788899 778999996554 345556777766666655555443 No 83 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=98.32 E-value=8.3e-09 Score=64.89 Aligned_cols=112 Identities=16% Similarity=0.157 Sum_probs=66.0 Q ss_pred Cccch-hHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MSFAL-DVSKFVEKAK-------KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 msf~~-~i~~~~~~~~-------~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) ||+.- -+++|.++++ +.....+++.|..+...++..+|+++|..|.+=.+-.+ .... ...+..|.. T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~-I~~~--~~k~~~g~~--- 74 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDD-IKLS--SVRETSGLT--- 74 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhh-hccc--cccccCcee--- Confidence 77652 2444444433 33445666777778888899999998875543111100 0000 001111111 Q ss_pred HHHHHHhhccCCceEEEe---eCchhhhhhhcCCCCCCCchhHHHHHHHHH-HHHHHHHHhhC Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLT---NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQ-QLLNEEASKVK 131 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~---Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~-~~v~~~~~e~k 131 (131) .+.|. .+.-|+.++|||+|.|.|.+|++.++++-. ++++...+++| T Consensus 75 -------------~~~VG~~k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~ 124 (128) T protein:vir:38 75 -------------EVDVGYGKDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLK 124 (128) T ss_pred -------------EEEeeecCCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHH Confidence 12232 234689999999999999999999998774 44555556666 No 84 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=98.28 E-value=8.4e-09 Score=64.87 Aligned_cols=108 Identities=14% Similarity=0.140 Sum_probs=65.5 Q ss_pred Cccchh---HHHHHHHHHHH---H-----HHHHHHHHHHHHHHHHHhCCccc----hhhccccccccCcccccccCCCCC Q lcl|NC_019539. 1 MSFALD---VSKFVEKAKKN---P-----EKVIRQVSIKLFSAIIKASPVDT----GRFRMNWMASGGTPADGTTDATDK 65 (131) Q Consensus 1 msf~~~---i~~~~~~~~~~---~-----~~~~r~~a~~~~~~vv~~tPVdt----Gr~R~nw~vs~~~~~~~~~~~~d~ 65 (131) |+++-+ ++++.+.++.- + ...+++.+..+..+++.++|||+ |.+|.|-.++-... + T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~---------~ 71 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRG---------K 71 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccc---------c Confidence 776655 33555544422 2 23455666668888999999986 88888876653211 1 Q ss_pred CcchhHHHHHHHHhhccCCceEEEeeCch---hhhhhhcCCCCCCCchhHHHHHHHHHH-HHHHHHHhhC Q lcl|NC_019539. 66 AGTTATSNAANFVLNAADWHTFTLTNNLP---YAQRLEYGWSQQAPQGFVRVNVSRFQQ-LLNEEASKVK 131 (131) Q Consensus 66 ~g~~~~~~~~~~i~~~~~g~~i~i~Nn~p---Ya~~LEyG~S~QAp~G~V~~a~~~~~~-~v~~~~~e~k 131 (131) .|.. +-++.+..+-+ |+.++|||+|.|+|+.|++.++.+-.. +++....+++ T Consensus 72 ~~~~--------------~v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~ 127 (135) T protein:vir:57 72 AGST--------------VVVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIR 127 (135) T ss_pred ccce--------------eEEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHH Confidence 1111 11233444333 477789999999999999999876533 3333333343 No 85 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=98.22 E-value=7.8e-09 Score=65.02 Aligned_cols=106 Identities=14% Similarity=0.209 Sum_probs=71.1 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCc-------cchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKA--SPV-------DTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~--tPV-------dtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) ++-..=..+..+..+.++.+++++-..++..+++.. +|| |||.+|.|-+..+..... T Consensus 2 ~G~~~L~~~Lk~~s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~-------------- 67 (127) T protein:vir:98 2 TGMPALEVKLRSMSEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSK-------------- 67 (127) T ss_pred cChHHHHHHHHHhhHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCc-------------- Confidence 333332333444456677788888888899999885 899 999999997765432100 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCC---------CCCchhHHHHHHHHHHHHHHHHHh-hC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQ---------QAPQGFVRVNVSRFQQLLNEEASK-VK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~---------QAp~G~V~~a~~~~~~~v~~~~~e-~k 131 (131) .+.+=.......||+||||||+- +..+.++..++..-..+|.+-+++ +| T Consensus 68 -----------~~~vgp~g~t~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k 126 (127) T protein:vir:98 68 -----------DVITGNFGYIKDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELR 126 (127) T ss_pred -----------eEEeccCcccccccceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhc Confidence 00011122357899999999983 447899999999988887665554 44 No 86 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.04 E-value=3.2e-08 Score=61.68 Aligned_cols=131 Identities=10% Similarity=0.062 Sum_probs=61.6 Q ss_pred Cccchh-HHHHHHHHH---HH-----HHHHHHHHHHHHHHHHHHhCCc-----cchhhccccccccCcccccccCCC--- Q lcl|NC_019539. 1 MSFALD-VSKFVEKAK---KN-----PEKVIRQVSIKLFSAIIKASPV-----DTGRFRMNWMASGGTPADGTTDAT--- 63 (131) Q Consensus 1 msf~~~-i~~~~~~~~---~~-----~~~~~r~~a~~~~~~vv~~tPV-----dtGr~R~nw~vs~~~~~~~~~~~~--- 63 (131) |+|.-. |+++..+++ +. +...+++.+.-+..+++.++|+ ++|.++.|-.+.-+.......... T Consensus 5 ~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~~~~~ 84 (179) T protein:vir:18 5 VEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGDLAFR 84 (179) T ss_pred EEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccceeEe Confidence 333211 333333332 22 2345566666677788888865 566777666554333221111100 Q ss_pred -CCCcchhHHHHHHHHhhccCCceEEE--------eeCchhhhhhhcCCCCCCCchhHHHHHHHHH------------HH Q lcl|NC_019539. 64 -DKAGTTATSNAANFVLNAADWHTFTL--------TNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQ------------QL 122 (131) Q Consensus 64 -d~~g~~~~~~~~~~i~~~~~g~~i~i--------~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~------------~~ 122 (131) ...+.............-+.+...|. .-+..|+.+||||+|.|+|..|++.++.+-. +. T Consensus 85 vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~l~~~ 164 (179) T protein:vir:18 85 VGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTEMGKA 164 (179) T ss_pred eecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHHHHHH Confidence 00000000000000000011112221 2257899999999999999999998885332 22 Q ss_pred HHHHHHhhC Q lcl|NC_019539. 123 LNEEASKVK 131 (131) Q Consensus 123 v~~~~~e~k 131 (131) +++++++.+ T Consensus 165 i~k~lk~~~ 173 (179) T protein:vir:18 165 IDRAIRLAM 173 (179) T ss_pred HHHHHHhhc Confidence 333333333 No 87 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=97.97 E-value=3.6e-08 Score=61.42 Aligned_cols=127 Identities=12% Similarity=0.138 Sum_probs=62.9 Q ss_pred Cccch-hHHHHHHHHH---HH-----HHHHHHHHHHHHHHHHHHhCCc-----cchhhccccccccCcccccccCCCC-C Q lcl|NC_019539. 1 MSFAL-DVSKFVEKAK---KN-----PEKVIRQVSIKLFSAIIKASPV-----DTGRFRMNWMASGGTPADGTTDATD-K 65 (131) Q Consensus 1 msf~~-~i~~~~~~~~---~~-----~~~~~r~~a~~~~~~vv~~tPV-----dtGr~R~nw~vs~~~~~~~~~~~~d-~ 65 (131) |.|.- -|+++.++++ .. ....+++.+.-+..+++.++|+ ++|.++.|-.++...--........ . T Consensus 5 ~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~~ 84 (164) T protein:vir:43 5 VEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLGFR 84 (164) T ss_pred eEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccceeEE Confidence 33332 1223333322 22 2345666676678888888887 6788888876643211110000000 0 Q ss_pred CcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHH------------HHHHHhhC Q lcl|NC_019539. 66 AGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLL------------NEEASKVK 131 (131) Q Consensus 66 ~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v------------~~~~~e~k 131 (131) -|.... ...............++.+|+.+||||+|.|+|+.|++.++.+-.+.+ +++++++. T Consensus 85 vg~~~~----~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka~~k~~ 158 (164) T protein:vir:43 85 IGVLHG----AVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRAIKRAA 158 (164) T ss_pred eccccc----ccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 000000011112344667899999999999999999998886443322 12222211 No 88 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.85 E-value=1.8e-07 Score=57.61 Aligned_cols=116 Identities=10% Similarity=0.141 Sum_probs=59.0 Q ss_pred Cc--cchh---HHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHhCCccchhh---ccccccccCcccccccCCC Q lcl|NC_019539. 1 MS--FALD---VSKFVEKAKK---------NPEKVIRQVSIKLFSAIIKASPVDTGRF---RMNWMASGGTPADGTTDAT 63 (131) Q Consensus 1 ms--f~~~---i~~~~~~~~~---------~~~~~~r~~a~~~~~~vv~~tPVdtGr~---R~nw~vs~~~~~~~~~~~~ 63 (131) |+ |.-+ |+++.+++++ .....+++.+.-+..+++..+|++.+-- +..|..+-..-.......+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 54 2222 3444444322 2335667777778888999999863311 1111111000000000011 Q ss_pred CCCcchhHHHHHHHHhhccCCceEEEe------eCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHHHh-hC Q lcl|NC_019539. 64 DKAGTTATSNAANFVLNAADWHTFTLT------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEASK-VK 131 (131) Q Consensus 64 d~~g~~~~~~~~~~i~~~~~g~~i~i~------Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e-~k 131 (131) ...+ -...+.+. .+..|+.++|||+|+|.|++|++.++.+....+.+++.+ ++ T Consensus 81 ~~~~---------------g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~ 140 (149) T protein:vir:13 81 RKKK---------------GNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYD 140 (149) T ss_pred cccc---------------ceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHH Confidence 1100 00123332 356899999999999999999999996664444333222 22 No 89 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=97.56 E-value=1.4e-06 Score=52.68 Aligned_cols=98 Identities=17% Similarity=0.129 Sum_probs=66.4 Q ss_pred Ccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHH Q lcl|NC_019539. 1 MSF--ALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFV 78 (131) Q Consensus 1 msf--~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i 78 (131) |++ .-++..+..++.+..+.....++.++++.+-.-.|.|||.|++|=. .+ T Consensus 1 M~vkV~id~~~~~~~l~~a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~-~~-------------------------- 53 (112) T protein:vir:80 1 MPIKVRVDLSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYV-IM-------------------------- 53 (112) T ss_pred CceeEEeehHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCcccccee-ec-------------------------- Confidence 764 4556677667777666677778899999998899999999999821 10 Q ss_pred hhccCCceEEEeeCchhhhhhhcCCC--------CCCCchhHHHHHH----HHHHHHHHHHHhhC Q lcl|NC_019539. 79 LNAADWHTFTLTNNLPYAQRLEYGWS--------QQAPQGFVRVNVS----RFQQLLNEEASKVK 131 (131) Q Consensus 79 ~~~~~g~~i~i~Nn~pYa~~LEyG~S--------~QAp~G~V~~a~~----~~~~~v~~~~~e~k 131 (131) ..| .|..+.|||.++-||+. ..+..-|.+.+.. +|.+.+.+++++-= T Consensus 54 ---~~g---~I~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 54 ---NDK---EIMWTSIYARRLYNGINFNFTLTHHPLAGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred ---cCc---eEEecCchhhHhhhcccCCCCcCCCCCcchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 011 47889999999999773 2566777765442 34444444444333 No 90 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=97.16 E-value=1.3e-06 Score=52.84 Aligned_cols=101 Identities=21% Similarity=0.208 Sum_probs=57.9 Q ss_pred Cccc--hhHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFA--LDVSKFVE-------KAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~--~~i~~~~~-------~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |+-- .=++++.. ..+..-.+.+++.+.-+..++...+|++||.++. +... .-+.| T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk---ik~~---------~kk~g---- 64 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK---VKIR---------VKNTG---- 64 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcce---eeee---------eecCc---- Confidence 4311 01222222 2233334577777777888999999999999984 3211 11112 Q ss_pred HHHHHHHhhccCCceEEEe---eCchhhhhhhcCCCCCCCc-hhHHHHHHHH----HHHH-HHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLT---NNLPYAQRLEYGWSQQAPQ-GFVRVNVSRF----QQLL-NEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~---Nn~pYa~~LEyG~S~QAp~-G~V~~a~~~~----~~~v-~~~~~e~k 131 (131) .+.+. +..=|..++|||.|.|.+. ||+..++.+- .+++ ++..+++| T Consensus 65 --------------~~~VG~~ks~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 65 --------------LATEGTASSSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred --------------eeEeccCCcchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 11111 1124999999999999998 9998877432 2222 22334455 No 91 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=97.15 E-value=5.3e-06 Score=49.51 Aligned_cols=107 Identities=15% Similarity=0.234 Sum_probs=66.7 Q ss_pred CccchhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCc-------cc---hhhccccccccCcccccccCCC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPV-------DT---GRFRMNWMASGGTPADGTTDAT 63 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~-------~~~r~~a~~~~~~vv~~tPV-------dt---Gr~R~nw~vs~~~~~~~~~~~~ 63 (131) |+|...|.+|..++++... ++++..|.-....|...||- ++ |.++.+-.++-. .. T Consensus 1 ~~~~~~l~e~l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~--------~i 72 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAQLSVSDQEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAG--------DI 72 (139) T ss_pred CCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCc--------cc Confidence 9999999999999987653 45566666666777788884 22 235555433311 11 Q ss_pred CC--CcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHH-HHHHHHHhhC Q lcl|NC_019539. 64 DK--AGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQ-LLNEEASKVK 131 (131) Q Consensus 64 d~--~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~-~v~~~~~e~k 131 (131) |. .|..+. +++ .-.|+ |.++|+|++.|.|..||.-|.++... ++..+++++| T Consensus 73 dg~~~g~~~V--------G~~--~~~~~------Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~k 127 (139) T protein:vir:10 73 DGDHNGSSTV--------GFH--NKAHI------ARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQ 127 (139) T ss_pred ccccccccee--------CCC--CCcee------eeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHH Confidence 11 111110 111 01122 68999999999999999999888744 4455555555 No 92 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=97.12 E-value=5e-06 Score=49.64 Aligned_cols=109 Identities=15% Similarity=0.197 Sum_probs=68.9 Q ss_pred CccchhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCcc-------c---hhhccccccccCcccccccCCC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPVD-------T---GRFRMNWMASGGTPADGTTDAT 63 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~-------~~~r~~a~~~~~~vv~~tPVd-------t---Gr~R~nw~vs~~~~~~~~~~~~ 63 (131) |.|...+.+|..++++... ++++..|.-+...|...||.. | +.++.|-.+|-...... T Consensus 1 v~~~~~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~----- 75 (139) T protein:vir:10 1 MDMDEALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGD----- 75 (139) T ss_pred CCHHHHHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHhcccccCcCCCCCCCCcchhhcceecCcccccc----- Confidence 9999999999998876653 355666666666688889962 2 46777776653111100 Q ss_pred CCCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHH-HHHHHHhhC Q lcl|NC_019539. 64 DKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQL-LNEEASKVK 131 (131) Q Consensus 64 d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~-v~~~~~e~k 131 (131) ..|..+ -++ -+.-=+|.++|+|++.|.|+.|++-|.++...- +..+++++| T Consensus 76 -~~g~~~--------VG~--------~k~~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k 127 (139) T protein:vir:10 76 -HNGSST--------VGF--------HNKAHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQ 127 (139) T ss_pred -cceeee--------eCC--------CCCcceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHH Confidence 011111 011 112224789999999999999999998887544 444445555 No 93 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=97.10 E-value=1.1e-05 Score=47.79 Aligned_cols=98 Identities=18% Similarity=0.134 Sum_probs=64.4 Q ss_pred Cccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHH Q lcl|NC_019539. 1 MSFA--LDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFV 78 (131) Q Consensus 1 msf~--~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i 78 (131) |+.. -++..+..++.+..+.....++.++++.+-.-.|.|+|.||+|=.+ + T Consensus 1 M~vkv~vn~~~~~~~l~~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~-~-------------------------- 53 (112) T protein:vir:45 1 MPIKVRVDLSKAKGSVKKAKERGQFALINQAAADIALYVPFLSGDLSNQYVI-M-------------------------- 53 (112) T ss_pred CceeEEeehHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCccccceee-c-------------------------- Confidence 7744 4556666667666666777788899999988999999999997211 1 Q ss_pred hhccCCceEEEeeCchhhhhhhcCCC--------CCCCchhHHHHHH-HHHHHHHHHHHhhC Q lcl|NC_019539. 79 LNAADWHTFTLTNNLPYAQRLEYGWS--------QQAPQGFVRVNVS-RFQQLLNEEASKVK 131 (131) Q Consensus 79 ~~~~~g~~i~i~Nn~pYa~~LEyG~S--------~QAp~G~V~~a~~-~~~~~v~~~~~e~k 131 (131) ..| .|.++.|||.++=||.. ..+..-|.+.+.. ....|++-+.+.++ T Consensus 54 ---~~g---~I~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~ 109 (112) T protein:vir:45 54 ---NDK---EIMWTSIYARRLYKGINFNFTLTHHPLAGPEWDQRAKIDKMDVWEKVAQKAVE 109 (112) T ss_pred ---cCC---eEEecChhhHHhhhccccCCCCCCCCCCchhhHHHHHHhhHHHHHHHHHHHHh Confidence 112 47899999999988542 3466677764442 22333333334444 No 94 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=97.03 E-value=9.2e-06 Score=48.18 Aligned_cols=107 Identities=12% Similarity=0.138 Sum_probs=63.3 Q ss_pred Cc-cchhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCcc------c---hhhccccccccCcccccccCCC Q lcl|NC_019539. 1 MS-FALDVSKFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPVD------T---GRFRMNWMASGGTPADGTTDAT 63 (131) Q Consensus 1 ms-f~~~i~~~~~~~~~~~~-------~~~r~~a~~~~~~vv~~tPVd------t---Gr~R~nw~vs~~~~~~~~~~~~ 63 (131) |+ |...|.+|.+++++-.. +.++..|.-+-..+..-||.. | |.++.|-.++- . T Consensus 1 M~~~~~glee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~----------~ 70 (153) T protein:vir:49 1 MTGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQS----------T 70 (153) T ss_pred CccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcccccceecc----------c Confidence 54 88788999888875433 344444444444455566652 3 35555555431 1 Q ss_pred CCCcchhHHHHHHHHhhccCCceEEEeeC----chhhhhhhcCCCCCCCchhHHHHHHHH---HHHHHHHHHhhC Q lcl|NC_019539. 64 DKAGTTATSNAANFVLNAADWHTFTLTNN----LPYAQRLEYGWSQQAPQGFVRVNVSRF---QQLLNEEASKVK 131 (131) Q Consensus 64 d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn----~pYa~~LEyG~S~QAp~G~V~~a~~~~---~~~v~~~~~e~k 131 (131) +..|.. .| ...+... .=||.++|+|++.|.|..||+-+..+- ..+++..+++.| T Consensus 71 ~idG~~-------------dG-~s~VG~~~~~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~ 131 (153) T protein:vir:49 71 NADGRK-------------NG-VSTVGWKNNYHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYE 131 (153) T ss_pred cccccc-------------cc-eeeecccCCccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHH Confidence 111111 11 1122222 224899999999999999999888764 356766667776 No 95 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=96.95 E-value=5.1e-06 Score=49.60 Aligned_cols=101 Identities=14% Similarity=0.315 Sum_probs=66.7 Q ss_pred Cc----cch---hHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHhCCc-----------cchhhccccccccCc Q lcl|NC_019539. 1 MS----FAL---DVSKFVEKAK--------KNPEKVIRQVSIKLFSAIIKASPV-----------DTGRFRMNWMASGGT 54 (131) Q Consensus 1 ms----f~~---~i~~~~~~~~--------~~~~~~~r~~a~~~~~~vv~~tPV-----------dtGr~R~nw~vs~~~ 54 (131) |+ +.- -+.+|...++ +++....+.+|.-++..+..-||+ .||+|.+|..++-.. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 54 222 2345554443 344456677777788888899999 699999999876311 Q ss_pred ccccccCCCCCCcchhHHHHHHHHhhccCCceEEEee--CchhhhhhhcCCCCCC--CchhHH------------HHHHH Q lcl|NC_019539. 55 PADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTN--NLPYAQRLEYGWSQQA--PQGFVR------------VNVSR 118 (131) Q Consensus 55 ~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~N--n~pYa~~LEyG~S~QA--p~G~V~------------~a~~~ 118 (131) -+-+|-+.- .+|||..++|||..+- |.-|+. +..++ T Consensus 81 ----------------------------raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~ 132 (143) T protein:vir:62 81 ----------------------------KGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERR 132 (143) T ss_pred ----------------------------cceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHH Confidence 112344444 7999999999998887 887774 44455 Q ss_pred HHHHHHHHHHh Q lcl|NC_019539. 119 FQQLLNEEASK 129 (131) Q Consensus 119 ~~~~v~~~~~e 129 (131) +.+++++.+.. T Consensus 133 i~~vl~k~l~s 143 (143) T protein:vir:62 133 IAAVVEKYLES 143 (143) T ss_pred HHHHHHHHhcC Confidence 55566655555 No 96 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=96.91 E-value=1.7e-05 Score=46.74 Aligned_cols=99 Identities=12% Similarity=0.101 Sum_probs=71.2 Q ss_pred CccchhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHh Q lcl|NC_019539. 1 MSFALDVSKFVEKA-KKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVL 79 (131) Q Consensus 1 msf~~~i~~~~~~~-~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~ 79 (131) |.+.-+++.....+ +..++.....++.++++.+-.-.|.|||.||+|=.++.+ T Consensus 2 mkvkv~~~~~~~~~~~~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s~-------------------------- 55 (108) T protein:vir:98 2 PKIRVELSGAKDKLSPQTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISSD-------------------------- 55 (108) T ss_pred ceeEeeehHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcCcCCccccceeeccC-------------------------- Confidence 67887777776655 445566667788889999988999999999999655421 Q ss_pred hccCCceEEEeeCchhhhhhhcCC---C--CCCCchhHH-HHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 80 NAADWHTFTLTNNLPYAQRLEYGW---S--QQAPQGFVR-VNVSRFQQLLNEEASKVK 131 (131) Q Consensus 80 ~~~~g~~i~i~Nn~pYa~~LEyG~---S--~QAp~G~V~-~a~~~~~~~v~~~~~e~k 131 (131) . =.|..+.|||.++=||. . ..+..-|.+ .-......+++-+.+.+| T Consensus 56 ---~---g~I~y~tPYAr~qYYg~~~n~~~p~ag~~W~eraka~~~~~~~~~~~k~~k 107 (108) T protein:vir:98 56 ---A---EEIYYNTPYAKRRFYEPAYNYTTPGTGPRWDMKAKRLFISDWERAYMKGAN 107 (108) T ss_pred ---C---ceEEecChhhHHhhhccccCCCCCCCcchhHHHHHhhhhHHHHHHHHHhhc Confidence 1 14888999999999883 2 334455665 444555677777788888 No 97 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=96.86 E-value=1.9e-05 Score=46.43 Aligned_cols=94 Identities=17% Similarity=0.194 Sum_probs=65.3 Q ss_pred chhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhcc Q lcl|NC_019539. 4 ALDVSKFVEKAKK-NPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAA 82 (131) Q Consensus 4 ~~~i~~~~~~~~~-~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~ 82 (131) ..+|+.+.+++.. ........++.++++.+-.-.|.|||.||+|=.++. T Consensus 1 ~~dL~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~s------------------------------ 50 (113) T protein:vir:79 1 MSDLSVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVND------------------------------ 50 (113) T ss_pred CchHHHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhccccccC------------------------------ Confidence 6688888777654 444566678888999999999999999999843221 Q ss_pred CCceEEEeeCchhhhhhhcCCCC----------CCCchhHHHHH----HHHHHHHHHHH-HhhC Q lcl|NC_019539. 83 DWHTFTLTNNLPYAQRLEYGWSQ----------QAPQGFVRVNV----SRFQQLLNEEA-SKVK 131 (131) Q Consensus 83 ~g~~i~i~Nn~pYa~~LEyG~S~----------QAp~G~V~~a~----~~~~~~v~~~~-~e~k 131 (131) =+|..++|||.++=||... .+..-|.+.+- .+|.+.+.+++ +.+| T Consensus 51 ----~~I~y~tPYAr~qyYg~~~~~~~~~~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~ 110 (113) T protein:vir:79 51 ----TGIHYTAKYARAQFYGFVNGHRVRNYSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAK 110 (113) T ss_pred ----CeeEecChhhhHhhccccCCCCccccCCCCCCchhhHHHHHHhHHHHHHHHHHHhhcccc Confidence 1388999999999997653 34456666544 34666655543 3333 No 98 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=96.73 E-value=6.5e-06 Score=49.01 Aligned_cols=101 Identities=13% Similarity=0.303 Sum_probs=65.7 Q ss_pred Cc----cch---hHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHhCCcc-----------chhhccccccccCc Q lcl|NC_019539. 1 MS----FAL---DVSKFVEKAK--------KNPEKVIRQVSIKLFSAIIKASPVD-----------TGRFRMNWMASGGT 54 (131) Q Consensus 1 ms----f~~---~i~~~~~~~~--------~~~~~~~r~~a~~~~~~vv~~tPVd-----------tGr~R~nw~vs~~~ 54 (131) |+ +.- -+..|...++ +++....+++|.-++..+..-||+- +|+|.+|..++-.. T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 54 222 1334444433 4445566777877888888899997 89999999876311 Q ss_pred ccccccCCCCCCcchhHHHHHHHHhhccCCceEEEee--CchhhhhhhcCCCCCC--CchhHH------------HHHHH Q lcl|NC_019539. 55 PADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTN--NLPYAQRLEYGWSQQA--PQGFVR------------VNVSR 118 (131) Q Consensus 55 ~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~N--n~pYa~~LEyG~S~QA--p~G~V~------------~a~~~ 118 (131) -+-+|-+.- .+|||..++|||..+- |.-|+. +..++ T Consensus 81 ----------------------------raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~ 132 (143) T protein:vir:13 81 ----------------------------KGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERR 132 (143) T ss_pred ----------------------------cceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHH Confidence 112344442 4899999999998887 777774 44455 Q ss_pred HHHHHHHHHHh Q lcl|NC_019539. 119 FQQLLNEEASK 129 (131) Q Consensus 119 ~~~~v~~~~~e 129 (131) +.+++++.+.. T Consensus 133 i~~vl~k~l~s 143 (143) T protein:vir:13 133 IAAVVEKYLES 143 (143) T ss_pred HHHHHHHHhcC Confidence 56666655555 No 99 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=96.61 E-value=4e-05 Score=44.70 Aligned_cols=97 Identities=18% Similarity=0.193 Sum_probs=63.4 Q ss_pred Cccc--hhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHH Q lcl|NC_019539. 1 MSFA--LDVSKFVEKA-KKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANF 77 (131) Q Consensus 1 msf~--~~i~~~~~~~-~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~ 77 (131) |+.. -+++.+...+ .+.+......++.++++.+-.-.|.|||.||+|=.+..+ T Consensus 1 M~~kVkv~l~~~~~~l~~~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~~------------------------ 56 (114) T protein:vir:47 1 MNIAIKVDLQKAKQKLSNESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVGQ------------------------ 56 (114) T ss_pred CceeEEeehhHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcCccCccccceeeeeC------------------------ Confidence 6655 5556666665 345566667788889999988999999999997544321 Q ss_pred HhhccCCceEEEeeCchhhhhhhcCCC----------CCCCchhHHHHHHH----HHHHHHHHHHh Q lcl|NC_019539. 78 VLNAADWHTFTLTNNLPYAQRLEYGWS----------QQAPQGFVRVNVSR----FQQLLNEEASK 129 (131) Q Consensus 78 i~~~~~g~~i~i~Nn~pYa~~LEyG~S----------~QAp~G~V~~a~~~----~~~~v~~~~~e 129 (131) .=.|.++.|||.++=||+- .++..-|.+.+... |.+.+.+.+.- T Consensus 57 --------~~~I~y~tPYAr~qyYg~~~~~~~~~~~~p~~g~~W~eraka~~~~~~~~~~~k~~g~ 114 (114) T protein:vir:47 57 --------GDAVVYGTVYARAQFYGSNGIVTFRRYTTPGTGKRWDQVATSKHAEEWARAFVKGMGL 114 (114) T ss_pred --------CcEEEecCchhhHhhhcccCCCCCCccCCCCCcchhHHHHHhhhhHHHHHHHHHhhCC Confidence 0148899999999999762 35666777655433 22223222222 No 100 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=96.47 E-value=2.5e-05 Score=45.78 Aligned_cols=105 Identities=11% Similarity=0.181 Sum_probs=62.0 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCc----------------------cchhhccccccccC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIK-----ASPV----------------------DTGRFRMNWMASGG 53 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~-----~tPV----------------------dtGr~R~nw~vs~~ 53 (131) +-=..+|.+-.+.+..++...+++++..++..+.. .+|. |||+||+|++-+ T Consensus 2 i~~~~~i~~~l~~l~~~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~~~-- 79 (145) T protein:vir:31 2 VEDENNIPEAREAIQDGLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDINAA-- 79 (145) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHHHH-- Confidence 33444555555555555555666666666665543 2332 333333333222 Q ss_pred cccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCC--CCCchhHHHHHH----HHHHHHHHHH Q lcl|NC_019539. 54 TPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQ--QAPQGFVRVNVS----RFQQLLNEEA 127 (131) Q Consensus 54 ~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~--QAp~G~V~~a~~----~~~~~v~~~~ 127 (131) +.....++.+.|..|++||...++|..+ ..|..|+.++.. ++..++.+.+ T Consensus 80 ------------------------~~~~~~~~~a~vGtn~~YA~~hqfG~~~~~IPaRPfLG~~~~~~~~~~~~ii~~~i 135 (145) T protein:vir:31 80 ------------------------SMMDRANRMAVIGTNLDYAEHHEFGAPEAGIPARPIFGPAGAYASQQAPDVIGDEI 135 (145) T ss_pred ------------------------hhhcccCceeEecCCchhhhhhccCCcccccCCCCccCCCccchHHHHHHHHHHHH Confidence 2222345678899999999999999975 889999988664 3444444333 Q ss_pred H-hhC Q lcl|NC_019539. 128 S-KVK 131 (131) Q Consensus 128 ~-e~k 131 (131) . -++ T Consensus 136 ~~~L~ 140 (145) T protein:vir:31 136 DTNLE 140 (145) T ss_pred HHHhh Confidence 3 233 No 101 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=96.45 E-value=5.3e-05 Score=44.00 Aligned_cols=107 Identities=14% Similarity=0.154 Sum_probs=65.2 Q ss_pred CccchhHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCCc---------cchhhccccccccCcccccccCCCC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNP-------EKVIRQVSIKLFSAIIKASPV---------DTGRFRMNWMASGGTPADGTTDATD 64 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~-------~~~~r~~a~~~~~~vv~~tPV---------dtGr~R~nw~vs~~~~~~~~~~~~d 64 (131) -+|...+.+|.+++++-. .+.++.-|.-+...+...||. ..|.++.|-.++-. ..| T Consensus 2 ~~~~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~--------~~D 73 (141) T protein:vir:50 2 VGLAEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQST--------NAD 73 (141) T ss_pred ccHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccC--------ccc Confidence 448888999999888655 234444443344445556664 24456666655421 111 Q ss_pred C--CcchhHHHHHHHHhhccCCceEEEeeC--chhhhhhhcCCCCCCCchhHHHHHHHH---HHHHHHHHHhhC Q lcl|NC_019539. 65 K--AGTTATSNAANFVLNAADWHTFTLTNN--LPYAQRLEYGWSQQAPQGFVRVNVSRF---QQLLNEEASKVK 131 (131) Q Consensus 65 ~--~g~~~~~~~~~~i~~~~~g~~i~i~Nn--~pYa~~LEyG~S~QAp~G~V~~a~~~~---~~~v~~~~~e~k 131 (131) . .|..+ |=..|. .=+|.+||+|++.|.|..||+-+.++. ..+++..++++| T Consensus 74 G~~dg~s~----------------VG~~~~~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k 131 (141) T protein:vir:50 74 GRKNGVST----------------VGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTK 131 (141) T ss_pred cccCCeee----------------eccCCCccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHH Confidence 1 11111 111122 225789999999999999999999753 467777788888 No 102 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=96.06 E-value=9.6e-05 Score=42.60 Aligned_cols=104 Identities=14% Similarity=0.203 Sum_probs=70.5 Q ss_pred CccchhHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVSKFVEKA-------KKNPEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~~~~~~~-------~~~~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |.|.-++.++.+++ ...+.......|..+...++..+|= .||..|....-++.. .|. T Consensus 4 ~~f~~d~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~-----------~g~--- 69 (123) T protein:vir:74 4 VTFEYDAQELRTNIRNLDRRMESAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANK-----------LGP--- 69 (123) T ss_pred eEEEecHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccccc-----------CCC--- Confidence 66766665555544 4445556666777888899999996 599999887544321 110 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHH----HHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQL----LNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~----v~~~~~e~k 131 (131) .--+||++-+++|..+||.+++++ ...++.|++.+... +++.+.+++ T Consensus 70 -----------~~~~Iylsh~veYG~~LEla~~~k--yaIi~Ptv~~~~~~im~g~~~ll~~l~ 120 (123) T protein:vir:74 70 -----------GSHELIMSYSVHYGIWLEIANSGQ--YAVIGPFLPVMGRKLMHDLEHLIDRLE 120 (123) T ss_pred -----------ceEEEEEecCeeecceeeecCCCC--ceeecchHHHHhHHHHHHHHHHHHHhh Confidence 114899999999999999988754 34667777666443 455566666 No 103 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=96.05 E-value=0.00012 Score=42.11 Aligned_cols=99 Identities=15% Similarity=0.193 Sum_probs=65.3 Q ss_pred Cc--cchhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCccchh--hccccccccCcccccccCCCCCCcchhHHHHH Q lcl|NC_019539. 1 MS--FALDVSKFVEKA-KKNPEKVIRQVSIKLFSAIIKASPVDTGR--FRMNWMASGGTPADGTTDATDKAGTTATSNAA 75 (131) Q Consensus 1 ms--f~~~i~~~~~~~-~~~~~~~~r~~a~~~~~~vv~~tPVdtGr--~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~ 75 (131) |+ ..-+++.+..++ .+..+.....++.++++.+-.-.|.|||. +|.+-.+..+ T Consensus 1 M~ikVkv~l~~~~~~~~~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~---------------------- 58 (116) T protein:vir:15 1 MAFRINVDLDGFMDQTSLDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSD---------------------- 58 (116) T ss_pred CCceEEeehhHhhhhhhHHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecC---------------------- Confidence 55 556677777776 45667777888999999999999999977 4443322211 Q ss_pred HHHhhccCCceEEEeeCchhhhhhhcCCC-----------CCCCchhHHHHH-HHHHHHHHHHHHhhC Q lcl|NC_019539. 76 NFVLNAADWHTFTLTNNLPYAQRLEYGWS-----------QQAPQGFVRVNV-SRFQQLLNEEASKVK 131 (131) Q Consensus 76 ~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-----------~QAp~G~V~~a~-~~~~~~v~~~~~e~k 131 (131) .-+|.+++|||.++=||+- .+|..-|-+.+- .-...+.+-+.+++| T Consensus 59 ----------~~~I~y~tPYAr~qyYg~~~~~~~~~~~t~p~ag~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 59 ----------GSEITYSTPYAKAQFYGIINDKYPVHNYTTPGTTKRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred ----------CceEEecCchhHHHhcccccCCCCcccccCCCCCcchhHHHHhhhHHHHHHHHHHhcC Confidence 1358899999999988762 344556665443 333444445555555 No 104 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=95.99 E-value=3.3e-05 Score=45.17 Aligned_cols=92 Identities=20% Similarity=0.206 Sum_probs=56.2 Q ss_pred Cccc-hhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHH Q lcl|NC_019539. 1 MSFA-LDVSKF---VEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAAN 76 (131) Q Consensus 1 msf~-~~i~~~---~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~ 76 (131) +--+ +.+++. ..++++++++.+.++-.++-.-+.-..||.||.||-|+..|+.+ T Consensus 2 i~i~idkp~almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg---------------------- 59 (133) T protein:vir:42 2 IEIRIDKPDALMEKPHEVQGKIEETLEKILNQLQGIAENTAPVKTGNLRDSHIISIEG---------------------- 59 (133) T ss_pred eeeecCCchhhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeec---------------------- Confidence 1111 222222 23456666666655555554444556899999999999999753 Q ss_pred HHhhccCCceEEEeeCchhhhhhhcCC--------------------------------C----CCCCchhHHHHHHHHH Q lcl|NC_019539. 77 FVLNAADWHTFTLTNNLPYAQRLEYGW--------------------------------S----QQAPQGFVRVNVSRFQ 120 (131) Q Consensus 77 ~i~~~~~g~~i~i~Nn~pYa~~LEyG~--------------------------------S----~QAp~G~V~~a~~~~~ 120 (131) .+=.++|.+||-+.+=+|. | --+|.|+|+-++-+|- T Consensus 60 --------stgelsn~~~yl~~vl~grgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewl 131 (133) T protein:vir:42 60 --------STGELSNLAYYLPFVLHGRGWVFPVRRKALWWPELPHPVAYARPAPPNDYFSAVVAYSAPEGVVEETLIEWL 131 (133) T ss_pred --------CccchhhhhHHhhHhhhcccceeeccccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHH Confidence 2344677777777766643 1 2346778887777775 Q ss_pred HH Q lcl|NC_019539. 121 QL 122 (131) Q Consensus 121 ~~ 122 (131) +. T Consensus 132 re 133 (133) T protein:vir:42 132 RE 133 (133) T ss_pred hC Confidence 54 No 105 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=95.77 E-value=7.2e-05 Score=43.30 Aligned_cols=80 Identities=23% Similarity=0.286 Sum_probs=52.8 Q ss_pred Cc--------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHH Q lcl|NC_019539. 1 MS--------FALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATS 72 (131) Q Consensus 1 ms--------f~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~ 72 (131) |+ -..+++.|-++++..+++.+-+.+.++....+...|||||.||.|-.+.. +.| T Consensus 13 makvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dy------------k~G----- 75 (100) T protein:vir:96 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKY------------FDG----- 75 (100) T ss_pred hhhheechHHHHHHHhcchHHHHHHHHHHHHHHHHHHHhhHHhhccccccccceeeeeee------------ecC----- Confidence 21 23567778888888888999999999999999999999999999986642 111 Q ss_pred HHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019539. 73 NAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLNEEA 127 (131) Q Consensus 73 ~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~ 127 (131) |-+--|+=...||- .|+.|.+-..+ T Consensus 76 -----------GltavI~vGAeYAI-------------------krmsqllvtvi 100 (100) T protein:vir:96 76 -----------GLSSVISVGADYAI-------------------KRMSQLLVTVI 100 (100) T ss_pred -----------CeeEEEecchhHHH-------------------HHHHHHHhhcC Confidence 11233444455554 23333333333 No 106 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=95.69 E-value=0.00022 Score=40.66 Aligned_cols=109 Identities=11% Similarity=0.118 Sum_probs=64.1 Q ss_pred CccchhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCcc------c---hhhccccccccCcccccccCCCC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPVD------T---GRFRMNWMASGGTPADGTTDATD 64 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~-------~~~r~~a~~~~~~vv~~tPVd------t---Gr~R~nw~vs~~~~~~~~~~~~d 64 (131) -+|...|.+|.+++++-.. ++++.-|.-+...+..-||.. | |.++.|-.++-. ..| T Consensus 2 ~~~~d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~--------~id 73 (140) T protein:vir:48 2 TGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQST--------NVD 73 (140) T ss_pred ccHHHHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhcccCCCCCCCCCCCCcccccceeccc--------ccc Confidence 3488889999988865433 234444444455556667752 3 346666655411 111 Q ss_pred C--CcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHH---HHHHHHHHHhhC Q lcl|NC_019539. 65 K--AGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF---QQLLNEEASKVK 131 (131) Q Consensus 65 ~--~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~---~~~v~~~~~e~k 131 (131) . .|+.+ -+++.. +..=+|.+||+|+|.|.|..||.-|.++- ..+++....+.| T Consensus 74 g~~dG~s~--------VG~~k~------~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~ 131 (140) T protein:vir:48 74 GRKNGVAT--------VGWKNN------YHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYE 131 (140) T ss_pred ccccccee--------ecccCC------CceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHH Confidence 1 11111 011111 11234789999999999999999999753 566777666766 No 107 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=95.54 E-value=0.00014 Score=41.64 Aligned_cols=125 Identities=16% Similarity=0.202 Sum_probs=59.9 Q ss_pred CccchhHHHHHHHHHH---H--HHHHHHHHHHHHHHHHHHh-----CCccchhhccccccccCccccccc-CCCCCCcch Q lcl|NC_019539. 1 MSFALDVSKFVEKAKK---N--PEKVIRQVSIKLFSAIIKA-----SPVDTGRFRMNWMASGGTPADGTT-DATDKAGTT 69 (131) Q Consensus 1 msf~~~i~~~~~~~~~---~--~~~~~r~~a~~~~~~vv~~-----tPVdtGr~R~nw~vs~~~~~~~~~-~~~d~~g~~ 69 (131) +++..+.+++.+.+.. . ...++++++..+...+..+ +| |+|. .|..= +|.+... ......+.. T Consensus 5 i~~~~d~~~l~~~L~~l~~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~P-d~G~---~W~pl--s~~t~~~r~~~~~~~~~ 78 (156) T protein:vir:19 5 MNVAVDVRRIQLALDELGTVTRDRAIPRVMAAALLSSTEQAFERQADP-DTGK---GWEAW--SDSWLAWRQDHGFVPGS 78 (156) T ss_pred EEEeecHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHhcCCC-CCCC---CCccc--ChHHHHHhhccCCCCCc Confidence 3355455555554432 1 1236677776666666542 22 2231 23110 0000000 000001111 Q ss_pred hH---HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCC--------CCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 70 AT---SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQ--------APQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 70 ~~---~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~Q--------Ap~G~V~~a~~~~~~~v~~~~~e~k 131 (131) .+ +....-|.....++.+.|++|++||..-+||-..+ ....|+.++-.....|.+-...-++ T Consensus 79 ~L~~tg~L~~Si~~~~~~~~v~vGt~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s~~d~~~I~~~i~~~l~ 151 (156) T protein:vir:19 79 ILTLHGDLARSITTDYGQDYALIGSPKIYAAIHQWGGTPDMAPRPAGVPARPYMGLDKTGEQEIFDAIRKRVS 151 (156) T ss_pred chhhhHHHHHHhhheecCCEEEEecchhhhHHhhcCcccccCCCccccCCccccCCCHHHHHHHHHHHHHHHH Confidence 11 11222222223457788999999999999998754 3456666776665555444444444 No 108 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=95.42 E-value=3.9e-05 Score=44.76 Aligned_cols=96 Identities=15% Similarity=0.126 Sum_probs=67.7 Q ss_pred CccchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHh Q lcl|NC_019539. 1 MSFALDVSKFVEKAK-KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVL 79 (131) Q Consensus 1 msf~~~i~~~~~~~~-~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~ 79 (131) |...-+++.+...+. +.++.....++.++++.+-.-.|.|||.||+|=.++. T Consensus 2 ~kV~vdl~~~~~~ls~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~--------------------------- 54 (118) T protein:vir:98 2 AKVVVELGGIKRKVSPQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANS--------------------------- 54 (118) T ss_pred ceeeechhHHhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecC--------------------------- Confidence 778899998888874 5667777888889999998899999999999843221 Q ss_pred hccCCceEEEeeCchhhhhhhcCCC--------------CCCCchhHHHH------HHHHHHHHHHHHHhhC Q lcl|NC_019539. 80 NAADWHTFTLTNNLPYAQRLEYGWS--------------QQAPQGFVRVN------VSRFQQLLNEEASKVK 131 (131) Q Consensus 80 ~~~~g~~i~i~Nn~pYa~~LEyG~S--------------~QAp~G~V~~a------~~~~~~~v~~~~~e~k 131 (131) + .|.++.|||.++=||+- .++..-|-... ...|.+++.+.+. +| T Consensus 55 -----~--~I~Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~~g-~k 118 (118) T protein:vir:98 55 -----V--GVTWSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLRGMG-FK 118 (118) T ss_pred -----C--eeEECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHHhcC-CC Confidence 1 37899999999999752 13455554321 2445555555543 33 No 109 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=95.42 E-value=3.9e-05 Score=44.76 Aligned_cols=96 Identities=15% Similarity=0.126 Sum_probs=67.7 Q ss_pred CccchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHh Q lcl|NC_019539. 1 MSFALDVSKFVEKAK-KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVL 79 (131) Q Consensus 1 msf~~~i~~~~~~~~-~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~ 79 (131) |...-+++.+...+. +.++.....++.++++.+-.-.|.|||.||+|=.++. T Consensus 2 ~kV~vdl~~~~~~ls~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~--------------------------- 54 (118) T protein:vir:30 2 AKVVVELGGIKRKVSPQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANS--------------------------- 54 (118) T ss_pred ceeeechhHHhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecC--------------------------- Confidence 778899998888874 5667777888889999998899999999999843221 Q ss_pred hccCCceEEEeeCchhhhhhhcCCC--------------CCCCchhHHHH------HHHHHHHHHHHHHhhC Q lcl|NC_019539. 80 NAADWHTFTLTNNLPYAQRLEYGWS--------------QQAPQGFVRVN------VSRFQQLLNEEASKVK 131 (131) Q Consensus 80 ~~~~g~~i~i~Nn~pYa~~LEyG~S--------------~QAp~G~V~~a------~~~~~~~v~~~~~e~k 131 (131) + .|.++.|||.++=||+- .++..-|-... ...|.+++.+.+. +| T Consensus 55 -----~--~I~Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~~~~~w~~~~~k~~g-~k 118 (118) T protein:vir:30 55 -----V--GVTWSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANATIVKDWEKSLLRGMG-FK 118 (118) T ss_pred -----C--eeEECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchhhhHHHHHHHHHhcC-CC Confidence 1 37899999999999752 13455554321 2445555555543 33 No 110 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=95.36 E-value=0.00043 Score=39.05 Aligned_cols=107 Identities=12% Similarity=0.116 Sum_probs=64.7 Q ss_pred CccchhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhCCcc---------chhhccccccccCcccccccCCCC Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPE-------KVIRQVSIKLFSAIIKASPVD---------TGRFRMNWMASGGTPADGTTDATD 64 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~-------~~~r~~a~~~~~~vv~~tPVd---------tGr~R~nw~vs~~~~~~~~~~~~d 64 (131) -+|...+.+|.+++++-.. +.++.-|.-+...+...||.. .|.++.|-.++-- .+| T Consensus 2 ~~~~d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~--------~iD 73 (140) T protein:vir:48 2 TGLDEALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQST--------NVD 73 (140) T ss_pred ccHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeeccc--------ccc Confidence 3488889999998865433 344444444555566678852 2245555544310 111 Q ss_pred CCcchhHHHHHHHHhhccCCceEEEee----CchhhhhhhcCCCCCCCchhHHHHHHHH---HHHHHHHHHhhC Q lcl|NC_019539. 65 KAGTTATSNAANFVLNAADWHTFTLTN----NLPYAQRLEYGWSQQAPQGFVRVNVSRF---QQLLNEEASKVK 131 (131) Q Consensus 65 ~~g~~~~~~~~~~i~~~~~g~~i~i~N----n~pYa~~LEyG~S~QAp~G~V~~a~~~~---~~~v~~~~~e~k 131 (131) |. ..|.+ .+.. ..=+|.+||+|++.|.|..||+-+.++- ..+++...++.| T Consensus 74 --g~-------------~~g~s-~VG~~kk~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~ 131 (140) T protein:vir:48 74 --GR-------------KNGVS-TVGWVNRYHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYE 131 (140) T ss_pred --cc-------------cCcee-eeccCCCcceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHH Confidence 11 11111 1221 2336789999999999999999999753 467777777777 No 111 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=95.26 E-value=0.0004 Score=39.19 Aligned_cols=104 Identities=15% Similarity=0.163 Sum_probs=70.4 Q ss_pred CccchhHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCCcchhH Q lcl|NC_019539. 1 MSFALDVS-------KFVEKAKKNPEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKAGTTAT 71 (131) Q Consensus 1 msf~~~i~-------~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~ 71 (131) |.|.-+.. +|..+....+.......|..+...++..+|= .||..|.....++... .++ T Consensus 4 ~~f~~~~~~l~~~i~~~~~k~~~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~~----------~~~--- 70 (120) T protein:vir:10 4 IEFKFKDIELRRGVEDMEAKVDRAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVASTP----------QPD--- 70 (120) T ss_pred EEEEecHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhccccccC----------CCc--- Confidence 77776654 4555555556666777888899999999996 5999998875543210 000 Q ss_pred HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHHHH----HHHHHHHHhhC Q lcl|NC_019539. 72 SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRFQ----QLLNEEASKVK 131 (131) Q Consensus 72 ~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~~----~~v~~~~~e~k 131 (131) --+||++-+++|..+||.-++++ ...++.|++.+. +=+++.+.++| T Consensus 71 ------------~~~Iylsh~veYG~~LEla~~~k--yaIl~PTi~~~~~~il~g~~~ll~~l~ 120 (120) T protein:vir:10 71 ------------RYEIVFAHTVHYGIWLEIANSGR--YEIIMPTVHHEGKLMAQRLRGLLGRLR 120 (120) T ss_pred ------------eEEEEEecCeeecceEEeeCCCC--cccccchHHHHhHHHHHHHHHHhhhcC Confidence 13899999999999999655544 345566665554 44566677777 No 112 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=94.98 E-value=0.00013 Score=41.92 Aligned_cols=92 Identities=20% Similarity=0.300 Sum_probs=54.8 Q ss_pred Cccc-hhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHH Q lcl|NC_019539. 1 MSFA-LDVSKF---VEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAAN 76 (131) Q Consensus 1 msf~-~~i~~~---~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~ 76 (131) +--+ +.+++. ..++++++++.+.++-.++-.-+.-..||.||.||-|+..|+.+ T Consensus 2 i~i~idkp~almek~~ev~~~ie~t~~~~~~~l~~i~~ntapiktg~lr~sh~~sieg---------------------- 59 (133) T protein:vir:41 2 IRINIDKPEALMEKASEVEDRVEQTVTLLMIELEEILMNTAPIKTGELRISHTWSVEG---------------------- 59 (133) T ss_pred eeeecCCchhhhcchhhhhhHHHHHHHHHHHHHHHHhhhccccccccceeeeeEEeec---------------------- Confidence 1111 222222 23456666666665555554444557899999999999999753 Q ss_pred HHhhccCCceEEEeeCchhhhhhhcCC----------------------C--------------CCCCchhHHHHHHHHH Q lcl|NC_019539. 77 FVLNAADWHTFTLTNNLPYAQRLEYGW----------------------S--------------QQAPQGFVRVNVSRFQ 120 (131) Q Consensus 77 ~i~~~~~g~~i~i~Nn~pYa~~LEyG~----------------------S--------------~QAp~G~V~~a~~~~~ 120 (131) .+=.++|.+||-+.+=+|. - --+|.|+|+-++-+|- T Consensus 60 --------stgelsn~~~yl~~vl~grgwvfpv~~kal~wpelphpvayarpappndyfsa~vay~~~~give~s~iewl 131 (133) T protein:vir:41 60 --------STGELTNTVPYLQWVLFGRGWVFPVEKKALYWPELPHPVAYARPAPPNDYFSAAVAYIDAKGIVEDSFIEWL 131 (133) T ss_pred --------CccchhhhhHHhhHhhhcccceeeecccccccCCCCCcccccCCCCCchhhhhhhhhhcccchhHHHHHHHh Confidence 2344677777777776653 1 2246677777776663 Q ss_pred HH Q lcl|NC_019539. 121 QL 122 (131) Q Consensus 121 ~~ 122 (131) -- T Consensus 132 is 133 (133) T protein:vir:41 132 IS 133 (133) T ss_pred cC Confidence 22 No 113 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=94.86 E-value=0.00048 Score=38.78 Aligned_cols=101 Identities=16% Similarity=0.195 Sum_probs=57.6 Q ss_pred Cccchh-------HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhC-C---------------------------ccch Q lcl|NC_019539. 1 MSFALD-------VSKFVEKAKK---NPEKVIRQVSIKLFSAIIKAS-P---------------------------VDTG 42 (131) Q Consensus 1 msf~~~-------i~~~~~~~~~---~~~~~~r~~a~~~~~~vv~~t-P---------------------------VdtG 42 (131) ||...+ +.+..+.+.. +...++++++..+...+..+= | .||| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg 80 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch Confidence 664322 2222233322 234566777766666655431 1 2344 Q ss_pred hhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCC-------CCCCchhHHHH Q lcl|NC_019539. 43 RFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWS-------QQAPQGFVRVN 115 (131) Q Consensus 43 r~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-------~QAp~G~V~~a 115 (131) +|++|+... ...+.+.|++|++||..-+||-. +-....|+.++ T Consensus 81 ~L~~Si~~~------------------------------~~~~~v~vGtn~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s 130 (155) T protein:vir:99 81 ALARSVTTW------------------------------ADRNEAGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFD 130 (155) T ss_pred hhhhhhhce------------------------------ecCCEEEEecCccchhhhhcccccCCCCccccCCccccCCC Confidence 444443322 23467889999999999999964 23445677665 Q ss_pred H---------HHHHHHHHHHHHhhC Q lcl|NC_019539. 116 V---------SRFQQLLNEEASKVK 131 (131) Q Consensus 116 ~---------~~~~~~v~~~~~e~k 131 (131) . +++..++.+.+++=| T Consensus 131 ~~~~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 131 ENGQLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred CccccchHHHHHHHHHHHHHHhccC Confidence 3 456666777776666 No 114 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=94.75 E-value=0.00056 Score=38.41 Aligned_cols=101 Identities=15% Similarity=0.188 Sum_probs=57.7 Q ss_pred Cccch-------hHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhC--------C--------------------ccch Q lcl|NC_019539. 1 MSFAL-------DVSKFVEKAKK---NPEKVIRQVSIKLFSAIIKAS--------P--------------------VDTG 42 (131) Q Consensus 1 msf~~-------~i~~~~~~~~~---~~~~~~r~~a~~~~~~vv~~t--------P--------------------VdtG 42 (131) ||... ++.+-.+.+.. +...+++.++..+...+..+= | +||| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 55322 22222222222 234566667766666665431 0 3455 Q ss_pred hhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCC-------CCCCchhHHHH Q lcl|NC_019539. 43 RFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWS-------QQAPQGFVRVN 115 (131) Q Consensus 43 r~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-------~QAp~G~V~~a 115 (131) +|++|++.. -.++.+-|.+|++||..-+||-. +-....|+.++ T Consensus 81 ~L~~Si~~~------------------------------~~~~~v~vGt~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s 130 (155) T protein:vir:79 81 ALARSVTTW------------------------------ADRNEAGIGSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFD 130 (155) T ss_pred hhhhhhhce------------------------------ecCCEEEEecCchhhhhhhcccccCCCCccccCCccccCCC Confidence 555554322 23467889999999999999964 33455777665 Q ss_pred H---------HHHHHHHHHHHHhhC Q lcl|NC_019539. 116 V---------SRFQQLLNEEASKVK 131 (131) Q Consensus 116 ~---------~~~~~~v~~~~~e~k 131 (131) . +++..++.+.+++=| T Consensus 131 ~~~~l~~~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 131 ENGQLAAGARQSILEVVLTALSRNR 155 (155) T ss_pred CccccchHHHHHHHHHHHHHHHhcC Confidence 3 456666666666666 No 115 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=94.57 E-value=6.6e-05 Score=43.50 Aligned_cols=94 Identities=18% Similarity=0.253 Sum_probs=51.5 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) -.|--.+++|.+.-| +++-+.+.-.+++..-+..|||++|.+|.||+|.--+-.-+ . T Consensus 12 ~KFGvs~~d~~K~~E--Vn~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~NkG----R----------------- 68 (108) T protein:vir:79 12 AKFGVRLDDFDKLPE--VNQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNKG----R----------------- 68 (108) T ss_pred hhhcCChhhhhhchh--hhhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhccC----c----------------- Confidence 347777888877322 23333444446777788999999999999999974321100 0 Q ss_pred ccCCceEEEeeCchhhhhhhcCCC---CCCCchhHHHHHHHHHHHHHHHHHh Q lcl|NC_019539. 81 AADWHTFTLTNNLPYAQRLEYGWS---QQAPQGFVRVNVSRFQQLLNEEASK 129 (131) Q Consensus 81 ~~~g~~i~i~Nn~pYa~~LEyG~S---~QAp~G~V~~a~~~~~~~v~~~~~e 129 (131) =-+.-..|||..+|+|.- .-||+- ....|+=..+-.. T Consensus 69 ------G~~G~~~~~AH~VEFGs~hndeyapaq------ktakqfggtay~d 108 (108) T protein:vir:79 69 ------GKVGATDPQAHLVEFGSAHNDEYAPAQ------KTAKQFGGTAYGD 108 (108) T ss_pred ------cccCCcchhhhhhhhhccccccccchh------hHHHhhcccccCC Confidence 112335689999999973 223321 0001110000011 No 116 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=94.00 E-value=0.001 Score=37.03 Aligned_cols=126 Identities=15% Similarity=0.138 Sum_probs=54.8 Q ss_pred CccchhH----HHHHH---HHHHH---HHHHHHHHHHHHHHHHHH-----hCCccchhhccccccccCc----------- Q lcl|NC_019539. 1 MSFALDV----SKFVE---KAKKN---PEKVIRQVSIKLFSAIIK-----ASPVDTGRFRMNWMASGGT----------- 54 (131) Q Consensus 1 msf~~~i----~~~~~---~~~~~---~~~~~r~~a~~~~~~vv~-----~tPVdtGr~R~nw~vs~~~----------- 54 (131) ||...+| +++.. .+... ...++++++..+...... ..| | =..|..+.-. T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~P-d----w~p~~p~t~~~r~~~g~~~~k 75 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRP-R----WQALSEATIHMRVGGKKAYKK 75 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCC-C----CCCCchhhhhhhhcccccchh Confidence 7643221 22222 22222 234566666666655543 223 1 0011000000 Q ss_pred -ccccccCCCCCCcchhH---HHHHHHHhhccCCceEEEeeCchhhhhhhcCCC-------CCCCchhHHHHH------H Q lcl|NC_019539. 55 -PADGTTDATDKAGTTAT---SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWS-------QQAPQGFVRVNV------S 117 (131) Q Consensus 55 -~~~~~~~~~d~~g~~~~---~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-------~QAp~G~V~~a~------~ 117 (131) +.-.......+.++..+ +....-|...-..+.+-|++|++||..-.||-. +-....|+.++. . T Consensus 76 ~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~~~~e 155 (175) T protein:vir:10 76 NGELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGSNKEYAAIHQFGGQAGRGLKVTIPARPWLPVTADGELQPE 155 (175) T ss_pred hhhhhhhhhhhccCCCcceechhhhhhhheeecCCEEEEecChhhhhhhhcccccCCCCccccCCccccCCCcccccchH Confidence 00000000000111111 112222222223467899999999999999975 456667887764 2 Q ss_pred HHHHHHHHHHHhhC Q lcl|NC_019539. 118 RFQQLLNEEASKVK 131 (131) Q Consensus 118 ~~~~~v~~~~~e~k 131 (131) ....|++.+.+.++ T Consensus 156 ~~~~Il~~~~~~l~ 169 (175) T protein:vir:10 156 AVEPVLNTILRHLM 169 (175) T ss_pred HHHHHHHHHHHHHH Confidence 23455555555444 No 117 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=92.96 E-value=0.0012 Score=36.54 Aligned_cols=101 Identities=14% Similarity=0.176 Sum_probs=52.2 Q ss_pred Cccch----hHHHHHHHH---HH---HHHHHHHHHHHHHHHHHHH-----hCC--------------------------- Q lcl|NC_019539. 1 MSFAL----DVSKFVEKA---KK---NPEKVIRQVSIKLFSAIIK-----ASP--------------------------- 38 (131) Q Consensus 1 msf~~----~i~~~~~~~---~~---~~~~~~r~~a~~~~~~vv~-----~tP--------------------------- 38 (131) ||..- +-+++.+.+ .. +...++++++..+...+.. ..| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 66321 212233222 22 2344667777766666554 223 Q ss_pred -------------ccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCC- Q lcl|NC_019539. 39 -------------VDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWS- 104 (131) Q Consensus 39 -------------VdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S- 104 (131) +|||+|++|++.. ...+.+-|++|++||..-+||-. T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~------------------------------~~~~~v~vGtn~~YAaiHqfGg~~ 130 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATD------------------------------SGEDYSVIGSNKEYAAIQHFGGQA 130 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhhe------------------------------ecCCEEEEecCcchhhHhhccccc Confidence 1344444444322 23467889999999999999963 Q ss_pred ------CCCCchhHHHHHH---------HHHHHHHHHHHhhC Q lcl|NC_019539. 105 ------QQAPQGFVRVNVS---------RFQQLLNEEASKVK 131 (131) Q Consensus 105 ------~QAp~G~V~~a~~---------~~~~~v~~~~~e~k 131 (131) +-....|+.++-+ ++..++.+.++++= T Consensus 131 ~~~~~v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~ 172 (175) T protein:vir:79 131 GRGLKVTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAA 172 (175) T ss_pred CCCcccccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHh Confidence 3455567766542 23333333332222 No 118 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=92.82 E-value=0.0021 Score=35.28 Aligned_cols=122 Identities=16% Similarity=0.265 Sum_probs=52.2 Q ss_pred CccchhHHHH---HHHHHH---HHHHHHHHHHHHHHHHHHHh-----CCccchhhccccccccCcccccccCCCCCCcch Q lcl|NC_019539. 1 MSFALDVSKF---VEKAKK---NPEKVIRQVSIKLFSAIIKA-----SPVDTGRFRMNWMASGGTPADGTTDATDKAGTT 69 (131) Q Consensus 1 msf~~~i~~~---~~~~~~---~~~~~~r~~a~~~~~~vv~~-----tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~ 69 (131) ++...+.+++ .+.+.. +...+.++++..+.+.+..+ .| | | ..|...- +.+. ...-..|.. T Consensus 4 i~i~~d~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~P-d-G---~~W~p~~--~~t~--~rk~~~~~~ 74 (190) T protein:vir:99 4 ITLEWDGRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSP-D-G---TPWQPLS--PAYL--RRKRKNRDK 74 (190) T ss_pred eEEEecHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-C-C---CCCcccc--HHHH--HHhhcCCCc Confidence 2233332222 222222 23456777777766666542 33 1 1 2342210 0000 000001111 Q ss_pred hH---HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCC-------------------------------------- Q lcl|NC_019539. 70 AT---SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAP-------------------------------------- 108 (131) Q Consensus 70 ~~---~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp-------------------------------------- 108 (131) .+ .....-|...-..+.+.|++|++||..-+||-..+.+ T Consensus 75 ~L~~tg~L~~Si~~~~~~~~v~vGtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 154 (190) T protein:vir:99 75 ILTLDGHLRNLLRYQLDGSELLFGSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPY 154 (190) T ss_pred cceecHHHHHHHhheecCcEEEEecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccc Confidence 11 1122222222334678899999999999999554433 Q ss_pred ------chhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 109 ------QGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 109 ------~G~V~~a~~~~~~~v~~~~~e~k 131 (131) ..|++++-+....|.+-+...++ T Consensus 155 ~v~IPaRpfLG~s~~d~~~I~~~i~~~l~ 183 (190) T protein:vir:99 155 TIQMPARPWLGTSSQDDDTILQRVERYLQ 183 (190) T ss_pred eeeecCcccCCCCHHHHHHHHHHHHHHHH Confidence 23444444333333333333333 No 119 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=91.93 E-value=0.0034 Score=34.15 Aligned_cols=124 Identities=17% Similarity=0.198 Sum_probs=55.9 Q ss_pred Ccc----chh---HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhC-CccchhhccccccccCccccccc-CCCCCCcc Q lcl|NC_019539. 1 MSF----ALD---VSKFVEKAKK---NPEKVIRQVSIKLFSAIIKAS-PVDTGRFRMNWMASGGTPADGTT-DATDKAGT 68 (131) Q Consensus 1 msf----~~~---i~~~~~~~~~---~~~~~~r~~a~~~~~~vv~~t-PVdtGr~R~nw~vs~~~~~~~~~-~~~d~~g~ 68 (131) ||- ..+ +.+..+.+.. +...+++.++..+...+..+= | .|+ -|.-- +|.+... ....+.+. T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p--~G~---~W~pl--sp~t~~~r~k~g~~~~ 73 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMD--EGP---GWPQL--SPVTVAARAAKGRGAH 73 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhh--cCC---CCCCC--CccchHHHHhccCCCC Confidence 552 222 3333333322 234566777766666665431 1 111 23210 1111000 00001111 Q ss_pred hhH---HHHHHHHhhccCCceEEEeeCchhhhhhhcCCC-------CCCCchhHHHHH---------HHHHHHHHHHHHh Q lcl|NC_019539. 69 TAT---SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWS-------QQAPQGFVRVNV---------SRFQQLLNEEASK 129 (131) Q Consensus 69 ~~~---~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S-------~QAp~G~V~~a~---------~~~~~~v~~~~~e 129 (131) ..+ +....-|......+.+.|.+|++||..-+||-. +-....|+.++. +.+..++.+.+++ T Consensus 74 ~~L~~tG~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i~~~l~~ 153 (155) T protein:vir:10 74 PILQVTNALARSITTRADRDQAQIGSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARDAVLDVLLAALSQ 153 (155) T ss_pred CccccchhhhhhhhceecCCEEEEecCcchhhhhhcccccCCCCccccCCccccCCCccccchHHHHHHHHHHHHHHHhh Confidence 111 112222222234477899999999999999963 345556776653 3344444455444 Q ss_pred hC Q lcl|NC_019539. 130 VK 131 (131) Q Consensus 130 ~k 131 (131) =| T Consensus 154 ~r 155 (155) T protein:vir:10 154 GR 155 (155) T ss_pred cC Confidence 44 No 120 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=91.83 E-value=0.0047 Score=33.37 Aligned_cols=110 Identities=14% Similarity=0.125 Sum_probs=63.4 Q ss_pred CccchhHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhCCcc-----------------------chhhcccccc Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNP-------EKVIRQVSIKLFSAIIKASPVD-----------------------TGRFRMNWMA 50 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~-------~~~~r~~a~~~~~~vv~~tPVd-----------------------tGr~R~nw~v 50 (131) -+|...|.+|.+++++.. .++++.-|.-....+..-||.. .|.++.|-.+ T Consensus 3 ~~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~HlaD~I~~ 82 (159) T protein:vir:38 3 NDMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQDSITY 82 (159) T ss_pred chHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccccceee Confidence 348888999999996632 2345555555555666667762 2355555544 Q ss_pred ccCcccccccCCCC--CCcchhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCc-----hhHHHHHHHHHHH- Q lcl|NC_019539. 51 SGGTPADGTTDATD--KAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQ-----GFVRVNVSRFQQL- 122 (131) Q Consensus 51 s~~~~~~~~~~~~d--~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~-----G~V~~a~~~~~~~- 122 (131) +-+. ..| ..|+.+. ++... +..=||.+|+.|.+.|.|+ .||+-+.++...- T Consensus 83 ~~~~-------~iDg~~dG~s~V--------Gw~~~------~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~V 141 (159) T protein:vir:38 83 KPGY-------TADKLHTGDTDV--------GFEGK------YYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSV 141 (159) T ss_pred ecCc-------cccccccceeee--------cccCC------ccceEeeecccCccccCCCCccCChhHHHHHHHHHHHH Confidence 3210 111 1222111 11100 1224678999999999886 6999998887554 Q ss_pred HHHHHHhhC Q lcl|NC_019539. 123 LNEEASKVK 131 (131) Q Consensus 123 v~~~~~e~k 131 (131) ++...+++| T Consensus 142 l~A~~~~~~ 150 (159) T protein:vir:38 142 AEAELKAYK 150 (159) T ss_pred HHHHHHHHH Confidence 455555555 No 121 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=91.32 E-value=0.0012 Score=36.66 Aligned_cols=100 Identities=15% Similarity=0.121 Sum_probs=53.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLN 80 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~ 80 (131) |||++==++-++.+.++--..--++-.++++..---+|.+||.|+.|=..++. T Consensus 1 ~~f~~f~~~~~k~l~kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tv--------------------------- 53 (105) T protein:vir:78 1 MSFSSFKDAVIDDIHNKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKII--------------------------- 53 (105) T ss_pred CCcccccchHHHHHHHhcCCCCchhhHHHHHHhCCCCccccccccccccccee--------------------------- Confidence 98864222222322222111000111255655556689999999998654321 Q ss_pred ccCCceEEEee-CchhhhhhhcCCCCCCCchhHHHHHH----HHHHHHHHHHHh Q lcl|NC_019539. 81 AADWHTFTLTN-NLPYAQRLEYGWSQQAPQGFVRVNVS----RFQQLLNEEASK 129 (131) Q Consensus 81 ~~~g~~i~i~N-n~pYa~~LEyG~S~QAp~G~V~~a~~----~~~~~v~~~~~e 129 (131) ...|.++|=.| -+|||.+.=|.+ |-..-|.+.... ++.++|+-.++- T Consensus 54 Igsg~I~y~~~~~aPYAr~qYYe~--~Rg~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 54 IQKNSIVARVFSLTPYARRQYYEN--RRNPRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred ecCCeeEeeccccCchhhhhhhcc--cCCCchhHHhhhcchhHHHHHHhcccCC Confidence 13344555332 489999999877 355557776653 444555533333 No 122 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=89.26 E-value=0.0013 Score=36.48 Aligned_cols=79 Identities=14% Similarity=0.153 Sum_probs=49.6 Q ss_pred HHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEE-Eee---CchhhhhhhcCC- Q lcl|NC_019539. 29 LFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFT-LTN---NLPYAQRLEYGW- 103 (131) Q Consensus 29 ~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~-i~N---n~pYa~~LEyG~- 103 (131) +=.+.+.+-|++||.||.|..+..+.- .| ..|-.+| ++. ..||..-+|||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~---------~S---------------~dG~~~Y~Vswn~rkAPhghlvE~Ghw 56 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPE---------ES---------------TNGVQTYAVSWRKKAAPHGHLLEFGHW 56 (119) T ss_pred CCcccccccCCCccchhhhheeeeccc---------cC---------------CCCeEEEEeeccCCcCCccccccccee Confidence 444556788999999999997764311 11 1233455 333 468888899995 Q ss_pred -----------------------CCCCCchhHHHHHH-HHHHH-------HHHHHHhhC Q lcl|NC_019539. 104 -----------------------SQQAPQGFVRVNVS-RFQQL-------LNEEASKVK 131 (131) Q Consensus 104 -----------------------S~QAp~G~V~~a~~-~~~~~-------v~~~~~e~k 131 (131) +..+|..|+|.++. ...+. +.+.++|+. T Consensus 57 ~~~~~~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~ 115 (119) T protein:vir:81 57 QTHAAYKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQ 115 (119) T ss_pred eeeeeeeccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 56677889988776 22333 333344444 No 123 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=88.99 E-value=0.0099 Score=31.57 Aligned_cols=105 Identities=21% Similarity=0.206 Sum_probs=64.3 Q ss_pred Cccc-hhHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA-LDVSKFVEKAKKN---------PEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~-~~i~~~~~~~~~~---------~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) ||=. .=+++...++++. .+..+++.+..++..++...+| |||.....-.+|- |- -.+|. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~--p~-------~~~G~ 71 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSK--PE-------WINGK 71 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecC--ee-------ecCCc Confidence 7643 3344555555443 5667888888899999999998 9999999887762 11 11221 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCch-----hhhhhhcCCCCC------CCch--hHHHHHHHH----HHHHHHHHHhh Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLP-----YAQRLEYGWSQQ------APQG--FVRVNVSRF----QQLLNEEASKV 130 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~p-----Ya~~LEyG~S~Q------Ap~G--~V~~a~~~~----~~~v~~~~~e~ 130 (131) -+|-|...-| |..-.||||+.. -|.| .++-++... ...+.+.++++ T Consensus 72 ----------------r~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 72 ----------------RTITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ----------------eEEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1344444333 666789999764 3554 455455444 44455555555 No 124 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=88.99 E-value=0.0099 Score=31.57 Aligned_cols=105 Identities=21% Similarity=0.206 Sum_probs=64.3 Q ss_pred Cccc-hhHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA-LDVSKFVEKAKKN---------PEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~-~~i~~~~~~~~~~---------~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) ||=. .=+++...++++. .+..+++.+..++..++...+| |||.....-.+|- |- -.+|. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~--p~-------~~~G~ 71 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSK--PE-------WINGK 71 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecC--ee-------ecCCc Confidence 7643 3344555555443 5667888888899999999998 9999999887762 11 11221 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCch-----hhhhhhcCCCCC------CCch--hHHHHHHHH----HHHHHHHHHhh Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLP-----YAQRLEYGWSQQ------APQG--FVRVNVSRF----QQLLNEEASKV 130 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~p-----Ya~~LEyG~S~Q------Ap~G--~V~~a~~~~----~~~v~~~~~e~ 130 (131) -+|-|...-| |..-.||||+.. -|.| .++-++... ...+.+.++++ T Consensus 72 ----------------r~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 72 ----------------RTITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ----------------eEEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1344444333 666789999764 3554 455455444 44455555555 No 125 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=88.81 E-value=0.0014 Score=36.17 Aligned_cols=79 Identities=15% Similarity=0.158 Sum_probs=49.4 Q ss_pred HHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEE-Eee---CchhhhhhhcCC- Q lcl|NC_019539. 29 LFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFT-LTN---NLPYAQRLEYGW- 103 (131) Q Consensus 29 ~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~-i~N---n~pYa~~LEyG~- 103 (131) +=.+.+.+-|++||.||.|..+..+.-. | ..|-.+| ++. ..||..-+|||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~---------S---------------~dG~~~Y~Vswn~rkAPhghlvE~Ghw 56 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEE---------S---------------TNGVQTYAVSWRKKAAPHGHLLEFGHW 56 (119) T ss_pred CCcccccccCCCccchhhhheeeecccc---------C---------------CCCEEEEEeecCCCcCCccccccccee Confidence 4445567889999999999977643211 1 1233455 433 468888899995 Q ss_pred -----------------------CCCCCchhHHHHHH-HHHHH-------HHHHHHhhC Q lcl|NC_019539. 104 -----------------------SQQAPQGFVRVNVS-RFQQL-------LNEEASKVK 131 (131) Q Consensus 104 -----------------------S~QAp~G~V~~a~~-~~~~~-------v~~~~~e~k 131 (131) +..+|..|+|.++. ...+. +.+.++|+. T Consensus 57 ~~~~~~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~ 115 (119) T protein:vir:10 57 QTHAAYKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQ 115 (119) T ss_pred eeeeeeeccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 35677889988776 22333 333344444 No 126 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=86.37 E-value=0.012 Score=31.02 Aligned_cols=87 Identities=15% Similarity=0.135 Sum_probs=61.9 Q ss_pred HHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchh Q lcl|NC_019539. 18 PEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPY 95 (131) Q Consensus 18 ~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pY 95 (131) +.....-.|.++-..++..+|= .||..|....-+++. .|.. --+||++-+++| T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~-----------~g~~--------------~~~i~lsh~v~Y 55 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVAST-----------PQPD--------------RYEIVFAHTVHY 55 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccc-----------cCCc--------------eEEEEEecCeec Confidence 5555566677888999999996 699999888655421 1111 137999999999 Q ss_pred hhhhhcCCCCCCCchhHHHHHHHHHHH----HHHHHHhhC Q lcl|NC_019539. 96 AQRLEYGWSQQAPQGFVRVNVSRFQQL----LNEEASKVK 131 (131) Q Consensus 96 a~~LEyG~S~QAp~G~V~~a~~~~~~~----v~~~~~e~k 131 (131) ..+||-+++++- .+++.|++.+.+. +++.+.++| T Consensus 56 g~~LE~a~~~ky--aIl~Ptv~~~~~~i~~g~~~ll~~l~ 93 (93) T protein:vir:10 56 GIWLEIANSGRY--EIIMPTVHHEGKLMAQRLRGLLGRLR 93 (93) T ss_pred cceEEeecCCCc--cchhhhHHHHHHHHHHHHHHHHHhcC Confidence 999999997653 4666666655443 566677777 No 127 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=85.83 E-value=0.00078 Score=37.62 Aligned_cols=100 Identities=21% Similarity=0.318 Sum_probs=50.4 Q ss_pred Cc-----cchhHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcch Q lcl|NC_019539. 1 MS-----FALDVSKFVEKA------KKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTT 69 (131) Q Consensus 1 ms-----f~~~i~~~~~~~------~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~ 69 (131) |+ |-..+++|.+-+ .+.+.+++.+.|. .-.+..+|||.|.+|+||+|.--+- +| T Consensus 1 mgNP~~KFGvS~~e~~K~irns~EV~~GiNdFMe~~A~---~~aK~~SPV~~GeY~~S~~V~~ka~----------NG-- 65 (150) T protein:vir:81 1 MGNPFEKFGVSDSELAKHIRNSAEVDAGINDFMENEAI---PYAKSISPVDDGEYAASWAVMKKAK----------NG-- 65 (150) T ss_pred CCCchhhhcCCHHHHHHhhccchhhhhhHHHHHHhhhh---hhhhccCCcccchhHHHHHHHhhcc----------cC-- Confidence 43 666777777653 3333444443332 2236789999999999999874221 11 Q ss_pred hHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCC---C----------CCchhHHHHH-----------HHHHHHHHH Q lcl|NC_019539. 70 ATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQ---Q----------APQGFVRVNV-----------SRFQQLLNE 125 (131) Q Consensus 70 ~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~---Q----------Ap~G~V~~a~-----------~~~~~~v~~ 125 (131) .=-+.-..|||..+|||.-. | ....-|++-- ...+-|..+ T Consensus 66 ----------------RG~~G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqk 129 (150) T protein:vir:81 66 ----------------RGVFGPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQK 129 (150) T ss_pred ----------------ccccCccchhhhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHH Confidence 01133467899999998731 1 1112222111 111112222 Q ss_pred HHHhh----C Q lcl|NC_019539. 126 EASKV----K 131 (131) Q Consensus 126 ~~~e~----k 131 (131) .+..+ | T Consensus 130 vashfggslk 139 (150) T protein:vir:81 130 VASHFGGSLK 139 (150) T ss_pred HHHhcccccc Confidence 22222 2 No 128 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=82.52 E-value=0.047 Score=27.87 Aligned_cols=105 Identities=20% Similarity=0.228 Sum_probs=63.3 Q ss_pred Cccc-hhHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MSFA-LDVSKFVEKAKKN---------PEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 msf~-~~i~~~~~~~~~~---------~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) ||=. .-+++...++++. .+..+++.+..++..++...+| |||.....-.+|- | .-..|. T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~--p-------~~~~G~ 71 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTE--P-------EWIKGK 71 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecC--e-------eecCCc Confidence 7733 4455555555544 5567788888899999987777 9999988887762 1 111221 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCch-----hhhhhhcCCCC-CC-----Cc--hhHHHHHHHHH----HHHHHHHHhh Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLP-----YAQRLEYGWSQ-QA-----PQ--GFVRVNVSRFQ----QLLNEEASKV 130 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~p-----Ya~~LEyG~S~-QA-----p~--G~V~~a~~~~~----~~v~~~~~e~ 130 (131) -+|-|...-| |..-.||||+. .+ |. |.++-++.... ..+.+.++++ T Consensus 72 ----------------r~V~vgW~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 72 ----------------RTVTIRWRGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ----------------eEEEEEEEcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 2344444333 56668999972 33 54 44555555443 3444455555 No 129 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=70.90 E-value=0.2 Score=24.38 Aligned_cols=106 Identities=17% Similarity=0.209 Sum_probs=59.0 Q ss_pred Cccchh---HHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCC Q lcl|NC_019539. 1 MSFALD---VSKFVEKAKK---------NPEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKA 66 (131) Q Consensus 1 msf~~~---i~~~~~~~~~---------~~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~ 66 (131) ||=-++ +++..+++++ ..+..+++.+..++..++...|| |||..-..-.+|- |- -.+ T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~--~~-------~~~ 71 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSG--VR-------RED 71 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecC--ee-------ecC Confidence 653333 3344444443 34556777888889999999998 9999888776662 11 112 Q ss_pred cchhHHHHHHHHhhccCCceEEEeeCch--hhhhhh-cCCCC---CCCchhHHHHHHHHHH-HHHHHHHhhC Q lcl|NC_019539. 67 GTTATSNAANFVLNAADWHTFTLTNNLP--YAQRLE-YGWSQ---QAPQGFVRVNVSRFQQ-LLNEEASKVK 131 (131) Q Consensus 67 g~~~~~~~~~~i~~~~~g~~i~i~Nn~p--Ya~~LE-yG~S~---QAp~G~V~~a~~~~~~-~v~~~~~e~k 131 (131) |.. +|-|..+-| |..+|+ |||.+ ..+-|+++-++..-.. ++...-.++| T Consensus 72 G~r----------------~V~VgW~GpR~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~~~elk 127 (132) T protein:vir:96 72 GIP----------------KVKLGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLK 127 (132) T ss_pred Cce----------------EEEecccCCceeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHH Confidence 221 233333333 223444 78753 3445778777766653 2333334444 No 130 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=67.58 E-value=0.091 Score=26.30 Aligned_cols=102 Identities=14% Similarity=0.140 Sum_probs=61.0 Q ss_pred Cc-----cchhHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhCCc----cchhhccccccccCcccccccCCCCCCcc Q lcl|NC_019539. 1 MS-----FALDVSKFVEKAKKN---PEKVIRQVSIKLFSAIIKASPV----DTGRFRMNWMASGGTPADGTTDATDKAGT 68 (131) Q Consensus 1 ms-----f~~~i~~~~~~~~~~---~~~~~r~~a~~~~~~vv~~tPV----dtGr~R~nw~vs~~~~~~~~~~~~d~~g~ 68 (131) |+ |+..++.+..-++-+ --+.+.+.|.--+.+++-.-|+ ..|.+|.+.+|-+-. T Consensus 1 m~sNNNGFae~~~~~~tl~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~-------------- 66 (125) T protein:vir:62 1 MASNNNGFAEALEDINTLLRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKD-------------- 66 (125) T ss_pred CCCCchhHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeC-------------- Confidence 43 666655554433222 1123344444455555544554 468999999886521 Q ss_pred hhHHHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCCCCch------hHHHHHHHHHHHHHHHH-----Hhh Q lcl|NC_019539. 69 TATSNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQAPQG------FVRVNVSRFQQLLNEEA-----SKV 130 (131) Q Consensus 69 ~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G------~V~~a~~~~~~~v~~~~-----~e~ 130 (131) ..-.+.+.+..=|=..+|+||++|-++| ||.-|+..-..-+++.+ .++ T Consensus 67 --------------d~V~V~Fed~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 67 --------------DRVSVEFKDEAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred --------------CeEEEEEcchhhhhhhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 1124778889999999999999996655 77777765544444333 222 No 131 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=62.98 E-value=0.2 Score=24.38 Aligned_cols=93 Identities=12% Similarity=0.039 Sum_probs=40.9 Q ss_pred HHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhh Q lcl|NC_019539. 19 EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQR 98 (131) Q Consensus 19 ~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~ 98 (131) ..+.|+....++.++.. ...+-.|.-+.. .+|..|.... ....+.....+ =.+..-+|.+ T Consensus 1 m~~~r~~l~~~~~~l~~------~~v~VGi~~~a~--------y~d~~~~~~~--~~~~~~~~~~~----G~pva~ia~~ 60 (155) T protein:vir:77 1 MSVTRRGLTLPKDRYRS------MSVKAGVLAGAT--------YPDESGKKLA--DGSILKKDPRA----GLPVAMIAMA 60 (155) T ss_pred CcchHHHHHHHHHHHhc------CceEEeecCCCC--------Cccccchhhh--hhhhccccccc----cccHhhhhhh Confidence 22222222222222221 223333322211 1222221111 00111100000 1234457889 Q ss_pred hhcCCCCCCCchhHHHHHHHHHHHHHHHHHh-hC Q lcl|NC_019539. 99 LEYGWSQQAPQGFVRVNVSRFQQLLNEEASK-VK 131 (131) Q Consensus 99 LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e-~k 131 (131) +|||+.+-.|..|+|.++.+.+.-..+.+.+ ++ T Consensus 61 ~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:77 61 LNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMT 94 (155) T ss_pred hhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999998775544444333 33 No 132 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=61.01 E-value=0.18 Score=24.72 Aligned_cols=93 Identities=15% Similarity=0.098 Sum_probs=40.4 Q ss_pred HHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhh Q lcl|NC_019539. 19 EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQR 98 (131) Q Consensus 19 ~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~ 98 (131) ..+.|+-...++.++... .|.++-++... .+|..|..- ..+. +...+. --=.+..-+|.+ T Consensus 1 m~v~r~~L~~~~~~l~~~------------~V~VGi~~~a~--y~d~~g~~~-~~g~--~~~~~~---~~G~pva~ia~~ 60 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYKSM------------SVKAGVLAGAT--YPDESGKKL-ADGT--ILKKDP---RAGLPVAMIAMA 60 (155) T ss_pred CcchHHHHHHHHHHhhCC------------eeEEeecCCCC--CCccccchh-hhhh--hhcccc---ccCcchhhhhhh Confidence 222222222222333221 13333332211 223222111 1110 011100 001123346779 Q ss_pred hhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 99 LEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 99 LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) +|||+.+-.|..|+|.++.+...-..+.+.++ + T Consensus 61 ~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:10 61 LNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMT 94 (155) T ss_pred hhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999987755444433332 2 No 133 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=57.20 E-value=0.32 Score=23.27 Aligned_cols=92 Identities=14% Similarity=0.057 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhc-cCCceEEEeeCchhhh Q lcl|NC_019539. 19 EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNA-ADWHTFTLTNNLPYAQ 97 (131) Q Consensus 19 ~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~-~~g~~i~i~Nn~pYa~ 97 (131) ..+.|+-...++.++.. ...+-.| +.... .+|.+|..... ..+... ..+. .++.-+|. T Consensus 1 m~v~~k~L~~~~~~l~~------~~v~VGi------~~~a~--y~d~~~~~~~~---~~~~~~~~~~g----~~va~ia~ 59 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRS------MSVKAGV------LAGAT--YPDESGKKLAD---GTILTKDPRAG----LPVAMIAM 59 (155) T ss_pred CcchHHHHHHHHHHHhC------CeeEEee------cCCCC--Cccccchhhhh---hhhcccccccC----CcHHHHHH Confidence 33334333333333321 1223223 22211 22322222210 000000 0000 12344566 Q ss_pred hhhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 98 RLEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 98 ~LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) +.|||+.+-.|..|+|.++.+...-..+.+.++ + T Consensus 60 ~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:10 60 ALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMT 94 (155) T ss_pred HHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHH Confidence 899999999999999999977755444333322 2 No 134 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=57.00 E-value=0.32 Score=23.32 Aligned_cols=93 Identities=12% Similarity=0.040 Sum_probs=41.1 Q ss_pred HHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCchhhhh Q lcl|NC_019539. 19 EKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQR 98 (131) Q Consensus 19 ~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~pYa~~ 98 (131) ..+.|+-...++.++.. ...+-.| +... ..+|.+|....... .......+. .++.-+|.+ T Consensus 1 m~v~~k~L~~~~~~l~~------~~v~VGi------~~~a--~y~d~~~~~~~~~~--~~~~~~~~g----~~va~ia~~ 60 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRS------MSVKAGV------LAGA--TYPDESGKKLADGT--ILTKDPRAG----LPVAMIAMA 60 (155) T ss_pred CcchHHHHHHHHHHHhC------CeeEEee------cCCC--CCCcccchhhhhhh--hcccccccC----CcHHHHHHh Confidence 33334333333333321 1222223 2221 12233332221100 000000000 123335668 Q ss_pred hhcCCCCCCCchhHHHHHHHHHHHHHHHHHhh-C Q lcl|NC_019539. 99 LEYGWSQQAPQGFVRVNVSRFQQLLNEEASKV-K 131 (131) Q Consensus 99 LEyG~S~QAp~G~V~~a~~~~~~~v~~~~~e~-k 131 (131) +|||+.+-.|..|+|.++.+.+.-..+.+.++ + T Consensus 61 ~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~ 94 (155) T protein:vir:78 61 LNYGTSKLPARPFMEKTITDRSAEWIKGLTVMMT 94 (155) T ss_pred hhcCCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999987755443333322 2 No 135 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=54.49 E-value=0.38 Score=22.89 Aligned_cols=105 Identities=14% Similarity=0.119 Sum_probs=51.2 Q ss_pred Ccc------chhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHH Q lcl|NC_019539. 1 MSF------ALDVSKFVEKAKK-NPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSN 73 (131) Q Consensus 1 msf------~~~i~~~~~~~~~-~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~ 73 (131) |-- ...+.++++.+.. +....+...-+.+...--..|||||+.|-+|=--.+....+. T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei~~ngtr--------------- 65 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTEKVLYLVMNAGANHAAVITPVKSSTLINSQYKKLEPIPSG--------------- 65 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHhhhhhccccchhhhccccceeeeccCce--------------- Confidence 211 1222333333321 222222322222333333579999999998865444321111 Q ss_pred HHHHHhhccCCceEEEeeCchhhhhhhc--CC--------------CCCCCchhHHHHHHHH-HHHHHHHHHh-hC Q lcl|NC_019539. 74 AANFVLNAADWHTFTLTNNLPYAQRLEY--GW--------------SQQAPQGFVRVNVSRF-QQLLNEEASK-VK 131 (131) Q Consensus 74 ~~~~i~~~~~g~~i~i~Nn~pYa~~LEy--G~--------------S~QAp~G~V~~a~~~~-~~~v~~~~~e-~k 131 (131) -+--+.++..||.++.. |. ..-|-..|..-.+++- .+.++..++| .| T Consensus 66 -----------itGRVGYSAnYA~yVHda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~~d~i~avik~e~k 130 (131) T protein:vir:10 66 -----------MIGRVGYTANYAAAVNAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDGLNEIKAIIRQGYK 130 (131) T ss_pred -----------eEEeeccceeeeeeeecCccccCCCcCCCCCcceecCCCChhhhhhhhhccchHHHHHHHhhhcC Confidence 13347788899999977 22 2334445666666543 3344444433 34 No 136 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=47.24 E-value=0.71 Score=21.41 Aligned_cols=120 Identities=13% Similarity=0.143 Sum_probs=52.6 Q ss_pred CccchhHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHh-----CCccchhhccccccccCcccccccCCCCCCc Q lcl|NC_019539. 1 MSFALDVSKFVEKAKK--------NPEKVIRQVSIKLFSAIIKA-----SPVDTGRFRMNWMASGGTPADGTTDATDKAG 67 (131) Q Consensus 1 msf~~~i~~~~~~~~~--------~~~~~~r~~a~~~~~~vv~~-----tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g 67 (131) |+ +++++.+.+.. +...++++++..+......+ +| | |. .|...- +.+... ...... T Consensus 1 m~---d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~P-d-G~---~W~p~~--~~~~~~-k~~~~~ 69 (149) T protein:vir:98 1 MS---ELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAP-D-GT---PYAARK--RQSVRS-KKGRIR 69 (149) T ss_pred Cc---hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccc--hHHHHh-ccCCCC Confidence 66 33333332222 22345777777766655542 44 2 21 353321 111000 000000 Q ss_pred chhH--HHHHHHHhhccCCce--E-EEeeCchhhhhhhcCCCCC----------CCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 68 TTAT--SNAANFVLNAADWHT--F-TLTNNLPYAQRLEYGWSQQ----------APQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 68 ~~~~--~~~~~~i~~~~~g~~--i-~i~Nn~pYa~~LEyG~S~Q----------Ap~G~V~~a~~~~~~~v~~~~~e~k 131 (131) ..-+ .....-|...-..+. | |+..|.+||..-.||-..+ ....|+.++-+..+.+++-+.+-+. T Consensus 70 ~~l~~~g~l~~sl~~~~~~~~~~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~ 148 (149) T protein:vir:98 70 REMFARLRTNRFMKAKGSDSAAVVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFTRDDEQMIEDIIIRHLG 148 (149) T ss_pred cccchhhhhhhhhhheecCCeeEEEecCcchHHhhHhhccccccccCCCcceeccccccCCCCHHHHHHHHHHHHHHhh Confidence 0000 011111111111222 2 3589999999999997532 3445666666555555555555554 No 137 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=45.03 E-value=0.78 Score=21.16 Aligned_cols=122 Identities=9% Similarity=0.074 Sum_probs=54.9 Q ss_pred CccchhHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHH-----hCCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKK-----NPEKVIRQVSIKLFSAIIK-----ASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 msf~~~i~~~~~~~~~-----~~~~~~r~~a~~~~~~vv~-----~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+=-.+++.....+-. +...++++++..+...... .+| | |. .|...- +.+. ......+... T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~P-d-G~---~W~p~k--~~~~--~~k~g~~~~~ 71 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAP-D-GT---PYAPRQ--QQSV--RKKTGRVKRK 71 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccc--hHHH--HHhccCCCcc Confidence 4422333322222221 2234677777776666654 244 2 11 232110 0000 0000000111 Q ss_pred HH---HHHHHHhhccCCc--eEEE--eeCchhhhhhhcCCC----------CCCCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 71 TS---NAANFVLNAADWH--TFTL--TNNLPYAQRLEYGWS----------QQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 71 ~~---~~~~~i~~~~~g~--~i~i--~Nn~pYa~~LEyG~S----------~QAp~G~V~~a~~~~~~~v~~~~~e~k 131 (131) +. ....-|..-...+ +|++ ..|.+||..-.||-+ +-....|++++....+.|.+-+.+-+. T Consensus 72 l~~~~~l~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~ 149 (150) T protein:vir:20 72 MFAKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLE 149 (150) T ss_pred ccchhhhhhhhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCHHHHHHHHHHHHHHHh Confidence 10 1111111111122 3433 889999999999964 234456777777777777666666666 No 138 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=44.95 E-value=0.18 Score=24.63 Aligned_cols=87 Identities=15% Similarity=0.254 Sum_probs=41.9 Q ss_pred Cc--cchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHH Q lcl|NC_019539. 1 MS--FALDVSKFVEKAK--KNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAAN 76 (131) Q Consensus 1 ms--f~~~i~~~~~~~~--~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~ 76 (131) |. |.-... |-+++- ..+..+++-.|.+.+...+...|||||.+|...++..- ........---|+.. T Consensus 1 madaftpNp~-~FDqIl~s~~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~--q~~~RtT~MVVG~D~------ 71 (92) T protein:vir:78 1 MADAFTPNPT-WFDQIMRTPKVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHR--QGRSRETAMVVGSDE------ 71 (92) T ss_pred CCCccCCChh-HHHHhhcccchhhhhhhhhhhhhhhhcccCcccccccccccchhhh--hccccceeEEeecCc------ Confidence 65 554444 444432 33567888899999999999999999999999976421 110000000000000 Q ss_pred HHhhccCCceEEE-eeCchhhhhhhcCCC Q lcl|NC_019539. 77 FVLNAADWHTFTL-TNNLPYAQRLEYGWS 104 (131) Q Consensus 77 ~i~~~~~g~~i~i-~Nn~pYa~~LEyG~S 104 (131) .++.| +-+=.-+..|.--.| T Consensus 72 --------KTlLvESrTGNLakalk~~rs 92 (92) T protein:vir:78 72 --------KTLLIESRTGNLARSVKRRRS 92 (92) T ss_pred --------ceeeeecccchHHHHHhhhcC Confidence 01111 000011111211111 No 139 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=44.52 E-value=0.8 Score=21.10 Aligned_cols=122 Identities=8% Similarity=0.090 Sum_probs=55.9 Q ss_pred CccchhHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHh-----CCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKK-----NPEKVIRQVSIKLFSAIIKA-----SPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 msf~~~i~~~~~~~~~-----~~~~~~r~~a~~~~~~vv~~-----tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |+=-.++.+....+-. ....++++++..+......+ .| | |. .|...- +.+.. .....+... T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~P-d-G~---~W~p~~--~~~~~--~k~~~~~~~ 71 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAP-D-GT---PYAPRQ--QQSAR--KKTGRVKRK 71 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccC--hHHHH--HhhcCCCcc Confidence 5422222222222211 22346777777766666542 34 2 11 142210 00000 000000000 Q ss_pred H-H--HHHHHHhhc--cCCceEE--EeeCchhhhhhhcCCCCC----------CCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 71 T-S--NAANFVLNA--ADWHTFT--LTNNLPYAQRLEYGWSQQ----------APQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 71 ~-~--~~~~~i~~~--~~g~~i~--i~Nn~pYa~~LEyG~S~Q----------Ap~G~V~~a~~~~~~~v~~~~~e~k 131 (131) + . ....-|... ..+-+|. +..|.+||..-.||-+.+ ....|++++-+..+.+.+.+.+-+. T Consensus 72 l~~~~~l~~sl~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~ 149 (150) T protein:vir:60 72 MFAKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLD 149 (150) T ss_pred chhhhhhcceeeeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCCHHHHHHHHHHHHHHHh Confidence 0 0 000001101 1122443 388999999999997533 4557778888777777777766666 No 140 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=42.66 E-value=0.88 Score=20.90 Aligned_cols=122 Identities=8% Similarity=0.093 Sum_probs=55.0 Q ss_pred CccchhHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHh-----CCccchhhccccccccCcccccccCCCCCCcchh Q lcl|NC_019539. 1 MSFALDVSKFVEKAKK-----NPEKVIRQVSIKLFSAIIKA-----SPVDTGRFRMNWMASGGTPADGTTDATDKAGTTA 70 (131) Q Consensus 1 msf~~~i~~~~~~~~~-----~~~~~~r~~a~~~~~~vv~~-----tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~ 70 (131) |.=-.++.+....+-. +...++++++..+......+ +| | |. -|... .+.+.. .....+... T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~P-d-G~---~W~p~--k~~~~~--~k~~~~~~~ 71 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAP-D-GT---PYAPR--QQQSAR--KKTGRVKRK 71 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCccc--ChHHHH--HhccCCCcc Confidence 4422333322222211 22336777777766666542 34 2 11 13211 000000 000000000 Q ss_pred H-H--HHHHHHhhc--cCCceEE--EeeCchhhhhhhcCCCCC----------CCchhHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019539. 71 T-S--NAANFVLNA--ADWHTFT--LTNNLPYAQRLEYGWSQQ----------APQGFVRVNVSRFQQLLNEEASKVK 131 (131) Q Consensus 71 ~-~--~~~~~i~~~--~~g~~i~--i~Nn~pYa~~LEyG~S~Q----------Ap~G~V~~a~~~~~~~v~~~~~e~k 131 (131) + . ....-|... ..+-+|. +..|.+||..-.||-+.+ ....|++++.+....|.+-+.+-+. T Consensus 72 l~~~~~l~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~~i~~~i~~~l~ 149 (150) T protein:vir:57 72 MFAKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLD 149 (150) T ss_pred cchhhhhccceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCHHHHHHHHHHHHHHHh Confidence 0 0 001111111 1122443 388999999999997643 4557778887777666666666666 No 141 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=41.93 E-value=0.29 Score=23.51 Aligned_cols=93 Identities=19% Similarity=0.182 Sum_probs=38.0 Q ss_pred HHHHHHHHHH---HHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHHhhccCCceEEEeeCch Q lcl|NC_019539. 18 PEKVIRQVSI---KLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLP 94 (131) Q Consensus 18 ~~~~~r~~a~---~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~i~Nn~p 94 (131) +..+.++... +++.++. ....|..|.-+..-|.....+..+ |. .....+.| .+++- T Consensus 1 ~~~~~~~g~~~~~~~~~~l~------~~~v~vG~l~~a~yp~G~~~~~~~--~~--------~~~~~~~g-----~~va~ 59 (168) T protein:vir:94 1 MTTIARKGVKMPPHLEAQFQ------SGEVKAGVLSGSTYPQMTYTDQRT--GK--------QIEDARGG-----MPVAV 59 (168) T ss_pred CccccchhhhhhHHHHHhhh------ccceeeeccccCcccccccchhhc--cc--------cccccccc-----ccHHH Confidence 1112222111 1122221 223344443333222211100000 00 00000000 02345 Q ss_pred hhhhhhcCCCCCCCchhHHHHHHHHH----HHHHHHHHhhC Q lcl|NC_019539. 95 YAQRLEYGWSQQAPQGFVRVNVSRFQ----QLLNEEASKVK 131 (131) Q Consensus 95 Ya~~LEyG~S~QAp~G~V~~a~~~~~----~~v~~~~~e~k 131 (131) +|..+|||+.+-.|..|+|.++.+-+ +.+.++++--- T Consensus 60 Ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~ 100 (168) T protein:vir:94 60 IAQALEYGHGQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGA 100 (168) T ss_pred HHHHHhcCCCCCCCchhhHHHHHHHHHHHHHHHHHHHhcCC Confidence 67899999999999999999986543 33333332111 No 142 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=40.71 E-value=0.96 Score=20.68 Aligned_cols=106 Identities=16% Similarity=0.201 Sum_probs=56.5 Q ss_pred Cccchh---HHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHhCCc--cchhhccccccccCcccccccCCCCCC Q lcl|NC_019539. 1 MSFALD---VSKFVEKAKK---------NPEKVIRQVSIKLFSAIIKASPV--DTGRFRMNWMASGGTPADGTTDATDKA 66 (131) Q Consensus 1 msf~~~---i~~~~~~~~~---------~~~~~~r~~a~~~~~~vv~~tPV--dtGr~R~nw~vs~~~~~~~~~~~~d~~ 66 (131) ||=-++ +++..+++++ ..+..+++.+..+...++...+| |||..-..-.+| .|-+ .+ T Consensus 7 ~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s--~p~~-------~~ 77 (138) T protein:vir:98 7 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVS--GVRR-------ED 77 (138) T ss_pred ccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeec--Ceee-------cC Confidence 553333 3344444443 34556777788888888888886 999866655444 2211 12 Q ss_pred cchhHHHHHHHHhhccCCceEEEeeCch--hhhhhh-cCCCC---CCCchhHHHHHHHHHHHH-HHHHHhhC Q lcl|NC_019539. 67 GTTATSNAANFVLNAADWHTFTLTNNLP--YAQRLE-YGWSQ---QAPQGFVRVNVSRFQQLL-NEEASKVK 131 (131) Q Consensus 67 g~~~~~~~~~~i~~~~~g~~i~i~Nn~p--Ya~~LE-yG~S~---QAp~G~V~~a~~~~~~~v-~~~~~e~k 131 (131) |.. +|-|...-| |..+|+ |||.+ ..+-|+++-++..-.... +..-.|++ T Consensus 78 G~r----------------~V~igW~GpR~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~ 133 (138) T protein:vir:98 78 GIP----------------KVKLGFTTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLK 133 (138) T ss_pred Cce----------------EEEEeeecCeeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHH Confidence 222 233333333 223444 88853 334577777776664443 33334444 No 143 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=39.31 E-value=0.46 Score=22.43 Aligned_cols=95 Identities=18% Similarity=0.238 Sum_probs=42.8 Q ss_pred CccchhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVE--KAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFV 78 (131) Q Consensus 1 msf~~~i~~~~~--~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i 78 (131) -.|...+++|-+ ++.+.+ .+.--++...-+..+||.||.+|.|-||+.-+..- ...+-|...-+.- + T Consensus 12 akfgi~lddfdklpevnqgv----nef~dev~aawk~nspv~~g~yrdsvqvterstnk----grgkvgatdpqah---l 80 (108) T protein:vir:10 12 AKFGVRLDDFDKLPEVNQGV----NEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNK----GRGKVGATDPQAH---L 80 (108) T ss_pred hhhccchhhhhccchhhhhH----HHHHHHHHHhhhcCCCccccccccceeeccccccc----ccccccCcchhhh---h Confidence 447788888855 233333 34444556677889999999999999998543321 1122221111000 0 Q ss_pred hhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHH Q lcl|NC_019539. 79 LNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSR 118 (131) Q Consensus 79 ~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~ 118 (131) - ..|. .-|-+||+.-..- +| |=+.+... T Consensus 81 v--efgs----~hndeyapaqkta--kq----fggtay~d 108 (108) T protein:vir:10 81 V--EFGS----AHNDEYAPAQKTA--KQ----FGGTAYGD 108 (108) T ss_pred h--hhhc----cccccccchhhhH--Hh----hcccccCC Confidence 0 0000 1233444432110 00 00000000 No 144 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=39.31 E-value=0.46 Score=22.43 Aligned_cols=95 Identities=18% Similarity=0.238 Sum_probs=42.8 Q ss_pred CccchhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhHHHHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVE--KAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTATSNAANFV 78 (131) Q Consensus 1 msf~~~i~~~~~--~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~~~~~~~i 78 (131) -.|...+++|-+ ++.+.+ .+.--++...-+..+||.||.+|.|-||+.-+..- ...+-|...-+.- + T Consensus 12 akfgi~lddfdklpevnqgv----nef~dev~aawk~nspv~~g~yrdsvqvterstnk----grgkvgatdpqah---l 80 (108) T protein:vir:10 12 AKFGVRLDDFDKLPEVNQGV----NEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNK----GRGKVGATDPQAH---L 80 (108) T ss_pred hhhccchhhhhccchhhhhH----HHHHHHHHHhhhcCCCccccccccceeeccccccc----ccccccCcchhhh---h Confidence 447788888855 233333 34444556677889999999999999998543321 1122221111000 0 Q ss_pred hhccCCceEEEeeCchhhhhhhcCCCCCCCchhHHHHHHH Q lcl|NC_019539. 79 LNAADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSR 118 (131) Q Consensus 79 ~~~~~g~~i~i~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~ 118 (131) - ..|. .-|-+||+.-..- +| |=+.+... T Consensus 81 v--efgs----~hndeyapaqkta--kq----fggtay~d 108 (108) T protein:vir:10 81 V--EFGS----AHNDEYAPAQKTA--KQ----FGGTAYGD 108 (108) T ss_pred h--hhhc----cccccccchhhhH--Hh----hcccccCC Confidence 0 0000 1233444432110 00 00000000 No 145 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=37.30 E-value=1.1 Score=20.37 Aligned_cols=71 Identities=13% Similarity=0.092 Sum_probs=33.8 Q ss_pred ccccccCCCCCCcchhHHHHHHHHhhccCCceEE--E-------------eeCchhhhhhhcCCCCCCCchhHHHHHHHH Q lcl|NC_019539. 55 PADGTTDATDKAGTTATSNAANFVLNAADWHTFT--L-------------TNNLPYAQRLEYGWSQQAPQGFVRVNVSRF 119 (131) Q Consensus 55 ~~~~~~~~~d~~g~~~~~~~~~~i~~~~~g~~i~--i-------------~Nn~pYa~~LEyG~S~QAp~G~V~~a~~~~ 119 (131) |+... ..+..| +.+....+..+.. ..+. | .+++-.|...|||+.+-.|..|+|.++.+- T Consensus 1 M~~~~--k~~~~~---~~~l~~~l~~l~~-~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~ 74 (148) T protein:vir:52 1 MAVTV--TANFSA---AKQLIEQMKSLKE-KAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEEN 74 (148) T ss_pred Ccccc--ccccHH---HHHHHHHHHHhhC-CeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHH Confidence 22211 111111 2222222222211 1111 1 234556778999999999999999988665 Q ss_pred HHHHHHHHHhh-C Q lcl|NC_019539. 120 QQLLNEEASKV-K 131 (131) Q Consensus 120 ~~~v~~~~~e~-k 131 (131) .+-+.+.+..+ + T Consensus 75 ~~~~~~~~~~~~~ 87 (148) T protein:vir:52 75 QEKYTALFIQWFD 87 (148) T ss_pred HHHHHHHHHHHHH Confidence 44333333222 2 No 146 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=22.00 E-value=2.5 Score=18.39 Aligned_cols=102 Identities=15% Similarity=0.099 Sum_probs=37.7 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccccccCcccccccCCCCCCcchhH--HHHHHHH Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNPEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTTAT--SNAANFV 78 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~~~~~r~~a~~~~~~vv~~tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g~~~~--~~~~~~i 78 (131) |++..+.+.+.+ +.+++ ... ++..|.++-+.... .+|+.|.+.- -...+.+ T Consensus 1 m~~~~~~~~~~~--------~~~~l--------~~l---------~~~~v~vGi~~~~~--~~~~~~~~~G~~va~iAai 53 (193) T protein:vir:96 1 MSLRRDSELIAA--------HLQML--------RAM---------RGRSVSAGWYSTAR--YPDKAGGSVGIQVARIARL 53 (193) T ss_pred CeeccchHHHHH--------HHHHH--------HHh---------cCCeEEEEEcCCCC--CCCcccccccchHHHHHhH Confidence 998877765432 22222 111 12333333332211 1222221110 0001112 Q ss_pred hhccCCceEEEeeCchhh------------hhhhcCCC-----------CCCCchhHHHHHHH----HHHHHHHHHHhhC Q lcl|NC_019539. 79 LNAADWHTFTLTNNLPYA------------QRLEYGWS-----------QQAPQGFVRVNVSR----FQQLLNEEASKVK 131 (131) Q Consensus 79 ~~~~~g~~i~i~Nn~pYa------------~~LEyG~S-----------~QAp~G~V~~a~~~----~~~~v~~~~~e~k 131 (131) ..+ |.+|-+-+...|. .++..+++ +-.|..|+|.++.+ |.+++++.++.+- T Consensus 54 ~Ef--G~~I~~~~~~~~~~~~~~~g~~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~ 131 (193) T protein:vir:96 54 NEY--GGTIDHPGGTRYIRDAIVRGRFVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLA 131 (193) T ss_pred HHc--CCccccCccceeeeeccccccccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHH Confidence 211 1111111111110 01111111 34678999998655 5555555655554 No 147 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=20.59 E-value=2.7 Score=18.18 Aligned_cols=117 Identities=15% Similarity=0.114 Sum_probs=55.2 Q ss_pred CccchhHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHh-----CCccchhhccccccccCcccccccCCCCCCc Q lcl|NC_019539. 1 MSFALDVSKFVEKAKKNP--------EKVIRQVSIKLFSAIIKA-----SPVDTGRFRMNWMASGGTPADGTTDATDKAG 67 (131) Q Consensus 1 msf~~~i~~~~~~~~~~~--------~~~~r~~a~~~~~~vv~~-----tPVdtGr~R~nw~vs~~~~~~~~~~~~d~~g 67 (131) |+ ++....+.+...+ ..++++++..+......+ +| | |. .|.-.- +.+.. .+.| T Consensus 1 m~---~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~P-d-G~---~W~p~~--~~~~~----~~~g 66 (149) T protein:vir:18 1 MS---ELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAP-D-GT---PYAARK--RQPVR----SKKG 66 (149) T ss_pred Cc---hHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCC-C-CC---CCcccc--hhhhh----hccC Confidence 54 3333333332221 236777777766666543 45 2 21 343221 11100 0111 Q ss_pred ch---hH-----HHHHHHHhhccCCceEEEeeCchhhhhhhcCCCCC----------CCchhHHHHHHHHHHHHHHHHHh Q lcl|NC_019539. 68 TT---AT-----SNAANFVLNAADWHTFTLTNNLPYAQRLEYGWSQQ----------APQGFVRVNVSRFQQLLNEEASK 129 (131) Q Consensus 68 ~~---~~-----~~~~~~i~~~~~g~~i~i~Nn~pYa~~LEyG~S~Q----------Ap~G~V~~a~~~~~~~v~~~~~e 129 (131) .. -. +.-...........+.++..|.+||.--.||-..+ ....|+.++-+....|.+.+.+- T Consensus 67 ~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~~d~~~I~~~i~~~ 146 (149) T protein:vir:18 67 RIKREMFAKLRTSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTRDDEQMIEDVIISH 146 (149) T ss_pred cccchhhhhhhhhhhhheeecCceeEEEecccchhhhhhhhccccccccCCCccccccccccCCCCHHHHHHHHHHHHHH Confidence 00 00 00011111112223445799999999999997632 34466677776666666555555 Q ss_pred hC Q lcl|NC_019539. 130 VK 131 (131) Q Consensus 130 ~k 131 (131) +. T Consensus 147 l~ 148 (149) T protein:vir:18 147 LG 148 (149) T ss_pred Hh Confidence 55 Done!