Query lcl|NC_019511.1_cdsid_YP_007005625.1 [gene=F422_gp150] [protein=hypothetical protein] [protein_id=YP_007005625.1] [location=complement(113372..114364)] Match_columns 330 No_of_seqs 163 out of 356 Neff 7.1 Searched_HMMs 1612 Date Thu Nov 7 16:57:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_150 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_150_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95599 Length: 563 100.0 5.4E-81 3.4E-84 460.6 30.3 330 1-330 1-330 (563) 2 protein:vir:99312 Length: 563 100.0 5.4E-81 3.4E-84 460.6 30.3 330 1-330 1-330 (563) 3 protein:vir:96579 Length: 576 100.0 6.2E-78 3.9E-81 443.8 31.6 325 1-330 5-329 (576) 4 protein:vir:63755 Length: 547 100.0 2.5E-75 1.5E-78 429.6 32.4 320 2-330 1-320 (547) 5 protein:vir:80644 Length: 551 100.0 9.5E-75 5.9E-78 426.4 31.0 321 1-330 4-324 (551) 6 protein:vir:80796 Length: 574 100.0 6.7E-67 4.1E-70 383.4 26.6 322 4-330 1-328 (574) 7 protein:vir:100691 Length: 535 100.0 1.6E-57 9.7E-61 332.0 28.7 307 1-330 1-317 (535) 8 protein:vir:100249 Length: 431 100.0 5.7E-44 3.5E-47 257.7 22.7 270 1-330 1-271 (431) 9 protein:vir:94666 Length: 723 100.0 1.3E-42 8E-46 250.2 25.1 237 62-330 1-254 (723) 10 protein:vir:100150 Length: 437 100.0 1.5E-42 9.2E-46 249.9 25.2 258 25-330 1-258 (437) 11 protein:vir:4698 Length: 251 # 100.0 2.6E-42 1.6E-45 248.6 24.6 246 29-330 1-250 (251) 12 protein:vir:107605 Length: 432 100.0 1.9E-42 1.2E-45 249.3 23.8 259 29-330 1-262 (432) 13 protein:vir:105002 Length: 432 100.0 1.9E-42 1.2E-45 249.3 23.8 259 29-330 1-262 (432) 14 protein:vir:102855 Length: 432 100.0 1.9E-42 1.2E-45 249.3 23.8 259 29-330 1-262 (432) 15 protein:vir:1380 Length: 422 # 100.0 3.1E-42 1.9E-45 248.2 24.8 259 1-330 1-261 (422) 16 protein:vir:102727 Length: 945 100.0 4.8E-42 3E-45 247.1 24.3 299 1-330 4-352 (945) 17 protein:vir:105064 Length: 421 100.0 3.6E-42 2.3E-45 247.8 23.3 249 32-330 1-252 (421) 18 protein:vir:4337 Length: 434 # 100.0 2.2E-42 1.3E-45 249.0 21.8 260 29-330 1-261 (434) 19 protein:vir:7853 Length: 518 # 100.0 8.5E-42 5.3E-45 245.8 24.9 248 41-330 1-257 (518) 20 protein:vir:102080 Length: 429 100.0 6.8E-42 4.2E-45 246.3 23.5 255 29-330 1-259 (429) 21 protein:vir:102118 Length: 409 100.0 1.5E-41 9.5E-45 244.3 25.1 246 32-330 1-251 (409) 22 protein:vir:101648 Length: 518 100.0 2E-41 1.3E-44 243.7 24.8 248 41-330 1-257 (518) 23 protein:vir:93610 Length: 454 100.0 5.6E-41 3.5E-44 241.2 24.8 253 28-330 1-260 (454) 24 protein:vir:5737 Length: 419 # 100.0 9.2E-41 5.7E-44 240.1 23.7 248 32-330 1-250 (419) 25 protein:vir:4454 Length: 414 # 100.0 9.2E-41 5.7E-44 240.1 23.6 249 29-330 1-249 (414) 26 protein:vir:9408 Length: 441 # 100.0 3.8E-41 2.3E-44 242.2 21.3 266 1-330 6-275 (441) 27 protein:vir:79984 Length: 441 100.0 3.8E-41 2.3E-44 242.2 21.3 266 1-330 6-275 (441) 28 protein:vir:483 Length: 413 # 100.0 1.4E-40 8.8E-44 239.0 24.2 248 31-330 1-248 (413) 29 protein:vir:10362 Length: 432 100.0 8.8E-41 5.5E-44 240.2 22.6 259 23-330 1-259 (432) 30 protein:vir:81152 Length: 411 100.0 1.3E-40 8E-44 239.3 23.4 250 29-330 1-253 (411) 31 protein:vir:3153 Length: 467 # 100.0 1E-40 6.3E-44 239.8 22.6 225 88-330 1-266 (467) 32 protein:vir:98396 Length: 441 100.0 1.8E-40 1.1E-43 238.5 23.8 256 33-330 1-275 (441) 33 protein:vir:8100 Length: 466 # 100.0 1.7E-40 1E-43 238.6 23.6 281 2-330 1-291 (466) 34 protein:vir:1884 Length: 424 # 100.0 2.1E-40 1.3E-43 238.2 23.5 253 43-330 1-264 (424) 35 protein:vir:6240 Length: 457 # 100.0 2.8E-40 1.8E-43 237.4 23.7 259 2-330 1-263 (457) 36 protein:vir:81095 Length: 416 100.0 5.1E-40 3.2E-43 236.0 24.4 245 29-330 1-250 (416) 37 protein:vir:4598 Length: 416 # 100.0 5.1E-40 3.2E-43 236.0 24.4 245 29-330 1-250 (416) 38 protein:vir:101647 Length: 460 100.0 7.3E-40 4.5E-43 235.1 24.4 282 1-330 1-297 (460) 39 protein:vir:189 Length: 424 # 100.0 5.4E-40 3.4E-43 235.9 23.4 264 12-330 1-264 (424) 40 protein:vir:960 Length: 413 # 100.0 4.5E-40 2.8E-43 236.3 22.7 260 23-330 1-262 (413) 41 protein:vir:95378 Length: 406 100.0 7.3E-40 4.5E-43 235.2 23.5 246 29-330 1-246 (406) 42 protein:vir:80333 Length: 419 100.0 5.8E-40 3.6E-43 235.7 22.9 247 32-330 1-249 (419) 43 protein:vir:1431 Length: 419 # 100.0 3.3E-40 2.1E-43 237.0 21.4 246 39-330 1-249 (419) 44 protein:vir:97060 Length: 432 100.0 9.5E-40 5.9E-43 234.5 22.7 259 23-330 1-259 (432) 45 protein:vir:81072 Length: 432 100.0 6.7E-40 4.1E-43 235.4 21.8 259 23-330 1-259 (432) 46 protein:vir:4509 Length: 424 # 100.0 8.6E-40 5.4E-43 234.7 22.1 247 1-330 15-263 (424) 47 protein:vir:1082 Length: 359 # 100.0 6E-40 3.7E-43 235.6 21.0 236 29-330 1-238 (359) 48 protein:vir:3868 Length: 417 # 100.0 2.4E-39 1.5E-42 232.3 24.0 239 37-330 1-243 (417) 49 protein:vir:8317 Length: 409 # 100.0 2.3E-39 1.4E-42 232.4 23.0 264 2-330 1-269 (409) 50 protein:vir:4156 Length: 542 # 100.0 4.1E-39 2.6E-42 231.0 23.2 252 24-330 1-280 (542) 51 protein:vir:9702 Length: 406 # 100.0 1.1E-38 6.6E-42 228.8 24.9 238 43-330 1-240 (406) 52 protein:vir:1266 Length: 416 # 100.0 3.1E-39 1.9E-42 231.7 20.6 246 3-330 1-247 (416) 53 protein:vir:1326 Length: 457 # 100.0 1.2E-38 7.3E-42 228.5 22.9 259 2-330 1-263 (457) 54 protein:vir:4194 Length: 540 # 100.0 4.1E-38 2.5E-41 225.6 24.0 249 1-330 1-278 (540) 55 protein:vir:99452 Length: 651 100.0 3.8E-39 2.4E-42 231.2 17.9 274 11-330 1-353 (651) 56 protein:vir:80134 Length: 403 100.0 2.9E-38 1.8E-41 226.3 21.6 242 29-330 1-243 (403) 57 protein:vir:7407 Length: 392 # 100.0 5.2E-38 3.2E-41 225.0 21.3 245 32-330 1-248 (392) 58 protein:vir:8418 Length: 409 # 100.0 1.2E-37 7.4E-41 223.0 23.1 245 29-330 1-246 (409) 59 protein:vir:81218 Length: 423 100.0 2.9E-37 1.8E-40 220.9 24.4 256 29-330 1-264 (423) 60 protein:vir:1023 Length: 392 # 100.0 2.9E-37 1.8E-40 220.9 21.2 244 1-330 2-248 (392) 61 protein:vir:3989 Length: 392 # 100.0 2.9E-37 1.8E-40 220.9 21.2 244 1-330 2-248 (392) 62 protein:vir:79772 Length: 648 100.0 3.3E-36 2.1E-39 215.1 26.3 285 1-330 7-314 (648) 63 protein:vir:3843 Length: 397 # 100.0 8.4E-37 5.2E-40 218.4 23.0 235 29-330 1-238 (397) 64 protein:vir:4854 Length: 386 # 100.0 1.5E-36 9.4E-40 217.0 21.2 238 2-330 1-241 (386) 65 protein:vir:104259 Length: 403 100.0 2.4E-36 1.5E-39 215.9 21.5 243 29-330 1-244 (403) 66 protein:vir:93943 Length: 409 100.0 1.4E-35 8.4E-39 211.7 24.6 238 45-330 1-244 (409) 67 protein:vir:4828 Length: 382 # 100.0 5.3E-36 3.3E-39 214.0 21.9 235 29-330 1-238 (382) 68 protein:vir:100187 Length: 385 100.0 2.3E-35 1.4E-38 210.5 22.0 236 29-330 1-237 (385) 69 protein:vir:4952 Length: 386 # 100.0 4.5E-35 2.8E-38 208.9 22.8 238 29-330 1-241 (386) 70 protein:vir:4995 Length: 384 # 100.0 4.2E-35 2.6E-38 209.0 22.5 238 29-330 1-241 (384) 71 protein:vir:96980 Length: 409 100.0 7.9E-35 4.9E-38 207.6 23.9 244 24-330 1-244 (409) 72 protein:vir:2683 Length: 412 # 100.0 7.4E-35 4.6E-38 207.7 23.3 247 29-330 1-247 (412) 73 protein:vir:100882 Length: 383 100.0 7.5E-35 4.7E-38 207.7 23.3 237 29-330 1-237 (383) 74 protein:vir:94426 Length: 409 100.0 1.1E-34 6.9E-38 206.7 24.0 238 45-330 1-244 (409) 75 protein:vir:103971 Length: 376 100.0 2.8E-34 1.8E-37 204.5 25.4 237 23-330 1-275 (376) 76 protein:vir:79207 Length: 351 100.0 1.2E-34 7.5E-38 206.5 20.8 248 8-330 1-250 (351) 77 protein:vir:5691 Length: 344 # 100.0 3.9E-35 2.4E-38 209.2 17.9 232 11-330 1-249 (344) 78 protein:vir:6058 Length: 344 # 100.0 8E-35 5E-38 207.5 19.3 236 11-330 1-249 (344) 79 protein:vir:78191 Length: 351 100.0 2.9E-34 1.8E-37 204.4 21.0 248 8-330 1-250 (351) 80 protein:vir:100328 Length: 346 100.0 1.8E-34 1.1E-37 205.6 19.5 245 8-330 1-247 (346) 81 protein:vir:267 Length: 348 # 100.0 4.1E-34 2.5E-37 203.7 21.3 228 36-330 1-245 (348) 82 protein:vir:1150 Length: 350 # 100.0 1.4E-34 8.7E-38 206.2 18.6 251 8-330 1-253 (350) 83 protein:vir:98567 Length: 340 100.0 3.1E-34 1.9E-37 204.3 20.2 241 8-330 1-242 (340) 84 protein:vir:2013 Length: 344 # 100.0 1.4E-34 8.4E-38 206.3 17.6 235 11-330 1-249 (344) 85 protein:vir:3780 Length: 345 # 100.0 1.4E-33 8.5E-37 200.8 18.3 235 11-330 1-249 (345) 86 protein:vir:6210 Length: 394 # 100.0 7E-33 4.3E-36 196.9 21.9 233 29-330 1-233 (394) 87 protein:vir:78749 Length: 337 100.0 4.3E-33 2.7E-36 198.0 20.5 235 8-330 1-240 (337) 88 protein:vir:79150 Length: 368 100.0 9.1E-33 5.6E-36 196.2 21.2 252 8-330 1-262 (368) 89 protein:vir:3743 Length: 345 # 100.0 2.6E-32 1.6E-35 193.8 21.1 245 8-330 1-245 (345) 90 protein:vir:9359 Length: 348 # 100.0 1.9E-30 1.2E-33 183.6 20.6 183 108-330 1-183 (348) 91 protein:vir:78641 Length: 278 100.0 2.5E-30 1.5E-33 182.9 20.6 183 108-330 1-183 (278) 92 protein:vir:95965 Length: 385 99.9 1.9E-27 1.2E-30 167.1 19.9 224 29-330 1-225 (385) 93 protein:vir:9507 Length: 395 # 99.9 1.6E-26 1E-29 161.9 19.8 221 29-330 1-223 (395) 94 protein:vir:101289 Length: 395 99.9 1.6E-26 1E-29 161.9 19.8 221 29-330 1-223 (395) 95 protein:vir:100650 Length: 395 99.9 1.6E-26 1E-29 161.9 19.8 221 29-330 1-223 (395) 96 protein:vir:78310 Length: 376 99.9 1E-24 6.2E-28 152.2 19.1 223 29-330 1-224 (376) 97 protein:vir:94002 Length: 378 99.9 3.6E-25 2.2E-28 154.6 15.8 207 29-330 1-212 (378) 98 protein:vir:93867 Length: 378 99.9 1.6E-24 1E-27 151.0 17.1 204 29-330 1-212 (378) 99 protein:vir:1661 Length: 378 # 99.9 6.3E-24 3.9E-27 147.8 17.2 207 29-330 1-212 (378) 100 protein:vir:4089 Length: 395 # 99.8 1.9E-22 1.2E-25 139.7 18.2 227 29-330 1-230 (395) 101 protein:vir:98853 Length: 219 99.8 4.4E-23 2.8E-26 143.1 12.4 111 206-330 1-121 (219) 102 protein:vir:9641 Length: 395 # 99.8 6E-22 3.7E-25 136.9 16.4 225 29-330 1-234 (395) 103 protein:vir:858 Length: 378 # 99.8 1.3E-21 8.1E-25 135.1 15.9 207 29-330 1-212 (378) 104 protein:vir:98643 Length: 395 99.8 6.5E-21 4E-24 131.3 19.3 230 29-330 1-234 (395) 105 protein:vir:94869 Length: 378 99.8 1.3E-20 8.1E-24 129.6 15.4 207 29-330 1-212 (378) 106 protein:vir:5249 Length: 437 # 99.0 8.9E-11 5.5E-14 75.7 18.7 246 34-330 1-260 (437) 107 protein:vir:99853 Length: 488 98.9 1.8E-09 1.1E-12 68.6 21.7 249 31-330 1-252 (488) 108 protein:vir:79063 Length: 491 98.8 2.6E-09 1.6E-12 67.6 19.0 258 13-330 1-260 (491) 109 protein:vir:108215 Length: 469 98.8 7.6E-09 4.7E-12 65.1 21.1 257 40-330 1-293 (469) 110 protein:vir:107742 Length: 537 98.8 7.8E-09 4.8E-12 65.0 20.9 285 4-330 1-337 (537) 111 protein:vir:107880 Length: 491 98.8 1.2E-08 7.2E-12 64.1 21.3 258 13-330 1-260 (491) 112 protein:vir:103860 Length: 528 98.7 5.3E-08 3.3E-11 60.5 23.6 262 19-330 1-271 (528) 113 protein:vir:99232 Length: 526 98.7 1.1E-07 7.1E-11 58.7 24.0 262 19-330 1-271 (526) 114 protein:vir:94049 Length: 532 98.6 1.2E-07 7.5E-11 58.5 22.3 273 1-330 1-322 (532) 115 protein:vir:79511 Length: 448 98.6 1.2E-07 7.7E-11 58.4 21.5 273 8-330 1-289 (448) 116 protein:vir:1986 Length: 512 # 98.5 4E-07 2.5E-10 55.7 24.5 250 19-330 1-276 (512) 117 protein:vir:77981 Length: 448 98.5 5.1E-08 3.2E-11 60.6 17.7 274 8-330 1-284 (448) 118 protein:vir:99563 Length: 862 98.4 3.3E-07 2.1E-10 56.1 20.4 296 1-330 32-371 (862) 119 protein:vir:95254 Length: 488 98.4 5.7E-07 3.5E-10 54.8 19.9 264 43-330 1-297 (488) 120 protein:vir:79233 Length: 526 98.3 1.2E-06 7.6E-10 53.0 23.9 261 19-330 1-271 (526) 121 protein:vir:104338 Length: 422 98.2 1E-06 6.4E-10 53.4 18.4 241 22-330 1-260 (422) 122 protein:vir:107662 Length: 427 98.2 5.2E-07 3.2E-10 55.0 16.2 242 33-330 1-261 (427) 123 protein:vir:79538 Length: 502 98.2 2.3E-06 1.5E-09 51.4 19.5 275 22-330 1-308 (502) 124 protein:vir:80040 Length: 461 98.1 1.2E-06 7.7E-10 53.0 17.2 249 29-330 1-284 (461) 125 protein:vir:96068 Length: 765 97.9 5.9E-06 3.7E-09 49.3 16.8 290 1-330 4-343 (765) 126 protein:vir:79647 Length: 435 97.8 1.2E-05 7.2E-09 47.7 17.0 249 11-330 1-272 (435) 127 protein:vir:98816 Length: 446 97.5 4.9E-05 3E-08 44.2 18.6 256 25-330 1-303 (446) 128 protein:vir:95542 Length: 548 97.5 5.6E-05 3.5E-08 43.9 19.7 279 22-330 1-312 (548) 129 protein:vir:5839 Length: 533 # 96.5 0.00057 3.5E-07 38.4 18.2 278 1-330 1-302 (533) 130 protein:vir:10321 Length: 495 96.4 0.0007 4.4E-07 37.9 15.6 266 22-330 1-302 (495) 131 protein:vir:389 Length: 530 # 96.1 0.001 6.4E-07 36.9 19.0 272 43-330 1-332 (530) 132 protein:vir:6382 Length: 553 # 95.7 0.0016 1E-06 35.9 19.6 279 32-330 1-346 (553) 133 protein:vir:96738 Length: 505 95.7 0.0017 1E-06 35.8 20.6 278 11-330 1-318 (505) 134 protein:vir:78161 Length: 355 95.4 0.0022 1.4E-06 35.1 13.8 142 182-330 1-161 (355) 135 protein:vir:106716 Length: 698 95.2 0.0026 1.6E-06 34.8 14.6 290 1-330 43-362 (698) 136 protein:vir:3420 Length: 533 # 95.1 0.0028 1.8E-06 34.6 19.8 280 21-330 1-335 (533) 137 protein:vir:5665 Length: 511 # 94.9 0.0033 2.1E-06 34.2 18.1 287 5-330 1-321 (511) 138 protein:vir:78589 Length: 695 94.3 0.0049 3.1E-06 33.2 15.5 290 1-330 43-362 (695) 139 protein:vir:3648 Length: 695 # 94.2 0.0052 3.2E-06 33.1 15.0 290 1-330 43-362 (695) 140 protein:vir:98265 Length: 524 90.6 0.02 1.2E-05 29.9 21.1 296 1-330 3-333 (524) 141 protein:vir:101541 Length: 694 89.4 0.027 1.7E-05 29.2 16.6 290 1-330 41-361 (694) 142 protein:vir:99781 Length: 511 89.4 0.027 1.7E-05 29.2 17.6 273 1-330 10-336 (511) 143 protein:vir:78805 Length: 511 88.6 0.031 1.9E-05 28.8 18.5 273 1-330 10-336 (511) 144 protein:vir:96366 Length: 511 88.6 0.031 1.9E-05 28.8 18.5 273 1-330 10-336 (511) 145 protein:vir:96240 Length: 511 87.9 0.036 2.2E-05 28.5 19.2 275 1-330 10-336 (511) 146 protein:vir:9306 Length: 511 # 87.4 0.039 2.4E-05 28.3 19.4 276 1-330 10-336 (511) 147 protein:vir:2341 Length: 488 # 84.9 0.056 3.5E-05 27.4 19.0 260 23-330 1-342 (488) 148 protein:vir:103951 Length: 511 83.5 0.068 4.2E-05 27.0 20.0 276 1-330 10-336 (511) 149 protein:vir:733 Length: 453 # 83.4 0.069 4.3E-05 27.0 17.8 259 6-330 1-323 (453) 150 protein:vir:95806 Length: 440 82.5 0.077 4.8E-05 26.7 19.1 249 24-330 1-281 (440) 151 protein:vir:97171 Length: 512 82.0 0.081 5E-05 26.6 17.7 277 1-330 4-337 (512) 152 protein:vir:1785 Length: 555 # 80.6 0.093 5.8E-05 26.2 13.5 261 32-330 1-377 (555) 153 protein:vir:96839 Length: 474 80.0 0.099 6.2E-05 26.1 16.5 266 1-330 1-303 (474) 154 protein:vir:94709 Length: 522 79.4 0.1 6.5E-05 26.0 18.3 272 23-330 1-367 (522) 155 protein:vir:106639 Length: 481 78.1 0.12 7.3E-05 25.7 19.6 273 1-330 8-316 (481) 156 protein:vir:9871 Length: 429 # 73.0 0.18 0.00011 24.7 18.6 250 24-330 1-272 (429) 157 protein:vir:99916 Length: 504 72.9 0.18 0.00011 24.7 19.7 263 13-330 1-346 (504) 158 protein:vir:108049 Length: 524 70.5 0.21 0.00013 24.3 19.8 295 1-330 1-332 (524) 159 protein:vir:99522 Length: 470 69.9 0.22 0.00013 24.2 20.0 270 1-330 1-307 (470) 160 protein:vir:97447 Length: 474 66.4 0.27 0.00017 23.7 18.6 271 4-330 1-311 (474) 161 protein:vir:94498 Length: 474 66.4 0.27 0.00017 23.7 18.6 271 4-330 1-311 (474) 162 protein:vir:80680 Length: 441 66.0 0.27 0.00017 23.7 18.6 240 21-330 1-277 (441) 163 protein:vir:105292 Length: 478 62.3 0.34 0.00021 23.2 20.8 266 1-330 1-316 (478) 164 protein:vir:94742 Length: 409 61.3 0.36 0.00022 23.0 17.3 226 57-330 1-271 (409) 165 protein:vir:6596 Length: 521 # 60.9 0.36 0.00023 23.0 19.7 292 4-330 1-329 (521) 166 protein:vir:96494 Length: 501 60.7 0.37 0.00023 23.0 20.5 273 1-330 15-355 (501) 167 protein:vir:7768 Length: 484 # 60.7 0.37 0.00023 23.0 16.7 248 19-330 1-299 (484) 168 protein:vir:81017 Length: 521 59.7 0.39 0.00024 22.8 17.8 294 4-330 1-329 (521) 169 protein:vir:8184 Length: 474 # 59.2 0.4 0.00025 22.8 20.9 262 13-330 1-305 (474) 170 protein:vir:5961 Length: 503 # 53.6 0.53 0.00033 22.1 19.6 273 1-330 1-323 (503) 171 protein:vir:102668 Length: 547 53.3 0.53 0.00033 22.1 20.2 275 31-330 1-382 (547) 172 protein:vir:95315 Length: 559 53.1 0.54 0.00033 22.1 15.8 274 24-330 1-373 (559) 173 protein:vir:2732 Length: 501 # 52.7 0.55 0.00034 22.0 19.7 277 1-330 11-355 (501) 174 protein:vir:97336 Length: 492 52.7 0.55 0.00034 22.0 19.6 277 1-330 7-328 (492) 175 protein:vir:103330 Length: 517 50.7 0.6 0.00037 21.8 17.9 259 43-330 1-359 (517) 176 protein:vir:94101 Length: 474 47.9 0.69 0.00043 21.5 19.1 254 1-330 3-298 (474) 177 protein:vir:105889 Length: 474 47.9 0.69 0.00043 21.5 19.1 254 1-330 3-298 (474) 178 protein:vir:2198 Length: 536 # 47.3 0.71 0.00044 21.4 18.2 278 1-330 1-372 (536) 179 protein:vir:78696 Length: 542 46.2 0.74 0.00046 21.3 14.7 257 33-330 1-373 (542) 180 protein:vir:98444 Length: 434 45.9 0.75 0.00047 21.3 16.9 210 71-330 1-260 (434) 181 protein:vir:107112 Length: 478 45.6 0.76 0.00047 21.2 20.1 261 1-330 1-345 (478) 182 protein:vir:94805 Length: 492 44.5 0.8 0.0005 21.1 19.6 278 1-330 7-328 (492) 183 protein:vir:93747 Length: 472 42.7 0.87 0.00054 20.9 20.6 265 13-330 1-308 (472) 184 protein:vir:104082 Length: 485 42.0 0.9 0.00056 20.8 20.1 261 13-330 1-347 (485) 185 protein:vir:95899 Length: 474 41.4 0.93 0.00058 20.8 17.5 263 1-330 1-311 (474) 186 protein:vir:96266 Length: 474 41.4 0.93 0.00058 20.8 17.5 263 1-330 1-311 (474) 187 protein:vir:1236 Length: 483 # 39.7 1 0.00062 20.6 19.4 272 1-330 1-319 (483) 188 protein:vir:10447 Length: 536 39.0 1 0.00064 20.5 18.2 278 1-330 1-372 (536) 189 protein:vir:3964 Length: 453 # 38.9 1 0.00065 20.5 19.4 258 12-330 1-290 (453) 190 protein:vir:106282 Length: 521 38.2 1.1 0.00067 20.4 20.5 291 1-330 5-328 (521) 191 protein:vir:2427 Length: 485 # 37.2 1.1 0.0007 20.3 19.7 255 21-330 1-347 (485) 192 protein:vir:98506 Length: 555 36.7 1.2 0.00072 20.2 17.5 270 24-330 1-374 (555) 193 protein:vir:107404 Length: 555 36.7 1.2 0.00072 20.2 17.5 270 24-330 1-374 (555) 194 protein:vir:107822 Length: 555 36.7 1.2 0.00072 20.2 17.5 270 24-330 1-374 (555) 195 protein:vir:4223 Length: 486 # 35.5 1.2 0.00076 20.1 16.8 246 19-330 1-300 (486) 196 protein:vir:3609 Length: 452 # 34.6 1.3 0.00079 20.0 19.9 259 12-330 1-289 (452) 197 protein:vir:4898 Length: 502 # 34.2 1.3 0.00081 19.9 19.3 276 1-330 9-356 (502) 198 protein:vir:2500 Length: 501 # 31.6 1.5 0.00092 19.6 16.3 272 1-330 1-352 (501) 199 protein:vir:95113 Length: 474 29.9 1.6 0.001 19.4 21.0 264 1-330 1-311 (474) 200 protein:vir:1538 Length: 535 # 28.7 1.7 0.0011 19.3 18.4 274 11-330 1-372 (535) 201 protein:vir:96988 Length: 516 27.3 1.9 0.0012 19.1 18.9 258 1-330 16-369 (516) 202 protein:vir:94572 Length: 535 26.5 1.9 0.0012 19.0 15.4 272 23-330 1-372 (535) 203 protein:vir:105782 Length: 449 26.3 2 0.0012 19.0 17.4 259 13-330 1-290 (449) 204 protein:vir:6896 Length: 523 # 25.7 2 0.0013 18.9 17.8 293 1-330 5-332 (523) 205 protein:vir:96179 Length: 468 22.5 2.4 0.0015 18.5 18.4 273 1-330 1-316 (468) 206 protein:vir:94546 Length: 506 22.2 2.5 0.0015 18.4 18.0 277 1-330 2-336 (506) 207 protein:vir:100039 Length: 522 20.7 2.7 0.0017 18.2 12.4 260 50-330 1-359 (522) No 1 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=5.4e-81 Score=460.62 Aligned_cols=330 Identities=80% Similarity=1.251 Sum_probs=316.2 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |+|+||+||+++.+.-++...++|+||+++.++++++++....+-.+++.++++++|++|+...|.+.++++.+++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 80 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKN 80 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCC Confidence 99999999999999999999999999999999999999988777788889999999999999999999999999998888 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ..+..++|++|++|+++++||++++++||+||+++++++.++||.+++++++.++++++.+++..+.++|.+++.+++++ T Consensus 81 ~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~ 160 (563) T protein:vir:95 81 EHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVD 160 (563) T ss_pred cccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCC Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHe Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSREL 240 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dv 240 (330) ++||++|+++++.++|++||||+|+++.|++.|+|++||||+|.+|++..+++|..+....+|+|+.+|+....|.++|| T Consensus 161 ~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~ev 240 (563) T protein:vir:95 161 RDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSREL 240 (563) T ss_pred cchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcce Confidence 89999999999999999999999999999999999999999999999999999998888889999999999999999999 Q ss_pred eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 241 VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGIN 320 (330) Q Consensus 241 ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~ 320 (330) +|+++||..+...++||+|||++|+.+|++++++++|+++||+||++|+|||.++++..+|++++++++++|++.++|.+ T Consensus 241 I~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~ 320 (563) T protein:vir:95 241 AMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGIN 320 (563) T ss_pred EEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Confidence 99999999888888899999999999999999999999999999999999999999888999999999999999999999 Q ss_pred cccccceeeC Q lcl|NC_019511. 321 GSWQICLYIK 330 (330) Q Consensus 321 na~kvpvL~e 330 (330) |+||+|+|++ T Consensus 321 nagk~~~vl~ 330 (563) T protein:vir:95 321 GSWQIPVVMA 330 (563) T ss_pred ccccceEEcC Confidence 9999988877 No 2 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=5.4e-81 Score=460.62 Aligned_cols=330 Identities=80% Similarity=1.251 Sum_probs=316.2 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |+|+||+||+++.+.-++...++|+||+++.++++++++....+-.+++.++++++|++|+...|.+.++++.+++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 80 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKN 80 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCC Confidence 99999999999999999999999999999999999999988777788889999999999999999999999999998888 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ..+..++|++|++|+++++||++++++||+||+++++++.++||.+++++++.++++++.+++..+.++|.+++.+++++ T Consensus 81 ~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~ 160 (563) T protein:vir:99 81 EHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVD 160 (563) T ss_pred cccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCC Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHe Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSREL 240 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dv 240 (330) ++||++|+++++.++|++||||+|+++.|++.|+|++||||+|.+|++..+++|..+....+|+|+.+|+....|.++|| T Consensus 161 ~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~ev 240 (563) T protein:vir:99 161 RDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSREL 240 (563) T ss_pred cchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcce Confidence 89999999999999999999999999999999999999999999999999999998888889999999999999999999 Q ss_pred eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 241 VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGIN 320 (330) Q Consensus 241 ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~ 320 (330) +|+++||..+...++||+|||++|+.+|++++++++|+++||+||++|+|||.++++..+|++++++++++|++.++|.+ T Consensus 241 I~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~ 320 (563) T protein:vir:99 241 AMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGIN 320 (563) T ss_pred EEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Confidence 99999999888888899999999999999999999999999999999999999999888999999999999999999999 Q ss_pred cccccceeeC Q lcl|NC_019511. 321 GSWQICLYIK 330 (330) Q Consensus 321 na~kvpvL~e 330 (330) |+||+|+|++ T Consensus 321 nagk~~~vl~ 330 (563) T protein:vir:99 321 GSWQIPVVMA 330 (563) T ss_pred ccccceEEcC Confidence 9999988877 No 3 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=6.2e-78 Score=443.85 Aligned_cols=325 Identities=76% Similarity=1.207 Sum_probs=306.3 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |+|+|+.+|+ -++++..+.+++.+|+++.+++++++. ....+++++|++++|.+|+...++..++++.+|+++.+ T Consensus 5 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~ 79 (576) T protein:vir:96 5 LADIFKRLRL--GRDYEDIIDTVPIDDGLQANIRNIEEK---SKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKN 79 (576) T ss_pred HHHHHHHHhc--cCccccchhhhhcccChhHHHHHhhhh---hhhhccccCCccchhhcceeeeeecCCCccccCcchhh Confidence 9999999998 455777888999999999999999985 22347779999999999999999999999999999999 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) .+...++|+.|++||+|++||++|+++||+||++++++..++||.|++++++.++++++.++++.+.+||++++.+++++ T Consensus 80 ~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~ 159 (576) T protein:vir:96 80 SDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDID 159 (576) T ss_pred hhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999888777 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHe Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSREL 240 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dv 240 (330) ++|+.+|+++++.|+|++||+|+|+++++++.|++++||||+|.+|++..+++|+......+|++..+++....|.++|| T Consensus 160 ~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~di 239 (576) T protein:vir:96 160 RDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREM 239 (576) T ss_pred cccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccce Confidence 89999999999999999999999999999999999999999999999999999988877788999999999999999999 Q ss_pred eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 241 VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGIN 320 (330) Q Consensus 241 ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~ 320 (330) +|+++++..|...++||+|||++|+.+|++++++++|+++||+||++|+|||+++++.++|++++++|+++|++.++|.+ T Consensus 240 i~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~ 319 (576) T protein:vir:96 240 AMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGIN 319 (576) T ss_pred EEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Confidence 99999999988888999999999999999999999999999999999999999999888999999999999999999999 Q ss_pred cccccceeeC Q lcl|NC_019511. 321 GSWQICLYIK 330 (330) Q Consensus 321 na~kvpvL~e 330 (330) |+|++|+|++ T Consensus 320 nag~~p~vl~ 329 (576) T protein:vir:96 320 GSWQVPVVMA 329 (576) T ss_pred ccccceeecC Confidence 9999988877 No 4 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=2.5e-75 Score=429.60 Aligned_cols=320 Identities=60% Similarity=0.958 Sum_probs=295.9 Q ss_pred chhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 2 PDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) -++|++|++..+.++...+ ..++++..++.++.++...+ ++++++++++|.+|.++.+...++++.+|++ ++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~ 73 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVK-HIEVDDNYSIAIQQREQEQI-----SKAMNNKEVAYSQPVIGSMSANPGFKTKPSI-RNN 73 (547) T ss_pred CchhhhhhhhcCCcccccc-ccccccccchhhhhhhHHHH-----HHhhcccchhhhchhhheeecccccccCCcc-CCh Confidence 7999999887776555444 66777888888888888833 4557789999999999999999999999887 688 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) .+..++|+.|++||+|++||++++++||+||.+++++..++||++++++++.+.++++.++++.+++||++|+++.++++ T Consensus 74 ~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~ 153 (547) T protein:vir:63 74 QDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINR 153 (547) T ss_pred hHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCcc Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHee Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELV 241 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvi 241 (330) +|+++|+++++.|+|++||+|+++++ +..|+|++||||+|.+|++..+++|+..+.+.+|+|+.+++....|+++||| T Consensus 154 ~s~~~f~~~lv~d~ll~Gn~~~~i~r--d~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eii 231 (547) T protein:vir:63 154 DSFSSFVKKIVRDTYMYDQVNFEKVF--NRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMA 231 (547) T ss_pred chHHHHHHHHHHHHHhhCCEEEEEEE--CCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEE Confidence 99999999999999999999999875 5568999999999999999999999888888899999999999999999999 Q ss_pred eecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_019511. 242 MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGING 321 (330) Q Consensus 242 h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~n 321 (330) |+++||+++...++||+|||+.|+.+|++++++++|+.+||+||++|+|||+++++.++|++++++||++|++.++|.+| T Consensus 232 h~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~n 311 (547) T protein:vir:63 232 FAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGING 311 (547) T ss_pred EecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccc Confidence 99999999988889999999999999999999999999999999999999999988889999999999999999999999 Q ss_pred ccccceeeC Q lcl|NC_019511. 322 SWQICLYIK 330 (330) Q Consensus 322 a~kvpvL~e 330 (330) +|++|||++ T Consensus 312 agk~~vl~~ 320 (547) T protein:vir:63 312 SWQIPVVSA 320 (547) T ss_pred ccccccccC Confidence 999999986 No 5 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=9.5e-75 Score=426.39 Aligned_cols=321 Identities=60% Similarity=0.950 Sum_probs=296.3 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) --.+||+||+-++.+++... .++.++...++++.++++.+ ++++.|++++|+.|....|...++++.+|++ +. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~-~~ 76 (551) T protein:vir:80 4 KLGLFESIRLVGVNKSDAVK-HIEVDDNYSIAIQQREQEQI-----SKAMNNKEVAYSQPVIGSMSANPGFKTKPSI-RN 76 (551) T ss_pred hhhhHHHhhhccCChhhccc-ccccccceeeecccccHHHH-----HHhhccCcceeecccccceecCcccccCccc-cC Confidence 34689999988887777665 77788888899999999954 5557799999999999889889999988876 46 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ..+..++|+.|++||+|++||++++++||+||++++....++||.|++++.+.++++++.++++.+++||++|+++.+++ T Consensus 77 ~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~ 156 (551) T protein:vir:80 77 NQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDIN 156 (551) T ss_pred hhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCc Confidence 67778899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHe Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSREL 240 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dv 240 (330) ++|+++|+++++.|+|++||+|+++++ +..|+|++||||+|.+|++..+++|+..+.+.+|+|+.+|+....|+++|| T Consensus 157 ~~s~~~f~~~lv~dlll~Gnay~~i~r--d~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~ei 234 (551) T protein:vir:80 157 RDSFSSFVKKIVRDTYMYDQVNFEKVF--NRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREM 234 (551) T ss_pred cchHHHHHHHHHHHHHhcCCEEEEEEE--CCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccce Confidence 999999999999999999999998875 556899999999999999999999988888889999999998889999999 Q ss_pred eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 241 VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGIN 320 (330) Q Consensus 241 ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~ 320 (330) ||+++||+++...++||+|||++|+.+|++++++++|+.+||+||++|+|||+++++.++|++++++||++|++.|+|.+ T Consensus 235 iH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~ 314 (551) T protein:vir:80 235 AFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGIN 314 (551) T ss_pred EEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCcc Confidence 99999999988888899999999999999999999999999999999999999998888999999999999999999999 Q ss_pred cccccceeeC Q lcl|NC_019511. 321 GSWQICLYIK 330 (330) Q Consensus 321 na~kvpvL~e 330 (330) |+|++|||++ T Consensus 315 nag~~~vl~~ 324 (551) T protein:vir:80 315 GSWQIPVVSA 324 (551) T ss_pred ccCccccccC Confidence 9999999986 No 6 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=6.7e-67 Score=383.39 Aligned_cols=322 Identities=53% Similarity=0.871 Sum_probs=275.7 Q ss_pred hHHHH-HhcCCCCCCcccccCccC-----cchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCc Q lcl|NC_019511. 4 LFKSL-RLGSMYKEDTEDLMVPID-----DGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSY 77 (330) Q Consensus 4 ~~~~~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~ 77 (330) .-||| ++++..++...+-....+ ..++.++. ..+....++..++++++.+++.+++...+...++++..|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVV--NNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSI 78 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhh--hccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCcc Confidence 33566 555554444333222222 12222222 22333334456678889999999999888888888888775 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) ++..+..++|+.|+.|+++++||++++++|+++|+++..+..+++|.|+.+|.+.+++.++.++...+..||.++.... T Consensus 79 -~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~ 157 (574) T protein:vir:80 79 -RNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFR 157 (574) T ss_pred -CCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCC Confidence 6777788999999999999999999999999999999999999999999999999999999999999999999887776 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEech Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTS 237 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~ 237 (330) .++++++.+|+++++.++|++||+|+++++ +..|+|++||||+|.+|++..+++|+....+.+|+|+.+|+....|++ T Consensus 158 nP~~~s~~ef~~~lv~~lll~Gnayi~i~r--~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~ 235 (574) T protein:vir:80 158 DPNRDNFTTFCKKLVRATYMYDQVNFEKVF--DKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNE 235 (574) T ss_pred CCccccHHHHHHHHHHHHHhcCCeEEEEEE--CCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEcc Confidence 556689999999999999999999999875 456899999999999999999999988888889999999999999999 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) +||||+++||.++...++||+|||++|+.+|++++++++|+.+||+||++|+|||.++++..+|++++++|+++|++.|+ T Consensus 236 ~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~ 315 (574) T protein:vir:80 236 RELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLA 315 (574) T ss_pred ccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc Confidence 99999999999988888899999999999999999999999999999999999999998888999999999999999999 Q ss_pred CcccccccceeeC Q lcl|NC_019511. 318 GINGSWQICLYIK 330 (330) Q Consensus 318 G~~na~kvpvL~e 330 (330) |.+|+|++|||++ T Consensus 316 G~~n~g~~~vl~~ 328 (574) T protein:vir:80 316 GINGSWQIPVVSA 328 (574) T ss_pred cccccccceeecC Confidence 9999999999986 No 7 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.6e-57 Score=332.00 Aligned_cols=307 Identities=32% Similarity=0.479 Sum_probs=238.5 Q ss_pred CchhHHHHH-hcCC-CCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCC-CCCcCCCc Q lcl|NC_019511. 1 MPDLFKSLR-LGSM-YKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNP-DYRDKKSY 77 (330) Q Consensus 1 ~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p-~~~~~~s~ 77 (330) |+ +.|.|| +-+. .+|+. --+.+.+.+++-++..--+...+ .+. ....+..+.++| +|+.++++ T Consensus 1 ~~-~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~g~~~~~~~ 66 (535) T protein:vir:10 1 MA-ILKDLRNAFSLSNKKST----------SYIELGDYDKDIVNKAIRPGRAS-ARD--TVDGIDIADGNVAGQYSVASI 66 (535) T ss_pred Ch-hhHHHHHHHHhhhhhhh----------hhHHHhhhhHHHHHhhhhhhhhh-hhc--cccccccccCCcccccccCcc Confidence 22 223331 1111 11221 12456777777554432222111 111 113444566777 47777775 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) ++.....+.|+.|..||++++||++++++||.||++.+++....+|.+++++.+.+.++++.++...+.++|... T Consensus 67 -~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~---- 141 (535) T protein:vir:10 67 -SDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNT---- 141 (535) T ss_pred -ccccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhC---- Confidence 555555677888889999999999999999999999999999999999999999999988888777776666544 Q ss_pred CCCcC----CHHHHHHHHHHHHHhcCCc-eeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceE Q lcl|NC_019511. 158 DIDRD----SFQEFCKKIVRDTYTYDQV-NFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVV 232 (330) Q Consensus 158 pn~~~----s~~~fl~~~v~d~L~~g~g-~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~ 232 (330) ||+++ +|.+|+++++.|+|++|++ |+++ .++..|+|++||||+|.+|++..+.+|+ .....|+++..++.. T Consensus 142 PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i--~r~~~G~~~~L~~l~p~~V~v~~d~~~~--~~~~~~~~~~~~~~~ 217 (535) T protein:vir:10 142 GSEYYEWRDTFPRLLTKIINDMYVQDQINIERI--FKNDSNELDHFNAVDASKVVISYSPRSK--DQPRKFEQFVSETKS 217 (535) T ss_pred CCCCCChhHHHHHHHHHHHHHHHhhCCceEEEE--EECCCCcEEEEEEeCCceeEEEEcCccc--cCceEEEEEecCcee Confidence 66555 4557899999999998853 4444 4677899999999999999999887765 334567777788888 Q ss_pred EEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC--CCCCHHHHHHHHH Q lcl|NC_019511. 233 ASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD--QQQSQHALENFKR 310 (330) Q Consensus 233 ~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~--~~ls~e~~e~lr~ 310 (330) .+|+++||||+++||.++...++||+|||++|+++|++++++++|+.+||+||++|+|||.++++ ..++++++++|++ T Consensus 218 ~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~ 297 (535) T protein:vir:10 218 VKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRR 297 (535) T ss_pred EEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHH Confidence 89999999999999999888888999999999999999999999999999999999999999863 5699999999999 Q ss_pred HHHHHhcCcccccccceeeC Q lcl|NC_019511. 311 EWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 311 ~w~~~~~G~~na~kvpvL~e 330 (330) +|++.|+|.+|+|++|||+. T Consensus 298 ~~~~~~~G~~nag~~~vl~~ 317 (535) T protein:vir:10 298 QWTSQGSGLGGAWKIPILAA 317 (535) T ss_pred HHHHHhcCcccccccccccC Confidence 99999999999999999984 No 8 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=5.7e-44 Score=257.68 Aligned_cols=270 Identities=13% Similarity=0.076 Sum_probs=182.3 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccch-hccccccccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQA-YAEPFLEMMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~~~~~~~~~~~~p~~~~~~s~~r 79 (330) |- +|.+||..+..++.... + ++ -..+...+..+..|+.+. +.+|.......+ +. +. T Consensus 1 Mg-l~d~~r~~~~~~~~~~~-----------~---~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-~~------~~ 57 (431) T protein:vir:10 1 MG-LFDFIRREKQPEAQARP-----------H---VE-PSFQASTPTTSIPGETFEGLDDPRLKEYIRR-GE------LN 57 (431) T ss_pred Cc-chhhhhcCccccccccc-----------c---cc-cccccccccccccccccccccchHHHHhhcc-Cc------cC Confidence 43 45555422222111111 0 00 001111222222333322 122211110000 00 01 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCC Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDI 159 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn 159 (330) +..... ....+++.|++||++|+++||+.. +.+.-++... -....|.+.+++..+|| T Consensus 58 g~~v~~---~~al~~~~V~~ci~~Ia~~iA~lp-----------~~v~~~~~~~---------~~~~~~~~~~lL~~~PN 114 (431) T protein:vir:10 58 GGTGRE---TRALRNMAVLRCVTLISGTIGMLP-----------MNLISSDDSK---------QVLTDDPAHRLLKYKPN 114 (431) T ss_pred cceech---hhhhccHHHHHHHHHHHHhhccCc-----------eEEEEecCce---------eeeccchHHHHHhhccC Confidence 111111 112238999999999999999642 2221111111 01223556666777899 Q ss_pred CcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhH Q lcl|NC_019511. 160 DRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRE 239 (330) Q Consensus 160 ~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~d 239 (330) +++|.++|++.++.++|+.||+|+++++ ++ |++++|+|++|.+|++..+.+|. +.|++...+|....|+++| T Consensus 115 ~~~t~~~f~~~l~~~lll~Gna~~~i~r--~~-g~~~~L~pl~~~~v~~~~~~~~~-----~~y~~~~~~g~~~~~~~~d 186 (431) T protein:vir:10 115 DWQTPMEFKSLMQLRALLDGESMARIVW--SG-NRPIRLIPMDRGSAKGRLTSTWQ-----IVYDYTTPTGDKIELPARE 186 (431) T ss_pred CCCCHHHHHHHHHHHHhhcCCeEEEEEE--cC-CceEEEEEEcCceeEEEEcCCCe-----EEEEEEeCCceEEEEchhh Confidence 9999999999999999999999999875 43 78999999999999998877664 3577777777788899999 Q ss_pred eeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCc Q lcl|NC_019511. 240 LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGI 319 (330) Q Consensus 240 vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~ 319 (330) |+|++.++.+ +.+|+|||++|+++|++++++++|+++||+||++|+|||.+++ .+++|+++++|+.|++.++|. T Consensus 187 ViHir~~~~d----g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~~~~~~~~~~g~ 260 (431) T protein:vir:10 187 VFHLRDLSID----GVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK--ELSDNAYGRMKASVQENHTGS 260 (431) T ss_pred EEEecCcCCC----CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC--CCCHHHHHHHHHHHHHHhcCc Confidence 9999754432 2369999999999999999999999999999999999999875 599999999999999999999 Q ss_pred ccccccceeeC Q lcl|NC_019511. 320 NGSWQICLYIK 330 (330) Q Consensus 320 ~na~kvpvL~e 330 (330) +|+|+++||-+ T Consensus 261 ~n~g~~~vl~~ 271 (431) T protein:vir:10 261 ENAGSWMLLEE 271 (431) T ss_pred cccCCceecCC Confidence 99999887766 No 9 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=1.3e-42 Score=250.23 Aligned_cols=237 Identities=10% Similarity=0.009 Sum_probs=173.4 Q ss_pred ccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHH Q lcl|NC_019511. 62 LEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKE 141 (330) Q Consensus 62 ~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~ 141 (330) +.+...-+++...+++....... -+.+.+++.|++||++++++||+. .|.+.-++.. . T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~---~~~~~~~~~V~acV~~Ia~~iA~l-----------pl~l~~~~~~--~------ 58 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNG---AKGWSNSAVAYRCISMLANNAASV-----------DLVVRGPDGE--L------ 58 (723) T ss_pred CcccccCCCcccccccccccccc---HHHHhhhHHHHHHHHHHHHhhccc-----------eeEEEcCCCc--c------ Confidence 11111122222333332222111 234556899999999999999953 3333211111 1 Q ss_pred HHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe-cCCCcceEEEEeeCCCceEEeeCCCCcccC-- Q lcl|NC_019511. 142 QMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS-PKNKTKMEKFIAVDPSTIFYATDKNGKIIK-- 218 (330) Q Consensus 142 ~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~-rd~~G~~~~L~pldp~tV~~~~d~~G~~~~-- 218 (330) ...|.++.++..+||+++|.++|+++++.++++.||+|++++++ ++..|.|++|+|++|..+.+....++.... T Consensus 59 ---~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~ 135 (723) T protein:vir:94 59 ---DELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQA 135 (723) T ss_pred ---chhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceee Confidence 01245666667789999999999999999999999999998865 345689999999999877776655543222 Q ss_pred CceeEEEEeCCceEEEechhHeeeeccc-CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC Q lcl|NC_019511. 219 GGNRFVQVIDKQVVASFTSRELVMGIRN-PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD 297 (330) Q Consensus 219 ~~~~Y~q~~~~~~~~~~~~~dvih~~~n-~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~ 297 (330) ....|.+...+|....|+++||||++.+ |.+ +.||+|||++++.+|++++++++|+.+||+||++|+|||+.+ T Consensus 136 ~~~~y~~~~~~G~~~~~~~~dIiHir~~~~~d----g~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-- 209 (723) T protein:vir:94 136 QIIGYVIERTDGVRVPVLADEMLWLRFSDPYD----PLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-- 209 (723) T ss_pred eeeEEEEEecCceeEEecccceEEecCCCCCC----CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-- Confidence 2334655556777788999999999854 333 336999999999999999999999999999999999999864 Q ss_pred CCCCHHHHHHHHHHHHHHhcCcccccccceee-------------C Q lcl|NC_019511. 298 QQQSQHALENFKREWKSSFSGINGSWQICLYI-------------K 330 (330) Q Consensus 298 ~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~-------------e 330 (330) ++++++.+++++.|++.++|..|+||++||. + T Consensus 210 -~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~ 254 (723) T protein:vir:94 210 -DMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGAT 254 (723) T ss_pred -CCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCce Confidence 4899999999999999999999999966663 1 No 10 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=1.5e-42 Score=249.90 Aligned_cols=258 Identities=11% Similarity=0.073 Sum_probs=174.3 Q ss_pred cCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHH Q lcl|NC_019511. 25 IDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITR 104 (330) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~ 104 (330) -++..+-.+..+++.+.+.-..+.+..+ ..+.+. |....+. .+.....+ ...++|.|++||++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~--~~~~~~----------~~~~~~~-~g~~v~~~---~al~~~~v~~ci~~I 64 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTD--GSFWSA----------WGGMGSS-SGETVTAD---SALQLSAVWSCVRLI 64 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCc--hhHHHh----------hcccccC-CCceechH---hhhccHHHHHHHHHH Confidence 0001111122222222211111111100 001000 1000110 01001111 222489999999999 Q ss_pred HHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeE Q lcl|NC_019511. 105 ANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFE 184 (330) Q Consensus 105 ~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~ 184 (330) +++||+...... ....-|-+. ....|.+..++...||+++|.++|++.++.++|+.||+|++ T Consensus 65 a~~ia~lp~~~~-~~~~~g~~~-----------------~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~ 126 (437) T protein:vir:10 65 AETIATLPLNLY-QTKPDGTRV-----------------LAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYAR 126 (437) T ss_pred HHHHhhCceeEE-EEcCCCcee-----------------eccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEE Confidence 999997432211 111111111 11234555666778999999999999999999999999988 Q ss_pred EEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHH Q lcl|NC_019511. 185 KVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIA 264 (330) Q Consensus 185 ~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a 264 (330) ++ |+ .|++++|||++|..|++..+.+|. +.|++...++....|+++||+|++.++.+ +.||+|||+++ T Consensus 127 i~--r~-~g~~~~L~~l~p~~v~i~~~~~g~-----~~y~~~~~~g~~~~~~~~dIih~r~~~~d----~~~G~spi~~~ 194 (437) T protein:vir:10 127 KL--RS-AGVLIGLELMLPQRTTVKRLTSGA-----LQYTYRNVDGTVSTLAEDDVFHVRGFSLD----GLMGLTPIQYA 194 (437) T ss_pred EE--ec-CCcEEEEEEEcCcceEEEECCCCe-----EEEEEEecCceEEEEccccEEEecCcCCC----CcccccHHHHH Confidence 76 45 389999999999999998776664 34666666777888999999999754432 33699999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 265 MKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 265 ~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +.+|++++++++|+.+||.||++|+|||.+++ .+++++.+++++.|++.++|..|+|+++||-+ T Consensus 195 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~ 258 (437) T protein:vir:10 195 REVLGNSTAANKTSASVFRNGLRPSGVLSTDQ--ILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEA 258 (437) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcCccccCcceeccC Confidence 99999999999999999999999999999875 58999999999999999999999999776655 No 11 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=2.6e-42 Score=248.57 Aligned_cols=246 Identities=12% Similarity=0.123 Sum_probs=171.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ |...++ .+......... ++.. ..|.+ +......... ....++|.|++||++++++| T Consensus 1 Mgl--------F~~~~~--r~~~~~~~~~~-~~~~---~~~~~----~~~~~~~v~~---~~al~~~~v~~~i~~ia~~i 59 (251) T protein:vir:46 1 MGI--------FYKNEK--RDLQYNEDDLQ-MMVQ---TLPSF----QGTKLRQYKD---IEAIRHSDIFTAVMMIASDL 59 (251) T ss_pred CCc--------cccccc--cccCCCccchh-hhhh---hhccc----cCcCcceech---hhhhccHHHHHHHHHHHHhH Confidence 111 000001 00000000000 0000 00000 0000000111 11223789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .+.+.-+ .++. .+|.+..++..+||+++|.++|+++++.++|++||+|++++ T Consensus 60 A~l-----------p~~~~~~--~~~~----------~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~-- 114 (251) T protein:vir:46 60 ARM-----------PIRVTVN--GQIN----------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-- 114 (251) T ss_pred hhC-----------ceEEeeC--cccc----------ccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE-- Confidence 963 3333211 1111 12445566667899999999999999999999999999986 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-e---CCceEEEechhHeeeecccCcCCCCCCCccccHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-I---DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIA 264 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~---~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a 264 (330) |+..|+|++|+||+|.+|++..+.+|.. .|++. . .++....|+++||||++.++.+ +.+|+|||+++ T Consensus 115 r~~~G~~~~L~~i~~~~v~v~~~~~g~~-----~~~~~~~~~~~~g~~~~~~~~diiH~r~~~~d----g~~G~spi~~~ 185 (251) T protein:vir:46 115 RDKTGEPMNLTFRKTSEIELKSDARGRL-----YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLD----GINGLSLLDTL 185 (251) T ss_pred ECCCCcEEEEEEECCceEEEEECCCCcE-----EEEEEEeccCCcceeEEECCccEEEecCcCCC----CeeecCHHHHH Confidence 5566899999999999999998877754 24333 2 2456678999999999876543 23699999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 265 MKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 265 ~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++|++++++++|++++|+||++|+|+|++++. -.++++++++|+.|++.++|.+|+|+++|.+| T Consensus 186 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~~e~~~~~~~~~~~~~~g~~n~g~~~~gm~ 250 (251) T protein:vir:46 186 SRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFPKVLVELNKLGKLSYSMN 250 (251) T ss_pred HHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCC-CCCHHHHHHHHHHHHHHhcCcccccccccccC Confidence 999999999999999999999999999999864 24788899999999999999999999999999 No 12 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=1.9e-42 Score=249.30 Aligned_cols=259 Identities=14% Similarity=0.151 Sum_probs=180.0 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhcc--c-cccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAE--P-FLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRA 105 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~--~-~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~ 105 (330) +.+ +..+ ..++...+ ++ ......... + +..-+...+. .+.++ -+...+++.|++||+.++ T Consensus 1 M~~-~~r~-~~~~~~~~-r~--~~~~~~~~~~~~~~~~~~g~~~~-~~~v~-----------~~~al~~~~v~~~i~~ia 63 (432) T protein:vir:10 1 MKI-VDSV-KKFFNFEK-RQ--TSQVIELNKDDEKLLEWLGISPS-TISVK-----------GKNALKVATVFACIKILS 63 (432) T ss_pred CCh-HHHH-HHhcCccc-cC--cccccccCCchHHHHHHhCCCcC-ccccc-----------hhhhhccHHHHHHHHHHH Confidence 322 2222 11111111 01 011111110 0 1111101110 00111 112334899999999999 Q ss_pred HhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEE Q lcl|NC_019511. 106 NQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEK 185 (330) Q Consensus 106 d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~ 185 (330) ++||+. .+.+.-++++.. .+..+|.+.+++..+||+++|.++|+++++.++|++||+|+++ T Consensus 64 ~~ia~l-----------p~~~~~~~~~~~--------~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 124 (432) T protein:vir:10 64 ESVSKL-----------PLKIYQEDEYGI--------QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANI 124 (432) T ss_pred HhhccC-----------ceEEEEecCCce--------eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 999953 333221211110 1123455666666789999999999999999999999999998 Q ss_pred EEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 186 VFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 186 v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) + ++..|+|++||||+|.+|++..++.|........|+++..++....|+++||+|++.++..+ +.||+||+++|+ T Consensus 125 ~--r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~---~~~G~s~~~~~~ 199 (432) T protein:vir:10 125 E--FDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLD---GLVGVPTMEYLK 199 (432) T ss_pred E--ECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCC---CcccccHHHHHH Confidence 7 45678999999999999999999887655544556666667778889999999998643221 336999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++|+.++++++|+++||+||++|+|+|.+++ .+++++.+++++.|++.++|..|+|+++||-+ T Consensus 200 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 262 (432) T protein:vir:10 200 STLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRIALMPV 262 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcccccCCcceecCC Confidence 9999999999999999999999999998765 58999999999999999999999999887655 No 13 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=1.9e-42 Score=249.30 Aligned_cols=259 Identities=14% Similarity=0.151 Sum_probs=180.0 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhcc--c-cccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAE--P-FLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRA 105 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~--~-~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~ 105 (330) +.+ +..+ ..++...+ ++ ......... + +..-+...+. .+.++ -+...+++.|++||+.++ T Consensus 1 M~~-~~r~-~~~~~~~~-r~--~~~~~~~~~~~~~~~~~~g~~~~-~~~v~-----------~~~al~~~~v~~~i~~ia 63 (432) T protein:vir:10 1 MKI-VDSV-KKFFNFEK-RQ--TSQVIELNKDDEKLLEWLGISPS-TISVK-----------GKNALKVATVFACIKILS 63 (432) T ss_pred CCh-HHHH-HHhcCccc-cC--cccccccCCchHHHHHHhCCCcC-ccccc-----------hhhhhccHHHHHHHHHHH Confidence 322 2222 11111111 01 011111110 0 1111101110 00111 112334899999999999 Q ss_pred HhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEE Q lcl|NC_019511. 106 NQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEK 185 (330) Q Consensus 106 d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~ 185 (330) ++||+. .+.+.-++++.. .+..+|.+.+++..+||+++|.++|+++++.++|++||+|+++ T Consensus 64 ~~ia~l-----------p~~~~~~~~~~~--------~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 124 (432) T protein:vir:10 64 ESVSKL-----------PLKIYQEDEYGI--------QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANI 124 (432) T ss_pred HhhccC-----------ceEEEEecCCce--------eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 999953 333221211110 1123455666666789999999999999999999999999998 Q ss_pred EEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 186 VFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 186 v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) + ++..|+|++||||+|.+|++..++.|........|+++..++....|+++||+|++.++..+ +.||+||+++|+ T Consensus 125 ~--r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~---~~~G~s~~~~~~ 199 (432) T protein:vir:10 125 E--FDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLD---GLVGVPTMEYLK 199 (432) T ss_pred E--ECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCC---CcccccHHHHHH Confidence 7 45678999999999999999999887655544556666667778889999999998643221 336999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++|+.++++++|+++||+||++|+|+|.+++ .+++++.+++++.|++.++|..|+|+++||-+ T Consensus 200 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 262 (432) T protein:vir:10 200 STLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRIALMPV 262 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcccccCCcceecCC Confidence 9999999999999999999999999998765 58999999999999999999999999887655 No 14 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=1.9e-42 Score=249.30 Aligned_cols=259 Identities=14% Similarity=0.151 Sum_probs=180.0 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhcc--c-cccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAE--P-FLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRA 105 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~--~-~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~ 105 (330) +.+ +..+ ..++...+ ++ ......... + +..-+...+. .+.++ -+...+++.|++||+.++ T Consensus 1 M~~-~~r~-~~~~~~~~-r~--~~~~~~~~~~~~~~~~~~g~~~~-~~~v~-----------~~~al~~~~v~~~i~~ia 63 (432) T protein:vir:10 1 MKI-VDSV-KKFFNFEK-RQ--TSQVIELNKDDEKLLEWLGISPS-TISVK-----------GKNALKVATVFACIKILS 63 (432) T ss_pred CCh-HHHH-HHhcCccc-cC--cccccccCCchHHHHHHhCCCcC-ccccc-----------hhhhhccHHHHHHHHHHH Confidence 322 2222 11111111 01 011111110 0 1111101110 00111 112334899999999999 Q ss_pred HhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEE Q lcl|NC_019511. 106 NQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEK 185 (330) Q Consensus 106 d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~ 185 (330) ++||+. .+.+.-++++.. .+..+|.+.+++..+||+++|.++|+++++.++|++||+|+++ T Consensus 64 ~~ia~l-----------p~~~~~~~~~~~--------~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 124 (432) T protein:vir:10 64 ESVSKL-----------PLKIYQEDEYGI--------QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANI 124 (432) T ss_pred HhhccC-----------ceEEEEecCCce--------eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 999953 333221211110 1123455666666789999999999999999999999999998 Q ss_pred EEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 186 VFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 186 v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) + ++..|+|++||||+|.+|++..++.|........|+++..++....|+++||+|++.++..+ +.||+||+++|+ T Consensus 125 ~--r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~---~~~G~s~~~~~~ 199 (432) T protein:vir:10 125 E--FDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLD---GLVGVPTMEYLK 199 (432) T ss_pred E--ECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCC---CcccccHHHHHH Confidence 7 45678999999999999999999887655544556666667778889999999998643221 336999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++|+.++++++|+++||+||++|+|+|.+++ .+++++.+++++.|++.++|..|+|+++||-+ T Consensus 200 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 262 (432) T protein:vir:10 200 STLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRIALMPV 262 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcccccCCcceecCC Confidence 9999999999999999999999999998765 58999999999999999999999999887655 No 15 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=3.1e-42 Score=248.17 Aligned_cols=259 Identities=15% Similarity=0.142 Sum_probs=178.8 Q ss_pred CchhHHHHHhcCC-CCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSM-YKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r 79 (330) |- ||.+|-..+. .++.... +. . .. ++.... ..+......++.+. ++ T Consensus 1 MG-~f~~lf~~~~~~~~~~~~---~~------------~------~~--~~~~~~----~~~~~~~g~~~~~~--v~--- 47 (422) T protein:vir:13 1 MG-FLRGLFNKKNNNDEKRSN---YD------------E------DI--GIDISD----SNFWEKFGIKLNFS--VR--- 47 (422) T ss_pred Cc-hhhhhhhccCCccchhhh---hh------------h------cc--ccccCc----chhhhhccccCCcc--cc--- Confidence 32 4444411111 1000000 00 0 00 000000 00111111111111 11 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCC Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDI 159 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn 159 (330) ....| +++.|++||+.++++||+. .+.+. ++. .+.. +|.+.+++..+|| T Consensus 48 ----~~~al----~~~~v~~ci~~ia~~iA~l-----------p~~~~-~~~-~~~~----------~~~~~~lL~~~PN 96 (422) T protein:vir:13 48 ----GKRAL----KENTVYVCTKIRAESIGKL-----------SLKIY-KDK-EEYK----------EHELYYLLRYKPN 96 (422) T ss_pred ----hhhhh----ccHHHHHHHHHHHHhhhhC-----------ceEEE-ecC-cccc----------cchHHHHHhhhcc Confidence 11223 3788999999999999963 22221 111 1111 2334455556899 Q ss_pred CcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC-CceeEEEEeCCceEEEechh Q lcl|NC_019511. 160 DRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK-GGNRFVQVIDKQVVASFTSR 238 (330) Q Consensus 160 ~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~-~~~~Y~q~~~~~~~~~~~~~ 238 (330) +++|+++|+++++.++|++||+|+++++ +..|+|++|+||+|.+|++..+++|.... +..+|++...+|...+|.++ T Consensus 97 ~~~t~~~f~~~~~~~lll~Gna~~~i~r--~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~ 174 (422) T protein:vir:13 97 PLMSSINFWKCLETQRTLKGNAYAYIER--DRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKVWYVVTDKNGKEHKLLPD 174 (422) T ss_pred cCCCHHHHHHHHHHHHhhcCCeEEEEEE--CCCCcEEEEEEECCcceEEEEcCCcceeccceEEEEEEeCCCeEEEEccc Confidence 9999999999999999999999999874 55689999999999999999998885533 44567777777888899999 Q ss_pred HeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcC Q lcl|NC_019511. 239 ELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSG 318 (330) Q Consensus 239 dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G 318 (330) ||+|++.++..+ +.||+||++.|.++|++++++++|+.+||+||++|+|+|.+++ .+++++.++++++|++.++| T Consensus 175 eiih~~~~~~~~---~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g 249 (422) T protein:vir:13 175 EMLHFIGDITLD---GLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG--DLDEKAKKIFKKEFESMSNG 249 (422) T ss_pred ceEEEcCCCCCC---CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC--CCCHHHHHHHHHHHHHHhcC Confidence 999998754332 3469999999999999999999999999999999999999875 58999999999999999999 Q ss_pred cccccccceeeC Q lcl|NC_019511. 319 INGSWQICLYIK 330 (330) Q Consensus 319 ~~na~kvpvL~e 330 (330) .+|+|+++||-+ T Consensus 250 ~~n~~~~~vl~~ 261 (422) T protein:vir:13 250 LENAHSISLLPF 261 (422) T ss_pred ccccCCceecCC Confidence 999999877766 No 16 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=4.8e-42 Score=247.09 Aligned_cols=299 Identities=20% Similarity=0.223 Sum_probs=183.9 Q ss_pred CchhHH-----------------------HHHhcCCCCCCcccccCcc---C-----cchhHHHHHHHHHHHHhhcccch Q lcl|NC_019511. 1 MPDLFK-----------------------SLRLGSMYKEDTEDLMVPI---D-----DGIQANIRQIEQDTKEMQEITKS 49 (330) Q Consensus 1 ~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~---~-----~~~~~~~~~~~~~~~~~~~~~~~ 49 (330) +.++.| +|..++.||-=+--++-.. + +.+--+++++ ++.++...+ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~kk~~i~~p 79 (945) T protein:vir:10 4 LENIIKGFIVNANEQKRPSFSSNIKANVDSLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQV----LKKEKIIVP 79 (945) T ss_pred hhhHhhhheeccccccCccccccchhchhhhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhH----HHhhccccc Confidence 222222 1222222222111111000 1 1222233333 222333333 Q ss_pred hccccchhcccccc----ccccCCCCCcCCCcccchHHHHHHHHHHh-hcHHHHHHHHHHHHhHhhhhhhh-eecccccc Q lcl|NC_019511. 50 LYGKQQAYAEPFLE----MMDTNPDYRDKKSYMRNAHNLHEVLKKFG-NNSILNAIIITRANQVSTYCKPA-RYSEKGVG 123 (330) Q Consensus 50 ~~g~~~~~~~~~~~----~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a-~~~iv~a~I~~~~d~Ia~~~~~~-~~~~~~~g 123 (330) ...+...-...... ..+.-|.+..|-+.. .....++.| +++.|++||+.++++||+..... +..+ T Consensus 80 fkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~-----~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~e---- 150 (945) T protein:vir:10 80 YNHQEPPFKFNLFEYSPESLMYLPSISDPDAFF-----LINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIE---- 150 (945) T ss_pred ccccccchhhhhhhccCccceecccccCcccee-----eehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecc---- Confidence 33222211111000 011111111111110 012334444 47999999999999999642211 1111 Q ss_pred eeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHH----HHHHHHHHHHhcCCceeEEEEecCCCcceEEEE Q lcl|NC_019511. 124 FEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQE----FCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFI 199 (330) Q Consensus 124 ~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~----fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~ 199 (330) +.......++. .-.|.++.++. +||+.+|.++ |+++++.++|++||+|+++++ +..|+|++|+ T Consensus 151 ------dG~~~~~~kk~----~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiR--d~~G~ii~L~ 217 (945) T protein:vir:10 151 ------DKHVNYYLKRI----RDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIR--DEQGNLVAIT 217 (945) T ss_pred ------cCccccccccc----ccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEE--CCCCcEEEEE Confidence 11110000111 11233344443 6888877555 888999999999999999874 5568999999 Q ss_pred eeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 200 AVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFND 279 (330) Q Consensus 200 pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~ 279 (330) |++|.+|++..+++|... .+|++..+|+....|.++|++|+.+++..+....+||+|||++|+++|+.++++++|++ T Consensus 218 pLdPs~Vti~~ddDG~~~---y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aa 294 (945) T protein:vir:10 218 PVDGTTIKPILSEDTGIV---VGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNL 294 (945) T ss_pred EECCcceEEEEcCCCcEE---EEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHH Confidence 999999999998887542 35677778888888999999888888877777788999999999999999999999999 Q ss_pred HHHh-cCCCcceEEEeCCC--------CCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 280 RFFS-HGGTTRGILQIRAD--------QQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 280 ~fF~-nGa~p~GiL~~~~~--------~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +||. ||++|+|+|.++++ +.+++++++++|++|++.++| .|+|+ |++++ T Consensus 295 r~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG-~NnG~-piVLd 352 (945) T protein:vir:10 295 DYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMG-DYTQV-PILSG 352 (945) T ss_pred HHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCC-ccccc-ceecC Confidence 9996 78999999998643 568999999999999999999 46666 55565 No 17 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=3.6e-42 Score=247.77 Aligned_cols=249 Identities=14% Similarity=0.132 Sum_probs=172.0 Q ss_pred HHHHHHHHHHHhhcccchhccc-cchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhh Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGK-QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVST 110 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~ 110 (330) ++ +..-+.+. +++.++. .+..... .+...+|. ++.....+. ..+++.|++||+.|+++||+ T Consensus 1 m~--~~~~~~~~---~~~~s~~~~w~~~~~---------~~~~~~~~-~g~~vt~~~---al~~~~v~~~i~~Ia~~iA~ 62 (421) T protein:vir:10 1 MF--IPQMFEGK---KRSVSGGGFWEAMLG---------GVRSSHSK-AGVMITPET---ALALSAVRACVTLLAESVAQ 62 (421) T ss_pred CC--Ccchhccc---ccccCcchhhHHHhh---------hhccCccc-CCceechHH---hhccHHHHHHHHHHHHhhcc Confidence 11 22221111 1111111 1111000 11111111 111111222 12378999999999999996 Q ss_pred hhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC Q lcl|NC_019511. 111 YCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK 190 (330) Q Consensus 111 ~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd 190 (330) . .|.+.-++++.+. .+..+|.+.+++..+||+++|.++|+++++.++|++|++|++++ |+ T Consensus 63 l-----------p~~~~~~~~~g~~-------~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~--r~ 122 (421) T protein:vir:10 63 L-----------PVELYRRDKNGGR-------QRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIID--RD 122 (421) T ss_pred C-----------ceEEEEEcCCCce-------eecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEE--Ec Confidence 3 3332211111100 01123456666777899999999999999999999999988886 56 Q ss_pred CCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHH Q lcl|NC_019511. 191 NKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIA 270 (330) Q Consensus 191 ~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~ 270 (330) +.|+|++||||+|.+|++..+.+|.. |+++..++. +++++||+|++.++.+ +.||+|||+.++++|++ T Consensus 123 ~~G~~~~L~~l~~~~v~v~~~~~g~~------~y~~~~~g~--~~~~~eiih~~~~~~d----~~~G~spi~~~~~~i~~ 190 (421) T protein:vir:10 123 GKGYPKELIPINPKKVIVLKGPDGMP------YYEIPEIGE--TLPMRMMHHVKVFSLD----GYIGSSPIQTNADVLGL 190 (421) T ss_pred CCCcEEEEEEecCceEEEEECCCceE------EEEEcCCCc--EEchhhEEEecCcCCC----CcccccHHHHHHHHHHH Confidence 67899999999999999988877643 444444443 5789999999876543 23699999999999999 Q ss_pred HHHHHHHHHHHHhcCCCcceEEEeCCC--CCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 271 YNNTESFNDRFFSHGGTTRGILQIRAD--QQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 271 ~laae~~~~~fF~nGa~p~GiL~~~~~--~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++++++|++++|.||++|+|+|.++++ ..+++|+++++++.|++.|+|.+|+|+++||-+ T Consensus 191 ~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 252 (421) T protein:vir:10 191 NLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQE 252 (421) T ss_pred HHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCC Confidence 999999999999999999999998764 245999999999999999999999999777766 No 18 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=2.2e-42 Score=249.01 Aligned_cols=260 Identities=13% Similarity=0.073 Sum_probs=174.2 Q ss_pred hhHHHHHHHHHHHHhhccc-chhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEIT-KSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +...+-++-..+....++. ....|+.....++..- +.++.-+.+ .+.....+ ...+++.|++||++|+++ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~~~--~g~~v~~~---~al~~~~V~~~i~~ia~~ 71 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFW----SQFLGRESS--SGKKVTVD---KAMKLSAVWACVRLISTS 71 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCchHHH----HHHhcCCcc--CCceechh---hhhccHHHHHHHHHHHHh Confidence 1111111111111100000 0001122111111000 000111110 11111112 122388999999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+... .+--++.+.. +.+...|.+.+++..+||+++|.++|++.++.++|++||+|+++. T Consensus 72 ia~lp~-----------~~~~~~~~g~-------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~- 132 (434) T protein:vir:43 72 VAGLPL-----------GVYERKADGS-------RVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIR- 132 (434) T ss_pred hhhCce-----------EEEEEcCCCc-------cccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 996422 2211111100 112234567777778999999999999999999999999988864 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) ++ .|+|++|+||+|.+|++..+.+|. +.|+++..++....|+++||+|++.++.. +.+|+|||++++++ T Consensus 133 -~~-~G~~~~L~~l~p~~v~~~~~~~g~-----~~y~~~~~~g~~~~~~~~eVih~~~~~~d----g~~G~spi~~~~~~ 201 (434) T protein:vir:43 133 -RA-AGRPAALDFLLPSRVDLECDENGR-----LKYFYTTKKGARREIERTNMLHIPAFTLD----GRIGLSAIRYGVDV 201 (434) T ss_pred -eC-CCcEEEEEEEcCcceEEEEcCCCe-----EEEEEEecCceEEEEccccEEEecCcCCC----CccccCHHHHHHHH Confidence 44 599999999999999999887765 35777777777889999999999865433 23699999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |++++++++|+++||+||++|+|+|.+++ .+++++.+++|++|++ +.|..|+|+++||-+ T Consensus 202 i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~r~~~~~-~~g~~nag~~~vl~~ 261 (434) T protein:vir:43 202 FGSVMSAEDAANGTFKNGLLPTVAFKVDR--ILQPAQREEFREYVKS-VSGAMNSGRSPVLEQ 261 (434) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEecCC--CCCHHHHHHHHHHHHH-hcCccccCCccccCC Confidence 99999999999999999999999998875 5899999999999975 678899999888766 No 19 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=8.5e-42 Score=245.75 Aligned_cols=248 Identities=10% Similarity=0.092 Sum_probs=168.6 Q ss_pred HHhhcccchhccccchhcc------ccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhh Q lcl|NC_019511. 41 KEMQEITKSLYGKQQAYAE------PFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKP 114 (330) Q Consensus 41 ~~~~~~~~~~~g~~~~~~~------~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~ 114 (330) .=+ ++|...+-.. .+.......|....+.+.. ..-....+.++|.|++||++|+++||+. T Consensus 1 ~~~------~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~~~~~~~~~~V~acV~~IA~~iA~l--- 66 (518) T protein:vir:78 1 MLL------ANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQ-----FSLYGGIYKNQPWVRTVIAKRAQALARL--- 66 (518) T ss_pred Ccc------cCceeeccchhhhhhhhhhhcccccceeceecccc-----cchhhHHhhhhHHHHHHHHHHHHhhccC--- Confidence 001 1121111110 0011111111111111110 0011133446899999999999999963 Q ss_pred heecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcc Q lcl|NC_019511. 115 ARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTK 194 (330) Q Consensus 115 ~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~ 194 (330) .|.+..++.+.... ...+.+ ..+..+||+++|.++|+++++.++|+.||+|++++ |+..|+ T Consensus 67 --------p~~l~~~~~~~~~~--------~~~~~~-~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~--r~~~G~ 127 (518) T protein:vir:78 67 --------PVKCMFTSGDTETE--------EHDTGY-AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ--KNKSGT 127 (518) T ss_pred --------ceEEEEEcCCcccc--------ccchHH-HHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE--EcCCCc Confidence 33332222211110 001122 22345799999999999999999999999999986 466789 Q ss_pred eEEEEeeCCCceEEeeCCCCcccCCceeEEEEeC---CceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHH Q lcl|NC_019511. 195 MEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVID---KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAY 271 (330) Q Consensus 195 ~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~---~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~ 271 (330) |++||||+|.+|++..+.++. ...|++... ++....|.++||||++.+...+ ..||+|||++++++|+++ T Consensus 128 ~~~L~~l~p~~Vtv~~~~~~~----~~~y~~~~~~~~~~~~~~~~~~eIiHir~~~~dg---~~~G~Spi~~~~~~i~~~ 200 (518) T protein:vir:78 128 PEKLMPMHPSRVAIKRNSRTG----RYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDG---LERGLSLMESLKSTIFSE 200 (518) T ss_pred EEEEEEECCCceEEEEcCCCC----EEEEEEEecCCccceeEEecCCcEEEecCCCCCc---ccccccHHHHHHHHHHHH Confidence 999999999999998775432 234555443 3456789999999998543322 126999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 272 NNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 272 laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++|+++||+||++|+|||.+++ .+++++.+++|+.|++.++|..|+|+++||-+ T Consensus 201 ~aa~~~~~~~f~Ng~~p~gvl~~~~--~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~ 257 (518) T protein:vir:78 201 DSSRNATAAMWKNAGRPNLVLRHEK--RLSPEAQQRLREQFDRAHAGSSNTGKTMVVEE 257 (518) T ss_pred HHHHHHHHHHHhcCCCccEEEecCC--CCCHHHHHHHHHHHHHHhcCcccCCceeEcCC Confidence 9999999999999999999998775 58999999999999999999999999887766 No 20 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=6.8e-42 Score=246.27 Aligned_cols=255 Identities=15% Similarity=0.140 Sum_probs=177.9 Q ss_pred hhHHHHHHHHHHHHhhcc----cchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEI----TKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITR 104 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~----~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~ 104 (330) +. +.+.++...+- +...+.....+.. +.. . ..+.. ..+ . +...+++.|++||+.+ T Consensus 1 M~-----~~~~~f~~~~r~~~~~~~~~~~~~~~~~-~~g-~-~~~~~--~v~--------~---~~al~~~~v~~~i~~i 59 (429) T protein:vir:10 1 MD-----SVKKFFNFEKRQTSQVIELNKDDEKLLE-WLG-I-SPSTI--SVK--------G---KNALKVATVFACIKIL 59 (429) T ss_pred Cc-----hhhhhhcccccCcccccccCCChHHHHH-Hhc-C-CCCcc--eec--------h---hhhhccHHHHHHHHHH Confidence 11 11222211110 0111111111100 000 0 01111 111 1 1122489999999999 Q ss_pred HHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeE Q lcl|NC_019511. 105 ANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFE 184 (330) Q Consensus 105 ~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~ 184 (330) +++||+. .|.+--++++. ..+..+|.+.+++..+||+.+|.++|+++++.++|++||+|++ T Consensus 60 a~~ia~l-----------~~~~~~~~~~~--------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~ 120 (429) T protein:vir:10 60 SESVSKL-----------PLKIYQEDEYG--------IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYAN 120 (429) T ss_pred HHhhccC-----------ceEEEEecCCc--------eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEE Confidence 9999963 22221111110 0122345566666678999999999999999999999999999 Q ss_pred EEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHH Q lcl|NC_019511. 185 KVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIA 264 (330) Q Consensus 185 ~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a 264 (330) ++ |+..|+|++|||++|.+|++..++.|........|+++..++....|+++||||++.+...+ +.+|+|||+.| T Consensus 121 i~--r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~---~~~G~s~i~~~ 195 (429) T protein:vir:10 121 IE--FDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLD---GLVGVPTMEYL 195 (429) T ss_pred EE--ECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEEccCCeEEEEccccEEEecCCCCCC---CcccccHHHHH Confidence 87 45678999999999999999999887654444455555567777889999999998643222 33599999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 265 MKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 265 ~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +.+|++++++++|+.++|.||++|+|+|.+++ .+++++.+++++.|++.++|..|+|+++||-+ T Consensus 196 ~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 259 (429) T protein:vir:10 196 KSTLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRIALMPV 259 (429) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhccccccCceeecCC Confidence 99999999999999999999999999998875 58999999999999999999999999887765 No 21 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=1.5e-41 Score=244.35 Aligned_cols=246 Identities=16% Similarity=0.130 Sum_probs=175.9 Q ss_pred HHHHHHHHHHHhhcccchhccccchhccc---cccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGKQQAYAEP---FLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) ++ ..+...-+......+ +.. +....++.. ..+. ...-+++.|++||+.++++| T Consensus 1 m~------------f~~~~~~~~~~~~~~~~~~~~------~~g~~~~~~---~v~~---~~al~~~~v~~~i~~ia~~i 56 (409) T protein:vir:10 1 ML------------FRKGFKNQSQEISIDDKKILE------WLGINPSET---YVNG---KSCLKQATVFGCIRILSDNI 56 (409) T ss_pred Cc------------ccccccCcCCCCCCChHHHHH------HhcCCcCcc---eech---hhhhccHHHHHHHHHHHHhh Confidence 11 111000011111000 000 001011100 0001 12224899999999999999 Q ss_pred hhhhhhheecccccceeeec-cCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKL-KDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~-kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) |+. .|.+.- ++..+ +...|.++.++..+||+.+|.++|+++++.++|++||+|++++ T Consensus 57 a~l-----------p~~~~~~~~~~~----------~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~- 114 (409) T protein:vir:10 57 SKL-----------PIKIYQKKDGIK----------RVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALD- 114 (409) T ss_pred hhC-----------ceEEEEecCCee----------eccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 963 222211 11111 1123445556667899999999999999999999999999986 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCccc-CCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKII-KGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMK 266 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~-~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~ 266 (330) ++..|++++||||+|.+|++..+++|... ...+.|++...++....|+++||+|++.+..+ +.||+|||+.+++ T Consensus 115 -r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~r~~~~d----~~~G~s~i~~~~~ 189 (409) T protein:vir:10 115 -FKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTDDLGQRHKFMSDEILHFKGLTAD----GLAGLSVIELLNH 189 (409) T ss_pred -EcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEEeCCceeEEeccccEEEecCcCCC----CcccccHHHHHHH Confidence 45668999999999999999998887543 34456777777788889999999999865433 3469999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 267 EFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 267 ~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +|+++.++++|+.++|+||++|+|||++++ .+++++.+++++.|++.++|..|+|+++||-+ T Consensus 190 ~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 251 (409) T protein:vir:10 190 LIENGKSSETYLNNFFKNGLQVKGLVQYAG--DLNPEAEEVFKENFERMSSGLKNAHRIAMLPI 251 (409) T ss_pred HHHHHHHHHHHHHHHHhccCCCcEEEEcCC--CCCHHHHHHHHHHHHHHhccccccCCceecCC Confidence 999999999999999999999999999875 58999999999999999999999999888766 No 22 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=2e-41 Score=243.67 Aligned_cols=248 Identities=11% Similarity=0.092 Sum_probs=167.6 Q ss_pred HHhhcccchhccccchhc-----ccc-ccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhh Q lcl|NC_019511. 41 KEMQEITKSLYGKQQAYA-----EPF-LEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKP 114 (330) Q Consensus 41 ~~~~~~~~~~~g~~~~~~-----~~~-~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~ 114 (330) .=+ ++|...+-. .|. .......|....+.+. ... -....+.+++.|++||++|+++||+... T Consensus 1 ~~~------~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~--~~~---~~~~~a~~~~~V~acV~~IA~~iA~lpl- 68 (518) T protein:vir:10 1 MLL------ANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLER--QFS---LYGGIYKNQPWVRTVIAKRAQALARLPV- 68 (518) T ss_pred Ccc------cCceeecCchhhhhhhhhhcccccccccceeccc--ccc---hhhHHHhhhHHHHHHHHHHHHhhccCce- Confidence 001 112211110 000 0111001111111110 000 1112344689999999999999996422 Q ss_pred heecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcc Q lcl|NC_019511. 115 ARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTK 194 (330) Q Consensus 115 ~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~ 194 (330) .+.-++.+.... ...+.+ +.+...||+++|.++|+++++.++|++||+|++++ |+..|+ T Consensus 69 ----------~l~~~~~~~~~~--------~~~~~~-~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~--r~~~G~ 127 (518) T protein:vir:10 69 ----------KCMFTSGDTETE--------ESDTGY-AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ--KNKSGT 127 (518) T ss_pred ----------EEEEEcCCCcee--------ccchHH-HHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE--ECCCCc Confidence 221111111000 011222 22335799999999999999999999999999986 456689 Q ss_pred eEEEEeeCCCceEEeeCCCCcccCCceeEEEEeC---CceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHH Q lcl|NC_019511. 195 MEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVID---KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAY 271 (330) Q Consensus 195 ~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~---~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~ 271 (330) |++|+||+|..|++..+.++. ...|++... ++...+|+++||||++.+...+ ..||+|||++|+++|+++ T Consensus 128 ~~~L~~l~p~~v~v~~~~~~~----~~~y~~~~~~~~~~~~~~~~~~eViHir~~s~dg---~~~G~spi~~a~~~i~~~ 200 (518) T protein:vir:10 128 PEKLMPMHPSRVAIKRNSRTG----RYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDG---LERGLSLMESLKSTIFSE 200 (518) T ss_pred EEEEEEECCCceEEEEcCCCC----EEEEEEEecCCccceEEEecCCcEEEecCCCCCc---ccccccHHHHHHHHHHHH Confidence 999999999999998775432 234555543 3456789999999997543322 126999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 272 NNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 272 laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++|+++||+||++|+|||.+++ .+++++.+++|+.|++.++|..|+|+++||-+ T Consensus 201 ~a~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~ 257 (518) T protein:vir:10 201 DSSRNATAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE 257 (518) T ss_pred HHHHHHHHHHHhcCCCccEEEecCC--CCCHHHHHHHHHHHHHHhcCccccCcceEcCC Confidence 9999999999999999999998875 48999999999999999999999999888766 No 23 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=5.6e-41 Score=241.25 Aligned_cols=253 Identities=11% Similarity=0.146 Sum_probs=166.9 Q ss_pred chhHHHHHHHHHHHHhhcccchhccccc---hhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHH Q lcl|NC_019511. 28 GIQANIRQIEQDTKEMQEITKSLYGKQQ---AYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITR 104 (330) Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~ 104 (330) -.+. ++..+ ++...++.- .+..-+ ... ..+++.. ...+.....+. ..+++.|++||++| T Consensus 1 ~~~~---------~~~~~-~~~~~~~~~~~~~~~~~~-~~~-~~~~~g~---~~~g~~v~~~~---al~~~~V~~~v~~I 62 (454) T protein:vir:93 1 MWNL---------LRRTR-KNQKSGRDVREAGWTSLF-QAV-AEPFAGA---WQQGVKADPEA---VLSFHAVFACISLI 62 (454) T ss_pred CCCc---------cccCc-ccccccccccchhhhhhh-hhh-hhhhcch---hhcCcccChHH---hhccHHHHHHHHHH Confidence 1111 11111 111111110 000000 000 0000000 00111111121 22378899999999 Q ss_pred HHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeE Q lcl|NC_019511. 105 ANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFE 184 (330) Q Consensus 105 ~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~ 184 (330) +++||+. .|.+--++++. .. .+..+...+.+...||+++|.++|+++++.++|++||+|++ T Consensus 63 a~~iA~l-----------p~~~~~~~~~g--~~------~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~ 123 (454) T protein:vir:93 63 SQDIAKM-----------RLRLMQTDAQG--IR------RETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVL 123 (454) T ss_pred HHhhccC-----------ceEEEEeccCC--cc------chhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEE Confidence 9999964 22221111110 00 01112222333468999999999999999999999999999 Q ss_pred EEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC----ceEEEechhHeeeecccCcCCCCCCCccccH Q lcl|NC_019511. 185 KVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK----QVVASFTSRELVMGIRNPRSDLNSSGYGLSE 260 (330) Q Consensus 185 ~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~----~~~~~~~~~dvih~~~n~~~d~~~~~yGlSP 260 (330) +++ +..|++++||||+|.+|++..+.+|.. .|.+.... +...+|+++||+|++.++..+ +.||+|| T Consensus 124 i~r--~~~G~~~~L~~i~~~~v~v~~~~~g~~-----~y~~~~~~~~~~~~~~~~~~~eViH~k~~~~~~---~~~G~sp 193 (454) T protein:vir:93 124 KIR--NARGQIKELRILDWNRVEPLVADDGEV-----FYRITPDRNCGITEAVTVPAREVIHDRFNCFFH---PLIGLPP 193 (454) T ss_pred EEE--CCCCcEEEEEEEcCcceEEEEcCCCcE-----EEEEEeccccccceeEEecCcceEEeccCCCCC---CceeccH Confidence 874 556899999999999999998877753 35554332 335679999999998643322 3479999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 261 VEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 261 Ie~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++.|+++|++++++++|+++||+||++|+|+|.+++ .+++|+.++++++|++.++| .|+|+++||-+ T Consensus 194 ~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~ 260 (454) T protein:vir:93 194 VYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG--SITEENAKKLKSNWDSGYTG-ENAGKTAILSN 260 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC--CCCHHHHHHHHHHHHHHhcc-cccCCceeccC Confidence 999999999999999999999999999999998875 58999999999999999998 79999887765 No 24 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=9.2e-41 Score=240.09 Aligned_cols=248 Identities=11% Similarity=0.064 Sum_probs=170.5 Q ss_pred HHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhh Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTY 111 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~ 111 (330) ++ +..-+ +.....+.-.+. +... +|...++. .+.....+. .-+++.|++||++|+++||+. T Consensus 1 m~--~~~~~----~~~~~~~~~~~~---~~~~------~~~~~~~~-~g~~v~~~~---al~~~~v~~~i~~ia~~ia~l 61 (419) T protein:vir:57 1 MF--IPQFW----KGRPSENRVNWQ---VVPG------GMRSSSSQ-AGVIITPET---ALALSAVRACVTLLAESVAQL 61 (419) T ss_pred Cc--chhhh----ccCCcccccccc---cccc------cccccccc-CCceechHH---hhccHHHHHHHHHHHHhhccC Confidence 11 11000 000000001110 0001 11111111 111122222 113788999999999999963 Q ss_pred hhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCC Q lcl|NC_019511. 112 CKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKN 191 (330) Q Consensus 112 ~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~ 191 (330) .|.+--++++... -...+|.+.+++...||+++|.++|++.++.+++++|++|++++ |+. T Consensus 62 -----------p~~~~~~~~~g~~-------~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~--r~~ 121 (419) T protein:vir:57 62 -----------PCVLYRRTENGGR-------EIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLID--RNG 121 (419) T ss_pred -----------ceEEEEEcCCCce-------eccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEE--ECC Confidence 2222111111000 01124456666667899999999999999999999999999986 566 Q ss_pred CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHH Q lcl|NC_019511. 192 KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAY 271 (330) Q Consensus 192 ~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~ 271 (330) .|+|++||||+|.+|++..+.+|.. |+++..++ ..++.+||+|++.++.+ +.||+|||++++.+|+++ T Consensus 122 ~G~~~~L~pl~~~~v~v~~~~~g~~------~y~~~~~~--~~~~~~~vih~r~~~~d----~~~G~s~i~~~~~~i~~~ 189 (419) T protein:vir:57 122 RGDITELIPINPHKVIVLKGPDGMP------YYDIPSIG--EILPMRMVHHIKSFSLD----GYIGTSPIQTNPDVLGLG 189 (419) T ss_pred CCcEEEEEEEcCcceEEEECCCceE------EEEEcCCc--eEEchhhEEEecCcCCC----CcccccHHHHHHHHHHHH Confidence 7899999999999999988877653 44443333 35789999999866543 346999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcceEEEeCCC--CCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 272 NNTESFNDRFFSHGGTTRGILQIRAD--QQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 272 laae~~~~~fF~nGa~p~GiL~~~~~--~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++|+.+||+||++|+|+|.++++ ..+++++++++++.|.+.++|..|+|+++||-+ T Consensus 190 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~ 250 (419) T protein:vir:57 190 IAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQE 250 (419) T ss_pred HHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCC Confidence 99999999999999999999998753 458999999999999999999999999887766 No 25 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=9.2e-41 Score=240.08 Aligned_cols=249 Identities=13% Similarity=0.104 Sum_probs=173.2 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ++.+.+. +.++ .......+.+.+.... +.+ .+.....++ ..+++.|++||+.|+++| T Consensus 1 Mg~-f~~lf~r-----~~~~-~~~~~~~~~~~~~~~~---~~~-------~g~~v~~~~---al~~~~v~~~i~~Ia~~i 60 (414) T protein:vir:44 1 MVF-FSGLFQR-----KSDA-PVTTPAELADAIGLSY---DTY-------TGKQISSQR---AMRLTAVFSCVRVLAESV 60 (414) T ss_pred Cch-hhhhhcc-----CccC-cccchhhHhHhhccCc---ccc-------CCceechhh---hhccHHHHHHHHHHHHHh Confidence 222 1112111 1111 1111111111111110 010 111111121 223889999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .+.+.-++++.. -....|.+.+++..+||+++|+++|+++++.++|++|++|++++ T Consensus 61 a~~-----------p~~~~~~~~~~~--------~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~-- 119 (414) T protein:vir:44 61 GML-----------PCNLYHLNGSLK--------QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKV-- 119 (414) T ss_pred ccC-----------ceEEEEecCCce--------eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-- Confidence 963 222222211110 01123445556667899999999999999999999999998875 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF 268 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I 268 (330) ++ .|+|++||||+|..|.+..+.+|. ..|.+...++....|.++||+|++.++.+ +.||+||++.|+++| T Consensus 120 ~~-~g~~~~L~~l~~~~v~~~~~~~~~-----~~y~~~~~~g~~~~~~~~evih~~~~~~d----~~~G~s~i~~~~~~i 189 (414) T protein:vir:44 120 KA-FGEVAELLPVDPGCVVPKLNSSWE-----PVYQVTFPDGSTDVLSQEDIWHVRTLTLD----GLVGLNPIAYAREAI 189 (414) T ss_pred eC-CCcEEEEEEEcCceEEEEECCCCc-----EEEEEEecCceEEEEccccEEEecCCCCC----CcccccHHHHHHHHH Confidence 34 489999999999999998877664 34766667778889999999999855433 246999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 269 IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 269 ~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++.++++|+++||+||++|+|+|.+++ .+++|+.+++++.|++.++|.+|+|+++||-+ T Consensus 190 ~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 249 (414) T protein:vir:44 190 SLAAATEEHGARLFSNGAVTSGVLRTEQ--TLSDQAYERLKKDFEERHTGLGNAHRPMILEM 249 (414) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCcceecCC Confidence 9999999999999999999999998775 58999999999999999999999999777655 No 26 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=3.8e-41 Score=242.21 Aligned_cols=266 Identities=14% Similarity=0.149 Sum_probs=169.4 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) -.-||=..+- +|+... .+- +..+... .++ .++.+.....+..+....+..+.....+++ T Consensus 6 ~~~~~~~~~~----~~~~~~-~~~--------~~~lf~~---~e~--R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 64 (441) T protein:vir:94 6 TDCYFVDFKS----RKQSRK-ELV--------VVGIFYK---NEK--RDLQYNEDDLQMMVQTLPGFQGTKLRQYKD--- 64 (441) T ss_pred Cccccccccc----cccchh-hhh--------ccccccc---ccc--ccccCCCcchHHHHHHhcccCcccccccch--- Confidence 1111111110 011000 000 0111100 000 001111111100000000000000001111 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ...| +++.|++||+.++++||+. +.++.+++ + ....|.+..++...||+ T Consensus 65 ----~~al----~~~~V~~cv~~Ia~~iA~l--p~~~~~~~------------~---------~~~~~~~~~lL~~~PN~ 113 (441) T protein:vir:94 65 ----IEAI----RHSDIFTAVMMIASDLARM--PIRVTVNG------------Q---------INYSDRIVNLLNTRPNP 113 (441) T ss_pred ----hhhh----ccHHHHHHHHHHHHhhccC--ceeeecCc------------c---------ccccchHHHHHhcccCc Confidence 0122 3788999999999999963 22222111 0 01134455666678999 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eC---CceEEEec Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-ID---KQVVASFT 236 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~---~~~~~~~~ 236 (330) ++|.++|+++++.++|++||+|++++ |++.|+|++|+||+|++|++..+.+|.. .|++. .+ ++....|. T Consensus 114 ~~t~~~f~~~~~~~lll~Gnay~~i~--r~~~G~~~~L~~i~~~~v~v~~d~~g~~-----~~~~~~~~~~~~~~~~~~~ 186 (441) T protein:vir:94 114 MYNGYIFKLVVFVSALLTSHGYIEIT--RDKTGEPMNLTFRKTSEIELKSDARGRL-----YYFHQRIDSNGNNIERNVK 186 (441) T ss_pred CCCHHHHHHHHHHHHhhcCCeEEEEE--ECCCCcEEEEEEEcCceeEEEECCCccE-----EEEEEEeccCCceeEEEEc Confidence 99999999999999999999998886 4567899999999999999998877754 23332 22 24456799 Q ss_pred hhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019511. 237 SRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF 316 (330) Q Consensus 237 ~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~ 316 (330) ++||||++.++.++ .||+|||+.++++|++++++++|+++||+||++|+|||.+++. ..++++++++|+.|++.+ T Consensus 187 ~~dvih~k~~~~dg----~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~e~~e~~r~~~~~~~ 261 (441) T protein:vir:94 187 FEDMLDIKFYSLDG----INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFHKSF 261 (441) T ss_pred cccEEEeccCCCCC----ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-CCCHHHHHHHHHHHHHHh Confidence 99999998655432 3699999999999999999999999999999999999998864 247899999999999999 Q ss_pred cCcccccccceeeC Q lcl|NC_019511. 317 SGINGSWQICLYIK 330 (330) Q Consensus 317 ~G~~na~kvpvL~e 330 (330) +|..|+|+++||-+ T Consensus 262 ~G~~nag~~~vl~~ 275 (441) T protein:vir:94 262 SGTKQAGKVVVLDE 275 (441) T ss_pred cCccccCcceecCC Confidence 99999999776655 No 27 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=3.8e-41 Score=242.21 Aligned_cols=266 Identities=14% Similarity=0.149 Sum_probs=169.4 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) -.-||=..+- +|+... .+- +..+... .++ .++.+.....+..+....+..+.....+++ T Consensus 6 ~~~~~~~~~~----~~~~~~-~~~--------~~~lf~~---~e~--R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 64 (441) T protein:vir:79 6 TDCYFVDFKS----RKQSRK-ELV--------VVGIFYK---NEK--RDLQYNEDDLQMMVQTLPGFQGTKLRQYKD--- 64 (441) T ss_pred Cccccccccc----cccchh-hhh--------ccccccc---ccc--ccccCCCcchHHHHHHhcccCcccccccch--- Confidence 1111111110 011000 000 0111100 000 001111111100000000000000001111 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ...| +++.|++||+.++++||+. +.++.+++ + ....|.+..++...||+ T Consensus 65 ----~~al----~~~~V~~cv~~Ia~~iA~l--p~~~~~~~------------~---------~~~~~~~~~lL~~~PN~ 113 (441) T protein:vir:79 65 ----IEAI----RHSDIFTAVMMIASDLARM--PIRVTVNG------------Q---------INYSDRIVNLLNTRPNP 113 (441) T ss_pred ----hhhh----ccHHHHHHHHHHHHhhccC--ceeeecCc------------c---------ccccchHHHHHhcccCc Confidence 0122 3788999999999999963 22222111 0 01134455666678999 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eC---CceEEEec Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-ID---KQVVASFT 236 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~---~~~~~~~~ 236 (330) ++|.++|+++++.++|++||+|++++ |++.|+|++|+||+|++|++..+.+|.. .|++. .+ ++....|. T Consensus 114 ~~t~~~f~~~~~~~lll~Gnay~~i~--r~~~G~~~~L~~i~~~~v~v~~d~~g~~-----~~~~~~~~~~~~~~~~~~~ 186 (441) T protein:vir:79 114 MYNGYIFKLVVFVSALLTSHGYIEIT--RDKTGEPMNLTFRKTSEIELKSDARGRL-----YYFHQRIDSNGNNIERNVK 186 (441) T ss_pred CCCHHHHHHHHHHHHhhcCCeEEEEE--ECCCCcEEEEEEEcCceeEEEECCCccE-----EEEEEEeccCCceeEEEEc Confidence 99999999999999999999998886 4567899999999999999998877754 23332 22 24456799 Q ss_pred hhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019511. 237 SRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF 316 (330) Q Consensus 237 ~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~ 316 (330) ++||||++.++.++ .||+|||+.++++|++++++++|+++||+||++|+|||.+++. ..++++++++|+.|++.+ T Consensus 187 ~~dvih~k~~~~dg----~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~e~~e~~r~~~~~~~ 261 (441) T protein:vir:79 187 FEDMLDIKFYSLDG----INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFHKSF 261 (441) T ss_pred cccEEEeccCCCCC----ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-CCCHHHHHHHHHHHHHHh Confidence 99999998655432 3699999999999999999999999999999999999998864 247899999999999999 Q ss_pred cCcccccccceeeC Q lcl|NC_019511. 317 SGINGSWQICLYIK 330 (330) Q Consensus 317 ~G~~na~kvpvL~e 330 (330) +|..|+|+++||-+ T Consensus 262 ~G~~nag~~~vl~~ 275 (441) T protein:vir:79 262 SGTKQAGKVVVLDE 275 (441) T ss_pred cCccccCcceecCC Confidence 99999999776655 No 28 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=1.4e-40 Score=239.03 Aligned_cols=248 Identities=13% Similarity=0.110 Sum_probs=175.3 Q ss_pred HHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhh Q lcl|NC_019511. 31 ANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVST 110 (330) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~ 110 (330) ..+.++++. +.+...... ....+.+.... +.| ++.....+ .+.++|.|++||+.++++||+ T Consensus 1 ~~f~~~f~r-----~~~~~~~~~-~~~~~~~~~~~---~~~-------~g~~v~~~---~~l~~~~v~~~i~~Ia~~iA~ 61 (413) T protein:vir:48 1 MFFSGLFQR-----KSDAPVTTP-AELAEAIGLSY---DTY-------TGKRISSQ---RAMRLTAVYSCVRVLAESVGM 61 (413) T ss_pred Cccchhhcc-----CccCCccch-HHHHHhhhcCc---ccc-------cCceechh---hhhccHHHHHHHHHHHHhhhh Confidence 334444333 111111111 01111111110 001 01000111 122489999999999999996 Q ss_pred hhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC Q lcl|NC_019511. 111 YCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK 190 (330) Q Consensus 111 ~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd 190 (330) . .+.+.-++.+.. .....|.+.+++...||+++|.++|+++++.++|+.|++|+++++ + T Consensus 62 ~-----------p~~~~~~~~~~~--------~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~--~ 120 (413) T protein:vir:48 62 L-----------PCSLYKISGTLK--------TRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVK--A 120 (413) T ss_pred C-----------ceEEEEecCCcc--------eeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEe--C Confidence 3 222221111100 011234455566678999999999999999999999999998764 3 Q ss_pred CCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHH Q lcl|NC_019511. 191 NKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIA 270 (330) Q Consensus 191 ~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~ 270 (330) .|+|++||||+|.+|++..+.+|.. .|++...++...+|.++||+|++.++.++ .||+|||+.|..+|++ T Consensus 121 -~g~~~~L~~l~~~~v~~~~~~~~~~-----~y~~~~~~g~~~~~~~~evih~~~~~~d~----~~G~s~i~~~~~~i~~ 190 (413) T protein:vir:48 121 -LGEVVELLPIDPGCVEPKLNSQWQP-----VYQVTFPDGSVDVLTQDEIWHVRTLTLDG----LVGLNPIAYAREAISL 190 (413) T ss_pred -CCcEEEEEEEcCceEEEEEcCCceE-----EEEEEecCceEEEEccccEEEecCcCCCC----cccccHHHHHHHHHHH Confidence 4899999999999999988876643 47777777888889999999998765443 4799999999999999 Q ss_pred HHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 271 YNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 271 ~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++++++|+.++|+||++|+|+|.+++ .+++|+.+++++.|++.++|.+|+|+++||-+ T Consensus 191 ~~~~~~~~~~~~~ng~~p~gil~~~~--~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 248 (413) T protein:vir:48 191 AAATEEHGARLFGNGAVTSGVLRTEQ--KLTPDAYERLKKDFEERHTGLGNAHRPMILEM 248 (413) T ss_pred HHHHHHHHHHHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCcceecCC Confidence 99999999999999999999999875 48999999999999999999999999766555 No 29 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=8.8e-41 Score=240.18 Aligned_cols=259 Identities=12% Similarity=0.087 Sum_probs=173.7 Q ss_pred CccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHH Q lcl|NC_019511. 23 VPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIII 102 (330) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~ 102 (330) ++-+..+-. +..+..-+.. ..+.++.++.-.-..+.... .+++ ..+. .+.....+. +-++|.|++||+ T Consensus 1 ~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~---~~~~--~~s~-~g~~v~~~~---al~~~~V~~~i~ 68 (432) T protein:vir:10 1 MPDEKKLGL-LGQLKAMFVP--PDPVDIGGGQTFTPVNATAR---DLGI--IISD-TGAAVNADA---IMRLDAVAACVK 68 (432) T ss_pred CCCCcccch-hhhhHhhcCC--ccccccccccccccCcchhh---hhcc--cccc-cCcccchhh---hhcchHHHHHHH Confidence 333322222 2222222211 11122222221110010000 0000 0111 111111222 223799999999 Q ss_pred HHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCce Q lcl|NC_019511. 103 TRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVN 182 (330) Q Consensus 103 ~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~ 182 (330) +|+++||+. .|.+--++++. +.++..|.+.+++..+||+++|.++|++.++.++|++||+| T Consensus 69 ~Ia~~ia~l-----------p~~~y~~~~~g--------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay 129 (432) T protein:vir:10 69 LVSQAIAAM-----------PLTMYMRTPDG--------RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAY 129 (432) T ss_pred HHHHhhhhC-----------ceeEEEecCCC--------cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeE Confidence 999999963 22221111110 11233466677777889999999999999999999999999 Q ss_pred eEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHH Q lcl|NC_019511. 183 FEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVE 262 (330) Q Consensus 183 ~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe 262 (330) +++++ + .|++.+||||+|.+|++..+.+|. ..|++...++....|+++||+|++.++.. +.||+|||+ T Consensus 130 ~~~~~--~-~g~~~~L~~l~~~~v~v~~~~~g~-----~~y~~~~~~g~~~~~~~~~iih~~~~~~d----g~~G~spi~ 197 (432) T protein:vir:10 130 VRKVV--T-DGRIESLQYLANDRLTITTDTKGN-----TAYRYRRTDGQMIDIPKQQIWKIMGYSLD----GENGLSAIR 197 (432) T ss_pred EEEEe--c-CCcEEEEEEEcCCceEEEEcCCCc-----EEEEEEecCceEEEEcCccEEEecCCCCC----CcccccHHH Confidence 88875 3 389999999999999999887775 35777666788889999999999765443 236999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 263 IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 263 ~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .|+++|++++++++|+++||+||++|+|||.+++ .+++|+++++++.| +|..|+|+++||-+ T Consensus 198 ~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~----~~~~nag~~~vl~~ 259 (432) T protein:vir:10 198 YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--FLTDDQYDSFAKKV----SGSVEAGRAPLLEG 259 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC--CCCHHHHHHHHHHH----hhhhhCCCceecCC Confidence 9999999999999999999999999999998775 58999998887776 46789999777665 No 30 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=1.3e-40 Score=239.27 Aligned_cols=250 Identities=10% Similarity=0.062 Sum_probs=174.1 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcC-CCcccchHHHHHHHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDK-KSYMRNAHNLHEVLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~-~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +.. ++.+...+. ... .......|..... +..+ .++ +...+++.|++||+.++++ T Consensus 1 MG~-~~~~~~~~~----~~~----~~~~~~~~~~~~~-----~g~~~~~~-----------~~al~~~~V~~~v~~Ia~~ 55 (411) T protein:vir:81 1 MGW-WSRLTRFFR----PRN----ETVDMTNPLLLQW-----LGVDPDTP-----------RNQLSEATYFACLKILSES 55 (411) T ss_pred Cch-HHHHHhhcc----Ccc----cccccchHHHHHH-----hcCcccCh-----------hhhhccHHHHHHHHHHHHh Confidence 222 111211111 000 1111112221111 1111 111 1122378899999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+--++++. ..+..+|.+.+++...||+++|+++|+++++.++|+.||+|+++++ T Consensus 56 iA~l-----------p~~~~~~~~~~--------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r 116 (411) T protein:vir:81 56 LGKL-----------PLKMYQKTERG--------IVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQY 116 (411) T ss_pred HhhC-----------ceeEEEecCCc--------eeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 9963 22221111110 0111234455566678999999999999999999999999998864 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCce-eEEEEe-CCceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGN-RFVQVI-DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~-~Y~q~~-~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) + .|++.+|||++|.+|++..++.|....... .|.+.. .++....|+++||+|++.++..+ +.||+||+.+|+ T Consensus 117 --~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~eiih~k~~~~~~---~~~G~s~~~~~~ 190 (411) T protein:vir:81 117 --S-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRYNDPYDGKMYVFRNDEILHFKTSVTFD---GITGLSVRDVLK 190 (411) T ss_pred --c-CCceEEEEEECCceEEEEEcCcccccccceEEEEEEecCCceEEEEccccEEEEcCCCCCC---CcccccHHHHHH Confidence 4 489999999999999999998875544333 343333 35667789999999998654332 346999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+|++++++++|+.+||+||++|+|+|.+++ .+++++.+++++.|++.++|.+|+|+++||-+ T Consensus 191 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 253 (411) T protein:vir:81 191 HTVDGALESQKFMNNLYKTGLTGKAVLEYTG--DLNQEARDRLVKGFEQFANGSKNAGKIIPVPL 253 (411) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 9999999999999999999999999998875 58999999999999999999999999877655 No 31 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=1e-40 Score=239.83 Aligned_cols=225 Identities=14% Similarity=0.158 Sum_probs=167.6 Q ss_pred HHHHhh-cHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC-----CCCCc Q lcl|NC_019511. 88 LKKFGN-NSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD-----KDIDR 161 (330) Q Consensus 88 Lr~~a~-~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~-----~pn~~ 161 (330) ||.+++ ||+|++||++++++||+ ++|.+..+...+.. ......++.+.+|+..+... .++.+ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~-----------~p~~i~~~~~~~~~-~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~ 68 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAG-----------FGINIIPHPEAEDP-DRDGEQYERVWDFWFGDDSNWQVGPMESER 68 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhc-----------CCeEEEEccCcccc-cchhhhhhhHHHHhhccCCCccccchhhHh Confidence 888886 79999999999999984 57776655433222 22334556666665544221 12235 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC--Cc-eeEEE------------- Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--GG-NRFVQ------------- 225 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--~~-~~Y~q------------- 225 (330) +++.+||++++.|++++||+|+|+++ +..|+|++|+||+|.+|++..+..++... .. ..|.+ T Consensus 69 ~t~~~~~~~~~~~l~l~Gn~~i~~~r--~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (467) T protein:vir:31 69 ATATNVLQTAWTDYEAIGWLTIEILT--QTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDL 146 (467) T ss_pred hHHHHHHHHHHHHHHhcCCeEEEEEE--CCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccce Confidence 68889999999999999999999885 55689999999999999998776543211 11 11111 Q ss_pred --------EeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC Q lcl|NC_019511. 226 --------VIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD 297 (330) Q Consensus 226 --------~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~ 297 (330) ...++....|+++||||++.+.. ....||+||+.+|+.+|.++.++++|+++||+||++|+|+|.+++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~diih~r~~~~---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~- 222 (467) T protein:vir:31 147 DPVFVDADDGSTGTSVSNPANELIFKRNHSP---LYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKG- 222 (467) T ss_pred eeeeeeeccccccceeEeccccEEEecCCCC---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC- Confidence 01234456789999999975422 224479999999999999999999999999999999999999865 Q ss_pred CCCCHHHHHHHHHHHHHHhc-----------CcccccccceeeC Q lcl|NC_019511. 298 QQQSQHALENFKREWKSSFS-----------GINGSWQICLYIK 330 (330) Q Consensus 298 ~~ls~e~~e~lr~~w~~~~~-----------G~~na~kvpvL~e 330 (330) ..+++++.++++++|++.++ |..|++++.+|.. T Consensus 223 ~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~ 266 (467) T protein:vir:31 223 AELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLAD 266 (467) T ss_pred cCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccC Confidence 46899999999999998776 5668888665543 No 32 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=1.8e-40 Score=238.53 Aligned_cols=256 Identities=16% Similarity=0.168 Sum_probs=168.7 Q ss_pred HHHHHH-----HHHHhhcccchhc---------cccchhccccc-cccccCCCCCcCCCcccchHHHHHHHHHHhhcHHH Q lcl|NC_019511. 33 IRQIEQ-----DTKEMQEITKSLY---------GKQQAYAEPFL-EMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSIL 97 (330) Q Consensus 33 ~~~~~~-----~~~~~~~~~~~~~---------g~~~~~~~~~~-~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv 97 (330) .+|-.- ++++..+..+... .|...+..... ..+..-+.|.. ........+ ..-+++.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~---~al~~~~V 73 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQG----TKLRQYKDI---EAIRHSDI 73 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccc----cCccccchh---hhhccHHH Confidence 222222 1111111111000 00000000000 00000000000 000000000 11137889 Q ss_pred HHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHh Q lcl|NC_019511. 98 NAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYT 177 (330) Q Consensus 98 ~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~ 177 (330) ++||+.++++||+.. ..+.+++ . ....|.+.+++...||+++|.++|+++++.++|+ T Consensus 74 ~acv~~Ia~~iA~lp--l~~~~~~----------~-----------~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll 130 (441) T protein:vir:98 74 FTAVMMIASDLARMP--IRVTVNG----------Q-----------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALL 130 (441) T ss_pred HHHHHHHHHhhccCc--eEEecCC----------c-----------ccccchHHHHHhcccccCCCHHHHHHHHHHHHhh Confidence 999999999999632 2222111 0 0123445566667899999999999999999999 Q ss_pred cCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEe-C---CceEEEechhHeeeecccCcCCCCC Q lcl|NC_019511. 178 YDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVI-D---KQVVASFTSRELVMGIRNPRSDLNS 253 (330) Q Consensus 178 ~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~-~---~~~~~~~~~~dvih~~~n~~~d~~~ 253 (330) .||+|++++ |++.|+|++||||+|.+|++..+.+|.. .|++.. + .+....|+++||||++.++.++ T Consensus 131 ~Gnay~~i~--r~~~G~~~~L~~i~~~~v~v~~~~~g~~-----~~~~~~~~~~~~~~~~~~~~~dviHir~~~~dg--- 200 (441) T protein:vir:98 131 TSHGYIEIT--RDKTGEPMNLTFRKTSEIELKLDARGRL-----YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDG--- 200 (441) T ss_pred cCCeEEEEE--EcCCCcEEEEEEEcCceeEEEECCCCcE-----EEEEEEeccCcceeeEEEccccEEEeccCCCCC--- Confidence 999999887 4566899999999999999998887754 243322 2 2345679999999998655432 Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 254 SGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 254 ~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+|+|||+.++++|++++++++|+.+||.||++|+|||.+++. ..++++++++|+.|++.++|.+|+|+++||-+ T Consensus 201 -~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~-~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~ 275 (441) T protein:vir:98 201 -INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFHKSFSGTKQAGKVVVLDE 275 (441) T ss_pred -ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCC Confidence 3599999999999999999999999999999999999998864 23689999999999999999999999776655 No 33 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=1.7e-40 Score=238.65 Aligned_cols=281 Identities=12% Similarity=0.066 Sum_probs=174.8 Q ss_pred chhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 2 PDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) --++.+|+ +..+..+.- . ++..+-.+..+.....|.......|-..... .+... -.++..+. T Consensus 1 M~~~~~l~--~~~~~~~~~---~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~-~~~~~~g~ 62 (466) T protein:vir:81 1 MRLIDRLL--STRGAAPRM---S-----------IDDYAQMLNEFAFNGIGYGFGGGVPRIQQTL-AGPST-ELAPDTFV 62 (466) T ss_pred CchhHHHh--hccCccccc---c-----------hhhhhhhhhhhhccccccccccccHHHHHhh-ccccc-cccCcccc Confidence 11222221 221111000 0 0000000001011111222222222111100 00000 01111111 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) .. +-+.+.++|.|++||+.|+++||+.. |.+.-++... +.+...|.++. +..+||++ T Consensus 63 ~v---~~~~a~~~~~v~~~i~~Ia~~ia~lp-----------~~~~~~~~~~--------~~~~~~~~~~~-L~~~PN~~ 119 (466) T protein:vir:81 63 GL---ATQAYQANGPVFACMLVRQLVFSSVR-----------FRWQRLRDGK--------PSDTFGSRDLQ-ILETPWKG 119 (466) T ss_pred cc---chhhhhccHHHHHHHHHHHHhhccCc-----------eEEEEecCCc--------eeeccccHHHH-HhhCCCCC Confidence 11 12345568999999999999999642 2221111000 00111222333 34579999 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCC------CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC----ce Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKN------KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK----QV 231 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~------~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~----~~ 231 (330) +|.++|+++++.++|++||+|++++++..+ .|.+++|+||+|.+|.+..+.+|.. ...|.|...+ +. T Consensus 120 ~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~---~~~y~~~~~~~~~~~~ 196 (466) T protein:vir:81 120 GTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQLGWR---KVGYLYTEGGRQSGNE 196 (466) T ss_pred CCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCcceeEEEEecCcceEEEEcCCCce---EEEEEEEecCcccccc Confidence 999999999999999999999999876533 4779999999999999998877643 2346665543 34 Q ss_pred EEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019511. 232 VASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKRE 311 (330) Q Consensus 232 ~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~ 311 (330) ..+|+++||||++.++-+. .+.||+|||.+|+++|++++++++|+++||+||++|+|||.+++ .+++|+++++++. T Consensus 197 ~~~~~~~dviHir~~~~~~--d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~ 272 (466) T protein:vir:81 197 SVGFLAEDVVHFAPIPDPL--ASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNP--MADPAAVKKWADE 272 (466) T ss_pred eeeeccccEEEEcCCCCcc--cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC--CCCHHHHHHHHHH Confidence 5679999999998643111 13369999999999999999999999999999999999998765 5899999999999 Q ss_pred HHHHhcCcccccccceeeC Q lcl|NC_019511. 312 WKSSFSGINGSWQICLYIK 330 (330) Q Consensus 312 w~~~~~G~~na~kvpvL~e 330 (330) |++.++|.+|+|+++||.+ T Consensus 273 ~~~~~~g~~n~g~~~vl~~ 291 (466) T protein:vir:81 273 VNSKHAGVDNAWKNLNLYP 291 (466) T ss_pred HHHHhcCccccccceEcCC Confidence 9999999999999776665 No 34 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=2.1e-40 Score=238.16 Aligned_cols=253 Identities=11% Similarity=0.049 Sum_probs=169.4 Q ss_pred hhcccchhc--cccchh---ccccccccccCC---CCCcCCC---cccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhh Q lcl|NC_019511. 43 MQEITKSLY--GKQQAY---AEPFLEMMDTNP---DYRDKKS---YMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTY 111 (330) Q Consensus 43 ~~~~~~~~~--g~~~~~---~~~~~~~~~~~p---~~~~~~s---~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~ 111 (330) +++++-.++ ++.--+ ..-|...--..| -...+.+ ..++..... +.+.+++.|++||+.++++||+. T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~---~~al~~~~v~~cv~~Ia~~iA~l 77 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND---ERILQISTVWRCVSLISTLTACL 77 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccccH---HHhhccHHHHHHHHHHHHhhccC Confidence 322221111 110000 000000000000 0000111 011111111 23334889999999999999963 Q ss_pred hhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCC Q lcl|NC_019511. 112 CKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKN 191 (330) Q Consensus 112 ~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~ 191 (330) .|.+--.+++.. . +.....|.+.+++...||+.+|.++|++.++.++|+.||+|++++ |+. T Consensus 78 -----------p~~~~~~~~~~~--~----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~--r~~ 138 (424) T protein:vir:18 78 -----------PLDVFETDQNDN--R----KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD--RNS 138 (424) T ss_pred -----------ceEEEEeecCCc--e----eeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE--ECC Confidence 222211111100 0 001124566777778899999999999999999999999999986 556 Q ss_pred CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHH Q lcl|NC_019511. 192 KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAY 271 (330) Q Consensus 192 ~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~ 271 (330) .|+|++|||++|.+|++..+. +...|.+..+ +....|.++||||++..+.+ +.+|+|||++|+++|+++ T Consensus 139 ~G~~~~L~pl~~~~V~v~~~~------~~~~y~~~~~-g~~~~~~~~eIih~r~~~~d----g~~G~spi~~~~~~i~~~ 207 (424) T protein:vir:18 139 AGDVISLLPLQSANMDVKLVG------KKVVYRYQRD-SEYADFSQKEIFHLKGFGFT----GLVGLSPIAFACKSAGVA 207 (424) T ss_pred CCcEEEEEEecCcceEEEEcC------CeEEEEEEeC-CeEEEeccccEEEecCcCCC----CcccccHHHHHHHHHHHH Confidence 689999999999999986542 2345666654 55678999999999754332 236999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 272 NNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 272 laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++|+++||+||++|+|||.++.+ .+++++++++++.|++.++| .|+|+++||-+ T Consensus 208 ~a~~~~~~~~f~ng~~p~gil~~~~~-~l~~e~~~~~~~~~~~~~~g-~nag~~~vl~~ 264 (424) T protein:vir:18 208 VAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRLWILEA 264 (424) T ss_pred HHHHHHHHHHHHccCCcceEEEeCCc-CCCHHHHHHHHHHHHHHhCC-cccCCceeccC Confidence 99999999999999999999998753 58999999999999988766 68999888776 No 35 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=2.8e-40 Score=237.38 Aligned_cols=259 Identities=12% Similarity=0.114 Sum_probs=166.1 Q ss_pred chhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 2 PDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) --+|.+|.+....+. -.+..|+.+.-..|..........-..++++ T Consensus 1 Mg~~~~l~~~~~~~~------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~---- 46 (457) T protein:vir:62 1 MGFWSALFGRGHSPA------------------------------LDAAEGRAWEPYDPSIYNLGATASSGERVTP---- 46 (457) T ss_pred Cchhhhhhccccccc------------------------------cccccccccccchhhhhhccccccCCceech---- Confidence 122222211100000 0001111111111110000000000112221 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) ...|+ ++.|++||++++++||+. .+.+.-++.+. .+.+.+.....+...||++ T Consensus 47 ---~~al~----~~~v~~~i~~ia~~iA~l-----------p~~~~~~~~~~---------~~~~~~~~~~~ll~~pn~~ 99 (457) T protein:vir:62 47 ---HDALQ----VSAVFASVRLLSETIATL-----------PLSTYSKRGGT---------RKEIDTPEWLDFPNAEPGG 99 (457) T ss_pred ---HHhhc----cHHHHHHHHHHHHhHhhC-----------ceEEEEecCCc---------cccccchHHHHhccccCCC Confidence 12233 789999999999999964 22221111100 0112233333445678889 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEe--CCc--eEEEech Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVI--DKQ--VVASFTS 237 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~--~~~--~~~~~~~ 237 (330) +|+++|+++++.++++.||+|+++. ++ .|++.+||||+|.+|++..+..+... ....|.|.. ++. ....|++ T Consensus 100 ~t~~~f~~~~~~~l~l~Gna~~~i~--~~-~g~~~~l~~l~p~~v~v~~~~~~~~~-~~~~~~y~~~~~g~~~~~~~~~~ 175 (457) T protein:vir:62 100 MGRIDILSQTVLSLLLQGNAFLAVR--WA-GPNIAGLDVLDPTKIHVHMVMVDGLR-RKVFEAYDIDADGNEVLLGWFTP 175 (457) T ss_pred CCHHHHHHHHHHHHhhcCCeEEEEE--eC-CCcEEEEEEEcCcceEEEEeccCCcc-ceeEEEEEEccCCceeEEEeeCc Confidence 9999999999999999999998874 33 58999999999999998766543221 112233332 232 2356899 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) +||||++.+...+ ..||+||+++++++|++++++++|+++||+||++|+|||.+++ .+++|+++++++.|++.++ T Consensus 176 ~eiih~r~~~~~~---~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~~~~~~~~~~ 250 (457) T protein:vir:62 176 RDVLHIPGMMLPG---DFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG--TMSEEGLARAREAWRAANS 250 (457) T ss_pred cceEEecCCCCCC---ceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC--CCCHHHHHHHHHHHHHHhc Confidence 9999998654332 2369999999999999999999999999999999999999876 5899999999999999999 Q ss_pred CcccccccceeeC Q lcl|NC_019511. 318 GINGSWQICLYIK 330 (330) Q Consensus 318 G~~na~kvpvL~e 330 (330) |.+|+|+++||-+ T Consensus 251 G~~nag~~~vl~~ 263 (457) T protein:vir:62 251 GVDNAHRVALLTE 263 (457) T ss_pred CccccCcceecCC Confidence 9999999888766 No 36 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=5.1e-40 Score=235.98 Aligned_cols=245 Identities=14% Similarity=0.163 Sum_probs=168.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCc-CCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRD-KKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~-~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +-+ + .. .+ +....+.......+ +...|.|.- ...... ....|+ ++.|++||+.++++ T Consensus 1 Mg~-f----~~---~~--~r~~~~~~~~~~~~----~~~~~~~~~~~~~~~~----~~~al~----~~~v~~cv~~Ia~~ 58 (416) T protein:vir:81 1 MGI-F----YK---NE--KRDLQYNEDDLQMM----VQTLPGFQGTKLRQYK----DIEAIR----HSDIFTAVMMIASD 58 (416) T ss_pred CCc-c----cc---cc--cccccCCCcchhHH----HHHhccccccCccccc----hhhhhc----chHHHHHHHHHHHh Confidence 111 1 00 01 11111111111111 111222211 111000 011233 67899999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+. ++. . + ..+|.+.+++...||+.+|.++|+++++.++|+.||+|++++ T Consensus 59 iA~~-----------p~~~~-~~~-~-~---------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~- 114 (416) T protein:vir:81 59 LARM-----------PIRVT-VNG-Q-I---------NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT- 114 (416) T ss_pred hccC-----------ceEEe-cCc-c-c---------cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 9963 22221 111 0 0 113445556667899999999999999999999999999986 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eC---CceEEEechhHeeeecccCcCCCCCCCccccHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-ID---KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEI 263 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~---~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~ 263 (330) |++.|+|++||||+|++|++..+.+|.. .|++. ++ ++....|.++||||++.++..+ .||+|||+. T Consensus 115 -r~~~G~~~~L~~i~~~~v~v~~~~~g~~-----~~~~~~~~~~~~~~~~~~~~~evihir~~~~d~----~~G~s~i~~ 184 (416) T protein:vir:81 115 -RDKTGEPMNLTFRKTSEIELKSDARGRL-----YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDG----INGLSLLDT 184 (416) T ss_pred -ECCCCcEEEEEEEcCceeEEEECCCccE-----EEEEEEecCCCceeEEEEccccEEEeccCCCCC----ccccCHHHH Confidence 4566899999999999999998887754 23332 22 2344679999999998665432 369999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 264 AMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 264 a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+++|++++++++|+.+||+||++|+|||.+++. ..++++.+++++.|++.++|..|+|+++||-+ T Consensus 185 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~ 250 (416) T protein:vir:81 185 LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFHKSFSGTKQAGKVVVLDE 250 (416) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CCCHHHHHHHHHHHHHHhcCccccCceeecCC Confidence 9999999999999999999999999999999864 34788999999999999999999999766655 No 37 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=5.1e-40 Score=235.98 Aligned_cols=245 Identities=14% Similarity=0.163 Sum_probs=168.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCc-CCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRD-KKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~-~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +-+ + .. .+ +....+.......+ +...|.|.- ...... ....|+ ++.|++||+.++++ T Consensus 1 Mg~-f----~~---~~--~r~~~~~~~~~~~~----~~~~~~~~~~~~~~~~----~~~al~----~~~v~~cv~~Ia~~ 58 (416) T protein:vir:45 1 MGI-F----YK---NE--KRDLQYNEDDLQMM----VQTLPGFQGTKLRQYK----DIEAIR----HSDIFTAVMMIASD 58 (416) T ss_pred CCc-c----cc---cc--cccccCCCcchhHH----HHHhccccccCccccc----hhhhhc----chHHHHHHHHHHHh Confidence 111 1 00 01 11111111111111 111222211 111000 011233 67899999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+. ++. . + ..+|.+.+++...||+.+|.++|+++++.++|+.||+|++++ T Consensus 59 iA~~-----------p~~~~-~~~-~-~---------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~- 114 (416) T protein:vir:45 59 LARM-----------PIRVT-VNG-Q-I---------NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT- 114 (416) T ss_pred hccC-----------ceEEe-cCc-c-c---------cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 9963 22221 111 0 0 113445556667899999999999999999999999999986 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eC---CceEEEechhHeeeecccCcCCCCCCCccccHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-ID---KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEI 263 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~---~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~ 263 (330) |++.|+|++||||+|++|++..+.+|.. .|++. ++ ++....|.++||||++.++..+ .||+|||+. T Consensus 115 -r~~~G~~~~L~~i~~~~v~v~~~~~g~~-----~~~~~~~~~~~~~~~~~~~~~evihir~~~~d~----~~G~s~i~~ 184 (416) T protein:vir:45 115 -RDKTGEPMNLTFRKTSEIELKSDARGRL-----YYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDG----INGLSLLDT 184 (416) T ss_pred -ECCCCcEEEEEEEcCceeEEEECCCccE-----EEEEEEecCCCceeEEEEccccEEEeccCCCCC----ccccCHHHH Confidence 4566899999999999999998887754 23332 22 2344679999999998665432 369999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 264 AMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 264 a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+++|++++++++|+.+||+||++|+|||.+++. ..++++.+++++.|++.++|..|+|+++||-+ T Consensus 185 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~ 250 (416) T protein:vir:45 185 LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFHKSFSGTKQAGKVVVLDE 250 (416) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CCCHHHHHHHHHHHHHHhcCccccCceeecCC Confidence 9999999999999999999999999999999864 34788999999999999999999999766655 No 38 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=7.3e-40 Score=235.15 Aligned_cols=282 Identities=11% Similarity=0.056 Sum_probs=177.7 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |++++.++-+.+.. .+. .+.+.+.+. ..|.+. +.+..+ T Consensus 1 ~~~~~~~~~~~~~~-----------------------------------~~~---~~~~~~~~~--~g~~~~--~~~~~~ 38 (460) T protein:vir:10 1 MANRIIRALRELTG-----------------------------------LDN---KFNDAFIKY--IGQTFT--KYDNNG 38 (460) T ss_pred CchhHHHHHhhhhc-----------------------------------cCC---CchHHHHHh--hccccC--CCccch Confidence 44444444211100 000 000011100 001110 111111 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCC--Cc-------ccChhhHHHHHHHHHHHH Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDL--DA-------TPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~--~~-------~~~~~~~~~~~~i~~~l~ 151 (330) ..... ..+.++|.|++||++++++||...........+-+.+...+.. .+ ...++... .....+-+. T Consensus 39 ~~~~~---~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 114 (460) T protein:vir:10 39 KTYLE---QGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLD-TKAFSETEK 114 (460) T ss_pred hhhhH---HHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhh-hcccchhHH Confidence 11111 1244589999999999999996433222111111111000000 00 00000000 001111112 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC--CCcceEEEEeeCCCceEEeeCCCCccc--CCceeEEEEe Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK--NKTKMEKFIAVDPSTIFYATDKNGKII--KGGNRFVQVI 227 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd--~~G~~~~L~pldp~tV~~~~d~~G~~~--~~~~~Y~q~~ 227 (330) ..+...||+++|.++|+++++.++|++|++|++++++.+ ..|+|.+||||+|.+|++..+++|... ..+..++.+. T Consensus 115 ~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~ 194 (460) T protein:vir:10 115 AFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLI 194 (460) T ss_pred HHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEe Confidence 223457999999999999999999999999999987654 458999999999999999988877432 2223344455 Q ss_pred CCceEEEechhHeeeecccCcC-CCCCCC-ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHH Q lcl|NC_019511. 228 DKQVVASFTSRELVMGIRNPRS-DLNSSG-YGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHAL 305 (330) Q Consensus 228 ~~~~~~~~~~~dvih~~~n~~~-d~~~~~-yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~ 305 (330) .++....|+++||||++.++.. +...++ ||+||+++++++|++++++++|+++||+||+.|+|++..+ ..++++++ T Consensus 195 ~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~--~~l~~e~~ 272 (460) T protein:vir:10 195 QGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGS--TGLTQPQA 272 (460) T ss_pred cCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecC--CCCCHHHH Confidence 6788889999999999864322 222233 6999999999999999999999999999999999998754 46999999 Q ss_pred HHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 306 ENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 306 e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++.|++.++|.+|+|+++||-+ T Consensus 273 ~~~~~~~~~~~~g~~n~g~~~vl~~ 297 (460) T protein:vir:10 273 DSLKQRLTEMDKSPDRLSQIAGASG 297 (460) T ss_pred HHHHHHHHHHhcCccccCCceecCC Confidence 9999999999999999999766655 No 39 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=5.4e-40 Score=235.85 Aligned_cols=264 Identities=11% Similarity=0.019 Sum_probs=171.9 Q ss_pred CCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHH Q lcl|NC_019511. 12 SMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKF 91 (330) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~ 91 (330) -..|+-+.+ +.. .+.+-..+++.-+.........-....|+. ...++ ++..... +.. T Consensus 1 ~~~~~~~~~--------~~~-~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~----~~~~~-------~~~~v~~---~~a 57 (424) T protein:vir:18 1 MEEPKYTID--------LRT-NNGWWARLKSWFVGGRLVTPNQGSQTGPVS----AHGYL-------GDSSIND---ERI 57 (424) T ss_pred CCCCccccc--------cCC-CCchHHHHHhhccccccccccchhhccccc----ccccc-------ccccccH---HHh Confidence 001111111 111 111222222221111111111111111211 11111 1111111 223 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) .+++.|++||+.|+++||+. .+.+--.+++.. . +.....|.+..++...||+.+|.++|++++ T Consensus 58 l~~~~v~~cv~~Ia~~iA~l-----------p~~vy~~~~~~~--~----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~ 120 (424) T protein:vir:18 58 LQISTVWRCVSLISTLTACL-----------PLDVFETDQNDN--R----KKVDLSNPLARLLRYSPNQYMTAQEFREAM 120 (424) T ss_pred hccHHHHHHHHHHHHhhccC-----------ceEEEEeccCCc--e----eeeccccHHHHHHhhccCCCCCHHHHHHHH Confidence 34889999999999999963 222211111100 0 001124556677778899999999999999 Q ss_pred HHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCC Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDL 251 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~ 251 (330) +.++|+.|++|++++ |+..|++++|||++|.+|++..+. +.+.|.+..+ +....|.++||+|++....+ T Consensus 121 ~~~lll~Gnay~~i~--r~~~G~~~~L~~l~~~~v~v~~~~------~~~~y~~~~~-g~~~~~~~~eVihir~~~~d-- 189 (424) T protein:vir:18 121 TMQLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVG------KKVVYRYQRD-SEYADFSQKEIFHLKGFGFT-- 189 (424) T ss_pred HHHHhhcCCeEEEEE--ECCCCcEEEEEEecCcceEEEEcC------CeEEEEEEeC-CeEEEeccccEEEecCcCCC-- Confidence 999999999999986 456789999999999999986542 2345666554 55678999999999754322 Q ss_pred CCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 252 NSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 252 ~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +.+|+|||++|+++|++++++++|+.+||+||++|+|+|.++.. .+++++++++++.|++.++| .|+|+++||.+ T Consensus 190 --g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~l~~e~~~~~~~~~~~~~~~-~nag~~~vl~~ 264 (424) T protein:vir:18 190 --GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRLWILEA 264 (424) T ss_pred --CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCc-CCCHHHHHHHHHHHHHHhCC-cccCCceeccC Confidence 23699999999999999999999999999999999999998753 48999999999999987765 68999888776 No 40 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=4.5e-40 Score=236.30 Aligned_cols=260 Identities=10% Similarity=0.063 Sum_probs=173.4 Q ss_pred CccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccc--cccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHH Q lcl|NC_019511. 23 VPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEM--MDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAI 100 (330) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~--~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~ 100 (330) +|. ++.-.+...=.+++..+.+. -..+......... ....|.|.. .+. .....++.+++.|++| T Consensus 1 ~~~---~~~~~~~~~m~~F~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~---~~~-----~~~~~~~~~~~~v~~c 66 (413) T protein:vir:96 1 MPG---VSEIRKDKNLKFFNNKRSPT---EESKAKDEIPKAPQVVMTLPNFFK---ELI-----SDGYTKLSDSPEVRMA 66 (413) T ss_pred CCc---cchhhhhhcCCccccCCCcc---hhhhhhccccccccccccchhhHh---hhc-----cchhHHHhhchHHHHH Confidence 111 00000000001111111000 0011111111110 001111110 011 1123345568999999 Q ss_pred HHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCC Q lcl|NC_019511. 101 IITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQ 180 (330) Q Consensus 101 I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~ 180 (330) |+.++++||+ +.|.+--++.+.+ ++.+|.+..++...||+++|+++|+++++.++|+.|+ T Consensus 67 I~~ia~~ia~-----------~~~~~~~~~~~~~---------~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn 126 (413) T protein:vir:96 67 VDCIADLVSN-----------MTIQLMQNGETGD---------KRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGN 126 (413) T ss_pred HHHHHHhhcc-----------CceEEEEecCCCc---------cccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCC Confidence 9999999995 3444322222111 1224556666777899999999999999999999999 Q ss_pred ceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccH Q lcl|NC_019511. 181 VNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSE 260 (330) Q Consensus 181 g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSP 260 (330) +|++++++.+ ++++.+|||++|.+|++..+.. .+.|.+..+++ +++++||+|++.++.++ ...+|+|| T Consensus 127 ~~~~i~r~~~-g~~~~~L~~l~~~~v~~~~~~~------~~~y~~~~~~~---~~~~~evih~k~~~~~~--~~~~G~s~ 194 (413) T protein:vir:96 127 GNAVVKPQVS-GDKIIGLTPISPYKVTFNVSDD------DLDYSITFDNK---EYDPSTLLHFVLNPSIE--RPFIGTGY 194 (413) T ss_pred eEEEEEEcCC-CCceEEEEEecCceeEEEEcCC------eEEEEEeecCc---EEchhhEEEEeccCCCC--CccccccH Confidence 9999876433 2578899999999999876532 34677766664 57899999998765332 12259999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 261 VEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 261 Ie~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++.+|++++++++|+.+||.||++|+|+|.+++ .+++++.++++++|++.++|..|+|+++||.+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 262 (413) T protein:vir:96 195 KVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS--DSDELSDEEGRENFEEMYLKRKEAGKPWIIPE 262 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCceeeecC Confidence 999999999999999999999999999999999875 48999999999999999999999999888876 No 41 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=7.3e-40 Score=235.16 Aligned_cols=246 Identities=12% Similarity=0.073 Sum_probs=173.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ...+....+ ++.......+..-+.. ..... .++ ... .++.++|.|++||+.++++| T Consensus 1 Mg~-----f~~~~~~~~--~~~~~~~~~~~~~~~~----~~~~~-~~~------~~~---~~~~~~~~v~~~i~~ia~~i 59 (406) T protein:vir:95 1 MGL-----FDRWRRTKR--KSKIRADTGYVGLFMS----GEDVS-FLV------PGY---VRLSDNPEVRMAVHKIADLI 59 (406) T ss_pred Ccc-----hhhhccccc--cccccccchhhhhhcc----CcccC-ccc------cCH---HHHhhcHHHHHHHHHHHHhh Confidence 222 112221111 1111111111111111 11111 111 111 12335899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+.-++++.. ....+.+.+.+...||+.+|+++|+++++.++|+.|+|++|++.. T Consensus 60 a~~-----------~~~~~~~~~~~~---------~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~ 119 (406) T protein:vir:95 60 SSM-----------TIYLMQNTEDGD---------IRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPK 119 (406) T ss_pred ccC-----------ceEEEEecCCcc---------eeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEE Confidence 953 333321111100 011234445566789999999999999999999999999999889 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF 268 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I 268 (330) ++..|+|++||||+|.+|++..+.+| |.+..++ ..|+++||+|+++++.+. ...||+||+++|..+| T Consensus 120 ~~~~g~~~~l~~i~~~~v~~~~~~~~--------~~~~~~~---~~~~~~evih~~~~~~~~--~~~~G~s~i~~~~~~i 186 (406) T protein:vir:95 120 YTADGLIDELVPLTPSKVNFLDTPDG--------YQVLYGG---QTFNYDEVLHFIYNPDPE--RPYIGRGYRVVLKDIA 186 (406) T ss_pred ECCCCcEEEEEEEcCceeEEEEcCCe--------EEEEecc---EEEchhHEEEeeccCCCC--CCccccCHHHHHHHHH Confidence 99999999999999999999888765 3333333 358999999998765442 1235999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 269 IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 269 ~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+++++++|++++|.||++|+|+|.+++ .+++++.++++++|.+.++|..|+|+++||.+ T Consensus 187 ~~~~~~~~~~~~~~~ng~~~~~il~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~ 246 (406) T protein:vir:95 187 DNLKQATATKKSFMSGKYMPSLIVKVDA--ATAELSSEEGRNAVFKKYLQATEAGQPWIIPA 246 (406) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHHHhccccccCCceeecC Confidence 9999999999999999999999999876 58999999999999999999999999888876 No 42 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=5.8e-40 Score=235.68 Aligned_cols=247 Identities=10% Similarity=0.082 Sum_probs=167.5 Q ss_pred HHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhh Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTY 111 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~ 111 (330) ++ ....+.+..+.+.+..+ . +-..+.... .......+|+ ...| ++|.|++||++|+++||+. T Consensus 1 m~--~~~~~~~~~~~~~~~~~-~--~~~~~~g~~--~s~~~~~v~~-------~~al----~~~~v~~cv~~ia~~ia~l 62 (419) T protein:vir:80 1 MF--FSRQLLSNLGQTQPGSG-G--WVSALLGSA--RSEAGQVVTP-------ASAL----SLTVLQNCVTLLAESIAQL 62 (419) T ss_pred CC--cccccccccCcCCCCcc-h--hhHHhhccc--ccccCcccCh-------HHhh----ccHHHHHHHHHHHHhhccC Confidence 11 00000011111111100 0 000000000 0001111111 1123 3789999999999999963 Q ss_pred hhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCC Q lcl|NC_019511. 112 CKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKN 191 (330) Q Consensus 112 ~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~ 191 (330) .+.+.-++++.. .....|.+.+++...||+++|.++|++.++.++|+.||+|++++ |+. T Consensus 63 -----------p~~~~~~~~~~~--------~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~--r~~ 121 (419) T protein:vir:80 63 -----------PVELYERSGDDR--------KPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFID--RDQ 121 (419) T ss_pred -----------ceEEEEecCCCc--------ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE--ECC Confidence 333222211110 01123445566667899999999999999999999999999986 556 Q ss_pred CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHH Q lcl|NC_019511. 192 KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAY 271 (330) Q Consensus 192 ~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~ 271 (330) .|+|++||||+|.+|++..+.+|. ++|...+.. .+++++|+|++.++.++ .||+|||++++.+|+++ T Consensus 122 ~G~~~~L~~i~~~~v~i~~~~~~~-------~~y~~~~~~--~~~~~~i~h~~~~~~d~----~~G~s~i~~~~~~i~~~ 188 (419) T protein:vir:80 122 DGVIQGLYPLDNEAVTVMKGPDLK-------PMYRVAGAD--PLPQRLVHHVRWMSING----YTGLSPVLLHANAIGHA 188 (419) T ss_pred CCcEEEEEEecCceEEEEECCCce-------EEEEEcCcc--ccchhheEEecCCCCCC----cccccHHHHHHHHHHHH Confidence 789999999999999998877653 233333432 47899999998765432 36999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcceEEEeCCC--CCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 272 NNTESFNDRFFSHGGTTRGILQIRAD--QQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 272 laae~~~~~fF~nGa~p~GiL~~~~~--~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++++|+.+||.||++|+|+|.++++ ...+++++++|++.|++.++|.+|+|+++||-+ T Consensus 189 ~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~ 249 (419) T protein:vir:80 189 QAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE 249 (419) T ss_pred HHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 99999999999999999999998764 345899999999999999999999999888755 No 43 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=3.3e-40 Score=237.00 Aligned_cols=246 Identities=9% Similarity=0.063 Sum_probs=167.3 Q ss_pred HHHHhhcccchhccccchhcc-ccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhhee Q lcl|NC_019511. 39 DTKEMQEITKSLYGKQQAYAE-PFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARY 117 (330) Q Consensus 39 ~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~ 117 (330) =|++....+ ..+.+ .... +....+ +.-.+| ..+.....+. .-+++.|++||++++++||+.. T Consensus 1 ~~~~r~~~~--~~~~~-~~~~~~~~~~~-----~g~~~s-~~~~~vt~~~---al~~~~v~~~v~~ia~~iA~lp----- 63 (419) T protein:vir:14 1 MFFSRQLLS--NLGQT-QMSAGGWVSAL-----LGSSRS-DSGQVVTPAS---ALALTVLQNCVTLLAESIAQLP----- 63 (419) T ss_pred Ccccccccc--ccccc-ccCcchhhHHh-----hcCCCc-cCCcccchHH---hhccHHHHHHHHHHHHhhccCc----- Confidence 111110000 00111 0000 000000 000111 1111112222 1237889999999999999632 Q ss_pred cccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEE Q lcl|NC_019511. 118 SEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEK 197 (330) Q Consensus 118 ~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~ 197 (330) |.+.-++.+. +.....|.+.+++..+||+++|.++|+++++.++|+.||++++++ |+..|+|++ T Consensus 64 ------~~~~~~~~~~--------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~--r~~~G~~~~ 127 (419) T protein:vir:14 64 ------IELYERSGED--------RKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFID--RDSDGVIQG 127 (419) T ss_pred ------eEEEEecCCc--------cccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE--ECCCCcEEE Confidence 2221111110 011234556666667899999999999999999999999988876 556689999 Q ss_pred EEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 198 FIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESF 277 (330) Q Consensus 198 L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~ 277 (330) ||||+|.+|++..+.+|.. .|.+. +.. .+++++|+|++.++.++ .||+|||++++++|+.++++++| T Consensus 128 l~pl~~~~v~v~~~~~~~~-----~y~~~--~~~--~~~~~~i~h~~~~~~dg----~~G~s~i~~~~~~i~~~~~~~~~ 194 (419) T protein:vir:14 128 LYPLDNEAVTVMRGSDLKP-----VYRVR--GSD--PMPQRLVHHVRWMSING----YTGLSPVLLHANAIGHAQAIQQY 194 (419) T ss_pred EEEecCceEEEEECCCceE-----EEEEc--cCc--ccchhheeEecCcCCCC----cccccHHHHHHHHHHHHHHHHHH Confidence 9999999999988776642 23332 322 36889999998765443 36999999999999999999999 Q ss_pred HHHHHhcCCCcceEEEeCCCC--CCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 278 NDRFFSHGGTTRGILQIRADQ--QQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 278 ~~~fF~nGa~p~GiL~~~~~~--~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +.++|+||++|+|+|.+++.. .++++++++|++.|++.++|.+|+|+++||-+ T Consensus 195 ~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~ 249 (419) T protein:vir:14 195 AGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE 249 (419) T ss_pred HHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 999999999999999987643 34799999999999999999999999888766 No 44 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=9.5e-40 Score=234.53 Aligned_cols=259 Identities=13% Similarity=0.082 Sum_probs=172.3 Q ss_pred CccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHH Q lcl|NC_019511. 23 VPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIII 102 (330) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~ 102 (330) ++-+..+-. +..+..-+... .+.++.|+.-.-..+.... .....++. .+.....+. +-+++.|++||+ T Consensus 1 ~~~~~~~g~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-----~~~~~~~~-~g~~v~~~~---a~~~~aV~~~v~ 68 (432) T protein:vir:97 1 MPDEKKLGL-LGQLKAMFVPP--DPVDIGGGQTFTPVNATAR-----DLGIIISD-TGAAVNADA---IMRLDAVAACVK 68 (432) T ss_pred CCCcccCch-hhhhHhhcCCc--cccccccccccccCchhhh-----hhcccccc-cCcccchHh---hhcchHHHHHHH Confidence 222222221 22222221111 1112222221110000000 00001111 111112222 223799999999 Q ss_pred HHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCce Q lcl|NC_019511. 103 TRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVN 182 (330) Q Consensus 103 ~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~ 182 (330) .|+++||+.. +.+--++++- ..++..|.+.+++..+||+++|.++|+++++.++|+.||+| T Consensus 69 ~Ia~~ia~lp-----------~~~y~~~~~g--------~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay 129 (432) T protein:vir:97 69 LVSQAVAAMP-----------LMMYMRTPDG--------RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAY 129 (432) T ss_pred HHHHhhccCc-----------eEEEEecCCC--------cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeE Confidence 9999999632 2221111110 11234466677777889999999999999999999999999 Q ss_pred eEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHH Q lcl|NC_019511. 183 FEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVE 262 (330) Q Consensus 183 ~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe 262 (330) +++++ + .|++.+||||+|..|++..+.+|. ..|++...++....|+++||+|++.++.+ +.||+|||+ T Consensus 130 ~~~~~--~-~g~~~~L~~l~p~~v~v~~~~~g~-----~~y~~~~~~g~~~~~~~~~iih~r~~~~d----g~~G~spi~ 197 (432) T protein:vir:97 130 VRKVV--T-DGRIESLQYLANDRLTITTDTKGN-----TAYRYRRTDGQMIDIPRQQIWKIMGYSLD----GENGLSAIR 197 (432) T ss_pred EEEEe--c-CCcEEEEEEEcCcceEEEEcCCCc-----EEEEEEecCceEEEEccccEEEecCcCCC----CcccccHHH Confidence 88875 3 389999999999999999887765 35777666778889999999999865433 236999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 263 IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 263 ~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .|+++|++++++++|+++||+||++|+|||.+++ .+++|+++++++.| +|..|+|+++||-+ T Consensus 198 ~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~----~~~~nag~~~vl~~ 259 (432) T protein:vir:97 198 YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--FLTDDQYDSFSKKV----SGSVEAGRAPLLEG 259 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC--CCCHHHHHHHHHHH----hhhhcCCCceecCC Confidence 9999999999999999999999999999998775 58999988876665 56789999887766 No 45 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=6.7e-40 Score=235.36 Aligned_cols=259 Identities=13% Similarity=0.100 Sum_probs=170.1 Q ss_pred CccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHH Q lcl|NC_019511. 23 VPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIII 102 (330) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~ 102 (330) +|....+.. +..+ +.++.-.. +.++.+.. . ..|.... .+ .+...++. .+.....+. +-++|.|++||+ T Consensus 1 ~~~~~~mg~-f~r~-~~~~~~~~-~~~~~~~~-~-~~~~~~~--~~-~~~~~~~~-~g~~v~~~~---al~~~~V~~~i~ 68 (432) T protein:vir:81 1 MPDEKKLGL-FGQL-KAMFVPPD-PVDIGGGQ-T-FTPVNAT--AR-DLGIIISD-TGAAVNADA---IMRLDAVAACVK 68 (432) T ss_pred CCchhhcch-hhhh-hhhccccc-cccccccc-c-cccCccc--hh-hhcccccc-cCcccchHh---hhccHHHHHHHH Confidence 332222222 1111 11111100 00111111 0 0110000 00 00111111 111111121 223799999999 Q ss_pred HHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCce Q lcl|NC_019511. 103 TRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVN 182 (330) Q Consensus 103 ~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~ 182 (330) +|+++||+..... +.+. +|.. .++..|.+.+++..+||+++|.++|+++++.++|+.||+| T Consensus 69 ~Ia~~ia~lp~~~-y~~~--------~~g~----------~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay 129 (432) T protein:vir:81 69 LVSQAIAAMPLTM-YMRT--------PDGR----------KEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAY 129 (432) T ss_pred HHHHhhhhCceee-EEec--------CCcc----------eecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeE Confidence 9999999642211 1111 1111 1123456667777889999999999999999999999999 Q ss_pred eEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHH Q lcl|NC_019511. 183 FEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVE 262 (330) Q Consensus 183 ~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe 262 (330) +++++ + .|+|++||||+|..|++..+.+|. ..|.++..++....|.++||+|++.++.++ .||+|||+ T Consensus 130 v~i~~--~-~g~~~~L~~l~~~~v~v~~~~~g~-----~~y~~~~~~g~~~~~~~~~iih~r~~~~dg----~~G~spi~ 197 (432) T protein:vir:81 130 VRKVV--T-DGRIESLQYLANDRLTITTDPKGN-----TAYRYRRTDGQMIDIPKQQIWKIMGYSLDG----ENGLSAIR 197 (432) T ss_pred EEEEe--c-CCcEEEEEEEcCCceEEEECCCCc-----EEEEEEecCceEEEEccccEEEecCCCCCC----cccccHHH Confidence 88764 3 389999999999999999887764 347776667888899999999998654432 26999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 263 IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 263 ~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +|+++|++++++++|+++||+||++|+|+|.+++ .+++++++++++.| +|..|+|+++||-+ T Consensus 198 ~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~----~~~~nag~~~vl~~ 259 (432) T protein:vir:81 198 YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--FLTDDQYDSFAKKV----SGSVEAGRAPLLEG 259 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC--CCCHHHHHHHHHHH----hhhhcCCCceecCC Confidence 9999999999999999999999999999999875 58999999887776 46789999777765 No 46 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=8.6e-40 Score=234.74 Aligned_cols=247 Identities=11% Similarity=0.091 Sum_probs=170.6 Q ss_pred CchhHHHH-HhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSL-RLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r 79 (330) .--+|++| +++.. +++ ..+...+. . + .. |- ...+ ...|+ T Consensus 15 ~~~~~~~lf~~~~~--~~~---~~~~~~~~-~-----~-----~~-------~~-------------~~~~--~~vs~-- 54 (424) T protein:vir:45 15 GRVLLDALFRSKSL--ENP---STPITGDA-V-----D-----TD-------GL-------------FRAD--VYVSP-- 54 (424) T ss_pred hhHHHHhhccccCC--CCC---ccccchhh-h-----h-----hh-------cc-------------ccCC--ceech-- Confidence 34466666 32211 111 11111110 0 0 00 00 0000 00111 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCC Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDI 159 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn 159 (330) + ..-+++.|++||++|+++||+.. +.+.-++.+. ......|.+.+++..+|| T Consensus 55 ------~---~al~~~~v~~cv~~Ia~~iA~lp-----------~~v~~~~~~~--------~~~~~~~~l~~lL~~~PN 106 (424) T protein:vir:45 55 ------E---TAMKLAAVYSCIYVLSSSLAQMP-----------LHVMRRHKGK--------VEPARDHPAFYLVHDEPN 106 (424) T ss_pred ------H---HhhccHHHHHHHHHHHHHHhhCc-----------eEEEEecCCc--------eeecccchHHHHHHhhcc Confidence 1 11137889999999999999642 2221111100 001123456666677899 Q ss_pred CcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhH Q lcl|NC_019511. 160 DRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRE 239 (330) Q Consensus 160 ~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~d 239 (330) +++|.++|++.++.++|+.||+|++++ |+..|+|++|+|++|.+|++..+. | ...|.+...++ ...|.++| T Consensus 107 ~~~t~~~f~~~~v~~lll~Gna~~~i~--r~~~G~~~~L~~l~~~~v~i~~~~-~-----~~~y~~~~~~~-~~~~~~~e 177 (424) T protein:vir:45 107 TWQTSYKWRELKQRHILGWGNGYTWVK--RNRRGEVISLDCCMPWETTLMNTG-G-----RYTYGLYNEYG-AFAISPDD 177 (424) T ss_pred cCCCHHHHHHHHHHHHhhcCCeEEEEE--EcCCCcEEEEEEecCceEEEEEcC-C-----eEEEEEEecCc-eEEECccc Confidence 999999999999999999999999886 566799999999999999886542 2 23466555544 45799999 Q ss_pred eeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCc Q lcl|NC_019511. 240 LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGI 319 (330) Q Consensus 240 vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~ 319 (330) |+|++....+ +.+|+||++.++++|++++++++|+++||+||++|+|||.+++ .+++|+.+++++.|++.++|. T Consensus 178 Vih~r~~~~d----~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~ 251 (424) T protein:vir:45 178 MIHIRALGNN----QKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS--GLNKESWGWLKDQWQKASQAL 251 (424) T ss_pred EEEecCcCCC----CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC--CCCHHHHHHHHHHHHHHhccc Confidence 9999854332 3469999999999999999999999999999999999999876 489999999999999999996 Q ss_pred -ccccccceeeC Q lcl|NC_019511. 320 -NGSWQICLYIK 330 (330) Q Consensus 320 -~na~kvpvL~e 330 (330) +|+|+++||.+ T Consensus 252 ~~n~g~~~vl~~ 263 (424) T protein:vir:45 252 RRQENKTMLLPA 263 (424) T ss_pred cccCCceeEcCC Confidence 58999777766 No 47 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=6e-40 Score=235.62 Aligned_cols=236 Identities=11% Similarity=0.091 Sum_probs=162.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.. ++ .|.+.. ......+. ++... ..+. ..+...+.+ ...+++.|++||+.++++| T Consensus 1 M~~-~~----~f~~r~------~~~~~~~~-~~~~~--~~~~-------~~~~~v~~~---~al~~~av~~cv~~ia~~i 56 (359) T protein:vir:10 1 MSI-LN----PFERRS------SITPNNYY-PFMVQ--NGSI-------VPNSLVDAT---EALKNSDLYAVTSLISSDI 56 (359) T ss_pred Ccc-cc----hhhccc------cCCCCcch-hhhhc--cccc-------cCCcccCHH---HhhcchHHHHHHHHHHHhh Confidence 111 00 011100 00010110 10000 0000 011111111 1223788999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. + + + .+.+.+.+...||+.+|.++|+++++.++|++||+|++++ T Consensus 57 a~~--p-----------~--~-----------------~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-- 102 (359) T protein:vir:10 57 AGT--R-----------F--I-----------------GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAIL-- 102 (359) T ss_pred hcC--c-----------c--c-----------------cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEE-- Confidence 952 1 1 0 0112233456799999999999999999999999999886 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eCCceEEEechhHeeeecccCcCCCCCCC-ccccHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-IDKQVVASFTSRELVMGIRNPRSDLNSSG-YGLSEVEIAMK 266 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~~~~~~~~~~~dvih~~~n~~~d~~~~~-yGlSPIe~a~~ 266 (330) |+..|+|.+|+||+|.+|++..+++ .++|.+. ..++....|.++||+|++.++......+| +|+|||+.++. T Consensus 103 r~~~g~~~~l~~l~~~~v~i~~~~~------~~~y~~~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~ 176 (359) T protein:vir:10 103 KGDNSLMKELRLIPSNAITIDLTDD------TLTYEVNQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTS 176 (359) T ss_pred ECCCCeEEEEEEeCCceEEEEEcCC------eEEEEEEecCCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHH Confidence 5667899999999999999866543 2445443 35677889999999999875443222233 69999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 267 EFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 267 ~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +|+++.++++|++++|+||++|+|+|.++++ .+++++.+++++.|++.++| +|+|+++||-+ T Consensus 177 ~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~l~~e~~~~~~~~~~~~~~~-~n~g~~~vl~~ 238 (359) T protein:vir:10 177 EIGQQKEANRLSLSTLKGALNPTSVVKVPQG-TLSSEAKDSIRKEFEKANGG-NNSGRVMVLDQ 238 (359) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc-cccCCceecCC Confidence 9999999999999999999999999998754 58999999999999887655 89999777766 No 48 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=2.4e-39 Score=232.29 Aligned_cols=239 Identities=12% Similarity=-0.013 Sum_probs=164.4 Q ss_pred HHHHHHhhcccchhcccc---chhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhh Q lcl|NC_019511. 37 EQDTKEMQEITKSLYGKQ---QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCK 113 (330) Q Consensus 37 ~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~ 113 (330) ++= ++... ..+.. ..+..+ + ..|+. ++.-.....| +++.|++||++++++||+. T Consensus 1 m~~-~~~~~----~~~~~~~~~~~~~~---------~--~~~~~-~g~~~~~~Al----~~~~V~~cv~~ia~~iA~l-- 57 (417) T protein:vir:38 1 MKL-FRGLA----TEVDPHWADHLLDS---------G--VIPSF-RGGYLGISAL----RNSDVLTAVSIVSGDVSRF-- 57 (417) T ss_pred Ccc-ccccc----cCCCccchhhhccc---------c--ccccc-CCceechhhc----ccHHHHHHHHHHHHhhccC-- Confidence 110 11000 00000 011000 0 01111 1110000112 3788999999999999963 Q ss_pred hheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCc Q lcl|NC_019511. 114 PARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKT 193 (330) Q Consensus 114 ~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G 193 (330) .+.+.-++.+.... .+.+++++...||+++|+++|+++++.++|+.||+|++++++.. .| T Consensus 58 ---------p~~~~~~~~~~~~~----------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~-g~ 117 (417) T protein:vir:38 58 ---------PLVITDSSTDEVID----------LANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPI-TN 117 (417) T ss_pred ---------eeEEEEcCCcceec----------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-CC Confidence 22222111111111 12344455578999999999999999999999999999885433 47 Q ss_pred ceEEEEeeCCCceEEeeCCCCcccCCceeEEEEe-CCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHH Q lcl|NC_019511. 194 KMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVI-DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYN 272 (330) Q Consensus 194 ~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~-~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~l 272 (330) .|.+|+|++|.+|.+...+.|. +.|++.. +++....|+++||||++.++.. +.+|+||++.++++|+++. T Consensus 118 ~~~~l~~l~p~~v~v~~~~~~~-----~~y~~~~~~~~~~~~~~~~dviH~r~~~~d----~~~G~s~l~~~~~~i~~~~ 188 (417) T protein:vir:38 118 EPAMFEFYAPSQTQVDTSDPDN-----IIYRFTPYNSSMQKVCGFEDVIHWKFFSYD----TIMGRSPLLSLGDEIGLQE 188 (417) T ss_pred EEEEEEEeCCceEEEEEcCCCe-----EEEEEEEcCCcEEEEecCcceEEecCCCCC----CccccCHHHHHHHHHHHHH Confidence 8999999999999987766553 3465554 4555677999999999865432 2359999999999999999 Q ss_pred HHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 273 NTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 273 aae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++++|+.+||+||++|+|||..++ .+++++.+++|++|++.++|. |+|+++||-+ T Consensus 189 ~~~~~~~~~f~ng~~p~~il~~~~--~l~~e~~~~~~~~~~~~~~g~-n~g~~~vl~~ 243 (417) T protein:vir:38 189 SGVSTLQKFFKSGLKGSIIKAKES--RLSAEARQKIREDFERAQAGA-DAGSPIIVDA 243 (417) T ss_pred HHHHHHHHHHhccCCCcEEEEeCC--CCCHHHHHHHHHHHHHHhccc-ccCCceeccC Confidence 999999999999999999998765 589999999999999999885 8999666544 No 49 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=2.3e-39 Score=232.45 Aligned_cols=264 Identities=14% Similarity=0.053 Sum_probs=164.1 Q ss_pred chhHHHHHhc---CCCCCC--cccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCC Q lcl|NC_019511. 2 PDLFKSLRLG---SMYKED--TEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKS 76 (330) Q Consensus 2 ~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s 76 (330) --|.-+|.-. -..+++ ..+ -++.+-++.. |. ..+++.+.+......+..... ....|... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~--------~~---~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~-- 65 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVD-YNPGDPDMVE--------FR---GPEEEPEARALPWIRPTAWSG-YPESWATP-- 65 (409) T ss_pred CchhhhhcccccCCCccccccccc-ccCCCCceee--------cc---CCCcchhhhhccccccccccc-cccccccc-- Confidence 1233333211 011111 111 2222222211 11 111111111111111110000 00112111 Q ss_pred cccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC Q lcl|NC_019511. 77 YMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD 156 (330) Q Consensus 77 ~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~ 156 (330) .+.. -+-+.+.+++.|++||+.|++.||+. +....+++ . ..++ +.+++.. T Consensus 66 --~~~~---~t~~~~~~~~~v~acV~~Ia~~iA~l--pl~~~~~~----------~--~~~~-----------~~~ll~~ 115 (409) T protein:vir:83 66 --SWGS---AQDKLRTLIDVAWACIDLNASVLSSM--PIYRMRNG----------R--IIDS-----------VAWMSNP 115 (409) T ss_pred --Cccc---cchhhHhhhHHHHHHHHHHHHhhccC--ceEEeeCC----------c--cccc-----------hhhhccc Confidence 0100 11233445899999999999999963 22111111 0 0000 1223456 Q ss_pred CCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEec Q lcl|NC_019511. 157 KDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFT 236 (330) Q Consensus 157 ~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~ 236 (330) .||+++|+++|+++++.++|+ ||+|++++ .++..|+|++|+||+|.+|++..+.+|.. .| .+.+ .+. T Consensus 116 ~PN~~~t~~~f~~~l~~~lll-Gnay~~~i-~r~~~G~~~~L~pl~p~~v~v~~~~~g~~-----~y--~~~~----~~~ 182 (409) T protein:vir:83 116 DPEVYTSWQEFAKQLFWDFQL-GEAFVLPM-AHGSDGYPIRFRVVPPWLVNVELKKGARR-----EY--RIGG----LNV 182 (409) T ss_pred CCCCCCCHHHHHHHHHHHHhh-CCcEEEEE-EECCCCcEEEEEEECCcceEEEEcCCceE-----EE--EEcc----ccC Confidence 899999999999999999876 88887765 36677999999999999999988776532 23 2222 235 Q ss_pred hhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019511. 237 SRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF 316 (330) Q Consensus 237 ~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~ 316 (330) ++||||++.++..+ +.||+|||++++++|+++.++++|+.+||+||++|+|+|++++ .+++|++++++++|++.+ T Consensus 183 ~~eiiHir~~~~~~---~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~--~ls~e~~~~~~~~~~~~~ 257 (409) T protein:vir:83 183 TDEILHIRYQGNTA---DAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVER--RLSETEAVDLMDRWIESR 257 (409) T ss_pred ccceEEeCCCCCCC---CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCC--CCCHHHHHHHHHHHHHhh Confidence 68999998764433 4479999999999999999999999999999999999998775 599999999999999988 Q ss_pred cCcccccccceeeC Q lcl|NC_019511. 317 SGINGSWQICLYIK 330 (330) Q Consensus 317 ~G~~na~kvpvL~e 330 (330) +| |+|+++||++ T Consensus 258 ~~--nag~~~il~~ 269 (409) T protein:vir:83 258 SK--YAGHPALVTG 269 (409) T ss_pred CC--ccCccceecC Confidence 76 8999776665 No 50 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=4.1e-39 Score=231.01 Aligned_cols=252 Identities=15% Similarity=0.157 Sum_probs=168.2 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHh-hcHHHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFG-NNSILNAIII 102 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a-~~~iv~a~I~ 102 (330) .++..++++-..-.+...+.+ . +.+......+ . .|..||- + .+.|.+.. .|++|++||+ T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~-----~-~s~~~~~~~~------~-~~~~pp~---~----~~~la~l~~~n~~v~scI~ 60 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREE-----V-ESQALGETRF------E-EYVEPKV---N----PLVLLSLLQVNPYHASACS 60 (542) T ss_pred Cccccccccccccchhhhhcc-----c-cccccccccC------C-ccccCCC---C----HHHHHHHHhhcHHHHHHHH Confidence 455555553311111111110 1 1111000000 1 2333331 1 23343333 4899999999 Q ss_pred HHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCce Q lcl|NC_019511. 103 TRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVN 182 (330) Q Consensus 103 ~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~ 182 (330) +++++||+ ++|.+..++ .+.+.+++ ||+.+|+++|+++++.++|++||+| T Consensus 61 ~ia~~IA~-----------l~~~~~~~~------------~~~l~~~l-------pN~~~s~~~f~~~~v~~lll~Gnay 110 (542) T protein:vir:41 61 IKANDIIR-----------TGYILEGDD------------EGVVDEFI-------RACKPSFEYVLLRALEDLQVFNYCT 110 (542) T ss_pred HHHHHHhh-----------Cceeeeccc------------chhhhhhc-------CCCCCCHHHHHHHHHHHHhhcCCeE Confidence 99999995 355443211 12233332 7888999999999999999999999 Q ss_pred eEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC---CceeEEEE---------eCCceEEEechhHeeeecccCcCC Q lcl|NC_019511. 183 FEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK---GGNRFVQV---------IDKQVVASFTSRELVMGIRNPRSD 250 (330) Q Consensus 183 ~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~---~~~~Y~q~---------~~~~~~~~~~~~dvih~~~n~~~d 250 (330) ++++ |+..|+|++|+||+|.+|++..+.++.... ....|.+. ..|.....++++||+|++.+... T Consensus 111 i~i~--rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~- 187 (542) T protein:vir:41 111 LEVV--RDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPV- 187 (542) T ss_pred EEEE--EcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCC- Confidence 9886 455689999999999999998876653211 11112111 11223345788999999765322 Q ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC--------CCCCHHHHHHHHHHHHHHhcCc-cc Q lcl|NC_019511. 251 LNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD--------QQQSQHALENFKREWKSSFSGI-NG 321 (330) Q Consensus 251 ~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~--------~~ls~e~~e~lr~~w~~~~~G~-~n 321 (330) .+.||+|||..|+.+|.+++++++|+.+||.||++|+|||.+++. ..+++++.++++++|++.++|. +| T Consensus 188 --~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n 265 (542) T protein:vir:41 188 --CSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEA 265 (542) T ss_pred --CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcc Confidence 345799999999999999999999999999999999999998753 4679999999999999999987 57 Q ss_pred ccccceee------C Q lcl|NC_019511. 322 SWQICLYI------K 330 (330) Q Consensus 322 a~kvpvL~------e 330 (330) +|++.||. + T Consensus 266 ~gk~~vL~~~~~~~~ 280 (542) T protein:vir:41 266 PHTPLVFSIPGGDTV 280 (542) T ss_pred cCceeEeeccCCccc Confidence 88855553 1 No 51 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=1.1e-38 Score=228.78 Aligned_cols=238 Identities=13% Similarity=0.017 Sum_probs=163.6 Q ss_pred hhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHh-hcHHHHHHHHHHHHhHhhhhhhheecccc Q lcl|NC_019511. 43 MQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFG-NNSILNAIIITRANQVSTYCKPARYSEKG 121 (330) Q Consensus 43 ~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a-~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~ 121 (330) |.=.++ ..-+...+..+...-+.. .++. . ....-| +++.|++||+.|+++||++ T Consensus 1 m~~f~~-~~~~~~~~~~~~~~~~~~------~~~~-~-------~~~~~Al~~~~V~~~i~~Ia~~iA~l---------- 55 (406) T protein:vir:97 1 MSFFQP-LGTSKVSYDDYISSVLAG------DVSQ-K-------YLGVSALKNSDILTATSIIAGDIARF---------- 55 (406) T ss_pred Cccccc-cCCCCCCcchHHHHHhcC------CCCc-c-------cccchhhccHHHHHHHHHHHHhhhhC---------- Confidence 100000 000000010000000000 0000 0 001112 3788999999999999963 Q ss_pred cceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEee Q lcl|NC_019511. 122 VGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAV 201 (330) Q Consensus 122 ~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pl 201 (330) .+.++-++.+. ...|.+.+++...||+++|+++|+++++.++++.||+|++++++. ..|++++|+|+ T Consensus 56 -p~~~~~~~g~~-----------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~-~~g~~~~L~~i 122 (406) T protein:vir:97 56 -PLVKKDVNGDI-----------IHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP-KTNQALQFQFY 122 (406) T ss_pred -eeEEEecCccc-----------cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CCCeEEEEEEE Confidence 22222222211 012445566667899999999999999999999999999987532 35899999999 Q ss_pred CCCceEEeeCCCCcccCCceeEEEE-eCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 202 DPSTIFYATDKNGKIIKGGNRFVQV-IDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDR 280 (330) Q Consensus 202 dp~tV~~~~d~~G~~~~~~~~Y~q~-~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~ 280 (330) +|+.|++..+++|. +.|++. ..++....|.++||||++.++.++ .+|+|||++++++|++++++++|+++ T Consensus 123 ~p~~v~v~~~~~~~-----~~y~~~~~~~~~~~~~~~~evih~r~~~~dg----~~G~spi~~~~~~i~~~~a~~~~~~~ 193 (406) T protein:vir:97 123 RPSETTVEETDNHE-----IVYTFTDMLTAKQVKCFAHDVIHWKFFSHDT----ILGRSPLLSLGDEIDLQTGGINTLIK 193 (406) T ss_pred CCCeeEEEEcCCce-----EEEEEEecCCceEEEEccccEEEecCCCCCC----cccccHHHHHHHHHHHHHHHHHHHHH Confidence 99999998776654 345543 346777789999999998654322 24999999999999999999999999 Q ss_pred HHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 281 FFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 281 fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ||+||++|++++..+ ..+++++++++++.|++.++| .|+|+++||-+ T Consensus 194 ~f~ng~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~ 240 (406) T protein:vir:97 194 FFKDGFSSGILTMKG--AQLSGDARQRARQEFEKMREG-SVGGSPLVFDS 240 (406) T ss_pred HHhccCCCceEEecC--CCCCHHHHHHHHHHHHHHhcc-cccCceeecCC Confidence 999999998776643 469999999999999999988 68999776655 No 52 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=3.1e-39 Score=231.73 Aligned_cols=246 Identities=14% Similarity=0.106 Sum_probs=163.2 Q ss_pred hhHHHH-HhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 3 DLFKSL-RLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 3 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) =||++| +++... ++... +.++ . ..+ .|.. ..+.++ ...+ T Consensus 1 m~~~~~f~~~~~~-~~~~~---~~~~---~-----~~~--------------------~~~~-~~~~~~--~~v~----- 40 (416) T protein:vir:12 1 MLLERMFEKRSGS-SDHED---GFNN---I-----LLN--------------------MFGG-RKTASG--ERVS----- 40 (416) T ss_pred CccchhcccccCc-cccCc---cchh---H-----HHH--------------------hhcC-cccccC--ceec----- Confidence 234333 221111 11000 0000 0 000 0000 000000 0011 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) .+ ...++|.|++||+.|+++||+..... +.+.+-| .. +..+|.+..++..+||+. T Consensus 41 ---~~---~al~~~~v~~~i~~Ia~~ia~l~~~~-~~~~~~~--------~~----------~~~~~~l~~~l~~~PN~~ 95 (416) T protein:vir:12 41 ---ES---NSLVQPDIFACVNVLSDDIAKLPIHT-YKRTDGG--------IE----------RKPEHKSAHAVYARPNPY 95 (416) T ss_pred ---hh---hhhccHHHHHHHHHHHHhhhhCceEE-EEecCCc--------cc----------cccccHHHHHHHhhcccC Confidence 01 12237889999999999999632211 1111111 01 111233444556679999 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHee Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELV 241 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvi 241 (330) +|+++|+++++.++|+.||++++++ |+..|+|.+||||+|.+|++..+.++.. +.|++ ..+|....|.++||+ T Consensus 96 ~t~~~f~~~~v~~lll~Gna~~~i~--r~~~G~~~~L~~l~~~~v~v~~~~~~~~----~~~~~-~~~g~~~~~~~~eii 168 (416) T protein:vir:12 96 MTAFTWKKLMMTHVLTWGNAYSYIQ--FGSHGYPEALFPLRPDYTNAYVHPTTGM----LWYQT-VLNGKAIELYDYEVL 168 (416) T ss_pred CCHHHHHHHHHHHHhhcCCeEEEEE--ECCCCcEEEEEEECCcceEEEEeCCCcE----EEEEE-ecCCeEEEecCccEE Confidence 9999999999999999999998886 4556899999999999999987765432 23444 445666789999999 Q ss_pred eecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_019511. 242 MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGING 321 (330) Q Consensus 242 h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~n 321 (330) |++.++.+ +.||+|||++++.+|++++++++|+++||+||++|+|||++++ .+++|++++++++|+.. .| T Consensus 169 h~~~~~~~----~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~----~~ 238 (416) T protein:vir:12 169 HFKGLSTD----GIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA--FLDEKPKENVRKEWKRV----NK 238 (416) T ss_pred EecCcCCC----CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC--CCCHHHHHHHHHHHHHH----hc Confidence 99865443 3479999999999999999999999999999999999999875 58999999999999864 46 Q ss_pred ccccceeeC Q lcl|NC_019511. 322 SWQICLYIK 330 (330) Q Consensus 322 a~kvpvL~e 330 (330) +++++||-+ T Consensus 239 ~~~~~vl~~ 247 (416) T protein:vir:12 239 VENIAIIDY 247 (416) T ss_pred CCCeeecCC Confidence 788777655 No 53 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=1.2e-38 Score=228.54 Aligned_cols=259 Identities=12% Similarity=0.135 Sum_probs=161.8 Q ss_pred chhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 2 PDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) --|+++|......+ ..+..+++.+.-.+|..........-..+.++ T Consensus 1 Mg~~~~l~~r~~~~------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~---- 46 (457) T protein:vir:13 1 MGFWSALFGRGHSP------------------------------ALDGIEARAWEPYDPSIYNLGAVAASGETVTP---- 46 (457) T ss_pred Cchhhhhhcccccc------------------------------cccccccccccccchHHHhhcccccCCceech---- Confidence 11122221110000 00011111111111110000000000112221 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) ...|+ ++.|++||+.++++||+..... +.+.+-+ .++ ...+.+...+ ..||++ T Consensus 47 ---~~al~----~~~V~~~v~~Ia~~iA~lp~~~-~~~~~~~--------~~~----------~~~~~l~~~l-n~~~n~ 99 (457) T protein:vir:13 47 ---HDALQ----VSAVFASVRLLSETIATLPLST-YSKRGGS--------RKE----------IVTPEWLDYP-NAEPGG 99 (457) T ss_pred ---HHhhc----cHHHHHHHHHHHHhhccCceEE-EEecCCc--------ccc----------cccchHHHhc-cccCCC Confidence 12233 7889999999999999632111 1111101 000 0112233333 344557 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCcee--EEEEeCCc--eEEEech Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNR--FVQVIDKQ--VVASFTS 237 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~--Y~q~~~~~--~~~~~~~ 237 (330) +|+++|+++++.++|++||+|+++. ++ .|+|++||||+|.+|++..+..+... .... |.+..++. ....|++ T Consensus 100 ~t~~~f~~~~~~~lll~Gna~~~i~--~~-~g~~~~l~~l~p~~v~v~~~~~~~~~-~~~~~~y~~~~~~~~~~~~~~~~ 175 (457) T protein:vir:13 100 MGRIDILSQTVLSLLLQGNAFLAVR--WQ-GPNIVGLDVLDPTKIHVHMVMVDGLR-RKVFEAYDIDADGNEVLLGWFTP 175 (457) T ss_pred CCHHHHHHHHHHHHhhcCCeEEEEE--ec-CCcEEEEEEEccCceEEEEecCCCcc-ceeEEEEEEecCCceeeEEeeCc Confidence 8999999999999999999998874 34 48999999999999998776554321 1122 22222332 2346899 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) +||||++.+...+ ..||+|||++++.+|++++++++|+++||+||++|+|||.+++ .+++|++++++++|++.++ T Consensus 176 ~diih~~~~~~~~---~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~~~~~~~~~~ 250 (457) T protein:vir:13 176 RDVLHIPGMMLPG---DFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG--TMSEEGLARAREAWRAANS 250 (457) T ss_pred cceEEecCCCCCC---ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC--CCCHHHHHHHHHHHHHHhc Confidence 9999998654332 2369999999999999999999999999999999999999875 5899999999999999999 Q ss_pred CcccccccceeeC Q lcl|NC_019511. 318 GINGSWQICLYIK 330 (330) Q Consensus 318 G~~na~kvpvL~e 330 (330) |..|+|+++||-+ T Consensus 251 g~~nag~~~vl~~ 263 (457) T protein:vir:13 251 GVDNAHRVALLTE 263 (457) T ss_pred CccccCcceecCC Confidence 9999999877766 No 54 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=4.1e-38 Score=225.56 Aligned_cols=249 Identities=14% Similarity=0.171 Sum_probs=161.6 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchh-ccccccccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAY-AEPFLEMMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~~~~p~~~~~~s~~r 79 (330) |-+|+.+++.+.++ ...+. + +..++. .+.+ -.|..||- T Consensus 1 ~~~~~~~~~~~~~~--------------------~~~~~----~-------~~~~~~~~~~~-------~~~~~pp~--- 39 (540) T protein:vir:41 1 MFNYHLSIKSLEKY--------------------RAIKG----D-------TDSQALKEDRF-------EEYVEPKV--- 39 (540) T ss_pred CCCcccChhhccch--------------------hhhhc----c-------ccccccccCCC-------CccccCCC--- Confidence 22222222211111 01110 0 111111 1111 12333331 Q ss_pred chHHHHHHHHHHh-hcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCC Q lcl|NC_019511. 80 NAHNLHEVLKKFG-NNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKD 158 (330) Q Consensus 80 ~~~~~~~~Lr~~a-~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~p 158 (330) + .+.|++.+ .|+++++||++++++||. ++|.+..++.. +.++ .| T Consensus 40 ~----~~~La~~~~~n~~v~scI~~ia~~ia~-----------~~~~i~~~~~~-------------~~~~-------lp 84 (540) T protein:vir:41 40 H----PLVLLSLLQVNPYHASACSIKANDILR-----------TGYLIDGDDGG-------------VEEL-------LR 84 (540) T ss_pred C----HHHHHHHHHhcHHHHHHHHHHHHHHhc-----------CCceEecCccc-------------hhhh-------cc Confidence 1 24444444 489999999999999995 45555443321 1112 27 Q ss_pred CCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC--Ccee-EEE---------E Q lcl|NC_019511. 159 IDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--GGNR-FVQ---------V 226 (330) Q Consensus 159 n~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--~~~~-Y~q---------~ 226 (330) |+++|+.+|+++++.|+|++||+|+++++ +..|+|++|+||+|.+|++..+.+++... +... |+. . T Consensus 85 N~~~t~~~f~~~~v~dlll~Gnayv~i~r--~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 162 (540) T protein:vir:41 85 ACRPSFEFILLQALEDLQVFNYCTLEVVR--DDQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNP 162 (540) T ss_pred CCCCCHHHHHHHHHHHHHhcCCeEEEEEE--CCCCcEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeec Confidence 89999999999999999999999999875 45689999999999999998776654321 1111 111 0 Q ss_pred eCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCC----CCC- Q lcl|NC_019511. 227 IDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQ----QQS- 301 (330) Q Consensus 227 ~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~----~ls- 301 (330) ..+.....|+++||||++.... ..+.||+||+.+|+.+|.+++++++|+.+||+||++|+|||.+++.. .++ T Consensus 163 ~~g~~~~~~~~~eViHir~~~~---~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~ 239 (540) T protein:vir:41 163 DNGEDQDGVGANEIIFIHLPSP---ICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGS 239 (540) T ss_pred cccccceeecccceEEecCCCC---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccch Confidence 1233345789999999975422 22457999999999999999999999999999999999999988641 111 Q ss_pred ---HHHHHHHHHHHHHHhcCc-ccccccceee------C Q lcl|NC_019511. 302 ---QHALENFKREWKSSFSGI-NGSWQICLYI------K 330 (330) Q Consensus 302 ---~e~~e~lr~~w~~~~~G~-~na~kvpvL~------e 330 (330) ++..+++++.|++.++|. +|+|+++||. + T Consensus 240 ~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~ 278 (540) T protein:vir:41 240 DGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTV 278 (540) T ss_pred HHHHHHHHHHHHHHHHHhccccccccceEEEecCCCccc Confidence 233578999999999886 4788855543 1 No 55 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=3.8e-39 Score=231.20 Aligned_cols=274 Identities=14% Similarity=0.108 Sum_probs=182.9 Q ss_pred cCCCCCCcccccCcc-CcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHH Q lcl|NC_019511. 11 GSMYKEDTEDLMVPI-DDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLK 89 (330) Q Consensus 11 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr 89 (330) +..++......-+-. +....... + ++.++++ . ++-.-...++...+|+.+ +.|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-------~------~~~~~~~--~--~~~~~~~~~~~~~p~~~~--------~~L~ 55 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADL-------A------KSPNSTQ--I--PDHRIQSHNVGVNPPYNP--------DRLA 55 (651) T ss_pred CCCccceeeeeEEEeecccccccc-------c------ccccccc--c--chhhhcccCCCCCCCCCH--------HHHH Confidence 222221111101100 00000000 0 0001111 0 111112335556655543 6799 Q ss_pred HHhh-cHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhcc------CCCCCCcC Q lcl|NC_019511. 90 KFGN-NSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTG------TDKDIDRD 162 (330) Q Consensus 90 ~~a~-~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~------~~~pn~~~ 162 (330) .|++ |+++++||++++++|| |+||.++++.. .+.+++..+++....+|++.+. +...|..+ T Consensus 56 ~~~e~~~~~~~~i~~~~~~ia-----------g~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~ 123 (651) T protein:vir:99 56 AFLELNETLATGIRKKSRYEV-----------GFGFDLVPAQG-VDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPA 123 (651) T ss_pred HHHhcChHHHHHHHHHhhhhh-----------ccCceeeeccc-CCCCccchHHHHHHHHHhhccchhhcccccccCCCC Confidence 9998 8999999999999998 67999988643 2334455556667788876643 35567789 Q ss_pred CHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCc--------------------------- Q lcl|NC_019511. 163 SFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGK--------------------------- 215 (330) Q Consensus 163 s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~--------------------------- 215 (330) ++.+|++.++.|++++|++|++++ +++.|+|++|+++++.++++..++..- T Consensus 124 t~~~i~~~~~~Dle~tGna~ieiI--rn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~ 201 (651) T protein:vir:99 124 TPERVKELARQDYHGVGWLALEML--TDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQI 201 (651) T ss_pred CHHHHHHHHHHHHHHHhhHhhhhh--hcCccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHH Confidence 999999999999999999999987 556789999999999988765432110 Q ss_pred -------------cc--------------------C----------CceeEEE-EeCCceEEEechhHeeeecccCcCCC Q lcl|NC_019511. 216 -------------II--------------------K----------GGNRFVQ-VIDKQVVASFTSRELVMGIRNPRSDL 251 (330) Q Consensus 216 -------------~~--------------------~----------~~~~Y~q-~~~~~~~~~~~~~dvih~~~n~~~d~ 251 (330) .+ + ....|.+ ..+++...+|+++||||++.+... T Consensus 202 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~-- 279 (651) T protein:vir:99 202 RNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSIL-- 279 (651) T ss_pred HhcCcceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCC-- Confidence 00 0 0000001 112233456889999999754322 Q ss_pred CCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 252 NSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 252 ~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+.||+|||+.|+.+|.+++++++|+++||+||++|+|||.+++ ..++++++++|+++|++.+ +|+||++||.. T Consensus 280 -~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~-~~ls~e~~~~lr~~~~~~~---~nagk~~vL~~ 353 (651) T protein:vir:99 280 -EDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTG-GELSEESKRDLRQMLNGLR---EESHRAVVLEV 353 (651) T ss_pred -CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHHh---ccCCceEEeec Confidence 24579999999999999999999999999999999999999876 4689999999999999854 47899777754 No 56 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=2.9e-38 Score=226.35 Aligned_cols=242 Identities=11% Similarity=0.056 Sum_probs=168.0 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ++. ++. +..+.. ..... .+.. ..+....+ ......++++|.|++||+.++++| T Consensus 1 Mg~-~~~-----f~~-k~~~~~---~~~~~-~~~~-----~~~~~~~~--------~~~~~~~~~~~~V~~~I~~ia~~i 56 (403) T protein:vir:80 1 MGL-FNF-----FRR-KTRSEP---TNAIS-WFLT-----QEAYDTLA--------IPGYTRLSDNPEVRMAVHKIAELI 56 (403) T ss_pred Ccc-ccc-----ccc-cccccc---cchhh-hhcc-----cccccccc--------cchhhhhhhhHHHHHHHHHHHHhh Confidence 211 110 100 000000 00000 0000 00000000 011234566899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+... .+.+.+-+-. .+..|.+.+++..+||+.+|.++|++++|.++|+.|+||+|+... T Consensus 57 A~~p~--~~~~~~~~g~------------------~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~ 116 (403) T protein:vir:80 57 SSMTI--HLMQNTDNGD------------------IRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPK 116 (403) T ss_pred hhCce--EEEEecCCce------------------eecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEE Confidence 96422 1111110000 112344556666789999999999999999999988888887777 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCC-ccccHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSG-YGLSEVEIAMKE 267 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~-yGlSPIe~a~~~ 267 (330) +++.|+|.+||||+|.+|++..+.+|+. +.| .+ ..|.++||+|++.++.+. ++ +|+||+++++.+ T Consensus 117 ~~~~g~~~~L~~l~p~~v~~~~~~~g~~------~~y--~~---~~~~~~eiih~~~~~~~~---~~~~G~s~~~~~~~~ 182 (403) T protein:vir:80 117 YTTSGLIDELIPLAPSKVSFVDTDTGYQ------IWY--QG---KAYNYDEVLHFIVNPDPE---KPYMGRGYRVVLKDI 182 (403) T ss_pred EcCCCcEEEEEEEcCCeeEEEEcCCceE------EEE--ee---cccchhhEEEEeccCCCc---CccccccHHHHHHHH Confidence 8889999999999999999988877643 222 12 357899999998765443 34 499999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+.+.++++|+.+||+||++|+|||.+++ .+++++.++++++|.+.+.|..|+|+++||.. T Consensus 183 i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 243 (403) T protein:vir:80 183 VNNLKQATTTKKSFMSGKYMPSLIVKVDA--ATAELSSEEGRNAVFKKYLEASEAGQPWIIPA 243 (403) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCC--CCChHHHHHHHHHHHHHHhhhhhcCCeeeecc Confidence 99999999999999999999999998875 48899999999999999999999999887754 No 57 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=5.2e-38 Score=224.97 Aligned_cols=245 Identities=11% Similarity=0.006 Sum_probs=159.3 Q ss_pred HHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhh Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTY 111 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~ 111 (330) +.-.+...+++..+... .+ .+..++... ..|.+........+.... -+...+++.|++||+.++++||+. T Consensus 1 m~m~~~~~~~~~~~~~~---~~--~~~~~~~~~--~~~~~~~~~~~~~g~~v~---~~~al~~~~v~~~v~~ia~~ia~l 70 (392) T protein:vir:74 1 MILPILNFINQTNDPPE---AG--SVQSYFPDG--NDAQIMESLLGDNNEWVS---ARAALRNSDLFSIILQLSSDLAIV 70 (392) T ss_pred CcchhhhhhhcccCccc---cc--ccccccccC--chhhhhhhccCCCCcccc---hhhhhcchHHHHHHHHHHHhhccC Confidence 22222222221111100 00 000000000 000000000000010001 112223799999999999999952 Q ss_pred hhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCC Q lcl|NC_019511. 112 CKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKN 191 (330) Q Consensus 112 ~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~ 191 (330) .+.+ .+.+. ..+.++||+.+|.++|+++++.++|++|++|++++ |+. T Consensus 71 -----------p~~~--~~~~~------------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~--r~~ 117 (392) T protein:vir:74 71 -----------KINA--EKKKN------------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRW--RNA 117 (392) T ss_pred -----------ceee--ccchh------------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEE--ECC Confidence 2222 11110 01234699999999999999999999999999987 455 Q ss_pred CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC---ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH Q lcl|NC_019511. 192 KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK---QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF 268 (330) Q Consensus 192 ~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~---~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I 268 (330) .|++++|+||+|.+|++..+++|.. ..|.+..++ +....|+++||+|++.++..+ ..||+|||++++.+| T Consensus 118 ~G~~~~L~~i~~~~v~v~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~s~i~~~~~~i 190 (392) T protein:vir:74 118 NGADMKWEYLRPSQVNTYYFEYENG----MYYNITFDDPKIEPILQAPQSDLIHMKLLSIDG---GKTGISPLYSLRRES 190 (392) T ss_pred CCcEEEEEEEcCceeEEEEcCCCce----EEEEEEecCCccceeEEEcCccEEEecCCCCCC---ccccccHHHHHHHHH Confidence 6899999999999999998876532 345555443 345679999999998655432 346999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 269 IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 269 ~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++++++++|++++|+||++|+|+|.++++..++++ .++.|.+.+.|..|+|+++||-+ T Consensus 191 ~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~----~~~~~~~~~~~~~n~g~~~vl~~ 248 (392) T protein:vir:74 191 KIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK----DKASRSRSFMKRSRSGGPVVLDD 248 (392) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH----HHHHHHHHHhccccCCCeeecCC Confidence 99999999999999999999999999886555543 34667778889999999776655 No 58 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=1.2e-37 Score=223.00 Aligned_cols=245 Identities=11% Similarity=0.071 Sum_probs=162.5 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ...+++..+.+.+. ++....+.. -..|... +.....+. ..+++.|++||+.++++| T Consensus 1 Mgl-----~~~~f~~~~~~~~~---~~~~~~~~~-----~~~~~~~-----g~~v~~~~---al~~~~v~~~v~~ia~~i 59 (409) T protein:vir:84 1 MSL-----FTRIFSGPSEERTL---TKISGIPSP-----AEDWAMH-----GDRPGANS---AMTLGAFYACVTLLADTV 59 (409) T ss_pred Cch-----hhhhhcCCCccccc---ccccccccc-----cchhhcc-----Ccccchhh---hhccHHHHHHHHHHHHhh Confidence 332 22222211111111 000000100 0111111 11111111 113789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+.-++.+.+. ..|.+.+++..+||+++|+++|++.++.++|+.||+|+++.+ T Consensus 60 A~l-----------p~~~~~~~~~~~~----------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~- 117 (409) T protein:vir:84 60 ASL-----------SIDAYRKKDNVRI----------PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISA- 117 (409) T ss_pred hhC-----------ceEEEEecCCccc----------ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE- Confidence 963 2222211111111 123455556678999999999999999999999999999864 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEE-EeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQ-VIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q-~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) ++..|+|++||||+|.+|++....++. +..|++ ...++ ..|+++||+|++.++..+ ..||+||++.+..+ T Consensus 118 ~~~~g~~~~L~~l~p~~v~v~~~~~~~----~~~~~~~~~~~g--~~~~~~dvih~~~~~~~~---~~~G~s~i~~~~~~ 188 (409) T protein:vir:84 118 RDEANRPTAIMPIHPDCIHVTDAKDED----GDWIEPVYRIDG--KVVPNHRIMHIKRYPVAG---CALGMSPIEKAASA 188 (409) T ss_pred ECCCCceEEEEEEcCceeEEEEcCCCc----ceEEEEEecCCc--eEEchhhEEEecCCCCCc---ccccccHHHHHHHH Confidence 566799999999999999887554332 111222 22233 358899999998776554 23699999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |++++++++|+.+||+||++|+|+|.+++ .+++|+++++++.|.+.+ .|+|+++||-+ T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~---~n~g~~~vl~~ 246 (409) T protein:vir:84 189 IGLGLAAERYGLRWFRDSANPSGILSSDA--DLTPDQVKQTQKQWIQSH---HNRRLPAVMSA 246 (409) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEecCC--CCCHHHHHHHHHHHHHHh---ccCCCeeecCC Confidence 99999999999999999999999998875 589999999999998865 46888776555 No 59 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=2.9e-37 Score=220.94 Aligned_cols=256 Identities=10% Similarity=0.065 Sum_probs=167.0 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHH-HhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKK-FGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~-~a~~~iv~a~I~~~~d~ 107 (330) +-. ++++.. ..++...+.......+ +...++.... . -.+.+ +.++|.|++||+.++++ T Consensus 1 Mg~-~~~~~~------~~~~~~~~~~~~~~~~----------~~~~~~~~~~---~-~~~~~~~~~~~~v~~~i~~ia~~ 59 (423) T protein:vir:81 1 MGF-LQKLGL------APSVVATPEPIELVGP----------IFESLKLSTK---N-MTVEQIWEDQPHLRTVTTFIARN 59 (423) T ss_pred Cch-hHhhcc------ccccccCccccccccc----------cccccccccc---h-hhHHHHHHhhhHHHHHHHHHHHh Confidence 111 111110 0011011111000000 1111111010 0 11222 23579999999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+..... +. +.+|.+.+.. ..|.+.+++. .||+++|.++|++.++.++++.||+|+++.+ T Consensus 60 ia~lp~~~-~~--------~~~dg~~~~~---------~~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r 120 (423) T protein:vir:81 60 VASLQLQA-FE--------RVEDGGRERV---------REGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPG 120 (423) T ss_pred HhhCceEE-EE--------EecCCceeee---------ccchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 99642211 11 1111111111 1233344443 6999999999999999999999999998876 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEe---CCceEEEechhHeeeecccCcCCCCCCCccccHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVI---DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIA 264 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~---~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a 264 (330) +..+.+.++.|+|+.+..|++....+|. +...|++.. .+|...+|+++||||++.....+ ..||+||++.+ T Consensus 121 d~~~~~~~~~l~p~~~~~v~~~~~~~~~---~~~~Y~~~~~~~~~g~~~~~~~~evih~r~~~~~~---~~~G~spi~~~ 194 (423) T protein:vir:81 121 DLGVDTPTLDIRPIPVSWVQRRAYKDGW---GSLDYIIIESGDNDGRSVKVPGERVIHRHGYNPKT---MKRGKSPVQSL 194 (423) T ss_pred cCCcCcceEEEeecccceeeeeeccCCC---cceEEEEEEecCCCceEEEEcccceEEecCCCCCC---ccccccHHHHH Confidence 5555578888999888888876654432 234555443 35667789999999997543332 23699999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC---CCCCHHHHHHHHHHHHHHhc-CcccccccceeeC Q lcl|NC_019511. 265 MKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD---QQQSQHALENFKREWKSSFS-GINGSWQICLYIK 330 (330) Q Consensus 265 ~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~---~~ls~e~~e~lr~~w~~~~~-G~~na~kvpvL~e 330 (330) +++|+.++++++|+.+||+||++|+|+|.++.. .++++|+.+++++.|++.++ |.+|+|+++||-+ T Consensus 195 ~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~ 264 (423) T protein:vir:81 195 RDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLED 264 (423) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCC Confidence 999999999999999999999999999987643 35899999999999999985 7789999777765 No 60 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=2.9e-37 Score=220.86 Aligned_cols=244 Identities=11% Similarity=0.022 Sum_probs=162.2 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |--||+++++.+..++.... .... ..|..... .... .+......++ T Consensus 2 ~m~~f~~~~~~~~~~~~~~~----~~~~---------------------~~~~~~~~----~~~~--~~~~~~~v~~--- 47 (392) T protein:vir:10 2 ILPILNFINQTNDPPEVGSV----QSYF---------------------PDGNDAQI----MESL--LGDNNEWVSA--- 47 (392) T ss_pred cchhhhhhhccccccccccc----cccc---------------------ccCchhhh----hhhh--cCCCCceech--- Confidence 33444444322221111000 0000 00000000 0000 0011111111 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ...| ++|.|++||+.++++||+. .+ ++++.+. ..+.++||+ T Consensus 48 ----~~al----~~~~v~~~i~~ia~~ia~l-----------p~--~~~~~~~------------------~~l~~~PN~ 88 (392) T protein:vir:10 48 ----RAAL----RNSDLFSIILQLSSDLAIV-----------KI--NAEKKKN------------------QGIIDNPST 88 (392) T ss_pred ----HHhh----ccHHHHHHHHHHHHhhccC-----------ce--eeccchh------------------hhHhhcCCC Confidence 1122 3799999999999999952 22 2222110 112357999 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC---ceEEEech Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK---QVVASFTS 237 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~---~~~~~~~~ 237 (330) ++|.++|+++++.++|++|++|++++ |+..|++++|+||+|.+|++..+.+|.. ..|.+..++ +....|++ T Consensus 89 ~~t~~~f~~~~~~~lll~Gna~~~i~--r~~~g~~~~L~~l~~~~v~~~~~~~~~~----~~y~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:10 89 NANKHGFWQSMFAQLLLGGEAFAYRW--RNANGADMKWEYLRPSQVNTYYFEYENG----MYYNITFDDPKIEPILQAPQ 162 (392) T ss_pred CCCHHHHHHHHHHHhhhcCcEEEEEE--ECCCCcEEEEEEEcCceeEEEEcCCCce----EEEEEEecCcccceeEEEcc Confidence 99999999999999999999999987 4566899999999999999998776532 345555443 34567999 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) +||+|++.++..+ ..||+|||++|+.+|++++++++|+.++|.||++|+|+|.++++...++++ ++.|.+.+. T Consensus 163 ~eiih~~~~~~~~---~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~~~~~~~~ 235 (392) T protein:vir:10 163 SDLIHMKLLSIDG---GKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSFM 235 (392) T ss_pred ccEEEecCCCCCC---ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHHh Confidence 9999998765432 236999999999999999999999999999999999999998875555444 456777888 Q ss_pred CcccccccceeeC Q lcl|NC_019511. 318 GINGSWQICLYIK 330 (330) Q Consensus 318 G~~na~kvpvL~e 330 (330) |..|+|+++||-+ T Consensus 236 ~~~~~g~~~vl~~ 248 (392) T protein:vir:10 236 KRSRSGGPVVLDD 248 (392) T ss_pred ccccCCCeeecCC Confidence 8999999776655 No 61 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=2.9e-37 Score=220.86 Aligned_cols=244 Identities=11% Similarity=0.022 Sum_probs=162.2 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |--||+++++.+..++.... .... ..|..... .... .+......++ T Consensus 2 ~m~~f~~~~~~~~~~~~~~~----~~~~---------------------~~~~~~~~----~~~~--~~~~~~~v~~--- 47 (392) T protein:vir:39 2 ILPILNFINQTNDPPEVGSV----QSYF---------------------PDGNDAQI----MESL--LGDNNEWVSA--- 47 (392) T ss_pred cchhhhhhhccccccccccc----cccc---------------------ccCchhhh----hhhh--cCCCCceech--- Confidence 33444444322221111000 0000 00000000 0000 0011111111 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ...| ++|.|++||+.++++||+. .+ ++++.+. ..+.++||+ T Consensus 48 ----~~al----~~~~v~~~i~~ia~~ia~l-----------p~--~~~~~~~------------------~~l~~~PN~ 88 (392) T protein:vir:39 48 ----RAAL----RNSDLFSIILQLSSDLAIV-----------KI--NAEKKKN------------------QGIIDNPST 88 (392) T ss_pred ----HHhh----ccHHHHHHHHHHHHhhccC-----------ce--eeccchh------------------hhHhhcCCC Confidence 1122 3799999999999999952 22 2222110 112357999 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC---ceEEEech Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK---QVVASFTS 237 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~---~~~~~~~~ 237 (330) ++|.++|+++++.++|++|++|++++ |+..|++++|+||+|.+|++..+.+|.. ..|.+..++ +....|++ T Consensus 89 ~~t~~~f~~~~~~~lll~Gna~~~i~--r~~~g~~~~L~~l~~~~v~~~~~~~~~~----~~y~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:39 89 NANKHGFWQSMFAQLLLGGEAFAYRW--RNANGADMKWEYLRPSQVNTYYFEYENG----MYYNITFDDPKIEPILQAPQ 162 (392) T ss_pred CCCHHHHHHHHHHHhhhcCcEEEEEE--ECCCCcEEEEEEEcCceeEEEEcCCCce----EEEEEEecCcccceeEEEcc Confidence 99999999999999999999999987 4566899999999999999998776532 345555443 34567999 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) +||+|++.++..+ ..||+|||++|+.+|++++++++|+.++|.||++|+|+|.++++...++++ ++.|.+.+. T Consensus 163 ~eiih~~~~~~~~---~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~~~~~~~~ 235 (392) T protein:vir:39 163 SDLIHMKLLSIDG---GKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSFM 235 (392) T ss_pred ccEEEecCCCCCC---ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHHh Confidence 9999998765432 236999999999999999999999999999999999999998875555444 456777888 Q ss_pred CcccccccceeeC Q lcl|NC_019511. 318 GINGSWQICLYIK 330 (330) Q Consensus 318 G~~na~kvpvL~e 330 (330) |..|+|+++||-+ T Consensus 236 ~~~~~g~~~vl~~ 248 (392) T protein:vir:39 236 KRSRSGGPVVLDD 248 (392) T ss_pred ccccCCCeeecCC Confidence 8999999776655 No 62 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=3.3e-36 Score=215.08 Aligned_cols=285 Identities=11% Similarity=0.049 Sum_probs=178.2 Q ss_pred CchhHHHHHhcCCCCCC-cccccCccCcchhHHHHHHHHHHHHhhcccc-hhccc-cchhccc--cccccc----cCCCC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKED-TEDLMVPIDDGIQANIRQIEQDTKEMQEITK-SLYGK-QQAYAEP--FLEMMD----TNPDY 71 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~-~~~~~~~--~~~~~~----~~p~~ 71 (330) -.-|.-+++..-|.+.| +.++....+..+.....- +-+..- ...++ +..+..- ..+.+. ...++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~-------~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~ 79 (648) T protein:vir:79 7 GRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGA-------MPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDF 79 (648) T ss_pred cchhhhhhhhhccCccccccccccccccccCCCccc-------cCCCCcccccccccchhHHHHHhHHHHHhhcCCcccc Confidence 23455556666665555 334333333322211100 000000 00000 0000000 000000 01123 Q ss_pred CcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHH Q lcl|NC_019511. 72 RDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 72 ~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~ 151 (330) ..||.. . ....+.|..||+|++||++++++|++ ++|.++.++... ...... T Consensus 80 ~epp~d---~---~~l~~l~~~np~V~~aI~iia~~ia~-----------l~~~i~~~~~~~--~~~~~~---------- 130 (648) T protein:vir:79 80 EEPEFD---F---NEITSAYNTEGYVRQAVDKYIEMMFK-----------ADWDFVSKNPNA--VEYIRM---------- 130 (648) T ss_pred ccCCcC---H---HHHHHHHhcChHHHHHHHHHHHHHhh-----------CcceEEecCCcc--chhhHH---------- Confidence 334322 1 12235666799999999999999985 467776654322 111111 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCC-------------cceEEEEeeCCCceEEeeCCCCcccC Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNK-------------TKMEKFIAVDPSTIFYATDKNGKIIK 218 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~-------------G~~~~L~pldp~tV~~~~d~~G~~~~ 218 (330) +.+...||+++|.++|+++++.++|++||+|++++++.++. ..+.+||||+|.+|++..+++|.. T Consensus 131 ~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~-- 208 (648) T protein:vir:79 131 RFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMI-- 208 (648) T ss_pred HHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCce-- Confidence 11123689999999999999999999999999998765531 245789999999999998887753 Q ss_pred CceeEEEEeCC-ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC Q lcl|NC_019511. 219 GGNRFVQVIDK-QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD 297 (330) Q Consensus 219 ~~~~Y~q~~~~-~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~ 297 (330) ..|+|...| +....|.++||||++.++..+ +.||+|||++|+++|++++++++|+++||.||++|+|+|.++.+ T Consensus 209 --~~Y~y~~~g~~~~~~~~~~dIIHik~~~~~d---~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~ 283 (648) T protein:vir:79 209 --KGWQQEQEGQDKPQKFKPEDIVHIYYKREKG---RAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLE 283 (648) T ss_pred --eeeEEEecCCceeEEecCccEEEEccCCCCC---CceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC Confidence 347666554 455679999999998654332 45799999999999999999999999999999999999998755 Q ss_pred CCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 298 QQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 298 ~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) . ...++.+++++.|.+.+.|.. .+...+..+ T Consensus 284 ~-~~~e~~k~~~e~~~~~~~~~~-i~gg~v~~~ 314 (648) T protein:vir:79 284 Q-EGFGAEEGEVDLVRGEVENMD-VEGGMVTTE 314 (648) T ss_pred c-cchHHHHHHHHHHHHhccccc-ccccccccc Confidence 4 345666788888888877653 222233322 No 63 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=8.4e-37 Score=218.37 Aligned_cols=235 Identities=9% Similarity=0.006 Sum_probs=163.4 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.. + ....+.+........++...+... .... ..+. ...| +++.|++||+.++++| T Consensus 1 M~~-f-------~~~~~~~~~~~~~~~~~~~~~~~~---~~~~--~v~~-------~~al----~~~~V~~~v~~ia~~i 56 (397) T protein:vir:38 1 MPL-L-------KLNKSHSQGFSLNDPDWVNFLTGG---EAQK--YVSA-------DTAL----KNSDIFSLIMQLSGDL 56 (397) T ss_pred Ccc-h-------hhhhcccCcccCCchhhhhhhcCC---cCCc--eech-------HHhh----ccHHHHHHHHHHHHHH Confidence 111 1 111111111111221221111110 0000 0110 1123 3899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |++ .|. .++ ...+.+..+||+.+|+++|+++++.++|++|+++++++ T Consensus 57 a~~-----------p~~--~~~------------------~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~-- 103 (397) T protein:vir:38 57 AMV-----------RYT--SES------------------DRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRH-- 103 (397) T ss_pred hhC-----------ccc--ccc------------------cHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEE-- Confidence 953 111 111 01122345799999999999999999999999999887 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeC---CceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVID---KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~---~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) |+..|++++|+||+|.+|++..+++|.. ..|.+... ++....|+++||+|++.+...+ ..||+|||+.|. T Consensus 104 r~~~g~~~~l~~l~~~~v~i~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~---~~~G~s~i~~~~ 176 (397) T protein:vir:38 104 KNTNGVDLSWEYLRPSQVQPMLLQDGSG----LIYNINFDEPAIGYMENVPAADVIHIRLLSKNG---GKTGISPLSALI 176 (397) T ss_pred ECCCCcEEEEEEEcCceeEEEEcCCCce----EEEEEEeccccccceeEecCccEEEecCCCCCC---ccccccHHHHHH Confidence 4566899999999999999998877642 34544432 4556789999999998764432 337999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+|++++++++|+.++|+||++|+|+|.++++ +++++.+++++.|+..++| .|+|+++||-+ T Consensus 177 ~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~--~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~ 238 (397) T protein:vir:38 177 NEQQIKDASNELTLKALKQSVTASAVLTIQKG--GLLDAETRIARSKEISKQI-HNSDGPVVIDA 238 (397) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEeCCC--CCHHHHHHHHHHHHHHhcc-cccCCceecCC Confidence 99999999999999999999999999998764 7899999999999987766 78999666555 No 64 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=1.5e-36 Score=216.96 Aligned_cols=238 Identities=11% Similarity=0.048 Sum_probs=160.2 Q ss_pred chhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 2 PDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) --||+.+++.+. ..+ .....+ ... ..+.|.... .++. T Consensus 1 M~~f~~~~~~~~-------------------------------~~~--~~~~~~------~~~--~~~~~~~~~--~~~~ 37 (386) T protein:vir:48 1 MPIFNITNLATE-------------------------------SPP--ISQGGF------FDI--TDPDFLSTL--NGSE 37 (386) T ss_pred Cccccccccccc-------------------------------ccc--cccccc------ccc--ccchhcccc--cCCc Confidence 001111100000 000 000000 000 001111000 0110 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) . -..+.+.++|.|++||+.++++||+. .+. ++++. . ..+..+||+. T Consensus 38 ~---v~~~~~~~~~~v~~~i~~ia~~ia~~-----------p~~--~~~~~----------~--------~~l~~~pN~~ 83 (386) T protein:vir:48 38 W---VSAESALRNSDLFSIINQLSNDLATV-----------KLT--ASRKQ----------L--------QGIIDNPSNN 83 (386) T ss_pred e---echhhhhcchHHHHHHHHHHHhhccC-----------cee--eccch----------h--------HHHhhcCCCC Confidence 0 11233345899999999999999952 222 22111 0 1123478999 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC---ceEEEechh Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK---QVVASFTSR 238 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~---~~~~~~~~~ 238 (330) +|+++|+++++.++|++|+++++++ |+..|++++|+||+|.+|++..+.+|.. +.|.+..++ +....|+++ T Consensus 84 ~t~~~f~~~~~~~lll~Gna~~~i~--r~~~g~~~~L~~l~~~~v~v~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~ 157 (386) T protein:vir:48 84 ANRFNFYQSIFAQMLLGGEAFAYRW--RNENGRDMKWEYLRPSQVSFNRLDNKDG----IYYNITFDDPRIPPKQHVPQG 157 (386) T ss_pred CCHHHHHHHHHHHhhhcCcEEEEEE--ECCCCcEEEEEEecCceeEEEEcCCCce----EEEEEEecCccccceeEecCc Confidence 9999999999999999999999886 4567899999999999999988776632 346555544 345679999 Q ss_pred HeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcC Q lcl|NC_019511. 239 ELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSG 318 (330) Q Consensus 239 dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G 318 (330) ||+|++.++..+ ..||+|||+.++.+|++++++++|+.++|+||++|+|+|+.++ .+++++.+++++.|.+ | T Consensus 158 evih~~~~~~~~---~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~--~~~~e~~~~~~~~~~~---~ 229 (386) T protein:vir:48 158 DVLHFKLLSVDG---GLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG--GGLLDFKTKLSRSRQA---M 229 (386) T ss_pred cEEEecCCCCCC---ceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHH---h Confidence 999998654332 3469999999999999999999999999999999999999876 4899999999999976 4 Q ss_pred cccccccceeeC Q lcl|NC_019511. 319 INGSWQICLYIK 330 (330) Q Consensus 319 ~~na~kvpvL~e 330 (330) ..|+|+++||-+ T Consensus 230 ~~n~g~~~vl~~ 241 (386) T protein:vir:48 230 KQMQGGPLVLDD 241 (386) T ss_pred hcCCCCceecCC Confidence 568999776665 No 65 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=2.4e-36 Score=215.85 Aligned_cols=243 Identities=12% Similarity=0.086 Sum_probs=160.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ++|+..+. .. |+ .+... ...++.++.+..-+ +-+.+.+++.|++||+++++.| T Consensus 1 mg~-~~~~~~~~--------~~-~~-~~~~~--~~~~~~~~~~~~~~-----------t~~~~~~~~~v~~cv~~Ia~~i 56 (403) T protein:vir:10 1 MGF-KSWITEKL--------NP-GQ-RIIRD--MEPVSHRTNRKPFT-----------TGQAYSKIEILNRTANMVIDSA 56 (403) T ss_pred Ccc-hhhhhhcc--------ch-hh-hhhhc--ccccccccCCcccc-----------cHHHHHHHHHHHHHHHHHHHHH Confidence 111 22222110 00 10 00000 00111122211111 1244556899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+.. |.+..+. ...++.+ ....|.+.+++..+||+++|.++|+++++.++|+.||+|+++. T Consensus 57 a~~p-----------~~v~~~~--~~~~~~~----~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~-- 117 (403) T protein:vir:10 57 AECS-----------YTVGDKY--NIVTYAN----GVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD-- 117 (403) T ss_pred hhCc-----------eeEeecc--ccccccc----ccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe-- Confidence 9632 2221110 0001110 1112344455566899999999999999999999999877652 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCc-CCCCCCCccccHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPR-SDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~-~d~~~~~yGlSPIe~a~~~ 267 (330) | .+|+|++|..|.+..+..+. .|.+...++ ..|..+||+|++.++. .....+.||+|||++|+++ T Consensus 118 ----~--~~l~~l~~~~~~v~~~~~~~------~~~~~~~~~--~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~ 183 (403) T protein:vir:10 118 ----G--TSLYHVPAALMQVEADANKF------IKKFIFNNQ--INYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDS 183 (403) T ss_pred ----C--ceeEeecCcceEEEEcCCce------EEEEEecCc--eeecccceEEecccccccCCCCCcccccHHHHHHHH Confidence 2 25889999999887665442 233433333 4578899999975432 2222345799999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+++.++++|+++||+||++|+|||.+++ .++++++++++++|++.|+|..|+|+++||.+ T Consensus 184 i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 244 (403) T protein:vir:10 184 LEKRSKMLNFKEKFLDNGTVIGLILETDE--ILNKKLRERKQEELQLDYNPSTGQSSVLILDG 244 (403) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHHHhCCcccCcceeecCC Confidence 99999999999999999999999998764 59999999999999999999999999777655 No 66 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=1.4e-35 Score=211.74 Aligned_cols=238 Identities=9% Similarity=0.041 Sum_probs=162.5 Q ss_pred cccchhccccchh------ccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheec Q lcl|NC_019511. 45 EITKSLYGKQQAY------AEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYS 118 (330) Q Consensus 45 ~~~~~~~g~~~~~------~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~ 118 (330) +.++.+.+|-+.- .+++....+-.|+... +... ...+ .+.+++.|++||+.|+++||+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~---v~~~---~~~~~~~V~~ci~~Ia~~ia~l------- 65 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNR--SFWG---VINN---TLETNETIFSAITKLSNSMASL------- 65 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccccccccccCc--cccc---cchh---hhhccHHHHHHHHHHHHhhhhC------- Confidence 2222233332221 1122211111111111 1000 1111 2335899999999999999963 Q ss_pred ccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEE Q lcl|NC_019511. 119 EKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKF 198 (330) Q Consensus 119 ~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L 198 (330) .+.+.-+.+ . ..|.+.+++..+||+++|.++|+++++.++|++||+|++++ |+..|++++| T Consensus 66 ----p~~~~~~~~---~----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~--r~~~G~~~~L 126 (409) T protein:vir:93 66 ----PLKMYEDYK---V----------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE--RDIYHQPSKL 126 (409) T ss_pred ----ceeEeeccc---c----------ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEE--ECCCCcEEEE Confidence 222221111 1 12334555667899999999999999999999999988876 5667899999 Q ss_pred EeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 199 IAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFN 278 (330) Q Consensus 199 ~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~ 278 (330) |||+|.+|++..+.++. .+.|.+...++....|+++||+|++.++..+ +.||+|||++++.+|+++.++++|+ T Consensus 127 ~~l~~~~v~~~~~~~~~----~~~y~~~~~~g~~~~~~~~eVih~r~~~~~~---~~~G~s~i~~~~~~i~~~~~~~~~~ 199 (409) T protein:vir:93 127 FLLNPDVVEMLIENQSR----ELYYSIHAATGNKLIVHNMDMLHFKHIVASN---MVQGISPIDVLKNTTDFDNAVRTFN 199 (409) T ss_pred EEEcCceeEEEEeCCCc----EEEEEEEcCCceEEEEccccEEEeCCCCCCC---ccccccHHHHHHHHHHHHHHHHHHH Confidence 99999999998876653 3456666666777789999999998643222 3469999999999999999999984 Q ss_pred HHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 279 DRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 279 ~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +|.++..+++++..+ ..+++++++++++.|++.++ |+|+++||.+ T Consensus 200 --~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~~ 244 (409) T protein:vir:93 200 --LTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFKQYYE---ENGGILFQEP 244 (409) T ss_pred --HHhcCCCCceEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeeecCC Confidence 677777777777644 46999999999999998764 6778777655 No 67 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=5.3e-36 Score=213.99 Aligned_cols=235 Identities=11% Similarity=0.022 Sum_probs=159.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.. ++.+ .+ +.+..... + .+.. .+.+-. +..++... ..+.+.++|.|++||+.++++| T Consensus 1 Mg~-f~~~-------~~-~~~~~~~~--~-~~~~-----~~~~~~--~~~~~~~v---~~~~~l~~~~v~~~i~~ia~~i 58 (382) T protein:vir:48 1 MPI-FNLA-------TE-SPPDNQGG--F-FDVV-----DSDFLA--SLKGNEWV---SAETALRNSDLFSIINQLSNDL 58 (382) T ss_pred Ccc-cccc-------cc-CCcccccc--c-ccch-----hhhccc--cccCCccc---chHhhhccHHHHHHHHHHHHhh Confidence 111 0000 00 00000000 0 0000 000000 00011111 1122334899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+. +... ..+...||+.+|+++|+++++.++|+.||+|++++ T Consensus 59 a~~-----------~~~~~--~~~~------------------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~-- 105 (382) T protein:vir:48 59 ATV-----------KLITS--RKKL------------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRW-- 105 (382) T ss_pred ccC-----------ceeee--cchh------------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEE-- Confidence 953 22222 1110 11245699999999999999999999999999886 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC---ceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK---QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~---~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) |+..|++++|+||+|.+|++..+.+|.. +.|.+..++ +....|+++||+|++.+...+ ..||+||+++++ T Consensus 106 rd~~G~~~~l~~i~~~~v~v~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~s~l~~~~ 178 (382) T protein:vir:48 106 RNENGRDMKWEYLRPSQVSFNRLDNKDG----IYYNITFDDPRIPPKQHVPQNDVLHFRLLSVDG---GMTSVSPLMALS 178 (382) T ss_pred ECCCCcEEEEEEEcCceeEEEEcCCCCe----EEEEEEecCccccceeEEcCccEEEecCCCCCC---ccccccHHHHHH Confidence 5666899999999999999988776532 346555544 345679999999998654332 347999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+|+++.++++|+.++|+||++|+|+|.+++ .+++++.+++++.|.+ |..|+|+++||-+ T Consensus 179 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~---~~~n~g~~~vl~~ 238 (382) T protein:vir:48 179 RELDIQKASGNLTINSLKNALNANGILKIKG--GGLLDFKTKLSRSRQA---MKQMQGGPLVLDD 238 (382) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCC--CCChHHHHHHHHHHHh---hccCCCCeeEcCC Confidence 9999999999999999999999999999876 4789999999999976 4568899666655 No 68 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=2.3e-35 Score=210.46 Aligned_cols=236 Identities=12% Similarity=0.063 Sum_probs=159.7 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+. +.+. ....+.+............++.. .+.... . .-+.+.+++.|++||+.++++| T Consensus 1 Mg~~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~--v--------~~~~al~~~~v~~~i~~ia~~i 60 (385) T protein:vir:10 1 MGLL-TPRN---FNKRKAKNMVYPSNPAFFTTTVG------GMQLSY--V--------SALSALQNTNVYSVINRIASDV 60 (385) T ss_pred Cccc-cchh---cccccccccccccchhhhhhhcc------ccCccc--c--------CHHHhhccHHHHHHHHHHHHHH Confidence 2221 1000 00001111111111111111111 111110 0 0122334789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+ +.|.+. +.. ... +..+||+.+|.++|+++++.+++++|++|++++. T Consensus 61 a~-----------~p~~v~--~~~-----------------~~~-ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r- 108 (385) T protein:vir:10 61 AS-----------AHFKTE--NTA-----------------TLN-RLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVG- 108 (385) T ss_pred hh-----------Cceeee--ccc-----------------hhh-hhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEc- Confidence 95 233331 110 001 1246999999999999999999999999999763 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-IDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) .+.+|+|+++.+|++..+..|. .|.+. ..++...+|+++||+|++.... +...+.||+|||+.|+.+ T Consensus 109 -----~~~~~~p~~~~~v~~~~~~~~~------~~~~~~~~~~~~~~~~~~eiihik~~~~-~~~~~~~G~s~i~~~~~~ 176 (385) T protein:vir:10 109 -----QNLEHIPNSDVQINYLPGNMGI------VYTVLESNDRPQMVLRQDQMLHFRLMPD-PQYRYLIGRSPLESLQNA 176 (385) T ss_pred -----CceeEeecCCceEEEEEcCCce------EEEEEEcCCceEEEEccccEEEeccCCC-CcccccccccHHHHHHHH Confidence 2678999999999988765543 24443 3466778899999999985332 222345799999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |++++++++|+.+||+||++|+|+|+++++ ..++++++++++.|++.++| .|+|+++||-+ T Consensus 177 i~~~~~~~~~~~~~~~ng~~~~gil~~~~~-~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~ 237 (385) T protein:vir:10 177 LNLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG-DNSGRLMVLPD 237 (385) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc-cccCCccccCC Confidence 999999999999999999999999999864 35789999999999999988 79999777765 No 69 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=4.5e-35 Score=208.86 Aligned_cols=238 Identities=11% Similarity=0.045 Sum_probs=159.1 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.. ++. +.+.++ .+.... ..+... ..+.+..... .+.... .+...++|.|++||+.++++| T Consensus 1 M~~-f~~-------~~~~~~---~~~~~~-~~~~~~--~~~~~~~~~~--~~~~v~---~~~al~~~~v~~~i~~ia~~i 61 (386) T protein:vir:49 1 MPI-FNI-------TNLATE---SPPINQ-ESFFDI--ADSDFLASLN--SSEWVS---AENALKNSDLFSIISQLSNDL 61 (386) T ss_pred Cch-hhh-------hccCCC---Ccccch-hhhhhh--hhcccccccc--CCceec---hhhhhccHHHHHHHHHHHHHh Confidence 111 111 100010 000000 000000 0111111111 010011 112223789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .+.+. ++.. ..+...||+.+|.++|+++++.++|++||+|+++++ T Consensus 62 a~~-----------p~~~~--~~~~------------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r- 109 (386) T protein:vir:49 62 ATA-----------KITTS--RKQL------------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR- 109 (386) T ss_pred hhC-----------ceeec--cchh------------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEE- Confidence 952 22222 1110 012356999999999999999999999999999874 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeC---CceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVID---KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~---~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) +..|++++|+||+|.+|++..+.+|.. ..|.+... ++....|+++||+|++.+...+ ..||+||++.|+ T Consensus 110 -~~~g~~~~l~~i~~~~v~v~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~s~l~~~~ 181 (386) T protein:vir:49 110 -NDNGRDMKWEYLRPSQVSFNRLDNQNG----LYYNITFDDPHIAPKQHVPQNDILHFRLLSVDG---GLTSVSPLMALG 181 (386) T ss_pred -CCCCcEEEEEEecCceeEEEEcCCCce----EEEEEEEcCccccceeEEccccEEEecCCCCCC---ccccccHHHHHH Confidence 556899999999999999988776532 34555433 4566789999999998643322 236999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+|+++.++++|+.++|+||++|+|+|.+++. +++++.+++++.|++ +..|+|+++||-+ T Consensus 182 ~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~--~~~~~~~~~~~~~~~---~~~n~g~~~vl~~ 241 (386) T protein:vir:49 182 REFNIQKASDKLTISALKNALNANGILKIKGG--GLLDFKTKVSRSRQA---MKQMQGGPLVLDD 241 (386) T ss_pred HHHHHHHHHHHHHHHHHHccCCccEEEEeCCC--CChHHHHHHHHHHHH---hccCCCCceecCC Confidence 99999999999999999999999999998764 788999999999976 3468999666655 No 70 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=4.2e-35 Score=209.04 Aligned_cols=238 Identities=11% Similarity=0.061 Sum_probs=156.2 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ++.+. . +.+++..-++... . + ..|.+-. ...++.. ...+.+..++.|++||+.++++| T Consensus 1 Mgl-f~~~~-----~-~~~~~~~~~~~~~-----~-~-~~~~~~~--~~~~~~~---v~~~~al~~~~V~~~i~~Ia~~i 61 (384) T protein:vir:49 1 MPI-FNITN-----L-ATESPPSNQDSFF-----D-I-TDPEFLD--ALNGSEW---VSAETALKNSDLFSIISQLSNDL 61 (384) T ss_pred Ccc-ccccc-----c-Ccccccccchhhc-----c-c-cchhhcc--cccCCce---echhhhhccHHHHHHHHHHHHHH Confidence 111 10000 0 0011110000000 0 0 0000000 0011111 11233345899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+. ++.. ..+..+||+.+|.++|+++++.++|+.|++|+++++ T Consensus 62 a~l-----------~~~~~--~~~~------------------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r- 109 (384) T protein:vir:49 62 ATA-----------KITTS--RKQL------------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR- 109 (384) T ss_pred hhC-----------ceeee--cchh------------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEE- Confidence 952 32222 1110 012356999999999999999999999999999874 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC---ceEEEechhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK---QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~---~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) +..|+|++|+||+|.+|++..++++. .+.|.+..++ +....|+++||+|++.+...+ ..+|+|||.+++ T Consensus 110 -~~~g~~~~L~~l~~~~v~v~~~~~~~----~~~y~~~~~~~~~~~~~~~~~~eVih~~~~~~~~---~~~G~s~i~~~~ 181 (384) T protein:vir:49 110 -NENGRDMKWEYLRPSQVSFNRLDNQN----GLYYNITFDDPRIPPKQHVPQGDILHFRLLSVDG---GLTSVSPLMALG 181 (384) T ss_pred -CCCCcEEEEEEEcCceeEEEEcCCCc----eEEEEEEecCccccceeEecCccEEEecCCCCCC---ceeeccHHHHHH Confidence 55689999999999999998776543 2345555443 445689999999998643332 236999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+|++++++++|+.++|.||++|+|+|.+++. +++++. +++|.+.+.|..|+|+++||-+ T Consensus 182 ~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~--~~~~~~---~~~~~~~~~~~~n~~~~~vl~~ 241 (384) T protein:vir:49 182 RELNIQKASDKLTLNALKNALNANGILKIKGG--GLLDFK---TKQSRSRQAMKQMQGGPLVLDD 241 (384) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CChHHH---HHHHHHHHhcccCCccceecCC Confidence 99999999999999999999999999999874 445443 3456677888899999666655 No 71 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=7.9e-35 Score=207.56 Aligned_cols=244 Identities=10% Similarity=0.069 Sum_probs=157.8 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIIT 103 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~ 103 (330) +..+.+ +..+...++.. ....++...++-. .|..+ +...- ..+ .+..+|.|++||+. T Consensus 1 ~~~~~~---~~~~k~~~~~~------------~~~~~~~~~~~~~-~~~~~-~~~~v---~~~---~a~~~~~V~~ci~~ 57 (409) T protein:vir:96 1 MAKENI---VTRIKKKLIDN------------WIDQSASKLYDFS-PWKNK-SFWGV---INN---TLETNETIFSAITK 57 (409) T ss_pred Cccccc---hhhhhhHHhhh------------hhccccccccccc-cccCc-ccccc---chh---hHhhhHHHHHHHHH Confidence 111111 11111111100 0001111111111 12111 11010 111 12237999999999 Q ss_pred HHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCcee Q lcl|NC_019511. 104 RANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNF 183 (330) Q Consensus 104 ~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~ 183 (330) ++++||+.. |.+.-+. +. ..|.+.+++..+||+++|.++|+++++.++|++||+|+ T Consensus 58 ia~~ia~lp-----------~~~~~~~---~~----------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~ 113 (409) T protein:vir:96 58 LSNSMASLP-----------LKMYEDY---KV----------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYV 113 (409) T ss_pred HHHhhhhCc-----------eEEeecc---cc----------cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEE Confidence 999999632 2221110 11 12344555666899999999999999999999999988 Q ss_pred EEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHH Q lcl|NC_019511. 184 EKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEI 263 (330) Q Consensus 184 ~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~ 263 (330) +++ |+..|+|++|||++|..|++..+.++. ...|.+...++....|.++||+|++.++.. .+.||+||+++ T Consensus 114 ~i~--r~~~G~~~~L~~l~~~~v~v~~~~~~~----~~~y~~~~~~g~~~~~~~~evih~r~~~~~---~~~~G~s~l~~ 184 (409) T protein:vir:96 114 LIE--RDIYHQPSKLFLLNPDVVEMLIENQSR----ELYYSIHAATGNKLIVHNMDMLHFKHIVAS---NMVQGISPIDV 184 (409) T ss_pred EEE--ECCCCcEEEEEEEcCceeEEEEeCCCc----EEEEEEEcCCceEEEEccccEEEeCCCCCC---CccccccHHHH Confidence 886 566789999999999999998876653 235666666677778999999999864322 23469999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 264 AMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 264 a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++.+|+++.++++++ |+.++..+++++.. +..+++++++++++.|++.++ |+|+++||-+ T Consensus 185 ~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~--~~~l~~e~~~~~~~~~~~~~~---n~g~~~vl~~ 244 (409) T protein:vir:96 185 LKNTTDFDNAVRTFN--LTEMQKPDSFMLKY--GSNVSTEKRQQVLEDFKQYYE---ENGGILFQEP 244 (409) T ss_pred HHHHHHHHHHHHHHH--HHhcCCCceeEEec--CCCCCHHHHHHHHHHHHHHhh---cCCCeeecCC Confidence 999999999999884 55555555555543 357999999999999998874 6788777655 No 72 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=7.4e-35 Score=207.71 Aligned_cols=247 Identities=9% Similarity=0.032 Sum_probs=160.7 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) ++..- .+.-+.+. ++++.++.. .++.. .+.....|... + ...... ..+.++|.|++||+.++++| T Consensus 1 m~~~~--~~~~~~~~---~~~~~~~~~--~~~~~-~~~~~~~~~~~-~---~~~v~~---~~a~~~~~v~~~i~~ia~~i 65 (412) T protein:vir:26 1 MNVIA--KENIVTRI---KKKLIDNWI--DQSTS-KLYDFSPWKNR-S---FWGVIN---NTLETNETIFSAITKLSNSM 65 (412) T ss_pred Cccch--hhhhhhhh---hhhHhhhhh--ccccc-ccccccccCCc-c---ccccch---hhhhccHHHHHHHHHHHHhH Confidence 11100 00000000 111111110 01111 11111111111 0 000111 12334899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+.-+. +. ..|.+..++..+||+.+|.++|+++++.++|++||+|++++ T Consensus 66 A~l-----------p~~~~~~~---~~----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-- 119 (412) T protein:vir:26 66 ASL-----------PLKMYEDY---KV----------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE-- 119 (412) T ss_pred hhC-----------ceeEeecc---cc----------ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEE-- Confidence 963 22221111 11 12334445556899999999999999999999999999886 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF 268 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I 268 (330) |+..|++++|+||+|.+|++..+.+++ .+.|.+...++....|+++||+|++.++..+ +.||+|||++|+.+| T Consensus 120 r~~~G~~~~L~~l~~~~v~v~~~~~~~----~~~y~~~~~~g~~~~~~~~evih~~~~~~~~---~~~G~s~i~~~~~~i 192 (412) T protein:vir:26 120 RDIYHQPSKLFLLNPDVVEMLIENQSR----ELYYSIHAATGNKLIVHNMDMLHFKHIVASN---MVQGISPIDVLKNTT 192 (412) T ss_pred ECCCCcEEEEEEEcCceeEEEEeCCCc----EEEEEEEcCCceEEEEccccEEEeCCCCCCC---CcccccHHHHHHHHH Confidence 556789999999999999998877653 3457666667777889999999998653332 346999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 269 IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 269 ~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++.++++|+ ++.++..+++++..+ ..+++++++++++.|++.++ |+|+++||-+ T Consensus 193 ~~~~a~~~~~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~~ 247 (412) T protein:vir:26 193 DFDNAVRTFN--LTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFKQYYE---ENGGILFQEP 247 (412) T ss_pred HHHHHHHHHH--HHhcCCCCceEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeeecCC Confidence 9999999994 666777777777654 46999999999999998764 6778777766 No 73 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=7.5e-35 Score=207.66 Aligned_cols=237 Identities=12% Similarity=0.028 Sum_probs=158.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+.-+ ... .+.+. +. ...+... .++..-.+...+.... -+.+.+++.|++||+.++++| T Consensus 1 Mg~~~~----~~~--~k~~~----~~--~~~~~~~-----~~~~~~~~~~~~~~v~---~~~~l~~~~v~~~i~~ia~~i 60 (383) T protein:vir:10 1 MGLLTP----KNF--SKRNA----KN--MVYPSNP-----AFFTTTVGGMQLSYVS---ALSALQNTNVYSVINRIASDV 60 (383) T ss_pred CCcccc----ccc--ccccc----cc--cccccch-----hhhhhhccCccccccc---hhHhhcchHHHHHHHHHHHhh Confidence 111100 000 00000 00 0000000 0000000000000001 122334899999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .+++. +.. . .. +...||+.+|.++|+++++.++|+.|++|++++ T Consensus 61 a~~-----------~~~~~--~~~----------~---~~-----ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~-- 107 (383) T protein:vir:10 61 SSA-----------HFKTE--NTA----------T---LN-----RLESPSSLIGRFSFWQGALMQLCLSGNDYIPLV-- 107 (383) T ss_pred ccC-----------ceeec--ccc----------h---hh-----hhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEE-- Confidence 952 33322 100 0 01 123699999999999999999999999999875 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF 268 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I 268 (330) + .+.+++|+++.+|++..+.+|.. .+++...++...+|+++||+|++..+ .+.....||+|||++|..+| T Consensus 108 ~----~~~~~~p~~~~~v~~~~~~~~~~-----~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~~~~G~s~l~~~~~~i 177 (383) T protein:vir:10 108 G----QNLEHIPNSDVQINYLPGNMGIV-----YTVLESNDRPKMVLRQDQMLHFRLMP-DPQYRYLIGRSPLESLQNAL 177 (383) T ss_pred c----CceeEeecCcceEEEEEcCCceE-----EEEEEcCCceEEEEcccceEEeccCC-CCcccccccccHHHHHHHHH Confidence 2 36789999999999877655432 23334457778899999999997543 23333457999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 269 IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 269 ~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+++++++|+++||+||++|+|+|.++++ ..++++.++++++|++.++| .|+|+++||-+ T Consensus 178 ~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~ 237 (383) T protein:vir:10 178 NLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG-DNSGRLMVLPD 237 (383) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc-cccCCccccCC Confidence 99999999999999999999999999864 35799999999999999988 69999777655 No 74 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=1.1e-34 Score=206.72 Aligned_cols=238 Identities=9% Similarity=0.068 Sum_probs=158.9 Q ss_pred cccchhccccch------hccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheec Q lcl|NC_019511. 45 EITKSLYGKQQA------YAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYS 118 (330) Q Consensus 45 ~~~~~~~g~~~~------~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~ 118 (330) ++++.+-+|-|. ...++....+-.+ |..+ +...- ..+ .+..+|.|++||+.++++||+.. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~v---~~~---~a~~~~~v~~~i~~Ia~~ia~lp------ 66 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSASKLYDFSP-WKNK-SFWGV---INN---TLETNETIFSAITKLSNSMASLP------ 66 (409) T ss_pred CcccccchhhhhHHhhhhhcCCccccccccc-ccCc-ccccc---chh---hhhccHHHHHHHHHHHHhhhhCc------ Confidence 111112222111 0111111111111 1111 11110 111 12248999999999999999632 Q ss_pred ccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEE Q lcl|NC_019511. 119 EKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKF 198 (330) Q Consensus 119 ~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L 198 (330) +.+.-+.+ . ..|.+.+++..+||+++|.++|+++++.++|++||+|++++ |+..|+|++| T Consensus 67 -----~~~~~~~~---~----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~--r~~~G~~~~L 126 (409) T protein:vir:94 67 -----LKMYEDYK---V----------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE--RDIYHQPSKL 126 (409) T ss_pred -----eeEeeccc---c----------cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE--ECCCCcEEEE Confidence 22211111 1 12334555666899999999999999999999999998886 5667899999 Q ss_pred EeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 199 IAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFN 278 (330) Q Consensus 199 ~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~ 278 (330) |||+|.+|++..+.++. .+.|.+...++....|.++||+|++..+..+ +.||+|||..++.+|+++.++++|+ T Consensus 127 ~~l~~~~v~v~~~~~~~----~~~y~~~~~~g~~~~~~~~dvih~r~~~~~~---~~~G~s~l~~~~~~i~~~~~~~~~~ 199 (409) T protein:vir:94 127 FLLNPDVVEMLIENQSR----ELYYSIHAATGNKLIVHNMDMLHFKHIVASN---MVQGISPIDVLKNTTDFDNAVRTFN 199 (409) T ss_pred EEEcCceeEEEEeCCCc----EEEEEEEcCCceEEEEccccEEEecCCCCCC---ccccccHHHHHHHHHHHHHHHHHHH Confidence 99999999998876653 2346555556667789999999998543222 3469999999999999999999984 Q ss_pred HHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 279 DRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 279 ~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +|.++..+++++..+ ..+++++.+++++.|++.++ |+|+++||.+ T Consensus 200 --~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---~~g~~~vl~~ 244 (409) T protein:vir:94 200 --LTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFKQYYE---ENGGILFQEP 244 (409) T ss_pred --HHhcCCCCeeEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeeecCC Confidence 666666666776544 46999999999999998774 6778777655 No 75 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=2.8e-34 Score=204.50 Aligned_cols=237 Identities=17% Similarity=0.206 Sum_probs=161.3 Q ss_pred CccCcchhHHHHH--------------------------HHHHHHHhhcccchhcccc--chh--ccccccc-------- Q lcl|NC_019511. 23 VPIDDGIQANIRQ--------------------------IEQDTKEMQEITKSLYGKQ--QAY--AEPFLEM-------- 64 (330) Q Consensus 23 ~~~~~~~~~~~~~--------------------------~~~~~~~~~~~~~~~~g~~--~~~--~~~~~~~-------- 64 (330) +|.-+......++ ..++ +. + ++ ..++- ..+ .+|.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~-~~-~~~~~~~f~fg~p~~v~~~~~~~~~~~ 75 (376) T protein:vir:10 1 MPARDRPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNP--SA-G-SA-APARAEVFTFDDPTPVMNRAEILDYVE 75 (376) T ss_pred CCCCccchhhhhhcccchhhcccccchhccCCCcccchhhhhH--hh-h-cc-CcceeEEEEcCCceeccCcchhhhhhh Confidence 1111111000000 0000 00 0 00 00111 111 1121110 Q ss_pred cccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHH Q lcl|NC_019511. 65 MDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMK 144 (330) Q Consensus 65 ~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~ 144 (330) .--+..|.+||-.+. .+.+.+-.|+.+.+||..+++.++. T Consensus 76 ~~~~~~~~~pp~~~~------~La~~~~~~~~h~s~l~~k~n~l~~---------------------------------- 115 (376) T protein:vir:10 76 CWSNGEWFEPPVSFA------GLAKSFRASTHHSSALFFKANVLAS---------------------------------- 115 (376) T ss_pred hhhcCceecCCCCHH------HHHHHHhhhHHhhhhHHHHhHHHHh---------------------------------- Confidence 011345666654322 2234344488899999888877752 Q ss_pred HHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEE Q lcl|NC_019511. 145 RIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFV 224 (330) Q Consensus 145 ~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~ 224 (330) .-.||+++|..+|.+ ++.|+|++||+|+++++ ++.|+|++|+||+|.+|++..+.+ +|+ T Consensus 116 ----------~~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~r--n~~G~~~~L~pl~~~~vr~~~d~~--------~~~ 174 (376) T protein:vir:10 116 ----------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRR--NMVGGTLRLEPALAKYVRRKADFN--------GFV 174 (376) T ss_pred ----------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEE--CCCCCEEEEEEeCCcceEEEeeCC--------eEE Confidence 013788899999975 55699999999999874 557899999999999999876644 366 Q ss_pred EEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHH Q lcl|NC_019511. 225 QVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHA 304 (330) Q Consensus 225 q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~ 304 (330) |+..++....|+++||+|++.. +.....||+||+.+|+++|.++.++++|+.+||.||++|+|||.+++ ..+++|+ T Consensus 175 ~~~~~~~~~~~~~~eViHir~~---~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d-~~l~~e~ 250 (376) T protein:vir:10 175 YVNGWQERHEFEPDSVFQLVRP---DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-AAQKQDD 250 (376) T ss_pred EEEcCCeEEEEccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHH Confidence 7777788888999999999753 33335589999999999999999999999999999999999998754 5699999 Q ss_pred HHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 305 LENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 305 ~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++||++|++ +.|.+|+|++.|+.. T Consensus 251 ~~~lr~~~~~-~~G~~N~~~~~vl~~ 275 (376) T protein:vir:10 251 VDNMRDALKN-AKGPGNFRNVFMYAP 275 (376) T ss_pred HHHHHHHHHH-hcCccccCceeEecC Confidence 9999999987 689999999777642 No 76 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=1.2e-34 Score=206.54 Aligned_cols=248 Identities=16% Similarity=0.181 Sum_probs=162.9 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHH-HHHHHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccchHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQI-EQDTKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLH 85 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~ 85 (330) |.|++ +.+.. +.+...........- .-.+++...+.....+++ -.|.+=.. ...|.+||-.+. T Consensus 1 ~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~-----~~~~~~pp~~~~------ 65 (351) T protein:vir:79 1 MSKRR-SRAPR---TFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWS-----NGEWFEPPVSFA------ 65 (351) T ss_pred CCCCC-CCCCC---CCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhh-----cCceecCCCCHH------ Confidence 32221 11221 122211111100000 000000000011111111 12222111 335666654321 Q ss_pred HHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHH Q lcl|NC_019511. 86 EVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQ 165 (330) Q Consensus 86 ~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~ 165 (330) .+.+.+-.|+.+.+||...++.++. .-.||+.+|.. T Consensus 66 ~la~~~~~~~~h~~~l~~k~n~l~~--------------------------------------------~~~Pnp~~t~~ 101 (351) T protein:vir:79 66 GLAKSFRASTHHSSALFFKANVLAS--------------------------------------------TFRPHRWLSRH 101 (351) T ss_pred HHHHHHhhhHhhhhhhhhhhhHHhh--------------------------------------------cccCCCCCCHH Confidence 1223333388889998887777652 01378889999 Q ss_pred HHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecc Q lcl|NC_019511. 166 EFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIR 245 (330) Q Consensus 166 ~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~ 245 (330) +|. +++.|+|++||+|+++++ ++.|++++|+|++|.+|++..+.+| |+|+..++....|+++||+|++. T Consensus 102 ~f~-~~v~d~ll~Gnay~~~~r--~~~G~~~~L~~l~~~~v~~~~~~~~--------~~~~~~~g~~~~~~~~eIihir~ 170 (351) T protein:vir:79 102 AFE-RWALDFLTFGNGYLERRR--NMVGGTLRLEPALAKYVRRKADFSG--------FVYVNGWQERHEFEPDSVFQLVR 170 (351) T ss_pred HHH-HHHHHHHhcCCeEEEEEE--CCCCCEEEEEEeCCcceeeeecCCe--------EEEEecCceEEEEcCccEEEeCC Confidence 995 567899999999999875 5568999999999999998665443 66777778888999999999974 Q ss_pred cCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_019511. 246 NPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQI 325 (330) Q Consensus 246 n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kv 325 (330) . +.....||+||+..|+++|.++++++.|+.+||+||++|+|||.+++ ..+|+|++++||++|++ +.|.+|+|++ T Consensus 171 ~---~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~-~~ls~e~~~~lk~~~~~-~~G~~N~~~~ 245 (351) T protein:vir:79 171 P---DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-AAQKQDDVDNMRDALKN-AKGPGNFRNV 245 (351) T ss_pred C---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHH-hcCccccCce Confidence 3 33335589999999999999999999999999999999999998764 56999999999999986 6899999996 Q ss_pred ceeeC Q lcl|NC_019511. 326 CLYIK 330 (330) Q Consensus 326 pvL~e 330 (330) .|+.. T Consensus 246 ~v~~~ 250 (351) T protein:vir:79 246 FMYAP 250 (351) T ss_pred eEecC Confidence 66642 No 77 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=3.9e-35 Score=209.24 Aligned_cols=232 Identities=16% Similarity=0.194 Sum_probs=157.9 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchh----ccccccc--------cccCCCCCcCCCcc Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAY----AEPFLEM--------MDTNPDYRDKKSYM 78 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~----~~~~~~~--------~~~~p~~~~~~s~~ 78 (330) +++.++. .+ . +. ... +.+..++-.++ .+|.+.. .-.+..|.+||-++ T Consensus 1 ~~~~~~~-~~-~-~~----------~~~--------~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~ 59 (344) T protein:vir:56 1 MSKKKGK-TP-Q-PA----------AKT--------MTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSF 59 (344) T ss_pred CCCCCCC-CC-c-hh----------hHH--------hhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCH Confidence 2222111 00 0 00 000 00011111111 1222111 11245677776443 Q ss_pred cchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCC Q lcl|NC_019511. 79 RNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKD 158 (330) Q Consensus 79 r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~p 158 (330) .+ ..+.+| .|+.+.+||...++.|+. .-.| T Consensus 60 ~~---la~~~~---a~~~h~s~i~~k~n~l~~--------------------------------------------~~~P 89 (344) T protein:vir:56 60 TG---LAKSLR---AAVHHSSPIYVKRNILAS--------------------------------------------TFIP 89 (344) T ss_pred HH---HHHHHh---hhhhhCccceehhhhHHh--------------------------------------------hcCC Confidence 22 112222 377777777666655541 1148 Q ss_pred CCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechh Q lcl|NC_019511. 159 IDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSR 238 (330) Q Consensus 159 n~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~ 238 (330) |+++|..+| ++++.|+|++||+|+++++ ++.|++++|+|++|.+|++..+.+ .|+++..++....|.++ T Consensus 90 np~~t~~~f-~~~~~d~ll~Gnay~~~~r--n~~G~~~~L~pl~~~~v~~~~~~~--------~~~~~~~~g~~~~~~~~ 158 (344) T protein:vir:56 90 HPWLSQQDF-SRFVLDFLVFGNAFLEKRY--STTGKVIRLETSPAKYTRRGVEED--------VYWWVPSFNEPTAFAPG 158 (344) T ss_pred CCCCCHHHH-HHHHHHHHhcCCeEEEEEE--CCCCcEEEEEEeCCceeEEeecCC--------EEEEEecCCeEEEEcCc Confidence 999999998 7788999999999999874 567899999999999998865432 36677778888899999 Q ss_pred HeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcC Q lcl|NC_019511. 239 ELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSG 318 (330) Q Consensus 239 dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G 318 (330) ||+|++.. +.....||+||+..|+++|.++.++++|+++||.||++|+|||.+++ ..+|+|++++||++|++.. | T Consensus 159 dIiHir~~---~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d-~~ls~e~~~~lk~~~~~~~-g 233 (344) T protein:vir:56 159 SVFHLLEP---DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AVQDRNDIEMLRENMVKSK-G 233 (344) T ss_pred cEEEECCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHhc-C Confidence 99999742 22234579999999999999999999999999999999999998764 4699999999999999865 4 Q ss_pred cccccccceee-----C Q lcl|NC_019511. 319 INGSWQICLYI-----K 330 (330) Q Consensus 319 ~~na~kvpvL~-----e 330 (330) . |+||+++|. | T Consensus 234 ~-~~~r~l~l~~p~g~~ 249 (344) T protein:vir:56 234 R-NNFKNLFLYAPQGKA 249 (344) T ss_pred C-CCccceEEecCCCCc Confidence 3 789988774 2 No 78 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=8e-35 Score=207.51 Aligned_cols=236 Identities=15% Similarity=0.164 Sum_probs=156.9 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhcccccc--------ccccCCCCCcCCCcccchH Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE--------MMDTNPDYRDKKSYMRNAH 82 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~--------~~~~~p~~~~~~s~~r~~~ 82 (330) +++.++- ... .-.++ ..+....+..-...-.+|.+. ..-.+..|.+||-.+.+. T Consensus 1 m~~~~~~--~~~------------~~~~~---~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~l- 62 (344) T protein:vir:60 1 MSKKKGK--TLQ------------PAAKK---MTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGL- 62 (344) T ss_pred CCcccCC--CCC------------chHHh---hcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHH- Confidence 1111111 000 00000 000000000000111111111 011245676666443221 Q ss_pred HHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcC Q lcl|NC_019511. 83 NLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRD 162 (330) Q Consensus 83 ~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~ 162 (330) .+.+| .|+.+..||...++.|+. .-.||+++ T Consensus 63 --a~~~~---a~~~h~~~i~~k~n~l~~--------------------------------------------~~~Pn~~~ 93 (344) T protein:vir:60 63 --AKSLR---AAVHHSSPIYVKRNILAS--------------------------------------------TFIPHPWL 93 (344) T ss_pred --HHHHH---hhhhhccchhhhhhHHHh--------------------------------------------hccCCCCC Confidence 12222 367777777666665541 01388899 Q ss_pred CHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeee Q lcl|NC_019511. 163 SFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVM 242 (330) Q Consensus 163 s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih 242 (330) |..+| ++++.|+|++||+|+++++ ++.|+|++|+||+|.+|++..+.+ +|+++..++....|.++||+| T Consensus 94 t~~~f-~~~~~d~ll~Gnay~~i~r--n~~G~~~~L~~l~~~~vr~~~~~~--------~~~~v~~~~~~~~~~~~eIiH 162 (344) T protein:vir:60 94 SQQDF-SRFVLDFLVFGNAFLEKRY--STTGKVIRLETSPAKYTRRGVEED--------VYWWVPSFNEPTAFAPGSVFH 162 (344) T ss_pred CHHHH-HHHHHHHHhcCCeEEEEEE--CCCCcEEEEEEcCcceEEEeecCC--------eEEEEccCCeEEEEcCccEEE Confidence 99988 7788999999999999874 567899999999999998865432 366777788888999999999 Q ss_pred ecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019511. 243 GIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGS 322 (330) Q Consensus 243 ~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na 322 (330) ++.. +...+.||+||+..|+++|.++.++++|+.+||.||++|+|||.+++ ..+|+|++++||++|++.+ |. |+ T Consensus 163 ir~~---~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~-~~ls~e~~~~ik~~~~~~~-g~-~~ 236 (344) T protein:vir:60 163 LLEP---DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AVQDRNDIEMLRENMVKSK-GR-NN 236 (344) T ss_pred EcCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-cCCCHHHHHHHHHHHHHhc-CC-CC Confidence 9742 22234579999999999999999999999999999999999998764 5699999999999999876 53 78 Q ss_pred cccceee-----C Q lcl|NC_019511. 323 WQICLYI-----K 330 (330) Q Consensus 323 ~kvpvL~-----e 330 (330) ||.++|. | T Consensus 237 ~r~~~l~~p~g~~ 249 (344) T protein:vir:60 237 FKNLFLYAPQGKA 249 (344) T ss_pred CcceEEecCCCCc Confidence 8877775 2 No 79 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=2.9e-34 Score=204.41 Aligned_cols=248 Identities=16% Similarity=0.183 Sum_probs=162.7 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHH-HHHHHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccchHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQI-EQDTKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLH 85 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~ 85 (330) |.|++ +.+.. +.+...........- .-.+++...+.....+++ -.|.+=.. ...|.+||-.+.+ T Consensus 1 ~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~-----~~~~~~pp~~~~~----- 66 (351) T protein:vir:78 1 MSKRR-SRAPR---TFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWS-----NGEWFEPPVSFAG----- 66 (351) T ss_pred CCCCC-CCCCC---CCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhc-----cCceecCCCCHHH----- Confidence 32221 11221 112211111100000 000000000011111111 12222111 3356666543221 Q ss_pred HHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHH Q lcl|NC_019511. 86 EVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQ 165 (330) Q Consensus 86 ~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~ 165 (330) +.+.+-.|+.+.+||...++.++. .-.||+.+|.. T Consensus 67 -la~~~~~~~~h~~~l~~k~n~l~~--------------------------------------------~~~Pn~~~t~~ 101 (351) T protein:vir:78 67 -LAKSFRASTHHSSALFFKANVLAS--------------------------------------------TFRPHRWLSRH 101 (351) T ss_pred -HHHHHhhhHhhhhhhhhhhhHHhh--------------------------------------------cccCCCCCCHH Confidence 222232378888998887777752 01378889999 Q ss_pred HHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecc Q lcl|NC_019511. 166 EFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIR 245 (330) Q Consensus 166 ~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~ 245 (330) +|. +++.|+|++||+|+++++ ++.|++++|+|++|.+|++..+.++ |+|+..++....|.++||+|++. T Consensus 102 ~f~-~~~~d~ll~Gnay~~~~r--n~~G~~~~L~pl~~~~v~~~~~~~~--------~~~~~~~~~~~~~~~~eVihir~ 170 (351) T protein:vir:78 102 AFE-RWALDFLTFGNGYLERRR--NMVGGTLRLEPALAKYVRRKADFSG--------FVYVNGWQERHEFAPDSVFQLVR 170 (351) T ss_pred HHH-HHHHHHHhcCCeEEEEEE--CCCCCEEEEEEecCcceEEeeeCCe--------EEEEecCCeEEEEccccEEEEcC Confidence 996 456799999999999875 5568999999999999998765443 66677778888999999999974 Q ss_pred cCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_019511. 246 NPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQI 325 (330) Q Consensus 246 n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kv 325 (330) . +.....||+||+..|+++|.++.+++.|+++||+||++|+|||.+++ ..+|+|++++||++|++ +.|.+|+|++ T Consensus 171 ~---~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~-~~ls~e~~~~lr~~~~~-~~G~~N~~~~ 245 (351) T protein:vir:78 171 P---DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-AAQKQDDVDNMRDALKN-AKGPGNFRNV 245 (351) T ss_pred C---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHH-hcCcccccce Confidence 2 33345589999999999999999999999999999999999998754 56999999999999986 6899999997 Q ss_pred ceeeC Q lcl|NC_019511. 326 CLYIK 330 (330) Q Consensus 326 pvL~e 330 (330) .|+.. T Consensus 246 ~v~~~ 250 (351) T protein:vir:78 246 FMYAP 250 (351) T ss_pred eeecC Confidence 77642 No 80 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=1.8e-34 Score=205.63 Aligned_cols=245 Identities=15% Similarity=0.192 Sum_probs=159.0 Q ss_pred HHhcC-CCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccchHHHH Q lcl|NC_019511. 8 LRLGS-MYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLH 85 (330) Q Consensus 8 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~ 85 (330) |.|.+ +.....+. .-+. .. -+++..-.++....|++ .+|. ........|.+||.++.+ T Consensus 1 m~~~~~~~~~~~~~-~~~~-~~---------~~~~~~~~p~~~~~~~~~~~~~----~~~~~~~~~~~pp~~~~~----- 60 (346) T protein:vir:10 1 MKKQLRKNLTQNDR-LQPQ-AQ---------TEIFSFGDPIPVLDRADILNYL----ECSAMYEKWYNPPMSFDG----- 60 (346) T ss_pred CCcccCCCCCcccc-cccc-cC---------eEEEecCCcceecCchhHHHHH----HHhhcCCceEecCCCHHH----- Confidence 31111 11100000 0000 00 00011111111111110 1111 111224567777654321 Q ss_pred HHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHH Q lcl|NC_019511. 86 EVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQ 165 (330) Q Consensus 86 ~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~ 165 (330) +++.+..|+....||...++.++. + .+.||+++|.. T Consensus 61 -la~l~~~~~~h~~~i~~k~n~l~~------------------------------------------l-~~~Pn~~~t~~ 96 (346) T protein:vir:10 61 -LAKSLRSSTHHESAIITKANILLS------------------------------------------T-CEVDSRYLSRR 96 (346) T ss_pred -HHHHHHhhhhcchhhhhhhhhHHH------------------------------------------H-HhCCCCCCCHH Confidence 223333366677777665544431 1 13489999999 Q ss_pred HHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecc Q lcl|NC_019511. 166 EFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIR 245 (330) Q Consensus 166 ~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~ 245 (330) +|++ ++.|+|++||+|++++ |++.|++++|+|++|.+|++..+.+++ .|++...++....|+++||+|++. T Consensus 97 ~f~~-~~~d~ll~Gnay~~i~--r~~~G~~~~L~pl~~~~v~~~~~~~~~------~~~~~~~~g~~~~~~~~dIih~r~ 167 (346) T protein:vir:10 97 DLSS-FVKDYLVFGNAYFEVV--RNRLGQVQRIESPLAKYVRKGLEAGQF------YYVPQRFDHQEHEFAKGSIYHLLE 167 (346) T ss_pred HHHH-HHHHHHhcCCeEEEEE--EcCCCcEEEEEEecCCceEEEEcCCeE------EEEEEccCCeEEEEecccEEEecC Confidence 9976 5679999999999886 556789999999999999987665543 355555577788899999999974 Q ss_pred cCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_019511. 246 NPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQI 325 (330) Q Consensus 246 n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kv 325 (330) . +.....||+||+..|+.+|.++.++++|+.+||.||++|+|||.+++ ..+++|++++||++|++. .|.+|+|++ T Consensus 168 ~---~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d-~~l~~e~~~~i~~~~~~~-~g~~n~~~~ 242 (346) T protein:vir:10 168 P---DINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSD-ASQKQEDVENIRQQLKQS-KGVGNFKNL 242 (346) T ss_pred C---CCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCCHHHHHHHHHHHHHh-cCccccCce Confidence 3 33335589999999999999999999999999999999999998754 568999999999999976 577999996 Q ss_pred ceeeC Q lcl|NC_019511. 326 CLYIK 330 (330) Q Consensus 326 pvL~e 330 (330) .|+.. T Consensus 243 ~vl~~ 247 (346) T protein:vir:10 243 FVHAP 247 (346) T ss_pred eEecC Confidence 66643 No 81 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=4.1e-34 Score=203.65 Aligned_cols=228 Identities=14% Similarity=0.183 Sum_probs=155.3 Q ss_pred HHHHHHHhhcccchhccccchh---ccccccc------cc----cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHH Q lcl|NC_019511. 36 IEQDTKEMQEITKSLYGKQQAY---AEPFLEM------MD----TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIII 102 (330) Q Consensus 36 ~~~~~~~~~~~~~~~~g~~~~~---~~~~~~~------~~----~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~ 102 (330) ..++..+.....+...+.-..+ .+|.+.. +. ....|.+||-++.+ +.+.+-.|+.+.+||. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~------La~l~~~n~~h~~~i~ 74 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKG------LAEIANANGYHGSLLK 74 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHH------HHHHHhhhhhhhhhHh Confidence 1111111111111111111111 1122110 11 12256666643211 2233334888999998 Q ss_pred HHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCce Q lcl|NC_019511. 103 TRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVN 182 (330) Q Consensus 103 ~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~ 182 (330) .+++.++. .-.||+++|..+|.+. +.|+|++||+| T Consensus 75 ~k~N~l~~--------------------------------------------~~~Pn~~~t~~~f~~~-~~d~ll~Gnay 109 (348) T protein:vir:26 75 ARANYVAG--------------------------------------------RFMNGGGLPMYKMNSA-CWDYFGLGMSA 109 (348) T ss_pred hhhhHHhh--------------------------------------------cccCCCCCCHHHHHHH-HHHHHhcCCeE Confidence 88877752 0137888999999655 57999999999 Q ss_pred eEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHH Q lcl|NC_019511. 183 FEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVE 262 (330) Q Consensus 183 ~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe 262 (330) ++++ |++.|+|++|+|++|.+|++..+ | .|+++..++....|.++||+|++.. +.....||+||+. T Consensus 110 ~~~~--rn~~G~~~~L~~l~~~~v~~~~d--~-------~~~~~~~~g~~~~f~~~dIiHir~~---~~~~~~~Gls~~~ 175 (348) T protein:vir:26 110 FVKI--RSYLKNVIALEPLPMVHMRKRKN--G-------DFVQLLRNNEQKVFKAKDVIFIPQY---DPQQQIYGLPDYL 175 (348) T ss_pred EEEE--EcCCCcEEEEEEecCceeEeeec--C-------cEEEEEecCeEEEEcCccEEEEcCC---CCCCCcccccHHH Confidence 9987 45678999999999999987543 2 1444455666778999999999742 2333457999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceee----C Q lcl|NC_019511. 263 IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYI----K 330 (330) Q Consensus 263 ~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~----e 330 (330) .|+++|.++.+++.|+++||+||++|+|||.+++ ..+++|++++||++|++. .|.+|+|++.|+. | T Consensus 176 ~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~-~~ls~e~~~~lk~~~~~~-~G~~n~~~~~vl~~~g~~ 245 (348) T protein:vir:26 176 GSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATD-PNLSEADEKALKEKIASS-KGIGNFRSMFVNIPNGKE 245 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHh-cCcccccceeEEcCCCCc Confidence 9999999999999999999999999999998754 569999999999999985 6889999865552 2 No 82 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=1.4e-34 Score=206.17 Aligned_cols=251 Identities=16% Similarity=0.151 Sum_probs=157.1 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHHH-HHHHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccchHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQIE-QDTKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLH 85 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~ 85 (330) |.|.+...+... ...+-....+....... -+.++...+.....++. -.|.+ .+. ...|..||-++.+ T Consensus 1 m~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~----~~~-~~~~~~pp~~~~~----- 69 (350) T protein:vir:11 1 MSKRRSHRRQQP-VTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLE----CWP-NGRWYEPPLSMEG----- 69 (350) T ss_pred CCccccCCCcCc-cccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHH----Hhh-cCccccCCCCHHH----- Confidence 322221111111 01110000000000000 00000011111111211 11111 111 2345555432211 Q ss_pred HHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHH Q lcl|NC_019511. 86 EVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQ 165 (330) Q Consensus 86 ~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~ 165 (330) ..+.+..|+...+||...++.++. ...||+.+|.. T Consensus 70 -la~~~~~~~~h~~~l~~k~n~l~~--------------------------------------------~~~Pn~~~t~~ 104 (350) T protein:vir:11 70 -LAKSVGSSVYLQSGLKFKRNMLAK--------------------------------------------TFIPHRLLSRA 104 (350) T ss_pred -HHHHHhhhhhhccchhhhhhhhhh--------------------------------------------cccCCCCCCHH Confidence 123333467777777665554431 12488899999 Q ss_pred HHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecc Q lcl|NC_019511. 166 EFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIR 245 (330) Q Consensus 166 ~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~ 245 (330) +|.+ ++.|+|++||+|+++++ ++.|++++|+||+|.+|++..+.+ .|+++..++....|.++||+|++. T Consensus 105 ~f~~-~v~d~ll~Gnay~~~~r--n~~G~~~~L~~l~~~~vr~~~~~~--------~~~~~~~~~~~~~~~~~eVihir~ 173 (350) T protein:vir:11 105 TFEQ-FSLDWLTFGSAYLEQPR--SRLGTRMPLQAPLAKYMRRGTDLE--------TFYQVRSWKDEHEFEKGSVIQLRE 173 (350) T ss_pred HHHH-HHHHHHhcCCeEEEEEE--cCCCCEEEEEEeCCceeEeeecCC--------eEEEEeeCCeEEEECcccEEEeCC Confidence 9865 67799999999999874 556899999999999998865432 356666778888999999999975 Q ss_pred cCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_019511. 246 NPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQI 325 (330) Q Consensus 246 n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kv 325 (330) . +.....||+||+..++.+|.++.++++|+.+||.||++|+|||.+++ ..+++|++++|+++|++ ..|.+|+|++ T Consensus 174 ~---~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~-~~ls~e~~~~l~~~~~~-~~G~~N~~~~ 248 (350) T protein:vir:11 174 A---DINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTD-AAQNEEDIDALRTALKT-AKGPGNFRNL 248 (350) T ss_pred C---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHH-hcCccccCce Confidence 3 22234579999999999999999999999999999999999998864 46999999999999987 5788999997 Q ss_pred ceeeC Q lcl|NC_019511. 326 CLYIK 330 (330) Q Consensus 326 pvL~e 330 (330) .|+.. T Consensus 249 ~v~~~ 253 (350) T protein:vir:11 249 FVYAP 253 (350) T ss_pred eeecC Confidence 66643 No 83 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=3.1e-34 Score=204.29 Aligned_cols=241 Identities=15% Similarity=0.171 Sum_probs=159.7 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccchHHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE 86 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~ 86 (330) |.|.+..+.-.+...-+.. -+.++...+.....+++ .+|.+ . ..+.+|.+||-++.+. T Consensus 1 m~~~~~~~~~~~~~~~~~~-----------~~~~~~~~p~~~~~~~~~~~~~~----~-~~~~~~~~pp~~~~~l----- 59 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQK-----------MEAFTFGEPVPVLDKRDILDYVE----C-ISNGKWYEPPVSFSGL----- 59 (340) T ss_pred CCCCCCCccccccccCccc-----------eeEEEcCCceeecCcchhhhhhh----h-hhcCceecCCCCHHHH----- Confidence 2211111110000000000 00011111111112221 11211 1 1244566666543322 Q ss_pred HHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHH Q lcl|NC_019511. 87 VLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQE 166 (330) Q Consensus 87 ~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~ 166 (330) .+.+..|+.+.+||...++.++. .-.||+++|..+ T Consensus 60 -a~l~~a~~~h~s~i~~k~n~l~~--------------------------------------------~~~Pn~~lt~~~ 94 (340) T protein:vir:98 60 -AKSLRSAVHHSSPIYVKRNVLAS--------------------------------------------TYIPHPLLSRQD 94 (340) T ss_pred -HHHHHhccccchhhhhhhhHHhh--------------------------------------------ccCCCCCCCHHH Confidence 22233378888888887777752 013788889988 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeeccc Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRN 246 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n 246 (330) |. +++.|+|++||+|+++++ ++.|++++|+|++|.+|++..+ | .+|+++..++....|.++||+|++.. T Consensus 95 f~-~~~~d~ll~Gnay~~~~r--n~~G~~~~L~pl~~~~vr~~~~--~------~~~~~~~~~~~~~~~~~~eViHir~~ 163 (340) T protein:vir:98 95 FS-RFALDYLVFGNAFLEQRH--SVTGQLIKLLTSPAKYTRRGVD--D------SVFWFVENFTQPHEFAPDTVFHLLEP 163 (340) T ss_pred HH-HHHHHHHhcCCeEEEEEE--CCCCcEEEEEEeCCceEEEccc--C------cEEEEEecCCeEEEEccccEEEEcCC Confidence 85 566799999999999874 5678999999999999987533 2 23666666777788999999999742 Q ss_pred CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccc Q lcl|NC_019511. 247 PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQIC 326 (330) Q Consensus 247 ~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvp 326 (330) +.....||+||+..|+++|.++.++++|+++||.||++|+|||.+++ ..+++|++++||++|++ +.|.+|+|++. T Consensus 164 ---~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~-~~ls~e~~~~lk~~~~~-~~G~~n~~~~~ 238 (340) T protein:vir:98 164 ---DINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTD-PAQSATDVESLRDAMRN-SKGLGNFKNLF 238 (340) T ss_pred ---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHH-hcCccccCcee Confidence 22234589999999999999999999999999999999999998864 56999999999999987 68999999977 Q ss_pred eeeC Q lcl|NC_019511. 327 LYIK 330 (330) Q Consensus 327 vL~e 330 (330) |+.+ T Consensus 239 vl~~ 242 (340) T protein:vir:98 239 FYSP 242 (340) T ss_pred EecC Confidence 7643 No 84 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=1.4e-34 Score=206.26 Aligned_cols=235 Identities=16% Similarity=0.190 Sum_probs=156.4 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHH--------HHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccch Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQD--------TKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNA 81 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~ 81 (330) +++.++.. . . +. ..+. +++-..+.....+++ -+|. ..+ .+..|.+||-++.+ T Consensus 1 ~~~~~~~~-~-~-~~----------~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~----~~~-~~~~~~~pp~~~~~- 61 (344) T protein:vir:20 1 MSKKKGKT-P-Q-PA----------AKTMTASGPKMEAFTFGEPVPVLDRRDILDYV----ECI-SNGRWYEPPVSFTG- 61 (344) T ss_pred CCcccCCC-C-c-ch----------hhhhhccCCceEEEEcCCceEecCcchhhhhh----hhh-hcCceecCCCCHHH- Confidence 22221110 0 0 00 0000 000000000011111 0111 111 24567766644322 Q ss_pred HHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 82 HNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 82 ~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) ..+.++ .|+.+..||...++.++. .-.||++ T Consensus 62 --la~~~~---a~~~h~~~i~~k~n~l~~--------------------------------------------~~~Pn~~ 92 (344) T protein:vir:20 62 --LAKSLR---AAVHHSSPIYVKRNILAS--------------------------------------------TFIPHPW 92 (344) T ss_pred --HHHHHh---hhhhhCccceehhhhHHH--------------------------------------------hccCCCC Confidence 122222 367777777666555541 0137888 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHee Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELV 241 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvi 241 (330) +|..+| ++++.|+|++||+|+++++ ++.|+|++|+|++|.+|++..+.+ +|+++..++....|.++||+ T Consensus 93 lt~~~f-~~~~~d~ll~Gnay~~i~r--n~~G~~~~L~pl~~~~vr~~~~~~--------~~~~~~~~~~~~~~~~~eIi 161 (344) T protein:vir:20 93 LSQQDF-SRFVLDFLVFGNAFLEKRY--STTGKVIRLETSPAKYTRRGVEED--------VYWWVPSFNEPTAFAPGSVF 161 (344) T ss_pred CCHHHH-HHHHHHHHhcCCeEEEEEE--CCCCcEEEEEEcCCceeEeeecCC--------EEEEEccCCeEEEEcCccEE Confidence 999988 6788999999999999875 556899999999999998864432 36777778888899999999 Q ss_pred eecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_019511. 242 MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGING 321 (330) Q Consensus 242 h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~n 321 (330) |++.. +...+.||+||+..|+++|.++.++++|+++||+||++|+|||.+++ ..+|+|++++||++|++.+ |. | T Consensus 162 Hir~~---~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d-~~l~~e~~~~ik~~~~~~~-g~-~ 235 (344) T protein:vir:20 162 HLLEP---DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AVQDRNDIEMLRENMVKSK-GR-N 235 (344) T ss_pred EeCCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-cCCCHHHHHHHHHHHHHhc-CC-C Confidence 99743 22235589999999999999999999999999999999999998764 4699999999999999865 43 7 Q ss_pred ccccceee-----C Q lcl|NC_019511. 322 SWQICLYI-----K 330 (330) Q Consensus 322 a~kvpvL~-----e 330 (330) +||.++|. | T Consensus 236 n~r~l~l~~p~g~~ 249 (344) T protein:vir:20 236 NFKNLFLYAPQGKA 249 (344) T ss_pred CccceEEecCCCCc Confidence 78977775 2 No 85 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=1.4e-33 Score=200.76 Aligned_cols=235 Identities=14% Similarity=0.090 Sum_probs=154.4 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccc--cchhcccc----cccc----ccCCCCCcCCCcccc Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGK--QQAYAEPF----LEMM----DTNPDYRDKKSYMRN 80 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~----~~~~----~~~p~~~~~~s~~r~ 80 (330) ++.+.+-++.. . .++..++ ...+.+|. +..+ ..+..|.+||-++.+ T Consensus 1 ~~~~~~~~~~~---------------------~---~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~ 56 (345) T protein:vir:37 1 MKTNVKTDNKK---------------------G---IVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHA 56 (345) T ss_pred CCCCccccchh---------------------h---cccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHH Confidence 11111110000 0 0000001 11111111 1111 125567777643211 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) +.+.+-.|+...+||...++.++. .-.||+ T Consensus 57 ------la~l~~~~~~h~~~i~~k~n~l~~--------------------------------------------~~~Pn~ 86 (345) T protein:vir:37 57 ------LAKLPHQNAQHGGILHSRANMVSS--------------------------------------------LYEGGK 86 (345) T ss_pred ------HHHHhhcccccccceeeechHHHh--------------------------------------------hccCCC Confidence 122222367777777555554431 113788 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHe Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSREL 240 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dv 240 (330) .+|..+|++ ++.|+|++||+|++++ |++.|+|++|+||+|..|++..+.+.+. -++|.....++....|+++|| T Consensus 87 ~lt~~~f~~-~~~d~ll~Gnay~~~~--rn~~G~~~~L~pl~~~~vr~~~d~~~~~---~~~~~~~~~~g~~~~~~~~dV 160 (345) T protein:vir:37 87 ALSRMDMRA-LCLNLIQFGDVGLLKV--RNGFGQVVRLVPLSSLYLRVRKDGGYSY---LMKKSLYDTAQEIYRYDAKDI 160 (345) T ss_pred CCCHHHHHH-HHHHHHhcCCeEEEEE--EcCCCcEEEEEEEcCceeEEEEeCCeeE---EEEEeEecCCceEEEEccccE Confidence 899999975 5679999999999987 4567899999999999998876643321 112333345677788999999 Q ss_pred eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 241 VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGIN 320 (330) Q Consensus 241 ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~ 320 (330) +|++.. +.....||+||+..|++++.++.++++|+++||.||++|+|||.+++ ..+++|++++||++|++ +.|.+ T Consensus 161 ihir~~---~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d-~~l~~e~~~~lk~~~~~-~~g~~ 235 (345) T protein:vir:37 161 IFIKLY---DPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD-PDLTEEMEEEIARKISE-SKGVG 235 (345) T ss_pred EEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecC-CCCCHHHHHHHHHHHHH-hcCcc Confidence 999742 22334579999999999999999999999999999999999998754 56899999999999987 57889 Q ss_pred cccccceee----C Q lcl|NC_019511. 321 GSWQICLYI----K 330 (330) Q Consensus 321 na~kvpvL~----e 330 (330) |++++.|+. + T Consensus 236 n~~~~~i~~p~g~~ 249 (345) T protein:vir:37 236 NFRSMFVNIANGHP 249 (345) T ss_pred cccceEEEcCCCcc Confidence 998855553 1 No 86 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=7e-33 Score=196.87 Aligned_cols=233 Identities=9% Similarity=-0.031 Sum_probs=151.0 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +-+ +.++ ..... ++ .+.+.+ ..+.+....+. ...+.+. +. ..+++.+++||+.|+++| T Consensus 1 MGl-~~~~----~~~~~-~~-~~~~~~-~~~~~~~~~~~---~~~~vt~--------~~---al~~~~v~~~i~~Ia~~i 58 (394) T protein:vir:62 1 MGL-RDRF----SNYLF-KK-AEKRGY-LDNVLGKSIRY---SGVYVTD--------SN---ILQSSDVYELLQDISNQM 58 (394) T ss_pred Cch-hhhh----hhhcc-CC-CCchhh-hhhhhhccccc---CccccCh--------hh---hhccHHHHHHHHHHHHhh Confidence 222 1111 11100 00 111111 11111111110 0111111 11 123789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+ +.|.+.-+|.+. + -.|.++.++ .+||+.+|+++|++.++.++|+.|++++++. T Consensus 59 A~-----------lp~~v~~~~g~~---------~--~~~~~~~Ll-~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~-- 113 (394) T protein:vir:62 59 VL-----------ADIVVEDEFGNE---------I--KDDIALQIL-RNPNNYLTQSEFIKLMTNTYLLEGETFPILN-- 113 (394) T ss_pred cc-----------cceEEEcCCCcc---------c--chhhHHHHh-ccCCCCCCHHHHHHHHHHHHHhcCCeEEEEe-- Confidence 95 334443322210 0 123333333 4799999999999999999999999888863 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF 268 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I 268 (330) + .+..++ + .|.+..++++ .|++.. ++ .+|+++||+|++.++.++ .+|+||++.|.++| T Consensus 114 ~----~~~~~~--~--~~~~~~~~~~-------~~~~~~-~~--~~~~~~eiih~r~~~~d~----~~G~s~~~~~~~~i 171 (394) T protein:vir:62 114 G----AQIHLA--S--NVFTELDDNL-------VEHFNI-GG--HEIPPCMIRHVKNIGADH----LRGKGILDLGRDTL 171 (394) T ss_pred c----ceeecc--c--cceEEECCce-------EEEEee-CC--EEechhheEEecCcCCCC----ccccChHHHHHHHH Confidence 3 344442 2 4455555443 233332 33 468999999998655332 35999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 269 IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 269 ~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +.++++++|+.++|+||++|+|+|.+++....++++.+++++.|++.++|..|+|+++||-. T Consensus 172 ~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~ 233 (394) T protein:vir:62 172 EGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPL 233 (394) T ss_pred HHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeC Confidence 99999999999999999999999999887666788899999999999999999999888755 No 87 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=4.3e-33 Score=198.02 Aligned_cols=235 Identities=12% Similarity=0.125 Sum_probs=146.6 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhcccc-chhccccccccccCCCCCcCCCcccchHHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQ-QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE 86 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~ 86 (330) |.|.+..+...... +- .+.+....+.....|+. .+|.+=..+ ....|.+||-++.+ T Consensus 1 m~~~~~~~~~~~~~-----~~---------~~~~~~~~p~~~~~~~~~~~~~~~~~~---~~~~~~~pP~~~~~------ 57 (337) T protein:vir:78 1 MTKRQQQPAQAAAS-----SP---------RPSVVFSMPEAIDPTAWMTDYTGVFYN---PYGEYYQPPIDRKG------ 57 (337) T ss_pred CCCcccCccccccc-----Cc---------eeEEEecCcccccCcchhHhhhhhhhc---cCcceecCCCCHHH------ Confidence 21111111110000 00 00011111111111110 011111111 12345555543221 Q ss_pred HHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHH Q lcl|NC_019511. 87 VLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQE 166 (330) Q Consensus 87 ~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~ 166 (330) ..+.+..|+.+++|+..+.+.++. ..+...+ T Consensus 58 La~l~~~~~~h~~~L~~k~N~~~~-------------------------------------------------~f~~~~~ 88 (337) T protein:vir:78 58 LAKVARANAHHGAILMARRNMVAG-------------------------------------------------RFTNQRA 88 (337) T ss_pred HHHHhhcchhhhhHHHhhhccccc-------------------------------------------------cCcCcHH Confidence 122222366666666665554431 0011235 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeeccc Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRN 246 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n 246 (330) ++++++.|+|++||+|+++++ ++.|+|++|+||+|.+|++..+ | .|+|+..++....|.++||+|++.. T Consensus 89 ~~~~~~~d~ll~GNay~~~~r--n~~G~~~~L~pl~~~~v~~~~d--~-------~~~~~~~~~~~~~~~~~eIiHik~~ 157 (337) T protein:vir:78 89 TITAFVHNYLQFGDGGLLKLR--NSFGQVVGLHPLSSVYLRRRED--G-------CFVYLQQGKPNLIYRPDDVIWLAQY 157 (337) T ss_pred HHHHHHHHHHhhCCeEEEEEE--CCCCcEEEEEEeCCceeEeeeC--C-------eEEEEEcCCceEEECCccEEEECCC Confidence 788899999999999999874 5578999999999999987643 2 2445556677778999999999743 Q ss_pred CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccc Q lcl|NC_019511. 247 PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQIC 326 (330) Q Consensus 247 ~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvp 326 (330) +...+.||+||+..|+++|.+++++++|+++||+||++|+|||.+++ ..++++++++||++|++ +.|.+|++++. T Consensus 158 ---~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~-~~l~~e~~~~lk~~~~~-~~G~~n~~~~~ 232 (337) T protein:vir:78 158 ---DPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATD-PNMDDDTEEEMKEMIAN-SKGVGNFRSMF 232 (337) T ss_pred ---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-CCCCHHHHHHHHHHHHH-hcCcccccceE Confidence 22234579999999999999999999999999999999999998754 46999999999999986 68889998855 Q ss_pred eee----C Q lcl|NC_019511. 327 LYI----K 330 (330) Q Consensus 327 vL~----e 330 (330) |+. | T Consensus 233 v~~~~g~~ 240 (337) T protein:vir:78 233 VNIPDGKP 240 (337) T ss_pred EEcCCCCc Confidence 442 2 No 88 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=9.1e-33 Score=196.25 Aligned_cols=252 Identities=14% Similarity=0.112 Sum_probs=146.7 Q ss_pred H-HhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHH Q lcl|NC_019511. 8 L-RLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE 86 (330) Q Consensus 8 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~ 86 (330) | |+.++........... ....+....+++- ...++..- .+..|.+.+.+ .+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~---~~~~~~~~-~fg~p~~~~~~-~~~~~ 55 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVR--------------------TANTDAPTEHHTD---RAAQAEVF-SFGDPVEVLDR-RELLD 55 (368) T ss_pred CCccccccchhccCcccc--------------------cccccCcchhhcc---ccCceEEE-EcCCceeecch-hhHHH Confidence 2 1111110000000000 0000000000000 00001100 01111111111 11122 Q ss_pred HHHH-----HhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 87 VLKK-----FGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 87 ~Lr~-----~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) .+.- |.+.|+-..++.-+.+. +..+ + .....++-+.. +...||+. T Consensus 56 ~~~~~~~~~~~~~pi~~~~la~~~~~------------~~~h------------~-----~~~~~~~n~l~-l~~~Pn~~ 105 (368) T protein:vir:79 56 YVECMRMGQWYEPPMPWDGLARSFRA------------AAHH------------S-----SAVYVKRNILV-STFIPHPL 105 (368) T ss_pred HHHHHhccchhccCcCHHHHHHHHhh------------cccc------------c-----hhhhhhcchhh-hhcCCCcC Confidence 2211 22233333322110000 0000 0 00111122222 23469999 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHee Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELV 241 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvi 241 (330) +|..+|+ +++.|+|++||+|+++++ ++.|++++|+||+|.+|++..+.+ .|+|+..++....|.++||+ T Consensus 106 ~t~~~f~-~l~~d~ll~Gnay~~~~r--~~~G~~~~L~~l~~~~v~~~~~~~--------~~~~~~~~~~~~~~~~~dIi 174 (368) T protein:vir:79 106 LSRATFE-RLVLDWQVFGNAYLERRE--NVLGGTIRLDTPLAKYVRRGLDLN--------TYFFVQNWQQPYTFAAGSVF 174 (368) T ss_pred CCHHHHH-HHHHHHhhcCCeEEEEEE--cCCCCEEEEEEeCcccceeeccCC--------EEEEEecCCeEEEEccccEE Confidence 9999996 578899999999999874 557899999999999998754422 36666677778889999999 Q ss_pred eecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_019511. 242 MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGING 321 (330) Q Consensus 242 h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~n 321 (330) |++.. +...+.||+||+++|+.+|.++.++++|+.+||.||++|+|||.+++ ..+++|++++||++|++ +.|.+| T Consensus 175 hir~~---~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~-~~l~~e~~~~lk~~~~~-~~G~~N 249 (368) T protein:vir:79 175 HLQEP---DINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTD-AAQKQEDVDTLREAMKS-AKGPGN 249 (368) T ss_pred EecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCCHHHHHHHHHHHHH-hcCCcc Confidence 99742 22234589999999999999999999999999999999999998764 56999999999999987 789999 Q ss_pred ccccceee----C Q lcl|NC_019511. 322 SWQICLYI----K 330 (330) Q Consensus 322 a~kvpvL~----e 330 (330) +|+++||. | T Consensus 250 ~g~~~vl~~~g~~ 262 (368) T protein:vir:79 250 FRNLFMYAPNGKK 262 (368) T ss_pred cCceeEecCCCCc Confidence 99977763 2 No 89 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=2.6e-32 Score=193.78 Aligned_cols=245 Identities=15% Similarity=0.142 Sum_probs=153.5 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEV 87 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~ 87 (330) |+|.+..+........+. + ...+.. ..++...--.|.+=. ......|.+||-++.+ + T Consensus 1 ~~~~~~~~~~~~~~~~~~----~-------~~~~~~---~~~~~~~~~~y~~~~---~~~~~~~~epp~~~~~------l 57 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPI----N-------DRTFSL---SEITASPALDYVGIG---FDENYNCYLPPVNRHA------L 57 (345) T ss_pred CCccccccchhhhcCCCc----e-------EEEeec---CCcccchhhccccee---eecCCccccCCCCHHH------H Confidence 222211111000000000 0 000000 000000000111100 0124567777644221 1 Q ss_pred HHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHH Q lcl|NC_019511. 88 LKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEF 167 (330) Q Consensus 88 Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~f 167 (330) .+.+-.|+.+.+||...++.++. .-.||+.+|..+| T Consensus 58 a~~~~~~~~h~~~i~~k~n~l~~--------------------------------------------~~~Pn~~~t~~~f 93 (345) T protein:vir:37 58 AKLPHQNAQHGGILHSRANMVSA--------------------------------------------TYEGGKALSKMEM 93 (345) T ss_pred HHHhhcchhhcchhhhhhhHHhh--------------------------------------------ccCCCCCCCHHHH Confidence 12222378888888877776642 0137889999999 Q ss_pred HHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccC Q lcl|NC_019511. 168 CKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNP 247 (330) Q Consensus 168 l~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~ 247 (330) .+ ++.|+|++||+|+++++ ++.|++++|+|++|.+|++..+.+.+.. .++.....++....|.++||+|++.. T Consensus 94 ~~-~v~d~ll~Gnay~~i~r--n~~G~~~~L~pl~~~~vr~~~d~~~~~~---~~~~~~~~~g~~~~~~~~eViHir~~- 166 (345) T protein:vir:37 94 RA-LCLNLIQFGDVGLLKVR--NGFGQVVRLVPLSSLYLRVHKDGGYSYL---MKKSLYDTAQEIYRYDAKDIIFIKLY- 166 (345) T ss_pred HH-HHHHHHhcCCeEEEEEE--CCCCCEEEEEEecCceeEEeecCCeeEE---EeeeeeccCceEEEEccccEEEEcCC- Confidence 65 56799999999999874 5678999999999999988655432111 11222334567778999999999742 Q ss_pred cCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccce Q lcl|NC_019511. 248 RSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICL 327 (330) Q Consensus 248 ~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpv 327 (330) +.....||+||+..|+++|.++.++++|+++||.||++|+|||.+++ ..+++|+.++||++|++.+ |.+|.+.+.| T Consensus 167 --~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~-~~l~~e~~~~lk~~~~~~~-g~~n~~~~~i 242 (345) T protein:vir:37 167 --DPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD-PDLTEEMEEEIARKISESK-GVGNFRSMFV 242 (345) T ss_pred --CCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCCHHHHHHHHHHHHHhc-CccccCceeE Confidence 22234579999999999999999999999999999999999998754 5699999999999999986 4466655444 Q ss_pred eeC Q lcl|NC_019511. 328 YIK 330 (330) Q Consensus 328 L~e 330 (330) +.. T Consensus 243 ~~~ 245 (345) T protein:vir:37 243 NIA 245 (345) T ss_pred ecC Confidence 422 No 90 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.95 E-value=1.9e-30 Score=183.58 Aligned_cols=183 Identities=9% Similarity=0.016 Sum_probs=138.3 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+ +-+.+.-++ + ...|.+.+++..+||+++|.++|+++++.+++++||+|++++ T Consensus 1 ia~-----------lp~~~~~~~---~----------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~- 55 (348) T protein:vir:93 1 MAS-----------LPLKMYEDY---K----------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE- 55 (348) T ss_pred Ccc-----------cceEeEecC---c----------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 553 233332111 1 123445566667899999999999999999999999888876 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) |+..|+|++||||+|.+|++..+.+|. .+.|.+...++....|+++||+|++.++..+ ..||+||++.++.+ T Consensus 56 -r~~~G~~~~L~~l~~~~v~~~~~~~~~----~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~---~~~G~s~~~~~~~~ 127 (348) T protein:vir:93 56 -RDIYHQPSKLFLLNPDVVEMLIENQSR----ELYYSIHAATGNKLIVHNMDMLHFKHIVASN---MVQGISPIDVLKNT 127 (348) T ss_pred -ECCCCcEEEEEEEcCCceEEEEeCCCc----EEEEEEEcCCCeEEEEccccEEEecCCCCCC---ceeeccHHHHHHHH Confidence 566789999999999999998877654 2456666666777789999999998654332 34699999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+++.++++|+ ++.++..+++++..+ ..+++|+++++++.|++.++ |+|+++||-+ T Consensus 128 i~~~~~~~~~~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---n~~~~~vl~~ 183 (348) T protein:vir:93 128 TDFDNAVRTFN--LTEMQKPDSFMLKYG--SNVSTEKRQQVLEDFKQYYE---ENGGILFQEP 183 (348) T ss_pred HHHHHHHHHHH--HHhcCCCceeEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeeecCC Confidence 99999999996 444444556666543 46999999999999999874 6788666655 No 91 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.95 E-value=2.5e-30 Score=182.90 Aligned_cols=183 Identities=8% Similarity=-0.013 Sum_probs=140.6 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+ +-+.+.-+++. . .|.+.+++..+||+.+|+++|++.++.++|+.|++++++++ T Consensus 1 ia~-----------l~~~~~~~~~~---~----------~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r 56 (278) T protein:vir:78 1 MAS-----------LPLKMYEDYKV---V----------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER 56 (278) T ss_pred Ccc-----------ceeEEEecCcc---c----------ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEE Confidence 553 33333222111 1 23344555568999999999999999999999999999875 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) +..|++++|+||+|.+|++..+.+|. .+.|.+...++....|+++||+|++.+... .+.||+||+..+..+ T Consensus 57 --~~~G~~~~l~~l~~~~v~v~~~~~~~----~~~y~~~~~~g~~~~~~~~evih~~~~~~~---~~~~G~s~~~~~~~~ 127 (278) T protein:vir:78 57 --DIYHQPSKLFLLNPDVVEMLIENQSR----ELYYSIHAATGNKLIVHNMDMLHFKHIVAS---NMVQGISPIDVLKNT 127 (278) T ss_pred --CCCCcEEEEEEECCceeEEEEcCCCc----eEEEEEEcCCceEEEEccccEEEECCCCCC---CCeeeccHHHHHHHH Confidence 55689999999999999998887764 245666666777789999999999864322 234799999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+.++++++++..+|.++ |+|++..++ .+++|+.+++++.|++.+ +|+|+++||-+ T Consensus 128 i~~~~~~~~~~~~~~~~~--~~~i~~~~~--~l~~e~~~~~~~~~~~~~---~~~g~~~vl~~ 183 (278) T protein:vir:78 128 TDFDNAVRTFNLTEMQKP--DSFMLKYGS--NVGKEKRQQVLEDFKQYY---EENGGILFQEP 183 (278) T ss_pred HHHHHHHHHHHHHHhcCC--CcEEEEeCC--CCCHHHHHHHHHHHHHHh---ccCCCceecCC Confidence 999999999977666554 788887654 589999999999999865 36889877755 No 92 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.92 E-value=1.9e-27 Score=167.11 Aligned_cols=224 Identities=13% Similarity=0.093 Sum_probs=141.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHH-HHHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLH-EVLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~-~~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +.. +..+.+ |.... +...+.+... ..-+.+.+++.|++||+.++++ T Consensus 1 Mg~-f~~~f~--------------~~~~~------------------~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ 47 (385) T protein:vir:95 1 MGL-FDSVFK--------------RHSEL------------------SWMYDLEFLQDKSKKAYLKQIALNTVVEMVART 47 (385) T ss_pred Cch-hhhhhc--------------cCccc------------------ccccchhhhhccchhhhhhhHHHHHHHHHHHHH Confidence 221 222211 11000 0000000000 0012233589999999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+.-++. . ..|.+.+++..+||+.+|+++|+++++.++++.|++|+++ T Consensus 48 ia~~-----------p~~~~~~~~---~----------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~-- 101 (385) T protein:vir:95 48 ISQS-----------EFRVMKNNT---K----------EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVK-- 101 (385) T ss_pred Hccc-----------ceeeeecCc---c----------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEE-- Confidence 9963 232221111 1 1133445555689999999999999999999999887765 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) .+++ +.+..++++.+..+.+..+ . ...+.+.+.+....|+++||+|++.++..+ ..||+||++.|+.+ T Consensus 102 ~~~~-~~~~~~~~~~~~~~~~~~~----~----~~~~~~~~~~~~~~~~~~eiih~~~~~~~~---~~~G~s~~~~~~~~ 169 (385) T protein:vir:95 102 NDEG-HFFVADDFEKEDELGLYSH----R----FTNVLVNDFEFKRVFTMDDVIYLKYNNQKL---DAFSLGLFEDYGEI 169 (385) T ss_pred ecCC-Ceeeccccccccccccccc----c----ceeeeecccceeeeeccccEEEecCCCCCc---ccccchHHHHHHHH Confidence 3443 4455666655554433211 1 112222334555679999999998765433 34699999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |+.++++.. |. +.|+|+|.+++...+++++.++++++|++.++|..|+++.+++++ T Consensus 170 i~~~~~~~~-----~~--~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~ 225 (385) T protein:vir:95 170 FGRMIDLQM-----LN--NQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLT 225 (385) T ss_pred HHHHHHHHH-----hc--CCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcC Confidence 988776543 33 348899999887789999999999999999999977766556566 No 93 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.91 E-value=1.6e-26 Score=161.93 Aligned_cols=221 Identities=14% Similarity=0.074 Sum_probs=136.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHH-HHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE-VLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~-~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +.. +.++.+ +.+.. +...+...+.+ .-+.+.+++.|++||+.++++ T Consensus 1 Mg~-f~~lf~--------------~~~~~------------------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:95 1 MSI-LEKIFK--------------TRKDI------------------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred Cch-hhhhhc--------------cCccc------------------cccccchhccccchhhhhhhHHHHHHHHHHHHh Confidence 211 111111 10000 00000000000 112334589999999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+.-+ . ++. ++.+..++...||+.+|.++|+++++.++|+.|++++++ T Consensus 48 iA~~-----------p~~~~~~--~-~~~----------~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~-- 101 (395) T protein:vir:95 48 VAQS-----------HFKVLEG--N-RIQ----------KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV-- 101 (395) T ss_pred hccc-----------eeEeccC--C-ccc----------cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE-- Confidence 9963 2222111 1 111 223444455679999999999999999998877655443 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) .+++ .++|+++..+.+...... ...++.....+...+|.++||+|++.++..+ ..||+|||+.+..+ T Consensus 102 ~~~~-----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~spi~~~~~~ 168 (395) T protein:vir:95 102 SDSK-----ELLIADSFYREEYALYDD-----IFKDVTVKDYTYQRTFTMQEVIYLKYNNNKV---THFVESLFEDYGKI 168 (395) T ss_pred ecCC-----CeEecCCccceeEeecCc-----ceeEEEEcCceeeeeeccccEEEEccCCCCc---ccccchHHHHHHHH Confidence 2332 256777766655433221 1122333344555679999999998876544 34799999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccccee-eC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLY-IK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL-~e 330 (330) ++.+++ .|.+|+.|+|+|.++++ .+++++.+++++.|++.++|. |+++.+|| ++ T Consensus 169 ~~~~~~-------~~~~~~~~~gii~~~~~-~~~~e~~~~~~~~~~~~~~~~-~~~~~~v~~l~ 223 (395) T protein:vir:95 169 FGRMIG-------AQLKNYQIRGILKSASS-AYDEKNIEKLQAFTNKLFNTF-NKNQLAIAPLI 223 (395) T ss_pred HHHHHH-------HHHhcCCCceEEEeCCC-CCCHHHHHHHHHHHHHHhccc-cccCcceEEcC Confidence 887654 35677788999988764 589999999999999988886 66676665 55 No 94 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.91 E-value=1.6e-26 Score=161.93 Aligned_cols=221 Identities=14% Similarity=0.074 Sum_probs=136.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHH-HHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE-VLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~-~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +.. +.++.+ +.+.. +...+...+.+ .-+.+.+++.|++||+.++++ T Consensus 1 Mg~-f~~lf~--------------~~~~~------------------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:10 1 MSI-LEKIFK--------------TRKDI------------------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred Cch-hhhhhc--------------cCccc------------------cccccchhccccchhhhhhhHHHHHHHHHHHHh Confidence 211 111111 10000 00000000000 112334589999999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+.-+ . ++. ++.+..++...||+.+|.++|+++++.++|+.|++++++ T Consensus 48 iA~~-----------p~~~~~~--~-~~~----------~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~-- 101 (395) T protein:vir:10 48 VAQS-----------HFKVLEG--N-RIQ----------KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV-- 101 (395) T ss_pred hccc-----------eeEeccC--C-ccc----------cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE-- Confidence 9963 2222111 1 111 223444455679999999999999999998877655443 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) .+++ .++|+++..+.+...... ...++.....+...+|.++||+|++.++..+ ..||+|||+.+..+ T Consensus 102 ~~~~-----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~spi~~~~~~ 168 (395) T protein:vir:10 102 SDSK-----ELLIADSFYREEYALYDD-----IFKDVTVKDYTYQRTFTMQEVIYLKYNNNKV---THFVESLFEDYGKI 168 (395) T ss_pred ecCC-----CeEecCCccceeEeecCc-----ceeEEEEcCceeeeeeccccEEEEccCCCCc---ccccchHHHHHHHH Confidence 2332 256777766655433221 1122333344555679999999998876544 34799999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccccee-eC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLY-IK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL-~e 330 (330) ++.+++ .|.+|+.|+|+|.++++ .+++++.+++++.|++.++|. |+++.+|| ++ T Consensus 169 ~~~~~~-------~~~~~~~~~gii~~~~~-~~~~e~~~~~~~~~~~~~~~~-~~~~~~v~~l~ 223 (395) T protein:vir:10 169 FGRMIG-------AQLKNYQIRGILKSASS-AYDEKNIEKLQAFTNKLFNTF-NKNQLAIAPLI 223 (395) T ss_pred HHHHHH-------HHHhcCCCceEEEeCCC-CCCHHHHHHHHHHHHHHhccc-cccCcceEEcC Confidence 887654 35677788999988764 589999999999999988886 66676665 55 No 95 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.91 E-value=1.6e-26 Score=161.93 Aligned_cols=221 Identities=14% Similarity=0.074 Sum_probs=136.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHH-HHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE-VLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~-~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +.. +.++.+ +.+.. +...+...+.+ .-+.+.+++.|++||+.++++ T Consensus 1 Mg~-f~~lf~--------------~~~~~------------------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:10 1 MSI-LEKIFK--------------TRKDI------------------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred Cch-hhhhhc--------------cCccc------------------cccccchhccccchhhhhhhHHHHHHHHHHHHh Confidence 211 111111 10000 00000000000 112334589999999999999 Q ss_pred HhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) ||+. .|.+.-+ . ++. ++.+..++...||+.+|.++|+++++.++|+.|++++++ T Consensus 48 iA~~-----------p~~~~~~--~-~~~----------~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~-- 101 (395) T protein:vir:10 48 VAQS-----------HFKVLEG--N-RIQ----------KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV-- 101 (395) T ss_pred hccc-----------eeEeccC--C-ccc----------cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE-- Confidence 9963 2222111 1 111 223444455679999999999999999998877655443 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) .+++ .++|+++..+.+...... ...++.....+...+|.++||+|++.++..+ ..||+|||+.+..+ T Consensus 102 ~~~~-----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~spi~~~~~~ 168 (395) T protein:vir:10 102 SDSK-----ELLIADSFYREEYALYDD-----IFKDVTVKDYTYQRTFTMQEVIYLKYNNNKV---THFVESLFEDYGKI 168 (395) T ss_pred ecCC-----CeEecCCccceeEeecCc-----ceeEEEEcCceeeeeeccccEEEEccCCCCc---ccccchHHHHHHHH Confidence 2332 256777766655433221 1122333344555679999999998876544 34799999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccccee-eC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLY-IK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL-~e 330 (330) ++.+++ .|.+|+.|+|+|.++++ .+++++.+++++.|++.++|. |+++.+|| ++ T Consensus 169 ~~~~~~-------~~~~~~~~~gii~~~~~-~~~~e~~~~~~~~~~~~~~~~-~~~~~~v~~l~ 223 (395) T protein:vir:10 169 FGRMIG-------AQLKNYQIRGILKSASS-AYDEKNIEKLQAFTNKLFNTF-NKNQLAIAPLI 223 (395) T ss_pred HHHHHH-------HHHhcCCCceEEEeCCC-CCCHHHHHHHHHHHHHHhccc-cccCcceEEcC Confidence 887654 35677788999988764 589999999999999988886 66676665 55 No 96 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.88 E-value=1e-24 Score=152.16 Aligned_cols=223 Identities=8% Similarity=0.008 Sum_probs=131.6 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.. ++.+.. ..+.+ ...... ..-...+ -..+.+++.|++||+.++++| T Consensus 1 Mg~-f~~l~~----~~~~~--------~~~~~~--------~~~~~~~-----------~~~~l~~~~v~~~i~~Ia~~i 48 (376) T protein:vir:78 1 MGF-FSELFK----RNKEI--------EWMWDL--------DFLEDKT-----------TKVYLKKMALNTCVKHIARTI 48 (376) T ss_pred Cch-hhhhhc----cCCcc--------ccccch--------hhccccc-----------hhhhhhhHHHHHHHHHHHHhh Confidence 222 111111 00000 000000 0000011 112334788999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+ +++.. ...|.+.+++..+||+.+|.++|+++++.++++.|++++++. T Consensus 49 a~~-----------p~~~--~~~~~-----------~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~-- 102 (376) T protein:vir:78 49 AKS-----------DFRL--KNGET-----------SVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLS-- 102 (376) T ss_pred ccc-----------ceee--ccccc-----------cccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEE-- Confidence 952 2222 11111 123455666677899999999999999999999998877764 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeC-CceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVID-KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~-~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) +++.|.+.+++|+++..+..... |..... .+....|+++||+|++.++.++ ..++.++++.+... T Consensus 103 r~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~~~~~~~~~~~~ 168 (376) T protein:vir:78 103 DTDDFLIADSYVRKEFAFFPDVF-----------EGVTVKDYRYNRNFSMDDVIFLEYGNERL---SAFTDGMFEDYGEL 168 (376) T ss_pred eCCCeeeccceeecccceeeeee-----------eeeeeecceeeeeeccccEEEeccCCCCc---hhhhhHHHHHHHHH Confidence 67778999999999987654221 122222 2334568999999998654332 22344444444444 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +..++ .+.+|.+|.++.+++. ....+++++.+++++.|++.++|..+.++-.++++ T Consensus 169 ~~~~~-----~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~ 224 (376) T protein:vir:78 169 FGKMI-----RAQMRNFQIRGAVNFK--MAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQL 224 (376) T ss_pred HHHHH-----HHHHhcCCCceeEEEc--cCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC Confidence 33222 2334555555555553 44578999999999999999999865554344355 No 97 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.88 E-value=3.6e-25 Score=154.61 Aligned_cols=207 Identities=14% Similarity=0.160 Sum_probs=122.6 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.. . ++-+...-. ..+.....++..-+ -....++++|++||+.|+++| T Consensus 1 Mg~--------------f-----~~~~~~~~~------~~~~~~~~~~~~~~-------~~~~~~~~~v~~~v~~IA~~i 48 (378) T protein:vir:94 1 MNL--------------F-----GKVVSFSRG------KLNNDTQRVTAWQN-------EAVEYTSAFVTNIHNKIANEI 48 (378) T ss_pred CCc--------------c-----ccchhcccc------cccCCcceeeeecc-------chhHHHHHHHHHHHHHHHhhh Confidence 000 0 000000000 00001101110000 001124689999999999999 Q ss_pred hhhhh-hheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 109 STYCK-PARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 109 a~~~~-~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) |+... .++..+++-+ .+ .......|.+.+++...||+++|.++|++.++.++|+.|++|+++++ T Consensus 49 A~lp~~~~~~~~~~~~------------~~---~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~ 113 (378) T protein:vir:94 49 TKVEFNHVKYKKSDVG------------SD---TLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVF 113 (378) T ss_pred hhCceeeEEEcccCcc------------cc---cccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEe Confidence 96422 1222222211 00 01123446677777778999999999999999999999999999875 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) +++.|+++.|+|.+. ..+|.++||+|++. |.. ...|+||++.|+++ T Consensus 114 -~~~~g~~~~l~p~~~----------------------------~~~~~~~diiH~~~-~~~----~~~g~s~l~~~~~~ 159 (378) T protein:vir:94 114 -DDNTGELLDLLFADD----------------------------KKEYKPEELVRLTS-PFY----INEDTSILDNALAS 159 (378) T ss_pred -eCCCceEEEEEecCC----------------------------eeEeeeeeeEEecC-cCC----ccchhHHHHHHHHH Confidence 455678777765321 12457789999973 322 22499999999998 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh----cCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF----SGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~----~G~~na~kvpvL~e 330 (330) |..+ +++| .|+|+|++++ .+++++.+++++.|++.+ +| .|+|+++||-+ T Consensus 160 i~~~----------~~~~-~~~gil~~~~--~l~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~vl~~ 212 (378) T protein:vir:94 160 IQTK----------LEQG-KLRGLLKINA--FLDIDNTQEYREKALTTIKNMQEG-SSYNGLTPVDN 212 (378) T ss_pred HHHH----------Hhcc-cccceeeeCC--cCCHHHHHHHHHHHHHHHHHhhcc-cccccceecCC Confidence 7533 3444 5899999876 477776655555555544 44 57889766655 No 98 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.87 E-value=1.6e-24 Score=151.01 Aligned_cols=204 Identities=14% Similarity=0.132 Sum_probs=120.2 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcc---cchHHHHHHHHHHhhcHHHHHHHHHHH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYM---RNAHNLHEVLKKFGNNSILNAIIITRA 105 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~---r~~~~~~~~Lr~~a~~~iv~a~I~~~~ 105 (330) +.. |....+.+.. ..++.. .... .... .-++++|++||+.|+ T Consensus 1 Mg~-----------------------------f~~~~~f~~~--~~~~~~~~~~~~~-~~~~---~~~~~~v~~~i~~Ia 45 (378) T protein:vir:93 1 MNL-----------------------------FGKVVSFSRG--KLNNDTQRVTAWQ-NEAV---EYTSAFVTNIHNKIA 45 (378) T ss_pred Ccc-----------------------------chhhhhhhcc--ccCCCcceeeecc-cchh---HHHHHHHHHHHHHHH Confidence 000 0000000000 000000 0000 0011 113678999999999 Q ss_pred HhHhhhhh-hheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeE Q lcl|NC_019511. 106 NQVSTYCK-PARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFE 184 (330) Q Consensus 106 d~Ia~~~~-~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~ 184 (330) ++||+... .++..+++.+ . ++ ......|.+.+++...||+++|.++|++.++.++|+.|++|++ T Consensus 46 ~~iA~lp~~~~~~~~~~~~-~-----------~~---~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~ 110 (378) T protein:vir:93 46 NEITKVEFNHVKYKKSDVG-S-----------DT---LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLY 110 (378) T ss_pred hhhhhCceeeEEEcccccc-c-----------cc---ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEE Confidence 99996432 1222111111 0 00 0112345666777778999999999999999999999999998 Q ss_pred EEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHH Q lcl|NC_019511. 185 KVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIA 264 (330) Q Consensus 185 ~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a 264 (330) ++++ ++.|+++.|+|.+. ..+|.++||+|++ +|.. ...|.||++.+ T Consensus 111 ~~~~-~~~g~~~~l~~~~~----------------------------~~~~~~~diih~r-~~~~----~~~~~s~l~~~ 156 (378) T protein:vir:93 111 AVFD-DNTGELLDLLFADD----------------------------KKEYKTEELVRLT-SPFY----INEDTSILDNA 156 (378) T ss_pred EEee-cCCceEEEEEecCC----------------------------eeEeccceeEEec-Cccc----cchhhHHHHHH Confidence 8753 45577776655321 2346789999996 3432 22399999988 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHH----HhcCcccccccceeeC Q lcl|NC_019511. 265 MKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKS----SFSGINGSWQICLYIK 330 (330) Q Consensus 265 ~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~----~~~G~~na~kvpvL~e 330 (330) +.++. .+|++| .|+|+|.+++. +++++.+++++.|++ .++| .|+|+++||-+ T Consensus 157 ~~~i~----------~~~~~~-~~~g~l~~~~~--l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~ 212 (378) T protein:vir:93 157 LASIQ----------TKLEQG-KLRGLLKINAF--LDIDNTQEYREKALTTIKNMQEG-SSYNGLTPVDN 212 (378) T ss_pred HHHHH----------HHHhcC-cccceeeeCCc--CCHHHHHHHHHHHHHHHHHhhcc-cccccceEcCC Confidence 87763 356666 48999998764 677765555555554 4444 58889766665 No 99 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.86 E-value=6.3e-24 Score=147.77 Aligned_cols=207 Identities=13% Similarity=0.137 Sum_probs=119.4 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHh-hcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFG-NNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a-~~~iv~a~I~~~~d~ 107 (330) +. +..+..+.. .+. .+......+ ... . ...+ ++++|++||+.|+++ T Consensus 1 Mg-----~f~~~~~~~------~~~--------------~~~~~~~~~---~~~-~----~~~~~~~~~v~~~i~~Ia~~ 47 (378) T protein:vir:16 1 MN-----LFGKVVSFS------RGK--------------LNNDTQRVT---AWQ-N----EAVEYTSAFVTNIHNKIANE 47 (378) T ss_pred Cc-----cchhhhhhh------ccc--------------ccCCcceee---ecc-c----chhhHHHHHHHHHHHHHHhh Confidence 00 000000000 000 000000000 000 0 1111 367899999999999 Q ss_pred Hhhhhh-hheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEE Q lcl|NC_019511. 108 VSTYCK-PARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKV 186 (330) Q Consensus 108 Ia~~~~-~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v 186 (330) ||+... .++..+++.+ ..+ ......|.+.+++...||+++|.++|+++++.++|+.|+++++++ T Consensus 48 iA~l~~~~~~~~~~~~~-~~~--------------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~ 112 (378) T protein:vir:16 48 ITKVEFNHVKYKKSDVG-SDT--------------LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAV 112 (378) T ss_pred hhhCceeEEEEcccccc-ccc--------------ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEE Confidence 996432 2222222211 000 011234556667777899999999999999999999999999987 Q ss_pred EecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHH Q lcl|NC_019511. 187 FSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMK 266 (330) Q Consensus 187 ~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~ 266 (330) ++ ++.|+++.|+|.+. ...|.++||+|++ +|.. ...|.||++.+++ T Consensus 113 ~d-~~~g~~~~l~~~~~----------------------------~~~~~~~diih~r-~~~~----~~~~~s~l~~~~~ 158 (378) T protein:vir:16 113 FD-DNTGELLDLLFADD----------------------------KKEYKPEELVRLT-SPFY----INEDTSILDNALA 158 (378) T ss_pred ee-cCCceEEEEEecCC----------------------------eeEecccceEEec-CccC----ccchhHHHHHHHH Confidence 53 34467666655321 1245789999997 3332 2239999999888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc---CcccccccceeeC Q lcl|NC_019511. 267 EFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS---GINGSWQICLYIK 330 (330) Q Consensus 267 ~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~---G~~na~kvpvL~e 330 (330) +|.. +|++| .|+|+|..++. +++++.+++++.|++.++ |..|+|+++||.+ T Consensus 159 ~i~~----------~~~~~-~~~g~l~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~ 212 (378) T protein:vir:16 159 SIQT----------KLEQG-KLRGLLKINAF--LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDN 212 (378) T ss_pred HHHH----------HHhcC-ccceeeEeCCc--CCHHHHHHHHHHHHHHHHHhhcccccccceEcCC Confidence 7642 34444 58999988764 677665555555555442 3368899777766 No 100 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.83 E-value=1.9e-22 Score=139.69 Aligned_cols=227 Identities=11% Similarity=-0.051 Sum_probs=125.3 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ +.++..-+-.. .+... .+ ... +|..-.. ... +.+-+++.|++||++|+++| T Consensus 1 Mg~-~~~~~~~~~~~---~~~~~----~~-----~~~----~~~~~~~------~~~---~~~l~~~~v~~~v~~Ia~~i 54 (395) T protein:vir:40 1 MGF-KSWVSGFFNEE---QRTLN----LT-----DTV----WCSIPSE------KLK---ELSIKKWAIDSCANKIANTL 54 (395) T ss_pred Cch-HHHHHhhhccc---ccccc----cc-----cch----hhccccc------cch---hhhhhhHHHHHHHHHHHHHH Confidence 333 22222221110 00000 00 000 1111000 001 11224789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+.-++ . +.++.+.+++...||+.+|.++|+++++.++|+.|++|+++. T Consensus 55 a~~-----------p~~~~~~~--~-----------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~-- 108 (395) T protein:vir:40 55 SCA-----------EVLTYEKG--E-----------EVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQ-- 108 (395) T ss_pred hhC-----------ceeeccCC--c-----------cccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEe-- Confidence 963 22222111 1 112334455566899999999999999999999999887764 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eCC-ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-IDK-QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMK 266 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~~-~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~ 266 (330) ++. +++.++..+... + ..+..|.++ ..+ +...+|.++||+|+++++..+ .+++.+.++.+.. T Consensus 109 ~~~------~~~~~~~~~~~~----~---~~~~~~~~v~~~~~~~~~~~~~~evih~r~~~~~~---~~~~~~l~~~~~~ 172 (395) T protein:vir:40 109 DEY------IYVADSFTKNDK----S---LYENTYTEVTLKDLTLKKEFKESEVLHLTLNNESI---KSIIDGFYLLYGD 172 (395) T ss_pred cCc------eeecCCcccccc----c---cccceeeeeeecCceeeeeeccccEEEeecCCCCc---cccchhHHHHHHH Confidence 332 333333322111 0 011123322 222 223468999999998765432 3455555666665 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCc-ccccccceeeC Q lcl|NC_019511. 267 EFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGI-NGSWQICLYIK 330 (330) Q Consensus 267 ~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~-~na~kvpvL~e 330 (330) .+...++ +.+|.|+.++. +..++...+++++.+++|+.|++.++|. +|++++.||-+ T Consensus 173 ~~~~~~~-----~~~~~~~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~ 230 (395) T protein:vir:40 173 LLTAAVN-----KYKKLNSRKII--VKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVED 230 (395) T ss_pred HHHHHHH-----HHHhcCCCCce--EEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCC Confidence 5544443 33445555554 4445556799999999999999999885 46777555544 No 101 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=99.82 E-value=4.4e-23 Score=143.13 Aligned_cols=111 Identities=12% Similarity=0.219 Sum_probs=85.2 Q ss_pred eEEeeCCCCcccCCceeEEEEeC----CceEEEechhHeeeeccc-CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 206 IFYATDKNGKIIKGGNRFVQVID----KQVVASFTSRELVMGIRN-PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDR 280 (330) Q Consensus 206 V~~~~d~~G~~~~~~~~Y~q~~~----~~~~~~~~~~dvih~~~n-~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~ 280 (330) |++. .+| .++|++... ++...+|.++||+|++.. |.+ +.||+|||++|+.+|.++.++++|+++ T Consensus 1 ~r~~--~dg-----~~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~----~~~Glspi~~a~~~i~~~~aa~~~~~~ 69 (219) T protein:vir:98 1 MRVC--KDG-----NYKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQ----QVYGSPDYVGGITSALLNSDATIFRRR 69 (219) T ss_pred Ccee--ecC-----eEEEEEecceecCCceeEEeccccEEEecCCCCCC----CcceecHHHHHHHHHHHHHHHHHHHHH Confidence 3332 223 234544332 366778999999999742 222 336999999999999999999999999 Q ss_pred HHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceee-----C Q lcl|NC_019511. 281 FFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYI-----K 330 (330) Q Consensus 281 fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~-----e 330 (330) ||+||++|+|||.+++ ..+|++++++|+++|++ +.|.+|+++ ++|+ | T Consensus 70 ~f~Ng~~p~gil~~~~-~~l~~e~~~~~~~~~~~-~~g~~n~~~-~~l~~~gg~~ 121 (219) T protein:vir:98 70 YYSNGAHMGFILYSTD-PDMTEEMEDEIAERIRD-SKGVGNFRS-MFVNIAGGHP 121 (219) T ss_pred HHhcCCCCceEEEeCC-CCCCHHHHHHHHHHHHH-hcCcccccc-eeEecCCCCc Confidence 9999999999998865 46999999999999987 468788755 5665 2 No 102 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.81 E-value=6e-22 Score=136.92 Aligned_cols=225 Identities=13% Similarity=-0.012 Sum_probs=118.8 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ...+. . + +.+ ..+.. .+....+. ..-..+-+++.|++||+.++++| T Consensus 1 Mgl-----~d~~~-~---~-----~~~--~~~~~-------~~~~~~~~--------~~~~~~l~~~~v~~~i~~Ia~~i 49 (395) T protein:vir:96 1 MGI-----LDFFS-F---K-----KSG--TLSDD-------DSGSTTSE--------KLTNVVLKEDALYKCVNYLARII 49 (395) T ss_pred Ccc-----hhhhc-C---C-----CCc--ccccc-------ccccchhh--------hcchhhhhhHHHHHHHHHHHHhh Confidence 111 10000 0 0 000 00000 01100000 00111224789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .|.+.-+++ .+.. .|.+.+++...||+++|.++|+++++.++|+.|+++++++ T Consensus 50 a~l-----------p~~v~~~~~-~~~~----------~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~-- 105 (395) T protein:vir:96 50 SKS-----------TFRIKAPEK-LTEN----------QKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVI-- 105 (395) T ss_pred ccc-----------eeEEEeCCc-cccc----------cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEE-- Confidence 962 333322211 1111 2334455556899999999999999999999999888875 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE-eCC-ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV-IDK-QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMK 266 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~-~~~-~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~ 266 (330) ++..+ ++.+...+.... .+..|.++ ..+ .....|.++||+|++.++... ..+|.++++.+.. T Consensus 106 ~~~~~-----~~~~~~~~~~~~--------~~~~~~~v~~~~~~~~~~~~~~dvih~k~~~~~~---~~~~~~~~~~~~~ 169 (395) T protein:vir:96 106 PGKGI-----YVADAFTQDKKL--------SGNKFKVSRVQGQTYEKIFTFDQVIYLKNDNSDL---MLKVESLWEEYGE 169 (395) T ss_pred cCCce-----ecCCcccccccc--------ccceeeeeeeccceeeeEeccCceEEecccCCcc---ccccccccchHHH Confidence 44332 232222221110 01123222 222 234568999999998765432 2234444444444 Q ss_pred HHHHHHH------HHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccc-ceeeC Q lcl|NC_019511. 267 EFIAYNN------TESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQI-CLYIK 330 (330) Q Consensus 267 ~I~~~la------ae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kv-pvL~e 330 (330) .++.+++ +.+|...+|.+|+.|+|++...++ ++.+.++++|++.+++.. ++.. ++++| T Consensus 170 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~-~~~~~v~~l~ 234 (395) T protein:vir:96 170 LLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGG-----RQPKSDKDFFKRTIEKIR-TESVVGIPVT 234 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhhcccccccceeeccCch-----hhHHHHHHHHHHHHHHhh-cCCcceEEcc Confidence 4444333 447888999999999999976543 334566667776665543 3333 33355 No 103 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.80 E-value=1.3e-21 Score=135.08 Aligned_cols=207 Identities=13% Similarity=0.113 Sum_probs=115.6 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHh-hcHHHHHHHHHHHHh Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFG-NNSILNAIIITRANQ 107 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a-~~~iv~a~I~~~~d~ 107 (330) +++ +. +++.....+. ..+.. |.+ +.. + +..+ ++++|++||+.++++ T Consensus 1 M~~-f~----k~~~~~~~~~-~~~~~--------------~~~----~~~-~--------~~~~~~~~~v~~~v~~ia~~ 47 (378) T protein:vir:85 1 MNL-FG----KVVSFSRGKL-NNDTQ--------------RVT----AWQ-N--------EAVEYTSAFVTNIHNKIANE 47 (378) T ss_pred Cch-hh----hhhhhhhccc-ccCCc--------------cee----eee-c--------cchhhhhHHHHHHHHHHHHh Confidence 111 00 1000000000 00000 000 000 0 0111 367899999999999 Q ss_pred Hhhhhhh-heecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEE Q lcl|NC_019511. 108 VSTYCKP-ARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKV 186 (330) Q Consensus 108 Ia~~~~~-~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v 186 (330) ||..... ++....+.+ +++ ......|.+.+++..+||+++|.++|++.++.++|+.|++|+|++ T Consensus 48 iA~lp~~~~~~~~~~~~-----~~~----------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i 112 (378) T protein:vir:85 48 ITKVEFNHVKYKKSDVG-----SDT----------LISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPI 112 (378) T ss_pred HhhCceeEEEEeccccc-----ccc----------ccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEe Confidence 9964221 111111111 000 012234667777788999999999999999999999999998876 Q ss_pred EecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHH Q lcl|NC_019511. 187 FSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMK 266 (330) Q Consensus 187 ~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~ 266 (330) + ++..|+++ |.+..+++. .|.++||+|++. |.+.. -+.++++.+.+ T Consensus 113 ~-~~~~g~~~--------------------------~~~~~~~~~--~~~~~dvih~~~-~~~~~----~~~~~~~~a~~ 158 (378) T protein:vir:85 113 F-DSETGELL--------------------------DLLFANDKK--EYKPEELVRLVS-PFYIN----EDTSILDNALA 158 (378) T ss_pred e-cCCCceEE--------------------------EEEecCCCE--EEcccceEEEec-CcCcc----chhhHHHHHHH Confidence 4 33334433 222333332 467889999863 22211 14555665555 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh---cCcccccccceeeC Q lcl|NC_019511. 267 EFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF---SGINGSWQICLYIK 330 (330) Q Consensus 267 ~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~---~G~~na~kvpvL~e 330 (330) ++. .+|++| .|+|+|..++ .+++++.+++++.|++.+ .|..|+|+++||.+ T Consensus 159 ~~~----------~~~~~~-~~~g~l~~~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~ 212 (378) T protein:vir:85 159 SIQ----------TKLEQG-KLRGLLKINA--FLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDN 212 (378) T ss_pred HHH----------HHHhcC-CcceEEEeCC--cCCHHHHHHHHHHHHHHHHHhhcccccccceecCC Confidence 442 334454 6899998876 478888777777766543 34468999777766 No 104 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.79 E-value=6.5e-21 Score=131.27 Aligned_cols=230 Identities=12% Similarity=-0.019 Sum_probs=126.9 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ +..+ ++.+....+.... ....+ ...-..+-.++.|++||+.++++| T Consensus 1 MGl-f~~~---------------~~~~~~~~~~~~~-------~~~~~--------~~~~~~~~~~~~v~~~I~~ia~~i 49 (395) T protein:vir:98 1 MGI-LDFF---------------SFKKSGTLSDDDS-------GSTTS--------EKLTNVVLKEDALYKCVNYLARII 49 (395) T ss_pred Ccc-hhhh---------------cCCCccccccccc-------chhhh--------hhcchhhhhhHHHHHHHHHHHHHH Confidence 111 0000 0000000000000 00000 011122234789999999999999 Q ss_pred hhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS 188 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~ 188 (330) |+. .+.+.-++ +.+.. .+.+.+++...||+++|.++|++.++.++|+.|++|+++++ T Consensus 50 A~l-----------p~~~~~~~-~~~~~----------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~- 106 (395) T protein:vir:98 50 SKS-----------TFRLKTPE-KLTEN----------QKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIP- 106 (395) T ss_pred hhC-----------ceeEEecC-Ccccc----------cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEe- Confidence 963 33332111 11111 13344455568999999999999999999999999888764 Q ss_pred cCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCC-ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 189 PKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDK-QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 189 rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~-~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) ++.+ ++.++..+... . .+...+.....+ +...+|.++||+|++.+.... ..+|.++++.+..+ T Consensus 107 -~~~~-----~~~~~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~~~evih~k~~~~~~---~~~~~~~~~~~~~~ 170 (395) T protein:vir:98 107 -GKGI-----YVADSFTQDKK------I-SGSQFKVSRVQGQTYEKTFTFDQVIYLKNDNSDL---MSKVESLWEEYGEL 170 (395) T ss_pred -CCce-----ecCCccccccc------c-cCcccceeeecCceeeeEecCccEEEecCCCCCc---cccccchhhhHHHH Confidence 3322 22222221110 1 011112222233 224578999999998654322 23455566666666 Q ss_pred HHHHHHH--HHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcc-cccccceeeC Q lcl|NC_019511. 268 FIAYNNT--ESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGIN-GSWQICLYIK 330 (330) Q Consensus 268 I~~~laa--e~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~-na~kvpvL~e 330 (330) +..++.. ..++.++|.+++.+.|++..+.. ..++++.++++++|++.+++.. |+++ ++++| T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~l~ 234 (395) T protein:vir:98 171 LGHVINNQKIANQIRFTMIPPKDKVRERAQEN-SDGGRQSKSDKDFFKRTVEKIRTESVV-GIPVT 234 (395) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccccc-CCcHHHHHHHHHHHHHHHhhhhcCCcc-eeecC Confidence 6655554 44556789999988888765442 3577788899999999888754 3333 33345 No 105 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.76 E-value=1.3e-20 Score=129.59 Aligned_cols=207 Identities=14% Similarity=0.118 Sum_probs=116.1 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +.+ ..++++.-..+. ..|.. |. .+.. + .+.. .+++.|++||+.|+++| T Consensus 1 M~i-----f~~~~~~~~~~~-~~~~~--------------~~----~~~~-~----~~~~---~~~~~v~~~v~~Ia~~i 48 (378) T protein:vir:94 1 MNL-----FGKVVSFSRGKL-NNDTQ--------------RV----TAWQ-N----EAVE---YTSAFVTNIHNKIANEI 48 (378) T ss_pred Cch-----hHHhHhhhhccc-ccCcc--------------ee----eeee-c----chhh---hhhHHHHHHHHHHHHhH Confidence 222 222221100000 01110 00 0111 0 0110 12578999999999999 Q ss_pred hhhhh-hheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE Q lcl|NC_019511. 109 STYCK-PARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF 187 (330) Q Consensus 109 a~~~~-~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~ 187 (330) |+... .++..+++.+.. + ......|.+.+++...||+++|.++|++.++.++|+.|++|++.++ T Consensus 49 A~lp~~~~~~~~~~~~~~-----~----------~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~ 113 (378) T protein:vir:94 49 TKVEFNHVKYKKSDVGSD-----T----------LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIF 113 (378) T ss_pred hhCceeeeeecccccccc-----c----------ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 96432 222222221100 0 0112335566777778999999999999999999999999988764 Q ss_pred ecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 188 SPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 188 ~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) ++..|+++.+++ ..++ .+|.++||+|++. |... .-+.++++.+..+ T Consensus 114 -~~~~g~~~~~~~--------------------------~~~~--~~~~~~dvih~~~-~~~~----~~~~~~~~~~~~~ 159 (378) T protein:vir:94 114 -DSETGELLDLLF--------------------------ANDK--KEYKPEELVRLTS-PFYI----NEDTSILDNALAS 159 (378) T ss_pred -eCCCCcEEEEEE--------------------------ecCc--EEechhceeeecC-cCCc----ccchhHHHHHHHH Confidence 344455443322 2233 3478899999963 2211 1266777777665 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHH----HHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHA----LENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~----~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +..+ +++| .++|+|..++ .+++++ .+++++.|++.++| .|+|+++||-+ T Consensus 160 ~~~~----------~~~~-~~~g~l~~~~--~l~~~~~~~~~e~~~~~~~~~~~~-~n~~~~~vl~~ 212 (378) T protein:vir:94 160 IQTK----------LEQG-KLRGLLKINA--FLDIDNTQEYREKALATIKNMQEG-SSYNGLTPVDN 212 (378) T ss_pred HHHH----------HhhC-CcccceeeCC--cCCHHHHHHHHHHHHHHHHHhhcc-cccccceeccC Confidence 5422 3444 5889998876 477665 45566666655555 57888777665 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.04 E-value=8.9e-11 Score=75.68 Aligned_cols=246 Identities=13% Similarity=0.134 Sum_probs=128.8 Q ss_pred HHHHHHHHHhhcccchh-ccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhh Q lcl|NC_019511. 34 RQIEQDTKEMQEITKSL-YGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYC 112 (330) Q Consensus 34 ~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~ 112 (330) -.+...+.+... .. ..+.+. |..++.+. ....++.+..|+.++++++||++.+++..+ T Consensus 1 ~~~~D~~~~~~~---~~g~~~~~~--------------~~~~~~~~--~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r-- 59 (437) T protein:vir:52 1 MKFFDGIKSLAL---KLGSKQEQT--------------YYSPSLSL--TDDLVQLEALWRDNWIANKVCIKRPEDMVR-- 59 (437) T ss_pred CchhhhhHhHHh---cCCCccccc--------------eeecCccc--cccHHHHHHHHHhCchhhHHhhcchHHhhc-- Confidence 001111111100 00 011111 11111111 122345667788899999999999998763 Q ss_pred hhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCC- Q lcl|NC_019511. 113 KPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKN- 191 (330) Q Consensus 113 ~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~- 191 (330) -||.+.-.|. +.+.++.++..+.++ .+.+-+...++..-++|.+++.++. ++ T Consensus 60 ---------~~~~i~~~d~-------~~~~~~~~~~~~~~l---------~~~~~l~~a~~~~rl~G~a~i~i~~--d~~ 112 (437) T protein:vir:52 60 ---------NWREIYSNDL-------NSKQLDLFTKFERSL---------KLRETLTKALQWSSLYGSVGLLVVT--DSQ 112 (437) T ss_pred ---------CCceEecCCC-------CHHHHHHHHHHHHhh---------cHHHHHHHHHHhcccccceEEEEEe--cCC Confidence 2555543221 123445555555544 2233333334444467877777654 44 Q ss_pred --------CcceEEEEeeCCCceEEeeCCCCcc--cC-CceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCccccH Q lcl|NC_019511. 192 --------KTKMEKFIAVDPSTIFYATDKNGKI--IK-GGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGYGLSE 260 (330) Q Consensus 192 --------~G~~~~L~pldp~tV~~~~d~~G~~--~~-~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSP 260 (330) .|.+..|.++++..|.+..-++-.. +. +...++++..++....+-++.|+|+...+........+|.|+ T Consensus 113 ~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~ 192 (437) T protein:vir:52 113 NTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSD 192 (437) T ss_pred CcccccccCCceeEEEEechhhccccccccccccccccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCch Confidence 3778899999998887533222111 00 112344444454455678889999976554433344459999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC-CCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 261 VEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD-QQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 261 Ie~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~-~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++.+...|.....+....+.++.+...+ ++.+++- ..++....+.+++.++....+ .+.+.+.||=. T Consensus 193 le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~ 260 (437) T protein:vir:52 193 LEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKIAAGMENEVASVISAVQEI-KSATNSLLLDA 260 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHhcCCcHHHHHHHHHHHHHh-cCCCceEEEcC Confidence 9999999999999999988877665444 2334321 112222223333333333233 23445444422 No 107 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.93 E-value=1.8e-09 Score=68.59 Aligned_cols=249 Identities=10% Similarity=0.084 Sum_probs=138.2 Q ss_pred HHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHH-HHHHHHHhhcHHHHHHHHHHHHhHh Q lcl|NC_019511. 31 ANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNL-HEVLKKFGNNSILNAIIITRANQVS 109 (330) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~-~~~Lr~~a~~~iv~a~I~~~~d~Ia 109 (330) +...-+.++.. ....+++..- ++.. +...+.++.++....+ .+..+.+.+.+-|.+|+++|+..|. T Consensus 1 v~~~~l~~e~a------t~~~~~d~~~--~~~~-----~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~ 67 (488) T protein:vir:99 1 MEKPALGREIA------TSGDGRDITR--PFIS-----GLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVV 67 (488) T ss_pred CCccchhHHHH------HHHhhhhhhc--cccC-----CCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHh Confidence 11111111110 0111222211 1111 1111112333322211 2233333457889999999999997 Q ss_pred hhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec Q lcl|NC_019511. 110 TYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP 189 (330) Q Consensus 110 ~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r 189 (330) ++.|.|.+.+. +.++.+....++.+|.. .+|.+++..++ |.+.+|-..+|++|.+ T Consensus 68 -----------~~~w~i~p~~~----~~~~~~~ae~v~~~l~~---------~~~~~~l~~~l-da~~~G~s~~Ei~w~~ 122 (488) T protein:vir:99 68 -----------SREWKVEAGGD----RPIDQAAAEHLEQQLQR---------VGWDRVTSKML-FGVFYGYAVSELIYGR 122 (488) T ss_pred -----------cCCceEEcCCC----ChHHHHHHHHHHHHHhC---------CCHHHHHHHHH-hhhhhcceeEEEEEee Confidence 46888876532 23344445555555532 25888888876 6788999999999976 Q ss_pred C-CCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechh-HeeeecccCcCCCCCCCccccHHHHHHHH Q lcl|NC_019511. 190 K-NKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSR-ELVMGIRNPRSDLNSSGYGLSEVEIAMKE 267 (330) Q Consensus 190 d-~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~-dvih~~~n~~~d~~~~~yGlSPIe~a~~~ 267 (330) + +...|..|.+++|..+.. +.+|. .++....+......++.. ..++.++.+.++ .+||.|.+..|... T Consensus 123 ~~g~~~~~~l~~r~~~~f~~--d~~~~-----l~~~~~~~~~~g~~lp~~~~~i~~~~~~~~g---~p~g~gLl~~~~w~ 192 (488) T protein:vir:99 123 DDRYITLEAIKVRNRRRFRY--DQDGG-----LRLLTPNNMFEGEPCPAPYFWHFSTGADNDD---EPYGLGLAHWLYWP 192 (488) T ss_pred cCCeeeEeeeeeecccceee--cCCCc-----eEEeccCCCCCccccccCceEEEEeecCCCC---CcccchHHHHHHHH Confidence 5 445677899999987764 33332 223211111112233322 333223334433 67899999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .-.-....++-+.|...-+.|--+-.++. ...|+++++.|.+...+..+. .++=+|-=.| T Consensus 193 ~~fK~~~~~~w~~f~E~yG~P~~igky~~-~~a~~~ek~~l~~av~~~~~~--~~~viP~~~~ 252 (488) T protein:vir:99 193 VFFKRNGIKFWLIFLDKFGMPTAVGRYDD-KTATPEDKAKLLAALHAIQTD--SAIIMPAGMQ 252 (488) T ss_pred HHHHHhhHHHHHHHHHHcCCceeeeecCC-CCCCHHHHHHHHHHHHHHhcC--cEEEecCCce Confidence 88888888888888888888854444432 246888888888887775332 2222232222 No 108 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.81 E-value=2.6e-09 Score=67.62 Aligned_cols=258 Identities=9% Similarity=0.063 Sum_probs=143.2 Q ss_pred CCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCC-CcccchHHHHHHHHHH Q lcl|NC_019511. 13 MYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKK-SYMRNAHNLHEVLKKF 91 (330) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~-s~~r~~~~~~~~Lr~~ 91 (330) +++.=-.+.-.|+.. .+..+.. ...++++...+....... ..+.+ +-|+......+..+.. T Consensus 1 ~~~~i~~~~g~~~~~------~~~~~~~------~~~ia~~~~~~~~~~~~~------~~p~~~~il~~~~~~~~~y~~m 62 (491) T protein:vir:79 1 MSKGLWVSPTEFVKF------GEPDKSL------SSQIATRARSIDFFALGM------YLPNPDPVLKALGKDIRVYREL 62 (491) T ss_pred CCCeeeCCCCCcccc------cccchhH------HHHHhhhccccccccccc------cCcchhHHHhhccCCHHHHHHH Confidence 221111111111111 0000110 111233333343322111 11111 1122111122333333 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) -+.+-|.+|+++|+..|. ++.|.|.+.+.+. +..+.+++.|..+ .|.+++..+ T Consensus 63 ~~D~~i~s~l~~Rk~av~-----------~~~w~i~~~~~~~-------~~a~~i~e~l~~~---------~~~~~i~~~ 115 (491) T protein:vir:79 63 RADAHVGGCVRRRKAAVK-----------ALEWGLDRGKAKS-------RVAKSIADVFADL---------DLSRIATEM 115 (491) T ss_pred hhChHHHHHHHHHHHHHh-----------CCCcEEecCCCCH-------HHHHHHHHHHhcC---------CHHHHHHHH Confidence 447889999999999997 4678887643321 1234444544332 477787776 Q ss_pred HHHHHhcCCceeEEEEecCC-CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCC Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKN-KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSD 250 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~-~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d 250 (330) + |.+.+|-...|++|..++ ...|..|.+++|..+... .+|. .++....+......+.....++.++.+.++ T Consensus 116 l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d--~~~~-----l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g 187 (491) T protein:vir:79 116 L-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYD--PENQ-----LRFRSKEHWVQGEELPARKFLVPRQEATYL 187 (491) T ss_pred H-HhhhhcceeEEEEEeecCCeeeEEeeeeecccceeec--cCCc-----eEEeecCCCCCceeecCCCeEEEEecCCCC Confidence 5 677799999999987753 356778999999877642 2332 334332222222345566566555555544 Q ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 251 LNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 251 ~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+||.|.+..|....-.-....++-+.|...-+.|--+-.++. ..++++++.|.+...+..+. .++=+|-=.| T Consensus 188 ---~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~--~a~~~ek~~l~~al~~~~~~--a~~viP~~~~ 260 (491) T protein:vir:79 188 ---NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR--SASDAETNLLLDRLEDMVQD--AVAVIPDDSS 260 (491) T ss_pred ---CcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC--CCCHHHHHHHHHHHHHHhcC--eEEEecCCce Confidence 5789999999999999988889999999998888876665543 46888889988888775332 2222222111 No 109 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.80 E-value=7.6e-09 Score=65.09 Aligned_cols=257 Identities=12% Similarity=0.153 Sum_probs=140.2 Q ss_pred HHHhhcccchhccccchhccccccccccCCCCCc---C--CCcccchHHHHHHHHHHh-hcHHHHHHHHHHHHhHhhhhh Q lcl|NC_019511. 40 TKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRD---K--KSYMRNAHNLHEVLKKFG-NNSILNAIIITRANQVSTYCK 113 (330) Q Consensus 40 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~---~--~s~~r~~~~~~~~Lr~~a-~~~iv~a~I~~~~d~Ia~~~~ 113 (330) .-.+.+++. +..+...+.+-...+.+++ + ...||.. ...++-+.+. +-+-|.+|+++|+.-|. T Consensus 1 ~~~~~~~~~------p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~-~~~~ly~~m~e~D~~i~s~l~~rk~av~---- 69 (469) T protein:vir:10 1 MTERVKTAA------PVSEAGYVFGSGVVDGWTVWDPFEQTPELQWP-QSVAVYSRMDNEDSRVTSLLEAISLPIR---- 69 (469) T ss_pred CCCcccCCC------Cccchhhhhhcccccchhhccccccccccccc-cchHHHHHHHhhChHHHHHHHHHHHHHh---- Confidence 000001000 0000000000000111221 1 0123321 1122333332 36789999999999997 Q ss_pred hheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC--------CCCCcCCHHHHHHHHHHHHHhcCCceeEE Q lcl|NC_019511. 114 PARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD--------KDIDRDSFQEFCKKIVRDTYTYDQVNFEK 185 (330) Q Consensus 114 ~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~--------~pn~~~s~~~fl~~~v~d~L~~g~g~~~~ 185 (330) ++.|.|.+.+.+.+ ..+.+...|..+... ....+.+|++++..++.+.+.+|-...|+ T Consensus 70 -------~~~w~v~p~~~~~e-------~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Ei 135 (469) T protein:vir:10 70 -------STPWRIRANGASDE-------VTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQ 135 (469) T ss_pred -------cCCceEecCCCCHH-------HHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeee Confidence 46888876543321 122222233222211 11234578888888888888899999999 Q ss_pred EEecCC-----CcceEEEEeeCCCceE-EeeCCCCcccCCceeEEEE-----------eCCceEEEechhHeeeecccCc Q lcl|NC_019511. 186 VFSPKN-----KTKMEKFIAVDPSTIF-YATDKNGKIIKGGNRFVQV-----------IDKQVVASFTSRELVMGIRNPR 248 (330) Q Consensus 186 v~~rd~-----~G~~~~L~pldp~tV~-~~~d~~G~~~~~~~~Y~q~-----------~~~~~~~~~~~~dvih~~~n~~ 248 (330) +|.+.+ .-.+..|.+.++.++. ...+.++.+. .+.|. ..+.....++....++.++++. T Consensus 136 vw~~~~~~~dG~~~~~~l~~rp~~~i~~~~~~~~~~l~----~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~ 211 (469) T protein:vir:10 136 VYRPRNQSPDGRFWLRKLAPRPQWTISKFNVAPDGGLE----SIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKR 211 (469) T ss_pred eeecccccCCCceeeeeeeecCcccceeeeeccCCcee----eeeecCcccccccccccCCCCccccccCcEEEEEecCC Confidence 998653 2357778888887763 2333333211 12221 1111223455556565555554 Q ss_pred CCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccce- Q lcl|NC_019511. 249 SDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICL- 327 (330) Q Consensus 249 ~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpv- 327 (330) ++ ++||.|.+..|....-.-....++-+.|-..-+.|-=+..++. ..++++++.|.+...+...|...+.=+|- T Consensus 212 ~g---~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~--~a~~~ek~~l~~a~~~~~~g~~a~~iip~~ 286 (469) T protein:vir:10 212 PG---QWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASS--ATDEDEVRKMAALARSVRGGINAGVGLAQG 286 (469) T ss_pred CC---CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCC--CCCHHHHHHHHHHHHHHhcCCceEEEccCC Confidence 43 5789999999999988888888888888888777765555543 46888999999888877656432222221 Q ss_pred ----eeC Q lcl|NC_019511. 328 ----YIK 330 (330) Q Consensus 328 ----L~e 330 (330) ++| T Consensus 287 ~~ie~~e 293 (469) T protein:vir:10 287 QILELLG 293 (469) T ss_pred ceEEEee Confidence 122 No 110 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.79 E-value=7.8e-09 Score=65.03 Aligned_cols=285 Identities=12% Similarity=0.074 Sum_probs=134.6 Q ss_pred hHHHHHhcCCCCCCc-----cccc---CccCcchhHHHHHHHHHHHHhhcccchh-------ccccchhccccccc---- Q lcl|NC_019511. 4 LFKSLRLGSMYKEDT-----EDLM---VPIDDGIQANIRQIEQDTKEMQEITKSL-------YGKQQAYAEPFLEM---- 64 (330) Q Consensus 4 ~~~~~~~~~~~~~~~-----~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~g~~~~~~~~~~~~---- 64 (330) .|+++||....+... +... ++..++-.+.. ...+-..+....+... .|-..+.+...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~ 79 (537) T protein:vir:10 1 MFKFWRKKTVEAVQSSIAERIEPRVGIFGAGDDEKPFT-RAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGT 79 (537) T ss_pred CCCccccccccccccccccccccccCCCcccchhhHHH-HHHhhhhccCCCCCccCcccccccccccchhccccccchhh Confidence 566665444222221 1111 11112221211 1111111111111111 11111111111100 Q ss_pred c-----ccCC----CCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCccc Q lcl|NC_019511. 65 M-----DTNP----DYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATP 135 (330) Q Consensus 65 ~-----~~~p----~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~ 135 (330) + ..++ .|.... . -.+++++..|+.++++++||++++++..+ -||.+.-.+.++ . T Consensus 80 ~~~~~~~~~~~~~~~~~~~~----~-~~~~~l~a~Y~~~~l~r~iVd~~A~d~~r-----------~~~~i~~~~~~~-~ 142 (537) T protein:vir:10 80 FSAYANPNLSEGLVLWYAQQ----A-FIGHQMCALIATHWLVNKACSQMPRDAMR-----------KGYKIISDDGNE-L 142 (537) T ss_pred hhhhccccccchhhhhcccc----C-CccHHHHHHHHhCchhhhhhhhhhHHhhc-----------CCceeecCCccc-c Confidence 0 0000 011111 1 12467888899999999999999999853 255655433221 1 Q ss_pred ChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe-cC-------------CCcceEEEEee Q lcl|NC_019511. 136 GIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS-PK-------------NKTKMEKFIAV 201 (330) Q Consensus 136 ~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~-rd-------------~~G~~~~L~pl 201 (330) .+ +.+++++..+.++. -+..|.++ ++..-++|.+++++... +| +.|....|..+ T Consensus 143 ~~---~~~~~l~~~~~~l~--------~~~~l~~a-~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vi 210 (537) T protein:vir:10 143 DP---KDAKFIDRYDRAFN--------IKKHAIQF-VRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQI 210 (537) T ss_pred cH---HHHHHHHHHHHHhh--------HHHHHHHH-HHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEe Confidence 22 23344444444431 12334444 34444578777776543 22 23345677788 Q ss_pred CCCceEEee------CCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCC---CccccHHHHHHHHHHHHH Q lcl|NC_019511. 202 DPSTIFYAT------DKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSS---GYGLSEVEIAMKEFIAYN 272 (330) Q Consensus 202 dp~tV~~~~------d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~---~yGlSPIe~a~~~I~~~l 272 (330) ||.-+.+.. |.....+-.+ .+++ +.+. .+-++.|+|+..++..+.... .+|.|.++.+...|.... T Consensus 211 dp~~~~~~~~~~~~~dp~sp~fg~P-~~y~-v~g~---~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~ 285 (537) T protein:vir:10 211 DPYWCAPLLDAQASSNPVSMHFYEP-TYWL-INGK---KYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAE 285 (537) T ss_pred chhhcccccchhhhccCCccccCCc-eeee-ecCe---EecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHH Confidence 876555432 1111111111 2333 3443 456788999987776665432 349999999999999998 Q ss_pred HHHHHHHHHHhcCCCcceEEEeCCCCCC-CHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 273 NTESFNDRFFSHGGTTRGILQIRADQQQ-SQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 273 aae~~~~~fF~nGa~p~GiL~~~~~~~l-s~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+....+.++...... ++.+++...+ +++++..-.+.|. .+.+|.+- +++. T Consensus 286 ~t~~~~~~l~~~~~~~--v~k~~~~~~l~~~~~~~~r~~~~~---~~r~n~g~--~~id 337 (537) T protein:vir:10 286 RTANEGPMLAMTKRQT--VLKVDAAQVLANKQQFDETMSWWT---ATRDNYQV--RVVD 337 (537) T ss_pred HHHHHHHHHHHhcCCc--eeeechHHhhcCHHHHHHHHHHHH---hhcCCcce--eEec Confidence 8888888777766544 2334433222 3444333333443 33344442 4443 No 111 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.77 E-value=1.2e-08 Score=64.09 Aligned_cols=258 Identities=9% Similarity=0.054 Sum_probs=140.5 Q ss_pred CCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCC-cccchHHHHHHHHHH Q lcl|NC_019511. 13 MYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKS-YMRNAHNLHEVLKKF 91 (330) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s-~~r~~~~~~~~Lr~~ 91 (330) +++.=-.+.-.|++-... -+.+.++ ++-+......|... .....+. -||......+..+.+ T Consensus 1 m~~~i~~~~g~p~~~~~~--~~~~~~~----------ia~~~~~~~~~~~~------~~~~~~~~iLr~~~~~~~~y~~m 62 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEP--DKSLSSQ----------IATRARSIDFFALG------MYLPNPDPVLKALGKDIRVYREL 62 (491) T ss_pred CCCceeCCCCCccCcccC--ChHHHHH----------HHhhhccccccccc------CCccchHHHHHhcCCCHHHHHHH Confidence 322111111122211110 0111111 11111111111110 0111111 122111112223333 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) -+.+-+.+|+++|+..|. ++.|.|.+.+.+ + +..+.+++.|.++ .|.+++..+ T Consensus 63 ~~D~~i~s~l~~Rk~av~-----------~~~w~i~~~~~~----~---~~~e~v~e~l~~~---------~~~~~l~~~ 115 (491) T protein:vir:10 63 RADAHVGGCVRRRKAAVK-----------ALEWGLDRGKAK----S---RVAKSIADVFADL---------DLSRIVTEM 115 (491) T ss_pred hhChHHHHHHHHHHHHHh-----------CCCcEEecCCCC----H---HHHHHHHHHHhcC---------CHHHHHHHH Confidence 346789999999999997 467888754322 1 2234455554432 578888887 Q ss_pred HHHHHhcCCceeEEEEecCC-CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCC Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKN-KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSD 250 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~-~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d 250 (330) + |.+.+|-..+|++|..++ ...|..|.++++..+.. +.+|. .+|....++.....+.....++.++.+.++ T Consensus 116 l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~--d~~~~-----l~~~~~~~~~~g~~l~~~k~i~~~~~~~~~ 187 (491) T protein:vir:10 116 L-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVY--DPENQ-----LRFRSKDHWMQGEELPARKFLVPRQEATYL 187 (491) T ss_pred H-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceee--ccCCc-----eEEecCCCCCCcceecCCCEEEEEecCCCC Confidence 6 678899999999998754 34677899999987764 32332 334322222222345555555555444443 Q ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 251 LNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 251 ~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .+||.|.+..|....-.-....++-+.|-..-+.|--+-.++ ...+++++++|.+...+..+. .++=+|-=.| T Consensus 188 ---~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~--~~a~~~ek~~l~~al~~~~~~--a~~viP~~~~ 260 (491) T protein:vir:10 188 ---NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP--RSASDGEKNLLLDCLEDMVQD--AVAVVPDDSS 260 (491) T ss_pred ---CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecC--CCCCHHHHHHHHHHHHHHhcC--cEEEecCCce Confidence 578999999999999888888888888888878786555554 446888999998888876332 2222232222 No 112 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.73 E-value=5.3e-08 Score=60.47 Aligned_cols=262 Identities=11% Similarity=0.051 Sum_probs=139.0 Q ss_pred ccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcC-C-Cccc-----chHHHHHHHHHH Q lcl|NC_019511. 19 EDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDK-K-SYMR-----NAHNLHEVLKKF 91 (330) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~-~-s~~r-----~~~~~~~~Lr~~ 91 (330) ...+++..... +....+.+. ....+.|-...+..- ...+..|. . +-|+ +...-.++.+.+ T Consensus 1 ~~~~~d~~g~p-~~~~~~~~~------~~~~~~~~~~~~~~~------~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m 67 (528) T protein:vir:10 1 MAAIVDIYGNP-LRTQQLRKQ------QTAHLAGLAKEFANH------PAKGLTPAKLAHILIEAEQGHLQAQAELFMDM 67 (528) T ss_pred CCeeECCCCCc-cccccccch------hhhhhhhhhhhhccc------CCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHH Confidence 22222221110 001111111 001122211111110 00011110 0 1111 111122233333 Q ss_pred h-hcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHH Q lcl|NC_019511. 92 G-NNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKK 170 (330) Q Consensus 92 a-~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~ 170 (330) - +.+-|.+|+++|+..|. ++.|.|.+.+.+ +..+.+....++.+|... ..|.+++.. T Consensus 68 ~e~D~~i~s~l~~Rk~av~-----------~~~w~I~p~~~~---~~~~~~~a~~v~~~l~~~--------~~f~~~i~~ 125 (528) T protein:vir:10 68 EERDAHLFAEMSKRKRAVL-----------GLDWTIEPPRNA---SAAEKADAEYLHELLLDL--------EGIEDLMLD 125 (528) T ss_pred HhhChHHHHHHHHHHHHHh-----------cCCceEecCCCC---CHHHHHHHHHHHHHHhCC--------ccHHHHHHH Confidence 3 35779999999999997 468888765322 223334445555555332 246677766 Q ss_pred HHHHHHhcCCceeEEEEecC-CCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcC Q lcl|NC_019511. 171 IVRDTYTYDQVNFEKVFSPK-NKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRS 249 (330) Q Consensus 171 ~v~d~L~~g~g~~~~v~~rd-~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~ 249 (330) ++ |.+.+|-...|++|..+ +...|..+.++++..+.+.. ++. .++....+......+.+...++.++.+.+ T Consensus 126 ~l-da~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f~~~~--~~~-----~~l~~~~~~~~g~~l~~~k~iv~~~~~~~ 197 (528) T protein:vir:10 126 CM-DGVGHGYSAIELDWSLQGREWLPQAFDHRPQSWFQLNP--DDQ-----DELRLRDNSIAGEVLQPFGWIMHKPRSRS 197 (528) T ss_pred HH-hhhhhcceeEEEEEeecCCceeEEEeeeecccceeecc--CCC-----cEEeccCCCCCceeecCCCeEEEeecCCC Confidence 54 56779999999998775 34577789888887765532 221 12222111111233455554433444443 Q ss_pred CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceee Q lcl|NC_019511. 250 DLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYI 329 (330) Q Consensus 250 d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~ 329 (330) + .+||.+.+..|......-..+.++-+.|...-+.|--+-.++. ..++++++.|.+...+..++ .++=+|-=. T Consensus 198 g---~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--~a~~~ek~~L~~al~~i~~~--~~~iiP~~~ 270 (528) T protein:vir:10 198 G---YVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPP--GTPDEEKVTLLRAVTGLGHA--AAGIIPESM 270 (528) T ss_pred C---CccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC--CCCHHHHHHHHHHHHHHhhC--cEEEecCCc Confidence 3 5689999999999999888888898999998888876665543 46888999999888776433 222233222 Q ss_pred C Q lcl|NC_019511. 330 K 330 (330) Q Consensus 330 e 330 (330) | T Consensus 271 ~ 271 (528) T protein:vir:10 271 S 271 (528) T ss_pred e Confidence 2 No 113 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.68 E-value=1.1e-07 Score=58.65 Aligned_cols=262 Identities=11% Similarity=0.064 Sum_probs=140.0 Q ss_pred ccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCC--Ccccch-----HHHHHHHHHH Q lcl|NC_019511. 19 EDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKK--SYMRNA-----HNLHEVLKKF 91 (330) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~--s~~r~~-----~~~~~~Lr~~ 91 (330) ...+++.... .+....+.+. ....+.|-...+..- ...+..|.- +-||.. ..-.++.+.+ T Consensus 1 ~~~~~d~~g~-p~~~~~~~~~------~~~~~~~~~~~~~~~------~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m 67 (526) T protein:vir:99 1 MAQIVDVYGN-PIRTQQLREP------QTSRLAGLAKEFAQH------PAKGLTPAKLARILVEAEQGNLQAQAELFMDM 67 (526) T ss_pred CCeeECCCCC-ccccccccch------hhhhhhhhhhhhccc------CcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHH Confidence 2222222211 1111111111 001122211111110 000111100 112211 1111222233 Q ss_pred h-hcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHH Q lcl|NC_019511. 92 G-NNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKK 170 (330) Q Consensus 92 a-~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~ 170 (330) - +.+-|.+|+.+|+..|+ ++.|.|.+-+.+ +..+.+....++.+|... .+|.+++.. T Consensus 68 ~e~D~~i~s~l~~Rk~av~-----------~~~w~I~p~~~~---~~~~~~~a~~v~~~l~~~--------~~~~~~i~~ 125 (526) T protein:vir:99 68 EERDAHLFAEMSKRKRAIL-----------GLDWAVEPPRNA---SAAEKADADYLHELLLDL--------EGLEDLLLD 125 (526) T ss_pred HhhChHHHHHHHHHHHHHh-----------CCCceEecCCCC---CHHHHHHHHHHHHHHhcc--------cCHHHHHHH Confidence 3 25789999999999997 467888764322 223344445555555322 247778877 Q ss_pred HHHHHHhcCCceeEEEEecCC-CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcC Q lcl|NC_019511. 171 IVRDTYTYDQVNFEKVFSPKN-KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRS 249 (330) Q Consensus 171 ~v~d~L~~g~g~~~~v~~rd~-~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~ 249 (330) ++ |.+.+|-...|++|..++ ...|..|.+++|..+....+..+ +..+..++..-..+.+...+..++.+.+ T Consensus 126 ~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~f~~~~~~~~-------~l~~~~~~~~g~~l~~~k~i~~~~~~~~ 197 (526) T protein:vir:99 126 AL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQN-------ELRLRDNSPAGEALQPFGWIIHRPRARS 197 (526) T ss_pred HH-HhhhhcceeEEEEEeecCCceeEEEeeeecccceeeccCCCc-------EEEecCCCCCceeecCCCeEEEeecCCc Confidence 76 677799999999988754 34777899999987765322211 1222222222233455544333444444 Q ss_pred CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceee Q lcl|NC_019511. 250 DLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYI 329 (330) Q Consensus 250 d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~ 329 (330) + .+||.|.+..|......-..+.++-+.|...-+.|--+-.++. ..++++++.|.+...+..++ .++=+|-=. T Consensus 198 g---~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--~a~~~ek~~L~~av~~i~~d--~~~iiP~~~ 270 (526) T protein:vir:99 198 G---YVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP--GTADEEKATLLRAVTGLGHA--AAGIIPETM 270 (526) T ss_pred C---CccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC--CCCHHHHHHHHHHHHHHhhC--cEEEecCCc Confidence 3 6789999999999998888888888899888888866655543 36888999999888776432 222223222 Q ss_pred C Q lcl|NC_019511. 330 K 330 (330) Q Consensus 330 e 330 (330) | T Consensus 271 ~ 271 (526) T protein:vir:99 271 A 271 (526) T ss_pred e Confidence 2 No 114 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=98.61 E-value=1.2e-07 Score=58.51 Aligned_cols=273 Identities=10% Similarity=0.046 Sum_probs=134.8 Q ss_pred CchhHHHHHhcCCCCCCcccccCcc---Cc-chhHHHHHHHHHHHHhhcccchhccccchhc----cccc-----ccccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPI---DD-GIQANIRQIEQDTKEMQEITKSLYGKQQAYA----EPFL-----EMMDT 67 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~----~~~~-----~~~~~ 67 (330) |+| .++-+ -|. .. ..+.+. ..++..+... ...|..+.. -|+. +.+.+ T Consensus 1 ~~~------------~~~~~--~~~~~~~~~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~a~ 60 (532) T protein:vir:94 1 MAD------------TDPTP--RPEITYATLQQAQRV--DAKRATHTSL----GLATAHEIDPTAYSPYERNAAQNAMAM 60 (532) T ss_pred CCC------------CCCCC--CcceehhhhhhHhhh--hhhhhhhhhh----hhhhhhhhccccccccccccccccccc Confidence 111 11111 111 00 011111 1111111000 011111111 0111 00100 Q ss_pred C---C---------CCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCccc Q lcl|NC_019511. 68 N---P---------DYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATP 135 (330) Q Consensus 68 ~---p---------~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~ 135 (330) . . .|..+.+ -.++++|..|+.++++++||++++++..+ -||++.-.+.++ . T Consensus 61 ~~g~~~~~~~~~~~~~~~~~~-----~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r-----------~~~~i~~~~~~~-~ 123 (532) T protein:vir:94 61 DYGLQTGRNGRNALSFVEATS-----WPGFPTLALLAQLPEYRTMHETPADECVR-----------AWGKITCSSKDE-L 123 (532) T ss_pred ccccCcccccccccccccccc-----cchHHHHHHHHcCchhhhhhccchHHHhh-----------CCceEeeCCccc-c Confidence 0 0 1111111 13567888999999999999999998764 255555433221 1 Q ss_pred ChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec-----------------CCCcceEEE Q lcl|NC_019511. 136 GIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP-----------------KNKTKMEKF 198 (330) Q Consensus 136 ~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r-----------------d~~G~~~~L 198 (330) .+ +.+++++..+.++ ...+-+...++...++|.+++++.... -+.|.+.+| T Consensus 124 ~~---~~~~~i~~~~~~l---------~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l 191 (532) T protein:vir:94 124 AA---DKATRITQKLEQY---------NVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGF 191 (532) T ss_pred ch---HHHHHHHHHHHhh---------hHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEE Confidence 12 2334444444333 223334444555567887777765431 223455789 Q ss_pred EeeCCCceEEeeCCC----CcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCC---ccccHHHHHHHHHHHH Q lcl|NC_019511. 199 IAVDPSTIFYATDKN----GKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSG---YGLSEVEIAMKEFIAY 271 (330) Q Consensus 199 ~pldp~tV~~~~d~~----G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~---yGlSPIe~a~~~I~~~ 271 (330) .++||..|.+..... .-.+-.+ .|+++..+. .+-++.|+|+..++..+..... +|.|-++.+...|... T Consensus 192 ~vld~~~v~p~~~~~~dp~sp~fg~P-~~y~v~~g~---~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~ 267 (532) T protein:vir:94 192 ATIEPMWLSPNAYNATDPTLPSFYKP-DSWIATSGK---KIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNW 267 (532) T ss_pred EeechheecccccccccccccccCCc-eeEEEccCe---eeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHH Confidence 999998877642211 1111111 234443332 4567889999888877764333 4999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 272 NNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 272 laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ..+....+.+...-. ..+ +.+.....++.+..+.+.+.+.....+-+|.+ ++++. T Consensus 268 ~~t~~~~~~l~~~~~-~~v-~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g--~~~id 322 (532) T protein:vir:94 268 LRTRQSVSDTVKQFS-MTN-LATDMAQLLAPGGAQSLDARLQLFNLYRDNRN--IGALD 322 (532) T ss_pred HHHHHHHHHHHHhcC-Cce-eeechHHhhcchhHHHHHHHHHHHHhhcCCcc--ceEEc Confidence 888887777544433 322 33332234566667777777765545544443 33332 No 115 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=98.58 E-value=1.2e-07 Score=58.44 Aligned_cols=273 Identities=12% Similarity=0.075 Sum_probs=138.7 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCC-CcccchHHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKK-SYMRNAHNLHE 86 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~-s~~r~~~~~~~ 86 (330) |-|.++.++. ..|+....+. +. .....|.......-....+ ..... +-||.. .+.+ T Consensus 1 m~k~~~k~~~----~~~~~~~~~~------~~-------~~~~~~~~~~~~~~~~~g~-----~~~~~~~iLr~~-~~~~ 57 (448) T protein:vir:79 1 MAKRGRKPKE----LVPGPGSIDP------SD-------VPKLEGASVPVMSTSYDVV-----VDREFDELLQGK-DGLL 57 (448) T ss_pred CCCCCCCCcc----ccCccccccc------cc-------chhhhhhhhhhcccccccc-----cccchhHhhccc-cchH Confidence 2222222111 2233221110 00 0001111111111000000 00010 113321 1123 Q ss_pred HHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHH Q lcl|NC_019511. 87 VLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQE 166 (330) Q Consensus 87 ~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~ 166 (330) +.+.+.+.+-|.+|+.+|+.-|. ++.|.|.+.+. +.++.+....+..+|... +.-..+.+|.+ T Consensus 58 ly~~m~~D~hi~s~l~~Rk~av~-----------~~~w~v~p~~~----~~~~~~~ae~v~~~l~~~--~~~~~~~~f~~ 120 (448) T protein:vir:79 58 VYHKMLSDGTVKNALNYIFGRIR-----------SAKWYVEPAST----DPEDIAIAAFIHAQLGID--DASVGKYPFGR 120 (448) T ss_pred HHHHHhhChHHHHHHHHHHHHHh-----------cCCceEecCCC----CHHHHHHHHHHHHHhhhh--hhhhccCCHHH Confidence 33333346789999999999997 46888875432 234444444454444321 11123456888 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCCc--ceEEEEeeCCCceEE-eeCCCCcccCCceeEEEEeC---C----ceEEEec Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNKT--KMEKFIAVDPSTIFY-ATDKNGKIIKGGNRFVQVID---K----QVVASFT 236 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~G--~~~~L~pldp~tV~~-~~d~~G~~~~~~~~Y~q~~~---~----~~~~~~~ 236 (330) ++..++ |.+.+|-..+|+++.+...| .+..|.+.++.++.- ..+.+|. .++....+ + .....++ T Consensus 121 ~~~~~l-da~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~~~~~~f~~~~d~~-----l~~~~~~~~~~~~~~~~~~~~lP 194 (448) T protein:vir:79 121 LFAIYE-NAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGG-----PKALKLSGEVKGGSQFVSGLEIP 194 (448) T ss_pred HHHHHH-HhhhhcceeEEEEeeecCCCceecccccccCCccccceeeecCCc-----eEEeecCCcccccccCCCccccc Confidence 887765 67789999999999754334 455677777765431 2222222 22222111 0 0111234 Q ss_pred hhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019511. 237 SRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF 316 (330) Q Consensus 237 ~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~ 316 (330) ..-++|.++ +.++ .+||.+.+..|....-.-....++-+.|-..-+.|-=+-.++.+...++++++.+.+...+.. T Consensus 195 ~~~~i~~~~-~~~g---~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~ 270 (448) T protein:vir:79 195 IWKTVVFLH-NDDG---SFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFV 270 (448) T ss_pred cceEEEEec-CccC---CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHh Confidence 455676543 3333 578999999999998888888888888888877787666676554455677788877776654 Q ss_pred cCcccccccce-----eeC Q lcl|NC_019511. 317 SGINGSWQICL-----YIK 330 (330) Q Consensus 317 ~G~~na~kvpv-----L~e 330 (330) .|...++=+|- ++| T Consensus 271 ~g~~a~~iiP~~~~ie~~e 289 (448) T protein:vir:79 271 QKPRHGIILPDDWKFDTVD 289 (448) T ss_pred cCCceEEEecCCceEEEEe Confidence 45322211221 112 No 116 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=98.51 E-value=4e-07 Score=55.68 Aligned_cols=250 Identities=10% Similarity=0.024 Sum_probs=136.1 Q ss_pred ccccCccCcchhHHHHHHHHHHHHhhcccchhccccchh---cccccccc-ccCCCCCcCCCcccchHHHHHHHHHHh-- Q lcl|NC_019511. 19 EDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAY---AEPFLEMM-DTNPDYRDKKSYMRNAHNLHEVLKKFG-- 92 (330) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~---~~~~~~~~-~~~p~~~~~~s~~r~~~~~~~~Lr~~a-- 92 (330) +..+++. .|+.-.. .++....+ .++-.|..-|+.-=+.+.....||... T Consensus 1 m~~~~d~-------------------------~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~g 55 (512) T protein:vir:19 1 MGRILDI-------------------------SGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERG 55 (512) T ss_pred CcceeCC-------------------------CCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCC Confidence 1111111 2221111 11110000 000111111110000011111222111 Q ss_pred --------------hcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCC Q lcl|NC_019511. 93 --------------NNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKD 158 (330) Q Consensus 93 --------------~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~p 158 (330) +-+-+.+|+++|+.-|. ++.|.|.+-+. .+..+.+....++.+|... T Consensus 56 d~~~~~~L~~dm~~~D~hi~s~l~~Rk~av~-----------~~~w~I~p~~~---~~~~~~~~a~~v~~~l~~~----- 116 (512) T protein:vir:19 56 DLTAQADLAFDMEEKDTHLFSELSKRRLAIQ-----------ALEWRIAPARD---ASAQEKKDADMLNEYLHDA----- 116 (512) T ss_pred CHHHHHHHHHHHHhhChHHHHHHHHHHHHHh-----------CCCceEecCCC---CCHHHHHHHHHHHHHHhcC----- Confidence 24568899999999987 46788876422 2334444555566655322 Q ss_pred CCcCCHHHHHHHHHHHHHhcCCceeEEEEecC-CCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEech Q lcl|NC_019511. 159 IDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK-NKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTS 237 (330) Q Consensus 159 n~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd-~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~ 237 (330) .+|.+++..++ |.+.+|-..+|++|..+ +...|..+.+++|..+....+..++ +++.. ++..-..+.+ T Consensus 117 ---~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f~~~~~~~~~-----lr~~~--~~~~G~~l~~ 185 (512) T protein:vir:19 117 ---AWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDPALFCANPDNLNE-----LRLRD--ASYHGLELQP 185 (512) T ss_pred ---CCHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeeccccceeccCCCcE-----EEecC--CCCCceeecC Confidence 24778887765 67779999999998654 4467888999998877653322221 22322 2111223555 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) ...++.++.+.++ .+||.+.+..|......-..+.++-+.|-..-+.|--+-.++ ...+++++++|.+.+.+..+ T Consensus 186 ~k~i~~~~~~~~g---~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~--~~a~~~ek~~L~~al~~~~~ 260 (512) T protein:vir:19 186 FGWFMHRAKSRTG---YVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYP--TGSTNREKATLMQAVMDIGR 260 (512) T ss_pred CceEEEeccCCCC---CcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecC--CCCCHHHHHHHHHHHHHHhh Confidence 5555544444443 678999999999999888888888888888878885444444 34688899999988887643 Q ss_pred Ccccccccce-----eeC Q lcl|NC_019511. 318 GINGSWQICL-----YIK 330 (330) Q Consensus 318 G~~na~kvpv-----L~e 330 (330) + .+.=+|- ++| T Consensus 261 ~--a~~iiP~~~~ie~~e 276 (512) T protein:vir:19 261 R--AGGIIPMGMTLDFQS 276 (512) T ss_pred C--cEEEecCCceEEEee Confidence 2 2222232 222 No 117 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=98.51 E-value=5.1e-08 Score=60.57 Aligned_cols=274 Identities=12% Similarity=0.077 Sum_probs=139.1 Q ss_pred HHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHH Q lcl|NC_019511. 8 LRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEV 87 (330) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~ 87 (330) |-|.++.+|. ++|....++. ++...... .+...-..|+......+ .-+-||... ..++ T Consensus 1 m~kk~~k~~~----~~~~~~~~~~------~~~~~~~~------~~~~~~~~~~~g~~~~~-----~~~iLr~~~-~~~l 58 (448) T protein:vir:77 1 MAKRGRKPKE----LVPGPGSIDP------SDVPKLEG------ASVPVMSTSYDVVVDRE-----FDELLQGKD-GLLV 58 (448) T ss_pred CCCCCCCCcc----cCCcccccch------hhhhhhcc------chhhhcccccccccccc-----hhHhhcccc-chHH Confidence 3222222222 2222111100 01001111 11111111111100000 001122211 1233 Q ss_pred HHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHH Q lcl|NC_019511. 88 LKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEF 167 (330) Q Consensus 88 Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~f 167 (330) .+...+.+-|.+|+++|+.-|. ++.|.|.+.+. +..+.+....+..++.. .+....+.+|.++ T Consensus 59 y~~m~~D~hi~s~l~~Rk~av~-----------~~~w~v~p~~~----~~~d~~~ae~v~~~l~~--~~~~~~~~~f~~~ 121 (448) T protein:vir:77 59 YHKMLSDGTVKNALNYIFGRIR-----------SAKWYVEPAST----DPEDIAIAAFIHAQLGI--DDASVGKYPFGRL 121 (448) T ss_pred HHHHhhChHHHHHHHHHHHHHh-----------cCCceEecCCC----CHHHHHHHHHHHHHhhc--hhhhhccCCHHHH Confidence 3333346789999999999997 46788875432 22333333344444321 1222345578888 Q ss_pred HHHHHHHHHhcCCceeEEEEecCCCc--ceEEEEeeCCCceEE-eeCCCCcccCCceeEEEEeCC---c----eEEEech Q lcl|NC_019511. 168 CKKIVRDTYTYDQVNFEKVFSPKNKT--KMEKFIAVDPSTIFY-ATDKNGKIIKGGNRFVQVIDK---Q----VVASFTS 237 (330) Q Consensus 168 l~~~v~d~L~~g~g~~~~v~~rd~~G--~~~~L~pldp~tV~~-~~d~~G~~~~~~~~Y~q~~~~---~----~~~~~~~ 237 (330) +..++ |.+.+|-..+|++|.+...| .+..|.+.++.++.- ..+.+|. .++....+. . ....++. T Consensus 122 i~~~l-da~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~~~~~~f~~~~~~~-----l~~~~~~~~~~~~~~~~~~~~lP~ 195 (448) T protein:vir:77 122 FAIYE-NAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGG-----PKALKLSGEVKGGSQFVNGLEIPI 195 (448) T ss_pred HHHHH-HhhhhcceeEEEEEeecCCCceeeccccccCCCccceeeeecCCc-----eEEEecCCcccccccCCCcccccc Confidence 88875 78889999999999764334 455677777765432 2222222 233222111 0 1112345 Q ss_pred hHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 238 RELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 238 ~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~ 317 (330) .-++|.++ +.++ .+||.|.+..|....-.-....++-+.|-..-+.|-=+..++.+..-+++.++.+.+...+... T Consensus 196 ~~~i~~~~-~~~g---~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~ 271 (448) T protein:vir:77 196 WKTVVFLH-NDDG---SFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQ 271 (448) T ss_pred ceEEEEec-CCcC---CcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhc Confidence 56677643 3332 5789999999999888888888888888887777876666665444566777788777666444 Q ss_pred CcccccccceeeC Q lcl|NC_019511. 318 GINGSWQICLYIK 330 (330) Q Consensus 318 G~~na~kvpvL~e 330 (330) |...++=+|-=.| T Consensus 272 g~~a~~iiP~g~~ 284 (448) T protein:vir:77 272 KPRHGIILPDDWK 284 (448) T ss_pred CCceEEEecCCce Confidence 5322222221111 No 118 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=98.43 E-value=3.3e-07 Score=56.08 Aligned_cols=296 Identities=11% Similarity=0.078 Sum_probs=129.8 Q ss_pred CchhHHHH---HhcCCC-CCCcccccCccCc----chhHHHHHHHHHH--HHhhcccchhccccchhccccc----cccc Q lcl|NC_019511. 1 MPDLFKSL---RLGSMY-KEDTEDLMVPIDD----GIQANIRQIEQDT--KEMQEITKSLYGKQQAYAEPFL----EMMD 66 (330) Q Consensus 1 ~~~~~~~~---~~~~~~-~~~~~~~~~~~~~----~~~~~~~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~----~~~~ 66 (330) -.|=...| +++-.. ++.+-+..-+..+ .++.....+++.. +.+..+..+..+-......+.+ ..+. T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG 111 (862) T protein:vir:99 32 RHDPLDPLARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIG 111 (862) T ss_pred ccCccchHHhhcccCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhcc Confidence 11222222 222212 1211111111111 1111222222211 1111111111111111111111 1111 Q ss_pred c----CCCCCcCCC---cccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhh Q lcl|NC_019511. 67 T----NPDYRDKKS---YMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKE 139 (330) Q Consensus 67 ~----~p~~~~~~s---~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~ 139 (330) . ++....... +....-.++.++..|+.++++++||++++++..+ -||++.-.+.+.+..+ T Consensus 112 ~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~pAeDatR-----------~g~~I~~~~d~~e~~~-- 178 (862) T protein:vir:99 112 AEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSLAGEDAIR-----------NGWHLKSLGEGEEIDE-- 178 (862) T ss_pred ccccccccccchhccccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhh-----------CCceEeecCcccccCH-- Confidence 1 111000000 0001123467788899999999999999999864 2556654322222222 Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEE-ecC-------------CCcceEEEEeeCCCc Q lcl|NC_019511. 140 KEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVF-SPK-------------NKTKMEKFIAVDPST 205 (330) Q Consensus 140 ~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~-~rd-------------~~G~~~~L~pldp~t 205 (330) +.+++++..+.++.. +.-|.++ ++..-++|.+.+.++. ..| ..|.+.+|..+||.. T Consensus 179 -e~~~~ie~~~~rL~v--------~~~l~ea-ir~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w 248 (862) T protein:vir:99 179 -ESLEKFKAIDVEFKV--------KENLIEF-NRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYW 248 (862) T ss_pred -HHHHHHHHHHHHhhH--------HHHHHHH-HHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhh Confidence 334555555554411 1223333 3333346655555443 233 234556777788765 Q ss_pred eEEee------CCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCC---ccccHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 206 IFYAT------DKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSG---YGLSEVEIAMKEFIAYNNTES 276 (330) Q Consensus 206 V~~~~------d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~---yGlSPIe~a~~~I~~~laae~ 276 (330) +.+.. |...-.+-.+. +++ +.+. .+-++-|+++...+..++.... +|+|.++.+...|.....+.. T Consensus 249 ~~p~~v~~~~~Dp~sp~yGkP~-~y~-I~g~---~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~ 323 (862) T protein:vir:99 249 MMPMLTAESTADPSSQFFYEPE-FWI-ISGQ---KYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTAN 323 (862) T ss_pred hcccccccccccccccccCCce-eee-ecCe---eeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHH Confidence 55421 11111111122 222 3343 3456677777666665543333 499999999999999999999 Q ss_pred HHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 277 FNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 277 ~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ..+.++.+.... + +.+++...+..+ +.+.+.......+-+|.+ + +++. T Consensus 324 saa~Ll~ka~l~-v-~ktd~l~~l~~e--d~l~~r~~~~~~~rdN~G-i-~liD 371 (862) T protein:vir:99 324 EAPLLAMNKRTT-A-IHTDTAKAIANE--DKFIQRLMFWVRYRDNHA-V-KVLG 371 (862) T ss_pred HHHHHHHHhccc-e-eechhHhhhccH--HHHHHHHHHHHhccCcce-e-EEec Confidence 998888875533 2 344443333322 233333333333434443 3 4444 No 119 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=98.35 E-value=5.7e-07 Score=54.82 Aligned_cols=264 Identities=12% Similarity=0.093 Sum_probs=120.6 Q ss_pred hhcccchhccccchhcccccc---ccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecc Q lcl|NC_019511. 43 MQEITKSLYGKQQAYAEPFLE---MMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSE 119 (330) Q Consensus 43 ~~~~~~~~~g~~~~~~~~~~~---~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~ 119 (330) |...+....|=++...-.+.. ....-+.+..+-.-||. ....++.+...+.+-|.+|+..|+.-|. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~-~~~~~ly~~m~~D~hi~s~l~~Rk~av~---------- 69 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRF-PESIKTFQLMMRDPAVAASVNIIKMFVR---------- 69 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcc-cchHHHHHHHhhChHHHHHHHHHHHHHh---------- Confidence 323333333333221100000 00001112222222332 1122333333346789999999999997 Q ss_pred cccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC--------- Q lcl|NC_019511. 120 KGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK--------- 190 (330) Q Consensus 120 ~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd--------- 190 (330) ++.|.|.+.+.+ ....++.+....+..++..+ ..+|.+++..++ |.+.+|-..+|++|.++ T Consensus 70 -~~~w~v~p~~~~-~~d~~~~~~a~~v~~~l~~~-------~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~ 139 (488) T protein:vir:95 70 -KVNWRFVPPKGK-EQDPKMLERADFFNSLMDDM-------EHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQS 139 (488) T ss_pred -cCCceEecCCCC-chhHHHHHHHHHHHHHHhcc-------CccHHHHHHHHH-Hhhcccceeeeeeeeccccccccccc Confidence 468888764322 12223333344444444221 235777777776 67788999999999763 Q ss_pred ----CCcceEEEEeeCCCceE-EeeCCCCcccCC-ceeEEEE---------eCCceEEEechhHeeeecccCcCCCCCCC Q lcl|NC_019511. 191 ----NKTKMEKFIAVDPSTIF-YATDKNGKIIKG-GNRFVQV---------IDKQVVASFTSRELVMGIRNPRSDLNSSG 255 (330) Q Consensus 191 ----~~G~~~~L~pldp~tV~-~~~d~~G~~~~~-~~~Y~q~---------~~~~~~~~~~~~dvih~~~n~~~d~~~~~ 255 (330) +.-.|..|.+.++.++. ...+.+|..... .....+. ........++....++.++...++ .+ T Consensus 140 ~~~dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g---~p 216 (488) T protein:vir:95 140 KFDDGLIGWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYG---NP 216 (488) T ss_pred cccCCeeeeeeeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCC---cc Confidence 23345666666665432 222333322100 0000000 000111234444544444444433 67 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC--CCCCHHHHHHHHHHHHHHhc---Cccccc-ccceee Q lcl|NC_019511. 256 YGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD--QQQSQHALENFKREWKSSFS---GINGSW-QICLYI 329 (330) Q Consensus 256 yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~--~~ls~e~~e~lr~~w~~~~~---G~~na~-kvpvL~ 329 (330) ||.+.+..|....-.=....++-+.|-..-+.|-=+...|.. ...+++..+.+.+...+... +...++ =+|.-+ T Consensus 217 ~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~ 296 (488) T protein:vir:95 217 EGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYI 296 (488) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeecccc Confidence 899999999888877677777767766543333222333321 11334444444444433211 101111 112111 Q ss_pred C Q lcl|NC_019511. 330 K 330 (330) Q Consensus 330 e 330 (330) + T Consensus 297 ~ 297 (488) T protein:vir:95 297 D 297 (488) T ss_pred c Confidence 1 No 120 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.34 E-value=1.2e-06 Score=53.01 Aligned_cols=261 Identities=11% Similarity=0.064 Sum_probs=138.1 Q ss_pred ccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhcc-ccccccccCCCCCcC-C-Ccccch-----HHHHHHHHH Q lcl|NC_019511. 19 EDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAE-PFLEMMDTNPDYRDK-K-SYMRNA-----HNLHEVLKK 90 (330) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~p~~~~~-~-s~~r~~-----~~~~~~Lr~ 90 (330) +..+++..... +....+.+. .+..+.|....+.. | ..+..|. . +-||.. ...+++.+. T Consensus 1 ~~~~~d~~g~p-~~~~~~~~~------~~~~~~~~~~~~~~~~-------~~gltp~~l~~il~~a~~gd~~~~~~L~ed 66 (526) T protein:vir:79 1 MAQIVDVYGNP-IRPQQLREP------QTSRLAGLAKEFAQHP-------AKGLTPAKLARILVEAEQGNLQAQAELFMD 66 (526) T ss_pred CCeeeCCCCCc-cCccccchh------hhhhhhhhhhhcccCC-------CCCcCHHHHHHHHHHhhCCCHHHHHHHHHH Confidence 22222222111 111111111 01112222221111 1 0011110 0 111211 111222222 Q ss_pred Hh-hcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHH Q lcl|NC_019511. 91 FG-NNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCK 169 (330) Q Consensus 91 ~a-~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~ 169 (330) .- +-+-|.+|+.+|+.-|. ++.|.|.+-+.+ +..+.+....++.+|... .+|.+++. T Consensus 67 m~e~D~~i~s~l~~Rk~av~-----------~~~w~I~p~~~~---~~~~~~~a~~v~~~l~~~--------~~~~~~i~ 124 (526) T protein:vir:79 67 MEERDAHLFAEMSKRKRAIL-----------GLDWAVEPPRNA---SAAEKADADYLHELLLDL--------EGLEDLLL 124 (526) T ss_pred HHhhChHHHHHHHHHHHHHh-----------CCCceEecCCCC---ChHHHHHHHHHHHHHhcc--------cCHHHHHH Confidence 22 24679999999999997 467888764322 223344455566655322 24777777 Q ss_pred HHHHHHHhcCCceeEEEEecCCC-cceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEEEechhHeeeecccCc Q lcl|NC_019511. 170 KIVRDTYTYDQVNFEKVFSPKNK-TKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPR 248 (330) Q Consensus 170 ~~v~d~L~~g~g~~~~v~~rd~~-G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~ 248 (330) .++ |.+.+|-...|++|..+++ -.|..|.+.+|.......+..+ +..+..++..-..+.+...+..++.+. T Consensus 125 ~~l-dA~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~F~~~~~~~~-------~l~~~~~~~~g~~l~~~k~iv~~~~~~ 196 (526) T protein:vir:79 125 DAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQN-------ELRLRDNSPAGEALQPFGWIIHRPRAR 196 (526) T ss_pred HHH-hhhhhcceeEEEEEeecCCceeEEEeeeecccceEeccCCCc-------EEEecCCCCCceeecCCceEEEeecCC Confidence 765 5667999999999988643 4777899998877664322211 122212222223455554444344444 Q ss_pred CCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCccccccccee Q lcl|NC_019511. 249 SDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLY 328 (330) Q Consensus 249 ~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL 328 (330) ++ .+||.+.+..|....-.-..+.++-+.|-..-+.|-=+-.++ ...+++++++|.+...+..++ .++=+|-= T Consensus 197 ~g---~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~--~~a~~~ek~~L~~av~~i~~d--a~~iiP~~ 269 (526) T protein:vir:79 197 SG---YVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYP--PGTADEEKATLLRAVTGLGHA--AAGIIPET 269 (526) T ss_pred cC---CccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecC--CCCCHHHHHHHHHHHHHHhcC--cEEEecCC Confidence 43 578999999999988887778888888888777786555554 346888899998888776432 22222322 Q ss_pred eC Q lcl|NC_019511. 329 IK 330 (330) Q Consensus 329 ~e 330 (330) .| T Consensus 270 ~~ 271 (526) T protein:vir:79 270 MA 271 (526) T ss_pred ce Confidence 22 No 121 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=98.21 E-value=1e-06 Score=53.40 Aligned_cols=241 Identities=11% Similarity=0.129 Sum_probs=115.4 Q ss_pred cCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHH Q lcl|NC_019511. 22 MVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAII 101 (330) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I 101 (330) ++..|+ |..-+.++.+-...+ ..+.. . ..+..+..|+.++++++|| T Consensus 1 ~~~~D~-----------------------------~~n~~~gg~~~~~~~-~~~~~-~---~~~~l~a~Y~~~~l~~~~V 46 (422) T protein:vir:10 1 MVKTDS-----------------------------YANIFLGGSDGSEIY-GSLQN-Q---APTILASLYADNALVRRII 46 (422) T ss_pred Cccchh-----------------------------hHHHHcCCCCCcccc-Ccccc-c---CHHHHHHHHHhChhhHHHH Confidence 111111 111111111000001 11111 1 2345677788899999999 Q ss_pred HHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCc Q lcl|NC_019511. 102 ITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQV 181 (330) Q Consensus 102 ~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g 181 (330) ++.+++..+ -||+|.-.+ +..+ ++.-+.++ ...+-+...++-..++|.+ T Consensus 47 d~~aed~~r-----------~g~~i~~~~--------~~~~---~~~~~~~l---------~~~~~l~~a~~~~rl~G~a 95 (422) T protein:vir:10 47 DTIPETALA-----------AGFHIDGID--------DEPA---FWSRWDDL---------EMTQNINDAWSWARLFGGA 95 (422) T ss_pred hhhhHHHhc-----------CCccccCCC--------HHHH---HHHHHHHh---------hHHHHHHHHHHhhccccce Confidence 999999863 255553211 1111 11222222 2233344445556678877 Q ss_pred eeEEEEecC--------CCcceEEEEeeCCCceEEeeCC---CCcccCCceeEEEEeCC--ceEEEechhHeeeecccCc Q lcl|NC_019511. 182 NFEKVFSPK--------NKTKMEKFIAVDPSTIFYATDK---NGKIIKGGNRFVQVIDK--QVVASFTSRELVMGIRNPR 248 (330) Q Consensus 182 ~~~~v~~rd--------~~G~~~~L~pldp~tV~~~~d~---~G~~~~~~~~Y~q~~~~--~~~~~~~~~dvih~~~n~~ 248 (330) ++.+....+ ..|....|.++|+..|.+..-. ..-.+-.+..| ++..+ +....+-++.|+|+...+. T Consensus 96 ~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y-~v~~~~~~~~~~iH~SRli~~~g~~~ 174 (422) T protein:vir:10 96 AIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTY-RITTNESDMFYDVHYSRIHIIDGERI 174 (422) T ss_pred EEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEE-EEecCCCCcceeeccceeEEeCCCCc Confidence 777765311 2456678888998877654211 11111122333 33322 2223455667888765554 Q ss_pred CCC---CCCCccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCC-C-CHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019511. 249 SDL---NSSGYGLSEVEI-AMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQ-Q-SQHALENFKREWKSSFSGINGS 322 (330) Q Consensus 249 ~d~---~~~~yGlSPIe~-a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~-l-s~e~~e~lr~~w~~~~~G~~na 322 (330) .+. ...++|.||++. +.+.|.....+....+.+|...... ++.+++-.. + +.+....++..++....+-++- T Consensus 175 p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~ 252 (422) T protein:vir:10 175 PNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVG 252 (422) T ss_pred hhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchhHHHhcCCccchHHHHHHHHHHHHhcCCc Confidence 442 334469999985 7899999888888888766554322 233332111 1 1222333333333333222233 Q ss_pred cccceeeC Q lcl|NC_019511. 323 WQICLYIK 330 (330) Q Consensus 323 ~kvpvL~e 330 (330) +-+.|.-+ T Consensus 253 ~~~~l~~~ 260 (422) T protein:vir:10 253 QAIGIDAE 260 (422) T ss_pred cceeEecC Confidence 33333222 No 122 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=98.18 E-value=5.2e-07 Score=55.03 Aligned_cols=242 Identities=10% Similarity=0.097 Sum_probs=117.0 Q ss_pred HHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhh Q lcl|NC_019511. 33 IRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYC 112 (330) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~ 112 (330) ++.+..+... +...|...... + +++ .....++.+..|+.++++++||.+.+++..+ T Consensus 1 ~~~~~~d~~~-----~~~~~~~~~~~---------~-----~~~---~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r-- 56 (427) T protein:vir:10 1 MKIVKHDGYN-----DIFNGGADGSP---------K-----PFF---MSDASYHVGSFYNDNATAKRIVDVIPEEMVT-- 56 (427) T ss_pred CCccccchHH-----HHhhcCCCCcc---------c-----Ccc---ccCchHHHHHHHHcCchhhhhhccchHHhhc-- Confidence 2222222111 11111111110 1 111 1223567788899999999999999999864 Q ss_pred hhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec--- Q lcl|NC_019511. 113 KPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP--- 189 (330) Q Consensus 113 ~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r--- 189 (330) -||+|.-. .+.++ ++.-+.++ ...+-+...++-..++|.+++.+...- T Consensus 57 ---------~g~~i~g~--------~~~~~---~~~~~~~l---------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~ 107 (427) T protein:vir:10 57 ---------AGFKMSGV--------KDEKE---FKSLWDSY---------KLDSSLVDLLCWARLYGGAAMVAIIKDNRM 107 (427) T ss_pred ---------CCccccCc--------cHHHH---HHHHHHHh---------hHHHHHHHHHHhccccceeEEEEEecCCCc Confidence 25555321 11122 22222222 223334444555566777777664422 Q ss_pred -----CCCcceEEEEeeCCCceEEeeCC---CCcccCCceeEEEEeCCc--eEEEechhHeeeecccCcCCC---CCCCc Q lcl|NC_019511. 190 -----KNKTKMEKFIAVDPSTIFYATDK---NGKIIKGGNRFVQVIDKQ--VVASFTSRELVMGIRNPRSDL---NSSGY 256 (330) Q Consensus 190 -----d~~G~~~~L~pldp~tV~~~~d~---~G~~~~~~~~Y~q~~~~~--~~~~~~~~dvih~~~n~~~d~---~~~~y 256 (330) +..|.+..|.++|+..|.+..-. ..-.+-.+ .++++..++ ....+-++.|+|+...+..+. ..+.+ T Consensus 108 l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P-~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~ 186 (427) T protein:vir:10 108 LTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEP-EIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGW 186 (427) T ss_pred cccccCCCcceeEEEEechhcccccccccCccccccCcc-eEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcc Confidence 34678899999999877653211 11011112 233333322 224566777888865554433 23446 Q ss_pred cccHHH-HHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCC--CHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 257 GLSEVE-IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQ--SQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 257 GlSPIe-~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~l--s~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) |.||+. .+...|.....+....+..|...... ++.+++-..+ +.+....++..+......-++-+.+.|.=| T Consensus 187 G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~ 261 (427) T protein:vir:10 187 GASVLNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE 261 (427) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecC Confidence 999996 56788888888887777766554322 2333221100 111111222233332222223333333322 No 123 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=98.17 E-value=2.3e-06 Score=51.44 Aligned_cols=275 Identities=11% Similarity=0.017 Sum_probs=123.5 Q ss_pred cCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHH----HHHh-hcHH Q lcl|NC_019511. 22 MVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVL----KKFG-NNSI 96 (330) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~L----r~~a-~~~i 96 (330) ++-+|.-+.. +.= +...+...+-..+ .+|.-- ..-....+|+...++-.......+.| |.+. +||+ T Consensus 1 mn~~dr~i~~-~sP--~~~~~R~~ar~~~----~~y~aa--~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~ 71 (502) T protein:vir:79 1 MAILDDVIGV-FSP--GWKAARLRSRAVI----QAYEAV--KTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDL 71 (502) T ss_pred CchHhhHHhh-cCh--HHHHHHHhhHHHH----hhcccc--CcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChH Confidence 4444443322 100 0000111110000 012111 11111223333333222222223444 4444 3689 Q ss_pred HHHHHHHHHHhHhhhhhhheeccccc-ceeeecc--CCCcccChhhHHHHHHHHHHHHhccCC-CCCCcCCHHHHHHHHH Q lcl|NC_019511. 97 LNAIIITRANQVSTYCKPARYSEKGV-GFEVKLK--DLDATPGIKEKEQMKRIEEFILNTGTD-KDIDRDSFQEFCKKIV 172 (330) Q Consensus 97 v~a~I~~~~d~Ia~~~~~~~~~~~~~-g~~v~~k--d~~~~~~~~~~~~~~~i~~~l~~~~~~-~pn~~~s~~~fl~~~v 172 (330) +..+|+++.+.|- |- |+.+.++ ..+.+..++ -.++|+..+...... ....+.+|..+...++ T Consensus 72 a~~av~~~~~nvV-----------G~ggi~~~~~~~~~~~~~~~~---~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~ 137 (502) T protein:vir:79 72 VIGVFDKLEERVV-----------GKNGIIVEPHPVLRNGAIARD---LAAEIRTRWSEWSVSPEVTGQFTRPMLERLML 137 (502) T ss_pred HHHHHHHHHHhhc-----------cCCceeeeeccCCCChhHHHH---HHHHHHHHHHHhhcCcCccccCCHHHHHHHHH Confidence 9999998887774 22 3333332 222222222 233444444333221 2345678999999999 Q ss_pred HHHHhcCCceeEEEEecC-----CCcceEEEEeeCCCceEEeeCCCCcccC---------CceeEEEEe-C-----CceE Q lcl|NC_019511. 173 RDTYTYDQVNFEKVFSPK-----NKTKMEKFIAVDPSTIFYATDKNGKIIK---------GGNRFVQVI-D-----KQVV 232 (330) Q Consensus 173 ~d~L~~g~g~~~~v~~rd-----~~G~~~~L~pldp~tV~~~~d~~G~~~~---------~~~~Y~q~~-~-----~~~~ 232 (330) +.++.-|..++.+++.+. +.+.++.|-.|+|+.|..- ..+|.... .++.|.... + .... T Consensus 138 r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~-~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~ 216 (502) T protein:vir:79 138 RTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMT-SDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMET 216 (502) T ss_pred HHHHhCCceEEEEeecccCccCCCcccceEEEEecchhcCCC-CCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccce Confidence 999999988888776543 4456788889999887422 12222111 123343221 1 1223 Q ss_pred EEechhHeeeecccCcCCCCCCCccccHHHHHHHHHH---HHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHH Q lcl|NC_019511. 233 ASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFI---AYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFK 309 (330) Q Consensus 233 ~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~---~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr 309 (330) ..+++++|+|+-..-+.+- .-|+|.+..++..+. -...++..+++. .|.-.+++..+.+.....+... T Consensus 217 ~rvpA~~vlH~f~~~r~gQ---~RGis~lapvl~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~~~~~~~~~~--- 287 (502) T protein:vir:79 217 KEVDAERMLHLKFVRRLHQ---MRGTSLLSGVLIRLSALKEYEDSELTAARI---AAALGMYIRKGDGQSYEPDGNG--- 287 (502) T ss_pred eEechhheEEeecccCCcc---ccCCchHHHHHHHHHHHhHHHHHHHHHHHH---hhhheeeeecCCCcccccccCC--- Confidence 4678999999865433332 238888776665543 344455444433 2334455554322111111000 Q ss_pred HHHHHHhcCcccccc-cceeeC Q lcl|NC_019511. 310 REWKSSFSGINGSWQ-ICLYIK 330 (330) Q Consensus 310 ~~w~~~~~G~~na~k-vpvL~e 330 (330) ..-.......+ .|. ++.|.. T Consensus 288 ~~~~~~~~~l~-pG~i~~~L~p 308 (502) T protein:vir:79 288 SKENERELTIQ-PGIIYDDLKP 308 (502) T ss_pred CCCcccccccc-CCccccccCC Confidence 00000111121 222 344433 No 124 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=98.12 E-value=1.2e-06 Score=52.98 Aligned_cols=249 Identities=11% Similarity=0.056 Sum_probs=119.4 Q ss_pred hhHHHHHHHHHHHHhhcccchhccccchhcccccc--ccc-cCCC-CCcCCCcccchHHHHHHH-HHHhhcHHHHHHHHH Q lcl|NC_019511. 29 IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE--MMD-TNPD-YRDKKSYMRNAHNLHEVL-KKFGNNSILNAIIIT 103 (330) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~--~~~-~~p~-~~~~~s~~r~~~~~~~~L-r~~a~~~iv~a~I~~ 103 (330) +.. +++-.++. +.++...|...... .+. .++. |....++- ..++..| ..|+.+.++++||++ T Consensus 1 ~~~-~~~a~~~~---------~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~---~~~~~~l~~lY~~~~l~r~iVd~ 67 (461) T protein:vir:80 1 MYS-IDKAKQAK---------IDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQ---KLDLKACENLYASNSIAMNIVDI 67 (461) T ss_pred Ccc-chhhhhhh---------hhhhhhhhhHHHhhcCCcchhhhhhccccCccc---ccCHHHHHHHHHhCCccchhhcc Confidence 000 11111110 11111111111100 000 0110 11111110 1134555 455678999999999 Q ss_pred HHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCcee Q lcl|NC_019511. 104 RANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNF 183 (330) Q Consensus 104 ~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~ 183 (330) .+++..+ -||.+.-++ .+..+.++.++.++ ...+-+...++...++|.+++ T Consensus 68 ~a~d~~r-----------~g~~i~~~~---------~~~~~~~~~~~~~l---------~~~~~l~~~~~~~rl~G~a~i 118 (461) T protein:vir:80 68 ISEDMVR-----------AGWSLKTDN---------KEMKKNIESKWRKL---------KTKDRFQKLYADKRLYGDGFL 118 (461) T ss_pred chHHhhc-----------CCeeeecCC---------HHHHHHHHHHHHHh---------hHHHHHHHHHHhhcccccEEE Confidence 9998753 255554321 12334455555444 233344555667778988888 Q ss_pred EEEEecCCC-----------c---ceEEEEeeCCCceEE---eeCCCCcccCCceeEEEEeC-------------CceEE Q lcl|NC_019511. 184 EKVFSPKNK-----------T---KMEKFIAVDPSTIFY---ATDKNGKIIKGGNRFVQVID-------------KQVVA 233 (330) Q Consensus 184 ~~v~~rd~~-----------G---~~~~L~pldp~tV~~---~~d~~G~~~~~~~~Y~q~~~-------------~~~~~ 233 (330) ++.....+. + .+.-|.|+++..|.+ ..+..+-.+-.+ .|+++.. +.... T Consensus 119 ~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P-~~y~i~~~~~~~~~~~~~~~~~~~~ 197 (461) T protein:vir:80 119 SIGVVSSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFGEV-EFFEVNRVSQLGEEILSGTTASTSE 197 (461) T ss_pred EEEeecCCccccCccCCcccccccceeEEEeccccccchhhhcccCcCcccccc-eEEEEeccccccccccccccCccce Confidence 875532211 1 222333333332221 111111111112 2333321 22234 Q ss_pred EechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019511. 234 SFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWK 313 (330) Q Consensus 234 ~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~ 313 (330) .+-++.|+|+...+..+. .+|.|.++.+...|.....+....+.+..+-..+ ++.+++-..+..+....+.+.++ T Consensus 198 ~iH~SRii~~~~~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~ 272 (461) T protein:vir:80 198 QIHRSRIIHEQGLRFEGE---TKGRSIFESLYDIITVMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLD 272 (461) T ss_pred EEccccEEEecCCCCCcc---ccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHH Confidence 567888999876665543 4599999999999999999888888776664433 45555433444455556666676 Q ss_pred HHhcCcccccccceeeC Q lcl|NC_019511. 314 SSFSGINGSWQICLYIK 330 (330) Q Consensus 314 ~~~~G~~na~kvpvL~e 330 (330) ...++ - .+ +++. T Consensus 273 ~~~~~---~-g~-~~~d 284 (461) T protein:vir:80 273 FMFRT---E-AL-AIIK 284 (461) T ss_pred HhcCC---c-eE-EEEc Confidence 54332 1 22 3333 No 125 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=97.88 E-value=5.9e-06 Score=49.25 Aligned_cols=290 Identities=12% Similarity=0.086 Sum_probs=127.5 Q ss_pred CchhHHHHHhcCCCC--CC--cccccCccCc--chhHHHHHHHHHHHHhhcccchhccccchhcccccc-cccc-CCC-- Q lcl|NC_019511. 1 MPDLFKSLRLGSMYK--ED--TEDLMVPIDD--GIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE-MMDT-NPD-- 70 (330) Q Consensus 1 ~~~~~~~~~~~~~~~--~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~-~p~-- 70 (330) |.=+|- ||....- ++ ..-..+|..+ +.-.++..+...+.+.++++ +.-+.+.+.+|... -|+. .+. T Consensus 4 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~ds~~~~~~ 79 (765) T protein:vir:96 4 LSWIFG--RKKDNAACSESAPEKVARIPQHDPLDPMIKLGKIRGWNVEPEKAP--VIRSVKDFLEPGLSVAMDSAYGDGP 79 (765) T ss_pred eeeecc--cccccccccccCchhhhhcCCCCCcccchhHHHHhhcccccccCC--CCCCCCcccCcccceeccccccccc Confidence 111111 1111100 00 0000111111 11112221212111112211 11112222222210 0000 000 Q ss_pred ---CCcC---CCcccc-----------hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCc Q lcl|NC_019511. 71 ---YRDK---KSYMRN-----------AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDA 133 (330) Q Consensus 71 ---~~~~---~s~~r~-----------~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~ 133 (330) +... .++.++ .-.+++++..|+.+.++++||.+.+++..+ -||.|.-.+ . T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R-----------~g~~I~~~~--~ 146 (765) T protein:vir:96 80 TPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAAR-----------NGWELKSDG--R 146 (765) T ss_pred cchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhc-----------CCceeecCc--c Confidence 0000 000000 112467888899999999999999999753 255554322 1 Q ss_pred ccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEe-cC-------------CCcceEEEE Q lcl|NC_019511. 134 TPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFS-PK-------------NKTKMEKFI 199 (330) Q Consensus 134 ~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~-rd-------------~~G~~~~L~ 199 (330) +..+ +.+++++..+.++ ...+-+...++..-+||.+++.+... +| ..|....|. T Consensus 147 e~~~---~~~~~l~~~~~rl---------~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~ 214 (765) T protein:vir:96 147 KLSD---EQSALIARRDMEF---------RVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGIS 214 (765) T ss_pred ccCH---HHHHHHHHHHHHh---------hHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEE Confidence 2222 2344555544444 23333444455556677777776543 22 223445666 Q ss_pred eeCCCceEEee------CCCCcccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCC---CccccHHHHHHHHHHH Q lcl|NC_019511. 200 AVDPSTIFYAT------DKNGKIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSS---GYGLSEVEIAMKEFIA 270 (330) Q Consensus 200 pldp~tV~~~~------d~~G~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~---~yGlSPIe~a~~~I~~ 270 (330) .+||.-+.+.. |...-.+-.+. ++ .+.+. .+-++-|+|+...+..+.... .+|.|-++.+...|.. T Consensus 215 vldp~~~~~~~v~e~~~Dp~sp~fg~P~-~y-~i~g~---~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~ 289 (765) T protein:vir:96 215 QIDPYWAMPQLTAESTADPSAEHFYEPD-FW-IISGK---KYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYA 289 (765) T ss_pred EechhhcccccchhccccccccccCcce-ee-eecCc---eeccceEEEecCCCchhhhccccCccCccHHHHHHHHHHH Confidence 66665443321 11110111111 22 23443 345667888876666555332 2499999999999999 Q ss_pred HHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 271 YNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 271 ~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ...+....+.++...... ++.+++...+..+ +.+++..+....+.+|-+ + +++. T Consensus 290 ~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~--~~l~~r~~~~~~~r~n~g-~-~~id 343 (765) T protein:vir:96 290 AERTANEAPLLAMSKRTS--TIHVDVEKAIANE--DAFNARLAFWIANRDNHG-V-KVIG 343 (765) T ss_pred HHHHHHHHHHHHHHhccc--eeeechHhhhccH--HHHHHHHHHHHHhcCCce-e-EEec Confidence 999988888877765543 3444443333222 233333333333433433 3 4454 No 126 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=97.78 E-value=1.2e-05 Score=47.65 Aligned_cols=249 Identities=11% Similarity=0.095 Sum_probs=114.1 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccc-cccCCCCCcCCCcccchHHHHHHHH Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEM-MDTNPDYRDKKSYMRNAHNLHEVLK 89 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~-~~~~p~~~~~~s~~r~~~~~~~~Lr 89 (330) ++.+= .+-++ ...+...|...+... ...+|.+..... . ....... T Consensus 1 ~~~~m----------------------------~~~~~-~~~~~D~~~~~~~~~~g~~~~~~~~~~~----~-~~~~l~~ 46 (435) T protein:vir:79 1 MGVFM----------------------------SDKVK-AITKEDGYNEIFGSKDGTFRPNAFYMQR----A-AFKALSQ 46 (435) T ss_pred CCccc----------------------------ccccc-cchhhcchhhhhcccccccccCcccCCc----C-CHHHHHH Confidence 11110 00011 111222333322221 111222222111 1 1123345 Q ss_pred HHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHH Q lcl|NC_019511. 90 KFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCK 169 (330) Q Consensus 90 ~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~ 169 (330) .|+.++++++||.+.+++..+ -||++.-. + +.++ ++..+.++ ...+-+. T Consensus 47 ~Y~~~~l~~~~Vd~~aed~~r-----------~g~~i~g~-~-------~~~~---~~~~~~~l---------~~~~~l~ 95 (435) T protein:vir:79 47 FYEEDGMARRIVDVIPEEMVT-----------PGFKVDGV-K-------NEKS---FKSRWDEL---------RLNAKII 95 (435) T ss_pred HHhcCchhhhhhccchHHhhc-----------CCceecCC-C-------hHHH---HHHHHHHh---------hHHHHHH Confidence 577899999999999999864 24554321 1 1112 23333332 1223344 Q ss_pred HHHHHHHhcCCceeEEEEecCC---------CcceEEEEeeCCCceEEeeCC---CCcccCCceeEEEEeCC--ceEEEe Q lcl|NC_019511. 170 KIVRDTYTYDQVNFEKVFSPKN---------KTKMEKFIAVDPSTIFYATDK---NGKIIKGGNRFVQVIDK--QVVASF 235 (330) Q Consensus 170 ~~v~d~L~~g~g~~~~v~~rd~---------~G~~~~L~pldp~tV~~~~d~---~G~~~~~~~~Y~q~~~~--~~~~~~ 235 (330) ..++-..++|.+++.+... ++ .|.+..|.++||..|.+..-. ..-.+-.+ .++++..+ .....+ T Consensus 96 ~a~~~~rl~G~~~i~i~~~-d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P-~~y~v~~~~~~~~~~i 173 (435) T protein:vir:79 96 DALSWSRLFGGSAILAVVA-DNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGEP-KLYKISPGGDIPEFFV 173 (435) T ss_pred HHHHhhhccccEEEEEEec-CCCCcccccccCCceeeEEeechhhccchhhccCCcccccCcc-eEEEEecCCCCCceEE Confidence 4445566788777776642 22 244557778888766543211 11111112 23334322 223456 Q ss_pred chhHeeeecccCcCCCC---CCCccccHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC-CCC-CHHHHHHHH Q lcl|NC_019511. 236 TSRELVMGIRNPRSDLN---SSGYGLSEV-EIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD-QQQ-SQHALENFK 309 (330) Q Consensus 236 ~~~dvih~~~n~~~d~~---~~~yGlSPI-e~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~-~~l-s~e~~e~lr 309 (330) -++.|+|+...+..+.. .+.+|.||+ +.+...|.....+....+.++...... ++.+++- ..+ +.+....++ T Consensus 174 H~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~ 251 (435) T protein:vir:79 174 HYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQA--VWKARDLALMCDDEEGRYAAR 251 (435) T ss_pred cceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc--cccchhHHHhhcCccchHHHH Confidence 67788888765544332 344599998 789999999999988888765544322 1333221 011 111112222 Q ss_pred HHH--HHHhcCcccccccceeeC Q lcl|NC_019511. 310 REW--KSSFSGINGSWQICLYIK 330 (330) Q Consensus 310 ~~w--~~~~~G~~na~kvpvL~e 330 (330) ... .+...+. -+.+.|.=+ T Consensus 252 ~r~~~~~~~~~~--~~~~~i~~~ 272 (435) T protein:vir:79 252 LRLAQVDDESGV--GKAIGIDAT 272 (435) T ss_pred HHHHHHHHhcCC--CCceeEecC Confidence 222 2233332 233333222 No 127 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=97.54 E-value=4.9e-05 Score=44.23 Aligned_cols=256 Identities=12% Similarity=0.159 Sum_probs=122.2 Q ss_pred cCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc-hHHHHHHHHHH---hh-cHHHHH Q lcl|NC_019511. 25 IDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN-AHNLHEVLKKF---GN-NSILNA 99 (330) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~-~~~~~~~Lr~~---a~-~~iv~a 99 (330) .|.++ .+++.+.-+++-++.-..... +++ |--+-+-||- ..+.++.|+.| -+ -+-|.+ T Consensus 1 ~~~~~--------------~~~p~~~~~~~~~~~~~~~~~--~~g-~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s 63 (446) T protein:vir:98 1 MNMEV--------------RNAPTPAIRRRTIYAMEHLGL--ATS-YLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQ 63 (446) T ss_pred Ccccc--------------cCCCchhhhhhhhhccccchh--hcc-cCCcchHhhhcCCChHHHHHHHHHHHhcchHHHH Confidence 11111 111222222222221111110 111 1111010110 01122333333 23 478999 Q ss_pred HHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 100 IIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 100 ~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) |+.+|+.-|. ++.|.|.+.+ .+..+.++..|..+ . ++++...+.|.+.+| T Consensus 64 ~l~~Rk~av~-----------~~~w~V~p~~---------~~~a~~v~~~l~~~---------~-~~~~~~~~ldai~~G 113 (446) T protein:vir:98 64 GLDSIALSVL-----------NKVGPYQHGD---------KRIKKFIDDQLRNR---------A-KTWISHCVKSIMTYG 113 (446) T ss_pred HHHHHHHHhh-----------cCCceecCcc---------HHHHHHHHHHHhhc---------C-chhHHHHHHHHHhhC Confidence 9999999997 4678887532 12334455555332 1 245555677999999 Q ss_pred CceeEEEEecCCCcc-eEE----EEeeCCCceEEeeCCCCcccCCce-e--------------E------EEEeCCceEE Q lcl|NC_019511. 180 QVNFEKVFSPKNKTK-MEK----FIAVDPSTIFYATDKNGKIIKGGN-R--------------F------VQVIDKQVVA 233 (330) Q Consensus 180 ~g~~~~v~~rd~~G~-~~~----L~pldp~tV~~~~d~~G~~~~~~~-~--------------Y------~q~~~~~~~~ 233 (330) -...|++|.+...+. |.. +....|..+.-..+.++....+.. . | ......+.-. T Consensus 114 ~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 193 (446) T protein:vir:98 114 FSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHV 193 (446) T ss_pred ceeeeEEEeecccccccchhhccccccccccceeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccc Confidence 999999998654332 211 122233333322233322211100 0 0 0001112223 Q ss_pred EechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCC----CCHHHH---H Q lcl|NC_019511. 234 SFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQ----QSQHAL---E 306 (330) Q Consensus 234 ~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~----ls~e~~---e 306 (330) .++.+..++.++++.++ .+||.|.+..|....-.-....++-+.|-..-+.|-=+..+|.+.. -++++. + T Consensus 194 ~iP~~kfi~~~~~~~~~---~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~ 270 (446) T protein:vir:98 194 RLPSHKRLFINYNTKGN---NPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITT 270 (446) T ss_pred cccccceEEEEecCCCC---CccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHH Confidence 45666777777766554 5799999999999998888888888888888787877777764321 122211 1 Q ss_pred HHHHHHHHHhcCc-ccc-ccccee-------eC Q lcl|NC_019511. 307 NFKREWKSSFSGI-NGS-WQICLY-------IK 330 (330) Q Consensus 307 ~lr~~w~~~~~G~-~na-~kvpvL-------~e 330 (330) .+.+.+.+.+... .++ +-+|.+ +| T Consensus 271 ~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie 303 (446) T protein:vir:98 271 TIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVG 303 (446) T ss_pred HHHHHHHHHHHhccccceeeeecccCCCCceEE Confidence 2222333333221 122 222121 22 No 128 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=97.49 E-value=5.6e-05 Score=43.89 Aligned_cols=279 Identities=11% Similarity=0.048 Sum_probs=120.9 Q ss_pred cCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHH----HHhh-cHH Q lcl|NC_019511. 22 MVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLK----KFGN-NSI 96 (330) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr----~~a~-~~i 96 (330) ++.+|.-+...-- +...+...+.... .+| .....-....+|+.+.++........+.|+ .+.+ |++ T Consensus 1 Mn~iDr~i~~~sP---~~a~~R~~ar~~~----~~y--~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~ 71 (548) T protein:vir:95 1 MNLIDRLLEPLAP---ELVARRLAAREAI----QAY--EAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDL 71 (548) T ss_pred CchHHhHhhhcch---HHHHHHHHhHHHh----ccc--cccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChH Confidence 3334433322100 0000000000000 011 111111122345444333222222234443 3443 689 Q ss_pred HHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC-CCCCcCCHHHHHHHHHHHH Q lcl|NC_019511. 97 LNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD-KDIDRDSFQEFCKKIVRDT 175 (330) Q Consensus 97 v~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~-~pn~~~s~~~fl~~~v~d~ 175 (330) +..+|+.+.+.|-. ..+++..-+....+.+..++- .++++..+.....+ ....+.+|..+...+++.+ T Consensus 72 a~~av~~~~~nvVG--------~~G~~i~p~~l~~d~~~a~~l---~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~ 140 (548) T protein:vir:95 72 VTGLLDRLEERVVG--------GSGIGVEPLPLRLDGSVHAEL---AMEIRSAWAEWSLSPETSGELTRPQVERLMCRTW 140 (548) T ss_pred HHHHHHHHHHhccC--------ccccceeeeecCCCHHHHHHH---HHHHHHHHHHhhcCccccccCCHHHHHHHHHHHH Confidence 99999988877741 112333322222222222221 23344333333221 1234568999999999999 Q ss_pred HhcCCceeEEEEecCC-----CcceEEEEeeCCCceEEeeCCCCcccC---------CceeEEEEeC--C--------ce Q lcl|NC_019511. 176 YTYDQVNFEKVFSPKN-----KTKMEKFIAVDPSTIFYATDKNGKIIK---------GGNRFVQVID--K--------QV 231 (330) Q Consensus 176 L~~g~g~~~~v~~rd~-----~G~~~~L~pldp~tV~~~~d~~G~~~~---------~~~~Y~q~~~--~--------~~ 231 (330) +..|...+.+.+.+.. ...++.|-.|+|+.|..-.+..+.... .++.|..... | .. T Consensus 141 ~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~ 220 (548) T protein:vir:95 141 LRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLA 220 (548) T ss_pred HhCCceEEEeeecccccccCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccc Confidence 9999888877765542 335678888999887432222221111 1233432211 1 12 Q ss_pred EEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH---HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHH Q lcl|NC_019511. 232 VASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF---IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENF 308 (330) Q Consensus 232 ~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I---~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~l 308 (330) ...+++++|+|+-..-+.+- .-|+|.+..++..+ .-...++..+++. .|.-.+++..+.+.....+ .. T Consensus 221 ~~rvpA~~VlHif~~~r~gQ---~RGvs~lapvl~~l~~l~~y~dael~~aki---~A~~a~fi~~~~~~~~~~~---~~ 291 (548) T protein:vir:95 221 VKRVEAERIIHIAYRKRIGQ---NRGVPMLHAVLIRLADLKDYEESERVAARI---SAALAMYIKKGNPDSYTVE---PG 291 (548) T ss_pred eeeechhHheecccccCCcc---ccCcchHHHHHHHHHHHhHHHHHHHHHHHH---hhhheeeeecCCCccccCC---CC Confidence 34578999999854333321 23888776665554 4445555555443 2333455554322111110 00 Q ss_pred HHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 309 KREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 309 r~~w~~~~~G~~na~kvpvL~e 330 (330) ...-. .....+....++.|.. T Consensus 292 ~~~~~-~~~~~~pG~iv~~L~p 312 (548) T protein:vir:95 292 KDRKN-RTIPIAPGMVFDDLEP 312 (548) T ss_pred ccccc-ccccccCCccccccCC Confidence 00000 0011111112344433 No 129 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=96.50 E-value=0.00057 Score=38.38 Aligned_cols=278 Identities=13% Similarity=0.116 Sum_probs=137.4 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchh---ccccccccccCCCCCcCCCc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAY---AEPFLEMMDTNPDYRDKKSY 77 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~---~~~~~~~~~~~p~~~~~~s~ 77 (330) ||-|.+|-+ .+..++..++..++.+... +....|....- ..|+.........+. .. T Consensus 1 ~~~~~~w~~-----------------~de~~~~~~~~~~~~~~~~-p~~~dG~s~i~~~~~~~~~~~~~~~~~~g---g~ 59 (533) T protein:vir:58 1 MPSLEKYKK-----------------LNEAVNFTNFLSPMYGMGA-PHGAGGSSMIPINMYHPFATAGYASRFYG---GI 59 (533) T ss_pred CCCcchhhh-----------------hhHHHHHHHhhchhhcccC-ccCCCCCccccCCCCcchhhhhhhhhhhc---cc Confidence 555555543 2334555666666666532 33344432111 111111100011111 12 Q ss_pred ccchHHHHHHHHHHhh-cHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGN-NSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD 156 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~-~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~ 156 (330) .+|-....+.-|..|. +|-|..+|+.|++++. +.+..+---+|.+.+ .+.++...++|..+-+ T Consensus 60 ~~n~~eLI~~YR~ma~~~pEVd~AideIvneai------v~d~~~~pV~v~l~~--~e~s~~iK~kI~~lld-------- 123 (533) T protein:vir:58 60 EFNRFFLYDMYDRMDYTDPLISTVLDIIADECT------IPNENGNIVDVVTKD--IELAKAILSYLDYVIN-------- 123 (533) T ss_pred cccHHHHHHHHHHhhccCcchhhHHHhhhceee------EecCCCceeEeeccc--ccccHHHHHHHHHHhc-------- Confidence 3455556667777775 5889999999888874 222222223444543 3456555555543222 Q ss_pred CCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEE------eCCc Q lcl|NC_019511. 157 KDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQV------IDKQ 230 (330) Q Consensus 157 ~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~------~~~~ 230 (330) |..-.-.+++..++.|..|+.++.+. -.+-+.+|..|||..|+.+...... ...|+|. ..+. T Consensus 124 -------f~~~~~~~fR~WYVDGriy~Hkiik~-~k~GI~elr~lDPr~i~~vr~~~t~----~eyyvy~~~~~~~~s~~ 191 (533) T protein:vir:58 124 -------IEKNAYPIIRNMIKYGDMFLHILEKG-SDGTIEKFQVVSPYIFSKRYNPETD----TWYYVITDVYRNVVSGY 191 (533) T ss_pred -------chhhhhHHHHhhhhcceeEEEeccCC-cccchhhheecCCeeeEEEEeeccc----eEEEeecccccccccCc Confidence 22222334566788889999987643 3455789999999999877543221 1122222 1222 Q ss_pred eEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHH Q lcl|NC_019511. 231 VVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKR 310 (330) Q Consensus 231 ~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~ 310 (330) .......+.|+|.+.--.+ ..++||+|=|..|+..+....-.+.-.--|==--|.=+=|+-+..+ +|.+...++.-+ T Consensus 192 ~~~kI~~daI~y~~SGl~d--~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVG-Nlpk~KAeqYl~ 268 (533) T protein:vir:58 192 FNEDIPEEDVIHFSHKIDT--NFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVG-NVPPDKINEYLT 268 (533) T ss_pred cccccchhheeeeeecccc--CCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeec-CCCccCHHHHHH Confidence 2345678999998753322 2367899999999777766555544322221122322334444332 333333333323 Q ss_pred HHHHHhcC----ccccccc----------ceeeC Q lcl|NC_019511. 311 EWKSSFSG----INGSWQI----------CLYIK 330 (330) Q Consensus 311 ~w~~~~~G----~~na~kv----------pvL~e 330 (330) .....|.- -.+-|+| .||++ T Consensus 269 ~im~k~kNklvYDa~TGev~ddrk~m~~~sMlED 302 (533) T protein:vir:58 269 NIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKD 302 (533) T ss_pred HHHHhcccceEEeccCCeEeeccchhhhhhhHhh Confidence 33322210 0111222 23333 No 130 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=96.36 E-value=0.0007 Score=37.87 Aligned_cols=266 Identities=10% Similarity=0.039 Sum_probs=112.7 Q ss_pred cCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcC--CCcccchHHHHHHH----HHHhh-c Q lcl|NC_019511. 22 MVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDK--KSYMRNAHNLHEVL----KKFGN-N 94 (330) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~--~s~~r~~~~~~~~L----r~~a~-~ 94 (330) .|-++..+..-- ..+...... ..| ....... +|+.. .++........+.| |.+.+ | T Consensus 1 m~~~~~~~~a~~----~~~~~~~~~--------~~y--~aa~~~~---~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn 63 (495) T protein:vir:10 1 MNMTPSGYQSLA----SGLLVPVGA--------SAY--EGASGGH---RWQDIGDYGPDTAVASGIQTLRARSHHNVRNN 63 (495) T ss_pred CCcccccccccc----hhhhhHHHh--------hhh--hccccCc---ccCCCCCCChhHHHHHHHHHHHHHHHHHHhcC Confidence 223332221100 000000000 011 1000000 11111 12211111223344 44443 6 Q ss_pred HHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC-CCCcCCHHHHHHHHHH Q lcl|NC_019511. 95 SILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK-DIDRDSFQEFCKKIVR 173 (330) Q Consensus 95 ~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~-pn~~~s~~~fl~~~v~ 173 (330) +++..+|+.+.+.|- |-|+....+-.++ + --++++..+......- ...+.+|..+...+++ T Consensus 64 ~~a~~av~~~~~~vV-----------G~Gi~p~~~~~~~----~---~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r 125 (495) T protein:vir:10 64 PWATNAVATWVAAAV-----------GNGLTPRWRMKEQ----E---LRQELQELWGDWVNEADFDEVQSFYGLQALVVR 125 (495) T ss_pred hHHHHHHHHHHHhhc-----------CCCcccccCCchH----H---HHHHHHHHHHHhhcCcccccccCHHHHHHHHHH Confidence 899999998888773 3365544432221 2 2233333333332211 2345689999999999 Q ss_pred HHHhcCCceeEEEEecCC--CcceEEEEeeCCCceEE-ee---CCCCccc---------CCceeEEEEe-CCc------- Q lcl|NC_019511. 174 DTYTYDQVNFEKVFSPKN--KTKMEKFIAVDPSTIFY-AT---DKNGKII---------KGGNRFVQVI-DKQ------- 230 (330) Q Consensus 174 d~L~~g~g~~~~v~~rd~--~G~~~~L~pldp~tV~~-~~---d~~G~~~---------~~~~~Y~q~~-~~~------- 230 (330) .++.-|..++-+.+.+.+ ..-++.|-.|+|+.|.. .. ..+|... -.++.|.... +.+ T Consensus 126 ~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~ 205 (495) T protein:vir:10 126 TVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGD 205 (495) T ss_pred HHHhCCceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCccccccc Confidence 999988887766655443 34577888999998742 11 1122111 1123444321 111 Q ss_pred --eEEEechhHeeeecccCcCCCCCCCccccHHHHH--HHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHH Q lcl|NC_019511. 231 --VVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIA--MKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALE 306 (330) Q Consensus 231 --~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a--~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e 306 (330) ....+++++|+|+ +..+.+- .-|+|-+... +..+.-...++..+++. .|.-.+++..+.+.....+... T Consensus 206 ~~~~~rvpA~~vlH~-f~~r~gQ---~RGis~la~i~~l~~l~~y~dael~~a~i---~A~~~~fi~~~~~~~~~~~~~~ 278 (495) T protein:vir:10 206 PVDTVWIKAEHVLHV-TVLTVRS---DAGAPWFQLLLRLNELDQYEDAELVRKKT---AALFAAFIQEATADSTGGPTIG 278 (495) T ss_pred ccceeeechhheEec-cccCCCc---ccCcchhHHHHHHHHhhHHHHHHHHHHHH---hhhheeeeecCCCccccccccC Confidence 2345789999998 3344332 2377654321 22333344444444433 2333455543222110000000 Q ss_pred H-HHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 307 N-FKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 307 ~-lr~~w~~~~~G~~na~kvpvL~e 330 (330) . -...-.....+. +.|.|+.|-. T Consensus 279 ~~~~~~~~~~~~~l-~pG~i~~L~p 302 (495) T protein:vir:10 279 QPKRSKGGKRITGL-NPGTLQYLQP 302 (495) T ss_pred ccccccCcccceec-CCceeeecCC Confidence 0 000000011111 2345555443 No 131 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=96.08 E-value=0.001 Score=36.94 Aligned_cols=272 Identities=8% Similarity=0.012 Sum_probs=115.5 Q ss_pred hhcccch-hccccc-----hhccccccccccCCCCCcCC-CcccchHHHHHHH----HHHh-hcHHHHHHHHHHHHhHhh Q lcl|NC_019511. 43 MQEITKS-LYGKQQ-----AYAEPFLEMMDTNPDYRDKK-SYMRNAHNLHEVL----KKFG-NNSILNAIIITRANQVST 110 (330) Q Consensus 43 ~~~~~~~-~~g~~~-----~~~~~~~~~~~~~p~~~~~~-s~~r~~~~~~~~L----r~~a-~~~iv~a~I~~~~d~Ia~ 110 (330) |++.... ..++.. .|.+-....-.....|.+.. ++-.......+.| |.+. +|+++..+|+.+.+.|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvV- 79 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIV- 79 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhh- Confidence 1111000 011110 11100000000112344332 1111111122344 4444 36999999999888874 Q ss_pred hhhhheecccccceeeeccCC--CcccChhhH-HHHHHHHHHHHhccCCC-----CCCcCCHHHHHHHHHHHHHhcCCce Q lcl|NC_019511. 111 YCKPARYSEKGVGFEVKLKDL--DATPGIKEK-EQMKRIEEFILNTGTDK-----DIDRDSFQEFCKKIVRDTYTYDQVN 182 (330) Q Consensus 111 ~~~~~~~~~~~~g~~v~~kd~--~~~~~~~~~-~~~~~i~~~l~~~~~~~-----pn~~~s~~~fl~~~v~d~L~~g~g~ 182 (330) |-|+.++.+-. -...+.++. +-.++++..+....... ...+.||.++...+++.++.-|.+. T Consensus 80 ----------G~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~ 149 (530) T protein:vir:38 80 ----------GSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELC 149 (530) T ss_pred ----------CCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceE Confidence 34655554311 001111222 22345555554432221 2345689999999999999999888 Q ss_pred eEEEEecCCC-cceEEEEeeCCCceEEeeC-CCCcccC---------CceeEEEEeC--Cce----------EEEechhH Q lcl|NC_019511. 183 FEKVFSPKNK-TKMEKFIAVDPSTIFYATD-KNGKIIK---------GGNRFVQVID--KQV----------VASFTSRE 239 (330) Q Consensus 183 ~~~v~~rd~~-G~~~~L~pldp~tV~~~~d-~~G~~~~---------~~~~Y~q~~~--~~~----------~~~~~~~d 239 (330) +-+.+.+... .-++.|-.|+|+.|....+ .+|.... .++.|..... ++. .....+++ T Consensus 150 ~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~ 229 (530) T protein:vir:38 150 VQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPS 229 (530) T ss_pred EEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhH Confidence 8777654422 2356788899987643211 1221111 1233433221 111 12245679 Q ss_pred eeeecccCcCCCCCCCccccHHHHHHHHHHH---HHHHHHHHHH-------HHhcCCCcceEEEeCCCCCCCHHHHHHHH Q lcl|NC_019511. 240 LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIA---YNNTESFNDR-------FFSHGGTTRGILQIRADQQQSQHALENFK 309 (330) Q Consensus 240 vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~---~laae~~~~~-------fF~nGa~p~GiL~~~~~~~ls~e~~e~lr 309 (330) |+|+-..-+.+ ..-|+|.+..++..+.. ...++..+++ |.++...+.+.....++. ...++...+. T Consensus 230 vlH~f~~~r~g---Q~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~-~~~~~~~~~~ 305 (530) T protein:vir:38 230 FIHVFEPMEDG---QTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGA-DNKEQQSKLT 305 (530) T ss_pred eEeeccccCCC---cccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccC-Cccccccccc Confidence 99985433332 22388887766655433 3334433333 222322222222211110 0111111222 Q ss_pred HHHHHHhcCcc-------cccccceeeC Q lcl|NC_019511. 310 REWKSSFSGIN-------GSWQICLYIK 330 (330) Q Consensus 310 ~~w~~~~~G~~-------na~kvpvL~e 330 (330) ....+. .+.. +.|.|+.|.. T Consensus 306 ~~~~~~-~~~~~~~~~~l~pG~i~~L~p 332 (530) T protein:vir:38 306 GWLGEM-AAYYSAAPVRLGGARVPHLLP 332 (530) T ss_pred ccchhh-hhcccccceeccCceeeecCC Confidence 222111 1100 2344444444 No 132 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=95.69 E-value=0.0016 Score=35.88 Aligned_cols=279 Identities=7% Similarity=0.044 Sum_probs=119.9 Q ss_pred HHHHHHHHHHHhhcccchhccccchhcccccccc-ccCCCCCcCCC-cccchHHHHHHH----HHHhh-cHHHHHHHHHH Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMM-DTNPDYRDKKS-YMRNAHNLHEVL----KKFGN-NSILNAIIITR 104 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-~~~p~~~~~~s-~~r~~~~~~~~L----r~~a~-~~iv~a~I~~~ 104 (330) +++.+..-...........-.+...+........ ...-.|+++.. +-.......+.| |.+.+ |+++..+|+.+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 1111111000000000000000000000100000 01124555432 111111123344 44443 68999999998 Q ss_pred HHhHhhhhhhheecccccceeeeccCCCc---ccChhhHHH-HHHHHHHHHhccCC-----CCCCcCCHHHHHHHHHHHH Q lcl|NC_019511. 105 ANQVSTYCKPARYSEKGVGFEVKLKDLDA---TPGIKEKEQ-MKRIEEFILNTGTD-----KDIDRDSFQEFCKKIVRDT 175 (330) Q Consensus 105 ~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~---~~~~~~~~~-~~~i~~~l~~~~~~-----~pn~~~s~~~fl~~~v~d~ 175 (330) .+.|- |-|+....+-... ..+.+..++ .+++++.+...... ....+.+|..+...+++.+ T Consensus 81 ~~nvV-----------G~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~ 149 (553) T protein:vir:63 81 RDSIV-----------GAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGY 149 (553) T ss_pred HHhhc-----------cCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHH Confidence 88874 3466655442111 112222222 34455555443221 1234568999999999999 Q ss_pred HhcCCceeEEEEecCCCc-ceEEEEeeCCCceEEeeC-CCCcccC---------CceeEEEEe-C-Cc------------ Q lcl|NC_019511. 176 YTYDQVNFEKVFSPKNKT-KMEKFIAVDPSTIFYATD-KNGKIIK---------GGNRFVQVI-D-KQ------------ 230 (330) Q Consensus 176 L~~g~g~~~~v~~rd~~G-~~~~L~pldp~tV~~~~d-~~G~~~~---------~~~~Y~q~~-~-~~------------ 230 (330) +.-|...+-+.+.++..+ .++.|-.|+|+.|....+ .+|.... .++.|.... + |. T Consensus 150 ~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~ 229 (553) T protein:vir:63 150 VKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWK 229 (553) T ss_pred HhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCcccccccccccee Confidence 999988887777654322 346788899987753322 1221111 123343221 1 10 Q ss_pred ---eEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHH---HHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHH Q lcl|NC_019511. 231 ---VVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFI---AYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHA 304 (330) Q Consensus 231 ---~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~---~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~ 304 (330) ......+++|||+-..-+.+- .-|+|.+..++..+. -...++..+++. +|.-.++|..+.+ ++.. T Consensus 230 r~~~~~~v~a~~vlH~f~~~r~gQ---~RGis~lapvl~~l~~l~~y~daeL~~a~i---~A~~a~fi~~~~~---~~~~ 300 (553) T protein:vir:63 230 FVQQSKPWGRRQVIHILEPREPDQ---SRGIADIVSGLKDMRMAKRFKEMSLQNAVI---NASYAAAIESELP---PEFI 300 (553) T ss_pred eeccccccChhHheecccccCCCc---ccCCchHHHHHHHHHHHhHHHHHHHHHHHH---hhhheeeeecCCC---hhhh Confidence 112356899999854433322 238888766665543 344445444443 2333455543321 1222 Q ss_pred HHHHHHH----------------HHHHhcCcc----cccccceeeC Q lcl|NC_019511. 305 LENFKRE----------------WKSSFSGIN----GSWQICLYIK 330 (330) Q Consensus 305 ~e~lr~~----------------w~~~~~G~~----na~kvpvL~e 330 (330) .+.+... ....+.|.. +.|.|+.|.. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p 346 (553) T protein:vir:63 301 HSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFP 346 (553) T ss_pred hhhcccccccccccccccccccccccccccccceeecCceeeecCC Confidence 2222111 111111110 1334444333 No 133 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=95.66 E-value=0.0017 Score=35.83 Aligned_cols=278 Identities=11% Similarity=0.045 Sum_probs=124.9 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCc-ccchH--HHHHH Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSY-MRNAH--NLHEV 87 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~-~r~~~--~~~~~ 87 (330) +++.++...- .|.-++.-......... +....|.-- ..-...-.|...|+. ..+.+ ...+. T Consensus 1 ~~r~~~~~~~----~dr~i~~~~~~~~~~~~----------~~~~~y~aa--~~~r~~~~w~~~~~~~s~~~~i~~~~~~ 64 (505) T protein:vir:96 1 MKRAEKKPSL----AQRMVNWAWYRYVEPQK----------NAARAFEAA--RRDRLGKAWLRRASRLSADEEIYADLAS 64 (505) T ss_pred CCCCccccch----hhcccchhhhhhHHHHH----------Hhhhhcccc--cCCCccccccCCCCCCChHHHHHHHHHH Confidence 5555554332 33322221100000000 000112110 000011134322221 11211 12233 Q ss_pred H----HHHh-hcHHHHHHHHHHHHhHhhhhhhheecccccceeee--ccCCCcccChhhHHHHHHHHHHHHhccCC---C Q lcl|NC_019511. 88 L----KKFG-NNSILNAIIITRANQVSTYCKPARYSEKGVGFEVK--LKDLDATPGIKEKEQMKRIEEFILNTGTD---K 157 (330) Q Consensus 88 L----r~~a-~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~--~kd~~~~~~~~~~~~~~~i~~~l~~~~~~---~ 157 (330) | |.+. +|+++..+|+.+.+.|-. ..|+... .+..+-+.+++- .++|+..+...... . T Consensus 65 lr~RaRdL~rNn~~a~~av~~~~~nvVG----------~~Gi~~~~~~~~~~~~~~~~~---~~~ie~~w~~Wa~~~~~D 131 (505) T protein:vir:96 65 LVQRAREQSINNPYAKRFYQLLKNNVIG----------PKGMTFQSRVKRRNGKPDDRA---NTLIEGNWQQWIKKGNCD 131 (505) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHhcC----------CCcceeeecCCcccccccHHH---HHHHHHHHHHhcCCcCcc Confidence 4 4444 368999999998888741 1244333 322333333333 33444444444322 1 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeC---CCCcccC---------CceeEEE Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATD---KNGKIIK---------GGNRFVQ 225 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d---~~G~~~~---------~~~~Y~q 225 (330) -..+.+|.++...+++.++.-|..++-+++ +.+...++.|-.|+|+.|..-.+ .+|.... .++.|.. T Consensus 132 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~-~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i 210 (505) T protein:vir:96 132 VTGRYHFVTLLHLWMETLARDGEVLVREHR-GYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHL 210 (505) T ss_pred eeccCCHHHHHHHHHHHHhhCCceEEEEee-cCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEE Confidence 235568999999999999998887776655 34445677888899998743211 1221110 1223332 Q ss_pred Ee-C-C----------ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHH---HHHHHHHHHHHHHHhcCCCcce Q lcl|NC_019511. 226 VI-D-K----------QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEF---IAYNNTESFNDRFFSHGGTTRG 290 (330) Q Consensus 226 ~~-~-~----------~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I---~~~laae~~~~~fF~nGa~p~G 290 (330) .. + | .....+++++|+|+-..-+.+ ..-|+|.+..++..+ .-...++..+++.- |.-.+ T Consensus 211 ~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~g---Q~RGis~lapvl~~l~~l~~y~dael~~a~i~---A~~a~ 284 (505) T protein:vir:96 211 LVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPH---QNRGIPWTHASMVELHHIGEYRKSEMIAAELG---AKKVG 284 (505) T ss_pred eecCCCccccccccccccccccCHhHhhhhhcccCCc---cccCcchHHHHHHHHHHHhHHHHHHHHHHHHh---hhhee Confidence 21 1 1 112346789999985433222 223888776665554 44555555555432 23335 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 291 ILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 291 iL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +++.+.+. ..+.. .+.+....... +.|.++.|.. T Consensus 285 fi~~~~~~-~~~~~----~~~~~~~~~~l-~pG~i~~L~p 318 (505) T protein:vir:96 285 FYEQDPEA-YDQPP----EDDQGEIVEEV-EAGTYQLLPY 318 (505) T ss_pred eeecCCcc-CCCcc----ccccCcccccc-CCceeeecCC Confidence 55533221 11110 01111222222 2445555543 No 134 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=95.36 E-value=0.0022 Score=35.12 Aligned_cols=142 Identities=16% Similarity=0.075 Sum_probs=79.0 Q ss_pred eeEEEEecCCC-cceEEEEeeCCCceE-EeeCCCCcccCCceeEEEEe-CCceEEEechhHeeeecccCcCCCCCCCccc Q lcl|NC_019511. 182 NFEKVFSPKNK-TKMEKFIAVDPSTIF-YATDKNGKIIKGGNRFVQVI-DKQVVASFTSRELVMGIRNPRSDLNSSGYGL 258 (330) Q Consensus 182 ~~~~v~~rd~~-G~~~~L~pldp~tV~-~~~d~~G~~~~~~~~Y~q~~-~~~~~~~~~~~dvih~~~n~~~d~~~~~yGl 258 (330) ..|++|.++++ -.|..|.+++|.++. ...+++|.+ ..+.+.. .|.....+.....|+.++...++ ++||. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l----~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g---~p~G~ 73 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGL----VAIEQWGVFGKATVRIPVDRLVVFVNEREGA---NWLGQ 73 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCce----eEEEecCCCCCCcceeccCCEEEEEeCCCCC---Cccch Confidence 78999977542 357788888888765 333444332 2333432 23333455666666555544443 57899 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCC-----------CHHHHHHHHHHHHHHhcCcccccccce Q lcl|NC_019511. 259 SEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQ-----------SQHALENFKREWKSSFSGINGSWQICL 327 (330) Q Consensus 259 SPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~l-----------s~e~~e~lr~~w~~~~~G~~na~kvpv 327 (330) |.+..|....-.-....++-+.|-..-+.|-=+...|.+... +++..+.+..-.++...|...++=+|- T Consensus 74 gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~ 153 (355) T protein:vir:78 74 SLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPH 153 (355) T ss_pred hhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecC Confidence 999999999888888888888887764333333333432211 233344455555544445321111111 Q ss_pred -----eeC Q lcl|NC_019511. 328 -----YIK 330 (330) Q Consensus 328 -----L~e 330 (330) ++| T Consensus 154 g~~ie~~e 161 (355) T protein:vir:78 154 GANFTLTG 161 (355) T ss_pred CceEEEee Confidence 222 No 135 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=95.19 E-value=0.0026 Score=34.77 Aligned_cols=290 Identities=13% Similarity=0.115 Sum_probs=131.8 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhcccccc-ccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE-MMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~p~~~~~~s~~r 79 (330) -+|.= |.+-.+-.+.-+..-|.. +.++-+..+ +.... ......+-..|..+|.. +|..-+ |..-..+ T Consensus 43 ~~~~~---~~~~~~~~~~~~~~~~~~---~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F-- 110 (698) T protein:vir:10 43 PADMG---RRGALNALDAAPVAEPSP---SLRLARQFE--VDVSN-YTPRERRAASYALDFNGTSMDALS-FVTSSGF-- 110 (698) T ss_pred chhhc---ccccccccccccccCCCc---cccccccce--ecccc-CCccccchhhhhhcccccccccch-hhhccCc-- Confidence 22211 222222222222222222 333322222 11100 11111111223333322 222222 2221111 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecc-----cccceeeeccCCCcccChhhHHHHHHHHHHHHhcc Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSE-----KGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTG 154 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~-----~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~ 154 (330) .+|-.|-.+|..|-.+.|+.++++...+- |.-.... ...|+.+.- ..-+..+.+++++|+..++++ T Consensus 111 ---~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~----~~~~~~d~dqi~~L~~e~erl- 181 (698) T protein:vir:10 111 ---PGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGG----NAASTSDGDQLKQINDEIERL- 181 (698) T ss_pred ---chHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccchhhhhhcccccc----cccccccHHHHHHHHHHHHHH- Confidence 26788999999999999999999887652 3211110 011222111 111223446778887777666 Q ss_pred CCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec---------------CCCcceEEEEeeCCCceEEeeC----CCCc Q lcl|NC_019511. 155 TDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP---------------KNKTKMEKFIAVDPSTIFYATD----KNGK 215 (330) Q Consensus 155 ~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r---------------d~~G~~~~L~pldp~tV~~~~d----~~G~ 215 (330) ...+-+...+.-.-+||-+.+++.+.- -++|....|..+||.-|.+..- +.+. T Consensus 182 --------~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~sp 253 (698) T protein:vir:10 182 --------RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD 253 (698) T ss_pred --------HHHHHHHHHHHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhh Confidence 333333333344445665656664422 2245566688888887766321 1111 Q ss_pred ccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCc---cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEE Q lcl|NC_019511. 216 IIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGY---GLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGIL 292 (330) Q Consensus 216 ~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~y---GlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL 292 (330) .+-.+ .|+++. |+. +-++-++.+...|.+|.....| |+|-++.+.+.|..+..+......+... .+.+++. T Consensus 254 dfgkP-~~y~V~-G~~---IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~-~~~~~l~ 327 (698) T protein:vir:10 254 DFYKP-STWWMI-GSE---VHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGIL 327 (698) T ss_pred ccCCC-ceEEEe-cce---ecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHH-hhHHHHH Confidence 11111 244443 442 3344555566667777755555 9999999999998887776666655433 2222221 Q ss_pred EeCCCCCCCHHHH--HHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 293 QIRADQQQSQHAL--ENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 293 ~~~~~~~ls~e~~--e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) . .-.+.++.... -..|-++-+++.+ |-| .+|++ T Consensus 328 ~-dla~aL~~g~~~~l~~R~eli~~~Rs--n~G--~~llD 362 (698) T protein:vir:10 328 M-DLAQALTPGANVDLSMRAELINRYRD--NRN--ILFLD 362 (698) T ss_pred H-HHHHhcCChhhHHHHHHHHHHHHhcC--ccc--eEEEe Confidence 0 00001111111 1223455566655 333 34445 No 136 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=95.07 E-value=0.0028 Score=34.56 Aligned_cols=280 Identities=10% Similarity=0.031 Sum_probs=115.8 Q ss_pred ccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCC-CcccchHHHHHHH----HHHh-hc Q lcl|NC_019511. 21 LMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKK-SYMRNAHNLHEVL----KKFG-NN 94 (330) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~-s~~r~~~~~~~~L----r~~a-~~ 94 (330) ..+|.- ..+..++- ..+... -..|..-....-..-..|.+.. ++-.......+.| |.+. +| T Consensus 1 ~~~p~~----~~~~~~~~-----~~~~~~----~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn 67 (533) T protein:vir:34 1 MKTPTI----PTLLGPDG-----MTSLRE----YAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNN 67 (533) T ss_pred CCCchh----hhhhcccc-----cchHHH----HHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcC Confidence 001100 00000000 000000 0111110000000011344332 2111111223344 4444 36 Q ss_pred HHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCC--cccChhhHHH-HHHHHHHHHhccCC-----CCCCcCCHHH Q lcl|NC_019511. 95 SILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLD--ATPGIKEKEQ-MKRIEEFILNTGTD-----KDIDRDSFQE 166 (330) Q Consensus 95 ~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~--~~~~~~~~~~-~~~i~~~l~~~~~~-----~pn~~~s~~~ 166 (330) +++..+|+.+.+.|- |-|+.+..+-.. -..+.++.++ -++++..+.....+ ....+.+|.+ T Consensus 68 ~~a~~av~~~~~nvV-----------G~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~ 136 (533) T protein:vir:34 68 GYAANAIQLHQDHIV-----------GSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTM 136 (533) T ss_pred hHHHHHHHHHHHHhh-----------CCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHH Confidence 999999999888874 346665543110 0111122222 23444444433222 1234568999 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCC-cceEEEEeeCCCceEEeeC-CCCcccC---------CceeEEEEeC--Cce-- Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNK-TKMEKFIAVDPSTIFYATD-KNGKIIK---------GGNRFVQVID--KQV-- 231 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~-G~~~~L~pldp~tV~~~~d-~~G~~~~---------~~~~Y~q~~~--~~~-- 231 (330) +...+++.++.-|..++-+.+.+... ..++.|-.|+|+.|..-.+ .+|.... .++.|..... ++. T Consensus 137 ~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~ 216 (533) T protein:vir:34 137 MIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMP 216 (533) T ss_pred HHHHHHHHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccc Confidence 99999999999998888777665432 2356788899987753211 1221111 1234443321 111 Q ss_pred --------EEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHH---HHHHHHHHHHHHhcCCCcceEEEeCCCC-- Q lcl|NC_019511. 232 --------VASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIA---YNNTESFNDRFFSHGGTTRGILQIRADQ-- 298 (330) Q Consensus 232 --------~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~---~laae~~~~~fF~nGa~p~GiL~~~~~~-- 298 (330) .....+++|||+-..-+.+- .-|+|.+..++..+.. ...++..+++. .+.-.+++..+.+. T Consensus 217 ~~~~~~~~~~~v~a~~VlH~f~~~r~gQ---~RGis~lapvl~~l~~l~~y~dael~~a~i---~A~~a~fi~~~~~~~~ 290 (533) T protein:vir:34 217 QKWTWIPRELPGGRASFIHVFEPVEDGQ---TRGANVFYSVMEQMKMLDTLQNTQLQSAIV---KAMYAATIESELDTQS 290 (533) T ss_pred cccceeeeeeccChhHeeeeccccCCCc---ccCCchHHHHHHHHHHHHHHHHHHHHHHHH---hhhheeeeecCCCccc Confidence 12245789999864433332 2388887766655433 33334333321 12222333321110 Q ss_pred -------CCCHHHHHHHHHHHHHH---hcCc---ccccccceeeC Q lcl|NC_019511. 299 -------QQSQHALENFKREWKSS---FSGI---NGSWQICLYIK 330 (330) Q Consensus 299 -------~ls~e~~e~lr~~w~~~---~~G~---~na~kvpvL~e 330 (330) ....+.-+.+....... +.|. =+.|.++.|.. T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p 335 (533) T protein:vir:34 291 AMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMP 335 (533) T ss_pred ccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCC Confidence 01111112222211110 1110 02344444444 No 137 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=94.85 E-value=0.0033 Score=34.16 Aligned_cols=287 Identities=8% Similarity=0.089 Sum_probs=133.2 Q ss_pred HHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCC----Ccccc Q lcl|NC_019511. 5 FKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKK----SYMRN 80 (330) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~----s~~r~ 80 (330) ||.- .+.+....-.++.....+- .+++.-.|-.-.....-... ..+.+.... ...++ T Consensus 1 ~~~w----------------~~~de~~~~~~~~~~~~S~-~~p~~~DGa~~i~~~~~~~~--~~g~~~~~~~~~~~~~~~ 61 (511) T protein:vir:56 1 MKFW----------------TKEEEQDIQKIEKNPVRSF-SAPDNVDGAKEIHTNLLAPQ--LGHAIIPSDAQSEGTIPV 61 (511) T ss_pred CCCc----------------cchhhhhhhhhccCCcccc-cCCCCCCCceEEecccccce--ecceeccccccccCccch Confidence 1100 0111111000111110000 11111122211110000000 000111111 11223 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ++..+.-|..|.+|-|..+|+.+.+++. ..+..+---++.+.+ -+.++...++|..--+.+.+++.... T Consensus 62 -~eLI~~YR~ma~~pEvd~Av~eIvne~i------v~d~~~~pV~l~ld~--~~~s~~iK~kI~eeF~~Il~ll~F~~-- 130 (511) T protein:vir:56 62 -KELIKSYRALAEYHEVDDAIQEIVDEAI------VYENDKEVVWLNLDN--TDFSENIKAKINEEFDRVVSLLQMRK-- 130 (511) T ss_pred -HHHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecc--cCcchHHHHHHHHHHHHHHHHhccch-- Confidence 3455666888889999999999998874 222222233444532 33566666666554344444433221 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeC-----CCCcccCCce--eEEEEeCC---- Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATD-----KNGKIIKGGN--RFVQVIDK---- 229 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d-----~~G~~~~~~~--~Y~q~~~~---- 229 (330) +..++ ++.-++-|..++-++++... | +.+|..|||..|+.+.. .+|....++. .|.|...+ T Consensus 131 --~~~~~----fR~WYVDgRi~fHkiid~k~-G-I~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~ 202 (511) T protein:vir:56 131 --HGYKW----FRKWYVDSRIYFHKILDKDN-N-IIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMP 202 (511) T ss_pred --hhhHH----HhhhhhcceEEEEEEecccc-c-eeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccC Confidence 22333 45556778888888887654 4 78999999998876532 2232222222 12232111 Q ss_pred ---------ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCC Q lcl|NC_019511. 230 ---------QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQ 300 (330) Q Consensus 230 ---------~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~l 300 (330) .....++.+.|.|.+.-...+....|+.+|-+..|...+......|.-.-=|==--|.-+=|.-+..+ +| T Consensus 203 ~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVG-nL 281 (511) T protein:vir:56 203 SWMSATNRAQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVG-NL 281 (511) T ss_pred cccccccccccceeechhheeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecC-CC Confidence 12356788999888765555556678889999999988877665554322221122333333333332 34 Q ss_pred CHHHHHHHHHHHHHHh----------cCcccccccceeeC Q lcl|NC_019511. 301 SQHALENFKREWKSSF----------SGINGSWQICLYIK 330 (330) Q Consensus 301 s~e~~e~lr~~w~~~~----------~G~~na~kvpvL~e 330 (330) .+...++.-+...+.| +.+.|..+..-++| T Consensus 282 Pk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlE 321 (511) T protein:vir:56 282 PTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLE 321 (511) T ss_pred CchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHh Confidence 4433333333333222 12333333333444 No 138 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=94.26 E-value=0.0049 Score=33.23 Aligned_cols=290 Identities=13% Similarity=0.112 Sum_probs=132.0 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhcccccc-ccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE-MMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~p~~~~~~s~~r 79 (330) -+|.= |.+-.+-.+.-+..-|.. +.++-+..+ +.... ..+...+-..|..+|.. +|..-+ |..-..+ T Consensus 43 ~~~~~---~~~~~~~~~~~~~~~~~~---~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F-- 110 (695) T protein:vir:78 43 PADMG---RRGALNALDAAPVAEPSP---SLRLARQFE--VDVSN-YTPRERRAASYALDFNGTSMDALS-FVTSSGF-- 110 (695) T ss_pred chhhc---ccccccccccccccCCCc---ccccceece--ecccc-CCccccchhhhhhcccccccccch-hhhccCc-- Confidence 22211 222222222222222222 333322222 11100 11111111223333322 222222 2221111 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecc-----cccceeeeccCCCcccChhhHHHHHHHHHHHHhcc Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSE-----KGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTG 154 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~-----~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~ 154 (330) .+|-.|-.+|..|-.+.|+.++++...+- |.-.... ...|+.+.- ..-+..+.+++++|+..++++ T Consensus 111 ---~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~----~~~~~~d~dqi~~L~~e~erL- 181 (695) T protein:vir:78 111 ---PGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGG----NAASTSDGDQLKQINDEIERL- 181 (695) T ss_pred ---chHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccchhhhhhcccccc----cccccccHHHHHHHHHHHHHH- Confidence 26788999999999999999999887652 3211110 011222111 111223446778887777666 Q ss_pred CCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec---------------CCCcceEEEEeeCCCceEEeeC----CCCc Q lcl|NC_019511. 155 TDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP---------------KNKTKMEKFIAVDPSTIFYATD----KNGK 215 (330) Q Consensus 155 ~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r---------------d~~G~~~~L~pldp~tV~~~~d----~~G~ 215 (330) ...+-+...+.-.-+||-+.+++.+.- -++|....|..+||.-|.+..- +.+- T Consensus 182 --------~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp 253 (695) T protein:vir:78 182 --------RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD 253 (695) T ss_pred --------HHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhh Confidence 333333333344445665656664422 2245566688889887766421 1111 Q ss_pred ccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCc---cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEE Q lcl|NC_019511. 216 IIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGY---GLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGIL 292 (330) Q Consensus 216 ~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~y---GlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL 292 (330) -+-.+ .|+++. |.. +-++-++.+...|.+++....| |+|-++.+.+.|..++.+..-...+-.. .+..++. T Consensus 254 dfgkP-~~y~V~-G~k---IH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~-~~v~~lk 327 (695) T protein:vir:78 254 DFYKP-STWWMI-GTE---VHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGIL 327 (695) T ss_pred ccCCC-ceEEEe-ceE---EeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh-hhhHHHH Confidence 11111 244443 432 3344455566667777654444 9999999999998888777666655443 2333331 Q ss_pred EeCCCCCCCHHHH--HHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 293 QIRADQQQSQHAL--ENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 293 ~~~~~~~ls~e~~--e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) . .-.+.+....- -..|-++-+++.+ |-| .+|++ T Consensus 328 ~-dla~~L~~g~~~~l~~R~eli~~~Rs--n~G--~~llD 362 (695) T protein:vir:78 328 M-DLAQALMPGANVDLSMRAELINRYRD--NRN--ILFLD 362 (695) T ss_pred H-HHHHhhcChhHHHHHHHHHHHHHhcC--ccc--eEEEe Confidence 1 00011111111 1223455566655 333 34445 No 139 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=94.16 E-value=0.0052 Score=33.10 Aligned_cols=290 Identities=13% Similarity=0.118 Sum_probs=133.5 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhcccccc-ccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE-MMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~p~~~~~~s~~r 79 (330) -+||-+. +-.+-.+.-+.--|.. +.++-+..+ +.... ..+..-+-..|..+|.. +|..-+ |..-..+ T Consensus 43 ~~~~~~~---~~~~~~~~~~~~~~~~---~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F-- 110 (695) T protein:vir:36 43 PADFARR---GALNALDAAPVVEPSP---SLRLARQFE--VDVSN-YTPRERRAASYALDFNGTSMDALS-FVTSSGF-- 110 (695) T ss_pred chhhhhc---ccccccccccccCCCc---ccccceece--ecccc-cCccccchhhhhhcccccccccch-hhhccCc-- Confidence 4566432 2222223222222222 333322222 11100 11111111223333322 222222 2221111 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecc-----cccceeeeccCCCcccChhhHHHHHHHHHHHHhcc Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSE-----KGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTG 154 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~-----~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~ 154 (330) .+|-.|-.+|..|-.+.|+.++++...+- |.-.... ...|+.+.-. .-+..+.+++++|+..++++ T Consensus 111 ---~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~----~~~~~d~dqik~L~~e~erL- 181 (695) T protein:vir:36 111 ---PGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGN----AASTSDGDQLKQINDEIERL- 181 (695) T ss_pred ---chHHHHHHHhhccchhhHHHHHHHHhhcc-cceecccchhhhhhcccccccc----ccccCchHHHHHHHHHHHHH- Confidence 26788999999999999999999887652 3211110 0112222111 11223445778887777666 Q ss_pred CCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec---------------CCCcceEEEEeeCCCceEEeeC----CCCc Q lcl|NC_019511. 155 TDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP---------------KNKTKMEKFIAVDPSTIFYATD----KNGK 215 (330) Q Consensus 155 ~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r---------------d~~G~~~~L~pldp~tV~~~~d----~~G~ 215 (330) ...+-+...+.-.-+||-+.+++.+.- -++|....|..+||.-|.+..- +.+- T Consensus 182 --------~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp 253 (695) T protein:vir:36 182 --------RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD 253 (695) T ss_pred --------HHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhh Confidence 333333333344445665666664422 2245666688889887766421 1111 Q ss_pred ccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCc---cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEE Q lcl|NC_019511. 216 IIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGY---GLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGIL 292 (330) Q Consensus 216 ~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~y---GlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL 292 (330) -+-.+ .|+++. |.. +-++-++.+...|.+++....| |+|-++.+.+.|..++.+..-...+-.. .+..++. T Consensus 254 dfgkP-~~y~V~-G~k---IH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~-~~v~~lk 327 (695) T protein:vir:36 254 DFYKP-STWWMI-GTE---VHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGIL 327 (695) T ss_pred ccCCC-ceEEEe-ceE---EeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh-hhHHHHH Confidence 11111 244443 432 3344455566667777655444 9999999999988887776666655433 2233321 Q ss_pred EeCCCCCCCHHHH--HHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 293 QIRADQQQSQHAL--ENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 293 ~~~~~~~ls~e~~--e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) . .-.+.+....- -..|-++-+++.+ |-| .+|++ T Consensus 328 ~-dla~aL~~g~~~~l~~R~eli~~~Rs--n~G--~~llD 362 (695) T protein:vir:36 328 M-DLAQALMPGANVDLSMRAELINRYRD--NRN--ILFLD 362 (695) T ss_pred H-HHHHhhcChhHHHHHHHHHHHHHhcC--ccc--eEEEe Confidence 1 00001111111 1223455566655 333 34445 No 140 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=90.65 E-value=0.02 Score=29.93 Aligned_cols=296 Identities=11% Similarity=0.095 Sum_probs=141.2 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHH-HHHHHHHhhcccchhccccchhcc---ccccccccCCCCCcCCC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQ-IEQDTKEMQEITKSLYGKQQAYAE---PFLEMMDTNPDYRDKKS 76 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~~~~~~---~~~~~~~~~p~~~~~~s 76 (330) |++||--|+..+...+.... +..+ +..+..+-. +++...|....-.. .....+... .+.-.-. T Consensus 3 ~~~~~~~l~~~~~~~~~d~~-----------~~~~~~~~~~~s~~-~p~~~dGa~~i~~~~~~~~~~g~~~~-~y~~~e~ 69 (524) T protein:vir:98 3 FLGFGNVLSFFKNFAREDEI-----------ELEQQLKNDTGSVA-PPKNNDGAYEIETDLNNQKYAGVFQQ-FYSGQDP 69 (524) T ss_pred CcchhhHHHHhhhhhhhhhh-----------hHhhhhcCCccccc-CCCCCCCceeecCCCCcceecceeee-ecccccc Confidence 88888877766665433111 0100 111111111 11112222111100 001111000 1111112 Q ss_pred cccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC Q lcl|NC_019511. 77 YMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD 156 (330) Q Consensus 77 ~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~ 156 (330) ..+|-....+.-|..|.+|-|..+|+.+.+++. +.+..+---++.+. +-+.++...++|..--+.+.+++.. T Consensus 70 ~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaI------v~~~~~~pV~l~L~--~~~~s~~iK~kI~eeF~~Il~ll~F 141 (524) T protein:vir:98 70 AIQNKEQLINTYRGIMSYPEVENAVSEIIDDAI------VNEQGKDIITMDLA--KTNFSKAIQDKIVEEFDNVLNIYDF 141 (524) T ss_pred ccchHHHHHHHHHHHhhccchhhHHHhhhccee------EecCCCceEEEEec--ccccchHHHHHHHHHHHHHHHHhcc Confidence 235666666777888889999999999998873 22332223344453 3346666666666544444444332 Q ss_pred CCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEee------CCCCcccCCce--eEEEEe- Q lcl|NC_019511. 157 KDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYAT------DKNGKIIKGGN--RFVQVI- 227 (330) Q Consensus 157 ~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~------d~~G~~~~~~~--~Y~q~~- 227 (330) .. ...++ ++.-++-|..|+-++++.+..--+.+|..|||..|+.+. .+.|....++. .|+|.. T Consensus 142 ~~----~~~~~----fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~ 213 (524) T protein:vir:98 142 DN----MGARL----FRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAP 213 (524) T ss_pred ch----hhhHH----HhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccC Confidence 22 22233 455567788999999876655348999999999987653 12231111221 233321 Q ss_pred ------------CCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeC Q lcl|NC_019511. 228 ------------DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIR 295 (330) Q Consensus 228 ------------~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~ 295 (330) .++ ...++.+-|+|.+..-.+ .+ .++ +|=+..|...+......|.-.-=|==--|.-+=|.-+. T Consensus 214 ~~~~~~~g~~~~~~~-~ikI~~dAIvy~hSGL~d-~~-~~i-isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID 289 (524) T protein:vir:98 214 KAGYTYNGQIYQANQ-KIKIPRSAIVYAHSGLED-CS-NNI-IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYID 289 (524) T ss_pred CCccccccceecCCC-ceeechhheeeeccCccc-CC-CCe-eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEe Confidence 112 245778888887654332 22 222 46677787777665555543222111223333344344 Q ss_pred CCCCCCHHHHHHHHHHHHHHhc---------C-cccccccceeeC Q lcl|NC_019511. 296 ADQQQSQHALENFKREWKSSFS---------G-INGSWQICLYIK 330 (330) Q Consensus 296 ~~~~ls~e~~e~lr~~w~~~~~---------G-~~na~kvpvL~e 330 (330) .+ +|.+...++.-+.....|. | +.|..+..-++| T Consensus 290 vG-nlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlE 333 (524) T protein:vir:98 290 VG-QMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTE 333 (524) T ss_pred cC-CCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhh Confidence 33 4544444444444444443 1 112222333333 No 141 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=89.38 E-value=0.027 Score=29.21 Aligned_cols=290 Identities=14% Similarity=0.120 Sum_probs=129.8 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccc-cchhcccccc-ccccCCCCCcCCCcc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGK-QQAYAEPFLE-MMDTNPDYRDKKSYM 78 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~-~~~~~p~~~~~~s~~ 78 (330) |+--|. +.+..+..+.-+..-|. .+.++.+..+ +.-+.-..+++ -..|...|.. +|..-+ |..-..+ T Consensus 41 ~~~~~~--~~~~~~~~~~~~~~~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~F- 109 (694) T protein:vir:10 41 VPADFA--RRGALNALDAAPVAEPS---PSLRLARQFE----VDVSNYTPRERRAASYALDFNGTSMDALS-FVTSSGF- 109 (694) T ss_pred ccCCcc--ccccchhhcccccCCCC---cchhhhhhcc----ccccCCCccccchhhhhhccCcccccchh-hhhccCc- Confidence 111111 01111111111101111 1333322222 11111111111 1123333321 222221 2211111 Q ss_pred cchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecc-----cccceeeeccCCCcccChhhHHHHHHHHHHHHhc Q lcl|NC_019511. 79 RNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSE-----KGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNT 153 (330) Q Consensus 79 r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~-----~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~ 153 (330) .+|-.|-.+|..|-.+.|+.++++...+- |.-.... ...|+.+.- ..-+..+.+++++|+..++++ T Consensus 110 ----~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~----~~~~~~d~dqi~~L~~e~erl 180 (694) T protein:vir:10 110 ----PGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGG----NAASTSDGDQLKQINDEIERL 180 (694) T ss_pred ----chHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccchhhhhhcccccc----cccccccHHHHHHHHHHHHHH Confidence 26788999999999999999999887652 3211110 011222111 111223446778887777666 Q ss_pred cCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEec---------------CCCcceEEEEeeCCCceEEeeC----CCC Q lcl|NC_019511. 154 GTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSP---------------KNKTKMEKFIAVDPSTIFYATD----KNG 214 (330) Q Consensus 154 ~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~r---------------d~~G~~~~L~pldp~tV~~~~d----~~G 214 (330) ...+-+...+.-.-+||-+.+++.+.- -++|....|..+||.-|.+..- +.+ T Consensus 181 ---------~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~s 251 (694) T protein:vir:10 181 ---------RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVA 251 (694) T ss_pred ---------HHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhhccchh Confidence 333333333344445666666664422 2245566688889887766421 111 Q ss_pred cccCCceeEEEEeCCceEEEechhHeeeecccCcCCCCCCCc---cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE Q lcl|NC_019511. 215 KIIKGGNRFVQVIDKQVVASFTSRELVMGIRNPRSDLNSSGY---GLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGI 291 (330) Q Consensus 215 ~~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~n~~~d~~~~~y---GlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~Gi 291 (330) --+-.+ .|+++. |.. +-++-++.+...|.+++....| |+|-++.+.+.|..++.+..-...+-.. .+..++ T Consensus 252 pdfgkP-~~y~V~-G~~---IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~-~~v~~l 325 (694) T protein:vir:10 252 DDFYKP-STWWMI-GTE---VHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGI 325 (694) T ss_pred hccCCC-ceEEEe-ceE---EeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh-hhhHHH Confidence 111111 244443 432 3344455566667777654444 9999999999998888776666655443 233333 Q ss_pred EEeCCCCCCCHHHH--HHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 292 LQIRADQQQSQHAL--ENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 292 L~~~~~~~ls~e~~--e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) .. .-.+.+....- -..|-++-+++.+ |-| .+|++ T Consensus 326 k~-dla~~L~~g~~~~l~~R~eli~~~Rs--n~G--~~llD 361 (694) T protein:vir:10 326 LM-DLAQALMPGANVDLSMRAELINRYRD--NRN--ILFLD 361 (694) T ss_pred HH-HHHHhhcChhHHHHHHHHHHHHHhcC--ccc--eEEEe Confidence 11 00011111111 1223455566655 333 34445 No 142 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=89.37 E-value=0.027 Score=29.21 Aligned_cols=273 Identities=14% Similarity=0.149 Sum_probs=91.2 Q ss_pred CchhHHHHHh------cCC--CCCCcccccCccCcchhHHHH-HHHHHHHHhhcccchhccccchhccccccccccCCCC Q lcl|NC_019511. 1 MPDLFKSLRL------GSM--YKEDTEDLMVPIDDGIQANIR-QIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDY 71 (330) Q Consensus 1 ~~~~~~~~~~------~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~ 71 (330) .-|+--++.. ... ++....+.....+ ++...++ +.....-++.+-..=-.|+......+....-..+|.. T Consensus 10 ~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~-~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ 88 (511) T protein:vir:99 10 DTDLRGNINYLFNDEANVVYTYDGTESDLLQNVN-EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (511) T ss_pred hhhhhhhhhhhhhhhhCCccccchhhhhhhccHH-HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcc Confidence 2222222211 111 1111111000000 1111111 1111111111111111233222111100000001111 Q ss_pred CcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHH Q lcl|NC_019511. 72 RDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 72 ~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~ 151 (330) ++. ......|+++.+.=+ +..+.++. .. ++ +..+.+.+++. T Consensus 89 ki~-------------------~n~~k~Iv~~~~~yl--~g~p~~~~---------~~--d~-------~~~~~l~~~~~ 129 (511) T protein:vir:99 89 RVA-------------------HDYASYISDFINGYF--LGNPIQYQ---------DD--DK-------DVLEAIEAFND 129 (511) T ss_pred eee-------------------cchHHHHHHHHHhhh--cccCceee---------cC--ch-------HHHHHHHHHHh Confidence 110 233344444433322 12222211 11 11 12334444442 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe--C Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI--D 228 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~--~ 228 (330) . .++......+..+++++|.+|.++.++.+ |+ +.+..++|..+.++.++.. ..+.-.++|++.. + T Consensus 130 ~---------n~~~~~~~~~~~~~~i~G~a~~~vy~ded--~~-~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~ 197 (511) T protein:vir:99 130 L---------NDVESHNRSLGLDLSIYGKAYELMIRNQD--DE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID 197 (511) T ss_pred h---------cCHhHHHHHHHHHHHhcCeeEEEEEeCCC--Cc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc Confidence 1 14666777788999999988887765544 44 4677899999888776542 1122233443321 1 Q ss_pred Cc------eEEEechhHeeeecccC----------------------cCCCCCCCccccHHHHH---HHHHHHHHHHHHH Q lcl|NC_019511. 229 KQ------VVASFTSRELVMGIRNP----------------------RSDLNSSGYGLSEVEIA---MKEFIAYNNTESF 277 (330) Q Consensus 229 ~~------~~~~~~~~dvih~~~n~----------------------~~d~~~~~yGlSPIe~a---~~~I~~~laae~~ 277 (330) ++ ....++++.+.+...+. .-.+.....|.|.++-. ..++...++--.- T Consensus 198 ~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~ 277 (511) T protein:vir:99 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (511) T ss_pred cCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHH Confidence 11 11234555554432211 10111122466655544 3444433333322 Q ss_pred HHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHH---HHHHh-------cCccccccccee-eC Q lcl|NC_019511. 278 NDRFFSHGGTTRGILQIRADQQQSQHALENFKRE---WKSSF-------SGINGSWQICLY-IK 330 (330) Q Consensus 278 ~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~---w~~~~-------~G~~na~kvpvL-~e 330 (330) ...+|++ |- +.+.|....+.+....++.. |.... .+....+.+--| -+ T Consensus 278 ~~~~~~~---~~--lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~ 336 (511) T protein:vir:99 278 YMSDLND---AM--LLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHhhc---hh--hhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeec Confidence 2344443 22 21222222233332222211 10000 000001111101 00 No 143 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=88.62 E-value=0.031 Score=28.84 Aligned_cols=273 Identities=14% Similarity=0.142 Sum_probs=94.2 Q ss_pred CchhHHHHHh------cCCCC--CCcccccCccCcchhHHH-HHHHHHHHHhhcccchhccccchhccccccccccCCCC Q lcl|NC_019511. 1 MPDLFKSLRL------GSMYK--EDTEDLMVPIDDGIQANI-RQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDY 71 (330) Q Consensus 1 ~~~~~~~~~~------~~~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~ 71 (330) -.|+-.++.. .+.+. ....+..... +.+...+ ++...+.-++++-..=-.|+......+-...-..+|.. T Consensus 10 ~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ 88 (511) T protein:vir:78 10 DTDLRGNINYLFNDEANVVYTYDGTESDLLQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (511) T ss_pred hhhhhhhhhhhhhhhhCCcccccchhhhhhcCH-HHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcc Confidence 3333333322 11111 1111100000 1111111 11111111122211112233222111100000001111 Q ss_pred CcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHH Q lcl|NC_019511. 72 RDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 72 ~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~ 151 (330) ++. ......|+++.+.-+ +..+.++ ... ++ +....+.+++. T Consensus 89 ki~-------------------~n~~k~Iv~~~~~yl--~g~p~~~---------~~~--d~-------~~~~~l~~~~~ 129 (511) T protein:vir:78 89 RVA-------------------HDYASYISDFINGYF--LGNPIQY---------QDD--DK-------DVLEAIEAFND 129 (511) T ss_pred eee-------------------cchHHHHHHHHhhhh--cccCcee---------ecC--ch-------HHHHHHHHHHh Confidence 110 223334444433322 1222221 111 11 12233444432 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe--C Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI--D 228 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~--~ 228 (330) . ..+..+...+..+.+.+|.+|.++..+.+ |+ +.+..++|..+.++.++.. ..+.-.++|+++. + T Consensus 130 ~---------n~~~~~~~~~~~~~~~~G~a~~~vy~d~d--g~-~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~ 197 (511) T protein:vir:78 130 L---------NDVESHNRSLGLDLSIYGKAYELMIRNQD--DE-TRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPID 197 (511) T ss_pred h---------cChhHHHHHHHHHHHhcCeeEEEEEeCCC--Cc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecc Confidence 1 14666777788999999988777655444 54 4677899999888776543 1222334444332 1 Q ss_pred Cc------eEEEechhHeeeecccC----------------------cCCCCCCCccccHHHHHHHHHH---HHHHHHHH Q lcl|NC_019511. 229 KQ------VVASFTSRELVMGIRNP----------------------RSDLNSSGYGLSEVEIAMKEFI---AYNNTESF 277 (330) Q Consensus 229 ~~------~~~~~~~~dvih~~~n~----------------------~~d~~~~~yGlSPIe~a~~~I~---~~laae~~ 277 (330) ++ ....|+++.+.+...+. .-.+....+|.|-++-....|. ..++.-.- T Consensus 198 ~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~ 277 (511) T protein:vir:78 198 KTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (511) T ss_pred ccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHH Confidence 11 11234555554432211 1111112346665554444443 33333322 Q ss_pred HHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc----------Ccccccccceee-C Q lcl|NC_019511. 278 NDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS----------GINGSWQICLYI-K 330 (330) Q Consensus 278 ~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~----------G~~na~kvpvL~-e 330 (330) ...+|+ .|- +.+.|....+.+..+..+....-... +....+.+--|. + T Consensus 278 ~~~~~~---~~~--lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 336 (511) T protein:vir:78 278 YMSDLN---DAM--LLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHhh---cch--hheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeec Confidence 334444 333 22234323344333332221100000 000011110110 0 No 144 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=88.62 E-value=0.031 Score=28.84 Aligned_cols=273 Identities=14% Similarity=0.142 Sum_probs=94.2 Q ss_pred CchhHHHHHh------cCCCC--CCcccccCccCcchhHHH-HHHHHHHHHhhcccchhccccchhccccccccccCCCC Q lcl|NC_019511. 1 MPDLFKSLRL------GSMYK--EDTEDLMVPIDDGIQANI-RQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDY 71 (330) Q Consensus 1 ~~~~~~~~~~------~~~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~ 71 (330) -.|+-.++.. .+.+. ....+..... +.+...+ ++...+.-++++-..=-.|+......+-...-..+|.. T Consensus 10 ~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ 88 (511) T protein:vir:96 10 DTDLRGNINYLFNDEANVVYTYDGTESDLLQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (511) T ss_pred hhhhhhhhhhhhhhhhCCcccccchhhhhhcCH-HHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcc Confidence 3333333322 11111 1111100000 1111111 11111111122211112233222111100000001111 Q ss_pred CcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHH Q lcl|NC_019511. 72 RDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 72 ~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~ 151 (330) ++. ......|+++.+.-+ +..+.++ ... ++ +....+.+++. T Consensus 89 ki~-------------------~n~~k~Iv~~~~~yl--~g~p~~~---------~~~--d~-------~~~~~l~~~~~ 129 (511) T protein:vir:96 89 RVA-------------------HDYASYISDFINGYF--LGNPIQY---------QDD--DK-------DVLEAIEAFND 129 (511) T ss_pred eee-------------------cchHHHHHHHHhhhh--cccCcee---------ecC--ch-------HHHHHHHHHHh Confidence 110 223334444433322 1222221 111 11 12233444432 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe--C Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI--D 228 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~--~ 228 (330) . ..+..+...+..+.+.+|.+|.++..+.+ |+ +.+..++|..+.++.++.. ..+.-.++|+++. + T Consensus 130 ~---------n~~~~~~~~~~~~~~~~G~a~~~vy~d~d--g~-~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~ 197 (511) T protein:vir:96 130 L---------NDVESHNRSLGLDLSIYGKAYELMIRNQD--DE-TRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPID 197 (511) T ss_pred h---------cChhHHHHHHHHHHHhcCeeEEEEEeCCC--Cc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecc Confidence 1 14666777788999999988777655444 54 4677899999888776543 1222334444332 1 Q ss_pred Cc------eEEEechhHeeeecccC----------------------cCCCCCCCccccHHHHHHHHHH---HHHHHHHH Q lcl|NC_019511. 229 KQ------VVASFTSRELVMGIRNP----------------------RSDLNSSGYGLSEVEIAMKEFI---AYNNTESF 277 (330) Q Consensus 229 ~~------~~~~~~~~dvih~~~n~----------------------~~d~~~~~yGlSPIe~a~~~I~---~~laae~~ 277 (330) ++ ....|+++.+.+...+. .-.+....+|.|-++-....|. ..++.-.- T Consensus 198 ~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~ 277 (511) T protein:vir:96 198 KTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (511) T ss_pred ccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHH Confidence 11 11234555554432211 1111112346665554444443 33333322 Q ss_pred HHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhc----------Ccccccccceee-C Q lcl|NC_019511. 278 NDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFS----------GINGSWQICLYI-K 330 (330) Q Consensus 278 ~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~----------G~~na~kvpvL~-e 330 (330) ...+|+ .|- +.+.|....+.+..+..+....-... +....+.+--|. + T Consensus 278 ~~~~~~---~~~--lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 336 (511) T protein:vir:96 278 YMSDLN---DAM--LLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHhh---cch--hheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeec Confidence 334444 333 22234323344333332221100000 000011110110 0 No 145 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=87.91 E-value=0.036 Score=28.52 Aligned_cols=275 Identities=14% Similarity=0.143 Sum_probs=93.0 Q ss_pred CchhHHHHHhcCCCCCCcccccCccC--------cchhHHH-HHHHHHHHHhhcccchhccccchhccccccccccCCCC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPID--------DGIQANI-RQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDY 71 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~ 71 (330) ..|+--++...=....+... ..+.. +.+...+ ++...+..++.+-..=-.|+......+-...-..+|.. T Consensus 10 ~~~~~~~~~~~~~~~~n~~~-~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ 88 (511) T protein:vir:96 10 DTDLRGNINYLFNDEANVVY-TYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (511) T ss_pred hhhhhhhhhhhhhhhhCCcc-ccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcc Confidence 33333333211111111100 11111 1111111 11111111222211111232222111100000001111 Q ss_pred CcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHH Q lcl|NC_019511. 72 RDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 72 ~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~ 151 (330) ++ + ......|+++.+.-+ +..+.++ ...+ + +....+.+++. T Consensus 89 ki------------------~-~n~~k~Iv~~~~~yl--~g~p~~~---------~~~~--~-------~~~~~l~~~~~ 129 (511) T protein:vir:96 89 RV------------------A-HDYASYISDFINGYF--LGNPIQY---------QDDD--K-------DVLEAIEAFND 129 (511) T ss_pred ee------------------e-cchHHHHHHHHHhhh--ccCCcee---------ecCc--h-------HHHHHHHHHHh Confidence 11 0 223344444433322 1222222 1111 1 12233444432 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe--C Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI--D 228 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~--~ 228 (330) . .++......+..+++++|.+|.++.++.+ |+ +.+..++|..+.++.++.. ..+.-.++|+... + T Consensus 130 ~---------n~~~~~~~~~~~~~~i~G~a~~~vy~ded--~~-~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d 197 (511) T protein:vir:96 130 L---------NDVESHNRSLGLDLSIYGKAYELMIRNQD--DE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID 197 (511) T ss_pred h---------cCHHHHHHHHHHHHHhcCeeEEEEEeCCC--Cc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc Confidence 1 24667778888999999988777665444 44 4677899999988766542 1122223443332 1 Q ss_pred Cc------eEEEechhHeeeeccc----------------------CcCCCCCCCccccHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_019511. 229 KQ------VVASFTSRELVMGIRN----------------------PRSDLNSSGYGLSEVEIAMKEFIAYNNT-ESFND 279 (330) Q Consensus 229 ~~------~~~~~~~~dvih~~~n----------------------~~~d~~~~~yGlSPIe~a~~~I~~~laa-e~~~~ 279 (330) ++ ....++++.+.+.... |.-.+....+|+|-++-+...|...-.+ ..+ + T Consensus 198 ~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~-~ 276 (511) T protein:vir:96 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDT-A 276 (511) T ss_pred ccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHH-H Confidence 11 0123444444433211 1111111234666665554444432222 222 2 Q ss_pred HHHhcCCCcceEEEeCCCCCCCHHHHHHHHHH---H-----HHHhcC--cccccccceee-C Q lcl|NC_019511. 280 RFFSHGGTTRGILQIRADQQQSQHALENFKRE---W-----KSSFSG--INGSWQICLYI-K 330 (330) Q Consensus 280 ~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~---w-----~~~~~G--~~na~kvpvL~-e 330 (330) +.+...+.|--+ +.|....+.++...++.. | .....| ....+++--|. + T Consensus 277 ~~~~~~~~~~lv--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 336 (511) T protein:vir:96 277 NYMSDLNDAMLL--IKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHHhhCceee--eecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeec Confidence 223333344322 233222333332222111 0 000000 01111111111 1 No 146 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=87.38 E-value=0.039 Score=28.30 Aligned_cols=276 Identities=15% Similarity=0.177 Sum_probs=94.7 Q ss_pred CchhHHHHHh------cCCC--CCCcccccCccCcchhHHHH-HHHHHHHHhhcccchhccccchhccccccccccCCCC Q lcl|NC_019511. 1 MPDLFKSLRL------GSMY--KEDTEDLMVPIDDGIQANIR-QIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDY 71 (330) Q Consensus 1 ~~~~~~~~~~------~~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~ 71 (330) ..|+--++.. ...+ +....+..... +++...++ +...+.-++.+-..=-.|+.+....+....-..++.. T Consensus 10 ~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ 88 (511) T protein:vir:93 10 DTDLRGNINYLFNDEANVVYTYDGTESDLLQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (511) T ss_pred hhhhhhhhhhhhhhhhCCcccccchhhhhhccH-HHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcc Confidence 3333333321 1111 11111100000 11111111 1111111222222212233222111100000000011 Q ss_pred CcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHH Q lcl|NC_019511. 72 RDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFIL 151 (330) Q Consensus 72 ~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~ 151 (330) + .. .+....|+++.+.-+ +..+.++ ... ++ +..+.+.+++. T Consensus 89 k------------------i~-~n~~k~Iv~~~~~yl--~g~p~~~---------~~~--d~-------~~~~~l~~~~~ 129 (511) T protein:vir:93 89 R------------------VA-HDYASYISDFINGYF--LGNPIQY---------QDD--DK-------DVLEVIEAFND 129 (511) T ss_pred e------------------ee-cchHHHHHHHHhhhh--cccCeee---------ccC--Ch-------HHHHHHHHHHh Confidence 1 00 233445554444322 1222221 111 11 12233444432 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe--C Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI--D 228 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~--~ 228 (330) . .++......+..+++.+|.+|..+..+.+ |+ +.+..++|..+.++.++.. ..+.-.++|++.. + T Consensus 130 ~---------n~~~~~~~~~~~~~~~~G~ay~~vy~de~--~~-~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~ 197 (511) T protein:vir:93 130 L---------NDVESHNRSLGLDLSIYGKAYELMIRNQD--DE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID 197 (511) T ss_pred h---------cCHhHHHHHHHHHHHhcCeeEEEEEeCCC--Cc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc Confidence 1 24667778888999999988877665444 54 4577899999888766532 1122233444332 1 Q ss_pred Cc------eEEEechhHeeeeccc----------------------CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 229 KQ------VVASFTSRELVMGIRN----------------------PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDR 280 (330) Q Consensus 229 ~~------~~~~~~~~dvih~~~n----------------------~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~ 280 (330) ++ ....|+.+.+.+.... |.-.+....+|.|-++-+...|...-.+..-.++ T Consensus 198 ~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~ 277 (511) T protein:vir:93 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (511) T ss_pred ccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHH Confidence 11 1123444444443211 1111111234666666555544433222211122 Q ss_pred HHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh--------c--Ccccccccceee-C Q lcl|NC_019511. 281 FFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF--------S--GINGSWQICLYI-K 330 (330) Q Consensus 281 fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~--------~--G~~na~kvpvL~-e 330 (330) .+...+.|--++ .|....+.+.....+....-.. . +....+.+-.|. + T Consensus 278 ~~~~~~~~~lv~--~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 336 (511) T protein:vir:93 278 YMSDLNDAMLLI--KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHhhCcceee--ecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeec Confidence 233333443222 3332233333333222111000 0 000011111111 0 No 147 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=84.95 E-value=0.056 Score=27.43 Aligned_cols=260 Identities=8% Similarity=0.044 Sum_probs=92.1 Q ss_pred CccCcc------hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHH Q lcl|NC_019511. 23 VPIDDG------IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSI 96 (330) Q Consensus 23 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~i 96 (330) ++.++. ++.-++++..+.-++.+-..=-.|++.. + -.| ...+ ..++ .++ ..+.. T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i---~------~~~-~~~~-~~~~-------~~~--~~~n~ 60 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRP---D------AIG-LAVP-LDMR-------KYL--AHVGY 60 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccch---h------hcC-cccc-hhhh-------hhh--hhcch Confidence 222221 2222333333333332222211222110 0 000 0111 1111 111 12455 Q ss_pred HHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHH Q lcl|NC_019511. 97 LNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTY 176 (330) Q Consensus 97 v~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L 176 (330) ...|+++.++.+-. .||.+-...........+.+....+.+++.. .++......+..+++ T Consensus 61 ~~~ivd~~a~~l~~-----------~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~---------N~~~~~~~~~~~~a~ 120 (488) T protein:vir:23 61 PRTYVDAIAERQEL-----------EGFRIPSANGEEPESGGENDPASELWDWWQA---------NNLDIEATLGHTDAL 120 (488) T ss_pred HHHHHHHHHHhhhc-----------cceeccCCcccccccccchhHHHHHHHHHHh---------cChhHHHHHHHHHHh Confidence 66777777765431 2443322111111111111222334443321 146667777889999 Q ss_pred hcCCceeEEEEec------CCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceEE---EechhHee------ Q lcl|NC_019511. 177 TYDQVNFEKVFSP------KNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVVA---SFTSRELV------ 241 (330) Q Consensus 177 ~~g~g~~~~v~~r------d~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~~---~~~~~dvi------ 241 (330) ++|.+|..+..+. +..|.+ .+.+++|..+.+..++.-....-.++|++..+++.+. .|+.+.+. T Consensus 121 i~G~a~~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~ 199 (488) T protein:vir:23 121 IYGTAYITISMPDPEVDFDVDPEVP-LIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAE 199 (488) T ss_pred hcCceEEEEecCCcccccCCCCCcc-eEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecC Confidence 9998876654322 222322 3667889888777664322222233444443333221 23333332 Q ss_pred -------------------eecccCcCCCCCCCccccHHH----HHHHHHHHHHHHHHHHHHHHhc------CCCcceEE Q lcl|NC_019511. 242 -------------------MGIRNPRSDLNSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSH------GGTTRGIL 292 (330) Q Consensus 242 -------------------h~~~n~~~d~~~~~yGlSPIe----~a~~~I~~~laae~~~~~fF~n------Ga~p~GiL 292 (330) +++.|+.. .+++|.|-|+ ....++...+.--.-...||+. |+.+...- T Consensus 200 ~~~~~~~~~~h~~g~vPvv~f~n~~~~---~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~ 276 (488) T protein:vir:23 200 GEWEAPTSTPHGLEMVPVIPISNRTRL---SDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELG 276 (488) T ss_pred CceEeccccccCCCCcceEEecccccc---CCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccc Confidence 33322221 1346877654 2233444333332222333332 22221110 Q ss_pred ------------------EeCCCC-----CC---C-HHHHHHHHHHHHHHhcCcccccccce-----eeC Q lcl|NC_019511. 293 ------------------QIRADQ-----QQ---S-QHALENFKREWKSSFSGINGSWQICL-----YIK 330 (330) Q Consensus 293 ------------------~~~~~~-----~l---s-~e~~e~lr~~w~~~~~G~~na~kvpv-----L~e 330 (330) ..+.+. ++ + +.-+++|+..... +++.. .+|. ..+ T Consensus 277 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~~~~~~~~~~l~~~i~~-~~~~~---~~p~~~~g~~~~ 342 (488) T protein:vir:23 277 INAETGQRMFDAYMARILAFEGGEGAHAEQFSAAELRNFVDALDALDRK-AASYS---GLPPQYLSSSSD 342 (488) T ss_pred ccccccchhhhhhhhhhccCCCCCCceeEecCCCChHHHHHHHHHHHHH-Hhccc---CCCHHHhccccC Confidence 000000 00 0 0111112211111 11110 1111 000 No 148 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=83.45 E-value=0.068 Score=26.97 Aligned_cols=276 Identities=14% Similarity=0.161 Sum_probs=94.8 Q ss_pred CchhHHHHHh------cCCCCCCcccccCccC-cchhHHH-HHHHHHHHHhhcccchhccccchhccccccccccCCCCC Q lcl|NC_019511. 1 MPDLFKSLRL------GSMYKEDTEDLMVPID-DGIQANI-RQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYR 72 (330) Q Consensus 1 ~~~~~~~~~~------~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~ 72 (330) ..|+--++.. ...+.-...+.....+ +.+.-.+ .+...+.-++++-..=-.|+......+....-..+|..+ T Consensus 10 ~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k 89 (511) T protein:vir:10 10 DTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR 89 (511) T ss_pred hhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcce Confidence 3333333321 1211110000000000 1111111 111111122222211122333221111000000011111 Q ss_pred cCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHh Q lcl|NC_019511. 73 DKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILN 152 (330) Q Consensus 73 ~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~ 152 (330) + + ......|+++.+.-+ +..+.++ ... + . +....+.+++.. T Consensus 90 i------------------~-~n~~k~Iv~~~~~yl--~g~p~~~---------~~~--d-----~--~~~~~l~~~~~~ 130 (511) T protein:vir:10 90 V------------------A-HDYASYISDFINGYF--LGNPIQY---------QDD--D-----K--DVLEAIEAFNDL 130 (511) T ss_pred e------------------e-cchHHHHHHHHhhhh--cccCcee---------ecC--c-----h--HHHHHHHHHHhh Confidence 1 0 233444444433222 1222221 111 1 1 122334444321 Q ss_pred ccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe--CC Q lcl|NC_019511. 153 TGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI--DK 229 (330) Q Consensus 153 ~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~--~~ 229 (330) .++......+..+++.+|.+|.++..+.+ |+ +.+..++|..+.++-++.. ..+.-.++|+... ++ T Consensus 131 ---------n~~~~~~~~~~~~~~i~G~ay~~vy~ded--g~-~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~ 198 (511) T protein:vir:10 131 ---------NDVESHNRSLGLDLSIYGKAYEIMIRNQD--DE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred ---------cCHHHHHHHHHHHHHhcCeeEEEEEeCCC--Cc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 14666777788999999988777655444 44 4677789999888776543 1122233443332 11 Q ss_pred c------eEEEechhHeeeeccc----------------------CcCCCCCCCccccHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_019511. 230 Q------VVASFTSRELVMGIRN----------------------PRSDLNSSGYGLSEVEIAMKEFIAYNNT-ESFNDR 280 (330) Q Consensus 230 ~------~~~~~~~~dvih~~~n----------------------~~~d~~~~~yGlSPIe~a~~~I~~~laa-e~~~~~ 280 (330) + ....|+++.+.+...+ |.-.+....+|.|-++-+...|.....+ ..+ ++ T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~-~~ 277 (511) T protein:vir:10 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDT-AN 277 (511) T ss_pred CccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHH-HH Confidence 1 0123455544443211 1101111224666665554444432222 222 22 Q ss_pred HHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh------c----Ccccccccceee-C Q lcl|NC_019511. 281 FFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF------S----GINGSWQICLYI-K 330 (330) Q Consensus 281 fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~------~----G~~na~kvpvL~-e 330 (330) .+...+.|--+ +.|....+.+....++..-.-.. . +....+.+--|. + T Consensus 278 ~~~~~~~~~lv--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~ 336 (511) T protein:vir:10 278 YMSDLNDAMLL--IKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHhhCceee--eeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeec Confidence 23333444323 33433334443333222110000 0 001111111111 1 No 149 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=83.40 E-value=0.069 Score=26.96 Aligned_cols=259 Identities=11% Similarity=0.095 Sum_probs=98.5 Q ss_pred HHHHhcCC--CCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHH Q lcl|NC_019511. 6 KSLRLGSM--YKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHN 83 (330) Q Consensus 6 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~ 83 (330) -+|+..+. ++++ + .+ ..++++..+++...+..++.+-..=-.|++.....+ . ..+ ...+ T Consensus 1 ~~~~~~~~~~~~~~--~-~~-~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~---------~--~~~-~~~~--- 61 (453) T protein:vir:73 1 MNLKPIKLMTYSRD--E-EI-TDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQK---------A--KDS-WKPD--- 61 (453) T ss_pred Cccccceeeecccc--c-cC-CHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCC---------C--CCc-cCcc--- Confidence 12222222 2221 1 11 123344444444444444443333233443221110 0 000 0011 Q ss_pred HHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCC Q lcl|NC_019511. 84 LHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDS 163 (330) Q Consensus 84 ~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s 163 (330) .| +. ++....|+++.+.-+ |..+. .+... + + +....+.+++.. .. T Consensus 62 ----~k-i~-~n~~~~ivd~~~~~l--~g~~~---------~~~~~--d----~---~~~~~l~~~~~~---------n~ 106 (453) T protein:vir:73 62 ----NR-LT-NNFAKYIVDTFVGYF--NGIPI---------KKTHD--D----K---SVLEAMQLFDNL---------ND 106 (453) T ss_pred ----ce-ee-cchHHHHHHHhhhhh--cccCc---------eeecC--C----h---HHHHHHHHHHHh---------cC Confidence 01 11 234445554443322 11221 12111 1 1 122334444321 14 Q ss_pred HHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEEEeCCce-EEEechhHee Q lcl|NC_019511. 164 FQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQVIDKQV-VASFTSRELV 241 (330) Q Consensus 164 ~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q~~~~~~-~~~~~~~dvi 241 (330) +......+..+.+.+|.+|..+..+. .|.+ .+..++|..+.++.++. +..+.-.++|+...++.. ...|+.+.+. T Consensus 107 ~~~~~~~~~~~~~~~G~~~~~v~~d~--~~~~-~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~ 183 (453) T protein:vir:73 107 MEDEESELAKIACVYGRAYELMYQNE--STES-EVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETI 183 (453) T ss_pred hhHHHHHHHHHHHhcCeEEEEEEeCC--CCce-EEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEE Confidence 66677788899999998877765544 3554 56678999887776543 222222233333332222 1224444444 Q ss_pred eeccc-----------------CcCCCCCCCccccHHHHHHHHH---HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCC Q lcl|NC_019511. 242 MGIRN-----------------PRSDLNSSGYGLSEVEIAMKEF---IAYNNTESFNDRFFSHGGTTRGILQIRADQQQS 301 (330) Q Consensus 242 h~~~n-----------------~~~d~~~~~yGlSPIe~a~~~I---~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls 301 (330) ++..+ |.-.+....+|.|-++-....+ ...++.-.-...+|++ |-=++ .| ..++ T Consensus 184 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~---~~l~~--~g-~~~~ 257 (453) T protein:vir:73 184 SITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSD---QYLVF--LG-AEVD 257 (453) T ss_pred EEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhcc---ceeee--ec-CCCC Confidence 43211 1111112334666665444444 3333333333344443 43222 23 2344 Q ss_pred HHHHHHHHHH----------------------------------------HHHHhcCcccccccceeeC Q lcl|NC_019511. 302 QHALENFKRE----------------------------------------WKSSFSGINGSWQICLYIK 330 (330) Q Consensus 302 ~e~~e~lr~~----------------------------------------w~~~~~G~~na~kvpvL~e 330 (330) ++....++.. ..+..... + .+|-+-. T Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~--s-~~p~~~~ 323 (453) T protein:vir:73 258 EEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQF--T-MAANISD 323 (453) T ss_pred chhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHH--h-CCcccCc Confidence 4444444321 11111000 0 0010000 No 150 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=82.46 E-value=0.077 Score=26.70 Aligned_cols=249 Identities=10% Similarity=0.100 Sum_probs=94.6 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIIT 103 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~ 103 (330) -+.+. ++++..++++-..=-.|++.....+-...-...|..++. +.....|+++ T Consensus 1 ~~~~~-------~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~-------------------~n~~~~ivd~ 54 (440) T protein:vir:95 1 MLAAF-------LGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVR-------------------HKWGGYISSF 54 (440) T ss_pred ChhhH-------HHHHHHHHHHHHHHhccCCcccccccccccccCCcceee-------------------cchHHHHHHh Confidence 11111 112222222222212344332211100000001111110 2334455544 Q ss_pred HHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCcee Q lcl|NC_019511. 104 RANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNF 183 (330) Q Consensus 104 ~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~ 183 (330) .+.=+ |..+.+ |. ..+. ...+....+.+++.. ..+......+..+.+++|.++. T Consensus 55 ~~~~l--~g~~~~-------~~--~~~~------~~~~~~~~l~~~~~~---------n~~~~~~~~~~~~~~~~G~a~~ 108 (440) T protein:vir:95 55 ATGYV--IGNPVS-------IG--VMEG------GSADQLSTIKDIEWQ---------NDINALNSDLAFDASVYGRAYE 108 (440) T ss_pred hhhhe--eccCce-------Ee--eCCC------ccHHHHHHHHHHHHh---------cCHhHHHHHHHHHHhhcCeEEE Confidence 33222 222222 11 1111 112233444444321 1466667778899999999887 Q ss_pred EEEEecCCCcceEEEEeeCCCceEEeeCCCCc-ccCCceeEEEEeCCceEEEechhHeeeecc---------------cC Q lcl|NC_019511. 184 EKVFSPKNKTKMEKFIAVDPSTIFYATDKNGK-IIKGGNRFVQVIDKQVVASFTSRELVMGIR---------------NP 247 (330) Q Consensus 184 ~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~-~~~~~~~Y~q~~~~~~~~~~~~~dvih~~~---------------n~ 247 (330) .+..+. .|++ .+..++|..+.++.++.+. ...-.++|+...+......|+.+.+.+... |+ T Consensus 109 ~~~~d~--~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (440) T protein:vir:95 109 YHFRDK--DKVD-RVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHS 185 (440) T ss_pred EEEecC--CCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeecc Confidence 766544 4554 4667899999888776542 122223333222222223344444443211 11 Q ss_pred cC-----CCCCCCccccHHH---HHHHHHHHHHHHHHHHHHHHhcCCCcceEEE-eCCCCCCCHHHHHHHHHHHHHHhc- Q lcl|NC_019511. 248 RS-----DLNSSGYGLSEVE---IAMKEFIAYNNTESFNDRFFSHGGTTRGILQ-IRADQQQSQHALENFKREWKSSFS- 317 (330) Q Consensus 248 ~~-----d~~~~~yGlSPIe---~a~~~I~~~laae~~~~~fF~nGa~p~GiL~-~~~~~~ls~e~~e~lr~~w~~~~~- 317 (330) .. .+.....|.|-++ ....++...++--.-..++|++ |--++. ...+...++++...++..-.-... T Consensus 186 ~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~---~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~ 262 (440) T protein:vir:95 186 YNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLND---AMLLVKGDLDGIKLSPEDAAKMKDANMLFLKT 262 (440) T ss_pred CceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhc---ceeeeecccccCCCCccchhhhhhccceeccc Confidence 11 1111123555444 4444444444444444455554 332221 011123456666665543221110 Q ss_pred -----Cccccccc-ceeeC Q lcl|NC_019511. 318 -----GINGSWQI-CLYIK 330 (330) Q Consensus 318 -----G~~na~kv-pvL~e 330 (330) |.+..+.+ .+--+ T Consensus 263 ~~~~~~~~~~~~~~~lt~~ 281 (440) T protein:vir:95 263 GISTTGQQTTADASYIYKQ 281 (440) T ss_pred ccccccCCCCcceeEEeec Confidence 00001111 01111 No 151 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=81.96 E-value=0.081 Score=26.57 Aligned_cols=277 Identities=13% Similarity=0.151 Sum_probs=94.5 Q ss_pred CchhHHHH--HhcC--CCCCCcccccC--ccC-------cchhHHHH-HHHHHHHHhhcccchhccccchhccccccccc Q lcl|NC_019511. 1 MPDLFKSL--RLGS--MYKEDTEDLMV--PID-------DGIQANIR-QIEQDTKEMQEITKSLYGKQQAYAEPFLEMMD 66 (330) Q Consensus 1 ~~~~~~~~--~~~~--~~~~~~~~~~~--~~~-------~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 66 (330) ...|+-.- ..++ +++++....-. +.+ +++.-.++ +...+.-++.+-..=-.|+......+....-. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~ 83 (512) T protein:vir:97 4 ANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEE 83 (512) T ss_pred ceeccCceeeeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCccccc Confidence 11110000 0000 11222111000 001 11111111 11111112222111122322211111000000 Q ss_pred cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHH Q lcl|NC_019511. 67 TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRI 146 (330) Q Consensus 67 ~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i 146 (330) .++..++ . ......|+++.+.-+ +..+.+ +...| + +..+.+ T Consensus 84 ~~~~~ki------------------~-~n~~k~Ivd~~~~yl--~g~p~~---------~~~~d--~-------~~~~~l 124 (512) T protein:vir:97 84 YMADNRV------------------A-HDYASYISDFINGYF--LGNPIQ---------CQDDD--K-------DVLEAI 124 (512) T ss_pred ccCccee------------------e-cchHHHHHHHHhhhh--cccCce---------eccCC--h-------HHHHHH Confidence 0011111 0 223344444433322 112221 11111 1 112334 Q ss_pred HHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEE Q lcl|NC_019511. 147 EEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQ 225 (330) Q Consensus 147 ~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q 225 (330) .+++.. .++......+..+++.+|.+|..+..+.+ |+ +.+..++|..+.++.++.. ..+.-.++|+. T Consensus 125 ~~~~~~---------n~~~~~~~~~~~~~~i~G~ay~~vy~ded--~~-~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~ 192 (512) T protein:vir:97 125 EAFNDL---------NDVESHNRSLGLDLSIYGKAYELMIRNQD--DE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (512) T ss_pred HHHHhh---------cCHHHHHHHHHHHHHhcCeEEEEEEeCCC--Cc-eEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 444321 24666777888999999988877665444 44 4677899998888776542 22223344443 Q ss_pred Ee--CCc------eEEEechhHeeeeccc----------------------CcCCCCCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_019511. 226 VI--DKQ------VVASFTSRELVMGIRN----------------------PRSDLNSSGYGLSEVEIAMKEFIAYNNTE 275 (330) Q Consensus 226 ~~--~~~------~~~~~~~~dvih~~~n----------------------~~~d~~~~~yGlSPIe~a~~~I~~~laae 275 (330) +. +++ ....|+.+.+.+...+ |.-.+....+|.|-++-+...|...-.+. T Consensus 193 ~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~ 272 (512) T protein:vir:97 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAE 272 (512) T ss_pred eeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHH Confidence 32 111 1123455555443211 11111112346676665555554433332 Q ss_pred HHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCc-----------ccccccceee-C Q lcl|NC_019511. 276 SFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGI-----------NGSWQICLYI-K 330 (330) Q Consensus 276 ~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~-----------~na~kvpvL~-e 330 (330) .-.++.+...+.|--++ .|....+++.....+....-...+. +..+.+=.|. + T Consensus 273 S~~~~~~~~~~~~~lv~--~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~ 337 (512) T protein:vir:97 273 SDTANYMSDLNDAMLLI--KGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ 337 (512) T ss_pred HHHHHHHHHhcCceeee--ecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeec Confidence 22222233334444333 3433334444443332221111110 0001100000 0 No 152 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=80.62 E-value=0.093 Score=26.23 Aligned_cols=261 Identities=11% Similarity=0.080 Sum_probs=94.0 Q ss_pred HHHHHHHHHHHhhcccchhccc---cchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhH Q lcl|NC_019511. 32 NIRQIEQDTKEMQEITKSLYGK---QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQV 108 (330) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~g~---~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~I 108 (330) +...+.+.+-++...-.+-..+ =..|..|........+.. .++. +-| .+....|+++++..+ T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~------~~~~-------~~~--dst~~~a~~~Laa~l 65 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQG------GYLP-------TPW--QSVGSKGVNVLASKL 65 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc------cccc-------ccc--cccHHHHHHHHHHHH Confidence 1111111111111100111111 113333422111111100 0000 001 233445677777777 Q ss_pred hhhhhhheecccccceeeec--cCCC---cccChhhHHHHH----HHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 109 STYCKPARYSEKGVGFEVKL--KDLD---ATPGIKEKEQMK----RIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 109 a~~~~~~~~~~~~~g~~v~~--kd~~---~~~~~~~~~~~~----~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) ..-..+. +. =|+++ .|.+ ...+...+..++ .+++.+..-+. +.+|+.=+-.+..|++++| T Consensus 66 ~~~ltpp-----~~-~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~-----~snf~~~~~~~~~~L~~~G 134 (555) T protein:vir:17 66 MLSLFPV-----NT-SFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIA-----ESSDRVHLEMAMKHLIVTG 134 (555) T ss_pred HHhhcCC-----CC-cccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHH-----hcCcHHHHHHHHHHHHhHC Confidence 5422221 11 13333 2221 112223333332 24444433322 2356666667778888999 Q ss_pred CceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCC---------------------------------------- Q lcl|NC_019511. 180 QVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKG---------------------------------------- 219 (330) Q Consensus 180 ~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~---------------------------------------- 219 (330) ++..|. ..+ +..+|||. +..+..|..|++.+- T Consensus 135 ~a~ly~--~~~----~~~~~pl~--~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~ 206 (555) T protein:vir:17 135 NALLYQ--GKK----NLKLYPLD--RFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVT 206 (555) T ss_pred eEEEEe--cCC----ceeEEEcC--eEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhh Confidence 988765 222 34567763 344444555532210 Q ss_pred ------------------------ceeEEEEeCCceE-EEec---hh--HeeeecccCcCCCCCCCccccHHHHHHHHHH Q lcl|NC_019511. 220 ------------------------GNRFVQVIDKQVV-ASFT---SR--ELVMGIRNPRSDLNSSGYGLSEVEIAMKEFI 269 (330) Q Consensus 220 ------------------------~~~Y~q~~~~~~~-~~~~---~~--dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~ 269 (330) ...|++-.+|..+ .++. -+ =.+..+.+..+ ...||.||.+-++-.+. T Consensus 207 ~~~~~~~~~~~~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~---ge~YGrgp~~~~l~D~k 283 (555) T protein:vir:17 207 APGGRDKGKSNDALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVD---GEAYGRGRVEEFMGDLK 283 (555) T ss_pred hhcccccCCCcceeEeecccccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecC---CCccccchHHHHHHHHH Confidence 0000011111110 0000 00 01222222222 24589999988887776 Q ss_pred HHHHHHHHHHHHHhcCCCcc------eEEE----------------------eC----CCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_019511. 270 AYNNTESFNDRFFSHGGTTR------GILQ----------------------IR----ADQQQSQHALENFKREWKSSFS 317 (330) Q Consensus 270 ~~laae~~~~~fF~nGa~p~------GiL~----------------------~~----~~~~ls~e~~e~lr~~w~~~~~ 317 (330) .....++-....-.-...|- |++. +. ++-....+.++.++...+..|- T Consensus 284 ~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm 363 (555) T protein:vir:17 284 SLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFL 363 (555) T ss_pred HHHHHHHHHHHHHHHHhCCceeeccccccCcceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHh Confidence 65555543333222112221 1110 00 0001123444555555544432 Q ss_pred Cc--ccccccceeeC Q lcl|NC_019511. 318 GI--NGSWQICLYIK 330 (330) Q Consensus 318 G~--~na~kvpvL~e 330 (330) .. .++.++ --.| T Consensus 364 ~~~~~d~~r~-TAtE 377 (555) T protein:vir:17 364 MLQVRQSERT-TATE 377 (555) T ss_pred hcCCCCcccc-hHHH Confidence 21 122111 0011 No 153 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=79.96 E-value=0.099 Score=26.08 Aligned_cols=266 Identities=9% Similarity=0.058 Sum_probs=91.2 Q ss_pred CchhHHHHHhcCCCC--CCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYK--EDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYM 78 (330) Q Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~ 78 (330) |++++-=+.+..-.+ +...+..-...+.+...+++-..+..++.+...=-.|+......+.... .. ...-++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~--~~--~~~~~~-- 74 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLD--NK--GEIDPL-- 74 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhc--cc--cccccc-- Confidence 666543221110000 0000000011112222222222222222222111123321111110000 00 000000 Q ss_pred cchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCC Q lcl|NC_019511. 79 RNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKD 158 (330) Q Consensus 79 r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~p 158 (330) ..-.+.+ +++...|+++.+.-+ |..+.+++ .. + .+....+..++. T Consensus 75 -------~~~~ki~-~n~~~~Ivd~~~~~l--~g~p~~~~---------~~--d-------~~~~~~l~~~~~------- 119 (474) T protein:vir:96 75 -------KPDWRMF-TNYHQNLVDQKVAYA--VANPVTFS---------SD--D-------DKSLKTIQEVLN------- 119 (474) T ss_pred -------ccchhcc-cchHHHHHHhhhhhh--cccCceee---------cC--c-------hHHHHHHHHHHh------- Confidence 0000111 344556655544433 22332221 11 1 122344555542 Q ss_pred CCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEEEeCCceEEEech Q lcl|NC_019511. 159 IDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQVIDKQVVASFTS 237 (330) Q Consensus 159 n~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q~~~~~~~~~~~~ 237 (330) | ++..-...+..+++.+|.++..+.++.+ |++ .+..++|..+.++.++. .....-.++|+..........++. T Consensus 120 n---~~~~~~~~~~~~~~~~G~~~~~~y~d~~--~~~-~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~ 193 (474) T protein:vir:96 120 H---KWDDKLVDILTAASNKGIEWLQPYIDEN--GEF-KTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTD 193 (474) T ss_pred c---CHHHHHHHHHHHHHhcCeeEEEEEecCC--Cce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeC Confidence 1 3444455567889999988776654443 554 57889999998886642 111222233333222222222333 Q ss_pred hHeeeecc--------------------------c-----CcCCCCCCCccccHHHHHH---HHHHHHHHHHHHHHHHHh Q lcl|NC_019511. 238 RELVMGIR--------------------------N-----PRSDLNSSGYGLSEVEIAM---KEFIAYNNTESFNDRFFS 283 (330) Q Consensus 238 ~dvih~~~--------------------------n-----~~~d~~~~~yGlSPIe~a~---~~I~~~laae~~~~~fF~ 283 (330) +.+.|... + |.--+....+|.|-++... .++...++-- ++.+. T Consensus 194 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~---~~~~~ 270 (474) T protein:vir:96 194 SDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDT---QNTFD 270 (474) T ss_pred CeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHH---HHHHH Confidence 33322210 1 1111111234666555443 4444333333 33334 Q ss_pred cCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 284 HGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 284 nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ..+.|- +.+.|.. . +..+.+...+.. .+ ++.++ T Consensus 271 ~~~~~~--lv~~g~~-~--~~~~~~~~~~~~--------~~-~i~~~ 303 (474) T protein:vir:96 271 ESTELI--YILKGYE-G--QDLDEFMRNLKY--------YK-AINVD 303 (474) T ss_pred Hhccce--eeeecCC-c--ccccchhhhhhc--------Cc-eEEec Confidence 444453 3333321 1 111122211211 12 12221 No 154 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=79.44 E-value=0.1 Score=25.96 Aligned_cols=272 Identities=9% Similarity=0.059 Sum_probs=101.6 Q ss_pred CccCcch-hHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHH Q lcl|NC_019511. 23 VPIDDGI-QANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAII 101 (330) Q Consensus 23 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I 101 (330) +.+-..+ ...++.+-+.+.+.-..=.+..-.=..|..|....-... +... +.-+-| .+....|+ T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~-----------~~~~--~~~~~~--dst~~~a~ 65 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESD-----------NSST--EYTTPW--QAVGARCL 65 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC-----------cccc--cccccc--cccHHHHH Confidence 2221111 222222222222111000000001123333421110000 0000 000111 33444667 Q ss_pred HHHHHhHhhhhhhheecccccceeeeccCCC---------cccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHH Q lcl|NC_019511. 102 ITRANQVSTYCKPARYSEKGVGFEVKLKDLD---------ATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIV 172 (330) Q Consensus 102 ~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~---------~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v 172 (330) ++++..+..-..|++ - |+++.-.+ ........+....+++.+..-+ .+.+|+.=+..+. T Consensus 66 ~~Las~l~~~ltP~~------~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-----~~snf~~~~~~~~ 133 (522) T protein:vir:94 66 NNLAAKLMLALFPQS------P-WMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYM-----ETNSFRVPLFEAL 133 (522) T ss_pred HHHHHHHHhhcCCCC------c-ccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHH-----HhcCcHHHHHHHH Confidence 777777754322421 2 44442111 1111122222344444443332 2346777777778 Q ss_pred HHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC----------------------------CceeEE Q lcl|NC_019511. 173 RDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK----------------------------GGNRFV 224 (330) Q Consensus 173 ~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~----------------------------~~~~Y~ 224 (330) .|++++|++..|+.-+..+.+.....||+. ++.+..|..|++.+ ....++ T Consensus 134 ~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~--~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~ 211 (522) T protein:vir:94 134 KQLIVSGNCLLYIPEPEQGTYSPMRMYRLV--SYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVY 211 (522) T ss_pred HHHHhhCcEeEeeeccCCCceeeEEEEEcc--eEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEE Confidence 899999999988753333334446677764 45555666664321 000000 Q ss_pred EE---eCCceEEEechhH----------------eeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019511. 225 QV---IDKQVVASFTSRE----------------LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHG 285 (330) Q Consensus 225 q~---~~~~~~~~~~~~d----------------vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nG 285 (330) .. .+++-......++ .+..+.+..+ ...||.||++-|+-.+......++-....-.-. T Consensus 212 ~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~---ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~ 288 (522) T protein:vir:94 212 THIYRQDDEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLD---GEDYGRSYCEEYLGDLNSLETITEAITKMAKVA 288 (522) T ss_pred EEEEeeCCceeEEeeccCceecccCCCCccccCCceeeeeeecC---CCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0111111111111 1111122222 145899999988877776665555444433333 Q ss_pred CCcceEEEeCCCCCC----------------------------------CHHHHHHHHHHHHHHhcCcc----cccccce Q lcl|NC_019511. 286 GTTRGILQIRADQQQ----------------------------------SQHALENFKREWKSSFSGIN----GSWQICL 327 (330) Q Consensus 286 a~p~GiL~~~~~~~l----------------------------------s~e~~e~lr~~w~~~~~G~~----na~kvpv 327 (330) ..|-.++ +.+... ..+.++.++......|--.. ++.+ | T Consensus 289 ~~p~~~v--~~~g~~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r--~ 364 (522) T protein:vir:94 289 SKVVGLV--NPNGITQPRRLNKAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAER--V 364 (522) T ss_pred hCCceee--cccccccchheeccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCcc--c Confidence 3333211 111111 22233333333333221100 0000 0 Q ss_pred eeC Q lcl|NC_019511. 328 YIK 330 (330) Q Consensus 328 L~e 330 (330) --+ T Consensus 365 TAt 367 (522) T protein:vir:94 365 TAE 367 (522) T ss_pred cHH Confidence 000 No 155 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=78.09 E-value=0.12 Score=25.67 Aligned_cols=273 Identities=10% Similarity=0.084 Sum_probs=97.4 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHH-HHHHHhhcccchhccccchhccc--cccccccCCCCCcCCCc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIE-QDTKEMQEITKSLYGKQQAYAEP--FLEMMDTNPDYRDKKSY 77 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~~~~~--~~~~~~~~p~~~~~~s~ 77 (330) +-+.+-+......+-.++....+ ..+++.-.++.-. +...++.+...=-.|+......+ -......+|..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~k----- 81 (481) T protein:vir:10 8 NINTKFSPLANDDFVVSDLAELL-KEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHR----- 81 (481) T ss_pred hhchhcccccCceeeeecchhhc-CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccce----- Confidence 22222233222222222111111 1112222122111 11111111111112322111000 000000000000 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) + .++....|+++.+.-+. ..+. .+...|. +....+.+++.. T Consensus 82 -------------i-~~n~~~~ivd~~~~~l~--g~~~---------~~~~~d~---------~~~~~l~~~~~~----- 122 (481) T protein:vir:10 82 -------------A-VHNYAKYVSRFIVGYLT--GNPI---------TITHQDN---------QTNDKIIELNDL----- 122 (481) T ss_pred -------------e-ecchHHHHHHHHHhhhc--cCCc---------eEecCCh---------hHHHHHHHHHHh----- Confidence 1 13445555555443331 1221 1222111 112334444432 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCc-ccCCceeEEEEeCC--c---e Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGK-IIKGGNRFVQVIDK--Q---V 231 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~-~~~~~~~Y~q~~~~--~---~ 231 (330) ..+..++..+..+.+++|.++..+..+.+ |++ .+..++|..+.++.++.+. ...-.++|+...++ + . T Consensus 123 ----n~~~~~~~~~~~~~~~~G~~~~~~~~d~d--g~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~ 195 (481) T protein:vir:10 123 ----NDADEVNSDLALNLSIYGRAYEIVYRDFE--DRD-TFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQH 195 (481) T ss_pred ----cChhHHHHHHHHHHHhcCeEEEEEEeCCC--CeE-EEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEE Confidence 14667888889999999988777655444 554 5777899999888765431 11222333222211 1 1 Q ss_pred EEEechhHeeeeccc------------CcCC-----CCCCCccccHHHHHHHH---HHHHHHHHHHHHHHHhcCCCcceE Q lcl|NC_019511. 232 VASFTSRELVMGIRN------------PRSD-----LNSSGYGLSEVEIAMKE---FIAYNNTESFNDRFFSHGGTTRGI 291 (330) Q Consensus 232 ~~~~~~~dvih~~~n------------~~~d-----~~~~~yGlSPIe~a~~~---I~~~laae~~~~~fF~nGa~p~Gi 291 (330) ...++.+.+.+.... +... +..+.+|.|-++..... +...++--.....+|+ .|--+ T Consensus 196 ~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~---~~~~~ 272 (481) T protein:vir:10 196 VEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLN---DAMLA 272 (481) T ss_pred EEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhc---CceeE Confidence 122344444433211 1111 11133466665543333 3333333333334444 34333 Q ss_pred EEeCCCCCCCHHHHHHHHHH-H-H----HHhcCccccccccee-eC Q lcl|NC_019511. 292 LQIRADQQQSQHALENFKRE-W-K----SSFSGINGSWQICLY-IK 330 (330) Q Consensus 292 L~~~~~~~ls~e~~e~lr~~-w-~----~~~~G~~na~kvpvL-~e 330 (330) ++|....+++..+.++.. + . ....|.+..+.+.-| -+ T Consensus 273 --~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 316 (481) T protein:vir:10 273 --IIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQ 316 (481) T ss_pred --eecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeec Confidence 334333444444444321 1 0 000111111222111 12 No 156 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=73.02 E-value=0.18 Score=24.73 Aligned_cols=250 Identities=8% Similarity=-0.027 Sum_probs=89.8 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIIT 103 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~ 103 (330) ...+.+...+++.....-++.+-..=-.|++.... ++. +.....+. +.+ ++....|+++ T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~---------~~~---~~~~~~~~--------ki~-~n~~~~ivd~ 59 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQ---------QKQ---KEQYKPDN--------RLV-VNFAKYIVDT 59 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---------ccc---cccCCCcc--------eee-cchHHHHHHH Confidence 11122222233323222223222222233332111 100 00000110 011 3445566555 Q ss_pred HHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCcee Q lcl|NC_019511. 104 RANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNF 183 (330) Q Consensus 104 ~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~ 183 (330) .+.-+- .. +..+...+ +.. ...+..++.. .++...+..+..+++.+|.++. T Consensus 60 ~~~~l~--g~---------~~~~~~~~------~~~---~~~l~~~~~~---------n~~~~~~~~~~~~~~~~G~~~~ 110 (429) T protein:vir:98 60 FNGYFI--GV---------PVQTSHEN------KQV---SNYLELLDGY---------NDQDDNNAELSKICSIYGHGYE 110 (429) T ss_pred Hhhhhc--cc---------CceeecCC------hHH---HHHHHHHHhh---------cCHhHHHHHHHHHHhhcCeEEE Confidence 443331 11 22222111 111 2233443321 1456677788899999998877 Q ss_pred EEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeCCceEE-EechhHeeee------------cccCcC Q lcl|NC_019511. 184 EKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVIDKQVVA-SFTSRELVMG------------IRNPRS 249 (330) Q Consensus 184 ~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~~~~~~-~~~~~dvih~------------~~n~~~ 249 (330) .+.... .|++ .+-.++|..+.++.++.. ..+...++|+...++.... .++.+++.+. ..++.. T Consensus 111 ~v~~d~--~g~~-~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 187 (429) T protein:vir:98 111 LVFNDE--NAEA-GITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFD 187 (429) T ss_pred EEEecC--CCcE-EEEEEcccceEEEEeCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCC Confidence 765544 4554 577789998887766432 1122223333221111111 1122211111 001111 Q ss_pred -----CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHH--HHHHhcCcccc Q lcl|NC_019511. 250 -----DLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKRE--WKSSFSGINGS 322 (330) Q Consensus 250 -----d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~--w~~~~~G~~na 322 (330) .+..+.+|.|-++-....+...-.+-.-.++.....+.|--++ .| ..++++....++.. |.-.-.|..++ T Consensus 188 ~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i--~g-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 264 (429) T protein:vir:98 188 GVPMIEYVENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKI--LG-AELDDETLKSLRDTRIINLKDTDAQQL 264 (429) T ss_pred ccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeee--ec-CCCCcchhhhHhhCceeeccCCCCCCc Confidence 1112345766666544444433222222222233344454333 23 23444433333211 11000000000 Q ss_pred ccc-ceeeC Q lcl|NC_019511. 323 WQI-CLYIK 330 (330) Q Consensus 323 ~kv-pvL~e 330 (330) .+ .+--+ T Consensus 265 -~~~~l~~~ 272 (429) T protein:vir:98 265 -TVEFLQKP 272 (429) T ss_pred -ceeEEeec Confidence 00 01111 No 157 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=72.89 E-value=0.18 Score=24.71 Aligned_cols=263 Identities=10% Similarity=-0.001 Sum_probs=95.6 Q ss_pred CCCCCcccccCcc-----CcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHH Q lcl|NC_019511. 13 MYKEDTEDLMVPI-----DDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEV 87 (330) Q Consensus 13 ~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~ 87 (330) .++.++.-+..+. +++....++.+-++..+. .+..-+-..|-+- ..+++.-+. +....+.. T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~----~~r~~~l~~YY~G---------~~~i~~~~~-~~p~~~~~ 66 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDR----TPRNLLRASFYDG---------KYAIRQIGN-LIPPEYLR 66 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHH----hHHHHHHHHHHhc---------cccchhccc-cccHHHHH Confidence 5554444333333 232222233333322111 1111222223110 011111100 01111111 Q ss_pred HHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHH Q lcl|NC_019511. 88 LKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEF 167 (330) Q Consensus 88 Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~f 167 (330) ++ +.......|+++.++.+. ..||.+- + .+..+ +. +.+++. -| ++... T Consensus 67 ~~--~v~n~~~~iVd~~a~rl~-----------~~Gf~~~--d--~~~~~---~~---l~~i~~------~N---~ld~~ 114 (504) T protein:vir:99 67 TA--TVLGWSAKAVDTLARRCN-----------LESFVWP--D--GDYGS---IG---GPDVWD------EN---FFATK 114 (504) T ss_pred Hh--hccCcHHHHHHHHHhhhc-----------cceeeCC--C--CChhh---HH---HHHHHH------hc---ChhhH Confidence 22 223456677777777653 2365432 1 11111 12 222221 12 35556 Q ss_pred HHHHHHHHHhcCCceeEEEEecCCCcceE-EEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceE---EEechhHee-- Q lcl|NC_019511. 168 CKKIVRDTYTYDQVNFEKVFSPKNKTKME-KFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVV---ASFTSRELV-- 241 (330) Q Consensus 168 l~~~v~d~L~~g~g~~~~v~~rd~~G~~~-~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~---~~~~~~dvi-- 241 (330) ...+..+.|++|.+|..+.. +..|++. .+.+++|..+.++.|+.-....-.++|++...++.. ..|.++.++ T Consensus 115 ~~~~~~~a~iyG~af~~v~~--~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~ 192 (504) T protein:vir:99 115 ANNAMVSSLIHGPAFLINTE--GGAGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTA 192 (504) T ss_pred HHHHHHHHHhhCceeEEEec--CCCCCceeEEEEeccceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEE Confidence 67788999999988766643 3345543 456789988777666432221122222222222211 123333333 Q ss_pred ----------------------eecccCcCCCCCCCccccHH----HHHHHHHHHHHHHHHHHHHHHhc------CCCcc Q lcl|NC_019511. 242 ----------------------MGIRNPRSDLNSSGYGLSEV----EIAMKEFIAYNNTESFNDRFFSH------GGTTR 289 (330) Q Consensus 242 ----------------------h~~~n~~~d~~~~~yGlSPI----e~a~~~I~~~laae~~~~~fF~n------Ga~p~ 289 (330) ++..+++. .++||.|.| .....++.-.+.--.-...||+. |+.+. T Consensus 193 ~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~---~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~ 269 (504) T protein:vir:99 193 DMDDDGDWHADVRTHKLGVPVEVLPYKPRE---DRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAK 269 (504) T ss_pred EEcCCceeeeccccCCCCcceEEecccccC---ccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCcc Confidence 33323222 245677743 34444444444433344455543 22221 Q ss_pred ------------------eEEEeCCCC-------------CCCHHH----HHHHHHHHHHHhcCcccccccce-----ee Q lcl|NC_019511. 290 ------------------GILQIRADQ-------------QQSQHA----LENFKREWKSSFSGINGSWQICL-----YI 329 (330) Q Consensus 290 ------------------GiL~~~~~~-------------~ls~e~----~e~lr~~w~~~~~G~~na~kvpv-----L~ 329 (330) .++.++.+. ++++.. +++|+..... +++.. .+|. += T Consensus 270 ~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~-~a~~t---~~P~~~lG~~~ 345 (504) T protein:vir:99 270 NFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAMM-FSGET---SIPVESLGFSN 345 (504) T ss_pred ccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHHHHH-HHhhh---CCCHHHhcccc Confidence 111111000 000000 1112211111 11100 0110 00 Q ss_pred C Q lcl|NC_019511. 330 K 330 (330) Q Consensus 330 e 330 (330) + T Consensus 346 ~ 346 (504) T protein:vir:99 346 R 346 (504) T ss_pred c Confidence 0 No 158 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=70.54 E-value=0.21 Score=24.33 Aligned_cols=295 Identities=14% Similarity=0.141 Sum_probs=128.2 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCC-------CCCc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNP-------DYRD 73 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p-------~~~~ 73 (330) |++|-.-|+..+.--+.+. ...-.++.....+- .++....|..-.-. +++.-| .+.- T Consensus 1 ~~~~~~~~~lf~f~~~~de----------~~~~~~~~~~~~S~-~~p~~~dGa~~I~~-----~~~~~~~~~~~q~~y~~ 64 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDE----------KEYKQQINNNLESV-TAPKLDDGAREIET-----QEQNIPYNALMQQMFGS 64 (524) T ss_pred CCchhhHHHHhhhhhcchh----------hhhhhhhccCCCcc-ccCCCCCCceeecc-----Ccccccchhhhhhhhhc Confidence 5554333343333211100 00000000000000 00111111111000 000000 0110 Q ss_pred CCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhc Q lcl|NC_019511. 74 KKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNT 153 (330) Q Consensus 74 ~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~ 153 (330) --...++-....+.-|..|.+|-|..+|+.+.+++. ..+..+---++.+.+ -+.++...++|..--+.+.++ T Consensus 65 ~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~Ld~--~~~s~siK~kI~eeF~~Il~l 136 (524) T protein:vir:10 65 NEPEVKNTRELIDTYRNLMNNYEVDNAVQEIVSDAI------VYEDDKEVVALNLDG--TDFSQSIKDKILAEFSEVLNL 136 (524) T ss_pred ccchhhhHHHHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEecc--cCcchHHHHHHHHHHHHHHHH Confidence 011234555566667888888999999999998874 223322234455533 336666666666544444444 Q ss_pred cCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC-CCcceEEEEeeCCCceEEe-----eCCCCcccCCcee--EEE Q lcl|NC_019511. 154 GTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK-NKTKMEKFIAVDPSTIFYA-----TDKNGKIIKGGNR--FVQ 225 (330) Q Consensus 154 ~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd-~~G~~~~L~pldp~tV~~~-----~d~~G~~~~~~~~--Y~q 225 (330) +.... +..++ ++.-++-|..|+-++++.+ .+.-+.+|..|||..|+.+ .+.+|....++.. |.| T Consensus 137 l~F~~----~~~~~----fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y 208 (524) T protein:vir:10 137 LNFQR----KGTDH----FQRWYVDSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVY 208 (524) T ss_pred hccch----hhhHH----HhhheeeceEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheee Confidence 33222 22233 3555677888888888633 3345889999999998652 2333432223321 222 Q ss_pred Ee------------CCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEE Q lcl|NC_019511. 226 VI------------DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQ 293 (330) Q Consensus 226 ~~------------~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~ 293 (330) .. ..+....++.+.|+|.+..-.++- ++.=+|=+..|...+......|.-.-=|==--|.-+=|.- T Consensus 209 ~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~d~~--~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFY 286 (524) T protein:vir:10 209 DTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLLDCC--GKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFY 286 (524) T ss_pred cCCCcccccCcceecCCcceecchhheeeeccCcccCC--CCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEE Confidence 21 112234577888888754322211 1233455777777776655444432211112233333333 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHhc---------C-cccccccceeeC Q lcl|NC_019511. 294 IRADQQQSQHALENFKREWKSSFS---------G-INGSWQICLYIK 330 (330) Q Consensus 294 ~~~~~~ls~e~~e~lr~~w~~~~~---------G-~~na~kvpvL~e 330 (330) +..+ +|.+...++.-+.....|. | +.|..+..-++| T Consensus 287 IDVG-nlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlE 332 (524) T protein:vir:10 287 IDTG-NMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTE 332 (524) T ss_pred EecC-CCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHh Confidence 3332 3444333333333322221 1 122222333333 No 159 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=69.86 E-value=0.22 Score=24.22 Aligned_cols=270 Identities=11% Similarity=0.076 Sum_probs=102.7 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHH-HHH-HHHHH----HHhhcccchhccccchhccccccccccCCCCCcC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQAN-IRQ-IEQDT----KEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDK 74 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~----~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~ 74 (330) |+| |..++.+......-.+|.+.++... ++. +.++- -++.+-..=-.|+.... .++. + T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~---------~~~~---~ 64 (470) T protein:vir:99 1 MKD----INYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKIL---------TAPE---K 64 (470) T ss_pred Ccc----ccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccc---------cCcc---c Confidence 544 4555656555555556665444322 211 11110 01111111112222111 0111 0 Q ss_pred CCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhcc Q lcl|NC_019511. 75 KSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTG 154 (330) Q Consensus 75 ~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~ 154 (330) . ...+. | +. ++....|+++.++-+ +..+.+ |. ..+. .+....+.+++. T Consensus 65 ~-~~~~~-------k-i~-~n~~~~Ivd~~~~~l--~g~p~~-------~~--~~~d--------~~~~~~l~~~~~--- 112 (470) T protein:vir:99 65 E-TGADN-------R-IV-VNSAKYVVDVYNGYF--CGIEPK-------LA--LLND--------SSKIDEIARWNR--- 112 (470) T ss_pred c-cCCcc-------e-ee-cchHHHHHHHHhhhh--ccCCee-------Ee--eCCc--------hhHHHHHHHHHH--- Confidence 0 00110 0 00 234455555544432 112222 11 1111 112233444432 Q ss_pred CCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcc-cCCceeEEEEeCCceE- Q lcl|NC_019511. 155 TDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKI-IKGGNRFVQVIDKQVV- 232 (330) Q Consensus 155 ~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~-~~~~~~Y~q~~~~~~~- 232 (330) ..++...+..+..+.+.+|.++..+..+. .|++ .+..++|..+.+..++.+.. ..-.++|+....++.. T Consensus 113 ------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~--dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~ 183 (470) T protein:vir:99 113 ------QENFFDTINEISKQCDIFGRSIASIYQGE--DARP-HLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTD 183 (470) T ss_pred ------hcCHhHHHHHHHHHHHhcCeeEEEEEeCC--CCeE-EEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeE Confidence 12577788888999999998877665444 4554 57779999998887765421 1122233222222111 Q ss_pred ---EEechhHeeeecc--------------c-----CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcce Q lcl|NC_019511. 233 ---ASFTSRELVMGIR--------------N-----PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRG 290 (330) Q Consensus 233 ---~~~~~~dvih~~~--------------n-----~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~G 290 (330) ..++.+.+.++.. + |.-.+..+.+|.|-++-....|.....+-...+..+...+.|-= T Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~ 263 (470) T protein:vir:99 184 AYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYM 263 (470) T ss_pred EEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 1122222222110 1 11122224457666655444444333222222333334444543 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHH----hcCc--ccccccceee-C Q lcl|NC_019511. 291 ILQIRADQQQSQHALENFKREWKSS----FSGI--NGSWQICLYI-K 330 (330) Q Consensus 291 iL~~~~~~~ls~e~~e~lr~~w~~~----~~G~--~na~kvpvL~-e 330 (330) + ++|.. ++++........|... ..+. +..+.+--|. + T Consensus 264 ~--i~g~~-~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 307 (470) T protein:vir:99 264 Y--MIGFK-LPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKP 307 (470) T ss_pred e--eecCC-cccccccchhhhhhhcceeeecCCCCCCCCcceEEeec Confidence 3 33422 2221111111222221 0010 1111111111 1 No 160 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=66.42 E-value=0.27 Score=23.72 Aligned_cols=271 Identities=10% Similarity=0.027 Sum_probs=90.2 Q ss_pred hHHHHHhcCCCCCC--cccccCccCcchhHHH----HHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCc Q lcl|NC_019511. 4 LFKSLRLGSMYKED--TEDLMVPIDDGIQANI----RQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSY 77 (330) Q Consensus 4 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~ 77 (330) .|+.|+.-...+-. ......+........+ ..-..+..++.+...=-.|+......+.. ..+....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~----~~~~~~~~~~- 75 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKK----VDVHGNIDYD- 75 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccch----hccccccccc- Confidence 45555322221111 1111222221111111 11111212222211111222211110000 0000000000 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) .++. +.. ++....|++..+.-+ |..+.+ +...| .+..+.+..++. T Consensus 76 ~~~~--------ki~-~n~~k~Ivd~~~~~l--~g~p~~---------~~~~d---------~~~~~~l~~~~~------ 120 (474) T protein:vir:97 76 KPDW--------RIT-TNFHQNLVDQKVSYV--ASKPVT---------YSCED---------ENVLKVIHDVLD------ 120 (474) T ss_pred cCcc--------eee-cchHHHHHHHHHhhh--hcCCce---------eccCc---------HHHHHHHHHHHh------ Confidence 0010 000 233444444433332 122222 21111 112233444331 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeCCceEEEec Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVIDKQVVASFT 236 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~~~~~~~~~ 236 (330) .++...+..+..+++.+|.++..+..+. .|+ +.+..++|..+.++.++.. ....-.++|+...+......++ T Consensus 121 ----n~~~~~~~e~~~~~~~~G~~~~~~~~d~--~~~-~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt 193 (474) T protein:vir:97 121 ----TRWDNKLIDILTATSNKGIDWLQVYINE--NGE-MKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWT 193 (474) T ss_pred ----ccHHHHHHHHHHHHhhcCceEEEEEecC--CCe-eEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEe Confidence 2355666777899999998777665433 454 4577799999988866431 1111223333322222222333 Q ss_pred hhHeeeecc----------------------c-----CcCCCCCCCccccHHHHHHHHHHH---HHHHHHHHHHHHhcCC Q lcl|NC_019511. 237 SRELVMGIR----------------------N-----PRSDLNSSGYGLSEVEIAMKEFIA---YNNTESFNDRFFSHGG 286 (330) Q Consensus 237 ~~dvih~~~----------------------n-----~~~d~~~~~yGlSPIe~a~~~I~~---~laae~~~~~fF~nGa 286 (330) .+.+.+.+. + |.--+....+|.|-++-....|.. .++--.-...+|+ T Consensus 194 ~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~--- 270 (474) T protein:vir:97 194 DTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESV--- 270 (474) T ss_pred CCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc--- Confidence 333332210 1 111111133576665554444443 3333322233443 Q ss_pred CcceEEEeCCCCCCCHHHHHHHHHHHHHH--hcCcccccccceee-C Q lcl|NC_019511. 287 TTRGILQIRADQQQSQHALENFKREWKSS--FSGINGSWQICLYI-K 330 (330) Q Consensus 287 ~p~GiL~~~~~~~ls~e~~e~lr~~w~~~--~~G~~na~kvpvL~-e 330 (330) .|--+ ++|. ..+ ..+.+....+.. ....+++ .+-.|. + T Consensus 271 ~~~lv--~~g~-~~~--~~~~~~~~~~~~~~i~~~~~~-~~~~l~~~ 311 (474) T protein:vir:97 271 ELIYI--LKGY-EGE--DLEEFMRGLKYYKAINVDGDG-GVETIQVE 311 (474) T ss_pred Cceee--eecC-Ccc--cchhhhhhhhccceeeccCCC-ceeEEeec Confidence 34323 3332 111 111222111110 0000111 110111 1 No 161 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=66.42 E-value=0.27 Score=23.72 Aligned_cols=271 Identities=10% Similarity=0.027 Sum_probs=90.2 Q ss_pred hHHHHHhcCCCCCC--cccccCccCcchhHHH----HHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCc Q lcl|NC_019511. 4 LFKSLRLGSMYKED--TEDLMVPIDDGIQANI----RQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSY 77 (330) Q Consensus 4 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~ 77 (330) .|+.|+.-...+-. ......+........+ ..-..+..++.+...=-.|+......+.. ..+....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~----~~~~~~~~~~- 75 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKK----VDVHGNIDYD- 75 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccch----hccccccccc- Confidence 45555322221111 1111222221111111 11111212222211111222211110000 0000000000 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) .++. +.. ++....|++..+.-+ |..+.+ +...| .+..+.+..++. T Consensus 76 ~~~~--------ki~-~n~~k~Ivd~~~~~l--~g~p~~---------~~~~d---------~~~~~~l~~~~~------ 120 (474) T protein:vir:94 76 KPDW--------RIT-TNFHQNLVDQKVSYV--ASKPVT---------YSCED---------ENVLKVIHDVLD------ 120 (474) T ss_pred cCcc--------eee-cchHHHHHHHHHhhh--hcCCce---------eccCc---------HHHHHHHHHHHh------ Confidence 0010 000 233444444433332 122222 21111 112233444331 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeCCceEEEec Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVIDKQVVASFT 236 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~~~~~~~~~ 236 (330) .++...+..+..+++.+|.++..+..+. .|+ +.+..++|..+.++.++.. ....-.++|+...+......++ T Consensus 121 ----n~~~~~~~e~~~~~~~~G~~~~~~~~d~--~~~-~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt 193 (474) T protein:vir:94 121 ----TRWDNKLIDILTATSNKGIDWLQVYINE--NGE-MKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWT 193 (474) T ss_pred ----ccHHHHHHHHHHHHhhcCceEEEEEecC--CCe-eEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEe Confidence 2355666777899999998777665433 454 4577799999988866431 1111223333322222222333 Q ss_pred hhHeeeecc----------------------c-----CcCCCCCCCccccHHHHHHHHHHH---HHHHHHHHHHHHhcCC Q lcl|NC_019511. 237 SRELVMGIR----------------------N-----PRSDLNSSGYGLSEVEIAMKEFIA---YNNTESFNDRFFSHGG 286 (330) Q Consensus 237 ~~dvih~~~----------------------n-----~~~d~~~~~yGlSPIe~a~~~I~~---~laae~~~~~fF~nGa 286 (330) .+.+.+.+. + |.--+....+|.|-++-....|.. .++--.-...+|+ T Consensus 194 ~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~--- 270 (474) T protein:vir:94 194 DTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESV--- 270 (474) T ss_pred CCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc--- Confidence 333332210 1 111111133576665554444443 3333322233443 Q ss_pred CcceEEEeCCCCCCCHHHHHHHHHHHHHH--hcCcccccccceee-C Q lcl|NC_019511. 287 TTRGILQIRADQQQSQHALENFKREWKSS--FSGINGSWQICLYI-K 330 (330) Q Consensus 287 ~p~GiL~~~~~~~ls~e~~e~lr~~w~~~--~~G~~na~kvpvL~-e 330 (330) .|--+ ++|. ..+ ..+.+....+.. ....+++ .+-.|. + T Consensus 271 ~~~lv--~~g~-~~~--~~~~~~~~~~~~~~i~~~~~~-~~~~l~~~ 311 (474) T protein:vir:94 271 ELIYI--LKGY-EGE--DLEEFMRGLKYYKAINVDGDG-GVETIQVE 311 (474) T ss_pred Cceee--eecC-Ccc--cchhhhhhhhccceeeccCCC-ceeEEeec Confidence 34323 3332 111 111222111110 0000111 110111 1 No 162 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=66.04 E-value=0.27 Score=23.67 Aligned_cols=240 Identities=12% Similarity=0.066 Sum_probs=92.1 Q ss_pred ccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCc-CCCcccchHHHHHHHHHHhhcHHHHH Q lcl|NC_019511. 21 LMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRD-KKSYMRNAHNLHEVLKKFGNNSILNA 99 (330) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~-~~s~~r~~~~~~~~Lr~~a~~~iv~a 99 (330) +..+..+.+.--+++++.+..++.+-..=-.|++.. +.+.. .+..+ ..++. .+..... T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i------------~~~~~~~~~~~-------~~~k~--~~n~~~~ 59 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRV------------RDLGVAIPPEL-------QRVQT--VVSWPGI 59 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc------------hhcCcccchhh-------hhhhh--hcchHHH Confidence 111111223333333333322222111111222110 00100 11111 12221 1344556 Q ss_pred HHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 100 IIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 100 ~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) |+++.++.+- ..||... + + +.++++ +. ..++......+..+++++| T Consensus 60 ivd~~~~~l~-----------~~g~~~~----d----~---~~l~~i---~~---------~n~~~~~~~~~~~~~~~~G 105 (441) T protein:vir:80 60 AVDALEERLD-----------WLGWTNG----D----G---YGLDGV---YA---------ANRLATASCDVHLDALIFG 105 (441) T ss_pred HHHHHHhhhc-----------cccccCC----C----h---HHHHHH---HH---------hcCHHHHHHHHHHHHhhcC Confidence 6665555441 2354311 1 1 122222 21 1257777788889999999 Q ss_pred CceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCce--EEEechhH------------------ Q lcl|NC_019511. 180 QVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQV--VASFTSRE------------------ 239 (330) Q Consensus 180 ~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~--~~~~~~~d------------------ 239 (330) .+|.++. .+..|.+ .+..++|..+.++.|+........+++++...++. +..|..+. T Consensus 106 ~a~~~v~--~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~ 182 (441) T protein:vir:80 106 LSFVAII--PHGDGTV-SVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRI 182 (441) T ss_pred eeEEEEE--eCCCCce-EEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeecccc Confidence 8877654 4555765 57889999988876654321111111111111111 11122222 Q ss_pred --------eeeecccCcCCCCCCCccccHH----HHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHH Q lcl|NC_019511. 240 --------LVMGIRNPRSDLNSSGYGLSEV----EIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALEN 307 (330) Q Consensus 240 --------vih~~~n~~~d~~~~~yGlSPI----e~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~ 307 (330) |+|+..++.. .+++|.|.| .....++...+.--.....||++ |-=++ .| ..++++..+. T Consensus 183 ~~~~g~vPvv~~~n~~~~---~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~---~~~~i--~G-~~~~~~~~~~ 253 (441) T protein:vir:80 183 PNVLGAVPLVPIVNRRRT---SRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAY---PQRWV--TG-VSADEFSQPG 253 (441) T ss_pred ccCCCceeEEEeeccccC---CccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcC---ceeee--ec-CCccccccch Confidence 2333322221 234677744 34455555555554445566654 22122 12 1222221111 Q ss_pred H----HHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 308 F----KREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 308 l----r~~w~~~~~G~~na~kvpvL~e 330 (330) . -+-|. ..+.. .+..|-+.+ T Consensus 254 ~~~~~~~i~~--~~~~~-~~~~~~~~~ 277 (441) T protein:vir:80 254 WVLSMASVWA--VDKDD-DGDTPNVGS 277 (441) T ss_pred hhhccccccc--CCCCC-CCCcceeEe Confidence 1 11111 01101 111122222 No 163 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=62.31 E-value=0.34 Score=23.17 Aligned_cols=266 Identities=9% Similarity=0.064 Sum_probs=93.6 Q ss_pred Cchh------------HHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccC Q lcl|NC_019511. 1 MPDL------------FKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTN 68 (330) Q Consensus 1 ~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 68 (330) |+|+ +++|+......+ +.+...+++...+.-++.+...=-.|+......+. + T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~----------~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~------~ 64 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQE----------EMILRLVREHKENIDNITMGERYYNHHPDILDAPP------K 64 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcH----------HHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccc------c Confidence 5544 555543322211 12222222222222222222222223321111100 0 Q ss_pred CCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHH Q lcl|NC_019511. 69 PDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEE 148 (330) Q Consensus 69 p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~ 148 (330) +.+...+- .+..+ .| .+ +++...|+++.+.-+ |..+.++ ... + + +....+.+ T Consensus 65 ~~~~~~~~--~~~~~----~k-i~-~n~~~~ivd~~~~~l--~g~~~~~---------~~~--~----d---~~~~~l~~ 116 (478) T protein:vir:10 65 RDVNGDYD--ETKPD----WR-MY-TNYHQNLVDQKVAYA--VANPVTF---------GVD--N----D---KALKQIQH 116 (478) T ss_pred cccccccc--ccccc----ce-ec-cchHHHHHHHHHhhh--ccCCeee---------ecC--C----h---HHHHHHHH Confidence 00000000 00000 00 00 223444444433322 1222221 111 1 1 12334444 Q ss_pred HHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEe Q lcl|NC_019511. 149 FILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVI 227 (330) Q Consensus 149 ~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~ 227 (330) ++. | ++.+.+..+..+++.+|.++..+..+.+ |++ .+..++|..+.++.++.. ....-.++|+... T Consensus 117 ~~~-------n---~~~~~~~~~~~~~~~~G~~~~~~~~d~~--g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~ 183 (478) T protein:vir:10 117 TLN-------H---KWDDKLVDILTAASNKGIEWVQPYVDEE--GEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELD 183 (478) T ss_pred HHh-------c---CHHHHHHHHHHHHHhcCeEEEEEEecCC--Cee-EEEEEcccceEEEEcCCCCCceEEEEEEEEec Confidence 431 1 4566677778999999988877655444 553 577789999988766432 1112223333322 Q ss_pred CCceEEEechhHeeeecc--------------------------c-----CcCCCCCCCccccHHHH---HHHHHHHHHH Q lcl|NC_019511. 228 DKQVVASFTSRELVMGIR--------------------------N-----PRSDLNSSGYGLSEVEI---AMKEFIAYNN 273 (330) Q Consensus 228 ~~~~~~~~~~~dvih~~~--------------------------n-----~~~d~~~~~yGlSPIe~---a~~~I~~~la 273 (330) ....+..++.+++.|... + |.-.+..+.+|.|-++- ...++...++ T Consensus 184 ~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S 263 (478) T protein:vir:10 184 GAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLS 263 (478) T ss_pred CceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHH Confidence 222222333333332211 0 11111123456665554 3334443333 Q ss_pred HHHHHHHHHhcCCCcceEEEeCCCCCCC--HHHHHHHHHHHHHHhcCccccccccee-eC Q lcl|NC_019511. 274 TESFNDRFFSHGGTTRGILQIRADQQQS--QHALENFKREWKSSFSGINGSWQICLY-IK 330 (330) Q Consensus 274 ae~~~~~fF~nGa~p~GiL~~~~~~~ls--~e~~e~lr~~w~~~~~G~~na~kvpvL-~e 330 (330) --.-...+|+ .|- +.+.|. ..+ .+....++....-...|..++ .+-.| .+ T Consensus 264 ~~~~~~~~~~---~p~--~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~ 316 (478) T protein:vir:10 264 DTQNTFDESV---ELI--YILKGY-EGEDMKDFMHNLKYYKAISVAGESGS-GVDTIKVE 316 (478) T ss_pred HHHHHHHHhh---Cce--eeeecC-CccccchhhhhhhhcceEEecCCCCC-cceEEeec Confidence 3222234443 343 222231 111 111222222111112221111 12111 11 No 164 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=61.31 E-value=0.36 Score=23.04 Aligned_cols=226 Identities=11% Similarity=0.038 Sum_probs=82.1 Q ss_pred hccccccccccCCCCCcCCCcccchHHHHHHHHHHhh--cHHHHHHHHH------HHHhHhhhhhhheecccccceeeec Q lcl|NC_019511. 57 YAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGN--NSILNAIIIT------RANQVSTYCKPARYSEKGVGFEVKL 128 (330) Q Consensus 57 ~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~--~~iv~a~I~~------~~d~Ia~~~~~~~~~~~~~g~~v~~ 128 (330) +.+..+.... ..+......++.+..|.+ .++-+.-+++ ..+.+.-+|..+... T Consensus 1 ~~~~~i~~L~---------~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds---------- 61 (409) T protein:vir:94 1 MTEKGIGYLR---------FKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDS---------- 61 (409) T ss_pred CCHHHHHHHH---------HHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHH---------- Confidence 0000000000 000000001112222221 1111100000 001111122222111 Q ss_pred cCCCcccChhhHHHHHHHHHHHHhccCCCCC-------CcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEee Q lcl|NC_019511. 129 KDLDATPGIKEKEQMKRIEEFILNTGTDKDI-------DRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAV 201 (330) Q Consensus 129 kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn-------~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pl 201 (330) +.+.+.-.+...++ ...++......+..+.|++|.+|+.+. .+..|+| .+.++ T Consensus 62 -----------------~a~rl~~~Gf~~~d~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~--~~~dg~~-~i~~~ 121 (409) T protein:vir:94 62 -----------------LADRLVFREFENDDFTVNEIFEENNPDIFFDSAVLSSLIASCSFTYIS--KGENDAV-RLQVI 121 (409) T ss_pred -----------------hHhhcccCcccCCchHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEe--cCCCCce-EEEEe Confidence 11111111111111 112355566788899999998777664 4445655 57788 Q ss_pred CCCceEEeeCCCCcccCCceeEEEEeCCceE---EEechhH----------------------eeeecccCcCCCCCCCc Q lcl|NC_019511. 202 DPSTIFYATDKNGKIIKGGNRFVQVIDKQVV---ASFTSRE----------------------LVMGIRNPRSDLNSSGY 256 (330) Q Consensus 202 dp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~---~~~~~~d----------------------vih~~~n~~~d~~~~~y 256 (330) +|..+.+..|+.-+.+...++|.+...++.. ..+.+++ |+++..++.. .++| T Consensus 122 sp~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~---~~~~ 198 (409) T protein:vir:94 122 EAVNATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDA---VRPF 198 (409) T ss_pred ccceEEEEEecCCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEecccccc---cccc Confidence 9988877766532222222222221111110 1122222 2333333322 2567 Q ss_pred cccH----HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccc-cccceeeC Q lcl|NC_019511. 257 GLSE----VEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGS-WQICLYIK 330 (330) Q Consensus 257 GlSP----Ie~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na-~kvpvL~e 330 (330) |.|. +.....++.-.+.--...+.||++-- .-++-... +.+..+.++..-.....-..++ +..|=+-| T Consensus 199 G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pq--r~i~G~d~----d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q 271 (409) T protein:vir:94 199 GRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQ--KYVTGLSD----DAEPMETWKATVSSMLQFTKDEDGDKPTLGQ 271 (409) T ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHhcChh--heeEecCC----CCcccchhhhhHHHhhcCCCCCCCCCceEEe Confidence 8774 45566666666666666677777631 12221111 1122223332222222111111 11233333 No 165 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=60.88 E-value=0.36 Score=22.99 Aligned_cols=292 Identities=14% Similarity=0.142 Sum_probs=127.5 Q ss_pred hHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcc---cchhccccchhcc---ccc--cccccCCCCCcCC Q lcl|NC_019511. 4 LFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEI---TKSLYGKQQAYAE---PFL--EMMDTNPDYRDKK 75 (330) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~~~~~~---~~~--~~~~~~p~~~~~~ 75 (330) .|--|+.++.-.+.+ ++. .+++...-..+ +....|....... |.. +.+... ....- T Consensus 1 ~~~~l~~~~~~~~~d----------~~~----~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~-~~~~e- 64 (521) T protein:vir:65 1 MFSRLKMLARWADFD----------NDK----YEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQ-FYSTD- 64 (521) T ss_pred CccchhhhhhccCch----------hhH----HHhhhccCCCcccCCCCCCCceeecccCCccccccccceee-ecccc- Confidence 222222222211110 000 11111111111 1222232222111 000 000000 00111 Q ss_pred CcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccC Q lcl|NC_019511. 76 SYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGT 155 (330) Q Consensus 76 s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~ 155 (330) ...++-....+.-|..|.+|-|..+|+.+.+++. ..+..+---++.+. +-+.++...++|..--+.+.+++. T Consensus 65 ~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~--~~~~s~~iK~kI~eeF~~Il~ll~ 136 (521) T protein:vir:65 65 QKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAI------VFEEGHEVVSLNLE--ATGFSESVKERIHEEFKDLLNTIQ 136 (521) T ss_pred chhhhHHHHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEec--ccccchHHHHHHHHHHHHHHHHhc Confidence 1234555566667888888999999999998874 22322222344453 344667766666654444444433 Q ss_pred CCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeC-----CCCcccCCce--eEEEEe- Q lcl|NC_019511. 156 DKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATD-----KNGKIIKGGN--RFVQVI- 227 (330) Q Consensus 156 ~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d-----~~G~~~~~~~--~Y~q~~- 227 (330) ... +..++ ++.-++-|..|+-++++.+.+.-+.+|..|||..|+.+.- ..|....++. .|+|.. T Consensus 137 F~~----~~~~~----fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~ 208 (521) T protein:vir:65 137 FDR----RGQDM----FRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVG 208 (521) T ss_pred cch----hhhHH----HhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecC Confidence 222 22233 4555677889999998766666689999999998876522 1121111221 233322 Q ss_pred -----------CCceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCC Q lcl|NC_019511. 228 -----------DKQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRA 296 (330) Q Consensus 228 -----------~~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~ 296 (330) ..+....+..+-|.|.+.+-. |. .++.=+|=+..|...+......|.-.-=|==--|.-+=|.-+.. T Consensus 209 ~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~-d~-~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDv 286 (521) T protein:vir:65 209 NSSYCAGGQVFSPNSRVKIPRSAITYAHSGLM-DC-DDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDT 286 (521) T ss_pred CcceeccceeecCCcceeechhheeeeeccce-eC-CCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec Confidence 111123444444544432211 11 12233466788877777655555432211112233333443333 Q ss_pred CCCCCHHHHHHHHHHHHHHhcC----------cccccccceeeC Q lcl|NC_019511. 297 DQQQSQHALENFKREWKSSFSG----------INGSWQICLYIK 330 (330) Q Consensus 297 ~~~ls~e~~e~lr~~w~~~~~G----------~~na~kvpvL~e 330 (330) + +|.+...++.-+.....|.- +.|..+..-++| T Consensus 287 G-nlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 329 (521) T protein:vir:65 287 G-NMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTE 329 (521) T ss_pred C-CCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhh Confidence 3 34444444433333333321 122223333333 No 166 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=60.67 E-value=0.37 Score=22.96 Aligned_cols=273 Identities=11% Similarity=0.069 Sum_probs=92.9 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHH----HHHhhcccchhccccchhccccccccccCCCCCcCCC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQD----TKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKS 76 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s 76 (330) .-++..+-+.+..+-.+..+.. ..++ ...-.+-+.++ ..++.+-..=-.|++... . .++. + ++. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i----~----~~~~-~-~~~ 82 (501) T protein:vir:96 15 VLNLRFHRESRIRYRADNLEEL-MVNN-WELLKNFINHHKLRQAPRIQELLDYARGENHDV----L----KSGR-R-KDN 82 (501) T ss_pred ccccccchhHHhhhcccccccc-cCCh-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcc----c----Cccc-c-Ccc Confidence 1122111222222222211111 1111 11111112211 111222111112322110 0 0000 0 000 Q ss_pred cccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCC Q lcl|NC_019511. 77 YMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTD 156 (330) Q Consensus 77 ~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~ 156 (330) ...+. | . ..+....|+++.+.-+ |..+. .+...+. +.. .+....+.+++.. T Consensus 83 ~~~~~-------r-i-~~n~~k~Ivd~~~~yl--~g~p~---------~~~~~~~--~~~---~~~~~~l~~~~~~---- 133 (501) T protein:vir:96 83 EMADK-------R-A-VHNYGRMISKFKTGYL--AGNPI---------RVEYDDN--DDN---SQNDDAIKRIGRI---- 133 (501) T ss_pred ccccc-------e-e-ecchHHHHHHHHhhhh--cccCe---------eEeeCCc--cch---hHHHHHHHHHHHh---- Confidence 00010 0 0 1344455555544332 12222 2222211 111 1122334444321 Q ss_pred CCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeC--Cc--e Q lcl|NC_019511. 157 KDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVID--KQ--V 231 (330) Q Consensus 157 ~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~--~~--~ 231 (330) .++...+..+..+++.+|.++..+..+.+ |. +.+..++|..+.++.++.. ....-.++|++... ++ . T Consensus 134 -----n~~~~~~~~~~~~~~~~G~a~~~v~~ded--g~-~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~ 205 (501) T protein:vir:96 134 -----NDLDSLNRTLIRDLSQTGRAYEVIYRSEY--DE-TRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDV 205 (501) T ss_pred -----cCHHHHHHHHHHHHhhcCeEEEEEEEcCC--Cc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEE Confidence 25667788889999999988777655444 54 3577799999888876531 22222344443321 11 1 Q ss_pred EEEechhHeeeecc-----------cCcC-----CCCCCCccccHHHHHHHHHH---HHHHHHHHHHHHHhcCCCcceEE Q lcl|NC_019511. 232 VASFTSRELVMGIR-----------NPRS-----DLNSSGYGLSEVEIAMKEFI---AYNNTESFNDRFFSHGGTTRGIL 292 (330) Q Consensus 232 ~~~~~~~dvih~~~-----------n~~~-----d~~~~~yGlSPIe~a~~~I~---~~laae~~~~~fF~nGa~p~GiL 292 (330) ...++.+.+.+... ++.. -+..+..|.|.++-....|. ..++--.-...+|+ .|-=++ T Consensus 206 ~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~---~~~l~i 282 (501) T protein:vir:96 206 VEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMA---DAILAI 282 (501) T ss_pred EEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhc---Cceeee Confidence 12334444433221 1111 11113357776665444444 33333333333443 332222 Q ss_pred EeCCCCCC-CHHHHHHHH--------------------------------------HHHHHHhcCcccccccc-eeeC Q lcl|NC_019511. 293 QIRADQQQ-SQHALENFK--------------------------------------REWKSSFSGINGSWQIC-LYIK 330 (330) Q Consensus 293 ~~~~~~~l-s~e~~e~lr--------------------------------------~~w~~~~~G~~na~kvp-vL~e 330 (330) .|.... ..+....++ +...+..... + .+| +-.+ T Consensus 283 --~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~--s-~~p~~~~~ 355 (501) T protein:vir:96 283 --YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIF--T-NTPDMSDT 355 (501) T ss_pred --ecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHH--h-CCcccCcc Confidence 221100 111112221 1111111110 0 011 1111 No 167 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=60.66 E-value=0.37 Score=22.96 Aligned_cols=248 Identities=8% Similarity=0.039 Sum_probs=90.0 Q ss_pred ccccCccCc------chhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHH- Q lcl|NC_019511. 19 EDLMVPIDD------GIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKF- 91 (330) Q Consensus 19 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~- 91 (330) ....+|.+. .+..-++.+.++..++.+-..=-.|++.. +.... ... +.++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i------------~~~~~------~~~---~~~~~~~ 59 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRP------------DAVGV------TVP---QQMQKLL 59 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------hhccc------ccc---hhHHhhh Confidence 222222322 22223344444433332221111122110 00000 000 111222 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) +.+.....|+++.++.+- ..||.+- + +. . ..+.+.+++.. .++......+ T Consensus 60 ~~~n~~~~ivd~~~~~l~-----------~~g~~~~--~-~~-~------~~~~l~~i~~~---------N~~d~~~~~~ 109 (484) T protein:vir:77 60 AHVGYPRLYIDAIAARQE-----------LEGFRLG--G-AD-K------ADEQLWDWWQA---------NDLDIESTLG 109 (484) T ss_pred hhcCcHHHHHHHHHhhhc-----------cCceecC--C-cc-h------hHHHHHHHHHh---------cCHhHHHHHH Confidence 124556677777666542 2355431 1 11 1 11223333221 2466677788 Q ss_pred HHHHHhcCCceeEEEEecCCCcce-------EEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceE---EEechhH-- Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKNKTKM-------EKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVV---ASFTSRE-- 239 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~~G~~-------~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~---~~~~~~d-- 239 (330) ..+++++|.+|..+..+.+ |.+ ..+.+++|..+.++.|+.-....-.++|++..+++.+ ..|+.+. T Consensus 110 ~~~a~~~G~a~~~v~~~~~--~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~ 187 (484) T protein:vir:77 110 HTDSLVHGRSYITISKPDP--NIDPGVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTV 187 (484) T ss_pred HHHHhhcCceEEEEecCCC--CcccccccccceEEEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEE Confidence 8999999988777654433 322 2467788988877766432111122223222222111 1122222 Q ss_pred -----------------------eeeecccCcCCCCCCCccccHHH----HHHHHHHHHHHHHHHHHHHHhcCCCcceEE Q lcl|NC_019511. 240 -----------------------LVMGIRNPRSDLNSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSHGGTTRGIL 292 (330) Q Consensus 240 -----------------------vih~~~n~~~d~~~~~yGlSPIe----~a~~~I~~~laae~~~~~fF~nGa~p~GiL 292 (330) |++++.|+.. .+++|.|.|+ ....++...+.--.-...||+. |.=++ T Consensus 188 ~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~---~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~---p~~~i 261 (484) T protein:vir:77 188 IWNREDGQWVQVANVAHNLEMVPVIPIPNRTRL---SDLYGTTEITPELRSVTDAAARTLMLMQATAELMGV---PQRLL 261 (484) T ss_pred EEEecCCceEeeccccCCCCCcceEEecccccc---CccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhh---hHHHH Confidence 2333322221 1345777554 3334444444433344455543 21111 Q ss_pred EeCCCCCCCHHHH--HHHHHHHHHHhcC---cccccccceeeC Q lcl|NC_019511. 293 QIRADQQQSQHAL--ENFKREWKSSFSG---INGSWQICLYIK 330 (330) Q Consensus 293 ~~~~~~~ls~e~~--e~lr~~w~~~~~G---~~na~kvpvL~e 330 (330) .|. .+++... +.-..-|+...+- ..+. . |=+.+ T Consensus 262 --~G~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~q 299 (484) T protein:vir:77 262 --FGV-KGEELGVDPETGQTLFDAYLARILAFEDH-E-SKAQQ 299 (484) T ss_pred --hCC-CcchhcccccccchhhhhhhhhhcccCCC-C-ceeEe Confidence 110 0000000 0000112211100 0000 0 00111 No 168 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=59.69 E-value=0.39 Score=22.84 Aligned_cols=294 Identities=13% Similarity=0.138 Sum_probs=124.7 Q ss_pred hHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcc---cchhccccchhcc---ccccccccCCCCCcCCCc Q lcl|NC_019511. 4 LFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEI---TKSLYGKQQAYAE---PFLEMMDTNPDYRDKKSY 77 (330) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~~~~~~---~~~~~~~~~p~~~~~~s~ 77 (330) .|--|+.++.--.- + .+..+++......+ ++...|....-.. |.........-+..--.. T Consensus 1 ~~~~l~~~~~~~~~----------~----~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~ 66 (521) T protein:vir:81 1 MFSRLKMLARWADF----------D----NDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQK 66 (521) T ss_pred CcchhhhhHhhcCc----------h----hhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccc Confidence 11111111110000 0 00011111100011 1111122111000 000000000000000112 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) .++-....+.-|..|.+|-|..+|+.+.+++. ..+..+---++.+. +-+.++...++|..--+.+.+++... T Consensus 67 ~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~l~L~--~~~~s~~iK~kI~eeF~~Il~ll~F~ 138 (521) T protein:vir:81 67 ISTTKQLVNTYRGLMNNHEVENAVQNIVNDAI------VFEEGHEVVSLNLE--ATGFSESVKERIHEEFKDLLNTIQFD 138 (521) T ss_pred hhhHHHHHHHHHHHhhccchhhHHHHhhccee------EecCCCceEEEEec--ccccchHHHHHHHHHHHHHHHHhccc Confidence 34555666677888889999999999998874 22332223344453 34467776666665444444443322 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeC-----CCCcccCCce--eEEEEeC-- Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATD-----KNGKIIKGGN--RFVQVID-- 228 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d-----~~G~~~~~~~--~Y~q~~~-- 228 (330) . +..++ ++.-++-|..|+-++++.+.+.-+.+|..|||..|+.+.- ..|....++. .|+|... T Consensus 139 ~----~~~~~----fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~ 210 (521) T protein:vir:81 139 R----RGQDM----FRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNS 210 (521) T ss_pred h----hhhHH----HhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCc Confidence 2 22233 4555677889999998766666689999999998876422 1221111121 2333221 Q ss_pred ----------CceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCC Q lcl|NC_019511. 229 ----------KQVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQ 298 (330) Q Consensus 229 ----------~~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~ 298 (330) .+....+..+-|.|.+.+-. |.+ ++.=+|=+..|...+......|.-.-=|==--|.-+=|.-+..+ T Consensus 211 ~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~-d~~-~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvG- 287 (521) T protein:vir:81 211 SYCAGGQVFSPNSRVKIPRSAITYAHSGLM-DCD-DKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTG- 287 (521) T ss_pred cccccceeecCCcceeechhheeeeeccce-eCC-CCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecC- Confidence 11123344444544432211 111 22334667888777776555554322111122333334433333 Q ss_pred CCCHHHHHHHHHHHHHHhcC----------cccccccceeeC Q lcl|NC_019511. 299 QQSQHALENFKREWKSSFSG----------INGSWQICLYIK 330 (330) Q Consensus 299 ~ls~e~~e~lr~~w~~~~~G----------~~na~kvpvL~e 330 (330) +|.+...++.-+.....|.- +.|..+..-++| T Consensus 288 nlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 329 (521) T protein:vir:81 288 NMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTE 329 (521) T ss_pred CCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhh Confidence 34444444433333333321 122223333333 No 169 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=59.24 E-value=0.4 Score=22.79 Aligned_cols=262 Identities=9% Similarity=-0.038 Sum_probs=96.6 Q ss_pred CCCCCcccccCcc-CcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHH Q lcl|NC_019511. 13 MYKEDTEDLMVPI-DDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKF 91 (330) Q Consensus 13 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~ 91 (330) .-+-+ -..+|. +++...-+..+-++.. ...+-.-+...|-+- ..+++.-+ .+....+..++. T Consensus 1 ~~~~~--~~~~~gl~~~~~~~~~~L~~~~~----~~~~~~~~~~~Yy~G---------~~~~~~~~-~~~p~~~r~~~~- 63 (474) T protein:vir:81 1 MIQQQ--TVRIPSLSNDENALINGLLAQIE----NLRWKNLLRTSYYEN---------KRTIQYVG-TLIPPQYFNLGL- 63 (474) T ss_pred CcCCC--cCcCCCCChhHHHHHHHHHHHHH----HHhhHHHHHHHHhcc---------CCChhhcc-ccccHHHHHHHh- Confidence 21111 112222 1222221222222211 111122223333110 01111100 011112233332 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) .......|+.+.++.+.. .||.+- | .+..+. .+. +.+. ..++......+ T Consensus 64 -v~nw~~~~Vd~~a~rl~~-----------~Gf~~~--d--~~~~~~---~l~---~iw~---------~N~ld~~~~~~ 112 (474) T protein:vir:81 64 -VLGWTGKAVDALARRCNL-----------EGFVWP--D--GDLDSL---GGT---EVVD---------DNHLLSEIDSA 112 (474) T ss_pred -hcChHHHHHHHHHhhhcc-----------cceECC--C--CCccch---HHH---HHHH---------hcChhHHHHHH Confidence 345677888888876642 366532 1 111111 111 1111 12455567778 Q ss_pred HHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceE---EEechhH--------- Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVV---ASFTSRE--------- 239 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~---~~~~~~d--------- 239 (330) ..+.|++|.+|+.+..+.++.+. +.+.+++|..+.+..|+.-......+.+.+...++.. ..|.++. T Consensus 113 ~~~al~~G~sf~~V~~~~d~~~~-~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~ 191 (474) T protein:vir:81 113 IVAAMQHGPAFLINTVGEDDEPE-ALIHVKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKA 191 (474) T ss_pred HHHHHhhCceeEEEecCCCCCce-eEEEEeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCc Confidence 89999999988776544444333 4467888888777665432221111111111111110 1122222 Q ss_pred ----------------eeeecccCcCCCCCCCccccHH----HHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCC Q lcl|NC_019511. 240 ----------------LVMGIRNPRSDLNSSGYGLSEV----EIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQ 299 (330) Q Consensus 240 ----------------vih~~~n~~~d~~~~~yGlSPI----e~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ 299 (330) |+++..+|.. .++||.|.| .....++.-.+.--.-.+.||+.-- .-++-..-+ . T Consensus 192 ~~~w~~~~~~~~~gvPvV~~~n~~~~---~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pq--r~i~G~~~~-~ 265 (474) T protein:vir:81 192 TLKWQVDRDEHVYGVPAQVLPYKPAP---KRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPE--FWLLGADES-A 265 (474) T ss_pred cceeeeccCCCCCCcceEEecccccc---cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchh--heeecCChh-h Confidence 3344433332 245787743 4445555555555555566665421 111110000 0 Q ss_pred CCH---HHHHHHHHHHHHH--hcCccccccccee-----eC Q lcl|NC_019511. 300 QSQ---HALENFKREWKSS--FSGINGSWQICLY-----IK 330 (330) Q Consensus 300 ls~---e~~e~lr~~w~~~--~~G~~na~kvpvL-----~e 330 (330) .+. +..+.++...... +.+-++ +.+|=+ -| T Consensus 266 ~~d~d~~~~~~~~~~~~~i~~~~~d~d-~~~~~~~~~~~~q 305 (474) T protein:vir:81 266 LKNADGTIKSVWEARLGRIKGLPDDAD-ADIPQLARADVKQ 305 (474) T ss_pred cccccccccchhhhhHHHHhcCCCccc-ccccccccccccc Confidence 000 0011222111111 111111 111211 11 No 170 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=53.56 E-value=0.53 Score=22.12 Aligned_cols=273 Identities=11% Similarity=0.114 Sum_probs=93.6 Q ss_pred CchhHHHHH----h-cCCCCCCcccccCccCcchhHHHHHHHHH-HHHhhcccchhccccchhccccccccccCCCCCcC Q lcl|NC_019511. 1 MPDLFKSLR----L-GSMYKEDTEDLMVPIDDGIQANIRQIEQD-TKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDK 74 (330) Q Consensus 1 ~~~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~ 74 (330) |+|+|-|-+ . .+...++..++....-+.++ +.++++ ..++.+...=-.|+......+ ......... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~---~~i~~~~~~~~~~~~~YY~g~~~i~~~~-----~~~~~~~~~ 72 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQ---KLIDEHNPEPLLKGVRYYMCENDIEKKR-----RTYYDAAGQ 72 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHH---HHHHhhcHHHHHHHHHHhccccchhhcc-----chhcccccc Confidence 888776541 1 11122222111111011111 111211 001111111011222111000 000000000 Q ss_pred CCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhcc Q lcl|NC_019511. 75 KSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTG 154 (330) Q Consensus 75 ~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~ 154 (330) ....+...+ .| .+ +++...|++..++-+. ..+. .+...| . +..+.++.++. T Consensus 73 ~~~~~~~~~----~r-i~-~n~~~~ivd~~~~yl~--g~~~---------~~~~~d-------~--~~~~~l~~~~~--- 123 (503) T protein:vir:59 73 QLVDDTKTN----NR-TS-HAWHKLFVDQKTQYLV--GEPV---------TFTSDN-------K--TLLEYVNELAD--- 123 (503) T ss_pred ccccccccc----ce-ee-cchHHHHHHHHHhhhh--cCCe---------eeccCc-------H--HHHHHHHHHHh--- Confidence 000000000 00 01 3445566555544432 2222 221111 1 11223333321 Q ss_pred CCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEEEeCC-c-- Q lcl|NC_019511. 155 TDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQVIDK-Q-- 230 (330) Q Consensus 155 ~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q~~~~-~-- 230 (330) .++...+..+..+++.+|.++..+.++.+ |++ .+..++|..+.+..++. .....-.++|+....+ + T Consensus 124 -------n~~~~~~~~~~~~~~~~G~~~~~v~~d~d--g~~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~ 193 (503) T protein:vir:59 124 -------DDFDDILNETVKNMSNKGIEYWHPFVDEE--GEF-DYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEE 193 (503) T ss_pred -------cCHHHHHHHHHHHHhhCCeEEEEEeecCC--Cce-EEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCce Confidence 24666777788999999998877665443 554 58889999888876643 1111222333332211 1 Q ss_pred --eEEEechhHeeeecc-------------------------------cCcCCCCCCCccccHHHHHHHHH---HHHHHH Q lcl|NC_019511. 231 --VVASFTSRELVMGIR-------------------------------NPRSDLNSSGYGLSEVEIAMKEF---IAYNNT 274 (330) Q Consensus 231 --~~~~~~~~dvih~~~-------------------------------n~~~d~~~~~yGlSPIe~a~~~I---~~~laa 274 (330) ....++.+.+.+... -|+--+..+.+|.|-++.+...| ...++. T Consensus 194 ~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~ 273 (503) T protein:vir:59 194 TQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSS 273 (503) T ss_pred EEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHH Confidence 112234443332211 01111111234666555444444 333333 Q ss_pred HHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHH----hcCcccccccceeeC Q lcl|NC_019511. 275 ESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSS----FSGINGSWQICLYIK 330 (330) Q Consensus 275 e~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~----~~G~~na~kvpvL~e 330 (330) -... +...+.|--++ +|. ...+ .+.+...+... ..| +++--.+.-+ T Consensus 274 ~~~~---~~~~~~~~~v~--~g~-~~~~--~~~~~~~~~~~~~~~~~~--~~~~~~l~~~ 323 (503) T protein:vir:59 274 TMDS---FSDFQQIVYVL--KNY-DGEN--PKEFTANLRYHSVIKVSG--DGGVDTLRAE 323 (503) T ss_pred HHHH---HHHhcCCeeEe--ecC-Cccc--cchhhhhhhcccceeccC--CCcceeEecc Confidence 2222 33444454333 331 1111 11121122111 011 1110011122 No 171 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=53.31 E-value=0.53 Score=22.09 Aligned_cols=275 Identities=11% Similarity=0.116 Sum_probs=105.7 Q ss_pred HHHHHHHHHHHHhhcccchhcccc---chhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 31 ANIRQIEQDTKEMQEITKSLYGKQ---QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQ 107 (330) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~ 107 (330) +..+++.+.+-++...-.+-..+- ..|..|..........-.--.++.++... | .+....|+++++.. T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i-------~--dst~~~a~~~Las~ 71 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREV-------F--DSTAGDGLETLSSS 71 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCccccccccccc-------c--cchHHHHHHHHHHH Confidence 222333333333322111111111 13333432211110000000000111000 0 34455667777777 Q ss_pred Hhhhhhhheecccccc-eeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEE Q lcl|NC_019511. 108 VSTYCKPARYSEKGVG-FEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKV 186 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g-~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v 186 (330) +..-..|. +.- |.+.+.|.+........+-+..+++.+...+.. .+|+.-+-.+..|++++|++..|+. T Consensus 72 L~~~ltPp-----~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~-----snf~~~~~~~~~~L~~~G~a~l~~~ 141 (547) T protein:vir:10 72 LHGSLTSP-----ATKWFELAFRDKELNSDDECRKWLENATHDVYSALQD-----SNFNLEANETYIDLCGYGNAIMVEE 141 (547) T ss_pred HHHhhcCC-----CCcccccccCCccccchHHHHHHHHHHHHHHHHHHHh-----cCcHHHHHHHHHHHHhHCcEeEEec Confidence 75422221 111 233333332222233333344555555444332 3566666666789999999988875 Q ss_pred EecCCCcceEEEEeeCCCceEEeeCCCCcccC-------------------------------Cce----eE-E----EE Q lcl|NC_019511. 187 FSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK-------------------------------GGN----RF-V----QV 226 (330) Q Consensus 187 ~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~-------------------------------~~~----~Y-~----q~ 226 (330) ...+..+ .+.+..+...++.+..|..|++.+ .+. .+ + +. T Consensus 142 ~d~~~~~-~~r~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~ 220 (547) T protein:vir:10 142 EDEDEEG-SVVFQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFT 220 (547) T ss_pred cCCCCCC-ceeEEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEee Confidence 4433322 334444444566666666664310 000 00 0 00 Q ss_pred eCCce-----------------EEEechh---He-----------eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_019511. 227 IDKQV-----------------VASFTSR---EL-----------VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTE 275 (330) Q Consensus 227 ~~~~~-----------------~~~~~~~---dv-----------ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae 275 (330) ..+.. ...+..+ .+ +..+.+..+ ...||.||++-|+-.+......+ T Consensus 221 ~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~---ge~YGrgp~~~~l~D~k~L~~l~ 297 (547) T protein:vir:10 221 RYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSA---GSQWGFGPSHLALPDVLTANRYV 297 (547) T ss_pred ccCCCCCccccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecC---CcccccchHHHHHHHHHHHHHHH Confidence 00000 0011111 11 111222222 14589999998887776655554 Q ss_pred HHHHHHHh---------------c--CCCcceEEEeCCCC-----------CCCHHHHHHHHHHHHHHhc----Cccccc Q lcl|NC_019511. 276 SFNDRFFS---------------H--GGTTRGILQIRADQ-----------QQSQHALENFKREWKSSFS----GINGSW 323 (330) Q Consensus 276 ~~~~~fF~---------------n--Ga~p~GiL~~~~~~-----------~ls~e~~e~lr~~w~~~~~----G~~na~ 323 (330) +-...--. + ...|+|+....+.. ....+.++.++......|- +..++. T Consensus 298 ~~~l~~~~~~~~pp~~v~~~g~~~~~~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~ 377 (547) T protein:vir:10 298 ELVLRSSEKVIDPAIMVTERGLISDIDLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSP 377 (547) T ss_pred HHHHHHHHHHhcCceecccccccccceecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCc Confidence 43221111 1 12344443221111 1123445555555554443 122221 Q ss_pred ccceeeC Q lcl|NC_019511. 324 QICLYIK 330 (330) Q Consensus 324 kvpvL~e 330 (330) + |=-+ T Consensus 378 ~--~TAt 382 (547) T protein:vir:10 378 A--MTAT 382 (547) T ss_pred c--ccHH Confidence 1 1001 No 172 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=53.09 E-value=0.54 Score=22.06 Aligned_cols=274 Identities=10% Similarity=0.066 Sum_probs=99.4 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhcccc---chhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGKQ---QAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAI 100 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~ 100 (330) +.+.. .+.+.+.+-++...-..-..+- ..|..|....... ...-....++.. -| .+....| T Consensus 1 m~~~~----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~---~~~~~~~~~~~~-------~~--dst~~~a 64 (559) T protein:vir:95 1 MAETT----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT---SEVNRNDRRNTR-------II--DSTGTMA 64 (559) T ss_pred CChhh----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCC---CCCCcccccccc-------cc--cchHHHH Confidence 11110 1112222222211111111111 1344443222110 000001111100 01 3445566 Q ss_pred HHHHHHhHhhhhhhheecccccc-eeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 101 IITRANQVSTYCKPARYSEKGVG-FEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 101 I~~~~d~Ia~~~~~~~~~~~~~g-~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) +++++..+..-..|. +.- |.+.+.|++........+.+..+++.+...+.. .+|+.-+-.+..|++++| T Consensus 65 ~~~Las~l~~~ltpp-----~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~-----snf~~~~~~~~~~L~~~G 134 (559) T protein:vir:95 65 ARTLASGMMSGITSP-----ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNK-----SNLYQSLPQLYGSLGTYS 134 (559) T ss_pred HHHHHHHHHHhhcCC-----CCcccccccCCccccchHHHHHHHHHHHHHHHHHHHh-----cCcHHHHHHHHHHHHhhC Confidence 777777775422221 111 233333332222222233344455544433322 356665666678999999 Q ss_pred CceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCC-------------------------------c-ee-E--E Q lcl|NC_019511. 180 QVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKG-------------------------------G-NR-F--V 224 (330) Q Consensus 180 ~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~-------------------------------~-~~-Y--~ 224 (330) ++..|+.. +. +..+.+.+++..++.+..|..|++.+- + -. + + T Consensus 135 ta~l~~~~--d~-~~~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~ 211 (559) T protein:vir:95 135 TGAMAVLD--DD-EDIIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM 211 (559) T ss_pred ceeeEeec--CC-CceeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEE Confidence 99888753 32 345667677777777777777643210 0 00 0 0 Q ss_pred EEe---CCceE------------EEec--hh-H-eeee---cccC----cCC-CCCCCcccc-HHHHHHHHHHHHHHHHH Q lcl|NC_019511. 225 QVI---DKQVV------------ASFT--SR-E-LVMG---IRNP----RSD-LNSSGYGLS-EVEIAMKEFIAYNNTES 276 (330) Q Consensus 225 q~~---~~~~~------------~~~~--~~-d-vih~---~~n~----~~d-~~~~~yGlS-PIe~a~~~I~~~laae~ 276 (330) ..+ .+... ..+. .+ + ++.- ..+| |.. .....||.| |.+-|+-.+......++ T Consensus 212 ~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~ 291 (559) T protein:vir:95 212 HSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQK 291 (559) T ss_pred EEEeccccccccccccccceEEEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHH Confidence 000 00000 0000 01 0 1100 0011 111 111458999 89877555554444433 Q ss_pred HHHHHHhcCC-----------------CcceEEEeCCC---CCCC------------HHHHHHHHHHHHHHhcCcccccc Q lcl|NC_019511. 277 FNDRFFSHGG-----------------TTRGILQIRAD---QQQS------------QHALENFKREWKSSFSGINGSWQ 324 (330) Q Consensus 277 ~~~~fF~nGa-----------------~p~GiL~~~~~---~~ls------------~e~~e~lr~~w~~~~~G~~na~k 324 (330) -....-.-.. .|+|+..+... ..+. .+.++.++...+..|-+. -. T Consensus 292 ~~l~~~~~~~~pp~~v~~~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d--~~- 368 (559) T protein:vir:95 292 RKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVD--LF- 368 (559) T ss_pred HHHHHHHHHhcCceeccccccccceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhh--hH- Confidence 3222212111 23333221100 0000 122334444444433321 00 Q ss_pred cceeeC Q lcl|NC_019511. 325 ICLYIK 330 (330) Q Consensus 325 vpvL~e 330 (330) -+|.. T Consensus 369 -~~l~~ 373 (559) T protein:vir:95 369 -MMLQN 373 (559) T ss_pred -HHhhc Confidence 11222 No 173 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=52.72 E-value=0.55 Score=22.02 Aligned_cols=277 Identities=12% Similarity=0.094 Sum_probs=92.5 Q ss_pred CchhHHHH----HhcCCCCCCcccccCccCcchhHHHHHHHHH----HHHhhcccchhccccchhccccccccccCCCCC Q lcl|NC_019511. 1 MPDLFKSL----RLGSMYKEDTEDLMVPIDDGIQANIRQIEQD----TKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYR 72 (330) Q Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~ 72 (330) +.+++-++ +.+..+-.+..+.... ++...-.+.++++ .-++.+-..=-.|+.... ..++... T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i--------~~~~~~~ 80 (501) T protein:vir:27 11 GQDLVLNLRFHRESRIRYRADNLEELMV--NNWELLKNFINHHKLRQAPRIQELLDYARGENHDV--------LQFGRRK 80 (501) T ss_pred chhhhhhcccChhHHHhhcccccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--------cccCccC Confidence 23322222 1122221111110111 1111111112222 111111111111221110 0000000 Q ss_pred cCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHh Q lcl|NC_019511. 73 DKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILN 152 (330) Q Consensus 73 ~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~ 152 (330) . .+..+. | . .++....|+++.+.-+- ..+. .+...|. +..+ .....+.+++.. T Consensus 81 ~-----~~~~~~----k-i-~~n~~k~Ivd~~~~yl~--g~p~---------~~~~~d~--~~~~---~~~~~l~~~~~~ 133 (501) T protein:vir:27 81 D-----REMADK----R-A-VHNYGRMISKFKTGYLA--GNPI---------RVEYDDN--DNNS---QNDDTIKRIGRI 133 (501) T ss_pred c-----cccccc----e-e-ccchHHHHHHHHhhhhc--ccCe---------eEecCCc--cchH---HHHHHHHHHHHh Confidence 0 000000 0 0 03344455544443331 1222 2222211 1111 112233333321 Q ss_pred ccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEEEeC--C Q lcl|NC_019511. 153 TGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQVID--K 229 (330) Q Consensus 153 ~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q~~~--~ 229 (330) .++...+..+..+++.+|.++.++..+.+ |++ .+..++|..+.++.++. .....-.++|++... + T Consensus 134 ---------n~~~~~~~~~~~~~~~~G~a~~~vy~ded--~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~ 201 (501) T protein:vir:27 134 ---------NDIDSHNRTLIRDLSQTGRAYEVIYRNEY--DET-RIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQN 201 (501) T ss_pred ---------cChhHHHHHHHHHHhhCCeEEEEEEeCCC--Cce-EEEEEccceeEEEecCCCCCceEEEEEEEEeeecCC Confidence 25777888889999999988777655443 543 56778999888876653 222223344444321 1 Q ss_pred ce--EEEechhHeeeecc-----------c-----CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE Q lcl|NC_019511. 230 QV--VASFTSRELVMGIR-----------N-----PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGI 291 (330) Q Consensus 230 ~~--~~~~~~~dvih~~~-----------n-----~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~Gi 291 (330) +. +..++.+.+.++.. | |+--+..+.+|.|.++-....|.....+..-.+..+...+.|--+ T Consensus 202 ~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v 281 (501) T protein:vir:27 202 AKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILA 281 (501) T ss_pred cEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 11 12234444332221 1 111111233577776655444443332222222222323333322 Q ss_pred EEeCCCCC-CCHHHHHHHHH--------------------------------------HHHHHhcCcccccccceeeC Q lcl|NC_019511. 292 LQIRADQQ-QSQHALENFKR--------------------------------------EWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 292 L~~~~~~~-ls~e~~e~lr~--------------------------------------~w~~~~~G~~na~kvpvL~e 330 (330) + .|... ...+....++. ...+..... ++-+.+-.+ T Consensus 282 ~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~--s~~p~~~~~ 355 (501) T protein:vir:27 282 I--YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIF--TNIPDMSDT 355 (501) T ss_pred e--ecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHH--hCCcccCcc Confidence 2 22110 11122222221 111111110 000001111 No 174 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=52.68 E-value=0.55 Score=22.02 Aligned_cols=277 Identities=13% Similarity=0.106 Sum_probs=93.4 Q ss_pred CchhHHHHHhcCC-------CCCCcccccCccCc-------chhHHHHHHHHHHHHhhcccchhccccchhccccccccc Q lcl|NC_019511. 1 MPDLFKSLRLGSM-------YKEDTEDLMVPIDD-------GIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMD 66 (330) Q Consensus 1 ~~~~~~~~~~~~~-------~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 66 (330) +...-.-|-+++. |-.+.....+..+. .+.-.+.+..++..++.+...=-.|+......+.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~---- 82 (492) T protein:vir:97 7 ISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP---- 82 (492) T ss_pred HHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccccc---- Confidence 1112222322322 11111111111111 11111111122222222211111232211111100 Q ss_pred cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHH Q lcl|NC_019511. 67 TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRI 146 (330) Q Consensus 67 ~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i 146 (330) ........+.. .+ .| .+ ++....|+++.+.-+ +..+.+ +... ++ +....+ T Consensus 83 ~~~~~~~~~~~-~~-------~r-i~-~n~~k~Ivd~~~~yl--~g~p~~---------~~~~--d~-------~~~~~l 132 (492) T protein:vir:97 83 VDATGAVDPLK-PD-------DR-MI-TNFHANLVDQKVSYI--VGKPIA---------FKHT--DD-------EVVKRI 132 (492) T ss_pred ccccccccccc-cc-------cc-cc-cchHHHHHHHHhhhh--cccCce---------eccC--ch-------HHHHHH Confidence 00000000000 00 00 00 234455555544332 122221 1111 11 122334 Q ss_pred HHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEE Q lcl|NC_019511. 147 EEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQ 225 (330) Q Consensus 147 ~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q 225 (330) ..++. | ++...+..+..+.+.+|.++..+..+. .|+ +.+..++|..+.++.++. ...+.-.++|+. T Consensus 133 ~~~~~-------n---~~~~~~~~~~~~~~~~G~a~~~v~~d~--dg~-~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~ 199 (492) T protein:vir:97 133 DEVLG-------N---RFDDKLHSVLTGASNKGIEWLHPYLDE--EGE-FKLFRVPAEQGIPIWTDKEHEELEAFIRMYK 199 (492) T ss_pred HHHHh-------c---cHHHHHHHHHHHHhhcCeEEEEEEecC--CCc-eEEEEEcccceEEEEcCCCCCceEEEEEEEe Confidence 44331 1 344556667889999998877765544 354 457779999988876642 111222234433 Q ss_pred EeCCceEEEechhHeeeec----------------------ccCcCC-----CCCCCccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 226 VIDKQVVASFTSRELVMGI----------------------RNPRSD-----LNSSGYGLSEVEIAMKEFIAYNNTESFN 278 (330) Q Consensus 226 ~~~~~~~~~~~~~dvih~~----------------------~n~~~d-----~~~~~yGlSPIe~a~~~I~~~laae~~~ 278 (330) ..+...+..++...+.+.. .|+... +..+.+|.|-++-....|.....+..-. T Consensus 200 ~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~ 279 (492) T protein:vir:97 200 LENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDL 279 (492) T ss_pred eccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHH Confidence 3233333333333333321 111111 1112346776665555444333222222 Q ss_pred HHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHH--hcCccccccccee-eC Q lcl|NC_019511. 279 DRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSS--FSGINGSWQICLY-IK 330 (330) Q Consensus 279 ~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~--~~G~~na~kvpvL-~e 330 (330) ++.+...+.|--++ .|. +.+....++...... .....++ .+-.| -+ T Consensus 280 ~~~~~~~~~~~l~~--~g~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~ 328 (492) T protein:vir:97 280 SNTFKDSNELTYVL--KNY---DDQELPEFKRLLRYYGAIKVSDNG-GVDTIQVE 328 (492) T ss_pred HHHHHHhccceeee--ecC---CcccchhHHHHHhhccceecCCCC-cceeEecc Confidence 33344445554333 332 122222222211110 0001111 11111 12 No 175 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=50.67 E-value=0.6 Score=21.79 Aligned_cols=259 Identities=10% Similarity=0.067 Sum_probs=84.9 Q ss_pred hhcccchhccccchhccccccccccC----CCCCcCC---CcccchHHHHHH-HHHHhhcHHHHHHHHHHHHhHhhhhhh Q lcl|NC_019511. 43 MQEITKSLYGKQQAYAEPFLEMMDTN----PDYRDKK---SYMRNAHNLHEV-LKKFGNNSILNAIIITRANQVSTYCKP 114 (330) Q Consensus 43 ~~~~~~~~~g~~~~~~~~~~~~~~~~----p~~~~~~---s~~r~~~~~~~~-Lr~~a~~~iv~a~I~~~~d~Ia~~~~~ 114 (330) |+.. -++.......-+-...+-| +.|+.=. -|.+......+. ..+.- .+..-.|.++++..+..-..+ T Consensus 1 ~~~~---~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~-dstg~~a~~~LAa~l~~~ltp 76 (517) T protein:vir:10 1 MDMR---FAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAW-QDDGASATNFLSNKLSQVLFP 76 (517) T ss_pred Cccc---ccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccccc-cchHHHHHHHHHHHHHHhhcC Confidence 1111 0000000000000000000 0111100 000110000000 00000 123344455555555422111 Q ss_pred heecccccceeeeccCCC-----cccChhh----HHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEE Q lcl|NC_019511. 115 ARYSEKGVGFEVKLKDLD-----ATPGIKE----KEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEK 185 (330) Q Consensus 115 ~~~~~~~~g~~v~~kd~~-----~~~~~~~----~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~ 185 (330) . + -=|+++.-.+ ...+... .+....+++.+..-+ .+.+|+.=+..+..|+.++|++..|. T Consensus 77 p-----~-~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf~~~~~~~~~~L~~~G~a~ly~ 145 (517) T protein:vir:10 77 A-----Q-RSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYG-----ESLQFRPAVVEAFKHLIVTGNVMMYH 145 (517) T ss_pred C-----C-CccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHH-----HhcCcHHHHHHHHHHHHhHCeEEEEE Confidence 0 1 0134433111 0111111 222333333333221 12356666666778888999987664 Q ss_pred EEecCCCcceEEEEeeCCCceEEeeCCCCcccC--------------------------------Ccee-E--EEEeCCc Q lcl|NC_019511. 186 VFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--------------------------------GGNR-F--VQVIDKQ 230 (330) Q Consensus 186 v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--------------------------------~~~~-Y--~q~~~~~ 230 (330) .++ +..+..||+. +..+..|..|++.+ .... | ++...++ T Consensus 146 ---~~~-~~~~~~~pl~--~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~ 219 (517) T protein:vir:10 146 ---PDK-TSPIQAVPLH--HYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDG 219 (517) T ss_pred ---eCC-CCcEEEEEcC--eEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCC Confidence 232 3456788873 44455555554321 0000 0 0011111 Q ss_pred e-EEEechh-------------He--eeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCc------ Q lcl|NC_019511. 231 V-VASFTSR-------------EL--VMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTT------ 288 (330) Q Consensus 231 ~-~~~~~~~-------------dv--ih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p------ 288 (330) . ......+ +. +-.+.+..+ ...||.||.+-|+-.+.......+-....=.-.+.| T Consensus 220 ~~~~~~~~d~~~~~~~s~y~~~e~P~~~~Rw~~~~---ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~ 296 (517) T protein:vir:10 220 KYLIRQSADDVPVGKESTVTEDKSPFLILTWKRSY---GEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKP 296 (517) T ss_pred ceEEEEEeCceeeccccccccccCCeeeeeeeecC---CCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCc Confidence 1 1111111 11 111111111 245999999988877776554433333221111111 Q ss_pred ceEEE----------------------eC--C--CCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 289 RGILQ----------------------IR--A--DQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 289 ~GiL~----------------------~~--~--~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) +|++. +. . +-....+.++.++...+..|--. . ..+.+ T Consensus 297 ~~~~~~~~l~~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~--~---l~~~~ 359 (517) T protein:vir:10 297 GSYTDINQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMME--A---MTRRD 359 (517) T ss_pred ccccchhhccCCCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--h---hhccC Confidence 11110 00 0 00111333444444444443210 0 01111 No 176 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=47.88 E-value=0.69 Score=21.48 Aligned_cols=254 Identities=13% Similarity=0.103 Sum_probs=89.5 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhcccc---chhcccc-------cc-----cc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQ---QAYAEPF-------LE-----MM 65 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~-------~~-----~~ 65 (330) |--|+.-+.....++ +.+...++.-.++..++.+...=-.|.. +.+..+. .. .. T Consensus 3 ~~~~~~~~~~~~~~~-----------e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (474) T protein:vir:94 3 LYKLIDDIEAQGILP-----------KHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRL 71 (474) T ss_pred hHHHHhhccccCCCH-----------HHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccc Confidence 211221111111111 1122222221111111111100000000 0000000 00 00 Q ss_pred ccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHH Q lcl|NC_019511. 66 DTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKR 145 (330) Q Consensus 66 ~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~ 145 (330) ..+|.+++ + ++....|+++.+.=+ |..|.+++ .. +.+ ..+.+.... T Consensus 72 ~~~~~~ki------------------~-~n~~~~ivd~~~~yl--~g~pv~~~---------~~--~~~--~~~e~~~~~ 117 (474) T protein:vir:94 72 DVSVNNKL------------------N-NSFDSEIVDTRVGYL--HGVPVTYD---------LD--ENA--EKNEKLKKF 117 (474) T ss_pred ccCccccc------------------c-cchHHHHHHhHhhhe--eccceeEe---------eC--CCC--cchHHHHHH Confidence 00111111 0 233444443333211 12333222 11 111 111222344 Q ss_pred HHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEE Q lcl|NC_019511. 146 IEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQ 225 (330) Q Consensus 146 i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q 225 (330) +.+++.. ..+......+..+++.+|.++..+..+. .|+ +.+..++|..+.++.|+.+.. .-.++|++ T Consensus 118 l~~~~~~---------n~~~~~~~~~~~~~~~~G~a~~~~~~d~--~~~-~~~~~i~p~~~~~v~d~~~~~-~~~i~~~~ 184 (474) T protein:vir:94 118 ITNFAIR---------NSVDDEDSEIGKMAAICGYGARLAYIDT--NGD-IRIKNIDPYNVIFVGDNILEP-TYSLRYFY 184 (474) T ss_pred HHHHHhh---------cCHhHHHHHHHHHHhhcCeEEEEEEeCC--CCe-eEEEEEcccceEEEEcCCCce-EEEEEEEE Confidence 5555432 2466777788899999998776655443 454 467789999988887776643 12233333 Q ss_pred EeC--Cce----EEEechhHeeeeccc-------------CcC-----CCCCCCccccHHH---HHHHHHHHHHHHHHHH Q lcl|NC_019511. 226 VID--KQV----VASFTSRELVMGIRN-------------PRS-----DLNSSGYGLSEVE---IAMKEFIAYNNTESFN 278 (330) Q Consensus 226 ~~~--~~~----~~~~~~~dvih~~~n-------------~~~-----d~~~~~yGlSPIe---~a~~~I~~~laae~~~ 278 (330) ... ++. +..++...+.+...+ +.. -+....+|.|-++ ....++...++.-... T Consensus 185 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~ 264 (474) T protein:vir:94 185 EKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSE 264 (474) T ss_pred EeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 322 111 112233333222111 111 1111234655544 4444444444443333 Q ss_pred HHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 279 DRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 279 ~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ..+|++ |- +.+.| ..++++....++. . +..++.+ T Consensus 265 ~~~~~~---~~--l~i~g-~~~~~~~~~~~~~---~---------~~i~~~~ 298 (474) T protein:vir:94 265 ISQTRL---AY--LVLRG-MGMSEEMIQETQK---S---------GAFELFD 298 (474) T ss_pred HHHhhc---ch--hhhcc-CCCCchhhhhhhh---c---------ceeEecC Confidence 344443 32 33333 2344443333221 1 1122222 No 177 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=47.88 E-value=0.69 Score=21.48 Aligned_cols=254 Identities=13% Similarity=0.103 Sum_probs=89.5 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhcccc---chhcccc-------cc-----cc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQ---QAYAEPF-------LE-----MM 65 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~-------~~-----~~ 65 (330) |--|+.-+.....++ +.+...++.-.++..++.+...=-.|.. +.+..+. .. .. T Consensus 3 ~~~~~~~~~~~~~~~-----------e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (474) T protein:vir:10 3 LYKLIDDIEAQGILP-----------KHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRL 71 (474) T ss_pred hHHHHhhccccCCCH-----------HHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccc Confidence 211221111111111 1122222221111111111100000000 0000000 00 00 Q ss_pred ccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHH Q lcl|NC_019511. 66 DTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKR 145 (330) Q Consensus 66 ~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~ 145 (330) ..+|.+++ + ++....|+++.+.=+ |..|.+++ .. +.+ ..+.+.... T Consensus 72 ~~~~~~ki------------------~-~n~~~~ivd~~~~yl--~g~pv~~~---------~~--~~~--~~~e~~~~~ 117 (474) T protein:vir:10 72 DVSVNNKL------------------N-NSFDSEIVDTRVGYL--HGVPVTYD---------LD--ENA--EKNEKLKKF 117 (474) T ss_pred ccCccccc------------------c-cchHHHHHHhHhhhe--eccceeEe---------eC--CCC--cchHHHHHH Confidence 00111111 0 233444443333211 12333222 11 111 111222344 Q ss_pred HHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccCCceeEEE Q lcl|NC_019511. 146 IEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQ 225 (330) Q Consensus 146 i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q 225 (330) +.+++.. ..+......+..+++.+|.++..+..+. .|+ +.+..++|..+.++.|+.+.. .-.++|++ T Consensus 118 l~~~~~~---------n~~~~~~~~~~~~~~~~G~a~~~~~~d~--~~~-~~~~~i~p~~~~~v~d~~~~~-~~~i~~~~ 184 (474) T protein:vir:10 118 ITNFAIR---------NSVDDEDSEIGKMAAICGYGARLAYIDT--NGD-IRIKNIDPYNVIFVGDNILEP-TYSLRYFY 184 (474) T ss_pred HHHHHhh---------cCHhHHHHHHHHHHhhcCeEEEEEEeCC--CCe-eEEEEEcccceEEEEcCCCce-EEEEEEEE Confidence 5555432 2466777788899999998776655443 454 467789999988887776643 12233333 Q ss_pred EeC--Cce----EEEechhHeeeeccc-------------CcC-----CCCCCCccccHHH---HHHHHHHHHHHHHHHH Q lcl|NC_019511. 226 VID--KQV----VASFTSRELVMGIRN-------------PRS-----DLNSSGYGLSEVE---IAMKEFIAYNNTESFN 278 (330) Q Consensus 226 ~~~--~~~----~~~~~~~dvih~~~n-------------~~~-----d~~~~~yGlSPIe---~a~~~I~~~laae~~~ 278 (330) ... ++. +..++...+.+...+ +.. -+....+|.|-++ ....++...++.-... T Consensus 185 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~ 264 (474) T protein:vir:10 185 EKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSE 264 (474) T ss_pred EeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 322 111 112233333222111 111 1111234655544 4444444444443333 Q ss_pred HHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 279 DRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 279 ~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ..+|++ |- +.+.| ..++++....++. . +..++.+ T Consensus 265 ~~~~~~---~~--l~i~g-~~~~~~~~~~~~~---~---------~~i~~~~ 298 (474) T protein:vir:10 265 ISQTRL---AY--LVLRG-MGMSEEMIQETQK---S---------GAFELFD 298 (474) T ss_pred HHHhhc---ch--hhhcc-CCCCchhhhhhhh---c---------ceeEecC Confidence 344443 32 33333 2344443333221 1 1122222 No 178 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=47.32 E-value=0.71 Score=21.41 Aligned_cols=278 Identities=11% Similarity=0.005 Sum_probs=96.5 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |++ .| . ... ..-++.+-+.+.+.-..=.+..-.=-.|..|....-+..+. T Consensus 1 m~~-~~---------~------~~~----~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~---------- 50 (536) T protein:vir:21 1 MAE-KR---------T------GLA----EDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNA---------- 50 (536) T ss_pred Ccc-hh---------h------chh----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcc---------- Confidence 111 00 0 000 00111111111111000000000011333342111111000 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCccc-----ChhhH----HHHHHHHHHHH Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATP-----GIKEK----EQMKRIEEFIL 151 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~-----~~~~~----~~~~~i~~~l~ 151 (330) + +.+.+.= .+....|+++++..+..-..|+ +=|+++.-.+.+. .+.+. +-...+++.+. T Consensus 51 -~---~~~~~~~-dst~~~a~~~Laa~l~~~ltP~-------~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~ 118 (536) T protein:vir:21 51 -S---TDYQTPW-QAVGARGLNNLASKLMLALFPM-------QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIM 118 (536) T ss_pred -c---ccccccc-cccHHHHHHHHHHHHHHhhcCC-------CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHH Confidence 0 0000000 3345566777777775432343 1155543222111 11111 12333444443 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCccc-------------- Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKII-------------- 217 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~-------------- 217 (330) .-+. +.+|+.=+.....|++++|++..|+.-+..+.+.....|||. ++.+..|..|++. T Consensus 119 ~~l~-----~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~--~~~v~~d~~G~vd~i~r~~~~t~~~l~ 191 (536) T protein:vir:21 119 NYIE-----SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLS--SYVVQRDAFGNVLQMVTRDQIAFGALP 191 (536) T ss_pred HHHH-----hcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcC--eEEEeeCCCCCeeEEeeeeeccHHHHH Confidence 3322 235666666777899999999988764443334456677763 4445555555322 Q ss_pred ----------------CCceeEEEE---e-CCceEE-EechhH----------------eeeecccCcCCCCCCCccccH Q lcl|NC_019511. 218 ----------------KGGNRFVQV---I-DKQVVA-SFTSRE----------------LVMGIRNPRSDLNSSGYGLSE 260 (330) Q Consensus 218 ----------------~~~~~Y~q~---~-~~~~~~-~~~~~d----------------vih~~~n~~~d~~~~~yGlSP 260 (330) +....++.. . +++... +...++ .+..+.+..+ ...||.|| T Consensus 192 ~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~---ge~YGrgp 268 (536) T protein:vir:21 192 EDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLD---GESYGRSY 268 (536) T ss_pred HhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCeeeccccCccccccCCeeeeeeeecC---CCccccch Confidence 011111111 1 111111 001111 1112222222 24589999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC------CCcceEEEe------------C--------------CCCCCCHHHHHHH Q lcl|NC_019511. 261 VEIAMKEFIAYNNTESFNDRFFSHG------GTTRGILQI------------R--------------ADQQQSQHALENF 308 (330) Q Consensus 261 Ie~a~~~I~~~laae~~~~~fF~nG------a~p~GiL~~------------~--------------~~~~ls~e~~e~l 308 (330) ++-++-.+.......+-....-.-- ..|+|++.. + ++-....+.++.+ T Consensus 269 ~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~ 348 (536) T protein:vir:21 269 IEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAI 348 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHH Confidence 9887776665444333222210100 112222110 0 0001123444555 Q ss_pred HHHHHHHhcCccc--ccccceeeC Q lcl|NC_019511. 309 KREWKSSFSGING--SWQICLYIK 330 (330) Q Consensus 309 r~~w~~~~~G~~n--a~kvpvL~e 330 (330) +...+..|--... ...-.|--+ T Consensus 349 ~~rI~~af~~~~l~~~~~~r~TAt 372 (536) T protein:vir:21 349 EARLSFAFMLNSAVQRTGERVTAE 372 (536) T ss_pred HHHHHHHHhhhhcccCCCCCccHH Confidence 5555444421100 000000000 No 179 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=46.23 E-value=0.74 Score=21.29 Aligned_cols=257 Identities=13% Similarity=0.098 Sum_probs=89.5 Q ss_pred HHHHHHHHHHhhccc-chhccc---cchhccccccccccCCCCCcCCCcccchHHHHHHH-HHHhhcHHHHHHHHHHHHh Q lcl|NC_019511. 33 IRQIEQDTKEMQEIT-KSLYGK---QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVL-KKFGNNSILNAIIITRANQ 107 (330) Q Consensus 33 ~~~~~~~~~~~~~~~-~~~~g~---~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~L-r~~a~~~iv~a~I~~~~d~ 107 (330) .+.+.++-.+.-+.. .+-..+ =..|..|.... +. .. +. .+.+ +-| .+....|.++++.. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~----~~-----~~--~~---~~~~~~~~--dstg~~a~~~Laa~ 64 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLT----ED-----GH--AS---GGRLQQPY--QSLGSKGVNALSSK 64 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC----CC-----CC--cc---cccccccc--cchHHHHHHHHHHH Confidence 222222211111100 001111 11233342110 00 00 00 0001 111 23344666777777 Q ss_pred HhhhhhhheecccccceeeeccCCCc------ccCh----hhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHh Q lcl|NC_019511. 108 VSTYCKPARYSEKGVGFEVKLKDLDA------TPGI----KEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYT 177 (330) Q Consensus 108 Ia~~~~~~~~~~~~~g~~v~~kd~~~------~~~~----~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~ 177 (330) +..-..+. +. =|+++.-.+. +.+. .....+..+++.+..-+. +.+|+.=+-.++.|+++ T Consensus 65 l~~~ltpp-----~~-~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~~L~~ 133 (542) T protein:vir:78 65 LMLSLFPI-----QT-SFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIA-----ESSDRVQLTAAMKHLIV 133 (542) T ss_pred HHHhhcCC-----CC-ccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHH-----hcCcHHHHHHHHHHHHh Confidence 75422221 11 1344332110 1111 112223334444433322 23566666677788999 Q ss_pred cCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC--------------------------------------- Q lcl|NC_019511. 178 YDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--------------------------------------- 218 (330) Q Consensus 178 ~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--------------------------------------- 218 (330) +|++..|.- .+ +...+|+. +..+..|..|++.+ T Consensus 134 ~G~a~l~~~--~~----~~~~~pl~--~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v 205 (542) T protein:vir:78 134 TGNVLVFAG--KK----TLKVYPLD--RYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGV 205 (542) T ss_pred hCeEEEEec--CC----CceEEecc--eeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEE Confidence 999877752 22 24455553 23334444443210 Q ss_pred --------------------CceeEEEEeCCceE-EEe---chhH--eeeecccCcCCCCCCCccccHHHHHHHHHHHHH Q lcl|NC_019511. 219 --------------------GGNRFVQVIDKQVV-ASF---TSRE--LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYN 272 (330) Q Consensus 219 --------------------~~~~Y~q~~~~~~~-~~~---~~~d--vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~l 272 (330) +...|++-.+|..+ ..+ .-++ .+-.+.+..+ ...||.||++-++-.+.... T Consensus 206 ~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~---ge~YGrgp~~~~l~D~k~L~ 282 (542) T protein:vir:78 206 AQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVD---GESYGRGRVEEFFGDLSSLD 282 (542) T ss_pred EEEeecccCCccccccccCCCeEEEEEEeccccccccccccccccCCceeeeeeecC---CCccccchHHHHHHHHHHHH Confidence 00111111222211 000 0001 1111222222 24589999988887776665 Q ss_pred HHHHHHHHHHhcCCCcceEEEeCCCCCC----------------------------------CHHHHHHHHHHHHHHhcC Q lcl|NC_019511. 273 NTESFNDRFFSHGGTTRGILQIRADQQQ----------------------------------SQHALENFKREWKSSFSG 318 (330) Q Consensus 273 aae~~~~~fF~nGa~p~GiL~~~~~~~l----------------------------------s~e~~e~lr~~w~~~~~G 318 (330) ..++-....-.-...|-.+ ++.+..+ ..+.++.++...+..|-- T Consensus 283 ~l~~~~l~~~~~a~~pp~l--v~~~g~~~~~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~ 360 (542) T protein:vir:78 283 ALTRSLIEGSAAAAKVVFM--VSPSATTKPQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLI 360 (542) T ss_pred HHHHHHHHHHHHHhcCcee--eccccccchhhcccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcc Confidence 5544443332222333211 1111111 222333333333332211 Q ss_pred c--ccccccceeeC Q lcl|NC_019511. 319 I--NGSWQICLYIK 330 (330) Q Consensus 319 ~--~na~kvpvL~e 330 (330) . .++.++ -=.| T Consensus 361 ~~~~d~~rv-TAtE 373 (542) T protein:vir:78 361 LNVRQSERT-TATE 373 (542) T ss_pred cccCCcccc-cHHH Confidence 0 011110 0000 No 180 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=45.94 E-value=0.75 Score=21.26 Aligned_cols=210 Identities=11% Similarity=-0.015 Sum_probs=85.4 Q ss_pred CCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHH Q lcl|NC_019511. 71 YRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFI 150 (330) Q Consensus 71 ~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l 150 (330) |-++ +...-+..+++.+.+.....||++.++-+- ..||.. .|. + ..+.+.. ++ T Consensus 1 ~l~~-----~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~-----------~~gf~~--~d~--~----~~~~~~~---i~ 53 (434) T protein:vir:98 1 MLPK-----NAEQAFLDFQRKARTNFCGLIANASVHRLL-----------ALGVTG--PDG--E----PDTRASR---WW 53 (434) T ss_pred CCCC-----CccHHHHHhhhhhhccchHHHHHHHHhhhc-----------cCceec--CCC--c----hHHHHHH---HH Confidence 2221 222233444444445567788877776542 235542 221 1 1122222 22 Q ss_pred HhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCC---Ccce-EEEEeeCCCceEEeeCCCCcccCCceeEEEE Q lcl|NC_019511. 151 LNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKN---KTKM-EKFIAVDPSTIFYATDKNGKIIKGGNRFVQV 226 (330) Q Consensus 151 ~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~---~G~~-~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~ 226 (330) . .| ++.+....+..+++++|.+|..+....++ .|.+ ..+..++|..+.++.|+......-.++|++. T Consensus 54 ~------~N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~ 124 (434) T protein:vir:98 54 Q------AN---RLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHN 124 (434) T ss_pred H------hc---ChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEe Confidence 1 12 45566777889999999888776544322 1222 2356789988887776543222222333322 Q ss_pred eCC-ceEE--Ee--------------------------------------chhHeeeecccCcCCCCCCCccccHHHHHH Q lcl|NC_019511. 227 IDK-QVVA--SF--------------------------------------TSRELVMGIRNPRSDLNSSGYGLSEVEIAM 265 (330) Q Consensus 227 ~~~-~~~~--~~--------------------------------------~~~dvih~~~n~~~d~~~~~yGlSPIe~a~ 265 (330) ..+ .... .+ ..=-|+++..|+.. ..+|.|-++... T Consensus 125 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~----~~~g~sd~e~vi 200 (434) T protein:vir:98 125 DIDGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDL----GEDPEPEFAGVL 200 (434) T ss_pred ccCCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCc----CcCCcchhhhHH Confidence 111 1100 00 00003334444322 125777666554 Q ss_pred HHHHHHHHH---HHHHHHHHhcCCCcceEEEeCCCCCCCHH--HHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 266 KEFIAYNNT---ESFNDRFFSHGGTTRGILQIRADQQQSQH--ALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 266 ~~I~~~laa---e~~~~~fF~nGa~p~GiL~~~~~~~ls~e--~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ..|.....+ -.-...||+. |.=+| .|. .+.+. ........|+....+. +++ .+++ T Consensus 201 ~liDa~~~~~s~~~~~~~~~a~---p~~~i--~G~-~~~~~~~~~~~~~~~~~~~~~~~---~~i-~~~~ 260 (434) T protein:vir:98 201 DIQDRVNLGILNRMAASRFSGF---RQKWI--KGH-KFAKRTDPATGMTVVDQPFVPSP---SAV-WASE 260 (434) T ss_pred HHHHHHHHHHHHHHHHHHHhcc---hhhhh--cCC-Ccccccccccccchhhhhhhccc---ccc-ccCC Confidence 444443322 2233344433 32111 110 01110 0011222232211111 111 1111 No 181 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=45.62 E-value=0.76 Score=21.23 Aligned_cols=261 Identities=10% Similarity=0.087 Sum_probs=91.6 Q ss_pred Cchh------------HHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccc---- Q lcl|NC_019511. 1 MPDL------------FKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEM---- 64 (330) Q Consensus 1 ~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~---- 64 (330) |+|+ ++.|+....+. ++.+...+++..++..++.+...=-.|+......+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~ 70 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQ----------EEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGD 70 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCCh----------HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccc Confidence 4433 44443222211 122233333333333333222221223322111110000 Q ss_pred -cccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHH Q lcl|NC_019511. 65 -MDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQM 143 (330) Q Consensus 65 -~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~ 143 (330) -...|.+++. +++...|+++.+.-+ |..+.++. . .+ .+.. T Consensus 71 ~~~~~~~~ki~-------------------~n~~k~ivd~~~~yl--~g~p~~~~---------~--~~-------~~~~ 111 (478) T protein:vir:10 71 YDETKPDWRMY-------------------TNYHQNLVDQKVAYA--VANPVTFG---------V--DN-------DKAL 111 (478) T ss_pred cccccccceec-------------------cchHHHHHHHHhhhh--cccCceee---------c--CC-------hHHH Confidence 0000111110 233444444333222 12222221 1 11 1123 Q ss_pred HHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC--CcccCCce Q lcl|NC_019511. 144 KRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN--GKIIKGGN 221 (330) Q Consensus 144 ~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~--G~~~~~~~ 221 (330) +.+..++. .++...+..+..+.+.+|.++..+.++.+ |+ +.+..++|..+.++.++. |. ..-.+ T Consensus 112 ~~l~~~~~----------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~-~~~~~~~p~~~~~v~d~~~~~~-~~~~i 177 (478) T protein:vir:10 112 KQIQHTLN----------HKWDDKLVDILTAASNKGIEWVQPYVDEE--GE-FKTFRVPAEQAVPIWTNKERDE-LQAFI 177 (478) T ss_pred HHHHHHHh----------ccHHHHHHHHHHHHhhCCeEEEEEEecCC--Cc-eEEEEEcccceEEEEcCCCCCc-eEEEE Confidence 34444431 14666677778899999988776655444 44 467789999988776532 22 22223 Q ss_pred eEEEEeCCceEEEechhHeeeecc--------------------------c-----CcCCCCCCCccccHHHH---HHHH Q lcl|NC_019511. 222 RFVQVIDKQVVASFTSRELVMGIR--------------------------N-----PRSDLNSSGYGLSEVEI---AMKE 267 (330) Q Consensus 222 ~Y~q~~~~~~~~~~~~~dvih~~~--------------------------n-----~~~d~~~~~yGlSPIe~---a~~~ 267 (330) +|+...+...+..++.+.|.+... + |.-.+.....|.|-++- ...+ T Consensus 178 r~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa 257 (478) T protein:vir:10 178 RVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDA 257 (478) T ss_pred EEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHH Confidence 333332222233344444433211 0 10001112346665544 4444 Q ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCC--HHHHHHHHHHHHHHhcCccccc---------------------- Q lcl|NC_019511. 268 FIAYNNTESFNDRFFSHGGTTRGILQIRADQQQS--QHALENFKREWKSSFSGINGSW---------------------- 323 (330) Q Consensus 268 I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls--~e~~e~lr~~w~~~~~G~~na~---------------------- 323 (330) +...++--.-..++|++ |- +.+.|. ..+ .+....++..-.-...|..+++ T Consensus 258 ~~~~~S~~~~~~~~~~~---~~--~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 331 (478) T protein:vir:10 258 LDKRLSDTQNTFDESVE---LI--YILKGY-EGEDMKDFMHNLKYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRD 331 (478) T ss_pred HHHHHHHHHHHHHHhhC---cc--eeeecC-CcccccchhhhhhhCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHH Confidence 44333333333344443 21 222221 111 1111111111000011111111 Q ss_pred ------ccc-eeeC Q lcl|NC_019511. 324 ------QIC-LYIK 330 (330) Q Consensus 324 ------kvp-vL~e 330 (330) .+| +-.+ T Consensus 332 ~I~~~s~~p~~~~~ 345 (478) T protein:vir:10 332 YIIEFGQGVDFQQD 345 (478) T ss_pred HHHHHhCCcCcCcc Confidence 011 0111 No 182 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=44.49 E-value=0.8 Score=21.10 Aligned_cols=278 Identities=12% Similarity=0.098 Sum_probs=95.2 Q ss_pred CchhHHHHHhcCC--CCC-----CcccccCccCcchhH---HHHHH-HH---HHHHhhcccchhccccchhccccccccc Q lcl|NC_019511. 1 MPDLFKSLRLGSM--YKE-----DTEDLMVPIDDGIQA---NIRQI-EQ---DTKEMQEITKSLYGKQQAYAEPFLEMMD 66 (330) Q Consensus 1 ~~~~~~~~~~~~~--~~~-----~~~~~~~~~~~~~~~---~~~~~-~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 66 (330) +...-.-|-+++. +|. +.....+..+++... -++.+ ++ +..++.+-..=-.|+......+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~------ 80 (492) T protein:vir:94 7 ISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP------ 80 (492) T ss_pred HHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc------ Confidence 1222223333332 222 211212222222211 11111 11 1111111111011221111000 Q ss_pred cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHH Q lcl|NC_019511. 67 TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRI 146 (330) Q Consensus 67 ~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i 146 (330) ++.+..... ..... ..| ++ ++....|+++.+.-+ +..+.++ ... + .+..+.+ T Consensus 81 -~~~~~~~~~-~~~~~----~~r-i~-~n~~k~Ivd~~~~yl--~G~p~~~---------~~~--d-------~~~~~~l 132 (492) T protein:vir:94 81 -KPVDATGAV-DPLKP----DDR-MI-TNFHANLVDQKVSYI--VGKPIAF---------KHT--D-------DEVVKRI 132 (492) T ss_pred -ccccccccc-ccccc----ccc-cc-cchHHHHHHHHHhhh--cccCcee---------ccC--c-------hHHHHHH Confidence 000000000 00000 000 00 344555555544332 1222221 111 1 1123344 Q ss_pred HHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEE Q lcl|NC_019511. 147 EEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQ 225 (330) Q Consensus 147 ~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q 225 (330) ..++. | ++...+..+..+.+.+|.++..+..+.+ |++ .+..++|..+.++.++. ...+.-.++|+. T Consensus 133 ~~~~~-------n---~~~~~~~~~~~~a~~~G~a~~~v~~d~d--g~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~ 199 (492) T protein:vir:94 133 DEVLG-------N---RFDDKLHSVLTGASNKGIEWLHPYLDEE--GEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYK 199 (492) T ss_pred HHHHh-------c---cHHHHHHHHHHHHhhCCeEEEEEEecCC--Cce-EEEEEcccceEEEEcCCCCCceEEEEEEEe Confidence 44431 1 3556667778899999988777655444 553 57789999988876532 111222334433 Q ss_pred EeCCceEEEechhHeeeecc----------------------cCcCCC-----CCCCccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 226 VIDKQVVASFTSRELVMGIR----------------------NPRSDL-----NSSGYGLSEVEIAMKEFIAYNNTESFN 278 (330) Q Consensus 226 ~~~~~~~~~~~~~dvih~~~----------------------n~~~d~-----~~~~yGlSPIe~a~~~I~~~laae~~~ 278 (330) ..+...+..++...+.+... |+...+ ..+.+|+|-++-....+...-.+..-. T Consensus 200 ~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~ 279 (492) T protein:vir:94 200 LENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDL 279 (492) T ss_pred eccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHH Confidence 33333333333333333211 111111 012246666665444444333222222 Q ss_pred HHHHhcCCCcceEEEeCCCC-CCCHHHHHHHHHHHHHHhcCccccccccee-eC Q lcl|NC_019511. 279 DRFFSHGGTTRGILQIRADQ-QQSQHALENFKREWKSSFSGINGSWQICLY-IK 330 (330) Q Consensus 279 ~~fF~nGa~p~GiL~~~~~~-~ls~e~~e~lr~~w~~~~~G~~na~kvpvL-~e 330 (330) ++.+...+.|--++ .|.. .-..+....++..+--.. + .+ +.+-.| -+ T Consensus 280 ~~~~~~~~~p~lv~--~g~~~~~~~~~~~~~~~~~~~~~-~-~~-~~~~~l~~~ 328 (492) T protein:vir:94 280 SNTFKDSNELTYVL--KNYDDQELPEFKRLLRYYGAIKV-S-DN-GGVDTIQVE 328 (492) T ss_pred HHHHHHhcCceeee--ecCCcccchhhHHHHhhccceec-C-CC-CcceeEecc Confidence 33334445554333 3321 111122222222211111 1 11 122222 12 No 183 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=42.74 E-value=0.87 Score=20.91 Aligned_cols=265 Identities=12% Similarity=0.078 Sum_probs=91.1 Q ss_pred CCCCCcccccC-------ccC-----cchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcC-CCccc Q lcl|NC_019511. 13 MYKEDTEDLMV-------PID-----DGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDK-KSYMR 79 (330) Q Consensus 13 ~~~~~~~~~~~-------~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~-~s~~r 79 (330) .||+-.....+ +.+ +.+...+.+-.++..++.+...=-.|+......+ ++.+... ....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~-------~~~~~~~~~~~~~ 73 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP-------KPVDATGAVDPLK 73 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-------chhhccccccccc Confidence 44433222111 111 1111111111222112211111112221111000 0000000 00000 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCC Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDI 159 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn 159 (330) . ..| +. .+....|+++.+.-+ |..+. .+...| .+....+..++. | T Consensus 74 --~----~~r-i~-~n~~~~ivd~~~~~l--~g~~~---------~~~~~d---------~~~~~~l~~~~~-------n 118 (472) T protein:vir:93 74 --P----DDR-MI-TNFHANLVDQKVSYI--VGKPI---------AFKHTD---------DEVVKRIDEVLG-------N 118 (472) T ss_pred --c----ccc-cc-cchHHHHHHHHhhhh--cccCe---------eeccCC---------hHHHHHHHHHHh-------c Confidence 0 000 00 244555555544433 12222 221111 112333444431 1 Q ss_pred CcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEEEeCCceEEEechh Q lcl|NC_019511. 160 DRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQVIDKQVVASFTSR 238 (330) Q Consensus 160 ~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q~~~~~~~~~~~~~ 238 (330) ++...+..+..+.+.+|.++..+..+. .|+ +.+..++|..+.++.++. .....-.++|+...+...+..++.. T Consensus 119 ---~~~~~~~~~~~~~~~~G~~~~~v~~d~--d~~-~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~ 192 (472) T protein:vir:93 119 ---RFDDKLHSVLTGASNKGIEWLHPYLDE--EGE-FKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKV 192 (472) T ss_pred ---cHHHHHHHHHHHHhhcCeEEEEEEECC--CCc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecC Confidence 355566667789999998877765544 455 357779999988876532 1112222333332222222222322 Q ss_pred Heeeec----------------------ccCcC-----CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE Q lcl|NC_019511. 239 ELVMGI----------------------RNPRS-----DLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGI 291 (330) Q Consensus 239 dvih~~----------------------~n~~~-----d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~Gi 291 (330) .+.+.. .|+.. -+..+.+|.|-++-....|.....+-.-.++-+...+.|--+ T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~ 272 (472) T protein:vir:93 193 TVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV 272 (472) T ss_pred eEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeE Confidence 222211 01111 111123577777655544443222222222333444556444 Q ss_pred EEeCCCCCC-CHHHHHHHHHHHHHHhcCccccccccee-eC Q lcl|NC_019511. 292 LQIRADQQQ-SQHALENFKREWKSSFSGINGSWQICLY-IK 330 (330) Q Consensus 292 L~~~~~~~l-s~e~~e~lr~~w~~~~~G~~na~kvpvL-~e 330 (330) + .|.... ..+....++... . ... ...+.+-.| .+ T Consensus 273 ~--~g~~~~~~~~~~~~~~~~~-~-~~~-~~~~~~~~l~~~ 308 (472) T protein:vir:93 273 L--TNYDDQELPEFKRLLRYYG-A-IKV-SDNGGVDTIQVE 308 (472) T ss_pred e--ecCCcccchhhHHHHhhcc-c-ccc-CCCCcceeEeec Confidence 3 332111 112222222111 1 111 111222222 12 No 184 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=42.03 E-value=0.9 Score=20.83 Aligned_cols=261 Identities=11% Similarity=0.047 Sum_probs=95.9 Q ss_pred CCCCCcccccCccCcc---hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHH Q lcl|NC_019511. 13 MYKEDTEDLMVPIDDG---IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLK 89 (330) Q Consensus 13 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr 89 (330) .+ -..++.-..++. ++..+++++.+.-++.+-..=-.|+... +..+ ++....+..++ T Consensus 1 ~~--~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i-----------------~~~~-~~~~~~~~~~~ 60 (485) T protein:vir:10 1 MT--APLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRP-----------------EAIG-VTVPIQMQSLL 60 (485) T ss_pred CC--CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-----------------hhcC-CCCChhhhhhh Confidence 11 112212112221 1222333333333332222212222211 0000 00011111222 Q ss_pred HHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHH Q lcl|NC_019511. 90 KFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCK 169 (330) Q Consensus 90 ~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~ 169 (330) . .+.....|+++.++.+- ..||.+- + +. .. -..+.+++.. .++..... T Consensus 61 ~--~~n~~~~ivd~~~~~l~-----------~~g~~~~--~-~~----~~---~~~~~~i~~~---------N~~d~~~~ 108 (485) T protein:vir:10 61 A--HVGYPRLYVDSIAERQA-----------VEGFRFG--D-AD----EA---DEELWQWWQA---------NNLDIEAP 108 (485) T ss_pred h--hcCcHHHHHHHHHhhhc-----------ccceecC--C-Cc----hh---HHHHHHHHHh---------cCHhHHHH Confidence 1 13455677766665541 2355421 1 11 11 1223333321 24667778 Q ss_pred HHHHHHHhcCCceeEEEEecCC------CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCce---EEEechhHe Q lcl|NC_019511. 170 KIVRDTYTYDQVNFEKVFSPKN------KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQV---VASFTSREL 240 (330) Q Consensus 170 ~~v~d~L~~g~g~~~~v~~rd~------~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~---~~~~~~~dv 240 (330) .+..+++++|.+|..+..+..+ .|. ..+.+++|..+.+..|+........+++++...++. ...|+.+.+ T Consensus 109 ~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~-~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~ 187 (485) T protein:vir:10 109 LGYTDAYVHGRSYITISRPDPQIDLGWDPNT-PIIRVEPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDI 187 (485) T ss_pred HHHHHHhhcCceEEEEeeCCcccccccCCCe-eEEEEEccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeE Confidence 8899999999887776543321 122 357778998888777654322222233333333221 112333333 Q ss_pred e-------------------------eecccCcCCCCCCCccccHHH----HHHHHHHHHHHHHHHHHHHHhc------C Q lcl|NC_019511. 241 V-------------------------MGIRNPRSDLNSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSH------G 285 (330) Q Consensus 241 i-------------------------h~~~n~~~d~~~~~yGlSPIe----~a~~~I~~~laae~~~~~fF~n------G 285 (330) . ++..|+.. .+.||.|-|+ ....++...+.--.-...||+. | T Consensus 188 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~---~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G 264 (485) T protein:vir:10 188 FGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRL---SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG 264 (485) T ss_pred EEEEEcCCceEEeccccCCCCcccEEEecccccc---CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhc Confidence 2 33322221 1346887553 3334444444333233344432 2 Q ss_pred CC------------------cceEEEeCCCC----CCCHH----HHHHHHHHHHHHhcCccc-------------ccccc Q lcl|NC_019511. 286 GT------------------TRGILQIRADQ----QQSQH----ALENFKREWKSSFSGING-------------SWQIC 326 (330) Q Consensus 286 a~------------------p~GiL~~~~~~----~ls~e----~~e~lr~~w~~~~~G~~n-------------a~kvp 326 (330) .. ++.++..+++. +++.. -+++|+...... +++.+ +--++ T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~~~~~~~~~~~l~~~i~~~-~~~~~~p~~~fg~~~~n~~Sg~A 343 (485) T protein:vir:10 265 IKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQV-AAYTGLPPQYLSTAADNPASAEA 343 (485) T ss_pred CCcccccccccccchhhhhcccceeccCCCCceEEeecccchHHHHHHHHHHHHHH-hcccCCCHHHhccccCchhHHHH Confidence 21 22223222111 11111 123333333322 11100 00011 Q ss_pred eeeC Q lcl|NC_019511. 327 LYIK 330 (330) Q Consensus 327 vL~e 330 (330) +-.. T Consensus 344 l~~~ 347 (485) T protein:vir:10 344 IRAA 347 (485) T ss_pred HHHH Confidence 1111 No 185 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=41.43 E-value=0.93 Score=20.76 Aligned_cols=263 Identities=11% Similarity=0.080 Sum_probs=87.3 Q ss_pred CchhHHHHHhcCC--CCCCcccc--cCccC-----cchhHHHHHHHHHHHHhhcccchhccccchhcccccc-----ccc Q lcl|NC_019511. 1 MPDLFKSLRLGSM--YKEDTEDL--MVPID-----DGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE-----MMD 66 (330) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-----~~~ 66 (330) |- ++.++ ++..+.+. .++.+ +.+...+++-.++..++.+...=-.|+......+... ... T Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:95 1 MI------NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDY 74 (474) T ss_pred Cc------ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccc Confidence 11 21111 11111110 00111 1112222222222222222111112222111100000 000 Q ss_pred cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHH Q lcl|NC_019511. 67 TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRI 146 (330) Q Consensus 67 ~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i 146 (330) .+|.+++. ++....|+++.+.-+ |..+.+++ .. + .+..+.+ T Consensus 75 ~~~~~ki~-------------------~n~~k~Iv~~~~~yl--~g~p~~~~---------~~--~-------~~~~~~l 115 (474) T protein:vir:95 75 TKPDWRIT-------------------TNFHQNLVDQKVSYV--AGKPVTYA---------HD--D-------DKVLDVI 115 (474) T ss_pred cccccccc-------------------cchHHHHHHhhhhhh--cccCceec---------cC--C-------hHHHHHH Confidence 01111110 233445544443332 12332221 11 1 1122344 Q ss_pred HHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC--CcccCCceeEE Q lcl|NC_019511. 147 EEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN--GKIIKGGNRFV 224 (330) Q Consensus 147 ~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~--G~~~~~~~~Y~ 224 (330) .+++. .++...+..+..+++.+|.++..+.++. .|+ +.+..++|..+.++.++. +. ..-.++|+ T Consensus 116 ~~~~~----------n~~~~~~~~l~~~~~~~G~~~~~~~~d~--~~~-~~i~~~~p~~~~~v~d~~~~~~-~~a~ir~~ 181 (474) T protein:vir:95 116 HQVLD----------TRWDNKLIDILTAASNKGIDWLQVYINE--DGE-LKLFRVPAEQAIPIWTDKEREQ-LNAFIRIF 181 (474) T ss_pred HHHHh----------ccHHHHHHHHHHHHhhCCeEEEEeeeCC--CCc-eEEEEEcccceEEEEcCCCCCc-eEEEEEEE Confidence 44431 1466667778899999998877765544 355 467779999988876542 22 11222332 Q ss_pred EEeCCceEEEechhHeeeecc----------------------cCcC-----CCCCCCccccHH---HHHHHHHHHHHHH Q lcl|NC_019511. 225 QVIDKQVVASFTSRELVMGIR----------------------NPRS-----DLNSSGYGLSEV---EIAMKEFIAYNNT 274 (330) Q Consensus 225 q~~~~~~~~~~~~~dvih~~~----------------------n~~~-----d~~~~~yGlSPI---e~a~~~I~~~laa 274 (330) .......+..++.+.+.+... ++.. -+..+.+|.|-+ .....++...++- T Consensus 182 ~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~ 261 (474) T protein:vir:95 182 TFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSD 261 (474) T ss_pred eecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHH Confidence 221112223344444443211 0000 000122355544 4444444444443 Q ss_pred HHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh-cCccccccc-ceeeC Q lcl|NC_019511. 275 ESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF-SGINGSWQI-CLYIK 330 (330) Q Consensus 275 e~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~-~G~~na~kv-pvL~e 330 (330) -.-...+|++ |- |.+.|. +.+....+...++... -.+..-+.+ .+.-+ T Consensus 262 ~~~~~~~~~~---p~--lv~~g~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:95 262 VQNMFDESVE---LI--YILRGY---EGEDLSEFMEGLKYYKAINVSSDGGVETIQVE 311 (474) T ss_pred HHHHHHHhhc---ch--hhhcCC---CcccccchhhhhhccceeeccCCCceeEEecc Confidence 3333444543 32 222231 1111122222222110 000000111 01111 No 186 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=41.43 E-value=0.93 Score=20.76 Aligned_cols=263 Identities=11% Similarity=0.080 Sum_probs=87.3 Q ss_pred CchhHHHHHhcCC--CCCCcccc--cCccC-----cchhHHHHHHHHHHHHhhcccchhccccchhcccccc-----ccc Q lcl|NC_019511. 1 MPDLFKSLRLGSM--YKEDTEDL--MVPID-----DGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLE-----MMD 66 (330) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-----~~~ 66 (330) |- ++.++ ++..+.+. .++.+ +.+...+++-.++..++.+...=-.|+......+... ... T Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:96 1 MI------NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDY 74 (474) T ss_pred Cc------ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccc Confidence 11 21111 11111110 00111 1112222222222222222111112222111100000 000 Q ss_pred cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHH Q lcl|NC_019511. 67 TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRI 146 (330) Q Consensus 67 ~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i 146 (330) .+|.+++. ++....|+++.+.-+ |..+.+++ .. + .+..+.+ T Consensus 75 ~~~~~ki~-------------------~n~~k~Iv~~~~~yl--~g~p~~~~---------~~--~-------~~~~~~l 115 (474) T protein:vir:96 75 TKPDWRIT-------------------TNFHQNLVDQKVSYV--AGKPVTYA---------HD--D-------DKVLDVI 115 (474) T ss_pred cccccccc-------------------cchHHHHHHhhhhhh--cccCceec---------cC--C-------hHHHHHH Confidence 01111110 233445544443332 12332221 11 1 1122344 Q ss_pred HHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC--CcccCCceeEE Q lcl|NC_019511. 147 EEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN--GKIIKGGNRFV 224 (330) Q Consensus 147 ~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~--G~~~~~~~~Y~ 224 (330) .+++. .++...+..+..+++.+|.++..+.++. .|+ +.+..++|..+.++.++. +. ..-.++|+ T Consensus 116 ~~~~~----------n~~~~~~~~l~~~~~~~G~~~~~~~~d~--~~~-~~i~~~~p~~~~~v~d~~~~~~-~~a~ir~~ 181 (474) T protein:vir:96 116 HQVLD----------TRWDNKLIDILTAASNKGIDWLQVYINE--DGE-LKLFRVPAEQAIPIWTDKEREQ-LNAFIRIF 181 (474) T ss_pred HHHHh----------ccHHHHHHHHHHHHhhCCeEEEEeeeCC--CCc-eEEEEEcccceEEEEcCCCCCc-eEEEEEEE Confidence 44431 1466667778899999998877765544 355 467779999988876542 22 11222332 Q ss_pred EEeCCceEEEechhHeeeecc----------------------cCcC-----CCCCCCccccHH---HHHHHHHHHHHHH Q lcl|NC_019511. 225 QVIDKQVVASFTSRELVMGIR----------------------NPRS-----DLNSSGYGLSEV---EIAMKEFIAYNNT 274 (330) Q Consensus 225 q~~~~~~~~~~~~~dvih~~~----------------------n~~~-----d~~~~~yGlSPI---e~a~~~I~~~laa 274 (330) .......+..++.+.+.+... ++.. -+..+.+|.|-+ .....++...++- T Consensus 182 ~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~ 261 (474) T protein:vir:96 182 TFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSD 261 (474) T ss_pred eecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHH Confidence 221112223344444443211 0000 000122355544 4444444444443 Q ss_pred HHHHHHHHhcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHh-cCccccccc-ceeeC Q lcl|NC_019511. 275 ESFNDRFFSHGGTTRGILQIRADQQQSQHALENFKREWKSSF-SGINGSWQI-CLYIK 330 (330) Q Consensus 275 e~~~~~fF~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~~-~G~~na~kv-pvL~e 330 (330) -.-...+|++ |- |.+.|. +.+....+...++... -.+..-+.+ .+.-+ T Consensus 262 ~~~~~~~~~~---p~--lv~~g~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:96 262 VQNMFDESVE---LI--YILRGY---EGEDLSEFMEGLKYYKAINVSSDGGVETIQVE 311 (474) T ss_pred HHHHHHHhhc---ch--hhhcCC---CcccccchhhhhhccceeeccCCCceeEEecc Confidence 3333444543 32 222231 1111122222222110 000000111 01111 No 187 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=39.69 E-value=1 Score=20.57 Aligned_cols=272 Identities=10% Similarity=0.049 Sum_probs=90.9 Q ss_pred CchhHHHHHhcCC-------CCCCcccccCccCcch---hHHHHHH----HHHHHHhhcccchhccccchhccccccccc Q lcl|NC_019511. 1 MPDLFKSLRLGSM-------YKEDTEDLMVPIDDGI---QANIRQI----EQDTKEMQEITKSLYGKQQAYAEPFLEMMD 66 (330) Q Consensus 1 ~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~---~~~~~~~----~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 66 (330) |+ .-|-+++. +-++....++..+.+. ...++.+ .....++.+...=-.|+...+..+... T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~--- 74 (483) T protein:vir:12 1 MA---QALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV--- 74 (483) T ss_pred Cc---cchhcCCceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc--- Confidence 32 22222222 2222111112122111 1111111 111111111111112222111111000 Q ss_pred cCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHH Q lcl|NC_019511. 67 TNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRI 146 (330) Q Consensus 67 ~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i 146 (330) .+.....+.. .+ . +++ ++....|+++.+.-+ |..+.++ ... + .+..+.+ T Consensus 75 -~~~~~~~~~~-~~-------~-ki~-~n~~k~Ivd~~~~~l--~G~p~~~---------~~~--d-------~~~~~~l 123 (483) T protein:vir:12 75 -DATGAVDPLK-PD-------D-RMI-TNFHANLVDQKVSYI--VGKPIAF---------KHT--D-------DEVVKRI 123 (483) T ss_pred -cccccccccc-cc-------c-ccc-cchHHHHHHHHhhhh--cccCcee---------ccC--C-------hHHHHHH Confidence 0000000000 00 0 000 344555555544332 1222221 111 1 1123344 Q ss_pred HHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEE Q lcl|NC_019511. 147 EEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQ 225 (330) Q Consensus 147 ~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q 225 (330) .+++. | ++...+..+..+++.+|.+|..+..+.+ |++ .+..++|..+.++.++. ...+.-.++|+. T Consensus 124 ~~~~~-------n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d--~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~ 190 (483) T protein:vir:12 124 DEVLG-------N---RFDDKLHSVLTGASNKGIEWLHPYLDEE--GEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYK 190 (483) T ss_pred HHHHh-------c---cHHHHHHHHHHHHhhCCeEEEEEEEcCC--Cce-EEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 44431 1 3455566677899999988777665444 553 57789999988876542 111222233333 Q ss_pred EeCCceEEEechhHeeeec----------------------ccCcCC-----CCCCCccccHHHHHH---HHHHHHHHHH Q lcl|NC_019511. 226 VIDKQVVASFTSRELVMGI----------------------RNPRSD-----LNSSGYGLSEVEIAM---KEFIAYNNTE 275 (330) Q Consensus 226 ~~~~~~~~~~~~~dvih~~----------------------~n~~~d-----~~~~~yGlSPIe~a~---~~I~~~laae 275 (330) ..+...+..++...+.|.. .|+... +..+.+|.|-++-.. .++...++-- T Consensus 191 ~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~ 270 (483) T protein:vir:12 191 LENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDL 270 (483) T ss_pred eecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHH Confidence 3222222333333333221 011110 001224666555443 3444333333 Q ss_pred HHHHHHHhcCCCcceEEEeCCCCCCC-HHHHHHHHHHHHHHhcCcccccccceee-C Q lcl|NC_019511. 276 SFNDRFFSHGGTTRGILQIRADQQQS-QHALENFKREWKSSFSGINGSWQICLYI-K 330 (330) Q Consensus 276 ~~~~~fF~nGa~p~GiL~~~~~~~ls-~e~~e~lr~~w~~~~~G~~na~kvpvL~-e 330 (330) .-..++|+ .|--++ .|..... .+....++....-...+ + +.+-.|. + T Consensus 271 ~~~~~~~~---~~~lv~--~g~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~l~~~ 319 (483) T protein:vir:12 271 SNTFKDSN---ELTYVL--TNYDDQELPEFKRLLRYYGAIKVSD--N-GGVDTIQVE 319 (483) T ss_pred HHHHHHhc---Cceeee--ecCCcccchhHHHhhhhccccccCC--C-CcceEEeec Confidence 33334444 443333 3321111 12122222211111111 1 1222221 1 No 188 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=39.04 E-value=1 Score=20.50 Aligned_cols=278 Identities=11% Similarity=0.026 Sum_probs=95.7 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |++ .| . ... ..-++.+-+.+.+.-..=.+..-.=-.|..|....-+..+. T Consensus 1 m~~-~~---------~------~~~----~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~---------- 50 (536) T protein:vir:10 1 MAE-KR---------T------GLA----EDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNA---------- 50 (536) T ss_pred Ccc-hh---------h------chh----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcc---------- Confidence 111 00 0 000 00111111111110000000000011333342111111000 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCccc-----ChhhH----HHHHHHHHHHH Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATP-----GIKEK----EQMKRIEEFIL 151 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~-----~~~~~----~~~~~i~~~l~ 151 (330) + +.+.+.- .+....|+++++..+..-..|+ +=|+++.-.+.+. .+.+. +-...+++.+. T Consensus 51 -~---~~~~~~~-dst~~~a~~~Laa~l~~~ltP~-------~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~ 118 (536) T protein:vir:10 51 -S---TDYQTPW-QAVGARGLNNLASKLMLALFPM-------QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIM 118 (536) T ss_pred -c---ccccccc-cccHHHHHHHHHHHHHhhhcCC-------CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHH Confidence 0 0000000 2344566677777775432343 1145543222111 11111 12333444443 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC------------- Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK------------- 218 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~------------- 218 (330) .-+. +.+|+.=+.....|++++|++..|+.-+..+.+.....|||. ++.+..|..|++.+ T Consensus 119 ~~l~-----~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~--~~~v~~d~~G~vd~i~r~~~~t~~~l~ 191 (536) T protein:vir:10 119 NYIE-----SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLS--SYVVQRDAFGNVLQMVTRDQIAFGALP 191 (536) T ss_pred HHHH-----hcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcC--eEEEeeCCCCCeeEEeeeeeccHHHHH Confidence 3322 235666666777899999999988754443334456677763 44455555553220 Q ss_pred -----------------CceeE---------------EEEeCCceEEEe----chhH--eeeecccCcCCCCCCCccccH Q lcl|NC_019511. 219 -----------------GGNRF---------------VQVIDKQVVASF----TSRE--LVMGIRNPRSDLNSSGYGLSE 260 (330) Q Consensus 219 -----------------~~~~Y---------------~q~~~~~~~~~~----~~~d--vih~~~n~~~d~~~~~yGlSP 260 (330) ....+ ++..+|..+... .-++ .+..+.+..+ ...||.|| T Consensus 192 ~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~---ge~YGrgp 268 (536) T protein:vir:10 192 EDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLD---GESYGRSY 268 (536) T ss_pred HhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCccccccccccccccCCceeeeeeecC---CCccccch Confidence 01111 011111111000 0001 1112222222 24589999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC------CCcceEEEe------------C--------------CCCCCCHHHHHHH Q lcl|NC_019511. 261 VEIAMKEFIAYNNTESFNDRFFSHG------GTTRGILQI------------R--------------ADQQQSQHALENF 308 (330) Q Consensus 261 Ie~a~~~I~~~laae~~~~~fF~nG------a~p~GiL~~------------~--------------~~~~ls~e~~e~l 308 (330) ++-++-.+.......+-....-.-- ..|+|++.. + ++-....+.++.+ T Consensus 269 ~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~ 348 (536) T protein:vir:10 269 IEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAI 348 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHH Confidence 9887776655444333222210100 112222110 0 0001123344555 Q ss_pred HHHHHHHhcCccc--ccccceeeC Q lcl|NC_019511. 309 KREWKSSFSGING--SWQICLYIK 330 (330) Q Consensus 309 r~~w~~~~~G~~n--a~kvpvL~e 330 (330) +...+..|--... ...-.|--+ T Consensus 349 ~~rI~~af~~~~l~~~~~~r~TAt 372 (536) T protein:vir:10 349 EARLSFAFMLNSAVQRTGERVTAE 372 (536) T ss_pred HHHHHHHHhhhhcccCCCCCccHH Confidence 5555444421100 000000000 No 189 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=38.92 E-value=1 Score=20.48 Aligned_cols=258 Identities=12% Similarity=0.105 Sum_probs=93.4 Q ss_pred CCCCCCcccccCccCcc-----hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHH Q lcl|NC_019511. 12 SMYKEDTEDLMVPIDDG-----IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE 86 (330) Q Consensus 12 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~ 86 (330) -.++++.. -.+|.|.. +.-.+++.+.+.-++.+-..=-.|+.... .+|. +.....+ T Consensus 1 ~~~~~~~~-~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~---------~~~~---~~~~~~~------ 61 (453) T protein:vir:39 1 MKYKPPKL-MTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAID---------AEPT---KDLWKPD------ 61 (453) T ss_pred CeecCCcc-eEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchh---------cCCC---ccccCcc------ Confidence 12223211 13344332 22223332333222222222122332111 1111 0000000 Q ss_pred HHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHH Q lcl|NC_019511. 87 VLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQE 166 (330) Q Consensus 87 ~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~ 166 (330) .| .. ++....|+++.++-+ |..+.+ +...++ +....+.+++.. ..+.. T Consensus 62 -~k-i~-~n~~~~ivd~~~~~l--~g~~~~---------~~~~d~---------~~~~~l~~i~~~---------N~~~~ 109 (453) T protein:vir:39 62 -NR-LT-VNFTKYIVDTFTGYF--NGIPVK---------KSHSDK---------ETLSKLQEFDNL---------NDMED 109 (453) T ss_pred -ce-ee-cchHHHHHHHHhhhh--cccCce---------eccCCh---------HHHHHHHHHHHh---------cChhH Confidence 01 11 234555555544433 122222 211111 122344444431 14666 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeCCce--EEEechhHeeee Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVIDKQV--VASFTSRELVMG 243 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~~~~--~~~~~~~dvih~ 243 (330) .+..+..+.+.+|.++..+..+. .|++ .+-.++|..+.++.++.. ....-.++|+. ..+.. +..|+.+.+.+. T Consensus 110 ~~~~~~~~~~~~G~~~~~v~~d~--~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~-~~~~~~~~~~yt~~~i~~~ 185 (453) T protein:vir:39 110 EESELAKMACIYGRAFELLYQNE--ETQT-NVIYNTPENMFMVYDDTIKQEPLFAVRYGY-DDDYKLYGEVYTKETTYAL 185 (453) T ss_pred HHHHHHHHHhhcCeEEEEEEecC--CCce-EEEEEcccceEEEecCCCCCeEEEEEEEEE-eCCeEEEEEEEeCCeEEEE Confidence 77778899999998877765544 4554 466689998888766432 21111122221 11111 112333333322 Q ss_pred cc------------cCcC-----CCCCCCccccHHHHHHHHH---HHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHH Q lcl|NC_019511. 244 IR------------NPRS-----DLNSSGYGLSEVEIAMKEF---IAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQH 303 (330) Q Consensus 244 ~~------------n~~~-----d~~~~~yGlSPIe~a~~~I---~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e 303 (330) .. ++.. .+....+|.|-++.....+ ...++--.-...+|+ .|--++ .| ..++++ T Consensus 186 ~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~---~p~~~~--~g-~~~~~~ 259 (453) T protein:vir:39 186 NGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFS---DQYLTF--LG-AAVEEE 259 (453) T ss_pred EecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhh---Cceeee--ec-CCCCch Confidence 21 1111 1111345766665444433 333333322334443 343333 23 234444 Q ss_pred HHHHHHHHHHHHhcC-c--ccccccceee-C Q lcl|NC_019511. 304 ALENFKREWKSSFSG-I--NGSWQICLYI-K 330 (330) Q Consensus 304 ~~e~lr~~w~~~~~G-~--~na~kvpvL~-e 330 (330) ..+.++..-.-...| . ...+.+.-|. + T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~ 290 (453) T protein:vir:39 260 DLKNIRSNRVINYYGESSEAKNVDVKFLEKP 290 (453) T ss_pred hhhhhhhcceeeecCCCCCCCCCceeEEeec Confidence 444433210000000 0 0001111111 1 No 190 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=38.24 E-value=1.1 Score=20.41 Aligned_cols=291 Identities=11% Similarity=0.089 Sum_probs=127.5 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhc---c--ccccccccCCCCCcCC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYA---E--PFLEMMDTNPDYRDKK 75 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~---~--~~~~~~~~~p~~~~~~ 75 (330) |..+|+--.+..- ..+...+.....+... +....|..-.-. . +....+.. .... - T Consensus 5 ~l~lf~f~~k~~e----------------~~~~~~~~~~~~s~~~-p~~~dGa~~I~~~~~~~~~~~~~~~~--~~~~-~ 64 (521) T protein:vir:10 5 FLKLLQPWMKDDE----------------KRVQSDLSDRIDSFAV-PDTADGAIEVDKQIDTTAPKTAIVQS--VLGY-A 64 (521) T ss_pred hhHHhhhhhhhhh----------------hHHhhhhccCcccccc-ccCCCCceeeccCCCccccccchhhh--hhcc-c Confidence 4444433211000 0000000000000000 111111100000 0 00000000 0011 1 Q ss_pred CcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccC Q lcl|NC_019511. 76 SYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGT 155 (330) Q Consensus 76 s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~ 155 (330) +..+|..+..+.-|..|.+|-|..+|+.+.+++. ..+..+---++.+.+ -+.++...++|..--+.+.+++. T Consensus 65 ~~~~n~~eLI~~YR~ma~~pEvd~Av~eIvneai------v~d~~~~pV~i~Ld~--~~~s~~iK~kI~eeF~~Il~ll~ 136 (521) T protein:vir:10 65 PKIQNTKDLINQYRSLSKYHEVDNAIDEIINDAI------VQEDNRDTVYLDLDK--TDWNESVKEMVREEFRTILKLLK 136 (521) T ss_pred cccchHHHHHHHHHHHhhccchhhHHHhhhcceE------EecCCCceEEEEecC--cccchHHHHHHHHHHHHHHHHhc Confidence 1245666666777888889999999999998874 223222223445533 33466655665554333444433 Q ss_pred CCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC-CCcceEEEEeeCCCceEEee-----CCCCcccCCce--eEEEEe Q lcl|NC_019511. 156 DKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK-NKTKMEKFIAVDPSTIFYAT-----DKNGKIIKGGN--RFVQVI 227 (330) Q Consensus 156 ~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd-~~G~~~~L~pldp~tV~~~~-----d~~G~~~~~~~--~Y~q~~ 227 (330) ... +..++ ++.-++-|..|+-++++.+ .+.-+.+|..|||..|+.+. +..|....+++ .|+|.. T Consensus 137 F~~----~~~~~----fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~ 208 (521) T protein:vir:10 137 FER----EGKRH----FRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGA 208 (521) T ss_pred cch----hhhHH----HhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeecc Confidence 221 22233 4555677888888888633 33458899999999885442 12222111221 233321 Q ss_pred --------CC--ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCC Q lcl|NC_019511. 228 --------DK--QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRAD 297 (330) Q Consensus 228 --------~~--~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~ 297 (330) +| +....++.+-|.|.+.--. |. ..++.+|=+..|...+......|.-.-=|==--|.-+=|.-+..+ T Consensus 209 ~~~~~~~~~g~~~~~vkI~~daI~y~hSGL~-d~-~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvG 286 (521) T protein:vir:10 209 TEDNRYNISGNSNNLVQIPIDAIVYSHSGKV-DI-DGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVG 286 (521) T ss_pred CCCceecCCCCCCcceeechhheeeecccce-eC-CCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecC Confidence 11 1123466666666542211 22 256788889999888877655554322211122333333333332 Q ss_pred CCCCHHHHHHHHHHHHHHh----------cCcccccccceeeC Q lcl|NC_019511. 298 QQQSQHALENFKREWKSSF----------SGINGSWQICLYIK 330 (330) Q Consensus 298 ~~ls~e~~e~lr~~w~~~~----------~G~~na~kvpvL~e 330 (330) +|.+...++.-+...+.| +.+.|..+..-++| T Consensus 287 -nlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlE 328 (521) T protein:vir:10 287 -TMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTE 328 (521) T ss_pred -CCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHh Confidence 344433333333332222 12233333333344 No 191 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=37.25 E-value=1.1 Score=20.30 Aligned_cols=255 Identities=12% Similarity=0.058 Sum_probs=90.7 Q ss_pred ccCccCcc----h-h----HHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHH Q lcl|NC_019511. 21 LMVPIDDG----I-Q----ANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKF 91 (330) Q Consensus 21 ~~~~~~~~----~-~----~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~ 91 (330) ++.|++.. . . .-+++...+.-++.+-..=-.|+...- . .| ..++.. + ..++.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~--------~-~~-~~~~~~-~-------~~~~~~ 62 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPE--------A-IG-VTVPVQ-M-------QSLLAH 62 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchh--------h-cC-cccchh-h-------hhhhhc Confidence 33333211 1 1 123333333333322211112222110 0 00 011111 1 112211 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) +.....|+++.++.+- ..||.+- + .+..+ ..+.+++.. .++......+ T Consensus 63 --~n~~~~ivd~~~~~l~-----------~~g~~~~--~--~~~~~------~~l~~i~~~---------N~~d~~~~~~ 110 (485) T protein:vir:24 63 --VGYPRLYVDSIAERQA-----------VEGFRLG--D--ADEAD------EELWQWWQA---------NNLDIEAPLG 110 (485) T ss_pred --cchHHHHHHHHhhhhc-----------cCceecC--C--CchhH------HHHHHHHHh---------cChhHHHHHH Confidence 2344555555444431 2355422 1 11111 122333321 1456677888 Q ss_pred HHHHHhcCCceeEEEEecCC------CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCce---EEEechhH--- Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKN------KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQV---VASFTSRE--- 239 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~------~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~---~~~~~~~d--- 239 (330) ..+++++|.+|.++..+.++ .|. ..+.+++|..+.+..|+.-......+++++..+++. ...|+.+. T Consensus 111 ~~~a~i~G~ay~~v~~~~~~~~~~~~~~~-~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~ 189 (485) T protein:vir:24 111 YTDAYVHGRSYITISRPDPQIDLGWDPNV-PLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFG 189 (485) T ss_pred HHHHhhcCceEEEEecCCcccccccCCCc-ceEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEE Confidence 89999999988776554432 122 257788998887776643221111122222221111 11122222 Q ss_pred ----------------------eeeecccCcCCCCCCCccccHHH----HHHHHHHHHHHHHHHHHHHHhc------CCC Q lcl|NC_019511. 240 ----------------------LVMGIRNPRSDLNSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSH------GGT 287 (330) Q Consensus 240 ----------------------vih~~~n~~~d~~~~~yGlSPIe----~a~~~I~~~laae~~~~~fF~n------Ga~ 287 (330) |++++.|+.. .++||.|.|+ ....++...+.--.-...||+. |+. T Consensus 190 ~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~---~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~ 266 (485) T protein:vir:24 190 WFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRL---SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIK 266 (485) T ss_pred EEecCCceEeecccccCCCcccEEEeccCccc---CCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCC Confidence 2333333322 2347887654 2334444443333333344432 211 Q ss_pred ------------------cceEEEeCCC-C---CCCH----HHHHHHHHHHHHHhcCccc-------------cccccee Q lcl|NC_019511. 288 ------------------TRGILQIRAD-Q---QQSQ----HALENFKREWKSSFSGING-------------SWQICLY 328 (330) Q Consensus 288 ------------------p~GiL~~~~~-~---~ls~----e~~e~lr~~w~~~~~G~~n-------------a~kvpvL 328 (330) ++.++..+++ . .++. .-+++|+..... +++..+ +-.+++- T Consensus 267 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~-~s~~~~~p~~~fg~~~~n~~Sg~Al~ 345 (485) T protein:vir:24 267 PEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQ-VAAYTGLPPQYLSTAADNPASAEAIR 345 (485) T ss_pred ccccccccccccchhhhcccceeccCCCCceEEeecccchHHHHHHHHHHHHH-HhcccCCCHHHhccccCcchHHHHHH Confidence 1112211111 0 0111 112333333322 111100 0001111 Q ss_pred eC Q lcl|NC_019511. 329 IK 330 (330) Q Consensus 329 ~e 330 (330) .. T Consensus 346 ~~ 347 (485) T protein:vir:24 346 AA 347 (485) T ss_pred HH Confidence 11 No 192 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=36.67 E-value=1.2 Score=20.23 Aligned_cols=270 Identities=8% Similarity=0.041 Sum_probs=100.3 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccc---cchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGK---QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAI 100 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~---~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~ 100 (330) ++.... .+.+.+.+-++...-.+-..+ =.+|..|........... ....++ .+.= .+....| T Consensus 1 M~~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~~~~~~--------~~~~-dst~~~a 65 (555) T protein:vir:98 1 MAEQTE---RKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRN---RGEKRH--------NNIL-DNTGTRA 65 (555) T ss_pred CCCccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCC---cchhcc--------cccc-cccHHHH Confidence 111111 112222222221111111111 113444432211100000 000000 0000 3445567 Q ss_pred HHHHHHhHhhhhhhheecccccc-eeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 101 IITRANQVSTYCKPARYSEKGVG-FEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 101 I~~~~d~Ia~~~~~~~~~~~~~g-~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) +++++..+..-..|. +.- |.+.+.|++.....+..+....+++.+...+. ..+|+.-+-.++.|++++| T Consensus 66 ~~~LAa~L~~~ltpp-----~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~~Lv~~G 135 (555) T protein:vir:98 66 LRVLAAGMMAGMTSP-----ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-----KSNTYRALHSMYEELGAFG 135 (555) T ss_pred HHHHHHHHHHhhcCC-----CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-----hcCcHHHHHHHHHHHHhhC Confidence 777777775422221 111 23333333322222333334445555543332 2466666666778999999 Q ss_pred CceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC--------------------------------CceeEE--- Q lcl|NC_019511. 180 QVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--------------------------------GGNRFV--- 224 (330) Q Consensus 180 ~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--------------------------------~~~~Y~--- 224 (330) ++..|.. .+. +..+.+.++...+..+..|..|++.+ +.-.++ T Consensus 136 ~a~l~~~--~d~-~~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~ 212 (555) T protein:vir:98 136 TASSIVL--PDF-DAVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVI 212 (555) T ss_pred ceEEEEe--cCC-CceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE Confidence 9988864 333 23556666666666666666664311 000000 Q ss_pred EEe---CCce----------E--EEe--chh-H-ee-----------eecccCcCCCCCCCccccHHHHHHHHHHHHHHH Q lcl|NC_019511. 225 QVI---DKQV----------V--ASF--TSR-E-LV-----------MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNT 274 (330) Q Consensus 225 q~~---~~~~----------~--~~~--~~~-d-vi-----------h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laa 274 (330) ..+ .+.. . ..+ ..+ + |+ ..+.+..+ ...||.||++-|+-.+...... T Consensus 213 ~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~---ge~YGrgp~~~~lgD~k~L~~l 289 (555) T protein:vir:98 213 HAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVG---GDIYGNSPAMEALGDVRQLQHE 289 (555) T ss_pred EEEeeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecC---CCccccchHHHHHHHHHHHHHH Confidence 000 0000 0 011 001 0 11 11112212 2458999999888777766555 Q ss_pred HHHHHHHHhcCCCcceEEEeCCCCCC----------------------------------CHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 275 ESFNDRFFSHGGTTRGILQIRADQQQ----------------------------------SQHALENFKREWKSSFSGIN 320 (330) Q Consensus 275 e~~~~~fF~nGa~p~GiL~~~~~~~l----------------------------------s~e~~e~lr~~w~~~~~G~~ 320 (330) .+-....-.-.+.|-.. ++.+... ..+.++.++...+..|-. T Consensus 290 ~~~~l~~~~~~~~pp~~--v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-- 365 (555) T protein:vir:98 290 QLRKAQAIDYKSNPPLQ--LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYA-- 365 (555) T ss_pred HHHHHHHHHHHhcCcee--eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhc-- Confidence 54433322222222211 1111100 111123333333332221 Q ss_pred cccccceee-C Q lcl|NC_019511. 321 GSWQICLYI-K 330 (330) Q Consensus 321 na~kvpvL~-e 330 (330) +-.. +|. . T Consensus 366 dlf~--~l~~~ 374 (555) T protein:vir:98 366 DLFL--MLANG 374 (555) T ss_pred chhh--hccCC Confidence 0000 010 0 No 193 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=36.67 E-value=1.2 Score=20.23 Aligned_cols=270 Identities=8% Similarity=0.041 Sum_probs=100.3 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccc---cchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGK---QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAI 100 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~---~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~ 100 (330) ++.... .+.+.+.+-++...-.+-..+ =.+|..|........... ....++ .+.= .+....| T Consensus 1 M~~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~~~~~~--------~~~~-dst~~~a 65 (555) T protein:vir:10 1 MAEQTE---RKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRN---RGEKRH--------NNIL-DNTGTRA 65 (555) T ss_pred CCCccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCC---cchhcc--------cccc-cccHHHH Confidence 111111 112222222221111111111 113444432211100000 000000 0000 3445567 Q ss_pred HHHHHHhHhhhhhhheecccccc-eeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 101 IITRANQVSTYCKPARYSEKGVG-FEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 101 I~~~~d~Ia~~~~~~~~~~~~~g-~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) +++++..+..-..|. +.- |.+.+.|++.....+..+....+++.+...+. ..+|+.-+-.++.|++++| T Consensus 66 ~~~LAa~L~~~ltpp-----~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~~Lv~~G 135 (555) T protein:vir:10 66 LRVLAAGMMAGMTSP-----ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-----KSNTYRALHSMYEELGAFG 135 (555) T ss_pred HHHHHHHHHHhhcCC-----CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-----hcCcHHHHHHHHHHHHhhC Confidence 777777775422221 111 23333333322222333334445555543332 2466666666778999999 Q ss_pred CceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC--------------------------------CceeEE--- Q lcl|NC_019511. 180 QVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--------------------------------GGNRFV--- 224 (330) Q Consensus 180 ~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--------------------------------~~~~Y~--- 224 (330) ++..|.. .+. +..+.+.++...+..+..|..|++.+ +.-.++ T Consensus 136 ~a~l~~~--~d~-~~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~ 212 (555) T protein:vir:10 136 TASSIVL--PDF-DAVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVI 212 (555) T ss_pred ceEEEEe--cCC-CceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE Confidence 9988864 333 23556666666666666666664311 000000 Q ss_pred EEe---CCce----------E--EEe--chh-H-ee-----------eecccCcCCCCCCCccccHHHHHHHHHHHHHHH Q lcl|NC_019511. 225 QVI---DKQV----------V--ASF--TSR-E-LV-----------MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNT 274 (330) Q Consensus 225 q~~---~~~~----------~--~~~--~~~-d-vi-----------h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laa 274 (330) ..+ .+.. . ..+ ..+ + |+ ..+.+..+ ...||.||++-|+-.+...... T Consensus 213 ~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~---ge~YGrgp~~~~lgD~k~L~~l 289 (555) T protein:vir:10 213 HAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVG---GDIYGNSPAMEALGDVRQLQHE 289 (555) T ss_pred EEEeeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecC---CCccccchHHHHHHHHHHHHHH Confidence 000 0000 0 011 001 0 11 11112212 2458999999888777766555 Q ss_pred HHHHHHHHhcCCCcceEEEeCCCCCC----------------------------------CHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 275 ESFNDRFFSHGGTTRGILQIRADQQQ----------------------------------SQHALENFKREWKSSFSGIN 320 (330) Q Consensus 275 e~~~~~fF~nGa~p~GiL~~~~~~~l----------------------------------s~e~~e~lr~~w~~~~~G~~ 320 (330) .+-....-.-.+.|-.. ++.+... ..+.++.++...+..|-. T Consensus 290 ~~~~l~~~~~~~~pp~~--v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-- 365 (555) T protein:vir:10 290 QLRKAQAIDYKSNPPLQ--LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYA-- 365 (555) T ss_pred HHHHHHHHHHHhcCcee--eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhc-- Confidence 54433322222222211 1111100 111123333333332221 Q ss_pred cccccceee-C Q lcl|NC_019511. 321 GSWQICLYI-K 330 (330) Q Consensus 321 na~kvpvL~-e 330 (330) +-.. +|. . T Consensus 366 dlf~--~l~~~ 374 (555) T protein:vir:10 366 DLFL--MLANG 374 (555) T ss_pred chhh--hccCC Confidence 0000 010 0 No 194 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=36.67 E-value=1.2 Score=20.23 Aligned_cols=270 Identities=8% Similarity=0.041 Sum_probs=100.3 Q ss_pred ccCcchhHHHHHHHHHHHHhhcccchhccc---cchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcHHHHHH Q lcl|NC_019511. 24 PIDDGIQANIRQIEQDTKEMQEITKSLYGK---QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAI 100 (330) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~---~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~ 100 (330) ++.... .+.+.+.+-++...-.+-..+ =.+|..|........... ....++ .+.= .+....| T Consensus 1 M~~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~~~~~~--------~~~~-dst~~~a 65 (555) T protein:vir:10 1 MAEQTE---RKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRN---RGEKRH--------NNIL-DNTGTRA 65 (555) T ss_pred CCCccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCC---cchhcc--------cccc-cccHHHH Confidence 111111 112222222221111111111 113444432211100000 000000 0000 3445567 Q ss_pred HHHHHHhHhhhhhhheecccccc-eeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcC Q lcl|NC_019511. 101 IITRANQVSTYCKPARYSEKGVG-FEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYD 179 (330) Q Consensus 101 I~~~~d~Ia~~~~~~~~~~~~~g-~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g 179 (330) +++++..+..-..|. +.- |.+.+.|++.....+..+....+++.+...+. ..+|+.-+-.++.|++++| T Consensus 66 ~~~LAa~L~~~ltpp-----~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~~Lv~~G 135 (555) T protein:vir:10 66 LRVLAAGMMAGMTSP-----ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-----KSNTYRALHSMYEELGAFG 135 (555) T ss_pred HHHHHHHHHHhhcCC-----CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-----hcCcHHHHHHHHHHHHhhC Confidence 777777775422221 111 23333333322222333334445555543332 2466666666778999999 Q ss_pred CceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC--------------------------------CceeEE--- Q lcl|NC_019511. 180 QVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK--------------------------------GGNRFV--- 224 (330) Q Consensus 180 ~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~--------------------------------~~~~Y~--- 224 (330) ++..|.. .+. +..+.+.++...+..+..|..|++.+ +.-.++ T Consensus 136 ~a~l~~~--~d~-~~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~ 212 (555) T protein:vir:10 136 TASSIVL--PDF-DAVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVI 212 (555) T ss_pred ceEEEEe--cCC-CceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE Confidence 9988864 333 23556666666666666666664311 000000 Q ss_pred EEe---CCce----------E--EEe--chh-H-ee-----------eecccCcCCCCCCCccccHHHHHHHHHHHHHHH Q lcl|NC_019511. 225 QVI---DKQV----------V--ASF--TSR-E-LV-----------MGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNT 274 (330) Q Consensus 225 q~~---~~~~----------~--~~~--~~~-d-vi-----------h~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laa 274 (330) ..+ .+.. . ..+ ..+ + |+ ..+.+..+ ...||.||++-|+-.+...... T Consensus 213 ~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~---ge~YGrgp~~~~lgD~k~L~~l 289 (555) T protein:vir:10 213 HAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVG---GDIYGNSPAMEALGDVRQLQHE 289 (555) T ss_pred EEEeeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecC---CCccccchHHHHHHHHHHHHHH Confidence 000 0000 0 011 001 0 11 11112212 2458999999888777766555 Q ss_pred HHHHHHHHhcCCCcceEEEeCCCCCC----------------------------------CHHHHHHHHHHHHHHhcCcc Q lcl|NC_019511. 275 ESFNDRFFSHGGTTRGILQIRADQQQ----------------------------------SQHALENFKREWKSSFSGIN 320 (330) Q Consensus 275 e~~~~~fF~nGa~p~GiL~~~~~~~l----------------------------------s~e~~e~lr~~w~~~~~G~~ 320 (330) .+-....-.-.+.|-.. ++.+... ..+.++.++...+..|-. T Consensus 290 ~~~~l~~~~~~~~pp~~--v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-- 365 (555) T protein:vir:10 290 QLRKAQAIDYKSNPPLQ--LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYA-- 365 (555) T ss_pred HHHHHHHHHHHhcCcee--eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhc-- Confidence 54433322222222211 1111100 111123333333332221 Q ss_pred cccccceee-C Q lcl|NC_019511. 321 GSWQICLYI-K 330 (330) Q Consensus 321 na~kvpvL~-e 330 (330) +-.. +|. . T Consensus 366 dlf~--~l~~~ 374 (555) T protein:vir:10 366 DLFL--MLANG 374 (555) T ss_pred chhh--hccCC Confidence 0000 010 0 No 195 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=35.48 E-value=1.2 Score=20.09 Aligned_cols=246 Identities=11% Similarity=0.031 Sum_probs=86.9 Q ss_pred ccccCccCcc-------hhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHHH Q lcl|NC_019511. 19 EDLMVPIDDG-------IQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKF 91 (330) Q Consensus 19 ~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~ 91 (330) ..+.+|.-++ +..-++++..+.-++.+-..=-.|+...- -.| ...++ .++ .++ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~---------~~~-~~~~~-~~~-------~~~-- 60 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPE---------AIG-VTVPR-EMQ-------QLL-- 60 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch---------hcc-cccch-hHh-------hhh-- Confidence 2223333221 22233344443333322211112221100 000 01111 111 111 Q ss_pred hhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHHHH Q lcl|NC_019511. 92 GNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCKKI 171 (330) Q Consensus 92 a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~ 171 (330) +.+.....|+++.++.+- ..||.+- +.+..+ +. +.+++.. | ++......+ T Consensus 61 ~v~n~~~~iVd~~~~~l~-----------~~g~~~~----~~~~~~---~~---~~~i~~~------N---~~d~~~~~~ 110 (486) T protein:vir:42 61 AHVGYPRLYVDSVAERQA-----------VEGFRLG----DADEAD---EE---LWQWWQA------N---NLDIEAPLG 110 (486) T ss_pred hccchHHHHHHHHHhhhc-----------ccceecC----CCchhH---HH---HHHHHHh------c---ChhHHHHHH Confidence 113345566666555442 2355432 111111 11 2222221 2 355566778 Q ss_pred HHHHHhcCCceeEEEEecCC------CcceEEEEeeCCCceEEeeCCCCcccCCceeEEEEeCCceE---EEechhHee- Q lcl|NC_019511. 172 VRDTYTYDQVNFEKVFSPKN------KTKMEKFIAVDPSTIFYATDKNGKIIKGGNRFVQVIDKQVV---ASFTSRELV- 241 (330) Q Consensus 172 v~d~L~~g~g~~~~v~~rd~------~G~~~~L~pldp~tV~~~~d~~G~~~~~~~~Y~q~~~~~~~---~~~~~~dvi- 241 (330) ..+++++|.+|..+.....+ .+. ..+.+++|..+.+..|+.-....-.++|++..+++.+ ..|+.+.+. T Consensus 111 ~~~a~~~G~ay~~v~~~e~~~~~~~~~~~-~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~ 189 (486) T protein:vir:42 111 YTDAYVHGRSFITISKPDPQLDLGWDQNV-PIIRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIG 189 (486) T ss_pred HHHHhhcCceEEEEecCCcccccccCCCe-eEEEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEE Confidence 89999999887766543321 222 3566788988887766432222222333332222211 112222222 Q ss_pred ------------------------eecccCcCCCCCCCccccHHH----HHHHHHHHHHHHHHHHHHHHhc------CCC Q lcl|NC_019511. 242 ------------------------MGIRNPRSDLNSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSH------GGT 287 (330) Q Consensus 242 ------------------------h~~~n~~~d~~~~~yGlSPIe----~a~~~I~~~laae~~~~~fF~n------Ga~ 287 (330) +++.|+.. .+.+|.|-|+ ....++...+.--.-...||+. |+. T Consensus 190 ~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~---~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~ 266 (486) T protein:vir:42 190 WFRADGEWAEWFNVPHGLGVVPVVPLPNRTRL---SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIK 266 (486) T ss_pred EEecCCcEEeecceecCCCCceEEEecccccc---CCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCC Confidence 22222211 1346887554 3344555444333333344432 322 Q ss_pred cceEEEeCCCCCCCHHHHHHHHHHHHHHhcC---cccccccceeeC Q lcl|NC_019511. 288 TRGILQIRADQQQSQHALENFKREWKSSFSG---INGSWQICLYIK 330 (330) Q Consensus 288 p~GiL~~~~~~~ls~e~~e~lr~~w~~~~~G---~~na~kvpvL~e 330 (330) +..+-..++ +-..-|+..... ..+. . +=+.| T Consensus 267 ~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~-~-~~~~q 300 (486) T protein:vir:42 267 PEEIGVDSE----------TGQTLFDAYLARILAFEDA-E-GKIQQ 300 (486) T ss_pred ccccccccc----------cccchhhhhhchhcccCCC-C-ceEEe Confidence 221110000 000112211100 0000 0 00111 No 196 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=34.62 E-value=1.3 Score=19.99 Aligned_cols=259 Identities=12% Similarity=0.102 Sum_probs=93.0 Q ss_pred CCCCCCcccccCccCcch-----hHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHH Q lcl|NC_019511. 12 SMYKEDTEDLMVPIDDGI-----QANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE 86 (330) Q Consensus 12 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~ 86 (330) -.| +-+.-.+++.+.++ +-.+++-..+.-++.+..+=-.|+.... . ++. +.+. ..+ T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~-----~----~~~-~~~~--~~~------ 61 (452) T protein:vir:36 1 MKY-KPPKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAID-----D----EPA-KDSW--KPD------ 61 (452) T ss_pred Ccc-cCceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----c----Ccc-cccc--Ccc------ Confidence 111 11111223323222 2222222222222222222222322111 1 111 0000 000 Q ss_pred HHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHH Q lcl|NC_019511. 87 VLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQE 166 (330) Q Consensus 87 ~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~ 166 (330) .| .. ++....|+++.+.-+ |.. +..+...|. . ..+.+.+++.. ..+.. T Consensus 62 -~k-i~-~n~~~~ivd~~~~~l--~g~---------~~~~~~~d~------~---~~~~l~~~~~~---------n~~~~ 109 (452) T protein:vir:36 62 -NR-LA-VNFTKYIVDTFTGYF--NGI---------PVKKSHSDK------E---ILTKLQEFDNL---------NDMED 109 (452) T ss_pred -ce-ee-cchHHHHHHHHhhhh--ccc---------CceeecCCh------h---HHHHHHHHHhh---------cChhH Confidence 01 11 234455554444332 112 222222211 1 12334444321 14666 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeCCc-eEEEechhHeeeec Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVIDKQ-VVASFTSRELVMGI 244 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~~~-~~~~~~~~dvih~~ 244 (330) .+..+..+.+.+|.++..+..+.+ |++ .+..++|..+.++.++.. ....-.++|+...++. .+..|+.+.+.+.. T Consensus 110 ~~~~~~~~~~~~G~~~~~v~~d~~--g~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~ 186 (452) T protein:vir:36 110 EESELAKMACIYGRAFEFLYQDED--TQT-NVVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKIS 186 (452) T ss_pred HHHHHHHHHHhcCeEEEEEEecCC--Cee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEE Confidence 677788999999988776655443 554 577789999888776542 1111112332222221 11223443333221 Q ss_pred c------------c-----CcCCCCCCCccccHHH---HHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeCCCCCCCHHH Q lcl|NC_019511. 245 R------------N-----PRSDLNSSGYGLSEVE---IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRADQQQSQHA 304 (330) Q Consensus 245 ~------------n-----~~~d~~~~~yGlSPIe---~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~~~~~ls~e~ 304 (330) . + |.-.+....+|.|-++ ....++...++.-.-...+|+ .|- +.+.| ..++++. T Consensus 187 ~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~---~p~--~~~~g-~~~~~~~ 260 (452) T protein:vir:36 187 GENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFS---DQY--LTFLG-AAVEEED 260 (452) T ss_pred EcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhc---Cce--eEeec-CCcCchh Confidence 1 1 1111111234655554 444444444444333334444 343 33334 2345544 Q ss_pred HHHHHHH--HHHHhcCcccccccceee-C Q lcl|NC_019511. 305 LENFKRE--WKSSFSGINGSWQICLYI-K 330 (330) Q Consensus 305 ~e~lr~~--w~~~~~G~~na~kvpvL~-e 330 (330) ...++.. |.-.-.|.+....+-.|. + T Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 289 (452) T protein:vir:36 261 LKNIRSNRVINYYADGEGKNVDVKFLEKP 289 (452) T ss_pred hhhhhhcceEEecCCCCccCCcceeEeec Confidence 4443221 111000100000111111 1 No 197 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=34.21 E-value=1.3 Score=19.95 Aligned_cols=276 Identities=12% Similarity=0.087 Sum_probs=94.2 Q ss_pred CchhHHHH-------HhcCCCCCCcccccCccCcchhHHHHHHHHHH----HHhhcccchhccccchhccccccccccCC Q lcl|NC_019511. 1 MPDLFKSL-------RLGSMYKEDTEDLMVPIDDGIQANIRQIEQDT----KEMQEITKSLYGKQQAYAEPFLEMMDTNP 69 (330) Q Consensus 1 ~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~g~~~~~~~~~~~~~~~~p 69 (330) ..+....+ +.+..+..+..+ ....++...-.+.++++. -++++-..=-.|+.... . .++ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i----~----~~~ 78 (502) T protein:vir:48 9 DSTGQDLVLNLRFHRESRIRYRADNLE--ELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDV----L----KSG 78 (502) T ss_pred ecchhHHHhhcccChhHHhhhcccchh--hhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc----c----ccc Confidence 11111111 111112111111 111112222122233221 11111111112221100 0 000 Q ss_pred CCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHH Q lcl|NC_019511. 70 DYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEF 149 (330) Q Consensus 70 ~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~ 149 (330) . .+.....+. | . .+.....|+++.+.-+. .. +..+...+. +..+ .....+.++ T Consensus 79 ~--~~~~~~~~~-------k-i-~~n~~k~Ivd~~~~yl~--g~---------p~~~~~~d~--~~~~---~~~~~l~~~ 131 (502) T protein:vir:48 79 R--RKDNEMADK-------R-A-VHNYGRMISKFKTGYLA--GN---------PIRVEYDDN--EDNS---QNDDAIKRI 131 (502) T ss_pred c--ccccccccc-------e-e-ecchHHHHHHHHhhhhc--cc---------CeeEecCCc--cchh---HHHHHHHHH Confidence 0 000000010 0 0 02344455544443332 12 222222211 1111 112233443 Q ss_pred HHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEEeC Q lcl|NC_019511. 150 ILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQVID 228 (330) Q Consensus 150 l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~~~ 228 (330) +.. .++...+..+..+++.+|.++..+..+. .|+ +.+..++|..+.++.++.. ....-.++|+.... T Consensus 132 ~~~---------N~~~~~~~~~~~~~~~~G~a~~~v~~de--dg~-~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~ 199 (502) T protein:vir:48 132 GRI---------NDIDTHNRNLIRDLSQTGRAYEVIYRSE--YDE-TRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGT 199 (502) T ss_pred Hhh---------cCHhHHHHHHHHHHhhcCeEEEEEEeCC--CCc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEee Confidence 321 2567778888999999998877765544 354 3567789998888766431 11222334433221 Q ss_pred --Cc--eEEEechhHeeeecc-----------c-----CcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019511. 229 --KQ--VVASFTSRELVMGIR-----------N-----PRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTT 288 (330) Q Consensus 229 --~~--~~~~~~~~dvih~~~-----------n-----~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p 288 (330) ++ .+..++.+.+.++.. + |+--+..+++|+|.++-+...|...-.+..-.+..+...+.| T Consensus 200 ~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 279 (502) T protein:vir:48 200 LQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADA 279 (502) T ss_pred cCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 11 122344444433321 1 111111234577777655555544333322223333434444 Q ss_pred ceEEEeCCCCCC-CHHHHHHHHHHH--------------------------------------HHHhcCcccccccc-ee Q lcl|NC_019511. 289 RGILQIRADQQQ-SQHALENFKREW--------------------------------------KSSFSGINGSWQIC-LY 328 (330) Q Consensus 289 ~GiL~~~~~~~l-s~e~~e~lr~~w--------------------------------------~~~~~G~~na~kvp-vL 328 (330) --++ .|.... .++....++..+ .+..... + .+| +- T Consensus 280 ~lv~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~--s-~~p~~~ 354 (502) T protein:vir:48 280 ILAI--YGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVF--T-NTPDMS 354 (502) T ss_pred eeee--ecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHH--h-CCCCcC Confidence 3222 222111 122222222221 1111110 0 011 11 Q ss_pred eC Q lcl|NC_019511. 329 IK 330 (330) Q Consensus 329 ~e 330 (330) .+ T Consensus 355 ~~ 356 (502) T protein:vir:48 355 DN 356 (502) T ss_pred cc Confidence 11 No 198 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=31.65 E-value=1.5 Score=19.65 Aligned_cols=272 Identities=12% Similarity=0.083 Sum_probs=89.5 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) |. . ...-.-.+.+-+..+|.++....-+..+.++++..-....+..-+-+.|- .. ....+.-+ .+ T Consensus 1 ~~---~--~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY----~G---~~~~~~~~---~~ 65 (501) T protein:vir:25 1 MT---V--PVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYT----KG---LRGRPEVP---EG 65 (501) T ss_pred Cc---c--cchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hc---CCCchhcc---cc Confidence 10 0 00011112233344555543333334444444433221222222222231 11 00000000 01 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCC Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDID 160 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~ 160 (330) ....++.+..-+.+.....|+++.++.+- ..||.+. |.+ . .+.+. +++. T Consensus 66 ~~~~~~~~~~~~v~n~~~~ivd~~a~~l~-----------~~gf~~~--d~~--~----~~~l~---~i~~--------- 114 (501) T protein:vir:25 66 ASDEVKELAKLSVKNVLSLVRDSFAQNLS-----------VVGYRNA--LAK--E----NDPAW---EMWQ--------- 114 (501) T ss_pred CChhhhhhHhhhhcChHHHHHHHHHhhhc-----------ccceecC--Ccc--c----hHHHH---HHHH--------- Confidence 11111222111223345566666655431 2355432 211 1 11122 2221 Q ss_pred cCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEee-CCC-CcccCCceeEEEEeCC--c--eEEE Q lcl|NC_019511. 161 RDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYAT-DKN-GKIIKGGNRFVQVIDK--Q--VVAS 234 (330) Q Consensus 161 ~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~-d~~-G~~~~~~~~Y~q~~~~--~--~~~~ 234 (330) ..++.+....+..+++++|.+|..+.. +..|.. +..++|..+.++. |.. .....-.++|+....+ . .... T Consensus 115 ~N~~d~~~~~~~~~a~i~G~ay~~v~~--de~~~~--i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~ 190 (501) T protein:vir:25 115 RNRMDARQAEVHRPALTYGASYVTVTP--TDEGPV--FRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVL 190 (501) T ss_pred hcChhHHHHHHHHHHhhcCceEEEEec--CCCCCe--EEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEE Confidence 123555666788999999998766543 344543 4457888777553 322 1111112222221110 0 0001 Q ss_pred echhH----------------------------------------------eeeecccCcCCCCCCCccccHHH---HHH Q lcl|NC_019511. 235 FTSRE----------------------------------------------LVMGIRNPRSDLNSSGYGLSEVE---IAM 265 (330) Q Consensus 235 ~~~~d----------------------------------------------vih~~~n~~~d~~~~~yGlSPIe---~a~ 265 (330) |+... |++++.++. ..++|.|-++ ... T Consensus 191 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~----~~~~g~sdie~v~~l~ 266 (501) T protein:vir:25 191 YDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRD----ADDMIVGEVAPLILLQ 266 (501) T ss_pred ecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCccc----cCccccchhhhhHHHH Confidence 11111 111111111 1346777554 344 Q ss_pred HHHHHHHHHHHHHHHHHhc------CC----------CcceEEEeCCCC----CCCHHH----HHHHHHHHHHHhcCccc Q lcl|NC_019511. 266 KEFIAYNNTESFNDRFFSH------GG----------TTRGILQIRADQ----QQSQHA----LENFKREWKSSFSGING 321 (330) Q Consensus 266 ~~I~~~laae~~~~~fF~n------Ga----------~p~GiL~~~~~~----~ls~e~----~e~lr~~w~~~~~G~~n 321 (330) .++...+.--.....||++ |. .++.++..+++. .+++.. .+.++..... +++ T Consensus 267 Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~-i~~--- 342 (501) T protein:vir:25 267 QAINSVNFDRLIVSRFGANPQRVISGWTGSKAEVLKASALRVWTFEDPEVKAQAFPPASVEPYNLILEEMLQH-VAM--- 342 (501) T ss_pred HHHHHHHHHHHHHHHhhccHHHHHhCCCCCccchhhhcccceeccCCCCceEEEecccChHHHHHHHHHHHHH-HHh--- Confidence 5555544444445555554 11 011111111100 000000 0111111110 000 Q ss_pred ccccce-eeC Q lcl|NC_019511. 322 SWQICL-YIK 330 (330) Q Consensus 322 a~kvpv-L~e 330 (330) .-.+|. -+. T Consensus 343 ~s~~P~~~~~ 352 (501) T protein:vir:25 343 VAQISPAQVT 352 (501) T ss_pred hcCCChhhhc Confidence 001110 010 No 199 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=29.92 E-value=1.6 Score=19.44 Aligned_cols=264 Identities=9% Similarity=0.013 Sum_probs=89.3 Q ss_pred Cch-------------hHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhcccccccccc Q lcl|NC_019511. 1 MPD-------------LFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDT 67 (330) Q Consensus 1 ~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 67 (330) |-+ +++.|+ ..... ..+.+...++.-.++..++.+...=-.|.......+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~-------~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~---- 66 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLK---PQFET-------QEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKV---- 66 (474) T ss_pred CcceeecCCCCchhhHHHHhhh---hccCC-------hHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccccc---- Confidence 211 222221 10000 001222222222223222222221122322111111000 Q ss_pred CCCCCcCCCcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHH Q lcl|NC_019511. 68 NPDYRDKKSYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIE 147 (330) Q Consensus 68 ~p~~~~~~s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~ 147 (330) ......... .++. | .. ++....|+++.+.-+ |..+.+ +...| .+..+.+. T Consensus 67 ~~~~~~~~~-~~~~-------k-i~-~n~~~~Ivd~~~~~l--~g~p~~---------~~~~d---------~~~~~~l~ 116 (474) T protein:vir:95 67 DVYGNIDYD-KPDW-------R-IT-TNFHQNLVDQKVSYV--ASKPVT---------YSCED---------ESVLKIIH 116 (474) T ss_pred ccccccccc-cccc-------e-ec-cchHHHHHHHHHhhh--ccCCce---------eccCc---------hHHHHHHH Confidence 000000000 0000 0 00 234445554444322 222222 21111 11223344 Q ss_pred HHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCC-cccCCceeEEEE Q lcl|NC_019511. 148 EFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNG-KIIKGGNRFVQV 226 (330) Q Consensus 148 ~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G-~~~~~~~~Y~q~ 226 (330) .++. | ++...+..+..+.+.+|.++..+.+ +..|++ .+..++|..+.++.++.- ....-.++|+.. T Consensus 117 ~~~~-------n---~~~~~~~e~~~~~~~~G~~~~~v~~--d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~ 183 (474) T protein:vir:95 117 DVLD-------T---RWDNKLIDILTATSNKGIDWLQVYI--NENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKF 183 (474) T ss_pred HHHh-------c---cHHHHHHHHHHHHhhcCcEEEEEEe--cCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEE Confidence 4432 1 3555666778899999987766544 334554 577789998887765421 111222344333 Q ss_pred eCCceEEEechhHeeeecc----------------------cCcCCC-----CCCCccccHHHHHHHHHHH---HHHHHH Q lcl|NC_019511. 227 IDKQVVASFTSRELVMGIR----------------------NPRSDL-----NSSGYGLSEVEIAMKEFIA---YNNTES 276 (330) Q Consensus 227 ~~~~~~~~~~~~dvih~~~----------------------n~~~d~-----~~~~yGlSPIe~a~~~I~~---~laae~ 276 (330) .....+..++.+.+.+... ++.+.+ ....+|.|-++-....|.. .++--. T Consensus 184 ~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~ 263 (474) T protein:vir:95 184 NNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQ 263 (474) T ss_pred cCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHH Confidence 3333333444444433211 111111 1123466665544444433 333222 Q ss_pred HHHHHHhcCCCcceEEEeCCCCCCC--HHHHHHHHHHHHHHhcCccccccc-ceeeC Q lcl|NC_019511. 277 FNDRFFSHGGTTRGILQIRADQQQS--QHALENFKREWKSSFSGINGSWQI-CLYIK 330 (330) Q Consensus 277 ~~~~fF~nGa~p~GiL~~~~~~~ls--~e~~e~lr~~w~~~~~G~~na~kv-pvL~e 330 (330) -...+| +.|- +.+.|. ..+ .+....++..+ .+...+++ .+ .+-.+ T Consensus 264 ~~~~~~---~~p~--lv~~g~-~~~~~~~~~~~~~~~~--~i~~~~~~-~~~~l~~~ 311 (474) T protein:vir:95 264 NMFDES---VELI--YILKGY-EGQDLEEFMRGLKYYK--AINVDGDG-GVETIQVE 311 (474) T ss_pred HHHHHh---cCce--eeeecC-Ccccchhhhhhhhccc--eeeccCCC-ceeEEeec Confidence 222333 4443 333332 111 11111111111 00000111 11 11111 No 200 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=28.75 E-value=1.7 Score=19.29 Aligned_cols=274 Identities=11% Similarity=0.063 Sum_probs=98.2 Q ss_pred cCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccchHHHHHHHHH Q lcl|NC_019511. 11 GSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKK 90 (330) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~ 90 (330) +-.+++. ... .++..-+..++..+ -..=.+..-.=-.|..|..-. +.....-+. .-+- T Consensus 1 m~~~~~~----~~~-~~~~k~r~~~l~~~----R~~~e~~w~e~~~~~lP~~~~----~~~~~~~~~---------~~~~ 58 (535) T protein:vir:15 1 MADSKRT----GLG-EDGAKATYDRLTND----RRAYETRAENCAQYTIPSLFP----KESDNESTD---------YTTP 58 (535) T ss_pred CCccchh----ccc-hHHHHHHHHHHHHH----hhHHHHHHHHHHHHhcccccC----CCCCccccc---------cccc Confidence 1111110 000 11111122222222 000000000111333342111 011000000 0011 Q ss_pred HhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCc-----ccChh----hHHHHHHHHHHHHhccCCCCCCc Q lcl|NC_019511. 91 FGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDA-----TPGIK----EKEQMKRIEEFILNTGTDKDIDR 161 (330) Q Consensus 91 ~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~-----~~~~~----~~~~~~~i~~~l~~~~~~~pn~~ 161 (330) | .+....|+++++..+..-..|++ - |+++.-.+. ..++. ..+-...+++.+..-+. + T Consensus 59 ~--dst~~~a~~~Laa~l~~~ltP~~------~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~ 124 (535) T protein:vir:15 59 W--QAVGARGLNNLASKLMLALFPMQ------S-WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIE-----S 124 (535) T ss_pred c--cccHHHHHHHHHHHHHHhhcCCC------c-ccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHH-----h Confidence 1 23344667777777754323431 1 444322111 11112 22223344444443332 2 Q ss_pred CCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC----------------------- Q lcl|NC_019511. 162 DSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK----------------------- 218 (330) Q Consensus 162 ~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~----------------------- 218 (330) .+|+.=+..+..|++++|++..|+.- ..+.+.....||+. +..+..|..|++.+ T Consensus 125 snf~~~~~~~~~~L~~~G~a~l~~~~-~~~~~~~f~~~pl~--~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~ 201 (535) T protein:vir:15 125 NSYRVTLFECLKQLIVAGNALLYLPE-PEGSYNPMKLYRLS--SYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKA 201 (535) T ss_pred cCcHHHHHHHHHHHHhhCceeEEeec-CCCCceeeEEEEcC--eeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhcc Confidence 46777777778899999999888642 33334456666663 44455555553210 Q ss_pred -------CceeEEEE---e-CCceEEEe-ch-h-H--------------eeeecccCcCCCCCCCccccHHHHHHHHHHH Q lcl|NC_019511. 219 -------GGNRFVQV---I-DKQVVASF-TS-R-E--------------LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIA 270 (330) Q Consensus 219 -------~~~~Y~q~---~-~~~~~~~~-~~-~-d--------------vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~ 270 (330) ....++.. . +++....+ .. + + .+..+.+..+ ...||.||++-++-.+.. T Consensus 202 ~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~---ge~YGrgp~~~~l~D~k~ 278 (535) T protein:vir:15 202 GGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRID---GESYGRSYCEEYLGDLRS 278 (535) T ss_pred ccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCCceeeeeeecC---CCccccchHHHHHHHHHH Confidence 00111111 0 11111111 00 0 0 1111112222 145899999888877776 Q ss_pred HHHHHHHHHHHHhcCCCcceEEEeCCCCCCC----------------------------------HHHHHHHHHHHHHHh Q lcl|NC_019511. 271 YNNTESFNDRFFSHGGTTRGILQIRADQQQS----------------------------------QHALENFKREWKSSF 316 (330) Q Consensus 271 ~laae~~~~~fF~nGa~p~GiL~~~~~~~ls----------------------------------~e~~e~lr~~w~~~~ 316 (330) ....++-....-.-...|-.++ +.+.... .+.++.++......| T Consensus 279 L~~l~~~~l~~~~~~~~p~~lv--~~~g~~~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 356 (535) T protein:vir:15 279 LENLQEAIVKMSMISAKVIGLV--NPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF 356 (535) T ss_pred HHHHHHHHHHHHHHHhcCceee--cccccccchhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH Confidence 6655554444333333333221 1111111 222223333332222 Q ss_pred cCc----ccccccceeeC Q lcl|NC_019511. 317 SGI----NGSWQICLYIK 330 (330) Q Consensus 317 ~G~----~na~kvpvL~e 330 (330) --. .++.+ |--+ T Consensus 357 ~~~~~~~~~~~r--~TAt 372 (535) T protein:vir:15 357 MLNSAVQRTGER--VTAE 372 (535) T ss_pred hhhhcccCCCcc--ccHH Confidence 110 00000 0000 No 201 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=27.30 E-value=1.9 Score=19.11 Aligned_cols=258 Identities=9% Similarity=0.016 Sum_probs=84.0 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCcccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMRN 80 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~ 80 (330) +...|+.|+..+.. +-..-++.....-+. .+..+....-..+++ . +. T Consensus 16 l~~r~~~L~~~R~~------------------~e~~w~e~a~~~lP~--------~~~~~~~~~~~~~~~-----d--st 62 (516) T protein:vir:96 16 IPKLWEKFSNKRSS------------------FLDRAKHYSKLTLPY--------LMNDKGDNETSQNGW-----Q--GV 62 (516) T ss_pred HHHHHHHHHHHhhH------------------HHHHHHHHHHhhccc--------ccCCCCCccccCCcc-----c--ch Confidence 34444444221110 111112222221110 000000000000111 0 11 Q ss_pred hHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCc-----ccChhh----HHHHHHHHHHHH Q lcl|NC_019511. 81 AHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDA-----TPGIKE----KEQMKRIEEFIL 151 (330) Q Consensus 81 ~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~-----~~~~~~----~~~~~~i~~~l~ 151 (330) ..+.++.+| +-=.+.+..+. .=|+++.-.+. ..+..+ .+-...+++.+. T Consensus 63 ---g~~a~~~LA------------a~l~~~ltpp~-------~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~ 120 (516) T protein:vir:96 63 ---GAQATNHLA------------NKLAQVLFPAQ-------RSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAM 120 (516) T ss_pred ---HHHHHHHHH------------HHHHhhhcCCC-------CcccccccChhHHhhccccCchhHHHHHHHHHHHHHHH Confidence 112333333 12222111111 01223221110 001111 112233344333 Q ss_pred hccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC------------- Q lcl|NC_019511. 152 NTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK------------- 218 (330) Q Consensus 152 ~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~------------- 218 (330) .-+. +.+|+.=+-.++.|++++|++..|. ..+ + ....|||. +..+..|..|++.+ T Consensus 121 ~~l~-----~snf~~~~~~~~~~L~~~G~a~l~~--d~~--~-~~~~~pl~--~y~v~~d~~G~v~~i~rr~~~~~~~l~ 188 (516) T protein:vir:96 121 KELE-----QRQFRPAVVEAFKHLIVAGSCMLYK--PSK--G-AISAIPMH--HYVVNRDTNGDLLDIILLQEKALRTFD 188 (516) T ss_pred HHHH-----hcCcHHHHHHHHHHHHhHCeEeEEe--cCC--C-CEEEEEcC--eEEEeeCCCCCeeeehhhhHhhHHHHH Confidence 2211 2356666666677888899987775 232 2 35677773 34455555553210 Q ss_pred -------------------Ccee--------------EEEEeCCceEEEec---hhHe--eeecccCcCCCCCCCccccH Q lcl|NC_019511. 219 -------------------GGNR--------------FVQVIDKQVVASFT---SREL--VMGIRNPRSDLNSSGYGLSE 260 (330) Q Consensus 219 -------------------~~~~--------------Y~q~~~~~~~~~~~---~~dv--ih~~~n~~~d~~~~~yGlSP 260 (330) .... |++-.++......+ -++. +-.+.+..+ ...||.|| T Consensus 189 ~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~es~~~~~e~P~~~~Rw~~~~---ge~YGrgp 265 (516) T protein:vir:96 189 PATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSKIKSEKLPFIPLTWKRSY---GEDWGRPL 265 (516) T ss_pred HhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEeCceeeccccccccccCCeeeeeeeecC---CCCcccch Confidence 0010 11111111111100 0111 111222222 24599999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCC------CcceEEEe------------CC------------CCCCC--HHHHHHH Q lcl|NC_019511. 261 VEIAMKEFIAYNNTESFNDRFFSHGG------TTRGILQI------------RA------------DQQQS--QHALENF 308 (330) Q Consensus 261 Ie~a~~~I~~~laae~~~~~fF~nGa------~p~GiL~~------------~~------------~~~ls--~e~~e~l 308 (330) .+-|+-.+.......+-....-.-.+ .|+|++.. +| ...++ .+.++.+ T Consensus 266 ~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~ 345 (516) T protein:vir:96 266 AEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVY 345 (516) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCceeecCCcccceeeecCcccchhHHHHHHHHH Confidence 99888777766555443332211111 12222110 00 00111 3334444 Q ss_pred HHHHHHHhcCc----ccccccceeeC Q lcl|NC_019511. 309 KREWKSSFSGI----NGSWQICLYIK 330 (330) Q Consensus 309 r~~w~~~~~G~----~na~kvpvL~e 330 (330) +...+..|--. .++.+ |--+ T Consensus 346 ~~rI~~af~~~~l~~r~~~r--vTAt 369 (516) T protein:vir:96 346 TRRIGVVFMMETMTRRDAER--VTAV 369 (516) T ss_pred HHHHHHHHhhhhhccCCCcc--ccHH Confidence 44444333210 00000 0000 No 202 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=26.52 E-value=1.9 Score=19.01 Aligned_cols=272 Identities=12% Similarity=0.049 Sum_probs=94.5 Q ss_pred CccCc---chhHH-HHHHHHHHHHhhcccchhccc---cchhccccccccccCCCCCcCCCcccchHHHHHHHHHHhhcH Q lcl|NC_019511. 23 VPIDD---GIQAN-IRQIEQDTKEMQEITKSLYGK---QQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHEVLKKFGNNS 95 (330) Q Consensus 23 ~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~g~---~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~~Lr~~a~~~ 95 (330) +-... .++.. .+.+-+.+.+. -.+...+ =..|..|........... +... +-| .+ T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~---R~~~e~~w~e~~~y~lP~~~~~~~~~~~----~~~~---------~~~--ds 62 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKND---RNSYETRAENCAKYTIPSLFPKDSDNAS----TDYT---------TPW--QA 62 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHH---hhHHHHHHHHHHHHhccccCCCCCCccc----cccC---------Ccc--cc Confidence 11110 00000 11111111110 0001111 113344422111110000 0000 001 23 Q ss_pred HHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCc-----ccChhhHH----HHHHHHHHHHhccCCCCCCcCCHHH Q lcl|NC_019511. 96 ILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDA-----TPGIKEKE----QMKRIEEFILNTGTDKDIDRDSFQE 166 (330) Q Consensus 96 iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~-----~~~~~~~~----~~~~i~~~l~~~~~~~pn~~~s~~~ 166 (330) ....|+++++..+..-..|++ - |+++.-.+. ..++.+.+ .+..+++.+..-+ .+.+|+. T Consensus 63 t~~~a~~~Laa~l~~~ltP~~------~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-----~~snf~~ 130 (535) T protein:vir:94 63 VGARGLNNLASKLMLALFPMQ------T-WMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYI-----ESNSYRV 130 (535) T ss_pred cHHHHHHHHHHHHHhhhcCCC------C-ccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-----HhcCcHH Confidence 445666777777754323431 2 344422211 11122222 2333333333222 2346666 Q ss_pred HHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCcccC---------------------------- Q lcl|NC_019511. 167 FCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGKIIK---------------------------- 218 (330) Q Consensus 167 fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~~~~---------------------------- 218 (330) =+-....|++++|++..|+.-. .+.+.....||+ .+..+..|..|++.+ T Consensus 131 ~~~~~~~~L~~~G~a~l~~~~~-~~~~~~f~~~pl--~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~ 207 (535) T protein:vir:94 131 TLFETLKQLVVAGNALLYIPEP-EGTYNPMKLYRL--SSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKG 207 (535) T ss_pred HHHHHHHHHHhhCcEeEeeccC-cCcccceEEEEc--CeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCC Confidence 6666778888999998886532 223345566666 345555566664320 Q ss_pred -CceeEEEE---e-CCceEEE-echhH----------------eeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 219 -GGNRFVQV---I-DKQVVAS-FTSRE----------------LVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTES 276 (330) Q Consensus 219 -~~~~Y~q~---~-~~~~~~~-~~~~d----------------vih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~ 276 (330) ....++.. . +++.... +..++ .+..+.+..+ ...||.||++-++-.+......++ T Consensus 208 ~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~---ge~YGrgp~~~~l~D~k~L~~l~~ 284 (535) T protein:vir:94 208 DEMIDVYTHIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRID---GESYGRSYCEEYLGDLRSLENLQE 284 (535) T ss_pred CceeEEEEEEEeeCCCCcEEEEEEecCeeeccccccCccccCCceeeeeeecC---CCccccchHHHHHHHHHHHHHHHH Confidence 00111110 1 1111111 11111 1111112221 145899999888777766554444 Q ss_pred HHHHHHhcCCC------cceEEEe------------C--------------CCCCCCHHHHHHHHHHHHHHhcCccc--c Q lcl|NC_019511. 277 FNDRFFSHGGT------TRGILQI------------R--------------ADQQQSQHALENFKREWKSSFSGING--S 322 (330) Q Consensus 277 ~~~~fF~nGa~------p~GiL~~------------~--------------~~~~ls~e~~e~lr~~w~~~~~G~~n--a 322 (330) -....=.-.+. |+|++.. + ++-....+.++.++......|--... . T Consensus 285 ~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~ 364 (535) T protein:vir:94 285 AIVKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQR 364 (535) T ss_pred HHHHHHHHhccCCcccccccccchhhcccCCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccC Confidence 32221111111 1111110 0 00011223344444444433321100 0 Q ss_pred cccceeeC Q lcl|NC_019511. 323 WQICLYIK 330 (330) Q Consensus 323 ~kvpvL~e 330 (330) .+-.|--+ T Consensus 365 d~~rvTAt 372 (535) T protein:vir:94 365 TGERVTAE 372 (535) T ss_pred CCCCccHH Confidence 00000000 No 203 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=26.27 E-value=2 Score=18.98 Aligned_cols=259 Identities=11% Similarity=0.095 Sum_probs=89.4 Q ss_pred CCCCCcccccCccCcch-hHHHHHHHHHHHHhhcccchh-ccccchhccccccccccCCCCCcCCCcccchHHHHH-HHH Q lcl|NC_019511. 13 MYKEDTEDLMVPIDDGI-QANIRQIEQDTKEMQEITKSL-YGKQQAYAEPFLEMMDTNPDYRDKKSYMRNAHNLHE-VLK 89 (330) Q Consensus 13 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~p~~~~~~s~~r~~~~~~~-~Lr 89 (330) +++| +..-.|.-. +.++......+.+.... . ..|.+.+.+- ++... ..++ ... T Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~rd~l~~~~~g---lg~~r~~~~~~~---------g~~~~--------~~~~~l~~ 56 (449) T protein:vir:10 1 MTDK----LTLAVNHALNDARMARARMGLMVPTMG---LDNKRHSAWCEY---------GFPEL--------VTYENLYS 56 (449) T ss_pred Cchh----hHHHHhhhcchhHHHHHHHHHHHHHhc---CCcccchhhhhc---------CCccc--------CCHHHHHH Confidence 1111 111111101 11111111222211111 1 1223222111 11110 1123 345 Q ss_pred HHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCCCcCCHHHHHH Q lcl|NC_019511. 90 KFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDIDRDSFQEFCK 169 (330) Q Consensus 90 ~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn~~~s~~~fl~ 169 (330) .|..|.+.++||..++|+.-. .++ .+.-.+ +.+..+.+..--+.+++.+.+ .-|..+.+ T Consensus 57 ~Yr~~~ia~~iVd~~~d~~~~-~~~----------~i~~g~-~~~~~~~~~~~e~~~~~l~~~---------~~~~~l~e 115 (449) T protein:vir:10 57 LYRRGGIAHGAVEKLVGKCWQ-TNP----------EIIEGD-DADDSEDETSWEKKSKQVFTN---------RLWRSFAE 115 (449) T ss_pred HHhcCchhHHHHHhhhhhhhh-cCc----------ccccCc-cccchhhhHHHHHHHHHHHHH---------HHHHHHHH Confidence 566799999999999876421 111 111111 111111111111122221110 01222222 Q ss_pred HHHHHHHhcCCceeEEEEecCC---------CcceEEEEeeCCCceEEe---eCCCCcccCCceeEEEEe--CCceE--E Q lcl|NC_019511. 170 KIVRDTYTYDQVNFEKVFSPKN---------KTKMEKFIAVDPSTIFYA---TDKNGKIIKGGNRFVQVI--DKQVV--A 233 (330) Q Consensus 170 ~~v~d~L~~g~g~~~~v~~rd~---------~G~~~~L~pldp~tV~~~---~d~~G~~~~~~~~Y~q~~--~~~~~--~ 233 (330) +.- -.+++|-+.+.+.. +++ .+.+..|.|++...|.+. .|...-.+-.|..|.+.. .|+.. . T Consensus 116 a~~-~~rl~Gga~i~i~v-~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~ 193 (449) T protein:vir:10 116 ADR-RRLVGRYAGILLHI-RDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRV 193 (449) T ss_pred HHH-hhhccCcEEEEEEe-cCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccce Confidence 222 23346655555544 232 234556666665444332 111111111223333221 12211 1 Q ss_pred EechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHH-HHHHHHHHhcCC-----------CcceEEEeCCCCCCC Q lcl|NC_019511. 234 SFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNT-ESFNDRFFSHGG-----------TTRGILQIRADQQQS 301 (330) Q Consensus 234 ~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laa-e~~~~~fF~nGa-----------~p~GiL~~~~~~~ls 301 (330) .+-++-|+++...|. -|.|-++.+.+.+-....+ .-++..|+.|-. ...++....+ ... T Consensus 194 ~iH~SRl~~~~~~~~-------~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~--~~~ 264 (449) T protein:vir:10 194 DIHPDRVFILGDYSE-------DAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYG--VSI 264 (449) T ss_pred eeccceeEeecCCCC-------CChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhh--CCc Confidence 233444554422111 1778788887765332222 223333333211 1222211111 123 Q ss_pred HHHHHHHHHHHHHHhcCcccccccceeeC Q lcl|NC_019511. 302 QHALENFKREWKSSFSGINGSWQICLYIK 330 (330) Q Consensus 302 ~e~~e~lr~~w~~~~~G~~na~kvpvL~e 330 (330) ++..+++.+..+...+|.+ .+.|..| T Consensus 265 e~~~~~~~~~~~~~~~~~~---~~~i~~~ 290 (449) T protein:vir:10 265 DELQDKFNEVAGEINRGND---VLMTTQG 290 (449) T ss_pred hHHHHHHHHHHHHHhccch---heeecCC Confidence 3444556555554445543 2233333 No 204 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=25.67 E-value=2 Score=18.90 Aligned_cols=293 Identities=14% Similarity=0.127 Sum_probs=122.1 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHHHHHHHHHHhhcccchhccccchh---cccc--ccccccCCCCCcCC Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAY---AEPF--LEMMDTNPDYRDKK 75 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~---~~~~--~~~~~~~p~~~~~~ 75 (330) |..+|+-.-+. +... ..+.++.......+++...|-+..- ..++ .+.+..+ .+.--- T Consensus 5 ~~~lf~f~~~~----------------de~~-~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~-~y~~~e 66 (523) T protein:vir:68 5 ILSLFAPWAKM----------------DERD-YKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQR-MFGSQE 66 (523) T ss_pred hhhhhhhhhhh----------------hhhh-hhhhhhccCCCccccCCCCcceeeeccccccccccchhhhh-hhhccc Confidence 44555433100 0000 0000000000001111111111000 0000 0000000 010011 Q ss_pred CcccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccC Q lcl|NC_019511. 76 SYMRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGT 155 (330) Q Consensus 76 s~~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~ 155 (330) +..++-....+.-|..|.+|-|..+|+.+.+++. ..+..+---++.+. +-+.++...++|..--+.+.+++. T Consensus 67 ~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai------v~d~~~~pV~i~Ld--~~~~s~~iK~kI~eeF~~Il~ll~ 138 (523) T protein:vir:68 67 PGLKSTRELIDTYRNLMTNYEVDNAVSEIVSDAI------VYEDDTEVVSINLD--NTKFSPNIKSMMLDEFNEVLNHLS 138 (523) T ss_pred cccchHHHHHHHHHHHhhccchhhHHHHhhccee------eecCCCceEEEEec--ccccchHHHHHHHHHHHHHHHHhc Confidence 1235556666777888889999999999998874 22222222344443 334666666666554334444433 Q ss_pred CCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecC-CCcceEEEEeeCCCceEEe-----eCCCCcccCCce--eEEEEe Q lcl|NC_019511. 156 DKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPK-NKTKMEKFIAVDPSTIFYA-----TDKNGKIIKGGN--RFVQVI 227 (330) Q Consensus 156 ~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd-~~G~~~~L~pldp~tV~~~-----~d~~G~~~~~~~--~Y~q~~ 227 (330) ... ...++ ++.-++-|..++-++++.. .+.-+.+|..|||..|+.+ .+..|....++. .|+|.. T Consensus 139 F~~----~~~~~----fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~ 210 (523) T protein:vir:68 139 FQR----KGSDH----FRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDT 210 (523) T ss_pred cch----hhhHH----HHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeecc Confidence 222 22333 3455677788888887643 2334889999999998652 223333222221 233321 Q ss_pred -------CC-----ceEEEechhHeeeecccCcCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEeC Q lcl|NC_019511. 228 -------DK-----QVVASFTSRELVMGIRNPRSDLNSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIR 295 (330) Q Consensus 228 -------~~-----~~~~~~~~~dvih~~~n~~~d~~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~GiL~~~ 295 (330) .| +....++.+-|.|.+..-.++ + ++.=+|=+..|...+....-.|.-.-=|==--|.-+=|.-+. T Consensus 211 ~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~-~-~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYID 288 (523) T protein:vir:68 211 SHESYACDGRIYEAGTKIKIPKAAIVYAHSGLVDC-C-GKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVD 288 (523) T ss_pred ccccccccccccCCCcceecchhheeeeeccceeC-C-CCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEe Confidence 11 122345555555554221111 1 123345577777777665544433221111223333333333 Q ss_pred CCCCCCHHHHHHHHHHHHHHhc---------C-cccccccceeeC Q lcl|NC_019511. 296 ADQQQSQHALENFKREWKSSFS---------G-INGSWQICLYIK 330 (330) Q Consensus 296 ~~~~ls~e~~e~lr~~w~~~~~---------G-~~na~kvpvL~e 330 (330) .+ +|.+...++.-+...+.|. | +.|..+..-++| T Consensus 289 vG-nlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlE 332 (523) T protein:vir:68 289 TG-NMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTE 332 (523) T ss_pred cC-CCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHh Confidence 32 3444333333333322221 1 122223333333 No 205 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=22.47 E-value=2.4 Score=18.46 Aligned_cols=273 Identities=10% Similarity=0.064 Sum_probs=87.5 Q ss_pred CchhHHHHHhcCCCCCC---cccccCccCcchhHHHHHHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKED---TEDLMVPIDDGIQANIRQIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSY 77 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~ 77 (330) |.|.|-...+. -+.+. ..+......+.+.-.++.......++.+-..=-.|+......+ +........ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~--------~~~~~~~~~ 71 (468) T protein:vir:96 1 MIDIFWPNEKP-YHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNA--------PKRNVKGEI 71 (468) T ss_pred CccccCCcCce-eehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc--------ccccccccc Confidence 55554333211 01111 0000111111222222222222222221111112322111000 000000000 Q ss_pred ccchHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCC Q lcl|NC_019511. 78 MRNAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDK 157 (330) Q Consensus 78 ~r~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~ 157 (330) .....+ .+++ +++...|+++.+.-+ |..+.+++ .. + .+..+.+..++. T Consensus 72 ~~~~~~-----~ki~-~n~~~~Iv~~~~~~l--~g~p~~~~---------~~--d-------~~~~~~l~~~~~------ 119 (468) T protein:vir:96 72 DPFKPD-----WRMY-TNYHQNLVDQKVAYA--VANPVTYG---------TE--D-------EKSLKTIQEVLN------ 119 (468) T ss_pred cccccc-----cccc-cchHHHHHHHHHhhh--ccCCceec---------cC--C-------hHHHHHHHHHHh------ Confidence 000000 0011 234445544433332 12222211 11 1 112334444431 Q ss_pred CCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCC-CcccCCceeEEEEeCCceEEEec Q lcl|NC_019511. 158 DIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKN-GKIIKGGNRFVQVIDKQVVASFT 236 (330) Q Consensus 158 pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~-G~~~~~~~~Y~q~~~~~~~~~~~ 236 (330) .++...+..+..+++.+|.++..+.++.+ |+ +.+..++|..+.++.++. .....-.++|+..........++ T Consensus 120 ----n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~-~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~ 192 (468) T protein:vir:96 120 ----HKWDDKLVDILTAASNKGVEWIQPYVDEQ--GE-FKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWT 192 (468) T ss_pred ----cCHHHHHHHHHHHHhhcCeEEEEEEEcCC--Cc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEe Confidence 14556666778999999988776654443 54 467778999888775532 11111122332221122222223 Q ss_pred hhHeeeecc--------------------------c-----CcCCCCCCCccccHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019511. 237 SRELVMGIR--------------------------N-----PRSDLNSSGYGLSEVE---IAMKEFIAYNNTESFNDRFF 282 (330) Q Consensus 237 ~~dvih~~~--------------------------n-----~~~d~~~~~yGlSPIe---~a~~~I~~~laae~~~~~fF 282 (330) .+.+.|.+. + |.--+....+|.|-++ ....++...++.-.-..++| T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 272 (468) T protein:vir:96 193 ANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEA 272 (468) T ss_pred CCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 333222110 0 1111111234555544 44444444443322223333 Q ss_pred hcCCCcceEEEeCCCCCCCHHHHHHHHHHHHHH----hcCccccccc-ceeeC Q lcl|NC_019511. 283 SHGGTTRGILQIRADQQQSQHALENFKREWKSS----FSGINGSWQI-CLYIK 330 (330) Q Consensus 283 ~nGa~p~GiL~~~~~~~ls~e~~e~lr~~w~~~----~~G~~na~kv-pvL~e 330 (330) +.|- +.++|. .+++ .+.+....+.. ..|.+++ .+ .+.-+ T Consensus 273 ---~~p~--lv~~g~-~~~~--~~~~~~~~~~~~~i~~~~d~~~-~~~~l~~~ 316 (468) T protein:vir:96 273 ---TELI--YVLKGY-EGED--LEEFMYNLKYYKAINVDGDGSG-GVDTIQID 316 (468) T ss_pred ---cCce--eeeecC-Cccc--cchhhhhhhcCceEEecCCCCC-cceEEeec Confidence 3343 333331 1111 11111111100 0011010 10 01111 No 206 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=22.19 E-value=2.5 Score=18.42 Aligned_cols=277 Identities=14% Similarity=0.119 Sum_probs=89.6 Q ss_pred CchhHHHHHhcCCCCCCcccccCccCcchhHHHH-HHHHHHHHhhcccchhccccchhccccccccccCCCCCcCCCccc Q lcl|NC_019511. 1 MPDLFKSLRLGSMYKEDTEDLMVPIDDGIQANIR-QIEQDTKEMQEITKSLYGKQQAYAEPFLEMMDTNPDYRDKKSYMR 79 (330) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~p~~~~~~s~~r 79 (330) +-||++-.+..-.++.+...++. + ++...++ +..++..++.+-..=-.|+.... ..++..+.... .. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~l~~--~-~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i--------~~~~~~~~~~~-~~ 69 (506) T protein:vir:94 2 DYDLTEHKQANLIYQESLENLTP--N-KIMKFITHHFNYQRPRLEMLDDYYQGYNLKI--------LDKQSRRHEDG-KA 69 (506) T ss_pred CcchhhhhcceeecccchhcCCH--H-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--------ccccccccccc-CC Confidence 56666666655666655333221 1 1111111 11222111211111112222110 00111000000 00 Q ss_pred chHHHHHHHHHHhhcHHHHHHHHHHHHhHhhhhhhheecccccceeeeccCCCcccChhhHHHHHHHHHHHHhccCCCCC Q lcl|NC_019511. 80 NAHNLHEVLKKFGNNSILNAIIITRANQVSTYCKPARYSEKGVGFEVKLKDLDATPGIKEKEQMKRIEEFILNTGTDKDI 159 (330) Q Consensus 80 ~~~~~~~~Lr~~a~~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~g~~v~~kd~~~~~~~~~~~~~~~i~~~l~~~~~~~pn 159 (330) +. | .+ .+....|+++.+.-+ |..+.+ +...++ . ....+.+++.. | T Consensus 70 ~~-------k-i~-~n~~~~Iv~~~~~~l--~G~p~~---------~~~~d~------~---~~~~l~~~~~~------N 114 (506) T protein:vir:94 70 DH-------R-AT-HSFAKYIADFQTSYS--VGNPIN---------VKLPDD------G---SNSGFDTFNKA------N 114 (506) T ss_pred cc-------e-ee-cchHHHHHHHhhhhh--cccCce---------eecCcc------h---HHHHHHHHHhc------c Confidence 00 0 00 344555555544433 122222 222111 1 12334444421 2 Q ss_pred CcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcceEEEEeeCCCceEEeeCCCCc-ccCCceeEEEEe--CCce----- Q lcl|NC_019511. 160 DRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTKMEKFIAVDPSTIFYATDKNGK-IIKGGNRFVQVI--DKQV----- 231 (330) Q Consensus 160 ~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~~~~L~pldp~tV~~~~d~~G~-~~~~~~~Y~q~~--~~~~----- 231 (330) ++...+..+..+.+.+|.++..+.++.+ |+ ..+..++|..+.++.++... .+.-.++|++.. ++.. T Consensus 115 ---~~~~~~~~~~~~~~~~G~a~~~v~~ded--~~-~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~ 188 (506) T protein:vir:94 115 ---DVDAENYDLFLDMSRYGRAYEYVYRGED--NE-EHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTIN 188 (506) T ss_pred ---CHhHHHHHHHHHHHhcCeEEEEEEecCC--Ce-eEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEE Confidence 4555666778899999988877665544 54 45777999998887665321 111123332211 1110 Q ss_pred --EEEechhHeeeec------------ccCcCC-----CCCCCccccHHH---HHHHHHHHHHHHHHHHHHHHhc----- Q lcl|NC_019511. 232 --VASFTSRELVMGI------------RNPRSD-----LNSSGYGLSEVE---IAMKEFIAYNNTESFNDRFFSH----- 284 (330) Q Consensus 232 --~~~~~~~dvih~~------------~n~~~d-----~~~~~yGlSPIe---~a~~~I~~~laae~~~~~fF~n----- 284 (330) ...++...+.+.. .++... +.....|.|.++ ....++...++.-.-...+|++ T Consensus 189 ~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~ 268 (506) T protein:vir:94 189 YVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLII 268 (506) T ss_pred EEEEEEeCceEEEeccccCccceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHH Confidence 0112222221110 011111 101122444433 3333333333222222233332 Q ss_pred -CCC--------cceEEE---eCCCCCCCHHHHHHHHHHHHHHhcCcccccccc----------eeeC Q lcl|NC_019511. 285 -GGT--------TRGILQ---IRADQQQSQHALENFKREWKSSFSGINGSWQIC----------LYIK 330 (330) Q Consensus 285 -Ga~--------p~GiL~---~~~~~~ls~e~~e~lr~~w~~~~~G~~na~kvp----------vL~e 330 (330) |.. +...+. ..+......+..+-++....+..-.....+.+. |.-+ T Consensus 269 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~ 336 (506) T protein:vir:94 269 QGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKT 336 (506) T ss_pred hcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeec Confidence 110 000000 001111111112222211111111111000000 0000 No 207 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=20.68 E-value=2.7 Score=18.19 Aligned_cols=260 Identities=8% Similarity=0.041 Sum_probs=83.2 Q ss_pred hccccchhccccccccccCCCCCcCC--C-cccchHHHH--HHHHHHhh--cHHHHHHHHHHHHhHhhhhhhheeccccc Q lcl|NC_019511. 50 LYGKQQAYAEPFLEMMDTNPDYRDKK--S-YMRNAHNLH--EVLKKFGN--NSILNAIIITRANQVSTYCKPARYSEKGV 122 (330) Q Consensus 50 ~~g~~~~~~~~~~~~~~~~p~~~~~~--s-~~r~~~~~~--~~Lr~~a~--~~iv~a~I~~~~d~Ia~~~~~~~~~~~~~ 122 (330) +.-+ +.|+.-....-.--+.|+.=. + |.+....+. ..-+...+ .+..-.|.++++..+..-..+. +. T Consensus 1 m~~~-~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp-----~~ 74 (522) T protein:vir:10 1 MKAR-ERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPP-----QT 74 (522) T ss_pred CchH-HHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC-----CC Confidence 0000 011000000000000111100 0 001111100 00011110 1234445555555554321111 11 Q ss_pred ceeeecc--CC--CcccChhhH----HHHHHHHHHHHhccCCCCCCcCCHHHHHHHHHHHHHhcCCceeEEEEecCCCcc Q lcl|NC_019511. 123 GFEVKLK--DL--DATPGIKEK----EQMKRIEEFILNTGTDKDIDRDSFQEFCKKIVRDTYTYDQVNFEKVFSPKNKTK 194 (330) Q Consensus 123 g~~v~~k--d~--~~~~~~~~~----~~~~~i~~~l~~~~~~~pn~~~s~~~fl~~~v~d~L~~g~g~~~~v~~rd~~G~ 194 (330) =|+++. |. .+..+.... +....+++.+..-+. +.+|+.=+-.+..|++++|++..|.- .+ T Consensus 75 -~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~-----~snf~~~~~~~~~~L~~~G~a~ly~~--~~---- 142 (522) T protein:vir:10 75 -SFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIA-----ASNDRVAVHQALKHLIVGGNALIFMG--KD---- 142 (522) T ss_pred -ccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHH-----hcCcHHHHHHHHHHHHhHCceeEEEc--CC---- Confidence 133332 21 111111111 122334444433322 23566666677788999999887652 22 Q ss_pred eEEEEeeCCCceEEeeCCCCccc--------------------------------CCceeEEEEe----C-CceEEEech Q lcl|NC_019511. 195 MEKFIAVDPSTIFYATDKNGKII--------------------------------KGGNRFVQVI----D-KQVVASFTS 237 (330) Q Consensus 195 ~~~L~pldp~tV~~~~d~~G~~~--------------------------------~~~~~Y~q~~----~-~~~~~~~~~ 237 (330) +...||+. +..+..|..|++. ......+..+ + +.....-.. T Consensus 143 ~~~~~pl~--~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~ 220 (522) T protein:vir:10 143 GLKTFPLT--RYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEA 220 (522) T ss_pred CceEEEcc--eEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEcc Confidence 12344542 2333344444321 0001100000 0 111111011 Q ss_pred h-Heeeec-------ccC----cCCC-CCCCccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc------eEEEe---- Q lcl|NC_019511. 238 R-ELVMGI-------RNP----RSDL-NSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTR------GILQI---- 294 (330) Q Consensus 238 ~-dvih~~-------~n~----~~d~-~~~~yGlSPIe~a~~~I~~~laae~~~~~fF~nGa~p~------GiL~~---- 294 (330) + .++... .+| |... ....||.||++-++-.+......++-....-.-...|- |++.. T Consensus 221 ~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~ 300 (522) T protein:vir:10 221 FDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIA 300 (522) T ss_pred CCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccccc Confidence 1 111100 001 1111 12458999999888887776655554444322222222 11110 Q ss_pred C--------------------CCCCC--CHHHHHHHHHHHHHHh--cCcccccccceeeC Q lcl|NC_019511. 295 R--------------------ADQQQ--SQHALENFKREWKSSF--SGINGSWQICLYIK 330 (330) Q Consensus 295 ~--------------------~~~~l--s~e~~e~lr~~w~~~~--~G~~na~kvpvL~e 330 (330) + ....+ ..+.++.++......| ....++.++ -=.| T Consensus 301 ~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~~~d~~rv-TAtE 359 (522) T protein:vir:10 301 KAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMNVRNAERV-TAEE 359 (522) T ss_pred CCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhccCCCCCCC-CHHH Confidence 0 00011 1333444444444332 111111110 0001 Done!