Query lcl|NC_019408.1_cdsid_YP_006989075.1 [gene=D869_gp305] [protein=putative portal protein] [protein_id=YP_006989075.1] [location=20300..22138] Match_columns 612 No_of_seqs 163 out of 203 Neff 7.3 Searched_HMMs 1612 Date Thu Nov 7 17:11:38 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_46 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_46_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97265 Length: 513 100.0 1E-147 7E-151 826.2 48.1 466 1-525 10-513 (513) 2 protein:vir:80453 Length: 535 100.0 1E-146 7E-150 820.6 48.1 461 1-509 36-535 (535) 3 protein:vir:95149 Length: 501 100.0 7E-146 4E-149 816.3 48.7 473 1-514 5-501 (501) 4 protein:vir:95014 Length: 491 100.0 1E-145 7E-149 815.2 49.2 464 1-516 12-491 (491) 5 protein:vir:78393 Length: 489 100.0 1E-143 7E-147 804.3 46.7 463 1-516 12-489 (489) 6 protein:vir:94956 Length: 452 100.0 6E-142 4E-145 794.6 48.4 445 1-506 4-452 (452) 7 protein:vir:96783 Length: 488 100.0 1E-135 8E-139 760.2 45.6 439 1-497 17-488 (488) 8 protein:vir:5961 Length: 503 # 99.9 3.6E-27 2.2E-30 165.5 30.4 458 1-540 37-503 (503) 9 protein:vir:94805 Length: 492 99.9 3.8E-27 2.3E-30 165.4 28.0 436 1-533 52-492 (492) 10 protein:vir:79043 Length: 479 99.9 2.3E-26 1.4E-29 161.1 30.9 445 1-535 27-479 (479) 11 protein:vir:97336 Length: 492 99.9 2E-26 1.2E-29 161.5 27.5 438 1-534 52-492 (492) 12 protein:vir:93747 Length: 472 99.9 2.3E-25 1.5E-28 155.6 31.5 438 1-536 32-472 (472) 13 protein:vir:1236 Length: 483 # 99.9 1E-25 6.4E-29 157.6 29.1 437 1-536 43-483 (483) 14 protein:vir:9871 Length: 429 # 99.9 1.4E-25 8.7E-29 156.8 29.3 414 1-528 9-429 (429) 15 protein:vir:106571 Length: 499 99.9 4.3E-25 2.7E-28 154.2 31.2 452 1-547 24-499 (499) 16 protein:vir:106639 Length: 481 99.9 1.7E-24 1E-27 150.9 33.2 424 1-515 38-481 (481) 17 protein:vir:105889 Length: 474 99.9 6.3E-25 3.9E-28 153.3 30.8 437 1-526 9-474 (474) 18 protein:vir:94101 Length: 474 99.9 6.3E-25 3.9E-28 153.3 30.8 437 1-526 9-474 (474) 19 protein:vir:94498 Length: 474 99.9 1.1E-24 6.8E-28 151.9 30.2 435 1-524 35-474 (474) 20 protein:vir:97447 Length: 474 99.9 1.1E-24 6.8E-28 151.9 30.2 435 1-524 35-474 (474) 21 protein:vir:3964 Length: 453 # 99.9 1E-24 6.5E-28 152.1 29.2 421 1-524 25-453 (453) 22 protein:vir:95806 Length: 440 99.9 4.3E-24 2.7E-27 148.7 32.2 417 1-517 2-440 (440) 23 protein:vir:99781 Length: 511 99.9 6.6E-24 4.1E-27 147.7 32.8 440 1-534 48-511 (511) 24 protein:vir:102330 Length: 451 99.9 1.4E-23 8.5E-27 145.9 34.3 432 1-526 9-451 (451) 25 protein:vir:96240 Length: 511 99.9 1.7E-23 1.1E-26 145.4 34.4 442 1-536 48-511 (511) 26 protein:vir:80680 Length: 441 99.9 2.6E-23 1.6E-26 144.4 35.0 415 1-541 12-441 (441) 27 protein:vir:95113 Length: 474 99.9 7.1E-24 4.4E-27 147.5 31.7 434 1-526 35-474 (474) 28 protein:vir:96179 Length: 468 99.9 9.4E-24 5.9E-27 146.8 32.3 430 1-519 34-468 (468) 29 protein:vir:105292 Length: 478 99.9 7E-24 4.4E-27 147.5 31.4 437 1-531 34-478 (478) 30 protein:vir:9306 Length: 511 # 99.9 3.2E-23 2E-26 143.9 34.7 441 1-536 48-511 (511) 31 protein:vir:96266 Length: 474 99.9 7.9E-24 4.9E-27 147.3 31.2 436 1-531 35-474 (474) 32 protein:vir:95899 Length: 474 99.9 7.9E-24 4.9E-27 147.3 31.2 436 1-531 35-474 (474) 33 protein:vir:97171 Length: 512 99.9 2.8E-24 1.8E-27 149.7 28.7 444 1-547 48-512 (512) 34 protein:vir:107112 Length: 478 99.9 1.5E-23 9.6E-27 145.6 32.3 437 1-531 34-478 (478) 35 protein:vir:103951 Length: 511 99.9 1E-23 6.2E-27 146.7 31.0 443 1-547 48-511 (511) 36 protein:vir:105819 Length: 456 99.9 1.9E-23 1.2E-26 145.1 32.1 426 1-512 13-456 (456) 37 protein:vir:102602 Length: 456 99.9 1.9E-23 1.2E-26 145.1 32.1 426 1-512 13-456 (456) 38 protein:vir:3609 Length: 452 # 99.9 1.4E-23 8.7E-27 145.9 31.2 421 1-536 25-452 (452) 39 protein:vir:9922 Length: 489 # 99.9 3.7E-23 2.3E-26 143.6 32.8 425 1-510 23-489 (489) 40 protein:vir:78805 Length: 511 99.9 4.6E-23 2.9E-26 143.0 32.7 441 1-526 48-511 (511) 41 protein:vir:96366 Length: 511 99.9 4.6E-23 2.9E-26 143.0 32.7 441 1-526 48-511 (511) 42 protein:vir:99072 Length: 479 99.9 3.3E-23 2.1E-26 143.8 31.9 443 1-540 17-479 (479) 43 protein:vir:733 Length: 453 # 99.9 1.4E-22 8.9E-26 140.3 34.5 415 1-532 25-453 (453) 44 protein:vir:99522 Length: 470 99.9 6.8E-23 4.2E-26 142.1 32.7 423 1-535 33-470 (470) 45 protein:vir:2732 Length: 501 # 99.9 1.2E-22 7.4E-26 140.8 34.0 432 1-534 47-501 (501) 46 protein:vir:102950 Length: 471 99.9 4.5E-23 2.8E-26 143.1 31.2 445 1-511 13-471 (471) 47 protein:vir:96839 Length: 474 99.9 9.4E-23 5.8E-26 141.3 32.1 434 1-517 34-474 (474) 48 protein:vir:98444 Length: 434 99.9 3E-22 1.9E-25 138.6 33.5 405 34-508 1-434 (434) 49 protein:vir:7987 Length: 456 # 99.9 1.3E-22 8.4E-26 140.5 31.4 425 1-512 13-456 (456) 50 protein:vir:96494 Length: 501 99.9 1.8E-22 1.1E-25 139.7 31.8 429 1-533 47-501 (501) 51 protein:vir:78227 Length: 480 99.9 8E-23 5E-26 141.7 29.7 450 1-546 11-480 (480) 52 protein:vir:78537 Length: 480 99.9 1.2E-22 7.1E-26 140.9 30.0 445 1-549 11-480 (480) 53 protein:vir:4898 Length: 502 # 99.9 4.8E-22 3E-25 137.4 33.2 428 1-540 48-502 (502) 54 protein:vir:104082 Length: 485 99.9 8E-22 5E-25 136.2 32.6 439 1-540 21-485 (485) 55 protein:vir:94546 Length: 506 99.9 8.8E-22 5.5E-25 136.0 30.9 436 1-536 30-506 (506) 56 protein:vir:2341 Length: 488 # 99.9 1E-21 6.4E-25 135.6 31.3 438 1-539 16-488 (488) 57 protein:vir:9568 Length: 410 # 99.9 3.6E-21 2.2E-24 132.6 33.0 391 7-499 1-410 (410) 58 protein:vir:105461 Length: 470 99.9 1.3E-21 7.9E-25 135.1 30.3 444 1-529 13-470 (470) 59 protein:vir:2500 Length: 501 # 99.9 1.2E-21 7.3E-25 135.3 29.1 458 1-540 31-501 (501) 60 protein:vir:2427 Length: 485 # 99.9 6.4E-21 4E-24 131.3 31.1 441 1-540 21-485 (485) 61 protein:vir:4223 Length: 486 # 99.8 1.9E-21 1.2E-24 134.1 28.1 440 1-540 21-486 (486) 62 protein:vir:7768 Length: 484 # 99.8 4.8E-21 3E-24 132.0 29.5 441 1-539 20-484 (484) 63 protein:vir:9751 Length: 422 # 99.8 3.8E-20 2.4E-23 127.1 33.5 395 1-496 9-422 (422) 64 protein:vir:94742 Length: 409 99.8 1.3E-20 8E-24 129.6 30.9 383 1-480 9-409 (409) 65 protein:vir:78083 Length: 537 99.8 2E-18 1.3E-21 117.6 33.7 496 1-612 20-529 (537) 66 protein:vir:1634 Length: 409 # 99.8 1.5E-18 9.4E-22 118.3 32.6 382 1-480 9-409 (409) 67 protein:vir:8846 Length: 705 # 99.8 3.2E-16 2E-19 105.5 40.7 584 1-612 10-700 (705) 68 protein:vir:99916 Length: 504 99.7 4.4E-17 2.7E-20 110.3 32.8 433 1-531 17-504 (504) 69 protein:vir:8184 Length: 474 # 99.7 2E-17 1.3E-20 112.1 30.4 413 1-518 23-474 (474) 70 protein:vir:101494 Length: 527 99.7 1.9E-17 1.2E-20 112.3 29.6 468 1-521 23-527 (527) 71 protein:vir:102239 Length: 527 99.7 2E-17 1.2E-20 112.1 29.6 468 1-521 23-527 (527) 72 protein:vir:38 Length: 496 # N 99.7 2.5E-14 1.5E-17 95.2 36.6 435 1-511 31-496 (496) 73 protein:vir:80959 Length: 499 99.6 7.3E-14 4.5E-17 92.6 37.8 439 1-511 31-499 (499) 74 protein:vir:7430 Length: 563 # 99.6 2.4E-14 1.5E-17 95.3 31.1 457 1-510 21-563 (563) 75 protein:vir:93630 Length: 776 99.5 4.7E-13 2.9E-16 88.2 33.9 580 1-612 38-739 (776) 76 protein:vir:108295 Length: 711 99.5 2.7E-12 1.7E-15 84.0 37.7 552 1-591 20-711 (711) 77 protein:vir:3028 Length: 500 # 99.5 9.3E-12 5.8E-15 81.1 36.8 436 1-509 32-500 (500) 78 protein:vir:9815 Length: 500 # 99.5 9.3E-12 5.8E-15 81.1 36.8 436 1-509 32-500 (500) 79 protein:vir:80165 Length: 651 99.4 1.8E-11 1.1E-14 79.5 36.4 535 1-590 29-651 (651) 80 protein:vir:98883 Length: 517 99.4 1.9E-11 1.2E-14 79.4 34.0 435 1-515 26-517 (517) 81 protein:vir:1587 Length: 508 # 99.4 2.1E-11 1.3E-14 79.2 34.6 437 1-509 33-508 (508) 82 protein:vir:79703 Length: 505 99.3 2E-10 1.2E-13 73.8 35.5 422 1-509 32-505 (505) 83 protein:vir:78907 Length: 518 99.1 1.7E-09 1.1E-12 68.7 34.1 451 11-511 1-518 (518) 84 protein:vir:4782 Length: 522 # 99.1 2.1E-09 1.3E-12 68.2 38.0 443 1-510 31-522 (522) 85 protein:vir:100920 Length: 725 99.1 2.4E-09 1.5E-12 67.9 38.9 579 4-612 1-723 (725) 86 protein:vir:95821 Length: 763 99.1 2.6E-09 1.6E-12 67.6 43.2 560 1-612 8-731 (763) 87 protein:vir:77597 Length: 725 99.1 3E-09 1.9E-12 67.3 36.9 578 4-606 1-725 (725) 88 protein:vir:172 Length: 708 # 99.1 3.4E-09 2.1E-12 67.0 41.8 579 1-607 1-708 (708) 89 protein:vir:105429 Length: 708 99.0 5E-09 3.1E-12 66.1 41.8 568 1-607 1-708 (708) 90 protein:vir:105520 Length: 706 98.9 1.3E-08 8.3E-12 63.7 45.5 574 1-612 1-706 (706) 91 protein:vir:2764 Length: 714 # 98.7 1.4E-07 9E-11 58.1 39.2 561 1-598 7-714 (714) 92 protein:vir:9950 Length: 714 # 98.7 1.4E-07 9E-11 58.1 39.2 561 1-598 7-714 (714) 93 protein:vir:817 Length: 714 # 98.7 1.4E-07 9E-11 58.1 39.2 561 1-598 7-714 (714) 94 protein:vir:10117 Length: 714 98.7 1.4E-07 9E-11 58.1 39.2 561 1-598 7-714 (714) 95 protein:vir:3296 Length: 714 # 98.7 1.4E-07 9E-11 58.1 39.2 561 1-598 7-714 (714) 96 protein:vir:9263 Length: 725 # 98.6 1.6E-07 1E-10 57.8 37.6 573 4-607 1-725 (725) 97 protein:vir:105619 Length: 772 98.6 2.4E-07 1.5E-10 56.9 42.0 574 1-612 11-743 (772) 98 protein:vir:345 Length: 663 # 98.6 2.6E-07 1.6E-10 56.7 33.8 538 1-605 9-663 (663) 99 protein:vir:104437 Length: 714 98.6 2.9E-07 1.8E-10 56.4 40.1 566 1-598 8-714 (714) 100 protein:vir:95449 Length: 584 98.6 9.1E-08 5.7E-11 59.2 20.2 503 1-565 12-584 (584) 101 protein:vir:94599 Length: 641 98.4 9.5E-07 5.9E-10 53.6 33.1 526 1-602 20-641 (641) 102 protein:vir:3520 Length: 720 # 97.8 1.6E-05 1E-08 46.8 41.4 576 1-609 1-720 (720) 103 protein:vir:79538 Length: 502 97.5 5.5E-05 3.4E-08 43.9 31.7 422 1-516 9-502 (502) 104 protein:vir:80040 Length: 461 97.1 0.00016 9.7E-08 41.4 29.1 410 1-502 1-461 (461) 105 protein:vir:10447 Length: 536 97.0 0.00022 1.4E-07 40.6 29.0 471 1-596 13-536 (536) 106 protein:vir:100039 Length: 522 96.9 0.00025 1.6E-07 40.3 29.5 464 1-583 1-522 (522) 107 protein:vir:80211 Length: 514 96.8 0.00034 2.1E-07 39.6 28.9 455 1-558 8-514 (514) 108 protein:vir:2198 Length: 536 # 96.6 0.00052 3.2E-07 38.6 30.4 467 1-596 1-536 (536) 109 protein:vir:78942 Length: 510 96.3 0.00073 4.5E-07 37.8 29.8 459 1-565 12-510 (510) 110 protein:vir:78161 Length: 355 96.2 0.0009 5.6E-07 37.3 17.4 318 163-551 1-355 (355) 111 protein:vir:5249 Length: 437 # 96.2 0.00093 5.8E-07 37.2 23.5 407 2-516 1-437 (437) 112 protein:vir:1538 Length: 535 # 95.7 0.0015 9.5E-07 36.0 31.8 469 1-595 21-535 (535) 113 protein:vir:103765 Length: 549 95.6 0.0017 1.1E-06 35.7 29.4 478 1-592 1-549 (549) 114 protein:vir:6322 Length: 510 # 95.6 0.0019 1.2E-06 35.6 30.5 462 2-565 1-510 (510) 115 protein:vir:3361 Length: 535 # 95.5 0.0019 1.2E-06 35.5 29.3 467 1-595 21-535 (535) 116 protein:vir:96988 Length: 516 95.2 0.0025 1.6E-06 34.8 27.4 455 1-558 16-516 (516) 117 protein:vir:98816 Length: 446 95.2 0.0025 1.6E-06 34.8 27.8 389 1-485 7-446 (446) 118 protein:vir:94572 Length: 535 95.1 0.0028 1.7E-06 34.6 30.7 474 1-598 10-535 (535) 119 protein:vir:99672 Length: 532 94.8 0.0034 2.1E-06 34.1 33.9 466 1-591 8-532 (532) 120 protein:vir:103860 Length: 528 94.8 0.0034 2.1E-06 34.1 32.5 467 1-601 16-528 (528) 121 protein:vir:103330 Length: 517 94.8 0.0035 2.2E-06 34.0 28.7 474 1-593 3-517 (517) 122 protein:vir:7017 Length: 515 # 94.6 0.0038 2.4E-06 33.8 29.6 466 1-572 6-515 (515) 123 protein:vir:78696 Length: 542 94.6 0.0039 2.4E-06 33.8 32.0 482 1-606 12-542 (542) 124 protein:vir:107880 Length: 491 94.5 0.0044 2.7E-06 33.5 34.5 454 1-595 12-491 (491) 125 protein:vir:3420 Length: 533 # 94.4 0.0046 2.9E-06 33.4 28.6 430 2-522 1-533 (533) 126 protein:vir:1785 Length: 555 # 94.2 0.0051 3.2E-06 33.1 32.1 500 1-604 12-555 (555) 127 protein:vir:79063 Length: 491 93.9 0.0059 3.7E-06 32.8 31.9 449 1-595 12-491 (491) 128 protein:vir:108215 Length: 469 93.9 0.006 3.7E-06 32.8 30.4 421 1-543 4-469 (469) 129 protein:vir:7321 Length: 556 # 93.7 0.0068 4.2E-06 32.5 32.6 507 2-596 1-556 (556) 130 protein:vir:94709 Length: 522 93.4 0.0077 4.8E-06 32.2 35.6 456 1-577 19-522 (522) 131 protein:vir:79511 Length: 448 93.4 0.0078 4.8E-06 32.1 24.6 398 1-521 11-448 (448) 132 protein:vir:107662 Length: 427 93.1 0.0087 5.4E-06 31.9 23.6 387 1-503 3-427 (427) 133 protein:vir:105641 Length: 516 93.0 0.009 5.6E-06 31.8 28.3 455 1-551 16-516 (516) 134 protein:vir:99853 Length: 488 92.8 0.0099 6.1E-06 31.6 36.6 445 2-594 1-488 (488) 135 protein:vir:8883 Length: 543 # 92.5 0.011 6.8E-06 31.3 33.7 487 1-604 5-543 (543) 136 protein:vir:10321 Length: 495 92.4 0.011 7.1E-06 31.2 31.5 429 1-519 3-495 (495) 137 protein:vir:77981 Length: 448 92.4 0.011 7.1E-06 31.2 26.4 399 1-538 11-448 (448) 138 protein:vir:104338 Length: 422 91.6 0.015 9.3E-06 30.6 24.0 384 1-502 1-422 (422) 139 protein:vir:100882 Length: 383 90.8 0.019 1.2E-05 30.0 21.6 352 1-502 3-383 (383) 140 protein:vir:99232 Length: 526 90.5 0.021 1.3E-05 29.8 33.3 470 1-601 11-526 (526) 141 protein:vir:98506 Length: 555 88.7 0.031 1.9E-05 28.9 31.9 508 1-595 1-555 (555) 142 protein:vir:107404 Length: 555 88.7 0.031 1.9E-05 28.9 31.9 508 1-595 1-555 (555) 143 protein:vir:107822 Length: 555 88.7 0.031 1.9E-05 28.9 31.9 508 1-595 1-555 (555) 144 protein:vir:1084 Length: 437 # 88.2 0.034 2.1E-05 28.7 10.0 146 462-612 1-170 (437) 145 protein:vir:104256 Length: 458 87.7 0.037 2.3E-05 28.4 11.2 123 460-612 1-156 (458) 146 protein:vir:102668 Length: 547 87.4 0.039 2.4E-05 28.3 32.1 495 5-578 1-547 (547) 147 protein:vir:79647 Length: 435 86.3 0.047 2.9E-05 27.9 23.8 392 1-511 4-435 (435) 148 protein:vir:95315 Length: 559 84.1 0.063 3.9E-05 27.1 33.3 515 2-601 1-559 (559) 149 protein:vir:107742 Length: 537 83.9 0.064 4E-05 27.1 20.6 410 1-529 70-537 (537) 150 protein:vir:3843 Length: 397 # 83.8 0.066 4.1E-05 27.1 21.2 365 1-509 3-397 (397) 151 protein:vir:6382 Length: 553 # 81.2 0.088 5.4E-05 26.4 31.2 449 1-531 1-553 (553) 152 protein:vir:79233 Length: 526 79.4 0.1 6.5E-05 26.0 35.1 472 1-601 16-526 (526) 153 protein:vir:3989 Length: 392 # 77.9 0.12 7.4E-05 25.6 27.9 357 16-502 1-392 (392) 154 protein:vir:1023 Length: 392 # 77.9 0.12 7.4E-05 25.6 27.9 357 16-502 1-392 (392) 155 protein:vir:7407 Length: 392 # 74.7 0.16 9.6E-05 25.0 28.5 360 13-502 1-392 (392) 156 protein:vir:104256 Length: 458 72.3 0.18 0.00011 24.6 12.5 151 443-612 1-166 (458) 157 protein:vir:95254 Length: 488 70.3 0.21 0.00013 24.3 22.7 431 1-524 1-488 (488) 158 protein:vir:3139 Length: 599 # 68.8 0.23 0.00014 24.1 24.8 517 1-580 11-599 (599) 159 protein:vir:95542 Length: 548 68.0 0.24 0.00015 23.9 29.3 458 1-551 7-548 (548) 160 protein:vir:80128 Length: 466 66.3 0.27 0.00017 23.7 10.2 111 483-612 1-131 (466) 161 protein:vir:8420 Length: 477 # 63.9 0.31 0.00019 23.4 11.4 104 503-612 1-107 (477) 162 protein:vir:8846 Length: 705 # 59.9 0.38 0.00024 22.9 27.9 542 1-603 59-705 (705) 163 protein:vir:1986 Length: 512 # 56.0 0.47 0.00029 22.4 34.6 464 1-599 16-512 (512) 164 protein:vir:1084 Length: 437 # 55.6 0.48 0.0003 22.3 10.8 161 445-612 1-176 (437) 165 protein:vir:80128 Length: 466 53.9 0.52 0.00032 22.2 9.3 96 503-612 1-99 (466) 166 protein:vir:962 Length: 397 # 50.3 0.61 0.00038 21.7 9.4 122 471-612 1-136 (397) 167 protein:vir:8102 Length: 543 # 46.6 0.73 0.00045 21.3 10.9 150 436-612 1-167 (543) 168 protein:vir:389 Length: 530 # 43.1 0.86 0.00053 21.0 28.5 427 2-523 1-530 (530) 169 protein:vir:962 Length: 397 # 42.1 0.9 0.00056 20.8 8.1 107 503-612 1-115 (397) 170 protein:vir:100135 Length: 418 41.6 0.92 0.00057 20.8 9.6 99 474-612 1-99 (418) 171 protein:vir:8420 Length: 477 # 39.5 1 0.00063 20.5 10.1 126 462-612 1-143 (477) 172 protein:vir:4952 Length: 386 # 38.6 1.1 0.00066 20.4 28.3 363 11-501 1-386 (386) 173 protein:vir:101607 Length: 379 37.6 1.1 0.00069 20.3 6.7 107 503-612 1-121 (379) 174 protein:vir:4854 Length: 386 # 37.5 1.1 0.00069 20.3 26.3 361 11-504 1-386 (386) 175 protein:vir:100135 Length: 418 24.2 2.2 0.0014 18.7 8.0 94 505-612 1-94 (418) 176 protein:vir:1082 Length: 359 # 23.9 2.2 0.0014 18.7 20.0 333 10-477 1-359 (359) 177 protein:vir:3870 Length: 400 # 22.6 2.4 0.0015 18.5 9.3 121 481-612 1-133 (400) No 1 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=1.1e-147 Score=826.19 Aligned_cols=466 Identities=20% Similarity=0.291 Sum_probs=399.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee-c Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK-N 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~-~ 79 (612) -.+||+|.+|+++|++|+|||+|+++||++|++||||+++|+++.|++||+||+|||+|++||++|+|+||+++|+++ + T Consensus 10 ~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf~k~p~~~~~ 89 (513) T protein:vir:97 10 ATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPFSEPIKLNED 89 (513) T ss_pred CcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhhhcCcccCcC Confidence 788999999999999999999999999999999999999999999999999999999999999999999999999995 4 Q ss_pred CCHHHHH-HHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc------------chhhhhccCceEEEechhhhhcc Q lcl|NC_019408. 80 LPPKFKD-AVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD------------NPRKGAVATSFAVGYSAENILDW 146 (612) Q Consensus 80 ~p~~l~~-~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~------------a~~~~~~~rPy~~~~~ae~IinW 146 (612) +|+.+.. |++|||++|+||++||+++++++|.+||||||||||. +++++.+.||||++|+|++|||| T Consensus 90 ~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW 169 (513) T protein:vir:97 90 VPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPECLLFA 169 (513) T ss_pred chHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHhhhcCc Confidence 7998975 8999999999999999999999999999999999996 36688899999999999999999 Q ss_pred hhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccc Q lcl|NC_019408. 147 DEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEI 226 (612) Q Consensus 147 ~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~ 226 (612) + +.+|||+.+|++|||+|++.+ .|+|+.+.+.|||+|.... +++||.... T Consensus 170 ~-~~~v~G~~~L~~v~l~E~~~~------~Dgf~~~~~~q~rvL~~g~--------------------~~v~r~~~~--- 219 (513) T protein:vir:97 170 R-SEVINGVEVLQHVRIIEHYME------QDGFAEVCKRRIRVLEPGL--------------------VQLWEPVKK--- 219 (513) T ss_pred c-eeccCcceeeeeEEEEEEEee------cCCCcceEEEEEEEEeCce--------------------EEEEEeecC--- Confidence 5 789999999999999999875 3679999999999986431 245553211 Q ss_pred ccccccccceeEEEEEeeCCCceecc-eeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHH Q lcl|NC_019408. 227 EWPSGEVKLAYVQYLYEEDPESRPIA-RIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYG 305 (612) Q Consensus 227 ~~~~g~~~~~~~~~~~~~~~~~~~~~-~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~ 305 (612) +.+... .+++..++++|++|||||+++.++++++++|||++||+|||+|||++|||++| T Consensus 220 --------------------~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~i 279 (513) T protein:vir:97 220 --------------------SNAQKEEWALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHI 279 (513) T ss_pred --------------------CCccccceEEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHH Confidence 111112 24556778999999999999999999999999999999999999999999999 Q ss_pred HHHhccceeeeecCCCCCCceEEEeccccccCCC-CCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhH Q lcl|NC_019408. 306 RLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQ-GSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSE 384 (612) Q Consensus 306 l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~-~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~e 384 (612) ||+++||+||++|++.+|.++|+||++++|.||. |++++||||+|++|++++++|++|++||+++||+++..++ +++ T Consensus 280 l~~~~~P~l~~~G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~ll~~~~--~~~ 357 (513) T protein:vir:97 280 LTVSRFPILACSGASGEDSDPVVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEFLKRKT--GGQ 357 (513) T ss_pred HHhcccceeeeecCCcCCCCceEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHhhccCC--ccc Confidence 9999999999999999988899999999999995 8999999999999999999999999999999999997644 579 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCH Q lcl|NC_019408. 385 SNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPD 464 (612) Q Consensus 385 sa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~ 464 (612) ||+++++++++.+|+|++||.+|++|++++|+|||+|+|.. .++++|+||+||....++++++++|++|+++|.||+ T Consensus 358 Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wlg~~---~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~ 434 (513) T protein:vir:97 358 TATARALDSAEATSDLSAMTGLFEDALAQALDITADWLRLG---PNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISR 434 (513) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---CCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCH Confidence 99999999999999999999999999999999999999963 457899999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccchhhhhH----HHHHHhhccccc--------cccc-hhHHhh---hhhhHHHHhHHHHHHH------ Q lcl|NC_019408. 465 PVFYEYMRKAEVISSDMTFE----EFQALRADENSF--------INNP-DAQARQ---RGYTNRGQELEQSRMA------ 522 (612) Q Consensus 465 et~~~~lqr~~vl~~~~~~e----ee~~ria~e~~~--------~~~~-~~~~~~---~~e~~r~~~~e~~r~~------ 522 (612) +||+++|+|+|||+++++.+ ++++++.+.... .+.+ +..... ..+...-.+ +=. T Consensus 435 ~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 510 (513) T protein:vir:97 435 KTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGE----GGEGGGNPG 510 (513) T ss_pred HHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCC----ccccCCCCC Confidence 99999999999999877744 444544432111 1111 000000 000000000 000 Q ss_pred HHH Q lcl|NC_019408. 523 REA 525 (612) Q Consensus 523 ~e~ 525 (612) -|. T Consensus 511 ~~~ 513 (513) T protein:vir:97 511 GES 513 (513) T ss_pred CCC Confidence 000 No 2 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=100.00 E-value=1.2e-146 Score=820.58 Aligned_cols=461 Identities=28% Similarity=0.419 Sum_probs=406.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCC-----CCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKG-----ADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~-----e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p 75 (612) =-+||+|.+|+++|++|+|||+|+++||++|++|||+++. |++++|++||+||+|||+|++||++|+|+||+++| T Consensus 36 ~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p 115 (535) T protein:vir:80 36 GYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDP 115 (535) T ss_pred CcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCc Confidence 2499999999999999999999999999999999999874 56788999999999999999999999999999999 Q ss_pred eeecCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc-------chhhhhccCceEEEechhhhhcchh Q lcl|NC_019408. 76 IVKNLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD-------NPRKGAVATSFAVGYSAENILDWDE 148 (612) Q Consensus 76 ~~~~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~-------a~~~~~~~rPy~~~~~ae~IinW~~ 148 (612) ++ ++|+.|++|++|||++|++|++||+++++++|.+|+||||||||. +++++.+.||||++|+|++||||+ T Consensus 116 ~~-~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~- 193 (535) T protein:vir:80 116 IR-QLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWR- 193 (535) T ss_pred ce-eccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCcc- Confidence 99 599999999999999999999999999999999999999999996 467888999999999999999995 Q ss_pred hhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccc Q lcl|NC_019408. 149 VVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEW 228 (612) Q Consensus 149 ~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~ 228 (612) +.+|||+.+|++|||+|++.+. .|+|+.+.++|||+|.++. .|.|.+++||....+ T Consensus 194 ~~~v~G~~~Lt~v~lrE~~~~~-----dd~f~~~~~~q~RvL~~~~---------------~G~y~v~~~~~~~~~---- 249 (535) T protein:vir:80 194 TKLVGGKSVISLVVIQENVLAQ-----DDGFETTYVQQWRVLQLNA---------------EGNYQVERWRRETQE---- 249 (535) T ss_pred ccccCCccceeEEEEEEEEEec-----CCCcccceeEEEEEEEecC---------------CceEEEEEEEeecCC---- Confidence 7899999999999999998752 4899999999999998851 234455556532111 Q ss_pred ccccccceeEEEEEeeCCCcee-cceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHH Q lcl|NC_019408. 229 PSGEVKLAYVQYLYEEDPESRP-IARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRL 307 (612) Q Consensus 229 ~~g~~~~~~~~~~~~~~~~~~~-~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~ 307 (612) +.... ...+.+..+|++|++|||||+|+.++++++++|||++||+|||+|||++|||++||| T Consensus 250 -----------------~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~ 312 (535) T protein:vir:80 250 -----------------EMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAF 312 (535) T ss_pred -----------------ccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHH Confidence 01111 123455678899999999999999999999999999999999999999999999999 Q ss_pred HhccceeeeecCCCCCC------ceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_019408. 308 FTALPVYYAPGTDSEGT------GEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKS 381 (612) Q Consensus 308 ~~~~P~l~i~G~~~~~~------~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~ 381 (612) ++++|+||++|++.+|. ..|+||++++|.||+|++++|+|++|+++. +++|++|++||+++|++++...+ T Consensus 313 ~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a--~~~l~~~e~qM~~lGa~ll~~~~-- 388 (535) T protein:vir:80 313 VAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQITPNSVP--FEAMTHKESQMIAMGANLLVKSG-- 388 (535) T ss_pred HhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeeccchhH--HHHHHHHHHHHHHHHHHhhccCc-- Confidence 99999999999987763 349999999999999999999999999997 68899999999999999997654 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCC Q lcl|NC_019408. 382 VSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGL 461 (612) Q Consensus 382 ~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~ 461 (612) +++|++++++++++++|+|++||.+|++|+++||+|||+|+|+.. +++++.|+||+||....++++++++|++++++|. T Consensus 389 ~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~-~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~ 467 (535) T protein:vir:80 389 GNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIV-NDETVEYNLNTDFPAARLTPNERAELILEWQQGA 467 (535) T ss_pred ccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCcc-CCCceEEEeccccccccCCHHHHHHHHHHHhcCC Confidence 578999999999999999999999999999999999999999755 4678999999999999999999999999999999 Q ss_pred CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchh--------------------HHhhhhh Q lcl|NC_019408. 462 LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDA--------------------QARQRGY 509 (612) Q Consensus 462 is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~--------------------~~~~~~e 509 (612) ||++||+++|+|+|||++++++++++.++..|....+...- .+.+.+- T Consensus 468 Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 468 ITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred CCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCccccccCCC Confidence 99999999999999999999999999999777322111100 0011111 No 3 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=100.00 E-value=7.2e-146 Score=816.31 Aligned_cols=473 Identities=24% Similarity=0.356 Sum_probs=402.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCC-----CCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAM-----KGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~-----~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p 75 (612) =.+||+|.+++++|++|+|||+|+++||++|++|||++ ++|+++.|++||+||+|||+|++||++|+|+||+++| T Consensus 5 ~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf~k~p 84 (501) T protein:vir:95 5 SFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVFMRDP 84 (501) T ss_pred CCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhhcCCc Confidence 36899999999999999999999999999999999986 4556789999999999999999999999999999999 Q ss_pred eeecCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc---------chhhhhccCceEEEechhhhhcc Q lcl|NC_019408. 76 IVKNLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD---------NPRKGAVATSFAVGYSAENILDW 146 (612) Q Consensus 76 ~~~~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~---------a~~~~~~~rPy~~~~~ae~IinW 146 (612) +++ +|+.|+.|++|||++|+||++||+++++++|.+|+||||||||. +++++.+.||||++|+|++|||| T Consensus 85 ~~~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW 163 (501) T protein:vir:95 85 VVK-VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPTEIINW 163 (501) T ss_pred cee-CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHhhhcCc Confidence 995 99999999999999999999999999999999999999999995 35678899999999999999999 Q ss_pred hhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccc Q lcl|NC_019408. 147 DEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEI 226 (612) Q Consensus 147 ~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~ 226 (612) + +.+|||+.+|++|||+|++.+ +.++|+.+.++|||+|.++ .+|.+.+++||.....+. T Consensus 164 ~-~~~v~g~~~l~~v~l~E~~~~-----~d~~f~~~~~~q~RvL~~~---------------~~g~~~~~v~r~~~~~~~ 222 (501) T protein:vir:95 164 R-TTDRGAEEVLSLVVLFETWCA-----ADDGFEMKTSGQFRVLRLD---------------EEGYYVHEIWREPQPTKA 222 (501) T ss_pred c-eeccCCceeeeEEEEEEEEee-----cCCCcccceeEEEEEEeeC---------------CCceEEEEEEEecCCccc Confidence 5 789999999999999999875 3478999999999999986 245666777776433221 Q ss_pred ccccccccceeEEEEEeeCCCceecceee-eccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHH Q lcl|NC_019408. 227 EWPSGEVKLAYVQYLYEEDPESRPIARIV-PTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYG 305 (612) Q Consensus 227 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~-p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~ 305 (612) ...+. ..+......++. +..+|++|++|||||+|+.+++++++.|||++||+|||+|||++|||++| T Consensus 223 ~~~~~------------~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~ 290 (501) T protein:vir:95 223 DGSKI------------PKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEES 290 (501) T ss_pred Cccee------------cCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHH Confidence 11100 001111122333 35678999999999999999999999999999999999999999999999 Q ss_pred HHHhccceeeeecCCCCCC-----ceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_019408. 306 RLFTALPVYYAPGTDSEGT-----GEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASK 380 (612) Q Consensus 306 l~~~~~P~l~i~G~~~~~~-----~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~ 380 (612) ||++++|+|||+|+++++. ..|.||++++|.||+||+++||||+|++|. +++|++|++||+++||+++..+ T Consensus 291 l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~~~~i~--~~~l~~l~~~m~~~Ga~ll~~~-- 366 (501) T protein:vir:95 291 CYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQASENTML--KEAMDTKERQMVALGAKLVEQK-- 366 (501) T ss_pred HHHcccceeeeeCCcccccccCCCCceeecccccccCCCCCceeEEecChhhHH--HHHHHHHHHHHHHHHHhhccCC-- Confidence 9999999999999987753 358999999999999999999999999986 8999999999999999999754 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) .+++||+++++++++++|+|+++|.+|++|++++|+|||+|+|+. .++++|+||+||....++++++++|++|+++| T Consensus 367 ~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~---~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G 443 (501) T protein:vir:95 367 EVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQA---DSGVKFELNTDFDIARMTPDERRSLVEEWQKG 443 (501) T ss_pred ccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCC---CCceEEEEecccccccCCHHHHHHHHHHHhCC Confidence 357999999999999999999999999999999999999999973 35689999999999999999999999999999 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhc-cccccccchhHHh-h-hhhh-HHHH Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRAD-ENSFINNPDAQAR-Q-RGYT-NRGQ 514 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~-e~~~~~~~~~~~~-~-~~e~-~r~~ 514 (612) .||++||+++|+++||++++.++++++-+.+. +......++..-. . -+.. ..++ T Consensus 444 ~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 444 AITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred CCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccccCCC Confidence 99999999999999999998887765533322 2111111111100 0 0000 0000 No 4 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=100.00 E-value=1.1e-145 Score=815.24 Aligned_cols=464 Identities=23% Similarity=0.364 Sum_probs=412.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCC-CCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKG-ADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~-e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) =-+||+|.+|+++|++|+|||+|++. +..++.|||+++. +++++|++||+||+|||+|++||++|+|+||+++|++ + T Consensus 12 ~~~hp~y~a~~~~W~~ird~~~G~~~-~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~-~ 89 (491) T protein:vir:95 12 KTKHREWLHYAPKWQKVRHALAGDLV-GYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEI-N 89 (491) T ss_pred CccCHHHHHHHHHHHHHHHHhcCcch-hhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhchhhcCCcee-e Confidence 67899999999999999999999654 4456799999886 5677899999999999999999999999999999999 5 Q ss_pred CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc------chhhhhccCceEEEechhhhhcchhhhccC Q lcl|NC_019408. 80 LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD------NPRKGAVATSFAVGYSAENILDWDEVVDMG 153 (612) Q Consensus 80 ~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~------a~~~~~~~rPy~~~~~ae~IinW~~~~~v~ 153 (612) +|+.|++|++|||++|+||++|++++++++|.+||||||||||. +++++.+.||||++|+|++||||+ +.+|| T Consensus 90 ~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~-~~~v~ 168 (491) T protein:vir:95 90 IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIVNWR-LTRVG 168 (491) T ss_pred ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhcCce-eeeeC Confidence 99999999999999999999999999999999999999999996 467888999999999999999995 78999 Q ss_pred CccceeEEEEEEEeeccccccCCCcccccceeeeeeEeee-cccccccceeecccccccccceeeeeeeecccccccccc Q lcl|NC_019408. 154 GFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALA-SGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGE 232 (612) Q Consensus 154 g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~-~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~ 232 (612) |+.+|++|||+|++.. .++.|+|+.+.+.|||+|.++ +| .+.+++||.... | T Consensus 169 g~~~L~~v~l~E~~~~---~d~~~~f~~~~~~qyRvL~l~~~g----------------~~~~~v~r~~~~-------g- 221 (491) T protein:vir:95 169 SVNRVTMVVLRETWEY---HEPGNEFETKYGEQYRVLDIDTDG----------------NYRQRLFRFDAE-------G- 221 (491) T ss_pred CceeeeEEEEEEeEEe---ecCCCCcccceEEEEEEEeecCCC----------------ceEEEEEEEcCC-------C- Confidence 9999999999998764 467899999999999999986 33 344455553111 0 Q ss_pred ccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccc Q lcl|NC_019408. 233 VKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALP 312 (612) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P 312 (612) +......+++|..+|++|++|||||+|+.++++++++|||++||+|||+|||++|||++|||++++| T Consensus 222 -------------~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P 288 (491) T protein:vir:95 222 -------------GAQEEVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQP 288 (491) T ss_pred -------------cceeeeeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccc Confidence 1111234567778889999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecCCCCCCc--------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhH Q lcl|NC_019408. 313 VYYAPGTDSEGTG--------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSE 384 (612) Q Consensus 313 ~l~i~G~~~~~~~--------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~e 384 (612) +||++|.+....+ .+++|++++|.||+|++++|+|++|+++ .+++|++|++||+++||+|+..+ .++ T Consensus 289 ~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~~~~~--~~~~l~~~e~qm~~~Ga~l~~~~---~~~ 363 (491) T protein:vir:95 289 TLFIYPGDNLTPQSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQLITPS---QQI 363 (491) T ss_pred eeeeecCcccCcchhhccCcceeEecCcCCcCCCCCCccceeecCcchH--HHHHHHHHHHHHHHHHHHhccCC---cch Confidence 9999997653322 3899999999999999999999999987 49999999999999999999753 369 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCH Q lcl|NC_019408. 385 SNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPD 464 (612) Q Consensus 385 sa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~ 464 (612) ||++++++++.++|+|+++|.+|++|+++||+|||+|+|++ ++.++.|+||+||...++++++++++++++++|.||+ T Consensus 364 Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~--~~~~v~i~~n~dF~~~~~~~~~~~all~~~~~G~is~ 441 (491) T protein:vir:95 364 TAESARIQRGADTSVMATIARNVSQAYTDALRWVAMMLGKP--EDSEVEFQLNMDFFLQPMTAQDRAAWMADINAGLLPA 441 (491) T ss_pred hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCC--CCCceEEEeecccccccCCHHHHHHHHHHHhcCCCCH Confidence 99999999999999999999999999999999999999975 3568999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhH Q lcl|NC_019408. 465 PVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQEL 516 (612) Q Consensus 465 et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~ 516 (612) +||+++|+++||++ .++++++++|+++....+...+.+.+-+...++.++ T Consensus 442 ~t~~~~L~~~~vl~--~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 442 TAYYAALRKAGVTD--WTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHHHHHhCCCCC--ccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 99999999999984 578999999999988888888888776766555543 No 5 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=100.00 E-value=1.1e-143 Score=804.26 Aligned_cols=463 Identities=23% Similarity=0.365 Sum_probs=400.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCC-CHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGA-DGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e-~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) =-+||+|.+|+++|++|+|||+|++.++. +..|||+++.+ +++.|++||+||+|||+|++||++|+|+||+++|++ + T Consensus 12 ~~~hp~y~a~~~~W~~ird~~~G~~~~~~-r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~-~ 89 (489) T protein:vir:78 12 KTKHREWLHYAPKWQKVRHALAGELVSYL-RNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEI-N 89 (489) T ss_pred CccCHHHHHHHHHHHHHHHHhcCcccccc-cCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhchhhcCCcce-e Confidence 66899999999999999999999866554 45799998875 466799999999999999999999999999999999 5 Q ss_pred CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc------chhhhhccCceEEEechhhhhcchhhhccC Q lcl|NC_019408. 80 LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD------NPRKGAVATSFAVGYSAENILDWDEVVDMG 153 (612) Q Consensus 80 ~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~------a~~~~~~~rPy~~~~~ae~IinW~~~~~v~ 153 (612) +|+.|++|++|||++|+||++|++++++++|.+||||||||||. +++++.+.||||++|+|++||||+ +.+|| T Consensus 90 ~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~-~~~v~ 168 (489) T protein:vir:78 90 IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIVNWR-LTRVG 168 (489) T ss_pred ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhcCce-eeeeC Confidence 99999999999999999999999999999999999999999996 467889999999999999999995 88999 Q ss_pred CccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccc Q lcl|NC_019408. 154 GFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEV 233 (612) Q Consensus 154 g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~ 233 (612) |+.+|++|||+|++.. .++.|+|+.+.+.|||+|.++. .|.+.+++||.... T Consensus 169 G~~~Lt~v~lrE~~~~---~d~~~~f~~~~~~q~RvL~~~~---------------~g~~~~~~~r~~~~---------- 220 (489) T protein:vir:78 169 SVNRVTMVVLRETWEY---NEPGNEFETKYGEQYRVLDIDS---------------DGNYRQRLFRFDAE---------- 220 (489) T ss_pred CccceeEEEEEEeEEe---ecCCCCccceeEEEEEEEecCC---------------CcceEEEEEEeecC---------- Confidence 9999999999998765 3678999999999999999862 13334455553211 Q ss_pred cceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccce Q lcl|NC_019408. 234 KLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPV 313 (612) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~ 313 (612) ++.......++|..+|++|++|||||+|+.++++++++|||++||+|||+|||++|||++|||++++|+ T Consensus 221 -----------g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~ 289 (489) T protein:vir:78 221 -----------GGAQEDVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPT 289 (489) T ss_pred -----------CcccceeeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccce Confidence 111112234567788999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecCCCCCCc--------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHH Q lcl|NC_019408. 314 YYAPGTDSEGTG--------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSES 385 (612) Q Consensus 314 l~i~G~~~~~~~--------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~es 385 (612) ||++|+++...+ .+++|++++|.||++++++|+|++|+++. +++|++|+++|+++||+|+..+ .++| T Consensus 290 l~i~G~d~~~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~~~~~~--r~~l~~le~qm~~lGa~l~~~~---~~~T 364 (489) T protein:vir:78 290 LFIYPGENLTPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAGENNLA--RQNMLDKEQQAIQIGAQLITPT---QQIT 364 (489) T ss_pred eeeecCccCCcccccccCccceeeCCcccccCCCCCCcceeccCcchHH--HHHHHHHHHHHHHHhhhhccCC---cchh Confidence 999998654322 37899999999999999999999998874 9999999999999999999643 3799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHH Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDP 465 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~e 465 (612) |++++++++.++|+|+++|.+|++|+++||+|||+|+|+. ++.++.|++|+||+..+++++++++|++++++|.||++ T Consensus 365 a~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~--~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~ 442 (489) T protein:vir:78 365 AQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGKP--EDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPAT 442 (489) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCC--CCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHH Confidence 9999999999999999999999999999999999999974 45689999999999999999999999999999999999 Q ss_pred HHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhH Q lcl|NC_019408. 466 VFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQEL 516 (612) Q Consensus 466 t~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~ 516 (612) ||+++|+++||++. ++++++++|+++....+.... ...++..++++. T Consensus 443 t~~~~L~~~gv~d~--~~e~~~~ei~~~~~~~~~~~~--g~~~~~~q~~~~ 489 (489) T protein:vir:78 443 AYYAALRKAGVTDW--TDADIKDAVADQPLPVATEVQ--GEIPQSAQQQEK 489 (489) T ss_pred HHHHHHHhCCCCCc--cHHHHHHHHhhcCCCcccCCc--ccCCCCcccccC Confidence 99999999999854 678889999988654433222 111221222211 No 6 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=100.00 E-value=6.5e-142 Score=794.62 Aligned_cols=445 Identities=28% Similarity=0.468 Sum_probs=392.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) --+||+|.+++++|++|+|||+|+++||++|++||||+++|++++|++||+||+|||+|++||++|+|+||+++|++ ++ T Consensus 4 ~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~p~~-~~ 82 (452) T protein:vir:94 4 ETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQPPVI-TH 82 (452) T ss_pred CCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCCcee-cc Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999 58 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) |+.|.++ |+|++|+||++|++++++++|.+||||||||||.+ ++||||++|+|++||||+ +..+|| |++ T Consensus 83 p~~l~~~--~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~-----g~rPy~~~~~~~~Ii~W~-~~~~g~---l~~ 151 (452) T protein:vir:94 83 PDAMSKY--FEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLT-----GGDPYISVYTTENILNWE-EDEDGR---LLM 151 (452) T ss_pred cHHHHHH--HhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccC-----CCceEEEEechhhhcCcc-ccccCC---eeE Confidence 9999998 77999999999999999999999999999999965 679999999999999996 445543 999 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |+|||+... .++.|+|+.+.+.+||+|.+++|.+. +++||. T Consensus 152 v~lre~~~~---~d~~d~f~~~~~~~yRvL~l~~g~~~----------------v~~~~~-------------------- 192 (452) T protein:vir:94 152 VVLREFYTV---RDTADRYVQNIRVRYRCLELVDGLLQ----------------ITVHET-------------------- 192 (452) T ss_pred EEEEEEEEE---ecCCCcccceeEEEEEEEEEeCCeEE----------------EEEEEc-------------------- Confidence 999998665 46789999999999999999876443 333321 Q ss_pred EEeeCCCcee-cceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 241 LYEEDPESRP-IARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 241 ~~~~~~~~~~-~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) .+++.|. +.+++|..+|++|++|||||+++.++++++++|||++||+||++|||++|||++|||++++|+||++|+ T Consensus 193 ---~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~ 269 (452) T protein:vir:94 193 ---QDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGA 269 (452) T ss_pred ---cCCceeeeccceeecCCCcccceeEEEEEcCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecC Confidence 1112222 234677889999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCceEEEeccccccCCC-CCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGEYHIGPNMVWEVPQ-GSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQTVLREANEQS 398 (612) Q Consensus 320 ~~~~~~~l~iG~~~~~~lp~-~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~~~~s 398 (612) ++. +++.||++++|.||+ |++++||||+|++|++++++|++|+++|+.+|++++..++ ...+|++++.+++++++| T Consensus 270 ~~~--~~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~ll~~~~-~~~~s~ea~~~~~~~~~s 346 (452) T protein:vir:94 270 ESQ--STMHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSARLIDNST-RGSEATETVKLRYMSETA 346 (452) T ss_pred cCC--CceEecccccccCCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHHHHHhhccCC-CcchHHHHHHHHHHHhhH Confidence 765 468999999999996 9999999999999999999999999999999999998765 356788888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccc Q lcl|NC_019408. 399 LLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVIS 478 (612) Q Consensus 399 ~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~ 478 (612) +|+++|.+|++|++++|+|||+|+|++ .++.|+||+||....++++++++|++|+++|.||++||+++|+|+||++ T Consensus 347 ~L~~~a~~~e~al~~~l~~~a~w~g~~----~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~ 422 (452) T protein:vir:94 347 SLKSVTRAVEALLNKAYSCIMDMESMG----GTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLP 422 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCC----CceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCC Confidence 999999999999999999999999973 4689999999999999999999999999999999999999999999998 Q ss_pred hhhhhHHHHHHhhcccccccc-c-hhHHhh Q lcl|NC_019408. 479 SDMTFEEFQALRADENSFINN-P-DAQARQ 506 (612) Q Consensus 479 ~~~~~eee~~ria~e~~~~~~-~-~~~~~~ 506 (612) .+.+.+...+.+..+.+...+ | +.-.+. T Consensus 423 ~~~e~~~i~~E~~~~~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 423 PPGESMGVIPDPPAPEPSPSNTPPNPSSKA 452 (452) T ss_pred CccCHHHHHHHhhccCcccCCCCCCCccCC Confidence 876666655555544332111 1 111111 No 7 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=100.00 E-value=1.2e-135 Score=760.24 Aligned_cols=439 Identities=18% Similarity=0.281 Sum_probs=375.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCC-----CHHHHHHHH------------hhccCCchHHHHH Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGA-----DGDDYAIYL------------QRATFFNMLAQTR 63 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e-----~~~~Y~~rl------------~rA~~~n~~~~tv 63 (612) -.+||+|.+|+|+|++++|| |+.+||.+|++||||++.+ ++..|+.|+ +||+|||+|++|+ T Consensus 17 ~~~hp~y~a~~~~W~~~~d~--g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl 94 (488) T protein:vir:96 17 PIYHPDYLVNAPQWLRNLDC--VMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPTM 94 (488) T ss_pred cccCHHHHHHhhhhhHhhhh--hhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHHH Confidence 56899999999999999985 6778999999999998754 344444444 3899999999999 Q ss_pred HHhhchhhcCCceeecCC--HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc-----chhhhhccCceEE Q lcl|NC_019408. 64 DGMTGMVFRRDPIVKNLP--PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD-----NPRKGAVATSFAV 136 (612) Q Consensus 64 ~~~~G~vf~k~p~~~~~p--~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~-----a~~~~~~~rPy~~ 136 (612) ++|+|+||+++|+++ .| +.|+.|++|||++|+||++|++++++++|.+||||||||||. +++++.+.|||++ T Consensus 95 ~~l~G~vfrk~p~~~-~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~~ 173 (488) T protein:vir:96 95 NAITGAVMRREPEFD-TMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTAA 173 (488) T ss_pred HHhcchhhccCceec-cCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEEE Confidence 999999999999995 44 679999999999999999999999999999999999999996 5788999999999 Q ss_pred EechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccccccccee Q lcl|NC_019408. 137 GYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYIT 216 (612) Q Consensus 137 ~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~ 216 (612) +|+|++||||+ +.+|||+.+|++|||+|++.+ .|+|......+++++.+.+|. ++ T Consensus 174 ~~~a~~IinW~-~~~v~G~~~L~~v~lrE~~~~------~D~~~~~~~~~~~~~~l~~g~------------------~~ 228 (488) T protein:vir:96 174 FYDALHIIDWE-VEYIDGEEKLTYLSLLEDYQE------RDGGTYVSKQRLINHRLVDGL------------------CE 228 (488) T ss_pred EechhhhcCcc-eeccCCceeeEEEEEEEEEEe------ccCCCcccceEEEEEEEECcE------------------EE Confidence 99999999995 889999999999999999865 244444556677777666542 13 Q ss_pred eeeeeeccccccccccccceeEEEEEeeCCCceecceeee-ccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHH Q lcl|NC_019408. 217 VYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVP-TVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSH 295 (612) Q Consensus 217 ~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p-~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~H 295 (612) +||+.. +.. +.+++| ..+|++|++|||||+|+.++++++++|||++||+|||+| T Consensus 229 v~~~~~----------------------~~~---~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~H 283 (488) T protein:vir:96 229 FQEVTD----------------------DEY---SDEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSI 283 (488) T ss_pred EEEEec----------------------CCc---ccceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHH Confidence 333211 111 123444 467889999999999999999999999999999999999 Q ss_pred HhhhHHHHHHHHHhccceeeee--cCCCCCCceE-EEeccccccCC---CCCceeEEecCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 296 YRTYAELEYGRLFTALPVYYAP--GTDSEGTGEY-HIGPNMVWEVP---QGSEPGILEYTGQGLKALETALNDKERQIAA 369 (612) Q Consensus 296 Y~~~sD~~~~l~~~~~P~l~i~--G~~~~~~~~l-~iG~~~~~~lp---~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~ 369 (612) ||++|||++|||++++|++++. |++.++.+++ .+|.+.++.+| +.|+++|++++++++ .+++|++|++||++ T Consensus 284 y~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l--~~~~l~~l~~qm~~ 361 (488) T protein:vir:96 284 YVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNPLGFTLAGRMPYYVKNGDVKVIQAQFSPE--TENKVEKLFEQAVK 361 (488) T ss_pred HhhhhHHHHHHHhcCCceeeeccCCCCcccccccccceeeecccccccccCCceeecCCchhHH--HHHHHHHHHHHHHH Confidence 9999999999999999999985 3444433332 23333334333 356899999999988 48999999999999 Q ss_pred HHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC--CCcceEEEeeccccccCCCH Q lcl|NC_019408. 370 IGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA--DTENLRYEVNTDFLSTPIGA 447 (612) Q Consensus 370 lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~--~~~~~~v~ln~dF~~~~~d~ 447 (612) +||+++..+ +++||++++++++.++|+|+++|.+|++|+++||+|||+|+|+..+ ++++++|+||+||....+|+ T Consensus 362 ~Ga~l~~~~---~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~ 438 (488) T protein:vir:96 362 VGASLFTQQ---SNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNP 438 (488) T ss_pred HhHhhccCC---CcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCH Confidence 999999743 4689999999999999999999999999999999999999998653 35689999999999999999 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccccc Q lcl|NC_019408. 448 REMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFI 497 (612) Q Consensus 448 ~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~ 497 (612) +++++|++++++|.||++||+++|+|+|||++++++++++++|++++..+ T Consensus 439 ~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 439 QMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhcCCCC Confidence 99999999999999999999999999999999999999999999887666 No 8 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.94 E-value=3.6e-27 Score=165.55 Aligned_cols=458 Identities=11% Similarity=-0.023 Sum_probs=253.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+- ..+++.++.+-|.|...+..+...+....+......++.- .|.+ .|+++.+|+..+|++|.++|++..- T Consensus 37 ~i~~~----~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~-~ri~-~n~~~~ivd~~~~yl~g~~~~~~~~ 110 (503) T protein:vir:59 37 LIDEH----NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTN-NRTS-HAWHKLFVDQKTQYLVGEPVTFTSD 110 (503) T ss_pred HHHhh----cHHHHHHHHHHhccccchhhccchhccccccccccccccc-ceee-cchHHHHHHHHHhhhhcCCeeeccC Confidence 32221 2467788888888877665443333322222222222211 1223 7999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....+++... .++++.....+.+.++.+|+++++|..+. ..+|-+..++|.+++-. +....++ ..+.- T Consensus 111 d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------dg~~~i~~~~p~~~~~i-~d~~~~~-~~~~~ 180 (503) T protein:vir:59 111 NKTLLEYVNELA--DDDFDDILNETVKNMSNKGIEYWHPFVDE------EGEFDYVIFPAEEMIVV-YKDNTRR-DILFA 180 (503) T ss_pred cHHHHHHHHHHH--hcCHHHHHHHHHHHHhhCCeEEEEEeecC------CCceEEEEEccceeEEE-EeCCCCC-ceEEE Confidence 456666666654 36899999999999999999999998654 24789999999998763 1111121 11222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++-.. . +.+.-. ..+..+... ..+++..........+... T Consensus 181 ir~~~~--~-----~~~~~~----~~~~evy~~---------------------~~i~~~~~~~~~~~~~~~~------- 221 (503) T protein:vir:59 181 LRYYSY--K-----GIMGEE----TQKAELYTD---------------------THVYYYEKIDGVYQMDYSY------- 221 (503) T ss_pred EEEEEE--e-----cCCCce----EEEEEEEeC---------------------CcEEEEEEcCCcccccccc------- Confidence 222111 0 001000 011111111 0111111000000000000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ..............-++++.||||.+.. +.+. .+-|.++..|-=..-+..|++.+.+-+.+.|+++++|.+ T Consensus 222 -----~~~~~~~~~~~~~~~~~~~~vPiv~~~n--n~~~--~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~ 292 (503) T protein:vir:59 222 -----GENNPRPHMTKGGQAIGWGRVPIIPFKN--NEEM--VSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYD 292 (503) T ss_pred -----cccccccceeecceeccCCccceEEecC--CCCC--CcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCC Confidence 0000000011112235799999998854 2222 222333333222333456888888999999999999986 Q ss_pred CCCCce--EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGE--YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~--l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+.... ..+....++.++.+++++|+..+... +.....++.+.+.|..++.-+ +......++-||++......... T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 293 GENPKEFTANLRYHSVIKVSGDGGVDTLRAEIPV-DSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLD 371 (503) T ss_pred ccccchhhhhhhcccceeccCCCcceeEeccCCH-HHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHH Confidence 554332 33666778889999999999987644 667888999999888775322 11122345668888888878877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC----CcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLAD----TENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK 473 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr 473 (612) .........+..++.+++++++.+++..... ..++.|.+++ ..+.+. .+.++++.+++++|.||++|++..+ T Consensus 372 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~-~~p~d~-~~~~~~~~kl~~~GiiS~et~l~~l-- 447 (503) T protein:vir:59 372 LKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTR-TRIQND-SEIVQSLVQGVTGGIMSKETAVARN-- 447 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCC-CCCCCH-HHHHHHHHHHHhCCCCchHHHHHhC-- Confidence 7888888889999999999999987753222 1235555543 233332 5678899999999999999998764 Q ss_pred cCccchhhhhHHHHHHhhccccccccchhHH--hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 474 AEVISSDMTFEEFQALRADENSFINNPDAQA--RQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVA 540 (612) Q Consensus 474 ~~vl~~~~~~eee~~ria~e~~~~~~~~~~~--~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~ 540 (612) +.++ +++++.++|.+|........... ........+++.. +....++ ....+++ T Consensus 448 -~~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~---~~~g~~~ 503 (503) T protein:vir:59 448 -PFVQ---DPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDP------NAGAAES---GGAGQVS 503 (503) T ss_pred -CCCC---CHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCC------CCCcccC---CCCCCcC Confidence 2222 34566666655421100000000 0000000000000 0000000 0000000 No 9 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.94 E-value=3.8e-27 Score=165.45 Aligned_cols=436 Identities=10% Similarity=0.029 Sum_probs=251.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....+++.++.+.|.|...+..+...+..... . +..+-..-+-.|+.+.+|+.++|++|.+||++..- T Consensus 52 ~i~~--~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~---~--~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~ 124 (492) T protein:vir:94 52 YIKQ--HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA---V--DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT 124 (492) T ss_pred HHHH--HHHHHHHHHHHHHHhcccccccccccccccccc---c--cccccccccccchHHHHHHHHHhhhcccCceeccC Confidence 5543 456678899999999997554333222221111 1 11111111346999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....++++- +++++.....+.+.++.+|+++++|.... ..+|-+..++|.+++- |+ ..+.+. .+. T Consensus 125 d~~~~~~l~~~~--~n~~~~~~~~~~~~a~~~G~a~~~v~~d~------dg~~~~~~~~p~~~~~v~d--~~~~~~-~~a 193 (492) T protein:vir:94 125 DDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPIWT--DKEHEE-LEA 193 (492) T ss_pred chHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEEEecC------CCceEEEEEcccceEEEEc--CCCCCc-eEE Confidence 444555555553 36799999999999999999999997643 2478899999999866 42 112222 233 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .|++.. .+. .+ .++.| ... .+..|+...... . T Consensus 194 ~ir~~~--~~~-----~~-----~~~~y----~~~-------------------~v~~~~~~~~~~-------------~ 225 (492) T protein:vir:94 194 FIRMYK--LEN-----ET-----KVEYW----DKV-------------------TVNYYVYENGSL-------------I 225 (492) T ss_pred EEEEEe--ecc-----ce-----eEEEE----ecC-------------------eEEEEEEecCee-------------e Confidence 333221 110 00 00000 000 011111100000 0 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) .......+.+ .+...-++|+.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.+.+.++|+++++|. T Consensus 226 ~~~~~~~~~~-----~~~~~~~~~g~vPvv~~~n--n~~--~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~ 296 (492) T protein:vir:94 226 PDYSNNLENS-----KTHFSTGSWGKIPFIPFKN--NDL--EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNY 296 (492) T ss_pred eccccccccc-----cccccccCCCccceEEecC--CCC--CCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 0000001111 1112236799999998854 222 233344444443344456789999999999999999998 Q ss_pred CCCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANE 396 (612) Q Consensus 320 ~~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~ 396 (612) +.+..... .++...++.++.|++++|+..+.+. +.....++.+++.|...+.-. +....-.++.||++........ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 375 (492) T protein:vir:94 297 DDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPV-ENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNL 375 (492) T ss_pred CcccchhhHHHHhhccceecCCCCcceeEeccCCH-HHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHH Confidence 76543332 2456678889999999999876544 567888899999888875432 1111222456888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCc Q lcl|NC_019408. 397 QSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEV 476 (612) Q Consensus 397 ~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~v 476 (612) ..........+..++.+++++++.++|... +..++.|.+++.. +.+ ..+.++++.++ .|.||++|++..+ +. T Consensus 376 ~~k~~~k~~~f~~~l~~~~~li~~~~~~~~-~~~~i~v~f~~~~-p~~-~~e~~~~~~kl--~giiS~et~~~~l---~~ 447 (492) T protein:vir:94 376 NLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNK-VAN-TELQVQTAQQS--MGIVSHETVLENH---PF 447 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-ccceeeEEecCCC-CCC-HHHHHHHHHHH--hccCchHHHHHhC---CC Confidence 888899999999999999999999999754 3345666554322 222 24556666666 5999999998766 33 Q ss_pred cchhhhhHHHHHHhhccccccccc-hhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 477 ISSDMTFEEFQALRADENSFINNP-DAQARQRGYTNRGQELEQSRMAREADFTQQKID 533 (612) Q Consensus 477 l~~~~~~eee~~ria~e~~~~~~~-~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e 533 (612) ++ ++++|.+++.+|....... ..............+.+ ..++.| T Consensus 448 v~---d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~e~e 492 (492) T protein:vir:94 448 VE---DLQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERS----------NNKESE 492 (492) T ss_pred CC---CHHHHHHHHHHHHHHHHhhccccccccCCCCccccCC----------ccccCC Confidence 32 3556666665542100000 00000000000000000 000000 No 10 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.93 E-value=2.3e-26 Score=161.15 Aligned_cols=445 Identities=11% Similarity=0.034 Sum_probs=250.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) +|.+.-=....+.+.++.+.|.|...+..+...+. ...+.....++.. .=+-.|+++.+|+.++|++|.+||++..- T Consensus 27 ~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~-~~~~~~~~~~~~~--~ki~~~~~~~Ivd~~~~~l~g~p~~~~~~ 103 (479) T protein:vir:79 27 VIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYL-LDGAKVDDFTKVN--NKAINNYHKLLVDQKVGYSVGNPIVFNAD 103 (479) T ss_pred HHHHHHhhhhHHHHHHHHHHhccCCcccccccccc-cccccccccccCc--ceeecchHHHHHHHHHhhhhcCCceeccC Confidence 22222112245778888899988765543221111 1111111111111 11338999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....+++.... ++++..+..+.+.++.+|+++++|-.+.. .+|-+..++|.+++-. +.. ++...+.. T Consensus 104 ~~~~~~~~~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~------~~~~i~~~~p~~~~~v-~d~--~~~~~~~~ 172 (479) T protein:vir:79 104 DDNLTKLLNDLLG--EEFDDTITELYLNASNKGVEWLHPYINRK------GEFKYVIIPAEEAIPI-WDS--KRQRELVA 172 (479) T ss_pred CHHHHHHHHHHHh--cCHHHHHHHHHHHHHhcCeEEEEEEeCCC------CceEEEEEccceeEEE-EeC--CCCCceEE Confidence 4567777776654 68999999999999999999999976542 5788999999998653 111 11222332 Q ss_pred -EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 161 -VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 161 -v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) |+.-.. .. .++-.. .+..+...++ +..|+.....+ .. . T Consensus 173 ~ir~y~~-~~------~~~~~~----~~~e~y~~~~-------------------i~~~~~~~~~~-------~~----~ 211 (479) T protein:vir:79 173 FIRFYYI-ED------IDGNKI----KRVEYYTEND-------------------ITYFIERGNSF-------IQ----E 211 (479) T ss_pred EEEEEEE-ee------cCCceE----EEEEEEeCCc-------------------EEEEEecCCcc-------cc----c Confidence 222111 10 111000 1111111110 01111100000 00 0 Q ss_pred EEEeeCCCceecc--eeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeee Q lcl|NC_019408. 240 YLYEEDPESRPIA--RIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAP 317 (612) Q Consensus 240 ~~~~~~~~~~~~~--~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~ 317 (612) ............. .......-++|+.||||.+... .+ +.+-|.++..|-=+.-...|++.+.+.+.+.|+++++ T Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn--~~--g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~ 287 (479) T protein:vir:79 212 FLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNN--EK--CVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLK 287 (479) T ss_pred ccccccccccccccccccccccccCCCcccEEEecCC--CC--CCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee Confidence 0000000000000 0111222367999999988543 22 2233444544433444577889999999999999999 Q ss_pred cCCCCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHHHHHHHH Q lcl|NC_019408. 318 GTDSEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQTVLREAN 395 (612) Q Consensus 318 G~~~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~~ 395 (612) |.+....+.. .+..+.++.+++|++++|++.+.+ .+..+..++.+++.|...+.-.-....+.++.||++....... T Consensus 288 g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~ 366 (479) T protein:vir:79 288 EYPGTSLQEFIDNIRYYKSIKVDGGGGVDKLEINIP-VEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSL 366 (479) T ss_pred cCCccccccchhhhhhccceecCCCCcceEEeccCC-HHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHH Confidence 9754433221 234556788899999999998874 5778999999999998775322112223456688888888788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC---CcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHH Q lcl|NC_019408. 396 EQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD---TENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMR 472 (612) Q Consensus 396 ~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~---~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lq 472 (612) ........-..+..++.+++++++.|++...+. ..++.|.+++ ..+.+ ..+.++++..+ +|.||.+|++..+ T Consensus 367 l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~-~~p~~-~~~~a~~~~kl--~g~iS~et~l~~l- 441 (479) T protein:vir:79 367 LDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNH-SMIIN-EAEKIDMAAKS--TGIVSDETIVSNH- 441 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCcHHHHHHhC- Confidence 888888888899999999999999998764321 2344555533 22333 24456666665 5999999998765 Q ss_pred hcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 473 KAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQ 535 (612) Q Consensus 473 r~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~ 535 (612) +.++ ++++|.+++.+|............... +...+ +. T Consensus 442 --~~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~----~~~~~----------------e~ 479 (479) T protein:vir:79 442 --PWVE---DVNDELERLKKQEDTQKEYDDLIPNNQ----DGVID----------------ET 479 (479) T ss_pred --CCCC---CHHHHHHHHHHHHHHHHHHHhccCccc----CCCcC----------------cC Confidence 3332 345556666554211000000000000 00000 00 No 11 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.93 E-value=2e-26 Score=161.49 Aligned_cols=438 Identities=11% Similarity=0.034 Sum_probs=248.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||. .+....+++..+.+.|.|...+-.+...+.... ....++ -..-+-.|+.+.+|+.++|++|.+||++..- T Consensus 52 ~i~--~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~---~~~~~~--~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~ 124 (492) T protein:vir:97 52 YIK--QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG---AVDPLK--PDDRMITNFHANLVDQKVSYIVGKPIAFKHT 124 (492) T ss_pred HHH--HHHHHHHHHHHHHHHhcccCccccccccccccc---cccccc--cccccccchHHHHHHHHhhhhcccCceeccC Confidence 443 345667888888888888654332211111110 001111 1111236999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....+++++- +++++.....+...++.+|+|+++|.... ..+|-+..++|++++-. +...+.+. .+.. T Consensus 125 d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~a~~~v~~d~------dg~~~~~~~~p~~~~~i-~d~~~~~~-~~~~ 194 (492) T protein:vir:97 125 DDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPI-WTDKEHEE-LEAF 194 (492) T ss_pred chHHHHHHHHHH--hccHHHHHHHHHHHHhhcCeEEEEEEecC------CCceEEEEEcccceEEE-EcCCCCCc-eEEE Confidence 455566666663 36899999999999999999999997543 24688999999998763 11112222 2222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++.. .+. . ..++.| .. + .+..|+... +. ... T Consensus 195 vr~~~--~~~------~----~~~~~y----~~-----------------~--~v~~~~~~~--------~~-----~~~ 226 (492) T protein:vir:97 195 IRMYK--LEN------E----TKVEYW----DK-----------------V--TVNYYVYEN--------GS-----LIP 226 (492) T ss_pred EEEEe--ecc------c----eeEEEE----ec-----------------C--eEEEEEEec--------Ce-----eee Confidence 22211 100 0 000000 00 0 011111100 00 000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) .+.... ....+...-++|+.||||.+... .+ +.+-|.++..|-=+.-...|+..+.+.+.++|+++++|.+ T Consensus 227 ~~~~~~-----~~~~~~~~~~~~g~vPvv~~~nn--~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~ 297 (492) T protein:vir:97 227 DYSNNL-----ENSKTHFSTGSWGKIPFIPFKNN--DL--EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD 297 (492) T ss_pred cccccc-----cccccccccCCCCCcceEEecCC--CC--CCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Confidence 000000 11112223367999999988542 22 2233333333322333467888899999999999999986 Q ss_pred CCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+...+. .++...++.++.|++++|+..+. ..+.....++.+++.|...+.-. +....-.++.||.+......... T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 376 (492) T protein:vir:97 298 DQELPEFKRLLRYYGAIKVSDNGGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 376 (492) T ss_pred cccchhHHHHHhhccceecCCCCcceeEeccC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHH Confidence 6543332 35667788899999999998765 44677888999999988875422 11112224568888888878888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) ......-..+..++.+.+++++.++|... +..++.|.+++ ..+.+ ..+.++++.++ +|.||++|.+..+ +.+ T Consensus 377 ~ka~~~~~~f~~~l~~~~~li~~~~~~~~-~~~~i~v~f~~-~~p~~-~~e~a~~~~kl--~G~iS~et~l~~l---~~v 448 (492) T protein:vir:97 377 LKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNY-NKVAN-TELQVQTAQQS--MGIVSHETVLENH---PFV 448 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCc-ccceeeEEecC-CCCCC-HHHHHHHHHHH--hccCchHHHHHhC---CCC Confidence 78888888999999999999999999754 33455565543 22222 24566667766 6999999987765 333 Q ss_pred chhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDI 534 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~ 534 (612) + ++++|.++|.+|....... .+...........+.+. + ..+.+| T Consensus 449 ~---d~~~Eleri~~E~~~~~~~-~~~~~~~~~~~~~~~~~-----~----~~~~~e 492 (492) T protein:vir:97 449 E---DLQAELERIEQEQTEYNKQ-LPNLDDGGADSAQQQER-----S----NNKESE 492 (492) T ss_pred C---CHHHHHHHHHHHHHHHHHh-hhccccCCCCCCccccc-----c----cccccC Confidence 2 3456667765543100000 00000000000000000 0 000000 No 12 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.92 E-value=2.3e-25 Score=155.62 Aligned_cols=438 Identities=10% Similarity=0.042 Sum_probs=245.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||. .+....+++.++.+.|.|...+-.+...|...... ..++ ...-+-.|+.+.+|+.++|++|.+||++..- T Consensus 32 ~i~--~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~---~~~~--~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~ 104 (472) T protein:vir:93 32 YIK--QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV---DPLK--PDDRMITNFHANLVDQKVSYIVGKPIAFKHT 104 (472) T ss_pred HHH--HHHHHHHHHHHHHHHhccccccccccchhhccccc---cccc--cccccccchHHHHHHHHhhhhcccCeeeccC Confidence 343 34566788999999999975443222222211111 1111 1111236999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....+++++- +++++.....+.+.++.+|+++++|..... .+|-+..++|.+++-. +.....+. .+.. T Consensus 105 d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d------~~~~i~~~~p~~~~~i-~d~~~~~~-~~~~ 174 (472) T protein:vir:93 105 DDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEE------GEFKLFRVPAEQGIPI-WTDKEHEE-LEAF 174 (472) T ss_pred ChHHHHHHHHHH--hccHHHHHHHHHHHHhhcCeEEEEEEECCC------CceEEEEEcccceEEE-EcCCCCCc-eEEE Confidence 455566666664 368999999999999999999999976432 4688999999998873 11122222 2222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++... +. .+ .++.| .. .+++++.... + .... T Consensus 175 ir~~~~--~~-----~~-----~~~~~----~~---------------------~~~~~~~~~~------~-----~~~~ 206 (472) T protein:vir:93 175 IRMYKL--EN-----ET-----KVEYW----DK---------------------VTVNYYVYEN------G-----SLIP 206 (472) T ss_pred EEEEEe--ec-----ce-----eEEEE----ec---------------------CeEEEEEEec------C-----eeee Confidence 222211 10 00 00000 00 0111110000 0 0000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) .+.. ......+....++|+.||||.+... .+.. +-|.++..|-=+.=...|++.+.+.+.++|+++++|.+ T Consensus 207 ~~~~-----~~~~~~~~~~~~~~~~vPvv~~~nn--~~g~--s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~ 277 (472) T protein:vir:93 207 DYSN-----NLENSKTHFSTGSWGKIPFIPFKNN--DLEI--SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD 277 (472) T ss_pred cccc-----cccccccccccCCCCCcceEEecCC--CCCC--CchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC Confidence 0000 0111122334578999999988542 2222 22222222211111356788889999999999999986 Q ss_pred CCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) ....... .++...++.++.|++++|+..+.+ .+.....++.+.+.|..++.-. +......++-||.+......... T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 356 (472) T protein:vir:93 278 DQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVP-VENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 356 (472) T ss_pred cccchhhHHHHhhccccccCCCCcceeEeecCC-HHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHH Confidence 6543322 245667888999999999987654 4667888889988888775432 11112224568888888878888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) .........+..++.+++++++.++|... +..++.|.+++. .+.++ .+.++++.++ +|.||++|++..+ +.+ T Consensus 357 ~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~i~v~f~~~-~p~~~-~~~~~~~~k~--~giis~et~l~~l---~~~ 428 (472) T protein:vir:93 357 LKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYN-KVANT-ELQVQTAQQS--MGIVSHETVLENH---PFV 428 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEeCCC-CCCCH-HHHHHHHHHH--hccCchHHHHHhC---CCC Confidence 78888889999999999999999999754 234555555432 22222 4456666665 6999999987765 222 Q ss_pred chhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) . +++++.++|.+|....... .............+.+ +...++++ T Consensus 429 ~---d~~~E~~ri~~E~~~~~~~-~~~~~~~~~d~~~~~~-----------~~~~~~~e 472 (472) T protein:vir:93 429 E---DLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQQE-----------RSNNKESE 472 (472) T ss_pred C---CHHHHHHHHHHHHHHHHHh-ccCcCcccCCCCCCCC-----------CCCcccCC Confidence 2 3455666665542100000 0000000000000000 00000000 No 13 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.92 E-value=1e-25 Score=157.59 Aligned_cols=437 Identities=10% Similarity=0.044 Sum_probs=245.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....+++.++.+.|.|...+-.+...|.+.... +..+...-+-.|+.+.+|+..+|++|.+||++..- T Consensus 43 ~i~~--~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~-----~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~ 115 (483) T protein:vir:12 43 YIKQ--HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV-----DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT 115 (483) T ss_pred HHHH--HHHHHHHHHHHHHHhccccccccccccccccccc-----cccccccccccchHHHHHHHHhhhhcccCceeccC Confidence 4443 3455678888889998865443332222221111 11111111236999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+....+++++- +++++.....+...++.+|+++++|-... ..+|-+..++|.+++- |+ ..+.+. .+. T Consensus 116 d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~y~~v~~d~------d~~~~i~~~~p~~~~~v~d--~~~~~~-~~~ 184 (483) T protein:vir:12 116 DDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPIWT--DKEHEE-LEA 184 (483) T ss_pred ChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEEEEcC------CCceEEEEEcccceEEEEc--CCCCCc-eEE Confidence 455555566553 25788899999999999999999997543 2578899999999865 32 122222 233 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .|++... +. .+ .++.| .. .++++.... .+. .. T Consensus 185 ~ir~~~~--~~-----~~-----~~~~y----~~---------------------~~v~~~~~~------~~~-----~~ 216 (483) T protein:vir:12 185 FIRMYKL--EN-----ET-----KVEYW----DK---------------------VTVNYYVYE------NGS-----LI 216 (483) T ss_pred EEEEEEe--ec-----ce-----EEEEE----ec---------------------CeEEEEEEe------CCe-----ee Confidence 3333221 00 00 00100 00 011111000 000 00 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+... .....+...-++|+.||||.+... .+. .+-|.++..|-=+.=...|+..+.+.+.++|+++++|. T Consensus 217 ~~~~~~-----~~~~~~~~~~~~~g~vPvv~~~nn--~~g--~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~ 287 (483) T protein:vir:12 217 PDYSNN-----LENSKTHFSTGSWGKIPFIPFKNN--DLE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY 287 (483) T ss_pred eccccc-----ccccccccccCCCCccceEEecCC--CCC--CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 000000 011112223467999999988542 221 22233333322122235688888999999999999998 Q ss_pred CCCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANE 396 (612) Q Consensus 320 ~~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~ 396 (612) +.+..++. .++...++.++.|++++|+..+.+ .+.....++.+++.|...+.-. +....-.++-||++........ T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 366 (483) T protein:vir:12 288 DDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVP-VENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNL 366 (483) T ss_pred CcccchhHHHhhhhccccccCCCCcceEEeecCC-HHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHH Confidence 66543322 245567888899999999997654 4667888899999888775322 1111122456888888777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCc Q lcl|NC_019408. 397 QSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEV 476 (612) Q Consensus 397 ~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~v 476 (612) .......-..+..++.+.+++++.++|... +..++.|.+++ ..+.+ ..+.++++.++ +|.||++|.+..+ +. T Consensus 367 ~~k~~~~~~~f~~~l~~~~~li~~~~~~~~-~~~~i~v~f~~-~~p~~-~~~~a~~~~kl--~GiiS~et~~~~~---~~ 438 (483) T protein:vir:12 367 NLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNY-NKVAN-TELQVQTAQQS--MGIVSHETVLENH---PF 438 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccceeeEEeCC-CCCCC-HHHHHHHHHHH--hccCchHHHHHhC---CC Confidence 777888888899999999999999998754 33456665543 23333 24456666666 6999999988765 22 Q ss_pred cchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 477 ISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 477 l~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) ++ ++++|.+++.+|........ +...........+.++ ...++++ T Consensus 439 v~---d~~~E~~ri~~E~~~~~~~~-~~~~~~~~d~~~~~~~-----------~~~~e~e 483 (483) T protein:vir:12 439 VE---DLQAELERIEQEQMEYNKQL-PNLDDGGADGAQQQER-----------SNNKESE 483 (483) T ss_pred CC---CHHHHHHHHHHHHHHHHhhc-ccccccccCCcccCCC-----------CCcccCC Confidence 22 34556666655421000000 0000000000000000 0000000 No 14 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.92 E-value=1.4e-25 Score=156.83 Aligned_cols=414 Identities=10% Similarity=0.026 Sum_probs=247.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||-+ +....+++.++.+.|.|...+. .+.+.... .-..++ -.|+.+.+|+..+|++|.+||++..- T Consensus 9 ~i~~--~~~~~~r~~~l~~yy~g~~~il-------~~~~~~~~-~~~~ki----~~n~~~~ivd~~~~~l~g~~~~~~~~ 74 (429) T protein:vir:98 9 LIQK--HRSFNLSYSAYKQLYEGDHAIL-------QQKQKEQY-KPDNRL----VVNFAKYIVDTFNGYFIGVPVQTSHE 74 (429) T ss_pred HHHH--HHHHHHHHHHHHHHhccccccc-------cccccccC-CCccee----ecchHHHHHHHHhhhhcccCceeecC Confidence 5543 4566789999999999975432 22221111 111122 36999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....++++-.. ++++.++..+.+.++.+|+++++|.... .++|-+..++|.+++- |+ ..+ +...+. T Consensus 75 ~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~g~~~~~~~~p~~~~~v~d--d~~-~~~~~~ 144 (429) T protein:vir:98 75 NKQVSNYLELLDGY-NDQDDNNAELSKICSIYGHGYELVFNDE------NAEAGITYLTPLEAFIVYD--DSI-RQKPLF 144 (429) T ss_pred ChHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcCeEEEEEEecC------CCcEEEEEEcccceEEEEe--CCC-CCceEE Confidence 45566666666333 7899999999999999999999996543 2578899999998864 32 111 122222 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .|++.. +. + . ..+. .+|. .... T Consensus 145 ~i~~~~---~~------~--~----~~~~---------------------------~~~~----------------~~~~ 166 (429) T protein:vir:98 145 AVRYFY---NK------G--G----VLEG---------------------------SYSD----------------ASNI 166 (429) T ss_pred EEEEEE---ec------C--c----eEEE---------------------------EEEe----------------CceE Confidence 222211 00 0 0 0000 0000 0001 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+..+++++...+. ..++|+.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.+.+.++|+++++|. T Consensus 167 ~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~n--~~~--g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~ 238 (429) T protein:vir:98 167 TYFKDGEKGIEIGES----EPHPFDGVPMIEYVE--NEE--RQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGA 238 (429) T ss_pred EEEEecCCceEeccc----ccccCCccceEEecC--CCC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 111222222222222 236799999998753 223 344466666665567778899999999999999999997 Q ss_pred CCCCCceEEEeccccccCCC----CCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGEYHIGPNMVWEVPQ----GSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQTVLREAN 395 (612) Q Consensus 320 ~~~~~~~l~iG~~~~~~lp~----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~~ 395 (612) +......-.+....+|.+|. +++++|+..+. ..+.....++.+.+.|.....-+-....+.++-||++....... T Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~ 317 (429) T protein:vir:98 239 ELDDETLKSLRDTRIINLKDTDAQQLTVEFLQKPD-ADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQA 317 (429) T ss_pred CCCcchhhhHhhCceeeccCCCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHH Confidence 54432222233456676653 35789999876 45677888899998887765322111223346688877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC--cceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh Q lcl|NC_019408. 396 EQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT--ENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK 473 (612) Q Consensus 396 ~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~--~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr 473 (612) ........-..+..++.+++++++.+++...... .++.|.+++ ..+.+ ..+.++++.++ +|.||++|.++.+ T Consensus 318 l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~-~~p~~-~~~~a~~~~kl--~g~is~et~~~~l-- 391 (429) T protein:vir:98 318 MDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTR-NLPAN-LLEESQIAGNL--AGIVSEETQVGVL-- 391 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCchHHHHHhC-- Confidence 7777777788899999999999999987643322 245555543 23333 24566777766 7999999998766 Q ss_pred cCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_019408. 474 AEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFT 528 (612) Q Consensus 474 ~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~ 528 (612) +.++ ++++|.+++.+|....- +.+.. .... ...+.+.+ T Consensus 392 -~~v~---d~~~E~~ri~~E~~~~~--~~~~~---------~~~~--~~~~~~~~ 429 (429) T protein:vir:98 392 -SIVE---NPQKEIERKNSDKSTLI--SRQAG---------GLNG--QNTTTILE 429 (429) T ss_pred -CCCC---CHHHHHHHHHHHHHHHH--HHHHh---------hhcC--CCCCCCCC Confidence 3322 34566666655421100 00000 0000 00000000 No 15 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.92 E-value=4.3e-25 Score=154.16 Aligned_cols=452 Identities=10% Similarity=0.001 Sum_probs=240.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.| +....+++.++.+.|.|...+..+ +......-..|+ + .|+.+.+|+..+|++|.+||++..- T Consensus 24 ~i~~--~~~~~~~~~~l~~Yy~g~~~i~~~--------~~~~~~~~~~ki---~-~n~~~~Iv~~~~~~l~g~p~~~~~~ 89 (499) T protein:vir:10 24 AIRE--LQNRKKRLDKLSDYYNGKQEIEKH--------EFDNATVEAANV---M-VNHAKYITDMNVGFMTGNPVKYVAE 89 (499) T ss_pred HHHH--HHHHHHHHHHHHHHhccccchhcC--------CcCcCCCCccee---e-cchHHHHHHHHhhhhcccCceeecC Confidence 5543 566778899999999997654321 111111111222 2 5999999999999999999998432 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh-----------hhhccCceEEEechhhhhcchhh Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR-----------KGAVATSFAVGYSAENILDWDEV 149 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~-----------~~~~~rPy~~~~~ae~IinW~~~ 149 (612) ++....-+.++ ...++++.++..+.+.++.+|+++++|-....+. .....++.+..++|.+++-- +. T Consensus 90 ~~~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v-~~ 167 (499) T protein:vir:10 90 KGKNIDDILEV-FNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVV-CD 167 (499) T ss_pred ChhHHHHHHHH-HhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEE-ec Confidence 33333333344 3447899999999999999999999995543211 01112455666666665432 11 Q ss_pred hccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccc Q lcl|NC_019408. 150 VDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWP 229 (612) Q Consensus 150 ~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~ 229 (612) .-.++..+.-|++...... ++ .+.+..+ .+... -++|+ T Consensus 168 -d~~~~~~~~~i~~~~~~~~-------~~--~~~~~~~-~iyt~---------------------~~i~~---------- 205 (499) T protein:vir:10 168 -DTVEHDPLFAVFTQEKKDL-------EG--NTNGYSI-TVYMP---------------------QRIVE---------- 205 (499) T ss_pred -CCCCcceEEEEEEEEEeec-------CC--CceEEEE-EEEeC---------------------CeEEE---------- Confidence 1112222222222221100 00 0000000 00000 01111 Q ss_pred cccccceeEEEEEeeCCCcee-cceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019408. 230 SGEVKLAYVQYLYEEDPESRP-IARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLF 308 (612) Q Consensus 230 ~g~~~~~~~~~~~~~~~~~~~-~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~ 308 (612) |...+.+.. ..........++|+.||||.+... .. +.+-|.++..|-=..-...|++.+.+.+ T Consensus 206 ------------~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~--~~--~~~d~e~v~~liD~~~~~~S~~~~~~~~ 269 (499) T protein:vir:10 206 ------------YRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNN--EE--RQGDFEQLISLIDAYNLLQTDRISDKEA 269 (499) T ss_pred ------------EEecCCccccCcceecccccCCCCccceEEecCC--CC--CCCchHhHHHHHHHHHHHHHHHHHHHHH Confidence 111111111 011122333467999999987542 22 1222333333322333466888889999 Q ss_pred hccceeeeecCCCCCCce--EEEecccccc--CCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchh Q lcl|NC_019408. 309 TALPVYYAPGTDSEGTGE--YHIGPNMVWE--VPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVS 383 (612) Q Consensus 309 ~~~P~l~i~G~~~~~~~~--l~iG~~~~~~--lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~ 383 (612) .++|+++++|.+.+.... ..+..+..+. .+.|++++||..+.+ .+..+..++.+.+.|..+..-. +......++ T Consensus 270 ~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn 348 (499) T protein:vir:10 270 FVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFD-ETQVNLLSQSIENDIHKISYVPNMNDEKFMGN 348 (499) T ss_pred hcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCC-HHHHHHHHHHHHHHHHHHhCcccCCchhhccc Confidence 999999999975432221 2233334444 457789999998775 3667888999999998875322 111112345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC--CcceEEEeeccccccCCCHHHHHHHHHHHHcCC Q lcl|NC_019408. 384 ESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD--TENLRYEVNTDFLSTPIGAREMRAIQLMANDGL 461 (612) Q Consensus 384 esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~--~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~ 461 (612) -||++..................+..++.+++++++.|++..... ..++.|.+++.... + ..+.++.+.++ +|. T Consensus 349 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~-n-~~e~~~~~~kl--~g~ 424 (499) T protein:vir:10 349 VSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPS-N-LSDVVNNVKNA--DGI 424 (499) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCC-C-HHHHHHHHHHH--hcc Confidence 688888888888888889999999999999999999998754222 22456666554432 2 25566777776 699 Q ss_pred CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-----hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 462 LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-----RQRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 462 is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-----~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) ||++|++..|- .++ +++++.++|.+|........... .........++..+ -...|...+ T Consensus 425 iS~et~~~~l~---~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-------- 489 (499) T protein:vir:10 425 IPRKYTYSWLP---DVD---NPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSS-ENDKEAGSN-------- 489 (499) T ss_pred CChHHHHHhCC---CCC---CHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccC-CCCCCCccc-------- Confidence 99999998762 221 34566666655421100000000 00000000000000 000000000 Q ss_pred HHHHHHHHHHH Q lcl|NC_019408. 537 RSVAVQEGHAE 547 (612) Q Consensus 537 r~~~~~~~r~~ 547 (612) .+..-+.|+- T Consensus 490 -~~~~~~~~~~ 499 (499) T protein:vir:10 490 -HNQSHRTRAV 499 (499) T ss_pred -cccCCCCCCC Confidence 0000000000 No 16 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.92 E-value=1.7e-24 Score=150.94 Aligned_cols=424 Identities=8% Similarity=-0.017 Sum_probs=234.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+= -....++|+++.+.|.|.... ++.+.. ....+..+..+-.-.|+++.+++.++|++|.+||++..- T Consensus 38 ~i~~~-~~~~~~~~~~~~~yY~g~~~~------i~~~~~--~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~ 108 (481) T protein:vir:10 38 FISRH-QTEQVPRLEMLESYYLNRNTD------ILAGER--RLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITHQ 108 (481) T ss_pred HHHHH-HHHHHHHHHHHHHHhcCCCcc------cccCcc--ccccccccccceeecchHHHHHHHHHhhhccCCceEecC Confidence 34321 134567788888888886321 111110 011122222223457999999999999999999998432 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++-.. ++++.++..+.+.++.+|+++++|-.+. ..+|-+..++|.+++-. +..... ...+.. T Consensus 109 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~------dg~~~i~~~~p~~~~~v-~d~~~~-~~~~~~ 179 (481) T protein:vir:10 109 DNQTNDKIIELNDL-NDADEVNSDLALNLSIYGRAYEIVYRDF------EDRDTFKVLDPKSTFVV-YDQTLD-KKVVAG 179 (481) T ss_pred ChhHHHHHHHHHHh-cChhHHHHHHHHHHHhcCeEEEEEEeCC------CCeEEEEEEcccceEEE-EcCCCC-CceEEE Confidence 33333333333333 6799999999999999999999996543 25788999999998753 111111 112222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |+.... . +.+.- .+ .+..+... -.+|+ T Consensus 180 i~~~~~--~-----~~~~~---~~-~~~~~y~~---------------------~~i~~--------------------- 206 (481) T protein:vir:10 180 VRYFEK--Q-----DKDKV---PV-QHVEVYTT---------------------DKIYY--------------------- 206 (481) T ss_pred EEEEEE--e-----eCCCc---eE-EEEEEEec---------------------CeEEE--------------------- Confidence 222110 0 00000 00 00000000 01111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) +..+++.|...+.. -++|+.||||.+... .+ +.+-|.++..|-=+.-+..|++.+.+.+.+.|+++++|.. T Consensus 207 -~~~~~~~~~~~~~~----~~~~g~vPvv~~~n~--~~--g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~ 277 (481) T protein:vir:10 207 -IEIKGGTYHRVEEV----EHYYNDVPIIEYLND--QF--KQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNV 277 (481) T ss_pred -EEecCCceeecccc----cccCCceeEEEeecC--CC--CCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCc Confidence 22222333222222 257999999987542 22 2233444444433444567889999999999999999863 Q ss_pred CCCCce---------EEEecccccc-CCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhh-hccccchhHHHHHH Q lcl|NC_019408. 321 SEGTGE---------YHIGPNMVWE-VPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMM-PGASKSVSESNNQT 389 (612) Q Consensus 321 ~~~~~~---------l~iG~~~~~~-lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll-~~~~~~~~esa~~~ 389 (612) ....+. +.+..+.... ...+++++|+..+.. .+.....++.+.+.|..++.-.- ......++-||.+. T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al 356 (481) T protein:vir:10 278 DLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYD-VAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESM 356 (481) T ss_pred CCCccchhhhhhccceeccccccccCCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHH Confidence 322221 1111111111 124578999998764 45677888999998887754221 11122245688877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC---cceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHH Q lcl|NC_019408. 390 VLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT---ENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPV 466 (612) Q Consensus 390 ~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~---~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et 466 (612) .................+..++.+++++++++++...... .++++.+++. .+.+ ..+.++++.++ .|.||.+| T Consensus 357 ~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~-~~~~-~~~~a~~~~kl--~g~is~et 432 (481) T protein:vir:10 357 KYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPN-LPKS-MMESINAFNAL--SGGVSEST 432 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCC-CCcC-HHHHHHHHHHH--hccCChHH Confidence 7777777778888889999999999999999987654322 2445555432 2222 24456666666 58999999 Q ss_pred HHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHh------hhhhhHHHHh Q lcl|NC_019408. 467 FYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR------QRGYTNRGQE 515 (612) Q Consensus 467 ~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~------~~~e~~r~~~ 515 (612) ++..| +.++ ++.++.++|.+|............ ........+- T Consensus 433 ~~~~l---~~i~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 433 RLSLL---DFID---NPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred HHHhC---CCCC---CHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 98765 3332 345666666554311100000000 0000000000 No 17 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.92 E-value=6.3e-25 Score=153.25 Aligned_cols=437 Identities=9% Similarity=-0.050 Sum_probs=236.5 Q ss_pred CC----CcHHH--------HHHHHHHHHHHHHhcChHH-HHhcccccCCCCCCC-CHHHHHHHHhh---ccCCchHHHHH Q lcl|NC_019408. 1 MV----THPEY--------QYWRPEWTKLRDVMAGQRE-IKRKAEAYLPAMKGA-DGDDYAIYLQR---ATFFNMLAQTR 63 (612) Q Consensus 1 ~~----~hP~y--------~~~~~~W~~i~d~~~G~~~-vr~~g~~YLPk~~~e-~~~~Y~~rl~r---A~~~n~~~~tv 63 (612) .| .-|+. ....+++.....-|.|... ........+++.... ....++....+ =+-.|+.+.+| T Consensus 9 ~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~iv 88 (474) T protein:vir:10 9 DIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIV 88 (474) T ss_pred hccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHH Confidence 00 11111 1112223333333333211 111000111111100 00011111111 14589999999 Q ss_pred HHhhchhhcCCceeec-----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEe Q lcl|NC_019408. 64 DGMTGMVFRRDPIVKN-----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGY 138 (612) Q Consensus 64 ~~~~G~vf~k~p~~~~-----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~ 138 (612) +..+|++|.+||++.. ..+.+..++.++...+ +++.+...+.+.++.+|+|+++|-... ..+|.+..+ T Consensus 89 d~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~~~~~~~i 161 (474) T protein:vir:10 89 DTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRN-SVDDEDSEIGKMAAICGYGARLAYIDT------NGDIRIKNI 161 (474) T ss_pred HhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhc-CHhHHHHHHHHHHhhcCeEEEEEEeCC------CCeeEEEEE Confidence 9999999999999841 2356677788886664 899999999999999999999995432 247899999 Q ss_pred chhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeee Q lcl|NC_019408. 139 SAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVY 218 (612) Q Consensus 139 ~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~ 218 (612) +|.+++-+ +. +....+.-|++...... .+. ...+..-.+ +..++| T Consensus 162 ~p~~~~~v-~d---~~~~~~~~i~~~~~~~~------~~~-----~~~~~~~~y--------------------~~~~~~ 206 (474) T protein:vir:10 162 DPYNVIFV-GD---NILEPTYSLRYFYEKDD------DNG-----TDYVYAEFY--------------------DNAYYY 206 (474) T ss_pred cccceEEE-Ec---CCCceEEEEEEEEEeeC------CCc-----eEEEEEEEE--------------------cCceEE Confidence 99998764 21 12223433433321110 000 000000000 000111 Q ss_pred eeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhh Q lcl|NC_019408. 219 RELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRT 298 (612) Q Consensus 219 R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~ 298 (612) ++ ...+.+++.... ..-++|+.||||.+... .. +.+-|.++..|-=+.-.. T Consensus 207 ~~---------------------~~~~~~~~~~~~----~~~~~~g~vPvv~~~n~--~~--g~sd~e~v~~liDa~d~~ 257 (474) T protein:vir:10 207 VF---------------------RGEGIDALQEVG----RYEHLFDYNPLFGVPNN--KE--MIGDAEKVIHLIDAYDLT 257 (474) T ss_pred EE---------------------eecCCCcccccc----cccCCCCccceEEecCC--CC--CCCchHHHHHHHHHHHHH Confidence 10 111111111111 12367999999987532 22 333455555554455567 Q ss_pred hHHHHHHHHHhccceeeeecCCCCCCceEEEec-cccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hh Q lcl|NC_019408. 299 YAELEYGRLFTALPVYYAPGTDSEGTGEYHIGP-NMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MP 376 (612) Q Consensus 299 ~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~-~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~ 376 (612) .|++.+.+...+.|+++++|.+........+.. +..+..+.+++++|+..+.. .+.....++.+++.|...+.-. +. T Consensus 258 ~S~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~ 336 (474) T protein:vir:10 258 MSDASSEISQTRLAYLVLRGMGMSEEMIQETQKSGAFELFDKDMDVKYLTKDVN-DTMIENHLDRIEKNIMRFAKSVNFN 336 (474) T ss_pred HHHHHHHHHHhhcchhhhccCCCCchhhhhhhhcceeEecCCCCceeEEeccCC-HHHHHHHHHHHHHHHHHHhCCcccc Confidence 889999999999999999997544322222222 34445578999999998764 4677889999999998875422 11 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHH Q lcl|NC_019408. 377 GASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMR 451 (612) Q Consensus 377 ~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~ 451 (612) .....++.||.+...............-..+..++.+.+++++.+++..... -.++.+.+++.. +.+ ..+.++ T Consensus 337 ~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~-p~d-~~e~a~ 414 (474) T protein:vir:10 337 SDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNI-PVN-KLEESQ 414 (474) T ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCC-CCC-HHHHHH Confidence 1112346688887777777777777778889999999999999997753221 123445444322 222 244566 Q ss_pred HHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHH Q lcl|NC_019408. 452 AIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREAD 526 (612) Q Consensus 452 al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e 526 (612) ++..+ .|.||++|++..+ +.++ ++++|.++|.+|......... ........ .+..+.+.| T Consensus 415 ~~~kl--~g~iS~et~~~~l---~~v~---d~~~E~eri~~E~~e~~~~~~-~~~~~~~~------~~~~~~~s~ 474 (474) T protein:vir:10 415 VLINL--KGQVSERTRLGQS---QLVD---DVDYELDEMEKESLEFNDKLP-DIDEGDAN------DKSQNNQSE 474 (474) T ss_pred HHHHH--hccCchHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHhhcc-cccCCCcC------CCCccccCC Confidence 66665 5999999998876 2332 456666776554211000000 00000000 000001111 No 18 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.92 E-value=6.3e-25 Score=153.25 Aligned_cols=437 Identities=9% Similarity=-0.050 Sum_probs=236.5 Q ss_pred CC----CcHHH--------HHHHHHHHHHHHHhcChHH-HHhcccccCCCCCCC-CHHHHHHHHhh---ccCCchHHHHH Q lcl|NC_019408. 1 MV----THPEY--------QYWRPEWTKLRDVMAGQRE-IKRKAEAYLPAMKGA-DGDDYAIYLQR---ATFFNMLAQTR 63 (612) Q Consensus 1 ~~----~hP~y--------~~~~~~W~~i~d~~~G~~~-vr~~g~~YLPk~~~e-~~~~Y~~rl~r---A~~~n~~~~tv 63 (612) .| .-|+. ....+++.....-|.|... ........+++.... ....++....+ =+-.|+.+.+| T Consensus 9 ~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~iv 88 (474) T protein:vir:94 9 DIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIV 88 (474) T ss_pred hccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHH Confidence 00 11111 1112223333333333211 111000111111100 00011111111 14589999999 Q ss_pred HHhhchhhcCCceeec-----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEe Q lcl|NC_019408. 64 DGMTGMVFRRDPIVKN-----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGY 138 (612) Q Consensus 64 ~~~~G~vf~k~p~~~~-----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~ 138 (612) +..+|++|.+||++.. ..+.+..++.++...+ +++.+...+.+.++.+|+|+++|-... ..+|.+..+ T Consensus 89 d~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~~~~~~~i 161 (474) T protein:vir:94 89 DTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRN-SVDDEDSEIGKMAAICGYGARLAYIDT------NGDIRIKNI 161 (474) T ss_pred HhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhc-CHhHHHHHHHHHHhhcCeEEEEEEeCC------CCeeEEEEE Confidence 9999999999999841 2356677788886664 899999999999999999999995432 247899999 Q ss_pred chhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeee Q lcl|NC_019408. 139 SAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVY 218 (612) Q Consensus 139 ~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~ 218 (612) +|.+++-+ +. +....+.-|++...... .+. ...+..-.+ +..++| T Consensus 162 ~p~~~~~v-~d---~~~~~~~~i~~~~~~~~------~~~-----~~~~~~~~y--------------------~~~~~~ 206 (474) T protein:vir:94 162 DPYNVIFV-GD---NILEPTYSLRYFYEKDD------DNG-----TDYVYAEFY--------------------DNAYYY 206 (474) T ss_pred cccceEEE-Ec---CCCceEEEEEEEEEeeC------CCc-----eEEEEEEEE--------------------cCceEE Confidence 99998764 21 12223433433321110 000 000000000 000111 Q ss_pred eeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhh Q lcl|NC_019408. 219 RELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRT 298 (612) Q Consensus 219 R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~ 298 (612) ++ ...+.+++.... ..-++|+.||||.+... .. +.+-|.++..|-=+.-.. T Consensus 207 ~~---------------------~~~~~~~~~~~~----~~~~~~g~vPvv~~~n~--~~--g~sd~e~v~~liDa~d~~ 257 (474) T protein:vir:94 207 VF---------------------RGEGIDALQEVG----RYEHLFDYNPLFGVPNN--KE--MIGDAEKVIHLIDAYDLT 257 (474) T ss_pred EE---------------------eecCCCcccccc----cccCCCCccceEEecCC--CC--CCCchHHHHHHHHHHHHH Confidence 10 111111111111 12367999999987532 22 333455555554455567 Q ss_pred hHHHHHHHHHhccceeeeecCCCCCCceEEEec-cccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hh Q lcl|NC_019408. 299 YAELEYGRLFTALPVYYAPGTDSEGTGEYHIGP-NMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MP 376 (612) Q Consensus 299 ~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~-~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~ 376 (612) .|++.+.+...+.|+++++|.+........+.. +..+..+.+++++|+..+.. .+.....++.+++.|...+.-. +. T Consensus 258 ~S~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~ 336 (474) T protein:vir:94 258 MSDASSEISQTRLAYLVLRGMGMSEEMIQETQKSGAFELFDKDMDVKYLTKDVN-DTMIENHLDRIEKNIMRFAKSVNFN 336 (474) T ss_pred HHHHHHHHHHhhcchhhhccCCCCchhhhhhhhcceeEecCCCCceeEEeccCC-HHHHHHHHHHHHHHHHHHhCCcccc Confidence 889999999999999999997544322222222 34445578999999998764 4677889999999998875422 11 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHH Q lcl|NC_019408. 377 GASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMR 451 (612) Q Consensus 377 ~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~ 451 (612) .....++.||.+...............-..+..++.+.+++++.+++..... -.++.+.+++.. +.+ ..+.++ T Consensus 337 ~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~-p~d-~~e~a~ 414 (474) T protein:vir:94 337 SDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNI-PVN-KLEESQ 414 (474) T ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCC-CCC-HHHHHH Confidence 1112346688887777777777777778889999999999999997753221 123445444322 222 244566 Q ss_pred HHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHH Q lcl|NC_019408. 452 AIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREAD 526 (612) Q Consensus 452 al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e 526 (612) ++..+ .|.||++|++..+ +.++ ++++|.++|.+|......... ........ .+..+.+.| T Consensus 415 ~~~kl--~g~iS~et~~~~l---~~v~---d~~~E~eri~~E~~e~~~~~~-~~~~~~~~------~~~~~~~s~ 474 (474) T protein:vir:94 415 VLINL--KGQVSERTRLGQS---QLVD---DVDYELDEMEKESLEFNDKLP-DIDEGDAN------DKSQNNQSE 474 (474) T ss_pred HHHHH--hccCchHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHhhcc-cccCCCcC------CCCccccCC Confidence 66665 5999999998876 2332 456666776554211000000 00000000 000001111 No 19 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.91 E-value=1.1e-24 Score=151.93 Aligned_cols=435 Identities=9% Similarity=0.017 Sum_probs=239.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+. ....++..++.+.|.|...+..+...+.+..+.+ .. ....|. -.|+.+.+|+.++|++|.+||++..- T Consensus 35 ~i~~~--~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~~--~~~~ki-~~n~~k~Ivd~~~~~l~g~p~~~~~~ 107 (474) T protein:vir:94 35 LIDDH--RKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID--YD--KPDWRI-TTNFHQNLVDQKVSYVASKPVTYSCE 107 (474) T ss_pred HHHHH--HHHHHHHHHHHHHhccccchhcccchhccccccc--cc--cCccee-ecchHHHHHHHHHhhhhcCCceeccC Confidence 44332 3345677778888888755543321221111111 00 011122 26999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....+++.+- .++++..+..+.+.++.+|+++++|..+. ..+|.+..++|.+++-. +...+. ..+.. T Consensus 108 d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~~~~d~------~~~~~i~~~~p~~~~~v-~d~~~~--~~~~~ 176 (474) T protein:vir:94 108 DENVLKVIHDVL--DTRWDNKLIDILTATSNKGIDWLQVYINE------NGEMKLFRVPAEQAIPI-WVDKER--EELKS 176 (474) T ss_pred cHHHHHHHHHHH--hccHHHHHHHHHHHHhhcCceEEEEEecC------CCeeEEEEEcccceEEE-EcCCCC--CceEE Confidence 355556666553 36788999999999999999999997643 24789999999998863 111111 22222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) + ++...... .. +..+...+ .+..|+.... +.... T Consensus 177 ~-ir~~~~~~-------------~~-~~~~yt~~-------------------~~~~y~~~~~-------~~~~~----- 210 (474) T protein:vir:94 177 F-IRYYKFNN-------------EE-KVEFWTDT-------------------TVTYYVLENG-------GLIPD----- 210 (474) T ss_pred E-EEEEEecC-------------eE-EEEEEeCC-------------------eEEEEEEcCC-------ccccc----- Confidence 2 22211110 00 01111110 0111211000 00000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ...+ .....+...-++++.||||.+.... . +.+-|.++..|-=..=...|++.+.+-+.+.|+++++|.+ T Consensus 211 -~~~~-----~~~~~~~~~~~~~g~vPvv~~~nn~--~--g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~ 280 (474) T protein:vir:94 211 -YYYG-----ANHVQSHFSNGNWGRVPFIAFKNNP--E--EVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE 280 (474) T ss_pred -cccC-----cCcccccccccCCCccceEEecCCc--C--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 0000 0001111223579999999875432 2 1222222222211222355667777788899999999986 Q ss_pred CCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+....+ .+....+|.++.|++++|+..+.+ .+..+..++.+.+.|...+.-. +...+..++-||++......... T Consensus 281 ~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 359 (474) T protein:vir:94 281 GEDLEEFMRGLKYYKAINVDGDGGVETIQVEVP-VSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLD 359 (474) T ss_pred cccchhhhhhhhccceeeccCCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHH Confidence 5433321 234556788899999999997764 4677888999998888775432 11122234568888777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) ......-..+..++.+.+++++.+.|... +..++.|.+++.- +. .+......+.++|.||++|++..+ +.+ T Consensus 360 ~k~~~k~~~~~~~l~~~~~li~~~~~~~~-d~~~i~v~f~~~~-p~----~~~e~a~~~~~~g~iS~et~l~~l---~~v 430 (474) T protein:vir:94 360 LKANKLKNKATVAIQELISFIIDFNNLKT-DVKDIEISFNFNR-MM----NDAEQSQIIAQSQYLSRETLVKSS---PLV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEeccCc-cc----CHHHHHHHHHHcCCCCHHHHHHhC---CCC Confidence 77788888899999999999999998754 2334555543321 11 123333345667999999998765 222 Q ss_pred chhhhhHHHHHHhhcccccccc--chhHHhhhhhhHHHHhHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFINN--PDAQARQRGYTNRGQELEQSRMARE 524 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~~~--~~~~~~~~~e~~r~~~~e~~r~~~e 524 (612) + +++++.++|.+|...... +..........+..++.++.+. | T Consensus 431 ~---D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--e 474 (474) T protein:vir:94 431 D---DYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKES--E 474 (474) T ss_pred C---CHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCccccc--C Confidence 2 345666666554311000 0000000000000011111110 0 No 20 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.91 E-value=1.1e-24 Score=151.93 Aligned_cols=435 Identities=9% Similarity=0.017 Sum_probs=239.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+. ....++..++.+.|.|...+..+...+.+..+.+ .. ....|. -.|+.+.+|+.++|++|.+||++..- T Consensus 35 ~i~~~--~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~~--~~~~ki-~~n~~k~Ivd~~~~~l~g~p~~~~~~ 107 (474) T protein:vir:97 35 LIDDH--RKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID--YD--KPDWRI-TTNFHQNLVDQKVSYVASKPVTYSCE 107 (474) T ss_pred HHHHH--HHHHHHHHHHHHHhccccchhcccchhccccccc--cc--cCccee-ecchHHHHHHHHHhhhhcCCceeccC Confidence 44332 3345677778888888755543321221111111 00 011122 26999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....+++.+- .++++..+..+.+.++.+|+++++|..+. ..+|.+..++|.+++-. +...+. ..+.. T Consensus 108 d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~~~~d~------~~~~~i~~~~p~~~~~v-~d~~~~--~~~~~ 176 (474) T protein:vir:97 108 DENVLKVIHDVL--DTRWDNKLIDILTATSNKGIDWLQVYINE------NGEMKLFRVPAEQAIPI-WVDKER--EELKS 176 (474) T ss_pred cHHHHHHHHHHH--hccHHHHHHHHHHHHhhcCceEEEEEecC------CCeeEEEEEcccceEEE-EcCCCC--CceEE Confidence 355556666553 36788999999999999999999997643 24789999999998863 111111 22222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) + ++...... .. +..+...+ .+..|+.... +.... T Consensus 177 ~-ir~~~~~~-------------~~-~~~~yt~~-------------------~~~~y~~~~~-------~~~~~----- 210 (474) T protein:vir:97 177 F-IRYYKFNN-------------EE-KVEFWTDT-------------------TVTYYVLENG-------GLIPD----- 210 (474) T ss_pred E-EEEEEecC-------------eE-EEEEEeCC-------------------eEEEEEEcCC-------ccccc----- Confidence 2 22211110 00 01111110 0111211000 00000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ...+ .....+...-++++.||||.+.... . +.+-|.++..|-=..=...|++.+.+-+.+.|+++++|.+ T Consensus 211 -~~~~-----~~~~~~~~~~~~~g~vPvv~~~nn~--~--g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~ 280 (474) T protein:vir:97 211 -YYYG-----ANHVQSHFSNGNWGRVPFIAFKNNP--E--EVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE 280 (474) T ss_pred -cccC-----cCcccccccccCCCccceEEecCCc--C--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 0000 0001111223579999999875432 2 1222222222211222355667777788899999999986 Q ss_pred CCCCceE--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEY--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+....+ .+....+|.++.|++++|+..+.+ .+..+..++.+.+.|...+.-. +...+..++-||++......... T Consensus 281 ~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 359 (474) T protein:vir:97 281 GEDLEEFMRGLKYYKAINVDGDGGVETIQVEVP-VSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLD 359 (474) T ss_pred cccchhhhhhhhccceeeccCCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHH Confidence 5433321 234556788899999999997764 4677888999998888775432 11122234568888777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) ......-..+..++.+.+++++.+.|... +..++.|.+++.- +. .+......+.++|.||++|++..+ +.+ T Consensus 360 ~k~~~k~~~~~~~l~~~~~li~~~~~~~~-d~~~i~v~f~~~~-p~----~~~e~a~~~~~~g~iS~et~l~~l---~~v 430 (474) T protein:vir:97 360 LKANKLKNKATVAIQELISFIIDFNNLKT-DVKDIEISFNFNR-MM----NDAEQSQIIAQSQYLSRETLVKSS---PLV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEeccCc-cc----CHHHHHHHHHHcCCCCHHHHHHhC---CCC Confidence 77788888899999999999999998754 2334555543321 11 123333345667999999998765 222 Q ss_pred chhhhhHHHHHHhhcccccccc--chhHHhhhhhhHHHHhHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFINN--PDAQARQRGYTNRGQELEQSRMARE 524 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~~~--~~~~~~~~~e~~r~~~~e~~r~~~e 524 (612) + +++++.++|.+|...... +..........+..++.++.+. | T Consensus 431 ~---D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--e 474 (474) T protein:vir:97 431 D---DYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKES--E 474 (474) T ss_pred C---CHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCccccc--C Confidence 2 345666666554311000 0000000000000011111110 0 No 21 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.91 E-value=1e-24 Score=152.06 Aligned_cols=421 Identities=11% Similarity=0.015 Sum_probs=239.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....+++..+.+-|.|...+..+ |.. ...... . |. -.|+.+.+|+.++|++|.+||++..- T Consensus 25 ~i~~--~~~~~~r~~~~~~yy~g~~~i~~~-----~~~-~~~~~~--~---ki-~~n~~~~ivd~~~~~l~g~~~~~~~~ 90 (453) T protein:vir:39 25 FMEK--HRLEVARYEYLKNMYRGIMAIDAE-----PTK-DLWKPD--N---RL-TVNFTKYIVDTFTGYFNGIPVKKSHS 90 (453) T ss_pred HHHH--HHHHHHHHHHHHHHhhccCchhcC-----CCc-cccCcc--c---ee-ecchHHHHHHHHhhhhcccCceeccC Confidence 5554 345668889999999997655332 111 111111 1 22 35999999999999999999998533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) ++.....+.++-.. ++++..+..+.+.++.+|+++++|-... ..+|-+..++|.+++-+ +.... ++..+.- T Consensus 91 d~~~~~~l~~i~~~-N~~~~~~~~~~~~~~~~G~~~~~v~~d~------~g~~~i~~~~p~~~~~v-~d~~~-~~~~~~~ 161 (453) T protein:vir:39 91 DKETLSKLQEFDNL-NDMEDEESELAKMACIYGRAFELLYQNE------ETQTNVIYNTPENMFMV-YDDTI-KQEPLFA 161 (453) T ss_pred ChHHHHHHHHHHHh-cChhHHHHHHHHHHhhcCeEEEEEEecC------CCceEEEEEcccceEEE-ecCCC-CCeEEEE Confidence 34444455555444 6999999999999999999999996543 24688999999998775 22111 2222222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++. ... + ...+..+... .++|+ T Consensus 162 ir~~---~~~------~------~~~~~~~yt~---------------------~~i~~--------------------- 184 (453) T protein:vir:39 162 VRYG---YDD------D------YKLYGEVYTK---------------------ETTYA--------------------- 184 (453) T ss_pred EEEE---EeC------C------eEEEEEEEeC---------------------CeEEE--------------------- Confidence 2221 100 0 0011111100 01111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) +..+++.+...+. ..++|+.||||.+... .+ +.+-|.++..|-=+.-...|++.+.+.+.++|+++++|.+ T Consensus 185 -~~~~~~~~~~~~~----~~~~~g~vPvv~~~n~--~~--g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~ 255 (453) T protein:vir:39 185 -LNGTMGFYNMTEQ----APNPFDDLPVVEFYFN--EE--RMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAA 255 (453) T ss_pred -EEecCCceeeecc----cccCCCceeEEEecCC--CC--CCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC Confidence 1122222222222 1367999999988542 22 2233444444433445677888999999999999999965 Q ss_pred CCCCceE------EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEY------HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQTVLREA 394 (612) Q Consensus 321 ~~~~~~l------~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~ 394 (612) .++.... .+........+++++++|+..+.+ .+.....++.+.+.|..+..-+-....+.++-||.+...... T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~ 334 (453) T protein:vir:39 256 VEEEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDS-DSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQ 334 (453) T ss_pred CCchhhhhhhhcceeeecCCCCCCCCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHH Confidence 4432211 112222223346788999997754 466788889999988776432211222234567777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC--CcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHH Q lcl|NC_019408. 395 NEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD--TENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMR 472 (612) Q Consensus 395 ~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~--~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lq 472 (612) .........-..+..++.+++++++.+++..... ..++.|.+++.. +.++ .+.++++..+ +|.||++|++..+ T Consensus 335 ~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~-p~~~-~~~a~~~~kl--~g~is~et~l~~l- 409 (453) T protein:vir:39 335 AMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNE-PKDI-KEQAETANIL--MGITSQETALSVI- 409 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCC-CcCH-HHHHHHHHHH--hccCChHHHHHhC- Confidence 7677777778888999999999999988753222 234556665432 2222 4456666655 7899999998766 Q ss_pred hcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHH Q lcl|NC_019408. 473 KAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMARE 524 (612) Q Consensus 473 r~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e 524 (612) +.++ +++++.+++.+|...............+.. +.+..+ ..+| T Consensus 410 --~~v~---D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~~~~~--~~~e 453 (453) T protein:vir:39 410 --SVIP---DVQAEMEKIKKEEASTAIFDKDKQPSEKGT-DTVVPE--TNEE 453 (453) T ss_pred --CCCC---CHHHHHHHHHHHHHHHHHHHHhccCCCCCC-CCCCCC--cCCC Confidence 3332 345666676655321110000000000000 000000 0000 No 22 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.91 E-value=4.3e-24 Score=148.68 Aligned_cols=417 Identities=7% Similarity=-0.040 Sum_probs=235.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCC--CCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK--GADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK 78 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~--~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~ 78 (612) ++.|.. ...++|+++.+.|.|....- +.+.. ......+ + +-.|+.+.+|+..+|++|.+||++. T Consensus 2 ~~~~~~--~~~~r~~~l~~yy~g~~~~~------~~~~~~~~~~~~~~-----k-i~~n~~~~ivd~~~~~l~g~~~~~~ 67 (440) T protein:vir:95 2 LAAFLG--SQKQRLAILASYAQGDNFSI------LSGHRRLDDEKADY-----R-VRHKWGGYISSFATGYVIGNPVSIG 67 (440) T ss_pred hhhHHH--HHHHHHHHHHHHhccCCccc------ccccccccccCCcc-----e-eecchHHHHHHhhhhheeccCceEe Confidence 445544 57889999999999964321 11111 1111111 1 4579999999999999999999994 Q ss_pred cCC---HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 79 NLP---PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 79 ~~p---~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ..+ +.....+.++ ...+.++.....+.+.++.+|+++++|.... ..+|-+..++|.+++-- +...++ + T Consensus 68 ~~~~~~~~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~~~~i~~~~p~~~~~~-~d~~~~-~ 138 (440) T protein:vir:95 68 VMEGGSADQLSTIKDI-EWQNDINALNSDLAFDASVYGRAYEYHFRDK------DKVDRVVLISPLEMFVI-RDLTVE-Q 138 (440) T ss_pred eCCCccHHHHHHHHHH-HHhcCHhHHHHHHHHHHhhcCeEEEEEEecC------CCceEEEEEcccceEEE-EcCCCC-C Confidence 211 1222233444 3346899999999999999999999995432 24788999999988752 111111 2 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) ..+..|+..+. . + . .+..+... ..+++ T Consensus 139 ~~~~~i~~~~~--~-------~------~-~~~~vyt~---------------------~~~~~---------------- 165 (440) T protein:vir:95 139 NIIAAVHLPIY--A-------D------K-VNMTVYTK---------------------DKVIT---------------- 165 (440) T ss_pred ceEEEEEEEEe--c-------C------c-eEEEEEeC---------------------CeEEE---------------- Confidence 22333332211 0 0 0 00001100 01111 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) +.....+...+...+.. .++|+.||||.+... .+ +.+-+.++..|-=+.-...|++.+.+.+.++|+++ T Consensus 166 ---~~~~~~~~~~~~~~~~~----~~~~g~vPvv~~~n~--~~--g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v 234 (440) T protein:vir:95 166 ---YKPYSNNSVRLVVDDVK----KHSYNDVPVVEWWNN--RF--RMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLL 234 (440) T ss_pred ---EEEecCCccceeeccee----eccCceeeEEEeeCC--CC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceee Confidence 11111111122222222 367999999987542 22 33445555555445556778889999999999999 Q ss_pred eecCCCCC---Cce--------EEEecccc--ccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccc Q lcl|NC_019408. 316 APGTDSEG---TGE--------YHIGPNMV--WEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKS 381 (612) Q Consensus 316 i~G~~~~~---~~~--------l~iG~~~~--~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~ 381 (612) ++|..... .+. +...+... .....+++++|+..+. ..+.....++.+.+.|..+..-. +...... T Consensus 235 ~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~ 313 (440) T protein:vir:95 235 VKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQY-DVNGTEAYKNRLANDIHRFSRIPNLDDDRFN 313 (440) T ss_pred eecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccc Confidence 99953221 111 11111111 1224578899999875 45677888999999887764322 1111123 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC---CCcceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 382 VSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA---DTENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 382 ~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~---~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) ++-||++...............-..+..++.+++++++.+++...+ +...+.|.++ +..+.+ ..+.++++..+ T Consensus 314 ~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~-~~~p~~-~~~~ad~~~kl-- 389 (440) T protein:vir:95 314 STSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFH-PNIPQD-VWTEIKAYIEA-- 389 (440) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeC-CCCCCC-HHHHHHHHHHH-- Confidence 4568888777777777777777788899999999999999764322 1234455554 333433 35567777776 Q ss_pred cCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHH Q lcl|NC_019408. 459 DGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELE 517 (612) Q Consensus 459 ~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e 517 (612) +|.||++|++..| +.+ +...+..+|.+|.... ..............+.+.| T Consensus 390 ~g~iS~et~~~~l---~~~----d~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 390 GGEISQETLMENA---SFT----DYKTEHSRILKQGGSS-DLEIGQIVGDADVGQADTE 440 (440) T ss_pred hccCcHHHHHHhC---CCC----CcHHHHHHHHHHHHHh-hhhHHhhccCCCCCCcCCC Confidence 6899999998876 222 3334455554442210 0000000000000001111 No 23 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.91 E-value=6.6e-24 Score=147.68 Aligned_cols=440 Identities=10% Similarity=0.014 Sum_probs=241.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.|= .....++++.+.+.|.|...+-. +. ......++.. +=+-.|+.+.+|+.++|++|.+||++..- T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~g~~~i~~-------~~-~~~~~~~~~~--~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:99 48 YIEHH-MDYQRPRLKVLSDYYEGKTKNLV-------EL-TRRKEEYMAD--NRVAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhcccCcccc-------cc-CcccccccCc--ceeecchHHHHHHHHHhhhcccCceeecC Confidence 33220 12245778888888888654311 11 1111111111 11447999999999999999999999644 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.+...+.++-.. ++++.+...+++.++.+|+++++|-... ..+|-+..++|.+++=. +...+.+ ..+.- T Consensus 117 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~a~~~vy~de------d~~~~i~~~~p~~~~~v-yd~~~~~-~~~~~ 187 (511) T protein:vir:99 117 DKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVI-YDNTIER-NSIAG 187 (511) T ss_pred chHHHHHHHHHHhh-cCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEE-EcCCCCC-ceEEE Confidence 56677777777555 5899999999999999999999996532 24788999999988752 1111111 12333 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++..... .++... ....+..+...+ .+|+. T Consensus 188 vr~~~~~~-------~~~~~~-~~~~~~~vyt~~---------------------~i~~~-------------------- 218 (511) T protein:vir:99 188 VRYLRTKP-------IDKTDE-DEVFTVDLFTSH---------------------GVYRY-------------------- 218 (511) T ss_pred EEEEEeee-------cccCcc-ceEEEEEEEeCC---------------------cEEEE-------------------- Confidence 33221100 000000 000010111100 11111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ...+++.............++++.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.+...+.|+++++|.. T Consensus 219 -~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--n~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~ 293 (511) T protein:vir:99 219 -LTSRTNGLKLTPRENGFESHSFERMPITEFSN--NER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred -EecCCccccccccccccccCCCCccceEEecC--CCC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCc Confidence 01111111111112233457899999998753 222 2333444444433445677888999999999999999853 Q ss_pred CCCCceE---------E-----EeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHH Q lcl|NC_019408. 321 SEGTGEY---------H-----IGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSES 385 (612) Q Consensus 321 ~~~~~~l---------~-----iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~es 385 (612) ....+.+ . ...........|++++||..+.+ .+.....++.+.+.|..++.-. +......++-| T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~-~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~S 372 (511) T protein:vir:99 294 NLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred ccCchhhcccccccceecccccccccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 3222211 1 11122233456789999997654 5667888999999887765322 11112234668 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC-----CCcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA-----DTENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~-----~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) |++...............-..+..++.+.+++++.+++.... +-.++.|.+++ ..+.+ .++.++++..+ .| T Consensus 373 g~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~-~~p~n-~~e~~~~~~kl--~G 448 (511) T protein:vir:99 373 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNR-NLPKS-LIEELKAYIDS--GG 448 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCC-CCCcC-HHHHHHHHHHH--hc Confidence 888888887777777888888899999999999998764221 11234555532 23333 24566767666 59 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-hhhh---hhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-RQRG---YTNRGQELEQSRMAREADFTQQKIDI 534 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-~~~~---e~~r~~~~e~~r~~~e~e~~~q~~e~ 534 (612) .||++|++..+ ..++ ++++|.++|.+|........... ...+ ....+...++ . ....+| T Consensus 449 iiS~et~l~~l---~~v~---D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--------~~d~~e 511 (511) T protein:vir:99 449 KISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTK-D--------SIDKKE 511 (511) T ss_pred cCCHHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCc-C--------cccccC Confidence 99999998876 2222 35667777766532100000000 0000 0000000000 0 000000 No 24 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.91 E-value=1.4e-23 Score=145.92 Aligned_cols=432 Identities=11% Similarity=0.048 Sum_probs=247.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) ||.+ +....++.....+.|.|...+..+...++.+... ..++.- .|.+ .|+++.+|+..+|++|.+||++.. T Consensus 9 ~i~~--~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~---~~~~~~-~ki~-~n~~~~Ivd~~~~yl~G~p~~~~~~ 81 (451) T protein:vir:10 9 IISA--DAARRQEILQAKSYYYNKNDILKKGVVVQNRDEN---PLRNAD-NRIS-HNFHEILVDEKASYMFTYPVLFDID 81 (451) T ss_pred HHHH--HHHHHHHHHHHHHHhcccCccccccccccccccc---cccccc-cccc-cchHHHHHHhhhhheecccceeecC Confidence 5433 4456677788888898876554433333322211 111110 1222 599999999999999999999841 Q ss_pred CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh--hhhccCceEEEechhhhhc-chhhhccCCcc Q lcl|NC_019408. 80 LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR--KGAVATSFAVGYSAENILD-WDEVVDMGGFY 156 (612) Q Consensus 80 ~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~--~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~ 156 (612) -......+++... .++++.....+.+.++.+|+++++|-...... .....++-+..++|++++- |+ ..+.+ T Consensus 82 ~~~~~~~~~~~~~--~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vyd--d~~~~-- 155 (451) T protein:vir:10 82 NNKELNEKVTDVL--GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYR--NGIER-- 155 (451) T ss_pred CcHHHHHHHHHHh--ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEc--CCCCC-- Confidence 2244555666554 37899999999999999999999986543211 1222456678888988865 42 12222 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccce Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLA 236 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~ 236 (612) .+.. .+|...... +.++........+..+...+ .+|++ T Consensus 156 ~~~~-~ir~~~~~~----~~~~~~~~~~~~~~e~yt~~---------------------~~~~~---------------- 193 (451) T protein:vir:10 156 ELEA-VIRYYIQLE----DVKGQIQKQAYTYVEFWTDK---------------------ILDKY---------------- 193 (451) T ss_pred ceEE-EEEEEEeee----cccccccceEEEEEEEEeCC---------------------eEEEE---------------- Confidence 2222 222222111 11111111111111111110 11111 Q ss_pred eEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeee Q lcl|NC_019408. 237 YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYA 316 (612) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i 316 (612) ...+.+............++|+.||||.+... .. +.+-|.++..|-=++-...|++.+++...+.|++++ T Consensus 194 ------~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn--~~--~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~ 263 (451) T protein:vir:10 194 ------KFFGVSCCGSQIEHITVQHRFNSVPFVEFSNN--IK--KQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYIL 263 (451) T ss_pred ------EecccCccccccccccccCCCCeeeEEEeccC--CC--CCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeee Confidence 11111111111111122367999999988542 22 233445555554445567789999999999999999 Q ss_pred ecCCCCCCce--EEEeccccccCC-----CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHH Q lcl|NC_019408. 317 PGTDSEGTGE--YHIGPNMVWEVP-----QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQT 389 (612) Q Consensus 317 ~G~~~~~~~~--l~iG~~~~~~lp-----~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~ 389 (612) +|.+.+..+. -.+....++.++ .|++++||..+.. .+.....++.+++.|...+.-.-....+.++-||.+. T Consensus 264 ~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al 342 (451) T protein:vir:10 264 ENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIP-TEARKIILEILKKQIYESGQGLQQDTENFGNASGVAL 342 (451) T ss_pred ecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCC-HHHHHHHHHHHHHHHHHHhCcccccccccccccHHHH Confidence 9976543322 123444555554 4678999998875 4677889999999998875432111223346788888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_019408. 390 VLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYE 469 (612) Q Consensus 390 ~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~ 469 (612) ..............-..+..++.+.+++++.++|.. +..++.|.+++.. +.+ ..+.++++..+ .|.||++|++. T Consensus 343 k~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~--d~~~i~i~f~~~~-p~n-~~e~~~~~~kl--~g~iS~et~~~ 416 (451) T protein:vir:10 343 KFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT--DYKKIQQTYTRNM-MSN-DLEDADIATKS--VGIIPTKIILR 416 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CccceeEEecCCC-CCC-HHHHHHHHHHH--hccCchHHHHH Confidence 888888888888888999999999999999999864 3445666665432 222 35567777776 48999999987 Q ss_pred HHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHH Q lcl|NC_019408. 470 YMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREAD 526 (612) Q Consensus 470 ~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e 526 (612) .+ +.++ +++++..++.++............-. .. + T Consensus 417 ~~---p~v~---d~~~e~~~~~ee~~~~~~~~~~~~~~-~~---------------~ 451 (451) T protein:vir:10 417 HH---PWVD---DVEEAEKLYLEEKKIQASKVSDDYNN-FT---------------E 451 (451) T ss_pred hC---CCCC---CHHHHHHHHHHHHHHHHHHHHhhcCC-CC---------------C Confidence 65 2222 23444444433211000000000000 00 0 No 25 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.91 E-value=1.7e-23 Score=145.36 Aligned_cols=442 Identities=12% Similarity=0.034 Sum_probs=240.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.|= .....++++++.+.|.|...+-. .+......++.. .| +-.|+.+.+++.++|++|.+||++..- T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~g~~~i~~--------~~~~~~~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:96 48 YIEHH-MDYQRPRLKVLSDYYEGKTKNLV--------ELTRRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhcccCcccc--------ccCcCcccccCc-ce-eecchHHHHHHHHHhhhccCCceeecC Confidence 33321 12346788889999988654311 111111111111 12 337999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....+.++-.. ++++.+...++..++.+|+++++|-... ..+|-+..++|.+++- |+ ..+. ...+. T Consensus 117 ~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~a~~~vy~de------d~~~~i~~~~p~~~~~vyd--d~~~-~~~~~ 186 (511) T protein:vir:96 117 DKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVIYD--NTIE-RNSIA 186 (511) T ss_pred chHHHHHHHHHHhh-cCHHHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEEEc--CCCC-CceEE Confidence 56677777777666 5899999999999999999999997542 2468888888888775 32 1111 22233 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .|+...... .+......+ .+..+... -++|+ T Consensus 187 ~vr~~~~~~-------~d~~~~~~~-~~~~iyt~---------------------~~i~~-------------------- 217 (511) T protein:vir:96 187 GVRYLRTKP-------IDKTDEDEV-FTVDLFTS---------------------HGVYR-------------------- 217 (511) T ss_pred EEEEEEeee-------ccccccceE-EEEEEEeC---------------------CcEEE-------------------- Confidence 333322100 000000000 00000000 01111 Q ss_pred EEEeeCCCce-ecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeec Q lcl|NC_019408. 240 YLYEEDPESR-PIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPG 318 (612) Q Consensus 240 ~~~~~~~~~~-~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G 318 (612) +...++++ ...........++|+.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.++..+.|+++++| T Consensus 218 --~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n--n~~--g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g 291 (511) T protein:vir:96 218 --YLTSRTNGLKLTPRENGFESHSFERMPITEFSN--NER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (511) T ss_pred --EEecCCCcccccccccccccccCCceeeEEecC--CCC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec Confidence 11111111 111112223457899999998753 222 22334444443334445678889999999999999999 Q ss_pred CCCCCCceE-EEeccc-------------cccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchh Q lcl|NC_019408. 319 TDSEGTGEY-HIGPNM-------------VWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVS 383 (612) Q Consensus 319 ~~~~~~~~l-~iG~~~-------------~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~ 383 (612) ......+.+ ....+. ......+++++||..+.. .+.....++.+.+.|..++.-. +......++ T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n 370 (511) T protein:vir:96 292 NLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT 370 (511) T ss_pred CccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCccccccccccc Confidence 533322211 111111 222345788999997654 4566888889999887765322 111112245 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 384 ESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 384 esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) -||++...............-..+..++.+.+++++.+++..... -.++.|.+++.. +.+ ..+.+.++..+ T Consensus 371 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~-p~n-~~e~~~~~~kl-- 446 (511) T protein:vir:96 371 QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNL-PKS-LIEELKAYIDS-- 446 (511) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCC-CCC-HHHHHHHHHHH-- Confidence 688888888888777788888888999999999999887643211 124455554322 222 24456666665 Q ss_pred cCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 DGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 459 ~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) .|.||++|++..+ +.++ ++++|.++|.+|..... ...+.............+..-. .+...++.+ T Consensus 447 ~G~iS~et~l~~l---~~v~---D~~~E~~ri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~ 511 (511) T protein:vir:96 447 GGKISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESI-KKAQKGIYKDPRDINDDEQDDD------TKDTVDKKE 511 (511) T ss_pred hccCChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHH-HHHhhccccCCCCCCCCCCCCc------ccccccccC Confidence 6999999998765 2222 35667777766532100 0000000000000000000000 000000000 No 26 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.91 E-value=2.6e-23 Score=144.37 Aligned_cols=415 Identities=12% Similarity=0.017 Sum_probs=233.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||. .|....++++.+.+-|.|...++ |+|+ .-...|++.. ...|+.+-+|+.++++++-...+.. - T Consensus 12 l~~--~~~~~~~r~~~l~~Yy~G~~~i~-----~~~~---~~~~~~~~~k---~~~n~~~~ivd~~~~~l~~~g~~~~-d 77 (441) T protein:vir:80 12 MYD--RIQRLSSWHCCIEGYYEGSNRVR-----DLGV---AIPPELQRVQ---TVVSWPGIAVDALEERLDWLGWTNG-D 77 (441) T ss_pred HHH--HHHHHHHHHHHHHHHHhcCCcch-----hcCc---ccchhhhhhh---hhcchHHHHHHHHHhhhccccccCC-C Confidence 444 36666778888888888865443 2332 3333344332 4579999999999999964433332 3 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) ++.|..+++. ++++.+...++..++.+|+++++|= +.. ...|.+..++|++++- |+ .. .++..+. T Consensus 78 ~~~l~~i~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~-~d~-----~g~~~i~~~~p~~~~~i~d--~~-~~~~~~~ 143 (441) T protein:vir:80 78 GYGLDGVYAA-----NRLATASCDVHLDALIFGLSFVAII-PHG-----DGTVSVRPQSPKNCTGKFS--AD-GSRLDAG 143 (441) T ss_pred hHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEE-eCC-----CCceEEEEEccceEEEEEe--CC-CCceeEE Confidence 4678877764 6899999999999999999999984 332 2468899999999865 42 11 2232222 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .++..+ +. + -..++.+.+.+ ... T Consensus 144 ~~~~~~---~~------~------~~~~~~vy~~~------------------------------------------~~~ 166 (441) T protein:vir:80 144 LVVQQT---CD------P------EVVEAELLLPD------------------------------------------VIV 166 (441) T ss_pred EEEEEE---ec------C------ceEEEEEEecC------------------------------------------eEE Confidence 222221 00 0 00111111110 001 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcCchHH-HHHHHHHHHhhhHHHHHHHHHhccceeeee Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKPPLLD-ICDLNLSHYRTYAELEYGRLFTALPVYYAP 317 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~pPLld-LA~lnl~HY~~~sD~~~~l~~~~~P~l~i~ 317 (612) ..+..+++.+...+. ..++++.||+|.|. ....+..-+.+-|.+ +-.|-=+.=...|+...++.+.++|+++++ T Consensus 167 ~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~ 242 (441) T protein:vir:80 167 QVERRGSREWVEVDR----IPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT 242 (441) T ss_pred EEEEcCCcceeeccc----cccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee Confidence 112222222222122 23679999999764 222222223443321 222211333556788889999999999999 Q ss_pred cCCCCC--CceEEEeccccccCCCCC---ceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh--ccccchhHHHHHHH Q lcl|NC_019408. 318 GTDSEG--TGEYHIGPNMVWEVPQGS---EPGILEYTGQGLKALETALNDKERQIAAIGGRMMP--GASKSVSESNNQTV 390 (612) Q Consensus 318 G~~~~~--~~~l~iG~~~~~~lp~~~---~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~--~~~~~~~esa~~~~ 390 (612) |.+.+. .+...+..+.+|.+|.++ .+++.+.+.+.++...+.|+.+..++.....-... ..++..+.||.+.. T Consensus 243 G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~ 322 (441) T protein:vir:80 243 GVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALA 322 (441) T ss_pred cCCccccccchhhhcccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHH Confidence 975433 223456677888888654 36777888888888888888887777543211111 11112224787777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCc---ceEEEeeccccccCCCHHHHHHHHHHHHcCCC--CHH Q lcl|NC_019408. 391 LREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTE---NLRYEVNTDFLSTPIGAREMRAIQLMANDGLL--PDP 465 (612) Q Consensus 391 ~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~---~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~i--s~e 465 (612) .............-..+..++.+.+++++.++|....... .+.|.+++ ..+.++ .+.++++.+++++|.+ |++ T Consensus 323 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~-~~~~~~-~e~ad~~~kl~~~g~~~~s~~ 400 (441) T protein:vir:80 323 AEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRD-ASTPTR-AATADAVTKLVGAGILPADSR 400 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCC-CCCcCH-HHHHHHHHHHHhcCcccccHH Confidence 7777777777777777888999999999999886543322 33444433 233332 5678889999999975 566 Q ss_pred HHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 466 VFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAV 541 (612) Q Consensus 466 t~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~ 541 (612) +++..+ |..+ ++.+++.++. +++.++. .+......++-. +. T Consensus 401 ~~~~~l---~~~~------~e~~~~~~e~------------------~e~~~~~-~~~~~~~~~~~~-------~~ 441 (441) T protein:vir:80 401 TVLEML---GLDD------VQVEAVMRHR------------------AESSDPL-AVLAGAISRQTN-------EV 441 (441) T ss_pred HHHHhC---CCCH------HHHHHHHHHH------------------HHHHHHH-HHHhhhhhcccc-------cC Confidence 665543 3322 1222221110 0000000 000000000000 00 No 27 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.91 E-value=7.1e-24 Score=147.48 Aligned_cols=434 Identities=9% Similarity=0.017 Sum_probs=241.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....+++.+..+.|.|...+-.+- .|. .............+ .-.|+.+.+++.++|++|.+||++..- T Consensus 35 ~i~~--~~~~~~~~~~~~~Yy~g~~~i~~r~-~~~---~~~~~~~~~~~~~k-i~~n~~~~Ivd~~~~~l~g~p~~~~~~ 107 (474) T protein:vir:95 35 LIDD--HRKQLDKITVGQRYYDKDNDIVKQM-KKV---DVYGNIDYDKPDWR-ITTNFHQNLVDQKVSYVASKPVTYSCE 107 (474) T ss_pred HHHH--HHHHHHHHHHHHHHhcccCchhccc-ccc---ccccccccccccce-eccchHHHHHHHHHhhhccCCceeccC Confidence 5543 4567778888888888876543221 111 01000001111112 236999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+....++.++- .++++.....+.+.++.+|+++++|..+. ..+|-+..++|.+++-. +..... ..+.. T Consensus 108 d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~------~~~~~i~~~~p~~~~~v-~d~~~~--~~~~~ 176 (474) T protein:vir:95 108 DESVLKIIHDVL--DTRWDNKLIDILTATSNKGIDWLQVYINE------NGEMKLFRVPAEQAIPI-WVDKER--EELKS 176 (474) T ss_pred chHHHHHHHHHH--hccHHHHHHHHHHHHhhcCcEEEEEEecC------CCceEEEEEcccceEEE-EcCCCC--CceEE Confidence 355555666553 26789999999999999999999997653 24788999999998853 111112 22322 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) + ++.+.... ...+ .+...+ .+..|+.....+..... T Consensus 177 ~-i~~~~~~~-------------~~~~-~~y~~~-------------------~~~~~~~~~~~~~~~~~---------- 212 (474) T protein:vir:95 177 F-IRYYKFNN-------------EEKV-EFWTDT-------------------TVTYYVLENGGLIPDYY---------- 212 (474) T ss_pred E-EEEEEEcC-------------eeEE-EEEeCC-------------------eEEEEEEcCCccccccc---------- Confidence 2 22211110 0000 011100 01122211100000000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) .+ .....+...-++++.||||.+.... .. .+-|.++-.|-=..=...|++.+.+.+.++|+++++|.+ T Consensus 213 ---~~-----~~~~~~~~~~~~~g~iPvv~~~nn~--~g--~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 280 (474) T protein:vir:95 213 ---YG-----ANHIQSHFSNGNWGRVPFIAFKNNP--EE--VSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYE 280 (474) T ss_pred ---cC-----cccccccccccCCCccceEeecCCC--CC--CCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 00 0001112233579999999875422 21 222222222211222355777778888999999999986 Q ss_pred CCCCce--EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGE--YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~--l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+.... -.+....+|.++.|++++|+..+. ..+.....|+.+.++|...+.-. +...+..++-||++......... T Consensus 281 ~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~ 359 (474) T protein:vir:95 281 GQDLEEFMRGLKYYKAINVDGDGGVETIQVEV-PVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLD 359 (474) T ss_pred cccchhhhhhhhccceeeccCCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHH Confidence 553222 224455677889999999999775 45778899999999998775332 11222234568888887777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) .........+..++.+++++++.+.|... +..++.|.+++. .+.+ ..+.++ .+.++|.||++|++..+ +.+ T Consensus 360 ~k~~~k~~~~~~~l~~~~~li~~~~g~~~-d~~~i~v~f~~~-~p~d-~~e~a~---~~~~~g~iS~et~i~~l---~~v 430 (474) T protein:vir:95 360 LKANKLKNKATVAIQELIGFIIDFNNLKM-DVKDIEISFNFN-RMMN-DAEQSQ---IIAQSQYLSRETLVKSS---PLV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEeccC-CCcC-HHHHHH---HHHhcCCCchHHHHHhC---CCC Confidence 77788888999999999999999999754 334555555322 2221 233344 34567999999998754 222 Q ss_pred chhhhhHHHHHHhhccccccccch-hHHh--hhhhhHHHHhHHHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFINNPD-AQAR--QRGYTNRGQELEQSRMAREAD 526 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~~~~~-~~~~--~~~e~~r~~~~e~~r~~~e~e 526 (612) + +++++.++|.+|........ .... .....+.+++.++ |.| T Consensus 431 ~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~-----~~~ 474 (474) T protein:vir:95 431 D---DYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDK-----ESE 474 (474) T ss_pred C---CHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccC-----CCC Confidence 2 34556666655421100000 0000 0000000000000 000 No 28 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.91 E-value=9.4e-24 Score=146.82 Aligned_cols=430 Identities=10% Similarity=0.028 Sum_probs=240.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....++..+..+.|.|...+-.+...+..+. ....++... + +-.|+.+.+++..+|++|.+||++..- T Consensus 34 ~i~~--~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~---~~~~~~~~~-k-i~~n~~~~Iv~~~~~~l~g~p~~~~~~ 106 (468) T protein:vir:96 34 LITK--HKENVEDITVGERYYNHQPDVLFNAPKRNVKG---EIDPFKPDW-R-MYTNYHQNLVDQKVAYAVANPVTYGTE 106 (468) T ss_pred HHHH--HHHHHHHHHHHHHHhcCCCccccccccccccc---ccccccccc-c-cccchHHHHHHHHHhhhccCCceeccC Confidence 3322 23455667777888888754432221111111 111222111 1 237999999999999999999999432 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++- +++++.....+...++.+|+++++|.... ..+|.+..++|.+++-. +.....+ .+.. T Consensus 107 d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~~~~~i~~~~p~~~~~v-~~~~~~~--~~~~ 175 (468) T protein:vir:96 107 DEKSLKTIQEVL--NHKWDDKLVDILTAASNKGVEWIQPYVDE------QGEFKTFRVPAEQAIPI-WTNKERD--ELKA 175 (468) T ss_pred ChHHHHHHHHHH--hcCHHHHHHHHHHHHhhcCeEEEEEEEcC------CCceEEEEEcccceEEE-EcCCCCC--ceEE Confidence 344455555553 26788889999999999999999997653 24788999999998642 1111112 2222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) + ++.+..+. . .. ..+... + .+..|+......... T Consensus 176 ~-ir~~~~~~-----~--------~~-~~~~~~-----------------~--~~~~~~~~~~~~~~~------------ 209 (468) T protein:vir:96 176 F-IRLYELDG-----G--------ER-VEYWTA-----------------N--DVTFYELKDGQLIPD------------ 209 (468) T ss_pred E-EEEEEecC-----c--------eE-EEEEeC-----------------C--eEEEEEEcCCceeec------------ Confidence 2 22221110 0 00 001000 0 011222110000000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) .. ....+.. .........++|+.||||.+... .. +.+-|.++..|-=+.-...|++.+.+.+.++|+++++|.+ T Consensus 210 ~~-~~~~~~~-~~~~~~~~~~~~~~iPvv~~~n~--~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 283 (468) T protein:vir:96 210 YY-QGEEHVQ-AHYYVGNKSMSWNRVPFIPFKNN--PQ--EVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYE 283 (468) T ss_pred cc-ccccccc-cceeeccccccCCcccEEEecCC--CC--CCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 00 0000000 00111223467999999988542 22 2333555554443445667888888899999999999986 Q ss_pred CCCCceE--EEeccccccCC--CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEY--HIGPNMVWEVP--QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREAN 395 (612) Q Consensus 321 ~~~~~~l--~iG~~~~~~lp--~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~ 395 (612) .+..... .+..+.++.++ .+++++|+..+.. .+.....++.+.+.+..++.-. +...+..++.||++....... T Consensus 284 ~~~~~~~~~~~~~~~~i~~~~d~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~ 362 (468) T protein:vir:96 284 GEDLEEFMYNLKYYKAINVDGDGSGGVDTIQIDVP-VQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSN 362 (468) T ss_pred ccccchhhhhhhcCceEEecCCCCCcceEEeecCC-hHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHH Confidence 5432222 22223455554 4578999998875 4677888999999988875322 111122346788888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcC Q lcl|NC_019408. 396 EQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAE 475 (612) Q Consensus 396 ~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~ 475 (612) ...........+..++.+++++++.+.|... +..++.|.+++. .+.+ ..+.++ .+.++|.||++|.+..+ . T Consensus 363 l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~-d~~~i~i~f~~~-~p~d-~~e~a~---~~~~~g~iS~et~i~~l---~ 433 (468) T protein:vir:96 363 LDLKANKLKNKTLTALQELLQYIIDFYKLSI-KVQDVEITFNFN-VMVN-ELEQSQ---IGVNSQYLSKETVVTNH---P 433 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEecCC-CCcC-HHHHHH---HHHhcCCCchHHHHHhC---C Confidence 8888888999999999999999999999754 334556655422 2222 123333 45568999999998754 2 Q ss_pred ccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHH Q lcl|NC_019408. 476 VISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQS 519 (612) Q Consensus 476 vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~ 519 (612) .+. ++++|.++|.+|...... .+... --....+.. T Consensus 434 ~v~---D~~~E~~ri~~E~~~~~~-----~~~~~-~~~~~~~~~ 468 (468) T protein:vir:96 434 WVD---DPVAEMERIDQEELALPS-----IEEGL-NGKENNEPT 468 (468) T ss_pred CCC---CHHHHHHHHHHHHHHHHH-----Hhhcc-CCCCCCCCC Confidence 222 356667777654211000 00000 000000000 No 29 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.90 E-value=7e-24 Score=147.52 Aligned_cols=437 Identities=12% Similarity=0.047 Sum_probs=240.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) || ..+....+++..+...|.|...+..+. +++... ...+..+-..-+-.|+.+.+|+..+|++|.+||++..- T Consensus 34 ~i--~~~~~~~~~~~~~~~yY~g~~~i~~~~----~~~~~~-~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~ 106 (478) T protein:vir:10 34 LV--REHKENIDNITMGERYYNHHPDILDAP----PKRDVN-GDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVD 106 (478) T ss_pred HH--HHHHHHHHHHHHHHHHhcCCCchhccc----cccccc-cccccccccceeccchHHHHHHHHHhhhccCCeeeecC Confidence 33 344556677888888888865443221 111111 11111111112456999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....+.++- +++++.....+.+.++.+|+++++|.... ..+|-+..++|.+++- |+. +....+. T Consensus 107 ~d~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~~~~d~------~g~~~~~~~~p~~~~~i~d~----~~~~~~~ 174 (478) T protein:vir:10 107 NDKALKQIQHTL--NHKWDDKLVDILTAASNKGIEWVQPYVDE------EGEFKTFRVPAEQAVPIWTN----KERDELQ 174 (478) T ss_pred ChHHHHHHHHHH--hcCHHHHHHHHHHHHHhcCeEEEEEEecC------CCeeEEEEEcccceEEEEcC----CCCCceE Confidence 344444444442 36899999999999999999999996543 2478888999998764 321 1112233 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .++ +....+. ...+. +...+ .+..|+.... .... T Consensus 175 ~~v-~~~~~~~-------------~~~~~-~y~~~-------------------~i~~~~~~~~------------~~~~ 208 (478) T protein:vir:10 175 AFI-RVYELDG-------------AERVE-YWTKD-------------------DVTYYELKEG------------QLIP 208 (478) T ss_pred EEE-EEEEecC-------------ceEEE-EEeCC-------------------eEEEEEEcCC------------eeec Confidence 222 1111000 00010 00000 0011111000 0000 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) .......+.. .........++++.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.+.+.+.|+++++|. T Consensus 209 ~~~~~~~~~~--~~~~~~~~~~~~~~vPvv~~~n--~~~--g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~ 282 (478) T protein:vir:10 209 DFYRSDDHIQ--PHYYQGNKLMSWGRVPFIPFKN--NPQ--EVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGY 282 (478) T ss_pred cccccccccc--cceecccccccCCccceEEecc--CCC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecC Confidence 0000000000 0011112236799999998853 233 233344444443333456688888999999999999998 Q ss_pred CCCCCce--EEEeccccccC--CCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGE--YHIGPNMVWEV--PQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREA 394 (612) Q Consensus 320 ~~~~~~~--l~iG~~~~~~l--p~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~ 394 (612) +.+.... ..+....++.+ ..|++++|+..+. ..+.....++.+++.|..++.-. +......++-||++...... T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~ 361 (478) T protein:vir:10 283 EGEDMKDFMHNLKYYKAISVAGESGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYS 361 (478) T ss_pred CccccchhhhhhhhcceEEecCCCCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHH Confidence 6543222 12233334444 3678999998776 45677888999999888775322 11111124668888777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc Q lcl|NC_019408. 395 NEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA 474 (612) Q Consensus 395 ~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~ 474 (612) ............+..++.+++++++.+.|.... ..++.|.+++ ..+.+ ..+.++++..+ +|.||++|++..| T Consensus 362 ~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~-~~~i~i~f~~-~~p~d-~~e~a~~~~kl--~g~iS~et~~~~l--- 433 (478) T protein:vir:10 362 NLDLKANKLKNKTLTALQELLQYIIDFYRLDVK-VQDIEITFNF-NVMVN-ELENSQIAMNS--TGLLSKETILSNH--- 433 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcc-cccceEEecC-CCCCC-HHHHHHHHHHH--hCCCChHHHHHhC--- Confidence 777777788888899999999999999987543 3456666543 23333 24456666655 8999999998866 Q ss_pred CccchhhhhHHHHHHhhccccccccc--hhHHhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 475 EVISSDMTFEEFQALRADENSFINNP--DAQARQRGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 475 ~vl~~~~~~eee~~ria~e~~~~~~~--~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) +.+. +++++.++|.+|.....+. +.....-.+.+.+.+..+ .+ T Consensus 434 ~~v~---D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~ 478 (478) T protein:vir:10 434 AWVE---DPVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQ-----------PE 478 (478) T ss_pred CCCC---CHHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCC-----------CC Confidence 3332 3556677776542210000 000000000000000000 00 No 30 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.90 E-value=3.2e-23 Score=143.89 Aligned_cols=441 Identities=11% Similarity=0.028 Sum_probs=241.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.|= .....++++++.+.|.|...+-. +.....+ .++.. +=+-.|+.+.+++.++|++|.+||++..- T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~g~~~il~-------~~~~~~~-~~~~~--~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:93 48 YIEHH-MDYQRPRLKVLSDYYEGKTKNLV-------ELTRRKE-EYMAD--NRVAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhcccCcccc-------ccCcCcc-cccCc--ceeecchHHHHHHHHhhhhcccCeeeccC Confidence 33321 12346778888888888654321 1111111 11111 11447999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....+.++-.. ++++.+...+.+.++.+|+++++|.... ..+|-+..++|.+++- |+ ..+.+ ..+. T Consensus 117 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~ay~~vy~de------~~~~~i~~~~p~~~~~vyd--d~~~~-~~~~ 186 (511) T protein:vir:93 117 DKDVLEVIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVIYD--NTIER-NSIA 186 (511) T ss_pred ChHHHHHHHHHHhh-cCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEEEc--CCCCC-ceEE Confidence 55666667776544 5899999999999999999999997643 2468889999999875 32 11222 2334 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .|++...-.. ++.....+. +..+... -.+|+. T Consensus 187 ~vr~~~~~~~-------~~~~~~~~~-~~~iyt~---------------------~~i~~~------------------- 218 (511) T protein:vir:93 187 GVRYLRTKPI-------DKTDEDEVF-TVDLFTS---------------------HGVYRY------------------- 218 (511) T ss_pred EEEEEEeeec-------cccccceEE-EEEEEeC---------------------CcEEEE------------------- Confidence 3433221000 000000000 0000000 011111 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ...++..............++|+.||||.+.. +.+ +.+-|.++..|-=+.-...|+..+.+...+.|+++++|. T Consensus 219 --~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--n~~--g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~ 292 (511) T protein:vir:93 219 --LTSRTNGLKLTPRENGFESHSFERMPITEFSN--NER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (511) T ss_pred --EecCCCccccccccccccccCCCccceEEecC--CCC--CCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecC Confidence 11111111111111222346799999997753 222 223344444443344567788999999999999999996 Q ss_pred CCCCCceE--------EEe------ccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhH Q lcl|NC_019408. 320 DSEGTGEY--------HIG------PNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSE 384 (612) Q Consensus 320 ~~~~~~~l--------~iG------~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~e 384 (612) .....+.+ .++ ....+....+++++||..+.+ .+..+..++.+.+.|..+..-. +......++- T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~ 371 (511) T protein:vir:93 293 LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ 371 (511) T ss_pred cccCchhhcccccccceecccccccccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccccccccccc Confidence 43322221 111 111223456789999997654 4667888899999887765322 1111122466 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHHHHHHHHHc Q lcl|NC_019408. 385 SNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMRAIQLMAND 459 (612) Q Consensus 385 sa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~ 459 (612) ||++..................+..++.+.+++++.+++..... -.++.|.+++ ..+.+ .++.++++..+ . T Consensus 372 Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~-~~p~n-~~e~~~~~~kl--~ 447 (511) T protein:vir:93 372 SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNR-NLPKS-LIEELKAYIDS--G 447 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCC-CCCCC-HHHHHHHHHHH--h Confidence 88888888887777778888889999999999998886643211 1134555432 22333 35567777776 6 Q ss_pred CCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHh--hhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 460 GLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR--QRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 460 G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~--~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) |.||.+|++..+ +.++ ++++|.++|.+|...... ..+.. ..+......+.+ +..+..+.+++ T Consensus 448 g~iS~et~~~~l---~~v~---d~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 511 (511) T protein:vir:93 448 GKISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIK-KAQKGIYKDPRDINDDEQD--------DDTKDTVDKKE 511 (511) T ss_pred ccCchHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHH-HHhhhcccCCCCCCCCCCC--------CcccccccccC Confidence 999999998865 2222 345667777665321000 00000 000000000000 00000000000 No 31 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.90 E-value=7.9e-24 Score=147.25 Aligned_cols=436 Identities=11% Similarity=0.028 Sum_probs=236.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....++...+.+.|.|...+..+. .|. ...+. .+..+-..-+-.|+.+.+++..+|++|.+||++..- T Consensus 35 ~i~~--~~~~~~~~~~l~~Yy~g~~~i~~~~-~~~-~~~~~---~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~ 107 (474) T protein:vir:96 35 LINN--HKQKLKDINVGQKYYDKDNDINYQA-YKQ-DLHGN---IDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHD 107 (474) T ss_pred HHHH--HHHHHHHHHHHHHHhcccCcccccc-chh-hhccc---ccccccccccccchHHHHHHhhhhhhcccCceeccC Confidence 5554 3456677788888888876544321 110 00011 111111112346999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++- +++++.....+.+.++.+|+++++|-.... .+|-+..++|++++=. +...+. ..+.. T Consensus 108 ~~~~~~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~------~~~~i~~~~p~~~~~v-~d~~~~--~~~~a 176 (474) T protein:vir:96 108 DDKVLDVIHQVL--DTRWDNKLIDILTAASNKGIDWLQVYINED------GELKLFRVPAEQAIPI-WTDKER--EQLNA 176 (474) T ss_pred ChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEeeeCCC------CceEEEEEcccceEEE-EcCCCC--CceEE Confidence 344455555552 367999999999999999999999976432 4688888999987732 111111 12222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) + +|.. .. + ....+ .+... .++++.... .+.. .. T Consensus 177 ~-ir~~-~~-------~-----~~~~~-~vy~~---------------------~~i~~~~~~------~~~~-----~~ 209 (474) T protein:vir:96 177 F-IRIF-TF-------N-----GETKV-EYWTA---------------------ETVTYYVYE------NGGL-----IP 209 (474) T ss_pred E-EEEE-ee-------c-----CeeEE-EEEeC---------------------CeEEEEEEc------CCce-----ee Confidence 2 2221 11 0 00001 00000 011111000 0000 00 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ........ .......++++.||||.+.... . +.+-|.++..|-=+.=...|+..+.+.+.++|+++++|.+ T Consensus 210 ~~~~~~~~-----~~~~~~~~~~~~vPvv~~~nn~--~--~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 280 (474) T protein:vir:96 210 DFYYGDEH-----IQTHFSTGSWERVPFIAFKNNP--E--EVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE 280 (474) T ss_pred cccccccc-----ccCcccccCCCccceEEecCCC--C--CCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC Confidence 00000000 0111223579999999885422 1 1222222222211222355777888889999999999976 Q ss_pred CCCCce--EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGE--YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~--l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+.... -.+....++.++.|++++|+..+.+. +..+..++.+.+.|..++.-+ +...+..++-||++......... T Consensus 281 ~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~ 359 (474) T protein:vir:96 281 GEDLSEFMEGLKYYKAINVSSDGGVETIQVEVPV-ASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLN 359 (474) T ss_pred cccccchhhhhhccceeeccCCCceeEEeccCCH-HHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHH Confidence 543222 22345567778999999999977654 667889999999998775322 11112224567777776666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) ......-..+..++.+.+++++.+.|... +..++.|.+++.. +.+ ..+.++ .+.++|.||++|++..+ +.+ T Consensus 360 ~k~~~~~~~~~~~l~~~~~~i~~~~g~~~-d~~~i~i~f~~~~-p~~-~~e~a~---~~~~~giiS~et~~~~l---p~v 430 (474) T protein:vir:96 360 LKANKLKNKANVALQELMQFILDFNKIKL-DAKEIEITFNFNV-MVN-DLEQSQ---IGAQSQYLSKETLVRHH---PWV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEecCCC-ccC-HHHHHH---HHHHcCCCChHHHHHhC---CCC Confidence 66677777889999999999999998754 3445566554322 222 122233 34568999999998765 332 Q ss_pred chhhhhHHHHHHhhcccccc-ccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFI-NNPDAQARQRGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~-~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) . ++++|.++|.+|.... .+....... +....+++ .+.+..+.+ T Consensus 431 ~---D~~~E~eri~~E~~~~~~~~~~~~~~-~~~~~~~~-------~~~~~~e~~ 474 (474) T protein:vir:96 431 D---DPKAELERLDEEQLELNKQLPNLDDG-GADGAQQQ-------QQSENNQSK 474 (474) T ss_pred C---CHHHHHHHHHHHHHHHHhhccccccc-cCCCCCCc-------CCCCccccC Confidence 2 4566677776552110 000000000 00000000 000000000 No 32 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.90 E-value=7.9e-24 Score=147.25 Aligned_cols=436 Identities=11% Similarity=0.028 Sum_probs=236.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....++...+.+.|.|...+..+. .|. ...+. .+..+-..-+-.|+.+.+++..+|++|.+||++..- T Consensus 35 ~i~~--~~~~~~~~~~l~~Yy~g~~~i~~~~-~~~-~~~~~---~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~ 107 (474) T protein:vir:95 35 LINN--HKQKLKDINVGQKYYDKDNDINYQA-YKQ-DLHGN---IDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHD 107 (474) T ss_pred HHHH--HHHHHHHHHHHHHHhcccCcccccc-chh-hhccc---ccccccccccccchHHHHHHhhhhhhcccCceeccC Confidence 5554 3456677788888888876544321 110 00011 111111112346999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++- +++++.....+.+.++.+|+++++|-.... .+|-+..++|++++=. +...+. ..+.. T Consensus 108 ~~~~~~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~------~~~~i~~~~p~~~~~v-~d~~~~--~~~~a 176 (474) T protein:vir:95 108 DDKVLDVIHQVL--DTRWDNKLIDILTAASNKGIDWLQVYINED------GELKLFRVPAEQAIPI-WTDKER--EQLNA 176 (474) T ss_pred ChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEeeeCCC------CceEEEEEcccceEEE-EcCCCC--CceEE Confidence 344455555552 367999999999999999999999976432 4688888999987732 111111 12222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) + +|.. .. + ....+ .+... .++++.... .+.. .. T Consensus 177 ~-ir~~-~~-------~-----~~~~~-~vy~~---------------------~~i~~~~~~------~~~~-----~~ 209 (474) T protein:vir:95 177 F-IRIF-TF-------N-----GETKV-EYWTA---------------------ETVTYYVYE------NGGL-----IP 209 (474) T ss_pred E-EEEE-ee-------c-----CeeEE-EEEeC---------------------CeEEEEEEc------CCce-----ee Confidence 2 2221 11 0 00001 00000 011111000 0000 00 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ........ .......++++.||||.+.... . +.+-|.++..|-=+.=...|+..+.+.+.++|+++++|.+ T Consensus 210 ~~~~~~~~-----~~~~~~~~~~~~vPvv~~~nn~--~--~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 280 (474) T protein:vir:95 210 DFYYGDEH-----IQTHFSTGSWERVPFIAFKNNP--E--EVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE 280 (474) T ss_pred cccccccc-----ccCcccccCCCccceEEecCCC--C--CCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC Confidence 00000000 0111223579999999885422 1 1222222222211222355777888889999999999976 Q ss_pred CCCCce--EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGE--YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANEQ 397 (612) Q Consensus 321 ~~~~~~--l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~ 397 (612) .+.... -.+....++.++.|++++|+..+.+. +..+..++.+.+.|..++.-+ +...+..++-||++......... T Consensus 281 ~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~ 359 (474) T protein:vir:95 281 GEDLSEFMEGLKYYKAINVSSDGGVETIQVEVPV-ASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLN 359 (474) T ss_pred cccccchhhhhhccceeeccCCCceeEEeccCCH-HHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHH Confidence 543222 22345567778999999999977654 667889999999998775322 11112224567777776666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCcc Q lcl|NC_019408. 398 SLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVI 477 (612) Q Consensus 398 s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl 477 (612) ......-..+..++.+.+++++.+.|... +..++.|.+++.. +.+ ..+.++ .+.++|.||++|++..+ +.+ T Consensus 360 ~k~~~~~~~~~~~l~~~~~~i~~~~g~~~-d~~~i~i~f~~~~-p~~-~~e~a~---~~~~~giiS~et~~~~l---p~v 430 (474) T protein:vir:95 360 LKANKLKNKANVALQELMQFILDFNKIKL-DAKEIEITFNFNV-MVN-DLEQSQ---IGAQSQYLSKETLVRHH---PWV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEecCCC-ccC-HHHHHH---HHHHcCCCChHHHHHhC---CCC Confidence 66677777889999999999999998754 3445566554322 222 122233 34568999999998765 332 Q ss_pred chhhhhHHHHHHhhcccccc-ccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 478 SSDMTFEEFQALRADENSFI-NNPDAQARQRGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 478 ~~~~~~eee~~ria~e~~~~-~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) . ++++|.++|.+|.... .+....... +....+++ .+.+..+.+ T Consensus 431 ~---D~~~E~eri~~E~~~~~~~~~~~~~~-~~~~~~~~-------~~~~~~e~~ 474 (474) T protein:vir:95 431 D---DPKAELERLDEEQLELNKQLPNLDDG-GADGAQQQ-------QQSENNQSK 474 (474) T ss_pred C---CHHHHHHHHHHHHHHHHhhccccccc-cCCCCCCc-------CCCCccccC Confidence 2 4566677776552110 000000000 00000000 000000000 No 33 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.90 E-value=2.8e-24 Score=149.67 Aligned_cols=444 Identities=11% Similarity=0.016 Sum_probs=240.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.| ......++++++.+.|.|...+-.. + ......|+.. .| +-.|+.+-+|+.++|++|.+||++..- T Consensus 48 ~i~~-~~~~~~~r~~~l~~YY~g~~~i~~~-----~---~~~~~~~~~~-~k-i~~n~~k~Ivd~~~~yl~g~p~~~~~~ 116 (512) T protein:vir:97 48 YIEH-HMDYQRPRLKVLSDYYEGKTKNLVE-----L---TRRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQCQDD 116 (512) T ss_pred HHHH-HHHhhHHHHHHHHHHhcccCccccc-----c---CcccccccCc-ce-eecchHHHHHHHHhhhhcccCceeccC Confidence 3332 1123467788888888886543111 1 1111111111 12 347999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++-.. ++++.....+.+.++.+|+++++|-... ..+|-+..++|.+++-. +...+.+ ..+.- T Consensus 117 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~ay~~vy~de------d~~~~i~~~~p~~~~~i-yd~~~~~-~~~~~ 187 (512) T protein:vir:97 117 DKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVI-YDNTIER-NSIAG 187 (512) T ss_pred ChHHHHHHHHHHhh-cCHHHHHHHHHHHHHhcCeEEEEEEeCC------CCceEEEEEcccceEEE-EcCCCCC-ceEEE Confidence 45566667777544 6999999999999999999999997543 24688899999998763 1112221 22333 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++...-.. +..+ .. ..++..+... -.+|+. T Consensus 188 vr~~~~~~~----~~~~---~~-~~~~~~vyt~---------------------~~i~~~-------------------- 218 (512) T protein:vir:97 188 VRYLRTKPI----DKTD---ED-EVFTVDLFTS---------------------HGVYRY-------------------- 218 (512) T ss_pred EEEEEeeec----cccc---cc-eEEEEEEEeC---------------------CcEEEE-------------------- Confidence 333221000 0000 00 0000000000 011111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ...+++.............++|+.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.+.+.+.|+++++|.. T Consensus 219 -~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--n~~--~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~ 293 (512) T protein:vir:97 219 -LTSRTNGLKLTPRENGFESHSFERMPITEFSN--NER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (512) T ss_pred -EecCCCcccccccccccccccCcccceEeecC--CCC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc Confidence 11111111111112233457899999998753 222 2233444444443445667899999999999999999964 Q ss_pred CCCCceEE---------------EeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhH Q lcl|NC_019408. 321 SEGTGEYH---------------IGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSE 384 (612) Q Consensus 321 ~~~~~~l~---------------iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~e 384 (612) ....+.+. .+....+..+.|++++|+..+- ..+..+..++.+.+.|...+.-. +......++- T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~ 372 (512) T protein:vir:97 294 NLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ 372 (512) T ss_pred cCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccCcccccccc Confidence 43322111 1112222345678899999764 34567788889998887765322 1111122456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHHHHHHHHHc Q lcl|NC_019408. 385 SNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMRAIQLMAND 459 (612) Q Consensus 385 sa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~ 459 (612) ||++...............-..+..++.+.+++++.+++..... -.++.|.+++.. +.+ ..+.++++..+ . T Consensus 373 Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~-p~~-~~e~~~~~~kl--~ 448 (512) T protein:vir:97 373 SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL-PKS-LIEELKAYIDS--G 448 (512) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCC-CcC-HHHHHHHHHHH--h Confidence 88888877777777778888888999999999999986532211 123455554332 222 34566777766 5 Q ss_pred CCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 460 GLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSV 539 (612) Q Consensus 460 G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~ 539 (612) |.||++|++..| +.++ ++++|.++|.+|...... ..+.. .. . +.......++..+. T Consensus 449 giiS~et~~~~l---~~v~---d~~~E~eri~~E~~~~~~-~~~~~--~~---~------------~~~~~~~~~~~~~~ 504 (512) T protein:vir:97 449 GKISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIK-KAQKG--IY---K------------DPRDINDDEQDDDT 504 (512) T ss_pred ccCchHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHH-HHhhc--cc---C------------CCCCCCCCCCCCCc Confidence 999999998876 3332 355666776655211000 00000 00 0 00000000000000 Q ss_pred HHHHHHHH Q lcl|NC_019408. 540 AVQEGHAE 547 (612) Q Consensus 540 ~~~~~r~~ 547 (612) ++.+.+++ T Consensus 505 ~~~~~~~~ 512 (512) T protein:vir:97 505 KDTVDKKE 512 (512) T ss_pred cccccccC Confidence 00000000 No 34 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.90 E-value=1.5e-23 Score=145.64 Aligned_cols=437 Identities=11% Similarity=0.032 Sum_probs=242.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||. .+....+++.++.+.|.|...+..+...+-... ..+...-..=+-.|+.+.+|+..+|++|.+||++..- T Consensus 34 ~i~--~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~-----~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~ 106 (478) T protein:vir:10 34 LVR--EHKENIDNITMGERYYNHHPDILDAPFKRDVNG-----DYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVD 106 (478) T ss_pred HHH--HHHHHHHHHHHHHHHhcccccccccchhhhccc-----ccccccccceeccchHHHHHHHHhhhhcccCceeecC Confidence 332 445567788888888888755433221111100 0000000001236999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....+.++- +++++.....+.+.++.+|+++++|.+... .+|-+..++|.+++- |+ ..+. ..+. T Consensus 107 ~~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~------~~~~~~~~~p~~~~~v~d--~~~~--~~~~ 174 (478) T protein:vir:10 107 NDKALKQIQHTL--NHKWDDKLVDILTAASNKGIEWVQPYVDEE------GEFKTFRVPAEQAVPIWT--NKER--DELQ 174 (478) T ss_pred ChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEEEecCC------CceEEEEEcccceEEEEc--CCCC--CceE Confidence 344444444432 268999999999999999999999977542 478899999999865 32 1112 2243 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .++ +....+ + ...+ .+...+ .+..|+.... . ..+ T Consensus 175 ~~i-r~~~~~-----~--------~~~~-~~y~~~-------------------~i~~~~~~~~--------~----~~~ 208 (478) T protein:vir:10 175 AFI-RVYELD-----G--------AERV-EYWTKD-------------------DVTFYELKEG--------Q----LIP 208 (478) T ss_pred EEE-EEEeee-----C--------ceEE-EEEeCC-------------------cEEEEEecCC--------e----eec Confidence 332 221111 0 0011 000000 0111221000 0 000 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) .......+ . ..........++|+.||||.+... .. +.+-|.++..|-=..-...|+..+.+.+.++|+++++|. T Consensus 209 ~~~~~~~~-~-~~~~~~~~~~~~~g~vPvv~~~n~--~~--g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~ 282 (478) T protein:vir:10 209 DFYRSEDH-I-QPHYYQGNKLMSWGRVPFIPFKNN--PQ--EVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGY 282 (478) T ss_pred cccccccc-c-ccceecccccccCCcceEEEeccC--CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecC Confidence 00000000 0 000111122367999999988542 22 333455555554455567788888999999999999998 Q ss_pred CCCCCceE--EEeccccccC--CCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGEY--HIGPNMVWEV--PQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREA 394 (612) Q Consensus 320 ~~~~~~~l--~iG~~~~~~l--p~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~ 394 (612) +.+..... .+....++.+ +.|++++|+..+.. .+.....++.+.+.|..++.-. +...+..++-||.+...... T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 361 (478) T protein:vir:10 283 EGEDMKDFMHNLKYYKAISVAGESGSGVDTIKVEVP-IDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYS 361 (478) T ss_pred CcccccchhhhhhhCceeEecCCCCCcceEEeecCC-HHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHH Confidence 65432221 2233344444 46789999998764 4667888999999888775322 11112234678888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc Q lcl|NC_019408. 395 NEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA 474 (612) Q Consensus 395 ~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~ 474 (612) ............+..++.+++++++.+.|... +..++.|.+++ ..+.+ ..+.++.+..+ +|.||++|++..+ T Consensus 362 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-d~~~i~i~f~~-~~p~~-~~e~~~~~~~~--~g~iS~et~i~~~--- 433 (478) T protein:vir:10 362 NLDLKANKLKNKTLTALQELLQYIIDFYRLDV-RVQDIEITFNF-NVMVN-ELENSQIAMNS--TGLLSKETILGNH--- 433 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccccceEEeCC-CCCCC-HHHHHHHHHHH--hCCCChHHHHHhC--- Confidence 88888889999999999999999999999765 33456666643 22322 23445555544 7999999998754 Q ss_pred CccchhhhhHHHHHHhhccccccccchhHHhh--hhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 475 EVISSDMTFEEFQALRADENSFINNPDAQARQ--RGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 475 ~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~--~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) +.+. ++.++.++|.+|..... .+... ......+ .... +-.+.| T Consensus 434 ~~v~---d~~~E~~ri~~E~~~~~---~~~~~~~~~~~d~~------~~~~--~d~~~e 478 (478) T protein:vir:10 434 SWVQ---DPVAEMERIEQENIELN---QQLPDIEEGLNDEQ------QRQS--EDNQSE 478 (478) T ss_pred CCCC---CHHHHHHHHHHHHHHHH---HhccccCCCCcccc------cccC--cCCCCC Confidence 2222 34555666654421100 00000 0000000 0000 000000 No 35 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.90 E-value=1e-23 Score=146.67 Aligned_cols=443 Identities=10% Similarity=0.011 Sum_probs=237.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.| -....+++++++.+.|.|...+- +.+......++... + +-.|+.+-+++.++|++|.+||++..- T Consensus 48 ~i~~-~~~~~~~r~~~l~~Yy~g~~~i~--------~~~~~~~~~~~~~~-k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:10 48 CIEH-HMDYQRPRLKVLSDYYEGKTKNL--------VELTRRKEEYMADN-R-VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred HHHH-HHHhhHHHHHHHHHHhcccCccc--------cccCcccccccCcc-e-eecchHHHHHHHHhhhhcccCceeecC Confidence 2221 01123577888888888865431 11111111122111 2 236999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.+...+.++-.. ++++.....+...++.+|+++++|-... ..+|-+..++|.+++-. +...+. ...+.. T Consensus 117 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~ay~~vy~de------dg~~~i~~~~p~~~~~v-ydd~~~-~~~~~~ 187 (511) T protein:vir:10 117 DKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYEIMIRNQ------DDETRLYKSDAMSTFVI-YDNTIE-RNSIAG 187 (511) T ss_pred chHHHHHHHHHHhh-cCHHHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEE-EcCCCC-CceEEE Confidence 56666777777555 5899999999999999999999996533 24688888888887752 111111 223333 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |+....-.. +....+. ..+..+...+ .+|+ T Consensus 188 vr~~~~~~~-------d~~~~~~-~~~~~iyt~~---------------------~i~~--------------------- 217 (511) T protein:vir:10 188 VRYLRTKPI-------DKTDEDE-VFTVDLFTSH---------------------GVYR--------------------- 217 (511) T ss_pred EEEEEeeec-------ccCccce-EEEEEEEeCC---------------------cEEE--------------------- Confidence 333221000 0000000 0000010000 1111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ....+++.............++|+.||||.+... .+ +.+-|.++..|-=+.-...|++.+.+...+.|+++++|.. T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn--~~--g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 293 (511) T protein:vir:10 218 YLTSRTNGLKLTPRENGFESHSFERMPITEFSNN--ER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred EEecCCCcccccccccccccccCcceeEEEecCC--CC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccc Confidence 1111111111111122234578999999987532 22 2222333333322334567888889999999999999954 Q ss_pred CCCCceE-EEeccccc-------------cCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHH Q lcl|NC_019408. 321 SEGTGEY-HIGPNMVW-------------EVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSES 385 (612) Q Consensus 321 ~~~~~~l-~iG~~~~~-------------~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~es 385 (612) ....+.+ ....++.+ ....|++++||..+.. .+..+..++.+.+.|..+..-. +......++-| T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~-~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:10 294 NLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred cCCchhhccchhccceecccccccccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 3322221 11122222 2345788999987543 3556788888888887764322 11111224568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) |++...............-..+..++.+.+++++.+++..... -.++.|.+++. .+.++ .+.+.++..+ .| T Consensus 373 g~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~-~p~d~-~~~~~~~~kl--~G 448 (511) T protein:vir:10 373 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN-LPKSL-IEELKAYIDS--GG 448 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCC-CCcCH-HHHHHHHHHH--hc Confidence 8888888777777778888888999999999999987643211 12455555433 23332 4567777777 48 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-RQRGYTNRGQELEQSRMAREADFTQQKIDIQERSV 539 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~ 539 (612) .||++|++..+ +.++ ++++|.++|.+|........... ...+... + ..++.-+. T Consensus 449 ~iS~et~~~~l---~~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~------------~-------~~~~~~~~ 503 (511) T protein:vir:10 449 KISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIKKAQKGIYKDPRDI------------N-------DDEQDDDT 503 (511) T ss_pred cCcHHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHhhhcccCCCCC------------C-------CCCCCCcc Confidence 99999998876 2222 34566777765521100000000 0000000 0 00000000 Q ss_pred HHHHHHHH Q lcl|NC_019408. 540 AVQEGHAE 547 (612) Q Consensus 540 ~~~~~r~~ 547 (612) +....+++ T Consensus 504 ~~~~~~~~ 511 (511) T protein:vir:10 504 KDTVDKKE 511 (511) T ss_pred cCcccccC Confidence 00000000 No 36 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.90 E-value=1.9e-23 Score=145.13 Aligned_cols=426 Identities=11% Similarity=0.041 Sum_probs=245.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) ||.| +....++++++.+-|.|...+ .|+|+. -.+.|+....+++ .|+.+-+|+..+|++|.+++++.. T Consensus 13 l~~~--~~~~~~r~~~l~~Yy~g~~~i-----~~~~~~---~~~~~~~~~~k~~-~n~~~~ivd~~~~~l~~~~~~~~~~ 81 (456) T protein:vir:10 13 LTKR--IDDGMSRVRLLARYSNGDAPL-----PELTRN---TSAAWRSFQREAR-TNWGLMVRDSVADRIIPNGITVGGS 81 (456) T ss_pred HHHH--HHHHHHHHHHHHHHHhcCCCc-----hhcCcc---cChhhhhhhhhhh-cchHHHHHHHHHhhhccCCeecCCC Confidence 5543 556789999999999986543 244433 3344555444444 699999999999999999988731 Q ss_pred ----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 ----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 ----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ....+..++.. |+++.+...++..++.+|+++++|-.... ..|-+..++|.+++-. ++. ..++ T Consensus 82 ~d~~~~~~~~~i~~~-----N~~d~~~~~~~~~a~i~G~ay~~v~~d~~------g~~~i~~~~p~~~~~i-~d~-~~~~ 148 (456) T protein:vir:10 82 ADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDD------GTATITADSPETMVVS-VDP-LQPW 148 (456) T ss_pred CCcchHHHHHHHHHh-----cChhhHHHHHHHHHhhcCeeEEEEeeCCC------CceEEEEEccceeEEE-EcC-CCCc Confidence 12235555543 68999999999999999999999854332 4688999999997553 221 2333 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) ..+.-|++.+.. |+. ..+.++...++ ..+.||.....+ . T Consensus 149 ~~~~~i~~~~~~---------d~~-----~~~~~~~~~~~------------------~~~~~~~~~~~~---------~ 187 (456) T protein:vir:10 149 RIRAAMRWWRDL---------DAE-----SDFAIVWSGDG------------------WQKFARPCFVQS---------S 187 (456) T ss_pred ceEEEEEEEEec---------CCc-----eeEEEEEeccc------------------eeEEEEEEEEee---------c Confidence 334444443211 100 01111111111 112222110000 0 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) .. .......++.+... ....+.++.+|++++.. .+..+ -+.++..|.=+.-+..||.-....+.++|.++ T Consensus 188 ~~-~~~~~~~~~~~~~~----~~~~~~~~~~pvv~~~N---~~g~g--d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~ 257 (456) T protein:vir:10 188 SR-RRLVTRISDSWVPV----GDAVVTGSPPPVVVYQN---PDGMG--EVEPHIDIINRINRAELQLLSTMAIQAFRQRA 257 (456) T ss_pred cc-ceeeeecCCceeec----cccCCCCCceeEEEecC---CCCCc--hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHh Confidence 00 00111111112111 11235688999998853 33332 23444444334445667777888999999999 Q ss_pred eecCCCCC------Cce------EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccc-cch Q lcl|NC_019408. 316 APGTDSEG------TGE------YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGAS-KSV 382 (612) Q Consensus 316 i~G~~~~~------~~~------l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~-~~~ 382 (612) +.|.+... ... +..+.+..|.+|+|++++.+ +...++...+.++.+..++.....-....-+ ... T Consensus 258 i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~--~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~ 335 (456) T protein:vir:10 258 LKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWES--QANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA 335 (456) T ss_pred hhccCcccccccccccccchhhhhhhhccccccCCCCcceEEe--cccChhHHHHHHHHHHHHHHhccCCChHHhccccc Confidence 99964321 111 23456677888888876654 4666777788888888888754321111111 123 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCC Q lcl|NC_019408. 383 SESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLL 462 (612) Q Consensus 383 ~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~i 462 (612) +-||++...............-..+..++.+.+++++...|... ...+.+.+. +-.+.+ .++.++++.++.++|.+ T Consensus 336 N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~--~~~~~v~w~-~~~~~~-~~~~ada~~kl~~~gi~ 411 (456) T protein:vir:10 336 NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--EDTVDVSFE-SPDRVT-LGEKYSAASLAKAAGES 411 (456) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--ccceeEEec-CCCCcC-HHHHHHHHHHHHHcCCC Confidence 45777777777776666777777888899999999998887432 234455443 333333 36678899999999999 Q ss_pred CHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHH Q lcl|NC_019408. 463 PDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNR 512 (612) Q Consensus 463 s~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r 512 (612) |++++++.+ |+.++++ ...+.+++.+|.........+..+ ++-.| T Consensus 412 ~~~~~~~~l---g~~~~~i-~~~e~er~~~e~~~~~~~~~~~~~-~~~~~ 456 (456) T protein:vir:10 412 WASIRRNIL---NYNADQI-KQDDLDRAREQITLFAGNPVQRPQ-EDGSR 456 (456) T ss_pred hHHHHHhhC---CCCHHHH-HHHHHHHHHHHHHHHhhhhhhcCC-CCCCC Confidence 999887654 6644433 233444544432211111111111 11111 No 37 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.90 E-value=1.9e-23 Score=145.13 Aligned_cols=426 Identities=11% Similarity=0.041 Sum_probs=245.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) ||.| +....++++++.+-|.|...+ .|+|+. -.+.|+....+++ .|+.+-+|+..+|++|.+++++.. T Consensus 13 l~~~--~~~~~~r~~~l~~Yy~g~~~i-----~~~~~~---~~~~~~~~~~k~~-~n~~~~ivd~~~~~l~~~~~~~~~~ 81 (456) T protein:vir:10 13 LTKR--IDDGMSRVRLLARYSNGDAPL-----PELTRN---TSAAWRSFQREAR-TNWGLMVRDSVADRIIPNGITVGGS 81 (456) T ss_pred HHHH--HHHHHHHHHHHHHHHhcCCCc-----hhcCcc---cChhhhhhhhhhh-cchHHHHHHHHHhhhccCCeecCCC Confidence 5543 556789999999999986543 244433 3344555444444 699999999999999999988731 Q ss_pred ----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 ----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 ----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ....+..++.. |+++.+...++..++.+|+++++|-.... ..|-+..++|.+++-. ++. ..++ T Consensus 82 ~d~~~~~~~~~i~~~-----N~~d~~~~~~~~~a~i~G~ay~~v~~d~~------g~~~i~~~~p~~~~~i-~d~-~~~~ 148 (456) T protein:vir:10 82 ADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDD------GTATITADSPETMVVS-VDP-LQPW 148 (456) T ss_pred CCcchHHHHHHHHHh-----cChhhHHHHHHHHHhhcCeeEEEEeeCCC------CceEEEEEccceeEEE-EcC-CCCc Confidence 12235555543 68999999999999999999999854332 4688999999997553 221 2333 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) ..+.-|++.+.. |+. ..+.++...++ ..+.||.....+ . T Consensus 149 ~~~~~i~~~~~~---------d~~-----~~~~~~~~~~~------------------~~~~~~~~~~~~---------~ 187 (456) T protein:vir:10 149 RIRAAMRWWRDL---------DAE-----SDFAIVWSGDG------------------WQKFARPCFVQS---------S 187 (456) T ss_pred ceEEEEEEEEec---------CCc-----eeEEEEEeccc------------------eeEEEEEEEEee---------c Confidence 334444443211 100 01111111111 112222110000 0 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) .. .......++.+... ....+.++.+|++++.. .+..+ -+.++..|.=+.-+..||.-....+.++|.++ T Consensus 188 ~~-~~~~~~~~~~~~~~----~~~~~~~~~~pvv~~~N---~~g~g--d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~ 257 (456) T protein:vir:10 188 SR-RRLVTRISDSWVPV----GDAVVTGSPPPVVVYQN---PDGMG--EVEPHIDIINRINRAELQLLSTMAIQAFRQRA 257 (456) T ss_pred cc-ceeeeecCCceeec----cccCCCCCceeEEEecC---CCCCc--hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHh Confidence 00 00111111112111 11235688999998853 33332 23444444334445667777888999999999 Q ss_pred eecCCCCC------Cce------EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccc-cch Q lcl|NC_019408. 316 APGTDSEG------TGE------YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGAS-KSV 382 (612) Q Consensus 316 i~G~~~~~------~~~------l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~-~~~ 382 (612) +.|.+... ... +..+.+..|.+|+|++++.+ +...++...+.++.+..++.....-....-+ ... T Consensus 258 i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~--~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~ 335 (456) T protein:vir:10 258 LKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWES--QANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA 335 (456) T ss_pred hhccCcccccccccccccchhhhhhhhccccccCCCCcceEEe--cccChhHHHHHHHHHHHHHHhccCCChHHhccccc Confidence 99964321 111 23456677888888876654 4666777788888888888754321111111 123 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCC Q lcl|NC_019408. 383 SESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLL 462 (612) Q Consensus 383 ~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~i 462 (612) +-||++...............-..+..++.+.+++++...|... ...+.+.+. +-.+.+ .++.++++.++.++|.+ T Consensus 336 N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~--~~~~~v~w~-~~~~~~-~~~~ada~~kl~~~gi~ 411 (456) T protein:vir:10 336 NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--EDTVDVSFE-SPDRVT-LGEKYSAASLAKAAGES 411 (456) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--ccceeEEec-CCCCcC-HHHHHHHHHHHHHcCCC Confidence 45777777777776666777777888899999999998887432 234455443 333333 36678899999999999 Q ss_pred CHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHH Q lcl|NC_019408. 463 PDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNR 512 (612) Q Consensus 463 s~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r 512 (612) |++++++.+ |+.++++ ...+.+++.+|.........+..+ ++-.| T Consensus 412 ~~~~~~~~l---g~~~~~i-~~~e~er~~~e~~~~~~~~~~~~~-~~~~~ 456 (456) T protein:vir:10 412 WASIRRNIL---NYNADQI-KQDDLDRAREQITLFAGNPVQRPQ-EDGSR 456 (456) T ss_pred hHHHHHhhC---CCCHHHH-HHHHHHHHHHHHHHHhhhhhhcCC-CCCCC Confidence 999887654 6644433 233444544432211111111111 11111 No 38 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.90 E-value=1.4e-23 Score=145.88 Aligned_cols=421 Identities=11% Similarity=0.018 Sum_probs=238.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....+++..+.+-|.|...+..+.. ...... .. | .-.|+.+.+|+..+|++|.+||++..- T Consensus 25 ~i~~--~~~~~~r~~~~~~Yy~g~~~i~~~~~------~~~~~~--~~---k-i~~n~~~~ivd~~~~~l~g~~~~~~~~ 90 (452) T protein:vir:36 25 FMEK--HKLEVARYEYLKNMYLGIMAIDDEPA------KDSWKP--DN---R-LAVNFTKYIVDTFTGYFNGIPVKKSHS 90 (452) T ss_pred HHHH--HHHHHHHHHHHHHHhccccccccCcc------ccccCc--cc---e-eecchHHHHHHHHhhhhcccCceeecC Confidence 5554 34556788888888988765533211 011111 11 2 236999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++-.. ++++.....+.+.++.+|+++++|- +.. ..+|-+..++|.+++-. +...+++ ..+.. T Consensus 91 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~-~d~-----~g~~~i~~~~p~~~~~v-~d~~~~~-~~~~~ 161 (452) T protein:vir:36 91 DKEILTKLQEFDNL-NDMEDEESELAKMACIYGRAFEFLY-QDE-----DTQTNVVYNSPENMFMV-YDDTVKQ-EPLFA 161 (452) T ss_pred ChhHHHHHHHHHhh-cChhHHHHHHHHHHHhcCeEEEEEE-ecC-----CCeeEEEEEcccceEEE-EcCCCCC-ceEEE Confidence 44555666666444 6899999999999999999999884 322 24788999999998763 2212111 11222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) | +..... +. ..+..+.. ..++|+. T Consensus 162 i--~~~~~~-------~~------~~~~~vyt---------------------~~~i~~~-------------------- 185 (452) T protein:vir:36 162 V--RYGVDE-------DK------KLQGEVYT---------------------LLETIKI-------------------- 185 (452) T ss_pred E--EEEEec-------Cc------eEEEEEEe---------------------cCeEEEE-------------------- Confidence 2 111100 00 00000000 0111111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ..+++++...+.. -++|+.||||.+... .. +.+-|.++..|-=+.=...|++.+.+.+.+.|+++++|.+ T Consensus 186 --~~~~~~~~~~~~~----~~~~g~iPvv~~~n~--~~--g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~ 255 (452) T protein:vir:36 186 --SGENDEISFGEGT----YNPYPDLPVVEFYFN--EE--RMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAA 255 (452) T ss_pred --EEcCCceEEecce----eccCCcccEEEecCC--CC--CCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC Confidence 1111222222222 257999999987542 22 2333444544433444567888889999999999999975 Q ss_pred CCCCceEEEeccccccCCCC-----CceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGEYHIGPNMVWEVPQG-----SEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQTVLREAN 395 (612) Q Consensus 321 ~~~~~~l~iG~~~~~~lp~~-----~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~~ 395 (612) ......-.+-.+.+|.++.+ ++++|+..+.. .+..+..++.+.+.|...+.-.-....+.++-||++....... T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~ 334 (452) T protein:vir:36 256 VEEEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDS-DSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQA 334 (452) T ss_pred cCchhhhhhhhcceEEecCCCCccCCcceeEeecCC-HHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHH Confidence 54333222333456666543 46899997764 4667888899999888775322112222345677777776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC--CcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh Q lcl|NC_019408. 396 EQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD--TENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK 473 (612) Q Consensus 396 ~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~--~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr 473 (612) ........-..+..++.+++++++.+++..... ..++.|.+++. .+.+ ..+.++++.++ +|.||.+|++..+ T Consensus 335 l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~-~p~d-~~~~a~~~~k~--~g~iS~et~~~~~-- 408 (452) T protein:vir:36 335 MSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRN-EPKD-IKEQAETANIL--MGITSQETALSVI-- 408 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCC-CCcC-HHHHHHHHHHH--hccCChHHHHHhC-- Confidence 666677777888999999999999987653222 22455555432 2222 24455666554 6899999998766 Q ss_pred cCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 474 AEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 474 ~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) +.++ +++++.++|.+|...... ..+.........+. .....++| T Consensus 409 -~~~~---d~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~--------------~~~~~~~e 452 (452) T protein:vir:36 409 -SVIP---DVQAEMEKIKKEEASTAI-FDKDKQPSEKGTDT--------------VVSETNEE 452 (452) T ss_pred -CCCC---CHHHHHHHHHHHHHHHHH-HHhhccCCCCcccc--------------cCccccCC Confidence 3332 345666676654210000 00000000000000 00000000 No 39 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.90 E-value=3.7e-23 Score=143.56 Aligned_cols=425 Identities=12% Similarity=0.012 Sum_probs=237.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.|= -....++|+++++.|.|...+..+. ......... .+ .-.|+.+.+|+.++|++|.++|++..- T Consensus 23 ~i~~~-~~~~~~r~~~~~~yy~g~~~i~~~~-----~~~~~~~~~--~k----i~~n~~~~iv~~~~~~l~g~~~~~~~~ 90 (489) T protein:vir:99 23 YISRF-KAEQLERLKELKRYYLGDNNIKYRP-----AKTDKYAAD--NR----IASDFAKYITVFEQGYMLGVPVEYKNE 90 (489) T ss_pred HHHHH-HHHHHHHHHHHHHHhcccCcccccc-----ccccccCCc--ce----eecchHHHHHHHHhhhhccCCceeecC Confidence 44331 1345688999999999976544321 111111111 12 247999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) ++.+..++.++-.. ++++.++..+.+.++.+|+++++|-..... ....+|.+..++|.+++-- +...+. ...+.- T Consensus 91 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~--d~~~~~~i~~~~p~~~~~v-~dd~~~-~~~~~~ 165 (489) T protein:vir:99 91 NKDLQAAIDLMSVR-NNEDYHNVKIKTDLSIYGRAYELLTVEKID--DKKTEVKLYQLPAEQTFVI-YDDTYQ-RNSLMA 165 (489) T ss_pred ChhHHHHHHHHHhh-cChhHHHHHHHHHHhhCCeEEEEEeeccCc--CCCcceEEEEEcccceEEE-EcCCCC-CceEEE Confidence 46677777777665 689999999999999999999988542211 1236889999999997542 111111 112222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++... .. .++ . ...+-.+... -.+|+. . T Consensus 166 i~~~~~-~~------~~~---~-~~~~~~~y~~---------------------~~i~~~-------------------~ 194 (489) T protein:vir:99 166 VHFYDI-DY------GSG---K-RKQIIKAYTS---------------------DTIYTY-------------------E 194 (489) T ss_pred EEEEEE-ec------CCC---c-eEEEEEEEeC---------------------CcEEEE-------------------E Confidence 222211 00 000 0 0001011100 011111 0 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ....+..... ......++|+.||||.+.... . +.+-|.++..|-=.+-...|++.+.+.+.++|+++++|.. T Consensus 195 ~~~~~~~~~~----~~~~~~~~~g~vPvv~~~n~~--~--~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~ 266 (489) T protein:vir:99 195 DYNLETKGMR----LKDYEGHFFKGVPVNEYANNE--E--RTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNA 266 (489) T ss_pred ecCCCcccce----ecccccccCCceeEEEeecCC--C--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCC Confidence 0000111111 111223679999999885422 2 2233444444433455666888889999999999999975 Q ss_pred CCCCce------EEEeccc------------cccC-------CCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_019408. 321 SEGTGE------YHIGPNM------------VWEV-------PQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM- 374 (612) Q Consensus 321 ~~~~~~------l~iG~~~------------~~~l-------p~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l- 374 (612) ....+. ..++++. ++.+ +.+.+++||..+.. .+.....|+.+++.|..++.-. T Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~ 345 (489) T protein:vir:99 267 YTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYD-TAGSEAYKNRLVADILRFTFTPD 345 (489) T ss_pred cccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCC-hHHHHHHHHHHHHHHHHHhCCcc Confidence 433221 1111111 1222 12346778876543 4556788889999988775422 Q ss_pred hhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC------cceEEEeeccccccCCCHH Q lcl|NC_019408. 375 MPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT------ENLRYEVNTDFLSTPIGAR 448 (612) Q Consensus 375 l~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~------~~~~v~ln~dF~~~~~d~~ 448 (612) +...+..++.||++...............-..+..++.+++++++.+++...... .++.|.+++ ..+.+ ..+ T Consensus 346 ~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~-~~p~d-~~~ 423 (489) T protein:vir:99 346 TQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTP-NLPQN-DNE 423 (489) T ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCC-CCCcC-HHH Confidence 1112223466888877777777777777888899999999999999987532211 134444432 22222 244 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccc----------hhHHhhhhhh Q lcl|NC_019408. 449 EMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNP----------DAQARQRGYT 510 (612) Q Consensus 449 ~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~----------~~~~~~~~e~ 510 (612) .++++.++ .|.||++|.+..+. ++ . +.+++++.+++.+|....... +++...+.++ T Consensus 424 ~~~~~~kl--~giis~et~~~~l~--~v-~-~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 424 IVTAAQNL--YGIVSDQTIFEILN--TV-T-GVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHH--hccCCHHHHHHhcC--CC-C-chhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 56666666 49999999987652 22 2 224566666665542211110 1111111111 No 40 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.90 E-value=4.6e-23 Score=143.05 Aligned_cols=441 Identities=10% Similarity=0.023 Sum_probs=236.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.| -.....++++++.+.|.|...+-. +. ......++.. .| +-.|+.+.+++.++|++|.+||++..- T Consensus 48 ~i~~-~~~~~~~r~~~l~~Yy~g~~~il~-------~~-~~~~~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:78 48 YIEH-HMDYQRPRLKVLSDYYEGKTKNLV-------EL-TRRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred HHHH-HHHhhhHHHHHHHHHhhccCcccc-------cc-CcccccccCc-ce-eecchHHHHHHHHhhhhcccCceeecC Confidence 3322 012345678888888888654311 11 1111111111 12 336999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++-.. ++++.+...+...++.+|+++++|-... ..+|-+..++|.+++-. +...+. ...+.. T Consensus 117 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~vy~d~------dg~~~i~~~~p~~~~~v-~dd~~~-~~~~~~ 187 (511) T protein:vir:78 117 DKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFII-YDNTVE-RNSIAG 187 (511) T ss_pred chHHHHHHHHHHhh-cChhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEcccceEEE-EcCCCC-CceEEE Confidence 45666677777555 5899999999999999999999996543 24688888999888652 111111 122333 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++..... .++...+ ...+..+...+ ++|+. T Consensus 188 vr~~~~~~-------~~~~~~~-~~~~~~vyt~~---------------------~i~~~-------------------- 218 (511) T protein:vir:78 188 VRYLRTKP-------IDKTDED-EVFTVDLFTSH---------------------GVYRY-------------------- 218 (511) T ss_pred EEEEEeee-------ccccccc-eEEEEEEEeCC---------------------cEEEE-------------------- Confidence 33221100 0000000 00000000000 11111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ...+++.............++++.||||.+... .+ +.+-|.++..|-=+.-...|++.+.+++.+.|+++++|.. T Consensus 219 -~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~--~~--g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~ 293 (511) T protein:vir:78 219 -LTNRTNGLKLTPRENSFESHSFERMPITEFSNN--ER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred -EecCCCcccccccccccccCcCcccceEEecCC--CC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCc Confidence 111111111111223345678999999987432 22 2222333333322334567888899999999999999953 Q ss_pred CCCCceE---------EEec-----cccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHH Q lcl|NC_019408. 321 SEGTGEY---------HIGP-----NMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSES 385 (612) Q Consensus 321 ~~~~~~l---------~iG~-----~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~es 385 (612) ....+.+ .+.+ ........+++++||..+-. .+.....++.+.+.|.....-. +......++-| T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:78 294 NLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred cCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCccccccccccccH Confidence 3322211 1111 11122345788999997654 4556788888888887664322 11111124568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) |++...............-..+..++.+.+++++.+++..... -.++.|.+++.. +.+ ..+.++++..+ .| T Consensus 373 g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~-p~n-~~e~~d~~~kl--~G 448 (511) T protein:vir:78 373 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL-PKS-LIEELKAYIDS--GG 448 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCC-CcC-HHHHHHHHHHH--hc Confidence 8888777777777777778888999999999999987643211 124555554432 222 34567777777 48 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-h-hhhhh-HHHHhHHHHHHHHHHH Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-R-QRGYT-NRGQELEQSRMAREAD 526 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-~-~~~e~-~r~~~~e~~r~~~e~e 526 (612) .||++|++..+ ..++ ++++|.++|.+|........... . ...-. ..+.+.+.++.+.|.| T Consensus 449 ~iS~et~l~~l---~~v~---d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 449 KISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred cCChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 99999998765 3332 35667777766532100000000 0 00000 0000000000000000 No 41 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.90 E-value=4.6e-23 Score=143.05 Aligned_cols=441 Identities=10% Similarity=0.023 Sum_probs=236.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.| -.....++++++.+.|.|...+-. +. ......++.. .| +-.|+.+.+++.++|++|.+||++..- T Consensus 48 ~i~~-~~~~~~~r~~~l~~Yy~g~~~il~-------~~-~~~~~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:96 48 YIEH-HMDYQRPRLKVLSDYYEGKTKNLV-------EL-TRRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred HHHH-HHHhhhHHHHHHHHHhhccCcccc-------cc-CcccccccCc-ce-eecchHHHHHHHHhhhhcccCceeecC Confidence 3322 012345678888888888654311 11 1111111111 12 336999999999999999999999544 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++-.. ++++.+...+...++.+|+++++|-... ..+|-+..++|.+++-. +...+. ...+.. T Consensus 117 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~vy~d~------dg~~~i~~~~p~~~~~v-~dd~~~-~~~~~~ 187 (511) T protein:vir:96 117 DKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFII-YDNTVE-RNSIAG 187 (511) T ss_pred chHHHHHHHHHHhh-cChhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEcccceEEE-EcCCCC-CceEEE Confidence 45666677777555 5899999999999999999999996543 24688888999888652 111111 122333 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |++..... .++...+ ...+..+...+ ++|+. T Consensus 188 vr~~~~~~-------~~~~~~~-~~~~~~vyt~~---------------------~i~~~-------------------- 218 (511) T protein:vir:96 188 VRYLRTKP-------IDKTDED-EVFTVDLFTSH---------------------GVYRY-------------------- 218 (511) T ss_pred EEEEEeee-------ccccccc-eEEEEEEEeCC---------------------cEEEE-------------------- Confidence 33221100 0000000 00000000000 11111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) ...+++.............++++.||||.+... .+ +.+-|.++..|-=+.-...|++.+.+++.+.|+++++|.. T Consensus 219 -~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~--~~--g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~ 293 (511) T protein:vir:96 219 -LTNRTNGLKLTPRENSFESHSFERMPITEFSNN--ER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred -EecCCCcccccccccccccCcCcccceEEecCC--CC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCc Confidence 111111111111223345678999999987432 22 2222333333322334567888899999999999999953 Q ss_pred CCCCceE---------EEec-----cccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHH Q lcl|NC_019408. 321 SEGTGEY---------HIGP-----NMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSES 385 (612) Q Consensus 321 ~~~~~~l---------~iG~-----~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~es 385 (612) ....+.+ .+.+ ........+++++||..+-. .+.....++.+.+.|.....-. +......++-| T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:96 294 NLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred cCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCccccccccccccH Confidence 3322211 1111 11122345788999997654 4556788888888887664322 11111124568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-----CcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-----TENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) |++...............-..+..++.+.+++++.+++..... -.++.|.+++.. +.+ ..+.++++..+ .| T Consensus 373 g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~-p~n-~~e~~d~~~kl--~G 448 (511) T protein:vir:96 373 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL-PKS-LIEELKAYIDS--GG 448 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCC-CcC-HHHHHHHHHHH--hc Confidence 8888777777777777778888999999999999987643211 124555554432 222 34567777777 48 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-h-hhhhh-HHHHhHHHHHHHHHHH Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-R-QRGYT-NRGQELEQSRMAREAD 526 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-~-~~~e~-~r~~~~e~~r~~~e~e 526 (612) .||++|++..+ ..++ ++++|.++|.+|........... . ...-. ..+.+.+.++.+.|.| T Consensus 449 ~iS~et~l~~l---~~v~---d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 449 KISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred cCChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 99999998765 3332 35667777766532100000000 0 00000 0000000000000000 No 42 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.90 E-value=3.3e-23 Score=143.81 Aligned_cols=443 Identities=11% Similarity=0.042 Sum_probs=228.7 Q ss_pred CCC---cHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCC-HHHHHHHHhhccCCchHHHHHHHhhchhhcCCce Q lcl|NC_019408. 1 MVT---HPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGAD-GDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPI 76 (612) Q Consensus 1 ~~~---hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~-~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~ 76 (612) +|. .+.|....+++.+..+-|.|...+. ..+... .+.++....+++ .|+.+-+|+.+++++|-...+ T Consensus 17 ~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~--------~~~~~~~~~~~~~~~~~~~-~n~~~~iVd~~~~~l~~~gf~ 87 (479) T protein:vir:99 17 YLETKVFPKMNTECERLDDFEAWTKNGQEVP--------DLATRHKNKEREVLQQLSR-KPWMGLMVNSFAQQLIVDGYR 87 (479) T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHhcCCccc--------ccccccCChhHHHHHHHhh-cCcHHHHHHHHHhhccccccc Confidence 221 2566678888999999998865433 222222 222222222222 599999999999999866554 Q ss_pred ee--cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccC Q lcl|NC_019408. 77 VK--NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMG 153 (612) Q Consensus 77 ~~--~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~ 153 (612) .. +..+.+..++.+ ++++..+..++..++.+|++|++|- |..+..-....|-+..++|++++- |+ ..+. T Consensus 88 ~~d~~~~~~~~~i~~~-----N~~d~~~~~~~~~a~~~G~af~~v~-~~~~~~d~~g~~~i~~~~p~~~~~iyd--d~~~ 159 (479) T protein:vir:99 88 KTGTNENAKGWDTWRL-----NQMDKQQFWLNRAVLTFGYAFIKVT-SGISPLDGTTVARIKCIDPRDAFAIWE--DPYW 159 (479) T ss_pred CCCchhhHHHHHHHHh-----cChhHHHHHHHHHHhhcCceEEEEe-cCCCCcCCCCceEEEEechhheEEEec--CCcc Confidence 42 234456666653 6788999999999999999999984 321111112457888999999874 42 1111 Q ss_pred CccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccc Q lcl|NC_019408. 154 GFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEV 233 (612) Q Consensus 154 g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~ 233 (612) + ..-+...+. . . . +. ..+| T Consensus 160 ~---~~~~~~~~~-~-------~-----------------~----------------~~--~~~~--------------- 178 (479) T protein:vir:99 160 D---EWPKYLLER-Q-------P-----------------N----------------GQ--YWWW--------------- 178 (479) T ss_pred c---ceeeEEEee-c-------C-----------------c----------------ee--EEEE--------------- Confidence 1 100000000 0 0 0 00 0000 Q ss_pred cceeEEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccc Q lcl|NC_019408. 234 KLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALP 312 (612) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P 312 (612) ....+.++..+++.|...+.. -++|+.||||.|- ....+. .+.+-|.++..|-=+.=...|+...++.+.++| T Consensus 179 -~~~~~~~~~~~~~~~~~~~~~----~h~~g~vPvv~f~n~~~~~~-~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p 252 (479) T protein:vir:99 179 -TEEDYSIFEFKQGKFIYRETV----SHDYGHIPFVRYVNVMDLRG-VCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQ 252 (479) T ss_pred -ecceEEEEEecCCceeecccc----ccCCCCcceEEeecCCCcCc-CCcchhHHHHHHHHHHHHHHHHHHHHHHHhhch Confidence 001112233333344333332 2569999999664 322211 233434444444333345678888999999999 Q ss_pred eeeeecCCCCCCc-----eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHH Q lcl|NC_019408. 313 VYYAPGTDSEGTG-----EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNN 387 (612) Q Consensus 313 ~l~i~G~~~~~~~-----~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~ 387 (612) ++++.|.+..... ...+..+.+|.+ .|++++|.+.+...++.+.+.++.+..++.....-....-+..++.||. T Consensus 253 ~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~ 331 (479) T protein:vir:99 253 IRWATGLMLPEGANADQEKMRFAQESMLIS-QNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAAD 331 (479) T ss_pred hhhhcCCCcccccccchhccccccccceee-cCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHH Confidence 9999997543222 123343445443 4667888999888888787877777777654321111111123446776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEee-ccccccCCCHHHHHHHHHHHHcCCCCHHH Q lcl|NC_019408. 388 QTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVN-TDFLSTPIGAREMRAIQLMANDGLLPDPV 466 (612) Q Consensus 388 ~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln-~dF~~~~~d~~~~~al~~~~~~G~is~et 466 (612) +...........-...-..+..++.+++++++.+.|.... ...+.+++. ++-.+.. .++.++++.+++++|.||++| T Consensus 332 Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~-~~~~~i~~~w~~~~~~s-~~~~ad~~~kl~~ag~is~et 409 (479) T protein:vir:99 332 ALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEE-ATDLDFTITWQDVTIQS-LAQFADAWAKMVESLKIPAEG 409 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcc-ccceeeeEEecCCCCCC-HHHHHHHHHHHHhcCCCCHHH Confidence 6666655555555555566677899999999998886432 222233332 2333333 367888999999999999999 Q ss_pred HHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHh------hhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 467 FYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR------QRGYTNRGQELEQSRMAREADFTQQKIDIQERSVA 540 (612) Q Consensus 467 ~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~------~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~ 540 (612) .+..| -|+-++++ +...+....+.......+.... +........+. ++-...+.+..+ =-|-.. T Consensus 410 ~l~~l--~gv~~~~~--e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-----~~~~~~ 479 (479) T protein:vir:99 410 VWDMI--PNLDQSTV--NGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNM-QQANNKTGEPAS-----LNKSGA 479 (479) T ss_pred HHHhc--CCCCHHHH--HHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCC-CCCCCCCcchhc-----cCCCCC Confidence 98776 23322221 1111111111000000000000 00000000000 000000000000 000000 No 43 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.89 E-value=1.4e-22 Score=140.34 Aligned_cols=415 Identities=11% Similarity=0.016 Sum_probs=232.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+ +....++..++.+.|.|...+.... ++.+... . .|. -.|+.+.+|+..+|++|.++|++... T Consensus 25 ~i~~--~~~~~~r~~~~~~yy~g~~~i~~~~----~~~~~~~--~-----~ki-~~n~~~~ivd~~~~~l~g~~~~~~~~ 90 (453) T protein:vir:73 25 FMKK--HQEEVERYEYLGNMYKGIMEISSQK----AKDSWKP--D-----NRL-TNNFAKYIVDTFVGYFNGIPIKKTHD 90 (453) T ss_pred HHHH--HHHHHHHHHHHHHHhccccchhcCC----CCCccCc--c-----cee-ecchHHHHHHHhhhhhcccCceeecC Confidence 5543 4556778888899999987665432 1111111 1 122 35999999999999999999999533 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+....++.++-.. ++++.....+.+.++.+|+++++|-... ...|-+..++|.+++- |+ ..+ ++..+. T Consensus 91 d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~~~~~i~~~~p~~~~~v~d--d~~-~~~~~~ 160 (453) T protein:vir:73 91 DKSVLEAMQLFDNL-NDMEDEESELAKIACVYGRAYELMYQNE------STESEVIYCSPLNVFMVYD--DSI-KQKPLF 160 (453) T ss_pred ChHHHHHHHHHHHh-cChhHHHHHHHHHHHhcCeEEEEEEeCC------CCceEEEEEcccceEEEEe--CCC-CceeEE Confidence 45555556555333 6899999999999999999999995432 2468888899988764 32 111 111111 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) .++-. .+. ++ ..+..+... ..+|+. T Consensus 161 --~i~~~-~~~------~~------~~~~~vyt~---------------------~~i~~~------------------- 185 (453) T protein:vir:73 161 --AVYYG-FDE------EG------NLSGTVYTL---------------------LETISI------------------- 185 (453) T ss_pred --EEEEE-Eec------Cc------eEEEEEEeC---------------------CeEEEE------------------- Confidence 11111 100 00 001111100 011111 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ...++.+...+. ..++|+.||||.+... .+ +.+-+.++..|-=+.-+..|+..+.+.+.+.|+++++|. T Consensus 186 ---~~~~~~~~~~~~----~~~~~g~vPvv~~~n~--~~--g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~ 254 (453) T protein:vir:73 186 ---TGKAGEVKFGES----TYNVYSDLPIVEYNFN--EE--RQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA 254 (453) T ss_pred ---EecCCceEEccc----eeccCCceeEEEecCC--CC--CCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC Confidence 111122221111 2367999999987532 22 222233333332233455678888888999999999998 Q ss_pred CCCCCceEE-----------EeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHH Q lcl|NC_019408. 320 DSEGTGEYH-----------IGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQ 388 (612) Q Consensus 320 ~~~~~~~l~-----------iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~ 388 (612) +.++...-. .++......+.+++++|+..+.+ .+..+..++.+.+.|..+..-.-......++-||++ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~A 333 (453) T protein:vir:73 255 EVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDS-DVQTENLLNRLERSIFQFTMAANISDENFGNSSGVA 333 (453) T ss_pred CCCchhhhcccccccccccccccccccccccCceeEEeeecCC-HHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHH Confidence 544322111 12222233456788999997764 355678888888888776432211112224557777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC--CcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHH Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD--TENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPV 466 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~--~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et 466 (612) ...............-..+..++.+.+++++.+++..... ..++.|.+++.. +.+ ..+.++++.++. |.||.+| T Consensus 334 l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~-p~~-~~~~a~~~~k~~--giis~et 409 (453) T protein:vir:73 334 LAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNE-PKD-IKEQAETANILK--GITSEET 409 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCC-CCC-HHHHHHHHHHHh--ccCcHHH Confidence 7666666666667777788899999999999887643221 234566554332 222 245666676664 8999999 Q ss_pred HHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 467 FYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKI 532 (612) Q Consensus 467 ~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~ 532 (612) +++.+ +.++ ++++|.++|.+|... .... ++.. ...+. ++..... T Consensus 410 ~~~~~---~~~~---d~~~E~~ri~~E~~~-----~~~~-----~~~~----~~~~~--~~~~~~~ 453 (453) T protein:vir:73 410 ALSVI---SVIP---DVQAEMEKIKKKKLL-----QLSL-----TRTS----NLVRM--KQMRGNL 453 (453) T ss_pred HHHhC---CCCC---CHHHHHHHHHHHHHH-----HHHH-----HHhc----cCCcc--hhhhcCC Confidence 98765 2222 345666666554210 0000 0000 00000 0000000 No 44 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.89 E-value=6.8e-23 Score=142.10 Aligned_cols=423 Identities=11% Similarity=0.048 Sum_probs=232.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.+= .....++++++.+.|.|...+... ++ ......+ -+-.|+.+.+|+.++|++|.+||++... T Consensus 33 ~i~~~-~~~~~~~~~~l~~Yy~g~~~i~~~-----~~--~~~~~~~------ki~~n~~~~Ivd~~~~~l~g~p~~~~~~ 98 (470) T protein:vir:99 33 FIAYN-ETVLKPRYRENMKLYLGKHKILTA-----PE--KETGADN------RIVVNSAKYVVDVYNGYFCGIEPKLALL 98 (470) T ss_pred HHHHH-HHhhHHHHHHHHHHhccccccccC-----cc--cccCCcc------eeecchHHHHHHHHhhhhccCCeeEeeC Confidence 43321 134568899999999997544321 11 1111111 1346999999999999999999998421 Q ss_pred CH-HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCccce Q lcl|NC_019408. 81 PP-KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVP 158 (612) Q Consensus 81 p~-~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~L 158 (612) .+ .....+.++-. .++++.+++.++..++.+|+++++|-... ..+|.+..++|.+++- |+. . .++..+ T Consensus 99 ~d~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------dg~~~i~~~~p~~~~~i~d~--~-~~~~~~ 168 (470) T protein:vir:99 99 NDSSKIDEIARWNR-QENFFDTINEISKQCDIFGRSIASIYQGE------DARPHLMYSSPNHAFIIYDD--T-VQRQPL 168 (470) T ss_pred CchhHHHHHHHHHH-hcCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCeEEEEEEccceeEEEEcC--C-CCcceE Confidence 11 12112222211 36999999999999999999999995432 3579999999999764 321 1 112222 Q ss_pred eEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeE Q lcl|NC_019408. 159 SRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYV 238 (612) Q Consensus 159 t~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~ 238 (612) ..|+... .. .+. ....+..+...+ ++|+. T Consensus 169 ~~vr~~~--~~------~~~----~~~~~~~~~~~~---------------------~~~~~------------------ 197 (470) T protein:vir:99 169 AFVHYQI--DN------SNN----WTDAYGVIQYAD---------------------KFYKF------------------ 197 (470) T ss_pred EEEEEEE--Ee------cCC----eeEEEEEEEecC---------------------eEEEE------------------ Confidence 2222211 11 000 011111111110 11110 Q ss_pred EEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeec Q lcl|NC_019408. 239 QYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPG 318 (612) Q Consensus 239 ~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G 318 (612) . ..+.++.. .......++|+.||||.+... .+ +.+-+.++..|-=+.=...|++.+++.+.++|+++++| T Consensus 198 ---~-~~~~~~~~--~~~~~~~~~~g~vPvv~~~n~--~~--g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g 267 (470) T protein:vir:99 198 ---K-GYDIEEDT--NAAGYAINPYGLVPAVEFFEN--EE--RQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIG 267 (470) T ss_pred ---E-eccccccc--ccccccccCCCccceEeecCC--CC--CCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 0 01111100 011122467999999987532 22 22223333333223335668888899999999999999 Q ss_pred CCCCCCce----EEEeccccccCC-----CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHH Q lcl|NC_019408. 319 TDSEGTGE----YHIGPNMVWEVP-----QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQ 388 (612) Q Consensus 319 ~~~~~~~~----l~iG~~~~~~lp-----~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~ 388 (612) ......+. ..++.+.++.+| .+++++|+..+. ..+..+..++.+.+.|...+.-. +....-.++-||++ T Consensus 268 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A 346 (470) T protein:vir:99 268 FKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPD-ADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVA 346 (470) T ss_pred CCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHH Confidence 75443221 223445555554 467899998775 34566788889999988775432 11111124558888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC---cceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHH Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT---ENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDP 465 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~---~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~e 465 (612) ...............-..+..++.+.+++++.+++...... .++.|.+++.. +.+ ..+.++++..+ .|.||++ T Consensus 347 i~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~-p~~-~~e~a~~~~kl--~giis~e 422 (470) T protein:vir:99 347 LQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNL-PED-MASAIDNAKNA--EGIVSKK 422 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCC-CcC-HHHHHHHHHHH--hccCCHH Confidence 77777777777888888899999999999999977543222 24455554322 222 24456666666 4899999 Q ss_pred HHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 466 VFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQ 535 (612) Q Consensus 466 t~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~ 535 (612) |++..+ +.+++++|.++|.+|...... ...........- +.+ -..+++ T Consensus 423 t~l~~l-------~~vd~~~E~eri~~E~~~~~~-~~~~~~~~~d~~-----------~~d---~~~ee~ 470 (470) T protein:vir:99 423 TQLGMI-------PDIEPDAEMKQIAKEKADAIK-QTQQLSMPIDIL-----------KRD---NNAEEE 470 (470) T ss_pred HHHHhC-------CCCCHHHHHHHHHHHHHHHHH-HHHhhcCCCCcC-----------CCC---CCccCC Confidence 998865 223455666676554210000 000000000000 000 000000 No 45 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.89 E-value=1.2e-22 Score=140.78 Aligned_cols=432 Identities=11% Similarity=0.042 Sum_probs=233.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcCh-HHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQ-REIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~-~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) ||.|= -....++|++..+.|.|. ..+.. +....+. ... ..-.-.|+.+.+|+.++|++|.+||++.. T Consensus 47 ~i~~~-~~~~~~r~~~l~~yY~g~~~~i~~-------~~~~~~~-~~~---~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~ 114 (501) T protein:vir:27 47 FINHH-KLRQAPRIQELLDYARGENHDVLQ-------FGRRKDR-EMA---DKRAVHNYGRMISKFKTGYLAGNPIRVEY 114 (501) T ss_pred HHHHH-HHHHHHHHHHHHHHhcCCCccccc-------cCccCcc-ccc---cceeccchHHHHHHHHhhhhcccCeeEec Confidence 22210 123466788888888874 22221 1111111 011 11134799999999999999999999842 Q ss_pred C----CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCC Q lcl|NC_019408. 80 L----PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGG 154 (612) Q Consensus 80 ~----p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g 154 (612) . .+.+..++.++-.. ++++.++..+.+.++.+|+++++|-... ..+|-+..++|.+++- |+ ..+.+ T Consensus 115 ~d~~~~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~vy~de------d~~~~i~~~~p~~~~~v~d--~~~~~ 185 (501) T protein:vir:27 115 DDNDNNSQNDDTIKRIGRI-NDIDSHNRTLIRDLSQTGRAYEVIYRNE------YDETRIKRLNPLETFVIYD--NSLED 185 (501) T ss_pred CCccchHHHHHHHHHHHHh-cChhHHHHHHHHHHhhCCeEEEEEEeCC------CCceEEEEEccceeEEEec--CCCCC Confidence 1 24456667776555 6999999999999999999999995432 2478899999998864 32 12222 Q ss_pred ccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccc Q lcl|NC_019408. 155 FYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVK 234 (612) Q Consensus 155 ~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~ 234 (612) ..+..|++...-.. . ++...+++|- . . T Consensus 186 -~~~~~ir~~~~~~~------~---------------------------------~~~~~~~vyt---~----------~ 212 (501) T protein:vir:27 186 -NSIAAVRYYNRGTL------Q---------------------------------NAKDVVEIYT---N----------E 212 (501) T ss_pred -ceEEEEEEEEeeec------C---------------------------------CcEEEEEEEe---C----------C Confidence 22222322211000 0 0000011110 0 0 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhcccee Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVY 314 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l 314 (612) .++ .+..+++.+.+ ...-++|+.||||.+... .+ +.+-|.++..|-=+.-...|++.+.+.+.+.|++ T Consensus 213 ~v~---~~~~~~~~~~~-----~~~~~~~g~vPvv~~~nn--~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~ 280 (501) T protein:vir:27 213 HIY---TLDASDDFNEI-----SVTTHAFGTVPITEFLNN--VD--GIGDYETELYLIDLYDSAESDTANHMSDMADAIL 280 (501) T ss_pred eEE---EEEeCCceeec-----cccccCCCcccEEEecCC--CC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 000 11111222211 112367999999987532 22 2333444444433445677888899999999999 Q ss_pred eeecCCCCCCce----------EEEec-cccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccch Q lcl|NC_019408. 315 YAPGTDSEGTGE----------YHIGP-NMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSV 382 (612) Q Consensus 315 ~i~G~~~~~~~~----------l~iG~-~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~ 382 (612) +++|......+. +.+.+ ++....+.+++++|+..+-.. +.....++.+.+.|...+.-. +....-.+ T Consensus 281 v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 359 (501) T protein:vir:27 281 AIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDV-SGAEAYKTRLNRDIHIFTNIPDMSDTNFSG 359 (501) T ss_pred eeecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCH-HHHHHHHHHHHHHHHHHhCCcccCcccccc Confidence 999964332211 22211 122223456789999877543 456777888888887765322 11112234 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC----CcceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 383 SESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD----TENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 383 ~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~----~~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) +-||++..................+..++.+.+++++.+++..... ..++.|.+++. .+.+ ..+.++++..+ T Consensus 360 n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~-~p~n-~~e~ad~~~kl-- 435 (501) T protein:vir:27 360 NTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPN-LPKS-LNEQVSILTGL-- 435 (501) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCC-CCcC-HHHHHHHHHHH-- Confidence 5688888887777777788888999999999999999987653221 12355555432 2222 24456666665 Q ss_pred cCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHH-HHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 DGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNR-GQELEQSRMAREADFTQQKIDI 534 (612) Q Consensus 459 ~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r-~~~~e~~r~~~e~e~~~q~~e~ 534 (612) +|.||++|++..+ +.++ ++++|.++|.+|..... ....+...... ....++.....+.+ ++.+.| T Consensus 436 ~g~iS~et~l~~l---~~v~---D~~~E~eri~~E~~e~~---~~~~~~~~~~~~~~~~d~~~~~~~d~--~e~~~~ 501 (501) T protein:vir:27 436 GGQVSQETALSLS---GLVE---SPNEELDKINKEVSEID---FKGYSNDFNEHVGKYTDEVKETHTDD--FERAYE 501 (501) T ss_pred hccCcHHHHHHhC---CCCC---CHHHHHHHHHHHHHhhh---HhhhcCccccccccccCCCCCCcccc--ccccCC Confidence 6999999998765 2222 35666777655421100 00000000000 00000000000000 000000 No 46 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.89 E-value=4.5e-23 Score=143.08 Aligned_cols=445 Identities=10% Similarity=0.030 Sum_probs=241.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhh---ccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQR---ATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~r---A~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) ||. .+....+++....+.|.|.+.+..+...+.++............+.. =+-.|+.+.+|+..+|++|.+||++ T Consensus 13 ~~~--~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~ 90 (471) T protein:vir:10 13 QMV--KHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKKAYALTYPPTF 90 (471) T ss_pred HHH--HHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhhhhhcccCcee Confidence 443 33456778889999999976655432222222111111111111111 1457999999999999999999999 Q ss_pred ecCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcc Q lcl|NC_019408. 78 KNLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFY 156 (612) Q Consensus 78 ~~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~ 156 (612) ..-.+....+++++. .++++...+.+.+.++.+|+++++|=.... ..+|-+..++|.+++- |+ ....+ . T Consensus 91 ~~~~~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-----~g~~~~~~~~p~~~~~i~d--~~~~~-~ 160 (471) T protein:vir:10 91 DVDDKKVNDMIVDVL--GDDYERISKQLCVNAGNAGIAWLHVWKDAS-----DNSFRYACVDSKEVIPIYS--KSLDK-K 160 (471) T ss_pred ccCChHHHHHHHHHH--hcCHHHHHHHHHHHHhhCCeEEEEEEeeCC-----CCeeEEEEEcccceEEEEc--CCCCC-c Confidence 533455666777664 378999999999999999999999854322 2478899999999764 32 11111 1 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccce Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLA 236 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~ 236 (612) .+.-|+...... ..+... ...+ .....++ +..|+...... ....... T Consensus 161 ~~~~ir~~~~~~------~~~~~~---~~~~-~vy~~~~-------------------~~~y~~~~~~~--~~~~~~~-- 207 (471) T protein:vir:10 161 SIGVLRVYSSID------ETDGKN---YTVY-EYWNDKE-------------------CSFYRHEKEKP--LEELETF-- 207 (471) T ss_pred eEEEEEEEEeec------cCCCce---eEEE-EEEeCCc-------------------EEEEEecCCcc--ccccccc-- Confidence 222222222111 011111 1110 1111111 01111100000 0000000 Q ss_pred eEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeee Q lcl|NC_019408. 237 YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYA 316 (612) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i 316 (612) .......-..+.. .....-.++|+.||||.+... .. +.+-|.++-.|-=+.=...|+..+.+...+.|++++ T Consensus 208 ~~~~~~~~~~~~~----~~~~~~~~~~g~iPvv~~~n~--~~--~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~ 279 (471) T protein:vir:10 208 QAISLIDTMNGDR----SSDNSFKHDFGLVPFIPFKNN--EI--ETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVL 279 (471) T ss_pred ccccccccccccc----cccccccCCCCceeEEEeccC--CC--CCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeee Confidence 0000000000000 011112357999999988432 22 122233333332122235677888889999999999 Q ss_pred ecCCCCCCceE--EEeccccccCC-----CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHH Q lcl|NC_019408. 317 PGTDSEGTGEY--HIGPNMVWEVP-----QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQT 389 (612) Q Consensus 317 ~G~~~~~~~~l--~iG~~~~~~lp-----~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~ 389 (612) +|.+.+..... .+-...++.++ .+++++|+..+.+ ++..+..++.+++.|...+.-+-....+.++-|+++. T Consensus 280 ~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Al 358 (471) T protein:vir:10 280 TNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIP-TEARNLILERTKKQIFISGQGVNPETDKLGNSSGVAL 358 (471) T ss_pred ecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCC-hHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHH Confidence 99754432221 12223344443 4568999998865 4678999999999998775322111223345678777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_019408. 390 VLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYE 469 (612) Q Consensus 390 ~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~ 469 (612) ..............-..+..++.+.+++++.++|.. +..++.|.+++ ..+.+ ..+.++.+..+ +|.||.+|.+. T Consensus 359 k~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--d~~~i~i~f~~-~~p~n-~~e~~~~~~kl--~g~iS~et~~~ 432 (471) T protein:vir:10 359 KFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS--DKLKIKQTWTR-NSINN-DTEMAQVVSTL--ATITSRENVAK 432 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCceeEEEeCC-CCCCC-HHHHHHHHHHH--hccCchHHHHH Confidence 777777777777778888999999999999998864 34456666543 33333 24456666665 68999999887 Q ss_pred HHHhcCccchhhhhHHHHHHhhcccccc-ccchh--HHhhhhhhH Q lcl|NC_019408. 470 YMRKAEVISSDMTFEEFQALRADENSFI-NNPDA--QARQRGYTN 511 (612) Q Consensus 470 ~lqr~~vl~~~~~~eee~~ria~e~~~~-~~~~~--~~~~~~e~~ 511 (612) .+ ..+. ++.+|.++|.+|.... ..... ......|.+ T Consensus 433 ~~---p~v~---D~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 433 SN---PIVE---DWQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred hC---CCCC---CHHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 65 2222 3456666666542110 00000 000000111 No 47 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.89 E-value=9.4e-23 Score=141.34 Aligned_cols=434 Identities=10% Similarity=0.005 Sum_probs=235.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||-| +....++..++.+.|.|...+..+..++.-+. . .+..+-..=+-.|+.+.+++..+|++|.+||+++.- T Consensus 34 ~i~~--~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~---~--~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~ 106 (474) T protein:vir:96 34 LIND--HKPKIDDITVGERYYNHDPDVLRLAPKLDNKG---E--IDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSSD 106 (474) T ss_pred HHHH--HHHHHHHHHHHHHHhccCCcchhccchhcccc---c--ccccccchhcccchHHHHHHhhhhhhcccCceeecC Confidence 3332 23445667777777888665543322221111 1 111111111336999999999999999999999422 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) .+.....+.++- +++++.....+.+.++.+|+++++|..+. ..+|.+..++|.+++-. +...+.+. .+.. T Consensus 107 d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~~y~d~------~~~~~i~~~~p~~~~~v-~d~~~~~~-~~~~ 176 (474) T protein:vir:96 107 DDKSLKTIQEVL--NHKWDDKLVDILTAASNKGIEWLQPYIDE------NGEFKTFRVPAEQAIPI-WTNKERDT-LKAF 176 (474) T ss_pred chHHHHHHHHHH--hcCHHHHHHHHHHHHHhcCeeEEEEEecC------CCceEEEEEcccceEEE-EcCCCCCc-eEEE Confidence 233344444442 24677788888999999999999997653 25789999999998864 22112222 1222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) |+... .+ + ...+ .+...+ .+..|+...... ...+ T Consensus 177 vr~~~--~~-----~--------~~~~-~~yt~~-------------------~v~~~~~~~~~~--~~~~--------- 210 (474) T protein:vir:96 177 IRYYR--LD-----G--------AERV-EYWTDS-------------------DVTYYEYQDGIL--IPDY--------- 210 (474) T ss_pred EEEEe--ec-----C--------ceEE-EEEeCC-------------------eEEEEEecCCce--eecc--------- Confidence 22211 10 0 0000 111110 011111100000 0000 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCC Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTD 320 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~ 320 (612) .+.. .... ........-++++.||||.+.... . +.+-|.++-.|-=+.=...|+..+.+...+.|+++++|.+ T Consensus 211 ~~~~--~~~~-~~~~~~~~~~~~g~iPvv~~~nn~--~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 283 (474) T protein:vir:96 211 YHGE--EHIQ-SHYYVGNKRVSWGRVPFIPFKNNP--Q--EMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYE 283 (474) T ss_pred cccc--cccc-ccccccccccCCCceeEEEeccCC--C--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Confidence 0000 0000 000001123579999999885432 2 2232333333321222355777888889999999999986 Q ss_pred CCCCce--EEEeccccccCC-CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccchhHHHHHHHHHHHHH Q lcl|NC_019408. 321 SEGTGE--YHIGPNMVWEVP-QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSVSESNNQTVLREANE 396 (612) Q Consensus 321 ~~~~~~--l~iG~~~~~~lp-~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~ 396 (612) .+..+. ..++...+|.++ .|++++|+..+.. .+..+..++.+.+.|.....-. +...+..++-||++........ T Consensus 284 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 362 (474) T protein:vir:96 284 GQDLDEFMRNLKYYKAINVDGDGSGVDTIQIEVP-VQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNL 362 (474) T ss_pred cccccchhhhhhcCceEEecCCCCceeEEeecCC-hHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHH Confidence 544332 235566778887 4789999997754 3667888899999888764322 1111222456888777776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCc Q lcl|NC_019408. 397 QSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEV 476 (612) Q Consensus 397 ~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~v 476 (612) .......-..+..++.+.+++++...|... +..++.|.+++. .+. +..++. ..+.++|.||++|++..+ +. T Consensus 363 ~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~-~~~~i~i~f~~~-~p~--~~~e~~--~~~~~ag~iS~et~~~~~---~~ 433 (474) T protein:vir:96 363 DLKANKLKNKTLTALQELLQYIIDFYKLNI-KVQDVEITFNFN-VMV--NELEQS--QIGVQSQYLSKETVVTNH---PW 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEEeccC-CCc--CHHHHH--HHHHhcCCCchHHHHHhC---CC Confidence 667777778889999999999999998754 334566655432 222 222322 234678999999998765 22 Q ss_pred cchhhhhHHHHHHhhccccccccc-hhHHhhh--hhhHHHHhHH Q lcl|NC_019408. 477 ISSDMTFEEFQALRADENSFINNP-DAQARQR--GYTNRGQELE 517 (612) Q Consensus 477 l~~~~~~eee~~ria~e~~~~~~~-~~~~~~~--~e~~r~~~~e 517 (612) +. ++++|.+++.+|....... ....... .....+++.. T Consensus 434 v~---d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 434 VD---DPVAELERIEQDNIDFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred CC---CHHHHHHHHHHHHHHHHhcccccccccccccCCCcccCC Confidence 22 3566777776553210000 0000000 0001111111 No 48 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.89 E-value=3e-22 Score=138.56 Aligned_cols=405 Identities=12% Similarity=0.042 Sum_probs=226.2 Q ss_pred cCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec--CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019408. 34 YLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN--LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAG 111 (612) Q Consensus 34 YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~--~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~ 111 (612) |||+- ..+.|+...++++ .|+.+-+|++++++++-...+..+ ....+..++.+ |+++.++..+...++. T Consensus 1 ~l~~~---~~~~~~~~~~~~v-~n~~~~ivd~~~~~l~~~gf~~~d~~~~~~~~~i~~~-----N~~d~~~~~~~~~a~i 71 (434) T protein:vir:98 1 MLPKN---AEQAFLDFQRKAR-TNFCGLIANASVHRLLALGVTGPDGEPDTRASRWWQA-----NRLDSRQKLVWRMAMA 71 (434) T ss_pred CCCCC---ccHHHHHhhhhhh-ccchHHHHHHHHhhhccCceecCCCchHHHHHHHHHh-----cChhHHHHHHHHHHhh Confidence 88764 4477887766654 499999999999998766544422 33456666653 7899999999999999 Q ss_pred hCCeEEEEecCcchh-hhhccCceEEEechhhhhc-chhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeee Q lcl|NC_019408. 112 VGRFGVLVDVVDNPR-KGAVATSFAVGYSAENILD-WDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARA 189 (612) Q Consensus 112 ~Gr~~vlVD~p~a~~-~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~ 189 (612) +|+++++|....... .....+|.|..++|++++- |+ .. .++ .+.-|++... +.++. .++. T Consensus 72 ~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D--~~-~~~-~~~ai~~~~~--------~~~~~------~~~~ 133 (434) T protein:vir:98 72 QSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYD--PE-TGE-PLVGLKVWHN--------DIDGF------GYAR 133 (434) T ss_pred cCceEEEEecCCCcccccCCceeEEEEeccceeEEEEe--CC-CCc-eEEEEEEEEe--------ccCCc------eEEE Confidence 999999997643221 1223478889999999754 32 12 222 2222222111 11111 1111 Q ss_pred EeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEE Q lcl|NC_019408. 190 AALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFK 269 (612) Q Consensus 190 l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v 269 (612) ..+.+- ..+|...... . . .+......+......+...-++|+.||+| T Consensus 134 ~~~~~~-------------------~~~~~~~~~~-----~---~------~~~~~~~~~~~~~~~~~~~~h~~g~vPvv 180 (434) T protein:vir:98 134 VFFDDT-------------------SFPYRTRERT-----G---A------RLPWGPDSWVYTGTADSGDVHDLGGMQLV 180 (434) T ss_pred EEEeCc-------------------EEEEEEeecc-----c---c------ccccccccceecccccccccCCCCccceE Confidence 111100 0000000000 0 0 00001111111122233334689999999 Q ss_pred Eee-cCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCc-----e------EEEeccccccC Q lcl|NC_019408. 270 FFG-ASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTG-----E------YHIGPNMVWEV 337 (612) Q Consensus 270 ~~~-~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~-----~------l~iG~~~~~~l 337 (612) .|- ....+. .+..-+.++-.|-=+.-+..|+...+..+.++|+++++|.+..... . +..+.+. +.+ T Consensus 181 ~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~-i~~ 258 (434) T protein:vir:98 181 EFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSA-VWA 258 (434) T ss_pred EeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccccc-ccc Confidence 663 322211 1233344444444344466788889999999999999997654211 0 1223333 344 Q ss_pred CCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhh-hcc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 338 PQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMM-PGA-SKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVV 415 (612) Q Consensus 338 p~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll-~~~-~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l 415 (612) .+|.+++|.|++...++.+.+.|+.+..++.... .+- ..- ....+.||.+...............-..+..++.+++ T Consensus 259 ~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~-~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~ 337 (434) T protein:vir:98 259 SEGENTQFGQLDATDLSGFLKEHASDVRDMLTIS-QTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVL 337 (434) T ss_pred CCCCCceEEEecCcchHHHHHHHHHHHHHHhccc-CCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4577899999999999888888888888775432 221 111 1123568888887777777777777788888999999 Q ss_pred HHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhh-HHHHHH---hh Q lcl|NC_019408. 416 RWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTF-EEFQAL---RA 491 (612) Q Consensus 416 ~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~-eee~~r---ia 491 (612) ++++.+.|... +..++.+.. ++..+.++ ++.++++.++...| +|++++++.| |+.++++.- +++.+. ++ T Consensus 338 rl~~~~~g~~~-~~~~~~v~w-~~~~~~s~-~~~ada~~kl~~~g-~~~e~~~~~l---g~~~~e~~r~~~e~~~~~~~~ 410 (434) T protein:vir:98 338 ALAAAQAGVPE-DYTEAEVRW-ANPAHVTM-AVKADAATKLKSIG-YPLDVIAEEL---DESPARVRRIVAGAASQALLA 410 (434) T ss_pred HHHHHhcCCCh-hheeeeEEe-cCCCCCCH-HHHHHHHHHHHhcC-CcHHHHHHhC---CCCHHHHHHHHHHHHHHHHHH Confidence 99999988643 222344444 34444443 66788899998877 6988887654 443322210 111000 00 Q ss_pred -------ccccccccchhHHhhhh Q lcl|NC_019408. 492 -------DENSFINNPDAQARQRG 508 (612) Q Consensus 492 -------~e~~~~~~~~~~~~~~~ 508 (612) .+.+.-+.++.+....+ T Consensus 411 ~~~~~~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 411 ASLLPAPGAPSAGNVPDSGGAVDG 434 (434) T ss_pred HhhhccCCCCCCCCCCcccCCCCC Confidence 00001011111111222 No 49 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.88 E-value=1.3e-22 Score=140.49 Aligned_cols=425 Identities=12% Similarity=0.071 Sum_probs=242.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) ||.| +....++++++.+-|.|...++ |+|+ .....|+..-.+++ .|+.+.+|+.++|++|-+++++.. T Consensus 13 l~~~--~~~~~~r~~~l~~Yy~g~~~i~-----~~~~---~~~~~~~~~~~~~~-~n~~~~ivd~~~~~l~~~g~~~~~~ 81 (456) T protein:vir:79 13 LTKR--IDDGMSRVRLLARYSNGDAPLP-----ELTR---NTSAAWRSFQREAR-TNWGLMVRDSVADRIIPNGITVGGS 81 (456) T ss_pred HHHH--HHHHHHHHHHHHHHHhccCChh-----hcCc---ccChhhchhhhhhh-cchHHHHHHHHHhhhccCCeecCCC Confidence 5555 6667888999999999965443 3433 22334554433333 789999999999999999987721 Q ss_pred ----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 ----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 ----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) +...+..++++ ++++.+.+.++..++.+|+++++|= +..+ ..|.+..++|++++-. ++ ...++ T Consensus 82 ~d~~~~~~~~~~~~~-----n~~d~~~~~~~~~a~~~G~a~~~~~-~~ed-----g~~~i~~~~p~~~~~i-~d-~~~~~ 148 (456) T protein:vir:79 82 ADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCW-RRDD-----GTATITADSPETMVVS-VD-PLQPW 148 (456) T ss_pred CCccHHHHHHHHHHh-----cChhHHHHHHHHHHhhcCeeEEEEe-eCCC-----CceEEEEeccceeEEE-Ec-CCCCC Confidence 22346666654 5788999999999999999999874 3322 3678899999987653 11 22333 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) ..+..+++.... |+ ...+..+...++.. +.||.....+ T Consensus 149 ~~~~~~~~~~~~---------d~-----~~~~~~~~~~~~~~------------------~~~~~~~~~~---------- 186 (456) T protein:vir:79 149 RIRSAMRWWRDL---------DA-----ESDFAIVWSGDGWQ------------------KFARPCFVQS---------- 186 (456) T ss_pred ceEEEEEEEEec---------CC-----ceeEEEEEcCCceE------------------EEEEEEEeec---------- Confidence 344444433210 10 01111222222111 1111100000 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCc-CchHHHHHHHHHHHhhhHHHHHHHHHhcccee Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEK-PPLLDICDLNLSHYRTYAELEYGRLFTALPVY 314 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~-pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l 314 (612) .........+.+.+..... -.+.++.|||+++. |.+..+. -|..+|.+ ..=+..|+....+.+.++|.+ T Consensus 187 ~~~~~~~~~~~~~~~~~~~----~~~~~~~~pvv~~~---N~~~~gd~e~v~~liD---~~~~~~s~~~~~~~~~a~~~~ 256 (456) T protein:vir:79 187 SSRRRLVTRISDSWVPVGD----AVVTGSPPPVVVYQ---NPDGMGEVEPHIDIIN---RINRAELQLLSTMAIQAFRQR 256 (456) T ss_pred cccceeeeccCCceeeccc----ccCCCCceeEEEec---CCCCCchhhhhHHHHH---HHHHHHHHHHHHHHHHhhHHH Confidence 0000111111111111111 13468899999874 3332222 13333322 122455677788899999999 Q ss_pred eeecCCCC------CCc------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccc-cc Q lcl|NC_019408. 315 YAPGTDSE------GTG------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGAS-KS 381 (612) Q Consensus 315 ~i~G~~~~------~~~------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~-~~ 381 (612) +++|.+.. +.. .+..+.+..|.+|+|.+++ +++...++.+.+.|+.+..++.....-....-+ .. T Consensus 257 ~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~~~--q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~ 334 (456) T protein:vir:79 257 ALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVDIW--ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS 334 (456) T ss_pred HHhcCCcccccccccccccchhhhhhhhccccccCCCCccee--eecccChHHHHHHHHHHHHHHHhhcCCChhHhcccc Confidence 99997432 111 1334666778888887765 445666777788888888888654321111111 12 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCC Q lcl|NC_019408. 382 VSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGL 461 (612) Q Consensus 382 ~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~ 461 (612) ++-||++...............-..+..++.+.++++..+.|... ..++.+.+. +-.+.+ .++.++++.++.++|. T Consensus 335 ~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~~--~~~i~v~w~-~~~~~s-~~~~ada~~kl~~~G~ 410 (456) T protein:vir:79 335 ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--EDTVDVSFE-SPDRVT-LGEKYSAASLAKAAGE 410 (456) T ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--cccceEEeC-CCCCcC-HHHHHHHHHHHHhcCC Confidence 345777777777777766777778888999999999999888532 234454443 223333 3667888999999999 Q ss_pred CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHH Q lcl|NC_019408. 462 LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNR 512 (612) Q Consensus 462 is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r 512 (612) +|+++++..| |+-++++. ..+.+++.+|...+.....+ .-++...| T Consensus 411 ~~~~~~~~~l---g~~~~~i~-~~e~~r~~~e~~~~~~~~~~-~~~~~~~~ 456 (456) T protein:vir:79 411 SWASIRRNIL---NYNADQIK-QDDLDRAREQITLFAGNPVQ-RPQEDGSR 456 (456) T ss_pred ChHHHHHhcC---CCCHHHHH-HHHHHHHHHHHHHHhhhHhh-cCCCCCCC Confidence 9999886654 66444332 33444444443222111111 11112122 No 50 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.88 E-value=1.8e-22 Score=139.74 Aligned_cols=429 Identities=11% Similarity=0.031 Sum_probs=232.0 Q ss_pred CCCcHHH-HHHHHHHHHHHHHhcCh-HHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee Q lcl|NC_019408. 1 MVTHPEY-QYWRPEWTKLRDVMAGQ-REIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK 78 (612) Q Consensus 1 ~~~hP~y-~~~~~~W~~i~d~~~G~-~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~ 78 (612) ||. .| ....++|+.+.+.|.|. ..+... + .... . + +...=+-.|+.+.+|+.++|++|.+||++. T Consensus 47 ~i~--~~~~~~~~r~~~~~~yY~g~~~~i~~~------~-~~~~-~-~--~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~ 113 (501) T protein:vir:96 47 FIN--HHKLRQAPRIQELLDYARGENHDVLKS------G-RRKD-N-E--MADKRAVHNYGRMISKFKTGYLAGNPIRVE 113 (501) T ss_pred HHH--HHHHHHHHHHHHHHHHhcCCCCcccCc------c-ccCc-c-c--cccceeecchHHHHHHHHhhhhcccCeeEe Confidence 332 11 23346778888888884 222111 1 1111 0 1 111124589999999999999999999994 Q ss_pred c----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCC Q lcl|NC_019408. 79 N----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGG 154 (612) Q Consensus 79 ~----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g 154 (612) . ..+.+..++.++-.. ++++..+..+++.++.+|+++++|- +.. ..+|.+..++|.+++-- +...+.+ T Consensus 114 ~~~~~~~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~-~de-----dg~~~i~~~~p~~~~~v-~d~~~~~ 185 (501) T protein:vir:96 114 YDDNDDNSQNDDAIKRIGRI-NDLDSLNRTLIRDLSQTGRAYEVIY-RSE-----YDETRIKRLSPLETFVI-YDNSLED 185 (501) T ss_pred eCCccchhHHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEE-EcC-----CCceEEEEEccceeEEE-EcCCCCC Confidence 1 123445555544333 6899999999999999999999983 322 24788999999998652 1222222 Q ss_pred ccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccc Q lcl|NC_019408. 155 FYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVK 234 (612) Q Consensus 155 ~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~ 234 (612) ..+.-|++..... .++ ...+..+... -++|+ T Consensus 186 -~~~~~v~~~~~~~-------~~~-----~~~~~~vyt~---------------------~~i~~--------------- 216 (501) T protein:vir:96 186 -NSIAAVRYYNRGT-------LQS-----AKDVVEIYTD---------------------EHIYT--------------- 216 (501) T ss_pred -ceEEEEEEEEeec-------CCC-----cEEEEEEEcC---------------------CcEEE--------------- Confidence 2223233221100 000 0000000000 01111 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhcccee Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVY 314 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l 314 (612) +..+++.+.+ ....++|+.||||.+... .+ +.+-|.++..|-=..=...|+..+.+.+.+.|++ T Consensus 217 -------~~~~~~~~~~-----~~~~~~~g~vPvv~~~nn--~~--g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l 280 (501) T protein:vir:96 217 -------LDASDDFNEI-----SVTTHAFGTVPITEYLNN--ID--GIGDYETELYLIDLYDSAESDTANHMSDMADAIL 280 (501) T ss_pred -------EeeCCCceec-----cccccCCCccceEEecCC--cc--CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 1111111111 122367999999987532 22 2333444333322222466888889999999999 Q ss_pred eeecCCCCCCce----------EEE-eccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh-hhccccch Q lcl|NC_019408. 315 YAPGTDSEGTGE----------YHI-GPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM-MPGASKSV 382 (612) Q Consensus 315 ~i~G~~~~~~~~----------l~i-G~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l-l~~~~~~~ 382 (612) +++|......+. +.+ ++.+....+.+++++|+..+... +.....++.+.+.|..++.-. +......+ T Consensus 281 ~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 359 (501) T protein:vir:96 281 AIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDV-SGAEAYKTRLNRDIHIFTNTPDMSDTNFSG 359 (501) T ss_pred eeecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCH-HHHHHHHHHHHHHHHHHhCCcccCcccccc Confidence 999975433221 222 22222334556789999876644 446777888888887765422 11111234 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC----CCcceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 383 SESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA----DTENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 383 ~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~----~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) +-||++...............-..+..++.+.+++++.+++.... +..++.|.+++ ..+.+ ..+.++++.++ T Consensus 360 n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~-~~p~n-~~e~ad~~~kl-- 435 (501) T protein:vir:96 360 NTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTP-NLPKS-LNEQVSILTGL-- 435 (501) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCC-CCCcC-HHHHHHHHHHH-- Confidence 568888777777777777777888899999999999998765321 11235555543 33333 24566777776 Q ss_pred cCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccccccc-chhHHh---hhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 DGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINN-PDAQAR---QRGYTNRGQELEQSRMAREADFTQQKID 533 (612) Q Consensus 459 ~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~-~~~~~~---~~~e~~r~~~~e~~r~~~e~e~~~q~~e 533 (612) +|.||++|++..+ +.+. ++++|.++|.+|...... ...... .......+.+.+ ++...++.| T Consensus 436 ~g~iS~et~~~~l---~~v~---D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~-------~d~~e~~~~ 501 (501) T protein:vir:96 436 GGQVSQETALSLS---GLVE---SPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETH-------TDDFEREYE 501 (501) T ss_pred hccCchHHHHHhC---CCCC---CHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCC-------CCccccccC Confidence 4899999998876 2332 345667776554321100 000000 000101111111 010011111 No 51 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.88 E-value=8e-23 Score=141.72 Aligned_cols=450 Identities=9% Similarity=0.012 Sum_probs=227.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee-- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK-- 78 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~-- 78 (612) |+. .+....+++.++.+-|.|...++ |+| ..-+..++++ -.-.|+.+-+|+.++++++-...++. T Consensus 11 L~~--~~~~~~~r~~~l~~Yy~G~~~i~-----~~~---~~~~~~~~~~---~~~~n~~~~ivd~~~~~l~~~g~~~~~d 77 (480) T protein:vir:78 11 LQG--LLARDLPNLLEAEAYRNGTRRLK-----TIG---IGAPPELAYL---DVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) T ss_pred HHH--HHHHHHHHHHHHHHHHhcccccc-----ccc---cccchhHhhh---hhhcchHHHHHHHHHhhhccCceecCCC Confidence 333 35567788888889999876543 232 2223334333 24568999999999999876655552 Q ss_pred -cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccc Q lcl|NC_019408. 79 -NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYV 157 (612) Q Consensus 79 -~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~ 157 (612) ...+.|..++. .++++..+..+++.++.+|+|+++|........-....|.+..++|++++-. ++..+.+ . T Consensus 78 ~~~~~~l~~i~~-----~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~-~D~~~~~--~ 149 (480) T protein:vir:78 78 SEGLEELWNWWQ-----ANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAE-LDPRNTR--R 149 (480) T ss_pred chhHHHHHHHHH-----hcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEE-EcCCCcc--c Confidence 23455666654 3788999999999999999999999754322223346789999999998753 2212222 2 Q ss_pred eeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccccee Q lcl|NC_019408. 158 PSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAY 237 (612) Q Consensus 158 Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~ 237 (612) ++..+ +-.... ++.. ...+..+... +. +..|+. T Consensus 150 ~~~~i-~~~~~~-------~~~~---~~~~~~~y~~-----------------~~--~~~~~~----------------- 182 (480) T protein:vir:78 150 VTRAV-RLYTTR-------DDVA---VPDRATLYLP-----------------DE--TVPLRR----------------- 182 (480) T ss_pred eEEEE-EEEEee-------cCCC---ceEEEEEEeC-----------------Ce--EEEEEe----------------- Confidence 23222 111110 0000 0111111111 00 011110 Q ss_pred EEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcCchH-HHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 238 VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKPPLL-DICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 238 ~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~pPLl-dLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) .+ +....+.......-++|+.||||.|. ....+...+.+-|. +|..|.=+.=+..|+...++.+.++|+++ T Consensus 183 ------~~-~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~ 255 (480) T protein:vir:78 183 ------NG-GLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV 255 (480) T ss_pred ------cC-CCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 00 00000000011113569999999663 22222222222222 12222112224556788899999999999 Q ss_pred eecCCCCCCc------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhc--cccchhHHHH Q lcl|NC_019408. 316 APGTDSEGTG------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPG--ASKSVSESNN 387 (612) Q Consensus 316 i~G~~~~~~~------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~--~~~~~~esa~ 387 (612) |+|.+..... .+....+..|.+ .|++++|.+++...++.+.+.|+.+..++.....-.... .....+.||. T Consensus 256 i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~ 334 (480) T protein:vir:78 256 ISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAE 334 (480) T ss_pred hhcCCccccccccccchhhhhhhhhccC-CCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHH Confidence 9998643211 122333344444 467899999999999988888888888876532111111 0111124666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-CcceEEEeeccccccCCCHHHHHHHHHHHHcC--CCCH Q lcl|NC_019408. 388 QTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-TENLRYEVNTDFLSTPIGAREMRAIQLMANDG--LLPD 464 (612) Q Consensus 388 ~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G--~is~ 464 (612) +..............+-..+..++.+++++++.+.|..... -..+.|++. +-...+ .++.+.++.+++++| .+|+ T Consensus 335 Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~v~f~-~~~~~s-~~~~ad~~~kl~~~g~~~~s~ 412 (480) T protein:vir:78 335 AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWR-DPSTPT-VAAKADAVSKLYANGQGPIPK 412 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEec-CCCCCC-HHHHHHHHHHHHHhccccCCH Confidence 66655555444455555666778999999999998853321 123444443 222333 356788889988876 7999 Q ss_pred HHHHHHHHhcCccchhhhh-HHHHHHhhcccc--ccccchhHH-hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 465 PVFYEYMRKAEVISSDMTF-EEFQALRADENS--FINNPDAQA-RQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVA 540 (612) Q Consensus 465 et~~~~lqr~~vl~~~~~~-eee~~ria~e~~--~~~~~~~~~-~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~ 540 (612) +|++..| |+.++.+.- ++++... .+.. .+.....+. -.....+...... |.+. +. .+. T Consensus 413 et~~~~l---g~~~d~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~------~~--~~~ 474 (480) T protein:vir:78 413 EQARIDL---GYTATQREQMRDWDKQE-TEDMIDTLYSTTKAQADATPKPTVTETKT------ETQT------SP--SGF 474 (480) T ss_pred HHHHhcC---CCCHhHHHHHHHHHHHH-HHHHHHHhhccccccCCCCCCCCCCCCCC------cccc------cc--CCC Confidence 9988875 554332211 1110000 0000 000000000 0000000000000 0000 00 000 Q ss_pred HHHHHH Q lcl|NC_019408. 541 VQEGHA 546 (612) Q Consensus 541 ~~~~r~ 546 (612) =..+.+ T Consensus 475 ~~~~~~ 480 (480) T protein:vir:78 475 NRTKTR 480 (480) T ss_pred CcccCC Confidence 000000 No 52 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.88 E-value=1.2e-22 Score=140.86 Aligned_cols=445 Identities=9% Similarity=0.017 Sum_probs=231.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee-- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK-- 78 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~-- 78 (612) |+. .+....+++.++.+-|.|...++ |++.. -...+++.. .-.|+.+-+|+.++|+++-....+. T Consensus 11 L~~--~~~~~~~r~~~~~~Yy~G~~~i~-----~~~~~---~~~~~~~~~---~~~n~~~~ivd~~~~~l~~~g~~~~~d 77 (480) T protein:vir:78 11 LQG--LLARDLPNLLEAEAYRNGTRRLK-----TIGIG---APPELAYLD---VQPGWVATYLRTLSDRLDIEGFRISED 77 (480) T ss_pred HHH--HHHHHHHHHHHHHHHHhccccch-----hcccc---cchhhhhhh---hhcchHHHHHHHHHhhhccCceecCCC Confidence 443 46677888899999999976543 23222 222333221 3369999999999999987766552 Q ss_pred -cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccc Q lcl|NC_019408. 79 -NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYV 157 (612) Q Consensus 79 -~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~ 157 (612) ...+.|..++. -++++..+..++..++.+|+||++|........-....|.+..++|++++-- +...+.+ . T Consensus 78 ~~~~~~l~~i~~-----~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i-~D~~~~~--~ 149 (480) T protein:vir:78 78 SEGLEELWNWWQ-----ANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE-LDPRNTR--R 149 (480) T ss_pred chhHHHHHHHHH-----hcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEE-EcCCCcc--c Confidence 23455666664 3789999999999999999999999743322222346789999999998741 1111121 2 Q ss_pred eeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccccee Q lcl|NC_019408. 158 PSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAY 237 (612) Q Consensus 158 Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~ 237 (612) ++..+ +-.... + + .. ...+..+... + .+..|+. T Consensus 150 ~~~~i-~~~~~~----d--~-~~---~~~~~~~y~~-----------------~--~~~~~~~----------------- 182 (480) T protein:vir:78 150 VTRAV-RLYTTR----D--D-VA---VPDRATLYLP-----------------D--ETVPLRR----------------- 182 (480) T ss_pred eEEEE-EEEEee----c--C-Cc---ceEEEEEEeC-----------------C--eEEEEEe----------------- Confidence 22221 111111 0 0 00 0111111111 0 0001110 Q ss_pred EEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcCchH-HHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 238 VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKPPLL-DICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 238 ~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~pPLl-dLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) .++. ...+.......-++|+.||||.|. ....+...+.+-|. +|..|.=+.=+..|+...++.+.++|+++ T Consensus 183 ------~~~~-~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~ 255 (480) T protein:vir:78 183 ------NGGL-NDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV 255 (480) T ss_pred ------cCCC-cccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 0100 000000001113579999999664 22222222333232 23332222234567788899999999999 Q ss_pred eecCCCCC------CceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhh-h--ccccchhHHH Q lcl|NC_019408. 316 APGTDSEG------TGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMM-P--GASKSVSESN 386 (612) Q Consensus 316 i~G~~~~~------~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll-~--~~~~~~~esa 386 (612) ++|.+.+. ...+.+..+..|.++ |++++|.++++..++.+.+.++.+..++.... .+- . ......+-|| T Consensus 256 i~G~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~-~~p~~~fg~~~~n~~Sg 333 (480) T protein:vir:78 256 ISGVTTDELTNDGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASIT-GLPPQYLSSSSENPASA 333 (480) T ss_pred hhCCCccccccccccchhhhhhhhhccCC-CCCceEEecCccCHHHHHHHHHHHHHHHhccc-CCCHHHhccccCchhHH Confidence 99976432 111333344455554 66799999999999988888888888886542 211 1 1111112467 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC-CcceEEEeeccccccCCCHHHHHHHHHHHHcC--CCC Q lcl|NC_019408. 387 NQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD-TENLRYEVNTDFLSTPIGAREMRAIQLMANDG--LLP 463 (612) Q Consensus 387 ~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~-~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G--~is 463 (612) .+...............-..+..++.+.+++++.+.|..... ...+.|++... .+.+ .++.+.++.+++++| .+| T Consensus 334 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~-~~~s-~~~~ad~~~kl~~~g~~~~s 411 (480) T protein:vir:78 334 EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDP-STPT-VAAKADAVSKLYANGQGPIP 411 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCC-CCCC-HHHHHHHHHHHHHhcccCCC Confidence 666666655555556666667778999999999998854321 12344444322 2333 356788888888876 689 Q ss_pred HHHHHHHHHhcCccchhhhhHHHHHHhhcccc--cc-----ccchhHH-hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 464 DPVFYEYMRKAEVISSDMTFEEFQALRADENS--FI-----NNPDAQA-RQRGYTNRGQELEQSRMAREADFTQQKIDIQ 535 (612) Q Consensus 464 ~et~~~~lqr~~vl~~~~~~eee~~ria~e~~--~~-----~~~~~~~-~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~ 535 (612) ++|++..| |+.++...--++..+...+.. .. +.++++. ...++..- |.+ T Consensus 412 ~et~~~~l---g~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~--------- 468 (480) T protein:vir:78 412 KEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKT-----------ETQ--------- 468 (480) T ss_pred HHHHHhcC---CCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCC-----------ccC--------- Confidence 99987765 555433211010000000000 00 0000000 00000000 000 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_019408. 536 ERSVAVQEGHAEVA 549 (612) Q Consensus 536 ~r~~~~~~~r~~~e 549 (612) .+--...|.... T Consensus 469 --~~~~~~~~~~~~ 480 (480) T protein:vir:78 469 --TSPSGFNRTKTR 480 (480) T ss_pred --CCcccCCCcCCC Confidence 000000000000 No 53 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.88 E-value=4.8e-22 Score=137.45 Aligned_cols=428 Identities=11% Similarity=0.026 Sum_probs=234.1 Q ss_pred CCCcHHH-HHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEY-QYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y-~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) ||.| + ....++++.+.+.|.|.. ...+.+....+. .+..+-+-.|+.+.+++.++|++|.+||++.. T Consensus 48 ~i~~--h~~~~~~rl~~l~~yY~g~~------~~i~~~~~~~~~----~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~ 115 (502) T protein:vir:48 48 FINH--HKLRQAPRIQELLDYARGEN------HDVLKSGRRKDN----EMADKRAVHNYGRMISKFKTGYLAGNPIRVEY 115 (502) T ss_pred HHHH--HHHHHHHHHHHHHHHhcCCC------cccccccccccc----ccccceeecchHHHHHHHHhhhhcccCeeEec Confidence 3332 1 233567777777787742 111222111111 11112244799999999999999999999842 Q ss_pred ----CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCC Q lcl|NC_019408. 80 ----LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGG 154 (612) Q Consensus 80 ----~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g 154 (612) ....+..++.++-.. ++++.++..+.+.++.+|+++++|-... ...|-+..++|.+++- |+ ..+.+ T Consensus 116 ~d~~~~~~~~~~l~~~~~~-N~~~~~~~~~~~~~~~~G~a~~~v~~de------dg~~~i~~~~p~~~~~vyd--d~~~~ 186 (502) T protein:vir:48 116 DDNEDNSQNDDAIKRIGRI-NDIDTHNRNLIRDLSQTGRAYEVIYRSE------YDETRIKRLSPLETFVIYD--NSLED 186 (502) T ss_pred CCccchhHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcCeEEEEEEeCC------CCceEEEEEcccceEEEEc--CCCCC Confidence 123466667777655 6999999999999999999999996533 2467788888888653 32 11111 Q ss_pred ccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccc Q lcl|NC_019408. 155 FYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVK 234 (612) Q Consensus 155 ~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~ 234 (612) . .+.-|++... ... ..+...+++|- .+ T Consensus 187 ~-~~~~ir~~~~--------------------------~~~-------------~~~~~~~~iyt----------~~--- 213 (502) T protein:vir:48 187 N-SIAAVRYYNR--------------------------GTL-------------QNAKDVVEIYT----------NQ--- 213 (502) T ss_pred c-eEEEEEEEEE--------------------------eec-------------CCcEEEEEEEe----------CC--- Confidence 1 1222222111 000 00001111110 00 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhcccee Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVY 314 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l 314 (612) .++ . +..+++.+.+ ...-++++.||||.+.. +.+ +.+-|.++..|-=..-+..|++.+.+...+.|++ T Consensus 214 ~i~--~-~~~~~~~~~~-----~~~~~~~g~vPvv~~~n--n~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 281 (502) T protein:vir:48 214 HIY--T-LDASDSFNEI-----SVTPHAFGTVPITEFLN--NAD--GIGDYETELYLIDLYDSAESDTANHMSDMADAIL 281 (502) T ss_pred eEE--E-EEeCCceeec-----cceecCCCccceEEecC--CCC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 000 0 1111111111 11236799999998753 333 2233444444433444667888999999999999 Q ss_pred eeecCCCCCCce--EEEeccccc---------cCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhh-hccccch Q lcl|NC_019408. 315 YAPGTDSEGTGE--YHIGPNMVW---------EVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMM-PGASKSV 382 (612) Q Consensus 315 ~i~G~~~~~~~~--l~iG~~~~~---------~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll-~~~~~~~ 382 (612) +++|......+. ..+.....+ ..+.+++++|+..+.. .+.....++.+.+.|..++.-.- ......+ T Consensus 282 v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~-~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~ 360 (502) T protein:vir:48 282 AIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYD-VSGAEAYKTRLNKDIHVFTNTPDMSDNHFSG 360 (502) T ss_pred eeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCC-HHHHHHHHHHHHHHHHHHhCCCCcCcccccc Confidence 999964332221 111122222 2345678999998764 35677888999999887753221 1111124 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC----CCcceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 383 SESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA----DTENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 383 ~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~----~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) +-||++..................+..++.+.+++++.+++.... +..++.|.+++ ..+.+ ..+.++++.++ T Consensus 361 n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~-~~p~d-~~e~a~~~~kl-- 436 (502) T protein:vir:48 361 NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTP-NLPKS-LYEQVSILNDL-- 436 (502) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCC-CCCcC-HHHHHHHHHHH-- Confidence 568888888777777777888899999999999999999875322 12235555533 33333 24556777666 Q ss_pred cCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccc-hhHHhh----hhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 DGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNP-DAQARQ----RGYTNRGQELEQSRMAREADFTQQKID 533 (612) Q Consensus 459 ~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~-~~~~~~----~~e~~r~~~~e~~r~~~e~e~~~q~~e 533 (612) +|.||++|++..+ +.+. ++++|.++|.+|....... ...... .+..+..+ .+.+ T Consensus 437 ~g~iS~et~l~~l---~~v~---D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e--------~~~~------- 495 (502) T protein:vir:48 437 GGQVSQETALSLS---GLVE---NPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKE--------THTD------- 495 (502) T ss_pred hccCcHHHHHHhC---CCCC---CHHHHHHHHHHHHHhhhhhcccccccccccccCCCccC--------CCCc------- Confidence 6899999998876 3332 3456677776553211000 000000 00000000 0000 Q ss_pred HHHHHHH Q lcl|NC_019408. 534 IQERSVA 540 (612) Q Consensus 534 ~~~r~~~ 540 (612) +.++--+ T Consensus 496 ~~~~~~~ 502 (502) T protein:vir:48 496 DFERVYE 502 (502) T ss_pred CcCCCCC Confidence 0000000 No 54 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.87 E-value=8e-22 Score=136.24 Aligned_cols=439 Identities=10% Similarity=-0.034 Sum_probs=231.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) |+ ..+....+++.++.+-|.|...++ |||+ .-...|+.. ++ ..|+.+-+|+.+++++|-...++.+ T Consensus 21 l~--~~~~~~~~r~~~~~~Yy~G~~~i~-----~~~~---~~~~~~~~~--~~-~~n~~~~ivd~~~~~l~~~g~~~~~~ 87 (485) T protein:vir:10 21 MV--SAFEDSTQNLKTNTSYYEAERRPE-----AIGV---TVPIQMQSL--LA-HVGYPRLYVDSIAERQAVEGFRFGDA 87 (485) T ss_pred HH--HHHHHHHHHHHHHHHHHhcCCcch-----hcCC---CCChhhhhh--hh-hcCcHHHHHHHHHhhhcccceecCCC Confidence 43 556677888999999999966443 3433 333444432 22 3599999999999999766555422 Q ss_pred --CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh--hhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 --LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR--KGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 --~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~--~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ....+..++.+ |+++.+...+.+.++.+|+|+++|-...... .....+|.|..++|++++-. ++.. .++ T Consensus 88 ~~~~~~~~~i~~~-----N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~-~D~~-~~~ 160 (485) T protein:vir:10 88 DEADEELWQWWQA-----NNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAE-IDPR-IGR 160 (485) T ss_pred chhHHHHHHHHHh-----cCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEE-EcCC-CCc Confidence 23446666643 8899999999999999999999987643211 12235788999999987642 1111 121 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) .+..+++.. .. .+ .... +..+... + .+| T Consensus 161 -~~~~~~~~~--~~------~~----~~~~-~~~~y~~-----------------~----~~~----------------- 188 (485) T protein:vir:10 161 -VSKAIRVAY--DA------EG----NEIQ-AATLYTP-----------------N----DIF----------------- 188 (485) T ss_pred -eeEEEEEEE--ee------CC----CeEE-EEEEEeC-----------------C----eEE----------------- Confidence 222222211 00 00 0000 0000000 0 011 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecC-CCCCCcCcC----chHHHHHHHHHHHhhhHHHHHHHHHhc Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS-GNTADVEKP----PLLDICDLNLSHYRTYAELEYGRLFTA 310 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~-~~~~~~~~p----PLldLA~lnl~HY~~~sD~~~~l~~~~ 310 (612) .+. ..++.|...... -++++.||+|.|... ..+...+.+ ++.+|.+ +.=...|+...++++.+ T Consensus 189 ----~~~-~~~~~~~~~~~~----~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liD---a~~~~~s~~~~~~~~~a 256 (485) T protein:vir:10 189 ----GWY-RVENEWQEWFNN----PHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTD---AAARILMLMQATAELMG 256 (485) T ss_pred ----EEE-EcCCceEEeccc----cCCCCcccEEEeccccccCCCCCccchhHHHHHHHH---HHHHHHHHHHHHHHhhc Confidence 111 112223222222 267999999976432 222112232 3444432 22245678888999999 Q ss_pred cceeeeecCCCCCC-----c---eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh--cccc Q lcl|NC_019408. 311 LPVYYAPGTDSEGT-----G---EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMP--GASK 380 (612) Q Consensus 311 ~P~l~i~G~~~~~~-----~---~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~--~~~~ 380 (612) +|++++.|.+.... . .+..+.+..|.+| |++++|.|++..+++.+.+.|+.+..++....--... .... T Consensus 257 ~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~ 335 (485) T protein:vir:10 257 VPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFE-DAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA 335 (485) T ss_pred chHHHHhcCCcccccccccccchhhhhcccceeccC-CCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhcccc Confidence 99999999754321 0 1334455556554 5678899999999888888887777777543210000 1111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC--cceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT--ENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~--~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) ..+-||.+...............-..+..++.++++++..+.+...... ..+.|++.. -.+.+ .++.++++.++++ T Consensus 336 ~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~-~~~~~-~~~~ada~~kl~~ 413 (485) T protein:vir:10 336 DNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRD-PSTPT-YAAKADAASKLYN 413 (485) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecC-CCCCC-HHHHHHHHHHHHh Confidence 1124666666666666655666667777889999999888876422111 234444432 22333 3567788999999 Q ss_pred cC--CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccc--cchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 DG--LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFIN--NPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDI 534 (612) Q Consensus 459 ~G--~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~--~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~ 534 (612) +| .+|++|+++.| |+.++++. +.+++.++..... ..+..........-+.+.++ ..+.. .+. T Consensus 414 ag~~~~s~et~~~~l---g~~~~~~~---~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-------~~~ 479 (485) T protein:vir:10 414 GGTGVIPRERARKDM---GYSIAERE---EMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAP-APKPA-------ALE 479 (485) T ss_pred ccccCCCHHHHHHhC---CCCHhHHH---HHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccc-cccCc-------CCC Confidence 77 89999998765 66554322 2222222110000 00000000000000000000 00000 000 Q ss_pred HHHHHH Q lcl|NC_019408. 535 QERSVA 540 (612) Q Consensus 535 ~~r~~~ 540 (612) --.-++ T Consensus 480 ~~~~~~ 485 (485) T protein:vir:10 480 SGGDAA 485 (485) T ss_pred CCCCCC Confidence 000000 No 55 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.87 E-value=8.8e-22 Score=136.01 Aligned_cols=436 Identities=11% Similarity=0.047 Sum_probs=231.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) ||.| ......++|+.+.+.|.|....--....+++. ..... . | +-.|+.+.+++..+|++|.+||++... T Consensus 30 li~~-~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~---~~~~~--~---k-i~~n~~~~Iv~~~~~~l~G~p~~~~~~ 99 (506) T protein:vir:94 30 FITH-HFNYQRPRLEMLDDYYQGYNLKILDKQSRRHE---DGKAD--H---R-ATHSFAKYIADFQTSYSVGNPINVKLP 99 (506) T ss_pred HHHH-HHHHHHHHHHHHHHHhcCCCcccccccccccc---ccCCc--c---e-eecchHHHHHHHhhhhhcccCceeecC Confidence 5544 23456788999999999975321110111111 11111 1 1 246999999999999999999998422 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcccee Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVPS 159 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~Lt 159 (612) .+.....+.++-.. ++++.....+.+.++.+|+++++|.... ..+|-+..++|.+++- |+ ..+.+ ..+. T Consensus 100 d~~~~~~l~~~~~~-N~~~~~~~~~~~~~~~~G~a~~~v~~de------d~~~~i~~~~p~~~~~v~d--d~~~~-~~~~ 169 (506) T protein:vir:94 100 DDGSNSGFDTFNKA-NDVDAENYDLFLDMSRYGRAYEYVYRGE------DNEEHLAKLDPLDTFVIYS--TDVDP-KPIM 169 (506) T ss_pred cchHHHHHHHHHhc-cCHhHHHHHHHHHHHhcCeEEEEEEecC------CCeeEEEEEcccceEEEec--CCCCC-ceEE Confidence 33444444444333 6899999999999999999999997643 2478888899998865 31 11121 1222 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEE Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQ 239 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~ 239 (612) -|+.... .. .++-.......+. .+|- .... T Consensus 170 ~v~~~~~-~~------~~~~~~~~~~~~~---------------------------~~yt----------------~~~~ 199 (506) T protein:vir:94 170 AVRYHQI-EL------VDDNQVSTINYVP---------------------------ETWT----------------ADTY 199 (506) T ss_pred EEEEEee-ee------ccCCceeEEEEEE---------------------------EEEe----------------CceE Confidence 2222111 10 0000000000000 0000 0001 Q ss_pred EEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 240 YLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 240 ~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+..++.++.. .....++|+.||||.+.... . +.+-|.++-.|-=+.-...|+.-+.+.+.+.|+++++|. T Consensus 200 ~~~~~~~~~~~~----~~~~~~~~g~vPvv~~~n~~--~--~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~ 271 (506) T protein:vir:94 200 TLYNPTPIMGKM----QVDTTKPITTFPVVEFKNSN--F--RLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGD 271 (506) T ss_pred EEeccccCccce----eccccccCCccceEEecCCC--C--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcC Confidence 111111112211 11223679999999874322 1 223344433332233455678888888899999999886 Q ss_pred CCCCCce----------------------------------EEEeccccc-cCCCCCceeEEecCchhHHHHHHHHHHHH Q lcl|NC_019408. 320 DSEGTGE----------------------------------YHIGPNMVW-EVPQGSEPGILEYTGQGLKALETALNDKE 364 (612) Q Consensus 320 ~~~~~~~----------------------------------l~iG~~~~~-~lp~~~~~~~lE~~g~~l~~~~~~l~~~e 364 (612) ...+... +.+.++... ..+.+++++||..+... +.....++.+. T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~-~~~~~~~~~l~ 350 (506) T protein:vir:94 272 IDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDV-VGSEAYKKRVA 350 (506) T ss_pred ccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCH-HHHHHHHHHHH Confidence 4322111 112111111 12345689999987754 56788899999 Q ss_pred HHHHHHHHHh-hhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCC----CcceEEEeecc Q lcl|NC_019408. 365 RQIAAIGGRM-MPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLAD----TENLRYEVNTD 439 (612) Q Consensus 365 ~qm~~lGa~l-l~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~----~~~~~v~ln~d 439 (612) +.|...+.-. +...+..++-||++...............-..+..++.+++++++.+++...+. ..++.|.+++. T Consensus 351 ~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~ 430 (506) T protein:vir:94 351 GDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDN 430 (506) T ss_pred HHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCC Confidence 9988765322 111122346688877777777777777778888999999999999987643221 12344544332 Q ss_pred ccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHH Q lcl|NC_019408. 440 FLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQS 519 (612) Q Consensus 440 F~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~ 519 (612) .+.+ ..+.++++.++ +|.||++|++..+ +.++ ++.+|.++|.+|.......-........ +++.+. T Consensus 431 -~p~d-~~e~a~~~~kl--~g~iS~et~~~~l---p~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~---~~~~~~- 496 (506) T protein:vir:94 431 -LPAD-NISQIKALVQA--GATLPQKYLYQQL---PGVT---NPQDIVDMMKEQSANGDYSFDQNGVISN---DGQTNT- 496 (506) T ss_pred -CCcC-HHHHHHHHHHH--hccCChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHhhcchhhcCCCc---ccCccc- Confidence 2222 24456666666 6999999998865 2222 3456666666553210000000000000 000000 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_019408. 520 RMAREADFTQQKIDIQE 536 (612) Q Consensus 520 r~~~e~e~~~q~~e~~~ 536 (612) . +.+..++-+ T Consensus 497 -~------~~~~~~e~~ 506 (506) T protein:vir:94 497 -T------ATQTDEEVR 506 (506) T ss_pred -c------ccccccCCC Confidence 0 000000000 No 56 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.87 E-value=1e-21 Score=135.64 Aligned_cols=438 Identities=10% Similarity=0.001 Sum_probs=232.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC------C Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR------D 74 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k------~ 74 (612) |+ ..+....++++++.+-|.|...++. +| ..-...++.++ ...|+.+-+|+.++.+++-. + T Consensus 16 L~--~~~~~~~~r~~~~~~Yy~g~~~i~~-----~~---~~~~~~~~~~~---~~~n~~~~ivd~~a~~l~~~Gf~~~~~ 82 (488) T protein:vir:23 16 LL--DAFENKQNELKSSKAYYDAERRPDA-----IG---LAVPLDMRKYL---AHVGYPRTYVDAIAERQELEGFRIPSA 82 (488) T ss_pred HH--HHHHHHHHHHHHHHHHHhcccchhh-----cC---cccchhhhhhh---hhcchHHHHHHHHHHhhhccceeccCC Confidence 43 5667778899999999998654442 32 22233444432 23699999999988666322 2 Q ss_pred ceee---cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh--hhhccCceEEEechhhhhcchhh Q lcl|NC_019408. 75 PIVK---NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR--KGAVATSFAVGYSAENILDWDEV 149 (612) Q Consensus 75 p~~~---~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~--~~~~~rPy~~~~~ae~IinW~~~ 149 (612) .... .-.......+.++- +-++++...+.+.+.++.+|+++++|....... .-....|.|..++|.+++-+ ++ T Consensus 83 ~~~~~~~~~d~~~~~~l~~i~-~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~-~d 160 (488) T protein:vir:23 83 NGEEPESGGENDPASELWDWW-QANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAE-VD 160 (488) T ss_pred cccccccccchhHHHHHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEE-Ee Confidence 2110 00111222222332 236899999999999999999999997643211 11234578889999998775 22 Q ss_pred hccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccc Q lcl|NC_019408. 150 VDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWP 229 (612) Q Consensus 150 ~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~ 229 (612) . ..+. .+.-|++. ... + ++ ..+...+| .+ T Consensus 161 ~-~~~~-~~~~~~~~--~~~-------~----------------~~---------------~~~~~~~y---------~~ 189 (488) T protein:vir:23 161 P-RTRK-VLYAIRAI--YGA-------D----------------GN---------------EIVSATLY---------LP 189 (488) T ss_pred c-CCCc-eEEEEEEE--Eec-------C----------------CC---------------cEEEEEEE---------ec Confidence 1 2222 22222221 000 0 00 00000111 00 Q ss_pred cccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecC-CCCCCcCcC----chHHHHHHHHHHHhhhHHHHH Q lcl|NC_019408. 230 SGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS-GNTADVEKP----PLLDICDLNLSHYRTYAELEY 304 (612) Q Consensus 230 ~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~-~~~~~~~~p----PLldLA~lnl~HY~~~sD~~~ 304 (612) + .++ .+..+.+.|.... ...++++.||||+|-.. ..+..-+.+ ++.+|.+ ++=+..|+... T Consensus 190 ~----~~~---~~~~~~~~~~~~~----~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~D---a~~~~~s~~~~ 255 (488) T protein:vir:23 190 D----TTM---TWLRAEGEWEAPT----STPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTD---AAAQILMNMQG 255 (488) T ss_pred C----cEE---EEEecCCceEecc----ccccCCCCcceEEeccccccCCcCCccchhhhHHHHHH---HHHHHHHHHHH Confidence 0 001 1112222332222 22367999999976422 222112222 2334332 23356678888 Q ss_pred HHHHhccceeeeecCCCCCC--------ceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019408. 305 GRLFTALPVYYAPGTDSEGT--------GEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMP 376 (612) Q Consensus 305 ~l~~~~~P~l~i~G~~~~~~--------~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~ 376 (612) ++.+.++|+++|.|.+.... ..+..+.+.+|.++.|.+++|.++++.+++...+.|+.+..++....--... T Consensus 256 ~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~ 335 (488) T protein:vir:23 256 TANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQ 335 (488) T ss_pred HHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHH Confidence 99999999999999754321 1245677788899999999999999999988888888888887643211111 Q ss_pred c--cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC--cceEEEeeccccccCCCHHHHHH Q lcl|NC_019408. 377 G--ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT--ENLRYEVNTDFLSTPIGAREMRA 452 (612) Q Consensus 377 ~--~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~--~~~~v~ln~dF~~~~~d~~~~~a 452 (612) . .....+.||.+...............-..+..++.+.+++++.+.|...... .++.+++. +-.+.+ .++.+++ T Consensus 336 ~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f~-~~~~~s-~~~~ada 413 (488) T protein:vir:23 336 YLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRMETVWR-DPSTPT-YAAKADA 413 (488) T ss_pred HhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhccceEEec-CCCCCC-HHHHHHH Confidence 1 0111124676766666666666666777778899999999999987432111 23444443 222222 3567788 Q ss_pred HHHHHHcC--CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccc--cccccchhH---HhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_019408. 453 IQLMANDG--LLPDPVFYEYMRKAEVISSDMTFEEFQALRADEN--SFINNPDAQ---ARQRGYTNRGQELEQSRMAREA 525 (612) Q Consensus 453 l~~~~~~G--~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~--~~~~~~~~~---~~~~~e~~r~~~~e~~r~~~e~ 525 (612) +.+++++| .+|++|+++.| ++.++... +.+++.++. +....-+.. ....+...-.. . T Consensus 414 ~~kl~~~g~~~~s~et~~~~l---~~~~d~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-------- 477 (488) T protein:vir:23 414 AAKLFANGAGLIPRERGWVDM---GYTIVERE---QMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAP--V-------- 477 (488) T ss_pred HHHHHhcccccCCHHHHHHhC---CCCchHHH---HHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCC--C-------- Confidence 99999976 79999998887 55443221 111111110 000000000 00000000000 0 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_019408. 526 DFTQQKIDIQERSV 539 (612) Q Consensus 526 e~~~q~~e~~~r~~ 539 (612) .....++. ..| T Consensus 478 --~~~~~~e~-~~a 488 (488) T protein:vir:23 478 --GEPPAPEP-DAA 488 (488) T ss_pred --CCCCCCCC-CCC Confidence 00000000 000 No 57 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.86 E-value=3.6e-21 Score=132.65 Aligned_cols=391 Identities=9% Similarity=-0.009 Sum_probs=228.6 Q ss_pred HHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecCCHHHHH Q lcl|NC_019408. 7 YQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNLPPKFKD 86 (612) Q Consensus 7 y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~p~~l~~ 86 (612) .....++..++.+-|.|...++ ||+ .+-.+.|+..+ ++ ..|+.+-+|++++++++-.--+. ....+.. T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~-----~~~---~~~p~~~~~~~-~~-v~nw~~~~Vds~a~rl~~~Gf~~--~d~~l~~ 68 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEA-----PTG---ITIPAHIRAKY-QA-VLGWAAKGVDSLADRLIFRAFAN--DDFNVTE 68 (410) T ss_pred CCcchhhHHHHHHHhcCCCCcc-----ccc---hhccHHHHhHH-Hh-hcchhHHHHHHhHhhhccccccC--CCchHHH Confidence 3344677788888898876543 332 33344566554 34 46999999999999887665443 2356777 Q ss_pred HHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEE Q lcl|NC_019408. 87 AVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREF 166 (612) Q Consensus 87 ~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~ 166 (612) ++.. |+++..+..+...+|.||+|+++| +|.. ..+|-|..++|.+++-. ++ .+.++ ...-+++... T Consensus 69 i~~~-----N~ld~~~~~~~~~al~~G~sf~~v-~~~~-----d~~~~i~~~sP~~~~~i-~D-p~~~~-~~~al~~~~~ 134 (410) T protein:vir:95 69 IFDR-----NNPDIFFDSAILSALIGSCSFVYI-SKGE-----DDEVRLQVIESSNATGV-ID-PITGL-LVEGYAVLAR 134 (410) T ss_pred HHhh-----cChHHHHHHHHHHHHHhCceeEEE-ecCC-----CCceEEEEEcccceEEE-Ee-CCCCc-eEEEEEEEEe Confidence 7643 999999999999999999999999 4432 23689999999998764 22 12221 1222222110 Q ss_pred eeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEEEEeeCC Q lcl|NC_019408. 167 VRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDP 246 (612) Q Consensus 167 v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~ 246 (612) . .+ |. ....++|- .+ .++.+. .++ T Consensus 135 ~--------~~-----------------~~---------------~~~~~~~~----------~~-----~~~~~~-~~~ 158 (410) T protein:vir:95 135 D--------DY-----------------NR---------------PTLEAYFE----------PN-----ATHFIP-KDG 158 (410) T ss_pred c--------CC-----------------Ce---------------EEEEEEEe----------CC-----cEEEEe-eCC Confidence 0 00 00 00001110 00 000111 122 Q ss_pred CceecceeeeccCCccccceeEEEee-cCCCCCCcCc----CchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCC Q lcl|NC_019408. 247 ESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEK----PPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDS 321 (612) Q Consensus 247 ~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~----pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~ 321 (612) +.|. + .+++++||+|.|- ..+.+...+. .|+.+|.+ ..-+..++...+.++.++|..++.|++. T Consensus 159 ~~~~----~----~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~d---a~~r~~~~~~~~~e~~a~pqr~i~G~d~ 227 (410) T protein:vir:95 159 EPYS----V----TNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQK---YAKRTLERADITAEFYSWPQKYILGLDP 227 (410) T ss_pred cccc----c----cCCCCCcceEEecccccCCccCCccccchhHHHHHH---HHHHHHHHHHHHHHHhcchhheeeccCC Confidence 2221 1 2579999999764 2222211222 35666643 2336667778899999999999999976 Q ss_pred CCC--ceEEEeccccccCCCCC---ceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhcccc-ch-hHHHHHHHHHHH Q lcl|NC_019408. 322 EGT--GEYHIGPNMVWEVPQGS---EPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASK-SV-SESNNQTVLREA 394 (612) Q Consensus 322 ~~~--~~l~iG~~~~~~lp~~~---~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~-~~-~esa~~~~~~~~ 394 (612) ... +.+.+..+..|.+|++. .+++-|+++..+.-+.+.|+.+..++.+...-....-+. .. +.||.+...... T Consensus 228 d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~ 307 (410) T protein:vir:95 228 DAEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHE 307 (410) T ss_pred CCCcCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHH Confidence 432 23455666789998654 488999999999988888888888887653222222111 11 245555554444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC---CCcceEEEeec--cccccCCCHHHHHHHHHHHHc--CCCCHHHH Q lcl|NC_019408. 395 NEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA---DTENLRYEVNT--DFLSTPIGAREMRAIQLMAND--GLLPDPVF 467 (612) Q Consensus 395 ~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~---~~~~~~v~ln~--dF~~~~~d~~~~~al~~~~~~--G~is~et~ 467 (612) .........-..+..++.++++++....+.... ....+.|..-+ |-....+ ++..+++.+++++ |.++++++ T Consensus 308 ~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~-a~~aDa~~Kl~~a~~g~~~~~~~ 386 (410) T protein:vir:95 308 NLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTM-TMIGDGVVKLNQALPGYINAETI 386 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhH-HHHHHHHHHHHHhccCCccHHHH Confidence 444444445555677788888887776553211 11233443422 2222333 6788999999998 78999998 Q ss_pred HHHHHhcCccchhhhhHHHHHHhhcccccccc Q lcl|NC_019408. 468 YEYMRKAEVISSDMTFEEFQALRADENSFINN 499 (612) Q Consensus 468 ~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~ 499 (612) ++.| |+-+ +++..+..++....++ T Consensus 387 ~~~l---g~~~-----~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 387 RDLT---GIAG-----DMSAKPVVSEGGSNGE 410 (410) T ss_pred HHhc---CCCh-----HHHHHHHHHHHHhCCC Confidence 8877 5532 2333333322211111 No 58 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.86 E-value=1.3e-21 Score=135.13 Aligned_cols=444 Identities=10% Similarity=-0.006 Sum_probs=234.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhccc-ccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAE-AYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~-~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) ++.| +....+++..+.+.|.|...+..+-. .|-...++...... +...=.-.|+.+.+++..+|++|.+||++.. T Consensus 13 ~~~~--~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~--~~~~ki~~n~~k~Iv~~~~~yl~G~p~~~~~ 88 (470) T protein:vir:10 13 TSTS--RNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLR--SADNRIPSNFYQLLVDQEAGYVASVFPDIDV 88 (470) T ss_pred HHHH--HHHHHHHHHHHHHHhccccchhccccchhcccccccccccc--cCCcccccchHHHHHHhhhhheeccceeeec Confidence 3433 34567888999999999876654321 11111111111111 1111234899999999999999999999842 Q ss_pred CCH----HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 LPP----KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 ~p~----~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) -.+ .|..++. .+.......+...++.+|+++++|=+.. .+++-+..++|.+++=. +...+.+. T Consensus 89 ~d~~~~~~l~~~~~------~~~~~~~~~l~~~~~~~G~a~~~~y~d~------~~~~~~~~~~p~~~~~v-~d~~~~~~ 155 (470) T protein:vir:10 89 GKDADNKKIIDVLG------DDRALTLNGLLVDSSNAGRAWLHYWIDE------DGNFRYGIIQPDQITPI-YATTLDNK 155 (470) T ss_pred CchHHHHHHHHHHh------hhHHHHHHHHHHHHhhcCeeEEEEEecC------CCceEEEEEcccceEEE-EcCCCCCc Confidence 122 2333332 3456666788899999999999985432 23677888888887753 11122222 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) .+..|++-..... .+ ...+..+ .....++ +..|+........... T Consensus 156 -~~a~ir~y~~~~~------~~---~~~~~~~-e~yt~~~-------------------~~~~~~~~~~~~~~~~----- 200 (470) T protein:vir:10 156 -LLGILRSYKQLDP------DS---GKYFTVH-EYWTDKE-------------------AQFFRTNATDSTVIEP----- 200 (470) T ss_pred -eEEEEEEEEeeec------CC---ceEEEEE-EEEcCCc-------------------EEEEEeecCcceeccc----- Confidence 2222322221111 00 0001000 0111100 0111100000000000 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) ......-...............++|+.||||.+... .. +.+-|.++-.|-=+.=...|++.+.+.+.+.|+++ T Consensus 201 ---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn--~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 273 (470) T protein:vir:10 201 ---YNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKN--KY--RLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (470) T ss_pred ---cccccccccccccccccccccccCCCeeeEEEeecC--CC--CCCchhHHHHHHHHHHHHHHHHHHHHHHhcCccee Confidence 000000000000001111223467999999988643 22 23334444333333334567888889999999999 Q ss_pred eecCCCCCCceE--EEeccccccCC-----CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHH Q lcl|NC_019408. 316 APGTDSEGTGEY--HIGPNMVWEVP-----QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQ 388 (612) Q Consensus 316 i~G~~~~~~~~l--~iG~~~~~~lp-----~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~ 388 (612) ++|...+..... .+....++.++ .+++++|+..+.+. +..+..|+.+++.|...+.-.-....+.++-||++ T Consensus 274 l~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~-~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~A 352 (470) T protein:vir:10 274 LTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPV-EARDDALKITRKNIFLFGQGIDPANFESSNASGVA 352 (470) T ss_pred eecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCCh-HHHHHHHHHHHHHHHHHhCCCCCCccccccchHHH Confidence 999754432221 22333455554 35679999988764 67889999999999887643311222335677777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHH Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFY 468 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~ 468 (612) ...........-...-..+..++.+.+++++.++|....+..++.|.+++..... ..+.++.+..+ +|.||.+|++ T Consensus 353 lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d--~~e~~~~~~~~--~g~iS~et~l 428 (470) T protein:vir:10 353 IKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVED--SLTKAQIVSTV--ANYSSKEAVA 428 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCC--HHHHHHHHHHH--hccCcHHHHH Confidence 7777776666667777777889999999999999875444556666665443222 13344444443 7999999998 Q ss_pred HHHHhcCccchhhhhHHHHHHhhcccccccc--chhHHhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_019408. 469 EYMRKAEVISSDMTFEEFQALRADENSFINN--PDAQARQRGYTNRGQELEQSRMAREADFTQ 529 (612) Q Consensus 469 ~~lqr~~vl~~~~~~eee~~ria~e~~~~~~--~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~ 529 (612) ..+ +.++ ++.+|.+++.+|...... ............ . ++ T Consensus 429 ~~~---p~v~---D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~d----d-----------e~ 470 (470) T protein:vir:10 429 KAN---PIVD---DWQQELKDLAKDKEENDPYSNQADELNGKGVN----D-----------EQ 470 (470) T ss_pred HhC---CCCC---CHHHHHHHHHHHHHHHHHhhccccccCCCCCC----C-----------CC Confidence 765 3332 355666776654211000 000000000000 0 00 No 59 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.86 E-value=1.2e-21 Score=135.33 Aligned_cols=458 Identities=13% Similarity=0.047 Sum_probs=234.6 Q ss_pred CCC--cHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee Q lcl|NC_019408. 1 MVT--HPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK 78 (612) Q Consensus 1 ~~~--hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~ 78 (612) +|. .+.|....++++++.+-|.|...+ .|||+. -...|+....++ -.|+.+-+|+.+++++|-...++. T Consensus 31 l~~~l~~~~~~~~~rl~~l~~YY~G~~~~-----~~~~~~---~~~~~~~~~~~~-v~n~~~~ivd~~a~~l~~~gf~~~ 101 (501) T protein:vir:25 31 LVADMWRLHISERQWLDRIYEYTKGLRGR-----PEVPEG---ASDEVKELAKLS-VKNVLSLVRDSFAQNLSVVGYRNA 101 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhcccc---CChhhhhhHhhh-hcChHHHHHHHHHhhhcccceecC Confidence 121 244666778888888888775433 234432 334455543333 348999999999999886655542 Q ss_pred --cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhh-cchhhhccCCc Q lcl|NC_019408. 79 --NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENIL-DWDEVVDMGGF 155 (612) Q Consensus 79 --~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Ii-nW~~~~~v~g~ 155 (612) +..+.+..+++ -|+++.....+...++.||+++++| |+..+ -|.+..++|.+++ =|+ +. +..+ T Consensus 102 d~~~~~~l~~i~~-----~N~~d~~~~~~~~~a~i~G~ay~~v-~~de~------~~~i~~~sp~~~~~iy~-D~-~~~~ 167 (501) T protein:vir:25 102 LAKENDPAWEMWQ-----RNRMDARQAEVHRPALTYGASYVTV-TPTDE------GPVFRTRSPRQILAVYA-DP-SVDA 167 (501) T ss_pred CccchHHHHHHHH-----hcChhHHHHHHHHHHhhcCceEEEE-ecCCC------CCeEEEeccccEEEEEe-cC-CCCc Confidence 23344555543 3678999999999999999999998 44322 2678899999986 342 11 1111 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccc-cccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEW-PSGEVK 234 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~-~~g~~~ 234 (612) . ....++-.... .+.+ .... -.+...+ .+|.......... ..+..+ T Consensus 168 ~--~~~ai~~~~~~----~~~~-----~~~~-~~~y~~~---------------------~~~~~~~~~~~~~~~~~~~~ 214 (501) T protein:vir:25 168 W--PQYALETWVAQ----KDAK-----PHRR-GVLYDDT---------------------YMYELDLGEVVLGDAGGGQA 214 (501) T ss_pred c--eeEEEEEEeec----cccC-----ccee-EEEecCe---------------------eEEEEecCceeeeecccccc Confidence 1 12222211111 0000 0000 0011000 0111100000000 000000 Q ss_pred ceeEEEEEeeCCCceec-ceeeeccCCccccceeEEEeecC-C-CCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhcc Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPI-ARIVPTVRGEPLDFIPFKFFGAS-G-NTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTAL 311 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~-~~~~p~~~g~~l~~IP~v~~~~~-~-~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~ 311 (612) . ......+.. ........-++++.||||.|-.. . +++.. +-+.++-.|.=+.=+..|+...+.++.++ T Consensus 215 ~-------~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~--sdie~v~~l~Da~~~~~s~~~~~~e~~a~ 285 (501) T protein:vir:25 215 T-------QQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIV--GEVAPLILLQQAINSVNFDRLIVSRFGAN 285 (501) T ss_pred c-------cccccccccccccccccccCCccceeeEeccCccccCcccc--chhhhhHHHHHHHHHHHHHHHHHHHhhcc Confidence 0 000001111 11111223467999999976432 2 23333 32333333322333567788899999999 Q ss_pred ceeeeecCCCCCCceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh-ccccchhHHHHHHH Q lcl|NC_019408. 312 PVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMP-GASKSVSESNNQTV 390 (612) Q Consensus 312 P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~-~~~~~~~esa~~~~ 390 (612) |++|+.|++.+..+.+.+..+..|.+| |++++|.+.+...++.+.+.|+.+..+|....--... -.....+-||.+.. T Consensus 286 p~~~i~G~~~~~~~~~~~~~~~i~~~~-~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~ 364 (501) T protein:vir:25 286 PQRVISGWTGSKAEVLKASALRVWTFE-DPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALA 364 (501) T ss_pred HHHHHhCCCCCccchhhhcccceeccC-CCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHH Confidence 999999997665555666666777765 5678889999988888888899999888664321111 11122345777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC-cceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_019408. 391 LREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT-ENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYE 469 (612) Q Consensus 391 ~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~-~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~ 469 (612) .............-..+..++.+++++++...|...... .++.|.. ++..+..+ ++.++++.++.++| ||.+|++. T Consensus 365 ~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w-~~~~~~s~-~~~ada~~kl~~~g-is~et~~~ 441 (501) T protein:vir:25 365 AAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLW-RDTEARSF-GAVVDGITKLASAG-IPIEHLLS 441 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEe-cCCCCCCH-HHHHHHHHHHHhcC-CCHHHHHH Confidence 777666666677777788899999999998888532211 1233332 34444443 66788899999887 79999877 Q ss_pred HHHhcCccchhhhhHHHHHHhhcccc--ccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 470 YMRKAEVISSDMTFEEFQALRADENS--FINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVA 540 (612) Q Consensus 470 ~lqr~~vl~~~~~~eee~~ria~e~~--~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~ 540 (612) .+ -|+-++++ +...+..+++.. ..................+++.+...+.+ ..-.-.+ T Consensus 442 ~~--~g~~~~~i--e~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~g~ 501 (501) T protein:vir:25 442 MV--PGMTQQTI--QAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGG---------VNGNGGA 501 (501) T ss_pred Hc--CCCCHHHH--HHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCcccccccc---------CCCCCCC Confidence 65 23322221 111111111100 00000000000000000000000000000 0000000 No 60 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.85 E-value=6.4e-21 Score=131.29 Aligned_cols=441 Identities=10% Similarity=-0.006 Sum_probs=232.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) || -.|....+++.++.+-|.|...++ |+++.. +..+++. +. ..|+.+-+|+.++++++-...++.+ T Consensus 21 L~--~~~~~~~~r~~~~~~YY~G~~~i~-----~~~~~~---~~~~~~~--~~-~~n~~~~ivd~~~~~l~~~g~~~~~~ 87 (485) T protein:vir:24 21 MV--SAFEDQNQNLRSNTSYYEAERRPE-----AIGVTV---PVQMQSL--LA-HVGYPRLYVDSIAERQAVEGFRLGDA 87 (485) T ss_pred HH--HHHHHHHHHHHHHHHHHhccCchh-----hcCccc---chhhhhh--hh-ccchHHHHHHHHhhhhccCceecCCC Confidence 43 345666788888888898876554 344322 2223332 22 3599999999999999888777632 Q ss_pred --CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh--hhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 --LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR--KGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 --~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~--~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ....+..++.+ ++++.++..++..++.||++|++|....... .-...+|-+..++|++++-. +...+ + T Consensus 88 ~~~~~~l~~i~~~-----N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i-~D~~~-~- 159 (485) T protein:vir:24 88 DEADEELWQWWQA-----NNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAE-IDPRI-G- 159 (485) T ss_pred chhHHHHHHHHHh-----cChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEE-eeCCc-C- Confidence 22446666643 6899999999999999999999997643211 11235678899999988653 12122 1 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) .++..+.+.. .+ ++ +..+..++|- . + T Consensus 160 -~~~~~~~~~~-~~------------------------~~--------------~~~~~~~~y~---~-------~---- 185 (485) T protein:vir:24 160 -RPAKAIRVAY-DA------------------------EG--------------NEIQAATLYT---P-------N---- 185 (485) T ss_pred -ceeEEEEEEE-ee------------------------cC--------------CeEEEEEEEc---C-------C---- Confidence 2222221110 00 00 0000011110 0 0 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeec-CCCCCCcCcCchH-HHHHHHHHHHhhhHHHHHHHHHhccce Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGA-SGNTADVEKPPLL-DICDLNLSHYRTYAELEYGRLFTALPV 313 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~-~~~~~~~~~pPLl-dLA~lnl~HY~~~sD~~~~l~~~~~P~ 313 (612) .++.+...+ +.|...... -++|+.||||.|-. ...+..-+.+-|. +|..|.=+.=+..|+...++.+.++|+ T Consensus 186 -~~~~~~~~~-~~~~~~~~~----~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~ 259 (485) T protein:vir:24 186 -ETFGWFRAE-GEWVEWFSD----PHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQ 259 (485) T ss_pred -cEEEEEecC-CceEeeccc----ccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 001111111 122211111 26799999997632 2222112333222 222221122355688899999999999 Q ss_pred eeeecCCCCCC--------ceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh--ccccchh Q lcl|NC_019408. 314 YYAPGTDSEGT--------GEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMP--GASKSVS 383 (612) Q Consensus 314 l~i~G~~~~~~--------~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~--~~~~~~~ 383 (612) +++.|.+.... ..+..+.+..|.+| |++++|.+++.++++.+.+.|+.+..++....--... ......+ T Consensus 260 ~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~ 338 (485) T protein:vir:24 260 RLIFGIKPEEIGVDPETGQTLFDAYLARILAFE-DAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNP 338 (485) T ss_pred hhhccCCccccccccccccchhhhcccceeccC-CCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcc Confidence 99999754321 11344556666665 4678889999999888888777777777543211010 1111112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC--CCcceEEEeeccccccCCCHHHHHHHHHHHHcC- Q lcl|NC_019408. 384 ESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA--DTENLRYEVNTDFLSTPIGAREMRAIQLMANDG- 460 (612) Q Consensus 384 esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~--~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G- 460 (612) -||.+..............+-..+..++.+.+++++.+.+.... +..++.|++.... +.+ .++.++++.+++++| T Consensus 339 ~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~-~~s-~~~~ad~~~kl~~~g~ 416 (485) T protein:vir:24 339 ASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPS-TPT-YAAKADAATKLYGNGQ 416 (485) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccceeeEEecCCC-CCC-HHHHHHHHHHHHhccc Confidence 47777777766666666777777888999999999988663211 1224455543222 222 356778888888866 Q ss_pred -CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccc--cchhHHhhhhhhHHH-HhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 461 -LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFIN--NPDAQARQRGYTNRG-QELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 461 -~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~--~~~~~~~~~~e~~r~-~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) .+|++|+++.| ++.++.. ++..++.++..... ..+.........+-+ ...+++ ..+.+.+.- T Consensus 417 ~~~s~et~~~~l---~~~~d~~---~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~--------~~~~~~~~~ 482 (485) T protein:vir:24 417 GVIPRERARKDM---GYSIAER---EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAP--------KPQPAIEGG 482 (485) T ss_pred ccCCHHHHHhhC---CCCHhHH---HHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCC--------CCccCCCCC Confidence 79999997654 5544332 22222222211000 000000000000000 000000 000000000 Q ss_pred HHHH Q lcl|NC_019408. 537 RSVA 540 (612) Q Consensus 537 r~~~ 540 (612) -.+ T Consensus 483 -~~a 485 (485) T protein:vir:24 483 -DSA 485 (485) T ss_pred -CCC Confidence 000 No 61 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.85 E-value=1.9e-21 Score=134.14 Aligned_cols=440 Identities=10% Similarity=-0.027 Sum_probs=232.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) ||. .|....++.+++.+-|.|...++. +|.. -+..+++. + ...|+.+-+|+.++++++-...++.+ T Consensus 21 l~~--~~~~~~~r~~~l~~YY~G~~~i~~-----~~~~---~~~~~~~~--~-~v~n~~~~iVd~~~~~l~~~g~~~~~~ 87 (486) T protein:vir:42 21 MIS--AFEDASKDLASNTSYYDAERRPEA-----IGVT---VPREMQQL--L-AHVGYPRLYVDSVAERQAVEGFRLGDA 87 (486) T ss_pred HHH--HHHHHHHHHHHHHHHhcccCcchh-----cccc---cchhHhhh--h-hccchHHHHHHHHHhhhcccceecCCC Confidence 443 366778889999999999775543 3221 12223322 2 23599999999999998655544422 Q ss_pred --CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh--hhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 --LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR--KGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 --~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~--~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ....+..++.+ |+++.....++..++.+|++|++|....... .....+|-+..++|++++-+ +... .++ T Consensus 88 ~~~~~~~~~i~~~-----N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i-~d~~-~~~ 160 (486) T protein:vir:42 88 DEADEELWQWWQA-----NNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAE-IDPR-INR 160 (486) T ss_pred chhHHHHHHHHHh-----cChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEE-EeCC-CCC Confidence 12335555543 7899999999999999999999997643221 22345788999999998875 2222 222 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) .+.-|++.. .. ++ ....+..+.+. + .+| T Consensus 161 -~~~~~~~~~--~~-------~~----~~~~~~~~y~~-----------------~----~~~----------------- 188 (486) T protein:vir:42 161 -VSKAIRVAY--DK-------EG----NEIQAATLYTP-----------------M----ETI----------------- 188 (486) T ss_pred -eEEEEEEEE--ec-------CC----CeEEEEEEEcC-----------------C----cEE----------------- Confidence 222222211 00 00 00011111111 0 001 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcC----chHHHHHHHHHHHhhhHHHHHHHHHhc Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKP----PLLDICDLNLSHYRTYAELEYGRLFTA 310 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~p----PLldLA~lnl~HY~~~sD~~~~l~~~~ 310 (612) .... .++.|......| ++++.||||.|- ....+...+.+ ++.+|.+ +.=+..|+...+..+.+ T Consensus 189 ----~~~~-~~~~~~~~~~~~----h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liD---a~~~~~s~~~~~~e~~a 256 (486) T protein:vir:42 189 ----GWFR-ADGEWAEWFNVP----HGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTD---AAARILMLMQATAELMG 256 (486) T ss_pred ----EEEe-cCCcEEeeccee----cCCCCceEEEeccccccCCCCCcccchhhHHHHHH---HHHHHHHHHHHHHHhhc Confidence 0111 112222222222 679999999663 22222222222 2333321 22245578888999999 Q ss_pred cceeeeecCCCCCC-----ce---EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh--cccc Q lcl|NC_019408. 311 LPVYYAPGTDSEGT-----GE---YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMP--GASK 380 (612) Q Consensus 311 ~P~l~i~G~~~~~~-----~~---l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~--~~~~ 380 (612) +|++++.|.+.+.. .. +..+.+..|.+| +++++|.++++.+++.+.+.|+.+..++....--... .... T Consensus 257 ~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~ 335 (486) T protein:vir:42 257 VPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFE-DAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA 335 (486) T ss_pred chHHHhhcCCccccccccccccchhhhhhchhcccC-CCCceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhcccc Confidence 99999999754321 11 223444556554 4578899999999988888888777777643211111 1111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC--CCcceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA--DTENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~--~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) ..+-||.+...............-..+..++.+++++++.+.|.... +..++.|++... .+.+ .++.++++.++++ T Consensus 336 ~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~-~~~s-~~~~ad~~~kl~~ 413 (486) T protein:vir:42 336 DNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDP-STPT-YAAKADAATKLYG 413 (486) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCC-CCCC-HHHHHHHHHHHHh Confidence 11246777666666666666677777889999999999998764221 112344444332 2222 3566788888888 Q ss_pred c--CCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccc--hhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 D--GLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNP--DAQARQRGYTNRGQELEQSRMAREADFTQQKIDI 534 (612) Q Consensus 459 ~--G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~--~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~ 534 (612) + |.+|++|+++.+ |+.++. .++.+++.+|....... +.........+.+..... +...+...+ T Consensus 414 ~~~g~~s~et~~~~l---g~~~d~---~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~ 480 (486) T protein:vir:42 414 NGQGVIPRERARIDM---GYSVKE---REEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTA-------PPKPQPAIE 480 (486) T ss_pred cccCCCCHHHHHhcC---CCChhH---HHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCC-------CCCCCcccC Confidence 6 779999998765 655433 23333333221110000 000000000000000000 000000000 Q ss_pred HHHHHH Q lcl|NC_019408. 535 QERSVA 540 (612) Q Consensus 535 ~~r~~~ 540 (612) +..-+. T Consensus 481 ~~~~~~ 486 (486) T protein:vir:42 481 SSGGDA 486 (486) T ss_pred CCCCCC Confidence 000000 No 62 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.85 E-value=4.8e-21 Score=131.98 Aligned_cols=441 Identities=11% Similarity=-0.008 Sum_probs=225.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) |+.+ +....+++.++.+-|.|...++. |+.. -...+++++ .-.|+.+-+|+.+++++|-...++.+. T Consensus 20 l~~~--~~~~~~rl~~l~~Yy~G~~~i~~-----~~~~---~~~~~~~~~---~~~n~~~~ivd~~~~~l~~~g~~~~~~ 86 (484) T protein:vir:77 20 MLNL--FTERTQDLGDNTAYYESERRPDA-----VGVT---VPQQMQKLL---AHVGYPRLYIDAIAARQELEGFRLGGA 86 (484) T ss_pred HHHH--HHHHHHHHHHHHHHHhccccchh-----cccc---cchhHHhhh---hhcCcHHHHHHHHHhhhccCceecCCc Confidence 2221 23446778888888999766543 3322 223333332 345999999999999988666555221 Q ss_pred ---CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh--hhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 81 ---PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR--KGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 81 ---p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~--~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) ...+..++.+ |+++.....++..++.+|++|++|-...... .....+|-|+.++|++++-. +... .+ T Consensus 87 ~~~~~~l~~i~~~-----N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~-~D~~-~~- 158 (484) T protein:vir:77 87 DKADEQLWDWWQA-----NDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQ-IDPR-TR- 158 (484) T ss_pred chhHHHHHHHHHh-----cCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEE-ecCC-CC- Confidence 2335555543 7889999999999999999999996543322 22345688999999998753 1111 11 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) ..+.-|++.. .+ .+ + +....++|- .+ . T Consensus 159 ~~~~a~~~~~--~~------~~-----------------~---------------~~~~~~~y~----------~~---~ 185 (484) T protein:vir:77 159 QVMRAIRAIE--DE------EG-----------------N---------------EVIGATLYL----------PN---N 185 (484) T ss_pred ceEEEEEEEE--ee------cC-----------------C---------------cEEEEEEEe----------cC---e Confidence 1222222211 00 00 0 000011110 00 0 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcC----chHHHHHHHHHHHhhhHHHHHHHHHhc Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKP----PLLDICDLNLSHYRTYAELEYGRLFTA 310 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~p----PLldLA~lnl~HY~~~sD~~~~l~~~~ 310 (612) ++ . +..+.+.|...+..| ++++.||||.|- ....+...+.+ ++.+|.+ ..=...|+...++++.+ T Consensus 186 ~~--~-~~~~~~~~~~~~~~~----~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~D---a~~~~~s~~~~~~~~~a 255 (484) T protein:vir:77 186 TV--I-WNREDGQWVQVANVA----HNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTD---AAARTLMLMQATAELMG 255 (484) T ss_pred EE--E-EEecCCceEeecccc----CCCCCcceEEeccccccCccCCcccchHHHHHHHH---HHHHHHHHHHHHHHhhh Confidence 00 0 111222232222222 679999999663 22222112222 3334422 11245578888999999 Q ss_pred cceeeeecCCCCCC--c------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhh--cccc Q lcl|NC_019408. 311 LPVYYAPGTDSEGT--G------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMP--GASK 380 (612) Q Consensus 311 ~P~l~i~G~~~~~~--~------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~--~~~~ 380 (612) +|++++.|.+...- + .+..+.+..|.+| +.+++|.+++..+++.+.+.|..+..++....--... .... T Consensus 256 ~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~ 334 (484) T protein:vir:77 256 VPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFE-DHESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSS 334 (484) T ss_pred hhHHHHhCCCcchhcccccccchhhhhhhhhhcccC-CCCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhcccc Confidence 99999999754321 1 1334455566665 4568899999999888888888777777543211011 1011 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC--cceEEEeeccccccCCCHHHHHHHHHHHH Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT--ENLRYEVNTDFLSTPIGAREMRAIQLMAN 458 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~--~~~~v~ln~dF~~~~~d~~~~~al~~~~~ 458 (612) ..+-||.+...............-..+..++.+++++++...|...... ..+.|.+. +-.+.. .++.++++.++.+ T Consensus 335 ~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~-~~~~~s-~~~~ad~~~kl~~ 412 (484) T protein:vir:77 335 ENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWR-DPSTPT-YAAKADAATKLYN 412 (484) T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccceEEec-CCCCCC-HHHHHHHHHHHHh Confidence 1113666655555554444555566678888889999988876422111 23444442 222333 3667888999998 Q ss_pred cC--CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 459 DG--LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQE 536 (612) Q Consensus 459 ~G--~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~ 536 (612) +| .+|++|+++.| |+.++.+ ++.+++.++.................+...... ..+.+ +.+..-+.. T Consensus 413 ~g~gi~s~et~~~~l---~~~~~~~---~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~ 481 (484) T protein:vir:77 413 NGQGVIPKERARIDM---GYSITER---EEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPD----NPETP-EPQPNPAEE 481 (484) T ss_pred ccCCCCCHHHHHhcC---CCChhHH---HHHHHHHHHHHHHHHHHHhhhccccccCCCCCC----CCCcc-cccCCCccc Confidence 76 89999998876 6654432 222232222110000000000000000000000 00000 000000000 Q ss_pred HHH Q lcl|NC_019408. 537 RSV 539 (612) Q Consensus 537 r~~ 539 (612) +.+ T Consensus 482 ~~~ 484 (484) T protein:vir:77 482 AAA 484 (484) T ss_pred cCC Confidence 000 No 63 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.84 E-value=3.8e-20 Score=127.05 Aligned_cols=395 Identities=10% Similarity=0.022 Sum_probs=234.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) |+ ..+....+++.++.+-|.|...++ |||. .-.+.|+.+. ++ ..|+++-+|++++++++-.-.+.. T Consensus 9 L~--~~~~~~~~r~~~~~~yy~g~~~~~-----~~~~---~~p~~~~~~~-~~-v~nw~~~~Vd~~a~rl~~~Gf~~~-- 74 (422) T protein:vir:97 9 LR--RKLALFKTGVDKRYRYYAMDDRDD-----TRSI---VMPNNVREMY-RS-VLEWTAKGVDSLADRIIFREFTND-- 74 (422) T ss_pred HH--HHHHHHHHHHHHHHHHHhcCCChh-----hcCc---cccHHHHHHH-Hh-hcchhHHHHHHHHhccccceeeCC-- Confidence 43 566777888888899999876543 3433 3345565543 33 459999999999998877654432 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) ...+..++.. |+++.....+...+|.||+|+++| ++..+ ..+|.+..++|++++.. ++.. .++-...+ T Consensus 75 d~~l~~~w~~-----N~ld~~~~~~~~~al~~G~sf~~v-~~~~~----~~~p~i~~~sp~~~~~i-~D~~-~~~~~~a~ 142 (422) T protein:vir:97 75 DFNAWEIFKA-----NNPDIFFDTAIQSALIASCCFVYI-MPGAE----DGLPKMQVIEASKATGI-LDPT-TFLLTEGY 142 (422) T ss_pred chhHHHHHHh-----cChHHHHHHHHHHHHHhcceeEEE-eeCCC----CCeeEEEEechhhEEEE-EeCC-CCcceeeE Confidence 2455555543 889999999999999999999999 33211 14688999999998875 2221 22222222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) .++ +. +.++- ....+.+.+. ++|. T Consensus 143 ~~~-~~--------~~~~~------~~~~~~~~~~--------------------~~~~--------------------- 166 (422) T protein:vir:97 143 AIL-ES--------DSNGN------PTLEAYFTDK--------------------DIWY--------------------- 166 (422) T ss_pred EEE-Ee--------cCCCc------EEEEEEEcCc--------------------eEEE--------------------- Confidence 221 10 00000 0000000000 1111 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeecC-CCCCCcCcC----chHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS-GNTADVEKP----PLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~-~~~~~~~~p----PLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) +..++..+ .-.+++++||+|+|... +.+-..|.+ |+.+|.+ +.-+..++...+.++.++|+.+ T Consensus 167 -~~~~~~~~--------~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~d---a~~r~~~~~~~~~e~~a~pqr~ 234 (422) T protein:vir:97 167 -YPKKGKPY--------NIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQK---AAKRTLERAEVTAEFYSFPQKY 234 (422) T ss_pred -EcCCCccc--------cccCCCCCcceEEecccCCCccccCccccchhHHHHHH---HHHHHHHHHHHHHHHhcchhhh Confidence 01111111 11367899999976432 222222333 4444433 3336677888899999999999 Q ss_pred eecCCCCCC--ceEEEeccccccCCCCC---ceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccch--hHHHHH Q lcl|NC_019408. 316 APGTDSEGT--GEYHIGPNMVWEVPQGS---EPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSV--SESNNQ 388 (612) Q Consensus 316 i~G~~~~~~--~~l~iG~~~~~~lp~~~---~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~--~esa~~ 388 (612) +.|++.... +.+....+..|.+|++. .+++-++++.+++-+.+.|+.+..++.....-....-++.. +.||.+ T Consensus 235 i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~A 314 (422) T protein:vir:97 235 VLGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVES 314 (422) T ss_pred hcccCcccccCchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHH Confidence 999976432 22445556888888643 47888999999998888888888888765322222211111 246777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC---cceEEEeeccccccCC-CHHHHHHHHHHHHc--CCC Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT---ENLRYEVNTDFLSTPI-GAREMRAIQLMAND--GLL 462 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~---~~~~v~ln~dF~~~~~-d~~~~~al~~~~~~--G~i 462 (612) ...............-..+..++.+++++++...|...... .++.+...+-+..... .++.++++.+++++ |.+ T Consensus 315 i~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~ 394 (422) T protein:vir:97 315 IKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFM 394 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccc Confidence 66666655555666667778888889999888876422111 1244444433322221 24557888899888 788 Q ss_pred CHHHHHHHHHhcCccchhhhhHHHHHHhhcc-ccc Q lcl|NC_019408. 463 PDPVFYEYMRKAEVISSDMTFEEFQALRADE-NSF 496 (612) Q Consensus 463 s~et~~~~lqr~~vl~~~~~~eee~~ria~e-~~~ 496 (612) +.++.++.| |+-++ ..+..++.++ ... T Consensus 395 ~~~~~~~~l---g~~~~----~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 395 DADVIRDLT---GVKGA----DKPIPAITEVTTDG 422 (422) T ss_pred cHHHHHHHc---CCCch----hHHHHHHHhhhccC Confidence 999888877 66222 2223333222 111 No 64 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.84 E-value=1.3e-20 Score=129.63 Aligned_cols=383 Identities=9% Similarity=0.023 Sum_probs=223.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) |+ -.+....+++..+.+.|.|...++ ||+. .-.+.++.+.+ . ..|+.+-+|++++++++-.-.+ .. T Consensus 9 L~--~~~~~~~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~iVds~a~rl~~~Gf~--~~ 74 (409) T protein:vir:94 9 LR--FKLSVHKRRAEMRYDQYAMKYVDR-----FKGI---TIPQALSQQYR-S-ILGWCAKGVDSLADRLVFREFE--ND 74 (409) T ss_pred HH--HHHHHHhHHHHHHHHHhcccCchh-----hcCh---hhhHHHHHHHh-h-hcchhHHHHHHhHhhcccCccc--CC Confidence 32 235566778888889999976554 3422 22223333332 2 4599999999999988765433 23 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) +..+..++.. |+++.+...+...+|.||+|+++|- +.. ..+|-|+.++|.+++-. ++. +.+ ..+.- T Consensus 75 d~~l~~i~~~-----N~ld~~~~~~~~~aliyG~sf~~v~-~~~-----dg~~~i~~~sp~~~~~i-~D~-~~~-~~~~a 140 (409) T protein:vir:94 75 DFTVNEIFEE-----NNPDIFFDSAVLSSLIASCSFTYIS-KGE-----NDAVRLQVIEAVNATGI-IDP-ITG-LLTEG 140 (409) T ss_pred chHHHHHHHh-----cChhHHHHHHHHHHHHhcceeEEEe-cCC-----CCceEEEEeccceEEEE-Eec-CCC-ceeee Confidence 4567777653 8999999999999999999999994 432 24789999999987764 221 111 12222 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) +++... +.++ +....++|.. + . ++. T Consensus 141 ~~~~~~--------d~~~--------------------------------~~~~~~~~~~---------~-~-----~~~ 165 (409) T protein:vir:94 141 YAVLER--------DENN--------------------------------NVVLEAHFLP---------D-R-----TDY 165 (409) T ss_pred EEEEEe--------cCCC--------------------------------ceEEEEEEec---------C-c-----EEE Confidence 222110 0000 0000011100 0 0 001 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeec-CCCCCCcCcC----chHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGA-SGNTADVEKP----PLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~-~~~~~~~~~p----PLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) ++. +.+.|. .. .+++++||+|.|.. .+.+...|.+ |+.+|.+ +.-+..++...+.++.++|..| T Consensus 166 ~~~-~~~~~~---~~----~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~d---a~~r~~~~~~~~~e~~a~pqr~ 234 (409) T protein:vir:94 166 YYR-DSRNNI---SI----ANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQS---NAKRTLERADVTAEFYSFPQKY 234 (409) T ss_pred EEe-cCceeE---ee----eCCCCCcceEEeccccccccccCccccchhHHHHHH---HHHHHHHHHHHHHHHhcChhhe Confidence 111 112221 11 26799999997642 2222222222 4555433 2335667788999999999999 Q ss_pred eecCCCCC--CceEEEeccccccCCCC---CceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhcccc-ch-hHHHHH Q lcl|NC_019408. 316 APGTDSEG--TGEYHIGPNMVWEVPQG---SEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASK-SV-SESNNQ 388 (612) Q Consensus 316 i~G~~~~~--~~~l~iG~~~~~~lp~~---~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~-~~-~esa~~ 388 (612) +.|++... .+.+..+.+..|.+|++ ..+++-|+++..++-+.+.|+.+..++.+...-....-+. .. +.||.+ T Consensus 235 i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~A 314 (409) T protein:vir:94 235 VTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEA 314 (409) T ss_pred eEecCCCCcccchhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHH Confidence 99997543 22355677789999864 4588889999999988888888888887653211211111 11 245555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC---cceEEEeeccccccCC-CHHHHHHHHHHHHcC--CC Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT---ENLRYEVNTDFLSTPI-GAREMRAIQLMANDG--LL 462 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~---~~~~v~ln~dF~~~~~-d~~~~~al~~~~~~G--~i 462 (612) ...............-..+..++.++++++....|-..... ..+.+...+-+.+... -++.++++.+++++| .. T Consensus 315 l~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~ 394 (409) T protein:vir:94 315 IKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFI 394 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhccccc Confidence 55443333333344444566778888888777765321111 2344444433322221 145678899999999 55 Q ss_pred CHHHHHHHHHhcCccchh Q lcl|NC_019408. 463 PDPVFYEYMRKAEVISSD 480 (612) Q Consensus 463 s~et~~~~lqr~~vl~~~ 480 (612) +.+++++.| |+=+++ T Consensus 395 ~~~~~~~~l---G~~~~d 409 (409) T protein:vir:94 395 NKDTIRDLT---GIEGGE 409 (409) T ss_pred chhHHHHHc---CCCCCC Confidence 677776665 775544 No 65 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.80 E-value=2e-18 Score=117.59 Aligned_cols=496 Identities=9% Similarity=0.005 Sum_probs=242.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) .|..-......++...+.+-|.|...+..+...+.-.........++. ..=.-.|+.+.+|+..+|++|.+||++..- T Consensus 20 ~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~--nnki~~nf~k~Ivd~~~~yl~G~Pv~~~~~ 97 (537) T protein:vir:78 20 EITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYAS--NVKISHGFFTELVDQLAQYLLSNGVEVKVK 97 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccchhhhccccccccccccccccccc--ccccccchHHHHHHHHhhhhcccCceeecC Confidence 222222345566777788888887766543322211111111111110 011447999999999999999999999421 Q ss_pred ---CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcc Q lcl|NC_019408. 81 ---PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFY 156 (612) Q Consensus 81 ---p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~ 156 (612) ...+..++.+.. +++++.....++..++.+|++++++-.... ..+.+..++|++++= |+. ... T Consensus 98 d~~~~e~~~~l~~~~--~~~~~~~~~el~~~~s~~G~ay~~~y~de~------~~~~~~~i~p~~~~pv~d~-----~~~ 164 (537) T protein:vir:78 98 DEDNTQLDEILQEYF--DEDFQATIDTLVTNASKKGFEGIFARTTSE------GKLKFQTVDGLTLIPVFDD-----YGV 164 (537) T ss_pred cchhHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeeEEEeeecCC------CceEEEEEccceeEEEEcC-----CCC Confidence 233444455442 367788888999999999999999866543 367788889988654 421 112 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccc--cccccc Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEW--PSGEVK 234 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~--~~g~~~ 234 (612) ....+++......... ..+.- ...+..+...++ +..|+......... ....+. T Consensus 165 ~~~~~~~y~~~~~~~~--~~~~~----~~~~~evyt~~~-------------------i~~y~~~~~~~~~~~~~~~~~~ 219 (537) T protein:vir:78 165 LKMIIRWYSEIRYSTK--QQSTE----TIWHADVWNEEA-------------------VCYYIQDDEGVSTTYKLDEAYN 219 (537) T ss_pred ceeEEEEEeeeecccc--ccCcc----eEEEEEEEcCCc-------------------EEEEEecCCccccccccccccc Confidence 2333343332211100 00000 001111111111 11111100000000 000000 Q ss_pred ceeEEEEEeeC--CCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccc Q lcl|NC_019408. 235 LAYVQYLYEED--PESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALP 312 (612) Q Consensus 235 ~~~~~~~~~~~--~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P 312 (612) ......++... ...............++|+.||||.|... ... .+-|.++-.|-=+.=...|++.+.+-..+.| T Consensus 220 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn--~~~--~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ 295 (537) T protein:vir:78 220 PNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNN--KDG--MSDVKRVKSIIDDYDVMNCFLSNNLQDFSEA 295 (537) T ss_pred ccccceeeeccccccccccccccccccccCCcceeEEEeccC--ccC--CCchhhhHHHHHHHHHHHHhhhhHHHHhcCc Confidence 00000000000 00000011112233468999999987543 222 2223333333222223558888888999999 Q ss_pred eeeeecCCCCCCceE--EEeccccccCC-CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccchhHHHHHH Q lcl|NC_019408. 313 VYYAPGTDSEGTGEY--HIGPNMVWEVP-QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKSVSESNNQT 389 (612) Q Consensus 313 ~l~i~G~~~~~~~~l--~iG~~~~~~lp-~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~ 389 (612) +++++|.+.+..+.+ .+....++.++ .|++++|+..+... ++....++.+++.|...+--........++-|+++. T Consensus 296 ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~l~~~~~~-~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAl 374 (537) T protein:vir:78 296 IYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEIQTVSIPY-EARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVI 374 (537) T ss_pred eeeeecCCCccchhHHHHHhhcCceeecCCCCceeEEEecCCH-HHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHH Confidence 999999754432322 12233455565 57899999987644 667888999999998875322222334556677777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC---CCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHH Q lcl|NC_019408. 390 VLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA---DTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPV 466 (612) Q Consensus 390 ~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~---~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et 466 (612) ...-......-...-..+..++.+.+++++.+++.... +..++.|.+++.. +.+ ..+.++.+..+++.|.||++| T Consensus 375 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~-P~n-~~e~a~~~~~l~~~giiS~eT 452 (537) T protein:vir:78 375 KSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHV-LAN-ELDIATTRKTEAETEALKIGN 452 (537) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCC-CCC-HHHHHHHHHHHHhcCcchHHH Confidence 77766666555666667788888888899888765321 2335566655432 222 244567777888999999999 Q ss_pred HHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 467 FYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHA 546 (612) Q Consensus 467 ~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~ 546 (612) ++..+ ..++ .. ++.++ . .| +.+++..+..+. T Consensus 453 ~l~~~---p~vd----d~-e~ek~------------------------------~-~e-e~~~~~~~~~~~--------- 483 (537) T protein:vir:78 453 IMTVA---PRIG----DD-ETLKL------------------------------I-AE-ELDLDYNELKDA--------- 483 (537) T ss_pred HHHhC---CCCC----CH-HHHHH------------------------------H-HH-HHHhhhhhhhhh--------- Confidence 98664 2222 10 00010 0 00 000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 547 EVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 547 ~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) .+++ ++++. .. .. +.+.--.-......++|--+..+..-|.+-.|| T Consensus 484 -~~~~------~~~~~----~~-----~~----~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 529 (537) T protein:vir:78 484 -LAEQ------DAQSL----DV-----SP----DVQAMLDGLPVNANQPPVDPNQPVADPNVVPPT 529 (537) T ss_pred -hhhh------ccccc----Cc-----Cc----chhhhcCCCCCCCCCCCCCccCCCCCCCCCCCC Confidence 0000 00000 00 00 000000000111112222222333335555555 No 66 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.80 E-value=1.5e-18 Score=118.27 Aligned_cols=382 Identities=9% Similarity=0.026 Sum_probs=222.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) | -..+....+++..+.+.|.|...++ ||+. .-...++.+++ + ..|+.+-+|++++++++-.--+ .. T Consensus 9 L--~~~~~~~~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~iVds~a~rl~~~Gf~--~~ 74 (409) T protein:vir:16 9 L--RFKLSVHKRRAEMRYEQYAMKHVDR-----FKGI---TIPQALSQQYR-S-ILGWCAKGVDSLADRLVFREFE--ND 74 (409) T ss_pred H--HHHHHHHhHHHHHHHHHHhccCchh-----hcch---hhhHHHHHHHh-h-hcChhHHHHHHhHhhccccccc--Cc Confidence 3 2345667788888889999976554 3432 22334444333 3 4599999999999988665433 23 Q ss_pred CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeE Q lcl|NC_019408. 81 PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSR 160 (612) Q Consensus 81 p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~ 160 (612) +..+..++.. |+++.....+...+|.||+|+++|- |.. ..+|-|..++|.+++-. ++. +.++ .... T Consensus 75 d~~l~~i~~~-----N~ld~~~~~~~~~al~yG~sf~~v~-~~~-----dg~~~i~~~sP~~~~~i-~D~-~~~~-~~~a 140 (409) T protein:vir:16 75 DFTVNEIFEE-----NNPDIFFDSTVLSALIASCSFTYIS-KGE-----NDAVRLQVIEATNATGI-IDP-ITGL-LTEG 140 (409) T ss_pred chHHHHHHHh-----cChhHHHHHHHHHHHHhCceeEEEe-cCC-----CCceEEEEEcccceEEE-eec-cccc-ceee Confidence 4567777653 9999999999999999999999995 322 24689999999887653 111 1111 1111 Q ss_pred EEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEE Q lcl|NC_019408. 161 VLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 161 v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) +++.. ++.++ +....++|- .+ . ++. T Consensus 141 ~~~~~--------~d~~~--------------------------------~~~~~~~~~---~~-------~-----~~~ 165 (409) T protein:vir:16 141 YAVLE--------RDENN--------------------------------NVVLEAHFL---PD-------R-----TDY 165 (409) T ss_pred eEEEE--------ecCCC--------------------------------ceEEEEEEe---cC-------c-----EEE Confidence 11110 00000 000001110 00 0 001 Q ss_pred EEeeCCCceecceeeeccCCccccceeEEEeec-CCCCCCcCc----CchHHHHHHHHHHHhhhHHHHHHHHHhccceee Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIPFKFFGA-SGNTADVEK----PPLLDICDLNLSHYRTYAELEYGRLFTALPVYY 315 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~-~~~~~~~~~----pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~ 315 (612) .+ .+++.|. .. -++++.||+|.|.. .+.+...|. .|+.+|.+ +.-+..++...+.++.++|..+ T Consensus 166 ~~-~~~~~~~---~~----~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~d---a~~r~~~~~~~~~e~~a~pqr~ 234 (409) T protein:vir:16 166 YY-RDSRNNI---SI----ANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQS---NAKRTLERADVTAEFYSFPQKY 234 (409) T ss_pred EE-ecCcccc---ce----ecCCCCcceEEecccccccccCCccccchhHHHHHH---HHHHHHHHHHHHHHHhcChhhe Confidence 11 1112221 11 26799999997642 222222233 35555533 2236667778899999999999 Q ss_pred eecCCCCCC--ceEEEeccccccCCCC---CceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhccccc-hh-HHHHH Q lcl|NC_019408. 316 APGTDSEGT--GEYHIGPNMVWEVPQG---SEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASKS-VS-ESNNQ 388 (612) Q Consensus 316 i~G~~~~~~--~~l~iG~~~~~~lp~~---~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~~-~~-esa~~ 388 (612) +.|++.... +.+..+.+..|.+|++ ..+++-|+++..++-+.+.|+.+..++.+...-....-+.. .| .||.+ T Consensus 235 i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~A 314 (409) T protein:vir:16 235 VTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEA 314 (409) T ss_pred eEecCCCCCccchhhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHH Confidence 999975421 2355667789999854 45788899999999888999998888876532122211111 11 45555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC---cceEEEeeccc--cccCCCHHHHHHHHHHHHcCC-C Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT---ENLRYEVNTDF--LSTPIGAREMRAIQLMANDGL-L 462 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~---~~~~v~ln~dF--~~~~~d~~~~~al~~~~~~G~-i 462 (612) ...............-..+..++.++++++....|...... ..+.+..-+-+ +...+ ++..+++.+++++|. + T Consensus 315 i~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~-a~~aDa~~Kl~~a~~~~ 393 (409) T protein:vir:16 315 IKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASML-SLIGDGAIKLNQAIPEF 393 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhH-HHHHHHHHHHHhhcccc Confidence 55443333333344444467777788888877766421111 13344443222 22223 667899999999984 3 Q ss_pred -CHHHHHHHHHhcCccchh Q lcl|NC_019408. 463 -PDPVFYEYMRKAEVISSD 480 (612) Q Consensus 463 -s~et~~~~lqr~~vl~~~ 480 (612) ..++.++.| |+-+++ T Consensus 394 ~~~~v~~~~~---g~~~~d 409 (409) T protein:vir:16 394 INKDTIRDLT---GIKGAE 409 (409) T ss_pred cchhHHHHhc---cCCCCC Confidence 345555554 764444 No 67 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.77 E-value=3.2e-16 Score=105.52 Aligned_cols=584 Identities=12% Similarity=0.004 Sum_probs=188.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChH--HHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhc----CC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQR--EIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFR----RD 74 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~--~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~----k~ 74 (612) |-.|-...-....++....-+.|.. ..+.+=.-|+=...+.. ..=...++.|.+..+|+.+.+.+.+ .+ T Consensus 10 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~-----~~~~s~~~~~~v~~~v~~~~~~l~~~~~~~~ 84 (705) T protein:vir:88 10 MDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNE-----RPGKSGIVSRDVQETVDWIMPSLMKVFTSGG 84 (705) T ss_pred CCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCcc-----cCCCCccccHHHHHHHHHHHHHHHHhhcCCC Confidence 3333322222333333333333332 11222223442211111 1114677888899888888886543 33 Q ss_pred ceeecCC-----HH----HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcc--------------------- Q lcl|NC_019408. 75 PIVKNLP-----PK----FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDN--------------------- 124 (612) Q Consensus 75 p~~~~~p-----~~----l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a--------------------- 124 (612) +.+.-.| .. +..++.-+=...+....++..+|+.+|.+|.+.|=|-+-.. T Consensus 85 ~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~ 164 (705) T protein:vir:88 85 QVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILS 164 (705) T ss_pred ceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhh Confidence 3332223 11 22233333234455567889999999999987665544110 Q ss_pred -hhh--------------------hhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccc Q lcl|NC_019408. 125 -PRK--------------------GAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQ 183 (612) Q Consensus 125 -~~~--------------------~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~ 183 (612) ++. ....++-+..++|++++ |+. +.-+-..-.++..+-.....+. ...+|.... T Consensus 165 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~-~dp--~a~~~~d~~~~~~~~~~t~~dl--~~~g~~~~~ 239 (705) T protein:vir:88 165 DPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFL-VDR--LATCIDDARFLCHREKYTVSDL--RLLGVPEDV 239 (705) T ss_pred hhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHce-ecC--CCCCcccCcEEEEEEeccHHHH--HhhcCChhH Confidence 000 00124445555665554 111 0111112222222211110000 000000000 Q ss_pred eeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeecc----- Q lcl|NC_019408. 184 ARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTV----- 258 (612) Q Consensus 184 ~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~----- 258 (612) +... .-.++...........+ +.+....-+.... .+.....-.++.+.++..+..++.+....-.+... T Consensus 240 ~~~~---~~~~~~~~~~~~e~~~~--~~~d~~~~~~~~~-~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il 313 (705) T protein:vir:88 240 IEEL---PYDEYEFSDSQPERLVR--DNFDMTGQLQYNS-GDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYII 313 (705) T ss_pred hhhh---hcccccchhhhhhhccc--ccccccccccccc-ccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCcccc Confidence 0000 00000000000000000 0000000000000 00000000011111111111111111100011111 Q ss_pred CCccccceeEEEeecC-CCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeee-cCCCCCCceEEEecccccc Q lcl|NC_019408. 259 RGEPLDFIPFKFFGAS-GNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAP-GTDSEGTGEYHIGPNMVWE 336 (612) Q Consensus 259 ~g~~l~~IP~v~~~~~-~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~-G~~~~~~~~l~iG~~~~~~ 336 (612) ...+++.+||++++.. ..+...+.+....++.+.-..=-.-+-+-++++.+..|...+. |.- ...+.+...++.++. T Consensus 314 ~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v-~~~d~~~~~pg~vv~ 392 (705) T protein:vir:88 314 SNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQV-NLEDLLTNEAAGIVR 392 (705) T ss_pred ccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecccccc-CcccccccCCCeeEE Confidence 1236778899875433 2222335555555555533222222234567788888877663 321 223445556677776 Q ss_pred CCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHHhhhcc----ccchhHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_019408. 337 VPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GGRMMPGA----SKSVSESNNQTVLREANEQSLLLNIIQACES-G 410 (612) Q Consensus 337 lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~ll~~~----~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~-a 410 (612) +..++.+.++.+..-+ .....-|+-++..|... |..-...+ .-..+.|+++..+-.......|..++.++.+ . T Consensus 393 ~~~~~~i~~~~~~~~~-~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~ 471 (705) T protein:vir:88 393 VKSMNSITPLETPQLS-GEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETG 471 (705) T ss_pred ecCCCccccccCCcCc-HHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6556668887655433 22233344455555432 33222111 1123567777777777777778888887754 3 Q ss_pred H----HHHHHHHHHHcCCcCC----------------CCcceEEEeeccccccCCCHHHHHHHHHHHHc----CC----C Q lcl|NC_019408. 411 M----TDVVRWWLMWRDVPLA----------------DTENLRYEVNTDFLSTPIGAREMRAIQLMAND----GL----L 462 (612) Q Consensus 411 ~----~~~l~~~a~w~g~~~~----------------~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~----G~----i 462 (612) + ..++.++..|.....- ...++.+.+...+....-..+.+..++.+.+. +. + T Consensus 472 ~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~ 551 (705) T protein:vir:88 472 VKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLV 551 (705) T ss_pred HHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhc Confidence 3 4555666666432100 00111111111111110011123344433322 11 1 Q ss_pred CHHHHHHH----HHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHH-----HHHHHHH Q lcl|NC_019408. 463 PDPVFYEY----MRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREAD-----FTQQKID 533 (612) Q Consensus 463 s~et~~~~----lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e-----~~~q~~e 533 (612) +....... ++..++-....-..+. ..+.... ......+...+....+.+.+.++++...|.+ .+++..| T Consensus 552 ~~~~~~~~~~el~e~~~~k~~~~~~~~~-~~~e~~~-~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E 629 (705) T protein:vir:88 552 SEQNLYNILKEVTENAGYKDPDRFWTNP-NSPEALQ-AKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVE 629 (705) T ss_pred ChHHHHHHHHHHHHhhhhhhHHHHhhhh-hhHHHHH-HHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111 1111110000000000 0000000 0000000000000000011111111111111 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 534 IQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 534 ~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) .+.++++++.++++...++++.+... +. .+.++...++++++.+++++.++..++ +-..+. ..-|+.+||+ T Consensus 630 ~q~~q~e~e~~~~~~~~~~~e~~~~~--a~-~~~~~~~~e~e~~~~e~e~~~e~~q~~---~~~~~~--~~~~~~~k~~ 700 (705) T protein:vir:88 630 AQIRLAEIELKKQEAVLQQREMALKE--AE-LQLERDRFTWERARNEAEYHLEATQAR---AAYIGD--GKVPETKKPT 700 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HH-HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH--HhHHHHHHHH Confidence 22222222222222111111111000 00 000111111111111111110000000 000000 0012233333 No 68 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.75 E-value=4.4e-17 Score=110.26 Aligned_cols=433 Identities=11% Similarity=0.057 Sum_probs=224.3 Q ss_pred CCCcH----------HHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchh Q lcl|NC_019408. 1 MVTHP----------EYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMV 70 (612) Q Consensus 1 ~~~hP----------~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~v 70 (612) =|+-+ .+....+++..+.+-|.|...++ |||. .-...|++.+ .-.|+.+-+|+.++.++ T Consensus 17 ~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~-----~~~~---~~p~~~~~~~---~v~n~~~~iVd~~a~rl 85 (504) T protein:vir:99 17 ELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIR-----QIGN---LIPPEYLRTA---TVLGWSAKAVDTLARRC 85 (504) T ss_pred CCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccch-----hccc---cccHHHHHHh---hccCcHHHHHHHHHhhh Confidence 11122 25556678888888888876553 4443 2234455332 34699999999998877 Q ss_pred hcCCceeec---CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-c Q lcl|NC_019408. 71 FRRDPIVKN---LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-W 146 (612) Q Consensus 71 f~k~p~~~~---~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W 146 (612) +-.-..+.+ .+..|..++.. |+|+.....+...++.||+++++|- +..+ ...+|.|..++|++++- | T Consensus 86 ~~~Gf~~~d~~~~~~~l~~i~~~-----N~ld~~~~~~~~~a~iyG~af~~v~-~~~d---~~~~~~I~~~sP~~~~~iy 156 (504) T protein:vir:99 86 NLESFVWPDGDYGSIGGPDVWDE-----NFFATKANNAMVSSLIHGPAFLINT-EGGA---GEPDSLIHVKSAMQATGEW 156 (504) T ss_pred ccceeeCCCCChhhHHHHHHHHh-----cChhhHHHHHHHHHHhhCceeEEEe-cCCC---CCceeEEEEeccceeEEEE Confidence 666544421 22346566643 6789999999999999999999993 3322 12367788899998742 3 Q ss_pred hhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccc Q lcl|NC_019408. 147 DEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEI 226 (612) Q Consensus 147 ~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~ 226 (612) +- ..++... -+++.+. + .+ |. ....++|- T Consensus 157 D~---~~~~~~~-a~~~~~~--d------~~-----------------g~---------------~~~~~~y~------- 185 (504) T protein:vir:99 157 NS---RRNAMDS-LLSITSR--D------AE-----------------GH---------------PTGIALYE------- 185 (504) T ss_pred eC---CCCceeE-EEEEEEe--c------CC-----------------Ce---------------EEEEEEEc------- Confidence 21 1122111 1111111 0 00 00 00001110 Q ss_pred ccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCc----CchHHHHHHHHHHHhhhHH Q lcl|NC_019408. 227 EWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEK----PPLLDICDLNLSHYRTYAE 301 (612) Q Consensus 227 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~----pPLldLA~lnl~HY~~~sD 301 (612) .+ ..+.+...+++.|.. ....++++ ||+|.|- ..+.+...|. .|+.+|.+- .=+..++ T Consensus 186 ---~~-----~~~~~~~~~~~~~~~-----~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da---~~~~~~~ 248 (504) T protein:vir:99 186 ---DG-----VTVTADMDDDGDWHA-----DVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQR---ALKGCIR 248 (504) T ss_pred ---CC-----cEEEEEEcCCceeee-----ccccCCCC-cceEEecccccCccccCcccchhhHHHHHHH---HHHHHHH Confidence 00 001111112222221 12235676 8888653 2222221222 245554332 2255677 Q ss_pred HHHHHHHhccceeeeecCCCCC-----Cc---eEEEeccccccCCCC--------CceeEEecCchhHHHHHHHHHHHHH Q lcl|NC_019408. 302 LEYGRLFTALPVYYAPGTDSEG-----TG---EYHIGPNMVWEVPQG--------SEPGILEYTGQGLKALETALNDKER 365 (612) Q Consensus 302 ~~~~l~~~~~P~l~i~G~~~~~-----~~---~l~iG~~~~~~lp~~--------~~~~~lE~~g~~l~~~~~~l~~~e~ 365 (612) ...+.++.++|.++|.|++.+. .. .+.+..+..|.+|.+ .+++|-++++..++.+.+.|+.+.. T Consensus 249 ~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~ 328 (504) T protein:vir:99 249 MDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAM 328 (504) T ss_pred HHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHHHH Confidence 7889999999999999986532 11 133444567777654 4588889999999988888888888 Q ss_pred HHHHHHHHhhhc---cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC-CC--cceEEEeecc Q lcl|NC_019408. 366 QIAAIGGRMMPG---ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA-DT--ENLRYEVNTD 439 (612) Q Consensus 366 qm~~lGa~ll~~---~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~-~~--~~~~v~ln~d 439 (612) ++.+...-.... .+...+.||.+...............-..+..++.++++++....+.... .. ..+.+.. +| T Consensus 329 ~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w-~d 407 (504) T protein:vir:99 329 MFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKF-RS 407 (504) T ss_pred HHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEe-cC Confidence 887653211111 11123457777776666666666677777888888999988877653211 11 2233333 34 Q ss_pred ccccCCCHHHHHHHHHHHHcCCC--C-HHHHHHHHHhcCccchhhhh-HHHHHHh---------hccccccccch-hHHh Q lcl|NC_019408. 440 FLSTPIGAREMRAIQLMANDGLL--P-DPVFYEYMRKAEVISSDMTF-EEFQALR---------ADENSFINNPD-AQAR 505 (612) Q Consensus 440 F~~~~~d~~~~~al~~~~~~G~i--s-~et~~~~lqr~~vl~~~~~~-eee~~ri---------a~e~~~~~~~~-~~~~ 505 (612) -.+.++ ++.++++.+++++|.+ + .+++++.| |+-++++.- +++.++. ....+...... .... T Consensus 408 ~~~~s~-a~~aDa~~Kl~~ag~~l~~~~~~l~~~l---g~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 483 (504) T protein:vir:99 408 PLYLSK-AAQADAGAKMLGAGPEWLKETEVGLELL---GLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQ 483 (504) T ss_pred CCccCH-HHHHHHHHHHHhhccccccchHHHHhhc---CCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCc Confidence 444443 6678889999999862 2 45666554 664443321 1111111 01000000000 0000 Q ss_pred hhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 506 QRGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 506 ~~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) ...++.-. +.- .+. ..-.+.- T Consensus 484 ~~~e~a~~---~~~-~~~-~~p~~~~ 504 (504) T protein:vir:99 484 GAGEPPAN---EPP-AAL-GRPTLVG 504 (504) T ss_pred CCCCCCCC---CCC-ccC-CCcccCC Confidence 00000000 000 000 0000000 No 69 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.74 E-value=2e-17 Score=112.09 Aligned_cols=413 Identities=10% Similarity=0.022 Sum_probs=223.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN- 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~- 79 (612) |+. .+....+++..+.+-|.|...++ ||| ..-...|+.+. ...|+.+-+|++++.++.-.-..+.+ T Consensus 23 L~~--~~~~~~~~~~~~~~Yy~G~~~~~-----~~~---~~~p~~~r~~~---~v~nw~~~~Vd~~a~rl~~~Gf~~~d~ 89 (474) T protein:vir:81 23 LLA--QIENLRWKNLLRTSYYENKRTIQ-----YVG---TLIPPQYFNLG---LVLGWTGKAVDALARRCNLEGFVWPDG 89 (474) T ss_pred HHH--HHHHHhhHHHHHHHHhccCCChh-----hcc---ccccHHHHHHH---hhcChHHHHHHHHHhhhcccceECCCC Confidence 222 45556677777778888876543 343 33345566442 35799999999998777665544411 Q ss_pred --CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCcc Q lcl|NC_019408. 80 --LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFY 156 (612) Q Consensus 80 --~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~ 156 (612) .+..+..++. -|+|+.+...+...+|.||+++++|=.... ...+|-|..++|.+++- |+ ...++- T Consensus 90 ~~~~~~l~~iw~-----~N~ld~~~~~~~~~al~~G~sf~~V~~~~d----~~~~~~i~~~sp~~~~~~~D---~~~~~~ 157 (474) T protein:vir:81 90 DLDSLGGTEVVD-----DNHLLSEIDSAIVAAMQHGPAFLINTVGED----DEPEALIHVKDASEATGEWN---RRRRGL 157 (474) T ss_pred CccchHHHHHHH-----hcChhHHHHHHHHHHHhhCceeEEEecCCC----CCceeEEEEeccceEEEEEe---CCCCcc Confidence 2234556654 478899999999999999999999965332 12368888999998764 32 122222 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccce Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLA 236 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~ 236 (612) ...+.++ +. +.++.. . .-.+.+. + .+| T Consensus 158 ~~al~~~-~~--------~~~g~~----~-~~~ly~~-----------------~----~~~------------------ 184 (474) T protein:vir:81 158 NNLLSII-DK--------DKEGKV----L-SLALYLD-----------------N----ETV------------------ 184 (474) T ss_pred eeeeEEE-EE--------cCCCcE----E-EEEEEeC-----------------C----cEE------------------ Confidence 2222221 10 001100 0 0001100 0 000 Q ss_pred eEEEEEeeC-CCceecceeeeccCCccccceeEEEee-cCCCCCCcCc----CchHHHHHHHHHHHhhhHHHHHHHHHhc Q lcl|NC_019408. 237 YVQYLYEED-PESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEK----PPLLDICDLNLSHYRTYAELEYGRLFTA 310 (612) Q Consensus 237 ~~~~~~~~~-~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~----pPLldLA~lnl~HY~~~sD~~~~l~~~~ 310 (612) .+...+ +..|. . ....++++ ||+|.|- ..+.+...+. .|+.+|.+- .-+..++...+.+|.+ T Consensus 185 ---~~~~~~~~~~w~-~----~~~~~~~g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da---~~r~~~~~~~~~e~~a 252 (474) T protein:vir:81 185 ---TAQRDKATLKWQ-V----DRDEHVYG-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDA---GVRELARREGHMDVFS 252 (474) T ss_pred ---EEEEcCccceee-e----ccCCCCCC-cceEEecccccccCcCCccccchhHHHHHHH---HHHHHHHHHHHHHHhc Confidence 011111 11121 1 12345677 7888653 2222221233 356565432 2356677788999999 Q ss_pred cceeeeecCCCCCC-----c---eEEEeccccccCCCCC--------ceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019408. 311 LPVYYAPGTDSEGT-----G---EYHIGPNMVWEVPQGS--------EPGILEYTGQGLKALETALNDKERQIAAIGGRM 374 (612) Q Consensus 311 ~P~l~i~G~~~~~~-----~---~l~iG~~~~~~lp~~~--------~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l 374 (612) +|+.|+.|++...- . .+....+..|.+|++. .++|-|++...++-+.+.|+.+..++.....-. T Consensus 253 ~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP 332 (474) T protein:vir:81 253 YPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASLP 332 (474) T ss_pred chhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCCC Confidence 99999999865321 1 1223344677777654 367889999999988888888888887653222 Q ss_pred hhc---cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC-----cceEEEeeccccccCCC Q lcl|NC_019408. 375 MPG---ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADT-----ENLRYEVNTDFLSTPIG 446 (612) Q Consensus 375 l~~---~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~-----~~~~v~ln~dF~~~~~d 446 (612) ... .+-..+.||.+...............-..+..++.+++++++...|....+. ..+.+.. +|-....+ T Consensus 333 ~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W-~d~~~~s~- 410 (474) T protein:vir:81 333 DTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKW-RDPRYLSK- 410 (474) T ss_pred HHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEe-cCCCccCH- Confidence 211 0111235666666666555555666667788888999999988877422211 1233333 34444443 Q ss_pred HHHHHHHHHHHHcCC--CCHHHHHHHHHhcCccchhhh-hHHHHHHhhccccccccchhH-Hh-hhhhhHHHHhHHH Q lcl|NC_019408. 447 AREMRAIQLMANDGL--LPDPVFYEYMRKAEVISSDMT-FEEFQALRADENSFINNPDAQ-AR-QRGYTNRGQELEQ 518 (612) Q Consensus 447 ~~~~~al~~~~~~G~--is~et~~~~lqr~~vl~~~~~-~eee~~ria~e~~~~~~~~~~-~~-~~~e~~r~~~~e~ 518 (612) ++..+++.+++++|. .+++++++.+ |+-++++. ++.+.++..... .-+.. .+ ...-.. | T Consensus 411 a~~aDa~~Kl~~a~~~~~~~~~~~~~l---g~t~~~i~~~~~~~~~~~~~~----~~~~l~~~~~~~~~a------q 474 (474) T protein:vir:81 411 SAQADAGMKQLAAVPWLAETEVGLELI---GLTPQQARRAMADKRRVQGRG----TLQALIDRSNNGATA------Q 474 (474) T ss_pred HHHHHHHHHHHhcccCCCcHHHHHhhc---CCCHHHHHHHHHHHHHHhHHH----HHHHHHhcCCCCCCC------C Confidence 778999999999873 4455555544 54322221 011111110000 00000 00 000000 0 No 70 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.74 E-value=1.9e-17 Score=112.29 Aligned_cols=468 Identities=13% Similarity=0.054 Sum_probs=240.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChH-HHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCcee-- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQR-EIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIV-- 77 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~-~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~-- 77 (612) =++.+.=.+.+..+++..|.|.|.. .++ ++...++ .+=+|-++.|.- +.|+.++-.| T Consensus 23 ~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~------~~lrg~~------~~~~r~~~~ps~--------~~~~~~~~~~~~ 82 (527) T protein:vir:10 23 NAVTDFDKARLASYRLYEDMYLTNTSDYQ------VILRGGD------EGDQRPIYVPNG--------EKLIEAKMRFLG 82 (527) T ss_pred ccCCHHHHHHHHHHHHHHHHhcCchhhee------eecCCcc------ccccceeeehhh--------HHhhCCcceeec Confidence 1355666677888999999988742 111 1111111 111223333333 2222222222 Q ss_pred ---e----cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhh Q lcl|NC_019408. 78 ---K----NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVV 150 (612) Q Consensus 78 ---~----~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~ 150 (612) + ...+.++.++.+ --+-++|+..+.+.-+.++.-|-..++|=.. +++-.+.||-++.++|..+.-|. T Consensus 83 ~g~~~~~~~~~e~v~~~lr~-~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD--~~k~~~~R~~v~~~DP~~~f~~e--- 156 (527) T protein:vir:10 83 QGLKWEFSKKDAKVDDAIKV-LFDRENWEQKFESLKRWTEIRGDYVLLLIGD--DEKDEGSRLSLHEVDPSTYFPYE--- 156 (527) T ss_pred cCccccccchhHHHHHHHHH-HHHHhhhHHHHHHHHHhhhhhcceeEEEeec--cCCCcCCCceEeecCcceeeeee--- Confidence 1 011233333322 1222889999999999988888555444332 22334679999999999999883 Q ss_pred ccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccc Q lcl|NC_019408. 151 DMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPS 230 (612) Q Consensus 151 ~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~ 230 (612) +-+|.....-|-+. ..|..+.|.......-..|.+....+..+. ...+|. +.| .++ .|.- T Consensus 157 d~d~~~~v~~v~~~-----~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~-------~~~~G~---~~y----t~~-~w~l 216 (527) T protein:vir:10 157 DPRYPGQVLGVYLV-----DEYPHPDSEKKNEKCARVQKYMKTLDDDGK-------PVPGGA---IKY----TEE-LYEP 216 (527) T ss_pred cCCCCCceeeEEEe-----eeccCCccccccceehhhhhhhhhcCcccc-------cccCcc---eee----eec-eeec Confidence 22333333333332 134455554432221111111111111110 011121 111 000 0000 Q ss_pred ccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019408. 231 GEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFT 309 (612) Q Consensus 231 g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~ 309 (612) |.+......-+...+=.. .....+....-+||++||+|.+. ........|.+-|.++-.+--+.=++.||++-|+-++ T Consensus 217 g~w~d~~e~p~~~~~~~~-~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~s 295 (527) T protein:vir:10 217 GKWDDRPESPLEPDDIKK-LSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFG 295 (527) T ss_pred cccccccccccchhhhhh-hcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHh Confidence 100000000000000000 01112233455789999999663 3445667789999999999889999999999999999 Q ss_pred ccceeeeecCCCCC----CceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhcccc----c Q lcl|NC_019408. 310 ALPVYYAPGTDSEG----TGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASK----S 381 (612) Q Consensus 310 ~~P~l~i~G~~~~~----~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~----~ 381 (612) +.|+.+++|+...+ ..++.|||+..|.||.++++..|.-. ..++..++.|+.+.++|... +++-...-+ . T Consensus 296 G~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~-~~la~~~~h~~~L~~~l~~v-A~~PavA~G~vD~s 373 (527) T protein:vir:10 296 GLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGV-ASLEPSQTHMTKAEEAMQQT-KGIPDIAVGVVDAA 373 (527) T ss_pred CCceeeecccccccccCCcCccccCCceeEecCCCcceeeccch-hhhHHHHHHHHHHHHHHHHh-hcCCeeeeccccCC Confidence 99999999985443 24688999999999999999988832 35666788888888887654 233222111 2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH----cCCcCCCCc---ceEEEeeccccccCCCHHHHHHH Q lcl|NC_019408. 382 VSESNNQTVLREANEQSLLLNIIQACESGMTDVVR-WWLMW----RDVPLADTE---NLRYEVNTDFLSTPIGAREMRAI 453 (612) Q Consensus 382 ~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~-~~a~w----~g~~~~~~~---~~~v~ln~dF~~~~~d~~~~~al 453 (612) ...|+.+..+..+...+..+.........+.+.+. |+-+| .|....+.. .+++... +..+.+ ..+.+.++ T Consensus 374 ~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~-p~lP~D-~~avie~v 451 (527) T protein:vir:10 374 VAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFR-DPKPVN-SEKRFNQL 451 (527) T ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEec-ccCCCC-HHHHHHHH Confidence 34576666666555443333333223333333222 33344 343332211 2333332 222222 35578999 Q ss_pred HHHHHcCCCCHHHHHHHHHhc-CccchhhhhHHHHHHhhccc----cccccchhHHhh-----hhhhHHHHhHHHHHH Q lcl|NC_019408. 454 QLMANDGLLPDPVFYEYMRKA-EVISSDMTFEEFQALRADEN----SFINNPDAQARQ-----RGYTNRGQELEQSRM 521 (612) Q Consensus 454 ~~~~~~G~is~et~~~~lqr~-~vl~~~~~~eee~~ria~e~----~~~~~~~~~~~~-----~~e~~r~~~~e~~r~ 521 (612) ..++++|.||++|.++.|.+. ++-+++.+.++..++++.+. ...+.-.+++.. ..+...+=. .+=. T Consensus 452 ~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~--~~~~ 527 (527) T protein:vir:10 452 LQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALN--GQPL 527 (527) T ss_pred HHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccC--CCCC Confidence 999999999999999999875 54444544444443333221 111111122110 001110000 0000 No 71 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.74 E-value=2e-17 Score=112.12 Aligned_cols=468 Identities=13% Similarity=0.056 Sum_probs=240.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChH-HHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCcee-- Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQR-EIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIV-- 77 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~-~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~-- 77 (612) =++.+.=.+.+..+++..|.|.|.. .++ ++...++ .+=+|-++.|.- +.|+.++-.| T Consensus 23 ~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~------~~lrg~~------~~~~r~~~~ps~--------~~~~~~~~~~~~ 82 (527) T protein:vir:10 23 NAVTDFDKARLASYRLYEDMYLTNTSDYQ------VILRGGD------EGDQRPIYVPNG--------EKLIEAKMRFLG 82 (527) T ss_pred ccCCHHHHHHHHHHHHHHHHhcCchhhee------eecCCcc------ccccceeeehhh--------HHhhCCcceeec Confidence 1355666677888999999988742 111 1111111 111223333333 2222222222 Q ss_pred ---e----cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhh Q lcl|NC_019408. 78 ---K----NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVV 150 (612) Q Consensus 78 ---~----~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~ 150 (612) + ...+.++.++.+ --+-++|+..+.+.-+.++.-|-..++|=.. +++-.+.||-++.++|..+.-|. T Consensus 83 ~g~~~~~~~~~e~v~~~lr~-~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD--~~k~~~~R~~v~~~DP~~~f~~e--- 156 (527) T protein:vir:10 83 QGLKWEFSKKDAKVDDAIRV-LFDRENWEQKFESLKRWTEIRGDYVLLLIGD--DEKDEGSRLSLHEVDPSTYFPYE--- 156 (527) T ss_pred cCccccccchhHHHHHHHHH-HHHHhhhHHHHHHHHHhhhhhcceeEEEeec--cCCCcCCCceEeecCcceeeeee--- Confidence 1 011233333321 2222888999999999988888555444332 22334679999999999999883 Q ss_pred ccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccc Q lcl|NC_019408. 151 DMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPS 230 (612) Q Consensus 151 ~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~ 230 (612) +-+|.....-|-+. ..|..+.|.......-..|.+....+..+. ...+|. +.| .++ .|.- T Consensus 157 d~d~~~~v~~v~~~-----~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~-------~~~~G~---~~y----t~~-~w~l 216 (527) T protein:vir:10 157 DPRYPGQVLGVYLV-----DEYPHPDSEKKNEKCARVQKYMKTLDDDGK-------PVPGGA---IKY----TEE-LYEP 216 (527) T ss_pred cCCCCCceeeEEEe-----eeccCCccccccceehhhhhhhhhcCcccc-------cccCcc---eee----eec-eeec Confidence 22333333333332 134455554432221111111111111110 011121 111 000 0000 Q ss_pred ccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEee-cCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019408. 231 GEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFG-ASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFT 309 (612) Q Consensus 231 g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~-~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~ 309 (612) |.+......-+...+=.. .....+....-+||++||+|.+. ........|.+-|.++-.+--+.=++.||++-|+-++ T Consensus 217 g~w~d~~e~p~~~~~~~~-~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~s 295 (527) T protein:vir:10 217 GKWDDRPESPLEPDDIKK-LSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFG 295 (527) T ss_pred cccccccccccchhhhhh-hcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHh Confidence 100000000000000000 01112233455789999999663 3445667789999999999889999999999999999 Q ss_pred ccceeeeecCCCCC----CceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHhhhcccc----c Q lcl|NC_019408. 310 ALPVYYAPGTDSEG----TGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIGGRMMPGASK----S 381 (612) Q Consensus 310 ~~P~l~i~G~~~~~----~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~ll~~~~~----~ 381 (612) +.|+.+++|+...+ ..++.|||+..|.||.++++..|.-. ..++..++.|+.+.++|... +++-...-+ . T Consensus 296 G~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~-~~la~~~~h~~~L~~~l~~v-A~~PavA~G~vD~s 373 (527) T protein:vir:10 296 GLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGV-ASLEPSQTHMNKAEEAMQQT-KGIPDIAVGVVDAA 373 (527) T ss_pred CCceeeecccccccccCCcCccccCCceeEecCCCcceeeccch-hhhHHHHHHHHHHHHHHHHh-hcCCeeeeccccCC Confidence 99999999985443 24688999999999999999988832 35666788888888887654 233222111 2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH----cCCcCCCCc---ceEEEeeccccccCCCHHHHHHH Q lcl|NC_019408. 382 VSESNNQTVLREANEQSLLLNIIQACESGMTDVVR-WWLMW----RDVPLADTE---NLRYEVNTDFLSTPIGAREMRAI 453 (612) Q Consensus 382 ~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~-~~a~w----~g~~~~~~~---~~~v~ln~dF~~~~~d~~~~~al 453 (612) ...|+.+..+..+...+..+.........+.+.+. |+-+| .|....+.. .+++... +..+.+ ..+.+.++ T Consensus 374 ~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~-p~lP~D-~~avie~v 451 (527) T protein:vir:10 374 VAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFR-DPKPVN-NEKRFAQL 451 (527) T ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEec-ccCCCC-HHHHHHHH Confidence 34576666666555443333333223333333222 33344 343332211 2333332 222222 35578999 Q ss_pred HHHHHcCCCCHHHHHHHHHhc-CccchhhhhHHHHHHhhccc----cccccchhHHhh-----hhhhHHHHhHHHHHH Q lcl|NC_019408. 454 QLMANDGLLPDPVFYEYMRKA-EVISSDMTFEEFQALRADEN----SFINNPDAQARQ-----RGYTNRGQELEQSRM 521 (612) Q Consensus 454 ~~~~~~G~is~et~~~~lqr~-~vl~~~~~~eee~~ria~e~----~~~~~~~~~~~~-----~~e~~r~~~~e~~r~ 521 (612) ..++++|.||++|.++.|.+. ++-+++.+..+..++++.+. ...+.-.+++.. ..+...+=. .+=. T Consensus 452 ~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~--~~~~ 527 (527) T protein:vir:10 452 LELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALN--GQPL 527 (527) T ss_pred HHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccC--CCCC Confidence 999999999999999999875 54445544444444333221 111111122110 001110000 0000 No 72 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.65 E-value=2.5e-14 Score=95.20 Aligned_cols=435 Identities=11% Similarity=0.088 Sum_probs=206.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChH-HHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQR-EIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~-~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) ...||++......| +.+|.|.. -+......+-+++ +. .+-.-.|+.+.+++.+++++|.+||++.. T Consensus 31 ~~~~~~~~~~i~~~---~~yy~g~~~~~~~~~~~~~~~~-------~~---~~~~~~n~~k~i~~~~a~~l~~~p~~i~~ 97 (496) T protein:vir:38 31 VNANDEDYKYIDMW---KRLYQGHYAEWHNLNYEHNGNP-------VN---RRQLSMNLPKVTAKYMSKLLFNEKVKINI 97 (496) T ss_pred CcCCHHHHHHHHHH---HHHhcCCCchhhcchhccCCCc-------cc---cceeecchHHHHHHHHhhhhhCCcceEee Confidence 44466666666666 46688853 3332221111111 11 11122599999999999999999999842 Q ss_pred CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCccce Q lcl|NC_019408. 80 LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVP 158 (612) Q Consensus 80 ~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~L 158 (612) -.+....+++++=.. ++++..+..++..++.+|.+++.|=+ .. ..+|.+..++|++++- |. ++ ..+ T Consensus 98 ~d~~~~e~l~~~~~~-n~f~~~~~~~~~~a~~~G~~~~~~~~-D~-----~~~~~i~~v~~~~~~P~~~-----~~-~~~ 164 (496) T protein:vir:38 98 DDKAAEEFVLNVLKT-NGFTKNMERYIEYGEAMGGFVIKVYH-DG-----NKNVKVSFATADCMYPLSN-----DS-ENV 164 (496) T ss_pred CChHHHHHHHHHHhc-cCHHHHHHHHHHHHhhhCcEEEEEEE-cC-----CCcEEEEEEcccceEEEEe-----cC-CcE Confidence 234566666655333 78999999999999999999888733 21 3578999999999875 31 21 346 Q ss_pred eEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccce--eeeeeeeccccccccccccce Q lcl|NC_019408. 159 SRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYI--TVYRELKLEEIEWPSGEVKLA 236 (612) Q Consensus 159 t~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~--~~~R~~~~~~~~~~~g~~~~~ 236 (612) +.+++.+.... + -..|+.+..-.+ .++.+.+ ..|+..... .. | ..+ T Consensus 165 ~~~~f~~~~~~-------~------~~~y~~le~h~~-------------~~~~~~I~~~~y~~~~~~--~~--g--~~v 212 (496) T protein:vir:38 165 DECVIANSFHK-------N------NKYYTLLEWNEW-------------QGDVYTVTTELYQSDDPN--EL--G--TKV 212 (496) T ss_pred EEEEEEEEEEe-------C------CeEEEEEEEEEE-------------eCceEEEEEEEEecCCcc--cc--C--ccc Confidence 66666543321 1 112222221111 0111111 112211000 00 0 000 Q ss_pred eEEEEEeeCCCceecceeeeccCCccccceeEEEeecC--CC---CCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHh-- Q lcl|NC_019408. 237 YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS--GN---TADVEKPPLLDICDLNLSHYRTYAELEYGRLFT-- 309 (612) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~--~~---~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~-- 309 (612) -...++.. ..+...=..++..||+++... ++ +...|.+-|.++..+-=..=..-|++.+.++.. T Consensus 213 ~~~~~~~~---------~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~ 283 (496) T protein:vir:38 213 SLTLLFDD---------IEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKK 283 (496) T ss_pred cccccccc---------cccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhccc Confidence 00001100 000000012456677665332 11 111233334333333111112223333344432 Q ss_pred --ccceeeeecCCCCCCceEE--Eeccc---cc-cCCCCCceeEEecCchh-HHHHHHHHHHHHHHHHHH-H--HHhhhc Q lcl|NC_019408. 310 --ALPVYYAPGTDSEGTGEYH--IGPNM---VW-EVPQGSEPGILEYTGQG-LKALETALNDKERQIAAI-G--GRMMPG 377 (612) Q Consensus 310 --~~P~l~i~G~~~~~~~~l~--iG~~~---~~-~lp~~~~~~~lE~~g~~-l~~~~~~l~~~e~qm~~l-G--a~ll~~ 377 (612) -+|.-++.......+.... .+... ++ ..+.++..++-.+++.- .+.....++.+.+++... | ...+.. T Consensus 284 ~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~ 363 (496) T protein:vir:38 284 KVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTF 363 (496) T ss_pred ceecchHHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCC Confidence 2232223211111111100 00000 00 11233333344444432 245566666666655432 2 112211 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc-------CCcCCCCcceEEEeeccccccCCCHHHH Q lcl|NC_019408. 378 ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWR-------DVPLADTENLRYEVNTDFLSTPIGAREM 450 (612) Q Consensus 378 ~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~-------g~~~~~~~~~~v~ln~dF~~~~~d~~~~ 450 (612) ...+..||++.........+........++.++.++++.+..+. |... +..+++|..+.. .+.+ ....+ T Consensus 364 -~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~-~~~~i~v~f~d~-i~~d-~~~~~ 439 (496) T protein:vir:38 364 -DENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVV-ELDTITVDFDDS-IAQD-EDTTI 439 (496) T ss_pred -CccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-CccceEEEeCCC-CCCC-HHHHH Confidence 12344566666655555666666777788888888877765432 2222 234455555432 2221 24568 Q ss_pred HHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccccccc-chhHHhhhhhhH Q lcl|NC_019408. 451 RAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINN-PDAQARQRGYTN 511 (612) Q Consensus 451 ~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~-~~~~~~~~~e~~ 511 (612) +.+.+++.+|.||++|++..+ -++ .+-+++++.+++++|.+.... ++.-.. .++.+ T Consensus 440 ~~~~~~~~~GiiS~et~l~~~--~~~--~d~ea~~el~ri~~E~~~~~~~~d~~~~-~~~~e 496 (496) T protein:vir:38 440 NRYTNAKNQGMIPLKIALQRA--WNI--TEAEADEWAEMLAKEKQAEMPNNDMNGI-FGEEE 496 (496) T ss_pred HHHHHHHhcCCCCHHHHHHhc--CCC--ChHHHHHHHHHHHHhhhccCccccccCC-CCCCC Confidence 889999999999999886532 244 233455677777666442211 111111 11111 No 73 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.64 E-value=7.3e-14 Score=92.62 Aligned_cols=439 Identities=11% Similarity=0.077 Sum_probs=203.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcCh-HHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQ-REIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~-~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) .-.|+++......|+ .+|.|. ..+......+-+++ .. .+-.=.|+.+.+++.+++++|.+||++.. T Consensus 31 i~~~~~~~~~i~~~~---~~Y~g~~~~~~~~~~~~~~~~--~~--------~~~~s~n~~~~iv~~~a~~l~~ep~~i~~ 97 (499) T protein:vir:80 31 VNANDEDYKYIDMWK---RLYQGNYAEWHNLNYEHNGNP--VN--------RRQLSMNLPKVTAKYMSKLLFNEKVKINI 97 (499) T ss_pred CcCCHHHHHHHHHHH---HHhcCCcchhhccccccCCCc--cc--------cceeecchHHHHHHHHHHhhhCCcceEee Confidence 334777777777775 567775 33433211111111 11 11122599999999999999999999943 Q ss_pred CCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc-chhhhccCCccce Q lcl|NC_019408. 80 LPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD-WDEVVDMGGFYVP 158 (612) Q Consensus 80 ~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin-W~~~~~v~g~~~L 158 (612) -.+....+++++-.. +++...+..++..++.+|.+++.|=+.. ..+|-+..++|++++- |.. + ..+ T Consensus 98 ~d~~~~e~l~~~~~~-n~f~~~~~~~~~~a~~~G~~~~~~~~D~------~~~~~i~~v~a~~~~Pi~~d----~--~~~ 164 (499) T protein:vir:80 98 DDETAEEFVLNVLKT-NGFTKNMERYIEYGEAMGGFVIKVYHDG------NKNVKVSFATADCMYPLSND----S--ENV 164 (499) T ss_pred CCHHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCcEEEEEEECC------CCcEEEEEEcCCceEEEEec----C--CCe Confidence 235555565554433 7789999999999999999988764432 2578899999999875 421 2 236 Q ss_pred eEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccccee- Q lcl|NC_019408. 159 SRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAY- 237 (612) Q Consensus 159 t~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~- 237 (612) +.+++.+..... -..|+.+....+. ....+.|.++ ++..... ++..-|.. T Consensus 165 ~~~~f~~~~~~~-------------~~~y~~lE~h~~~----------~~~~~~y~I~-n~~~~~~-----~~~~lG~~v 215 (499) T protein:vir:80 165 DECLIANSFHKN-------------NKYYKLLEWNEWK----------GEKEEVYTVT-TELYQSD-----DPNELGGKV 215 (499) T ss_pred EEEEEEEEEeec-------------CeEEEEEEEEEec----------ccceeeEEEE-EEEEecc-----CccccCccc Confidence 666665543221 0122222211100 0000011111 1110000 00000000 Q ss_pred -EEEEEeeCCCceecceeeeccCCccccceeEEEeecC-CCC----CCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhcc Q lcl|NC_019408. 238 -VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS-GNT----ADVEKPPLLDICDLNLSHYRTYAELEYGRLFTAL 311 (612) Q Consensus 238 -~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~-~~~----~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~ 311 (612) ...++.. ..|...=..++..||+++... .|. ...|.+-|.++-.+-=+.=..-|.+.+.+..... T Consensus 216 ~l~~~~~~---------~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~ 286 (499) T protein:vir:80 216 SLKLLFND---------IEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKK 286 (499) T ss_pred chhhhccC---------cCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhccc Confidence 0001100 001000012566777766332 111 1113333333332211111222333344444333 Q ss_pred ceee----ee-cCCCCCCce-EEEeccc---cc-cCCCCCceeEEecCchh-HHHHHHHHHHHHHHHHH---HHHHhhhc Q lcl|NC_019408. 312 PVYY----AP-GTDSEGTGE-YHIGPNM---VW-EVPQGSEPGILEYTGQG-LKALETALNDKERQIAA---IGGRMMPG 377 (612) Q Consensus 312 P~l~----i~-G~~~~~~~~-l~iG~~~---~~-~lp~~~~~~~lE~~g~~-l~~~~~~l~~~e~qm~~---lGa~ll~~ 377 (612) .+++ +. ..+..+... .--+... .+ ..+.++..++-.+++.- .+++.+.|+.+..++.. ++...+.. T Consensus 287 ~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~ 366 (499) T protein:vir:80 287 KVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTF 366 (499) T ss_pred ceecchhhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCC Confidence 3333 11 111111100 0000111 11 11223333343344432 23445555555554432 11112211 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCc------CCCCcceEEEeeccccccCCCHHHHH Q lcl|NC_019408. 378 ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVP------LADTENLRYEVNTDFLSTPIGAREMR 451 (612) Q Consensus 378 ~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~------~~~~~~~~v~ln~dF~~~~~d~~~~~ 451 (612) ...+..||++.....+........+...+..+|.++++.+..|.... ..+..++.|..+..-.. -...+++ T Consensus 367 -~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~--d~~~~~~ 443 (499) T protein:vir:80 367 -DENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQ--DEDTTIN 443 (499) T ss_pred -CcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCC--CHHHHHH Confidence 22345566666655555555667777788888888877777663211 11123455544332211 1245678 Q ss_pred HHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccc-cccchhHHhhhhhhH Q lcl|NC_019408. 452 AIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSF-INNPDAQARQRGYTN 511 (612) Q Consensus 452 al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~-~~~~~~~~~~~~e~~ 511 (612) .+.+++.+|.||++|++.. ..|+ ++-+.+++..++.+|... ...++.... .++.+ T Consensus 444 ~~~~~~~~Gi~S~et~l~~--~~~~--~d~ea~~el~~i~~E~~~~~~~~d~~g~-~ge~e 499 (499) T protein:vir:80 444 RYTTAKNQGMIPLKIALQR--AWNI--TEAEADEWAEMLAKEKQAEIPNNDMTGI-FGEEE 499 (499) T ss_pred HHHHHHHcCCCCHHHHHhh--cCCC--ChHHHHHHHHHHHHHhhcCCCCCCcccc-CCCCC Confidence 8899999999999988543 2344 233455666776655432 222222211 11111 No 74 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.60 E-value=2.4e-14 Score=95.30 Aligned_cols=457 Identities=11% Similarity=0.048 Sum_probs=230.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) .++-+.-.+.+..+++..|.|.|... .|=|-.++++ .+-++.|.-+..|++ +..++.++..+ .+ T Consensus 21 ~wV~~~D~~RlaaY~ly~d~y~n~~~------el~~il~G~d--------r~~~~~ps~r~~V~~-~~~~Lg~~~~~-~V 84 (563) T protein:vir:74 21 NIVDENDKNRVRAYDLYENIYLNSAE------TLKLVLRGDD--------SVPILMPSGRKIVEA-VHRFLGVGFDY-LV 84 (563) T ss_pred ccCCHHHHHHHHHHHHHHHhhcCchh------hhhhhcCCCc--------eeeeccchHHHHHHH-HHHhcCCCcEE-ec Confidence 66778888889999999999988432 1112122332 455555666667777 44666776666 23 Q ss_pred C---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhc Q lcl|NC_019408. 81 P---------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVD 151 (612) Q Consensus 81 p---------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~ 151 (612) | ..++.++.+.=. -.+|...+.+.-+.++.-|-..++|=. .+++..+.||-++.|+|..|.-|.--.. T Consensus 85 e~~~~de~~~~avq~~Lr~~~~-~e~l~~~~~~~~r~a~vlGDgvf~l~w--Dp~K~~g~R~rv~~vDP~~~fp~~dpd~ 161 (563) T protein:vir:74 85 EPDMGDEGIRQSLNAYFRTTFK-REAIKAKFTSNKRWGLIRGDAHFYIHA--DPNKKAGERISVDEVDPRQIFLIEDGST 161 (563) T ss_pred CccccCcchHHHHHHHHHHHHH-HhhhHHHHHHHHHhhhhhcceeEEEee--ccccccCCCceEeecCCceeeeccCCCC Confidence 2 334444433222 257888888888888888855555433 2245667899999999999999953222 Q ss_pred cCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccc Q lcl|NC_019408. 152 MGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSG 231 (612) Q Consensus 152 v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g 231 (612) +.| +...+.+. .|..+.|+.. .+...|.+....+..++..... .+..+.|.--+ |.+- T Consensus 162 v~g------~~~v~v~~--~~~~pdd~~~--~~~r~~~~~~~lndeg~~~~~~-------~~dae~w~lg~-----wd~r 219 (563) T protein:vir:74 162 VVG------FHMVDIVQ--DFRSPDDPSK--KLARRRTFRRVRNDEGMFTGRI-------SSELTHWTLGN-----WDDR 219 (563) T ss_pred ccc------ceeeeccc--CCCCCcchhc--cceeeeeeeeeeCCCCCcccee-------eeccchhcccc-----cccc Confidence 211 11222222 3444444332 1121111111111111100000 00111121100 0000 Q ss_pred cccceeEEEEEeeCCCceecce--eeeccCCccccceeEEEeecC-CCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019408. 232 EVKLAYVQYLYEEDPESRPIAR--IVPTVRGEPLDFIPFKFFGAS-GNTADVEKPPLLDICDLNLSHYRTYAELEYGRLF 308 (612) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~--~~p~~~g~~l~~IP~v~~~~~-~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~ 308 (612) . .....+.. .+.+..... .....--+|+++||+|.+.+. ..+...+.+-|.+|-.+--+.-++.+|.+.++-+ T Consensus 220 ~-~~~~~~~~---~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~ 295 (563) T protein:vir:74 220 G-AISDEQAR---RKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVF 295 (563) T ss_pred C-ccchhhhc---ccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHh Confidence 0 00000000 000000000 000111358999999876443 3455568888998888888888999999999999 Q ss_pred hccceeeeecCCCCCC-----ceEEEeccccccCCCCCceeEEe-cCc-hhHHHHHHHHHHHHH-HHHHH------HHHh Q lcl|NC_019408. 309 TALPVYYAPGTDSEGT-----GEYHIGPNMVWEVPQGSEPGILE-YTG-QGLKALETALNDKER-QIAAI------GGRM 374 (612) Q Consensus 309 ~~~P~l~i~G~~~~~~-----~~l~iG~~~~~~lp~~~~~~~lE-~~g-~~l~~~~~~l~~~e~-qm~~l------Ga~l 374 (612) ++.|+.++.|....+. .++.||++..|-||.++..+++. .+| ..++.++.-|+++.. -|... +.-. T Consensus 296 tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~ 375 (563) T protein:vir:74 296 QGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGR 375 (563) T ss_pred cCCCeEEeccccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecc Confidence 9999999986543221 13569999999999776545444 556 123334444455554 33321 1111 Q ss_pred hhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH---------HHHcCCcCCCCcc--eEEE Q lcl|NC_019408. 375 MPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTD--------VVRWW---------LMWRDVPLADTEN--LRYE 435 (612) Q Consensus 375 l~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~--------~l~~~---------a~w~g~~~~~~~~--~~v~ 435 (612) ++. +..+|+.+-.+.-....|.+..-...+...+.+ .|..+ .+|.|.... ... ++|. T Consensus 376 vD~---~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~-~~~~~v~iv 451 (563) T protein:vir:74 376 VDV---TSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADL-LNECSVVCI 451 (563) T ss_pred ccc---ccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhccccccccccc-CCceEEEEE Confidence 222 225676666665444443222111111222222 22222 234543221 112 2222 Q ss_pred eeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccc---------c----------- Q lcl|NC_019408. 436 VNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADEN---------S----------- 495 (612) Q Consensus 436 ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~---------~----------- 495 (612) . .+..+.+. .+-+..++.++++|.||++|.++.|.+.|..-++...+ ...+..+. . T Consensus 452 f-~p~~P~d~-~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e--~~~ie~~~i~~~~~a~a~ad~~~~~~a~~ 527 (563) T protein:vir:74 452 F-ADPMPVNK-TQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQ--GNALTDDDIADMLLAEAEADASLGLSAMD 527 (563) T ss_pred e-CCCCCccH-HHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHH--HhhcCHHHHHHHHHHHhhccCcccceecc Confidence 2 23333322 44678899999999999999999999988765553221 11111110 0 Q ss_pred ----------ccccchhHH-----------hhhhhh Q lcl|NC_019408. 496 ----------FINNPDAQA-----------RQRGYT 510 (612) Q Consensus 496 ----------~~~~~~~~~-----------~~~~e~ 510 (612) -.++|..+= ....-+ T Consensus 528 ~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 528 NGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred cCCCCcccccccCCchhHcCCcccCCccccccCCCC Confidence 001110000 000011 No 75 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.55 E-value=4.7e-13 Score=88.19 Aligned_cols=580 Identities=11% Similarity=0.036 Sum_probs=180.5 Q ss_pred CCCcH---HHHHHHHHHHHHHHHhcChHHHHh---cccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhc Q lcl|NC_019408. 1 MVTHP---EYQYWRPEWTKLRDVMAGQREIKR---KAEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFR 72 (612) Q Consensus 1 ~~~hP---~y~~~~~~W~~i~d~~~G~~~vr~---~g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~ 72 (612) +-.+= .+..++.. ++..+.....+|. .-..|. =+|+.+.....+.+-...+.+|.++.+|+.++|...+ T Consensus 38 ~~~~~~~~~~~~l~~~---~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~ 114 (776) T protein:vir:93 38 LDSEQAVELHSRLLSY---YRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQAPTVYNVISQSVNWIIGSEKR 114 (776) T ss_pred CCCHHHHHHHHHHHHH---HHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCceEEecchHHHHHHHHHHHHh Confidence 11111 11111111 1112222222221 111222 1455555555666667778899999999999999998 Q ss_pred CCceeecCC---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEE--EecCcchhhhhccCceEEE-ech Q lcl|NC_019408. 73 RDPIVKNLP---------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVL--VDVVDNPRKGAVATSFAVG-YSA 140 (612) Q Consensus 73 k~p~~~~~p---------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vl--VD~p~a~~~~~~~rPy~~~-~~a 140 (612) ..+.+.-.| +.|..++..+ .+=++.+.-+..+|..++.+|.+|+= +||... .-|++.. +++ T Consensus 115 nr~~~~~~p~~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~------~~~~~~~~~~p 187 (776) T protein:vir:93 115 GRSDFKVLPRRKDGGKAAERKTALLKYL-SDVNHTPFERSMAFEETTKAGIGWLESQVQDEND------GEPIYAGAESW 187 (776) T ss_pred CCcceEEecCChhHHHHHHHHHHHHHHH-HHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCC------CCceEeeccCh Confidence 877664233 2355555555 35577788899999999999987754 466432 2244433 344 Q ss_pred hhhhcchhhhc-cCCccceeEEEEEEEeeccc----cccC-----------CCcccccceeeeeeEeeecccccccceee Q lcl|NC_019408. 141 ENILDWDEVVD-MGGFYVPSRVLLREFVRDLR----WKSD-----------IEPLTTAQARKARAAALASGSASSPMVRQ 204 (612) Q Consensus 141 e~IinW~~~~~-v~g~~~Lt~v~l~E~v~~~~----~~~~-----------~d~f~~~~~~q~r~l~l~~g~~~~~~~~~ 204 (612) .+|+ |+.... .|.. ...++..+..+.... +.+. .+.+...+......+. ..+.. ..+.. T Consensus 188 ~~i~-~Dp~a~~~D~s-Dar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~ 262 (776) T protein:vir:93 188 RNIL-WDSTYRRLDMD-DCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMD-SPEYE--RSMNS 262 (776) T ss_pred hhee-eccccccCCHH-HHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccc-ccccc--ccccc Confidence 4433 322111 1111 122222221111100 0000 0000000000000000 00000 00000 Q ss_pred ccccccc--ccceeeeeeeeccccc----c-ccccccc-----------------------eeEEEEEeeCCCceeccee Q lcl|NC_019408. 205 TARTLGG--YSYITVYRELKLEEIE----W-PSGEVKL-----------------------AYVQYLYEEDPESRPIARI 254 (612) Q Consensus 205 ~~~~~~g--~~~~~~~R~~~~~~~~----~-~~g~~~~-----------------------~~~~~~~~~~~~~~~~~~~ 254 (612) ......+ ...++++-+....++. . ..+...+ ..+..++.. +..+.. T Consensus 263 ~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~----~~~g~~ 338 (776) T protein:vir:93 263 VTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCA----IMTTRD 338 (776) T ss_pred ccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEE----EEecch Confidence 0000000 0011222111110000 0 0000000 000000000 000001 Q ss_pred eeccCCc--cccceeEEEeec-CCCCCCcCcCchHHHHHHHHH-HHhhhHHHHHHHHHhccceeeeecCCCCCCceEE-E Q lcl|NC_019408. 255 VPTVRGE--PLDFIPFKFFGA-SGNTADVEKPPLLDICDLNLS-HYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYH-I 329 (612) Q Consensus 255 ~p~~~g~--~l~~IP~v~~~~-~~~~~~~~~pPLldLA~lnl~-HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~-i 329 (612) +...+-. +.++||||++.. .......+..-...+.++.-. ++..+. +.++| ...++.+..|.-...++... + T Consensus 339 ~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~-~~~~l--~~~~~~~~~gav~~~d~~~~~~ 415 (776) T protein:vir:93 339 LMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSK-ALYIL--STNKVLMEEGAVDDIDEFRREA 415 (776) T ss_pred hhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHH-HHHhh--cCCceeeccccccchHHHHHhc Confidence 1111112 346888885532 222222222223333333221 122222 33343 23455554553222111111 1 Q ss_pred -eccccccCCCCC--ceeEEecCchhHHHHHHHHHHHHHHHHHH-HHH-hhhccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 330 -GPNMVWEVPQGS--EPGILEYTGQGLKALETALNDKERQIAAI-GGR-MMPGASKSVSESNNQTVLREANEQSLLLNII 404 (612) Q Consensus 330 -G~~~~~~lp~~~--~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~-ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a 404 (612) =++.++.+-.|+ .+.+....+- ...+.+.|....+.|..+ |.. ....... ...|+.+......+..-.|..+. T Consensus 416 ~rp~~vi~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~-n~~Sg~ai~~~~~~~~~~~~~~~ 493 (776) T protein:vir:93 416 ARPDAVMTVKNGKLGAVKMDVDRDL-APAHLELASRSIQMIQQVGGVTDEMLGRTT-NAVSGVAIQARQEQGSVATNKLF 493 (776) T ss_pred ccCCceeeeCCccccccccccCcCc-cHHHHHHHHHHHHHHHHhhCcChHHhCCCc-chhhHHHHHHHHHHHHHHHHHHH Confidence 133444443333 3344333222 234555566666655443 321 1111121 23466666665555555677777 Q ss_pred HHHHHHHHH----HHHHHHHHcCCc------CCCCcceEEEeec----------ccccc--------CCCHHHHHHHHHH Q lcl|NC_019408. 405 QACESGMTD----VVRWWLMWRDVP------LADTENLRYEVNT----------DFLST--------PIGAREMRAIQLM 456 (612) Q Consensus 405 ~~~~~a~~~----~l~~~a~w~g~~------~~~~~~~~v~ln~----------dF~~~--------~~d~~~~~al~~~ 456 (612) .++..++.. +|.++..|++.. ..++..=.|.||. +|+.. ....+...+|+++ T Consensus 494 dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql 573 (776) T protein:vir:93 494 DNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEV 573 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHH Confidence 777777665 445555554311 0000001122321 11111 0112233444444 Q ss_pred HHcCC--CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHH- Q lcl|NC_019408. 457 ANDGL--LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKID- 533 (612) Q Consensus 457 ~~~G~--is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e- 533 (612) +.... +....+-..+.-.++.. .++...++....+ ...+.+.....++.++++ ..++..+ .+.+++... T Consensus 574 ~~~~~p~~~~~~~~~~~e~~d~p~----~~e~~~~l~~~~~-~~~p~q~~~~~e~~~~qq-~q~~~~q--~q~~~~~a~~ 645 (776) T protein:vir:93 574 IGKMPPEIALTMLDLLVENMDIPN----RDELVKRIRAVNG-QKDPDQDEPTPEEIAREQ-AQQQQQQ--YNDALAIATL 645 (776) T ss_pred HhhcChhhHHHHHHHHHHhcCccc----hHHHHHHHHHhhc-ccccchhhcchhHHHHHH-HhhHHHH--HHHHHhhhhh Confidence 43211 11111111111222211 1222233322111 111112221111111111 1111000 011111111 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH----HHHHHHHH---HHhhccccCCCc---h Q lcl|NC_019408. 534 ----IQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKP-AVADQA----TIDNAKKQ---TANAAKVAAQPP---A 598 (612) Q Consensus 534 ----~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~-~~~eq~----~~~~~~k~---~~~~a~~~~~~~---~ 598 (612) .+.++++.++++.+++.++.+.+...++..+...+... .+..+. ..+.+..+ ........+++. . T Consensus 646 ~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~~a~~~~p~~p~~~~~~~~~ 725 (776) T protein:vir:93 646 EEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILRESGWDDPNTPQPASAASGM 725 (776) T ss_pred hHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhccccccccccccccccccCC Confidence 11111111111111111111111111111000000000 000000 00000000 000000000000 0 Q ss_pred hhcCCCCCcccCCC Q lcl|NC_019408. 599 PAAPGAPPTNRRPT 612 (612) Q Consensus 599 ~~~~~~~~~~~~~~ 612 (612) ..++..|.+...|. T Consensus 726 ~~~~~~p~~p~~p~ 739 (776) T protein:vir:93 726 PPAPAQPAQPANPA 739 (776) T ss_pred CCCCCCCCCCCCcC Confidence 00011111111111 No 76 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.53 E-value=2.7e-12 Score=83.99 Aligned_cols=552 Identities=11% Similarity=0.007 Sum_probs=188.9 Q ss_pred CCC------cHHHHHHHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhch Q lcl|NC_019408. 1 MVT------HPEYQYWRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGM 69 (612) Q Consensus 1 ~~~------hP~y~~~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~ 69 (612) ..+ .-.+..+. ..+..+..+...++.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+|. T Consensus 20 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 96 (711) T protein:vir:10 20 VYAKNNDDDRALLATAR---ERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGD 96 (711) T ss_pred hcccCcchHHHHHHHHH---HHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHhhh Confidence 000 00000000 0111111111111110 01121 1555555555566666688999999999999999 Q ss_pred hhcCCceeecCCH-------------------------------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE- Q lcl|NC_019408. 70 VFRRDPIVKNLPP-------------------------------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV- 117 (612) Q Consensus 70 vf~k~p~~~~~p~-------------------------------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v- 117 (612) -=+..|.+.-.|- .|..++.. ..+-++.+.-+..+|..++..|.+|+ T Consensus 97 ~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~d~~~~G~G~~e 175 (711) T protein:vir:10 97 QRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKN-IEYNCDAETEYDIAFQGAVESGMGYLR 175 (711) T ss_pred HhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHH-HHHhcChhHHHHHHHHHhhhcCcceEE Confidence 9998888754451 13333322 22334666678899999999998884 Q ss_pred -EEecCcchhhhhccCceEEEe-chhhhh-cchhhhccCCccceeEEEEEEEeeccc----cccC----------CCccc Q lcl|NC_019408. 118 -LVDVVDNPRKGAVATSFAVGY-SAENIL-DWDEVVDMGGFYVPSRVLLREFVRDLR----WKSD----------IEPLT 180 (612) Q Consensus 118 -lVD~p~a~~~~~~~rPy~~~~-~ae~Ii-nW~~~~~v~g~~~Lt~v~l~E~v~~~~----~~~~----------~d~f~ 180 (612) .+||-..+. ....+-+..| +|.+|+ ||.... .+.. ..-|+..+..+.... +.+. .+.+. T Consensus 176 v~~d~~~~d~--~~~e~~i~~v~~p~~v~~Dp~a~~-~D~s-Dar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~ 251 (711) T protein:vir:10 176 VRSDYLADDS--FEQDLIIEAIQNQFSVTIDPDAKK-RDRS-DMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDT 251 (711) T ss_pred EEecccCCCC--CCCCeEEeeecChhheeeCccccc-cChh-hhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCc Confidence 456622111 1123444455 466654 332211 1111 123333332211110 0000 00000 Q ss_pred ccceeeeeeEeeecccccccceeecccccccccceeeeeeeec--------ccccccccccccee-EEEEEeeCCCceec Q lcl|NC_019408. 181 TAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKL--------EEIEWPSGEVKLAY-VQYLYEEDPESRPI 251 (612) Q Consensus 181 ~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~--------~~~~~~~g~~~~~~-~~~~~~~~~~~~~~ 251 (612) ......+|+... -+...+.........|.. ..|..... ..+......+.... .+.++. |. T Consensus 252 ~~~~~~vrv~E~---~~r~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~--G~---- 320 (711) T protein:vir:10 252 WFTEKSVRVSEY---FTREPVIREIALLSDGRS--FWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKIT--GA---- 320 (711) T ss_pred ccCcceeeEEEE---EeeeeeeeEEEeecCCce--eccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEe--cc---- Confidence 000001111100 000000001111111100 00000000 00000000001111 111111 11 Q ss_pred ceeeeccCCcc--ccceeEEEeecCC-----CCCCcCcCchHHHHHHHH-HHHhhhHHHHHHHHHhccceeee-ecCCCC Q lcl|NC_019408. 252 ARIVPTVRGEP--LDFIPFKFFGASG-----NTADVEKPPLLDICDLNL-SHYRTYAELEYGRLFTALPVYYA-PGTDSE 322 (612) Q Consensus 252 ~~~~p~~~g~~--l~~IP~v~~~~~~-----~~~~~~~pPLldLA~lnl-~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~ 322 (612) .+. .+..| -++||||+|.... .+...|. ..++-+..- ..+..| -+-+++..++.+.+++ .|.-.. T Consensus 321 --~~L-~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~--vr~~~d~Qr~~N~~~s-~~~~~l~~~~~~~~~~~~gai~~ 394 (711) T protein:vir:10 321 --NVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSI--IRHSKDAQRMANYWDS-AATETVALAPKAPFIGSEGNVEG 394 (711) T ss_pred --eee-cCCCCCCCCcccEEEEeeeeeccccccccchh--hhhhhhhHHHHHHHHH-HHHHHHHhcCCCceeecCcccCC Confidence 111 22333 3568888653221 1111111 122222211 122332 3455555555555544 443221 Q ss_pred CCceEE---EeccccccCCCC----CceeEEecCchhHHHHHHHHHHHHHHHHH-HHHHhhhccccchhHHHHHHHHHHH Q lcl|NC_019408. 323 GTGEYH---IGPNMVWEVPQG----SEPGILEYTGQGLKALETALNDKERQIAA-IGGRMMPGASKSVSESNNQTVLREA 394 (612) Q Consensus 323 ~~~~l~---iG~~~~~~lp~~----~~~~~lE~~g~~l~~~~~~l~~~e~qm~~-lGa~ll~~~~~~~~esa~~~~~~~~ 394 (612) .++.+. .-++..+.+-+| +.+.+..+..-+ .....-|+.....|.. .|..-...+....+.|+.+...... T Consensus 395 ~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~ 473 (711) T protein:vir:10 395 REDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVP-AAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQR 473 (711) T ss_pred hHHHHHhccccCCCeeEecccccCcCCccccCCCCCC-HHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHH Confidence 111111 224444544323 246666544433 2345555555555533 3432211111122346666666666 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHHHcC---------CcCCCCcceEEEeecc-------------------c-- Q lcl|NC_019408. 395 NEQSLLLNIIQACESGMTD----VVRWWLMWRD---------VPLADTENLRYEVNTD-------------------F-- 440 (612) Q Consensus 395 ~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g---------~~~~~~~~~~v~ln~d-------------------F-- 440 (612) +..-.|..+..++..+... +|.++..|.. ++. +.+ .|.||.. | T Consensus 474 qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~--~~~-~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv 550 (711) T protein:vir:10 474 QGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDE--TED-FVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCC--Ccc-eEEecccccccccccceeeeccceeeeEE Confidence 6555667777777766554 5556666652 210 111 1223211 1 Q ss_pred cc------cCCCHHHHHHHHHHHHcCCCCHH---HHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhH Q lcl|NC_019408. 441 LS------TPIGAREMRAIQLMANDGLLPDP---VFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTN 511 (612) Q Consensus 441 ~~------~~~d~~~~~al~~~~~~G~is~e---t~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~ 511 (612) .+ .....+.+.+|+++.. .++.- .+-..+.-.++. ..++...++....+.....+....+ . T Consensus 551 ~i~~~p~~~s~r~~~~~~l~ql~~--~~p~~~~~~~~~il~~~d~p----~~~el~e~lr~~~~~~~~~~~~~~~----~ 620 (711) T protein:vir:10 551 VVTTGPAFATQRIEAAEAMIQFAQ--AVPSAAAVMADLIAQNMDWP----GADVIAERLKKIVPPNVLSKDEREA----I 620 (711) T ss_pred EEeeccCchhHHHHHHHHHHHHHh--hcchhhhHHHHHHHHhcCCC----CHHHHHHHHHhhcCcccCcchhhhH----H Confidence 10 0111222334444432 22211 111112112221 1233344444332221111111100 0 Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHH--HHHHHHHHHH-------HHHHHHHHH Q lcl|NC_019408. 512 RGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSIS--GSR--KLGDPEQAKP-------AVADQATID 580 (612) Q Consensus 512 r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~--~~r--~~~~e~q~k~-------~~~eq~~~~ 580 (612) .+...+++.+..+.+.++++++....+++.+..+.++++.+.+.+.. .++ ......|... ++.++.+.+ T Consensus 621 qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qae 700 (711) T protein:vir:10 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAE 700 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111111111111212222211111222222222222211111110 000 0000111111 111222223 Q ss_pred HHHHHHHhhcc Q lcl|NC_019408. 581 NAKKQTANAAK 591 (612) Q Consensus 581 ~~~k~~~~~a~ 591 (612) .+..|+....+ T Consensus 701 lq~~q~~~~q~ 711 (711) T protein:vir:10 701 ITASQANVTEQ 711 (711) T ss_pred HHHHHHHhhcC Confidence 33333333333 No 77 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.47 E-value=9.3e-12 Score=81.06 Aligned_cols=436 Identities=11% Similarity=0.065 Sum_probs=200.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) +-..|++.+....| +..|.|..- . +..+...+ ..+.+...+ .|+.+.+++.+++++|..+|++. + T Consensus 32 i~~~~~~~~~i~~~---~~~Y~g~~~----~---~~~~~~~~--~~~~~~~~s--lnl~~~i~~~~A~lv~~e~~~i~-~ 96 (500) T protein:vir:30 32 IAISKLEYDRITTN---LKYYKSDWD----S---VLYLNTDG--ETKKRDLNH--LPIARTAAKKIASLVFNEQAEIK-V 96 (500) T ss_pred ccCCHHHHHHHHHH---HHHhcCCCC----C---cccccCCC--CcccCceee--cchHHHHHHHHhhhhcCCcceEe-c Confidence 55566666666666 446666311 1 11111111 111111222 49999999999999999999984 4 Q ss_pred C-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCcccee Q lcl|NC_019408. 81 P-PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPS 159 (612) Q Consensus 81 p-~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt 159 (612) + +.+..+++++=. .+++...+..++..++..|.+++-+=+- +.+|-|..++|++++-+.. ++..... T Consensus 97 ~d~~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~~----d~~~~~~ 164 (500) T protein:vir:30 97 DDDAANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQS----NTQDVSS 164 (500) T ss_pred CChHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEEE----cCCCeEE Confidence 4 445555554422 2678888999999999999777654321 2468899999999875311 3333445 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccce--eeeeeeecccccccccccccee Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYI--TVYRELKLEEIEWPSGEVKLAY 237 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~--~~~R~~~~~~~~~~~g~~~~~~ 237 (612) .+++.+++...+. ....|+.+..-.+. .++.+.+ .+||.... ...+. .+. T Consensus 165 ~a~~~~~~~~~~~----------~~~~yt~lE~h~~~------------~~~~~~I~n~ly~~~~~--~~lG~-~v~--- 216 (500) T protein:vir:30 165 AAVVIKSVKTING----------KEVYYTLIEFHEWQ------------SSDDYVISNELYRSDDK--AKVGS-RVP--- 216 (500) T ss_pred EEEEEEEeeeecC----------CceEEEEEEEEEEe------------CCceeEEEEEEEecccc--cccCc-ccc--- Confidence 5555554432110 11234444332111 0111111 12221100 00000 000 Q ss_pred EEEEEeeCCCceecceeeeccCCccccceeEEEeec-CCCCCC----cCcC------chHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_019408. 238 VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGA-SGNTAD----VEKP------PLLDICDLNLSHYRTYAELEYGR 306 (612) Q Consensus 238 ~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~-~~~~~~----~~~p------PLldLA~lnl~HY~~~sD~~~~l 306 (612) ...+|.. .. .+.+. ..++..||+++-. ..|.-. .|.+ +|++-.+..+..|. -+++.+= T Consensus 217 l~~~~~~---l~--~~~~~----~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~ 285 (500) T protein:vir:30 217 LSEVYKD---LK--DEAKV----TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQ 285 (500) T ss_pred cccccCC---cC--cceEe----ccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCc Confidence 0011110 00 01111 1133444554421 111111 1222 33333333333332 2333222 Q ss_pred HHhccceeeeecC-CCCCCceE---EEecc-c-c--ccCCCCCceeEEecCchh-HHHHHHHHHHHHHHHHH-HH--HHh Q lcl|NC_019408. 307 LFTALPVYYAPGT-DSEGTGEY---HIGPN-M-V--WEVPQGSEPGILEYTGQG-LKALETALNDKERQIAA-IG--GRM 374 (612) Q Consensus 307 ~~~~~P~l~i~G~-~~~~~~~l---~iG~~-~-~--~~lp~~~~~~~lE~~g~~-l~~~~~~l~~~e~qm~~-lG--a~l 374 (612) +..-+|.-++... +....+.+ ..... . . +....++.-++-++++.- .+.+...|+.+-.++.. .| ... T Consensus 286 ~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~ 365 (500) T protein:vir:30 286 RRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGL 365 (500) T ss_pred ceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccc Confidence 2222232222211 00001100 00000 0 1 111222222333333332 23445555555554432 22 111 Q ss_pred hhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC---C---cCCCCcceEEEeeccccccCCCHH Q lcl|NC_019408. 375 MPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRD---V---PLADTENLRYEVNTDFLSTPIGAR 448 (612) Q Consensus 375 l~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g---~---~~~~~~~~~v~ln~dF~~~~~d~~ 448 (612) +.. ...+.+||++.....+........+...++.++.++++.++.+.. . ......++.|..+ |....+ ... T Consensus 366 ~~~-~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~-d~i~~d-~~~ 442 (500) T protein:vir:30 366 FSF-DGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLD-DGVFTD-RDA 442 (500) T ss_pred ccc-CcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeC-CCCCCC-HHH Confidence 111 123456788877777777777888888899999998887765421 1 1112223444443 212221 245 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccc-ccccchhHHhhhhh Q lcl|NC_019408. 449 EMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENS-FINNPDAQARQRGY 509 (612) Q Consensus 449 ~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~-~~~~~~~~~~~~~e 509 (612) ++..+.+++.+|.||+++++.. ..|+ .+-+.+++..++.+|.. ..+..+....--+| T Consensus 443 ~~~~~~~~v~aGi~s~~~~i~~--~~g~--~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 443 ELDYWIKVVNAGFGTREMAIQK--VLNV--TEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHHHHHcCCCCHHHHHHh--cCCC--CHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 6788999999999999987643 3455 22244555666655432 33333333222223 No 78 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.47 E-value=9.3e-12 Score=81.06 Aligned_cols=436 Identities=11% Similarity=0.065 Sum_probs=200.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) +-..|++.+....| +..|.|..- . +..+...+ ..+.+...+ .|+.+.+++.+++++|..+|++. + T Consensus 32 i~~~~~~~~~i~~~---~~~Y~g~~~----~---~~~~~~~~--~~~~~~~~s--lnl~~~i~~~~A~lv~~e~~~i~-~ 96 (500) T protein:vir:98 32 IAISKLEYDRITTN---LKYYKSDWD----S---VLYLNTDG--ETKKRDLNH--LPIARTAAKKIASLVFNEQAEIK-V 96 (500) T ss_pred ccCCHHHHHHHHHH---HHHhcCCCC----C---cccccCCC--CcccCceee--cchHHHHHHHHhhhhcCCcceEe-c Confidence 55566666666666 446666311 1 11111111 111111222 49999999999999999999984 4 Q ss_pred C-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCcccee Q lcl|NC_019408. 81 P-PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPS 159 (612) Q Consensus 81 p-~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt 159 (612) + +.+..+++++=. .+++...+..++..++..|.+++-+=+- +.+|-|..++|++++-+.. ++..... T Consensus 97 ~d~~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~~----d~~~~~~ 164 (500) T protein:vir:98 97 DDDAANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQS----NTQDVSS 164 (500) T ss_pred CChHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEEE----cCCCeEE Confidence 4 445555554422 2678888999999999999777654321 2468899999999875311 3333445 Q ss_pred EEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccce--eeeeeeecccccccccccccee Q lcl|NC_019408. 160 RVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYI--TVYRELKLEEIEWPSGEVKLAY 237 (612) Q Consensus 160 ~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~--~~~R~~~~~~~~~~~g~~~~~~ 237 (612) .+++.+++...+. ....|+.+..-.+. .++.+.+ .+||.... ...+. .+. T Consensus 165 ~a~~~~~~~~~~~----------~~~~yt~lE~h~~~------------~~~~~~I~n~ly~~~~~--~~lG~-~v~--- 216 (500) T protein:vir:98 165 AAVVIKSVKTING----------KEVYYTLIEFHEWQ------------SSDDYVISNELYRSDDK--AKVGS-RVP--- 216 (500) T ss_pred EEEEEEEeeeecC----------CceEEEEEEEEEEe------------CCceeEEEEEEEecccc--cccCc-ccc--- Confidence 5555554432110 11234444332111 0111111 12221100 00000 000 Q ss_pred EEEEEeeCCCceecceeeeccCCccccceeEEEeec-CCCCCC----cCcC------chHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_019408. 238 VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGA-SGNTAD----VEKP------PLLDICDLNLSHYRTYAELEYGR 306 (612) Q Consensus 238 ~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~-~~~~~~----~~~p------PLldLA~lnl~HY~~~sD~~~~l 306 (612) ...+|.. .. .+.+. ..++..||+++-. ..|.-. .|.+ +|++-.+..+..|. -+++.+= T Consensus 217 l~~~~~~---l~--~~~~~----~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~ 285 (500) T protein:vir:98 217 LSEVYKD---LK--DEAKV----TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQ 285 (500) T ss_pred cccccCC---cC--cceEe----ccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCc Confidence 0011110 00 01111 1133444554421 111111 1222 33333333333332 2333222 Q ss_pred HHhccceeeeecC-CCCCCceE---EEecc-c-c--ccCCCCCceeEEecCchh-HHHHHHHHHHHHHHHHH-HH--HHh Q lcl|NC_019408. 307 LFTALPVYYAPGT-DSEGTGEY---HIGPN-M-V--WEVPQGSEPGILEYTGQG-LKALETALNDKERQIAA-IG--GRM 374 (612) Q Consensus 307 ~~~~~P~l~i~G~-~~~~~~~l---~iG~~-~-~--~~lp~~~~~~~lE~~g~~-l~~~~~~l~~~e~qm~~-lG--a~l 374 (612) +..-+|.-++... +....+.+ ..... . . +....++.-++-++++.- .+.+...|+.+-.++.. .| ... T Consensus 286 ~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~ 365 (500) T protein:vir:98 286 RRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGL 365 (500) T ss_pred ceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccc Confidence 2222232222211 00001100 00000 0 1 111222222333333332 23445555555554432 22 111 Q ss_pred hhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC---C---cCCCCcceEEEeeccccccCCCHH Q lcl|NC_019408. 375 MPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRD---V---PLADTENLRYEVNTDFLSTPIGAR 448 (612) Q Consensus 375 l~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g---~---~~~~~~~~~v~ln~dF~~~~~d~~ 448 (612) +.. ...+.+||++.....+........+...++.++.++++.++.+.. . ......++.|..+ |....+ ... T Consensus 366 ~~~-~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~-d~i~~d-~~~ 442 (500) T protein:vir:98 366 FSF-DGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLD-DGVFTD-RDA 442 (500) T ss_pred ccc-CcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeC-CCCCCC-HHH Confidence 111 123456788877777777777888888899999998887765421 1 1112223444443 212221 245 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccc-ccccchhHHhhhhh Q lcl|NC_019408. 449 EMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENS-FINNPDAQARQRGY 509 (612) Q Consensus 449 ~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~-~~~~~~~~~~~~~e 509 (612) ++..+.+++.+|.||+++++.. ..|+ .+-+.+++..++.+|.. ..+..+....--+| T Consensus 443 ~~~~~~~~v~aGi~s~~~~i~~--~~g~--~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 443 ELDYWIKVVNAGFGTREMAIQK--VLNV--TEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHHHHHcCCCCHHHHHHh--cCCC--CHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 6788999999999999987643 3455 22244555666655432 33333333222223 No 79 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.43 E-value=1.8e-11 Score=79.45 Aligned_cols=535 Identities=8% Similarity=-0.026 Sum_probs=201.0 Q ss_pred CCCcHHHH-HHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC----Cc Q lcl|NC_019408. 1 MVTHPEYQ-YWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR----DP 75 (612) Q Consensus 1 ~~~hP~y~-~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k----~p 75 (612) +-.|.++. ...++|.-..+++.+.......--...++....+. ..-...++.|.+..+++.++..+++. +. T Consensus 29 ~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~----~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~ 104 (651) T protein:vir:80 29 YKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVN----ADWRHKITTGKAFEAIETIHAYLMSATFPNKN 104 (651) T ss_pred HHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCC----CCCCccccChhHHHHHHHHHHHHHHhhcCCCc Confidence 23333332 22344555555554432111100001111111111 11123588999999998776665553 33 Q ss_pred eeecCC-----H------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc----------chhhhh----- Q lcl|NC_019408. 76 IVKNLP-----P------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD----------NPRKGA----- 129 (612) Q Consensus 76 ~~~~~p-----~------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~----------a~~~~~----- 129 (612) .++-.| + .+..++.+- ..-.++..++..++..++.+|.+.+=|-+-. .+..-. T Consensus 105 ~~~~~p~~~~d~a~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~ 183 (651) T protein:vir:80 105 WFDVVPAKPGQDNLLVSRLIKRYVQDK-LTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPT 183 (651) T ss_pred eeEeccCCchhHHHHHHHHHHHHHHHH-hhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheeccccccccccc Confidence 333122 1 144444421 1344588888899999999998766443211 000011 Q ss_pred ----------ccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCccccccee--eeeeEeeecccc Q lcl|NC_019408. 130 ----------VATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQAR--KARAAALASGSA 197 (612) Q Consensus 130 ----------~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~--q~r~l~l~~g~~ 197 (612) ...|.+..++|.+++ |+. +..+-....+|+-+..+...-...-.+++...... .......+.... T Consensus 184 ~~v~~~~~~~~~~~~i~~v~p~~~~-~dp--~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~ 260 (651) T protein:vir:80 184 FEVVSEEREVKSSPDFEVLDMFDCF-YDP--NVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDT 260 (651) T ss_pred eeeeccceeeeceeEEEEecHHHee-ecC--CCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHHHhhhccccccC Confidence 124555566666655 432 22233333332211110000000000000000000 000000000000 Q ss_pred cccceee-cccccccc---cceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc-cccceeEEEee Q lcl|NC_019408. 198 SSPMVRQ-TARTLGGY---SYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE-PLDFIPFKFFG 272 (612) Q Consensus 198 ~~~~~~~-~~~~~~g~---~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~-~l~~IP~v~~~ 272 (612) +...... ......|+ ..+.+|-.. +. .+++.. . .+.++....+ ..+....++ +++..||+++. T Consensus 261 ~~~~~~~~~~~d~~~~~~~~~v~v~E~~----~~-~d~e~~-~-~~~~~v~~~g-----~~il~~~~~~~~~~~Pf~~~~ 328 (651) T protein:vir:80 261 KQDMLSTFQGVTTSLWSPHQNVELLEYW----GD-IHLENK-T-YHDVVVTIMG-----NEVLRFEQNPYWCGRPFVIGT 328 (651) T ss_pred CccccccccCCCccccccccceEEEEEE----EE-eeccCC-c-eEEEEEEEcC-----cEEecccccCCCCCCCeeeec Confidence 0000000 00000000 001111100 00 000000 0 1112211111 122222223 34455776543 Q ss_pred cC-CCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeee-c-CCCCCCceEEEeccccccCCCCCceeEEecC Q lcl|NC_019408. 273 AS-GNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAP-G-TDSEGTGEYHIGPNMVWEVPQGSEPGILEYT 349 (612) Q Consensus 273 ~~-~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~-G-~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~ 349 (612) .. ..+...|..|...+....-..=......-+.+..+..|...+. + +.. .+++..+++.++.+...+++..+.+. T Consensus 329 ~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~--~~~l~~~pg~vi~~~~~~~~~~l~~~ 406 (651) T protein:vir:80 329 YIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQ--PEDVYTEPGKVFLVSDHGDLQPLANQ 406 (651) T ss_pred ceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCcccc--HHHhhcCCCceEEecCCCCceeeccC Confidence 22 3344557777766665554444455556677888888887664 2 222 23466788888877777778888876 Q ss_pred chhHHHHHHHHHHHHHHHHHH-HHHh-hhcccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHH Q lcl|NC_019408. 350 GQGLKALETALNDKERQIAAI-GGRM-MPGASK--SVSESNNQTVLREANEQSLLLNIIQACESG-----MTDVVRWWLM 420 (612) Q Consensus 350 g~~l~~~~~~l~~~e~qm~~l-Ga~l-l~~~~~--~~~esa~~~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~ 420 (612) ...+......|..++..|..+ |... ....+. ....||+..+.........|..+++++... +..+|.++.+ T Consensus 407 ~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~ 486 (651) T protein:vir:80 407 SSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQ 486 (651) T ss_pred cccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666666677788888877543 4422 211111 124578888888888888899999999874 3456666665 Q ss_pred HcCCcC----CCC---cceEE-----EeeccccccCCCH-------HHHHHHHHHHHcCC----CCH-----HHHHHHHH Q lcl|NC_019408. 421 WRDVPL----ADT---ENLRY-----EVNTDFLSTPIGA-------REMRAIQLMANDGL----LPD-----PVFYEYMR 472 (612) Q Consensus 421 w~g~~~----~~~---~~~~v-----~ln~dF~~~~~d~-------~~~~al~~~~~~G~----is~-----et~~~~lq 472 (612) +.-.+. .+. ....+ .+..+|....+.+ +.++.++.+++.+. +.. ..+...+. T Consensus 487 ~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~ 566 (651) T protein:vir:80 487 FTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQ 566 (651) T ss_pred hcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHH Confidence 532110 000 00000 1222333322222 12333444443221 111 00111111 Q ss_pred hcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 473 KAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAA 552 (612) Q Consensus 473 r~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~ 552 (612) ..|+-++ ...+..+++++.... ++....+.+....+.+.+.++++..+.+.....++.+..+ T Consensus 567 ~~g~~~~--------------~~~l~~~~q~~~~~~----~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 628 (651) T protein:vir:80 567 HWGFEEP--------------EAYLKQQDQQAPANP----QEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGTP 628 (651) T ss_pred HcCCCCc--------------HHhcCCCccchhhhh----hHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222111 001111111111100 1111111111111111111111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019408. 553 GSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAA 590 (612) Q Consensus 553 ~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a 590 (612) .+.+.+.+ +.+.+... |++++.. T Consensus 629 ~~~~~~~~-~~~~~~~l--------------~~~~~~~ 651 (651) T protein:vir:80 629 NADQMQQE-LMATTPNV--------------SEQQLTQ 651 (651) T ss_pred HHHHHHHH-HHHHHHHH--------------HHhhccC Confidence 11111111 00111111 1222211 No 80 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.43 E-value=1.9e-11 Score=79.42 Aligned_cols=435 Identities=11% Similarity=0.022 Sum_probs=196.4 Q ss_pred CCC------cHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCC Q lcl|NC_019408. 1 MVT------HPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRD 74 (612) Q Consensus 1 ~~~------hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~ 74 (612) ... -|++......|.. +|.|...+ |.....+ ...+.|.. .=.|+.+.+++.++++||..+ T Consensus 26 ~~~~~~i~~~~~~~~~I~~w~~---~Y~g~~~~-------~~~~~~~--~~~~~~~~--~sl~~~~~i~~~~A~Ll~~e~ 91 (517) T protein:vir:98 26 INDHEKINIDPNELARIERNLR---QYEGDYPQ-------VEYINSQ--GKIQERDY--MTLNLRKLSADVLSGLVFNEQ 91 (517) T ss_pred hhcCCceecCHHHHHHHHHHHH---HhcCCCcc-------ccccccc--ccccccce--eecCcHHHHHHHhhhhhcCCc Confidence 223 4556666666743 46664321 1100111 01111111 124899999999999999999 Q ss_pred ceeecCCH------------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhh Q lcl|NC_019408. 75 PIVKNLPP------------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAEN 142 (612) Q Consensus 75 p~~~~~p~------------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~ 142 (612) ++|. +++ ....+++.+ .+-+++...+...+..++..|-+++-+=+. +.++-|..++|.+ T Consensus 92 ~~i~-v~d~~~~~~~~~~~~~~~e~l~~i-~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-------~~~~~I~~v~ad~ 162 (517) T protein:vir:98 92 CEVY-VSDAKDEEKKDNSFKTAHEFIQHV-FQHNKFIKNLSDYLEPTFALGGLTVRPYVD-------NGEIEFSWALANA 162 (517) T ss_pred ceEE-ecccccccccccchhHHHHHHHHH-HHhccHHHHHHHHHHHHhhhCCEEEEEEEe-------CCeeEEEEEcCCe Confidence 9983 442 122233222 123678888999999999998666643221 2356688889999 Q ss_pred hhcchhhhccCCccceeEEEEE-EEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeee Q lcl|NC_019408. 143 ILDWDEVVDMGGFYVPSRVLLR-EFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYREL 221 (612) Q Consensus 143 IinW~~~~~v~g~~~Lt~v~l~-E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~ 221 (612) ++-+.. +... .+..+|. ++....+ .....|+.|....+.- . ....|.|.+ T Consensus 163 ~~Pl~~----~~~~-v~~~ai~~~~~~~~~----------~~~~~Yt~lE~H~~~~-------~-~~~~~~y~I------ 213 (517) T protein:vir:98 163 FYPLRS----NSNG-ISEGVMKSVTTKVIG----------NKTVYYTLLEFHEWEK-------T-EEGESLYVI------ 213 (517) T ss_pred eEEEEe----cCCC-eEEEEEEEEEEEeec----------CCceEEEEEEEEecCc-------e-eccCCcEEE------ Confidence 875421 2222 2333332 2211100 0112333333221100 0 000011111 Q ss_pred eccccccccccccceeEEEEEeeCCCceecc--------e-eeeccCCccccceeEEEeec-CCC----CCCcCcCchH- Q lcl|NC_019408. 222 KLEEIEWPSGEVKLAYVQYLYEEDPESRPIA--------R-IVPTVRGEPLDFIPFKFFGA-SGN----TADVEKPPLL- 286 (612) Q Consensus 222 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~--------~-~~p~~~g~~l~~IP~v~~~~-~~~----~~~~~~pPLl- 286 (612) ...+|..+.....+. + ..|..--+.++.-+|+++-. ..| ....|.+-|. T Consensus 214 ----------------~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~ 277 (517) T protein:vir:98 214 ----------------TNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDN 277 (517) T ss_pred ----------------EEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhh Confidence 112222111100000 0 00000000122222433311 111 1112333332 Q ss_pred ---HHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEecccc----------ccCCCCCceeEEecCchh- Q lcl|NC_019408. 287 ---DICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMV----------WEVPQGSEPGILEYTGQG- 352 (612) Q Consensus 287 ---dLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~----------~~lp~~~~~~~lE~~g~~- 352 (612) -|-.||..+-+-.-+++-+=+..-+|.-++.-..+.. . .++... +..+. ++.+|-++++.- T Consensus 278 a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~~~~~--g--~~~~~~~d~~~~~y~~~~~~~-~~~~i~~~~~~iR 352 (517) T protein:vir:98 278 SVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTVPDES--G--MPPPQVFDPDVNVYKSIRMGT-DEEFVKDVTHDIR 352 (517) T ss_pred hHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccccCCC--C--cccCCCCCcccceeeeccCCC-CCCceeeeccccc Confidence 2234454444444444444443444444442111110 0 111111 22222 334555566543 Q ss_pred HHHHHHHHHHHHHHHHH-H--HHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC------ Q lcl|NC_019408. 353 LKALETALNDKERQIAA-I--GGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRD------ 423 (612) Q Consensus 353 l~~~~~~l~~~e~qm~~-l--Ga~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g------ 423 (612) .+.+.+.++.+-+++.. . +...+... +.+.+|||+...+.+...+....+...++.+|.++++.++.|.. T Consensus 353 ~e~~~~~~~~~L~~i~~~~Gls~~t~~~~-~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~ 431 (517) T protein:vir:98 353 TEQYKEAINQALRTLEMELKLSVGTFSFD-GRSMKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFG 431 (517) T ss_pred hHHHHHHHHHHHHHHHHHhCCCccccccc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 23345555555444422 1 22223222 33457888888888888888888999999999998888876632 Q ss_pred CcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhH Q lcl|NC_019408. 424 VPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQ 503 (612) Q Consensus 424 ~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~ 503 (612) .......+++|..+ |....+ ...++..+.+++.+|.||.++++.. ..|+ .+-+.+++..++.+|....+.-.-. T Consensus 432 ~~~~~~~~v~v~f~-D~i~~D-~~~~~~~~~~~v~aG~ms~~~~i~~--~~g~--~eeeA~~e~~~i~~E~~~~~~~~~~ 505 (517) T protein:vir:98 432 GEIPSAEHIGVDFD-DGVFQD-RSALLRFYGQAKTFGFIPTVEAIQR--IFKV--PKKTAEQWLEEIRKDQIELDPVTIS 505 (517) T ss_pred CCCCCCcceEEEcC-CCCCCC-HHHHHHHHHHHHhcCCCCHHHHHHH--hCCC--ChHHHHHHHHHHHHhccccCCCCcc Confidence 11111223333332 222222 1346788899999999999988654 3465 2334556667776654322211111 Q ss_pred HhhhhhhHHHHh Q lcl|NC_019408. 504 ARQRGYTNRGQE 515 (612) Q Consensus 504 ~~~~~e~~r~~~ 515 (612) ..+.....-.+| T Consensus 506 ~~~~~~~~gd~e 517 (517) T protein:vir:98 506 QRAQKRMFGDEE 517 (517) T ss_pred ccccCCCCCCCC Confidence 111100000000 No 81 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.42 E-value=2.1e-11 Score=79.18 Aligned_cols=437 Identities=9% Similarity=0.025 Sum_probs=200.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNL 80 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~ 80 (612) .-.-|+|......|..+ |.|...+. .|-+ .. ...+.++. .=.|+.+.+++.+++++|..+|++. + T Consensus 33 i~~~~~~~~ri~~~~~~---y~g~~~~~----~~~~---~~--~~~~~~~~--~sln~~~~i~~~~A~lv~~e~~~i~-v 97 (508) T protein:vir:15 33 ISIDPDEYVRIQTDLDY---YSDKLQYI----HYQA---SD--GIKKKRLK--NTINMAKTAARRIASVVFNEKAEIH-V 97 (508) T ss_pred cccCHHHHHHHHHHHHH---hcCCCccc----cccc---CC--CCccccce--eecchHHHHHHHHHhhhhCCCceEE-e Confidence 33466666666666544 77743221 1111 11 11111221 1239999999999999999999984 4 Q ss_pred --CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc--chhhhccCCcc Q lcl|NC_019408. 81 --PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD--WDEVVDMGGFY 156 (612) Q Consensus 81 --p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin--W~~~~~v~g~~ 156 (612) ++....+++++= +.+++...+..++..++..|.+++-+=+. +..+-|..++|++++- |+ +|+ T Consensus 98 ~~~~~~~e~l~~il-~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~i~~v~ad~~~P~~~d-----~~~- 163 (508) T protein:vir:15 98 KDNNEADKFLNDVL-EDNDFKNKFEEALEKGVALGGFAMRPYID-------GNHIKIAWVRADQFYPLQSN-----TND- 163 (508) T ss_pred CCchHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCceEEEEEEe-------CCeeEEEEEcCCeeEEEEEc-----CCC- Confidence 243444433321 12677888999999999999887654332 1356688889999764 42 122 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccccccccee--eeeeeecccccccccccc Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYIT--VYRELKLEEIEWPSGEVK 234 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~--~~R~~~~~~~~~~~g~~~ 234 (612) ....+...++....+ .....|+.+..-.+ ..+|.+.++ +|+..... .. | T Consensus 164 ~~~~af~~~~~~~~~----------~~~~~yt~lE~h~~------------~~~~~~~I~n~ly~~~~~~--~l--G--- 214 (508) T protein:vir:15 164 ISEAAIASRTQRTES----------NQTKYYTLLEFHQW------------QDNGSYQITNELYKSDSPD--IV--G--- 214 (508) T ss_pred eEEEEEEEEEEeecC----------CCceEEEEEEEEEE------------ecCcceEEEEEEEecCCch--hc--C--- Confidence 233334444433210 11223443332211 011111111 22210000 00 0 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecC-CCC----CCcCcCchHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS-GNT----ADVEKPPLLDICDLNLSHYRTYAELEYGRLFT 309 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~-~~~----~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~ 309 (612) .......+.+..... .++.. +.+...||+++-.. .|. ...|.+-|.++..+==+.=..-|.+.+.+... T Consensus 215 ~~v~l~~~~e~~~l~--~~~~~----~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~ 288 (508) T protein:vir:15 215 NQVPLSTLPVYKELA--PQVTI----SGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLG 288 (508) T ss_pred cccchhhcccccCCC--cceEe----cCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhc Confidence 000000000000000 00110 11445566655321 111 11234444433333111112334455566544 Q ss_pred ccceee---eecCCCCCCceEEEecccc--ccCCC--CCceeEEecCchhHHHHHHHHHHHHHHHHH---HHHHhhhccc Q lcl|NC_019408. 310 ALPVYY---APGTDSEGTGEYHIGPNMV--WEVPQ--GSEPGILEYTGQGLKALETALNDKERQIAA---IGGRMMPGAS 379 (612) Q Consensus 310 ~~P~l~---i~G~~~~~~~~l~iG~~~~--~~lp~--~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~---lGa~ll~~~~ 379 (612) ...+.+ +...+......+-.+.+-. +.... |..+..+.|.=. .+.+.+.++.+...+.. ++...+.. . T Consensus 289 ~~~i~v~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir-~e~~~~~~~~~l~~~~~~~gls~~~f~~-~ 366 (508) T protein:vir:15 289 QKHIAVQPGMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIR-TVQYKDAIDHFIKEFEVQIGLSTGTFSY-S 366 (508) T ss_pred ccceeechHHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecccC-hHHHHHHHHHHHHHHHHHhCCCchhccc-c Confidence 444444 1112222111111111111 11222 223444444322 23345555555554432 12122211 2 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC---cC-----------CCCcceEEEeeccccccCC Q lcl|NC_019408. 380 KSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDV---PL-----------ADTENLRYEVNTDFLSTPI 445 (612) Q Consensus 380 ~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~---~~-----------~~~~~~~v~ln~dF~~~~~ 445 (612) +.+..||++.....+........+...++.++.++++.++.+... .. ....+++|..+.. ...+ T Consensus 367 ~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~-i~~d- 444 (508) T protein:vir:15 367 NDGVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDG-VFVN- 444 (508) T ss_pred cCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCC-CCCC- Confidence 334567777776666666677788888999999987776665321 10 0112333433321 1111 Q ss_pred CHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchh----HHhhhhh Q lcl|NC_019408. 446 GAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDA----QARQRGY 509 (612) Q Consensus 446 d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~----~~~~~~e 509 (612) ...+++.+.+++.+|.+|+++++.. ..|+ .+-+.+++..++.+|.+.....+. ..-..+| T Consensus 445 ~~~~~~~~~~~v~aGi~s~e~~i~~--~~g~--~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 445 KDKQLEEDAKVLAIGALSKQTFLQR--NYGM--TDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHh--cCCC--ChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 2456888999999999999988743 3454 223345667777776554322221 1122223 No 82 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.28 E-value=2e-10 Score=73.79 Aligned_cols=422 Identities=13% Similarity=0.051 Sum_probs=196.6 Q ss_pred CC-CcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MV-THPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~-~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) =| .-|+|.+....|. .+|.|...+ |-...... ..+.+...+ .|+.+.+++.+++++|.++|+|. T Consensus 32 ~i~~~~~~~~~i~~~~---~~Y~g~~~~-------l~~~~~~~--~~~~~~~~s--lnl~~~i~~~~A~ll~~e~~~i~- 96 (505) T protein:vir:79 32 RINLPADEVERIARDK---RYYMDDFKQ-------VTHKNSYG--DTQKHELQS--VNVTKLASAKLASLIFNEQCQVT- 96 (505) T ss_pred CCCCCHHHHHHHHHHH---HHhcCCCcc-------ccccccCC--Cccccceee--cchHHHHHHHHHhhhcCCCceee- Confidence 12 2345666666664 567775322 11100000 111122222 48999999999999999999984 Q ss_pred CC-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc--chhhhccCCcc Q lcl|NC_019408. 80 LP-PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD--WDEVVDMGGFY 156 (612) Q Consensus 80 ~p-~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin--W~~~~~v~g~~ 156 (612) ++ +....+++.+= +.+++...+..++..++..|.+++.+=+. +.+|-+..++|++++= |+ +| T Consensus 97 ~~d~~~~e~l~~i~-~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-------~~~~~i~~v~ad~~~P~~~d-----~~-- 161 (505) T protein:vir:79 97 VSDETANDFLDDVF-QQNDFYTTFEEKLEEWIALGSGCVRPYVD-------SGKIKLAWATADQVYPLQAD-----TN-- 161 (505) T ss_pred cCChHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCCeEEEEEEe-------CCceEEEEEcCCeeEEEEEc-----CC-- Confidence 33 33444443321 22678888999999999999777654332 2356788889999865 32 12 Q ss_pred ceeEEEEE-EEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 157 VPSRVLLR-EFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 157 ~Lt~v~l~-E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) .++.+++. ++.... ......|+.+..-.+. +|.+.+ T Consensus 162 ~~~~~a~~~~~~~~~----------~~~~~~yt~lE~h~~~-------------~~~~~I-------------------- 198 (505) T protein:vir:79 162 QVNELAIASRTTEVE----------NHRTIYYTLLEFHQWD-------------HGDYVI-------------------- 198 (505) T ss_pred CeEEEEEEEEEEEec----------CCcceEEEEEEEEEec-------------CceEEE-------------------- Confidence 23333332 322211 0111234434332111 111111 Q ss_pred eeEEEEEeeCCCceec-----ce------eeeccCCccccceeEEEeec--CCCCC---CcCcCchHHHHHHHHHHHhhh Q lcl|NC_019408. 236 AYVQYLYEEDPESRPI-----AR------IVPTVRGEPLDFIPFKFFGA--SGNTA---DVEKPPLLDICDLNLSHYRTY 299 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~-----~~------~~p~~~g~~l~~IP~v~~~~--~~~~~---~~~~pPLldLA~lnl~HY~~~ 299 (612) ...+|........+ .+ ..|.+.=+.++..||+.+-. .++.. ..|.+-|.++..+==+.=..- T Consensus 199 --~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~ 276 (505) T protein:vir:79 199 --TNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTH 276 (505) T ss_pred --EEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHH Confidence 11122111100000 00 00000001134445554421 11211 123444433333211111222 Q ss_pred HHHHHHHHHhccceee----eecCCCCCCceEEEe------cccc---ccCCCCCceeEEecCchh-HHHHHHHHHHHHH Q lcl|NC_019408. 300 AELEYGRLFTALPVYY----APGTDSEGTGEYHIG------PNMV---WEVPQGSEPGILEYTGQG-LKALETALNDKER 365 (612) Q Consensus 300 sD~~~~l~~~~~P~l~----i~G~~~~~~~~l~iG------~~~~---~~lp~~~~~~~lE~~g~~-l~~~~~~l~~~e~ 365 (612) |.+.+.+......+.+ +.-...........+ -..+ +.. .++..+|-.+++.- .+++.+.|+.+.+ T Consensus 277 s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~-~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 355 (505) T protein:vir:79 277 DQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYG-DASEVGFHDATSPIRVADYQATMDFFLR 355 (505) T ss_pred HHHHHHHHhcccceeechHHhcccCCCCcccccccccCCCccceeeeeccC-CCCCCceEEecccCCHHHHHHHHHHHHH Confidence 3334444433333333 211000000000001 0111 111 23334455555542 2445566666665 Q ss_pred HHHHH-H--HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcC----------CC--Cc Q lcl|NC_019408. 366 QIAAI-G--GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPL----------AD--TE 430 (612) Q Consensus 366 qm~~l-G--a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~----------~~--~~ 430 (612) ++... | ...+.. .+.+..||++...+.+...+....+...++.+|.++++.++.|..... .. .. T Consensus 356 ~i~~~~g~s~~~~~~-~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~ 434 (505) T protein:vir:79 356 EFENQTGLSQGTFTT-SPSGIQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSL 434 (505) T ss_pred HHHHHhCCChhhcCC-CccccchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCce Confidence 54432 2 122211 233456888877777777788888888999999998888887643211 01 12 Q ss_pred ceEEEeeccccccCC-C-HHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhh Q lcl|NC_019408. 431 NLRYEVNTDFLSTPI-G-AREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRG 508 (612) Q Consensus 431 ~~~v~ln~dF~~~~~-d-~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~ 508 (612) +++|. |...-+ | ..+++...+++.+|.+|+++++.. ..|+ .+-+.+++..+|.+|... ..|+...- -+ T Consensus 435 ~i~v~----f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~--~~~~--~eeea~~el~ri~~E~~~-~~p~~~~~-gg 504 (505) T protein:vir:79 435 DITIN----FNDGVFVDQESKRAADLQAVQAQVMPKKQFLMR--NYGL--DEEEADEWLAQIDAENST-AEPEFNQF-GG 504 (505) T ss_pred eEEEE----eCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHh--cCCC--ChHHHHHHHHHHHHhccc-cCCCchhc-cC Confidence 33343 333211 2 346788999999999999987643 3454 233355667777666432 11222111 01 Q ss_pred h Q lcl|NC_019408. 509 Y 509 (612) Q Consensus 509 e 509 (612) + T Consensus 505 ~ 505 (505) T protein:vir:79 505 D 505 (505) T ss_pred C Confidence 1 No 83 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.12 E-value=1.7e-09 Score=68.65 Aligned_cols=451 Identities=8% Similarity=-0.040 Sum_probs=193.7 Q ss_pred HHHHH----HHHHHhcChHHH--HhcccccCCCCCCCCH----HHHHHHH----------hhccCCchHHHHHHHhhchh Q lcl|NC_019408. 11 RPEWT----KLRDVMAGQREI--KRKAEAYLPAMKGADG----DDYAIYL----------QRATFFNMLAQTRDGMTGMV 70 (612) Q Consensus 11 ~~~W~----~i~d~~~G~~~v--r~~g~~YLPk~~~e~~----~~Y~~rl----------~rA~~~n~~~~tv~~~~G~v 70 (612) +--|. .|..-+.|+..= -..-..|++.+...-. ..|-.++ .+-.=+|..+.+++-++.+| T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll 80 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYI 80 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhh Confidence 11221 122223332110 0011123332221100 1111111 12234577899999999999 Q ss_pred hcCCceeecC-------CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEecCcchhhhhccCceEEEechh Q lcl|NC_019408. 71 FRRDPIVKNL-------PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVDVVDNPRKGAVATSFAVGYSAE 141 (612) Q Consensus 71 f~k~p~~~~~-------p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD~p~a~~~~~~~rPy~~~~~ae 141 (612) |..+|+|+ + ++.+..+++.+ ...+.++..+...+..++..|.+++ .+| ..+|-+..++|. T Consensus 81 ~~e~~~i~-v~~~~~~d~e~~~~~l~~i-l~~n~f~~~~~~~~e~a~a~G~~~~k~~~d---------~~~~~i~~v~ad 149 (518) T protein:vir:78 81 SGKPLSID-VTGVNGSKDENLTKQLKEA-LRIDNFDSKSVKIVELAGGSGVSAVKINIL---------NGRPSISVHSSS 149 (518) T ss_pred cCCCceEE-ecCccccCcHHHHHHHHHH-HHhccHHHHHHHHHHHhhccCceEEEEEEE---------CCeeEEEEEcCC Confidence 99999883 3 23455555443 2237788888899999999997664 333 236777888888 Q ss_pred hhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeee Q lcl|NC_019408. 142 NILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYREL 221 (612) Q Consensus 142 ~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~ 221 (612) +++=.. -+|+ ++-+++.+..... .....|+.|......-... .....|.+. ..|... T Consensus 150 ~~~P~~----~~g~--~~~~~f~~~~~~~-----------~k~~~y~~lE~he~~~~~~-----~~~~~~~~~-I~n~ly 206 (518) T protein:vir:78 150 QFWIDF----KNNE--PFRFNFFEEIPTS-----------NKADIYYLVESREIKQWDK-----EGKKLSGGF-VTYSVI 206 (518) T ss_pred eeEEEe----ecCc--EEEEEEEEEeecC-----------CcceeEEEEEeeccccccc-----eeeccccee-EEEEEe Confidence 876521 1443 5555554432211 0112343333221100000 000001100 111111 Q ss_pred eccccccccccccce--eEEEEEeeCCCceecceeeeccCCccccceeEEEe---ecCCCCCC---cCcCchHHHHHHHH Q lcl|NC_019408. 222 KLEEIEWPSGEVKLA--YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF---GASGNTAD---VEKPPLLDICDLNL 293 (612) Q Consensus 222 ~~~~~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~---~~~~~~~~---~~~pPLldLA~lnl 293 (612) ..+. .++ +.+. ........ ...+. ...+..--.+....||+++ ...++... .|.+-|.++..+=- T Consensus 207 ~~~~---~~~-v~~~~~~~~~~l~~-~~~~~--~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id 279 (518) T protein:vir:78 207 KIDG---DKT-TPISAERLPEQITS-YLHTN--DIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLF 279 (518) T ss_pred eecC---ccc-cccccccccccccc-ccccc--cCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHH Confidence 1000 000 0000 00000000 00000 0000000001223444443 22222221 14444444333322 Q ss_pred HHHhhhHHHHHHHHHhccceeeee-cC-----CCCCCc---eEEEeccccccCC----CCCc----eeEEecCchhHHHH Q lcl|NC_019408. 294 SHYRTYAELEYGRLFTALPVYYAP-GT-----DSEGTG---EYHIGPNMVWEVP----QGSE----PGILEYTGQGLKAL 356 (612) Q Consensus 294 ~HY~~~sD~~~~l~~~~~P~l~i~-G~-----~~~~~~---~l~iG~~~~~~lp----~~~~----~~~lE~~g~~l~~~ 356 (612) ..=..-|.+.+.+.. +.+..+++ .+ +..... .+-.+.+....+. .|.+ ..-++|.=. .+.+ T Consensus 280 ~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir-~e~~ 357 (518) T protein:vir:78 280 AVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFR-DGSY 357 (518) T ss_pred HHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccC-hHHH Confidence 222233445666655 44444442 11 000000 0111112111111 1111 222333322 1334 Q ss_pred HHHHHHHHHHHHH-HH--HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC------ Q lcl|NC_019408. 357 ETALNDKERQIAA-IG--GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLA------ 427 (612) Q Consensus 357 ~~~l~~~e~qm~~-lG--a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~------ 427 (612) ...++.+-.++.. +| ...+... .+..|||+...+.+...+.+......++.++.+++..++..++...+ T Consensus 358 ~~~~~~~l~~~~~~~G~s~~tfg~~--~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~ 435 (518) T protein:vir:78 358 RETMEYFAQKAVSKSGYNPATFNLG--NREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAI 435 (518) T ss_pred HHHHHHHHHHHHHhhCCChhhcCcc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc Confidence 4555554444421 12 2222222 23578888888888888899999999999999988887765432211 Q ss_pred CCc--ceEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccc--hhH Q lcl|NC_019408. 428 DTE--NLRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNP--DAQ 503 (612) Q Consensus 428 ~~~--~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~--~~~ 503 (612) ... +++|..+ |-.+.+ ....++.+.+++.+|.||++++++.+ -.++ .+-+.+++.++|.+|......+ +++ T Consensus 436 ~~~~~~v~i~f~-D~i~~D-~~~~~~~~~~~v~aGimS~e~~i~~~-~~~~--~deea~~e~~ri~~E~~~~~~~~p~~~ 510 (518) T protein:vir:78 436 MRDEIRVIIEFP-DPMSVN-LNELSSTLNNMNSALAMSVEEKVKLI-HPKW--EDEEIQAEVKRIYLENAIGEVPDPEAI 510 (518) T ss_pred CCCceeEEEEeC-CCCCCC-HHHHHHHHHHHHhcCCCCHHHHHHHh-CCCC--CHHHHHHHHHHHHHHhcccCCCCCccc Confidence 112 3444333 222222 13356667789999999999987754 2233 3334566777777765543333 333 Q ss_pred HhhhhhhH Q lcl|NC_019408. 504 ARQRGYTN 511 (612) Q Consensus 504 ~~~~~e~~ 511 (612) .....+.- T Consensus 511 ~g~~~~~g 518 (518) T protein:vir:78 511 GGMETKGG 518 (518) T ss_pred cCCCCCCC Confidence 32211111 No 84 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.10 E-value=2.1e-09 Score=68.18 Aligned_cols=443 Identities=9% Similarity=0.002 Sum_probs=196.8 Q ss_pred CC-CcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MV-THPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~-~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) .| .||++......| +.+|.|... ...|. ... .... .+. -.-.|+.+.+++.++++||..+|++. T Consensus 31 ~i~~~~~~~~~i~~~---~~~y~g~~~----~~~~~---~~~-~~~~-~~~--~~slnl~~~i~~~~A~lv~~e~~~i~- 95 (522) T protein:vir:47 31 KIAVTQEEYDRIKRN---LVYYQSKWD----DVQYK---NTD-GDIK-SRP--MNHLPIARTASKKIASLVYNEQATIT- 95 (522) T ss_pred CCCCCHHHHHHHHHH---HHHhcCCcc----ccccc---ccC-cchh-ccc--ceecchHHHHHHHHhhhhcCCcceee- Confidence 11 266655555545 456766311 00111 111 1111 111 11249999999999999999999984 Q ss_pred CC-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhc--chhhhccCCcc Q lcl|NC_019408. 80 LP-PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILD--WDEVVDMGGFY 156 (612) Q Consensus 80 ~p-~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~Iin--W~~~~~v~g~~ 156 (612) ++ +.+..+++.+=.+ +.++..+...+..++..|-+++.+=+- +.++-+..++|.+++= | ++.. T Consensus 96 v~d~~~~~~l~~~l~~-n~f~~~~~~~~e~a~a~G~~a~k~~~d-------~~~~~i~~v~ad~~~P~~~------~~~~ 161 (522) T protein:vir:47 96 TKNEILQKFLDDMLTN-DRFNKNFERYLESCLALGGLAMRPYID-------GDKVRVAFIQAPVFFPLES------NTQD 161 (522) T ss_pred cCChHHHHHHHHHHhh-cchHHHHHHHHHHhhccCCEEEEEEEc-------CCceEEEEEcCCceEEEEE------cCCc Confidence 33 4555555544333 678888999999999988665543221 2357788888888863 4 3344 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccce--eeeeeeecccccccccccc Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYI--TVYRELKLEEIEWPSGEVK 234 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~--~~~R~~~~~~~~~~~g~~~ 234 (612) .+.-+++.+++...+ +....|+.|....+.-+......... .++.+.+ ..|+-.... ... T Consensus 162 ~~e~a~~~~~~~~~~----------~~~~~yt~lE~he~~~~~~~~~~~~~-~~~~~~I~n~ly~~~~~~-------~lG 223 (522) T protein:vir:47 162 VSSAAILTKTIKSEG----------RKNVYYTLVEFHEWVTADGQETGSTN-DKKYYRITNELYRSDVND-------VLG 223 (522) T ss_pred eEEEEEEEEEEeecc----------cceeEEEEEEEeeecccccccccccc-cCCceEEEEEEeecCCCc-------ccC Confidence 455555555543211 11223444443322111111100000 0111111 111110000 000 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecC-CCCCCc----CcCchHHH----HHHHHHHHhhhHHHHHH Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGAS-GNTADV----EKPPLLDI----CDLNLSHYRTYAELEYG 305 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~-~~~~~~----~~pPLldL----A~lnl~HY~~~sD~~~~ 305 (612) ......-+.+..... ..++. ..+...+|+++-.. .|.-.. |.+-+.+. -.+|..+-+ +-+- T Consensus 224 ~~v~l~~~~e~~~l~--~~~~~----~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~----~~~e 293 (522) T protein:vir:47 224 QRVNLSELDKYKNLE--PVTVF----ENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDE----FMWE 293 (522) T ss_pred ccccccccccccCCC--CceEe----CCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHH----HHHH Confidence 000000000000000 00111 11233334444211 121111 22322222 233432222 2223 Q ss_pred HHHhccceee----eecCCCC------------CCceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 306 RLFTALPVYY----APGTDSE------------GTGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAA 369 (612) Q Consensus 306 l~~~~~P~l~----i~G~~~~------------~~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~ 369 (612) +......+.+ +.-.... .+..+-++-+. ..+.++++..++|.=.. +.+...++.+...+.. T Consensus 294 ~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~--~~~~~~~i~~~~~~ir~-e~~~~~~~~~l~~i~~ 370 (522) T protein:vir:47 294 VRMGQRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGG--SSMDAGGITDLTSPIRA-NDYILAISEGLKLFEM 370 (522) T ss_pred HHhccceeecchHHhccCCCCCCcccccccccCcccceEeecCC--CCCCCCcceeeccccCh-HHHHHHHHHHHHHHHH Confidence 3322222222 2100000 01112222111 11223344444443322 3344455554444422 Q ss_pred ---HHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC------cCCCCcceEEEeeccc Q lcl|NC_019408. 370 ---IGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDV------PLADTENLRYEVNTDF 440 (612) Q Consensus 370 ---lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~------~~~~~~~~~v~ln~dF 440 (612) ++...+... +.+.+|||+...+.+...+....+...++.||.+++..++.|... ......+++|..+ |. T Consensus 371 ~~gls~~tf~~~-~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~-D~ 448 (522) T protein:vir:47 371 QIGVSSGMFTFD-GQGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLD-DG 448 (522) T ss_pred HhCCCccccCcc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcC-CC Confidence 222222222 234578888877888888888899999999999998888877531 1112223444433 22 Q ss_pred cccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccccccc---------chhHHhhhhhh Q lcl|NC_019408. 441 LSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINN---------PDAQARQRGYT 510 (612) Q Consensus 441 ~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~---------~~~~~~~~~e~ 510 (612) ...+ ...+++.+.+++.+|.||+++++.. ..|+- +-+.+++..++.+|...... .+.+....++- T Consensus 449 i~~D-~~~~~~~~~~~v~aG~~s~e~~i~~--~~g~~--eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 449 VFTD-RHAELDYWAKMVAAGFSTKKRAIGK--TLNIS--GVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred CCCC-HHHHHHHHHHHHhcCCCCHHHHHHh--cCCCC--hHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 2222 1446788999999999999988753 34652 22345666777665322111 11111111111 No 85 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.09 E-value=2.4e-09 Score=67.87 Aligned_cols=579 Identities=9% Similarity=-0.019 Sum_probs=175.3 Q ss_pred cHHHHHHHHH-HHHHHHHhcChHHHHhcc---cccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 4 HPEYQYWRPE-WTKLRDVMAGQREIKRKA---EAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 4 hP~y~~~~~~-W~~i~d~~~G~~~vr~~g---~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) -|+=...+.. -..++.++.+...+|... ..|. =+|+.+.....+. ..|=+ +|.++++|+.++|.-=+..+.+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~-q~rp~-~N~i~~~v~~v~g~e~~nr~d~ 78 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh-cCCCc-ccchHHHHHHHHhhHHhCCcce Confidence 1111111111 111222333333333211 1111 2555444443333 33434 5999999999999977777766 Q ss_pred ecCC---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEecCcchhhhhc--cCceEEEechhhh- Q lcl|NC_019408. 78 KNLP---------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVDVVDNPRKGAV--ATSFAVGYSAENI- 143 (612) Q Consensus 78 ~~~p---------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD~p~a~~~~~~--~rPy~~~~~ae~I- 143 (612) .-.| +.|..++..+ .+-++.+.-+..+|..++.+|.+|+ ..||...+..... .+-+.++.++.+| T Consensus 79 ~v~p~~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v~ 157 (725) T protein:vir:10 79 LYRPKDGASPDAADVLMGMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred EEecCCcchHHHHHHHHHHHHHH-HHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHcc Confidence 3223 2234444333 2234555668899999999999995 4477433221111 1222223345555 Q ss_pred hcchhhhccCCccceeEEEEEEEeec-----cccccCCC---cccccceeeeeeEeeecccccccceeec---------- Q lcl|NC_019408. 144 LDWDEVVDMGGFYVPSRVLLREFVRD-----LRWKSDIE---PLTTAQARKARAAALASGSASSPMVRQT---------- 205 (612) Q Consensus 144 inW~~~~~v~g~~~Lt~v~l~E~v~~-----~~~~~~~d---~f~~~~~~q~r~l~l~~g~~~~~~~~~~---------- 205 (612) +||... ..|.. ...|+.+++.+.. .....+.+ .++......| .++-+....++.. T Consensus 158 ~Dp~a~-~~D~s-Dar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~-----~~~~~~~~~vrv~E~~~r~~~~~ 230 (725) T protein:vir:10 158 WDSNSK-LMDKS-DARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDW-----VFPWLTQDTIQIAEFYEVVEKKE 230 (725) T ss_pred cCchhh-ccChh-hhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccc-----cccccCCCeEEEEEEEEEEEEee Confidence 444322 22211 1233333332221 00011111 0110000000 0000000000000 Q ss_pred --ccccccccc-eeeeeeeeccccc---ccccc--ccce----eEEEEEeeCCCceecceeeeccCCccccceeEEEeec Q lcl|NC_019408. 206 --ARTLGGYSY-ITVYRELKLEEIE---WPSGE--VKLA----YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGA 273 (612) Q Consensus 206 --~~~~~g~~~-~~~~R~~~~~~~~---~~~g~--~~~~----~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~ 273 (612) .....+... +..|-...+..+. ...|. +..+ ....++...|..+ -+..+...| ++||||+|.. T Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~--l~~~~~~~~---~~fP~vP~~g 305 (725) T protein:vir:10 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAV--LKDKQLIAG---EHIPIVPVFG 305 (725) T ss_pred EEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhh--hcCCCCCCC---CceeEEEEEe Confidence 000000000 0000000000000 00000 0000 0001111111111 011111122 3455554321 Q ss_pred C----CCCC---CcCcCchHHHHHHHHHHHhhhHHHHHHHHH-hccceeeeecCCCC----CC--ceEEEeccccccCCC Q lcl|NC_019408. 274 S----GNTA---DVEKPPLLDICDLNLSHYRTYAELEYGRLF-TALPVYYAPGTDSE----GT--GEYHIGPNMVWEVPQ 339 (612) Q Consensus 274 ~----~~~~---~~~~pPLldLA~lnl~HY~~~sD~~~~l~~-~~~P~l~i~G~~~~----~~--~~l~iG~~~~~~lp~ 339 (612) . ++.+ +.-. ++.|.=. .-.++.|.-+ +++.. ...+..+-.|.-.. |. +...+..-+.+.... T Consensus 306 ~r~~~~g~~~~~G~vr-~~kd~Q~--~~N~~~s~~~-~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 381 (725) T protein:vir:10 306 EWGFVEDKEVYEGVVR-LTKDGQR--LRNMIMSFNA-DIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENN 381 (725) T ss_pred eeeccCCcceeeeeec-cchhHHH--HHHHHHHHHH-HHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccC Confidence 1 1111 1111 1112111 1112333323 33332 22222222211000 00 001111111111111 Q ss_pred C----CceeEEecCchhHHHHHHHHHHHHHHHHH-HHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 340 G----SEPGILEYTGQGLKALETALNDKERQIAA-IGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT 412 (612) Q Consensus 340 ~----~~~~~lE~~g~~l~~~~~~l~~~e~qm~~-lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~ 412 (612) | ..+.+.++..- ...+..-|+.....|.. .|. .++-..+ .+.|+.+...+..+..-.|..+..|+..+.. T Consensus 382 g~~~~~~i~~~~~~~~-p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~--n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~ 458 (725) T protein:vir:10 382 GEMPTQPLAYYENPEV-PQANAYMLEAATAAVKEVATLGVDAEAVNG--GQVAYDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred cccccccCcccCCCCc-hHHHHHHHHHHHHHHHHHhCCCHHHhCcCc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 23444443322 22344555555555533 343 2332222 2456777777777766677778888877776 Q ss_pred H----HHHHHHHHcCCcC------CCCcceEEEeecc------------------ccc--c------CCCHHHHHHHHHH Q lcl|NC_019408. 413 D----VVRWWLMWRDVPL------ADTENLRYEVNTD------------------FLS--T------PIGAREMRAIQLM 456 (612) Q Consensus 413 ~----~l~~~a~w~g~~~------~~~~~~~v~ln~d------------------F~~--~------~~d~~~~~al~~~ 456 (612) . +|.++..+++..- .++..-.|.||.. |+. . ....+.+.+|+++ T Consensus 459 ~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:10 459 RDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILEL 538 (725) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHH Confidence 5 5666667764211 1111112233321 111 0 0012334455555 Q ss_pred HHcCC-CC---HHHHHHHHHhcCccchhhhhHHHHHHhhccccccc--cchhHHh--------hhhhhHH---------- Q lcl|NC_019408. 457 ANDGL-LP---DPVFYEYMRKAEVISSDMTFEEFQALRADENSFIN--NPDAQAR--------QRGYTNR---------- 512 (612) Q Consensus 457 ~~~G~-is---~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~--~~~~~~~--------~~~e~~r---------- 512 (612) +.+-- +. ..++..++ .+++ --..++..+++..+.+... .+..... +....+. T Consensus 539 l~~~~~~~~~~~~~l~~~~---~~~d-~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~ 614 (725) T protein:vir:10 539 LGKTPQGTPEYQLLLLQYF---TLLD-GKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGV 614 (725) T ss_pred HHhccccchhHHHHHHHHh---hcCC-chhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHH Confidence 44311 00 01111111 1111 0011233344432211110 0000000 0000000 Q ss_pred --HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HH--HHH-----HHHHHHHHHHH Q lcl|NC_019408. 513 --GQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKL----GD--PEQ-----AKPAVADQATI 579 (612) Q Consensus 513 --~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~----~~--e~q-----~k~~~~eq~~~ 579 (612) +.+.+.+++.+|.+..+..+...+.+++..+.+......++...+.+.++. .+ .++ ++.++...+. T Consensus 615 ~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~- 693 (725) T protein:vir:10 615 LLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKG- 693 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHH- Confidence 000000011111110011111111111111110000000000000000000 00 000 0001111111 Q ss_pred HHHHHHHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 580 DNAKKQTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 580 ~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) ..++++++.+..+ +....+...||.+=.-| T Consensus 694 ~~~~~~~~~~~~~---~~~~q~~~~~~~~~~~~ 723 (725) T protein:vir:10 694 NEQTHKQRMDIAN---ILQSQRQNQPSGSVAET 723 (725) T ss_pred HHHHHHHHhhhhh---ccccccccCCCcccccC Confidence 1111222222222 22333334444444433 No 86 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.08 E-value=2.6e-09 Score=67.62 Aligned_cols=560 Identities=9% Similarity=0.012 Sum_probs=160.6 Q ss_pred CCCcHHHHHHH--HHHH-------HHHHH---hcChHHHHhcccccCCCCCCCCHHHHHHH-HhhccCCchHHHHHHHhh Q lcl|NC_019408. 1 MVTHPEYQYWR--PEWT-------KLRDV---MAGQREIKRKAEAYLPAMKGADGDDYAIY-LQRATFFNMLAQTRDGMT 67 (612) Q Consensus 1 ~~~hP~y~~~~--~~W~-------~i~d~---~~G~~~vr~~g~~YLPk~~~e~~~~Y~~r-l~rA~~~n~~~~tv~~~~ 67 (612) +++.|+-.+.. +.|. +..|| ......++.+...+|-.+..+.+..-..- =..++..+-++++++.|. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v~~~ve~~~ 87 (763) T protein:vir:95 8 MVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLVRRQAEWRY 87 (763) T ss_pred cCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCHHHHHHHHHHH Confidence 44444333221 2332 22222 11122222111122221111111100000 056778888999999999 Q ss_pred chhhc---CCcee-ecCCH-------------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEe--c----- Q lcl|NC_019408. 68 GMVFR---RDPIV-KNLPP-------------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVD--V----- 121 (612) Q Consensus 68 G~vf~---k~p~~-~~~p~-------------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD--~----- 121 (612) +.+.+ -.+.+ +-.|- .+.+ .-+...+| ..++..+|+.+|..|.+.| .+| + T Consensus 88 ~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~-~~~~~~~~---~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~ 163 (763) T protein:vir:95 88 SALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNY-QFRTKLNR---VSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQ 163 (763) T ss_pred HHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHH-HHhhcCch---hhHHHHHHHHHhhcCcceEEEeeeeeeeeeee Confidence 87765 33323 21221 1222 12222223 4456677888888775522 223 1 Q ss_pred --Cc--------ch-------h-h---------------------------------------------hhccCceEEEe Q lcl|NC_019408. 122 --VD--------NP-------R-K---------------------------------------------GAVATSFAVGY 138 (612) Q Consensus 122 --p~--------a~-------~-~---------------------------------------------~~~~rPy~~~~ 138 (612) +. ++ . . ....+|.|..+ T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V 243 (763) T protein:vir:95 164 EVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEML 243 (763) T ss_pred eehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEee Confidence 10 00 0 0 00124455555 Q ss_pred chhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccc---------eeeccccc Q lcl|NC_019408. 139 SAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPM---------VRQTARTL 209 (612) Q Consensus 139 ~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~---------~~~~~~~~ 209 (612) +|++++ |+....-+ .....++..+-.........- ++. |..+.-.++...... +....... T Consensus 244 ~p~d~~-iDp~a~sD-~~Da~~~~~~~~~t~~dL~~~--~~~------y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 313 (763) T protein:vir:95 244 NPENII-IDPSCQGD-INKAMFAIVSFETCKADLLKE--KDR------YHNLNKIDWQSSAPVNEPDHATTTPQEFQISD 313 (763) T ss_pred cHHHhe-ecCCCCCc-hhhCceEeeEEeccHHHHHhc--cCC------ccccchhcchhccccccccccccchhhccCCC Confidence 555554 32211100 111111111100000000000 000 000000000000000 00000000 Q ss_pred ccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc--cccceeEEEeecCCC-CCCcCcCchH Q lcl|NC_019408. 210 GGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE--PLDFIPFKFFGASGN-TADVEKPPLL 286 (612) Q Consensus 210 ~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~--~l~~IP~v~~~~~~~-~~~~~~pPLl 286 (612) .+...+.+|..... ....++| +..++.....+ ..+.....+ +.+.+||+++..... ....+.+... T Consensus 314 ~~~~~V~v~E~y~~-~d~~gdg----~~~~~~v~~~g------~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~ 382 (763) T protein:vir:95 314 PMRKRVVAYEYWGF-WDIEGNG----VLEPIVATWIG------STLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAE 382 (763) T ss_pred cccceEEEEEeeee-eccCCcc----eeEEEEEEEEc------CeeeecccccccCCCcCEEEecceeecCcccCCchHH Confidence 00011111111000 0001111 11111111111 111112222 235678875433211 1123444444 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCc----eeEEec--CchhHHHHHHH Q lcl|NC_019408. 287 DICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSE----PGILEY--TGQGLKALETA 359 (612) Q Consensus 287 dLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~----~~~lE~--~g~~l~~~~~~ 359 (612) .+.++.-.+=-..+-.-+++..+..|...+ .|.-. ..+.+...++..+.+-.|+. +.+..+ .+.++...... T Consensus 383 ~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~-~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~ 461 (763) T protein:vir:95 383 LLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLD-ALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATL 461 (763) T ss_pred HhhHHHHHHHHHHHHHHHHHHhhcCCcEEeeccccc-chhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHH Confidence 444443322222344556777778876555 33311 12223334444444322322 223332 22232222222 Q ss_pred HHHHHHHHHHHHHHhh-hccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCcCCCCcceE Q lcl|NC_019408. 360 LNDKERQIAAIGGRMM-PGAS-KSVSESNNQTVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVPLADTENLR 433 (612) Q Consensus 360 l~~~e~qm~~lGa~ll-~~~~-~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~~~~~~~~~ 433 (612) ++...+.+ .|..-. .... ...+.|++....-.......+..++.++.+++.. +|.++..|... +.-++ T Consensus 462 ~~~~~e~~--TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~----~rviR 535 (763) T protein:vir:95 462 QNQEAESL--TGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAE----HEVVR 535 (763) T ss_pred HHHHHHHh--hCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC----CcEEE Confidence 22222222 122111 0101 1113344333333333333456667777666554 45555555432 11111 Q ss_pred E------Eee-----ccccc------cCCCHHHHHHHHHHHHc--CCCCHHHHHHHHHh-cCccchhhhhHHHHHHhhcc Q lcl|NC_019408. 434 Y------EVN-----TDFLS------TPIGAREMRAIQLMAND--GLLPDPVFYEYMRK-AEVISSDMTFEEFQALRADE 493 (612) Q Consensus 434 v------~ln-----~dF~~------~~~d~~~~~al~~~~~~--G~is~et~~~~lqr-~~vl~~~~~~eee~~ria~e 493 (612) | .++ .+|+. ...+.+.+..+..+.+. ..+........|.+ .++.. ..+....+... T Consensus 536 I~g~e~v~v~~~~~~~~~DV~V~~~~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~----~~~~~~~lr~~ 611 (763) T protein:vir:95 536 ITNEEFVTIKREDLKGNFDLEVDISTAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKR----MPKLAHDLRTW 611 (763) T ss_pred EeCCccccccHHHhcCCcceEEecccchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhc----hhhhHHHHHhc Confidence 1 111 12321 11112223333333222 22332221111111 01111 11111222221 Q ss_pred ccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H Q lcl|NC_019408. 494 NSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQK-----IDIQERSVAVQEGHAEVAHAAGSTSISGSRKLG---D 565 (612) Q Consensus 494 ~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~-----~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~---~ 565 (612) .+. ..+.+ ++++++++++.+.+.+..+.+ ++.+..+++.+..+.++++++.+.+++.+++.. . T Consensus 612 q~~-~d~~~--------q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~ 682 (763) T protein:vir:95 612 QPQ-PDPVQ--------EQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQS 682 (763) T ss_pred CCC-ccchh--------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 01111 111122222221111111111 111111111111122222222222222221110 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccCCC---------chhhcCCCCCcccCCC Q lcl|NC_019408. 566 PEQAKPAVADQATIDNAKKQTANAAKVAAQP---------PAPAAPGAPPTNRRPT 612 (612) Q Consensus 566 e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~---------~~~~~~~~~~~~~~~~ 612 (612) +.+++.+... .+-++..++++.... +.....+- ++..+++ T Consensus 683 eaq~~l~~~~------a~~~~~~ea~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 731 (763) T protein:vir:95 683 QGNQQLEITK------ALTKPRKEGELPPNLSAAIGYNALTNGEDTGI-QSVSERD 731 (763) T ss_pred HHHHHHHHHH------HHHHHHHHhccChhHHHhhhhcccccccCCCc-cchhhcc Confidence 1111111100 011111111111111 11111011 1111111 No 87 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.07 E-value=3e-09 Score=67.32 Aligned_cols=578 Identities=10% Similarity=0.002 Sum_probs=173.2 Q ss_pred cHHHHHHHHH-HHHHHHHhcChHHHHhcc---cccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 4 HPEYQYWRPE-WTKLRDVMAGQREIKRKA---EAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 4 hP~y~~~~~~-W~~i~d~~~G~~~vr~~g---~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) -|+=...+.. -..++.++.....+|... ..|. =+|+.+.....+. ..|-+ +|.++++|++++|.-=+..+.+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~-q~rp~-~N~i~~~i~~v~g~~~~nr~d~ 78 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHh-cCCCc-cccHHHHHHHHHhhHHhCCcce Confidence 2221111111 112223333333333211 1121 2454444443333 34444 5999999999999887777766 Q ss_pred ecCC---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEecCcchhhh--hccCceEEEechhh-h Q lcl|NC_019408. 78 KNLP---------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVDVVDNPRKG--AVATSFAVGYSAEN-I 143 (612) Q Consensus 78 ~~~p---------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD~p~a~~~~--~~~rPy~~~~~ae~-I 143 (612) .-+| +.|..++..+ .+-++.+.-+..+|..++..|.+|+ ..||...+... ...+.+.+..++.+ + T Consensus 79 ~v~P~~~~d~~~Ae~l~~~~~~~-~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~v~ 157 (725) T protein:vir:77 79 LYRPKDGARPDAADVLMGMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred EEecCCccHHHHHHHHHHHHHHH-HHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhhce Confidence 3233 2234444333 2235555668899999999999984 45774332211 11122222223333 3 Q ss_pred hcchhhhccCCccceeEEEEEEEeecccc-----ccC---CCccccc----------ceeeeeeEeeecccccccceeec Q lcl|NC_019408. 144 LDWDEVVDMGGFYVPSRVLLREFVRDLRW-----KSD---IEPLTTA----------QARKARAAALASGSASSPMVRQT 205 (612) Q Consensus 144 inW~~~~~v~g~~~Lt~v~l~E~v~~~~~-----~~~---~d~f~~~----------~~~q~r~l~l~~g~~~~~~~~~~ 205 (612) +||..+. .|.. .--|+.++..+....+ ..+ .+.++.. ....+|+.. .-+...+.... T Consensus 158 ~Dp~a~~-~D~s-Dar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E---~~~r~~~~~~~ 232 (725) T protein:vir:77 158 WDSNSKL-MDKS-DARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAE---FYEVVEKKETA 232 (725) T ss_pred eCchhhc-cChh-hHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEE---EEEEEEEeeEE Confidence 4443221 1110 0011111111110000 000 0000000 000011100 00000000000 Q ss_pred ccccccccc-eeeeeeeeccccc---cccccc--cce----eEEEEEeeCCCceecceeeeccCCc--cccceeEEEeec Q lcl|NC_019408. 206 ARTLGGYSY-ITVYRELKLEEIE---WPSGEV--KLA----YVQYLYEEDPESRPIARIVPTVRGE--PLDFIPFKFFGA 273 (612) Q Consensus 206 ~~~~~g~~~-~~~~R~~~~~~~~---~~~g~~--~~~----~~~~~~~~~~~~~~~~~~~p~~~g~--~l~~IP~v~~~~ 273 (612) .....+... ...|-........ ...|.+ ... ....++...|.. .+.+.. |-++||||+|.. T Consensus 233 ~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~-------~l~~~~~~~~~~~P~vP~~g 305 (725) T protein:vir:77 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA-------VLKDKQLIAGEHIPIVPVFG 305 (725) T ss_pred EEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCce-------eeccCCcCCCCccceEEEee Confidence 000000000 0000000000000 000100 000 000111111111 011111 224556664322 Q ss_pred CCCCCCcCcCchH----HHHHH-HHHHHhhhHHHHHHHHHh-ccceeeeecCCC----CCCc--eEEEeccccccCCCCC Q lcl|NC_019408. 274 SGNTADVEKPPLL----DICDL-NLSHYRTYAELEYGRLFT-ALPVYYAPGTDS----EGTG--EYHIGPNMVWEVPQGS 341 (612) Q Consensus 274 ~~~~~~~~~pPLl----dLA~l-nl~HY~~~sD~~~~l~~~-~~P~l~i~G~~~----~~~~--~l~iG~~~~~~lp~~~ 341 (612) ... +-.+.|-.. ++-+. -.-.++.|.-+ +++..+ ..+..+-.|..+ .|.. .......+.+....|+ T Consensus 306 ~r~-~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:77 306 EWG-FVEDKEVYEGVVRLTKDGQRLRNMIMSFNA-DIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGD 383 (725) T ss_pred eee-ccCCcccccchhhhhhhHHHHHHHHHHHHH-HHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCc Confidence 110 011111111 11111 11122333323 333332 222222222111 1110 1111222223333332 Q ss_pred ----ceeEEecCchhHHHHHHHHHHHHHHHHHH-HHH--hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 342 ----EPGILEYTGQGLKALETALNDKERQIAAI-GGR--MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDV 414 (612) Q Consensus 342 ----~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~--ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~ 414 (612) .+.++++..- -..+.+-|+.....|..+ |.. ++-.. ....|+.+...+..+....|+.+..|+..+...+ T Consensus 384 ~~~~~i~~~~~~~l-p~~~~~ll~~~~~~i~~~tGi~~~~lG~~--~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~ 460 (725) T protein:vir:77 384 LPTQPLAYYENPEV-PQANAYMLEAATSAVKEVATLGVDTEAVN--GGQVAFDTVNQLNMRADLETYVFQDNLATAMRRD 460 (725) T ss_pred ccccCccccCCCCc-hHHHHHHHHHHHHHHHHHhCCCHHHhCCC--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3444443322 234455666666666443 433 23222 2235777777777777777888888888887765 Q ss_pred ----HHHHHHHcCCc------CCCCcceEEEeec------------------cccc--cC------CCHHHHHHHHHHHH Q lcl|NC_019408. 415 ----VRWWLMWRDVP------LADTENLRYEVNT------------------DFLS--TP------IGAREMRAIQLMAN 458 (612) Q Consensus 415 ----l~~~a~w~g~~------~~~~~~~~v~ln~------------------dF~~--~~------~d~~~~~al~~~~~ 458 (612) |.++..+++.. ..+...-.+.||. .|+. .. ...+.+.+|++++. T Consensus 461 g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~ 540 (725) T protein:vir:77 461 GEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLG 540 (725) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHH Confidence 67777776311 0011111233331 0211 11 01233455555554 Q ss_pred cCCCCH----HHHHHHHHhcCccchhhhhHHHHHHhhccccccccc------hhHH----hhhhhhHHHHhHHH-HHH-- Q lcl|NC_019408. 459 DGLLPD----PVFYEYMRKAEVISSDMTFEEFQALRADENSFINNP------DAQA----RQRGYTNRGQELEQ-SRM-- 521 (612) Q Consensus 459 ~G~is~----et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~------~~~~----~~~~e~~r~~~~e~-~r~-- 521 (612) +.--.. .++..++ .+++- -..++..+++..+.+..... .++. .+.+..+.+.+..+ +.. T Consensus 541 ~~~~~~~~~~~~l~~~~---~l~d~-~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~ 616 (725) T protein:vir:77 541 KTPQGTPEYQLLLLQYF---TLLDG-KGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLL 616 (725) T ss_pred hccccchhHHHHHHHhh---ccccc-hHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 322100 1111111 11110 01133334443322111000 0000 00000000000000 000 Q ss_pred HHHHHHHH---------HHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 522 AREADFTQ---------QKIDIQERSVAVQEG----------------HAEVAHAAGSTSISGSRKLGDPEQAKPAVADQ 576 (612) Q Consensus 522 ~~e~e~~~---------q~~e~~~r~~~~~~~----------------r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq 576 (612) ..+++.++ ..+..++.+++..+. ++++.....+.+++.+++.....+.+.+..++ T Consensus 617 ~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~ 696 (725) T protein:vir:77 617 QGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQ 696 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhH Confidence 00001000 000000011000000 00000000000011111110000000000000 Q ss_pred HHHHHHHHHHHhhccccCCCchhhcCCCCC Q lcl|NC_019408. 577 ATIDNAKKQTANAAKVAAQPPAPAAPGAPP 606 (612) Q Consensus 577 ~~~~~~~k~~~~~a~~~~~~~~~~~~~~~~ 606 (612) .+....+-..++.......+|...+ ..|. T Consensus 697 ~~~q~~~~~~~~~~~~~~~~~~~~~-~~~~ 725 (725) T protein:vir:77 697 THKQRMDIANILQSQRQNQPSGSVA-ETPQ 725 (725) T ss_pred HHhhHHHHHHHHHHHHhcCCCcCcc-cCCC Confidence 0000000011122223333333333 1121 No 88 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.06 E-value=3.4e-09 Score=67.03 Aligned_cols=579 Identities=11% Similarity=0.034 Sum_probs=187.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhc-----ccccC--CCCCCCCHHHHHHHHh----hccCCchHHHHHHHhhch Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRK-----AEAYL--PAMKGADGDDYAIYLQ----RATFFNMLAQTRDGMTGM 69 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~-----g~~YL--Pk~~~e~~~~Y~~rl~----rA~~~n~~~~tv~~~~G~ 69 (612) |=-|=+ ......+.+++.++.....++.. .-.|. =+|+.+.....+.+-+ -.+-+|.++.+|+.++|. T Consensus 1 ma~~~~-~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~ 79 (708) T protein:vir:17 1 MAETLE-KKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) T ss_pred CchhHH-HHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhh Confidence 221111 11222233344454444333321 11233 2555555555554433 256689999999999999 Q ss_pred hhcCCceeecCCH----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEecCcchhhh-hccC-ceE Q lcl|NC_019408. 70 VFRRDPIVKNLPP----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVDVVDNPRKG-AVAT-SFA 135 (612) Q Consensus 70 vf~k~p~~~~~p~----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD~p~a~~~~-~~~r-Py~ 135 (612) ==+..+.+.-.|. .|..++..+-- -++.+.-+..+|..++..|.+|+ ..||-..+... -..+ ++. T Consensus 80 e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~ 158 (708) T protein:vir:17 80 YRNNRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) T ss_pred HhhCCcceEEecCCCcchHHHHHHHHHHHHHHHH-hcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceE Confidence 6666665532221 23333332222 23355568999999999999998 44552211100 0112 233 Q ss_pred EEe-chhhh-hcchhhhccCCccceeEEEEEEEeeccc----cc-cCCCcccccc----------eeeeeeEeeeccccc Q lcl|NC_019408. 136 VGY-SAENI-LDWDEVVDMGGFYVPSRVLLREFVRDLR----WK-SDIEPLTTAQ----------ARKARAAALASGSAS 198 (612) Q Consensus 136 ~~~-~ae~I-inW~~~~~v~g~~~Lt~v~l~E~v~~~~----~~-~~~d~f~~~~----------~~q~r~l~l~~g~~~ 198 (612) ..+ ++.+| +||.... .|.. ...|+..+..+.-.. +. .....|.... ...+|+.. +. T Consensus 159 ~~~~~~~~v~~Dp~a~~-~D~s-Dar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e-----~~ 231 (708) T protein:vir:17 159 PIYDPSRSVWFDPDAKK-YDKS-DALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAK-----YY 231 (708) T ss_pred eeccchhheecCccccc-cChh-hhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEE-----EE Confidence 333 33566 5554322 1211 122222221111000 00 0000000000 00111100 00 Q ss_pred ccceeecc--ccccccc-ceeeeeeeeccccc--cc-ccc--ccc----eeEEEEEeeCCCceecceeeeccCCccccce Q lcl|NC_019408. 199 SPMVRQTA--RTLGGYS-YITVYRELKLEEIE--WP-SGE--VKL----AYVQYLYEEDPESRPIARIVPTVRGEPLDFI 266 (612) Q Consensus 199 ~~~~~~~~--~~~~g~~-~~~~~R~~~~~~~~--~~-~g~--~~~----~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~I 266 (612) ++.+.... ....+.. .+..|-....+... .. .|. +.. +..+.++-..+.... +. | .--|.+++ T Consensus 232 ~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l--~~-~--~~~p~~~f 306 (708) T protein:vir:17 232 EVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFL--EK-P--RRIPGEHI 306 (708) T ss_pred EEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccc--cC-C--CCCCCCcc Confidence 00000000 0000000 00011000000000 00 000 000 000111111111100 00 0 01234566 Q ss_pred eEEEeecCCCCCCcCcCchH----HHHHH-HHHHHhhhHHHHHHHHHhccceee----eecCCCCCCce-------E--- Q lcl|NC_019408. 267 PFKFFGASGNTADVEKPPLL----DICDL-NLSHYRTYAELEYGRLFTALPVYY----APGTDSEGTGE-------Y--- 327 (612) Q Consensus 267 P~v~~~~~~~~~~~~~pPLl----dLA~l-nl~HY~~~sD~~~~l~~~~~P~l~----i~G~~~~~~~~-------l--- 327 (612) |+|+|...... ..+.|-.. ++-+. ..-.|+.|.-++++......+..+ +.|+...|..- + T Consensus 307 P~vP~~g~r~~-~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~ 385 (708) T protein:vir:17 307 PLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLR 385 (708) T ss_pred ceEEEeccccc-ccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhh Confidence 66655432221 11222111 11111 112234444444443333332222 22333222211 0 Q ss_pred EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 328 HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQA 406 (612) Q Consensus 328 ~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~ 406 (612) .++....+..+....++.+++..-+ ....+-|+.....|... |..-... +..++.|+.+...+..+..-.|+.+-.| T Consensus 386 ~~~~~~g~v~~~a~~~~~~~~~~~~-~~~~~llq~~~~~i~~~tGi~d~~~-G~~sn~SG~Ai~~rq~qg~~~~~~~~Dn 463 (708) T protein:vir:17 386 EVRDKYGNIIAGATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQ-QMPSNIAQETVNNLMNRADMASFIYLDN 463 (708) T ss_pred ccCCcccccccccCCcccCCCcccc-HHHHHHHHHHHHHHHHhcCCChHHc-cCccchHHHHHHHHHHHHHHHHHHHHHH Confidence 1111112122222234455533222 34455566666655444 3222111 1122347777776666666667777777 Q ss_pred HHHHHH----HHHHHHHHHcCCcC------CCCcceEEEeec---------------------ccccc------CCCHHH Q lcl|NC_019408. 407 CESGMT----DVVRWWLMWRDVPL------ADTENLRYEVNT---------------------DFLST------PIGARE 449 (612) Q Consensus 407 ~~~a~~----~~l~~~a~w~g~~~------~~~~~~~v~ln~---------------------dF~~~------~~d~~~ 449 (612) +..+.. .+|.++..+++..- .++..-.|.+|. |+... ....+. T Consensus 464 l~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~ 543 (708) T protein:vir:17 464 MAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHH Confidence 766655 46677777764210 001111112221 11111 112334 Q ss_pred HHHHHHHHHcCCCC---HHHHHH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_019408. 450 MRAIQLMANDGLLP---DPVFYE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREA 525 (612) Q Consensus 450 ~~al~~~~~~G~is---~et~~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~ 525 (612) +.+|++++.+.... ...+.. .+.-..+.. .++..+++..+.+..+..++... +.+++....++.+.+.. T Consensus 544 ~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~----~~ei~e~ir~~~~~~~~~~~~~~---e~~q~~~q~qq~~q~q~ 616 (708) T protein:vir:17 544 VSVLTNVLSSMLPADPMRPAIQGIILDNIDGEG----LDDFKEYNRNQLLISGIAKPRNE---KEQQIVQQAQMAAQSQP 616 (708) T ss_pred HHHHHHHHHhcCCccchhHHHHHHHHHhcCCCC----hHHHHHHHHHHhhccccccCcch---hhHHHHHHHHHHHHHHH Confidence 55666666654321 111111 122222211 13344444433222111111110 00000000000000001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHH-HHHHHHHHHhhcc Q lcl|NC_019408. 526 DFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKP-------------AVADQAT-IDNAKKQTANAAK 591 (612) Q Consensus 526 e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~-------------~~~eq~~-~~~~~k~~~~~a~ 591 (612) +.+++++..+..+.+.+.++.++++.+.+.....++..+++.+.+. ++.++.+ ....++.++ . T Consensus 617 ~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~q~---q 693 (708) T protein:vir:17 617 NPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQ---Q 693 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHH---H Confidence 1111122222222222222222222221111000000000000000 0001111 111112221 2 Q ss_pred ccCCCchhhcCCCCCc Q lcl|NC_019408. 592 VAAQPPAPAAPGAPPT 607 (612) Q Consensus 592 ~~~~~~~~~~~~~~~~ 607 (612) .++++|.... -.||+ T Consensus 694 ~~~a~p~~~~-~~~~~ 708 (708) T protein:vir:17 694 QFQSPPQSPA-DLMPS 708 (708) T ss_pred HHhccccCch-hccCC Confidence 2111211111 22344 No 89 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.02 E-value=5e-09 Score=66.09 Aligned_cols=568 Identities=11% Similarity=0.040 Sum_probs=189.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcc---cccC----CCCCCCCHHHHHHHHhh----ccCCchHHHHHHHhhch Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKA---EAYL----PAMKGADGDDYAIYLQR----ATFFNMLAQTRDGMTGM 69 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g---~~YL----Pk~~~e~~~~Y~~rl~r----A~~~n~~~~tv~~~~G~ 69 (612) |=-+=+ .-....+..+..+......++... ..|- =+|+.+.....+.+.+- .+-+|.++.+|+..+|. T Consensus 1 m~~~~~-~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~ 79 (708) T protein:vir:10 1 MAETLE-KKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) T ss_pred CchhHH-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHH Confidence 111100 000111122233333333333211 1122 26666665556555542 56689999999999999 Q ss_pred hhcCCceeecCCH----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEE--ecCcchhhhh--ccCceE Q lcl|NC_019408. 70 VFRRDPIVKNLPP----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLV--DVVDNPRKGA--VATSFA 135 (612) Q Consensus 70 vf~k~p~~~~~p~----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlV--D~p~a~~~~~--~~rPy~ 135 (612) -=+..+.+.-.|. .|..++..+- +-++.+.-+..+|..++..|++|+=| ||-..++... ..-|+. T Consensus 80 ~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~ 158 (708) T protein:vir:10 80 YRNNRITVKFRPGDREASEELANKLNGLFRADY-EETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) T ss_pred HHhCCcceEEEcCCCCchHHHHHHHHHHHHHHH-HhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceE Confidence 8888887742222 2333322221 23446667999999999999999744 5422111000 112444 Q ss_pred EEech-hhh-hcchhhhccCCccceeEEEEEEEeeccc----ccc------CCCcccc-------cceeeeeeEeeeccc Q lcl|NC_019408. 136 VGYSA-ENI-LDWDEVVDMGGFYVPSRVLLREFVRDLR----WKS------DIEPLTT-------AQARKARAAALASGS 196 (612) Q Consensus 136 ~~~~a-e~I-inW~~~~~v~g~~~Lt~v~l~E~v~~~~----~~~------~~d~f~~-------~~~~q~r~l~l~~g~ 196 (612) ..+.| .+| +||.... .|.. ..-|+..+..+.... |.+ +.+.++. .+...+.+ T Consensus 159 ~~~~p~~~v~~Dp~a~~-~D~s-Dar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~e------- 229 (708) T protein:vir:10 159 PIYDPSRSVWFDPDAKK-YDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAK------- 229 (708) T ss_pred EeecchhhcccCccccc-cChh-hhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEE------- Confidence 45544 455 5654321 1222 122333222111100 000 0000000 00000000 Q ss_pred ccccceee--c---ccccccccceeeeeeeecc----------ccccccccccceeEEEEEeeCCCceecceeeeccCCc Q lcl|NC_019408. 197 ASSPMVRQ--T---ARTLGGYSYITVYRELKLE----------EIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE 261 (612) Q Consensus 197 ~~~~~~~~--~---~~~~~g~~~~~~~R~~~~~----------~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 261 (612) +..+.+.. . .....|. +..|-..... ........+... .+.++-..+... -+. | +-- T Consensus 230 y~~r~~~~~~~~~~~~~~tg~--~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~-~v~~~~~~g~~~--le~-~--~~~ 301 (708) T protein:vir:10 230 YYEVRKESVDVISYRHPITGE--IATYDSDQVEDIEDELAIAGFHEVARRSVKRR-RVYVSVVDGDGF--LEK-P--RRI 301 (708) T ss_pred eeeEEEEEEEEEEEecCCCCc--eeeecchhhhhHHHHHHhcccchhheeeeeeE-EEEEEeecchhh--hcc-C--CCC Confidence 00000000 0 0000000 0000000000 000000001110 011111111100 000 0 112 Q ss_pred cccceeEEEeecCCC-----C--CCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeee-----ecCCCCCCce--- Q lcl|NC_019408. 262 PLDFIPFKFFGASGN-----T--ADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYA-----PGTDSEGTGE--- 326 (612) Q Consensus 262 ~l~~IP~v~~~~~~~-----~--~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-----~G~~~~~~~~--- 326 (612) |.++||+|+|..... . ++.-. .+.|.=. .+. |+.|. +.+++-.++-...++ .|+...|... T Consensus 302 p~~~fP~vP~~g~r~~~d~~~~~yG~vr-~~kd~Q~-~~N-~~~S~-~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~ 377 (708) T protein:vir:10 302 PGEHIPLIPVYGKRWFIDDIERVEGHIA-KAMDPQR-LYN-LQVSM-LADTAAQDPGQIPIVGMEQIRGLEKHWEARNKK 377 (708) T ss_pred CCCceeeEEEeeeeeccCCCcccceeec-ccchhHH-HHH-HHHHH-HHHHHHhcCCcccccChhhhhhHHHHHhhcccc Confidence 455666665432211 1 11111 1111111 111 22222 344444444433333 2322221110 Q ss_pred ----EE---EeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHHhhhccccchhHHHHHHHHHHHHHHH Q lcl|NC_019408. 327 ----YH---IGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GGRMMPGASKSVSESNNQTVLREANEQS 398 (612) Q Consensus 327 ----l~---iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~ll~~~~~~~~esa~~~~~~~~~~~s 398 (612) +. ++.......+....++.+++..-+ ..+.+-|+.....|..+ |..--..+ ...+.|+.+...+..+..- T Consensus 378 ~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~-~~~~~l~q~~~~~i~~vsG~~~~~lG-~~sn~SG~aI~~rq~qg~~ 455 (708) T protein:vir:10 378 RPAFLPLREVRDKSGNIIAGATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQQ-MPSNIAQETVNNLMNRADM 455 (708) T ss_pred chhhhccccccccccccccccCCccccCCccch-HHHHHHHHHHHHHHHHHhCcChhHcc-CccchHHHHHHHHHHHHHH Confidence 00 222222222333346666653322 22344455555555443 43221111 1223467666666666666 Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHHcCCcC------CCCcceEEEeec---------------------cc------c Q lcl|NC_019408. 399 LLLNIIQACESGMTD----VVRWWLMWRDVPL------ADTENLRYEVNT---------------------DF------L 441 (612) Q Consensus 399 ~L~~~a~~~~~a~~~----~l~~~a~w~g~~~------~~~~~~~v~ln~---------------------dF------~ 441 (612) .|+.+-.|+..+... +|.++..|++..- .++..-.+.+|. |+ . T Consensus 456 ~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~ 535 (708) T protein:vir:10 456 ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccC Confidence 778888888777765 5666667653110 000000111110 11 1 Q ss_pred ccCCCHHHHHHHHHHHHcCCCCHH---HHHH-HHHhcCccchhhhhHHHHHHhhccccccccchhH--Hhhhhhh-HHHH Q lcl|NC_019408. 442 STPIGAREMRAIQLMANDGLLPDP---VFYE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQ--ARQRGYT-NRGQ 514 (612) Q Consensus 442 ~~~~d~~~~~al~~~~~~G~is~e---t~~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~--~~~~~e~-~r~~ 514 (612) ......+.+.+|++++.+...... .+.. .+.-..+. ..++..+++..+.+..+..++. ..++... +++. T Consensus 536 ~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p----~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~ 611 (708) T protein:vir:10 536 YTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGE----GLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMA 611 (708) T ss_pred chhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCc----ChHHHHHHHHHhhcccccccccchhhHHHHHHHHHH Confidence 111223456777777776543211 1111 12112221 1234445554433222221111 1110000 1111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH------------HHH-HH Q lcl|NC_019408. 515 ELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAV-AD------------QAT-ID 580 (612) Q Consensus 515 ~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~-~e------------q~~-~~ 580 (612) +.++ .+.+.+++..+..+.+.+.++.++++.+.+.....++..+++.+.+.++ .+ ..+ .. T Consensus 612 ~q~q------~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l~ 685 (708) T protein:vir:10 612 AQSQ------PNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLK 685 (708) T ss_pred HHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 1111 0111111111111222222222222221111111111111111111110 00 000 01 Q ss_pred HHHHHHHhhccccCCCchhhcCCCCCc Q lcl|NC_019408. 581 NAKKQTANAAKVAAQPPAPAAPGAPPT 607 (612) Q Consensus 581 ~~~k~~~~~a~~~~~~~~~~~~~~~~~ 607 (612) ..++.++..++ ++|.+.- -.||+ T Consensus 686 ~~q~~q~~~~~-~~p~~~~---~~~p~ 708 (708) T protein:vir:10 686 DVAESQQQQFQ-SPPQSPA---DLMPS 708 (708) T ss_pred hhhhhHHHHHh-ccccCch---hccCC Confidence 11111111111 1222111 22344 No 90 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.93 E-value=1.3e-08 Score=63.74 Aligned_cols=574 Identities=12% Similarity=0.026 Sum_probs=195.2 Q ss_pred CCCcHHHHH-HHHHHHHHHHHhcChHHHHhcc---cccC--C--CCCCCCHHHHHHHHh----hccCCchHHHHHHHhhc Q lcl|NC_019408. 1 MVTHPEYQY-WRPEWTKLRDVMAGQREIKRKA---EAYL--P--AMKGADGDDYAIYLQ----RATFFNMLAQTRDGMTG 68 (612) Q Consensus 1 ~~~hP~y~~-~~~~W~~i~d~~~G~~~vr~~g---~~YL--P--k~~~e~~~~Y~~rl~----rA~~~n~~~~tv~~~~G 68 (612) |-- .... ....+..++.+..+...++... ..|. + +|+.+.....+.+-+ -.+.+|.++.+|+..+| T Consensus 1 m~e--~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g 78 (706) T protein:vir:10 1 MAE--SRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIIS 78 (706) T ss_pred CCc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhh Confidence 111 0111 1112344444555544444321 1232 1 666665555554432 36778999999999999 Q ss_pred hhhcCCceeecCC----------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeE--EEEecCcchhhh-hccCceE Q lcl|NC_019408. 69 MVFRRDPIVKNLP----------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFG--VLVDVVDNPRKG-AVATSFA 135 (612) Q Consensus 69 ~vf~k~p~~~~~p----------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~--vlVD~p~a~~~~-~~~rPy~ 135 (612) .--+..+.+.-.| +.|..++..+ -+=++.+.-+..+|..++.+|++| +..||-...+.. ...++-+ T Consensus 79 ~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~-~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i 157 (706) T protein:vir:10 79 EYRNNRISVKFRPGDNAASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAV 157 (706) T ss_pred HHHhCCCceEEecCCCCchHHHHHHHHHHHHHH-HHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCcccee Confidence 9888887764233 1233343333 223456777999999999999998 455664322111 1122222 Q ss_pred -EEechh-hh-hcchhhhccCCccceeEEEEEEEeeccc----ccc---CCCc----------cccc---ceeeeeeEee Q lcl|NC_019408. 136 -VGYSAE-NI-LDWDEVVDMGGFYVPSRVLLREFVRDLR----WKS---DIEP----------LTTA---QARKARAAAL 192 (612) Q Consensus 136 -~~~~ae-~I-inW~~~~~v~g~~~Lt~v~l~E~v~~~~----~~~---~~d~----------f~~~---~~~q~r~l~l 192 (612) ..+.|- +| +||.. ...++.. -.|+..+..+.... +.+ ..+. +... ..+.|+.- T Consensus 158 ~~v~~p~~~v~~Dp~a-~~~D~sD-ar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~-- 233 (706) T protein:vir:10 158 EPIYDPARSVWFDPDA-KKYDKSD-ALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVR-- 233 (706) T ss_pred eeeccchhceecCchh-cccChhh-cceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceeccccccc-- Confidence 223343 44 56632 1222221 12222221111100 000 0000 0000 00000000 Q ss_pred ecccccccceeecccccccccceeeeeeeec-------cccccccccccceeEEEEEeeCCCceecceeeeccCCcc--c Q lcl|NC_019408. 193 ASGSASSPMVRQTARTLGGYSYITVYRELKL-------EEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEP--L 263 (612) Q Consensus 193 ~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~-------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~--l 263 (612) -...+.++.+......+ .....++.... ....+....++... ..++.-.+ . .+ +.+.+| . T Consensus 234 --~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-v~~~~~~g-~----~~--l~~~~p~~~ 302 (706) T protein:vir:10 234 --KESVDVISYRQPLTQEI-ATYDSEQIADIQDELEQAGFEEIGRRSVKRRR-IYVAVVDG-D----GF--LEKPRRIPG 302 (706) T ss_pred --ceeEEEEEeeccccCCc-eeeccchhhhhHHHHhhCCchhhhhcccceee-EEEEeecc-c----cc--cccCCCCCC Confidence 00000011111111100 00000000000 00000000011100 01111011 0 00 111222 3 Q ss_pred cceeEEEeecCCC-----C--CCcCcCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeee-----ecCCCCCCce---- Q lcl|NC_019408. 264 DFIPFKFFGASGN-----T--ADVEKPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYA-----PGTDSEGTGE---- 326 (612) Q Consensus 264 ~~IP~v~~~~~~~-----~--~~~~~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i-----~G~~~~~~~~---- 326 (612) +.||||+|..... + ++.-. .+.|.= .+| |+.|. +-+++...+.-..+. .|+...|... T Consensus 303 ~~~P~vP~~g~r~~~d~~~~~~G~vr-~~~d~Q~~~N---~~~s~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 377 (706) T protein:vir:10 303 EHIPLIPVYGKRWFIDDVERVEGHIA-KAMDPQRLYN---LQVSM-LADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKR 377 (706) T ss_pred CccceEEEeeccccccccCcccceec-cchhhHHHHH---HHHHH-HHHHHHhcCCcccccchhHHHHHHHHhhhccccc Confidence 6677775532211 1 11111 111211 122 22222 222222222111111 1111112111 Q ss_pred ---E---EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHHhhhccccchhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 327 ---Y---HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GGRMMPGASKSVSESNNQTVLREANEQSL 399 (612) Q Consensus 327 ---l---~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~ll~~~~~~~~esa~~~~~~~~~~~s~ 399 (612) + .+|.......+....+++++++--+ .+..+-|+.....|..+ |..--.- +..++.||.+...+..+..-. T Consensus 378 ~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~-~~~~~l~~~~~~~i~~vsGi~~~~l-G~~sn~SG~Ai~~rq~qg~~~ 455 (706) T protein:vir:10 378 PAFLPLRTVTDKTGNVVAPANVAGYTQAPVLN-QALAALLQQTSADIQEVTGSSQAMQ-QMPSNVARETVNSLLNRSDMA 455 (706) T ss_pred ccchhcccccCCCCcccccccccccCCCcchH-HHHHHHHHHHHHHHHHHhCCCHHHc-CCccchHHHHHHHHHHHHHHH Confidence 1 2444433333344566676654322 22344455555555443 4322111 112234777777777776667 Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHcCCcC------CCCcceEEEeec---------------------ccccc----- Q lcl|NC_019408. 400 LLNIIQACESGMTDV----VRWWLMWRDVPL------ADTENLRYEVNT---------------------DFLST----- 443 (612) Q Consensus 400 L~~~a~~~~~a~~~~----l~~~a~w~g~~~------~~~~~~~v~ln~---------------------dF~~~----- 443 (612) |+.+-.|+..+...+ |.++..|++..- .++..-.+.||. |+... T Consensus 456 ~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~ 535 (706) T protein:vir:10 456 SFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSY 535 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCc Confidence 888888888888765 777777764210 001111122221 11111 Q ss_pred -CCCHHHHHHHHHHHHcCCCC-HHH--HHHH-HHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHH Q lcl|NC_019408. 444 -PIGAREMRAIQLMANDGLLP-DPV--FYEY-MRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQ 518 (612) Q Consensus 444 -~~d~~~~~al~~~~~~G~is-~et--~~~~-lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~ 518 (612) ....+.+.+|+++..++.-- ..+ +... +.-..+ + ..++..+++..+.+..+..+....+..+ ..+ ..+ T Consensus 536 ~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~--p--~~~e~~e~irk~~~~q~~~~~~~~~eq~--~~~-q~q 608 (706) T protein:vir:10 536 SARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEG--E--GLDDFKAFNRRQLLTQGIVKPRNQQEQA--IVQ-QAQ 608 (706) T ss_pred chHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCc--c--chHHHHHHHHHhhcccCCccccchhHHH--HHH-HHH Confidence 11233456677777765321 111 1111 111111 1 1134444544332222222111110000 000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHH----HHHHHHHHHHHHhhc Q lcl|NC_019408. 519 SRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAK----PAVAD----QATIDNAKKQTANAA 590 (612) Q Consensus 519 ~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k----~~~~e----q~~~~~~~k~~~~~a 590 (612) +.+..+.+.+..+.+.+.++.+.+.++.+++..+.+.+.......+.+.+.. .+++. ++..+..+..+...+ T Consensus 609 q~q~~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a 688 (706) T protein:vir:10 609 QAQATQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLLKEVAA 688 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111222222222222222222222222111111111000001010000 00000 111111111111111 Q ss_pred c--ccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 591 K--VAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 591 ~--~~~~~~~~~~~~~~~~~~~~~ 612 (612) + ..+|++.+ |.-+=|| T Consensus 689 ~q~~~~~~~~~------~~~~~~~ 706 (706) T protein:vir:10 689 SQQQTIPSPPS------PADIVPS 706 (706) T ss_pred hccCCCCCCCC------CcccCCC Confidence 1 12222222 2334444 No 91 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=98.65 E-value=1.4e-07 Score=58.08 Aligned_cols=561 Identities=13% Similarity=0.056 Sum_probs=182.5 Q ss_pred CCCcH--------HHHHHHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhh Q lcl|NC_019408. 1 MVTHP--------EYQYWRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMT 67 (612) Q Consensus 1 ~~~hP--------~y~~~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~ 67 (612) ...|+ .+...+..|.. .+.....+|.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:27 7 TMATKNDNGATPRFSQRQLQALCS---DIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred cccCCCCcchhHHHHHHHHHHHHH---HHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 33333 33333332222 22333333321 11122 25555555555666667788999999999999 Q ss_pred chhhcCCceeecCCH-----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEE Q lcl|NC_019408. 68 GMVFRRDPIVKNLPP-----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAV 136 (612) Q Consensus 68 G~vf~k~p~~~~~p~-----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~ 136 (612) |.-=+..+.+.-.|. .|..++.. ..+-++.+.-+..+|..++.+|.+|+=|-+-. -..+..+++. T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~ 159 (714) T protein:vir:27 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFAD-ACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVS 159 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHH-HHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEE Confidence 999888887743341 12222222 22234566778999999999998884442211 1234567788 Q ss_pred EechhhhhcchhhhccCCccceeEEEEEEEeeccccc--cCC-----C-------ccc---------------------- Q lcl|NC_019408. 137 GYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWK--SDI-----E-------PLT---------------------- 180 (612) Q Consensus 137 ~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~--~~~-----d-------~f~---------------------- 180 (612) .++|.+|+ |+....-..-....|+..+..+.....+ .+. + +|. T Consensus 160 ~v~p~~v~-~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:27 160 TVSRNEVF-WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred ecchhhee-eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 88888865 5432211111122333333221111000 000 0 000 Q ss_pred ------ccceeeeeeEeeecccccccceeecccccccccceeeeeeeecc--------ccccccccccceeEEEEEeeCC Q lcl|NC_019408. 181 ------TAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLE--------EIEWPSGEVKLAYVQYLYEEDP 246 (612) Q Consensus 181 ------~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~--------~~~~~~g~~~~~~~~~~~~~~~ 246 (612) .-.....|++....+ .+.+.....+.++-..+..|...... .+......+..++ ..++.. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w---~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~-~~~~~g-- 312 (714) T protein:vir:27 239 DRQQNEWLQRERRRVLLQVVY---YRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR-EAWFVG-- 312 (714) T ss_pred cccccccccccccEEEEEEEE---EEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE-EEEEec-- Confidence 000000011100000 00000000000000000011100000 0000011111111 111110 Q ss_pred CceecceeeeccCCcc--ccceeEEEeecCC---CCCCcC-cCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 247 ESRPIARIVPTVRGEP--LDFIPFKFFGASG---NTADVE-KPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 247 ~~~~~~~~~p~~~g~~--l~~IP~v~~~~~~---~~~~~~-~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+...+-.| -+.+|||+|.... .+-..| .-.+.|.- .+| .+++. +.++|. +.-+.+..|. T Consensus 313 ------~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N--~~~s~--~~~~l~--~~~~~~~~~a 380 (714) T protein:vir:27 313 ------PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVN--FRRIK--LTWLLQ--AKRVIMDEDA 380 (714) T ss_pred ------CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHH--HHHHH--HHHhhc--CCceeeecCc Confidence 0111111112 2345655442221 111111 01122221 123 23333 234543 2222333443 Q ss_pred CCCCCceEE--E-eccccccC-C--C-----CCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHH Q lcl|NC_019408. 320 DSEGTGEYH--I-GPNMVWEV-P--Q-----GSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSES 385 (612) Q Consensus 320 ~~~~~~~l~--i-G~~~~~~l-p--~-----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~es 385 (612) ...+++.+. + =+++.+.+ | . +..+.+.. ...-...+.+.|+...+.|..+ |. .++-..+. +.| T Consensus 381 ~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n--a~S 457 (714) T protein:vir:27 381 TQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG--ATS 457 (714) T ss_pred ccccHHHHHHhccCCCCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc--chh Confidence 222221110 1 12233333 2 1 11233332 2222334556666666655443 32 22212222 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCc---------CCCCcceEEEeec-------------- Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVP---------LADTENLRYEVNT-------------- 438 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~---------~~~~~~~~v~ln~-------------- 438 (612) +.+...+..+..-.|+.+..|+..+... +|.++..|++.. ......-.|.+|. T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~ 537 (714) T protein:vir:27 458 GVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRL 537 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceee Confidence 5555555555555677777777777665 455666665421 0000001222321 Q ss_pred --cccccCCC------HHHHHHHHHHHHcCCCCHHHH---HH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHh- Q lcl|NC_019408. 439 --DFLSTPIG------AREMRAIQLMANDGLLPDPVF---YE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR- 505 (612) Q Consensus 439 --dF~~~~~d------~~~~~al~~~~~~G~is~et~---~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~- 505 (612) |+.+...+ .+...+|+++++. ++-... .. .|.-..+. ..++..++|.+..+....++.... T Consensus 538 ~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p----~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:27 538 NTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVP----QKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred eEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCC----CHHHHHHHHHHHcCCCCCccccchh Confidence 11111111 3345566666654 221111 11 11112221 123344444333221111111110 Q ss_pred hhhhhHHHHhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019408. 506 QRGYTNRGQELEQSRM-----AREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVAD----- 575 (612) Q Consensus 506 ~~~e~~r~~~~e~~r~-----~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~e----- 575 (612) +....+.++.++++.. +.+++.++-+++.++-+++. ++...+.+......+.++......+...++.. T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a--~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~ 689 (714) T protein:vir:27 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAA--QRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN 689 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 0000111111111110 01111111111111111111 00000000000000000000000000000000 Q ss_pred -HHHHHHH-HHHHHhhccccCCCch Q lcl|NC_019408. 576 -QATIDNA-KKQTANAAKVAAQPPA 598 (612) Q Consensus 576 -q~~~~~~-~k~~~~~a~~~~~~~~ 598 (612) +...... ++..+..+.+..+=+- T Consensus 690 ~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:27 690 MEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred hhhhhHHHHHHHHHHHHHHHHhcCC Confidence 0000000 0000000000000000 No 92 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=98.65 E-value=1.4e-07 Score=58.08 Aligned_cols=561 Identities=13% Similarity=0.056 Sum_probs=182.5 Q ss_pred CCCcH--------HHHHHHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhh Q lcl|NC_019408. 1 MVTHP--------EYQYWRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMT 67 (612) Q Consensus 1 ~~~hP--------~y~~~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~ 67 (612) ...|+ .+...+..|.. .+.....+|.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:99 7 TMATKNDNGATPRFSQRQLQALCS---DIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred cccCCCCcchhHHHHHHHHHHHHH---HHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 33333 33333332222 22333333321 11122 25555555555666667788999999999999 Q ss_pred chhhcCCceeecCCH-----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEE Q lcl|NC_019408. 68 GMVFRRDPIVKNLPP-----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAV 136 (612) Q Consensus 68 G~vf~k~p~~~~~p~-----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~ 136 (612) |.-=+..+.+.-.|. .|..++.. ..+-++.+.-+..+|..++.+|.+|+=|-+-. -..+..+++. T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~ 159 (714) T protein:vir:99 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFAD-ACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVS 159 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHH-HHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEE Confidence 999888887743341 12222222 22234566778999999999998884442211 1234567788 Q ss_pred EechhhhhcchhhhccCCccceeEEEEEEEeeccccc--cCC-----C-------ccc---------------------- Q lcl|NC_019408. 137 GYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWK--SDI-----E-------PLT---------------------- 180 (612) Q Consensus 137 ~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~--~~~-----d-------~f~---------------------- 180 (612) .++|.+|+ |+....-..-....|+..+..+.....+ .+. + +|. T Consensus 160 ~v~p~~v~-~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:99 160 TVSRNEVF-WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred ecchhhee-eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 88888865 5432211111122333333221111000 000 0 000 Q ss_pred ------ccceeeeeeEeeecccccccceeecccccccccceeeeeeeecc--------ccccccccccceeEEEEEeeCC Q lcl|NC_019408. 181 ------TAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLE--------EIEWPSGEVKLAYVQYLYEEDP 246 (612) Q Consensus 181 ------~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~--------~~~~~~g~~~~~~~~~~~~~~~ 246 (612) .-.....|++....+ .+.+.....+.++-..+..|...... .+......+..++ ..++.. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w---~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~-~~~~~g-- 312 (714) T protein:vir:99 239 DRQQNEWLQRERRRVLLQVVY---YRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR-EAWFVG-- 312 (714) T ss_pred cccccccccccccEEEEEEEE---EEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE-EEEEec-- Confidence 000000011100000 00000000000000000011100000 0000011111111 111110 Q ss_pred CceecceeeeccCCcc--ccceeEEEeecCC---CCCCcC-cCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 247 ESRPIARIVPTVRGEP--LDFIPFKFFGASG---NTADVE-KPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 247 ~~~~~~~~~p~~~g~~--l~~IP~v~~~~~~---~~~~~~-~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+...+-.| -+.+|||+|.... .+-..| .-.+.|.- .+| .+++. +.++|. +.-+.+..|. T Consensus 313 ------~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N--~~~s~--~~~~l~--~~~~~~~~~a 380 (714) T protein:vir:99 313 ------PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVN--FRRIK--LTWLLQ--AKRVIMDEDA 380 (714) T ss_pred ------CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHH--HHHHH--HHHhhc--CCceeeecCc Confidence 0111111112 2345655442221 111111 01122221 123 23333 234543 2222333443 Q ss_pred CCCCCceEE--E-eccccccC-C--C-----CCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHH Q lcl|NC_019408. 320 DSEGTGEYH--I-GPNMVWEV-P--Q-----GSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSES 385 (612) Q Consensus 320 ~~~~~~~l~--i-G~~~~~~l-p--~-----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~es 385 (612) ...+++.+. + =+++.+.+ | . +..+.+.. ...-...+.+.|+...+.|..+ |. .++-..+. +.| T Consensus 381 ~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n--a~S 457 (714) T protein:vir:99 381 TQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG--ATS 457 (714) T ss_pred ccccHHHHHHhccCCCCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc--chh Confidence 222221110 1 12233333 2 1 11233332 2222334556666666655443 32 22212222 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCc---------CCCCcceEEEeec-------------- Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVP---------LADTENLRYEVNT-------------- 438 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~---------~~~~~~~~v~ln~-------------- 438 (612) +.+...+..+..-.|+.+..|+..+... +|.++..|++.. ......-.|.+|. T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~ 537 (714) T protein:vir:99 458 GVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRL 537 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceee Confidence 5555555555555677777777777665 455666665421 0000001222321 Q ss_pred --cccccCCC------HHHHHHHHHHHHcCCCCHHHH---HH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHh- Q lcl|NC_019408. 439 --DFLSTPIG------AREMRAIQLMANDGLLPDPVF---YE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR- 505 (612) Q Consensus 439 --dF~~~~~d------~~~~~al~~~~~~G~is~et~---~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~- 505 (612) |+.+...+ .+...+|+++++. ++-... .. .|.-..+. ..++..++|.+..+....++.... T Consensus 538 ~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p----~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:99 538 NTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVP----QKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred eEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCC----CHHHHHHHHHHHcCCCCCccccchh Confidence 11111111 3345566666654 221111 11 11112221 123344444333221111111110 Q ss_pred hhhhhHHHHhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019408. 506 QRGYTNRGQELEQSRM-----AREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVAD----- 575 (612) Q Consensus 506 ~~~e~~r~~~~e~~r~-----~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~e----- 575 (612) +....+.++.++++.. +.+++.++-+++.++-+++. ++...+.+......+.++......+...++.. T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a--~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~ 689 (714) T protein:vir:99 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAA--QRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN 689 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 0000111111111110 01111111111111111111 00000000000000000000000000000000 Q ss_pred -HHHHHHH-HHHHHhhccccCCCch Q lcl|NC_019408. 576 -QATIDNA-KKQTANAAKVAAQPPA 598 (612) Q Consensus 576 -q~~~~~~-~k~~~~~a~~~~~~~~ 598 (612) +...... ++..+..+.+..+=+- T Consensus 690 ~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:99 690 MEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred hhhhhHHHHHHHHHHHHHHHHhcCC Confidence 0000000 0000000000000000 No 93 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=98.65 E-value=1.4e-07 Score=58.08 Aligned_cols=561 Identities=13% Similarity=0.056 Sum_probs=182.5 Q ss_pred CCCcH--------HHHHHHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhh Q lcl|NC_019408. 1 MVTHP--------EYQYWRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMT 67 (612) Q Consensus 1 ~~~hP--------~y~~~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~ 67 (612) ...|+ .+...+..|.. .+.....+|.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:81 7 TMATKNDNGATPRFSQRQLQALCS---DIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred cccCCCCcchhHHHHHHHHHHHHH---HHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 33333 33333332222 22333333321 11122 25555555555666667788999999999999 Q ss_pred chhhcCCceeecCCH-----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEE Q lcl|NC_019408. 68 GMVFRRDPIVKNLPP-----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAV 136 (612) Q Consensus 68 G~vf~k~p~~~~~p~-----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~ 136 (612) |.-=+..+.+.-.|. .|..++.. ..+-++.+.-+..+|..++.+|.+|+=|-+-. -..+..+++. T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~ 159 (714) T protein:vir:81 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFAD-ACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVS 159 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHH-HHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEE Confidence 999888887743341 12222222 22234566778999999999998884442211 1234567788 Q ss_pred EechhhhhcchhhhccCCccceeEEEEEEEeeccccc--cCC-----C-------ccc---------------------- Q lcl|NC_019408. 137 GYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWK--SDI-----E-------PLT---------------------- 180 (612) Q Consensus 137 ~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~--~~~-----d-------~f~---------------------- 180 (612) .++|.+|+ |+....-..-....|+..+..+.....+ .+. + +|. T Consensus 160 ~v~p~~v~-~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:81 160 TVSRNEVF-WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred ecchhhee-eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 88888865 5432211111122333333221111000 000 0 000 Q ss_pred ------ccceeeeeeEeeecccccccceeecccccccccceeeeeeeecc--------ccccccccccceeEEEEEeeCC Q lcl|NC_019408. 181 ------TAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLE--------EIEWPSGEVKLAYVQYLYEEDP 246 (612) Q Consensus 181 ------~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~--------~~~~~~g~~~~~~~~~~~~~~~ 246 (612) .-.....|++....+ .+.+.....+.++-..+..|...... .+......+..++ ..++.. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w---~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~-~~~~~g-- 312 (714) T protein:vir:81 239 DRQQNEWLQRERRRVLLQVVY---YRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR-EAWFVG-- 312 (714) T ss_pred cccccccccccccEEEEEEEE---EEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE-EEEEec-- Confidence 000000011100000 00000000000000000011100000 0000011111111 111110 Q ss_pred CceecceeeeccCCcc--ccceeEEEeecCC---CCCCcC-cCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 247 ESRPIARIVPTVRGEP--LDFIPFKFFGASG---NTADVE-KPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 247 ~~~~~~~~~p~~~g~~--l~~IP~v~~~~~~---~~~~~~-~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+...+-.| -+.+|||+|.... .+-..| .-.+.|.- .+| .+++. +.++|. +.-+.+..|. T Consensus 313 ------~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N--~~~s~--~~~~l~--~~~~~~~~~a 380 (714) T protein:vir:81 313 ------PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVN--FRRIK--LTWLLQ--AKRVIMDEDA 380 (714) T ss_pred ------CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHH--HHHHH--HHHhhc--CCceeeecCc Confidence 0111111112 2345655442221 111111 01122221 123 23333 234543 2222333443 Q ss_pred CCCCCceEE--E-eccccccC-C--C-----CCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHH Q lcl|NC_019408. 320 DSEGTGEYH--I-GPNMVWEV-P--Q-----GSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSES 385 (612) Q Consensus 320 ~~~~~~~l~--i-G~~~~~~l-p--~-----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~es 385 (612) ...+++.+. + =+++.+.+ | . +..+.+.. ...-...+.+.|+...+.|..+ |. .++-..+. +.| T Consensus 381 ~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n--a~S 457 (714) T protein:vir:81 381 TQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG--ATS 457 (714) T ss_pred ccccHHHHHHhccCCCCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc--chh Confidence 222221110 1 12233333 2 1 11233332 2222334556666666655443 32 22212222 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCc---------CCCCcceEEEeec-------------- Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVP---------LADTENLRYEVNT-------------- 438 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~---------~~~~~~~~v~ln~-------------- 438 (612) +.+...+..+..-.|+.+..|+..+... +|.++..|++.. ......-.|.+|. T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~ 537 (714) T protein:vir:81 458 GVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRL 537 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceee Confidence 5555555555555677777777777665 455666665421 0000001222321 Q ss_pred --cccccCCC------HHHHHHHHHHHHcCCCCHHHH---HH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHh- Q lcl|NC_019408. 439 --DFLSTPIG------AREMRAIQLMANDGLLPDPVF---YE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR- 505 (612) Q Consensus 439 --dF~~~~~d------~~~~~al~~~~~~G~is~et~---~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~- 505 (612) |+.+...+ .+...+|+++++. ++-... .. .|.-..+. ..++..++|.+..+....++.... T Consensus 538 ~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p----~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:81 538 NTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVP----QKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred eEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCC----CHHHHHHHHHHHcCCCCCccccchh Confidence 11111111 3345566666654 221111 11 11112221 123344444333221111111110 Q ss_pred hhhhhHHHHhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019408. 506 QRGYTNRGQELEQSRM-----AREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVAD----- 575 (612) Q Consensus 506 ~~~e~~r~~~~e~~r~-----~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~e----- 575 (612) +....+.++.++++.. +.+++.++-+++.++-+++. ++...+.+......+.++......+...++.. T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a--~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~ 689 (714) T protein:vir:81 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAA--QRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN 689 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 0000111111111110 01111111111111111111 00000000000000000000000000000000 Q ss_pred -HHHHHHH-HHHHHhhccccCCCch Q lcl|NC_019408. 576 -QATIDNA-KKQTANAAKVAAQPPA 598 (612) Q Consensus 576 -q~~~~~~-~k~~~~~a~~~~~~~~ 598 (612) +...... ++..+..+.+..+=+- T Consensus 690 ~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:81 690 MEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred hhhhhHHHHHHHHHHHHHHHHhcCC Confidence 0000000 0000000000000000 No 94 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=98.65 E-value=1.4e-07 Score=58.08 Aligned_cols=561 Identities=13% Similarity=0.056 Sum_probs=182.5 Q ss_pred CCCcH--------HHHHHHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhh Q lcl|NC_019408. 1 MVTHP--------EYQYWRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMT 67 (612) Q Consensus 1 ~~~hP--------~y~~~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~ 67 (612) ...|+ .+...+..|.. .+.....+|.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:10 7 TMATKNDNGATPRFSQRQLQALCS---DIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred cccCCCCcchhHHHHHHHHHHHHH---HHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 33333 33333332222 22333333321 11122 25555555555666667788999999999999 Q ss_pred chhhcCCceeecCCH-----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEE Q lcl|NC_019408. 68 GMVFRRDPIVKNLPP-----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAV 136 (612) Q Consensus 68 G~vf~k~p~~~~~p~-----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~ 136 (612) |.-=+..+.+.-.|. .|..++.. ..+-++.+.-+..+|..++.+|.+|+=|-+-. -..+..+++. T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~ 159 (714) T protein:vir:10 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFAD-ACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVS 159 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHH-HHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEE Confidence 999888887743341 12222222 22234566778999999999998884442211 1234567788 Q ss_pred EechhhhhcchhhhccCCccceeEEEEEEEeeccccc--cCC-----C-------ccc---------------------- Q lcl|NC_019408. 137 GYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWK--SDI-----E-------PLT---------------------- 180 (612) Q Consensus 137 ~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~--~~~-----d-------~f~---------------------- 180 (612) .++|.+|+ |+....-..-....|+..+..+.....+ .+. + +|. T Consensus 160 ~v~p~~v~-~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:10 160 TVSRNEVF-WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred ecchhhee-eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 88888865 5432211111122333333221111000 000 0 000 Q ss_pred ------ccceeeeeeEeeecccccccceeecccccccccceeeeeeeecc--------ccccccccccceeEEEEEeeCC Q lcl|NC_019408. 181 ------TAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLE--------EIEWPSGEVKLAYVQYLYEEDP 246 (612) Q Consensus 181 ------~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~--------~~~~~~g~~~~~~~~~~~~~~~ 246 (612) .-.....|++....+ .+.+.....+.++-..+..|...... .+......+..++ ..++.. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w---~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~-~~~~~g-- 312 (714) T protein:vir:10 239 DRQQNEWLQRERRRVLLQVVY---YRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR-EAWFVG-- 312 (714) T ss_pred cccccccccccccEEEEEEEE---EEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE-EEEEec-- Confidence 000000011100000 00000000000000000011100000 0000011111111 111110 Q ss_pred CceecceeeeccCCcc--ccceeEEEeecCC---CCCCcC-cCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 247 ESRPIARIVPTVRGEP--LDFIPFKFFGASG---NTADVE-KPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 247 ~~~~~~~~~p~~~g~~--l~~IP~v~~~~~~---~~~~~~-~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+...+-.| -+.+|||+|.... .+-..| .-.+.|.- .+| .+++. +.++|. +.-+.+..|. T Consensus 313 ------~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N--~~~s~--~~~~l~--~~~~~~~~~a 380 (714) T protein:vir:10 313 ------PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVN--FRRIK--LTWLLQ--AKRVIMDEDA 380 (714) T ss_pred ------CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHH--HHHHH--HHHhhc--CCceeeecCc Confidence 0111111112 2345655442221 111111 01122221 123 23333 234543 2222333443 Q ss_pred CCCCCceEE--E-eccccccC-C--C-----CCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHH Q lcl|NC_019408. 320 DSEGTGEYH--I-GPNMVWEV-P--Q-----GSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSES 385 (612) Q Consensus 320 ~~~~~~~l~--i-G~~~~~~l-p--~-----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~es 385 (612) ...+++.+. + =+++.+.+ | . +..+.+.. ...-...+.+.|+...+.|..+ |. .++-..+. +.| T Consensus 381 ~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n--a~S 457 (714) T protein:vir:10 381 TQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG--ATS 457 (714) T ss_pred ccccHHHHHHhccCCCCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc--chh Confidence 222221110 1 12233333 2 1 11233332 2222334556666666655443 32 22212222 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCc---------CCCCcceEEEeec-------------- Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVP---------LADTENLRYEVNT-------------- 438 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~---------~~~~~~~~v~ln~-------------- 438 (612) +.+...+..+..-.|+.+..|+..+... +|.++..|++.. ......-.|.+|. T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~ 537 (714) T protein:vir:10 458 GVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRL 537 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceee Confidence 5555555555555677777777777665 455666665421 0000001222321 Q ss_pred --cccccCCC------HHHHHHHHHHHHcCCCCHHHH---HH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHh- Q lcl|NC_019408. 439 --DFLSTPIG------AREMRAIQLMANDGLLPDPVF---YE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR- 505 (612) Q Consensus 439 --dF~~~~~d------~~~~~al~~~~~~G~is~et~---~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~- 505 (612) |+.+...+ .+...+|+++++. ++-... .. .|.-..+. ..++..++|.+..+....++.... T Consensus 538 ~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p----~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:10 538 NTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVP----QKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred eEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCC----CHHHHHHHHHHHcCCCCCccccchh Confidence 11111111 3345566666654 221111 11 11112221 123344444333221111111110 Q ss_pred hhhhhHHHHhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019408. 506 QRGYTNRGQELEQSRM-----AREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVAD----- 575 (612) Q Consensus 506 ~~~e~~r~~~~e~~r~-----~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~e----- 575 (612) +....+.++.++++.. +.+++.++-+++.++-+++. ++...+.+......+.++......+...++.. T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a--~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~ 689 (714) T protein:vir:10 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAA--QRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN 689 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 0000111111111110 01111111111111111111 00000000000000000000000000000000 Q ss_pred -HHHHHHH-HHHHHhhccccCCCch Q lcl|NC_019408. 576 -QATIDNA-KKQTANAAKVAAQPPA 598 (612) Q Consensus 576 -q~~~~~~-~k~~~~~a~~~~~~~~ 598 (612) +...... ++..+..+.+..+=+- T Consensus 690 ~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 690 MEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred hhhhhHHHHHHHHHHHHHHHHhcCC Confidence 0000000 0000000000000000 No 95 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=98.65 E-value=1.4e-07 Score=58.08 Aligned_cols=561 Identities=13% Similarity=0.056 Sum_probs=182.5 Q ss_pred CCCcH--------HHHHHHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhh Q lcl|NC_019408. 1 MVTHP--------EYQYWRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMT 67 (612) Q Consensus 1 ~~~hP--------~y~~~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~ 67 (612) ...|+ .+...+..|.. .+.....+|.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:32 7 TMATKNDNGATPRFSQRQLQALCS---DIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred cccCCCCcchhHHHHHHHHHHHHH---HHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 33333 33333332222 22333333321 11122 25555555555666667788999999999999 Q ss_pred chhhcCCceeecCCH-----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEE Q lcl|NC_019408. 68 GMVFRRDPIVKNLPP-----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAV 136 (612) Q Consensus 68 G~vf~k~p~~~~~p~-----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~ 136 (612) |.-=+..+.+.-.|. .|..++.. ..+-++.+.-+..+|..++.+|.+|+=|-+-. -..+..+++. T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~ 159 (714) T protein:vir:32 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFAD-ACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVS 159 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHH-HHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEE Confidence 999888887743341 12222222 22234566778999999999998884442211 1234567788 Q ss_pred EechhhhhcchhhhccCCccceeEEEEEEEeeccccc--cCC-----C-------ccc---------------------- Q lcl|NC_019408. 137 GYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWK--SDI-----E-------PLT---------------------- 180 (612) Q Consensus 137 ~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~--~~~-----d-------~f~---------------------- 180 (612) .++|.+|+ |+....-..-....|+..+..+.....+ .+. + +|. T Consensus 160 ~v~p~~v~-~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:32 160 TVSRNEVF-WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred ecchhhee-eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 88888865 5432211111122333333221111000 000 0 000 Q ss_pred ------ccceeeeeeEeeecccccccceeecccccccccceeeeeeeecc--------ccccccccccceeEEEEEeeCC Q lcl|NC_019408. 181 ------TAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLE--------EIEWPSGEVKLAYVQYLYEEDP 246 (612) Q Consensus 181 ------~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~--------~~~~~~g~~~~~~~~~~~~~~~ 246 (612) .-.....|++....+ .+.+.....+.++-..+..|...... .+......+..++ ..++.. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w---~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~-~~~~~g-- 312 (714) T protein:vir:32 239 DRQQNEWLQRERRRVLLQVVY---YRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR-EAWFVG-- 312 (714) T ss_pred cccccccccccccEEEEEEEE---EEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE-EEEEec-- Confidence 000000011100000 00000000000000000011100000 0000011111111 111110 Q ss_pred CceecceeeeccCCcc--ccceeEEEeecCC---CCCCcC-cCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeeeecC Q lcl|NC_019408. 247 ESRPIARIVPTVRGEP--LDFIPFKFFGASG---NTADVE-KPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYAPGT 319 (612) Q Consensus 247 ~~~~~~~~~p~~~g~~--l~~IP~v~~~~~~---~~~~~~-~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~ 319 (612) ..+...+-.| -+.+|||+|.... .+-..| .-.+.|.- .+| .+++. +.++|. +.-+.+..|. T Consensus 313 ------~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N--~~~s~--~~~~l~--~~~~~~~~~a 380 (714) T protein:vir:32 313 ------PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVN--FRRIK--LTWLLQ--AKRVIMDEDA 380 (714) T ss_pred ------CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHH--HHHHH--HHHhhc--CCceeeecCc Confidence 0111111112 2345655442221 111111 01122221 123 23333 234543 2222333443 Q ss_pred CCCCCceEE--E-eccccccC-C--C-----CCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHH Q lcl|NC_019408. 320 DSEGTGEYH--I-GPNMVWEV-P--Q-----GSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSES 385 (612) Q Consensus 320 ~~~~~~~l~--i-G~~~~~~l-p--~-----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~es 385 (612) ...+++.+. + =+++.+.+ | . +..+.+.. ...-...+.+.|+...+.|..+ |. .++-..+. +.| T Consensus 381 ~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n--a~S 457 (714) T protein:vir:32 381 TQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG--ATS 457 (714) T ss_pred ccccHHHHHHhccCCCCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc--chh Confidence 222221110 1 12233333 2 1 11233332 2222334556666666655443 32 22212222 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCc---------CCCCcceEEEeec-------------- Q lcl|NC_019408. 386 NNQTVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVP---------LADTENLRYEVNT-------------- 438 (612) Q Consensus 386 a~~~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~---------~~~~~~~~v~ln~-------------- 438 (612) +.+...+..+..-.|+.+..|+..+... +|.++..|++.. ......-.|.+|. T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~ 537 (714) T protein:vir:32 458 GVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRL 537 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceee Confidence 5555555555555677777777777665 455666665421 0000001222321 Q ss_pred --cccccCCC------HHHHHHHHHHHHcCCCCHHHH---HH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHh- Q lcl|NC_019408. 439 --DFLSTPIG------AREMRAIQLMANDGLLPDPVF---YE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR- 505 (612) Q Consensus 439 --dF~~~~~d------~~~~~al~~~~~~G~is~et~---~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~- 505 (612) |+.+...+ .+...+|+++++. ++-... .. .|.-..+. ..++..++|.+..+....++.... T Consensus 538 ~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p----~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:32 538 NTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVP----QKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred eEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCC----CHHHHHHHHHHHcCCCCCccccchh Confidence 11111111 3345566666654 221111 11 11112221 123344444333221111111110 Q ss_pred hhhhhHHHHhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019408. 506 QRGYTNRGQELEQSRM-----AREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVAD----- 575 (612) Q Consensus 506 ~~~e~~r~~~~e~~r~-----~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~e----- 575 (612) +....+.++.++++.. +.+++.++-+++.++-+++. ++...+.+......+.++......+...++.. T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a--~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~ 689 (714) T protein:vir:32 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAA--QRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN 689 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 0000111111111110 01111111111111111111 00000000000000000000000000000000 Q ss_pred -HHHHHHH-HHHHHhhccccCCCch Q lcl|NC_019408. 576 -QATIDNA-KKQTANAAKVAAQPPA 598 (612) Q Consensus 576 -q~~~~~~-~k~~~~~a~~~~~~~~ 598 (612) +...... ++..+..+.+..+=+- T Consensus 690 ~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:32 690 MEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred hhhhhHHHHHHHHHHHHHHHHhcCC Confidence 0000000 0000000000000000 No 96 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.63 E-value=1.6e-07 Score=57.79 Aligned_cols=573 Identities=10% Similarity=0.012 Sum_probs=167.4 Q ss_pred cHHHHHHH-HHHHHHHHHhcChHHHHhcc---cccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 4 HPEYQYWR-PEWTKLRDVMAGQREIKRKA---EAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 4 hP~y~~~~-~~W~~i~d~~~G~~~vr~~g---~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) -|+=...+ ..-..++.++.....+|... ..|. =+|+.+.....+. ..|-+ +|.++++|++++|.-=+..+.+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~-q~rp~-~N~i~~~i~~v~g~e~~nr~d~ 78 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh-cCCCc-ccchHHHHHHHHhhHHhCCcce Confidence 11111111 11112223333333333211 1121 2454444443333 34444 5999999999999876666655 Q ss_pred ecCC---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEecCcchhhhhccCceEE-Eec-hhhh- Q lcl|NC_019408. 78 KNLP---------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVDVVDNPRKGAVATSFAV-GYS-AENI- 143 (612) Q Consensus 78 ~~~p---------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD~p~a~~~~~~~rPy~~-~~~-ae~I- 143 (612) .-+| +.|..++..+ .+-++.+.-+..+|..++..|.+|+ ..||...+......+.-+. +|. ..+| T Consensus 79 ~v~P~~~~d~~~Ae~l~~~~~~~-~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~V~ 157 (725) T protein:vir:92 79 LYRPKDGASPDAADVLMGMYRTD-MRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred EEecCCccHHHHHHHHHHHHHHH-HHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhhcc Confidence 3223 2344444433 2245666778899999999999984 4577432221111111111 122 3333 Q ss_pred hcchhhhccCCccceeEEEEEEEeecc-------ccc-cCCCcccccce----------eeeeeEeeeccccccccee-- Q lcl|NC_019408. 144 LDWDEVVDMGGFYVPSRVLLREFVRDL-------RWK-SDIEPLTTAQA----------RKARAAALASGSASSPMVR-- 203 (612) Q Consensus 144 inW~~~~~v~g~~~Lt~v~l~E~v~~~-------~~~-~~~d~f~~~~~----------~q~r~l~l~~g~~~~~~~~-- 203 (612) +||..+. .|+. .--|+.++..+... .+. +..+.++.... ..+|+.. +..+... T Consensus 158 ~Dp~a~~-~D~s-Dar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e-----~~~r~~~~~ 230 (725) T protein:vir:92 158 WDSNSKL-MDKS-DSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAE-----FYEVVEKKE 230 (725) T ss_pred cCchhhc-cChh-hHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEE-----EEEEEEEee Confidence 5664322 1111 00111111111100 000 00000000000 0011110 0000000 Q ss_pred ec---ccccccccceeeeeeeeccccc---ccccc--ccce----eEEEEEeeCCCceecceeeeccCCccccceeEEEe Q lcl|NC_019408. 204 QT---ARTLGGYSYITVYRELKLEEIE---WPSGE--VKLA----YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF 271 (612) Q Consensus 204 ~~---~~~~~g~~~~~~~R~~~~~~~~---~~~g~--~~~~----~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~ 271 (612) .. .....|.. ..|-...+..+. ...|. +..+ ....++...|.. +-+..+.. |-++||||+| T Consensus 231 ~~~~~~d~~~g~~--~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~--~l~~~~~~---~~~~~P~vP~ 303 (725) T protein:vir:92 231 TAFIYQDPVTGEP--VSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA--VLKDKQLI---AGEHIPIVPV 303 (725) T ss_pred eEEeecCCCCCce--eecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchh--hhcCCCCC---CCCceeeEEE Confidence 00 00000000 000000000000 00010 0000 000111111111 10111111 2244565543 Q ss_pred ecC----CCCC---CcCcCchHHHHHHHHHHHhhhHHHHHHHHHh-ccceeeeecCCCCCCceEE------EeccccccC Q lcl|NC_019408. 272 GAS----GNTA---DVEKPPLLDICDLNLSHYRTYAELEYGRLFT-ALPVYYAPGTDSEGTGEYH------IGPNMVWEV 337 (612) Q Consensus 272 ~~~----~~~~---~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~-~~P~l~i~G~~~~~~~~l~------iG~~~~~~l 337 (612) ... ++.+ +.-. ++.|.=. .-.++.|.-+ +++..+ ..+..+-.|..+....... +...+.+.. T Consensus 304 ~g~r~~~~g~~~~~G~vr-~~kd~Q~--~~N~~~S~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 379 (725) T protein:vir:92 304 FGEWGFVEDKEVYEGVVR-LTKDGQR--LRNMIMSFNA-DIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDE 379 (725) T ss_pred EeeeeccCCcccccceec-cchhHHH--HHHHHHHHHH-HHHHhccCcccccchhhhhHHHHHHhccCccceeecccccc Confidence 211 1111 1111 1112111 1123333333 333332 2222222221110000001 111111111 Q ss_pred CCC----CceeEEecCchhHHHHHHHHHHHHHHHHHH-HHH--hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 338 PQG----SEPGILEYTGQGLKALETALNDKERQIAAI-GGR--MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESG 410 (612) Q Consensus 338 p~~----~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~--ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a 410 (612) ..| ..+.+..+..-+ ..+..-|+.....|..+ |.. ++-..+ .+.|+.+...+..+..-.|+.+..|+..+ T Consensus 380 ~~g~~~~~~i~~~~~~~~p-~~~~~ll~~~~~~i~~~tGi~~~~lG~~~--n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~ 456 (725) T protein:vir:92 380 NNGEMPTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNG--GQVAYDTVNQLNMRADLETYVFQDNLATA 456 (725) T ss_pred ccccccccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCCHHHhccCc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 134444433222 23455556555555433 432 332222 23566666666666666677777777777 Q ss_pred HHH----HHHHHHHHcCCcC------CCCcceEEEeecc------------------cccc----CC----CHHHHHHHH Q lcl|NC_019408. 411 MTD----VVRWWLMWRDVPL------ADTENLRYEVNTD------------------FLST----PI----GAREMRAIQ 454 (612) Q Consensus 411 ~~~----~l~~~a~w~g~~~------~~~~~~~v~ln~d------------------F~~~----~~----d~~~~~al~ 454 (612) ... +|.++..+++..- .++..-.+.||.. |+.. +. ..+.+..|. T Consensus 457 ~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ 536 (725) T protein:vir:92 457 MRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEIL 536 (725) T ss_pred HHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHH Confidence 665 5666667764211 1111122333321 1110 00 112334444 Q ss_pred HHHHcCC-CCH---HHHHHHHHhcCccchhhhhHHHHHHhhccccccc--cchhHHhhhhhhHHH-HhHHHHH------- Q lcl|NC_019408. 455 LMANDGL-LPD---PVFYEYMRKAEVISSDMTFEEFQALRADENSFIN--NPDAQARQRGYTNRG-QELEQSR------- 520 (612) Q Consensus 455 ~~~~~G~-is~---et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~--~~~~~~~~~~e~~r~-~~~e~~r------- 520 (612) ++..+-- +.- .++..++ .+++-. ..++..+++..+.+... .+.....+....+.+ .+..++. T Consensus 537 ql~~~~~~~~~~~~~~l~~~~---~~~d~~-~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~q 612 (725) T protein:vir:92 537 ELLGKTPQGTPEYQLLLLQYF---TLLDGK-GVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQ 612 (725) T ss_pred HHHHhcccchhHHHHHHHHHh---hcccch-HHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHH Confidence 4443211 100 1111111 111100 11233343432221111 110011000000000 0000000 Q ss_pred ---HHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 521 ---MAREADFTQQK---------IDIQERSVAVQEGHAEVAHAA------------GSTSISGSRKLGDPEQAKPAVADQ 576 (612) Q Consensus 521 ---~~~e~e~~~q~---------~e~~~r~~~~~~~r~~~e~~~------------~~~~~~~~r~~~~e~q~k~~~~eq 576 (612) ...+++.++.. +...+.+++..+.+......+ .+...+.+.+.++++ + +.+|+ T Consensus 613 a~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a-~--~~ae~ 689 (725) T protein:vir:92 613 GVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDA-R--ANAEL 689 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-H--HhchH Confidence 00011111111 111111111100000000000 000011110000000 0 01111 Q ss_pred -HHHHHHHHHHHhhccccCCCchhhcC----CCCCc Q lcl|NC_019408. 577 -ATIDNAKKQTANAAKVAAQPPAPAAP----GAPPT 607 (612) Q Consensus 577 -~~~~~~~k~~~~~a~~~~~~~~~~~~----~~~~~ 607 (612) .+.+.+.++++....++....-..++ +.-|+ T Consensus 690 ~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 725 (725) T protein:vir:92 690 LLKGNEQTHKQRMDIANILQSQRQNQPSGSVAETPQ 725 (725) T ss_pred HHHHHHHHHHHHHHHHHHhcchhccCCccccccCCC Confidence 11111122222222222222111111 11233 No 97 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=98.58 E-value=2.4e-07 Score=56.89 Aligned_cols=574 Identities=12% Similarity=0.079 Sum_probs=192.6 Q ss_pred CCCcH---HHHHHHHHHHHHHHHhcChHHHHhcccc---cC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhc Q lcl|NC_019408. 1 MVTHP---EYQYWRPEWTKLRDVMAGQREIKRKAEA---YL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFR 72 (612) Q Consensus 1 ~~~hP---~y~~~~~~W~~i~d~~~G~~~vr~~g~~---YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~ 72 (612) +.--| .-.-....|..+...+.+...+|....+ |. =+|+.+.....+.+-.-.+.+|.++.+|+..+|.--+ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~ 90 (772) T protein:vir:10 11 LNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIGPALLSLQGYEAV 90 (772) T ss_pred hccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHHHHHHh Confidence 11000 0000001233333334444444421111 21 2666666677777778888999999999999999999 Q ss_pred CCceeecCCH----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhh Q lcl|NC_019408. 73 RDPIVKNLPP----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAEN 142 (612) Q Consensus 73 k~p~~~~~p~----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~ 142 (612) ..+.+.-.|. .|..++.. ..+-++.+.-+..+|..++.+|++|+=|++-... .+...++..+++.+ T Consensus 91 nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~---~~~~i~i~~v~p~~ 166 (772) T protein:vir:10 91 TRTDWRVTPNGDVGGQEVADALNYRLNT-AERQSGADRACSEAFRPQIACGIGWVEVSRESDP---FKFPYRCRPIRRDE 166 (772) T ss_pred cCcceEEecCCCchHHHHHHHHHHHHHH-HHHhcChHHHHHHHHHHhhhcCceeEEeccccCC---CCCCeEEEeeCccc Confidence 8887743342 12222222 2223556677899999999999999887774332 22344566666666 Q ss_pred hhcchhhhccCCccceeEEEEEEEeeccc----cccC------------------------------------------- Q lcl|NC_019408. 143 ILDWDEVVDMGGFYVPSRVLLREFVRDLR----WKSD------------------------------------------- 175 (612) Q Consensus 143 IinW~~~~~v~g~~~Lt~v~l~E~v~~~~----~~~~------------------------------------------- 175 (612) |+ |+....-|. ...-|+.....+.... |.+. T Consensus 167 v~-~Dp~a~~D~-sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (772) T protein:vir:10 167 IH-WDMKCGDDW-EACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTV 244 (772) T ss_pred ce-ecCCCCCCH-HHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhcccc Confidence 53 322111111 1111222111111000 0000 Q ss_pred -CCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecc-ccccccccccc------eeEEEEEeeCCC Q lcl|NC_019408. 176 -IEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLE-EIEWPSGEVKL------AYVQYLYEEDPE 247 (612) Q Consensus 176 -~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~-~~~~~~g~~~~------~~~~~~~~~~~~ 247 (612) .+.+-......+|+... + |...+.........| .+..|...... ......|.+.. +..+.++..+ T Consensus 245 ~~~~~~~~~~~rVrv~E~--w-~r~~~~~~~~~~~~g--~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~-- 317 (772) T protein:vir:10 245 QEDHWYNPTSKEICLVEL--W-YRRWVQVHVLKSPDG--RVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGP-- 317 (772) T ss_pred ccccccccCCceEEEEEE--e-eeeeeeeeeeccCCC--ceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecc-- Confidence 00000000001111100 0 000000000000111 00001000000 00000000000 1111222111 Q ss_pred ceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHH--HHHHhhhHHHHHHHHHhccceeeeecCCCCCCc Q lcl|NC_019408. 248 SRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLN--LSHYRTYAELEYGRLFTALPVYYAPGTDSEGTG 325 (612) Q Consensus 248 ~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~ln--l~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~ 325 (612) ........| ..+..+++|||+.+.....+...|. ..++-+.. +..|.+ -+-++|...+ +..-.|.-+..++ T Consensus 318 ~~L~~~~~p-~~~~~fP~vP~~g~r~~~~g~~~G~--vr~~kd~Qr~~N~~~S--~~~~~l~~~~--~~~~~gav~~~d~ 390 (772) T protein:vir:10 318 HCLHDGPTP-YTHRHFPYVPFFGFREDATGIPYGY--VRGMKYAQDSLNSGVS--KLRWGMSVAR--VERTKGAVAMTDA 390 (772) T ss_pred eeeccCCCC-CCCCccceEEEeeeEeccCCcccch--hhhhhhHHHHHHHHHH--HHHHHHhccc--ccccCCCccchhH Confidence 100000011 1122244444433321111111111 11222221 111222 2344554443 3333443222111 Q ss_pred eE--EE-eccccccCC------CCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHH--hhhccccchhHHHHHHHHHH Q lcl|NC_019408. 326 EY--HI-GPNMVWEVP------QGSEPGILEYTGQGLKALETALNDKERQIAAI-GGR--MMPGASKSVSESNNQTVLRE 393 (612) Q Consensus 326 ~l--~i-G~~~~~~lp------~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~--ll~~~~~~~~esa~~~~~~~ 393 (612) .+ .+ =+++.+.+- .|..+.+..+ ......+.+.|+...+.|..+ |.. ++-..+ ...|+.+...+. T Consensus 391 ~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~-~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~--na~SGvAi~~rq 467 (772) T protein:vir:10 391 QFRRQIARPDADIVLDENHMAKPGARFDVKRD-YTLTDQHFQMLQDNRATIERVSNITAGFQGRKG--TATSGIQEQQQI 467 (772) T ss_pred HHHHhccCCCCeEEeCCccccCCCCCccccCC-ccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCc--chhhHHHHHHHH Confidence 11 01 122233222 2345555443 333344566666666666554 332 222222 234666655555 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCcC------CCC--cceEEEeec--------------cccccCCC- Q lcl|NC_019408. 394 ANEQSLLLNIIQACESGMTD----VVRWWLMWRDVPL------ADT--ENLRYEVNT--------------DFLSTPIG- 446 (612) Q Consensus 394 ~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~~------~~~--~~~~v~ln~--------------dF~~~~~d- 446 (612) .+..-.|+.+-.|+..+... +|.++..|++..- .+. ..-.+.||. |......| T Consensus 468 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv 547 (772) T protein:vir:10 468 EQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKV 547 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEE Confidence 55555677777777777764 5677777764210 000 011122331 11111111 Q ss_pred ------------HHHHHHHHHHHHcCCCCHHHH---HH-HHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhh Q lcl|NC_019408. 447 ------------AREMRAIQLMANDGLLPDPVF---YE-YMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYT 510 (612) Q Consensus 447 ------------~~~~~al~~~~~~G~is~et~---~~-~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~ 510 (612) .+.+.+++++... +.-+.. .. .+.-..+. ..++..+++.+..+.. .++++.. T Consensus 548 ~i~~~p~~~t~r~~~~~~m~ql~~~--~~P~~~~~~~~~~le~~D~p----~~~ei~~~ir~~~~~~-~peq~~~----- 615 (772) T protein:vir:10 548 ALEDVPSTNSYRGQQLNAMSEAVKS--MPPQYQAAVLPFLVSLMDVP----FKRDVVEAIRAVDQQQ-TPEQIQQ----- 615 (772) T ss_pred EeeccccchHHHHHHHHHHHHHHhc--cChhHHHHHHHHHHhhcCCC----ChHHHHHHHHHHhccC-ChHHHHH----- Confidence 2344555555543 222111 11 11111111 1234444444322111 1111111 Q ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH Q lcl|NC_019408. 511 NRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTS-ISGSRKLGDPEQAKPAVADQATIDNAK--KQTA 587 (612) Q Consensus 511 ~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~-~~~~r~~~~e~q~k~~~~eq~~~~~~~--k~~~ 587 (612) +.++.+.++.+...++++.++.+.+.++.+.++++..+++.+...+ +..+-+.+ +...+.-..+....+. .+.. T Consensus 616 ~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa---~~~~q~~q~a~~ad~~l~~~g~ 692 (772) T protein:vir:10 616 QIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAG---AQIAQMPMIAPIADAVMQSAGY 692 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---hhHHhhhhhhHHHHHHHHhccc Confidence 1011111111111122222222222222222222111111111100 00000000 0000000001111111 0100 Q ss_pred hh-------------cc-------------ccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 588 NA-------------AK-------------VAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 588 ~~-------------a~-------------~~~~~~~~~~~~~~~~~~~~~ 612 (612) .. +. .+++++....+..+|+++-|+ T Consensus 693 ~~~~~~~~~~~~p~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 743 (772) T protein:vir:10 693 QRPNPAGDDPNYPIADQTAAMNIRSPYIQGQGPAAEAEAESVSVRRNTSPT 743 (772) T ss_pred ccccccccCCCCCCCCCccCCCCCccCCCCCCCCCccccCCCCCccCCCCC Confidence 00 00 001111111222235566665 No 98 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=98.57 E-value=2.6e-07 Score=56.68 Aligned_cols=538 Identities=13% Similarity=0.104 Sum_probs=216.1 Q ss_pred CCCcHHHHHHHHHHHH-HHHHhcChHHHHhc----ccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTK-LRDVMAGQREIKRK----AEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~-i~d~~~G~~~vr~~----g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p 75 (612) |--||+- ...+|.. |...-.+-..|+.+ -+.|+=-...-+... .-||.+-.|+.+|.=-|+.++| T Consensus 9 ~~~tpe~--la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~--------~r~nl~~sni~~i~P~iYar~P 78 (663) T protein:vir:34 9 FADTPQG--WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAE--------TRWNLFSTNIQTQMASLYGQTP 78 (663) T ss_pred chhcchh--HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCccc--------cccchhhhhHHHHhhhhhcCCC Confidence 8888876 5677765 66665555555433 234541111111111 1369999999999999999999 Q ss_pred eee--------c--CCHHHHHHHhc-----cCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc---------------ch Q lcl|NC_019408. 76 IVK--------N--LPPKFKDAVRR-----FAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD---------------NP 125 (612) Q Consensus 76 ~~~--------~--~p~~l~~~~~d-----~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~---------------a~ 125 (612) ..+ + ++.....+++. +..+-..|+.-|+.+++..|.+||+-+=|=|-+ .+ T Consensus 79 ~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~D~~~~ 158 (663) T protein:vir:34 79 KVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAILDEATG 158 (663) T ss_pred cceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccccCCCccc Confidence 773 1 23334444444 455556799999999999999998888887721 11 Q ss_pred h-hhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceee Q lcl|NC_019408. 126 R-KGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQ 204 (612) Q Consensus 126 ~-~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~ 204 (612) + .+.+.-| -.++-| +.-. +-+|.- + .|.......|-...+.-. +++-. T Consensus 159 ~~~a~~~~~-------~e~~a~-E~v~------id~v~~-~------------dfl~~pAr~W~ev~wva~----r~~mt 207 (663) T protein:vir:34 159 AELAAAVPP-------TQRKAY-ECVE------TDYLHW-Q------------DVLWSPARVWHEVRWLAF----RNLLD 207 (663) T ss_pred cchhccccc-------chhhcc-ccee------eeeech-h------------hcccchhhccccccceee----eccCC Confidence 1 1111111 122222 1000 011100 0 011111111110000000 00000 Q ss_pred cccccccccceeeeeee-----ecc-ccccccccc----cceeEEEEEeeCCCce-ec-------ceeeeccCC-ccccc Q lcl|NC_019408. 205 TARTLGGYSYITVYREL-----KLE-EIEWPSGEV----KLAYVQYLYEEDPESR-PI-------ARIVPTVRG-EPLDF 265 (612) Q Consensus 205 ~~~~~~g~~~~~~~R~~-----~~~-~~~~~~g~~----~~~~~~~~~~~~~~~~-~~-------~~~~p~~~g-~~l~~ 265 (612) .+. ..+.|....|+.. ... .....+|.. +..-+|+||+..+.-- ++ -++-|..-| .-+-- T Consensus 208 k~e-~~~rf~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~~~~p~lgl~~ffP 286 (663) T protein:vir:34 208 MRE-FNARFDADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLDTQPDPLGLESFFP 286 (663) T ss_pred HHH-HHHhhcCChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcceecccCCCCCCCCCCCC Confidence 000 0001111111100 000 011111111 2334566665543211 00 111122222 11234 Q ss_pred eeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCce-EE------EeccccccC- Q lcl|NC_019408. 266 IPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGE-YH------IGPNMVWEV- 337 (612) Q Consensus 266 IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~-l~------iG~~~~~~l- 337 (612) +||..++...++..+..|++.=--.+.-+| +..++--+.|.-+--|.-++++...++... |. +.|=..|.+ T Consensus 287 cPrpl~~~~~~ds~ipvpd~~~y~~~~~E~-n~~t~Rin~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~ 365 (663) T protein:vir:34 287 CPKPLLANWTTDKVVPRPDFVLAQDLYKEI-DLVSTRITLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTF 365 (663) T ss_pred CcccccceecCCCeecCCcHHHHHHHHHHH-HHHHHHHHHHHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhh Confidence 688888888888888888876222233333 334444445554445555553111111000 10 111122222 Q ss_pred -CCCCc---eeEEecCc--hhHHHHHHHHHHHHHHHHHH-HH-HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 338 -PQGSE---PGILEYTG--QGLKALETALNDKERQIAAI-GG-RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACES 409 (612) Q Consensus 338 -p~~~~---~~~lE~~g--~~l~~~~~~l~~~e~qm~~l-Ga-~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~ 409 (612) ..||- ..|+-+++ .+|..+...=..+..-.+++ |. ..+.+ .-..++|+++..+......--++-+...|++ T Consensus 366 ~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rg-a~~a~ETatAQ~IKsq~gS~RIqe~qdevqR 444 (663) T protein:vir:34 366 ADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRG-ASDPRETAMAQGVKAKFGSIRLQRLQDEVAR 444 (663) T ss_pred hhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhc-ccCcchhhHHHHHHHHHHhHHHHHHHHHHHH Confidence 22332 33444443 34444444444444444322 32 33433 3456899999998887766678888888888 Q ss_pred HHHHHHHHHHHHc-------------CCcCC---------------CCcceEEEeeccccccCCCH-H---H-------- Q lcl|NC_019408. 410 GMTDVVRWWLMWR-------------DVPLA---------------DTENLRYEVNTDFLSTPIGA-R---E-------- 449 (612) Q Consensus 410 a~~~~l~~~a~w~-------------g~~~~---------------~~~~~~v~ln~dF~~~~~d~-~---~-------- 449 (612) ...++.+++|..+ |..+. ....+.+.+-.|=-... |. . . T Consensus 445 ~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~-D~~~eK~~~~E~l~~i 523 (663) T protein:vir:34 445 FASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQ-DFAALRNEKMEVLSGI 523 (663) T ss_pred HHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcC-ChHHHHHHHHHHHHHH Confidence 8888888887763 22222 11223344433211111 11 1 1 Q ss_pred ---HHHHHHHHHcCCCCHHHHHHHHHh--cCccchhhhhHHHHHHhhccccccc-cchhHHhhhhhhHHHHhHHHHHHHH Q lcl|NC_019408. 450 ---MRAIQLMANDGLLPDPVFYEYMRK--AEVISSDMTFEEFQALRADENSFIN-NPDAQARQRGYTNRGQELEQSRMAR 523 (612) Q Consensus 450 ---~~al~~~~~~G~is~et~~~~lqr--~~vl~~~~~~eee~~ria~e~~~~~-~~~~~~~~~~e~~r~~~~e~~r~~~ 523 (612) ++++.-+.+.+-.....+.+.|+- +++ ....+.+..++++....+... ++++.....+..+-+.+.++... T Consensus 524 ~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f-~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~-- 600 (663) T protein:vir:34 524 ASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGL-RGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKG-- 600 (663) T ss_pred HHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcC-ChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHHH-- Confidence 122222334555555544443331 233 335566666666654322111 11111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCchhhcCC Q lcl|NC_019408. 524 EADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPPAPAAPG 603 (612) Q Consensus 524 e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~~~ 603 (612) +.+.+....+.|......+.+..+...++.++ +++.++++-+..-+...-|+- .+.-.-.| T Consensus 601 q~~~aeAq~e~q~~~~~~ql~~~~~~~k~~~~------------------a~~~~~~a~q~~~~~~~~r~~-~~~a~~~~ 661 (663) T protein:vir:34 601 QQEMAKVQAEVQGDLLRIQAETQANETKERQQ------------------AEWNVREAAQKNLISQAARAM-NPQARNGG 661 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHhhHHHHHHHhh-chhhhcCC Confidence 11111111111111111111111111111111 111111111111000000000 01111122 Q ss_pred CC Q lcl|NC_019408. 604 AP 605 (612) Q Consensus 604 ~~ 605 (612) -| T Consensus 662 ~~ 663 (663) T protein:vir:34 662 MP 663 (663) T ss_pred CC Confidence 22 No 99 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=98.55 E-value=2.9e-07 Score=56.39 Aligned_cols=566 Identities=11% Similarity=0.037 Sum_probs=181.8 Q ss_pred CCCcHHHHH----HHHHHHHHHHHhcChHHHHhc---ccccC--CCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhh Q lcl|NC_019408. 1 MVTHPEYQY----WRPEWTKLRDVMAGQREIKRK---AEAYL--PAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVF 71 (612) Q Consensus 1 ~~~hP~y~~----~~~~W~~i~d~~~G~~~vr~~---g~~YL--Pk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf 71 (612) .-.+|+=.. ....|..+...+.+...+|.. -..|. =+|+.+....-+.+-.-.+.+|.++++|+..+|.-= T Consensus 8 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:10 8 TAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHHHHH Confidence 111111100 011122222223344444321 11122 255555555556666677889999999999999998 Q ss_pred cCCceeecCCH-----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEE--EEecCcchhhhhccCceEEEe Q lcl|NC_019408. 72 RRDPIVKNLPP-----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGV--LVDVVDNPRKGAVATSFAVGY 138 (612) Q Consensus 72 ~k~p~~~~~p~-----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~v--lVD~p~a~~~~~~~rPy~~~~ 138 (612) +..+.+.-.|. .|..++. ...+-+..+.-+..+|..++.+|.+|+ .+||-. .+..+++..+ T Consensus 88 ~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~-~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~-----~~~~i~i~~v 161 (714) T protein:vir:10 88 KTRTDLIVMSDDPNDETEKLAEAINAEFA-DACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEP-----FGPEFKVSTV 161 (714) T ss_pred hCCcceEEecCCCChhhHHHHHHHHHHHH-HHHHhhchhHHHHHHHHHhhhcccceEEeeeccCC-----CCCCeEEEec Confidence 88887742331 1222222 222234566778899999999998887 566532 2456778888 Q ss_pred chhhhhcchhhhccCCccceeEEEEEEEeeccc----cccC----------CCcccc--------------------cce Q lcl|NC_019408. 139 SAENILDWDEVVDMGGFYVPSRVLLREFVRDLR----WKSD----------IEPLTT--------------------AQA 184 (612) Q Consensus 139 ~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~----~~~~----------~d~f~~--------------------~~~ 184 (612) +|.+|+ |+....-.......|+..+..+.... +.+. ..+|.. .+. T Consensus 162 ~p~~v~-~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:10 162 SRNEVF-WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred Chhhee-eccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhccccc Confidence 887765 43221111111122222221111000 0000 000000 000 Q ss_pred e--------eeeeEeeecccccccceeecccccccccceeeeeeeec--------cccccccccccceeEEEEEeeCCCc Q lcl|NC_019408. 185 R--------KARAAALASGSASSPMVRQTARTLGGYSYITVYRELKL--------EEIEWPSGEVKLAYVQYLYEEDPES 248 (612) Q Consensus 185 ~--------q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~--------~~~~~~~g~~~~~~~~~~~~~~~~~ 248 (612) . .-|++...-+ +...+.........|. +..|-.... ..+......+. ...+.++. T Consensus 241 ~~~~~~~~~~~rV~v~E~w-~k~~~~~~~~~~~~g~--~~~~d~~~~~~~~~~~~g~~~~~~~~~~-rv~~~~~~----- 311 (714) T protein:vir:10 241 QQNEWLQRERRRVLLQVVY-YRTFERLPVIELSNGR--VVAFDKNNLMQAVAVASGRVQVKVGRVS-RIREAWFV----- 311 (714) T ss_pred ccccccccCcceEEEEEEE-EeEEEEEEeecCCCCC--eeeeCccCHHHHHHHHhccceeccccee-eEEEEEEe----- Confidence 0 0000000000 0000000000000000 000000000 00000000000 00011111 Q ss_pred eecceeeeccCCcc--ccceeEEEeecC---CCCCCcC-cCchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCC Q lcl|NC_019408. 249 RPIARIVPTVRGEP--LDFIPFKFFGAS---GNTADVE-KPPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSE 322 (612) Q Consensus 249 ~~~~~~~p~~~g~~--l~~IP~v~~~~~---~~~~~~~-~pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~ 322 (612) +..+...+-.| -+++|||+|... ..+...| .-.+.|.-. .+.++++. +.++|.-.+ +.+..|.... T Consensus 312 ---g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr-~~N~~~s~--~~~~l~~~~--~~~~~gav~~ 383 (714) T protein:vir:10 312 ---GPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQD-EVNFRRIK--LTWLLQAKR--VIMDEDATQL 383 (714) T ss_pred ---cchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHH-HHHHHHHH--HHHHHhCCc--eeeccccccc Confidence 01111111122 234555544222 1111111 112333321 22234444 344553222 2333443322 Q ss_pred CCceEE--E-eccccccC-C--C-----CCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHH--hhhccccchhHHHHH Q lcl|NC_019408. 323 GTGEYH--I-GPNMVWEV-P--Q-----GSEPGILEYTGQGLKALETALNDKERQIAAI-GGR--MMPGASKSVSESNNQ 388 (612) Q Consensus 323 ~~~~l~--i-G~~~~~~l-p--~-----~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~--ll~~~~~~~~esa~~ 388 (612) ++..+. + =++..+.+ | . ++.+... +...-...+...|+...+.|..+ |.. ++-..+ .+.|+.+ T Consensus 384 ~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~--na~SGvA 460 (714) T protein:vir:10 384 SDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS--GATSGVA 460 (714) T ss_pred cHHHHHHhccCCCCeEEecccccccCCcccccccc-CCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCc--chhHHHH Confidence 221110 1 11223322 1 1 1223322 22222334556666666655544 322 221222 2345665 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCcC---------CCCcceEEEeecc--------------cc Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTD----VVRWWLMWRDVPL---------ADTENLRYEVNTD--------------FL 441 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~w~g~~~---------~~~~~~~v~ln~d--------------F~ 441 (612) ...+..+..-.|..+..++..+... +|.++..|++..- .....-.+.+|.+ |+ T Consensus 461 I~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~d 540 (714) T protein:vir:10 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEE Confidence 5555555555677777777777654 5666667654211 0000112333321 11 Q ss_pred cc--------CCCHHHHHHHHHHHHcCC--CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-hhhhhh Q lcl|NC_019408. 442 ST--------PIGAREMRAIQLMANDGL--LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-RQRGYT 510 (612) Q Consensus 442 ~~--------~~d~~~~~al~~~~~~G~--is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-~~~~e~ 510 (612) .. ....+.+.+|++++.+.. +...++--.+.-..+. ..++..+++.+..+....++... .+.... T Consensus 541 v~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p----~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q 616 (714) T protein:vir:10 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP----QKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred EEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCc----CHHHHHHHHHHHcCCCCCccccCcchhHHH Confidence 10 111223556666665421 1111111112222221 12334444433222111111110 000000 Q ss_pred HHHHhHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_019408. 511 NRGQELEQSR-----MAREADFTQQKIDIQERSVAVQEGHAEVAH----AAGSTSISGSRK--LGDPEQAKPAVADQATI 579 (612) Q Consensus 511 ~r~~~~e~~r-----~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~----~~~~~~~~~~r~--~~~e~q~k~~~~eq~~~ 579 (612) +.++.++++. .+.+++.++.+++.++-+++......++.. ++.+....+... .++..++... .+|.-. T Consensus 617 ~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~-~~q~~~ 695 (714) T protein:vir:10 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN-MEQEQD 695 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhhHH Confidence 1111111111 111111111111111111111111000000 000000000000 0000000000 000000 Q ss_pred HHHHHHHHhhccccCCCch Q lcl|NC_019408. 580 DNAKKQTANAAKVAAQPPA 598 (612) Q Consensus 580 ~~~~k~~~~~a~~~~~~~~ 598 (612) ..+++..+..++...+=+- T Consensus 696 ~~~q~~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 696 VLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHhcCC Confidence 0000000000111111111 No 100 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=98.55 E-value=9.1e-08 Score=59.17 Aligned_cols=503 Identities=12% Similarity=0.097 Sum_probs=203.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchh----hcCCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMV----FRRDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~v----f~k~p 75 (612) |.+|-.=...+..|+...+...= ...|++.. .|+-.+.-.....-+...+..+|.|.+..+++.++..+ |-.+- T Consensus 12 ~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~-~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~l~~~~Fp~~~ 90 (584) T protein:vir:95 12 LVRDSSAQWVAYLWDRFNNQRRQKIEEWKELR-NYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSNYFSSLFPNDD 90 (584) T ss_pred ccccchHHHHHHHHHHHHhhhchhhccCHHHH-HHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHHHHHhhcCccc Confidence 66777766777777776665432 22233221 12212222222222233355678888887776655544 43332 Q ss_pred eee---cCC--------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcc----hh---hhhccCceEEE Q lcl|NC_019408. 76 IVK---NLP--------PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDN----PR---KGAVATSFAVG 137 (612) Q Consensus 76 ~~~---~~p--------~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a----~~---~~~~~rPy~~~ 137 (612) -++ .+| ..++.+..|==-+ .++..-++.++..++.+|-|.+=|.+-.. .+ .-.-.+|.+.. T Consensus 91 w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e-~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~prier 169 (584) T protein:vir:95 91 WLRWVGYGKGDSTKTKAKAIQAYMSNKCRE-SHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDYIGPRLVR 169 (584) T ss_pred eeeeecCCCchhhHHHHHHHHHHHhhhhhh-ccHHHHHHHHHHhhccCCceEEEEeEeecceeeeccccccccccceEEe Confidence 111 112 1233333221112 27888899999999999988877765321 11 01123799999 Q ss_pred echhhhhcchhhhccCCccceeEEEEEEEe--ecccccc---CCCcccccceeeeeeE--eeecccccccceeec-cccc Q lcl|NC_019408. 138 YSAENILDWDEVVDMGGFYVPSRVLLREFV--RDLRWKS---DIEPLTTAQARKARAA--ALASGSASSPMVRQT-ARTL 209 (612) Q Consensus 138 ~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v--~~~~~~~---~~d~f~~~~~~q~r~l--~l~~g~~~~~~~~~~-~~~~ 209 (612) ++|.+|+ |+... ++...-..|+ |..+ .+..-.. ....+....+.....- .+.+.+.. -+... ...+ T Consensus 170 iSP~d~~-~Dpsa--~~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~--~~~~~~~~~~ 243 (584) T protein:vir:95 170 ISPLDIV-FNPLA--TSISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVE--DFDKAAGFDV 243 (584) T ss_pred eChhhee-ecCCC--CCccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccc--cccccccccc Confidence 9999998 76433 2222222222 2111 0000000 0000111000000000 00000000 00000 0011 Q ss_pred ccccceeeee----ee-ecccccc---ccccccceeEEEEEeeCCCceecceeee-ccCCccccceeEEEeec-CCCCCC Q lcl|NC_019408. 210 GGYSYITVYR----EL-KLEEIEW---PSGEVKLAYVQYLYEEDPESRPIARIVP-TVRGEPLDFIPFKFFGA-SGNTAD 279 (612) Q Consensus 210 ~g~~~~~~~R----~~-~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~~~~~~p-~~~g~~l~~IP~v~~~~-~~~~~~ 279 (612) .|++...-|. |. ..-|+.+ .+++.+..+...++. + ...+- ..+-.|.+.+||+.++- ...+.. T Consensus 244 d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~--g-----~~iIR~~~np~~~~~~PF~~~~~~p~~~s~ 316 (584) T protein:vir:95 244 DGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVD--R-----STEVRNESIPTWFGSAPIYHVGWRFRPDNL 316 (584) T ss_pred ccccccccccCCceeEEEeecccccccccCCCcccceEEEEe--c-----cEEEEeeecCCCCCCCCEEEEcceeeeccc Confidence 1111111000 00 0011211 112222222222221 0 11110 12234568899997643 344455 Q ss_pred cCcCchHHHHHHHH---HHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEEecCchhHHHH Q lcl|NC_019408. 280 VEKPPLLDICDLNL---SHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYTGQGLKAL 356 (612) Q Consensus 280 ~~~pPLldLA~lnl---~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~ 356 (612) .|.+++.-|.++.- ...|.-- ++ +.....|++...+- ...+.-||+..|.....+...|++|+.+.+... T Consensus 317 yG~gi~~ll~d~Q~~lna~~r~~i--Dn-l~l~~~pv~k~~~~----~~~~~~~pg~~~~~~~~~~~q~~~p~a~~~~s~ 389 (584) T protein:vir:95 317 WAMGPLDNLVGMQYRIDHLENAKA--DA-VDLIIQPPLKIIGE----VEEFVWGPGAEIHLDQGGDVQEIAKNVNYIINA 389 (584) T ss_pred cCCCchhhhhhHHHHHhHHHHHHH--HH-HHHhcCcceeeccc----cchhcccCCceeecCCCCCcceecCchhhhhHH Confidence 67777665555432 1233333 33 45556676665532 223456888888887777889999987666666 Q ss_pred HHHHHHHHHHHHH-HHHHhhh-ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHcCCcCCCCcceE Q lcl|NC_019408. 357 ETALNDKERQIAA-IGGRMMP-GASKSVSESNNQTVLREANEQSLLLNIIQACESGM-TDVVRWWLMWRDVPLADTENLR 433 (612) Q Consensus 357 ~~~l~~~e~qm~~-lGa~ll~-~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-~~~l~~~a~w~g~~~~~~~~~~ 433 (612) ...|.-++..|-. .|+.... .....+++||+..+.=-...+..+..++...++.+ ++.+.++-+|-.......+.++ T Consensus 390 ~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr 469 (584) T protein:vir:95 390 DNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIR 469 (584) T ss_pred HHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcee Confidence 6677777788754 3543321 11234466666665555556667777777777766 6655555444211111122333 Q ss_pred EEe----------------eccccccCCCH-------HHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHh Q lcl|NC_019408. 434 YEV----------------NTDFLSTPIGA-------REMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALR 490 (612) Q Consensus 434 v~l----------------n~dF~~~~~d~-------~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ri 490 (612) +.. ..||......+ +..+.++...++ .+ +-.+.+.....+...-+ T Consensus 470 ~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~-~~-----------~~~i~p~~~~~~l~~~l 537 (584) T protein:vir:95 470 VMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNS-QI-----------GQMILPHTSGKALATFV 537 (584) T ss_pred eeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHh-hh-----------hhhccccchHHHHHHHH Confidence 321 11222222111 111112222111 11 11122222212211212 Q ss_pred hccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 491 ADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGD 565 (612) Q Consensus 491 a~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~ 565 (612) ++......- .........++|. +.|++.+ ++| +.... +.+..+++- . T Consensus 538 adl~~~p~~--~~~~~~~~~~~Q~-~~q~~~~-------------------~~q--~~~~~--~~~~~~~~~--~ 584 (584) T protein:vir:95 538 DDVTGLQGY--EIFRPNVAVAEQA-ETQSLVA-------------------QAQ--EDLQL--QAQMPAEGA--I 584 (584) T ss_pred HHHhCCCcc--cccCCCcccchhH-HHHhhhH-------------------HHH--HHHHH--HHhhhhccC--C Confidence 211100000 0011111111111 1111110 001 00000 000011110 0 No 101 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.38 E-value=9.5e-07 Score=53.60 Aligned_cols=526 Identities=11% Similarity=-0.004 Sum_probs=182.0 Q ss_pred CCCcHHHHHHHH--------------HHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHh Q lcl|NC_019408. 1 MVTHPEYQYWRP--------------EWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGM 66 (612) Q Consensus 1 ~~~hP~y~~~~~--------------~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~ 66 (612) +.+|---...+. +|+-+.+.|.+....+ .-++|+...-....-..+ ...+|.+-...+++.| T Consensus 20 ~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~-r~ki~~~~~~~~~~~l 95 (641) T protein:vir:94 20 LSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDR---QNTRARNFQTTGADDADW-RHRINTGHTFEVVETL 95 (641) T ss_pred CCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhh---hhcccccccccccchhcc-cccccchhHHHHHHHH Confidence 444443333333 4443333222221111 112343322111111111 2246666666666655 Q ss_pred hchhhc----CCceeecCC---H------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcc-----hhh- Q lcl|NC_019408. 67 TGMVFR----RDPIVKNLP---P------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDN-----PRK- 127 (612) Q Consensus 67 ~G~vf~----k~p~~~~~p---~------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a-----~~~- 127 (612) +..+++ .++-++-.| + .+..++.+.-.++ ++...+..++++++.+|.+.+-|++-.. .+. T Consensus 96 ~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~-~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~ 174 (641) T protein:vir:94 96 VAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAA-SIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTF 174 (641) T ss_pred hhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhc-chHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhc Confidence 544433 222222112 1 1223333332233 3344446999999999988887775321 000 Q ss_pred -----hhc-----------cCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEe Q lcl|NC_019408. 128 -----GAV-----------ATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAA 191 (612) Q Consensus 128 -----~~~-----------~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~ 191 (612) -.+ ..+.+..+++.+|. |+-..+ ...-+.+++|++..+.. .| T Consensus 175 ~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~-~dps~~---~~~~~f~~~r~t~~t~~-----------------~l- 232 (641) T protein:vir:94 175 VETGDIFGGWEDVAVNRQRSELRIEPLSPYDVW-LDTSGG---KNTGTFVRLRHTREELH-----------------EL- 232 (641) T ss_pred ccchhhcccccccceecccceeeEEecchhhee-ecCCCC---cccccceehhhhHHHHH-----------------HH- Confidence 000 11222233333332 111111 11111222332221100 00 Q ss_pred eecccccccceeeccc------------ccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccC Q lcl|NC_019408. 192 LASGSASSPMVRQTAR------------TLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVR 259 (612) Q Consensus 192 l~~g~~~~~~~~~~~~------------~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 259 (612) ...|.++...+..... ...+. ....|++...-...+.+| .-.+.++...+ +..++... T Consensus 233 ~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~-~~~~~~~~e~~gd~~~d~----~~~~~~~~~~~-----g~~il~~~ 302 (641) T protein:vir:94 233 VTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGT-DTSGWDIIEYYGPLLVEG----VQFWCVHAVFY-----GKQLIRLS 302 (641) T ss_pred HhcCCCChhhcchhhcccccccccccccccccc-cccccceeeeeeeeccCC----CceeeEEEEEe-----CCEEeecc Confidence 0001111100000000 00000 011122110000000111 11112222111 12334444 Q ss_pred Cc-cccceeEEEeecC-CCCCCcCcCchHHHHHHHHHH-HhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccc Q lcl|NC_019408. 260 GE-PLDFIPFKFFGAS-GNTADVEKPPLLDICDLNLSH-YRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVW 335 (612) Q Consensus 260 g~-~l~~IP~v~~~~~-~~~~~~~~pPLldLA~lnl~H-Y~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~ 335 (612) |+ +++..||+++... ..+...|.+|..++...-... =-..+-++ .++.+..|.+.+ .+... ....+..+|+..+ T Consensus 303 ~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld-~~~~~~~p~~~~~~~~~~-~~~~l~~~PG~ii 380 (641) T protein:vir:94 303 DSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLD-NLVLHINKMWTLVEDGIL-KREDVKAKPGAVF 380 (641) T ss_pred cccccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHH-HHHHHhCCeeeecccccc-ccceeeccCCcce Confidence 44 4667788866544 334455677754332221111 12222233 345555666544 22211 2345888999988 Q ss_pred cCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHHhh-hcccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_019408. 336 EVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GGRMM-PGASK--SVSESNNQTVLREANEQSLLLNIIQACES-- 409 (612) Q Consensus 336 ~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~ll-~~~~~--~~~esa~~~~~~~~~~~s~L~~~a~~~~~-- 409 (612) .....+..+++.+....+......++.++..++.. +...+ ..... ....||+..+.........|..+++++++ T Consensus 381 ~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~ 460 (641) T protein:vir:94 381 KVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSS 460 (641) T ss_pred eeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77766778888665444555566666666666533 22222 11111 11247888877777777778888888875 Q ss_pred ---HHHHHHHHHHHHc----------------CCcCCCCcceEEEeeccccccCC-----CHHHHHHHHHHHHcCCCCHH Q lcl|NC_019408. 410 ---GMTDVVRWWLMWR----------------DVPLADTENLRYEVNTDFLSTPI-----GAREMRAIQLMANDGLLPDP 465 (612) Q Consensus 410 ---a~~~~l~~~a~w~----------------g~~~~~~~~~~v~ln~dF~~~~~-----d~~~~~al~~~~~~G~is~e 465 (612) .++.++.++.+.. |......+++++ +-++.+... .++.++.|+.+++....- . T Consensus 461 l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~--~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~-P 537 (641) T protein:vir:94 461 TLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHY--PYKFLALGANYVVERERMVTDLLQLLDISGRV-P 537 (641) T ss_pred HHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceee--eeeEeecchhHHHHHHHHHHHHHHHHHHhhcC-h Confidence 3344444443321 100011222222 212221111 111233333333321110 0 Q ss_pred HHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 466 VFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGH 545 (612) Q Consensus 466 t~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r 545 (612) . +.+..++......+.+. -..+.+...-+.+.. +. +.+..++++++++-.+ +++.-.+ T Consensus 538 ~----------v~d~~d~~~~~~~~~~~-~g~~~p~~~ir~~~~---~~---~~~~~~~~~~q~~~~~----~a~~~~~- 595 (641) T protein:vir:94 538 Q----------IGQSLDYALILEDLLRQ-MRFTDPMRYIKKAEA---PP---AAPPIAPAEPGALPPE----MMNSVGG- 595 (641) T ss_pred h----------hhhcCCHHHHHHHHHHH-hCCCCchhhccCccC---ch---hHHHHHHHHHHHHHHH----HHHHHHh- Confidence 0 01111222212222211 112222221111100 00 0001011111100000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhccccCCCchhhcC Q lcl|NC_019408. 546 AEVAHAAGSTSISGSRKLGDPEQAKPAVADQATI-DNAKKQTANAAKVAAQPPAPAAP 602 (612) Q Consensus 546 ~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~-~~~~k~~~~~a~~~~~~~~~~~~ 602 (612) ....+..+.+. .++.+ +..++... .....+++.+ .+.|...+.+. T Consensus 596 -------~~~~~a~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 641 (641) T protein:vir:94 596 -------GLNDQAIAGMT--PEDVS-DLASRIGIDTSDVAPEAMA--AATQQITSGAL 641 (641) T ss_pred -------hhHHHHHHHhh--HHHHH-HHHHhhcCCchhhhHHHHh--cccccccccCC Confidence 01111111111 11111 11111111 1122233322 34444444444 No 102 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=97.83 E-value=1.6e-05 Score=46.83 Aligned_cols=576 Identities=11% Similarity=0.021 Sum_probs=169.4 Q ss_pred CCCcHHHHHHH-HHHHHHHHHhcChHHHHhcc---cccC----CCCCCCCHHHHHHHH---hh-ccCCchHHHHHHHhhc Q lcl|NC_019408. 1 MVTHPEYQYWR-PEWTKLRDVMAGQREIKRKA---EAYL----PAMKGADGDDYAIYL---QR-ATFFNMLAQTRDGMTG 68 (612) Q Consensus 1 ~~~hP~y~~~~-~~W~~i~d~~~G~~~vr~~g---~~YL----Pk~~~e~~~~Y~~rl---~r-A~~~n~~~~tv~~~~G 68 (612) |=- .....+ ..+..++.+......+|... ..|- -+|+.+.....+.-| .+ .+-+|.++.+|+..+| T Consensus 1 ma~--~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g 78 (720) T protein:vir:35 1 MAE--TLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIIS 78 (720) T ss_pred Cch--HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHh Confidence 000 000000 11122333333333443211 1121 255555544222112 22 2557999999999999 Q ss_pred hhhcCCceeecCCH----------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEE--ecCcchhhhhcc-CceE Q lcl|NC_019408. 69 MVFRRDPIVKNLPP----------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLV--DVVDNPRKGAVA-TSFA 135 (612) Q Consensus 69 ~vf~k~p~~~~~p~----------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlV--D~p~a~~~~~~~-rPy~ 135 (612) .-=+..|.+.-.|. .|..++..+-- -++.+.-+..+|..++.+|.+|+=| ||...++..... +..+ T Consensus 79 ~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i 157 (720) T protein:vir:35 79 EYRHNRITVKFRPGDKTASEALANKLNGLFRADYE-ETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICL 157 (720) T ss_pred HHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHH-hcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeE Confidence 98776666632231 23333322222 2345566899999999999888755 553322111110 0000 Q ss_pred E-Eec-hhhh-hcchhhhccCCccceeEEEEEEEeecc----ccc-------------cCCCcccccceeeeeeEeeecc Q lcl|NC_019408. 136 V-GYS-AENI-LDWDEVVDMGGFYVPSRVLLREFVRDL----RWK-------------SDIEPLTTAQARKARAAALASG 195 (612) Q Consensus 136 ~-~~~-ae~I-inW~~~~~v~g~~~Lt~v~l~E~v~~~----~~~-------------~~~d~f~~~~~~q~r~l~l~~g 195 (612) - +|. +.+| +||.... .+.. .--|+..+..+... .|. ...|.|....+. +... T Consensus 158 ~~v~~~~~~v~~Dp~a~~-~D~s-Dar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~------i~E~ 229 (720) T protein:vir:35 158 EPIYDPARSVWFDPDAKK-YDKS-DAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVY------IAKY 229 (720) T ss_pred ecccCchhheeecccccc-cChh-hhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceE------EEEe Confidence 0 011 1122 3332211 1110 01111111110000 000 001111111111 0000 Q ss_pred cccccceeeccccccccc-ceeeeeeeeccccc----------cccccccceeEEEEEeeCCCceecceeeeccCCcccc Q lcl|NC_019408. 196 SASSPMVRQTARTLGGYS-YITVYRELKLEEIE----------WPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLD 264 (612) Q Consensus 196 ~~~~~~~~~~~~~~~g~~-~~~~~R~~~~~~~~----------~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~ 264 (612) -+..++.........+.. .+..|.....+... .....+.... +..+..+|.... ... .--|.+ T Consensus 230 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~-v~~~~~~g~~~l-~~~----~~~p~~ 303 (720) T protein:vir:35 230 YEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRR-VYVSVVDGEGFL-EKA----QRIPGE 303 (720) T ss_pred eEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEE-EEEEeeccchhc-ccC----CCCCCC Confidence 000000000000000000 01111111110000 0000011111 111111111100 000 011345 Q ss_pred ceeEEEeecCC----CC---CCcCcCchHHHH-HHHHHHHhhhHHHHHHHHHhccceeeeecCC-------CCCCce--- Q lcl|NC_019408. 265 FIPFKFFGASG----NT---ADVEKPPLLDIC-DLNLSHYRTYAELEYGRLFTALPVYYAPGTD-------SEGTGE--- 326 (612) Q Consensus 265 ~IP~v~~~~~~----~~---~~~~~pPLldLA-~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~-------~~~~~~--- 326 (612) .||||+|.... +. ++.-. ++.|.= .+| |+.|. +-+++ +..|+..-.|.. .+|... T Consensus 304 ~fP~vP~~g~r~~~d~~~~~~G~vr-~~kd~Q~~~N---~~~s~-~~~~~--~~~~~~~~~~a~~~~~~~~~~~a~~~~~ 376 (720) T protein:vir:35 304 HIPLIPVYGKRWFIDDIERVEGHIA-KAMDAQRLYN---LQVSM-LADSA--TQDTGSIPIVGKSQIKTLEKYWANRNKN 376 (720) T ss_pred ccceEEEEeeeeccCCCcccceeee-cchhHHHHHH---HHHHH-HHHHH--HcCCccccccCcchHHHHHHHhhccccc Confidence 56666543211 11 11111 122211 112 12221 22232 233332222211 111111 Q ss_pred ----E---EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HHH--hhhccccchhHHHHHHHHHHHHH Q lcl|NC_019408. 327 ----Y---HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GGR--MMPGASKSVSESNNQTVLREANE 396 (612) Q Consensus 327 ----l---~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~--ll~~~~~~~~esa~~~~~~~~~~ 396 (612) + .++..+....+..+.+++.++..-+ ......|+.-...|... |.. ++-.. ++.||.+...+..+. T Consensus 377 ~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~-~~~~~llq~~~~~i~~vsGi~~~~lG~~---sn~SG~Ai~~rq~qg 452 (720) T protein:vir:35 377 RPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLN-QAMAALLQQTGADIQEVTGSSQAMQPMP---SNIAKETVNHLMHRS 452 (720) T ss_pred cccccccccccccCcccccCCCcccccCCCCCc-hHHHHHHHHHHHHHHHHhCCChHHcCcc---cchHHHHHHHHHHHH Confidence 0 1222222222223456666654322 23344444444444333 332 23222 234676666666666 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHcCCc------CCCCcceEEEeec--------------c-------ccc--- Q lcl|NC_019408. 397 QSLLLNIIQACESGMT----DVVRWWLMWRDVP------LADTENLRYEVNT--------------D-------FLS--- 442 (612) Q Consensus 397 ~s~L~~~a~~~~~a~~----~~l~~~a~w~g~~------~~~~~~~~v~ln~--------------d-------F~~--- 442 (612) .-.|..+-.|+..+.. .+|.++..|++.. ..++..-.+.+|. | +.. T Consensus 453 ~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~ 532 (720) T protein:vir:35 453 DMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVG 532 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecc Confidence 6667777777776665 4667777776411 0011111111110 1 111 Q ss_pred ---cCCCHHHHHHHHHHHHcCCCCHHH----HHHH-HHhcCccchhhhhHHHHHHhhccccccccchh-----HHhhhhh Q lcl|NC_019408. 443 ---TPIGAREMRAIQLMANDGLLPDPV----FYEY-MRKAEVISSDMTFEEFQALRADENSFINNPDA-----QARQRGY 509 (612) Q Consensus 443 ---~~~d~~~~~al~~~~~~G~is~et----~~~~-lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~-----~~~~~~e 509 (612) .....+.+.+++++..+ ..+... +... +.-..+. ..++..+++....+.....++ +...... T Consensus 533 p~~~s~req~~~~m~qll~~-~~p~~~~~~~~~~~ile~~d~p----~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~ 607 (720) T protein:vir:35 533 PSYTARRDATVSVLTNLLAG-MLPQDPMRQVLQGIILDNMEGE----GLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQM 607 (720) T ss_pred cCcccHHHHHHHHHHHHHHh-cCCCchhHHHHHHHHHHhcCch----hHHHHHHHHHhhcchhcccCccChhHHHHHHHH Confidence 11123345555555542 111111 1111 1111111 112333444333221111111 1100000 Q ss_pred hHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHH-------H Q lcl|NC_019408. 510 TNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPA------VAD-------Q 576 (612) Q Consensus 510 ~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~------~~e-------q 576 (612) .+..+....+..+..+++++..++.++-+++....+.++.+++.+++.+..+...+-.|...+ ++. + T Consensus 608 qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~ 687 (720) T protein:vir:35 608 IQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQK 687 (720) T ss_pred HHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000001111111222222222221111111111111121111111111111000000000 000 0 Q ss_pred HHHHHHHHHHHhhccccCCCchhhcCCCCCccc Q lcl|NC_019408. 577 ATIDNAKKQTANAAKVAAQPPAPAAPGAPPTNR 609 (612) Q Consensus 577 ~~~~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~ 609 (612) ++.+.++-.+++..+.....+....-...-.+= T Consensus 688 ~q~~~eqa~~el~~~~~~~~~~~~~~~~~~~~~ 720 (720) T protein:vir:35 688 EQGDASRADAELILKATDTQHKQNRDAAKNHSI 720 (720) T ss_pred hcchHHHHHHHHhhcccchhhhhhHHHhhccCC Confidence 111111111111111111111111100000000 No 103 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=97.50 E-value=5.5e-05 Score=43.92 Aligned_cols=422 Identities=15% Similarity=0.116 Sum_probs=179.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCH--HHHHHHH-hhc--c--CCchHHH----HHHHhhch Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADG--DDYAIYL-QRA--T--FFNMLAQ----TRDGMTGM 69 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~--~~Y~~rl-~rA--~--~~n~~~~----tv~~~~G~ 69 (612) =..-|...+...........|.|...-+.. .+.|..-.-+. ..+...| .|| . =.++... .++..+|- T Consensus 9 ~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~--~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~ 86 (502) T protein:vir:79 9 GVFSPGWKAARLRSRAVIQAYEAVKTTRTH--KARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGK 86 (502) T ss_pred hhcChHHHHHHHhhHHHHhhccccCccccc--CCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccC Confidence 224577777777777777777776543322 23332211111 1121111 111 1 2234444 44455553 Q ss_pred -hhcCCceee--c------CCH----HHHHHHhccCCCCC-CHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccC--c Q lcl|NC_019408. 70 -VFRRDPIVK--N------LPP----KFKDAVRRFAKDGS-SHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVAT--S 133 (612) Q Consensus 70 -vf~k~p~~~--~------~p~----~l~~~~~d~D~~G~-~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~r--P 133 (612) -+.-.|... + +-. ....|.++||-.|. +++.+.+.+++..+..|=|++..-+...... .... | T Consensus 87 ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~-~~g~~~~ 165 (502) T protein:vir:79 87 NGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSL-TPSAGVH 165 (502) T ss_pred CceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCcc-CCCcccc Confidence 222222221 1 111 34556789999986 8999999999999999999988755322100 0111 1 Q ss_pred -eEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccc Q lcl|NC_019408. 134 -FAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGY 212 (612) Q Consensus 134 -y~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~ 212 (612) -|-+|.|+.|=+-. -+|.. +.+-. .+ ...|.++ T Consensus 166 l~lq~iepd~l~~~~----~~~~~---------------------------i~~GV--e~--d~~Gr~~----------- 199 (502) T protein:vir:79 166 FWLEALEPDFIPMTS----DESNR---------------------------LNQGV--FV--DDWGRPE----------- 199 (502) T ss_pred eEEEEecchhcCCCC----CCCCe---------------------------eEeee--EE--CCCCceE----------- Confidence 13344444432210 01100 00000 00 0001111 Q ss_pred cceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCcccccee---EEEe-ecCCCCCCcCcCchHHH Q lcl|NC_019408. 213 SYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIP---FKFF-GASGNTADVEKPPLLDI 288 (612) Q Consensus 213 ~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP---~v~~-~~~~~~~~~~~pPLldL 288 (612) -|+ ++...++.. .+..+..|| ++.+ ...+.+-.-|.|.|..+ T Consensus 200 ----aY~---------------------i~~~hPgd~---------~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapv 245 (502) T protein:vir:79 200 ----KYL---------------------VYKSRPVSG---------RQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGV 245 (502) T ss_pred ----EEE---------------------EeecCCCCC---------cccceeEechhheEEeecccCCccccCCchHHHH Confidence 111 111111110 011233455 4433 23344445577777654 Q ss_pred HHH--HHHHHhhhHHHHHHHHHhccceeeee-cCCC----------CCCceEEEeccccc-cCCCCCceeEEecCchhHH Q lcl|NC_019408. 289 CDL--NLSHYRTYAELEYGRLFTALPVYYAP-GTDS----------EGTGEYHIGPNMVW-EVPQGSEPGILEYTGQGLK 354 (612) Q Consensus 289 A~l--nl~HY~~~sD~~~~l~~~~~P~l~i~-G~~~----------~~~~~l~iG~~~~~-~lp~~~~~~~lE~~g~~l~ 354 (612) ... ++..|.. +.+....-.+.+ +.+|+ +... .....+.+++++.+ .|++|-+++|+.++..+- T Consensus 246 l~~l~~l~~~~d-ael~~a~i~A~~-~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~- 322 (502) T protein:vir:79 246 LIRLSALKEYED-SELTAARIAAAL-GMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNP- 322 (502) T ss_pred HHHHHHHhHHHH-HHHHHHHHhhhh-eeeeecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCC- Confidence 322 3344553 334444444444 44444 2211 11123678898877 589999999999764321 Q ss_pred HHHHHHHHHHHHHHH-HHH--HhhhccccchhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHcCCc-CCCC Q lcl|NC_019408. 355 ALETALNDKERQIAA-IGG--RMMPGASKSVSESNNQTVLREANEQSLLLN-IIQACESGMTDVVRWWLMWRDVP-LADT 429 (612) Q Consensus 355 ~~~~~l~~~e~qm~~-lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~-~a~~~~~a~~~~l~~~a~w~g~~-~~~~ 429 (612) .....++.+...|.+ +|. .+|...-...=.|+.+..+++-......+. ++..++.-+-..+--++...|.- +++. T Consensus 323 ~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~ 402 (502) T protein:vir:79 323 NLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRD 402 (502) T ss_pred CHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCC Confidence 123444444444432 222 122222111123444555555444443333 33333333222222222233421 1110 Q ss_pred cceEEEeeccccccC---CCHH-HHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccc-------ccc Q lcl|NC_019408. 430 ENLRYEVNTDFLSTP---IGAR-EMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENS-------FIN 498 (612) Q Consensus 430 ~~~~v~ln~dF~~~~---~d~~-~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~-------~~~ 498 (612) ..-.-.++-+|.... +|+. ++++.+.++.+|..|++....+ +|. |++++.+.++.+.. ... T Consensus 403 ~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~---~G~-----D~~~v~~q~a~e~~~~~~~Gl~~~ 474 (502) T protein:vir:79 403 LDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRA---GGR-----NPDDVKRRRKAEIDENRKLDLVFD 474 (502) T ss_pred CCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHH---cCC-----CHHHHHHHHHHHHHHHHHcCCCCC Confidence 011111122233332 3454 8999999999999999876654 343 44444444443321 000 Q ss_pred c---------chhHHh-hhhhhHHHHhH Q lcl|NC_019408. 499 N---------PDAQAR-QRGYTNRGQEL 516 (612) Q Consensus 499 ~---------~~~~~~-~~~e~~r~~~~ 516 (612) . ...... ...+...+.|+ T Consensus 475 ~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 475 TDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 0 000000 00010000000 No 104 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=97.13 E-value=0.00016 Score=41.44 Aligned_cols=410 Identities=10% Similarity=-0.010 Sum_probs=177.1 Q ss_pred CCCc-----HHHHHHHHHHHHHHHHhcChHHHHh-cccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCC Q lcl|NC_019408. 1 MVTH-----PEYQYWRPEWTKLRDVMAGQREIKR-KAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRD 74 (612) Q Consensus 1 ~~~h-----P~y~~~~~~W~~i~d~~~G~~~vr~-~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~ 74 (612) |-.. +.....+....-+-.-+|....... ....|....----...+..|. ...+.+..|+..++..+++. T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~----~~~l~r~iVd~~a~d~~r~g 76 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYA----SNSIAMNIVDIISEDMVRAG 76 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHH----hCCccchhhccchHHhhcCC Confidence 1000 0000000001101111111111111 112233211111222334443 33566788999999999999 Q ss_pred ceeecCCHH----HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh-hhh----------ccCceEEEec Q lcl|NC_019408. 75 PIVKNLPPK----FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR-KGA----------VATSFAVGYS 139 (612) Q Consensus 75 p~~~~~p~~----l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~-~~~----------~~rPy~~~~~ 139 (612) +.|+...+. +..++++ ..+..-++.+++.+..+|.++|+|..-.... ... +.-.||..|. T Consensus 77 ~~i~~~~~~~~~~~~~~~~~-----l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~ 151 (461) T protein:vir:80 77 WSLKTDNKEMKKNIESKWRK-----LKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTFN 151 (461) T ss_pred eeeecCCHHHHHHHHHHHHH-----hhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEecc Confidence 988543333 4444544 3578889999999999999999986421100 000 0011222221 Q ss_pred hhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeee Q lcl|NC_019408. 140 AENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYR 219 (612) Q Consensus 140 ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R 219 (612) +.+|..= + ...|+ ..+.|+....|+ T Consensus 152 ~~~i~~~------------------~--------~~~dp-----------------------------~sp~fg~P~~y~ 176 (461) T protein:vir:80 152 TQKVTQL------------------Y--------LNQDM-----------------------------FSEHFGEVEFFE 176 (461) T ss_pred ccccchh------------------h--------hcccC-----------------------------cCcccccceEEE Confidence 1111100 0 00011 112222233333 Q ss_pred eeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhh Q lcl|NC_019408. 220 ELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRT 298 (612) Q Consensus 220 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~ 298 (612) +.......... ..+.. ...+..++.-+++.+ +..-.+...|.| ++..+.=-|..|.. T Consensus 177 i~~~~~~~~~~------------~~~~~---------~~~~~~iH~SRii~~~~~~~~~~~~G~S-~le~~~~~l~~~~~ 234 (461) T protein:vir:80 177 VNRVSQLGEEI------------LSGTT---------ASTSEQIHRSRIIHEQGLRFEGETKGRS-IFESLYDIITVMDT 234 (461) T ss_pred Eeccccccccc------------ccccc---------CccceEEccccEEEecCCCCCccccCcc-hHHHHHHHHHHHHH Confidence 32111000000 00000 000112344454443 222122222444 55555556666666 Q ss_pred hHH-HHHHHHHhccceeeeecCCCCCCce---------EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 299 YAE-LEYGRLFTALPVYYAPGTDSEGTGE---------YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA 368 (612) Q Consensus 299 ~sD-~~~~l~~~~~P~l~i~G~~~~~~~~---------l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~ 368 (612) ... -.++++...++++.+.|+..-..+. ...+..+.+.+.++.++..+..+-+++. +.++-..++|. T Consensus 235 ~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~e~~e~~~~~lsgl~---~~l~~~~~~ia 311 (461) T protein:vir:80 235 SLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIKGDEQLTKESTNVSGMK---DLLDYGWDYLA 311 (461) T ss_pred HHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEcCCcceEEEecCcCCHH---HHHHHHHHHHh Confidence 554 3568888899998888764322211 2234444555677778888887767664 45555555555 Q ss_pred HHHHHh---hhccccchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH--H-cCCcC-CCCcceEEEeeccc Q lcl|NC_019408. 369 AIGGRM---MPGASKSVSESNNQTVLREANEQSLLLNII-QACESGMTDVVRWWLM--W-RDVPL-ADTENLRYEVNTDF 440 (612) Q Consensus 369 ~lGa~l---l~~~~~~~~esa~~~~~~~~~~~s~L~~~a-~~~~~a~~~~l~~~a~--w-~g~~~-~~~~~~~v~ln~dF 440 (612) ..-.-. |..++....-|+ ..+...=...+.++- ..+...++.+++++.+ | ++..+ ++..+++|..|.=. T Consensus 312 a~s~iP~t~L~G~s~g~~asg---e~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~ 388 (461) T protein:vir:80 312 GAVRMPKTVLKGQEAGTLTGA---QYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLW 388 (461) T ss_pred hhhcCCeeeeecccCCccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCC Confidence 432111 212221112121 222222223344443 3456666777766653 3 22211 12346666665322 Q ss_pred cccCCCHHH-----HHHHHHHHHcCCCCHHHHHHHHHhcCccchh-----hhhH-HHHHHhhccccccccchh Q lcl|NC_019408. 441 LSTPIGARE-----MRAIQLMANDGLLPDPVFYEYMRKAEVISSD-----MTFE-EFQALRADENSFINNPDA 502 (612) Q Consensus 441 ~~~~~d~~~-----~~al~~~~~~G~is~et~~~~lqr~~vl~~~-----~~~e-ee~~ria~e~~~~~~~~~ 502 (612) ....-.-.+ .+++..++++|.||-++.+++|..+..+++. .+++ ++..++..+.+..+..+- T Consensus 389 ~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 389 NLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccccccCCCC Confidence 221111112 3667788999999999999998643322221 1222 111222222221111111 No 105 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=96.99 E-value=0.00022 Score=40.62 Aligned_cols=471 Identities=10% Similarity=0.056 Sum_probs=181.9 Q ss_pred CCCc-----HHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHH----Hhhchhh Q lcl|NC_019408. 1 MVTH-----PEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRD----GMTGMVF 71 (612) Q Consensus 1 ~~~h-----P~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~----~~~G~vf 71 (612) +... -+-..+.++|+-|.+.+--. +-...+.+.... . .-.|-+.-.+.++ .|.+.+| T Consensus 13 ~~~r~~~l~~~R~~~e~~w~e~~~~~lP~----------~~~~~~~~~~~~---~-~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:10 13 AKSVYERLKNDRAPYETRAQNCAQYTIPS----------LFPKDSDNASTD---Y-QTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHhccc----------ccCCCCCccccc---c-cccccccHHHHHHHHHHHHHhhhc Confidence 1110 01122344555555543221 101111111111 1 1123333333443 3444444 Q ss_pred cCCcee-e-cC-----------C---HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhh Q lcl|NC_019408. 72 RRDPIV-K-NL-----------P---PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGA 129 (612) Q Consensus 72 ~k~p~~-~-~~-----------p---~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~ 129 (612) =-.| + . .+ + ..++.|++.|. ...++.+.-+-.++.+.+.+|-+.+++|-+.. T Consensus 79 P~~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~----- 152 (536) T protein:vir:10 79 PMQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG----- 152 (536) T ss_pred CCCc-ccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCC----- Confidence 1112 1 0 00 0 12333443321 12355777788889999999999899885432 Q ss_pred ccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccc Q lcl|NC_019408. 130 VATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTL 209 (612) Q Consensus 130 ~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~ 209 (612) +..-++..|+-.++. + ..|+...+.-|+.++.+.- +.-...|+........ + . T Consensus 153 ~~~~~~~~~pl~~~~-v----~~d~~G~vd~i~r~~~~t~---~~l~~~fg~~~~~~~~--------------~-----~ 205 (536) T protein:vir:10 153 SNYNPMKLYRLSSYV-V----QRDAFGNVLQMVTRDQIAF---GALPEDIRKAVEGQGG--------------E-----K 205 (536) T ss_pred CceeeEEEEEcCeEE-E----eeCCCCCeeEEeeeeeccH---HHHHHhhhhhhccccc--------------c-----c Confidence 112245566644432 1 2234444444555554331 1112223221111000 0 0 Q ss_pred ccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc----cccceeEEEeecCCCCCCcCcCc- Q lcl|NC_019408. 210 GGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE----PLDFIPFKFFGASGNTADVEKPP- 284 (612) Q Consensus 210 ~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~----~l~~IP~v~~~~~~~~~~~~~pP- 284 (612) .....+++|...... ..++ .+. +|.+.. +..++...|. .+++||+.|.-..+..+ |..| T Consensus 206 ~~~~~v~v~~~V~~~---~~~~----~~~--~~~e~~-----g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~Y--Grgp~ 269 (536) T protein:vir:10 206 KADETIDVYTHIYLD---EASG----EYL--RYEEVE-----GMEVQGSDGTYPKEACPYIPIRMVRLDGESY--GRSYI 269 (536) T ss_pred CcccceEEEEEEEEe---cCCC----cEE--EEEeec-----CccccccccccccccCCceeeeeeecCCCcc--ccchH Confidence 011122333222111 0011 111 222111 1122222332 35666666765555554 4445 Q ss_pred ---hHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHH Q lcl|NC_019408. 285 ---LLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETAL 360 (612) Q Consensus 285 ---LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l 360 (612) |-|+..||. -+.+-+..+......|.++-++.......-+.-|++.++.. ..++.+.+... +..+..+.+.| T Consensus 270 ~~~l~D~k~L~~---l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g-~~~~v~~~~~~~~~~~~~~~~~i 345 (536) T protein:vir:10 270 EEYLGDLRSLEN---LQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTG-RPEDISFLQLEKQADFTVAKAVS 345 (536) T ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecC-CcccceeeeccccccchHHHHHH Confidence 447777774 24444555666666666665432222111123344444332 22445555533 56688889999 Q ss_pred HHHHHHHHHHH-HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCC-CCcceE Q lcl|NC_019408. 361 NDKERQIAAIG-GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLA-DTENLR 433 (612) Q Consensus 361 ~~~e~qm~~lG-a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~-~~~~~~ 433 (612) ++++..++.+= ..++.... ...-||+....+...-...|..+-.++.+-+- .++.++-+ .|+-.. ..+.+. T Consensus 346 ~~~~~rI~~af~~~~l~~~~-~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~ 423 (536) T protein:vir:10 346 DAIEARLSFAFMLNSAVQRT-GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKEAVE 423 (536) T ss_pred HHHHHHHHHHHhhhhcccCC-CCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCChhhcc Confidence 99999997532 11221111 12348888888888888888887777765443 33333322 232111 112222 Q ss_pred EEeeccccccCCCH----HHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccchhHHhhh Q lcl|NC_019408. 434 YEVNTDFLSTPIGA----REMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNPDAQARQR 507 (612) Q Consensus 434 v~ln~dF~~~~~d~----~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~ 507 (612) + +|. ..+.+ +++..++. |...+..- .++++.++++...+.+++--.. .|...=+.. T Consensus 424 ~----~~v-s~l~~l~r~~~~~~l~~-----------~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv--~p~~~irt~ 485 (536) T protein:vir:10 424 P----TIS-TGLEAIGRGQDLDKLER-----------CVTAWAALAPMRDDPDINLAMIKLRIANAIGI--DTSGILLTE 485 (536) T ss_pred c----eEE-ecHHHHHHHHHHHHHHH-----------HHHHHHhhchhhhcccCCHHHHHHHHHHHcCC--CchhhcCCH Confidence 2 331 22211 12222222 22222221 2344456777777777654211 011111111 Q ss_pred hhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 508 GYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTA 587 (612) Q Consensus 508 ~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~ 587 (612) . +..+.|+++.. +++ + .+.+.... +. ...+-+.+ .+.- + ++. T Consensus 486 e------ev~~~r~q~~~--~~~----~-~~~a~~~~---------~~-~~~~~~~~-~~~~------~--------~~~ 527 (536) T protein:vir:10 486 E------QKQQKMAQQSM--QMG----M-DNGAAALA---------QG-MAAQATAS-PEAM------A--------AAA 527 (536) T ss_pred H------HHHHHHHHHHH--HHH----H-HHHHHHHH---------HH-HHHHHhcC-chhH------H--------hhh Confidence 1 11111111000 000 0 00000000 00 00000000 0000 0 001 Q ss_pred hhccccCCC Q lcl|NC_019408. 588 NAAKVAAQP 596 (612) Q Consensus 588 ~~a~~~~~~ 596 (612) ..+.-.|.- T Consensus 528 ~~~g~~~~~ 536 (536) T protein:vir:10 528 DSVGLQPGI 536 (536) T ss_pred hccccCCCC Confidence 111111111 No 106 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=96.93 E-value=0.00025 Score=40.33 Aligned_cols=464 Identities=12% Similarity=0.101 Sum_probs=188.0 Q ss_pred CCCcHHH-------HHHHHHHHHHHHHhcChHHHHhcccccCCCCC-CCCHH-HHHHHHhhccCCchHHHHHHHhh---- Q lcl|NC_019408. 1 MVTHPEY-------QYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK-GADGD-DYAIYLQRATFFNMLAQTRDGMT---- 67 (612) Q Consensus 1 ~~~hP~y-------~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~-~e~~~-~Y~~rl~rA~~~n~~~~tv~~~~---- 67 (612) |=-.--| ..+..+|+-|.+.+ ||..- ..+.. ....++. -.|-+.-.+.++.++ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~t-------------lP~~~~~~~~~~~~~~~~~-~~~dstg~~a~~~LAa~l~ 66 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELT-------------LPYLIDDDISSRPNHKSLT-VPWQSVGAKCCVTLAAKLM 66 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHh-------------hhcccCCCCCCCccccccc-ccccchHHHHHHHHHHHHH Confidence 1111111 22344454444443 34221 11111 1111111 234444444554443 Q ss_pred chhhcC-Cce--ee--------cCC----HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchh Q lcl|NC_019408. 68 GMVFRR-DPI--VK--------NLP----PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPR 126 (612) Q Consensus 68 G~vf~k-~p~--~~--------~~p----~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~ 126 (612) +.+|.- .|= +. +.+ ..++.+++.|+ ...++.+.-+-.++.+.+.+|-+-+++|-.. T Consensus 67 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~--- 143 (522) T protein:vir:10 67 LAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKDG--- 143 (522) T ss_pred HhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCCC--- Confidence 333321 111 10 111 23556666655 4567888889999999999999999988532 Q ss_pred hhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecc Q lcl|NC_019408. 127 KGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTA 206 (612) Q Consensus 127 ~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~ 206 (612) +..|+-.++. -..|+...+.-|+.++.+.-. .-...|+....... ++. T Consensus 144 --------~~~~pl~~y~-----v~~d~~G~vd~i~r~~~~t~~---ql~~~fg~~~~~~~--------------~~~-- 191 (522) T protein:vir:10 144 --------LKTFPLTRYV-----INRDGDGNVLEIVTKELISRK---VLDIELPEPKPNTG--------------IDE-- 191 (522) T ss_pred --------ceEEEcceEE-----EeeCCCCCeeEEEeeeeccHH---HHHHhcchhccchh--------------hhc-- Confidence 3445443321 223555555556666544311 11112222111100 000 Q ss_pred cccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeec---cCC-ccccceeEEEeecCCCCCCcCc Q lcl|NC_019408. 207 RTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPT---VRG-EPLDFIPFKFFGASGNTADVEK 282 (612) Q Consensus 207 ~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~---~~g-~~l~~IP~v~~~~~~~~~~~~~ 282 (612) .......+++|..... +...+ .+. ++.+..+ ..++. .+| ..+++||+.|.-..+..+ |. T Consensus 192 -~~~~~~~v~v~~~v~p------~~~~~-~~~--~~~~~~~-----~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~Y--Gr 254 (522) T protein:vir:10 192 -SSTTNDDVTIYTYVKL------DKSSG-RWV--WHQEAFD-----KIIPDSRSTAPKNASPWLPLRFNTVDGEDY--GR 254 (522) T ss_pred -ccCCCCceEEEEEEEe------eccCC-ceE--EEEccCC-----ccccccccccccccCCceeeeeeecCCCcc--cc Confidence 0011112333332211 11111 111 1211111 11111 112 235677777776555555 44 Q ss_pred Cc----hHHHHHHHHHHHhhhHHHHHHHHHhccceeeee--cCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHH Q lcl|NC_019408. 283 PP----LLDICDLNLSHYRTYAELEYGRLFTALPVYYAP--GTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKA 355 (612) Q Consensus 283 pP----LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~--G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~ 355 (612) .| |-|+..||.- +. ..-...+.+.-|.+.+. |.. ....+.-|.++.+.-...++...++.. +..+.. T Consensus 255 gp~~~~l~D~k~L~~l---~~-~~~~~~~~a~~p~~lv~~~~~~--~~~~l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~ 328 (522) T protein:vir:10 255 GRVEEFLGDLKSLDGL---SQ-SLIEGAAAASKVVFLVSPSSTT--KPATIAKAGNGAIVQGRPEDVAVIQVGKTADFST 328 (522) T ss_pred chHHHHHHHHHHHHHH---HH-HHHHHHHHhcCCceeecccccc--ccccccCCCCcceecCCCccceeecccccccchH Confidence 45 4477777742 22 23445555555655552 221 122344465666654455667777654 456888 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCCCCc Q lcl|NC_019408. 356 LETALNDKERQIAAIGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLADTE 430 (612) Q Consensus 356 ~~~~l~~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~~~~ 430 (612) ....+++++..++.+-.-+ .......-||++...+...-...|..+-.++.+-+- ++|.++.+ .|.-..-+. T Consensus 329 ~~~~i~~~~~ri~~aFl~~--~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~ 405 (522) T protein:vir:10 329 AANMATAIEKRLLEAFLVM--NVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQR-SNQIPKLPK 405 (522) T ss_pred HHHHHHHHHHHHHHHHhhc--cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCc Confidence 8999999999998753322 222333458999999988888888888888755443 34444433 332111111 Q ss_pred ceEEEeeccccccCCCH----HHHHHHHHHHHcCCCCHHHHHHHHHh-c--CccchhhhhHHHHHHhhccccccccc-hh Q lcl|NC_019408. 431 NLRYEVNTDFLSTPIGA----REMRAIQLMANDGLLPDPVFYEYMRK-A--EVISSDMTFEEFQALRADENSFINNP-DA 502 (612) Q Consensus 431 ~~~v~ln~dF~~~~~d~----~~~~al~~~~~~G~is~et~~~~lqr-~--~vl~~~~~~eee~~ria~e~~~~~~~-~~ 502 (612) ++ + .. ..+..+++ +.+..++ .|...+.. . ..+.+.+++++..+.+++-.. .+ .. T Consensus 406 ~~-~--~~-~~v~~is~Laraq~~~~l~-----------~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~G---vp~~~ 467 (522) T protein:vir:10 406 DI-V--RP-TIVAGVNALGRGQDRESLT-----------AFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQG---IDVLN 467 (522) T ss_pred cc-c--cc-ccccchhHHHHHHHHHHHH-----------HHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhC---CChhh Confidence 22 1 11 11222211 1111111 12222211 1 112234566666777765422 11 11 Q ss_pred HHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 503 QARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNA 582 (612) Q Consensus 503 ~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~ 582 (612) .-+.+++ ..+.|+++++.++++ ....++.+. ..... +. -+....-.+|-+..-+ T Consensus 468 ivrt~ee------v~~~~q~~q~~~~~~-------~~~~~a~~~---------~~~~~---~~-~~~~~~~~~~~~~~~~ 521 (522) T protein:vir:10 468 LVKTEQQ------LAEEQQAAQQQAAQQ-------SLVDQAGQM---------TGSPL---MD-PTKNPQLMDEEQPPME 521 (522) T ss_pred hcCCHHH------HHHHHHHHHHHHHHH-------HHHHHHHHH---------hcccc---cC-ccccHHHHHHhCCCCC Confidence 1111111 111111111000000 000000000 00000 00 0000000000000000 Q ss_pred H Q lcl|NC_019408. 583 K 583 (612) Q Consensus 583 ~ 583 (612) + T Consensus 522 ~ 522 (522) T protein:vir:10 522 E 522 (522) T ss_pred C Confidence 0 No 107 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=96.78 E-value=0.00034 Score=39.58 Aligned_cols=455 Identities=9% Similarity=0.010 Sum_probs=185.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCC---CCCCHHHHHHHHhhccCCchHH-HHHHHhh----chhhc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAM---KGADGDDYAIYLQRATFFNMLA-QTRDGMT----GMVFR 72 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~---~~e~~~~Y~~rl~rA~~~n~~~-~tv~~~~----G~vf~ 72 (612) +--.-+-..+.++|+-|.+.+ ||.. +.++...- -.+...|..|+ +.++.++ +.+|. T Consensus 8 l~~k~~R~~~e~~w~e~a~~~-------------lP~~~~~~~~~~~~~---~~~~~~~dstg~~a~~~LAa~l~~~ltp 71 (514) T protein:vir:80 8 MWAEYRDSTAIRKAEDFAKFT-------------IASLMVDPLDKTHQA---EVVEYDFQSAGAFLVNNLTAKLALTLFP 71 (514) T ss_pred HHHHhhcchHHHHHHHHHHHh-------------cccccCCCCCCcccc---cccccccchhHHHHHHHHHHHHHhhhcC Confidence 111112223555665555554 3421 22221111 11112223333 5544443 33332 Q ss_pred -CCceee-cCC--------------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhc Q lcl|NC_019408. 73 -RDPIVK-NLP--------------PKFKDAVRRFAK------DGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAV 130 (612) Q Consensus 73 -k~p~~~-~~p--------------~~l~~~~~d~D~------~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~ 130 (612) ..|=+. .+. ..++.|++.|+. ..++++.-+-.++.+...+|-+.+++|-. T Consensus 72 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~-------- 143 (514) T protein:vir:80 72 PGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPG-------- 143 (514) T ss_pred CCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecC-------- Confidence 111111 000 135555554443 45788888889999999999998998621 Q ss_pred cCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccc Q lcl|NC_019408. 131 ATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLG 210 (612) Q Consensus 131 ~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~ 210 (612) ..+ +..|+-.++. -..|+...+.-|+.++.+.... -...|... .... ..... T Consensus 144 ~~~-~~~~pl~~y~-----v~~d~~G~v~~i~rr~~~~~~~---l~~~~~~~------------------~~~~-~~~~~ 195 (514) T protein:vir:80 144 TGK-MLVWTMQSYT-----VRRTSHGDPAVVVLRQQMPFRE---LTPEIQAD------------------AQAK-QIAKR 195 (514) T ss_pred CCc-EEEEEcCeEE-----EeeCCCcCeEEEEeeeeecHHH---hhhhhhhh------------------hhhh-hccCC Confidence 122 4566644422 2235555555566666543211 11111111 0000 00001 Q ss_pred cccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc---cccceeEEEeecCCCCCCcCcCc--- Q lcl|NC_019408. 211 GYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE---PLDFIPFKFFGASGNTADVEKPP--- 284 (612) Q Consensus 211 g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~---~l~~IP~v~~~~~~~~~~~~~pP--- 284 (612) ....+.+|-..... +++. ...+.+|.+..+ ..+...+|- -+++||+.|.-..+..+ |..| T Consensus 196 ~~~~v~v~~~v~~~----~~~~---~~~~sv~~e~~g-----~~i~~es~y~~~e~P~i~~Rw~~~~ge~Y--Grgp~~~ 261 (514) T protein:vir:80 196 DSDKCDLYTVIEWQ----PTPN---GKRCAVWHELEG-----KRVGPESSYPAHLCPYVPVAWNVPDGEHY--GRGYVEE 261 (514) T ss_pred CCCceEEEEEEEee----cCCC---CeEEEEEEeccc-----eeecccCccccccCCeeeeeeEecCCCCc--ccchHHH Confidence 11122222221111 0111 112333432211 112233443 35777777776555555 3344 Q ss_pred -hHHHHHHHHHHHhhhHHHHHHHHHhccceeee-e-cCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHH Q lcl|NC_019408. 285 -LLDICDLNLSHYRTYAELEYGRLFTALPVYYA-P-GTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETAL 360 (612) Q Consensus 285 -LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~-G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l 360 (612) |-|+..||. -+.+-+.. .+.+.-|.+.+ . |.. ....+.-|.++.+.-....+.+.++.. +..+..+.+.| T Consensus 262 al~D~k~L~~---l~~~~l~~-~~~a~~~~~~v~~~g~~--~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i 335 (514) T protein:vir:80 262 YSGDFARLSI---LSERLGLY-EFEALSLLNLVDEAKGG--AVDDYRDAETGDFVPGQVGSVASYERGDYNKIAQASASV 335 (514) T ss_pred HHHHHHHHHH---HHHHHHHH-HHHhcCCCceeCccccc--chhhhcccCCceeecCCCccceeeecCcccchHHHHHHH Confidence 457777773 22232333 34444344333 2 221 123355555444432223456666654 45688889999 Q ss_pred HHHHHHHHHHHHHhhhcccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHH-cCCcCCCCcceE Q lcl|NC_019408. 361 NDKERQIAAIGGRMMPGASK-SVSESNNQTVLREANEQSLLLNIIQACESGMTD-----VVRWWLMW-RDVPLADTENLR 433 (612) Q Consensus 361 ~~~e~qm~~lGa~ll~~~~~-~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-----~l~~~a~w-~g~~~~~~~~~~ 433 (612) ++++..++.+- |+..... ...-||++...+...-...|.-+-.++.+=+-. ++.++-+- .|.-.+-+.+. T Consensus 336 ~~~~~rI~~aF--ml~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l- 412 (514) T protein:vir:80 336 ESIVMRLNRAF--MYTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGV- 412 (514) T ss_pred HHHHHHHHHHH--hhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchh- Confidence 99999997642 2222111 122488899888888888888877777655443 22222221 12211111111 Q ss_pred EEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc-Cc---cchhhhhHHHHHHhhccccccccch-hHHhhhh Q lcl|NC_019408. 434 YEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA-EV---ISSDMTFEEFQALRADENSFINNPD-AQARQRG 508 (612) Q Consensus 434 v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~-~v---l~~~~~~eee~~ria~e~~~~~~~~-~~~~~~~ 508 (612) +..+|.. + +.++..+.....| ..+...+... .+ +.+.+++++..+.+++- ++-|. ...+.++ T Consensus 413 --~~~~~vs-~-----la~l~r~~~~~~l--~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~---~Gvp~~~i~~~~e 479 (514) T protein:vir:80 413 --YRPSIIT-G-----IPALTRNIETANI--LRATQEASAIVPALVQLSKRFDPEKLVERIFAN---NSVDLSTLSKDPD 479 (514) T ss_pred --hcceeee-c-----HHHHHHHHHHHHH--HHHHHHHHHHhccchhhhhcCCHHHHHHHHHHH---hCCCHhhccCCHH Confidence 2233322 2 2222222222111 2223333221 11 12345667777777653 33332 1222222 Q ss_pred hhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 509 YTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSIS 558 (612) Q Consensus 509 e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~ 558 (612) +.+ .|++++..++|| .++..+.... ++.+.+.+ -. T Consensus 480 ~~~-------~~~~~~~~~~~~------~~~~~~~~~~-~~~~~~~~-~~ 514 (514) T protein:vir:80 480 VVA-------AEAEQEAALAQQ------QLDVASGALA-AETSAGVL-TS 514 (514) T ss_pred HHH-------HHHHHHHHHHHH------HHHHHHHHHH-Hhhhcccc-CC Confidence 111 111111000000 0000000000 00000000 00 No 108 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=96.56 E-value=0.00052 Score=38.61 Aligned_cols=467 Identities=10% Similarity=0.068 Sum_probs=182.8 Q ss_pred CC---CcHHH--------------HHHHHHHHHHHHHhcChHHHHhcccccCCCC---CCCCHHHHHHHHhhccCCchHH Q lcl|NC_019408. 1 MV---THPEY--------------QYWRPEWTKLRDVMAGQREIKRKAEAYLPAM---KGADGDDYAIYLQRATFFNMLA 60 (612) Q Consensus 1 ~~---~hP~y--------------~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~---~~e~~~~Y~~rl~rA~~~n~~~ 60 (612) |- +...- ..+.++|+-|.+.+ ||.. .+.+... +. .-.|-+.-. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~-------------lP~~~~~~~~~~~~---~~-~~~~dst~~ 63 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYT-------------IPSLFPKDSDNAST---DY-QTPWQAVGA 63 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHh-------------cccccCCCCCcccc---cc-cccccccHH Confidence 10 01111 22344455444443 3321 1111111 11 112333333 Q ss_pred HHHHH----hhchhhcCCcee-e-cCC--------------HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCC Q lcl|NC_019408. 61 QTRDG----MTGMVFRRDPIV-K-NLP--------------PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGR 114 (612) Q Consensus 61 ~tv~~----~~G~vf~k~p~~-~-~~p--------------~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr 114 (612) +.++. |.+.+|=-.| + . .++ ..++.|++.|. ...++.+.-+-.++.+.+.+|- T Consensus 64 ~a~~~Laa~l~~~ltP~~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 142 (536) T protein:vir:21 64 RGLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGN 142 (536) T ss_pred HHHHHHHHHHHHhhcCCCc-ccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCc Confidence 44443 4444441112 1 0 000 12333443321 1235577778888999999999 Q ss_pred eEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeec Q lcl|NC_019408. 115 FGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALAS 194 (612) Q Consensus 115 ~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~ 194 (612) +.+++|-+.. +..-++..|+-.++. + ..|+...+.-|+.++.+.- +.-...|+........ T Consensus 143 a~ly~~e~~~-----~~~~~f~~~pl~~~~-v----~~d~~G~vd~i~r~~~~t~---~~l~~~fg~~~~~~~~------ 203 (536) T protein:vir:21 143 VLLYLPEPEG-----SNYNPMKLYRLSSYV-V----QRDAFGNVLQMVTRDQIAF---GALPEDIRKAVEGQGG------ 203 (536) T ss_pred EeEEEeeCCC-----CceeeEEEEEcCeEE-E----eeCCCCCeeEEeeeeeccH---HHHHHhhhhhhccccc------ Confidence 9899885432 112245566644432 1 2234444444554554332 1122223221111000 Q ss_pred ccccccceeecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc-c---ccceeEEE Q lcl|NC_019408. 195 GSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE-P---LDFIPFKF 270 (612) Q Consensus 195 g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~-~---l~~IP~v~ 270 (612) + ......+++|...... ++ +..+ .+|.+..+ ..++...|. + +++||+.| T Consensus 204 --------~-----~~~~~~v~v~~~v~~~----~~---~~~~--~~~~e~~g-----~~v~~~~g~~~f~~~P~i~~Rw 256 (536) T protein:vir:21 204 --------E-----KKADETIDVYTHIYLD----ED---SGEY--LRYEEVEG-----MEVQGSDGTYPKEACPYIPIRM 256 (536) T ss_pred --------c-----cccccceeEEEEEEEe----cC---CCcE--EEEeccCC-----eeeccccCccccccCCeeeeee Confidence 0 0011122333322111 01 1111 22222111 122223332 3 45566666 Q ss_pred eecCCCCCCcCcCc----hHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEE Q lcl|NC_019408. 271 FGASGNTADVEKPP----LLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGIL 346 (612) Q Consensus 271 ~~~~~~~~~~~~pP----LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~l 346 (612) .-..+..+ |..| |-|+..||. -+.+-+..+......|.++-++.......-+.-|++.++.. ..++.+.+ T Consensus 257 ~~~~ge~Y--Grgp~~~~l~D~k~L~~---l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g-~~~~v~~~ 330 (536) T protein:vir:21 257 VRLDGESY--GRSYIEEYLGDLRSLEN---LQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTG-RPEDISFL 330 (536) T ss_pred eecCCCcc--ccchHHHHHHHHHHHHH---HHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecC-Ccccceee Confidence 65555544 4445 447777774 24444555666666666665432222111123344444332 22445555 Q ss_pred ecC-chhHHHHHHHHHHHHHHHHHHH-HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHH Q lcl|NC_019408. 347 EYT-GQGLKALETALNDKERQIAAIG-GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWL 419 (612) Q Consensus 347 E~~-g~~l~~~~~~l~~~e~qm~~lG-a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a 419 (612) ... +..+..+.+.|++++..++.+= ..++.... ...-||+....+...-...|..+-.++.+-+- .++.++- T Consensus 331 ~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~-~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~ 409 (536) T protein:vir:21 331 QLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRT-GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQ 409 (536) T ss_pred eccccccchHHHHHHHHHHHHHHHHHhhhhcccCC-CCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 533 5668888999999999997532 11221111 12348888888888888888887777765443 3333332 Q ss_pred HHcCCcC-CCCcceEEEeeccccccCCCH----HHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhc Q lcl|NC_019408. 420 MWRDVPL-ADTENLRYEVNTDFLSTPIGA----REMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRAD 492 (612) Q Consensus 420 ~w~g~~~-~~~~~~~v~ln~dF~~~~~d~----~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~ 492 (612) + .|+-. ...+.+.+ +|. ..+.+ +++..++. |+..+..- .++++.++++...+.+++ T Consensus 410 r-~g~lP~~p~~~v~~----~~v-s~l~~l~r~~~~~~l~~-----------~~~~la~~~Pe~ld~~id~d~~~~~~a~ 472 (536) T protein:vir:21 410 A-TQQIPELPKEAVEP----TIS-TGLEAIGRGQDLDKLER-----------CVTAWAALAPMRDDPDINLAMIKLRIAN 472 (536) T ss_pred h-CCCCCCCChhhccc----eEE-ecHHHHHHHHHHHHHHH-----------HHHHHHhhchhhhcccCCHHHHHHHHHH Confidence 2 23211 11122222 331 22211 12222222 22222221 134445677777777765 Q ss_pred cccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 493 ENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPA 572 (612) Q Consensus 493 e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~ 572 (612) --.. .|...=+.. ++..+.|+++.. +++ .+++ ..++...++.+ .+.. T Consensus 473 ~~Gv--~p~~~irt~------eev~~~r~q~~~--~~~----------~~~~-------a~~~~~~~~~~------~~~~ 519 (536) T protein:vir:21 473 AIGI--DTSGILLTE------EQKQQKMAQQSM--QMG----------MDNG-------AAALAQGMAAQ------ATAS 519 (536) T ss_pred HcCC--ChhhhcCCH------HHHHHHHHHHHH--HHH----------HHHH-------HHHHHHHHHHH------HhcC Confidence 4211 011111111 111111110000 000 0000 00000000000 0000 Q ss_pred HHHHHHHHHHH-HHHHhhccccCCC Q lcl|NC_019408. 573 VADQATIDNAK-KQTANAAKVAAQP 596 (612) Q Consensus 573 ~~eq~~~~~~~-k~~~~~a~~~~~~ 596 (612) .+. .++...+.-.|.- T Consensus 520 --------~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 520 --------PEAMAAAADSVGLQPGI 536 (536) T ss_pred --------hhhHHhhhhccccCCCC Confidence 000 0111111111111 No 109 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=96.33 E-value=0.00073 Score=37.78 Aligned_cols=459 Identities=11% Similarity=0.037 Sum_probs=188.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhc----hhhcCC-c Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTG----MVFRRD-P 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G----~vf~k~-p 75 (612) |=+. .+..+|+-|.+.+ ||..-..+...-...+.+ .|-+.-.+.++.++. .+|.-- | T Consensus 12 lkr~----~~e~~w~e~a~~t-------------lP~~~~~~~~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) T protein:vir:78 12 LRDG----SVEQRAIEFAKTT-------------LPYLMVDPMSGSRGVVEH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) T ss_pred Hhcc----chHHHHHHHHHhh-------------ccccccCCCCcccccccC-cccchHHHHHHHHHHHHHHhhcCCCCc Confidence 2111 1344555555443 342111111111112222 244444445544443 333211 1 Q ss_pred eee-cC--------------CHHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCce Q lcl|NC_019408. 76 IVK-NL--------------PPKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSF 134 (612) Q Consensus 76 ~~~-~~--------------p~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy 134 (612) =+. .+ ...++.|++.|. ...++++.-+-.++.+...+|-+.+++|-+. . . T Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~--------~-~ 144 (510) T protein:vir:78 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE--------A-T 144 (510) T ss_pred ccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCC--------C-e Confidence 110 11 123565655542 3456778888888889889999888887432 1 2 Q ss_pred EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccc Q lcl|NC_019408. 135 AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSY 214 (612) Q Consensus 135 ~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~ 214 (612) +..|+-.++. -..|+...+.-|+.++.+.- +.-...|..... .... ....... T Consensus 145 ~~~~pl~~y~-----v~~d~~G~vd~i~rr~~~t~---~~l~~~~~~~~~------------------~~~~-~~~~~~~ 197 (510) T protein:vir:78 145 VVAWSLRSYA-----VRRDATGRWMDIVLKQRYKS---KDLDDVYKQDLM------------------RAGR-NLSGSGS 197 (510) T ss_pred EEEEEcceeE-----EeeCCCcCeeEEEeeeeccH---HHHHHHhhHHhh------------------hhhh-ccCCCce Confidence 5566644422 12344444544555554431 111222221100 0000 0011112 Q ss_pred eeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc---cccceeEEEeecCCCCCCcC--cCchHHHH Q lcl|NC_019408. 215 ITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE---PLDFIPFKFFGASGNTADVE--KPPLLDIC 289 (612) Q Consensus 215 ~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~---~l~~IP~v~~~~~~~~~~~~--~pPLldLA 289 (612) +++|...... ++ .....+++|.+-.+. .+...++- -+++||+.|.-..+..+..+ .--|-|+. T Consensus 198 v~v~~~V~~~-----~~--~~~~~~sv~~e~dg~-----~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k 265 (510) T protein:vir:78 198 VDLYTHVQRR-----KG--TAMDYAEMYHEIDGV-----RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFA 265 (510) T ss_pred EEEEEEEEee-----cC--CCCcEEEEEEEecCe-----eeccccccccccCCeeeeeeeecCCCccccchHHHHHHHHH Confidence 2333322211 11 112234444432111 11223332 35677777776555555443 11245777 Q ss_pred HHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCC-CceeEEecC-chhHHHHHHHHHHHHHHH Q lcl|NC_019408. 290 DLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQG-SEPGILEYT-GQGLKALETALNDKERQI 367 (612) Q Consensus 290 ~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~-~~~~~lE~~-g~~l~~~~~~l~~~e~qm 367 (612) .||. -+.+.+..+.+....|.++-.+.... .+.+.-|.+..+ .|.+ .+.+.++.. +..+..+.+.|++++..+ T Consensus 266 ~L~~---l~~~~l~~a~~a~~~~~lv~p~g~~~-~~~l~~~~~g~~-v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI 340 (510) T protein:vir:78 266 KLSL---LSEKLGLYELESLEVLNLVDEAKGAV-VDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRL 340 (510) T ss_pred HHHH---HHHHHHHHHHHhhcCCcccCCccccc-hhhhccCCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHH Confidence 7774 34455666777777776665432111 223454554444 2322 335555533 345788899999999999 Q ss_pred HHHHHHhhhccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHcCCcCCCCcceEEEeecccc Q lcl|NC_019408. 368 AAIGGRMMPGAS-KSVSESNNQTVLREANEQSLLLNIIQACESGM-----TDVVRWWLMWRDVPLADTENLRYEVNTDFL 441 (612) Q Consensus 368 ~~lGa~ll~~~~-~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~ 441 (612) +.+= |+.... .+..-||++...+...-...|..+-.++.+-+ ..++.++.+ .|+...-++.++.. . T Consensus 341 ~~aF--~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~-----~ 412 (510) T protein:vir:78 341 NQAF--MYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPA-----I 412 (510) T ss_pred HHHH--hhccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCcccccce-----e Confidence 8742 332111 11224888888888888888888777755433 344554433 34322222222211 1 Q ss_pred ccCCCHHHHHHHHHHHHc-CCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHH Q lcl|NC_019408. 442 STPIGAREMRAIQLMAND-GLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSR 520 (612) Q Consensus 442 ~~~~d~~~~~al~~~~~~-G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r 520 (612) +..+++ |-.+... +..+...++..+-.-.-+.+.+++++..+.+++--.. .+...=+.+++.+ +.| T Consensus 413 v~~is~-----Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv--~p~~ivrs~eev~------a~~ 479 (510) T protein:vir:78 413 ETGLPA-----LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSV--DTSQFYKSADELQ------AEA 479 (510) T ss_pred eecccH-----HHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCC--ChhhhcCCHHHHH------HHH Confidence 222222 1111111 0111111111111111245567778777777754221 0111111111111 001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 521 MAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGD 565 (612) Q Consensus 521 ~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~ 565 (612) +++ +||++.++.-+.+. -++-.+.. --..+= T Consensus 480 ~~~----~~q~~~~~~~~~a~-------~~~~~~~~---~~~~g~ 510 (510) T protein:vir:78 480 EEQ----RRQAAQAQAAQETL-------LEGASDMT---NALAGV 510 (510) T ss_pred HHH----HHHHHHHHHHHHHH-------HHhhhhhc---ccCCCC Confidence 000 00000000000000 00000000 000000 No 110 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.18 E-value=0.0009 Score=37.27 Aligned_cols=318 Identities=14% Similarity=0.076 Sum_probs=119.6 Q ss_pred EEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccccccccee--eeeeeeccccccccccccceeEEE Q lcl|NC_019408. 163 LREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYIT--VYRELKLEEIEWPSGEVKLAYVQY 240 (612) Q Consensus 163 l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~--~~R~~~~~~~~~~~g~~~~~~~~~ 240 (612) +.|. .| .. .+|.+... .||-.. ...+- T Consensus 1 v~Ei----vw------------------------------~~----~~g~~~~~~l~~r~~~-------------~~~~f 29 (355) T protein:vir:78 1 MFEQ----VY------------------------------RI----ENGRARLGKLAWRPPR-------------TISRF 29 (355) T ss_pred CeEE----EE------------------------------Ee----eCCeEEEeeeeecCcc-------------ceeee Confidence 1121 11 10 01111100 111100 00000 Q ss_pred EEeeCCCceecceeeeccCCcccccee---EEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHHHHh--cccee Q lcl|NC_019408. 241 LYEEDPESRPIARIVPTVRGEPLDFIP---FKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAELEYGRLFT--ALPVY 314 (612) Q Consensus 241 ~~~~~~~~~~~~~~~p~~~g~~l~~IP---~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l~~~--~~P~l 314 (612) .+..++..... ...+. .|.---.|| |+++ +....+--.+.+.|..++..-+ |.+.+--.+..|.- +.|+| T Consensus 30 ~~~~~~~l~~~-~~~~~-~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~--fK~~~~~~w~~f~Er~g~g~p 105 (355) T protein:vir:78 30 DVAPDGGLVAI-EQWGV-FGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWL--LKDRFLRIQALVGERNGLGVP 105 (355) T ss_pred eeccCCceeEE-EecCC-CCCCcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHH--HHHhhHHHHHHHHHHcCCCce Confidence 01111110000 00000 111011233 3322 2222233334444445544333 23333334444444 45888 Q ss_pred eeecCCCCC---Cc--------------------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH--H Q lcl|NC_019408. 315 YAPGTDSEG---TG--------------------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA--A 369 (612) Q Consensus 315 ~i~G~~~~~---~~--------------------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~--~ 369 (612) +..|..... .+ -+..|+.++..+|.|.++.|++..|++.. ..+.++...++|. . T Consensus 106 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~-~~~~i~~~d~~Isk~i 184 (355) T protein:vir:78 106 IYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPE-MDGPIRYHDEQIARAV 184 (355) T ss_pred EEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCccc-HHHHHHHHHHHHHHHH Confidence 887642110 00 13458888999999999999998887543 4556666666663 3 Q ss_pred HHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCCCcceEEEeeccccccCCCHH Q lcl|NC_019408. 370 IGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAR 448 (612) Q Consensus 370 lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~ 448 (612) +| .+|.........|--.......-...++.+-+..+++++++ ++.+++.|-.-+ ...-..|.+.. ... -+.. T Consensus 185 LG-qtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~--~~~~P~~~~~~--~~~-~~~~ 258 (355) T protein:vir:78 185 LA-HFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQNWGP--EEPAPRLVPAQ--LGK-EQPV 258 (355) T ss_pred hh-hhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCCCCEEEecC--cCh-hHHH Confidence 44 44432111111222233445555566788888999999974 888888875211 12223454421 111 1223 Q ss_pred HHHHHHHHHHcCCC-CHHHHHHHHH-hcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHH Q lcl|NC_019408. 449 EMRAIQLMANDGLL-PDPVFYEYMR-KAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREAD 526 (612) Q Consensus 449 ~~~al~~~~~~G~i-s~et~~~~lq-r~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e 526 (612) .+..+..+...|.+ +.+....+++ +.|+..+.-..+... ...+........+.... .....+...+.- .++ T Consensus 259 ~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~----~~~~~~~~~~~~~~~~~--~~~~~~~~a~~~-~a~ 331 (355) T protein:vir:78 259 TAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGAD----AAAAKAAGRRRAKRLPG--QRQGAALPSRSP-RAD 331 (355) T ss_pred HHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccC----CccccccccccccccCC--ccccccccccCC-CCC Confidence 45667788888874 4343334444 567754432211111 10000000000000000 000000000000 000 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_019408. 527 FTQQKIDIQERSVAVQ-EGHAEVAHA 551 (612) Q Consensus 527 ~~~q~~e~~~r~~~~~-~~r~~~e~~ 551 (612) -..+..+.|....+ +-+.-...- T Consensus 332 --~~~~~~~~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 332 --PPRRRGPLRRRPRHPAHRRCAPDG 355 (355) T ss_pred --ChhhhHHHHHHhhccccCCCCCCC Confidence 00000000111110 000000000 No 111 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=96.16 E-value=0.00093 Score=37.20 Aligned_cols=407 Identities=11% Similarity=0.007 Sum_probs=170.5 Q ss_pred CCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee--c Q lcl|NC_019408. 2 VTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK--N 79 (612) Q Consensus 2 ~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~--~ 79 (612) ++.-+ -...+..|...-+ ....|-|..+. +. .|......=-...+++..|+....-++|+...|. + T Consensus 1 ~~~~D---------~~~~~~~~~g~~~-~~~~~~~~~~~-~~-~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d 68 (437) T protein:vir:52 1 MKFFD---------GIKSLALKLGSKQ-EQTYYSPSLSL-TD-DLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSND 68 (437) T ss_pred Cchhh---------hhHhHHhcCCCcc-ccceeecCccc-cc-cHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCC Confidence 11000 0111222211111 11233333322 22 1222222212345667788888888999999883 2 Q ss_pred CCH----HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 80 LPP----KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 80 ~p~----~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) .++ .+...++.. .+.+-++.+++.+-.||.++|||.--..+ -..|. +.. T Consensus 69 ~~~~~~~~~~~~~~~l-----~~~~~l~~a~~~~rl~G~a~i~i~~d~~~----~~~pl------------------~~~ 121 (437) T protein:vir:52 69 LNSKQLDLFTKFERSL-----KLRETLTKALQWSSLYGSVGLLVVTDSQN----TSAPL------------------KPT 121 (437) T ss_pred CCHHHHHHHHHHHHhh-----cHHHHHHHHHHhcccccceEEEEEecCCC----ccccc------------------ccC Confidence 332 344445544 45677777777777799999888652210 00111 000 Q ss_pred cceeEEEEEEEeec-cccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRD-LRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVK 234 (612) Q Consensus 156 ~~Lt~v~l~E~v~~-~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~ 234 (612) ..+.-|++-..... +......|+ ..+.|+....|++... .. T Consensus 122 ~~~~~~~v~~~~~v~~~~~~~~dp-----------------------------~s~~fg~p~~y~v~~~---------~~ 163 (437) T protein:vir:52 122 ERLKRLIILPKWKISPTGTKDDDV-----------------------------LSPNFGRYSEYSILGG---------SQ 163 (437) T ss_pred CceeEEEEechhhccccccccccc-----------------------------cccccCcceEEEEecC---------Cc Confidence 11111111110000 000000111 1122223333333100 00 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHH-HHHHHHHhccce Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAE-LEYGRLFTALPV 313 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD-~~~~l~~~~~P~ 313 (612) . ..+|.+ . +....|.+ +| ... ....|.|+|.. +.=-|..|...+. -..++|...+++ T Consensus 164 ~---~~iH~S--------R-ii~~~~~~---~~----~~~--~~~~G~s~le~-~~~~i~~~~~~~~~~~~l~~~~~~~v 221 (437) T protein:vir:52 164 S---ITVHHS--------R-LIILNAND---AP----LSD--NDIWGVSDLEK-IIDVLKRFDSASVNVGDLIFESKIDI 221 (437) T ss_pred c---eeEccc--------e-eEEecCcc---CC----Ccc--ccccCCchHHH-HHHHHHHHHHHHHHHHHHHHHcCCCc Confidence 0 011110 0 00011111 22 112 22235665543 4555556665554 366788888888 Q ss_pred eeeecCCC----CCCce-------E--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHHH---HHhhhc Q lcl|NC_019408. 314 YYAPGTDS----EGTGE-------Y--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAIG---GRMMPG 377 (612) Q Consensus 314 l~i~G~~~----~~~~~-------l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lG---a~ll~~ 377 (612) +-+.|+.. ..... + .-+..+.+.+..+.++..+..+-+++. +.++...++|..+- +..|.. T Consensus 222 ~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl~---~~l~~~~~~iaaa~~iP~t~L~G 298 (437) T protein:vir:52 222 FKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGLK---DLLTEFRNAVAGAADMPVTILFG 298 (437) T ss_pred eecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCHH---HHHHHHHHHHHHHhcCchhhhcC Confidence 88877421 11100 1 124456667777878888887777765 44455555655432 122222 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHH-----HHH Q lcl|NC_019408. 378 ASKSVSESNNQTVLREANEQSLLLNII-QACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAR-----EMR 451 (612) Q Consensus 378 ~~~~~~esa~~~~~~~~~~~s~L~~~a-~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~-----~~~ 451 (612) ++..+-.|+ ..+...=...+.++- ..+...++.++.++..=.+-.. ..+++|.+|.=......+.. ..+ T Consensus 299 ~s~~Glasg---e~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~--~~~~~~~f~pL~~~s~kekae~~~~~a~ 373 (437) T protein:vir:52 299 QSVSGLASG---DEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGL--PADWWFEFVPLTTVKQEQQINMLNTFAT 373 (437) T ss_pred cCccccccc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcceEEeCCcCCcCHHHHHHHHHHHHH Confidence 221111111 122222222333333 2355566666665543222222 23677777643333222111 234 Q ss_pred HHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhH Q lcl|NC_019408. 452 AIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQEL 516 (612) Q Consensus 452 al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~ 516 (612) +...++++|.|+-...+.+|+..|+.+ ..+.++.......+....+-...........+.-+.. T Consensus 374 a~~~~~~~g~i~~~e~r~~L~~~g~~~-~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 374 AANTLIQNGVLNEYQIANELRESGLFA-NISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred HHHHHHhcCCCCHHHHHHHHHhcCCCC-CCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 566778999999999999999877754 2332222222111111000000001000111100000 No 112 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=95.74 E-value=0.0015 Score=36.01 Aligned_cols=469 Identities=12% Similarity=0.079 Sum_probs=186.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCC-CCCCHHHHHHHHhhccCCchHHHHHH----HhhchhhcCCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAM-KGADGDDYAIYLQRATFFNMLAQTRD----GMTGMVFRRDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~-~~e~~~~Y~~rl~rA~~~n~~~~tv~----~~~G~vf~k~p 75 (612) |-. +-..+.++|+-|.+.+ ||.. +.++.. =..++. -.|-+...+.++ .|.+.+|=-.| T Consensus 21 l~~--~R~~~e~~w~e~~~~~-------------lP~~~~~~~~~-~~~~~~-~~~dst~~~a~~~Laa~l~~~ltP~~~ 83 (535) T protein:vir:15 21 LTN--DRRAYETRAENCAQYT-------------IPSLFPKESDN-ESTDYT-TPWQAVGARGLNNLASKLMLALFPMQS 83 (535) T ss_pred HHH--HhhHHHHHHHHHHHHh-------------cccccCCCCCc-cccccc-ccccccHHHHHHHHHHHHHHhhcCCCc Confidence 111 1122355565555544 2321 111110 000001 123333334444 34444441111 Q ss_pred eee-cCC--------------HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCce Q lcl|NC_019408. 76 IVK-NLP--------------PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSF 134 (612) Q Consensus 76 ~~~-~~p--------------~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy 134 (612) =+. .+. +.++.|++.|. ...++.+.-+-.++.+.+.+|-+-++||-+.. .... T Consensus 84 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~------~~~~ 157 (535) T protein:vir:15 84 WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEG------SYNP 157 (535) T ss_pred ccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCC------Ccee Confidence 011 111 23455554442 23566888888999999999999888875431 1222 Q ss_pred EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccc Q lcl|NC_019408. 135 AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSY 214 (612) Q Consensus 135 ~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~ 214 (612) +..|+-.++ .-..|+...+.-|+.++.+.-. .-...|... ...... ....... T Consensus 158 f~~~pl~~~-----~v~~d~~G~vd~i~r~~~~t~~---~l~~~~~~~------------------~~~~~~-~~~~~~~ 210 (535) T protein:vir:15 158 MKLYRLSSY-----VVQRDAYGNVLQIVTRDQIAFG---ALPEDVRSA------------------VEKAGG-EKKMDEM 210 (535) T ss_pred eEEEEcCee-----EEeeCCCCCeeEEEEeEeecHH---HHHHHHhHh------------------hhcccc-ccCCCCc Confidence 444544332 1123455555555555544321 111122211 000000 0011122 Q ss_pred eeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCcc---ccceeEEEeecCCCCCCcCcCc----hHH Q lcl|NC_019408. 215 ITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEP---LDFIPFKFFGASGNTADVEKPP----LLD 287 (612) Q Consensus 215 ~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~---l~~IP~v~~~~~~~~~~~~~pP----Lld 287 (612) +.+|...... .. ++.+.+....++. .+....++.+ +++||+.|.-..+..+ |..| |-| T Consensus 211 v~v~~~v~~~------~~-~~~~~~~~e~~g~------~~~~~~~~~~~~~~P~i~~Rw~~~~ge~Y--Grgp~~~~l~D 275 (535) T protein:vir:15 211 VDVYTHVYLD------EE-SGDYLKYEEVEDV------EIDGSDATYPTDAMPYIPVRMVRIDGESY--GRSYCEEYLGD 275 (535) T ss_pred eeEEEEEEEe------cC-CCcEEEEEEeeCc------cccccccccccccCCceeeeeeecCCCcc--ccchHHHHHHH Confidence 3344322111 11 1122222211111 1111122223 5666666765555544 3444 457 Q ss_pred HHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHHHH Q lcl|NC_019408. 288 ICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDKER 365 (612) Q Consensus 288 LA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~e~ 365 (612) +..||.- +.+-+. ..+.+.-|.+.+ ...... ...+.-|..+.+.-...++.+.++.. +..+..+.+.|++++. T Consensus 276 ~k~L~~l---~~~~l~-~~~~~~~p~~lv~~~g~~~-~~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~ 350 (535) T protein:vir:15 276 LRSLENL---QEAIVK-MSMISAKVIGLVNPAGITQ-PRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEA 350 (535) T ss_pred HHHHHHH---HHHHHH-HHHHHhcCceeeccccccc-chhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHH Confidence 7777752 223234 444444555444 222111 22344444444433333556666543 4568889999999999 Q ss_pred HHHHHH--HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCC-cCCCCcceEEEee Q lcl|NC_019408. 366 QIAAIG--GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDV-PLADTENLRYEVN 437 (612) Q Consensus 366 qm~~lG--a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~-~~~~~~~~~v~ln 437 (612) .++.+= -.+.... ...-||++...+...-...|..+-.++.+-+- .++.++-+ .|+ +....+.++++|- T Consensus 351 ~I~~af~~~~~~~~~--~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~~~yi 427 (535) T protein:vir:15 351 RLSYAFMLNSAVQRT--GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TSQIPELPKEAVEPTIS 427 (535) T ss_pred HHHHHHhhhhcccCC--CccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCccceeEEEe Confidence 997642 1121121 22358899998888888888888888765443 33333322 122 1112233444432 Q ss_pred ccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccch-hHHhhhhhhHHHH Q lcl|NC_019408. 438 TDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNPD-AQARQRGYTNRGQ 514 (612) Q Consensus 438 ~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~~-~~~~~~~e~~r~~ 514 (612) .++ .++..+. +.-+-..|+..+... .++++.+++++..+.+++- ++-+. ..-+.+++.++. T Consensus 428 -----s~L-----a~aqr~~--~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~---~Gvp~~~i~~~~eev~~~- 491 (535) T protein:vir:15 428 -----TGL-----EAIGRGQ--DLDKLERCISAWAALAPMQGDPDINLAVIKLRIANA---IGIDTSGILLTDEQKQAL- 491 (535) T ss_pred -----cHH-----HHHHHHH--HHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHH---cCCChhhhcCCHHHHHHH- Confidence 122 1111111 111111233333221 2344456777777777654 22221 122222221110 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccC Q lcl|NC_019408. 515 ELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAA 594 (612) Q Consensus 515 ~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~ 594 (612) |+ +++.++.+. +.+.+..+.... +-+ +..+... ...+.+.-.+ T Consensus 492 -----~~------q~~~~~~~~-~~a~~~g~~~~~----------~~~-~~p~~~~--------------~~~~~~g~~~ 534 (535) T protein:vir:15 492 -----MM------QDAAQTGIE-NAAATGGAGVGA----------LAT-SSPEAMQ--------------GAAAQAGLDA 534 (535) T ss_pred -----HH------HHHHHHHHH-HHHHHHHhhccc----------hhc-cChHHHH--------------HHHhccCCCC Confidence 00 001000000 001111111000 000 0011110 1111111111 Q ss_pred C Q lcl|NC_019408. 595 Q 595 (612) Q Consensus 595 ~ 595 (612) . T Consensus 535 ~ 535 (535) T protein:vir:15 535 T 535 (535) T ss_pred C Confidence 1 No 113 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=95.63 E-value=0.0017 Score=35.75 Aligned_cols=478 Identities=13% Similarity=0.088 Sum_probs=179.3 Q ss_pred CCC-----------cH-----HHHHHHHHHHHHHHHhcChHHHHhcccccCCCC---------CCCCHHHHHHHHhhccC Q lcl|NC_019408. 1 MVT-----------HP-----EYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAM---------KGADGDDYAIYLQRATF 55 (612) Q Consensus 1 ~~~-----------hP-----~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~---------~~e~~~~Y~~rl~rA~~ 55 (612) |-+ += +-..+.++|+-|.+.+ ||.. .......+..+ .| T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~-------------lP~~~~~~~~~~~~~~~~~~~~~~----~~ 63 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYL-------------MPRLDKFGQLPRPDSEKGRERSQK----MF 63 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHh-------------ccccccccccCCCCCCcccccccc----cc Confidence 221 11 1122344455444443 3332 11111111111 22 Q ss_pred CchHHHHHHH----hhchhhcC-Cce--e----ecCC--HHHHHHHhccC--------CCCCCHHHHHHHHHHHHHHhCC Q lcl|NC_019408. 56 FNMLAQTRDG----MTGMVFRR-DPI--V----KNLP--PKFKDAVRRFA--------KDGSSHATFAKAVLSEQAGVGR 114 (612) Q Consensus 56 ~n~~~~tv~~----~~G~vf~k-~p~--~----~~~p--~~l~~~~~d~D--------~~G~~l~~f~~~~~~~~l~~Gr 114 (612) -+.-.+.++. |.+.+|.- .|= + +.+. ...+.|++.|+ ...++++.-+-.++.+.+.+|- T Consensus 64 dstg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gt 143 (549) T protein:vir:10 64 DSTAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGP 143 (549) T ss_pred cchHHHHHHHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcc Confidence 2222333333 33333331 110 1 1111 12344554332 2356788888889999999999 Q ss_pred eEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeec Q lcl|NC_019408. 115 FGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALAS 194 (612) Q Consensus 115 ~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~ 194 (612) +-+++|-.. +....+..|+-.++.=. .|+...+.-|+ |++ ....+.-...|+..... T Consensus 144 a~l~~~~~~------~~~~~f~~~pl~~~~v~-----~d~~G~vd~i~-r~~--~~t~~ql~~~fg~~~l~--------- 200 (549) T protein:vir:10 144 GALMIEHDV------GKGIVYRNVPMQRLWFA-----ENNSGLIDKTH-VQW--ELTLRQAAQRFGRENLS--------- 200 (549) T ss_pred eeeEEeecC------CCeeEEEEEEcCeEEEe-----eCCCCCeEEEE-EEe--ecCHHHHHHhcCcccCC--------- Confidence 999987321 11123444555543321 12333332222 221 01111111222211110 Q ss_pred ccccccceeecccccccccceeeeeeeecccccccc--ccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEe Q lcl|NC_019408. 195 GSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPS--GEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFF 271 (612) Q Consensus 195 g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~ 271 (612) .-++..... .....+++|.........++. +..+-.+.+..++.++ . .+...+| ..+++||+.|. T Consensus 201 -----~~v~~~~~~-~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~-~-----~il~esg~~e~P~~~~Rw~ 268 (549) T protein:vir:10 201 -----PSMQSTLEK-DPEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGR-D-----RIVQNSGFRTFPFAIGRFY 268 (549) T ss_pred -----HHHHHHhhc-CCCceEEEEEEeecCCCCCccccccccCceEEEEEEecC-C-----EeeccCCcccCCcceeeee Confidence 000000000 111223333322111111111 1111122222233322 1 2222333 45788888887 Q ss_pred ecCCCCCCcCcCc----hHHHHHHHHHHHhhhHHHHHHHHHhccceeeee--cCCCCCCceEEEeccccccCCCCCceeE Q lcl|NC_019408. 272 GASGNTADVEKPP----LLDICDLNLSHYRTYAELEYGRLFTALPVYYAP--GTDSEGTGEYHIGPNMVWEVPQGSEPGI 345 (612) Q Consensus 272 ~~~~~~~~~~~pP----LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~--G~~~~~~~~l~iG~~~~~~lp~~~~~~~ 345 (612) -..+..+ |..| |-|+..||.- +-..-...+.+..|.+.++ |... ...+.-|..+.+....+++..+ T Consensus 269 ~~~ge~Y--Grgp~~~~l~D~k~L~~l----~~~~l~~~~~~~~p~~~v~~~g~~~--~~~l~pgg~~~~~~~~~~~~~~ 340 (549) T protein:vir:10 269 VGTDDVY--GGSPAYDAMPDVRMANDM----AKTNIRGAQKLVDPPLLANEDGVLD--GFDLRSGALNWGGLNDKGEEMV 340 (549) T ss_pred ecCCCcc--ccchHHHHHHHHHHHHHH----HHHHHHHHHHHhcCceeeccccccc--cceeccCCccccccCCCCccce Confidence 6666555 3444 4577777752 2223445555556665553 2211 1234445555444433444333 Q ss_pred Ee-cCchhHHHHHHHHHHHHHHHHHHH-HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHH Q lcl|NC_019408. 346 LE-YTGQGLKALETALNDKERQIAAIG-GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGM-----TDVVRWW 418 (612) Q Consensus 346 lE-~~g~~l~~~~~~l~~~e~qm~~lG-a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~ 418 (612) .. -.+..+..+...|+++++.+..+= ..++........-||++...+...-...|..+-.++.+=+ .++|.++ T Consensus 341 ~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il 420 (549) T protein:vir:10 341 KPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDIL 420 (549) T ss_pred eeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 33 245577888899999999887642 1111111112346888888888888888888877776432 3444444 Q ss_pred HHHcCCcCCCCcce---EEEeeccccccCCC----HHHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHH Q lcl|NC_019408. 419 LMWRDVPLADTENL---RYEVNTDFLSTPIG----AREMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQAL 489 (612) Q Consensus 419 a~w~g~~~~~~~~~---~v~ln~dF~~~~~d----~~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~r 489 (612) -+ .|.-..-+.+. .+.++-.|.. .+. ...+.++....+ ++..+... .++ +.+++++..+. T Consensus 421 ~r-~g~lP~~p~~l~~~~~~~~i~yis-~La~aq~~~~~~~i~~~~~--------~~~~laq~~Pe~l-d~id~d~~~~~ 489 (549) T protein:vir:10 421 AE-AGQLPDMPQELIDAGADVDVEYDS-PLNKAMRAGEGAAILQWLQ--------QLGIVSQFDPAAA-KVPNGARIARL 489 (549) T ss_pred Hh-cCCCCCCChhhhcCCceeEEEeec-HHHHHHHHHHHHHHHHHHH--------HHHHHhccChhHH-hcCCHHHHHHH Confidence 44 34311101111 1222223321 110 111221111111 11111110 111 23566666666 Q ss_pred hhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 490 RADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQA 569 (612) Q Consensus 490 ia~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~ 569 (612) +++- ++-|...=+..++ .++.|++++ +++|+. .+ -+++..+... .....++++. T Consensus 490 ~a~~---~Gvp~~~irs~ee------v~~~r~~~~-~qqq~~--~~-~~~a~~a~~~--a~~~~~~~ta----------- 543 (549) T protein:vir:10 490 LADY---GGVPVEAMSTDEE------LQAQQAAEA-QAAQMQ--QM-LAAAPVAAGA--IKDLSDAQTA----------- 543 (549) T ss_pred HHHh---cCCCccccCCHHH------HHHHHHHHH-HHHHHH--HH-HHHHHHHHHH--HHhhhhhcCC----------- Confidence 6654 2222211111111 112222111 101110 00 0000000000 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019408. 570 KPAVADQATIDNAKKQTANAAKV 592 (612) Q Consensus 570 k~~~~eq~~~~~~~k~~~~~a~~ 592 (612) .-.|.. T Consensus 544 -----------------~~~~~~ 549 (549) T protein:vir:10 544 -----------------AQTARV 549 (549) T ss_pred -----------------CcccCC Confidence 000111 No 114 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=95.55 E-value=0.0019 Score=35.56 Aligned_cols=462 Identities=13% Similarity=0.054 Sum_probs=188.9 Q ss_pred CC---cHHH-----HHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhc----h Q lcl|NC_019408. 2 VT---HPEY-----QYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTG----M 69 (612) Q Consensus 2 ~~---hP~y-----~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G----~ 69 (612) .. ..-| ..+..+|+-|.+.+--. +...++.+. ..++.+ .|-+.-.+.++.++. . T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~----------~~~~~~~~~---~~~~~~-~~dstg~~a~~~LAa~l~~~ 66 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPY----------LMVDPMSGS---RGVVEH-DFQSAGALLVNNLAAKLARS 66 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc----------cCCCCCCcc---ccccCC-CccchHHHHHHHHHHHHHhh Confidence 00 0111 11345555555544221 111122111 112222 244444455554443 3 Q ss_pred hhcCC-ceee-cC--------------CHHHHHHHhccCC------CCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhh Q lcl|NC_019408. 70 VFRRD-PIVK-NL--------------PPKFKDAVRRFAK------DGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRK 127 (612) Q Consensus 70 vf~k~-p~~~-~~--------------p~~l~~~~~d~D~------~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~ 127 (612) +|.-- |=+. .+ ...++.|++.|.. ..++++.-+-.++.+.+.+|-+.+++|-- T Consensus 67 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~----- 141 (510) T protein:vir:63 67 LFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSD----- 141 (510) T ss_pred hcCCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCC----- Confidence 33211 1110 11 1235555443322 34678888888899999999998888721 Q ss_pred hhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccc Q lcl|NC_019408. 128 GAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTAR 207 (612) Q Consensus 128 ~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~ 207 (612) .. .+..|+-.++. -..|+...+.-|+.++.+.... -...|... ..+.. . T Consensus 142 ---~~-~~~~~pl~~y~-----v~~d~~G~vd~i~rr~~~t~~~---l~e~~~~~------------------~~~~~-~ 190 (510) T protein:vir:63 142 ---AA-TVVAWSLRSYA-----VRRDATGRWMDIVLKQRYKSKD---LDEEYKQD------------------LMRAG-R 190 (510) T ss_pred ---Cc-EEEEEEcceeE-----EeeCCCcCeeEEEeeeeccHHH---HhHHhhhh------------------hhccc-c Confidence 12 25566654422 2235555555566666543211 11112111 00000 0 Q ss_pred ccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC---ccccceeEEEeecCCCCCCcC--c Q lcl|NC_019408. 208 TLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG---EPLDFIPFKFFGASGNTADVE--K 282 (612) Q Consensus 208 ~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~l~~IP~v~~~~~~~~~~~~--~ 282 (612) .......+++|...... ++ .....+++|.+-.+.. +...++ +-+++||+.|.-..+..+..+ . T Consensus 191 ~~~~~~~v~v~~~V~~~-----~~--~~~~~~sv~~e~dg~~-----~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~ 258 (510) T protein:vir:63 191 NLSGSGSVDLYTHVQRK-----KG--TAMEYAELYHEIDGVR-----VGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVE 258 (510) T ss_pred ccCCCcceEEEEEEEee-----cC--CCceEEEEEEEecCce-----eccccccccccCceeeeeeeecCCCccccchHH Confidence 00111122333322211 11 1123444554322211 112233 235677777766555555443 1 Q ss_pred CchHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCC-CCceeEEecC-chhHHHHHHHH Q lcl|NC_019408. 283 PPLLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQ-GSEPGILEYT-GQGLKALETAL 360 (612) Q Consensus 283 pPLldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~-~~~~~~lE~~-g~~l~~~~~~l 360 (612) --|-|+..||. -+.+.+..+.+....|.++-.+.... .+.+.-|.+..+ .|. -.+.+.++.. +..+..+.+.| T Consensus 259 ~~l~D~k~L~~---l~~~~l~~a~~a~~~~~lv~p~g~~~-~~~~~~~~~g~~-v~g~~~~v~~~~~~~~~d~~~~~~~i 333 (510) T protein:vir:63 259 DYIGDFAKLSL---LSEKLGLYELESLEVLNLVDEAKGAV-VDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSL 333 (510) T ss_pred HHHHHHHHHHH---HHHHHHHHHHHhccCCcccCcccccc-hhhhccCCCcee-ecCCcccceeeecCcccchHHHHHHH Confidence 12457777764 34455666677777776665422111 223444544333 232 2335555544 34588889999 Q ss_pred HHHHHHHHHHHHHhhhccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHcCCcCCCCcceEE Q lcl|NC_019408. 361 NDKERQIAAIGGRMMPGAS-KSVSESNNQTVLREANEQSLLLNIIQACESGM-----TDVVRWWLMWRDVPLADTENLRY 434 (612) Q Consensus 361 ~~~e~qm~~lGa~ll~~~~-~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~a~w~g~~~~~~~~~~v 434 (612) ++++..++.+= |+.... .+..-||++...+...-...|..+-.++.+-+ ..++.++.+ .|+....++.+.. T Consensus 334 ~~~~~rI~~af--~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~ 410 (510) T protein:vir:63 334 QAVVVRLNQAF--MYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKP 410 (510) T ss_pred HHHHHHHHHHH--HhhcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCchhccc Confidence 99999988753 322111 11224888888888888888888777755433 344554433 3432222232221 Q ss_pred EeeccccccCCCHHHHHHHHHHHHc-CCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHH Q lcl|NC_019408. 435 EVNTDFLSTPIGAREMRAIQLMAND-GLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRG 513 (612) Q Consensus 435 ~ln~dF~~~~~d~~~~~al~~~~~~-G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~ 513 (612) . .+..+++ |-.+... +..+-..++..+..-.-+.+.+++++..+.+++--.. .+...=+.+++.+ T Consensus 411 ~-----~v~~is~-----Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv--~p~~ivrs~eev~-- 476 (510) T protein:vir:63 411 A-----IETGLPA-----LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSV--DTSQFYKSADELQ-- 476 (510) T ss_pred c-----eecchhH-----HHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCC--ChhHhcCCHHHHH-- Confidence 1 1222211 1111111 0111111222221111234667788877877754221 0111111111111 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 514 QELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGD 565 (612) Q Consensus 514 ~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~ 565 (612) +..++++ ||.+.+++.++.. .+.-.+...... += T Consensus 477 a~~~~~~--------qq~~~~~~~~~~~-------~~~a~~~~~~~~---g~ 510 (510) T protein:vir:63 477 AEAEQQR--------QQAAQAQAAQETL-------LEGASDMTNALA---GV 510 (510) T ss_pred HHHHHHH--------HHHHHHHHHHHHH-------HHHHHhhccccc---CC Confidence 0000000 0000000000000 000000000000 00 No 115 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=95.52 E-value=0.0019 Score=35.49 Aligned_cols=467 Identities=10% Similarity=0.063 Sum_probs=182.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCC-CCCCHHHHHHHHhh-ccCCchHHHHHHH----hhchhhcCC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAM-KGADGDDYAIYLQR-ATFFNMLAQTRDG----MTGMVFRRD 74 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~-~~e~~~~Y~~rl~r-A~~~n~~~~tv~~----~~G~vf~k~ 74 (612) |-. +-..+.++|+-|.+.+- |.. +.++.. ..-.+ -.|-+...+.++. |.+.+|=-. T Consensus 21 l~~--~R~~~e~~w~e~~~~~l-------------P~~~~~~~~~---~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~ 82 (535) T protein:vir:33 21 LTN--DRRAYETRAENCAQYTI-------------PSLFPKESDN---ESTDYTTPWQAVGARGLNNLASKLMLALFPMQ 82 (535) T ss_pred HHH--HhhHHHHHHHHHHHHhc-------------ccccCCCCCc---ccccccccccccHHHHHHHHHHHHHHhhcCCC Confidence 211 11234555655555542 321 111100 00010 1233333344443 444444111 Q ss_pred ceee-cCC--------------HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc Q lcl|NC_019408. 75 PIVK-NLP--------------PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS 133 (612) Q Consensus 75 p~~~-~~p--------------~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP 133 (612) |=+. .++ +.++.|++.|. ...++.+.-+-.++.+.+.+|-+.+++|-+.. ... T Consensus 83 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~------~~~ 156 (535) T protein:vir:33 83 SWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEG------SYN 156 (535) T ss_pred cccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCC------Cce Confidence 1001 111 12444544432 23466888888889999999999999874421 122 Q ss_pred eEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccccccc Q lcl|NC_019408. 134 FAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYS 213 (612) Q Consensus 134 y~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~ 213 (612) .+..|+-.++.= ..|+...+.-|+.++.+.- ..-...|....... ..++.. .. T Consensus 157 ~f~~~pl~~~~v-----~~d~~G~vd~i~r~~~~t~---~ql~~~~~~~~~~~--------------~~~k~~-----~~ 209 (535) T protein:vir:33 157 PMKLYRLSSYVV-----QRDAYGNVLQIVTRDQIAF---GALPEDVRSAVEKS--------------GGEKKM-----DE 209 (535) T ss_pred eeEEEEcCeeEE-----eeCCCCCeeEEEeeEeecH---HHHHHHhhhhhccc--------------cccccc-----cc Confidence 344555444321 2244444444555554331 11112222111100 000000 01 Q ss_pred ceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeee-ccCCcc---ccceeEEEeecCCCCCCcCcCc----h Q lcl|NC_019408. 214 YITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVP-TVRGEP---LDFIPFKFFGASGNTADVEKPP----L 285 (612) Q Consensus 214 ~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p-~~~g~~---l~~IP~v~~~~~~~~~~~~~pP----L 285 (612) .+.+|..... .+. ++.+.+....++. .++ ..++.+ +++||+.|.-..+..+ |..| | T Consensus 210 ~~~v~~~v~~------~~~-~~~~~~~~~~~~~-------~~~~~~~~~~~~~~P~i~~Rw~~~~ge~Y--Grgp~~~~l 273 (535) T protein:vir:33 210 MVDVYTHVYL------DEE-SGDYLKYEEVEDV-------EIDGSDATYPTDAMPYIPVRMVRIDGESY--GRSYCEEYL 273 (535) T ss_pred CCeEEEEEEe------eCC-CCcEEEEEEEeCc-------cccccccccccccCCceeeeeeecCCCcc--ccchHHHHH Confidence 1222222111 011 1112221111111 111 122223 5666666765555544 4444 4 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHH Q lcl|NC_019408. 286 LDICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDK 363 (612) Q Consensus 286 ldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~ 363 (612) -|+..||.- +.+-+. ..+.+.-|.+.+ ...... ...+.-|..+.+.-...++.+.++.. +..+..+.+.|+++ T Consensus 274 ~D~k~L~~l---~~~~l~-~~~~~~~p~~lv~~~g~~~-~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~ 348 (535) T protein:vir:33 274 GDLRSLENL---QEAIVK-MSMISAKVIGLVNPAGITQ-PRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQI 348 (535) T ss_pred HHHHHHHHH---HHHHHH-HHHHHhcCceeeccccccc-hhhcccCCceeeecCCcccceeeecccccchhHHHHHHHHH Confidence 577777752 223334 444444554444 222111 22344444444433333556666543 45688899999999 Q ss_pred HHHHHHHH--HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCC-cCCCCcceEEE Q lcl|NC_019408. 364 ERQIAAIG--GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDV-PLADTENLRYE 435 (612) Q Consensus 364 e~qm~~lG--a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~-~~~~~~~~~v~ 435 (612) +..++.+= -.+.... ...-||+....+...-...|..+-.++.+-+- .++.++-+ .|+ +....+.++++ T Consensus 349 ~~~I~~af~~~~~~~~~--~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~~~ 425 (535) T protein:vir:33 349 EARLSYAFMLNSAVQRT--GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TSQIPELPKEAVEPT 425 (535) T ss_pred HHHHHHHHhhhhcccCC--CccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCccceeEE Confidence 99997642 1121121 22358899998888888888888888765443 33333322 122 11123334444 Q ss_pred eeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccch-hHHhhhhhhHH Q lcl|NC_019408. 436 VNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNPD-AQARQRGYTNR 512 (612) Q Consensus 436 ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~~-~~~~~~~e~~r 512 (612) |- .++ .++..+. +.-+-..|+..+... .++++.+++++..+.+++-. +-+. ..-+.+++.++ T Consensus 426 yi-----s~L-----a~aqr~~--~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~---Gvp~~~i~~~~ee~~~ 490 (535) T protein:vir:33 426 IS-----TGL-----EAIGRGQ--DLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAI---GIDTSGILLTDEQKQA 490 (535) T ss_pred Ee-----cHH-----HHHHHHH--HHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHc---CCCHhHhcCCHHHHHH Confidence 32 122 1111111 111111223333221 23444567777777776542 2221 12222111110 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019408. 513 GQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKV 592 (612) Q Consensus 513 ~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~ 592 (612) .|+ +++++..+..+++ +.. +.. ..+-+..- +..+ ++...+.- T Consensus 491 ------~~~------q~~~~~~~~~~~~-~~g---------~~~-~~~~~~~~--------------~~~~-~~~~~~g~ 532 (535) T protein:vir:33 491 ------LMM------QDAAQTGVENAAA-AGG---------AGV-GALATSSP--------------EAMQ-GAAAKAGL 532 (535) T ss_pred ------HHH------HHHHHHHHHHHHH-hhh---------hhh-cchhhcCC--------------hhHH-HHHHhccC Confidence 000 0000000000000 000 000 00000000 0000 11111111 Q ss_pred cCC Q lcl|NC_019408. 593 AAQ 595 (612) Q Consensus 593 ~~~ 595 (612) -+. T Consensus 533 ~~~ 535 (535) T protein:vir:33 533 NAT 535 (535) T ss_pred CCC Confidence 111 No 116 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=95.22 E-value=0.0025 Score=34.85 Aligned_cols=455 Identities=10% Similarity=0.050 Sum_probs=182.8 Q ss_pred CCC-----cHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhh----chhh Q lcl|NC_019408. 1 MVT-----HPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMT----GMVF 71 (612) Q Consensus 1 ~~~-----hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~----G~vf 71 (612) +.. ..+...+..+|+-|.+.+--.. +|... .. .+.++ .|-+.-.+.++.++ +.+| T Consensus 16 l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~---------~~~~~---~~---~~~~~-~~dstg~~a~~~LAa~l~~~lt 79 (516) T protein:vir:96 16 IPKLWEKFSNKRSSFLDRAKHYSKLTLPYL---------MNDKG---DN---ETSQN-GWQGVGAQATNHLANKLAQVLF 79 (516) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHhhcccc---------cCCCC---Cc---cccCC-cccchHHHHHHHHHHHHHhhhc Confidence 111 1233344556666666543311 12111 11 11111 34444445555444 3333 Q ss_pred c-CCceee-cCC--------------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhh Q lcl|NC_019408. 72 R-RDPIVK-NLP--------------PKFKDAVRRFAK------DGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGA 129 (612) Q Consensus 72 ~-k~p~~~-~~p--------------~~l~~~~~d~D~------~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~ 129 (612) . ..|=+. .++ ..++.|++.|.. ..++++.-+-.++.+.+.+|-+.+++|.+. T Consensus 80 pp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~------ 153 (516) T protein:vir:96 80 PAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG------ 153 (516) T ss_pred CCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC------ Confidence 2 111110 111 135555544432 446788888888899999999989887432 Q ss_pred ccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccc Q lcl|NC_019408. 130 VATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTL 209 (612) Q Consensus 130 ~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~ 209 (612) + +..|+-.++. -..|+...+.-|+.++.+.... -...|.. .....+.. ... T Consensus 154 ---~-~~~~pl~~y~-----v~~d~~G~v~~i~rr~~~~~~~---l~~~~~~-~~~~~~~~----------------~~~ 204 (516) T protein:vir:96 154 ---A-ISAIPMHHYV-----VNRDTNGDLLDIILLQEKALRT---FDPATRA-VVEVGLKG----------------KKC 204 (516) T ss_pred ---C-EEEEEcCeEE-----EeeCCCCCeeeehhhhHhhHHH---HHHhhhh-hhhhhhhh----------------hhc Confidence 1 3455544422 1224444444455555322110 0011100 00000000 000 Q ss_pred ccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC---ccccceeEEEeecCCCCCCcC--cCc Q lcl|NC_019408. 210 GGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG---EPLDFIPFKFFGASGNTADVE--KPP 284 (612) Q Consensus 210 ~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~l~~IP~v~~~~~~~~~~~~--~pP 284 (612) .....+++|-..... ++ +. +.+|.+.++. .+...+| ..+++||+.|.-..+..+..+ .-- T Consensus 205 ~~~~~v~v~~~v~~~----~~----~~--~~~~~~~d~~-----~~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~ 269 (516) T protein:vir:96 205 KEDDSVKLYTHAKYL----GD----GF--WELKQSADDI-----PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDY 269 (516) T ss_pred CCCCceEEEEeeeee----CC----ce--eEEEEEeCce-----eeccccccccccCCeeeeeeeecCCCCcccchHHHh Confidence 011112233211110 11 11 2233322211 1222334 246777777776655555443 112 Q ss_pred hHHHHHHHHHHHhhhHHHHHHHHHhccceeee-e-cCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHH Q lcl|NC_019408. 285 LLDICDLNLSHYRTYAELEYGRLFTALPVYYA-P-GTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALN 361 (612) Q Consensus 285 LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~-G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~ 361 (612) |-|+..||.- +.+ .-...+.+.-|.+.+ . |.. ....+.-|.++.+.-...++.+.++.. +..+..+...|+ T Consensus 270 L~D~k~L~~l---~~~-~l~~~~~a~~~~~lv~p~g~~--~~~~l~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~ 343 (516) T protein:vir:96 270 SGDLFVIQFL---SEA-VARGAALMADIKYLIRPGAQT--DVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLE 343 (516) T ss_pred hHHHHHHHHH---HHH-HHHHHHHhcCCccccCccccc--chhhhccCCCceeecCCcccceeeecCcccchhHHHHHHH Confidence 5577777742 222 333445555454444 3 221 123455565555532222445666654 335888899999 Q ss_pred HHHHHHHHHHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCCCcceEEEeec Q lcl|NC_019408. 362 DKERQIAAIGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLADTENLRYEVNT 438 (612) Q Consensus 362 ~~e~qm~~lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~~~~~~v~ln~ 438 (612) +++..++.+=. .+..... ..-||++...+...-...|.-+-.++.+-+-. ++..+..-++..++ ...+ +. T Consensus 344 ~~~~rI~~af~~~~l~~r~~--~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp-~~~v----~~ 416 (516) T protein:vir:96 344 VYTRRIGVVFMMETMTRRDA--ERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFT-SDLV----DP 416 (516) T ss_pred HHHHHHHHHHhhhhhccCCC--ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCc-cccc----cc Confidence 99999876321 1222221 22588888888888888888877776554433 33332222332222 1122 22 Q ss_pred cccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCc---cchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHH Q lcl|NC_019408. 439 DFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEV---ISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQ 514 (612) Q Consensus 439 dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~v---l~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~ 514 (612) ++. .. +.+|..+.....|. .+...+.. .++ +.+.+++++..+.+++-- +-|...=+..++ T Consensus 417 ~~v-s~-----l~~l~r~~~~~~i~--~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~---Gvp~~~irs~ee----- 480 (516) T protein:vir:96 417 VII-TG-----IEALGRMAELDKLA--NFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQI---SAELPFLKSAEE----- 480 (516) T ss_pred eee-ch-----HHHHHHHHHHHHHH--HHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHh---CCCccccCCHHH----- Confidence 222 12 22333332222221 22222221 111 114456666667766542 222111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 515 ELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSIS 558 (612) Q Consensus 515 ~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~ 558 (612) -++.|+++. +++++. ..+...++.-....+.+.++. T Consensus 481 -v~~~~~~~~-~~q~~~------~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 481 -MAQEQEAQM-QAQQAQ------MLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred -HHHHHHHHH-HHHHHH------HHHHHhhhhhhHHhhcccccC Confidence 111111100 000000 001111111111110010000 No 117 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=95.21 E-value=0.0025 Score=34.83 Aligned_cols=389 Identities=12% Similarity=0.041 Sum_probs=158.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCC------CC---CHHHHHHHHhhccCCchHHHHHHHhhchhh Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK------GA---DGDDYAIYLQRATFFNMLAQTRDGMTGMVF 71 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~------~e---~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf 71 (612) +=+.|+....... ...|-.- ..| ||=..+ +- .-.-|+.-.. --.++...++..-..|. T Consensus 7 ~~p~~~~~~~~~~------~~~~~~~--~~g--~~~~D~~lr~~gg~~~~~~~l~~~m~e---~D~~v~s~l~~Rk~av~ 73 (446) T protein:vir:98 7 NAPTPAIRRRTIY------AMEHLGL--ATS--YLSEDGGYKRAGKPTYQQLSAWDEAAQ---TEPIIAQGLDSIALSVL 73 (446) T ss_pred CCCchhhhhhhhh------ccccchh--hcc--cCCcchHhhhcCCChHHHHHHHHHHHh---cchHHHHHHHHHHHHhh Confidence 4456654433221 0111110 112 431000 00 1123433332 24667777777777788 Q ss_pred cCCceeecCCHHH----HHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcch Q lcl|NC_019408. 72 RRDPIVKNLPPKF----KDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWD 147 (612) Q Consensus 72 ~k~p~~~~~p~~l----~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~ 147 (612) +.+..++-.++.. ..++.+++ ++.++.. +..++.+|.+.+=+.|-..+. +.+|--+ -..++.|+ T Consensus 74 ~~~w~V~p~~~~~a~~v~~~l~~~~-----~~~~~~~-~ldai~~G~s~~Eivw~~~~g---~~~p~~~---~d~~~~~~ 141 (446) T protein:vir:98 74 NKVGPYQHGDKRIKKFIDDQLRNRA-----KTWISHC-VKSIMTYGFSLSEQIYAHGAR---DNMPATV---LDDIVNYH 141 (446) T ss_pred cCCceecCccHHHHHHHHHHHhhcC-----chhHHHH-HHHHHhhCceeeeEEEeeccc---ccccchh---hccccccc Confidence 8888885223333 44444443 3333333 457777887766555432211 0111100 00111221 Q ss_pred hhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccc Q lcl|NC_019408. 148 EVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIE 227 (612) Q Consensus 148 ~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~ 227 (612) . +.++ ++-..+ + + +.+|.. +++.......+.. T Consensus 142 ~------------~~~r-~~~~~~-----~----------~---~~~~~~-----------------~~~~~~~~~~~~~ 173 (446) T protein:vir:98 142 P------------LQVM-LIANDN-----G----------R---IVDGDT-----------------VTASQYKSGYWVP 173 (446) T ss_pred c------------ccce-eeeccC-----C----------c---cccccc-----------------cchhhcccccccC Confidence 0 0000 000000 0 0 000000 0000000000000 Q ss_pred cccccccceeEEEEEeeCCCceecceeeeccCCcccccee---EEE-eecCCCCCCcCcCchHHHHHHHH-HHHhhhHHH Q lcl|NC_019408. 228 WPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIP---FKF-FGASGNTADVEKPPLLDICDLNL-SHYRTYAEL 302 (612) Q Consensus 228 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP---~v~-~~~~~~~~~~~~pPLldLA~lnl-~HY~~~sD~ 302 (612) . +.....+... .. ...|.. -.|| |++ .+....+.-.+.+.|..++..-+ ++|-. -+. T Consensus 174 ~----------~~~~~~~~~~-----~~-~~~g~~-~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~-~~w 235 (446) T protein:vir:98 174 L----------PPYRIGDPPK-----KV-DVVGSH-VRLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFR-DMM 235 (446) T ss_pred c----------ccchhhhhhh-----hc-ccCccc-ccccccceEEEEecCCCCCccccchHHHHHHHHHHHHhhH-HHH Confidence 0 0000000000 00 000110 1244 333 33333333445554445444332 33222 334 Q ss_pred HHHHHHhccceeeee---cCCCCCCce------------------EEEeccccccC-----CCCCceeEEecCchhHHHH Q lcl|NC_019408. 303 EYGRLFTALPVYYAP---GTDSEGTGE------------------YHIGPNMVWEV-----PQGSEPGILEYTGQGLKAL 356 (612) Q Consensus 303 ~~~l~~~~~P~l~i~---G~~~~~~~~------------------l~iG~~~~~~l-----p~~~~~~~lE~~g~~l~~~ 356 (612) -.-+-.=++|+++.. |.+....+. ..+|+.++..+ |.|..+.|++..+++-... T Consensus 236 ~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~ 315 (446) T protein:vir:98 236 LIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSF 315 (446) T ss_pred HHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhH Confidence 445666678888875 333222210 13566665554 9999999999988764445 Q ss_pred HHHHHHHHHHHHH--HHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCcCCCCcceE Q lcl|NC_019408. 357 ETALNDKERQIAA--IGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-DVVRWWLMWRDVPLADTENLR 433 (612) Q Consensus 357 ~~~l~~~e~qm~~--lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-~~l~~~a~w~g~~~~~~~~~~ 433 (612) ...++-...+|.. +|-.|.-.+......|-........-..-++.+-+..+++.++ +++.+++.|-+-+....-.+. T Consensus 316 ~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~ 395 (446) T protein:vir:98 316 ERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASN 395 (446) T ss_pred HHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccc Confidence 6666666666643 3432311111111122222333433344567788889999997 588999888653211100000 Q ss_pred EEeeccccccCCCHHH----HHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHH Q lcl|NC_019408. 434 YEVNTDFLSTPIGARE----MRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEE 485 (612) Q Consensus 434 v~ln~dF~~~~~d~~~----~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~ee 485 (612) ..+. .|... ++.+ +.++..+++.|.+-..+--...++.|+.+.+-+ . T Consensus 396 ~~~~-~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~--~ 446 (446) T protein:vir:98 396 TGYI-TRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISS--T 446 (446) T ss_pred cccc-eeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCC--C Confidence 0000 12211 2334 444556677887533221123345677443322 2 No 118 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=95.09 E-value=0.0028 Score=34.60 Aligned_cols=474 Identities=10% Similarity=0.054 Sum_probs=184.3 Q ss_pred CCCc----------HHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHH----Hh Q lcl|NC_019408. 1 MVTH----------PEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRD----GM 66 (612) Q Consensus 1 ~~~h----------P~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~----~~ 66 (612) |--| -+-..+.++|+-|.+.+--.. .| ..+..... ++. -.|-+.-.+.++ .| T Consensus 10 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---------~~-~~~~~~~~---~~~-~~~dst~~~a~~~Laa~l 75 (535) T protein:vir:94 10 FAENGAKAVYDALKNDRNSYETRAENCAKYTIPSL---------FP-KDSDNAST---DYT-TPWQAVGARGLNNLASKL 75 (535) T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---------CC-CCCCcccc---ccC-CcccccHHHHHHHHHHHH Confidence 1111 111234566666655543211 01 11111110 011 123333333333 34 Q ss_pred hchhhcCCceee-cCC--------------HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcch Q lcl|NC_019408. 67 TGMVFRRDPIVK-NLP--------------PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNP 125 (612) Q Consensus 67 ~G~vf~k~p~~~-~~p--------------~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~ 125 (612) .+.+|=-.|=+. .++ ..++.+++.|+ ...++++.-+-.++.+.+.+|-+-+++|-+... T Consensus 76 ~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 155 (535) T protein:vir:94 76 MLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGT 155 (535) T ss_pred HhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCc Confidence 444441111111 011 12555555431 346678888888899999999999998865421 Q ss_pred hhhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeec Q lcl|NC_019408. 126 RKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQT 205 (612) Q Consensus 126 ~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~ 205 (612) ...+..|+-.++. -..|+...+.-|+.++.+.... -...|... +.... T Consensus 156 ------~~~f~~~pl~~y~-----v~~d~~G~vd~i~r~~~~~~~~---l~~~~~~~------------------~~~~~ 203 (535) T protein:vir:94 156 ------YNPMKLYRLSSYV-----VQRDAFGTVLQIVTLDKTAYAA---LPEDVRNS------------------MDSSQ 203 (535) T ss_pred ------ccceEEEEcCeEE-----EeeCCCCCeEEEEeeeeccHHH---hhHHHHHH------------------HHhcc Confidence 1124456544432 2234555565666665443211 11111110 00000 Q ss_pred ccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcCcCc Q lcl|NC_019408. 206 ARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVEKPP 284 (612) Q Consensus 206 ~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~~pP 284 (612) ...+...+.+|...... .. +..+.+..+. ++... ... ....| ..+++||+.|.-..+..+. ..| T Consensus 204 --~~~~~~~v~v~~~v~~~------~~-~~~~~~~~e~-~g~~~--~~~-~~~~g~~~~P~~~~Rw~~~~ge~YG--rgp 268 (535) T protein:vir:94 204 --EHKGDEMIDVYTHIYLD------EE-SGEYLKYEEI-DGVEV--EGT-DASYPVDACPYIPVRMVRIDGESYG--RSY 268 (535) T ss_pred --ccCCCceeEEEEEEEee------CC-CCcEEEEEEe-cCeee--ccc-cccCccccCCceeeeeeecCCCccc--cch Confidence 01112223333322111 00 1122221111 11110 000 01112 2356777777766555553 344 Q ss_pred ----hHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEE-eccccccCCCCCceeEEecC-chhHHHHHH Q lcl|NC_019408. 285 ----LLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHI-GPNMVWEVPQGSEPGILEYT-GQGLKALET 358 (612) Q Consensus 285 ----LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~i-G~~~~~~lp~~~~~~~lE~~-g~~l~~~~~ 358 (612) |-|+..||.- +.+-+..+......|.++-...... ...+.- |++.++. ...++.+.++.. +..+..+.. T Consensus 269 ~~~~l~D~k~L~~l---~~~~l~~~~~a~~~~~lv~p~g~~~-~~~~~~~~~g~~v~-g~~~~v~~~~~~~~~~~~~~~~ 343 (535) T protein:vir:94 269 CEEYLGDLRSLENL---QEAIVKMSMISAKVIGLVNPAGITQ-VRRLTKAQTGDFVS-GRPEDISFLQLEKAADFSVARA 343 (535) T ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHhccCCcccccccccc-hhhcccCCCceeec-CCcccceeeecccccchhHHHH Confidence 4577777742 2333444444455555554321111 123333 3344332 223445556543 456888899 Q ss_pred HHHHHHHHHHHHHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCCCCcc Q lcl|NC_019408. 359 ALNDKERQIAAIGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLADTEN 431 (612) Q Consensus 359 ~l~~~e~qm~~lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~~~~~ 431 (612) .|++++..++.+-. .+... . ...-||++...+...-...|..+-.++.+-+- .+|.++-+- |.-..-+++ T Consensus 344 ~i~~~~~rI~~af~~~~~~~~-d-~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~-g~lP~~p~~ 420 (535) T protein:vir:94 344 VSEQIEGRLSYAFMLNSAVQR-T-GERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQAT-NQIPELPKE 420 (535) T ss_pred HHHHHHHHHHHHHhHhhhccC-C-CCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhC-CCCCCCChh Confidence 99999999876421 11111 1 12248889888888888888888777755443 334433321 321111111 Q ss_pred eEEEeeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccc-hhHHhhhh Q lcl|NC_019408. 432 LRYEVNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNP-DAQARQRG 508 (612) Q Consensus 432 ~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~-~~~~~~~~ 508 (612) . ++.+|. .++. ++..+. ..-+-..|+..+..- .++++.+++++..+.+++- ++-+ ...=+.++ T Consensus 421 ~---v~~~~v-s~la-----~l~r~~--~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~---~Gvp~~~i~rs~e 486 (535) T protein:vir:94 421 A---VEPTIS-TGME-----ALGRGQ--DLDKLERCIAAWSALAPMQGDPDINIATIKLRIANA---IGIDTSGILKTPE 486 (535) T ss_pred h---ccceEe-ehHH-----HHHHHH--HHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHH---hCCChhhhcCCHH Confidence 1 233442 2221 111110 001111122222221 2333445666666666553 2222 11111111 Q ss_pred hhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019408. 509 YTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTAN 588 (612) Q Consensus 509 e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~ 588 (612) +. ++.|++++.+ +++..+..++. +.... .+ . ...+ .++. T Consensus 487 ev------~~~~~q~~~~-------~~~~~~~~~~g---------~~~~~----------~~-------~-~~~~-~~~~ 525 (535) T protein:vir:94 487 EK------QQEMAEAAQG-------TAMQNAAASAG---------AGAGT----------MA-------T-ASPE-NMKA 525 (535) T ss_pred HH------HHHHHHHHHH-------HHHHHHHHHHH---------Hhhhc----------cc-------c-cChH-HHHH Confidence 10 0111000000 00000000000 00000 00 0 0000 1111 Q ss_pred hccccCCCch Q lcl|NC_019408. 589 AAKVAAQPPA 598 (612) Q Consensus 589 ~a~~~~~~~~ 598 (612) .+.-..-+|+ T Consensus 526 ~~~~~g~~~~ 535 (535) T protein:vir:94 526 AAAQAGMAPN 535 (535) T ss_pred HHHHhccCCC Confidence 1122222222 No 119 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=94.83 E-value=0.0034 Score=34.12 Aligned_cols=466 Identities=10% Similarity=0.099 Sum_probs=181.5 Q ss_pred CCCcHH-----------HHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhh-- Q lcl|NC_019408. 1 MVTHPE-----------YQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMT-- 67 (612) Q Consensus 1 ~~~hP~-----------y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~-- 67 (612) -+.-+. -..+.++|+-|.+.+--.+. +. .+..... ++.+ .|-+.-.+.++.++ T Consensus 8 ~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~---------~~-~~~~~~~---~~~~-~~dst~~~a~~~LAa~ 73 (532) T protein:vir:99 8 GFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---------PS-ATADGST---SYTT-PWQSIGARGLNNLASK 73 (532) T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhccc---------CC-CCCcchh---hccc-cccchHHHHHHHHHHH Confidence 111111 12234555555554432210 10 1111111 1111 33444444444443 Q ss_pred --chhhc-CCceee-cCC--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHHhCCeEEEEecCc Q lcl|NC_019408. 68 --GMVFR-RDPIVK-NLP--------------PKFKDAVRRF------AKDGSSHATFAKAVLSEQAGVGRFGVLVDVVD 123 (612) Q Consensus 68 --G~vf~-k~p~~~-~~p--------------~~l~~~~~d~------D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~ 123 (612) +.+|. ..|=+. .++ +.++.|++.| -...++++.-+-.++.+.+.+|-+-+++|-+. T Consensus 74 L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~ 153 (532) T protein:vir:99 74 LMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) T ss_pred HHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccc Confidence 34443 111111 011 2356665443 23446788888889999999999999887543 Q ss_pred chhhhhccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeeccccccccee Q lcl|NC_019408. 124 NPRKGAVATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVR 203 (612) Q Consensus 124 a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~ 203 (612) .+. +....+..|+-.++. + ..|+...+.-|+.++.+...+. .+.|... +.++.. T Consensus 154 ~~~---~~~~~f~~~pl~~y~-v----~~d~~G~v~~ivrr~~~~~~~l---~e~~~~~---------~~~~~~------ 207 (532) T protein:vir:99 154 QVE---GQSNAPKLYKLHNFV-V----ERDAYDNVLQIVTEDKIARAAL---PEDVRKS---------LEDAQG------ 207 (532) T ss_pred ccc---CcccceEEEEcCeEE-E----eeCCCCCeeeEeeeeeecHHhc---ChHHHHH---------hhcccc------ Confidence 221 223335555544432 1 2244455555666654432211 1111100 000000 Q ss_pred ecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCcc---ccceeEEEeecCCCCCCc Q lcl|NC_019408. 204 QTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEP---LDFIPFKFFGASGNTADV 280 (612) Q Consensus 204 ~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~---l~~IP~v~~~~~~~~~~~ 280 (612) .......+++|...... +++ ..+.+..+.++ . .+.-..++-+ +++||+.|.-..+..+ T Consensus 208 ----~~~p~~~v~v~~~v~~~----~~~---~~~~~~~~~~g-~-----~~~~~~~~~~~~e~P~~~~Rw~~~~ge~Y-- 268 (532) T protein:vir:99 208 ----DQNPSEEVTIYTHVYRD----PEA---MVFRSYQEIDG-E-----IVAGTEGEYPLDSCPWIPVRLIKMPNEDY-- 268 (532) T ss_pred ----ccCCCcceEEEEEEEec----CCC---CeeEEEEeecC-c-----eecccccccccccCCceeeeeeecCCCcc-- Confidence 00111123333322111 011 11222222211 1 1111223333 4566666665555444 Q ss_pred CcCc----hHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEE-eccccccCCCCCceeEEecC-chhHH Q lcl|NC_019408. 281 EKPP----LLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHI-GPNMVWEVPQGSEPGILEYT-GQGLK 354 (612) Q Consensus 281 ~~pP----LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~i-G~~~~~~lp~~~~~~~lE~~-g~~l~ 354 (612) |..| |-|+..||. -+.+-+..+......|.++-.+.... ...+.- |++.++.- ..++.+.++.. +..+. T Consensus 269 Grgp~~~~l~D~k~L~~---l~~~~l~~~~~a~~~~~lv~p~g~~~-~~~~~~~~~g~~v~g-~~~~i~~~~~~~~~~~~ 343 (532) T protein:vir:99 269 GRSFVEEYLGDLKSLEN---LYEAIVKMSMISSKVLFFVNPNGVTQ-IRRVAKANTGDFVAG-RKQDVEVFQLEKYNDFQ 343 (532) T ss_pred ccchHHHHHHHHHHHHH---HHHHHHHHHHHHcCCCceeccccccc-hhhhccCCCcceecC-Ccccceeeecccccchh Confidence 4444 457777774 23344454555555555554322211 122333 44343321 22345555533 45688 Q ss_pred HHHHHHHHHHHHHHHHHHH-hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCCC Q lcl|NC_019408. 355 ALETALNDKERQIAAIGGR-MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLAD 428 (612) Q Consensus 355 ~~~~~l~~~e~qm~~lGa~-ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~~ 428 (612) .+.+.|++++..++.+=.- ++... ....-||+....+...-...|..+-.++.+-+- .++.++-+ .|. ++. T Consensus 344 ~~~~~i~~~~~rI~~af~~~~~~~~-d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~-lP~ 420 (532) T protein:vir:99 344 VAKATADDIEKRLSYAFMLNSAVQR-GGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSK-IPN 420 (532) T ss_pred HHHHHHHHHHHHHHHHHhhhhcccC-CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCC-CCC Confidence 8999999999999764211 11111 112248889888888888888888777755443 33444333 232 111 Q ss_pred -CcceEEEeeccccccCCCH----HHHHHHHHHHHcCCCCHHHHHHHHHh-cCccchhhhhHHHHHHhhcccccccc-ch Q lcl|NC_019408. 429 -TENLRYEVNTDFLSTPIGA----REMRAIQLMANDGLLPDPVFYEYMRK-AEVISSDMTFEEFQALRADENSFINN-PD 501 (612) Q Consensus 429 -~~~~~v~ln~dF~~~~~d~----~~~~al~~~~~~G~is~et~~~~lqr-~~vl~~~~~~eee~~ria~e~~~~~~-~~ 501 (612) +.++ +. .+. ...+++ +.+..++ .+...+.. .+-+.+.+++++..+.+++-. +- +. T Consensus 421 ~p~~~-~~--~~i-v~~is~Laraq~~~~l~-----------~~~~~laq~~p~~~d~id~d~~~~~~a~~~---GV~~~ 482 (532) T protein:vir:99 421 LPKEA-VE--PAI-ATGLEALGRGHDLNKLN-----------VFIDYMIKLAGLQDDDINLLDVKMRLANSL---GMDTT 482 (532) T ss_pred CChhh-cc--cce-eecchHHHHHHHHHHHH-----------HHHHHHHhhcchhhhhCCHHHHHHHHHHHh---CCChh Confidence 1111 11 111 112211 1111111 12222221 122334556666666665531 11 11 Q ss_pred hHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 502 AQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDN 581 (612) Q Consensus 502 ~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~ 581 (612) ..=+.+++.+ +.|++ +++++. + +...++..+...++. +...+.+.. T Consensus 483 ~i~r~~ee~~------~~~~q---~~~~~~---~----~~a~~~~~~~~~~~~---------~~~~~~~~~--------- 528 (532) T protein:vir:99 483 GLILTQQDKQ------AKMAE---ASTAAG---M----VTAGQQMGAAGGQAA---------AAMMQQQAG--------- 528 (532) T ss_pred hccCCHHHHH------HHHHH---HHHHHH---H----HHHHHHHHHHHHHhc---------chhHHhhcC--------- Confidence 1111111100 00100 000000 0 000000000000000 000110000 Q ss_pred HHHHHHhhcc Q lcl|NC_019408. 582 AKKQTANAAK 591 (612) Q Consensus 582 ~~k~~~~~a~ 591 (612) ..-+ T Consensus 529 ------~~~~ 532 (532) T protein:vir:99 529 ------MPTQ 532 (532) T ss_pred ------CCCC Confidence 0000 No 120 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=94.82 E-value=0.0034 Score=34.10 Aligned_cols=467 Identities=12% Similarity=0.072 Sum_probs=182.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHhc-----C------hHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhch Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMA-----G------QREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGM 69 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~-----G------~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~ 69 (612) -+..|.-... .-+++.+. | -..++.+..-+++. --+-|+..+.+ -.++...++..... T Consensus 16 ~~~~~~~~~~----~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~----~~~L~~~m~e~---D~~i~s~l~~Rk~a 84 (528) T protein:vir:10 16 QLRKQQTAHL----AGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQA----QAELFMDMEER---DAHLFAEMSKRKRA 84 (528) T ss_pred cccchhhhhh----hhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHH----HHHHHHHHHhh---ChHHHHHHHHHHHH Confidence 2223321111 11122111 1 01222222111110 01234444332 56777777888888 Q ss_pred hhcCCceeec----CC--HH----HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEec Q lcl|NC_019408. 70 VFRRDPIVKN----LP--PK----FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYS 139 (612) Q Consensus 70 vf~k~p~~~~----~p--~~----l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ 139 (612) |...+..|+. -| .. +..++.++ .+++.++..++. ++-+|.+.+=+ T Consensus 85 v~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~----~~f~~~i~~~ld-a~~~G~s~~Ei-------------------- 139 (528) T protein:vir:10 85 VLGLDWTIEPPRNASAAEKADAEYLHELLLDL----EGIEDLMLDCMD-GVGHGYSAIEL-------------------- 139 (528) T ss_pred HhcCCceEecCCCCCHHHHHHHHHHHHHHhCC----ccHHHHHHHHHh-hhhhcceeEEE-------------------- Confidence 8888877741 11 12 23333333 247777777663 66677654432 Q ss_pred hhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeee Q lcl|NC_019408. 140 AENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYR 219 (612) Q Consensus 140 ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R 219 (612) .|+. .+|...+.-|..+.. .-|.... + ++ .++| T Consensus 140 -----~w~~---~~g~~~~~~~~~r~~----------~~f~~~~----------~---------------~~----~~l~ 172 (528) T protein:vir:10 140 -----DWSL---QGREWLPQAFDHRPQ----------SWFQLNP----------D---------------DQ----DELR 172 (528) T ss_pred -----EEee---cCCceeEEEeeeecc----------cceeecc----------C---------------CC----cEEe Confidence 2432 134333333322210 0000000 0 00 0001 Q ss_pred eeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEE-EeecCCCCCCcCcCchHHHHHHHH-HHHh Q lcl|NC_019408. 220 ELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFK-FFGASGNTADVEKPPLLDICDLNL-SHYR 297 (612) Q Consensus 220 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v-~~~~~~~~~~~~~pPLldLA~lnl-~HY~ 297 (612) . .++ ...|.+|+.-=|+ +.+....+.-.+...|..++..-+ ++| T Consensus 173 ~-----------------------~~~----------~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~- 218 (528) T protein:vir:10 173 L-----------------------RDN----------SIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHY- 218 (528) T ss_pred c-----------------------cCC----------CCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHh- Confidence 0 000 0012222211122 223333344445666666665544 333 Q ss_pred hhHHHHHHHHHhccceeeee---cCCCCCCce-----EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH- Q lcl|NC_019408. 298 TYAELEYGRLFTALPVYYAP---GTDSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA- 368 (612) Q Consensus 298 ~~sD~~~~l~~~~~P~l~i~---G~~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~- 368 (612) .-.+.-.-+..-|+|+++.. |.+++..+. ..||++++..+|.|..+.|++.++.+......-++-...+|. T Consensus 219 ~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk 298 (528) T protein:vir:10 219 STADLAEMLEIYGLPIRLGKYPPGTPDEEKVTLLRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSK 298 (528) T ss_pred hHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHH Confidence 34566777888899998885 222221111 348999999999999999999887776766777777777764 Q ss_pred -HHHHHhhhcccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCC-CcceEEEeeccccccC Q lcl|NC_019408. 369 -AIGGRMMPGASK-SVSESNNQTVLREANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLAD-TENLRYEVNTDFLSTP 444 (612) Q Consensus 369 -~lGa~ll~~~~~-~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~-~~~~~v~ln~dF~~~~ 444 (612) .+|..| ....+ ....|--.......-...++.+-+..+++.+++ ++.+++.|-.-...+ ..-..|.+... ...+ T Consensus 299 ~iLGqtl-Ts~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~-e~eD 376 (528) T protein:vir:10 299 AILGGTL-TSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLK-DRAD 376 (528) T ss_pred HHhhhhh-hccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC-Cccc Confidence 355444 32111 111122233445555666788899999999984 889888885422111 11123433211 1111 Q ss_pred CCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCccchhhhhHHHHHHhhcccccc-cc------ch-hHHhhhhhhHHHHh Q lcl|NC_019408. 445 IGAREMRAIQLMANDGL-LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFI-NN------PD-AQARQRGYTNRGQE 515 (612) Q Consensus 445 ~d~~~~~al~~~~~~G~-is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~-~~------~~-~~~~~~~e~~r~~~ 515 (612) + ...+..+..+...|. |+.+.+.+ +.|+..+. +.++.........+.. +. .. ..........++.. T Consensus 377 l-~~~a~~~~~L~~~G~~i~~~~i~e---~~gip~p~-~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (528) T protein:vir:10 377 L-AAMATSLPPLVKLGVQVPVNWVQE---QLGIPLPA-NGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEAL 451 (528) T ss_pred H-HHHHHHHHHHHhCCCCCCHHHHHH---HhCCCCCC-CCcccccCCCcccccccCcccccccccccccccccccccchH Confidence 1 223555667888887 77664433 44764332 1122211100000000 00 00 00000001111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccC Q lcl|NC_019408. 516 LEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPE-QAKPAVADQATIDNAKKQTANAAKVAA 594 (612) Q Consensus 516 ~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~-q~k~~~~eq~~~~~~~k~~~~~a~~~~ 594 (612) .+........+.+..-..-- +.=++.++ +...=++.+.+..+-- ..-.++.++ .-+.-=..+.-+-+.. T Consensus 452 d~~~~~~~~~~~~~~~~~~l------~~i~~~l~--~~~s~ee~~~~L~~l~~~~d~~~l~~--~l~~a~~~A~l~G~~~ 521 (528) T protein:vir:10 452 DQVLASLPAQDMQNQADSLV------APLLDVIS--RGGSEAELLGALAEAFPDMDDSALAD--ALHRLLFVADTWGRLN 521 (528) T ss_pred HHHHHHHHHHHHHHHHHHHH------HHHHHHHH--hcCCHHHHHHHHHHHhhcCCHHHHHH--HHHHHHHHHHHhhhhh Confidence 00000000000010000000 00000000 0000011111111000 000000000 0000000111111111 Q ss_pred CCchhhc Q lcl|NC_019408. 595 QPPAPAA 601 (612) Q Consensus 595 ~~~~~~~ 601 (612) ...+..- T Consensus 522 ~~~e~~~ 528 (528) T protein:vir:10 522 GTLDRID 528 (528) T ss_pred ccccccC Confidence 1000000 No 121 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=94.78 E-value=0.0035 Score=34.04 Aligned_cols=474 Identities=11% Similarity=0.073 Sum_probs=186.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCC-CCCCHHHHHHHHhhccCCchHHHHHHHhhc----hhhc-C Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAM-KGADGDDYAIYLQRATFFNMLAQTRDGMTG----MVFR-R 73 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~-~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G----~vf~-k 73 (612) |+--=+-.....+|..++.--.- ...|+....-.||-. +..++. .+.+ -.|-+.-.+.++.++. .+|. . T Consensus 3 ~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~---~~~~-~~~dstg~~a~~~LAa~l~~~ltpp~ 78 (517) T protein:vir:10 3 MRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD---LSSQ-NAWQDDGASATNFLSNKLSQVLFPAQ 78 (517) T ss_pred ccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC---cccc-ccccchHHHHHHHHHHHHHHhhcCCC Confidence 44333333344444433211100 223333333334422 111111 1111 1344444455555443 3332 1 Q ss_pred Cceee-cCC--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccC Q lcl|NC_019408. 74 DPIVK-NLP--------------PKFKDAVRRF------AKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVAT 132 (612) Q Consensus 74 ~p~~~-~~p--------------~~l~~~~~d~------D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~r 132 (612) .|=+. .++ +.++.|++.| -...++.+.-+-.++.+...+|-+.+++|=. .. T Consensus 79 ~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--------~~ 150 (517) T protein:vir:10 79 RSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPDK--------TS 150 (517) T ss_pred CccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeCC--------CC Confidence 11111 111 2356565554 2345688888888999999999877776411 11 Q ss_pred ceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccc Q lcl|NC_019408. 133 SFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGY 212 (612) Q Consensus 133 Py~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~ 212 (612) .+..|+-.++. -..|+...+.-|+.++.+.. ..-...|....... +.. .. ..+ . T Consensus 151 -~~~~~pl~~y~-----v~~d~~G~v~~ivrr~~~~~---~~l~~~~~~~~~~~-~~~------------~~---~~~-~ 204 (517) T protein:vir:10 151 -PIQAVPLHHYC-----VRRDNNGTVLDIVFLQEKAL---ETFEPSIRMAIQAS-RKG------------KQ---YKD-K 204 (517) T ss_pred -cEEEEEcCeEE-----EeeCCCcCeEEEEeeeeccH---HHHHHHhhhhcchh-hhh------------hc---cCC-c Confidence 34556544422 22345555555666654332 12222232211000 000 00 000 0 Q ss_pred cceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC---ccccceeEEEeecCCCCCCcC--cCchHH Q lcl|NC_019408. 213 SYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG---EPLDFIPFKFFGASGNTADVE--KPPLLD 287 (612) Q Consensus 213 ~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~l~~IP~v~~~~~~~~~~~~--~pPLld 287 (612) ..+++|...... .+ +.++ +|.+.++ ..+...+| +.+++||+.|.-..+..+..+ .--|-| T Consensus 205 ~~v~v~~~v~~~----~~----~~~~--~~~~~d~-----~~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D 269 (517) T protein:vir:10 205 DNVKLYTHAKRT----KD----GKYL--IRQSADD-----VPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGA 269 (517) T ss_pred CceEEEEEEEEe----CC----CceE--EEEEeCc-----eeeccccccccccCCeeeeeeeecCCCCcccchHHHhHHH Confidence 112222211110 11 1222 2222211 11112233 345677777776655555433 112557 Q ss_pred HHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHHHHH Q lcl|NC_019408. 288 ICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDKERQ 366 (612) Q Consensus 288 LA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~e~q 366 (612) +..||. -+.+-+..+......|.++-...... ...+.-|.++.+.-..-++...++.. +..+..+.+.|++++.. T Consensus 270 ~k~L~~---l~~~~~~~~~~a~~~~~lv~~~~~~~-~~~l~~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~r 345 (517) T protein:vir:10 270 FFVIQF---LSEALARGMALMADVKYLVKPGSYTD-INQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQR 345 (517) T ss_pred HHHHHH---HHHHHHHHHHHhccCCcccCcccccc-hhhccCCCccccccCCcccceeeecccccchhHHHHHHHHHHHH Confidence 777774 33444555555555666654322111 22344444433321112345555533 34578889999999999 Q ss_pred HHHHHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHcCCcCCCCcceEEEeecccccc Q lcl|NC_019408. 367 IAAIGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDV-VRWWLMWRDVPLADTENLRYEVNTDFLST 443 (612) Q Consensus 367 m~~lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~-l~~~a~w~g~~~~~~~~~~v~ln~dF~~~ 443 (612) ++.+=. .+..... ..-||++...+...-...|.-+-.++.+-+-.- +..+..-++... ....+.++| . . T Consensus 346 I~~af~~~~l~~~~~--~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l-~~~~v~~~~----~-s 417 (517) T protein:vir:10 346 IGRVFMMEAMTRRDA--ERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSIL-TSKNVSPTI----L-T 417 (517) T ss_pred HHHHHhhhhhhccCC--ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhc-CCCCcccee----e-c Confidence 876421 1221221 235888888888888888888777765543221 111111112111 122333332 1 1 Q ss_pred CCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc----CccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHH Q lcl|NC_019408. 444 PIGAREMRAIQLMANDGLLPDPVFYEYMRKA----EVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQS 519 (612) Q Consensus 444 ~~d~~~~~al~~~~~~G~is~et~~~~lqr~----~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~ 519 (612) . +.++..+.... +-..+...+..- ..+.+.+++++..+.+++-- +-|...=+.++|.+ +. T Consensus 418 ~-----la~l~r~~~~~--~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~---Gvp~~~irs~~ev~------~~ 481 (517) T protein:vir:10 418 G-----IEALGRMAELD--KLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQI---SANFPFFKTQDELN------AE 481 (517) T ss_pred c-----HHHHHHHHHHH--HHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHh---CCChhhcCCHHHHH------HH Confidence 2 22232222221 122333333221 12233456777777777642 22221111111111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_019408. 520 RMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVA 593 (612) Q Consensus 520 r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~ 593 (612) |+++ +++ ++....+.++.+.- ...... .|+..+ -.. T Consensus 482 ~~~~---~~~----~~~~~~~~~ag~~~-----~~~~~~--------~~~~~~------------------~~~ 517 (517) T protein:vir:10 482 AQAQ---QEQ----EATKYAAEQAGKAI-----PDMVKN--------GQINPQ------------------GGQ 517 (517) T ss_pred HHHH---HHH----HHHHHHHHHHHHHH-----HHHHhC--------CCCCCC------------------CCC Confidence 1000 000 00000001111000 000000 000000 000 No 122 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=94.65 E-value=0.0038 Score=33.82 Aligned_cols=466 Identities=12% Similarity=0.069 Sum_probs=181.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCC-CCCCHHHHHHHHhhccCCchHHHHHHHhh----chhhc-C Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAM-KGADGDDYAIYLQRATFFNMLAQTRDGMT----GMVFR-R 73 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~-~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~----G~vf~-k 73 (612) |+.-=+-.....+|+.++.--.- ...|+....-.||-. +..++.. +.++ .|-+.-.+.++.++ +.+|. . T Consensus 6 ~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~---~~~~-~~dstg~~a~~~LAa~l~~~ltpp~ 81 (515) T protein:vir:70 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE---TSQN-GWQGVGAQATNHLANKLAQVLFPAQ 81 (515) T ss_pred hhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc---cccc-cccchHHHHHHHHHHHHHHhhcCCC Confidence 33322333333333333211111 222232222233311 1111111 1111 33344444444443 33333 1 Q ss_pred Cceee-cCC--------------HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccC Q lcl|NC_019408. 74 DPIVK-NLP--------------PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVAT 132 (612) Q Consensus 74 ~p~~~-~~p--------------~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~r 132 (612) .|=+. .++ ..++.|++.|. ...++.+.-+-.++.+.+.+|-+.+++|-+. T Consensus 82 ~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~--------- 152 (515) T protein:vir:70 82 RSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG--------- 152 (515) T ss_pred CcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCC--------- Confidence 11110 111 12444444432 3356788888889999999999999998431 Q ss_pred ceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccc Q lcl|NC_019408. 133 SFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGY 212 (612) Q Consensus 133 Py~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~ 212 (612) + +..|+-.++. -..|+...+.-|+.++.+.. +.-...|+....... ... ..... T Consensus 153 ~-~~~~pl~~y~-----v~~d~~G~v~~i~rr~~~t~---~~l~~~f~~~~~~~~-------------~~~----~~~~~ 206 (515) T protein:vir:70 153 A-MSAVPMHHYV-----VNRDTNGDLMDVILLQEKAL---RTFDPATRMAIEVGM-------------KGK----KCKED 206 (515) T ss_pred C-eEEEEcCeEE-----EeeCCCcCeeEEEeeeeccH---HHHHHhhhhhhhhhh-------------hhh----hcCCC Confidence 2 3455544422 12345555555555554322 111122221110000 000 00001 Q ss_pred cceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC---ccccceeEEEeecCCCCCCcCcCc----h Q lcl|NC_019408. 213 SYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG---EPLDFIPFKFFGASGNTADVEKPP----L 285 (612) Q Consensus 213 ~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~l~~IP~v~~~~~~~~~~~~~pP----L 285 (612) ..+++|-..... ++ + .+.+|.+..+. .+...+| +.+++||+.|.-..+..+ |..| | T Consensus 207 ~~v~i~~~v~~~----~~----~--~~~~~~e~d~~-----~~~~es~y~~~e~P~~~~Rw~~~~ge~Y--Grgp~~~~l 269 (515) T protein:vir:70 207 DNVKLYTHAQYA----GE----G--FWKINQSADDI-----PVGKESRIKSEKLPFIPLTWKRSYGEDW--GRPLAEDYS 269 (515) T ss_pred CceEEEEEEEec----CC----C--ceEEEEecCce-----eeccccccccccCCceeeeeeecCCCCc--ccchHHHhh Confidence 122333221111 01 1 12333333221 1122334 346777777776655555 3444 5 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHH Q lcl|NC_019408. 286 LDICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDK 363 (612) Q Consensus 286 ldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~ 363 (612) -|+..||.- +.+-+..+ +.+.-|.+.+ .... .....+.-|.++.+.-...++.+.++.. +..+..+...|+++ T Consensus 270 ~D~k~L~~l---~~~~l~~~-~~a~~p~~lv~~~g~-~~~~~l~~~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~ 344 (515) T protein:vir:70 270 GDLFVIQFL---SEAMARGA-ALMADIKYLIRPGSQ-TDVDHFVNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVY 344 (515) T ss_pred HHHHHHHHH---HHHHHHHH-HHhcCCCeeeCcccc-cchhhccccCCceeecCCcccceeeecCcccchhHHHHHHHHH Confidence 577777742 22334444 4444444444 3211 1123355565544432233456666644 34588889999999 Q ss_pred HHHHHHHHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-cCCcCCCCcceEEEeeccc Q lcl|NC_019408. 364 ERQIAAIGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMW-RDVPLADTENLRYEVNTDF 440 (612) Q Consensus 364 e~qm~~lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w-~g~~~~~~~~~~v~ln~dF 440 (612) +..++.+-. .+..... -.-||+....+...-...|.-+-.++.+=+-.=|-.+++- .+...+ .+.+ +.++ T Consensus 345 ~~rI~~af~~~~l~~rd~--~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P-~~~v----~~~~ 417 (515) T protein:vir:70 345 TRRIGVIFMMETMTRRDA--ERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFT-SELV----DPVI 417 (515) T ss_pred HHHHHHHHhhhhhhccCC--ccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCC-hhhc----ccce Confidence 999977432 1222222 1348888888888888888888888766554433222221 121111 1112 2222 Q ss_pred cccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCcc---chhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhH Q lcl|NC_019408. 441 LSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEVI---SSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQEL 516 (612) Q Consensus 441 ~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~vl---~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~ 516 (612) +..+ .+|..+.....|. .+..++.- ..+. .+-+++++..+.++... +.+...=+.+++ . T Consensus 418 -vs~l-----~~L~r~q~~~~i~--~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~---g~p~~~~rs~ee------v 480 (515) T protein:vir:70 418 -VTGI-----EALGRMAELDKLA--NFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQI---SAELPFLKSEEE------M 480 (515) T ss_pred -ehhH-----HHHHHHHHHHHHH--HHHHHHHHHhccChhHHhhCCHHHHHHHHHHHh---CCCccccCCHHH------H Confidence 1122 2222222211111 12223221 1111 12234555555554331 222111111111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 517 EQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPA 572 (612) Q Consensus 517 e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~ 572 (612) ++.|++++..++++ ..+.... ++.... ..+..|.+ T Consensus 481 ~~~r~q~~~~~~~~-------~~~~~~~---------~a~~~~-----~~~~~~~~ 515 (515) T protein:vir:70 481 QQEMAQQAQAQQEA-------MLNEGVA---------KAVPGV-----IQQEMKEG 515 (515) T ss_pred HHHHHHHHHHHHHH-------HHHHhhh---------hhcccc-----hhhhhccC Confidence 11111111000000 0000111 000000 00001100 No 123 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=94.64 E-value=0.0039 Score=33.80 Aligned_cols=482 Identities=12% Similarity=0.094 Sum_probs=181.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhh----chhhc-CCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMT----GMVFR-RDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~----G~vf~-k~p 75 (612) |-.+ -..+.++|+-|.+.+ ||..-..+...=..++. -.|-+.-.+.++.++ +.+|. ..| T Consensus 12 l~~~--R~~~e~~w~e~~~y~-------------lP~~~~~~~~~~~~~~~-~~~dstg~~a~~~Laa~l~~~ltpp~~~ 75 (542) T protein:vir:78 12 MRAD--REDFLDMARRCAALT-------------LPYLLTEDGHASGGRLQ-QPYQSLGSKGVNALSSKLMLSLFPIQTS 75 (542) T ss_pred HHHH--hhHHHHHHHHHHHHh-------------ccccCCCCCCccccccc-ccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 1111 122334444444433 33321111100011111 234444445555444 33333 122 Q ss_pred eee-cCC---------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc Q lcl|NC_019408. 76 IVK-NLP---------------PKFKDAVRRF------AKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS 133 (612) Q Consensus 76 ~~~-~~p---------------~~l~~~~~d~------D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP 133 (612) =+. .++ ..++.|++.| -..-++.+.-+-.++.+.+.+|-+.+++|-++ T Consensus 76 WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~---------- 145 (542) T protein:vir:78 76 FFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKKT---------- 145 (542) T ss_pred cccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCCC---------- Confidence 111 011 1244444432 22345677778888999999999999988542 Q ss_pred eEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccccccc Q lcl|NC_019408. 134 FAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYS 213 (612) Q Consensus 134 y~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~ 213 (612) +..|+-.++. -..|+...+.-|+.++.+.- ..-...|+........ +......++.. T Consensus 146 -~~~~pl~~y~-----v~~d~~G~vd~v~r~~~~t~---~ql~~~fg~~~l~~~~--------------~~~~~~~~~~~ 202 (542) T protein:vir:78 146 -LKVYPLDRYV-----IERDGDGNVIEIITRELVDR---SLLPAEFQKQSLLEGK--------------DSNAVGEDGPK 202 (542) T ss_pred -ceEEecceeE-----EeeCCCCCeEEEeeeeecCH---HHHHHhhccccCchHH--------------HhhccccCCCe Confidence 3445544421 12244444444444443221 1222223221111100 00000011100 Q ss_pred ceeeeeeeeccccc--cccccccceeEEEEEeeCCCceecceeee---ccCC-ccccceeEEEeecCCCCCCcCcCc--- Q lcl|NC_019408. 214 YITVYRELKLEEIE--WPSGEVKLAYVQYLYEEDPESRPIARIVP---TVRG-EPLDFIPFKFFGASGNTADVEKPP--- 284 (612) Q Consensus 214 ~~~~~R~~~~~~~~--~~~g~~~~~~~~~~~~~~~~~~~~~~~~p---~~~g-~~l~~IP~v~~~~~~~~~~~~~pP--- 284 (612) ..-++.+......+ .........+ ++|.+-.+. .++ ..+| ..+++||+.|.-..+..+ |..| T Consensus 203 ~~v~~~v~pr~~~~~~~~~~~~~~~~--s~~~e~~g~-----~v~~~~~e~g~~~~P~i~~Rw~~~~ge~Y--Grgp~~~ 273 (542) T protein:vir:78 203 FGVAQGKGGRNDAEVFTCCKLVDGQH--RWHQECDGK-----EIKGSRSSSPLKHSPWLPLRFNVVDGESY--GRGRVEE 273 (542) T ss_pred EEEEEEeecccCCccccccccCCCeE--EEEEEeccc-----cccccccccccccCCceeeeeeecCCCcc--ccchHHH Confidence 00011111000000 0000011112 222221111 111 1112 236777777776655555 4444 Q ss_pred -hHHHHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHH Q lcl|NC_019408. 285 -LLDICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALN 361 (612) Q Consensus 285 -LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~ 361 (612) |-|+..||. -+.+-+. ..+.+.-|.+.+ ...... ...+.-|.++.+.-...++.+.++.. +..+..+.+.|+ T Consensus 274 ~l~D~k~L~~---l~~~~l~-~~~~a~~pp~lv~~~g~~~-~~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~ 348 (542) T protein:vir:78 274 FFGDLSSLDA---LTRSLIE-GSAAAAKVVFMVSPSATTK-PQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIR 348 (542) T ss_pred HHHHHHHHHH---HHHHHHH-HHHHHhcCceeeccccccc-hhhcccCCCceeecCCccceeeeecccccchhHHHHHHH Confidence 447777775 2223344 444444454443 321111 22333343444433334556666633 556888999999 Q ss_pred HHHHHHHHHHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCC-cCCCCcceEEE Q lcl|NC_019408. 362 DKERQIAAIGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDV-PLADTENLRYE 435 (612) Q Consensus 362 ~~e~qm~~lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~-~~~~~~~~~v~ 435 (612) +++..|+.+- |+........-||++...+...-...|..+-.++.+-+- ++|.++.+ .|. +....+-+++ T Consensus 349 ~~~~rI~~aF--l~~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r-~g~lP~~p~~lv~~- 424 (542) T protein:vir:78 349 DLSQRISDAF--LILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQR-SKQLPSLPKGLVMP- 424 (542) T ss_pred HHHHHHHHHh--cccccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhceee- Confidence 9999997642 322222233358899888888888888888888755333 34444444 232 1111222233 Q ss_pred eeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-c--CccchhhhhHHHHHHhhccccccccch-hHHhhhhhhH Q lcl|NC_019408. 436 VNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-A--EVISSDMTFEEFQALRADENSFINNPD-AQARQRGYTN 511 (612) Q Consensus 436 ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~--~vl~~~~~~eee~~ria~e~~~~~~~~-~~~~~~~e~~ 511 (612) +|.. + +.++..+.....| ..|...+-. . ..+.+.++++...+.+++- ++-|. ..-+.+++.+ T Consensus 425 ---~~~s-~-----La~~~r~~~~~~l--~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~---~Gvp~~~i~~s~e~~~ 490 (542) T protein:vir:78 425 ---TVVA-G-----LGGVGRGEDRAAL--IEFMQTVGQAMGPEALQQFIDPTEFLKRLAAA---SGIDTLNLVKSPETMA 490 (542) T ss_pred ---eeec-h-----HHHHHHHHHHHHH--HHHHHHHHHhcCChhHHhcCCHHHHHHHHHHH---cCCCHhhccCCHHHHH Confidence 3322 2 2222222221111 122222211 1 1122335666666666553 22221 1111111100 Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019408. 512 RGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAK 591 (612) Q Consensus 512 r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~ 591 (612) + +++...+++.++ +. .+|.... +. ..+. ..-.++..|. T Consensus 491 ------~---------~~~q~q~~~~~~------------------al------~~~a~~~-a~-~~~~-~~~~~~~~a~ 528 (542) T protein:vir:78 491 ------N---------EAQQAQQQQMTA------------------SL------MGQAGQL-AK-SPIG-EKMMQQINAP 528 (542) T ss_pred ------H---------HHHHHHHHHHHH------------------HH------HHhhhhc-cc-cccc-cchhhhcCCC Confidence 0 000000000000 00 0000000 00 0000 0111111111 Q ss_pred ccCCCchhhcCCCCC Q lcl|NC_019408. 592 VAAQPPAPAAPGAPP 606 (612) Q Consensus 592 ~~~~~~~~~~~~~~~ 606 (612) -...|+++...-. - T Consensus 529 ~~~~~~~~~~~~~-~ 542 (542) T protein:vir:78 529 GQEAPAGPQTGED-L 542 (542) T ss_pred CcCCCCCCccccc-C Confidence 1111111111110 1 No 124 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=454 Identities=12% Similarity=0.049 Sum_probs=186.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCC------CCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK------GADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRD 74 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~------~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~ 74 (612) +|..+.-.. +.=..|...-.|..-.. .| .++|+.. +-+...|+..+. -.++...++.....|...+ T Consensus 12 p~~~~~~~~--~~~~~ia~~~~~~~~~~-~~-~~~~~~~~iLr~~~~~~~~y~~m~~----D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:10 12 FVTFGEPDK--SLSSQIATRARSIDFFA-LG-MYLPNPDPVLKALGKDIRVYRELRA----DAHVGGCVRRRKAAVKALE 83 (491) T ss_pred ccCcccCCh--HHHHHHHhhhccccccc-cc-CCccchHHHHHhcCCCHHHHHHHhh----ChHHHHHHHHHHHHHhCCC Confidence 333222110 00011111111111110 11 1222221 124567777653 5667777777777777777 Q ss_pred ceee--cCCHHHHHH-HhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhc Q lcl|NC_019408. 75 PIVK--NLPPKFKDA-VRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVD 151 (612) Q Consensus 75 p~~~--~~p~~l~~~-~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~ 151 (612) ..|+ +-++....+ .+.+ .+.+++.++..++ .++-+|.+.+=+ .|+. T Consensus 84 w~i~~~~~~~~~~e~v~e~l--~~~~~~~~l~~~l-da~~~G~s~~Ei-------------------------~w~~--- 132 (491) T protein:vir:10 84 WGLDRGKAKSRVAKSIADVF--ADLDLSRIVTEML-DAVLYGYQPMEI-------------------------TWGK--- 132 (491) T ss_pred cEEecCCCCHHHHHHHHHHH--hcCCHHHHHHHHH-HhhhhcceeEEE-------------------------EEee--- Confidence 7773 112233333 3333 2457999999987 577788664432 2532 Q ss_pred cCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccc Q lcl|NC_019408. 152 MGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSG 231 (612) Q Consensus 152 v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g 231 (612) .+|...+..+..+.. .-|.... ++ + -+||. T Consensus 133 ~~g~~~~~~l~~r~~----------~~f~~d~----------~~---------------~----l~~~~----------- 162 (491) T protein:vir:10 133 VGNYIVPIDVVGKPA----------DWFVYDP----------EN---------------Q----LRFRS----------- 162 (491) T ss_pred cCCeeEEEEeeeecc----------cceeecc----------CC---------------c----eEEec----------- Confidence 244433433333321 0000000 00 0 01110 Q ss_pred cccceeEEEEEeeCCCceecceeeeccCCccccceeEEE-eecCCCCCCcCcCchHHHHHHHH-HHHhhhHHHHHHHHHh Q lcl|NC_019408. 232 EVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKF-FGASGNTADVEKPPLLDICDLNL-SHYRTYAELEYGRLFT 309 (612) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~-~~~~~~~~~~~~pPLldLA~lnl-~HY~~~sD~~~~l~~~ 309 (612) .+. ...|.+|..-=|+. .+....+.-.+.+-|..++..-+ ++| .-.+.-.-+..- T Consensus 163 ------------~~~----------~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~-~~~~w~~f~E~y 219 (491) T protein:vir:10 163 ------------KDH----------WMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKG-GLKFWVQFTEKY 219 (491) T ss_pred ------------CCC----------CCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHH-HHHHHHHHHHHc Confidence 000 00111221111222 22222333334444555544433 333 334566777778 Q ss_pred ccceeeeec---CCCCCCce-----EEEeccccccCCCCCceeEEecCchh--HHHHHHHHHHHHHHHH--HHHHHhhhc Q lcl|NC_019408. 310 ALPVYYAPG---TDSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQG--LKALETALNDKERQIA--AIGGRMMPG 377 (612) Q Consensus 310 ~~P~l~i~G---~~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~~--l~~~~~~l~~~e~qm~--~lGa~ll~~ 377 (612) |+|+++..- .+++..+. ..||.++++.+|.|.++.|++..+++ .....+-++-..++|. -+|-.| .. T Consensus 220 G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtl-Tt 298 (491) T protein:vir:10 220 GSPMLVGKHPRSASDGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQ-TT 298 (491) T ss_pred CCCeEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhc-cc Confidence 999988862 22211111 34899999999999999999987643 3445555665566653 355443 33 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHH Q lcl|NC_019408. 378 ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMA 457 (612) Q Consensus 378 ~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~ 457 (612) .. . .|--.......-...++.+-+..+++.+++.+.+++.|.+-+ ....+|.+.. ....+...+..+..+. T Consensus 299 ~~-~--gs~a~~~vh~~v~~di~~~D~~~i~~tln~li~~l~~~N~~~---~~~p~f~~~~---~~e~~~~~a~~~~~L~ 369 (491) T protein:vir:10 299 EA-T--STRASAQAGLEVTDDIRDGDKAVVSEAMNMLIRWICDLNFDG---ADRPVFDMWE---QEQVDEIQAGRDQKLT 369 (491) T ss_pred Cc-c--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---CCcceEEecC---cCchhHHHHHHHHHHH Confidence 22 1 222234455555667788899999999999999999987532 2345555532 2222333456667778 Q ss_pred HcCC-CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_019408. 458 NDGL-LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREA-DFTQQKIDIQ 535 (612) Q Consensus 458 ~~G~-is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~-e~~~q~~e~~ 535 (612) +.|. ++...+ .++.|+..+..+.. -+....+....+...+.... .-+.+.+.--.+... +.+..-..- T Consensus 370 ~~G~~i~~~~i---~e~~Gip~~~~~~~----~~~~~~~~~~~~~~~~~~~~--~~~~~~d~~~~~~~~~~~~~~~~~~- 439 (491) T protein:vir:10 370 QAGARFTPAYF---KRAYNLQDGDLDER----PLPVSAVDTVGAASFAEFEA--PDQDALDAALNTLSARDLNADAQAL- 439 (491) T ss_pred hCCCcCCHHHH---HHHhCCCCCCcCcc----ccccCCCCCcccccccccCC--CCCCchHHHHHHHHHHHHHHHHHHH- Confidence 8887 665433 34557754432211 11111111111111110000 000111100000000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCC Q lcl|NC_019408. 536 ERSVAVQEGHAEVAHAAGSTSISGSRKLGDPE-QAKPAVADQATIDNAKKQTANAAKVAAQ 595 (612) Q Consensus 536 ~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~-q~k~~~~eq~~~~~~~k~~~~~a~~~~~ 595 (612) .+.=++.++ +...=++.+.+..+-- ..-.++.++ .-+.-=..+.-.-+..+ T Consensus 440 -----~~~i~~~l~--~~~s~~e~~~~L~~l~~~~d~~~l~~--~l~~a~~~A~l~G~~~a 491 (491) T protein:vir:10 440 -----VAPLLKRIA--NGASADELLGMLAELYPSLDADALQE--RLARAIFVANLWGRLHA 491 (491) T ss_pred -----HHHHHHHHH--hcCCHHHHHHHHHHHhhcCCHHHHHH--HHHHHHHHHHHhhhccC Confidence 000000000 0011111111111000 000001010 00000011122222222 No 125 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=94.36 E-value=0.0046 Score=33.38 Aligned_cols=430 Identities=11% Similarity=0.031 Sum_probs=166.7 Q ss_pred CCcHHHHHHHH-----HHHHHHHHhcChHHHHhcccccCCCCCCCCH--HHHHHHH-hhc----cCCchHHHHHHHhhch Q lcl|NC_019408. 2 VTHPEYQYWRP-----EWTKLRDVMAGQREIKRKAEAYLPAMKGADG--DDYAIYL-QRA----TFFNMLAQTRDGMTGM 69 (612) Q Consensus 2 ~~hP~y~~~~~-----~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~--~~Y~~rl-~rA----~~~n~~~~tv~~~~G~ 69 (612) |..|...+.+. .-..+...++|.......-..+.|..-.-+. ..+..+| .|| .=.++.+..|+.++.. T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 77775553222 1112222333322111112234443321111 1122222 111 2234555555555444 Q ss_pred hhcCCceeecCC----------------HHHHH----HHhc----cCCCCC-CHHHHHHHHHHHHHHhCCeEEEEecCcc Q lcl|NC_019408. 70 VFRRDPIVKNLP----------------PKFKD----AVRR----FAKDGS-SHATFAKAVLSEQAGVGRFGVLVDVVDN 124 (612) Q Consensus 70 vf~k~p~~~~~p----------------~~l~~----~~~d----~D~~G~-~l~~f~~~~~~~~l~~Gr~~vlVD~p~a 124 (612) |=-.-+++...| ..++. |+++ ||-.|. +++++.+.+++..+..|=|++..-+-.. T Consensus 81 vVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~ 160 (533) T protein:vir:34 81 IVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTS 160 (533) T ss_pred hhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccC Confidence 433222221111 23333 3333 566665 9999999999999999999888655321 Q ss_pred hhhhhccCce---EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccc Q lcl|NC_019408. 125 PRKGAVATSF---AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPM 201 (612) Q Consensus 125 ~~~~~~~rPy---~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~ 201 (612) . ..|| +-+|.|+.|=+.. .. -+|.. +.+-. .. ...|.+ T Consensus 161 ~-----g~~~~~~lq~ie~d~l~~~~-~~-~~~~~---------------------------i~~GI--e~--d~~Gr~- 201 (533) T protein:vir:34 161 S-----SRLFRTQFRMVSPKRISNPN-NT-GDSRN---------------------------CRAGV--QI--NDSGAA- 201 (533) T ss_pred C-----CCccceEEEEechhhcCCCC-CC-CCCCc---------------------------eEeee--EE--CCCCCe- Confidence 1 1222 3445554443321 00 01110 00000 00 000000 Q ss_pred eeecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCce--ecceeeeccCCcccccee---EEEe-ecCC Q lcl|NC_019408. 202 VRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESR--PIARIVPTVRGEPLDFIP---FKFF-GASG 275 (612) Q Consensus 202 ~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~--~~~~~~p~~~g~~l~~IP---~v~~-~~~~ 275 (612) .-|+ ++.....+. ...+.+|. ...|| ++.+ ...+ T Consensus 202 --------------~aY~---------------------i~~~~~~~~~~~~~~~~~~-----~~~v~a~~VlH~f~~~r 241 (533) T protein:vir:34 202 --------------LGYY---------------------VSEDGYPGWMPQKWTWIPR-----ELPGGRASFIHVFEPVE 241 (533) T ss_pred --------------EEEE---------------------EeecCCCCccccccceeee-----eeccChhHeeeeccccC Confidence 1111 111111000 01111111 11122 2222 2233 Q ss_pred CCCCcCcCchHHHHHH--HHHHHhhhHHHHHHHHHhccceeeee-cCCC------------C-----------------C Q lcl|NC_019408. 276 NTADVEKPPLLDICDL--NLSHYRTYAELEYGRLFTALPVYYAP-GTDS------------E-----------------G 323 (612) Q Consensus 276 ~~~~~~~pPLldLA~l--nl~HY~~~sD~~~~l~~~~~P~l~i~-G~~~------------~-----------------~ 323 (612) .+-.-|.|.|...... .+..|.. |.+..+.--+++ +.+|+ .... + . T Consensus 242 ~gQ~RGis~lapvl~~l~~l~~y~d-ael~~a~i~A~~-a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (533) T protein:vir:34 242 DGQTRGANVFYSVMEQMKMLDTLQN-TQLQSAIVKAMY-AATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYA 319 (533) T ss_pred CCcccCCchHHHHHHHHHHHHHHHH-HHHHHHHHhhhh-eeeeecCCCcccccccccCCCcccccccccccchhhhhccC Confidence 4444466766544221 3334432 334433333333 44443 1110 0 0 Q ss_pred CceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH-HHHH--Hhhhcc-ccchhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 324 TGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA-AIGG--RMMPGA-SKSVSESNNQTVLREANEQSL 399 (612) Q Consensus 324 ~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~-~lGa--~ll~~~-~~~~~esa~~~~~~~~~~~s~ 399 (612) ...+.+++++++.|++|.+++|+.++..+-. ....+..+...+. .+|. .+|... ++..=.|+.+..+++-..... T Consensus 320 ~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~-~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~ 398 (533) T protein:vir:34 320 AAPVRLGGAKVPHLMPGDSLNLQTAQDTDNG-YSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMG 398 (533) T ss_pred cceeeccCceeeecCCCCeeeecCCCCCCCC-HHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHH Confidence 0125689999999999999999998743211 2333334433332 2221 122221 121122445555555555444 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHcCCcCCCCcceE--EE------eeccccccC---CCHH-HHHHHHHHHHcCCCCHHH Q lcl|NC_019408. 400 LLN-IIQACESGMTDVVRWWLMWRDVPLADTENLR--YE------VNTDFLSTP---IGAR-EMRAIQLMANDGLLPDPV 466 (612) Q Consensus 400 L~~-~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~--v~------ln~dF~~~~---~d~~-~~~al~~~~~~G~is~et 466 (612) ++. ++..++.-+-..+--.|...|. +.-+..+. +. ++-+|.... +|+. ++++.+.++.+|..|++. T Consensus 399 ~q~~~~~~~~~pi~~~wl~~ail~G~-i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~ 477 (533) T protein:vir:34 399 RRKFVASRQASQMFLCWLEEAIVRRV-VTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEK 477 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCc-ccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHH Confidence 443 2222222221111112222332 11011000 00 011222222 2443 899999999999999987 Q ss_pred HHHHHHhcCccchhhhhHHHHHHhhccccc-----cccchhHHhhhh---hhHHHHhHHHHHHH Q lcl|NC_019408. 467 FYEYMRKAEVISSDMTFEEFQALRADENSF-----INNPDAQARQRG---YTNRGQELEQSRMA 522 (612) Q Consensus 467 ~~~~lqr~~vl~~~~~~eee~~ria~e~~~-----~~~~~~~~~~~~---e~~r~~~~e~~r~~ 522 (612) ...+ +|. |+++..+.++.+... +..+..-+.... .....+..+..+.+ T Consensus 478 ~~a~---~G~-----D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 478 ECAK---RGD-----DYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred HHHH---cCC-----CHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCCCC Confidence 7554 343 455555444433211 100000000000 00000000011111 No 126 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=94.18 E-value=0.0051 Score=33.13 Aligned_cols=500 Identities=11% Similarity=0.064 Sum_probs=179.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhch----hhc-CCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGM----VFR-RDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~----vf~-k~p 75 (612) |-.+. ..+.++|+-|.+.+ ||..-..+...=. ....-.|-+...+.++.++.. +|. ..| T Consensus 12 l~~~R--~~~e~~w~e~~~y~-------------lP~~~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~ 75 (555) T protein:vir:17 12 LRADR--EDYLDSGRQSARLT-------------LPYILTDEGHVQG-GYLPTPWQSVGSKGVNVLASKLMLSLFPVNTS 75 (555) T ss_pred HHHHh--hHHHHHHHHHHHHh-------------cccccCCCCCccc-ccccccccccHHHHHHHHHHHHHHhhcCCCCc Confidence 11111 12344455444443 3432111110000 111123444445555554433 333 112 Q ss_pred eee-cCC--------------HHHHHHHhccCCC------CCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCce Q lcl|NC_019408. 76 IVK-NLP--------------PKFKDAVRRFAKD------GSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSF 134 (612) Q Consensus 76 ~~~-~~p--------------~~l~~~~~d~D~~------G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy 134 (612) =+. .++ ..++.+++.|... .++++.-+-.++.+.+.+|-+-+++|-.. T Consensus 76 WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~----------- 144 (555) T protein:vir:17 76 FFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKKN----------- 144 (555) T ss_pred ccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCCc----------- Confidence 111 011 1144433333222 44688888888999999999888887432 Q ss_pred EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCccccccee-eeeeEeeeccccccccee--eccccccc Q lcl|NC_019408. 135 AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQAR-KARAAALASGSASSPMVR--QTARTLGG 211 (612) Q Consensus 135 ~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~-q~r~l~l~~g~~~~~~~~--~~~~~~~g 211 (612) +..|+-.++. -..|+...+.-|+.++.+.- ..-...|+..... ..+.. ....+...+.+. .......+ T Consensus 145 ~~~~pl~~y~-----v~~d~~G~vd~v~rk~~~t~---~ql~~~fg~~~l~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~ 215 (555) T protein:vir:17 145 LKLYPLDRFV-----VSRDGEGNVMEIVTEEQIDR---SLLPEEFQKVGGLEGAPDS-NAVGEDGPKMGVTAPGGRDKGK 215 (555) T ss_pred eeEEEcCeEE-----EeeCCCcCeeEEEeeeeecH---HHHHHHhhhccccchhhhh-hhccccchhhhhhhhcccccCC Confidence 3344433211 12234444444444443221 1111122111000 00000 000000000000 00000001 Q ss_pred ccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcCcCc----hH Q lcl|NC_019408. 212 YSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVEKPP----LL 286 (612) Q Consensus 212 ~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~~pP----Ll 286 (612) ...+++|-.... +. +. +.+|.+..+.. +... ...+| ..+++||+.|.-..+..+ |..| |- T Consensus 216 ~~~~~v~t~~~~-----~~----~~--~~~~~e~~~~~-v~~~-l~e~g~~e~P~i~~Rw~~~~ge~Y--Grgp~~~~l~ 280 (555) T protein:vir:17 216 SNDALVYTYVCR-----KD----GQ--VKWHQECDGKV-IPGS-NSSAPYTHNPWIPLRFNIVDGEAY--GRGRVEEFMG 280 (555) T ss_pred CcceeEeecccc-----cC----Ce--eEEEEecCcee-cccc-ccccCcccCCeeeeeeeecCCCcc--ccchHHHHHH Confidence 111222211111 00 11 12222221110 0000 01111 235677777765555555 4444 44 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHHHH Q lcl|NC_019408. 287 DICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDKER 365 (612) Q Consensus 287 dLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~e~ 365 (612) |+..||.- +.+-+..+-.....|.++-+..-. ....+.-|+++.+.-...++.+-++.. +..+..+.+.|++++. T Consensus 281 D~k~L~~l---~~~~l~~~~~~~~pp~lv~~~g~~-~~~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~ 356 (555) T protein:vir:17 281 DLKSLEAL---SQAMVEGSAASAKVVFMVSPSATT-KPQNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQ 356 (555) T ss_pred HHHHHHHH---HHHHHHHHHHHhCCceeecccccc-CcceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHH Confidence 77777752 333344444444444444332211 223456666666543222345555533 3457888999999999 Q ss_pred HHHHHHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCC-cCCCCcceEEEeecc Q lcl|NC_019408. 366 QIAAIGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDV-PLADTENLRYEVNTD 439 (612) Q Consensus 366 qm~~lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~-~~~~~~~~~v~ln~d 439 (612) .++.+-.-+ .......-||++...+...-...|..+-.++.+-+- ++|.++.+ .|+ +....+-+.+++... T Consensus 357 ~I~~aFm~~--~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r-~g~lP~~p~~~v~~~i~~~ 433 (555) T protein:vir:17 357 RISDAFLML--QVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQK-QRKLPQLPKDLVQPTVVAG 433 (555) T ss_pred HHHHHHhhc--CCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCCHhhhccceeeh Confidence 987653212 222334468999999988888888888888764333 34444444 232 111111222222211 Q ss_pred ccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc-C--ccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhH Q lcl|NC_019408. 440 FLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKA-E--VISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQEL 516 (612) Q Consensus 440 F~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~-~--vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~ 516 (612) +.++....+ .-+-..|+..+... + -+-+.+++++..+++++--... +...=+.+ ++. T Consensus 434 ----------l~~l~r~~~--~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~--p~~ivrs~------eev 493 (555) T protein:vir:17 434 ----------LWGVGRGQD--KQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGID--TLQLINSP------ETM 493 (555) T ss_pred ----------HHHHHHHHH--HHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCC--hhhhcCCH------HHH Confidence 111111111 00111122222111 1 0112345556666665432110 11111111 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC Q lcl|NC_019408. 517 EQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQP 596 (612) Q Consensus 517 e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~ 596 (612) ++.|+++.++ .+ +++. .++..+...... ....-....+.+..+..+=++..+- .. T Consensus 494 ~~~rq~~~~~-------~~-q~~~--------~~qa~~~~~~~~------~~~~~~~~~~~~~~a~~~~~a~~~~---~~ 548 (555) T protein:vir:17 494 KQLGDQQKQD-------MV-QASL--------INQAGQLAKTPM------AEQAMQLIQQQQEGAQDAGAAESET---SS 548 (555) T ss_pred HHHHHHHHHH-------HH-HHHH--------HHHHHHHHhhhh------hhhHHhccccchhhhhHHHHHHhhc---CC Confidence 1111111000 00 0000 000000000000 0000011112222222322111111 11 Q ss_pred chhhcCCC Q lcl|NC_019408. 597 PAPAAPGA 604 (612) Q Consensus 597 ~~~~~~~~ 604 (612) | +.+-++ T Consensus 549 ~-~~~~~~ 555 (555) T protein:vir:17 549 A-EAQAGA 555 (555) T ss_pred c-ccccCC Confidence 1 111122 No 127 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=93.93 E-value=0.0059 Score=32.79 Aligned_cols=449 Identities=13% Similarity=0.053 Sum_probs=185.7 Q ss_pred CCCcHH----HHHHHHHHHHHHHHhcChHHHHhcccccCCCCC------CCCHHHHHHHHhhccCCchHHHHHHHhhchh Q lcl|NC_019408. 1 MVTHPE----YQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK------GADGDDYAIYLQRATFFNMLAQTRDGMTGMV 70 (612) Q Consensus 1 ~~~hP~----y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~------~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~v 70 (612) +|.+.. -......+...-|...+ ..++|... +-+-..|+..+. -.++...++.....| T Consensus 12 ~~~~~~~~~~~~~~ia~~~~~~~~~~~--------~~~~p~~~~il~~~~~~~~~y~~m~~----D~~i~s~l~~Rk~av 79 (491) T protein:vir:79 12 FVKFGEPDKSLSSQIATRARSIDFFAL--------GMYLPNPDPVLKALGKDIRVYRELRA----DAHVGGCVRRRKAAV 79 (491) T ss_pred cccccccchhHHHHHhhhccccccccc--------cccCcchhHHHhhccCCHHHHHHHhh----ChHHHHHHHHHHHHH Confidence 554432 11222222211111111 11233221 123456777654 455555555555556 Q ss_pred hcCCceee--cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchh Q lcl|NC_019408. 71 FRRDPIVK--NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDE 148 (612) Q Consensus 71 f~k~p~~~--~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~ 148 (612) ...+..|+ +-++....++.++ ..+.+++.++..++ .++-+|.+.+=+ .|+. T Consensus 80 ~~~~w~i~~~~~~~~~a~~i~e~-l~~~~~~~~i~~~l-da~~~G~s~~Ei-------------------------~w~~ 132 (491) T protein:vir:79 80 KALEWGLDRGKAKSRVAKSIADV-FADLDLSRIATEML-DAVLYGYQPMEI-------------------------TWGK 132 (491) T ss_pred hCCCcEEecCCCCHHHHHHHHHH-HhcCCHHHHHHHHH-HhhhhcceeEEE-------------------------EEee Confidence 66666663 1122332332221 23357888888886 477788664433 2532 Q ss_pred hhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccc Q lcl|NC_019408. 149 VVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEW 228 (612) Q Consensus 149 ~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~ 228 (612) .+|...+.-|..+.. .-|.... ++ + -++|.. T Consensus 133 ---~~g~~~~~~l~~r~~----------~~f~~d~----------~~---------------~----l~l~~~------- 163 (491) T protein:vir:79 133 ---VGNYIVPIDVVGKPA----------DWFVYDP----------EN---------------Q----LRFRSK------- 163 (491) T ss_pred ---cCCeeeEEeeeeecc----------cceeecc----------CC---------------c----eEEeec------- Confidence 245444444433321 0000000 00 0 011100 Q ss_pred ccccccceeEEEEEeeCCCceecceeeeccCCccc---cceeEEEeecCCCCCCcCcCchHHHHHHHH-HHHhhhHHHHH Q lcl|NC_019408. 229 PSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPL---DFIPFKFFGASGNTADVEKPPLLDICDLNL-SHYRTYAELEY 304 (612) Q Consensus 229 ~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l---~~IP~v~~~~~~~~~~~~~pPLldLA~lnl-~HY~~~sD~~~ 304 (612) .+ ...|.+| .+|- +.+....+.-.+.+.|..++..-+ ++|- -.+.-. T Consensus 164 ---------------~~-----------~~~g~~lp~~k~i~--~~~~~~~g~p~g~gLl~~~~w~~~fK~~~-~~~w~~ 214 (491) T protein:vir:79 164 ---------------EH-----------WVQGEELPARKFLV--PRQEATYLNPYGFPDLSMCFWPTTFKKGG-LKFWVQ 214 (491) T ss_pred ---------------CC-----------CCCceeecCCCeEE--EEecCCCCCcccchhHHHHHHHHHHHHhh-HHHHHH Confidence 00 0011122 2222 222223333445555666655433 4444 356667 Q ss_pred HHHHhccceeeee---cCCCCCCce-----EEEeccccccCCCCCceeEEecCch--hHHHHHHHHHHHHHHHH--HHHH Q lcl|NC_019408. 305 GRLFTALPVYYAP---GTDSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQ--GLKALETALNDKERQIA--AIGG 372 (612) Q Consensus 305 ~l~~~~~P~l~i~---G~~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~--~l~~~~~~l~~~e~qm~--~lGa 372 (612) -+..-|+|+++.. |.+++.... ..||+++++.+|.|.++.|++..+. +.....+-++-..++|. -+|- T Consensus 215 f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGq 294 (491) T protein:vir:79 215 FTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQ 294 (491) T ss_pred HHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhh Confidence 7777899988885 222221111 3489999999999999999998753 34445555555555553 3554 Q ss_pred HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHH Q lcl|NC_019408. 373 RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRA 452 (612) Q Consensus 373 ~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~a 452 (612) .| ....+ .|--.......-...++.+-+..+++.+++.+.+++.|.+- +...+.|.+.. ....+...... T Consensus 295 tl-Tt~~~---gs~a~~~vh~~v~~~i~~~D~~~i~~tln~li~~l~~~N~~---~~~~p~f~~~e---~ee~~~~~a~~ 364 (491) T protein:vir:79 295 NQ-TTEAT---STRASAQAGLEVTDDIRDGDKAIVVEAMNMLIRWICDLNFD---GAARPVFDMWE---QEQVDEIQAGR 364 (491) T ss_pred hh-ccCcc---cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---CCCcceEeecC---cCchhHHHHHH Confidence 43 33221 22223445555666788899999999999999999999763 23344554322 12222223455 Q ss_pred HHHHHHcCC-CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHH-HHHHH Q lcl|NC_019408. 453 IQLMANDGL-LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREA-DFTQQ 530 (612) Q Consensus 453 l~~~~~~G~-is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~-e~~~q 530 (612) +..+.+.|. |+..-+. ++.||..+..+ ++ -+....+........+ ......+...+.--.+... +.+.. T Consensus 365 ~~~L~~~G~~i~~~~~~---e~~Gip~~~~~-e~---~~~~~~~~~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~ 435 (491) T protein:vir:79 365 DEKLTRAGARFTPAYFK---RAYNLQDGDLD-ER---PLPVSAVDAVGAASFA--EFEAPDQDALDAALNALSARDLNAD 435 (491) T ss_pred HHHHHhCCCccCHHHHH---HHhCCCCCCCC-cc---ccCcCccccccccccc--ccCCCCCcchHHHHHHHHHHHHHHH Confidence 667778777 6654332 35677544322 11 1111111111111111 0110111111110000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCC Q lcl|NC_019408. 531 KIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPE-QAKPAVADQATIDNAKKQTANAAKVAAQ 595 (612) Q Consensus 531 ~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~-q~k~~~~eq~~~~~~~k~~~~~a~~~~~ 595 (612) -.. - .+.=++.++ +...-++.+.+..+-- ..-..+.++ .-+.-=..+.-.-++.+ T Consensus 436 ~~~----~--~~~i~~~l~--~~~s~~e~~~~L~~l~~~~d~~~l~~--~l~~a~~~A~l~Gr~~a 491 (491) T protein:vir:79 436 AQA----L--VAPLLKRIA--NGASADELLGMLAELYPSLDTDALQE--RLARAIFVANLWGRLHA 491 (491) T ss_pred HHH----H--HHHHHHHHH--hcCCHHHHHHHHHHHhhcCCHHHHHH--HHHHHHHHHHHhhhccC Confidence 000 0 000000000 0011111121211000 000000010 00011112222222222 No 128 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=93.90 E-value=0.006 Score=32.76 Aligned_cols=421 Identities=10% Similarity=-0.008 Sum_probs=181.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCC------C-CCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAM------K-GADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR 73 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~------~-~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k 73 (612) ++.-|.+.--... .|+..+.-.+ ..|... - +.+-+-|+....+ -.++...++.....|.+. T Consensus 4 ~~~~~~p~~~~g~--------~~~~~~~~~~-~~~~~~e~~~~lr~~~~~~ly~~m~e~---D~~i~s~l~~rk~av~~~ 71 (469) T protein:vir:10 4 RVKTAAPVSEAGY--------VFGSGVVDGW-TVWDPFEQTPELQWPQSVAVYSRMDNE---DSRVTSLLEAISLPIRST 71 (469) T ss_pred cccCCCCccchhh--------hhhcccccch-hhccccccccccccccchHHHHHHHhh---ChHHHHHHHHHHHHHhcC Confidence 3333333211110 1111111111 111100 0 1233467766554 366677777777777777 Q ss_pred Cceee--cCCHH----HHHHH-hcc-----------CCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceE Q lcl|NC_019408. 74 DPIVK--NLPPK----FKDAV-RRF-----------AKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFA 135 (612) Q Consensus 74 ~p~~~--~~p~~----l~~~~-~d~-----------D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~ 135 (612) +..|+ +.++. +...+ ..+ |....+...++...+..++.||.+.+=+. T Consensus 72 ~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eiv--------------- 136 (469) T protein:vir:10 72 PWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQV--------------- 136 (469) T ss_pred CceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeee--------------- Confidence 77773 11222 22211 111 12245677888888888888997765443 Q ss_pred EEechhhhhcchhhh-ccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccc Q lcl|NC_019408. 136 VGYSAENILDWDEVV-DMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSY 214 (612) Q Consensus 136 ~~~~ae~IinW~~~~-~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~ 214 (612) |+... ..+|+..+..+..|-...-..|..+. ++ |. T Consensus 137 ----------w~~~~~~~dG~~~~~~l~~rp~~~i~~~~~~~-----------------~~---------------~l-- 172 (469) T protein:vir:10 137 ----------YRPRNQSPDGRFWLRKLAPRPQWTISKFNVAP-----------------DG---------------GL-- 172 (469) T ss_pred ----------eecccccCCCceeeeeeeecCcccceeeeecc-----------------CC---------------ce-- Confidence 32111 12344333333333110000000000 00 00 Q ss_pred eeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEE-eecCCCCCCcCcCchHHHHHHHH Q lcl|NC_019408. 215 ITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKF-FGASGNTADVEKPPLLDICDLNL 293 (612) Q Consensus 215 ~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~-~~~~~~~~~~~~pPLldLA~lnl 293 (612) ..+|.... .+.+. ...+.. ..+|.+|+.==|++ .+....+.-.+.+.|..++..-+ T Consensus 173 -~~~~~~~~------~~~~~----~~~~~~------------~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~ 229 (469) T protein:vir:10 173 -ESIEQIAP------PARTR----GSLYVA------------NIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWL 229 (469) T ss_pred -eeeeecCc------ccccc----cccccC------------CCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHH Confidence 00000000 00000 000000 00111111111332 22233333445666666655543 Q ss_pred -HHHhhhHHHHHHHHHhccceeeeecCCCCCCce----------EEEeccccccCCCCCceeEEecCchhHHHHHHHHHH Q lcl|NC_019408. 294 -SHYRTYAELEYGRLFTALPVYYAPGTDSEGTGE----------YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALND 362 (612) Q Consensus 294 -~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~----------l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~ 362 (612) ++| ...+.-.-+..-++|+++..-......++ +..|+++++.+|.|.++.|++.+|++. .....++. T Consensus 230 fK~~-~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~-~~~~li~~ 307 (469) T protein:vir:10 230 LKDK-LLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLP-DIRRAIEG 307 (469) T ss_pred HHHH-HHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCch-HHHHHHHH Confidence 444 44556667777789999886332211111 345888899999999999999988764 56777777 Q ss_pred HHHHHHH--HHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCcCCCCcceEEEeecc Q lcl|NC_019408. 363 KERQIAA--IGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-DVVRWWLMWRDVPLADTENLRYEVNTD 439 (612) Q Consensus 363 ~e~qm~~--lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-~~l~~~a~w~g~~~~~~~~~~v~ln~d 439 (612) ..++|.. +|-. +...++.+ |--.......-..-++.+.+..++++++ +++.+++.|-.- .+..-.+|.+.. T Consensus 308 ~d~~Isk~iLG~t-lTs~~~gG--S~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g--~~~~~P~~~~~~- 381 (469) T protein:vir:10 308 HDRSIALSGLAHF-LNLDGKGG--SYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFG--VDTPAPVLTFDP- 381 (469) T ss_pred HHHHHHHHHhccc-ccccCccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCCccEEEecC- Confidence 7777643 4533 33322222 2223444555555678889999999997 588888887421 112224555531 Q ss_pred ccccCCCHHHHHHHHHHHHcCCCCH-HHHHHHH-HhcCccchhhhhHHHHHHhhc-cccccc-cchhHHhhhhhhHHHHh Q lcl|NC_019408. 440 FLSTPIGAREMRAIQLMANDGLLPD-PVFYEYM-RKAEVISSDMTFEEFQALRAD-ENSFIN-NPDAQARQRGYTNRGQE 515 (612) Q Consensus 440 F~~~~~d~~~~~al~~~~~~G~is~-et~~~~l-qr~~vl~~~~~~eee~~ria~-e~~~~~-~~~~~~~~~~e~~r~~~ 515 (612) . ...+...+.++..+.+.|.+.. +....++ ++.|+..+... +......+. ..+... .+........... T Consensus 382 ~--e~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 454 (469) T protein:vir:10 382 I--GSRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELND-TPSAEPEEPAAVPNQSAAPARTRSSGNADA---- 454 (469) T ss_pred C--CCcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCC-cccccchhcccCCCCCccccccCCCCCccc---- Confidence 1 1122334666778888888432 2122233 45677544322 111111111 000000 0000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 516 LEQSRMAREADFTQQKIDIQERSVAVQE 543 (612) Q Consensus 516 ~e~~r~~~e~e~~~q~~e~~~r~~~~~~ 543 (612) ..+.. +.+..+... + T Consensus 455 --~~~~~---~~~~~~l~d--------a 469 (469) T protein:vir:10 455 --RARAP---KADQGVLFD--------A 469 (469) T ss_pred --ccccC---CChHHhhcc--------C Confidence 00000 000000000 0 No 129 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=93.67 E-value=0.0068 Score=32.48 Aligned_cols=507 Identities=10% Similarity=0.036 Sum_probs=193.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCC-CCHHHHHHHHhh-ccCCchHHHHHHHhh----chhhc-C Q lcl|NC_019408. 2 VTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMKG-ADGDDYAIYLQR-ATFFNMLAQTRDGMT----GMVFR-R 73 (612) Q Consensus 2 ~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~~-e~~~~Y~~rl~r-A~~~n~~~~tv~~~~----G~vf~-k 73 (612) ..+++-..+..+|+.+..--.= ...|+....-.||.... .++......... -.|-+.-.+.++.++ +.+|. . T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSPA 80 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCCC Confidence 4555555555555554443211 34445544444564321 222222221222 234444445554444 44443 1 Q ss_pred Cceee------cCC--HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEec Q lcl|NC_019408. 74 DPIVK------NLP--PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYS 139 (612) Q Consensus 74 ~p~~~------~~p--~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ 139 (612) .|=+. ++. ..++.|++.|. ....+++.-+..++.+.+.+|-+-++||.... ....+..|+ T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~------~~~r~~~~~ 154 (556) T protein:vir:73 81 RPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQ------DVIRTMPFP 154 (556) T ss_pred CcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCC------ceEEEEEee Confidence 11110 000 23444544422 23456888888889999999999999885431 112233344 Q ss_pred hhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCccccccee-eeeeEeeecccccccceeecccccccccceee- Q lcl|NC_019408. 140 AENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQAR-KARAAALASGSASSPMVRQTARTLGGYSYITV- 217 (612) Q Consensus 140 ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~-q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~- 217 (612) ..++. | ..|+...+.-|+-++.+. .+.-.+.|+..... ..+... ..+.. ...+++ T Consensus 155 l~~~~-~----~~d~~G~vd~i~r~~~~t---~~ql~~~fg~~~l~~~v~~~~-~~~~~--------------~~~~~v~ 211 (556) T protein:vir:73 155 IGSYY-L----ANSPRGSVDTCIRQFSMT---VRQMVQEFGLDNVSTSVKGMW-ENGTY--------------ETWVEVN 211 (556) T ss_pred cceeE-E----eeCCCCCeEEEEEEEecc---HHHHHHHcCcccCCHHHHHHH-hcCCc--------------cceEEEE Confidence 43332 1 112333332222222111 11112222211110 000000 00100 001111 Q ss_pred eeeeecccc-ccccccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcCcCc---hHHHHHHH Q lcl|NC_019408. 218 YRELKLEEI-EWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVEKPP---LLDICDLN 292 (612) Q Consensus 218 ~R~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~~pP---LldLA~ln 292 (612) |.+...... ..+.+..+-.+....|..+..+ ..+...+| ..+++||+.|.-..+..+..+.|- |-|+..|| T Consensus 212 ~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~----~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~ 287 (556) T protein:vir:73 212 HCITPNVNRDSGKMDSKNKPYRSVYFESGGDS----DKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQ 287 (556) T ss_pred EEEeccccccccccCcccceEEEEEEEecCCC----ceecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHH Confidence 111111000 0011111111211222222221 12233344 357888888887777766554432 45676666 Q ss_pred HHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccC--CCC-CceeEEecCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 293 LSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEV--PQG-SEPGILEYTGQGLKALETALNDKERQIAA 369 (612) Q Consensus 293 l~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~l--p~~-~~~~~lE~~g~~l~~~~~~l~~~e~qm~~ 369 (612) .-+ -..-...+.+..|.+.+++. .....+.+.|+..+.. +.+ ..+.-+......+..+.+.|++++..++. T Consensus 288 ~l~----~~~l~~~~~~~~pp~~v~~~--~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~ 361 (556) T protein:vir:73 288 VEQ----KRKAQLIDKATNPPMVAPTS--LKNQRVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINS 361 (556) T ss_pred HHH----HHHHHHHHHHhcCceecccc--ccccceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHH Confidence 422 22234555566665555321 2234567777765433 322 23443332223477778889999988875 Q ss_pred HH-HH---hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHcCC-cCC----CCcceEEE Q lcl|NC_019408. 370 IG-GR---MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGM-----TDVVRWWLMWRDV-PLA----DTENLRYE 435 (612) Q Consensus 370 lG-a~---ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~a~w~g~-~~~----~~~~~~v~ 435 (612) += +. ++.. .....-||++...+...-...|..+..++.+=+ +++|.++.+ .|. +.. .+.+++|+ T Consensus 362 af~~d~~~~l~~-~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l~~~~i~v~ 439 (556) T protein:vir:73 362 AYFVDLFMMLQN-INTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMAR-KNMLPEPPDVLQGMPLRIE 439 (556) T ss_pred Hhhcchhhhhcc-CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhcCceeEEE Confidence 42 11 1211 122235899999888888888888888875542 344444444 232 110 01122222 Q ss_pred eeccccccCCCHHHHHHHHHHHHcC-CCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccchhHHhhhhhhHH Q lcl|NC_019408. 436 VNTDFLSTPIGAREMRAIQLMANDG-LLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNR 512 (612) Q Consensus 436 ln~dF~~~~~d~~~~~al~~~~~~G-~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r 512 (612) |.. + +.+...+.... .+..-.++..+... .++ +.+++++..+.+++- ++-|...=+.+++. T Consensus 440 ----yis----~--La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~-d~id~d~~~~~~a~~---~Gvp~~~irs~eev-- 503 (556) T protein:vir:73 440 ----YIS----V--MAQAQKSIGLTSLSQTVGFIGQLAQFKPEAL-DKLDVDQAIDAFSEM---SGVSPTVIVPQEQV-- 503 (556) T ss_pred ----eec----H--HHHHHHHHHHHHHHHHHHHHHHHhccChhhH-hcCCHHHHHHHHHHH---cCCChhhcCCHHHH-- Confidence 211 1 22222221111 11111122222111 111 235666666776654 22232111111110 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019408. 513 GQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKV 592 (612) Q Consensus 513 ~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~ 592 (612) ++.|+++. ++|+.+.+.+.. +...+..... ++..-.--.+. ++...+-= T Consensus 504 ----~~~rq~r~--------~~qq~~~~~~~~---------~~a~~~~~~~--------~~~~~~~~~~l--~~~~~~~g 552 (556) T protein:vir:73 504 ----QGIREERA--------KQAQAAQAMAMG---------QAAAQGAKTL--------SETQTSDPSAL--TAIANAAG 552 (556) T ss_pred ----HHHHHHHH--------HHHHHHHHHHHH---------HHHHHHHHHh--------hhccCCCHHHH--HHHHHhhc Confidence 11111100 000000000000 0000000010 10000000000 01111111 Q ss_pred cCCC Q lcl|NC_019408. 593 AAQP 596 (612) Q Consensus 593 ~~~~ 596 (612) +++. T Consensus 553 ~~~~ 556 (556) T protein:vir:73 553 APQQ 556 (556) T ss_pred CCCC Confidence 1111 No 130 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=93.41 E-value=0.0077 Score=32.18 Aligned_cols=456 Identities=12% Similarity=0.080 Sum_probs=178.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhh----chhhcCCce Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMT----GMVFRRDPI 76 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~----G~vf~k~p~ 76 (612) |-.+. ..+.++|+-|.+.+--.. .|. .+.+.. .++. -.|-+...+.++.++ +.+|=..|= T Consensus 19 l~~~R--~~~e~~w~e~~~y~lP~~---------~~~-~~~~~~---~~~~-~~~dst~~~a~~~Las~l~~~ltP~~~W 82 (522) T protein:vir:94 19 LKNGR--QPYETRAQNCAAVTIPSL---------FPK-ESDNSS---TEYT-TPWQAVGARCLNNLAAKLMLALFPQSPW 82 (522) T ss_pred HHHHh--hHHHHHHHHHHHHhcccc---------cCC-CCCccc---cccc-ccccccHHHHHHHHHHHHHhhcCCCCcc Confidence 21111 224555655555542211 111 111111 1111 134444445554443 333311120 Q ss_pred ee---------------cCCHHHHHHHhcc------CCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceE Q lcl|NC_019408. 77 VK---------------NLPPKFKDAVRRF------AKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFA 135 (612) Q Consensus 77 ~~---------------~~p~~l~~~~~d~------D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~ 135 (612) +. +....++.|++.| -...++.+.-+-.++.+.+.+|-+.++|+=+.. +.-..+ T Consensus 83 Frl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~-----~~~~~~ 157 (522) T protein:vir:94 83 MRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQ-----GTYSPM 157 (522) T ss_pred cccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCC-----CceeeE Confidence 00 0112355555543 223456788888889999999998888864332 111124 Q ss_pred EEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccce Q lcl|NC_019408. 136 VGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYI 215 (612) Q Consensus 136 ~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~ 215 (612) ..|+-.++. -..|+...+.-|+.++.+..... .+.|.... ..+ ...+...+ T Consensus 158 ~~~pl~~y~-----v~~d~~G~vd~i~r~~~~~~~~l---~~~~~~~~--------~~~-------------~~~p~~~v 208 (522) T protein:vir:94 158 RMYRLVSYV-----VQRDAFGNILQIVTIDKVAFSAL---PEDVKSQL--------NAD-------------DYEPDTEL 208 (522) T ss_pred EEEEcceEE-----EeeCCCcCeEEEeeeeeccHHhc---chHHHHHH--------hcc-------------cCCccceE Confidence 556544422 12244444555555554332111 11111000 000 00112223 Q ss_pred eeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCc-c---ccceeEEEeecCCCCCCcCcCc----hHH Q lcl|NC_019408. 216 TVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGE-P---LDFIPFKFFGASGNTADVEKPP----LLD 287 (612) Q Consensus 216 ~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~-~---l~~IP~v~~~~~~~~~~~~~pP----Lld 287 (612) ++|...... + + .+. +|.+.. +..++...|. + +++||+.|.-..+..+ |..| |-| T Consensus 209 ~v~~~v~~~------~--~-~~~--~~~~~~-----g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~Y--Grgp~~~~l~D 270 (522) T protein:vir:94 209 EVYTHIYRQ------D--D-EYL--RYEEVE-----GIEVTGTDGSYPLTACPYIPVRMVRLDGEDY--GRSYCEEYLGD 270 (522) T ss_pred EEEEEEEee------C--C-cee--EEeecc-----CceecccCCCCccccCCceeeeeeecCCCcc--ccchHHHHHHH Confidence 444332221 0 0 111 111111 1122222222 3 5566666665555444 4444 446 Q ss_pred HHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHHHH Q lcl|NC_019408. 288 ICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDKER 365 (612) Q Consensus 288 LA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~e~ 365 (612) +..||. -+.+-+. ..+.+..|.+.+ .+.... ...+.-|.++.+.-...++.+.++.. +..+..+...|++++. T Consensus 271 ~k~L~~---l~~~~l~-~~~~~~~p~~~v~~~g~~~-~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~ 345 (522) T protein:vir:94 271 LNSLET---ITEAITK-MAKVASKVVGLVNPNGITQ-PRRLNKAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQ 345 (522) T ss_pred HHHHHH---HHHHHHH-HHHHHhCCceeeccccccc-chheeccCCceeecCCcccceeeecccccchhHHHHHHHHHHH Confidence 666664 3333344 444455555444 322111 22344443334433333445555532 4468888999999999 Q ss_pred HHHHHHH-HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCC-cCCCCcceEEEeec Q lcl|NC_019408. 366 QIAAIGG-RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDV-PLADTENLRYEVNT 438 (612) Q Consensus 366 qm~~lGa-~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~-~~~~~~~~~v~ln~ 438 (612) .++.+=. ..+... ....-||+....+...-...|..+-.++.+-+- .++.++.+ .|. +....+.++++| T Consensus 346 rI~~af~~~~~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~v~v~~-- 421 (522) T protein:vir:94 346 RLGWAFLLNSAVQR-NAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQS-AGMIPDLPKEAVEPTV-- 421 (522) T ss_pred HHHHHHhhhhhccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCcccEEeeE-- Confidence 9876421 112111 122358889888888888888888777755443 34444433 232 111223334333 Q ss_pred cccccCCC----HHHHHHHHHHHHcCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccchhHHhhhhhhHH Q lcl|NC_019408. 439 DFLSTPIG----AREMRAIQLMANDGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNR 512 (612) Q Consensus 439 dF~~~~~d----~~~~~al~~~~~~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r 512 (612) . .++. .++++.|+.. +..+... .++++.+++++..+.+++-.... +...=+.++|.+ T Consensus 422 --~-s~La~~qr~~~~~~l~~~-----------~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~--~~~ivr~~ee~~- 484 (522) T protein:vir:94 422 --S-TGLEALGRGQDLEKLTQA-----------VNMMTGLQPLSQDPDINLPTLKLRLLNALGID--TAGLLLTQDEKI- 484 (522) T ss_pred --e-cHHHHHHHHHHHHHHHHH-----------HHHHHhccchhhhhcCCHHHHHHHHHHHcCCC--hhhccCCHHHHH- Confidence 2 1221 1122222222 2222111 11223356666667766542210 111111111100 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 513 GQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQA 577 (612) Q Consensus 513 ~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~ 577 (612) +.|++ |+.++.+ .+++..+. + ......+ .++- ....++ T Consensus 485 -----~~~~q------~~~~~~~-~~~~~~~~-~-----------~~~a~~~-~~~~--~~~~~~ 522 (522) T protein:vir:94 485 -----QRMAE------QSSQQAV-VQGASAAG-A-----------NMGAAVG-QGAG--EDMAQA 522 (522) T ss_pred -----HHHHH------HHHHHHH-HHHHHHHH-H-----------Hhhhhhh-cccc--hhhhcC Confidence 10100 0000000 00000000 0 0000000 0000 000000 No 131 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=93.38 E-value=0.0078 Score=32.15 Aligned_cols=398 Identities=9% Similarity=-0.043 Sum_probs=159.3 Q ss_pred CCCcHHH--HHHHHHHHHHHHHhcChHHHHhcccccCCCC-----CCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC Q lcl|NC_019408. 1 MVTHPEY--QYWRPEWTKLRDVMAGQREIKRKAEAYLPAM-----KGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR 73 (612) Q Consensus 1 ~~~hP~y--~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~-----~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k 73 (612) +++-|.= ....+.|..++.-..+... .| ..+++. ...+-+-|+..+.- .++...++.....|... T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g-~~~~~~~~iLr~~~~~~ly~~m~~D----~hi~s~l~~Rk~av~~~ 82 (448) T protein:vir:79 11 LVPGPGSIDPSDVPKLEGASVPVMSTSY---DV-VVDREFDELLQGKDGLLVYHKMLSD----GTVKNALNYIFGRIRSA 82 (448) T ss_pred ccCcccccccccchhhhhhhhhhccccc---cc-ccccchhHhhccccchHHHHHHhhC----hHHHHHHHHHHHHHhcC Confidence 4443311 1223444444443322111 11 000110 01234567776543 55666666666666666 Q ss_pred Cceee--cCCHH-------HHHHHhccCC--CCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhh Q lcl|NC_019408. 74 DPIVK--NLPPK-------FKDAVRRFAK--DGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAEN 142 (612) Q Consensus 74 ~p~~~--~~p~~-------l~~~~~d~D~--~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~ 142 (612) +..|+ +.++. +..++...|. .-.+++.++..++ .++-+|.+.+=+. T Consensus 83 ~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~l-da~~~G~s~~Eiv---------------------- 139 (448) T protein:vir:79 83 KWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYE-NAYIYGMAAGEIV---------------------- 139 (448) T ss_pred CceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHH-HhhhhcceeEEEE---------------------- Confidence 66663 11111 2222222221 1245777777766 4667776544332 Q ss_pred hhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeee Q lcl|NC_019408. 143 ILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELK 222 (612) Q Consensus 143 IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~ 222 (612) |+. ..+|+..+..+..+.... ..-|. +..+ + + -++|.. T Consensus 140 ---w~~--~~~g~~~~~~l~~r~~~~-------~~~f~---------~~~d-~---------------~----l~~~~~- 177 (448) T protein:vir:79 140 ---LTL--GADGKLILDKIVPIHPFN-------IDEVL---------YDEE-G---------------G----PKALKL- 177 (448) T ss_pred ---eee--cCCCceecccccccCCcc-------cccee---------eecC-C---------------c----eEEeec- Confidence 321 113332222222221000 00000 0000 0 0 000000 Q ss_pred ccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHH-HHHhhhHH Q lcl|NC_019408. 223 LEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNL-SHYRTYAE 301 (612) Q Consensus 223 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl-~HY~~~sD 301 (612) .++. ....+..+|.++++=-|+.+.....+.-.+.+.|..++..-+ ++|- ..+ T Consensus 178 ---------------------~~~~----~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~-~~~ 231 (448) T protein:vir:79 178 ---------------------SGEV----KGGSQFVSGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRAL-ILL 231 (448) T ss_pred ---------------------CCcc----cccccCCCccccccceEEEEecCccCCcccchhHHHHHHHHHHHHHH-HHH Confidence 0000 000011122222211233332222333334444445554333 3433 345 Q ss_pred HHHHHHHhccceeeee---cCCCCCCc---------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH- Q lcl|NC_019408. 302 LEYGRLFTALPVYYAP---GTDSEGTG---------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA- 368 (612) Q Consensus 302 ~~~~l~~~~~P~l~i~---G~~~~~~~---------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~- 368 (612) .-.-+..-++|+++.. |.+....+ .|..|++++..+|.|.++.|++..|++.. ..+.++-...+|. T Consensus 232 w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~-~~~~i~~~d~~Isk 310 (448) T protein:vir:79 232 INHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPD-AIPYLTYHDAGIAR 310 (448) T ss_pred HHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccc-HHHHHHHHHHHHHH Confidence 6677777899999886 33321111 13468889999999999999999887643 4556665556653 Q ss_pred -HHHHHhhhccccchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCCCcceEEEeeccccccCC Q lcl|NC_019408. 369 -AIGGRMMPGASKSVSESNNQTVLR-EANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLADTENLRYEVNTDFLSTPI 445 (612) Q Consensus 369 -~lGa~ll~~~~~~~~esa~~~~~~-~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~ 445 (612) .|| .++....+. .++....-. ..-..-.+.+-+..+++++++ ++.+++.|-.-+ +..-..|.+. .. T Consensus 311 ~iLG-qtlTs~~~~--g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~--~~~~P~~~f~------~~ 379 (448) T protein:vir:79 311 ALGI-DFNTVQLNM--GVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPS--ATRFPRLTFE------ME 379 (448) T ss_pred HHhh-hhhcccccc--chhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--cCCCcEEEec------CC Confidence 345 444433221 222222111 122234557788889999985 788888874211 1111244432 12 Q ss_pred CHHHHHHHHHH----HHcCCCCHHHHHHHHHh-cCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHH Q lcl|NC_019408. 446 GAREMRAIQLM----ANDGLLPDPVFYEYMRK-AEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSR 520 (612) Q Consensus 446 d~~~~~al~~~----~~~G~is~et~~~~lqr-~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r 520 (612) ++.++.++.+. ...+.+. +. +.+. .++.++ ...+..+......+.. .+...+. ..+-.=+++| T Consensus 380 e~~Dl~~~a~~~~~l~~~~~~~-~~---~~~~~~~~p~~--~~~~~~~a~~~~~~~~-----~~~~~~~-~~~~~~~~~~ 447 (448) T protein:vir:79 380 ERNDFSAAANLMGMLINAVKDS-ED---IPTELKALIDA--LPSKMRRALGVVDEVR-----EAVRQPA-DSRYLYTRRR 447 (448) T ss_pred ChHHHHHHHHHhhhhhccchhh-HH---HHHHhhcCCCC--CCCccccccCCCCccc-----ccccCCc-cccchhhccc Confidence 34455444333 2222221 11 2222 233221 1111111111111000 0000000 0000011111 Q ss_pred H Q lcl|NC_019408. 521 M 521 (612) Q Consensus 521 ~ 521 (612) . T Consensus 448 ~ 448 (448) T protein:vir:79 448 R 448 (448) T ss_pred C Confidence 1 No 132 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=93.13 E-value=0.0087 Score=31.89 Aligned_cols=387 Identities=11% Similarity=0.050 Sum_probs=172.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHH---HHHHhhccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDY---AIYLQRATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y---~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) ++++.-|. ++++|.. .|..|-+ ..+...| ..|... ..++..|+..+--+.|+...| T Consensus 3 ~~~~d~~~----------~~~~~~~----~~~~~~~---~~~~~~~~l~a~Y~~~----~l~~~~Vd~~aed~~r~g~~i 61 (427) T protein:vir:10 3 IVKHDGYN----------DIFNGGA----DGSPKPF---FMSDASYHVGSFYNDN----ATAKRIVDVIPEEMVTAGFKM 61 (427) T ss_pred ccccchHH----------HHhhcCC----CCcccCc---cccCchHHHHHHHHcC----chhhhhhccchHHhhcCCccc Confidence 55555553 2344421 2222322 1222333 444433 345556667777777888888 Q ss_pred ecCC--HHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccCCc Q lcl|NC_019408. 78 KNLP--PKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMGGF 155 (612) Q Consensus 78 ~~~p--~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~g~ 155 (612) +... +.++..++. ..+..-++.+++.+..+|.++||++.... +| |.. -+.+. T Consensus 62 ~g~~~~~~~~~~~~~-----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~-------~~------------l~~--p~~~~ 115 (427) T protein:vir:10 62 SGVKDEKEFKSLWDS-----YKLDSSLVDLLCWARLYGGAAMVAIIKDN-------RM------------LTS--QAKPG 115 (427) T ss_pred cCccHHHHHHHHHHH-----hhHHHHHHHHHHhccccceeEEEEEecCC-------Cc------------ccc--ccCCC Confidence 5432 234444443 36778888999999999999999875321 11 100 00111 Q ss_pred cceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccc Q lcl|NC_019408. 156 YVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKL 235 (612) Q Consensus 156 ~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~ 235 (612) ..|.-+...............|+ ..+.|+....|++... +. T Consensus 116 g~l~~l~v~d~~~~~~~~~~~dp-----------------------------~s~~fg~P~~y~v~~~-------~~--- 156 (427) T protein:vir:10 116 AKLEGVRVYDRFAITVEKRVTNA-----------------------------RSPRYGEPEIYKVSPG-------DN--- 156 (427) T ss_pred cceeEEEEechhcccccccccCc-----------------------------cccccCcceEEEEecC-------CC--- Confidence 11221111100000000000111 1222233333433110 00 Q ss_pred eeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHH-HHHHHHHhcccee Q lcl|NC_019408. 236 AYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAE-LEYGRLFTALPVY 314 (612) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD-~~~~l~~~~~P~l 314 (612) ...+.+|.+ .+ ....|.+++++. .......+.+||....+=.|..|.+.+. -.+++|...+.++ T Consensus 157 ~~~~~iH~S--------Rl-i~~~g~~~p~~~------~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~ 221 (427) T protein:vir:10 157 MQPYLIHHS--------RV-FIADGERVAQQA------RKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVW 221 (427) T ss_pred CcceEEccc--------cE-EEecCCCchhhh------cccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 000111110 01 111233332221 1122234667777655444677766655 5778888888888 Q ss_pred eeecCCC----CCCc-e---------EEEeccccccCC-CCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHHh---hh Q lcl|NC_019408. 315 YAPGTDS----EGTG-E---------YHIGPNMVWEVP-QGSEPGILEYTGQGLKALETALNDKERQIAAIGGRM---MP 376 (612) Q Consensus 315 ~i~G~~~----~~~~-~---------l~iG~~~~~~lp-~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~l---l~ 376 (612) -+.|+.. .... . ...|.++.+.+. ++.++.-+..+-+++. +.++...++|...-.-. |. T Consensus 222 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~---~~~~~~~~~iaaa~~IP~t~L~ 298 (427) T protein:vir:10 222 KVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGVP---EFLSSKMDRIVSLSGIHEIIIK 298 (427) T ss_pred cchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCChH---HHHHHHHHHHHhhhCCCeeeec Confidence 8877532 1111 0 123445555554 4567777776666664 34445555554432111 21 Q ss_pred ccc-cchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHH----- Q lcl|NC_019408. 377 GAS-KSVSESNNQTVLREANEQSLLLNII-QACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGARE----- 449 (612) Q Consensus 377 ~~~-~~~~esa~~~~~~~~~~~s~L~~~a-~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~----- 449 (612) .++ +.-+.|+. .+...=...+.++- ..+..+++..++++.+ .++++|..|+-+.....+..+ T Consensus 299 G~sp~Glnstgd---~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~--------s~~~~~~f~pL~~~s~kEkaei~~~~ 367 (427) T protein:vir:10 299 NKNVGGVSASQN---TALETFYKLVDRKREEDYRPLLEFLLPFIVD--------EEEWSIEFEPLSVPSKKEESEITKNN 367 (427) T ss_pred cCCccccccchh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------CCCcEEEeCCCCCCCHHHHHHHHHHH Confidence 211 11111221 12222222233322 2345555555655431 246888888655554433222 Q ss_pred HHHHHHHHHcCCCCHHHHHHHHHhcCccch-----hhhhHHHHHHhhcccccc--ccchhH Q lcl|NC_019408. 450 MRAIQLMANDGLLPDPVFYEYMRKAEVISS-----DMTFEEFQALRADENSFI--NNPDAQ 503 (612) Q Consensus 450 ~~al~~~~~~G~is~et~~~~lqr~~vl~~-----~~~~eee~~ria~e~~~~--~~~~~~ 503 (612) ..+...++++|.|+.+...+.|+..+.... +.++++ .....+.+|.. +..++- T Consensus 368 a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~-~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 368 VESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE-PEETTEPEPGLGEKLEDEN 427 (427) T ss_pred HHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccc-cchhcCCCCCCCCCCCCCC Confidence 356678899999999999999976433221 122221 11111111111 111111 No 133 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=93.04 E-value=0.009 Score=31.80 Aligned_cols=455 Identities=11% Similarity=0.069 Sum_probs=179.9 Q ss_pred CCC-----cHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhh----chhh Q lcl|NC_019408. 1 MVT-----HPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMT----GMVF 71 (612) Q Consensus 1 ~~~-----hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~----G~vf 71 (612) +.. -.+...+..+|+-|.+.+--+. +| ..+.. .+.++ .|-+.-.+.++.++ +.+| T Consensus 16 l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~---------~~---~~~~~---~~~~~-~~dstg~~a~~~LAa~l~~~lt 79 (516) T protein:vir:10 16 IPKLWEKFSTKRSSFLDRAKHYSKLTLPYL---------MN---DKGDN---ETSQN-GWQGVGAQATNHLANKLAQVLF 79 (516) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhhcccc---------cC---CCCCc---ccccc-cccchHHHHHHHHHHHHHhhhc Confidence 111 1122344556665555543311 11 11111 11111 34444444555443 3333 Q ss_pred c-CCceee-cCC--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhh Q lcl|NC_019408. 72 R-RDPIVK-NLP--------------PKFKDAVRRF------AKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGA 129 (612) Q Consensus 72 ~-k~p~~~-~~p--------------~~l~~~~~d~------D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~ 129 (612) . ..|=+. .++ ..++.|++.| -...++++.-+-.++.+.+.+|-+.+++|.+. T Consensus 80 pp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~------ 153 (516) T protein:vir:10 80 PAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG------ 153 (516) T ss_pred CCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC------ Confidence 3 111110 111 2356666655 33556788888888999999999988887432 Q ss_pred ccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccc Q lcl|NC_019408. 130 VATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTL 209 (612) Q Consensus 130 ~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~ 209 (612) + +..|+-.++. -..|+...+.-|+.++.+..- .-...|.. .....+.. ... T Consensus 154 ---~-~~~~pl~~y~-----v~~d~~G~v~~ivrr~~~~~~---~l~e~~~~-~~~~~~~~----------------~~~ 204 (516) T protein:vir:10 154 ---A-ISAIPMHHYV-----VNRDTNGDLLDIILLQEKSLR---TFDPATRA-VVEVGLKG----------------KKC 204 (516) T ss_pred ---C-eEEEEcCeEE-----EeeCCCCCeEEEeeeecccHH---HHHHHhhh-hhhhhhhh----------------hcc Confidence 1 3455544422 122444445556666543221 11111110 00000000 000 Q ss_pred ccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC---ccccceeEEEeecCCCCCCcC--cCc Q lcl|NC_019408. 210 GGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG---EPLDFIPFKFFGASGNTADVE--KPP 284 (612) Q Consensus 210 ~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~l~~IP~v~~~~~~~~~~~~--~pP 284 (612) .....+.+|-..... +++ .|.+|.+.++. .+-..+| ..+++||+.|.-..+..+..+ .-- T Consensus 205 ~~~~~~~i~t~v~~~----~~~------~~~~~~~~d~~-----~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~ 269 (516) T protein:vir:10 205 KEDDSIKLYTHAKYL----GEG------FWELKQSADDI-----PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDY 269 (516) T ss_pred CCCCceEEEEEEEec----CCC------ceEEEEeeCce-----eeccccccccccCCeeeeeeeecCCCCcccchHHHh Confidence 001112222111100 011 12223222111 1112233 245777777776655555443 112 Q ss_pred hHHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHHHH Q lcl|NC_019408. 285 LLDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALNDK 363 (612) Q Consensus 285 LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~~~ 363 (612) |-|+..||. -+.+-+..+.+....|.++-..... ....+.-|.++.+.-...++.+.++.. +..+..+.+.|+++ T Consensus 270 L~D~k~L~~---l~~~~l~~~~~a~~~~~lv~p~g~~-~~~~l~~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~ 345 (516) T protein:vir:10 270 SGDLFVIQF---LSEAVARGAALMADIKYLIRPGAQT-DVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVY 345 (516) T ss_pred hHHHHHHHH---HHHHHHHHHHHhcCCCcccCccccc-chhhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHH Confidence 557777774 2333344454555555555432211 123355555545432223345566654 33588889999999 Q ss_pred HHHHHHHHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-cCCcCCCCcceEEEeeccc Q lcl|NC_019408. 364 ERQIAAIGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMW-RDVPLADTENLRYEVNTDF 440 (612) Q Consensus 364 e~qm~~lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w-~g~~~~~~~~~~v~ln~dF 440 (612) +..++.+-. .+..... ..-||++...+...-...|.-+-.++.+=+-.-|-..+.+ ++-.+ +.+. ++.++ T Consensus 346 ~~rI~~af~~~~l~~rd~--~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~--P~~l---v~~~~ 418 (516) T protein:vir:10 346 TRRIGVVFMMETMTRRDA--ERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSF--TSDL---VDPVI 418 (516) T ss_pred HHHHHHHHhhhhhhccCC--ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCC--Chhh---cCcce Confidence 999876422 1222222 2348889888888888888888887766554333222222 11111 1111 22232 Q ss_pred cccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCccc---hhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhH Q lcl|NC_019408. 441 LSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEVIS---SDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQEL 516 (612) Q Consensus 441 ~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~vl~---~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~ 516 (612) +.. +.+|..+.....|+ .+...+-. .++.+ +-+++++..+.++.-. +.+...=+..++ . T Consensus 419 -v~~-----i~~L~raq~~~~i~--~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~---gvp~~~irs~ee------v 481 (516) T protein:vir:10 419 -ITG-----IEALGRMAELDKLA--NFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQI---SAELPFLKSAEE------M 481 (516) T ss_pred -ehh-----HHHHHHHHHHHHHH--HHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHh---CCChhccCCHHH------H Confidence 222 22222222211111 11111111 11111 2334455455554431 111111111111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH Q lcl|NC_019408. 517 EQSRMAREADFTQQKIDIQERSVAVQ--EGHAEVAHA 551 (612) Q Consensus 517 e~~r~~~e~e~~~q~~e~~~r~~~~~--~~r~~~e~~ 551 (612) ++.|+++. +.||....++ ..++.. .=..+..+. T Consensus 482 ~~~r~~~~-~~q~~~~~~~-~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 482 EQEQEAQM-QAQQAQMLEE-GVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHH-HHHHHHHHHH-HhhhcccchhhhhhhcC Confidence 11111110 0000000000 000000 000011100 No 134 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=92.81 E-value=0.0099 Score=31.57 Aligned_cols=445 Identities=12% Similarity=0.034 Sum_probs=189.4 Q ss_pred CCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCC----C--CCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCc Q lcl|NC_019408. 2 VTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK----G--ADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDP 75 (612) Q Consensus 2 ~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~----~--e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p 75 (612) |.+|+...-..--..++|.+.+ +. .| -++|... + -+-..|...+. -.++...++.....|...+. T Consensus 1 v~~~~l~~e~at~~~~~d~~~~---~~-~~-l~~~~~~il~~a~~g~~~~y~~l~~----D~~i~s~l~~rk~av~~~~w 71 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRP---FI-SG-LQVPNDSILQRRGGNDLRVYEEILS----DAQVKTVWGQRQLAVVSREW 71 (488) T ss_pred CCccchhHHHHHHHhhhhhhcc---cc-CC-CCCCChHHHHhhccCCHHHHHHHhh----ChHHHHHHHHHHHHHhcCCc Confidence 8888876543322223444321 00 11 2334321 0 12356766554 45777777777777777777 Q ss_pred eee---cCCH--HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhh Q lcl|NC_019408. 76 IVK---NLPP--KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVV 150 (612) Q Consensus 76 ~~~---~~p~--~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~ 150 (612) .|+ +-|. .+..++..+ ..+.+++.++..++ .++-+|.+.+=+ .|+. T Consensus 72 ~i~p~~~~~~~~~~ae~v~~~-l~~~~~~~~l~~~l-da~~~G~s~~Ei-------------------------~w~~-- 122 (488) T protein:vir:99 72 KVEAGGDRPIDQAAAEHLEQQ-LQRVGWDRVTSKML-FGVFYGYAVSEL-------------------------IYGR-- 122 (488) T ss_pred eEEcCCCChHHHHHHHHHHHH-HhCCCHHHHHHHHH-hhhhhcceeEEE-------------------------EEee-- Confidence 774 1111 222332221 23357899999988 477788665443 2431 Q ss_pred ccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccc Q lcl|NC_019408. 151 DMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPS 230 (612) Q Consensus 151 ~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~ 230 (612) .||...+..|..+... -|.+. .++ + -++|. T Consensus 123 -~~g~~~~~~l~~r~~~----------~f~~d----------~~~---------------~----l~~~~---------- 152 (488) T protein:vir:99 123 -DDRYITLEAIKVRNRR----------RFRYD----------QDG---------------G----LRLLT---------- 152 (488) T ss_pred -cCCeeeEeeeeeeccc----------ceeec----------CCC---------------c----eEEec---------- Confidence 2444333333332210 00000 000 0 00000 Q ss_pred ccccceeEEEEEeeCCCceecceeeeccCCccccceeEEE---eecCCCCCCcCcCchHHHHHHH-HHHHhhhHHHHHHH Q lcl|NC_019408. 231 GEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKF---FGASGNTADVEKPPLLDICDLN-LSHYRTYAELEYGR 306 (612) Q Consensus 231 g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~---~~~~~~~~~~~~pPLldLA~ln-l~HY~~~sD~~~~l 306 (612) ..+ ...|.+|+ .|+-| .+....+.-.+.+.|..++..- .++|- -.+.-.-+ T Consensus 153 ------------~~~-----------~~~g~~lp-~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~-~~~w~~f~ 207 (488) T protein:vir:99 153 ------------PNN-----------MFEGEPCP-APYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNG-IKFWLIFL 207 (488) T ss_pred ------------cCC-----------CCCccccc-cCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhh-HHHHHHHH Confidence 000 01122232 23222 2222333333455555555543 34444 35566667 Q ss_pred HHhccceeeeecC----CCCCCce-----EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHH--HHHHhh Q lcl|NC_019408. 307 LFTALPVYYAPGT----DSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAA--IGGRMM 375 (612) Q Consensus 307 ~~~~~P~l~i~G~----~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~--lGa~ll 375 (612) ..-|+|+++..-. +++..+. ..||++++..+|.|.++.|++.++.+.......++-..++|.. +|.. + T Consensus 208 E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqt-l 286 (488) T protein:vir:99 208 DKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQV-A 286 (488) T ss_pred HHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhh-h Confidence 7779999988621 1111111 3479999999999999999998877766667777777777743 4544 4 Q ss_pred hccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCCCcceEEEeeccccccCCC-HHHHHHH Q lcl|NC_019408. 376 PGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLADTENLRYEVNTDFLSTPIG-AREMRAI 453 (612) Q Consensus 376 ~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d-~~~~~al 453 (612) ....+.+ |--.......-...++.+.+..+++.++. ++.+++.|-. ++..-..|.+.. .. ..| ...+..+ T Consensus 287 ts~~~~G--s~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~---~~~~~p~~~~~~--~e-~edl~~~a~~~ 358 (488) T protein:vir:99 287 STQGTPG--RLGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWNF---PGAQPPRVYRVI--EE-PEDITAKAERD 358 (488) T ss_pred ccccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCc---CCcCCceeEecC--CC-cccHHHHHHHH Confidence 3332221 22234455555667788899999999974 8888888843 122223333321 11 112 2235556 Q ss_pred HHHHHc-CC-CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 454 QLMAND-GL-LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 454 ~~~~~~-G~-is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) ..+.+. |. |+.+.+. ++.|+..+... ++. +. ..+........+.......-..+. +.+.+... T Consensus 359 ~~l~~~~G~~i~~~~i~---e~~Gip~~~~~-~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~ 423 (488) T protein:vir:99 359 EKVFRMSGFRPTRGYVQ---ETYGVEVESTQ-AEA---TA-PTPSTEFAEGDQPSDPAAAMAPQL-------AEAMQPVV 423 (488) T ss_pred HHHHhhcCCCCCHHHHH---HHcCCCCcccc-ccc---cc-CCCcccCCCCCCCCCchHHHHHHH-------HHHHHHHH Confidence 666664 65 6655333 45577543321 111 11 111111100000000000000000 00000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHH------HHHHH-HHHHHHHHHHHHhhccccC Q lcl|NC_019408. 532 IDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGD------PEQAK------PAVAD-QATIDNAKKQTANAAKVAA 594 (612) Q Consensus 532 ~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~------e~q~k------~~~~e-q~~~~~~~k~~~~~a~~~~ 594 (612) ..--+.- +..++ +...-++.+.+..+ ..+-. -.-++ .-+.+. ..+..+.... T Consensus 424 ~~~~~~i------~~~l~--~a~s~ee~~~~L~~l~~~~d~~~l~~~l~~a~~~a~l~G~~~~---~~e~~~~~~~ 488 (488) T protein:vir:99 424 GNWTTQL------RTLIE--QASSLEDLRERLLDLAPQLSLDQYAQAMAEGLEAAHLAGRNDV---QEELDGREQI 488 (488) T ss_pred HHHHHHH------HHHHH--hcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhH---hhhhcccCCC Confidence 0000000 00000 00000111111110 00000 00000 000000 0011111111 No 135 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=92.54 E-value=0.011 Score=31.33 Aligned_cols=487 Identities=10% Similarity=0.057 Sum_probs=180.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCC-CCCCHHHHHHHHhhccCCchHHHHHHH----hhchhhcCC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAM-KGADGDDYAIYLQRATFFNMLAQTRDG----MTGMVFRRD 74 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~-~~e~~~~Y~~rl~rA~~~n~~~~tv~~----~~G~vf~k~ 74 (612) ...-=........|..+++--.- ...|+....-.||.. +.+++..- ..+. -.|-+...+.++. |.+.+| | T Consensus 5 ~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~-~~~~-~~~dst~~~a~~~Laa~l~~~lt--P 80 (543) T protein:vir:88 5 KREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSS-TDYT-TPWQAVGARGLNNLSAKVMLALF--P 80 (543) T ss_pred ccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCccc-cccc-ccccchHHHHHHHHHHHHHHhhc--C Confidence 11111122222333333321111 222333332233431 11111110 1111 1233333444444 444444 2 Q ss_pred ce-e-e-cCC--------------HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhcc Q lcl|NC_019408. 75 PI-V-K-NLP--------------PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVA 131 (612) Q Consensus 75 p~-~-~-~~p--------------~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~ 131 (612) +. + . .++ +.++.|++.|. ...++++.-+-.++.+.+.+|-+-+++|-+... ..+. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~--~~~~ 158 (543) T protein:vir:88 81 LQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDAS--SNSY 158 (543) T ss_pred CCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccc--ccee Confidence 21 1 0 011 23444443322 124557788888899999999999999854321 1111 Q ss_pred CceEEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeeccccccc Q lcl|NC_019408. 132 TSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGG 211 (612) Q Consensus 132 rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g 211 (612) +| +..|+-.+..- ..|+...+.-|+.++.+.... -...|... +. .. ..... T Consensus 159 ~~-~~~~pl~~y~v-----~~d~~G~v~~i~r~~~~~~~~---l~~~~~~~------------------v~-~~-~~~~p 209 (543) T protein:vir:88 159 NP-MKLYTLHNHVV-----QRDAFGNVLQIVTLDKVAYAA---LPEDVRNS------------------LS-GG-QEYKP 209 (543) T ss_pred cc-eEEeEcceEEE-----eeCCCCCeeeeeeeeeccHHH---HhHHhhHH------------------HH-HH-hhcCC Confidence 22 22333222211 124444455555555433211 11112110 00 00 00111 Q ss_pred ccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCC-cc---ccceeEEEeecCCCCCCcCcCc--- Q lcl|NC_019408. 212 YSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG-EP---LDFIPFKFFGASGNTADVEKPP--- 284 (612) Q Consensus 212 ~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~---l~~IP~v~~~~~~~~~~~~~pP--- 284 (612) +..+++|...... ...+ .+ .+|.+- .+.+++...| .+ +++||+.|.-..+..+ |..| T Consensus 210 ~~~~~v~~~V~pr------~~~~-~~--~~~~~~-----~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~Y--Grgp~~~ 273 (543) T protein:vir:88 210 EQELEVYTHIYID------DESG-DF--LSYQEI-----EGVEVDGSDGQYPQDALPWIAVRWTKRDGEHY--GRSHVEE 273 (543) T ss_pred ccceEEEEEEEee------cCCC-cc--cccccc-----cCeeeecCCCccccccCCceeeeeeecCCCcc--ccchHHH Confidence 1223344322110 0000 01 111100 0112222222 23 4556666665554444 4445 Q ss_pred -hHHHHHHHHHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCCCCceeEEecC-chhHHHHHHHHH Q lcl|NC_019408. 285 -LLDICDLNLSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYT-GQGLKALETALN 361 (612) Q Consensus 285 -LldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~-g~~l~~~~~~l~ 361 (612) |-|+..||. -+.+-+. .++.+.-|.+.+ ...-. ....+.-|..+.+.-...++...++.. +..+..+...|+ T Consensus 274 ~l~D~k~L~~---l~~~~l~-~~~~~~~pp~~v~~~g~~-~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~ 348 (543) T protein:vir:88 274 YLGDLNSLES---LNEAMIK-FAMISSKVVGLVNPNGIT-QVRRLVKAQTGDFVAGRKADIEFLQLEKTADFTVAKSVAD 348 (543) T ss_pred HHHHHHHHHH---HHHHHHH-HHHHHhcCceeecccccc-chhhcccCCCceeecCCCCcceeeecccccchhHHHHHHH Confidence 447777775 2223344 444444454444 22111 122344454444433334566666544 456888999999 Q ss_pred HHHHHHHHHHH-HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCc-CCCCcceEE Q lcl|NC_019408. 362 DKERQIAAIGG-RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVP-LADTENLRY 434 (612) Q Consensus 362 ~~e~qm~~lGa-~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~-~~~~~~~~v 434 (612) +++..|+.+=. .++.... ...-||+....+...-...|..+-.++.+-+- +++.++-+ .|+- ....+.+.+ T Consensus 349 ~~~~rI~~af~~~~~~~~~-~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~v~~ 426 (543) T protein:vir:88 349 AIEARLSYVFMLNSAVQRS-GERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQA-TQQIPNLPQEAVEP 426 (543) T ss_pred HHHHHHHHHHhhhhhccCC-CCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhceee Confidence 99999975321 1121111 22358999999888888888888888765443 33443333 2321 111223333 Q ss_pred EeeccccccCCC----HHHHHHHHHHHH-cCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccccccc-chhHHhhhh Q lcl|NC_019408. 435 EVNTDFLSTPIG----AREMRAIQLMAN-DGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINN-PDAQARQRG 508 (612) Q Consensus 435 ~ln~dF~~~~~d----~~~~~al~~~~~-~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~-~~~~~~~~~ 508 (612) +| . ..+. .+++..|....+ -|.|+.. ++ .+.++++...+.+++-. +- +...-+.+. T Consensus 427 ~~----v-s~l~~l~r~~~~~~l~~~~~~v~~~~~p---------~v-ld~id~d~~~~~~a~~~---Gv~~~~i~r~~~ 488 (543) T protein:vir:88 427 TV----T-TGAEALGRGQDLDKLTQFLNAVATVSQL---------NG-DPDLNVNNIKLRLANAI---GIDTAGLLLTEA 488 (543) T ss_pred eE----E-ecHHHHHHHHHHHHHHHHHHHHHhccch---------hh-hccCCHHHHHHHHHHHh---CCChhhhcCCHH Confidence 33 2 1221 112222222111 1122211 11 13456666666666432 21 111111111 Q ss_pred hhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019408. 509 YTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTAN 588 (612) Q Consensus 509 e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~ 588 (612) +.+ +.|+ +|+ +++ +...+. .++.. ...+.. ..- -++.+ ++-. T Consensus 489 e~~------~~~~------q~~--~q~----~~~~~~--------~~~~~--------~~~~~~-~~~--~~~~~-~~~~ 530 (543) T protein:vir:88 489 EKA------QAQS------QEM--LKQ----GGLNAA--------AGIGS--------GVAAQA-TAS--PEAME-SAMD 530 (543) T ss_pred HHH------HHHH------HHH--HHH----HHHHHH--------HHHhh--------chhhhh-ccC--hHHHH-HHhh Confidence 100 0000 000 000 000000 00000 000000 000 00111 1111 Q ss_pred hccccCCCchhhcCCC Q lcl|NC_019408. 589 AAKVAAQPPAPAAPGA 604 (612) Q Consensus 589 ~a~~~~~~~~~~~~~~ 604 (612) ++- -+|++..+.- T Consensus 531 ~~~---~~~~p~~~~~ 543 (543) T protein:vir:88 531 TAG---VQPGPIATQV 543 (543) T ss_pred hcC---CCCCCCCCCC Confidence 122 2222222222 No 136 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=92.44 E-value=0.011 Score=31.24 Aligned_cols=429 Identities=14% Similarity=0.041 Sum_probs=168.6 Q ss_pred CCCcHHHHHHHHHHHHH--HHHhcChHHHHhcccccCCCCCCCCH--HHHHHHHhhc----cCCchHHH----HHHHhhc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKL--RDVMAGQREIKRKAEAYLPAMKGADG--DDYAIYLQRA----TFFNMLAQ----TRDGMTG 68 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i--~d~~~G~~~vr~~g~~YLPk~~~e~~--~~Y~~rl~rA----~~~n~~~~----tv~~~~G 68 (612) ||+. .|.+..+....- .-.|.|....+. -...|.+....+ .....-..|| .=.++.+. .++..+| T Consensus 3 ~~~~-~~~a~~~~~~~~~~~~~y~aa~~~~~--~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG 79 (495) T protein:vir:10 3 MTPS-GYQSLASGLLVPVGASAYEGASGGHR--WQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVG 79 (495) T ss_pred cccc-cccccchhhhhHHHhhhhhccccCcc--cCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Confidence 3333 233222221111 112333222221 122332221111 1111111222 12234444 4444444 Q ss_pred hhhcCCceee------cCCHHHHHHHhccCCCCC-CHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEech Q lcl|NC_019408. 69 MVFRRDPIVK------NLPPKFKDAVRRFAKDGS-SHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSA 140 (612) Q Consensus 69 ~vf~k~p~~~------~~p~~l~~~~~d~D~~G~-~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~a 140 (612) -=|+-.+... .+-.....|.++||-.|. +++.+.+.+++..+..|=|++.+-+..... .+.-| =+-+|.| T Consensus 80 ~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~--g~~~~~~lqliep 157 (495) T protein:vir:10 80 NGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSE--GLSVPLQLQIIEP 157 (495) T ss_pred CCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCC--CCccceEEEEech Confidence 4343322221 122335566789999985 999999999999999999998876643210 00111 2445555 Q ss_pred hhhhcchh-hhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeee Q lcl|NC_019408. 141 ENILDWDE-VVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYR 219 (612) Q Consensus 141 e~IinW~~-~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R 219 (612) +.|=+-.- ...-+|.. |+ +-.+ ++ ..|.++ -|+ T Consensus 158 d~l~~~~~~~~~~~g~~------i~---------------------~GIe--~d--~~Gr~v---------------aY~ 191 (495) T protein:vir:10 158 DMLASDIPDETLPSGGY------VK---------------------GGIR--FS--NGGKRK---------------AYC 191 (495) T ss_pred hhcCCCCCCCCCCCCCE------EE---------------------eceE--EC--CCCceE---------------EEE Confidence 55533200 00000000 00 0000 00 111111 111 Q ss_pred eeeccccccccccccceeEEEEEeeCCCceecceeeeccCCcccccee---EEEeecCCCCCCcCcCchHHHHHH-HHHH Q lcl|NC_019408. 220 ELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIP---FKFFGASGNTADVEKPPLLDICDL-NLSH 295 (612) Q Consensus 220 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP---~v~~~~~~~~~~~~~pPLldLA~l-nl~H 295 (612) + +...++.. .....+..+..|| ++.+...+.+-.-+.|-|..|-.| .+.. T Consensus 192 i---------------------~~~hpgd~-----~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis~la~i~~l~~l~~ 245 (495) T protein:vir:10 192 F---------------------YRNHPAES-----SLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAPWFQLLLRLNELDQ 245 (495) T ss_pred E---------------------eecCCCcc-----cccccccceeeechhheEeccccCCCcccCcchhHHHHHHHHhhH Confidence 1 11111100 0011111233344 222222233333355533332211 2223 Q ss_pred HhhhHHHHHHHHHhccceeeeecCCCC---------------CCceEEEeccccccCCCCCceeEEecCch--hHHHHHH Q lcl|NC_019408. 296 YRTYAELEYGRLFTALPVYYAPGTDSE---------------GTGEYHIGPNMVWEVPQGSEPGILEYTGQ--GLKALET 358 (612) Q Consensus 296 Y~~~sD~~~~l~~~~~P~l~i~G~~~~---------------~~~~l~iG~~~~~~lp~~~~~~~lE~~g~--~l~~~~~ 358 (612) |.. +.+.... +.+.-+.+|+.-+.. ....+.++++++..|++|-+++|+.++.. .+. . T Consensus 246 y~d-ael~~a~-i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~ 320 (495) T protein:vir:10 246 YED-AELVRKK-TAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYE---P 320 (495) T ss_pred HHH-HHHHHHH-HhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHH---H Confidence 333 2233333 344445555421110 11135689999999999999999998733 333 3 Q ss_pred HHHHHHHHHHH-HHH--Hhhhcc-ccchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHcCC-cCCCCcc Q lcl|NC_019408. 359 ALNDKERQIAA-IGG--RMMPGA-SKSVSESNNQTVLREANEQSLLL--NIIQACESGMTDVVRWWLMWRDV-PLADTEN 431 (612) Q Consensus 359 ~l~~~e~qm~~-lGa--~ll~~~-~~~~~esa~~~~~~~~~~~s~L~--~~a~~~~~a~~~~l~~~a~w~g~-~~~~~~~ 431 (612) .+..+...+.+ +|. .++... ++..=.|+.+..+++-.....++ .++..++.-+-..+--++...|. .+++--+ T Consensus 321 f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~ 400 (495) T protein:vir:10 321 WLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQ 400 (495) T ss_pred HHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchh Confidence 33333333321 221 112111 11112244555555555444433 24445555443333333333443 1111000 Q ss_pred eE-EEeeccccccC---CCHH-HHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccc---cccc--- Q lcl|NC_019408. 432 LR-YEVNTDFLSTP---IGAR-EMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSF---INNP--- 500 (612) Q Consensus 432 ~~-v~ln~dF~~~~---~d~~-~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~---~~~~--- 500 (612) .. ...+-+|.... +|+. ++++.+.++.+|..|++....+ +|. |++++.+.++.+... ++.+ T Consensus 401 ~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~---~G~-----D~~~v~~q~a~e~~~~~~~Gl~~~~ 472 (495) T protein:vir:10 401 RRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAE---RGY-----DMEELFDMISDANQLIDEYDLRLDS 472 (495) T ss_pred hhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHH---cCC-----CHHHHHHHHHHHHHHHHHcCCCCCC Confidence 00 00112333333 3444 8999999999999999877654 343 444444444333110 0000 Q ss_pred hhHHhhh----hhhHHHHhHHHH Q lcl|NC_019408. 501 DAQARQR----GYTNRGQELEQS 519 (612) Q Consensus 501 ~~~~~~~----~e~~r~~~~e~~ 519 (612) +...... ...........+ T Consensus 473 ~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 473 DPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred CCCcCCCccCCCCCCCCCCCCCC Confidence 0000000 000000000000 No 137 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=92.42 E-value=0.011 Score=31.22 Aligned_cols=399 Identities=9% Similarity=-0.048 Sum_probs=159.5 Q ss_pred CCCcHHHH--HHHHHHHHHHHHhcChHHHHhcccccCCCC-----CCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC Q lcl|NC_019408. 1 MVTHPEYQ--YWRPEWTKLRDVMAGQREIKRKAEAYLPAM-----KGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR 73 (612) Q Consensus 1 ~~~hP~y~--~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~-----~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k 73 (612) .++-|.-. ...+.|..++.-..+-.. .|- .+++. ...+-+-|+..+.-+ ++...++.....|... T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~-~~~~~~~iLr~~~~~~ly~~m~~D~----hi~s~l~~Rk~av~~~ 82 (448) T protein:vir:77 11 LVPGPGSIDPSDVPKLEGASVPVMSTSY---DVV-VDREFDELLQGKDGLLVYHKMLSDG----TVKNALNYIFGRIRSA 82 (448) T ss_pred cCCcccccchhhhhhhccchhhhccccc---ccc-cccchhHhhccccchHHHHHHhhCh----HHHHHHHHHHHHHhcC Confidence 55444322 333444444443322111 110 01110 012335677776544 4455555555555555 Q ss_pred Cceee---cCCH--H----HHHHHhccC--CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhh Q lcl|NC_019408. 74 DPIVK---NLPP--K----FKDAVRRFA--KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAEN 142 (612) Q Consensus 74 ~p~~~---~~p~--~----l~~~~~d~D--~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~ 142 (612) +..|+ .-|. . +..++.+.| ....+++.++..++ .++-+|.+.+=+. T Consensus 83 ~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~l-da~~~G~s~~Eiv---------------------- 139 (448) T protein:vir:77 83 KWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYE-NAYIYGMAAGEIV---------------------- 139 (448) T ss_pred CceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHHH-HhhhhcceeEEEE---------------------- Confidence 55553 1111 1 223333222 23457888888885 6888887654432 Q ss_pred hhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeee Q lcl|NC_019408. 143 ILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELK 222 (612) Q Consensus 143 IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~ 222 (612) |+. ..+|+..+..+..+.... ..-|.+ ..+ + + -++|.. T Consensus 140 ---w~~--~~dg~~~~~~l~~r~~~~-------~~~f~~---------~~~-~---------------~----l~~~~~- 177 (448) T protein:vir:77 140 ---LTL--GADGKLILDKIVPIHPFN-------IDEVLY---------DEE-G---------------G----PKALKL- 177 (448) T ss_pred ---Eee--cCCCceeeccccccCCCc-------cceeee---------ecC-C---------------c----eEEEec- Confidence 321 113333222222211000 000000 000 0 0 000000 Q ss_pred ccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEe---ecCCCCCCcCcCchHHHHHHHH-HHHhh Q lcl|NC_019408. 223 LEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF---GASGNTADVEKPPLLDICDLNL-SHYRT 298 (612) Q Consensus 223 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~---~~~~~~~~~~~pPLldLA~lnl-~HY~~ 298 (612) .+. +.. ..+..+| -.||+-++ .-...+.-.+..-|..++..-+ ++|-. T Consensus 178 ---------------------~~~--~~~--~~~~~~~---~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~ 229 (448) T protein:vir:77 178 ---------------------SGE--VKG--GSQFVNG---LEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALI 229 (448) T ss_pred ---------------------CCc--ccc--cccCCCc---cccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhH Confidence 000 000 0000111 13453322 1111222223333444444333 33333 Q ss_pred hHHHHHHHHHhccceeeee---cCCCCCCc---------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHH Q lcl|NC_019408. 299 YAELEYGRLFTALPVYYAP---GTDSEGTG---------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQ 366 (612) Q Consensus 299 ~sD~~~~l~~~~~P~l~i~---G~~~~~~~---------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~q 366 (612) .|.-.-+..-|+|+++.. |.+....+ .|..|+.+++.+|.|..+.|++..|++- ...+.++....+ T Consensus 230 -~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~-~~~~~i~~~d~~ 307 (448) T protein:vir:77 230 -LLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMP-DAIPYLTYHDAG 307 (448) T ss_pred -HHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcc-CHHHHHHHHHHH Confidence 556667777889999886 33221111 1345888899999999999999887754 345566655555 Q ss_pred HH--HHHHHhhhccccchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCCCcceEEEeeccccc Q lcl|NC_019408. 367 IA--AIGGRMMPGASKSVSESNNQTVLR-EANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLADTENLRYEVNTDFLS 442 (612) Q Consensus 367 m~--~lGa~ll~~~~~~~~esa~~~~~~-~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~~~~~~v~ln~dF~~ 442 (612) |. .+| .++....+. .++..+.-. ..-..-.+.+-+..+++.+++ ++.+++.|-.-+ +..-..|.+. T Consensus 308 Isk~iLG-qtlTs~~~~--g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~--~~~~P~~~f~----- 377 (448) T protein:vir:77 308 IARALGI-DFNTVQLNM--GVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPG--ATRFPRLTFE----- 377 (448) T ss_pred HHHHHhc-ccccccccc--chhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCCCCEEEec----- Confidence 53 344 444433322 222222111 122233456778888888885 777777764211 1222344432 Q ss_pred cCCCHHHHHHHHHHHHcCCCCHHHHHHHHH-hcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHH Q lcl|NC_019408. 443 TPIGAREMRAIQLMANDGLLPDPVFYEYMR-KAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRM 521 (612) Q Consensus 443 ~~~d~~~~~al~~~~~~G~is~et~~~~lq-r~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~ 521 (612) ..++.++.++.+.... +..+.+ +.||..+.....+.... ...+..+. ..+.+.-.......|. T Consensus 378 -~~e~eDl~~~a~~~~~-------l~~~~~~~~~ip~~~~~~~~~~~~--~~~~~~~~------~~~~~~~~~~~~~~~~ 441 (448) T protein:vir:77 378 -MEERNDFSAAANLMGM-------LINAVKDSEDIPTELKALIDALPS--KMRRALGV------VDEVREAVRQPADSRY 441 (448) T ss_pred -CCChhhHHHHHHHhHH-------HHHHHHHHhcCCccCCcCCCCCch--hcccccCC------CCCCCchhhcchhhHH Confidence 1235666665554321 222222 34553221100000000 00000000 0000000000000010 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_019408. 522 AREADFTQQKIDIQERS 538 (612) Q Consensus 522 ~~e~e~~~q~~e~~~r~ 538 (612) ....+|. T Consensus 442 ----------~~~r~~~ 448 (448) T protein:vir:77 442 ----------LYTRRRR 448 (448) T ss_pred ----------HHhhhcC Confidence 0000000 No 138 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=91.62 E-value=0.015 Score=30.59 Aligned_cols=384 Identities=13% Similarity=0.076 Sum_probs=176.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCC-HHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGAD-GDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~-~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) |+.+=-|.. ++++|.. . .+|...+..-+ ..-+..|. ...+++..|+..+--++|+...|+. T Consensus 1 ~~~~D~~~n---------~~~gg~~----~-~~~~~~~~~~~~~~l~a~Y~----~~~l~~~~Vd~~aed~~r~g~~i~~ 62 (422) T protein:vir:10 1 MVKTDSYAN---------IFLGGSD----G-SEIYGSLQNQAPTILASLYA----DNALVRRIIDTIPETALAAGFHIDG 62 (422) T ss_pred CccchhhHH---------HHcCCCC----C-ccccCcccccCHHHHHHHHH----hChhhHHHHhhhhHHHhcCCccccC Confidence 433333322 2345543 2 22322222111 12233333 3345566777777778899888864 Q ss_pred CCH--HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhh-----hhccCceEEEechhhhhcchhhhcc Q lcl|NC_019408. 80 LPP--KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRK-----GAVATSFAVGYSAENILDWDEVVDM 152 (612) Q Consensus 80 ~p~--~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~-----~~~~rPy~~~~~ae~IinW~~~~~v 152 (612) .++ .+..-++. ..+..-++.+++.+..+|.++|+++....... ..+.-.++..+++-+|.- T Consensus 63 ~~~~~~~~~~~~~-----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~------- 130 (422) T protein:vir:10 63 IDDEPAFWSRWDD-----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKV------- 130 (422) T ss_pred CCHHHHHHHHHHH-----hhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccc------- Confidence 443 33333443 46788899999999999999999876321000 001111122221111000 Q ss_pred CCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccc Q lcl|NC_019408. 153 GGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGE 232 (612) Q Consensus 153 ~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~ 232 (612) .....|++ .+.|.....|++.... . T Consensus 131 -------------------~~~~~dp~-----------------------------s~~fg~P~~y~v~~~~-------~ 155 (422) T protein:vir:10 131 -------------------QTREENPR-----------------------------NARFGEPLTYRITTNE-------S 155 (422) T ss_pred -------------------hhcccCcc-----------------------------ccccCcceEEEEecCC-------C Confidence 00001111 1222222333321100 0 Q ss_pred ccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHH-HHHHHHHhcc Q lcl|NC_019408. 233 VKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAE-LEYGRLFTAL 311 (612) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD-~~~~l~~~~~ 311 (612) .. .+.+|.+ . +....|.+++.+ .....+ ..+.+||..++.=-|..|.+.+. -.+++|...+ T Consensus 156 ~~---~~~iH~S--------R-li~~~g~~~p~~----~~~~~~--~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~ 217 (422) T protein:vir:10 156 DM---FYDVHYS--------R-IHIIDGERIPNV----MRRQND--GWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQ 217 (422) T ss_pred Cc---ceeeccc--------e-eEEeCCCCchhh----hcccCC--cccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 00 0111110 0 111123333221 122222 24678888877666777777766 4777888889 Q ss_pred ceeeeecCCCC---CCce-----------EEEeccccccC-CCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_019408. 312 PVYYAPGTDSE---GTGE-----------YHIGPNMVWEV-PQGSEPGILEYTGQGLKALETALNDKERQIAAIGGR--- 373 (612) Q Consensus 312 P~l~i~G~~~~---~~~~-----------l~iG~~~~~~l-p~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~--- 373 (612) .++.+.|+... .... ...|.++.+.+ ..+.++.-+..+-+++.. .++...++|...-.- T Consensus 218 ~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~---~~~~~~~~iaaa~~IP~t 294 (422) T protein:vir:10 218 AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDA---FLDKKFDRIVALSGIHEI 294 (422) T ss_pred ccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCChHH---HHHHHHHHHHhhhCCCee Confidence 98888875221 1110 11344555544 456778888877777653 444555555433211 Q ss_pred hhhccc-cchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHH--- Q lcl|NC_019408. 374 MMPGAS-KSVSESNNQTVLREANEQSLLLNII-QACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAR--- 448 (612) Q Consensus 374 ll~~~~-~~~~esa~~~~~~~~~~~s~L~~~a-~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~--- 448 (612) .|..++ +.-+.|+ ..+...=...+.++- ..+..+++..++++.+ .++++|.+|+-......+-. T Consensus 295 ~L~G~s~~Glnatg---d~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~--------s~~~~~~f~pL~~~sekekaei~ 363 (422) T protein:vir:10 295 ILKNKNVGGVSSSQ---NTALETFHKLVDRKRNAELLPILEFLIPFIVN--------AEEWSVEFNPLAQESSKDKAEIL 363 (422) T ss_pred eeccCCcccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------cCCcEEEeCCCCCCCHHHHHHHH Confidence 121211 1111121 112222222233332 2345566666666532 24678887754444433222 Q ss_pred --HHHHHHHHHHcCCCCHHHHHHHHHhc----CccchhhhhHHHHHHhhccccccccchh Q lcl|NC_019408. 449 --EMRAIQLMANDGLLPDPVFYEYMRKA----EVISSDMTFEEFQALRADENSFINNPDA 502 (612) Q Consensus 449 --~~~al~~~~~~G~is~et~~~~lqr~----~vl~~~~~~eee~~ria~e~~~~~~~~~ 502 (612) .+++...++++|.|+....++.|+.. ++. ++...++........++....++. T Consensus 364 ~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 364 EKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKIN-DGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCC-CCCCccccchhhcCCCCCCCCCCC Confidence 23566788999999999999999752 222 232222222211112221111111 No 139 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=90.79 E-value=0.019 Score=30.02 Aligned_cols=352 Identities=12% Similarity=0.081 Sum_probs=148.1 Q ss_pred CC-----CcHHHHHHHHHHH--HHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC Q lcl|NC_019408. 1 MV-----THPEYQYWRPEWT--KLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR 73 (612) Q Consensus 1 ~~-----~hP~y~~~~~~W~--~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k 73 (612) |+ ...+.......|. .+....+|.. ..|+ +. +.+++ .+.+...|+.+++.+-.- T Consensus 3 ~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~------~~~v------~~---~~~l~----~~~v~~~i~~ia~~ia~~ 63 (383) T protein:vir:10 3 LLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQ------LSYV------SA---LSALQ----NTNVYSVINRIASDVSSA 63 (383) T ss_pred cccccccccccccccccccchhhhhhhccCcc------cccc------ch---hHhhc----chHHHHHHHHHHHhhccC Confidence 11 1111111111110 0001111100 0000 01 12222 344556666666666655 Q ss_pred CceeecCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcchhhhccC Q lcl|NC_019408. 74 DPIVKNLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDWDEVVDMG 153 (612) Q Consensus 74 ~p~~~~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW~~~~~v~ 153 (612) |..+.. ..+..++...+ ...+-.+|.+.++...+.+|-++++++- .+ +.+|.+..+ T Consensus 64 ~~~~~~--~~~~~ll~~PN-~~~t~~~f~~~~~~~l~l~Gn~~~~i~~----------~~-~~~~p~~~~---------- 119 (383) T protein:vir:10 64 HFKTEN--TATLNRLESPS-SLIGRFSFWQGALMQLCLSGNDYIPLVG----------QN-LEHIPNSDV---------- 119 (383) T ss_pred ceeecc--cchhhhhhCCC-CCCCHHHHHHHHHHHhhhcCCeEEEEEc----------Cc-eeEeecCcc---------- Confidence 655632 23456676665 5689999999999999999999999862 11 111211110 Q ss_pred CccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccc Q lcl|NC_019408. 154 GFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEV 233 (612) Q Consensus 154 g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~ 233 (612) .|.+.. . .+ + .+|++.... .|. T Consensus 120 ------~v~~~~---~-----------------------~~----------------~----~~~~~~~~~-----~~~- 141 (383) T protein:vir:10 120 ------QINYLP---G-----------------------NM----------------G----IVYTVLESN-----DRP- 141 (383) T ss_pred ------eEEEEE---c-----------------------CC----------------c----eEEEEEEcC-----Cce- Confidence 000000 0 00 0 011110000 000 Q ss_pred cceeEEEEEeeCCCceecceeeeccCCccccceeEEEe---ecCCCCCCcCcCchHHHHHHHHHHHhhhHHH-HHHHHHh Q lcl|NC_019408. 234 KLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF---GASGNTADVEKPPLLDICDLNLSHYRTYAEL-EYGRLFT 309 (612) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~---~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~-~~~l~~~ 309 (612) ..+ +..=-++.+ .....+...|.||+.-+... |........+ ...+... T Consensus 142 -----------------~~~---------~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~f~ng 194 (383) T protein:vir:10 142 -----------------KMV---------LRQDQMLHFRLMPDPQYRYLIGRSPLESLQNA-LNLDDKASKSNMSAMENQ 194 (383) T ss_pred -----------------EEE---------EcccceEEeccCCCCcccccccccHHHHHHHH-HHHHHHHHHHHHHHHhcc Confidence 000 000012222 11222334578888765443 3333333333 3444556 Q ss_pred ccceeeeecCCCC-CCce----------EEEec--cccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--H Q lcl|NC_019408. 310 ALPVYYAPGTDSE-GTGE----------YHIGP--NMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--R 373 (612) Q Consensus 310 ~~P~l~i~G~~~~-~~~~----------l~iG~--~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ 373 (612) +.|-.++.--... ..+. ..-|. +.++.++.|.++.-+..+..-+.-..+.++...+++..+ |. . T Consensus 195 ~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~ 274 (383) T protein:vir:10 195 INPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSD 274 (383) T ss_pred CCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHH Confidence 7787776521111 1110 11232 245677777776666665554443344555555665443 32 2 Q ss_pred hhhccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHH Q lcl|NC_019408. 374 MMPGAS--KSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMR 451 (612) Q Consensus 374 ll~~~~--~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~ 451 (612) ++.... .....+..+..... ...|.-++..+++.++..| .+ ..+.|.++ ++...+ ....+. T Consensus 275 ~lg~~~~~~~~~sn~eq~~~~~---~~~l~P~~~~ie~~l~~~l------~~------~~~~f~~~-~l~~~d-~~~~~~ 337 (383) T protein:vir:10 275 ILGGGTSTESQHSNIDQIKATY---LANLNSYVNPIVDELRLKM------NA------PDLELDIK-DMLDVD-DSILIN 337 (383) T ss_pred HcCCccCCCCccccHHHHHHHH---HHHHHHHHHHHHHHHHHhh------CC------ceEEeech-hhhccC-HHHHHH Confidence 221111 00011112222111 1246666666666665433 12 13444332 122221 233466 Q ss_pred HHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchh Q lcl|NC_019408. 452 AIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDA 502 (612) Q Consensus 452 al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~ 502 (612) ++-.++++|.|+....++.+-.-++.+.+.. .......+..+-.++ T Consensus 338 ~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~-----~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 338 QVSNLAKSGVLGAEQAQFILTRSGFLPDNLP-----EFKPLTNETKGGDDK 383 (383) T ss_pred HHHHHHhCCCcCHHHHHHHhCCCcccCCccc-----ccCCCcccCCCCCCC Confidence 7889999999999988888765555443321 111111111111111 No 140 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=90.49 E-value=0.021 Score=29.83 Aligned_cols=470 Identities=13% Similarity=0.113 Sum_probs=181.1 Q ss_pred CCCcHHHHHH-HHHHHHHHHHhcC-----------hHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhc Q lcl|NC_019408. 1 MVTHPEYQYW-RPEWTKLRDVMAG-----------QREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTG 68 (612) Q Consensus 1 ~~~hP~y~~~-~~~W~~i~d~~~G-----------~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G 68 (612) =|..+....- .+.-.-+++.+.+ -..++.+..-.++. --+-|+..+.+ -.++...++.... T Consensus 11 p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~----~~~L~e~m~e~---D~~i~s~l~~Rk~ 83 (526) T protein:vir:99 11 PIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQA----QAELFMDMEER---DAHLFAEMSKRKR 83 (526) T ss_pred ccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHH----HHHHHHHHHhh---ChHHHHHHHHHHH Confidence 1222211100 0111111122111 01222221111100 01234444432 4566666677777 Q ss_pred hhhcCCceeec----CC--HH----HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEe Q lcl|NC_019408. 69 MVFRRDPIVKN----LP--PK----FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGY 138 (612) Q Consensus 69 ~vf~k~p~~~~----~p--~~----l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~ 138 (612) .|...+..|+- -+ .. +..++.+. .+++.++..++ .++-+|.+.+=+ T Consensus 84 av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~----~~~~~~i~~~l-da~~~G~s~~Ei------------------- 139 (526) T protein:vir:99 84 AILGLDWAVEPPRNASAAEKADADYLHELLLDL----EGLEDLLLDAL-DGIGHGYSCIEL------------------- 139 (526) T ss_pred HHhCCCceEecCCCCCHHHHHHHHHHHHHHhcc----cCHHHHHHHHH-HhhhhcceeEEE------------------- Confidence 77777777741 11 12 33333332 25888888887 477788664433 Q ss_pred chhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeee Q lcl|NC_019408. 139 SAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVY 218 (612) Q Consensus 139 ~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~ 218 (612) .|+. .+|...+.-+..|.. .-|.... . .+ ... T Consensus 140 ------vw~~---~~g~~~~~~l~~r~~----------~~f~~~~---------~----------------~~----~~l 171 (526) T protein:vir:99 140 ------EWAL---QGREWMPLAFHHRPQ----------SWFQLNP---------E----------------DQ----NEL 171 (526) T ss_pred ------EEee---cCCceeEEEeeeecc----------cceeecc---------C----------------CC----cEE Confidence 2432 134333333332210 0000000 0 00 000 Q ss_pred eeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEE-eecCCCCCCcCcCchHHHHHHHH-HHH Q lcl|NC_019408. 219 RELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKF-FGASGNTADVEKPPLLDICDLNL-SHY 296 (612) Q Consensus 219 R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~-~~~~~~~~~~~~pPLldLA~lnl-~HY 296 (612) |. ..+ ...|.+|..-=|++ .+....+.-.+.+.|..++..-+ ++| T Consensus 172 ~~----------------------~~~-----------~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~ 218 (526) T protein:vir:99 172 RL----------------------RDN-----------SPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHY 218 (526) T ss_pred Ee----------------------cCC-----------CCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHh Confidence 00 000 01122222111222 23333344455666666665544 444 Q ss_pred hhhHHHHHHHHHhccceeeee---cCCCCCCce-----EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 297 RTYAELEYGRLFTALPVYYAP---GTDSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA 368 (612) Q Consensus 297 ~~~sD~~~~l~~~~~P~l~i~---G~~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~ 368 (612) .-.+.-.-+..-|+|+++.. |.+++..+. ..||++++..+|.|..+.|++..+.+......-++-..++|. T Consensus 219 -~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Is 297 (526) T protein:vir:99 219 -ATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAIS 297 (526) T ss_pred -hHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHH Confidence 44566677777899999985 222221111 348999999999999999999877666666666677777764 Q ss_pred H--HHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCcCCCC-cceEEEeeccccccC Q lcl|NC_019408. 369 A--IGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMTD-VVRWWLMWRDVPLADT-ENLRYEVNTDFLSTP 444 (612) Q Consensus 369 ~--lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w~g~~~~~~-~~~~v~ln~dF~~~~ 444 (612) . +|..|-..+......|--.......-...++.+-+..+++.+++ ++.+++.|-+-..... .-..|.+... ...+ T Consensus 298 k~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~-e~eD 376 (526) T protein:vir:99 298 KAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLR-EQAD 376 (526) T ss_pred HHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCC-Cccc Confidence 3 55444221111111222233445555666788899999999975 8899999854321111 1233433210 1111 Q ss_pred CCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH-------hhhhhhHHHHhH Q lcl|NC_019408. 445 IGAREMRAIQLMANDGL-LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA-------RQRGYTNRGQEL 516 (612) Q Consensus 445 ~d~~~~~al~~~~~~G~-is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~-------~~~~e~~r~~~~ 516 (612) + ...+..+..+...|. |+...+.+ +.|+..+. +.++......... ......... .......++ .. T Consensus 377 l-~~~a~~~~~L~~~G~~i~~~~i~e---~~Gip~~~-~~e~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 449 (526) T protein:vir:99 377 I-TSMAQSIPALVNVGLEIPSAWVYD---KLGIPQPA-KNEPVLRSAAQPA-ILSRQHGQRVAALATIVGPRYGDQQ-AL 449 (526) T ss_pred H-HHHHHHHHHHHhCCCccCHHHHHH---HhCCCCCC-CcccccCCCCCCc-ccccccccccccccccccccCcchh-hH Confidence 1 224556667888886 88765444 34763322 1111111111000 000000000 000000101 11 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccC Q lcl|NC_019408. 517 EQSRMA-READFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPE-QAKPAVADQATIDNAKKQTANAAKVAA 594 (612) Q Consensus 517 e~~r~~-~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~-q~k~~~~eq~~~~~~~k~~~~~a~~~~ 594 (612) +.-..+ ...+.+..-..-- +.=+..++ ....=++.+.+..+-- ..-..+.++ .-+.-=..+.-.-+.. T Consensus 450 d~~l~~~~~~~~~~~~~~~l------~~i~~~l~--~~~s~ee~~~~L~~l~~~ld~~~l~~--~l~~a~~~A~l~Gr~~ 519 (526) T protein:vir:99 450 DKALADLPAKDMQNQANDLL------APLLEAVN--RGDSETELLGALAEAFPDMDDSALTD--ALHRLLFAADTWGRLH 519 (526) T ss_pred HHHHHHHHHHHHHHHHHHHH------HHHHHHHH--hcCCHHHHHHHHHHHhccCCHHHHHH--HHHHHHHHHHHhhhhh Confidence 100000 0000010000000 00000000 0000011111110000 000000000 0000000000000000 Q ss_pred CCchhhc Q lcl|NC_019408. 595 QPPAPAA 601 (612) Q Consensus 595 ~~~~~~~ 601 (612) ...+..- T Consensus 520 ~~~e~~~ 526 (526) T protein:vir:99 520 GNLDRID 526 (526) T ss_pred hhhcccC Confidence 0000000 No 141 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=88.71 E-value=0.031 Score=28.88 Aligned_cols=508 Identities=12% Similarity=0.074 Sum_probs=193.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCC---CCCHHHHHHHHhhccCCchHHHHHHHhhch----hhc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMK---GADGDDYAIYLQRATFFNMLAQTRDGMTGM----VFR 72 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~---~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~----vf~ 72 (612) |..+.+-.....+|+.++.--.= ...|+....-.||... ..+...=..+ ..-.|-+.-.+.++.++.. +|. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~~~~dst~~~a~~~LAa~L~~~ltp 79 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKR-HNNILDNTGTRALRVLAAGMMAGMTS 79 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhc-ccccccccHHHHHHHHHHHHHHhhcC Confidence 88777777777777766554211 4455555545566521 1111110111 1224444445555554433 332 Q ss_pred -CCcee--e----cC--CHHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEE Q lcl|NC_019408. 73 -RDPIV--K----NL--PPKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVG 137 (612) Q Consensus 73 -k~p~~--~----~~--p~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~ 137 (612) ..|=+ . ++ -..++.|++.|. ...++++.-+-.++.+.+.+|-+-++++-... ..-.+.. T Consensus 80 p~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~------~~~rf~~ 153 (555) T protein:vir:98 80 PARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD------AVVYHHS 153 (555) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC------ceEEEEE Confidence 11111 0 00 123555555422 23477888888889999999998888874321 1123344 Q ss_pred echhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceee Q lcl|NC_019408. 138 YSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITV 217 (612) Q Consensus 138 ~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~ 217 (612) |+..++. . ..|+...+.-|+-++.+. .+.-.+.|+...... .++......+....+++ T Consensus 154 ~pl~~~~---v--~~d~~G~vd~i~r~~~~t---~~ql~~~fg~~~l~~--------------~~~~~~~~~~~~~~v~v 211 (555) T protein:vir:98 154 LTAGEYA---I--AADNQGRVNTLYREFQIT---VAQMVREFGKDKCST--------------TVQSLFDRGALEQWVTV 211 (555) T ss_pred eecceeE---E--eeCCCCCEEEEEEEEecc---HHHHHHhcCcccCCH--------------HHHHHHhcCCCCceEEE Confidence 5544432 1 223333343333222111 111122232211110 00000000011111222 Q ss_pred eeeeecccccccc--ccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcC--cCchHHHHHHH Q lcl|NC_019408. 218 YRELKLEEIEWPS--GEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVE--KPPLLDICDLN 292 (612) Q Consensus 218 ~R~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~--~pPLldLA~ln 292 (612) +..........++ +..+-.+....+..+..+ ..+...+| ..+++||+.|.-..+..+..+ ..-|-|+..|| T Consensus 212 ~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~----~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~ 287 (555) T protein:vir:98 212 IHAIEPRADRDPSKRDDRNMAWKSVYFEPGADE----TRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQ 287 (555) T ss_pred EEEEeeccCcCcCCCCccccceEEEEEEeccCC----ccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHH Confidence 2211110000011 111111111112111111 11222333 347788888876666555332 12255777777 Q ss_pred HHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCC--CCc--eeEEecCchhHHHHHHHHHHHHHHH Q lcl|NC_019408. 293 LSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQ--GSE--PGILEYTGQGLKALETALNDKERQI 367 (612) Q Consensus 293 l~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~--~~~--~~~lE~~g~~l~~~~~~l~~~e~qm 367 (612) .- +.+-+. .++.+..|.+.+ .+. ....+.+-|+....++. +++ .-.++. +..+....+.|++++..+ T Consensus 288 ~l---~~~~l~-~~~~~~~pp~~v~~~~---~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~-~~d~~~~~~~i~~~~~rI 359 (555) T protein:vir:98 288 HE---QLRKAQ-AIDYKSNPPLQLPVSA---KNQDISTVPGGLSYVDAAAPNGGIRTAFEV-NLDLSHLLADIVDVRERI 359 (555) T ss_pred HH---HHHHHH-HHHHHhcCceeecccc---ccccceeccccccccccCCCCcceeccccc-ccchHHHHHHHHHHHHHH Confidence 52 222244 445555554444 322 22345555554433332 222 222333 235788889999999998 Q ss_pred HHHHH-H---hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCCCCcceEEEeec Q lcl|NC_019408. 368 AAIGG-R---MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLADTENLRYEVNT 438 (612) Q Consensus 368 ~~lGa-~---ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~~~~~~~v~ln~ 438 (612) +.+-- . ++... ....-||++...+...-...|..+-.++.+=+- ++|.++.+ .|.-..-+.++ ... T Consensus 360 ~~af~~dlf~~l~~~-~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l---~~~ 434 (555) T protein:vir:98 360 KASFYADLFLMLANG-TNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE-ANILPPPPQEM---QGV 434 (555) T ss_pred HHHhhcchhhhccCC-CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhh---cCc Confidence 76542 1 22211 223468999988888888888888877755433 34444443 23210001111 001 Q ss_pred cccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCccch----hhhhHHHHHHhhccccccccchhHHhhhhhhHHH Q lcl|NC_019408. 439 DFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEVISS----DMTFEEFQALRADENSFINNPDAQARQRGYTNRG 513 (612) Q Consensus 439 dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~vl~~----~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~ 513 (612) ++.+..+.+ +.+...+...+.| .-++..+.. .+ ++| -+++++..+.+++- .+-|...=+..++ T Consensus 435 ~i~v~yis~--La~aq~~~~~~~i--~~~l~~i~~laq-~~P~vld~id~d~~~~~~a~~---~Gvp~~~irs~ee---- 502 (555) T protein:vir:98 435 DLNVEFVSM--LAQAQRAIATNSV--DRFVGNLGAVAG-IKPEVLDKFDADRWADTYADM---LGIDPELIVPGNQ---- 502 (555) T ss_pred eeEEEeccH--HHHHHHHHHHHHH--HHHHHHHHHHhc-CChhhhhcCCHHHHHHHHHHH---hCCCccccCCHHH---- Confidence 111112211 2222222222111 112222211 11 122 35667767777654 2222111111111 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_019408. 514 QELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVA 593 (612) Q Consensus 514 ~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~ 593 (612) -++.|+++. +++|+ +.+++.+.+ +-+..++-.+.+-... -....-+.|--- T Consensus 503 --v~~~r~qr~-~~~q~-----~~~a~~~~q-------------------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 553 (555) T protein:vir:98 503 --VALIRKQRA-DQQQA-----AQQAALLNQ-------------------GADTAAKLGSVDTSKQ--NALTDVTRAFSG 553 (555) T ss_pred --HHHHHHHHH-HHHHH-----HHHHHHHHH-------------------HHHHHHHhcccccCcc--hhHHHHHhhhcc Confidence 011111100 00000 000000000 0000000000000000 000000000000 Q ss_pred CC Q lcl|NC_019408. 594 AQ 595 (612) Q Consensus 594 ~~ 595 (612) =. T Consensus 554 ~~ 555 (555) T protein:vir:98 554 YT 555 (555) T ss_pred CC Confidence 00 No 142 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=88.71 E-value=0.031 Score=28.88 Aligned_cols=508 Identities=12% Similarity=0.074 Sum_probs=193.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCC---CCCHHHHHHHHhhccCCchHHHHHHHhhch----hhc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMK---GADGDDYAIYLQRATFFNMLAQTRDGMTGM----VFR 72 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~---~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~----vf~ 72 (612) |..+.+-.....+|+.++.--.= ...|+....-.||... ..+...=..+ ..-.|-+.-.+.++.++.. +|. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~~~~dst~~~a~~~LAa~L~~~ltp 79 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKR-HNNILDNTGTRALRVLAAGMMAGMTS 79 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhc-ccccccccHHHHHHHHHHHHHHhhcC Confidence 88777777777777766554211 4455555545566521 1111110111 1224444445555554433 332 Q ss_pred -CCcee--e----cC--CHHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEE Q lcl|NC_019408. 73 -RDPIV--K----NL--PPKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVG 137 (612) Q Consensus 73 -k~p~~--~----~~--p~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~ 137 (612) ..|=+ . ++ -..++.|++.|. ...++++.-+-.++.+.+.+|-+-++++-... ..-.+.. T Consensus 80 p~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~------~~~rf~~ 153 (555) T protein:vir:10 80 PARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD------AVVYHHS 153 (555) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC------ceEEEEE Confidence 11111 0 00 123555555422 23477888888889999999998888874321 1123344 Q ss_pred echhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceee Q lcl|NC_019408. 138 YSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITV 217 (612) Q Consensus 138 ~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~ 217 (612) |+..++. . ..|+...+.-|+-++.+. .+.-.+.|+...... .++......+....+++ T Consensus 154 ~pl~~~~---v--~~d~~G~vd~i~r~~~~t---~~ql~~~fg~~~l~~--------------~~~~~~~~~~~~~~v~v 211 (555) T protein:vir:10 154 LTAGEYA---I--AADNQGRVNTLYREFQIT---VAQMVREFGKDKCST--------------TVQSLFDRGALEQWVTV 211 (555) T ss_pred eecceeE---E--eeCCCCCEEEEEEEEecc---HHHHHHhcCcccCCH--------------HHHHHHhcCCCCceEEE Confidence 5544432 1 223333343333222111 111122232211110 00000000011111222 Q ss_pred eeeeecccccccc--ccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcC--cCchHHHHHHH Q lcl|NC_019408. 218 YRELKLEEIEWPS--GEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVE--KPPLLDICDLN 292 (612) Q Consensus 218 ~R~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~--~pPLldLA~ln 292 (612) +..........++ +..+-.+....+..+..+ ..+...+| ..+++||+.|.-..+..+..+ ..-|-|+..|| T Consensus 212 ~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~----~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~ 287 (555) T protein:vir:10 212 IHAIEPRADRDPSKRDDRNMAWKSVYFEPGADE----TRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQ 287 (555) T ss_pred EEEEeeccCcCcCCCCccccceEEEEEEeccCC----ccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHH Confidence 2211110000011 111111111112111111 11222333 347788888876666555332 12255777777 Q ss_pred HHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCC--CCc--eeEEecCchhHHHHHHHHHHHHHHH Q lcl|NC_019408. 293 LSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQ--GSE--PGILEYTGQGLKALETALNDKERQI 367 (612) Q Consensus 293 l~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~--~~~--~~~lE~~g~~l~~~~~~l~~~e~qm 367 (612) .- +.+-+. .++.+..|.+.+ .+. ....+.+-|+....++. +++ .-.++. +..+....+.|++++..+ T Consensus 288 ~l---~~~~l~-~~~~~~~pp~~v~~~~---~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~-~~d~~~~~~~i~~~~~rI 359 (555) T protein:vir:10 288 HE---QLRKAQ-AIDYKSNPPLQLPVSA---KNQDISTVPGGLSYVDAAAPNGGIRTAFEV-NLDLSHLLADIVDVRERI 359 (555) T ss_pred HH---HHHHHH-HHHHHhcCceeecccc---ccccceeccccccccccCCCCcceeccccc-ccchHHHHHHHHHHHHHH Confidence 52 222244 445555554444 322 22345555554433332 222 222333 235788889999999998 Q ss_pred HHHHH-H---hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCCCCcceEEEeec Q lcl|NC_019408. 368 AAIGG-R---MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLADTENLRYEVNT 438 (612) Q Consensus 368 ~~lGa-~---ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~~~~~~~v~ln~ 438 (612) +.+-- . ++... ....-||++...+...-...|..+-.++.+=+- ++|.++.+ .|.-..-+.++ ... T Consensus 360 ~~af~~dlf~~l~~~-~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l---~~~ 434 (555) T protein:vir:10 360 KASFYADLFLMLANG-TNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE-ANILPPPPQEM---QGV 434 (555) T ss_pred HHHhhcchhhhccCC-CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhh---cCc Confidence 76542 1 22211 223468999988888888888888877755433 34444443 23210001111 001 Q ss_pred cccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCccch----hhhhHHHHHHhhccccccccchhHHhhhhhhHHH Q lcl|NC_019408. 439 DFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEVISS----DMTFEEFQALRADENSFINNPDAQARQRGYTNRG 513 (612) Q Consensus 439 dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~vl~~----~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~ 513 (612) ++.+..+.+ +.+...+...+.| .-++..+.. .+ ++| -+++++..+.+++- .+-|...=+..++ T Consensus 435 ~i~v~yis~--La~aq~~~~~~~i--~~~l~~i~~laq-~~P~vld~id~d~~~~~~a~~---~Gvp~~~irs~ee---- 502 (555) T protein:vir:10 435 DLNVEFVSM--LAQAQRAIATNSV--DRFVGNLGAVAG-IKPEVLDKFDADRWADTYADM---LGIDPELIVPGNQ---- 502 (555) T ss_pred eeEEEeccH--HHHHHHHHHHHHH--HHHHHHHHHHhc-CChhhhhcCCHHHHHHHHHHH---hCCCccccCCHHH---- Confidence 111112211 2222222222111 112222211 11 122 35667767777654 2222111111111 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_019408. 514 QELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVA 593 (612) Q Consensus 514 ~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~ 593 (612) -++.|+++. +++|+ +.+++.+.+ +-+..++-.+.+-... -....-+.|--- T Consensus 503 --v~~~r~qr~-~~~q~-----~~~a~~~~q-------------------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 553 (555) T protein:vir:10 503 --VALIRKQRA-DQQQA-----AQQAALLNQ-------------------GADTAAKLGSVDTSKQ--NALTDVTRAFSG 553 (555) T ss_pred --HHHHHHHHH-HHHHH-----HHHHHHHHH-------------------HHHHHHHhcccccCcc--hhHHHHHhhhcc Confidence 011111100 00000 000000000 0000000000000000 000000000000 Q ss_pred CC Q lcl|NC_019408. 594 AQ 595 (612) Q Consensus 594 ~~ 595 (612) =. T Consensus 554 ~~ 555 (555) T protein:vir:10 554 YT 555 (555) T ss_pred CC Confidence 00 No 143 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=88.71 E-value=0.031 Score=28.88 Aligned_cols=508 Identities=12% Similarity=0.074 Sum_probs=193.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCC---CCCHHHHHHHHhhccCCchHHHHHHHhhch----hhc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMK---GADGDDYAIYLQRATFFNMLAQTRDGMTGM----VFR 72 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~---~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~----vf~ 72 (612) |..+.+-.....+|+.++.--.= ...|+....-.||... ..+...=..+ ..-.|-+.-.+.++.++.. +|. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~~~~dst~~~a~~~LAa~L~~~ltp 79 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKR-HNNILDNTGTRALRVLAAGMMAGMTS 79 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhc-ccccccccHHHHHHHHHHHHHHhhcC Confidence 88777777777777766554211 4455555545566521 1111110111 1224444445555554433 332 Q ss_pred -CCcee--e----cC--CHHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEE Q lcl|NC_019408. 73 -RDPIV--K----NL--PPKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVG 137 (612) Q Consensus 73 -k~p~~--~----~~--p~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~ 137 (612) ..|=+ . ++ -..++.|++.|. ...++++.-+-.++.+.+.+|-+-++++-... ..-.+.. T Consensus 80 p~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~------~~~rf~~ 153 (555) T protein:vir:10 80 PARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD------AVVYHHS 153 (555) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC------ceEEEEE Confidence 11111 0 00 123555555422 23477888888889999999998888874321 1123344 Q ss_pred echhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceee Q lcl|NC_019408. 138 YSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITV 217 (612) Q Consensus 138 ~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~ 217 (612) |+..++. . ..|+...+.-|+-++.+. .+.-.+.|+...... .++......+....+++ T Consensus 154 ~pl~~~~---v--~~d~~G~vd~i~r~~~~t---~~ql~~~fg~~~l~~--------------~~~~~~~~~~~~~~v~v 211 (555) T protein:vir:10 154 LTAGEYA---I--AADNQGRVNTLYREFQIT---VAQMVREFGKDKCST--------------TVQSLFDRGALEQWVTV 211 (555) T ss_pred eecceeE---E--eeCCCCCEEEEEEEEecc---HHHHHHhcCcccCCH--------------HHHHHHhcCCCCceEEE Confidence 5544432 1 223333343333222111 111122232211110 00000000011111222 Q ss_pred eeeeecccccccc--ccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcC--cCchHHHHHHH Q lcl|NC_019408. 218 YRELKLEEIEWPS--GEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVE--KPPLLDICDLN 292 (612) Q Consensus 218 ~R~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~--~pPLldLA~ln 292 (612) +..........++ +..+-.+....+..+..+ ..+...+| ..+++||+.|.-..+..+..+ ..-|-|+..|| T Consensus 212 ~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~----~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~ 287 (555) T protein:vir:10 212 IHAIEPRADRDPSKRDDRNMAWKSVYFEPGADE----TRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQ 287 (555) T ss_pred EEEEeeccCcCcCCCCccccceEEEEEEeccCC----ccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHH Confidence 2211110000011 111111111112111111 11222333 347788888876666555332 12255777777 Q ss_pred HHHHhhhHHHHHHHHHhccceeee-ecCCCCCCceEEEeccccccCCC--CCc--eeEEecCchhHHHHHHHHHHHHHHH Q lcl|NC_019408. 293 LSHYRTYAELEYGRLFTALPVYYA-PGTDSEGTGEYHIGPNMVWEVPQ--GSE--PGILEYTGQGLKALETALNDKERQI 367 (612) Q Consensus 293 l~HY~~~sD~~~~l~~~~~P~l~i-~G~~~~~~~~l~iG~~~~~~lp~--~~~--~~~lE~~g~~l~~~~~~l~~~e~qm 367 (612) .- +.+-+. .++.+..|.+.+ .+. ....+.+-|+....++. +++ .-.++. +..+....+.|++++..+ T Consensus 288 ~l---~~~~l~-~~~~~~~pp~~v~~~~---~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~-~~d~~~~~~~i~~~~~rI 359 (555) T protein:vir:10 288 HE---QLRKAQ-AIDYKSNPPLQLPVSA---KNQDISTVPGGLSYVDAAAPNGGIRTAFEV-NLDLSHLLADIVDVRERI 359 (555) T ss_pred HH---HHHHHH-HHHHHhcCceeecccc---ccccceeccccccccccCCCCcceeccccc-ccchHHHHHHHHHHHHHH Confidence 52 222244 445555554444 322 22345555554433332 222 222333 235788889999999998 Q ss_pred HHHHH-H---hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHcCCcCCCCcceEEEeec Q lcl|NC_019408. 368 AAIGG-R---MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-----DVVRWWLMWRDVPLADTENLRYEVNT 438 (612) Q Consensus 368 ~~lGa-~---ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-----~~l~~~a~w~g~~~~~~~~~~v~ln~ 438 (612) +.+-- . ++... ....-||++...+...-...|..+-.++.+=+- ++|.++.+ .|.-..-+.++ ... T Consensus 360 ~~af~~dlf~~l~~~-~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l---~~~ 434 (555) T protein:vir:10 360 KASFYADLFLMLANG-TNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE-ANILPPPPQEM---QGV 434 (555) T ss_pred HHHhhcchhhhccCC-CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhh---cCc Confidence 76542 1 22211 223468999988888888888888877755433 34444443 23210001111 001 Q ss_pred cccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCccch----hhhhHHHHHHhhccccccccchhHHhhhhhhHHH Q lcl|NC_019408. 439 DFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEVISS----DMTFEEFQALRADENSFINNPDAQARQRGYTNRG 513 (612) Q Consensus 439 dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~vl~~----~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~ 513 (612) ++.+..+.+ +.+...+...+.| .-++..+.. .+ ++| -+++++..+.+++- .+-|...=+..++ T Consensus 435 ~i~v~yis~--La~aq~~~~~~~i--~~~l~~i~~laq-~~P~vld~id~d~~~~~~a~~---~Gvp~~~irs~ee---- 502 (555) T protein:vir:10 435 DLNVEFVSM--LAQAQRAIATNSV--DRFVGNLGAVAG-IKPEVLDKFDADRWADTYADM---LGIDPELIVPGNQ---- 502 (555) T ss_pred eeEEEeccH--HHHHHHHHHHHHH--HHHHHHHHHHhc-CChhhhhcCCHHHHHHHHHHH---hCCCccccCCHHH---- Confidence 111112211 2222222222111 112222211 11 122 35667767777654 2222111111111 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_019408. 514 QELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVA 593 (612) Q Consensus 514 ~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~ 593 (612) -++.|+++. +++|+ +.+++.+.+ +-+..++-.+.+-... -....-+.|--- T Consensus 503 --v~~~r~qr~-~~~q~-----~~~a~~~~q-------------------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 553 (555) T protein:vir:10 503 --VALIRKQRA-DQQQA-----AQQAALLNQ-------------------GADTAAKLGSVDTSKQ--NALTDVTRAFSG 553 (555) T ss_pred --HHHHHHHHH-HHHHH-----HHHHHHHHH-------------------HHHHHHHhcccccCcc--hhHHHHHhhhcc Confidence 011111100 00000 000000000 0000000000000000 000000000000 Q ss_pred CC Q lcl|NC_019408. 594 AQ 595 (612) Q Consensus 594 ~~ 595 (612) =. T Consensus 554 ~~ 555 (555) T protein:vir:10 554 YT 555 (555) T ss_pred CC Confidence 00 No 144 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=88.23 E-value=0.034 Score=28.66 Aligned_cols=146 Identities=10% Similarity=0.126 Sum_probs=13.0 Q ss_pred CCHHHHHHHHHhcCccchhhh-hH-HHHHHhhccccccccchhHHhhhhhhHHHH---hHHHHHHHHHHH----HHHHHH Q lcl|NC_019408. 462 LPDPVFYEYMRKAEVISSDMT-FE-EFQALRADENSFINNPDAQARQRGYTNRGQ---ELEQSRMAREAD----FTQQKI 532 (612) Q Consensus 462 is~et~~~~lqr~~vl~~~~~-~e-ee~~ria~e~~~~~~~~~~~~~~~e~~r~~---~~e~~r~~~e~e----~~~q~~ 532 (612) |..+.+..+|.... .... .. +.+..+.+...... +..+...+....++ +++++..+.+.. .++.+. T Consensus 1 Mki~elk~el~~~~---~el~~~~~elr~~~~~~~~~~~--el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~ 75 (437) T protein:vir:10 1 MKIEKLKKDLATKT---AELNTKKAEIRSFTESEDKTID--EVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRD 75 (437) T ss_pred CCHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222222222100 0000 00 00000000000000 00000000000000 000000000000 000000 Q ss_pred HHHHHHHHHHHH--HHHHHHHHHHH-----HHHHHHHHHHHHH-HHHHHHHHHHHHHHH--HHHHhhcccc-CCC-chhh Q lcl|NC_019408. 533 DIQERSVAVQEG--HAEVAHAAGST-----SISGSRKLGDPEQ-AKPAVADQATIDNAK--KQTANAAKVA-AQP-PAPA 600 (612) Q Consensus 533 e~~~r~~~~~~~--r~~~e~~~~~~-----~~~~~r~~~~e~q-~k~~~~eq~~~~~~~--k~~~~~a~~~-~~~-~~~~ 600 (612) .......+.+.. +.+....++.. ..+.+.+...+.. +.............. ......+... ... .... T Consensus 76 ~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 155 (437) T protein:vir:10 76 DSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRD 155 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhh Confidence 000000000000 00000000000 0000000000000 000000000000000 0000000000 000 0001 Q ss_pred cCCCCCcccC---CC Q lcl|NC_019408. 601 APGAPPTNRR---PT 612 (612) Q Consensus 601 ~~~~~~~~~~---~~ 612 (612) .....+++.. |+ T Consensus 156 ~~~~~~~~~g~lvp~ 170 (437) T protein:vir:10 156 VTGIALKDGKVIIPE 170 (437) T ss_pred hhhcccccccccchH Confidence 1111122222 22 No 145 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=87.68 E-value=0.037 Score=28.42 Aligned_cols=123 Identities=13% Similarity=0.002 Sum_probs=12.8 Q ss_pred CCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHH-------HHHH Q lcl|NC_019408. 460 GLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFT-------QQKI 532 (612) Q Consensus 460 G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~-------~q~~ 532 (612) =.|-.+++.++| ..++....++. ....++.++.++.+++.+.+.. .+.. T Consensus 1 ~~~~~~~~~~e~-----------------------~~~e~a~~~~~-~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~ 56 (458) T protein:vir:10 1 MTIDINKLKEEL-----------------------GLGDLAKSLEG-LTAAQKAQEAERMRKEQEEKELARMNDLVSKAV 56 (458) T ss_pred Cccchhhhhhhh-----------------------chhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111111 00000000000 0011111111111111111000 0000 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCchhhcC--------- Q lcl|NC_019408. 533 DIQ-ERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPPAPAAP--------- 602 (612) Q Consensus 533 e~~-~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~~--------- 602 (612) ++. +...+..++.+...+..++... .... ..+...+....++.+ .+.....+++.......... T Consensus 57 ~E~~~~le~~~ee~k~l~ee~~~~~~-~~a~-~~e~~~~~~~~~~~~----~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 130 (458) T protein:vir:10 57 GEDRKRLEEALELVKSLDEKSKKSNE-LFAQ-TVEKQQETIVGLQDE----IKSLLTAREGRSFVGDSVAKALYGTQENF 130 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHH----HHHHHHHHHhhhhhhhhhhccchhhhhhH Confidence 000 0000000111111111000000 0000 000000000000000 00000000000000000000 Q ss_pred ----------------CCCCcccCCC Q lcl|NC_019408. 603 ----------------GAPPTNRRPT 612 (612) Q Consensus 603 ----------------~~~~~~~~~~ 612 (612) +.-++.++.. T Consensus 131 ~~~~e~~~~~~~~~~~~~~~~~~~~~ 156 (458) T protein:vir:10 131 EDEVEKLVLLSYVMEKGVFETEHGQR 156 (458) T ss_pred HHHHHHHHHHHHHHhhccchhhhhhh Confidence 0011111111 No 146 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=87.44 E-value=0.039 Score=28.32 Aligned_cols=495 Identities=11% Similarity=0.036 Sum_probs=183.7 Q ss_pred HHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCC----CCHHHHHHH-HhhccCCchHHHHHHHhhc----hhhc-C Q lcl|NC_019408. 5 PEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMKG----ADGDDYAIY-LQRATFFNMLAQTRDGMTG----MVFR-R 73 (612) Q Consensus 5 P~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~~----e~~~~Y~~r-l~rA~~~n~~~~tv~~~~G----~vf~-k 73 (612) =+......+|..++.--.= ...|+....-.||.... .+...+... ...-.|-+...+.++.++. .+|. . T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 1222222222222221000 22222222223444321 111112111 1122344444444444433 3333 1 Q ss_pred Cceee------cC--CHHHHHHHhc--------cCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEE Q lcl|NC_019408. 74 DPIVK------NL--PPKFKDAVRR--------FAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVG 137 (612) Q Consensus 74 ~p~~~------~~--p~~l~~~~~d--------~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~ 137 (612) .|=+. ++ ...++.|++. ++. .+++.-+-.++.+.+.+|-+-++++-++.. ..-+++.. T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~--snf~~~~~~~~~~L~~~G~a~l~~~~d~~~----~~~~r~~~ 154 (547) T protein:vir:10 81 TKWFELAFRDKELNSDDECRKWLENATHDVYSALQD--SNFNLEANETYIDLCGYGNAIMVEEEDEDE----EGSVVFQS 154 (547) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHh--cCcHHHHHHHHHHHHhHCcEeEEeccCCCC----CCceeEEE Confidence 11110 00 1334555544 343 457777888899999999998888753221 12345666 Q ss_pred echhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCccccccee-eeeeEeeecccccccceeeccccccccc--c Q lcl|NC_019408. 138 YSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQAR-KARAAALASGSASSPMVRQTARTLGGYS--Y 214 (612) Q Consensus 138 ~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~-q~r~l~l~~g~~~~~~~~~~~~~~~g~~--~ 214 (612) |+..++. + ..|+...+.-|+-+..+ ..+.-.+.|+..... .++.. ....++.+ . T Consensus 155 ~pl~~~~-v----~~d~~G~v~~i~r~~~~---t~~qi~~~fg~~~l~~~v~~~---------------~~~~~~~~~~~ 211 (547) T protein:vir:10 155 SPIQDSY-F----EEDSRGQVVNFYRVFRW---TPAQIYDRFGDEGTPEAIIKK---------------AKEASNQAALK 211 (547) T ss_pred eecceEE-E----eeCCCcCeeeeeeeeec---cHHHHHHhcCcccCCHHHHHH---------------HhcCCCcccce Confidence 6665532 1 11233323222211111 011111222221110 00000 00000100 0 Q ss_pred e----eeeeeeecccccccccc--ccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcC--cCch Q lcl|NC_019408. 215 I----TVYRELKLEEIEWPSGE--VKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVE--KPPL 285 (612) Q Consensus 215 ~----~~~R~~~~~~~~~~~g~--~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~--~pPL 285 (612) + -+|+............. .......++|.+..++ ..+...+| ..+++||+.|.-..+..+..+ ..-| T Consensus 212 ~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~----~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l 287 (547) T protein:vir:10 212 QEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGA----VQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLAL 287 (547) T ss_pred EEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCc----eeeeecCCcccCCeeeeeeeecCCcccccchHHHHH Confidence 1 11221111100000000 0001111233222211 11222222 347777777876655555332 1124 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHH Q lcl|NC_019408. 286 LDICDLNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKER 365 (612) Q Consensus 286 ldLA~lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~ 365 (612) -|+..||.- +.+-+ ..++.+..|.+.+. ++.-..++.++++..+.......++=++.. ..+..+...|+++++ T Consensus 288 ~D~k~L~~l---~~~~l-~~~~~~~~pp~~v~--~~g~~~~~~~~pgg~~~~~~~~~v~pl~~~-~~~~~~~~~i~~~~~ 360 (547) T protein:vir:10 288 PDVLTANRY---VELVL-RSSEKVIDPAIMVT--ERGLISDIDLGASGLTVVRDMESMKPFESR-ARFDVSSIQLTDLRS 360 (547) T ss_pred HHHHHHHHH---HHHHH-HHHHHHhcCceecc--cccccccceecCCeeeecCCcccceeeecc-cchHHHHHHHHHHHH Confidence 577777752 22223 45555555655443 111234577788777755444455545544 467778899999999 Q ss_pred HHHHHHH--HhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHcCCcCCCCcceEEEe-e Q lcl|NC_019408. 366 QIAAIGG--RMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGM-----TDVVRWWLMWRDVPLADTENLRYEV-N 437 (612) Q Consensus 366 qm~~lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~a~w~g~~~~~~~~~~v~l-n 437 (612) .|..+=- .+... ....-||+....+...-...|..+-..+.+=+ .++|.++.+ .|.-..-+.++ +.. - T Consensus 361 rI~~af~~d~~~~~--~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~l-~~~~~ 436 (547) T protein:vir:10 361 AVRRIYYVDQLQMK--DSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFR-AGKLGELPSKL-LESGK 436 (547) T ss_pred HHHHHhhhhhhhcC--CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhh-hccCc Confidence 8875421 11212 22346899999988888888888877775433 334444433 23211101111 000 0 Q ss_pred ccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCccch----hhhhHHHHHHhhccccccccchhHHhhhhhhHH Q lcl|NC_019408. 438 TDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRK-AEVISS----DMTFEEFQALRADENSFINNPDAQARQRGYTNR 512 (612) Q Consensus 438 ~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr-~~vl~~----~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r 512 (612) .++.+..+++ +.....+.....| .-++..+.. .+ ++| .+++++..+.+++-- +-|...=+.+++ T Consensus 437 ~~~~v~~is~--Laraq~~~~~~~i--~~~~~~v~~laq-~~P~vld~id~d~~~~~~a~~~---Gvp~~~irs~ee--- 505 (547) T protein:vir:10 437 AAMDIVYTGP--LSRAQKIDQAASI--ERWAGSTAQLAE-INPEVLDIPDWDEMVRMLGSLL---GAPQTLMRPKAK--- 505 (547) T ss_pred ceEEEEeccH--HHHHHHHHHHHHH--HHHHHHHHHhhc-cChhhhhcCCHHHHHHHHHHHh---CCChhccCCHHH--- Confidence 1122222211 1111111111111 111221111 11 122 356677677776542 222211111111 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 513 GQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQAT 578 (612) Q Consensus 513 ~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~ 578 (612) .++-|++++. +||. .++...++.+.+.-....... +-+.|-+ T Consensus 506 ---v~~~r~qr~~--~~q~--~~qaa~~~~~g~~m~~~~~~~-----------------a~~~~~~ 547 (547) T protein:vir:10 506 ---VTSIRKNRSQ--TQQK--AEQAAIAEAEGNAMEAQGKGQ-----------------AALKENQ 547 (547) T ss_pred ---HHHHHHHHHH--HHHH--HHHHHHHHHHHHHHHhhcCcc-----------------cchhccC Confidence 1111211110 0000 000000011110000000000 0000000 No 147 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=86.26 E-value=0.047 Score=27.87 Aligned_cols=392 Identities=13% Similarity=0.092 Sum_probs=166.2 Q ss_pred CCCcHHHHHHHHHHHH---HHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTK---LRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~---i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) |...- ...+.. +.+.+.|.......+.-|.+... ....+..|.. ..+++.+|+..+..++|+...| T Consensus 4 ~m~~~-----~~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~--~~~l~~~Y~~----~~l~~~~Vd~~aed~~r~g~~i 72 (435) T protein:vir:79 4 FMSDK-----VKAITKEDGYNEIFGSKDGTFRPNAFYMQRAA--FKALSQFYEE----DGMARRIVDVIPEEMVTPGFKV 72 (435) T ss_pred ccccc-----cccchhhcchhhhhcccccccccCcccCCcCC--HHHHHHHHhc----CchhhhhhccchHHhhcCCcee Confidence 21111 111111 11112221111111111111111 1122333333 3556777888888888888888 Q ss_pred ecC--CHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhh-----hccCceEEEechhhhhcchhhh Q lcl|NC_019408. 78 KNL--PPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKG-----AVATSFAVGYSAENILDWDEVV 150 (612) Q Consensus 78 ~~~--p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~-----~~~rPy~~~~~ae~IinW~~~~ 150 (612) +.. .+.++..++.. .+..-++.+++.+..+|.++|+|+........ .+.-.++..+++.+|..- T Consensus 73 ~g~~~~~~~~~~~~~l-----~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~---- 143 (435) T protein:vir:79 73 DGVKNEKSFKSRWDEL-----RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIH---- 143 (435) T ss_pred cCCChHHHHHHHHHHh-----hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccch---- Confidence 532 24455555543 57788899999999999999999864221100 011112222222211100 Q ss_pred ccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccc Q lcl|NC_019408. 151 DMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPS 230 (612) Q Consensus 151 ~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~ 230 (612) + ...|++ .+.|+....|++.. . T Consensus 144 --------------~--------~~~dp~-----------------------------sp~fg~P~~y~v~~-------~ 165 (435) T protein:vir:79 144 --------------E--------RETNAR-----------------------------SVRYGEPKLYKISP-------G 165 (435) T ss_pred --------------h--------hccCCc-----------------------------ccccCcceEEEEec-------C Confidence 0 001111 12222223333210 0 Q ss_pred ccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhhhHH-HHHHHHHh Q lcl|NC_019408. 231 GEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRTYAE-LEYGRLFT 309 (612) Q Consensus 231 g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD-~~~~l~~~ 309 (612) +.... +.+|.+ -+....|.+++..+ ....-..+.+||+..++=.|..|.+.+. -.+++|.. T Consensus 166 ~~~~~---~~iH~S---------Rli~~~g~~~p~~~------~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~ 227 (435) T protein:vir:79 166 GDIPE---FFVHYS---------RICIIDGERVSNEK------RRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRK 227 (435) T ss_pred CCCCc---eEEcce---------eEEEecCCcchhhh------ccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00000 011110 01112233333222 1112233677787766666777777666 46778888 Q ss_pred ccceeeeecCCCC-----CCce---------EEEeccccccC-CCCCceeEEecCchhHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019408. 310 ALPVYYAPGTDSE-----GTGE---------YHIGPNMVWEV-PQGSEPGILEYTGQGLKALETALNDKERQIAAIGGR- 373 (612) Q Consensus 310 ~~P~l~i~G~~~~-----~~~~---------l~iG~~~~~~l-p~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~lGa~- 373 (612) .++++.+.|+... .... ..-|.++.+.+ ..+.++..+..+-+++. +.++...++|...-.- T Consensus 228 ~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl~---~~~~~~~~~iaaa~~IP 304 (435) T protein:vir:79 228 QQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGVP---EFLQEKIDRIVALTGIH 304 (435) T ss_pred cCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCCHH---HHHHHHHHHHHhhhCCC Confidence 8888888765211 1111 11344555555 44556777777766664 4445555555443211 Q ss_pred --hhhccc-cchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCC--- Q lcl|NC_019408. 374 --MMPGAS-KSVSESNNQTVLREANEQSLLLNII-QACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIG--- 446 (612) Q Consensus 374 --ll~~~~-~~~~esa~~~~~~~~~~~s~L~~~a-~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d--- 446 (612) .|..++ +.-+.|+.. +...=...+.++- ..+...++..++++.. ..++.|.+|+=....... T Consensus 305 ~t~L~G~s~~glnstgd~---d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~--------s~d~~~~f~pL~~~sekEkAe 373 (435) T protein:vir:79 305 EIIIKNKNTGGVSASQNT---ALETFYKLIDRKRVEDYKPILEFLLPFMIS--------ETEWSIEFEPLSVPSDKDKAE 373 (435) T ss_pred eeeeccCCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------CCCCeEEeCCCCCCCHHHHHH Confidence 121211 111112211 2112222222221 1234444455554421 246777776433222211 Q ss_pred --HHHHHHHHHHHHcCCCCHHHHHHHHHhc----CccchhhhhHHHHHHhhccccccccchhHHhhhhhhH Q lcl|NC_019408. 447 --AREMRAIQLMANDGLLPDPVFYEYMRKA----EVISSDMTFEEFQALRADENSFINNPDAQARQRGYTN 511 (612) Q Consensus 447 --~~~~~al~~~~~~G~is~et~~~~lqr~----~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~ 511 (612) ...+++...++++|.|+.+..++.|+.. |+-+.... .+.+... . .+ ..-.+-++.+ T Consensus 374 i~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~------~~~~~~d-~-~~-~~~~e~g~~~ 435 (435) T protein:vir:79 374 IMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNI------ELPEPED-L-DP-EPGQEGGLNK 435 (435) T ss_pred HHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccc------cCCcccc-C-CC-CCCCCCCCCC Confidence 1234556788999999999999988642 22211111 1111000 0 00 0000011111 No 148 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=84.05 E-value=0.063 Score=27.15 Aligned_cols=515 Identities=10% Similarity=0.029 Sum_probs=183.2 Q ss_pred CCcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCC-CCHHHHHHHHhh-ccCCchHHHHHHHhh----chhhc-C Q lcl|NC_019408. 2 VTHPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMKG-ADGDDYAIYLQR-ATFFNMLAQTRDGMT----GMVFR-R 73 (612) Q Consensus 2 ~~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~~-e~~~~Y~~rl~r-A~~~n~~~~tv~~~~----G~vf~-k 73 (612) .--+.-.....+|..+..--.= ...|+....-.||.... .++..-...... -.|-+.-.+.++.++ +.+|. . T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 1112222222222222111000 22333333333454322 111111111111 124444444444443 33333 1 Q ss_pred Cceee------cCC--HHHHHHHhccC------CCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEec Q lcl|NC_019408. 74 DPIVK------NLP--PKFKDAVRRFA------KDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYS 139 (612) Q Consensus 74 ~p~~~------~~p--~~l~~~~~d~D------~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ 139 (612) .|=+. ++. ..++.|++.|. ....+++.-+-.++.+.+.+|-+-+++|.... ....+..|+ T Consensus 81 ~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~------~~~r~~~~~ 154 (559) T protein:vir:95 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDE------DIIRTMPFP 154 (559) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCC------ceeEEEEee Confidence 11110 111 33444543321 12356777788889999999998888875321 112344555 Q ss_pred hhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCccccccee-eeeeEeeecccccccceeecccccccccceeee Q lcl|NC_019408. 140 AENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQAR-KARAAALASGSASSPMVRQTARTLGGYSYITVY 218 (612) Q Consensus 140 ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~-q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~ 218 (612) ..++. | ..|+...+.-|+.++.+. .+.-.+.|+..... .++.. +..+ +....++++ T Consensus 155 l~~~~-v----~~d~~G~vd~i~r~~~~t---~~ql~~~fg~~~l~~~~~~~-~~~~--------------~~~~~v~v~ 211 (559) T protein:vir:95 155 IGSYY-L----ANSPRGSVDTCFRKFSMT---VRQLVQEFGLNNVSESVKSM-WESG--------------TYEKWIEVM 211 (559) T ss_pred cCeEE-E----eeCCCCCeEEEEEeEecC---HHHHHHHcCcccCCHHHHHH-HhcC--------------CCCCeEEEE Confidence 44432 1 223333343333332221 11112222211111 00000 0000 000111222 Q ss_pred e-eeeccccc-cccccccceeEEEEEeeCCCceecceeeeccCC-ccccceeEEEeecCCCCCCcCcCc---hHHHHHHH Q lcl|NC_019408. 219 R-ELKLEEIE-WPSGEVKLAYVQYLYEEDPESRPIARIVPTVRG-EPLDFIPFKFFGASGNTADVEKPP---LLDICDLN 292 (612) Q Consensus 219 R-~~~~~~~~-~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~l~~IP~v~~~~~~~~~~~~~pP---LldLA~ln 292 (612) . +....... .+.+..+-.+....|+.++.+ ..+...+| ..+++||+.|.-..+..+..+.|- |-|+..|| T Consensus 212 ~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~----~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~ 287 (559) T protein:vir:95 212 HSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDN----DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQ 287 (559) T ss_pred EEEeccccccccccccccceEEEEEEEecCCC----ceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHH Confidence 1 11100000 000001111111222222221 11222333 357888888887766666554432 45666666 Q ss_pred HHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCC--c-eeEEecCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 293 LSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGS--E-PGILEYTGQGLKALETALNDKERQIAA 369 (612) Q Consensus 293 l~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~--~-~~~lE~~g~~l~~~~~~l~~~e~qm~~ 369 (612) .- +-..-...+.+..|.+.+++ +.....+.+.|+..+..+.+. + +..+......+..+...|++++..++. T Consensus 288 ~l----~~~~l~~~~~~~~pp~~v~~--~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~ 361 (559) T protein:vir:95 288 LL----QKRKSQLIDKATNPPMVAPT--SLKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINS 361 (559) T ss_pred HH----HHHHHHHHHHHhcCceeccc--cccccceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHH Confidence 42 22234456666666555532 222345667676665444322 1 222222123455566778888888865 Q ss_pred HH-HH---hhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHcCCcCCCCcceEEEeeccc Q lcl|NC_019408. 370 IG-GR---MMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGM-----TDVVRWWLMWRDVPLADTENLRYEVNTDF 440 (612) Q Consensus 370 lG-a~---ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~a~w~g~~~~~~~~~~v~ln~dF 440 (612) += .. ++... ....-||++...+...-...|..+..++.+=+ +++|.++-+- |.-..-+.++. ..++ T Consensus 362 af~~d~~~~l~~r-~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~l~---~~~i 436 (559) T protein:vir:95 362 AYFVDLFMMLQNI-NTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRK-NMLPPPPDVME---GMPL 436 (559) T ss_pred HhhhhhHHHhhcC-CCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccc---Ccce Confidence 42 11 12111 22234899999888888888888888875442 3444444442 32000011110 0122 Q ss_pred cccCCCHHHHHHHHHHHH-cCCCCHHHHHHHHHhc--CccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHH Q lcl|NC_019408. 441 LSTPIGAREMRAIQLMAN-DGLLPDPVFYEYMRKA--EVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELE 517 (612) Q Consensus 441 ~~~~~d~~~~~al~~~~~-~G~is~et~~~~lqr~--~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e 517 (612) .+..+.+ +.+...+.. .+.+..-.++..+... .++ +.+++++..+.+++-- +-|...=+.+++ .++.- T Consensus 437 ~v~~is~--La~aqk~~~~~~i~~~~~~~~~laq~~Pevl-d~id~d~~~~~~a~~~---Gvp~~~irs~~e---v~~~r 507 (559) T protein:vir:95 437 KVEYISV--MAQAQKSIGLSSLASTVNFIGQLAQVKPEAL-DKLNVDQAIDAFADMS---GVSPTVIVPQEQ---VEQAR 507 (559) T ss_pred EEEeecH--HHHHHHHHHHHHHHHHHHHHHHHhccChhhh-hcCCHHHHHHHHHHHh---CCchhhcCCHHH---HHHHH Confidence 2222211 222222211 1112212222222221 111 2356667777776542 222211111111 00011 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCc Q lcl|NC_019408. 518 QSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPP 597 (612) Q Consensus 518 ~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~ 597 (612) ++|+++ +++ +| +++...+ . .+..+...+. +....+... +.+. -....- T Consensus 508 qqr~~~----qq~---~q--~~~~~~~-a--a~~~~~~~~~---~~~~~~~l~-~~~~----------------~~~~~~ 555 (559) T protein:vir:95 508 QQRAQQ----QQQ---QQ--MMAMGMA-A--AQGVKTLSEA---KTSDPSVLS-AMAN----------------AVSGQG 555 (559) T ss_pred HHHHHH----HHH---HH--HHHHHHH-H--HHhhhccccc---cCCChhHHH-HHHH----------------hhcCcc Confidence 111110 000 00 0000000 0 0000000000 000000000 0000 000000 Q ss_pred hhhc Q lcl|NC_019408. 598 APAA 601 (612) Q Consensus 598 ~~~~ 601 (612) ++++ T Consensus 556 ~~~~ 559 (559) T protein:vir:95 556 GQSQ 559 (559) T ss_pred ccCC Confidence 0000 No 149 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=83.93 E-value=0.064 Score=27.11 Aligned_cols=410 Identities=12% Similarity=-0.027 Sum_probs=174.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCC-CCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeec Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLP-AMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKN 79 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLP-k~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~ 79 (612) +---|.+.. .+. .++|+..-.+...-|.+ .++ +.+-+..|. .....+.+|+..+...+++...|+. T Consensus 70 ~d~~~~~~~-----~~~--~~~~~~~~~~~~~~~~~~~~~--~~~l~a~Y~----~~~l~r~iVd~~A~d~~r~~~~i~~ 136 (537) T protein:vir:10 70 MDGLDVEGG-----TFS--AYANPNLSEGLVLWYAQQAFI--GHQMCALIA----THWLVNKACSQMPRDAMRKGYKIIS 136 (537) T ss_pred ccccccchh-----hhh--hhccccccchhhhhccccCCc--cHHHHHHHH----hCchhhhhhhhhhHHhhcCCceeec Confidence 222222111 111 22333332322222322 222 233344443 3457788889999999999998842 Q ss_pred C------CHH---HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc------------eEEEe Q lcl|NC_019408. 80 L------PPK---FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS------------FAVGY 138 (612) Q Consensus 80 ~------p~~---l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP------------y~~~~ 138 (612) . |+. |+..++. ..+..-++.++..+..||.++++|.-...+.... ..| |++.| T Consensus 137 ~~~~~~~~~~~~~l~~~~~~-----l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~-~~Pl~~~~i~kg~~k~l~vi 210 (537) T protein:vir:10 137 DDGNELDPKDAKFIDRYDRA-----FNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYY-EKPFNIDGVMPGAYKGIVQI 210 (537) T ss_pred CCcccccHHHHHHHHHHHHH-----hhHHHHHHHHHHhcccccceEEEEeecCcCCccc-ccccccccccccceeEEEEe Confidence 1 122 3333333 3577778888888888998887776433221100 011 12222 Q ss_pred chhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeee Q lcl|NC_019408. 139 SAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVY 218 (612) Q Consensus 139 ~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~ 218 (612) ++-.+-- . ...+. ..|+ ..+.|+....| T Consensus 211 dp~~~~~-----~----------~~~~~--------~~dp-----------------------------~sp~fg~P~~y 238 (537) T protein:vir:10 211 DPYWCAP-----L----------LDAQA--------SSNP-----------------------------VSMHFYEPTYW 238 (537) T ss_pred chhhccc-----c----------cchhh--------hccC-----------------------------CccccCCceee Confidence 2111100 0 00000 0011 11122222223 Q ss_pred eeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCcCchHHHHHHHHHHHhh Q lcl|NC_019408. 219 RELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEKPPLLDICDLNLSHYRT 298 (612) Q Consensus 219 R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~HY~~ 298 (612) ++. +. .+|. ..++ ...|.+++.+.--. .+ ..|.| ++..+.=.|..|.. T Consensus 239 ~v~-------------g~---~iH~--------SRli-~f~g~~~p~~~~~~----~~--~~G~S-vlq~~~~~l~~~~~ 286 (537) T protein:vir:10 239 LIN-------------GK---KYHR--------SHLA-IYINDEVVDFLKPS----YI--YGGVP-LPQQIMERVYAAER 286 (537) T ss_pred eec-------------Ce---Eecc--------eeEE-EecCCCCchhhhcc----cC--ccccc-HHHHHHHHHHHHHH Confidence 221 00 1110 0011 11233333332111 11 22444 44445556666665 Q ss_pred hHH-HHHHHHHhccceeeeecCCCCCCc-eE---------EEeccccccCCCC-CceeEEecCchhHHHHHHHHHHHHHH Q lcl|NC_019408. 299 YAE-LEYGRLFTALPVYYAPGTDSEGTG-EY---------HIGPNMVWEVPQG-SEPGILEYTGQGLKALETALNDKERQ 366 (612) Q Consensus 299 ~sD-~~~~l~~~~~P~l~i~G~~~~~~~-~l---------~iG~~~~~~lp~~-~~~~~lE~~g~~l~~~~~~l~~~e~q 366 (612) .+. -..+++...++++-+.|+..-.++ .+ .-+....+.+..+ .++..+..+-+++. +.++...++ T Consensus 287 t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~~~lsgl~---~~l~~~~~~ 363 (537) T protein:vir:10 287 TANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQVRVVDKDNEDVVQIDTTLNDLD---KVIMNQYQL 363 (537) T ss_pred HHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcceeEecCCCceeEEEeccCCCHH---HHHHHHHHH Confidence 554 567888888888888775322111 11 1233455666664 56666666666654 445555555 Q ss_pred HHHH-H--HHhhhccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccc Q lcl|NC_019408. 367 IAAI-G--GRMMPGAS-KSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLS 442 (612) Q Consensus 367 m~~l-G--a~ll~~~~-~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~ 442 (612) |..+ | +..|..++ +.-+.|+ ..+...=+..+.++-..+.-+++.+++++.+-.+ +...+++|..|.=+.. T Consensus 364 iAa~~~IP~t~L~G~sp~GlnatG---e~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~---~~~~~~~i~f~pL~~~ 437 (537) T protein:vir:10 364 VCAIARTPAPKMLGTVPTGFNSTG---DYEEASYHEECESTQDDMRPLIDRHHQLVCRSHL---RKRIRVKVEFPPMDAP 437 (537) T ss_pred HHhhhCCCceeeccCCccccccch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCCcceEEEeCCCCCC Confidence 5443 2 11122222 1111222 2233333344555544567777777776654222 1233566665532212 Q ss_pred cCCCHH-----HHHHHHHHHHcCCCCHHHHHHHHHhc------CccchhhhhHHHHHH-hhccccccc------cch--h Q lcl|NC_019408. 443 TPIGAR-----EMRAIQLMANDGLLPDPVFYEYMRKA------EVISSDMTFEEFQAL-RADENSFIN------NPD--A 502 (612) Q Consensus 443 ~~~d~~-----~~~al~~~~~~G~is~et~~~~lqr~------~vl~~~~~~eee~~r-ia~e~~~~~------~~~--~ 502 (612) ....-. ..++...++++|.|+....++.|..- ++ .+..+.+++... +.++.+..+ .+. + T Consensus 438 s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l-~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~ 516 (537) T protein:vir:10 438 KESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSI-TPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMF 516 (537) T ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccc-cCCCChhhhhcccCCccCCcCCCCCCCCCccccC Confidence 211111 23567788999999999999999863 33 233333333222 222221111 000 0 Q ss_pred HHhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_019408. 503 QARQRGYTNRGQELEQSRMAREADFTQ 529 (612) Q Consensus 503 ~~~~~~e~~r~~~~e~~r~~~e~e~~~ 529 (612) ...+..+... ..+...+..+- T Consensus 517 ~~~~~~~~~~------~~~~~~a~~~~ 537 (537) T protein:vir:10 517 GATSSGESAN------DPRDSGAAFED 537 (537) T ss_pred CCCccccccC------CCccCccccCC Confidence 0111100000 00000000000 No 150 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=83.79 E-value=0.066 Score=27.07 Aligned_cols=365 Identities=13% Similarity=0.058 Sum_probs=146.6 Q ss_pred CCCcHH-HHHH----HHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCc Q lcl|NC_019408. 1 MVTHPE-YQYW----RPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDP 75 (612) Q Consensus 1 ~~~hP~-y~~~----~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p 75 (612) |...+. .... -+.| +..+.+|. .| .|+ ..+..|..+..+..+..+.+.++++.|+ T Consensus 3 ~f~~~~~~~~~~~~~~~~~--~~~~~~~~-----~~-~~v---------~~~~al~~~~V~~~v~~ia~~ia~~p~~--- 62 (397) T protein:vir:38 3 LLKLNKSHSQGFSLNDPDW--VNFLTGGE-----AQ-KYV---------SADTALKNSDIFSLIMQLSGDLAMVRYT--- 62 (397) T ss_pred chhhhhcccCcccCCchhh--hhhhcCCc-----CC-cee---------chHHhhccHHHHHHHHHHHHHHhhCccc--- Confidence 111000 0000 0001 00001110 00 111 1123455544555555555555555554 Q ss_pred eeecCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEechhhhhcchhhhccCC Q lcl|NC_019408. 76 IVKNLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSAENILDWDEVVDMGG 154 (612) Q Consensus 76 ~~~~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~ae~IinW~~~~~v~g 154 (612) .+ .+.+..|..+... ..+-.+|.+.++...+.+|-|++++.+... .+| -+..++|..|-=+ T Consensus 63 -~~--~~~~~~l~~~PN~-~~s~~~f~~~~~~~lll~Gna~~~i~r~~~------g~~~~l~~l~~~~v~i~-------- 124 (397) T protein:vir:38 63 -SE--SDRSQSIISNPSV-TANGYSFWQGMFAQLLLDGNCYAYRHKNTN------GVDLSWEYLRPSQVQPM-------- 124 (397) T ss_pred -cc--ccHHHHHHhcCCC-CCCHHHHHHHHHHHhhhcCCEEEEEEECCC------CcEEEEEEEcCceeEEE-------- Confidence 32 2345566666654 688999999999999999999999875421 122 2222333222000 Q ss_pred ccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccc Q lcl|NC_019408. 155 FYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVK 234 (612) Q Consensus 155 ~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~ 234 (612) ...+| + ...|++..... ..|. T Consensus 125 ------------------------------------~~~~~---------------~---~~~y~~~~~~~---~~~~-- 145 (397) T protein:vir:38 125 ------------------------------------LLQDG---------------S---GLIYNINFDEP---AIGY-- 145 (397) T ss_pred ------------------------------------EcCCC---------------c---eEEEEEEeccc---cccc-- Confidence 00000 0 00111100000 0000 Q ss_pred ceeEEEEEeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHHH-HHhccc Q lcl|NC_019408. 235 LAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAELEYGR-LFTALP 312 (612) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~l-~~~~~P 312 (612) ....| -+. ++.+ +....+...+.||+..+. ..|.......++...+ ...+.| T Consensus 146 -----------------~~~~~------~~e--iih~~~~~~~~~~~G~s~i~~~~-~~i~~~~~~~~~~~~~f~ng~~~ 199 (397) T protein:vir:38 146 -----------------MENVP------AAD--VIHIRLLSKNGGKTGISPLSALI-NEQQIKDASNELTLKALKQSVTA 199 (397) T ss_pred -----------------eeEec------Ccc--EEEecCCCCCCccccccHHHHHH-HHHHHHHHHHHHHHHHHhccCCc Confidence 00000 011 1111 112233345778776544 4555556655554444 445778 Q ss_pred eeeeecCCCCCCc----------eEEEe--ccccccCCCCCceeEEecCchhHHH-HHHHHHHHHHHHHHH-HH--Hhhh Q lcl|NC_019408. 313 VYYAPGTDSEGTG----------EYHIG--PNMVWEVPQGSEPGILEYTGQGLKA-LETALNDKERQIAAI-GG--RMMP 376 (612) Q Consensus 313 ~l~i~G~~~~~~~----------~l~iG--~~~~~~lp~~~~~~~lE~~g~~l~~-~~~~l~~~e~qm~~l-Ga--~ll~ 376 (612) ..++.-......+ ...-| ++.++.++.| ++|.+.+.++-.. ..+..+-...++..+ |. -++- T Consensus 200 ~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg 277 (397) T protein:vir:38 200 SAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDAL--EDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLN 277 (397) T ss_pred cEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCCC--ceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC Confidence 8887622111111 01112 2234556655 5555555443322 245555555666544 42 2221 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHH Q lcl|NC_019408. 377 GASKSVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLM 456 (612) Q Consensus 377 ~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~ 456 (612) ... ..+.+..+.... -...|.-++..++++++.-|- ...+ ++-+|...........++-.+ T Consensus 278 ~~~-~~~~~~e~~~~~---~~~~l~P~~~~ie~~ln~~l~-----------~~~~----~~~~~~~~~d~~~~~~~~~~~ 338 (397) T protein:vir:38 278 GQG-DQQSSITQISGQ---YAKSLNRYVQAIVGELNDKLH-----------ANIS----ANIRFAIDAMGDQYASTISSS 338 (397) T ss_pred CCC-CcccHHHHHHHH---HHHHHHHHHHHHHHHHHHhcc-----------Chhc----ccccccccCCHHHHHHHHHHH Confidence 111 111111111111 123455666666666654331 1111 122333333234456777889 Q ss_pred HHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhc-ccc-----ccccchhHHhhhhh Q lcl|NC_019408. 457 ANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRAD-ENS-----FINNPDAQARQRGY 509 (612) Q Consensus 457 ~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~-e~~-----~~~~~~~~~~~~~e 509 (612) ++.|.|+.-..++.+-.-.+.+.+.-.-+....... ..+ ..+....+...+++ T Consensus 339 ~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 339 VKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 999999999888877554443332111110000000 000 00000000000011 No 151 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=81.22 E-value=0.088 Score=26.38 Aligned_cols=449 Identities=9% Similarity=0.011 Sum_probs=167.1 Q ss_pred CCCcHHH--HHHHHHHHHHHHHh-----cChHHHHhcccccCCCCCCCCHH--HHHHHH-hhc----cCCchHHHHHH-- Q lcl|NC_019408. 1 MVTHPEY--QYWRPEWTKLRDVM-----AGQREIKRKAEAYLPAMKGADGD--DYAIYL-QRA----TFFNMLAQTRD-- 64 (612) Q Consensus 1 ~~~hP~y--~~~~~~W~~i~d~~-----~G~~~vr~~g~~YLPk~~~e~~~--~Y~~rl-~rA----~~~n~~~~tv~-- 64 (612) |+.++.- ....+.|...+... .|..........|-|..-.-+.+ .+..+| .|| .=.++.+..|+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 4443322 22233333322222 12111111112233432211111 122222 111 22334444444 Q ss_pred --HhhchhhcCCceee--cC-----------CHHH----HHHHhc----cCCCCC-CHHHHHHHHHHHHHHhCCeEEEEe Q lcl|NC_019408. 65 --GMTGMVFRRDPIVK--NL-----------PPKF----KDAVRR----FAKDGS-SHATFAKAVLSEQAGVGRFGVLVD 120 (612) Q Consensus 65 --~~~G~vf~k~p~~~--~~-----------p~~l----~~~~~d----~D~~G~-~l~~f~~~~~~~~l~~Gr~~vlVD 120 (612) ..+|-=|+-.++.+ -+ -..+ +.|.++ ||-.|. +++.+...+++..+..|=|++..- T Consensus 81 ~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 160 (553) T protein:vir:63 81 RDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAE 160 (553) T ss_pred HHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEee Confidence 44444333222210 00 1122 344442 465554 999999999999999999988876 Q ss_pred cCcchhhhhccCce---EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccc Q lcl|NC_019408. 121 VVDNPRKGAVATSF---AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSA 197 (612) Q Consensus 121 ~p~a~~~~~~~rPy---~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~ 197 (612) +.... ..|| +-+|.|+.|=+.. ... +|.. +.+-.++ ... T Consensus 161 ~~~~~-----~~~~~~~lq~ie~drl~~~~-~~~-~~~~---------------------------i~~GVE~----d~~ 202 (553) T protein:vir:63 161 WDRAA-----NRPYATCFQMVSTDRLSNPY-QQL-DTPT---------------------------LRRGVQY----DKR 202 (553) T ss_pred eccCC-----CCcccceEEEechhhcCCCC-CCC-CCCe---------------------------eEeeeEE----CCC Confidence 54321 1122 3455555554321 100 1110 0000000 000 Q ss_pred cccceeecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCC Q lcl|NC_019408. 198 SSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNT 277 (612) Q Consensus 198 ~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~ 277 (612) |.++ -|++.....+..- .. ......+..++...-.+-..|=- .+-..+.+ T Consensus 203 Gr~v---------------aY~i~~~hPgd~~-------------~~-~~~~~~~~r~~~~~~v~a~~vlH-~f~~~r~g 252 (553) T protein:vir:63 203 GRPQ---------------GYWIQVAHPGDLY-------------QM-APDMYKWKFVQQSKPWGRRQVIH-ILEPREPD 252 (553) T ss_pred CceE---------------EEEeeccCCCccc-------------cc-cccccceeeeccccccChhHhee-cccccCCC Confidence 1111 1211111100000 00 00000001111000001111111 12223344 Q ss_pred CCcCcCchHHHHHH--HHHHHhhhHHHHHHHHHhccceeeeecCCCC-------------------------------CC Q lcl|NC_019408. 278 ADVEKPPLLDICDL--NLSHYRTYAELEYGRLFTALPVYYAPGTDSE-------------------------------GT 324 (612) Q Consensus 278 ~~~~~pPLldLA~l--nl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~-------------------------------~~ 324 (612) -.-|.|.|..+... .+..|.. +.+....-.+++...+-++.+.+ .. T Consensus 253 Q~RGis~lapvl~~l~~l~~y~d-aeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (553) T protein:vir:63 253 QSRGIADIVSGLKDMRMAKRFKE-MSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGA 331 (553) T ss_pred cccCCchHHHHHHHHHHHhHHHH-HHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccc Confidence 44566766544222 3334443 33444444445544333322110 11 Q ss_pred ceEEEeccccccCCCCCceeEEecC-c-hhHHHHHHHHHHHHHHHHH-HHH--Hhhhcc-ccchhHHHHHHHHHHHHHHH Q lcl|NC_019408. 325 GEYHIGPNMVWEVPQGSEPGILEYT-G-QGLKALETALNDKERQIAA-IGG--RMMPGA-SKSVSESNNQTVLREANEQS 398 (612) Q Consensus 325 ~~l~iG~~~~~~lp~~~~~~~lE~~-g-~~l~~~~~~l~~~e~qm~~-lGa--~ll~~~-~~~~~esa~~~~~~~~~~~s 398 (612) ..+.++++.+..|++|.+++|+.++ + ..+. ..++.+...|.+ +|. .+|... ++..=.|+.+..+++-.... T Consensus 332 ~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~ 408 (553) T protein:vir:63 332 NNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGS---EFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLE 408 (553) T ss_pred cceeecCceeeecCCCCeeeecCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHH Confidence 2367899999999999999999987 3 3443 333333334322 221 112221 22222345555566555555 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHcCC-cCCCCcceEE---------EeeccccccC---CCHH-HHHHHHHHHHcCCCC Q lcl|NC_019408. 399 LLLN-IIQACESGMTDVVRWWLMWRDV-PLADTENLRY---------EVNTDFLSTP---IGAR-EMRAIQLMANDGLLP 463 (612) Q Consensus 399 ~L~~-~a~~~~~a~~~~l~~~a~w~g~-~~~~~~~~~v---------~ln~dF~~~~---~d~~-~~~al~~~~~~G~is 463 (612) .++. ++..++.-+-..+--.|...|. +++....-.+ .++-+|.... +|+. ++++.+.++.+|..| T Consensus 409 ~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t 488 (553) T protein:vir:63 409 GRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLST 488 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCC Confidence 4444 2233333322222222223332 1111100000 0001122211 2444 899999999999999 Q ss_pred HHHHHHHHHhcCccchhhhhHHHHHHhhccccc---ccc--c-hhHH-hhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019408. 464 DPVFYEYMRKAEVISSDMTFEEFQALRADENSF---INN--P-DAQA-RQRGYTNRGQELEQSRMAREADFTQQK 531 (612) Q Consensus 464 ~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~---~~~--~-~~~~-~~~~e~~r~~~~e~~r~~~e~e~~~q~ 531 (612) .+....++ |. |+++..+.++.+... ++. + +... ......+...+.+...... ..+..| T Consensus 489 ~~~~~a~~---G~-----D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~e 553 (553) T protein:vir:63 489 YEREIARL---GG-----DFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQ--TSQQGE 553 (553) T ss_pred HHHHHHHh---CC-----CHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCC--cccccC Confidence 98776554 43 445544444433210 000 0 0000 0000000000000000000 000000 No 152 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=79.39 E-value=0.1 Score=25.95 Aligned_cols=472 Identities=13% Similarity=0.100 Sum_probs=176.2 Q ss_pred CCCcHHHHHHHHHHHHHHHH-hcC------hHHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDV-MAG------QREIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR 73 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~-~~G------~~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k 73 (612) -+.-|.=.....-|+.+..- ..| -..++.+..-.++.. -+-|+.-+.+ -.++...++.....|... T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~----~~L~edm~e~---D~~i~s~l~~Rk~av~~~ 88 (526) T protein:vir:79 16 QLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQ----AELFMDMEER---DAHLFAEMSKRKRAILGL 88 (526) T ss_pred ccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHH----HHHHHHHHhh---ChHHHHHHHHHHHHHhCC Confidence 11222111111101111000 000 011111110000000 1223333321 345556666666666666 Q ss_pred Cceee----cCC--HH----HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhh Q lcl|NC_019408. 74 DPIVK----NLP--PK----FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENI 143 (612) Q Consensus 74 ~p~~~----~~p--~~----l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~I 143 (612) +..|+ +-+ .. +..++.+. .+++.++..++. ++-+|.+.+=+ T Consensus 89 ~w~I~p~~~~~~~~~~~a~~v~~~l~~~----~~~~~~i~~~ld-A~~~G~s~~Ei------------------------ 139 (526) T protein:vir:79 89 DWAVEPPRNASAAEKADADYLHELLLDL----EGLEDLLLDALD-GIGHGYSCIEL------------------------ 139 (526) T ss_pred CceEecCCCCChHHHHHHHHHHHHHhcc----cCHHHHHHHHHh-hhhhcceeEEE------------------------ Confidence 66663 111 12 33333332 258888888775 77777554332 Q ss_pred hcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeec Q lcl|NC_019408. 144 LDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKL 223 (612) Q Consensus 144 inW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~ 223 (612) .|+. .+|...+..+..|.. .-|.... . .+ ...|. T Consensus 140 -~w~~---~~g~~~~~~l~~r~~----------~~F~~~~---------~----------------~~----~~l~~--- 173 (526) T protein:vir:79 140 -EWAL---QGREWMPLAFHHRPQ----------SWFQLNP---------E----------------DQ----NELRL--- 173 (526) T ss_pred -EEee---cCCceeEEEeeeecc----------cceEecc---------C----------------CC----cEEEe--- Confidence 2532 244333333322210 0000000 0 00 00000 Q ss_pred cccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEE-eecCCCCCCcCcCchHHHHHHHH-HHHhhhHH Q lcl|NC_019408. 224 EEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKF-FGASGNTADVEKPPLLDICDLNL-SHYRTYAE 301 (612) Q Consensus 224 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~-~~~~~~~~~~~~pPLldLA~lnl-~HY~~~sD 301 (612) ..+ ...|.+|+.-=|++ .+....+.-.+.+.|..++..-+ ++| ...+ T Consensus 174 -------------------~~~-----------~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~-~~~~ 222 (526) T protein:vir:79 174 -------------------RDN-----------SPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHY-ATSD 222 (526) T ss_pred -------------------cCC-----------CCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHh-hHHH Confidence 000 01122222111222 22233333445555555555543 455 4456 Q ss_pred HHHHHHHhccceeeee---cCCCCCCce-----EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHH--HH Q lcl|NC_019408. 302 LEYGRLFTALPVYYAP---GTDSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAA--IG 371 (612) Q Consensus 302 ~~~~l~~~~~P~l~i~---G~~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~--lG 371 (612) .-.-+..-|+|+++.. |.+++..+. ..||++++..+|.|..+.|++..+.+......-++-...+|.. +| T Consensus 223 w~~F~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLG 302 (526) T protein:vir:79 223 LAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLG 302 (526) T ss_pred HHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 6677777799999885 222221111 3489999999999999999998776666666666766676643 56 Q ss_pred HHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCcCCCCc-ceEEEeeccccccCCCHHH Q lcl|NC_019408. 372 GRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-DVVRWWLMWRDVPLADTE-NLRYEVNTDFLSTPIGARE 449 (612) Q Consensus 372 a~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-~~l~~~a~w~g~~~~~~~-~~~v~ln~dF~~~~~d~~~ 449 (612) ..|-..+......|--.......-..-++.+-+..+++.++ +++.+++.|-+-...+.. -.+|.+... ...++ ... T Consensus 303 qtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~-e~eDl-~~~ 380 (526) T protein:vir:79 303 GTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLR-EQADI-TSM 380 (526) T ss_pred hhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCC-CcccH-HHH Confidence 54432111111122223444555566678889999999997 488999998643221111 123433210 11111 223 Q ss_pred HHHHHHHHHcCC-CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHh-----hhhhhHHHHhHHHHHHH- Q lcl|NC_019408. 450 MRAIQLMANDGL-LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQAR-----QRGYTNRGQELEQSRMA- 522 (612) Q Consensus 450 ~~al~~~~~~G~-is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~-----~~~e~~r~~~~e~~r~~- 522 (612) +..+..+...|. |+.+.+.+ +.|+..+. +.+..........+....+..... ..+....+...++--.+ T Consensus 381 a~~~~~L~~~G~~i~~~~i~e---~~gip~~~-~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~ 456 (526) T protein:vir:79 381 AQSIPALVNVGLEIPSAWVYD---KLGIPQPA-KNEPVLRPAAQPAILSRQHGQRVAALATIVGPRYGDQQALDKALADL 456 (526) T ss_pred HHHHHHHHhCCCcCCHHHHHH---HhCCCCCC-CchhhccccCCccccccccccccccccccccccCchhhHHHHHHHHH Confidence 555667788887 77654433 45764332 112221111111111000000000 00000001011000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCchhhc Q lcl|NC_019408. 523 READFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPE-QAKPAVADQATIDNAKKQTANAAKVAAQPPAPAA 601 (612) Q Consensus 523 ~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~-q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~ 601 (612) ...+.+..-..-- .+-...-+....=++.+.+..+-- ..-..+.++ .-+.-=..+.-.-+.....+..- T Consensus 457 ~~~~~~~~~~~~~--------~~i~~~~~~~~s~ee~~~~L~~l~~~ld~~~l~~--~l~~a~~~A~l~Gr~~~~~e~~~ 526 (526) T protein:vir:79 457 PAKDMQNQANDLL--------APLLDAVNRGDSETELLGALAEAFPDMDDSALTD--ALHRLLFAADTWGRLHGNLDRID 526 (526) T ss_pred HHHHHHHHHHHHH--------HHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHH--HHHHHHHHHHHhhhhhhhhcccC Confidence 0000000000000 000000000000011111110000 000000000 00000000000000000000000 No 153 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=77.95 E-value=0.12 Score=25.64 Aligned_cols=357 Identities=12% Similarity=0.023 Sum_probs=147.4 Q ss_pred HHHHHhcChHHHHhccc---c-----c---------CCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee Q lcl|NC_019408. 16 KLRDVMAGQREIKRKAE---A-----Y---------LPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK 78 (612) Q Consensus 16 ~i~d~~~G~~~vr~~g~---~-----Y---------LPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~ 78 (612) |+.-+. ..++.... . + ++...+.+...|.. ..|.-.+.+...|+.+++.|-.-|..+. T Consensus 1 m~m~~f---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~--~~al~~~~v~~~i~~ia~~ia~lp~~~~ 75 (392) T protein:vir:39 1 MILPIL---NFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSA--RAALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred Ccchhh---hhhhcccccccccccccccccCchhhhhhhhcCCCCceech--HHhhccHHHHHHHHHHHHhhccCceeec Confidence 111111 11111000 0 0 00000000011111 1111234455566666666666665553 Q ss_pred cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEechhhhhcchhhhccCCccc Q lcl|NC_019408. 79 NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSAENILDWDEVVDMGGFYV 157 (612) Q Consensus 79 ~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~ae~IinW~~~~~v~g~~~ 157 (612) . .....|+..-+ ...+-.+|.+.++...+.+|-+++++..... ++| -++.+.|..|- T Consensus 76 ~--~~~~~l~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~~~~L~~l~~~~v~------------- 133 (392) T protein:vir:39 76 K--KKNQGIIDNPS-TNANKHGFWQSMFAQLLLGGEAFAYRWRNAN------GADMKWEYLRPSQVN------------- 133 (392) T ss_pred c--chhhhHhhcCC-CCCCHHHHHHHHHHHhhhcCcEEEEEEECCC------CcEEEEEEEcCceeE------------- Confidence 2 22334555544 3688899999999999999999999865321 111 11111111110 Q ss_pred eeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccccee Q lcl|NC_019408. 158 PSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAY 237 (612) Q Consensus 158 Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~ 237 (612) |++ . .+ +| .-.|++.. T Consensus 134 ---~~~-----~-----------------------~~---------------~~---~~~y~~~~--------------- 149 (392) T protein:vir:39 134 ---TYY-----F-----------------------EY---------------EN---GMYYNITF--------------- 149 (392) T ss_pred ---EEE-----c-----------------------CC---------------Cc---eEEEEEEe--------------- Confidence 000 0 00 00 00111110 Q ss_pred EEEEEeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHH-HHHHHHhccceee Q lcl|NC_019408. 238 VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAEL-EYGRLFTALPVYY 315 (612) Q Consensus 238 ~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~-~~~l~~~~~P~l~ 315 (612) .++.+... ... +-+.| +++ +....+...|.+|+..+... |..-....++ ...+...+.|-.+ T Consensus 150 ------~~~~~~~~-~~~------~~~ei--ih~~~~~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gi 213 (392) T protein:vir:39 150 ------DDPKIEPI-LQA------PQSDL--IHMKLLSIDGGKTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPGV 213 (392) T ss_pred ------cCccccee-EEE------ccccE--EEecCCCCCCccccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCceE Confidence 00000000 000 01112 211 22233445688888655442 2222222333 3344556778777 Q ss_pred ee--cCCCCCCce--------E--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--Hhhhcccc Q lcl|NC_019408. 316 AP--GTDSEGTGE--------Y--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASK 380 (612) Q Consensus 316 i~--G~~~~~~~~--------l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~ 380 (612) ++ |. ....+. + .-.++.++.||.|.++.=+..+..-+. ..+..+-..+++.++ |. .++-. .. T Consensus 214 l~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~-~~ 290 (392) T protein:vir:39 214 LTVKGG-GLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIGG-QG 290 (392) T ss_pred EEeCCC-CCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCC-CC Confidence 64 21 111110 1 123334566777755554544433322 244455555566544 42 22211 11 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) ..+ +..+.... --...|.-++..++++++..|- . .+.+.+..-|.... ......+..++.+| T Consensus 291 ~~~-~~~~~~~~--f~~~~l~P~~~~ie~~l~~~L~-------~------~~~~d~~~~~~~d~--~~~~~~~~~l~~~g 352 (392) T protein:vir:39 291 DQQ-SSIQQISG--MYASALNRYLRPAISELEYKLS-------D------HISVNMRPAIDPLG--DNYLSTISTATRWG 352 (392) T ss_pred Ccc-cHHHHHHH--HHHHHHHHHHHHHHHHHHHhcc-------c------cccccchhhhccCH--HHHHHHHHHHHhCC Confidence 111 11111111 1234466677777777765441 1 12222222222111 22355677889999 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchh Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDA 502 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~ 502 (612) .+|...+++.+.+.|+.+.++-..+-..-+..-+ .+.|.+ T Consensus 353 ~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd--~~~p~p 392 (392) T protein:vir:39 353 ALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQ--SNEPVP 392 (392) T ss_pred CcCHHHHHHHHHhcCCCccccchhcCCCCCCCCC--CCCCCC Confidence 9999999999999998754432111111111111 111111 No 154 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=77.95 E-value=0.12 Score=25.64 Aligned_cols=357 Identities=12% Similarity=0.023 Sum_probs=147.4 Q ss_pred HHHHHhcChHHHHhccc---c-----c---------CCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceee Q lcl|NC_019408. 16 KLRDVMAGQREIKRKAE---A-----Y---------LPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVK 78 (612) Q Consensus 16 ~i~d~~~G~~~vr~~g~---~-----Y---------LPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~ 78 (612) |+.-+. ..++.... . + ++...+.+...|.. ..|.-.+.+...|+.+++.|-.-|..+. T Consensus 1 m~m~~f---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~--~~al~~~~v~~~i~~ia~~ia~lp~~~~ 75 (392) T protein:vir:10 1 MILPIL---NFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSA--RAALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred Ccchhh---hhhhcccccccccccccccccCchhhhhhhhcCCCCceech--HHhhccHHHHHHHHHHHHhhccCceeec Confidence 111111 11111000 0 0 00000000011111 1111234455566666666666665553 Q ss_pred cCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEechhhhhcchhhhccCCccc Q lcl|NC_019408. 79 NLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSAENILDWDEVVDMGGFYV 157 (612) Q Consensus 79 ~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~ae~IinW~~~~~v~g~~~ 157 (612) . .....|+..-+ ...+-.+|.+.++...+.+|-+++++..... ++| -++.+.|..|- T Consensus 76 ~--~~~~~l~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~~~~L~~l~~~~v~------------- 133 (392) T protein:vir:10 76 K--KKNQGIIDNPS-TNANKHGFWQSMFAQLLLGGEAFAYRWRNAN------GADMKWEYLRPSQVN------------- 133 (392) T ss_pred c--chhhhHhhcCC-CCCCHHHHHHHHHHHhhhcCcEEEEEEECCC------CcEEEEEEEcCceeE------------- Confidence 2 22334555544 3688899999999999999999999865321 111 11111111110 Q ss_pred eeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccccccccccccee Q lcl|NC_019408. 158 PSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAY 237 (612) Q Consensus 158 Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~ 237 (612) |++ . .+ +| .-.|++.. T Consensus 134 ---~~~-----~-----------------------~~---------------~~---~~~y~~~~--------------- 149 (392) T protein:vir:10 134 ---TYY-----F-----------------------EY---------------EN---GMYYNITF--------------- 149 (392) T ss_pred ---EEE-----c-----------------------CC---------------Cc---eEEEEEEe--------------- Confidence 000 0 00 00 00111110 Q ss_pred EEEEEeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHH-HHHHHHhccceee Q lcl|NC_019408. 238 VQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAEL-EYGRLFTALPVYY 315 (612) Q Consensus 238 ~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~-~~~l~~~~~P~l~ 315 (612) .++.+... ... +-+.| +++ +....+...|.+|+..+... |..-....++ ...+...+.|-.+ T Consensus 150 ------~~~~~~~~-~~~------~~~ei--ih~~~~~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gi 213 (392) T protein:vir:10 150 ------DDPKIEPI-LQA------PQSDL--IHMKLLSIDGGKTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPGV 213 (392) T ss_pred ------cCccccee-EEE------ccccE--EEecCCCCCCccccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCceE Confidence 00000000 000 01112 211 22233445688888655442 2222222333 3344556778777 Q ss_pred ee--cCCCCCCce--------E--EEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--Hhhhcccc Q lcl|NC_019408. 316 AP--GTDSEGTGE--------Y--HIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASK 380 (612) Q Consensus 316 i~--G~~~~~~~~--------l--~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~ 380 (612) ++ |. ....+. + .-.++.++.||.|.++.=+..+..-+. ..+..+-..+++.++ |. .++-. .. T Consensus 214 l~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~-~~ 290 (392) T protein:vir:10 214 LTVKGG-GLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIGG-QG 290 (392) T ss_pred EEeCCC-CCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCC-CC Confidence 64 21 111110 1 123334566777755554544433322 244455555566544 42 22211 11 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) ..+ +..+.... --...|.-++..++++++..|- . .+.+.+..-|.... ......+..++.+| T Consensus 291 ~~~-~~~~~~~~--f~~~~l~P~~~~ie~~l~~~L~-------~------~~~~d~~~~~~~d~--~~~~~~~~~l~~~g 352 (392) T protein:vir:10 291 DQQ-SSIQQISG--MYASALNRYLRPAISELEYKLS-------D------HISVNMRPAIDPLG--DNYLSTISTATRWG 352 (392) T ss_pred Ccc-cHHHHHHH--HHHHHHHHHHHHHHHHHHHhcc-------c------cccccchhhhccCH--HHHHHHHHHHHhCC Confidence 111 11111111 1234466677777777765441 1 12222222222111 22355677889999 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchh Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDA 502 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~ 502 (612) .+|...+++.+.+.|+.+.++-..+-..-+..-+ .+.|.+ T Consensus 353 ~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd--~~~p~p 392 (392) T protein:vir:10 353 ALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQ--SNEPVP 392 (392) T ss_pred CcCHHHHHHHHHhcCCCccccchhcCCCCCCCCC--CCCCCC Confidence 9999999999999998754432111111111111 111111 No 155 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=74.68 E-value=0.16 Score=25.02 Aligned_cols=360 Identities=11% Similarity=-0.007 Sum_probs=149.8 Q ss_pred HHHHHHHHhcChHHHHhcc-----cccCCCCCCCCHHHHHH----------HHhhccCCchHHHHHHHhhchhhcCCcee Q lcl|NC_019408. 13 EWTKLRDVMAGQREIKRKA-----EAYLPAMKGADGDDYAI----------YLQRATFFNMLAQTRDGMTGMVFRRDPIV 77 (612) Q Consensus 13 ~W~~i~d~~~G~~~vr~~g-----~~YLPk~~~e~~~~Y~~----------rl~rA~~~n~~~~tv~~~~G~vf~k~p~~ 77 (612) .|--+-+ .++... ..+-+-...-++...-. .-..|.-.+.+...|+.+++.|-.-|..+ T Consensus 1 m~m~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~ 74 (392) T protein:vir:74 1 MILPILN------FINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINA 74 (392) T ss_pred Ccchhhh------hhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceee Confidence 1111111 111110 00000000000100000 11122223445555666666665555555 Q ss_pred ecCCHHHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEechhhhhcchhhhccCCcc Q lcl|NC_019408. 78 KNLPPKFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSAENILDWDEVVDMGGFY 156 (612) Q Consensus 78 ~~~p~~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~ae~IinW~~~~~v~g~~ 156 (612) .. .....|++..+- ..+-.+|.+.++...+.+|-+++++..... ++| -+..+.|..|- T Consensus 75 ~~--~~~~~l~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G~~~~L~~i~~~~v~------------ 133 (392) T protein:vir:74 75 EK--KKNQGIIDNPST-NANKHGFWQSMFAQLLLGGEAFAYRWRNAN------GADMKWEYLRPSQVN------------ 133 (392) T ss_pred cc--chhhhhhhhcCC-CCCHHHHHHHHHHHhhhcCCEEEEEEECCC------CcEEEEEEEcCceeE------------ Confidence 22 122345555443 588899999999999999999999864321 111 11111111110 Q ss_pred ceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccce Q lcl|NC_019408. 157 VPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLA 236 (612) Q Consensus 157 ~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~ 236 (612) |++ ..+ +| .-.|++... T Consensus 134 ----v~~----------------------------~~~---------------~~---~~~y~~~~~------------- 150 (392) T protein:vir:74 134 ----TYY----------------------------FEY---------------EN---GMYYNITFD------------- 150 (392) T ss_pred ----EEE----------------------------cCC---------------Cc---eEEEEEEec------------- Confidence 000 000 00 011221100 Q ss_pred eEEEEEeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHH-HHHHHHhcccee Q lcl|NC_019408. 237 YVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAEL-EYGRLFTALPVY 314 (612) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~-~~~l~~~~~P~l 314 (612) ++.......+ + -+.| +++ +...++...|.+|+..+... |..-.....+ ...+...+.|-. T Consensus 151 --------~~~~~~~~~~-~------~~ev--ih~~~~~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~~ 212 (392) T protein:vir:74 151 --------DPKIEPILQA-P------QSDL--IHMKLLSIDGGKTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPG 212 (392) T ss_pred --------CCccceeEEE-c------CccE--EEecCCCCCCccccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCce Confidence 0000000000 0 0111 211 22223445678888665543 3332333333 334556677777 Q ss_pred eeecC-CCCCCce--------EE--EeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--Hhhhcccc Q lcl|NC_019408. 315 YAPGT-DSEGTGE--------YH--IGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASK 380 (612) Q Consensus 315 ~i~G~-~~~~~~~--------l~--iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~ 380 (612) +++=- +....+. +. -.++..+.|+.|.++.=+..+..-.. ..+..+-...++.++ |. .++-..+ T Consensus 213 il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~- 290 (392) T protein:vir:74 213 VLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIGGQG- 290 (392) T ss_pred EEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCC- Confidence 76511 1111111 11 12334566777766555554444322 244455555566543 32 2221111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcC Q lcl|NC_019408. 381 SVSESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDG 460 (612) Q Consensus 381 ~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G 460 (612) ..+ +..+.... --...|.-++..++++++..|- . .+.+.+..-|... .......+-.++.+| T Consensus 291 ~~~-~~~e~~~~--~~~~~l~p~~~~ie~~l~~~l~-------~------~~~~~~~~~~~~d--~~~~~~~~~~l~~~g 352 (392) T protein:vir:74 291 DQQ-SSIQQISG--MYASALNRYLRPAISELEYKLS-------D------HISVNMRPAIDPL--GDNYLSTISTATRWG 352 (392) T ss_pred Ccc-cHHHHHHH--HHHHHHHHHHHHHHHHHHHhcc-------c------hhcccchhhhcCC--HHHHHHHHHHHHhCC Confidence 111 11111111 1234466677777777766541 1 1222222222221 123456677899999 Q ss_pred CCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchh Q lcl|NC_019408. 461 LLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDA 502 (612) Q Consensus 461 ~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~ 502 (612) .+|....++.+.+.|+.+.+.-..+-...+..-+ .+.|.+ T Consensus 353 ~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd--~~~p~p 392 (392) T protein:vir:74 353 ALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQ--SNEPVP 392 (392) T ss_pred CcCHHHHHHHHHhCCCCccccchhcCCCCCCCCC--CCCCCC Confidence 9999999999999898754432111111111111 111212 No 156 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=72.31 E-value=0.18 Score=24.61 Aligned_cols=151 Identities=8% Similarity=0.010 Sum_probs=21.4 Q ss_pred cCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHH Q lcl|NC_019408. 443 TPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMA 522 (612) Q Consensus 443 ~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~ 522 (612) -.++-+.+.. .+........+ .++..... ..+.+++..+. .............+..++.|+. T Consensus 1 ~~~~~~~~~~--------e~~~~e~a~~~--~~~~~~~k--~~e~~~~~ke~------~~~~l~~~~e~~~k~~~E~~~~ 62 (458) T protein:vir:10 1 MTIDINKLKE--------ELGLGDLAKSL--EGLTAAQK--AQEAERMRKEQ------EEKELARMNDLVSKAVGEDRKR 62 (458) T ss_pred Cccchhhhhh--------hhchhhHHHHH--HHHHHHHH--HHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHH Confidence 2222222111 11111111111 11111111 11111111110 0000000000001111111111 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHH-HHH--HHHHHHH-HHH Q lcl|NC_019408. 523 READFT-QQKIDIQERSVAVQEGHAEVAHAAGSTS----------ISGSRKLGDPEQAKPAVA-DQA--TIDNAKK-QTA 587 (612) Q Consensus 523 ~e~e~~-~q~~e~~~r~~~~~~~r~~~e~~~~~~~----------~~~~r~~~~e~q~k~~~~-eq~--~~~~~~k-~~~ 587 (612) .+...+ .+...++.++.. +......++...... +...++............ ... ..+.+.+ ... T Consensus 63 le~~~ee~k~l~ee~~~~~-~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 141 (458) T protein:vir:10 63 LEEALELVKSLDEKSKKSN-ELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLS 141 (458) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHH Confidence 111111 111111111100 000000000000000 000000000000000000 000 0111111 111 Q ss_pred hhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 588 NAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 588 ~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) ....+............+..+..-+ T Consensus 142 ~~~~~~~~~~~~~~~~~~a~~~~~~ 166 (458) T protein:vir:10 142 YVMEKGVFETEHGQRHLKAVNQSSS 166 (458) T ss_pred HHHhhccchhhhhhhhhhhhhhccc Confidence 1122222222222222232222222 No 157 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=70.32 E-value=0.21 Score=24.29 Aligned_cols=431 Identities=11% Similarity=0.003 Sum_probs=165.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCC-----CCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCc Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMK-----GADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDP 75 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~-----~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p 75 (612) |-..=+-..-++--++-..+.. +-+..+..|+..+. +..-+-|+.-+. -.++...++.....|...+. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~---~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~----D~hi~s~l~~Rk~av~~~~w 73 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSL---GLKVKNGRIYEEPRQALRFPESIKTFQLMMR----DPAVAASVNIIKMFVRKVNW 73 (488) T ss_pred CCCccccCCCCCHHHHHHHHHH---hhccccchhhccchhhhcccchHHHHHHHhh----ChHHHHHHHHHHHHHhcCCc Confidence 1000000000000011111111 11222223443221 223456766553 46777777777777777777 Q ss_pred eee---cCCHH-----HHHHHhccCCCC-CCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcc Q lcl|NC_019408. 76 IVK---NLPPK-----FKDAVRRFAKDG-SSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDW 146 (612) Q Consensus 76 ~~~---~~p~~-----l~~~~~d~D~~G-~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW 146 (612) .|+ ..++. ...++..+-... .++..++..++ .++-+|.+.+=+-|-.... ...++.| T Consensus 74 ~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~-------------~~~~~~~ 139 (488) T protein:vir:95 74 RFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQG-------------KKGKYQS 139 (488) T ss_pred eEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-Hhhcccceeeeeeeecccc-------------ccccccc Confidence 774 11111 123333332222 46888888887 5888887765443311000 0011111 Q ss_pred hhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeee-eccc Q lcl|NC_019408. 147 DEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYREL-KLEE 225 (612) Q Consensus 147 ~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~-~~~~ 225 (612) . ..||+..+.-|..|.... ..-|.+.. ++. .+.+.. .... T Consensus 140 ~---~~dg~~~~~~i~~Rpq~~-------~~~f~~d~----------d~~-------------------l~~~~~~~~~~ 180 (488) T protein:vir:95 140 K---FDDGLIGWAKLPIRNQST-------LDKWYFDE----------DFR-------------------RVTGVRQNLRN 180 (488) T ss_pred c---ccCCeeeeeeeeecCccc-------ccceeecc----------CCC-------------------ceeeccccccc Confidence 1 113333333333221100 00000000 000 000000 0000 Q ss_pred cccccccccceeEEEEEeeCCCceecceeeeccCCcccccee---EEE-eecCCCCCCcCcCchHHHHHHHHHHHhhhHH Q lcl|NC_019408. 226 IEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIP---FKF-FGASGNTADVEKPPLLDICDLNLSHYRTYAE 301 (612) Q Consensus 226 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP---~v~-~~~~~~~~~~~~pPLldLA~lnl~HY~~~sD 301 (612) +....+. .+ +.-...| ..|| |++ .+....+--.+...|..++..-+ |.+.+- T Consensus 181 ~~~~~~~---------------~~----~~~~~~~---~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~--fK~~~~ 236 (488) T protein:vir:95 181 VSHIAGA---------------IN----LGERPLT---RKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWK--YKVQIE 236 (488) T ss_pred ccccccc---------------cc----ccccccc---ccccccceEEEeecCCCCccchhhHHHHHHHHHH--HHHHHH Confidence 0000000 00 0000001 1345 332 22222233334444444443332 223333 Q ss_pred HHHHHHHh--ccceeeeecCC-----CCCCc-------------eEEEeccccccCCCCCceeE---------EecCchh Q lcl|NC_019408. 302 LEYGRLFT--ALPVYYAPGTD-----SEGTG-------------EYHIGPNMVWEVPQGSEPGI---------LEYTGQG 352 (612) Q Consensus 302 ~~~~l~~~--~~P~l~i~G~~-----~~~~~-------------~l~iG~~~~~~lp~~~~~~~---------lE~~g~~ 352 (612) -.+..|.- ++|+|++.|.. ....+ .+..|+.+++.+|.|....+ ++..|.+ T Consensus 237 ~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~ 316 (488) T protein:vir:95 237 EYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAK 316 (488) T ss_pred HHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCC Confidence 34444444 57888877631 11111 12235557778887765443 4555555 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHhhhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCcCCCC Q lcl|NC_019408. 353 LKALETALNDKERQI--AAIGGRMMPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-DVVRWWLMWRDVPLADT 429 (612) Q Consensus 353 l~~~~~~l~~~e~qm--~~lGa~ll~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-~~l~~~a~w~g~~~~~~ 429 (612) .......++-...+| ..+|-. |....+. ..|--.......-...++.+-+..++++++ +++.+++.|-.-+ .. T Consensus 317 ~~~~~~li~~~d~~Isk~iLGqt-LT~~~~~-~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~--~~ 392 (488) T protein:vir:95 317 AYDTGSIIDRYSKQIMMAFMSDV-LAMGQSK-YGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNMWD--DE 392 (488) T ss_pred chhHHHHHHHHHHHHHHHHhccc-cccccCc-chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CC Confidence 444455555555555 334543 3322211 122224455555566778888999999997 5888888875311 12 Q ss_pred cceEEEeeccccccCCC-HHHHHHHHHHHHcCC-CCHHHHHHHHH-hcCccchhhhhHHHHHHhhccccccccchhHHhh Q lcl|NC_019408. 430 ENLRYEVNTDFLSTPIG-AREMRAIQLMANDGL-LPDPVFYEYMR-KAEVISSDMTFEEFQALRADENSFINNPDAQARQ 506 (612) Q Consensus 430 ~~~~v~ln~dF~~~~~d-~~~~~al~~~~~~G~-is~et~~~~lq-r~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~ 506 (612) .-.+|.+.. ....| .....++-.+...|. ++...+.++++ +.||..+..+ +........+.+. ..+...... T Consensus 393 ~~P~~~~~~---~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~-e~~~~~~~~~~~~-~~~~~~~~~ 467 (488) T protein:vir:95 393 EHVQITYDD---IETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADES-QPVSEKLSPNSQS-RSGDGYKTA 467 (488) T ss_pred CccEEEecC---cChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCC-ccccccCCCCCCC-CCCcccCCC Confidence 223444321 11112 122345666778887 45443444444 5677544322 2111111111000 000000000 Q ss_pred hhhhHHHHhHH---HHHHHHH Q lcl|NC_019408. 507 RGYTNRGQELE---QSRMARE 524 (612) Q Consensus 507 ~~e~~r~~~~e---~~r~~~e 524 (612) .....+....+ ..+++.+ T Consensus 468 ~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 468 GEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred cccCCcccccccchhhhhccC Confidence 00000000000 0000000 No 158 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=68.76 E-value=0.23 Score=24.06 Aligned_cols=517 Identities=14% Similarity=0.111 Sum_probs=186.6 Q ss_pred CC-----CcHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCC----CCCHHHHHHHHhhccCCchHHHHHHHhhchh Q lcl|NC_019408. 1 MV-----THPEYQYWRPEWTKLRDVMAG-QREIKRKAEAYLPAMK----GADGDDYAIYLQRATFFNMLAQTRDGMTGMV 70 (612) Q Consensus 1 ~~-----~hP~y~~~~~~W~~i~d~~~G-~~~vr~~g~~YLPk~~----~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~v 70 (612) ++ +|---......|+...|.-.= ...|++.. .|+-.-. +-++..+ +.++..|.+-..+..+.... T Consensus 11 ~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~-~yi~~~~tr~t~~~~~~w----~~s~t~~k~~~~~~~l~a~~ 85 (599) T protein:vir:31 11 MLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELM-DYIDATDTRKTSNSKLPF----KNSTTINKLAHLHLMITTSY 85 (599) T ss_pred HhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHH-HHHhhhcccccccCCCCc----ccccchHHHHHHHHHHHHHH Confidence 22 344334556667766664321 22232221 2321111 0111111 23445555555555544433 Q ss_pred ----hcCCceee---cCCH--------HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEec----Ccchhhhh-- Q lcl|NC_019408. 71 ----FRRDPIVK---NLPP--------KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDV----VDNPRKGA-- 129 (612) Q Consensus 71 ----f~k~p~~~---~~p~--------~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~----p~a~~~~~-- 129 (612) |-..-=++ -.|+ .++.+..|= +.-.++..-...++...+.+|-|..-|++ +..++.+. T Consensus 86 ~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~K-l~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~ 164 (599) T protein:vir:31 86 MEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGK-VEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIK 164 (599) T ss_pred HhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhh-hhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeeccccccc Confidence 33222111 0121 233332221 11234555578888899999999998985 32222211 Q ss_pred -ccCceEEEechhhhhcchhhhccCCccceeEEEEEEEeecccccc-----CCCcccccceeeeeeEeeeccccccccee Q lcl|NC_019408. 130 -VATSFAVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKS-----DIEPLTTAQARKARAAALASGSASSPMVR 203 (612) Q Consensus 130 -~~rPy~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~-----~~d~f~~~~~~q~r~l~l~~g~~~~~~~~ 203 (612) -..|-+..++|.+|+ |+.+. +.-...-.++ |.......... ....++.........-..+.+.+-...+. T Consensus 165 ~~~~P~~ervsP~Di~-~Dp~A--~si~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~ 240 (599) T protein:vir:31 165 NYSGTVTERLSPSDVF-WDVTA--DSLPKAAKCI-RQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYN 240 (599) T ss_pred ccccceEEeeccccee-eCCCC--CCCCcceeee-ehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhh Confidence 246889999998876 43322 2222222122 32210000000 00000100000000000000000000000 Q ss_pred eccccc----ccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecce--------eeeccCC--ccccceeEE Q lcl|NC_019408. 204 QTARTL----GGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIAR--------IVPTVRG--EPLDFIPFK 269 (612) Q Consensus 204 ~~~~~~----~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--------~~p~~~g--~~l~~IP~v 269 (612) ...... .|++.+..| .-..+++. ..||.-+|++.+..+.... ++....- .+.+.+||+ T Consensus 241 ~~~g~D~~~~d~~~~~~eY--~~~~~Vev------LeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyv 312 (599) T protein:vir:31 241 GRRKFDSLHKKGYGSMMNY--INEGVVEV------LTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLH 312 (599) T ss_pred hhhhccccccccccchhhh--cccchhhh------hhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeE Confidence 000000 111111111 00000000 0111123333222222111 1111122 245668988 Q ss_pred EeecC-CCCCCcCcCchH---HHHH-HHHHHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEEeccccccCCCCCcee Q lcl|NC_019408. 270 FFGAS-GNTADVEKPPLL---DICD-LNLSHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHIGPNMVWEVPQGSEPG 344 (612) Q Consensus 270 ~~~~~-~~~~~~~~pPLl---dLA~-lnl~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~iG~~~~~~lp~~~~~~ 344 (612) +.+-. ..+...+-.||. ++-+ ||+. +|..-|. +-....|++...|--. ..++.-+|+.+|.+...+++. T Consensus 313 v~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~-~Ng~iD~---~~~~l~p~l~~~~dl~--~eD~~~~P~~v~~~~d~~~vq 386 (599) T protein:vir:31 313 IAVYEFQKDTLCPIGPLHRLTGMQYKLDKR-ENFREDL---HDRFLHPSLKKVGDVR--EKGMRGGPNHVFEVEETGDVQ 386 (599) T ss_pred EEEeeeeccccCCCCCchhcchHHHHHHHH-HHHhhhh---hhhhhccccccccccc--ccCccCCCCcceeecCCCccc Confidence 65422 222233333443 3322 3543 5555443 3344477777766422 234555789999998888888 Q ss_pred EEecCchhHHHHHHHHHHHHHHHHHH-HHHhhhc-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_019408. 345 ILEYTGQGLKALETALNDKERQIAAI-GGRMMPG-ASKSVSESNNQTVLREANEQSLLLNIIQACESGMTD-VVRWWLMW 421 (612) Q Consensus 345 ~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga~ll~~-~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~~a~w 421 (612) |+.|+.+...+ .-.+.-.+..|-.+ |+..... ....+.+||...+.-....+...+......++.+.. ++.-+-.| T Consensus 387 ~~~p~s~~~~a-~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~ 465 (599) T protein:vir:31 387 YMTPPAEVLQP-DNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQ 465 (599) T ss_pred cccCchhhhhH-HHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 88888775543 33566677777544 6544321 123445676666666566665666666665555543 33322222 Q ss_pred cCCcCCCCcceEEEeecc-----ccccCCCHHHHHHHHHHHHcCC---CCHHHHHHHHHhcCccchhhhhHHHHHHhhcc Q lcl|NC_019408. 422 RDVPLADTENLRYEVNTD-----FLSTPIGAREMRAIQLMANDGL---LPDPVFYEYMRKAEVISSDMTFEEFQALRADE 493 (612) Q Consensus 422 ~g~~~~~~~~~~v~ln~d-----F~~~~~d~~~~~al~~~~~~G~---is~et~~~~lqr~~vl~~~~~~eee~~ria~e 493 (612) .-.-+.+.+.+++ +|.+ |... +..++..=..++.-|. +.++.+...|+. ++ T Consensus 466 ~~~f~D~~~tiri-~~~e~~~~~f~~i--~redl~~~~~~v~~Ga~~v~ere~~~q~l~~--il---------------- 524 (599) T protein:vir:31 466 GRNHLDASDTIKT-FNSELGTATFLDI--TADDLNLNGQMVAQGATLFAEKANTLQNLNA--IL---------------- 524 (599) T ss_pred HHhhcccccceee-ecccccceeeEEe--ehhhhhCCeeeeechhhHHHHHHHHHHHHHH--Hh---------------- Confidence 2111223333433 2333 2221 2223322222222221 112222111111 11 Q ss_pred ccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 494 NSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAV 573 (612) Q Consensus 494 ~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~ 573 (612) .+..+++ ..+...+++-.. ..+...+ .++-.-++-.++.++++.++.-.|.+.+++..- +--+ T Consensus 525 ~~~~~q~-----~~P~~~~k~l~~--~l~~~~~--l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~--------~~~~ 587 (599) T protein:vir:31 525 GGPLGAA-----LAPHMSRTKLFN--AVEYLGD--LDAYGIFTFGIGVQEDQQLARMAQKSTQQTEET--------ALTQ 587 (599) T ss_pred cccCCCc-----cchhhHHHHHHH--HHHHHHh--ccccccCCCchhHHHHHHHHHHHHHHHHHhHhh--------hhhh Confidence 0111111 112222221111 0000000 000000000111111111111111111111000 0000 Q ss_pred HH-----HHHHH Q lcl|NC_019408. 574 AD-----QATID 580 (612) Q Consensus 574 ~e-----q~~~~ 580 (612) ++ ..+.. T Consensus 588 ~~~~~~~~~~~~ 599 (599) T protein:vir:31 588 EEVGGPTTDTGQ 599 (599) T ss_pred hhcCCCCcccCC Confidence 00 00000 No 159 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=67.97 E-value=0.24 Score=23.94 Aligned_cols=458 Identities=12% Similarity=0.065 Sum_probs=173.3 Q ss_pred CC--CcHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHH---HHHHHH-hhc--------cCCchHHHHHHHh Q lcl|NC_019408. 1 MV--THPEYQYWRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGD---DYAIYL-QRA--------TFFNMLAQTRDGM 66 (612) Q Consensus 1 ~~--~hP~y~~~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~---~Y~~rl-~rA--------~~~n~~~~tv~~~ 66 (612) +| .-|...............|.|...-+. ....|.+ .-.+. .+..+| .|| +--+++...++.. T Consensus 7 ~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~--~~~~~~~-~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nv 83 (548) T protein:vir:95 7 LLEPLAPELVARRLAAREAIQAYEAARPGRT--HKAKRQP-LGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERV 83 (548) T ss_pred HhhhcchHHHHHHHHhHHHhccccccCcccc--ccccCCC-CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhc Confidence 11 146666655556666666766543222 2223322 21111 122221 111 2223444455566 Q ss_pred hchhh-cCCceeecCC------------HHHHHHHhccCCCCC-CHHHHHHHHHHHHHHhCCeEEEEecCcchh-hhhcc Q lcl|NC_019408. 67 TGMVF-RRDPIVKNLP------------PKFKDAVRRFAKDGS-SHATFAKAVLSEQAGVGRFGVLVDVVDNPR-KGAVA 131 (612) Q Consensus 67 ~G~vf-~k~p~~~~~p------------~~l~~~~~d~D~~G~-~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~-~~~~~ 131 (612) +|-.+ .-.|+.-... .....|.++||-.|. +++++.+.+++..+..|=|++..-+..... ..... T Consensus 84 VG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~ 163 (548) T protein:vir:95 84 VGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATS 163 (548) T ss_pred cCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCcc Confidence 77311 1111110111 124567788998876 699999999999999999987765432211 00111 Q ss_pred Cce-EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccc Q lcl|NC_019408. 132 TSF-AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLG 210 (612) Q Consensus 132 rPy-~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~ 210 (612) -|+ |-+|.|+.|=+ .....+. .+.+- +.....+.++ T Consensus 164 ~~~~lqliepd~l~~--------------------------~~~~~~~----~i~~G----IE~D~~Grp~--------- 200 (548) T protein:vir:95 164 VPFALELLEPDYLPF--------------------------SYNNLSK----GIVQG----IERDTWRRKR--------- 200 (548) T ss_pred cceEEEEechhhcCC--------------------------CCCCCCC----ceeee----eEECCCCceE--------- Confidence 121 23333333311 0000000 00000 0000111111 Q ss_pred cccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCcccccee---EEE-eecCCCCCCcCcCchH Q lcl|NC_019408. 211 GYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIP---FKF-FGASGNTADVEKPPLL 286 (612) Q Consensus 211 g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP---~v~-~~~~~~~~~~~~pPLl 286 (612) -|+ ++...++.+.. ...+..+..|| ++. +...+.+-.-+.|.|. T Consensus 201 ------aY~---------------------i~~~hPgd~~~-----~~~~~~~~rvpA~~VlHif~~~r~gQ~RGvs~la 248 (548) T protein:vir:95 201 ------AYH---------------------LLKDHPGNLQT-----LGGSLAVKRVEAERIIHIAYRKRIGQNRGVPMLH 248 (548) T ss_pred ------EEE---------------------EeecCCCcccc-----cccccceeeechhHheecccccCCccccCcchHH Confidence 111 11111111100 00111233445 222 2233344444667665 Q ss_pred HHHHH--HHHHHhhhHHHHHHHHHhccceeeee-cCCC---------CCCceEEEeccccc-cCCCCCceeEEecCc--h Q lcl|NC_019408. 287 DICDL--NLSHYRTYAELEYGRLFTALPVYYAP-GTDS---------EGTGEYHIGPNMVW-EVPQGSEPGILEYTG--Q 351 (612) Q Consensus 287 dLA~l--nl~HY~~~sD~~~~l~~~~~P~l~i~-G~~~---------~~~~~l~iG~~~~~-~lp~~~~~~~lE~~g--~ 351 (612) .+... .+..|.. +.+..+. +.+.-..+|+ +... .....+.++++..+ .|++|-+++|+.++- . T Consensus 249 pvl~~l~~l~~y~d-ael~~ak-i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~ 326 (548) T protein:vir:95 249 AVLIRLADLKDYEE-SERVAAR-ISAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNP 326 (548) T ss_pred HHHHHHHHHhHHHH-HHHHHHH-HhhhheeeeecCCCccccCCCCcccccccccccCCccccccCCCceeeecCCCCCCC Confidence 44221 2333444 2233333 3344445554 2111 11223678899887 589999999999763 3 Q ss_pred hHHHHHHHHHHHHHHHHH-HHH--HhhhccccchhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHcCC-cC Q lcl|NC_019408. 352 GLKALETALNDKERQIAA-IGG--RMMPGASKSVSESNNQTVLREANEQSLLLN-IIQACESGMTDVVRWWLMWRDV-PL 426 (612) Q Consensus 352 ~l~~~~~~l~~~e~qm~~-lGa--~ll~~~~~~~~esa~~~~~~~~~~~s~L~~-~a~~~~~a~~~~l~~~a~w~g~-~~ 426 (612) .+. ..+..+...|.+ +|. .++.......=.|+.+..+++-.....++. ++..++.-+-.++--+|...|. ++ T Consensus 327 ~~~---~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~l 403 (548) T protein:vir:95 327 FLE---GFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERL 403 (548) T ss_pred CHH---HHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCC Confidence 333 333333333321 221 112221111112344444444444443332 2222222221111111222332 11 Q ss_pred CCCcceEEEeeccccccC---CCHH-HHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhcccc-----cc Q lcl|NC_019408. 427 ADTENLRYEVNTDFLSTP---IGAR-EMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENS-----FI 497 (612) Q Consensus 427 ~~~~~~~v~ln~dF~~~~---~d~~-~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~-----~~ 497 (612) +........++-+|.... +|+. ++++.+.++.+|..|.+....+ +|. |++++.+.++.+.. .+ T Consensus 404 P~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~---~G~-----D~~ev~~q~a~E~~~~~~~GL 475 (548) T protein:vir:95 404 PADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARA---RGR-----DPRELKKSRETEIKANRAAGL 475 (548) T ss_pred CCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHH---hCC-----CHHHHHHHHHHHHHHHHHcCC Confidence 111111111222333332 3444 8999999999999999866554 343 44444444433321 00 Q ss_pred -------ccchhHHhhhhhhHHHHhHHH-HHHH---HHHHHHHHHH----------HHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 498 -------NNPDAQARQRGYTNRGQELEQ-SRMA---READFTQQKI----------DIQERSVAVQEGHAEVAHA 551 (612) Q Consensus 498 -------~~~~~~~~~~~e~~r~~~~e~-~r~~---~e~e~~~q~~----------e~~~r~~~~~~~r~~~e~~ 551 (612) ..+........+..+++...- +.+. .|.+-.|--+ +|.+ ..-.+- +-.-... T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~ 548 (548) T protein:vir:95 476 VFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESN-NGGADG-QPSNPDP 548 (548) T ss_pred CCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccc-cCCCCC-CCCCCCC Confidence 000001111111111111110 0000 0101000000 0000 000000 0000000 No 160 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=66.27 E-value=0.27 Score=23.70 Aligned_cols=111 Identities=14% Similarity=0.134 Sum_probs=8.9 Q ss_pred hHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 483 FEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERS--VAVQEGHAEVAHAAGSTSISGS 560 (612) Q Consensus 483 ~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~--~~~~~~r~~~e~~~~~~~~~~~ 560 (612) ..--|..++.+ -.++..+...-++ ...+.+.+..+..++. +..+++.+..++.-.+. +.+ T Consensus 1 ~~~~~~~l~~~---------------~~~~~~~l~el~e-~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l--~~~ 62 (466) T protein:vir:80 1 MALRQLMLAKK---------------IEQRKAALAELLE-QEKALQKRSEELEAAIDEANTDEEIAVVEDEINKL--EGE 62 (466) T ss_pred CchHHHHHHHH---------------HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHH--HHH Confidence 00000111110 0000000000000 0000000000000000 00000000000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCchhhcCCCC------------------CcccCCC Q lcl|NC_019408. 561 RKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPPAPAAPGAP------------------PTNRRPT 612 (612) Q Consensus 561 r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~~~~~------------------~~~~~~~ 612 (612) ..+. +++.+..+.|.++.+.+-++.........+.+........ ...|.-. T Consensus 63 ~~el-~e~~~~l~~ei~~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (466) T protein:vir:80 63 KTEL-EEKKSKLEGEIKELENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAAL 131 (466) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHH Confidence 0000 0011111111111111100000000000000000000000 0000000 No 161 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=63.92 E-value=0.31 Score=23.38 Aligned_cols=104 Identities=8% Similarity=-0.004 Sum_probs=9.8 Q ss_pred HHhhhhhh-HHHHhHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 503 QARQRGYT-NRGQELEQSRMAREADFTQ--QKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATI 579 (612) Q Consensus 503 ~~~~~~e~-~r~~~~e~~r~~~e~e~~~--q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~ 579 (612) ..+..++. ....++.++|.+...+.+. .+.+...+.+..+.+.++.+.+-.+...+-.+ .++...+..+++. . T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~--le~~~~~~~~~~~--~ 76 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDK--VEDLDEQIRELES--E 76 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH--H Confidence 11110100 1111111112111100000 00000000001111111111110011000000 0000000000000 0 Q ss_pred HHHHHHHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 580 DNAKKQTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 580 ~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) ...........+..... ...-..++.+++-. T Consensus 77 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 107 (477) T protein:vir:84 77 IERSGKLEAETKTVRKA--TVEVNEALTYEKGN 107 (477) T ss_pred HHHhhcchhhhhhhccc--ccccccchhhhhhH Confidence 00000000000000000 00000000111000 No 162 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=59.92 E-value=0.38 Score=22.87 Aligned_cols=542 Identities=12% Similarity=0.056 Sum_probs=110.0 Q ss_pred CCCcHHHHHHHHHHH---HHHHHhcChHHHHhcccccCCCCCCCCH-----HHHHHHH--hh----ccCCchHHHHHHHh Q lcl|NC_019408. 1 MVTHPEYQYWRPEWT---KLRDVMAGQREIKRKAEAYLPAMKGADG-----DDYAIYL--QR----ATFFNMLAQTRDGM 66 (612) Q Consensus 1 ~~~hP~y~~~~~~W~---~i~d~~~G~~~vr~~g~~YLPk~~~e~~-----~~Y~~rl--~r----A~~~n~~~~tv~~~ 66 (612) =|+-|...... .|- +++.+++|..- ..|.|..++-.. ..|-+|+ .. -++++++.+.|..= T Consensus 59 ~~~~~~v~~~v-~~~~~~l~~~~~~~~~~-----~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g 132 (705) T protein:vir:88 59 GIVSRDVQETV-DWIMPSLMKVFTSGGQV-----VKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMK 132 (705) T ss_pred ccccHHHHHHH-HHHHHHHHHhhcCCCce-----EEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcC Confidence 12222222111 111 23334444332 235565443222 1233332 11 12233444433222 Q ss_pred hchh---h--cCCcee---ecCCH-HHHHHHhccCCCCC--CHHH--HHHHHHHHHHHhCCeEE-EEe---c-Ccchhhh Q lcl|NC_019408. 67 TGMV---F--RRDPIV---KNLPP-KFKDAVRRFAKDGS--SHAT--FAKAVLSEQAGVGRFGV-LVD---V-VDNPRKG 128 (612) Q Consensus 67 ~G~v---f--~k~p~~---~~~p~-~l~~~~~d~D~~G~--~l~~--f~~~~~~~~l~~Gr~~v-lVD---~-p~a~~~~ 128 (612) +|.+ | ...+++ +.+|+ .+..+..+-|.... +... ...-.+...-..|+.-+ -|+ | +..+.+. T Consensus 133 ~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~ 212 (705) T protein:vir:88 133 TGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATC 212 (705) T ss_pred CeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCC Confidence 2211 2 222211 11221 12222222111000 0000 00000000011122211 122 1 0001111 Q ss_pred hccCceEEEe---chhhhhcchhhhccCCccceeEEEEEEEee---ccccccCCCcccccceeeeeeEeeecccccccce Q lcl|NC_019408. 129 AVATSFAVGY---SAENILDWDEVVDMGGFYVPSRVLLREFVR---DLRWKSDIEPLTTAQARKARAAALASGSASSPMV 202 (612) Q Consensus 129 ~~~rPy~~~~---~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~---~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~ 202 (612) ..--+|+++. +-++++.+.+...+- ..+. ..+... ..+ ....+.|.... ...... T Consensus 213 ~~d~~~~~~~~~~t~~dl~~~g~~~~~~--~~~~---~~~~~~~~~~~e-~~~~~~~d~~~--~~~~~~----------- 273 (705) T protein:vir:88 213 IDDARFLCHREKYTVSDLRLLGVPEDVI--EELP---YDEYEFSDSQPE-RLVRDNFDMTG--QLQYNS----------- 273 (705) T ss_pred cccCcEEEEEEeccHHHHHhhcCChhHh--hhhh---cccccchhhhhh-hcccccccccc--cccccc----------- Confidence 1112444222 333454432211000 0000 000000 000 00000010000 000000 Q ss_pred eecccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEeecCCCCCCcCc Q lcl|NC_019408. 203 RQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFFGASGNTADVEK 282 (612) Q Consensus 203 ~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~~~~~~~~~~~~ 282 (612) .... .....+.+|-.... ....++|.. ..| .....+. .+..+.| .++-|+-.+|++++....-+.++.. T Consensus 274 --~~~~-~~~r~v~~~E~y~~-~d~~~d~~~-~~~---~~~~~g~--~il~~~~-~~~~PF~~~~~~p~~~~~~G~g~~~ 342 (705) T protein:vir:88 274 --GDDA-EANREVWASECYTL-LDVDGDGIS-ELR---RILYVGD--YIISNEP-WDCRPFADLNAYRIAHKFHGMSVYD 342 (705) T ss_pred --cccc-CCceeEEEEEeeeE-ecccCCcce-eeE---EEEEeCc--ccccccc-CCCCCEEEecceeecCccccCChHH Confidence 0000 00001122211110 011122211 111 1111111 1112222 3455666667776655555555543 Q ss_pred C--chHHHHHHHH----HHHhhhHHHHHHHHHhccceeeeecCCCCCCceEEE-eccccccCCCC----CceeEEecCch Q lcl|NC_019408. 283 P--PLLDICDLNL----SHYRTYAELEYGRLFTALPVYYAPGTDSEGTGEYHI-GPNMVWEVPQG----SEPGILEYTGQ 351 (612) Q Consensus 283 p--PLldLA~lnl----~HY~~~sD~~~~l~~~~~P~l~i~G~~~~~~~~l~i-G~~~~~~lp~~----~~~~~lE~~g~ 351 (612) . |+-.+..... .|-+.++.-..++-...+.. -.-++...+..+.+ +++....+|.. +-+..++ T Consensus 343 ~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~--~d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~---- 416 (705) T protein:vir:88 343 KIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNL--EDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLD---- 416 (705) T ss_pred HHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCc--ccccccCCCeeEEecCCCccccccCCcCcHHHHHHHH---- Confidence 3 3333333322 22222222111111111100 00011122223333 22222223211 1111111 Q ss_pred hHHHHHHHHHHH---HHHHHHHHHHhhhc-cccch-------hHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH--HH Q lcl|NC_019408. 352 GLKALETALNDK---ERQIAAIGGRMMPG-ASKSV-------SESNNQTVLREANEQ---SLLLNIIQACESGMTD--VV 415 (612) Q Consensus 352 ~l~~~~~~l~~~---e~qm~~lGa~ll~~-~~~~~-------~esa~~~~~~~~~~~---s~L~~~a~~~~~a~~~--~l 415 (612) -....+.+. -..+.-+....+.. .+..+ ..+......+.-.++ -+...+...+..-++. ++ T Consensus 417 ---~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ 493 (705) T protein:vir:88 417 ---RLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVF 493 (705) T ss_pred ---HHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEE Confidence 111222111 11111100000000 00000 000000011100000 0001111111111110 01 Q ss_pred HH--------HHHHcCCcCCCCcceEEEe-ecc---------c----------ccc-CCCHHHHHHH-HHHHH-cCCCCH Q lcl|NC_019408. 416 RW--------WLMWRDVPLADTENLRYEV-NTD---------F----------LST-PIGAREMRAI-QLMAN-DGLLPD 464 (612) Q Consensus 416 ~~--------~a~w~g~~~~~~~~~~v~l-n~d---------F----------~~~-~~d~~~~~al-~~~~~-~G~is~ 464 (612) ++ -..|.|...-. ..+.+.. +++ + ... .+++..+..+ ..+.. .|.-.. T Consensus 494 ri~g~~v~v~~~~~~~~~~v~-v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~ 572 (705) T protein:vir:88 494 QLRGKWVAVNPANWRERSDLT-VTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDP 572 (705) T ss_pred eeccchhccchHhhccCCceE-EeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhH Confidence 11 01333321100 0000000 000 0 000 0112222111 11111 222222 Q ss_pred HHHHHHHHhcCccchhhhhHHHHHHhhcccccccc-------chhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 465 PVFYEYMRKAEVISSDMTFEEFQALRADENSFINN-------PDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQER 537 (612) Q Consensus 465 et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~-------~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r 537 (612) ..++...+ ..+..+.+...+...... ..+.++.+.+.+++ ..+.++++.|++.++.+++.++. T Consensus 573 ~~~~~~~~---------~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~-q~e~q~~q~E~q~~q~e~e~~~~ 642 (705) T protein:vir:88 573 DRFWTNPN---------SPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAK-QAEAQMKQVEAQIRLAEIELKKQ 642 (705) T ss_pred HHHhhhhh---------hHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 22221110 011111111110000000 00111111111111 12223333344433333332222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--CCCchhhcCC Q lcl|NC_019408. 538 SVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVA--AQPPAPAAPG 603 (612) Q Consensus 538 ~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~--~~~~~~~~~~ 603 (612) ++. +++++.+.++++.+ .++...+ .+++..++++...+.+.++.....++. ..-|.++.+. T Consensus 643 ~~~--~~~~e~~~~~a~~~--~~~~~~e-~e~~~~e~e~~~e~~q~~~~~~~~~~~~~~~k~~~~~rr 705 (705) T protein:vir:88 643 EAV--LQQREMALKEAELQ--LERDRFT-WERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 (705) T ss_pred HHH--HHHHHHHHHHHHHH--HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcC Confidence 222 22222111111111 1111111 111111111111111111111111111 1112233322 No 163 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=55.97 E-value=0.47 Score=22.40 Aligned_cols=464 Identities=11% Similarity=0.022 Sum_probs=172.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHhcCh-------HHHHhcccccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcC Q lcl|NC_019408. 1 MVTHPEYQYWRPEWTKLRDVMAGQ-------REIKRKAEAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRR 73 (612) Q Consensus 1 ~~~hP~y~~~~~~W~~i~d~~~G~-------~~vr~~g~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k 73 (612) -++.|+-.....-|+...+-.+.+ ..++.+..-.++.+ -+-|.....+ -.++...++.....|... T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~----~~L~~dm~~~---D~hi~s~l~~Rk~av~~~ 88 (512) T protein:vir:19 16 DEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQ----ADLAFDMEEK---DTHLFSELSKRRLAIQAL 88 (512) T ss_pred cccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHH----HHHHHHHHhh---ChHHHHHHHHHHHHHhCC Confidence 111222111111122111111110 11122211111100 0112222221 456667777777777777 Q ss_pred Cceee---cCCH---HHHHH-HhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCceEEEechhhhhcc Q lcl|NC_019408. 74 DPIVK---NLPP---KFKDA-VRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSFAVGYSAENILDW 146 (612) Q Consensus 74 ~p~~~---~~p~---~l~~~-~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy~~~~~ae~IinW 146 (612) +..|+ +..+ .+..+ .+.+.+. -+++.++..++ .++-+|.+.+=+ .| T Consensus 89 ~w~I~p~~~~~~~~~~~a~~v~~~l~~~-~~f~~~~~~ll-dA~~~G~s~~Ei-------------------------~w 141 (512) T protein:vir:19 89 EWRIAPARDASAQEKKDADMLNEYLHDA-AWFEDALFDAG-DAILKGYSMQEI-------------------------EW 141 (512) T ss_pred CceEecCCCCCHHHHHHHHHHHHHHhcC-CCHHHHHHHHH-hhhhhcceeeee-------------------------Ee Confidence 77774 1111 12112 2222211 14888888877 477777554332 25 Q ss_pred hhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeecccc Q lcl|NC_019408. 147 DEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEI 226 (612) Q Consensus 147 ~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~ 226 (612) +. .+|...+..+..+.. . -|.. + T Consensus 142 ~~---~~g~~~~~~~~~r~~----~------~f~~-----------~--------------------------------- 164 (512) T protein:vir:19 142 GW---LGKMRVPVALHHRDP----A------LFCA-----------N--------------------------------- 164 (512) T ss_pred ee---eCCceeeeeeeeecc----c------ccee-----------c--------------------------------- Confidence 32 244443433333321 0 0000 0 Q ss_pred ccccccccceeEEEEEeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHH-HHHhhhHHHHH Q lcl|NC_019408. 227 EWPSGEVKLAYVQYLYEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNL-SHYRTYAELEY 304 (612) Q Consensus 227 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl-~HY~~~sD~~~ 304 (612) +++. ..+...++ ...|.+|..-=|+++ +....+.-.+...|..++..-+ ++| .-.+.-. T Consensus 165 --~~~~------~~lr~~~~----------~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~-~~~~w~~ 225 (512) T protein:vir:19 165 --PDNL------NELRLRDA----------SYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNY-SVRDFAE 225 (512) T ss_pred --cCCC------cEEEecCC----------CCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHH-HHHHHHH Confidence 0000 00000000 011222221013322 2222333344444555554433 333 3355666 Q ss_pred HHHHhccceeeee---cCCCCCCce-----EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH--HHHHHh Q lcl|NC_019408. 305 GRLFTALPVYYAP---GTDSEGTGE-----YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA--AIGGRM 374 (612) Q Consensus 305 ~l~~~~~P~l~i~---G~~~~~~~~-----l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~--~lGa~l 374 (612) -+..-|+|+++.. |.+++..+. ..||++++..+|.|..+.|++.++.+......-++-...+|. .+|..| T Consensus 226 f~E~yG~P~~igky~~~a~~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtl 305 (512) T protein:vir:19 226 FLEIYGLPMRVGKYPTGSTNREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTL 305 (512) T ss_pred HHHHcCCCeeEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhh Confidence 7777899998875 222221111 347999999999999999999887766666666776777764 356544 Q ss_pred hhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCcCCCC-cceEEEeeccccccCCCHHHHHH Q lcl|NC_019408. 375 MPGASKSVSESNNQTVLREANEQSLLLNIIQACESGMT-DVVRWWLMWRDVPLADT-ENLRYEVNTDFLSTPIGAREMRA 452 (612) Q Consensus 375 l~~~~~~~~esa~~~~~~~~~~~s~L~~~a~~~~~a~~-~~l~~~a~w~g~~~~~~-~~~~v~ln~dF~~~~~d~~~~~a 452 (612) -...++.+ |--.......-..-++.+.+..+++.++ +++.+++.|-+-...+. .-..|.+. .. .+.++.+ T Consensus 306 Ts~~g~~G--s~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~----~~--e~eDl~~ 377 (512) T protein:vir:19 306 TTEAGDKG--ARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFD----TS--EAGDITA 377 (512) T ss_pred cccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEec----CC--ChhhHHH Confidence 22211111 2123455555566678899999999997 58898888854321111 11233332 11 2233322 Q ss_pred ----HHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_019408. 453 ----IQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFT 528 (612) Q Consensus 453 ----l~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~ 528 (612) +-.+...-.|+.+.+.+ +.||..+... +.... .....+..+.....+.......-+.+.+.--.+. .+.+ T Consensus 378 ~a~~~~~l~~G~~i~~~~i~e---~~Gip~~~~~-e~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~-~~~~ 451 (512) T protein:vir:19 378 LSDAIPKLAAGMRIPVSWIQE---KLHIPQPVGD-EAVFT-IQPVVPDNGSQKEAALSAEDIPQEDDIDRMGVSP-EDWQ 451 (512) T ss_pred HHHHHHHHhcCCCCCHHHHHH---HhCCCCCCCc-ccccc-CCCccccccccccccccccCCCchhhHhHHhhhH-HHHH Confidence 22333333476553333 4576433211 11111 1111111111111111100000111111100000 0111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCchh Q lcl|NC_019408. 529 QQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPE-QAKPAVADQATIDNAKKQTANAAKVAAQPPAP 599 (612) Q Consensus 529 ~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~-q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~ 599 (612) +.-..-- +.-...+++ ... ++.+.+..+-- ..-..+.+ ..-+.-=..+.-.-+.....+. T Consensus 452 ~~~~~~~------~~i~~~~~~--~s~-ee~~~~L~~l~~~ld~~~l~--~~l~~a~~~A~l~G~~~~~~e~ 512 (512) T protein:vir:19 452 RSVDPLL------KPVIFSVLK--DGP-EAAMNKAASLYPQMDDAELI--DMLTRAIFVADIWGRLDAAADH 512 (512) T ss_pred HHHHHHH------HHHHHHHHh--CCH-HHHHHHHHHHhccCCHHHHH--HHHHHHHHHHHHhhhhhhhccC Confidence 1000000 000000000 001 11111110000 00000000 0000000111111111111111 No 164 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=55.58 E-value=0.48 Score=22.35 Aligned_cols=161 Identities=10% Similarity=-0.002 Sum_probs=10.1 Q ss_pred CCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccc-hh--HHhhhhhhHHHHhHHHHHH Q lcl|NC_019408. 445 IGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNP-DA--QARQRGYTNRGQELEQSRM 521 (612) Q Consensus 445 ~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~-~~--~~~~~~e~~r~~~~e~~r~ 521 (612) |--.+++.-+.-...- + .....+++.. .+......++......+-...... +. ++........+...++.+. T Consensus 1 Mki~elk~el~~~~~e-l--~~~~~elr~~--~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~ 75 (437) T protein:vir:10 1 MKIEKLKKDLATKTAE-L--NTKKAEIRSF--TESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRD 75 (437) T ss_pred CCHHHHHHHHHHHHHH-H--HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111000000 0 0000011000 000000000000000000000000 00 0000000000000000000 Q ss_pred HH-----HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH--HHHHHHHHHHHHhhcc Q lcl|NC_019408. 522 AR-----EADF--TQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQ-AKPAVAD--QATIDNAKKQTANAAK 591 (612) Q Consensus 522 ~~-----e~e~--~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q-~k~~~~e--q~~~~~~~k~~~~~a~ 591 (612) .. +.+. .+.+.++.+.. .....+..+...........|....... ......+ ........+....... T Consensus 76 ~~~~~~~e~~~~~~~~e~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 153 (437) T protein:vir:10 76 DSDLVAPELEENSADNEEDDPEKL--KTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEV 153 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhh Confidence 00 0000 00000000000 0000000000000000000000000000 0000000 0000000000000000 Q ss_pred ccCCC--chhhcCCCCCcccCCC Q lcl|NC_019408. 592 VAAQP--PAPAAPGAPPTNRRPT 612 (612) Q Consensus 592 ~~~~~--~~~~~~~~~~~~~~~~ 612 (612) ++... .....-.-|.+-..+. T Consensus 154 ~~~~~~~~~~~g~lvp~~~~~~i 176 (437) T protein:vir:10 154 RDVTGIALKDGKVIIPETILTPE 176 (437) T ss_pred hhhhhcccccccccchHHHHHHH Confidence 00000 0000001111111111 No 165 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=53.92 E-value=0.52 Score=22.16 Aligned_cols=96 Identities=16% Similarity=0.140 Sum_probs=17.6 Q ss_pred HHhhhhhhHHHHhHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 503 QARQRGYTNRGQELEQSRMAREAD---FTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATI 579 (612) Q Consensus 503 ~~~~~~e~~r~~~~e~~r~~~e~e---~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~ 579 (612) .+ =||..+..+.+...++ ++.+..+.+++..++ .+..++.....+..+..+..+..+...++.++... T Consensus 1 ~~------~~~~~l~~~~~~~~~~l~el~e~~~~l~k~~~el---~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~ 71 (466) T protein:vir:80 1 MA------LRQLMLAKKIEQRKAALAELLEQEKALQKRSEEL---EAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKS 71 (466) T ss_pred Cc------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12 2222222222222222 222222222221111 11112211111100100111111111112221111 Q ss_pred HHHHHHHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 580 DNAKKQTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 580 ~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) ..++.-..++++-. +.....+..+++|. T Consensus 72 ~l~~ei~~le~el~-----e~~~~~~~~~~~~~ 99 (466) T protein:vir:80 72 KLEGEIKELENELE-----QLNNKEPKNNSEPA 99 (466) T ss_pred HHHHHHHHHHHHHH-----HHHHhhhccCchhH Confidence 11111111111111 00111122222332 No 166 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=50.29 E-value=0.61 Score=21.75 Aligned_cols=122 Identities=17% Similarity=0.185 Sum_probs=13.3 Q ss_pred HHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHH------------HHHHHHH Q lcl|NC_019408. 471 MRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQK------------IDIQERS 538 (612) Q Consensus 471 lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~------------~e~~~r~ 538 (612) |.+..+ ..+.....+.++ ....... .++.+.+..+.+...+..+ .+.+... T Consensus 1 m~~k~~-----~l~~~~~el~~~-----------l~eL~e~-~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i 63 (397) T protein:vir:96 1 MALKQL-----ILNKQIKERSSE-----------IDKLLSQ-RSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQV 63 (397) T ss_pred CcHHHH-----HHHHHHHHHHHH-----------HHHHHHH-HHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 222111 111111111110 0001101 1111111111111100000 0000001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhhccccCCCchhhcCCCCCcccCC-C Q lcl|NC_019408. 539 VAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKKQT-ANAAKVAAQPPAPAAPGAPPTNRRP-T 612 (612) Q Consensus 539 ~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~-~~~a~~~~~~~~~~~~~~~~~~~~~-~ 612 (612) ...+++..+.+++.+..+.+.... .+....+..+.+........+.. .....+..-.. ..+........- + T Consensus 64 ~~l~~~i~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 136 (397) T protein:vir:96 64 KDLDEKIAELQKEKQDLEDELAKA-ADPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINA--FVKSKGAEKRDGFT 136 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh-hhhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHH--HHHhhhhhhhhccc Confidence 111111111111111111111100 00000000001100000000000 00000000000 000000000100 0 No 167 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=46.58 E-value=0.73 Score=21.33 Aligned_cols=150 Identities=10% Similarity=0.138 Sum_probs=16.2 Q ss_pred eeccccccCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCccchhhhhHHH--HHHhhccccccccchhHHhhhhhhHHH Q lcl|NC_019408. 436 VNTDFLSTPIGAREMRAIQLMANDGLLPDPVFYEYMRKAEVISSDMTFEEF--QALRADENSFINNPDAQARQRGYTNRG 513 (612) Q Consensus 436 ln~dF~~~~~d~~~~~al~~~~~~G~is~et~~~~lqr~~vl~~~~~~eee--~~ria~e~~~~~~~~~~~~~~~e~~r~ 513 (612) .| .+++-++-+. +.||.-||=...--++.+ ......+.+.+.......+.++...+. T Consensus 1 ~~-~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 59 (543) T protein:vir:81 1 MN-TLDTLPVHPR--------------------TGLRAIGMGKRGPIWPVMGASDDHKDDAPTLTYSQARNRADEVHARM 59 (543) T ss_pred CC-ccccCcCChh--------------------HHHHHHHhhccCccchhcccccchhhhhhhhhhhHHHHHHHHHHHHH Confidence 11 1122222222 222222221100000000 011111111111111111111111111 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH----HHHH- Q lcl|NC_019408. 514 QELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQA--TIDNA----KKQT- 586 (612) Q Consensus 514 ~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~--~~~~~----~k~~- 586 (612) ++++.+.+..+.+ .++..+..++.++++.+.++.++. .+.+..++ +.+..++....++. +.+.. ++.. T Consensus 60 e~l~~~~~~~~~e-~~~~~~~~~e~~el~~~~~~l~~~---e~~~~~~e-~~~~~~~~~~~~~~e~r~e~~a~~~~~~~~ 134 (543) T protein:vir:81 60 EQIAELDKPTDEE-NEEFRALGAEFDSLVNHMSRLERA---AELARVRS-THEQIGKPQSGGQRRMRVEAGSSQGGRGDY 134 (543) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH-HHHHHHHHHHHHHHHhhhhhhhHHHhhHHH Confidence 1111111000000 000000000011111111110000 00000000 00000000000000 00000 0000 Q ss_pred HhhccccCCCc--hhhcCCC------CCcccCCC Q lcl|NC_019408. 587 ANAAKVAAQPP--APAAPGA------PPTNRRPT 612 (612) Q Consensus 587 ~~~a~~~~~~~--~~~~~~~------~~~~~~~~ 612 (612) .+.+ ...... .-..... ....+.+. T Consensus 135 ~~~~-~~~~~~l~e~~~~~~~~~~e~k~~~e~~~ 167 (543) T protein:vir:81 135 DRDA-ILEPDSIEDCRFRDPWNLSEMRTFGRDAE 167 (543) T ss_pred HHhh-hccCccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000 0000000 00000000 No 168 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=43.12 E-value=0.86 Score=20.95 Aligned_cols=427 Identities=12% Similarity=0.095 Sum_probs=160.2 Q ss_pred CCcHHHHH--HHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHH---HHHHHH--hhccCC--ch----HHHHHHHhhc Q lcl|NC_019408. 2 VTHPEYQY--WRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGD---DYAIYL--QRATFF--NM----LAQTRDGMTG 68 (612) Q Consensus 2 ~~hP~y~~--~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~---~Y~~rl--~rA~~~--n~----~~~tv~~~~G 68 (612) .--|.... ..+....+...+.|..........|.|..-.-+.+ ...... .|-.+- ++ +...++..+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 11111111 01112222333333222122223355543221111 111112 222222 23 3444445555 Q ss_pred hhhcCCceee------------cCCHHHHHHH----hc----cCCCCC-CHHHHHHHHHHHHHHhCCeEEEEecCcchhh Q lcl|NC_019408. 69 MVFRRDPIVK------------NLPPKFKDAV----RR----FAKDGS-SHATFAKAVLSEQAGVGRFGVLVDVVDNPRK 127 (612) Q Consensus 69 ~vf~k~p~~~------------~~p~~l~~~~----~d----~D~~G~-~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~ 127 (612) -=|+-.++.+ .+-..++.++ ++ ||-.|. +++++.+.+++..+..|=|++..-+-.. T Consensus 81 ~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~--- 157 (530) T protein:vir:38 81 SFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSD--- 157 (530) T ss_pred CCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccC--- Confidence 4343222210 0112344443 32 576665 9999999999999999999888765432 Q ss_pred hhccCce---EEEechhhhhcchhhhccCCccceeEEEEEEEeeccccccCCCcccccceeeeeeEeeecccccccceee Q lcl|NC_019408. 128 GAVATSF---AVGYSAENILDWDEVVDMGGFYVPSRVLLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQ 204 (612) Q Consensus 128 ~~~~rPy---~~~~~ae~IinW~~~~~v~g~~~Lt~v~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~ 204 (612) ...|| +-+|.|+.|=+.. ... +|.. |+ +-.+ .+ ..|. T Consensus 158 --~g~~~~~~lq~ie~d~l~~~~-~~~-~~~~------i~---------------------~GIe--~d--~~Gr----- 197 (530) T protein:vir:38 158 --STRLFRTQFKMVSPKRVSNPN-NIG-DTRN------CR---------------------AGVK--IN--DSGA----- 197 (530) T ss_pred --CCCccceEEEEechhhcCCCC-CCC-CCCe------eE---------------------eeeE--EC--CCCc----- Confidence 12233 3445555443331 000 1110 00 0000 00 0000 Q ss_pred cccccccccceeeeeeeeccccccccccccceeEEEEEeeCCCce--ecceeeeccCCcccccee---EEEe-ecCCCCC Q lcl|NC_019408. 205 TARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYEEDPESR--PIARIVPTVRGEPLDFIP---FKFF-GASGNTA 278 (612) Q Consensus 205 ~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~--~~~~~~p~~~g~~l~~IP---~v~~-~~~~~~~ 278 (612) ..-|+ ++.....+. .....+|.. ..|| ++.+ ...+.+- T Consensus 198 ----------~~aY~---------------------i~~~~~~~~~~~~~~~~~~~-----~~v~a~~vlH~f~~~r~gQ 241 (530) T protein:vir:38 198 ----------ALGYY---------------------VSDDGYPGWMAQNWTYIPRE-----LPGGRPSFIHVFEPMEDGQ 241 (530) T ss_pred ----------eEEEE---------------------EeeccCCCccccccceeeee-----eccChhHeEeeccccCCCc Confidence 01111 111110000 011111111 1122 3322 2233344 Q ss_pred CcCcCchHHHHHHHHHHHhhh--HHHHHHHHHhccceeeeecCCC-------------C-----------------CCce Q lcl|NC_019408. 279 DVEKPPLLDICDLNLSHYRTY--AELEYGRLFTALPVYYAPGTDS-------------E-----------------GTGE 326 (612) Q Consensus 279 ~~~~pPLldLA~lnl~HY~~~--sD~~~~l~~~~~P~l~i~G~~~-------------~-----------------~~~~ 326 (612) .-+.|.|.... ..+.++..- |.+..+. +.+.-+.+|+.-.. . .... T Consensus 242 ~RGis~lapvl-~~l~~l~~y~dael~~a~-i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (530) T protein:vir:38 242 TRGANAFYSVM-EQMKMLDTLQNTQLQSAI-VKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAP 319 (530) T ss_pred ccCCchHHHHH-HHHHHHhHHHHHHHHHHH-HhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccc Confidence 44667665442 222222222 2233333 33333444431110 0 0113 Q ss_pred EEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHH-HHHH--Hhhhcc-ccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 327 YHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIA-AIGG--RMMPGA-SKSVSESNNQTVLREANEQSLLLN 402 (612) Q Consensus 327 l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~-~lGa--~ll~~~-~~~~~esa~~~~~~~~~~~s~L~~ 402 (612) +.++++.+..|++|-+++|+.++..+- .....++.+...+. .+|. .+|... ++..=.|+.+..+++.......+. T Consensus 320 ~~l~pG~i~~L~pGe~i~~~~p~~p~~-~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~ 398 (530) T protein:vir:38 320 VRLGGARVPHLLPGDSLNLQSAQDTDN-GYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRK 398 (530) T ss_pred eeccCceeeecCCCCeeeeeCCCCCCC-CHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHH Confidence 568999999999999999999874321 12344444444432 2221 122221 121122344555555444443333 Q ss_pred HHHHHHHHHHHHHHHHHHHc------CC-cCCCCcceEEE------eecccccc---CCCHH-HHHHHHHHHHcCCCCHH Q lcl|NC_019408. 403 IIQACESGMTDVVRWWLMWR------DV-PLADTENLRYE------VNTDFLST---PIGAR-EMRAIQLMANDGLLPDP 465 (612) Q Consensus 403 ~a~~~~~a~~~~l~~~a~w~------g~-~~~~~~~~~v~------ln~dF~~~---~~d~~-~~~al~~~~~~G~is~e 465 (612) .+...+ +-..+..|+ |. +++....+.+. ++-.|... .+|+. ++++...++.+|..|++ T Consensus 399 ---~~~~~~--~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~ 473 (530) T protein:vir:38 399 ---FVASRQ--ACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYE 473 (530) T ss_pred ---HHHHHH--hhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHH Confidence 222221 112233332 31 11110000000 01112111 12454 89999999999999998 Q ss_pred HHHHHHHhcCccchhhhhHHHHHHhhccccc-----cccc-hhHH-hhhhhhHHHHhHHHHHHHH Q lcl|NC_019408. 466 VFYEYMRKAEVISSDMTFEEFQALRADENSF-----INNP-DAQA-RQRGYTNRGQELEQSRMAR 523 (612) Q Consensus 466 t~~~~lqr~~vl~~~~~~eee~~ria~e~~~-----~~~~-~~~~-~~~~e~~r~~~~e~~r~~~ 523 (612) ....+ +|. |++++.+.++.+... +..+ +... ........+.+.+..-..+ T Consensus 474 ~~~a~---~G~-----D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 474 KECAK---RGD-----DYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HHHHH---cCC-----CHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 77554 343 444444444333210 0000 0000 0000000000000000000 No 169 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=42.13 E-value=0.9 Score=20.84 Aligned_cols=107 Identities=7% Similarity=-0.010 Sum_probs=7.7 Q ss_pred HHhhhhh-hHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH-HHHHHHHHHHHHHH Q lcl|NC_019408. 503 QARQRGY-TNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSI---SGSRKL-GDPEQAKPAVADQA 577 (612) Q Consensus 503 ~~~~~~e-~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~---~~~r~~-~~e~q~k~~~~eq~ 577 (612) +++.... .++-++...+..+.. .++++.+++.+......+....++...+.+. +.+.+. .-+++.+..+.+.+ T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~--e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~ 78 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLL--SQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQ 78 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111000 000011111111100 0011111110000000000000000000000 000000 00000000000000 Q ss_pred HHHHHHHHHHhhccccCCCchhhcCCCCCcc---cCCC Q lcl|NC_019408. 578 TIDNAKKQTANAAKVAAQPPAPAAPGAPPTN---RRPT 612 (612) Q Consensus 578 ~~~~~~k~~~~~a~~~~~~~~~~~~~~~~~~---~~~~ 612 (612) ....+.+......... +............. .... T Consensus 79 ~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 115 (397) T protein:vir:96 79 DLEDELAKAADPTDQK-PKDGEKRKMKKFKVTEEELAE 115 (397) T ss_pred HHHHHHHhhhhhhhhh-hHHHHHHHHHHHhhhhHHHHH Confidence 0000000000000000 00000000000000 0000 No 170 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=41.62 E-value=0.92 Score=20.78 Aligned_cols=99 Identities=14% Similarity=0.091 Sum_probs=10.4 Q ss_pred cCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 474 AEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAG 553 (612) Q Consensus 474 ~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~ 553 (612) .+-+.. ..+.+++... .++.+++.++..++.+....+..+ ..++...+.++... T Consensus 1 ~~~~~~-------------------~~~~~~~~~~----~~el~~~~~e~~~~l~~~~~e~~~---~~e~~~~e~~~~~~ 54 (418) T protein:vir:10 1 MSHMNE-------------------PRQFGRKSGG----DSHPEQVLETVTKELKRIGDEVKS---AGEKALAEAKRAGD 54 (418) T ss_pred CCCchh-------------------HHHHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhhhh Confidence 000000 0001111100 111111111111111111111000 00000001000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 554 STSISGSRKLGDPEQAKPAVADQATIDNAKKQTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 554 ~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) .. +..++..++...+..+. .++++.-+++...... .. .+...++. T Consensus 55 ~~--~e~~~~~~~l~~~~~~l-~~~~~~~e~~~~~~~~-------~~----~~~~~~~~ 99 (418) T protein:vir:10 55 LG--VETKATVDELLIKQGEL-QARLLEAEQKLARGGG-------SA----ELETPKTL 99 (418) T ss_pred hh--HHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccc-------cc----ccchhhhh Confidence 00 00000011111110111 1111111111000000 00 00000111 No 171 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=39.49 E-value=1 Score=20.55 Aligned_cols=126 Identities=10% Similarity=0.046 Sum_probs=9.8 Q ss_pred CCHHHHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_019408. 462 LPDPVFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREA-DFTQQKIDIQERSVA 540 (612) Q Consensus 462 is~et~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~-e~~~q~~e~~~r~~~ 540 (612) |.++ +++|+. .|.+= .++++....+.+.-.+..+.+.+. ..+.+..+...+..+ T Consensus 1 ~~k~--~eem~~---------------~i~eL--------~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~e 55 (477) T protein:vir:84 1 MEKH--LEELRA---------------LRAAA--------VEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSAS 55 (477) T ss_pred CchH--HHHHHH---------------HHHHH--------HHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHH Confidence 1111 111110 00000 000000000000000000000000 000000000000111 Q ss_pred HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-----HHHHHHHH------HHHH--HHHhhccccCCCchhhcCCC Q lcl|NC_019408. 541 VQEGHAEVAHA---AGSTSISGSRKLGDPEQAKPA-----VADQATID------NAKK--QTANAAKVAAQPPAPAAPGA 604 (612) Q Consensus 541 ~~~~r~~~e~~---~~~~~~~~~r~~~~e~q~k~~-----~~eq~~~~------~~~k--~~~~~a~~~~~~~~~~~~~~ 604 (612) .+++-...++. .++++....+....+.+.+.. +.+..+.. ..-+ ......+..........+.. T Consensus 56 l~~ei~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (477) T protein:vir:84 56 IKAELDKVEDLDEQIRELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHM 135 (477) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHH Confidence 11110000000 000000000000000000000 00000000 0000 00000000000000000011 Q ss_pred CCcccCCC Q lcl|NC_019408. 605 PPTNRRPT 612 (612) Q Consensus 605 ~~~~~~~~ 612 (612) -...+.+. T Consensus 136 ~~~~~~~~ 143 (477) T protein:vir:84 136 VDVESDKE 143 (477) T ss_pred hhhhhhhh Confidence 11111111 No 172 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=38.55 E-value=1.1 Score=20.44 Aligned_cols=363 Identities=11% Similarity=0.042 Sum_probs=145.5 Q ss_pred HHHHHHHHHHhcC----hHHHH-hcccccCCCCC-CCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecCCHHH Q lcl|NC_019408. 11 RPEWTKLRDVMAG----QREIK-RKAEAYLPAMK-GADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNLPPKF 84 (612) Q Consensus 11 ~~~W~~i~d~~~G----~~~vr-~~g~~YLPk~~-~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~p~~l 84 (612) +.-|.+...--.. ...+. ..+..+++.+. +..-.. ...+. .+.+-..++.+++.|-+-|+.+.. ... T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~-~~al~----~~~v~~~i~~ia~~ia~~p~~~~~--~~~ 73 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSA-ENALK----NSDLFSIISQLSNDLATAKITTSR--KQL 73 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceech-hhhhc----cHHHHHHHHHHHHHhhhCceeecc--chh Confidence 1111111100000 00000 00000111110 000000 11122 234445666666666666665532 223 Q ss_pred HHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEechhhhhcchhhhccCCccceeEEEE Q lcl|NC_019408. 85 KDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSAENILDWDEVVDMGGFYVPSRVLL 163 (612) Q Consensus 85 ~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~ae~IinW~~~~~v~g~~~Lt~v~l 163 (612) ..|+..... ..+-.+|.+.++...+.+|-+++++..... .+| -+..+.|..|- + T Consensus 74 ~~l~~~PN~-~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~~~~l~~i~~~~v~------------------v 128 (386) T protein:vir:49 74 QGIVDNPSN-NANRFNFYQSIFAQMLLGGEAFAYRWRNDN------GRDMKWEYLRPSQVS------------------F 128 (386) T ss_pred hhhhhccCC-CCCHHHHHHHHHHHhhhcCCEEEEEEECCC------CcEEEEEEecCceeE------------------E Confidence 345555444 678899999999999999999999865321 122 12222222210 0 Q ss_pred EEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEEEEe Q lcl|NC_019408. 164 REFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLYE 243 (612) Q Consensus 164 ~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~~ 243 (612) . ...++ + .-.|++.. T Consensus 129 ~--------------------------~~~~~---------------~---~~~y~~~~--------------------- 143 (386) T protein:vir:49 129 N--------------------------RLDNQ---------------N---GLYYNITF--------------------- 143 (386) T ss_pred E--------------------------EcCCC---------------c---eEEEEEEE--------------------- Confidence 0 00000 0 00111100 Q ss_pred eCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHH-HHHHhccceeeeecCCC Q lcl|NC_019408. 244 EDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAELEY-GRLFTALPVYYAPGTDS 321 (612) Q Consensus 244 ~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~-~l~~~~~P~l~i~G~~~ 321 (612) .+..+... ...+ -+. ++++ +....+...|.||+..+.. .|........+.. .+...+.|-.++.--.. T Consensus 144 ~~~~~~~~-~~~~------~~e--vih~~~~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~ 213 (386) T protein:vir:49 144 DDPHIAPK-QHVP------QND--ILHFRLLSVDGGLTSVSPLMALGR-EFNIQKASDKLTISALKNALNANGILKIKGG 213 (386) T ss_pred cCccccce-eEEc------ccc--EEEecCCCCCCccccccHHHHHHH-HHHHHHHHHHHHHHHHHccCCccEEEEeCCC Confidence 00000000 0000 011 1222 1122333457787765444 3344444444443 44455678877742111 Q ss_pred CCCc----------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHHHHH Q lcl|NC_019408. 322 EGTG----------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSESNNQ 388 (612) Q Consensus 322 ~~~~----------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~esa~~ 388 (612) ...+ ...=.++.++.++.|.++.=+..+..-.. ..+..+-..+++..+ |. .++-. +.. .++.. T Consensus 214 ~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~--~~~~~ 289 (386) T protein:vir:49 214 GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQ-LLSQADWTTGQFAKVYGIPESIVGG-DGD--QQSSL 289 (386) T ss_pred CChHHHHHHHHHHHHhccCCCCceecCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCC-CCC--ccchH Confidence 1110 12223445667777766555544444332 244455555566544 32 22311 111 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHH-HHHHHHHHHHcCCCCHHHH Q lcl|NC_019408. 389 TVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAR-EMRAIQLMANDGLLPDPVF 467 (612) Q Consensus 389 ~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~-~~~al~~~~~~G~is~et~ 467 (612) ...+. .-...+.-+...+++.+++.| +- .+.+.++. .. ..|.. ....+-.++.+|.++.-.+ T Consensus 290 ~~~~~-~~~~~i~~~l~~i~~~~~~~l-------~~------~~~~~~~~--~~-~~d~~~~~~~~~~l~~~g~~t~nE~ 352 (386) T protein:vir:49 290 EMIYN-IYFKSVSRYLRPFVSEMSKKL-------SC------EVDVDISP--AV-DPTGSNYISLINSMVKSGTLAQNQG 352 (386) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHh-------cc------hhcccchh--hh-ccCHHHHHHHHHHHHhCCCcCHHHH Confidence 11121 122344555555555555544 11 12222211 11 12333 3455667889999999999 Q ss_pred HHHHHhcCccchhhhhHHHHHHhhccccccccch Q lcl|NC_019408. 468 YEYMRKAEVISSDMTFEEFQALRADENSFINNPD 501 (612) Q Consensus 468 ~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~ 501 (612) ++.|.+.|+.+.+...-+.......+..-.+..+ T Consensus 353 r~~l~~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 353 LYILQQAEILPKELPDGKNPNRTSLKGGEINEQD 386 (386) T ss_pred HHHHhhCCCCCCcCcchhccCCCCCCCCCCCCCC Confidence 9999888887654322111111111110000000 No 173 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=37.55 E-value=1.1 Score=20.33 Aligned_cols=107 Identities=11% Similarity=-0.046 Sum_probs=10.1 Q ss_pred HHhhhhhhHHHHhHHHHHHHHHH--HHHHHHHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_019408. 503 QARQRGYTNRGQELEQSRMAREA--DFTQQKIDIQERSVA-V-QEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVA---- 574 (612) Q Consensus 503 ~~~~~~e~~r~~~~e~~r~~~e~--e~~~q~~e~~~r~~~-~-~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~---- 574 (612) +.+.+.+...++..++-++..+. +..+.+.++.++... + ..+..+...+.+..+ ++-...+++.+.... T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~---~~~~~~e~~~~~~~~~~~~ 77 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQ---AHADKLDVKLKEKAKSEDK 77 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhccccccc Confidence 11100110111111111100000 000001111100000 0 000000000000000 000000000000000 Q ss_pred --HH-HHHHHHHHHH-HhhccccCCCchhhcCCCC--CcccCCC Q lcl|NC_019408. 575 --DQ-ATIDNAKKQT-ANAAKVAAQPPAPAAPGAP--PTNRRPT 612 (612) Q Consensus 575 --eq-~~~~~~~k~~-~~~a~~~~~~~~~~~~~~~--~~~~~~~ 612 (612) .. .......+.. +....+..........+.+ ...--|+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~ 121 (379) T protein:vir:10 78 SDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPK 121 (379) T ss_pred chhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccch Confidence 00 0000000000 0000011111111110111 0111222 No 174 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=37.51 E-value=1.1 Score=20.33 Aligned_cols=361 Identities=11% Similarity=0.035 Sum_probs=145.8 Q ss_pred HHHHHHHHHHhcChHHHHhcc--------cccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHhhchhhcCCceeecCCH Q lcl|NC_019408. 11 RPEWTKLRDVMAGQREIKRKA--------EAYLPAMKGADGDDYAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNLPP 82 (612) Q Consensus 11 ~~~W~~i~d~~~G~~~vr~~g--------~~YLPk~~~e~~~~Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~p~ 82 (612) +.-|.+..-- ........ ..++..+..-..-.....++.++ +...|+-+++.+-.-|..+. .. T Consensus 1 M~~f~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~----v~~~i~~ia~~ia~~p~~~~--~~ 71 (386) T protein:vir:48 1 MPIFNITNLA---TESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSD----LFSIINQLSNDLATVKLTAS--RK 71 (386) T ss_pred Cccccccccc---ccccccccccccccccchhcccccCCceechhhhhcchH----HHHHHHHHHHhhccCceeec--cc Confidence 1111110000 00000000 00000000000000112233333 33444444444444444442 23 Q ss_pred HHHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCc-eEEEechhhhhcchhhhccCCccceeEE Q lcl|NC_019408. 83 KFKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATS-FAVGYSAENILDWDEVVDMGGFYVPSRV 161 (612) Q Consensus 83 ~l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rP-y~~~~~ae~IinW~~~~~v~g~~~Lt~v 161 (612) ....|+..... +.+-.+|.+.++...+.+|-+++++..... .+| -+..++|..| T Consensus 72 ~~~~l~~~pN~-~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~~~~L~~l~~~~v------------------ 126 (386) T protein:vir:48 72 QLQGIIDNPSN-NANRFNFYQSIFAQMLLGGEAFAYRWRNEN------GRDMKWEYLRPSQV------------------ 126 (386) T ss_pred hhHHHhhcCCC-CCCHHHHHHHHHHHhhhcCcEEEEEEECCC------CcEEEEEEecCcee------------------ Confidence 44556666654 578889999999999999999998865321 122 1112222211 Q ss_pred EEEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEEE Q lcl|NC_019408. 162 LLREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYL 241 (612) Q Consensus 162 ~l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~ 241 (612) .+.. ..+| + ...|++... T Consensus 127 ~v~~--------------------------~~~~---------------~---~~~y~~~~~------------------ 144 (386) T protein:vir:48 127 SFNR--------------------------LDNK---------------D---GIYYNITFD------------------ 144 (386) T ss_pred EEEE--------------------------cCCC---------------c---eEEEEEEec------------------ Confidence 0000 0000 0 011221100 Q ss_pred EeeCCCceecceeeeccCCccccceeEEEe-ecCCCCCCcCcCchHHHHHHHHHHHhhhHHH-HHHHHHhccceeeeecC Q lcl|NC_019408. 242 YEEDPESRPIARIVPTVRGEPLDFIPFKFF-GASGNTADVEKPPLLDICDLNLSHYRTYAEL-EYGRLFTALPVYYAPGT 319 (612) Q Consensus 242 ~~~~~~~~~~~~~~p~~~g~~l~~IP~v~~-~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~-~~~l~~~~~P~l~i~G~ 319 (612) +..... ....| -+. ++.+ +...++...+.+|+..++. .|.......++ ...+...+.|..+++-- T Consensus 145 ---~~~~~~-~~~~~------~~e--vih~~~~~~~~~~~G~s~i~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~ii~~~ 211 (386) T protein:vir:48 145 ---DPRIPP-KQHVP------QGD--VLHFKLLSVDGGLTSVSPLMALSR-ELNIQKASDKLTLNSLKNALNANGILKIK 211 (386) T ss_pred ---Cccccc-eeEec------Ccc--EEEecCCCCCCceeeccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEeC Confidence 000000 00000 011 1111 1122333457788875543 33333333433 44556667888888622 Q ss_pred CCCCCc----------eEEEeccccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--HhhhccccchhHHH Q lcl|NC_019408. 320 DSEGTG----------EYHIGPNMVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVSESN 386 (612) Q Consensus 320 ~~~~~~----------~l~iG~~~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~esa 386 (612) .....+ ...-+++.++.|+.|.++.=+..+..-+. ..+..+-..+++..+ |. .++-. + ....+. T Consensus 212 ~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~-~-~~~~~~ 288 (386) T protein:vir:48 212 GGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQ-LLKQADWTTGQFAKVYGIPENVVGG-Q-GDQQSS 288 (386) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCC-C-CCcccH Confidence 111111 01223445566777665554444433332 244455555565443 42 22211 1 111122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHH-HHHHHHHHHHcCCCCHH Q lcl|NC_019408. 387 NQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAR-EMRAIQLMANDGLLPDP 465 (612) Q Consensus 387 ~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~-~~~al~~~~~~G~is~e 465 (612) .+..... -...|.-++..++++++..|- . .+.+.+...|.. +.. ....+-.++.+|.++.- T Consensus 289 e~~~~~~--~~~~l~P~~~~ie~~l~~~l~-------~------~~~~~~~~~~~~---d~~~~~~~~~~l~~~g~~t~n 350 (386) T protein:vir:48 289 LEMSLDL--YNKAVSRYLRPFLSELSQKLS-------C------DVDADILPAVDP---TGSNSVSRINSMVKSGTLAQN 350 (386) T ss_pred HHHHHHH--HHHHHHHHHHHHHHHHHHhhc-------c------hhhcchhhhhcc---ChHHHHHHHHHHHhCCCcCHH Confidence 2333332 233466677777777765541 1 111111112221 222 33456678889999999 Q ss_pred HHHHHHHhcCccchhhhhHHHHHHhhccccccccchhHH Q lcl|NC_019408. 466 VFYEYMRKAEVISSDMTFEEFQALRADENSFINNPDAQA 504 (612) Q Consensus 466 t~~~~lqr~~vl~~~~~~eee~~ria~e~~~~~~~~~~~ 504 (612) ..++.+-+.++.+.+...-+ ........-+.++.+. T Consensus 351 E~r~~lg~~~~~~~~~~~~~---~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 351 QGLYILQQAEILPKELPEGE---NPNKTTLKGGEINGED 386 (386) T ss_pred HHHHHhhcCCCCCccchhhc---CCCCCccCCCCCCCCC Confidence 99888877777654322111 0100000001111111 No 175 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=24.15 E-value=2.2 Score=18.69 Aligned_cols=94 Identities=5% Similarity=-0.095 Sum_probs=15.2 Q ss_pred hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 505 RQRGYTNRGQELEQSRMAREADFTQQKIDIQERSVAVQEGHAEVAHAAGSTSISGSRKLGDPEQAKPAVADQATIDNAKK 584 (612) Q Consensus 505 ~~~~e~~r~~~~e~~r~~~e~e~~~q~~e~~~r~~~~~~~r~~~e~~~~~~~~~~~r~~~~e~q~k~~~~eq~~~~~~~k 584 (612) +...... ++- +++.+...+.+++..+-.+...+...+-++..++ +..+.+......++.+.+..++..+..+.+.+ T Consensus 1 ~~~~~~~-~~~--~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~~e~-~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~ 76 (418) T protein:vir:10 1 MSHMNEP-RQF--GRKSGGDSHPEQVLETVTKELKRIGDEVKSAGEK-ALAEAKRAGDLGVETKATVDELLIKQGELQAR 76 (418) T ss_pred CCCchhH-HHH--HHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 2111111 111 1111111122222111111111111111111000 00001111111111111111111111111111 Q ss_pred HHHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 585 QTANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 585 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) .... +.+...++.+..+. T Consensus 77 ~~~~----------e~~~~~~~~~~~~~ 94 (418) T protein:vir:10 77 LLEA----------EQKLARGGGSAELE 94 (418) T ss_pred HHHH----------HHHHhhcccccccc Confidence 1111 11112223333333 No 176 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=23.89 E-value=2.2 Score=18.66 Aligned_cols=333 Identities=13% Similarity=0.085 Sum_probs=131.4 Q ss_pred HHHHHHHHHHHhcChHHHHhcccccCCCCCCCCHHH------HHHHHhhccCCchHHHHHHHhhchhhcCCceeecCCHH Q lcl|NC_019408. 10 WRPEWTKLRDVMAGQREIKRKAEAYLPAMKGADGDD------YAIYLQRATFFNMLAQTRDGMTGMVFRRDPIVKNLPPK 83 (612) Q Consensus 10 ~~~~W~~i~d~~~G~~~vr~~g~~YLPk~~~e~~~~------Y~~rl~rA~~~n~~~~tv~~~~G~vf~k~p~~~~~p~~ 83 (612) |. -|....- +. ......|.|......... -+.-|+. +.+-..|+.+++.|-.-|. .+ .+. T Consensus 1 M~-~~~~f~~-----r~-~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~----~av~~cv~~ia~~ia~~p~-~~--~~~ 66 (359) T protein:vir:10 1 MS-ILNPFER-----RS-SITPNNYYPFMVQNGSIVPNSLVDATEALKN----SDLYAVTSLISSDIAGTRF-IG--NQV 66 (359) T ss_pred Cc-ccchhhc-----cc-cCCCCcchhhhhccccccCCcccCHHHhhcc----hHHHHHHHHHHHhhhcCcc-cc--chH Confidence 10 0000000 00 000111111111111000 0111222 2233445555554444433 22 133 Q ss_pred HHHHHhccCCCCCCHHHHHHHHHHHHHHhCCeEEEEecCcchhhhhccCce-EEEechhhhhcchhhhccCCccceeEEE Q lcl|NC_019408. 84 FKDAVRRFAKDGSSHATFAKAVLSEQAGVGRFGVLVDVVDNPRKGAVATSF-AVGYSAENILDWDEVVDMGGFYVPSRVL 162 (612) Q Consensus 84 l~~~~~d~D~~G~~l~~f~~~~~~~~l~~Gr~~vlVD~p~a~~~~~~~rPy-~~~~~ae~IinW~~~~~v~g~~~Lt~v~ 162 (612) +..|+..-.- -.+-.+|.+.++...+.+|-++++|..... .+|. +..+++..| . T Consensus 67 ~~~L~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------g~~~~l~~l~~~~v------------------~ 121 (359) T protein:vir:10 67 FTSVLNNPSH-LTNAFSFWQTAILNLLLNGNVFLAILKGDN------SLMKELRLIPSNAI------------------T 121 (359) T ss_pred HHHHhhcccc-cCCHHHHHHHHHHhccccCceEEEEEECCC------CeEEEEEEeCCceE------------------E Confidence 4555555544 478889999999999999999888753211 1111 111111110 0 Q ss_pred EEEEeeccccccCCCcccccceeeeeeEeeecccccccceeecccccccccceeeeeeeeccccccccccccceeEEEEE Q lcl|NC_019408. 163 LREFVRDLRWKSDIEPLTTAQARKARAAALASGSASSPMVRQTARTLGGYSYITVYRELKLEEIEWPSGEVKLAYVQYLY 242 (612) Q Consensus 163 l~E~v~~~~~~~~~d~f~~~~~~q~r~l~l~~g~~~~~~~~~~~~~~~g~~~~~~~R~~~~~~~~~~~g~~~~~~~~~~~ 242 (612) + .+.++. -.|++.... ++. .. T Consensus 122 i---------------------------~~~~~~-------------------~~y~~~~~~-----~~~------~~-- 142 (359) T protein:vir:10 122 I---------------------------DLTDDT-------------------LTYEVNQFD-----DYP------SA-- 142 (359) T ss_pred E---------------------------EEcCCe-------------------EEEEEEecC-----Cce------EE-- Confidence 0 000100 011110000 000 00 Q ss_pred eeCCCceecceeeeccCCccccceeEEEe--ecCCCCCCcCcCchHHHHHHHHHHHhhhHHHHHH-HHHhccceeeeecC Q lcl|NC_019408. 243 EEDPESRPIARIVPTVRGEPLDFIPFKFF--GASGNTADVEKPPLLDICDLNLSHYRTYAELEYG-RLFTALPVYYAPGT 319 (612) Q Consensus 243 ~~~~~~~~~~~~~p~~~g~~l~~IP~v~~--~~~~~~~~~~~pPLldLA~lnl~HY~~~sD~~~~-l~~~~~P~l~i~G~ 319 (612) ++ + +=+.|-|-.+ +....+...|.+|+..++. .|..-....++... +...+.|-.+++-. T Consensus 143 ----------~~-~-----~~evih~~~~~~~~~~~dg~~G~spi~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~ 205 (359) T protein:vir:10 143 ----------KY-N-----ASEMIHVKIMAYGVDTLHNLVGHSPLESLTS-EIGQQKEANRLSLSTLKGALNPTSVVKVP 205 (359) T ss_pred ----------EE-c-----ccceEEeccCCCCCCccCccccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEeC Confidence 00 0 0011112111 1111233467888865544 33333333444444 44455688777521 Q ss_pred ----CCCCCc-------eEEEecc--ccccCCCCCceeEEecCchhHHHHHHHHHHHHHHHHHH-HH--Hhhhccccchh Q lcl|NC_019408. 320 ----DSEGTG-------EYHIGPN--MVWEVPQGSEPGILEYTGQGLKALETALNDKERQIAAI-GG--RMMPGASKSVS 383 (612) Q Consensus 320 ----~~~~~~-------~l~iG~~--~~~~lp~~~~~~~lE~~g~~l~~~~~~l~~~e~qm~~l-Ga--~ll~~~~~~~~ 383 (612) +++..+ ...-|.+ ..+.|+.|.++.-++.+..-++ ..+..+-..+++..+ |- .++-... . T Consensus 206 ~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~---~ 281 (359) T protein:vir:10 206 QGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINADVAN-YLNSMNWGRTQIAKAFGVSDSYLNGTG---D 281 (359) T ss_pred CCCCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCC---c Confidence 111000 1222222 3567787777666665554332 223333334444322 22 2221111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcceEEEeeccccccCCCHHHHHHHHHHHHcCCCC Q lcl|NC_019408. 384 ESNNQTVLREANEQSLLLNIIQACESGMTDVVRWWLMWRDVPLADTENLRYEVNTDFLSTPIGAREMRAIQLMANDGLLP 463 (612) Q Consensus 384 esa~~~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~w~g~~~~~~~~~~v~ln~dF~~~~~d~~~~~al~~~~~~G~is 463 (612) .+++...++....+ .|..+...+...++..|.-- +.. ....-.+|+ +......+..++++|.++ T Consensus 282 ~~~~~~~~e~~~~~-~l~~~l~p~~~~l~~~l~~~---~~~--------~~~~~~~~d----~~~~~~~~~~~~~~G~~t 345 (359) T protein:vir:10 282 QQSSLDQIKDLYVN-ALNRFIEPLISELRIKCDSS---IGV--------DMSPITDYS----NSVFKADILNWVKEGIIE 345 (359) T ss_pred ccccHHHHHHHHHH-HHHHHHHHHHHHHHHHhhhh---hcc--------cchhhhhcC----HHHHHHHHHHHHhCCCcC Confidence 11122222222222 23334444444454444211 110 000011222 233455677899999999 Q ss_pred HHHHHHHHHhcCcc Q lcl|NC_019408. 464 DPVFYEYMRKAEVI 477 (612) Q Consensus 464 ~et~~~~lqr~~vl 477 (612) .-..++.+-.-.|+ T Consensus 346 ~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 346 PTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHhCCCCCC Confidence 99998888777787 No 177 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=22.63 E-value=2.4 Score=18.48 Aligned_cols=121 Identities=11% Similarity=0.010 Sum_probs=10.0 Q ss_pred hhhHHHHHHhhccccccccchhHHhhhhhhHHHHhHHHHHHHHHH---HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019408. 481 MTFEEFQALRADENSFINNPDAQARQRGYTNRGQELEQSRMAREA---DFTQQKIDI-QERSVAVQEGHAEVAHAAGSTS 556 (612) Q Consensus 481 ~~~eee~~ria~e~~~~~~~~~~~~~~~e~~r~~~~e~~r~~~e~---e~~~q~~e~-~~r~~~~~~~r~~~e~~~~~~~ 556 (612) ...++..+.+..+- ++...++.+..++-|...|. +...++.++ ..+....+.+-++++++..... T Consensus 1 ~~l~e~i~e~~~~l-----------~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~ 69 (400) T protein:vir:38 1 MTLDEKLAAVKKQL-----------DEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYE 69 (400) T ss_pred CChHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23333333322210 00000000000000000000 000000000 0001011111111110000000 Q ss_pred HHHHH------HH-HHHHHHHHHHHHHHHHHHHHHH-HHhhccccCCCchhhcCCCCCcccCCC Q lcl|NC_019408. 557 ISGSR------KL-GDPEQAKPAVADQATIDNAKKQ-TANAAKVAAQPPAPAAPGAPPTNRRPT 612 (612) Q Consensus 557 ~~~~r------~~-~~e~q~k~~~~eq~~~~~~~k~-~~~~a~~~~~~~~~~~~~~~~~~~~~~ 612 (612) ..... +. ....+.........-.....+. .....+.............+-..+.-. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (400) T protein:vir:38 70 AALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAV 133 (400) T ss_pred HHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHH Confidence 00000 00 0000000000000000000000 000000000000000000000000000 Done!