Query lcl|Aclame:protein:vir:108295|NCBI_annot:hypothetical protein|genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Match_columns 711 No_of_seqs 217 out of 332 Neff 9.4 Searched_HMMs 1612 Date Tue Dec 3 01:30:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_1 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_1_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:108295 Length: 711 100.0 2E-195 9E-199 1088.7 75.8 711 1-711 1-711 (711) 2 protein:vir:172 Length: 708 # 100.0 8E-159 5E-162 887.2 66.4 666 24-711 1-700 (708) 3 protein:vir:9263 Length: 725 # 100.0 2E-158 1E-161 884.9 64.9 660 25-711 1-715 (725) 4 protein:vir:3520 Length: 720 # 100.0 7E-158 4E-161 882.1 67.1 663 24-711 1-702 (720) 5 protein:vir:100920 Length: 725 100.0 3E-158 2E-161 884.5 64.5 660 25-711 1-715 (725) 6 protein:vir:105429 Length: 708 100.0 3E-157 2E-160 878.6 67.3 666 24-711 1-700 (708) 7 protein:vir:77597 Length: 725 100.0 1E-157 9E-161 880.6 65.0 662 25-711 1-715 (725) 8 protein:vir:105520 Length: 706 100.0 2E-156 1E-159 873.8 67.7 665 24-711 1-696 (706) 9 protein:vir:2764 Length: 714 # 100.0 1E-154 8E-158 864.3 72.3 662 10-711 1-707 (714) 10 protein:vir:817 Length: 714 # 100.0 1E-154 8E-158 864.3 72.3 662 10-711 1-707 (714) 11 protein:vir:10117 Length: 714 100.0 1E-154 8E-158 864.3 72.3 662 10-711 1-707 (714) 12 protein:vir:9950 Length: 714 # 100.0 1E-154 8E-158 864.3 72.3 662 10-711 1-707 (714) 13 protein:vir:3296 Length: 714 # 100.0 1E-154 8E-158 864.3 72.3 662 10-711 1-707 (714) 14 protein:vir:105619 Length: 772 100.0 3E-156 2E-159 873.5 63.0 657 8-711 1-697 (772) 15 protein:vir:104437 Length: 714 100.0 7E-153 5E-156 854.6 70.3 664 1-711 1-707 (714) 16 protein:vir:93630 Length: 776 100.0 2E-150 1E-153 841.4 63.9 666 1-711 13-715 (776) 17 protein:vir:8846 Length: 705 # 100.0 1E-87 6.3E-91 497.5 59.7 592 1-711 1-703 (705) 18 protein:vir:95821 Length: 763 100.0 2.1E-87 1.3E-90 495.8 60.4 609 1-711 1-713 (763) 19 protein:vir:80165 Length: 651 100.0 2.8E-62 1.7E-65 358.0 53.3 576 3-703 1-651 (651) 20 protein:vir:95449 Length: 584 100.0 4E-48 2.5E-51 280.4 40.1 532 1-656 1-584 (584) 21 protein:vir:345 Length: 663 # 100.0 1.1E-44 6.8E-48 261.6 45.3 591 1-709 1-663 (663) 22 protein:vir:94599 Length: 641 100.0 9.1E-44 5.6E-47 256.6 44.8 580 1-711 1-639 (641) 23 protein:vir:3139 Length: 599 # 100.0 3.5E-41 2.1E-44 242.4 32.1 558 1-670 1-599 (599) 24 protein:vir:103765 Length: 549 99.9 2.6E-22 1.6E-25 138.9 41.4 531 21-676 1-549 (549) 25 protein:vir:7321 Length: 556 # 99.9 1.1E-21 6.9E-25 135.4 43.5 542 21-688 1-556 (556) 26 protein:vir:95315 Length: 559 99.9 6.2E-22 3.8E-25 136.9 41.7 541 21-711 1-558 (559) 27 protein:vir:3361 Length: 535 # 99.9 2.1E-21 1.3E-24 133.9 42.1 524 1-704 1-535 (535) 28 protein:vir:102668 Length: 547 99.9 1.8E-21 1.1E-24 134.3 41.5 535 30-672 1-547 (547) 29 protein:vir:1538 Length: 535 # 99.9 2.5E-21 1.5E-24 133.5 41.6 517 1-665 1-535 (535) 30 protein:vir:107404 Length: 555 99.9 3E-21 1.9E-24 133.1 41.7 539 10-683 1-555 (555) 31 protein:vir:98506 Length: 555 99.9 3E-21 1.9E-24 133.1 41.7 539 10-683 1-555 (555) 32 protein:vir:107822 Length: 555 99.9 3E-21 1.9E-24 133.1 41.7 539 10-683 1-555 (555) 33 protein:vir:10447 Length: 536 99.9 1.1E-20 6.5E-24 130.1 41.7 520 1-694 1-536 (536) 34 protein:vir:96494 Length: 501 99.9 1.4E-22 8.9E-26 140.3 31.3 479 1-674 1-501 (501) 35 protein:vir:2198 Length: 536 # 99.9 1.9E-20 1.2E-23 128.7 41.4 520 1-694 1-536 (536) 36 protein:vir:1785 Length: 555 # 99.9 3.3E-20 2E-23 127.4 40.2 529 31-706 1-555 (555) 37 protein:vir:2732 Length: 501 # 99.9 3.5E-21 2.2E-24 132.7 33.9 475 1-668 1-501 (501) 38 protein:vir:4898 Length: 502 # 99.9 1.4E-21 8.7E-25 134.9 30.0 478 1-668 1-502 (502) 39 protein:vir:97171 Length: 512 99.9 1.1E-20 6.7E-24 130.1 34.1 480 1-654 1-512 (512) 40 protein:vir:94572 Length: 535 99.9 5.6E-19 3.5E-22 120.6 42.7 520 21-682 1-535 (535) 41 protein:vir:95806 Length: 440 99.8 1.3E-20 8.4E-24 129.5 32.4 427 31-651 1-440 (440) 42 protein:vir:96240 Length: 511 99.8 2E-20 1.2E-23 128.6 32.4 484 1-654 1-511 (511) 43 protein:vir:102950 Length: 471 99.8 6.6E-20 4.1E-23 125.7 35.0 452 23-653 1-471 (471) 44 protein:vir:94709 Length: 522 99.8 1.7E-18 1.1E-21 118.0 42.7 511 1-682 1-522 (522) 45 protein:vir:3609 Length: 452 # 99.8 5.7E-20 3.5E-23 126.1 34.5 446 1-665 1-452 (452) 46 protein:vir:99522 Length: 470 99.8 1.7E-19 1E-22 123.5 36.8 463 1-653 1-470 (470) 47 protein:vir:103951 Length: 511 99.8 7.1E-20 4.4E-23 125.6 34.2 484 1-654 1-511 (511) 48 protein:vir:78805 Length: 511 99.8 4.5E-20 2.8E-23 126.7 31.7 484 1-665 1-511 (511) 49 protein:vir:96366 Length: 511 99.8 4.5E-20 2.8E-23 126.7 31.7 484 1-665 1-511 (511) 50 protein:vir:9306 Length: 511 # 99.8 6.9E-20 4.3E-23 125.6 32.6 484 1-654 1-511 (511) 51 protein:vir:95113 Length: 474 99.8 7.1E-20 4.4E-23 125.6 32.6 463 1-651 1-474 (474) 52 protein:vir:9871 Length: 429 # 99.8 2.8E-19 1.7E-22 122.3 35.8 424 27-651 1-429 (429) 53 protein:vir:99672 Length: 532 99.8 2.8E-18 1.7E-21 116.8 41.0 516 1-678 1-532 (532) 54 protein:vir:105292 Length: 478 99.8 1.6E-19 1E-22 123.6 34.1 466 1-651 1-478 (478) 55 protein:vir:100039 Length: 522 99.8 1.3E-18 8.2E-22 118.6 38.4 506 33-700 1-522 (522) 56 protein:vir:3964 Length: 453 # 99.8 4.3E-19 2.6E-22 121.3 35.7 446 1-653 1-453 (453) 57 protein:vir:106639 Length: 481 99.8 3.1E-18 1.9E-21 116.6 40.1 463 1-643 6-481 (481) 58 protein:vir:99781 Length: 511 99.8 8.2E-20 5.1E-23 125.2 30.6 484 1-654 1-511 (511) 59 protein:vir:96179 Length: 468 99.8 1E-18 6.3E-22 119.2 36.5 456 1-662 1-468 (468) 60 protein:vir:93747 Length: 472 99.8 3.1E-19 1.9E-22 122.0 33.6 462 1-663 1-472 (472) 61 protein:vir:107112 Length: 478 99.8 5.7E-19 3.5E-22 120.6 35.0 466 1-663 1-478 (478) 62 protein:vir:105461 Length: 470 99.8 3E-19 1.9E-22 122.1 33.5 455 30-647 1-470 (470) 63 protein:vir:8883 Length: 543 # 99.8 2.4E-18 1.5E-21 117.2 38.4 528 1-694 1-543 (543) 64 protein:vir:96266 Length: 474 99.8 3.1E-19 2E-22 122.0 33.5 463 1-663 1-474 (474) 65 protein:vir:95899 Length: 474 99.8 3.1E-19 2E-22 122.0 33.5 463 1-663 1-474 (474) 66 protein:vir:80680 Length: 441 99.8 1.4E-18 8.5E-22 118.5 36.8 435 24-660 1-441 (441) 67 protein:vir:94101 Length: 474 99.8 7E-20 4.4E-23 125.6 28.8 454 1-651 1-474 (474) 68 protein:vir:105889 Length: 474 99.8 7E-20 4.4E-23 125.6 28.8 454 1-651 1-474 (474) 69 protein:vir:1236 Length: 483 # 99.8 4.9E-19 3.1E-22 120.9 33.4 465 1-653 1-483 (483) 70 protein:vir:97447 Length: 474 99.8 9.7E-19 6E-22 119.3 34.4 464 1-658 1-474 (474) 71 protein:vir:94498 Length: 474 99.8 9.7E-19 6E-22 119.3 34.4 464 1-658 1-474 (474) 72 protein:vir:5961 Length: 503 # 99.8 1.2E-19 7.3E-23 124.3 28.3 487 1-674 1-503 (503) 73 protein:vir:102330 Length: 451 99.8 4.4E-18 2.7E-21 115.8 36.8 438 26-643 1-451 (451) 74 protein:vir:94805 Length: 492 99.8 3.4E-18 2.1E-21 116.3 35.8 462 1-658 21-492 (492) 75 protein:vir:96988 Length: 516 99.8 4.7E-17 2.9E-20 110.1 41.8 507 20-669 1-516 (516) 76 protein:vir:9922 Length: 489 # 99.8 1.6E-18 9.9E-22 118.1 33.7 463 1-632 1-489 (489) 77 protein:vir:97336 Length: 492 99.8 6E-18 3.7E-21 115.0 35.1 463 1-672 20-492 (492) 78 protein:vir:103330 Length: 517 99.8 5.5E-17 3.4E-20 109.7 39.9 506 19-681 1-517 (517) 79 protein:vir:733 Length: 453 # 99.8 2.4E-17 1.5E-20 111.7 37.6 440 1-645 1-453 (453) 80 protein:vir:78942 Length: 510 99.8 4.4E-16 2.7E-19 104.8 42.6 498 31-671 1-510 (510) 81 protein:vir:78696 Length: 542 99.8 3.4E-17 2.1E-20 110.9 35.9 527 31-694 1-542 (542) 82 protein:vir:94546 Length: 506 99.8 1.2E-17 7.2E-21 113.4 32.9 467 1-661 1-506 (506) 83 protein:vir:96839 Length: 474 99.8 2.8E-17 1.8E-20 111.3 33.8 463 1-651 1-474 (474) 84 protein:vir:7017 Length: 515 # 99.8 6.4E-16 4E-19 103.9 40.6 505 21-669 1-515 (515) 85 protein:vir:79043 Length: 479 99.8 8.7E-17 5.4E-20 108.6 35.7 461 1-655 7-479 (479) 86 protein:vir:78537 Length: 480 99.8 7.2E-19 4.5E-22 120.0 24.0 461 21-665 1-480 (480) 87 protein:vir:6322 Length: 510 # 99.7 3.9E-15 2.4E-18 99.6 42.3 498 31-671 1-510 (510) 88 protein:vir:78227 Length: 480 99.7 3E-18 1.8E-21 116.7 24.8 461 23-665 1-480 (480) 89 protein:vir:2427 Length: 485 # 99.7 5.9E-17 3.7E-20 109.5 31.8 465 6-667 1-485 (485) 90 protein:vir:106571 Length: 499 99.7 8.6E-17 5.4E-20 108.6 31.6 482 1-663 1-499 (499) 91 protein:vir:2341 Length: 488 # 99.7 4.4E-17 2.8E-20 110.2 29.7 472 10-674 1-488 (488) 92 protein:vir:105641 Length: 516 99.7 6.6E-15 4.1E-18 98.3 41.0 506 21-669 1-516 (516) 93 protein:vir:1587 Length: 508 # 99.7 3.4E-16 2.1E-19 105.4 33.6 467 29-626 1-508 (508) 94 protein:vir:80211 Length: 514 99.7 1.3E-14 7.9E-18 96.8 42.2 498 35-662 1-514 (514) 95 protein:vir:38 Length: 496 # N 99.7 2.2E-15 1.4E-18 100.9 37.7 455 27-623 1-496 (496) 96 protein:vir:9751 Length: 422 # 99.7 1.1E-15 7.1E-19 102.5 35.7 408 27-610 1-422 (422) 97 protein:vir:94742 Length: 409 99.7 1.1E-15 7.1E-19 102.5 35.5 398 27-597 1-409 (409) 98 protein:vir:104082 Length: 485 99.7 1E-16 6.3E-20 108.3 28.7 468 6-667 1-485 (485) 99 protein:vir:78083 Length: 537 99.7 5.7E-16 3.6E-19 104.1 32.5 494 10-671 1-537 (537) 100 protein:vir:4223 Length: 486 # 99.7 7.5E-16 4.7E-19 103.5 32.4 467 6-666 1-486 (486) 101 protein:vir:79703 Length: 505 99.7 3.5E-15 2.2E-18 99.8 35.8 464 29-621 1-505 (505) 102 protein:vir:80959 Length: 499 99.7 2.6E-15 1.6E-18 100.5 34.5 465 27-626 1-499 (499) 103 protein:vir:7768 Length: 484 # 99.7 3.8E-16 2.4E-19 105.1 28.2 460 6-674 1-484 (484) 104 protein:vir:2500 Length: 501 # 99.7 6.2E-16 3.9E-19 103.9 29.4 492 6-671 1-501 (501) 105 protein:vir:9568 Length: 410 # 99.7 9.8E-15 6.1E-18 97.4 35.4 398 46-612 1-410 (410) 106 protein:vir:1634 Length: 409 # 99.7 1.2E-14 7.3E-18 97.0 35.4 398 27-597 1-409 (409) 107 protein:vir:105819 Length: 456 99.7 5.5E-15 3.4E-18 98.8 33.3 437 21-654 1-456 (456) 108 protein:vir:102602 Length: 456 99.7 5.5E-15 3.4E-18 98.8 33.3 437 21-654 1-456 (456) 109 protein:vir:78907 Length: 518 99.6 1E-14 6.4E-18 97.3 33.6 483 1-623 1-518 (518) 110 protein:vir:98883 Length: 517 99.6 4.9E-15 3E-18 99.1 31.0 471 29-628 1-517 (517) 111 protein:vir:7987 Length: 456 # 99.6 6.8E-15 4.2E-18 98.2 31.6 441 21-654 1-456 (456) 112 protein:vir:99916 Length: 504 99.6 5.2E-15 3.2E-18 98.9 30.6 461 1-647 1-504 (504) 113 protein:vir:3028 Length: 500 # 99.6 1.9E-14 1.2E-17 95.8 31.6 456 29-622 1-500 (500) 114 protein:vir:9815 Length: 500 # 99.6 1.9E-14 1.2E-17 95.8 31.6 456 29-622 1-500 (500) 115 protein:vir:99072 Length: 479 99.6 1E-13 6.4E-17 91.8 33.6 459 10-664 1-479 (479) 116 protein:vir:101494 Length: 527 99.6 1.4E-13 8.5E-17 91.1 32.6 502 1-633 3-527 (527) 117 protein:vir:102239 Length: 527 99.6 1.5E-13 9.4E-17 90.9 32.6 502 1-633 3-527 (527) 118 protein:vir:7430 Length: 563 # 99.6 3.5E-13 2.2E-16 88.9 33.9 525 1-647 3-563 (563) 119 protein:vir:8184 Length: 474 # 99.6 3.5E-13 2.2E-16 88.9 33.9 451 10-656 1-474 (474) 120 protein:vir:4782 Length: 522 # 99.5 1.9E-13 1.2E-16 90.4 29.7 487 29-632 1-522 (522) 121 protein:vir:98444 Length: 434 99.5 2.3E-13 1.5E-16 89.8 25.4 422 59-666 1-434 (434) 122 protein:vir:8846 Length: 705 # 98.5 4.7E-07 2.9E-10 55.3 29.3 613 21-711 1-677 (705) 123 protein:vir:3520 Length: 720 # 98.4 7.7E-07 4.8E-10 54.1 32.9 628 28-711 1-695 (720) 124 protein:vir:103385 Length: 666 98.3 1.2E-07 7.4E-11 58.5 15.0 584 1-673 1-666 (666) 125 protein:vir:96403 Length: 666 98.2 1.4E-07 8.6E-11 58.2 14.2 584 1-673 1-666 (666) 126 protein:vir:172 Length: 708 # 98.2 2.3E-06 1.4E-09 51.5 32.2 584 1-711 76-705 (708) 127 protein:vir:95821 Length: 763 98.2 3E-06 1.9E-09 50.9 35.7 590 1-711 54-724 (763) 128 protein:vir:100920 Length: 725 97.9 1.1E-05 6.9E-09 47.7 36.0 629 46-711 1-678 (725) 129 protein:vir:94956 Length: 452 97.7 3E-05 1.9E-08 45.4 29.5 443 21-622 1-452 (452) 130 protein:vir:108295 Length: 711 97.7 3E-05 1.9E-08 45.3 38.6 621 14-711 1-704 (711) 131 protein:vir:9263 Length: 725 # 97.5 6.1E-05 3.8E-08 43.7 38.5 624 46-711 1-678 (725) 132 protein:vir:78393 Length: 489 97.3 9.6E-05 6E-08 42.6 31.3 471 10-627 1-489 (489) 133 protein:vir:77597 Length: 725 97.0 0.00024 1.5E-07 40.4 38.2 613 46-711 1-678 (725) 134 protein:vir:95014 Length: 491 96.5 0.00061 3.8E-07 38.2 31.4 475 10-629 1-491 (491) 135 protein:vir:105520 Length: 706 96.3 0.00077 4.8E-07 37.7 36.5 632 28-711 1-692 (706) 136 protein:vir:9950 Length: 714 # 96.2 0.00086 5.3E-07 37.4 24.2 606 15-711 1-699 (714) 137 protein:vir:817 Length: 714 # 96.2 0.00086 5.3E-07 37.4 24.2 606 15-711 1-699 (714) 138 protein:vir:10117 Length: 714 96.2 0.00086 5.3E-07 37.4 24.2 606 15-711 1-699 (714) 139 protein:vir:2764 Length: 714 # 96.2 0.00086 5.3E-07 37.4 24.2 606 15-711 1-699 (714) 140 protein:vir:3296 Length: 714 # 96.2 0.00086 5.3E-07 37.4 24.2 606 15-711 1-699 (714) 141 protein:vir:95149 Length: 501 95.5 0.002 1.3E-06 35.3 32.6 462 21-626 1-501 (501) 142 protein:vir:97265 Length: 513 94.9 0.0033 2E-06 34.2 27.6 470 1-636 1-513 (513) 143 protein:vir:80453 Length: 535 94.3 0.0048 3E-06 33.3 33.2 489 1-657 1-535 (535) 144 protein:vir:104437 Length: 714 94.0 0.0056 3.5E-06 32.9 25.2 600 15-711 1-699 (714) 145 protein:vir:93630 Length: 776 92.5 0.011 6.9E-06 31.3 29.7 630 1-711 4-706 (776) 146 protein:vir:105429 Length: 708 91.9 0.014 8.5E-06 30.8 37.1 631 28-711 1-693 (708) 147 protein:vir:96783 Length: 488 86.7 0.044 2.7E-05 28.0 31.7 452 1-610 14-488 (488) 148 protein:vir:80128 Length: 466 73.4 0.17 0.00011 24.8 13.8 113 591-711 1-127 (466) 149 protein:vir:80128 Length: 466 72.0 0.19 0.00012 24.6 11.7 92 618-711 1-94 (466) 150 protein:vir:962 Length: 397 # 66.3 0.27 0.00017 23.7 12.0 113 591-711 1-116 (397) 151 protein:vir:1084 Length: 437 # 65.2 0.29 0.00018 23.6 13.8 118 591-711 1-128 (437) 152 protein:vir:1084 Length: 437 # 60.3 0.37 0.00023 22.9 13.4 101 601-711 1-120 (437) 153 protein:vir:100884 Length: 389 49.7 0.63 0.00039 21.7 10.1 95 600-711 1-106 (389) 154 protein:vir:100884 Length: 389 41.9 0.91 0.00056 20.8 9.1 74 631-711 1-74 (389) 155 protein:vir:962 Length: 397 # 40.8 0.96 0.00059 20.7 13.3 124 586-711 1-127 (397) 156 protein:vir:93881 Length: 387 30.6 1.6 0.00097 19.5 8.6 91 621-711 1-114 (387) 157 protein:vir:96978 Length: 387 28.0 1.8 0.0011 19.2 8.6 91 621-711 1-114 (387) 158 protein:vir:2685 Length: 387 # 28.0 1.8 0.0011 19.2 8.6 91 621-711 1-114 (387) 159 protein:vir:94424 Length: 387 28.0 1.8 0.0011 19.2 8.6 91 621-711 1-114 (387) 160 protein:vir:78641 Length: 278 26.0 2 0.0012 18.9 21.2 270 120-530 1-278 (278) 161 protein:vir:1383 Length: 421 # 23.3 2.3 0.0014 18.6 8.2 80 618-711 1-81 (421) 162 protein:vir:105619 Length: 772 22.9 2.4 0.0015 18.5 29.5 601 30-711 1-690 (772) No 1 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=1.5e-195 Score=1088.67 Aligned_cols=711 Identities=100% Similarity=1.403 Sum_probs=694.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) |+||+|+||++++|.+++|..++++++++++|.++++||+++++++++||+++.+|++||+|+||++++++.|+.+|||| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~ 80 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHH Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYD 160 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~ 160 (711) +|||+|+|+|++|+|++++||++|+|+||+.....++..++..+.+.+.+++++|.++|++|+++++++++.|+++++++ T Consensus 81 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s 160 (711) T protein:vir:10 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYD 160 (711) T ss_pred EEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch Q lcl|Aclame:pro 161 IAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP 240 (711) Q Consensus 161 ~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~ 240 (711) ++|+++++||+||++|++||..+++++++|+|.+|++|++|||||+|+++|+|||+|+|+++|||+++++++||+++.++ T Consensus 161 ~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~ 240 (711) T protein:vir:10 161 IAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP 240 (711) T ss_pred HHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecC Q lcl|Aclame:pro 241 VYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGA 320 (711) Q Consensus 241 ~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 320 (711) +..++..+++.|+++++|||+|||+++++..+++.+.+|++++++..++.++.++..|...+..+.+++++|+|++|+|+ T Consensus 241 ~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~ 320 (711) T protein:vir:10 241 VYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGA 320 (711) T ss_pred hhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHh Q lcl|Aclame:pro 321 NVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWE 400 (711) Q Consensus 321 ~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~ 400 (711) ++|++++||+|++||||||||++.++++++++||+||.|+|+|+++|+++|+++|++++++++++++++|+|++.++.|. T Consensus 321 ~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~ 400 (711) T protein:vir:10 321 NVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWE 400 (711) T ss_pred eeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 401 QANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSF 480 (711) Q Consensus 401 ~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~ 480 (711) +.+.+||++|+++|+++++.+|++++++++|+++++|++++.+.|+++|||+++++|..+|++||+||++++++|++++. T Consensus 401 e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~ 480 (711) T protein:vir:10 401 QANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSF 480 (711) T ss_pred hccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHH Confidence 99999999999999999888999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHH Q lcl|Aclame:pro 481 AFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFAT 560 (711) Q Consensus 481 ~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s 560 (711) +++|||+++++++|+++|+||++||+++|+|||+|++++.++|.||....++.+|..+++||+++++|||+|+++|++++ T Consensus 481 ~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s 560 (711) T protein:vir:10 481 AFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFAT 560 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 561 QRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAK 640 (711) Q Consensus 561 ~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~ 640 (711) +|++.+..|+++++++|+.++++++++++++|+|+++++.+.+++..+++.+.++...+.++.+++++++..+++.++++ T Consensus 561 ~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~ 640 (711) T protein:vir:10 561 QRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAK 640 (711) T ss_pred HHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999888888888888888888888888889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 641 SQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 641 ~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +++...+++++.+++++++.++++++.+++++++.+..+++..++.++++.+++++.++++++.|+++++| T Consensus 641 ~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 641 SQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999999999998888898999999999999999999999999999999 No 2 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=8.5e-159 Score=887.18 Aligned_cols=666 Identities=25% Similarity=0.337 Sum_probs=552.1 Q ss_pred CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHH--HHhCCCCCCHHHHHHHHHhC----CCceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 24 NNDDDRALLATARERARDGATYWKDNWEAAEDDL--KFLGGEQWPSQVRTERELEQ----RPCLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~--~~y~G~Qw~~~~~~~~~~~g----~p~~~~N~i~~~v~~i~g~~ 97 (711) =.+...++|.+++++|+++++++++||.++.+|. +||+|+||+++++++|+++| |||+|||+|+|+|++|+|++ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 2245557899999999999999999999998886 56999999999999998765 79999999999999999999 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) ++||++++|+|+. .++|.++|++||++++++++.|++++++|+||+++++||+||++|+ T Consensus 81 ~~nr~d~~v~p~~---------------------~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~ 139 (708) T protein:vir:17 81 RNNRITVKFRPGD---------------------REASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) T ss_pred hhCCcceEEecCC---------------------CcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeee Confidence 9999999999984 3578999999999999999999999999999999999999999999 Q ss_pred EeeccCC---CCCCcceEEEec-CccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccc-cccC Q lcl|Aclame:pro 178 SDYLADD---SFEQDLIIEAIQ-NQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVAD-YDTW 252 (711) Q Consensus 178 ~d~~~~~---~~~~~i~i~~v~-~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~-~~~~ 252 (711) ++|.+++ .+..+|.|.++. +|++|||||+|+++|+|||+|+|+++|||+++++++||+++...+.....++ .+.| T Consensus 140 ~d~~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~ 219 (708) T protein:vir:17 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDW 219 (708) T ss_pred ecccccCCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccc Confidence 9998764 356778887765 4589999999999999999999999999999999999998766554433333 3579 Q ss_pred CCCCeEEEEEeeeeeeeceeEEEccC---CcEEEec--CcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCc Q lcl|Aclame:pro 253 FTEKSVRVSEYFTREPVIREIALLSD---GRSFWLD--ALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPV 327 (711) Q Consensus 253 ~~~~~v~v~E~~~~~~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~ 327 (711) +++++|||+|||++.++...++.+.+ |..+.++ .....+..+...|...+..+.+++++|+|+.+.|+.+|++++ T Consensus 220 ~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~ 299 (708) T protein:vir:17 220 FDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) T ss_pred cCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCC Confidence 99999999999999999998887755 4444444 445666677778888889999999999999999999999999 Q ss_pred cCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCC Q lcl|Aclame:pro 328 EIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNF 407 (711) Q Consensus 328 p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~ 407 (711) |+||++||||||||++.++++.+++||+||.|+|+|+++|+++|+++|+++++++.+++++.+++.+.+..|.+.+...+ T Consensus 300 ~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~ 379 (708) T protein:vir:17 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) T ss_pred CCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999888888777766 Q ss_pred ceEEecccccC-------cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 408 SLLTYIPQYQG-------DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSF 480 (711) Q Consensus 408 ~~i~~~~~~~~-------~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~ 480 (711) +++.+++.... ..++..++++++|+++++|++.+..+|+++|||+++++|+.+| +||+||++++++|++.++ T Consensus 380 ~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn-~SG~Ai~~rq~qg~~~~~ 458 (708) T protein:vir:17 380 AFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASF 458 (708) T ss_pred hhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc-hHHHHHHHHHHHHHHHHH Confidence 66666553222 1245678899999999999999999999999999999998665 899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHH Q lcl|Aclame:pro 481 AFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFAT 560 (711) Q Consensus 481 ~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s 560 (711) +++|||+.+++++|+++|+||++|||++|+|||+|++|+.+++.+|....++.+|..+++||+++|+|||+|+++|++++ T Consensus 459 ~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t 538 (708) T protein:vir:17 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcch---hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHH-HHHHHHH Q lcl|Aclame:pro 561 QRIEAAEAMIQFAQAVPS---AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTE-PTPEQQV 636 (711) Q Consensus 561 ~r~~~~~~L~~l~~~~p~---~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~-~~~~~q~ 636 (711) +|++.++.|+++++++|. ..+.++++++++||+|+++++.+++++..+++...++..++.+++.+++++ ++.+.++ T Consensus 539 ~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~ 618 (708) T protein:vir:17 539 RRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) T ss_pred HHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHH Confidence 999999999999998764 355677889999999999999999999888877766655554444433333 3333445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 637 EMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIED-------MAQGGDVVYQQVRELVAQALAEITASQANVT 709 (711) Q Consensus 637 ~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~-------~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e 709 (711) ++.+++++..+++|++++++++..++++++.+++.+..+... .+...+...+....++-+..+.....++.+. T Consensus 619 ~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~q~q~~~a~ 698 (708) T protein:vir:17 619 EMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSP 698 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhcc Confidence 555677777777777777777766666665544433322221 1111111111111111111111122222222 Q ss_pred cC Q lcl|Aclame:pro 710 EQ 711 (711) Q Consensus 710 ~Q 711 (711) .| T Consensus 699 p~ 700 (708) T protein:vir:17 699 PQ 700 (708) T ss_pred cc Confidence 22 No 3 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=2.3e-158 Score=884.86 Aligned_cols=660 Identities=20% Similarity=0.286 Sum_probs=529.7 Q ss_pred cchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccce Q lcl|Aclame:pro 25 NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAI 104 (711) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~ 104 (711) =.+++.+|++++.||+++++++.+||.++.+|++||+|+||+++++++|+.+|+| +||+|+|+|++|+|++++||++| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~e~~nr~d~ 78 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHHhCCcce Confidence 2235667999999999999999999999999999999999999999999999998 48999999999999999999999 Q ss_pred eEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCC Q lcl|Aclame:pro 105 KVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADD 184 (711) Q Consensus 105 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~ 184 (711) +|+|++ ++|.++|++||++++++++.|++++++|++|+++++||+||++|++||.+++ T Consensus 79 ~v~P~~----------------------~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d 136 (725) T protein:vir:92 79 LYRPKD----------------------GASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQS 136 (725) T ss_pred EEecCC----------------------ccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCC Confidence 999974 7999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcceEEE--ecC-ccceeeCCCccccCccccceeeeeecCCHHHHHH---hcCCcccchhhc-ccccccccCCCCCe Q lcl|Aclame:pro 185 SFEQDLIIEA--IQN-QFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKA---LYPDATAEPVYE-DSVADYDTWFTEKS 257 (711) Q Consensus 185 ~~~~~i~i~~--v~~-~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~---~~p~~~~~~~~~-~~~~~~~~~~~~~~ 257 (711) +|+++++|+. |++ +.+|||||+|+++|+|||+|+|+++|||+++++. .||....+.... ......+.|+++++ T Consensus 137 ~~~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 216 (725) T protein:vir:92 137 PTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDT 216 (725) T ss_pred CCCCceeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCe Confidence 9998887764 444 5569999999999999999999999999986665 555444333222 22233467899999 Q ss_pred EEEEEeeeeeeeceeEEEccC---CcEEEecC--cchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCC Q lcl|Aclame:pro 258 VRVSEYFTREPVIREIALLSD---GRSFWLDA--LEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPST 332 (711) Q Consensus 258 v~v~E~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~ 332 (711) |||+|||++.++...++.+.+ |.++.++. +.+.++.+...|...+..+.+++++|+|++++|+++|++++||||+ T Consensus 217 vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~ 296 (725) T protein:vir:92 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) T ss_pred EEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCC Confidence 999999999999888876654 55555543 4466778888999999999999999999999999999999999999 Q ss_pred ccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEe Q lcl|Aclame:pro 333 TIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTY 412 (711) Q Consensus 333 ~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~ 412 (711) +||||||||++.++++..++||+||.|+|+|+++|+++|+++|+++++++++++++++++++.++.|+..+.. .++.+ T Consensus 297 ~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~ 374 (725) T protein:vir:92 297 HIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLL 374 (725) T ss_pred ceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCcc--ceeec Confidence 9999999999999999999999999999999999999999999999999999999999999887777654433 33333 Q ss_pred -----cccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 413 -----IPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) Q Consensus 413 -----~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 487 (711) ++|.....+|++.+++++|+++++||+.+.++|+++|||+++++|..+|++||++|++++++|++.+++++|||+ T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~ 454 (725) T protein:vir:92 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) T ss_pred cccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 344555567899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHH Q lcl|Aclame:pro 488 KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAE 567 (711) Q Consensus 488 ~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~ 567 (711) .+++++|+++|+||++|||++|++||+|++|..+++.||....++.+|..+++||+ +|+|||+|+++|+++|+|++++. T Consensus 455 ~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi-~g~~Dv~v~~~p~~~s~r~~~~~ 533 (725) T protein:vir:92 455 TAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDI-RGRYECYTDVGPSFQSMKQQNRA 533 (725) T ss_pred HHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhcc-ccceeeEEeeccChHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999 48999999999999999999999 Q ss_pred HHHHHHhhcchhHHH---HHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH-HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 568 AMIQFAQAVPSAAAV---MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ-TEPTPEQQVEMAKSQA 643 (711) Q Consensus 568 ~L~~l~~~~p~~~~~---~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-q~~~~~~q~~~~~~q~ 643 (711) .|+++++++|+..++ ++..+++++++|+++++.+++++..+++...++...+.++..+++ +.+..+++....++++ T Consensus 534 ~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa 613 (725) T protein:vir:92 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQG 613 (725) T ss_pred HHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHH Confidence 999999999876544 344557889999999999999988877666555443333332222 2222233333444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H-HHH------HHHHHHHHHHHHH-------HHHHH----- Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIED----M-AQG------GDVVYQQVRELVA-------QALAE----- 700 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~----~-~q~------~~~~~~~~~~~~~-------~~~~e----- 700 (711) ..+++++++++++++..++++++.+.+.+++...+ . .+. ...+..+..+.++ +..++ T Consensus 614 ~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~ 693 (725) T protein:vir:92 614 VLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKG 693 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHH Confidence 44444444444444444443333222222111000 0 000 0000000111111 00111 Q ss_pred --------H---HHHHhhhccC Q lcl|Aclame:pro 701 --------I---TASQANVTEQ 711 (711) Q Consensus 701 --------~---~~~qa~~e~Q 711 (711) + ++.+++..+| T Consensus 694 ~~~~~~~~~d~~~~~~~~~~~~ 715 (725) T protein:vir:92 694 NEQTHKQRMDIANILQSQRQNQ 715 (725) T ss_pred HHHHHHHHHHHHHHhcchhccC Confidence 0 1111111111 No 4 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=7.1e-158 Score=882.12 Aligned_cols=663 Identities=21% Similarity=0.309 Sum_probs=554.0 Q ss_pred CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC--CCCCCHHHHH----HHHHhCCCceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 24 NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLG--GEQWPSQVRT----ERELEQRPCLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~--G~Qw~~~~~~----~~~~~g~p~~~~N~i~~~v~~i~g~~ 97 (711) =.+..+++|.+++.+|+++++++++||+++.+|++||+ |+||++++++ .++.+|+||+|||+|+|+|++|+|++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 34788899999999999999999999999999999984 9999999988 56678999999999999999999999 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) ++||++++|+|+. .++|.++|++||++++++++.|++++++|++|+++++||+||++|+ T Consensus 81 ~~nr~d~~v~P~~---------------------~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~ 139 (720) T protein:vir:35 81 RHNRITVKFRPGD---------------------KTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLT 139 (720) T ss_pred HhCCCceEEEcCC---------------------CcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEee Confidence 9999999999984 3578999999999999999999999999999999999999999999 Q ss_pred EeeccCCC---CCCcceEEEecC-ccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCC Q lcl|Aclame:pro 178 SDYLADDS---FEQDLIIEAIQN-QFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWF 253 (711) Q Consensus 178 ~d~~~~~~---~~~~i~i~~v~~-~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~ 253 (711) +||++++. +.++|++++|++ +++|||||+|+++|+|||+|+|+++|||+++++++||+++.........+.++.|+ T Consensus 140 ~d~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~ 219 (720) T protein:vir:35 140 TNLVNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWY 219 (720) T ss_pred ecccccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcccccccccccccccccc Confidence 99977644 345778888754 57999999999999999999999999999999999999998877777777788999 Q ss_pred CCCeEEEEEeeeeeeeceeEEEccC---CcEEEecCcc--hhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCcc Q lcl|Aclame:pro 254 TEKSVRVSEYFTREPVIREIALLSD---GRSFWLDALE--DIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVE 328 (711) Q Consensus 254 ~~~~v~v~E~~~~~~~~~~~~~~~~---~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p 328 (711) +++.|+++|||+++++...++.+.+ |..+.++... ..+.++...|...+..+.+++++|+|++++|+.+|++++| T Consensus 220 ~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~ 299 (720) T protein:vir:35 220 DVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQR 299 (720) T ss_pred CCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCC Confidence 9999999999999999888776544 5556655443 4666777777777788888999999999999999999999 Q ss_pred CCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH---HHhhcccC Q lcl|Aclame:pro 329 IPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED---EWEQANTK 405 (711) Q Consensus 329 ~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~---~~~~~~~~ 405 (711) +||++||||||||++.++++.+++||+||.|||+|+++|+++|+++|+++++ +.+++.|++++.+. .|...+.. T Consensus 300 ~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~---~~~~~~~a~~~~~~~~~~~a~~~~~ 376 (720) T protein:vir:35 300 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQD---TGSIPIVGKSQIKTLEKYWANRNKN 376 (720) T ss_pred CCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcC---CccccccCcchHHHHHHHhhccccc Confidence 9999999999999999999999999999999999999999999999999876 66778888877654 33333333 Q ss_pred CCceEEecc-----ccc--CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 406 NFSLLTYIP-----QYQ--GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRG 478 (711) Q Consensus 406 ~~~~i~~~~-----~~~--~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~ 478 (711) ...++.+++ |.. ...++.+++++++|++.++|++.+..+|+++|||+++++|..+| +||+||.+++++|++. T Consensus 377 ~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~ 455 (720) T protein:vir:35 377 RPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN-IAKETVNHLMHRSDMS 455 (720) T ss_pred cccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc-hHHHHHHHHHHHHHHH Confidence 333444443 222 12467789999999999999999999999999999999998776 8999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccCh Q lcl|Aclame:pro 479 SFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAF 558 (711) Q Consensus 479 ~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~ 558 (711) +.+++|||+++++++|+++|+||++|||++|+|||+|++|.++++.+|....++.+|..+++||+++|+|||+|+++|++ T Consensus 456 ~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~ 535 (720) T protein:vir:35 456 SFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSY 535 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcch---hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHH Q lcl|Aclame:pro 559 ATQRIEAAEAMIQFAQAVPS---AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQ 635 (711) Q Consensus 559 ~s~r~~~~~~L~~l~~~~p~---~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q 635 (711) +|+|++++..|+++++.+|+ ....+++++++++++|+++++.+++++..+++...++...+.++..++++++.++.+ T Consensus 536 ~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~ 615 (720) T protein:vir:35 536 TARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPN 615 (720) T ss_pred ccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHh Confidence 99999999999999987653 456788889999999999999999999998888777777776666666666666777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH-------HHHHHHHHHHH Q lcl|Aclame:pro 636 VEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIE----DMAQGGDVVYQQVREL-------VAQALAEITAS 704 (711) Q Consensus 636 ~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~----~~~q~~~~~~~~~~~~-------~~~~~~e~~~~ 704 (711) .++.++++++.+++++.++++++....++++.+++.++...+ +.....+...+....+ .+..+.+..++ T Consensus 616 ~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa 695 (720) T protein:vir:35 616 AELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRA 695 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHH Confidence 777788888888888888888777766666555443322111 1111111111111111 11122222222 Q ss_pred HhhhccC Q lcl|Aclame:pro 705 QANVTEQ 711 (711) Q Consensus 705 qa~~e~Q 711 (711) .+++.+. T Consensus 696 ~~el~~~ 702 (720) T protein:vir:35 696 DAELILK 702 (720) T ss_pred HHHHhhc Confidence 2222221 No 5 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=2.6e-158 Score=884.51 Aligned_cols=660 Identities=20% Similarity=0.287 Sum_probs=532.0 Q ss_pred cchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccce Q lcl|Aclame:pro 25 NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAI 104 (711) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~ 104 (711) =.+++.+|++++.||+++++++.+||.++.+|++||+|+||+++++++|+.+|+| +||+|+|+|++|+|++++||++| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp--~~N~i~~~v~~v~g~e~~nr~d~ 78 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHHhCCcce Confidence 2235677999999999999999999999999999999999999999999999998 58999999999999999999999 Q ss_pred eEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCC Q lcl|Aclame:pro 105 KVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADD 184 (711) Q Consensus 105 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~ 184 (711) +|+|++ ++|.++|++||++++++++.|++++++|++|+++++||+||++|++||.+++ T Consensus 79 ~v~p~~----------------------~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d 136 (725) T protein:vir:10 79 LYRPKD----------------------GASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS 136 (725) T ss_pred EEecCC----------------------cchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCC Confidence 999974 7999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcceEEEe--c-CccceeeCCCccccCccccceeeeeecCCHH---HHHHhcCCcccchhh-cccccccccCCCCCe Q lcl|Aclame:pro 185 SFEQDLIIEAI--Q-NQFSVTIDPDAKKRDRSDMNWCLIDDTMSKE---KFKALYPDATAEPVY-EDSVADYDTWFTEKS 257 (711) Q Consensus 185 ~~~~~i~i~~v--~-~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~---e~~~~~p~~~~~~~~-~~~~~~~~~~~~~~~ 257 (711) +++++++|+.+ + ||.+|||||+|+++|+|||+|+|+++||+++ +|++.||..+.+... ......++.|+++++ T Consensus 137 ~~~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 216 (725) T protein:vir:10 137 PTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDT 216 (725) T ss_pred CCCCceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCe Confidence 99999877654 3 5778999999999999999999999999974 577789877654332 223334578999999 Q ss_pred EEEEEeeeeeeeceeEEEccC---CcEEEec--CcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCC Q lcl|Aclame:pro 258 VRVSEYFTREPVIREIALLSD---GRSFWLD--ALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPST 332 (711) Q Consensus 258 v~v~E~~~~~~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~ 332 (711) |||+|||++.++...++.+.+ |.++.++ .+...++.+...|...+..+.+++++|+|++++|+++|++++||+|+ T Consensus 217 vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~ 296 (725) T protein:vir:10 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) T ss_pred EEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCC Confidence 999999999999988876654 5555544 34466778888999999999999999999999999999999999999 Q ss_pred ccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEe Q lcl|Aclame:pro 333 TIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTY 412 (711) Q Consensus 333 ~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~ 412 (711) +||||||||++.++++.+++||+||.|+|+|+++|+++|+++|+++++++++++++.+++++.++.|+..+.. .++.+ T Consensus 297 ~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~--~~~~~ 374 (725) T protein:vir:10 297 HIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLL 374 (725) T ss_pred ceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCc--eeeec Confidence 9999999999999999999999999999999999999999999999999999999999998877777654333 33433 Q ss_pred -----cccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 413 -----IPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) Q Consensus 413 -----~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 487 (711) ++|....++|++.+++++|+++++||+.+.++|+++|||+++++|..+|++||++|++++++|++.+++++|||+ T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~ 454 (725) T protein:vir:10 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) T ss_pred ccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345555667899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHH Q lcl|Aclame:pro 488 KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAE 567 (711) Q Consensus 488 ~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~ 567 (711) .+++++|+++|+||++|||++|++||+|++|..+++.||....++.+|..+++||+ +|+|||+|+++|+++|+|++++. T Consensus 455 ~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi-~g~~Dv~v~~~p~~~s~r~~~~~ 533 (725) T protein:vir:10 455 TAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDI-RGRYECYTDVGPSFQSMKQQNRS 533 (725) T ss_pred HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhcc-ccceeEEEeeccCcHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999 58999999999999999999999 Q ss_pred HHHHHHhhcchhHH---HHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH-HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 568 AMIQFAQAVPSAAA---VMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ-TEPTPEQQVEMAKSQA 643 (711) Q Consensus 568 ~L~~l~~~~p~~~~---~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-q~~~~~~q~~~~~~q~ 643 (711) .|+++++++|+..+ .++..+++++++|+++++.+++++..+++...++..++.++..+++ +.++++++.+..++++ T Consensus 534 ~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~ 613 (725) T protein:vir:10 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQG 613 (725) T ss_pred HHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHH Confidence 99999999886543 3455567889999999999999998877766555444333222222 2222333333444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHH---HHHH---HHHHHHHH---------HHHHHHH- Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIED-----MAQGG---DVVY---QQVRELVA---------QALAEIT- 702 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~-----~~q~~---~~~~---~~~~~~~~---------~~~~e~~- 702 (711) ...++++++++++++..++++++.+.+.++..... ..+.. .... .++..+.+ ..+..+. T Consensus 614 ~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~ 693 (725) T protein:vir:10 614 VLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKG 693 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHH Confidence 44444555444444444443333322221111000 00000 0000 00000000 0000000 Q ss_pred ---HHHhh----------hccC Q lcl|Aclame:pro 703 ---ASQAN----------VTEQ 711 (711) Q Consensus 703 ---~~qa~----------~e~Q 711 (711) +.+.+ .++| T Consensus 694 ~~~~~~~~~~~~~~~~~q~~~~ 715 (725) T protein:vir:10 694 NEQTHKQRMDIANILQSQRQNQ 715 (725) T ss_pred HHHHHHHHhhhhhccccccccC Confidence 00111 1111 No 6 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=3.1e-157 Score=878.59 Aligned_cols=666 Identities=25% Similarity=0.342 Sum_probs=553.6 Q ss_pred CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh--CCCCCCHHHHHHHHHh----CCCceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 24 NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL--GGEQWPSQVRTERELE----QRPCLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y--~G~Qw~~~~~~~~~~~----g~p~~~~N~i~~~v~~i~g~~ 97 (711) =.++..++|.++++||.++++++++||+++.+|++|| +|+||+++++++|+++ ||||+|||+|+|+|++|+|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 2356678999999999999999999999999999888 4999999999999876 679999999999999999999 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) ++||++++|+|++ .++|.++|++||++++++++.|+++++++++|+++++||+||++|+ T Consensus 81 ~~nr~d~~v~P~~---------------------~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~ 139 (708) T protein:vir:10 81 RNNRITVKFRPGD---------------------REASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) T ss_pred HhCCcceEEEcCC---------------------CCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeee Confidence 9999999999984 3578999999999999999999999999999999999999999999 Q ss_pred EeeccCC---CCCCcceEEEecCc-cceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccc-cccC Q lcl|Aclame:pro 178 SDYLADD---SFEQDLIIEAIQNQ-FSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVAD-YDTW 252 (711) Q Consensus 178 ~d~~~~~---~~~~~i~i~~v~~~-~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~-~~~~ 252 (711) +||+++. .+..+|.|+++++| ++|||||.|+++|+|||+|+|+++|||+++++++||+++....+.....+ .+.| T Consensus 140 ~d~~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~ 219 (708) T protein:vir:10 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) T ss_pred eccccccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccc Confidence 9997763 45667888887765 78999999999999999999999999999999999998876554443333 4678 Q ss_pred CCCCeEEEEEeeeeeeeceeEEEccC---CcEEEe--cCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCc Q lcl|Aclame:pro 253 FTEKSVRVSEYFTREPVIREIALLSD---GRSFWL--DALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPV 327 (711) Q Consensus 253 ~~~~~v~v~E~~~~~~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~ 327 (711) ++.+.|+|+|||++.++...++.+.+ |..+.+ +.....+..+...|...+..+.+++++|+|++++|+.+|++++ T Consensus 220 ~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~ 299 (708) T protein:vir:10 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) T ss_pred cCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCC Confidence 99999999999999998888876644 444444 3455677788888888899999999999999999999999999 Q ss_pred cCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCC Q lcl|Aclame:pro 328 EIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNF 407 (711) Q Consensus 328 p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~ 407 (711) ||||++||||||||++.++++.+++||+||.|||+|+++|+++|+++++++++++..++++.+++.+.+..|.+.+.... T Consensus 300 ~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~ 379 (708) T protein:vir:10 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) T ss_pred CCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999888998888877 Q ss_pred ceEEecccccC-------cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 408 SLLTYIPQYQG-------DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSF 480 (711) Q Consensus 408 ~~i~~~~~~~~-------~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~ 480 (711) +++.+++.... ..++..++++++|+++++|++.+..+|+++||++++++|+.+| +||+||.+++++|++.++ T Consensus 380 ~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn-~SG~aI~~rq~qg~~~l~ 458 (708) T protein:vir:10 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASF 458 (708) T ss_pred hhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc-hHHHHHHHHHHHHHHHHH Confidence 77766543221 2246678899999999999999999999999999999997554 899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHH Q lcl|Aclame:pro 481 AFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFAT 560 (711) Q Consensus 481 ~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s 560 (711) +++|||+.+++++|+++|+||++|||++|++||+|++|+.+++.+|....++.+|..+++|||++|+|||+|+++|++++ T Consensus 459 ~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s 538 (708) T protein:vir:10 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcch---hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH-HHHHHH Q lcl|Aclame:pro 561 QRIEAAEAMIQFAQAVPS---AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP-TPEQQV 636 (711) Q Consensus 561 ~r~~~~~~L~~l~~~~p~---~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~q~ 636 (711) +|+++++.|+++++.+|. ..+.+++++++++|+|+++++.+++++..+++.+.++..++.+++.++++++ +.+++. T Consensus 539 ~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~ 618 (708) T protein:vir:10 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) T ss_pred HHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHH Confidence 999999999999998764 3566788899999999999999999999888777666555544444443333 333444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 637 EMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMI-------EDMAQGGDVVYQQVRELVAQALAEITASQANVT 709 (711) Q Consensus 637 ~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~-------~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e 709 (711) ++.+++++..++++++++++++..+.++++.+++.+..+. .+.++..+...+....++.+..+.....++... T Consensus 619 ~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~~~~~~~ 698 (708) T protein:vir:10 619 EMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSP 698 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhcc Confidence 5556667777777777777776666655554444332222 111111111111111111111111112222222 Q ss_pred cC Q lcl|Aclame:pro 710 EQ 711 (711) Q Consensus 710 ~Q 711 (711) -| T Consensus 699 p~ 700 (708) T protein:vir:10 699 PQ 700 (708) T ss_pred cc Confidence 22 No 7 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=1.4e-157 Score=880.56 Aligned_cols=662 Identities=20% Similarity=0.285 Sum_probs=533.7 Q ss_pred cchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccce Q lcl|Aclame:pro 25 NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAI 104 (711) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~ 104 (711) =.+++..|++++.||+++++++.+||.++.+|++||+|+||+++++++|+.+|+| +||+|+|+|++|+|++++||++| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~~~~nr~d~ 78 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ccccHHHHHHHHHhhHHhCCcce Confidence 2345677999999999999999999999999999999999999999999999998 57999999999999999999999 Q ss_pred eEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCC Q lcl|Aclame:pro 105 KVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADD 184 (711) Q Consensus 105 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~ 184 (711) +|+|++ ++|.++|++||++++++++.|++++++|+||+++++||+||++|++||.+++ T Consensus 79 ~v~P~~----------------------~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d 136 (725) T protein:vir:77 79 LYRPKD----------------------GARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS 136 (725) T ss_pred EEecCC----------------------ccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCC Confidence 999974 7999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcceEEEe--c-CccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCc---ccchhhc-ccccccccCCCCCe Q lcl|Aclame:pro 185 SFEQDLIIEAI--Q-NQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDA---TAEPVYE-DSVADYDTWFTEKS 257 (711) Q Consensus 185 ~~~~~i~i~~v--~-~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~---~~~~~~~-~~~~~~~~~~~~~~ 257 (711) +|+++++|+.+ + ||.+|||||+|+++|+|||+|+|+++|||+++++.+||.. ..+.... .....++.|+++++ T Consensus 137 ~~~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 216 (725) T protein:vir:77 137 PTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDT 216 (725) T ss_pred CCCCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCe Confidence 99999887654 2 6788999999999999999999999999999877665543 3222221 22333567999999 Q ss_pred EEEEEeeeeeeeceeEEEccC---CcEEEec--CcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCC Q lcl|Aclame:pro 258 VRVSEYFTREPVIREIALLSD---GRSFWLD--ALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPST 332 (711) Q Consensus 258 v~v~E~~~~~~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~ 332 (711) |||+|||||.++...++.+.+ |.+..++ .+.+.+..+...|...+..+.+++++|+|++++|+++|++++||+|+ T Consensus 217 vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~ 296 (725) T protein:vir:77 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) T ss_pred eEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCC Confidence 999999999999988877765 3444443 44466677788999999999999999999999999999999999999 Q ss_pred ccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCc---e Q lcl|Aclame:pro 333 TIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFS---L 409 (711) Q Consensus 333 ~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~---~ 409 (711) +||||||||++.++++.+++||+||.|+|+|+++|+++|+++|+++++++.++++.++++++.++.|...+..+.. . T Consensus 297 ~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 376 (725) T protein:vir:77 297 HIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNR 376 (725) T ss_pred ccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccc Confidence 9999999999999999999999999999999999999999999999999999999999999888888876554321 2 Q ss_pred EEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 410 LTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKS 489 (711) Q Consensus 410 i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 489 (711) +..++|....++|..++++++|+++++|++.+..+|+++|||+++++|..+|++||++|++++++|++.+++++|||+.+ T Consensus 377 ~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~ 456 (725) T protein:vir:77 377 TDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATA 456 (725) T ss_pred cccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44456666667888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHH Q lcl|Aclame:pro 490 IRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAM 569 (711) Q Consensus 490 ~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L 569 (711) ++++|+++|+||++|||++|++||+|+++..+++.+|....++.+|..+++||++ |+|||+|+++|+++|+|++++..| T Consensus 457 ~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~-g~~Dv~v~~~p~~~s~r~~~~~~l 535 (725) T protein:vir:77 457 MRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEI 535 (725) T ss_pred HHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhc-cceeeEEeeccchHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999995 899999999999999999999999 Q ss_pred HHHHhhcchhHHH---HHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH-HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 570 IQFAQAVPSAAAV---MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ-TEPTPEQQVEMAKSQADM 645 (711) Q Consensus 570 ~~l~~~~p~~~~~---~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-q~~~~~~q~~~~~~q~~~ 645 (711) +++++++|+..+. ++..+++++++|+++++.+++++..+++...++..+..++..+++ ++++.+++..+.++++.. T Consensus 536 ~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~ 615 (725) T protein:vir:77 536 LELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVL 615 (725) T ss_pred HHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHH Confidence 9999998865544 444557788999999999999998887766555544333322222 222223333344444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H-HHHHH---HHHHHH---HHHHHHHHH-------HH------- Q lcl|Aclame:pro 646 AQAEADTAQAQADMLKAQLETEEAQKQLAMIE----D-MAQGG---DVVYQQ---VRELVAQAL-------AE------- 700 (711) Q Consensus 646 ~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~----~-~~q~~---~~~~~~---~~~~~~~~~-------~e------- 700 (711) .++++++++++++..+++.++.+++.+++... + +.+.. .+..++ +...++..+ ++ T Consensus 616 ~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~ 695 (725) T protein:vir:77 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDE 695 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhh Confidence 45555555555544444333322222211000 0 00000 000000 000000000 00 Q ss_pred --------H-HHHHhhhccC Q lcl|Aclame:pro 701 --------I-TASQANVTEQ 711 (711) Q Consensus 701 --------~-~~~qa~~e~Q 711 (711) + ++.+++..+| T Consensus 696 ~~~~q~~~~~~~~~~~~~~~ 715 (725) T protein:vir:77 696 QTHKQRMDIANILQSQRQNQ 715 (725) T ss_pred HHHhhHHHHHHHHHHHHhcC Confidence 0 0111111111 No 8 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=2.3e-156 Score=873.82 Aligned_cols=665 Identities=25% Similarity=0.354 Sum_probs=555.4 Q ss_pred CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh--CCCCCCHHHHHHHHHh----CCCceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 24 NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL--GGEQWPSQVRTERELE----QRPCLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y--~G~Qw~~~~~~~~~~~----g~p~~~~N~i~~~v~~i~g~~ 97 (711) =.+++.++|.+++.||+++++++++||+++.+|++|| +|+||+++++++|+++ ||||+|||+|+|+|++|+|++ T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 2357888999999999999999999999999999999 5899999999999866 689999999999999999999 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) ++||++++|+|+. +++|.++|++|+++++++++.|++++++++||+++++||+||++|+ T Consensus 81 ~~nr~~~~v~P~~---------------------~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~ 139 (706) T protein:vir:10 81 RNNRISVKFRPGD---------------------NAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (706) T ss_pred HhCCCceEEecCC---------------------CCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEee Confidence 9999999999963 5788999999999999999999999999999999999999999999 Q ss_pred EeeccC---CCCCCcceEEEecCcc-ceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCC Q lcl|Aclame:pro 178 SDYLAD---DSFEQDLIIEAIQNQF-SVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWF 253 (711) Q Consensus 178 ~d~~~~---~~~~~~i~i~~v~~~~-~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~ 253 (711) +||+++ ..+.++|.|+.|++|. +|||||.|+++|+|||+|+|+++|||+++++++||+++.+.....+..+++.|. T Consensus 140 ~d~~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~ 219 (706) T protein:vir:10 140 TSFVNEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWF 219 (706) T ss_pred eccccccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhcccccccccc Confidence 999765 3466889999888876 899999999999999999999999999999999999887665666666778899 Q ss_pred CCCeEEEEEeeeeeeeceeEEEc---cC--CcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCcc Q lcl|Aclame:pro 254 TEKSVRVSEYFTREPVIREIALL---SD--GRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVE 328 (711) Q Consensus 254 ~~~~v~v~E~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p 328 (711) +.+++++.|||.++++...+..+ .. +.+++.+...+.++.+...|...+..+.+++++|+|++++|+++|++++| T Consensus 220 ~~d~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p 299 (706) T protein:vir:10 220 TPDVVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRR 299 (706) T ss_pred CCCcceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCC Confidence 99999999999988766544332 23 33444445556667777788888889999999999999999999999999 Q ss_pred CCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCc Q lcl|Aclame:pro 329 IPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFS 408 (711) Q Consensus 329 ~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~ 408 (711) |+|++||||||||++.++++++.+||+||.|+|+|+++|+++|+++|+++++.+...++..+.+.+.++.|...+..... T Consensus 300 ~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 379 (706) T protein:vir:10 300 IPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPA 379 (706) T ss_pred CCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhccccccc Confidence 99999999999999999999999999999999999999999999999999997776666666665566667766555555 Q ss_pred eEEecccccCc-------CCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 409 LLTYIPQYQGD-------PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFA 481 (711) Q Consensus 409 ~i~~~~~~~~~-------~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~ 481 (711) ++.+++.+..+ .++..++++++|+++++|++.+..+|+++|||+++++|+.+| +||+||++++++|++.+++ T Consensus 380 ~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~ 458 (706) T protein:vir:10 380 FLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN-VARETVNSLLNRSDMASFI 458 (706) T ss_pred chhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc-hHHHHHHHHHHHHHHHHHH Confidence 55555432221 344667888999999999999999999999999999998766 8999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHH Q lcl|Aclame:pro 482 FIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQ 561 (711) Q Consensus 482 ~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~ 561 (711) ++|||+++++++|+++|+||++|||++|+|||+|++++.+++.+|....++.+|..+++|||++|+|||+|+++|+++++ T Consensus 459 ~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~ 538 (706) T protein:vir:10 459 YLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSAR 538 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcch---hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH-HHHHHHHHHH Q lcl|Aclame:pro 562 RIEAAEAMIQFAQAVPS---AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ-TEPTPEQQVE 637 (711) Q Consensus 562 r~~~~~~L~~l~~~~p~---~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-q~~~~~~q~~ 637 (711) |+++++.|+++++.++. ..+.++++++++||+|+++++.+++++..+++...++.+++.++..+++ |.++.+++.+ T Consensus 539 r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~ 618 (706) T protein:vir:10 539 RDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPN 618 (706) T ss_pred HHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 99999999999987643 3556678889999999999999999998888777776665555554333 3444455666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 638 MAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDM-----AQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 638 ~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~-----~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +.+++++..+++|++++++++..+.+.++.+++.++.+.... .+..+...+.+..+ .+.++++++.|+++..= T Consensus 619 ~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~-~q~l~~~~a~q~~~~~~ 696 (706) T protein:vir:10 619 MLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMET-LRLLKEVAASQQQTIPS 696 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCCC Confidence 777777777888888888888777777666655544332211 11112222222222 23344555544433222 No 9 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=1.3e-154 Score=864.28 Aligned_cols=662 Identities=19% Similarity=0.226 Sum_probs=529.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTF 89 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~ 89 (711) |.-+. .....+.+++...+.|.+++.+|..+++.+++||.++.++++||+|+||+++++++|+++||||+|||+|+|+ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~ 78 (714) T protein:vir:27 1 MKNET--NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPT 78 (714) T ss_pred CCccc--ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHH Confidence 22111 1122233445677789999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 90 v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) |++|+|++++||++++|+|++ +++++.++|++|+++++++++.|+++++++++|+++++| T Consensus 79 v~~v~g~~~~nr~~~~v~p~~--------------------~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~ 138 (714) T protein:vir:27 79 VDGVLGMEAKTRTDLVVMSDE--------------------PDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA 138 (714) T ss_pred HHHHHhHHHhCCcceEEecCC--------------------CCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc Confidence 999999999999999999984 345667899999999999999999999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchh-------- Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPV-------- 241 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~-------- 241 (711) |+||+++++++ ++++++|+|++| ||++|||||+|+++|+|||+|+|+++|||+++|+++||+++...- T Consensus 139 G~G~~~~~~~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~ 214 (714) T protein:vir:27 139 GLSWVEVRRNS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRG 214 (714) T ss_pred CcceEEecccc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcc Confidence 99999998874 678899999999 899999999999999999999999999999999999998652110 Q ss_pred --------------------hcccccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCc Q lcl|Aclame:pro 242 --------------------YEDSVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGI 299 (711) Q Consensus 242 --------------------~~~~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 299 (711) ...+...++.|++ +++|+|+||||+.++...|+...+|++++++..+..+...+..|. T Consensus 215 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:27 215 FVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0011122344554 468999999999999999999999999999999999988888888 Q ss_pred hhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 300 SIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA 378 (711) Q Consensus 300 ~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~ 378 (711) ..+..+.++ ++++++|+|+++|+ +++||||++||||||||++.. ..+.+||+||.|+|+|+++|+++|+++++|+ T Consensus 295 ~~~~~~~~~--rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~--~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~ 370 (714) T protein:vir:27 295 VQVKVGRVS--RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKD--KTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ 370 (714) T ss_pred hhhhccccc--eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeee--ccCceeehhhhchhHHHHHHHHHHHHHHhhc Confidence 887766654 57778899999995 689999999999999999874 4566899999999999999999999999874 Q ss_pred hcCCCceEecccccCChHHHHhhcccCCCceEEecccccC----cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 379 LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG----DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 379 ~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) ++ ++++.+|++.+.++.+.+.+++|+++++++|+... ..+|++.+++++|+++++|++++.+.|+++|||+++ T Consensus 371 --~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:27 371 --AK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred --CC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChH Confidence 44 45688899988777788888999999999886443 356888999999999999999999999999999999 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcch---heecchhhhh Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED---FVKLNEQIFD 531 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~---~v~~~~~~~~ 531 (711) ++|..+|++||+||++++++|++.+.+++|||+.+++.+|+++|+||++||+++|++||+|+++... ++.+| T Consensus 448 ~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in----- 522 (714) T protein:vir:27 448 FLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN----- 522 (714) T ss_pred HcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec----- Confidence 9999999999999999999999999999999999999999999999999999999999999876543 55555 Q ss_pred hhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc-hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch Q lcl|Aclame:pro 532 EESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP-SAAAVMADLIAQNMDWPGADVIAERLKKIVPPN 610 (711) Q Consensus 532 ~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p-~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 610 (711) +.+|.....||+++++|||+|+++|+++++|++++..|+++++.+| ..+.++++++++++|+|+++++.+++++..+++ T Consensus 523 ~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:27 523 AEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred cccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 4678889999999999999999999999999999999999998764 456778889999999999999999999998877 Q ss_pred hhcchhhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 611 VLSKDEREAIEEDMPEQT-EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEA--QKQLAMIEDMAQGGDVVY 687 (711) Q Consensus 611 ~~~~~~~~~~~~~~~~~q-~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~--q~q~~~~~~~~q~~~~~~ 687 (711) ...++.+++.++.+++++ .+.++++.+..+++++.++.+|+++++++...+.+.+++.. +++.++..+.. ..+.+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~--~~a~~ 680 (714) T protein:vir:27 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDAL--NQAHT 680 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH Confidence 655544433333222222 22233444555667777777777666666544443333221 11111111111 11111 Q ss_pred HHHHHHHHHHHHHHHHHH---hhhccC Q lcl|Aclame:pro 688 QQVRELVAQALAEITASQ---ANVTEQ 711 (711) Q Consensus 688 ~~~~~~~~~~~~e~~~~q---a~~e~Q 711 (711) .++...++..+.+..-.+ .++.+| T Consensus 681 a~~~~~~~~~~~~~~~~~~q~~q~~~~ 707 (714) T protein:vir:27 681 AEIITGVQNMEQEQDVLQQQMLYTLQQ 707 (714) T ss_pred HHHHHhHhhhhhhhHHHHHHHHHHHHH Confidence 111101111111111111 111111 No 10 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=1.3e-154 Score=864.28 Aligned_cols=662 Identities=19% Similarity=0.226 Sum_probs=529.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTF 89 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~ 89 (711) |.-+. .....+.+++...+.|.+++.+|..+++.+++||.++.++++||+|+||+++++++|+++||||+|||+|+|+ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~ 78 (714) T protein:vir:81 1 MKNET--NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPT 78 (714) T ss_pred CCccc--ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHH Confidence 22111 1122233445677789999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 90 v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) |++|+|++++||++++|+|++ +++++.++|++|+++++++++.|+++++++++|+++++| T Consensus 79 v~~v~g~~~~nr~~~~v~p~~--------------------~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~ 138 (714) T protein:vir:81 79 VDGVLGMEAKTRTDLVVMSDE--------------------PDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA 138 (714) T ss_pred HHHHHhHHHhCCcceEEecCC--------------------CCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc Confidence 999999999999999999984 345667899999999999999999999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchh-------- Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPV-------- 241 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~-------- 241 (711) |+||+++++++ ++++++|+|++| ||++|||||+|+++|+|||+|+|+++|||+++|+++||+++...- T Consensus 139 G~G~~~~~~~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~ 214 (714) T protein:vir:81 139 GLSWVEVRRNS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRG 214 (714) T ss_pred CcceEEecccc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcc Confidence 99999998874 678899999999 899999999999999999999999999999999999998652110 Q ss_pred --------------------hcccccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCc Q lcl|Aclame:pro 242 --------------------YEDSVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGI 299 (711) Q Consensus 242 --------------------~~~~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 299 (711) ...+...++.|++ +++|+|+||||+.++...|+...+|++++++..+..+...+..|. T Consensus 215 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:81 215 FVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0011122344554 468999999999999999999999999999999999988888888 Q ss_pred hhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 300 SIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA 378 (711) Q Consensus 300 ~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~ 378 (711) ..+..+.++ ++++++|+|+++|+ +++||||++||||||||++.. ..+.+||+||.|+|+|+++|+++|+++++|+ T Consensus 295 ~~~~~~~~~--rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~--~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~ 370 (714) T protein:vir:81 295 VQVKVGRVS--RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKD--KTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ 370 (714) T ss_pred hhhhccccc--eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeee--ccCceeehhhhchhHHHHHHHHHHHHHHhhc Confidence 887766654 57778899999995 689999999999999999874 4566899999999999999999999999874 Q ss_pred hcCCCceEecccccCChHHHHhhcccCCCceEEecccccC----cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 379 LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG----DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 379 ~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) ++ ++++.+|++.+.++.+.+.+++|+++++++|+... ..+|++.+++++|+++++|++++.+.|+++|||+++ T Consensus 371 --~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:81 371 --AK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred --CC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChH Confidence 44 45688899988777788888999999999886443 356888999999999999999999999999999999 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcch---heecchhhhh Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED---FVKLNEQIFD 531 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~---~v~~~~~~~~ 531 (711) ++|..+|++||+||++++++|++.+.+++|||+.+++.+|+++|+||++||+++|++||+|+++... ++.+| T Consensus 448 ~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in----- 522 (714) T protein:vir:81 448 FLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN----- 522 (714) T ss_pred HcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec----- Confidence 9999999999999999999999999999999999999999999999999999999999999876543 55555 Q ss_pred hhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc-hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch Q lcl|Aclame:pro 532 EESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP-SAAAVMADLIAQNMDWPGADVIAERLKKIVPPN 610 (711) Q Consensus 532 ~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p-~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 610 (711) +.+|.....||+++++|||+|+++|+++++|++++..|+++++.+| ..+.++++++++++|+|+++++.+++++..+++ T Consensus 523 ~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:81 523 AEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred cccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 4678889999999999999999999999999999999999998764 456778889999999999999999999998877 Q ss_pred hhcchhhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 611 VLSKDEREAIEEDMPEQT-EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEA--QKQLAMIEDMAQGGDVVY 687 (711) Q Consensus 611 ~~~~~~~~~~~~~~~~~q-~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~--q~q~~~~~~~~q~~~~~~ 687 (711) ...++.+++.++.+++++ .+.++++.+..+++++.++.+|+++++++...+.+.+++.. +++.++..+.. ..+.+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~--~~a~~ 680 (714) T protein:vir:81 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDAL--NQAHT 680 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH Confidence 655544433333222222 22233444555667777777777666666544443333221 11111111111 11111 Q ss_pred HHHHHHHHHHHHHHHHHH---hhhccC Q lcl|Aclame:pro 688 QQVRELVAQALAEITASQ---ANVTEQ 711 (711) Q Consensus 688 ~~~~~~~~~~~~e~~~~q---a~~e~Q 711 (711) .++...++..+.+..-.+ .++.+| T Consensus 681 a~~~~~~~~~~~~~~~~~~q~~q~~~~ 707 (714) T protein:vir:81 681 AEIITGVQNMEQEQDVLQQQMLYTLQQ 707 (714) T ss_pred HHHHHhHhhhhhhhHHHHHHHHHHHHH Confidence 111101111111111111 111111 No 11 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=1.3e-154 Score=864.28 Aligned_cols=662 Identities=19% Similarity=0.226 Sum_probs=529.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTF 89 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~ 89 (711) |.-+. .....+.+++...+.|.+++.+|..+++.+++||.++.++++||+|+||+++++++|+++||||+|||+|+|+ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~ 78 (714) T protein:vir:10 1 MKNET--NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPT 78 (714) T ss_pred CCccc--ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHH Confidence 22111 1122233445677789999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 90 v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) |++|+|++++||++++|+|++ +++++.++|++|+++++++++.|+++++++++|+++++| T Consensus 79 v~~v~g~~~~nr~~~~v~p~~--------------------~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~ 138 (714) T protein:vir:10 79 VDGVLGMEAKTRTDLVVMSDE--------------------PDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA 138 (714) T ss_pred HHHHHhHHHhCCcceEEecCC--------------------CCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc Confidence 999999999999999999984 345667899999999999999999999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchh-------- Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPV-------- 241 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~-------- 241 (711) |+||+++++++ ++++++|+|++| ||++|||||+|+++|+|||+|+|+++|||+++|+++||+++...- T Consensus 139 G~G~~~~~~~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~ 214 (714) T protein:vir:10 139 GLSWVEVRRNS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRG 214 (714) T ss_pred CcceEEecccc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcc Confidence 99999998874 678899999999 899999999999999999999999999999999999998652110 Q ss_pred --------------------hcccccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCc Q lcl|Aclame:pro 242 --------------------YEDSVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGI 299 (711) Q Consensus 242 --------------------~~~~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 299 (711) ...+...++.|++ +++|+|+||||+.++...|+...+|++++++..+..+...+..|. T Consensus 215 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:10 215 FVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0011122344554 468999999999999999999999999999999999988888888 Q ss_pred hhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 300 SIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA 378 (711) Q Consensus 300 ~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~ 378 (711) ..+..+.++ ++++++|+|+++|+ +++||||++||||||||++.. ..+.+||+||.|+|+|+++|+++|+++++|+ T Consensus 295 ~~~~~~~~~--rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~--~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~ 370 (714) T protein:vir:10 295 VQVKVGRVS--RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKD--KTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ 370 (714) T ss_pred hhhhccccc--eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeee--ccCceeehhhhchhHHHHHHHHHHHHHHhhc Confidence 887766654 57778899999995 689999999999999999874 4566899999999999999999999999874 Q ss_pred hcCCCceEecccccCChHHHHhhcccCCCceEEecccccC----cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 379 LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG----DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 379 ~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) ++ ++++.+|++.+.++.+.+.+++|+++++++|+... ..+|++.+++++|+++++|++++.+.|+++|||+++ T Consensus 371 --~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:10 371 --AK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred --CC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChH Confidence 44 45688899988777788888999999999886443 356888999999999999999999999999999999 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcch---heecchhhhh Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED---FVKLNEQIFD 531 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~---~v~~~~~~~~ 531 (711) ++|..+|++||+||++++++|++.+.+++|||+.+++.+|+++|+||++||+++|++||+|+++... ++.+| T Consensus 448 ~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in----- 522 (714) T protein:vir:10 448 FLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN----- 522 (714) T ss_pred HcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec----- Confidence 9999999999999999999999999999999999999999999999999999999999999876543 55555 Q ss_pred hhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc-hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch Q lcl|Aclame:pro 532 EESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP-SAAAVMADLIAQNMDWPGADVIAERLKKIVPPN 610 (711) Q Consensus 532 ~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p-~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 610 (711) +.+|.....||+++++|||+|+++|+++++|++++..|+++++.+| ..+.++++++++++|+|+++++.+++++..+++ T Consensus 523 ~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:10 523 AEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred cccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 4678889999999999999999999999999999999999998764 456778889999999999999999999998877 Q ss_pred hhcchhhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 611 VLSKDEREAIEEDMPEQT-EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEA--QKQLAMIEDMAQGGDVVY 687 (711) Q Consensus 611 ~~~~~~~~~~~~~~~~~q-~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~--q~q~~~~~~~~q~~~~~~ 687 (711) ...++.+++.++.+++++ .+.++++.+..+++++.++.+|+++++++...+.+.+++.. +++.++..+.. ..+.+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~--~~a~~ 680 (714) T protein:vir:10 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDAL--NQAHT 680 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH Confidence 655544433333222222 22233444555667777777777666666544443333221 11111111111 11111 Q ss_pred HHHHHHHHHHHHHHHHHH---hhhccC Q lcl|Aclame:pro 688 QQVRELVAQALAEITASQ---ANVTEQ 711 (711) Q Consensus 688 ~~~~~~~~~~~~e~~~~q---a~~e~Q 711 (711) .++...++..+.+..-.+ .++.+| T Consensus 681 a~~~~~~~~~~~~~~~~~~q~~q~~~~ 707 (714) T protein:vir:10 681 AEIITGVQNMEQEQDVLQQQMLYTLQQ 707 (714) T ss_pred HHHHHhHhhhhhhhHHHHHHHHHHHHH Confidence 111101111111111111 111111 No 12 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=1.3e-154 Score=864.28 Aligned_cols=662 Identities=19% Similarity=0.226 Sum_probs=529.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTF 89 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~ 89 (711) |.-+. .....+.+++...+.|.+++.+|..+++.+++||.++.++++||+|+||+++++++|+++||||+|||+|+|+ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~ 78 (714) T protein:vir:99 1 MKNET--NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPT 78 (714) T ss_pred CCccc--ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHH Confidence 22111 1122233445677789999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 90 v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) |++|+|++++||++++|+|++ +++++.++|++|+++++++++.|+++++++++|+++++| T Consensus 79 v~~v~g~~~~nr~~~~v~p~~--------------------~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~ 138 (714) T protein:vir:99 79 VDGVLGMEAKTRTDLVVMSDE--------------------PDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA 138 (714) T ss_pred HHHHHhHHHhCCcceEEecCC--------------------CCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc Confidence 999999999999999999984 345667899999999999999999999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchh-------- Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPV-------- 241 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~-------- 241 (711) |+||+++++++ ++++++|+|++| ||++|||||+|+++|+|||+|+|+++|||+++|+++||+++...- T Consensus 139 G~G~~~~~~~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~ 214 (714) T protein:vir:99 139 GLSWVEVRRNS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRG 214 (714) T ss_pred CcceEEecccc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcc Confidence 99999998874 678899999999 899999999999999999999999999999999999998652110 Q ss_pred --------------------hcccccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCc Q lcl|Aclame:pro 242 --------------------YEDSVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGI 299 (711) Q Consensus 242 --------------------~~~~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 299 (711) ...+...++.|++ +++|+|+||||+.++...|+...+|++++++..+..+...+..|. T Consensus 215 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:99 215 FVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0011122344554 468999999999999999999999999999999999988888888 Q ss_pred hhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 300 SIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA 378 (711) Q Consensus 300 ~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~ 378 (711) ..+..+.++ ++++++|+|+++|+ +++||||++||||||||++.. ..+.+||+||.|+|+|+++|+++|+++++|+ T Consensus 295 ~~~~~~~~~--rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~--~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~ 370 (714) T protein:vir:99 295 VQVKVGRVS--RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKD--KTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ 370 (714) T ss_pred hhhhccccc--eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeee--ccCceeehhhhchhHHHHHHHHHHHHHHhhc Confidence 887766654 57778899999995 689999999999999999874 4566899999999999999999999999874 Q ss_pred hcCCCceEecccccCChHHHHhhcccCCCceEEecccccC----cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 379 LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG----DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 379 ~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) ++ ++++.+|++.+.++.+.+.+++|+++++++|+... ..+|++.+++++|+++++|++++.+.|+++|||+++ T Consensus 371 --~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:99 371 --AK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred --CC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChH Confidence 44 45688899988777788888999999999886443 356888999999999999999999999999999999 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcch---heecchhhhh Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED---FVKLNEQIFD 531 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~---~v~~~~~~~~ 531 (711) ++|..+|++||+||++++++|++.+.+++|||+.+++.+|+++|+||++||+++|++||+|+++... ++.+| T Consensus 448 ~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in----- 522 (714) T protein:vir:99 448 FLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN----- 522 (714) T ss_pred HcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec----- Confidence 9999999999999999999999999999999999999999999999999999999999999876543 55555 Q ss_pred hhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc-hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch Q lcl|Aclame:pro 532 EESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP-SAAAVMADLIAQNMDWPGADVIAERLKKIVPPN 610 (711) Q Consensus 532 ~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p-~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 610 (711) +.+|.....||+++++|||+|+++|+++++|++++..|+++++.+| ..+.++++++++++|+|+++++.+++++..+++ T Consensus 523 ~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:99 523 AEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred cccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 4678889999999999999999999999999999999999998764 456778889999999999999999999998877 Q ss_pred hhcchhhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 611 VLSKDEREAIEEDMPEQT-EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEA--QKQLAMIEDMAQGGDVVY 687 (711) Q Consensus 611 ~~~~~~~~~~~~~~~~~q-~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~--q~q~~~~~~~~q~~~~~~ 687 (711) ...++.+++.++.+++++ .+.++++.+..+++++.++.+|+++++++...+.+.+++.. +++.++..+.. ..+.+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~--~~a~~ 680 (714) T protein:vir:99 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDAL--NQAHT 680 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH Confidence 655544433333222222 22233444555667777777777666666544443333221 11111111111 11111 Q ss_pred HHHHHHHHHHHHHHHHHH---hhhccC Q lcl|Aclame:pro 688 QQVRELVAQALAEITASQ---ANVTEQ 711 (711) Q Consensus 688 ~~~~~~~~~~~~e~~~~q---a~~e~Q 711 (711) .++...++..+.+..-.+ .++.+| T Consensus 681 a~~~~~~~~~~~~~~~~~~q~~q~~~~ 707 (714) T protein:vir:99 681 AEIITGVQNMEQEQDVLQQQMLYTLQQ 707 (714) T ss_pred HHHHHhHhhhhhhhHHHHHHHHHHHHH Confidence 111101111111111111 111111 No 13 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=1.3e-154 Score=864.28 Aligned_cols=662 Identities=19% Similarity=0.226 Sum_probs=529.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTF 89 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~ 89 (711) |.-+. .....+.+++...+.|.+++.+|..+++.+++||.++.++++||+|+||+++++++|+++||||+|||+|+|+ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~ 78 (714) T protein:vir:32 1 MKNET--NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPT 78 (714) T ss_pred CCccc--ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHH Confidence 22111 1122233445677789999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 90 v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) |++|+|++++||++++|+|++ +++++.++|++|+++++++++.|+++++++++|+++++| T Consensus 79 v~~v~g~~~~nr~~~~v~p~~--------------------~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~ 138 (714) T protein:vir:32 79 VDGVLGMEAKTRTDLVVMSDE--------------------PDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA 138 (714) T ss_pred HHHHHhHHHhCCcceEEecCC--------------------CCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc Confidence 999999999999999999984 345667899999999999999999999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchh-------- Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPV-------- 241 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~-------- 241 (711) |+||+++++++ ++++++|+|++| ||++|||||+|+++|+|||+|+|+++|||+++|+++||+++...- T Consensus 139 G~G~~~~~~~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~ 214 (714) T protein:vir:32 139 GLSWVEVRRNS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRG 214 (714) T ss_pred CcceEEecccc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcc Confidence 99999998874 678899999999 899999999999999999999999999999999999998652110 Q ss_pred --------------------hcccccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCc Q lcl|Aclame:pro 242 --------------------YEDSVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGI 299 (711) Q Consensus 242 --------------------~~~~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 299 (711) ...+...++.|++ +++|+|+||||+.++...|+...+|++++++..+..+...+..|. T Consensus 215 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:32 215 FVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0011122344554 468999999999999999999999999999999999988888888 Q ss_pred hhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 300 SIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA 378 (711) Q Consensus 300 ~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~ 378 (711) ..+..+.++ ++++++|+|+++|+ +++||||++||||||||++.. ..+.+||+||.|+|+|+++|+++|+++++|+ T Consensus 295 ~~~~~~~~~--rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~--~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~ 370 (714) T protein:vir:32 295 VQVKVGRVS--RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKD--KTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ 370 (714) T ss_pred hhhhccccc--eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeee--ccCceeehhhhchhHHHHHHHHHHHHHHhhc Confidence 887766654 57778899999995 689999999999999999874 4566899999999999999999999999874 Q ss_pred hcCCCceEecccccCChHHHHhhcccCCCceEEecccccC----cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 379 LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG----DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 379 ~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) ++ ++++.+|++.+.++.+.+.+++|+++++++|+... ..+|++.+++++|+++++|++++.+.|+++|||+++ T Consensus 371 --~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:32 371 --AK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred --CC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChH Confidence 44 45688899988777788888999999999886443 356888999999999999999999999999999999 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcch---heecchhhhh Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED---FVKLNEQIFD 531 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~---~v~~~~~~~~ 531 (711) ++|..+|++||+||++++++|++.+.+++|||+.+++.+|+++|+||++||+++|++||+|+++... ++.+| T Consensus 448 ~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in----- 522 (714) T protein:vir:32 448 FLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN----- 522 (714) T ss_pred HcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec----- Confidence 9999999999999999999999999999999999999999999999999999999999999876543 55555 Q ss_pred hhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc-hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch Q lcl|Aclame:pro 532 EESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP-SAAAVMADLIAQNMDWPGADVIAERLKKIVPPN 610 (711) Q Consensus 532 ~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p-~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 610 (711) +.+|.....||+++++|||+|+++|+++++|++++..|+++++.+| ..+.++++++++++|+|+++++.+++++..+++ T Consensus 523 ~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:32 523 AEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred cccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 4678889999999999999999999999999999999999998764 456778889999999999999999999998877 Q ss_pred hhcchhhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 611 VLSKDEREAIEEDMPEQT-EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEA--QKQLAMIEDMAQGGDVVY 687 (711) Q Consensus 611 ~~~~~~~~~~~~~~~~~q-~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~--q~q~~~~~~~~q~~~~~~ 687 (711) ...++.+++.++.+++++ .+.++++.+..+++++.++.+|+++++++...+.+.+++.. +++.++..+.. ..+.+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~--~~a~~ 680 (714) T protein:vir:32 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDAL--NQAHT 680 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH Confidence 655544433333222222 22233444555667777777777666666544443333221 11111111111 11111 Q ss_pred HHHHHHHHHHHHHHHHHH---hhhccC Q lcl|Aclame:pro 688 QQVRELVAQALAEITASQ---ANVTEQ 711 (711) Q Consensus 688 ~~~~~~~~~~~~e~~~~q---a~~e~Q 711 (711) .++...++..+.+..-.+ .++.+| T Consensus 681 a~~~~~~~~~~~~~~~~~~q~~q~~~~ 707 (714) T protein:vir:32 681 AEIITGVQNMEQEQDVLQQQMLYTLQQ 707 (714) T ss_pred HHHHHhHhhhhhhhHHHHHHHHHHHHH Confidence 111101111111111111 111111 No 14 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=2.7e-156 Score=873.46 Aligned_cols=657 Identities=15% Similarity=0.162 Sum_probs=529.4 Q ss_pred CCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhH Q lcl|Aclame:pro 8 SRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLP 87 (711) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~ 87 (711) -.|.+.+.+-.+..+ .+.+..++.+.+.+|.++++.+.+||.++.+|++||+|+||++++++.|+++|+||+|||+|+ T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~ 78 (772) T protein:vir:10 1 MQITENDRQYLNGLP--PAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIG 78 (772) T ss_pred CCcchhhHHhhccCC--cccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchH Confidence 334444333333222 334566788889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHH Q lcl|Aclame:pro 88 TFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAV 167 (711) Q Consensus 88 ~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~ 167 (711) |+|++|+|++++||++++|+|+. +.+|.++|++|+++++++++.|+++++++++|++++ T Consensus 79 ~~v~~v~g~~~~nr~d~~v~Pr~---------------------~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i 137 (772) T protein:vir:10 79 PALLSLQGYEAVTRTDWRVTPNG---------------------DVGGQEVADALNYRLNTAERQSGADRACSEAFRPQI 137 (772) T ss_pred HHHHHHHHHHHhcCcceEEecCC---------------------CchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhh Confidence 99999999999999999999973 468899999999999999999999999999999999 Q ss_pred hcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhh----- Q lcl|Aclame:pro 168 ESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVY----- 242 (711) Q Consensus 168 ~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~----- 242 (711) +||+||++++++ +++++++|+|++| ||++|||||.|++ |+|||+|+|+++|||+++++++||+++..... T Consensus 138 ~~G~Gw~e~~~~---~d~~~~~i~i~~v-~p~~v~~Dp~a~~-D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~ 212 (772) T protein:vir:10 138 ACGIGWVEVSRE---SDPFKFPYRCRPI-RRDEIHWDMKCGD-DWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYG 212 (772) T ss_pred hcCceeEEeccc---cCCCCCCeEEEee-CcccceecCCCCC-CHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhc Confidence 999999998664 5678889999998 8999999999976 99999999999999999999999986532110 Q ss_pred cc---------------------------cccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHH Q lcl|Aclame:pro 243 ED---------------------------SVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDE 293 (711) Q Consensus 243 ~~---------------------------~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (711) .+ +....+.|++ +++|||+|||||.++...++...+|+++.++..+..+.. T Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~ 292 (772) T protein:vir:10 213 STWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNI 292 (772) T ss_pred ccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHH Confidence 00 0011223443 589999999999999999999999999999999999999 Q ss_pred HHhcCchhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHH Q lcl|Aclame:pro 294 LLEAGISIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSA 372 (711) Q Consensus 294 ~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~ 372 (711) .+..|...+. ....++|+|++|+|+++|+ +++||+|++||||||||++. +..+.+||+||.|+|+||++|+++|+ T Consensus 293 ~l~~g~~~~~--~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~--~~~g~~~G~vr~~kd~Qr~~N~~~S~ 368 (772) T protein:vir:10 293 ALASGRISPK--KVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFRE--DATGIPYGYVRGMKYAQDSLNSGVSK 368 (772) T ss_pred HHhhcccchh--eeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEe--ccCCcccchhhhhhhHHHHHHHHHHH Confidence 9988876544 4455789999999999997 69999999999999999987 45666899999999999999999999 Q ss_pred HHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCc--CCccccCCccchHHHHHHHHHHHHHHHHHhC Q lcl|Aclame:pro 373 ATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD--PGPRRQPPAAVPAAELTLGQNSVEKIKSTMG 450 (711) Q Consensus 373 ~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~--~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tG 450 (711) ++|+|+++ ++++++|+|++.++.+.+.+++|+++|+++++.++. .+|++.+++++|+++++|++.+.++|+++|| T Consensus 369 ~~~~l~~~---~~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsG 445 (772) T protein:vir:10 369 LRWGMSVA---RVERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSN 445 (772) T ss_pred HHHHHhcc---cccccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhC Confidence 99999887 589999999998888899999999999999987654 5678889999999999999999999999999 Q ss_pred CCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccC--cchheecchh Q lcl|Aclame:pro 451 MYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDE--TEDFVKLNEQ 528 (711) Q Consensus 451 v~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~--~~~~v~~~~~ 528 (711) |+++++|..+|++||+||.+++++|++.+++++|||+++++++|+++|+||++|||++|++||+|+++ .++++.||.. T Consensus 446 v~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~ 525 (772) T protein:vir:10 446 ITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEP 525 (772) T ss_pred CCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999874 4799999999 Q ss_pred hhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhc-chhHHHHHHHHHHhcCCcchHHHHHHHHhhh Q lcl|Aclame:pro 529 IFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV-PSAAAVMADLIAQNMDWPGADVIAERLKKIV 607 (711) Q Consensus 529 ~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~-p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~ 607 (711) ..++.+|..++.|||++++|||+|+++|+++++|+++++.|+++++.+ |+++..+++++++++|+|+++++.++++++. T Consensus 526 ~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~ 605 (772) T protein:vir:10 526 QRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVD 605 (772) T ss_pred eecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHh Confidence 999999999999999999999999999999999999999999998764 5667888899999999999999999999887 Q ss_pred cchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 608 PPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVY 687 (711) Q Consensus 608 ~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~ 687 (711) +++.+.+.+....++.++++++++.+ .+..+++++..++++++++.+++..+...+++..+.++++....++.. T Consensus 606 ~~~~peq~~~~~~q~~qq~~~~~~~e--l~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~---- 679 (772) T protein:vir:10 606 QQQTPEQIQQQIDQAVQDALAKAGND--IKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMI---- 679 (772) T ss_pred ccCChHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhh---- Confidence 66554433322222222222222222 223333333333333333333332222222222222222211111100 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 688 QQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 688 ~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) .+ + +.+.+..+-+...+. T Consensus 680 a~----~--ad~~l~~~g~~~~~~ 697 (772) T protein:vir:10 680 AP----I--ADAVMQSAGYQRPNP 697 (772) T ss_pred hH----H--HHHHHHhcccccccc Confidence 00 0 000110111111000 No 15 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=7.4e-153 Score=854.61 Aligned_cols=664 Identities=18% Similarity=0.221 Sum_probs=525.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) ||--. +. .....+..++..++.+++.+|.++++.+.+||+++.+|++||+|+||+++++++|+++|+|| T Consensus 1 ~~~~~----------~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~ 69 (714) T protein:vir:10 1 MKNEI----------NT-TAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPM 69 (714) T ss_pred CCcCc----------Cc-ccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCc Confidence 54311 11 11222334666778899999999999999999999999999999999999999999999999 Q ss_pred eEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHH Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYD 160 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~ 160 (711) +|||+|+|+|++|+|++++||++++|+|++ +++++.++|++|+++++++++.|+++++++ T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~--------------------~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s 129 (714) T protein:vir:10 70 TIHNLIAPTVDGVLGMEAKTRTDLIVMSDD--------------------PNDETEKLAEAINAEFADACRLGNMNKARS 129 (714) T ss_pred EEeccHHHHHHHHHHHHHhCCcceEEecCC--------------------CChhhHHHHHHHHHHHHHHHHhhchhHHHH Confidence 999999999999999999999999999985 345667899999999999999999999999 Q ss_pred HHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch Q lcl|Aclame:pro 161 IAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP 240 (711) Q Consensus 161 ~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~ 240 (711) ++|+++++||+||++++++| ++++++|+|++| ||++|||||+|+++|+|||+|+|+++|||+++++++||+++... T Consensus 130 ~af~~~~~~G~G~~~~~~d~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i 205 (714) T protein:vir:10 130 DAYAEQIKAGLSWVEVRRNS---EPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVI 205 (714) T ss_pred HHHHHhhhcccceEEeeecc---CCCCCCeEEEec-ChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhh Confidence 99999999999999999986 467899999999 89999999999999999999999999999999999999865321 Q ss_pred hh----------------------------cccccccccCCC--CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchh Q lcl|Aclame:pro 241 VY----------------------------EDSVADYDTWFT--EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDI 290 (711) Q Consensus 241 ~~----------------------------~~~~~~~~~~~~--~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (711) .. ..+...++.|++ +++|+|+||||+.++...|+...+|++++++..+.. T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~ 285 (714) T protein:vir:10 206 DYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLM 285 (714) T ss_pred hccchhhcCcccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHH Confidence 10 001112234544 468999999999999999999999999999999999 Q ss_pred HHHHHhcCchhhhhcccceEEEEEEEEecCcee-ccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHH Q lcl|Aclame:pro 291 VDELLEAGISIVRTRKVKTFKTYWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYW 369 (711) Q Consensus 291 ~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l-e~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~ 369 (711) +...+..|...+..+.+ ++|+|++|+|+++| ++++||||++||||||||++. +..+.+||+||.|+|+|+++|++ T Consensus 286 ~~~~~~~g~~~~~~~~~--~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~--~~~g~~~G~vr~~~d~Qr~~N~~ 361 (714) T protein:vir:10 286 QAVAVASGRVQVKVGRV--SRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK--DKTGEPYGLISRAIPAQDEVNFR 361 (714) T ss_pred HHHHHHhccceecccce--eeEEEEEEecchhhhcCCCCCCCCceeeEEecceee--eccCccceehhhhhhHHHHHHHH Confidence 98888888877665554 57999999999999 568999999999999999987 44567899999999999999999 Q ss_pred HHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccC----cCCccccCCccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 370 DSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG----DPGPRRQPPAAVPAAELTLGQNSVEKI 445 (711) Q Consensus 370 ~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~i~~~~~~~~~~~~~~ll~~~~~~~ 445 (711) +|+++|+|+.. ++++++|++.+.++.+.+.+++||++++++|+..+ ..+|++.+++++|+++++|++++.+.| T Consensus 362 ~s~~~~~l~~~---~~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i 438 (714) T protein:vir:10 362 RIKLTWLLQAK---RVIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLI 438 (714) T ss_pred HHHHHHHHhCC---ceeeccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHH Confidence 99999987533 67888999988777677788999999999886433 346889999999999999999999999 Q ss_pred HHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcc---hh Q lcl|Aclame:pro 446 KSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETE---DF 522 (711) Q Consensus 446 ~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~---~~ 522 (711) +++|||+++++|+.+|++||+||.+++++|++.+.+++|||+++++.+|+++|+||++||+++|++||+|+++.. ++ T Consensus 439 ~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:10 439 QDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred HHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCccccee Confidence 999999999999999999999999999999999999999999999999999999999999999999999987643 45 Q ss_pred eecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhc-chhHHHHHHHHHHhcCCcchHHHHH Q lcl|Aclame:pro 523 VKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV-PSAAAVMADLIAQNMDWPGADVIAE 601 (711) Q Consensus 523 v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~-p~~~~~~~~~~~~~~~~~~~~e~~~ 601 (711) +.+| ...+...+.||+++++|||+|+++|+++++|+++++.|+++++.+ |..+.++++++++++++|+++++.+ T Consensus 519 ~~~n-----~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~ 593 (714) T protein:vir:10 519 IVLN-----AEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVE 593 (714) T ss_pred Eeec-----cccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHH Confidence 5555 346677889999999999999999999999999999999999876 5567788899999999999999999 Q ss_pred HHHhhhcchhhcchhhhhhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|Aclame:pro 602 RLKKIVPPNVLSKDEREAIEEDMPEQ-TEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEE--AQKQLAMIED 678 (711) Q Consensus 602 ~l~~~~~~~~~~~~~~~~~~~~~~~~-q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~--~q~q~~~~~~ 678 (711) ++++..+++...++.+++.++.++++ +.+..+++++..+++++.++.+++++++++...+.+.+++. ++++.+.... T Consensus 594 ~ir~~~~~~~~~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~ 673 (714) T protein:vir:10 594 RIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD 673 (714) T ss_pred HHHHHcCCCCCccccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999887655444333322222222 22223344555566666666666666655544333322221 1111111111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 679 -MAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 679 -~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) ..+...+.+-+....+++..+.+.+.-.+..+| T Consensus 674 ~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q~~~~ 707 (714) T protein:vir:10 674 ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQ 707 (714) T ss_pred HHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHH Confidence 011000000000001111111111111111111 No 16 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=1.9e-150 Score=841.37 Aligned_cols=666 Identities=20% Similarity=0.244 Sum_probs=525.4 Q ss_pred CCcCCCCCCC---CcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhC Q lcl|Aclame:pro 1 MAKKQKKSRV---EQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQ 77 (711) Q Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g 77 (711) .-++.|.+++ .+.++++.+.++.++.+..++|.+++.||+++++++.+||+++.+|++||+|+||+++++++|+++| T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g 92 (776) T protein:vir:93 13 VPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERG 92 (776) T ss_pred cccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcC Confidence 3445566666 4444555566788888999999999999999999999999999999999999999999999999999 Q ss_pred CCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHH Q lcl|Aclame:pro 78 RPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAET 157 (711) Q Consensus 78 ~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~ 157 (711) +||+|||+|+++|++|+|++++||++|+|+|++ ++|.++|++|+++++++++.|++++ T Consensus 93 ~p~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~----------------------~~d~~~Ae~l~~~~~~~~~~~~~~~ 150 (776) T protein:vir:93 93 QAPTVYNVISQSVNWIIGSEKRGRSDFKVLPRR----------------------KDGGKAAERKTALLKYLSDVNHTPF 150 (776) T ss_pred CceEEecchHHHHHHHHHHHHhCCcceEEecCC----------------------hhHHHHHHHHHHHHHHHHHhhcHHH Confidence 999999999999999999999999999999974 7899999999999999999999999 Q ss_pred HHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcc Q lcl|Aclame:pro 158 EYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDAT 237 (711) Q Consensus 158 ~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~ 237 (711) +++++|+++++||+||++|+++|+. ++++++.++++|++|||||.|+++|++||+|+|+++|||+++|+++||+++ T Consensus 151 ~~~~af~d~~~~G~G~~~v~~d~~~----~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~ 226 (776) T protein:vir:93 151 ERSMAFEETTKAGIGWLESQVQDEN----DGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERA 226 (776) T ss_pred HHHHHHHHhhhcCcceEEEEeeccC----CCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCch Confidence 9999999999999999999998742 356677777799999999999999999999999999999999999999865 Q ss_pred cchhhcc----------------------------cccccccCCCCCeEEEEEeeeeeeeceeEEEc--cCCcEEEecCc Q lcl|Aclame:pro 238 AEPVYED----------------------------SVADYDTWFTEKSVRVSEYFTREPVIREIALL--SDGRSFWLDAL 287 (711) Q Consensus 238 ~~~~~~~----------------------------~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~--~~~~~~~~~~~ 287 (711) ....... .......|.++++|+|+|||||+++...++.+ ++++.+.++.. T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~ 306 (776) T protein:vir:93 227 AQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPN 306 (776) T ss_pred HHHHHhhhhcccccchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeeccc Confidence 3321100 00001233467899999999999988777754 77888999999 Q ss_pred chhHHHHHhcCchhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHH Q lcl|Aclame:pro 288 EDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMA 366 (711) Q Consensus 288 ~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~ 366 (711) ...+...+..|...+..+. ..+++|++++|+++|+ +++||+|++||||||||++. +++++|||+|+.|+|+|+++ T Consensus 307 ~~~~~~~~~~g~~~~~~~~--~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~--~~~~~~~G~v~~~~d~Q~~~ 382 (776) T protein:vir:93 307 DERHVLEVESGRAVLAVSP--MMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRR--ARDGMPYGVIRFMRGMQDDV 382 (776) T ss_pred chHHHHHhhcCceeehhee--eeeeEEEEEecchhhhccCCCCCCCccceEEecCcee--cccccccchHHhhhHHHHHH Confidence 9988888888877665554 4688999999999985 58999999999999999986 45778999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 367 NYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIK 446 (711) Q Consensus 367 N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~ 446 (711) |+++|+++|+|+. .++++++|++++.++++.+ .++||++++++++... .+.+.+.+++++++++|++++.+.|+ T Consensus 383 N~~~s~~~~~l~~---~~~~~~~gav~~~d~~~~~-~~rp~~vi~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~ 456 (776) T protein:vir:93 383 NKRLSKALYILST---NKVLMEEGAVDDIDEFRRE-AARPDAVMTVKNGKLG--AVKMDVDRDLAPAHLELASRSIQMIQ 456 (776) T ss_pred HHHHHHHHHhhcC---CceeeccccccchHHHHHh-cccCCceeeeCCcccc--ccccccCcCccHHHHHHHHHHHHHHH Confidence 9999999999863 5899999999999876664 6889999999988654 45667788899999999999999999 Q ss_pred HHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecc Q lcl|Aclame:pro 447 STMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLN 526 (711) Q Consensus 447 ~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~ 526 (711) ++|||+++++|..+|++||++|.+++++|++++++++|||+++++++|+++|+||++||+++|+|||+|+++..+||.|| T Consensus 457 ~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in 536 (776) T protein:vir:93 457 QVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVN 536 (776) T ss_pred HhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred hhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh-cchhHHHHHHHHHHhcCCcchHHHHHHHHh Q lcl|Aclame:pro 527 EQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA-VPSAAAVMADLIAQNMDWPGADVIAERLKK 605 (711) Q Consensus 527 ~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~-~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~ 605 (711) .. ++.||+++++|||+|++++++.++|++++..|+++++. .|++++.+...+++++++|+.+++.+.+++ T Consensus 537 ~~---------~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~ 607 (776) T protein:vir:93 537 DG---------LPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRA 607 (776) T ss_pred cc---------chhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHH Confidence 64 34589999999999999999999999999999998764 466788888999999999999999999998 Q ss_pred hhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 606 IVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDV 685 (711) Q Consensus 606 ~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~ 685 (711) ..++..+.+......++++++.+++.++.+.+.+++++..+++++....++++...+++...++++.....+...+..+. T Consensus 608 ~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa 687 (776) T protein:vir:93 608 VNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDA 687 (776) T ss_pred hhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhh Confidence 87766555555444444444444444344433333333333333322222222222222211111111111110000000 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 686 --VYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 686 --~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +..............+..+..+...+ T Consensus 688 ~~~~~~~~~~a~~a~~~~~~a~~~~p~~ 715 (776) T protein:vir:93 688 ATAIAFMPELAGLSDGILRESGWDDPNT 715 (776) T ss_pred hhhhhhhhhhhhhhhhhhcccccccccc Confidence 00000000000000000000000000 No 17 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=1e-87 Score=497.52 Aligned_cols=592 Identities=14% Similarity=0.127 Sum_probs=375.4 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHH-HHHHHHHHHHhCCCCCCHHHHHHHHHhCCC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDN-WEAAEDDLKFLGGEQWPSQVRTERELEQRP 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p 79 (711) |||+++.+- .++.+++..+..+++.+..+.... ..++.++++||+|++|+.. ..|+. T Consensus 1 ~~k~~~~~~----------------~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~------~~~~s 58 (705) T protein:vir:88 1 MAKRRKIKP----------------MDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNE------RPGKS 58 (705) T ss_pred CCccccccc----------------CCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCcc------cCCCC Confidence 777665443 344557888889999999877644 4588999999999999763 47999 Q ss_pred ceEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHH-HhhcC Q lcl|Aclame:pro 80 CLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNI-EYNCD 154 (711) Q Consensus 80 ~~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~-~~~~~ 154 (711) +++.|.|...|++++++..+ +..-++|.|+. ++|.++|+.++.+++|+ .+.|+ T Consensus 59 ~~~~~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~----------------------~~D~~~a~~~~~~~~~~~~~~~~ 116 (705) T protein:vir:88 59 GIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDT----------------------AEDVEQAEQETEYVNYLFMRKNE 116 (705) T ss_pred ccccHHHHHHHHHHHHHHHHhhcCCCceEEEeeCC----------------------hhHHHHHHHHHHHHhHHHhhccc Confidence 99999999999999998776 45567888874 78999999999999996 67788 Q ss_pred HHHHHHHHHHHHHhcCccEEEEEEeeccCCC------------------------------------------CCCcceE Q lcl|Aclame:pro 155 AETEYDIAFQGAVESGMGYLRVRSDYLADDS------------------------------------------FEQDLII 192 (711) Q Consensus 155 ~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~------------------------------------------~~~~i~i 192 (711) ....++++|++++++|+||++|+|+...... ..++|+| T Consensus 117 ~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i 196 (705) T protein:vir:88 117 GFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKV 196 (705) T ss_pred hhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceee Confidence 8889999999999999999999995421110 1156888 Q ss_pred EEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc-chhhcccc-----------ccccc--------- Q lcl|Aclame:pro 193 EAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA-EPVYEDSV-----------ADYDT--------- 251 (711) Q Consensus 193 ~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~-~~~~~~~~-----------~~~~~--------- 251 (711) +.| +|++|+|||+|++ +.||.|++++.++|+++|++++++.+. +.+...+. .+++. T Consensus 197 ~~V-~p~d~~~dp~a~~--~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~ 273 (705) T protein:vir:88 197 LCV-KPENFLVDRLATC--IDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNS 273 (705) T ss_pred eec-cHHHceecCCCCC--cccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhcccccccccccccccc Confidence 888 7999999999984 569999999999999999998665421 11111100 00000 Q ss_pred -CC--CCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCcc Q lcl|Aclame:pro 252 -WF--TEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVE 328 (711) Q Consensus 252 -~~--~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p 328 (711) |. ....|.++|||.+-.. .++ ...+.++++++|++|+..+ T Consensus 274 ~~~~~~~r~v~~~E~y~~~d~------~~d-----------------------------~~~~~~~~~~~g~~il~~~-- 316 (705) T protein:vir:88 274 GDDAEANREVWASECYTLLDV------DGD-----------------------------GISELRRILYVGDYIISNE-- 316 (705) T ss_pred ccccCCceeEEEEEeeeEecc------cCC-----------------------------cceeeEEEEEeCccccccc-- Confidence 11 1124555666554211 011 1235566778899998653 Q ss_pred CCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCc Q lcl|Aclame:pro 329 IPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFS 408 (711) Q Consensus 329 ~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~ 408 (711) +.+++||+.+. +.|++++++|+|+++.++|+|+.+|+++|+++|+++++++|++++++|++. .++. ..++||+ T Consensus 317 -~~~~~PF~~~~--~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~-~~d~---~~~~pg~ 389 (705) T protein:vir:88 317 -PWDCRPFADLN--AYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVN-LEDL---LTNEAAG 389 (705) T ss_pred -cCCCCCEEEec--ceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccC-cccc---cccCCCe Confidence 45789999654 445678999999999999999999999999999999999999999999985 3333 3478999 Q ss_pred eEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcccc----chhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 409 LLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG----NETSGRAIIARQRQGDRGSFAFID 484 (711) Q Consensus 409 ~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~----~~~sg~ai~~~~~~~~~~~~~~~d 484 (711) ++.++++ +.|.+++++++|+++++|+++..+.++++|||+++++|.++ ++.|+.++++++++|++++..+++ T Consensus 390 vv~~~~~----~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r 465 (705) T protein:vir:88 390 IVRVKSM----NSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIAR 465 (705) T ss_pred eEEecCC----CccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 9999853 45888999999999999999999999999999999999764 356888999999999999999999 Q ss_pred HHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHH-- Q lcl|Aclame:pro 485 NLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQ-- 561 (711) Q Consensus 485 n~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~-- 561 (711) +|+. +++++|+++++||.+||++++++||+| +++.+++.. ..++|||.++.++++.++ T Consensus 466 ~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g-----~~v~v~~~~--------------~~~~~~v~v~v~~~~~~~eq 526 (705) T protein:vir:88 466 MFAETGVKRLFQLLHDHAIKYQNQEEVFQLRG-----KWVAVNPAN--------------WRERSDLTVTVGIGNMNKDQ 526 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCceEEeecc-----chhccchHh--------------hccCCceEEeeccccchHHH Confidence 9985 789999999999999999999999998 466666533 245788888877766553 Q ss_pred HHHHHHHHHHHHhhcc---hhHHH-----HHHH---HHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH Q lcl|Aclame:pro 562 RIEAAEAMIQFAQAVP---SAAAV-----MADL---IAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) Q Consensus 562 r~~~~~~L~~l~~~~p---~~~~~-----~~~~---~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~ 630 (711) +.+....++++.+..+ ...+. ...+ +++.+.+.+..++.......... ....+..+...++++. T Consensus 527 ~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~-----~~~~~~~q~e~~~~~~ 601 (705) T protein:vir:88 527 QMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEAL-----QAKAIREQKEAQPKPE 601 (705) T ss_pred HHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHH-----HHHHhhhhhhhhHHHH Confidence 3344444444333221 11111 1111 12222222222221110000000 0000000000111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQL---------AMIEDMAQGGDVVYQQVRELVAQALAEI 701 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~---------~~~~~~~q~~~~~~~~~~~~~~~~~~e~ 701 (711) ...+|+..+++++++++++++++.+|.+....+++.+..++++ .+.+..++..+.+++....+.+..++.. T Consensus 602 ~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~ 681 (705) T protein:vir:88 602 DIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEAT 681 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111222222222222222222222222111111110000000 0000000000000000000000000000 Q ss_pred HHHH-----hhh-------ccC Q lcl|Aclame:pro 702 TASQ-----ANV-------TEQ 711 (711) Q Consensus 702 ~~~q-----a~~-------e~Q 711 (711) ...+ +.+ +++ T Consensus 682 q~~~~~~~~~~~~~~~k~~~~~ 703 (705) T protein:vir:88 682 QARAAYIGDGKVPETKKPTKAV 703 (705) T ss_pred HHHHHHHHHHhHHHHHHHHHHh Confidence 0000 000 000 No 18 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=2.1e-87 Score=495.77 Aligned_cols=609 Identities=13% Similarity=0.118 Sum_probs=407.7 Q ss_pred CCcCCCCCCCCcccCCCcccCCc-CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHH--hCCCCCCHHHHHHHHHhC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAK-NNDDDRALLATARERARDGATYWKDNWEAAEDDLKF--LGGEQWPSQVRTERELEQ 77 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~--y~G~Qw~~~~~~~~~~~g 77 (711) |-...-. +++-+++..+.. .+=+++.+|..|..++..+.....+.+.++...++| |.|+.- ..+..| T Consensus 1 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~g 70 (763) T protein:vir:95 1 MEQNTDS----MVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK------PPKVKG 70 (763) T ss_pred CCcCccC----cCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc------ccccCC Confidence 5432211 111122222222 223667778888888777777666666665555554 556542 234469 Q ss_pred CCceEehhhHHHHHHHhhhhhh---cccc-eeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHH-hh Q lcl|Aclame:pro 78 RPCLVNNVLPTFVDQVLGDQRQ---NRPA-IKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YN 152 (711) Q Consensus 78 ~p~~~~N~i~~~v~~i~g~~~~---~r~~-~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~ 152 (711) |..++...|+..|+|+++...+ +..+ +.|.|++ ++|.++|+..+.+++|+. .. T Consensus 71 rs~vv~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~----------------------~~D~~~A~q~t~~~n~~~~~~ 128 (763) T protein:vir:95 71 RSQVQPKLVRRQAEWRYSALTEPFLGSNKLFKVTPVT----------------------WEDVQGARQNELVLNYQFRTK 128 (763) T ss_pred CccccCHHHHHHHHHHHHHHHHhhcCCCcEEEEecCC----------------------cchHHHHHHHHHHHHHHHhhc Confidence 9999999999999999999888 3344 4899974 899999999999999965 56 Q ss_pred cCHHHHHHHHHHHHHhcCccEEEEEEeeccCC------------------------------------------C----- Q lcl|Aclame:pro 153 CDAETEYDIAFQGAVESGMGYLRVRSDYLADD------------------------------------------S----- 185 (711) Q Consensus 153 ~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~------------------------------------------~----- 185 (711) |+.....++++++++++|+|+++|+|+.+.+. . T Consensus 129 ~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (763) T protein:vir:95 129 LNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESV 208 (763) T ss_pred CchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhh Confidence 77778888999999999999999998632100 0 Q ss_pred -------------------------CCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHh-cCCcccc Q lcl|Aclame:pro 186 -------------------------FEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKAL-YPDATAE 239 (711) Q Consensus 186 -------------------------~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~-~p~~~~~ 239 (711) ..+.++|+.| +|++|||||.|++ |++||+|||++.++|+++|.++ |+....+ T Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V-~p~d~~iDp~a~s-D~~Da~~~~~~~~~t~~dL~~~~~~y~~~~ 286 (763) T protein:vir:95 209 RFFDETGQATYAVQTGTTTTEVEVPLANHPTVEML-NPENIIIDPSCQG-DINKAMFAIVSFETCKADLLKEKDRYHNLN 286 (763) T ss_pred hhccccCcceeeecccceeEEEEEEecCceEEEee-cHHHheecCCCCC-chhhCceEeeEEeccHHHHHhccCCccccc Confidence 0134566666 8999999999987 7899999999999999999887 2222212 Q ss_pred hhhccccc------------cc--ccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhc Q lcl|Aclame:pro 240 PVYEDSVA------------DY--DTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTR 305 (711) Q Consensus 240 ~~~~~~~~------------~~--~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 305 (711) .+...... .. ......++|+|.|||.+... .++| T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~------~gdg-------------------------- 334 (763) T protein:vir:95 287 KIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDI------EGNG-------------------------- 334 (763) T ss_pred hhcchhccccccccccccchhhccCCCcccceEEEEEeeeeecc------CCcc-------------------------- Confidence 22111100 00 01112468888888876321 1111 Q ss_pred ccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|Aclame:pro 306 KVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAP 384 (711) Q Consensus 306 ~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~ 384 (711) ..+.+++++.|+++|+ .++||+|++|||+++++++ +.++++|+|+++.++|+|+++|+++|+++|+++++++++ T Consensus 335 ---~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p--~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~ 409 (763) T protein:vir:95 335 ---VLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMP--VKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQ 409 (763) T ss_pred ---eeEEEEEEEEcCeeeecccccccCCCcCEEEeccee--ecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCc Confidence 2345667788999885 5799999999999877654 578899999999999999999999999999999999999 Q ss_pred eEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-- Q lcl|Aclame:pro 385 FIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-- 462 (711) Q Consensus 385 ~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-- 462 (711) |++++|++.+.+ .+ .++||++++++++......++...++.+++..+.++++....++++|||++.++|..++. T Consensus 410 ~~v~~gav~~~d-~~---~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~ 485 (763) T protein:vir:95 410 RGMPKGMLDALN-SR---RYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYG 485 (763) T ss_pred EEeecccccchh-hh---cccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCccccc Confidence 999999986543 23 468999999999988888888888999999999999999999999999999999987653 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeeh Q lcl|Aclame:pro 463 TSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHD 542 (711) Q Consensus 463 ~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD 542 (711) .++.++..+++++++++..+++||+++++.+|+++++||++||+++++|||+|+ +|+.++.. T Consensus 486 ~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~----e~v~v~~~-------------- 547 (763) T protein:vir:95 486 DVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNE----EFVTIKRE-------------- 547 (763) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCC----ccccccHH-------------- Confidence 455668888999999999999999999999999999999999999999999986 46666543 Q ss_pred hhheeeeEEeecccChHHHHHHHHHHHHHHHhhc-chhHHHHHH-HHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhh Q lcl|Aclame:pro 543 LNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV-PSAAAVMAD-LIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAI 620 (711) Q Consensus 543 ~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~-p~~~~~~~~-~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~ 620 (711) ...++|||+|+.+++ +.++++.+.|..+++.+ |...+.+.. ++.+.+++....++.+.++...+++.+.+.. T Consensus 548 ~~~~~~DV~V~~~~a--s~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~---- 621 (763) T protein:vir:95 548 DLKGNFDLEVDISTA--EVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQ---- 621 (763) T ss_pred HhcCCcceEEecccc--hHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhh---- Confidence 235789999998875 45555666666666654 333333323 3346667777777777777665543322111 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHH Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGG---DVVYQQVRELVAQA 697 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~---~~~~~~~~~~~~~~ 697 (711) +.+.++..++++++..++++++.++++....++++..++++.+++.+.+++...+.++.. +.+++.+.++++.. T Consensus 622 ---qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~ 698 (763) T protein:vir:95 622 ---LKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPR 698 (763) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112222222333333334444444444444444444443333333333322211111100 11111111111100 Q ss_pred H-HHHHHHHhhhccC Q lcl|Aclame:pro 698 L-AEITASQANVTEQ 711 (711) Q Consensus 698 ~-~e~~~~qa~~e~Q 711 (711) . ++....++..-.+ T Consensus 699 ~ea~~~~~~~~~~~~ 713 (763) T protein:vir:95 699 KEGELPPNLSAAIGY 713 (763) T ss_pred HHhccChhHHHhhhh Confidence 0 0000000000000 No 19 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=2.8e-62 Score=358.03 Aligned_cols=576 Identities=11% Similarity=0.102 Sum_probs=347.7 Q ss_pred cCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHH----------HHHHHHhCCCCCCHHHHHH Q lcl|Aclame:pro 3 KKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAA----------EDDLKFLGGEQWPSQVRTE 72 (711) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~----------~~~~~~y~G~Qw~~~~~~~ 72 (711) -|.-.++++. +.-+-.+.+.+...+..+|+++.+....+-.+| .++.+||+|..|... .- T Consensus 1 ~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~--~~ 70 (651) T protein:vir:80 1 MKLATTTTDK--------NRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSV--GD 70 (651) T ss_pred Ccccccccch--------hhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhcccccccc--CC Confidence 1112222221 112335556666677777777777665444444 367788888766332 12 Q ss_pred HHHhCCCceEehhhHHHHHHHhhhhhhc----ccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 RELEQRPCLVNNVLPTFVDQVLGDQRQN----RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKN 148 (711) Q Consensus 73 ~~~~g~p~~~~N~i~~~v~~i~g~~~~~----r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~ 148 (711) .+..||+.+++|.++..|++++++.-.. ..-++|.|.. ...+....+++++.++.+ T Consensus 71 ~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~--------------------~~d~a~~~~~~~~~~~~~ 130 (651) T protein:vir:80 71 VNADWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAK--------------------PGQDNLLVSRLIKRYVQD 130 (651) T ss_pred CCCCCCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCC--------------------chhHHHHHHHHHHHHHHH Confidence 3345889999999999999999887764 2225665642 112224466778888887 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeecc----------------CCC---------CCCcceEEEecCccceee Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA----------------DDS---------FEQDLIIEAIQNQFSVTI 203 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~----------------~~~---------~~~~i~i~~v~~~~~v~~ 203 (711) .+.+++|...++..++++++.|+|+++|+|+... +.. ..+.|+|++| +|++||| T Consensus 131 ~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v-~p~~~~~ 209 (651) T protein:vir:80 131 KLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVL-DMFDCFY 209 (651) T ss_pred HhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEe-cHHHeee Confidence 7789999999999999999999999999986321 000 1245778888 7999999 Q ss_pred CCCccccCccccceeeeeecCCHHHHHHhcC-----Cccc----chhhc-------c-----cccccccCCCCCeEEEEE Q lcl|Aclame:pro 204 DPDAKKRDRSDMNWCLIDDTMSKEKFKALYP-----DATA----EPVYE-------D-----SVADYDTWFTEKSVRVSE 262 (711) Q Consensus 204 Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p-----~~~~----~~~~~-------~-----~~~~~~~~~~~~~v~v~E 262 (711) ||.|+ ++.||.|++++.+ +..++..+.. +... +.... . ...+...+...++|.|+| T Consensus 210 dp~a~--~~~d~~~v~~~~~-t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E 286 (651) T protein:vir:80 210 DPNVT--DPNRGAFIRKLTK-TKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLE 286 (651) T ss_pred cCCCc--Cccccceeeeeee-eHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEE Confidence 99987 5679999998865 4555544321 1000 00000 0 000111122356889999 Q ss_pred eeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEe Q lcl|Aclame:pro 263 YFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWG 341 (711) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~ 341 (711) ||++... +|.. .+.+.+.+.|..+|. ..+||+++. ||+++. T Consensus 287 ~~~~~d~--------e~~~----------------------------~~~~~v~~~g~~il~~~~~~~~~~~-Pf~~~~- 328 (651) T protein:vir:80 287 YWGDIHL--------ENKT----------------------------YHDVVVTIMGNEVLRFEQNPYWCGR-PFVIGT- 328 (651) T ss_pred EEEEeec--------cCCc----------------------------eEEEEEEEcCcEEecccccCCCCCC-Ceeeec- Confidence 9876311 1110 112334556777774 467777654 999754 Q ss_pred eeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCC Q lcl|Aclame:pro 342 KSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPG 421 (711) Q Consensus 342 ~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 421 (711) +.+++++.||+|+++.+.|.|+.+|++.+.++++++++++++++++++++.+.+++ ...||++++++... . T Consensus 329 -~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l----~~~pg~vi~~~~~~----~ 399 (651) T protein:vir:80 329 -YIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDV----YTEPGKVFLVSDHG----D 399 (651) T ss_pred -ceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHh----hcCCCceEEecCCC----C Confidence 45578899999999999999999999999999999999999999999999887764 25789998886432 2 Q ss_pred ccccCC-ccchHHHHHHHHHHHHHHHHHhCCCHHHhcccc---chhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|Aclame:pro 422 PRRQPP-AAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG---NETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKI 496 (711) Q Consensus 422 i~~~~~-~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~---~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~ 496 (711) +.++++ +..++..++++++..+.++++|||++.++|..+ .+.|+.+|..+++++..++..++++|++ +++.++++ T Consensus 400 ~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r 479 (651) T protein:vir:80 400 LQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEK 479 (651) T ss_pred ceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444333 446778899999999999999999999999655 3468889999999999999999999987 89999999 Q ss_pred HHHHHHhhcCccceEeeecccCcc-hheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 497 LVEMIPHIYDTERVVRLKFPDETE-DFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA 575 (711) Q Consensus 497 ~l~li~~~~~~~r~~ri~g~~~~~-~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~ 575 (711) ++.|+.+||+.++++||+|+.... .++.++. .|+. +.+++ +..|+.....|.+..+.|.++.+. T Consensus 480 ~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~-------------~dl~-~~~~i-v~~g~~~~~~r~~~~~~l~~~~q~ 544 (651) T protein:vir:80 480 VMHLVQQFTDQPGMVRVAGDEAGAYEYYELDV-------------EDLQ-KEVRL-VPIGSDHVIERKQYIEDRLTFIQA 544 (651) T ss_pred HHHHHHHhcCcccceeecccccccccccccCc-------------ccee-eeeee-eeccHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999976432 3333322 2332 45665 344555555555555555555543 Q ss_pred ---cchhHHH-----HHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 576 ---VPSAAAV-----MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQ 647 (711) Q Consensus 576 ---~p~~~~~-----~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k 647 (711) .|++... ++..+++.+.+++.+.++.... +..+ +..++....|++...++++.+. T Consensus 545 ~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~-------q~~~----------~~~~~~~~~q~~~~~~~a~~~~ 607 (651) T protein:vir:80 545 VAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQD-------QQAP----------ANPQEALLSQAKDVGGQAMSNM 607 (651) T ss_pred hccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCc-------cchh----------hhhhHHHHhhHHHHHHHHHHHH Confidence 3333221 2233455666665554431100 0000 0000000001111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 648 AEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITA 703 (711) Q Consensus 648 ~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~ 703 (711) +++.+++.+. .+.++++...+++. +.++.....++. -.++++.. T Consensus 608 ~~~~~~~~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~---------l~~~~~~~ 651 (651) T protein:vir:80 608 LQNQLQADGG--TQMMSEMYGTPNAD-QMQQELMATTPN---------VSEQQLTQ 651 (651) T ss_pred HHHHHHHHHH--HHHHHHHHHHHHHH-HHHHHHHHHHHH---------HHHhhccC Confidence 1111000000 00001111000000 000000000000 01111111 No 20 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=4e-48 Score=280.43 Aligned_cols=532 Identities=11% Similarity=0.061 Sum_probs=328.2 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) |+-|. +...+-.+.++ +-..+...|+...++.+.+..+|.+-++||.+- - ..-..-.+...+.. T Consensus 1 ~~~~~-----------~~~~~~~~~~~---~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~-~-~~~~~~~~~~~r~~ 64 (584) T protein:vir:95 1 MSVKV-----------AELNSLLVRDS---SAQWVAYLWDRFNNQRRQKIEEWKELRNYVFAT-D-TTTTSNQGLPWKNS 64 (584) T ss_pred CCcch-----------hhhhhhccccc---hHHHHHHHHHHHHhhhchhhccCHHHHHHHHhh-h-hhhhhhcccccccc Confidence 33221 11111122223 234557778888888999999999999999872 1 12222345566778 Q ss_pred eEehhhHHHHHHHhhhhhhc----ccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHH Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQN----RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAE 156 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~----r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~ 156 (711) +++|+|...+++++.+.-.. +-=+++.|. ..+.++...+++++.++...+..+++. T Consensus 65 ~~~~k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~--------------------~~~~~~~~~~~ai~~~i~dkl~e~~~~ 124 (584) T protein:vir:95 65 TTLPKLCQIRDNLHSNYFSSLFPNDDWLRWVGY--------------------GKGDSTKTKAKAIQAYMSNKCRESHFR 124 (584) T ss_pred cchhHHHHHHHHHHHHHHHhhcCccceeeeecC--------------------CCchhhHHHHHHHHHHHhhhhhhccHH Confidence 99999999999998765432 112333332 122233345999999999999999999 Q ss_pred HHHHHHHHHHHhcCccEEEEEEeeccCC-------CCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHH Q lcl|Aclame:pro 157 TEYDIAFQGAVESGMGYLRVRSDYLADD-------SFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) Q Consensus 157 ~~~~~a~~~~~~~G~g~~~v~~d~~~~~-------~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~ 229 (711) .+....+++++++|.|++++.|...-.. ..-.+++|++| +|.+|||||.|+ +++|+.||+ +..+|+++| T Consensus 125 ~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~prieri-SP~d~~~Dpsa~--~i~d~~fiv-rs~~T~~~L 200 (584) T protein:vir:95 125 TEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDYIGPRLVRI-SPLDIVFNPLAT--SISDTFKIV-RSVKTKGEL 200 (584) T ss_pred HHHHHHHHhhccCCceEEEEeEeecceeeeccccccccccceEEee-ChhheeecCCCC--Cccchhhhh-hhhhhHHHH Confidence 9999999999999999999987643211 11236899999 689999999997 456999999 666899999 Q ss_pred HHhc-----CCcccchhhc------------------ccccc------cccCCCCCeEEEEEeeeeeeeceeEEEccCCc Q lcl|Aclame:pro 230 KALY-----PDATAEPVYE------------------DSVAD------YDTWFTEKSVRVSEYFTREPVIREIALLSDGR 280 (711) Q Consensus 230 ~~~~-----p~~~~~~~~~------------------~~~~~------~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~ 280 (711) .++- |....+.+.. ....+ ...++.+..|.|.|||-. T Consensus 201 ~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~-------------- 266 (584) T protein:vir:95 201 MRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGD-------------- 266 (584) T ss_pred HHHHhhcCccccchHHHHHHHHhccCCCCCcccccccccccccccccccccccCCceeEEEeeccc-------------- Confidence 8874 2111111110 00000 011133445555555510 Q ss_pred EEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCcee-ccCccCCCCccceEEEEeeeeccCCcccccchHHHh Q lcl|Aclame:pro 281 SFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHS 359 (711) Q Consensus 281 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l-e~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~ 359 (711) . ++..++ + ..... .+.++.|.+++ -..+|+|++..||+ ++.+.|..++.||+|+...+ T Consensus 267 ~--~~~~~~---e-------------~~~~~-iv~v~~g~~iIR~~~np~~~~~~PF~--~~~~~p~~~s~yG~gi~~ll 325 (584) T protein:vir:95 267 Y--HDKETG---E-------------LQTNR-IITVVDRSTEVRNESIPTWFGSAPIY--HVGWRFRPDNLWAMGPLDNL 325 (584) T ss_pred c--cccccC---C-------------Ccccc-eEEEEeccEEEEeeecCCCCCCCCEE--EEcceeeeccccCCCchhhh Confidence 0 010000 0 00011 23445777777 45789999999998 45567888999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCcc-chHHHHHHH Q lcl|Aclame:pro 360 KDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAA-VPAAELTLG 438 (711) Q Consensus 360 ~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~-~~~~~~~ll 438 (711) .|.|+.+|.+++.++||++++.+|.+ +..+. .++. ..+||+++..... +..+++.++. -....++.+ T Consensus 326 ~d~Q~~lna~~r~~iDnl~l~~~pv~---k~~~~-~~~~----~~~pg~~~~~~~~----~~~q~~~p~a~~~~s~~~~l 393 (584) T protein:vir:95 326 VGMQYRIDHLENAKADAVDLIIQPPL---KIIGE-VEEF----VWGPGAEIHLDQG----GDVQEIAKNVNYIINADNQI 393 (584) T ss_pred hhHHHHHhHHHHHHHHHHHHhcCcce---eeccc-cchh----cccCCceeecCCC----CCcceecCchhhhhHHHHHH Confidence 99999999999999999999999833 33332 2222 3568887776432 2355555442 223455678 Q ss_pred HHHHHHHHHHhCCCHHHhccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCccceEeeecc Q lcl|Aclame:pro 439 QNSVEKIKSTMGMYDASLGAMGN-ETSGRAIIARQRQGDRGSFAFIDNLTKS-IRRVGKILVEMIPHIYDTERVVRLKFP 516 (711) Q Consensus 439 ~~~~~~~~~~tGv~~~~~G~~~~-~~sg~ai~~~~~~~~~~~~~~~dn~~~~-~~~~~~~~l~li~~~~~~~r~~ri~g~ 516 (711) ++....+++.|||+..++|.++. +.|+..++++.++++..++++.+.+... +++++..+++...++++..-++|++++ T Consensus 394 q~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~ 473 (584) T protein:vir:95 394 QMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDT 473 (584) T ss_pred HHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeecc Confidence 99999999999999999997654 4577789999999999999999999885 488899999998999999999999997 Q ss_pred c-CcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh------cchhHH-HHHHHHH Q lcl|Aclame:pro 517 D-ETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA------VPSAAA-VMADLIA 588 (711) Q Consensus 517 ~-~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~------~p~~~~-~~~~~~~ 588 (711) . +...|+.+.+. |+ .+.|+++..-..... .|++..+.|.++++. .|.... .....+. T Consensus 474 e~~~~~f~~i~r~-------------Dl-~g~~~~va~Ga~~~~-~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~la 538 (584) T protein:vir:95 474 DLGVKEFMSVTRE-------------DI-TANGKIRPIGARHFG-KQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVD 538 (584) T ss_pred ccccccccccChh-------------hh-ccCeeEEeehhhHHH-HHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHH Confidence 6 44556666442 23 466777655443333 456666666666553 111111 1111223 Q ss_pred HhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 589 QNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQ 656 (711) Q Consensus 589 ~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aq 656 (711) +.+++|.-. ...+..... .+++.|+...++| +...+++++ .|+- |- T Consensus 539 dl~~~p~~~--------------~~~~~~~~~--~Q~~~q~~~~~~q-~~~~~~~~~---~~~~--~~ 584 (584) T protein:vir:95 539 DVTGLQGYE--------------IFRPNVAVA--EQAETQSLVAQAQ-EDLQLQAQM---PAEG--AI 584 (584) T ss_pred HHhCCCccc--------------ccCCCcccc--hhHHHHhhhHHHH-HHHHHHHhh---hhcc--CC Confidence 334444211 111111110 1111111111111 000111100 0000 00 No 21 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=1.1e-44 Score=261.58 Aligned_cols=591 Identities=15% Similarity=0.126 Sum_probs=338.0 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) |+.-++. + +.++++. +-.+..++...+..++.+|++......+-|.|..|+...... T Consensus 1 m~~~~~~---~----------~~~tpe~--la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~~-------- 57 (663) T protein:vir:34 1 MNESQPT---D----------FADTPQG--WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAET-------- 57 (663) T ss_pred CCccccc---c----------chhcchh--HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCcccc-------- Confidence 7765544 1 1122322 355777888889999999999999999999998887654322 Q ss_pred eEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHH----HHHHHHHHHHHHH--hhcC Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYE----LAEVFTGLIKNIE--YNCD 154 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~----~Ae~l~~~~~~~~--~~~~ 154 (711) -||+|...|..++.......|.|.|+|+-. ..|.. .+++++.++.... +..+ T Consensus 58 -r~nl~~sni~~i~P~iYar~P~p~V~~rf~---------------------d~d~~~~r~ase~leR~~~~~~~~D~~~ 115 (663) T protein:vir:34 58 -RWNLFSTNIQTQMASLYGQTPKVSVSRRFA---------------------DADDDVARVASELLERLLNTDIEKDSDT 115 (663) T ss_pred -ccchhhhhHHHHhhhhhcCCCcceeeeccc---------------------CcccchhhhHHHHHHHHHHHHHHhhHHH Confidence 289999999999999999999999999851 22333 4455555554333 6677 Q ss_pred HHHHHHHHHHHHHhcCccEEEEEEeeccC----------CCCCC---------------cceEEEecCccceeeCCCccc Q lcl|Aclame:pro 155 AETEYDIAFQGAVESGMGYLRVRSDYLAD----------DSFEQ---------------DLIIEAIQNQFSVTIDPDAKK 209 (711) Q Consensus 155 ~~~~~~~a~~~~~~~G~g~~~v~~d~~~~----------~~~~~---------------~i~i~~v~~~~~v~~Dp~a~~ 209 (711) ++.....+..+++.||+|+++|.++...+ +..+. .++|.+| +|.+|++|| |+. T Consensus 116 l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v-~~~dfl~~p-Ar~ 193 (663) T protein:vir:34 116 FQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYL-HWQDVLWSP-ARV 193 (663) T ss_pred HHHHHHHHHHhhhccccceEEEEeecccchhccccccCCCccccchhcccccchhhcccceeeeee-chhhcccch-hhc Confidence 99999999999999999999998754221 11111 4778888 699999999 676 Q ss_pred cCccccceeeeeecCCHHHHHHhcCCcccchh----hc---ccccc-cccCCCCCeEEEEEeeeeeeeceeEEEccCCcE Q lcl|Aclame:pro 210 RDRSDMNWCLIDDTMSKEKFKALYPDATAEPV----YE---DSVAD-YDTWFTEKSVRVSEYFTREPVIREIALLSDGRS 281 (711) Q Consensus 210 ~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~----~~---~~~~~-~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~ 281 (711) + +++.|++.+.||++++++++|..+..... .. ...++ .......++++|.|.|-|.. T Consensus 194 W--~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~------------- 258 (663) T protein:vir:34 194 W--HEVRWLAFRNLLDMREFNARFDADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGG------------- 258 (663) T ss_pred c--ccccceeeeccCCHHHHHHhhcCChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCC------------- Confidence 5 59999999999999999999954432111 11 00110 01111235888999997752 Q ss_pred EEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccce--EEEEeeeeccCCcccccchHHHh Q lcl|Aclame:pro 282 FWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPV--IPVWGKSLIIKKKEIFRSIIRHS 359 (711) Q Consensus 282 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~--vp~~~~~~~~~~~~~~~g~v~~~ 359 (711) .+|||++-+++.+|+. +|.+.|.--| +|++.+.....++..|-..+-.. T Consensus 259 ----------------------------~~V~w~~eg~~~~L~~-~~p~lgl~~ffPcPrpl~~~~~~ds~ipvpd~~~y 309 (663) T protein:vir:34 259 ----------------------------RKVDWYVEGYSAVLDT-QPDPLGLESFFPCPKPLLANWTTDKVVPRPDFVLA 309 (663) T ss_pred ----------------------------cEEEEEEcCcceeccc-CCCCCCCCCCCCCcccccceecCCCeecCCcHHHH Confidence 3566666666655542 3333332222 12222222234455665666699 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc------ccCcCCccccCCccchHH Q lcl|Aclame:pro 360 KDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ------YQGDPGPRRQPPAAVPAA 433 (711) Q Consensus 360 ~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~------~~~~~~i~~~~~~~~~~~ 433 (711) .+.|+.+|.++.++--+ ....+++++++.|+..++-+.+.... . +-++++... ++....|.+++-..+.+. T Consensus 310 ~~~~~E~n~~t~Rin~l-~d~ikv~gvy~~~~g~~i~~~l~~a~-~-n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~a 386 (663) T protein:vir:34 310 QDLYKEIDLVSTRITLL-ERAIRVVGVYDKSSGLTIGRLLSEAA-Q-NDLIPVENWLTFADKGGLRGVVDWFPLEPVVAA 386 (663) T ss_pred HHHHHHHHHHHHHHHHH-HhhhhhceeeccccchhHHHHHHHhh-C-CCceecchhhhhhhhcCccchhhcccchhHHHH Confidence 99999999887776554 44578999999888776655554332 1 234444221 122245788888888888 Q ss_pred HHHHHHHHH---HHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|Aclame:pro 434 ELTLGQNSV---EKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERV 510 (711) Q Consensus 434 ~~~ll~~~~---~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~ 510 (711) +..|.+... ..+.++||+.|++.|....+.|+.+.+.+++.|+.+++.+.+.+.++.++++++..+.|.+.++-+.+ T Consensus 387 I~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl 466 (663) T protein:vir:34 387 LTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASI 466 (663) T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHH Confidence 888776544 56778899999999988877888888889999999999999999999999999999999999999988 Q ss_pred EeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeeccc----ChHHHHHHHHHHHHHHHhhc---------- Q lcl|Aclame:pro 511 VRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGP----AFATQRIEAAEAMIQFAQAV---------- 576 (711) Q Consensus 511 ~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~----~~~s~r~~~~~~L~~l~~~~---------- 576 (711) -+|+|..-.. -++|..... .+.||- ...|.|.|..+. +....++...+.|..+.... T Consensus 467 ~~m~~~elp~-~~ei~~~~~-------~L~n~~-~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~ 537 (663) T protein:vir:34 467 LAQANAEFTF-DKELAPKAA-------ELIKSR-FSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQV 537 (663) T ss_pred HHHhcCCCCc-ccchhHHHH-------HHhcCC-CcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 8888754321 222222111 122331 134555554442 23334444444443333322 Q ss_pred chhHHHHHHHH-HHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 577 PSAAAVMADLI-AQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQA 655 (711) Q Consensus 577 p~~~~~~~~~~-~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~a 655 (711) |...+.+..++ .-...+.....+...+.+..........++ .++.+..+....++.+++.+++.+.++| T Consensus 538 p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~----------~~~~pa~~~~~~k~~~~q~k~q~~~aeA 607 (663) T protein:vir:34 538 PGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQA----------AQQSPAPQQPDPKVVAQAMKGQQEMAKV 607 (663) T ss_pred hhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhcc----------CCCCcccchhhHHHHHHHHHHHHHHHHH Confidence 22222111111 112233333333222222221111000000 0011111111222222233333333333 Q ss_pred HHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hhc Q lcl|Aclame:pro 656 QADM----LKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQA---NVT 709 (711) Q Consensus 656 qae~----~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa---~~e 709 (711) |++. .+.|++++..+ ..++.+....+....+..+..+++..+..++ -+. T Consensus 608 q~e~q~~~~~~ql~~~~~~-----~k~~~~a~~~~~~a~q~~~~~~~~r~~~~~a~~~~~~ 663 (663) T protein:vir:34 608 QAEVQGDLLRIQAETQANE-----TKERQQAEWNVREAAQKNLISQAARAMNPQARNGGMP 663 (663) T ss_pred HHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHhhHHHHHHHhhchhhhcCCCC Confidence 3222 22222221111 1111111111111112222222211111000 011 No 22 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=9.1e-44 Score=256.56 Aligned_cols=580 Identities=11% Similarity=0.081 Sum_probs=329.3 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC-----CCCHHHH---HH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGE-----QWPSQVR---TE 72 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~-----Qw~~~~~---~~ 72 (711) |.---|...++ .++....+...+++...+..+|+.+.+..+.|..+|.++++||.+. .....+. .. T Consensus 1 ~~~~~~~~~~~------~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (641) T protein:vir:94 1 MTIEMPTPIIE------DKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGA 74 (641) T ss_pred CccCCCccccc------CCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhccccccccccc Confidence 54444444444 2223333344555788888999999999999999999999988542 1100000 00 Q ss_pred HHHhCCCceEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 RELEQRPCLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKN 148 (711) Q Consensus 73 ~~~~g~p~~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~ 148 (711) -...+|..++.+.+.-.++++...... ++.-+++.|. +.+|.+.|++++..+++ T Consensus 75 ~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~----------------------~~ed~~~A~~~~~~~~~ 132 (641) T protein:vir:94 75 DDADWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGM----------------------VPELADAARVVKQLTKT 132 (641) T ss_pred chhcccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecC----------------------CCChHHHHHHHHHHHHH Confidence 122345678888888899888765544 2333466665 57899999999999999 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccC-----------CCCC-----------CcceEEEecCccceeeCCC Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD-----------DSFE-----------QDLIIEAIQNQFSVTIDPD 206 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~-----------~~~~-----------~~i~i~~v~~~~~v~~Dp~ 206 (711) .+..|++...++..+++++..|+|++++.|+.... +.++ ..++++.| +|.+|||||. T Consensus 133 ~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v-~~~di~~dps 211 (641) T protein:vir:94 133 KLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPL-SPYDVWLDTS 211 (641) T ss_pred HHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEec-chhheeecCC Confidence 99999999999999999999999999998764311 1111 23456666 7999999998 Q ss_pred ccccCccccceeeee-ecCCHHHHHHh--cCCcccchhhcccccc--cc-----cCCCCCeEEEEEeeeeeeeceeEEEc Q lcl|Aclame:pro 207 AKKRDRSDMNWCLID-DTMSKEKFKAL--YPDATAEPVYEDSVAD--YD-----TWFTEKSVRVSEYFTREPVIREIALL 276 (711) Q Consensus 207 a~~~d~~Da~~~~~~-~~~~~~e~~~~--~p~~~~~~~~~~~~~~--~~-----~~~~~~~v~v~E~~~~~~~~~~~~~~ 276 (711) ++.. +..|++++ +.+++.++... |+.............. .+ ...+..+.+++|||.. T Consensus 212 ~~~~---~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd---------- 278 (641) T protein:vir:94 212 GGKN---TGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGP---------- 278 (641) T ss_pred CCcc---cccceehhhhHHHHHHHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeee---------- Confidence 8542 44554433 45555555544 3222222111111000 00 0011111122232210 Q ss_pred cCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceec-cCccCCCCccceEEEEeeeeccCCcccccch Q lcl|Aclame:pro 277 SDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSI 355 (711) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~ 355 (711) . .........+ +..+.|.++|. +.++++ ..+||+.+ .+.+++++.||.|+ T Consensus 279 ---------~----------------~~d~~~~~~~-~~~~~g~~il~~~~~~~~-d~~Pf~~~--r~~~~~~~~YG~gp 329 (641) T protein:vir:94 279 ---------L----------------LVEGVQFWCV-HAVFYGKQLIRLSDSKYW-CGSPFVTT--TLLPDRDSVYGMSV 329 (641) T ss_pred ---------e----------------ccCCCceeeE-EEEEeCCEEeeccccccc-CcCCeEEe--cceecCCcccCCCh Confidence 0 0001111222 34456777774 455543 45699854 45567899999999 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCcc-chHHH Q lcl|Aclame:pro 356 IRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAA-VPAAE 434 (711) Q Consensus 356 v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~-~~~~~ 434 (711) +..+.+.|+.+|++.+.+++++..+++|++++..+++.+.++. ...||+++..+... .++++.+.. ..... T Consensus 330 ~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l----~~~PG~ii~~~~~~----~v~pl~~~~~~~~~~ 401 (641) T protein:vir:94 330 LHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDV----KAKPGAVFKVAQHG----SLQPIDMGRQDFVVT 401 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccccee----eccCCcceeeCCCC----cceeecCCccccchh Confidence 9999999999999999999999999999999988887665432 46689988765432 244443322 23334 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhcccc---chhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccce Q lcl|Aclame:pro 435 LTLGQNSVEKIKSTMGMYDASLGAMG---NETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERV 510 (711) Q Consensus 435 ~~ll~~~~~~~~~~tGv~~~~~G~~~---~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~ 510 (711) .+++++....+++.+|+....+|..+ ++.|+..+.++.++++.++..+.++|+. +++.+++.+++++.++++.+.+ T Consensus 402 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i 481 (641) T protein:vir:94 402 YQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPET 481 (641) T ss_pred HHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhh Confidence 56778888889999998888777554 3568899999999999999999999985 8999999999999999999999 Q ss_pred EeeecccCcc-hheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhc---chhHHH---- Q lcl|Aclame:pro 511 VRLKFPDETE-DFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV---PSAAAV---- 582 (711) Q Consensus 511 ~ri~g~~~~~-~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~---p~~~~~---- 582 (711) +|+.|..... .++.+.+ .++ .+.+++ +..+.+....+.+..+.|.++++.. |++... T Consensus 482 ~R~~~~~~~~~~~~~~~p-------------~~L-~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~ 546 (641) T protein:vir:94 482 IRMYVPEEQMDGFFEVSP-------------EYL-HYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYA 546 (641) T ss_pred hhhhchhhhcccCCCCCc-------------cce-eeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHH Confidence 9999864221 2222222 122 356666 4555555555655555555555432 221110 Q ss_pred -HHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 583 -MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLK 661 (711) Q Consensus 583 -~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~ 661 (711) ++..+++.+.++....+++ ..+ .+ +++.+.+++++| ++.. .++++-...+.. + T Consensus 547 ~~~~~~~~~~g~~~p~~~ir------------~~~-~~----~~~~~~~~~~~q----~~~~--~~a~~~~~~~~~---~ 600 (641) T protein:vir:94 547 LILEDLLRQMRFTDPMRYIK------------KAE-AP----PAAPPIAPAEPG----ALPP--EMMNSVGGGLND---Q 600 (641) T ss_pred HHHHHHHHHhCCCCchhhcc------------Ccc-Cc----hhHHHHHHHHHH----HHHH--HHHHHHHhhhHH---H Confidence 1111222222222221110 000 00 000000000000 0000 111110000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 662 AQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 662 ~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +.+.+...+++. ...++. .+. ....+.++++.-.++.+| T Consensus 601 a~~~~~~~~~~~--~~~~~~-----~~~----~~~~~~~~~~~~~~~~~~ 639 (641) T protein:vir:94 601 AIAGMTPEDVSD--LASRIG-----IDT----SDVAPEAMAAATQQITSG 639 (641) T ss_pred HHHHhhHHHHHH--HHHhhc-----CCc----hhhhHHHHhccccccccc Confidence 000000001100 000000 000 011222222333344444 No 23 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=3.5e-41 Score=242.40 Aligned_cols=558 Identities=10% Similarity=0.051 Sum_probs=321.3 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhC--- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQ--- 77 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g--- 77 (711) |. ++.+...+.--..++..++...+..+|.+..++.+.--++|.+.++|.+-. +....-..+ T Consensus 1 m~----------~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~-----~tr~t~~~~~~w 65 (599) T protein:vir:31 1 MS----------TDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDAT-----DTRKTSNSKLPF 65 (599) T ss_pred Cc----------cchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhh-----cccccccCCCCc Confidence 21 111111111113567777777888888888777777777888888886421 000111111 Q ss_pred CCceEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 78 RPCLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 78 ~p~~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) +-.++.|++..+++.++.++-. |+.=++|.|-. .+.+-...++++..+++.-+..+ T Consensus 66 ~~s~t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~--------------------~~~~~~~~~~~i~~yi~~Kl~e~ 125 (599) T protein:vir:31 66 KNSTTINKLAHLHLMITTSYMEHLLPNRNWVDFVGFD--------------------NDSVNAEKREIARSYVRGKVEAS 125 (599) T ss_pred ccccchHHHHHHHHHHHHHHHhhhcCCccceEeeecC--------------------CchhHHHHHHHHHHHhhhhhhhc Confidence 2257899999999999876543 22223444421 12223456788889999999999 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEE-----eeccCCCCC--CcceEEEecCccceeeCCCccccCccccceeeeeecCCH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRS-----DYLADDSFE--QDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSK 226 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~-----d~~~~~~~~--~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~ 226 (711) ++..+++..+.+.+..|.++..+.+ .+++...+. -.+++++| +|.+|||||.|++ ++|+.||+ |...|+ T Consensus 126 ~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~~~P~~erv-sP~Di~~Dp~A~s--i~d~~fiv-Rs~~Tk 201 (599) T protein:vir:31 126 NLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSGTVTERL-SPSDVFWDVTADS--LPKAAKCI-RQLYTL 201 (599) T ss_pred chHHHHHHHHhhhcccCceeEeeeEEEcceeecccccccccccceEEee-cccceeeCCCCCC--CCcceeee-ehhhhH Confidence 9999999999999999988765443 222222221 34788888 7999999999975 56998887 888889 Q ss_pred HHHHHhcCC-----cccchhhcc---c-------cccccc--CCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcch Q lcl|Aclame:pro 227 EKFKALYPD-----ATAEPVYED---S-------VADYDT--WFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALED 289 (711) Q Consensus 227 ~e~~~~~p~-----~~~~~~~~~---~-------~~~~~~--~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (711) .+|..+-.+ ...+.+... . ...+.. |++. . ..++.+..+++..+ T Consensus 202 ~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~------~-------------~~d~~~~~~eY~~~ 262 (599) T protein:vir:31 202 GSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDS------L-------------HKKGYGSMMNYINE 262 (599) T ss_pred HHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhccc------c-------------ccccccchhhhccc Confidence 999886432 221111110 0 000000 1110 0 00111111111111 Q ss_pred hHHHHHhcCchhhhh-cccceEEEEEEEEecC-cee-ccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHH Q lcl|Aclame:pro 290 IVDELLEAGISIVRT-RKVKTFKTYWRKITGA-NVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMA 366 (711) Q Consensus 290 ~~~~~~~~g~~~~~~-~~~~~~~v~~~~~~g~-~~l-e~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~ 366 (711) ...+.++-. +.... ...+...-+...+.|+ +++ ...+|+|++++||+ ++.+.|..++.||+|+...+.+.|..+ T Consensus 263 ~~VevLeyw-Gd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyv--v~~~~P~~~~~yG~G~l~~~~gaQ~~l 339 (599) T protein:vir:31 263 GVVEVLTFM-GDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLH--IAVYEFQKDTLCPIGPLHRLTGMQYKL 339 (599) T ss_pred chhhhhhhh-hhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeE--EEEeeeeccccCCCCCchhcchHHHHH Confidence 111111110 00000 0011111113445564 444 45799999999999 455667788999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 367 NYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIK 446 (711) Q Consensus 367 N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~ 446 (711) |.+.+.+++++.+...+ ++...+.+...+ ....|++++...... ..+++.+++-......++++....++ T Consensus 340 N~~~Ng~iD~~~~~l~p-~l~~~~dl~~eD-----~~~~P~~v~~~~d~~----~vq~~~p~s~~~~a~~~is~~e~~me 409 (599) T protein:vir:31 340 DKRENFREDLHDRFLHP-SLKKVGDVREKG-----MRGGPNHVFEVEETG----DVQYMTPPAEVLQPDNQLSITLQLME 409 (599) T ss_pred HHHHHHhhhhhhhhhcc-cccccccccccC-----ccCCCCcceeecCCC----ccccccCchhhhhHHHHHHHHHHHHH Confidence 99999999999988766 333333343321 113478888776432 34556665555556668888889999 Q ss_pred HHhCCCHHHhccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeeccc-Ccchhe Q lcl|Aclame:pro 447 STMGMYDASLGAMGN-ETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPD-ETEDFV 523 (711) Q Consensus 447 ~~tGv~~~~~G~~~~-~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~-~~~~~v 523 (711) +.||++.++.|..+. ..++..++++.++++.+.+.+.+.+.+ ..+.+.+.++++..+|+|++-++||+++. |...|+ T Consensus 410 e~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~ 489 (599) T protein:vir:31 410 DLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFL 489 (599) T ss_pred HhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeE Confidence 999999999996654 468899999999999999999999988 56669999999999999999999999976 667888 Q ss_pred ecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCCcchHHHHH Q lcl|Aclame:pro 524 KLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDWPGADVIAE 601 (711) Q Consensus 524 ~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~~~~~e~~~ 601 (711) .+.+..+ .+.+++ +..|...-..|++..+-|.++++. .+...+.+.. .++.. T Consensus 490 ~i~redl--------------~~~~~~-v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~-----------k~l~~ 543 (599) T protein:vir:31 490 DITADDL--------------NLNGQM-VAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSR-----------TKLFN 543 (599) T ss_pred Eeehhhh--------------hCCeee-eechhhHHHHHHHHHHHHHHHhcccCCCccchhhHH-----------HHHHH Confidence 8865433 244666 555554444566666666666642 1111111111 12222 Q ss_pred HHHhhhc--chhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 602 RLKKIVP--PNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQ 670 (711) Q Consensus 602 ~l~~~~~--~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q 670 (711) .+..... ..+..++.....++ ++++++ +|++++ ++-+.+..+-+--.--++ +.| T Consensus 544 ~l~~~~~l~~~~~~~~~va~~eq----------q~~~~m--~Q~~lq-~~~~~~~~~~~~~~~~~~--~~~ 599 (599) T protein:vir:31 544 AVEYLGDLDAYGIFTFGIGVQED----------QQLARM--AQKSTQ-QTEETALTQEEVGGPTTD--TGQ 599 (599) T ss_pred HHHHHHhccccccCCCchhHHHH----------HHHHHH--HHHHHH-HhHhhhhhhhhcCCCCcc--cCC Confidence 2211110 11111111111110 011111 111110 000000000000000000 000 No 24 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.91 E-value=2.6e-22 Score=138.92 Aligned_cols=531 Identities=12% Similarity=0.086 Sum_probs=275.7 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCC---C--CCCHHHHHHHHHhCCCceEehhhHHHHHHHhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGG---E--QWPSQVRTERELEQRPCLVNNVLPTFVDQVLG 95 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G---~--Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g 95 (711) +.. ++..++.++..+|.........|...|.++.+|..- . -++..+...-..+. +.+.-..-...++...+ T Consensus 1 m~~---d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~-~~~~dstg~~a~~~LAs 76 (549) T protein:vir:10 1 MTN---DDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERS-QKMFDSTAPLALRNFVA 76 (549) T ss_pred CCc---chHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccc-cccccchHHHHHHHHHH Confidence 443 456788899999999999999999999999999742 1 12221111111010 11111222222333322 Q ss_pred hhhh-----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHH---HHHHH--HhhcCHHHHHHHHHHH Q lcl|Aclame:pro 96 DQRQ-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTG---LIKNI--EYNCDAETEYDIAFQG 165 (711) Q Consensus 96 ~~~~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~---~~~~~--~~~~~~~~~~~~a~~~ 165 (711) ..-. +++=.++.+.+. ...+.....+-|.. .+..+ ...++|......++.+ T Consensus 77 ~l~~~ltpp~~~wF~l~~~~~-------------------~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~ 137 (549) T protein:vir:10 77 AMDSMITPATQLWHRLKTGND-------------------ALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQS 137 (549) T ss_pred HHHhhccCCCCccccccCCcc-------------------chhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHH Confidence 2111 111122222110 00111112222333 32222 2368899999999999 Q ss_pred HHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccc-hhhcc Q lcl|Aclame:pro 166 AVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAE-PVYED 244 (711) Q Consensus 166 ~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~ 244 (711) .+..|+|++-+.. ++ ++-+++..+ +..++++..++. -+ ..-+|++..||...+.++||.++.. .+... T Consensus 138 L~~~Gta~l~~~~-----~~-~~~~~f~~~-pl~~~~v~~d~~-G~---vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~ 206 (549) T protein:vir:10 138 IGLFGPGALMIEH-----DV-GKGIVYRNV-PMQRLWFAENNS-GL---IDKTHVQWELTLRQAAQRFGRENLSPSMQST 206 (549) T ss_pred HHhhcceeeEEee-----cC-CCeeEEEEE-EcCeEEEeeCCC-CC---eEEEEEEeecCHHHHHHhcCcccCCHHHHHH Confidence 9999999875532 11 234567776 578888877654 22 2338899999999999999986543 22222 Q ss_pred cccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceec Q lcl|Aclame:pro 245 SVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLE 324 (711) Q Consensus 245 ~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le 324 (711) ...+ ..+.+.|+++=|.+..... .. ...+ .+.+..+|+-..+.+++. T Consensus 207 ~~~~-----~~~~~~v~~~V~pr~~~~~--~~-------------------------~~~~-~~pf~sv~~e~~~~~il~ 253 (549) T protein:vir:10 207 LEKD-----PEKSAIFYHAVEPRADRDP--RK-------------------------LDGR-NMQFASYWLDEGRDRIVQ 253 (549) T ss_pred hhcC-----CCceEEEEEEeecCCCCCc--cc-------------------------cccc-cCceEEEEEEecCCEeec Confidence 2111 1345555443222111000 00 0011 122333444455666664 Q ss_pred cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhccc Q lcl|Aclame:pro 325 GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANT 404 (711) Q Consensus 325 ~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~ 404 (711) . +.| .++||+|+- +..+++..||.|.+....+-.+.+|.+...++..+..+.+|+++++.+.+.+..+ . T Consensus 254 e-sg~--~e~P~~~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~------l 322 (549) T protein:vir:10 254 N-SGF--RTFPFAIGR--FYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFD------L 322 (549) T ss_pred c-CCc--ccCCcceee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccce------e Confidence 3 333 568999764 4456889999999999999999999999999999999999999998877665432 3 Q ss_pred CCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 405 KNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFID 484 (711) Q Consensus 405 ~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~d 484 (711) .||++..+..+...+..+.++....-......+++...+.|....=+....+-.++.+.|+.-|..+.+.....+..... T Consensus 323 ~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~ 402 (549) T protein:vir:10 323 RSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLG 402 (549) T ss_pred ccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHH Confidence 57776555444333344554444444455666777777777776533332333456678999999999988889888888 Q ss_pred HHH-HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHH Q lcl|Aclame:pro 485 NLT-KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRI 563 (711) Q Consensus 485 n~~-~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~ 563 (711) +|. ++...+.+..++++.+.---|. +-....+ .| ..++|.. .++-...++. T Consensus 403 rl~~E~l~Pli~R~~~il~r~g~lP~---------------~p~~l~~--~~----------~~~~i~y-is~La~aq~~ 454 (549) T protein:vir:10 403 RTQSELLGPMIAREVDILAEAGQLPD---------------MPQELID--AG----------ADVDVEY-DSPLNKAMRA 454 (549) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCC---------------CChhhhc--CC----------ceeEEEe-ecHHHHHHHH Confidence 885 5666776666666655310000 0000000 00 0122222 2233344555 Q ss_pred HHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHH-HHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 564 EAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMP-EQTEPTPEQQVEMAKSQ 642 (711) Q Consensus 564 ~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~q~~~~~~q~~~~~~q 642 (711) .....+.++++....+.++ .+.+ ++.-+.+++.+.+....+-....--..++.++..+ .+++++.+++.+.+... T Consensus 455 ~~~~~i~~~~~~~~~laq~-~Pe~---ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a 530 (549) T protein:vir:10 455 GEGAAILQWLQQLGIVSQF-DPAA---AKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVA 530 (549) T ss_pred HHHHHHHHHHHHHHHHhcc-ChhH---HhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5555555555544333221 2222 23345567777776665543211111111111111 11111110000010000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 643 ADMAQAEADTAQAQADMLKAQLETEEAQKQLAMI 676 (711) Q Consensus 643 ~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~ 676 (711) +..++. .+++..+ .+-+.. T Consensus 531 ~~~a~~--------------~~~~~ta-~~~~~~ 549 (549) T protein:vir:10 531 AGAIKD--------------LSDAQTA-AQTARV 549 (549) T ss_pred HHHHHh--------------hhhhcCC-CcccCC Confidence 000000 0000000 010111 No 25 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=99.90 E-value=1.1e-21 Score=135.44 Aligned_cols=542 Identities=11% Similarity=0.028 Sum_probs=269.2 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC--CCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLG--GEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQR 98 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~--G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~ 98 (711) +.. ..-.++.++|+......+.|...|.++.+|.. ..-|...+...-. +..+.+..+.....++.+.+..- T Consensus 1 m~~------~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~-~~~~~~~dst~~~a~~~Las~l~ 73 (556) T protein:vir:73 1 MAE------TEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDD-RRNTKIVDPTGSMAQRILSSGMM 73 (556) T ss_pred CCh------hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcch-hhcCccccchHHHHHHHHHHHHH Confidence 111 11446777888888889999999999999972 2224332222111 11233344444444544433222 Q ss_pred h-----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccE Q lcl|Aclame:pro 99 Q-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGY 173 (711) Q Consensus 99 ~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~ 173 (711) . +++=+++.+.+.. ......-..--+.++..+......++|......++.+.+..|+|+ T Consensus 74 ~~ltpp~~~WF~l~~~d~~----------------~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 137 (556) T protein:vir:73 74 SGITSPARPWFKLATPDPD----------------MMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGA 137 (556) T ss_pred HhhcCCCCcccccccCccc----------------ccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcee Confidence 1 2222344332200 000000011123355566666778999999999999999999999 Q ss_pred EEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccc-hhhcccccccccC Q lcl|Aclame:pro 174 LRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAE-PVYEDSVADYDTW 252 (711) Q Consensus 174 ~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~~~ 252 (711) +-+.. ++ ++-+++..+ +..++++..++.- + ..=++++..|+..++.++|+.++.. .+......+ T Consensus 138 l~~~~-----~~-~~~~r~~~~-~l~~~~~~~d~~G-~---vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~---- 202 (556) T protein:vir:73 138 MAVME-----DD-QDVIRTMPF-PIGSYYLANSPRG-S---VDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENG---- 202 (556) T ss_pred eeeee-----cC-CceEEEEEe-ecceeEEeeCCCC-C---eEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcC---- Confidence 74432 21 133677777 6889999887642 2 2237888999999999999976533 222221111 Q ss_pred CCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE-ecCceeccCccCCC Q lcl|Aclame:pro 253 FTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI-TGANVLEGPVEIPS 331 (711) Q Consensus 253 ~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~le~~~p~~~ 331 (711) .....+.|+++=|.+.... +++ ...+......+||... .+.+++.. +.| T Consensus 203 ~~~~~~~v~~~V~pr~~~~-----~~~----------------------~~~~~~p~~s~~~~~~~~~~~vl~e-sg~-- 252 (556) T protein:vir:73 203 TYETWVEVNHCITPNVNRD-----SGK----------------------MDSKNKPYRSVYFESGGDSDKLLRE-SGF-- 252 (556) T ss_pred CccceEEEEEEEecccccc-----ccc----------------------cCcccceEEEEEEEecCCCceeccc-CCc-- Confidence 1123455544322211000 000 0011112223333322 23445532 334 Q ss_pred CccceEEEEeeeeccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceE Q lcl|Aclame:pro 332 TTIPVIPVWGKSLIIKKKEIFRS-IIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLL 410 (711) Q Consensus 332 ~~~P~vp~~~~~~~~~~~~~~~g-~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i 410 (711) .++||+|+- +..+++..||.| .+....+-.+.+|.+...++..+..+++|++.++.+..... ....||+++ T Consensus 253 ~e~P~~~~R--w~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~------~~~~pgg~~ 324 (556) T protein:vir:73 253 DEFPILAPR--WEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQR------VSLLPGDVT 324 (556) T ss_pred ccCCceeee--eeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc------eeeccCccc Confidence 668999764 344689999999 49999999999999999999999999999999987754321 235678866 Q ss_pred EecccccCcCCccccCCc-cchHHHHHHHHHHHHHHHHHhCCCH-HHhc-cccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 411 TYIPQYQGDPGPRRQPPA-AVPAAELTLGQNSVEKIKSTMGMYD-ASLG-AMGNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) Q Consensus 411 ~~~~~~~~~~~i~~~~~~-~~~~~~~~ll~~~~~~~~~~tGv~~-~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 487 (711) ....... ...++++... .--..+.+.++...+.|....-.+- .+++ .++.+.|+..|..+.+.....+..+..+|. T Consensus 325 ~~~~~~~-~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~ 403 (556) T protein:vir:73 325 YLDVISG-QDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 (556) T ss_pred cccCCCC-ccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHH Confidence 5543222 2234433211 1233445556666666766543322 1234 344568999999999988888898888885 Q ss_pred -HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHH Q lcl|Aclame:pro 488 -KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAA 566 (711) Q Consensus 488 -~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~ 566 (711) ++...+....++++.+.---|.. +.. +....+.|... ++-...++.... T Consensus 404 ~E~l~Pli~r~~~il~r~g~lP~~----------------P~~-------------l~~~~i~v~yi-s~La~aqk~~~~ 453 (556) T protein:vir:73 404 DEALNPLIDRVFSIMARKNMLPEP----------------PDV-------------LQGMPLRIEYI-SVMAQAQKSIGL 453 (556) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCC----------------chh-------------hcCceeEEEee-cHHHHHHHHHHH Confidence 46666777666666653110000 000 10111222221 222333444444 Q ss_pred HHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 567 EAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMA 646 (711) Q Consensus 567 ~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~ 646 (711) ..+.++++....+.++ .+. .++.-+.+++.+.+....+-..-.--.+++.++..++.++++++++ ++++++ + T Consensus 454 ~~i~~~~~~~~~laq~-~Pe---~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~-~~~~~~--~- 525 (556) T protein:vir:73 454 TSLSQTVGFIGQLAQF-KPE---ALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQ-AMAMGQ--A- 525 (556) T ss_pred HHHHHHHHHHHHHhcc-Chh---hHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHH-HHHHHH--H- Confidence 4455544443332221 122 2334456777777766654432111111111111111111100000 000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 647 QAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQ 688 (711) Q Consensus 647 k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~ 688 (711) - ++..+.-+ .+...-..+ ++.+..++-+-.+ T Consensus 526 a--~~~~~~~~---~~~~~~~~~------l~~~~~~~g~~~~ 556 (556) T protein:vir:73 526 A--AQGAKTLS---ETQTSDPSA------LTAIANAAGAPQQ 556 (556) T ss_pred H--HHHHHHhh---hccCCCHHH------HHHHHHhhcCCCC Confidence 0 00000000 000000000 0000000000000 No 26 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.90 E-value=6.2e-22 Score=136.86 Aligned_cols=541 Identities=9% Similarity=0.018 Sum_probs=272.0 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC--CCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLG--GEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQR 98 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~--G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~ 98 (711) +. .....++..+|+......+.|...|.++.+|.. ...+...+...-. +..+.+..+.....++.+.+..- T Consensus 1 m~------~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~-~~~~~~~dst~~~a~~~Las~l~ 73 (559) T protein:vir:95 1 MA------ETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASGMM 73 (559) T ss_pred CC------hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCccc-ccccccccchHHHHHHHHHHHHH Confidence 22 122567788899999999999999999999972 2223322211111 11233334444444544433222 Q ss_pred h-----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHH---HHHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 99 Q-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELA---EVFTGLIKNIEYNCDAETEYDIAFQGAVESG 170 (711) Q Consensus 99 ~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G 170 (711) . +++=.++.+.+. ......+.. +.++..+......++|..+...++.+.+..| T Consensus 74 ~~ltpp~~~WF~l~~~d~-------------------~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G 134 (559) T protein:vir:95 74 SGITSPARPWFRLATPDP-------------------EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYS 134 (559) T ss_pred HhhcCCCCcccccccCCc-------------------cccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhC Confidence 1 222233333210 001112222 2334555566678999999999999999999 Q ss_pred ccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccc-hhhccccccc Q lcl|Aclame:pro 171 MGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAE-PVYEDSVADY 249 (711) Q Consensus 171 ~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~ 249 (711) +|++-+..+ + +.-+++..+ +..++++..++.- ...=++++..||..++.++|+..... .+...... T Consensus 135 ta~l~~~~d-----~-~~~~r~~~~-~l~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~-- 201 (559) T protein:vir:95 135 TGAMAVLDD-----D-EDIIRTMPF-PIGSYYLANSPRG----SVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWES-- 201 (559) T ss_pred ceeeEeecC-----C-CceeEEEEe-ecCeEEEeeCCCC----CeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhc-- Confidence 998644322 1 234677777 6889999876642 23347788999999999999976543 22222211 Q ss_pred ccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEec-CceeccCcc Q lcl|Aclame:pro 250 DTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITG-ANVLEGPVE 328 (711) Q Consensus 250 ~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g-~~~le~~~p 328 (711) ....+.+.|+++-|.+...... ....+.+....+||..-.. .+++.. +. T Consensus 202 --~~~~~~v~v~~~V~pr~~~~~~---------------------------~~~~~~~pf~s~~~e~~~~~~~~l~e-sg 251 (559) T protein:vir:95 202 --GTYEKWIEVMHSVYPNIDRDTS---------------------------KLDSKNKPFKSVYYEVGGDNDKLLRE-SG 251 (559) T ss_pred --CCCCCeEEEEEEEecccccccc---------------------------ccccccceEEEEEEEecCCCceeeec-CC Confidence 1112345555543322110000 0011112223344433222 345533 33 Q ss_pred CCCCccceEEEEeeeeccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCC Q lcl|Aclame:pro 329 IPSTTIPVIPVWGKSLIIKKKEIFRS-IIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNF 407 (711) Q Consensus 329 ~~~~~~P~vp~~~~~~~~~~~~~~~g-~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~ 407 (711) | .++||+|+- +..+++..||.| .+....+-.+.+|.+....+..+..+.+|++.++.+..... ....|| T Consensus 252 ~--~e~P~~~~R--w~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~------~~l~pg 321 (559) T protein:vir:95 252 F--DEFPIMAPR--WEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR------ASLLPG 321 (559) T ss_pred c--ccCCcccee--eeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc------eeeecc Confidence 4 568999764 445688999999 59999999999999999999999999999999987765322 235688 Q ss_pred ceEEecccccCcCCccccCC-ccchHHHHHHHHHHHHHHHHHhCCCH-HHhc-cccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 408 SLLTYIPQYQGDPGPRRQPP-AAVPAAELTLGQNSVEKIKSTMGMYD-ASLG-AMGNETSGRAIIARQRQGDRGSFAFID 484 (711) Q Consensus 408 ~~i~~~~~~~~~~~i~~~~~-~~~~~~~~~ll~~~~~~~~~~tGv~~-~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~d 484 (711) ++..+.++.. ...+++... ..-...+...++...+.|....-.+- .+++ .++.+.|+.-|..+.+.....+..+.. T Consensus 322 g~~~~~~~~~-~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~ 400 (559) T protein:vir:95 322 DITYIDQITG-QDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (559) T ss_pred ceeeeCCCCC-cccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHH Confidence 8877765432 223433221 11222333445666666666553322 1223 455567999999999998889998888 Q ss_pred HHH-HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHH Q lcl|Aclame:pro 485 NLT-KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRI 563 (711) Q Consensus 485 n~~-~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~ 563 (711) +|. ++...+.+..++++.+.---|.. +.. +....+.|..- ++-...++. T Consensus 401 rl~~E~l~Pli~r~~~il~r~g~lP~~----------------p~~-------------l~~~~i~v~~i-s~La~aqk~ 450 (559) T protein:vir:95 401 RLNDECLNPLIDRSFSMMVRKNMLPPP----------------PDV-------------MEGMPLKVEYI-SVMAQAQKS 450 (559) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCC----------------ccc-------------ccCcceEEEee-cHHHHHHHH Confidence 885 46666777666666654210000 000 00011222222 222233445 Q ss_pred HHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 564 EAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 564 ~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) .....+.++++....++++ .+ +.++.-+.+++.+.+....+-..-.--..++.++..++.+++++++| +++ T Consensus 451 ~~~~~i~~~~~~~~~laq~-~P---evld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q----~~~- 521 (559) T protein:vir:95 451 IGLSSLASTVNFIGQLAQV-KP---EALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQ----MMA- 521 (559) T ss_pred HHHHHHHHHHHHHHHHhcc-Ch---hhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHH----HHH- Confidence 4444555555443333222 12 22344566777777766654432111111111111111111100000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) ...+.+++-..-+.+... ... .++.+ ..+......+| T Consensus 522 ------~~~~aa~~~~~~~~~~~~--~~~-------------~l~~~----------~~~~~~~~~~~ 558 (559) T protein:vir:95 522 ------MGMAAAQGVKTLSEAKTS--DPS-------------VLSAM----------ANAVSGQGGQS 558 (559) T ss_pred ------HHHHHHHhhhccccccCC--Chh-------------HHHHH----------HHhhcCccccC Confidence 000000000000000000 000 00000 00001111111 No 27 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.90 E-value=2.1e-21 Score=133.91 Aligned_cols=524 Identities=11% Similarity=0.056 Sum_probs=264.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) |++++.... .++ .++.+|....+..+.|...|.++.+|....-++++...... ++ .. T Consensus 1 m~~~~~~~~--------------~~~-------~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~-~~-~~ 57 (535) T protein:vir:33 1 MADSKRTGL--------------GED-------GAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNEST-DY-TT 57 (535) T ss_pred CChhhhhcc--------------Chh-------HHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccc-cc-cc Confidence 776653221 111 23456666666677889999999999643211111000000 00 11 Q ss_pred eEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHH---HHHHHHHHHHhhc Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAE---VFTGLIKNIEYNC 153 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae---~l~~~~~~~~~~~ 153 (711) +....-...++.+.+..-. +++=.++.+.+.. ..........-.+..+ .++..+......| T Consensus 58 ~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~------------~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~s 125 (535) T protein:vir:33 58 PWQAVGARGLNNLASKLMLALFPMQSWMKLTISEYE------------AKQLVGDPDGLAKVDEGLSMVERIIMNYIESN 125 (535) T ss_pred cccccHHHHHHHHHHHHHHhhcCCCcccccccChHH------------HhccccCcchHHHHHHHHHHHHHHHHHHHHhc Confidence 1122222233333222111 1111122211100 0000000011112222 3344455556789 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhc Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALY 233 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~ 233 (711) +|......++.+.+..|+|++.+-.+ .++-++++.+ +..++++..++.- ...-++++..+|..++.+.| T Consensus 126 nf~~~~~~~~~~L~~~G~a~l~~~~~------~~~~~~f~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~~ 194 (535) T protein:vir:33 126 SYRVTLFECLKQLIVAGNALLYLPEP------EGSYNPMKLY-RLSSYVVQRDAYG----NVLQIVTRDQIAFGALPEDV 194 (535) T ss_pred CcHHHHHHHHHHHHhhCceeEEeecC------CCCceeeEEE-EcCeeEEeeCCCC----CeeEEEeeEeecHHHHHHHh Confidence 99999999999999999998754322 1234566666 5778888766532 23448899999999999999 Q ss_pred CCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEE Q lcl|Aclame:pro 234 PDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTY 313 (711) Q Consensus 234 p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~ 313 (711) +......... ....+.+.++.+.+++.. ++ .+.++ T Consensus 195 ~~~~~~~~~~--------k~~~~~~~v~~~v~~~~~--------~~-----------------------------~~~~~ 229 (535) T protein:vir:33 195 RSAVEKSGGE--------KKMDEMVDVYTHVYLDEE--------SG-----------------------------DYLKY 229 (535) T ss_pred hhhhcccccc--------cccccCCeEEEEEEeeCC--------CC-----------------------------cEEEE Confidence 8653221111 011223334443332211 01 11122 Q ss_pred EEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC Q lcl|Aclame:pro 314 WRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE 393 (711) Q Consensus 314 ~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~ 393 (711) ....|..+....+.|+++.+||+|+. +..+++..||.|.+....+-.+.+|.+....+.....+.+++++++++.+. T Consensus 230 -~~~~~~~~~~~~~~~~~~~~P~i~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~ 306 (535) T protein:vir:33 230 -EEVEDVEIDGSDATYPTDAMPYIPVR--MVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGIT 306 (535) T ss_pred -EEEeCccccccccccccccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc Confidence 12234443333456788889999764 445688999999999999999999999999999999999999999988887 Q ss_pred ChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHH Q lcl|Aclame:pro 394 GREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQR 473 (711) Q Consensus 394 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~ 473 (711) +..+... ..+|.++.-++ .+..+-......-.......++...+.|.... ..+.+...++.+.|+.-|..+.+ T Consensus 307 ~~~~~~~---~~~g~~v~g~~---~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r~TAtEV~~r~~ 379 (535) T protein:vir:33 307 QPRRLTK---AQTGDFVPGRR---EDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGERVTAEEIRYVAS 379 (535) T ss_pred chhhccc---CCceeeecCCc---ccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCccccHHHHHHHHH Confidence 7654321 23344433222 22222223333445567777788888887654 33333335666789999999999 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEe Q lcl|Aclame:pro 474 QGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVV 552 (711) Q Consensus 474 ~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v 552 (711) .....+..++.+|.. +...+.+..+.++.+..--+ .+ | ...+.+.+ T Consensus 380 E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP---------------~~------p------------~~~v~~~y 426 (535) T protein:vir:33 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIP---------------EL------P------------KEAVEPTI 426 (535) T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC---------------CC------C------------ccceeEEE Confidence 999999998888864 66667777666665421000 00 0 00123333 Q ss_pred ecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcchh--hcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 553 TTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPPNV--LSKDEREAIEEDMPEQTE 629 (711) Q Consensus 553 ~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~q~ 629 (711) ..+ -...+|.+..+.|.++++.+.++.+ +.++ .-+.+++.+.+....+-.. ....+. +.++..+++++ T Consensus 427 is~-La~aqr~~~~~~l~~~~~~la~~~P-------~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~e-e~~~~~~q~~~ 497 (535) T protein:vir:33 427 STG-LEAIGRGQDLDKLERCISAWAALAP-------MQGDPDINLAVIKLRIANAIGIDTSGILLTDE-QKQALMMQDAA 497 (535) T ss_pred ecH-HHHHHHHHHHHHHHHHHHHHHhhCh-------hhhhccCCHHHHHHHHHHHcCCCHhHhcCCHH-HHHHHHHHHHH Confidence 322 2344556666666666554322222 1222 2366777777766655432 111111 11111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 630 PTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITAS 704 (711) Q Consensus 630 ~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~ 704 (711) ++++++++.++- ...+...+...+..+..+.+.-+.. + T Consensus 498 -~~~~~~~~~~~g-----------~~~~~~~~~~~~~~~~~~~~~g~~~-------------------------~ 535 (535) T protein:vir:33 498 -QTGVENAAAAGG-----------AGVGALATSSPEAMQGAAAKAGLNA-------------------------T 535 (535) T ss_pred -HHHHHHHHHhhh-----------hhhcchhhcCChhHHHHHHhccCCC-------------------------C Confidence 000000000000 0000000000000000000000000 0 No 28 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.90 E-value=1.8e-21 Score=134.34 Aligned_cols=535 Identities=9% Similarity=0.003 Sum_probs=270.8 Q ss_pred HHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC---CCCCCHHHHHHHHH-hCCCceEehhhHHHHHHHhhhhhh-----c Q lcl|Aclame:pro 30 ALLATARERARDGATYWKDNWEAAEDDLKFLG---GEQWPSQVRTEREL-EQRPCLVNNVLPTFVDQVLGDQRQ-----N 100 (711) Q Consensus 30 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~---G~Qw~~~~~~~~~~-~g~p~~~~N~i~~~v~~i~g~~~~-----~ 100 (711) ....++.++|+........|...|.++.+|.. +.-+.+........ .....+.-+.-...++.+.+..-. + T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 34667888888888889999999999999983 22222211000000 001112223333334433322211 2 Q ss_pred ccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEee Q lcl|Aclame:pro 101 RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDY 180 (711) Q Consensus 101 r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~ 180 (711) ++=.++.+.+.. ....++. ..--+..+..+....+.++|..+...++.+.+..|+|++.+..+. T Consensus 81 ~~WF~l~~~d~~---------------~~~~~~v-~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~ 144 (547) T protein:vir:10 81 TKWFELAFRDKE---------------LNSDDEC-RKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDE 144 (547) T ss_pred CcccccccCCcc---------------ccchHHH-HHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCC Confidence 222333332110 0000111 112223455555666789999999999999999999987665432 Q ss_pred ccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEE Q lcl|Aclame:pro 181 LADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRV 260 (711) Q Consensus 181 ~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 260 (711) ...+.++++.+ +..++++..++.- + ..=++++..||..++.++||.+...+-...... .+.......+.+ T Consensus 145 ----~~~~~~r~~~~-pl~~~~v~~d~~G-~---v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~-~~~~~~~~~~~v 214 (547) T protein:vir:10 145 ----DEEGSVVFQSS-PIQDSYFEEDSRG-Q---VVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAK-EASNQAALKQEV 214 (547) T ss_pred ----CCCCceeEEEe-ecceEEEeeCCCc-C---eeeeeeeeeccHHHHHHhcCcccCCHHHHHHHh-cCCCcccceEEE Confidence 22345677777 6789999876642 2 233678899999999999998764322211111 111111235666 Q ss_pred EEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEec-CceeccCccCCCCccceEEE Q lcl|Aclame:pro 261 SEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITG-ANVLEGPVEIPSTTIPVIPV 339 (711) Q Consensus 261 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g-~~~le~~~p~~~~~~P~vp~ 339 (711) +.+.+.+...... ..+++.+. ... +.+..+|.-..| .+++. ++.| .++||+++ T Consensus 215 ~~~v~~~~~~~~~--~~~~~~~~--------------------~~~-~p~~s~~~e~~~~~~~l~-esg~--~e~P~~~~ 268 (547) T protein:vir:10 215 VMCVFTRYDKKQN--RNAGTVLA--------------------PTE-RPFGKKWILKEGAVQLGE-EGGY--YEMPAYAI 268 (547) T ss_pred EEEEeeccCCCCC--ccccceee--------------------ccc-cceeEEEEEecCceeeee-cCCc--ccCCeeee Confidence 6665544321110 00000000 001 112222222333 44553 3334 56899976 Q ss_pred EeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCc Q lcl|Aclame:pro 340 WGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD 419 (711) Q Consensus 340 ~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~ 419 (711) - +..+++..||.|.+....+-.+.+|.+...++..+.++.+++++++.+.+.+. .+..||+++.+.+. T Consensus 269 R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~------~~~~pgg~~~~~~~---- 336 (547) T protein:vir:10 269 R--WRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISD------IDLGASGLTVVRDM---- 336 (547) T ss_pred e--eeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc------ceecCCeeeecCCc---- Confidence 4 44568899999999999999999999999999999999999999987766543 23568888876432 Q ss_pred CCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|Aclame:pro 420 PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT-KSIRRVGKILV 498 (711) Q Consensus 420 ~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~l 498 (711) ..+++++...-.......++...+.|....= .+.++=.++...|+.-|..+.+.....+......|. ++...+....+ T Consensus 337 ~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~-~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 415 (547) T protein:vir:10 337 ESMKPFESRARFDVSSIQLTDLRSAVRRIYY-VDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTF 415 (547) T ss_pred ccceeeecccchHHHHHHHHHHHHHHHHHhh-hhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 2344444444444455666766666666532 222211344568999999999998889888888885 46666666666 Q ss_pred HHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcch Q lcl|Aclame:pro 499 EMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPS 578 (711) Q Consensus 499 ~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~ 578 (711) .++.+.---|. +-.....+ ....++|..-. +-...++......+.++++...+ T Consensus 416 ~il~r~g~lP~---------------~p~~l~~~-----------~~~~~~v~~is-~Laraq~~~~~~~i~~~~~~v~~ 468 (547) T protein:vir:10 416 NIRFRAGKLGE---------------LPSKLLES-----------GKAAMDIVYTG-PLSRAQKIDQAASIERWAGSTAQ 468 (547) T ss_pred HHHHhcCCCCC---------------Cchhhhcc-----------CcceEEEEecc-HHHHHHHHHHHHHHHHHHHHHHH Confidence 66554310000 00000000 01112222211 11222333334444444443322 Q ss_pred hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 579 AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP-TPEQQVEMAKSQADMAQAEADTAQAQA 657 (711) Q Consensus 579 ~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~q~~~~~~q~~~~k~qae~~~aqa 657 (711) +.++ .+ +.++.-+.+++.+.+....+-....--..++.++..++.+++ +.++|+.+.++. .... + T Consensus 469 laq~-~P---~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~-------g~~m---~ 534 (547) T protein:vir:10 469 LAEI-NP---EVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAE-------GNAM---E 534 (547) T ss_pred hhcc-Ch---hhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHH---H Confidence 2221 11 223445667777776666544321111111111111111110 000010000000 0000 0 Q ss_pred HHHHHHHHHHHHHHH Q lcl|Aclame:pro 658 DMLKAQLETEEAQKQ 672 (711) Q Consensus 658 e~~~~q~~~~~~q~q 672 (711) .+....+.+.+ .+ T Consensus 535 ~~~~~~a~~~~--~~ 547 (547) T protein:vir:10 535 AQGKGQAALKE--NQ 547 (547) T ss_pred hhcCcccchhc--cC Confidence 00000000000 00 No 29 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.89 E-value=2.5e-21 Score=133.54 Aligned_cols=517 Identities=11% Similarity=0.038 Sum_probs=266.7 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCC----CCCCHHHHHHHHHh Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGG----EQWPSQVRTERELE 76 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G----~Qw~~~~~~~~~~~ 76 (711) ||++++. . +.+ +.++.+|....+....|...|.++.+|... +.+...- .+ T Consensus 1 m~~~~~~----~----------~~~-------~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~-----~~ 54 (535) T protein:vir:15 1 MADSKRT----G----------LGE-------DGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNES-----TD 54 (535) T ss_pred CCccchh----c----------cch-------HHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc-----cc Confidence 7766521 1 111 123456666666677888999999999643 2221100 00 Q ss_pred CCCceEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHH---HHHHHHHHH Q lcl|Aclame:pro 77 QRPCLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAE---VFTGLIKNI 149 (711) Q Consensus 77 g~p~~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae---~l~~~~~~~ 149 (711) + ..+....-...++.+.+..-. +++=.++.+.+.. ..-.......-.+..+ .++..+... T Consensus 55 ~-~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~------------~~~~~~~~~~~~~v~~~L~~ve~~~~~~ 121 (535) T protein:vir:15 55 Y-TTPWQAVGARGLNNLASKLMLALFPMQSWMKLTISEYE------------AKQLVGDPDGLAKVDEGLSMVERIIMNY 121 (535) T ss_pred c-cccccccHHHHHHHHHHHHHHhhcCCCcccccccChHH------------HhccCCCcchHHHHHHHHHHHHHHHHHH Confidence 0 111222222233333222111 1111122221100 0000000011112222 334445555 Q ss_pred HhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHH Q lcl|Aclame:pro 150 EYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) Q Consensus 150 ~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~ 229 (711) ...|+|..+...++.+.+..|+|++.+..+ .++-++++.+ +..++++..++.- ...-++++..||..++ T Consensus 122 l~~snf~~~~~~~~~~L~~~G~a~l~~~~~------~~~~~~f~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~~l 190 (535) T protein:vir:15 122 IESNSYRVTLFECLKQLIVAGNALLYLPEP------EGSYNPMKLY-RLSSYVVQRDAYG----NVLQIVTRDQIAFGAL 190 (535) T ss_pred HHhcCcHHHHHHHHHHHHhhCceeEEeecC------CCCceeeEEE-EcCeeEEeeCCCC----CeeEEEEeEeecHHHH Confidence 678999999999999999999998654321 1234566666 5778888766542 3445889999999999 Q ss_pred HHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccce Q lcl|Aclame:pro 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) Q Consensus 230 ~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) .+.|+....... ......+.|.++++.+.+... + . T Consensus 191 ~~~~~~~~~~~~--------~~~~~~~~v~v~~~v~~~~~~--------~-----------------------------~ 225 (535) T protein:vir:15 191 PEDVRSAVEKAG--------GEKKMDEMVDVYTHVYLDEES--------G-----------------------------D 225 (535) T ss_pred HHHHhHhhhccc--------cccCCCCceeEEEEEEEecCC--------C-----------------------------c Confidence 888875432111 011123456666665543211 1 1 Q ss_pred EEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecc Q lcl|Aclame:pro 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSE 389 (711) Q Consensus 310 ~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~ 389 (711) +.+|..+ .|..+....+.|++..+||+++. +..+++..||.|.+....+-.+.+|.+....+.....+.++++++++ T Consensus 226 ~~~~~e~-~g~~~~~~~~~~~~~~~P~i~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~ 302 (535) T protein:vir:15 226 YLKYEEV-EDVEIDGSDATYPTDAMPYIPVR--MVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNP 302 (535) T ss_pred EEEEEEe-eCccccccccccccccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc Confidence 1122112 23333222355788899999764 44568899999999999999999999999999999999999999988 Q ss_pred cccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHH Q lcl|Aclame:pro 390 GNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAII 469 (711) Q Consensus 390 ~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~ 469 (711) +.+.+..+.. ...+|.++.-++ .+..+-......-.......++...+.|.... ..+.+...++.+.|+.-|. T Consensus 303 ~g~~~~~~l~---~~~~g~~v~g~~---~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r~TAtEV~ 375 (535) T protein:vir:15 303 AGITQPRRLT---KAQTGDFVPGRR---EDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGERVTAEEIR 375 (535) T ss_pred cccccchhcc---cCCceeeecCCc---ccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCccccHHHHH Confidence 8887765432 123344443322 22222223333345557777787888887754 3333333566678999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKY 548 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~ 548 (711) .+.+.....+..++.+|.. +...+.+..+.++.+..--+ .+ | ...+ T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP---------------~~------p------------~~~v 422 (535) T protein:vir:15 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIP---------------EL------P------------KEAV 422 (535) T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC---------------CC------C------------ccce Confidence 9999999999998888864 66667777666665421000 00 0 0012 Q ss_pred eEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcchh--hcchhhhhhhhhHH Q lcl|Aclame:pro 549 DVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPPNV--LSKDEREAIEEDMP 625 (711) Q Consensus 549 dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~~~--~~~~~~~~~~~~~~ 625 (711) .+.+..+ -...+|.+..+.|.++++.+.++.+ +.++ .-+.+++.+.+....+-.. ....+. +.++.++ T Consensus 423 ~~~yis~-La~aqr~~~~~~l~~~~~~la~~~P-------~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~e-ev~~~~~ 493 (535) T protein:vir:15 423 EPTISTG-LEAIGRGQDLDKLERCISAWAALAP-------MQGDPDINLAVIKLRIANAIGIDTSGILLTDE-QKQALMM 493 (535) T ss_pred eEEEecH-HHHHHHHHHHHHHHHHHHHHHhcCh-------hhhhccCCHHHHHHHHHHHcCCChhhhcCCHH-HHHHHHH Confidence 3333322 2344556666666666554322222 1222 2366777777776655432 111111 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 626 EQTEPTPEQQVEMAKSQADMA---QAEADTAQAQADMLKAQLE 665 (711) Q Consensus 626 ~~q~~~~~~q~~~~~~q~~~~---k~qae~~~aqae~~~~q~~ 665 (711) ++++++. ++++..++.+... +..-+...+.+.....++. T Consensus 494 q~~~~~~-~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 494 QDAAQTG-IENAAATGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred HHHHHHH-HHHHHHHHHhhccchhccChHHHHHHHhccCCCCC Confidence 1111110 0000000000000 0000000000000000000 No 30 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=99.89 E-value=3e-21 Score=133.06 Aligned_cols=539 Identities=9% Similarity=-0.003 Sum_probs=272.4 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh---CCCCCCHHHHHHHHHhCCCceEehhh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL---GGEQWPSQVRTERELEQRPCLVNNVL 86 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y---~G~Qw~~~~~~~~~~~g~p~~~~N~i 86 (711) |.+ ......+..+|+........|...|.++.+|. .|.=|.+ +...-..+ .+.+....- T Consensus 1 M~~----------------~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~-~~~~~dst~ 62 (555) T protein:vir:10 1 MAE----------------QTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQ-DRNRGEKR-HNNILDNTG 62 (555) T ss_pred CCC----------------cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCC-CCCcchhc-ccccccccH Confidence 111 11255678888888888899999999999998 3332322 11111111 122233333 Q ss_pred HHHHHHHhhhhhh-----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHH---HHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 87 PTFVDQVLGDQRQ-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELA---EVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 87 ~~~v~~i~g~~~~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~~~~~~~~~~~~~~ 158 (711) ...++.+.+..-. +++=.++.+.+. ...+..+.. +.++..+......++|..+ T Consensus 63 ~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~-------------------~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~ 123 (555) T protein:vir:10 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP-------------------ELDESAAVKAWLANVTRLMLMIFAKSNTYRA 123 (555) T ss_pred HHHHHHHHHHHHHhhcCCCCcccccccCcc-------------------cccchHHHHHHHHHHHHHHHHHHHhcCcHHH Confidence 3334443332221 222233333210 000111122 2244555566678999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA 238 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~ 238 (711) ...++.+.+..|+|++-+..+ + ++-+++..+ +..++++..++.- ...=++++..||..++.++||.++. T Consensus 124 ~~~~~~~Lv~~G~a~l~~~~d-----~-~~~~rf~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l 192 (555) T protein:vir:10 124 LHSMYEELGAFGTASSIVLPD-----F-DAVVYHHSL-TAGEYAIAADNQG----RVNTLYREFQITVAQMVREFGKDKC 192 (555) T ss_pred HHHHHHHHHhhCceEEEEecC-----C-CceEEEEEe-ecceeEEeeCCCC----CEEEEEEEEeccHHHHHHhcCcccC Confidence 999999999999998644322 1 234677777 6888998765542 3445668889999999999998764 Q ss_pred chhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE- Q lcl|Aclame:pro 239 EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI- 317 (711) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~- 317 (711) .+....... .......+.|+++.|.+..... ++ ...+..+...+||.-- T Consensus 193 ~~~~~~~~~---~~~~~~~v~v~~~V~pr~~~~~-----~~----------------------~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:10 193 STTVQSLFD---RGALEQWVTVIHAIEPRADRDP-----SK----------------------RDDRNMAWKSVYFEPGA 242 (555) T ss_pred CHHHHHHHh---cCCCCceEEEEEEEeeccCcCc-----CC----------------------CCccccceEEEEEEecc Confidence 332222111 1112245777777654321100 00 0011111222333221 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED 397 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~ 397 (711) .|.+++. .+.| ..+||+|+- +...++..||.|.+....+-.+.+|++...++..+..+.++++.++.+..... T Consensus 243 d~~~vl~-esgy--~e~P~i~~R--w~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~-- 315 (555) T protein:vir:10 243 DETRTLR-ESGY--RSFRALCPR--WALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQD-- 315 (555) T ss_pred CCccccc-cCCc--ccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc-- Confidence 2344553 2334 578999764 44568899999999999999999999999999999999999999988765322 Q ss_pred HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH--Hhc-cccchhHHHHHHHHHHH Q lcl|Aclame:pro 398 EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA--SLG-AMGNETSGRAIIARQRQ 474 (711) Q Consensus 398 ~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~--~~G-~~~~~~sg~ai~~~~~~ 474 (711) ....||++..+.++...+......+....-+...+.++...+.|.... ..+. +++ .++...|+..|..+.+- T Consensus 316 ----~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E 390 (555) T protein:vir:10 316 ----ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEE 390 (555) T ss_pred ----ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHH Confidence 246688876666554433222222333344556677777777777655 3442 222 34456899999999888 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEee Q lcl|Aclame:pro 475 GDRGSFAFIDNLT-KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) Q Consensus 475 ~~~~~~~~~dn~~-~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~ 553 (711) ....+..++.+|. ++...+.+..+.++.+.---|. -+.. +....+.|..- T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~----------------~P~~-------------l~~~~i~v~yi 441 (555) T protein:vir:10 391 KLLMLGPVLERMHNEILDPLIELTFQRMVEANILPP----------------PPQE-------------MQGVDLNVEFV 441 (555) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC----------------Cchh-------------hcCceeEEEec Confidence 8889988888875 4666666666665555310000 0000 00011222222 Q ss_pred cccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPE 633 (711) Q Consensus 554 ~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~ 633 (711) . +-...++......+.++++....+.++ .+ +.++.-+.+++.+.+....+-....--..++.++..++.++++++ T Consensus 442 s-~La~aq~~~~~~~i~~~l~~i~~laq~-~P---~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~ 516 (555) T protein:vir:10 442 S-MLAQAQRAIATNSVDRFVGNLGAVAGI-KP---EVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQA 516 (555) T ss_pred c-HHHHHHHHHHHHHHHHHHHHHHHHhcC-Ch---hhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH Confidence 2 222334444444444444443222221 11 223444667777776666544321111111111111111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 QQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGG 683 (711) Q Consensus 634 ~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~ 683 (711) ++++ .++.+.....+.+.. +...-. ...........-=. T Consensus 517 ~~~a--~~~~q~~~~~~~~~~-------~~~~~~--~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 517 AQQA--ALLNQGADTAAKLGS-------VDTSKQ--NALTDVTRAFSGYT 555 (555) T ss_pred HHHH--HHHHHHHHHHHHhcc-------cccCcc--hhHHHHHhhhccCC Confidence 0000 000000000000000 000000 00000000000000 No 31 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=99.89 E-value=3e-21 Score=133.06 Aligned_cols=539 Identities=9% Similarity=-0.003 Sum_probs=272.4 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh---CCCCCCHHHHHHHHHhCCCceEehhh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL---GGEQWPSQVRTERELEQRPCLVNNVL 86 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y---~G~Qw~~~~~~~~~~~g~p~~~~N~i 86 (711) |.+ ......+..+|+........|...|.++.+|. .|.=|.+ +...-..+ .+.+....- T Consensus 1 M~~----------------~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~-~~~~~dst~ 62 (555) T protein:vir:98 1 MAE----------------QTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQ-DRNRGEKR-HNNILDNTG 62 (555) T ss_pred CCC----------------cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCC-CCCcchhc-ccccccccH Confidence 111 11255678888888888899999999999998 3332322 11111111 122233333 Q ss_pred HHHHHHHhhhhhh-----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHH---HHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 87 PTFVDQVLGDQRQ-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELA---EVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 87 ~~~v~~i~g~~~~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~~~~~~~~~~~~~~ 158 (711) ...++.+.+..-. +++=.++.+.+. ...+..+.. +.++..+......++|..+ T Consensus 63 ~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~-------------------~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~ 123 (555) T protein:vir:98 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP-------------------ELDESAAVKAWLANVTRLMLMIFAKSNTYRA 123 (555) T ss_pred HHHHHHHHHHHHHhhcCCCCcccccccCcc-------------------cccchHHHHHHHHHHHHHHHHHHHhcCcHHH Confidence 3334443332221 222233333210 000111122 2244555566678999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA 238 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~ 238 (711) ...++.+.+..|+|++-+..+ + ++-+++..+ +..++++..++.- ...=++++..||..++.++||.++. T Consensus 124 ~~~~~~~Lv~~G~a~l~~~~d-----~-~~~~rf~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l 192 (555) T protein:vir:98 124 LHSMYEELGAFGTASSIVLPD-----F-DAVVYHHSL-TAGEYAIAADNQG----RVNTLYREFQITVAQMVREFGKDKC 192 (555) T ss_pred HHHHHHHHHhhCceEEEEecC-----C-CceEEEEEe-ecceeEEeeCCCC----CEEEEEEEEeccHHHHHHhcCcccC Confidence 999999999999998644322 1 234677777 6888998765542 3445668889999999999998764 Q ss_pred chhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE- Q lcl|Aclame:pro 239 EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI- 317 (711) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~- 317 (711) .+....... .......+.|+++.|.+..... ++ ...+..+...+||.-- T Consensus 193 ~~~~~~~~~---~~~~~~~v~v~~~V~pr~~~~~-----~~----------------------~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:98 193 STTVQSLFD---RGALEQWVTVIHAIEPRADRDP-----SK----------------------RDDRNMAWKSVYFEPGA 242 (555) T ss_pred CHHHHHHHh---cCCCCceEEEEEEEeeccCcCc-----CC----------------------CCccccceEEEEEEecc Confidence 332222111 1112245777777654321100 00 0011111222333221 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED 397 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~ 397 (711) .|.+++. .+.| ..+||+|+- +...++..||.|.+....+-.+.+|++...++..+..+.++++.++.+..... T Consensus 243 d~~~vl~-esgy--~e~P~i~~R--w~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~-- 315 (555) T protein:vir:98 243 DETRTLR-ESGY--RSFRALCPR--WALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQD-- 315 (555) T ss_pred CCccccc-cCCc--ccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc-- Confidence 2344553 2334 578999764 44568899999999999999999999999999999999999999988765322 Q ss_pred HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH--Hhc-cccchhHHHHHHHHHHH Q lcl|Aclame:pro 398 EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA--SLG-AMGNETSGRAIIARQRQ 474 (711) Q Consensus 398 ~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~--~~G-~~~~~~sg~ai~~~~~~ 474 (711) ....||++..+.++...+......+....-+...+.++...+.|.... ..+. +++ .++...|+..|..+.+- T Consensus 316 ----~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E 390 (555) T protein:vir:98 316 ----ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEE 390 (555) T ss_pred ----ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHH Confidence 246688876666554433222222333344556677777777777655 3442 222 34456899999999888 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEee Q lcl|Aclame:pro 475 GDRGSFAFIDNLT-KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) Q Consensus 475 ~~~~~~~~~dn~~-~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~ 553 (711) ....+..++.+|. ++...+.+..+.++.+.---|. -+.. +....+.|..- T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~----------------~P~~-------------l~~~~i~v~yi 441 (555) T protein:vir:98 391 KLLMLGPVLERMHNEILDPLIELTFQRMVEANILPP----------------PPQE-------------MQGVDLNVEFV 441 (555) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC----------------Cchh-------------hcCceeEEEec Confidence 8889988888875 4666666666665555310000 0000 00011222222 Q ss_pred cccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPE 633 (711) Q Consensus 554 ~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~ 633 (711) . +-...++......+.++++....+.++ .+ +.++.-+.+++.+.+....+-....--..++.++..++.++++++ T Consensus 442 s-~La~aq~~~~~~~i~~~l~~i~~laq~-~P---~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~ 516 (555) T protein:vir:98 442 S-MLAQAQRAIATNSVDRFVGNLGAVAGI-KP---EVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQA 516 (555) T ss_pred c-HHHHHHHHHHHHHHHHHHHHHHHHhcC-Ch---hhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH Confidence 2 222334444444444444443222221 11 223444667777776666544321111111111111111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 QQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGG 683 (711) Q Consensus 634 ~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~ 683 (711) ++++ .++.+.....+.+.. +...-. ...........-=. T Consensus 517 ~~~a--~~~~q~~~~~~~~~~-------~~~~~~--~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 517 AQQA--ALLNQGADTAAKLGS-------VDTSKQ--NALTDVTRAFSGYT 555 (555) T ss_pred HHHH--HHHHHHHHHHHHhcc-------cccCcc--hhHHHHHhhhccCC Confidence 0000 000000000000000 000000 00000000000000 No 32 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=99.89 E-value=3e-21 Score=133.06 Aligned_cols=539 Identities=9% Similarity=-0.003 Sum_probs=272.4 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh---CCCCCCHHHHHHHHHhCCCceEehhh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL---GGEQWPSQVRTERELEQRPCLVNNVL 86 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y---~G~Qw~~~~~~~~~~~g~p~~~~N~i 86 (711) |.+ ......+..+|+........|...|.++.+|. .|.=|.+ +...-..+ .+.+....- T Consensus 1 M~~----------------~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~-~~~~~dst~ 62 (555) T protein:vir:10 1 MAE----------------QTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQ-DRNRGEKR-HNNILDNTG 62 (555) T ss_pred CCC----------------cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCC-CCCcchhc-ccccccccH Confidence 111 11255678888888888899999999999998 3332322 11111111 122233333 Q ss_pred HHHHHHHhhhhhh-----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHH---HHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 87 PTFVDQVLGDQRQ-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELA---EVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 87 ~~~v~~i~g~~~~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~~~~~~~~~~~~~~ 158 (711) ...++.+.+..-. +++=.++.+.+. ...+..+.. +.++..+......++|..+ T Consensus 63 ~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~-------------------~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~ 123 (555) T protein:vir:10 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP-------------------ELDESAAVKAWLANVTRLMLMIFAKSNTYRA 123 (555) T ss_pred HHHHHHHHHHHHHhhcCCCCcccccccCcc-------------------cccchHHHHHHHHHHHHHHHHHHHhcCcHHH Confidence 3334443332221 222233333210 000111122 2244555566678999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA 238 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~ 238 (711) ...++.+.+..|+|++-+..+ + ++-+++..+ +..++++..++.- ...=++++..||..++.++||.++. T Consensus 124 ~~~~~~~Lv~~G~a~l~~~~d-----~-~~~~rf~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l 192 (555) T protein:vir:10 124 LHSMYEELGAFGTASSIVLPD-----F-DAVVYHHSL-TAGEYAIAADNQG----RVNTLYREFQITVAQMVREFGKDKC 192 (555) T ss_pred HHHHHHHHHhhCceEEEEecC-----C-CceEEEEEe-ecceeEEeeCCCC----CEEEEEEEEeccHHHHHHhcCcccC Confidence 999999999999998644322 1 234677777 6888998765542 3445668889999999999998764 Q ss_pred chhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE- Q lcl|Aclame:pro 239 EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI- 317 (711) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~- 317 (711) .+....... .......+.|+++.|.+..... ++ ...+..+...+||.-- T Consensus 193 ~~~~~~~~~---~~~~~~~v~v~~~V~pr~~~~~-----~~----------------------~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:10 193 STTVQSLFD---RGALEQWVTVIHAIEPRADRDP-----SK----------------------RDDRNMAWKSVYFEPGA 242 (555) T ss_pred CHHHHHHHh---cCCCCceEEEEEEEeeccCcCc-----CC----------------------CCccccceEEEEEEecc Confidence 332222111 1112245777777654321100 00 0011111222333221 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED 397 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~ 397 (711) .|.+++. .+.| ..+||+|+- +...++..||.|.+....+-.+.+|++...++..+..+.++++.++.+..... T Consensus 243 d~~~vl~-esgy--~e~P~i~~R--w~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~-- 315 (555) T protein:vir:10 243 DETRTLR-ESGY--RSFRALCPR--WALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQD-- 315 (555) T ss_pred CCccccc-cCCc--ccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc-- Confidence 2344553 2334 578999764 44568899999999999999999999999999999999999999988765322 Q ss_pred HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH--Hhc-cccchhHHHHHHHHHHH Q lcl|Aclame:pro 398 EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA--SLG-AMGNETSGRAIIARQRQ 474 (711) Q Consensus 398 ~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~--~~G-~~~~~~sg~ai~~~~~~ 474 (711) ....||++..+.++...+......+....-+...+.++...+.|.... ..+. +++ .++...|+..|..+.+- T Consensus 316 ----~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E 390 (555) T protein:vir:10 316 ----ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEE 390 (555) T ss_pred ----ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHH Confidence 246688876666554433222222333344556677777777777655 3442 222 34456899999999888 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEee Q lcl|Aclame:pro 475 GDRGSFAFIDNLT-KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) Q Consensus 475 ~~~~~~~~~dn~~-~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~ 553 (711) ....+..++.+|. ++...+.+..+.++.+.---|. -+.. +....+.|..- T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~----------------~P~~-------------l~~~~i~v~yi 441 (555) T protein:vir:10 391 KLLMLGPVLERMHNEILDPLIELTFQRMVEANILPP----------------PPQE-------------MQGVDLNVEFV 441 (555) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC----------------Cchh-------------hcCceeEEEec Confidence 8889988888875 4666666666665555310000 0000 00011222222 Q ss_pred cccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPE 633 (711) Q Consensus 554 ~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~ 633 (711) . +-...++......+.++++....+.++ .+ +.++.-+.+++.+.+....+-....--..++.++..++.++++++ T Consensus 442 s-~La~aq~~~~~~~i~~~l~~i~~laq~-~P---~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~ 516 (555) T protein:vir:10 442 S-MLAQAQRAIATNSVDRFVGNLGAVAGI-KP---EVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQA 516 (555) T ss_pred c-HHHHHHHHHHHHHHHHHHHHHHHHhcC-Ch---hhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH Confidence 2 222334444444444444443222221 11 223444667777776666544321111111111111111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 QQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGG 683 (711) Q Consensus 634 ~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~ 683 (711) ++++ .++.+.....+.+.. +...-. ...........-=. T Consensus 517 ~~~a--~~~~q~~~~~~~~~~-------~~~~~~--~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 517 AQQA--ALLNQGADTAAKLGS-------VDTSKQ--NALTDVTRAFSGYT 555 (555) T ss_pred HHHH--HHHHHHHHHHHHhcc-------cccCcc--hhHHHHHhhhccCC Confidence 0000 000000000000000 000000 00000000000000 No 33 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.88 E-value=1.1e-20 Score=130.11 Aligned_cols=520 Identities=12% Similarity=0.051 Sum_probs=267.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC--CHHHHHHHHHhCC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW--PSQVRTERELEQR 78 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw--~~~~~~~~~~~g~ 78 (711) ||+++ ....-+.+..+|+...+....|...|.++.+|..-.=. +..... +.. T Consensus 1 m~~~~----------------------~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~----~~~ 54 (536) T protein:vir:10 1 MAEKR----------------------TGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS----TDY 54 (536) T ss_pred Ccchh----------------------hchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc----ccc Confidence 44321 11123456666777666778899999999999743211 111100 111 Q ss_pred CceEehhhHHHHHHHhhhhhhc----ccceeEecchhhhhhhhhcccccccccccCCCchhHHH------HHHHHHHHHH Q lcl|Aclame:pro 79 PCLVNNVLPTFVDQVLGDQRQN----RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYEL------AEVFTGLIKN 148 (711) Q Consensus 79 p~~~~N~i~~~v~~i~g~~~~~----r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~------Ae~l~~~~~~ 148 (711) ..+.-+.-...++.+.+..-.. ++=.++.+.+.. .. .....+... -+.++..+.. T Consensus 55 ~~~~dst~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~------------~~---~~~~~~~~~~~v~~~L~~ve~~~~~ 119 (536) T protein:vir:10 55 QTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYE------------AK---QLLSDPDGLAKVDEGLSMVERIIMN 119 (536) T ss_pred cccccccHHHHHHHHHHHHHhhhcCCCcccccccChhh------------hh---ccccchhhHHHHHHHHHHHHHHHHH Confidence 1222333333344333222111 110111111000 00 000001111 2234555666 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHH Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEK 228 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e 228 (711) ....|+|......++.+.+..|+|+.-+ + +++-.+-..++.+ +..++++..++.- ...-++++..||... T Consensus 120 ~l~~snf~~~~~~~~~~L~~~G~a~ly~--~---e~~~~~~~~~~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~~ 189 (536) T protein:vir:10 120 YIESNSYRVTLFEALKQLVVAGNVLLYL--P---EPEGSNYNPMKLY-RLSSYVVQRDAFG----NVLQMVTRDQIAFGA 189 (536) T ss_pred HHHhcCcHHHHHHHHHHHHhHCcEeEEE--e---eCCCCceeeEEEE-EcCeEEEeeCCCC----CeeEEeeeeeccHHH Confidence 6678999999999999999999998533 2 2222222335555 5678888765431 344578999999999 Q ss_pred HHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccc Q lcl|Aclame:pro 229 FKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVK 308 (711) Q Consensus 229 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 308 (711) +.+.||......... . ...+.|.|+++-+.+.. ++. T Consensus 190 l~~~fg~~~~~~~~~---~-----~~~~~v~v~~~V~~~~~--------~~~---------------------------- 225 (536) T protein:vir:10 190 LPEDIRKAVEGQGGE---K-----KADETIDVYTHIYLDEA--------SGE---------------------------- 225 (536) T ss_pred HHHhhhhhhcccccc---c-----CcccceEEEEEEEEecC--------CCc---------------------------- Confidence 999998754222111 1 11245566655443321 111 Q ss_pred eEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEec Q lcl|Aclame:pro 309 TFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGS 388 (711) Q Consensus 309 ~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~ 388 (711) + .++.-..|..++...+.|+...+||+++. +..+++..||.|.+....+-.+.+|.+...++.....+.++.++++ T Consensus 226 -~-~~~~e~~g~~v~~~~g~~~f~~~P~i~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~ 301 (536) T protein:vir:10 226 -Y-LRYEEVEGMEVQGSDGTYPKEACPYIPIR--MVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN 301 (536) T ss_pred -E-EEEEeecCccccccccccccccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC Confidence 1 12223345555555566778899999764 4456889999999999999999999999999999999999999999 Q ss_pred ccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHH Q lcl|Aclame:pro 389 EGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAI 468 (711) Q Consensus 389 ~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai 468 (711) ++.+.+..+.. ...+|.++..++ ++..+.........+.....++...+.|.+..=+ +.+.-.++.+.|+.-| T Consensus 302 p~g~~~~~~~~---~~~~g~~v~g~~---~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~l~~~~~~r~TAtEV 374 (536) T protein:vir:10 302 PAGITQPRRLT---KAQTGDFVTGRP---EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEI 374 (536) T ss_pred cccccchhhhc---cCCCcceecCCc---ccceeeeccccccchHHHHHHHHHHHHHHHHHhh-hhcccCCCCCccHHHH Confidence 88887766432 244566654333 2222333444455566677888888888776632 2222255667899999 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 469 IARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 469 ~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) ..+.+.....+...+.+|.. +...+.+..+.++...- . +-.+-. ++ T Consensus 375 ~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g-------------~--lP~~p~--------------~~---- 421 (536) T protein:vir:10 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ-------------Q--IPELPK--------------EA---- 421 (536) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCC-------------C--CCCCCh--------------hh---- Confidence 99999988888888888764 55556666665553321 0 000000 00 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcc--hhhcchhhhhhhhhH Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPP--NVLSKDEREAIEEDM 624 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~ 624 (711) +.+.+..+. ....|.+..+.+.++++.+.++.+. .++ .-+.+++.+.+....+- ......+.+..+..+ T Consensus 422 v~~~~vs~l-~~l~r~~~~~~l~~~~~~la~~~P~-------~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~ 493 (536) T protein:vir:10 422 VEPTISTGL-EAIGRGQDLDKLERCVTAWAALAPM-------RDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMA 493 (536) T ss_pred ccceEEecH-HHHHHHHHHHHHHHHHHHHHhhchh-------hhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHH Confidence 122222222 2345555556666655543322221 222 23677777777666543 222222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 625 PEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELV 694 (711) Q Consensus 625 ~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~ 694 (711) +++++++.+++ +.+..+. +.+++. .+ .+...+....+ -++..+ T Consensus 494 q~~~~~~~~~~-a~~~~~~---------~~~~~~--~~-~~~~~~~~~~~--------------g~~~~~ 536 (536) T protein:vir:10 494 QQSMQMGMDNG-AAALAQG---------MAAQAT--AS-PEAMAAAADSV--------------GLQPGI 536 (536) T ss_pred HHHHHHHHHHH-HHHHHHH---------HHHHHh--cC-chhHHhhhhcc--------------ccCCCC Confidence 11110000000 0000000 000000 00 00000000000 000000 No 34 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.88 E-value=1.4e-22 Score=140.34 Aligned_cols=479 Identities=11% Similarity=0.023 Sum_probs=244.3 Q ss_pred CCcCCCCCCCCcccC-----CCcccCCcC-------cchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYA-----KKAKVYAKN-------NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQ 68 (711) Q Consensus 1 ~~~~~~~~~~~~~~~-----~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~ 68 (711) |-++-+.-..-.... .+.....+. ..+..+.+.++...++ ..-.....+..+||.|+++.-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~------~~~~~r~~~~~~yY~g~~~~i~ 74 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHK------LRQAPRIQELLDYARGENHDVL 74 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCCccc Confidence 555432211111111 011111111 1122223333333222 1223355677899999987543 Q ss_pred HHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHH Q lcl|Aclame:pro 69 VRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLI 146 (711) Q Consensus 69 ~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~ 146 (711) ........++| .+++|..+.+|+..+|+.-.+.+.+.+. +....+.+...+ T Consensus 75 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~---------------------------~~~~~~~~~~~l 127 (501) T protein:vir:96 75 KSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYD---------------------------DNDDNSQNDDAI 127 (501) T ss_pred CccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeC---------------------------CccchhHHHHHH Confidence 22223333444 4789999999999999988776666442 112234456667 Q ss_pred HHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecC Q lcl|Aclame:pro 147 KNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTM 224 (711) Q Consensus 147 ~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~ 224 (711) ..+++.|+++.....+..+++++|.||.-++.+. ++++++..+ +|.+++ ||+.... ...+ +++.|. T Consensus 128 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~de------dg~~~i~~~-~p~~~~~v~d~~~~~----~~~~-~v~~~~ 195 (501) T protein:vir:96 128 KRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSE------YDETRIKRL-SPLETFVIYDNSLED----NSIA-AVRYYN 195 (501) T ss_pred HHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcC------CCceEEEEE-ccceeEEEEcCCCCC----ceEE-EEEEEE Confidence 7788889999999999999999999998877642 256788776 787765 5543210 1111 111110 Q ss_pred CHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhh Q lcl|Aclame:pro 225 SKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRT 304 (711) Q Consensus 225 ~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 304 (711) .. ...+.+..+++|.... + T Consensus 196 ----------~~----------------~~~~~~~~~~vyt~~~------------i----------------------- 214 (501) T protein:vir:96 196 ----------RG----------------TLQSAKDVVEIYTDEH------------I----------------------- 214 (501) T ss_pred ----------ee----------------cCCCcEEEEEEEcCCc------------E----------------------- Confidence 00 0011233344443321 1 Q ss_pred cccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|Aclame:pro 305 RKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAP 384 (711) Q Consensus 305 ~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~ 384 (711) +.+...+...+.+..|.+.|.+|+|+|. +...|.|.+..++++++.+|..+|.+...+...+.+. T Consensus 215 --------~~~~~~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~ 279 (501) T protein:vir:96 215 --------YTLDASDDFNEISVTTHAFGTVPITEYL-------NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAI 279 (501) T ss_pred --------EEEeeCCCceeccccccCCCccceEEec-------CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 1111112222223345566778877652 2345789999999999999999999999999888887 Q ss_pred eEecccccCChHHHHhhcccCCCceEEeccc-----ccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccc Q lcl|Aclame:pro 385 FIGSEGNVEGREDEWEQANTKNFSLLTYIPQ-----YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAM 459 (711) Q Consensus 385 ~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~ 459 (711) +++.-....+..+.... .+...++.+... ...+..++++..+.-...+...++.....|-.+|++++.+.|.. T Consensus 280 l~i~G~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~ 357 (501) T protein:vir:96 280 LAIYGDLALPKGMQASD--MKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNF 357 (501) T ss_pred eeeecccccCcccchhh--hhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccc Confidence 77643323222222221 222333443321 11223456666665667788888999999999999999888876 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceee Q lcl|Aclame:pro 460 GNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVT 539 (711) Q Consensus 460 ~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~ 539 (711) +++.||.|+..+............+.|..+++++.++++.++.... +....++ T Consensus 358 ~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~----------~~~~~d~----------------- 410 (501) T protein:vir:96 358 SGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVN----------EFKDFDE----------------- 410 (501) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------ccccccc----------------- Confidence 6778999998876666666666666777777777776666543321 1001010 Q ss_pred eehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhh Q lcl|Aclame:pro 540 IHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDERE 618 (711) Q Consensus 540 ~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~ 618 (711) .+|.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.+.-.+++.+............. T Consensus 411 --------~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~ 476 (501) T protein:vir:96 411 --------SLLKITFTPNLPKSLNEQVSILTGLGGQVSQ------ETALSLSGLVESPNEELDKINKEMSEIDFKGYSND 476 (501) T ss_pred --------ccceEEeCCCCCcCHHHHHHHHHHHhccCch------HHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccc Confidence 0122222333333334444455555433322 233444433 2222223333222111000000000 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 619 AIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLA 674 (711) Q Consensus 619 ~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~ 674 (711) .. .. ......++...+++ .-+.+- + T Consensus 477 ~~----~~-----------~~~~~~~~~e~~~d--~~e~~~--------------~ 501 (501) T protein:vir:96 477 FN----EH-----------VGKYTDEVKETHTD--DFEREY--------------E 501 (501) T ss_pred hh----hc-----------ccccCCcCCCCCCC--cccccc--------------C Confidence 00 00 00000000000000 000000 0 No 35 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.88 E-value=1.9e-20 Score=128.74 Aligned_cols=520 Identities=12% Similarity=0.040 Sum_probs=265.7 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC--CCCHHHHHHHHHhCC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGE--QWPSQVRTERELEQR 78 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~--Qw~~~~~~~~~~~g~ 78 (711) ||+++ ....-+.+..+|+...+....|...|.++.+|..-. ..+..... +.. T Consensus 1 m~~~~----------------------~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~----~~~ 54 (536) T protein:vir:21 1 MAEKR----------------------TGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS----TDY 54 (536) T ss_pred Ccchh----------------------hchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc----ccc Confidence 44321 111234566677777777788999999999997432 11111111 111 Q ss_pred CceEehhhHHHHHHHhhhhhhc----ccceeEecchhhhhhhhhcccccccccccCCCchhHHH------HHHHHHHHHH Q lcl|Aclame:pro 79 PCLVNNVLPTFVDQVLGDQRQN----RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYEL------AEVFTGLIKN 148 (711) Q Consensus 79 p~~~~N~i~~~v~~i~g~~~~~----r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~------Ae~l~~~~~~ 148 (711) ..+.-+.-...++.+.+..-.. ++=.++.+.+.. .. .....+... -+.++..+.. T Consensus 55 ~~~~dst~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~------------~~---~~~~~~~~~~~v~~~L~~ve~~~~~ 119 (536) T protein:vir:21 55 QTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYE------------AK---QLLSDPDGLAKVDEGLSMVERIIMN 119 (536) T ss_pred cccccccHHHHHHHHHHHHHHhhcCCCcccccccChhh------------hh---ccccchhhHHHHHHHHHHHHHHHHH Confidence 1223333333344333222111 110111111000 00 000011111 2234556666 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHH Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEK 228 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e 228 (711) ....|+|......++.+.+..|+|+.-+ + +++-.+-..++.+ +..++++..++.- ...-++++..||... T Consensus 120 ~l~~snf~~~~~~~~~~L~~~G~a~ly~--~---e~~~~~~~~f~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~~ 189 (536) T protein:vir:21 120 YIESNSYRVTLFEALKQLVVAGNVLLYL--P---EPEGSNYNPMKLY-RLSSYVVQRDAFG----NVLQMVTRDQIAFGA 189 (536) T ss_pred HHHhcCcHHHHHHHHHHHHhHCcEeEEE--e---eCCCCceeeEEEE-EcCeEEEeeCCCC----CeeEEeeeeeccHHH Confidence 6678999999999999999999998533 2 2222222335555 5678888765431 344588999999999 Q ss_pred HHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccc Q lcl|Aclame:pro 229 FKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVK 308 (711) Q Consensus 229 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 308 (711) +.+.||+........ . ...+.|.++++-+.+.. ++. T Consensus 190 l~~~fg~~~~~~~~~---~-----~~~~~v~v~~~v~~~~~--------~~~---------------------------- 225 (536) T protein:vir:21 190 LPEDIRKAVEGQGGE---K-----KADETIDVYTHIYLDED--------SGE---------------------------- 225 (536) T ss_pred HHHhhhhhhcccccc---c-----ccccceeEEEEEEEecC--------CCc---------------------------- Confidence 999999754322111 0 11245555554433221 011 Q ss_pred eEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEec Q lcl|Aclame:pro 309 TFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGS 388 (711) Q Consensus 309 ~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~ 388 (711) +.+ +.-..|..++.....|+...+||+++. +..+++..||.|.+....+-.+.+|.+...++.....+.++.++++ T Consensus 226 -~~~-~~e~~g~~v~~~~g~~~f~~~P~i~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~ 301 (536) T protein:vir:21 226 -YLR-YEEVEGMEVQGSDGTYPKEACPYIPIR--MVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN 301 (536) T ss_pred -EEE-EeccCCeeeccccCccccccCCeeeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC Confidence 111 122334445444556778899999764 4456889999999999999999999999999999999999999999 Q ss_pred ccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHH Q lcl|Aclame:pro 389 EGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAI 468 (711) Q Consensus 389 ~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai 468 (711) ++.+.+..+.. ...+|.++.-++ ++..+.........+.....++...+.|.+..=+ +...-.++.+.|+.-| T Consensus 302 p~g~~~~~~~~---~~~~g~~v~g~~---~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~l~~~~~~r~TAtEV 374 (536) T protein:vir:21 302 PAGITQPRRLT---KAQTGDFVTGRP---EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEI 374 (536) T ss_pred cccccchhhhc---cCCCcceecCCc---ccceeeeccccccchHHHHHHHHHHHHHHHHHhh-hhcccCCCCCccHHHH Confidence 88887766432 244566654333 2222333444455566677888888888776632 2222255667899999 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 469 IARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 469 ~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) ..+.+.....+...+.+|.. +...+.+..+.++...- . +-.+-. ++ T Consensus 375 ~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g-------------~--lP~~p~--------------~~---- 421 (536) T protein:vir:21 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ-------------Q--IPELPK--------------EA---- 421 (536) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCC-------------C--CCCCCh--------------hh---- Confidence 99999988888888888764 55556666665553321 0 000000 00 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcc--hhhcchhhhhhhhhH Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPP--NVLSKDEREAIEEDM 624 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~ 624 (711) +.+.+..+. ....|.+..+.+.++++.+.++.+. .++ .-+.+.+.+.+....+- ......+.+..+..+ T Consensus 422 v~~~~vs~l-~~l~r~~~~~~l~~~~~~la~~~Pe-------~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~ 493 (536) T protein:vir:21 422 VEPTISTGL-EAIGRGQDLDKLERCVTAWAALAPM-------RDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMA 493 (536) T ss_pred ccceEEecH-HHHHHHHHHHHHHHHHHHHHhhchh-------hhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHH Confidence 122222222 2345555556666655543222221 122 23667777777665543 222222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 625 PEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELV 694 (711) Q Consensus 625 ~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~ 694 (711) +++++++.++ + +.+..+.+++....+ .+...+....+ -++..+ T Consensus 494 q~~~~~~~~~-------~-----a~~~~~~~~~~~~~~-~~~~~~~~~~~--------------g~~~~~ 536 (536) T protein:vir:21 494 QQSMQMGMDN-------G-----AAALAQGMAAQATAS-PEAMAAAADSV--------------GLQPGI 536 (536) T ss_pred HHHHHHHHHH-------H-----HHHHHHHHHHHHhcC-hhhHHhhhhcc--------------ccCCCC Confidence 1110000000 0 000000000000000 00000000000 000000 No 36 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.87 E-value=3.3e-20 Score=127.39 Aligned_cols=529 Identities=14% Similarity=0.101 Sum_probs=253.0 Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh-----ccccee Q lcl|Aclame:pro 31 LLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ-----NRPAIK 105 (711) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~~ 105 (711) .-+.+..+|+...+....|...|.++.+|..-.-...+- .. .......+.-+.-...++.+.+..-. +++=.+ T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~-~~-~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~ 78 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEG-HV-QGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFK 78 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCC-Cc-ccccccccccccHHHHHHHHHHHHHHhhcCCCCcccc Confidence 334566777777777888999999999997432111100 00 00111222333333344443332211 122223 Q ss_pred EecchhhhhhhhhcccccccccccCCCchh-HHHHHH---HHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeec Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKND-YELAEV---FTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYL 181 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~Ae~---l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~ 181 (711) +.+.+... . ....++.. ..+.+. ++..+......|+|......++.+.+..|+|++ |. T Consensus 79 l~~~d~~~-~------------~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-----y~ 140 (555) T protein:vir:17 79 LQINDAEI-D------------NLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALL-----YQ 140 (555) T ss_pred cccCHHHH-h------------hccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE-----Ee Confidence 33221000 0 00000111 112222 445566666789999999999999999999985 22 Q ss_pred cCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccc-ccccc--------- Q lcl|Aclame:pro 182 ADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSV-ADYDT--------- 251 (711) Q Consensus 182 ~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~-~~~~~--------- 251 (711) +++++ +.+ +..++++..++.- ...-++++.+||..++.+.|++....+-..... ...+. T Consensus 141 ~~~~~------~~~-pl~~y~v~~d~~G----~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~ 209 (555) T protein:vir:17 141 GKKNL------KLY-PLDRFVVSRDGEG----NVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPG 209 (555) T ss_pred cCCce------eEE-EcCeEEEeeCCCc----CeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhc Confidence 33332 223 4566777665531 345588999999999999998754321111100 00000 Q ss_pred ---CCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCcee-ccCc Q lcl|Aclame:pro 252 ---WFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVL-EGPV 327 (711) Q Consensus 252 ---~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l-e~~~ 327 (711) +.....+.+..++.+ .++ +++|+.-..+..+ ...+ T Consensus 210 ~~~~~~~~~~~v~t~~~~----------~~~-------------------------------~~~~~~e~~~~~v~~~l~ 248 (555) T protein:vir:17 210 GRDKGKSNDALVYTYVCR----------KDG-------------------------------QVKWHQECDGKVIPGSNS 248 (555) T ss_pred ccccCCCcceeEeecccc----------cCC-------------------------------eeEEEEecCceecccccc Confidence 001111111111111 011 1223333333322 2123 Q ss_pred cCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCC Q lcl|Aclame:pro 328 EIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNF 407 (711) Q Consensus 328 p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~ 407 (711) .++...+||+++. +..+++..||.|.+....+-.+.+|.+...++..+..+.+++++++++.+.+..+.. |+ T Consensus 249 e~g~~e~P~i~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~------~~ 320 (555) T protein:vir:17 249 SAPYTHNPWIPLR--FNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLA------LA 320 (555) T ss_pred ccCcccCCeeeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcceee------cC Confidence 3455689999764 445688999999999999999999999999999999999999999888877665432 22 Q ss_pred ceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 408 SLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) Q Consensus 408 ~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 487 (711) +.-.+.++...+..+-....+..-+.....++...+.|.+..-+. .-.++.+.|+.-|..+.+.....+..++..|. T Consensus 321 ~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~---~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~ 397 (555) T protein:vir:17 321 ANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML---QVRQSERTTATEVQATVQELNEQIGGIYSNLT 397 (555) T ss_pred CCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc---CCCCcccchHHHHHHHHHHHHHHHhHHHHHHH Confidence 211122333222222222233334445666676667776654321 12445668999999999999999999999885 Q ss_pred -HHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHH Q lcl|Aclame:pro 488 -KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAA 566 (711) Q Consensus 488 -~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~ 566 (711) +++..+.+..+.++.+.---|. + ..+. ...++ ..+ .....|++.. T Consensus 398 ~E~L~Pli~R~~~il~r~g~lP~---~-----p~~~-----------------------v~~~i--~~~-l~~l~r~~~~ 443 (555) T protein:vir:17 398 TELLQPYLARKLHLLQKQRKLPQ---L-----PKDL-----------------------VQPTV--VAG-LWGVGRGQDK 443 (555) T ss_pred HHHHHHHHHHHHHHHHhCCCCCC---C-----CHhh-----------------------hccce--eeh-HHHHHHHHHH Confidence 5777777777777666421100 0 0000 01122 222 2234455555 Q ss_pred HHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcc--hhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 567 EAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQAD 644 (711) Q Consensus 567 ~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~ 644 (711) +.+.++++.+.++.. + ...++.-+.+++.+.+....+- ......+ ++.++..++++++++ +++..++.++ T Consensus 444 ~~l~~~~~~laq~~~---~--p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~-eev~~~rq~~~~~~~--q~~~~~qa~~ 515 (555) T protein:vir:17 444 QQLMEFITTLAQTMG---P--EIAMKYINPTEFIKRLAAAQGIDTLQLINSP-ETMKQLGDQQKQDMV--QASLINQAGQ 515 (555) T ss_pred HHHHHHHHHHHhhcC---c--hhHhhcCCHHHHHHHHHHHcCCChhhhcCCH-HHHHHHHHHHHHHHH--HHHHHHHHHH Confidence 556555544322211 0 2344555667777777665543 1121111 111111111111000 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 645 MAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQA 706 (711) Q Consensus 645 ~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa 706 (711) ++.+-... ++ ..+...+...++..-+ .++. ....-..+.+ T Consensus 516 ~~~~~~~~---~~-~~~~~~~~~~a~~~~~----a~~~--------------~~~~~~~~~~ 555 (555) T protein:vir:17 516 LAKTPMAE---QA-MQLIQQQQEGAQDAGA----AESE--------------TSSAEAQAGA 555 (555) T ss_pred HHhhhhhh---hH-HhccccchhhhhHHHH----HHhh--------------cCCcccccCC Confidence 00000000 00 0000000000000000 0000 0000000000 No 37 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.87 E-value=3.5e-21 Score=132.72 Aligned_cols=475 Identities=11% Similarity=0.018 Sum_probs=238.7 Q ss_pred CCcC---------------CCC-CCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC Q lcl|Aclame:pro 1 MAKK---------------QKK-SRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ 64 (711) Q Consensus 1 ~~~~---------------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q 64 (711) |-.| ++. ++..=.. +..+. ......+.+..+... +....+.+..+-.+||.|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~~~l~~~i~~------~~~~~~~r~~~l~~yY~g~~ 70 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRA-DNLEE---LMVNNWELLKNFINH------HKLRQAPRIQELLDYARGEN 70 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhcc-ccccc---cccccHHHHHHHHHH------HHHHHHHHHHHHHHHhcCCC Confidence 3222 111 1110000 01011 011222223333221 11222345567789999987 Q ss_pred CCHHHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHH Q lcl|Aclame:pro 65 WPSQVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVF 142 (711) Q Consensus 65 w~~~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l 142 (711) ..-......+..++| .+++|..+.+|+..+|+.-.+.+.+.... ....+.+ T Consensus 71 ~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d---------------------------~~~~~~~ 123 (501) T protein:vir:27 71 HDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD---------------------------NDNNSQN 123 (501) T ss_pred ccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCC---------------------------ccchHHH Confidence 643222223334444 57889999999999999887776665422 1223344 Q ss_pred HHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeee Q lcl|Aclame:pro 143 TGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLI 220 (711) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~ 220 (711) ...+..+++.|+++.....+..+++++|.+|..|+.+. ++++++..+ +|.+++ ||+.... +.-+ ++ T Consensus 124 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~de------d~~~~i~~~-~p~~~~~v~d~~~~~----~~~~-~i 191 (501) T protein:vir:27 124 DDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNE------YDETRIKRL-NPLETFVIYDNSLED----NSIA-AV 191 (501) T ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCC------CCceEEEEE-ccceeEEEecCCCCC----ceEE-EE Confidence 56677788889999999999999999999998877652 256788776 787776 4543210 1111 22 Q ss_pred eecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCch Q lcl|Aclame:pro 221 DDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGIS 300 (711) Q Consensus 221 ~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 300 (711) +.|.. +...+.+..+++|..... T Consensus 192 r~~~~--------------------------~~~~~~~~~~~vyt~~~v------------------------------- 214 (501) T protein:vir:27 192 RYYNR--------------------------GTLQNAKDVVEIYTNEHI------------------------------- 214 (501) T ss_pred EEEEe--------------------------eecCCcEEEEEEEeCCeE------------------------------- Confidence 22210 001122344455433211 Q ss_pred hhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 301 IVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALA 380 (711) Q Consensus 301 ~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~ 380 (711) +.+...|...+.+..|.+.|.+|+|+|. +...+.|.+..++++++.+|..+|.+.+.+... T Consensus 215 ------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 275 (501) T protein:vir:27 215 ------------YTLDASDDFNEISVTTHAFGTVPITEFL-------NNVDGIGDYETELYLIDLYDSAESDTANHMSDM 275 (501) T ss_pred ------------EEEEeCCceeeccccccCCCcccEEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 1111112222223345556777877542 234578999999999999999999999999988 Q ss_pred CCCceEecccccCChHHHHhhcccCCCceEEec-ccc----cCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|Aclame:pro 381 PKAPFIGSEGNVEGREDEWEQANTKNFSLLTYI-PQY----QGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDAS 455 (711) Q Consensus 381 ~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~-~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~ 455 (711) .++.+++......+..+..... .. ...+.+. ++. ..+..++++..+.-...+..+++.....|-.+|++++.+ T Consensus 276 ~~~~~v~~g~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 353 (501) T protein:vir:27 276 ADAILAIYGDLALPKGMQASDM-KR-TRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMS 353 (501) T ss_pred cCceeeeecCccCCcccchhhh-hh-cCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccC Confidence 8877776432332222222211 22 2333332 221 122346666666666677888899999999999999888 Q ss_pred hccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhcc Q lcl|Aclame:pro 456 LGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESG 535 (711) Q Consensus 456 ~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g 535 (711) .|..+++.||.|+..+............+.|..+++++.++++.++.... .....++ T Consensus 354 ~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~----------~~~~~d~------------- 410 (501) T protein:vir:27 354 DTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVN----------EFKDFDE------------- 410 (501) T ss_pred ccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------ccccccc------------- Confidence 87666678999998876666666666667777777777776665543221 1001111 Q ss_pred ceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcc Q lcl|Aclame:pro 536 EWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSK 614 (711) Q Consensus 536 ~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~ 614 (711) .+|.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-.+++++........ T Consensus 411 ------------~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~------et~l~~l~~v~D~~~E~eri~~E~~e~~~~- 471 (501) T protein:vir:27 411 ------------SLLKITFTPNLPKSLNEQVSILTGLGGQVSQ------ETALSLSGLVESPNEELDKINKEVSEIDFK- 471 (501) T ss_pred ------------ccceEEeCCCCCcCHHHHHHHHHHHhccCcH------HHHHHhCCCCCCHHHHHHHHHHHHHhhhHh- Confidence 0122222333333333444444444433222 233344433 22222222232211100000 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 615 DEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEE 668 (711) Q Consensus 615 ~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~ 668 (711) . ....+.... ....+.. .......+-++-+ T Consensus 472 -------------------~--~~~~~~~~~-~~~~d~~--~~~~~d~~e~~~~ 501 (501) T protein:vir:27 472 -------------------G--YSNDFNEHV-GKYTDEV--KETHTDDFERAYE 501 (501) T ss_pred -------------------h--hcCcccccc-ccccCCC--CCCccccccccCC Confidence 0 000000000 0000000 0000000000000 No 38 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.86 E-value=1.4e-21 Score=134.92 Aligned_cols=478 Identities=11% Similarity=0.025 Sum_probs=236.9 Q ss_pred CCcCCCCCCCCccc-------CCCccc------CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLY-------AKKAKV------YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~~~~~~~~~~-------~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |.++-.+..-+... -+++-. -........+.+..+.+.+ ...-+.+..+-.+||.|+++.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h------~~~~~~rl~~l~~yY~g~~~~i 74 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHH------KLRQAPRIQELLDYARGENHDV 74 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHH------HHHHHHHHHHHHHHhcCCCccc Confidence 55543222211110 011100 0000111112222222221 1122334566789999987643 Q ss_pred HHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........+++ .+++|..+.+|+..+|+.-.+.+.+.+. |....+.+... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~---------------------------d~~~~~~~~~~ 127 (502) T protein:vir:48 75 LKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYD---------------------------DNEDNSQNDDA 127 (502) T ss_pred cccccccccccccceeecchHHHHHHHHhhhhcccCeeEecC---------------------------CccchhHHHHH Confidence 222222333443 5788999999999999988777766542 11122334555 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+..+++++|.||+.++.+. ++++++..+ +|.+++ ||+... .+. .++++.| T Consensus 128 l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~de------dg~~~i~~~-~p~~~~~vydd~~~----~~~-~~~ir~~ 195 (502) T protein:vir:48 128 IKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSE------YDETRIKRL-SPLETFVIYDNSLE----DNS-IAAVRYY 195 (502) T ss_pred HHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEE Confidence 66778889999999999999999999998777642 256777776 677765 443221 011 1222221 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) .. . ...+.+.++++|..... T Consensus 196 ~~----------~----------------~~~~~~~~~~iyt~~~i---------------------------------- 215 (502) T protein:vir:48 196 NR----------G----------------TLQNAKDVVEIYTNQHI---------------------------------- 215 (502) T ss_pred EE----------e----------------ecCCcEEEEEEEeCCeE---------------------------------- Confidence 10 0 00122334455433211 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) +.+...|...+....|.+.|.+|+|+|. +...+.|.+..+++.++.+|..+|.+...+...+.+ T Consensus 216 ---------~~~~~~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 279 (502) T protein:vir:48 216 ---------YTLDASDSFNEISVTPHAFGTVPITEFL-------NNADGIGDYETELYLIDLYDSAESDTANHMSDMADA 279 (502) T ss_pred ---------EEEEeCCceeeccceecCCCccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 1111112222333445566777877542 234578999999999999999999999999988888 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc-----cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcc Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP-----QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGA 458 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~-----~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~ 458 (711) .+++......+.++... ..+....+...+ +...+..++++..+.-..++...++.....|-..|++++.+.|. T Consensus 280 ~lv~~g~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~ 357 (502) T protein:vir:48 280 ILAIYGDLALPQGMQAS--DMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNH 357 (502) T ss_pred eeeeecCcccccccchh--hhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc Confidence 77765332222221111 111222333221 11123346666655555677778899999999999999888876 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhcccee Q lcl|Aclame:pro 459 MGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWV 538 (711) Q Consensus 459 ~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~ 538 (711) -+++.||.|+..+............+.|..+++++.++++.++... +.....++. T Consensus 358 ~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~----------~~~~~~d~~--------------- 412 (502) T protein:vir:48 358 FSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLV----------NEFKDFDES--------------- 412 (502) T ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------ccccccccc--------------- Confidence 6667899999887766666666666666667777666666554322 111111110 Q ss_pred eeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhh Q lcl|Aclame:pro 539 TIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDER 617 (711) Q Consensus 539 ~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~ 617 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ ...++-.+++.+...+........ T Consensus 413 ----------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~------et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~ 476 (502) T protein:vir:48 413 ----------RLKITFTPNLPKSLYEQVSILNDLGGQVSQ------ETALSLSGLVENPTEELDKINEESSKIDFKGYPS 476 (502) T ss_pred ----------cceEEeCCCCCcCHHHHHHHHHHHhccCcH------HHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccc Confidence 111111223232233344444444333332 233444443 222222222222111000000000 Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 EAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTA-QAQADMLKAQLETEE 668 (711) Q Consensus 618 ~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~-~aqae~~~~q~~~~~ 668 (711) ....... ...+.. ....+ ....+-+ T Consensus 477 ----------------------~~~~~~~-~~~d~~~e~~~~---~~~~~~~ 502 (502) T protein:vir:48 477 ----------------------YFYDNVG-KYTDEVKETHTD---DFERVYE 502 (502) T ss_pred ----------------------ccccccc-ccCCCccCCCCc---CcCCCCC Confidence 0000000 000000 00000 0000000 No 39 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.86 E-value=1.1e-20 Score=130.05 Aligned_cols=480 Identities=12% Similarity=0.031 Sum_probs=236.4 Q ss_pred CCc--------------CCCCCCCCcccCCCcccCCcC---cchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC Q lcl|Aclame:pro 1 MAK--------------KQKKSRVEQLYAKKAKVYAKN---NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGE 63 (711) Q Consensus 1 ~~~--------------~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~ 63 (711) |-| ++.+++...+.- ..+.. ..++.+.+..+...+ . ...+....+-.+||.|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~e~~~~~~~~~i~~~i~~~---~---~~~~~r~~~l~~YY~g~ 70 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVY----TYDGTESDLLQNINEVSKYIEHH---M---DYQRPRLKVLSDYYEGK 70 (512) T ss_pred CccceeccCceeeeeCceeeecccccccc----ccCchhhhhhhhHHHHHHHHHHH---H---HhhHHHHHHHHHHhccc Confidence 332 222222111100 00000 011112222222221 1 12234456678899998 Q ss_pred CCCHHHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHH Q lcl|Aclame:pro 64 QWPSQVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEV 141 (711) Q Consensus 64 Qw~~~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~ 141 (711) +..-.........+++ .+++|..+.+|+..+|+.-.+.+.+.. ++.+ T Consensus 71 ~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~---------------------------~d~~---- 119 (512) T protein:vir:97 71 TKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD---------------------------DDKD---- 119 (512) T ss_pred CccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceecc---------------------------CChH---- Confidence 7642222222223333 478899999999999987666544421 2222 Q ss_pred HHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceee Q lcl|Aclame:pro 142 FTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCL 219 (711) Q Consensus 142 l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~ 219 (711) ....+..+++.|+++.....+..+++++|.+|..++.+. ++++++..+ +|.++| ||+... .-...+ T Consensus 120 ~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de------d~~~~i~~~-~p~~~~~iyd~~~~-----~~~~~~ 187 (512) T protein:vir:97 120 VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAG 187 (512) T ss_pred HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCC------CCceEEEEE-cccceEEEEcCCCC-----CceEEE Confidence 245577778889999999999999999999998777642 256788766 787776 565432 112233 Q ss_pred eeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCc Q lcl|Aclame:pro 220 IDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGI 299 (711) Q Consensus 220 ~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 299 (711) ++.|.+.. ......+.+..+++|....... +....+.... T Consensus 188 vr~~~~~~----------------------~~~~~~~~~~~~~vyt~~~i~~--~~~~~~~~~~---------------- 227 (512) T protein:vir:97 188 VRYLRTKP----------------------IDKTDEDEVFTVDLFTSHGVYR--YLTSRTNGLK---------------- 227 (512) T ss_pred EEEEEeee----------------------ccccccceEEEEEEEeCCcEEE--EEecCCCccc---------------- Confidence 33332100 0001123445556665442211 0001110000 Q ss_pred hhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 300 SIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVAL 379 (711) Q Consensus 300 ~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~ 379 (711) .......+.|.+.+.+|+++|. +...+.|.+..++++++.+|...|.+.+.+.. T Consensus 228 -------------------~~~~~~~~~~~~~g~vPvv~~~-------nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~ 281 (512) T protein:vir:97 228 -------------------LTPRENGFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (512) T ss_pred -------------------ccccccccccccCcccceEeec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHH Confidence 0000123455667777877642 23446799999999999999999999999988 Q ss_pred cCCCceEecccccCChHHHHhhcccCCCceEEecc----------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 380 APKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIP----------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTM 449 (711) Q Consensus 380 ~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~----------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~t 449 (711) .+++.+++.-....+..+... .+.+.++...+ +...++.+.++..+.-..++...+......|-.+| T Consensus 282 ~~~~~lv~~G~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s 358 (512) T protein:vir:97 282 LNDAMLLIKGNLNLDPVEVRK---QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 358 (512) T ss_pred hcCceeeeecCccCCchhhhh---hhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 888877664322222222111 11222222211 11222345666665566777888999999999999 Q ss_pred CCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhh Q lcl|Aclame:pro 450 GMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQI 529 (711) Q Consensus 450 Gv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~ 529 (711) ++++.+.|..+++.||.|+..+............+.|..++++++++++.++...-.. ....++. T Consensus 359 ~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~---------~~~~d~~------ 423 (512) T protein:vir:97 359 NTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSI---------DANKDFN------ 423 (512) T ss_pred CCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc---------ccccccc------ Confidence 9999888766667899999888776666666777777777777777766654322100 0011110 Q ss_pred hhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhc Q lcl|Aclame:pro 530 FDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVP 608 (711) Q Consensus 530 ~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~ 608 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-++++.+... T Consensus 424 -------------------~i~~~f~~~~p~~~~e~~~~~~kl~giiS~------et~~~~l~~v~d~~~E~eri~~E~~ 478 (512) T protein:vir:97 424 -------------------TVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEK 478 (512) T ss_pred -------------------cceEEeCCCCCcCHHHHHHHHHHHhccCch------HHHHHhCCCCCCHHHHHHHHHHHHH Confidence 111222223333333334444444333332 223344433 222222332322211 Q ss_pred chhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 609 PNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) Q Consensus 609 ~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~ 654 (711) ............... ......+....+-.++.++ T Consensus 479 ~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 479 ESIKKAQKGIYKDPR------------DINDDEQDDDTKDTVDKKE 512 (512) T ss_pred HHHHHHhhcccCCCC------------CCCCCCCCCCccccccccC Confidence 000000000000000 0000000000000000000 No 40 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.85 E-value=5.6e-19 Score=120.63 Aligned_cols=520 Identities=13% Similarity=0.065 Sum_probs=263.8 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh- Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ- 99 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~- 99 (711) +..++.....--+.+..+|+...+....|...|.++.+|..-.=.+.+.- ......+.+.-..-...++.+.+..-. T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~--~~~~~~~~~~dst~~~a~~~Laa~l~~~ 78 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSD--NASTDYTTPWQAVGARGLNNLASKLMLA 78 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC--ccccccCCcccccHHHHHHHHHHHHHhh Confidence 33333333223345777888888888889999999999974211111000 001111122222222333333222111 Q ss_pred ---cccceeEecchhhhhhhhhcccccccccccCCCchh-HHHHHHH---HHHHHHHHhhcCHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 100 ---NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKND-YELAEVF---TGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) Q Consensus 100 ---~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~Ae~l---~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g 172 (711) +++=.++.+.+.. .... ..++.+ .+..+.| +..+......|+|......++.+.+..|+| T Consensus 79 ltP~~~WF~l~~~d~~------------~~~~-~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a 145 (535) T protein:vir:94 79 LFPMQTWMKLTISEFE------------AKQL-VAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNA 145 (535) T ss_pred hcCCCCccccccChhh------------hhcc-ccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcE Confidence 1221122221100 0000 000011 1233333 334445556899999999999999999999 Q ss_pred EEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccC Q lcl|Aclame:pro 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTW 252 (711) Q Consensus 173 ~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~ 252 (711) ++.+..+. +..+.++.+ +..++++..++.- ...-++++..++.+.+-..|++.... ... . T Consensus 146 ~l~~~~~~------~~~~~f~~~-pl~~y~v~~d~~G----~vd~i~r~~~~~~~~l~~~~~~~~~~----~~~-----~ 205 (535) T protein:vir:94 146 LLYIPEPE------GTYNPMKLY-RLSSYVVQRDAFG----TVLQIVTLDKTAYAALPEDVRNSMDS----SQE-----H 205 (535) T ss_pred eEeeccCc------CcccceEEE-EcCeEEEeeCCCC----CeEEEEeeeeccHHHhhHHHHHHHHh----ccc-----c Confidence 87553321 112345555 5677887654431 24456788899999998877653211 111 1 Q ss_pred CCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCC Q lcl|Aclame:pro 253 FTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPST 332 (711) Q Consensus 253 ~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~ 332 (711) ...+.|.++++-+++... + .+.+| +...|..+....+.++.. T Consensus 206 ~~~~~v~v~~~v~~~~~~--------~-----------------------------~~~~~-~e~~g~~~~~~~~~~g~~ 247 (535) T protein:vir:94 206 KGDEMIDVYTHIYLDEES--------G-----------------------------EYLKY-EEIDGVEVEGTDASYPVD 247 (535) T ss_pred CCCceeEEEEEEEeeCCC--------C-----------------------------cEEEE-EEecCeeeccccccCccc Confidence 123456666654433210 0 11122 222343332233556788 Q ss_pred ccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEe Q lcl|Aclame:pro 333 TIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTY 412 (711) Q Consensus 333 ~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~ 412 (711) .+||+++- +...++..||.|.+....+-.+.+|.+...++.....+.++.++++++.+.+...+. ...+|.++.. T Consensus 248 ~~P~~~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~g~~v~g 322 (535) T protein:vir:94 248 ACPYIPVR--MVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLT---KAQTGDFVSG 322 (535) T ss_pred cCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcc---cCCCceeecC Confidence 89999764 445688999999999999999999999999999999999999999988887765432 2345555433 Q ss_pred cccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 413 IPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SIR 491 (711) Q Consensus 413 ~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~ 491 (711) . ..+..+.......-.+....+++...+.|.... ..+.+...++.+.|+.-|..+.+.....+...+.+|.. +.. T Consensus 323 ~---~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~ 398 (535) T protein:vir:94 323 R---PEDISFLQLEKAADFSVARAVSEQIEGRLSYAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQL 398 (535) T ss_pred C---cccceeeecccccchhHHHHHHHHHHHHHHHHH-hHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHH Confidence 2 222223334444455556777777777777655 22222235566789999999998888888888888753 666 Q ss_pred HHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHH Q lcl|Aclame:pro 492 RVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQ 571 (711) Q Consensus 492 ~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~ 571 (711) .+.+..+.++.+.---+ .+-. ++ .++.+.. +-....|.+..+.|.+ T Consensus 399 Pli~r~~~il~r~g~lP---------------~~p~--------------~~----v~~~~vs-~la~l~r~~~~~~l~~ 444 (535) T protein:vir:94 399 PMVRVLLKQLQATNQIP---------------ELPK--------------EA----VEPTIST-GMEALGRGQDLDKLER 444 (535) T ss_pred HHHHHHHHHHHhCCCCC---------------CCCh--------------hh----ccceEee-hHHHHHHHHHHHHHHH Confidence 66666666654431000 0000 00 1222222 2233455555566666 Q ss_pred HHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcch--hhcchhhhhhhhhHHHHHHHHHHHHHHHHHHH---HHH Q lcl|Aclame:pro 572 FAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPPN--VLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQ---ADM 645 (711) Q Consensus 572 l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~~--~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q---~~~ 645 (711) +++.+.++.+ +.++ .-+.+++.+.+....+.. .....+.+..+.. +++++++ +++.++.++. ..+ T Consensus 445 ~~~~laq~~P-------~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~-~q~~~~~-~~~~~~~~~g~~~~~~ 515 (535) T protein:vir:94 445 CIAAWSALAP-------MQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEM-AEAAQGT-AMQNAAASAGAGAGTM 515 (535) T ss_pred HHHHHHhhCh-------HHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHH-HHHHHHH-HHHHHHHHHHHhhhcc Confidence 6554322222 2222 346677777776665543 1222221111111 1111110 0000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 646 AQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQG 682 (711) Q Consensus 646 ~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~ 682 (711) .++.++.+.+.+ .++. +.+. T Consensus 516 ~~~~~~~~~~~~-------------~~~g----~~~~ 535 (535) T protein:vir:94 516 ATASPENMKAAA-------------AQAG----MAPN 535 (535) T ss_pred cccChHHHHHHH-------------HHhc----cCCC Confidence 000000000000 0111 1111 No 41 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.85 E-value=1.3e-20 Score=129.52 Aligned_cols=427 Identities=13% Similarity=0.087 Sum_probs=223.6 Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEec Q lcl|Aclame:pro 31 LLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSS 108 (711) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p 108 (711) +|.. | ....+....+..+||.|+|+.-.........++| .+.+|..+.+|+..+|+.-.+.+.+.+. T Consensus 1 ~~~~----~------~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~- 69 (440) T protein:vir:95 1 MLAA----F------LGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVM- 69 (440) T ss_pred Chhh----H------HHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeC- Confidence 1111 1 1234556677789999998753333333344444 5788999999999999876666555432 Q ss_pred chhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCC Q lcl|Aclame:pro 109 TEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQ 188 (711) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~ 188 (711) +.++.+.. ..+..+++.|+++.....+.++++++|.+|..++.+. ++ T Consensus 70 -----------------------~~~~~~~~----~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~ 116 (440) T protein:vir:95 70 -----------------------EGGSADQL----STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK------DK 116 (440) T ss_pred -----------------------CCccHHHH----HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC------CC Confidence 12333332 2356678889999999999999999999998887642 25 Q ss_pred cceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeee Q lcl|Aclame:pro 189 DLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTR 266 (711) Q Consensus 189 ~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~ 266 (711) ++.+..+ +|.+++ ||+.... .. ..+++.|...+ ...+++|.. T Consensus 117 ~~~i~~~-~p~~~~~~~d~~~~~----~~-~~~i~~~~~~~------------------------------~~~~~vyt~ 160 (440) T protein:vir:95 117 VDRVVLI-SPLEMFVIRDLTVEQ----NI-IAAVHLPIYAD------------------------------KVNMTVYTK 160 (440) T ss_pred ceEEEEE-cccceEEEEcCCCCC----ce-EEEEEEEEecC------------------------------ceEEEEEeC Confidence 6777776 687776 4553311 11 22233332100 001223322 Q ss_pred eeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeecc Q lcl|Aclame:pro 267 EPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLII 346 (711) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~ 346 (711) ....... ...++ .+...+.++.|.+.+.+|+|+|. T Consensus 161 ~~~~~~~-~~~~~--------------------------------------~~~~~~~~~~~~~~g~vPvv~~~------ 195 (440) T protein:vir:95 161 DKVITYK-PYSNN--------------------------------------SVRLVVDDVKKHSYNDVPVVEWW------ 195 (440) T ss_pred CeEEEEE-EecCC--------------------------------------ccceeecceeeccCceeeEEEee------ Confidence 1110000 00000 01112233445555666766542 Q ss_pred CCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC---hHHHHhhcccCCCceEEeccc-----ccC Q lcl|Aclame:pro 347 KKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG---REDEWEQANTKNFSLLTYIPQ-----YQG 418 (711) Q Consensus 347 ~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~---~~~~~~~~~~~~~~~i~~~~~-----~~~ 418 (711) +...|.|.+..+++.++.+|..+|.+...+.....+.+++. |.... ..+.... .+..+.+..... ... T Consensus 196 -n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~-g~~~~~~~~~e~~~~--~~~~~~~~~~~~~~~~~~~~ 271 (440) T protein:vir:95 196 -NNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVK-GDLDGIKLSPEDAAK--MKDANMLFLKTGISTTGQQT 271 (440) T ss_pred -CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeee-cccccCCCCccchhh--hhhccceecccccccccCCC Confidence 22346799999999999999999999999988888776653 32110 1111111 111122222111 112 Q ss_pred cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 419 DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILV 498 (711) Q Consensus 419 ~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l 498 (711) ++.++++..+.-..++...++.....|...|++++.+.+.-+++.||.|+..+..............|..+.+++++++. T Consensus 272 ~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~ 351 (440) T protein:vir:95 272 TADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELIS 351 (440) T ss_pred CcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23466666665567788889999999999999999887766667899999887666666666667777777777777665 Q ss_pred HHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcch Q lcl|Aclame:pro 499 EMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPS 578 (711) Q Consensus 499 ~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~ 578 (711) .++..... ...++ .++.+.=.+..+....+..+.+..+...++. T Consensus 352 ~~~~~~~~-----------~~~~~-------------------------~~v~i~f~~~~p~~~~~~ad~~~kl~g~iS~ 395 (440) T protein:vir:95 352 NIHKAING-----------PVIEA-------------------------NKLTFTFHPNIPQDVWTEIKAYIEAGGEISQ 395 (440) T ss_pred HHHhhcCC-----------ccccc-------------------------ccceEEeCCCCCCCHHHHHHHHHHHhccCcH Confidence 55432210 00000 1222222333333233344444444333221 Q ss_pred hHHHHHHHHHHhcCCcc-hHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 579 AAAVMADLIAQNMDWPG-ADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 579 ~~~~~~~~~~~~~~~~~-~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae 651 (711) ..+++.+++-. ..++.+ +..... ....+.+.... ...-...++| T Consensus 396 ------et~~~~l~~~d~~~E~~r-i~~E~~--------------------~~~~~~~~~~~--~~~~~~~~~e 440 (440) T protein:vir:95 396 ------ETLMENASFTDYKTEHSR-ILKQGG--------------------SSDLEIGQIVG--DADVGQADTE 440 (440) T ss_pred ------HHHHHhCCCCCcHHHHHH-HHHHHH--------------------HhhhhHHhhcc--CCCCCCcCCC Confidence 22333433322 112111 111000 00000000000 0000000011 No 42 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.84 E-value=2e-20 Score=128.63 Aligned_cols=484 Identities=12% Similarity=0.049 Sum_probs=238.9 Q ss_pred CCcCC-------CCCCCCcccCCCcccCCc------CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQ-------KKSRVEQLYAKKAKVYAK------NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |-|-- ..-++..-..+++..... +...+.+.+..+.+.+. ...+....+..+||.|.|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~------~~~~~r~~~l~~Yy~g~~~i~ 74 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM------DYQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHH------HhhHHHHHHHHHHhcccCccc Confidence 22210 111111111111111000 01111222333332211 223455667889999987643 Q ss_pred HHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........++| .+++|..+.+|+..+|+.-.+.+.+.. ++.+ .... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~~~~----~~~~ 123 (511) T protein:vir:96 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD---------------------------DDKD----VLEA 123 (511) T ss_pred cccCcCcccccCcceeecchHHHHHHHHHhhhccCCceeec---------------------------CchH----HHHH Confidence 222222333333 578899999999999998777666532 2222 2346 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+..++.++|.+|..++.+. ++++++..+ +|.+++ ||.... .-..++++.+ T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de------d~~~~i~~~-~p~~~~~vydd~~~-----~~~~~~vr~~ 191 (511) T protein:vir:96 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYL 191 (511) T ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEE Confidence 77778889999999999999999999998877642 256788776 787777 443221 1122333333 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) .... ......+.+..+++|....... +....+.... T Consensus 192 ~~~~----------------------~d~~~~~~~~~~~iyt~~~i~~--~~~~~~~~~~-------------------- 227 (511) T protein:vir:96 192 RTKP----------------------IDKTDEDEVFTVDLFTSHGVYR--YLTSRTNGLK-------------------- 227 (511) T ss_pred Eeee----------------------ccccccceEEEEEEEeCCcEEE--EEecCCCccc-------------------- Confidence 1100 0001123444455554432111 0000000000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) .......+.|.+.+.+|+++|. +...+.|.+..++++++.+|...|.+.+.+...+++ T Consensus 228 ---------------~~~~~~~~~~~~~~~vPvv~~~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 285 (511) T protein:vir:96 228 ---------------LTPRENGFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (511) T ss_pred ---------------ccccccccccccCCceeeEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCc Confidence 0000113345566777776542 233578999999999999999999999999888777 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc---------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP---------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) .+++......+..+. . ....+.++.+.+ ....+..+.++..+.-..++...+......|..+|++++. T Consensus 286 ~lv~~g~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~ 362 (511) T protein:vir:96 286 MLLIKGNLNLDPVEV-R--KQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred eeeeecCccCCchhh-c--ccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 776643233332221 1 112223333322 1122344666666666777888889999999999999998 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhc Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~ 534 (711) +.+.-+++.||.|+..+..............|..++++++++++.++...... ....++. T Consensus 363 ~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~---------~~~~d~~----------- 422 (511) T protein:vir:96 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSI---------DANKDFN----------- 422 (511) T ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc---------ccccccc----------- Confidence 87766667899999888777777777777777778777777766654332110 0111111 Q ss_pred cceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhc Q lcl|Aclame:pro 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) Q Consensus 535 g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~ 613 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-++++.+........ T Consensus 423 --------------~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~------et~l~~l~~v~D~~~E~~ri~~E~~~~~~~ 482 (511) T protein:vir:96 423 --------------TVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred --------------cceEEeCCCCCCCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 111212223333233333444444333322 223334432 22222223222211100000 Q ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) Q Consensus 614 ~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~ 654 (711) ........... .....+....+-..+..+ T Consensus 483 ~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 483 AQKGIYKDPRD------------INDDEQDDDTKDTVDKKE 511 (511) T ss_pred HhhccccCCCC------------CCCCCCCCcccccccccC Confidence 00000000000 000000000000000000 No 43 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.84 E-value=6.6e-20 Score=125.73 Aligned_cols=452 Identities=12% Similarity=0.070 Sum_probs=231.1 Q ss_pred cCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC------CHHHH-------HHHHHhCCC--ceEehhhH Q lcl|Aclame:pro 23 KNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW------PSQVR-------TERELEQRP--CLVNNVLP 87 (711) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw------~~~~~-------~~~~~~g~p--~~~~N~i~ 87 (711) .+.....+++. .......+.+....+..+||.|.+= ..... .....+++| .+.+|..+ T Consensus 1 ~~~e~~~~~i~-------~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~ 73 (471) T protein:vir:10 1 MEIEVIKKIIS-------SQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQ 73 (471) T ss_pred CCHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhH Confidence 22222222222 2223334456677888999999741 00000 000111222 37889999 Q ss_pred HHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHH Q lcl|Aclame:pro 88 TFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAV 167 (711) Q Consensus 88 ~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~ 167 (711) .+|+..+|+.-.+.+.+. + .+.+..+. +..+.+ |+++.....+..++. T Consensus 74 ~Ivd~~~~yl~G~p~~~~--~-------------------------~~~~~~~~----l~~~~~-n~~~~~~~~~~~~~~ 121 (471) T protein:vir:10 74 LLLDQKKAYALTYPPTFD--V-------------------------DDKKVNDM----IVDVLG-DDYERISKQLCVNAG 121 (471) T ss_pred HHHHhhhhhhcccCceec--c-------------------------CChHHHHH----HHHHHh-cCHHHHHHHHHHHHh Confidence 999999999866544432 1 23333333 343343 789999999999999 Q ss_pred hcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhccc Q lcl|Aclame:pro 168 ESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDS 245 (711) Q Consensus 168 ~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~ 245 (711) ++|.||..++++.. ++++++..+ +|.+++ ||+... +-...+++.|...... T Consensus 122 ~~G~~~~~v~~d~~-----~g~~~~~~~-~p~~~~~i~d~~~~-----~~~~~~ir~~~~~~~~---------------- 174 (471) T protein:vir:10 122 NAGIAWLHVWKDAS-----DNSFRYACV-DSKEVIPIYSKSLD-----KKSIGVLRVYSSIDET---------------- 174 (471) T ss_pred hCCeEEEEEEeeCC-----CCeeEEEEE-cccceEEEEcCCCC-----CceEEEEEEEEeeccC---------------- Confidence 99999988877532 357888877 787765 443221 1222333333221110 Q ss_pred ccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceecc Q lcl|Aclame:pro 246 VADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEG 325 (711) Q Consensus 246 ~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~ 325 (711) ..+.+..+++|...... .+....+.....-.. .. .........|.....+ T Consensus 175 --------~~~~~~~~~vy~~~~~~--~y~~~~~~~~~~~~~----------------~~----~~~~~~~~~~~~~~~~ 224 (471) T protein:vir:10 175 --------DGKNYTVYEYWNDKECS--FYRHEKEKPLEELET----------------FQ----AISLIDTMNGDRSSDN 224 (471) T ss_pred --------CCceeEEEEEEeCCcEE--EEEecCCcccccccc----------------cc----cccccccccccccccc Confidence 12344455555433221 111111111000000 00 0000112234444455 Q ss_pred CccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccC Q lcl|Aclame:pro 326 PVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTK 405 (711) Q Consensus 326 ~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~ 405 (711) +.|.+.+.+|+|+|. +...+.|.+..+++.++.+|..+|.+.+.+...+++.+++.-....+..+.... .+ T Consensus 225 ~~~~~~g~iPvv~~~-------n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~--~~ 295 (471) T protein:vir:10 225 SFKHDFGLVPFIPFK-------NNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLED--LK 295 (471) T ss_pred cccCCCCceeEEEec-------cCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHH--hh Confidence 556666777777542 234467999999999999999999999999999888776643222333333222 23 Q ss_pred CCceEEecc-cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 406 NFSLLTYIP-QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFID 484 (711) Q Consensus 406 ~~~~i~~~~-~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~d 484 (711) .++.+.+.. +...+..+.++....-..+....++...+.|-..|+..+.+.+..+ +.||.|+..+............. T Consensus 296 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~g-n~Sg~Alk~~~~~l~~k~~~~~~ 374 (471) T protein:vir:10 296 RYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLG-NSSGVALKFLYSLLELKAGNMET 374 (471) T ss_pred cCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccccc-CccHHHHHHHHHHHHHHHHHHHH Confidence 344555542 2233445777777766778888999999999999998877666543 46999988876666666666666 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHH Q lcl|Aclame:pro 485 NLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIE 564 (711) Q Consensus 485 n~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~ 564 (711) .|..+++++.++++.++..+ ++. ++.+.=.+..+....+ T Consensus 375 ~~~~~l~~~~~li~~~~~~~----------------d~~-------------------------~i~i~f~~~~p~n~~e 413 (471) T protein:vir:10 375 QFRSGYATLVKMILKHLGLS----------------DKL-------------------------KIKQTWTRNSINNDTE 413 (471) T ss_pred HHHHHHHHHHHHHHHHhccC----------------CCc-------------------------eeEEEeCCCCCCCHHH Confidence 66666666666555543111 000 1111112222222223 Q ss_pred HHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 565 AAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 565 ~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..+.++.+...++ ...+++.+++ .+++.-.+++++.........+..... T Consensus 414 ~~~~~~kl~g~iS------~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~~~~~~~----------------------- 464 (471) T protein:vir:10 414 MAQVVSTLATITS------RENVAKSNPIVEDWQDELRLQKAEQEGRSEKLYDMEEV----------------------- 464 (471) T ss_pred HHHHHHHHhccCc------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccCCC----------------------- Confidence 3333443332222 1223333332 222222233322111100000000000 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 644 DMAQAEADTA 653 (711) Q Consensus 644 ~~~k~qae~~ 653 (711) ..+.+.+ T Consensus 465 ---~~~~e~~ 471 (471) T protein:vir:10 465 ---EHESEVE 471 (471) T ss_pred ---CCccccC Confidence 0000000 No 44 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.84 E-value=1.7e-18 Score=117.97 Aligned_cols=511 Identities=12% Similarity=0.023 Sum_probs=259.0 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) |+++ +-. --+.+..+|+...+....|...|.++.+|..-.=+..+.-... ..... T Consensus 1 ~~~~--------------------~~~---~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~--~~~~~ 55 (522) T protein:vir:94 1 MAER--------------------EGF---AAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSS--TEYTT 55 (522) T ss_pred Cccc--------------------chh---hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc--ccccc Confidence 3331 111 1334556677666677789999999999974321111000000 01111 Q ss_pred eEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchhHHH---HHHHHHHHHHHHhhc Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYEL---AEVFTGLIKNIEYNC 153 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~---Ae~l~~~~~~~~~~~ 153 (711) +.-+.-...++.+.+..-. +++=.++.+.+.. ............+. -+.++..+......| T Consensus 56 ~~dst~~~a~~~Las~l~~~ltP~~~WFrl~~~d~~------------~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~s 123 (522) T protein:vir:94 56 PWQAVGARCLNNLAAKLMLALFPQSPWMRLTVSEYE------------AKTLSQDSEAAARVDEGLAMVERVLMAYMETN 123 (522) T ss_pred cccccHHHHHHHHHHHHHhhcCCCCcccccccchhh------------hhccCcccchhHHHHHHHHHHHHHHHHHHHhc Confidence 2233333333333322211 1111122111000 00000000111112 233455555666789 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhc Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALY 233 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~ 233 (711) +|..+...++.+.+..|+|+.-+.- +.-+.-..++.+ +..++++..++.- ...-++++..++.+.+-+.+ T Consensus 124 nf~~~~~~~~~~L~~~G~a~l~~~~-----~~~~~~~~~~~~-pl~~y~v~~d~~G----~vd~i~r~~~~~~~~l~~~~ 193 (522) T protein:vir:94 124 SFRVPLFEALKQLIVSGNCLLYIPE-----PEQGTYSPMRMY-RLVSYVVQRDAFG----NILQIVTIDKVAFSALPEDV 193 (522) T ss_pred CcHHHHHHHHHHHHhhCcEeEeeec-----cCCCceeeEEEE-EcceEEEeeCCCc----CeEEEeeeeeccHHhcchHH Confidence 9999999999999999999864332 221122335545 5667777654421 23456677888888765555 Q ss_pred CCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEE Q lcl|Aclame:pro 234 PDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTY 313 (711) Q Consensus 234 p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~ 313 (711) +.... . +.....+.|.|+++.+++.. ++. + T Consensus 194 ~~~~~----~------~~~~p~~~v~v~~~v~~~~~----------~~~------------------------------~ 223 (522) T protein:vir:94 194 KSQLN----A------DDYEPDTELEVYTHIYRQDD----------EYL------------------------------R 223 (522) T ss_pred HHHHh----c------ccCCccceEEEEEEEEeeCC----------cee------------------------------E Confidence 43220 0 11112357777777665421 111 1 Q ss_pred EEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC Q lcl|Aclame:pro 314 WRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE 393 (711) Q Consensus 314 ~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~ 393 (711) +.-..|..+....+-|+...+||+++. +..+++..||.|.+....+-.+.+|.+...++.....+.+|+++++++.+. T Consensus 224 ~~~~~g~~~~~~~~~~~~~e~P~~~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~ 301 (522) T protein:vir:94 224 YEEVEGIEVTGTDGSYPLTACPYIPVR--MVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGIT 301 (522) T ss_pred EeeccCceecccCCCCccccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Confidence 111223333333345677889999764 445688999999999999999999999999999999999999999988887 Q ss_pred ChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhc-cccchhHHHHHHHHH Q lcl|Aclame:pro 394 GREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLG-AMGNETSGRAIIARQ 472 (711) Q Consensus 394 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G-~~~~~~sg~ai~~~~ 472 (711) +..+.. ...+|.++. +...+..+-......-.+.....++...+.|....-+. +++ .++.+.|+.-|..+. T Consensus 302 ~~~~~~---~~~~g~~v~---g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~ 373 (522) T protein:vir:94 302 QPRRLN---KAATGEFVA---GRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVA 373 (522) T ss_pred cchhee---ccCCceeec---CCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHH Confidence 766432 234455443 22222222233333345556777888888888776433 344 455668999999999 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 473 RQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 473 ~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) +.....+..++..|.. +...+.+..+.++.+.---|. + | ..-+.+. T Consensus 374 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~---------------~------p------------~~~v~v~ 420 (522) T protein:vir:94 374 GELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPD---------------L------P------------KEAVEPT 420 (522) T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC---------------C------C------------cccEEee Confidence 9999999988888754 666666666666544321000 0 0 0002233 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcc--hhhcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDMPEQTE 629 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~q~ 629 (711) +.. +-...+|.+-.+.|.++++.+.++.+..+ ++.-+.+++.+.+....+- ......+.+..+..+ ++++ T Consensus 421 ~~s-~La~~qr~~~~~~l~~~~~~ia~l~P~~~------~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~-q~~~ 492 (522) T protein:vir:94 421 VST-GLEALGRGQDLEKLTQAVNMMTGLQPLSQ------DPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMA-EQSS 492 (522) T ss_pred Eec-HHHHHHHHHHHHHHHHHHHHHHhccchhh------hhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHH-HHHH Confidence 322 23345566666666666665433333221 1223567777777666543 222222111111111 1000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 630 PTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQG 682 (711) Q Consensus 630 ~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~ 682 (711) ++ +++ +.+. +..+...+.. ..+ ..+.+.++ T Consensus 493 ~~-~~~---~~~~-------~~~~~~~a~~-~~~-----------~~~~~~~~ 522 (522) T protein:vir:94 493 QQ-AVV---QGAS-------AAGANMGAAV-GQG-----------AGEDMAQA 522 (522) T ss_pred HH-HHH---HHHH-------HHHHHhhhhh-hcc-----------cchhhhcC Confidence 00 000 0000 0000000000 000 00000000 No 45 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.84 E-value=5.7e-20 Score=126.08 Aligned_cols=446 Identities=14% Similarity=0.064 Sum_probs=230.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP- 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p- 79 (711) |--+.+..=+.+ .+.+-+.+.+.++.+.+ ...+.+..+..+||.|.| +-.....+..++| T Consensus 1 ~~~~~~~~~~~~----------~~~~~~~~~i~~~i~~~-------~~~~~r~~~~~~Yy~g~~--~i~~~~~~~~~~~~ 61 (452) T protein:vir:36 1 MKYKPPKLMTFS----------KDEPITVEVVTKFMEKH-------KLEVARYEYLKNMYLGIM--AIDDEPAKDSWKPD 61 (452) T ss_pred CcccCceeEEcC----------CccCCCHHHHHHHHHHH-------HHHHHHHHHHHHHhcccc--ccccCccccccCcc Confidence 432222211111 11112223344444432 223445567899999975 1111112233333 Q ss_pred -ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 80 -CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 80 -~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) .+.+|..+.+|+..+|+.-.+.+.+. + .|.+. ...+..+++.|+++.. T Consensus 62 ~ki~~n~~~~ivd~~~~~l~g~~~~~~--~-------------------------~d~~~----~~~l~~~~~~n~~~~~ 110 (452) T protein:vir:36 62 NRLAVNFTKYIVDTFTGYFNGIPVKKS--H-------------------------SDKEI----LTKLQEFDNLNDMEDE 110 (452) T ss_pred ceeecchHHHHHHHHhhhhcccCceee--c-------------------------CChhH----HHHHHHHHhhcChhHH Confidence 47789999999999998766654432 1 12222 3456777788999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDA 236 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~ 236 (711) ...+..+++++|.||+.++.+. ++.+++..+ +|.+++ ||+.... .. ..+++.+.+ T Consensus 111 ~~~~~~~~~~~G~~~~~v~~d~------~g~~~i~~~-~p~~~~~v~d~~~~~----~~-~~~i~~~~~----------- 167 (452) T protein:vir:36 111 ESELAKMACIYGRAFEFLYQDE------DTQTNVVYN-SPENMFMVYDDTVKQ----EP-LFAVRYGVD----------- 167 (452) T ss_pred HHHHHHHHHhcCeEEEEEEecC------CCeeEEEEE-cccceEEEEcCCCCC----ce-EEEEEEEEe----------- Confidence 9999999999999998887642 256777766 777775 4542211 11 122222211 Q ss_pred ccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEE Q lcl|Aclame:pro 237 TAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRK 316 (711) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~ 316 (711) .+....+++|.... ++. |.. T Consensus 168 ------------------~~~~~~~~vyt~~~------------i~~------------------------------~~~ 187 (452) T protein:vir:36 168 ------------------EDKKLQGEVYTLLE------------TIK------------------------------ISG 187 (452) T ss_pred ------------------cCceEEEEEEecCe------------EEE------------------------------EEE Confidence 01112233332211 110 000 Q ss_pred EecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH Q lcl|Aclame:pro 317 ITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE 396 (711) Q Consensus 317 ~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~ 396 (711) -.+...+....|.+.|.+|+|++. +...+.|.+..+++.++.+|..+|.+.+.+...+++.+++....+. .+ T Consensus 188 ~~~~~~~~~~~~~~~g~iPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~-~~ 259 (452) T protein:vir:36 188 ENDEISFGEGTYNPYPDLPVVEFY-------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVE-EE 259 (452) T ss_pred cCCceEEecceeccCCcccEEEec-------CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcC-ch Confidence 111112233455566677776542 1234678999999999999999999999998888887777533332 22 Q ss_pred HHHhhcccCCCceEEeccccc-CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHH Q lcl|Aclame:pro 397 DEWEQANTKNFSLLTYIPQYQ-GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQG 475 (711) Q Consensus 397 ~~~~~~~~~~~~~i~~~~~~~-~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~ 475 (711) + .. ..+++.++.+.++.. .++.+.++..+.-...+...++...+.|-..|++++.+.+..+ +.||.|+..+-... T Consensus 260 ~-~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~Sg~Al~~~~~~l 335 (452) T protein:vir:36 260 D-LK--NIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFG-SSSGVSLAYKLQAM 335 (452) T ss_pred h-hh--hhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-CCcHHHHHHHHHHH Confidence 2 12 133445566654332 2344666666666777788889999999999999887766554 46999988877666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecc Q lcl|Aclame:pro 476 DRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTG 555 (711) Q Consensus 476 ~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~ 555 (711) ..........|..+++++.++++.+.... |. ..++. +|.|.=. T Consensus 336 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~----------~~--~~~~~-------------------------~i~i~f~ 378 (452) T protein:vir:36 336 SNLALSFQRKFQSSLNSRYKLFCELSTNV----------SN--KDSWK-------------------------DIEYTFT 378 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CC--ccccc-------------------------cceEEeC Confidence 66666666777777777777766654321 10 11111 1122222 Q ss_pred cChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHH Q lcl|Aclame:pro 556 PAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQ 634 (711) Q Consensus 556 ~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~ 634 (711) +..+....+..+.+..+...++. ..+++.+++ .+.++-.+++++......... T Consensus 379 ~~~p~d~~~~a~~~~k~~g~iS~------et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~-------------------- 432 (452) T protein:vir:36 379 RNEPKDIKEQAETANILMGITSQ------ETALSVISVIPDVQAEMEKIKKEEASTAIFD-------------------- 432 (452) T ss_pred CCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH-------------------- Confidence 22222233333444444333221 233344433 222222222222111000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 635 QVEMAKSQADMAQAEADTAQAQADMLKAQLE 665 (711) Q Consensus 635 q~~~~~~q~~~~k~qae~~~aqae~~~~q~~ 665 (711) . ..+.... -.+.+......+ T Consensus 433 ---~-~~~~~~~-------~~~~~~~~~~~e 452 (452) T protein:vir:36 433 ---K-DKQPSEK-------GTDTVVSETNEE 452 (452) T ss_pred ---h-hccCCCC-------cccccCccccCC Confidence 0 0000000 000000000000 No 46 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.84 E-value=1.7e-19 Score=123.54 Aligned_cols=463 Identities=11% Similarity=-0.008 Sum_probs=229.2 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP- 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p- 79 (711) |+-.-+.+.-.+.. +.=..|.+..=..+.+.++.+.+ ....+....+-.+||.|+| .........++| T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~------~~~~~~~~~~l~~Yy~g~~---~i~~~~~~~~~~~ 69 (470) T protein:vir:99 1 MKDINYGRDKVTGN--SSFIFPKGEKLTSNELLGFIAYN------ETVLKPRYRENMKLYLGKH---KILTAPEKETGAD 69 (470) T ss_pred CccccCCcccccCC--ceEEeCCCCCcCHHHHHHHHHHH------HHhhHHHHHHHHHHhcccc---ccccCcccccCCc Confidence 54333222111111 11001111111112233333222 2233445567789999975 111111222333 Q ss_pred -ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 80 -CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 80 -~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) .+++|..+.+|+..+|+.-.+.+.+.+ +.|....+ .+..+++.|+++.. T Consensus 70 ~ki~~n~~~~Ivd~~~~~l~g~p~~~~~--------------------------~~d~~~~~----~l~~~~~~n~~~~~ 119 (470) T protein:vir:99 70 NRIVVNSAKYVVDVYNGYFCGIEPKLAL--------------------------LNDSSKID----EIARWNRQENFFDT 119 (470) T ss_pred ceeecchHHHHHHHHhhhhccCCeeEee--------------------------CCchhHHH----HHHHHHHhcCHhHH Confidence 578899999999999987766544432 12222222 34556778999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDA 236 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~ 236 (711) ...+..+++++|.+|..++.+. ++++.+..+ +|.+++ ||+.... -..++++.+.... T Consensus 120 ~~~~~~~~~~~G~~~~~v~~d~------dg~~~i~~~-~p~~~~~i~d~~~~~-----~~~~~vr~~~~~~--------- 178 (470) T protein:vir:99 120 INEISKQCDIFGRSIASIYQGE------DARPHLMYS-SPNHAFIIYDDTVQR-----QPLAFVHYQIDNS--------- 178 (470) T ss_pred HHHHHHHHHhcCeeEEEEEeCC------CCeEEEEEE-ccceeEEEEcCCCCc-----ceEEEEEEEEEec--------- Confidence 9999999999999988776642 356777766 787765 5543211 0111222222100 Q ss_pred ccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEE Q lcl|Aclame:pro 237 TAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRK 316 (711) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~ 316 (711) ........+.|+.. .++.+... - T Consensus 179 -----------------~~~~~~~~~~~~~~------------~~~~~~~~----------------------------~ 201 (470) T protein:vir:99 179 -----------------NNWTDAYGVIQYAD------------KFYKFKGY----------------------------D 201 (470) T ss_pred -----------------CCeeEEEEEEEecC------------eEEEEEec----------------------------c Confidence 00111111222111 01110000 0 Q ss_pred EecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH Q lcl|Aclame:pro 317 ITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE 396 (711) Q Consensus 317 ~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~ 396 (711) ......+.+..|.+.+.+|++++. +...|.|.+..+++.++.+|..+|.+...+...+++.+++......+.+ T Consensus 202 ~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~ 274 (470) T protein:vir:99 202 IEEDTNAAGYAINPYGLVPAVEFF-------ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDD 274 (470) T ss_pred cccccccccccccCCCccceEeec-------CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccc Confidence 000111122344555677777542 2345779999999999999999999999999888888877543332211 Q ss_pred --HHHhhcccCCCceEEeccc-ccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHH Q lcl|Aclame:pro 397 --DEWEQANTKNFSLLTYIPQ-YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQR 473 (711) Q Consensus 397 --~~~~~~~~~~~~~i~~~~~-~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~ 473 (711) +... ......++.+.+. ...++.+.++..+.....+...++.....|-..||+++.+.+..+++.||.|+..+.. T Consensus 275 ~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 352 (470) T protein:vir:99 275 EGNPKF--DFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLF 352 (470) T ss_pred ccchhh--hhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHH Confidence 1111 1222334444322 2233457777766666677788899999999999999887777666789999988777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEee Q lcl|Aclame:pro 474 QGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) Q Consensus 474 ~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~ 553 (711) .........-..|..+++++.++++.++...... ..++ .++.+. T Consensus 353 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-----------~~~~-------------------------~~i~v~ 396 (470) T protein:vir:99 353 AMKNKADSKERKFDKSLMQLYRIVLATLFNNKQD-----------QELW-------------------------SELDFK 396 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCc-----------cccc-------------------------ccceEE Confidence 6666666777777777777777666554332110 0010 112222 Q ss_pred cccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPE 633 (711) Q Consensus 554 ~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~ 633 (711) =.+..+....+..+.+..+...++. ..+++.+++-..++-.+++.+..... ... T Consensus 397 f~~~~p~~~~e~a~~~~kl~giis~------et~l~~l~~vd~~~E~eri~~E~~~~--------------------~~~ 450 (470) T protein:vir:99 397 FTRNLPEDMASAIDNAKNAEGIVSK------KTQLGMIPDIEPDAEMKQIAKEKADA--------------------IKQ 450 (470) T ss_pred eCCCCCcCHHHHHHHHHHHhccCCH------HHHHHhCCCCCHHHHHHHHHHHHHHH--------------------HHH Confidence 2233332233333444444332222 22334444333222222222111000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 QQVEMAKSQADMAQAEADTA 653 (711) Q Consensus 634 ~q~~~~~~q~~~~k~qae~~ 653 (711) .+...............+-+ T Consensus 451 ~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 451 TQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred HHhhcCCCCcCCCCCCccCC Confidence 00000000000000000000 No 47 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.84 E-value=7.1e-20 Score=125.57 Aligned_cols=484 Identities=12% Similarity=0.048 Sum_probs=236.7 Q ss_pred CCcCC-------CCCCCCcccCCCcccCCc------CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQ-------KKSRVEQLYAKKAKVYAK------NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |-|-- ..-++..-..+++..... +...+.+.+.++...+ ....+....+..+||.|.|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~------~~~~~~r~~~l~~Yy~g~~~i~ 74 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHH------MDYQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHH------HHhhHHHHHHHHHHhcccCccc Confidence 22211 111111111111111111 1111222233333221 1223455667789999987642 Q ss_pred HHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........++| .+++|..+.+|+..+|+.-.+.+.+.. ++.+ .... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~d~~----~~~~ 123 (511) T protein:vir:10 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD---------------------------DDKD----VLEA 123 (511) T ss_pred cccCcccccccCcceeecchHHHHHHHHhhhhcccCceeec---------------------------CchH----HHHH Confidence 221222223333 577899999999999998766555432 2222 2356 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+..+++++|.+|..++.+ + ++++++..+ +|.++| ||+... .-..++++.| T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~d---e---dg~~~i~~~-~p~~~~~vydd~~~-----~~~~~~vr~~ 191 (511) T protein:vir:10 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRN---Q---DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYL 191 (511) T ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeC---C---CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEE Confidence 7777888999999999999999999998877664 2 256788776 787776 443221 1122333333 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) ... . ......+.+..+++|....... + ....+.... T Consensus 192 ~~~------~----------------~d~~~~~~~~~~~iyt~~~i~~-~-~~~~~~~~~-------------------- 227 (511) T protein:vir:10 192 RTK------P----------------IDKTDEDEVFTVDLFTSHGVYR-Y-LTSRTNGLK-------------------- 227 (511) T ss_pred Eee------e----------------cccCccceEEEEEEEeCCcEEE-E-EecCCCccc-------------------- Confidence 110 0 0001223445556654432111 0 000000000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) .......+.|.+.+.+|+++|. +...+.|.+..++++++.+|...|.+.+.+...+++ T Consensus 228 ---------------~~~~~~~~~~~~~~~vPvv~f~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 285 (511) T protein:vir:10 228 ---------------LTPRENGFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (511) T ss_pred ---------------ccccccccccccCcceeEEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCc Confidence 0000112345566667776542 223477999999999999999999999999888877 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc---------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP---------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) .+++......+..+. . ..+.+.++...+ +...+..+.++..+.-..++...+......|..+|++++. T Consensus 286 ~lv~~g~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:10 286 MLLIKGNLNLDPVEV-R--KQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred eeeeeccccCCchhh-c--cchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 776643232232221 1 112223333322 1122334566666556677888889999999999999988 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhc Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~ 534 (711) +.+.-+++.||.|+..+-..........-..|..++++++++++.++...-. .....++ T Consensus 363 ~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~---------~~~~~d~------------ 421 (511) T protein:vir:10 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS---------IDANKDF------------ 421 (511) T ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC---------ccccccc------------ Confidence 7776656789999988877666666666667777777777766665432210 0001111 Q ss_pred cceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhc Q lcl|Aclame:pro 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) Q Consensus 535 g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~ 613 (711) .++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-++++.+........ T Consensus 422 -------------~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~ 482 (511) T protein:vir:10 422 -------------NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred -------------ceeeEEeCCCCCcCHHHHHHHHHHHhccCcH------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 1122222333333333344444444333222 223333332 22222222222211100000 Q ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) Q Consensus 614 ~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~ 654 (711) .......... ...........+-.++.++ T Consensus 483 ~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 483 AQKGIYKDPR------------DINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HhhhcccCCC------------CCCCCCCCCcccCcccccC Confidence 0000000000 0000000000000000000 No 48 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.83 E-value=4.5e-20 Score=126.67 Aligned_cols=484 Identities=12% Similarity=0.042 Sum_probs=235.6 Q ss_pred CCcCC-------CCCCCCcccCCCcccCCc------CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQ-------KKSRVEQLYAKKAKVYAK------NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |-|-- +.-++..-..+++..... +...+...+.++.+ .... ..+....+..+||.|.|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~---~~~~---~~~~r~~~l~~Yy~g~~~il 74 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIE---HHMD---YQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHH---HHHH---hhhHHHHHHHHHhhccCccc Confidence 22211 111111111111111110 01111122223222 2211 23344556788999987642 Q ss_pred HHHHHHHHhCC--CceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQR--PCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~--p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........++ ..+++|..+.+|+..+|+.-.+.+.+.. ++.+. ... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~d~~~----~~~ 123 (511) T protein:vir:78 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD---------------------------DDKDV----LEA 123 (511) T ss_pred cccCcccccccCcceeecchHHHHHHHHhhhhcccCceeec---------------------------CchHH----HHH Confidence 22222222333 3578899999999999998766555532 22222 345 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+..+++++|.+|..++.+. ++++++..+ +|.++| ||+... .-..++++.| T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~------dg~~~i~~~-~p~~~~~v~dd~~~-----~~~~~~vr~~ 191 (511) T protein:vir:78 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFIIYDNTVE-----RNSIAGVRYL 191 (511) T ss_pred HHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEE-cccceEEEEcCCCC-----CceEEEEEEE Confidence 77778889999999999999999999998777642 256788776 788776 554331 1122333333 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) .... ......+.+..+++|....... + ....+.... T Consensus 192 ~~~~----------------------~~~~~~~~~~~~~vyt~~~i~~-~-~~~~~~~~~-------------------- 227 (511) T protein:vir:78 192 RTKP----------------------IDKTDEDEVFTVDLFTSHGVYR-Y-LTNRTNGLK-------------------- 227 (511) T ss_pred Eeee----------------------ccccccceEEEEEEEeCCcEEE-E-EecCCCccc-------------------- Confidence 2110 0001123444455554432110 0 000110000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) .........|.+.+.+|+++|. +...+.|.+..+++.++.+|...|.+.+.+...+++ T Consensus 228 ---------------~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~ 285 (511) T protein:vir:78 228 ---------------LTPRENSFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (511) T ss_pred ---------------ccccccccccCcCcccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 0000113355666777776542 234577999999999999999999999999888887 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc---------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP---------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) .+++......+.++ ... ...+.++...+ +...+..+.++..+.-...+...+......|-.+|++++. T Consensus 286 ~lv~~G~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:78 286 MLLIKGNLNLDPVE-VRK--QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hhheecCccCCchh-hcc--cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 77664322222222 111 11222222211 1122344566666555677788889899999999999998 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhc Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~ 534 (711) +.+.-+++.||.|+..+..............|..++++++++++.++...-... ...++. T Consensus 363 ~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~---------~~~~~~----------- 422 (511) T protein:vir:78 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID---------ANKDFN----------- 422 (511) T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---------cccccc----------- Confidence 877666678999998887666666666677777777777777666543321100 001110 Q ss_pred cceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhc Q lcl|Aclame:pro 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) Q Consensus 535 g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~ 613 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-++++.+........ T Consensus 423 --------------~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~------et~l~~l~~v~d~~~El~ri~~E~~~~~~~ 482 (511) T protein:vir:78 423 --------------TVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred --------------cceEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 111222222233233334444444333332 223333332 22222223232211100000 Q ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLE 665 (711) Q Consensus 614 ~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~ 665 (711) .... ..........+...-+.+-...+.+ T Consensus 483 ~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 483 AQKG-----------------------IYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred Hhhc-----------------------cccCCCCCCCCCCCCCccCcccccC Confidence 0000 0000000000000000000000000 No 49 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.83 E-value=4.5e-20 Score=126.67 Aligned_cols=484 Identities=12% Similarity=0.042 Sum_probs=235.6 Q ss_pred CCcCC-------CCCCCCcccCCCcccCCc------CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQ-------KKSRVEQLYAKKAKVYAK------NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |-|-- +.-++..-..+++..... +...+...+.++.+ .... ..+....+..+||.|.|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~---~~~~---~~~~r~~~l~~Yy~g~~~il 74 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIE---HHMD---YQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHH---HHHH---hhhHHHHHHHHHhhccCccc Confidence 22211 111111111111111110 01111122223222 2211 23344556788999987642 Q ss_pred HHHHHHHHhCC--CceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQR--PCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~--p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........++ ..+++|..+.+|+..+|+.-.+.+.+.. ++.+. ... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~d~~~----~~~ 123 (511) T protein:vir:96 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD---------------------------DDKDV----LEA 123 (511) T ss_pred cccCcccccccCcceeecchHHHHHHHHhhhhcccCceeec---------------------------CchHH----HHH Confidence 22222222333 3578899999999999998766555532 22222 345 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+..+++++|.+|..++.+. ++++++..+ +|.++| ||+... .-..++++.| T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~------dg~~~i~~~-~p~~~~~v~dd~~~-----~~~~~~vr~~ 191 (511) T protein:vir:96 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFIIYDNTVE-----RNSIAGVRYL 191 (511) T ss_pred HHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEE-cccceEEEEcCCCC-----CceEEEEEEE Confidence 77778889999999999999999999998777642 256788776 788776 554331 1122333333 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) .... ......+.+..+++|....... + ....+.... T Consensus 192 ~~~~----------------------~~~~~~~~~~~~~vyt~~~i~~-~-~~~~~~~~~-------------------- 227 (511) T protein:vir:96 192 RTKP----------------------IDKTDEDEVFTVDLFTSHGVYR-Y-LTNRTNGLK-------------------- 227 (511) T ss_pred Eeee----------------------ccccccceEEEEEEEeCCcEEE-E-EecCCCccc-------------------- Confidence 2110 0001123444455554432110 0 000110000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) .........|.+.+.+|+++|. +...+.|.+..+++.++.+|...|.+.+.+...+++ T Consensus 228 ---------------~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~ 285 (511) T protein:vir:96 228 ---------------LTPRENSFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (511) T ss_pred ---------------ccccccccccCcCcccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 0000113355666777776542 234577999999999999999999999999888887 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc---------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP---------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) .+++......+.++ ... ...+.++...+ +...+..+.++..+.-...+...+......|-.+|++++. T Consensus 286 ~lv~~G~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:96 286 MLLIKGNLNLDPVE-VRK--QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hhheecCccCCchh-hcc--cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 77664322222222 111 11222222211 1122344566666555677788889899999999999998 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhc Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~ 534 (711) +.+.-+++.||.|+..+..............|..++++++++++.++...-... ...++. T Consensus 363 ~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~---------~~~~~~----------- 422 (511) T protein:vir:96 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID---------ANKDFN----------- 422 (511) T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---------cccccc----------- Confidence 877666678999998887666666666677777777777777666543321100 001110 Q ss_pred cceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhc Q lcl|Aclame:pro 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) Q Consensus 535 g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~ 613 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-++++.+........ T Consensus 423 --------------~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~------et~l~~l~~v~d~~~El~ri~~E~~~~~~~ 482 (511) T protein:vir:96 423 --------------TVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred --------------cceEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 111222222233233334444444333332 223333332 22222223232211100000 Q ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLE 665 (711) Q Consensus 614 ~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~ 665 (711) .... ..........+...-+.+-...+.+ T Consensus 483 ~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 483 AQKG-----------------------IYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred Hhhc-----------------------cccCCCCCCCCCCCCCccCcccccC Confidence 0000 0000000000000000000000000 No 50 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.83 E-value=6.9e-20 Score=125.63 Aligned_cols=484 Identities=12% Similarity=0.056 Sum_probs=235.7 Q ss_pred CCcCC-------CCCCCCcccCCCcccCCc------CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQ-------KKSRVEQLYAKKAKVYAK------NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |-|-- ..-++..-..+++..-.. +...+.+.+..+.+.+. ...+....+-.+||.|.|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~------~~~~~r~~~l~~Yy~g~~~il 74 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM------DYQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHH------HhhHHHHHHHHHHhcccCccc Confidence 22210 111111111111111000 01111222333333221 123445566789999987532 Q ss_pred HHHHHHHHhCC--CceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQR--PCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~--p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........++ -.+++|..+.+|+..+|+.-.+.+.+.. ++.+ .... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~d~~----~~~~ 123 (511) T protein:vir:93 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD---------------------------DDKD----VLEV 123 (511) T ss_pred cccCcCcccccCcceeecchHHHHHHHHhhhhcccCeeecc---------------------------CChH----HHHH Confidence 11111122222 2478899999999999988666554421 2222 2456 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+..+++++|.+|..|+.+. ++++++..+ +|.++| ||+... .-..++++.| T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de------~~~~~i~~~-~p~~~~~vydd~~~-----~~~~~~vr~~ 191 (511) T protein:vir:93 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYL 191 (511) T ss_pred HHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEE Confidence 77778889999999999999999999998877642 256777776 787776 554332 1223344443 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) ..... + ....+.+..+++|....... +....+... T Consensus 192 ~~~~~------~----------------~~~~~~~~~~~iyt~~~i~~--~~~~~~~~~--------------------- 226 (511) T protein:vir:93 192 RTKPI------D----------------KTDEDEVFTVDLFTSHGVYR--YLTSRTNGL--------------------- 226 (511) T ss_pred Eeeec------c----------------ccccceEEEEEEEeCCcEEE--EEecCCCcc--------------------- Confidence 21100 0 01123344455554432211 000000000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) ........+.|.+.+.+|+++|. +...+.|.+..+++.++.+|..+|.+.+.+...+++ T Consensus 227 --------------~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 285 (511) T protein:vir:93 227 --------------KLTPRENGFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (511) T ss_pred --------------ccccccccccccCCCccceEEec-------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCc Confidence 00001112344555667766542 233467999999999999999999999999888877 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc---------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP---------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) .+++.-....+..+. . ..+.+.++...+ +...++.+.++..+.-..++...++.....|-.+|++++. T Consensus 286 ~lv~~G~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:93 286 MLLIKGNLNLDPVEV-R--KQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred ceeeecCcccCchhh-c--ccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 766643232222221 1 112223333222 1122344566665556677888889999999999999988 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhc Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~ 534 (711) +.+..+++.||.|+..+..............|..++++++++++.++....... ...++. T Consensus 363 ~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~---------~~~d~~----------- 422 (511) T protein:vir:93 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSID---------ANKDFN----------- 422 (511) T ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc---------cccccc----------- Confidence 877666678999998887777777777777777777777777766543322110 011110 Q ss_pred cceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhc Q lcl|Aclame:pro 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) Q Consensus 535 g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~ 613 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-.+++.+........ T Consensus 423 --------------~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~ 482 (511) T protein:vir:93 423 --------------TVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred --------------cceEEeCCCCCCCHHHHHHHHHHHhccCch------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 111111223333333334444444433332 223334332 22222222222211100000 Q ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) Q Consensus 614 ~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~ 654 (711) ........... ..........+-.++.++ T Consensus 483 ~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 483 AQKGIYKDPRD------------INDDEQDDDTKDTVDKKE 511 (511) T ss_pred HhhhcccCCCC------------CCCCCCCCcccccccccC Confidence 00000000000 000000000000000000 No 51 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.83 E-value=7.1e-20 Score=125.58 Aligned_cols=463 Identities=11% Similarity=0.033 Sum_probs=229.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHH------HHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR------TERE 74 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~------~~~~ 74 (711) |-.+.+.+...+--+..=+........+.+.+.++.+.+. ..+....+..+||.|.| +-.-+ .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~~~~Yy~g~~-~i~~r~~~~~~~~~~ 72 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHR-------KQLDKITVGQRYYDKDN-DIVKQMKKVDVYGNI 72 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHH-------HHHHHHHHHHHHhcccC-chhcccccccccccc Confidence 5554433332221100001111112233345555554433 23444567788999975 11000 0111 Q ss_pred HhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYN 152 (711) Q Consensus 75 ~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~ 152 (711) ..++| .+++|..+.+|+..+++.-.+.+.+. .+|.+..+ .++.+.+ T Consensus 73 ~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~---------------------------~~d~~~~~----~l~~~~~- 120 (474) T protein:vir:95 73 DYDKPDWRITTNFHQNLVDQKVSYVASKPVTYS---------------------------CEDESVLK----IIHDVLD- 120 (474) T ss_pred ccccccceeccchHHHHHHHHHhhhccCCceec---------------------------cCchHHHH----HHHHHHh- Confidence 22333 46799999999999998766654432 13333333 3444444 Q ss_pred cCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHH Q lcl|Aclame:pro 153 CDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFK 230 (711) Q Consensus 153 ~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~ 230 (711) ++++.....+..++.++|.||..++++. ++++++..+ +|.+++ ||+... .+..++ ++.+...+ T Consensus 121 n~~~~~~~e~~~~~~~~G~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~~~~-i~~~~~~~--- 185 (474) T protein:vir:95 121 TRWDNKLIDILTATSNKGIDWLQVYINE------NGEMKLFRV-PAEQAIPIWVDKER----EELKSF-IRYYKFNN--- 185 (474) T ss_pred ccHHHHHHHHHHHHhhcCcEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC----CceEEE-EEEEEEcC--- Confidence 6799999999999999999998877642 256777776 787777 444221 122222 33221100 Q ss_pred HhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceE Q lcl|Aclame:pro 231 ALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTF 310 (711) Q Consensus 231 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 310 (711) ...+++|....... +....+.... . T Consensus 186 ---------------------------~~~~~~y~~~~~~~--~~~~~~~~~~-~------------------------- 210 (474) T protein:vir:95 186 ---------------------------EEKVEFWTDTTVTY--YVLENGGLIP-D------------------------- 210 (474) T ss_pred ---------------------------eeEEEEEeCCeEEE--EEEcCCcccc-c------------------------- Confidence 01123333321111 1111111000 0 Q ss_pred EEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEeccc Q lcl|Aclame:pro 311 KTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEG 390 (711) Q Consensus 311 ~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~ 390 (711) .......+.....+.+.+.+|+++|.. ...+.|.+..+++.++.+|..+|.+.+.+.....+.+++... T Consensus 211 ----~~~~~~~~~~~~~~~~~g~iPvv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~ 279 (474) T protein:vir:95 211 ----YYYGANHIQSHFSNGNWGRVPFIAFKN-------NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGY 279 (474) T ss_pred ----cccCcccccccccccCCCccceEeecC-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 000111122233455667788876532 234679999999999999999999999998888888777544 Q ss_pred ccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHH Q lcl|Aclame:pro 391 NVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIA 470 (711) Q Consensus 391 av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~ 470 (711) ..++..+... ....+.++...++ +.++++..+.-..++...++.....|-..+++++.+.|..+++.||.|+.. T Consensus 280 ~~~~~~~~~~--~~~~~~~i~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~ 353 (474) T protein:vir:95 280 EGQDLEEFMR--GLKYYKAINVDGD----GGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKF 353 (474) T ss_pred Ccccchhhhh--hhhccceeeccCC----CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 4333333222 2233455655433 235566655566777888899999999999999877776667789999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeE Q lcl|Aclame:pro 471 RQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) Q Consensus 471 ~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv 550 (711) +..............|..+++++.++++++. . . ..++. ++ T Consensus 354 ~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----g---------~--~~d~~-------------------------~i 393 (474) T protein:vir:95 354 LYGNLDLKANKLKNKATVAIQELIGFIIDFN----N---------L--KMDVK-------------------------DI 393 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----C---------C--Ccccc-------------------------ee Confidence 7766666666666666667766666655442 1 0 00110 11 Q ss_pred EeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 551 VVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTE 629 (711) Q Consensus 551 ~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~ 629 (711) .+.=.++.+....+..+.+.++ ..++ ...++..++ ..+.++-.+++.+....................+... T Consensus 394 ~v~f~~~~p~d~~e~a~~~~~~-g~iS------~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~ 466 (474) T protein:vir:95 394 EISFNFNRMMNDAEQSQIIAQS-QYLS------RETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQE 466 (474) T ss_pred eEEeccCCCcCHHHHHHHHHhc-CCCc------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCC Confidence 1111222222222222333321 1111 122333333 2333333333332211100000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 630 PTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 630 ~~~~~q~~~~~~q~~~~k~qae 651 (711) . ..-. +.+ T Consensus 467 ~------------~~~~--~~~ 474 (474) T protein:vir:95 467 R------------SNDK--ESE 474 (474) T ss_pred C------------CccC--CCC Confidence 0 0000 000 No 52 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.83 E-value=2.8e-19 Score=122.33 Aligned_cols=424 Identities=12% Similarity=0.054 Sum_probs=224.1 Q ss_pred hHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccce Q lcl|Aclame:pro 27 DDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAI 104 (711) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~ 104 (711) =+.+.|.++.+.+. ....+..+-.+||.|+| +-.....+..+++ .+++|..+.+|+..+|+.-.+.+.+ T Consensus 1 l~~~~l~~~i~~~~-------~~~~r~~~l~~yy~g~~--~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 71 (429) T protein:vir:98 1 MTKDLLSELIQKHR-------SFNLSYSAYKQLYEGDH--AILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQT 71 (429) T ss_pred CCHHHHHHHHHHHH-------HHHHHHHHHHHHhcccc--ccccccccccCCCcceeecchHHHHHHHHhhhhcccCcee Confidence 12223444444332 23455556788999986 1111122233333 5789999999999999876654333 Q ss_pred eEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCC Q lcl|Aclame:pro 105 KVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADD 184 (711) Q Consensus 105 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~ 184 (711) . + ++. .....+..+++.|+++.....+..+++++|.||..++.+. T Consensus 72 ~--~-------------------------~~~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---- 116 (429) T protein:vir:98 72 S--H-------------------------ENK----QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDE---- 116 (429) T ss_pred e--c-------------------------CCh----HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecC---- Confidence 2 1 111 2334567777889999999999999999999998876642 Q ss_pred CCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEE Q lcl|Aclame:pro 185 SFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSE 262 (711) Q Consensus 185 ~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E 262 (711) +|.+.+..+ +|.+++ ||.... .-...+++.+.+ .+.+...+ T Consensus 117 --~g~~~~~~~-~p~~~~~v~dd~~~-----~~~~~~i~~~~~-----------------------------~~~~~~~~ 159 (429) T protein:vir:98 117 --NAEAGITYL-TPLEAFIVYDDSIR-----QKPLFAVRYFYN-----------------------------KGGVLEGS 159 (429) T ss_pred --CCcEEEEEE-cccceEEEEeCCCC-----CceEEEEEEEEe-----------------------------cCceEEEE Confidence 256777766 677664 442211 111222222211 12233344 Q ss_pred eeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEee Q lcl|Aclame:pro 263 YFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGK 342 (711) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~ 342 (711) +|..... ..+.. ..+...+.+..|.+.+.+|+|+|. T Consensus 160 ~~~~~~~----~~~~~--------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~-- 195 (429) T protein:vir:98 160 YSDASNI----TYFKD--------------------------------------GEKGIEIGESEPHPFDGVPMIEYV-- 195 (429) T ss_pred EEeCceE----EEEEe--------------------------------------cCCceEecccccccCCccceEEec-- Confidence 4432211 00000 011112234455666777777542 Q ss_pred eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCCc Q lcl|Aclame:pro 343 SLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGP 422 (711) Q Consensus 343 ~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i 422 (711) +...|.|.+..+++.++.+|...|.+.+.+.....+.+++.-...+ ++.... ...+.++.+..+....+.+ T Consensus 196 -----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~ 266 (429) T protein:vir:98 196 -----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELD--DETLKS--LRDTRIINLKDTDAQQLTV 266 (429) T ss_pred -----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCC--cchhhh--HhhCceeeccCCCCCCcce Confidence 2345789999999999999999999999999988888776532222 222222 2334566665433233345 Q ss_pred cccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 423 RRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIP 502 (711) Q Consensus 423 ~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~ 502 (711) .++..+.-...+...++...+.|-..|++++.+.+.. ++.||.|+..+-.............|..+++++.++++.++. T Consensus 267 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 345 (429) T protein:vir:98 267 EFLQKPDADATQEHLLDRLENLIFRTAMVANISDESF-GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPT 345 (429) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6666555566677788999999999999887666544 346999988876665566666666666666666665555422 Q ss_pred hhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHH Q lcl|Aclame:pro 503 HIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAV 582 (711) Q Consensus 503 ~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~ 582 (711) +.+. ..++ .++.+.=.+..+....+..+.+..+...++. T Consensus 346 ----------~~~~--~~d~-------------------------~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~---- 384 (429) T protein:vir:98 346 ----------SKIG--PKDW-------------------------IGIKYKFTRNLPANLLEESQIAGNLAGIVSE---- 384 (429) T ss_pred ----------cCCC--cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhccCch---- Confidence 1111 1111 0122222333333333444455554433332 Q ss_pred HHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 583 MADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 583 ~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae 651 (711) ..+++++++ ++.++-.+++++..... . +.+......+-...-++ T Consensus 385 --et~~~~l~~v~d~~~E~~ri~~E~~~~--------------------~---~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 385 --ETQVGVLSIVENPQKEIERKNSDKSTL--------------------I---SRQAGGLNGQNTTTILE 429 (429) T ss_pred --HHHHHhCCCCCCHHHHHHHHHHHHHHH--------------------H---HHHHhhhcCCCCCCCCC Confidence 223344432 23222222222211100 0 00000000000000000 No 53 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.83 E-value=2.8e-18 Score=116.84 Aligned_cols=516 Identities=12% Similarity=0.053 Sum_probs=256.0 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC----CCCHHHHHHHHHh Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGE----QWPSQVRTERELE 76 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~----Qw~~~~~~~~~~~ 76 (711) ||++++.+- . -+.+..+|+...+....|...|.++.+|.... .+..... T Consensus 1 m~~~~~~~~--------------~-------~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~------ 53 (532) T protein:vir:99 1 MAEVEKTGF--------------A-------ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGST------ 53 (532) T ss_pred Ccchhhccc--------------c-------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchh------ Confidence 776653211 0 12355666666666778888899999998432 2221110 Q ss_pred CCCceEehhhHHHHHHHhhhhhh-----cccceeEecchhhhhhhhhcccccccccccCCCchh-HHHHH---HHHHHHH Q lcl|Aclame:pro 77 QRPCLVNNVLPTFVDQVLGDQRQ-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKND-YELAE---VFTGLIK 147 (711) Q Consensus 77 g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~Ae---~l~~~~~ 147 (711) ....+.-..-...++.+.+..-. +++=.++.+.+.... . ...++++ .+..+ .++..+. T Consensus 54 ~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~----------~---~~~~~~~~~~v~~~L~~ve~~~~ 120 (532) T protein:vir:99 54 SYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVK----------Q---SITSPEELTEIATGLAMVERICM 120 (532) T ss_pred hccccccchHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHh----------c---cCCChhhHHHHHHHHHHHHHHHH Confidence 00112222222333333222111 122223322210000 0 0000000 11222 2344555 Q ss_pred HHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHH Q lcl|Aclame:pro 148 NIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKE 227 (711) Q Consensus 148 ~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~ 227 (711) .....|+|......++.+.+..|.|+.-+..+... .+....++.+ +..++++..++.- ...-++++..++.+ T Consensus 121 ~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~---~~~~~~f~~~-pl~~y~v~~d~~G----~v~~ivrr~~~~~~ 192 (532) T protein:vir:99 121 NYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV---EGQSNAPKLY-KLHNFVVERDAYD----NVLQIVTEDKIARA 192 (532) T ss_pred HHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccc---cCcccceEEE-EcCeEEEeeCCCC----CeeeEeeeeeecHH Confidence 66678999999999999999999998754433211 1233445555 5678888765531 23345677778877 Q ss_pred HHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhccc Q lcl|Aclame:pro 228 KFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKV 307 (711) Q Consensus 228 e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 307 (711) .+-+.++..... ....+.....|.|+++.+++... T Consensus 193 ~l~e~~~~~~~~--------~~~~~~p~~~v~v~~~v~~~~~~------------------------------------- 227 (532) T protein:vir:99 193 ALPEDVRKSLED--------AQGDQNPSEEVTIYTHVYRDPEA------------------------------------- 227 (532) T ss_pred hcChHHHHHhhc--------cccccCCCcceEEEEEEEecCCC------------------------------------- Confidence 774444332111 00111223567777766554321 Q ss_pred ceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEe Q lcl|Aclame:pro 308 KTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIG 387 (711) Q Consensus 308 ~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~ 387 (711) ..+.+|.. ..|..+....+-|+...+||+++. +..+++..||.|.+....+-.+.+|.+...++.....+.++.+++ T Consensus 228 ~~~~~~~~-~~g~~~~~~~~~~~~~e~P~~~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv 304 (532) T protein:vir:99 228 MVFRSYQE-IDGEIVAGTEGEYPLDSCPWIPVR--LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV 304 (532) T ss_pred CeeEEEEe-ecCceecccccccccccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcee Confidence 00111211 234333333455677789999764 445688999999999999999999999999999999999999999 Q ss_pred cccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHH Q lcl|Aclame:pro 388 SEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRA 467 (711) Q Consensus 388 ~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~a 467 (711) +++.+.+..... ...+|.++. |..++..+-......-.+.....++...+.|.... ..+...-.++.+.|+.- T Consensus 305 ~p~g~~~~~~~~---~~~~g~~v~---g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~d~~r~TAtE 377 (532) T protein:vir:99 305 NPNGVTQIRRVA---KANTGDFVA---GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEE 377 (532) T ss_pred ccccccchhhhc---cCCCcceec---CCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCcccHHH Confidence 988887766432 234555543 32222223233333445556677777777777654 23322235566789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhe Q lcl|Aclame:pro 468 IIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQ 546 (711) Q Consensus 468 i~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~ 546 (711) |..+.+-....+..++.+|.. +...+.+..+.++.+.---+ .+-.. + . T Consensus 378 V~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP---------------~~p~~--------------~--~ 426 (532) T protein:vir:99 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIP---------------NLPKE--------------A--V 426 (532) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC---------------CCChh--------------h--c Confidence 999999989999988888754 66666666666665421000 00000 0 0 Q ss_pred eeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcc--hhhcchhhhhhhhhH Q lcl|Aclame:pro 547 KYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDM 624 (711) Q Consensus 547 ~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~ 624 (711) ..+++.-..+- -|.+..+.+..+++.+.+ +.. ..++.-+.+++.+.+....+- ......+.+..+..+ T Consensus 427 ~~~iv~~is~L---araq~~~~l~~~~~~laq---~~p----~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~ 496 (532) T protein:vir:99 427 EPAIATGLEAL---GRGHDLNKLNVFIDYMIK---LAG----LQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMA 496 (532) T ss_pred ccceeecchHH---HHHHHHHHHHHHHHHHHh---hcc----hhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHH Confidence 01222211111 123333344444333222 111 123445667777777666543 222222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 625 PEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIED 678 (711) Q Consensus 625 ~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~ 678 (711) ++++++. +++ ...++.+.+ .++.+...+++.+.-.+ T Consensus 497 q~~~~~~--~~~-----a~~~~~~~~-----------~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 497 EASTAAG--MVT-----AGQQMGAAG-----------GQAAAAMMQQQAGMPTQ 532 (532) T ss_pred HHHHHHH--HHH-----HHHHHHHHH-----------HHhcchhHHhhcCCCCC Confidence 1111100 000 000000000 00000000111100000 No 54 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.83 E-value=1.6e-19 Score=123.56 Aligned_cols=466 Identities=12% Similarity=0.069 Sum_probs=230.0 Q ss_pred CCcCCCCCCCCcccCCCc--ccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHH-----HHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKA--KVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQV-----RTER 73 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~-----~~~~ 73 (711) |+-=- -|+++.+..+- .-.+... ...++|.++.. ..........+..+||.|.|=.-.. .... T Consensus 1 ~~~~~--~~~~~~~~~e~~~~~~~~~~-~~~~~i~~~i~-------~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~ 70 (478) T protein:vir:10 1 MISIN--WPWDKPYHEQVVEQIKPKYE-TQEEMILRLVR-------EHKENIDNITMGERYYNHHPDILDAPPKRDVNGD 70 (478) T ss_pred Ccccc--CCCCchhHHHHHHHHhhccC-CcHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCCchhccccccccccc Confidence 54321 22222111110 0011111 12233444333 2334456677889999997511000 0001 Q ss_pred HHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 74 ELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY 151 (711) Q Consensus 74 ~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~ 151 (711) ...++| .+.+|..+.+|+..+|+.-.+.+.+.. ++.+..+.+..+ ++ T Consensus 71 ~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~---------------------------~~d~~~~~l~~~----~~ 119 (478) T protein:vir:10 71 YDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGV---------------------------DNDKALKQIQHT----LN 119 (478) T ss_pred cccccccceeccchHHHHHHHHHhhhccCCeeeec---------------------------CChHHHHHHHHH----Hh Confidence 112222 378899999999999987665544421 233344444443 33 Q ss_pred hcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHH Q lcl|Aclame:pro 152 NCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) Q Consensus 152 ~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~ 229 (711) +++......+.++++++|.||..++.+. ++++++..+ +|.+++ ||+... .+..+ +++.|-.. T Consensus 120 -n~~~~~~~~~~~~~~~~G~~~~~~~~d~------~g~~~~~~~-~p~~~~~i~d~~~~----~~~~~-~v~~~~~~--- 183 (478) T protein:vir:10 120 -HKWDDKLVDILTAASNKGIEWVQPYVDE------EGEFKTFRV-PAEQAVPIWTNKER----DELQA-FIRVYELD--- 183 (478) T ss_pred -cCHHHHHHHHHHHHHhcCeEEEEEEecC------CCeeEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEec--- Confidence 6899999999999999999998887653 256777766 777776 454321 12222 23332100 Q ss_pred HHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccce Q lcl|Aclame:pro 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) Q Consensus 230 ~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) . ...+++|...... .+....+......... T Consensus 184 -------~--------------------~~~~~~y~~~~i~--~~~~~~~~~~~~~~~~--------------------- 213 (478) T protein:vir:10 184 -------G--------------------AERVEYWTKDDVT--YYELKEGQLIPDFYRS--------------------- 213 (478) T ss_pred -------C--------------------ceEEEEEeCCeEE--EEEEcCCeeecccccc--------------------- Confidence 0 0112333221111 0111111110000000 Q ss_pred EEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecc Q lcl|Aclame:pro 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSE 389 (711) Q Consensus 310 ~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~ 389 (711) ..-.....+....|.+.+.+|+++|. +...|.|.+..++++++.+|...|.+...+...+.+.+++.- T Consensus 214 -----~~~~~~~~~~~~~~~~~~~vPvv~~~-------n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g 281 (478) T protein:vir:10 214 -----DDHIQPHYYQGNKLMSWGRVPFIPFK-------NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG 281 (478) T ss_pred -----ccccccceecccccccCCccceEEec-------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeec Confidence 00001111223456667778877652 245578999999999999999999999999988888766543 Q ss_pred cccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHH Q lcl|Aclame:pro 390 GNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAII 469 (711) Q Consensus 390 ~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~ 469 (711) ...++..+... ....+.++.+.... ++.+.++..+.-.......++.....|-..|++.+.+.+..+++.||.|+. T Consensus 282 ~~~~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~ 357 (478) T protein:vir:10 282 YEGEDMKDFMH--NLKYYKAISVAGES--GSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALK 357 (478) T ss_pred CCccccchhhh--hhhhcceEEecCCC--CCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHH Confidence 22222222211 23344556554322 234566665555677788899999999999999988877666778999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .+..............|..+++++.++++.+ +. . ..++. + T Consensus 358 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~----~g---------~--~~~~~-------------------------~ 397 (478) T protein:vir:10 358 FMYSNLDLKANKLKNKTLTALQELLQYIIDF----YR---------L--DVKVQ-------------------------D 397 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hC---------C--Ccccc-------------------------c Confidence 8766666666666666666666666555543 21 0 01110 1 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHH Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQT 628 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q 628 (711) +.+.=.+..+....+..+.+..+...++. ..+++.+++ ...++-.+++++......+........... T Consensus 398 i~i~f~~~~p~d~~e~a~~~~kl~g~iS~------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~----- 466 (478) T protein:vir:10 398 IEITFNFNVMVNELENSQIAMNSTGLLSK------ETILSNHAWVEDPVAEMERIEQENIELNQQLPDIEEGLNG----- 466 (478) T ss_pred ceEEecCCCCCCHHHHHHHHHHHhCCCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCC----- Confidence 11111222222223334444444333322 233444443 333333444432221111000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 629 EPTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 629 ~~~~~~q~~~~~~q~~~~k~qae 651 (711) ..+.+..-.+.| T Consensus 467 -----------~~~~~~~~~~~~ 478 (478) T protein:vir:10 467 -----------EQQRQSENNQPE 478 (478) T ss_pred -----------CCCCCCCCCCCC Confidence 000000000000 No 55 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.83 E-value=1.3e-18 Score=118.60 Aligned_cols=506 Identities=11% Similarity=0.073 Sum_probs=251.8 Q ss_pred HHHHHHHHHHHhhchHHHHHHHHHHHHhC---CCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh-----cccce Q lcl|Aclame:pro 33 ATARERARDGATYWKDNWEAAEDDLKFLG---GEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ-----NRPAI 104 (711) Q Consensus 33 ~~~~~~~~~~~~~~~~~r~~~~~~~~~y~---G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~ 104 (711) -+++.+|........+|...|.++.+|.. |.-........ + + ...+.-..-...++...+..-. +++=. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~-~-~-~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 77 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNH-K-S-LTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFF 77 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCccc-c-c-ccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 44667777777778889999999999984 22111111000 0 0 0112222222233333222111 12222 Q ss_pred eEecchhhhhhhhhcccccccccccCCCchh----HHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEee Q lcl|Aclame:pro 105 KVSSTEVTRVPDAESGEDTTLKISNVAGKND----YELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDY 180 (711) Q Consensus 105 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d----~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~ 180 (711) ++.+.+.. ... ..+++. ...-+.++..+......|+|......++.+.+..|+|+. +. T Consensus 78 ~l~~~d~~-l~~-------------~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~-- 139 (522) T protein:vir:10 78 KLQVRDDK-LGE-------------ELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALI--FM-- 139 (522) T ss_pred cccCChHH-Hhh-------------hcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeE--EE-- Confidence 33332110 000 001111 112233555666667789999999999999999999985 22 Q ss_pred ccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEE Q lcl|Aclame:pro 181 LADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRV 260 (711) Q Consensus 181 ~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 260 (711) +++++ +.+ +..++++..++.- ...-++++.+|+...+.+.||........... ....+.+.| T Consensus 140 -~~~~~------~~~-pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~------~~~~~~v~v 201 (522) T protein:vir:10 140 -GKDGL------KTF-PLTRYVINRDGDG----NVLEIVTKELISRKVLDIELPEPKPNTGIDES------STTNDDVTI 201 (522) T ss_pred -cCCCc------eEE-EcceEEEeeCCCC----CeeEEEeeeeccHHHHHHhcchhccchhhhcc------cCCCCceEE Confidence 23432 233 5567888765532 34458899999999999999986543322211 122345777 Q ss_pred EEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE-ecCceeccCccCCCCccceEEE Q lcl|Aclame:pro 261 SEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI-TGANVLEGPVEIPSTTIPVIPV 339 (711) Q Consensus 261 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~le~~~p~~~~~~P~vp~ 339 (711) +++.+.+... +++. |+-. .|..+....+-+++..+||+++ T Consensus 202 ~~~v~p~~~~--------~~~~-------------------------------~~~~~~~~~~~~~~s~~g~~~~P~~~~ 242 (522) T protein:vir:10 202 YTYVKLDKSS--------GRWV-------------------------------WHQEAFDKIIPDSRSTAPKNASPWLPL 242 (522) T ss_pred EEEEEeeccC--------CceE-------------------------------EEEccCCccccccccccccccCCceee Confidence 7766544211 1111 1111 1222222234567788999976 Q ss_pred EeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCc Q lcl|Aclame:pro 340 WGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD 419 (711) Q Consensus 340 ~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~ 419 (711) .+ ..+++..||.|.+....+-.+.+|.+...++.....+.++.++++++.+.+..+... ..++.++ ++...+ T Consensus 243 Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~---~~~~~~v---~g~~~~ 314 (522) T protein:vir:10 243 RF--NTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAK---AGNGAIV---QGRPED 314 (522) T ss_pred ee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccC---CCCccee---cCCCcc Confidence 54 446889999999999999999999999999999999999999998888776654321 2223332 333222 Q ss_pred CCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|Aclame:pro 420 PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT-KSIRRVGKILV 498 (711) Q Consensus 420 ~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~l 498 (711) ..+-........+.....++...+.|.+..-+ +...++...|+..|..+.+-....+...+.+|. ++...+.+..+ T Consensus 315 v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~---~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 391 (522) T protein:vir:10 315 VAVIQVGKTADFSTAANMATAIEKRLLEAFLV---MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTL 391 (522) T ss_pred ceeecccccccchHHHHHHHHHHHHHHHHHhh---ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 22222223334445566777777777665321 123445567999999999998889998888885 46666666666 Q ss_pred HHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcch Q lcl|Aclame:pro 499 EMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPS 578 (711) Q Consensus 499 ~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~ 578 (711) .++.+- |. +. . . | . ++ .+.++ -++.+.- -|.+..+.+..+++.+. T Consensus 392 ~il~r~----------g~------lP--~-~--p-~-------~~--~~~~~--v~~is~L-araq~~~~l~~~~~~i~- 436 (522) T protein:vir:10 392 LVLQRS----------NQ------IP--K-L--P-K-------DI--VRPTI--VAGVNAL-GRGQDRESLTAFVGTIA- 436 (522) T ss_pred HHHHhc----------CC------CC--C-C--C-c-------cc--ccccc--ccchhHH-HHHHHHHHHHHHHHHHH- Confidence 655432 10 00 0 0 0 0 00 01111 1122211 23333444444444322 Q ss_pred hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchh--hcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 579 AAAVMADLIAQNMDWPGADVIAERLKKIVPPNV--LSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQ 656 (711) Q Consensus 579 ~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aq 656 (711) ..+.+ ...++.-+.+++.+.+....+... .... +++..+.+++.++.+++++.+.+..+ .+. T Consensus 437 --~~~~p--~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt-~eev~~~~q~~q~~~~~~~~~~~a~~-----------~~~ 500 (522) T protein:vir:10 437 --QTLGP--EALMQYLNPLEAIKRLAAAQGIDVLNLVKT-EQQLAEEQQAAQQQAAQQSLVDQAGQ-----------MTG 500 (522) T ss_pred --HhhCc--hhhhhcCCHHHHHHHHHHHhCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHH-----------Hhc Confidence 11111 112344466777777766655321 1111 11111111111111000000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 657 ADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAE 700 (711) Q Consensus 657 ae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e 700 (711) +-....+ +..+...++. +.. .+ T Consensus 501 ~~~~~~~-~~~~~~~~~~-----~~~----------------~~ 522 (522) T protein:vir:10 501 SPLMDPT-KNPQLMDEEQ-----PPM----------------EE 522 (522) T ss_pred ccccCcc-ccHHHHHHhC-----CCC----------------CC Confidence 0000000 0000000000 000 00 No 56 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.83 E-value=4.3e-19 Score=121.30 Aligned_cols=446 Identities=14% Similarity=0.090 Sum_probs=228.7 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP- 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p- 79 (711) |--+++..=+.| .+..-..+.|..+.+.+. ..+.+..+..+||.|.| +-.....+..+++ T Consensus 1 ~~~~~~~~~~~p----------~d~~~~~~~l~~~i~~~~-------~~~~r~~~~~~yy~g~~--~i~~~~~~~~~~~~ 61 (453) T protein:vir:39 1 MKYKPPKLMTFP----------KDEPITNEVVTKFMEKHR-------LEVARYEYLKNMYRGIM--AIDAEPTKDLWKPD 61 (453) T ss_pred CeecCCcceEcC----------CCCCCCHHHHHHHHHHHH-------HHHHHHHHHHHHhhccC--chhcCCCccccCcc Confidence 433333222222 222333344555554332 23445567788999975 1111111222322 Q ss_pred -ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 80 -CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 80 -~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) .+++|..+.+|+..+|+.-.+.+.+. + ++.+. ...+..++..|+++.. T Consensus 62 ~ki~~n~~~~ivd~~~~~l~g~~~~~~--~-------------------------~d~~~----~~~l~~i~~~N~~~~~ 110 (453) T protein:vir:39 62 NRLTVNFTKYIVDTFTGYFNGIPVKKS--H-------------------------SDKET----LSKLQEFDNLNDMEDE 110 (453) T ss_pred ceeecchHHHHHHHHhhhhcccCceec--c-------------------------CChHH----HHHHHHHHHhcChhHH Confidence 47789999999999998765543332 1 12222 3457777888999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDA 236 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~ 236 (711) ...+..+++++|.||+.|+.+. ++++++..+ +|.+++ ||+... + ...+++ +.+.. T Consensus 111 ~~~~~~~~~~~G~~~~~v~~d~------~g~~~i~~~-~p~~~~~v~d~~~~--~--~~~~~i-r~~~~----------- 167 (453) T protein:vir:39 111 ESELAKMACIYGRAFELLYQNE------ETQTNVIYN-TPENMFMVYDDTIK--Q--EPLFAV-RYGYD----------- 167 (453) T ss_pred HHHHHHHHhhcCeEEEEEEecC------CCceEEEEE-cccceEEEecCCCC--C--eEEEEE-EEEEe----------- Confidence 9999999999999998887653 256777776 677765 544221 1 122222 22110 Q ss_pred ccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEE Q lcl|Aclame:pro 237 TAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRK 316 (711) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~ 316 (711) .+....+++|.... ++.+.. T Consensus 168 ------------------~~~~~~~~~yt~~~------------i~~~~~------------------------------ 187 (453) T protein:vir:39 168 ------------------DDYKLYGEVYTKET------------TYALNG------------------------------ 187 (453) T ss_pred ------------------CCeEEEEEEEeCCe------------EEEEEe------------------------------ Confidence 11223344443321 111100 Q ss_pred EecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH Q lcl|Aclame:pro 317 ITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE 396 (711) Q Consensus 317 ~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~ 396 (711) -.+...+.++.|.+.|.+|+|+|. +...+.|.+..++++++.+|+.+|.+...+...+.+.+++.-..+++ T Consensus 188 ~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~-- 258 (453) T protein:vir:39 188 TMGFYNMTEQAPNPFDDLPVVEFY-------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE-- 258 (453) T ss_pred cCCceeeecccccCCCceeEEEec-------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCc-- Confidence 001111223445556677777653 22357799999999999999999999999988888877775333322 Q ss_pred HHHhhcccCCCceEEecccc--cCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHH Q lcl|Aclame:pro 397 DEWEQANTKNFSLLTYIPQY--QGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQ 474 (711) Q Consensus 397 ~~~~~~~~~~~~~i~~~~~~--~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~ 474 (711) +.... .+.++++.+.... ..++.+.++..+.-...+...++.....|-.+|++.+.+.+..+ +.||.|+..+... T Consensus 259 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~Sg~Al~~~~~~ 335 (453) T protein:vir:39 259 EDLKN--IRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFG-SSSGVSLAYKLQA 335 (453) T ss_pred hhhhh--hhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-CChHHHHHHHHHH Confidence 11121 2334444443221 12334566665555667777888888889899998876665443 4699998877666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeec Q lcl|Aclame:pro 475 GDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTT 554 (711) Q Consensus 475 ~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~ 554 (711) ...........|..++++++++++.+.... |. ..++. +|.+.= T Consensus 336 l~~ka~~~~~~~~~~l~~~~~li~~~~~~~----------~~--~~~~~-------------------------~i~v~f 378 (453) T protein:vir:39 336 MSNLALSFQRKFQSSLNSRYKLYCELSTNV----------SN--KEAWK-------------------------DIEYTF 378 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CC--ccccc-------------------------cceEEe Confidence 556666666666667776666665543211 11 11110 122222 Q ss_pred ccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 555 GPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPE 633 (711) Q Consensus 555 ~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~ 633 (711) .+..+....+..+.+..+...++. ..+++.+++ ++.++-.+++.+................... T Consensus 379 ~~~~p~~~~~~a~~~~kl~g~is~------et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~--------- 443 (453) T protein:vir:39 379 TRNEPKDIKEQAETANILMGITSQ------ETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGT--------- 443 (453) T ss_pred CCCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCC--------- Confidence 233333334444455555443332 223344432 2222222333222111100000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 QQVEMAKSQADMAQAEADTA 653 (711) Q Consensus 634 ~q~~~~~~q~~~~k~qae~~ 653 (711) +.+ ..+...+ T Consensus 444 --------~~~--~~~~~~e 453 (453) T protein:vir:39 444 --------DTV--VPETNEE 453 (453) T ss_pred --------CCC--CCCcCCC Confidence 000 0000000 No 57 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.83 E-value=3.1e-18 Score=116.55 Aligned_cols=463 Identities=12% Similarity=0.050 Sum_probs=232.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH--HHHHHHHHhCC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS--QVRTERELEQR 78 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~--~~~~~~~~~g~ 78 (711) |+.-.-...... -++--.......-+.+.+..+.+.+. .+.+..+.+-.+||.|.+-.- ......+..++ T Consensus 6 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~ 77 (481) T protein:vir:10 6 INNINTKFSPLA--NDDFVVSDLAELLKEENLRNFISRHQ------TEQVPRLEMLESYYLNRNTDILAGERRLQKYGDK 77 (481) T ss_pred eehhchhccccc--CceeeeecchhhcCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCcccccCcccccccccc Confidence 443221111000 00110001111112223444444322 234455778889999986431 11122233444 Q ss_pred C--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHH Q lcl|Aclame:pro 79 P--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAE 156 (711) Q Consensus 79 p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~ 156 (711) | .+.+|..+.+|+..+|+.-.+.+.+ .+ .+.+. ...+..+++.|+++ T Consensus 78 ~~~ki~~n~~~~ivd~~~~~l~g~~~~~--~~-------------------------~d~~~----~~~l~~~~~~n~~~ 126 (481) T protein:vir:10 78 ADHRAVHNYAKYVSRFIVGYLTGNPITI--TH-------------------------QDNQT----NDKIIELNDLNDAD 126 (481) T ss_pred ccceeecchHHHHHHHHHhhhccCCceE--ec-------------------------CChhH----HHHHHHHHHhcChh Confidence 4 3788999999999999876554433 22 22222 23456677889999 Q ss_pred HHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcC Q lcl|Aclame:pro 157 TEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYP 234 (711) Q Consensus 157 ~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p 234 (711) .....+..+++++|.+|+.++.+. ++++++..+ +|.+++ ||+... ....++++.|...+ T Consensus 127 ~~~~~~~~~~~~~G~~~~~~~~d~------dg~~~i~~~-~p~~~~~v~d~~~~-----~~~~~~i~~~~~~~------- 187 (481) T protein:vir:10 127 EVNSDLALNLSIYGRAYEIVYRDF------EDRDTFKVL-DPKSTFVVYDQTLD-----KKVVAGVRYFEKQD------- 187 (481) T ss_pred HHHHHHHHHHHhcCeEEEEEEeCC------CCeEEEEEE-cccceEEEEcCCCC-----CceEEEEEEEEEee------- Confidence 999999999999999998877642 256777766 787776 544321 11122222221000 Q ss_pred CcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEE Q lcl|Aclame:pro 235 DATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYW 314 (711) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~ 314 (711) .+...+..+++|..... +. T Consensus 188 ------------------~~~~~~~~~~~y~~~~i------------~~------------------------------- 206 (481) T protein:vir:10 188 ------------------KDKVPVQHVEVYTTDKI------------YY------------------------------- 206 (481) T ss_pred ------------------CCCceEEEEEEEecCeE------------EE------------------------------- Confidence 01123444455533221 10 Q ss_pred EEEecCc-eeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC Q lcl|Aclame:pro 315 RKITGAN-VLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE 393 (711) Q Consensus 315 ~~~~g~~-~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~ 393 (711) +...+.. .+.++.|.+.+.+|+|+|. +...|.|.+..+++.++.+|..+|.+...+...+.+.+++...... T Consensus 207 ~~~~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~ 279 (481) T protein:vir:10 207 IEIKGGTYHRVEEVEHYYNDVPIIEYL-------NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDL 279 (481) T ss_pred EEecCCceeecccccccCCceeEEEee-------cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCC Confidence 1111111 1123345555677777542 2334789999999999999999999999999888888877533222 Q ss_pred ChHHHHhhcccCCCceEEeccc-----ccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHH Q lcl|Aclame:pro 394 GREDEWEQANTKNFSLLTYIPQ-----YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAI 468 (711) Q Consensus 394 ~~~~~~~~~~~~~~~~i~~~~~-----~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai 468 (711) +.++ .. ..+.+..+..-++ ...++.++++....-...+...++.....|-.+|++++.+.|..+++.||.|+ T Consensus 280 ~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al 356 (481) T protein:vir:10 280 DSED-AK--AFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESM 356 (481) T ss_pred Cccc-hh--hhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHH Confidence 2221 11 1122222222111 11223455665555556777788999999999999999888877677899998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheee Q lcl|Aclame:pro 469 IARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKY 548 (711) Q Consensus 469 ~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~ 548 (711) .................|..+++++.++++.++... +. ...++ . T Consensus 357 ~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~----------~~-~~~~~-------------------------~ 400 (481) T protein:vir:10 357 KYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLT----------GL-KQHNY-------------------------A 400 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CC-Ccccc-------------------------c Confidence 777665555555556666666666666655543221 10 00000 1 Q ss_pred eEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH Q lcl|Aclame:pro 549 DVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) Q Consensus 549 dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 627 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-.+++++................... .. T Consensus 401 ~i~v~f~~~~~~~~~~~a~~~~kl~g~is~------et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~ 473 (481) T protein:vir:10 401 ELTITFTPNLPKSMMESINAFNALSGGVSE------STRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFE-NH 473 (481) T ss_pred eeeEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCC-CC Confidence 222322333333344444455554433332 223344443 2233333333222211111000000000000 00 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 628 TEPTPEQQVEMAKSQA 643 (711) Q Consensus 628 q~~~~~~q~~~~~~q~ 643 (711) .. ...-+. T Consensus 474 ~~--------~dd~~g 481 (481) T protein:vir:10 474 LN--------VDDSNG 481 (481) T ss_pred CC--------CCCCCC Confidence 00 000000 No 58 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.82 E-value=8.2e-20 Score=125.21 Aligned_cols=484 Identities=12% Similarity=0.053 Sum_probs=237.0 Q ss_pred CCcCC-------CCCCCCcccCCCcccCCc------CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH Q lcl|Aclame:pro 1 MAKKQ-------KKSRVEQLYAKKAKVYAK------NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS 67 (711) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~ 67 (711) |-|-- ..-++..-..+++..... +...+.+.+..+.+.+. ...+....+..+||.|.|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~------~~~~~r~~~l~~Yy~g~~~i~ 74 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM------DYQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHH------HhhHHHHHHHHHHhcccCccc Confidence 21110 000111111111111000 01111222333333221 223455667889999987643 Q ss_pred HHHHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 68 QVRTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 68 ~~~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) .........++| .+++|..+.+|+..+|+.-.+.+.+.. ++.+ .... T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~d~~----~~~~ 123 (511) T protein:vir:99 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD---------------------------DDKD----VLEA 123 (511) T ss_pred cccCcccccccCcceeecchHHHHHHHHHhhhcccCceeec---------------------------CchH----HHHH Confidence 222222333333 478899999999999998766555432 2222 2456 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +..+++.|+++.....+.++++++|.+|..++.+. ++++++..+ +|.++| ||+... .-...+++.| T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de------d~~~~i~~~-~p~~~~~vyd~~~~-----~~~~~~vr~~ 191 (511) T protein:vir:99 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYL 191 (511) T ss_pred HHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEE Confidence 77778889999999999999999999998887652 256788776 788876 554321 1122333333 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) .... .+ ....+.+..+++|........ ....+... T Consensus 192 ~~~~------~~----------------~~~~~~~~~~~vyt~~~i~~~--~~~~~~~~--------------------- 226 (511) T protein:vir:99 192 RTKP------ID----------------KTDEDEVFTVDLFTSHGVYRY--LTSRTNGL--------------------- 226 (511) T ss_pred Eeee------cc----------------cCccceEEEEEEEeCCcEEEE--EecCCccc--------------------- Confidence 2110 00 011234445566644322110 00000000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) ..........|.+.+.+|+|+|. +...+.|.+..+++.++.+|..+|.+.+.+...+++ T Consensus 227 --------------~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 285 (511) T protein:vir:99 227 --------------KLTPRENGFESHSFERMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (511) T ss_pred --------------cccccccccccCCCCccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhch Confidence 00000113345566677777542 233578999999999999999999999999887777 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecc---------cccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHH Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIP---------QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~ 454 (711) .+++......+..+. .. .+.+.++...+ +...+..+.++..+.-..++...++...+.|-.+|++++. T Consensus 286 ~lv~~G~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:99 286 MLLIKGNLNLDPVEV-RK--QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hhhhccCcccCchhh-cc--cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 666543222222221 11 11222222211 1122334666666555677788889999999999999988 Q ss_pred HhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhc Q lcl|Aclame:pro 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) Q Consensus 455 ~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~ 534 (711) +.+.-+++.||.|+..+..............|..++++++++++.++...-... ...++. T Consensus 363 ~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~---------~~~~~~----------- 422 (511) T protein:vir:99 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID---------VSKDFN----------- 422 (511) T ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc---------cccccc----------- Confidence 777655678999999887777777777777777788888877776654422100 000110 Q ss_pred cceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhc Q lcl|Aclame:pro 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) Q Consensus 535 g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~ 613 (711) ++.+.=.++.+....+..+.+..+...++. ..+++.+++ .+.++-++++++........ T Consensus 423 --------------~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~------et~l~~l~~v~D~~~E~~ri~~E~~~~~~~ 482 (511) T protein:vir:99 423 --------------TVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred --------------cceEEeCCCCCcCHHHHHHHHHHHhccCCH------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 111111222222233333444444332222 223333332 22333333332221100000 Q ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) Q Consensus 614 ~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~ 654 (711) ........... .......+..+.+.+... T Consensus 483 ~~~~~~~~~~~------------~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 483 AQKNMYQDPRN------------INDDEQDDSTKDSIDKKE 511 (511) T ss_pred HhhcccccCCC------------CCCCCCCCCCcCcccccC Confidence 00000000000 000000000000000000 No 59 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.82 E-value=1e-18 Score=119.24 Aligned_cols=456 Identities=12% Similarity=0.063 Sum_probs=228.5 Q ss_pred CCcCCCCCCC--CcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC--CCHHHHH---HH Q lcl|Aclame:pro 1 MAKKQKKSRV--EQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ--WPSQVRT---ER 73 (711) Q Consensus 1 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q--w~~~~~~---~~ 73 (711) |++..+.-.. ....-+..+... .-..+.+.++.+.+ ...+....+..+||.|.| +...... .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~i~~~i~~~-------~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~ 70 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQY---ETQEEMILRLITKH-------KENVEDITVGERYYNHQPDVLFNAPKRNVKGE 70 (468) T ss_pred CccccCCcCceeehheeecccccc---cCcHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCCcccccccccccccc Confidence 9888643222 222222222222 22233344444332 233455677899999985 1110000 00 Q ss_pred HHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 74 ELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY 151 (711) Q Consensus 74 ~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~ 151 (711) ....+| .+++|..+.+|+..+|+.-.+.+.+.. +|.+..+.+..+ ++ T Consensus 71 ~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~---------------------------~d~~~~~~l~~~----~~ 119 (468) T protein:vir:96 71 IDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYGT---------------------------EDEKSLKTIQEV----LN 119 (468) T ss_pred ccccccccccccchHHHHHHHHHhhhccCCceecc---------------------------CChHHHHHHHHH----Hh Confidence 112222 478999999999999998765544421 233333444333 33 Q ss_pred hcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHH Q lcl|Aclame:pro 152 NCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) Q Consensus 152 ~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~ 229 (711) +++......+..++.++|.||..|+++. ++++++..+ +|.+++ ||+... .+..+ +++.|...+ T Consensus 120 -n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~~~~~~----~~~~~-~ir~~~~~~-- 184 (468) T protein:vir:96 120 -HKWDDKLVDILTAASNKGVEWIQPYVDE------QGEFKTFRV-PAEQAIPIWTNKER----DELKA-FIRLYELDG-- 184 (468) T ss_pred -cCHHHHHHHHHHHHhhcCeEEEEEEEcC------CCceEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEecC-- Confidence 6788888999999999999998887653 256788777 788876 443221 12222 233331000 Q ss_pred HHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccce Q lcl|Aclame:pro 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) Q Consensus 230 ~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) ..-+++|+.... .++...++.... .... T Consensus 185 ----------------------------~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~--------------------- 212 (468) T protein:vir:96 185 ----------------------------GERVEYWTANDV--TFYELKDGQLIP-DYYQ--------------------- 212 (468) T ss_pred ----------------------------ceEEEEEeCCeE--EEEEEcCCceee-cccc--------------------- Confidence 001223322211 111111111000 0000 Q ss_pred EEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecc Q lcl|Aclame:pro 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSE 389 (711) Q Consensus 310 ~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~ 389 (711) .........+....|.+.+.+|+|+|. +...|.|.+..++++++.+|...|.+.+.+...+++.+++.. T Consensus 213 ----~~~~~~~~~~~~~~~~~~~~iPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g 281 (468) T protein:vir:96 213 ----GEEHVQAHYYVGNKSMSWNRVPFIPFK-------NNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKG 281 (468) T ss_pred ----cccccccceeeccccccCCcccEEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 000011122334456777888888652 234478999999999999999999999999888888877754 Q ss_pred cccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHH Q lcl|Aclame:pro 390 GNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAII 469 (711) Q Consensus 390 ~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~ 469 (711) ...++..+... ..+.+.++.+.... ++.++++....-...+...++.....|-..|++.+.+.+..+++.||.|+. T Consensus 282 ~~~~~~~~~~~--~~~~~~~i~~~~d~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk 357 (468) T protein:vir:96 282 YEGEDLEEFMY--NLKYYKAINVDGDG--SGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALK 357 (468) T ss_pred CCccccchhhh--hhhcCceEEecCCC--CCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHH Confidence 33333323222 22345566665432 234677776666777888899999999999999887776666778999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .+..............|..+++++.++++.+ +. .. .++. + T Consensus 358 ~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~----~g---------~~--~d~~-------------------------~ 397 (468) T protein:vir:96 358 FMYSNLDLKANKLKNKTLTALQELLQYIIDF----YK---------LS--IKVQ-------------------------D 397 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hC---------CC--cccc-------------------------e Confidence 7766666666666666666666666555543 21 10 0100 1 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCc-chHHHHHHHHhhhcchhhcchhhhhhhhhHHHHH Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWP-GADVIAERLKKIVPPNVLSKDEREAIEEDMPEQT 628 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~-~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q 628 (711) +.+.=.++.+....+..+.+... ..+ ....+++.+++- +.++-++++.+........+. T Consensus 398 i~i~f~~~~p~d~~e~a~~~~~~-g~i------S~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~~------------- 457 (468) T protein:vir:96 398 VEITFNFNVMVNELEQSQIGVNS-QYL------SKETVVTNHPWVDDPVAEMERIDQEELALPSIEE------------- 457 (468) T ss_pred eeEEecCCCCcCHHHHHHHHHhc-CCC------chHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhh------------- Confidence 11111112121112222222211 111 112223333221 222222222211110000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 629 EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKA 662 (711) Q Consensus 629 ~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~ 662 (711) ......... -+ T Consensus 458 --------~~~~~~~~~---------------~~ 468 (468) T protein:vir:96 458 --------GLNGKENNE---------------PT 468 (468) T ss_pred --------ccCCCCCCC---------------CC Confidence 000000000 00 No 60 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.82 E-value=3.1e-19 Score=122.02 Aligned_cols=462 Identities=12% Similarity=0.028 Sum_probs=224.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC--CCHHHH---HHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ--WPSQVR---TEREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q--w~~~~~---~~~~~ 75 (711) |--.-+.+. +-.......+.+.+++.++.. ...+.....+....+..+||.|++ |..... ..... T Consensus 1 ~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~i~---~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 70 (472) T protein:vir:93 1 MYPSQPTQT-------EIFDAIVRTNNKPETLEEMIV---RYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVD 70 (472) T ss_pred CCCCCCcch-------hhhhceeeecCchhhHHHHHH---HHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccc Confidence 211111110 111111122222222333222 222334456677788899999974 111110 01111 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) ..++ .+++|..+.+|+..+|+.-.+.+.+. .+|.+..+.+ +.++. | T Consensus 71 ~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~---------------------------~~d~~~~~~l----~~~~~-n 118 (472) T protein:vir:93 71 PLKPDDRMITNFHANLVDQKVSYIVGKPIAFK---------------------------HTDDEVVKRI----DEVLG-N 118 (472) T ss_pred ccccccccccchHHHHHHHHhhhhcccCeeec---------------------------cCChHHHHHH----HHHHh-c Confidence 2222 46789999999999998765544432 1333344443 33343 6 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) +++.....+..+++++|.||.-|+.+. ++++++..+ +|.+++ ||+... .+..+ +++.|...+. T Consensus 119 ~~~~~~~~~~~~~~~~G~~~~~v~~d~------d~~~~i~~~-~p~~~~~i~d~~~~----~~~~~-~ir~~~~~~~--- 183 (472) T protein:vir:93 119 RFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEH----EELEA-FIRMYKLENE--- 183 (472) T ss_pred cHHHHHHHHHHHHhhcCeEEEEEEECC------CCceEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEeecc--- Confidence 899999999999999999998776642 256777776 787776 453221 12222 2333321100 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) . .+++|...... .+.+..+........ T Consensus 184 ------------------------~---~~~~~~~~~~~--~~~~~~~~~~~~~~~------------------------ 210 (472) T protein:vir:93 184 ------------------------T---KVEYWDKVTVN--YYVYENGSLIPDYSN------------------------ 210 (472) T ss_pred ------------------------e---eEEEEecCeEE--EEEEecCeeeecccc------------------------ Confidence 0 12222221111 111111111000000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) ..........|.+.+.+|+|+|. +...|.|.+..+++.++.+|..+|.+...+...+.+.+++.-.. T Consensus 211 ------~~~~~~~~~~~~~~~~vPvv~~~-------nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~ 277 (472) T protein:vir:93 211 ------NLENSKTHFSTGSWGKIPFIPFK-------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD 277 (472) T ss_pred ------cccccccccccCCCCCcceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC Confidence 00000112234556677777652 23357799999999999999999999999998888877764322 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIAR 471 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~ 471 (711) ..+..+ +... .+.++++.+..+ +.+.++..+.-..++...++...+.|-..+++++.+.+..+++.||.|+..+ T Consensus 278 ~~~~~~-~~~~-~~~~~~~~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~ 351 (472) T protein:vir:93 278 DQELPE-FKRL-LRYYGAIKVSDN----GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 351 (472) T ss_pred cccchh-hHHH-HhhccccccCCC----CcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHH Confidence 222222 2221 233345554432 2355555455567788888999999999999998887776677899998877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) -.............|..+++++.++++.++..- + ++ .++. T Consensus 352 ~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~----------~-----~~-------------------------~~i~ 391 (472) T protein:vir:93 352 YTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----------G-----EH-------------------------KDVD 391 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----------c-----cc-------------------------ceee Confidence 666666666666666666666666655543210 0 10 0111 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~ 630 (711) +.=.+..+....+..+.+..+...++. ..+++.+++ .+.+...+++++................... T Consensus 392 v~f~~~~p~~~~~~~~~~~k~~giis~------et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~------ 459 (472) T protein:vir:93 392 ISFNYNKVANTELQVQTAQQSMGIVSH------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADG------ 459 (472) T ss_pred EEeCCCCCCCHHHHHHHHHHHhccCch------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCC------ Confidence 222233333233344444444433332 223444433 3333333333221110000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ 663 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q 663 (711) .......-+.+ .| T Consensus 460 ----------~~~~~~~~~~~----------~e 472 (472) T protein:vir:93 460 ----------AQQQERSNNKE----------SE 472 (472) T ss_pred ----------CCCCCCCCccc----------CC Confidence 00000000000 00 No 61 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.82 E-value=5.7e-19 Score=120.62 Aligned_cols=466 Identities=12% Similarity=0.069 Sum_probs=229.1 Q ss_pred CCcCCCCCCCCcccCCCc--ccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC--CH---HHHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKA--KVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW--PS---QVRTER 73 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw--~~---~~~~~~ 73 (711) |+.=-| ++++.+...- .-.+... ...+.|.++.+.+ ...+....+..+||.|.|= .. ...... T Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~~~~~-~~~~~i~~~i~~~-------~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~ 70 (478) T protein:vir:10 1 MISINW--PWDKPYHEQVVEQIKPKYE-TQEEMILRLVREH-------KENIDNITMGERYYNHHPDILDAPFKRDVNGD 70 (478) T ss_pred Cccccc--cCCchhhhHHHHHhhhccC-ChHHHHHHHHHHH-------HHHHHHHHHHHHHhcccccccccchhhhcccc Confidence 554321 1111111110 0001111 1222344444332 3345667788999999751 00 001111 Q ss_pred HHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 74 ELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY 151 (711) Q Consensus 74 ~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~ 151 (711) ...++| .+++|..+.+|+..+|+.-.+.+.+.. ++.+..+.+.. +++ T Consensus 71 ~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~---------------------------~~~~~~~~l~~----~~~ 119 (478) T protein:vir:10 71 YDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGV---------------------------DNDKALKQIQH----TLN 119 (478) T ss_pred cccccccceeccchHHHHHHHHhhhhcccCceeec---------------------------CChHHHHHHHH----HHh Confidence 223344 367999999999999998766554421 23334343333 333 Q ss_pred hcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHH Q lcl|Aclame:pro 152 NCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) Q Consensus 152 ~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~ 229 (711) ++++.....+.+++.++|.||..|+.+. ++++++..+ +|.+++ ||+... .+..++ ++.+-..+ T Consensus 120 -n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~~~~~~~~~-~p~~~~~v~d~~~~----~~~~~~-ir~~~~~~-- 184 (478) T protein:vir:10 120 -HKWDDKLVDILTAASNKGIEWVQPYVDE------EGEFKTFRV-PAEQAVPIWTNKER----DELQAF-IRVYELDG-- 184 (478) T ss_pred -ccHHHHHHHHHHHHhhCCeEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC----CceEEE-EEEEeeeC-- Confidence 7899999999999999999998887753 256777776 788765 453221 122222 22221100 Q ss_pred HHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccce Q lcl|Aclame:pro 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) Q Consensus 230 ~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) ...+++|..... .++....+...+-.. T Consensus 185 ----------------------------~~~~~~y~~~~i--~~~~~~~~~~~~~~~----------------------- 211 (478) T protein:vir:10 185 ----------------------------AERVEYWTKDDV--TFYELKEGQLIPDFY----------------------- 211 (478) T ss_pred ----------------------------ceEEEEEeCCcE--EEEEecCCeeecccc----------------------- Confidence 001233322211 111111111110000 Q ss_pred EEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecc Q lcl|Aclame:pro 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSE 389 (711) Q Consensus 310 ~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~ 389 (711) ..........+.+..|.+.+.+|+++|. +...+.|.+..++++++.+|...|.+.+.+.....+.+++.- T Consensus 212 ---~~~~~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g 281 (478) T protein:vir:10 212 ---RSEDHIQPHYYQGNKLMSWGRVPFIPFK-------NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG 281 (478) T ss_pred ---ccccccccceecccccccCCcceEEEec-------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeec Confidence 0000011122334456677888887653 234577999999999999999999999999888888766542 Q ss_pred cccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHH Q lcl|Aclame:pro 390 GNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAII 469 (711) Q Consensus 390 ~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~ 469 (711) -..++..+... +.....++.+.+.. ++.+.++....-..++...++...+.|-..|++++.+.+..+++.||.|+. T Consensus 282 ~~~~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~ 357 (478) T protein:vir:10 282 YEGEDMKDFMH--NLKYYKAISVAGES--GSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALK 357 (478) T ss_pred CCcccccchhh--hhhhCceeEecCCC--CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHH Confidence 22222222211 12233455554332 234666766666777888899999999999999887777666778999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .+..............|..+++++.++++.+...- .++ .+ T Consensus 358 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~---------------~d~-------------------------~~ 397 (478) T protein:vir:10 358 FMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLD---------------VRV-------------------------QD 397 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---------------ccc-------------------------cc Confidence 87666666666666666666666666555432110 110 01 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHH Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQT 628 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q 628 (711) +.+.=.+..+....+..+.+..+...++. ..+++.+++ ...+.-.+++++.........+. T Consensus 398 i~i~f~~~~p~~~~e~~~~~~~~~g~iS~------et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~------------ 459 (478) T protein:vir:10 398 IEITFNFNVMVNELENSQIAMNSTGLLSK------ETILGNHSWVQDPVAEMERIEQENIELNQQLPD------------ 459 (478) T ss_pred ceEEeCCCCCCCHHHHHHHHHHHhCCCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccc------------ Confidence 11222223222222333333333322221 222333332 22333333332221111000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 629 EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ 663 (711) Q Consensus 629 ~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q 663 (711) ..+...+.+..+.+-...+ T Consensus 460 ----------------~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 460 ----------------IEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred ----------------cCCCCcccccccCcCCCCC Confidence 0000000000000000000 No 62 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.82 E-value=3e-19 Score=122.13 Aligned_cols=455 Identities=10% Similarity=0.020 Sum_probs=227.9 Q ss_pred HHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC--CCHHHHH-------HHHHhCCC--ceEehhhHHHHHHHhhhhh Q lcl|Aclame:pro 30 ALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ--WPSQVRT-------ERELEQRP--CLVNNVLPTFVDQVLGDQR 98 (711) Q Consensus 30 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q--w~~~~~~-------~~~~~g~p--~~~~N~i~~~v~~i~g~~~ 98 (711) ..+..+.+.................+..+||.|.+ |...... .....++| .+++|..+.+|+..+|+.- T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 22223333333333333445566778899999975 2111110 11122333 4789999999999999987 Q ss_pred hcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEE Q lcl|Aclame:pro 99 QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRS 178 (711) Q Consensus 99 ~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~ 178 (711) .+.+.+.+ ++.+..+.+..++. +++....+....+++++|.+|..+++ T Consensus 81 G~p~~~~~---------------------------~d~~~~~~l~~~~~-----~~~~~~~~~l~~~~~~~G~a~~~~y~ 128 (470) T protein:vir:10 81 SVFPDIDV---------------------------GKDADNKKIIDVLG-----DDRALTLNGLLVDSSNAGRAWLHYWI 128 (470) T ss_pred ccceeeec---------------------------CchHHHHHHHHHHh-----hhHHHHHHHHHHHHhhcCeeEEEEEe Confidence 76555432 23334444444443 35677778888999999999998876 Q ss_pred eeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCC Q lcl|Aclame:pro 179 DYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEK 256 (711) Q Consensus 179 d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 256 (711) +. ++++++..+ +|..+| ||+... ... .++++.|...+ +.... T Consensus 129 d~------~~~~~~~~~-~p~~~~~v~d~~~~----~~~-~a~ir~y~~~~------------------------~~~~~ 172 (470) T protein:vir:10 129 DE------DGNFRYGII-QPDQITPIYATTLD----NKL-LGILRSYKQLD------------------------PDSGK 172 (470) T ss_pred cC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEEeee------------------------cCCce Confidence 53 256777766 787777 333211 111 22222222110 01122 Q ss_pred eEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccce Q lcl|Aclame:pro 257 SVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPV 336 (711) Q Consensus 257 ~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~ 336 (711) .+..+|+|....... +....+.......... ... . -....+.....+..+.+.+.+|+ T Consensus 173 ~~~~~e~yt~~~~~~--~~~~~~~~~~~~~~~~------------~~~-----~---~~~~~~~~~~~~~~~~~~g~vPv 230 (470) T protein:vir:10 173 YFTVHEYWTDKEAQF--FRTNATDSTVIEPYNI------------ITS-----Y---DLSAGYETGQSNTLKHNFGRVPF 230 (470) T ss_pred EEEEEEEEcCCcEEE--EEeecCcceecccccc------------ccc-----c---ccccccccccccccccCCCeeeE Confidence 344556554332211 1111111110000000 000 0 00000111111222334455666 Q ss_pred EEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc- Q lcl|Aclame:pro 337 IPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ- 415 (711) Q Consensus 337 vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~- 415 (711) ++| + +...+.|.+..++++++.+|..+|.+.+.+.-.+++.+++.-...++..+.... .+.++.+.+... T Consensus 231 v~~---~----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~--~~~~~~i~~~~~~ 301 (470) T protein:vir:10 231 IEF---S----KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMND--LRKYKSIKINNTG 301 (470) T ss_pred EEe---e----cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhh--hhhcCeEeccCCC Confidence 544 2 223467999999999999999999999999999888888764444444444332 233444555432 Q ss_pred ccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 416 YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGK 495 (711) Q Consensus 416 ~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 495 (711) ...++.++++..+.-.......++.....|-..|++++.+.+.. ++.||+|+..+...........-..|..+++++.+ T Consensus 302 ~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (470) T protein:vir:10 302 NGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (470) T ss_pred CCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22334577777777778888899999999999999988776554 46899999888777777766666666666666666 Q ss_pred HHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 496 ILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA 575 (711) Q Consensus 496 ~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~ 575 (711) +++.++ .- .+ .++ .++.+.=.+..+....+..+.+..+... T Consensus 381 ~i~~~l----~~------~~----~d~-------------------------~~i~i~f~~~~p~d~~e~~~~~~~~~g~ 421 (470) T protein:vir:10 381 AIMRYL----NF------SD----ADK-------------------------RHISQHWTRTKVEDSLTKAQIVSTVANY 421 (470) T ss_pred HHHHHh----cc------cC----ccc-------------------------ceeeEEeccCCCCCHHHHHHHHHHHhcc Confidence 555433 11 00 011 0111111222222222222333333222 Q ss_pred cchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 576 VPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQ 647 (711) Q Consensus 576 ~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k 647 (711) ++ ...+++.+++ .+.++-.+++.+......+....... .......-.+ T Consensus 422 iS------~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~~~------------------~~~~~~dde~ 470 (470) T protein:vir:10 422 SS------KEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQADE------------------LNGKGVNDEQ 470 (470) T ss_pred Cc------HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhccccc------------------cCCCCCCCCC Confidence 21 1223344432 23333333332211110000000000 0000000000 No 63 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.82 E-value=2.4e-18 Score=117.16 Aligned_cols=528 Identities=12% Similarity=0.030 Sum_probs=258.9 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC----CCCHHHHHHHHHh Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGE----QWPSQVRTERELE 76 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~----Qw~~~~~~~~~~~ 76 (711) ||++++.. .. -+.+..+|....+....|...|.++.+|..-. ++... .. + T Consensus 1 ~~~~~~~~------------------~~---~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~----~~-~ 54 (543) T protein:vir:88 1 MAETKREG------------------LA---EEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNS----ST-D 54 (543) T ss_pred CcccccCc------------------ch---HHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcc----cc-c Confidence 77665211 11 12345666667777788999999999998532 22211 00 1 Q ss_pred CCCceEehhhHHHHHHHhhhhhh----cccceeEecchhhhhhhhhcccccccccccCCCchh-HHHHH---HHHHHHHH Q lcl|Aclame:pro 77 QRPCLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKND-YELAE---VFTGLIKN 148 (711) Q Consensus 77 g~p~~~~N~i~~~v~~i~g~~~~----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~Ae---~l~~~~~~ 148 (711) . ..+....-...++.+.+..-. .++=.++.+.+... .. ...++.+ .+..+ .++..+.. T Consensus 55 ~-~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~------------~~-~~~~~~~~~~v~~~L~~ve~~~~~ 120 (543) T protein:vir:88 55 Y-TTPWQAVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQA------------KQ-LVSDPSQLAVVEQGLGMVERILMS 120 (543) T ss_pred c-cccccchHHHHHHHHHHHHHHhhcCCCcccccccChHHH------------hc-ccCChhhHHHHHHHHHHHHHHHHH Confidence 0 112222222333333222111 11111221111000 00 0001111 11222 23445555 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHH Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEK 228 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e 228 (711) ....|+|......++.+.+..|+|+.-+. ...+... ..-.+..+ +..++++..++. -...-++++..++... T Consensus 121 ~~~~snf~~~~~~~~~~L~~~G~a~ly~~--~~~~~~~-~~~~~~~~-pl~~y~v~~d~~----G~v~~i~r~~~~~~~~ 192 (543) T protein:vir:88 121 YMEANSYRVTLFELIRQLALAGTALIYLP--PPDASSN-SYNPMKLY-TLHNHVVQRDAF----GNVLQIVTLDKVAYAA 192 (543) T ss_pred HHHhcCcHHHHHHHHHHHHhhCceeeeec--cCccccc-eecceEEe-EcceEEEeeCCC----CCeeeeeeeeeccHHH Confidence 56789999999999999999999985332 2111111 11112223 445666654442 1345677889999999 Q ss_pred HHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccc Q lcl|Aclame:pro 229 FKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVK 308 (711) Q Consensus 229 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 308 (711) +.+.||+...... +.+ ..+.+.|+++-+.+.... .+. T Consensus 193 l~~~~~~~v~~~~------~~~---p~~~~~v~~~V~pr~~~~--------~~~-------------------------- 229 (543) T protein:vir:88 193 LPEDVRNSLSGGQ------EYK---PEQELEVYTHIYIDDESG--------DFL-------------------------- 229 (543) T ss_pred HhHHhhHHHHHHh------hcC---CccceEEEEEEEeecCCC--------ccc-------------------------- Confidence 9888875421111 111 124566666544332111 000 Q ss_pred eEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEec Q lcl|Aclame:pro 309 TFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGS 388 (711) Q Consensus 309 ~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~ 388 (711) ++.-+.|..+....+.|+..++||+++. +..+++..||.|.+....+-.+.+|.+....+..+..+.+++++++ T Consensus 230 ----~~~~~~~~~v~~~~~~~~~~e~P~i~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~ 303 (543) T protein:vir:88 230 ----SYQEIEGVEVDGSDGQYPQDALPWIAVR--WTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVN 303 (543) T ss_pred ----ccccccCeeeecCCCccccccCCceeee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Confidence 0011122222223345667789999764 4456889999999999999999999999999999999999999998 Q ss_pred ccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHH Q lcl|Aclame:pro 389 EGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAI 468 (711) Q Consensus 389 ~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai 468 (711) ++.+.+..+... ..+|.++ ++...+..+-......-.......++...+.|.... ..+.+...++.+.|+.-| T Consensus 304 ~~g~~~~~~~~~---~~~g~~v---~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEV 376 (543) T protein:vir:88 304 PNGITQVRRLVK---AQTGDFV---AGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVF-MLNSAVQRSGERVTAEEI 376 (543) T ss_pred cccccchhhccc---CCCceee---cCCCCcceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhhccCCCCcccHHHH Confidence 888877654321 2233333 333333223223333345557778888888888766 233333356677899999 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 469 IARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 469 ~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) ..+.+.....+..++.+|.. +...+.+..+.++.+.---+. + | . .. T Consensus 377 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~---------------~------p-~-----------~~ 423 (543) T protein:vir:88 377 RYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPN---------------L------P-Q-----------EA 423 (543) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC---------------C------c-h-----------hc Confidence 99999999999998888864 666677666666655321100 0 0 0 00 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch--hhcchhhhhhhhhHH Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPN--VLSKDEREAIEEDMP 625 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~--~~~~~~~~~~~~~~~ 625 (711) +.+.+.. +-.+-.|.+..+.|..+++....+++ + ..++.-+.+++...+....+-. .....+.+..+..++ T Consensus 424 v~~~~vs-~l~~l~r~~~~~~l~~~~~~v~~~~~---p---~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q 496 (543) T protein:vir:88 424 VEPTVTT-GAEALGRGQDLDKLTQFLNAVATVSQ---L---NGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQ 496 (543) T ss_pred eeeeEEe-cHHHHHHHHHHHHHHHHHHHHHhccc---h---hhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHH Confidence 1222222 22334556666666666655432222 2 2344556777777776665542 121211111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 626 EQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELV 694 (711) Q Consensus 626 ~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~ 694 (711) + +.+ +++.+++.++. +... +..... - ++.. ++++.+..+-.-+..++ T Consensus 497 ~--~~q---~~~~~~~~~~~----~~~~---~~~~~~-~---~~~~------~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 497 E--MLK---QGGLNAAAGIG----SGVA---AQATAS-P---EAME------SAMDTAGVQPGPIATQV 543 (543) T ss_pred H--HHH---HHHHHHHHHHh----hchh---hhhccC-h---HHHH------HHhhhcCCCCCCCCCCC Confidence 0 000 00000000000 0000 000000 0 0000 00000000000000001 No 64 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.82 E-value=3.1e-19 Score=122.02 Aligned_cols=463 Identities=12% Similarity=0.070 Sum_probs=233.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH-----HHHHHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS-----QVRTEREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~-----~~~~~~~~ 75 (711) |-.+-+.+-.++-.+.--+....+.+...++|.++.+.+. ..+....+..+||.|+|=-. ........ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK-------QKLKDINVGQKYYDKDNDINYQAYKQDLHGNID 73 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH-------HHHHHHHHHHHHhcccCccccccchhhhccccc Confidence 7776655555544444444444555555556666655433 23455677899999986100 00111111 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) ..+| .+++|..+.+|+..+|+.-.+.+.+.. .+.+..+.+... . .+ T Consensus 74 ~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~~~~~~~~l~~~----~-~n 121 (474) T protein:vir:96 74 YTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH---------------------------DDDKVLDVIHQV----L-DT 121 (474) T ss_pred ccccccccccchHHHHHHhhhhhhcccCceecc---------------------------CChHHHHHHHHH----H-hc Confidence 2223 478999999999999998766544421 223333444333 3 37 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) +++.....+.++++++|.||..++.+. ++++++..+ +|.++| ||+... .+. ..+++.|... T Consensus 122 ~~~~~~~~l~~~~~~~G~~~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~-~a~ir~~~~~----- 184 (474) T protein:vir:96 122 RWDNKLIDILTAASNKGIDWLQVYINE------DGELKLFRV-PAEQAIPIWTDKER----EQL-NAFIRIFTFN----- 184 (474) T ss_pred cHHHHHHHHHHHHhhCCeEEEEeeeCC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEeec----- Confidence 899999999999999999998877642 256777776 787777 443221 122 3334433210 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) ....+++|...... .+...++.... T Consensus 185 -------------------------~~~~~~vy~~~~i~--~~~~~~~~~~~---------------------------- 209 (474) T protein:vir:96 185 -------------------------GETKVEYWTAETVT--YYVYENGGLIP---------------------------- 209 (474) T ss_pred -------------------------CeeEEEEEeCCeEE--EEEEcCCceee---------------------------- Confidence 00112333222111 11111111000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) .....+........|.+.+.+|+++|. +...+.|.+..+++.++.+|...|.+.+.+...+.+.+++. |. T Consensus 210 --~~~~~~~~~~~~~~~~~~~~vPvv~~~-------nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g~ 279 (474) T protein:vir:96 210 --DFYYGDEHIQTHFSTGSWERVPFIAFK-------NNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR-GY 279 (474) T ss_pred --ccccccccccCcccccCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-CC Confidence 000011111122334455666766543 22346799999999999999999999999999888876653 43 Q ss_pred -cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHH Q lcl|Aclame:pro 392 -VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIA 470 (711) Q Consensus 392 -v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~ 470 (711) .++..+... ..+...++.+..+ +.+.++..+.-..+....++.....|-..|++.+.+.+..+++.||.|+.. T Consensus 280 ~~~~~~~~~~--~~~~~~~i~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~ 353 (474) T protein:vir:96 280 EGEDLSEFME--GLKYYKAINVSSD----GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKF 353 (474) T ss_pred Ccccccchhh--hhhccceeeccCC----CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHH Confidence 222222211 1223345555433 346667666677888889999999999999998877766667789999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeE Q lcl|Aclame:pro 471 RQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) Q Consensus 471 ~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv 550 (711) +..............|..+++++.++++. +.. +.. ++. +| T Consensus 354 ~~~~l~~k~~~~~~~~~~~l~~~~~~i~~----~~g------~~~-----d~~-------------------------~i 393 (474) T protein:vir:96 354 LYTNLNLKANKLKNKANVALQELMQFILD----FNK------IKL-----DAK-------------------------EI 393 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhC------CCc-----ccc-------------------------ee Confidence 76666666666666666666666665544 321 000 100 11 Q ss_pred EeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 551 VVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTE 629 (711) Q Consensus 551 ~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~ 629 (711) .+.=.+..+..-.+..+.+.+ +..+ ....+++.+++ .+.+.-.+++++..... T Consensus 394 ~i~f~~~~p~~~~e~a~~~~~-~gii------S~et~~~~lp~v~D~~~E~eri~~E~~~~------------------- 447 (474) T protein:vir:96 394 EITFNFNVMVNDLEQSQIGAQ-SQYL------SKETLVRHHPWVDDPKAELERLDEEQLEL------------------- 447 (474) T ss_pred eEEecCCCccCHHHHHHHHHH-cCCC------ChHHHHHhCCCCCCHHHHHHHHHHHHHHH------------------- Confidence 111111222211222222221 1111 11222333332 22222222222111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 630 PTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ 663 (711) Q Consensus 630 ~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q 663 (711) ..+++ ........ ...+..+.+..+.+ T Consensus 448 -~~~~~-~~~~~~~~-----~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 448 -NKQLP-NLDDGGAD-----GAQQQQQSENNQSK 474 (474) T ss_pred -Hhhcc-ccccccCC-----CCCCcCCCCccccC Confidence 00000 00000000 00000000000000 No 65 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.82 E-value=3.1e-19 Score=122.02 Aligned_cols=463 Identities=12% Similarity=0.070 Sum_probs=233.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH-----HHHHHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS-----QVRTEREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~-----~~~~~~~~ 75 (711) |-.+-+.+-.++-.+.--+....+.+...++|.++.+.+. ..+....+..+||.|+|=-. ........ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK-------QKLKDINVGQKYYDKDNDINYQAYKQDLHGNID 73 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH-------HHHHHHHHHHHHhcccCccccccchhhhccccc Confidence 7776655555544444444444555555556666655433 23455677899999986100 00111111 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) ..+| .+++|..+.+|+..+|+.-.+.+.+.. .+.+..+.+... . .+ T Consensus 74 ~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~---------------------------~~~~~~~~l~~~----~-~n 121 (474) T protein:vir:95 74 YTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH---------------------------DDDKVLDVIHQV----L-DT 121 (474) T ss_pred ccccccccccchHHHHHHhhhhhhcccCceecc---------------------------CChHHHHHHHHH----H-hc Confidence 2223 478999999999999998766544421 223333444333 3 37 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) +++.....+.++++++|.||..++.+. ++++++..+ +|.++| ||+... .+. ..+++.|... T Consensus 122 ~~~~~~~~l~~~~~~~G~~~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~-~a~ir~~~~~----- 184 (474) T protein:vir:95 122 RWDNKLIDILTAASNKGIDWLQVYINE------DGELKLFRV-PAEQAIPIWTDKER----EQL-NAFIRIFTFN----- 184 (474) T ss_pred cHHHHHHHHHHHHhhCCeEEEEeeeCC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEeec----- Confidence 899999999999999999998877642 256777776 787777 443221 122 3334433210 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) ....+++|...... .+...++.... T Consensus 185 -------------------------~~~~~~vy~~~~i~--~~~~~~~~~~~---------------------------- 209 (474) T protein:vir:95 185 -------------------------GETKVEYWTAETVT--YYVYENGGLIP---------------------------- 209 (474) T ss_pred -------------------------CeeEEEEEeCCeEE--EEEEcCCceee---------------------------- Confidence 00112333222111 11111111000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) .....+........|.+.+.+|+++|. +...+.|.+..+++.++.+|...|.+.+.+...+.+.+++. |. T Consensus 210 --~~~~~~~~~~~~~~~~~~~~vPvv~~~-------nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g~ 279 (474) T protein:vir:95 210 --DFYYGDEHIQTHFSTGSWERVPFIAFK-------NNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR-GY 279 (474) T ss_pred --ccccccccccCcccccCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-CC Confidence 000011111122334455666766543 22346799999999999999999999999999888876653 43 Q ss_pred -cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHH Q lcl|Aclame:pro 392 -VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIA 470 (711) Q Consensus 392 -v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~ 470 (711) .++..+... ..+...++.+..+ +.+.++..+.-..+....++.....|-..|++.+.+.+..+++.||.|+.. T Consensus 280 ~~~~~~~~~~--~~~~~~~i~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~ 353 (474) T protein:vir:95 280 EGEDLSEFME--GLKYYKAINVSSD----GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKF 353 (474) T ss_pred Ccccccchhh--hhhccceeeccCC----CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHH Confidence 222222211 1223345555433 346667666677888889999999999999998877766667789999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeE Q lcl|Aclame:pro 471 RQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) Q Consensus 471 ~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv 550 (711) +..............|..+++++.++++. +.. +.. ++. +| T Consensus 354 ~~~~l~~k~~~~~~~~~~~l~~~~~~i~~----~~g------~~~-----d~~-------------------------~i 393 (474) T protein:vir:95 354 LYTNLNLKANKLKNKANVALQELMQFILD----FNK------IKL-----DAK-------------------------EI 393 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhC------CCc-----ccc-------------------------ee Confidence 76666666666666666666666665544 321 000 100 11 Q ss_pred EeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 551 VVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTE 629 (711) Q Consensus 551 ~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~ 629 (711) .+.=.+..+..-.+..+.+.+ +..+ ....+++.+++ .+.+.-.+++++..... T Consensus 394 ~i~f~~~~p~~~~e~a~~~~~-~gii------S~et~~~~lp~v~D~~~E~eri~~E~~~~------------------- 447 (474) T protein:vir:95 394 EITFNFNVMVNDLEQSQIGAQ-SQYL------SKETLVRHHPWVDDPKAELERLDEEQLEL------------------- 447 (474) T ss_pred eEEecCCCccCHHHHHHHHHH-cCCC------ChHHHHHhCCCCCCHHHHHHHHHHHHHHH------------------- Confidence 111111222211222222221 1111 11222333332 22222222222111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 630 PTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ 663 (711) Q Consensus 630 ~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q 663 (711) ..+++ ........ ...+..+.+..+.+ T Consensus 448 -~~~~~-~~~~~~~~-----~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 448 -NKQLP-NLDDGGAD-----GAQQQQQSENNQSK 474 (474) T ss_pred -Hhhcc-ccccccCC-----CCCCcCCCCccccC Confidence 00000 00000000 00000000000000 No 66 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.82 E-value=1.4e-18 Score=118.51 Aligned_cols=435 Identities=13% Similarity=0.052 Sum_probs=208.6 Q ss_pred CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHH-HhCCCceEehhhHHHHHHHhhhhhhccc Q lcl|Aclame:pro 24 NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERE-LEQRPCLVNNVLPTFVDQVLGDQRQNRP 102 (711) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~-~~g~p~~~~N~i~~~v~~i~g~~~~~r~ 102 (711) =+++..+.+..+...+.. .+....+-.+||.|+|.......... ....-.++.|..+-+|+..++...-+ T Consensus 1 ~~~~~~~~i~~l~~~~~~-------~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~-- 71 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQR-------LSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWL-- 71 (441) T ss_pred CCccHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccc-- Confidence 234555556666665433 23344455799999986432211100 00112367899999999887754200 Q ss_pred ceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeecc Q lcl|Aclame:pro 103 AIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA 182 (711) Q Consensus 103 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~ 182 (711) .| ..+++. -+..+++.|++......++.++++.|.||.-|+.+ T Consensus 72 --g~------------------------~~~d~~--------~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d--- 114 (441) T protein:vir:80 72 --GW------------------------TNGDGY--------GLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH--- 114 (441) T ss_pred --cc------------------------cCCChH--------HHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC--- Confidence 01 011221 24566778999999999999999999999877653 Q ss_pred CCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEE Q lcl|Aclame:pro 183 DDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRV 260 (711) Q Consensus 183 ~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 260 (711) + ++++++..+ +|.+++ ||+...... + ++.+..+- . +.+.. T Consensus 115 ~---~g~~~i~~~-~p~~~~~i~d~~~~~~~---~-~~~~~~~~-----------~-------------------~~~~~ 156 (441) T protein:vir:80 115 G---DGTVSVRPQ-SPKNCTGKFSADGSRLD---A-GLVVQQTC-----------D-------------------PEVVE 156 (441) T ss_pred C---CCceEEEEE-ccceEEEEEeCCCCcee---E-EEEEEEEe-----------c-------------------CceEE Confidence 2 356777766 787765 777543221 1 11111110 0 00111 Q ss_pred EEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEE Q lcl|Aclame:pro 261 SEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVW 340 (711) Q Consensus 261 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~ 340 (711) .+.|+.. .++.+... -.+.....++.|.+.|.+|+|||. T Consensus 157 ~~vy~~~------------~~~~~~~~-----------------------------~~~~~~~~~~~~~~~g~vPvv~~~ 195 (441) T protein:vir:80 157 AELLLPD------------VIVQVERR-----------------------------GSREWVEVDRIPNVLGAVPLVPIV 195 (441) T ss_pred EEEEecC------------eEEEEEEc-----------------------------CCcceeeccccccCCCceeEEEee Confidence 1222211 01110000 000111223456667888888875 Q ss_pred eeeeccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCc Q lcl|Aclame:pro 341 GKSLIIKKKEIFRS-IIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD 419 (711) Q Consensus 341 ~~~~~~~~~~~~~g-~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~ 419 (711) -... .+..+|.| +.+.+++.++.+|..+|.+...+...+.+...+. |+-.+... .......+++++.+.++..+. T Consensus 196 n~~~--~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~~~~~~~-~~~~~~~~~~i~~~~~~~~~~ 271 (441) T protein:vir:80 196 NRRR--TSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GVSADEFS-QPGWVLSMASVWAVDKDDDGD 271 (441) T ss_pred cccc--CCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cCCccccc-cchhhhcccccccCCCCCCCC Confidence 3322 23344555 4467999999999999999999988888876663 43211110 111124566666655443332 Q ss_pred CCccccCCc-cchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 420 PGPRRQPPA-AVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKIL 497 (711) Q Consensus 420 ~~i~~~~~~-~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~ 497 (711) .+.+.+.+ .....+...+......+-.++++++..+|..+++ .||.|+..+...-........+.|..+++++++++ T Consensus 272 -~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~ 350 (441) T protein:vir:80 272 -TPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLA 350 (441) T ss_pred -cceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23322211 1223333444444445555588888888877655 69999988776666666666666666666666655 Q ss_pred HHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 498 VEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP 577 (711) Q Consensus 498 l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p 577 (711) +.+ +... +. ....+ .++.+.=.+..+....+..+.+..+.+... T Consensus 351 ~~~----~~~~------~~-~~~~~-------------------------~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~ 394 (441) T protein:vir:80 351 AKA----LDSR------VD-EADFF-------------------------GDVGLRWRDASTPTRAATADAVTKLVGAGI 394 (441) T ss_pred HHH----hcCC------Cc-ccccc-------------------------eeeeEEeCCCCCcCHHHHHHHHHHHHhcCc Confidence 433 2110 00 00000 111111122222223344444555544321 Q ss_pred hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 578 SAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQA 657 (711) Q Consensus 578 ~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqa 657 (711) .... ...+++.+++.. +++. ++.+. ..+++.+. . +. +.....+. T Consensus 395 ~~~s--~~~~~~~l~~~~-~e~~-~~~~e-----------------~~e~~~~~-------~-------~~-~~~~~~~~ 438 (441) T protein:vir:80 395 LPAD--SRTVLEMLGLDD-VQVE-AVMRH-----------------RAESSDPL-------A-------VL-AGAISRQT 438 (441) T ss_pred cccc--HHHHHHhCCCCH-HHHH-HHHHH-----------------HHHHHHHH-------H-------HH-hhhhhccc Confidence 1100 112233443321 1111 11100 00000000 0 00 00000111 Q ss_pred HHH Q lcl|Aclame:pro 658 DML 660 (711) Q Consensus 658 e~~ 660 (711) ++. T Consensus 439 ~~~ 441 (441) T protein:vir:80 439 NEV 441 (441) T ss_pred ccC Confidence 111 No 67 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.81 E-value=7e-20 Score=125.59 Aligned_cols=454 Identities=11% Similarity=0.052 Sum_probs=232.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC-C-CHHH--------- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ-W-PSQV--------- 69 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q-w-~~~~--------- 69 (711) |+=-|+-.-+.+ +. -+.+.|.++.+. ....+++..+..+||.|.+ + .... T Consensus 1 ~~~~~~~~~~~~---~~---------~~~e~i~~~i~~-------~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 61 (474) T protein:vir:94 1 MTLYKLIDDIEA---QG---------ILPKHIEALIES-------HKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKED 61 (474) T ss_pred CchHHHHhhccc---cC---------CCHHHHHHHHHH-------hhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhh Confidence 443333222222 11 111123333222 2223444455555555421 1 0000 Q ss_pred ----HHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHH Q lcl|Aclame:pro 70 ----RTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFT 143 (711) Q Consensus 70 ----~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~ 143 (711) ....+..++| .+++|..+.+|+..+|+.-.+.+.+.+. .+....+.+. T Consensus 62 ~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~--------------------------~~~~~~e~~~ 115 (474) T protein:vir:94 62 FETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLD--------------------------ENAEKNEKLK 115 (474) T ss_pred hhhcccccccccCcccccccchHHHHHHhHhhheeccceeEeeC--------------------------CCCcchHHHH Confidence 0112334444 4789999999999999987776655442 2223334456 Q ss_pred HHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeee Q lcl|Aclame:pro 144 GLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLID 221 (711) Q Consensus 144 ~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~ 221 (711) ..+..+++.|+++.....+..+++++|.+|..++.+. ++++++..+ +|.+++ || +.. +.-+ +.+ T Consensus 116 ~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~~~~~~~i-~p~~~~~v~d-~~~-----~~~~-~i~ 181 (474) T protein:vir:94 116 KFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT------NGDIRIKNI-DPYNVIFVGD-NIL-----EPTY-SLR 181 (474) T ss_pred HHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC------CCeeEEEEE-cccceEEEEc-CCC-----ceEE-EEE Confidence 6777778889999999999999999999988776542 256777766 677765 33 111 1122 222 Q ss_pred ecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchh Q lcl|Aclame:pro 222 DTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISI 301 (711) Q Consensus 222 ~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 301 (711) .|...++ .....+..+++|.... + +.+... T Consensus 182 ~~~~~~~------------------------~~~~~~~~~~~y~~~~----~--------~~~~~~-------------- 211 (474) T protein:vir:94 182 YFYEKDD------------------------DNGTDYVYAEFYDNAY----Y--------YVFRGE-------------- 211 (474) T ss_pred EEEEeeC------------------------CCceEEEEEEEEcCce----E--------EEEeec-------------- Confidence 2211000 0112223344442221 1 111000 Q ss_pred hhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 302 VRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAP 381 (711) Q Consensus 302 ~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~ 381 (711) ..+...+.++.|.+.|.+|+|+|. +...+.|.+..+++.++.+|...|.+...+...+ T Consensus 212 ---------------~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 269 (474) T protein:vir:94 212 ---------------GIDALQEVGRYEHLFDYNPLFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTR 269 (474) T ss_pred ---------------CCCcccccccccCCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 001111223345555667776542 2345789999999999999999999999999888 Q ss_pred CCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccc Q lcl|Aclame:pro 382 KAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGN 461 (711) Q Consensus 382 ~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~ 461 (711) ++.+++. |.- ..++.... ....+.+.+.++ ++.+.++..+.-..++...++...+.|-..|++++.+.+..++ T Consensus 270 ~~~l~i~-g~~-~~~~~~~~--~~~~~~i~~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 342 (474) T protein:vir:94 270 LAYLVLR-GMG-MSEEMIQE--TQKSGAFELFDK---DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNG 342 (474) T ss_pred cchhhhc-cCC-CCchhhhh--hhhcceeEecCC---CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccc Confidence 8877663 432 11222221 223345555432 2346677666666778888999999999999999888776666 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeee Q lcl|Aclame:pro 462 ETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIH 541 (711) Q Consensus 462 ~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~n 541 (711) +.||.|+..+..............|..++++++++++.++..-.... ...++ T Consensus 343 n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~---------~~~~~------------------- 394 (474) T protein:vir:94 343 NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNL---------DDDSY------------------- 394 (474) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------Ccccc------------------- Confidence 78999998887776777777777777788777777776544321100 00010 Q ss_pred hhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhh Q lcl|Aclame:pro 542 DLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAI 620 (711) Q Consensus 542 D~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~ 620 (711) .++.+.=.+..+....+..+.+..+...++ ...+++.+++ .+.+...+++++.........+..... T Consensus 395 ------~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS------~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~ 462 (474) T protein:vir:94 395 ------LNLIFKFTRNIPVNKLEESQVLINLKGQVS------ERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEG 462 (474) T ss_pred ------ccceEEeCCCCCCCHHHHHHHHHHHhccCc------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCC Confidence 011222222222223333344444432222 1233344432 344333333332221110000000000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae 651 (711) ....+....+.+ T Consensus 463 -------------------~~~~~~~~~~s~ 474 (474) T protein:vir:94 463 -------------------DANDKSQNNQSE 474 (474) T ss_pred -------------------CcCCCCccccCC Confidence 000000000000 No 68 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.81 E-value=7e-20 Score=125.59 Aligned_cols=454 Identities=11% Similarity=0.052 Sum_probs=232.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC-C-CHHH--------- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ-W-PSQV--------- 69 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q-w-~~~~--------- 69 (711) |+=-|+-.-+.+ +. -+.+.|.++.+. ....+++..+..+||.|.+ + .... T Consensus 1 ~~~~~~~~~~~~---~~---------~~~e~i~~~i~~-------~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 61 (474) T protein:vir:10 1 MTLYKLIDDIEA---QG---------ILPKHIEALIES-------HKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKED 61 (474) T ss_pred CchHHHHhhccc---cC---------CCHHHHHHHHHH-------hhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhh Confidence 443333222222 11 111123333222 2223444455555555421 1 0000 Q ss_pred ----HHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHH Q lcl|Aclame:pro 70 ----RTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFT 143 (711) Q Consensus 70 ----~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~ 143 (711) ....+..++| .+++|..+.+|+..+|+.-.+.+.+.+. .+....+.+. T Consensus 62 ~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~--------------------------~~~~~~e~~~ 115 (474) T protein:vir:10 62 FETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLD--------------------------ENAEKNEKLK 115 (474) T ss_pred hhhcccccccccCcccccccchHHHHHHhHhhheeccceeEeeC--------------------------CCCcchHHHH Confidence 0112334444 4789999999999999987776655442 2223334456 Q ss_pred HHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeee Q lcl|Aclame:pro 144 GLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLID 221 (711) Q Consensus 144 ~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~ 221 (711) ..+..+++.|+++.....+..+++++|.+|..++.+. ++++++..+ +|.+++ || +.. +.-+ +.+ T Consensus 116 ~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~~~~~~~i-~p~~~~~v~d-~~~-----~~~~-~i~ 181 (474) T protein:vir:10 116 KFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT------NGDIRIKNI-DPYNVIFVGD-NIL-----EPTY-SLR 181 (474) T ss_pred HHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC------CCeeEEEEE-cccceEEEEc-CCC-----ceEE-EEE Confidence 6777778889999999999999999999988776542 256777766 677765 33 111 1122 222 Q ss_pred ecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchh Q lcl|Aclame:pro 222 DTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISI 301 (711) Q Consensus 222 ~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 301 (711) .|...++ .....+..+++|.... + +.+... T Consensus 182 ~~~~~~~------------------------~~~~~~~~~~~y~~~~----~--------~~~~~~-------------- 211 (474) T protein:vir:10 182 YFYEKDD------------------------DNGTDYVYAEFYDNAY----Y--------YVFRGE-------------- 211 (474) T ss_pred EEEEeeC------------------------CCceEEEEEEEEcCce----E--------EEEeec-------------- Confidence 2211000 0112223344442221 1 111000 Q ss_pred hhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 302 VRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAP 381 (711) Q Consensus 302 ~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~ 381 (711) ..+...+.++.|.+.|.+|+|+|. +...+.|.+..+++.++.+|...|.+...+...+ T Consensus 212 ---------------~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 269 (474) T protein:vir:10 212 ---------------GIDALQEVGRYEHLFDYNPLFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTR 269 (474) T ss_pred ---------------CCCcccccccccCCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 001111223345555667776542 2345789999999999999999999999999888 Q ss_pred CCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccc Q lcl|Aclame:pro 382 KAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGN 461 (711) Q Consensus 382 ~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~ 461 (711) ++.+++. |.- ..++.... ....+.+.+.++ ++.+.++..+.-..++...++...+.|-..|++++.+.+..++ T Consensus 270 ~~~l~i~-g~~-~~~~~~~~--~~~~~~i~~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 342 (474) T protein:vir:10 270 LAYLVLR-GMG-MSEEMIQE--TQKSGAFELFDK---DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNG 342 (474) T ss_pred cchhhhc-cCC-CCchhhhh--hhhcceeEecCC---CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccc Confidence 8877663 432 11222221 223345555432 2346677666666778888999999999999999888776666 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeee Q lcl|Aclame:pro 462 ETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIH 541 (711) Q Consensus 462 ~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~n 541 (711) +.||.|+..+..............|..++++++++++.++..-.... ...++ T Consensus 343 n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~---------~~~~~------------------- 394 (474) T protein:vir:10 343 NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNL---------DDDSY------------------- 394 (474) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------Ccccc------------------- Confidence 78999998887776777777777777788777777776544321100 00010 Q ss_pred hhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhh Q lcl|Aclame:pro 542 DLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAI 620 (711) Q Consensus 542 D~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~ 620 (711) .++.+.=.+..+....+..+.+..+...++ ...+++.+++ .+.+...+++++.........+..... T Consensus 395 ------~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS------~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~ 462 (474) T protein:vir:10 395 ------LNLIFKFTRNIPVNKLEESQVLINLKGQVS------ERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEG 462 (474) T ss_pred ------ccceEEeCCCCCCCHHHHHHHHHHHhccCc------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCC Confidence 011222222222223333344444432222 1233344432 344333333332221110000000000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae 651 (711) ....+....+.+ T Consensus 463 -------------------~~~~~~~~~~s~ 474 (474) T protein:vir:10 463 -------------------DANDKSQNNQSE 474 (474) T ss_pred -------------------CcCCCCccccCC Confidence 000000000000 No 69 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.81 E-value=4.9e-19 Score=120.94 Aligned_cols=465 Identities=12% Similarity=0.033 Sum_probs=227.2 Q ss_pred CCcCC--------CCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC--CCHHH- Q lcl|Aclame:pro 1 MAKKQ--------KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ--WPSQV- 69 (711) Q Consensus 1 ~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q--w~~~~- 69 (711) ||..- +.+++--.. .+...-.+.+.+.+.+.. ....+...+.+....+..+||.|.| |.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~e~~~~~i---~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~ 73 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEI----FDAIVRTNNKPETLEEMI---VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP 73 (483) T ss_pred CccchhcCCceeecCcchhhhh----hhcccccCCchhhHHHHH---HHHHHHHHHHHHHHHHHHHHhcccccccccccc Confidence 65432 122211111 111111112222222222 2222233345667778899999975 11110 Q ss_pred --HHHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHH Q lcl|Aclame:pro 70 --RTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGL 145 (711) Q Consensus 70 --~~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 145 (711) ........+| .+++|..+.+|+..+|+.-.+.+.+. ..|.+..+.+ T Consensus 74 ~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~---------------------------~~d~~~~~~l--- 123 (483) T protein:vir:12 74 VDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK---------------------------HTDDEVVKRI--- 123 (483) T ss_pred ccccccccccccccccccchHHHHHHHHhhhhcccCceec---------------------------cCChHHHHHH--- Confidence 0011112222 37799999999999998766544432 1333443433 Q ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeec Q lcl|Aclame:pro 146 IKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDT 223 (711) Q Consensus 146 ~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~ 223 (711) +.++. ++++.....+..+++++|.||.-++.+. ++++++..+ +|.+++ ||+.... +. ..+++.| T Consensus 124 -~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d~------d~~~~i~~~-~p~~~~~v~d~~~~~----~~-~~~ir~~ 189 (483) T protein:vir:12 124 -DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEHE----EL-EAFIRMY 189 (483) T ss_pred -HHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEcC------CCceEEEEE-cccceEEEEcCCCCC----ce-EEEEEEE Confidence 33333 6788999999999999999998887652 256777776 788875 4542211 12 2223333 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) ...+. . .+++|...... .+....+........ T Consensus 190 ~~~~~---------------------------~---~~~~y~~~~v~--~~~~~~~~~~~~~~~---------------- 221 (483) T protein:vir:12 190 KLENE---------------------------T---KVEYWDKVTVN--YYVYENGSLIPDYSN---------------- 221 (483) T ss_pred Eeecc---------------------------e---EEEEEecCeEE--EEEEeCCeeeecccc---------------- Confidence 21100 0 12333221111 111111111100000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) ..........|.+.+.+|+|+|. +...+.|.+..+++.++.+|..+|.+.+.+...+.+ T Consensus 222 --------------~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 280 (483) T protein:vir:12 222 --------------NLENSKTHFSTGSWGKIPFIPFK-------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNEL 280 (483) T ss_pred --------------cccccccccccCCCCccceEEec-------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 00000011234455667776542 234577999999999999999999999999988888 Q ss_pred ceEecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchh Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNET 463 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~ 463 (711) .+++.-...++..+ +... .+.++++.+..+ +.+.++..+.-...+...++...+.|-..|++.+.+.+..+++. T Consensus 281 ~lv~~g~~~~~~~~-~~~~-~~~~~~~~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~ 354 (483) T protein:vir:12 281 TYVLTNYDDQELPE-FKRL-LRYYGAIKVSDN----GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAP 354 (483) T ss_pred eeeeecCCcccchh-HHHh-hhhccccccCCC----CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCc Confidence 77764333332222 2221 233445554432 33566665556677788889999999999999988877666778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehh Q lcl|Aclame:pro 464 SGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDL 543 (711) Q Consensus 464 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~ 543 (711) ||.|+..+............+.|..+++++.+++++++. +.+ ++ T Consensus 355 Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~----------~~~-----~~--------------------- 398 (483) T protein:vir:12 355 SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD----------IKG-----EH--------------------- 398 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----------CCC-----cc--------------------- Confidence 999988876666666666666666666666666554421 111 10 Q ss_pred hheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhh Q lcl|Aclame:pro 544 NVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEE 622 (711) Q Consensus 544 ~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~ 622 (711) .++.|.=.+..+....+..+.+..+...++. ..+++.+++ .+.+.-.+++++.........+....... T Consensus 399 ----~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 468 (483) T protein:vir:12 399 ----KDVDISFNYNKVANTELQVQTAQQSMGIVSH------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGA 468 (483) T ss_pred ----ceeeEEeCCCCCCCHHHHHHHHHHHhccCch------HHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccc Confidence 0122222333333333344444444333222 223344332 33333333332221110000000000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 623 DMPEQTEPTPEQQVEMAKSQADMAQAEADTA 653 (711) Q Consensus 623 ~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~ 653 (711) ... +.....-+.+.+ T Consensus 469 d~~----------------~~~~~~~~~e~e 483 (483) T protein:vir:12 469 DGA----------------QQQERSNNKESE 483 (483) T ss_pred CCc----------------ccCCCCCcccCC Confidence 000 000000000000 No 70 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.81 E-value=9.7e-19 Score=119.35 Aligned_cols=464 Identities=11% Similarity=0.039 Sum_probs=228.6 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHH-----HHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRT-----EREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~-----~~~~ 75 (711) |-+.....+-.+--+..-+..........++|.++...+. ..+....+..+||.|+|..-.-.. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHR-------KQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID 73 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccchhccccccc Confidence 6555544333332222222222222334445555544332 334556677889999863211100 1112 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) .++| .+++|..+.+|+..+|+.-.+.+.+. ..|.+..+. ++.+.+ + T Consensus 74 ~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~---------------------------~~d~~~~~~----l~~~~~-n 121 (474) T protein:vir:97 74 YDKPDWRITTNFHQNLVDQKVSYVASKPVTYS---------------------------CEDENVLKV----IHDVLD-T 121 (474) T ss_pred cccCcceeecchHHHHHHHHHhhhhcCCceec---------------------------cCcHHHHHH----HHHHHh-c Confidence 3333 36899999999999999876655442 123334333 344444 6 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) ++......+.++++++|.||..++.+. ++.+++..+ +|..++ ||+... .+..+ +++.|... T Consensus 122 ~~~~~~~e~~~~~~~~G~~~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~~~-~ir~~~~~----- 184 (474) T protein:vir:97 122 RWDNKLIDILTATSNKGIDWLQVYINE------NGEMKLFRV-PAEQAIPIWVDKER----EELKS-FIRYYKFN----- 184 (474) T ss_pred cHHHHHHHHHHHHhhcCceEEEEEecC------CCeeEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEec----- Confidence 899999999999999999998776642 256777766 787777 443221 12222 23332100 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) ....+++|....... +....+.... . T Consensus 185 -------------------------~~~~~~~yt~~~~~~--y~~~~~~~~~-~-------------------------- 210 (474) T protein:vir:97 185 -------------------------NEEKVEFWTDTTVTY--YVLENGGLIP-D-------------------------- 210 (474) T ss_pred -------------------------CeEEEEEEeCCeEEE--EEEcCCcccc-c-------------------------- Confidence 001123333221110 1111111000 0 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) .......+.....|.+.+.+|+++|. +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++.... T Consensus 211 ---~~~~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~ 280 (474) T protein:vir:97 211 ---YYYGANHVQSHFSNGNWGRVPFIAFK-------NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE 280 (474) T ss_pred ---cccCcCcccccccccCCCccceEEec-------CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 00011112223344456667776542 23457899999999999999999999999998888888776444 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIAR 471 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~ 471 (711) .++..+... +...+.++.+..+ +.+.++..+.-...+...++.....|-..|++++.+.+.-+++.||.|+..+ T Consensus 281 ~~~~~~~~~--~~~~~~~i~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 354 (474) T protein:vir:97 281 GEDLEEFMR--GLKYYKAINVDGD----GGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFL 354 (474) T ss_pred cccchhhhh--hhhccceeeccCC----CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHH Confidence 443333332 2234455655443 3366666665667777888999999999999988776665667899998877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) ..............|..+++++.++++ .+... . .++. ++. T Consensus 355 ~~~l~~k~~~k~~~~~~~l~~~~~li~----~~~~~------~-----~d~~-------------------------~i~ 394 (474) T protein:vir:97 355 YGNLDLKANKLKNKATVAIQELISFII----DFNNL------K-----TDVK-------------------------DIE 394 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhCC------C-----cccc-------------------------eee Confidence 665555555555555556666555544 44321 0 1110 011 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~ 630 (711) +.=.++.+....+..+.+... ..++ ...++..+++ ...+.-.++++..........+........ T Consensus 395 v~f~~~~p~~~~e~a~~~~~~-g~iS------~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~------- 460 (474) T protein:vir:97 395 ISFNFNRMMNDAEQSQIIAQS-QYLS------RETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGAD------- 460 (474) T ss_pred EEeccCcccCHHHHHHHHHHc-CCCC------HHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCC------- Confidence 111112221112222223221 1111 1222333332 222222333322111100000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae 658 (711) ......+.+.. +.| T Consensus 461 ------------~~~~~~~~~~~--~~e 474 (474) T protein:vir:97 461 ------------GAQQQEGSNNK--ESE 474 (474) T ss_pred ------------CcccCCCCccc--ccC Confidence 00000000000 000 No 71 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.81 E-value=9.7e-19 Score=119.35 Aligned_cols=464 Identities=11% Similarity=0.039 Sum_probs=228.6 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHH-----HHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRT-----EREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~-----~~~~ 75 (711) |-+.....+-.+--+..-+..........++|.++...+. ..+....+..+||.|+|..-.-.. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHR-------KQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID 73 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccchhccccccc Confidence 6555544333332222222222222334445555544332 334556677889999863211100 1112 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) .++| .+++|..+.+|+..+|+.-.+.+.+. ..|.+..+. ++.+.+ + T Consensus 74 ~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~---------------------------~~d~~~~~~----l~~~~~-n 121 (474) T protein:vir:94 74 YDKPDWRITTNFHQNLVDQKVSYVASKPVTYS---------------------------CEDENVLKV----IHDVLD-T 121 (474) T ss_pred cccCcceeecchHHHHHHHHHhhhhcCCceec---------------------------cCcHHHHHH----HHHHHh-c Confidence 3333 36899999999999999876655442 123334333 344444 6 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) ++......+.++++++|.||..++.+. ++.+++..+ +|..++ ||+... .+..+ +++.|... T Consensus 122 ~~~~~~~e~~~~~~~~G~~~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~~~-~ir~~~~~----- 184 (474) T protein:vir:94 122 RWDNKLIDILTATSNKGIDWLQVYINE------NGEMKLFRV-PAEQAIPIWVDKER----EELKS-FIRYYKFN----- 184 (474) T ss_pred cHHHHHHHHHHHHhhcCceEEEEEecC------CCeeEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEec----- Confidence 899999999999999999998776642 256777766 787777 443221 12222 23332100 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) ....+++|....... +....+.... . T Consensus 185 -------------------------~~~~~~~yt~~~~~~--y~~~~~~~~~-~-------------------------- 210 (474) T protein:vir:94 185 -------------------------NEEKVEFWTDTTVTY--YVLENGGLIP-D-------------------------- 210 (474) T ss_pred -------------------------CeEEEEEEeCCeEEE--EEEcCCcccc-c-------------------------- Confidence 001123333221110 1111111000 0 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) .......+.....|.+.+.+|+++|. +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++.... T Consensus 211 ---~~~~~~~~~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~ 280 (474) T protein:vir:94 211 ---YYYGANHVQSHFSNGNWGRVPFIAFK-------NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE 280 (474) T ss_pred ---cccCcCcccccccccCCCccceEEec-------CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 00011112223344456667776542 23457899999999999999999999999998888888776444 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIAR 471 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~ 471 (711) .++..+... +...+.++.+..+ +.+.++..+.-...+...++.....|-..|++++.+.+.-+++.||.|+..+ T Consensus 281 ~~~~~~~~~--~~~~~~~i~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 354 (474) T protein:vir:94 281 GEDLEEFMR--GLKYYKAINVDGD----GGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFL 354 (474) T ss_pred cccchhhhh--hhhccceeeccCC----CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHH Confidence 443333332 2234455655443 3366666665667777888999999999999988776665667899998877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) ..............|..+++++.++++ .+... . .++. ++. T Consensus 355 ~~~l~~k~~~k~~~~~~~l~~~~~li~----~~~~~------~-----~d~~-------------------------~i~ 394 (474) T protein:vir:94 355 YGNLDLKANKLKNKATVAIQELISFII----DFNNL------K-----TDVK-------------------------DIE 394 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhCC------C-----cccc-------------------------eee Confidence 665555555555555556666555544 44321 0 1110 011 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~ 630 (711) +.=.++.+....+..+.+... ..++ ...++..+++ ...+.-.++++..........+........ T Consensus 395 v~f~~~~p~~~~e~a~~~~~~-g~iS------~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~------- 460 (474) T protein:vir:94 395 ISFNFNRMMNDAEQSQIIAQS-QYLS------RETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGAD------- 460 (474) T ss_pred EEeccCcccCHHHHHHHHHHc-CCCC------HHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCC------- Confidence 111112221112222223221 1111 1222333332 222222333322111100000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae 658 (711) ......+.+.. +.| T Consensus 461 ------------~~~~~~~~~~~--~~e 474 (474) T protein:vir:94 461 ------------GAQQQEGSNNK--ESE 474 (474) T ss_pred ------------CcccCCCCccc--ccC Confidence 00000000000 000 No 72 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.80 E-value=1.2e-19 Score=124.35 Aligned_cols=487 Identities=12% Similarity=0.065 Sum_probs=233.3 Q ss_pred CCcCCCC--CCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHH-H----- Q lcl|Aclame:pro 1 MAKKQKK--SRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRT-E----- 72 (711) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~-~----- 72 (711) ||--.+. .-+..++.-..+.-......+.+.+.+ | .+.. .+....+..+||.|+|.-..... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~----~---i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~ 71 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQK----L---IDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAG 71 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHH----H---HHhh--cHHHHHHHHHHhccccchhhccchhccccc Confidence 5432210 001111000000000011111112222 2 1111 23456788899999874221111 1 Q ss_pred --HHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 --RELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKN 148 (711) Q Consensus 73 --~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~ 148 (711) ....++| .+.+|..+.+|+..+|+.-.+.+.+. .+|.+..+.+ +. T Consensus 72 ~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~---------------------------~~d~~~~~~l----~~ 120 (503) T protein:vir:59 72 QQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFT---------------------------SDNKTLLEYV----NE 120 (503) T ss_pred ccccccccccceeecchHHHHHHHHHhhhhcCCeeec---------------------------cCcHHHHHHH----HH Confidence 1122233 46789999999999999876655442 1333444433 43 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCH Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSK 226 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~ 226 (711) .. .++++.....+.++++++|.+|+.++++. ++++++..+ +|.+++ ||+... . .. .++++.|... T Consensus 121 ~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------dg~~~i~~~-~p~~~~~i~d~~~~-~---~~-~~~ir~~~~~ 187 (503) T protein:vir:59 121 LA-DDDFDDILNETVKNMSNKGIEYWHPFVDE------EGEFDYVIF-PAEEMIVVYKDNTR-R---DI-LFALRYYSYK 187 (503) T ss_pred HH-hcCHHHHHHHHHHHHhhCCeEEEEEeecC------CCceEEEEE-ccceeEEEEeCCCC-C---ce-EEEEEEEEEe Confidence 33 37899999999999999999998887653 257888777 787776 554321 1 12 2223333211 Q ss_pred HHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcc Q lcl|Aclame:pro 227 EKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRK 306 (711) Q Consensus 227 ~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 306 (711) + ...+.+..+|+|....... +....+.+..-.... .. T Consensus 188 --------~-----------------~~~~~~~~~evy~~~~i~~--~~~~~~~~~~~~~~~--------------~~-- 224 (503) T protein:vir:59 188 --------G-----------------IMGEETQKAELYTDTHVYY--YEKIDGVYQMDYSYG--------------EN-- 224 (503) T ss_pred --------c-----------------CCCceEEEEEEEeCCcEEE--EEEcCCccccccccc--------------cc-- Confidence 0 0112344556665543211 111111111000000 00 Q ss_pred cceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceE Q lcl|Aclame:pro 307 VKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFI 386 (711) Q Consensus 307 ~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~ 386 (711) -....+.....|++.+.+||++|. +...+.|.+..+++.++.+|..+|.+.+.+...+.+.++ T Consensus 225 ----------~~~~~~~~~~~~~~~~~vPiv~~~-------nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v 287 (503) T protein:vir:59 225 ----------NPRPHMTKGGQAIGWGRVPIIPFK-------NNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYV 287 (503) T ss_pred ----------ccccceeecceeccCCccceEEec-------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeE Confidence 001112234456677888887653 233478999999999999999999999999998888877 Q ss_pred ecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHH Q lcl|Aclame:pro 387 GSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGR 466 (711) Q Consensus 387 ~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ 466 (711) +...-.++..+... +.+.+.++.+..+ +.+.++....-.......++.....|...+++++.+.+..+++.||. T Consensus 288 ~~g~~~~~~~~~~~--~~~~~~~~~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~ 361 (503) T protein:vir:59 288 LKNYDGENPKEFTA--NLRYHSVIKVSGD----GGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGP 361 (503) T ss_pred eecCCccccchhhh--hhhcccceeccCC----CcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHH Confidence 65333333222222 1233445555432 23556655555667778888889999999998887766666778999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhe Q lcl|Aclame:pro 467 AIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQ 546 (711) Q Consensus 467 ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~ 546 (711) |+..+..............|..+++++.++++.++....... ..+ T Consensus 362 Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~----------~~~------------------------- 406 (503) T protein:vir:59 362 ALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGD----------FNP------------------------- 406 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc----------ccc------------------------- Confidence 998876666666666666666677766666665553322110 000 Q ss_pred eeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHH Q lcl|Aclame:pro 547 KYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMP 625 (711) Q Consensus 547 ~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~ 625 (711) ..+|.+.=.+..+....+..+.+..+..+. -+....+++.+++ ++.++-.+++.+.......... T Consensus 407 ~~~i~i~f~~~~p~d~~~~~~~~~kl~~~G----iiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~---------- 472 (503) T protein:vir:59 407 DKELTMTFTRTRIQNDSEIVQSLVQGVTGG----IMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQG---------- 472 (503) T ss_pred ccceeEEeCCCCCCCHHHHHHHHHHHHhCC----CCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhc---------- Confidence 012222222333333334444555543321 0111223333332 2222222222221100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 626 EQTEPTPEQQVEMAKSQADMAQAEADT-AQAQADMLKAQLETEEAQKQLA 674 (711) Q Consensus 626 ~~q~~~~~~q~~~~~~q~~~~k~qae~-~~aqae~~~~q~~~~~~q~q~~ 674 (711) ............+.+. ..-+++...+ -++ + T Consensus 473 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~-------~ 503 (503) T protein:vir:59 473 -----------NLLDDEGGDDDLEEDDPNAGAAESGGA-GQV-------S 503 (503) T ss_pred -----------cccCccCCCCCCCcCCCCCCcccCCCC-CCc-------C Confidence 0000000000000000 0000000000 000 0 No 73 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.80 E-value=4.4e-18 Score=115.77 Aligned_cols=438 Identities=10% Similarity=0.041 Sum_probs=227.6 Q ss_pred chHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHH-----HHHHHhCCC--ceEehhhHHHHHHHhhhhh Q lcl|Aclame:pro 26 DDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR-----TERELEQRP--CLVNNVLPTFVDQVLGDQR 98 (711) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~-----~~~~~~g~p--~~~~N~i~~~v~~i~g~~~ 98 (711) .. .+.+.++.+. ....+.+..+..+||.|++....-. ......+.| .+.+|..+.+|+..+|+.- T Consensus 1 l~-~~~i~~~i~~-------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~ 72 (451) T protein:vir:10 1 ME-LEKIRAIISA-------DAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMF 72 (451) T ss_pred CC-HHHHHHHHHH-------HHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhhee Confidence 11 1123333222 2334566778899999986422110 011112223 4778999999999999887 Q ss_pred hcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEE Q lcl|Aclame:pro 99 QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRS 178 (711) Q Consensus 99 ~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~ 178 (711) .+.+.+.. +++.+..+ +++... .|+++.....+.++++++|.||.-++. T Consensus 73 G~p~~~~~--------------------------~~~~~~~~----~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~ 121 (451) T protein:vir:10 73 TYPVLFDI--------------------------DNNKELNE----KVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWI 121 (451) T ss_pred cccceeec--------------------------CCcHHHHH----HHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEee Confidence 66554432 22333333 334444 478999999999999999999988877 Q ss_pred eeccCC--CCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCC Q lcl|Aclame:pro 179 DYLADD--SFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFT 254 (711) Q Consensus 179 d~~~~~--~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 254 (711) +....+ +..+++.+..+ +|.+++ ||.... .-...+++.|...++- ..-.. T Consensus 122 de~~~~~~~~~~~~~~~~i-~p~~~~~vydd~~~-----~~~~~~ir~~~~~~~~--------------------~~~~~ 175 (451) T protein:vir:10 122 DEEYSGEQVTNQTFKYGVV-NTEEIIPIYRNGIE-----RELEAVIRYYIQLEDV--------------------KGQIQ 175 (451) T ss_pred cCCcccccccccceeEEEE-cccceEEEEcCCCC-----CceEEEEEEEEeeecc--------------------ccccc Confidence 643322 23467778777 788776 443221 1123333333221110 00011 Q ss_pred CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCcc Q lcl|Aclame:pro 255 EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTI 334 (711) Q Consensus 255 ~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~ 334 (711) .+.+..+|+|..... +.+... . ....+..++..+.|.+.+.+ T Consensus 176 ~~~~~~~e~yt~~~~----~~~~~~------~----------------------------~~~~~~~~~~~~~~~~~g~v 217 (451) T protein:vir:10 176 KQAYTYVEFWTDKIL----DKYKFF------G----------------------------VSCCGSQIEHITVQHRFNSV 217 (451) T ss_pred ceEEEEEEEEeCCeE----EEEEec------c----------------------------cCccccccccccccCCCCee Confidence 223344455433211 000000 0 00112223334445555666 Q ss_pred ceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecc Q lcl|Aclame:pro 335 PVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIP 414 (711) Q Consensus 335 P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~ 414 (711) |+++|. +...+.|.+..++++++.+|.++|.+.+.+.-.+++.+++.-.......+.... .+..+++.+.+ T Consensus 218 Pvv~~~-------nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~--~~~~~~i~~~~ 288 (451) T protein:vir:10 218 PFVEFS-------NNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKE--LKRYKTIKTET 288 (451) T ss_pred eEEEec-------cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHH--HhhCCeEEecC Confidence 666542 223467999999999999999999999999999888777643222222232222 23445555554 Q ss_pred ccc-CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 415 QYQ-GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRV 493 (711) Q Consensus 415 ~~~-~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~ 493 (711) ... .++.+.++..+.-..+....++.....|-..|++++.+.+.. ++.||.|+..+-..........-..|..+.+++ T Consensus 289 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~ 367 (451) T protein:vir:10 289 DSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENF-GNASGVALKFFYRKLELKSGLLETEFRTSFDKL 367 (451) T ss_pred cCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 322 234577777777778888899999999999999887665544 357999998887666666666666666666666 Q ss_pred HHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHH Q lcl|Aclame:pro 494 GKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFA 573 (711) Q Consensus 494 ~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~ 573 (711) .++++.++..+ ++. ++.+.=.+..+....+..+.+..+. T Consensus 368 ~~li~~~~~~~----------------d~~-------------------------~i~i~f~~~~p~n~~e~~~~~~kl~ 406 (451) T protein:vir:10 368 IKAILYFLGVT----------------DYK-------------------------KIQQTYTRNMMSNDLEDADIATKSV 406 (451) T ss_pred HHHHHHHhCCC----------------Ccc-------------------------ceeEEecCCCCCCHHHHHHHHHHHh Confidence 66665543210 000 1111112222222223333444433 Q ss_pred hhcchhHHHHHHHHHHhcCCc-chHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 574 QAVPSAAAVMADLIAQNMDWP-GADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 574 ~~~p~~~~~~~~~~~~~~~~~-~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..++. ..++..+++. +.++..+.+.+. .+....+.+......-. T Consensus 407 g~iS~------et~~~~~p~v~d~~~e~~~~~ee--------------------~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 407 GIIPT------KIILRHHPWVDDVEEAEKLYLEE--------------------KKIQASKVSDDYNNFTE 451 (451) T ss_pred ccCch------HHHHHhCCCCCCHHHHHHHHHHH--------------------HHHHHHHHHhhcCCCCC Confidence 22221 2223333321 111111111100 00000000000000000 No 74 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.80 E-value=3.4e-18 Score=116.33 Aligned_cols=462 Identities=13% Similarity=0.051 Sum_probs=225.0 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC--CCHHHH---HHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ--WPSQVR---TEREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q--w~~~~~---~~~~~ 75 (711) |-+-.++.+ +. ..+--.-..+.+...+.+.++.+ ...+.+....+..+||.|++ +..... ..... T Consensus 21 ~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~i~~~i~-------~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~ 90 (492) T protein:vir:94 21 LYPSQPTQT-EI--FDAIVRTNNKPETLEEMIVRYIK-------QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVD 90 (492) T ss_pred eecCccchh-hh--hhcccccCCchhhHHHHHHHHHH-------HHHHHHHHHHHHHHHhcccccccccccccccccccc Confidence 221112211 11 11111111222223333333332 22344566778899999974 111000 00111 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) ..+| .+++|..+.+|+..+|+.-.+.+.+. .+|.+..+.+.. ++ .+ T Consensus 91 ~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~---------------------------~~d~~~~~~l~~----~~-~n 138 (492) T protein:vir:94 91 PLKPDDRMITNFHANLVDQKVSYIVGKPIAFK---------------------------HTDDEVVKRIDE----VL-GN 138 (492) T ss_pred ccccccccccchHHHHHHHHHhhhcccCceec---------------------------cCchHHHHHHHH----HH-hc Confidence 2222 36789999999999998765544432 133344444443 33 36 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) +++.....+..+++++|.||.-++.+. ++++++..+ +|.+++ ||+.... +.. .+++.|-..+ T Consensus 139 ~~~~~~~~~~~~a~~~G~a~~~v~~d~------dg~~~~~~~-~p~~~~~v~d~~~~~----~~~-a~ir~~~~~~---- 202 (492) T protein:vir:94 139 RFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEHE----ELE-AFIRMYKLEN---- 202 (492) T ss_pred cHHHHHHHHHHHHhhCCeEEEEEEecC------CCceEEEEE-cccceEEEEcCCCCC----ceE-EEEEEEeecc---- Confidence 899999999999999999998877642 256778777 787765 5542211 122 2333332100 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) . ..+++|...... .+....+.... +. T Consensus 203 ---------------~-----------~~~~~y~~~~v~--~~~~~~~~~~~-~~------------------------- 228 (492) T protein:vir:94 203 ---------------E-----------TKVEYWDKVTVN--YYVYENGSLIP-DY------------------------- 228 (492) T ss_pred ---------------c-----------eeEEEEecCeEE--EEEEecCeeee-cc------------------------- Confidence 0 012333221111 11111111110 00 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) ...+........|.+.+.+|+|+|. +...+.|.+..+++.++.+|..+|.+.+.+...+.+.+++.-.. T Consensus 229 ----~~~~~~~~~~~~~~~~g~vPvv~~~-------nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 297 (492) T protein:vir:94 229 ----SNNLENSKTHFSTGSWGKIPFIPFK-------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD 297 (492) T ss_pred ----ccccccccccccccCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 0000011112244556677777653 22347799999999999999999999999998888877764222 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIAR 471 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~ 471 (711) ..+..+ +.. ..+...++.+..+ +.+.++..+.-..++...++...+.|..+|++++.+.+.-+++.||.|+..+ T Consensus 298 ~~~~~~-~~~-~~~~~~~~~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 371 (492) T protein:vir:94 298 DQELPE-FKR-LLRYYGAIKVSDN----GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 371 (492) T ss_pred cccchh-hHH-HHhhccceecCCC----CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHH Confidence 222222 221 1223344544332 2355665555666778888999999999999988877766677899998887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) -...........+.|..+++++.++++.++.. .++ + .++. T Consensus 372 ~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~----------~~~-----~-------------------------~~i~ 411 (492) T protein:vir:94 372 YTNLNLKADKLARKAKVAIQELLWFVFEHFDI----------KGE-----H-------------------------KDVD 411 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----------Ccc-----c-------------------------ceee Confidence 66666666666666777777766665554321 110 0 0122 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~ 630 (711) +.=.++.+....+..+.+..+...++. ..+++.+++ .+.+.-.+++++.........+.. T Consensus 412 v~f~~~~p~~~~e~~~~~~kl~giiS~------et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~------------- 472 (492) T protein:vir:94 412 ISFNYNKVANTELQVQTAQQSMGIVSH------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNL------------- 472 (492) T ss_pred EEecCCCCCCHHHHHHHHHHHhccCch------HHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc------------- Confidence 222333333333344444444332222 223344432 233333333322111000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae 658 (711) ......-..-..+....+.| T Consensus 473 --------~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 473 --------DDGGADSAQQQERSNNKESE 492 (492) T ss_pred --------ccccCCCCccccCCccccCC Confidence 00000000000000000000 No 75 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.80 E-value=4.7e-17 Score=110.08 Aligned_cols=507 Identities=13% Similarity=0.023 Sum_probs=248.6 Q ss_pred cCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh Q lcl|Aclame:pro 20 VYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ 99 (711) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~ 99 (711) .+..-+-+...--..+..+|+...+....|...|.++.+|..-.=+++.... ++...+.-..-...++.+.+..-. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~----~~~~~~~dstg~~a~~~LAa~l~~ 76 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDN----ETSQNGWQGVGAQATNHLANKLAQ 76 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCc----cccCCcccchHHHHHHHHHHHHHh Confidence 1121122222334678888888888889999999999999854322221100 111111122222223333222111 Q ss_pred -----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHH---HHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCc Q lcl|Aclame:pro 100 -----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELA---EVFTGLIKNIEYNCDAETEYDIAFQGAVESGM 171 (711) Q Consensus 100 -----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~ 171 (711) +++=.++.+.+.. .......+.+..++. +.++..+......|+|......++.+.+..|+ T Consensus 77 ~ltpp~~~WF~L~~~~~~------------~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 144 (516) T protein:vir:96 77 VLFPAQRSFFRVDLTAQG------------EKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGS 144 (516) T ss_pred hhcCCCCcccccccChhH------------HhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCe Confidence 1111233222100 000000001111222 23455566667789999999999999999999 Q ss_pred cEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccccc Q lcl|Aclame:pro 172 GYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDT 251 (711) Q Consensus 172 g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 251 (711) |++.+ + +++ .++.+ +..++++..++.- ...-++++.+++..+|.+.|+... ....... . T Consensus 145 a~l~~--d---~~~-----~~~~~-pl~~y~v~~d~~G----~v~~i~rr~~~~~~~l~~~~~~~~-~~~~~~~-----~ 203 (516) T protein:vir:96 145 CMLYK--P---SKG-----AISAI-PMHHYVVNRDTNG----DLLDIILLQEKALRTFDPATRAVV-EVGLKGK-----K 203 (516) T ss_pred EeEEe--c---CCC-----CEEEE-EcCeEEEeeCCCC----CeeeehhhhHhhHHHHHHhhhhhh-hhhhhhh-----h Confidence 87532 2 221 14444 5667777655531 133477888999999988885432 1111000 0 Q ss_pred CCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCC Q lcl|Aclame:pro 252 WFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPS 331 (711) Q Consensus 252 ~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~ 331 (711) ....+.+.|..+-++++ ++ +.+++.-..|.+++ ..+-|+. T Consensus 204 ~~~~~~v~v~~~v~~~~---------~~------------------------------~~~~~~~~d~~~~~-~es~~~~ 243 (516) T protein:vir:96 204 CKEDDSVKLYTHAKYLG---------DG------------------------------FWELKQSADDIPVG-KVSKIKS 243 (516) T ss_pred cCCCCceEEEEeeeeeC---------Cc------------------------------eeEEEEEeCceeec-ccccccc Confidence 01122333333222211 11 11222222333333 3345666 Q ss_pred CccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEE Q lcl|Aclame:pro 332 TTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLT 411 (711) Q Consensus 332 ~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~ 411 (711) ..+||+++- +...++..||.|.+....+--+.+|.+...++.....+.++.++++++.+.+..+... ..+|.++ T Consensus 244 ~e~P~~~~R--w~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~---~~~g~i~- 317 (516) T protein:vir:96 244 EKLPFIPLT--WKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN---SGTGEVV- 317 (516) T ss_pred ccCCeeeee--eeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhcc---CCCceee- Confidence 789999764 4456889999999999999999999999999999999999999998888876654321 2233332 Q ss_pred ecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 412 YIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SI 490 (711) Q Consensus 412 ~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~ 490 (711) +|...+..+-......-.+.....++...+.|.... ..+.+.-.++.+.|+.-|..+.+--...+...+..|.. +. T Consensus 318 --~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af-~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell 394 (516) T protein:vir:96 318 --TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVF-MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQ 394 (516) T ss_pred --cCCcccceeeecCcccchhHHHHHHHHHHHHHHHHH-hhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 333322222122233334556677777777776654 22222224455678888988888777777777777653 33 Q ss_pred HHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHH Q lcl|Aclame:pro 491 RRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMI 570 (711) Q Consensus 491 ~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~ 570 (711) ..+.+.++.. .+. . -| . +...+.+..+ -.+-.|.+..+.+. T Consensus 395 ~Pli~r~l~~-------------~~p----~---------lp-~-----------~~v~~~~vs~-l~~l~r~~~~~~i~ 435 (516) T protein:vir:96 395 SPVAMWGLLE-------------AGE----S---------FT-S-----------DLVDPVIITG-IEALGRMAELDKLA 435 (516) T ss_pred HHHHHHHHHh-------------cCC----C---------Cc-c-----------ccccceeech-HHHHHHHHHHHHHH Confidence 3333322211 111 0 00 0 0012222222 22344555555566 Q ss_pred HHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 571 QFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEA 650 (711) Q Consensus 571 ~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qa 650 (711) .+++.+..+++.. -+.++.-+.+++.+.+....+-....--..++..+..++++++++.++++....++....+++ T Consensus 436 ~~~~~i~~~~~~~----p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~ 511 (516) T protein:vir:96 436 NFAQYMSLPLQWP----EPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQ 511 (516) T ss_pred HHHHHHHHHhcCC----hhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhc Confidence 5555433222111 223455566777777766655432111111111111111111111111000001111111111 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 651 DTAQAQADMLKAQLETEEA 669 (711) Q Consensus 651 e~~~aqae~~~~q~~~~~~ 669 (711) + .+++ T Consensus 512 ~--------------~~~~ 516 (516) T protein:vir:96 512 E--------------LKEA 516 (516) T ss_pred c--------------cccC Confidence 1 1111 No 76 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.80 E-value=1.6e-18 Score=118.14 Aligned_cols=463 Identities=11% Similarity=0.017 Sum_probs=228.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP- 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p- 79 (711) |.+.+ -.-.+.+..-+.+.+..+.+++.. ..+....+-.+||.|+| +-.-+......++| T Consensus 1 ~~~~~------------~~~~~~~~~~~~~~~~~~i~~~~~------~~~~r~~~~~~yy~g~~-~i~~~~~~~~~~~~~ 61 (489) T protein:vir:99 1 MLQED------------FEAIDYESKLWIDQLKNYISRFKA------EQLERLKELKRYYLGDN-NIKYRPAKTDKYAAD 61 (489) T ss_pred CCccc------------eeeeCCCCCCCHHHHHHHHHHHHH------HHHHHHHHHHHHhcccC-ccccccccccccCCc Confidence 33332 111111112222224444444321 22344567788999985 11111111122333 Q ss_pred -ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 80 -CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 80 -~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) .++.|..+.+|+..+|+.-.+.+.+. + +|.+ ....+..+++.|+++.. T Consensus 62 ~ki~~n~~~~iv~~~~~~l~g~~~~~~--~-------------------------~d~~----~~~~l~~~~~~n~~~~~ 110 (489) T protein:vir:99 62 NRIASDFAKYITVFEQGYMLGVPVEYK--N-------------------------ENKD----LQAAIDLMSVRNNEDYH 110 (489) T ss_pred ceeecchHHHHHHHHhhhhccCCceee--c-------------------------CChh----HHHHHHHHHhhcChhHH Confidence 48899999999999998766544432 1 2222 35567778888999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCc Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDA 236 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~ 236 (711) ...+.++++++|.||.-++.....+ .+++++|..+ +|.+++ ||+... .+..++ ++.|.. T Consensus 111 ~~~~~~~~~~~G~~~~~v~~~~~~d--~~~~~~i~~~-~p~~~~~v~dd~~~----~~~~~~-i~~~~~----------- 171 (489) T protein:vir:99 111 NVKIKTDLSIYGRAYELLTVEKIDD--KKTEVKLYQL-PAEQTFVIYDDTYQ----RNSLMA-VHFYDI----------- 171 (489) T ss_pred HHHHHHHHhhCCeEEEEEeeccCcC--CCcceEEEEE-cccceEEEEcCCCC----CceEEE-EEEEEE----------- Confidence 9999999999999998887654322 2467888877 787775 443221 112222 222210 Q ss_pred ccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEE Q lcl|Aclame:pro 237 TAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRK 316 (711) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~ 316 (711) .+.+...+..+++|.... ++.+..... T Consensus 172 --------------~~~~~~~~~~~~~y~~~~------------i~~~~~~~~--------------------------- 198 (489) T protein:vir:99 172 --------------DYGSGKRKQIIKAYTSDT------------IYTYEDYNL--------------------------- 198 (489) T ss_pred --------------ecCCCceEEEEEEEeCCc------------EEEEEecCC--------------------------- Confidence 000112334455553321 111100000 Q ss_pred EecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC-Ch Q lcl|Aclame:pro 317 ITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE-GR 395 (711) Q Consensus 317 ~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~-~~ 395 (711) ..+...+..+.|.+.+.+|+++|. +...+.|.+..+++.++.+|..+|.+.+.+...+.+..++- |... .. T Consensus 199 ~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~-g~~~~~~ 270 (489) T protein:vir:99 199 ETKGMRLKDYEGHFFKGVPVNEYA-------NNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIA-GNAYTGA 270 (489) T ss_pred CcccceecccccccCCceeEEEee-------cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhc-cCCcccc Confidence 001111223445555777777652 22346789999999999999999999999887777665553 3211 11 Q ss_pred H--HHHhhcccCC------------CceEEecccccC---cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcc Q lcl|Aclame:pro 396 E--DEWEQANTKN------------FSLLTYIPQYQG---DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGA 458 (711) Q Consensus 396 ~--~~~~~~~~~~------------~~~i~~~~~~~~---~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~ 458 (711) + +........+ +.++...++... +..++++..+.-...+...++.....|-..||+++.+.+. T Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 350 (489) T protein:vir:99 271 DENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMK 350 (489) T ss_pred cchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccccc Confidence 1 1111111111 122222222111 2234555555556677778888889999999988876654 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhcccee Q lcl|Aclame:pro 459 MGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWV 538 (711) Q Consensus 459 ~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~ 538 (711) .+++.||.|+..+..............|..+.+++.++++.++...... ......+ T Consensus 351 ~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~--------~~~~~~~---------------- 406 (489) T protein:vir:99 351 FSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNE--------ATTYSLV---------------- 406 (489) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--------ccccccc---------------- Confidence 4556799998877665555556666666667777777666655322110 0000000 Q ss_pred eeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC---cchHHHHHHHHhhhcchhh-cc Q lcl|Aclame:pro 539 TIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW---PGADVIAERLKKIVPPNVL-SK 614 (711) Q Consensus 539 ~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~---~~~~e~~~~l~~~~~~~~~-~~ 614 (711) .++.+.=.+..+....+..+.+..+...++. ..+++.+++ +.+++..+++++....... .+ T Consensus 407 ---------~~i~v~f~~~~p~d~~~~~~~~~kl~giis~------et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~ 471 (489) T protein:vir:99 407 ---------NDTSIVFTPNLPQNDNEIVTAAQNLYGIVSD------QTIFEILNTVTGVDAEAELKRLKEEADKKQSLPE 471 (489) T ss_pred ---------ccceEEeCCCCCcCHHHHHHHHHHHhccCCH------HHHHHhcCCCCchhHHHHHHHHHHHHHHHhcccc Confidence 1222222333333344444555555443332 222333332 2333333333332211110 00 Q ss_pred hhhhhhhhhHHHHHHHHH Q lcl|Aclame:pro 615 DEREAIEEDMPEQTEPTP 632 (711) Q Consensus 615 ~~~~~~~~~~~~~q~~~~ 632 (711) .......-.+.+.....+ T Consensus 472 ~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 472 PRLVGDASGQEEPTAEKP 489 (489) T ss_pred ccccCCCCCCcCCCCCCC Confidence 000000000000000000 No 77 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.79 E-value=6e-18 Score=114.99 Aligned_cols=463 Identities=12% Similarity=0.054 Sum_probs=223.7 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC-CHHHH----HHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW-PSQVR----TEREL 75 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw-~~~~~----~~~~~ 75 (711) .-++.-.+..+.. .+--..+.+.+...+.+.++.+ .....+.+..+..+||.|++= ..... ..... T Consensus 20 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~i~~~i~-------~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~ 90 (492) T protein:vir:97 20 ILYPSQPTQTEIF--DAIVRTNNKPETLEEMIVRYIK-------QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVD 90 (492) T ss_pred eeeccchhhhhHh--hhcccCCCchhhHHHHHHHHHH-------HHHHHHHHHHHHHHHhcccCcccccccccccccccc Confidence 1122211111110 1111111122222333333333 233455667788999999741 00000 00111 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) ..+| .+++|..+.+|+..+|+.-.+.+.+. .+|.+..+.+ +.+++ | T Consensus 91 ~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~---------------------------~~d~~~~~~l----~~~~~-n 138 (492) T protein:vir:97 91 PLKPDDRMITNFHANLVDQKVSYIVGKPIAFK---------------------------HTDDEVVKRI----DEVLG-N 138 (492) T ss_pred ccccccccccchHHHHHHHHhhhhcccCceec---------------------------cCchHHHHHH----HHHHh-c Confidence 2222 46799999999999998766554432 1333343433 33333 6 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) +++.....+..+++++|.||.-++.+. ++++++..+ +|.+++ ||+... . +. ..+++.|-..+ T Consensus 139 ~~~~~~~~~~~~~~~~G~a~~~v~~d~------dg~~~~~~~-~p~~~~~i~d~~~~-~---~~-~~~vr~~~~~~---- 202 (492) T protein:vir:97 139 RFDDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEH-E---EL-EAFIRMYKLEN---- 202 (492) T ss_pred cHHHHHHHHHHHHhhcCeEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC-C---ce-EEEEEEEeecc---- Confidence 899999999999999999988776542 256778776 787776 443221 1 12 22333332100 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) ...+++|...... .+.+..+....... T Consensus 203 --------------------------~~~~~~y~~~~v~--~~~~~~~~~~~~~~------------------------- 229 (492) T protein:vir:97 203 --------------------------ETKVEYWDKVTVN--YYVYENGSLIPDYS------------------------- 229 (492) T ss_pred --------------------------ceeEEEEecCeEE--EEEEecCeeeeccc------------------------- Confidence 0012333222111 11111111110000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) ...+.. .....|.+.+.+|+|+|. +...+.|.+..+++.++.+|..+|.+...+...+.+.+++.... T Consensus 230 ----~~~~~~-~~~~~~~~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~ 297 (492) T protein:vir:97 230 ----NNLENS-KTHFSTGSWGKIPFIPFK-------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD 297 (492) T ss_pred ----cccccc-ccccccCCCCCcceEEec-------CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Confidence 000000 112234455667776542 23347799999999999999999999999999888877764322 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIAR 471 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~ 471 (711) ..+..+ +.. ..+...++.+..+ +.+.++..+.-...+...++...+.|-..|++++.+.+..+++.||.|+..+ T Consensus 298 ~~~~~~-~~~-~~~~~~~~~~~~~----~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 371 (492) T protein:vir:97 298 DQELPE-FKR-LLRYYGAIKVSDN----GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 371 (492) T ss_pred cccchh-HHH-HHhhccceecCCC----CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHH Confidence 222222 221 1233345555433 2355665555566788888999999999999988877766677899998877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) -...........+.|..+++++.++++.++. +.+ ++ .++. T Consensus 372 ~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~----------~~~-----~~-------------------------~~i~ 411 (492) T protein:vir:97 372 YTNLNLKADKLARKAKVAIQELLWFVFEHFD----------IKG-----EH-------------------------KDVD 411 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----------CCc-----cc-------------------------ceee Confidence 6666666666666666666666666554431 111 11 0111 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~ 630 (711) +.=.+..+....+..+.+..+...++ ...+++.+++ .+.++-.+++++.........+. T Consensus 412 v~f~~~~p~~~~e~a~~~~kl~G~iS------~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~-------------- 471 (492) T protein:vir:97 412 ISFNYNKVANTELQVQTAQQSMGIVS------HETVLENHPFVEDLQAELERIEQEQTEYNKQLPN-------------- 471 (492) T ss_pred EEecCCCCCCHHHHHHHHHHHhccCc------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc-------------- Confidence 22223333323333444444433322 2223444432 23333333332211100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQ 672 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q 672 (711) ....... ...+....+ ..+.+ T Consensus 472 -------~~~~~~~-----~~~~~~~~~---------~~~~e 492 (492) T protein:vir:97 472 -------LDDGGAD-----SAQQQERSN---------NKESE 492 (492) T ss_pred -------cccCCCC-----CCccccccc---------ccccC Confidence 0000000 000000000 00000 No 78 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.79 E-value=5.5e-17 Score=109.71 Aligned_cols=506 Identities=10% Similarity=-0.014 Sum_probs=249.9 Q ss_pred ccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhh Q lcl|Aclame:pro 19 KVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQR 98 (711) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~ 98 (711) -++.+. ....++..+|+........|...|.++.+|..-.-+++..-..+. ..+....-...++...+..- T Consensus 1 ~~~~~~-----~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~----~~~~dstg~~a~~~LAa~l~ 71 (517) T protein:vir:10 1 MDMRFA-----GNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSS----QNAWQDDGASATNFLSNKLS 71 (517) T ss_pred Cccccc-----ccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccc----cccccchHHHHHHHHHHHHH Confidence 111111 125678888888888889999999999999854222111100000 11112222223333322211 Q ss_pred h-----cccceeEecchhhhhhhhhcccccccccccCCCchh-HHH---HHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 99 Q-----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKND-YEL---AEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 99 ~-----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~---Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) . +++=.++.+.+..... ...+..+ .++ -+.++..+......|+|......++.+.+.. T Consensus 72 ~~ltpp~~~WF~l~~~~~~l~~-------------~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~ 138 (517) T protein:vir:10 72 QVLFPAQRSFFRIDLTPEGIKQ-------------LDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVT 138 (517) T ss_pred HhhcCCCCccccccCCHHHHHh-------------hccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhH Confidence 1 1111233222100000 0001111 111 2233555566678899999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccc Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADY 249 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 249 (711) |+|+. +.+ +. ...++.+ +..++++..++.- + ..-++++.+++..++.+.|+......- . . T Consensus 139 G~a~l--y~~---~~----~~~~~~~-pl~~y~v~~d~~G-~---v~~ivrr~~~~~~~l~~~~~~~~~~~~----~--~ 198 (517) T protein:vir:10 139 GNVMM--YHP---DK----TSPIQAV-PLHHYCVRRDNNG-T---VLDIVFLQEKALETFEPSIRMAIQASR----K--G 198 (517) T ss_pred CeEEE--EEe---CC----CCcEEEE-EcCeEEEeeCCCc-C---eEEEEeeeeccHHHHHHHhhhhcchhh----h--h Confidence 99874 222 22 1234444 5667887655531 1 223678899999999999986532111 0 0 Q ss_pred ccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccC Q lcl|Aclame:pro 250 DTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEI 329 (711) Q Consensus 250 ~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~ 329 (711) ..+...+.|.++++-++.. +|.+ ++|.-..|..+ ...+-| T Consensus 199 ~~~~~~~~v~v~~~v~~~~---------~~~~------------------------------~~~~~~d~~~~-~~~s~y 238 (517) T protein:vir:10 199 KQYKDKDNVKLYTHAKRTK---------DGKY------------------------------LIRQSADDVPV-GKESTV 238 (517) T ss_pred hccCCcCceEEEEEEEEeC---------CCce------------------------------EEEEEeCceee-cccccc Confidence 1112234455554433321 1111 11222233333 234567 Q ss_pred CCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCce Q lcl|Aclame:pro 330 PSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSL 409 (711) Q Consensus 330 ~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~ 409 (711) +...+||+|+- +...++..||.|.+....+--+.+|++...++.....+.+++++++.+.+.+..... ...+|.+ T Consensus 239 ~~~e~P~~~~R--w~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~---~~~~g~~ 313 (517) T protein:vir:10 239 TEDKSPFLILT--WKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFV---EGGSGAV 313 (517) T ss_pred ccccCCeeeee--eeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhcc---CCCcccc Confidence 78899999765 444688999999999999999999999999999999999999999988887665432 1222333 Q ss_pred EEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhc-cccchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 410 LTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLG-AMGNETSGRAIIARQRQGDRGSFAFIDNLTK 488 (711) Q Consensus 410 i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~ 488 (711) + +|...+..+-........+.....++...+.|....= .+. ++ .++.+.|+.-|..+.+--...+...+.+|.. T Consensus 314 ~---~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~-~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 388 (517) T protein:vir:10 314 L---HGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFM-MEA-MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFAT 388 (517) T ss_pred c---cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHh-hhh-hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHH Confidence 2 3332222222233444556677788888888877652 222 23 3445688888988888877788887777753 Q ss_pred -HHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHH Q lcl|Aclame:pro 489 -SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAE 567 (711) Q Consensus 489 -~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~ 567 (711) +...+.+.++..+.... .++ ...+.+..+. ..-.|.+..+ T Consensus 389 Ell~Pli~r~~~~l~~~l--------~~~------------------------------~v~~~~~s~l-a~l~r~~~~~ 429 (517) T protein:vir:10 389 TFQGPLARWFMNGISSIL--------TSK------------------------------NVSPTILTGI-EALGRMAELD 429 (517) T ss_pred HHHHHHHHHHHHHhhhhc--------CCC------------------------------CccceeeccH-HHHHHHHHHH Confidence 44444444333221110 000 0111122222 2334455555 Q ss_pred HHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 568 AMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQ 647 (711) Q Consensus 568 ~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k 647 (711) .|.++++....+.+. .+. .++.-+.+++.+.+....+-....--..++.++.++++++++ +...+++ T Consensus 430 ~i~~~~~~i~~~a~~-~~~---~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~---------~~~~~~~ 496 (517) T protein:vir:10 430 KLGTFNGYVSMTAQW-PEP---LQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQE---------ATKYAAE 496 (517) T ss_pred HHHHHHHHHHHhhcC-ChH---HHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHH---------HHHHHHH Confidence 555555443222211 111 122335667776666665433211111111111110000000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 648 AEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQ 681 (711) Q Consensus 648 ~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q 681 (711) +-... +.+.+.+ .+.. .+..| T Consensus 497 ~ag~~--~~~~~~~--------~~~~---~~~~~ 517 (517) T protein:vir:10 497 QAGKA--IPDMVKN--------GQIN---PQGGQ 517 (517) T ss_pred HHHHH--HHHHHhC--------CCCC---CCCCC Confidence 00000 0000000 0000 00000 No 79 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.79 E-value=2.4e-17 Score=111.67 Aligned_cols=440 Identities=13% Similarity=0.053 Sum_probs=217.6 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP- 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p- 79 (711) |+-...+.=..+. +..-..+.+.++.+. ....+.+..+-.+||.|.|--.. ...+..+++ T Consensus 1 ~~~~~~~~~~~~~----------~~~~~~~~i~~~i~~-------~~~~~~r~~~~~~yy~g~~~i~~--~~~~~~~~~~ 61 (453) T protein:vir:73 1 MNLKPIKLMTYSR----------DEEITDKVVNDFMKK-------HQEEVERYEYLGNMYKGIMEISS--QKAKDSWKPD 61 (453) T ss_pred Cccccceeeeccc----------cccCCHHHHHHHHHH-------HHHHHHHHHHHHHHhccccchhc--CCCCCccCcc Confidence 4433221111110 111111223333332 22334455566889999874211 112223333 Q ss_pred -ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 80 -CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 80 -~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) .+++|..+.+|+..+|+.-.+.+.+. + .+.. ....+..+++.|+++.. T Consensus 62 ~ki~~n~~~~ivd~~~~~l~g~~~~~~--~-------------------------~d~~----~~~~l~~~~~~n~~~~~ 110 (453) T protein:vir:73 62 NRLTNNFAKYIVDTFVGYFNGIPIKKT--H-------------------------DDKS----VLEAMQLFDNLNDMEDE 110 (453) T ss_pred ceeecchHHHHHHHhhhhhcccCceee--c-------------------------CChH----HHHHHHHHHHhcChhHH Confidence 47899999999999998766544432 2 1222 23456777788999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeee-ecCCHHHHHHhcCC Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLID-DTMSKEKFKALYPD 235 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~-~~~~~~e~~~~~p~ 235 (711) ...+.++++++|.||..++.+. ++.+++..+ +|.+++ ||+... -..++.. .+.+ T Consensus 111 ~~~~~~~~~~~G~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~dd~~~------~~~~~~i~~~~~---------- 167 (453) T protein:vir:73 111 ESELAKIACVYGRAYELMYQNE------STESEVIYC-SPLNVFMVYDDSIK------QKPLFAVYYGFD---------- 167 (453) T ss_pred HHHHHHHHHhcCeEEEEEEeCC------CCceEEEEE-cccceEEEEeCCCC------ceeEEEEEEEEe---------- Confidence 9999999999999998877642 256777766 677665 443221 1122222 2211 Q ss_pred cccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEE Q lcl|Aclame:pro 236 ATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWR 315 (711) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~ 315 (711) .+....+++|.... ++.+.. T Consensus 168 -------------------~~~~~~~~vyt~~~------------i~~~~~----------------------------- 187 (453) T protein:vir:73 168 -------------------EEGNLSGTVYTLLE------------TISITG----------------------------- 187 (453) T ss_pred -------------------cCceEEEEEEeCCe------------EEEEEe----------------------------- Confidence 01112234443321 111000 Q ss_pred EEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCCh Q lcl|Aclame:pro 316 KITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGR 395 (711) Q Consensus 316 ~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~ 395 (711) -.+...+.++.|.+.|.+|+|+|. +...+.|.+..+++.++.+|..+|.+.+.+...+.+.+++....+++ T Consensus 188 -~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~- 258 (453) T protein:vir:73 188 -KAGEVKFGESTYNVYSDLPIVEYN-------FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE- 258 (453) T ss_pred -cCCceEEccceeccCCceeEEEec-------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc- Confidence 011111223445566777777642 22346789999999999999999999999988888877764222221 Q ss_pred HHHHhhcccCCCceEEec---ccc----cCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHH Q lcl|Aclame:pro 396 EDEWEQANTKNFSLLTYI---PQY----QGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAI 468 (711) Q Consensus 396 ~~~~~~~~~~~~~~i~~~---~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai 468 (711) +.... .+.+.++... ++. ..+..++++..+.-...+...++.....|-..|++++.+.+.. ++.||.|+ T Consensus 259 -~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al 334 (453) T protein:vir:73 259 -EDAKN--IKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENF-GNSSGVAL 334 (453) T ss_pred -hhhhc--ccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccc-cCccHHHH Confidence 11111 1111111111 111 1122255665555566677788888999999999888666554 34799998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheee Q lcl|Aclame:pro 469 IARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKY 548 (711) Q Consensus 469 ~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~ 548 (711) ..+-.............|..+++++.++++.+... .| ...++. T Consensus 335 ~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~----------~~--~~~~~~------------------------- 377 (453) T protein:vir:73 335 AYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTN----------AS--NKDAWK------------------------- 377 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----------cC--Cccccc------------------------- Confidence 88766666666666666666666666655543211 01 011110 Q ss_pred eEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH Q lcl|Aclame:pro 549 DVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) Q Consensus 549 dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 627 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-.+++++..... ..+++ T Consensus 378 ~i~v~f~~~~p~~~~~~a~~~~k~~giis~------et~~~~~~~~~d~~~E~~ri~~E~~~~------------~~~~~ 439 (453) T protein:vir:73 378 DIEYTFTRNEPKDIKEQAETANILKGITSE------ETALSVISVIPDVQAEMEKIKKKKLLQ------------LSLTR 439 (453) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHhccCcH------HHHHHhCCCCCCHHHHHHHHHHHHHHH------------HHHHH Confidence 111222233333333444444444333222 223333333 12222222221110000 00000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 628 TEPTPEQQVEMAKSQADM 645 (711) Q Consensus 628 q~~~~~~q~~~~~~q~~~ 645 (711) .... .+..+....+ T Consensus 440 ~~~~----~~~~~~~~~~ 453 (453) T protein:vir:73 440 TSNL----VRMKQMRGNL 453 (453) T ss_pred hccC----CcchhhhcCC Confidence 0000 0000000000 No 80 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.77 E-value=4.4e-16 Score=104.78 Aligned_cols=498 Identities=11% Similarity=0.017 Sum_probs=243.6 Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh-----ccccee Q lcl|Aclame:pro 31 LLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ-----NRPAIK 105 (711) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~~ 105 (711) .-..+..+|.+.. .+.|...|.++.+|..-.=+.+.--.......+ ..-..-...++...+..-. +++=.+ T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~--~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEH--DFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCcccccccC--cccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 2223444444442 456888899999998542121100000000111 1222222333333222111 111223 Q ss_pred EecchhhhhhhhhcccccccccccCCCchh-HHHH---HHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeec Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKND-YELA---EVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYL 181 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~A---e~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~ 181 (711) +.+.+.... . ...+..+ .++. +.++..+......|+|......++.+.+..|++++ +.+ T Consensus 77 l~~~d~~~~------------~-~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~-- 139 (510) T protein:vir:78 77 SELTDAIRR------------E-ADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRN-- 139 (510) T ss_pred cCCChHHhh------------h-cccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEe-- Confidence 322211000 0 0000111 1122 22345555666789999999999999998888764 222 Q ss_pred cCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEE Q lcl|Aclame:pro 182 ADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVS 261 (711) Q Consensus 182 ~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~ 261 (711) +++ + .++.+ +..++++..++. - ...-++++..|+..++.+.||......... ....+.|.++ T Consensus 140 -~~~--~--~~~~~-pl~~y~v~~d~~-G---~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~--------~~~~~~v~v~ 201 (510) T protein:vir:78 140 -SDE--A--TVVAW-SLRSYAVRRDAT-G---RWMDIVLKQRYKSKDLDDVYKQDLMRAGRN--------LSGSGSVDLY 201 (510) T ss_pred -CCC--C--eEEEE-EcceeEEeeCCC-c---CeeEEEeeeeccHHHHHHHhhHHhhhhhhc--------cCCCceEEEE Confidence 221 1 34455 566777765443 1 233478889999999999998754221111 1123456666 Q ss_pred EeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEe Q lcl|Aclame:pro 262 EYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWG 341 (711) Q Consensus 262 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~ 341 (711) ++.+++... ....+.+|+ -..|..++ ..+-|++.++||+|+- T Consensus 202 ~~V~~~~~~-----------------------------------~~~~~sv~~-e~dg~~i~-~~~~~~~~e~P~~~~R- 243 (510) T protein:vir:78 202 THVQRRKGT-----------------------------------AMDYAEMYH-EIDGVRVG-ETGRWPIHLCPYIVPT- 243 (510) T ss_pred EEEEeecCC-----------------------------------CCcEEEEEE-EecCeeec-cccccccccCCeeeee- Confidence 666554210 111122222 23455554 3356788889999764 Q ss_pred eeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCC Q lcl|Aclame:pro 342 KSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPG 421 (711) Q Consensus 342 ~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 421 (711) +...++..||.|.+....+--+.+|++....+.....+.++.++++++.+.+.+.... ..+|.++ +|...+.. T Consensus 244 -w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~---~~~g~~v---~g~~~~v~ 316 (510) T protein:vir:78 244 -WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD---AEMGDYV---PGGAEAVR 316 (510) T ss_pred -eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhcc---CCCceee---cCCccccc Confidence 4456889999999999999999999999999999999999999999888766654321 2233343 44333322 Q ss_pred ccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 422 PRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEM 500 (711) Q Consensus 422 i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~l 500 (711) +-..............++...+.|....=+ + ....++.+.|+.-|..+.+-....+...+.+|.. +...+.+..+.+ T Consensus 317 ~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~i 394 (510) T protein:vir:78 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMY-G-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSE 394 (510) T ss_pred ccccCcccchHHHHHHHHHHHHHHHHHHhh-c-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 222233344555667777777777765311 1 1113344578989999988888888888777753 666666666665 Q ss_pred HHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhH Q lcl|Aclame:pro 501 IPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAA 580 (711) Q Consensus 501 i~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~ 580 (711) +.... ++-+.- ..+ +-.++. +.+ +--|.+..+.+..+.+.+..+. T Consensus 395 l~r~g----l~p~p~-----------------~~~-----------~~~~v~--~is-~Laraq~~~~l~~~~q~l~~~~ 439 (510) T protein:vir:78 395 VDDAL----LQGLIT-----------------KQH-----------KPAIET--GLP-ALSRSAAVQSMLNASQVIAGLA 439 (510) T ss_pred HHhcc----CCCCCc-----------------ccc-----------cceeee--ccc-HHHHHHHHHHHHHHHHHHHHhc Confidence 54321 110000 000 001111 111 1122333333333333222211 Q ss_pred HHHHHHHHHhcCCcchHHHHHHHHhhhcc--hhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 581 AVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) Q Consensus 581 ~~~~~~~~~~~~~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae 658 (711) +. ....+.-+.+++.+.+....+. ......+.+. ++..+++++++.++++ . +++-+.++ +.... T Consensus 440 ~~-----~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev-~a~~~~~~~q~~~~~~-~--~~a~~~~~-~~~~~---- 505 (510) T protein:vir:78 440 PI-----AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADEL-QAEAEEQRRQAAQAQA-A--QETLLEGA-SDMTN---- 505 (510) T ss_pred Ch-----hhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHH-HHHHHHHHHHHHHHHH-H--HHHHHHhh-hhhcc---- Confidence 11 1112223667777777666553 2222211111 1111100000000000 0 00000000 00000 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 659 MLKAQLETEEAQK 671 (711) Q Consensus 659 ~~~~q~~~~~~q~ 671 (711) +.+ .+ T Consensus 506 ---~~~-----g~ 510 (510) T protein:vir:78 506 ---ALA-----GV 510 (510) T ss_pred ---cCC-----CC Confidence 000 00 No 81 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.77 E-value=3.4e-17 Score=110.89 Aligned_cols=527 Identities=12% Similarity=0.048 Sum_probs=260.4 Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh-----ccccee Q lcl|Aclame:pro 31 LLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ-----NRPAIK 105 (711) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~~ 105 (711) .-..+..+|+...+....|...|.++.+|..-.-...+- .....+. ..+.-+.-...++.+.+..-. +++=.+ T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~-~~~~~~~-~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 78 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDG-HASGGRL-QQPYQSLGSKGVNALSSKLMLSLFPIQTSFFK 78 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCC-Ccccccc-cccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 334467788888888889999999999997432111100 0000111 111222233333333322111 111122 Q ss_pred EecchhhhhhhhhcccccccccccCCCchh---HHHHHH---HHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEe Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKND---YELAEV---FTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSD 179 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d---~~~Ae~---l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d 179 (711) +.+.+..... ..+.++ .++... ++..+......|+|......++.+.+..|.|++-+ T Consensus 79 l~~~d~~l~~--------------~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--- 141 (542) T protein:vir:78 79 LQINDAEIAS--------------VPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFA--- 141 (542) T ss_pred ccCCHHHHHh--------------hccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEe--- Confidence 2222100000 000011 112222 34556666678999999999999999999987522 Q ss_pred eccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhc-ccccccccCCCCCeE Q lcl|Aclame:pro 180 YLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYE-DSVADYDTWFTEKSV 258 (711) Q Consensus 180 ~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~-~~~~~~~~~~~~~~v 258 (711) ++++ ++.+ +..++++..++.- ...-+|++..||..++.+.|++........ ....+ ....+ T Consensus 142 --~~~~------~~~~-pl~~y~v~~d~~G----~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~-----~~~~~ 203 (542) T protein:vir:78 142 --GKKT------LKVY-PLDRYVIERDGDG----NVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGE-----DGPKF 203 (542) T ss_pred --cCCC------ceEE-ecceeEEeeCCCC----CeEEEeeeeecCHHHHHHhhccccCchHHHhhcccc-----CCCeE Confidence 2333 2333 4567777665431 233388999999999999998765432221 11111 12344 Q ss_pred EEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEE Q lcl|Aclame:pro 259 RVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIP 338 (711) Q Consensus 259 ~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp 338 (711) .+++.++..... .++.. ..+....+.+|+ ...|..+-...+-+++..+||++ T Consensus 204 ~v~~~v~pr~~~-~~~~~--------------------------~~~~~~~~s~~~-e~~g~~v~~~~~e~g~~~~P~i~ 255 (542) T protein:vir:78 204 GVAQGKGGRNDA-EVFTC--------------------------CKLVDGQHRWHQ-ECDGKEIKGSRSSSPLKHSPWLP 255 (542) T ss_pred EEEEEeecccCC-ccccc--------------------------cccCCCeEEEEE-EeccccccccccccccccCCcee Confidence 455554433211 00000 001111122221 12233332222345677899997 Q ss_pred EEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccC Q lcl|Aclame:pro 339 VWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG 418 (711) Q Consensus 339 ~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~ 418 (711) +. +..+++..||.|.+....+-.+.+|.+....+.....+.++.++++++.+.+..+.. ...+|.++.-++ . T Consensus 256 ~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~---~~~~g~iv~g~~---~ 327 (542) T protein:vir:78 256 LR--FNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLA---RAGTGAIIQGRA---E 327 (542) T ss_pred ee--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc---cCCCceeecCCc---c Confidence 64 445688999999999999999999999999999999999999999888776665432 234555543322 2 Q ss_pred cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|Aclame:pro 419 DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT-KSIRRVGKIL 497 (711) Q Consensus 419 ~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~ 497 (711) +..+-++..+.-.......++...+.|.+..-+. .-.++...|+.-|..+.+.....+...+.+|. +++..+.+.. T Consensus 328 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~---~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~ 404 (542) T protein:vir:78 328 DVSVVQANKGADFRTVQEMIRDLSQRISDAFLIL---NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRK 404 (542) T ss_pred ceeeeecccccchhHHHHHHHHHHHHHHHHhccc---ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 2222233333345556777888888887765332 12445567999999999998889999888884 4666676666 Q ss_pred HHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 498 VEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVP 577 (711) Q Consensus 498 l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p 577 (711) +.++.+.---+. + |. ++ +.+.+..+. ...+|.+..+.|.++++.+. T Consensus 405 ~~il~r~g~lP~---------------~------p~--------~l----v~~~~~s~L-a~~~r~~~~~~l~~~~~~i~ 450 (542) T protein:vir:78 405 LHLMQRSKQLPS---------------L------PK--------GL----VMPTVVAGL-GGVGRGEDRAALIEFMQTVG 450 (542) T ss_pred HHHHHhcCCCCC---------------C------ch--------hc----eeeeeechH-HHHHHHHHHHHHHHHHHHHH Confidence 666555321110 0 00 00 123333332 33455555555555555432 Q ss_pred hhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchh--hcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 578 SAAAVMADLIAQNMDWPGADVIAERLKKIVPPNV--LSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQA 655 (711) Q Consensus 578 ~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~a 655 (711) ++ +.+ -..++.-+.+++...+....+... ....+. +.++.++++++ ++.++.+. .++.. -|.... T Consensus 451 ~~---~~p--~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e-~~~~~~~q~q~--~~~~~al~-~~a~~---~a~~~~- 517 (542) T protein:vir:78 451 QA---MGP--EALQQFIDPTEFLKRLAAASGIDTLNLVKSPE-TMANEAQQAQQ--QQMTASLM-GQAGQ---LAKSPI- 517 (542) T ss_pred Hh---cCC--hhHHhcCCHHHHHHHHHHHcCCCHhhccCCHH-HHHHHHHHHHH--HHHHHHHH-Hhhhh---cccccc- Confidence 21 111 112344466777777766655432 121111 11111111110 00000000 00000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 656 QADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELV 694 (711) Q Consensus 656 qae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~ 694 (711) ......+..+ ..+.+=+ .. +..+.+ T Consensus 518 -~~~~~~~~~a-~~~~~~~----~~--------~~~~~~ 542 (542) T protein:vir:78 518 -GEKMMQQINA-PGQEAPA----GP--------QTGEDL 542 (542) T ss_pred -ccchhhhcCC-CCcCCCC----CC--------cccccC Confidence 0000000000 0000000 00 000000 No 82 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.77 E-value=1.2e-17 Score=113.43 Aligned_cols=467 Identities=12% Similarity=0.035 Sum_probs=229.7 Q ss_pred CCcCC---CCCC-CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHH-HHHHH Q lcl|Aclame:pro 1 MAKKQ---KKSR-VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR-TEREL 75 (711) Q Consensus 1 ~~~~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~-~~~~~ 75 (711) |-=.. ++++ +.+. +-+. ...+ .+..+.+++. ...+....+..+||.|.|..-..+ ..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~-----l~~~-~i~~li~~~~------~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~ 65 (506) T protein:vir:94 1 MDYDLTEHKQANLIYQE---SLEN-----LTPN-KIMKFITHHF------NYQRPRLEMLDDYYQGYNLKILDKQSRRHE 65 (506) T ss_pred CCcchhhhhcceeeccc---chhc-----CCHH-HHHHHHHHHH------HHHHHHHHHHHHHhcCCCcccccccccccc Confidence 32111 1111 1111 1011 1111 2333333322 123344567788999987532111 22333 Q ss_pred hCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 EQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) Q Consensus 76 ~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) .++| .+++|..+.+|+..+|+.-.+.+.+. + ++.+. ...+..+++.| T Consensus 66 ~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~--~-------------------------~d~~~----~~~l~~~~~~N 114 (506) T protein:vir:94 66 DGKADHRATHSFAKYIADFQTSYSVGNPINVK--L-------------------------PDDGS----NSGFDTFNKAN 114 (506) T ss_pred ccCCcceeecchHHHHHHHhhhhhcccCceee--c-------------------------CcchH----HHHHHHHHhcc Confidence 4454 47899999999999998766644432 2 11122 34577788889 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHH Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~ 231 (711) +++.....+..+++++|.+|..++++. ++++++..+ +|..++ ||+... .....+++.|..... T Consensus 115 ~~~~~~~~~~~~~~~~G~a~~~v~~de------d~~~~i~~~-~p~~~~~v~dd~~~-----~~~~~~v~~~~~~~~--- 179 (506) T protein:vir:94 115 DVDAENYDLFLDMSRYGRAYEYVYRGE------DNEEHLAKL-DPLDTFVIYSTDVD-----PKPIMAVRYHQIELV--- 179 (506) T ss_pred CHhHHHHHHHHHHHhcCeEEEEEEecC------CCeeEEEEE-cccceEEEecCCCC-----CceEEEEEEEeeeec--- Confidence 999999999999999999998887652 256777776 777765 443221 122333444332111 Q ss_pred hcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 232 ~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) ..+.+....+|..... ..+++.+... T Consensus 180 ----------------------~~~~~~~~~~~~~~yt--------~~~~~~~~~~------------------------ 205 (506) T protein:vir:94 180 ----------------------DDNQVSTINYVPETWT--------ADTYTLYNPT------------------------ 205 (506) T ss_pred ----------------------cCCceeEEEEEEEEEe--------CceEEEeccc------------------------ Confidence 0111111111111100 1111111000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) .+...+....+.+.+.+|+++|. +...+.|.+..++++++.+|..+|.+.+.+.-.+++.+++.... T Consensus 206 ------~~~~~~~~~~~~~~g~vPvv~~~-------n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~ 272 (506) T protein:vir:94 206 ------PIMGKMQVDTTKPITTFPVVEFK-------NSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDI 272 (506) T ss_pred ------cCccceeccccccCCccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCc Confidence 00011122344556777877653 12346799999999999999999999998876666554442111 Q ss_pred cCC------------------------hHHHHhhcccCCCceEEeccccc-----CcCCccccCCccchHHHHHHHHHHH Q lcl|Aclame:pro 392 VEG------------------------REDEWEQANTKNFSLLTYIPQYQ-----GDPGPRRQPPAAVPAAELTLGQNSV 442 (711) Q Consensus 392 v~~------------------------~~~~~~~~~~~~~~~i~~~~~~~-----~~~~i~~~~~~~~~~~~~~ll~~~~ 442 (711) ... ..+... ..+-+.++.+.++.. .+..++++..+.-..++...++... T Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~ 350 (506) T protein:vir:94 273 DTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIK--EMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVA 350 (506) T ss_pred cccccchhccccccccccccccccccchhHHHh--hhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHH Confidence 100 001110 112223444443321 2234666777777888889999999 Q ss_pred HHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchh Q lcl|Aclame:pro 443 EKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDF 522 (711) Q Consensus 443 ~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~ 522 (711) ..|-..|++++.+.+.-+++.||.|+..+..............|..+++++.++++.++..... ....++ T Consensus 351 ~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~----------~~~~d~ 420 (506) T protein:vir:94 351 GDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHG----------DWTFDP 420 (506) T ss_pred HHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------cccccc Confidence 9999999999877666567789999988877766666666677777777777777766543221 011111 Q ss_pred eecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHH Q lcl|Aclame:pro 523 VKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAE 601 (711) Q Consensus 523 v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~ 601 (711) . ++.|.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-.+ T Consensus 421 ~-------------------------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~------et~~~~lp~v~d~~~E~~ 469 (506) T protein:vir:94 421 Q-------------------------ELTFTFRDNLPADNISQIKALVQAGATLPQ------KYLYQQLPGVTNPQDIVD 469 (506) T ss_pred c-------------------------cceEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHH Confidence 0 111222333333333444444444333322 222333332 22222222 Q ss_pred HHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 602 RLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLK 661 (711) Q Consensus 602 ~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~ 661 (711) ++.+................... .. -+......+-.+ T Consensus 470 ri~~E~~~~~~~~~~~~~~~~~~----------------------~~-~~~~~~~~~e~~ 506 (506) T protein:vir:94 470 MMKEQSANGDYSFDQNGVISNDG----------------------QT-NTTATQTDEEVR 506 (506) T ss_pred HHHHHHHHHhhcchhhcCCCccc----------------------Cc-cccccccccCCC Confidence 23222111000000000000000 00 000000000000 No 83 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.76 E-value=2.8e-17 Score=111.29 Aligned_cols=463 Identities=11% Similarity=0.046 Sum_probs=228.7 Q ss_pred CCcCCCCCCCCcccCCCc-ccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCC-HHHHHH----HH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKA-KVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWP-SQVRTE----RE 74 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~-~~~~~~----~~ 74 (711) |+.=.|. +++...... +....+.....+++.++.+.++ ....+..+..+||.|+|=- ...... .. T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~ 71 (474) T protein:vir:96 1 MIVIFWP--NEKPYHERVVEQIKPKYETQEEMIIRLINDHK-------PKIDDITVGERYYNHDPDVLRLAPKLDNKGEI 71 (474) T ss_pred CeeeccC--CCchhhhhHHHHhhhccCChHHHHHHHHHHHH-------HHHHHHHHHHHHhccCCcchhccchhcccccc Confidence 7655431 121111111 1111112233444555554432 3445667888999998510 000000 01 Q ss_pred HhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYN 152 (711) Q Consensus 75 ~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~ 152 (711) ...+| .+.+|..+.+|+..+|+.-.+.+.+. ..+.+..+.+..++ + T Consensus 72 ~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~---------------------------~~d~~~~~~l~~~~----~- 119 (474) T protein:vir:96 72 DPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFS---------------------------SDDDKSLKTIQEVL----N- 119 (474) T ss_pred cccccchhcccchHHHHHHhhhhhhcccCceee---------------------------cCchHHHHHHHHHH----h- Confidence 11222 36789999999999998766554432 13344444444433 3 Q ss_pred cCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHH Q lcl|Aclame:pro 153 CDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFK 230 (711) Q Consensus 153 ~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~ 230 (711) +++......+..++.++|.||..++.+. ++++++..+ +|..++ ||+... .+. .++++.|... T Consensus 120 n~~~~~~~~~~~~~~~~G~~~~~~y~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~-~~~vr~~~~~---- 183 (474) T protein:vir:96 120 HKWDDKLVDILTAASNKGIEWLQPYIDE------NGEFKTFRV-PAEQAIPIWTNKER----DTL-KAFIRYYRLD---- 183 (474) T ss_pred cCHHHHHHHHHHHHHhcCeeEEEEEecC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEeec---- Confidence 5788888889999999999998887653 256788777 788877 444221 122 2333333110 Q ss_pred HhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceE Q lcl|Aclame:pro 231 ALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTF 310 (711) Q Consensus 231 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 310 (711) . ..-+++|..... ..+...++.... +... T Consensus 184 ------~--------------------~~~~~~yt~~~v--~~~~~~~~~~~~-~~~~---------------------- 212 (474) T protein:vir:96 184 ------G--------------------AERVEYWTDSDV--TYYEYQDGILIP-DYYH---------------------- 212 (474) T ss_pred ------C--------------------ceEEEEEeCCeE--EEEEecCCceee-cccc---------------------- Confidence 0 001222322111 111111111110 0000 Q ss_pred EEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEeccc Q lcl|Aclame:pro 311 KTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEG 390 (711) Q Consensus 311 ~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~ 390 (711) .......+.+....|.+.+.+|+++|. +...|.|.+..+++.++.+|...|.+.+.+...+++.+++.-. T Consensus 213 ---~~~~~~~~~~~~~~~~~~g~iPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~ 282 (474) T protein:vir:96 213 ---GEEHIQSHYYVGNKRVSWGRVPFIPFK-------NNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGY 282 (474) T ss_pred ---ccccccccccccccccCCCceeEEEec-------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC Confidence 000001111223456677888887653 2345789999999999999999999999999988887776533 Q ss_pred ccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHH Q lcl|Aclame:pro 391 NVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIA 470 (711) Q Consensus 391 av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~ 470 (711) ...+..+... ..+.+.++.+.+ .++.++++..+.-..+....++...+.|-..|++.+.+.+..+++.||.|+.. T Consensus 283 ~~~~~~~~~~--~~~~~~~i~~~~---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~ 357 (474) T protein:vir:96 283 EGQDLDEFMR--NLKYYKAINVDG---DGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKF 357 (474) T ss_pred Ccccccchhh--hhhcCceEEecC---CCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHH Confidence 3333223222 223445565542 12346777766666788888899999999999999888776677789999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeE Q lcl|Aclame:pro 471 RQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) Q Consensus 471 ~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv 550 (711) +..............|..+++++.++++.+....+ ++. ++ T Consensus 358 ~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~---------------~~~-------------------------~i 397 (474) T protein:vir:96 358 MYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNI---------------KVQ-------------------------DV 397 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc---------------ccc-------------------------ee Confidence 77666666666666666677776666555431111 100 01 Q ss_pred EeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-CcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 551 VVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-WPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTE 629 (711) Q Consensus 551 ~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~ 629 (711) .+.=.++.+....+..+.+.+ +.. +....++..++ +.+.+.-.+++.+................ T Consensus 398 ~i~f~~~~p~~~~e~~~~~~~-ag~------iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~~-------- 462 (474) T protein:vir:96 398 EITFNFNVMVNELEQSQIGVQ-SQY------LSKETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPLEGDA-------- 462 (474) T ss_pred eEEeccCCCcCHHHHHHHHHh-cCC------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhccccccccc-------- Confidence 111111111111122222221 111 11122233333 22333333333222111000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 630 PTPEQQVEMAKSQADMAQAEAD 651 (711) Q Consensus 630 ~~~~~q~~~~~~q~~~~k~qae 651 (711) ......+.+ +.. T Consensus 463 --------~~~~~d~~~--e~~ 474 (474) T protein:vir:96 463 --------NGRAQDNES--ETN 474 (474) T ss_pred --------ccccCCCcc--cCC Confidence 000000000 000 No 84 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.76 E-value=6.4e-16 Score=103.87 Aligned_cols=505 Identities=11% Similarity=0.013 Sum_probs=243.5 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh- Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ- 99 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~- 99 (711) +.......--....+..+|.........|...|.++.+|..-.-+++..... +...+.-..-...++...+..-. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~----~~~~~~dstg~~a~~~LAa~l~~~ 76 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE----TSQNGWQGVGAQATNHLANKLAQV 76 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc----cccccccchHHHHHHHHHHHHHHh Confidence 1111111111256788888888888899999999999999653332211111 11111222222233333222111 Q ss_pred ----cccceeEecchhhhhhhhhcccccccccccCCCchhH-HHHH---HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCc Q lcl|Aclame:pro 100 ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDY-ELAE---VFTGLIKNIEYNCDAETEYDIAFQGAVESGM 171 (711) Q Consensus 100 ----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~-~~Ae---~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~ 171 (711) +++=.++.+.+... + .....+.+. ++.+ .++..+......|+|......++.+.+..|+ T Consensus 77 ltpp~~~WF~l~~~d~~~------------~-~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 143 (515) T protein:vir:70 77 LFPAQRSFFRVDLTAKGE------------K-VLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGN 143 (515) T ss_pred hcCCCCcccccccChhhh------------h-ccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCe Confidence 11112222211000 0 000001111 2222 3455566667789999999999999999999 Q ss_pred cEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccccc Q lcl|Aclame:pro 172 GYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDT 251 (711) Q Consensus 172 g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 251 (711) |++-+ + ++ +. ++.+ +..++++..++.- ...=++++..|+..+|.+.|+.......... . T Consensus 144 a~l~~--d---~~---~~--~~~~-pl~~y~v~~d~~G----~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~---~--- 202 (515) T protein:vir:70 144 CLLYK--P---SK---GA--MSAV-PMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIEVGMKGK---K--- 202 (515) T ss_pred EEEEE--e---CC---CC--eEEE-EcCeEEEeeCCCc----CeeEEEeeeeccHHHHHHhhhhhhhhhhhhh---h--- Confidence 87533 2 21 11 3334 5667777655531 2333788999999999999986431111100 0 Q ss_pred CCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCC Q lcl|Aclame:pro 252 WFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPS 331 (711) Q Consensus 252 ~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~ 331 (711) ....+.|.++.+-++. .++ +..+|.-..|.++ ...+-|+. T Consensus 203 ~~~~~~v~i~~~v~~~---------~~~------------------------------~~~~~~e~d~~~~-~~es~y~~ 242 (515) T protein:vir:70 203 CKEDDNVKLYTHAQYA---------GEG------------------------------FWKINQSADDIPV-GKESRIKS 242 (515) T ss_pred cCCCCceEEEEEEEec---------CCC------------------------------ceEEEEecCceee-cccccccc Confidence 1112334333221111 111 1122233334333 34466788 Q ss_pred CccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEE Q lcl|Aclame:pro 332 TTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLT 411 (711) Q Consensus 332 ~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~ 411 (711) ..+||+++- +...++..||.|.+....+--+.+|.+...++.....+.++.++++++.+.+...... ..+|.+ T Consensus 243 ~e~P~~~~R--w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~---~~~g~i-- 315 (515) T protein:vir:70 243 EKLPFIPLT--WKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVN---SGTGEV-- 315 (515) T ss_pred ccCCceeee--eeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccc---cCCcee-- Confidence 899999764 4446888999999999999999999999999999999999999999988876654321 222333 Q ss_pred ecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 412 YIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SI 490 (711) Q Consensus 412 ~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~ 490 (711) .+|...+..+-......-.+.....++...+.|....= .+.+.-.++.+.|+.-|..+.+--...+...+.+|.. +. T Consensus 316 -v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~-~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell 393 (515) T protein:vir:70 316 -ITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQ 393 (515) T ss_pred -ecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHh-hhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 33332222222233333445666777777777766542 2222223444578888888887777777777777653 33 Q ss_pred HHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHH Q lcl|Aclame:pro 491 RRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMI 570 (711) Q Consensus 491 ~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~ 570 (711) ..+..+++ +..+.. -|+ ++ .++.+..+ -.+-.|.+..+.+. T Consensus 394 ~Pli~r~~-------------~~~~p~-------------~P~--------~~----v~~~~vs~-l~~L~r~q~~~~i~ 434 (515) T protein:vir:70 394 TPIAMWGL-------------QEAGDS-------------FTS--------EL----VDPVIVTG-IEALGRMAELDKLA 434 (515) T ss_pred HHHHHHHH-------------HhhCCC-------------CCh--------hh----cccceehh-HHHHHHHHHHHHHH Confidence 33222110 000100 000 00 11222222 22334454555555 Q ss_pred HHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 571 QFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEA 650 (711) Q Consensus 571 ~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qa 650 (711) .+++.+...+.. .+. .++.-+.+++.+.+..........-...++.++..++.+++++ .+...++ T Consensus 435 ~~~q~i~~~~~~-~p~---~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~---~~~~~~~-------- 499 (515) T protein:vir:70 435 NFAQYMSLPQTW-PEP---AQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQ---EAMLNEG-------- 499 (515) T ss_pred HHHHHHHHHhcc-Chh---HHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHH---HHHHHHh-------- Confidence 555543211111 112 2333455555555544333221111111111111111111000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 651 DTAQAQADMLKAQLETEEA 669 (711) Q Consensus 651 e~~~aqae~~~~q~~~~~~ 669 (711) ..++...-++-.++++ T Consensus 500 ---~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 500 ---VAKAVPGVIQQEMKEG 515 (515) T ss_pred ---hhhhcccchhhhhccC Confidence 0000000000011111 No 85 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.75 E-value=8.7e-17 Score=108.63 Aligned_cols=461 Identities=13% Similarity=0.065 Sum_probs=223.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH-------HHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQ-------VRTER 73 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~-------~~~~~ 73 (711) |..-.. ..+...+....+...+.+ +.. . .......+..+||.|.|=... ..... T Consensus 7 ~~~~~~-------------~~~~~~~~~~~~~~~i~~-~~~---~--~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~ 67 (479) T protein:vir:79 7 SETDLI-------------KVQLKKESTINLVKVIEH-YIL---K--HRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKV 67 (479) T ss_pred cccceE-------------eeccccCChhHHHHHHHH-HHh---h--hhHHHHHHHHHHhccCCcccccccccccccccc Confidence 221111 111111222222222222 211 1 123456677899999751100 00111 Q ss_pred HHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 74 ELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY 151 (711) Q Consensus 74 ~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~ 151 (711) +...+| .+++|..+-+|+..+|+.-.+.+.+.. .+.+..+ +++... T Consensus 68 ~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~~---------------------------~~~~~~~----~~~~~~- 115 (479) T protein:vir:79 68 DDFTKVNNKAINNYHKLLVDQKVGYSVGNPIVFNA---------------------------DDDNLTK----LLNDLL- 115 (479) T ss_pred cccccCcceeecchHHHHHHHHHhhhhcCCceecc---------------------------CCHHHHH----HHHHHH- Confidence 222233 478999999999999998766544421 2223323 334333 Q ss_pred hcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHH Q lcl|Aclame:pro 152 NCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) Q Consensus 152 ~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~ 229 (711) .|+++.....+.++++++|.||..++++. ++++++..+ +|..++ ||+... +..-++++.|...+ T Consensus 116 ~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~d~~~~-----~~~~~~ir~y~~~~-- 181 (479) T protein:vir:79 116 GEEFDDTITELYLNASNKGVEWLHPYINR------KGEFKYVII-PAEEAIPIWDSKRQ-----RELVAFIRFYYIED-- 181 (479) T ss_pred hcCHHHHHHHHHHHHHhcCeEEEEEEeCC------CCceEEEEE-ccceeEEEEeCCCC-----CceEEEEEEEEEee-- Confidence 47999999999999999999998887642 256788777 787775 444321 11122222222110 Q ss_pred HHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccce Q lcl|Aclame:pro 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) Q Consensus 230 ~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) .+.+.+..+|+|....... +....+.....-... . T Consensus 182 -----------------------~~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~--------------------~ 216 (479) T protein:vir:79 182 -----------------------IDGNKIKRVEYYTENDITY--FIERGNSFIQEFLYD--------------------E 216 (479) T ss_pred -----------------------cCCceEEEEEEEeCCcEEE--EEecCCccccccccc--------------------c Confidence 0112333445554432211 111111111000000 0 Q ss_pred EEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecc Q lcl|Aclame:pro 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSE 389 (711) Q Consensus 310 ~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~ 389 (711) . ...........+..+.|.+.+.+||++|. +...+.|.+..+++.++.+|...|.+.+.+....++.+++.. T Consensus 217 ~-~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-------nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g 288 (479) T protein:vir:79 217 Y-GKMTDIQEGHFRINNKEQGWGKVPFIPFK-------NNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKE 288 (479) T ss_pred c-ccccccccccccccccccCCCcccEEEec-------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec Confidence 0 00000111112234455566677777542 234477999999999999999999999999988888777643 Q ss_pred cccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHH Q lcl|Aclame:pro 390 GNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAII 469 (711) Q Consensus 390 ~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~ 469 (711) .......+... ..+.+.++.+..+ +.++++..+.-..++...++.....|-..|++++.+.+..+ +.||.|+. T Consensus 289 ~~~~~~~~~~~--~~~~~~~i~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-n~Sg~Ai~ 361 (479) T protein:vir:79 289 YPGTSLQEFID--NIRYYKSIKVDGG----GGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTG-DKSGVALK 361 (479) T ss_pred CCccccccchh--hhhhccceecCCC----CcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccccccccc-chhHHHHH Confidence 22222222221 2334456666543 23566665555677788888889999999999888776554 46999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .+...........-..|..+++++.++++.++... +. ...++ .+ T Consensus 362 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~----------~~-~~~~~-------------------------~~ 405 (479) T protein:vir:79 362 FLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKIS----------GN-KSYDY-------------------------KT 405 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CC-Ccccc-------------------------cc Confidence 87666655555555666666666666555443211 10 00000 12 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHH Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQT 628 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q 628 (711) +.+.=.+..+....+..+.+..+...++ ...+++.+++ .+.++-.+++++........... T Consensus 406 i~i~f~~~~p~~~~~~a~~~~kl~g~iS------~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~------------ 467 (479) T protein:vir:79 406 VQITFNHSMIINEAEKIDMAAKSTGIVS------DETIVSNHPWVEDVNDELERLKKQEDTQKEYDDL------------ 467 (479) T ss_pred ceEEeCCCCCcCHHHHHHHHHHHhccCc------HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhc------------ Confidence 2222222322222333334444433222 1223333332 22222222222211100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 629 EPTPEQQVEMAKSQADMAQAEADTAQA 655 (711) Q Consensus 629 ~~~~~~q~~~~~~q~~~~k~qae~~~a 655 (711) ..... +.....+ T Consensus 468 --------~~~~~-------~~~~~e~ 479 (479) T protein:vir:79 468 --------IPNNQ-------DGVIDET 479 (479) T ss_pred --------cCccc-------CCCcCcC Confidence 00000 0000000 No 86 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.75 E-value=7.2e-19 Score=120.04 Aligned_cols=461 Identities=11% Similarity=0.042 Sum_probs=200.4 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC----CHHHHHHHHHhCCCceEehhhHHHHHHHhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW----PSQVRTERELEQRPCLVNNVLPTFVDQVLGD 96 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw----~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~ 96 (711) ++ -..+++.++...+ ...+.+..+-.+||+|+|= .......++ .-.++.|..+-+|+..+++ T Consensus 1 ~~----t~~d~i~~L~~~~-------~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~---~~~~~~n~~~~ivd~~~~~ 66 (480) T protein:vir:78 1 MT----TYHEHVERLQGLL-------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA---YLDVQPGWVATYLRTLSDR 66 (480) T ss_pred CC----CHHHHHHHHHHHH-------HHHHHHHHHHHHHHhccccchhcccccchhhh---hhhhhcchHHHHHHHHHhh Confidence 11 1222344444432 2345555677899999852 111111111 1125689999999998886 Q ss_pred hhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEE Q lcl|Aclame:pro 97 QRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRV 176 (711) Q Consensus 97 ~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v 176 (711) ..-+ - |+ .++|.+. ...+..+++.|+++.....++.+++++|.+|+-| T Consensus 67 l~~~---g-~~------------------------~~~d~~~----~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v 114 (480) T protein:vir:78 67 LDIE---G-FR------------------------ISEDSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRAYITV 114 (480) T ss_pred hccC---c-ee------------------------cCCCchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEe Confidence 4211 0 10 1123232 3445667788999999999999999999999877 Q ss_pred EEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCC Q lcl|Aclame:pro 177 RSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFT 254 (711) Q Consensus 177 ~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 254 (711) +.....+...++.++|..+ +|.+++ |||.... ...+. ++.+...+ . T Consensus 115 ~~~~~~~~d~~~~~~i~~~-~p~~~~~i~D~~~~~----~~~~~-i~~~~~~d--------------------------~ 162 (480) T protein:vir:78 115 SHPDVESGDPAGIPLIRVE-SPLYMYAELDPRNTR----RVTRA-VRLYTTRD--------------------------D 162 (480) T ss_pred ecCccccCCCCCeeEEEEE-cccceEEEEcCCCcc----ceEEE-EEEEEeec--------------------------C Confidence 6422112223466777766 787755 6764431 11222 22221110 0 Q ss_pred CCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCcc Q lcl|Aclame:pro 255 EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTI 334 (711) Q Consensus 255 ~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~ 334 (711) .+.+..+++|... .++.+..... . ..+.....++.|.+.|.+ T Consensus 163 ~~~~~~~~~y~~~------------~~~~~~~~~~---------~-----------------~~~~~~~~~~~~~~~g~v 204 (480) T protein:vir:78 163 VAVPDRATLYLPD------------ETVPLRRNGG---------L-----------------NDQWVVDGDVIKHGLGVV 204 (480) T ss_pred CcceEEEEEEeCC------------eEEEEEecCC---------C-----------------cccccccccccccCCCCc Confidence 1122233333221 1111110000 0 000001112334556778 Q ss_pred ceEEEEeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhh-----cccCCCc Q lcl|Aclame:pro 335 PVIPVWGKSLIIKKKEIFRSIIR-HSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQ-----ANTKNFS 408 (711) Q Consensus 335 P~vp~~~~~~~~~~~~~~~g~v~-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~-----~~~~~~~ 408 (711) |+|||.... ..+..+|.|-+. .+++.++.+|+.+|.+...+...+.+..++. |.- .+++..+ .....+. T Consensus 205 Pvv~f~n~~--~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~--~~~~~~~~~~~~~~~~~~~ 279 (480) T protein:vir:78 205 PVVPLTNDP--RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVT--TDELTNDGENTTLDIYYGR 279 (480) T ss_pred ceEEeeccc--ccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-CCC--ccccccccccchhhhhhhh Confidence 888775332 233445666665 5899999999999999999888888766553 321 1111111 0111222 Q ss_pred eEEecccccCcCCccccCCcc-chHHHHHHHHHHHHHHHHHhCCCHHHhcccc-chhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 409 LLTYIPQYQGDPGPRRQPPAA-VPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQRQGDRGSFAFIDNL 486 (711) Q Consensus 409 ~i~~~~~~~~~~~i~~~~~~~-~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~ 486 (711) ++... + ..+.+.+.+. ....+...+......+-.++|+++..+|..+ |..||.|+..+...-.......-+.| T Consensus 280 ~~~~~-~----~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f 354 (480) T protein:vir:78 280 ILTLA-S----EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIF 354 (480) T ss_pred hccCC-C----CCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 1 1122332222 2233444455555555556788888888554 45799999887666555555555566 Q ss_pred HHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHH Q lcl|Aclame:pro 487 TKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAA 566 (711) Q Consensus 487 ~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~ 566 (711) ..++++++++++. +.. .....++..+ .+.=.+...-...+.. T Consensus 355 ~~~l~~~~rl~~~----~~~---------~~~~~~~~~i-------------------------~v~w~~~~~~s~~~~a 396 (480) T protein:vir:78 355 GGAWERAMRIAMQ----IMG---------REVTEEYTRL-------------------------ETVWRDPSTPTVAAKA 396 (480) T ss_pred HHHHHHHHHHHHH----HcC---------CCccccceee-------------------------eEEecCCCCCCHHHHH Confidence 6666666665443 321 1111111111 0100000000112233 Q ss_pred HHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHH-HHHHHHHH- Q lcl|Aclame:pro 567 EAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQV-EMAKSQAD- 644 (711) Q Consensus 567 ~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~-~~~~~q~~- 644 (711) +.+..+.+..... +....+++.+++. +.-.+.+++....... ........ ....+.+. T Consensus 397 d~~~kl~~~g~~~--~s~et~~~~lg~~--~d~~~e~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~ 456 (480) T protein:vir:78 397 DAVSKLYANGQGP--IPKEQARIDLGYT--ATQREQMRDWDKQETE----------------DMIDTLYSTTKAQADATP 456 (480) T ss_pred HHHHHHHHhcccC--CCHHHHHhcCCCC--HhHHHHHHHHHHHHHH----------------HHHHHhhccccCCCcccc Confidence 3444444322110 1111223333332 1111111100000000 00000000 00000000 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 645 ---MAQAEADTAQAQADMLKAQLE 665 (711) Q Consensus 645 ---~~k~qae~~~aqae~~~~q~~ 665 (711) ......+.+.+...+-++... T Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 457 KPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred CCCCCCCCCccCCCcccCCCcCCC Confidence 000000000000000000000 No 87 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.74 E-value=3.9e-15 Score=99.60 Aligned_cols=498 Identities=11% Similarity=0.015 Sum_probs=240.3 Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh-----ccccee Q lcl|Aclame:pro 31 LLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ-----NRPAIK 105 (711) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~~ 105 (711) .-.++..+|.+.. ...|...|.++.+|..-.=..+.--.......+ ..-..-...++...+..-. +++=.+ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~--~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEH--DFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCC--CccchHHHHHHHHHHHHHhhhcCCCCcccc Confidence 2234555555442 557888899999988532111100000000111 1122222223333222111 111122 Q ss_pred EecchhhhhhhhhcccccccccccCCCchh-HHHH---HHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeec Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKND-YELA---EVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYL 181 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~A---e~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~ 181 (711) +.+.+.... . ...+..+ .+.. +.++..+......|+|..+...++.+.+..|++++-+ + T Consensus 77 l~~~d~~~~------------~-~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~--~-- 139 (510) T protein:vir:63 77 SELTDAIRR------------E-ADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--D-- 139 (510) T ss_pred cCCChHHhh------------c-ccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE--c-- Confidence 322210000 0 0000111 1122 2345556666778999999999999999989886532 2 Q ss_pred cCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEE Q lcl|Aclame:pro 182 ADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVS 261 (711) Q Consensus 182 ~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~ 261 (711) +++ ..++.+ +..++++..++.- ...=++++.+|+..+|-+.|+........ .....+.|.++ T Consensus 140 -~~~----~~~~~~-pl~~y~v~~d~~G----~vd~i~rr~~~t~~~l~e~~~~~~~~~~~--------~~~~~~~v~v~ 201 (510) T protein:vir:63 140 -SDA----ATVVAW-SLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEEYKQDLMRAGR--------NLSGSGSVDLY 201 (510) T ss_pred -CCC----cEEEEE-EcceeEEeeCCCc----CeeEEEeeeeccHHHHhHHhhhhhhcccc--------ccCCCcceEEE Confidence 222 235555 5667777655431 22337889999999998777654322110 00112344455 Q ss_pred EeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEe Q lcl|Aclame:pro 262 EYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWG 341 (711) Q Consensus 262 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~ 341 (711) .+-++.+. +.. .+..++.-..|..+. ..+-|++.++||+|+- T Consensus 202 ~~V~~~~~-----------------------------------~~~-~~~sv~~e~dg~~~~-~~~~~~~~e~P~~~~R- 243 (510) T protein:vir:63 202 THVQRKKG-----------------------------------TAM-EYAELYHEIDGVRVG-KEGRWPIHLCPYIVPT- 243 (510) T ss_pred EEEEeecC-----------------------------------CCc-eEEEEEEEecCceec-cccccccccCceeeee- Confidence 44433211 011 122222223444444 2345778889999764 Q ss_pred eeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCcCC Q lcl|Aclame:pro 342 KSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPG 421 (711) Q Consensus 342 ~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 421 (711) +...++..||.|.+....+--+.+|++....+.....+.++.++++++.+.+.+... ...+|.++ +|...+.. T Consensus 244 -w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~---~~~~g~~v---~g~~~~v~ 316 (510) T protein:vir:63 244 -WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ---DAEMGDYV---PGGAEAVR 316 (510) T ss_pred -eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhc---cCCCceee---cCCcccce Confidence 344688999999999999999999999999999999999999999988876665432 23344443 33332222 Q ss_pred ccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 422 PRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEM 500 (711) Q Consensus 422 i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~l 500 (711) +-........+.....++...+.|....=+ + ....++.+.|+.-|..+.+-....+...+.+|.. +...+.+..+.+ T Consensus 317 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~i 394 (510) T protein:vir:63 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMY-G-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSE 394 (510) T ss_pred eeecCcccchHHHHHHHHHHHHHHHHHHHh-h-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 222233344555667777777777765311 1 1113344578889999888888888887777753 666666666655 Q ss_pred HHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhH Q lcl|Aclame:pro 501 IPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAA 580 (711) Q Consensus 501 i~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~ 580 (711) +.... ++-+.- + . + +-.++ ++.+ +-.|.+..+.+..+.+.+..++ T Consensus 395 l~r~g----l~p~p~-----~-------~-------------~---~~~~v--~~is-~Laraq~~~~l~~~~q~l~~~~ 439 (510) T protein:vir:63 395 VDDAL----LQGLIT-----K-------Q-------------H---KPAIE--TGLP-ALSRSAAVQSMLNASQVIAGLA 439 (510) T ss_pred HHhcc----CCCCCc-----h-------h-------------c---cccee--cchh-HHHHHHHHHHHHHHHHHHHHhc Confidence 54321 110100 0 0 0 00111 1111 1122223333333333222111 Q ss_pred HHHHHHHHHhcCCcchHHHHHHHHhhhcc--hhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 581 AVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) Q Consensus 581 ~~~~~~~~~~~~~~~~~e~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae 658 (711) +. ....+.-+.+++.+.+....+- ......+.+. ++..++++++.++++++.+.+ + .++ T Consensus 440 ~~-----aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev-~a~~~~~~qq~~~~~~~~~~~---~---------~~a- 500 (510) T protein:vir:63 440 PI-----AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADEL-QAEAEQQRQQAAQAQAAQETL---L---------EGA- 500 (510) T ss_pred Cc-----hhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHH-HHHHHHHHHHHHHHHHHHHHH---H---------HHH- Confidence 11 1122333677777777666553 1222121111 111110000000000000000 0 000 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 659 MLKAQLETEEAQK 671 (711) Q Consensus 659 ~~~~q~~~~~~q~ 671 (711) .++....+.+ T Consensus 501 ---~~~~~~~~g~ 510 (510) T protein:vir:63 501 ---SDMTNALAGV 510 (510) T ss_pred ---HhhcccccCC Confidence 0000000011 No 88 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.73 E-value=3e-18 Score=116.67 Aligned_cols=461 Identities=12% Similarity=0.068 Sum_probs=199.6 Q ss_pred cCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC----CCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhh Q lcl|Aclame:pro 23 KNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ----WPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQR 98 (711) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q----w~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~ 98 (711) -. -.++++.++...+ ...+.+..+-.+||+|+| +.......++. -.++.|..+-+|+..+++.. T Consensus 1 ~~--t~~~~i~~L~~~~-------~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~---~~~~~n~~~~ivd~~~~~l~ 68 (480) T protein:vir:78 1 MT--TYHEHVERLQGLL-------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY---LDVQPGWVATYLRTLSDRLD 68 (480) T ss_pred CC--CHHHHHHHHHHHH-------HHHHHHHHHHHHHHhccccccccccccchhHhh---hhhhcchHHHHHHHHHhhhc Confidence 11 2223445554433 234555667789999975 11111111111 12678999999998888642 Q ss_pred hcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEE Q lcl|Aclame:pro 99 QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRS 178 (711) Q Consensus 99 ~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~ 178 (711) -+ - |. ..+|.+.. ..+..+++.|+++.....++.+++++|.+|.-|+. T Consensus 69 ~~---g-~~------------------------~~~d~~~~----~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~ 116 (480) T protein:vir:78 69 IE---G-FR------------------------ISEDSEGL----EELWNWWQANDLDEESVLGHDDSLTFGRSYITVSH 116 (480) T ss_pred cC---c-ee------------------------cCCCchhH----HHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEec Confidence 11 1 10 12233332 34456778899999999999999999999887764 Q ss_pred eeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCC Q lcl|Aclame:pro 179 DYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEK 256 (711) Q Consensus 179 d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 256 (711) ....+..-++.+++..+ +|.+++ |||.... ...+. ++.+.+.+ ..+ T Consensus 117 ~~~~~~d~~g~~~i~~~-~p~~~~~~~D~~~~~----~~~~~-i~~~~~~~--------------------------~~~ 164 (480) T protein:vir:78 117 PDVESGDPAGIPLIRVE-SPLYMYAELDPRNTR----RVTRA-VRLYTTRD--------------------------DVA 164 (480) T ss_pred CccccCCCCCeeEEEEE-cccceEEEEcCCCcc----ceEEE-EEEEEeec--------------------------CCC Confidence 32222233467777766 787766 6664321 11111 22221100 011 Q ss_pred eEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccce Q lcl|Aclame:pro 257 SVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPV 336 (711) Q Consensus 257 ~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~ 336 (711) .+...++|.... ++.+..... . ..+.....++.|.+.+.+|+ T Consensus 165 ~~~~~~~y~~~~------------~~~~~~~~~---------~-----------------~~~~~~~~~~~~~~~g~vPv 206 (480) T protein:vir:78 165 VPDRATLYLPDE------------TVPLRRNGG---------L-----------------NDQWVVDGDVIKHGLGVVPV 206 (480) T ss_pred ceEEEEEEeCCe------------EEEEEecCC---------C-----------------ccccccccccccCCCCCcce Confidence 222333332211 111110000 0 00000011233455677888 Q ss_pred EEEEeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhc-----ccCCCceE Q lcl|Aclame:pro 337 IPVWGKSLIIKKKEIFRSIIR-HSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQA-----NTKNFSLL 410 (711) Q Consensus 337 vp~~~~~~~~~~~~~~~g~v~-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~-----~~~~~~~i 410 (711) +||...+ ..+..+|.|.+. .+++.++.+|+.+|.+...+...+.+..++. |.- .+++..+. ....+.++ T Consensus 207 v~f~n~~--~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~--~~~~~~~~~~~~~~~~~~~~~ 281 (480) T protein:vir:78 207 VPLTNDP--RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVT--TDELTNDGENTTLDIYYGRIL 281 (480) T ss_pred EEeeccc--ccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCC--ccccccccccchhhhhhhhhc Confidence 8775332 233445666665 5899999999999999999887777765553 321 11111110 11122222 Q ss_pred EecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 411 TYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQRQGDRGSFAFIDNLTKS 489 (711) Q Consensus 411 ~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 489 (711) ... + .+..+...+... ...+...+......+-.++|+++..+|..+ |..||.|+..+...-..........|..+ T Consensus 282 ~~~-~--~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~ 357 (480) T protein:vir:78 282 TLA-S--EAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGA 357 (480) T ss_pred cCC-C--CCceEEecCccC-HHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 221 1 112232222221 233444455555555556889988988654 45799998877655555555555555556 Q ss_pred HHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHH Q lcl|Aclame:pro 490 IRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAM 569 (711) Q Consensus 490 ~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L 569 (711) +++++++++ .+.. ..+..++..+ +++-. -+...+ ..+..+.+ T Consensus 358 l~~~~~l~~----~~~g---------~~~~~~~~~i-----------------------~v~f~-~~~~~s-~~~~ad~~ 399 (480) T protein:vir:78 358 WERAMRIAM----QIMG---------REVTEEYTRL-----------------------ETVWR-DPSTPT-VAAKADAV 399 (480) T ss_pred HHHHHHHHH----HHcC---------CCccccceee-----------------------eEEec-CCCCCC-HHHHHHHH Confidence 666655544 3322 1111111111 11100 001111 12233334 Q ss_pred HHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 570 IQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMA--- 646 (711) Q Consensus 570 ~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~--- 646 (711) .++.+..... +....+++.+++. +.-.+.+++.... ..+..........+..+... T Consensus 400 ~kl~~~g~~~--~s~et~~~~lg~~--~d~~~~~~~~~~e-----------------~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) T protein:vir:78 400 SKLYANGQGP--IPKEQARIDLGYT--ATQREQMRDWDKQ-----------------ETEDMIDTLYSTTKAQADATPKP 458 (480) T ss_pred HHHHHhcccc--CCHHHHHhcCCCC--HhHHHHHHHHHHH-----------------HHHHHHHHhhccccccCCCCCCC Confidence 4443322111 1112223333332 1111111110000 00000000000000000000 Q ss_pred ---HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 647 ---QAEADTAQAQADMLKAQLE 665 (711) Q Consensus 647 ---k~qae~~~aqae~~~~q~~ 665 (711) ....+.+.+-...-+++.. T Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 459 TVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred CCCCCCCccccccCCCCcccCC Confidence 0000000000000000000 No 89 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.73 E-value=5.9e-17 Score=109.54 Aligned_cols=465 Identities=11% Similarity=0.029 Sum_probs=209.5 Q ss_pred CCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHH----HHHHHHhCCCce Q lcl|Aclame:pro 6 KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQV----RTERELEQRPCL 81 (711) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~----~~~~~~~g~p~~ 81 (711) ..++|+++++.. .+...+..+...+.. .+.+-.+-.+||.|+|.-... ...++ .-.+ T Consensus 1 ~~~~i~~~~~~~---------~~~~~~~~L~~~~~~-------~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~---~~~~ 61 (485) T protein:vir:24 1 MTAPLPGQEEIA---------DPAIARDEMVSAFED-------QNQNLRSNTSYYEAERRPEAIGVTVPVQMQ---SLLA 61 (485) T ss_pred CCCCCCCCCccc---------chHHHHHHHHHHHHH-------HHHHHHHHHHHHhccCchhhcCcccchhhh---hhhh Confidence 777777764433 223334444443322 223333456899999853221 11111 1135 Q ss_pred EehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHH Q lcl|Aclame:pro 82 VNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDI 161 (711) Q Consensus 82 ~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~ 161 (711) +.|..+.+|+..+++..-+ -|+. +++... ...+..++..|+++...+. T Consensus 62 ~~n~~~~ivd~~~~~l~~~----g~~~------------------------~~~~~~----~~~l~~i~~~N~~d~~~~~ 109 (485) T protein:vir:24 62 HVGYPRLYVDSIAERQAVE----GFRL------------------------GDADEA----DEELWQWWQANNLDIEAPL 109 (485) T ss_pred ccchHHHHHHHHhhhhccC----ceec------------------------CCCchh----HHHHHHHHHhcChhHHHHH Confidence 6799999999888764211 0111 122222 2334566778999999999 Q ss_pred HHHHHHhcCccEEEEEEeeccCCC--CCCcceEEEecCccce--eeCCCccccCccccceeeeeecCCHHHHHHhcCCcc Q lcl|Aclame:pro 162 AFQGAVESGMGYLRVRSDYLADDS--FEQDLIIEAIQNQFSV--TIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDAT 237 (711) Q Consensus 162 a~~~~~~~G~g~~~v~~d~~~~~~--~~~~i~i~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~ 237 (711) +..+++++|++|.-|+.+...... ..+.++|..+ +|.++ +||+....+ .++.+.+-+ . T Consensus 110 ~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~-~p~~~~~i~D~~~~~~------~~~~~~~~~----------~- 171 (485) T protein:vir:24 110 GYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVE-PPTRMYAEIDPRIGRP------AKAIRVAYD----------A- 171 (485) T ss_pred HHHHHhhcCceEEEEecCCcccccccCCCcceEEEe-ccceeEEEeeCCcCce------eEEEEEEEe----------e- Confidence 999999999999988876543322 2355677766 78777 477654321 112221110 0 Q ss_pred cchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE Q lcl|Aclame:pro 238 AEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) ..+.+..+++|.... ++.+ .-. T Consensus 172 ----------------~~~~~~~~~~y~~~~------------~~~~------------------------------~~~ 193 (485) T protein:vir:24 172 ----------------EGNEIQAATLYTPNE------------TFGW------------------------------FRA 193 (485) T ss_pred ----------------cCCeEEEEEEEcCCc------------EEEE------------------------------Eec Confidence 012233333332221 1111 001 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc-cCC- Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIR-HSKDAQRMANYWDSAATETVALAPKAPFIGSEGN-VEG- 394 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a-v~~- 394 (711) .|...+....|.+.+.+|+|||...+. .+..+|.|-+. .+++.++.+|+.+|.+..++...+.+..++- |. ... T Consensus 194 ~~~~~~~~~~~h~~g~vPvv~f~n~~~--~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~ 270 (485) T protein:vir:24 194 EGEWVEWFSDPHGLGAVPVVPLPNRTR--LSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEI 270 (485) T ss_pred CCceEeecccccCCCcccEEEeccCcc--cCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhc-cCCcccc Confidence 111122233455667888888753322 23334445443 6899999999999999999888887766553 21 111 Q ss_pred --hHHHHhh-cccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHH---hCCCHHHhcccc-chhHHHH Q lcl|Aclame:pro 395 --REDEWEQ-ANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKST---MGMYDASLGAMG-NETSGRA 467 (711) Q Consensus 395 --~~~~~~~-~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~---tGv~~~~~G~~~-~~~sg~a 467 (711) .++-... ....++.++.. ++ .+..+...+.. .+-.+++.....|..+ +++++..+|..+ |..||.| T Consensus 271 ~~~~~~~~~~~~~~~~~i~~~-~~--~~~~~~q~~~~----~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~A 343 (485) T protein:vir:24 271 GVDPETGQTLFDAYLARILAF-ED--AEGKIQQFSAA----ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEA 343 (485) T ss_pred ccccccccchhhhcccceecc-CC--CCceEEeeccc----chHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHH Confidence 0000000 01223333322 22 12223223222 2334455555555554 678888888555 5579999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 468 IIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 468 i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) +..+...-..........|..++++++++++.+...- + ...++.. T Consensus 344 l~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~----------~--~~~d~~~----------------------- 388 (485) T protein:vir:24 344 IRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGG----------D--VPPDMLR----------------------- 388 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------C--Cccccce----------------------- Confidence 9988777777777777777777777777766542210 0 0001100 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 627 (711) +.+.=.+.......+..+.+..+.+.... .+....+++++++.. ...+.+++............ T Consensus 389 --i~v~f~~~~~~s~~~~ad~~~kl~~~g~~--~~s~et~~~~l~~~~--d~~~e~~~~~ee~~~~~~~~---------- 452 (485) T protein:vir:24 389 --METVWRDPSTPTYAAKADAATKLYGNGQG--VIPRERARKDMGYSI--AEREEMRRWDEEEAAMGLGL---------- 452 (485) T ss_pred --eeEEecCCCCCCHHHHHHHHHHHHhcccc--cCCHHHHHhhCCCCH--hHHHHHHHHHHHHhhhhhhH---------- Confidence 00000101111112222333333322110 011122234444421 11111111110000000000 Q ss_pred HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 628 TEPTPEQQVEMAKSQADM--AQAEADTAQAQADMLKAQLETE 667 (711) Q Consensus 628 q~~~~~~q~~~~~~q~~~--~k~qae~~~aqae~~~~q~~~~ 667 (711) . ..+....... .....+....+.. ..-.+.+ T Consensus 453 ------~-~~~~~~~~~~~~~~~~~e~~~~~~~--~~~~~~a 485 (485) T protein:vir:24 453 ------L-GTMVDADPTVPGSPNPTPAPKPQPA--IEGGDSA 485 (485) T ss_pred ------H-HhhcccCCCCCCCCCCCCCCCCccC--CCCCCCC Confidence 0 0000000000 0000000000000 0000000 No 90 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.73 E-value=8.6e-17 Score=108.65 Aligned_cols=482 Identities=11% Similarity=0.027 Sum_probs=225.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP- 79 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p- 79 (711) ||=-=...=++.+ +..+.+++.++.+. ......+..+-.+||.|.| +-.....+..+++ T Consensus 1 ~~~~~~~~~~~~~-----------~~~~~~~i~~~i~~-------~~~~~~~~~~l~~Yy~g~~--~i~~~~~~~~~~~~ 60 (499) T protein:vir:10 1 MAVVIDKDLLDDV-----------NEPNIEAINYAIRE-------LQNRKKRLDKLSDYYNGKQ--EIEKHEFDNATVEA 60 (499) T ss_pred CccchhhhHHhhh-----------hcCCHHHHHHHHHH-------HHHHHHHHHHHHHHhcccc--chhcCCcCcCCCCc Confidence 3211000000000 01112234443332 2333455566789999975 1111111222333 Q ss_pred -ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHH Q lcl|Aclame:pro 80 -CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) Q Consensus 80 -~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) .+++|..+.+|+..+|+.-.+.+.+.. .+.+..+ .+..+++.|+++.. T Consensus 61 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~---------------------------~~~~~~~----~l~~~~~~n~~~~~ 109 (499) T protein:vir:10 61 ANVMVNHAKYITDMNVGFMTGNPVKYVA---------------------------EKGKNID----DILEVFNQIDIHKH 109 (499) T ss_pred ceeecchHHHHHHHHhhhhcccCceeec---------------------------CChhHHH----HHHHHHhhcCHhHH Confidence 467899999999999988766554432 1222222 34556777999999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCCC-----------CCcceEEEecCcccee--eCCCccccCccccceeeeeecCC Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDSF-----------EQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMS 225 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~~-----------~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~ 225 (711) ...+..+++++|.+|..++.+......+ ..++++..| +|+++| ||.... .-...+++.+.+ T Consensus 110 ~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v-~p~~~~~v~~d~~~-----~~~~~~i~~~~~ 183 (499) T protein:vir:10 110 DIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVI-DPRATVVVCDDTVE-----HDPLFAVFTQEK 183 (499) T ss_pred HHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEE-cccceEEEecCCCC-----cceEEEEEEEEE Confidence 9999999999999998777654322111 123445444 455443 211110 001111111111 Q ss_pred HHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhc Q lcl|Aclame:pro 226 KEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTR 305 (711) Q Consensus 226 ~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 305 (711) .+ ....+.++.+++|..... +.+...... . T Consensus 184 ~~------------------------~~~~~~~~~~~iyt~~~i----~~~~~~~~~------~---------------- 213 (499) T protein:vir:10 184 KD------------------------LEGNTNGYSITVYMPQRI----VEYRTKTTM------E---------------- 213 (499) T ss_pred ee------------------------cCCCceEEEEEEEeCCeE----EEEEecCCc------c---------------- Confidence 00 001223444455543321 111000000 0 Q ss_pred ccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|Aclame:pro 306 KVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPF 385 (711) Q Consensus 306 ~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~ 385 (711) ..+...+....|.+.|.+|+|+|. +...+.|.+..++++++.+|...|.+.+.+...+.+.+ T Consensus 214 -----------~~~~~~~~~~~~~~~g~vPvv~~~-------n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l 275 (499) T protein:vir:10 214 -----------VSANDPIVYDGENLFGAVPIIEFR-------NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALL 275 (499) T ss_pred -----------ccCcceecccccCCCCccceEEec-------CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 001111112234455677776542 23457899999999999999999999999998888888 Q ss_pred EecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHH Q lcl|Aclame:pro 386 IGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSG 465 (711) Q Consensus 386 ~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg 465 (711) ++.-..+....+... ....+.++....+ .+..++++..+.-..++...++...+.|...|++++.+-+.-+++.|| T Consensus 276 v~~G~~~~~~~~~~~--~~~~~~~~~~~~~--~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg 351 (499) T protein:vir:10 276 VTFGFGLGDDKDDIQ--RLKRGAIEAPPRE--EGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSG 351 (499) T ss_pred eeecCccccccchhh--hhhhcceeccCCC--CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchH Confidence 775333332221111 1234444433322 223466776666677788888999999999999887666655566899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhh Q lcl|Aclame:pro 466 RAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNV 545 (711) Q Consensus 466 ~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~ 545 (711) .|+..+..............|..+++++.++++.++. +.|. ..++. T Consensus 352 ~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~----------~~~~--~~d~~---------------------- 397 (499) T protein:vir:10 352 EAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVN----------IKGA--NDDAS---------------------- 397 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------ccCC--ccccc---------------------- Confidence 9998887766666666666666666666666665432 1121 11110 Q ss_pred eeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC-cchHHHHHHHHhhhcchhhcchhhhhhh-hh Q lcl|Aclame:pro 546 QKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIE-ED 623 (711) Q Consensus 546 ~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~~~~~~~~-~~ 623 (711) ++.+.=.+..+....+..+.+..+...++. ..+++.+++ .+.++-.+++.+................ .. T Consensus 398 ---~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (499) T protein:vir:10 398 ---GCKISLVANIPSNLSDVVNNVKNADGIIPR------KYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPD 468 (499) T ss_pred ---cceEEeCCCCCCCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCC Confidence 111222222232233344444444332222 233344433 2333333444332211100000000000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 624 MPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ 663 (711) Q Consensus 624 ~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q 663 (711) ...........+... .+...+.++.-+.++- T Consensus 469 ~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 469 RLELEDKQDDSSEND---------KEAGSNHNQSHRTRAV 499 (499) T ss_pred CCCCCCCCcccCCCC---------CCCccccccCCCCCCC Confidence 000000000000000 0000000000000000 No 91 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.72 E-value=4.4e-17 Score=110.23 Aligned_cols=472 Identities=11% Similarity=0.047 Sum_probs=203.0 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC----CCHHHHHHHHHhCCCceEehh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ----WPSQVRTERELEQRPCLVNNV 85 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q----w~~~~~~~~~~~g~p~~~~N~ 85 (711) |.+. ...+...++.++...+.. .+.+...-.+||+|+| +........+. . .++.|. T Consensus 1 ~~~~----------~~~d~~~~i~~L~~~~~~-------~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~-~--~~~~n~ 60 (488) T protein:vir:23 1 MAET----------ESIDPEKLRDQLLDAFEN-------KQNELKSSKAYYDAERRPDAIGLAVPLDMRK-Y--LAHVGY 60 (488) T ss_pred CCcc----------cCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcccchhhcCcccchhhhh-h--hhhcch Confidence 2221 123344556665543333 2334445578999986 22222222211 1 256788 Q ss_pred hHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccC-CCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHH Q lcl|Aclame:pro 86 LPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNV-AGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQ 164 (711) Q Consensus 86 i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~-~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~ 164 (711) .+-+|+..+....-+- + +.|.. +... ...+|.+..+. +..+++.|+++.....+.. T Consensus 61 ~~~ivd~~a~~l~~~G--f-~~~~~----------------~~~~~~~~~d~~~~~~----l~~i~~~N~~~~~~~~~~~ 117 (488) T protein:vir:23 61 PRTYVDAIAERQELEG--F-RIPSA----------------NGEEPESGGENDPASE----LWDWWQANNLDIEATLGHT 117 (488) T ss_pred HHHHHHHHHHhhhccc--e-eccCC----------------cccccccccchhHHHH----HHHHHHhcChhHHHHHHHH Confidence 8888887764221110 0 01100 0000 12234444333 4566889999999999999 Q ss_pred HHHhcCccEEEEEEeeccCC--CCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccch Q lcl|Aclame:pro 165 GAVESGMGYLRVRSDYLADD--SFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP 240 (711) Q Consensus 165 ~~~~~G~g~~~v~~d~~~~~--~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~ 240 (711) +++++|++|+-|+....... +..+.++|..+ +|.+++ |||.... ...+.+.+-+. T Consensus 118 ~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~-~p~~~~~~~d~~~~~------~~~~~~~~~~~-------------- 176 (488) T protein:vir:23 118 DALIYGTAYITISMPDPEVDFDVDPEVPLIRVE-PPTALYAEVDPRTRK------VLYAIRAIYGA-------------- 176 (488) T ss_pred HHhhcCceEEEEecCCcccccCCCCCcceEEEe-ccceeEEEEecCCCc------eEEEEEEEEec-------------- Confidence 99999999987765432111 12344566555 787665 7764322 12222222100 Q ss_pred hhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecC Q lcl|Aclame:pro 241 VYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGA 320 (711) Q Consensus 241 ~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 320 (711) +...+..+++|.... ++++ ....|. T Consensus 177 -------------~~~~~~~~~~y~~~~------------~~~~------------------------------~~~~~~ 201 (488) T protein:vir:23 177 -------------DGNEIVSATLYLPDT------------TMTW------------------------------LRAEGE 201 (488) T ss_pred -------------CCCcEEEEEEEecCc------------EEEE------------------------------EecCCc Confidence 011222233332211 1100 001111 Q ss_pred ceeccCccCCCCccceEEEEeeeeccCCcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC-ChH-- Q lcl|Aclame:pro 321 NVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSII-RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE-GRE-- 396 (711) Q Consensus 321 ~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v-~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~-~~~-- 396 (711) -.+++..|.+.+.+|+|||...+. ....+|.|-+ +.++++++.+|+.+|.+...+...+.+..++- |... +.. T Consensus 202 ~~~~~~~~h~~g~vPvv~f~n~~~--~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~ 278 (488) T protein:vir:23 202 WEAPTSTPHGLEMVPVIPISNRTR--LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIF-GAKPEELGIN 278 (488) T ss_pred eEeccccccCCCCcceEEeccccc--cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh-CCCccccccc Confidence 123345567778889988754332 2334555655 46899999999999999999887777655442 2211 100 Q ss_pred -H-HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcccc-chhHHHHHHHHHH Q lcl|Aclame:pro 397 -D-EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQR 473 (711) Q Consensus 397 -~-~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~-~~~sg~ai~~~~~ 473 (711) + ...-....++.++...++. +..+..++..+ ...+...+......+-.++|+++..+|..+ |..||.|+..... T Consensus 279 ~~~~~~~~~~~~~~v~~~~~g~--~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~ 355 (488) T protein:vir:23 279 AETGQRMFDAYMARILAFEGGE--GAHAEQFSAAE-LRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAES 355 (488) T ss_pred ccccchhhhhhhhhhccCCCCC--CceeEecCCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHH Confidence 0 0000011233333332221 12232233222 233444444444455556788888888554 5579999988776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEee Q lcl|Aclame:pro 474 QGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) Q Consensus 474 ~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~ 553 (711) .-..........|..+++++.++++.+. +. .....++..+ .+ . T Consensus 356 ~l~~k~~~~~~~f~~~l~~~~~l~~~~~----~~--------~~~~~~~~~i-----------------------~v--~ 398 (488) T protein:vir:23 356 RLVKKVERKNKIFGGAWEQAMRLAYKMV----KG--------GDIPTEYYRM-----------------------ET--V 398 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh----cC--------CCcchhhccc-----------------------eE--E Confidence 6666666666666667777776665432 11 0011111110 00 0 Q ss_pred cccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPE 633 (711) Q Consensus 554 ~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~ 633 (711) =.+..+....+..+.+..+.+.... .+....+++.+++- +.-.+.+++... ++..+...+ T Consensus 399 f~~~~~~s~~~~ada~~kl~~~g~~--~~s~et~~~~l~~~--~d~~~~~~~~~~----------------~~~~~~~~~ 458 (488) T protein:vir:23 399 WRDPSTPTYAAKADAAAKLFANGAG--LIPRERGWVDMGYT--IVEREQMRQWLE----------------QDQKQGLGL 458 (488) T ss_pred ecCCCCCCHHHHHHHHHHHHhcccc--cCCHHHHHHhCCCC--chHHHHHHHHHH----------------HHHHHHHHH Confidence 0111111122333334444332110 01111223333321 111111110000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 QQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLA 674 (711) Q Consensus 634 ~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~ 674 (711) .......++.. .....+... ..... +..+| T Consensus 459 ~~~~~~~~~~~-----~~~~~~~~~----~~~~~--e~~~a 488 (488) T protein:vir:23 459 IGSLYGASTPE-----GKPGEAPVG----EPPAP--EPDAA 488 (488) T ss_pred HHHHhccCCCc-----ccCCCCCCC----CCCCC--CCCCC Confidence 00000000000 000000000 00000 00000 No 92 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.72 E-value=6.6e-15 Score=98.31 Aligned_cols=506 Identities=13% Similarity=0.037 Sum_probs=239.0 Q ss_pred CCcCcc-hHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh Q lcl|Aclame:pro 21 YAKNND-DDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ 99 (711) Q Consensus 21 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~ 99 (711) +..+.+ ...-....+..+|+........|...|.++.+|..-.=+++.--. ++...+.-..-...++.+.+..-. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~----~~~~~~~dstg~~a~~~LAa~l~~ 76 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDN----ETSQNGWQGVGAQATNHLANKLAQ 76 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCc----ccccccccchHHHHHHHHHHHHHh Confidence 222222 222234578888888888889999999999999854322211000 011111122222223332222111 Q ss_pred -----cccceeEecchhhhhhhhhcccccccccccCCCchhHHHHH---HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCc Q lcl|Aclame:pro 100 -----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAE---VFTGLIKNIEYNCDAETEYDIAFQGAVESGM 171 (711) Q Consensus 100 -----~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae---~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~ 171 (711) +++=.++.+.+.. .......+.+-.++.+ .++..+......|+|......++.+.+..|+ T Consensus 77 ~ltpp~~~WF~L~~~d~~------------~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 144 (516) T protein:vir:10 77 VLFPAQRSFFRVDLTAQG------------EKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGS 144 (516) T ss_pred hhcCCCCccccccCChhh------------HhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCe Confidence 1111222221100 0000000011112222 3455566667789999999999999999999 Q ss_pred cEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccccc Q lcl|Aclame:pro 172 GYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDT 251 (711) Q Consensus 172 g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 251 (711) |+. +.+ +++ . ++.+ +..++++..++.- ...-++++..++..++.+.|++.. ......... T Consensus 145 a~l--~~d---~~~---~--~~~~-pl~~y~v~~d~~G----~v~~ivrr~~~~~~~l~e~~~~~~-~~~~~~~~~---- 204 (516) T protein:vir:10 145 CML--YKP---SKG---A--ISAI-PMHHYVVNRDTNG----DLLDIILLQEKSLRTFDPATRAVV-EVGLKGKKC---- 204 (516) T ss_pred EeE--Eec---CCC---C--eEEE-EcCeEEEeeCCCC----CeEEEeeeecccHHHHHHHhhhhh-hhhhhhhcc---- Confidence 874 333 221 1 3444 5667887655531 123367888999999999986532 111111111 Q ss_pred CCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCC Q lcl|Aclame:pro 252 WFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPS 331 (711) Q Consensus 252 ~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~ 331 (711) ...+.+.++.+=++. .++. . .++.-.+...+...+-|+. T Consensus 205 -~~~~~~~i~t~v~~~---------~~~~------------------------------~-~~~~~~d~~~~~~~s~~~~ 243 (516) T protein:vir:10 205 -KEDDSIKLYTHAKYL---------GEGF------------------------------W-ELKQSADDIPVGKVSKIKS 243 (516) T ss_pred -CCCCceEEEEEEEec---------CCCc------------------------------e-EEEEeeCceeecccccccc Confidence 012233332221111 1111 1 1222223333333445667 Q ss_pred CccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEE Q lcl|Aclame:pro 332 TTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLT 411 (711) Q Consensus 332 ~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~ 411 (711) ..+||+++-+ ...++..||.|.+....+--+.+|.+...++.....+.++.++++++.+.+...... ..+|.++ T Consensus 244 ~e~P~~~~Rw--~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~---~~~g~~~- 317 (516) T protein:vir:10 244 EKLPFIPLTW--KRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN---SGTGEVV- 317 (516) T ss_pred ccCCeeeeee--eecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhcc---CCCceee- Confidence 7899997654 446889999999999999999999999999999999999999998888876654321 2223332 Q ss_pred ecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 412 YIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SI 490 (711) Q Consensus 412 ~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~ 490 (711) +|...+..+-......-.+.....++...+.|....=+ +.+.-.++.+.|+.-|..+.+--...+...+..|.. +. T Consensus 318 --~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~-~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell 394 (516) T protein:vir:10 318 --TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMM-ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQ 394 (516) T ss_pred --cCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhh-hhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 44332222222223333455666777777777665422 222223455678888888887777777777776643 33 Q ss_pred HHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHH Q lcl|Aclame:pro 491 RRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMI 570 (711) Q Consensus 491 ~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~ 570 (711) ..+....+. ..+.. + |+ .+ .++.+.++. ..-.|.+..+.+. T Consensus 395 ~Pli~r~~~-------------~~~p~-------~------P~--------~l----v~~~~v~~i-~~L~raq~~~~i~ 435 (516) T protein:vir:10 395 SPVAMWGLL-------------EAGDS-------F------TS--------DL----VDPVIITGI-EALGRMAELDKLA 435 (516) T ss_pred HHHHHHHHH-------------hhCCC-------C------Ch--------hh----cCcceehhH-HHHHHHHHHHHHH Confidence 333222210 01100 0 00 00 112222222 2233444445555 Q ss_pred HHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 571 QFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEA 650 (711) Q Consensus 571 ~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qa 650 (711) .+++.+..+++. .+ ..++.-+.++..+.+..........--...+..+..+++++.++.+.++. T Consensus 436 ~~~q~i~~~~q~-~p---~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~------------ 499 (516) T protein:vir:10 436 NFAQYMSLPLQW-PE---PVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEE------------ 499 (516) T ss_pred HHHHHHHHHhcC-Ch---HHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHH------------ Confidence 554443222111 11 12333344444444444433221111111111111111111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 651 DTAQAQADMLKAQLETEEA 669 (711) Q Consensus 651 e~~~aqae~~~~q~~~~~~ 669 (711) +.+++...-.+.++.++ T Consensus 500 --~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 500 --GVAKAVPGVIQQELKEA 516 (516) T ss_pred --HhhhcccchhhhhhhcC Confidence 00000000011111111 No 93 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.72 E-value=3.4e-16 Score=105.38 Aligned_cols=467 Identities=9% Similarity=0.080 Sum_probs=235.1 Q ss_pred HHHHHHHHHHHHHHHhh------------------chHHHHHHHHHHHHhCCC-CCCHHHHHHHHHhC----CCceEehh Q lcl|Aclame:pro 29 RALLATARERARDGATY------------------WKDNWEAAEDDLKFLGGE-QWPSQVRTERELEQ----RPCLVNNV 85 (711) Q Consensus 29 ~~~~~~~~~~~~~~~~~------------------~~~~r~~~~~~~~~y~G~-Qw~~~~~~~~~~~g----~p~~~~N~ 85 (711) =.++.+++++|++.... ..+.........+||.|+ +|-. .....| +.....|+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~----~~~~~~~~~~~~~~sln~ 76 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIH----YQASDGIKKKRLKNTINM 76 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccc----cccCCCCccccceeecch Confidence 22455666666553322 233445567788999986 2211 111112 22356788 Q ss_pred hHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHH Q lcl|Aclame:pro 86 LPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQG 165 (711) Q Consensus 86 i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~ 165 (711) -+.+++...+..-.-.+.+.+. ++. .....+..+++.|++......++++ T Consensus 77 ~~~i~~~~A~lv~~e~~~i~v~--------------------------~~~----~~~e~l~~il~~n~f~~~~~~~~e~ 126 (508) T protein:vir:15 77 AKTAARRIASVVFNEKAEIHVK--------------------------DNN----EADKFLNDVLEDNDFKNKFEEALEK 126 (508) T ss_pred HHHHHHHHHhhhhCCCceEEeC--------------------------Cch----HHHHHHHHHHHhccHHHHHHHHHHH Confidence 8888887777654444444431 111 2234456667789999999999999 Q ss_pred HHhcCccEEEEEEeeccCCCCCCcceEEEecCccceee-CCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcc Q lcl|Aclame:pro 166 AVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTI-DPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYED 244 (711) Q Consensus 166 ~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~-Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~ 244 (711) ++..|.||+++++|. +.++|..| ++..||+ ..+. -+...|-|+......+ . T Consensus 127 a~a~G~~~~k~~~d~-------~~~~i~~v-~ad~~~P~~~d~--~~~~~~af~~~~~~~~----------~-------- 178 (508) T protein:vir:15 127 GVALGGFAMRPYIDG-------NHIKIAWV-RADQFYPLQSNT--NDISEAAIASRTQRTE----------S-------- 178 (508) T ss_pred HhhcCceEEEEEEeC-------CeeEEEEE-cCCeeEEEEEcC--CCeEEEEEEEEEEeec----------C-------- Confidence 999999999999862 46788777 7878773 1111 2344454433332210 0 Q ss_pred cccccccCCCCCeEEEEEeeeeeee-----ceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEec Q lcl|Aclame:pro 245 SVADYDTWFTEKSVRVSEYFTREPV-----IREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITG 319 (711) Q Consensus 245 ~~~~~~~~~~~~~v~v~E~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g 319 (711) ...+..+..|++++... ...++...+.. .-|..+ ....+.. |.-+.. T Consensus 179 --------~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~---------------~lG~~v-~l~~~~e----~~~l~~ 230 (508) T protein:vir:15 179 --------NQTKYYTLLEFHQWQDNGSYQITNELYKSDSPD---------------IVGNQV-PLSTLPV----YKELAP 230 (508) T ss_pred --------CCceEEEEEEEEEEecCcceEEEEEEEecCCch---------------hcCccc-chhhccc----ccCCCc Confidence 01112233444332110 00111110000 001000 0000000 000000 Q ss_pred CceeccCccCCCCccceEEEEeeeec-----cCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC Q lcl|Aclame:pro 320 ANVLEGPVEIPSTTIPVIPVWGKSLI-----IKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG 394 (711) Q Consensus 320 ~~~le~~~p~~~~~~P~vp~~~~~~~-----~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~ 394 (711) .+.+.+ ....||+.| +.| ...+++|.|++..+++.++.+|...|.+.+.+ ..+..++.++++.+.. T Consensus 231 ~~~~~g-----~~~p~f~y~---~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~ 301 (508) T protein:vir:15 231 QVTISG-----LQRPLFAYF---KTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRF 301 (508) T ss_pred ceEecC-----CCcceeEEe---cCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcC Confidence 000000 111233211 222 23567789999999999999999999999999 5678899998888752 Q ss_pred hHHHHhhcccCCCc--eEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHH Q lcl|Aclame:pro 395 REDEWEQANTKNFS--LLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIAR 471 (711) Q Consensus 395 ~~~~~~~~~~~~~~--~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~ 471 (711) -.+- ....+++. +..++.+...+..|+.+++.--...+...++.....+....|++....|.++++ .||.+|... T Consensus 302 d~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~ 379 (508) T protein:vir:15 302 DDEH--KPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSN 379 (508) T ss_pred CCCC--ccccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHH Confidence 1110 11122222 222333333334566666554556678888888889999999999999977654 578888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) .+..-.....+...+..+++++.+.++.+..-++-.. + |.. .... ++....++|+ T Consensus 380 ~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~------~--g~~-~~~~----------------~~~~~~~~v~ 434 (508) T protein:vir:15 380 NSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFD------D--GKP-LFTL----------------DSASQPLDIE 434 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc------c--ccc-cccc----------------ccccCCcceE Confidence 8777777788888889999999999988865543110 0 000 0000 0001123333 Q ss_pred eecccChHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCC--cchHHHHHHHHhhhcchhhcchhhhhhhhhHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDW--PGADVIAERLKKIVPPNVLSKDEREAIEEDMPE 626 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~--~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 626 (711) |+=+.+.....++..+.++++..+ ++.. ..+.+.-.+ +.+++...+++...+.............-.--+ T Consensus 435 v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e-----~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 435 CHFDDGVFVNKDKQLEEDAKVLAIGALSKQ-----TFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred EEeCCCCCCCHHHHHHHHHHHHhcCCCCHH-----HHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 333333222233333333333321 1111 111122122 223333444433332211111100000000000 No 94 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.72 E-value=1.3e-14 Score=96.77 Aligned_cols=498 Identities=11% Similarity=0.060 Sum_probs=242.0 Q ss_pred HHHHHHHHHh--hchHHHHHHHHHHHHhCCC--CCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh-----ccccee Q lcl|Aclame:pro 35 ARERARDGAT--YWKDNWEAAEDDLKFLGGE--QWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ-----NRPAIK 105 (711) Q Consensus 35 ~~~~~~~~~~--~~~~~r~~~~~~~~~y~G~--Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~-----~r~~~~ 105 (711) .+.++..++. ....|...|.++.+|.... ..+.+.........++ .-..-...++.+.+..-. +++=.+ T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~--~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYD--FQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccc--cchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 3333333322 2446888888888887431 1221111111111111 122222233333222111 122223 Q ss_pred EecchhhhhhhhhcccccccccccCCCchhHHHHHH------HHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEe Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEV------FTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSD 179 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~------l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d 179 (711) +.|.+.. ... ....+.+.+++ ++..+......|+|......++.+.+..|+|++-+ + T Consensus 79 l~~~d~~------------~~~---~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~ 141 (514) T protein:vir:80 79 IELDDTL------------QEL---AAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYR--E 141 (514) T ss_pred cccCchh------------hhh---ccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--e Confidence 3332100 000 01122222222 34455566678999999999999999999987543 2 Q ss_pred eccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEE Q lcl|Aclame:pro 180 YLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVR 259 (711) Q Consensus 180 ~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~ 259 (711) ++. . .++.+ +..++++..++.- + ..=++++.+|+..+|-+.|+...... . ......+.|. T Consensus 142 ---~~~--~--~~~~~-pl~~y~v~~d~~G-~---v~~i~rr~~~~~~~l~~~~~~~~~~~----~----~~~~~~~~v~ 201 (514) T protein:vir:80 142 ---PGT--G--KMLVW-TMQSYTVRRTSHG-D---PAVVVLRQQMPFRELTPEIQADAQAK----Q----IAKRDSDKCD 201 (514) T ss_pred ---cCC--C--cEEEE-EcCeEEEeeCCCc-C---eEEEEeeeeecHHHhhhhhhhhhhhh----h----ccCCCCCceE Confidence 221 1 23445 5667777655431 1 22377889999998876665432111 1 0111234566 Q ss_pred EEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEE Q lcl|Aclame:pro 260 VSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPV 339 (711) Q Consensus 260 v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~ 339 (711) |+.+.++.+.. . ..+..+|.-..|.+++ ..+-|++.++||+|+ T Consensus 202 v~~~v~~~~~~-----------------------------------~-~~~~sv~~e~~g~~i~-~es~y~~~e~P~i~~ 244 (514) T protein:vir:80 202 LYTVIEWQPTP-----------------------------------N-GKRCAVWHELEGKRVG-PESSYPAHLCPYVPV 244 (514) T ss_pred EEEEEEeecCC-----------------------------------C-CeEEEEEEeccceeec-ccCccccccCCeeee Confidence 66665544321 0 0112222233445554 235577788999976 Q ss_pred EeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccCc Q lcl|Aclame:pro 340 WGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD 419 (711) Q Consensus 340 ~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~ 419 (711) - +...++..||.|.+....+--+.+|++...++.....+.++.++++.+.+.+...... ..+|.++ +|...+ T Consensus 245 R--w~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~---~~~g~~v---~g~~~~ 316 (514) T protein:vir:80 245 A--WNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRD---AETGDFV---PGQVGS 316 (514) T ss_pred e--eEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcc---cCCceee---cCCCcc Confidence 4 4446889999999999999999999999999999999999999999888766654321 2233333 333222 Q ss_pred CCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|Aclame:pro 420 PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILV 498 (711) Q Consensus 420 ~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l 498 (711) ..+-........+.....++...+.|....=++ ..+.++.+.|+..|..+.+--...+...+..|.. +...+.+..+ T Consensus 317 v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~--~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~ 394 (514) T protein:vir:80 317 VASYERGDYNKIAQASASVESIVMRLNRAFMYT--GQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTM 394 (514) T ss_pred ceeeecCcccchHHHHHHHHHHHHHHHHHHhhh--ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 222112223344555677777777776543111 1224555578999988888877778777777653 4444444444 Q ss_pred HHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcch Q lcl|Aclame:pro 499 EMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPS 578 (711) Q Consensus 499 ~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~ 578 (711) .++.... .| .+-.+- . ++ ..+.+..+ -..-.|.+..+.|..+++.... T Consensus 395 ~il~r~~--------~g-----~lP~~p-------~-------~l----~~~~~vs~-la~l~r~~~~~~l~~~~~~i~~ 442 (514) T protein:vir:80 395 YEASRGN--------GG-----MLLGIA-------Q-------GV----YRPSIITG-IPALTRNIETANILRATQEASA 442 (514) T ss_pred HHHhhhc--------cC-----CCCCCC-------c-------hh----hcceeeec-HHHHHHHHHHHHHHHHHHHHHH Confidence 4433210 00 000000 0 01 11222222 2334455555566665554332 Q ss_pred hHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 579 AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) Q Consensus 579 ~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae 658 (711) +.+. ..+.++.-+.+++.+.+....+.....--..++..+..++..+++++.+++.+ +. .+. +.+++- T Consensus 443 l~~~----~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~--~~-----~~~-~~~~~~ 510 (514) T protein:vir:80 443 IVPA----LVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVA--SG-----ALA-AETSAG 510 (514) T ss_pred Hhcc----chhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHH--HH-----HHH-Hhhhcc Confidence 2222 22345666777777777666554422111111111111110000000000000 00 000 000000 Q ss_pred HHHH Q lcl|Aclame:pro 659 MLKA 662 (711) Q Consensus 659 ~~~~ 662 (711) ..-+ T Consensus 511 ~~~~ 514 (514) T protein:vir:80 511 VLTS 514 (514) T ss_pred ccCC Confidence 0000 No 95 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.71 E-value=2.2e-15 Score=100.89 Aligned_cols=455 Identities=10% Similarity=0.035 Sum_probs=223.7 Q ss_pred hHHHHHHHHHHHHHHH--------------HhhchHHHHHHHHHHHHhCCC--CCCHHHHHHHHHhCC----CceEehhh Q lcl|Aclame:pro 27 DDRALLATARERARDG--------------ATYWKDNWEAAEDDLKFLGGE--QWPSQVRTERELEQR----PCLVNNVL 86 (711) Q Consensus 27 ~~~~~~~~~~~~~~~~--------------~~~~~~~r~~~~~~~~~y~G~--Qw~~~~~~~~~~~g~----p~~~~N~i 86 (711) =-+.+...+++++++- .....+.+....+..+||.|. .|.... ....+. ..++.|.- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~---~~~~~~~~~~~~~~~n~~ 77 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLN---YEHNGNPVNRRQLSMNLP 77 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcch---hccCCCccccceeecchH Confidence 1111222333333321 001122334456778999984 453321 111222 24678988 Q ss_pred HHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHH Q lcl|Aclame:pro 87 PTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGA 166 (711) Q Consensus 87 ~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~ 166 (711) +-+++...++.....+.+.+ +|.+.++. +..+++.++|......++..+ T Consensus 78 k~i~~~~a~~l~~~p~~i~~---------------------------~d~~~~e~----l~~~~~~n~f~~~~~~~~~~a 126 (496) T protein:vir:38 78 KVTAKYMSKLLFNEKVKINI---------------------------DDKAAEEF----VLNVLKTNGFTKNMERYIEYG 126 (496) T ss_pred HHHHHHHhhhhhCCcceEee---------------------------CChHHHHH----HHHHHhccCHHHHHHHHHHHH Confidence 99999988887666555543 34445454 444556789999999999999 Q ss_pred HhcCccEEEEEEeeccCCCCCCcceEEEecCccceee--CCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcc Q lcl|Aclame:pro 167 VESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTI--DPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYED 244 (711) Q Consensus 167 ~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~--Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~ 244 (711) +..|.||+++++|. ++.++++.+ +|..+|+ +. .. ++..+-|+ ..+ + T Consensus 127 ~~~G~~~~~~~~D~------~~~~~i~~v-~~~~~~P~~~~-~~--~~~~~~f~--~~~-~------------------- 174 (496) T protein:vir:38 127 EAMGGFVIKVYHDG------NKNVKVSFA-TADCMYPLSND-SE--NVDECVIA--NSF-H------------------- 174 (496) T ss_pred hhhCcEEEEEEEcC------CCcEEEEEE-cccceEEEEec-CC--cEEEEEEE--EEE-E------------------- Confidence 99999999998863 256788776 7888772 21 11 22333332 111 0 Q ss_pred cccccccCCCCCeEEEEEeeeeeeece----eEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecC Q lcl|Aclame:pro 245 SVADYDTWFTEKSVRVSEYFTREPVIR----EIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGA 320 (711) Q Consensus 245 ~~~~~~~~~~~~~v~v~E~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 320 (711) .+.+..+.+|+|+...... +++...++... |..+ .. ..+ .. T Consensus 175 --------~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~---------------g~~v-~~-----~~~------~~ 219 (496) T protein:vir:38 175 --------KNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNEL---------------GTKV-SL-----TLL------FD 219 (496) T ss_pred --------eCCeEEEEEEEEEEeCceEEEEEEEEecCCcccc---------------Cccc-cc-----ccc------cc Confidence 0123455566666543221 12222221100 0000 00 000 00 Q ss_pred ceeccCccCC-CCccceEEEEeeee---ccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH Q lcl|Aclame:pro 321 NVLEGPVEIP-STTIPVIPVWGKSL---IIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE 396 (711) Q Consensus 321 ~~le~~~p~~-~~~~P~vp~~~~~~---~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~ 396 (711) . ++....+. ....||+.+ .... ....++.|.|.+..+++.++.+|...|.+.+.+.. ...+++++.+.+.... T Consensus 220 ~-~~~~~~~~~~~~~~f~~~-~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~ 296 (496) T protein:vir:38 220 D-IEPVVPLPDFTRPTFIYI-KPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAV 296 (496) T ss_pred c-cccceeecCCCcceEEEe-cCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccC Confidence 0 00000111 123344322 1110 12345678899999999999999999999999876 5777888766653221 Q ss_pred HHHhhc----ccCCCce-EEeccccc-CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHH Q lcl|Aclame:pro 397 DEWEQA----NTKNFSL-LTYIPQYQ-GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAII 469 (711) Q Consensus 397 ~~~~~~----~~~~~~~-i~~~~~~~-~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~ 469 (711) +. ... ...+..+ ........ ....++...+.-........++.....+...+|++..+.|.++++ .||.++. T Consensus 297 ~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~ 375 (496) T protein:vir:38 297 NL-DGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVV 375 (496) T ss_pred CC-CCccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHH Confidence 10 000 0111111 12221111 122455555443446677888888888989999999999976654 4788887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) ...+........+...+..+++++++.++.+...+..- .|.... ..+ T Consensus 376 ~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~------~g~~~~---------------------------~~~ 422 (496) T protein:vir:38 376 SEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAY------SGEVVE---------------------------LDT 422 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh------cCCCCC---------------------------ccc Confidence 76666566666677778888999988888776543210 010000 011 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcC-C--cchHHHHHHHHhhhcchhhcchhhh-hhhhh Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMD-W--PGADVIAERLKKIVPPNVLSKDERE-AIEED 623 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~-~--~~~~e~~~~l~~~~~~~~~~~~~~~-~~~~~ 623 (711) +.+.=..+.+....+..+.++++..+ +.+....++.... . +.+++.+++++.......+..+.-. ....+ T Consensus 423 i~v~f~d~i~~d~~~~~~~~~~~~~~----GiiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 423 ITVDFDDSIAQDEDTTINRYTNAKNQ----GMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHhc----CCCCHHHHHHhcCCCChHHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 11111111111122233333333211 0011111222221 2 1222333344333222111111000 00000 No 96 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.71 E-value=1.1e-15 Score=102.49 Aligned_cols=408 Identities=14% Similarity=0.096 Sum_probs=213.0 Q ss_pred hHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH----HHHHHHHhCCCceEehhhHHHHHHHhhhhhhccc Q lcl|Aclame:pro 27 DDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQ----VRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRP 102 (711) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~----~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~ 102 (711) =+...+.++...+.. .+..-.+-.+||.|+|-... ....++...+ ++.|..+..|+.+.+-.. T Consensus 1 m~~~~i~~L~~~~~~-------~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vd~~a~rl~---- 67 (422) T protein:vir:97 1 MNYMGMGYLRRKLAL-------FKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYR--SVLEWTAKGVDSLADRII---- 67 (422) T ss_pred CChHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCChhhcCccccHHHHHHHH--hhcchhHHHHHHHHhccc---- Confidence 112224444332222 33445567899999876432 2334444444 356888888888765211 Q ss_pred ceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeecc Q lcl|Aclame:pro 103 AIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA 182 (711) Q Consensus 103 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~ 182 (711) |.. .+-.|.+ +..+++.|+++...+.++.+++++|++++-|+.+.. T Consensus 68 ---~~G----------------------f~~~d~~--------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~- 113 (422) T protein:vir:97 68 ---FRE----------------------FTNDDFN--------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAE- 113 (422) T ss_pred ---cce----------------------eeCCchh--------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCC- Confidence 100 0112222 356788899999999999999999999987765421 Q ss_pred CCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEE Q lcl|Aclame:pro 183 DDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRV 260 (711) Q Consensus 183 ~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 260 (711) ++.+.|..+ +|.+++ |||..+.+. + .+.+...+ .+. ..+. T Consensus 114 ----~~~p~i~~~-sp~~~~~i~D~~~~~~~---~--a~~~~~~~---------------------------~~~-~~~~ 155 (422) T protein:vir:97 114 ----DGLPKMQVI-EASKATGILDPTTFLLT---E--GYAILESD---------------------------SNG-NPTL 155 (422) T ss_pred ----CCeeEEEEe-chhhEEEEEeCCCCcce---e--eEEEEEec---------------------------CCC-cEEE Confidence 245666544 776655 777543221 1 11110000 000 0111 Q ss_pred EEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEE Q lcl|Aclame:pro 261 SEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVW 340 (711) Q Consensus 261 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~ 340 (711) ..+ +.+..++.+... | ..... |-+.|..|+|||+ T Consensus 156 ~~~------------~~~~~~~~~~~~----------~----------------------~~~~~--~~~~g~vPvv~~~ 189 (422) T protein:vir:97 156 EAY------------FTDKDIWYYPKK----------G----------------------KPYNI--KNPTGHPLLVPII 189 (422) T ss_pred EEE------------EcCceEEEEcCC----------C----------------------ccccc--cCCCCCcceEEec Confidence 111 111111111100 0 00011 3345678999987 Q ss_pred eeeeccCCcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC--hHHHHhhcccCCCceEEeccccc Q lcl|Aclame:pro 341 GKSLIIKKKEIFRSII-RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG--REDEWEQANTKNFSLLTYIPQYQ 417 (711) Q Consensus 341 ~~~~~~~~~~~~~g~v-~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~--~~~~~~~~~~~~~~~i~~~~~~~ 417 (711) -.+. .....|.|-| +.+++.|+.+|+.++.++-.....+.++..+- |.-.+ ..+.|. ...+.++.+.+... T Consensus 190 n~~~--~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~---~~~~~i~~~~~de~ 263 (422) T protein:vir:97 190 HRPD--AVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVL-GMDPDAKPMEKWR---ATVSTLLEISKDED 263 (422) T ss_pred ccCC--CccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-ccCcccccCchhh---hhhhhhhccCCCCC Confidence 4432 3344555544 78999999999999999999888888876552 32111 111221 23345555543322 Q ss_pred C-cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 418 G-DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGK 495 (711) Q Consensus 418 ~-~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 495 (711) + ...+..++...+. .+...+......+-.+||++...+|..+++ .||.||.+....=........+.|..+.+++++ T Consensus 264 ~~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~r 342 (422) T protein:vir:97 264 GDKPTVGQFTTASMA-PFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAY 342 (422) T ss_pred CCcceeeecCCCChh-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2234445544444 345566666666666789999999976654 799999876655555556666666667777777 Q ss_pred HHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecc---cChHHHHHHHHHHHHHH Q lcl|Aclame:pro 496 ILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTG---PAFATQRIEAAEAMIQF 572 (711) Q Consensus 496 ~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~---~~~~s~r~~~~~~L~~l 572 (711) +++.+.-..-+ ..+++. ++.+.=. |.......+..+.+..+ T Consensus 343 la~~~~~~~~~-----------~~~~~~-------------------------~~~~~w~p~~~~~~~s~a~~aDa~~Kl 386 (422) T protein:vir:97 343 IAVCLRDEFPY-----------LRNQFM-------------------------DTVIKWEPLFEADANMLTLVGDGAIKL 386 (422) T ss_pred HHHHHhcCCcc-----------cchhhc-------------------------cceEEEccCCCCChHHHHHHHHHHHHH Confidence 66544322110 011111 1111101 11222234455666666 Q ss_pred HhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcch Q lcl|Aclame:pro 573 AQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPN 610 (711) Q Consensus 573 ~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 610 (711) +++.|.+.. ...+++.+++...+.-..++.+....- T Consensus 387 ~~a~~~~~~--~~~~~~~lg~~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 387 NQAIPGFMD--ADVIRDLTGVKGADKPIPAITEVTTDG 422 (422) T ss_pred Hhhcccccc--HHHHHHHcCCCchhHHHHHHHhhhccC Confidence 666553321 223456667765555444443332211 No 97 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.71 E-value=1.1e-15 Score=102.51 Aligned_cols=398 Identities=15% Similarity=0.079 Sum_probs=211.0 Q ss_pred hHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH----HHHHHHHhCCCceEehhhHHHHHHHhhhhhhccc Q lcl|Aclame:pro 27 DDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQ----VRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRP 102 (711) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~----~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~ 102 (711) =+.+.|.++...+.. .+..-.+-.+||+|+|.... ....+....+ ++.|..+.+|+.+.+... T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~iVds~a~rl~---- 67 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV-------HKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYR--SILGWCAKGVDSLADRLV---- 67 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHhcccCchhhcChhhhHHHHHHHh--hhcchhHHHHHHhHhhcc---- Confidence 223345555544322 23334455789999986432 3333433333 467999999998765321 Q ss_pred ceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeecc Q lcl|Aclame:pro 103 AIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA 182 (711) Q Consensus 103 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~ 182 (711) |. +....|. -+..+++.|+++...+.+..+++++|++++-|+-+ T Consensus 68 ---~~----------------------Gf~~~d~--------~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~--- 111 (409) T protein:vir:94 68 ---FR----------------------EFENDDF--------TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG--- 111 (409) T ss_pred ---cC----------------------cccCCch--------HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC--- Confidence 10 0111222 24677899999999999999999999999877543 Q ss_pred CCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEE Q lcl|Aclame:pro 183 DDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRV 260 (711) Q Consensus 183 ~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 260 (711) + +++++|..+ +|.+++ |||..+.+- + ..+.|-+ +. ....+ . T Consensus 112 ~---dg~~~i~~~-sp~~~~~i~D~~~~~~~-----~-a~~~~~~---------d~-----------------~~~~~-~ 154 (409) T protein:vir:94 112 E---NDAVRLQVI-EAVNATGIIDPITGLLT-----E-GYAVLER---------DE-----------------NNNVV-L 154 (409) T ss_pred C---CCceEEEEe-ccceEEEEEecCCCcee-----e-eEEEEEe---------cC-----------------CCceE-E Confidence 2 356777655 676544 777443221 1 1111100 00 00011 1 Q ss_pred EEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEE Q lcl|Aclame:pro 261 SEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVW 340 (711) Q Consensus 261 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~ 340 (711) ..+|. .+.++.+.. .+...... |.+.|..|+|||+ T Consensus 155 ~~~~~------------~~~~~~~~~-------------------------------~~~~~~~~--~n~~g~vPvV~f~ 189 (409) T protein:vir:94 155 EAHFL------------PDRTDYYYR-------------------------------DSRNNISI--ANPTGHPLLVPII 189 (409) T ss_pred EEEEe------------cCcEEEEEe-------------------------------cCceeEee--eCCCCCcceEEec Confidence 11111 111111100 00001112 3356788999986 Q ss_pred eeeeccCCcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC--hHHHHhhcccCCCceEEeccccc Q lcl|Aclame:pro 341 GKSLIIKKKEIFRSII-RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG--REDEWEQANTKNFSLLTYIPQYQ 417 (711) Q Consensus 341 ~~~~~~~~~~~~~g~v-~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~--~~~~~~~~~~~~~~~i~~~~~~~ 417 (711) -.+. .+...|.|-| +.+++.|+.+|+.++.++......+.|+..+- |.-.+ ..+.|. ..++.++.+..... T Consensus 190 n~~~--~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~---~~~~~i~~~~~d~d 263 (409) T protein:vir:94 190 HRPD--AVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-GLSDDAEPMETWK---ATVSSMLQFTKDED 263 (409) T ss_pred cccc--cccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-ecCCCCcccchhh---hhHHHhhcCCCCCC Confidence 4432 3344555544 78999999999999999999888888866552 22111 112232 22344555533222 Q ss_pred -CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 418 -GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGK 495 (711) Q Consensus 418 -~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 495 (711) ....+..++...+. .+...+......+-.+||+++..+|..+ |..||.|+.+....-........+.|..+.+++++ T Consensus 264 g~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~r 342 (409) T protein:vir:94 264 GDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAY 342 (409) T ss_pred CCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12334445555443 3456666666666677899999999655 45899999876555444555555556667777777 Q ss_pred HHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 496 ILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA 575 (711) Q Consensus 496 ~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~ 575 (711) +.+.+.-..-.. .+++..+ ++... ..-+.......+..+.+..+.+. T Consensus 343 la~~i~~~~~~~-----------~~~~~~~---------------------~v~W~-p~~~~~~~~~a~~aDa~~Kl~~a 389 (409) T protein:vir:94 343 LAACLRDDAPYL-----------REQFRKT---------------------KPKWE-PLFEADASMLSLIGDGAIKLNQA 389 (409) T ss_pred HHHHHhCCCCcc-----------ccccccc---------------------eEEec-cCCCcchHHHHHHHHHHHHHHHh Confidence 666553322100 1111110 00000 00011122234555667777766 Q ss_pred cchhHHHHHHHHHHhcCCcchH Q lcl|Aclame:pro 576 VPSAAAVMADLIAQNMDWPGAD 597 (711) Q Consensus 576 ~p~~~~~~~~~~~~~~~~~~~~ 597 (711) .|.+.. -..+++.+++...+ T Consensus 390 g~~~~~--~~~~~~~lG~~~~d 409 (409) T protein:vir:94 390 IPEFIN--KDTIRDLTGIEGGE 409 (409) T ss_pred cccccc--hhHHHHHcCCCCCC Confidence 543321 12345667776665 No 98 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.70 E-value=1e-16 Score=108.27 Aligned_cols=468 Identities=11% Similarity=0.026 Sum_probs=204.1 Q ss_pred CCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHh-CCCceEeh Q lcl|Aclame:pro 6 KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELE-QRPCLVNN 84 (711) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~-g~p~~~~N 84 (711) -.-+|+. ..+.++...++.++...+.. .+..-.+-.+||+|+++........... ..-.++.| T Consensus 1 ~~~~i~~---------~~~~~~~~~~~~~l~~~~~~-------~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n 64 (485) T protein:vir:10 1 MTAPLPG---------QEEIEDPAIARDEMVSAFED-------STQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVG 64 (485) T ss_pred CCCCCCC---------CCCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcC Confidence 1122222 11233444555555543322 3344556689999998753321110000 01124569 Q ss_pred hhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHH Q lcl|Aclame:pro 85 VLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQ 164 (711) Q Consensus 85 ~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~ 164 (711) ..+.+|+.+++...-+ -|+. +++.+..+ .+..++..|+++.....+.. T Consensus 65 ~~~~ivd~~~~~l~~~----g~~~------------------------~~~~~~~~----~~~~i~~~N~~d~~~~~~~~ 112 (485) T protein:vir:10 65 YPRLYVDSIAERQAVE----GFRF------------------------GDADEADE----ELWQWWQANNLDIEAPLGYT 112 (485) T ss_pred cHHHHHHHHHhhhccc----ceec------------------------CCCchhHH----HHHHHHHhcCHhHHHHHHHH Confidence 9999999887754211 1111 22333333 34556788999999999999 Q ss_pred HHHhcCccEEEEEEeeccCC--CCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccch Q lcl|Aclame:pro 165 GAVESGMGYLRVRSDYLADD--SFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP 240 (711) Q Consensus 165 ~~~~~G~g~~~v~~d~~~~~--~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~ 240 (711) +++++|++|+-|+.+....+ ..++.++|..+ +|.+++ |||....+ .+.+++.+ +. T Consensus 113 ~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~-~p~~~~~~~D~~~~~~-----~~~~~~~~-----------~~---- 171 (485) T protein:vir:10 113 DAYVHGRSYITISRPDPQIDLGWDPNTPIIRVE-PPTRMYAEIDPRIGRV-----SKAIRVAY-----------DA---- 171 (485) T ss_pred HHhhcCceEEEEeeCCcccccccCCCeeEEEEE-ccceeEEEEcCCCCce-----eEEEEEEE-----------ee---- Confidence 99999999998877643222 12456777666 787764 77744321 12222111 00 Q ss_pred hhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecC Q lcl|Aclame:pro 241 VYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGA 320 (711) Q Consensus 241 ~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 320 (711) ..+.+..+++|.... ++.+. ...|. T Consensus 172 -------------~~~~~~~~~~y~~~~------------~~~~~------------------------------~~~~~ 196 (485) T protein:vir:10 172 -------------EGNEIQAATLYTPND------------IFGWY------------------------------RVENE 196 (485) T ss_pred -------------CCCeEEEEEEEeCCe------------EEEEE------------------------------EcCCc Confidence 012233334443221 11110 00111 Q ss_pred ceeccCccCCCCccceEEEEeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCCh---H Q lcl|Aclame:pro 321 NVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIR-HSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGR---E 396 (711) Q Consensus 321 ~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~---~ 396 (711) .......|.+.+.+|+|+|.-... .+..+|.|-+. .++++++.+|+.+|.+...+...+.+..++.-....+. + T Consensus 197 ~~~~~~~~~~~g~vPvv~~~n~~~--~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~ 274 (485) T protein:vir:10 197 WQEWFNNPHGLGVVPVVPIPNRTR--LSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDP 274 (485) T ss_pred eEEeccccCCCCcccEEEeccccc--cCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccc Confidence 112234566677888888754332 23344555554 68999999999999999998888777655431111111 0 Q ss_pred H-HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHH---hCCCHHHhcccc-chhHHHHHHHH Q lcl|Aclame:pro 397 D-EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKST---MGMYDASLGAMG-NETSGRAIIAR 471 (711) Q Consensus 397 ~-~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~---tGv~~~~~G~~~-~~~sg~ai~~~ 471 (711) + -..-....++.++... + .+..+..++..+ +..+++.....++.+ +++++..+|..+ |..||.|+... T Consensus 275 ~~~~~~~~~~~~~i~~~~-~--~d~k~~q~~~~~----~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~ 347 (485) T protein:vir:10 275 ETGQTLFDAYLARILAFE-D--AEGKIQQFSAAE----LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAA 347 (485) T ss_pred cccchhhhhcccceeccC-C--CCceEEeecccc----hHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHH Confidence 0 0000112234443332 2 122333332222 233444445555554 777888888554 55799999887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) ...-..........|..+++++.++++.+...-. ...++. ++. T Consensus 348 ~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~------------~~~~~~-------------------------~i~ 390 (485) T protein:vir:10 348 ESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGD------------VPPDML-------------------------RME 390 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC------------Ccccce-------------------------eee Confidence 6666666666666666666666665544321100 000110 011 Q ss_pred eecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHH Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPT 631 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 631 (711) +.=.+..+....+..+.+..+.+....+ +....+++.+++.. .-.+.++..... + .. T Consensus 391 v~w~~~~~~~~~~~ada~~kl~~ag~~~--~s~et~~~~lg~~~--~~~~~~~~~~ee-----------------~-~~- 447 (485) T protein:vir:10 391 TVWRDPSTPTYAAKADAASKLYNGGTGV--IPRERARKDMGYSI--AEREEMRRWDEE-----------------E-AA- 447 (485) T ss_pred EEecCCCCCCHHHHHHHHHHHHhccccC--CCHHHHHHhCCCCH--hHHHHHHHHHHH-----------------H-HH- Confidence 1101111111222333333333221100 01111223333321 111111111000 0 00 Q ss_pred HHHHHHHHHHHHHHHHH--H-HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 632 PEQQVEMAKSQADMAQA--E-ADTAQAQADMLKAQLETE 667 (711) Q Consensus 632 ~~~q~~~~~~q~~~~k~--q-ae~~~aqae~~~~q~~~~ 667 (711) +....+..+-...... + ...+.........-.+.+ T Consensus 448 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 448 -MGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred -HHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 0000000000000000 0 000000000000000000 No 99 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.70 E-value=5.7e-16 Score=104.14 Aligned_cols=494 Identities=12% Similarity=0.059 Sum_probs=225.4 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHh-hc-hHHHHHHHHHHHHhCCCCCCHHHHH-----------HHHHh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGAT-YW-KDNWEAAEDDLKFLGGEQWPSQVRT-----------ERELE 76 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~r~~~~~~~~~y~G~Qw~~~~~~-----------~~~~~ 76 (711) +.+ .--.++ +..+...|.+... +. ...+....+..+||.|++ ..+. ..+.. T Consensus 1 ~~~----~~~~~~---------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h---~Il~r~~~~~~~~~~~~~d~ 64 (537) T protein:vir:78 1 MTS----PLLNKP---------IDQLGGLLNTEITTYMASNHIKWAHIGENYYNQEN---DIEKSRIFYMNDKGQLREDN 64 (537) T ss_pred CCc----cccccc---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---hhhhcccccccccccccccc Confidence 111 111111 1111222222221 11 234566778899999985 1111 11222 Q ss_pred CCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 77 QRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCD 154 (711) Q Consensus 77 g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~ 154 (711) .+| .+.+|..+.+|+..+|+.-.+.+.+.. . +.++.+..+.++ ...+ ++ T Consensus 65 ~~~nnki~~nf~k~Ivd~~~~yl~G~Pv~~~~--~----------------------d~~~~e~~~~l~----~~~~-~~ 115 (537) T protein:vir:78 65 YASNVKISHGFFTELVDQLAQYLLSNGVEVKV--K----------------------DEDNTQLDEILQ----EYFD-ED 115 (537) T ss_pred cccccccccchHHHHHHHHhhhhcccCceeec--C----------------------cchhHHHHHHHH----HHhh-cc Confidence 233 488999999999999998777555432 1 233334434333 3333 67 Q ss_pred HHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHh Q lcl|Aclame:pro 155 AETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKAL 232 (711) Q Consensus 155 ~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~ 232 (711) +.........++.++|.+|..++++. + +++++..+ +|.++| ||.. . +...+++.......+.. T Consensus 116 ~~~~~~el~~~~s~~G~ay~~~y~de---~---~~~~~~~i-~p~~~~pv~d~~-~-----~~~~~~~~y~~~~~~~~-- 180 (537) T protein:vir:78 116 FQATIDTLVTNASKKGFEGIFARTTS---E---GKLKFQTV-DGLTLIPVFDDY-G-----VLKMIIRWYSEIRYSTK-- 180 (537) T ss_pred HHHHHHHHHHHHhhcCeeEEEeeecC---C---CceEEEEE-ccceeEEEEcCC-C-----CceeEEEEEeeeecccc-- Confidence 88888899999999999998877653 2 56788777 788865 5532 1 12222222211111000 Q ss_pred cCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEE Q lcl|Aclame:pro 233 YPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKT 312 (711) Q Consensus 233 ~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v 312 (711) ..+.+.+..+|+|....... +....+........... ........+ T Consensus 181 -------------------~~~~~~~~~~evyt~~~i~~--y~~~~~~~~~~~~~~~~-------------~~~~~i~~~ 226 (537) T protein:vir:78 181 -------------------QQSTETIWHADVWNEEAVCY--YIQDDEGVSTTYKLDEA-------------YNPNPAPHV 226 (537) T ss_pred -------------------ccCcceEEEEEEEcCCcEEE--EEecCCccccccccccc-------------cccccccee Confidence 01223445556665443321 11122211110000000 000000001 Q ss_pred EE-----EEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEe Q lcl|Aclame:pro 313 YW-----RKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIG 387 (711) Q Consensus 313 ~~-----~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~ 387 (711) ++ ....+........|.+.|.+||++|. +...+.|.+..++++++.+|.+.|.+.+.+...+++.+++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~-------nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi 299 (537) T protein:vir:78 227 LAIEESTDADFEDTDGYQVLGRSYSKFPFQLLY-------NNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVV 299 (537) T ss_pred eeccccccccccccccccccccCCcceeEEEec-------cCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeee Confidence 10 00111122223445566777776543 2345679999999999999999999999999999888777 Q ss_pred cccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHH Q lcl|Aclame:pro 388 SEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRA 467 (711) Q Consensus 388 ~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~a 467 (711) .-..+++..+..... +..+++.+.. .++.+.++..+.-..+....++...+.|-..|.+.+.... ..++.||+| T Consensus 300 ~g~~~~~~~~~~~~l--~~~~~i~v~~---d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~-~~gn~SGvA 373 (537) T protein:vir:78 300 KGFSGDSTDKLRQNI--KAKKMIGVNG---DNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAV-GDGNVTNVV 373 (537) T ss_pred ecCCCccchhHHHHH--hhcCceeecC---CCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccc-cccCCcHHH Confidence 544444433333322 2234455532 1234677777777788888999999999988866554432 234579999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 468 IIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 468 i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) +..+-..........-.-|..+++++.++++.++.... . +..++. T Consensus 374 lk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~----------~-~~~d~~------------------------ 418 (537) T protein:vir:78 374 IKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRG----------L-GEYDSN------------------------ 418 (537) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC----------C-cccccc------------------------ Confidence 98886666665555555566666666666655543221 0 000100 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 627 (711) +|.+.=.+..+.-..+..+.+..+.+. +.+....++..+++-.-.+..+......... ... T Consensus 419 -~i~i~f~~~~P~n~~e~a~~~~~l~~~----giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~-------------~~~- 479 (537) T protein:vir:78 419 -DICFEIEPHVLANELDIATTRKTEAET----EALKIGNIMTVAPRIGDDETLKLIAEELDLD-------------YNE- 479 (537) T ss_pred -eeeEEeccCCCCCHHHHHHHHHHHHhc----CcchHHHHHHhCCCCCCHHHHHHHHHHHHhh-------------hhh- Confidence 111111122221112222222222211 1112222333333321111111111100000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHH----------------HHHH Q lcl|Aclame:pro 628 TEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ-----LETE----------------EAQK 671 (711) Q Consensus 628 q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q-----~~~~----------------~~q~ 671 (711) ......+.+++........+ ...+-...+ .+-. --+. T Consensus 480 ------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 480 ------LKDALAEQDAQSLDVSPDVQ-AMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred ------hhhhhhhhcccccCcCcchh-hhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 00000000000000000000 000000000 0000 0000 No 100 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.69 E-value=7.5e-16 Score=103.51 Aligned_cols=467 Identities=11% Similarity=0.012 Sum_probs=207.5 Q ss_pred CCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC----CCHHHHHHHHHhCCCce Q lcl|Aclame:pro 6 KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ----WPSQVRTERELEQRPCL 81 (711) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Q----w~~~~~~~~~~~g~p~~ 81 (711) ..++|+.+.+.+ ....++..+...+.. .+.+..+-.+||+|++ ++......++ .-.+ T Consensus 1 ~~~~~~~~~e~~---------~~~~~~~~l~~~~~~-------~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~---~~~~ 61 (486) T protein:vir:42 1 MTAPLPGMEEIE---------DPAVVREEMISAFED-------ASKDLASNTSYYDAERRPEAIGVTVPREMQ---QLLA 61 (486) T ss_pred CCCCCCCCCCcc---------cHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcchhcccccchhHh---hhhh Confidence 788888875544 333456565554433 2333344578999986 1111111111 1134 Q ss_pred EehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHH Q lcl|Aclame:pro 82 VNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDI 161 (711) Q Consensus 82 ~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~ 161 (711) +.|..+-+|+..++...-+ -|+. +++.... ..+..+++.|+++..... T Consensus 62 v~n~~~~iVd~~~~~l~~~----g~~~------------------------~~~~~~~----~~~~~i~~~N~~d~~~~~ 109 (486) T protein:vir:42 62 HVGYPRLYVDSVAERQAVE----GFRL------------------------GDADEAD----EELWQWWQANNLDIEAPL 109 (486) T ss_pred ccchHHHHHHHHHhhhccc----ceec------------------------CCCchhH----HHHHHHHHhcChhHHHHH Confidence 6799999998887754211 1111 1222222 224556778999999999 Q ss_pred HHHHHHhcCccEEEEEEeeccCC--CCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcc Q lcl|Aclame:pro 162 AFQGAVESGMGYLRVRSDYLADD--SFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDAT 237 (711) Q Consensus 162 a~~~~~~~G~g~~~v~~d~~~~~--~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~ 237 (711) +..+++++|++|.-|+.+..... ..++.++|..+ +|.+++ |||.... -.++.+.+-+ . T Consensus 110 ~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~-~p~~~~~i~d~~~~~------~~~~~~~~~~----------~- 171 (486) T protein:vir:42 110 GYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVE-PPTRMHAEIDPRINR------VSKAIRVAYD----------K- 171 (486) T ss_pred HHHHHhhcCceEEEEecCCcccccccCCCeeEEEEe-cccceEEEEeCCCCC------eEEEEEEEEe----------c- Confidence 99999999999988876543221 22355666655 787765 7764322 1122222210 0 Q ss_pred cchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE Q lcl|Aclame:pro 238 AEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) ..+.+...++|.... ++++.. . T Consensus 172 ----------------~~~~~~~~~~y~~~~------------~~~~~~------------------------------~ 193 (486) T protein:vir:42 172 ----------------EGNEIQAATLYTPME------------TIGWFR------------------------------A 193 (486) T ss_pred ----------------CCCeEEEEEEEcCCc------------EEEEEe------------------------------c Confidence 012333444443221 111100 0 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceEeccccc---- Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIR-HSKDAQRMANYWDSAATETVALAPKAPFIGSEGNV---- 392 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av---- 392 (711) .|.-.+++..|.+.+.+|+|||.-.+ ..+..+|.|-+. .++++++.+|+.+|.+.......+.+..++. |.- T Consensus 194 ~~~~~~~~~~~h~~g~vPvv~~~n~~--~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~-G~~~~~~ 270 (486) T protein:vir:42 194 DGEWAEWFNVPHGLGVVPVVPLPNRT--RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEI 270 (486) T ss_pred CCcEEeecceecCCCCceEEEecccc--ccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhh-cCCcccc Confidence 11111223345566778888764322 223445667676 5889999999999999998887777665543 221 Q ss_pred CChHHHHhh-cccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHH---hCCCHHHhcccc-chhHHHH Q lcl|Aclame:pro 393 EGREDEWEQ-ANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKST---MGMYDASLGAMG-NETSGRA 467 (711) Q Consensus 393 ~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~---tGv~~~~~G~~~-~~~sg~a 467 (711) ...++-... ....++.++.. ++ .+..+..++.. .+..+++.....|..+ +++++..+|..+ |..||.| T Consensus 271 ~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~q~~~~----~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~A 343 (486) T protein:vir:42 271 GVDSETGQTLFDAYLARILAF-ED--AEGKIQQFSAA----ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEA 343 (486) T ss_pred ccccccccchhhhhhchhccc-CC--CCceEEeeccc----CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHH Confidence 100000000 01123333322 11 12223222221 2334455555555554 788888888554 5579999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 468 IIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 468 i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) +......-..........|..+++++.++++.+...- ....++.. T Consensus 344 l~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~------------~~~~d~~~----------------------- 388 (486) T protein:vir:42 344 IRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGG------------DVPPDMLR----------------------- 388 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------------Ccccccee----------------------- Confidence 9887766666666666777777777776655432100 00011111 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 627 (711) +.+.=.+..+....+..+.+..+.+....+ +....+++.+++- +...+.+++........ T Consensus 389 --i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~--~s~et~~~~lg~~--~d~~~e~~~~~~e~~~~-------------- 448 (486) T protein:vir:42 389 --METVWRDPSTPTYAAKADAATKLYGNGQGV--IPRERARIDMGYS--VKEREEMRRWDEEEAAM-------------- 448 (486) T ss_pred --eeEEecCCCCCCHHHHHHHHHHHHhcccCC--CCHHHHHhcCCCC--hhHHHHHHHHHHHHHHH-------------- Confidence 101101111111223333344443321110 0111222333331 11111111110000000 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 628 TEPTPEQQVE-MAKSQADMAQAEADTAQAQADMLKAQLET 666 (711) Q Consensus 628 q~~~~~~q~~-~~~~q~~~~k~qae~~~aqae~~~~q~~~ 666 (711) ........ ......+...+..+...-+.....++.+. T Consensus 449 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 449 --GLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred --HHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 00000000 00000000000000000000000000000 No 101 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.69 E-value=3.5e-15 Score=99.85 Aligned_cols=464 Identities=11% Similarity=0.048 Sum_probs=229.4 Q ss_pred HHHHHHHHHHHHHH------------------HhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC----ceEehhh Q lcl|Aclame:pro 29 RALLATARERARDG------------------ATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP----CLVNNVL 86 (711) Q Consensus 29 ~~~~~~~~~~~~~~------------------~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p----~~~~N~i 86 (711) =.++.+++++|++- .....+.+....+..+||.|+.+. .......|++ ....|+- T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~---l~~~~~~~~~~~~~~~slnl~ 77 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQ---VTHKNSYGDTQKHELQSVNVT 77 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcc---ccccccCCCccccceeecchH Confidence 11233333333331 111223344455667888886331 1111122332 3566887 Q ss_pred HHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHH Q lcl|Aclame:pro 87 PTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGA 166 (711) Q Consensus 87 ~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~ 166 (711) +.+++...+..-...+.+.+ +|.+.+ ..+..+.+.|+|......+++.+ T Consensus 78 ~~i~~~~A~ll~~e~~~i~~---------------------------~d~~~~----e~l~~i~~~n~f~~~~~~~~e~a 126 (505) T protein:vir:79 78 KLASAKLASLIFNEQCQVTV---------------------------SDETAN----DFLDDVFQQNDFYTTFEEKLEEW 126 (505) T ss_pred HHHHHHHHhhhcCCCceeec---------------------------CChHHH----HHHHHHHHhccHHHHHHHHHHHH Confidence 88888877765544444433 233444 44566667789999999999999 Q ss_pred HhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccc Q lcl|Aclame:pro 167 VESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSV 246 (711) Q Consensus 167 ~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~ 246 (711) +..|.+|+++++|. +.++|..| ++..|++=. ...-+..++-|+.+.+..... T Consensus 127 ~a~G~~~~k~~~D~-------~~~~i~~v-~ad~~~P~~-~d~~~~~~~a~~~~~~~~~~~------------------- 178 (505) T protein:vir:79 127 IALGSGCVRPYVDS-------GKIKLAWA-TADQVYPLQ-ADTNQVNELAIASRTTEVENH------------------- 178 (505) T ss_pred hhcCCeEEEEEEeC-------CceEEEEE-cCCeeEEEE-EcCCCeEEEEEEEEEEEecCC------------------- Confidence 99999999999862 46778777 687777311 111233455554332211100 Q ss_pred cccccCCCCCeEEEEEeeeeeeece----eEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCce Q lcl|Aclame:pro 247 ADYDTWFTEKSVRVSEYFTREPVIR----EIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANV 322 (711) Q Consensus 247 ~~~~~~~~~~~v~v~E~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~ 322 (711) ...-.+..|+++...... +++...++..+ |..+ ....+.. |......+. T Consensus 179 -------~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~l---------------G~~v-~l~~~~~----~~~l~~~~~ 231 (505) T protein:vir:79 179 -------RTIYYTLLEFHQWDHGDYVITNELYRSEAAETV---------------GINV-PLNSLEQ----YEGLEPQVK 231 (505) T ss_pred -------cceEEEEEEEEEecCceEEEEEEEEecCCCCcc---------------Cccc-chhhccc----ccccCccee Confidence 001233455554332211 11111111000 0000 0000000 000000000 Q ss_pred eccCccCCCCccceEEEE--eeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH--- Q lcl|Aclame:pro 323 LEGPVEIPSTTIPVIPVW--GKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED--- 397 (711) Q Consensus 323 le~~~p~~~~~~P~vp~~--~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~--- 397 (711) +. .....+|+.|- ..-.-...+++|.|++..+++..+.+|...|++.+.+.. .+.++.++++++..... T Consensus 232 ~~-----g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~ 305 (505) T protein:vir:79 232 IT-----GLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGG 305 (505) T ss_pred ec-----CCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCc Confidence 00 11122232210 000012356678999999999999999999999999875 56778887776632110 Q ss_pred -HHh--hcccCCCceEEec-ccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHHH Q lcl|Aclame:pro 398 -EWE--QANTKNFSLLTYI-PQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIARQ 472 (711) Q Consensus 398 -~~~--~~~~~~~~~i~~~-~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~~ 472 (711) ... ......+..+-.. .+..+...++.+++.-...++...++.....+...+|++....|.++++ .||.+|.+.. T Consensus 306 ~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~ 385 (505) T protein:vir:79 306 QASETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNN 385 (505) T ss_pred ccccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHH Confidence 000 0001111111111 1122334577776654556678888888889999999999999977654 5788888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc-eEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEE Q lcl|Aclame:pro 473 RQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTER-VVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVV 551 (711) Q Consensus 473 ~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r-~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~ 551 (711) +........+...+..+++++.+.++.+..-+.-..- ..+-.+ ....++++ T Consensus 386 ~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~----------------------------~~~~~~i~ 437 (505) T protein:vir:79 386 SQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTG----------------------------DVDSLDIT 437 (505) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC----------------------------CCCceeEE Confidence 7777777788888888999999888888666542110 000000 01123444 Q ss_pred eecccChHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCCc--chHHHHHHHHhhhcchhhcchhhhhhh Q lcl|Aclame:pro 552 VTTGPAFATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDWP--GADVIAERLKKIVPPNVLSKDEREAIE 621 (711) Q Consensus 552 v~~~~~~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~~--~~~e~~~~l~~~~~~~~~~~~~~~~~~ 621 (711) ++=+.+....+++..+..+++... ++.. ..+.+.-.+. .+.+.+.+++.......+.. ...... T Consensus 438 v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e-----~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~-~~~gg~ 505 (505) T protein:vir:79 438 INFNDGVFVDQESKRAADLQAVQAQVMPKK-----QFLMRNYGLDEEEADEWLAQIDAENSTAEPEF-NQFGGD 505 (505) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHcCCCCHH-----HHHHhcCCCChHHHHHHHHHHHHhccccCCCc-hhccCC Confidence 444433333333333444443322 1111 1112222222 12233333333221111110 000011 No 102 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.68 E-value=2.6e-15 Score=100.50 Aligned_cols=465 Identities=10% Similarity=0.031 Sum_probs=227.5 Q ss_pred hHHHHHHHHHHHHHH---------HH-----hhchHHHHHHHHHHHHhCCC--CCCHHHHHH-HHHhCCCceEehhhHHH Q lcl|Aclame:pro 27 DDRALLATARERARD---------GA-----TYWKDNWEAAEDDLKFLGGE--QWPSQVRTE-RELEQRPCLVNNVLPTF 89 (711) Q Consensus 27 ~~~~~~~~~~~~~~~---------~~-----~~~~~~r~~~~~~~~~y~G~--Qw~~~~~~~-~~~~g~p~~~~N~i~~~ 89 (711) =-+.+...+++++++ .. ....+.+.......+||.|+ .|....... .....+..++.|.-+-+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 111223333343332 11 11234456667788999985 564321110 00112335778999999 Q ss_pred HHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhc Q lcl|Aclame:pro 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) Q Consensus 90 v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~ 169 (711) |+...++.-...+.+.+ +|.+.++.|+ .+.+.+++......+.+.++.. T Consensus 81 v~~~a~~l~~ep~~i~~---------------------------~d~~~~e~l~----~~~~~n~f~~~~~~~~~~a~~~ 129 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINI---------------------------DDETAEEFVL----NVLKTNGFTKNMERYIEYGEAM 129 (499) T ss_pred HHHHHHhhhCCcceEee---------------------------CCHHHHHHHH----HHHhhccHHHHHHHHHHHHhhc Confidence 99988887665555543 3445555544 4556689999999999999999 Q ss_pred CccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccc Q lcl|Aclame:pro 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADY 249 (711) Q Consensus 170 G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 249 (711) |.||+++++|. +++++|..+ +|..||+=... .-++..|-|+-... . T Consensus 130 G~~~~~~~~D~------~~~~~i~~v-~a~~~~Pi~~d-~~~~~~~~f~~~~~---~----------------------- 175 (499) T protein:vir:80 130 GGFVIKVYHDG------NKNVKVSFA-TADCMYPLSND-SENVDECLIANSFH---K----------------------- 175 (499) T ss_pred CcEEEEEEECC------CCcEEEEEE-cCCceEEEEec-CCCeEEEEEEEEEe---e----------------------- Confidence 99999999874 267888777 78888731111 12344454432211 0 Q ss_pred ccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccC Q lcl|Aclame:pro 250 DTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEI 329 (711) Q Consensus 250 ~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~ 329 (711) ..+..+.+|++++.......+...+ ..+..+.... .|..+... .+ + .-++...++ T Consensus 176 ----~~~~y~~lE~h~~~~~~~~~y~I~n-~~~~~~~~~~-------lG~~v~l~------~~-~------~~~~~~~~~ 230 (499) T protein:vir:80 176 ----NNKYYKLLEWNEWKGEKEEVYTVTT-ELYQSDDPNE-------LGGKVSLK------LL-F------NDIEPVVPL 230 (499) T ss_pred ----cCeEEEEEEEEEecccceeeEEEEE-EEEeccCccc-------cCcccchh------hh-c------cCcCCceee Confidence 0112333444333221111111110 0000000000 00000000 00 0 000000111 Q ss_pred C-CCccceEEEEeeeec-----cCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhc- Q lcl|Aclame:pro 330 P-STTIPVIPVWGKSLI-----IKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQA- 402 (711) Q Consensus 330 ~-~~~~P~vp~~~~~~~-----~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~- 402 (711) . .+..||+.| +.+ ..+++.|.|++..+++..+.+|...|.+.+.+.. ...++.++.+.+....+ .... T Consensus 231 ~~~~~p~f~~~---~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~-~~g~~ 305 (499) T protein:vir:80 231 PSLTRPTFIYI---KPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVN-LDGST 305 (499) T ss_pred cCCCccceEee---cCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-cccceecchhhhhccCC-CCCCc Confidence 1 123344322 222 2356678899999999999999999999999876 56777787666642111 0000 Q ss_pred ---ccCCCceEEecccc--cCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHHHHHHH Q lcl|Aclame:pro 403 ---NTKNFSLLTYIPQY--QGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIARQRQGD 476 (711) Q Consensus 403 ---~~~~~~~i~~~~~~--~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~~~~~~ 476 (711) ...+..++....+. .++..++..++.-...+....++.....+....|++....|.++++ .||.++....+... T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~ 385 (499) T protein:vir:80 306 TQYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETY 385 (499) T ss_pred ccCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHH Confidence 00111122221211 1223466666555556677888888889999999999999876654 57888887776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeeccc Q lcl|Aclame:pro 477 RGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGP 556 (711) Q Consensus 477 ~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~ 556 (711) .....+...+..+++++.+.++.+...+--. .|.... ..++++.=.. T Consensus 386 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~------~~~~~~---------------------------~~~v~v~f~d 432 (499) T protein:vir:80 386 QTKNSHSQLIEQGIKEMIVSILEVGKLIKAY------DGDTVE---------------------------LDTITVDFDD 432 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccc------cCCCCC---------------------------ccceEEEeCC Confidence 6667777788888888888888775543210 010000 0112221121 Q ss_pred ChHHHHHHHHHHHHHHHh--hcchhHHHHHHHHHHhcCCc--chHHHHHHHHhhhcchhhcchhhhhhhhhHHH Q lcl|Aclame:pro 557 AFATQRIEAAEAMIQFAQ--AVPSAAAVMADLIAQNMDWP--GADVIAERLKKIVPPNVLSKDEREAIEEDMPE 626 (711) Q Consensus 557 ~~~s~r~~~~~~L~~l~~--~~p~~~~~~~~~~~~~~~~~--~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 626 (711) +......+..+.++++.. .++... .+.+.-... .+++.+.+++.......+..+.... ....+ T Consensus 433 ~i~~d~~~~~~~~~~~~~~Gi~S~et-----~l~~~~~~~d~ea~~el~~i~~E~~~~~~~~d~~g~--~ge~e 499 (499) T protein:vir:80 433 SIAQDEDTTINRYTTAKNQGMIPLKI-----ALQRAWNITEAEADEWAEMLAKEKQAEIPNNDMTGI--FGEEE 499 (499) T ss_pred CCCCCHHHHHHHHHHHHHcCCCCHHH-----HHhhcCCCChHHHHHHHHHHHHHhhcCCCCCCcccc--CCCCC Confidence 211122233333333321 122111 111121111 1223333333322211111110000 00000 No 103 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.67 E-value=3.8e-16 Score=105.11 Aligned_cols=460 Identities=11% Similarity=0.044 Sum_probs=197.3 Q ss_pred CCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHH----HHHHHHhCCCce Q lcl|Aclame:pro 6 KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQV----RTERELEQRPCL 81 (711) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~----~~~~~~~g~p~~ 81 (711) ..++++... ..+.++++.++...+.. ++..-.+-.+||.|.|..... ...++ .-.. T Consensus 1 ~~~~~~~~~----------~~~~~~~~~~l~~~~~~-------~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~---~~~~ 60 (484) T protein:vir:77 1 MTSPLQKQE----------NVDPEKAREEMLNLFTE-------RTQDLGDNTAYYESERRPDAVGVTVPQQMQ---KLLA 60 (484) T ss_pred CCCcccccC----------CCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccccchhHH---hhhh Confidence 223333321 12333455556555443 222333557899998864321 11111 1124 Q ss_pred EehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHH Q lcl|Aclame:pro 82 VNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDI 161 (711) Q Consensus 82 ~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~ 161 (711) +.|..+-+|+..++...-+ -|+ .+++.+. ...+..+++.|+++..... T Consensus 61 ~~n~~~~ivd~~~~~l~~~----g~~------------------------~~~~~~~----~~~l~~i~~~N~~d~~~~~ 108 (484) T protein:vir:77 61 HVGYPRLYIDAIAARQELE----GFR------------------------LGGADKA----DEQLWDWWQANDLDIESTL 108 (484) T ss_pred hcCcHHHHHHHHHhhhccC----cee------------------------cCCcchh----HHHHHHHHHhcCHhHHHHH Confidence 6799999999888754211 011 1122222 2335567788999999999 Q ss_pred HHHHHHhcCccEEEEEEeeccCCCCC--CcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcc Q lcl|Aclame:pro 162 AFQGAVESGMGYLRVRSDYLADDSFE--QDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDAT 237 (711) Q Consensus 162 a~~~~~~~G~g~~~v~~d~~~~~~~~--~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~ 237 (711) +..+++++|.+|+-|+.+........ +.++|..+ +|.+++ |||..+. ..+ +++.+.+. T Consensus 109 ~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~-~p~~~~~~~D~~~~~-----~~~-a~~~~~~~----------- 170 (484) T protein:vir:77 109 GHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVE-PPTNLYAQIDPRTRQ-----VMR-AIRAIEDE----------- 170 (484) T ss_pred HHHHHhhcCceEEEEecCCCCcccccccccceEEEe-ccceeEEEecCCCCc-----eEE-EEEEEEee----------- Confidence 99999999999998887654332221 23455544 787775 6764332 111 12211110 Q ss_pred cchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE Q lcl|Aclame:pro 238 AEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) ....+..+++|+... ++.+ ... T Consensus 171 ----------------~~~~~~~~~~y~~~~------------~~~~------------------------------~~~ 192 (484) T protein:vir:77 171 ----------------EGNEVIGATLYLPNN------------TVIW------------------------------NRE 192 (484) T ss_pred ----------------cCCcEEEEEEEecCe------------EEEE------------------------------Eec Confidence 011122223332211 1110 001 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIR-HSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE 396 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~ 396 (711) .|.-.+.+..|.+.|.+|+|||.-.. ..+..+|.|.+. .++++++.+|+.+|.+...+...+.+..++- |.-. + T Consensus 193 ~~~~~~~~~~~~~~g~vPvv~f~N~~--~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~--~ 267 (484) T protein:vir:77 193 DGQWVQVANVAHNLEMVPVIPIPNRT--RLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLF-GVKG--E 267 (484) T ss_pred CCceEeeccccCCCCCcceEEecccc--ccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh-CCCc--c Confidence 11111223345566788888875322 234445566664 6999999999999999999887777765553 2211 1 Q ss_pred HHHhh-------cccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHH---hCCCHHHhcccc-chhHH Q lcl|Aclame:pro 397 DEWEQ-------ANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKST---MGMYDASLGAMG-NETSG 465 (711) Q Consensus 397 ~~~~~-------~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~---tGv~~~~~G~~~-~~~sg 465 (711) ++-.. ....++.++.. ++ .+..+..++..+ ...++......+..+ +++++..+|..+ |..|| T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~q~~~~~----~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg 340 (484) T protein:vir:77 268 ELGVDPETGQTLFDAYLARILAF-ED--HESKAQQFSAAE----LRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASA 340 (484) T ss_pred hhcccccccchhhhhhhhhhccc-CC--CCceeEeecCCC----hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHH Confidence 11000 00112222222 21 122233332222 223445555555554 678888888554 55799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhh Q lcl|Aclame:pro 466 RAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNV 545 (711) Q Consensus 466 ~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~ 545 (711) .|+......-..........|..+++++.++++.+ ... .....++.. T Consensus 341 ~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~----~~~--------~~~~~~~~~--------------------- 387 (484) T protein:vir:77 341 EAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKV----MNG--------GDIPPEYYR--------------------- 387 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCC--------CCccccccc--------------------- Confidence 99887665544455555555555665555554433 211 001111111 Q ss_pred eeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcc--hHHHHHHHHhhhcchhhcchhhhhhhhh Q lcl|Aclame:pro 546 QKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLSKDEREAIEED 623 (711) Q Consensus 546 ~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~--~~e~~~~l~~~~~~~~~~~~~~~~~~~~ 623 (711) +.+.=.+.......+..+.+..+++....+. ....+++.+++-. .+++ ++++...... T Consensus 388 ----i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~--s~et~~~~l~~~~~~~~e~-~~~~~ee~~~------------- 447 (484) T protein:vir:77 388 ----MESIWRDPSTPTYAAKADAATKLYNNGQGVI--PKERARIDMGYSITEREEM-RKWDEEEQAQ------------- 447 (484) T ss_pred ----ceEEecCCCCCCHHHHHHHHHHHHhccCCCC--CHHHHHhcCCCChhHHHHH-HHHHHHHHHH------------- Confidence 1111011111112233344444443211110 0112233333311 1111 1111000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 624 MPEQTEPTPEQQVEMAKSQAD-MAQA-EADTAQAQADMLKAQLETEEAQKQLA 674 (711) Q Consensus 624 ~~~~q~~~~~~q~~~~~~q~~-~~k~-qae~~~aqae~~~~q~~~~~~q~q~~ 674 (711) .+.....+... .... ..... +....+.+.. +...+ T Consensus 448 ----------~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~--~~~~~ 484 (484) T protein:vir:77 448 ----------GLGLMGTMFGTDPSGGGNPDNP----ETPEPQPNPA--EEAAA 484 (484) T ss_pred ----------HHHHHhhhccccccCCCCCCCC----CcccccCCCc--cccCC Confidence 00000000000 0000 00000 0000000000 00000 No 104 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.67 E-value=6.2e-16 Score=103.94 Aligned_cols=492 Identities=12% Similarity=0.072 Sum_probs=207.3 Q ss_pred CCCCCCcccCCCcccC-Cc-CcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC---c Q lcl|Aclame:pro 6 KKSRVEQLYAKKAKVY-AK-NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP---C 80 (711) Q Consensus 6 ~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p---~ 80 (711) -.-++++...-.++.- -. +..+.+.++.-+.+++.. ....+.+-.+-.+||.|+|.............++ . T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~----~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~ 76 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRL----HISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKL 76 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhh Confidence 2334444433333332 22 222333333333333322 2223334445578999998643322222222222 2 Q ss_pred eEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHH Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYD 160 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~ 160 (711) .+.|..+-+|+..++... ++ -|+- + +..+. ..+..+++.|+++.... T Consensus 77 ~v~n~~~~ivd~~a~~l~---~~-gf~~---------------------~-d~~~~-------~~l~~i~~~N~~d~~~~ 123 (501) T protein:vir:25 77 SVKNVLSLVRDSFAQNLS---VV-GYRN---------------------A-LAKEN-------DPAWEMWQRNRMDARQA 123 (501) T ss_pred hhcChHHHHHHHHHhhhc---cc-ceec---------------------C-Cccch-------HHHHHHHHhcChhHHHH Confidence 456899999998777432 11 0111 1 11111 12356788999999999 Q ss_pred HHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--e-CCCccccCccccceeeeeecCCHHHHHHhcCCcc Q lcl|Aclame:pro 161 IAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--I-DPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDAT 237 (711) Q Consensus 161 ~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~-Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~ 237 (711) .+..+++++|+||+-|+.+ ++ + .+|+.+ +|.+++ | ||..... ..++ ++.+....+ T Consensus 124 ~~~~~a~i~G~ay~~v~~d---e~---~-~~i~~~-sp~~~~~iy~D~~~~~~----~~~a-i~~~~~~~~--------- 181 (501) T protein:vir:25 124 EVHRPALTYGASYVTVTPT---DE---G-PVFRTR-SPRQILAVYADPSVDAW----PQYA-LETWVAQKD--------- 181 (501) T ss_pred HHHHHHhhcCceEEEEecC---CC---C-CeEEEe-ccccEEEEEecCCCCcc----eeEE-EEEEeeccc--------- Confidence 9999999999999877654 22 2 245544 677664 4 5644321 2222 222221110 Q ss_pred cchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE Q lcl|Aclame:pro 238 AEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) .+.....++|... .++.+.....+...... +.. .... ..... T Consensus 182 -----------------~~~~~~~~~y~~~----~~~~~~~~~~~~~~~~~---------~~~--~~~~------~~~~~ 223 (501) T protein:vir:25 182 -----------------AKPHRRGVLYDDT----YMYELDLGEVVLGDAGG---------GQA--TQQP------VNVRE 223 (501) T ss_pred -----------------cCcceeEEEecCe----eEEEEecCceeeeeccc---------ccc--cccc------ccccc Confidence 0011111111110 00001000000000000 000 0000 00000 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED 397 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~ 397 (711) .++..-+...|.+.+..|||+|.-.+ ...+.+.|.+..++++++.+|+.++.+...+...+.+...+- |.-.+..+ T Consensus 224 ~~~~~~~~~~~~~~~~vPiv~f~N~~---~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~-G~~~~~~~ 299 (501) T protein:vir:25 224 VTDVIEHGATFEGKPVCPVVRFVNGR---DADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS-GWTGSKAE 299 (501) T ss_pred cccccccccccCCccceeeEeccCcc---ccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh-CCCCCccc Confidence 11111122234445566666643222 334567899999999999999999999988887777654442 43222222 Q ss_pred HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 398 EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDR 477 (711) Q Consensus 398 ~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~ 477 (711) .|. ...+.++... +. +..+..++...+ ..+...+......|-..|++++...|...++.||.|+......-.. T Consensus 300 ~~~---~~~~~i~~~~-~~--~~~~~q~~~~~~-~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ 372 (501) T protein:vir:25 300 VLK---ASALRVWTFE-DP--EVKAQAFPPASV-EPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQR 372 (501) T ss_pred hhh---hcccceeccC-CC--CceEEEecccCh-HHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHH Confidence 332 3445555442 21 223433333222 3355556666666666788999999866666799999887766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccC Q lcl|Aclame:pro 478 GSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPA 557 (711) Q Consensus 478 ~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~ 557 (711) ......+.|..++++++++++ .+.+.. ...+. +++.+.=.+. T Consensus 373 ka~~k~~~f~~~l~~~~rl~~----~~~~~~---------~~~~~-------------------------~~i~v~w~~~ 414 (501) T protein:vir:25 373 KLAAKRESFGESWEQLLRLAA----EMDDDP---------DTAAD-------------------------SGAEVLWRDT 414 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHhCCC---------ccccc-------------------------eeeeEEecCC Confidence 666666666666666666544 332211 00010 1111111111 Q ss_pred hHHHHHHHHHHHHHHHhh-cchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 558 FATQRIEAAEAMIQFAQA-VPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQV 636 (711) Q Consensus 558 ~~s~r~~~~~~L~~l~~~-~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~ 636 (711) .+....+..+.+..+.+. .|. ..++..+..-...++. ++......... ...... T Consensus 415 ~~~s~~~~ada~~kl~~~gis~------et~~~~~~g~~~~~ie-~~~~~~~e~~~------------------~~~~~~ 469 (501) T protein:vir:25 415 EARSFGAVVDGITKLASAGIPI------EHLLSMVPGMTQQTIQ-AIKDSLRGGEV------------------KSLVDK 469 (501) T ss_pred CCCCHHHHHHHHHHHHhcCCCH------HHHHHHcCCCCHHHHH-HHHHHHHHHhH------------------HHHHHH Confidence 111123333444444432 221 1122222111112211 11110000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 637 EMAKSQADMAQAEADTAQAQADMLKAQLETEEAQK 671 (711) Q Consensus 637 ~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~ 671 (711) ..............+. ..+... ..+.. -...+ T Consensus 470 ~~~~~~~~~~~~~~~~-~~~~~~-~~~~~-~~~g~ 501 (501) T protein:vir:25 470 LLSNEPAPVPPPPPQA-AAQALN-EGGVN-GNGGA 501 (501) T ss_pred hhccCcCCCCCCCCCC-Cccccc-cccCC-CCCCC Confidence 0000000000000000 000000 00000 00000 No 105 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.66 E-value=9.8e-15 Score=97.38 Aligned_cols=398 Identities=15% Similarity=0.107 Sum_probs=207.0 Q ss_pred chHHHHHHHHHHHHhCCCCCC----HHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhccc Q lcl|Aclame:pro 46 WKDNWEAAEDDLKFLGGEQWP----SQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGE 121 (711) Q Consensus 46 ~~~~r~~~~~~~~~y~G~Qw~----~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~ 121 (711) ..-++.+-..-.+||.|+|=. ......++...+ ++.|+.+.+|+.+.+-..=+ -| T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vds~a~rl~~~----Gf--------------- 59 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQ--AVLGWAAKGVDSLADRLIFR----AF--------------- 59 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHH--hhcchhHHHHHHhHhhhccc----cc--------------- Confidence 222334444567999998632 233344444333 46799999999886532100 01 Q ss_pred ccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccce Q lcl|Aclame:pro 122 DTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSV 201 (711) Q Consensus 122 ~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v 201 (711) ..+|.+ +..+++.|+++...+.++.+++++|++|+-|+-+ + ++.++|..+ +|.++ T Consensus 60 ----------~~~d~~--------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~---~---d~~~~i~~~-sP~~~ 114 (410) T protein:vir:95 60 ----------ANDDFN--------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKG---E---DDEVRLQVI-ESSNA 114 (410) T ss_pred ----------cCCCch--------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecC---C---CCceEEEEE-cccce Confidence 112222 4667889999999999999999999999877543 2 245677655 67655 Q ss_pred e--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCC Q lcl|Aclame:pro 202 T--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDG 279 (711) Q Consensus 202 ~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~ 279 (711) + |||..+.+ .+..+.+ + .+ .........+|. ++ T Consensus 115 ~~i~Dp~~~~~------~~al~~~-----------~---------------~~-~~~~~~~~~~~~------------~~ 149 (410) T protein:vir:95 115 TGVIDPITGLL------VEGYAVL-----------A---------------RD-DYNRPTLEAYFE------------PN 149 (410) T ss_pred EEEEeCCCCce------EEEEEEE-----------E---------------ec-CCCeEEEEEEEe------------CC Confidence 4 77743322 1111110 0 00 001111112221 11 Q ss_pred cEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccc-hHHH Q lcl|Aclame:pro 280 RSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRS-IIRH 358 (711) Q Consensus 280 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g-~v~~ 358 (711) .++.+... |. .| ..|.+.|..|+|||+-.+ ..+.+.|.| +.+. T Consensus 150 ~~~~~~~~----------~~-------------~~-----------~~~~~~g~vPvV~f~n~~--~l~~~~G~s~I~~~ 193 (410) T protein:vir:95 150 ATHFIPKD----------GE-------------PY-----------SVTNETGIPLLVPVIHRP--DAVRPFGRSRITRA 193 (410) T ss_pred cEEEEeeC----------Cc-------------cc-----------cccCCCCCcceEEecccc--cCCccCCccccchh Confidence 12111100 00 00 113345778888876332 223444545 5588 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC--hHHHHhhcccCCCceEEecccccC-cCCccccCCccchHHHH Q lcl|Aclame:pro 359 SKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG--REDEWEQANTKNFSLLTYIPQYQG-DPGPRRQPPAAVPAAEL 435 (711) Q Consensus 359 ~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~--~~~~~~~~~~~~~~~i~~~~~~~~-~~~i~~~~~~~~~~~~~ 435 (711) +++.|+.+|+.++.+.-.....+.|+..+- |.-.+ ..+.|. ...+.++.+.....+ ...+..++...+. .+. T Consensus 194 v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~---~~~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~ 268 (410) T protein:vir:95 194 GMYYQKYAKRTLERADITAEFYSWPQKYIL-GLDPDAEPMEKWK---ATVSSLLTISSSDKGVKPSVGQFTTASMS-PFT 268 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhheee-ccCCCCCcCchhh---hhhhhheeccCCCCCCcceEEecCCCChH-HHH Confidence 999999999999999999888888866652 32111 111222 334555655433222 2234445555554 355 Q ss_pred HHHHHHHHHHHHHhCCCHHHhcccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc-ceEee Q lcl|Aclame:pro 436 TLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTE-RVVRL 513 (711) Q Consensus 436 ~ll~~~~~~~~~~tGv~~~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~-r~~ri 513 (711) ..+......+-.+||++...+|..+ |..||.||.+....=........+.|..+.++++++.+.+.-.+=..+ ...++ T Consensus 269 ~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~ 348 (410) T protein:vir:95 269 EQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRT 348 (410) T ss_pred HHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccccee Confidence 6666666677777899999999655 458999998776665555556666666777777777666543221100 00000 Q ss_pred ecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC Q lcl|Aclame:pro 514 KFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW 593 (711) Q Consensus 514 ~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~ 593 (711) ...|..+ . -+.+.|. .+..+.+..+.++.|-+. ....+++.+++ T Consensus 349 -----~v~W~p~---------------~-------------d~~~~s~-a~~aDa~~Kl~~a~~g~~--~~~~~~~~lg~ 392 (410) T protein:vir:95 349 -----AVKWEPL---------------F-------------EADANTM-TMIGDGVVKLNQALPGYI--NAETIRDLTGI 392 (410) T ss_pred -----eEEeeec---------------C-------------CcchhhH-HHHHHHHHHHHHhccCCc--cHHHHHHhcCC Confidence 0011100 0 1122222 445556666666544222 11234555666 Q ss_pred cchHHHHHHHHhhhcchhh Q lcl|Aclame:pro 594 PGADVIAERLKKIVPPNVL 612 (711) Q Consensus 594 ~~~~e~~~~l~~~~~~~~~ 612 (711) ...+ +.+.........++ T Consensus 393 ~~~~-~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 393 AGDM-SAKPVVSEGGSNGE 410 (410) T ss_pred ChHH-HHHHHHHHHHhCCC Confidence 4332 22211111111111 No 106 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.66 E-value=1.2e-14 Score=96.95 Aligned_cols=398 Identities=14% Similarity=0.097 Sum_probs=211.0 Q ss_pred hHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH----HHHHHHHhCCCceEehhhHHHHHHHhhhhhhccc Q lcl|Aclame:pro 27 DDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQ----VRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRP 102 (711) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~----~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~ 102 (711) =+.+.|.++...+.. .+..-.+-.+||.|+|.... ....+....+ ++.|..+.+|+.+.+... T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~iVds~a~rl~---- 67 (409) T protein:vir:16 1 MTEKGIGYLRFKLSV-------HKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYR--SILGWCAKGVDSLADRLV---- 67 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHHhccCchhhcchhhhHHHHHHHh--hhcChhHHHHHHhHhhcc---- Confidence 223345555544322 33445567889999986532 3344443333 457999999998865321 Q ss_pred ceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeecc Q lcl|Aclame:pro 103 AIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA 182 (711) Q Consensus 103 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~ 182 (711) |. +...+|.+ +..+++.|+++...+.+..+++++|++++-|+-+ T Consensus 68 ---~~----------------------Gf~~~d~~--------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~--- 111 (409) T protein:vir:16 68 ---FR----------------------EFENDDFT--------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKG--- 111 (409) T ss_pred ---cc----------------------cccCcchH--------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecC--- Confidence 10 01112222 4667889999999999999999999999866543 Q ss_pred CCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEE Q lcl|Aclame:pro 183 DDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRV 260 (711) Q Consensus 183 ~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 260 (711) + ++.+.|..+ +|.+++ |||..+.+. +-+ +.|-. + .....+. T Consensus 112 ~---dg~~~i~~~-sP~~~~~i~D~~~~~~~---~a~---~~~~~---------d-----------------~~~~~~~- 154 (409) T protein:vir:16 112 E---NDAVRLQVI-EATNATGIIDPITGLLT---EGY---AVLER---------D-----------------ENNNVVL- 154 (409) T ss_pred C---CCceEEEEE-cccceEEEeecccccce---eee---EEEEe---------c-----------------CCCceEE- Confidence 2 345677655 665544 777554331 111 11100 0 0000111 Q ss_pred EEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEE Q lcl|Aclame:pro 261 SEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVW 340 (711) Q Consensus 261 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~ 340 (711) ..+|. .+.++++.. .+... ...|.+.|..|+|||+ T Consensus 155 ~~~~~------------~~~~~~~~~-------------------------------~~~~~--~~~~~~~g~vPvV~f~ 189 (409) T protein:vir:16 155 EAHFL------------PDRTDYYYR-------------------------------DSRNN--ISIANPTGNPLLVPII 189 (409) T ss_pred EEEEe------------cCcEEEEEe-------------------------------cCccc--cceecCCCCcceEEec Confidence 11111 111111100 00000 1123456788999886 Q ss_pred eeeeccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC--hHHHHhhcccCCCceEEeccccc Q lcl|Aclame:pro 341 GKSLIIKKKEIFRS-IIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG--REDEWEQANTKNFSLLTYIPQYQ 417 (711) Q Consensus 341 ~~~~~~~~~~~~~g-~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~--~~~~~~~~~~~~~~~i~~~~~~~ 417 (711) -.+. .....|.| +.+.+++.|+.+|+.++.+.-.....+.|+..+- |.-.+ ..+.|. ..++.++.+..... T Consensus 190 n~~~--~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~---~~~~~i~~~~~d~~ 263 (409) T protein:vir:16 190 HRPD--AVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-GLSDDAEPMETWK---ATVSSMLQFTKDED 263 (409) T ss_pred cccc--ccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-ecCCCCCccchhh---hhhhHhhccCCCCC Confidence 4432 23334444 3478999999999999999999888888876662 32111 112222 23445555543322 Q ss_pred -CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 418 -GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGK 495 (711) Q Consensus 418 -~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 495 (711) ....+..++..++. .+...+......+-.+||++...+|..+.+ .||.|+.+....=........+.|..+.+++++ T Consensus 264 g~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~r 342 (409) T protein:vir:16 264 GDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAY 342 (409) T ss_pred CCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22335445554444 356666666667777789999999966554 799998876555444555555556666666666 Q ss_pred HHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 496 ILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA 575 (711) Q Consensus 496 ~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~ 575 (711) +++.+.-..=. ..+++..+.. .+ .-..-+++.+ ..+..+.+..+.+. T Consensus 343 la~~~~~~~~~-----------~~~~~~~~~v-------------------~W--~~~~~~~~~s-~a~~aDa~~Kl~~a 389 (409) T protein:vir:16 343 LAACLRDDVPY-----------LREQFSKTKP-------------------KW--EPLFEADASM-LSLIGDGAIKLNQA 389 (409) T ss_pred HHHHHhcCCCc-----------cchhhccceE-------------------Ee--cCCCCcchhh-HHHHHHHHHHHHhh Confidence 66555322100 0111111000 00 0000112222 24556677777766 Q ss_pred cchhHHHHHHHHHHhcCCcchH Q lcl|Aclame:pro 576 VPSAAAVMADLIAQNMDWPGAD 597 (711) Q Consensus 576 ~p~~~~~~~~~~~~~~~~~~~~ 597 (711) .|.+.. ...+.+.+++...+ T Consensus 390 ~~~~~~--~~v~~~~~g~~~~d 409 (409) T protein:vir:16 390 IPEFIN--KDTIRDLTGIKGAE 409 (409) T ss_pred cccccc--hhHHHHhccCCCCC Confidence 543321 12335566666555 No 107 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.66 E-value=5.5e-15 Score=98.78 Aligned_cols=437 Identities=12% Similarity=0.006 Sum_probs=196.7 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC---ceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP---CLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p---~~~~N~i~~~v~~i~g~~ 97 (711) ++ .+...+++.++...+. ..+.+..+-.+||+|+|.-..........++. .++.|..+-+|+..+|+. T Consensus 1 ~~--~~t~~~~~~~l~~~~~-------~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 1 MT--ASTPAEWLPVLTKRID-------DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CC--CCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhh Confidence 33 2334445555544332 23444456688999988432211111122222 367899999999999986 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) .-+...+ . ...|.+..+ .+..+++.|+++...+.+..+++++|.+|.-|+ T Consensus 72 ~~~~~~~-------------------------~-~~~d~~~~~----~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~ 121 (456) T protein:vir:10 72 IPNGITV-------------------------G-GSADSDLAL----RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCW 121 (456) T ss_pred ccCCeec-------------------------C-CCCCcchHH----HHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEe Confidence 5432211 1 112222222 244566789999999999999999999987655 Q ss_pred EeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCC Q lcl|Aclame:pro 178 SDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTE 255 (711) Q Consensus 178 ~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 255 (711) .+ + +++++++.+ +|.+++ |||.... ...++++. +.+.+ . T Consensus 122 ~d---~---~g~~~i~~~-~p~~~~~i~d~~~~~----~~~~~i~~-~~~~d---------------------------~ 162 (456) T protein:vir:10 122 RR---D---DGTATITAD-SPETMVVSVDPLQPW----RIRAAMRW-WRDLD---------------------------A 162 (456) T ss_pred eC---C---CCceEEEEE-ccceeEEEEcCCCCc----ceEEEEEE-EEecC---------------------------C Confidence 43 2 356777766 787754 6764432 12222222 11110 0 Q ss_pred CeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccc Q lcl|Aclame:pro 256 KSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIP 335 (711) Q Consensus 256 ~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P 335 (711) ......+++-.... +.+.. ++..+... +.......|.....+..|...+..| T Consensus 163 ~~~~~~~~~~~~~~--~~~~~----~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) T protein:vir:10 163 ESDFAIVWSGDGWQ--KFARP----CFVQSSSR----------------------RRLVTRISDSWVPVGDAVVTGSPPP 214 (456) T ss_pred ceeEEEEEecccee--EEEEE----EEEeeccc----------------------ceeeeecCCceeeccccCCCCCcee Confidence 00001111000000 00000 00000000 0000001111111222233334445 Q ss_pred eEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC---hHH------HHhhcccCC Q lcl|Aclame:pro 336 VIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG---RED------EWEQANTKN 406 (711) Q Consensus 336 ~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~---~~~------~~~~~~~~~ 406 (711) ++++ ....+.|.+..+++.++.+|+..|.++..+...+.+...+. |.... .++ ........+ T Consensus 215 vv~~--------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~~~d~~g~~~~~~~~~~~~~ 285 (456) T protein:vir:10 215 VVVY--------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLPNVDENGNAIDYASIFEAAP 285 (456) T ss_pred EEEe--------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccccccccccccchhhhhhhhc Confidence 5432 12346688999999999999999988766655555443331 21100 000 000011223 Q ss_pred CceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 407 FSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNL 486 (711) Q Consensus 407 ~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 486 (711) +.++...++ ..+..++.. ....+...+......+-.+||+++..+|...++.||.|+......-..........| T Consensus 286 ~~~~~~~~~----~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:10 286 GALWELPPG----VDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) T ss_pred cccccCCCC----cceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333322 224334332 234455666666666777789999999877667899999888777777777777777 Q ss_pred HHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHH Q lcl|Aclame:pro 487 TKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAA 566 (711) Q Consensus 487 ~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~ 566 (711) ..+++++.++++.+ . |. .+... +.++- .+..+....+.. T Consensus 361 ~~~l~~~~rl~~~~----~---------g~---~~~~~-----------------------~~v~w--~~~~~~~~~~~a 399 (456) T protein:vir:10 361 KIGLEAILVKALQI----E---------GE---SVEDT-----------------------VDVSF--ESPDRVTLGEKY 399 (456) T ss_pred HHHHHHHHHHHHHh----c---------CC---Ccccc-----------------------eeEEe--cCCCCcCHHHHH Confidence 77777777766532 1 11 00000 00000 000000112223 Q ss_pred HHHHHHHhh-cchhHHHHHHHHHHhcCCcchHHHH----HHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 567 EAMIQFAQA-VPSAAAVMADLIAQNMDWPGADVIA----ERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKS 641 (711) Q Consensus 567 ~~L~~l~~~-~p~~~~~~~~~~~~~~~~~~~~e~~----~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~ 641 (711) +.+..+.+. .+.. ..+.+.+++.. +++. ++++..... + + T Consensus 400 da~~kl~~~gi~~~-----~~~~~~lg~~~-~~i~~~e~er~~~e~~~----------------------------~--~ 443 (456) T protein:vir:10 400 SAASLAKAAGESWA-----SIRRNILNYNA-DQIKQDDLDRAREQITL----------------------------F--A 443 (456) T ss_pred HHHHHHHHcCCChH-----HHHHhhCCCCH-HHHHHHHHHHHHHHHHH----------------------------H--h Confidence 334333322 1111 11112222211 1110 000000000 0 0 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 642 QADMAQAEADTAQ 654 (711) Q Consensus 642 q~~~~k~qae~~~ 654 (711) ..-.+..+-+..+ T Consensus 444 ~~~~~~~~~~~~~ 456 (456) T protein:vir:10 444 GNPVQRPQEDGSR 456 (456) T ss_pred hhhhhcCCCCCCC Confidence 0000000000000 No 108 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.66 E-value=5.5e-15 Score=98.78 Aligned_cols=437 Identities=12% Similarity=0.006 Sum_probs=196.7 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC---ceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP---CLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p---~~~~N~i~~~v~~i~g~~ 97 (711) ++ .+...+++.++...+. ..+.+..+-.+||+|+|.-..........++. .++.|..+-+|+..+|+. T Consensus 1 ~~--~~t~~~~~~~l~~~~~-------~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 1 MT--ASTPAEWLPVLTKRID-------DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CC--CCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhh Confidence 33 2334445555544332 23444456688999988432211111122222 367899999999999986 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) .-+...+ . ...|.+..+ .+..+++.|+++...+.+..+++++|.+|.-|+ T Consensus 72 ~~~~~~~-------------------------~-~~~d~~~~~----~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~ 121 (456) T protein:vir:10 72 IPNGITV-------------------------G-GSADSDLAL----RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCW 121 (456) T ss_pred ccCCeec-------------------------C-CCCCcchHH----HHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEe Confidence 5432211 1 112222222 244566789999999999999999999987655 Q ss_pred EeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCC Q lcl|Aclame:pro 178 SDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTE 255 (711) Q Consensus 178 ~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 255 (711) .+ + +++++++.+ +|.+++ |||.... ...++++. +.+.+ . T Consensus 122 ~d---~---~g~~~i~~~-~p~~~~~i~d~~~~~----~~~~~i~~-~~~~d---------------------------~ 162 (456) T protein:vir:10 122 RR---D---DGTATITAD-SPETMVVSVDPLQPW----RIRAAMRW-WRDLD---------------------------A 162 (456) T ss_pred eC---C---CCceEEEEE-ccceeEEEEcCCCCc----ceEEEEEE-EEecC---------------------------C Confidence 43 2 356777766 787754 6764432 12222222 11110 0 Q ss_pred CeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccc Q lcl|Aclame:pro 256 KSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIP 335 (711) Q Consensus 256 ~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P 335 (711) ......+++-.... +.+.. ++..+... +.......|.....+..|...+..| T Consensus 163 ~~~~~~~~~~~~~~--~~~~~----~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) T protein:vir:10 163 ESDFAIVWSGDGWQ--KFARP----CFVQSSSR----------------------RRLVTRISDSWVPVGDAVVTGSPPP 214 (456) T ss_pred ceeEEEEEecccee--EEEEE----EEEeeccc----------------------ceeeeecCCceeeccccCCCCCcee Confidence 00001111000000 00000 00000000 0000001111111222233334445 Q ss_pred eEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC---hHH------HHhhcccCC Q lcl|Aclame:pro 336 VIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG---RED------EWEQANTKN 406 (711) Q Consensus 336 ~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~---~~~------~~~~~~~~~ 406 (711) ++++ ....+.|.+..+++.++.+|+..|.++..+...+.+...+. |.... .++ ........+ T Consensus 215 vv~~--------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~~~d~~g~~~~~~~~~~~~~ 285 (456) T protein:vir:10 215 VVVY--------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLPNVDENGNAIDYASIFEAAP 285 (456) T ss_pred EEEe--------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccccccccccccchhhhhhhhc Confidence 5432 12346688999999999999999988766655555443331 21100 000 000011223 Q ss_pred CceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 407 FSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNL 486 (711) Q Consensus 407 ~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 486 (711) +.++...++ ..+..++.. ....+...+......+-.+||+++..+|...++.||.|+......-..........| T Consensus 286 ~~~~~~~~~----~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:10 286 GALWELPPG----VDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) T ss_pred cccccCCCC----cceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333322 224334332 234455666666666777789999999877667899999888777777777777777 Q ss_pred HHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHH Q lcl|Aclame:pro 487 TKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAA 566 (711) Q Consensus 487 ~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~ 566 (711) ..+++++.++++.+ . |. .+... +.++- .+..+....+.. T Consensus 361 ~~~l~~~~rl~~~~----~---------g~---~~~~~-----------------------~~v~w--~~~~~~~~~~~a 399 (456) T protein:vir:10 361 KIGLEAILVKALQI----E---------GE---SVEDT-----------------------VDVSF--ESPDRVTLGEKY 399 (456) T ss_pred HHHHHHHHHHHHHh----c---------CC---Ccccc-----------------------eeEEe--cCCCCcCHHHHH Confidence 77777777766532 1 11 00000 00000 000000112223 Q ss_pred HHHHHHHhh-cchhHHHHHHHHHHhcCCcchHHHH----HHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 567 EAMIQFAQA-VPSAAAVMADLIAQNMDWPGADVIA----ERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKS 641 (711) Q Consensus 567 ~~L~~l~~~-~p~~~~~~~~~~~~~~~~~~~~e~~----~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~ 641 (711) +.+..+.+. .+.. ..+.+.+++.. +++. ++++..... + + T Consensus 400 da~~kl~~~gi~~~-----~~~~~~lg~~~-~~i~~~e~er~~~e~~~----------------------------~--~ 443 (456) T protein:vir:10 400 SAASLAKAAGESWA-----SIRRNILNYNA-DQIKQDDLDRAREQITL----------------------------F--A 443 (456) T ss_pred HHHHHHHHcCCChH-----HHHHhhCCCCH-HHHHHHHHHHHHHHHHH----------------------------H--h Confidence 334333322 1111 11112222211 1110 000000000 0 0 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 642 QADMAQAEADTAQ 654 (711) Q Consensus 642 q~~~~k~qae~~~ 654 (711) ..-.+..+-+..+ T Consensus 444 ~~~~~~~~~~~~~ 456 (456) T protein:vir:10 444 GNPVQRPQEDGSR 456 (456) T ss_pred hhhhhcCCCCCCC Confidence 0000000000000 No 109 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.65 E-value=1e-14 Score=97.27 Aligned_cols=483 Identities=9% Similarity=-0.029 Sum_probs=220.0 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhc----------------hHHHHHHHHHHHHhCCCC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYW----------------KDNWEAAEDDLKFLGGEQ 64 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~r~~~~~~~~~y~G~Q 64 (711) |+ +...+++..+.-+... .+.+.++. .++|.|.+ T Consensus 1 ~~----------------------------~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 50 (518) T protein:vir:78 1 MG----------------------------VWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWS--KDSYLTSL 50 (518) T ss_pred Cc----------------------------chhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhh--hhhhhhhh Confidence 11 1222222111111111 11111111 33455666 Q ss_pred CCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHH Q lcl|Aclame:pro 65 WPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTG 144 (711) Q Consensus 65 w~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~ 144 (711) |..--...+ .+-.+..|+-+.+++......-.-.+.|.|... +..| .+.++. T Consensus 51 w~~~~~~~~---~~~~~~~~l~~~i~~~~A~ll~~e~~~i~v~~~----------------------~~~d---~e~~~~ 102 (518) T protein:vir:78 51 WAQGYVPTV---HDKLMNSGTGNEIVVVAAEYISGKPLSIDVTGV----------------------NGSK---DENLTK 102 (518) T ss_pred cccCCCCcc---ccccccCChHHHHHHHHHHhhcCCCceEEecCc----------------------cccC---cHHHHH Confidence 644211111 112234455555666665554444445554321 1222 233466 Q ss_pred HHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecC Q lcl|Aclame:pro 145 LIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTM 224 (711) Q Consensus 145 ~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~ 224 (711) .+..+.+.++|.....+.++.++..|.+|++++++ +++++|..| ++..|++.. ..-++..| +|..... T Consensus 103 ~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~i~~v-~ad~~~P~~--~~g~~~~~--~f~~~~~ 170 (518) T protein:vir:78 103 QLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-------NGRPSISVH-SSSQFWIDF--KNNEPFRF--NFFEEIP 170 (518) T ss_pred HHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-------CCeeEEEEE-cCCeeEEEe--ecCcEEEE--EEEEEee Confidence 66777788999999999999999999999999885 256788887 677887532 22233333 3322111 Q ss_pred CHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhh Q lcl|Aclame:pro 225 SKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRT 304 (711) Q Consensus 225 ~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 304 (711) . ..-...|--. +........+..|... .-..+.+++....+..+...... T Consensus 171 ~-~~k~~~y~~l---E~he~~~~~~~~~~~~----------~~~I~n~ly~~~~~~~v~~~~~~---------------- 220 (518) T protein:vir:78 171 T-SNKADIYYLV---ESREIKQWDKEGKKLS----------GGFVTYSVIKIDGDKTTPISAER---------------- 220 (518) T ss_pred c-CCcceeEEEE---Eeeccccccceeeccc----------ceeEEEEEeeecCcccccccccc---------------- Confidence 1 0000000000 0000000000000000 00001111111111111000000 Q ss_pred cccceEEEEEEEEecCceeccCccCCCCccceEEEEeeee---ccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 305 RKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSL---IIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAP 381 (711) Q Consensus 305 ~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~---~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~ 381 (711) .......+.....+.+...-..+...||++++.... -..+++.|.|++..+++.++.+|...+++.+.+.. + T Consensus 221 ----~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g 295 (518) T protein:vir:78 221 ----LPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-T 295 (518) T ss_pred ----cccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-C Confidence 000000000111111110111223456665542211 11356778999999999999999999999999965 7 Q ss_pred CCceEecccccCChHHH-----HhhcccCCCceEEeccccc--C--cCCccccCCccchHHHHHHHHHHHHHHHHHhCCC Q lcl|Aclame:pro 382 KAPFIGSEGNVEGREDE-----WEQANTKNFSLLTYIPQYQ--G--DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMY 452 (711) Q Consensus 382 ~~~~~~~~~av~~~~~~-----~~~~~~~~~~~i~~~~~~~--~--~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~ 452 (711) ..++.++++.+....+- ..........++.++...+ . ...|+.+++.--...+...++.....+..-.|++ T Consensus 296 ~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s 375 (518) T protein:vir:78 296 KTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYN 375 (518) T ss_pred CceeeechhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCC Confidence 88999988776411100 0000011122333332111 1 1125555544344567778888888888889999 Q ss_pred HHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhh Q lcl|Aclame:pro 453 DASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDE 532 (711) Q Consensus 453 ~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~ 532 (711) ..+.|.++...||++|.+..+..-.....+...+..+++++-+.++.+..-++...... T Consensus 376 ~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~--------------------- 434 (518) T protein:vir:78 376 PATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKA--------------------- 434 (518) T ss_pred hhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccc--------------------- Confidence 99999776678999998888777777777777777888888887777766553211000 Q ss_pred hccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhc--CCc--chHHHHHHHHhhhc Q lcl|Aclame:pro 533 ESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNM--DWP--GADVIAERLKKIVP 608 (711) Q Consensus 533 ~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~--~~~--~~~e~~~~l~~~~~ 608 (711) .....++++|+=+.+....+.+..+.+.++... +.......++.+ ++. .+++..++++.... T Consensus 435 ----------~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~a----GimS~e~~i~~~~~~~~deea~~e~~ri~~E~~ 500 (518) T protein:vir:78 435 ----------IMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSA----LAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENA 500 (518) T ss_pred ----------cCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhc----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhc Confidence 001123333333333333334444444433222 001011112221 222 12223333333322 Q ss_pred chhhcchh-h--hhhhhh Q lcl|Aclame:pro 609 PNVLSKDE-R--EAIEED 623 (711) Q Consensus 609 ~~~~~~~~-~--~~~~~~ 623 (711) ...+..+. . +...+. T Consensus 501 ~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 501 IGEVPDPEAIGGMETKGG 518 (518) T ss_pred ccCCCCCccccCCCCCCC Confidence 11111110 0 111111 No 110 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.64 E-value=4.9e-15 Score=99.06 Aligned_cols=471 Identities=10% Similarity=0.028 Sum_probs=235.5 Q ss_pred HHHHHHHHHHHHHHHh-----------------hchHHHHHHHHHHHHhCCCCCCHHHHHH-HHHhCCCceEehhhHHHH Q lcl|Aclame:pro 29 RALLATARERARDGAT-----------------YWKDNWEAAEDDLKFLGGEQWPSQVRTE-RELEQRPCLVNNVLPTFV 90 (711) Q Consensus 29 ~~~~~~~~~~~~~~~~-----------------~~~~~r~~~~~~~~~y~G~Qw~~~~~~~-~~~~g~p~~~~N~i~~~v 90 (711) =.++.+++++|++... -..+.+....+..++|.|++|.=..... -..+.+..+..|+-+.++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 2346666666654311 1234455566678889998663211111 112334467788888777 Q ss_pred HHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 91 DQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESG 170 (711) Q Consensus 91 ~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G 170 (711) ..+.+..-.-.+.|.+...+ ..+.+.......+..+..+.+.|++.....+++++++..| T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~--------------------~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G 140 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAK--------------------DEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALG 140 (517) T ss_pred HHhhhhhcCCcceEEecccc--------------------cccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhC Confidence 77766655555566553321 1111222333455666677778999999999999999999 Q ss_pred ccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCcc-ccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccc Q lcl|Aclame:pro 171 MGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAK-KRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADY 249 (711) Q Consensus 171 ~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~-~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 249 (711) -|++++++| .+.++|+.| ++..||+ ... .-+...|-++++.... .+. . T Consensus 141 ~~a~k~~~d-------~~~~~I~~v-~ad~~~P--l~~~~~~v~~~ai~~~~~~~-~~~--------~------------ 189 (517) T protein:vir:98 141 GLTVRPYVD-------NGEIEFSWA-LANAFYP--LRSNSNGISEGVMKSVTTKV-IGN--------K------------ 189 (517) T ss_pred CEEEEEEEe-------CCeeEEEEE-cCCeeEE--EEecCCCeEEEEEEEEEEEe-ecC--------C------------ Confidence 999999987 246778777 6777773 211 1122334443332221 000 0 Q ss_pred ccCCCCCeEEEEEeeeeee---------eceeEEEccC----CcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEE Q lcl|Aclame:pro 250 DTWFTEKSVRVSEYFTREP---------VIREIALLSD----GRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRK 316 (711) Q Consensus 250 ~~~~~~~~v~v~E~~~~~~---------~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~ 316 (711) ....+..|+.+... .+.+++...+ |.-+.... + |.- T Consensus 190 -----~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~-------------------------~-~e~ 238 (517) T protein:vir:98 190 -----TVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEE-------------------------L-YEG 238 (517) T ss_pred -----ceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccc-------------------------c-ccC Confidence 00011112111100 0001111000 00000000 0 000 Q ss_pred EecCceeccCccCCCCccceEEEEeeee---ccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC Q lcl|Aclame:pro 317 ITGANVLEGPVEIPSTTIPVIPVWGKSL---IIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE 393 (711) Q Consensus 317 ~~g~~~le~~~p~~~~~~P~vp~~~~~~---~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~ 393 (711) ....+.+ .+=..|.+.++..+. ...++++|.|++..++|..+.+|...+++++.+.+ .+.+++++++++. T Consensus 239 l~~~~~~------~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~ 311 (517) T protein:vir:98 239 MQEKTYI------QGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLR 311 (517) T ss_pred CCcceeE------CCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-CCcceecChhhhc Confidence 0000011 111113222211110 12257789999999999999999999999999887 6778889888873 Q ss_pred ChHHH---HhhcccCC-CceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHH Q lcl|Aclame:pro 394 GREDE---WEQANTKN-FSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAI 468 (711) Q Consensus 394 ~~~~~---~~~~~~~~-~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai 468 (711) ...+. .......+ .-++..-.+......++.+++.-...++++.++...+.+....|++....|.++.. .||.+| T Consensus 312 ~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi 391 (517) T protein:vir:98 312 TVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEI 391 (517) T ss_pred cccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHH Confidence 21110 00000000 11112112223334566555544456788888999999999999999999977654 478888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--cCccceEeeecccCcchheecchhhhhhhccceeeeehhhhe Q lcl|Aclame:pro 469 IARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHI--YDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQ 546 (711) Q Consensus 469 ~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~--~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~ 546 (711) .+..+..-.....+...+..+++++.+.++.+..-+ |... . .. T Consensus 392 ~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~---------~--------------------------~~ 436 (517) T protein:vir:98 392 VSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGE---------I--------------------------PS 436 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------C--------------------------CC Confidence 887777777777888888889999999888776543 2110 0 01 Q ss_pred eeeEEeecccChHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCCc--chHHHHHHHHhhhcchhhcchhhhhhhh Q lcl|Aclame:pro 547 KYDVVVTTGPAFATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDWP--GADVIAERLKKIVPPNVLSKDEREAIEE 622 (711) Q Consensus 547 ~~dv~v~~~~~~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~~--~~~e~~~~l~~~~~~~~~~~~~~~~~~~ 622 (711) .++++|+=+.+....+++..+.++++... ++.. ..+.+..++. .+++.+.+++.......+....+.+... T Consensus 437 ~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~-----~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~ 511 (517) T protein:vir:98 437 AEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTV-----EAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKR 511 (517) T ss_pred CcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHH-----HHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCC Confidence 12333333333333333444444443322 1211 1122222332 1223333333322221111111100000 Q ss_pred hHHHHH Q lcl|Aclame:pro 623 DMPEQT 628 (711) Q Consensus 623 ~~~~~q 628 (711) .....+ T Consensus 512 ~~gd~e 517 (517) T protein:vir:98 512 MFGDEE 517 (517) T ss_pred CCCCCC Confidence 000000 No 111 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.63 E-value=6.8e-15 Score=98.24 Aligned_cols=441 Identities=12% Similarity=0.017 Sum_probs=196.8 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCC---ceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRP---CLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p---~~~~N~i~~~v~~i~g~~ 97 (711) .+. ....+++..+...+. ..+.+..+-.+||.|++=-...........+. .++.|..+-+|+..+|+. T Consensus 1 ~~~--~t~~~~~~~l~~~~~-------~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:79 1 MTA--STPAEWLPVLTKRID-------DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCC--CCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhh Confidence 222 233334555554322 23334456689999975111100011111111 256899999999999886 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) .-+...+. ...|.+..+. +..+++.|+++...+.+..+++++|.+|.-++ T Consensus 72 ~~~g~~~~--------------------------~~~d~~~~~~----~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~ 121 (456) T protein:vir:79 72 IPNGITVG--------------------------GSADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCW 121 (456) T ss_pred ccCCeecC--------------------------CCCCccHHHH----HHHHHHhcChhHHHHHHHHHHhhcCeeEEEEe Confidence 54422211 1223333332 34556779999999999999999999987665 Q ss_pred EeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCC Q lcl|Aclame:pro 178 SDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTE 255 (711) Q Consensus 178 ~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 255 (711) .+ + ++++++..+ +|.+++ |||.... ....+++ .+-+.++ . ......|..+ T Consensus 122 ~~---e---dg~~~i~~~-~p~~~~~i~d~~~~~----~~~~~~~-~~~~~d~----~------------~~~~~~~~~~ 173 (456) T protein:vir:79 122 RR---D---DGTATITAD-SPETMVVSVDPLQPW----RIRSAMR-WWRDLDA----E------------SDFAIVWSGD 173 (456) T ss_pred eC---C---CCceEEEEe-ccceeEEEEcCCCCC----ceEEEEE-EEEecCC----c------------eeEEEEEcCC Confidence 43 2 256677666 677654 6664432 1112211 1111100 0 0000111112 Q ss_pred CeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccc Q lcl|Aclame:pro 256 KSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIP 335 (711) Q Consensus 256 ~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P 335 (711) ..+....+|+....... .......+.-......|...+..| T Consensus 174 ~~~~~~~~~~~~~~~~~---------------------------------------~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) T protein:vir:79 174 GWQKFARPCFVQSSSRR---------------------------------------RLVTRISDSWVPVGDAVVTGSPPP 214 (456) T ss_pred ceEEEEEEEEeeccccc---------------------------------------eeeeccCCceeecccccCCCCcee Confidence 22221111111100000 000000011111122233445666 Q ss_pred eEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC---hHH------HHhhcccCC Q lcl|Aclame:pro 336 VIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG---RED------EWEQANTKN 406 (711) Q Consensus 336 ~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~---~~~------~~~~~~~~~ 406 (711) ++++ . ...+.|.+..+++.++.+|+.++.+...+...+.+...+. |.-.. .++ ........+ T Consensus 215 vv~~---~-----N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~-G~~~~~~~~d~~g~~i~~~~~~~~~~ 285 (456) T protein:vir:79 215 VVVY---Q-----NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-SSEHRLPKVDENGNAIDYASIFEAAP 285 (456) T ss_pred EEEe---c-----CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHh-cCCcccccccccccccchhhhhhhhc Confidence 6543 1 2345788999999999999999998777666555544432 21100 000 001111233 Q ss_pred CceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 407 FSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNL 486 (711) Q Consensus 407 ~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 486 (711) +.++...++ ..+..++..+ ...+...+......+-..||+++...|...++.||.|+..+...-..........| T Consensus 286 ~~~~~~~~~----~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:79 286 GALWELPPG----VDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) T ss_pred cccccCCCC----cceeeecccC-hHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHH Confidence 444333322 2233333333 34466667777777778889999999977777899999888766666666666677 Q ss_pred HHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHH Q lcl|Aclame:pro 487 TKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAA 566 (711) Q Consensus 487 ~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~ 566 (711) ..+++++.++++. +.+.. +...+ +++=.. +...+ ..+.. T Consensus 361 ~~~l~~~~~l~~~----~~g~~------------~~~~i-----------------------~v~w~~-~~~~s-~~~~a 399 (456) T protein:vir:79 361 KIGLEAILVKALQ----IEGES------------VEDTV-----------------------DVSFES-PDRVT-LGEKY 399 (456) T ss_pred HHHHHHHHHHHHH----hcCCC------------ccccc-----------------------eEEeCC-CCCcC-HHHHH Confidence 7777776665543 32210 00000 000000 00111 12223 Q ss_pred HHHHHHHhh-cchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 567 EAMIQFAQA-VPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADM 645 (711) Q Consensus 567 ~~L~~l~~~-~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~ 645 (711) +.+..+.+. .+.. ...++.+++.. +++ ...+.+....+ ... + +..-. T Consensus 400 da~~kl~~~G~~~~-----~~~~~~lg~~~-~~i------------------~~~e~~r~~~e-----~~~-~--~~~~~ 447 (456) T protein:vir:79 400 SAASLAKAAGESWA-----SIRRNILNYNA-DQI------------------KQDDLDRAREQ-----ITL-F--AGNPV 447 (456) T ss_pred HHHHHHHhcCCChH-----HHHHhcCCCCH-HHH------------------HHHHHHHHHHH-----HHH-H--hhhHh Confidence 333333222 1110 11111222110 000 00000000000 000 0 00000 Q ss_pred HHHHHHHHH Q lcl|Aclame:pro 646 AQAEADTAQ 654 (711) Q Consensus 646 ~k~qae~~~ 654 (711) +..+.+..+ T Consensus 448 ~~~~~~~~~ 456 (456) T protein:vir:79 448 QRPQEDGSR 456 (456) T ss_pred hcCCCCCCC Confidence 001111111 No 112 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.63 E-value=5.2e-15 Score=98.91 Aligned_cols=461 Identities=12% Similarity=0.012 Sum_probs=201.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH----HHHHHHHh Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQ----VRTERELE 76 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~----~~~~~~~~ 76 (711) |-. |+-+-..-+..-+--.++....+.++...+.. .+..-.+-.+||+|+|.... ....++.. T Consensus 1 ~~~------~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~-------~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~ 67 (504) T protein:vir:99 1 MTE------ETTSASKFTFRIPELNDDVVDKVNGLYQQLVD-------RTPRNLLRASFYDGKYAIRQIGNLIPPEYLRT 67 (504) T ss_pred CCc------cCCcccccccccCCCCHHHHHHHHHHHHHHHH-------HhHHHHHHHHHHhccccchhccccccHHHHHH Confidence 322 22211111111111123444556666554433 23344455789999875322 22222211 Q ss_pred CCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHH Q lcl|Aclame:pro 77 QRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAE 156 (711) Q Consensus 77 g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~ 156 (711) ..+.|..+-+|+.+.....-+ -|+ .+++.+. ...+..+++.|+++ T Consensus 68 ---~~v~n~~~~iVd~~a~rl~~~----Gf~------------------------~~d~~~~----~~~l~~i~~~N~ld 112 (504) T protein:vir:99 68 ---ATVLGWSAKAVDTLARRCNLE----SFV------------------------WPDGDYG----SIGGPDVWDENFFA 112 (504) T ss_pred ---hhccCcHHHHHHHHHhhhccc----eee------------------------CCCCChh----hHHHHHHHHhcChh Confidence 256799998888876432111 011 0122222 22356678899999 Q ss_pred HHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcC Q lcl|Aclame:pro 157 TEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYP 234 (711) Q Consensus 157 ~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p 234 (711) ...+.+..+++++|++|+-|+.+. + ....+.|+.+ +|.+++ |||....+ ...+++.. T Consensus 113 ~~~~~~~~~a~iyG~af~~v~~~~---d-~~~~~~I~~~-sP~~~~~iyD~~~~~~-----~~a~~~~~----------- 171 (504) T protein:vir:99 113 TKANNAMVSSLIHGPAFLINTEGG---A-GEPDSLIHVK-SAMQATGEWNSRRNAM-----DSLLSITS----------- 171 (504) T ss_pred hHHHHHHHHHHhhCceeEEEecCC---C-CCceeEEEEe-ccceeEEEEeCCCCce-----eEEEEEEE----------- Confidence 999999999999999998776442 1 1124556655 777664 88754322 11111110 Q ss_pred CcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEE Q lcl|Aclame:pro 235 DATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYW 314 (711) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~ 314 (711) .+.+ .....+++|+.. .++.+.. T Consensus 172 ----------------~d~~-g~~~~~~~y~~~------------~~~~~~~---------------------------- 194 (504) T protein:vir:99 172 ----------------RDAE-GHPTGIALYEDG------------VTVTADM---------------------------- 194 (504) T ss_pred ----------------ecCC-CeEEEEEEEcCC------------cEEEEEE---------------------------- Confidence 0000 112223333221 1111100 Q ss_pred EEEecCceeccCccCCCCccceEEEEeeeeccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC Q lcl|Aclame:pro 315 RKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRS-IIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE 393 (711) Q Consensus 315 ~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g-~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~ 393 (711) ..+.....+..|.+.| .|+|||+-.+. .....|.| +.+.+++.++.+|+.++.++-.....+.++..+- |+-. T Consensus 195 --~~~~~~~~~~~~~~~g-vPvV~~~n~~~--~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~-G~~~ 268 (504) T protein:vir:99 195 --DDDGDWHADVRTHKLG-VPVEVLPYKPR--EDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL-GADA 268 (504) T ss_pred --cCCceeeeccccCCCC-cceEEeccccc--CccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-cCCc Confidence 0011111123344455 78888753322 22333333 4468999999999999999988887777765542 2211 Q ss_pred C--------hHHHHhhcccCCCceEEecccccC------cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccc Q lcl|Aclame:pro 394 G--------REDEWEQANTKNFSLLTYIPQYQG------DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAM 459 (711) Q Consensus 394 ~--------~~~~~~~~~~~~~~~i~~~~~~~~------~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~ 459 (711) . ....|. ...+.++.+.+.... ...+..++...+ ..+...+......+-.+||+++..+|.. T Consensus 269 ~~~~~~d~~~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l-~~~~~~l~~~i~~~a~~t~~P~~~lG~~ 344 (504) T protein:vir:99 269 KNFRNKDGSMKPAWQ---IALARVFALPDDEDEPDAARARADVKQFPASSP-QPHIEMLEQIAMMFSGETSIPVESLGFS 344 (504) T ss_pred cccccccccccchhh---hhhhhhhcCCCccccccccCccceeeecCCCCh-HHHHHHHHHHHHHHHhhhCCCHHHhccc Confidence 0 001111 112333333222111 122333333322 2344455555555555699999999965 Q ss_pred c--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-ccceEeeecccCcchheecchhhhhhhccc Q lcl|Aclame:pro 460 G--NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYD-TERVVRLKFPDETEDFVKLNEQIFDEESGE 536 (711) Q Consensus 460 ~--~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~-~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~ 536 (711) + |+.||.|+......-........+-|..+.++++++.+.+....-. .....++. +.|.. T Consensus 345 ~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~--------v~w~d--------- 407 (504) T protein:vir:99 345 NRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTID--------SKFRS--------- 407 (504) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccce--------eEecC--------- Confidence 4 5679999988766666666667777777888888877655433210 00000000 00100 Q ss_pred eeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHH---Hhhhc----- Q lcl|Aclame:pro 537 WVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERL---KKIVP----- 608 (711) Q Consensus 537 ~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l---~~~~~----- 608 (711) +...+ ..+..+.+..+.+..+.+.. .-..+++.+++. .+++.+.. ++... T Consensus 408 -------------------~~~~s-~a~~aDa~~Kl~~ag~~l~~-~~~~l~~~lg~~-~~ei~r~~~e~~~~~~~~~~~ 465 (504) T protein:vir:99 408 -------------------PLYLS-KAAQADAGAKMLGAGPEWLK-ETEVGLELLGLT-PQQAKRALAERRRASSVSIIE 465 (504) T ss_pred -------------------CCccC-HHHHHHHHHHHHhhcccccc-chHHHHhhcCCC-HHHHHHHHHHHHHHhhHHHHH Confidence 01111 12222333333332211100 001122333332 11111100 00000 Q ss_pred -------ch-hhc--chhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 609 -------PN-VLS--KDEREAIEED-MPEQTEPTPEQQVEMAKSQADMAQ 647 (711) Q Consensus 609 -------~~-~~~--~~~~~~~~~~-~~~~q~~~~~~q~~~~~~q~~~~k 647 (711) .+ ... .+........ .+-..-..+.+. - T Consensus 466 ~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~-----------~ 504 (504) T protein:vir:99 466 ALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTLV-----------G 504 (504) T ss_pred HHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcccC-----------C Confidence 00 000 0000000000 000000000000 0 No 113 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.61 E-value=1.9e-14 Score=95.79 Aligned_cols=456 Identities=10% Similarity=0.047 Sum_probs=226.8 Q ss_pred HHHHHHHHHHHHHHHh-----------------hchHHHHHHHHHHHHhCCCCCCHHHHHHH-HHhCCCceEehhhHHHH Q lcl|Aclame:pro 29 RALLATARERARDGAT-----------------YWKDNWEAAEDDLKFLGGEQWPSQVRTER-ELEQRPCLVNNVLPTFV 90 (711) Q Consensus 29 ~~~~~~~~~~~~~~~~-----------------~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~-~~~g~p~~~~N~i~~~v 90 (711) =.++.+++++|+.... ...+.........+||.|+.+.-.....- .-..+..+..|+-+.++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 2245555555544221 12334455667789999974422111000 00123346778888888 Q ss_pred HHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 91 DQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESG 170 (711) Q Consensus 91 ~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G 170 (711) +...+..-.-.+.+.+ +|... +..+..+.+.|+|.....++++.++..| T Consensus 81 ~~~A~lv~~e~~~i~~---------------------------~d~~~----~~~l~~il~~n~f~~~~~~~~e~a~a~G 129 (500) T protein:vir:30 81 KKIASLVFNEQAEIKV---------------------------DDDAA----NEFISETLKNDRFNKNFERYLESCLALG 129 (500) T ss_pred HHHhhhhcCCcceEec---------------------------CChHH----HHHHHHHHhhccHHHHHHHHHHHHhhcC Confidence 8777765444333332 23344 4455556667999999999999999999 Q ss_pred ccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCcc-ccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccc Q lcl|Aclame:pro 171 MGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAK-KRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADY 249 (711) Q Consensus 171 ~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~-~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 249 (711) .|++++++|. +.|.|+.| ++..|++ -.. .-+...+-++++.... . + T Consensus 130 ~~~~k~~~d~-------~~~~I~~v-~ad~~~P--~~~d~~~~~~~a~~~~~~~~-~--------~-------------- 176 (500) T protein:vir:30 130 GLAMRPYVDG-------DKVRVAFV-QAPVFLP--LQSNTQDVSSAAVVIKSVKT-I--------N-------------- 176 (500) T ss_pred CEEEEEEEeC-------CceEEEEE-cCCeeEE--EEEcCCCeEEEEEEEEEeee-e--------c-------------- Confidence 9999999862 45778776 7878773 111 1122233333221100 0 0 Q ss_pred ccCCCCCeEEEEEeeeeeeec-----eeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceec Q lcl|Aclame:pro 250 DTWFTEKSVRVSEYFTREPVI-----REIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLE 324 (711) Q Consensus 250 ~~~~~~~~v~v~E~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le 324 (711) ......+..|+++..... .+++...+. . ..|..+ ....+ |.-..+.+.+. T Consensus 177 ---~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~-------~--------~lG~~v------~l~~~-~~~l~~~~~~~ 231 (500) T protein:vir:30 177 ---GKEVYYTLIEFHEWQSSDDYVISNELYRSDDK-------A--------KVGSRV------PLSEV-YKDLKDEAKVT 231 (500) T ss_pred ---CCceEEEEEEEEEEeCCceeEEEEEEEecccc-------c--------ccCccc------ccccc-cCCcCcceEec Confidence 001122344554432211 111111000 0 001000 00000 00000111111 Q ss_pred cCccCCCCccceEEEEeeeec-----cCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHH Q lcl|Aclame:pro 325 GPVEIPSTTIPVIPVWGKSLI-----IKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEW 399 (711) Q Consensus 325 ~~~p~~~~~~P~vp~~~~~~~-----~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~ 399 (711) + ....||+ ++ +.| ..++++|.|++..+++..+.+|...|.+.+.+.. ...++.++++.+....+-. T Consensus 232 ~-----~~~p~f~-~~--~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~ 302 (500) T protein:vir:30 232 D-----VTRPIFT-YL--KTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTT 302 (500) T ss_pred c-----CCCccEE-Ee--cCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCC Confidence 1 1112222 11 122 2356778899999999999999999999999976 6778888877764211100 Q ss_pred h-----hcccCCC--ceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHH Q lcl|Aclame:pro 400 E-----QANTKNF--SLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIAR 471 (711) Q Consensus 400 ~-----~~~~~~~--~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~ 471 (711) . .....++ .++.++.+......++.+++.-...++...++.....+....|++....|.++++ .||.+|.+. T Consensus 303 ~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~ 382 (500) T protein:vir:30 303 DGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSE 382 (500) T ss_pred CccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHH Confidence 0 0001111 1222222222334576665544456677888888888888899999999876654 578888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--cCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHI--YDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~--~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .+..-.....+...+..+++++.+.++.+..-+ +...- ...++ T Consensus 383 ~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~-----------------------------------~~~~~ 427 (500) T protein:vir:30 383 NSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEV-----------------------------------PSMDN 427 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-----------------------------------CCCcc Confidence 877777888888888889999999888765432 21100 01122 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCCcc--hHHHHHHHHhhhcchhhc-chhhhhhhh Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLS-KDEREAIEE 622 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~~~--~~e~~~~l~~~~~~~~~~-~~~~~~~~~ 622 (711) |+++=+.+.....++..+.++++..+ ++... .+.+.-++.. +.+++.+++....+..-. .+.....-+ T Consensus 428 v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~-----~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 428 ISISLDDGVFTDRDAELDYWIKVVNAGFGTREM-----AIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHH-----HHHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 33322222222233333344443322 22111 1122222221 222233332221111000 000000000 No 114 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.61 E-value=1.9e-14 Score=95.79 Aligned_cols=456 Identities=10% Similarity=0.047 Sum_probs=226.8 Q ss_pred HHHHHHHHHHHHHHHh-----------------hchHHHHHHHHHHHHhCCCCCCHHHHHHH-HHhCCCceEehhhHHHH Q lcl|Aclame:pro 29 RALLATARERARDGAT-----------------YWKDNWEAAEDDLKFLGGEQWPSQVRTER-ELEQRPCLVNNVLPTFV 90 (711) Q Consensus 29 ~~~~~~~~~~~~~~~~-----------------~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~-~~~g~p~~~~N~i~~~v 90 (711) =.++.+++++|+.... ...+.........+||.|+.+.-.....- .-..+..+..|+-+.++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 2245555555544221 12334455667789999974422111000 00123346778888888 Q ss_pred HHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 91 DQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESG 170 (711) Q Consensus 91 ~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G 170 (711) +...+..-.-.+.+.+ +|... +..+..+.+.|+|.....++++.++..| T Consensus 81 ~~~A~lv~~e~~~i~~---------------------------~d~~~----~~~l~~il~~n~f~~~~~~~~e~a~a~G 129 (500) T protein:vir:98 81 KKIASLVFNEQAEIKV---------------------------DDDAA----NEFISETLKNDRFNKNFERYLESCLALG 129 (500) T ss_pred HHHhhhhcCCcceEec---------------------------CChHH----HHHHHHHHhhccHHHHHHHHHHHHhhcC Confidence 8777765444333332 23344 4455556667999999999999999999 Q ss_pred ccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCcc-ccCccccceeeeeecCCHHHHHHhcCCcccchhhccccccc Q lcl|Aclame:pro 171 MGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAK-KRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADY 249 (711) Q Consensus 171 ~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~-~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 249 (711) .|++++++|. +.|.|+.| ++..|++ -.. .-+...+-++++.... . + T Consensus 130 ~~~~k~~~d~-------~~~~I~~v-~ad~~~P--~~~d~~~~~~~a~~~~~~~~-~--------~-------------- 176 (500) T protein:vir:98 130 GLAMRPYVDG-------DKVRVAFV-QAPVFLP--LQSNTQDVSSAAVVIKSVKT-I--------N-------------- 176 (500) T ss_pred CEEEEEEEeC-------CceEEEEE-cCCeeEE--EEEcCCCeEEEEEEEEEeee-e--------c-------------- Confidence 9999999862 45778776 7878773 111 1122233333221100 0 0 Q ss_pred ccCCCCCeEEEEEeeeeeeec-----eeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceec Q lcl|Aclame:pro 250 DTWFTEKSVRVSEYFTREPVI-----REIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLE 324 (711) Q Consensus 250 ~~~~~~~~v~v~E~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le 324 (711) ......+..|+++..... .+++...+. . ..|..+ ....+ |.-..+.+.+. T Consensus 177 ---~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~-------~--------~lG~~v------~l~~~-~~~l~~~~~~~ 231 (500) T protein:vir:98 177 ---GKEVYYTLIEFHEWQSSDDYVISNELYRSDDK-------A--------KVGSRV------PLSEV-YKDLKDEAKVT 231 (500) T ss_pred ---CCceEEEEEEEEEEeCCceeEEEEEEEecccc-------c--------ccCccc------ccccc-cCCcCcceEec Confidence 001122344554432211 111111000 0 001000 00000 00000111111 Q ss_pred cCccCCCCccceEEEEeeeec-----cCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHH Q lcl|Aclame:pro 325 GPVEIPSTTIPVIPVWGKSLI-----IKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEW 399 (711) Q Consensus 325 ~~~p~~~~~~P~vp~~~~~~~-----~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~ 399 (711) + ....||+ ++ +.| ..++++|.|++..+++..+.+|...|.+.+.+.. ...++.++++.+....+-. T Consensus 232 ~-----~~~p~f~-~~--~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~ 302 (500) T protein:vir:98 232 D-----VTRPIFT-YL--KTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTT 302 (500) T ss_pred c-----CCCccEE-Ee--cCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCC Confidence 1 1112222 11 122 2356778899999999999999999999999976 6778888877764211100 Q ss_pred h-----hcccCCC--ceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHH Q lcl|Aclame:pro 400 E-----QANTKNF--SLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIAR 471 (711) Q Consensus 400 ~-----~~~~~~~--~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~ 471 (711) . .....++ .++.++.+......++.+++.-...++...++.....+....|++....|.++++ .||.+|.+. T Consensus 303 ~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~ 382 (500) T protein:vir:98 303 DGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSE 382 (500) T ss_pred CccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHH Confidence 0 0001111 1222222222334576665544456677888888888888899999999876654 578888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--cCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHI--YDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 472 ~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~--~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .+..-.....+...+..+++++.+.++.+..-+ +...- ...++ T Consensus 383 ~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~-----------------------------------~~~~~ 427 (500) T protein:vir:98 383 NSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEV-----------------------------------PSMDN 427 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-----------------------------------CCCcc Confidence 877777888888888889999999888765432 21100 01122 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCCcc--hHHHHHHHHhhhcchhhc-chhhhhhhh Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLS-KDEREAIEE 622 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~~~--~~e~~~~l~~~~~~~~~~-~~~~~~~~~ 622 (711) |+++=+.+.....++..+.++++..+ ++... .+.+.-++.. +.+++.+++....+..-. .+.....-+ T Consensus 428 v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~-----~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 428 ISISLDDGVFTDRDAELDYWIKVVNAGFGTREM-----AIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHH-----HHHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 33322222222233333344443322 22111 1122222221 222233332221111000 000000000 No 115 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.59 E-value=1e-13 Score=91.79 Aligned_cols=459 Identities=10% Similarity=-0.006 Sum_probs=193.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHH----HHHHHhCCCceEehh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR----TERELEQRPCLVNNV 85 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~----~~~~~~g~p~~~~N~ 85 (711) |=++ |+ ...+.+++...+..++-..+ .....+..+-.+||.|++.-.... ......-.-.++.|. T Consensus 1 ~~~~--p~------~~l~~~~~~~~~~~~l~~~~---~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~ 69 (479) T protein:vir:99 1 MIDL--PD------EDLSSEGLAKYLETKVFPKM---NTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPW 69 (479) T ss_pred CccC--Cc------ccCChhHHHHHHHHHHHHHH---HHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCc Confidence 2222 11 11222222222221111111 122334445678999987532110 000000000135788 Q ss_pred hHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHH Q lcl|Aclame:pro 86 LPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQG 165 (711) Q Consensus 86 i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~ 165 (711) .+-+|+..++... ++ .|+ ..|.+..+ .+..+++.|+++.....+..+ T Consensus 70 ~~~iVd~~~~~l~---~~-gf~-------------------------~~d~~~~~----~~~~i~~~N~~d~~~~~~~~~ 116 (479) T protein:vir:99 70 MGLMVNSFAQQLI---VD-GYR-------------------------KTGTNENA----KGWDTWRLNQMDKQQFWLNRA 116 (479) T ss_pred HHHHHHHHHhhcc---cc-ccc-------------------------CCCchhhH----HHHHHHHhcChhHHHHHHHHH Confidence 8888888776431 00 111 11222222 235567789999999999999 Q ss_pred HHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhc Q lcl|Aclame:pro 166 AVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYE 243 (711) Q Consensus 166 ~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~ 243 (711) ++++|.+|+-|+..... ...++.+++..+ +|.+++ ||...+ +. -..|... . T Consensus 117 a~~~G~af~~v~~~~~~-~d~~g~~~i~~~-~p~~~~~iydd~~~--~~-~~~~~~~-----~----------------- 169 (479) T protein:vir:99 117 VLTFGYAFIKVTSGISP-LDGTTVARIKCI-DPRDAFAIWEDPYW--DE-WPKYLLE-----R----------------- 169 (479) T ss_pred HhhcCceEEEEecCCCC-cCCCCceEEEEe-chhheEEEecCCcc--cc-eeeEEEe-----e----------------- Confidence 99999998766532111 122356677666 787765 443221 10 0111000 0 Q ss_pred ccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE-ecCce Q lcl|Aclame:pro 244 DSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI-TGANV 322 (711) Q Consensus 244 ~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~ 322 (711) .... ...+|... .++++.. .|... T Consensus 170 ----------~~~~--~~~~~~~~-------------------------------------------~~~~~~~~~~~~~ 194 (479) T protein:vir:99 170 ----------QPNG--QYWWWTEE-------------------------------------------DYSIFEFKQGKFI 194 (479) T ss_pred ----------cCce--eEEEEecc-------------------------------------------eEEEEEecCCcee Confidence 0000 00111000 0011111 11222 Q ss_pred eccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccC-ChHH-HHh Q lcl|Aclame:pro 323 LEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE-GRED-EWE 400 (711) Q Consensus 323 le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~-~~~~-~~~ 400 (711) +.++.|.+.|.+|++||...+. ....|.|.+..+++.++.+|+.+|.+...+...+.+..++. |... ...+ ... T Consensus 195 ~~~~~~h~~g~vPvv~f~n~~~---~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~ 270 (479) T protein:vir:99 195 YRETVSHDYGHIPFVRYVNVMD---LRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT-GLMLPEGANADQE 270 (479) T ss_pred eccccccCCCCcceEEeecCCC---cCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc-CCCcccccccchh Confidence 2344555567788888754432 22457899999999999999999999988888888765553 3211 1100 000 Q ss_pred hcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 401 QANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSF 480 (711) Q Consensus 401 ~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~ 480 (711) ......+.++...+ . +..+..++... ...+...++.....+-.+||+++...|..+| .||.|+..+...-..... T Consensus 271 ~~~~~~~~i~~~~~-~--~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n-~Sg~Al~~~~~~l~~ka~ 345 (479) T protein:vir:99 271 KMRFAQESMLISQN-E--KASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVN-VAADALAAGTRQTMQKLF 345 (479) T ss_pred ccccccccceeecC-C--CceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccc-hHHHHHHHHHHHHHHHHH Confidence 01122233443322 1 22233333222 2344455555555555567888899996555 799999887666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecc-cChH Q lcl|Aclame:pro 481 AFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTG-PAFA 559 (711) Q Consensus 481 ~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~-~~~~ 559 (711) ...+.|..++++++++++.+. |.....+.+. +.+.=. +.+. T Consensus 346 ~~~~~f~~al~~~~~l~~~~~-------------~~~~~~~~~~-------------------------i~~~w~~~~~~ 387 (479) T protein:vir:99 346 EKQATWKASHNQTMRLVNKIE-------------GRTEEATDLD-------------------------FTITWQDVTIQ 387 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHc-------------CCCcccccee-------------------------eeEEecCCCCC Confidence 666666667777666654432 1111111111 111000 0111 Q ss_pred HHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 560 TQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMA 639 (711) Q Consensus 560 s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~ 639 (711) + ..+..+.+..+.++. -+....+++.+++-...++ ++++........ ..+ ..+.... T Consensus 388 s-~~~~ad~~~kl~~ag----~is~et~l~~l~gv~~~~~-e~~~~~~~~~~~--------------~~~---~~~~~~~ 444 (479) T protein:vir:99 388 S-LAQFADAWAKMVESL----KIPAEGVWDMIPNLDQSTV-NGWKEIYDREGD--------------FGK---YMRKLQN 444 (479) T ss_pred C-HHHHHHHHHHHHhcC----CCCHHHHHHhcCCCCHHHH-HHHHHHHHHHHH--------------HHH---HHHHHhc Confidence 1 112333333333220 0111122233211111111 111100000000 000 0000000 Q ss_pred -HHHHHHH---HHHHHHHHH------HHHHHHHHH Q lcl|Aclame:pro 640 -KSQADMA---QAEADTAQA------QADMLKAQL 664 (711) Q Consensus 640 -~~q~~~~---k~qae~~~a------qae~~~~q~ 664 (711) ...+++. .-..+.+.+ =|..-++-+ T Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 445 GPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred ccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 0000000 000000000 000001111 No 116 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.57 E-value=1.4e-13 Score=91.11 Aligned_cols=502 Identities=10% Similarity=-0.019 Sum_probs=236.9 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHH-HHHHHHHHHhCCC--CCCHHHHHHHHHhC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNW-EAAEDDLKFLGGE--QWPSQVRTERELEQ 77 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r-~~~~~~~~~y~G~--Qw~~~~~~~~~~~g 77 (711) -|+|+|.++.+= .......|-- ..+ ..+.| .....-.+||.|+ +|.. .+..-.+++ T Consensus 3 ~~~~~~~~~~~~--~~g~~~~p~~-v~~-----------------~d~~Rl~aY~l~~~~y~n~~~~~~~-~lrg~~~~~ 61 (527) T protein:vir:10 3 QDKRQYGSTQQL--RAGEANFPNA-VTD-----------------FDKARLASYRLYEDMYLTNTSDYQV-ILRGGDEGD 61 (527) T ss_pred ccccccCCCcCc--CCccccCccc-CCH-----------------HHHHHHHHHHHHHHHhcCchhheee-ecCCccccc Confidence 467887665432 1222222111 111 11111 1122446788875 6642 222223445 Q ss_pred CCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHH Q lcl|Aclame:pro 78 RPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAET 157 (711) Q Consensus 78 ~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~ 157 (711) +-++.++ .-..|+|..- .+.+-+- +..+...++-+..+++...+.++... T Consensus 62 ~r~~~~p----s~~~~~~~~~----~~~~~g~----------------------~~~~~~~~e~v~~~lr~~~~~e~l~~ 111 (527) T protein:vir:10 62 QRPIYVP----NGEKLIEAKM----RFLGQGL----------------------KWEFSKKDAKVDDAIKVLFDRENWEQ 111 (527) T ss_pred cceeeeh----hhHHhhCCcc----eeeccCc----------------------cccccchhHHHHHHHHHHHHHhhhHH Confidence 5566664 3356666421 1111111 11223344556777788888899999 Q ss_pred HHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeee----cCCHHHHHHhc Q lcl|Aclame:pro 158 EYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDD----TMSKEKFKALY 233 (711) Q Consensus 158 ~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~----~~~~~e~~~~~ 233 (711) .+-.+-.++++-|-|++++.||..... +++|.+..+ ||.-+| |- ..+ .+.+++..++ |-..++-++-+ T Consensus 112 ~~~~~~r~~~vlGDg~f~l~wD~~k~~--~~R~~v~~~-DP~~~f--~~-ed~--d~~~~v~~v~~~~~~~~P~d~~~~~ 183 (527) T protein:vir:10 112 KFESLKRWTEIRGDYVLLLIGDDEKDE--GSRLSLHEV-DPSTYF--PY-EDP--RYPGQVLGVYLVDEYPHPDSEKKNE 183 (527) T ss_pred HHHHHHHhhhhhcceeEEEeeccCCCc--CCCceEeec-Ccceee--ee-ecC--CCCCceeeEEEeeeccCCccccccc Confidence 999999999999999999999864432 467888777 785544 32 323 3667766554 33333322211 Q ss_pred CCccc-chhh-cccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 234 PDATA-EPVY-EDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 234 p~~~~-~~~~-~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) --... .... .++.+. .+.....++..+.|.-.. +-..+..... .+. T Consensus 184 ~~ar~~~~~~~l~~~g~--~~~~G~~~yt~~~w~lg~------------w~d~~e~p~~-~~~----------------- 231 (527) T protein:vir:10 184 KCARVQKYMKTLDDDGK--PVPGGAIKYTEELYEPGK------------WDDRPESPLE-PDD----------------- 231 (527) T ss_pred eehhhhhhhhhcCcccc--cccCcceeeeeceeeccc------------cccccccccc-hhh----------------- Confidence 00000 0000 011110 111222333222443211 1000000000 000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) ....++..++ ...|.|.+.+|||+|- -.+..++.+|+|-+.+++++++.+|+.+|-...++..+.+|.+...--. T Consensus 232 --~~~~~~~~~l-~~lp~pi~fiPvV~~~--t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~ 306 (527) T protein:vir:10 232 --IKKLSTLTEE-EPLPEQITTLPVFHFR--GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAP 306 (527) T ss_pred --hhhhcCceee-ecccCCCCccceEeec--CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccc Confidence 0011233333 3356677888888763 3455678899999999999999999999999999999988877774322 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcc-c-cchhHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGA-M-GNETSGRAII 469 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~-~-~~~~sg~ai~ 469 (711) ..+...........||+++.+-.+ ..+..+....-...+...+....+.|.+++|++..+.|. + ++..||.|+. T Consensus 307 ~vd~~G~~~~~~VgPG~iweL~e~----ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALe 382 (527) T protein:vir:10 307 PRDSRGNMVPWTISPLGMVEHGQN----NKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALD 382 (527) T ss_pred cccccCCcCccccCCceeEecCCC----cceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHH Confidence 221111111112346777766433 345555554455567778888889999999999999993 3 4567998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .....- +.++-+ -++++..++.-+-...+.+-.- ..+-+.+. |. ...+. T Consensus 383 L~L~PL------lar~~r------k~L~~~~vqrq~~~~~~~~~L~---aye~v~~~---------------d~-~~~~~ 431 (527) T protein:vir:10 383 LKLSAI------LSSCAE------QELELKSVLKQFFYNLVTQWLP---AYEGVGID---------------DA-DKKLT 431 (527) T ss_pred HHHHHH------HHHHHH------HHHHHHHHHHHhhhhhHHHHHH---HhhhcccC---------------CC-ccccc Confidence 654332 111111 1112222221110000000000 00000000 00 11234 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhh-----hhhh--- Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDER-----EAIE--- 621 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~-----~~~~--- 621 (711) +.+.=++-.++.+.+..+++..+.+.-----..+..++.+.+.+...+.-++++......+....... .+.. T Consensus 432 v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~ 511 (527) T protein:vir:10 432 VTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQ 511 (527) T ss_pred eEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhcccc Confidence 56666777777777777777665543110011122222233333333322222221111110000000 0000 Q ss_pred ----hhHHHHHHHHHH Q lcl|Aclame:pro 622 ----EDMPEQTEPTPE 633 (711) Q Consensus 622 ----~~~~~~q~~~~~ 633 (711) .+.-++-...+- T Consensus 512 g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 512 GIPDEEDDQALNGQPL 527 (527) T ss_pred CCCCCCcccccCCCCC Confidence 000000000000 No 117 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.57 E-value=1.5e-13 Score=90.86 Aligned_cols=502 Identities=10% Similarity=-0.020 Sum_probs=236.6 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHH-HHHHHHHHHhCCC--CCCHHHHHHHHHhC Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNW-EAAEDDLKFLGGE--QWPSQVRTERELEQ 77 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r-~~~~~~~~~y~G~--Qw~~~~~~~~~~~g 77 (711) -|+|+|.++.+= .......|-- ..+ ..+.| .....-.+||.|+ ||.. .+..-.+++ T Consensus 3 ~~~~~~~~~~~~--~~g~~~~p~~-v~~-----------------~d~~Rl~aY~l~~~~y~n~~~~~~~-~lrg~~~~~ 61 (527) T protein:vir:10 3 QDKRQYGSTQQL--RAGEANFPNA-VTD-----------------FDKARLASYRLYEDMYLTNTSDYQV-ILRGGDEGD 61 (527) T ss_pred ccccccCCCcCc--CCccccCccc-CCH-----------------HHHHHHHHHHHHHHHhcCchhheee-ecCCccccc Confidence 467887665432 1222222111 111 11111 1122446788875 6642 222223445 Q ss_pred CCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHH Q lcl|Aclame:pro 78 RPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAET 157 (711) Q Consensus 78 ~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~ 157 (711) +-++.++ .-..|+|..- .+.+-+- +..+...++-+..+++...+.++... T Consensus 62 ~r~~~~p----s~~~~~~~~~----~~~~~g~----------------------~~~~~~~~e~v~~~lr~~~~~e~l~~ 111 (527) T protein:vir:10 62 QRPIYVP----NGEKLIEAKM----RFLGQGL----------------------KWEFSKKDAKVDDAIRVLFDRENWEQ 111 (527) T ss_pred cceeeeh----hhHHhhCCcc----eeeccCc----------------------cccccchhHHHHHHHHHHHHHhhhHH Confidence 5566664 3356666421 1111111 11223345556777788888899999 Q ss_pred HHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeee----cCCHHHHHHhc Q lcl|Aclame:pro 158 EYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDD----TMSKEKFKALY 233 (711) Q Consensus 158 ~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~----~~~~~e~~~~~ 233 (711) .+-.+-.++++-|-|++++.||..... +++|.+..+ ||.-+| |- ..+ .+.+++..++ |-..++-++-+ T Consensus 112 ~~~~~~r~~~vlGDg~f~l~wD~~k~~--~~R~~v~~~-DP~~~f--~~-ed~--d~~~~v~~v~~~~~~~~P~d~~~~~ 183 (527) T protein:vir:10 112 KFESLKRWTEIRGDYVLLLIGDDEKDE--GSRLSLHEV-DPSTYF--PY-EDP--RYPGQVLGVYLVDEYPHPDSEKKNE 183 (527) T ss_pred HHHHHHHhhhhhcceeEEEeeccCCCc--CCCceEeec-Ccceee--ee-ecC--CCCCceeeEEEeeeccCCccccccc Confidence 999999999999999999999864432 467888777 785544 32 323 3667766554 33333322211 Q ss_pred CCccc-chhh-cccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEE Q lcl|Aclame:pro 234 PDATA-EPVY-EDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) Q Consensus 234 p~~~~-~~~~-~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) --... .... .++.+. .+.....++..+.|.-.. +-..+..... .+. T Consensus 184 ~~ar~~~~~~~l~~~g~--~~~~G~~~yt~~~w~lg~------------w~d~~e~p~~-~~~----------------- 231 (527) T protein:vir:10 184 KCARVQKYMKTLDDDGK--PVPGGAIKYTEELYEPGK------------WDDRPESPLE-PDD----------------- 231 (527) T ss_pred eehhhhhhhhhcCcccc--cccCcceeeeeceeeccc------------cccccccccc-hhh----------------- Confidence 00000 0000 011110 111222333222443211 1000000000 000 Q ss_pred EEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccc Q lcl|Aclame:pro 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) Q Consensus 312 v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~a 391 (711) ....++..++ ...|.|.+.+|||+|- -.+..++.+|+|-+.+++++++.+|+.+|-...++..+.+|.+...--. T Consensus 232 --~~~~~~~~~l-~~lp~pi~fiPvV~~~--t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~ 306 (527) T protein:vir:10 232 --IKKLSTLTEE-EPLPEQITTLPVFHFR--GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAP 306 (527) T ss_pred --hhhhcCceee-ecccCCCCccceEeec--CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccc Confidence 0011233333 3356677888888763 3455678899999999999999999999999999999988877774322 Q ss_pred cCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcc-c-cchhHHHHHH Q lcl|Aclame:pro 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGA-M-GNETSGRAII 469 (711) Q Consensus 392 v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~-~-~~~~sg~ai~ 469 (711) ..+...........||+++.+-.+ ..+..+....-...+...+....+.|.+++|++..+.|. + ++..||.|+. T Consensus 307 ~vd~~G~~~~~~VgPG~iweL~e~----ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALe 382 (527) T protein:vir:10 307 PRDSRGNMVPWTISPLGMVEHGQN----NKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALD 382 (527) T ss_pred cccccCCcCccccCCceeEecCCC----cceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHH Confidence 221111111112346777766433 345555554455567778888889999999999999993 3 4567998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeee Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~d 549 (711) .....- +.++-+ -++++..++.-+-...+.+-.- ..+-+.+. |. ...+. T Consensus 383 L~L~PL------lar~~r------k~L~~~~Vqrq~~~~~~~~~L~---aye~v~~~---------------d~-~~~~~ 431 (527) T protein:vir:10 383 LKLSAI------LSSCAE------QELELKSVLKQFFYNLVTQWLP---AYEGVGID---------------DA-DKKLT 431 (527) T ss_pred HHHHHH------HHHHHH------HHHHHHHHHHHhhhhhHHHHHH---HhhhcccC---------------CC-ccccc Confidence 654332 111111 1122222221110000000000 00000000 00 11234 Q ss_pred EEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhh-----hhhh--- Q lcl|Aclame:pro 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDER-----EAIE--- 621 (711) Q Consensus 550 v~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~-----~~~~--- 621 (711) +.+.=++-.++.+.+..+++..+.+.-----..+..++.+.+.+...+.-++++......+....... .+.. T Consensus 432 v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~ 511 (527) T protein:vir:10 432 VTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQ 511 (527) T ss_pred eEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhcccc Confidence 56666777777777777776665543110011112222222333333222222221111110000000 0000 Q ss_pred ----hhHHHHHHHHHH Q lcl|Aclame:pro 622 ----EDMPEQTEPTPE 633 (711) Q Consensus 622 ----~~~~~~q~~~~~ 633 (711) .+.-++-...+- T Consensus 512 g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 512 GIPDEEDDQALNGQPL 527 (527) T ss_pred CCCCCCcccccCCCCC Confidence 000000000000 No 118 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.56 E-value=3.5e-13 Score=88.87 Aligned_cols=525 Identities=9% Similarity=0.045 Sum_probs=243.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) -++++|.++-....+-.+.-=+ +.++. ++ .....-.+||.|+||+-.. .|.-+.+-+ T Consensus 3 ~~~~q~~p~~~~fp~~~a~wV~---~~D~~---Rl---------------aaY~ly~d~y~n~~~el~~--il~G~dr~~ 59 (563) T protein:vir:74 3 YNHKQYDPAKPFLRGGDDNIVD---ENDKN---RV---------------RAYDLYENIYLNSAETLKL--VLRGDDSVP 59 (563) T ss_pred ccccccCCCcccccccccccCC---HHHHH---HH---------------HHHHHHHHhhcCchhhhhh--hcCCCceee Confidence 3566655444333222221100 11111 11 1122447899999996432 232223334 Q ss_pred eEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHH Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYD 160 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~ 160 (711) +.++--+.+|+++. ..-.....+.|-| ..+|+...+.++.+++...+.++....+. T Consensus 60 ~~~ps~r~~V~~~~-~~Lg~~~~~~Ve~-----------------------~~~de~~~~avq~~Lr~~~~~e~l~~~~~ 115 (563) T protein:vir:74 60 ILMPSGRKIVEAVH-RFLGVGFDYLVEP-----------------------DMGDEGIRQSLNAYFRTTFKREAIKAKFT 115 (563) T ss_pred eccchHHHHHHHHH-HhcCCCcEEecCc-----------------------cccCcchHHHHHHHHHHHHHHhhhHHHHH Confidence 44434556677643 3223333444444 34566666778999999999999999999 Q ss_pred HHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeee---ecCCHHHHHH-hcCCc Q lcl|Aclame:pro 161 IAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLID---DTMSKEKFKA-LYPDA 236 (711) Q Consensus 161 ~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~---~~~~~~e~~~-~~p~~ 236 (711) .+-+++++-|-|+++|.||... .-++++++..| +|.-+| | +..+| .+..+..++ .|-..++.++ ++.-. T Consensus 116 ~~~r~a~vlGDgvf~l~wDp~K--~~g~R~rv~~v-DP~~~f--p-~~dpd-~v~g~~~v~v~~~~~~pdd~~~~~~r~~ 188 (563) T protein:vir:74 116 SNKRWGLIRGDAHFYIHADPNK--KAGERISVDEV-DPRQIF--L-IEDGS-TVVGFHMVDIVQDFRSPDDPSKKLARRR 188 (563) T ss_pred HHHHhhhhhcceeEEEeecccc--ccCCCceEeec-CCceee--e-ccCCC-CcccceeeecccCCCCCcchhccceeee Confidence 9999999999999999998643 33568888888 775444 2 33333 222222122 2222333221 11000 Q ss_pred ccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEE Q lcl|Aclame:pro 237 TAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRK 316 (711) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~ 316 (711) ... .. . +...++...-.+..|.|.- ++ ++.....-.... ...-.+ T Consensus 189 ~~~---~~-l-ndeg~~~~~~~~dae~w~l----------g~-----wd~r~~~~~~~~---------------~~~~~~ 233 (563) T protein:vir:74 189 TFR---RV-R-NDEGMFTGRISSELTHWTL----------GN-----WDDRGAISDEQA---------------RRKEQV 233 (563) T ss_pred eee---ee-e-CCCCCccceeeeccchhcc----------cc-----ccccCccchhhh---------------cccchh Confidence 000 00 0 0001111111122222211 00 011100000000 000001 Q ss_pred EecCceec-cCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC- Q lcl|Aclame:pro 317 ITGANVLE-GPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG- 394 (711) Q Consensus 317 ~~g~~~le-~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~- 394 (711) +.-...+| ..-|-|.+.+||+-+ .-.|..++.+|.|-...+..+.+.+|..++-.-.++..+.+|.++.+..+-.+ T Consensus 234 ~~~~~d~e~~~LP~pi~~iPiv~~--~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~ 311 (563) T protein:vir:74 234 RSAQHDEEEEELPEPISQLPLYRW--RNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDP 311 (563) T ss_pred hhhhhhchhhhccccccCccEEEc--CCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccccc Confidence 11112211 223556678888743 34456788899999999999999999999999999999998888876333211 Q ss_pred hHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHH-HHHHHhCCCHHHhc--cccchhHHHHHHHH Q lcl|Aclame:pro 395 REDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVE-KIKSTMGMYDASLG--AMGNETSGRAIIAR 471 (711) Q Consensus 395 ~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~-~~~~~tGv~~~~~G--~~~~~~sg~ai~~~ 471 (711) .......-+..||+++++-... ....+..+...+--+.+-.-+..... .|.+++|++..++| -.+...||.|.... T Consensus 312 ~~g~~~~w~vgpG~i~El~~~~-~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~ 390 (563) T protein:vir:74 312 NTGELTDWNIGPMQIVEIAGNR-NDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQ 390 (563) T ss_pred ccccccccccCCceeEeccCCc-cccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhh Confidence 1111111235688877775322 22334444442222233333443333 67788999999999 34456799886554 Q ss_pred HHHHHH---HHH-HHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhee Q lcl|Aclame:pro 472 QRQGDR---GSF-AFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQK 547 (711) Q Consensus 472 ~~~~~~---~~~-~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~ 547 (711) ..--.. +-+ .+...++.++.+...++|.+.+..|-... ..+|+-... .-.+ T Consensus 391 L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~---------~~~~~g~~~----------------~~~~ 445 (563) T protein:vir:74 391 LKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQD---------GSRPFASAD----------------LLNE 445 (563) T ss_pred hhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhc---------ccccccccc----------------cCCc Confidence 332111 111 25556667777788888887777552211 111111111 0112 Q ss_pred eeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHH---HHhcCCc--chHHHHHHHHhh--h---cchhhcchh- Q lcl|Aclame:pro 548 YDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLI---AQNMDWP--GADVIAERLKKI--V---PPNVLSKDE- 616 (711) Q Consensus 548 ~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~---~~~~~~~--~~~e~~~~l~~~--~---~~~~~~~~~- 616 (711) .-|++.=+|-.++-+.+..+....+.+. .-+...+. +...+++ .++...+.++.. . .++...... T Consensus 446 ~~v~ivf~p~~P~d~~~vv~~~~tl~~a----GiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~ 521 (563) T protein:vir:74 446 CSVVCIFADPMPVNKTQVTQDTLLLQQA----HLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASL 521 (563) T ss_pred eEEEEEeCCCCCccHHHHHHHHHHHHHc----CchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcc Confidence 2344555666666666666665554443 11111111 1111332 222211111100 0 000000000 Q ss_pred --hhh-------hh---hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 617 --REA-------IE---EDMPEQTEPTPEQQVEMAKSQADMAQ 647 (711) Q Consensus 617 --~~~-------~~---~~~~~~q~~~~~~q~~~~~~q~~~~k 647 (711) ++. .+ +.++--|-.. ..+..-.-.|.-+.- T Consensus 522 ~~~a~~~~g~~~~~~dd~g~p~~~~~~-~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 522 GLSAMDNGGAGEQQFDDQGNPIDQFGN-PVEIPPDVTQVPLSP 563 (563) T ss_pred cceecccCCCCcccccccCCchhHcCC-cccCCccccccCCCC Confidence 000 00 0000000000 000000000000000 No 119 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.56 E-value=3.5e-13 Score=88.87 Aligned_cols=451 Identities=13% Similarity=0.049 Sum_probs=204.9 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC----CHHHHHHHHHhCCCceEehh Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW----PSQVRTERELEQRPCLVNNV 85 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw----~~~~~~~~~~~g~p~~~~N~ 85 (711) |-+ .+.+.....+ ++....+.++...+.. .+..-.+-.+||+|+|= +......++.. ..+.|. T Consensus 1 ~~~--~~~~~~~gl~-~~~~~~~~~L~~~~~~-------~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~---~~v~nw 67 (474) T protein:vir:81 1 MIQ--QQTVRIPSLS-NDENALINGLLAQIEN-------LRWKNLLRTSYYENKRTIQYVGTLIPPQYFNL---GLVLGW 67 (474) T ss_pred CcC--CCcCcCCCCC-hhHHHHHHHHHHHHHH-------HhhHHHHHHHHhccCCChhhccccccHHHHHH---HhhcCh Confidence 222 1333332222 3334456666554333 23334456799999743 22222223221 146788 Q ss_pred hHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHH Q lcl|Aclame:pro 86 LPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQG 165 (711) Q Consensus 86 i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~ 165 (711) .+-.|+.+.....=+ .+ +.|. .+..+ .-+..+++.|+++...+.+..+ T Consensus 68 ~~~~Vd~~a~rl~~~--Gf-~~~d---------------------~~~~~--------~~l~~iw~~N~ld~~~~~~~~~ 115 (474) T protein:vir:81 68 TGKAVDALARRCNLE--GF-VWPD---------------------GDLDS--------LGGTEVVDDNHLLSEIDSAIVA 115 (474) T ss_pred HHHHHHHHHhhhccc--ce-ECCC---------------------CCccc--------hHHHHHHHhcChhHHHHHHHHH Confidence 888888875421100 01 1110 01111 1146778999999999999999 Q ss_pred HHhcCccEEEEEEeeccCCCCCCcceEEEecCcccee--eCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhc Q lcl|Aclame:pro 166 AVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYE 243 (711) Q Consensus 166 ~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~ 243 (711) ++++|++|+-|+.+. + ..+.+.|..+ +|.+++ |||..+.+ ...+.+...+ T Consensus 116 al~~G~sf~~V~~~~---d-~~~~~~i~~~-sp~~~~~~~D~~~~~~-----~~al~~~~~~------------------ 167 (474) T protein:vir:81 116 AMQHGPAFLINTVGE---D-DEPEALIHVK-DASEATGEWNRRRRGL-----NNLLSIIDKD------------------ 167 (474) T ss_pred HHhhCceeEEEecCC---C-CCceeEEEEe-ccceEEEEEeCCCCcc-----eeeeEEEEEc------------------ Confidence 999999998777542 1 1234666555 777666 88854322 1111111000 Q ss_pred ccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCcee Q lcl|Aclame:pro 244 DSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVL 323 (711) Q Consensus 244 ~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l 323 (711) .+. ......+|.. +.++.+.... ++.... T Consensus 168 ---------~~g-~~~~~~ly~~------------~~~~~~~~~~-----------------------------~~~~w~ 196 (474) T protein:vir:81 168 ---------KEG-KVLSLALYLD------------NETVTAQRDK-----------------------------ATLKWQ 196 (474) T ss_pred ---------CCC-cEEEEEEEeC------------CcEEEEEEcC-----------------------------ccceee Confidence 000 1111111211 1111110000 000001 Q ss_pred ccCccCCCCccceEEEEeeeeccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC-------- Q lcl|Aclame:pro 324 EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRS-IIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG-------- 394 (711) Q Consensus 324 e~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g-~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~-------- 394 (711) .+..|.+.| .|+|||+-.+. -...+|.| +.+.+++.|+.+|+.++.++-.....+.++-.+- |+-.. T Consensus 197 ~~~~~~~~g-vPvV~~~n~~~--~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~~~~~~~d~d~~ 272 (474) T protein:vir:81 197 VDRDEHVYG-VPAQVLPYKPA--PKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL-GADESALKNADGT 272 (474) T ss_pred eccCCCCCC-cceEEeccccc--ccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-cCChhhccccccc Confidence 122333445 68888764432 22334444 4479999999999999999999888888876553 32110 Q ss_pred hHHHHhhcccCCCceEEecccccCc------CCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccc--cchhHHH Q lcl|Aclame:pro 395 REDEWEQANTKNFSLLTYIPQYQGD------PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAM--GNETSGR 466 (711) Q Consensus 395 ~~~~~~~~~~~~~~~i~~~~~~~~~------~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~--~~~~sg~ 466 (711) ....|. ...+.++.+.++..+. ..+..++..++. .+...+......+-.+||++...+|.. .|..||. T Consensus 273 ~~~~~~---~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~-~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~Sae 348 (474) T protein:vir:81 273 IKSVWE---ARLGRIKGLPDDADADIPQLARADVKQFPAASPD-AHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAE 348 (474) T ss_pred ccchhh---hhHHHHhcCCCcccccccccccccccccCCCChh-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHH Confidence 001111 1122233333222211 123334333322 344455555556666789999999953 5668999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhhe Q lcl|Aclame:pro 467 AIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQ 546 (711) Q Consensus 467 ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~ 546 (711) ||.+....-........+.|..+.++++++.+.+.-.+--+ ....++..+.. T Consensus 349 Ai~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~---------~~~~~~~~~~v------------------- 400 (474) T protein:vir:81 349 SYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAID---------EIPDEWKSIDA------------------- 400 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc---------ccchhhcccee------------------- Confidence 99887766666666677777777888887776554322100 00111111100 Q ss_pred eeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHH Q lcl|Aclame:pro 547 KYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPE 626 (711) Q Consensus 547 ~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 626 (711) .= -.+.+.|. .+..+.+..+.++.+.+... ..+.+.+++. .+++.... ... T Consensus 401 ----~W-~d~~~~s~-a~~aDa~~Kl~~a~~~~~~~--~~~~~~lg~t-~~~i~~~~------------------~~~-- 451 (474) T protein:vir:81 401 ----KW-RDPRYLSK-SAQADAGMKQLAAVPWLAET--EVGLELIGLT-PQQARRAM------------------ADK-- 451 (474) T ss_pred ----Ee-cCCCccCH-HHHHHHHHHHHhcccCCCcH--HHHHhhcCCC-HHHHHHHH------------------HHH-- Confidence 00 00112222 33445555555543322111 1112222222 11111100 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 627 QTEPTPEQQVEMAKSQADMAQAEADTAQAQ 656 (711) Q Consensus 627 ~q~~~~~~q~~~~~~q~~~~k~qae~~~aq 656 (711) +++ +.+..+..+- ....+...|| T Consensus 452 -~~~--~~~~~~~~l~----~~~~~~~~aq 474 (474) T protein:vir:81 452 -RRV--QGRGTLQALI----DRSNNGATAQ 474 (474) T ss_pred -HHH--hHHHHHHHHH----hcCCCCCCCC Confidence 000 0000000000 0000001111 No 120 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.52 E-value=1.9e-13 Score=90.35 Aligned_cols=487 Identities=11% Similarity=0.076 Sum_probs=225.4 Q ss_pred HHHHHHHHHHHHHHHhh-----------------chHHHHHHHHHHHHhCCCCCCHHHHHH-HHHhCCCceEehhhHHHH Q lcl|Aclame:pro 29 RALLATARERARDGATY-----------------WKDNWEAAEDDLKFLGGEQWPSQVRTE-RELEQRPCLVNNVLPTFV 90 (711) Q Consensus 29 ~~~~~~~~~~~~~~~~~-----------------~~~~r~~~~~~~~~y~G~Qw~~~~~~~-~~~~g~p~~~~N~i~~~v 90 (711) =.++.+++++|++-... ..+.+........||.|+.+.-.-... -....+.....|+-+.++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 22445555555533211 233445566778889887442110000 000112245567777777 Q ss_pred HHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 91 DQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESG 170 (711) Q Consensus 91 ~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G 170 (711) +...+..-.-.+.+.+ +|....+ .+..+.+.++|......+++.++..| T Consensus 81 ~~~A~lv~~e~~~i~v---------------------------~d~~~~~----~l~~~l~~n~f~~~~~~~~e~a~a~G 129 (522) T protein:vir:47 81 KKIASLVYNEQATITT---------------------------KNEILQK----FLDDMLTNDRFNKNFERYLESCLALG 129 (522) T ss_pred HHHhhhhcCCcceeec---------------------------CChHHHH----HHHHHHhhcchHHHHHHHHHHhhccC Confidence 7666654433333332 2344444 44555567899999999999999999 Q ss_pred ccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccc Q lcl|Aclame:pro 171 MGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYD 250 (711) Q Consensus 171 ~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~ 250 (711) .++++++++. +.++|..| ++..|++= ....-+...|-++++........- ..|---..+++......... T Consensus 130 ~~a~k~~~d~-------~~~~i~~v-~ad~~~P~-~~~~~~~~e~a~~~~~~~~~~~~~-~~yt~lE~he~~~~~~~~~~ 199 (522) T protein:vir:47 130 GLAMRPYIDG-------DKVRVAFI-QAPVFFPL-ESNTQDVSSAAILTKTIKSEGRKN-VYYTLVEFHEWVTADGQETG 199 (522) T ss_pred CEEEEEEEcC-------CceEEEEE-cCCceEEE-EEcCCceEEEEEEEEEEeecccce-eEEEEEEEeeeccccccccc Confidence 9999999862 46788777 67677731 111112233444333322111100 00000000000000000000 Q ss_pred cCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCC Q lcl|Aclame:pro 251 TWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIP 330 (711) Q Consensus 251 ~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~ 330 (711) .-...+.-++...+|+..... .-|.-+.+..... | .-|....-+. T Consensus 200 ~~~~~~~~~I~n~ly~~~~~~-----~lG~~v~l~~~~e------------------------~------~~l~~~~~~~ 244 (522) T protein:vir:47 200 STNDKKYYRITNELYRSDVND-----VLGQRVNLSELDK------------------------Y------KNLEPVTVFE 244 (522) T ss_pred ccccCCceEEEEEEeecCCCc-----ccCcccccccccc------------------------c------cCCCCceEeC Confidence 000000111111111110000 0010000000000 0 0000001112 Q ss_pred CCccceEEEEee---eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHH-----Hhhc Q lcl|Aclame:pro 331 STTIPVIPVWGK---SLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDE-----WEQA 402 (711) Q Consensus 331 ~~~~P~vp~~~~---~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~-----~~~~ 402 (711) +-.-|++.++.. -....++++|.|++..+++..+.+|...+.+.+-+.. +..++++++..+.....- .... T Consensus 245 ~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~ 323 (522) T protein:vir:47 245 NLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRM-GQRRVIVPEHLTQRQYQRPDGTIDFRP 323 (522) T ss_pred CCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHh-ccceeecchHHhccCCCCCCccccccc Confidence 212232211111 1112367788999999999999999999999999875 455788877776432110 0000 Q ss_pred ccC--CCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccch-hHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 403 NTK--NFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-TSGRAIIARQRQGDRGS 479 (711) Q Consensus 403 ~~~--~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~-~sg~ai~~~~~~~~~~~ 479 (711) ... ...+..++........++.+++.--...+...++.....+....|++....|.+++. .||.+|.+..+..-... T Consensus 324 ~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~ 403 (522) T protein:vir:47 324 RFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMR 403 (522) T ss_pred ccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHH Confidence 011 111233332222334576666554555677788888888888899999888876654 57888888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh--cCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccC Q lcl|Aclame:pro 480 FAFIDNLTKSIRRVGKILVEMIPHI--YDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPA 557 (711) Q Consensus 480 ~~~~dn~~~~~~~~~~~~l~li~~~--~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~ 557 (711) ..+...+..+++++.+.++.+...+ +.. . . ...++++++=+.+ T Consensus 404 ~~~~~~~~~al~~lv~~i~~l~~~~~~~~~--------~--~-------------------------~~~~~i~v~f~D~ 448 (522) T protein:vir:47 404 SSIVALVEQSIKELCVSMCELGKAVGVYSG--------E--I-------------------------PELDDISVNLDDG 448 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccC--------C--C-------------------------CCcceeEEEcCCC Confidence 8888899999999999888877532 111 0 0 0112233332322 Q ss_pred hHHHHHHHHHHHHHHHhh--cchhHHHHHHHHHHhcCCcc--hHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHH Q lcl|Aclame:pro 558 FATQRIEAAEAMIQFAQA--VPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTP 632 (711) Q Consensus 558 ~~s~r~~~~~~L~~l~~~--~p~~~~~~~~~~~~~~~~~~--~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~ 632 (711) ....+++..+.++++..+ ++.. ..+.+.-++.. +.+.+.+++.....+.+..........++++.--..- T Consensus 449 i~~D~~~~~~~~~~~v~aG~~s~e-----~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 449 VFTDRHAELDYWAKMVAAGFSTKK-----RAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred CCCCHHHHHHHHHHHHhcCCCCHH-----HHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 222233333444443322 2211 11222223321 3334444443332221111111000000000000000 No 121 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.46 E-value=2.3e-13 Score=89.83 Aligned_cols=422 Identities=11% Similarity=-0.019 Sum_probs=186.6 Q ss_pred HhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHH Q lcl|Aclame:pro 59 FLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYEL 138 (711) Q Consensus 59 ~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 138 (711) |.. .......+...+ .++.|..+.+|+.+++...-+ . |+ -.|.+. T Consensus 1 ~l~-----~~~~~~~~~~~~-~~v~n~~~~ivd~~~~~l~~~--g--f~-------------------------~~d~~~ 45 (434) T protein:vir:98 1 MLP-----KNAEQAFLDFQR-KARTNFCGLIANASVHRLLAL--G--VT-------------------------GPDGEP 45 (434) T ss_pred CCC-----CCccHHHHHhhh-hhhccchHHHHHHHHhhhccC--c--ee-------------------------cCCCch Confidence 221 111111221111 246799999999888753211 0 11 011111 Q ss_pred HHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccC-CCCCCcceEEEecCccce--eeCCCccccCcccc Q lcl|Aclame:pro 139 AEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD-DSFEQDLIIEAIQNQFSV--TIDPDAKKRDRSDM 215 (711) Q Consensus 139 Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~-~~~~~~i~i~~v~~~~~v--~~Dp~a~~~d~~Da 215 (711) ...+..+++.|+++...+.+..+++++|.||+-|+.+.... ......+.|+.+ +|.++ +|||....+ T Consensus 46 ----~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~-~p~~~~~i~D~~~~~~----- 115 (434) T protein:vir:98 46 ----DTRASRWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITME-HPSECIVEYDPETGEP----- 115 (434) T ss_pred ----HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEe-ccceeEEEEeCCCCce----- Confidence 12234567889999999999999999999999887653221 111245666655 77765 477754322 Q ss_pred ceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHH Q lcl|Aclame:pro 216 NWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELL 295 (711) Q Consensus 216 ~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (711) .+.+++...+.+ ...+..-+++-............+.+...+. T Consensus 116 ~~ai~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 158 (434) T protein:vir:98 116 LVGLKVWHNDID----------------------------GFGYARVFFDDTSFPYRTRERTGARLPWGPD--------- 158 (434) T ss_pred EEEEEEEEeccC----------------------------CceEEEEEEeCcEEEEEEeeccccccccccc--------- Confidence 122222111100 0000000111000000000000000000000 Q ss_pred hcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 296 EAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATE 375 (711) Q Consensus 296 ~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~ 375 (711) . | .....+....|.+.|..|++||+-.+. -...|.|.++.+++.++.+|+.+|.+.. T Consensus 159 --------------~---~---~~~~~~~~~~~h~~g~vPvv~f~N~~~---~~~~g~sd~e~vi~liDa~~~~~s~~~~ 215 (434) T protein:vir:98 159 --------------S---W---VYTGTADSGDVHDLGGMQLVEFARMPD---LGEDPEPEFAGVLDIQDRVNLGILNRMA 215 (434) T ss_pred --------------c---c---eecccccccccCCCCccceEEeccCCC---cCcCCcchhhhHHHHHHHHHHHHHHHHH Confidence 0 0 001112233445667788888753322 2235789999999999999999999999 Q ss_pred HHHhcCCCceEeccccc-CCh-HH------HHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 376 TVALAPKAPFIGSEGNV-EGR-ED------EWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKS 447 (711) Q Consensus 376 ~l~~~~~~~~~~~~~av-~~~-~~------~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~ 447 (711) .+...+.++.++. |+- ... ++ .+......++.++.. ++ .+..+..++.. -...+...+......+-. T Consensus 216 ~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~--~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~ 290 (434) T protein:vir:98 216 ASRFSGFRQKWIK-GHKFAKRTDPATGMTVVDQPFVPSPSAVWAS-EG--ENTQFGQLDAT-DLSGFLKEHASDVRDMLT 290 (434) T ss_pred HHHHhcchhhhhc-CCCcccccccccccchhhhhhhccccccccC-CC--CCceEEEecCc-chHHHHHHHHHHHHHHhc Confidence 9888777765553 221 110 00 011111222222222 21 11223223222 223344555555556666 Q ss_pred HhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecch Q lcl|Aclame:pro 448 TMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNE 527 (711) Q Consensus 448 ~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~ 527 (711) +|++++...|-..++.||.|+......-........+.|..++++++++++.+ . |. ..+.. T Consensus 291 ~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~----~---------g~--~~~~~---- 351 (434) T protein:vir:98 291 ISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQ----A---------GV--PEDYT---- 351 (434) T ss_pred ccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----c---------CC--Chhhe---- Confidence 68888889986667889999988776666666666677777777777665543 1 11 11111 Q ss_pred hhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhh-cchhHHHHHHHHHHhcCCcchHHHHHHHHhh Q lcl|Aclame:pro 528 QIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQA-VPSAAAVMADLIAQNMDWPGADVIAERLKKI 606 (711) Q Consensus 528 ~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~-~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~ 606 (711) ++.+.=.+..+....+..+.+..+.+. .|. ..++++++++. +++.+ +.+. T Consensus 352 ---------------------~~~v~w~~~~~~s~~~~ada~~kl~~~g~~~------e~~~~~lg~~~-~e~~r-~~~e 402 (434) T protein:vir:98 352 ---------------------EAEVRWANPAHVTMAVKADAATKLKSIGYPL------DVIAEELDESP-ARVRR-IVAG 402 (434) T ss_pred ---------------------eeeEEecCCCCCCHHHHHHHHHHHHhcCCcH------HHHHHhCCCCH-HHHHH-HHHH Confidence 111111111111123333444444432 221 22344444432 22211 1100 Q ss_pred hcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 607 VPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLET 666 (711) Q Consensus 607 ~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~ 666 (711) ...+ .......+. +..+..... ......-.+. T Consensus 403 ~~~~-------------------------~~~~~~~~~-~~~~~~~g~--~~~~~~~~dg 434 (434) T protein:vir:98 403 AASQ-------------------------ALLAASLLP-APGAPSAGN--VPDSGGAVDG 434 (434) T ss_pred HHHH-------------------------HHHHHhhhc-cCCCCCCCC--CCcccCCCCC Confidence 0000 000000000 000000000 0000000000 No 122 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=98.49 E-value=4.7e-07 Score=55.25 Aligned_cols=613 Identities=12% Similarity=0.017 Sum_probs=136.1 Q ss_pred CCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHH-HhCCCc--eEehh---hHHHHHHHh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERE-LEQRPC--LVNNV---LPTFVDQVL 94 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~-~~g~p~--~~~N~---i~~~v~~i~ 94 (711) +.+.+......-.++... +......+..|++|. +..+....+. -.|.+. -.-|+ +-+.|...+ T Consensus 1 ~~k~~~~~~~~~~~~~~~----------~~~~~~~a~~~~~~~-~~~~~~~~~~~y~g~~~~~~~~~~s~~~~~~v~~~v 69 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRH----------LDQLVNDALDFNSSE-LSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETV 69 (705) T ss_pred CCcccccccCCHHHHHHH----------HHHHHHHHHhhhhhH-HHHHHHHHHHHHhCCCCCcccCCCCccccHHHHHHH Confidence 222222222222222222 334455667777764 2221111111 123322 11111 111121111 Q ss_pred hhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCc-cE Q lcl|Aclame:pro 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGM-GY 173 (711) Q Consensus 95 g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~-g~ 173 (711) -...-.-..+ -.....+....+-...-+++..-+-.++.....-.....+++.+.+..++ +- T Consensus 70 ~~~~~~l~~~-----------------~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g 132 (705) T protein:vir:88 70 DWIMPSLMKV-----------------FTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMK 132 (705) T ss_pred HHHHHHHHHh-----------------hcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcC Confidence 1111000000 00000111112322333332222111211111111111233333333332 11 Q ss_pred EEEEEeeccCCCCCCcceEEEecC-----ccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccc--hhhcccc Q lcl|Aclame:pro 174 LRVRSDYLADDSFEQDLIIEAIQN-----QFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAE--PVYEDSV 246 (711) Q Consensus 174 ~~v~~d~~~~~~~~~~i~i~~v~~-----~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~--~~~~~~~ 246 (711) ..|..-+|... -+..++++.. .-.++.||.+.-.+-++-.+..+..+++....+....-...+ ++.-+.. T Consensus 133 ~gi~kv~we~~---~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~ 209 (705) T protein:vir:88 133 TGVVKVYVEEV---LKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRL 209 (705) T ss_pred CeEEEeccccc---cchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCC Confidence 22233223211 1111111110 012333454332222222222211111111111000000000 0000000 Q ss_pred cccccCCCCCeEEEEEeeeeeeeceeEEEcc-CC------cEEEecCcchhHHHHHhcCchh-------hh-hcccceEE Q lcl|Aclame:pro 247 ADYDTWFTEKSVRVSEYFTREPVIREIALLS-DG------RSFWLDALEDIVDELLEAGISI-------VR-TRKVKTFK 311 (711) Q Consensus 247 ~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~-~~------~~~~~~~~~~~~~~~~~~g~~~-------~~-~~~~~~~~ 311 (711) . -+|.+..- +++..+.... .+..++ +. ...+.+. .....+.+..+... .. ......++ T Consensus 210 a--~~~~d~~~--~~~~~~~t~~--dl~~~g~~~~~~~~~~~~~~~~-~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~ 282 (705) T protein:vir:88 210 A--TCIDDARF--LCHREKYTVS--DLRLLGVPEDVIEELPYDEYEF-SDSQPERLVRDNFDMTGQLQYNSGDDAEANRE 282 (705) T ss_pred C--CCcccCcE--EEEEEeccHH--HHHhhcCChhHhhhhhcccccc-hhhhhhhccccccccccccccccccccCCcee Confidence 0 00111111 1111111000 000000 00 0000000 00000000000000 00 00111122 Q ss_pred EEE---EEE---ecCceecc-CccCCCCccceEEEEeeeeccCCccc--ccch-HHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 312 TYW---RKI---TGANVLEG-PVEIPSTTIPVIPVWGKSLIIKKKEI--FRSI-IRHSKDAQRMANYWDSAATETVALAP 381 (711) Q Consensus 312 v~~---~~~---~g~~~le~-~~p~~~~~~P~vp~~~~~~~~~~~~~--~~g~-v~~~~d~Q~~~N~~~s~~~~~l~~~~ 381 (711) |++ |+. .|+.+.+- ...|.++.+.-++.++.++|+..... +.++ -..+.+.-.-+....+.+...+.-+. T Consensus 283 v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~ 362 (705) T protein:vir:88 283 VWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNI 362 (705) T ss_pred EEEEEeeeEecccCCcceeeEEEEEeCccccccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHH Confidence 211 111 12221110 01122222222222222222211111 1111 12223333333333333332221110 Q ss_pred CCceEecccccCChHHHHhhcccCCCceEEecccccCcCCccc-----cCCccchHHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|Aclame:pro 382 KAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRR-----QPPAAVPAAELTLGQNSVEKIKSTMGMYDASL 456 (711) Q Consensus 382 ~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~-----~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~ 456 (711) ......+.+.+ +.. ..+...+...|| ..+.+ +.+.+.+.-...+. .+++.+...-.... T Consensus 363 --~~~~~~~~~~~-~g~-----v~~~d~~~~~pg----~vv~~~~~~~i~~~~~~~~~~~~~----~ll~~~~~~~~~~t 426 (705) T protein:vir:88 363 --YRTNQGRSVVL-DGQ-----VNLEDLLTNEAA----GIVRVKSMNSITPLETPQLSGEVY----GMLDRLEADRGKRT 426 (705) T ss_pred --HhccCCceecc-ccc-----cCcccccccCCC----eeEEecCCCccccccCCcCcHHHH----HHHHHHHHHHHHhh Confidence 00011111100 000 011111211122 12221 22222222222222 22322222223344 Q ss_pred cccc--chhHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHH----HHHHHHHHhhcCccceEeeecccC-cchheec Q lcl|Aclame:pro 457 GAMG--NETSGRAIIARQRQGDRGSFAFIDNLTK----SIRRVG----KILVEMIPHIYDTERVVRLKFPDE-TEDFVKL 525 (711) Q Consensus 457 G~~~--~~~sg~ai~~~~~~~~~~~~~~~dn~~~----~~~~~~----~~~l~li~~~~~~~r~~ri~g~~~-~~~~v~~ 525 (711) |... .+.++.+......++. +..+...-.. ..+.+. +.+..++... .-... ....+.+ T Consensus 427 Gi~~~~~G~~~~~~~~~~Ta~~--i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~l---------i~~~~~~~~~~ri 495 (705) T protein:vir:88 427 GITDRTRGLDQNTLHSNQAAMS--VNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDH---------AIKYQNQEEVFQL 495 (705) T ss_pred CCchHHcCCCcccccchhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHhCCCceEEee Confidence 4332 1223333222222211 1111111111 111111 1111222111 11111 1111222 Q ss_pred chhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHh Q lcl|Aclame:pro 526 NEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKK 605 (711) Q Consensus 526 ~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~ 605 (711) +|..+. +..-..--.-+........-...-+.+..+...+...+.+.. .....++.....+.+.+.. T Consensus 496 --------~g~~v~---v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~--~~~~~~~~~~~~~~~~~~e 562 (705) T protein:vir:88 496 --------RGKWVA---VNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVG--GGGLGVLVSEQNLYNILKE 562 (705) T ss_pred --------ccchhc---cchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhc--ccchhhhcChHHHHHHHHH Confidence 111100 000000000011111111111111122222111111111110 0111222222223222222 Q ss_pred hhcchhhcch-------hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HH Q lcl|Aclame:pro 606 IVPPNVLSKD-------EREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLA--MI 676 (711) Q Consensus 606 ~~~~~~~~~~-------~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~--~~ 676 (711) .....+.... ...+..+..++..+...+++....++++++++++++.+.+++++...|.+++..+++++ +. T Consensus 563 l~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~ 642 (705) T protein:vir:88 563 VTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQ 642 (705) T ss_pred HHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222111111 11111112222222233333444456666666777766666665554444332222222 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 677 EDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 677 ~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +..++..+.+.++.+.+.++.+.+..+++.+.+.+ T Consensus 643 ~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~ 677 (705) T protein:vir:88 643 EAVLQQREMALKEAELQLERDRFTWERARNEAEYH 677 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222211122111111121222222222222222 No 123 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.41 E-value=7.7e-07 Score=54.09 Aligned_cols=628 Identities=10% Similarity=-0.018 Sum_probs=162.0 Q ss_pred HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHH--Hhhhhhhccccee Q lcl|Aclame:pro 28 DRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQ--VLGDQRQNRPAIK 105 (711) Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~--i~g~~~~~r~~~~ 105 (711) --+.+...+.++...+++..++..++... .++...---+.-|...+.+.+ -++..++.+|-+. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~---------------~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~ 65 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREK---------------CLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFE 65 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHH---------------HHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEE Confidence 22222222222222222222222222222 222211100112344454444 1234555666443 Q ss_pred EecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEE-eeccCC Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRS-DYLADD 184 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~-d~~~~~ 184 (711) |-=. ........+..-...+-....+.+.+.-+.+.+++..+....-.......+..++...++.+ .++| +...+. T Consensus 66 ~N~i--~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~-G~G~~~v~~d~ 142 (720) T protein:vir:35 66 INKI--STELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTG-GFGCFRLTTNL 142 (720) T ss_pred EccH--HHHHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhc-cceeEEeeecc Confidence 2110 01112222222222333333455555445667777776665555555555666665555443 2222 211111 Q ss_pred CCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc--chhhcccccc-cccCCCCCeEEEE Q lcl|Aclame:pro 185 SFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA--EPVYEDSVAD-YDTWFTEKSVRVS 261 (711) Q Consensus 185 ~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~--~~~~~~~~~~-~~~~~~~~~v~v~ 261 (711) ..+.++.. .+..+.+.|-- .+....-|=-..+..+.++.+-.|-..-. +.+....-.+ ...|.. .. T Consensus 143 ~~~~d~~~----~~~~i~i~~v~--~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~-----~~ 211 (720) T protein:vir:35 143 VNALDPMD----ERQRICLEPIY--DPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSG-----IE 211 (720) T ss_pred cccCCCCc----ccceeeEeccc--CchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcccccccc-----cc Confidence 11111110 00011111100 00001111122333444444322211100 0110000000 000000 00 Q ss_pred EeeeeeeeceeEEEccCCcEE--------Ee-cCcchhHHHHHhcCchhhh--hc---cc-ceEEEEEEEEecCceeccC Q lcl|Aclame:pro 262 EYFTREPVIREIALLSDGRSF--------WL-DALEDIVDELLEAGISIVR--TR---KV-KTFKTYWRKITGANVLEGP 326 (711) Q Consensus 262 E~~~~~~~~~~~~~~~~~~~~--------~~-~~~~~~~~~~~~~g~~~~~--~~---~~-~~~~v~~~~~~g~~~le~~ 326 (711) ..|+.++.....+.+....+. .+ +...-............+. .. .+ ..++.+....+...++.+. T Consensus 212 ~~~~~d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~ 291 (720) T protein:vir:35 212 RSWDYDWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGE 291 (720) T ss_pred ccccccccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccc Confidence 111111111111111100000 00 0000000000000000000 00 00 0001011111111111111 Q ss_pred ccCCC-Cccc--eEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH---HHHh Q lcl|Aclame:pro 327 VEIPS-TTIP--VIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE---DEWE 400 (711) Q Consensus 327 ~p~~~-~~~P--~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~---~~~~ 400 (711) .-..+ +.+| .+|+++++-+. ....+......++..=+..=.+.+...-.+ -+++....+.... +.+. T Consensus 292 ~~l~~~~~~p~~~fP~vP~~g~r-~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~------~~~~~~~~~~~~~~a~~~~~ 364 (720) T protein:vir:35 292 GFLEKAQRIPGEHIPLIPVYGKR-WFIDDIERVEGHIAKAMDAQRLYNLQVSML------ADSATQDTGSIPIVGKSQIK 364 (720) T ss_pred hhcccCCCCCCCccceEEEEeee-eccCCCcccceeeecchhHHHHHHHHHHHH------HHHHHcCCccccccCcchHH Confidence 00000 1122 11222111000 000000000111111111111111111000 1111111110000 0000 Q ss_pred h---cccCCCceEEec-ccccC---cCCcc----ccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccc---hhHHH Q lcl|Aclame:pro 401 Q---ANTKNFSLLTYI-PQYQG---DPGPR----RQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGN---ETSGR 466 (711) Q Consensus 401 ~---~~~~~~~~i~~~-~~~~~---~~~i~----~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~---~~sg~ 466 (711) . .+.+++++-... .-... .+.+. .+...+.++-....++........+- ...|.... ..|+. T Consensus 365 ~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~----~vsGi~~~~lG~~sn~ 440 (720) T protein:vir:35 365 TLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQ----EVTGSSQAMQPMPSNI 440 (720) T ss_pred HHHHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHH----HHhCCChHHcCcccch Confidence 0 000000000000 00000 00000 00011111111222333333333322 22333321 23432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeee----h Q lcl|Aclame:pro 467 AIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIH----D 542 (711) Q Consensus 467 ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~n----D 542 (711) +--+....-.......+. |-..++...+.+-+++..+.. .--..+..+.|.... .+...+.+| | T Consensus 441 SG~Ai~~rq~qg~~~~~~-~~Dnl~~~~~~~g~~lL~lI~--------~~y~~er~~RI~~ed---~~~~~v~~n~~~~d 508 (720) T protein:vir:35 441 AKETVNHLMHRSDMSSFI-YLDNMAKSLKRAGEVWLSMAR--------EVYGSDRQVRIVNAD---GTDDIALMSVVIND 508 (720) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH--------HHcCCCcEEEEecCC---CCcceEeechhhhc Confidence 111111111111111111 111222222233333332211 111222333332211 112223333 2 Q ss_pred hhhe----eeeEEee---cccChHHHHHH-HHHHHHHHHhhcchhHHHHH--HHHHHh-cCCcchHHHHHHHHhhhcchh Q lcl|Aclame:pro 543 LNVQ----KYDVVVT---TGPAFATQRIE-AAEAMIQFAQAVPSAAAVMA--DLIAQN-MDWPGADVIAERLKKIVPPNV 611 (711) Q Consensus 543 ~~~~----~~dv~v~---~~~~~~s~r~~-~~~~L~~l~~~~p~~~~~~~--~~~~~~-~~~~~~~e~~~~l~~~~~~~~ 611 (711) ...| ..|+++. +..+..-.... ..+.+..+.+.++.+.+... ..++.. +..-......+ ...... T Consensus 509 ~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e----~~erir 584 (720) T protein:vir:35 509 NQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDE----FKEYNR 584 (720) T ss_pred cCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHH----HHHHHH Confidence 2112 2455432 23333222232 22333333333322222110 000000 00000010001 111111 Q ss_pred hcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 612 LSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVR 691 (711) Q Consensus 612 ~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~ 691 (711) ...+.+...++..++.++..++++.+.+++++++++++++++++|++..+++++....+..+.+.+..++..++...+.. T Consensus 585 k~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~ 664 (720) T protein:vir:35 585 KQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQIL 664 (720) T ss_pred hhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11112222233344445555566666677888888888888888888877777666555544443333332222222211 Q ss_pred HHHH-HHHHHHHHHH----------hhhccC Q lcl|Aclame:pro 692 ELVA-QALAEITASQ----------ANVTEQ 711 (711) Q Consensus 692 ~~~~-~~~~e~~~~q----------a~~e~Q 711 (711) .+.. .+++.+..+. +..++. T Consensus 665 aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa 695 (720) T protein:vir:35 665 ASADSAKRAEIREALKMLHQFQKEQGDASRA 695 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchHHHH Confidence 1111 1111111111 111111 No 124 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=98.29 E-value=1.2e-07 Score=58.53 Aligned_cols=584 Identities=12% Similarity=0.060 Sum_probs=234.5 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHH-HHHHHHhhch----HHHHH-HHHHHHHhC--CCCCCHHHH-- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARE-RARDGATYWK----DNWEA-AEDDLKFLG--GEQWPSQVR-- 70 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~----~~r~~-~~~~~~~y~--G~Qw~~~~~-- 70 (711) ||-.-..++|... --.+.+.++++ .++...++.. .+... -..|..|.+ --|=..+.+ T Consensus 1 maispsepninsf-------------vytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~ 67 (666) T protein:vir:10 1 MAISPSEPNINSF-------------VYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGY 67 (666) T ss_pred CCcCCCCCcchhh-------------hhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeee Confidence 5543322222221 00111111111 1122222221 11111 122333321 111111221 Q ss_pred HHHHHhCCCceEehh--hH----HHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHH Q lcl|Aclame:pro 71 TERELEQRPCLVNNV--LP----TFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTG 144 (711) Q Consensus 71 ~~~~~~g~p~~~~N~--i~----~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~ 144 (711) .......-||-+||. +. +.|+.++|+...- -....+..||++ +++.-+-||.|++ T Consensus 68 ~~~~~A~V~C~V~~~~~V~PIViSQV~S~~~YLT~V-----------------F~SG~Pi~PVVS--~P~~K~~AE~LE~ 128 (666) T protein:vir:10 68 NQNIAAKVRCQVVNKATVNPIVISQVQSMTAYLTEV-----------------FASGYPILPVVS--TPDKKEQAEALEG 128 (666) T ss_pred cccccccCcceeeccccCCchhhhhHHHHHHHHHHH-----------------HhcCCccceeec--CCchhHHHHHHHH Confidence 112233445666553 33 3456666664321 112223334443 4556678999999 Q ss_pred HHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccC--------CCCCCc----------ceEEEecCccceeeCCC Q lcl|Aclame:pro 145 LIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD--------DSFEQD----------LIIEAIQNQFSVTIDPD 206 (711) Q Consensus 145 ~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~--------~~~~~~----------i~i~~v~~~~~v~~Dp~ 206 (711) ++..-+....+-...--+++|++++....|+.-|.-... +-..+. -+|+++ +|+++||||. T Consensus 129 ii~DH~t~~~~~~~LiL~L~D~~KYN~~~~ET~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRL-N~RN~~~D~~ 207 (666) T protein:vir:10 129 IIQDHMTMTSSIPELILCLQDAAKYNLVGWETEWSHIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRL-NLRNVHWDPI 207 (666) T ss_pred HHHhhhhhhhhHHHHHHHHhhhhhcceeeeeeccccccccchhhhhhcCCCceeecccchhhhhhhhcc-ccccccccCC Confidence 998877777777777778889998887666665421110 111111 246666 8999999985 Q ss_pred ccccCcc-ccceeeeeecCCHHHHHHhcCCcccch----------hhcccccccccCCCC----------CeEEEEEeee Q lcl|Aclame:pro 207 AKKRDRS-DMNWCLIDDTMSKEKFKALYPDATAEP----------VYEDSVADYDTWFTE----------KSVRVSEYFT 265 (711) Q Consensus 207 a~~~d~~-Da~~~~~~~~~~~~e~~~~~p~~~~~~----------~~~~~~~~~~~~~~~----------~~v~v~E~~~ 265 (711) .--+|.. ...|+.....+++-.+++...--..+. ....+... .+|... ..+.-+. |- T Consensus 208 ~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~-sD~T~~P~IS~vY~~~~~~SDi~-WD 285 (666) T protein:vir:10 208 PDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQG-SDWTDNPQISPVYQEMEMASDIN-WD 285 (666) T ss_pred CCCCchhhhhhhhhHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhccc-cccccCCccCccccccchhhccc-hh Confidence 4333332 356888888888877776532111000 00000000 011100 0000000 00 Q ss_pred eeeeceeEEEccCCcEEEecCcchhHHH-----HHhcCch-hhhhcccceEEEEEEE-EecCcee-ccCccCCCCccceE Q lcl|Aclame:pro 266 REPVIREIALLSDGRSFWLDALEDIVDE-----LLEAGIS-IVRTRKVKTFKTYWRK-ITGANVL-EGPVEIPSTTIPVI 337 (711) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~g~~-~~~~~~~~~~~v~~~~-~~g~~~l-e~~~p~~~~~~P~v 337 (711) + +....-...+.|+.+...+....... .+-..+. -+..+. ...+|.++ +.|++++ .++.--.++.||+- T Consensus 286 ~-~G~~~T~~sS~~~rvpvneqg~Y~k~~~Y~RI~PSDF~~~~P~~N--~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~ 362 (666) T protein:vir:10 286 R-FGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRN--QVQIWKAVMINRDAIISFEPYIGAYGSFGMG 362 (666) T ss_pred h-cCcccccccccccccccccccceeeeeeeeeeccccceecCCCCC--cceeeeeeeeccceeEeeehhhhccchhhhh Confidence 0 00000000001110000000000000 0000000 011111 12233343 4466655 23322245666654 Q ss_pred EEEeeeeccCCcccc-cchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCC--ceEEecc Q lcl|Aclame:pro 338 PVWGKSLIIKKKEIF-RSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNF--SLLTYIP 414 (711) Q Consensus 338 p~~~~~~~~~~~~~~-~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~--~~i~~~~ 414 (711) +++... |+-++- .|+.+...+.|+...++++...-...+....+-++++..+. ....+.|- .-|.+++ T Consensus 363 --~~~~LE-DG~G~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~------a~~iNSP~~~~KIP~~~ 433 (666) T protein:vir:10 363 --LAFALE-DGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIR------ANDINSPIPQIKIPVVP 433 (666) T ss_pred --hhhhhh-hccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhccChhhhh------hhcccCCCCCcccceee Confidence 444432 333332 57788899999999988877666655555555555554443 22222222 2344454 Q ss_pred cccCcCCcc--ccCCccchHHHHHHHHH---HHHHHHHHhCCCHHHhccc--cchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 415 QYQGDPGPR--RQPPAAVPAAELTLGQN---SVEKIKSTMGMYDASLGAM--GNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) Q Consensus 415 ~~~~~~~i~--~~~~~~~~~~~~~ll~~---~~~~~~~~tGv~~~~~G~~--~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 487 (711) ..+...... +-+-|...-+.-+.++. ..+.-++++|++...+|+= +| .|-+.-.-.+-.+..+++...=-+. T Consensus 434 ~sL~N~~~~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGN-Kt~~E~~~~MG~a~NR~RLPALiLE 512 (666) T protein:vir:10 434 QSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKGN-KTRAEFDTIMGNAENRMRLPALILE 512 (666) T ss_pred hhhcccchhhhhccCCccccchhHHHhhhHHHHhhHHHhhccCCcccccccccC-cceeehhhhcCCcccceehhhHHhh Confidence 443322221 22333344455555543 3445678899999999953 33 2222211222223334333322222 Q ss_pred -HHHHHHHHHHHHHHHhhcCccceEee-ecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHH Q lcl|Aclame:pro 488 -KSIRRVGKILVEMIPHIYDTERVVRL-KFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEA 565 (711) Q Consensus 488 -~~~~~~~~~~l~li~~~~~~~r~~ri-~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~ 565 (711) +.+..+-+++.--|-+|-++..++.- +|+-...+.-+ +...-..+.+..|...+++ .+. T Consensus 513 H~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~------------------L~~~~L~F~~~DG~TP~SK-~AS 573 (666) T protein:vir:10 513 HRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKE------------------LQDLGLKFELGDGLTPASK-LAS 573 (666) T ss_pred hhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHH------------------HhhhhheeeeccCCCchhh-hhh Confidence 24444444444444555455444422 22211111111 1111223334555443333 333 Q ss_pred HHHHHHHHhhc-------chhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 566 AEAMIQFAQAV-------PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEM 638 (711) Q Consensus 566 ~~~L~~l~~~~-------p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~ 638 (711) ...|..++++. ...++.+..++.-++.+-+...+.+......++-.+.-..+++.++...+.+++ T Consensus 574 s~~lT~~LQMI~sS~~~~~A~G~~~P~M~AH~~QLGGVRG~E~Y~daalP~~~~~~~~~Q~LQ~~~LQ~~~Q-------- 645 (666) T protein:vir:10 574 SDFLTALLQMIMSSETTLQAFGTQVPGMIAHLAQLGGVRGFEKYADAALPQWQITYGMQQQLQQMLLQLQQQ-------- 645 (666) T ss_pred hHHHHHHHHHHhhhhhhHhhhcccchHHHHHHHHhccccchhhhhhccCCccccccchhHHHHHHHHHHhhh-------- Confidence 34444444432 223344555665565555555555444444443333222221111111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 639 AKSQADMAQAEADTAQAQADMLKAQLETEEAQKQL 673 (711) Q Consensus 639 ~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~ 673 (711) ..+|.+.. |. +...++.+--+ T Consensus 646 SA~Q~~A~---------Q~-----~L~~~Q~~PSq 666 (666) T protein:vir:10 646 SAMQLQAR---------QG-----ELSNDQSQPSQ 666 (666) T ss_pred hhcccccc---------cc-----cCcccccCCCC Confidence 00111000 00 11111000000 No 125 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=98.24 E-value=1.4e-07 Score=58.19 Aligned_cols=584 Identities=12% Similarity=0.056 Sum_probs=233.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHH-HHHHHHhhch----HHHHH-HHHHHHHhC--CCCCCHHHHH- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARE-RARDGATYWK----DNWEA-AEDDLKFLG--GEQWPSQVRT- 71 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~----~~r~~-~~~~~~~y~--G~Qw~~~~~~- 71 (711) ||-.-..++|... --.+.+.++++ .++...++.. .+... -..|..|.+ --|=..+.+. T Consensus 1 maispsepninsf-------------vytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~ 67 (666) T protein:vir:96 1 MAISPSEPNINSF-------------VYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGY 67 (666) T ss_pred CccCCCCCcchhh-------------hhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeee Confidence 5543322222221 00111111111 1122222221 11111 122333321 1111112211 Q ss_pred -HHHHhCCCceEehh--hH----HHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHH Q lcl|Aclame:pro 72 -ERELEQRPCLVNNV--LP----TFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTG 144 (711) Q Consensus 72 -~~~~~g~p~~~~N~--i~----~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~ 144 (711) ......-||-+||. +. +.|+.++|+...- -....+..||++ +++.-+-||.|++ T Consensus 68 ~~~~~A~V~C~V~~~~~V~PIViSQV~S~~~YLT~V-----------------F~SG~Pi~PVVS--~P~~K~~AE~LE~ 128 (666) T protein:vir:96 68 NQNIAAKVRCQVVNKATVNPIVISQVQSMTAYLTEV-----------------FASGYPILPVVS--TPDKKEQAEALEG 128 (666) T ss_pred cccccccccceeeccccCCchhhhhHHHHHHHHHHH-----------------HhcCCccceeec--CCchhHHHHHHHH Confidence 12233345666553 33 4456666664321 112223334443 4556678999999 Q ss_pred HHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccC--------CCCCCc----------ceEEEecCccceeeCCC Q lcl|Aclame:pro 145 LIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD--------DSFEQD----------LIIEAIQNQFSVTIDPD 206 (711) Q Consensus 145 ~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~--------~~~~~~----------i~i~~v~~~~~v~~Dp~ 206 (711) ++..-+....+-...--+++|++++....|+.-|.-... +-..+. -+|+++ +|+++||||. T Consensus 129 ii~DH~t~~~~~~~LiL~L~D~~KYN~~~~ET~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRL-N~RN~~~D~~ 207 (666) T protein:vir:96 129 IIQDHMTMTSSIPELILCLQDAAKYNLVGWETEWSNIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRL-NLRNVHWDPI 207 (666) T ss_pred HHHhhhhhhhhHHHHHHHHhhhhhcceeeeeeccccccccchhhhhhcCCCceeeeccchhhhhhhhcc-ccccccccCC Confidence 998877777777777778889998887666665421111 111111 245666 8999999985 Q ss_pred ccccCcc-ccceeeeeecCCHHHHHHhcCCcccch----------hhcccccccccCCCC----------CeEEEEEeee Q lcl|Aclame:pro 207 AKKRDRS-DMNWCLIDDTMSKEKFKALYPDATAEP----------VYEDSVADYDTWFTE----------KSVRVSEYFT 265 (711) Q Consensus 207 a~~~d~~-Da~~~~~~~~~~~~e~~~~~p~~~~~~----------~~~~~~~~~~~~~~~----------~~v~v~E~~~ 265 (711) .--+|.. ...|+.....+++-.+++...--..+. ....+... .+|... ..+.-+. |- T Consensus 208 ~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~-sD~T~~P~IS~vY~~~~~~SDi~-WD 285 (666) T protein:vir:96 208 PDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQG-SDWTDNPQISPVYQEMEMASDIN-WD 285 (666) T ss_pred CCCCchhhhhhhhhhHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhccc-cccccCCcccccccccchhhccc-hh Confidence 4333332 356888888888877776532111000 00000000 011100 0000000 00 Q ss_pred eeeeceeEEEccCCcEEEecCcchhHHH-----HHhcCch-hhhhcccceEEEEEEE-EecCcee-ccCccCCCCccceE Q lcl|Aclame:pro 266 REPVIREIALLSDGRSFWLDALEDIVDE-----LLEAGIS-IVRTRKVKTFKTYWRK-ITGANVL-EGPVEIPSTTIPVI 337 (711) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~g~~-~~~~~~~~~~~v~~~~-~~g~~~l-e~~~p~~~~~~P~v 337 (711) + +....-...+.|+.+...+....... .+-..+. -+..+. ...+|.++ +.|++++ .++.--.++.||+- T Consensus 286 ~-~G~~~T~~sS~~~rvpvneqg~Y~k~~mY~RI~PSDF~~~~P~~N--~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~ 362 (666) T protein:vir:96 286 R-FGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRN--QVQIWKAVMINRDAIISFEPYIGAYGSFGMG 362 (666) T ss_pred h-cCcccccccccccccccccccceeeeeeeeeeccccceecCCCCC--cceeeeeeeeccceeEeeehhhcccchhhhh Confidence 0 00000000000110000000000000 0000000 011111 12233333 4466655 23322245666654 Q ss_pred EEEeeeeccCCcccc-cchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCC--CceEEecc Q lcl|Aclame:pro 338 PVWGKSLIIKKKEIF-RSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKN--FSLLTYIP 414 (711) Q Consensus 338 p~~~~~~~~~~~~~~-~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~--~~~i~~~~ 414 (711) +++... |+-++- .|+.+...+.|+...++++...-...+....+-++++..+. ....+.| ..-|.+++ T Consensus 363 --~~~~LE-DGmG~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~------a~~iNSP~~~~KIP~~~ 433 (666) T protein:vir:96 363 --LAFALE-DGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIR------ANDINSPIPQIKIPVVP 433 (666) T ss_pred --hhhhhh-hccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhhcchhhhh------hhcccCCCCCcccceee Confidence 444432 333332 57788899999999988877766655555555555555443 2222222 22344454 Q ss_pred cccCcCCcc--ccCCccchHHHHHHHHH---HHHHHHHHhCCCHHHhccc--cchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 415 QYQGDPGPR--RQPPAAVPAAELTLGQN---SVEKIKSTMGMYDASLGAM--GNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) Q Consensus 415 ~~~~~~~i~--~~~~~~~~~~~~~ll~~---~~~~~~~~tGv~~~~~G~~--~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 487 (711) ..+...... +-+-|...-+.-+.++. ..+.-++++|++...+|+= +| .|-+.-.-.+-.+..+++...=-+. T Consensus 434 ~sL~N~~m~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGN-Kt~~E~~~~MG~a~NRmRLPALiLE 512 (666) T protein:vir:96 434 QSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKGN-KTRAEFDTIMGNAENRMRLPALILE 512 (666) T ss_pred hhhhccchhhhhccCCccccchhHHHhhhHHHhhhHHHhhccCCcccccccccC-cceeehhhhcCCcccceehhhHHHh Confidence 443322221 22333344455555543 3445678899999999953 33 2222222222223334333322222 Q ss_pred -HHHHHHHHHHHHHHHhhcCccceEee-ecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHH Q lcl|Aclame:pro 488 -KSIRRVGKILVEMIPHIYDTERVVRL-KFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEA 565 (711) Q Consensus 488 -~~~~~~~~~~l~li~~~~~~~r~~ri-~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~ 565 (711) +.+..+-+++.--+-+|-++..++.- +|+-...+.-+ +...-..+.+..|...++ +.+. T Consensus 513 H~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~------------------L~~~~L~F~~~DGlTP~S-KlAS 573 (666) T protein:vir:96 513 HRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKE------------------LQDLGLKFELGDGLTPAS-KLAS 573 (666) T ss_pred hhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHH------------------HhhhhheeeeccCCCchh-hhhh Confidence 24444444444444555455444422 12111111111 111222333455544333 3333 Q ss_pred HHHHHHHHhhc-------chhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 566 AEAMIQFAQAV-------PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEM 638 (711) Q Consensus 566 ~~~L~~l~~~~-------p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~ 638 (711) ...|..++++. ...++.+..++.-++.+-+...+.+....+.++-+..=-.+++ .|+...+.+.+ T Consensus 574 s~~lT~~LQMI~sS~~~~~A~G~~~P~M~AHl~QLGGVRG~E~Y~~~ALPqwqitygm~Q~-------LQ~~~LQ~~~Q- 645 (666) T protein:vir:96 574 SDFLTALLQMIMSSETTLQAFGTQVPGMIAHLAQLGGVRGFEKYANAALPQWQITYGMQQQ-------LQQMLLQLQQQ- 645 (666) T ss_pred hHHHHHHHHHHhcchhhHhhhcccchHHHHHHHHhccccchhhcccccCcchhhhhhhhHH-------HHHHHHHHhhh- Confidence 34444444432 2234445556666666666555555443333322111111111 11100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 639 AKSQADMAQAEADTAQAQADMLKAQLETEEAQKQL 673 (711) Q Consensus 639 ~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~ 673 (711) ..+|.+ |. |. +...++.+--+ T Consensus 646 SA~Q~~-----A~----Q~-----~L~~~Q~~PSq 666 (666) T protein:vir:96 646 SAMQLQ-----AR----QG-----ELSNDQSQPSQ 666 (666) T ss_pred hccccc-----cc----cc-----cCcccccCCCC Confidence 000100 00 10 11111100000 No 126 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.23 E-value=2.3e-06 Score=51.51 Aligned_cols=584 Identities=12% Similarity=0.035 Sum_probs=168.2 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC----CCCCCHHHHHHHHHh Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLG----GEQWPSQVRTERELE 76 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~----G~Qw~~~~~~~~~~~ 76 (711) ..-..+.+|.+.-+.|...+. ..+..+.|..+..++..- ++.+...+..|.+ |-=|-.-...+..+- T Consensus 76 v~g~e~~nr~d~~v~p~~~~~---d~~~Ae~l~~l~~~~~~~------~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~ 146 (708) T protein:vir:17 76 IIAEYRNNRITVKFRPGDREA---SEELANKLNGLFRADYEE------TDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) T ss_pred HHhhHhhCCcceEEecCCCcc---hHHHHHHHHHHHHHHHHh------cCchhHHhHHHHHhhhcccceeeeeecccccC Confidence 111223333333333321110 122333444444433332 2222223333332 322321111111000 Q ss_pred CCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHH Q lcl|Aclame:pro 77 QRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAE 156 (711) Q Consensus 77 g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~ 156 (711) + + .....++.+.++. +++..|.+...--+.++.++.=.+.....+...+. T Consensus 147 d--~-----------------~~~~~~i~i~~~~-----------~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~ 196 (708) T protein:vir:17 147 D--P-----------------MDDRQRIAIEPIY-----------DPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYE 196 (708) T ss_pred C--C-----------------CCCccccceEeec-----------cchhheecCccccccChhhhhhhhhhccCCHHHHH Confidence 0 0 0001111222211 00011111111111122221111111111111111 Q ss_pred HHH----HHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEe------cCccceeeCCCccccCccccceeeeeecCCH Q lcl|Aclame:pro 157 TEY----DIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAI------QNQFSVTIDPDAKKRDRSDMNWCLIDDTMSK 226 (711) Q Consensus 157 ~~~----~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v------~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~ 226 (711) ..+ ...+.....+. +++..-+ ...+++..+ .-+.-++.||.+.++ + ....-.. T Consensus 197 ~~yp~~a~~~~~~~~~~~-------~~~~~~~--~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~-------~-~~~~~~~ 259 (708) T protein:vir:17 197 AEYGKKPPASLDVTSMTS-------WEYDWFD--ADVIYIAKYYEVRKESVDVISYRHPITGEI-------A-TYDSDQV 259 (708) T ss_pred HhCccccchhhhhhhhcc-------ccccccC--CCeEEEEEEEEEeeeeeEEEEEecCccCce-------e-eeCccch Confidence 111 01111000000 0110000 012221111 001112234433211 0 0000011 Q ss_pred HHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcc Q lcl|Aclame:pro 227 EKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRK 306 (711) Q Consensus 227 ~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 306 (711) ..+...+.... ...-..+...+.-.+|+.- . |..+. + +......+. T Consensus 260 ~~~~~~~~~~g-----------~~~~~~r~~~r~~v~~~~~--------~--g~~~l-~------------~~~~~p~~~ 305 (708) T protein:vir:17 260 EDIEDELAIAG-----------FQEVARRSVKRRRVYVSVV--------D--GDGFL-E------------KPRRIPGEH 305 (708) T ss_pred hhHHHHHHhcc-----------cccceeeeeeEEEEEEEee--------c--ccccc-c------------CCCCCCCCc Confidence 12222111100 0000011111111222111 1 11000 0 000000000 Q ss_pred cceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceE Q lcl|Aclame:pro 307 VKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFI 386 (711) Q Consensus 307 ~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~ 386 (711) .. + .-+.|...--+..|.++|.+=.. +.+-+ .+-.....-...-.+.+.-.++++.++...+. T Consensus 306 fP---~--vP~~g~r~~~d~~~~~yG~vr~~-----kd~Q~-------~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~ 368 (708) T protein:vir:17 306 IP---L--IPVYGKRWFIDDIERVEGHIAKA-----MDPQR-------LYNLQVSMLADTAAQDPGQIPIVGMEQIRGLE 368 (708) T ss_pred cc---e--EEEecccccccCCCcccchhhhc-----hhHHH-------HHHHHHHHHHHHHHhcCCcceeechhhhhhhH Confidence 01 0 11122211111112111111100 11111 11111111122222222334555555544443 Q ss_pred ecccccCChHHHHhhcccCCCceEEecccccCcCCccccCCccchHHHHHHHHHHHHHH----HHHhCCCHHHhccccch Q lcl|Aclame:pro 387 GSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKI----KSTMGMYDASLGAMGNE 462 (711) Q Consensus 387 ~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~----~~~tGv~~~~~G~~~~~ 462 (711) ...+...+.+..+.+.+..++.+-.+++++.....++..+.++....++++.....+.+ ....|......|..-+. T Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~ 448 (708) T protein:vir:17 369 KHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNN 448 (708) T ss_pred HhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHH Confidence 32222222222333334444555555566666666666666777777777666555544 33456544444533221 Q ss_pred ---hHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecc---cCcchheecchh Q lcl|Aclame:pro 463 ---TSGRAIIARQRQ--------GDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFP---DETEDFVKLNEQ 528 (711) Q Consensus 463 ---~sg~ai~~~~~~--------~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~---~~~~~~v~~~~~ 528 (711) .+..++...... |...+..+.+-|. -.+++++ +-+- ..++.+.|.+. ..+..++.+|.. T Consensus 449 rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~--~~R~~RI----~~ed-g~~~~v~in~~~~d~~~g~~~~~nDi 521 (708) T protein:vir:17 449 LMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYG--SEREVRI----VNED-GSDDIAVLSAQVVDRQTGAVVALNDL 521 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC--CCcEEEE----ecCC-CCcceeeecceeccCCCccceeeccc Confidence 122222222211 2222211111110 0111111 1111 12344444331 112233444331 Q ss_pred hhhhhccceeeeehhhheeeeEEeecccChHHHHH-HHHHHHHHHH---hhcchhHHHHHHHHHHhcCCc-chHHHHHHH Q lcl|Aclame:pro 529 IFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRI-EAAEAMIQFA---QAVPSAAAVMADLIAQNMDWP-GADVIAERL 603 (711) Q Consensus 529 ~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~-~~~~~L~~l~---~~~p~~~~~~~~~~~~~~~~~-~~~e~~~~l 603 (711) . .| ..|+.. +.... .++...+.. ...+.|..+. ++.+.+...++..+ ..-... -.+.+...+ T Consensus 522 ~----~g----~~Dv~v---~~~p~-~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~-D~p~~~ei~e~ir~~~ 588 (708) T protein:vir:17 522 S----VG----RYDVTV---DVGPS-YTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNI-DGEGLDDFKEYNRNQL 588 (708) T ss_pred e----ee----eeeEEE---ecccC-chhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhc-CCCChHHHHHHHHHHh Confidence 1 11 123321 22222 223333222 3333322221 12222222222211 000000 012222222 Q ss_pred Hhhhcch--hhcchhh-hhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HH Q lcl|Aclame:pro 604 KKIVPPN--VLSKDER-EAI-EEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQL----AM 675 (711) Q Consensus 604 ~~~~~~~--~~~~~~~-~~~-~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~----~~ 675 (711) ....... .+...++ .+. +.++++++..+.+++++..++|+++++++++..+++++..+++.+..+++.++ ++ T Consensus 589 ~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q 668 (708) T protein:vir:17 589 LISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQ 668 (708) T ss_pred hccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 1111111 111 11112222233345566666777777777777666666655555544433221 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-cC Q lcl|Aclame:pro 676 IEDMAQGGDVVYQQVRELVAQALAEITASQANVT-EQ 711 (711) Q Consensus 676 ~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e-~Q 711 (711) ..........+..++-...+..++...+++.+.+ ++ T Consensus 669 ~~~~~~~~~~~~~~~l~~~q~~q~q~~~a~p~~~~~~ 705 (708) T protein:vir:17 669 ARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADL 705 (708) T ss_pred HHHHHHHHHHHHHHHhhhhhhhHHHHHhccccCchhc Confidence 3222222333333333344455666777788887 34 No 127 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.18 E-value=3e-06 Score=50.86 Aligned_cols=590 Identities=13% Similarity=0.052 Sum_probs=172.1 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC--------CHHHH-- Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW--------PSQVR-- 70 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw--------~~~~~-- 70 (711) |...-+. ++.+.++.+.+++--..++-..+ + |+. ...-.-|+.|++| .+... T Consensus 54 ~~~~~~~-----~~~~~~~~~grs~vv~~~v~~~v-e-----------~~~-~~l~~~f~~~~~~~~~~P~~~~D~~~A~ 115 (763) T protein:vir:95 54 NDLMRIE-----GKAKPPKVKGRSQVQPKLVRRQA-E-----------WRY-SALTEPFLGSNKLFKVTPVTWEDVQGAR 115 (763) T ss_pred HHhhhcc-----ccCcccccCCCccccCHHHHHHH-H-----------HHH-HHHHHhhcCCCcEEEEecCCcchHHHHH Confidence 2111111 22244444455554333322222 2 221 1122346666666 22111 Q ss_pred ------HHHHHhCCC--ceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHH Q lcl|Aclame:pro 71 ------TERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVF 142 (711) Q Consensus 71 ------~~~~~~g~p--~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l 142 (711) +++-..... -++.|.++..+..-+|..+ +-|.+.-+.. .........-.+.. .+.+..+ T Consensus 116 q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k---~~W~~~~~~~---------~~~~~~~~~~~~~~-~~~~~~~ 182 (763) T protein:vir:95 116 QNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVR---VGWNREIRKE---------KQEVPVFSLFPIQT-QEQADAL 182 (763) T ss_pred HHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEE---Eeeeeeeeee---------eeeehhhhhccccc-hhHHHHH Confidence 221111111 1233334444433333211 1111110000 00000000000011 1111111 Q ss_pred HHHHHHH-Hhh----cCHHHHHHHHHHHHHhcCccEE---------EEEEeeccCC--------CCCCcceEEE-ecCcc Q lcl|Aclame:pro 143 TGLIKNI-EYN----CDAETEYDIAFQGAVESGMGYL---------RVRSDYLADD--------SFEQDLIIEA-IQNQF 199 (711) Q Consensus 143 ~~~~~~~-~~~----~~~~~~~~~a~~~~~~~G~g~~---------~v~~d~~~~~--------~~~~~i~i~~-v~~~~ 199 (711) ...+..- .+. -+.+.....+.......|.+++ .+........ .|--++.++. +.+.. T Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~ 262 (763) T protein:vir:95 183 QQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKAM 262 (763) T ss_pred HHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecCCCCCchhhCc Confidence 1111111 111 1223333344444444454432 2222111110 0001111110 11111 Q ss_pred ceeeCCCccccCccccceeeeeecCCH---HH--HHHhcCCccc---chhhcc-----cccccccCC----CCCeEEEEE Q lcl|Aclame:pro 200 SVTIDPDAKKRDRSDMNWCLIDDTMSK---EK--FKALYPDATA---EPVYED-----SVADYDTWF----TEKSVRVSE 262 (711) Q Consensus 200 ~v~~Dp~a~~~d~~Da~~~~~~~~~~~---~e--~~~~~p~~~~---~~~~~~-----~~~~~~~~~----~~~~v~v~E 262 (711) -+++.-.-+.-|+-+..|.. .+++. ++ ... .++... ..+... ...-+..|. +++.+ .+ T Consensus 263 ~~~~~~~~t~~dL~~~~~~y--~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~--~~ 337 (763) T protein:vir:95 263 FAIVSFETCKADLLKEKDRY--HNLNKIDWQSSAPVN-EPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGV--LE 337 (763) T ss_pred eEeeEEeccHHHHHhccCCc--cccchhcchhccccc-cccccccchhhccCCCcccceEEEEEeeeeeccCCcce--eE Confidence 11111001111111111110 11000 00 000 000000 000000 000011111 12221 11 Q ss_pred eeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEee Q lcl|Aclame:pro 263 YFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGK 342 (711) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~ 342 (711) ||+ +. +..+........ + . ..+. +|+-.+|++|..+. T Consensus 338 -~~~------v~-~~g~~iL~~~~~-p-------------------~-------------~~~~--~PFv~~~~~p~~~~ 374 (763) T protein:vir:95 338 -PIV------AT-WIGSTLIRLEKN-P-------------------Y-------------PDGK--LPFVLIPYMPVKRD 374 (763) T ss_pred -EEE------EE-EEcCeeeecccc-c-------------------c-------------cCCC--cCEEEecceeecCc Confidence 111 11 111111111110 0 0 0011 12233455554433 Q ss_pred eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHH---hcCCCceEecccccCChHHHHhhcccCCCceEEecccccCc Q lcl|Aclame:pro 343 SLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA---LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD 419 (711) Q Consensus 343 ~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~---~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~~ 419 (711) .+ +.++..-+.+.-+..-.+.|.....+.-..+ ...++.+ ...+.+.+... .-...++|+....... T Consensus 375 ~~---G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav-~~~d~~~~~pg--~v~~v~~g~~~~~~~~---- 444 (763) T protein:vir:95 375 MY---GEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGML-DALNSRRYREG--EDYEYNPTQNPAQMII---- 444 (763) T ss_pred cc---CCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccc-cchhhhcccCC--ceEEeeCCCChhhhcc---- Confidence 21 3333333333333333334433322221100 1122222 11222211100 0011223332221111 Q ss_pred CCccccCCccchHHHHHHHHHHHHHHHHH----hCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 420 PGPRRQPPAAVPAAELTLGQNSVEKIKST----MGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGK 495 (711) Q Consensus 420 ~~i~~~~~~~~~~~~~~ll~~~~~~~~~~----tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 495 (711) ...+-+.++-+..++++++...+.+-.+ .|++....|.+.++.++.. ++.+.+...-+..+.+.+....+.+.. T Consensus 445 -~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~-qa~~~~~~~~~r~~~~~~k~l~~~~l~ 522 (763) T protein:vir:95 445 -EHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVL-DAASKREMAILRRLAKGMSEIGNKIIA 522 (763) T ss_pred -cccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112234556667777777666655444 4777777787777777753 444555666778888888888888888 Q ss_pred HHHHHHHhh----cCccceEeeecccCcchh-eecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHH Q lcl|Aclame:pro 496 ILVEMIPHI----YDTERVVRLKFPDETEDF-VKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMI 570 (711) Q Consensus 496 ~~l~li~~~----~~~~r~~ri~g~~~~~~~-v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~ 570 (711) ++......- .+.+..+.|..++-..+| |.+.. ..... .. ++.+.+.+|. T Consensus 523 Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~-----------------------~~as~-~~--q~~~~l~~ll 576 (763) T protein:vir:95 523 MNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDI-----------------------STAEV-DN--QKSQDLGFML 576 (763) T ss_pred HHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEec-----------------------ccchH-HH--HHHHHHHHHH Confidence 877643221 011122333322111111 11100 00000 11 1233333333 Q ss_pred H-HHhhc-chhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhh----hhhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 571 Q-FAQAV-PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAI----EEDMPEQTEPTPEQQVEMAKSQAD 644 (711) Q Consensus 571 ~-l~~~~-p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~----~~~~~~~q~~~~~~q~~~~~~q~~ 644 (711) + +.+.. +.+...++..+.+..+++...+-.+.......+....+.+.+.. +....+++.+..+.+++...++++ T Consensus 577 ~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e 656 (763) T protein:vir:95 577 QTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERD 656 (763) T ss_pred HHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 33333 23334455555555555554444333332222222222222111 111111111112222222333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhhhccC Q lcl|Aclame:pro 645 MAQAEADTAQAQADMLKAQLETEEAQKQLAM-IEDMAQGGDVVYQQVRELVAQALAEITA----SQANVTEQ 711 (711) Q Consensus 645 ~~k~qae~~~aqae~~~~q~~~~~~q~q~~~-~~~~~q~~~~~~~~~~~~~~~~~~e~~~----~qa~~e~Q 711 (711) +++++++.+..+.. +..+.+..+++.++.. ++.++... .+...++....+++... ..+.+.-. T Consensus 657 ~~~~d~~~~e~~~Q-~~~e~~~~~~~~eaq~~l~~~~a~~---~~~~ea~~~~~~~~~~~~~~~~~~~~~~~ 724 (763) T protein:vir:95 657 NKNLDYLEQESGTK-HARDLEKMKAQSQGNQQLEITKALT---KPRKEGELPPNLSAAIGYNALTNGEDTGI 724 (763) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhccChhHHHhhhhcccccccCCCc Confidence 33333322221111 1111111111111111 11111000 01111111111111111 11111111 No 128 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=97.91 E-value=1.1e-05 Score=47.73 Aligned_cols=629 Identities=11% Similarity=-0.013 Sum_probs=188.3 Q ss_pred chHHHHHHHHHHHHhCC--CCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhccccc Q lcl|Aclame:pro 46 WKDNWEAAEDDLKFLGG--EQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDT 123 (711) Q Consensus 46 ~~~~r~~~~~~~~~y~G--~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~ 123 (711) ..+.++...+.+.+|.= +.++++-...++.... +.-|...+.+..++ ..+.||-+...+ ...+...+..- T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~f--y~G~QW~~~~~~~l--~~q~rp~~N~i~----~~v~~v~g~e~ 72 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFF--SRVSQWDDWLSQYT--TLQYRGQFDVVR----PVVRKLVSEMR 72 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHh--hcCCCCCHHHHHHH--HhcCCCcccchH----HHHHHHHhhHH Confidence 33334333333333321 1122221222222111 11234444555544 223444322222 12222233333 Q ss_pred ccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcce-EEEecCcccee Q lcl|Aclame:pro 124 TLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLI-IEAIQNQFSVT 202 (711) Q Consensus 124 ~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~-i~~v~~~~~v~ 202 (711) ...+-....+.+. .-..+..++..+....-...-...+..++...++.+ .++|-....+-.+.+.. .+-......++ T Consensus 73 ~nr~d~~v~p~~~-~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~-G~G~~ev~~d~~~~d~~~~~~~i~~~~i~ 150 (725) T protein:vir:10 73 QNPIDVLYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEA-GVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) T ss_pred hCCcceEEecCCc-chHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhc-CcceeeeeccccCCCCCCCceeeeeeecc Confidence 3333344456443 334455555555544433333344444443333222 12221000000000000 00000000111 Q ss_pred eCCCccccCccccceeeeeecCCHHHHHH----hcCCcc-cchhhcccccccccCCCCCeE-EEEEeeeeeeeceeEEEc Q lcl|Aclame:pro 203 IDPDAKKRDRSDMNWCLIDDTMSKEKFKA----LYPDAT-AEPVYEDSVADYDTWFTEKSV-RVSEYFTREPVIREIALL 276 (711) Q Consensus 203 ~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~----~~p~~~-~~~~~~~~~~~~~~~~~~~~v-~v~E~~~~~~~~~~~~~~ 276 (711) +||...=+| -..+..++++.+= ++-+.. .+.+......+...|...... ....-|+... ..++..+ T Consensus 151 ~~~~~v~~D-------p~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~vrv~E~ 222 (725) T protein:vir:10 151 SACSHVIWD-------SNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQD-TIQIAEF 222 (725) T ss_pred cCHhHcccC-------chhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCC-eEEEEEE Confidence 111111111 1223333333321 111110 011111111111111110000 0011122211 1111110 Q ss_pred ----cCCcEE-Ee-cCcchhHHHHHhc---Cc--hhhhhc--ccceEEEEEEEEecCceeccCccCCC---CccceEEEE Q lcl|Aclame:pro 277 ----SDGRSF-WL-DALEDIVDELLEA---GI--SIVRTR--KVKTFKTYWRKITGANVLEGPVEIPS---TTIPVIPVW 340 (711) Q Consensus 277 ----~~~~~~-~~-~~~~~~~~~~~~~---g~--~~~~~~--~~~~~~v~~~~~~g~~~le~~~p~~~---~~~P~vp~~ 340 (711) .....+ .. +...-........ +. ...... .+..+++....+.-..+ .+..-... -.+.++||+ T Consensus 223 ~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~-~g~~~l~~~~~~~~~~fP~v 301 (725) T protein:vir:10 223 YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII-TCTAVLKDKQLIAGEHIPIV 301 (725) T ss_pred EEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEee-cchhhhcCCCCCCCCceeEE Confidence 000000 00 1100000000000 00 000000 01111111111111111 11111111 112235665 Q ss_pred eeeeccCCcccccchHHHhhHHHHHHHHHHHHHHH-HHHhcCCCceEecccccCChHHHHhhcccCCCceEEecccccC- Q lcl|Aclame:pro 341 GKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATE-TVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQG- 418 (711) Q Consensus 341 ~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~-~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~~- 418 (711) +++-+.. ...+...+..++..=+..=.+.+...- .+...+..+-....+....+++. .+.+.++ ....+-..... T Consensus 302 P~~g~r~-~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~-e~~~~~~-~~~~~~~~~~~~ 378 (725) T protein:vir:10 302 PVFGEWG-FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGF-EHMYDGN-DDYPYYLLNRTD 378 (725) T ss_pred EEEeeee-ccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHH-HHHHhcc-CCceeeeccccc Confidence 4332110 011111122333333333333333322 23333334444444444444432 2222211 11111000000 Q ss_pred --cCC--ccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 419 --DPG--PRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVG 494 (711) Q Consensus 419 --~~~--i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~ 494 (711) ++. ...+.....++-...+++........+-=++...-...|..+++.+--+....-......... |-..++.-. T Consensus 379 ~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~-~~Dnl~~~~ 457 (725) T protein:vir:10 379 ENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYV-FQDNLATAM 457 (725) T ss_pred ccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 000 112222233333335555555444444322222211222333333222222222222222222 112222233 Q ss_pred HHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehh----hhee----eeEE--eecccChH-HHHH Q lcl|Aclame:pro 495 KILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDL----NVQK----YDVV--VTTGPAFA-TQRI 563 (711) Q Consensus 495 ~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~----~~~~----~dv~--v~~~~~~~-s~r~ 563 (711) +.+-+++..+.. .+. .....+.|.... .+-..+.+|.. ..|+ -|++ .+...+.. +.-. T Consensus 458 ~~~g~~lL~lI~-----~~~---~~er~~RI~~ed---g~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s 526 (725) T protein:vir:10 458 RRDGEIYQSIVN-----DIY---DVPRNVTITLED---GSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQS 526 (725) T ss_pred HHHHHHHHHHHH-----HHc---CCCcEEEEecCC---CCcceeEeccccccccccchhhhhccccceeEEEeeccCcHH Confidence 333333333321 111 122333332211 11234444432 1222 1221 12222222 2222 Q ss_pred HHHHHHHHHHhhcchhHHHHHH---HHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 564 EAAEAMIQFAQAVPSAAAVMAD---LIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAK 640 (711) Q Consensus 564 ~~~~~L~~l~~~~p~~~~~~~~---~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~ 640 (711) ...+.+..|.+.++.+++.... +++..+++++.....+.......+..+.....+.. +.++++..++++++.++ T Consensus 527 ~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~---~e~~q~~~e~qq~~~~q 603 (725) T protein:vir:10 527 MKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPET---PEEQQWLVEAQQAKQGQ 603 (725) T ss_pred HHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccc---cchhHHHHHHHHHHHhh Confidence 2333333444444333332222 22333444444444443333332222222222111 22222333344555667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhhhccC Q lcl|Aclame:pro 641 SQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELV----AQALAEITASQANVTEQ 711 (711) Q Consensus 641 ~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~----~~~~~e~~~~qa~~e~Q 711 (711) +++++.++++++.++++++.+++++..+++.++++.+..++...+...++..+. ....++.....+.+++| T Consensus 604 ~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~ 678 (725) T protein:vir:10 604 QDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQD 678 (725) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 777777788888888888888888777777666665555443333322221111 11111111111112222 No 129 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=97.67 E-value=3e-05 Score=45.39 Aligned_cols=443 Identities=8% Similarity=-0.033 Sum_probs=179.9 Q ss_pred CCcCcchHHHHHHHHHHH---HHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 21 YAKNNDDDRALLATARER---ARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 21 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~ 97 (711) ++-+.... .+...... ++.++.....+|.....-+-.+.|..+.. -+.+-.-...+|.++.+|+.++|.. T Consensus 1 m~V~~~hp--~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~-----Y~~rl~rA~~~n~~~~t~~~~~G~v 73 (452) T protein:vir:94 1 MPIETKHP--EYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDM-----YNAYKQRALFYSITSKTLSALSGMV 73 (452) T ss_pred CCCCCcCH--HHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHH-----HHHHHhhccCCchHHHHHHHHhchh Confidence 33221110 01222222 22233333333221111122223443322 2333233556799999999999986 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~ 177 (711) =+..+.+.+ | +.+..+ ....+-++.......++..++.+|.+++=| T Consensus 74 f~k~p~~~~-p-------------------------------~~l~~~-~~D~~G~~L~~~~~~~~~~~l~~G~~~ilV- 119 (452) T protein:vir:94 74 LDQPPVITH-P-------------------------------DAMSKY-FEDQSGIQFYEVFTRAVEETLLMGRVGVFI- 119 (452) T ss_pred hcCCceecc-c-------------------------------HHHHHH-HhcccCCCHHHHHHHHHHHHHhcCeEEEEE- Confidence 444333321 1 112222 112456788999999999999999877655 Q ss_pred EeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCe Q lcl|Aclame:pro 178 SDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKS 257 (711) Q Consensus 178 ~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 257 (711) |+- ..+.++.+..| +|.+|+ |..... +.+...++. ++..-.. +. .+.|. .+. T Consensus 120 -D~p---~~g~rPy~~~~-~~~~Ii-~W~~~~----~g~l~~v~l-------re~~~~~--------d~--~d~f~-~~~ 171 (452) T protein:vir:94 120 -DRP---LTGGDPYISVY-TTENIL-NWEEDE----DGRLLMVVL-------REFYTVR--------DT--ADRYV-QNI 171 (452) T ss_pred -eec---cCCCceEEEEe-chhhhc-Cccccc----cCCeeEEEE-------EEEEEEe--------cC--CCccc-cee Confidence 442 23467888888 677777 332211 122111110 0000000 00 01111 122 Q ss_pred EEEEEeeeeeeeceeEEE--ccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccc Q lcl|Aclame:pro 258 VRVSEYFTREPVIREIAL--LSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIP 335 (711) Q Consensus 258 v~v~E~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P 335 (711) +..+-+|...+....|.. ..++..+. .+...+.+...-+.+.+| T Consensus 172 ~~~yRvL~l~~g~~~v~~~~~~~~~~~~----------------------------------~~~~~~~~~~~~~l~~IP 217 (452) T protein:vir:94 172 RVRYRCLELVDGLLQITVHETQDGKVWE----------------------------------LAKTSTIQNVGVTMDYIP 217 (452) T ss_pred EEEEEEEEEeCCeEEEEEEEccCCceee----------------------------------eccceeecCCCcccceeE Confidence 222111211111111111 11111100 000011111223456788 Q ss_pred eEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc Q lcl|Aclame:pro 336 VIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ 415 (711) Q Consensus 336 ~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~ 415 (711) ||+++.... +...+.+..-.+-+++.......+-..+++..+..|..++. |. ++... ...-++.++.+- . T Consensus 218 ~v~~~~~~~---~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~-g~-~~~~~----i~iG~~~~~~lp-e 287 (452) T protein:vir:94 218 FFCITPSGL---SMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWIT-GA-ESQST----MHIGSTKAWVIP-E 287 (452) T ss_pred EEEEcCCCC---CCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEee-cC-cCCCc----eEecccccccCC-C Confidence 876644331 23344566778888888888888889999999988877764 22 11111 112223333332 1 Q ss_pred ccCcCCccccCCccch-HHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 416 YQGDPGPRRQPPAAVP-AAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVG 494 (711) Q Consensus 416 ~~~~~~i~~~~~~~~~-~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~ 494 (711) .+..+.++.+..-+ .....-|+...+.|.. .|. ....+.....+|+.+......+....|..+..++..+. T Consensus 288 --~~~~~~yie~~g~~i~~~~~~l~~le~~m~~-~Ga-~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al---- 359 (452) T protein:vir:94 288 --VAAKVGFLEFTGQGLQSLEKALSEKQAQLAS-LSA-RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALL---- 359 (452) T ss_pred --CCCcceEEccCchhHHHHHHHHHHHHHHHHH-HHH-HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHH---- Confidence 11235555533222 1223333444444433 232 22233334456776665555444567777777776665 Q ss_pred HHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 495 KILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQ 574 (711) Q Consensus 495 ~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~ 574 (711) ..+|.++..|.... ... -+.+|. .|.. ..-.+... ++|.++.. T Consensus 360 ~~~l~~~a~w~g~~---------~~~-~v~~n~-------------------dF~~----~~~~~~~~----~al~~~~~ 402 (452) T protein:vir:94 360 NKAYSCIMDMESMG---------GTL-NIKLNS-------------------AFLD----SKLTAAEL----KAWVEAYL 402 (452) T ss_pred HHHHHHHHHHcCCC---------Cce-EEEecc-------------------cccc----ccCCHHHH----HHHHHHHh Confidence 45566667765421 110 122221 1110 00011111 12222221 Q ss_pred hcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhc---chhhhhhhh Q lcl|Aclame:pro 575 AVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLS---KDEREAIEE 622 (711) Q Consensus 575 ~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~---~~~~~~~~~ 622 (711) . ..+....+...++...+...+.-.+.+....+.+++. .+....-.. T Consensus 403 ~-G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 403 S-GGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPSNTPPNPSSKA 452 (452) T ss_pred c-CCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccCCCCCCCccCC Confidence 1 0111111111222222222211112222222211111 111111001 No 130 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=97.67 E-value=3e-05 Score=45.35 Aligned_cols=621 Identities=13% Similarity=0.012 Sum_probs=188.8 Q ss_pred cCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh-CC-CCCCHHHHHHHHHhCCCceEehhhHHHHH Q lcl|Aclame:pro 14 YAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL-GG-EQWPSQVRTERELEQRPCLVNNVLPTFVD 91 (711) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y-~G-~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~ 91 (711) -.||-+..+....- ...++. .-....+.+....+.+.+| .. ..|.++-...++.... +.-|...+.+. T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f--y~G~Qw~~~~~ 70 (711) T protein:vir:10 1 MAKKQKKSRVEQLY----AKKAKV----YAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKF--LGGEQWPSQVR 70 (711) T ss_pred CCcccccccccchh----HHHHHh----cccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH--hCCCCCCHHHH Confidence 22333322211111 111111 1111122233333333322 22 2233332222222110 01134445555 Q ss_pred HHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHH------------------HHHHHHHHHHHHhhc Q lcl|Aclame:pro 92 QVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYEL------------------AEVFTGLIKNIEYNC 153 (711) Q Consensus 92 ~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~------------------Ae~l~~~~~~~~~~~ 153 (711) +++-...+....+...+.. +...-...-.+-+...+.+++.+.... -.-+.+++..+.... T Consensus 71 ~~l~~~g~p~~~~N~i~~~-v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~ 149 (711) T protein:vir:10 71 TERELEQRPCLVNNVLPTF-VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNI 149 (711) T ss_pred HHHHhcCCCcEEEcchHHH-HHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHH Confidence 5544333332333322221 222222333444555566665433221 123334443333322 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeee------------- Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLI------------- 220 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~------------- 220 (711) -.......+..++...++.+ .+ -|.++++|+.+. |..+-.+++. T Consensus 150 ~~~~~~~~~~s~af~d~~~~-G~--------------------G~~ev~~d~~~~--d~~~~e~~i~~v~~p~~v~~Dp~ 206 (711) T protein:vir:10 150 EYNCDAETEYDIAFQGAVES-GM--------------------GYLRVRSDYLAD--DSFEQDLIIEAIQNQFSVTIDPD 206 (711) T ss_pred HHhcChhHHHHHHHHHhhhc-Cc--------------------ceEEEEecccCC--CCCCCCeEEeeecChhheeeCcc Confidence 22222222333333333221 11 222344443221 1111222221 Q ss_pred eecCCHHHHHHhcCCccc--chhhcc--cccccccCCCCCeEEEEEeeeeeeeceeEEEc--cCCcEEEecCcchhHHHH Q lcl|Aclame:pro 221 DDTMSKEKFKALYPDATA--EPVYED--SVADYDTWFTEKSVRVSEYFTREPVIREIALL--SDGRSFWLDALEDIVDEL 294 (711) Q Consensus 221 ~~~~~~~e~~~~~p~~~~--~~~~~~--~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 294 (711) .+..+.++++-.|-..-. +.+... ....... ....+.-...|+.+ ...++..+ .......+-...+..-. T Consensus 207 a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~--~~~~~~~~~~~~~~-~~vrv~E~~~r~~~~~~~~~~~~~~~~- 282 (711) T protein:vir:10 207 AKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPV--YEDSVADYDTWFTE-KSVRVSEYFTREPVIREIALLSDGRSF- 282 (711) T ss_pred ccccChhhhcceeeeecCCHHHHHHhCCchhhhhh--hcccccccCcccCc-ceeeEEEEEeeeeeeeEEEeecCCcee- Confidence 122233333211110000 000000 0000000 00000001113222 11111110 00000000000000000 Q ss_pred HhcCchhh----hhc---ccceEEEEEEEEecCceeccCccCCC-Cccce--EEEEeeeeccCCcccccchHHHhhHHHH Q lcl|Aclame:pro 295 LEAGISIV----RTR---KVKTFKTYWRKITGANVLEGPVEIPS-TTIPV--IPVWGKSLIIKKKEIFRSIIRHSKDAQR 364 (711) Q Consensus 295 ~~~g~~~~----~~~---~~~~~~v~~~~~~g~~~le~~~p~~~-~~~P~--vp~~~~~~~~~~~~~~~g~v~~~~d~Q~ 364 (711) ...+...+ ... .+..+.+..+..... +..+..-+.. ..||. +||++++-+. ....+.|....+...=+ T Consensus 283 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~-~~~G~~~L~~~~p~~~~~~P~vp~~g~r-~~~d~~~~~~G~vr~~~ 360 (711) T protein:vir:10 283 WLDALEDIVDELLEAGISIVRTRKVKTFKTYWR-KITGANVLEGPVEIPSTTIPVIPVWGKS-LIIKKKEIFRSIIRHSK 360 (711) T ss_pred ccCcchhHHHHHHhcCchhhhhhhhceeeEEEE-EEecceeecCCCCCCCCcccEEEEeeee-eccccccccchhhhhhh Confidence 00000000 000 000011100000000 1112111111 11222 4444322110 00112233333222222 Q ss_pred HHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccccc-------------CcCCcc-------- Q lcl|Aclame:pro 365 MANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQ-------------GDPGPR-------- 423 (711) Q Consensus 365 ~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~-------------~~~~i~-------- 423 (711) ..=.+.+...-.+. +... ..+.+-+.+..|.. ++.-+. T Consensus 361 d~Qr~~N~~~s~~~------------------~~l~---~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~ 419 (711) T protein:vir:10 361 DAQRMANYWDSAAT------------------ETVA---LAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGD 419 (711) T ss_pred hhHHHHHHHHHHHH------------------HHHH---hcCCCceeecCcccCChHHHHHhccccCCCeeEecccccCc Confidence 22222222221111 1111 11111111111111 111111 Q ss_pred -ccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 424 -RQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIP 502 (711) Q Consensus 424 -~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~ 502 (711) .+...+.++-....+++.......+.-++..+....|..+++.+-.+....-......+.. +.+.++...+.+.+++. T Consensus 420 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~-~~dn~~~~~~~~g~~ll 498 (711) T protein:vir:10 420 PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFA-FIDNLTKSIRRVGKILV 498 (711) T ss_pred CCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 1222222333344555555555555443322222233344433333332222222222222 22333344444444444 Q ss_pred hhcCccceEeeecccCcchheecchhhhhhhccceeeeehh----hhe----eeeEEee---cccC-hHHHHHHHHHHHH Q lcl|Aclame:pro 503 HIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDL----NVQ----KYDVVVT---TGPA-FATQRIEAAEAMI 570 (711) Q Consensus 503 ~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~----~~~----~~dv~v~---~~~~-~~s~r~~~~~~L~ 570 (711) .+.-. +. ..++.+.|.... .+-..+.+|.- ..| ..|+++. +... .++.-....+.+. T Consensus 499 ~li~~-----~~---~~er~~rI~ged---~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~ 567 (711) T protein:vir:10 499 EMIPH-----IY---DTERVVRLKFPD---ETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAE 567 (711) T ss_pred HHHHH-----Hc---CCCeEEEEecCC---CCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHH Confidence 43211 11 122334443211 12233344321 111 2344432 2222 2333333333444 Q ss_pred HHHhhcchhHHHHHHHHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 571 QFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEA 650 (711) Q Consensus 571 ~l~~~~p~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qa 650 (711) .|.+..+.++.....++.-.+........-+........ .+++.......++.++.+++++++..+++.+++++++ T Consensus 568 ~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~----~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~ 643 (711) T protein:vir:10 568 AMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKI----VPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) T ss_pred HHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhh----cCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444433332221111111111111111111111111111 1111112223334444455556666667777778888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 651 DTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 651 e~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +..+++++..+++++..+++.+..+............+.....+++..++++..+++++.. T Consensus 644 ~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qaelq~~ 704 (711) T protein:vir:10 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITAS 704 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888888877777776666666555544444444444444445555556666666666655 No 131 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=97.47 E-value=6.1e-05 Score=43.68 Aligned_cols=624 Identities=12% Similarity=-0.009 Sum_probs=189.9 Q ss_pred chHHHHHHHHHHHHhC-C-CCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhccccc Q lcl|Aclame:pro 46 WKDNWEAAEDDLKFLG-G-EQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDT 123 (711) Q Consensus 46 ~~~~r~~~~~~~~~y~-G-~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~ 123 (711) ..+.++...+.+.+|. . +.++++-...++.... +.-|.-.+.+..++ ..+.||-+...+ .......+..- T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f--y~G~Qw~~~~~~~l--~~q~rp~~N~i~----~~i~~v~g~e~ 72 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFF--SRISQWDDWLSQYT--TLQYRGQFDVVR----PVVRKLVSEMR 72 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHh--hcCCCCCHHHHHHH--HhcCCCcccchH----HHHHHHHhhHH Confidence 4344444444433332 1 1122222222222111 01233334444444 223444322222 12222333333 Q ss_pred ccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEe-cCccc-- Q lcl|Aclame:pro 124 TLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAI-QNQFS-- 200 (711) Q Consensus 124 ~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v-~~~~~-- 200 (711) ...+-....+.+. .-..+..++..+....-...-...+..++...++.+ .++|- +++.... .||++ T Consensus 73 ~nr~d~~v~P~~~-~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~-G~G~~---------ev~~d~~~~d~~~~~ 141 (725) T protein:vir:92 73 QNPIDVLYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIES-GVGAW---------RLVTDYEDQSPTSNN 141 (725) T ss_pred hCCcceEEecCCc-cHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhc-Cccee---------eeeecccCCCCCCCc Confidence 3334444456554 334555666655554444444444555554444332 22221 0100000 01111 Q ss_pred eeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc-----chhhcccccccccCCCCCeE-EEEEeeeeeeeceeEE Q lcl|Aclame:pro 201 VTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA-----EPVYEDSVADYDTWFTEKSV-RVSEYFTREPVIREIA 274 (711) Q Consensus 201 v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~-----~~~~~~~~~~~~~~~~~~~v-~v~E~~~~~~~~~~~~ 274 (711) +.+...+.-.++...-|--..+..++++.+-.|-..-. ..+......+...|...... .-..-|+.. ...++. T Consensus 142 ~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~vrv~ 220 (725) T protein:vir:92 142 QVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQ-DTIQIA 220 (725) T ss_pred eeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCC-CeEEEE Confidence 00000000000111112222333444444322211100 11111111111111111000 000112221 111111 Q ss_pred EccCCcE-------EEe-cCcchhHHHHHhcCchh-----hhhc--ccceEEEEEEEEecCceeccCccCCC-Ccc--ce Q lcl|Aclame:pro 275 LLSDGRS-------FWL-DALEDIVDELLEAGISI-----VRTR--KVKTFKTYWRKITGANVLEGPVEIPS-TTI--PV 336 (711) Q Consensus 275 ~~~~~~~-------~~~-~~~~~~~~~~~~~g~~~-----~~~~--~~~~~~v~~~~~~g~~~le~~~p~~~-~~~--P~ 336 (711) ..+.. +.. +.......+........ .... .+..+++......-..+ .+..-... ..+ .+ T Consensus 221 --e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~-~g~~~l~~~~~~~~~~ 297 (725) T protein:vir:92 221 --EFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII-TCTAVLKDKQLIAGEH 297 (725) T ss_pred --EEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeee-cchhhhcCCCCCCCCc Confidence 11100 000 11110000000000000 0000 01111111111111111 11111111 112 23 Q ss_pred EEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHH-HHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc Q lcl|Aclame:pro 337 IPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATET-VALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ 415 (711) Q Consensus 337 vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~-l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~ 415 (711) +||++++-+.. ...+......++..=+..=.+.+...-. +...+..+-....+....+++. .+.+.++ -...+-.. T Consensus 298 ~P~vP~~g~r~-~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~-~~~~~~~~ 374 (725) T protein:vir:92 298 IPIVPVFGEWG-FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGF-EHMYDGN-DDYPYYLL 374 (725) T ss_pred eeeEEEEeeee-ccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHH-HHHHhcc-Cccceeec Confidence 66654321110 0111111223333333333333333222 2222222322222233223322 2221110 11100000 Q ss_pred ccC---cC--CccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 416 YQG---DP--GPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSI 490 (711) Q Consensus 416 ~~~---~~--~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~ 490 (711) ... ++ +...+.....++-...++++.....+.+-=++..+-...|..+++.+-.+....-......+..-| ..+ T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~-Dnl 453 (725) T protein:vir:92 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQ-DNL 453 (725) T ss_pred cccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHH-HHH Confidence 000 00 011222223333334555555555554433322222222333333333333222222222222222 223 Q ss_pred HHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehh----hhee----eeEE--eecccCh-H Q lcl|Aclame:pro 491 RRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDL----NVQK----YDVV--VTTGPAF-A 559 (711) Q Consensus 491 ~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~----~~~~----~dv~--v~~~~~~-~ 559 (711) +.-.+.+-+++..+.- .+ -.....+.|.... ..-..+.+|.. ..|+ -|+. .+...+. + T Consensus 454 ~~~~~~~g~~lL~lI~-----~~---~~~~r~~RI~~ed---g~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p 522 (725) T protein:vir:92 454 ATAMRRDGEIYQSIVN-----DI---YDVPRNVTITLED---GSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP 522 (725) T ss_pred HHHHHHHHHHHHHHHH-----Hh---cCCCcEEEEecCC---CCcceEEeccccccccccchhhhhccccceeeEEeecc Confidence 3333333334333321 11 1122333332211 11233444432 1122 1221 1221222 2 Q ss_pred HHHHHHHHHHHHHHhhcchhHHHHHH---HHHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhh-hHHHHHHHHHHHH Q lcl|Aclame:pro 560 TQRIEAAEAMIQFAQAVPSAAAVMAD---LIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEE-DMPEQTEPTPEQQ 635 (711) Q Consensus 560 s~r~~~~~~L~~l~~~~p~~~~~~~~---~~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~q~~~~~~q 635 (711) +.-....+.+..+.++++.+++.... .+...++++......+.......+..+ ....++ .+++.+...++++ T Consensus 523 ~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~----~~~~~~~~~e~~q~~~~~qq 598 (725) T protein:vir:92 523 SFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ----MGVKKPETPEEQQWLVEAQQ 598 (725) T ss_pred ChHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhch----hccCCccchhhhHHHHHHHH Confidence 22222223333333333333322222 112223333333333322222211111 111111 1223333344445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhhhccC Q lcl|Aclame:pro 636 VEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELV----AQALAEITASQANVTEQ 711 (711) Q Consensus 636 ~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~----~~~~~e~~~~qa~~e~Q 711 (711) ++.+++++++.++++.++++++++++++++..+++.++++.+..++...++..++..+. ....++.....+.+++| T Consensus 599 a~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~ 678 (725) T protein:vir:92 599 AKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQD 678 (725) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 66667777888888888888999888888888887777666655554433332221111 11111222222222222 No 132 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=97.32 E-value=9.6e-05 Score=42.60 Aligned_cols=471 Identities=10% Similarity=-0.011 Sum_probs=176.1 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh-CCCCCCHHHHHHHHHhCCCceEehhhHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL-GGEQWPSQVRTERELEQRPCLVNNVLPT 88 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y-~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 88 (711) |--...|..-+..++ ++ +..+...++...+....-+.-+... .|. ...+|+.+.. +..-..| ...+|.++. T Consensus 1 ~~~~~~~~~~V~~~h-p~----y~a~~~~W~~ird~~~G~~~~~~r~-~yl~~~~~~~~e~~-Y~~rl~r-A~~~n~~~~ 72 (489) T protein:vir:78 1 MLTENGQGSGVKTKH-RE----WLHYAPKWQKVRHALAGELVSYLRN-VGLNEPDKAYGEAR-QAEYEAG-GIVYNFTRR 72 (489) T ss_pred CccCCCccCCCCccC-HH----HHHHHHHHHHHHHHhcCcccccccC-CCCCCCCCCCChHH-HHHHHhc-cccCChHHH Confidence 211122222222222 11 2222222222222111111001111 122 2345544432 3322233 457899999 Q ss_pred HHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHH-hhcCHHHHHHHHHHHHH Q lcl|Aclame:pro 89 FVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YNCDAETEYDIAFQGAV 167 (711) Q Consensus 89 ~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~~~a~~~~~ 167 (711) +|+.++|..=+..|.+.+ -+.|..++..+- +-++.......++..++ T Consensus 73 tl~~l~G~vfrk~p~~~~--------------------------------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l 120 (489) T protein:vir:78 73 TLSGMVGSVMRKEPEINI--------------------------------PKELEYLLKNADGSGVGLIQHAQDTLMEID 120 (489) T ss_pred HHHHHhchhhcCCcceec--------------------------------cHHHHHHHhccCCCCCCHHHHHHHHHHHHH Confidence 999999986554443321 222455555554 34678999999999999 Q ss_pred hcCccEEEEEEeeccCCC--------CCCcceEEEecCccceeeCCCccccCc-cccceeeeeecCCHHHHHHhcCCccc Q lcl|Aclame:pro 168 ESGMGYLRVRSDYLADDS--------FEQDLIIEAIQNQFSVTIDPDAKKRDR-SDMNWCLIDDTMSKEKFKALYPDATA 238 (711) Q Consensus 168 ~~G~g~~~v~~d~~~~~~--------~~~~i~i~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~~~~~e~~~~~p~~~~ 238 (711) .+|.+++=| |+-..+. ..-+|.+..| +|.+|+ +......+. ....++..+..... T Consensus 121 ~~G~~~ilV--D~P~~~~~T~ade~~~~~rPy~~~~-~~~~Ii-nW~~~~v~G~~~Lt~v~lrE~~~~------------ 184 (489) T protein:vir:78 121 SVGRGGLLV--DAPETGAATAAEQNAGLLNPTIAFY-TTENIV-NWRLTRVGSVNRVTMVVLRETWEY------------ 184 (489) T ss_pred hcCeEEEEE--eeCCCCCcCHHHHHHhcCCcEEEEe-chhhhc-CceeeeeCCccceeEEEEEEeEEe------------ Confidence 999887543 4422221 1125667766 576665 332222221 01222222221100 Q ss_pred chhhcccccccccCCCCCeEEEEEe--eeeeeeceeEEE-ccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEE Q lcl|Aclame:pro 239 EPVYEDSVADYDTWFTEKSVRVSEY--FTREPVIREIAL-LSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWR 315 (711) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~v~v~E~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~ 315 (711) .+..+.+.+.....+||.+. |.+- +.+++. ..+|... ...+ T Consensus 185 -----~d~~~~f~~~~~~q~RvL~~~~~g~~--~~~~~r~~~~g~~~----------------------------~~~~- 228 (489) T protein:vir:78 185 -----NEPGNEFETKYGEQYRVLDIDSDGNY--RQRLFRFDAEGGAQ----------------------------EDVV- 228 (489) T ss_pred -----ecCCCCccceeEEEEEEEecCCCcce--EEEEEEeecCCccc----------------------------ceee- Confidence 01111122222233333321 1000 000000 0011000 0000 Q ss_pred EEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCCh Q lcl|Aclame:pro 316 KITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGR 395 (711) Q Consensus 316 ~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~ 395 (711) .++....--+.+.+|||+++.... +...+....-.+-.+....=...+-..+++..+.-|..++. |. ++. T Consensus 229 -----~~~~~~g~~~l~~IPfv~~~~~~~---~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~-d~~ 298 (489) T protein:vir:78 229 -----EIYPDLGESLRGVIPFTFIGATNN---DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PG-ENL 298 (489) T ss_pred -----EEeccCCCCccCeeeEEEEecCCC---CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cC-ccC Confidence 001111112346677776653321 12222233444544443333334455666777766766653 32 222 Q ss_pred HHHHhhcccCCCceEEeccccc--CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHH Q lcl|Aclame:pro 396 EDEWEQANTKNFSLLTYIPQYQ--GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQR 473 (711) Q Consensus 396 ~~~~~~~~~~~~~~i~~~~~~~--~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~ 473 (711) .+.+.......+-++.-+.+.. .+....++++....-. ...|....+.|. ..|.. .. ..+.+.|+.+...... T Consensus 299 ~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~-~lGa~--l~-~~~~~~Ta~~~~~~~~ 373 (489) T protein:vir:78 299 TPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAI-QIGAQ--LI-TPTQQITAQSARIQRG 373 (489) T ss_pred CcccccccCccceeeCCcccccCCCCCCcceeccCcchHH-HHHHHHHHHHHH-HHhhh--hc-cCCcchhHHHHHHHHH Confidence 2222222222222222111111 0112234443322221 222332233332 22322 12 2233577877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEee Q lcl|Aclame:pro 474 QGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) Q Consensus 474 ~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~ 553 (711) +....|..+..|+..++ ..+|.++..|...+ .++.. -+.+|. .|++ . T Consensus 374 ~~~S~L~~~a~~~e~al----~~~l~~~a~w~G~~-------~~~~~-~i~~n~-------------------dF~~--~ 420 (489) T protein:vir:78 374 ADTSVMATIARNVSQAY----TDALRWVAVMLGKP-------EDTEV-EFRLNM-------------------DFFL--E 420 (489) T ss_pred HhhHHHHHHHHHHHHHH----HHHHHHHHHHcCCC-------CCCce-EEEeec-------------------ccCc--c Confidence 77777777777776655 45555666664321 00000 111211 1111 0 Q ss_pred cccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC--cchHHHHHHHHhhhcchhhcchhhhhhhhhHHHH Q lcl|Aclame:pro 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW--PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) Q Consensus 554 ~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~--~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 627 (711) +-.+. ...+|..+...- .+....+-..++...+ +..+++...+....++.+......-....|++++ T Consensus 421 --~~d~~----~~~al~~~~~~G-~is~~t~~~~L~~~gv~d~~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 421 --PMTAQ----DRAAWMADINAG-LLPATAYYAALRKAGVTDWTDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred --cCCHH----HHHHHHHHHhcC-CCCHHHHHHHHHhCCCCCccHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 00011 122222222210 1111111111222222 2345555555544222211111110000000000 No 133 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=96.95 E-value=0.00024 Score=40.43 Aligned_cols=613 Identities=12% Similarity=-0.000 Sum_probs=195.2 Q ss_pred chHHHHHHHHHHHHhC-C-CCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhccccc Q lcl|Aclame:pro 46 WKDNWEAAEDDLKFLG-G-EQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDT 123 (711) Q Consensus 46 ~~~~r~~~~~~~~~y~-G-~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~ 123 (711) ..+.++...+.+.+|. . +.++++-...++.... +.-|...+.+..++ ..+.||-+..... ......+..- T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f--y~G~Qw~~~~~~~l--~~q~rp~~N~i~~----~i~~v~g~~~ 72 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFF--SRVSQWDDWLSQYT--TLQYRGQFDVVRP----VVRKLVSEMR 72 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHh--hCCCCCCHHHHHHH--HhcCCCccccHHH----HHHHHHhhHH Confidence 4344444333333332 1 1122222222222111 01234444455444 2234442222111 1222333333 Q ss_pred ccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceee Q lcl|Aclame:pro 124 TLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTI 203 (711) Q Consensus 124 ~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~ 203 (711) ...+-....+.+. .-..+..++..+....-...-...+..++...++.+ .++| .+++. T Consensus 73 ~nr~d~~v~P~~~-~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~-G~G~--------------------~ev~~ 130 (725) T protein:vir:77 73 QNPIDVLYRPKDG-ARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEA-GVGA--------------------WRLVT 130 (725) T ss_pred hCCcceEEecCCc-cHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhc-Ccce--------------------eeeee Confidence 3344444456554 333455566555554444444444444444443322 2222 22222 Q ss_pred CCCc---ccc-----------CccccceeeeeecCCHHHHHHhcCCccc-----chhhcccccccccCCCCCeEE-EEEe Q lcl|Aclame:pro 204 DPDA---KKR-----------DRSDMNWCLIDDTMSKEKFKALYPDATA-----EPVYEDSVADYDTWFTEKSVR-VSEY 263 (711) Q Consensus 204 Dp~a---~~~-----------d~~Da~~~~~~~~~~~~e~~~~~p~~~~-----~~~~~~~~~~~~~~~~~~~v~-v~E~ 263 (711) |... .+. |+...-|--..+..++++.+-.|-..-. ..+......+...|....... ...- T Consensus 131 d~~~~d~~~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (725) T protein:vir:77 131 DYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFP 210 (725) T ss_pred cccCCCCCCCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhccccccccccccc Confidence 2111 000 0111111112233344443322211100 011110000111111110000 0011 Q ss_pred eeeeeeceeEEEc----cC-CcEEEe-cCcchhHHHHHhcCchhh-----hhcc--cceEEEEEEEEecCceeccCccCC Q lcl|Aclame:pro 264 FTREPVIREIALL----SD-GRSFWL-DALEDIVDELLEAGISIV-----RTRK--VKTFKTYWRKITGANVLEGPVEIP 330 (711) Q Consensus 264 ~~~~~~~~~~~~~----~~-~~~~~~-~~~~~~~~~~~~~g~~~~-----~~~~--~~~~~v~~~~~~g~~~le~~~p~~ 330 (711) |+.. ...++..+ .. ...+.. ++...............+ .... +..+.+......- .+..+..... T Consensus 211 ~~~~-d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~-~~~~g~~~l~ 288 (725) T protein:vir:77 211 WLTQ-DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYK-SIITCTAVLK 288 (725) T ss_pred ccCC-CeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeE-eeecCceeec Confidence 2211 11111100 00 000000 100000000000000000 0000 0111111000000 0011111111 Q ss_pred C-Ccc--ceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHH-HHhcCCCceEecccccCChHHHHhhcccCC Q lcl|Aclame:pro 331 S-TTI--PVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATET-VALAPKAPFIGSEGNVEGREDEWEQANTKN 406 (711) Q Consensus 331 ~-~~~--P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~-l~~~~~~~~~~~~~av~~~~~~~~~~~~~~ 406 (711) . ..+ .++||+.++-+. ....+...+..++..=+..=.+.+...-. +...+..+.....+....++. ..+.+.++ T Consensus 289 ~~~~~~~~~~P~vP~~g~r-~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~ 366 (725) T protein:vir:77 289 DKQLIAGEHIPIVPVFGEW-GFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAG-FEHMYDGN 366 (725) T ss_pred cCCcCCCCccceEEEeeee-eccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhH-HHHHHHhc Confidence 1 111 124554322110 00111111223333333333444443322 223333344444444434433 45566776 Q ss_pred CceEEeccccc----CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhcccc----chhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 407 FSLLTYIPQYQ----GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG----NETSGRAIIARQRQGDRG 478 (711) Q Consensus 407 ~~~i~~~~~~~----~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~----~~~sg~ai~~~~~~~~~~ 478 (711) +.+..+..... +..+.+.+...+.+.-....+++.......+. ...|... ..+++.+--+....-... T Consensus 367 ~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~----~~tGi~~~~lG~~~n~~SG~ai~~rq~qg 442 (725) T protein:vir:77 367 DDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVK----EVATLGVDTEAVNGGQVAFDTVNQLNMRA 442 (725) T ss_pred cCCceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHH----HHhCCCHHHhCCCchhhHHHHHHHHHHHH Confidence 66544322111 11112233333444333455555555555442 3335432 233332222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh----hee----eeE Q lcl|Aclame:pro 479 SFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN----VQK----YDV 550 (711) Q Consensus 479 ~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~----~~~----~dv 550 (711) .... -.|-..++.-.+.+-+++..+.. .--...+.+.|.... ..-..+.+|... .|. -|+ T Consensus 443 ~~~~-~~~~Dnl~~~~~~~g~~lL~lI~--------~~~~~~rv~RI~~ed---~~~~~v~in~~~~~~~~G~~~~~NDi 510 (725) T protein:vir:77 443 DLET-YVFQDNLATAMRRDGEIYQSIVN--------DIYDVPRNVTITLED---GSEKDVQLMAEVVDLATGEKQVLNDI 510 (725) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHH--------HHcCCCcEEEEecCC---CCcceeeecccccccccchhHhhhhh Confidence 2222 22222333333333444443321 111122333332211 111234444221 111 121 Q ss_pred E--eecccCh-HHHHHHHHHHHHHHHhhcchhHHHHHHH---HHHhcCCcchHHHHHHHHhhhcchhhcchhhhhhhhhH Q lcl|Aclame:pro 551 V--VTTGPAF-ATQRIEAAEAMIQFAQAVPSAAAVMADL---IAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDM 624 (711) Q Consensus 551 ~--v~~~~~~-~s~r~~~~~~L~~l~~~~p~~~~~~~~~---~~~~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~ 624 (711) . .+..... ++.-....+.+..+.++++.+++..... +...+++++...+.+..........+....+ ...+ T Consensus 511 ~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q---~~~~ 587 (725) T protein:vir:77 511 RGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKK---PETP 587 (725) T ss_pred ccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccC---CCCh Confidence 0 1111111 1222222233333344333333322221 1223334444444443333322221111111 1112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHH Q lcl|Aclame:pro 625 PEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVA----QALAE 700 (711) Q Consensus 625 ~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~----~~~~e 700 (711) .+++..++.++++..++++++.++++.+.++++++.+++++...++.++++.+..++...++..++..+.. ...++ T Consensus 588 ~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~ 667 (725) T protein:vir:77 588 EEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFRE 667 (725) T ss_pred hhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH Confidence 23333344445556677777778888888888888888888777777776666555544333333222211 11111 Q ss_pred HHHHHhhhccC Q lcl|Aclame:pro 701 ITASQANVTEQ 711 (711) Q Consensus 701 ~~~~qa~~e~Q 711 (711) .....++++.| T Consensus 668 ~~~~~~~~q~~ 678 (725) T protein:vir:77 668 FLKTVASFQQD 678 (725) T ss_pred HHHHHHHHHHH Confidence 11111122211 No 134 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=96.45 E-value=0.00061 Score=38.21 Aligned_cols=475 Identities=9% Similarity=-0.020 Sum_probs=174.0 Q ss_pred CCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh-CCCCCCHHHHHHHHHhCCCceEehhhHH Q lcl|Aclame:pro 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL-GGEQWPSQVRTERELEQRPCLVNNVLPT 88 (711) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y-~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 88 (711) |--...|..-+..++ ++ +..+...++...+....-+..+.. ..|. ....++.+. .+ +.+-.-...+|.++. T Consensus 1 ~~~~~~~~~~V~~~h-p~----y~a~~~~W~~ird~~~G~~~~~~r-~~yl~~~~~~~~e~-~Y-~~rl~rA~~~n~~~~ 72 (491) T protein:vir:95 1 MLTANGQGSGVKTKH-RE----WLHYAPKWQKVRHALAGDLVGYLR-NVGLNEPDKAYGEA-RQ-AEYEAGGIVYNFTRR 72 (491) T ss_pred CcccCCccCCCCccC-HH----HHHHHHHHHHHHHHhcCcchhhcc-cCCCcCCCCCCCHH-HH-HHHHhcccCCChHHH Confidence 111112222222222 11 222222222222211111110000 0111 112333332 12 222223567899999 Q ss_pred HHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHH-hhcCHHHHHHHHHHHHH Q lcl|Aclame:pro 89 FVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YNCDAETEYDIAFQGAV 167 (711) Q Consensus 89 ~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~~~a~~~~~ 167 (711) +|+.++|..=+..|.+.+ -+.|..++..+. +-++.......++..++ T Consensus 73 tl~~l~G~vfrk~p~~~~--------------------------------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l 120 (491) T protein:vir:95 73 TLSGMVGSVMRKEPEINI--------------------------------PKELEYLLKNADGSGVGLIQHAQDTLMEID 120 (491) T ss_pred HHHHHhchhhcCCceeec--------------------------------cHHHHHHHhccCCCCCCHHHHHHHHHHHHH Confidence 999999986554444321 122455555554 34678999999999999 Q ss_pred hcCccEEEEEEeeccCCC--------CCCcceEEEecCccceeeCCCccccCc-cccceeeeeecCCHHHHHHhcCCccc Q lcl|Aclame:pro 168 ESGMGYLRVRSDYLADDS--------FEQDLIIEAIQNQFSVTIDPDAKKRDR-SDMNWCLIDDTMSKEKFKALYPDATA 238 (711) Q Consensus 168 ~~G~g~~~v~~d~~~~~~--------~~~~i~i~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~~~~~e~~~~~p~~~~ 238 (711) .+|.+++=| |+-.... ...+|.+..| +|.+|+ |......+. ....++..+...... T Consensus 121 ~~G~~~ilV--D~P~~~~~T~Ade~~~~~rPy~~~~-~~~~Ii-nW~~~~v~g~~~L~~v~l~E~~~~~----------- 185 (491) T protein:vir:95 121 SVGRGGLLV--DAPETAAATAAEQNAGLLNPTIAFY-TTENIV-NWRLTRVGSVNRVTMVVLRETWEYH----------- 185 (491) T ss_pred HcCeEEEEE--ecCCCcccCHHHHHHhcCCcEEEEe-chhhhc-CceeeeeCCceeeeEEEEEEeEEee----------- Confidence 999887543 4422211 1225667766 566665 332222221 012222222210000 Q ss_pred chhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEc-cCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEE Q lcl|Aclame:pro 239 EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALL-SDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) +..+.+.......+||.+.+....-..+++.. .+|.... .+.+.+. T Consensus 186 ------d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~---------------------------~~~~~~~ 232 (491) T protein:vir:95 186 ------EPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQE---------------------------EVVEIYP 232 (491) T ss_pred ------cCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCccee---------------------------eeeeeee Confidence 00011111112233333321000000011100 0110000 0000000 Q ss_pred ecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHH Q lcl|Aclame:pro 318 TGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRED 397 (711) Q Consensus 318 ~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~ 397 (711) -.+.. +.+.+|||+++.... +...+....-.+-.+....=...+-..+++..+.-|..++. |.- +..+ T Consensus 233 -----~~g~~--~l~~IPfv~~~~~~~---~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~-G~d-~~~~ 300 (491) T protein:vir:95 233 -----DLGES--LRGVIPFTFIGATNN---DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PGD-NLTP 300 (491) T ss_pred -----cCCCc--ccCeeEEEEEecCCC---CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cCc-ccCc Confidence 01111 336677776643321 12222233444444433332333445566666666666553 221 1111 Q ss_pred HHhhcccCCCceEEeccccc--CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHH Q lcl|Aclame:pro 398 EWEQANTKNFSLLTYIPQYQ--GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQG 475 (711) Q Consensus 398 ~~~~~~~~~~~~i~~~~~~~--~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~ 475 (711) .+.......+-++--+.+.. .+....++++....- ....|......|.. .|. .+...+.+.|+.+......+. T Consensus 301 ~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~-~Ga---~l~~~~~~~Ta~~~~~~~~~~ 375 (491) T protein:vir:95 301 QSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQ-IGA---QLITPSQQITAESARIQRGAD 375 (491) T ss_pred chhhccCcceeEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHH-HHH---HhccCCcchhHHHHHHHHHHh Confidence 22222222221111111111 112234444332221 12333333333322 332 222223457888888877777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEeecc Q lcl|Aclame:pro 476 DRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTG 555 (711) Q Consensus 476 ~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~ 555 (711) ...|..+..|+..++.. +|.++..|...+ .++.. -+.+|. .|+. . T Consensus 376 ~S~L~~~a~~~e~al~~----~l~~~a~w~G~~-------~~~~v-~i~~n~-------------------dF~~--~-- 420 (491) T protein:vir:95 376 TSVMATIARNVSQAYTD----ALRWVAMMLGKP-------EDSEV-EFQLNM-------------------DFFL--Q-- 420 (491) T ss_pred hHHHHHHHHHHHHHHHH----HHHHHHHHcCCC-------CCCce-EEEeec-------------------cccc--c-- Confidence 77888888887766554 455666664321 00010 111221 1110 0 Q ss_pred cChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCc--chHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHH Q lcl|Aclame:pro 556 PAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWP--GADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTE 629 (711) Q Consensus 556 ~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~--~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~ 629 (711) +-.+.. ..+|..+... ..+....+-..++...+. ..+++...++......+-..+......+..++.+. T Consensus 421 ~~~~~~----~~all~~~~~-G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 421 PMTAQD----RAAWMADINA-GLLPATAYYAALRKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred cCCHHH----HHHHHHHHhc-CCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 000111 1222222221 011111111112222222 33455555544433222221111111111110000 No 135 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=96.30 E-value=0.00077 Score=37.65 Aligned_cols=632 Identities=12% Similarity=-0.012 Sum_probs=103.4 Q ss_pred HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh--ccccee Q lcl|Aclame:pro 28 DRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ--NRPAIK 105 (711) Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~--~r~~~~ 105 (711) --+..+.+++++...++...+|..++..... ++....--.| |...+.+..++-..++ +||-+. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~---------~d~~f~~~~G------~QW~~~~~~~l~~~~q~~grP~~~ 65 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCI---------EATRFVRVPG------GQWEGATVAGTKLDEQFEKYPKFE 65 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhhccCC------ccCCHHHHHHHHhhhhhcCCCceE Confidence 0000000111100000000111111110000 0000000001 2222333333322111 222111 Q ss_pred EecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEE-eeccCC Q lcl|Aclame:pro 106 VSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRS-DYLADD 184 (711) Q Consensus 106 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~-d~~~~~ 184 (711) | .+ .........+......+-....+.+...-.-+.+++..+....-.......+..++...++.+ .++| +...+. T Consensus 66 ~-N~-i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~-G~G~~ev~~d~ 142 (706) T protein:vir:10 66 I-NK-VATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATG-GFGCFRLTTSF 142 (706) T ss_pred e-cc-hHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhc-CcceEEeeecc Confidence 1 00 000011111111111111112222222222233333333333322223333333333322221 1111 000000 Q ss_pred CCCCcceEEEecCccceeeCCC-ccccCccccceeeeeecCCHHHHHHhcCCccc--chh---hcccccccccCCCCCeE Q lcl|Aclame:pro 185 SFEQDLIIEAIQNQFSVTIDPD-AKKRDRSDMNWCLIDDTMSKEKFKALYPDATA--EPV---YEDSVADYDTWFTEKSV 258 (711) Q Consensus 185 ~~~~~i~i~~v~~~~~v~~Dp~-a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~--~~~---~~~~~~~~~~~~~~~~v 258 (711) .-+.++.-....-....++||. +--+|. ..+..+.++.+-.|-.+-. +.+ .-....+.. .. T Consensus 143 ~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp-------~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~-----~~- 209 (706) T protein:vir:10 143 VNEYDPMDERQRIAVEPIYDPARSVWFDP-------DAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLD-----RV- 209 (706) T ss_pred ccccCCCCCCccceeeeeccchhceecCc-------hhcccChhhcceEeeeecCCHHHHHHhcCCChhhhh-----hh- Confidence 0000000000000000000110 000000 0011111111110000000 000 000000000 00 Q ss_pred EEEEeeeeeeeceeEEEccC-C-------cEEEecCcchhHHHHHhcCc-----hhhhhcc---cceEEEEEEEEecCce Q lcl|Aclame:pro 259 RVSEYFTREPVIREIALLSD-G-------RSFWLDALEDIVDELLEAGI-----SIVRTRK---VKTFKTYWRKITGANV 322 (711) Q Consensus 259 ~v~E~~~~~~~~~~~~~~~~-~-------~~~~~~~~~~~~~~~~~~g~-----~~~~~~~---~~~~~v~~~~~~g~~~ 322 (711) ..-+|+..+....-+.... . ..+++.............+. ..+.... +..+++...-..-. + T Consensus 210 -~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~-~ 287 (706) T protein:vir:10 210 -GSVSWQYDWFTPDVVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVA-V 287 (706) T ss_pred -ccccccccccCCCcceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEE-e Confidence 0001221111100000000 0 00000000000000000000 0000000 00000000000000 0 Q ss_pred eccCccCCCCcc----ceEEEEeeeeccCCc-ccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChH Q lcl|Aclame:pro 323 LEGPVEIPSTTI----PVIPVWGKSLIIKKK-EIFRSII-RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGRE 396 (711) Q Consensus 323 le~~~p~~~~~~----P~vp~~~~~~~~~~~-~~~~g~v-~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~ 396 (711) +.+ ...-.+.- ..+||++++- .+. ..+.+.. -.+++.-+.-...-..+.-++...+..+..++.+++++.+ T Consensus 288 ~~g-~~~l~~~~p~~~~~~P~vP~~g--~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~ 364 (706) T protein:vir:10 288 VDG-DGFLEKPRRIPGEHIPLIPVYG--KRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIR 364 (706) T ss_pred ecc-ccccccCCCCCCCccceEEEee--ccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHH Confidence 000 00000000 1122221110 000 0011111 1122222222222223333334444444445555444443 Q ss_pred HHHhhcccCCCceE-EecccccCcCCccccCCccchHH---HHHHHHHHHHHHHHHhCCCHHHhccccc---hhHHHHHH Q lcl|Aclame:pro 397 DEWEQANTKNFSLL-TYIPQYQGDPGPRRQPPAAVPAA---ELTLGQNSVEKIKSTMGMYDASLGAMGN---ETSGRAII 469 (711) Q Consensus 397 ~~~~~~~~~~~~~i-~~~~~~~~~~~i~~~~~~~~~~~---~~~ll~~~~~~~~~~tGv~~~~~G~~~~---~~sg~ai~ 469 (711) ++ .+.+.++...- .+-.-...+.....+..+.-+.+ ...+-+...++++.....-....|.... ..|+++-- T Consensus 365 ~~-~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~ 443 (706) T protein:vir:10 365 GL-EQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARE 443 (706) T ss_pred HH-HHHhhhcccccccchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHH Confidence 32 22333221110 00000000000111100000000 0001111111111111111112222211 11221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh----- Q lcl|Aclame:pro 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN----- 544 (711) Q Consensus 470 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~----- 544 (711) +.+..-....... -.|-..++...+.+-+++..+.- .+. ..++.+.|.... .+-.++.+|... T Consensus 444 Ai~~rq~qg~~~~-~~~~Dnl~~~~~~~g~~lL~li~-----~~y---~~~R~~RI~~ed---~~~~~v~in~~~~d~~~ 511 (706) T protein:vir:10 444 TVNSLLNRSDMAS-FIYLDNMAKSLKRAGEIWLSMAR-----EIY---GSDREVRIVHED---GTDDIALMNAAVLDNQT 511 (706) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC---CCccceeeccceecccc Confidence 1111100010010 00001111111111222221110 000 111122221100 011222222110 Q ss_pred ---heeeeEEe---ecccChHHHHH-HHHHHHHHHHhhcchhHHHHHH--HHH-HhcCCcchHHHHHHHHhhhcchhhcc Q lcl|Aclame:pro 545 ---VQKYDVVV---TTGPAFATQRI-EAAEAMIQFAQAVPSAAAVMAD--LIA-QNMDWPGADVIAERLKKIVPPNVLSK 614 (711) Q Consensus 545 ---~~~~dv~v---~~~~~~~s~r~-~~~~~L~~l~~~~p~~~~~~~~--~~~-~~~~~~~~~e~~~~l~~~~~~~~~~~ 614 (711) ....||++ +...+...... ...+++..+.+.++.+.+.... .++ -.++.-+.....+..+....... .+ T Consensus 512 G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~-~q 590 (706) T protein:vir:10 512 GRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLL-TQ 590 (706) T ss_pred CceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhc-cc Confidence 01122222 11111111111 1122222222222221111000 000 00000000000000000000000 00 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 615 DEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQV-REL 693 (711) Q Consensus 615 ~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~-~~~ 693 (711) .. .....+++++..++++|++.+++++++.++++++.++++++.++++++.+.+.+..+.+..+..+.+..... ..+ T Consensus 591 ~~--~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a 668 (706) T protein:vir:10 591 GI--VKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQA 668 (706) T ss_pred CC--ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000111111111122223333333444444444444444444444443333333322221111111110000 000 Q ss_pred HHHHH---HHHHHHHhh---hccC Q lcl|Aclame:pro 694 VAQAL---AEITASQAN---VTEQ 711 (711) Q Consensus 694 ~~~~~---~e~~~~qa~---~e~Q 711 (711) .+... .+.....++ .+.| T Consensus 669 ~~~~~~~~~q~~q~l~~~~a~q~~ 692 (706) T protein:vir:10 669 RNIDDKAVMETLRLLKEVAASQQQ 692 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHhccC Confidence 00000 000000001 0011 No 136 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=606 Identities=12% Similarity=0.038 Sum_probs=170.1 Q ss_pred CCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHh Q lcl|Aclame:pro 15 AKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVL 94 (711) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~ 94 (711) -++...--.+. .+..+..+++.++...+....+....|..... .+..+. .| |...+.+..++ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~---------~d~~fy--~G------~Qw~~~~~~~l 62 (714) T protein:vir:99 1 MKNETNTMATK-NDNGATPRFSQRQLQALCSDIDSQPKWRDAAN---------KACAYY--DG------DQLPPEVLQVL 62 (714) T ss_pred CCcccccccCC-CCcchhHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhh--cC------CCCCHHHHHHH Confidence 34433333333 34455666666655554444333333332222 122222 23 34445555555 Q ss_pred hhhhhcccceeEecchhhhhhhhhcccccccccccCCCc--hhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK--NDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) Q Consensus 95 g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g 172 (711) -...+....+...+.- .....+..-...+-....+ .|....+ +.+++..+....-.......+..++...++. T Consensus 63 ~~~g~p~~~~N~i~~~----v~~v~g~~~~nr~~~~v~p~~~~~~~~~-~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~ 137 (714) T protein:vir:99 63 KDRGQPMTIHNLIAPT----VDGVLGMEAKTRTDLVVMSDEPDDETEK-LAEAINAEFADACRLGNMNKARSDAYAEQIK 137 (714) T ss_pred HhcCCCcEEeccHHHH----HHHHHhHHHhCCcceEEecCCCCchhHH-HHHHHHHHHHHHHHhhchhHHHHHHHHHhhh Confidence 4433322222222211 1111111111222222223 3344333 4444444433332333344455555444433 Q ss_pred EEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch--hhcccccccc Q lcl|Aclame:pro 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP--VYEDSVADYD 250 (711) Q Consensus 173 ~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~--~~~~~~~~~~ 250 (711) + .++ |.++++|++....+ ++...++..+ +|.|..+.. +.+....-.. T Consensus 138 ~-G~G--------------------~~~~~~~~d~~~~~-------i~i~~v~p~~---v~~Dp~a~~~D~sDar~~~~~ 186 (714) T protein:vir:99 138 A-GLS--------------------WVEVRRNSDPFGPE-------FKVSTVSRNE---VFWDWLSREADLSDCRWLMRR 186 (714) T ss_pred c-Ccc--------------------eEEeccccCCCCCC-------eEEEecchhh---eeeccccccCChhhccceeee Confidence 2 122 22333333222111 1222222222 232221110 1000000011 Q ss_pred cCCCCC--------eEEEEE----eeeeeeeceeEEEccCCcEEEe--cCcchhHHHH----HhcCchhhh-hcccceEE Q lcl|Aclame:pro 251 TWFTEK--------SVRVSE----YFTREPVIREIALLSDGRSFWL--DALEDIVDEL----LEAGISIVR-TRKVKTFK 311 (711) Q Consensus 251 ~~~~~~--------~v~v~E----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~----~~~g~~~~~-~~~~~~~~ 311 (711) .|.+.+ ...+++ .|.- . .++...+. +......... ...+..... .+.+.... T Consensus 187 ~~~~~~~~~~~fP~~a~~i~~~~~~~~~-~--------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E 257 (714) T protein:vir:99 187 RWMDTDEAKATFPGMAQVIDYAIDDWRG-F--------VDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQV 257 (714) T ss_pred ecCCHHHHHHhcCCchhhhhhhhhhhcc-c--------cccccccccccccccchhhhccccccccccccccccEEEEEE Confidence 111110 000000 0000 0 00000000 0000000000 000000000 00010001 Q ss_pred EEEEE---------EecCceeccCccC----------------CCCccceEEEEee------eeccCCccc----ccchH Q lcl|Aclame:pro 312 TYWRK---------ITGANVLEGPVEI----------------PSTTIPVIPVWGK------SLIIKKKEI----FRSII 356 (711) Q Consensus 312 v~~~~---------~~g~~~le~~~p~----------------~~~~~P~vp~~~~------~~~~~~~~~----~~g~v 356 (711) +|+-. .+|..+..++..+ ...++....|.|. +.|.++..+ .+|+. T Consensus 258 ~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~ 337 (714) T protein:vir:99 258 VYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYR 337 (714) T ss_pred EEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeee Confidence 11100 1122222111100 0111111111110 011111100 12322 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc------------ccCcCCccc Q lcl|Aclame:pro 357 RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ------------YQGDPGPRR 424 (711) Q Consensus 357 ~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~------------~~~~~~i~~ 424 (711) +.. ....--+++.+.++-...++.... ..+. . + +.+++..+.+ +.++.-+.+ T Consensus 338 ~~~---~g~~~G~vr~~~d~Qr~~N~~~s~----------~~~~-l-~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:99 338 KDK---TGEPYGLISRAIPAQDEVNFRRIK----------LTWL-L-Q-AKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred eec---cCceeehhhhchhHHHHHHHHHHH----------HHHh-h-c-CCceeeecCcccccHHHHHHhccCCCCceee Confidence 211 111223334444432211110000 0000 0 1 1111211111 111111111 Q ss_pred -------------cCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 425 -------------QPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIR 491 (711) Q Consensus 425 -------------~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 491 (711) +.+.+.++-....++........+--++..+....|..+++.+-.+....-......+. .+-..++ T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~ 480 (714) T protein:vir:99 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLA-EINDNYQ 480 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHH-HHHHHHH Confidence 22222333444455555444444433322222222233333322222211111111211 1222233 Q ss_pred HHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh---heeeeEE---eecccChHHHHHH- Q lcl|Aclame:pro 492 RVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN---VQKYDVV---VTTGPAFATQRIE- 564 (711) Q Consensus 492 ~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~---~~~~dv~---v~~~~~~~s~r~~- 564 (711) ...+.+-+++..+.- .+. ..++.+.|....-....-..+.+|... ....|++ .+........... T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:99 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 333333333333321 111 122323332111000111223333211 0112321 2222222222222 Q ss_pred HHHHHHHHHhhcchhHHHHHHHHHHhc-CCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 565 AAEAMIQFAQAVPSAAAVMADLIAQNM-DWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 565 ~~~~L~~l~~~~p~~~~~~~~~~~~~~-~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..+.+..|.+....+.+.+...++..+ ..-+.....+.++......++..+.. ..++.++ +.+.++.+++.+++++ T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~-~~~~e~q--~~~~~~q~~~~~q~~l 629 (714) T protein:vir:99 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPD-EMTPEEQ--EVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCcc-ccchhhH--HHHHHHHHHHHHHHHH Confidence 233333333333322222222222211 11111111122222222222222111 1111112 2222233344445566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhhccC Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQAL--AEITASQANVTEQ 711 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~--~e~~~~qa~~e~Q 711 (711) +..+++++.++.+++..++++.+.+.+.+++.....++......+..++.++... .+..+..+++.+| T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~ 699 (714) T protein:vir:99 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHH Confidence 6667777777888877777766555554443332222222222222222222111 2223334444444 No 137 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=606 Identities=12% Similarity=0.038 Sum_probs=170.1 Q ss_pred CCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHh Q lcl|Aclame:pro 15 AKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVL 94 (711) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~ 94 (711) -++...--.+. .+..+..+++.++...+....+....|..... .+..+. .| |...+.+..++ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~---------~d~~fy--~G------~Qw~~~~~~~l 62 (714) T protein:vir:81 1 MKNETNTMATK-NDNGATPRFSQRQLQALCSDIDSQPKWRDAAN---------KACAYY--DG------DQLPPEVLQVL 62 (714) T ss_pred CCcccccccCC-CCcchhHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhh--cC------CCCCHHHHHHH Confidence 34433333333 34455666666655554444333333332222 122222 23 34445555555 Q ss_pred hhhhhcccceeEecchhhhhhhhhcccccccccccCCCc--hhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK--NDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) Q Consensus 95 g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g 172 (711) -...+....+...+.- .....+..-...+-....+ .|....+ +.+++..+....-.......+..++...++. T Consensus 63 ~~~g~p~~~~N~i~~~----v~~v~g~~~~nr~~~~v~p~~~~~~~~~-~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~ 137 (714) T protein:vir:81 63 KDRGQPMTIHNLIAPT----VDGVLGMEAKTRTDLVVMSDEPDDETEK-LAEAINAEFADACRLGNMNKARSDAYAEQIK 137 (714) T ss_pred HhcCCCcEEeccHHHH----HHHHHhHHHhCCcceEEecCCCCchhHH-HHHHHHHHHHHHHHhhchhHHHHHHHHHhhh Confidence 4433322222222211 1111111111222222223 3344333 4444444433332333344455555444433 Q ss_pred EEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch--hhcccccccc Q lcl|Aclame:pro 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP--VYEDSVADYD 250 (711) Q Consensus 173 ~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~--~~~~~~~~~~ 250 (711) + .++ |.++++|++....+ ++...++..+ +|.|..+.. +.+....-.. T Consensus 138 ~-G~G--------------------~~~~~~~~d~~~~~-------i~i~~v~p~~---v~~Dp~a~~~D~sDar~~~~~ 186 (714) T protein:vir:81 138 A-GLS--------------------WVEVRRNSDPFGPE-------FKVSTVSRNE---VFWDWLSREADLSDCRWLMRR 186 (714) T ss_pred c-Ccc--------------------eEEeccccCCCCCC-------eEEEecchhh---eeeccccccCChhhccceeee Confidence 2 122 22333333222111 1222222222 232221110 1000000011 Q ss_pred cCCCCC--------eEEEEE----eeeeeeeceeEEEccCCcEEEe--cCcchhHHHH----HhcCchhhh-hcccceEE Q lcl|Aclame:pro 251 TWFTEK--------SVRVSE----YFTREPVIREIALLSDGRSFWL--DALEDIVDEL----LEAGISIVR-TRKVKTFK 311 (711) Q Consensus 251 ~~~~~~--------~v~v~E----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~----~~~g~~~~~-~~~~~~~~ 311 (711) .|.+.+ ...+++ .|.- . .++...+. +......... ...+..... .+.+.... T Consensus 187 ~~~~~~~~~~~fP~~a~~i~~~~~~~~~-~--------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E 257 (714) T protein:vir:81 187 RWMDTDEAKATFPGMAQVIDYAIDDWRG-F--------VDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQV 257 (714) T ss_pred ecCCHHHHHHhcCCchhhhhhhhhhhcc-c--------cccccccccccccccchhhhccccccccccccccccEEEEEE Confidence 111110 000000 0000 0 00000000 0000000000 000000000 00010001 Q ss_pred EEEEE---------EecCceeccCccC----------------CCCccceEEEEee------eeccCCccc----ccchH Q lcl|Aclame:pro 312 TYWRK---------ITGANVLEGPVEI----------------PSTTIPVIPVWGK------SLIIKKKEI----FRSII 356 (711) Q Consensus 312 v~~~~---------~~g~~~le~~~p~----------------~~~~~P~vp~~~~------~~~~~~~~~----~~g~v 356 (711) +|+-. .+|..+..++..+ ...++....|.|. +.|.++..+ .+|+. T Consensus 258 ~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~ 337 (714) T protein:vir:81 258 VYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYR 337 (714) T ss_pred EEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeee Confidence 11100 1122222111100 0111111111110 011111100 12322 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc------------ccCcCCccc Q lcl|Aclame:pro 357 RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ------------YQGDPGPRR 424 (711) Q Consensus 357 ~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~------------~~~~~~i~~ 424 (711) +.. ....--+++.+.++-...++.... ..+. . + +.+++..+.+ +.++.-+.+ T Consensus 338 ~~~---~g~~~G~vr~~~d~Qr~~N~~~s~----------~~~~-l-~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:81 338 KDK---TGEPYGLISRAIPAQDEVNFRRIK----------LTWL-L-Q-AKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred eec---cCceeehhhhchhHHHHHHHHHHH----------HHHh-h-c-CCceeeecCcccccHHHHHHhccCCCCceee Confidence 211 111223334444432211110000 0000 0 1 1111211111 111111111 Q ss_pred -------------cCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 425 -------------QPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIR 491 (711) Q Consensus 425 -------------~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 491 (711) +.+.+.++-....++........+--++..+....|..+++.+-.+....-......+. .+-..++ T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~ 480 (714) T protein:vir:81 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLA-EINDNYQ 480 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHH-HHHHHHH Confidence 22222333444455555444444433322222222233333322222211111111211 1222233 Q ss_pred HHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh---heeeeEE---eecccChHHHHHH- Q lcl|Aclame:pro 492 RVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN---VQKYDVV---VTTGPAFATQRIE- 564 (711) Q Consensus 492 ~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~---~~~~dv~---v~~~~~~~s~r~~- 564 (711) ...+.+-+++..+.- .+. ..++.+.|....-....-..+.+|... ....|++ .+........... T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:81 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 333333333333321 111 122323332111000111223333211 0112321 2222222222222 Q ss_pred HHHHHHHHHhhcchhHHHHHHHHHHhc-CCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 565 AAEAMIQFAQAVPSAAAVMADLIAQNM-DWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 565 ~~~~L~~l~~~~p~~~~~~~~~~~~~~-~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..+.+..|.+....+.+.+...++..+ ..-+.....+.++......++..+.. ..++.++ +.+.++.+++.+++++ T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~-~~~~e~q--~~~~~~q~~~~~q~~l 629 (714) T protein:vir:81 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPD-EMTPEEQ--EVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCcc-ccchhhH--HHHHHHHHHHHHHHHH Confidence 233333333333322222222222211 11111111122222222222222111 1111112 2222233344445566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhhccC Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQAL--AEITASQANVTEQ 711 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~--~e~~~~qa~~e~Q 711 (711) +..+++++.++.+++..++++.+.+.+.+++.....++......+..++.++... .+..+..+++.+| T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~ 699 (714) T protein:vir:81 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHH Confidence 6667777777888877777766555554443332222222222222222222111 2223334444444 No 138 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=606 Identities=12% Similarity=0.038 Sum_probs=170.1 Q ss_pred CCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHh Q lcl|Aclame:pro 15 AKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVL 94 (711) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~ 94 (711) -++...--.+. .+..+..+++.++...+....+....|..... .+..+. .| |...+.+..++ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~---------~d~~fy--~G------~Qw~~~~~~~l 62 (714) T protein:vir:10 1 MKNETNTMATK-NDNGATPRFSQRQLQALCSDIDSQPKWRDAAN---------KACAYY--DG------DQLPPEVLQVL 62 (714) T ss_pred CCcccccccCC-CCcchhHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhh--cC------CCCCHHHHHHH Confidence 34433333333 34455666666655554444333333332222 122222 23 34445555555 Q ss_pred hhhhhcccceeEecchhhhhhhhhcccccccccccCCCc--hhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK--NDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) Q Consensus 95 g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g 172 (711) -...+....+...+.- .....+..-...+-....+ .|....+ +.+++..+....-.......+..++...++. T Consensus 63 ~~~g~p~~~~N~i~~~----v~~v~g~~~~nr~~~~v~p~~~~~~~~~-~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~ 137 (714) T protein:vir:10 63 KDRGQPMTIHNLIAPT----VDGVLGMEAKTRTDLVVMSDEPDDETEK-LAEAINAEFADACRLGNMNKARSDAYAEQIK 137 (714) T ss_pred HhcCCCcEEeccHHHH----HHHHHhHHHhCCcceEEecCCCCchhHH-HHHHHHHHHHHHHHhhchhHHHHHHHHHhhh Confidence 4433322222222211 1111111111222222223 3344333 4444444433332333344455555444433 Q ss_pred EEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch--hhcccccccc Q lcl|Aclame:pro 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP--VYEDSVADYD 250 (711) Q Consensus 173 ~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~--~~~~~~~~~~ 250 (711) + .++ |.++++|++....+ ++...++..+ +|.|..+.. +.+....-.. T Consensus 138 ~-G~G--------------------~~~~~~~~d~~~~~-------i~i~~v~p~~---v~~Dp~a~~~D~sDar~~~~~ 186 (714) T protein:vir:10 138 A-GLS--------------------WVEVRRNSDPFGPE-------FKVSTVSRNE---VFWDWLSREADLSDCRWLMRR 186 (714) T ss_pred c-Ccc--------------------eEEeccccCCCCCC-------eEEEecchhh---eeeccccccCChhhccceeee Confidence 2 122 22333333222111 1222222222 232221110 1000000011 Q ss_pred cCCCCC--------eEEEEE----eeeeeeeceeEEEccCCcEEEe--cCcchhHHHH----HhcCchhhh-hcccceEE Q lcl|Aclame:pro 251 TWFTEK--------SVRVSE----YFTREPVIREIALLSDGRSFWL--DALEDIVDEL----LEAGISIVR-TRKVKTFK 311 (711) Q Consensus 251 ~~~~~~--------~v~v~E----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~----~~~g~~~~~-~~~~~~~~ 311 (711) .|.+.+ ...+++ .|.- . .++...+. +......... ...+..... .+.+.... T Consensus 187 ~~~~~~~~~~~fP~~a~~i~~~~~~~~~-~--------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E 257 (714) T protein:vir:10 187 RWMDTDEAKATFPGMAQVIDYAIDDWRG-F--------VDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQV 257 (714) T ss_pred ecCCHHHHHHhcCCchhhhhhhhhhhcc-c--------cccccccccccccccchhhhccccccccccccccccEEEEEE Confidence 111110 000000 0000 0 00000000 0000000000 000000000 00010001 Q ss_pred EEEEE---------EecCceeccCccC----------------CCCccceEEEEee------eeccCCccc----ccchH Q lcl|Aclame:pro 312 TYWRK---------ITGANVLEGPVEI----------------PSTTIPVIPVWGK------SLIIKKKEI----FRSII 356 (711) Q Consensus 312 v~~~~---------~~g~~~le~~~p~----------------~~~~~P~vp~~~~------~~~~~~~~~----~~g~v 356 (711) +|+-. .+|..+..++..+ ...++....|.|. +.|.++..+ .+|+. T Consensus 258 ~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~ 337 (714) T protein:vir:10 258 VYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYR 337 (714) T ss_pred EEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeee Confidence 11100 1122222111100 0111111111110 011111100 12322 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc------------ccCcCCccc Q lcl|Aclame:pro 357 RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ------------YQGDPGPRR 424 (711) Q Consensus 357 ~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~------------~~~~~~i~~ 424 (711) +.. ....--+++.+.++-...++.... ..+. . + +.+++..+.+ +.++.-+.+ T Consensus 338 ~~~---~g~~~G~vr~~~d~Qr~~N~~~s~----------~~~~-l-~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:10 338 KDK---TGEPYGLISRAIPAQDEVNFRRIK----------LTWL-L-Q-AKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred eec---cCceeehhhhchhHHHHHHHHHHH----------HHHh-h-c-CCceeeecCcccccHHHHHHhccCCCCceee Confidence 211 111223334444432211110000 0000 0 1 1111211111 111111111 Q ss_pred -------------cCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 425 -------------QPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIR 491 (711) Q Consensus 425 -------------~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 491 (711) +.+.+.++-....++........+--++..+....|..+++.+-.+....-......+. .+-..++ T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~ 480 (714) T protein:vir:10 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLA-EINDNYQ 480 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHH-HHHHHHH Confidence 22222333444455555444444433322222222233333322222211111111211 1222233 Q ss_pred HHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh---heeeeEE---eecccChHHHHHH- Q lcl|Aclame:pro 492 RVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN---VQKYDVV---VTTGPAFATQRIE- 564 (711) Q Consensus 492 ~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~---~~~~dv~---v~~~~~~~s~r~~- 564 (711) ...+.+-+++..+.- .+. ..++.+.|....-....-..+.+|... ....|++ .+........... T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:10 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 333333333333321 111 122323332111000111223333211 0112321 2222222222222 Q ss_pred HHHHHHHHHhhcchhHHHHHHHHHHhc-CCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 565 AAEAMIQFAQAVPSAAAVMADLIAQNM-DWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 565 ~~~~L~~l~~~~p~~~~~~~~~~~~~~-~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..+.+..|.+....+.+.+...++..+ ..-+.....+.++......++..+.. ..++.++ +.+.++.+++.+++++ T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~-~~~~e~q--~~~~~~q~~~~~q~~l 629 (714) T protein:vir:10 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPD-EMTPEEQ--EVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCcc-ccchhhH--HHHHHHHHHHHHHHHH Confidence 233333333333322222222222211 11111111122222222222222111 1111112 2222233344445566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhhccC Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQAL--AEITASQANVTEQ 711 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~--~e~~~~qa~~e~Q 711 (711) +..+++++.++.+++..++++.+.+.+.+++.....++......+..++.++... .+..+..+++.+| T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~ 699 (714) T protein:vir:10 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHH Confidence 6667777777888877777766555554443332222222222222222222111 2223334444444 No 139 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=606 Identities=12% Similarity=0.038 Sum_probs=170.1 Q ss_pred CCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHh Q lcl|Aclame:pro 15 AKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVL 94 (711) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~ 94 (711) -++...--.+. .+..+..+++.++...+....+....|..... .+..+. .| |...+.+..++ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~---------~d~~fy--~G------~Qw~~~~~~~l 62 (714) T protein:vir:27 1 MKNETNTMATK-NDNGATPRFSQRQLQALCSDIDSQPKWRDAAN---------KACAYY--DG------DQLPPEVLQVL 62 (714) T ss_pred CCcccccccCC-CCcchhHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhh--cC------CCCCHHHHHHH Confidence 34433333333 34455666666655554444333333332222 122222 23 34445555555 Q ss_pred hhhhhcccceeEecchhhhhhhhhcccccccccccCCCc--hhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK--NDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) Q Consensus 95 g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g 172 (711) -...+....+...+.- .....+..-...+-....+ .|....+ +.+++..+....-.......+..++...++. T Consensus 63 ~~~g~p~~~~N~i~~~----v~~v~g~~~~nr~~~~v~p~~~~~~~~~-~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~ 137 (714) T protein:vir:27 63 KDRGQPMTIHNLIAPT----VDGVLGMEAKTRTDLVVMSDEPDDETEK-LAEAINAEFADACRLGNMNKARSDAYAEQIK 137 (714) T ss_pred HhcCCCcEEeccHHHH----HHHHHhHHHhCCcceEEecCCCCchhHH-HHHHHHHHHHHHHHhhchhHHHHHHHHHhhh Confidence 4433322222222211 1111111111222222223 3344333 4444444433332333344455555444433 Q ss_pred EEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch--hhcccccccc Q lcl|Aclame:pro 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP--VYEDSVADYD 250 (711) Q Consensus 173 ~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~--~~~~~~~~~~ 250 (711) + .++ |.++++|++....+ ++...++..+ +|.|..+.. +.+....-.. T Consensus 138 ~-G~G--------------------~~~~~~~~d~~~~~-------i~i~~v~p~~---v~~Dp~a~~~D~sDar~~~~~ 186 (714) T protein:vir:27 138 A-GLS--------------------WVEVRRNSDPFGPE-------FKVSTVSRNE---VFWDWLSREADLSDCRWLMRR 186 (714) T ss_pred c-Ccc--------------------eEEeccccCCCCCC-------eEEEecchhh---eeeccccccCChhhccceeee Confidence 2 122 22333333222111 1222222222 232221110 1000000011 Q ss_pred cCCCCC--------eEEEEE----eeeeeeeceeEEEccCCcEEEe--cCcchhHHHH----HhcCchhhh-hcccceEE Q lcl|Aclame:pro 251 TWFTEK--------SVRVSE----YFTREPVIREIALLSDGRSFWL--DALEDIVDEL----LEAGISIVR-TRKVKTFK 311 (711) Q Consensus 251 ~~~~~~--------~v~v~E----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~----~~~g~~~~~-~~~~~~~~ 311 (711) .|.+.+ ...+++ .|.- . .++...+. +......... ...+..... .+.+.... T Consensus 187 ~~~~~~~~~~~fP~~a~~i~~~~~~~~~-~--------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E 257 (714) T protein:vir:27 187 RWMDTDEAKATFPGMAQVIDYAIDDWRG-F--------VDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQV 257 (714) T ss_pred ecCCHHHHHHhcCCchhhhhhhhhhhcc-c--------cccccccccccccccchhhhccccccccccccccccEEEEEE Confidence 111110 000000 0000 0 00000000 0000000000 000000000 00010001 Q ss_pred EEEEE---------EecCceeccCccC----------------CCCccceEEEEee------eeccCCccc----ccchH Q lcl|Aclame:pro 312 TYWRK---------ITGANVLEGPVEI----------------PSTTIPVIPVWGK------SLIIKKKEI----FRSII 356 (711) Q Consensus 312 v~~~~---------~~g~~~le~~~p~----------------~~~~~P~vp~~~~------~~~~~~~~~----~~g~v 356 (711) +|+-. .+|..+..++..+ ...++....|.|. +.|.++..+ .+|+. T Consensus 258 ~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~ 337 (714) T protein:vir:27 258 VYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYR 337 (714) T ss_pred EEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeee Confidence 11100 1122222111100 0111111111110 011111100 12322 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc------------ccCcCCccc Q lcl|Aclame:pro 357 RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ------------YQGDPGPRR 424 (711) Q Consensus 357 ~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~------------~~~~~~i~~ 424 (711) +.. ....--+++.+.++-...++.... ..+. . + +.+++..+.+ +.++.-+.+ T Consensus 338 ~~~---~g~~~G~vr~~~d~Qr~~N~~~s~----------~~~~-l-~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:27 338 KDK---TGEPYGLISRAIPAQDEVNFRRIK----------LTWL-L-Q-AKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred eec---cCceeehhhhchhHHHHHHHHHHH----------HHHh-h-c-CCceeeecCcccccHHHHHHhccCCCCceee Confidence 211 111223334444432211110000 0000 0 1 1111211111 111111111 Q ss_pred -------------cCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 425 -------------QPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIR 491 (711) Q Consensus 425 -------------~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 491 (711) +.+.+.++-....++........+--++..+....|..+++.+-.+....-......+. .+-..++ T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~ 480 (714) T protein:vir:27 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLA-EINDNYQ 480 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHH-HHHHHHH Confidence 22222333444455555444444433322222222233333322222211111111211 1222233 Q ss_pred HHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh---heeeeEE---eecccChHHHHHH- Q lcl|Aclame:pro 492 RVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN---VQKYDVV---VTTGPAFATQRIE- 564 (711) Q Consensus 492 ~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~---~~~~dv~---v~~~~~~~s~r~~- 564 (711) ...+.+-+++..+.- .+. ..++.+.|....-....-..+.+|... ....|++ .+........... T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:27 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 333333333333321 111 122323332111000111223333211 0112321 2222222222222 Q ss_pred HHHHHHHHHhhcchhHHHHHHHHHHhc-CCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 565 AAEAMIQFAQAVPSAAAVMADLIAQNM-DWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 565 ~~~~L~~l~~~~p~~~~~~~~~~~~~~-~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..+.+..|.+....+.+.+...++..+ ..-+.....+.++......++..+.. ..++.++ +.+.++.+++.+++++ T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~-~~~~e~q--~~~~~~q~~~~~q~~l 629 (714) T protein:vir:27 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPD-EMTPEEQ--EVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCcc-ccchhhH--HHHHHHHHHHHHHHHH Confidence 233333333333322222222222211 11111111122222222222222111 1111112 2222233344445566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhhccC Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQAL--AEITASQANVTEQ 711 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~--~e~~~~qa~~e~Q 711 (711) +..+++++.++.+++..++++.+.+.+.+++.....++......+..++.++... .+..+..+++.+| T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~ 699 (714) T protein:vir:27 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHH Confidence 6667777777888877777766555554443332222222222222222222111 2223334444444 No 140 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=606 Identities=12% Similarity=0.038 Sum_probs=170.1 Q ss_pred CCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHh Q lcl|Aclame:pro 15 AKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVL 94 (711) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~ 94 (711) -++...--.+. .+..+..+++.++...+....+....|..... .+..+. .| |...+.+..++ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~---------~d~~fy--~G------~Qw~~~~~~~l 62 (714) T protein:vir:32 1 MKNETNTMATK-NDNGATPRFSQRQLQALCSDIDSQPKWRDAAN---------KACAYY--DG------DQLPPEVLQVL 62 (714) T ss_pred CCcccccccCC-CCcchhHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhh--cC------CCCCHHHHHHH Confidence 34433333333 34455666666655554444333333332222 122222 23 34445555555 Q ss_pred hhhhhcccceeEecchhhhhhhhhcccccccccccCCCc--hhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK--NDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) Q Consensus 95 g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g 172 (711) -...+....+...+.- .....+..-...+-....+ .|....+ +.+++..+....-.......+..++...++. T Consensus 63 ~~~g~p~~~~N~i~~~----v~~v~g~~~~nr~~~~v~p~~~~~~~~~-~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~ 137 (714) T protein:vir:32 63 KDRGQPMTIHNLIAPT----VDGVLGMEAKTRTDLVVMSDEPDDETEK-LAEAINAEFADACRLGNMNKARSDAYAEQIK 137 (714) T ss_pred HhcCCCcEEeccHHHH----HHHHHhHHHhCCcceEEecCCCCchhHH-HHHHHHHHHHHHHHhhchhHHHHHHHHHhhh Confidence 4433322222222211 1111111111222222223 3344333 4444444433332333344455555444433 Q ss_pred EEEEEEeeccCCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccch--hhcccccccc Q lcl|Aclame:pro 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP--VYEDSVADYD 250 (711) Q Consensus 173 ~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~--~~~~~~~~~~ 250 (711) + .++ |.++++|++....+ ++...++..+ +|.|..+.. +.+....-.. T Consensus 138 ~-G~G--------------------~~~~~~~~d~~~~~-------i~i~~v~p~~---v~~Dp~a~~~D~sDar~~~~~ 186 (714) T protein:vir:32 138 A-GLS--------------------WVEVRRNSDPFGPE-------FKVSTVSRNE---VFWDWLSREADLSDCRWLMRR 186 (714) T ss_pred c-Ccc--------------------eEEeccccCCCCCC-------eEEEecchhh---eeeccccccCChhhccceeee Confidence 2 122 22333333222111 1222222222 232221110 1000000011 Q ss_pred cCCCCC--------eEEEEE----eeeeeeeceeEEEccCCcEEEe--cCcchhHHHH----HhcCchhhh-hcccceEE Q lcl|Aclame:pro 251 TWFTEK--------SVRVSE----YFTREPVIREIALLSDGRSFWL--DALEDIVDEL----LEAGISIVR-TRKVKTFK 311 (711) Q Consensus 251 ~~~~~~--------~v~v~E----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~----~~~g~~~~~-~~~~~~~~ 311 (711) .|.+.+ ...+++ .|.- . .++...+. +......... ...+..... .+.+.... T Consensus 187 ~~~~~~~~~~~fP~~a~~i~~~~~~~~~-~--------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E 257 (714) T protein:vir:32 187 RWMDTDEAKATFPGMAQVIDYAIDDWRG-F--------VDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQV 257 (714) T ss_pred ecCCHHHHHHhcCCchhhhhhhhhhhcc-c--------cccccccccccccccchhhhccccccccccccccccEEEEEE Confidence 111110 000000 0000 0 00000000 0000000000 000000000 00010001 Q ss_pred EEEEE---------EecCceeccCccC----------------CCCccceEEEEee------eeccCCccc----ccchH Q lcl|Aclame:pro 312 TYWRK---------ITGANVLEGPVEI----------------PSTTIPVIPVWGK------SLIIKKKEI----FRSII 356 (711) Q Consensus 312 v~~~~---------~~g~~~le~~~p~----------------~~~~~P~vp~~~~------~~~~~~~~~----~~g~v 356 (711) +|+-. .+|..+..++..+ ...++....|.|. +.|.++..+ .+|+. T Consensus 258 ~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~ 337 (714) T protein:vir:32 258 VYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYR 337 (714) T ss_pred EEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeee Confidence 11100 1122222111100 0111111111110 011111100 12322 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccc------------ccCcCCccc Q lcl|Aclame:pro 357 RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQ------------YQGDPGPRR 424 (711) Q Consensus 357 ~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~------------~~~~~~i~~ 424 (711) +.. ....--+++.+.++-...++.... ..+. . + +.+++..+.+ +.++.-+.+ T Consensus 338 ~~~---~g~~~G~vr~~~d~Qr~~N~~~s~----------~~~~-l-~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:32 338 KDK---TGEPYGLISRAIPAQDEVNFRRIK----------LTWL-L-Q-AKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred eec---cCceeehhhhchhHHHHHHHHHHH----------HHHh-h-c-CCceeeecCcccccHHHHHHhccCCCCceee Confidence 211 111223334444432211110000 0000 0 1 1111211111 111111111 Q ss_pred -------------cCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 425 -------------QPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIR 491 (711) Q Consensus 425 -------------~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 491 (711) +.+.+.++-....++........+--++..+....|..+++.+-.+....-......+. .+-..++ T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~ 480 (714) T protein:vir:32 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLA-EINDNYQ 480 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHH-HHHHHHH Confidence 22222333444455555444444433322222222233333322222211111111211 1222233 Q ss_pred HHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhh---heeeeEE---eecccChHHHHHH- Q lcl|Aclame:pro 492 RVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN---VQKYDVV---VTTGPAFATQRIE- 564 (711) Q Consensus 492 ~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~---~~~~dv~---v~~~~~~~s~r~~- 564 (711) ...+.+-+++..+.- .+. ..++.+.|....-....-..+.+|... ....|++ .+........... T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:32 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 333333333333321 111 122323332111000111223333211 0112321 2222222222222 Q ss_pred HHHHHHHHHhhcchhHHHHHHHHHHhc-CCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 565 AAEAMIQFAQAVPSAAAVMADLIAQNM-DWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQA 643 (711) Q Consensus 565 ~~~~L~~l~~~~p~~~~~~~~~~~~~~-~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 643 (711) ..+.+..|.+....+.+.+...++..+ ..-+.....+.++......++..+.. ..++.++ +.+.++.+++.+++++ T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~-~~~~e~q--~~~~~~q~~~~~q~~l 629 (714) T protein:vir:32 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPD-EMTPEEQ--EVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCcc-ccchhhH--HHHHHHHHHHHHHHHH Confidence 233333333333322222222222211 11111111122222222222222111 1111112 2222233344445566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhhccC Q lcl|Aclame:pro 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQAL--AEITASQANVTEQ 711 (711) Q Consensus 644 ~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~--~e~~~~qa~~e~Q 711 (711) +..+++++.++.+++..++++.+.+.+.+++.....++......+..++.++... .+..+..+++.+| T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~ 699 (714) T protein:vir:32 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHH Confidence 6667777777888877777766555554443332222222222222222222111 2223334444444 No 141 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=95.46 E-value=0.002 Score=35.35 Aligned_cols=462 Identities=10% Similarity=0.012 Sum_probs=178.3 Q ss_pred CCc-Cc--chHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhh Q lcl|Aclame:pro 21 YAK-NN--DDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQ 97 (711) Q Consensus 21 ~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~ 97 (711) +|. +. ++-.....+ -+.++.++.....+|.....-+-...+.+++.+....-+.+-.-...+|.++.+|+.++|.. T Consensus 1 m~~V~~~hp~y~~~~~~-W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~v 79 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPL-YYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQV 79 (501) T ss_pred CCCCCCCCHHHHHHHHH-HHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhh Confidence 331 11 121111111 12234444444444432222222234567777655555555455788999999999999986 Q ss_pred hhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHH-hhcCHHHHHHHHHHHHHhcCccEEEE Q lcl|Aclame:pro 98 RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YNCDAETEYDIAFQGAVESGMGYLRV 176 (711) Q Consensus 98 ~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~~~a~~~~~~~G~g~~~v 176 (711) =+..|.+. +-..|..++..+- +-++.......++..++.+|.+++=| T Consensus 80 f~k~p~~~--------------------------------~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV 127 (501) T protein:vir:95 80 FMRDPVVK--------------------------------VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLV 127 (501) T ss_pred hcCCccee--------------------------------CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE Confidence 54433332 1233455555553 34678999999999999999887544 Q ss_pred EEeeccC---CCC--------CCcceEEEecCccceeeCCCccccCc-cccceeeeeecCCHHHHHHhcCCcccchhhcc Q lcl|Aclame:pro 177 RSDYLAD---DSF--------EQDLIIEAIQNQFSVTIDPDAKKRDR-SDMNWCLIDDTMSKEKFKALYPDATAEPVYED 244 (711) Q Consensus 177 ~~d~~~~---~~~--------~~~i~i~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~ 244 (711) ||-.. +.. .-.|.+..| +|.+|+ |......+. ....++..+.... +.. T Consensus 128 --D~P~~~~~~~~t~a~~~~~~~rPy~~~~-~~~~Ii-nW~~~~v~g~~~l~~v~l~E~~~---------~~d------- 187 (501) T protein:vir:95 128 --DYPTTEAEGGASIADLEAGRIRPTLYVY-SPTEII-NWRTTDRGAEEVLSLVVLFETWC---------AAD------- 187 (501) T ss_pred --eecCCCCcccccHHHHHhccCCcEEEEe-cHhhhc-CcceeccCCceeeeEEEEEEEEe---------ecC------- Confidence 44211 101 113666666 566665 232222220 1122222221111 000 Q ss_pred cccccccCCCCCeEEEE----------EeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEEEE Q lcl|Aclame:pro 245 SVADYDTWFTEKSVRVS----------EYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYW 314 (711) Q Consensus 245 ~~~~~~~~~~~~~v~v~----------E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~ 314 (711) + .+.......+|+. ++|.+.... .. .|....... + T Consensus 188 --~-~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~----------~~--------------~~~~~~~~~--------~ 232 (501) T protein:vir:95 188 --D-GFEMKTSGQFRVLRLDEEGYYVHEIWREPQPT----------KA--------------DGSKIPKGN--------Y 232 (501) T ss_pred --C-CcccceeEEEEEEeeCCCceEEEEEEEecCCc----------cc--------------CcceecCCc--------c Confidence 0 0111111222222 222111100 00 000000000 0 Q ss_pred EEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCC Q lcl|Aclame:pro 315 RKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG 394 (711) Q Consensus 315 ~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~ 394 (711) .......+.+..--+.+.+|||++..... +...+....-.+-++....=...+-..+++..+..|..++. |. T Consensus 233 -~~~~~~~~~~~g~~~l~~IPfv~~~~~~~---~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~-G~--- 304 (501) T protein:vir:95 233 -QQYVVYKPTDAQGKRLTEIPFMFIGSENN---DSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLI-GL--- 304 (501) T ss_pred -cccceeeeeccCCCcCCeeeEEEEecCCC---CCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeee-CC--- Confidence 00000011111112446677765432211 11111222334444433221122335566666666666552 32 Q ss_pred hHHHHhhcccCCCceEEeccccc----CcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHH Q lcl|Aclame:pro 395 REDEWEQANTKNFSLLTYIPQYQ----GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIA 470 (711) Q Consensus 395 ~~~~~~~~~~~~~~~i~~~~~~~----~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~ 470 (711) ++.+....... .|.+-+... .+..+.++.+....- ....|+...+.|..+ |.. .+...+.+.||.+... T Consensus 305 -~~~~~~~~~~~--~i~~G~~~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~-Ga~--ll~~~~~~~Ta~~~~~ 377 (501) T protein:vir:95 305 -TEEWVTNVLKG--SVNFGSRGGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVAL-GAK--LVEQKEVQRTATEAEL 377 (501) T ss_pred -cccccccCCCC--ceeecccccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHHH-HHh--hccCCccchhHHHHHH Confidence 22222211111 122222111 112244554422111 133444444455433 421 2222334477877777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeE Q lcl|Aclame:pro 471 RQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) Q Consensus 471 ~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv 550 (711) ...+....|..+..|+..++.. +|.++..|... .+ .+.+ +.+++ .|.. T Consensus 378 ~~~~~~S~L~~~a~~le~al~~----~l~~~a~w~g~------~~--~~~~-v~i~~-------------------df~~ 425 (501) T protein:vir:95 378 EAASEGSTLSSATKNVSAAFEW----ALKWAARWVGQ------AD--SGVK-FELNT-------------------DFDI 425 (501) T ss_pred HHHHHhHHHHHHHHHHHHHHHH----HHHHHHHHcCC------CC--CceE-EEEec-------------------cccc Confidence 7777777788888887766554 55566666421 00 0001 22221 1110 Q ss_pred EeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCCcc--hHHHHHHHHhhhcchhhcchhhh-------hhh Q lcl|Aclame:pro 551 VVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLSKDERE-------AIE 621 (711) Q Consensus 551 ~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~~~--~~e~~~~l~~~~~~~~~~~~~~~-------~~~ 621 (711) . .. .+.. .++|..+... ..+....+-..+....++. .+...+.++.....+........ ... T Consensus 426 --~-~~-~~~~----~~al~~~~~~-G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~ 496 (501) T protein:vir:95 426 --A-RM-TPDE----RRSLVEEWQK-GAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDN 496 (501) T ss_pred --c-cC-CHHH----HHHHHHHHhC-CCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCccccc Confidence 0 00 0111 1222222221 1112222222233334432 23333434332221111100000 000 Q ss_pred hhHHH Q lcl|Aclame:pro 622 EDMPE 626 (711) Q Consensus 622 ~~~~~ 626 (711) -.-.+ T Consensus 497 ~~~~~ 501 (501) T protein:vir:95 497 VGNSE 501 (501) T ss_pred ccCCC Confidence 00000 No 142 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=94.89 E-value=0.0033 Score=34.22 Aligned_cols=470 Identities=11% Similarity=0.035 Sum_probs=179.8 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCc Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~ 80 (711) |+-++++++-.+ .++-...+.+. +.++.+.......|... ..|.. +|+.+....-+.+-.-. T Consensus 1 m~~~~~~~v~~~------------h~~y~a~~~~W-~~ird~~~G~~~~r~~g---~~YLP--k~~~E~~~~Y~~rl~rA 62 (513) T protein:vir:97 1 MADKDPKSPATT------------SGAYDQMLPRW-HVIETLLGGTEAMREAG---ETYLP--RHQEETDKGYQERLASA 62 (513) T ss_pred CCCCCCCCCCcC------------CHHHHHHHHHH-HHHHHHhcChHHHHhhc---ccCCC--CCCCCCHHHHHHHHhcc Confidence 666555442211 11111111111 11222222222222111 12221 34444444444444456 Q ss_pred eEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHH-HHHHHHH-hhcCHHHH Q lcl|Aclame:pro 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFT-GLIKNIE-YNCDAETE 158 (711) Q Consensus 81 ~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~-~~~~~~~-~~~~~~~~ 158 (711) ..+|.++.+|+.++|..=+..+.+. .+....+. .++..+- +-++.... T Consensus 63 ~~~n~~~~tl~~l~G~vf~k~p~~~------------------------------~~~p~~~~~~l~~d~D~~G~~L~~f 112 (513) T protein:vir:97 63 VLLNMVEQTLDTLSGKPFSEPIKLN------------------------------EDVPKAIEETILPDVDLQGNNLDVF 112 (513) T ss_pred cCCChHHHHHHHHhhhhhhcCcccC------------------------------cCchHHHHHHHhhccCCCCCCHHHH Confidence 7799999999999997543322110 11222233 2444433 45678999 Q ss_pred HHHHHHHHHhcCccEEEEEEeeccCCC--------------CCCcceEEEecCccceeeCCCccccCc-cccceeeeeec Q lcl|Aclame:pro 159 YDIAFQGAVESGMGYLRVRSDYLADDS--------------FEQDLIIEAIQNQFSVTIDPDAKKRDR-SDMNWCLIDDT 223 (711) Q Consensus 159 ~~~a~~~~~~~G~g~~~v~~d~~~~~~--------------~~~~i~i~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~ 223 (711) ...++..++.+|.+++= .||-...+ ..-.+.+..| .|.+|+ +......+. ....++..+.- T Consensus 113 ~~~~~~~~l~~G~~~il--VD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~-~~e~Ii-nW~~~~v~G~~~L~~v~l~E~ 188 (513) T protein:vir:97 113 ARQWFREGMAKALCHVL--IDMPRPAPREDGQPRTLADDRREGLRPYWVMI-KPECLL-FARSEVINGVEVLQHVRIIEH 188 (513) T ss_pred HHHHHHHHHhcCeEEEE--EecCCCCCccchhHHhHHHHHhhccCceEEEe-cHhhhc-CcceeccCcceeeeeEEEEEE Confidence 99999999999987753 34421111 1113556666 566665 332222221 01112211110 Q ss_pred CCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhh Q lcl|Aclame:pro 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVR 303 (711) Q Consensus 224 ~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 303 (711) .. + .+. +..+.+. +||. +..|.+..+..... +... T Consensus 189 ~~---------~-------------~Dg-f~~~~~~----q~rv--------L~~g~~~v~r~~~~--------~~~~-- 223 (513) T protein:vir:97 189 YM---------E-------------QDG-FAEVCKR----RIRV--------LEPGLVQLWEPVKK--------SNAQ-- 223 (513) T ss_pred Ee---------e-------------cCC-CcceEEE----EEEE--------EeCceEEEEEeecC--------CCcc-- Confidence 00 0 000 1111111 1111 11111111000000 0000 Q ss_pred hcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 304 TRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKA 383 (711) Q Consensus 304 ~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~ 383 (711) .+...+.....-..+.+|||+++.... +...+....-.+-.+....=...|-..+++..+..| T Consensus 224 --------------~~e~~~~~~g~~~l~~IP~v~~~~~~~---~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P 286 (513) T protein:vir:97 224 --------------KEEWALADEWATGLNYVPLVTFYADRQ---GFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFP 286 (513) T ss_pred --------------ccceEEecCCCCcCCceeEEEEecCCC---CCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccc Confidence 000011111112346788887764432 233344556677777766666777778888888888 Q ss_pred ceEecccccCChHHHHhh-cccCCCceEEecccccCcCCccccCCccch-HHHHHHHHHHHHHHHHHhCCCHHHhccccc Q lcl|Aclame:pro 384 PFIGSEGNVEGREDEWEQ-ANTKNFSLLTYIPQYQGDPGPRRQPPAAVP-AAELTLGQNSVEKIKSTMGMYDASLGAMGN 461 (711) Q Consensus 384 ~~~~~~~av~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~-~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~ 461 (711) ..++. |.. +.+.+ ...-++.++.+ |+ .+..+.++.+..-+ .....-|....+.|. ..|.. .+...+. T Consensus 287 ~l~~~-G~~----~~~~~~i~iG~~~~~~l-pe--~~~~~~yie~~g~~i~~~~~~l~~le~qm~-~~Ga~--ll~~~~~ 355 (513) T protein:vir:97 287 ILACS-GAS----GEDSDPVVVGPNKVLYN-PD--PAGRFYYVEHTGQAIAAGRTDLKDLEEQMA-GYGAE--FLKRKTG 355 (513) T ss_pred eeeee-cCC----cCCCCceEeeccccccC-CC--CCCcceeeccCchhHHHHHHHHHHHHHHHH-HHHHH--hhccCCc Confidence 77774 321 11110 11112222222 21 11235555544222 222334444445553 34432 2222333 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeee Q lcl|Aclame:pro 462 ETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIH 541 (711) Q Consensus 462 ~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~n 541 (711) +.||.+......+....|..+..|+..++.. ++.++..|...+ ...--|.+|. T Consensus 356 ~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~----~l~~~a~wlg~~---------~~~~~v~in~-------------- 408 (513) T protein:vir:97 356 GQTATARALDSAEATSDLSAMTGLFEDALAQ----ALDITADWLRLG---------PNGGTVELVK-------------- 408 (513) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHhCCC---------CCccEEEecc-------------- Confidence 5788888877777777888888877766554 455556664311 0000122222 Q ss_pred hhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC--c--c----hHHHHHHHHhhhcchhh- Q lcl|Aclame:pro 542 DLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW--P--G----ADVIAERLKKIVPPNVL- 612 (711) Q Consensus 542 D~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~--~--~----~~e~~~~l~~~~~~~~~- 612 (711) .|+. .. -.+. ...+|.++... ..+....+-..++...+ | . .+++.+++......... T Consensus 409 -----dF~~--~~--~~~~----~~~al~~a~~~-G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d 474 (513) T protein:vir:97 409 -----DYDL--EE--MDAP----GLQALQVAREK-RDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLD 474 (513) T ss_pred -----ccCc--cc--CCHH----HHHHHHHHHhC-CCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCcc Confidence 1211 00 0011 11122222211 00111111111111111 1 1 12333333322111000 Q ss_pred ----cchhhhhhh---hhHHHH---H---HH--HHHHHH Q lcl|Aclame:pro 613 ----SKDEREAIE---EDMPEQ---T---EP--TPEQQV 636 (711) Q Consensus 613 ----~~~~~~~~~---~~~~~~---q---~~--~~~~q~ 636 (711) ...+....+ ....+- + .. .+.-+. T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 475 LDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred ccccCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 000000000 000000 0 00 000000 No 143 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=94.30 E-value=0.0048 Score=33.30 Aligned_cols=489 Identities=13% Similarity=0.076 Sum_probs=174.2 Q ss_pred CCcCC--------CCCCCCcccCCCc---ccCCcCcchHHHHHHHHHH---HHHHHHhhchHHHHHHHHHHHHh---CCC Q lcl|Aclame:pro 1 MAKKQ--------KKSRVEQLYAKKA---KVYAKNNDDDRALLATARE---RARDGATYWKDNWEAAEDDLKFL---GGE 63 (711) Q Consensus 1 ~~~~~--------~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~r~~~~~~~~~y---~G~ 63 (711) ||+|. -++.+.|---|.+ ..++ +-....-.+..+.. .++.++.....+|... ..|. .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~-dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g---~~YLP~~~~~ 76 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLP-NVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKR---EEYLPMPSVD 76 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCC-CCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcc---cccCCCCCcc Confidence 88876 2233333211111 1222 10000001122211 2233333333333211 1232 233 Q ss_pred CCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHH Q lcl|Aclame:pro 64 QWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFT 143 (711) Q Consensus 64 Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~ 143 (711) +-+.+-.+.-+.+-.-...+|.++.+|+.++|..=+..+.+. +-..|. T Consensus 77 ~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~--------------------------------~p~~l~ 124 (535) T protein:vir:80 77 SRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQ--------------------------------LPPALE 124 (535) T ss_pred cCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCccee--------------------------------ccHHHH Confidence 332222222333333367799999999999998443323222 123355 Q ss_pred HHHHHHH-hhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCC---------CCCcceEEEecCccceeeCCCccccC-c Q lcl|Aclame:pro 144 GLIKNIE-YNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDS---------FEQDLIIEAIQNQFSVTIDPDAKKRD-R 212 (711) Q Consensus 144 ~~~~~~~-~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~---------~~~~i~i~~v~~~~~v~~Dp~a~~~d-~ 212 (711) .++..+- +-++.......++..++.+|.+++=| |+-.... ....+.+..| +|.+|+ +......+ . T Consensus 125 ~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLV--D~P~~~~~~t~ade~~~~~rPy~~~y-~ae~Ii-nW~~~~v~G~ 200 (535) T protein:vir:80 125 AIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFT--DYPNVGRPVTVLEQKLGLYRPTITLV-HPTSII-NWRTKLVGGK 200 (535) T ss_pred HHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE--eecCCCCcccHHHHHhcCCCcEEEEe-chhhcc-CccccccCCc Confidence 5555553 34678999999999999999887544 4321111 1234667777 576766 33222222 1 Q ss_pred cccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEE---EecCcch Q lcl|Aclame:pro 213 SDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSF---WLDALED 289 (711) Q Consensus 213 ~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 289 (711) ....++..+.....++ +.+.......+||.+.- .+|.+. +..+... T Consensus 201 ~~Lt~v~lrE~~~~~d-------------------d~f~~~~~~q~RvL~~~------------~~G~y~v~~~~~~~~~ 249 (535) T protein:vir:80 201 SVISLVVIQENVLAQD-------------------DGFETTYVQQWRVLQLN------------AEGNYQVERWRRETQE 249 (535) T ss_pred cceeEEEEEEEEEecC-------------------CCcccceeEEEEEEEec------------CCceEEEEEEEeecCC Confidence 1222332222111000 00111111223332210 011100 0000000 Q ss_pred hHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHH Q lcl|Aclame:pro 290 IVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYW 369 (711) Q Consensus 290 ~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~ 369 (711) +.. ...+..+..+....+.+.+|||+|.... -+...+......+-.++...=.. T Consensus 250 --------~~~---------------~~~~~~~~~~~g~~~l~~IPfv~~~~~~---~~~~~~~pPLl~LA~lni~Hy~~ 303 (535) T protein:vir:80 250 --------EMY---------------YSYSKHVPTDGNGNPFKEIPFQFIGPLD---NNADIDHPPLLDLCEVNIGHYRN 303 (535) T ss_pred --------ccc---------------cccceeecccCCCcccCeeEEEEeecCC---CCCCCCccchHHHHHHHHHHhhc Confidence 000 0000011111111234566776543221 12223333456777776665555 Q ss_pred HHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCc-------eEEecccccCcCCccccCCccchHHHHHHHHHHH Q lcl|Aclame:pro 370 DSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFS-------LLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSV 442 (711) Q Consensus 370 ~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~-------~i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~ 442 (711) .+-..+++..+..|..++. |. ++.+...+..+.+ .+.+-. ..+.++-.+.+..++.. .|+... T Consensus 304 ssd~~~il~~~~~P~l~i~-G~----~~~~~~~~~~~~~i~iG~~~~~~lP~--~~~~~~~e~~~~~~a~~---~l~~~e 373 (535) T protein:vir:80 304 SADYEEMAFVAGQPTAFFT-GL----TKDWVEDVFKDFKVHLGSRAIIPLPQ--GATAGILQITPNSVPFE---AMTHKE 373 (535) T ss_pred hhHHHHHHHHhcCceeeee-cC----chhhhhcCCCCcceEecCcccccCCC--CCCcceeeeccchhHHH---HHHHHH Confidence 5666777777777766653 32 2333222222222 121111 11223334444555433 344444 Q ss_pred HHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchh Q lcl|Aclame:pro 443 EKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDF 522 (711) Q Consensus 443 ~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~ 522 (711) +.|.. .|..- +...+.+.|+.+......+....|..+..|++.++. .+|.++..|.... . ..++.+ T Consensus 374 ~qM~~-lGa~l--l~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~----~aL~~~A~w~G~~-----~-~~~~~~- 439 (535) T protein:vir:80 374 SQMIA-MGANL--LVKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFR----KALRWANQFQTGI-----V-NDETVE- 439 (535) T ss_pred HHHHH-HHHHh--hccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHH----HHHHHHHHHcCCc-----c-CCCceE- Confidence 44443 23222 222222333333333344444556777777766654 4555666664210 0 000000 Q ss_pred eecchhhhhhhccceeeeehhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC--cc--hHH Q lcl|Aclame:pro 523 VKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW--PG--ADV 598 (711) Q Consensus 523 v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~--~~--~~e 598 (711) +.+|. .|. . .. -.+.. .++|..+.+. ..+....+-..++...+ +. .++ T Consensus 440 i~~n~-------------------dF~--~-~~-ld~~~----~~all~~~~~-G~Is~et~~~~L~r~gvl~~~~~~ee 491 (535) T protein:vir:80 440 YNLNT-------------------DFP--A-AR-LTPNE----RAELILEWQQ-GAITFKEMRAGLRRAGVASEDDAKAE 491 (535) T ss_pred EEecc-------------------ccc--c-cc-CCHHH----HHHHHHHHhc-CCCCHHHHHHHHHhCCCCCcccchHH Confidence 11111 010 0 00 00111 1122222221 00111111111222222 11 122 Q ss_pred HHHHHHhh----hcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 599 IAERLKKI----VPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQA 657 (711) Q Consensus 599 ~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqa 657 (711) ...++... ....+...+...........-- ....+.++. . T Consensus 492 e~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~----------~~~~~~~~~---------~ 535 (535) T protein:vir:80 492 TEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNN----------GNGGGNQAG---------N 535 (535) T ss_pred HHHHHHhhhhhccccCCCCCCCCCCCCCcCcccC----------CccccccCC---------C Confidence 22222221 1111111110000000000000 000000000 0 No 144 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=94.02 E-value=0.0056 Score=32.92 Aligned_cols=600 Identities=11% Similarity=0.020 Sum_probs=162.9 Q ss_pred CCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH---HHHHHHHHhCCCceEehhhHHHHH Q lcl|Aclame:pro 15 AKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS---QVRTERELEQRPCLVNNVLPTFVD 91 (711) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~---~~~~~~~~~g~p~~~~N~i~~~v~ 91 (711) -......+...++..+ -.++..+ .+. .|..++.... -|-. .+..+. .| |...+.+. T Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~~---~l~---~~~~~~~~~~------~~r~~a~~d~~fy--~G------~Qw~~~~~ 59 (714) T protein:vir:10 1 MKNEINTTAMKNDHGS-TPRFSQR---QLL---SLCSDIDSQP------LWRDAANKACAYY--DG------DQLAPEVI 59 (714) T ss_pred CCcCcCcccCCCcchh-hhhhhHH---HHH---HHHHHHhhhH------HHHHHHHHHHHhh--cC------CCCCHHHH Confidence 1222222223333332 1222211 111 1221111101 1211 111111 23 33344555 Q ss_pred HHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHH-HHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 92 QVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYE-LAEVFTGLIKNIEYNCDAETEYDIAFQGAVESG 170 (711) Q Consensus 92 ~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G 170 (711) .++-...+-...+...+. ......+..-...+-....+.+.. ..+.+.+++.......-.......+..++..+| T Consensus 60 ~~l~~~g~p~~~~N~i~~----~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:10 60 QVLKDRGQPMTIHNLIAP----TVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred HHHHhcCCCcEEeccHHH----HHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 555333322222222221 111112222222222233343322 223344555555444434444555666666666 Q ss_pred ccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCcc-------ccCccccceeeeeecCCHHHHHHhcCCccc--chh Q lcl|Aclame:pro 171 MGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAK-------KRDRSDMNWCLIDDTMSKEKFKALYPDATA--EPV 241 (711) Q Consensus 171 ~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~-------~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~--~~~ 241 (711) +.+= ++ |.++++|++.. ..|..+.-|=-..+..++++.+-.|--.-. +.+ T Consensus 136 ~~~G-~G--------------------~~~~~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~ 194 (714) T protein:vir:10 136 IKAG-LS--------------------WVEVRRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEA 194 (714) T ss_pred hhcc-cc--------------------eEEeeeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHH Confidence 5431 22 22333443211 111111101011122233333222100000 000 Q ss_pred hcccccccccCCCCCeEE-EEEeeeeeeecee--------EEEccCCcEEEecCcchhHHHHHhcCchhhhhcccceEEE Q lcl|Aclame:pro 242 YEDSVADYDTWFTEKSVR-VSEYFTREPVIRE--------IALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKT 312 (711) Q Consensus 242 ~~~~~~~~~~~~~~~~v~-v~E~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v 312 (711) ... +... .+.+. ....|........ ...+...+.+... ...+... ..+.+....+ T Consensus 195 ~~~-----fp~~-a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~-----~~~rV~v~E~ 258 (714) T protein:vir:10 195 KAT-----FPGM-AQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ-----QNEWLQR-----ERRRVLLQVV 258 (714) T ss_pred HHh-----cCCc-hhhhhccchhhcCcccchhhhhhcccccccchhhcccccc-----ccccccc-----CcceEEEEEE Confidence 000 0000 00000 0000100000000 0000000000000 0000000 0000000000 Q ss_pred E-------EE--EEecCceeccCcc--------------------------------CCCCc--cce--EEEEeeeeccC Q lcl|Aclame:pro 313 Y-------WR--KITGANVLEGPVE--------------------------------IPSTT--IPV--IPVWGKSLIIK 347 (711) Q Consensus 313 ~-------~~--~~~g~~~le~~~p--------------------------------~~~~~--~P~--vp~~~~~~~~~ 347 (711) | .+ ..+|..+..++.. ...+. ||+ +|++++ T Consensus 259 w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~----- 333 (714) T protein:vir:10 259 YYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPF----- 333 (714) T ss_pred EEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEe----- Confidence 0 00 1122222111100 01111 221 232211 Q ss_pred CcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCc----eEecccccCChHHHHhhcccCCCceEEec-----ccccC Q lcl|Aclame:pro 348 KKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAP----FIGSEGNVEGREDEWEQANTKNFSLLTYI-----PQYQG 418 (711) Q Consensus 348 ~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~----~~~~~~av~~~~~~~~~~~~~~~~~i~~~-----~~~~~ 418 (711) +|..+.. ....--+++.+.++-...++.. +++....+ ....|++.... ....+ T Consensus 334 -----~g~~~~~---~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~----------~~~~gav~~~d~~~~e~~~rp 395 (714) T protein:vir:10 334 -----WGYRKDK---TGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRV----------IMDEDATQLSDNDLMEQLERP 395 (714) T ss_pred -----cceeeec---cCccceehhhhhhHHHHHHHHHHHHHHHHhCCce----------eeccccccccHHHHHHhccCC Confidence 2221111 1111123333333321111100 00100000 00011111000 00011 Q ss_pred cCCccc-------------cCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 419 DPGPRR-------------QPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDN 485 (711) Q Consensus 419 ~~~i~~-------------~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn 485 (711) +..+.+ +.+.+.++-....++........+--++..+....|..+++.+-.+....-......+.. T Consensus 396 ~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~- 474 (714) T protein:vir:10 396 DGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE- 474 (714) T ss_pred CCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHH- Confidence 111111 122222233344555555555444433222222223333333332222222222222222 Q ss_pred HHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhh---eeeeEE---eecccCh- Q lcl|Aclame:pro 486 LTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNV---QKYDVV---VTTGPAF- 558 (711) Q Consensus 486 ~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~---~~~dv~---v~~~~~~- 558 (711) +-..++...+.+-+++..+.- .+.+ ....+.|-...-....-..+.+|.... .-.|+. .++.... T Consensus 475 ~~dnl~~~~~~~g~~ll~li~-----~~~~---~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~ 546 (714) T protein:vir:10 475 INDNYQFACQQVGRLLLAYLL-----DDLK---KRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----HHcC---CCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeec Confidence 222233333334444444321 1111 122233321111111112233332111 112332 2222222 Q ss_pred HHHHHHHHHHHHHHHhhcchhHHHHHHHHHHh-cCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 559 ATQRIEAAEAMIQFAQAVPSAAAVMADLIAQN-MDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVE 637 (711) Q Consensus 559 ~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~-~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~ 637 (711) ++.-....+.+..+.+.++.+.+.+...++.. ++.-+.....+.++......++..+.... ++.+++ .+..+.+++ T Consensus 547 p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~-~~e~q~--~q~~~~~~~ 623 (714) T protein:vir:10 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEM-TPEEQE--VAAQQQALQ 623 (714) T ss_pred cCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCcccc-CcchhH--HHHHHHHHH Confidence 22323344444455555444433333322221 11222212222233333222222221211 111222 222223334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhhhccC Q lcl|Aclame:pro 638 MAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAE--ITASQANVTEQ 711 (711) Q Consensus 638 ~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e--~~~~qa~~e~Q 711 (711) .++++++..+++++.++.+++..++++.+.+.+.+++.....++......+..+..++..... -.+...++.+| T Consensus 624 ~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~~q 699 (714) T protein:vir:10 624 QQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHH Confidence 445566666777777777777777666555444444332222222111111122222111111 11223333333 No 145 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=630 Identities=11% Similarity=0.009 Sum_probs=153.2 Q ss_pred CCcCC-----CCCCCCcccCCCcc----cCCcCcchHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCH---H Q lcl|Aclame:pro 1 MAKKQ-----KKSRVEQLYAKKAK----VYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPS---Q 68 (711) Q Consensus 1 ~~~~~-----~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~---~ 68 (711) .+.++ +.++-+..-.++.. .+..+..++.+... .|++... .+..++..+. -|-. + T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~l~~-------~~~~~~~~~~--~~r~~a~~ 70 (776) T protein:vir:93 4 LNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVE----LHSRLLS-------YYRQELSRQQ--DNRAEMAV 70 (776) T ss_pred ccccccccccccccccccCCCCCcccchhcccCCCCCHHHHH----HHHHHHH-------HHHHHHhhch--HHHHHHHH Confidence 22222 11111111111111 11111112222221 2222211 1111111111 1211 1 Q ss_pred HHHHHHHhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 69 VRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKN 148 (711) Q Consensus 69 ~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~ 148 (711) +..+. .|. .....+..++-...+....+.....- .....+..-...+.....+.+..- ..+.+++.. T Consensus 71 d~~fy--~G~------Qw~~~~~~~l~~~g~p~~~~N~i~~~----i~~v~g~~~~nr~~~~~~p~~~~d-~~~Ae~l~~ 137 (776) T protein:vir:93 71 DEDYY--DNI------QWSQDEIDELKERGQAPTVYNVISQS----VNWIIGSEKRGRSDFKVLPRRKDG-GKAAERKTA 137 (776) T ss_pred HHHHh--CCC------CCCHHHHHHHHhcCCceEEecchHHH----HHHHHHHHHhCCcceEEecCChhH-HHHHHHHHH Confidence 11121 232 22333433332222111112221111 111112212222233345654433 334555555 Q ss_pred HHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcceEEEecCccceeeCCCccc-------cCccccceeeee Q lcl|Aclame:pro 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKK-------RDRSDMNWCLID 221 (711) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i~i~~v~~~~~v~~Dp~a~~-------~d~~Da~~~~~~ 221 (711) +....-.......++.++...++.+ .++ |.+++||++... .+..+.-|=-.. T Consensus 138 ~~~~~~~~~~~~~~~~~af~d~~~~-G~G--------------------~~~v~~d~~~~~~~~~~~~~~p~~i~~Dp~a 196 (776) T protein:vir:93 138 LLKYLSDVNHTPFERSMAFEETTKA-GIG--------------------WLESQVQDENDGEPIYAGAESWRNILWDSTY 196 (776) T ss_pred HHHHHHHhhcHHHHHHHHHHHhhhc-Ccc--------------------eEEEEeeccCCCCceEeeccChhheeecccc Confidence 4443323334444555554444322 111 223444432210 000000000011 Q ss_pred ecCCHHHHHHhcCCccc--chh----hccc-ccc---c---ccC--CCCCeEEEEE-------------eeeeeeeceeE Q lcl|Aclame:pro 222 DTMSKEKFKALYPDATA--EPV----YEDS-VAD---Y---DTW--FTEKSVRVSE-------------YFTREPVIREI 273 (711) Q Consensus 222 ~~~~~~e~~~~~p~~~~--~~~----~~~~-~~~---~---~~~--~~~~~v~v~E-------------~~~~~~~~~~~ 273 (711) +..++++++-.|-..-. +.+ .... ... . ..| .+........ .|+.. ...+| T Consensus 197 ~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v 275 (776) T protein:vir:93 197 RRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAY-ARKRV 275 (776) T ss_pred ccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccccccccccc-CCCeE Confidence 22233333222211000 000 0000 000 0 000 0000000000 00000 00000 Q ss_pred EEccCCcEEEecCcch------------hHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccce----E Q lcl|Aclame:pro 274 ALLSDGRSFWLDALED------------IVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPV----I 337 (711) Q Consensus 274 ~~~~~~~~~~~~~~~~------------~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~----v 337 (711) ...+. |+....... ..+.........+....+..++.....+... +..+..-+..+.-|| + T Consensus 276 ~v~E~--~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~-~~~g~~~l~~~~~p~~~~~~ 352 (776) T protein:vir:93 276 RMIEA--WFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCA-IMTTRDLMWAGPSPYRHNRY 352 (776) T ss_pred EEEEE--EEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEE-EEecchhhhccCCCCCCCcc Confidence 00000 000000000 0000000000000000111111111111111 111222223333332 3 Q ss_pred EEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEeccccc Q lcl|Aclame:pro 338 PVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQ 417 (711) Q Consensus 338 p~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~~~~~ 417 (711) ||++.+ .-....-|+...+.+.=...-...+...-.+ ..++..+.+...++.+......-. -..+++.. T Consensus 353 Pfv~~~---~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~------~~~l~~~~~~~~~gav~~~d~~~~--~~~rp~~v 421 (776) T protein:vir:93 353 PFTPIW---GFRRARDGMPYGVIRFMRGMQDDVNKRLSKA------LYILSTNKVLMEEGAVDDIDEFRR--EAARPDAV 421 (776) T ss_pred ceEEec---CceecccccccchHHhhhHHHHHHHHHHHHH------HHhhcCCceeeccccccchHHHHH--hcccCCce Confidence 443221 1111113333344444444444444332222 112222222211111110000000 00122211 Q ss_pred C---cCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 418 G---DPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVG 494 (711) Q Consensus 418 ~---~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~ 494 (711) . .+....+.....++-...++++.......+..++..+-...|..+++.+..+...........+.. +.+.+.+.. T Consensus 422 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~-~~dn~~~~~ 500 (776) T protein:vir:93 422 MTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNK-LFDNLRLAF 500 (776) T ss_pred eeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 1 112223333333444555666666666666555444444444445444433333333222222222 222233333 Q ss_pred HHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeeehhhheeeeEEe---ecccChHHHHHH-HHHHHH Q lcl|Aclame:pro 495 KILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVV---TTGPAFATQRIE-AAEAMI 570 (711) Q Consensus 495 ~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~nD~~~~~~dv~v---~~~~~~~s~r~~-~~~~L~ 570 (711) +.+.+++....- ... .....+.|.... .+.+++.+|+ .....|+.+ +...+....... ..+++. T Consensus 501 ~~~~~~~l~li~-----~~~---~~~r~~ri~~~~---~~~~~v~in~-~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~ 568 (776) T protein:vir:93 501 QQHGEKELSLIE-----QYM---TEEKQFRITNSR---GNPEYVTVND-GLPENDITRTKADFIIDEAEWRATMRQAAVA 568 (776) T ss_pred HHHHHHHHHHHH-----Hhc---CcceEEEEeecC---CCcceEEecc-cchhhhhccceeeEEEeecccchhHHHHHHH Confidence 333443333321 111 122333332211 1123444443 112234433 333333333222 333333 Q ss_pred HHHhhcchhHHHHHHHHHHh-cCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 571 QFAQAVPSAAAVMADLIAQN-MDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAE 649 (711) Q Consensus 571 ~l~~~~p~~~~~~~~~~~~~-~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~q 649 (711) .+.+.++.+.+.+...+... +...+.....+.++.....+.+..+.+.. ..+.+++.++.+++..+++.+++.++ T Consensus 569 ~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~----~~~e~~~~qq~q~~~~q~q~~~~~a~ 644 (776) T protein:vir:93 569 ELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDE----PTPEEIAREQAQQQQQQYNDALAIAT 644 (776) T ss_pred HHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhh----cchhHHHHHHHhhHHHHHHHHHhhhh Confidence 33333333322222211111 11111111111111111111110000100 00011111112222222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 650 ADTAQAQADMLKAQLETEEAQKQLAMIEDMA--QGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 650 ae~~~aqae~~~~q~~~~~~q~q~~~~~~~~--q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) ++ .++++..+.+++++.++.++.+....+ +..+....+++...........+..++...+ T Consensus 645 ~~--~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~ 706 (776) T protein:vir:93 645 LE--EQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILR 706 (776) T ss_pred hh--HhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhc Confidence 22 223333333333333222222211111 1111111111111000000000111111101 No 146 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=91.89 E-value=0.014 Score=30.79 Aligned_cols=631 Identities=12% Similarity=-0.021 Sum_probs=157.9 Q ss_pred HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhh--ccccee Q lcl|Aclame:pro 28 DRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQ--NRPAIK 105 (711) Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~--~r~~~~ 105 (711) -.+.+..++.+++..++...++..++.+... ++....--.| |...+.+..++-...+ +||-+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~---------~D~~f~~~~G------~QW~~~~~~~l~~~~q~~grP~~~ 65 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCI---------EATRFARVPG------GQWEGATAAGTKLDEQFEKYPKFE 65 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhhcCCC------CCCCHHHHHHHHHhhhhcCCCceE Confidence 4444444444444333333333333332221 1111110012 4444555555533322 334322 Q ss_pred E--ecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccC Q lcl|Aclame:pro 106 V--SSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD 183 (711) Q Consensus 106 ~--~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~ 183 (711) | .+. ......+..-...+-....+.+.+.-..+.+++..+....-.......+..++..+++.+ .++|-.... T Consensus 66 ~N~i~~----~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~-G~Gw~~~~~ 140 (708) T protein:vir:10 66 INKVAT----ELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATG-GFGCFRLTS 140 (708) T ss_pred EcchHH----HHHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhc-ccceeeeee Confidence 1 111 111222222222333333455544324456666666655555555666666666666543 222211111 Q ss_pred C-CCCCcceEEEecCccceeeCC-CccccCccccceeeeeecCCHHHHHHhcCCc-----cc-chhhcccccccccCCCC Q lcl|Aclame:pro 184 D-SFEQDLIIEAIQNQFSVTIDP-DAKKRDRSDMNWCLIDDTMSKEKFKALYPDA-----TA-EPVYEDSVADYDTWFTE 255 (711) Q Consensus 184 ~-~~~~~i~i~~v~~~~~v~~Dp-~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~-----~~-~~~~~~~~~~~~~~~~~ 255 (711) + ..+.++....-..+....+|| .+.=+| -..+..+.++.+-.|-.. .. ..+.... .....|... T Consensus 141 d~~~e~d~~~~~~~i~i~~~~~p~~~v~~D-------p~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a-~~~~d~~~~ 212 (708) T protein:vir:10 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVWFD-------PDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSM 212 (708) T ss_pred ccccccCCCCCccccceEEeecchhhcccC-------ccccccChhhhhhhhhccCCCHHHHHHhCCCCc-ccccccccC Confidence 1 111111000000000111121 011111 111222344333222110 00 0010000 011111110 Q ss_pred CeEEEEEeeeeeeeceeEEEcc-----CCcEEEe-cCcchhHHH----HHhcCchhhhh---cccceEEEEEEEEecCce Q lcl|Aclame:pro 256 KSVRVSEYFTREPVIREIALLS-----DGRSFWL-DALEDIVDE----LLEAGISIVRT---RKVKTFKTYWRKITGANV 322 (711) Q Consensus 256 ~~v~v~E~~~~~~~~~~~~~~~-----~~~~~~~-~~~~~~~~~----~~~~g~~~~~~---~~~~~~~v~~~~~~g~~~ 322 (711) .. ...-|+.. ...++..+= ...++++ ++..-.... ........... ..+..+++.++.+.-..+ T Consensus 213 ~~--~~~~~~~~-d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~ 289 (708) T protein:vir:10 213 TS--WEYNWFGA-DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVV 289 (708) T ss_pred CC--ccccccCC-CceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEee Confidence 00 00001110 001110000 0000000 000000000 00000000000 011111111111111111 Q ss_pred eccCccC-CCCccceE--EEEeeeeccCCcccccchH--HH-hhHHHHHHHHHHHHHHHHHHh-cCCCceEecccccCCh Q lcl|Aclame:pro 323 LEGPVEI-PSTTIPVI--PVWGKSLIIKKKEIFRSII--RH-SKDAQRMANYWDSAATETVAL-APKAPFIGSEGNVEGR 395 (711) Q Consensus 323 le~~~p~-~~~~~P~v--p~~~~~~~~~~~~~~~g~v--~~-~~d~Q~~~N~~~s~~~~~l~~-~~~~~~~~~~~av~~~ 395 (711) . +..-. ..+.+|+- |++.++- ...+..|.. .. +++. +..=.+.+...--+.. .+..+....-.....+ T Consensus 290 ~-g~~~le~~~~~p~~~fP~vP~~g---~r~~~d~~~~~yG~vr~~-kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i 364 (708) T protein:vir:10 290 D-GDGFLEKPRRIPGEHIPLIPVYG---KRWFIDDIERVEGHIAKA-MDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) T ss_pred c-chhhhccCCCCCCCceeeEEEee---eeeccCCCcccceeeccc-chhHHHHHHHHHHHHHHHHhcCCcccccChhhh Confidence 1 10000 11223321 2211110 000000100 00 1111 0000011110000000 0000000000000000 Q ss_pred HHHHhhcccCCCce-EEecccccCcCCccccCCc-------cchHHHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHH Q lcl|Aclame:pro 396 EDEWEQANTKNFSL-LTYIPQYQGDPGPRRQPPA-------AVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRA 467 (711) Q Consensus 396 ~~~~~~~~~~~~~~-i~~~~~~~~~~~i~~~~~~-------~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~a 467 (711) .. +...+...... -.+-........+...... +.+.-...+++.......++.-++..+.+..| ..|+++ T Consensus 365 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG-~~sn~S 442 (708) T protein:vir:10 365 RG-LEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPSNIA 442 (708) T ss_pred hh-HHHHHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHcc-CccchH Confidence 00 00000000000 0000000001111111111 11122233555555555555443222222222 233322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcchheecchhhhhhhccceeeee----hh Q lcl|Aclame:pro 468 IIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIH----DL 543 (711) Q Consensus 468 i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~~v~~~~~~~~~~~g~~~~~n----D~ 543 (711) -.+....-......+. .+-.-++...+.+-+++..+.. .+. ..++.+.|.... .+-..+.+| |. T Consensus 443 G~aI~~rq~qg~~~l~-~~~Dnl~~~~~~~g~~lL~li~-----~~y---~~er~~RI~~ed---g~~~~v~in~~~~d~ 510 (708) T protein:vir:10 443 QETVNNLMNRADMASF-IYLDNMAKSLKRAGEVWLSMAR-----EVY---GSEREVRIVNED---GSDDIAVLSAQVVDR 510 (708) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC---CCcceEEecceeccC Confidence 2112111111111111 1111222222223333333321 111 122333332211 111223333 33 Q ss_pred hhe----eeeEEe---ecccChH-HHHHHHHHHHHHHHhhcchhHHHHH--HHHHH-hcCCcchHHHHHHHHhhhcchhh Q lcl|Aclame:pro 544 NVQ----KYDVVV---TTGPAFA-TQRIEAAEAMIQFAQAVPSAAAVMA--DLIAQ-NMDWPGADVIAERLKKIVPPNVL 612 (711) Q Consensus 544 ~~~----~~dv~v---~~~~~~~-s~r~~~~~~L~~l~~~~p~~~~~~~--~~~~~-~~~~~~~~e~~~~l~~~~~~~~~ 612 (711) ..+ .+|+++ ++..... +.-....+.+..|.+.++.+.+... ..++. .++.-......+.++.......+ T Consensus 511 ~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~ 590 (708) T protein:vir:10 511 QTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI 590 (708) T ss_pred CCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcc Confidence 333 245433 3333222 2222233333333333332222100 00000 00000001101111111111111 Q ss_pred cchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 613 SKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRE 692 (711) Q Consensus 613 ~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~ 692 (711) .... .+..+.++++.+++++++..+++..+.+++++..++|+++.++++++.+.+....+.+.....+.+...++-. T Consensus 591 ~~~~---~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~ 667 (708) T protein:vir:10 591 SGIA---KPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLA 667 (708) T ss_pred cccc---cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 1111222223333334445555566666677777777777777766655555444333222111111111100 Q ss_pred H-----HH--HHHHHHHHHHhhhccC Q lcl|Aclame:pro 693 L-----VA--QALAEITASQANVTEQ 711 (711) Q Consensus 693 ~-----~~--~~~~e~~~~qa~~e~Q 711 (711) + .. ...+++.......++| T Consensus 668 ~a~~~~~~~~~~~~q~l~~~q~~q~~ 693 (708) T protein:vir:10 668 QARNIDDKAVMEAIRLLKDVAESQQQ 693 (708) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhHHH Confidence 0 00 0011111111111112 No 147 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=86.66 E-value=0.044 Score=28.02 Aligned_cols=452 Identities=12% Similarity=0.060 Sum_probs=158.6 Q ss_pred CCcCCCCCCCCcccCCCcccCCcCcchHHHHHHHHHHHHHHHHhhchHHH------HHHHHHHHHhCCCCCCHHHHHHHH Q lcl|Aclame:pro 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNW------EAAEDDLKFLGGEQWPSQVRTERE 74 (711) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r------~~~~~~~~~y~G~Qw~~~~~~~~~ 74 (711) |. -++.-+.+..-...=....+ -.....++ ....+-+.+- .--.....|+... .... .++ T Consensus 14 m~----V~~~hp~y~a~~~~W~~~~d----~g~~~~k~--~g~~YLPk~~~~~~~~~~d~~y~~~~~~~-~~~y-~~~-- 79 (488) T protein:vir:96 14 ML----TPIYHPDYLVNAPQWLRNLD----CVMDNIKR--KKQTYLPNLGAIPPEAKTDPKVTALAAKI-EKDW-EDL-- 79 (488) T ss_pred ec----ccccCHHHHHHhhhhhHhhh----hhhHHHHH--hhhhcCCCCCCccccccCcchhhhhhccc-hhhh-Hhh-- Confidence 33 11111111111100000000 00000000 0000110000 0000000000000 0000 000 Q ss_pred HhCCCceEehhhHHHHHHHhhhhhhcccceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHH-hhc Q lcl|Aclame:pro 75 LEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YNC 153 (711) Q Consensus 75 ~~g~p~~~~N~i~~~v~~i~g~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~~ 153 (711) +.+-.+.+|..+.+++.++|..=+..|.+.. | . ...|..++..+. +-+ T Consensus 80 -~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~-~-------------------------~----~~~l~~l~~d~D~~G~ 128 (488) T protein:vir:96 80 -TWRLANYVNIVNPTMNAITGAVMRREPEFDT-M-------------------------D----NPVLIGLRDNIDGKGN 128 (488) T ss_pred -hhhccccCchhHHHHHHhcchhhccCceecc-C-------------------------C----cHHHHHHHhccCCCCC Confidence 1112345799999999999976544333321 1 1 112455666554 446 Q ss_pred CHHHHHHHHHHHHHhcCccEEEEEEeeccCCC-------CCCcceEEEecCccceeeCCCccccCc-cccceeeeeecCC Q lcl|Aclame:pro 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDS-------FEQDLIIEAIQNQFSVTIDPDAKKRDR-SDMNWCLIDDTMS 225 (711) Q Consensus 154 ~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~-------~~~~i~i~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~~~ 225 (711) +.......++..++.+|.+++=| |+-.+.. ..-+|.+..| +|.+|+ |......+. ....++..+.-++ T Consensus 129 ~L~~f~~~~~~~~l~~G~~~ilV--D~P~~~~T~ade~~~~~rPy~~~~-~a~~Ii-nW~~~~v~G~~~L~~v~lrE~~~ 204 (488) T protein:vir:96 129 GIDQECKQALNALQWGSRCGWLV--RSHPESATMADWNKGKKLPTAAFY-DALHII-DWEVEYIDGEEKLTYLSLLEDYQ 204 (488) T ss_pred CHHHHHHHHHHHHHhcCeEEEEE--ecCCCcCCHHHHHHhcCCcEEEEe-chhhhc-CcceeccCCceeeEEEEEEEEEE Confidence 78999999999999999887544 4421111 1234677777 577766 333322221 1122222222111 Q ss_pred HHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccCCcEEEecCcchhHHHHHhcCchhhhhc Q lcl|Aclame:pro 226 KEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTR 305 (711) Q Consensus 226 ~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 305 (711) ..+.+.+.+...+++.. ..+...++....++.. T Consensus 205 --------------------~~D~~~~~~~~~~~~~~---l~~g~~~v~~~~~~~~------------------------ 237 (488) T protein:vir:96 205 --------------------ERDGGTYVSKQRLINHR---LVDGLCEFQEVTDDEY------------------------ 237 (488) T ss_pred --------------------eccCCCcccceEEEEEE---EECcEEEEEEEecCCc------------------------ Confidence 00111112222222111 1111111111111100 Q ss_pred ccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|Aclame:pro 306 KVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPF 385 (711) Q Consensus 306 ~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s~~~~~l~~~~~~~~ 385 (711) ++.....+..--..+.+|||++..... +...+....-.+-.++...=...|-..+++..+.-+.+ T Consensus 238 ------------~~e~~~~~~g~~~l~~IP~v~~~~~~~---~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~l 302 (488) T protein:vir:96 238 ------------SDEWTPVLINSKQSDTIPFFLASSQSN---EWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKW 302 (488) T ss_pred ------------ccceEeecCCCcccCeeEEEEEecCCC---CCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCcee Confidence 001111111111346677776543221 22222333445555544333334445555655555555 Q ss_pred EecccccCChHHHHhhcccCCCceEEec--ccccCcCCccccCCccchHHHHHHHHHHHHHHHHHhCCCHHHhccccchh Q lcl|Aclame:pro 386 IGSEGNVEGREDEWEQANTKNFSLLTYI--PQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNET 463 (711) Q Consensus 386 ~~~~~av~~~~~~~~~~~~~~~~~i~~~--~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~ 463 (711) +..-+.. ...+.... .+.++-... +.....+...+.++ +..+-..+-|+...+.|.. .|.. +...+.+. T Consensus 303 v~~~~~~---~~~~~~~~-~~~g~~~~~~~~~~~~~g~~~~~e~-~~~~l~~~~l~~l~~qm~~-~Ga~---l~~~~~~~ 373 (488) T protein:vir:96 303 MVDMGDM---NKTMASEM-NPLGFTLAGRMPYYVKNGDVKVIQA-QFSPETENKVEKLFEQAVK-VGAS---LFTQQSNE 373 (488) T ss_pred eeccCCC---Cccccccc-ccceeeecccccccccCCceeecCC-chhHHHHHHHHHHHHHHHH-HhHh---hccCCCcc Confidence 5421111 11111111 111211111 11011112333322 1111123344444444432 3432 22223346 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeecccCcch--heecchhhhhhhccceeeee Q lcl|Aclame:pro 464 SGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED--FVKLNEQIFDEESGEWVTIH 541 (711) Q Consensus 464 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g~~~~~~--~v~~~~~~~~~~~g~~~~~n 541 (711) ||.+......+....|..+..|+..+... +|.++.+|.... +....+. -+.+|. T Consensus 374 Ta~~~~~~~~~~~S~L~~~a~~le~al~~----~l~~~A~w~g~~------~~~~~~~~~~~~in~-------------- 429 (488) T protein:vir:96 374 TATGAAIRSGSSTASMATLGNNVEDTVRN----MLRFIMRYFEGT------NLYVNPDELVFKLNR-------------- 429 (488) T ss_pred hHHHHHHHHHHhhHHHHHHHHHHHHHHHH----HHHHHHHHcCCC------CCCcCccceEEEecc-------------- Confidence 78887777777777888888877766554 455666665321 0000000 011111 Q ss_pred hhhheeeeEEeecccChHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHhcCC--c--chHHHHHHHHhhhcch Q lcl|Aclame:pro 542 DLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW--P--GADVIAERLKKIVPPN 610 (711) Q Consensus 542 D~~~~~~dv~v~~~~~~~s~r~~~~~~L~~l~~~~p~~~~~~~~~~~~~~~~--~--~~~e~~~~l~~~~~~~ 610 (711) .|. ......+..++|..+... ..+....+-..++...+ | ..+++..++...-... T Consensus 430 -----dF~--------~~~ld~~~~~al~~~~~~-G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 430 -----DYF--------DVEVNPQMLQVAYAAMME-GNLPQVSWFELLKRARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred -----CCC--------CccCCHHHHHHHHHHHhc-CCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhcCCCC Confidence 010 000001112223322221 01111111111222222 1 2244444444322211 No 148 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=73.41 E-value=0.17 Score=24.79 Aligned_cols=113 Identities=12% Similarity=0.133 Sum_probs=9.5 Q ss_pred cCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 591 MDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQ---------ADMAQAEADTAQAQADMLK 661 (711) Q Consensus 591 ~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q---------~~~~k~qae~~~aqae~~~ 661 (711) |-+.... +...++...... ........+.+.+..++.....++. .+..+.+++.........+ T Consensus 1 ~~~~~~~-l~~~~~~~~~~l-------~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~ 72 (466) T protein:vir:80 1 MALRQLM-LAKKIEQRKAAL-------AELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSK 72 (466) T ss_pred CchHHHH-HHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1110000 111111100000 0000000000000000000000000 0001111110000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhhhcc-C Q lcl|Aclame:pro 662 AQLETEEAQKQLAMIEDMAQGGDVVYQQVRE----LVAQALAEITASQANVTE-Q 711 (711) Q Consensus 662 ~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~----~~~~~~~e~~~~qa~~e~-Q 711 (711) .+.+....+.+++.+................ .+..............+. + T Consensus 73 l~~ei~~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 127 (466) T protein:vir:80 73 LEGEIKELENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQ 127 (466) T ss_pred HHHHHHHHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhh Confidence 0000000001111100000000000000000 000000000010101110 1 No 149 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=71.98 E-value=0.19 Score=24.56 Aligned_cols=92 Identities=8% Similarity=0.090 Sum_probs=11.9 Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 EAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADM--LKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVA 695 (711) Q Consensus 618 ~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~--~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~ 695 (711) ....+.. .+.+..+....+.+.+.+....+.......... .+...+....+.+...++......+.....+..+++ T Consensus 1 ~~~~~~~--l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~ 78 (466) T protein:vir:80 1 MALRQLM--LAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIK 78 (466) T ss_pred CchHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 111111111111111111111100000000000 000000000011111111111111111111122222 Q ss_pred HHHHHHHHHHhhhccC Q lcl|Aclame:pro 696 QALAEITASQANVTEQ 711 (711) Q Consensus 696 ~~~~e~~~~qa~~e~Q 711 (711) ...+++.......+.. T Consensus 79 ~le~el~e~~~~~~~~ 94 (466) T protein:vir:80 79 ELENELEQLNNKEPKN 94 (466) T ss_pred HHHHHHHHHHHhhhcc Confidence 2122221111111111 No 150 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=66.29 E-value=0.27 Score=23.70 Aligned_cols=113 Identities=8% Similarity=0.150 Sum_probs=9.5 Q ss_pred cCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 591 MDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKS--QADMAQAEADTAQAQADMLKAQLETEE 668 (711) Q Consensus 591 ~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~--q~~~~k~qae~~~aqae~~~~q~~~~~ 668 (711) |..... .+.+.++..... .......................++ ..+..+.+++....+.+.........+ T Consensus 1 m~~k~~-~l~~~~~el~~~-------l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~ 72 (397) T protein:vir:96 1 MALKQL-ILNKQIKERSSE-------IDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAE 72 (397) T ss_pred CcHHHH-HHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111100 001111110000 0000000000000000000000000 000011111111111111111100000 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 669 AQKQLAMIEDMAQGGDVVY-QQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 669 ~q~q~~~~~~~~q~~~~~~-~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) .+.+...+........... ......................+. T Consensus 73 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 116 (397) T protein:vir:96 73 LQKEKQDLEDELAKAADPTDQKPKDGEKRKMKKFKVTEEELAEK 116 (397) T ss_pred HHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHhhhhHHHHHH Confidence 1110000000000000000 000000000000000000000000 No 151 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=65.25 E-value=0.29 Score=23.56 Aligned_cols=118 Identities=6% Similarity=0.089 Sum_probs=14.3 Q ss_pred cCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHH Q lcl|Aclame:pro 591 MDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQA----QADMLKAQLET 666 (711) Q Consensus 591 ~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~a----qae~~~~q~~~ 666 (711) |. .+++.+.+++..........+.......................+++.++...+.++... ++.....+... T Consensus 1 Mk---i~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~ 77 (437) T protein:vir:10 1 MK---IEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDS 77 (437) T ss_pred CC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 334444443332211111100000000000000000000001111111111111111100 00000000000 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHhhhccC Q lcl|Aclame:pro 667 EEAQKQ--LAMIEDMAQGGDVVYQQVRELVAQAL----AEITASQANVTEQ 711 (711) Q Consensus 667 ~~~q~q--~~~~~~~~q~~~~~~~~~~~~~~~~~----~e~~~~qa~~e~Q 711 (711) .....+ .......................... ....+........ T Consensus 78 ~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (437) T protein:vir:10 78 DLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDM 128 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHH Confidence 000000 00000000000000000000000000 0000000000000 No 152 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=60.34 E-value=0.37 Score=22.92 Aligned_cols=101 Identities=9% Similarity=0.159 Sum_probs=9.1 Q ss_pred HHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH--HHHHHHHHHH--HHH Q lcl|Aclame:pro 601 ERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQ---ADMAQAEADTAQAQADML--KAQLETEEAQ--KQL 673 (711) Q Consensus 601 ~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q---~~~~k~qae~~~aqae~~--~~q~~~~~~q--~q~ 673 (711) -.++++ ..+..+...+......+.+....... .+..+...+....+.+.. +.+.+..... ..+ T Consensus 1 Mki~el----------k~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~ 70 (437) T protein:vir:10 1 MKIEKL----------KKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKV 70 (437) T ss_pred CCHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 001000 00000011111111111111100000 001111111111111110 0111100000 000 Q ss_pred HHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHH---hhhccC Q lcl|Aclame:pro 674 AMIEDMAQG---------GDVVYQQVRELVAQALAEITASQ---ANVTEQ 711 (711) Q Consensus 674 ~~~~~~~q~---------~~~~~~~~~~~~~~~~~e~~~~q---a~~e~Q 711 (711) ......... ....................... ...++. T Consensus 71 e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 120 (437) T protein:vir:10 71 EEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKR 120 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 00000000000000000000000 000000 No 153 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=49.72 E-value=0.63 Score=21.68 Aligned_cols=95 Identities=8% Similarity=0.145 Sum_probs=8.1 Q ss_pred HHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 600 AERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDM 679 (711) Q Consensus 600 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~ 679 (711) ++.|++. ..+.++...+.+....++..+..+...+..+...+....+.+......+...++.. T Consensus 1 meeL~~~-----------------~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~ 63 (389) T protein:vir:10 1 MDKLQTL-----------------FNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAE 63 (389) T ss_pred ChHHHHH-----------------HHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 00111110111111111000000000111111111111111110000011100000 Q ss_pred HHHHH-------HHHH--HH-HHHHHHHHHHHHHH-HhhhccC Q lcl|Aclame:pro 680 AQGGD-------VVYQ--QV-RELVAQALAEITAS-QANVTEQ 711 (711) Q Consensus 680 ~q~~~-------~~~~--~~-~~~~~~~~~e~~~~-qa~~e~Q 711 (711) ..... .... .. ..........+... +..-+.. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~ 106 (389) T protein:vir:10 64 KPAEPKTEPKDDGSKKGTDLSKKPIDAKKKAINDFIHSHGKVI 106 (389) T ss_pred HHhhhhccccccccccccccchhHHHHHHHHHHHHhhcchhhh Confidence 00000 0000 00 00000000000000 0000000 No 154 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=41.90 E-value=0.91 Score=20.82 Aligned_cols=74 Identities=11% Similarity=0.061 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 631 TPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTE 710 (711) Q Consensus 631 ~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~ 710 (711) ..+++....+.+...+++++++..+..+.....-+......+ +... ......+..+++...++.......... T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~e---i~~~----~~~~~~l~~~~~~~~~~~~~~~~~~~~ 73 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDD---LTAA----KARRDAINDQIKALEAEKPAEPKTEPK 73 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHH---HHHH----HHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 111111122222222222222111111100000001010011 1110 111111111111111111110000000 Q ss_pred C Q lcl|Aclame:pro 711 Q 711 (711) Q Consensus 711 Q 711 (711) . T Consensus 74 ~ 74 (389) T protein:vir:10 74 D 74 (389) T ss_pred c Confidence 0 No 155 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=40.78 E-value=0.96 Score=20.69 Aligned_cols=124 Identities=9% Similarity=0.073 Sum_probs=10.6 Q ss_pred HHHHhcCCc-chHHHHHHHHhhhcchhhcchhhhhhhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 586 LIAQNMDWP-GADVIAERLKKIVPPNVLSKDEREAIEEDMPEQT--EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKA 662 (711) Q Consensus 586 ~~~~~~~~~-~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q--~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~ 662 (711) |-.+.+-+. ...++.+.+.++...................... ......+....+++.+....+++..+.+.+.... T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 0111111111111110000000000000000000 0000001111111111111111111111100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 663 QLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 663 q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +..................... ................+....+... T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 127 (397) T protein:vir:96 81 EDELAKAADPTDQKPKDGEKRK--MKKFKVTEEELAEKRSAINAFVKSK 127 (397) T ss_pred HHHHHhhhhhhhhhhHHHHHHH--HHHHhhhhHHHHHHHHHHHHHHHhh Confidence 0000000000000000000000 0000000000000000000000000 No 156 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=30.64 E-value=1.6 Score=19.53 Aligned_cols=91 Identities=13% Similarity=0.076 Sum_probs=7.2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAE-------ADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQ----- 688 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~q-------ae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~----- 688 (711) .....+.+....+...+..+...++.... -++.+.+.+....+.+....+.+...++........... T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 00000000000000000111111000000 000001111100000000001111111000000000000 Q ss_pred -HHHHHHHHHHHH-HHHHH---------hhhccC Q lcl|Aclame:pro 689 -QVRELVAQALAE-ITASQ---------ANVTEQ 711 (711) Q Consensus 689 -~~~~~~~~~~~e-~~~~q---------a~~e~Q 711 (711) ..........++ +.... ...... T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~ 114 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRL 114 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHH Confidence 000000000000 00000 000000 No 157 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=28.02 E-value=1.8 Score=19.20 Aligned_cols=91 Identities=13% Similarity=0.075 Sum_probs=7.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAE-------ADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQ----- 688 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~q-------ae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~----- 688 (711) .....+.++.......++.+...++.+.. -++...+.+....+.+......+...++........... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 00000000000000000001000000000 001111111100111100011111110000000000000 Q ss_pred -HHHHHHHHHHHHHHHHHh----------hhccC Q lcl|Aclame:pro 689 -QVRELVAQALAEITASQA----------NVTEQ 711 (711) Q Consensus 689 -~~~~~~~~~~~e~~~~qa----------~~e~Q 711 (711) ..........++..+... ..... T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~ 114 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRL 114 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Confidence 000000000011000000 00000 No 158 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=28.02 E-value=1.8 Score=19.20 Aligned_cols=91 Identities=13% Similarity=0.075 Sum_probs=7.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAE-------ADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQ----- 688 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~q-------ae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~----- 688 (711) .....+.++.......++.+...++.+.. -++...+.+....+.+......+...++........... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 00000000000000000001000000000 001111111100111100011111110000000000000 Q ss_pred -HHHHHHHHHHHHHHHHHh----------hhccC Q lcl|Aclame:pro 689 -QVRELVAQALAEITASQA----------NVTEQ 711 (711) Q Consensus 689 -~~~~~~~~~~~e~~~~qa----------~~e~Q 711 (711) ..........++..+... ..... T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~ 114 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRL 114 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Confidence 000000000011000000 00000 No 159 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=28.02 E-value=1.8 Score=19.20 Aligned_cols=91 Identities=13% Similarity=0.075 Sum_probs=7.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|Aclame:pro 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAE-------ADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQ----- 688 (711) Q Consensus 621 ~~~~~~~q~~~~~~q~~~~~~q~~~~k~q-------ae~~~aqae~~~~q~~~~~~q~q~~~~~~~~q~~~~~~~----- 688 (711) .....+.++.......++.+...++.+.. -++...+.+....+.+......+...++........... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 00000000000000000001000000000 001111111100111100011111110000000000000 Q ss_pred -HHHHHHHHHHHHHHHHHh----------hhccC Q lcl|Aclame:pro 689 -QVRELVAQALAEITASQA----------NVTEQ 711 (711) Q Consensus 689 -~~~~~~~~~~~e~~~~qa----------~~e~Q 711 (711) ..........++..+... ..... T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~ 114 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRL 114 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Confidence 000000000011000000 00000 No 160 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=26.00 E-value=2 Score=18.94 Aligned_cols=270 Identities=12% Similarity=0.041 Sum_probs=107.1 Q ss_pred ccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeeccCCCCCCcc-eEEEecCc Q lcl|Aclame:pro 120 GEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDL-IIEAIQNQ 198 (711) Q Consensus 120 ~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~~~~~~~~i-~i~~v~~~ 198 (711) ...-+..+.-.....+..++..|+. .-........-....+.+.+.+|.+++.+..+. .|.+ .+..+ +| T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~---~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~------~G~~~~l~~l-~~ 70 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTV---SPNNSLSSFDFINQIETIRNEKGNAYVLIERDI------YHQPSKLFLL-NP 70 (278) T ss_pred CccceeEEEecCcccccHHHHHHHh---cCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECC------CCcEEEEEEE-CC Confidence 2222222222223334444433321 001122344567778889999999998877642 1332 23333 44 Q ss_pred cceeeCCCccccCccccceeeeeecCCHHHHHHhcCCcccchhhcccccccccCCCCCeEEEEEeeeeeeeceeEEEccC Q lcl|Aclame:pro 199 FSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSD 278 (711) Q Consensus 199 ~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~~~~~~~~~~~~~~ 278 (711) ..|-..+.. |-. . .||.. ...+ T Consensus 71 ~~v~v~~~~------~~~----------------------------------------~-----~~y~~-------~~~~ 92 (278) T protein:vir:78 71 DVVEMLIEN------QSR----------------------------------------E-----LYYSI-------HAAT 92 (278) T ss_pred ceeEEEEcC------CCc----------------------------------------e-----EEEEE-------EcCC Confidence 444322111 000 0 01110 0111 Q ss_pred CcEEEecCcchhHHHHHhcCchhhhhcccceEEEEEEEEecCceeccCccCCCCccceEEEEeeeeccCCcccccchHHH Q lcl|Aclame:pro 279 GRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRH 358 (711) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~vp~~~~~~~~~~~~~~~g~v~~ 358 (711) |.... +..+.+++-..+ . ..+..+|.|.+.. T Consensus 93 g~~~~---------------------------------~~~~evih~~~~---------------~-~~~~~~G~s~~~~ 123 (278) T protein:vir:78 93 GNKLI---------------------------------VHNMDMLHFKHI---------------V-ASNMVQGISPIDV 123 (278) T ss_pred ceEEE---------------------------------EccccEEEECCC---------------C-CCCCeeeccHHHH Confidence 11111 111112211100 0 1123345566666 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCCceEe-cccccCC-----hHHHHhhcccCCCceEEecccccCcCCccccCCccchH Q lcl|Aclame:pro 359 SKDAQRMANYWDSAATETVALAPKAPFIG-SEGNVEG-----REDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPA 432 (711) Q Consensus 359 ~~d~Q~~~N~~~s~~~~~l~~~~~~~~~~-~~~av~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~i~~~~~~~~~~ 432 (711) +.+.-...+......+..... .+..++ ..+.++. ..+.|.......|+++.+..|. .+..+.....-. T Consensus 124 ~~~~i~~~~~~~~~~~~~~~~--~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~----~~~~l~~~~~d~ 197 (278) T protein:vir:78 124 LKNTTDFDNAVRTFNLTEMQK--PDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGV----EIEPLPKKYVSE 197 (278) T ss_pred HHHHHHHHHHHHHHHHHHhcC--CCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCc----eEEEccCChhHH Confidence 665555444433333333222 233333 3333322 2234444444455666554332 244444333444 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhhcCccceE Q lcl|Aclame:pro 433 AELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMI-PHIYDTERVV 511 (711) Q Consensus 433 ~~~~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li-~~~~~~~r~~ 511 (711) .+.+..+...+.|-...||++..+|...+++-+.+-++...--... +..+.+.+-+-+ .+.++.... T Consensus 198 ~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~-----------l~P~~~~i~~~ln~~L~~~~e~- 265 (278) T protein:vir:78 198 DIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHT-----------LLPIVKQYEEEFNRKLLTKTDR- 265 (278) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH-----------HHHHHHHHHHHHHhhcCChhHh- Confidence 5556666777888888999999999654432111111111111122 333333332222 223322110 Q ss_pred eeecccCcchheecchhhh Q lcl|Aclame:pro 512 RLKFPDETEDFVKLNEQIF 530 (711) Q Consensus 512 ri~g~~~~~~~v~~~~~~~ 530 (711) ...-++.++-..+ T Consensus 266 ------~~g~~~~f~~~~l 278 (278) T protein:vir:78 266 ------EKIGILNLTLNLI 278 (278) T ss_pred ------cCCceEEEecccC Confidence 0112233332111 No 161 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=23.28 E-value=2.3 Score=18.57 Aligned_cols=80 Identities=8% Similarity=-0.026 Sum_probs=6.7 Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 EAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLE-TEEAQKQLAMIEDMAQGGDVVYQQVRELVAQ 696 (711) Q Consensus 618 ~~~~~~~~~~q~~~~~~q~~~~~~q~~~~k~qae~~~aqae~~~~q~~-~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~ 696 (711) +-..+...+.++ ...+...+..... .++++.......+ ......+...++......+.........+.. T Consensus 1 Mn~~e~lkel~~-------~~~el~~~~~~~~---~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~ 70 (421) T protein:vir:13 1 MNLFERLKELRA-------KKKELEEKRCGIV---EEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDE 70 (421) T ss_pred CCHHHHHHHHHH-------HHHHHHHHHHHHH---HHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000101 1111111000000 0000000000000 0000001111100000000000000000000 Q ss_pred HHHHHHHHHhhhccC Q lcl|Aclame:pro 697 ALAEITASQANVTEQ 711 (711) Q Consensus 697 ~~~e~~~~qa~~e~Q 711 (711) .. ........+ T Consensus 71 ~~----~~~~~~~~~ 81 (421) T protein:vir:13 71 ER----KNTNFTGGR 81 (421) T ss_pred HH----hhhcccccc Confidence 00 000000000 No 162 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=22.85 E-value=2.4 Score=18.51 Aligned_cols=601 Identities=11% Similarity=0.013 Sum_probs=158.3 Q ss_pred HHHHHHHHHHHHHHhhch------HHHHHHHHHHHHhCC-CCCCHHHHHHHHHhCCCceEehhhHHHHHHHhhhhhhccc Q lcl|Aclame:pro 30 ALLATARERARDGATYWK------DNWEAAEDDLKFLGG-EQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRP 102 (711) Q Consensus 30 ~~~~~~~~~~~~~~~~~~------~~r~~~~~~~~~y~G-~Qw~~~~~~~~~~~g~p~~~~N~i~~~v~~i~g~~~~~r~ 102 (711) .-+++=...|-...+... ..+..+..+.+ + .-|- ....+.... +.-|...+.+..++-...+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~q~~~r---~~a~~d~~f--y~G~QW~~~~~~~l~~~g~p~~ 72 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINYEIE---DQPAWR---AVADKEMDY--ADGNQLDTELLRRQQALGIPPA 72 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHHHHh---ccHHHH---HHHHHHHHh--hcCCCCCHHHHHHHHhcCCCcE Confidence 112222222222222111 11111111111 1 1121 111111110 0123444455555533322222 Q ss_pred ceeEecchhhhhhhhhcccccccccccCCCchhHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCccEEEEEEeecc Q lcl|Aclame:pro 103 AIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA 182 (711) Q Consensus 103 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a~~~~~~~G~g~~~v~~d~~~ 182 (711) .+.....- .....+..-...+-....+.+...-..+.+++..+....-.......+..++..+++.+ .++|- T Consensus 73 ~~N~i~~~----v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~-G~Gw~--- 144 (772) T protein:vir:10 73 VEDLIGPA----LLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIAC-GIGWV--- 144 (772) T ss_pred EEcchHHH----HHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhc-CceeE--- Confidence 22222211 11112222222222223343332224566666666666555666666777777777654 34441 Q ss_pred CCCCCCcceEEEecCccceeeCCCccccCccccceeeeeecCCHHHHHHhcCCccc--ch----hhc-ccc----cc-cc Q lcl|Aclame:pro 183 DDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA--EP----VYE-DSV----AD-YD 250 (711) Q Consensus 183 ~~~~~~~i~i~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~--~~----~~~-~~~----~~-~~ 250 (711) +..++.+..-..| ..-..||...=+|. +| + .+.++.+-.|-..-. +. +.. ... .+ .. T Consensus 145 e~~~~~d~~~~~i---~i~~v~p~~v~~Dp-~a------~-~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~ 213 (772) T protein:vir:10 145 EVSRESDPFKFPY---RCRPIRRDEIHWDM-KC------G-DDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGS 213 (772) T ss_pred EeccccCCCCCCe---EEEeeCcccceecC-CC------C-CCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcc Confidence 1122222111111 01112444443442 11 1 255554433211110 00 111 000 00 01 Q ss_pred cCCCC-----------CeEEEE----Ee-------eeee-eeceeEEEccCCcEEEecCcchhHHHHHhcCchh------ Q lcl|Aclame:pro 251 TWFTE-----------KSVRVS----EY-------FTRE-PVIREIALLSDGRSFWLDALEDIVDELLEAGISI------ 301 (711) Q Consensus 251 ~~~~~-----------~~v~v~----E~-------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~------ 301 (711) .|... +.+... .. |+-. ..+.+++.+ |++......++ ....|... T Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~----w~r~~~~~~~~--~~~~g~~~~~~~~~ 287 (772) T protein:vir:10 214 TWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVEL----WYRRWVQVHVL--KSPDGRVVEYDPNN 287 (772) T ss_pred cccCcccccccccccccccccccchhhccccccccccccCCceEEEEEE----eeeeeeeeeee--ccCCCceEeeCccc Confidence 11110 001000 01 1110 011111111 11100000000 01111111 Q ss_pred ------hhhcccceEEEEEEEEecCceeccCccCCCCccce----EEEEeeeeccCCcccccchHHHhhHHHHHHHHHHH Q lcl|Aclame:pro 302 ------VRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPV----IPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDS 371 (711) Q Consensus 302 ------~~~~~~~~~~v~~~~~~g~~~le~~~p~~~~~~P~----vp~~~~~~~~~~~~~~~g~v~~~~d~Q~~~N~~~s 371 (711) +....+..++++...+....++ +..-+..+.-|| +|+++++ .......|....++..=+..=.+.+ T Consensus 288 ~~~~~~l~~g~~~~~~~~~~rv~~~~~~-g~~~L~~~~~p~~~~~fP~vP~~---g~r~~~~g~~~G~vr~~kd~Qr~~N 363 (772) T protein:vir:10 288 LAHNIALASGRISPKKVTVSRVRRSYWL-GPHCLHDGPTPYTHRHFPYVPFF---GFREDATGIPYGYVRGMKYAQDSLN 363 (772) T ss_pred HHHHHHHhhcccchheeeeeEEEEEEEe-cceeeccCCCCCCCCccceEEEe---eeEeccCCcccchhhhhhhHHHHHH Confidence 1111111112221111111122 222233333332 3333211 1111113322222221111222222 Q ss_pred HHHHHHHhcCCCceEecccccCChHHHHhhcccCCCceEEec-----ccccCcCCcccc-----------CCccchHHHH Q lcl|Aclame:pro 372 AATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYI-----PQYQGDPGPRRQ-----------PPAAVPAAEL 435 (711) Q Consensus 372 ~~~~~l~~~~~~~~~~~~~av~~~~~~~~~~~~~~~~~i~~~-----~~~~~~~~i~~~-----------~~~~~~~~~~ 435 (711) +.. +. -.+++...++.. ..|++--.. ..+..+..+.+. ...+.+.-.. T Consensus 364 ~~~---S~---~~~~l~~~~~~~----------~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~ 427 (772) T protein:vir:10 364 SGV---SK---LRWGMSVARVER----------TKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTD 427 (772) T ss_pred HHH---HH---HHHHHhcccccc----------cCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccH Confidence 111 11 022222222211 111111100 011111111111 1112233345 Q ss_pred HHHHHHHHHHHHHhCCCHHHhccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceEeeec Q lcl|Aclame:pro 436 TLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKF 515 (711) Q Consensus 436 ~ll~~~~~~~~~~tGv~~~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~g 515 (711) +++++....+.++.-++..+....|..+++..--+....-......+. .+-.-++...+.+-+++..+.- .+. T Consensus 428 ~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~~~~~~~g~~lL~li~-----~~y- 500 (772) T protein:vir:10 428 QHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQSIG-RIMDNFRAGRTLVGELLLAMIV-----EDI- 500 (772) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----HHc- Confidence 666666666666554433222222333333222222211111111111 1111222233333333333321 111 Q ss_pred ccCcchheecchhhhhhhccceeeeehhh----hee----eeEEeec---ccC-hHHHHHHHHHHHHHHHhhcchhHHHH Q lcl|Aclame:pro 516 PDETEDFVKLNEQIFDEESGEWVTIHDLN----VQK----YDVVVTT---GPA-FATQRIEAAEAMIQFAQAVPSAAAVM 583 (711) Q Consensus 516 ~~~~~~~v~~~~~~~~~~~g~~~~~nD~~----~~~----~dv~v~~---~~~-~~s~r~~~~~~L~~l~~~~p~~~~~~ 583 (711) ..++.+.|... -......++.+|... .|. .||++.. ... .++.-....+++..+++.++.+.+.+ T Consensus 501 --~~er~~RI~~~-d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~~ 577 (772) T protein:vir:10 501 --GQERTEVVIEG-DAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQY 577 (772) T ss_pred --CCCcEEEEecC-CCCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhccChhH Confidence 12233333221 011123445555432 122 3444332 122 23333334444555555544443333 Q ss_pred HHHHHHh-cCCcchHHHHHHHHhhhcchhhcchhhhhhhhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 584 ADLIAQN-MDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQ--VEMAKSQADMAQAEADTAQAQADML 660 (711) Q Consensus 584 ~~~~~~~-~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q--~~~~~~q~~~~k~qae~~~aqae~~ 660 (711) ...++.. ++........+.++.......+..+ .+++++.+++.+ ++..+++++..+++++.+++.++.. T Consensus 578 ~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~p--------eq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~ 649 (772) T protein:vir:10 578 QAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTP--------EQIQQQIDQAVQDALAKAGNDIKLRELEIKERKADSEIS 649 (772) T ss_pred HHHHHHHHHhhcCCCChHHHHHHHHHHhccCCh--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3322221 1111111111111111110000000 000111111111 1112222222222222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|Aclame:pro 661 KAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) Q Consensus 661 ~~q~~~~~~q~q~~~~~~~~q~~~~~~~~~~~~~~~~~~e~~~~qa~~e~Q 711 (711) +.++ ++.+....++...++..+...+. .+.+.+ +...+..+ T Consensus 650 ~~~a-------qa~~~~~~a~~~a~~aa~~~~q~-~q~a~~--ad~~l~~~ 690 (772) T protein:vir:10 650 GLNA-------KAVQIGVQAAFSAMQAGAQIAQM-PMIAPI--ADAVMQSA 690 (772) T ss_pred HHHH-------HHHHHHHHHHHHHhhhhhhHHhh-hhhhHH--HHHHHHhc Confidence 1111 11111110111111111110110 111222 22333344 Done!