Query lcl|Aclame:protein:vir:105429|NCBI_annot:gene 3 protein|genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Match_columns 708 No_of_seqs 174 out of 270 Neff 9.2 Searched_HMMs 1612 Date Sat Nov 30 23:29:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_65 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_65_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:172 Length: 708 # 100.0 1E-199 9E-203 1110.8 74.3 708 1-708 1-708 (708) 2 protein:vir:105429 Length: 708 100.0 7E-199 4E-202 1107.0 75.2 708 1-708 1-708 (708) 3 protein:vir:105520 Length: 706 100.0 1E-189 8E-193 1056.0 74.3 706 1-708 1-706 (706) 4 protein:vir:3520 Length: 720 # 100.0 4E-183 3E-186 1020.4 72.8 706 1-708 1-709 (720) 5 protein:vir:100920 Length: 725 100.0 3E-172 2E-175 960.6 70.3 692 1-708 1-717 (725) 6 protein:vir:77597 Length: 725 100.0 1E-170 9E-174 951.8 71.1 692 1-708 1-723 (725) 7 protein:vir:9263 Length: 725 # 100.0 1E-170 7E-174 952.2 69.6 692 1-708 1-723 (725) 8 protein:vir:108295 Length: 711 100.0 4E-164 2E-167 916.6 75.0 666 1-697 22-711 (711) 9 protein:vir:105619 Length: 772 100.0 2E-152 1E-155 852.5 65.9 664 1-708 15-738 (772) 10 protein:vir:817 Length: 714 # 100.0 3E-150 2E-153 840.3 72.7 660 1-700 8-714 (714) 11 protein:vir:10117 Length: 714 100.0 3E-150 2E-153 840.3 72.7 660 1-700 8-714 (714) 12 protein:vir:9950 Length: 714 # 100.0 3E-150 2E-153 840.3 72.7 660 1-700 8-714 (714) 13 protein:vir:3296 Length: 714 # 100.0 3E-150 2E-153 840.3 72.7 660 1-700 8-714 (714) 14 protein:vir:2764 Length: 714 # 100.0 3E-150 2E-153 840.3 72.7 660 1-700 8-714 (714) 15 protein:vir:104437 Length: 714 100.0 2E-147 1E-150 825.3 69.7 660 1-700 1-714 (714) 16 protein:vir:93630 Length: 776 100.0 3E-142 2E-145 796.8 61.3 657 1-708 38-736 (776) 17 protein:vir:95821 Length: 763 100.0 2.3E-83 1.4E-86 473.6 62.2 623 1-708 20-750 (763) 18 protein:vir:8846 Length: 705 # 100.0 5E-81 3.1E-84 460.8 57.0 616 1-708 1-700 (705) 19 protein:vir:80165 Length: 651 100.0 1.8E-56 1.1E-59 326.2 54.6 586 1-685 15-651 (651) 20 protein:vir:345 Length: 663 # 100.0 8.2E-44 5.1E-47 256.8 41.7 597 1-705 1-663 (663) 21 protein:vir:95449 Length: 584 100.0 2.7E-41 1.6E-44 243.0 38.1 540 1-638 1-584 (584) 22 protein:vir:94599 Length: 641 100.0 6E-39 3.7E-42 230.1 42.7 581 1-705 20-641 (641) 23 protein:vir:3139 Length: 599 # 100.0 5.2E-35 3.2E-38 208.5 33.9 562 1-673 1-599 (599) 24 protein:vir:103765 Length: 549 99.9 3.5E-19 2.2E-22 121.7 44.5 530 1-657 1-549 (549) 25 protein:vir:7321 Length: 556 # 99.9 2.5E-19 1.6E-22 122.5 43.3 536 1-693 1-556 (556) 26 protein:vir:95315 Length: 559 99.8 1.2E-18 7.4E-22 118.9 42.2 539 1-693 1-559 (559) 27 protein:vir:2198 Length: 536 # 99.8 8.3E-18 5.1E-21 114.2 42.8 513 1-678 1-536 (536) 28 protein:vir:10447 Length: 536 99.8 1.1E-17 6.8E-21 113.6 42.3 513 1-678 1-536 (536) 29 protein:vir:94709 Length: 522 99.8 2.2E-17 1.4E-20 111.9 42.5 501 1-669 1-522 (522) 30 protein:vir:107404 Length: 555 99.8 4.7E-17 2.9E-20 110.1 43.1 536 1-685 1-555 (555) 31 protein:vir:107822 Length: 555 99.8 4.7E-17 2.9E-20 110.1 43.1 536 1-685 1-555 (555) 32 protein:vir:98506 Length: 555 99.8 4.7E-17 2.9E-20 110.1 43.1 536 1-685 1-555 (555) 33 protein:vir:96494 Length: 501 99.8 8.1E-19 5E-22 119.8 33.1 467 1-647 29-501 (501) 34 protein:vir:102668 Length: 547 99.8 1.3E-17 7.9E-21 113.2 39.4 531 7-654 1-547 (547) 35 protein:vir:1538 Length: 535 # 99.8 1.2E-16 7.6E-20 107.8 43.3 510 1-670 1-535 (535) 36 protein:vir:3361 Length: 535 # 99.8 9.9E-17 6.2E-20 108.3 42.8 510 1-687 1-535 (535) 37 protein:vir:97171 Length: 512 99.8 3E-18 1.9E-21 116.7 33.9 475 1-661 31-512 (512) 38 protein:vir:99522 Length: 470 99.8 3.7E-17 2.3E-20 110.7 37.4 447 1-635 1-470 (470) 39 protein:vir:2732 Length: 501 # 99.8 2.5E-17 1.6E-20 111.6 36.0 466 1-647 29-501 (501) 40 protein:vir:1785 Length: 555 # 99.8 7.1E-17 4.4E-20 109.1 36.8 531 5-699 1-555 (555) 41 protein:vir:4898 Length: 502 # 99.8 2.8E-17 1.7E-20 111.4 33.9 466 1-643 30-502 (502) 42 protein:vir:733 Length: 453 # 99.8 1.4E-16 9E-20 107.4 37.8 437 1-637 11-453 (453) 43 protein:vir:96240 Length: 511 99.8 5.4E-17 3.3E-20 109.8 35.3 472 1-643 31-511 (511) 44 protein:vir:99672 Length: 532 99.7 4.8E-16 3E-19 104.6 38.8 506 1-656 1-532 (532) 45 protein:vir:103951 Length: 511 99.7 1.3E-16 8.2E-20 107.6 35.1 474 1-643 31-511 (511) 46 protein:vir:94572 Length: 535 99.7 1.5E-15 9.5E-19 101.8 39.9 508 1-686 1-535 (535) 47 protein:vir:3964 Length: 453 # 99.7 5.9E-16 3.6E-19 104.1 37.2 438 1-633 11-453 (453) 48 protein:vir:96988 Length: 516 99.7 1.3E-15 8.4E-19 102.1 39.0 494 1-649 1-516 (516) 49 protein:vir:96366 Length: 511 99.7 6.3E-16 3.9E-19 103.9 36.7 469 1-633 31-511 (511) 50 protein:vir:78805 Length: 511 99.7 6.3E-16 3.9E-19 103.9 36.7 469 1-633 31-511 (511) 51 protein:vir:9871 Length: 429 # 99.7 1.3E-15 8E-19 102.2 38.2 426 4-642 1-429 (429) 52 protein:vir:3609 Length: 452 # 99.7 1.1E-15 7E-19 102.5 37.7 437 1-647 11-452 (452) 53 protein:vir:9306 Length: 511 # 99.7 1.1E-15 6.9E-19 102.5 37.5 472 1-633 31-511 (511) 54 protein:vir:102950 Length: 471 99.7 5.4E-16 3.3E-19 104.3 35.5 456 1-635 1-471 (471) 55 protein:vir:99781 Length: 511 99.7 5.1E-16 3.1E-19 104.4 35.3 470 1-638 31-511 (511) 56 protein:vir:95806 Length: 440 99.7 2.2E-16 1.4E-19 106.4 32.8 432 12-633 1-440 (440) 57 protein:vir:38 Length: 496 # N 99.7 3.5E-15 2.2E-18 99.8 38.7 466 1-601 1-496 (496) 58 protein:vir:8883 Length: 543 # 99.7 6.7E-15 4.1E-18 98.3 40.1 519 1-683 1-543 (543) 59 protein:vir:80680 Length: 441 99.7 7.7E-15 4.8E-18 98.0 39.6 435 1-642 1-441 (441) 60 protein:vir:106639 Length: 481 99.7 3.7E-15 2.3E-18 99.7 36.3 453 1-639 23-481 (481) 61 protein:vir:102330 Length: 451 99.7 1.9E-15 1.2E-18 101.3 34.5 446 4-640 1-451 (451) 62 protein:vir:105461 Length: 470 99.7 2.9E-15 1.8E-18 100.3 35.0 460 4-643 1-470 (470) 63 protein:vir:94805 Length: 492 99.7 3E-15 1.9E-18 100.2 33.9 447 1-649 37-492 (492) 64 protein:vir:105641 Length: 516 99.7 3.5E-14 2.2E-17 94.4 39.7 494 1-637 1-516 (516) 65 protein:vir:93747 Length: 472 99.7 6.6E-15 4.1E-18 98.3 35.0 448 1-645 17-472 (472) 66 protein:vir:9922 Length: 489 # 99.7 1.7E-14 1E-17 96.1 36.9 452 1-616 1-489 (489) 67 protein:vir:78907 Length: 518 99.7 1.3E-15 7.9E-19 102.2 30.6 495 1-614 1-518 (518) 68 protein:vir:94101 Length: 474 99.7 2.5E-15 1.6E-18 100.6 32.1 451 1-633 1-474 (474) 69 protein:vir:105889 Length: 474 99.7 2.5E-15 1.6E-18 100.6 32.1 451 1-633 1-474 (474) 70 protein:vir:97336 Length: 492 99.7 1.5E-14 9.1E-18 96.4 35.8 448 1-645 35-492 (492) 71 protein:vir:1236 Length: 483 # 99.7 1.4E-14 8.9E-18 96.5 35.7 447 1-635 29-483 (483) 72 protein:vir:105292 Length: 478 99.7 2.3E-15 1.4E-18 100.9 31.2 453 1-640 1-478 (478) 73 protein:vir:95113 Length: 474 99.7 1.1E-14 6.6E-18 97.2 34.4 436 1-617 24-474 (474) 74 protein:vir:5961 Length: 503 # 99.7 8.3E-15 5.1E-18 97.8 33.8 471 1-642 1-503 (503) 75 protein:vir:94546 Length: 506 99.6 5.7E-15 3.5E-18 98.7 32.3 462 1-643 16-506 (506) 76 protein:vir:96179 Length: 468 99.6 4.5E-14 2.8E-17 93.7 36.5 438 1-637 1-468 (468) 77 protein:vir:107112 Length: 478 99.6 6.3E-15 3.9E-18 98.4 31.8 453 1-638 1-478 (478) 78 protein:vir:100039 Length: 522 99.6 1E-13 6.3E-17 91.8 37.7 501 3-675 1-522 (522) 79 protein:vir:104082 Length: 485 99.6 1.4E-14 8.9E-18 96.5 32.3 467 1-649 10-485 (485) 80 protein:vir:2427 Length: 485 # 99.6 1.7E-14 1.1E-17 96.1 32.2 464 1-642 6-485 (485) 81 protein:vir:2341 Length: 488 # 99.6 1.7E-14 1.1E-17 96.1 32.1 470 1-642 1-488 (488) 82 protein:vir:106571 Length: 499 99.6 2.4E-15 1.5E-18 100.7 27.4 482 1-657 1-499 (499) 83 protein:vir:94498 Length: 474 99.6 2.1E-14 1.3E-17 95.6 32.2 450 1-638 13-474 (474) 84 protein:vir:97447 Length: 474 99.6 2.1E-14 1.3E-17 95.6 32.2 450 1-638 13-474 (474) 85 protein:vir:79043 Length: 479 99.6 5.3E-14 3.3E-17 93.4 34.1 457 1-637 14-479 (479) 86 protein:vir:1587 Length: 508 # 99.6 2.1E-13 1.3E-16 90.1 37.0 483 1-600 1-508 (508) 87 protein:vir:96839 Length: 474 99.6 8.6E-14 5.3E-17 92.2 33.5 451 1-640 1-474 (474) 88 protein:vir:3028 Length: 500 # 99.6 1.6E-13 9.8E-17 90.8 34.9 473 1-604 3-500 (500) 89 protein:vir:9815 Length: 500 # 99.6 1.6E-13 9.8E-17 90.8 34.9 473 1-604 3-500 (500) 90 protein:vir:99916 Length: 504 99.6 5.9E-13 3.7E-16 87.6 37.7 462 1-708 18-501 (504) 91 protein:vir:80959 Length: 499 99.6 6.6E-13 4.1E-16 87.4 42.0 473 1-606 1-499 (499) 92 protein:vir:7017 Length: 515 # 99.6 7.7E-13 4.8E-16 87.0 40.8 494 1-653 1-515 (515) 93 protein:vir:9751 Length: 422 # 99.6 1.4E-13 8.4E-17 91.1 33.3 408 1-591 1-422 (422) 94 protein:vir:78537 Length: 480 99.6 6.7E-14 4.1E-17 92.8 31.1 462 1-647 1-480 (480) 95 protein:vir:78227 Length: 480 99.6 6E-14 3.7E-17 93.1 30.6 466 1-654 1-480 (480) 96 protein:vir:79703 Length: 505 99.6 1E-12 6.2E-16 86.4 37.8 478 1-604 1-505 (505) 97 protein:vir:96266 Length: 474 99.6 8.2E-14 5.1E-17 92.3 31.1 446 1-638 20-474 (474) 98 protein:vir:95899 Length: 474 99.6 8.2E-14 5.1E-17 92.3 31.1 446 1-638 20-474 (474) 99 protein:vir:94742 Length: 409 99.6 1.3E-13 7.8E-17 91.3 31.8 399 1-578 1-409 (409) 100 protein:vir:4223 Length: 486 # 99.6 2.3E-13 1.4E-16 89.9 33.1 466 1-641 8-486 (486) 101 protein:vir:99072 Length: 479 99.6 4.8E-14 3E-17 93.6 28.7 464 1-646 1-479 (479) 102 protein:vir:80211 Length: 514 99.5 1.8E-12 1.1E-15 84.9 40.3 494 5-637 1-514 (514) 103 protein:vir:78083 Length: 537 99.5 1.8E-12 1.1E-15 85.0 36.6 504 1-708 1-528 (537) 104 protein:vir:98883 Length: 517 99.5 1.2E-12 7.5E-16 85.9 35.5 481 1-619 1-517 (517) 105 protein:vir:103330 Length: 517 99.5 2E-12 1.2E-15 84.7 41.5 498 1-677 1-517 (517) 106 protein:vir:78696 Length: 542 99.5 2.2E-12 1.4E-15 84.4 39.6 518 5-705 1-542 (542) 107 protein:vir:78942 Length: 510 99.5 2.6E-12 1.6E-15 84.1 39.3 489 5-648 1-510 (510) 108 protein:vir:105819 Length: 456 99.5 1.3E-12 8.2E-16 85.7 34.4 449 1-636 1-456 (456) 109 protein:vir:102602 Length: 456 99.5 1.3E-12 8.2E-16 85.7 34.4 449 1-636 1-456 (456) 110 protein:vir:7768 Length: 484 # 99.5 1.2E-13 7.4E-17 91.4 28.4 466 1-661 1-484 (484) 111 protein:vir:8184 Length: 474 # 99.5 2.5E-12 1.5E-15 84.2 35.2 443 1-638 12-474 (474) 112 protein:vir:7987 Length: 456 # 99.5 1.8E-12 1.1E-15 84.9 34.5 440 1-643 1-456 (456) 113 protein:vir:1634 Length: 409 # 99.5 6.7E-13 4.1E-16 87.3 31.0 399 1-578 1-409 (409) 114 protein:vir:9568 Length: 410 # 99.5 2.1E-12 1.3E-15 84.6 33.3 400 20-593 1-410 (410) 115 protein:vir:3520 Length: 720 # 99.5 6.5E-12 4E-15 81.9 40.6 650 5-708 1-702 (720) 116 protein:vir:6322 Length: 510 # 99.5 6.5E-12 4.1E-15 81.9 40.4 487 5-648 1-510 (510) 117 protein:vir:4782 Length: 522 # 99.4 2.3E-12 1.4E-15 84.4 30.3 487 1-620 3-522 (522) 118 protein:vir:2500 Length: 501 # 99.4 5.1E-13 3.2E-16 88.0 26.7 471 1-644 1-501 (501) 119 protein:vir:7430 Length: 563 # 99.3 5.8E-12 3.6E-15 82.2 25.3 536 1-708 1-555 (563) 120 protein:vir:98444 Length: 434 99.3 4.1E-11 2.5E-14 77.6 29.7 422 37-639 1-434 (434) 121 protein:vir:101494 Length: 527 99.0 6.2E-09 3.9E-12 65.6 32.2 506 1-662 1-527 (527) 122 protein:vir:102239 Length: 527 99.0 6.5E-09 4.1E-12 65.5 32.3 506 1-662 1-527 (527) 123 protein:vir:105520 Length: 706 99.0 9.7E-09 6E-12 64.5 38.5 637 5-708 1-701 (706) 124 protein:vir:105429 Length: 708 98.9 1.4E-08 8.4E-12 63.7 40.4 643 5-708 1-700 (708) 125 protein:vir:105619 Length: 772 98.9 1.5E-08 9E-12 63.6 33.1 615 1-708 1-714 (772) 126 protein:vir:100920 Length: 725 98.9 2.6E-08 1.6E-11 62.2 35.3 629 5-708 1-710 (725) 127 protein:vir:9263 Length: 725 # 98.7 8.5E-08 5.3E-11 59.3 40.6 629 5-708 1-717 (725) 128 protein:vir:93630 Length: 776 98.7 1.3E-07 8E-11 58.4 25.4 609 1-708 22-728 (776) 129 protein:vir:77597 Length: 725 98.6 1.9E-07 1.2E-10 57.5 36.7 629 5-708 1-718 (725) 130 protein:vir:108295 Length: 711 98.6 1.9E-07 1.2E-10 57.4 35.7 621 1-704 1-711 (711) 131 protein:vir:172 Length: 708 # 98.6 2.8E-07 1.7E-10 56.5 38.7 621 5-708 1-700 (708) 132 protein:vir:103385 Length: 666 98.4 5.6E-08 3.5E-11 60.3 16.3 579 1-655 1-666 (666) 133 protein:vir:3296 Length: 714 # 98.4 6.9E-07 4.3E-10 54.4 35.8 610 1-705 1-714 (714) 134 protein:vir:10117 Length: 714 98.4 6.9E-07 4.3E-10 54.4 35.8 610 1-705 1-714 (714) 135 protein:vir:9950 Length: 714 # 98.4 6.9E-07 4.3E-10 54.4 35.8 610 1-705 1-714 (714) 136 protein:vir:2764 Length: 714 # 98.4 6.9E-07 4.3E-10 54.4 35.8 610 1-705 1-714 (714) 137 protein:vir:817 Length: 714 # 98.4 6.9E-07 4.3E-10 54.4 35.8 610 1-705 1-714 (714) 138 protein:vir:96403 Length: 666 98.3 1.5E-07 9E-11 58.1 15.4 579 1-655 1-666 (666) 139 protein:vir:95821 Length: 763 98.2 2.7E-06 1.7E-09 51.1 28.9 602 1-708 79-754 (763) 140 protein:vir:8846 Length: 705 # 97.9 1.3E-05 8.1E-09 47.3 25.0 618 11-708 1-696 (705) 141 protein:vir:104437 Length: 714 97.6 3.6E-05 2.2E-08 45.0 37.4 618 4-705 1-714 (714) 142 protein:vir:94956 Length: 452 96.5 0.00052 3.2E-07 38.6 31.9 442 1-602 1-452 (452) 143 protein:vir:95014 Length: 491 96.1 0.001 6.5E-07 36.9 33.6 473 1-610 11-491 (491) 144 protein:vir:78393 Length: 489 96.0 0.0011 7.1E-07 36.7 30.0 471 1-606 11-489 (489) 145 protein:vir:95149 Length: 501 95.8 0.0015 9.3E-07 36.1 31.3 479 1-608 1-501 (501) 146 protein:vir:80165 Length: 651 94.5 0.0042 2.6E-06 33.6 27.7 550 1-664 21-651 (651) 147 protein:vir:80453 Length: 535 91.0 0.018 1.1E-05 30.2 31.9 481 1-639 32-535 (535) 148 protein:vir:96783 Length: 488 87.9 0.036 2.2E-05 28.5 31.3 451 1-591 14-488 (488) 149 protein:vir:97265 Length: 513 86.7 0.044 2.7E-05 28.0 27.6 471 1-632 1-513 (513) 150 protein:vir:345 Length: 663 # 31.0 1.5 0.00095 19.6 24.6 559 1-698 16-663 (663) No 1 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=1.4e-199 Score=1110.81 Aligned_cols=708 Identities=99% Similarity=1.413 Sum_probs=690.3 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||+++++|++++.+|.+++++++++|.+|++|++|+||+|+||+++++++|++++|++||||+|||+|+|+|++|+|++ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+||++++|.++|++||++++++++.|++++++|+||+++++||+|||+++++|.++.|+.+++.+|+|+++ T Consensus 81 ~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:17 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred hhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceEee Confidence 99999999999987789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDV 240 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~ 240 (708) ++|+++|||||+|+++|+|||+|||+++|||+++++++||+++....+.....++.++|.+.++++|+|||++..+...+ T Consensus 161 ~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~ 240 (708) T protein:vir:17 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVRKESVDV 240 (708) T ss_pred ccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEeeeeeEE Confidence 99999999999999999999999999999999999999999988888877888888999999999999999999999999 Q ss_pred EEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCC Q lcl|Aclame:pro 241 ISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 320 (708) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~ 320 (708) +++.++.+|+++++.+...+.+...+...|+..+..+.+++++|+|++|+|+.+|++++|+||++||||||||++.+++| T Consensus 241 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~ 320 (708) T protein:vir:17 241 ISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 320 (708) T ss_pred EEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccccc Q lcl|Aclame:pro 321 IERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATP 400 (708) Q Consensus 321 ~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (708) ++++||+||+|||+||++|+++|+++|+++++++.+++++.++++|++.+|++.+.++.+++.+++..+..|.+.+++.+ T Consensus 321 ~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~ 400 (708) T protein:vir:17 321 IERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATP 400 (708) T ss_pred CCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 401 AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) Q Consensus 401 ~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~ 480 (708) +.+++++++|+++++|++.+..+|+++||++++++|+.+|+||+||++++++|++.+++++|||+.+++++|+++|+||+ T Consensus 401 ~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~ 480 (708) T protein:vir:17 401 AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) T ss_pred cccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCch Q lcl|Aclame:pro 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) Q Consensus 481 ~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~ 560 (708) +|||++|+|||+|++|+.+++.||.+++|+.+|.++++|||++|+|||+|+++|+++++|+++++.|+++++++++..|+ T Consensus 481 ~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~ 560 (708) T protein:vir:17 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPM 560 (708) T ss_pred HHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred hHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 561 ~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) .+.+++++++++|+|+++++++++++..+.....+|..++++++.+++++.+++++++++.+++++.+++||++++++++ T Consensus 561 ~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~ae 640 (708) T protein:vir:17 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) T ss_pred hHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 641 TAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 641 ~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) +++.++++.+++++++++++++.+..++++.+++.++.+..+++.+.+..+++..+++|++|+++||| T Consensus 641 a~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~q~q~~~a~p~~~~~~~~~ 708 (708) T protein:vir:17 641 TAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccCchhccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=6.9e-199 Score=1106.96 Aligned_cols=708 Identities=100% Similarity=1.424 Sum_probs=689.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||+++++|++++.+|.++.+++++||+++++|++|+||+|+||+++++++|++++|++||||+|||+|+|+|++|+|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+|+++++|.++|++||++++++++.|++++++|+||+++++||+|||+++++|+++.+|.+++.+|+++++ T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:10 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred HhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceEEe Confidence 99999999999987789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDV 240 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~ 240 (708) ++|+++|||||.|+++|+|||+|||+++|||+++++++||+++....+.....++.++|.+.++++|+|||+++++.+.+ T Consensus 161 ~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~~~~ 240 (708) T protein:vir:10 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDV 240 (708) T ss_pred ecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEEEEEEE Confidence 99999999999999999999999999999999999999999999888888888888999999999999999999999999 Q ss_pred EEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCC Q lcl|Aclame:pro 241 ISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 320 (708) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~ 320 (708) +++.++.||+++.++++..+.+...+...|+..+..+.+++++|+|++++|+.+|++++|+||++||||||||++.+++| T Consensus 241 ~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d~ 320 (708) T protein:vir:10 241 ISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 320 (708) T ss_pred EEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccccc Q lcl|Aclame:pro 321 IERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATP 400 (708) Q Consensus 321 ~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (708) ++++||+||+|||+||++|+++|+++|+++++++.+++++.+++.+++.+|++.+.++.+++.+++..++.|.+.+++.+ T Consensus 321 ~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~ 400 (708) T protein:vir:10 321 IERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATP 400 (708) T ss_pred CcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 401 AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) Q Consensus 401 ~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~ 480 (708) +++++++++|+++++|++.+..+|+++||+|++++|+.+|+||+||++++++|++.+++++|||+.+++++|+++|+||+ T Consensus 401 ~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~ 480 (708) T protein:vir:10 401 AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) T ss_pred ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCch Q lcl|Aclame:pro 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) Q Consensus 481 ~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~ 560 (708) +|||++|+|||+|++|+.++|.||.+++|+++|..+++|||++|+|||+|+++|+++|+|+++++.|++|+++++|..|+ T Consensus 481 ~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~ 560 (708) T protein:vir:10 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) T ss_pred HHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred hHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 561 ~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) .+.+++++++++|+|+++++++++++.++.....++..++++++.++++++++.++++++.+++++.+++||++++++++ T Consensus 561 ~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~ 640 (708) T protein:vir:10 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) T ss_pred hHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 641 TAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 641 ~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) +++.++++.++++++.+.++++.+..+++..+++..+.++++++++.+..+++..+++|++|+++||| T Consensus 641 a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~~~~~~~p~~~~~~~p~ 708 (708) T protein:vir:10 641 TAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccCchhccCC Confidence 99999999999999999999999999999999888889999999999999999999999999999999 No 3 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=1.4e-189 Score=1056.01 Aligned_cols=706 Identities=81% Similarity=1.255 Sum_probs=673.3 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||+.+++|++++.+|+++.++++++|+++++|++|+|++|+||+++++++|+++++.+||||+|||+|+|+|++|+|++ T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+|+++++|.++|++||++++++++.|++++++++||+++++||+||++++++|+++++|++++++|.|+.+ T Consensus 81 ~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~~v 160 (706) T protein:vir:10 81 RNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVEPI 160 (706) T ss_pred HhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccceeeee Confidence 99999999999866789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDV 240 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~ 240 (708) ++|+++|||||+|+++|+|||+|+|+++|||+++++++||+++.+. +.....++..+|...+++++++||.++.+.+.+ T Consensus 161 ~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~-~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~ 239 (706) T protein:vir:10 161 YDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSL-DRVGSVSWQYDWFTPDVVYIAKYYEVRKESVDV 239 (706) T ss_pred ccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhh-hhhccccccccccCCCcceecccccccceeEEE Confidence 9999999999999999999999999999999999999999987643 333455667789999999999999999999999 Q ss_pred EEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCC Q lcl|Aclame:pro 241 ISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 320 (708) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~ 320 (708) ++|.++.+++...++.+.+....+.+...|+..+..+.+++++|+|++++|+.+|++++||||++||||||||++.++|+ T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~ 319 (706) T protein:vir:10 240 ISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 319 (706) T ss_pred EEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccc Confidence 99999999999999999999888888889998999999999999999999999999999999999999999999999999 Q ss_pred cccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccccc Q lcl|Aclame:pro 321 IERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATP 400 (708) Q Consensus 321 ~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (708) ++.+||+||+|||+||++|+++|+++|+++++++...++..+.+++++..|...+.....++.+++.+...|.+++...+ T Consensus 320 ~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~ 399 (706) T protein:vir:10 320 VERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANV 399 (706) T ss_pred cCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhcccccCCCCcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888889 Q ss_pred cccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 401 AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) Q Consensus 401 ~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~ 480 (708) +.+++++++|+++++|++.+..+|+++||+|++++|+.+|+||+||++++++|++.+++++|||+++++++|+++|+||+ T Consensus 400 ~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~ 479 (706) T protein:vir:10 400 AGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMAR 479 (706) T ss_pred cccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCch Q lcl|Aclame:pro 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) Q Consensus 481 ~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~ 560 (708) +|||++|+|||+|++|+.+++.||..++|+.+|..+++|||++|+|||+|+++|+++|+|+++++.|++|+++++|..|+ T Consensus 480 ~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~ 559 (706) T protein:vir:10 480 EIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPM 559 (706) T ss_pred HHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred hHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 561 ~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) .+.+++++++++|+|+++++++++++..+.+...+|.+++++++.++++|+++.++++++.+++++.+++||++++++++ T Consensus 560 ~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a~ 639 (706) T protein:vir:10 560 RPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQNE 639 (706) T ss_pred hHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88899999999999999999999999999999999999999998888889999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 641 TAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 641 ~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) +++.++++.++++++..+++.+...+.++..+++.+..++.+++.+.+..+.+..++.| .|+++=|| T Consensus 640 ~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a~q~~~~~~~~-~~~~~~~~ 706 (706) T protein:vir:10 640 TVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLLKEVAASQQQTIPSPP-SPADIVPS 706 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC-CCcccCCC Confidence 99999999999999999999999999999999999999999988888888777767776 55666666 No 4 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=4.3e-183 Score=1020.39 Aligned_cols=706 Identities=71% Similarity=1.104 Sum_probs=648.5 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||+++++|++++.+|+++.+++.+||+++++|++|+|++|+||++++++.++.-.+..|+||++||+|+|+|++|+|++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 99999999999999999999999999999999999999999999999998655444456899999999999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+||++++|+++|++||++++++++.|++++++|+||+++++||+||++|+++|+++.+|+.+.++|++.+| T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~~v 160 (720) T protein:vir:35 81 RHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLEPI 160 (720) T ss_pred HhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEecc Confidence 99999999999987789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDV 240 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~ 240 (708) ++|+++|||||+|+++|+|||+|+|+.+|||+++++++||+++.... .....++.++|.+.++++++|||+++++...+ T Consensus 161 ~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~-~~~~~~~~~d~~~~~~v~i~E~~~~~~~~~~~ 239 (720) T protein:vir:35 161 YDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLM-SGIERSWDYDWYDVDVVYIAKYYEVKKESVDV 239 (720) T ss_pred cCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcccccc-ccccccccccccCCCceEEEEeeEEEEEEEEE Confidence 99999999999999999999999999999999999999999886533 33444556789999999999999999999999 Q ss_pred EEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCC Q lcl|Aclame:pro 241 ISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 320 (708) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~ 320 (708) ++|.++.+|.++.++++..+.+...+...|+..+..+.+++++|+|++++|+.+|++++|+||+|||||||||++++++| T Consensus 240 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~ 319 (720) T protein:vir:35 240 VSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFIDD 319 (720) T ss_pred EEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccCC Confidence 99999999999999999999999999999998899999999999999999999999999999999999999999999999 Q ss_pred cccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccccc Q lcl|Aclame:pro 321 IERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATP 400 (708) Q Consensus 321 ~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (708) ++++||+||.|||+||++|+++|+++|++++++.....+..+.+++.++.|...+..+.+++++++..++.|.+.+.+.+ T Consensus 320 ~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~ 399 (720) T protein:vir:35 320 IERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTP 399 (720) T ss_pred CcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccccccccccccccccccCcccccCCCc Confidence 99999999999999999999999999999999887777777777888889988888999999999999999998887788 Q ss_pred cccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 401 AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) Q Consensus 401 ~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~ 480 (708) +.+.+++++|+++++|++.+..+|+++||+|++++|+.+|+||+||++++++|++.+++++|||+++++++|+++|+||+ T Consensus 400 ~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~ 479 (720) T protein:vir:35 400 VGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEVWLSMAR 479 (720) T ss_pred ccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCch Q lcl|Aclame:pro 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) Q Consensus 481 ~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~ 560 (708) +|||++|+|||+|++|..+++.+|..+.|+.+|.++++|||++|+|||+|+++|+++|+|+++++.|+++++.++|..++ T Consensus 480 ~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~~ 559 (720) T protein:vir:35 480 EVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDPM 559 (720) T ss_pred HHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999987777 Q ss_pred hHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 561 ~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) ...+++++++++++|+++++++++++..+++...++..+++++..+++++ ++++++.++.++++++.++|+++++++++ T Consensus 560 ~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq-~~qq~~~e~~~aqa~l~qaqae~~kaqa~ 638 (720) T protein:vir:35 560 RQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQ-QAQQPNAELVAAQGVLMQGQAEVQKAKNE 638 (720) T ss_pred HHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHH-HHHhHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 77888999999999999999999999988888888877766555544433 33456667778888899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 641 TAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKD---VAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 641 ~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~e~~~~ 708 (708) ..+.++++.+++++++++++++.+..+|+...++....+++++... .+...++....+++++.+.--+ T Consensus 639 ~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~~ 709 (720) T protein:vir:35 639 ELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILKATDTQHK 709 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhcccchhhh Confidence 9999999999999999999999888888887777777666665544 4444566667778888777776 No 5 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=3.5e-172 Score=960.58 Aligned_cols=692 Identities=30% Similarity=0.477 Sum_probs=606.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||. +.+|++++.+|+++++++.+||.++.+|. +||+|+||++++++.|+.+ ||| +||+|+|+|++|+|++ T Consensus 1 m~d~-~~~~~~~~~~~~~~~~~~~~~R~~a~~d~--~fy~G~QW~~~~~~~l~~q----~rp--~~N~i~~~v~~v~g~e 71 (725) T protein:vir:10 1 MADN-ENRLESILSRFDADWTASDEARREAKNDL--FFSRVSQWDDWLSQYTTLQ----YRG--QFDVVRPVVRKLVSEM 71 (725) T ss_pred CCch-HHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCC--cccchHHHHHHHHhhH Confidence 9987 77999999999999999999999999885 5689999999999998764 677 5899999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+||+ ++|.++|++||++++++++.|++++++|+||+++++||+||++|+++|.++ |+.+++..|++..+ T Consensus 72 ~~nr~d~~v~p~~-~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~-d~~~~~~~i~~~~i 149 (725) T protein:vir:10 72 RQNPIDVLYRPKD-GASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) T ss_pred HhCCcceEEecCC-cchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCC-CCCCCceeeeeeec Confidence 9999999999997 589999999999999999999999999999999999999999999999754 66777778888889 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHH---HHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPE---KYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKES 237 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~---e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~ 237 (708) |+|+.+|||||+|+++|+|||+|||+++|||++ +|++.||..+....+......+.++|.+.++++|+|||+++.+. T Consensus 150 ~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~ 229 (725) T protein:vir:10 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred ccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEe Confidence 999999999999999999999999999999974 67778998887766666777778899999999999999999999 Q ss_pred EEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeec Q lcl|Aclame:pro 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 317 (708) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~ 317 (708) +.++.+.++.+|+++.|+..++..+...+...|...+..+.++++||+|++|+|+.+|++++|+||++||||||||++.+ T Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~ 309 (725) T protein:vir:10 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred eEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeec Confidence 99999999999999999999998888889999999999999999999999999999999999999999999999999999 Q ss_pred cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccccc Q lcl|Aclame:pro 318 IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAG 397 (708) Q Consensus 318 ~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 397 (708) ++|++++||+||+|||+||++|+++|+++|+++++++++++++.+++++.++.|...+ ...++.+++...++|.+. T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~--~~~~~~~~~~~~~~g~~~-- 385 (725) T protein:vir:10 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGND--DYPYYLLNRTDENNGEMP-- 385 (725) T ss_pred cCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccC--CceeeecccccccCcccc-- Confidence 9999999999999999999999999999999999999999999999999888887544 345667777776666653 Q ss_pred ccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 398 ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l 476 (708) ..++.+++++++|+++++|++.+..+|+++||++++++|..+| +||+||++++++|++.+++++|||+.+++++|+++| T Consensus 386 ~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL 465 (725) T protein:vir:10 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) T ss_pred cccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3467888899999999999999999999999999999999876 699999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) Q Consensus 477 ~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~ 556 (708) +||++|||++|+|||+|++|+.++|.||..+.++.+|+.+++||+ +|+|||+|+++|+++|+|++++..|++|++++++ T Consensus 466 ~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi-~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~ 544 (725) T protein:vir:10 466 SIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDI-RGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQ 544 (725) T ss_pred HHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhcc-ccceeEEEeeccCcHHHHHHHHHHHHHHHHhccc Confidence 999999999999999999999999999999999999999999999 5899999999999999999999999999999999 Q ss_pred cCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 557 TDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) Q Consensus 557 ~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k 636 (708) ..|....+++.+++++|+|+++++.+++++..++....+|..++++++.++++++++.++..+..+++++.+++++++++ T Consensus 545 ~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~k 624 (725) T protein:vir:10 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) T ss_pred cchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH Confidence 88876777778889999999999999999999999999999888888888888888888888888888888888999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------H---HHHHHH--HHHhhhh-hhhhhh Q lcl|Aclame:pro 637 ATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDK---------------A---VMEAIR--LLKDVAE-SQQQQF 695 (708) Q Consensus 637 ~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~---------------~---~~~~~~--~~~~~~~-~~~~~~ 695 (708) ++++..++++++.+.++++++++++..+...++...+.. . .+...+ +....++ .++... T Consensus 625 a~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~~~~~~ 704 (725) T protein:vir:10 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDI 704 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhh Confidence 888888888888888888777766655444433221110 0 111111 1111111 112222 Q ss_pred hcCCCCCCCCCCC Q lcl|Aclame:pro 696 QSPPQSPADLMPS 708 (708) Q Consensus 696 ~~~~~~~~e~~~~ 708 (708) +...+++..-.|| T Consensus 705 ~~~~~~q~~~~~~ 717 (725) T protein:vir:10 705 ANILQSQRQNQPS 717 (725) T ss_pred hhccccccccCCC Confidence 2233333344444 No 6 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=1.4e-170 Score=951.79 Aligned_cols=692 Identities=30% Similarity=0.474 Sum_probs=605.3 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||. +.+|++++.+|+++++++.+||.++.+|. +||+|+||++++++.|+.+ ||| +||+|+|+|++|+|++ T Consensus 1 m~d~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~--~fy~G~Qw~~~~~~~l~~q----~rp--~~N~i~~~i~~v~g~~ 71 (725) T protein:vir:77 1 MADN-ENRLESILSRFDADWTASDEARREAKNDL--FFSRVSQWDDWLSQYTTLQ----YRG--QFDVVRPVVRKLVSEM 71 (725) T ss_pred CCch-HHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhCCCCCCHHHHHHHHhc----CCC--ccccHHHHHHHHHhhH Confidence 9986 77899999999999999999999999885 5789999999999999764 677 5799999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+||+ ++|.++|++||++++|+++.|++++++|+||+++++||+||++|+++|..+ ++++++..|++.++ T Consensus 72 ~~nr~d~~v~P~~-~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~-d~~~~~~~i~~~~~ 149 (725) T protein:vir:77 72 RQNPIDVLYRPKD-GARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) T ss_pred HhCCcceEEecCC-ccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCC-CCCCCceeeEEeec Confidence 9999999999997 589999999999999999999999999999999999999999999999754 56677777788888 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHHh---CCCCcccccccccccccccCCCCCceeEEeeeeeecceE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAE---YGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKES 237 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~---~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~ 237 (708) ++|+.+|||||+|+++|+|||+|||+.+|||+++++.+ ||.+..+..+.....++.++|++.++++|+|||++++++ T Consensus 150 ~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~ 229 (725) T protein:vir:77 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred ccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEe Confidence 99999999999999999999999999999999977655 555555555556666778899999999999999999999 Q ss_pred EEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeec Q lcl|Aclame:pro 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 317 (708) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~ 317 (708) +.++.+.++.||++..|+..++..+...+...|...+..+.++++||+|++++|+.+|++++||||++||||||||++.+ T Consensus 230 ~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~ 309 (725) T protein:vir:77 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred eEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeec Confidence 99999999999999999999888888888899999999999999999999999999999999999999999999999999 Q ss_pred cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccccc Q lcl|Aclame:pro 318 IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAG 397 (708) Q Consensus 318 ~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 397 (708) ++|++++||+||+|||+||++|+++|+++|+++++++.+++++++++++.+++|...+.. +++.++....++|... T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~g~~~-- 385 (725) T protein:vir:77 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLLNRTDENSGDLP-- 385 (725) T ss_pred cCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCC--ceecccccccCCCccc-- Confidence 999999999999999999999999999999999999999999999999999999876643 3556666666666642 Q ss_pred ccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 398 ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l 476 (708) ..++.+++++++|+++++|++.+..+|+++||++++++|..+| +||+||++++++|++.+++++|||+++++++|+++| T Consensus 386 ~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL 465 (725) T protein:vir:77 386 TQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) T ss_pred ccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3466788899999999999999999999999999999999887 699999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) Q Consensus 477 ~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~ 556 (708) +||++|||++|+|||+|++|+.+++.||..+.++.+|..+++|||+ |+|||+|+++|+++|+|+++++.|++|++++++ T Consensus 466 ~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~-g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~ 544 (725) T protein:vir:77 466 SIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) T ss_pred HHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhc-cceeeEEeeccchHHHHHHHHHHHHHHHHhccc Confidence 9999999999999999999999999999999999999999999996 899999999999999999999999999999998 Q ss_pred cCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 557 TDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) Q Consensus 557 ~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k 636 (708) ..|....+++.+++++++|+++++.+++++..++....++.+++++++.+++++.++.++++++.++|+..+++|+++++ T Consensus 545 ~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~k 624 (725) T protein:vir:77 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) T ss_pred cchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH Confidence 88877777778889999999999999999999998888998888888888888888888888888888888888999889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHhh--h--hhh--- Q lcl|Aclame:pro 637 ATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV------------------MEAIRLLKDV--A--ESQ--- 691 (708) Q Consensus 637 ~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~------------------~~~~~~~~~~--~--~~~--- 691 (708) +++++.+++.++.++++++++++++..++..++...++... .+.+++.... + +++ T Consensus 625 aq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~~~~ 704 (725) T protein:vir:77 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMDI 704 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhHHHH Confidence 99888888888888888777776665554444332221111 1111110000 0 000 Q ss_pred --hhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 692 --QQQFQSPPQSPADLMPS 708 (708) Q Consensus 692 --~~~~~~~~~~~~e~~~~ 708 (708) .-..+...|+|+++.-| T Consensus 705 ~~~~~~~~~~~~~~~~~~~ 723 (725) T protein:vir:77 705 ANILQSQRQNQPSGSVAET 723 (725) T ss_pred HHHHHHHHhcCCCcCcccC Confidence 01123455666666666 No 7 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=1.2e-170 Score=952.22 Aligned_cols=692 Identities=30% Similarity=0.476 Sum_probs=600.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |||. +++|++++.+|+++++++.+||.++.+|. +||+|+||++++++.|+.+ ||| +||+|+|+|++|+|++ T Consensus 1 m~d~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~--~fy~G~Qw~~~~~~~l~~q----~rp--~~N~i~~~i~~v~g~e 71 (725) T protein:vir:92 1 MADN-ENRLESILSRFDADWTASDEARREAKNDL--FFSRISQWDDWLSQYTTLQ----YRG--QFDVVRPVVRKLVSEM 71 (725) T ss_pred CCch-HHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCC--cccchHHHHHHHHhhH Confidence 9986 66999999999999999999999999885 5789999999999999764 677 5799999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+||+ ++|+++|++||++++|+++.|++++++|+||+++++||+||++|+++|..+ |+++++..|++.++ T Consensus 72 ~~nr~d~~v~P~~-~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~-d~~~~~~~i~~~~i 149 (725) T protein:vir:92 72 RQNPIDVLYRPKD-GASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) T ss_pred HhCCcceEEecCC-ccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCC-CCCCCceeeEEeec Confidence 9999999999997 589999999999999999999999999999999999999999999999754 66677777788889 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHH---hCCCCcccccccccccccccCCCCCceeEEeeeeeecceE Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEA---EYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKES 237 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~---~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~ 237 (708) |+|+.+|||||+|+++|+|||+|+|+++|||+++++. .||.+..+..+.....++.++|++.++++|+|||++.++. T Consensus 150 ~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~ 229 (725) T protein:vir:92 150 HSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred cCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEe Confidence 9999999999999999999999999999999986555 6666666666666677778899999999999999999999 Q ss_pred EEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeec Q lcl|Aclame:pro 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 317 (708) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~ 317 (708) +.++.+.++.+|+++.|+..++..+...+...|...+..+.++++||+|++++|+.+|++++|+||++||||||||++.+ T Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~ 309 (725) T protein:vir:92 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred eeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeec Confidence 99999999999999999999988888889999999999999999999999999999999999999999999999999999 Q ss_pred cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccccc Q lcl|Aclame:pro 318 IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAG 397 (708) Q Consensus 318 ~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 397 (708) ++|++++||+||+|||+||++|+++|+++|+++++++++++++++++++.++.|...+. .+++.+++...++|... T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~-- 385 (725) T protein:vir:92 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDD--YPYYLLNRTDENNGEMP-- 385 (725) T ss_pred cCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCc--cceeecccccccccccc-- Confidence 99999999999999999999999999999999999999999999999999888876443 45666777766666653 Q ss_pred ccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 398 ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l 476 (708) ..++.+++++++|+++++|++.+..+|+++||+|++++|..+| +||+||++++++|++.+++++|||+++++++|+++| T Consensus 386 ~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL 465 (725) T protein:vir:92 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) T ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3467888999999999999999999999999999999999776 699999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) Q Consensus 477 ~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~ 556 (708) +||++|||++|+|||+|++|+.++|.||..+.++.+|+.+++|||+ |+|||+|+++|+++|+|++++..|++|++++++ T Consensus 466 ~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~-g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~ 544 (725) T protein:vir:92 466 SIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) T ss_pred HHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccc-cceeeEEeeccChHHHHHHHHHHHHHHHHhccc Confidence 9999999999999999999999999999999999999999999995 899999999999999999999999999999998 Q ss_pred cCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 557 TDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) Q Consensus 557 ~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k 636 (708) ..|....+++.+++++++|+++++++++++..++....+|.+++++++.++++++++.+++.+..+++++.+++++++++ T Consensus 545 ~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~k 624 (725) T protein:vir:92 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) T ss_pred chhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH Confidence 87776667777889999999999999999999999999999888888888888888888888888888888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------HHH---HHHHH--HHHhhhhhhhhh-- Q lcl|Aclame:pro 637 ATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDD---------------KAV---MEAIR--LLKDVAESQQQQ-- 694 (708) Q Consensus 637 ~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~---------------~~~---~~~~~--~~~~~~~~~~~~-- 694 (708) ++++..++++++.++++++++++++......++...++ +.+ ...++ +..+.+..+++. T Consensus 625 aqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~ 704 (725) T protein:vir:92 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDI 704 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHH Confidence 88888888888877777777666555443333221111 000 00011 111111111111 Q ss_pred -----hhcCCCCCCCCCCC Q lcl|Aclame:pro 695 -----FQSPPQSPADLMPS 708 (708) Q Consensus 695 -----~~~~~~~~~e~~~~ 708 (708) .+...++|.++.-| T Consensus 705 ~~~~~~~~~~~~~~~~~~~ 723 (725) T protein:vir:92 705 ANILQSQRQNQPSGSVAET 723 (725) T ss_pred HHHhcchhccCCccccccC Confidence 11222233333333 No 8 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=3.7e-164 Score=916.59 Aligned_cols=666 Identities=24% Similarity=0.342 Sum_probs=551.2 Q ss_pred CC--cchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MA--ETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIA 78 (708) Q Consensus 1 ma--~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g 78 (708) -. ++.+.+|++++.+|+++.+++++||.++++|. .||+|+||++++++.|+.+ |+||+|||+|+|+|++|+| T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~v~~v~g 95 (711) T protein:vir:10 22 AKNNDDDRALLATARERARDGATYWKDNWEAAEDDL--KFLGGEQWPSQVRTERELE----QRPCLVNNVLPTFVDQVLG 95 (711) T ss_pred ccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH--HHhCCCCCCHHHHHHHHhc----CCCcEEEcchHHHHHHHhh Confidence 12 24566899999999999999999999999885 5689999999999999875 7899999999999999999 Q ss_pred HHhcCcceeEEecCC---------------------CcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGD---------------------REASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFR 137 (708) Q Consensus 79 ~~~~nr~~~~v~pr~---------------------~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~ 137 (708) ++++||++++|+||+ +++|.++|++||++++++++.|++++++++||+++++||+||++ T Consensus 96 ~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~e 175 (711) T protein:vir:10 96 DQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLR 175 (711) T ss_pred hHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEE Confidence 999999999999985 46889999999999999999999999999999999999999999 Q ss_pred EEeeccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccccccccccc Q lcl|Aclame:pro 138 LTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEY 217 (708) Q Consensus 138 v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~ 217 (708) |+++|..+ +.+.++++|.++++ +++|||||+|+++|+|||+|||+++|||+++|+++||+++...++..+. .+.+ T Consensus 176 v~~d~~~~---d~~~~e~~i~~v~~-p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~-~~~~ 250 (711) T protein:vir:10 176 VRSDYLAD---DSFEQDLIIEAIQN-QFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSV-ADYD 250 (711) T ss_pred EEecccCC---CCCCCCeEEeeecC-hhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccc-cccC Confidence 99998654 35567888888775 5789999999999999999999999999999999999988766654443 3455 Q ss_pred CCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeec Q lcl|Aclame:pro 218 NWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEK 297 (708) Q Consensus 218 ~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~ 297 (708) +|.+.++++|+|||+++.+...++.+.++. .++........+.+...|...+..+.+++++|+|++|+|+.+|++ T Consensus 251 ~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~ 325 (711) T protein:vir:10 251 TWFTEKSVRVSEYFTREPVIREIALLSDGR-----SFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEG 325 (711) T ss_pred cccCcceeeEEEEEeeeeeeeEEEeecCCc-----eeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecC Confidence 788999999999999988877766655432 234445556677777888888899999999999999999999999 Q ss_pred CCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhccc Q lcl|Aclame:pro 298 PRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKK 377 (708) Q Consensus 298 ~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~ 377 (708) ++||||++||||||||++.++|+++++||+||.|||+||++|+++|+++|++++++++++++++|++++.++.|.+.+.+ T Consensus 326 ~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~ 405 (711) T protein:vir:10 326 PVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTK 405 (711) T ss_pred CCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhHHHHHHHHHHHHHH Q lcl|Aclame:pro 378 RPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMA 456 (708) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg~ai~~~q~q~~~~ 456 (708) ++++++++++.+.. .++++++++++|+++++|++++.++|+++||++++++|..+| +||+||++++++|+++ T Consensus 406 ~~~vi~~~~~~~~~-------~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~ 478 (711) T protein:vir:10 406 NFSLLTYIPQYQGD-------PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRG 478 (711) T ss_pred CCCeeEecccccCc-------CCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHH Confidence 99999999865432 367788999999999999999999999999999999998776 6999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccc Q lcl|Aclame:pro 457 SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) Q Consensus 457 ~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~ 536 (708) +.+++|||+++++++|+++|+||++|||++|+|||+|++|+.++|.||..++++.+|..+++|||++|+|||+|+++|++ T Consensus 479 l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~ 558 (711) T protein:vir:10 479 SFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAF 558 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQP 616 (708) Q Consensus 537 ~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~ 616 (708) +++|++++..|++|++++++ ..+.+.+++++++|+|+++++++++++..++....++..++.++...+ ++++..+. T Consensus 559 ~s~r~~~~~~l~ql~~~~p~---~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e-~qq~~~~~ 634 (711) T protein:vir:10 559 ATQRIEAAEAMIQFAQAVPS---AAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPE-QTEPTPEQ 634 (711) T ss_pred hhHHHHHHHHHHHHHhhcch---hhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHH-HHHHHHHH Confidence 99999999999999987654 345677889999999999999999998877666555554443333322 22222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 617 NPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQ 696 (708) Q Consensus 617 ~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 696 (708) +.++.++++..++++++.+++++++.+++.++.+++++.....+.+... .+...+....++ ....+.+..+....+ T Consensus 635 q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~--~~~~qq~~~~l~--~~qaelq~~q~~~~q 710 (711) T protein:vir:10 635 QVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGG--DVVYQQVRELVA--QALAEITASQANVTE 710 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHH--HHHHHHHHHHHHhhc Confidence 3444555556666666666666666666666665554443322211111 111111011111 111111111111111 Q ss_pred c Q lcl|Aclame:pro 697 S 697 (708) Q Consensus 697 ~ 697 (708) . T Consensus 711 ~ 711 (711) T protein:vir:10 711 Q 711 (711) T ss_pred C Confidence 1 No 9 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=1.8e-152 Score=852.47 Aligned_cols=664 Identities=18% Similarity=0.160 Sum_probs=501.3 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) =.++...+|.+.+.+|.++.+.+.+||.++.+| |+||+|+||+++++++|+.+ |+||+|||+|+|+|++|+|++ T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d--~~fy~G~QW~~~~~~~l~~~----g~p~~~~N~i~~~v~~v~g~~ 88 (772) T protein:vir:10 15 PPAGDTPLTVDEYADINYEIEDQPAWRAVADKE--MDYADGNQLDTELLRRQQAL----GIPPAVEDLIGPALLSLQGYE 88 (772) T ss_pred CcccccccCHHHHHHHHHHHhccHHHHHHHHHH--HHhhcCCCCCHHHHHHHHhc----CCCcEEEcchHHHHHHHHHHH Confidence 112445678889999999999999999999887 55789999999999999875 789999999999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ++||++++|+||+..+|.++|++||++++++++.|++++++++||+++++||+||++++++. | .+.++|+|..+ T Consensus 89 ~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~----d--~~~~~i~i~~v 162 (772) T protein:vir:10 89 AVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRES----D--PFKFPYRCRPI 162 (772) T ss_pred HhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEecccc----C--CCCCCeEEEee Confidence 99999999999975688999999999999999999999999999999999999999986542 2 34557777776 Q ss_pred ecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc-----c--------ccc-------------- Q lcl|Aclame:pro 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV-----T--------SMT-------------- 213 (708) Q Consensus 161 ~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~-----~--------~~~-------------- 213 (708) ++++|||||.|++ |+|||+|+|+.+|||+++++++||++++..-.. . ... T Consensus 163 --~p~~v~~Dp~a~~-D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (772) T protein:vir:10 163 --RRDEIHWDMKCGD-DWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEA 239 (772) T ss_pred --CcccceecCCCCC-CHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchh Confidence 6789999999866 999999999999999999999999876321000 0 000 Q ss_pred ----ccccCCC--CCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEE Q lcl|Aclame:pro 214 ----SWEYNWF--GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVS 287 (708) Q Consensus 214 ----~~~~~~~--~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 287 (708) .....|. +.++|+|+|+|++..+... +..+.+|+.+.|+..+.... .....|...+ +....+||+|+ T Consensus 240 ~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~---~~~~~~g~~~~~~~~~~~~~--~~l~~g~~~~--~~~~~~rv~~~ 312 (772) T protein:vir:10 240 RAWTVQEDHWYNPTSKEICLVELWYRRWVQVH---VLKSPDGRVVEYDPNNLAHN--IALASGRISP--KKVTVSRVRRS 312 (772) T ss_pred hccccccccccccCCceEEEEEEeeeeeeeee---eeccCCCceEeeCcccHHHH--HHHhhcccch--heeeeeEEEEE Confidence 0011222 2356778887776654432 22457788888877655432 2333343333 33445689999 Q ss_pred EEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccc Q lcl|Aclame:pro 288 VVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG 366 (708) Q Consensus 288 ~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~ 366 (708) +|+|+++|+ +++||||++||||||||++...+|. +||+||.|||+||++|+++|+++|+|++++ +++++|++++ T Consensus 313 ~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~--~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~---~~~~~gav~~ 387 (772) T protein:vir:10 313 YWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGI--PYGYVRGMKYAQDSLNSGVSKLRWGMSVAR---VERTKGAVAM 387 (772) T ss_pred EEecceeeccCCCCCCCCccceEEEeeeEeccCCc--ccchhhhhhhHHHHHHHHHHHHHHHHhccc---ccccCCCccc Confidence 999999997 6899999999999999999866665 789999999999999999999999998874 6889999999 Q ss_pred hHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhHHH Q lcl|Aclame:pro 367 LEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQET 445 (708) Q Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg~a 445 (708) .++.+.+....++++++++++... . + +.+|.+++++++|+++++|++.+.++|+++||++++++|..+| +||+| T Consensus 388 ~d~~~~e~~arp~~vi~~~~~~~~--~--~-~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvA 462 (772) T protein:vir:10 388 TDAQFRRQIARPDADIVLDENHMA--K--P-GARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQ 462 (772) T ss_pred hhHHHHHhccCCCCeEEeCCcccc--C--C-CCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHH Confidence 887777777888899998865432 1 2 3467888999999999999999999999999999999999887 59999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCC--CceEEEecccccccCCCceEEeeccce Q lcl|Aclame:pro 446 VNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDG--SDDIAVLSAQVVDRQTGAVVALNDLSV 523 (708) Q Consensus 446 i~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~--~~~~v~in~~~~~~~~~~~~~~nDi~~ 523 (708) |++++++|++.++++||||+++++++|+++|+||++|||++|+|||+|+++ ..++|.||..+.++.+|..++.|||++ T Consensus 463 i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~ 542 (772) T protein:vir:10 463 EQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLR 542 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceecccee Confidence 999999999999999999999999999999999999999999999999885 579999999999999999999999999 Q ss_pred eeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHH Q lcl|Aclame:pro 524 GRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQ 603 (708) Q Consensus 524 g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q 603 (708) |+|||+|+++|+++|+|+++++.|++|++.++|. +...+.+++++++|+|+++++.+++++..++. .| +++++ T Consensus 543 g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~--~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~---~p--eq~~~ 615 (772) T protein:vir:10 543 TRIKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQ--YQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQ---TP--EQIQQ 615 (772) T ss_pred eeEEEEeeccccchHHHHHHHHHHHHHHhccChh--HHHHHHHHHHhhcCCCChHHHHHHHHHHhccC---Ch--HHHHH Confidence 9999999999999999999999999999887653 44567788999999999999999999764321 11 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 604 IVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIR- 682 (708) Q Consensus 604 ~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~- 682 (708) +.+++ .+++.++.+++++..++++++++.++++++.++++.+...+++..++++.+...++.....-+...... T Consensus 616 ~~~q~-----~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~~ad~~l~~~ 690 (772) T protein:vir:10 616 QIDQA-----VQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAPIADAVMQSA 690 (772) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhHHHHHHHHhc Confidence 11111 111122233344444455555555555555555555555555555555554444443322111111000 Q ss_pred HHHh-hhhhhhhhhhcCCCCC---------------------CCCCCC Q lcl|Aclame:pro 683 LLKD-VAESQQQQFQSPPQSP---------------------ADLMPS 708 (708) Q Consensus 683 ~~~~-~~~~~~~~~~~~~~~~---------------------~e~~~~ 708 (708) .... .+.......+++++++ +-.+|+ T Consensus 691 g~~~~~~~~~~~~~p~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 738 (772) T protein:vir:10 691 GYQRPNPAGDDPNYPIADQTAAMNIRSPYIQGQGPAAEAEAESVSVRR 738 (772) T ss_pred ccccccccccCCCCCCCCCccCCCCCccCCCCCCCCCccccCCCCCcc Confidence 0000 0000000000000000 000011 No 10 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=3.1e-150 Score=840.26 Aligned_cols=660 Identities=16% Similarity=0.178 Sum_probs=517.0 Q ss_pred CCcch-----HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETL-----EKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~-----~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |++++ .++|.+++.+|..+.+++.+||.++.+|. .||+|+||++++++.|+.+ |+||+|||+|+|+|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~v~~ 81 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKAC--AYYDGDQLPPEVLQVLKDR----GQPMTIHNLIAPTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCCcEEeccHHHHHHH Confidence 88864 45789999999999999999999998874 5789999999999999875 7899999999999999 Q ss_pred HHHHHhcCcceeEEecCCC-cchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDR-EASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~-~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) |+|++++||++++|+||++ +++.++|++|+++++++++.|++++++++||+++++||+||+++++++ | .+.++ T Consensus 82 v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~----d--~~~~~ 155 (714) T protein:vir:81 82 VLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS----D--PFGPE 155 (714) T ss_pred HHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc----C--CCCCC Confidence 9999999999999999974 355689999999999999999999999999999999999999998764 2 34567 Q ss_pred eeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc------------------------- Q lcl|Aclame:pro 155 IAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV------------------------- 209 (708) Q Consensus 155 i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~------------------------- 209 (708) |++..+ |+++|||||+|+++|+|||+|||+++|||+++|+++||++++..-.. T Consensus 156 i~i~~v--~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:81 156 FKVSTV--SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred eEEEec--chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 777766 68899999999999999999999999999999999999866321000 Q ss_pred --ccccccccCCCCC--ceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 210 --TSMTSWEYNWFGA--DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 210 --~~~~~~~~~~~~~--~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) ........+|.+. .+++|.|+|++..+. +.+.++.+|+++.|+..+...... ...|...+..+.++ +|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~---~~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~~~--rv~ 306 (714) T protein:vir:81 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFER---LPVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGRVS--RIR 306 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEE---EEeeccCCCceEEeCccCHHHHHH--Hhhcchhhhccccc--eEE Confidence 0001112334443 356667888766553 334467899999998887655433 33455555555554 577 Q ss_pred EEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc Q lcl|Aclame:pro 286 VSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) Q Consensus 286 ~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai 364 (708) +++|+|+.+|. +++||||++||||||||++..++|. +||+||.|||+||++|+++|+++|+|+ ++. +++.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~-~~~~~~a~ 381 (714) T protein:vir:81 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLLQ--AKR-VIMDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhhc--CCc-eeeecCcc Confidence 88899999885 6899999999999999999866665 799999999999999999999999874 443 56777887 Q ss_pred cchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhH Q lcl|Aclame:pro 365 RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQ 443 (708) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg 443 (708) ...++...+....++++++++++....... +.++.+.+++++|+++++|++.+.++|+++||+|++++|..+| +|| T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SG 458 (714) T protein:vir:81 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSV---ADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSG 458 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCC---CccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhH Confidence 666554445556677899988765433322 3467778889999999999999999999999999999999887 599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---ceEEEecccccccCCCceEEeec Q lcl|Aclame:pro 444 ETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGS---DDIAVLSAQVVDRQTGAVVALND 520 (708) Q Consensus 444 ~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~---~~~v~in~~~~~~~~~~~~~~nD 520 (708) +||+++++||++.+++++|||+.+++++|+++|+||++|||++|++||+|+++. .+++.+|+ .+|.....|| T Consensus 459 vAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nD 533 (714) T protein:vir:81 459 VAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTND 533 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceeccc Confidence 999999999999999999999999999999999999999999999999988654 45777764 4677788999 Q ss_pred cceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchH Q lcl|Aclame:pro 521 LSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEK 600 (708) Q Consensus 521 i~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~ 600 (708) |++|+|||+|+++|+++|+|+++++.|++|++.++|. +...+.+++++++|+|+++++++++++..+.....++.+++ T Consensus 534 i~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~--~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:81 534 ISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ--VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCch--hhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999987653 34556778899999999999999999988877777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 601 EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQA-------NTVYKLAQARNID 673 (708) Q Consensus 601 ~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a-------~~~~~~~q~~~~~ 673 (708) ++++++++++.++++++++..+++++..+++|+++++++++.+.+.++++....+..+++ .+.+++...+.++ T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 691 (714) T protein:vir:81 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNME 691 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh Confidence 777666666667667777777778888888888888887776666655443333322222 2222221222222 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 674 DKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 674 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) ++....+.++.+.+ ++...+.+. T Consensus 692 ~~~~~~~~q~~q~~----~~~~~~~~~ 714 (714) T protein:vir:81 692 QEQDVLQQQMLYTL----QQRMNEMSL 714 (714) T ss_pred hhhHHHHHHHHHHH----HHHHHhcCC Confidence 22222222222222 222222222 No 11 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=3.1e-150 Score=840.26 Aligned_cols=660 Identities=16% Similarity=0.178 Sum_probs=517.0 Q ss_pred CCcch-----HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETL-----EKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~-----~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |++++ .++|.+++.+|..+.+++.+||.++.+|. .||+|+||++++++.|+.+ |+||+|||+|+|+|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~v~~ 81 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKAC--AYYDGDQLPPEVLQVLKDR----GQPMTIHNLIAPTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCCcEEeccHHHHHHH Confidence 88864 45789999999999999999999998874 5789999999999999875 7899999999999999 Q ss_pred HHHHHhcCcceeEEecCCC-cchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDR-EASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~-~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) |+|++++||++++|+||++ +++.++|++|+++++++++.|++++++++||+++++||+||+++++++ | .+.++ T Consensus 82 v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~----d--~~~~~ 155 (714) T protein:vir:10 82 VLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS----D--PFGPE 155 (714) T ss_pred HHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc----C--CCCCC Confidence 9999999999999999974 355689999999999999999999999999999999999999998764 2 34567 Q ss_pred eeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc------------------------- Q lcl|Aclame:pro 155 IAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV------------------------- 209 (708) Q Consensus 155 i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~------------------------- 209 (708) |++..+ |+++|||||+|+++|+|||+|||+++|||+++|+++||++++..-.. T Consensus 156 i~i~~v--~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 156 FKVSTV--SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred eEEEec--chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 777766 68899999999999999999999999999999999999866321000 Q ss_pred --ccccccccCCCCC--ceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 210 --TSMTSWEYNWFGA--DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 210 --~~~~~~~~~~~~~--~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) ........+|.+. .+++|.|+|++..+. +.+.++.+|+++.|+..+...... ...|...+..+.++ +|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~---~~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~~~--rv~ 306 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFER---LPVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGRVS--RIR 306 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEE---EEeeccCCCceEEeCccCHHHHHH--Hhhcchhhhccccc--eEE Confidence 0001112334443 356667888766553 334467899999998887655433 33455555555554 577 Q ss_pred EEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc Q lcl|Aclame:pro 286 VSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) Q Consensus 286 ~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai 364 (708) +++|+|+.+|. +++||||++||||||||++..++|. +||+||.|||+||++|+++|+++|+|+ ++. +++.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~-~~~~~~a~ 381 (714) T protein:vir:10 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLLQ--AKR-VIMDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhhc--CCc-eeeecCcc Confidence 88899999885 6899999999999999999866665 799999999999999999999999874 443 56777887 Q ss_pred cchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhH Q lcl|Aclame:pro 365 RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQ 443 (708) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg 443 (708) ...++...+....++++++++++....... +.++.+.+++++|+++++|++.+.++|+++||+|++++|..+| +|| T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SG 458 (714) T protein:vir:10 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSV---ADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSG 458 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCC---CccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhH Confidence 666554445556677899988765433322 3467778889999999999999999999999999999999887 599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---ceEEEecccccccCCCceEEeec Q lcl|Aclame:pro 444 ETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGS---DDIAVLSAQVVDRQTGAVVALND 520 (708) Q Consensus 444 ~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~---~~~v~in~~~~~~~~~~~~~~nD 520 (708) +||+++++||++.+++++|||+.+++++|+++|+||++|||++|++||+|+++. .+++.+|+ .+|.....|| T Consensus 459 vAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nD 533 (714) T protein:vir:10 459 VAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTND 533 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceeccc Confidence 999999999999999999999999999999999999999999999999988654 45777764 4677788999 Q ss_pred cceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchH Q lcl|Aclame:pro 521 LSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEK 600 (708) Q Consensus 521 i~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~ 600 (708) |++|+|||+|+++|+++|+|+++++.|++|++.++|. +...+.+++++++|+|+++++++++++..+.....++.+++ T Consensus 534 i~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~--~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:10 534 ISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ--VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCch--hhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999987653 34556778899999999999999999988877777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 601 EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQA-------NTVYKLAQARNID 673 (708) Q Consensus 601 ~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a-------~~~~~~~q~~~~~ 673 (708) ++++++++++.++++++++..+++++..+++|+++++++++.+.+.++++....+..+++ .+.+++...+.++ T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 691 (714) T protein:vir:10 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNME 691 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh Confidence 777666666667667777777778888888888888887776666655443333322222 2222221222222 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 674 DKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 674 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) ++....+.++.+.+ ++...+.+. T Consensus 692 ~~~~~~~~q~~q~~----~~~~~~~~~ 714 (714) T protein:vir:10 692 QEQDVLQQQMLYTL----QQRMNEMSL 714 (714) T ss_pred hhhHHHHHHHHHHH----HHHHHhcCC Confidence 22222222222222 222222222 No 12 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=3.1e-150 Score=840.26 Aligned_cols=660 Identities=16% Similarity=0.178 Sum_probs=517.0 Q ss_pred CCcch-----HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETL-----EKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~-----~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |++++ .++|.+++.+|..+.+++.+||.++.+|. .||+|+||++++++.|+.+ |+||+|||+|+|+|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~v~~ 81 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKAC--AYYDGDQLPPEVLQVLKDR----GQPMTIHNLIAPTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCCcEEeccHHHHHHH Confidence 88864 45789999999999999999999998874 5789999999999999875 7899999999999999 Q ss_pred HHHHHhcCcceeEEecCCC-cchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDR-EASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~-~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) |+|++++||++++|+||++ +++.++|++|+++++++++.|++++++++||+++++||+||+++++++ | .+.++ T Consensus 82 v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~----d--~~~~~ 155 (714) T protein:vir:99 82 VLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS----D--PFGPE 155 (714) T ss_pred HHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc----C--CCCCC Confidence 9999999999999999974 355689999999999999999999999999999999999999998764 2 34567 Q ss_pred eeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc------------------------- Q lcl|Aclame:pro 155 IAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV------------------------- 209 (708) Q Consensus 155 i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~------------------------- 209 (708) |++..+ |+++|||||+|+++|+|||+|||+++|||+++|+++||++++..-.. T Consensus 156 i~i~~v--~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:99 156 FKVSTV--SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred eEEEec--chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 777766 68899999999999999999999999999999999999866321000 Q ss_pred --ccccccccCCCCC--ceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 210 --TSMTSWEYNWFGA--DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 210 --~~~~~~~~~~~~~--~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) ........+|.+. .+++|.|+|++..+. +.+.++.+|+++.|+..+...... ...|...+..+.++ +|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~---~~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~~~--rv~ 306 (714) T protein:vir:99 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFER---LPVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGRVS--RIR 306 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEE---EEeeccCCCceEEeCccCHHHHHH--Hhhcchhhhccccc--eEE Confidence 0001112334443 356667888766553 334467899999998887655433 33455555555554 577 Q ss_pred EEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc Q lcl|Aclame:pro 286 VSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) Q Consensus 286 ~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai 364 (708) +++|+|+.+|. +++||||++||||||||++..++|. +||+||.|||+||++|+++|+++|+|+ ++. +++.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~-~~~~~~a~ 381 (714) T protein:vir:99 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLLQ--AKR-VIMDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhhc--CCc-eeeecCcc Confidence 88899999885 6899999999999999999866665 799999999999999999999999874 443 56777887 Q ss_pred cchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhH Q lcl|Aclame:pro 365 RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQ 443 (708) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg 443 (708) ...++...+....++++++++++....... +.++.+.+++++|+++++|++.+.++|+++||+|++++|..+| +|| T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SG 458 (714) T protein:vir:99 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSV---ADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSG 458 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCC---CccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhH Confidence 666554445556677899988765433322 3467778889999999999999999999999999999999887 599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---ceEEEecccccccCCCceEEeec Q lcl|Aclame:pro 444 ETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGS---DDIAVLSAQVVDRQTGAVVALND 520 (708) Q Consensus 444 ~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~---~~~v~in~~~~~~~~~~~~~~nD 520 (708) +||+++++||++.+++++|||+.+++++|+++|+||++|||++|++||+|+++. .+++.+|+ .+|.....|| T Consensus 459 vAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nD 533 (714) T protein:vir:99 459 VAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTND 533 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceeccc Confidence 999999999999999999999999999999999999999999999999988654 45777764 4677788999 Q ss_pred cceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchH Q lcl|Aclame:pro 521 LSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEK 600 (708) Q Consensus 521 i~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~ 600 (708) |++|+|||+|+++|+++|+|+++++.|++|++.++|. +...+.+++++++|+|+++++++++++..+.....++.+++ T Consensus 534 i~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~--~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:99 534 ISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ--VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCch--hhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999987653 34556778899999999999999999988877777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 601 EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQA-------NTVYKLAQARNID 673 (708) Q Consensus 601 ~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a-------~~~~~~~q~~~~~ 673 (708) ++++++++++.++++++++..+++++..+++|+++++++++.+.+.++++....+..+++ .+.+++...+.++ T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 691 (714) T protein:vir:99 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNME 691 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh Confidence 777666666667667777777778888888888888887776666655443333322222 2222221222222 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 674 DKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 674 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) ++....+.++.+.+ ++...+.+. T Consensus 692 ~~~~~~~~q~~q~~----~~~~~~~~~ 714 (714) T protein:vir:99 692 QEQDVLQQQMLYTL----QQRMNEMSL 714 (714) T ss_pred hhhHHHHHHHHHHH----HHHHHhcCC Confidence 22222222222222 222222222 No 13 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=3.1e-150 Score=840.26 Aligned_cols=660 Identities=16% Similarity=0.178 Sum_probs=517.0 Q ss_pred CCcch-----HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETL-----EKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~-----~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |++++ .++|.+++.+|..+.+++.+||.++.+|. .||+|+||++++++.|+.+ |+||+|||+|+|+|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~v~~ 81 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKAC--AYYDGDQLPPEVLQVLKDR----GQPMTIHNLIAPTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCCcEEeccHHHHHHH Confidence 88864 45789999999999999999999998874 5789999999999999875 7899999999999999 Q ss_pred HHHHHhcCcceeEEecCCC-cchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDR-EASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~-~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) |+|++++||++++|+||++ +++.++|++|+++++++++.|++++++++||+++++||+||+++++++ | .+.++ T Consensus 82 v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~----d--~~~~~ 155 (714) T protein:vir:32 82 VLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS----D--PFGPE 155 (714) T ss_pred HHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc----C--CCCCC Confidence 9999999999999999974 355689999999999999999999999999999999999999998764 2 34567 Q ss_pred eeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc------------------------- Q lcl|Aclame:pro 155 IAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV------------------------- 209 (708) Q Consensus 155 i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~------------------------- 209 (708) |++..+ |+++|||||+|+++|+|||+|||+++|||+++|+++||++++..-.. T Consensus 156 i~i~~v--~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:32 156 FKVSTV--SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred eEEEec--chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 777766 68899999999999999999999999999999999999866321000 Q ss_pred --ccccccccCCCCC--ceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 210 --TSMTSWEYNWFGA--DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 210 --~~~~~~~~~~~~~--~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) ........+|.+. .+++|.|+|++..+. +.+.++.+|+++.|+..+...... ...|...+..+.++ +|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~---~~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~~~--rv~ 306 (714) T protein:vir:32 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFER---LPVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGRVS--RIR 306 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEE---EEeeccCCCceEEeCccCHHHHHH--Hhhcchhhhccccc--eEE Confidence 0001112334443 356667888766553 334467899999998887655433 33455555555554 577 Q ss_pred EEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc Q lcl|Aclame:pro 286 VSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) Q Consensus 286 ~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai 364 (708) +++|+|+.+|. +++||||++||||||||++..++|. +||+||.|||+||++|+++|+++|+|+ ++. +++.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~-~~~~~~a~ 381 (714) T protein:vir:32 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLLQ--AKR-VIMDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhhc--CCc-eeeecCcc Confidence 88899999885 6899999999999999999866665 799999999999999999999999874 443 56777887 Q ss_pred cchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhH Q lcl|Aclame:pro 365 RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQ 443 (708) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg 443 (708) ...++...+....++++++++++....... +.++.+.+++++|+++++|++.+.++|+++||+|++++|..+| +|| T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SG 458 (714) T protein:vir:32 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSV---ADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSG 458 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCC---CccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhH Confidence 666554445556677899988765433322 3467778889999999999999999999999999999999887 599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---ceEEEecccccccCCCceEEeec Q lcl|Aclame:pro 444 ETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGS---DDIAVLSAQVVDRQTGAVVALND 520 (708) Q Consensus 444 ~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~---~~~v~in~~~~~~~~~~~~~~nD 520 (708) +||+++++||++.+++++|||+.+++++|+++|+||++|||++|++||+|+++. .+++.+|+ .+|.....|| T Consensus 459 vAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nD 533 (714) T protein:vir:32 459 VAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTND 533 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceeccc Confidence 999999999999999999999999999999999999999999999999988654 45777764 4677788999 Q ss_pred cceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchH Q lcl|Aclame:pro 521 LSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEK 600 (708) Q Consensus 521 i~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~ 600 (708) |++|+|||+|+++|+++|+|+++++.|++|++.++|. +...+.+++++++|+|+++++++++++..+.....++.+++ T Consensus 534 i~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~--~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:32 534 ISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ--VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCch--hhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999987653 34556778899999999999999999988877777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 601 EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQA-------NTVYKLAQARNID 673 (708) Q Consensus 601 ~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a-------~~~~~~~q~~~~~ 673 (708) ++++++++++.++++++++..+++++..+++|+++++++++.+.+.++++....+..+++ .+.+++...+.++ T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 691 (714) T protein:vir:32 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNME 691 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh Confidence 777666666667667777777778888888888888887776666655443333322222 2222221222222 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 674 DKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 674 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) ++....+.++.+.+ ++...+.+. T Consensus 692 ~~~~~~~~q~~q~~----~~~~~~~~~ 714 (714) T protein:vir:32 692 QEQDVLQQQMLYTL----QQRMNEMSL 714 (714) T ss_pred hhhHHHHHHHHHHH----HHHHHhcCC Confidence 22222222222222 222222222 No 14 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=3.1e-150 Score=840.26 Aligned_cols=660 Identities=16% Similarity=0.178 Sum_probs=517.0 Q ss_pred CCcch-----HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETL-----EKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~-----~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |++++ .++|.+++.+|..+.+++.+||.++.+|. .||+|+||++++++.|+.+ |+||+|||+|+|+|++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~v~~ 81 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKAC--AYYDGDQLPPEVLQVLKDR----GQPMTIHNLIAPTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCCcEEeccHHHHHHH Confidence 88864 45789999999999999999999998874 5789999999999999875 7899999999999999 Q ss_pred HHHHHhcCcceeEEecCCC-cchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDR-EASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~-~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) |+|++++||++++|+||++ +++.++|++|+++++++++.|++++++++||+++++||+||+++++++ | .+.++ T Consensus 82 v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~----d--~~~~~ 155 (714) T protein:vir:27 82 VLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS----D--PFGPE 155 (714) T ss_pred HHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc----C--CCCCC Confidence 9999999999999999974 355689999999999999999999999999999999999999998764 2 34567 Q ss_pred eeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc------------------------- Q lcl|Aclame:pro 155 IAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV------------------------- 209 (708) Q Consensus 155 i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~------------------------- 209 (708) |++..+ |+++|||||+|+++|+|||+|||+++|||+++|+++||++++..-.. T Consensus 156 i~i~~v--~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:27 156 FKVSTV--SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred eEEEec--chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 777766 68899999999999999999999999999999999999866321000 Q ss_pred --ccccccccCCCCC--ceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 210 --TSMTSWEYNWFGA--DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 210 --~~~~~~~~~~~~~--~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) ........+|.+. .+++|.|+|++..+. +.+.++.+|+++.|+..+...... ...|...+..+.++ +|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~---~~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~~~--rv~ 306 (714) T protein:vir:27 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFER---LPVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGRVS--RIR 306 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEE---EEeeccCCCceEEeCccCHHHHHH--Hhhcchhhhccccc--eEE Confidence 0001112334443 356667888766553 334467899999998887655433 33455555555554 577 Q ss_pred EEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc Q lcl|Aclame:pro 286 VSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) Q Consensus 286 ~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai 364 (708) +++|+|+.+|. +++||||++||||||||++..++|. +||+||.|||+||++|+++|+++|+|+ ++. +++.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~-~~~~~~a~ 381 (714) T protein:vir:27 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLLQ--AKR-VIMDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhhc--CCc-eeeecCcc Confidence 88899999885 6899999999999999999866665 799999999999999999999999874 443 56777887 Q ss_pred cchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhH Q lcl|Aclame:pro 365 RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQ 443 (708) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg 443 (708) ...++...+....++++++++++....... +.++.+.+++++|+++++|++.+.++|+++||+|++++|..+| +|| T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SG 458 (714) T protein:vir:27 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSV---ADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSG 458 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCC---CccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhH Confidence 666554445556677899988765433322 3467778889999999999999999999999999999999887 599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---ceEEEecccccccCCCceEEeec Q lcl|Aclame:pro 444 ETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGS---DDIAVLSAQVVDRQTGAVVALND 520 (708) Q Consensus 444 ~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~---~~~v~in~~~~~~~~~~~~~~nD 520 (708) +||+++++||++.+++++|||+.+++++|+++|+||++|||++|++||+|+++. .+++.+|+ .+|.....|| T Consensus 459 vAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nD 533 (714) T protein:vir:27 459 VAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTND 533 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceeccc Confidence 999999999999999999999999999999999999999999999999988654 45777764 4677788999 Q ss_pred cceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchH Q lcl|Aclame:pro 521 LSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEK 600 (708) Q Consensus 521 i~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~ 600 (708) |++|+|||+|+++|+++|+|+++++.|++|++.++|. +...+.+++++++|+|+++++++++++..+.....++.+++ T Consensus 534 i~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~--~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:27 534 ISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ--VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCch--hhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999987653 34556778899999999999999999988877777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 601 EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQA-------NTVYKLAQARNID 673 (708) Q Consensus 601 ~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a-------~~~~~~~q~~~~~ 673 (708) ++++++++++.++++++++..+++++..+++|+++++++++.+.+.++++....+..+++ .+.+++...+.++ T Consensus 612 ~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 691 (714) T protein:vir:27 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNME 691 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh Confidence 777666666667667777777778888888888888887776666655443333322222 2222221222222 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 674 DKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 674 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) ++....+.++.+.+ ++...+.+. T Consensus 692 ~~~~~~~~q~~q~~----~~~~~~~~~ 714 (714) T protein:vir:27 692 QEQDVLQQQMLYTL----QQRMNEMSL 714 (714) T ss_pred hhhHHHHHHHHHHH----HHHHHhcCC Confidence 22222222222222 222222222 No 15 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=1.6e-147 Score=825.32 Aligned_cols=660 Identities=16% Similarity=0.173 Sum_probs=513.0 Q ss_pred CC------------cchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecc Q lcl|Aclame:pro 1 MA------------ETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINK 68 (708) Q Consensus 1 ma------------~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~ 68 (708) |+ .+...++.+++.+|.++.+.+.+||.++.+|. .||+|+||++++++.|+.+ |+||+|||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~--~fy~G~Qw~~~~~~~l~~~----g~p~~~~N~ 74 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKAC--AYYDGDQLAPEVIQVLKDR----GQPMTIHNL 74 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHHhc----CCCcEEecc Confidence 22 13445788999999999999999999998875 5689999999999999875 789999999 Q ss_pred hHHHHHHHHHHHhcCcceeEEecCCC-cchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 69 VATELNRIIAEYRNNRITVKFRPGDR-EASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 69 i~~~i~~i~g~~~~nr~~~~v~pr~~-~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) |+|+|++|+|++++||++++|+||++ +++.++|++|+++++++++.|++++++++||+++++||+||++++++++ T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d---- 150 (714) T protein:vir:10 75 IAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSE---- 150 (714) T ss_pred HHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccC---- Confidence 99999999999999999999999975 3456899999999999999999999999999999999999999988862 Q ss_pred CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccc------------------ Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV------------------ 209 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~------------------ 209 (708) .+.++|+++.| |+++|||||+|+++|+|||+|||+++|||+++++++||++++..-.. T Consensus 151 --~~~~~i~i~~v--~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~ 226 (714) T protein:vir:10 151 --PFGPEFKVSTV--SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred --CCCCCeEEEec--ChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcc Confidence 34567888877 67899999999999999999999999999999999999866421100 Q ss_pred cc---------ccccccCCCCC--ceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhhee Q lcl|Aclame:pro 210 TS---------MTSWEYNWFGA--DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRS 278 (708) Q Consensus 210 ~~---------~~~~~~~~~~~--~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (708) .. .......|.+. ++++|.|+|++..+. +.|.++.+|+++.|+..+...... ...|...+..+. T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~---~~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~ 301 (714) T protein:vir:10 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFER---LPVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGR 301 (714) T ss_pred cccccchhhcccccccccccccCcceEEEEEEEEeEEEE---EEeecCCCCCeeeeCccCHHHHHH--HHhccceecccc Confidence 00 00011234333 457788988876553 445568899999999877665443 234444444444 Q ss_pred eeeEEEEEEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCce Q lcl|Aclame:pro 279 VKRRRVYVSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIP 357 (708) Q Consensus 279 ~~~~~v~~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~ 357 (708) + +||+|++|+|+++|+ +++||||++||||||||++..++|. +||+||.|||+||++|+++|+++|+|+.+ ++ T Consensus 302 ~--~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~---~~ 374 (714) T protein:vir:10 302 V--SRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLLQAK---RV 374 (714) T ss_pred e--eeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCc--cceehhhhhhHHHHHHHHHHHHHHHHhCC---ce Confidence 3 469999999999885 6899999999999999999866654 89999999999999999999999987433 56 Q ss_pred eechhhccchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc Q lcl|Aclame:pro 358 IVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM 437 (708) Q Consensus 358 i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~ 437 (708) ++.+|++....+...+....++++++++++....... +.++...+++++|+++++|++.+..+|+++|||+++++|. T Consensus 375 ~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~ 451 (714) T protein:vir:10 375 IMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSV---ADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQ 451 (714) T ss_pred eeccccccccHHHHHHhccCCCCeEEecccccccCCc---cccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCC Confidence 8888988765543334445666888887765433222 2467778889999999999999999999999999999999 Q ss_pred ccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCC---CceEEEecccccccCCC Q lcl|Aclame:pro 438 PSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDG---SDDIAVLSAQVVDRQTG 513 (708) Q Consensus 438 ~~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~---~~~~v~in~~~~~~~~~ 513 (708) .+| +||+||+++++||++.+.+++|||+++++++|+++|+||++|||++|++||+|+++ ..+++.+|.. .+ T Consensus 452 ~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~-----~~ 526 (714) T protein:vir:10 452 DSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE-----GD 526 (714) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccc-----cC Confidence 877 59999999999999999999999999999999999999999999999999998865 4577877754 45 Q ss_pred ceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhc Q lcl|Aclame:pro 514 AVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGI 593 (708) Q Consensus 514 ~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~ 593 (708) ...+.|||++|+|||+|+++|+++++|+++++.|++++++++|. +...+.+++++++|+|+++++++++++.++.... T Consensus 527 ~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~--~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~ 604 (714) T protein:vir:10 527 NGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ--VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS 604 (714) T ss_pred CccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCch--hhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCC Confidence 55678999999999999999999999999999999999987653 4456778889999999999999999998877766 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHH Q lcl|Aclame:pro 594 AKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQ-------ANTVYKL 666 (708) Q Consensus 594 ~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~-------a~~~~~~ 666 (708) .++.++++++++.++++.++++.+++..+++++..+++|+++++++++.+.+.++++...++..+. +.+..++ T Consensus 605 ~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l 684 (714) T protein:vir:10 605 PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEII 684 (714) T ss_pred ccccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666666666666666666666666677777888888888888887777665555544333222221 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 667 AQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 667 ~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) .....+.++.....+++++.+++ ...+.+- T Consensus 685 ~~~~~~~q~~~~~~q~~~q~~~~----~~~~~~~ 714 (714) T protein:vir:10 685 TGVQNMEQEQDVLQQQMLYTLQQ----RMNEMSL 714 (714) T ss_pred HHHHhhhhhHHHHHHHHHHHHHH----HHHhcCC Confidence 12222222222222222222222 2222222 No 16 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=2.6e-142 Score=796.79 Aligned_cols=657 Identities=16% Similarity=0.164 Sum_probs=467.5 Q ss_pred CCc-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAE-TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) +.+ +..++|++++.+|+++.+.+.+||.++.+|.+ ||+|+||+++++++|+.+ |+||+|||+|+++|++|+|+ T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~--fy~G~Qw~~~~~~~l~~~----g~p~~~~N~i~~~i~~v~g~ 111 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDED--YYDNIQWSQDEIDELKER----GQAPTVYNVISQSVNWIIGS 111 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHH--HhCCCCCCHHHHHHHHhc----CCceEEecchHHHHHHHHHH Confidence 332 55668999999999999999999999998854 688999999999999875 78999999999999999999 Q ss_pred HhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEE Q lcl|Aclame:pro 80 YRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEP 159 (708) Q Consensus 80 ~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~ 159 (708) +++||++++|+|++ ++|.++|++||++++++++.|++++++++||+++++||+||++|+++|+.++ ++.+. T Consensus 112 ~~~nr~~~~~~p~~-~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~-------~~~~~- 182 (776) T protein:vir:93 112 EKRGRSDFKVLPRR-KDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDG-------EPIYA- 182 (776) T ss_pred HHhCCcceEEecCC-hhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCC-------CceEe- Confidence 99999999999996 5899999999999999999999999999999999999999999999875432 23333 Q ss_pred eecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCccccccccc---------------------------c Q lcl|Aclame:pro 160 IYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTS---------------------------M 212 (708) Q Consensus 160 v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~---------------------------~ 212 (708) .++++++|||||+|+++|++||+|||+++|||+++|+++||++++...+... . T Consensus 183 ~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (776) T protein:vir:93 183 GAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNS 262 (776) T ss_pred eccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccc Confidence 2346789999999999999999999999999999999999987643111000 0 Q ss_pred cccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecc Q lcl|Aclame:pro 213 TSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGD 292 (708) Q Consensus 213 ~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 292 (708) ....+.+...++++|+|||+|+.+...++... ...+..+.+... ...+......|...+..+. ..+|+|+++.|. T Consensus 263 ~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~-~~~~~~~~~d~~--~~~~~~~~~~g~~~~~~~~--~~~v~~~~~~g~ 337 (776) T protein:vir:93 263 VTAGAVAYARKRVRMIEAWFRMPVRVQRLKGR-NSDFRGEVFDPN--DERHVLEVESGRAVLAVSP--MMRMHCAIMTTR 337 (776) T ss_pred ccccccccCCCeEEEEEEEEeeeeehhhcccc-cccccceeeccc--chHHHHHhhcCceeehhee--eeeeEEEEEecc Confidence 01112333456788888887776554333221 112222333322 2223334455555554444 347888889988 Q ss_pred eeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHH Q lcl|Aclame:pro 293 GFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHW 371 (708) Q Consensus 293 ~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~ 371 (708) .+|+ +++||||++|||||+||++... +++|||+||.|+|+||++|+++|+++|+|+ +.++++++|++++.++.+ T Consensus 338 ~~l~~~~~p~~~~~~Pfv~~~~~~~~~--~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~---~~~~~~~~gav~~~d~~~ 412 (776) T protein:vir:93 338 DLMWAGPSPYRHNRYPFTPIWGFRRAR--DGMPYGVIRFMRGMQDDVNKRLSKALYILS---TNKVLMEEGAVDDIDEFR 412 (776) T ss_pred hhhhccCCCCCCCccceEEecCceecc--cccccchHHhhhHHHHHHHHHHHHHHHhhc---CCceeeccccccchHHHH Confidence 8775 6799999999999999998754 557999999999999999999999999985 347999999999999988 Q ss_pred HhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc-hhHHHHHHHH Q lcl|Aclame:pro 372 EARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLM 450 (708) Q Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n-~sg~ai~~~q 450 (708) ++.++ +++++.+++.... .+.+...+.+++++++|++++..+|+++|||+++++|..+| +||+||++++ T Consensus 413 ~~~~r-p~~vi~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~ 482 (776) T protein:vir:93 413 REAAR-PDAVMTVKNGKLG---------AVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQ 482 (776) T ss_pred Hhccc-CCceeeeCCcccc---------ccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHH Confidence 87654 5566666543211 23445567899999999999999999999999999999877 6999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEE Q lcl|Aclame:pro 451 NRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTV 530 (708) Q Consensus 451 ~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v 530 (708) ++|++++.+++|||+++++++|+++|+||++|||++|+|||+|++|..+||.||... +.||+++|+|||+| T Consensus 483 ~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~---------~~nd~~~~~~dv~v 553 (776) T protein:vir:93 483 EQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGL---------PENDITRTKADFII 553 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccc---------hhhhhccceeeEEE Confidence 999999999999999999999999999999999999999999999999999999643 46999999999999 Q ss_pred eecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHH Q lcl|Aclame:pro 531 DVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQM 610 (708) Q Consensus 531 ~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq 610 (708) ++||+++++|+++++.|+++++.++|. ....+.+.+++++++|++++++++++...+..........+++ +++++ T Consensus 554 ~~~~~~~s~r~~~~~~l~ql~~~~~p~--~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~---~~~qq 628 (776) T protein:vir:93 554 DEAEWRATMRQAAVAELMEVIGKMPPE--IALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEE---IAREQ 628 (776) T ss_pred eecccchhHHHHHHHHHHHHHhhcChh--hHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhH---HHHHH Confidence 999999999999999999999887653 4456777889999999999999999876543322221111111 11112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 611 AAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVY----KLAQARNIDDKAVMEAIRLLKD 686 (708) Q Consensus 611 ~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~----~~~q~~~~~~~~~~~~~~~~~~ 686 (708) .+++.++.++..+++++...++++.++++++.+.+.++.+.+.++......+.+ ...+...+...+.... +.+.. T Consensus 629 ~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~-~~~~~ 707 (776) T protein:vir:93 629 AQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSD-GILRE 707 (776) T ss_pred HhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhcc Confidence 222222222222222222223333333333222222222211111111111111 1111111111111000 00000 Q ss_pred hhhhhhhhhh--------cCCCCCCCCCCC Q lcl|Aclame:pro 687 VAESQQQQFQ--------SPPQSPADLMPS 708 (708) Q Consensus 687 ~~~~~~~~~~--------~~~~~~~e~~~~ 708 (708) . +...+..+ .+|.++.+..|+ T Consensus 708 a-~~~~p~~p~~~~~~~~~~~~~~~p~~p~ 736 (776) T protein:vir:93 708 S-GWDDPNTPQPASAASGMPPAPAQPAQPA 736 (776) T ss_pred c-cccccccccccccccCCCCCCCCCCCCC Confidence 0 00000000 011111111111 No 17 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=2.3e-83 Score=473.64 Aligned_cols=623 Identities=12% Similarity=0.088 Sum_probs=386.5 Q ss_pred CCc-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAE-TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) .-+ +.+..++.+...+..+..+....+.++-.-+.++||.|+.=+ ...+||.-++...++..|+|+++. T Consensus 20 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~grs~vv~~~v~~~ve~~~~~ 89 (763) T protein:vir:95 20 LTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKP----------PKVKGRSQVQPKLVRRQAEWRYSA 89 (763) T ss_pred CCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcc----------cccCCCccccCHHHHHHHHHHHHH Confidence 222 456778888888888877777666665444556677776521 123477788888999999999988 Q ss_pred Hhc---Ccce-eEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC------- Q lcl|Aclame:pro 80 YRN---NRIT-VKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD------- 147 (708) Q Consensus 80 ~~~---nr~~-~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d------- 147 (708) .+. +..+ +.|.|++ .+|.+.|+..|.+++|+ ...|+.....+++|+++|++|+|+++|.|+.+.... T Consensus 90 l~~~f~~~~~~~~~~P~~-~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~ 168 (763) T protein:vir:95 90 LTEPFLGSNKLFKVTPVT-WEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVF 168 (763) T ss_pred HHHhhcCCCcEEEEecCC-cchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhh Confidence 777 4444 5999997 57999999999999995 778889899999999999999998888776322110 Q ss_pred -----------------------------------C--------CCC-------------------CcceeeEEeecchh Q lcl|Aclame:pro 148 -----------------------------------P--------MDD-------------------RQRIAIEPIYDPSR 165 (708) Q Consensus 148 -----------------------------------~--------~~~-------------------~~~i~i~~v~~~~~ 165 (708) + ..+ .+.++|+. +|+. T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~--V~p~ 246 (763) T protein:vir:95 169 SLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEM--LNPE 246 (763) T ss_pred hhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEe--ecHH Confidence 0 000 01223333 4788 Q ss_pred heecCCccccCChhccCeEEEeecCCHHHHHHh-CCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEEEEEe Q lcl|Aclame:pro 166 SVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAE-YGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYR 244 (708) Q Consensus 166 ~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~-~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~ 244 (708) +|||||.|++ |++||+||+++.++|+++|.++ |+....+.++..................-..+.+..++++.+++|| T Consensus 247 d~~iDp~a~s-D~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y 325 (763) T protein:vir:95 247 NIIIDPSCQG-DINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYW 325 (763) T ss_pred HheecCCCCC-chhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEee Confidence 9999998887 8999999999999999999887 3322212222221111111100001111112222233455555555 Q ss_pred cCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCccc Q lcl|Aclame:pro 245 HPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIER 323 (708) Q Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~ 323 (708) .+. .+.++ ...++++.++.|..+|. ..+||||+.|||+++.+++. .++++ T Consensus 326 ~~~-----d~~gd----------------------g~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~--~~~~~ 376 (763) T protein:vir:95 326 GFW-----DIEGN----------------------GVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPV--KRDMY 376 (763) T ss_pred eee-----ccCCc----------------------ceeEEEEEEEEcCeeeecccccccCCCcCEEEecceee--cCccc Confidence 321 01110 11245556677777665 67899999999998888774 67889 Q ss_pred ccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccccccccccc Q lcl|Aclame:pro 324 VEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGY 403 (708) Q Consensus 324 ~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (708) |+|+++.++|+|+++|+++|+++|+++++++++|++++|+++.. +.....+++++.++++.+.. ..+.. T Consensus 377 G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~----d~~~~~pg~v~~v~~g~~~~-------~~~~~ 445 (763) T protein:vir:95 377 GEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDAL----NSRRYREGEDYEYNPTQNPA-------QMIIE 445 (763) T ss_pred CCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccch----hhhcccCCceEEeeCCCChh-------hhccc Confidence 99999999999999999999999999999999999999987543 23345677778777654332 23444 Q ss_pred ccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 404 TQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) Q Consensus 404 ~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~ 480 (708) ..++.+++..+.++++....++.+|||++.++|..++ .++++++.+++++++++..+++||+++++++|+++++||+ T Consensus 446 ~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~ 525 (763) T protein:vir:95 446 HKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNA 525 (763) T ss_pred ccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5677889999999999999999999999999996543 3566789999999999999999999999999999999999 Q ss_pred HhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCch Q lcl|Aclame:pro 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) Q Consensus 481 ~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~ 560 (708) +|||++|+|||+|+ +|+.+++.. ..|+|||+|++++ ++.++++++.|++|++.+++..++ T Consensus 526 q~~d~~rviRI~g~----e~v~v~~~~--------------~~~~~DV~V~~~~--as~~~q~~~~l~~ll~~l~~~~~~ 585 (763) T protein:vir:95 526 VFLAEHEVVRITNE----EFVTIKRED--------------LKGNFDLEVDIST--AEVDNQKSQDLGFMLQTIGPNVDQ 585 (763) T ss_pred hhCCCCcEEEEeCC----ccccccHHH--------------hcCCcceEEeccc--chHHHHHHHHHHHHHHHhccccCh Confidence 99999999999996 577776533 3578999999976 466778888899999988876553 Q ss_pred hHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 561 RPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 561 ~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) . ....++...+++....++.+.++...++ |++.+ +++++. ++.+.+.+.+..++++++.++++....++++ T Consensus 586 ~-~~~~il~~~~d~~~~~~~~~~lr~~q~~-----~d~~~-q~qaql--e~~~~q~e~~~~~akaq~~qaqa~~~~aq~e 656 (763) T protein:vir:95 586 Q-ITLNILAEIADLKRMPKLAHDLRTWQPQ-----PDPVQ-EQLKQL--AVEKAQLENEELRSKIRLNDAQAQKAMAERD 656 (763) T ss_pred H-HHHHHHHHHHhhhchhhhHHHHHhcCCC-----ccchh-hhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2233445666777676777777654321 11111 111111 1111111122222222222222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCC-----------CC---- Q lcl|Aclame:pro 641 TAQTQIKAFTAQQDAMESQANTVYKLAQA--RNIDDKAVMEAIRLLKDVAESQQQQFQSPPQS-----------PA---- 703 (708) Q Consensus 641 ~~~~q~e~~~~~~~~~~~~a~~~~~~~q~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~---- 703 (708) ..+.++.+...+.+....... .....++ +.+...+..+...-. ....+++....+.|-+ .. T Consensus 657 ~~~~d~~~~e~~~Q~~~e~~~-~~~~~eaq~~l~~~~a~~~~~~ea-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 734 (763) T protein:vir:95 657 NKNLDYLEQESGTKHARDLEK-MKAQSQGNQQLEITKALTKPRKEG-ELPPNLSAAIGYNALTNGEDTGIQSVSERDIAA 734 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh-ccChhHHHhhhhcccccccCCCccchhhcccCc Confidence 222222221111111100000 0000000 000000111100000 0000001000001101 11 Q ss_pred -----------CCCCC Q lcl|Aclame:pro 704 -----------DLMPS 708 (708) Q Consensus 704 -----------e~~~~ 708 (708) ..=|| T Consensus 735 ~~~~~~~~~~~~~~~~ 750 (763) T protein:vir:95 735 EANPAYSLGSSQFDPT 750 (763) T ss_pred cccccccCCCCCCCCC Confidence 11112 No 18 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=5e-81 Score=460.80 Aligned_cols=616 Identities=15% Similarity=0.090 Sum_probs=370.1 Q ss_pred CCcc-------hHHHHHHHHHHHHHHHHhhHH-HHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAET-------LEKKHERIMLRFDRAYSPQKE-VREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~-------~~~~~~~~~~~~~~~~~~~~~-~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) ||-+ .++++..+...++.|.++... ...++.+. +.||.|++|+.. ..|+..++.|.|... T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~--~~~y~g~~~~~~----------~~~~s~~~~~~v~~~ 68 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEA--LKYYFGEPFGNE----------RPGKSGIVSRDVQET 68 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHH--HHHHhCCCCCcc----------cCCCCccccHHHHHH Confidence 6632 456778888888888887663 33455544 467889999653 246778889999999 Q ss_pred HHHHHHHHhc----CcceeEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC- Q lcl|Aclame:pro 73 LNRIIAEYRN----NRITVKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY- 146 (708) Q Consensus 73 i~~i~g~~~~----nr~~~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~- 146 (708) |++++++... +..-+.|.|+. .+|.++|++++.+++|+ .+.|+.....+++|+++++||+||++|.|+..... T Consensus 69 v~~~~~~l~~~~~~~~~~~~~~p~~-~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~ 147 (705) T protein:vir:88 69 VDWIMPSLMKVFTSGGQVVKYEPDT-AEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPT 147 (705) T ss_pred HHHHHHHHHHhhcCCCceEEEeeCC-hhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchh Confidence 9999887664 67789999997 57899999999999995 88888999999999999999999999998643221 Q ss_pred -----------------CC----------CCC-----------CcceeeEEeecchhheecCCccccCChhccCeEEEee Q lcl|Aclame:pro 147 -----------------DP----------MDD-----------RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMY 188 (708) Q Consensus 147 -----------------d~----------~~~-----------~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~ 188 (708) +| ..+ .+.+++..| ||.+|||||+|+ +++|++|++++. T Consensus 148 ~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V--~p~d~~~dp~a~--~~~d~~~~~~~~ 223 (705) T protein:vir:88 148 FERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCV--KPENFLVDRLAT--CIDDARFLCHRE 223 (705) T ss_pred hhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeec--cHHHceecCCCC--CcccCcEEEEEE Confidence 11 000 134555554 788999999987 577999999999 Q ss_pred cCCHHHHHHhCCCCcc-cccccccccc--------cccCCCCCceeEEeeeeee-cceEEEEEEEecCccCceeEecCCc Q lcl|Aclame:pro 189 SLSPEKYEAEYGKKPP-TSLDVTSMTS--------WEYNWFGADVIYIAKYYEV-RKESVDVISYRHPITGEIATYDSDQ 258 (708) Q Consensus 189 ~~~~~e~~~~~p~~~~-~~~d~~~~~~--------~~~~~~~~~~~~v~e~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (708) ++|++++.+++++... ..+....... ....+.......+.++|+. ....+.+++|+.+.. +..+ T Consensus 224 ~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d-----~~~d- 297 (705) T protein:vir:88 224 KYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLD-----VDGD- 297 (705) T ss_pred eccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEec-----ccCC- Confidence 9999999988654321 1111000000 0011111122333444443 334455666653211 1111 Q ss_pred ccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHH Q lcl|Aclame:pro 259 VEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLY 338 (708) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~ 338 (708) ...++++.++.|+.++.. . |.+++||+.+.+++ +.++++|+|+++.++|+|+.+ T Consensus 298 ---------------------~~~~~~~~~~~g~~il~~-~--~~~~~PF~~~~~~p--~~~~~~G~g~~~~~~d~Q~~~ 351 (705) T protein:vir:88 298 ---------------------GISELRRILYVGDYIISN-E--PWDCRPFADLNAYR--IAHKFHGMSVYDKIRDIQEIR 351 (705) T ss_pred ---------------------cceeeEEEEEeCcccccc-c--cCCCCCEEEeccee--ecCccccCChHHHHhHHHHHH Confidence 112566777888888764 2 45778888765554 567889999999999999999 Q ss_pred HHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHH Q lcl|Aclame:pro 339 NLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQ 418 (708) Q Consensus 339 N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 418 (708) |+++|+++++++++++++++++.|+++..+ .....+++++.++.. ..+.+++++.+|++.++|++ T Consensus 352 n~~~~~~~d~~~~~~~~~~~~~~g~v~~~d----~~~~~pg~vv~~~~~-----------~~i~~~~~~~~~~~~~~ll~ 416 (705) T protein:vir:88 352 SVLMRNIMDNIYRTNQGRSVVLDGQVNLED----LLTNEAAGIVRVKSM-----------NSITPLETPQLSGEVYGMLD 416 (705) T ss_pred HHHHHHHHHHHHhccCCceeccccccCccc----ccccCCCeeEEecCC-----------CccccccCCcCcHHHHHHHH Confidence 999999999999999999999999875432 334556666655431 13455778899999999999 Q ss_pred HHHHHHHHHhCCChhHccccc-----chhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEe Q lcl|Aclame:pro 419 QTSADIQEVTGGSQAMQQMPS-----NIAQETVNNLMNRADMASFIYLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIV 492 (708) Q Consensus 419 ~~~~~~~~~tGv~~~~~G~~~-----n~sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~ 492 (708) +..+.++++|||+++++|..+ +.|+++|+.+++++++++..+++||+ .+++++|++++.||.+||+++++|||+ T Consensus 417 ~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~ 496 (705) T protein:vir:88 417 RLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLR 496 (705) T ss_pred HHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeec Confidence 999999999999999999542 35888999999999999999999998 689999999999999999999999999 Q ss_pred ccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhH--HHHHHHHHHHHHHhccccCchhHHHHHHHHh Q lcl|Aclame:pro 493 NEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTAR--RDATVSVLTNVLSSMLPTDPMRPAIQGIILD 570 (708) Q Consensus 493 ~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~--r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~ 570 (708) | .|+.+++..+ .++|||.|++++++.+. +.+.+..++++.+.+.+...+.+. +.. T Consensus 497 g-----~~v~v~~~~~--------------~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~----~~~ 553 (705) T protein:vir:88 497 G-----KWVAVNPANW--------------RERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVL----VSE 553 (705) T ss_pred c-----chhccchHhh--------------ccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhh----cCh Confidence 8 5788876443 35799999888877663 334445555555555443222111 111 Q ss_pred hccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 571 NIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKA-- 648 (708) Q Consensus 571 ~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~-- 648 (708) ........++.+.+....+.....++...++++..++..+.+. +++..+.++|++.+++|++.++.+++++..+.++ T Consensus 554 ~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~-~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~ 632 (705) T protein:vir:88 554 QNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEA-QPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQI 632 (705) T ss_pred HHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111222233333222222222222222222211111111111 1111111222222222222222222111111111 Q ss_pred -------HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 649 -------FTAQQDAME-----SQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 649 -------~~~~~~~~~-----~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) .+.+..++. .+++......+.+.+++...++. ++...... +...-....|.+-.|+ T Consensus 633 ~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~-~~e~~q~~---~~~~~~~~~~~~~k~~ 700 (705) T protein:vir:88 633 RLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEY-HLEATQAR---AAYIGDGKVPETKKPT 700 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH---HHHHHHHhHHHHHHHH Confidence 111110000 00111101111111100000000 00000000 0000000111111122 No 19 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=1.8e-56 Score=326.23 Aligned_cols=586 Identities=11% Similarity=0.048 Sum_probs=329.6 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH--------HhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATR--------FARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~--------~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |. ..+++...++.+++++.++...+-.+|.+..+ .+||.|..|.... .....||+.+++|.++.. T Consensus 15 ~~-~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~------~~~~~~rs~~~~~~v~~~ 87 (651) T protein:vir:80 15 YD-ETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVG------DVNADWRHKITTGKAFEA 87 (651) T ss_pred hh-hhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccC------CCCCCCCccccChhHHHH Confidence 44 34456777888888888887665555554322 3566675553221 112247888999999999 Q ss_pred HHHHHHHHhcC----cceeEEecCCCcch-HHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 73 LNRIIAEYRNN----RITVKFRPGDREAS-EELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 73 i~~i~g~~~~n----r~~~~v~pr~~~~d-~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) |+++++..... ..-++|.|.+++++ ...+++++.++.+.+.+|++...++..++|+++.|.|+++|.|+...+.. T Consensus 88 ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~ 167 (651) T protein:vir:80 88 IETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEV 167 (651) T ss_pred HHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeee Confidence 99999877764 33367778653222 23456666666666678999999999999999999999999887432111 Q ss_pred --------------CCC--------CCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhC----CC Q lcl|Aclame:pro 148 --------------PMD--------DRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEY----GK 201 (708) Q Consensus 148 --------------~~~--------~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~----p~ 201 (708) ++. -.+.++++.| |+.+|||||.|+ ++.|+.||+++.++ +.++..+. .. T Consensus 168 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v--~p~~~~~dp~a~--~~~d~~~v~~~~~t-~~~l~~l~~~g~~~ 242 (651) T protein:vir:80 168 KKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVL--DMFDCFYDPNVT--DPNRGAFIRKLTKT-KADILNLLSEGYYY 242 (651) T ss_pred ehheeccccccccccceeeeccceeeeceeEEEEe--cHHHeeecCCCc--Cccccceeeeeeee-HHHHHHHHhccccc Confidence 100 0123445555 678999999886 57799999998765 44444432 11 Q ss_pred Cc-c-ccccccccccccc---CCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhh Q lcl|Aclame:pro 202 KP-P-TSLDVTSMTSWEY---NWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVAR 276 (708) Q Consensus 202 ~~-~-~~~d~~~~~~~~~---~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (708) .. . ...+........+ .....+..-.. .+. ...++.+++||.+.. .+++ T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~-~~~~v~v~E~~~~~d-----~e~~------------------- 296 (651) T protein:vir:80 243 GVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTS-LWS-PHQNVELLEYWGDIH-----LENK------------------- 296 (651) T ss_pred chhhHHHHhhhccccccCCccccccccCCCcc-ccc-cccceEEEEEEEEee-----ccCC------------------- Confidence 00 0 0000000000000 00000000001 111 122344455542211 0000 Q ss_pred eeeeeEEEEEEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 277 RSVKRRRVYVSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQ 355 (708) Q Consensus 277 ~~~~~~~v~~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~ 355 (708) ..+++...+.|..+|. ..+++++ .+||+++.+.+ ++|+.+|.|+++.+.|.|+.+|++.+.+++++++++++ T Consensus 297 ----~~~~~~v~~~g~~il~~~~~~~~~-~~Pf~~~~~~~--~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~ 369 (651) T protein:vir:80 297 ----TYHDVVVTIMGNEVLRFEQNPYWC-GRPFVIGTYIP--TARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQ 369 (651) T ss_pred ----ceEEEEEEEcCcEEecccccCCCC-CCCeeeeccee--cCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCC Confidence 0112223344555553 3455554 34877655443 68899999999999999999999999999999999999 Q ss_pred ceeechhhccchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHc Q lcl|Aclame:pro 356 IPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQ 435 (708) Q Consensus 356 ~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~ 435 (708) +|+++++++...++. ...+++++.++... .+. ++. ..+..+...++++++..+.++++||+++.++ T Consensus 370 ~~~v~~d~~~~~~~l----~~~pg~vi~~~~~~----~~~----~l~--~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~ 435 (651) T protein:vir:80 370 MYTLRSDGLLQPEDV----YTEPGKVFLVSDHG----DLQ----PLA--NQSSNFSITYQESSFLESTIDKNFGTGNYVG 435 (651) T ss_pred cEEecCCccccHHHh----hcCCCceEEecCCC----Cce----eec--cCcccchhHHHHHHHHHHHHHHHhcCChHHh Confidence 999998876543332 23456665544321 111 111 1123457778999999999999999999999 Q ss_pred cccc----chhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcEEEEeccCC-CceEEEecccccc Q lcl|Aclame:pro 436 QMPS----NIAQETVNNLMNRADMASFIYLDNMAK-SLKRAGEVWLSMAREVYGSEREVRIVNEDG-SDDIAVLSAQVVD 509 (708) Q Consensus 436 G~~~----n~sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~y~~~r~irI~~~~~-~~~~v~in~~~~~ 509 (708) |... +.|+++|+.+++++..++..++++|.. +.+.+++.++.|+.+||+.++++||+|++. ...++.+++ T Consensus 436 g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~---- 511 (651) T protein:vir:80 436 ANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDV---- 511 (651) T ss_pred CCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCc---- Confidence 9632 358899999999999999999999996 899999999999999999999999999763 334444443 Q ss_pred cCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhh Q lcl|Aclame:pro 510 RQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLL 589 (708) Q Consensus 510 ~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~ 589 (708) .|++ ++|||+ ..|+..-..|.+..+.|.++++.+++..++.... . ....+.++.+.+....+ T Consensus 512 ---------~dl~-~~~~iv-~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~-~------~~~~~~~l~~~~g~~~~ 573 (651) T protein:vir:80 512 ---------EDLQ-KEVRLV-PIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLV-D------YKRILVDLLQHWGFEEP 573 (651) T ss_pred ---------ccee-eeeeee-eccHHHHHHHHHHHHHHHHHHHhhccCCccchhh-h------HHHHHHHHHHHcCCCCc Confidence 3444 577774 4565555668888888888888777643322111 0 01122334444332222 Q ss_pred hhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 590 ISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQA 669 (708) Q Consensus 590 ~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~ 669 (708) ......+.++.. +.+++..+.+ ++...++++.+.+++.+++.+ ..+.+.++.++++..+.++. T Consensus 574 ~~~l~~~~q~~~----~~~~~~~~~q--~~~~~~~a~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~--------- 636 (651) T protein:vir:80 574 EAYLKQQDQQAP----ANPQEALLSQ--AKDVGGQAMSNMLQNQLQADG--GTQMMSEMYGTPNADQMQQE--------- 636 (651) T ss_pred HHhcCCCccchh----hhhhHHHHhh--HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--------- Confidence 222222211111 1111111111 111111111111111111110 00111111111111100000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 670 RNIDDKAVMEAIRLLK 685 (708) Q Consensus 670 ~~~~~~~~~~~~~~~~ 685 (708) ....+..++..++-+ T Consensus 637 -~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 637 -LMATTPNVSEQQLTQ 651 (651) T ss_pred -HHHHHHHHHHhhccC Confidence 000011111111111 No 20 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=8.2e-44 Score=256.81 Aligned_cols=597 Identities=14% Similarity=0.098 Sum_probs=315.8 Q ss_pred CCc--------chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAE--------TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~--------~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |+| .++-...+|..++..+..++.+|.++.....+ .|.|..|+..... -.||+|... T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k--~y~~~~~~~~~~~-------------~r~nl~~sn 65 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVK--RYRDERDSAHDAE-------------TRWNLFSTN 65 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHH--HhhccccCCCccc-------------cccchhhhh Confidence 666 45566779999999999999999999887665 3557777553311 138999999 Q ss_pred HHHHHHHHhcCcceeEEecCCCcchHHHH----HHHHHHHHHHH--HhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKFRPGDREASEELA----NKLNGLFRADY--EETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v~pr~~~~d~~~A----~~l~~~~~~~~--~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~ 146 (708) |..++-......|.|.|+||....|..+| ++++.+++... +..+++.....+..++++||+|+++|+|+...+. T Consensus 66 i~~i~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~ 145 (663) T protein:vir:34 66 IQTQMASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEE 145 (663) T ss_pred HHHHhhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccch Confidence 99999999999999999999865553344 55555444322 5677999999999999999999999999764431 Q ss_pred C--------CCCCC--------------cceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcc Q lcl|Aclame:pro 147 D--------PMDDR--------------QRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP 204 (708) Q Consensus 147 d--------~~~~~--------------~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~ 204 (708) . +..+. ..++|..| +|.+|++|| |+. ++++.|++.+.||++++++++|+.+.+ T Consensus 146 ~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v--~~~dfl~~p-Ar~--W~ev~wva~r~~mtk~e~~~rf~~~~~ 220 (663) T protein:vir:34 146 VAGVDAILDEATGAELAAAVPPTQRKAYECVETDYL--HWQDVLWSP-ARV--WHEVRWLAFRNLLDMREFNARFDADGS 220 (663) T ss_pred hccccccCCCccccchhcccccchhhcccceeeeee--chhhcccch-hhc--cccccceeeeccCCHHHHHHhhcCChh Confidence 1 11111 13444444 699999999 564 569999999999999999999964432 Q ss_pred cccc--ccccccc-----ccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhhe Q lcl|Aclame:pro 205 TSLD--VTSMTSW-----EYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARR 277 (708) Q Consensus 205 ~~~d--~~~~~~~-----~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (708) .... ....... .......+...|.|.|+|...+ T Consensus 221 ~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~---------------------------------------- 260 (663) T protein:vir:34 221 RNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRK---------------------------------------- 260 (663) T ss_pred hhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcE---------------------------------------- Confidence 2111 1110010 1111122355666666655433 Q ss_pred eeeeEEEEEEEEecceeeecCCCCCCC---CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 278 SVKRRRVYVSVVDGDGFLEKPRRIPGE---HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPG 354 (708) Q Consensus 278 ~~~~~~v~~~~~~~~~il~~~~~~p~~---~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~ 354 (708) |||+.=+++.+|+ .+|.|.+ +||+ |+..++....++.+|--.+-.-.+.|+++|-++.++--+ ...-+ T Consensus 261 ------V~w~~eg~~~~L~-~~~p~lgl~~ffPc-Prpl~~~~~~ds~ipvpd~~~y~~~~~E~n~~t~Rin~l-~d~ik 331 (663) T protein:vir:34 261 ------VDWYVEGYSAVLD-TQPDPLGLESFFPC-PKPLLANWTTDKVVPRPDFVLAQDLYKEIDLVSTRITLL-ERAIR 331 (663) T ss_pred ------EEEEEcCcceecc-cCCCCCCCCCCCCC-cccccceecCCCeecCCcHHHHHHHHHHHHHHHHHHHHH-Hhhhh Confidence 2332222232332 2444332 3343 333444333444444444448899999999887775444 44457 Q ss_pred CceeechhhccchHHHHHhhcccCCceeeec---ccccccccccccccccccccCccc---hHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 355 QIPIVGMEQIRGLEKHWEARNKKRPAFLPLR---EVRDKSGNIIAGATPAGYTQPAVM---NQALAALLQQTSADIQEVT 428 (708) Q Consensus 355 ~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~~~~~~~~~t 428 (708) .+++++.++.+++-.......... .+++. ......|.. ..+...+...+ ...+.+.......++..+| T Consensus 332 v~gvy~~~~g~~i~~~l~~a~~n~--lvpV~~~~~~~~~gg~~----k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qIT 405 (663) T protein:vir:34 332 VVGVYDKSSGLTIGRLLSEAAQND--LIPVENWLTFADKGGLR----GVVDWFPLEPVVAALTSLRDYRRELVDALHQVT 405 (663) T ss_pred hceeeccccchhHHHHHHHhhCCC--ceecchhhhhhhhcCcc----chhhcccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 788888776655444333322221 22221 111111110 01112222222 3334555566778999999 Q ss_pred CCChhHccccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccc Q lcl|Aclame:pro 429 GGSQAMQQMPS-NIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQV 507 (708) Q Consensus 429 Gv~~~~~G~~~-n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~ 507 (708) |+.|++.|.+. |-+++|.+..++.|+.++..+.+.+.++.++++++.-+.|.+.|+-+.+-+|+|..-.. -+.|.+ T Consensus 406 GiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~-~~ei~~-- 482 (663) T protein:vir:34 406 GMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTF-DKELAP-- 482 (663) T ss_pred hHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCc-ccchhH-- Confidence 99999999754 45888888888899999999999999999999999999999999988888888753221 111211 Q ss_pred cccCCCceE-EeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccc-------cCchhHHHHHHHHhhcc-----c Q lcl|Aclame:pro 508 VDRQTGAVV-ALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP-------TDPMRPAIQGIILDNID-----G 574 (708) Q Consensus 508 ~~~~~~~~~-~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~-------~~p~~~~~~~~~~~~~d-----~ 574 (708) .+- +.|| .+-.|.|-|..+........+.-+.++++++.+++ ...+.+...+++.+++. + T Consensus 483 ------~~~~L~n~-~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f 555 (663) T protein:vir:34 483 ------KAAELIKS-RFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGL 555 (663) T ss_pred ------HHHHHhcC-CCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcC Confidence 111 2233 22445555554432222222222334444333322 11111112222222221 1 Q ss_pred hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 575 EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQD 654 (708) Q Consensus 575 ~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~ 654 (708) .....+-..+.+.... .++.+.++.+ ++..++..+.+++++.+++|.+++++++. +|.+..+.+.+ T Consensus 556 ~~~~qie~ai~~~~~~----------~e~aa~~~~~-~~pa~~~~~~k~~~~q~k~q~~~aeAq~e---~q~~~~~~ql~ 621 (663) T protein:vir:34 556 RGSSTIEGVLDKAIAA----------AEEAQKQAAQ-QSPAPQQPDPKVVAQAMKGQQEMAKVQAE---VQGDLLRIQAE 621 (663) T ss_pred ChhhhHHHHHHHHHhh----------hHHHhhccCC-CCcccchhhHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH Confidence 2211121111111000 0000000000 00001111112222333333333332221 22222222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCCC Q lcl|Aclame:pro 655 AMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADL 705 (708) Q Consensus 655 ~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 705 (708) +++.+.+....+ +..+..+.++.-..+.. ....+++.+.+.+ T Consensus 622 ~~~~~~k~~~~a-------~~~~~~a~q~~~~~~~~--r~~~~~a~~~~~~ 663 (663) T protein:vir:34 622 TQANETKERQQA-------EWNVREAAQKNLISQAA--RAMNPQARNGGMP 663 (663) T ss_pred HHHHHHHHHHHH-------HHHHHHHHHhhHHHHHH--HhhchhhhcCCCC Confidence 222222221111 11112221111111111 1112223333322 No 21 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=2.7e-41 Score=243.04 Aligned_cols=540 Identities=14% Similarity=0.076 Sum_probs=315.2 Q ss_pred CCcchHHH---------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHH Q lcl|Aclame:pro 1 MAETLEKK---------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVAT 71 (708) Q Consensus 1 ma~~~~~~---------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~ 71 (708) |+-+..++ ...+...|+.+.++.+.+..+|.+-. +||.+ +-...+. -. ...+|-.++.|++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~--~y~~a-~~~~~~~-~~----~~~~r~~~~~~k~~~ 72 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELR--NYVFA-TDTTTTS-NQ----GLPWKNSTTLPKLCQ 72 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHH--HHHHh-hhhhhhh-hc----ccccccccchhHHHH Confidence 66543332 25567777877788777777775433 44543 2222111 11 123566889999999 Q ss_pred HHHHHHHHHh----cCcceeEEecCCCcc-hHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecccc- Q lcl|Aclame:pro 72 ELNRIIAEYR----NNRITVKFRPGDREA-SEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNE- 145 (708) Q Consensus 72 ~i~~i~g~~~----~nr~~~~v~pr~~~~-d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~- 145 (708) .+++++.... .|+-=+++.+..+++ +...+++++.++..-..++++..+.+..|+++++.|.|++++.|...-. T Consensus 73 ~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e 152 (584) T protein:vir:95 73 IRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKE 152 (584) T ss_pred HHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeeccee Confidence 9999987554 345556777775532 2334899999999988999999999999999999999999988764321 Q ss_pred ---CCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhC-----CCCcccccccccc---cc Q lcl|Aclame:pro 146 ---YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEY-----GKKPPTSLDVTSM---TS 214 (708) Q Consensus 146 ---~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~-----p~~~~~~~d~~~~---~~ 214 (708) .+-.....+.+++++ +|.+|||||.|+ +++|+.||+ +..+|++++.++- |.-..+.+..... .. T Consensus 153 ~~e~~~v~~~~~prieri--SP~d~~~Dpsa~--~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~ 227 (584) T protein:vir:95 153 MTDGTLVPDYIGPRLVRI--SPLDIVFNPLAT--SISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHL 227 (584) T ss_pred eeccccccccccceEEee--ChhheeecCCCC--Cccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCC Confidence 111111224556655 456899999996 577999999 5668999998873 2111111100000 00 Q ss_pred cccCCC--------CCc-eeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 215 WEYNWF--------GAD-VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 215 ~~~~~~--------~~~-~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) .++.+. +.+ ...+.+||.+.++.+ ++||- . +.+.+.. + ...+++. T Consensus 228 ~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~v--l~~~g----~---~~~~~~~----e-------------~~~~~iv 281 (584) T protein:vir:95 228 GGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEI--LEFYG----D---YHDKETG----E-------------LQTNRII 281 (584) T ss_pred CCCcccccccccccccccccccccccCCceeEE--Eeecc----c---ccccccC----C-------------CcccceE Confidence 011111 111 112344554444443 33331 1 1111100 0 0112233 Q ss_pred EEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhcc Q lcl|Aclame:pro 286 VSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIR 365 (708) Q Consensus 286 ~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~ 365 (708) ..++.++.|....+|+|++.+||+-+. ..+...+.+|+|+...+.|.|+.+|..++.++++++++.++.+... . T Consensus 282 ~v~~g~~iIR~~~np~~~~~~PF~~~~--~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~---~- 355 (584) T protein:vir:95 282 TVVDRSTEVRNESIPTWFGSAPIYHVG--WRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKII---G- 355 (584) T ss_pred EEEeccEEEEeeecCCCCCCCCEEEEc--ceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeec---c- Confidence 333444444456789999999997433 3345778999999999999999999999999999999998733222 1 Q ss_pred chHHHHHhhcccCCceeeecccccccccccccccccccccCcc-chHHHHHHHHHHHHHHHHHhCCChhHcccccc--hh Q lcl|Aclame:pro 366 GLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IA 442 (708) Q Consensus 366 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~s 442 (708) +.++.. ..++..+... . ++ ..+++.++. --...++.+++....+...||++..++|..+. -+ T Consensus 356 ~~~~~~----~~pg~~~~~~----~-----~~--~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~T 420 (584) T protein:vir:95 356 EVEEFV----WGPGAEIHLD----Q-----GG--DVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKT 420 (584) T ss_pred ccchhc----ccCCceeecC----C-----CC--CcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhh Confidence 112111 1122221110 0 11 122233332 11234566889999999999999999997643 37 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcEEEEeccC-CCceEEEecccccccCCCceEEeec Q lcl|Aclame:pro 443 QETVNNLMNRADMASFIYLDNMAKSL-KRAGEVWLSMAREVYGSEREVRIVNED-GSDDIAVLSAQVVDRQTGAVVALND 520 (708) Q Consensus 443 g~ai~~~q~q~~~~~~~~~dn~~~~~-~~~~~~~l~li~~~y~~~r~irI~~~~-~~~~~v~in~~~~~~~~~~~~~~nD 520 (708) ++.++++.++++....++.+.+...+ ++++..|++...++.+..-++||+++. +...|+.|.+. | T Consensus 421 Atg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~-------------D 487 (584) T protein:vir:95 421 AFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTRE-------------D 487 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccChh-------------h Confidence 77889999999999999999998755 999999999999999999999999986 55567777542 3 Q ss_pred cceeeEEEEEeecccchhHHHHHHHHHHHHHHh-ccc-cCchhH--HHHHHHHhhccchhHHHHHHHHHhhhhhhhcccC Q lcl|Aclame:pro 521 LSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS-MLP-TDPMRP--AIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKP 596 (708) Q Consensus 521 i~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~-~~~-~~p~~~--~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~ 596 (708) + .|+||++..-..... .|+++.+.+.++++. +++ ..|... .+...+.+.++.|+-. .-.+ T Consensus 488 l-~g~~~~va~Ga~~~~-~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~--------------~~~~ 551 (584) T protein:vir:95 488 I-TANGKIRPIGARHFG-KQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYE--------------IFRP 551 (584) T ss_pred h-ccCeeEEeehhhHHH-HHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCccc--------------ccCC Confidence 3 478888877654444 478888888888873 222 111111 1111222233333211 1111 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 597 RNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKAT 638 (708) Q Consensus 597 ~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~ 638 (708) ....++|+..| +... +++ +..++|+++. |+ .+- T Consensus 552 ~~~~~~Q~~~q--~~~~-~~q-~~~~~~~~~~---~~--~~~ 584 (584) T protein:vir:95 552 NVAVAEQAETQ--SLVA-QAQ-EDLQLQAQMP---AE--GAI 584 (584) T ss_pred CcccchhHHHH--hhhH-HHH-HHHHHHHhhh---hc--cCC Confidence 11111111111 1100 000 1111121111 11 000 No 22 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=6e-39 Score=230.14 Aligned_cols=581 Identities=12% Similarity=0.080 Sum_probs=306.5 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCC-----CCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPG-----GQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G-----~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |++ +.+...++.+|+.+.++.+.|...|+++.+ ||.. +....-..+.. .....++|.-++.+.+...+++ T Consensus 20 ~~~--~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~~~r~ki~~~~~~~~~~~ 94 (641) T protein:vir:94 20 LST--DRIGGVVISKWQESRDKRNTVENNWDETYE--LYRASAIDRQNTRARNFQTT-GADDADWRHRINTGHTFEVVET 94 (641) T ss_pred CCc--hhHHHHHHHHHHHHHHhhcchHHHHHHHHH--Hhhcchhhhhhccccccccc-ccchhcccccccchhHHHHHHH Confidence 654 357788999999999998888888887644 3322 00000000000 0011223445666666666666 Q ss_pred HHH----HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecccc------ Q lcl|Aclame:pro 76 IIA----EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNE------ 145 (708) Q Consensus 76 i~g----~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~------ 145 (708) +.. ....++.-+++.|++. +|.+.|++++..+++...+|++...++..+.+++..|.|++++.|+.... T Consensus 95 l~s~Lm~~~~p~~~wf~~~p~~~-ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~ 173 (641) T protein:vir:94 95 LVAYFKGATFPSDDWFDLKGMVP-ELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRT 173 (641) T ss_pred HhhHHhhhhcCCCceEEEecCCC-ChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhh Confidence 654 4445666678999975 57788999999999999999999999999999999999999998764311 Q ss_pred ----CCCCCC---------CcceeeEEeecchhheecCCccccCChhccCeEEEe-ecCCHHHHHHh--CCCCccccccc Q lcl|Aclame:pro 146 ----YDPMDD---------RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCM-YSLSPEKYEAE--YGKKPPTSLDV 209 (708) Q Consensus 146 ----~d~~~~---------~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~-~~~~~~e~~~~--~p~~~~~~~d~ 209 (708) .+.... ...++++++ ++.++||||.++. | +..|++++ ..++..++... |+...- +. T Consensus 174 ~~~~~~~~~~~~~~~v~~~~~~~r~~~v--~~~di~~dps~~~-~--~~~f~~~r~t~~t~~~l~~eg~~~~d~v---~~ 245 (641) T protein:vir:94 174 FVETGDIFGGWEDVAVNRQRSELRIEPL--SPYDVWLDTSGGK-N--TGTFVRLRHTREELHELVTSGYYDLDLT---QV 245 (641) T ss_pred cccchhhcccccccceecccceeeEEec--chhheeecCCCCc-c--cccceehhhhHHHHHHHHhcCCCChhhc---ch Confidence 110100 112334443 5678999997653 2 44454332 33344444433 322111 11 Q ss_pred ccccccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEE Q lcl|Aclame:pro 210 TSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVV 289 (708) Q Consensus 210 ~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 289 (708) .... ++.+.+.++..-..+-+..+ ..+++|+. .+..+. ...+.++ ..+ T Consensus 246 ~~~~--~~~~~~~d~~~d~~~~~~~~--~~~~e~~g-------d~~~d~--------------------~~~~~~~-~~~ 293 (641) T protein:vir:94 246 EQYV--DYKFADPDTPKDVNGTDTSG--WDIIEYYG-------PLLVEG--------------------VQFWCVH-AVF 293 (641) T ss_pred hhcc--cccccccccccccccccccc--cceeeeee-------eeccCC--------------------CceeeEE-EEE Confidence 1111 11111122211111111111 12233331 011110 0111222 233 Q ss_pred ecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHH Q lcl|Aclame:pro 290 DGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEK 369 (708) Q Consensus 290 ~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~ 369 (708) .|..+|...+-.++..+||+.+... .++++.+|.|.+..+.+.|+.+|++.+.+++++.++.++++++..+.+-.- T Consensus 294 ~g~~il~~~~~~~~d~~Pf~~~r~~--~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~-- 369 (641) T protein:vir:94 294 YGKQLIRLSDSKYWCGSPFVTTTLL--PDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKR-- 369 (641) T ss_pred eCCEEeecccccccCcCCeEEecce--ecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccc-- Confidence 4566664322222345576643322 368899999999999999999999999999999999999999877654321 Q ss_pred HHHhhcccCCceeeecccccccccccccccccccccCcc-chHHHHHHHHHHHHHHHHHhCCChhHcccc---c-chhHH Q lcl|Aclame:pro 370 HWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQMP---S-NIAQE 444 (708) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~---~-n~sg~ 444 (708) ......+++++.... .+.+.| +.+.. -.....+++++....+...+|+...++|.. + +.|++ T Consensus 370 --~~l~~~PG~ii~~~~----~~~v~p-------l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAt 436 (641) T protein:vir:94 370 --EDVKAKPGAVFKVAQ----HGSLQP-------IDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAA 436 (641) T ss_pred --ceeeccCCcceeeCC----CCccee-------ecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHH Confidence 112233444433221 111211 11111 112234667777778888898887766543 2 34888 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCc-eEEEecccccccCCCceEEeeccc Q lcl|Aclame:pro 445 TVNNLMNRADMASFIYLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSD-DIAVLSAQVVDRQTGAVVALNDLS 522 (708) Q Consensus 445 ai~~~q~q~~~~~~~~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~-~~v~in~~~~~~~~~~~~~~nDi~ 522 (708) .+..+.+++..++..+.++|. .+++.+++.++.+++++++.+.++|++|..... .++.+.+ .|+ T Consensus 437 EV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p-------------~~L- 502 (641) T protein:vir:94 437 EIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSP-------------EYL- 502 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCc-------------cce- Confidence 999999999999999999999 699999999999999999999999999874221 2333322 233 Q ss_pred eeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccch-hHHHHHHHHHhhhhhhhcccCcchHH Q lcl|Aclame:pro 523 VGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGE-GLDDFKEYNRNQLLISGIAKPRNEKE 601 (708) Q Consensus 523 ~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~-~~~ei~e~~~~~~~~~~~~~~~~~~~ 601 (708) .|+||| +..+.+....+.+..+.|+++++.+++. |. ++.+.++. .+.++.+.++.-.+....-.++++ + T Consensus 503 ~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~-P~-------v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~-~ 572 (641) T protein:vir:94 503 HYPYKF-LALGANYVVERERMVTDLLQLLDISGRV-PQ-------IGQSLDYALILEDLLRQMRFTDPMRYIKKAEAP-P 572 (641) T ss_pred eeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcC-hh-------hhhcCCHHHHHHHHHHHhCCCCchhhccCccCc-h Confidence 367887 4666666777788888888888776652 21 22222322 234444443321121111111110 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 602 QQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKA--TNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVME 679 (708) Q Consensus 602 ~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~--~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~ 679 (708) ++..+++ ++ ++ ++. ..++|+-...+ ++.....+ .+++++..++.. +..... T Consensus 573 ~~~~~~~--~~---~q----~~~--~~~a~~~~~~~~~~a~~~~~~-----~~~~~~~~~~~~-----------~~~~~~ 625 (641) T protein:vir:94 573 AAPPIAP--AE---PG----ALP--PEMMNSVGGGLNDQAIAGMTP-----EDVSDLASRIGI-----------DTSDVA 625 (641) T ss_pred hHHHHHH--HH---HH----HHH--HHHHHHHHhhhHHHHHHHhhH-----HHHHHHHHhhcC-----------Cchhhh Confidence 0000000 00 00 000 00111000000 00000000 011111111000 001111 Q ss_pred HHHHHHhhhhhhhhhhhcCCCCCCCC Q lcl|Aclame:pro 680 AIRLLKDVAESQQQQFQSPPQSPADL 705 (708) Q Consensus 680 ~~~~~~~~~~~~~~~~~~~~~~~~e~ 705 (708) .++++...++ -+..-| T Consensus 626 ~~~~~~~~~~----------~~~~~~ 641 (641) T protein:vir:94 626 PEAMAAATQQ----------ITSGAL 641 (641) T ss_pred HHHHhccccc----------ccccCC Confidence 1111111111 111112 No 23 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=5.2e-35 Score=208.53 Aligned_cols=562 Identities=14% Similarity=0.081 Sum_probs=305.3 Q ss_pred CCc-------------chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeec Q lcl|Aclame:pro 1 MAE-------------TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEIN 67 (708) Q Consensus 1 ma~-------------~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N 67 (708) |.- .......++...|++..++.+...++|.+-.. |.+- | ..+. . .......+-.++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~--yi~~-~---~tr~-t-~~~~~~w~~s~t~~ 72 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMD--YIDA-T---DTRK-T-SNSKLPFKNSTTIN 72 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHH--HHhh-h---cccc-c-ccCCCCcccccchH Confidence 321 23334455666666666666555555544333 3221 1 0111 0 01112456678899 Q ss_pred chHHHHHHHHHHHhc----CcceeEEecCCCcch-HHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeec Q lcl|Aclame:pro 68 KVATELNRIIAEYRN----NRITVKFRPGDREAS-EELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) Q Consensus 68 ~i~~~i~~i~g~~~~----nr~~~~v~pr~~~~d-~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~ 142 (708) ++..++..++.+... |+-=+.+.+-.++++ .+.++++..+++.-..+|++..+++.-+.|.++.|.++..+.+.. T Consensus 73 k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er 152 (599) T protein:vir:31 73 KLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVK 152 (599) T ss_pred HHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEE Confidence 999999999875543 455567777665433 455888888888889999999999999999999998876665442 Q ss_pred cc----cCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCC-----CCccccccccccc Q lcl|Aclame:pro 143 VN----EYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYG-----KKPPTSLDVTSMT 213 (708) Q Consensus 143 ~~----~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p-----~~~~~~~d~~~~~ 213 (708) -. ++.-...-..++++++ ++.+|||||.|+ +++|+.||+ +...|+.+|..+-. -..-+.++..... T Consensus 153 ~~~~~~d~~v~~~~~~P~~erv--sP~Di~~Dp~A~--si~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~ 227 (599) T protein:vir:31 153 RMTVTAENQVIKNYSGTVTERL--SPSDVFWDVTAD--SLPKAAKCI-RQLYTLGSLKREIEEGTFPLMSMEDFQKLREE 227 (599) T ss_pred cceeecccccccccccceEEee--cccceeeCCCCC--CCCcceeee-ehhhhHHHHHHHhccCCccccchHHHHHHHhh Confidence 21 1111111224455555 456899999997 566998888 77778999988642 2221111110000 Q ss_pred ccccCCCCCceeEEeeeeeec--ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEec Q lcl|Aclame:pro 214 SWEYNWFGADVIYIAKYYEVR--KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDG 291 (708) Q Consensus 214 ~~~~~~~~~~~~~v~e~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 291 (708) ..+......+..+-..+.+.. .....++.|+.+...+.+.|+.+-.....+.+.. .+ +..+.| T Consensus 228 ~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~--------------~~-ViTi~g 292 (599) T protein:vir:31 228 RRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWN--------------NY-EITVID 292 (599) T ss_pred ccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCcccc--------------ce-EEEEec Confidence 000000000101000111111 1112234444444444444442211111111111 11 334555 Q ss_pred ceee--ecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHH Q lcl|Aclame:pro 292 DGFL--EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEK 369 (708) Q Consensus 292 ~~il--~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~ 369 (708) +.+| .+.+|+|++.+||+-+.+.+ ..++.+|+|+...+.+.|..+|.+.+.+++++.....+. +...+.+.+.+- T Consensus 293 ~~~liR~e~np~~~g~~Pyvv~~~~P--~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~-l~~~~dl~~eD~ 369 (599) T protein:vir:31 293 RKIIGRKQSKDTWDGSQNLHIAVYEF--QKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPS-LKKVGDVREKGM 369 (599) T ss_pred CcEEeecccCCCCCCCCCeEEEEeee--eccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccc-ccccccccccCc Confidence 4433 46789999999998544443 566899999999999999999999999999988877542 222222222111 Q ss_pred HHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccccc--chhHHHHH Q lcl|Aclame:pro 370 HWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS--NIAQETVN 447 (708) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~--n~sg~ai~ 447 (708) .| . ++.+.... ..+ ..+++.+++-......++++....+.+.||+..++.|..+ +-++..++ T Consensus 370 ~~----~-P~~v~~~~----d~~-------~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is 433 (599) T protein:vir:31 370 RG----G-PNHVFEVE----ETG-------DVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQ 433 (599) T ss_pred cC----C-CCcceeec----CCC-------ccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHH Confidence 11 1 11111100 011 1222333322334445778888889999999999999754 34888999 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcEEEEeccC-CCceEEEecccccccCCCceEEeeccceee Q lcl|Aclame:pro 448 NLMNRADMASFIYLDNMAK-SLKRAGEVWLSMAREVYGSEREVRIVNED-GSDDIAVLSAQVVDRQTGAVVALNDLSVGR 525 (708) Q Consensus 448 ~~q~q~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~y~~~r~irI~~~~-~~~~~v~in~~~~~~~~~~~~~~nDi~~g~ 525 (708) ++.++++....++.+.+.+ ..+.+.+.++++.++|+|++-++||++++ |...|+.|.+.. + .+. T Consensus 434 ~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~red-------------l-~~~ 499 (599) T protein:vir:31 434 LLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADD-------------L-NLN 499 (599) T ss_pred HHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhh-------------h-hCC Confidence 9999999999999999986 55779999999999999999999999986 777899986543 2 356 Q ss_pred EEEEEeecccchhHHHHHHHHHHHHHHh--ccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHH Q lcl|Aclame:pro 526 YDVTVDVGPSYTARRDATVSVLTNVLSS--MLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQ 603 (708) Q Consensus 526 ~Dv~v~~~~~~~~~r~~~~~~l~~llq~--~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q 603 (708) |++ +..|...--.|++..+-|.++++. ..+..|...+ .....+.+.+... .......+.. T Consensus 500 ~~~-v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~-----------k~l~~~l~~~~~l-~~~~~~~~~v----- 561 (599) T protein:vir:31 500 GQM-VAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSR-----------TKLFNAVEYLGDL-DAYGIFTFGI----- 561 (599) T ss_pred eee-eechhhHHHHHHHHHHHHHHHhcccCCCccchhhHH-----------HHHHHHHHHHHhc-cccccCCCch----- Confidence 787 566654444578877888887752 1112221111 0111111221111 0011111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 604 IVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNID 673 (708) Q Consensus 604 ~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~ 673 (708) .+++||.+ . ..+|++ +||+-.++-.+.. .. . .-++.+| T Consensus 562 -a~~eqq~~-----~--~m~Q~~---lq~~~~~~~~~~~-~~--~------------------~~~~~~~ 599 (599) T protein:vir:31 562 -GVQEDQQL-----A--RMAQKS---TQQTEETALTQEE-VG--G------------------PTTDTGQ 599 (599) T ss_pred -hHHHHHHH-----H--HHHHHH---HHHhHhhhhhhhh-cC--C------------------CCcccCC Confidence 00000000 0 000000 0111000000000 00 0 0000000 No 24 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.87 E-value=3.5e-19 Score=121.75 Aligned_cols=530 Identities=12% Similarity=0.016 Sum_probs=259.6 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhc-CC--CCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARV-PG--GQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII 77 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~-~G--~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~ 77 (708) |.-...++.++++.+|.........|...|+++.+|..= -| .-++..+...-..+ -+.+.-+.-...++.+. T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~-----~~~~~dstg~~a~~~LA 75 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRER-----SQKMFDSTAPLALRNFV 75 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCccccc-----ccccccchHHHHHHHHH Confidence 998888889999999999999999999999988776331 11 11221111110000 01122233344444444 Q ss_pred HHHhc-----CcceeEEecCCCcch--HHHHHHHHHHHHHHH-----HhcChHHHHHHHHHHHhhcCeeEEEEEeecccc Q lcl|Aclame:pro 78 AEYRN-----NRITVKFRPGDREAS--EELANKLNGLFRADY-----EETDGGEACDNAFDDAATGGFGCFRLTSMLVNE 145 (708) Q Consensus 78 g~~~~-----nr~~~~v~pr~~~~d--~~~A~~l~~~~~~~~-----~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~ 145 (708) +.... +++=+++.+.+...+ ....+-|...-+.+. ..|++..+...++.+.+..|.|++.+.. T Consensus 76 s~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~----- 150 (549) T protein:vir:10 76 AAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEH----- 150 (549) T ss_pred HHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEee----- Confidence 33332 344445555432111 122333443333322 3688999999999999999999877632 Q ss_pred CCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc-ccccccccccccccCCCCCce Q lcl|Aclame:pro 146 YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADV 224 (708) Q Consensus 146 ~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~ 224 (708) ++ ...+++..+ |+.++++..++. .. ..-+|++..||...+.++||..+ .+.+.. ..+.+ + T Consensus 151 -~~---~~~~~f~~~--pl~~~~v~~d~~-G~---vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~----~~~~~---~-- 211 (549) T protein:vir:10 151 -DV---GKGIVYRNV--PMQRLWFAENNS-GL---IDKTHVQWELTLRQAAQRFGRENLSPSMQS----TLEKD---P-- 211 (549) T ss_pred -cC---CCeeEEEEE--EcCeEEEeeCCC-CC---eEEEEEEeecCHHHHHHhcCcccCCHHHHH----HhhcC---C-- Confidence 11 223455544 567777766543 12 22388999999999999999743 221110 11100 0 Q ss_pred eEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCC Q lcl|Aclame:pro 225 IYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGE 304 (708) Q Consensus 225 ~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~ 304 (708) ...+.+|++.+|.......-. ....+.+.++|.-..+..++.+ ..|. T Consensus 212 ----------~~~~~v~~~V~pr~~~~~~~~--------------------~~~~~pf~sv~~e~~~~~il~e---sg~~ 258 (549) T protein:vir:10 212 ----------EKSAIFYHAVEPRADRDPRKL--------------------DGRNMQFASYWLDEGRDRIVQN---SGFR 258 (549) T ss_pred ----------CceEEEEEEeecCCCCCcccc--------------------ccccCceEEEEEEecCCEeecc---CCcc Confidence 112233343333221100000 0011222333333445555544 2345 Q ss_pred CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeee Q lcl|Aclame:pro 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL 384 (708) Q Consensus 305 ~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~ 384 (708) ++||+|+-.. ..+|..+|.|.+....+-.+.+|++....+....+..+++++++.+.+-. ..+..+++...+ T Consensus 259 e~P~~~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~------~~~l~pgg~~~~ 330 (549) T protein:vir:10 259 TFPFAIGRFY--VGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLD------GFDLRSGALNWG 330 (549) T ss_pred cCCcceeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceeccCCcccc Confidence 6777765433 36888999999999999999999999999999999999999998764422 122344554332 Q ss_pred cccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhH-cccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 385 REVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAM-QQMPSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~-~G~~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) ..+......+.| +. ..+.+ .....+++...+.|...--+.... +-.....|++-|..+.+.....+...+.+ T Consensus 331 ~~~~~~~~~~~p----l~--~~~~~-~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~r 403 (549) T protein:vir:10 331 GLNDKGEEMVKP----LL--TGKQA-QIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGR 403 (549) T ss_pred ccCCCCccceee----ec--cccch-hHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHH Confidence 222111111111 11 11122 233455666666666655333222 22233468888999998888888888888 Q ss_pred HH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |. .+..-+.+..+.++.+. | . |. ..-+.. .+ ....++|.. +++-...++.. T Consensus 404 l~~E~l~Pli~R~~~il~r~----------g---~-----lP-~~p~~l-------~~-~~~~~~i~y-is~La~aq~~~ 455 (549) T protein:vir:10 404 TQSELLGPMIAREVDILAEA----------G---Q-----LP-DMPQEL-------ID-AGADVDVEY-DSPLNKAMRAG 455 (549) T ss_pred HHHHHHHHHHHHHHHHHHhc----------C---C-----CC-CCChhh-------hc-CCceeEEEe-ecHHHHHHHHH Confidence 76 55555555555555441 1 0 00 000000 00 001133333 22333445666 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh-hcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS-GIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~-~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) ....+.++++.+++.... .+.++...+ .+++...+-...... ....++ ++.+++.++.++++|.++..+.. T Consensus 456 ~~~~i~~~~~~~~~laq~----~Pe~ld~id---~d~~~~~~a~~~Gvp~~~irs~-eev~~~r~~~~~qqq~~~~~~~a 527 (549) T protein:vir:10 456 EGAAILQWLQQLGIVSQF----DPAAAKVPN---GARIARLLADYGGVPVEAMSTD-EELQAQQAAEAQAAQMQQMLAAA 527 (549) T ss_pred HHHHHHHHHHHHHHHhcc----ChhHHhcCC---HHHHHHHHHHhcCCCccccCCH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 666666666655432211 222333333 344444444333221 222221 12122211111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAME 657 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~ 657 (708) .. .+++++...+++.+...+.. T Consensus 528 ~~--------------a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 528 PV--------------AAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred HH--------------HHHHHHhhhhhcCCCcccCC Confidence 11 11111111111111000000 No 25 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=99.86 E-value=2.5e-19 Score=122.52 Aligned_cols=536 Identities=15% Similarity=0.122 Sum_probs=253.1 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) ||+.. ..+++++|+......+.|...|+++.+|..=...-+...+...-.. ..+.+.-+.....++.+.+.. T Consensus 1 m~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~-----~~~~~~dst~~~a~~~Las~l 72 (556) T protein:vir:73 1 MAETE---KERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDR-----RNTKIVDPTGSMAQRILSSGM 72 (556) T ss_pred CChhh---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchh-----hcCccccchHHHHHHHHHHHH Confidence 99754 3456778888888888899999888775310111122211111000 112334455555666555433 Q ss_pred hc-----CcceeEEecCCCcchHHHHH------HHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCC Q lcl|Aclame:pro 81 RN-----NRITVKFRPGDREASEELAN------KLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) Q Consensus 81 ~~-----nr~~~~v~pr~~~~d~~~A~------~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~ 149 (708) .. +++=+++.+.+++ ..+.++ .++..+......|++..+...++.+.+..|.|.+.+.. ++ T Consensus 73 ~~~ltpp~~~WF~l~~~d~~-~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~------~~- 144 (556) T protein:vir:73 73 MSGITSPARPWFKLATPDPD-MMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVME------DD- 144 (556) T ss_pred HHhhcCCCCcccccccCccc-ccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeee------cC- Confidence 32 4555666665432 122222 24555666677899999999999999999999876532 11 Q ss_pred CCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc-ccccccccccccccCCCCCceeEEe Q lcl|Aclame:pro 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIYIA 228 (708) Q Consensus 150 ~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~v~ 228 (708) ...+++..+ |+.++++..++.- + ..-|++...++..++.++||... .+.+-. ...... ..+ T Consensus 145 --~~~~r~~~~--~l~~~~~~~d~~G-~---vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~----~~~~~~-~~~----- 206 (556) T protein:vir:73 145 --QDVIRTMPF--PIGSYYLANSPRG-S---VDTCIRQFSMTVRQMVQEFGLDNVSTSVKG----MWENGT-YET----- 206 (556) T ss_pred --CceEEEEEe--ecceeEEeeCCCC-C---eEEEEEEEeccHHHHHHHcCcccCCHHHHH----HHhcCC-ccc----- Confidence 223455544 5678888776542 1 22278889999999999998643 222110 000000 001 Q ss_pred eeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeE-EEEEEEEec-ceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRR-RVYVSVVDG-DGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~~~-~~il~~~~~~p~~~~ 306 (708) .+.++++.++.... ..+.. ....+.+ .++|...++ ..++.. ..|.++ T Consensus 207 --------~~~v~~~V~pr~~~----~~~~~----------------~~~~~p~~s~~~~~~~~~~~vl~e---sg~~e~ 255 (556) T protein:vir:73 207 --------WVEVNHCITPNVNR----DSGKM----------------DSKNKPYRSVYFESGGDSDKLLRE---SGFDEF 255 (556) T ss_pred --------eEEEEEEEeccccc----ccccc----------------CcccceEEEEEEEecCCCceeccc---CCcccC Confidence 11222222221100 00000 0000111 122221122 334432 345667 Q ss_pred ceeeEEEeeeccCCcccccch-HHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeec Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGH-IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLR 385 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~-vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~ 385 (708) ||+++-.. ..+|..+|.|. +....+-.+.+|++....+....+..+++++++.+.... .....++++.... T Consensus 256 P~~~~Rw~--~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~~~~ 327 (556) T protein:vir:73 256 PILAPRWE--VNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQ------RVSLLPGDVTYLD 327 (556) T ss_pred Cceeeeee--ecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceecccccccc------ceeeccCcccccc Confidence 77765433 26889999995 999999999999999999999999999999998764321 2233444443322 Q ss_pred ccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChh-Hccc--ccchhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 386 EVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQA-MQQM--PSNIAQETVNNLMNRADMASFIYLD 462 (708) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~-~~G~--~~n~sg~ai~~~q~q~~~~~~~~~d 462 (708) .... . .+ ..|.+...+.+ ..+.++++...+.|....-.+-+ +++. ..+.|++-|..+.+.....+...+. T Consensus 328 ~~~~-~----~~-i~p~~~~~~d~-~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~ 400 (556) T protein:vir:73 328 VISG-Q----DG-FKPAYLVNPNT-ADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (556) T ss_pred CCCC-c----cc-eeeeccccccH-HHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHH Confidence 1111 0 11 12222222222 22334456666666655543321 2333 2346999999999988888888888 Q ss_pred HHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHH Q lcl|Aclame:pro 463 NMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRD 541 (708) Q Consensus 463 n~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~ 541 (708) +|. .+..-+.+..+.++.+.-- + .+. -+ .+....+.|..... -...++. T Consensus 401 rl~~E~l~Pli~r~~~il~r~g~----l--------------P~~-P~----------~l~~~~i~v~yis~-La~aqk~ 450 (556) T protein:vir:73 401 RLNDEALNPLIDRVFSIMARKNM----L--------------PEP-PD----------VLQGMPLRIEYISV-MAQAQKS 450 (556) T ss_pred HHHHHHHHHHHHHHHHHHHhcCC----C--------------CCC-ch----------hhcCceeEEEeecH-HHHHHHH Confidence 875 5566666666665555211 0 000 00 00111233333222 2333455 Q ss_pred HHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh-hcccCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 542 ATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS-GIAKPRNEKEQQIVQQAQMAAQSQPNPEM 620 (708) Q Consensus 542 ~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~-~~~~~~~~~~~q~~~~~qq~qq~~~~~~~ 620 (708) .....+.++++.+++.... .+.++...+ .+++...+-...... ....++ ++.++..++.++++|.+++.+ T Consensus 451 ~~~~~i~~~~~~~~~laq~----~Pe~~d~id---~d~~~~~~a~~~Gvp~~~irs~-eev~~~rq~r~~~qq~~~~~~- 521 (556) T protein:vir:73 451 IGLTSLSQTVGFIGQLAQF----KPEALDKLD---VDQAIDAFSEMSGVSPTVIVPQ-EQVQGIREERAKQAQAAQAMA- 521 (556) T ss_pred HHHHHHHHHHHHHHHHhcc----ChhhHhcCC---HHHHHHHHHHHcCCChhhcCCH-HHHHHHHHHHHHHHHHHHHHH- Confidence 5555555555544332111 122233333 344444444333221 222221 111111111111111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 621 VLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQ 693 (708) Q Consensus 621 ~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 693 (708) +++.+ ++..+..+.. ...- ....++...++- .+ ++ T Consensus 522 ---~~~~a---~~~~~~~~~~---~~~~-~~~l~~~~~~~g-------------------------~~---~~ 556 (556) T protein:vir:73 522 ---MGQAA---AQGAKTLSET---QTSD-PSALTAIANAAG-------------------------AP---QQ 556 (556) T ss_pred ---HHHHH---HHHHHHhhhc---cCCC-HHHHHHHHHhhc-------------------------CC---CC Confidence 00000 0111100000 0000 000000000000 00 00 No 26 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.85 E-value=1.2e-18 Score=118.85 Aligned_cols=539 Identities=14% Similarity=0.103 Sum_probs=252.7 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh-cCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFAR-VPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~-~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) ||++. .+++..+|.......+.|...|+++.+|.. +.+ -+...+...-.. ..+.+.-+.....++.+.+. T Consensus 1 m~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~-~~~~~~~~~~~~-----~~~~~~dst~~~a~~~Las~ 71 (559) T protein:vir:95 1 MAETT---KERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDR-----RNTRIIDSTGTMAARTLASG 71 (559) T ss_pred CChhh---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC-CcCCCCCCcccc-----cccccccchHHHHHHHHHHH Confidence 99764 456678888888888888899988877532 111 121111111000 11233445555666655543 Q ss_pred Hhc-----CcceeEEecCCCcc--hHHHHHHH---HHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCC Q lcl|Aclame:pro 80 YRN-----NRITVKFRPGDREA--SEELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) Q Consensus 80 ~~~-----nr~~~~v~pr~~~~--d~~~A~~l---~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~ 149 (708) ... +++=+++.+.+.+. ..+..+.| +..+......|++..+...++.+.+..|.|++.+.. |+ T Consensus 72 l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~------d~- 144 (559) T protein:vir:95 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLD------DD- 144 (559) T ss_pred HHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeec------CC- Confidence 333 44545555544321 12333333 344445566899999999999999999999876632 22 Q ss_pred CCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc-ccccccccccccccCCCCCceeEEe Q lcl|Aclame:pro 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIYIA 228 (708) Q Consensus 150 ~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~v~ 228 (708) ...+++..+ |+.++++..++.- ...-|++...|+..++..+||... .+.+......+ ... T Consensus 145 --~~~~r~~~~--~l~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~-----~~~------ 205 (559) T protein:vir:95 145 --EDIIRTMPF--PIGSYYLANSPRG----SVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESG-----TYE------ 205 (559) T ss_pred --CceeEEEEe--ecCeEEEeeCCCC----CeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcC-----CCC------ Confidence 223455544 5678888765542 223378889999999999998643 22111110000 001 Q ss_pred eeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeE-EEEEEEEec-ceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRR-RVYVSVVDG-DGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~~~-~~il~~~~~~p~~~~ 306 (708) ..+.++++.++.... ..+.. ....+.+ .++|..-+. ..++.. ..|.++ T Consensus 206 -------~~v~v~~~V~pr~~~----~~~~~----------------~~~~~pf~s~~~e~~~~~~~~l~e---sg~~e~ 255 (559) T protein:vir:95 206 -------KWIEVMHSVYPNIDR----DTSKL----------------DSKNKPFKSVYYEVGGDNDKLLRE---SGFDEF 255 (559) T ss_pred -------CeEEEEEEEeccccc----ccccc----------------ccccceEEEEEEEecCCCceeeec---CCcccC Confidence 112233333221100 00000 0000011 122221122 234433 334667 Q ss_pred ceeeEEEeeeccCCcccccch-HHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeec Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGH-IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLR 385 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~-vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~ 385 (708) ||+++-..+ .+|..+|.|. +....+-.+.+|.+....+..+.+..+++++++.+... ...+..++++..+. T Consensus 256 P~~~~Rw~~--~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~------~~~~l~pgg~~~~~ 327 (559) T protein:vir:95 256 PIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN------QRASLLPGDITYID 327 (559) T ss_pred Cccceeeee--cCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccc------cceeeeccceeeeC Confidence 777654332 6888999994 99999999999999999999999999999999866432 12234455554433 Q ss_pred ccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChh-Hccc--ccchhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 386 EVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQA-MQQM--PSNIAQETVNNLMNRADMASFIYLD 462 (708) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~-~~G~--~~n~sg~ai~~~q~q~~~~~~~~~d 462 (708) .... . . ...|.+...+.+ ..+...++...+.|....-.+-. +++. ..+.|++-|..+.+.....+...+. T Consensus 328 ~~~~---~--~-~i~p~~~~~~~~-~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~ 400 (559) T protein:vir:95 328 QITG---Q--D-GFRPAYLVNPST-ADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (559) T ss_pred CCCC---c--c-cceeecccccch-HHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHH Confidence 2211 1 0 111222222222 22233455556666555543321 2232 3346999999999988888888888 Q ss_pred HHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHH Q lcl|Aclame:pro 463 NMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRD 541 (708) Q Consensus 463 n~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~ 541 (708) +|. .+..-+.+..+.++.+.-- +- + -+... ....+.|...... ...+|. T Consensus 401 rl~~E~l~Pli~r~~~il~r~g~----lP---~---------~p~~l-------------~~~~i~v~~is~L-a~aqk~ 450 (559) T protein:vir:95 401 RLNDECLNPLIDRSFSMMVRKNM----LP---P---------PPDVM-------------EGMPLKVEYISVM-AQAQKS 450 (559) T ss_pred HHHHHHHHHHHHHHHHHHHhcCC----CC---C---------Ccccc-------------cCcceEEEeecHH-HHHHHH Confidence 875 4555555555555555311 00 0 00000 0112333332222 233455 Q ss_pred HHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh-hhcccCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 542 ATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI-SGIAKPRNEKEQQIVQQAQMAAQSQPNPEM 620 (708) Q Consensus 542 ~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~-~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~ 620 (708) ...+.+.++++.+++.... .+.++... +.++++..+-..... .....++ ++.++..++.+++++++++ T Consensus 451 ~~~~~i~~~~~~~~~laq~----~Pevld~i---d~d~~~~~~a~~~Gvp~~~irs~-~ev~~~rqqr~~~qq~~q~--- 519 (559) T protein:vir:95 451 IGLSSLASTVNFIGQLAQV----KPEALDKL---NVDQAIDAFADMSGVSPTVIVPQ-EQVEQARQQRAQQQQQQQM--- 519 (559) T ss_pred HHHHHHHHHHHHHHHHhcc----ChhhhhcC---CHHHHHHHHHHHhCCchhhcCCH-HHHHHHHHHHHHHHHHHHH--- Confidence 5555555555544332111 12223333 334444444433322 1222221 1111111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 621 VLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQ 693 (708) Q Consensus 621 ~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 693 (708) .+++. .. ++..+.-++ +..... -++++....-.....++| T Consensus 520 -~~~~~-~a--a~~~~~~~~-------~~~~~~----------------------~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 520 -MAMGM-AA--AQGVKTLSE-------AKTSDP----------------------SVLSAMANAVSGQGGQSQ 559 (559) T ss_pred -HHHHH-HH--HHhhhcccc-------ccCCCh----------------------hHHHHHHHhhcCccccCC Confidence 00000 00 000010000 000000 000000000000011111 No 27 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.83 E-value=8.3e-18 Score=114.24 Aligned_cols=513 Identities=14% Similarity=0.072 Sum_probs=249.6 Q ss_pred CCcc-hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAET-LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~-~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) |||+ +-...+.++.+|+...+....|...|+++.+|.+=..--.+..... . ....+.-+.-...++.+.+. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~-----~~~~~~dst~~~a~~~Laa~ 72 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS---T-----DYQTPWQAVGARGLNNLASK 72 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc---c-----cccccccccHHHHHHHHHHH Confidence 9983 2224567888888888888888888988876522111111111100 0 01123344555555555543 Q ss_pred HhcC----cceeEEecCCCc------chHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 80 YRNN----RITVKFRPGDRE------ASEELANK------LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 80 ~~~n----r~~~~v~pr~~~------~d~~~A~~------l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) .... ++=+++.+.+.+ .+.+.+++ .+..+....+.|++..+...++.+.+..|.|+..+.-+ T Consensus 73 l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~-- 150 (536) T protein:vir:21 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-- 150 (536) T ss_pred HHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC-- Confidence 3333 332222222111 11122222 34556666778999999999999999999998655211 Q ss_pred ccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCc Q lcl|Aclame:pro 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGAD 223 (708) Q Consensus 144 ~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 223 (708) ...+...+..+ |+.++++..+.. . ...-+|++..||..++...||+...... .. .. ..+ T Consensus 151 ------~~~~~~~f~~~--pl~~~~v~~d~~---G-~vd~i~r~~~~t~~~l~~~fg~~~~~~~----~~---~~--~~~ 209 (536) T protein:vir:21 151 ------EGSNYNPMKLY--RLSSYVVQRDAF---G-NVLQMVTRDQIAFGALPEDIRKAVEGQG----GE---KK--ADE 209 (536) T ss_pred ------CCCceeeEEEE--EcCeEEEeeCCC---C-CeeEEeeeeeccHHHHHHhhhhhhcccc----cc---cc--ccc Confidence 11111122222 556777655432 1 3344889999999999999986432110 00 00 011 Q ss_pred eeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCC Q lcl|Aclame:pro 224 VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPG 303 (708) Q Consensus 224 ~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~ 303 (708) .+.+ ..+.+.++.++. +.|++ -..|..++...+.+|+ T Consensus 210 ~v~v-----------~~~v~~~~~~~~-------------------------------~~~~~-e~~g~~v~~~~g~~~f 246 (536) T protein:vir:21 210 TIDV-----------YTHIYLDEDSGE-------------------------------YLRYE-EVEGMEVQGSDGTYPK 246 (536) T ss_pred ceeE-----------EEEEEEecCCCc-------------------------------EEEEe-ccCCeeeccccCcccc Confidence 1111 101111111111 11211 1234445555677888 Q ss_pred CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceee Q lcl|Aclame:pro 304 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) Q Consensus 304 ~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~ 383 (708) ..+||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++++.+-...... ...++.++. T Consensus 247 ~~~P~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~g~~v~ 321 (536) T protein:vir:21 247 EACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---KAQTGDFVT 321 (536) T ss_pred ccCCeeeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc---cCCCcceec Confidence 999998765443 688899999999999999999999999999999999999999877664332211 112222222 Q ss_pred ecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-ccc-chhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPS-NIAQETVNNLMNRADMASFIYL 461 (708) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~~-n~sg~ai~~~q~q~~~~~~~~~ 461 (708) ... +.+. ... .....--+...+.++...+.|....-+. +++ .++ ..|++-|..+.+.....+...+ T Consensus 322 g~~-----~~v~----~~~-~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~ 389 (536) T protein:vir:21 322 GRP-----EDIS----FLQ-LEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVY 389 (536) T ss_pred CCc-----ccce----eee-ccccccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHH Confidence 111 1110 111 1111222334566777777776665333 233 223 3688889999988888888888 Q ss_pred HHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHH Q lcl|Aclame:pro 462 DNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) Q Consensus 462 dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r 540 (708) .+|. .+..-+.+..+.++.+. |. |-+ . | .. + +.+.+..+.+ +-.| T Consensus 390 ~rl~~Ell~Pli~r~~~il~r~-------------g~-----lP~--~-p--~~------~----v~~~~vs~l~-~l~r 435 (536) T protein:vir:21 390 SILSQELQLPLVRVLLKQLQAT-------------QQ-----IPE--L-P--KE------A----VEPTISTGLE-AIGR 435 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhC-------------CC-----CCC--C-C--hh------h----ccceEEecHH-HHHH Confidence 8876 45555555555554321 10 000 0 0 00 0 1223333333 4557 Q ss_pred HHHHHHHHHHHHhccccCchhHHHHHHHHh-hccchhHHHHHHHHHhhhh--hhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 541 DATVSVLTNVLSSMLPTDPMRPAIQGIILD-NIDGEGLDDFKEYNRNQLL--ISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 541 ~~~~~~l~~llq~~~~~~p~~~~~~~~~~~-~~d~~~~~ei~e~~~~~~~--~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) .+..+.++++++.+....|. ++. .. +.+++...+-.... +.....+ +++.++.+++++++++. T Consensus 436 ~~~~~~l~~~~~~la~~~Pe-------~ld~~i---d~d~~~~~~a~~~Gv~p~~~irt--~eev~~~r~q~~~~~~~-- 501 (536) T protein:vir:21 436 GQDLDKLERCVTAWAALAPM-------RDDPDI---NLAMIKLRIANAIGIDTSGILLT--EEQKQQKMAQQSMQMGM-- 501 (536) T ss_pred HHHHHHHHHHHHHHHhhchh-------hhcccC---CHHHHHHHHHHHcCCChhhhcCC--HHHHHHHHHHHHHHHHH-- Confidence 77888888887766554432 111 12 23344444433222 1222222 12211111111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVM 678 (708) Q Consensus 618 ~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~ 678 (708) ++++. +. .+..+.+++...+...+..+ ++ ..+ .+ + T Consensus 502 ----~~~a~-----~~-~~~~~~~~~~~~~~~~~~~~----~~----------g~~-~~-~ 536 (536) T protein:vir:21 502 ----DNGAA-----AL-AQGMAAQATASPEAMAAAAD----SV----------GLQ-PG-I 536 (536) T ss_pred ----HHHHH-----HH-HHHHHHHHhcChhhHHhhhh----cc----------ccC-CC-C Confidence 00000 00 00000000000000000000 00 000 00 0 No 28 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.82 E-value=1.1e-17 Score=113.55 Aligned_cols=513 Identities=13% Similarity=0.071 Sum_probs=251.2 Q ss_pred CCcc-hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAET-LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~-~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) |||+ +-...+.++.+|+...+....|...|+++.+|.+=..--.+..... . ....+.-+.-...++.+.+. T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~-----~~~~~~dst~~~a~~~Laa~ 72 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS---T-----DYQTPWQAVGARGLNNLASK 72 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc---c-----cccccccccHHHHHHHHHHH Confidence 9983 2224567888888888888888888988876532111111111100 0 01123344555555555543 Q ss_pred HhcC----cceeEEecCCCc------chHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 80 YRNN----RITVKFRPGDRE------ASEELANK------LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 80 ~~~n----r~~~~v~pr~~~------~d~~~A~~------l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) .... ++=+++.+.+.+ .+.+.+++ .+..+....+.|++..+...++.+.+..|.|+..+.-+ T Consensus 73 l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~-- 150 (536) T protein:vir:10 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-- 150 (536) T ss_pred HHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC-- Confidence 3333 332222222111 11122222 34556666778999999999999999999998655211 Q ss_pred ccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCc Q lcl|Aclame:pro 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGAD 223 (708) Q Consensus 144 ~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 223 (708) ...+...+..+ |+.++++..+.. . ...-+|++..|+..++...||+...... .. .. ..+ T Consensus 151 ------~~~~~~~~~~~--pl~~~~v~~d~~---G-~vd~i~r~~~~t~~~l~~~fg~~~~~~~----~~---~~--~~~ 209 (536) T protein:vir:10 151 ------EGSNYNPMKLY--RLSSYVVQRDAF---G-NVLQMVTRDQIAFGALPEDIRKAVEGQG----GE---KK--ADE 209 (536) T ss_pred ------CCCceeeEEEE--EcCeEEEeeCCC---C-CeeEEeeeeeccHHHHHHhhhhhhcccc----cc---cC--ccc Confidence 11111222222 556777655432 1 2334789999999999999986432110 00 00 001 Q ss_pred eeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCC Q lcl|Aclame:pro 224 VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPG 303 (708) Q Consensus 224 ~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~ 303 (708) ++.++.+..+..+. ..+.|+ ..+.|..++...+.+|+ T Consensus 210 -------------~v~v~~~V~~~~~~-----------------------------~~~~~~-~e~~g~~v~~~~g~~~f 246 (536) T protein:vir:10 210 -------------TIDVYTHIYLDEAS-----------------------------GEYLRY-EEVEGMEVQGSDGTYPK 246 (536) T ss_pred -------------ceEEEEEEEEecCC-----------------------------CcEEEE-EeecCcccccccccccc Confidence 11222222111000 001121 23445556555677888 Q ss_pred CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceee Q lcl|Aclame:pro 304 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) Q Consensus 304 ~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~ 383 (708) ..+||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++++.+-...... ...++.++. T Consensus 247 ~~~P~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~g~~v~ 321 (536) T protein:vir:10 247 EACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---KAQTGDFVT 321 (536) T ss_pred ccCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc---cCCCcceec Confidence 999998765443 688899999999999999999999999999999999999999877664332211 112222222 Q ss_pred ecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-ccc-chhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPS-NIAQETVNNLMNRADMASFIYL 461 (708) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~~-n~sg~ai~~~q~q~~~~~~~~~ 461 (708) ... +.+. ... .....--+...+.++...+.|....-+. +++ .++ ..|++-|..+.+.....+...+ T Consensus 322 g~~-----~~v~----~~~-~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~ 389 (536) T protein:vir:10 322 GRP-----EDIS----FLQ-LEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVY 389 (536) T ss_pred CCc-----ccce----eee-ccccccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHH Confidence 111 1110 111 1111222334566777777776665333 233 223 3688889999988888888888 Q ss_pred HHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHH Q lcl|Aclame:pro 462 DNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) Q Consensus 462 dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r 540 (708) .+|. .+..-+.+..+.++.+. |. |- +-.. ++ +.+.+..+.+ +-.| T Consensus 390 ~rl~~Ell~Pli~r~~~il~r~-------------g~-----lP-----~~p~------~~----v~~~~vs~l~-~l~r 435 (536) T protein:vir:10 390 SILSQELQLPLVRVLLKQLQAT-------------QQ-----IP-----ELPK------EA----VEPTISTGLE-AIGR 435 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhC-------------CC-----CC-----CCCh------hh----ccceEEecHH-HHHH Confidence 8876 45555555555554321 10 00 0000 00 1223333333 4557 Q ss_pred HHHHHHHHHHHHhccccCchhHHHHHHHHh-hccchhHHHHHHHHHhhhh--hhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 541 DATVSVLTNVLSSMLPTDPMRPAIQGIILD-NIDGEGLDDFKEYNRNQLL--ISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 541 ~~~~~~l~~llq~~~~~~p~~~~~~~~~~~-~~d~~~~~ei~e~~~~~~~--~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) .+..+.++.+++.+....|. ++. .. +.++++..+-.... +.....+ +++.++.+++++++++. T Consensus 436 ~~~~~~l~~~~~~la~~~P~-------~ld~~i---d~d~~~~~~a~~~Gv~p~~~irt--~eev~~~r~q~~~~~~~-- 501 (536) T protein:vir:10 436 GQDLDKLERCVTAWAALAPM-------RDDPDI---NLAMIKLRIANAIGIDTSGILLT--EEQKQQKMAQQSMQMGM-- 501 (536) T ss_pred HHHHHHHHHHHHHHHhhchh-------hhcccC---CHHHHHHHHHHHcCCCchhhcCC--HHHHHHHHHHHHHHHHH-- Confidence 78888888887776554432 111 12 33344444433322 1222222 12211111111100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVM 678 (708) Q Consensus 618 ~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~ 678 (708) ++ ++. +. .+..+.+++.-.+...+..+ ++ ..+ .+ + T Consensus 502 -~~---~a~-----~~-~~~~~~~~~~~~~~~~~~~~----~~----------g~~-~~-~ 536 (536) T protein:vir:10 502 -DN---GAA-----AL-AQGMAAQATASPEAMAAAAD----SV----------GLQ-PG-I 536 (536) T ss_pred -HH---HHH-----HH-HHHHHHHHhcCchhHHhhhh----cc----------ccC-CC-C Confidence 00 000 00 00000000000000000000 00 000 00 0 No 29 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.81 E-value=2.2e-17 Score=111.91 Aligned_cols=501 Identities=14% Similarity=0.038 Sum_probs=248.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) ||++.-...+.+..+|+...+....|...|+++.+|..=..--.+..... . .+..+.-+.-...++.+.+.. T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~---~-----~~~~~~dst~~~a~~~Las~l 72 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSS---T-----EYTTPWQAVGARCLNNLAAKL 72 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc---c-----cccccccccHHHHHHHHHHHH Confidence 99876666788899999888888888889988876521110011111100 0 011223444555555555443 Q ss_pred hcC----cceeEEecCC---------CcchHHH---HHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccc Q lcl|Aclame:pro 81 RNN----RITVKFRPGD---------REASEEL---ANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVN 144 (708) Q Consensus 81 ~~n----r~~~~v~pr~---------~~~d~~~---A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~ 144 (708) ... ++=.+..+.+ .....++ -+.++..+......|++..+...++.+.+..|.|+..+.-+ T Consensus 73 ~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~--- 149 (522) T protein:vir:94 73 MLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEP--- 149 (522) T ss_pred HhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeecc--- Confidence 333 3222222221 1111112 22234455556678999999999999999999998765321 Q ss_pred cCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCce Q lcl|Aclame:pro 145 EYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADV 224 (708) Q Consensus 145 ~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 224 (708) +.+ ....+..+ |+.++++..+.. . ...-++++..++.+.+-..++..... . .....+. T Consensus 150 ---~~~--~~~~~~~~--pl~~y~v~~d~~---G-~vd~i~r~~~~~~~~l~~~~~~~~~~---------~--~~~p~~~ 207 (522) T protein:vir:94 150 ---EQG--TYSPMRMY--RLVSYVVQRDAF---G-NILQIVTIDKVAFSALPEDVKSQLNA---------D--DYEPDTE 207 (522) T ss_pred ---CCC--ceeeEEEE--EcceEEEeeCCC---c-CeEEEeeeeeccHHhcchHHHHHHhc---------c--cCCccce Confidence 111 11122222 556666654332 1 23346777888888765555432210 0 0001122 Q ss_pred eEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCC Q lcl|Aclame:pro 225 IYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGE 304 (708) Q Consensus 225 ~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~ 304 (708) + .+|.+..+..++..++ .-..|..+....+.+|+. T Consensus 208 v-------------~v~~~v~~~~~~~~~~--------------------------------~~~~g~~~~~~~~~~~~~ 242 (522) T protein:vir:94 208 L-------------EVYTHIYRQDDEYLRY--------------------------------EEVEGIEVTGTDGSYPLT 242 (522) T ss_pred E-------------EEEEEEEeeCCceeEE--------------------------------eeccCceecccCCCCccc Confidence 2 2222222211111111 112233333334568889 Q ss_pred CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeee Q lcl|Aclame:pro 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL 384 (708) Q Consensus 305 ~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~ 384 (708) .+||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++++.+-...+.. ...++.++.. T Consensus 243 e~P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~---~~~~g~~v~g 317 (522) T protein:vir:94 243 ACPYIPVRMVR--LDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLN---KAATGEFVAG 317 (522) T ss_pred cCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchhee---ccCCceeecC Confidence 99998765443 688999999999999999999999999999999999999999877654332211 1112222211 Q ss_pred cccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc--ccchhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 385 REVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM--PSNIAQETVNNLMNRADMASFIYLD 462 (708) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~--~~n~sg~ai~~~q~q~~~~~~~~~d 462 (708) . .+.+. +......+. -+...+.++...+.|....-+. +++. ..+.|++-|..+.+.....+...+. T Consensus 318 ~-----~~~v~----~~~~~~~~~-~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~ 385 (522) T protein:vir:94 318 R-----VEDIN----FLQLTKGQD-FTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYS 385 (522) T ss_pred C-----cccce----eeecccccc-hhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHH Confidence 1 11111 111111222 2334566777777777766443 3332 2346889899999988888888888 Q ss_pred HHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHH Q lcl|Aclame:pro 463 NMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRD 541 (708) Q Consensus 463 n~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~ 541 (708) +|. .+..-+.+..+.++.+.-- + . .. | . ..+.+.+..+.. ...|. T Consensus 386 rl~~E~l~Pli~r~~~il~r~g~----l--------------P--~~-p--~----------~~v~v~~~s~La-~~qr~ 431 (522) T protein:vir:94 386 VQSQELQLPIVRVLMNQLQSAGM----I--------------P--DL-P--K----------EAVEPTVSTGLE-ALGRG 431 (522) T ss_pred HHHHHHHHHHHHHHHHHHHhcCC----C--------------C--CC-C--c----------ccEEeeEecHHH-HHHHH Confidence 876 4555555555555433211 0 0 00 0 0 012334433333 45677 Q ss_pred HHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh--hhcccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 542 ATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQSQPNPE 619 (708) Q Consensus 542 ~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~--~~~~~~~~~~~~q~~~~~qq~qq~~~~~~ 619 (708) +-.+.+.++++.++...|... .... +.++++..+-..... ...... +++.++.++++++++..++ T Consensus 432 ~~~~~l~~~~~~ia~l~P~~~------~~~i---d~d~~~~~~a~~~Gv~~~~ivr~--~ee~~~~~~q~~~~~~~~~-- 498 (522) T protein:vir:94 432 QDLEKLTQAVNMMTGLQPLSQ------DPDI---NLPTLKLRLLNALGIDTAGLLLT--QDEKIQRMAEQSSQQAVVQ-- 498 (522) T ss_pred HHHHHHHHHHHHHHhccchhh------hhcC---CHHHHHHHHHHHcCCChhhccCC--HHHHHHHHHHHHHHHHHHH-- Confidence 778888888887765544321 1112 234444444433321 222221 2222222211111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 620 MVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQA 669 (708) Q Consensus 620 ~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~ 669 (708) .+.++.+...|. .. .+ +...+ .++ T Consensus 499 --~~~~~~~~~~a~----~~----~~--------------~~~~~--~~~ 522 (522) T protein:vir:94 499 --GASAAGANMGAA----VG----QG--------------AGEDM--AQA 522 (522) T ss_pred --HHHHHHHHhhhh----hh----cc--------------cchhh--hcC Confidence 000000000000 00 00 00000 000 No 30 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=99.81 E-value=4.7e-17 Score=110.12 Aligned_cols=536 Identities=11% Similarity=0.033 Sum_probs=254.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhc-CCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARV-PGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~-~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) |+... ....++.+|+...+....|...|+++.+|..= .|.=|.. +...-..+ .+.+.-..-...++.+.+. T Consensus 1 M~~~~--~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~-----~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQT--ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQ-DRNRGEKR-----HNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcc--cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCC-CCCcchhc-----ccccccccHHHHHHHHHHH Confidence 88764 55789999999999999999999988775311 1111111 11100000 1223344455555555443 Q ss_pred Hhc-----CcceeEEecCCCcch--HHHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCC Q lcl|Aclame:pro 80 YRN-----NRITVKFRPGDREAS--EELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) Q Consensus 80 ~~~-----nr~~~~v~pr~~~~d--~~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~ 149 (708) ... +++=+++.+.+++.+ .+..+. .+..+......|++..+...++.+.+..|.|++.+..+ + T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d------~- 145 (555) T protein:vir:10 73 MMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPD------F- 145 (555) T ss_pred HHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecC------C- Confidence 332 455556666543211 122232 34445556678999999999999999999998765322 1 Q ss_pred CCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc-ccccccccccccccCCCCCceeEEe Q lcl|Aclame:pro 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIYIA 228 (708) Q Consensus 150 ~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~v~ 228 (708) ...+++..+ |+.++++..++. . ...-+|+...|+..++..+||... .+.+.. ..+.. .... T Consensus 146 --~~~~rf~~~--pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~----~~~~~-~~~~----- 207 (555) T protein:vir:10 146 --DAVVYHHSL--TAGEYAIAADNQ---G-RVNTLYREFQITVAQMVREFGKDKCSTTVQS----LFDRG-ALEQ----- 207 (555) T ss_pred --CceEEEEEe--ecceeEEeeCCC---C-CEEEEEEEEeccHHHHHHhcCcccCCHHHHH----HHhcC-CCCc----- Confidence 223455544 567777755443 2 223467888999999999998644 221110 00000 0011 Q ss_pred eeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEE-EEEEEEe-cceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRR-VYVSVVD-GDGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~-~~~il~~~~~~p~~~~ 306 (708) .+.++++.++....-..-. ....+.+. |+|.--. +..++.. ..|..+ T Consensus 208 --------~v~v~~~V~pr~~~~~~~~--------------------~~~~~p~~s~~~~~~~d~~~vl~e---sgy~e~ 256 (555) T protein:vir:10 208 --------WVTVIHAIEPRADRDPSKR--------------------DDRNMAWKSVYFEPGADETRTLRE---SGYRSF 256 (555) T ss_pred --------eEEEEEEEeeccCcCcCCC--------------------CccccceEEEEEEeccCCcccccc---CCcccC Confidence 1223333333211100000 00001111 2221111 2234432 334667 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) ||+|+... ..+|..+|.|.+....+-.+.+|++....+..+....+++++++.+.... ..+..++++..+.. T Consensus 257 P~i~~Rw~--~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~~v~~ 328 (555) T protein:vir:10 257 RALCPRWA--LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQ------DISTVPGGLSYVDA 328 (555) T ss_pred Cceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceecccccccccc Confidence 77765433 35888999999999999999999999999999999999999998765321 22344554432221 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCCh-hHccc--ccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQ-AMQQM--PSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~-~~~G~--~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) +.. .. ...+.+...+.+ +...+.++...+.|....=.+- .+++. ....|++-|..+.+.....+...+.+ T Consensus 329 g~~-----~d-~~~~~~~~~~d~-~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~r 401 (555) T protein:vir:10 329 AAP-----NG-GIRTAFEVNLDL-SHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLER 401 (555) T ss_pred CCC-----Cc-ceecccccccch-HHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHH Confidence 111 00 112222222222 3445566676777766652221 12322 23368999999988888888888888 Q ss_pred HH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |. .+..-+.+..+.++.+.--=|. -+..+ ....++|......+ ..+|.. T Consensus 402 l~~E~l~Pli~r~~~il~r~g~lP~----------------~P~~l-------------~~~~i~v~yis~La-~aq~~~ 451 (555) T protein:vir:10 402 MHNEILDPLIELTFQRMVEANILPP----------------PPQEM-------------QGVDLNVEFVSMLA-QAQRAI 451 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCC----------------Cchhh-------------cCceeEEEeccHHH-HHHHHH Confidence 76 4545555444444444210000 00000 00113333333332 344555 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh-hhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI-SGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~-~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) ....+.++++.+++..+. .+.++...+ .++++..+-..... .....++ ++.++..++.+++++.+++ . T Consensus 452 ~~~~i~~~l~~i~~laq~----~P~vld~id---~d~~~~~~a~~~Gvp~~~irs~-eev~~~r~qr~~~~q~~~~---a 520 (555) T protein:vir:10 452 ATNSVDRFVGNLGAVAGI----KPEVLDKFD---ADRWADTYADMLGIDPELIVPG-NQVALIRKQRADQQQAAQQ---A 520 (555) T ss_pred HHHHHHHHHHHHHHHhcC----ChhhhhcCC---HHHHHHHHHHHhCCCccccCCH-HHHHHHHHHHHHHHHHHHH---H Confidence 555555555554332211 112233333 34444444433321 2222221 1112211111111111111 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLK 685 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~ 685 (708) .++.+...+.+.+..++..-. .. +...+++...-. T Consensus 521 ~~~~q~~~~~~~~~~~~~~~~----~~-------------------------~~~~~~~~~~~~ 555 (555) T protein:vir:10 521 ALLNQGADTAAKLGSVDTSKQ----NA-------------------------LTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHhcccccCcc----hh-------------------------HHHHHhhhccCC Confidence 111111000000000000000 00 000011100000 No 31 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=99.81 E-value=4.7e-17 Score=110.12 Aligned_cols=536 Identities=11% Similarity=0.033 Sum_probs=254.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhc-CCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARV-PGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~-~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) |+... ....++.+|+...+....|...|+++.+|..= .|.=|.. +...-..+ .+.+.-..-...++.+.+. T Consensus 1 M~~~~--~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~-----~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQT--ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQ-DRNRGEKR-----HNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcc--cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCC-CCCcchhc-----ccccccccHHHHHHHHHHH Confidence 88764 55789999999999999999999988775311 1111111 11100000 1223344455555555443 Q ss_pred Hhc-----CcceeEEecCCCcch--HHHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCC Q lcl|Aclame:pro 80 YRN-----NRITVKFRPGDREAS--EELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) Q Consensus 80 ~~~-----nr~~~~v~pr~~~~d--~~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~ 149 (708) ... +++=+++.+.+++.+ .+..+. .+..+......|++..+...++.+.+..|.|++.+..+ + T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d------~- 145 (555) T protein:vir:10 73 MMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPD------F- 145 (555) T ss_pred HHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecC------C- Confidence 332 455556666543211 122232 34445556678999999999999999999998765322 1 Q ss_pred CCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc-ccccccccccccccCCCCCceeEEe Q lcl|Aclame:pro 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIYIA 228 (708) Q Consensus 150 ~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~v~ 228 (708) ...+++..+ |+.++++..++. . ...-+|+...|+..++..+||... .+.+.. ..+.. .... T Consensus 146 --~~~~rf~~~--pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~----~~~~~-~~~~----- 207 (555) T protein:vir:10 146 --DAVVYHHSL--TAGEYAIAADNQ---G-RVNTLYREFQITVAQMVREFGKDKCSTTVQS----LFDRG-ALEQ----- 207 (555) T ss_pred --CceEEEEEe--ecceeEEeeCCC---C-CEEEEEEEEeccHHHHHHhcCcccCCHHHHH----HHhcC-CCCc----- Confidence 223455544 567777755443 2 223467888999999999998644 221110 00000 0011 Q ss_pred eeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEE-EEEEEEe-cceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRR-VYVSVVD-GDGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~-~~~il~~~~~~p~~~~ 306 (708) .+.++++.++....-..-. ....+.+. |+|.--. +..++.. ..|..+ T Consensus 208 --------~v~v~~~V~pr~~~~~~~~--------------------~~~~~p~~s~~~~~~~d~~~vl~e---sgy~e~ 256 (555) T protein:vir:10 208 --------WVTVIHAIEPRADRDPSKR--------------------DDRNMAWKSVYFEPGADETRTLRE---SGYRSF 256 (555) T ss_pred --------eEEEEEEEeeccCcCcCCC--------------------CccccceEEEEEEeccCCcccccc---CCcccC Confidence 1223333333211100000 00001111 2221111 2234432 334667 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) ||+|+... ..+|..+|.|.+....+-.+.+|++....+..+....+++++++.+.... ..+..++++..+.. T Consensus 257 P~i~~Rw~--~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~~v~~ 328 (555) T protein:vir:10 257 RALCPRWA--LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQ------DISTVPGGLSYVDA 328 (555) T ss_pred Cceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceecccccccccc Confidence 77765433 35888999999999999999999999999999999999999998765321 22344554432221 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCCh-hHccc--ccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQ-AMQQM--PSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~-~~~G~--~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) +.. .. ...+.+...+.+ +...+.++...+.|....=.+- .+++. ....|++-|..+.+.....+...+.+ T Consensus 329 g~~-----~d-~~~~~~~~~~d~-~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~r 401 (555) T protein:vir:10 329 AAP-----NG-GIRTAFEVNLDL-SHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLER 401 (555) T ss_pred CCC-----Cc-ceecccccccch-HHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHH Confidence 111 00 112222222222 3445566676777766652221 12322 23368999999988888888888888 Q ss_pred HH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |. .+..-+.+..+.++.+.--=|. -+..+ ....++|......+ ..+|.. T Consensus 402 l~~E~l~Pli~r~~~il~r~g~lP~----------------~P~~l-------------~~~~i~v~yis~La-~aq~~~ 451 (555) T protein:vir:10 402 MHNEILDPLIELTFQRMVEANILPP----------------PPQEM-------------QGVDLNVEFVSMLA-QAQRAI 451 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCC----------------Cchhh-------------cCceeEEEeccHHH-HHHHHH Confidence 76 4545555444444444210000 00000 00113333333332 344555 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh-hhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI-SGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~-~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) ....+.++++.+++..+. .+.++...+ .++++..+-..... .....++ ++.++..++.+++++.+++ . T Consensus 452 ~~~~i~~~l~~i~~laq~----~P~vld~id---~d~~~~~~a~~~Gvp~~~irs~-eev~~~r~qr~~~~q~~~~---a 520 (555) T protein:vir:10 452 ATNSVDRFVGNLGAVAGI----KPEVLDKFD---ADRWADTYADMLGIDPELIVPG-NQVALIRKQRADQQQAAQQ---A 520 (555) T ss_pred HHHHHHHHHHHHHHHhcC----ChhhhhcCC---HHHHHHHHHHHhCCCccccCCH-HHHHHHHHHHHHHHHHHHH---H Confidence 555555555554332211 112233333 34444444433321 2222221 1112211111111111111 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLK 685 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~ 685 (708) .++.+...+.+.+..++..-. .. +...+++...-. T Consensus 521 ~~~~q~~~~~~~~~~~~~~~~----~~-------------------------~~~~~~~~~~~~ 555 (555) T protein:vir:10 521 ALLNQGADTAAKLGSVDTSKQ----NA-------------------------LTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHhcccccCcc----hh-------------------------HHHHHhhhccCC Confidence 111111000000000000000 00 000011100000 No 32 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=99.81 E-value=4.7e-17 Score=110.12 Aligned_cols=536 Identities=11% Similarity=0.033 Sum_probs=254.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhc-CCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARV-PGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~-~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) |+... ....++.+|+...+....|...|+++.+|..= .|.=|.. +...-..+ .+.+.-..-...++.+.+. T Consensus 1 M~~~~--~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~-----~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:98 1 MAEQT--ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQ-DRNRGEKR-----HNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcc--cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCC-CCCcchhc-----ccccccccHHHHHHHHHHH Confidence 88764 55789999999999999999999988775311 1111111 11100000 1223344455555555443 Q ss_pred Hhc-----CcceeEEecCCCcch--HHHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCC Q lcl|Aclame:pro 80 YRN-----NRITVKFRPGDREAS--EELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) Q Consensus 80 ~~~-----nr~~~~v~pr~~~~d--~~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~ 149 (708) ... +++=+++.+.+++.+ .+..+. .+..+......|++..+...++.+.+..|.|++.+..+ + T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d------~- 145 (555) T protein:vir:98 73 MMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPD------F- 145 (555) T ss_pred HHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecC------C- Confidence 332 455556666543211 122232 34445556678999999999999999999998765322 1 Q ss_pred CCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc-ccccccccccccccCCCCCceeEEe Q lcl|Aclame:pro 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIYIA 228 (708) Q Consensus 150 ~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~v~ 228 (708) ...+++..+ |+.++++..++. . ...-+|+...|+..++..+||... .+.+.. ..+.. .... T Consensus 146 --~~~~rf~~~--pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~----~~~~~-~~~~----- 207 (555) T protein:vir:98 146 --DAVVYHHSL--TAGEYAIAADNQ---G-RVNTLYREFQITVAQMVREFGKDKCSTTVQS----LFDRG-ALEQ----- 207 (555) T ss_pred --CceEEEEEe--ecceeEEeeCCC---C-CEEEEEEEEeccHHHHHHhcCcccCCHHHHH----HHhcC-CCCc----- Confidence 223455544 567777755443 2 223467888999999999998644 221110 00000 0011 Q ss_pred eeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEE-EEEEEEe-cceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRR-VYVSVVD-GDGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~-~~~il~~~~~~p~~~~ 306 (708) .+.++++.++....-..-. ....+.+. |+|.--. +..++.. ..|..+ T Consensus 208 --------~v~v~~~V~pr~~~~~~~~--------------------~~~~~p~~s~~~~~~~d~~~vl~e---sgy~e~ 256 (555) T protein:vir:98 208 --------WVTVIHAIEPRADRDPSKR--------------------DDRNMAWKSVYFEPGADETRTLRE---SGYRSF 256 (555) T ss_pred --------eEEEEEEEeeccCcCcCCC--------------------CccccceEEEEEEeccCCcccccc---CCcccC Confidence 1223333333211100000 00001111 2221111 2234432 334667 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) ||+|+... ..+|..+|.|.+....+-.+.+|++....+..+....+++++++.+.... ..+..++++..+.. T Consensus 257 P~i~~Rw~--~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~~v~~ 328 (555) T protein:vir:98 257 RALCPRWA--LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQ------DISTVPGGLSYVDA 328 (555) T ss_pred Cceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceecccccccccc Confidence 77765433 35888999999999999999999999999999999999999998765321 22344554432221 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCCh-hHccc--ccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQ-AMQQM--PSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~-~~~G~--~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) +.. .. ...+.+...+.+ +...+.++...+.|....=.+- .+++. ....|++-|..+.+.....+...+.+ T Consensus 329 g~~-----~d-~~~~~~~~~~d~-~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~r 401 (555) T protein:vir:98 329 AAP-----NG-GIRTAFEVNLDL-SHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLER 401 (555) T ss_pred CCC-----Cc-ceecccccccch-HHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHH Confidence 111 00 112222222222 3445566676777766652221 12322 23368999999988888888888888 Q ss_pred HH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |. .+..-+.+..+.++.+.--=|. -+..+ ....++|......+ ..+|.. T Consensus 402 l~~E~l~Pli~r~~~il~r~g~lP~----------------~P~~l-------------~~~~i~v~yis~La-~aq~~~ 451 (555) T protein:vir:98 402 MHNEILDPLIELTFQRMVEANILPP----------------PPQEM-------------QGVDLNVEFVSMLA-QAQRAI 451 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCC----------------Cchhh-------------cCceeEEEeccHHH-HHHHHH Confidence 76 4545555444444444210000 00000 00113333333332 344555 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh-hhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI-SGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~-~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) ....+.++++.+++..+. .+.++...+ .++++..+-..... .....++ ++.++..++.+++++.+++ . T Consensus 452 ~~~~i~~~l~~i~~laq~----~P~vld~id---~d~~~~~~a~~~Gvp~~~irs~-eev~~~r~qr~~~~q~~~~---a 520 (555) T protein:vir:98 452 ATNSVDRFVGNLGAVAGI----KPEVLDKFD---ADRWADTYADMLGIDPELIVPG-NQVALIRKQRADQQQAAQQ---A 520 (555) T ss_pred HHHHHHHHHHHHHHHhcC----ChhhhhcCC---HHHHHHHHHHHhCCCccccCCH-HHHHHHHHHHHHHHHHHHH---H Confidence 555555555554332211 112233333 34444444433321 2222221 1112211111111111111 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLK 685 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~ 685 (708) .++.+...+.+.+..++..-. .. +...+++...-. T Consensus 521 ~~~~q~~~~~~~~~~~~~~~~----~~-------------------------~~~~~~~~~~~~ 555 (555) T protein:vir:98 521 ALLNQGADTAAKLGSVDTSKQ----NA-------------------------LTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHhcccccCcc----hh-------------------------HHHHHhhhccCC Confidence 111111000000000000000 00 000011100000 No 33 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.81 E-value=8.1e-19 Score=119.76 Aligned_cols=467 Identities=12% Similarity=0.039 Sum_probs=224.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~g 78 (708) --+..+++-..-.+.+.++.+.+...+.... .+-..||.|+++.-..+.... ..++| .++.|..+.+|+..+| T Consensus 29 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~-~~~~~yY~g~~~~i~~~~~~~----~~~~~~~ri~~n~~k~Ivd~~~~ 103 (501) T protein:vir:96 29 RADNLEELMVNNWELLKNFINHHKLRQAPRI-QELLDYARGENHDVLKSGRRK----DNEMADKRAVHNYGRMISKFKTG 103 (501) T ss_pred cccccccccCChHHHHHHHHHHHHHHHHHHH-HHHHHHhcCCCCcccCccccC----ccccccceeecchHHHHHHHHhh Confidence 0011111110111122333333332222111 122357889886432221111 11222 4789999999999999 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) +...+.+.+.+... + -.+.+...+..+++.|+++.....+..+++++|.||..+..+. ++.+++. T Consensus 104 yl~g~p~~~~~~~~---~---~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~de---------dg~~~i~ 168 (501) T protein:vir:96 104 YLAGNPIRVEYDDN---D---DNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSE---------YDETRIK 168 (501) T ss_pred hhcccCeeEeeCCc---c---chhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcC---------CCceEEE Confidence 99999888876321 1 1345667777788899999999999999999999998876432 2344554 Q ss_pred Eeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 159 PIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 159 ~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) .+ ++..+ +||+.... +.. ++++.|.. ....+.....++|...+ T Consensus 169 ~~--~p~~~~~v~d~~~~~----~~~-~~v~~~~~---------------------------~~~~~~~~~~~vyt~~~- 213 (501) T protein:vir:96 169 RL--SPLETFVIYDNSLED----NSI-AAVRYYNR---------------------------GTLQSAKDVVEIYTDEH- 213 (501) T ss_pred EE--ccceeEEEEcCCCCC----ceE-EEEEEEEe---------------------------ecCCCcEEEEEEEcCCc- Confidence 43 22333 34442210 111 11111100 00001111222222211 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) ++.+...+...+....+.+++.+|+|+|.- T Consensus 214 -----------------------------------------------i~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--- 243 (501) T protein:vir:96 214 -----------------------------------------------IYTLDASDDFNEISVTTHAFGTVPITEYLN--- 243 (501) T ss_pred -----------------------------------------------EEEEeeCCCceeccccccCCCccceEEecC--- Confidence 111111111222233455667777776531 Q ss_pred ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 317 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) ...|.|.+..+++.++.+|+..|.+...+...+.+.+++.-....+..+... +......+.........+.. T Consensus 244 ----n~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-- 315 (501) T protein:vir:96 244 ----NIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQAS--DMKRTRLMQLKPPKSADGKE-- 315 (501) T ss_pred ----CccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchh--hhhhcCeeeecccccccccc-- Confidence 2346789999999999999999999999988888777664222221111111 11122222222222111111 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVW 475 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~ 475 (708) ....+.+...+.-...+...+..+...|..+|++.+.+.|. .+|.||.|+...............+.|..+++++.+++ T Consensus 316 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li 395 (501) T protein:vir:96 316 GTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLA 395 (501) T ss_pred cCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122233222233566777888899999999998877765 46789999988877777777777788888888888887 Q ss_pred HHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 476 LSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSML 555 (708) Q Consensus 476 l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~ 555 (708) +.++...... . .. |. .+|.|.=.+..+....+..+.++.+...+ T Consensus 396 ~~~~~~~~~~--------~--~~---------------------d~----~~i~i~f~~~~p~n~~e~ad~~~kl~g~i- 439 (501) T protein:vir:96 396 ARIGSLVNEF--------K--DF---------------------DE----SLLKITFTPNLPKSLNEQVSILTGLGGQV- 439 (501) T ss_pred HHHHHhcccc--------c--cc---------------------cc----ccceEEeCCCCCcCHHHHHHHHHHHhccC- Confidence 7776443210 0 00 00 12333334455554555656666553211 Q ss_pred ccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 556 PTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEA 634 (708) Q Consensus 556 ~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~ 634 (708) ...++++++.+ ...+.-.+++.+...... ......+. .-......-+. T Consensus 440 --------S~et~~~~l~~v~D~~~E~~ri~~E~~~~~---------------------~~~~~~~~--~~~~~~~~~~~ 488 (501) T protein:vir:96 440 --------SQETALSLSGLVESPNEELDKINKEMSEID---------------------FKGYSNDF--NEHVGKYTDEV 488 (501) T ss_pred --------chHHHHHhCCCCCCHHHHHHHHHHHHHHhh---------------------ccccccch--hhcccccCCcC Confidence 11233333322 112222233322110000 00000000 00000000000 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 635 QKATNETAQTQIK 647 (708) Q Consensus 635 ~k~~~~~~~~q~e 647 (708) ....++......+ T Consensus 489 ~e~~~d~~e~~~~ 501 (501) T protein:vir:96 489 KETHTDDFEREYE 501 (501) T ss_pred CCCCCCccccccC Confidence 0000000000000 No 34 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.80 E-value=1.3e-17 Score=113.22 Aligned_cols=531 Identities=10% Similarity=0.071 Sum_probs=243.1 Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh-cCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhc--- Q lcl|Aclame:pro 7 KKHERIMLRFDRAYSPQKEVREKCIEATRFAR-VPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRN--- 82 (708) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~-~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~--- 82 (708) =...++.++|+...+....|...|+++.+|.+ +.+.-+............ ...-+.-+.-...++.+.+.... T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~---~~~~i~dst~~~a~~~Las~L~~~lt 77 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWN---QNREVFDSTAGDGLETLSSSLHGSLT 77 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccc---cccccccchHHHHHHHHHHHHHHhhc Confidence 12456677788877888888888888776532 011111110000000000 01122334445555555443332 Q ss_pred --CcceeEEecCCCc--chHHHHHHH---HHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcce Q lcl|Aclame:pro 83 --NRITVKFRPGDRE--ASEELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRI 155 (708) Q Consensus 83 --nr~~~~v~pr~~~--~d~~~A~~l---~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i 155 (708) +++=++..+.+.+ +..+..+-| +..+....+.|++..+...++.+.++.|.|.+.+..+. . ....+ T Consensus 78 Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~------~-~~~~~ 150 (547) T protein:vir:10 78 SPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDE------D-EEGSV 150 (547) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCC------C-CCCce Confidence 4444555444321 112233333 44555566789999999999999999999988774321 1 22344 Q ss_pred eeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcc-cccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 156 AIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP-TSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 156 ~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~-~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) ++..+ |+.++++..++.- + ..=|++...|+..++..+||...- +.+......+ .......+. T Consensus 151 r~~~~--pl~~~~v~~d~~G-~---v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~---~~~~~~~~~-------- 213 (547) T protein:vir:10 151 VFQSS--PIQDSYFEEDSRG-Q---VVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEA---SNQAALKQE-------- 213 (547) T ss_pred eEEEe--ecceEEEeeCCCc-C---eeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcC---CCcccceEE-------- Confidence 55544 5678888765532 2 223688899999999999986442 2111110000 000000111 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeE-EEEEEEEec-ceeeecCCCCCCCCcceeeEE Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRR-RVYVSVVDG-DGFLEKPRRIPGEHIPLIPVY 312 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~~~-~~il~~~~~~p~~~~p~~p~~ 312 (708) ++.+..+.... ..+... .... ....+.. .||+ -..| ..++.+ + .|.++||+++- T Consensus 214 -----v~~~v~~~~~~----~~~~~~---------~~~~--~~~~~p~~s~~~-e~~~~~~~l~e-s--g~~e~P~~~~R 269 (547) T protein:vir:10 214 -----VVMCVFTRYDK----KQNRNA---------GTVL--APTERPFGKKWI-LKEGAVQLGEE-G--GYYEMPAYAIR 269 (547) T ss_pred -----EEEEEeeccCC----CCCccc---------ccee--eccccceeEEEE-EecCceeeeec-C--CcccCCeeeee Confidence 11111111000 000000 0000 0000111 1222 2233 334433 3 34567777654 Q ss_pred EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccc Q lcl|Aclame:pro 313 GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSG 392 (708) Q Consensus 313 ~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (708) ... .+|..+|.|.+....+-.+.+|++...++..+.+..+++++++.+.+-+ ..+..+++++.... .. T Consensus 270 w~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~------~~~~~pgg~~~~~~----~~ 337 (547) T protein:vir:10 270 WRK--SAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLIS------DIDLGASGLTVVRD----ME 337 (547) T ss_pred eee--cCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecccccccc------cceecCCeeeecCC----cc Confidence 333 5888999999999999999999999999999999999999998654432 23345566554321 11 Q ss_pred cccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|Aclame:pro 393 NIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMA-KSLKRA 471 (708) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~~ 471 (708) . ..++ +...--......++...+.|...-=+...........|++-|..+.+.....+...+..|. .+..-+ T Consensus 338 ~----v~pl---~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pl 410 (547) T protein:vir:10 338 S----MKPF---ESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPM 410 (547) T ss_pred c----ceee---ecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 1 1122 1111112233455555555555542322233333456899999999988888888887776 455555 Q ss_pred HHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHH Q lcl|Aclame:pro 472 GEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVL 551 (708) Q Consensus 472 ~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~ll 551 (708) ....+.++.+.--=+. + +... -+.....++|..-...+ ..++......+.+++ T Consensus 411 i~r~~~il~r~g~lP~---------------~-p~~l----------~~~~~~~~~v~~is~La-raq~~~~~~~i~~~~ 463 (547) T protein:vir:10 411 IQRTFNIRFRAGKLGE---------------L-PSKL----------LESGKAAMDIVYTGPLS-RAQKIDQAASIERWA 463 (547) T ss_pred HHHHHHHHHhcCCCCC---------------C-chhh----------hccCcceEEEEeccHHH-HHHHHHHHHHHHHHH Confidence 5555555443210000 0 0000 00011223343322222 233444445555555 Q ss_pred HhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh-hhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 552 SSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI-SGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAA 630 (708) Q Consensus 552 q~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~-~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~ 630 (708) +.+++.....+ .++...+ .+++...+-..... ...... +++.++..++.++++|.+++....++..+.++ T Consensus 464 ~~v~~laq~~P----~vld~id---~d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~- 534 (547) T protein:vir:10 464 GSTAQLAEINP----EVLDIPD---WDEMVRMLGSLLGAPQTLMRP-KAKVTSIRKNRSQTQQKAEQAAIAEAEGNAME- 534 (547) T ss_pred HHHHHhhccCh----hhhhcCC---HHHHHHHHHHHhCCChhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 54433211111 2223333 33444444333221 112221 11212222221111111111111110011000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 QAEAQKATNETAQTQIKAFTAQQD 654 (708) Q Consensus 631 qae~~k~~~~~~~~q~e~~~~~~~ 654 (708) .++ ...+...+.+ T Consensus 535 ------~~~-----~~~a~~~~~~ 547 (547) T protein:vir:10 535 ------AQG-----KGQAALKENQ 547 (547) T ss_pred ------hhc-----CcccchhccC Confidence 000 0000000000 No 35 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.80 E-value=1.2e-16 Score=107.83 Aligned_cols=510 Identities=13% Similarity=0.073 Sum_probs=252.3 Q ss_pred CCcchHHH--HHHHHHHHHHHHHhhHHHHHHHHHHHHHhh--cCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKK--HERIMLRFDRAYSPQKEVREKCIEATRFAR--VPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~--~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~--~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) ||+...+- -+.++.+|+...+....|...|+++.+|.+ ...+.+...- .+ ...+.-..-...++.+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~-----~~-----~~~~~dst~~~a~~~L 70 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNES-----TD-----YTTPWQAVGARGLNNL 70 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc-----cc-----ccccccccHHHHHHHH Confidence 99744332 345777888888888888888988876521 1111221100 00 0122233444445544 Q ss_pred HHHHh----cCcceeEEecCCC---------cchHHHHHHH---HHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe Q lcl|Aclame:pro 77 IAEYR----NNRITVKFRPGDR---------EASEELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) Q Consensus 77 ~g~~~----~nr~~~~v~pr~~---------~~d~~~A~~l---~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~ 140 (708) .+... -+++=+++.+.+. .+-.++.+.| +..+......|++..+...++.+.+..|.|++.+.. T Consensus 71 aa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~ 150 (535) T protein:vir:15 71 ASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE 150 (535) T ss_pred HHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec Confidence 43332 2333233333210 0111233333 344445567899999999999999999999876632 Q ss_pred eccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCC Q lcl|Aclame:pro 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWF 220 (708) Q Consensus 141 ~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~ 220 (708) ++ ...+++..+ |+.++++..++.- ...-++++..++.+++...|+....... . ... T Consensus 151 ------~~---~~~~~f~~~--pl~~~~v~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~~~~~----~-----~~~ 206 (535) T protein:vir:15 151 ------PE---GSYNPMKLY--RLSSYVVQRDAYG----NVLQIVTRDQIAFGALPEDVRSAVEKAG----G-----EKK 206 (535) T ss_pred ------CC---CCceeeEEE--EcCeeEEeeCCCC----CeeEEEEeEeecHHHHHHHHhHhhhccc----c-----ccC Confidence 11 223445544 5667777655431 2334889999999999888875422100 0 000 Q ss_pred CCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCC Q lcl|Aclame:pro 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) Q Consensus 221 ~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~ 300 (708) ..+.+.|... .+.++.++ .+.|+. ...|..+....+. T Consensus 207 ~~~~v~v~~~-----------v~~~~~~~-------------------------------~~~~~~-e~~g~~~~~~~~~ 243 (535) T protein:vir:15 207 MDEMVDVYTH-----------VYLDEESG-------------------------------DYLKYE-EVEDVEIDGSDAT 243 (535) T ss_pred CCCceeEEEE-----------EEEecCCC-------------------------------cEEEEE-EeeCccccccccc Confidence 1111211111 11111111 111211 1223333333466 Q ss_pred CCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCc Q lcl|Aclame:pro 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) Q Consensus 301 ~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~ 380 (708) +++..+||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++.+.+....+... ..++. T Consensus 244 ~~~~~~P~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~---~~~g~ 318 (535) T protein:vir:15 244 YPTDAMPYIPVRMVR--IDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTK---AQTGD 318 (535) T ss_pred cccccCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhccc---CCcee Confidence 888999998765443 6889999999999999999999999999999999999999998766543322110 11111 Q ss_pred eeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-ccc-chhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 381 FLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPS-NIAQETVNNLMNRADMASF 458 (708) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~~-n~sg~ai~~~q~q~~~~~~ 458 (708) ++.. ..+.+. +.+....+. .+...+.++...+.|....= .+ +++ .++ ..|++-|..+.+.....+. T Consensus 319 ~v~g-----~~~~v~----~~~~~~~~~-~~~~~~~i~~~~~~I~~af~-~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG 386 (535) T protein:vir:15 319 FVPG-----RREDID----FLQLEKQAD-FTVAKAVSDQIEARLSYAFM-LN-SAVQRTGERVTAEEIRYVASELEDTLG 386 (535) T ss_pred eecC-----Ccccce----eeecccccc-hhHHHHHHHHHHHHHHHHHh-hh-hcccCCCccccHHHHHHHHHHHHHHHh Confidence 2111 111111 111222222 23345566666777766542 22 333 223 3688889999998888898 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccch Q lcl|Aclame:pro 459 IYLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYT 537 (708) Q Consensus 459 ~~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~ 537 (708) ..+.+|. .+..-+.+..+.++.+.. . +. +... ..+.+.+..+.. . T Consensus 387 ~v~~rl~~Ell~Pli~r~~~il~r~g-------------~-----lP-----~~p~----------~~v~~~yis~La-~ 432 (535) T protein:vir:15 387 GVYSILSQELQLPLVRVLLKQLQATS-------------Q-----IP-----ELPK----------EAVEPTISTGLE-A 432 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC-------------C-----CC-----CCCc----------cceeEEEecHHH-H Confidence 8888877 566666666666654411 0 00 0000 113445544443 4 Q ss_pred hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh--hcccCcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 538 ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS--GIAKPRNEKEQQIVQQAQMAAQSQ 615 (708) Q Consensus 538 ~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~--~~~~~~~~~~~q~~~~~qq~qq~~ 615 (708) ..|.+..+.|+++++.+....|.. . .... +.+++...+....... ....+ +++.++.++++++++++ T Consensus 433 aqr~~~~~~l~~~~~~la~~~P~~---l---d~~i---d~d~~~~~~a~~~Gvp~~~i~~~--~eev~~~~~q~~~~~~~ 501 (535) T protein:vir:15 433 IGRGQDLDKLERCISAWAALAPMQ---G---DPDI---NLAVIKLRIANAIGIDTSGILLT--DEQKQALMMQDAAQTGI 501 (535) T ss_pred HHHHHHHHHHHHHHHHHHhcChhh---h---hccC---CHHHHHHHHHHHcCCChhhhcCC--HHHHHHHHHHHHHHHHH Confidence 567778888888888776544421 1 1112 3344444444333221 12222 12111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 616 PNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQAR 670 (708) Q Consensus 616 ~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~ 670 (708) + ++..+..+...++ .+...+..+ +.+++ +..++. T Consensus 502 ~--~~a~~~g~~~~~~---~~~~p~~~~-------~~~~~---------~g~~~~ 535 (535) T protein:vir:15 502 E--NAAATGGAGVGAL---ATSSPEAMQ-------GAAAQ---------AGLDAT 535 (535) T ss_pred H--HHHHHHHhhccch---hccChHHHH-------HHHhc---------cCCCCC Confidence 0 0000000000000 000000000 00000 000000 No 36 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.80 E-value=9.9e-17 Score=108.31 Aligned_cols=510 Identities=13% Similarity=0.073 Sum_probs=253.6 Q ss_pred CCcchHH--HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh--cCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEK--KHERIMLRFDRAYSPQKEVREKCIEATRFAR--VPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~--~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~--~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) ||+...+ --+.++.+|+...+..+.|...|+++.+|.+ .-.+.+...- . ....+.-..-...++.+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~-----~-----~~~~~~dst~~~a~~~L 70 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNES-----T-----DYTTPWQAVGARGLNNL 70 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc-----c-----cccccccccHHHHHHHH Confidence 9975433 2355778888888888888899988876521 1011111100 0 01122233444455544 Q ss_pred HHHHhc----CcceeEEecCCC---------cchHHHHHHH---HHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe Q lcl|Aclame:pro 77 IAEYRN----NRITVKFRPGDR---------EASEELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) Q Consensus 77 ~g~~~~----nr~~~~v~pr~~---------~~d~~~A~~l---~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~ 140 (708) .+.... +++=+++.+.+. .+-.++.+.| +..+......|++..+...++.+.+..|.|++.+.. T Consensus 71 aa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~ 150 (535) T protein:vir:33 71 ASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE 150 (535) T ss_pred HHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec Confidence 433332 333223322221 0011223333 344455567899999999999999999999877632 Q ss_pred eccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCC Q lcl|Aclame:pro 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWF 220 (708) Q Consensus 141 ~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~ 220 (708) + + ...+++..+ |+.++++..+.. . ...-+|++..+|..++.+.|+....... . ... T Consensus 151 ~------~---~~~~~f~~~--pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~~~~~~~~~~----~---~k~-- 206 (535) T protein:vir:33 151 P------E---GSYNPMKLY--RLSSYVVQRDAY---G-NVLQIVTRDQIAFGALPEDVRSAVEKSG----G---EKK-- 206 (535) T ss_pred C------C---CCceeeEEE--EcCeeEEeeCCC---C-CeeEEEeeEeecHHHHHHHhhhhhcccc----c---ccc-- Confidence 1 1 123445544 566777765543 1 2334899999999999999985432100 0 000 Q ss_pred CCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCC Q lcl|Aclame:pro 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) Q Consensus 221 ~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~ 300 (708) ..+.+ .+..+.+.++.++. +.|+ +..-|..+....+. T Consensus 207 ~~~~~-----------~v~~~v~~~~~~~~-------------------------------~~~~-~~~~~~~~~~~~~~ 243 (535) T protein:vir:33 207 MDEMV-----------DVYTHVYLDEESGD-------------------------------YLKY-EEVEDVEIDGSDAT 243 (535) T ss_pred cccCC-----------eEEEEEEeeCCCCc-------------------------------EEEE-EEEeCccccccccc Confidence 00111 11111122211111 1122 12234444334466 Q ss_pred CCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCc Q lcl|Aclame:pro 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) Q Consensus 301 ~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~ 380 (708) +|+..+||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++.+.+....+. .+++ T Consensus 244 ~~~~~~P~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~------~~~~ 315 (535) T protein:vir:33 244 YPTDAMPYIPVRMVR--IDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRL------TKAQ 315 (535) T ss_pred cccccCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhc------ccCC Confidence 888999998765443 68889999999999999999999999999999999999999987665433221 1111 Q ss_pred eeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-ccc-chhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 381 FLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPS-NIAQETVNNLMNRADMASF 458 (708) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~~-n~sg~ai~~~q~q~~~~~~ 458 (708) .-.+.++ ..+.+. +.+....+. .+...+.++...+.|....=+ + +++ .++ ..|++-|..+.+.....+. T Consensus 316 ~g~~v~g--~~~~v~----~~~~~~~~~-~~~~~~~i~~~~~~I~~af~~-~-~~~~~~~~r~TAtEV~~r~~E~~~~LG 386 (535) T protein:vir:33 316 TGDFVPG--RREDID----FLQLEKQAD-FTVAKAVSDQIEARLSYAFML-N-SAVQRTGERVTAEEIRYVASELEDTLG 386 (535) T ss_pred ceeeecC--Ccccce----eeecccccc-hhHHHHHHHHHHHHHHHHHhh-h-hcccCCCccccHHHHHHHHHHHHHHHh Confidence 1111111 111111 111222222 233455666667777665422 2 333 223 3688889999999888998 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccch Q lcl|Aclame:pro 459 IYLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYT 537 (708) Q Consensus 459 ~~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~ 537 (708) ..+.+|. .+..-+.+..+.++.+.. . +. +... ..+.+.+..+.. . T Consensus 387 ~v~~rl~~Ell~Pli~r~~~il~r~g-------------~-----lP-----~~p~----------~~v~~~yis~La-~ 432 (535) T protein:vir:33 387 GVYSILSQELQLPLVRVLLKQLQATS-------------Q-----IP-----ELPK----------EAVEPTISTGLE-A 432 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC-------------C-----CC-----CCCc----------cceeEEEecHHH-H Confidence 8888877 566666666666654411 0 00 0000 113445544443 4 Q ss_pred hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh--hcccCcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 538 ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS--GIAKPRNEKEQQIVQQAQMAAQSQ 615 (708) Q Consensus 538 ~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~--~~~~~~~~~~~q~~~~~qq~qq~~ 615 (708) ..|.+..+.|+++++.+....|.. .+ ... +.+++...+....... ....+ +++.++.++++++++++ T Consensus 433 aqr~~~~~~l~~~~~~la~~~P~~---~d---~~i---d~d~~~~~~a~~~Gvp~~~i~~~--~ee~~~~~~q~~~~~~~ 501 (535) T protein:vir:33 433 IGRGQDLDKLERCISAWAALAPMQ---GD---PDI---NLAVIKLRIANAIGIDTSGILLT--DEQKQALMMQDAAQTGV 501 (535) T ss_pred HHHHHHHHHHHHHHHHHHhhChhh---hh---ccC---CHHHHHHHHHHHcCCCHhHhcCC--HHHHHHHHHHHHHHHHH Confidence 567778888888888766544421 11 112 3344444444332221 12222 12211111111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 616 PNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDV 687 (708) Q Consensus 616 ~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~ 687 (708) + ++..+..+.+ +..++...+..++.+ .++.+ .+. T Consensus 502 ~--~~~~~~g~~~----------~~~~~~~~~~~~~~~-------------------------~~~g~-~~~ 535 (535) T protein:vir:33 502 E--NAAAAGGAGV----------GALATSSPEAMQGAA-------------------------AKAGL-NAT 535 (535) T ss_pred H--HHHHhhhhhh----------cchhhcCChhHHHHH-------------------------HhccC-CCC Confidence 0 0000000000 000000000000000 00000 000 No 37 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.79 E-value=3e-18 Score=116.66 Aligned_cols=475 Identities=11% Similarity=0.035 Sum_probs=223.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |. ..+..+..-...+......+...+.... ++-..||.|+|..-......... ....-.++.|..+.+|+..+|+. T Consensus 31 ~~-~~e~~~~~~~~~i~~~i~~~~~~~~~r~-~~l~~YY~g~~~i~~~~~~~~~~--~~~~~ki~~n~~k~Ivd~~~~yl 106 (512) T protein:vir:97 31 YD-GTESDLLQNINEVSKYIEHHMDYQRPRL-KVLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFINGYF 106 (512) T ss_pred cC-chhhhhhhhHHHHHHHHHHHHHhhHHHH-HHHHHHhcccCccccccCccccc--ccCcceeecchHHHHHHHHhhhh Confidence 43 2222222222333333333322222221 22245899987521111111110 11123467899999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ..+.+.+.+ . |.+ ..+.+..+++.|+++.....+..+++++|.+|..+..+. ++.+++..+ T Consensus 107 ~g~p~~~~~--~----d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de---------d~~~~i~~~ 167 (512) T protein:vir:97 107 LGNPIQCQD--D----DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDETRLYKS 167 (512) T ss_pred cccCceecc--C----ChH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCC---------CCceEEEEE Confidence 998888753 1 222 245677778889999999999999999999998876431 234555443 Q ss_pred ecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEE Q lcl|Aclame:pro 161 YDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESV 238 (708) Q Consensus 161 ~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~ 238 (708) ++.++| ||+... .-...+++.|... ..+....+.+...++|..... T Consensus 168 --~p~~~~~iyd~~~~-----~~~~~~vr~~~~~-----------------------~~~~~~~~~~~~~~vyt~~~i-- 215 (512) T protein:vir:97 168 --DAMSTFVIYDNTIE-----RNSIAGVRYLRTK-----------------------PIDKTDEDEVFTVDLFTSHGV-- 215 (512) T ss_pred --cccceEEEEcCCCC-----CceEEEEEEEEee-----------------------eccccccceEEEEEEEeCCcE-- Confidence 334443 665332 1112333333110 000011223344455544432 Q ss_pred EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeecc Q lcl|Aclame:pro 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~ 318 (708) +.|.....+. .. .......+.+.|++.+|+++|. T Consensus 216 --~~~~~~~~~~-~~-------------------------------------~~~~~~~~~~~~~g~vPvv~~~------ 249 (512) T protein:vir:97 216 --YRYLTSRTNG-LK-------------------------------------LTPRENGFESHSFERMPITEFS------ 249 (512) T ss_pred --EEEEecCCCc-cc-------------------------------------ccccccccccccCcccceEeec------ Confidence 1121110000 00 0001123455667777777643 Q ss_pred CCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccc---ccccc Q lcl|Aclame:pro 319 DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK---SGNII 395 (708) Q Consensus 319 d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 395 (708) +...+.|.+..+++.++.+|...|.+.+.+...+.+.+++-.....+...... ......+........ .+... T Consensus 250 -nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 325 (512) T protein:vir:97 250 -NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK---QKEANVLFLEPTVYENRDTGIET 325 (512) T ss_pred -CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhh---hhhcccccccccchhhcccccCC Confidence 12346799999999999999999999999988887776653211111111000 011111111110000 00000 Q ss_pred ccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 396 AGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) .+.....+...+.-..++...+..+...|..+|++.+.+.|. .+|.||.|+...............+.|..+++++.++ T Consensus 326 ~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l 405 (512) T protein:vir:97 326 EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 405 (512) T ss_pred CCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122333333234566677888889999999998877664 4678999999888877777777888888888888888 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) ++.++...-... ....+ -+|.+.=.+..+....+..+.+..+...+ T Consensus 406 i~~~~~~~~~~~---------~~~d~-------------------------~~i~~~f~~~~p~~~~e~~~~~~kl~gii 451 (512) T protein:vir:97 406 LETILKNTRSID---------ANKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDSGGKI 451 (512) T ss_pred HHHHHHhcCCcc---------ccccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhccC Confidence 777654322100 00000 12233334445554445555555542111 Q ss_pred cccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) + ..++++++.+ ...++-.+++..... ....+.+......+. ..+ T Consensus 452 ----S-----~et~~~~l~~v~d~~~E~eri~~E~~-------------~~~~~~~~~~~~~~~-------------~~~ 496 (512) T protein:vir:97 452 ----S-----QTTLMSLFSFFQDPELEVKKIEEDEK-------------ESIKKAQKGIYKDPR-------------DIN 496 (512) T ss_pred ----c-----hHHHHHhCCCCCCHHHHHHHHHHHHH-------------HHHHHHhhcccCCCC-------------CCC Confidence 1 1223333322 111122222221100 000000000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQTQIKAFTAQQDAMESQAN 661 (708) Q Consensus 634 ~~k~~~~~~~~q~e~~~~~~~~~~~~a~ 661 (708) .....+ +.+-...+ .+ T Consensus 497 -~~~~~~--~~~~~~~~---------~~ 512 (512) T protein:vir:97 497 -DDEQDD--DTKDTVDK---------KE 512 (512) T ss_pred -CCCCCC--Cccccccc---------cC Confidence 000000 00000000 00 No 38 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.78 E-value=3.7e-17 Score=110.66 Aligned_cols=447 Identities=9% Similarity=-0.012 Sum_probs=212.3 Q ss_pred CCcc-----------------hHHH-HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC Q lcl|Aclame:pro 1 MAET-----------------LEKK-HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP 62 (708) Q Consensus 1 ma~~-----------------~~~~-~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp 62 (708) |+|. .+.+ .+.+.+.+........+ +. ++-..||.|+| + +....+.. ....- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~---~~--~~l~~Yy~g~~-~--i~~~~~~~--~~~~~ 70 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKP---RY--RENMKLYLGKH-K--ILTAPEKE--TGADN 70 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHH---HH--HHHHHHhcccc-c--cccCcccc--cCCcc Confidence 3321 1112 23333333332222222 22 22245899976 1 11111110 11112 Q ss_pred ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeec Q lcl|Aclame:pro 63 KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) Q Consensus 63 ~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~ 142 (708) .++.|..+.+|+..+|+...+.+.+.+. +|.+..+ .+..+++.|+++.....++.+++++|.+|..+..+. T Consensus 71 ki~~n~~~~Ivd~~~~~l~g~p~~~~~~-----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~ 141 (470) T protein:vir:99 71 RIVVNSAKYVVDVYNGYFCGIEPKLALL-----NDSSKID----EIARWNRQENFFDTINEISKQCDIFGRSIASIYQGE 141 (470) T ss_pred eeecchHHHHHHHHhhhhccCCeeEeeC-----CchhHHH----HHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCC Confidence 4678999999999999999998776641 1222222 344566789999999999999999999988775431 Q ss_pred cccCCCCCCCcceeeEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCC Q lcl|Aclame:pro 143 VNEYDPMDDRQRIAIEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWF 220 (708) Q Consensus 143 ~~~~d~~~~~~~i~i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~ 220 (708) .+.+++..+ ++..+ +||+.... . ..++++.|...+ T Consensus 142 ---------dg~~~i~~~--~p~~~~~i~d~~~~~----~-~~~~vr~~~~~~--------------------------- 178 (470) T protein:vir:99 142 ---------DARPHLMYS--SPNHAFIIYDDTVQR----Q-PLAFVHYQIDNS--------------------------- 178 (470) T ss_pred ---------CCeEEEEEE--ccceeEEEEcCCCCc----c-eEEEEEEEEEec--------------------------- Confidence 223444432 23343 34442211 0 111222221100 Q ss_pred CCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCC Q lcl|Aclame:pro 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) Q Consensus 221 ~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~ 300 (708) +........+|...+ .+.+.... .+......+..+ T Consensus 179 ~~~~~~~~~~~~~~~---------------~~~~~~~~------------------------------~~~~~~~~~~~~ 213 (470) T protein:vir:99 179 NNWTDAYGVIQYADK---------------FYKFKGYD------------------------------IEEDTNAAGYAI 213 (470) T ss_pred CCeeEEEEEEEecCe---------------EEEEEecc------------------------------cccccccccccc Confidence 000011111111110 00010000 000111222344 Q ss_pred CCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchH--HHHHhhcccC Q lcl|Aclame:pro 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLE--KHWEARNKKR 378 (708) Q Consensus 301 ~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~--~~~~~~~~~~ 378 (708) .|++.+|+++|.- ...|.|.+..+++.++.+|..+|.+...+...+.+.+++......+-+ +... .... T Consensus 214 ~~~g~vPvv~~~n-------~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~--~~~~ 284 (470) T protein:vir:99 214 NPYGLVPAVEFFE-------NEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKF--DFKN 284 (470) T ss_pred cCCCccceEeecC-------CCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhh--hhhh Confidence 5666677766431 234678999999999999999999999999888887776432222111 0000 0111 Q ss_pred CceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 379 PAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMAS 457 (708) Q Consensus 379 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~ 457 (708) ...+.+...... ....+.+...+.-...+...+..+...|-..||+.+.+.+. .+|.||.|+........... T Consensus 285 ~~~~~~~~~~~~------~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~ 358 (470) T protein:vir:99 285 NRVLYVSQLDPD------TNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKA 358 (470) T ss_pred cceeeecCCCCC------CCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHH Confidence 111111111100 01123334433334556667888899999999998766664 46789999998888777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccch Q lcl|Aclame:pro 458 FIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYT 537 (708) Q Consensus 458 ~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~ 537 (708) ...-+.|..+++++.++++.++....... .. -.+|.|.=.+..+ T Consensus 359 ~~~~~~~~~~l~~~~~li~~~~~~~~~~~-----------~~-------------------------~~~i~v~f~~~~p 402 (470) T protein:vir:99 359 DSKERKFDKSLMQLYRIVLATLFNNKQDQ-----------EL-------------------------WSELDFKFTRNLP 402 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcc-----------cc-------------------------cccceEEeCCCCC Confidence 77888888888888887777654322110 00 0123333344445 Q ss_pred hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 538 ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 538 ~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) ....+..+.+..+...+ + ...+++++.+-..+.-.+++..... ...+. + .+.... T Consensus 403 ~~~~e~a~~~~kl~gii----s-----~et~l~~l~~vd~~~E~eri~~E~~--------------~~~~~-~-~~~~~~ 457 (470) T protein:vir:99 403 EDMASAIDNAKNAEGIV----S-----KKTQLGMIPDIEPDAEMKQIAKEKA--------------DAIKQ-T-QQLSMP 457 (470) T ss_pred cCHHHHHHHHHHHhccC----C-----HHHHHHhCCCCCHHHHHHHHHHHHH--------------HHHHH-H-HhhcCC Confidence 44445555555542211 1 1223333333222222222221100 00000 0 000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAEAQ 635 (708) Q Consensus 618 ~~~~~aq~~~~~~qae~~ 635 (708) ... .. .....+-+ T Consensus 458 ~d~--~~---~d~~~ee~ 470 (470) T protein:vir:99 458 IDI--LK---RDNNAEEE 470 (470) T ss_pred CCc--CC---CCCCccCC Confidence 000 00 00000000 No 39 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.78 E-value=2.5e-17 Score=111.57 Aligned_cols=466 Identities=12% Similarity=0.020 Sum_probs=223.5 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEV-REKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRII 77 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~-r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~ 77 (708) -.+..++....-.+.+.+..+.+... ..+.. +-..||.|++..-......+. .++| .++.|..+.+|+..+ T Consensus 29 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~--~l~~yY~g~~~~i~~~~~~~~----~~~~~~ki~~n~~k~Ivd~~~ 102 (501) T protein:vir:27 29 RADNLEELMVNNWELLKNFINHHKLRQAPRIQ--ELLDYARGENHDVLQFGRRKD----REMADKRAVHNYGRMISKFKT 102 (501) T ss_pred ccccccccccccHHHHHHHHHHHHHHHHHHHH--HHHHHhcCCCccccccCccCc----cccccceeccchHHHHHHHHh Confidence 11111111111111222333332222 12222 224689998753221111111 1222 467899999999999 Q ss_pred HHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceee Q lcl|Aclame:pro 78 AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAI 157 (708) Q Consensus 78 g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i 157 (708) |+...+.+.+.+...+ ..+.+...+..++..|+++.....+..+++++|.+|..+..+. ++.+++ T Consensus 103 ~yl~g~p~~~~~~d~~------~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~de---------d~~~~i 167 (501) T protein:vir:27 103 GYLAGNPIRVEYDDND------NNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNE---------YDETRI 167 (501) T ss_pred hhhcccCeeEecCCcc------chHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCC---------CCceEE Confidence 9999999888753321 2344556677778899999999999999999999998876432 234555 Q ss_pred EEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 158 EPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 158 ~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) ..+ ++.++| ||+.... +. ..+++.|.. ....+.+...++|...+ T Consensus 168 ~~~--~p~~~~~v~d~~~~~----~~-~~~ir~~~~---------------------------~~~~~~~~~~~vyt~~~ 213 (501) T protein:vir:27 168 KRL--NPLETFVIYDNSLED----NS-IAAVRYYNR---------------------------GTLQNAKDVVEIYTNEH 213 (501) T ss_pred EEE--ccceeEEEecCCCCC----ce-EEEEEEEEe---------------------------eecCCcEEEEEEEeCCe Confidence 433 233333 4442210 11 111211110 00011122233333221 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) + +.+...+...+.+..+.+++.+|+++|. T Consensus 214 v------------------------------------------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~--- 242 (501) T protein:vir:27 214 I------------------------------------------------YTLDASDDFNEISVTTHAFGTVPITEFL--- 242 (501) T ss_pred E------------------------------------------------EEEEeCCceeeccccccCCCcccEEEec--- Confidence 1 1111111222223345566777777643 Q ss_pred eccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccc Q lcl|Aclame:pro 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNII 395 (708) Q Consensus 316 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (708) +...+.|.+..+++.++.+|...|.+.+.+...+.+.+++......+..+..... .....+.........|.. T Consensus 243 ----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~- 315 (501) T protein:vir:27 243 ----NNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDM--KRTRLMQLKPPKSADGKE- 315 (501) T ss_pred ----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhh--hhcCceeecccccccCCC- Confidence 2234678999999999999999999999999888777765422221111111111 112222222222111111 Q ss_pred ccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 396 AGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) ....+.+...+.-...+..++......|..+|++.+.+.+. .+|.||.|+...............+.|..+++++.++ T Consensus 316 -~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~l 394 (501) T protein:vir:27 316 -GTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRL 394 (501) T ss_pred -CCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11123333333333566677888889999999998776664 4678999999887777777777778888888888888 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) ++.++...... ....+ .+|.|.=.+..+....+..+.++.+...+ T Consensus 395 i~~~~~~~~~~----------~~~d~-------------------------~~i~v~f~~~~p~n~~e~ad~~~kl~g~i 439 (501) T protein:vir:27 395 AARIGSLVNEF----------KDFDE-------------------------SLLKITFTPNLPKSLNEQVSILTGLGGQV 439 (501) T ss_pred HHHHHhhcccc----------ccccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhccC Confidence 77765432210 00000 12333334455554555555555542111 Q ss_pred cccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) ...++++++.+ ...++-.+++++...... ...++..-... .....-+..... . T Consensus 440 ---------S~et~l~~l~~v~D~~~E~eri~~E~~e~~----------~~~~~~~~~~~----~~~~~d~~~~~~--~- 493 (501) T protein:vir:27 440 ---------SQETALSLSGLVESPNEELDKINKEVSEID----------FKGYSNDFNEH----VGKYTDEVKETH--T- 493 (501) T ss_pred ---------cHHHHHHhCCCCCCHHHHHHHHHHHHHhhh----------HhhhcCccccc----cccccCCCCCCc--c- Confidence 11233333322 122223334432211000 00000000000 000000000000 0 Q ss_pred HHHHHHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQTQIK 647 (708) Q Consensus 634 ~~k~~~~~~~~q~e 647 (708) +..+.+.| T Consensus 494 ------d~~e~~~~ 501 (501) T protein:vir:27 494 ------DDFERAYE 501 (501) T ss_pred ------ccccccCC Confidence 00000000 No 40 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.77 E-value=7.1e-17 Score=109.11 Aligned_cols=531 Identities=12% Similarity=0.059 Sum_probs=241.7 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCC-CCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhc- Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGG-QWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRN- 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~-Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~- 82 (708) +++. +..+|+...+....|...|+++.+|. .... ..+..... . ..+.+.-+.-...++.+.+.... T Consensus 1 m~~~---~~~r~~~l~~~R~~~e~~w~e~~~y~-lP~~~~~~~~~~~---~-----~~~~~~dst~~~a~~~Laa~l~~~ 68 (555) T protein:vir:17 1 MKHS---AQAKYMMLRADREDYLDSGRQSARLT-LPYILTDEGHVQG---G-----YLPTPWQSVGSKGVNVLASKLMLS 68 (555) T ss_pred ChhH---HHHHHHHHHHHhhHHHHHHHHHHHHh-cccccCCCCCccc---c-----cccccccccHHHHHHHHHHHHHHh Confidence 3333 33445555555566667777666542 1110 00110000 0 01223344455555555443333 Q ss_pred ----CcceeEEecCCCc------chHH---HHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC Q lcl|Aclame:pro 83 ----NRITVKFRPGDRE------ASEE---LANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) Q Consensus 83 ----nr~~~~v~pr~~~------~d~~---~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~ 146 (708) +++=+++.+.+.. .... +.+. ++.++......|++..+...++.+.+..|.|++.+. . T Consensus 69 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~------~ 142 (555) T protein:vir:17 69 LFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQG------K 142 (555) T ss_pred hcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEec------C Confidence 4444555554321 0111 2222 344555666789999999999999999999986441 1 Q ss_pred CCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 147 DPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 147 d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) ++ .++. |+.++++..+.. -...-++++..|+..++.+.|++...............+ ....+. T Consensus 143 ~~------~~~~----pl~~y~v~~d~~----G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d---~~~~~~ 205 (555) T protein:vir:17 143 KN------LKLY----PLDRFVVSRDGE----GNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGED---GPKMGV 205 (555) T ss_pred Cc------eeEE----EcCeEEEeeCCC----cCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhcccc---chhhhh Confidence 21 2222 455566544332 133448999999999999999864321000000000000 000000 Q ss_pred Ee-eee-eecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeee-cCCCCCC Q lcl|Aclame:pro 227 IA-KYY-EVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLE-KPRRIPG 303 (708) Q Consensus 227 v~-e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~-~~~~~p~ 303 (708) .. ..+ .+....+.++.+.+... .+++|+.-....++. .-..+|+ T Consensus 206 ~~~~~~~~~~~~~~~v~t~~~~~~---------------------------------~~~~~~~e~~~~~v~~~l~e~g~ 252 (555) T protein:vir:17 206 TAPGGRDKGKSNDALVYTYVCRKD---------------------------------GQVKWHQECDGKVIPGSNSSAPY 252 (555) T ss_pred hhhcccccCCCcceeEeecccccC---------------------------------CeeEEEEecCceeccccccccCc Confidence 00 000 00001111111111000 123333333333332 2356888 Q ss_pred CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceee Q lcl|Aclame:pro 304 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) Q Consensus 304 ~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~ 383 (708) .++||+++.... .+|..+|.|.+....+-.+.+|++....+..+....+++++++.+.+....+ ..+++.-. T Consensus 253 ~e~P~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~------l~~~~~g~ 324 (555) T protein:vir:17 253 THNPWIPLRFNI--VDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQN------LALAANGA 324 (555) T ss_pred ccCCeeeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcce------eecCCCce Confidence 899998765443 6888999999999999999999999999999999999999998776543221 11222111 Q ss_pred ecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) +.. |. ++...+.....+.--+...+.++...+.|.++..+.. .......|++-|..+.+.....+...+.+ T Consensus 325 v~~-----g~--~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~~--~~d~~r~TAtEV~~r~~E~~~~LGpv~~r 395 (555) T protein:vir:17 325 IIQ-----GR--PDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLMLQ--VRQSERTTATEVQATVQELNEQIGGIYSN 395 (555) T ss_pred eec-----CC--cccceeeeccccchhhHHHHHHHHHHHHHHHHHhhcC--CCCcccchHHHHHHHHHHHHHHHhHHHHH Confidence 111 11 1111111111111122334555555666655543321 11123358888999999888899888888 Q ss_pred HH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |. .+..-+.+..+.++.+.--=+ .+ +.+. ...+++ ++. .+..|++ T Consensus 396 l~~E~L~Pli~R~~~il~r~g~lP---------------~~--------p~~~--------v~~~i~--~~l-~~l~r~~ 441 (555) T protein:vir:17 396 LTTELLQPYLARKLHLLQKQRKLP---------------QL--------PKDL--------VQPTVV--AGL-WGVGRGQ 441 (555) T ss_pred HHHHHHHHHHHHHHHHHHhCCCCC---------------CC--------CHhh--------hcccee--ehH-HHHHHHH Confidence 86 566666666666665532100 00 0000 012333 332 3455778 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh--hhcccCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQSQPNPEM 620 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~--~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~ 620 (708) ..+.++++++++++..+. +.++... +.++++..+-..... ..... .+++.++..++++++++++ +. T Consensus 442 ~~~~l~~~~~~laq~~~~-----p~~~d~i---d~d~~~~~~a~~~Gv~p~~ivr--s~eev~~~rq~~~~~~~q~--~~ 509 (555) T protein:vir:17 442 DKQQLMEFITTLAQTMGP-----EIAMKYI---NPTEFIKRLAAAQGIDTLQLIN--SPETMKQLGDQQKQDMVQA--SL 509 (555) T ss_pred HHHHHHHHHHHHHhhcCc-----hhHhhcC---CHHHHHHHHHHHcCCChhhhcC--CHHHHHHHHHHHHHHHHHH--HH Confidence 888888888776554321 1223333 334454444433221 11222 1222221111111111110 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCC Q lcl|Aclame:pro 621 VLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPP 699 (708) Q Consensus 621 ~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 699 (708) +++.++++. .+.+++ ..+ +....++++.-....+. ....++..+++ T Consensus 510 ~~qa~~~~~------~~~~~~--------------~~~---------~~~~~~~~a~~~~~a~~----~~~~~~~~~~~ 555 (555) T protein:vir:17 510 INQAGQLAK------TPMAEQ--------------AMQ---------LIQQQQEGAQDAGAAES----ETSSAEAQAGA 555 (555) T ss_pred HHHHHHHHh------hhhhhh--------------HHh---------ccccchhhhhHHHHHHh----hcCCcccccCC Confidence 000000000 000000 000 00000011111111111 11112222222 No 41 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.76 E-value=2.8e-17 Score=111.36 Aligned_cols=466 Identities=12% Similarity=0.047 Sum_probs=221.7 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~g 78 (708) --+..+++...-.+.+....+.+...+.... .+-..||.|+++.-......+ ..+++ .++.|..+.+|+..+| T Consensus 30 ~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl-~~l~~yY~g~~~~i~~~~~~~----~~~~~~~ki~~n~~k~Ivd~~~~ 104 (502) T protein:vir:48 30 RADNLEELMVNNWELLKNFINHHKLRQAPRI-QELLDYARGENHDVLKSGRRK----DNEMADKRAVHNYGRMISKFKTG 104 (502) T ss_pred cccchhhhccccHHHHHHHHHHHHHHHHHHH-HHHHHHhcCCCcccccccccc----ccccccceeecchHHHHHHHHhh Confidence 0011121111111223333333322221111 222357889875322111111 11222 5778999999999999 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) +...+.+.+.+.-. ++ .+.+..++..++..|+++.....+..+++++|.||+.+..+. .+.+++. T Consensus 105 yl~g~p~~~~~~d~--~~----~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~de---------dg~~~i~ 169 (502) T protein:vir:48 105 YLAGNPIRVEYDDN--ED----NSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSE---------YDETRIK 169 (502) T ss_pred hhcccCeeEecCCc--cc----hhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC---------CCceEEE Confidence 99999998886321 12 234455566677889999999999999999999998875431 2344444 Q ss_pred Eeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 159 PIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 159 ~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) .+ ++..+| ||+... .+. .++++.|... ...+.....++|...+. T Consensus 170 ~~--~p~~~~~vydd~~~----~~~-~~~ir~~~~~---------------------------~~~~~~~~~~iyt~~~i 215 (502) T protein:vir:48 170 RL--SPLETFVIYDNSLE----DNS-IAAVRYYNRG---------------------------TLQNAKDVVEIYTNQHI 215 (502) T ss_pred EE--cccceEEEEcCCCC----Cce-EEEEEEEEEe---------------------------ecCCcEEEEEEEeCCeE Confidence 33 233332 443211 011 1122211100 00111222333332211 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) +.+...+...+....+.+++.+|+|+|. T Consensus 216 ------------------------------------------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~---- 243 (502) T protein:vir:48 216 ------------------------------------------------YTLDASDSFNEISVTPHAFGTVPITEFL---- 243 (502) T ss_pred ------------------------------------------------EEEEeCCceeeccceecCCCccceEEec---- Confidence 1111111122233445566677776653 Q ss_pred ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 317 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++......+.... .........+.........|.. T Consensus 244 ---nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-- 316 (502) T protein:vir:48 244 ---NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQ--ASDMKRTRLMQLKPPKSADGKE-- 316 (502) T ss_pred ---CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccc--hhhhhhcceeeccccccccccc-- Confidence 123477899999999999999999999999988887766643221111110 1111112222222211111111 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVW 475 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~ 475 (708) ....+.+...+.-..++...+......|...|++.+.+.+. .+|.||.|+...............+.|..+++++.+++ T Consensus 317 ~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li 396 (502) T protein:vir:48 317 GTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLA 396 (502) T ss_pred cCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11123333333223566667888899999999998876665 46789999998887777777777778888888888877 Q ss_pred HHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 476 LSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSML 555 (708) Q Consensus 476 l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~ 555 (708) +.++..... ....++ .+|.|.=.+..+....+..+.+..+...+ T Consensus 397 ~~~~~~~~~----------~~~~d~-------------------------~~i~i~f~~~~p~d~~e~a~~~~kl~g~i- 440 (502) T protein:vir:48 397 ARIGSLVNE----------FKDFDE-------------------------SRLKITFTPNLPKSLYEQVSILNDLGGQV- 440 (502) T ss_pred HHHHhhccc----------cccccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhccC- Confidence 776543211 000000 01222333444444445555555542111 Q ss_pred ccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 556 PTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGI-AKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 556 ~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~-~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) + ..++++++.+ ...++-.+++......... ..+.. ....... . .-+......+ T Consensus 441 ---S-----~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~------------~~~~~~~--~---~d~~~e~~~~ 495 (502) T protein:vir:48 441 ---S-----QETALSLSGLVENPTEELDKINEESSKIDFKGYPSY------------FYDNVGK--Y---TDEVKETHTD 495 (502) T ss_pred ---c-----HHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhccccc------------ccccccc--c---CCCccCCCCc Confidence 1 1233444332 2222223333322110000 00000 0000000 0 0000000000 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQ 643 (708) Q Consensus 634 ~~k~~~~~~~ 643 (708) ...+.-. T Consensus 496 ---~~~~~~~ 502 (502) T protein:vir:48 496 ---DFERVYE 502 (502) T ss_pred ---CcCCCCC Confidence 0000000 No 42 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.76 E-value=1.4e-16 Score=107.43 Aligned_cols=437 Identities=10% Similarity=0.040 Sum_probs=212.0 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~g 78 (708) |. +.+.+..+.+..|-.........+ ++-..||.|+|- ++.. .....++| .++.|..+.+|+..+| T Consensus 11 ~~-~~~~~~~~~i~~~i~~~~~~~~r~-----~~~~~yy~g~~~---i~~~---~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (453) T protein:vir:73 11 YS-RDEEITDKVVNDFMKKHQEEVERY-----EYLGNMYKGIME---ISSQ---KAKDSWKPDNRLTNNFAKYIVDTFVG 78 (453) T ss_pred cc-ccccCCHHHHHHHHHHHHHHHHHH-----HHHHHHhccccc---hhcC---CCCCccCccceeecchHHHHHHHhhh Confidence 44 233343444333333222222211 122358999873 1111 11112222 4678999999999999 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) +...+.+.+.+ . ++ + ..+.+..+++.|+++.....+..+++++|.||..+..+. .+.+++. T Consensus 79 ~l~g~~~~~~~--~---d~-~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------~~~~~i~ 139 (453) T protein:vir:73 79 YFNGIPIKKTH--D---DK-S----VLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNE---------STESEVI 139 (453) T ss_pred hhcccCceeec--C---Ch-H----HHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCC---------CCceEEE Confidence 99988877642 2 11 1 234577777889999999999999999999998886432 1234444 Q ss_pred Eeecchhhe--ecCCccccCChhccCeEEEee-cCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 159 PIYDPSRSV--WFDPDAKKYDKSDALWAFCMY-SLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 159 ~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~-~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) .+ .| ..+ +||+... + ..++..+ +.+. +.....+.|...+ T Consensus 140 ~~-~p-~~~~~v~dd~~~-----~-~~~~~i~~~~~~------------------------------~~~~~~~vyt~~~ 181 (453) T protein:vir:73 140 YC-SP-LNVFMVYDDSIK-----Q-KPLFAVYYGFDE------------------------------EGNLSGTVYTLLE 181 (453) T ss_pred EE-cc-cceEEEEeCCCC-----c-eeEEEEEEEEec------------------------------CceEEEEEEeCCe Confidence 32 22 233 3443221 1 1122222 1110 0011123332221 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) . +.|.. -++...+.++.+.+++.+|+|+|.- T Consensus 182 i----~~~~~-------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n-- 212 (453) T protein:vir:73 182 T----ISITG-------------------------------------------KAGEVKFGESTYNVYSDLPIVEYNF-- 212 (453) T ss_pred E----EEEEe-------------------------------------------cCCceEEccceeccCCceeEEEecC-- Confidence 0 11110 0011112233455666777776431 Q ss_pred eccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccc Q lcl|Aclame:pro 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNII 395 (708) Q Consensus 316 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (708) ...+.|.+..+++.++.+|+..|.+.+.+...+.+.+++--...++ +...... ...............+... T Consensus 213 -----~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~~~ 284 (453) T protein:vir:73 213 -----NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE--EDAKNIK-DNRLINFFDKNSNGQGTNA 284 (453) T ss_pred -----CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc--hhhhccc-ccccccccccccccccccc Confidence 2246788999999999999999999999988888776663211111 1111110 0001111111111111111 Q ss_pred ccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 396 AGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVW 475 (708) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~ 475 (708) . .....+...+.-...+...++.....|...|++.+.+.+..+|.||.|+.................|..+++++.+++ T Consensus 285 ~-~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li 363 (453) T protein:vir:73 285 A-KVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLW 363 (453) T ss_pred c-CceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 112233333323455666778888889999998777666667889999988877777777777777888888887777 Q ss_pred HHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 476 LSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSML 555 (708) Q Consensus 476 l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~ 555 (708) +.+... .|. ...+ -+|.|.=.+..+....+..+.++.+...+ T Consensus 364 ~~~~~~----------~~~--~~~~-------------------------~~i~v~f~~~~p~~~~~~a~~~~k~~gii- 405 (453) T protein:vir:73 364 SSLSTN----------ASN--KDAW-------------------------KDIEYTFTRNEPKDIKEQAETANILKGIT- 405 (453) T ss_pred HHHHhc----------cCC--cccc-------------------------ccceEEeCCCCCCCHHHHHHHHHHHhccC- Confidence 664321 010 0000 12233334455554555555555543111 Q ss_pred ccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 556 PTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEA 634 (708) Q Consensus 556 ~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~ 634 (708) + ..++++++.+ ...++-.+++++.. +++..+++. .-..+...+. T Consensus 406 ---s-----~et~~~~~~~~~d~~~E~~ri~~E~------------------~~~~~~~~~---------~~~~~~~~~~ 450 (453) T protein:vir:73 406 ---S-----EETALSVISVIPDVQAEMEKIKKKK------------------LLQLSLTRT---------SNLVRMKQMR 450 (453) T ss_pred ---c-----HHHHHHhCCCCCCHHHHHHHHHHHH------------------HHHHHHHHh---------ccCCcchhhh Confidence 1 1223333322 11111122221100 000000000 0000000000 Q ss_pred HHH Q lcl|Aclame:pro 635 QKA 637 (708) Q Consensus 635 ~k~ 637 (708) ... T Consensus 451 ~~~ 453 (453) T protein:vir:73 451 GNL 453 (453) T ss_pred cCC Confidence 000 No 43 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.76 E-value=5.4e-17 Score=109.79 Aligned_cols=472 Identities=11% Similarity=0.040 Sum_probs=222.1 Q ss_pred CC--cchH-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHH Q lcl|Aclame:pro 1 MA--ETLE-KKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII 77 (708) Q Consensus 1 ma--~~~~-~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~ 77 (708) |- +... ...+++...+........+ +.. +-..||.|+|.--......... ......++.|..+.+++..+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~--~l~~Yy~g~~~i~~~~~~~~~~--~~~~~ki~~n~~k~Iv~~~~ 103 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLK--VLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFIN 103 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHH---HHH--HHHHHhcccCccccccCcCccc--ccCcceeecchHHHHHHHHH Confidence 22 1111 0122222222222222222 222 2235888987522111111111 11223577899999999999 Q ss_pred HHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceee Q lcl|Aclame:pro 78 AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAI 157 (708) Q Consensus 78 g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i 157 (708) |+...+.+.+.+ . +.+ ....+..+++.|+++.....+..+++++|.+|..+..+. ++.+++ T Consensus 104 ~yl~g~p~~~~~-----~-~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de---------d~~~~i 164 (511) T protein:vir:96 104 GYFLGNPIQYQD-----D-DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDETRL 164 (511) T ss_pred hhhccCCceeec-----C-chH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC---------CCceEE Confidence 999999988863 1 122 235677778889999999999999999999998876431 234555 Q ss_pred EEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 158 EPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 158 ~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) ..+ ++..+| ||.... .-...+++.|... ..+....+.+...+.|...+ T Consensus 165 ~~~--~p~~~~~vydd~~~-----~~~~~~vr~~~~~-----------------------~~d~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:96 165 YKS--DAMSTFVIYDNTIE-----RNSIAGVRYLRTK-----------------------PIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred EEE--ccceeEEEEcCCCC-----CceEEEEEEEEee-----------------------eccccccceEEEEEEEeCCc Confidence 433 233443 443221 1112233333110 00001112233334444332 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) . +.|.....+. .........+.+.|++.+|+++|. T Consensus 215 i----~~~~~~~~~~--------------------------------------~~~~~~~~~~~~~~~~~vPvv~~~--- 249 (511) T protein:vir:96 215 V----YRYLTSRTNG--------------------------------------LKLTPRENGFESHSFERMPITEFS--- 249 (511) T ss_pred E----EEEEecCCCc--------------------------------------ccccccccccccccCCceeeEEec--- Confidence 1 1111110000 000001123345666667776543 Q ss_pred eccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccc--cccc Q lcl|Aclame:pro 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRD--KSGN 393 (708) Q Consensus 316 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 393 (708) . ...|.|.+..+++.++.+|...|.+.+.+...+.+.+++......+.... . ............... ..+. T Consensus 250 ---n-n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~ 322 (511) T protein:vir:96 250 ---N-NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-R--KQKEANVLFLEPTVYADSEGR 322 (511) T ss_pred ---C-CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhh-c--ccccccceecccccccccccc Confidence 1 23467999999999999999999999999888777666532121111111 0 011111121111111 1111 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) ...+.....+...+.-..++...+..+...|..+|++.+.+.+. .+|.||.|+.................|..+++++. T Consensus 323 ~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 402 (511) T protein:vir:96 323 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 402 (511) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111123333333334666778888899999999988876664 46789999998888888888888888888888888 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.++....... ...++ -+|.+.=.+..+....+..+.+..+.. T Consensus 403 ~li~~~~~~~~~~~---------~~~d~-------------------------~~i~~~f~~~~p~n~~e~~~~~~kl~G 448 (511) T protein:vir:96 403 KLLETILKNTWSID---------ANKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDSGG 448 (511) T ss_pred HHHHHHHHhhcCcc---------ccccc-------------------------ccceEEeCCCCCCCHHHHHHHHHHHhc Confidence 88877654322100 00000 123333344555545555555555421 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) .+ + ..++++++.+ ...++-.+++...... .....+......+.......+. .+ T Consensus 449 ~i----S-----~et~l~~l~~v~D~~~E~~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~----~~ 502 (511) T protein:vir:96 449 KI----S-----QTTLMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQD----DD 502 (511) T ss_pred cC----C-----hHHHHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHhhccccCCCCCCCCCCC----Cc Confidence 11 1 1223333322 1222222333221100 0000000000000000000000 00 Q ss_pred HHHHHHHHHHHH Q lcl|Aclame:pro 632 AEAQKATNETAQ 643 (708) Q Consensus 632 ae~~k~~~~~~~ 643 (708) . +-..+..+ T Consensus 503 ~---~~~~~~~~ 511 (511) T protein:vir:96 503 T---KDTVDKKE 511 (511) T ss_pred c---cccccccC Confidence 0 00000000 No 44 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.75 E-value=4.8e-16 Score=104.56 Aligned_cols=506 Identities=12% Similarity=0.065 Sum_probs=243.8 Q ss_pred CCcchHH--HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh--cCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEK--KHERIMLRFDRAYSPQKEVREKCIEATRFAR--VPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~--~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~--~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) ||+...+ ..+.+..+|+...+....|...|+++.+|.+ .-...+...... ...+.-..-...++.+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~----------~~~~~dst~~~a~~~L 70 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS----------YTTPWQSIGARGLNNL 70 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhh----------ccccccchHHHHHHHH Confidence 9974322 2466778888888888888888888876531 101112111100 0112333444555555 Q ss_pred HHHHhc-----CcceeEEecCCCc------ch---HHHHHHH---HHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEE Q lcl|Aclame:pro 77 IAEYRN-----NRITVKFRPGDRE------AS---EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) Q Consensus 77 ~g~~~~-----nr~~~~v~pr~~~------~d---~~~A~~l---~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~ 139 (708) .+.... +++=+++.+.+.. .+ .++.+.| +..+......|++..+...++.+.+..|.|+..+. T Consensus 71 Aa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~ 150 (532) T protein:vir:99 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) T ss_pred HHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEec Confidence 443333 3444444443211 00 1123222 34445556789999999999999999999987653 Q ss_pred eeccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 140 ~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) .+.. .......+..+ |+.++++..+.. . ...-++++..++.+.+-+.++...... .+.+ T Consensus 151 ~~~~------~~~~~~~f~~~--pl~~y~v~~d~~---G-~v~~ivrr~~~~~~~l~e~~~~~~~~~---------~~~~ 209 (532) T protein:vir:99 151 STEQ------VEGQSNAPKLY--KLHNFVVERDAY---D-NVLQIVTEDKIARAALPEDVRKSLEDA---------QGDQ 209 (532) T ss_pred cccc------ccCcccceEEE--EcCeEEEeeCCC---C-CeeeEeeeeeecHHhcChHHHHHhhcc---------cccc Confidence 2211 11122233332 456676654332 1 122367777778777744443222110 0111 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) ...+.+ .++.+..+.... ..+.|+ ....|..+....+ T Consensus 210 ~p~~~v-------------~v~~~v~~~~~~-----------------------------~~~~~~-~~~~g~~~~~~~~ 246 (532) T protein:vir:99 210 NPSEEV-------------TIYTHVYRDPEA-----------------------------MVFRSY-QEIDGEIVAGTEG 246 (532) T ss_pred CCCcce-------------EEEEEEEecCCC-----------------------------CeeEEE-EeecCceeccccc Confidence 111111 222222111100 001122 1233444444456 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) .+|+.++||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++++.+-...... ...++ T Consensus 247 ~~~~~e~P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~g 321 (532) T protein:vir:99 247 EYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA---KANTG 321 (532) T ss_pred ccccccCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhc---cCCCc Confidence 7888889988765443 688899999999999999999999999999999999999999877654332211 11222 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-cc-cchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MP-SNIAQETVNNLMNRADMAS 457 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~-~n~sg~ai~~~q~q~~~~~ 457 (708) .++... .+.+ .+.......--+.....++...+.|....=+ + ++. .+ ...|++-|..+.+.....+ T Consensus 322 ~~v~g~-----~~~i-----~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~-~~~~~d~~r~TAtEV~~r~~E~~~~L 389 (532) T protein:vir:99 322 DFVAGR-----KQDV-----EVFQLEKYNDFQVAKATADDIEKRLSYAFML-N-SAVQRGGDRVTAEEIRYVAGELEDTL 389 (532) T ss_pred ceecCC-----cccc-----eeeecccccchhHHHHHHHHHHHHHHHHHhh-h-hcccCCCCcccHHHHHHHHHHHHHHh Confidence 222211 1111 1111111112233345666666666665422 2 222 22 2368888999998888888 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccc Q lcl|Aclame:pro 458 FIYLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) Q Consensus 458 ~~~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~ 536 (708) ...+.+|. .+..-+.+..+.++.+. | . | + +-.... + ..++++ +.+ T Consensus 390 Gpv~~rl~~E~l~Pli~r~~~il~r~----------g---~-----l-P----~~p~~~-------~-~~~iv~--~is- 435 (532) T protein:vir:99 390 GGVYSLLSQELQLPLVKILLKELQAT----------S---K-----I-P----NLPKEA-------V-EPAIAT--GLE- 435 (532) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhc----------C---C-----C-C----CCChhh-------c-ccceee--cch- Confidence 88888876 45555656655555541 1 0 0 0 000000 0 123322 222 Q ss_pred hhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh--hhcccCcchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQS 614 (708) Q Consensus 537 ~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~--~~~~~~~~~~~~q~~~~~qq~qq~ 614 (708) +-.|.+..+.++.+++.+.+..|. ++... +.++++..+...... ...... +++.++.++++++++. T Consensus 436 ~Laraq~~~~l~~~~~~laq~~p~-------~~d~i---d~d~~~~~~a~~~GV~~~~i~r~--~ee~~~~~~q~~~~~~ 503 (532) T protein:vir:99 436 ALGRGHDLNKLNVFIDYMIKLAGL-------QDDDI---NLLDVKMRLANSLGMDTTGLILT--QQDKQAKMAEASTAAG 503 (532) T ss_pred HHHHHHHHHHHHHHHHHHHhhcch-------hhhhC---CHHHHHHHHHHHhCCChhhccCC--HHHHHHHHHHHHHHHH Confidence 233666777777777776554432 22222 344455444443321 222222 2222222211111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 615 QPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAM 656 (708) Q Consensus 615 ~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~ 656 (708) + + + ++. ++. +.+.++.+.....++.++.+ T Consensus 504 ~-~--~---a~~----~~~---~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 504 M-V--T---AGQ----QMG---AAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred H-H--H---HHH----HHH---HHHHHhcchhHHhhcCCCCC Confidence 0 0 0 000 000 00000000001111111111 No 45 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.74 E-value=1.3e-16 Score=107.63 Aligned_cols=474 Identities=11% Similarity=0.034 Sum_probs=218.6 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEV-REKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~-r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) |.. .+...-.-..++.+..+.+... +.+. ++-..||.|+|.--......... ....-.++.|..+.+++..+|+ T Consensus 31 ~~~-~~~~~~~~~~~i~~~i~~~~~~~~~r~--~~l~~Yy~g~~~i~~~~~~~~~~--~~~~~ki~~n~~k~Iv~~~~~y 105 (511) T protein:vir:10 31 YDG-TESDLLQNVNEVSKCIEHHMDYQRPRL--KVLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFINGY 105 (511) T ss_pred Cch-hhhhcccCHHHHHHHHHHHHHhhHHHH--HHHHHHhcccCccccccCccccc--ccCcceeecchHHHHHHHHhhh Confidence 221 0111000011222222222211 1122 12235888987521111111110 1112356789999999999999 Q ss_pred HhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEE Q lcl|Aclame:pro 80 YRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEP 159 (708) Q Consensus 80 ~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~ 159 (708) ...+.+.+.+ ++.+ ....+..+++.|+++.....+..+++++|.+|..+..+. ++.+++.. T Consensus 106 l~g~p~~~~~------~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de---------dg~~~i~~ 166 (511) T protein:vir:10 106 FLGNPIQYQD------DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQ---------DDETRLYK 166 (511) T ss_pred hcccCceeec------CchH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC---------CCceEEEE Confidence 9999888753 1222 235677777889999999999999999999988775421 23455544 Q ss_pred eecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceE Q lcl|Aclame:pro 160 IYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKES 237 (708) Q Consensus 160 v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~ 237 (708) + ++.++| ||.... .-...+++.|... ..+....+.+...+.|..... T Consensus 167 ~--~p~~~~~vydd~~~-----~~~~~~vr~~~~~-----------------------~~d~~~~~~~~~~~iyt~~~i- 215 (511) T protein:vir:10 167 S--DAMSTFVIYDNTIE-----RNSIAGVRYLRTK-----------------------PIDKTDEDEVFTVDLFTSHGV- 215 (511) T ss_pred E--ccceeEEEEcCCCC-----CceEEEEEEEEee-----------------------ecccCccceEEEEEEEeCCcE- Confidence 3 233443 443221 1112233332110 000011222333344443321 Q ss_pred EEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeec Q lcl|Aclame:pro 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 317 (708) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~ 317 (708) +.|.....+.. ........+.+.|++.+|+++|. T Consensus 216 ---~~~~~~~~~~~--------------------------------------~~~~~~~~~~~~~~~~vPvv~f~----- 249 (511) T protein:vir:10 216 ---YRYLTSRTNGL--------------------------------------KLTPRENGFESHSFERMPITEFS----- 249 (511) T ss_pred ---EEEEecCCCcc--------------------------------------cccccccccccccCcceeEEEec----- Confidence 11111100000 00001113345566666666543 Q ss_pred cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccc--cccc Q lcl|Aclame:pro 318 IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKS--GNII 395 (708) Q Consensus 318 ~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 395 (708) . ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++......+.... .. ................ +... T Consensus 250 -n-n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~ 324 (511) T protein:vir:10 250 -N-NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-RK--QKEANVLFLEPTVYADSEGRET 324 (511) T ss_pred -C-CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhh-cc--chhccceecccccccccccccC Confidence 1 22467999999999999999999999999888777665532111111110 00 0111111111111100 1111 Q ss_pred ccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 396 AGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) .+.....+...+.-..++...+..+...|..+|++.+.+.+. .+|.||.|+...-..........-..|..++++++++ T Consensus 325 ~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l 404 (511) T protein:vir:10 325 EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 404 (511) T ss_pred CCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122333333234666678888889999999988766553 4678999999888777777777777888888888888 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) ++.++...-.. +....+ .+|.|.=.+..+....+..+.+..+...+ T Consensus 405 i~~~~~~~~~~---------~~~~d~-------------------------~~i~i~f~~~~p~d~~~~~~~~~kl~G~i 450 (511) T protein:vir:10 405 LETILKNTRSI---------DANKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDSGGKI 450 (511) T ss_pred HHHHHHhhCCc---------cccccc-------------------------ceeeEEeCCCCCcCHHHHHHHHHHHhccC Confidence 77765432110 000000 13344444555555555666666653211 Q ss_pred cccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) + ...+++++.+ ...++-.+++...... .....+............... ..+. T Consensus 451 S---------~et~~~~l~~v~d~~~E~~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~----~~~~- 503 (511) T protein:vir:10 451 S---------QTTLMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQ----DDDT- 503 (511) T ss_pred c---------HHHHHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHhhhcccCCCCCCCCCC----CCcc- Confidence 1 1223333321 1122222333211000 000000000000000000000 0000 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQ 643 (708) Q Consensus 634 ~~k~~~~~~~ 643 (708) +-..+..+ T Consensus 504 --~~~~~~~~ 511 (511) T protein:vir:10 504 --KDTVDKKE 511 (511) T ss_pred --cCcccccC Confidence 00000000 No 46 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.74 E-value=1.5e-15 Score=101.80 Aligned_cols=508 Identities=13% Similarity=0.084 Sum_probs=246.8 Q ss_pred CCcchHHH---HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcC--CCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETLEKK---HERIMLRFDRAYSPQKEVREKCIEATRFARVP--GGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~~~~---~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~--G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) ||.++... -+.++.+|+...+....|...|+++.+|.+=. ..+.+.. . ...+.+.-..-...++. T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~--~--------~~~~~~~dst~~~a~~~ 70 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNA--S--------TDYTTPWQAVGARGLNN 70 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcc--c--------cccCCcccccHHHHHHH Confidence 99865554 45688888888888888999998887753211 1111110 0 01122333444445554 Q ss_pred HHH----HHhcCcceeEEecCCC------cchH---HHHHHHHH---HHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEE Q lcl|Aclame:pro 76 IIA----EYRNNRITVKFRPGDR------EASE---ELANKLNG---LFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) Q Consensus 76 i~g----~~~~nr~~~~v~pr~~------~~d~---~~A~~l~~---~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~ 139 (708) +.+ ...-+++=+++.+.+. ..+. ++.+.|+. .+......|++..+...++.+.+..|.|.+.+. T Consensus 71 Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~ 150 (535) T protein:vir:94 71 LASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIP 150 (535) T ss_pred HHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeec Confidence 443 3333343333333321 0111 23333333 344445689999999999999999999987663 Q ss_pred eeccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 140 ~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) .+ +.. .+++..+ |+.++++..+.. . ...-|+++..++.+.+-..|++...... ++ T Consensus 151 ~~------~~~---~~~f~~~--pl~~y~v~~d~~---G-~vd~i~r~~~~~~~~l~~~~~~~~~~~~--------~~-- 205 (535) T protein:vir:94 151 EP------EGT---YNPMKLY--RLSSYVVQRDAF---G-TVLQIVTLDKTAYAALPEDVRNSMDSSQ--------EH-- 205 (535) T ss_pred cC------cCc---ccceEEE--EcCeEEEeeCCC---C-CeEEEEeeeeccHHHhhHHHHHHHHhcc--------cc-- Confidence 22 111 1233333 455666544332 1 2335788889999999887765321100 00 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) ...+.+.|.. +.+.++.. ..+.|+ +...|..+....+ T Consensus 206 ~~~~~v~v~~-----------~v~~~~~~-------------------------------~~~~~~-~e~~g~~~~~~~~ 242 (535) T protein:vir:94 206 KGDEMIDVYT-----------HIYLDEES-------------------------------GEYLKY-EEIDGVEVEGTDA 242 (535) T ss_pred CCCceeEEEE-----------EEEeeCCC-------------------------------CcEEEE-EEecCeeeccccc Confidence 0111111111 11111111 111121 2233444433346 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) .+++..+||+++.... .+|..+|.|.+....+-.+.+|++....+.......+++++++++.+-...... ...++ T Consensus 243 ~~g~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~g 317 (535) T protein:vir:94 243 SYPVDACPYIPVRMVR--IDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLT---KAQTG 317 (535) T ss_pred cCccccCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcc---cCCCc Confidence 7888999998765443 688899999999999999999999999999999999999999876553332211 11122 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-cc-cchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MP-SNIAQETVNNLMNRADMAS 457 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~-~n~sg~ai~~~q~q~~~~~ 457 (708) .++... .+.+. +......+ --+...++++...+.|....=+ .+++ .+ ...|++-|..+.+.....+ T Consensus 318 ~~v~g~-----~~~v~----~~~~~~~~-~~~~~~~~i~~~~~rI~~af~~--~~~~~~d~~rvTAtEV~~r~~E~~~~L 385 (535) T protein:vir:94 318 DFVSGR-----PEDIS----FLQLEKAA-DFSVARAVSEQIEGRLSYAFML--NSAVQRTGERVTAEEIRYVASELEDTL 385 (535) T ss_pred eeecCC-----cccce----eeeccccc-chhHHHHHHHHHHHHHHHHHhH--hhhccCCCCCccHHHHHHHHHHHHHHh Confidence 222111 11111 11111112 2233445666666666655422 2232 22 3368888999998888888 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccc Q lcl|Aclame:pro 458 FIYLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) Q Consensus 458 ~~~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~ 536 (708) ...+.+|. .+..-+.+..+.++.+.- ++ +. . | . ++ +.+.+..+. . T Consensus 386 Gpv~~rl~~ElL~Pli~r~~~il~r~g----~l---------------P~-~-p--~------~~----v~~~~vs~l-a 431 (535) T protein:vir:94 386 GGVYSILSQELQLPMVRVLLKQLQATN----QI---------------PE-L-P--K------EA----VEPTISTGM-E 431 (535) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhCC----CC---------------CC-C-C--h------hh----ccceEeehH-H Confidence 88888776 455555555555544321 00 00 0 0 0 00 123332333 3 Q ss_pred hhHHHHHHHHHHHHHHhccccCchhHHHHHHHHh-hccchhHHHHHHHHHhhhhhh--hcccCcchHHHHHHHHHHHHHH Q lcl|Aclame:pro 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGIILD-NIDGEGLDDFKEYNRNQLLIS--GIAKPRNEKEQQIVQQAQMAAQ 613 (708) Q Consensus 537 ~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~-~~d~~~~~ei~e~~~~~~~~~--~~~~~~~~~~~q~~~~~qq~qq 613 (708) +..|.+..+.++++++.+....|. .+. .. +.++++..+....... .... .+++.++.++++++++ T Consensus 432 ~l~r~~~~~~l~~~~~~laq~~P~-------~ld~~i---d~d~~~~~~a~~~Gvp~~~i~r--s~eev~~~~~q~~~~~ 499 (535) T protein:vir:94 432 ALGRGQDLDKLERCIAAWSALAPM-------QGDPDI---NIATIKLRIANAIGIDTSGILK--TPEEKQQEMAEAAQGT 499 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhhChH-------HhhhcC---CHHHHHHHHHHHhCCChhhhcC--CHHHHHHHHHHHHHHH Confidence 455777888888888776554442 121 12 3344444444333221 1222 1222221111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 614 SQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKD 686 (708) Q Consensus 614 ~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~ 686 (708) +++ .+..+..+.+.. +.+.....+.. ..+ ++ .+.-. T Consensus 500 ~~~--~~~~~~g~~~~~---~~~~~~~~~~~-------~~~-------------~~------------g~~~~ 535 (535) T protein:vir:94 500 AMQ--NAAASAGAGAGT---MATASPENMKA-------AAA-------------QA------------GMAPN 535 (535) T ss_pred HHH--HHHHHHHHhhhc---ccccChHHHHH-------HHH-------------Hh------------ccCCC Confidence 100 000000000000 00000000000 000 00 00000 No 47 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.73 E-value=5.9e-16 Score=104.09 Aligned_cols=438 Identities=10% Similarity=0.042 Sum_probs=213.1 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCC--CceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKY--PKFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~gr--p~~~~N~i~~~i~~i~g 78 (708) |- +.+.+..+.+..|-. .+...+.+.. +...||.|+| + .... .....++ -.++.|..+.+|+..+| T Consensus 11 ~p-~d~~~~~~~l~~~i~---~~~~~~~r~~--~~~~yy~g~~--~-i~~~---~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (453) T protein:vir:39 11 FP-KDEPITNEVVTKFME---KHRLEVARYE--YLKNMYRGIM--A-IDAE---PTKDLWKPDNRLTVNFTKYIVDTFTG 78 (453) T ss_pred cC-CCCCCCHHHHHHHHH---HHHHHHHHHH--HHHHHhhccC--c-hhcC---CCccccCccceeecchHHHHHHHHhh Confidence 43 222233333333322 2222222232 2235788976 1 1111 1111122 24678999999999999 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) +...+.+.+.+ - ++ + ....+..++..|+++.....+..+++++|.||+.+..+. ++.+++. T Consensus 79 ~l~g~~~~~~~--~---d~-~----~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~---------~g~~~i~ 139 (453) T protein:vir:39 79 YFNGIPVKKSH--S---DK-E----TLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNE---------ETQTNVI 139 (453) T ss_pred hhcccCceecc--C---Ch-H----HHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecC---------CCceEEE Confidence 99888876653 1 11 1 134577777889999999999999999999998886432 2344444 Q ss_pred Eeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 159 PIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 159 ~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) .+ ++.++ +||+.... ...+++ +.+.. .+.....++|...+. T Consensus 140 ~~--~p~~~~~v~d~~~~~----~~~~~i-r~~~~------------------------------~~~~~~~~~yt~~~i 182 (453) T protein:vir:39 140 YN--TPENMFMVYDDTIKQ----EPLFAV-RYGYD------------------------------DDYKLYGEVYTKETT 182 (453) T ss_pred EE--cccceEEEecCCCCC----eEEEEE-EEEEe------------------------------CCeEEEEEEEeCCeE Confidence 33 22333 34432210 111111 11100 011122333332211 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) +.|.. -++...+.++.+.+++.+|+|||.- T Consensus 183 ----~~~~~-------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n--- 212 (453) T protein:vir:39 183 ----YALNG-------------------------------------------TMGFYNMTEQAPNPFDDLPVVEFYF--- 212 (453) T ss_pred ----EEEEe-------------------------------------------cCCceeeecccccCCCceeEEEecC--- Confidence 11110 0011112233455566777776532 Q ss_pred ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 317 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) ...+.|.+..+++.++.+|+.+|.+...+...+.+.+++--...++- ..... .. .+.+..... . +. + T Consensus 213 ----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~--~~~~~-~~-~~~~~~~~~-~--~~--~ 279 (453) T protein:vir:39 213 ----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEE--DLKNI-RS-NRVINYYGE-S--SE--A 279 (453) T ss_pred ----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCch--hhhhh-hh-cceeeecCC-C--CC--C Confidence 22467889999999999999999999999888887766642222211 11111 11 122211111 0 00 0 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l 476 (708) ......+...+.-...+...+..+...|..+|++.+.+.+..+|.||.|+.................|..+++++.++++ T Consensus 280 ~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~ 359 (453) T protein:vir:39 280 KNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYC 359 (453) T ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01112223222223566667788888888999987776666678899999888777777777777778888888888777 Q ss_pred HHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) Q Consensus 477 ~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~ 556 (708) .+.... |.. ..+ -||.|.=.+..+....+..+.++.+...++ T Consensus 360 ~~~~~~----------~~~--~~~-------------------------~~i~v~f~~~~p~~~~~~a~~~~kl~g~is- 401 (453) T protein:vir:39 360 ELSTNV----------SNK--EAW-------------------------KDIEYTFTRNEPKDIKEQAETANILMGITS- 401 (453) T ss_pred HHHhcc----------CCc--ccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhccCC- Confidence 654321 110 000 123333344555545555555555422111 Q ss_pred cCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 557 TDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 557 ~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) ...+++++.+ +..++-.+++.+....... . .+...+.... -+........| T Consensus 402 --------~et~l~~l~~v~D~~~E~~ri~~E~~~~~~-------------~---~~~~~~~~~~--~~~~~~~~~~e 453 (453) T protein:vir:39 402 --------QETALSVISVIPDVQAEMEKIKKEEASTAI-------------F---DKDKQPSEKG--TDTVVPETNEE 453 (453) T ss_pred --------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-------------H---HHhccCCCCC--CCCCCCCcCCC Confidence 1223333322 1222222333221110000 0 0000000000 00000000000 No 48 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.73 E-value=1.3e-15 Score=102.11 Aligned_cols=494 Identities=14% Similarity=0.052 Sum_probs=237.7 Q ss_pred CCcchHHHH----HHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKH----ERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~----~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |++...... .+++.+|+...+....|...|+++.+|.. .+ =+++..-. .+...+.-+.-...++.+ T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~l-P~-~~~~~~~~--------~~~~~~~dstg~~a~~~L 70 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTL-PY-LMNDKGDN--------ETSQNGWQGVGAQATNHL 70 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhc-cc-ccCCCCCc--------cccCCcccchHHHHHHHH Confidence 777554443 56777888777778888889988876532 11 11111000 011112233344445544 Q ss_pred HHHHhc-----CcceeEEecCCC------cchH---HHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEE Q lcl|Aclame:pro 77 IAEYRN-----NRITVKFRPGDR------EASE---ELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) Q Consensus 77 ~g~~~~-----nr~~~~v~pr~~------~~d~---~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~ 139 (708) .+.... +++=+++.+.+. +.+. ++.+. ++..+......|++..+...++.+.+..|.|++.+ T Consensus 71 Aa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~- 149 (516) T protein:vir:96 71 ANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYK- 149 (516) T ss_pred HHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEe- Confidence 433332 344444544321 0111 22222 45556666778999999999999999999987654 Q ss_pred eeccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 140 ~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) + +. . .++.. |+.++++..+.. . ...-++++.+++..+|.+.|+....... . ..+. T Consensus 150 -d---~~---~---~~~~~----pl~~y~v~~d~~---G-~v~~i~rr~~~~~~~l~~~~~~~~~~~~---~--~~~~-- 204 (516) T protein:vir:96 150 -P---SK---G---AISAI----PMHHYVVNRDTN---G-DLLDIILLQEKALRTFDPATRAVVEVGL---K--GKKC-- 204 (516) T ss_pred -c---CC---C---CEEEE----EcCeEEEeeCCC---C-CeeeehhhhHhhHHHHHHhhhhhhhhhh---h--hhhc-- Confidence 1 11 1 12322 455666544332 1 2234788889999999888854221100 0 0000 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) ...+.+ ++|. +.++++. . ++.++.-.+.......+ T Consensus 205 ~~~~~v---~v~~--------~v~~~~~--------------------------------~--~~~~~~~~d~~~~~~es 239 (516) T protein:vir:96 205 KEDDSV---KLYT--------HAKYLGD--------------------------------G--FWELKQSADDIPVGKVS 239 (516) T ss_pred CCCCce---EEEE--------eeeeeCC--------------------------------c--eeEEEEEeCceeecccc Confidence 001111 1110 1111110 0 11122222222223346 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) .||+..+||+++.... .+|..+|.|.+....+--+.+|++...++.......++.++++.+.+-....... ..++ T Consensus 240 ~~~~~e~P~~~~Rw~~--~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~---~~~g 314 (516) T protein:vir:96 240 KIKSEKLPFIPLTWKR--SYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN---SGTG 314 (516) T ss_pred ccccccCCeeeeeeee--cCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhcc---CCCc Confidence 7888889998765443 6888999999999999999999999999999999999999998766643322111 1111 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFI 459 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~ 459 (708) .++ ++ ..+.+ .+.+...... -+.....++...+.|....=+...........|++-|..+.+--...+.. T Consensus 315 ~i~---~g--~~~~v----~~~q~~~~~d-~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGp 384 (516) T protein:vir:96 315 EVV---TG--VEEDI----HIVQLGKYAD-LTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGG 384 (516) T ss_pred eee---cC--Ccccc----eeeecCcccc-hhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHhhh Confidence 121 11 01111 1111111111 23444566666666666542221111122336888888888877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhH Q lcl|Aclame:pro 460 YLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTAR 539 (708) Q Consensus 460 ~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~ 539 (708) .+.+|.. +++..||..-+ .+.++. .|+ +.+.+.+..+ -.+-. T Consensus 385 v~~rl~~------Ell~Pli~r~l------~~~~p~-------------lp~------------~~v~~~~vs~-l~~l~ 426 (516) T protein:vir:96 385 VYSLFAT------TMQSPVAMWGL------LEAGES-------------FTS------------DLVDPVIITG-IEALG 426 (516) T ss_pred HHHHHHH------HHHHHHHHHHH------HhcCCC-------------Ccc------------ccccceeech-HHHHH Confidence 7777664 23333333321 111110 010 0122333333 33456 Q ss_pred HHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh-hcccCcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 540 RDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS-GIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) Q Consensus 540 r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~-~~~~~~~~~~~q~~~~~qq~qq~~~~~ 618 (708) |.+..+.+.++++.++...+.. +.++...| .++++..+....... ....+ +++..+.+++++.+++.+.. T Consensus 427 r~~~~~~i~~~~~~i~~~~~~~----p~v~d~id---~d~~~~~~a~~~Gvp~~~irs--~eev~~~~~~~~~~q~~~~~ 497 (516) T protein:vir:96 427 RMAELDKLANFAQYMSLPLQWP----EPVLAAVK---WPDYMDWVRGQISAELPFLKS--AEEMAQEQEAQMQAQQAQML 497 (516) T ss_pred HHHHHHHHHHHHHHHHHHhcCC----hhHHhcCC---HHHHHHHHHHHhCCCccccCC--HHHHHHHHHHHHHHHHHHHH Confidence 8888888888877765432221 22333333 344444443332221 12221 22221111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 619 EMVLAQAQMVAAQAEAQKATNETAQTQIKAF 649 (708) Q Consensus 619 ~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~ 649 (708) +....+ +....++.|+.-. T Consensus 498 a~~~~~------------~~~~~~~~~~~~~ 516 (516) T protein:vir:96 498 EEGVAK------------AVPGVIQQELKEA 516 (516) T ss_pred HHHhhh------------hhhHHhhcccccC Confidence 111111 1111111111000 No 49 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.73 E-value=6.3e-16 Score=103.92 Aligned_cols=469 Identities=11% Similarity=0.030 Sum_probs=221.0 Q ss_pred CC--cchHH-HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHH Q lcl|Aclame:pro 1 MA--ETLEK-KHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII 77 (708) Q Consensus 1 ma--~~~~~-~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~ 77 (708) +. +.... ..+++...+........+.++ +-..||.|+|.--......... ......++.|..+.+++..+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~-----~l~~Yy~g~~~il~~~~~~~~~--~~~~~ki~~n~~k~Iv~~~~ 103 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLK-----VLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFIN 103 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhHHHH-----HHHHHhhccCccccccCccccc--ccCcceeecchHHHHHHHHh Confidence 22 11110 122233333332222222221 2235899987521111111110 11123577899999999999 Q ss_pred HHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceee Q lcl|Aclame:pro 78 AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAI 157 (708) Q Consensus 78 g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i 157 (708) |+...+.+.+.+ . +.+ ..+.+..+++.|+++.....+..+++++|.+|..+..+. ++.+++ T Consensus 104 ~yl~g~p~~~~~-----~-d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~---------dg~~~i 164 (511) T protein:vir:96 104 GYFLGNPIQYQD-----D-DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDETRL 164 (511) T ss_pred hhhcccCceeec-----C-chH----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCC---------CCceEE Confidence 999999888763 1 122 234577777889999999999999999999988775431 234555 Q ss_pred EEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 158 EPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 158 ~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) ..+ ++.++| ||.... .-...+++.|.... .+....+.+...++|...+ T Consensus 165 ~~~--~p~~~~~v~dd~~~-----~~~~~~vr~~~~~~-----------------------~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:96 165 YKS--DAMSTFIIYDNTVE-----RNSIAGVRYLRTKP-----------------------IDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred EEE--cccceEEEEcCCCC-----CceEEEEEEEEeee-----------------------ccccccceEEEEEEEeCCc Confidence 433 333443 554221 11122333331100 0001112233334444332 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) . +.|.....+... .........+.|++.+|+++|. T Consensus 215 i----~~~~~~~~~~~~--------------------------------------~~~~~~~~~~~~~g~vPvv~~~--- 249 (511) T protein:vir:96 215 V----YRYLTNRTNGLK--------------------------------------LTPRENSFESHSFERMPITEFS--- 249 (511) T ss_pred E----EEEEecCCCccc--------------------------------------ccccccccccCcCcccceEEec--- Confidence 1 111111110000 0000123345666667776542 Q ss_pred eccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccc--ccccc Q lcl|Aclame:pro 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVR--DKSGN 393 (708) Q Consensus 316 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 393 (708) +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++-.....+... ... ......+...... ...+. T Consensus 250 ----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~ 322 (511) T protein:vir:96 250 ----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK--QKEANVLFLEPTVYVDAEGR 322 (511) T ss_pred ----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-hcc--cccccceeccccceeccccc Confidence 12346799999999999999999999999988777766553211111110 000 1111111111100 00111 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) .........+...+.-..++...+..+...|..+|++.+.+.+. .+|.||.|+...............+.|..+++++. T Consensus 323 ~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 402 (511) T protein:vir:96 323 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 402 (511) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111122333322224566677888888999999988876665 46789999998887777777777788888888888 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.++...-.... ...+ .+|.+.=.+..+....+..+.+..+.. T Consensus 403 ~li~~~~~~~~~~~~---------~~~~-------------------------~~i~~~f~~~~p~n~~e~~d~~~kl~G 448 (511) T protein:vir:96 403 KLLETILKNTRSIDA---------NKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDSGG 448 (511) T ss_pred HHHHHHHHhcCCCcc---------cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhc Confidence 887776543221000 0000 123333344445544555555555532 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPE---MVLAQAQMV 628 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~---~~~aq~~~~ 628 (708) .++ ..++++++.+ ...++..+++...... .....+.......... ....+.+.. T Consensus 449 ~iS---------~et~l~~l~~v~d~~~El~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:96 449 KIS---------QTTLMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred cCC---------hHHHHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHhhccccCCCCCCCCCCCCCccCc Confidence 111 1223333322 2223333444322100 0000000000000000 000000000 Q ss_pred HHHHH Q lcl|Aclame:pro 629 AAQAE 633 (708) Q Consensus 629 ~~qae 633 (708) ..|.| T Consensus 507 ~~e~~ 511 (511) T protein:vir:96 507 VDKKE 511 (511) T ss_pred ccccC Confidence 00001 No 50 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.73 E-value=6.3e-16 Score=103.92 Aligned_cols=469 Identities=11% Similarity=0.030 Sum_probs=221.0 Q ss_pred CC--cchHH-HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHH Q lcl|Aclame:pro 1 MA--ETLEK-KHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII 77 (708) Q Consensus 1 ma--~~~~~-~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~ 77 (708) +. +.... ..+++...+........+.++ +-..||.|+|.--......... ......++.|..+.+++..+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~-----~l~~Yy~g~~~il~~~~~~~~~--~~~~~ki~~n~~k~Iv~~~~ 103 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLK-----VLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFIN 103 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhHHHH-----HHHHHhhccCccccccCccccc--ccCcceeecchHHHHHHHHh Confidence 22 11110 122233333332222222221 2235899987521111111110 11123577899999999999 Q ss_pred HHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceee Q lcl|Aclame:pro 78 AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAI 157 (708) Q Consensus 78 g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i 157 (708) |+...+.+.+.+ . +.+ ..+.+..+++.|+++.....+..+++++|.+|..+..+. ++.+++ T Consensus 104 ~yl~g~p~~~~~-----~-d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~---------dg~~~i 164 (511) T protein:vir:78 104 GYFLGNPIQYQD-----D-DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDETRL 164 (511) T ss_pred hhhcccCceeec-----C-chH----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCC---------CCceEE Confidence 999999888763 1 122 234577777889999999999999999999988775431 234555 Q ss_pred EEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 158 EPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 158 ~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) ..+ ++.++| ||.... .-...+++.|.... .+....+.+...++|...+ T Consensus 165 ~~~--~p~~~~~v~dd~~~-----~~~~~~vr~~~~~~-----------------------~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:78 165 YKS--DAMSTFIIYDNTVE-----RNSIAGVRYLRTKP-----------------------IDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred EEE--cccceEEEEcCCCC-----CceEEEEEEEEeee-----------------------ccccccceEEEEEEEeCCc Confidence 433 333443 554221 11122333331100 0001112233334444332 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) . +.|.....+... .........+.|++.+|+++|. T Consensus 215 i----~~~~~~~~~~~~--------------------------------------~~~~~~~~~~~~~g~vPvv~~~--- 249 (511) T protein:vir:78 215 V----YRYLTNRTNGLK--------------------------------------LTPRENSFESHSFERMPITEFS--- 249 (511) T ss_pred E----EEEEecCCCccc--------------------------------------ccccccccccCcCcccceEEec--- Confidence 1 111111110000 0000123345666667776542 Q ss_pred eccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccc--ccccc Q lcl|Aclame:pro 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVR--DKSGN 393 (708) Q Consensus 316 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 393 (708) +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++-.....+... ... ......+...... ...+. T Consensus 250 ----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~ 322 (511) T protein:vir:78 250 ----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK--QKEANVLFLEPTVYVDAEGR 322 (511) T ss_pred ----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-hcc--cccccceeccccceeccccc Confidence 12346799999999999999999999999988777766553211111110 000 1111111111100 00111 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) .........+...+.-..++...+..+...|..+|++.+.+.+. .+|.||.|+...............+.|..+++++. T Consensus 323 ~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 402 (511) T protein:vir:78 323 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 402 (511) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111122333322224566677888888999999988876665 46789999998887777777777788888888888 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.++...-.... ...+ .+|.+.=.+..+....+..+.+..+.. T Consensus 403 ~li~~~~~~~~~~~~---------~~~~-------------------------~~i~~~f~~~~p~n~~e~~d~~~kl~G 448 (511) T protein:vir:78 403 KLLETILKNTRSIDA---------NKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDSGG 448 (511) T ss_pred HHHHHHHHhcCCCcc---------cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhc Confidence 887776543221000 0000 123333344445544555555555532 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPE---MVLAQAQMV 628 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~---~~~aq~~~~ 628 (708) .++ ..++++++.+ ...++..+++...... .....+.......... ....+.+.. T Consensus 449 ~iS---------~et~l~~l~~v~d~~~El~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:78 449 KIS---------QTTLMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred cCC---------hHHHHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHhhccccCCCCCCCCCCCCCccCc Confidence 111 1223333322 2223333444322100 0000000000000000 000000000 Q ss_pred HHHHH Q lcl|Aclame:pro 629 AAQAE 633 (708) Q Consensus 629 ~~qae 633 (708) ..|.| T Consensus 507 ~~e~~ 511 (511) T protein:vir:78 507 VDKKE 511 (511) T ss_pred ccccC Confidence 00001 No 51 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.73 E-value=1.3e-15 Score=102.21 Aligned_cols=426 Identities=8% Similarity=0.027 Sum_probs=210.8 Q ss_pred chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcC Q lcl|Aclame:pro 4 TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNN 83 (708) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~n 83 (708) ...+.+.++...+.. ...+. ++-..||.|+| ....+..+.... ..-.+++|..+.+|+..+|+...+ T Consensus 1 l~~~~l~~~i~~~~~-------~~~r~--~~l~~yy~g~~--~il~~~~~~~~~--~~~ki~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MTKDLLSELIQKHRS-------FNLSY--SAYKQLYEGDH--AILQQKQKEQYK--PDNRLVVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CCHHHHHHHHHHHHH-------HHHHH--HHHHHHhcccc--ccccccccccCC--CcceeecchHHHHHHHHhhhhccc Confidence 333445555444331 11112 12235899987 111111111110 111577899999999999999998 Q ss_pred cceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEeecc Q lcl|Aclame:pro 84 RITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDP 163 (708) Q Consensus 84 r~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~ 163 (708) .+.+.+ . ++ .....+..+++.|+++.....+..+++++|.||..+..+. ++.+++..+ +| T Consensus 68 ~~~~~~--~---~~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------~g~~~~~~~-~p 127 (429) T protein:vir:98 68 PVQTSH--E---NK-----QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDE---------NAEAGITYL-TP 127 (429) T ss_pred Cceeec--C---Ch-----HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecC---------CCcEEEEEE-cc Confidence 877653 1 11 1234566777789999999999999999999998875431 234444432 23 Q ss_pred hhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEEE Q lcl|Aclame:pro 164 SRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVI 241 (708) Q Consensus 164 ~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~ 241 (708) ..+ +||.... .....+++.+.. .+.....+||..++.. T Consensus 128 -~~~~~v~dd~~~-----~~~~~~i~~~~~------------------------------~~~~~~~~~~~~~~~~---- 167 (429) T protein:vir:98 128 -LEAFIVYDDSIR-----QKPLFAVRYFYN------------------------------KGGVLEGSYSDASNIT---- 167 (429) T ss_pred -cceEEEEeCCCC-----CceEEEEEEEEe------------------------------cCceEEEEEEeCceEE---- Confidence 233 2332111 001122222210 1112223333322111 Q ss_pred EEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCc Q lcl|Aclame:pro 242 SYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDI 321 (708) Q Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~ 321 (708) .|.+. .+...+.+..+.|++.+|+|+|. +. T Consensus 168 ~~~~~-------------------------------------------~~~~~~~~~~~~~~g~vPvv~~~-------n~ 197 (429) T protein:vir:98 168 YFKDG-------------------------------------------EKGIEIGESEPHPFDGVPMIEYV-------EN 197 (429) T ss_pred EEEec-------------------------------------------CCceEecccccccCCccceEEec-------CC Confidence 01100 01112223345666677776643 12 Q ss_pred ccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccccccccccc Q lcl|Aclame:pro 322 ERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPA 401 (708) Q Consensus 322 ~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 401 (708) ..|.|.+..+++.++.+|+..|.+.+.+...+.+.+++- |.- +..+..... .....+..... .|. . ... T Consensus 198 ~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~-g~~-~~~~~~~~~--~~~~~~~~~~~---~~~-~---~~~ 266 (429) T protein:vir:98 198 EERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKIL-GAE-LDDETLKSL--RDTRIINLKDT---DAQ-Q---LTV 266 (429) T ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCC-CCcchhhhH--hhCceeeccCC---CCC-C---cce Confidence 346789999999999999999999999998888776653 321 111111111 11222222111 111 0 012 Q ss_pred ccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 402 GYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMARE 481 (708) Q Consensus 402 ~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~ 481 (708) .+...+.-...+...+......|...|++.+.+.+..+|.||.|+...............+.|..+++++.++++.++.. T Consensus 267 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 346 (429) T protein:vir:98 267 EFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTS 346 (429) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22333332355666788889999999998877666667889999988777777777777777778887777777665321 Q ss_pred hcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchh Q lcl|Aclame:pro 482 VYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMR 561 (708) Q Consensus 482 ~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~ 561 (708) .+. ...+ .+|.|.=.+..+....+..+.++.+... .+ T Consensus 347 ----------~~~--~~d~-------------------------~~i~v~f~~~~p~~~~~~a~~~~kl~g~----is-- 383 (429) T protein:vir:98 347 ----------KIG--PKDW-------------------------IGIKYKFTRNLPANLLEESQIAGNLAGI----VS-- 383 (429) T ss_pred ----------CCC--cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhcc----Cc-- Confidence 111 0000 1233333445554444555555554211 11 Q ss_pred HHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 562 PAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 562 ~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) ..++++++.+ +..+.-.++++.. +. + .... +.. .. ..+....+ T Consensus 384 ---~et~~~~l~~v~d~~~E~~ri~~E---------------------~~-~----~~~~---~~~--~~--~~~~~~~~ 427 (429) T protein:vir:98 384 ---EETQVGVLSIVENPQKEIERKNSD---------------------KS-T----LISR---QAG--GL--NGQNTTTI 427 (429) T ss_pred ---hHHHHHhCCCCCCHHHHHHHHHHH---------------------HH-H----HHHH---HHh--hh--cCCCCCCC Confidence 1223333322 1111111222110 00 0 0000 000 00 00000000 Q ss_pred HH Q lcl|Aclame:pro 641 TA 642 (708) Q Consensus 641 ~~ 642 (708) .. T Consensus 428 ~~ 429 (429) T protein:vir:98 428 LE 429 (429) T ss_pred CC Confidence 00 No 52 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.73 E-value=1.1e-15 Score=102.54 Aligned_cols=437 Identities=10% Similarity=0.028 Sum_probs=213.1 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~g 78 (708) +- +.+++..+.+..| .+.+.....+.. +-..||.|+| .+.. ......++| .+..|..+.+|+..+| T Consensus 11 ~~-~~~~~~~~~i~~~---i~~~~~~~~r~~--~~~~Yy~g~~---~i~~---~~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (452) T protein:vir:36 11 FS-KDEPITVEVVTKF---MEKHKLEVARYE--YLKNMYLGIM---AIDD---EPAKDSWKPDNRLAVNFTKYIVDTFTG 78 (452) T ss_pred cC-CccCCCHHHHHHH---HHHHHHHHHHHH--HHHHHhcccc---cccc---CccccccCccceeecchHHHHHHHHhh Confidence 11 1111211221112 222222222222 2245899976 1111 111122333 3677999999999999 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) +...+.+.+.+ .+ .+ ..+.+..++..|+++.....+..+++++|.||+.+..+. ++.+++. T Consensus 79 ~l~g~~~~~~~--~d----~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------~g~~~i~ 139 (452) T protein:vir:36 79 YFNGIPVKKSH--SD----KE----ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDE---------DTQTNVV 139 (452) T ss_pred hhcccCceeec--CC----hh----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecC---------CCeeEEE Confidence 99998877663 21 11 234577777889999999999999999999998876431 2334444 Q ss_pred Eeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 159 PIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 159 ~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) .+ ++.++ +||+... +-...+++.|.+. +.....++|...+. T Consensus 140 ~~--~p~~~~~v~d~~~~-----~~~~~~i~~~~~~------------------------------~~~~~~~vyt~~~i 182 (452) T protein:vir:36 140 YN--SPENMFMVYDDTVK-----QEPLFAVRYGVDE------------------------------DKKLQGEVYTLLET 182 (452) T ss_pred EE--cccceEEEEcCCCC-----CceEEEEEEEEec------------------------------CceEEEEEEecCeE Confidence 33 22333 2443211 0011112222100 00111122221111 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) +.|. .-.+...+.+..+.+++.+|+++|.- T Consensus 183 ----~~~~-------------------------------------------~~~~~~~~~~~~~~~~g~iPvv~~~n--- 212 (452) T protein:vir:36 183 ----IKIS-------------------------------------------GENDEISFGEGTYNPYPDLPVVEFYF--- 212 (452) T ss_pred ----EEEE-------------------------------------------EcCCceEEecceeccCCcccEEEecC--- Confidence 0000 01111123334555666667665432 Q ss_pred ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 317 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) ...+.|.+..+++.++.+|+.+|.+...+...+.+.+++--...++ +... . ......+.+.......+ T Consensus 213 ----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~-~~~~-~--~~~~~~~~~~~~~~~~~---- 280 (452) T protein:vir:36 213 ----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE-EDLK-N--IRSNRVINYYADGEGKN---- 280 (452) T ss_pred ----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc-hhhh-h--hhhcceEEecCCCCccC---- Confidence 2236688999999999999999999999988888877664222221 1111 1 11122222222111111 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l 476 (708) ..+.+...+.-...+...+..+...|...|++.+.+.+..+|.||.|+..+-...........+.|..+++++.++++ T Consensus 281 --~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 358 (452) T protein:vir:36 281 --VDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFC 358 (452) T ss_pred --CcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122233333334666777888889999999998777777678999999988877777777788888888888888887 Q ss_pred HHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) Q Consensus 477 ~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~ 556 (708) .+.... |.. ..+ .||.|.=.+..+....+..+.++.+...+ T Consensus 359 ~~~~~~----------~~~--~~~-------------------------~~i~i~f~~~~p~d~~~~a~~~~k~~g~i-- 399 (452) T protein:vir:36 359 ELSTNV----------SNK--DSW-------------------------KDIEYTFTRNEPKDIKEQAETANILMGIT-- 399 (452) T ss_pred HHHhcc----------CCc--ccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhccC-- Confidence 765421 110 000 12333333444444444555555542111 Q ss_pred cCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 557 TDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQ 635 (708) Q Consensus 557 ~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~ 635 (708) ...++++++.+ ...++-.+++.+.... .. +. ... .+.... T Consensus 400 -------S~et~~~~~~~~~d~~~E~~ri~~E~~~--------------------~~--~~-----~~~-----~~~~~~ 440 (452) T protein:vir:36 400 -------SQETALSVISVIPDVQAEMEKIKKEEAS--------------------TA--IF-----DKD-----KQPSEK 440 (452) T ss_pred -------ChHHHHHhCCCCCCHHHHHHHHHHHHHH--------------------HH--HH-----Hhh-----ccCCCC Confidence 11233333322 1122222222211000 00 00 000 000000 Q ss_pred HHHHHHHHHHHH Q lcl|Aclame:pro 636 KATNETAQTQIK 647 (708) Q Consensus 636 k~~~~~~~~q~e 647 (708) ..+.+.-...-| T Consensus 441 ~~~~~~~~~~~e 452 (452) T protein:vir:36 441 GTDTVVSETNEE 452 (452) T ss_pred cccccCccccCC Confidence 000000000000 No 53 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.72 E-value=1.1e-15 Score=102.55 Aligned_cols=472 Identities=11% Similarity=0.029 Sum_probs=220.6 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |. ..+.......+.+....+.+...+.... ++-..||.|+|.--......+.. ....-.++.|..+.+++..+|+. T Consensus 31 ~~-~~e~~~~~~~~~i~~~i~~~~~~~~~r~-~~l~~Yy~g~~~il~~~~~~~~~--~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:93 31 YD-GTESDLLQNVNEVSKYIEHHMDYQRPRL-KVLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred cc-chhhhhhccHHHHHHHHHHHHHhhHHHH-HHHHHHhcccCccccccCcCccc--ccCcceeecchHHHHHHHHhhhh Confidence 32 1111111111122222222222111111 22246899987421111111110 01112477899999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ..+.+.+.+ +|.+ ..+.+..+++.|+++.....+..+++++|.+|..+..+. .+.+++..+ T Consensus 107 ~g~p~~~~~------~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de---------~~~~~i~~~ 167 (511) T protein:vir:93 107 LGNPIQYQD------DDKD----VLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDETRLYKS 167 (511) T ss_pred cccCeeecc------CChH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC---------CCceEEEEE Confidence 998888753 1222 245677777889999999999999999999998886431 234454433 Q ss_pred ecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEE Q lcl|Aclame:pro 161 YDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESV 238 (708) Q Consensus 161 ~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~ 238 (708) ++.++| ||.... .-...+++.|.... . ...+.+.+...+.|...... T Consensus 168 --~p~~~~~vydd~~~-----~~~~~~vr~~~~~~-------------~----------~~~~~~~~~~~~iyt~~~i~- 216 (511) T protein:vir:93 168 --DAMSTFVIYDNTIE-----RNSIAGVRYLRTKP-------------I----------DKTDEDEVFTVDLFTSHGVY- 216 (511) T ss_pred --ccceeEEEEcCCCC-----CceEEEEEEEEeee-------------c----------cccccceEEEEEEEeCCcEE- Confidence 233443 554321 11223333332100 0 00011222333444433211 Q ss_pred EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeecc Q lcl|Aclame:pro 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~ 318 (708) .|..... .. .........+.+.|++.+|+|+|. T Consensus 217 ---~~~~~~~-~~-------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~------ 249 (511) T protein:vir:93 217 ---RYLTSRT-NG-------------------------------------LKLTPRENGFESHSFERMPITEFS------ 249 (511) T ss_pred ---EEEecCC-Cc-------------------------------------cccccccccccccCCCccceEEec------ Confidence 1111000 00 000001112344555666666542 Q ss_pred CCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccc--cccccc Q lcl|Aclame:pro 319 DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK--SGNIIA 396 (708) Q Consensus 319 d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 396 (708) +...+.|.+..+++.++.+|...|.+.+.+...+.+.+++.-....+... ... ............... .+.... T Consensus 250 -nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:93 250 -NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK--QKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred -CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchh-hcc--cccccceecccccccccccccCC Confidence 12346799999999999999999999999988887766553211111111 000 111111111111100 000011 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVW 475 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~ 475 (708) ......+...+.-..++...+..+...|..+|++.+.+.+. .+|.||.|+..............-+.|..+++++.+++ T Consensus 326 ~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:93 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122333333234666778888899999999988766654 46789999998888777777777888888888888888 Q ss_pred HHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 476 LSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSML 555 (708) Q Consensus 476 l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~ 555 (708) +.++........ ...+ -+|.+.=.+..+....+..+.+..+...+ T Consensus 406 ~~~l~~~~~~~~---------~~d~-------------------------~~i~~~f~~~~p~n~~e~~~~~~kl~g~i- 450 (511) T protein:vir:93 406 ETILKNTWSIDA---------NKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDSGGKI- 450 (511) T ss_pred HHHHHhccCccc---------cccc-------------------------ccceEEeCCCCCCCHHHHHHHHHHHhccC- Confidence 876543322110 0000 02223334455554555555565552211 Q ss_pred ccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH Q lcl|Aclame:pro 556 PTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVL---AQAQMVAAQ 631 (708) Q Consensus 556 ~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~---aq~~~~~~q 631 (708) + ..++++++.+ ...++-.+++...... ...+.+............. -+.+....| T Consensus 451 ---S-----~et~~~~l~~v~d~~~E~~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (511) T protein:vir:93 451 ---S-----QTTLMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 509 (511) T ss_pred ---c-----hHHHHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHhhhcccCCCCCCCCCCCCcccccccc Confidence 1 1223333322 1222233333321100 0000000000000000000 000000000 Q ss_pred HH Q lcl|Aclame:pro 632 AE 633 (708) Q Consensus 632 ae 633 (708) .| T Consensus 510 ~~ 511 (511) T protein:vir:93 510 KE 511 (511) T ss_pred cC Confidence 00 No 54 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.72 E-value=5.4e-16 Score=104.30 Aligned_cols=456 Identities=10% Similarity=0.038 Sum_probs=216.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhh----------hhcCCC--ceeecc Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDE----------QFEKYP--KFEINK 68 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~----------q~~grp--~~~~N~ 68 (708) |- .+ .+.+++..+.. .....+.+..+. ..||.|.| +-..+....... ...++| .+..|. T Consensus 1 ~~--~e-~~~~~i~~~~~---~~~~~~~~~~~~--~~Yy~g~h-di~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 71 (471) T protein:vir:10 1 ME--IE-VIKKIISSQMV---KHGKFVSQAAEA--EKYYRNEN-DIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNW 71 (471) T ss_pred CC--HH-HHHHHHHHHHH---HHHHHHHHHHHH--HHHhcccc-ccccccchhhhhcccccccccccccccccceeccch Confidence 32 22 33343333332 222223333222 45788875 111111000000 000111 377899 Q ss_pred hHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCC Q lcl|Aclame:pro 69 VATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) Q Consensus 69 i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~ 148 (708) .+.+|+..+|+...+.+.+.+ . +.+..+ .+..+.+ |+++.....+..++.++|.||..+..+.. T Consensus 72 ~~~Ivd~~~~yl~G~p~~~~~--~----~~~~~~----~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----- 135 (471) T protein:vir:10 72 HQLLLDQKKAYALTYPPTFDV--D----DKKVND----MIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS----- 135 (471) T ss_pred hHHHHHhhhhhhcccCceecc--C----ChHHHH----HHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC----- Confidence 999999999999988877643 1 223333 3444443 78999999999999999999988865421 Q ss_pred CCCCcceeeEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 149 MDDRQRIAIEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 149 ~~~~~~i~i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) .+.+++..+ ++..+ +||+... +-...+++.|...+. .+.+... T Consensus 136 ---~g~~~~~~~--~p~~~~~i~d~~~~-----~~~~~~ir~~~~~~~-------------------------~~~~~~~ 180 (471) T protein:vir:10 136 ---DNSFRYACV--DSKEVIPIYSKSLD-----KKSIGVLRVYSSIDE-------------------------TDGKNYT 180 (471) T ss_pred ---CCeeEEEEE--cccceEEEEcCCCC-----CceEEEEEEEEeecc-------------------------CCCceeE Confidence 234555544 23344 3443221 112233333322111 0112223 Q ss_pred EeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCc Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~ 306 (708) ..++|...... .|.....+.......... ........|.....+..+.+++.+ T Consensus 181 ~~~vy~~~~~~----~y~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~g~i 233 (471) T protein:vir:10 181 VYEYWNDKECS----FYRHEKEKPLEELETFQA-----------------------ISLIDTMNGDRSSDNSFKHDFGLV 233 (471) T ss_pred EEEEEeCCcEE----EEEecCCccccccccccc-----------------------ccccccccccccccccccCCCCce Confidence 34444433221 111111111100000000 000001223333334445555666 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) |+|+|.- ...+.|.+..+++.++.+|...|.+.+.+...+++.+++.-.......+.... ....+.+.... T Consensus 234 Pvv~~~n-------~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~--~~~~~~i~~~~ 304 (471) T protein:vir:10 234 PFIPFKN-------NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLED--LKRYKMIKMDN 304 (471) T ss_pred eEEEecc-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHH--hhcCCeEEecC Confidence 6665421 23467889999999999999999999999988887666532111212221111 11222222211 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAK 466 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~ 466 (708) .+... .....+...+.-..++...+..+...|-..|+..+...+..+|.||.|+..+..............|.. T Consensus 305 ~~~~~------~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~ 378 (471) T protein:vir:10 305 DGMGD------QSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRS 378 (471) T ss_pred CCCcc------CccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 112333333333466777888889999999988776666667899999988887777777777777778 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHH Q lcl|Aclame:pro 467 SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSV 546 (708) Q Consensus 467 ~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~ 546 (708) +++++.++++.++.. .+ + .||.|.-.+..+..-.+..+. T Consensus 379 ~l~~~~~li~~~~~~----------~d------~-------------------------~~i~i~f~~~~p~n~~e~~~~ 417 (471) T protein:vir:10 379 GYATLVKMILKHLGL----------SD------K-------------------------LKIKQTWTRNSINNDTEMAQV 417 (471) T ss_pred HHHHHHHHHHHHhcc----------CC------C-------------------------ceeEEEeCCCCCCCHHHHHHH Confidence 887777777665421 10 0 122233334444444445555 Q ss_pred HHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 547 LTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQA 625 (708) Q Consensus 547 l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~ 625 (708) ++.+... ....++++++.+ ...+.-.+++........... +.. T Consensus 418 ~~kl~g~---------iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~--------------------~~~------- 461 (471) T protein:vir:10 418 VSTLATI---------TSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKL--------------------YDM------- 461 (471) T ss_pred HHHHhcc---------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc--------------------ccc------- Confidence 5543111 111233333322 112222233322110000000 000 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 626 QMVAAQAEAQ 635 (708) Q Consensus 626 ~~~~~qae~~ 635 (708) .....+.|.+ T Consensus 462 ~~~~~~~e~~ 471 (471) T protein:vir:10 462 EEVEHESEVE 471 (471) T ss_pred CCCCCccccC Confidence 0000000000 No 55 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.72 E-value=5.1e-16 Score=104.44 Aligned_cols=470 Identities=12% Similarity=0.045 Sum_probs=221.2 Q ss_pred CC--cchH-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHH Q lcl|Aclame:pro 1 MA--ETLE-KKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNR 75 (708) Q Consensus 1 ma--~~~~-~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~ 75 (708) |. ++.. ....++...+........+ +.. +-..||.|+|.--......+ .+.+| .++.|..+.+|+. T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~--~l~~Yy~g~~~i~~~~~~~~----~~~~~~~ki~~n~~k~Iv~~ 101 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLK--VLSDYYEGKTKNLVELTRRK----EEYMADNRVAHDYASYISDF 101 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHH---HHH--HHHHHhcccCccccccCccc----ccccCcceeecchHHHHHHH Confidence 32 1111 0122232222222222222 221 22458999875321111111 11222 3778999999999 Q ss_pred HHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcce Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRI 155 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i 155 (708) .+|+...+.+.+.+ . |.+ +.+.+..+++.|+++.....+..++++.|.+|..+..+. .+.+ T Consensus 102 ~~~yl~g~p~~~~~-----~-d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de---------d~~~ 162 (511) T protein:vir:99 102 INGYFLGNPIQYQD-----D-DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDET 162 (511) T ss_pred HHhhhcccCceeec-----C-chH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC---------CCce Confidence 99999999888763 1 222 245677777889999999999999999999998876531 2344 Q ss_pred eeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeee Q lcl|Aclame:pro 156 AIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEV 233 (708) Q Consensus 156 ~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~ 233 (708) ++..+ ++..+| ||+... .-...+++.|... . ....+.+.+...+.|.. T Consensus 163 ~i~~~--~p~~~~~vyd~~~~-----~~~~~~vr~~~~~----------~-------------~~~~~~~~~~~~~vyt~ 212 (511) T protein:vir:99 163 RLYKS--DAMSTFVIYDNTIE-----RNSIAGVRYLRTK----------P-------------IDKTDEDEVFTVDLFTS 212 (511) T ss_pred EEEEE--ccceeEEEEcCCCC-----CceEEEEEEEEee----------e-------------cccCccceEEEEEEEeC Confidence 55433 334443 554321 1122333333110 0 00001122333344443 Q ss_pred cceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEE Q lcl|Aclame:pro 234 RKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYG 313 (708) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~ 313 (708) ... +.|....++.. ..........+.|++.+|+|+|.- T Consensus 213 ~~i----~~~~~~~~~~~--------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n 250 (511) T protein:vir:99 213 HGV----YRYLTSRTNGL--------------------------------------KLTPRENGFESHSFERMPITEFSN 250 (511) T ss_pred CcE----EEEEecCCccc--------------------------------------cccccccccccCCCCccceEEecC Confidence 321 11111111000 000011233455666777766431 Q ss_pred eeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccc--ccc Q lcl|Aclame:pro 314 KRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVR--DKS 391 (708) Q Consensus 314 ~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 391 (708) ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++-.....+.... .. ............. ... T Consensus 251 -------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~ 320 (511) T protein:vir:99 251 -------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-RK--QKEANVLFLEPTVYADSE 320 (511) T ss_pred -------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhh-cc--cccccceecccccccccc Confidence 23467999999999999999999999999877776555432111111100 00 0111111111110 001 Q ss_pred ccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 392 GNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKR 470 (708) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~ 470 (708) +....+.....+...+.-..++...+..+.+.|..+|++.+.+.+. .+|.||.|+..+............+.|..++++ T Consensus 321 ~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~ 400 (511) T protein:vir:99 321 GRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 400 (511) T ss_pred cccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111122333332224566677888889999999988766654 467899999988887777778888888889999 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHH Q lcl|Aclame:pro 471 AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) Q Consensus 471 ~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~l 550 (708) +.++++.++...-...- ...+ .+|.+.=.+..+....+..+.++.+ T Consensus 401 ~~~li~~~~~~~~~~~~---------~~~~-------------------------~~i~i~f~~~~p~n~~e~~~~~~kl 446 (511) T protein:vir:99 401 RAKLLETILKNTRSIDV---------SKDF-------------------------NTVRYVYNRNLPKSLIEELKAYIDS 446 (511) T ss_pred HHHHHHHHHHhcCCccc---------cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHH Confidence 88888887654221000 0000 1222333344454444555555554 Q ss_pred HHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 551 LSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVA 629 (708) Q Consensus 551 lq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~ 629 (708) ...+ ...++++++.+ ...++-.+++...... ...+.+.... .+.....-....-. T Consensus 447 ~Gii---------S~et~l~~l~~v~D~~~E~~ri~~E~~~-------------~~~~~~~~~~--~~~~~~~~~~~~~~ 502 (511) T protein:vir:99 447 GGKI---------SQTTLMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKNMY--QDPRNINDDEQDDS 502 (511) T ss_pred hccC---------CHHHHHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHhhccc--ccCCCCCCCCCCCC Confidence 2111 11223333322 2233333444322100 0000000000 00000000000000 Q ss_pred HHHHHHHHH Q lcl|Aclame:pro 630 AQAEAQKAT 638 (708) Q Consensus 630 ~qae~~k~~ 638 (708) -+.+..+.+ T Consensus 503 ~~~~~d~~e 511 (511) T protein:vir:99 503 TKDSIDKKE 511 (511) T ss_pred CcCcccccC Confidence 000000000 No 56 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.72 E-value=2.2e-16 Score=106.40 Aligned_cols=432 Identities=11% Similarity=0.035 Sum_probs=208.9 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHHHHHhcCcceeEE Q lcl|Aclame:pro 12 IMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIAEYRNNRITVKF 89 (708) Q Consensus 12 ~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~g~~~~nr~~~~v 89 (708) ++..|. +.. +.+. ++-..||.|+|..-......+ ..++| .+..|..+.+|+..+|+...+.+.+.+ T Consensus 1 ~~~~~~---~~~---~~r~--~~l~~yy~g~~~~~~~~~~~~----~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 68 (440) T protein:vir:95 1 MLAAFL---GSQ---KQRL--AILASYAQGDNFSILSGHRRL----DDEKADYRVRHKWGGYISSFATGYVIGNPVSIGV 68 (440) T ss_pred ChhhHH---HHH---HHHH--HHHHHHhccCCcccccccccc----cccCCcceeecchHHHHHHhhhhheeccCceEee Confidence 111111 111 1222 122357899886322111111 11222 467899999999999999999988765 Q ss_pred ecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEeecchhhee- Q lcl|Aclame:pro 90 RPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVW- 168 (708) Q Consensus 90 ~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~- 168 (708) .. .++.+..+ .+..++..|+++.....+..+++++|.+|..+..+. ++.+++..+ ++.+++ T Consensus 69 ~~---~~~~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~---------~~~~~i~~~--~p~~~~~ 130 (440) T protein:vir:95 69 ME---GGSADQLS----TIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK---------DKVDRVVLI--SPLEMFV 130 (440) T ss_pred CC---CccHHHHH----HHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC---------CCceEEEEE--cccceEE Confidence 32 22333222 355667789999999999999999999998886431 123444433 233443 Q ss_pred -cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEEEEEecCc Q lcl|Aclame:pro 169 -FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPI 247 (708) Q Consensus 169 -~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~ 247 (708) ||+.... . ...+++.|...+ . ...++|+..+... +.+.... T Consensus 131 ~~d~~~~~----~-~~~~i~~~~~~~------------------------------~-~~~~vyt~~~~~~--~~~~~~~ 172 (440) T protein:vir:95 131 IRDLTVEQ----N-IIAAVHLPIYAD------------------------------K-VNMTVYTKDKVIT--YKPYSNN 172 (440) T ss_pred EEcCCCCC----c-eEEEEEEEEecC------------------------------c-eEEEEEeCCeEEE--EEEecCC Confidence 5553321 1 112222221000 0 0112332221110 0000000 Q ss_pred cCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccch Q lcl|Aclame:pro 248 TGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGH 327 (708) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~ 327 (708) .+...+.++.+.|++.+|+|+|.- ...|.|. T Consensus 173 ------------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n-------~~~g~sd 203 (440) T protein:vir:95 173 ------------------------------------------SVRLVVDDVKKHSYNDVPVVEWWN-------NRFRMGD 203 (440) T ss_pred ------------------------------------------ccceeecceeeccCceeeEEEeeC-------CCCCCCc Confidence 001122233445556666665431 2246789 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccc---hHHHHHhhcccCCceeeecccccccccccccccccccc Q lcl|Aclame:pro 328 IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG---LEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYT 404 (708) Q Consensus 328 vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 404 (708) +..+++.++.+|..+|.+...+...+.+.+++- |...+ ..+..... .+.+.+.........+. .. .....+. T Consensus 204 ~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~-g~~~~~~~~~e~~~~~--~~~~~~~~~~~~~~~~~-~~-~~~~~~l 278 (440) T protein:vir:95 204 YESEISLIDAYDAGQSDTANYMSDLNDAMLLVK-GDLDGIKLSPEDAAKM--KDANMLFLKTGISTTGQ-QT-TADASYI 278 (440) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcceeeee-cccccCCCCccchhhh--hhccceecccccccccC-CC-CcceeEE Confidence 999999999999999999999988887766542 21100 00100000 11111111111111000 00 0112233 Q ss_pred cCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 405 QPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVY 483 (708) Q Consensus 405 ~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y 483 (708) ..+.-..++...++.....|...|++.+.+.+. .+|.||.|+.................|..+++++.+++..++.... T Consensus 279 t~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~ 358 (440) T protein:vir:95 279 YKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAIN 358 (440) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 333234666778888999999999998866665 4678999999887777777777788888888888887776654322 Q ss_pred CCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHH Q lcl|Aclame:pro 484 GSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPA 563 (708) Q Consensus 484 ~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~ 563 (708) . . .++ ..+|.|.=.+..+..-.+..+.+..+.. + .+ T Consensus 359 ~---------~------------~~~---------------~~~v~i~f~~~~p~~~~~~ad~~~kl~g-~---iS---- 394 (440) T protein:vir:95 359 G---------P------------VIE---------------ANKLTFTFHPNIPQDVWTEIKAYIEAGG-E---IS---- 394 (440) T ss_pred C---------c------------ccc---------------cccceEEeCCCCCCCHHHHHHHHHHHhc-c---Cc---- Confidence 1 0 000 1233344444555444455555555421 1 11 Q ss_pred HHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 564 IQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 564 ~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) ...+++++.+-..+.-.+++. .++. ....+..+......-....+| T Consensus 395 -~et~~~~l~~~d~~~E~~ri~---------------------~E~~--~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 395 -QETLMENASFTDYKTEHSRIL---------------------KQGG--SSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred -HHHHHHhCCCCCcHHHHHHHH---------------------HHHH--HhhhhHHhhccCCCCCCcCCC Confidence 122333332211000001110 0000 000000000000000000000 No 57 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.71 E-value=3.5e-15 Score=99.83 Aligned_cols=466 Identities=10% Similarity=0.016 Sum_probs=225.8 Q ss_pred CCcchHHHHHHHHHHH--HHH----HHh-----hHHHHHHHHHHHHHhhcCCCC--CCHHHHHHhhhhhhhcCCCceeec Q lcl|Aclame:pro 1 MAETLEKKHERIMLRF--DRA----YSP-----QKEVREKCIEATRFARVPGGQ--WEGATAAGTKLDEQFEKYPKFEIN 67 (708) Q Consensus 1 ma~~~~~~~~~~~~~~--~~~----~~~-----~~~~r~~~~~d~~~~~~~G~Q--w~~~~~~~l~~~~q~~grp~~~~N 67 (708) |=++..+.++.++++. ..+ .+. ..+.+......+ .||.|++ |...... ..+....+..++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~yy~g~~~~~~~~~~~---~~~~~~~~~~~~~n 75 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWK--RLYQGHYAEWHNLNYE---HNGNPVNRRQLSMN 75 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHH--HHhcCCCchhhcchhc---cCCCccccceeecc Confidence 8877777777666653 111 111 112223333333 3577743 4221111 01111112346779 Q ss_pred chHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 68 KVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 68 ~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) .-+-+++...++.....+.+.+ +|.+..+.|++ +.+.+++......++.+++..|.||+++.++. T Consensus 76 ~~k~i~~~~a~~l~~~p~~i~~------~d~~~~e~l~~----~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~----- 140 (496) T protein:vir:38 76 LPKVTAKYMSKLLFNEKVKINI------DDKAAEEFVLN----VLKTNGFTKNMERYIEYGEAMGGFVIKVYHDG----- 140 (496) T ss_pred hHHHHHHHHhhhhhCCcceEee------CChHHHHHHHH----HHhccCHHHHHHHHHHHHhhhCcEEEEEEEcC----- Confidence 9999999999999999998876 23344555544 55578999999999999999999999997652 Q ss_pred CCCCCcceeeEEeecchhheecCCcccc-CChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAKK-YDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~~-~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) .+.+++..+ ++..+| |-... .++..+- |+..+ +. +..... T Consensus 141 ----~~~~~i~~v--~~~~~~--P~~~~~~~~~~~~--f~~~~-~~----------------------------~~~~y~ 181 (496) T protein:vir:38 141 ----NKNVKVSFA--TADCMY--PLSNDSENVDECV--IANSF-HK----------------------------NNKYYT 181 (496) T ss_pred ----CCcEEEEEE--cccceE--EEEecCCcEEEEE--EEEEE-Ee----------------------------CCeEEE Confidence 133455543 445555 21111 1222222 22111 00 111223 Q ss_pred EeeeeeecceEEE-EEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCC Q lcl|Aclame:pro 227 IAKYYEVRKESVD-VISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEH 305 (708) Q Consensus 227 v~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~ 305 (708) ..|+|+....+.. .+.++ .+.....+-....... + +--+.....+.+ + .. T Consensus 182 ~le~h~~~~~~~~I~~~~y----------~~~~~~~~g~~v~~~~-------------~-~~~~~~~~~~~~---~--~~ 232 (496) T protein:vir:38 182 LLEWNEWQGDVYTVTTELY----------QSDDPNELGTKVSLTL-------------L-FDDIEPVVPLPD---F--TR 232 (496) T ss_pred EEEEEEEeCceEEEEEEEE----------ecCCccccCccccccc-------------c-ccccccceeecC---C--Cc Confidence 3344432221111 11111 1110000000000000 0 000000011111 0 11 Q ss_pred cceeeEE--EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHh----hcccCC Q lcl|Aclame:pro 306 IPLIPVY--GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEA----RNKKRP 379 (708) Q Consensus 306 ~p~~p~~--~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~----~~~~~~ 379 (708) .||+++- .......+++.|.|.+.++++.++.+|.+.|.+.+.+.. +..+++++...+......... ...... T Consensus 233 ~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~ 311 (496) T protein:vir:38 233 PTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDE 311 (496) T ss_pred ceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCCCCccccCCCCccc Confidence 1222111 011112345667889999999999999999999999876 456777776655322110000 000000 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc--cchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP--SNIAQETVNNLMNRADMAS 457 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~--~n~sg~ai~~~q~q~~~~~ 457 (708) .+... . +....+...+....+.-....+...++.....+...+|+++.+.|.. +..||+++........... T Consensus 312 ~~~~~---~---~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~ 385 (496) T protein:vir:38 312 AFFLY---Q---GDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTK 385 (496) T ss_pred eEEEe---e---cCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHH Confidence 01111 1 11111122233333333346677788888888999999999988854 3358888887777666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccch Q lcl|Aclame:pro 458 FIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYT 537 (708) Q Consensus 458 ~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~ 537 (708) ......+..+++++.+.++.+...+-.- .|... ...+|.|.=..+.+ T Consensus 386 ~~~~~~~~~~l~~l~~~il~~~~~~~~~------~g~~~---------------------------~~~~i~v~f~d~i~ 432 (496) T protein:vir:38 386 NSHSQLIEQGIKEMIVSILEVGKFIEAY------SGEVV---------------------------ELDTITVDFDDSIA 432 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh------cCCCC---------------------------CccceEEEeCCCCC Confidence 7778888899999999999877643210 01000 01122222222334 Q ss_pred hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhcc---chhHHHHHHHHHhhhhhhhc----ccCcchHH Q lcl|Aclame:pro 538 ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNID---GEGLDDFKEYNRNQLLISGI----AKPRNEKE 601 (708) Q Consensus 538 ~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d---~~~~~ei~e~~~~~~~~~~~----~~~~~~~~ 601 (708) ....+..+.++++...+. . + ...++.... -+.+++..++++........ ..+..+++ T Consensus 433 ~d~~~~~~~~~~~~~~Gi-i-S-----~et~l~~~~~~~d~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 433 QDEDTTINRYTNAKNQGM-I-P-----LKIALQRAWNITEAEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred CCHHHHHHHHHHHHhcCC-C-C-----HHHHHHhcCCCChHHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 444556666666654321 1 1 112222221 12333444555443321100 00111111 No 58 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.71 E-value=6.7e-15 Score=98.30 Aligned_cols=519 Identities=12% Similarity=0.048 Sum_probs=244.2 Q ss_pred CCcchH-HH-HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcC--CCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLE-KK-HERIMLRFDRAYSPQKEVREKCIEATRFARVP--GGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~-~~-~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~--G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) ||+... .+ -+.++.+|....+....|...|+++.+|.+=. .++++..- . + +..+.-..-...++.+ T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~-~----~-----~~~~~dst~~~a~~~L 70 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSS-T----D-----YTTPWQAVGARGLNNL 70 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCccc-c----c-----ccccccchHHHHHHHH Confidence 998322 22 34567788888888888888898887753211 11221110 0 0 0112333444455554 Q ss_pred HHHHhc----CcceeEEecCCCc------ch---HHHHHHH---HHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe Q lcl|Aclame:pro 77 IAEYRN----NRITVKFRPGDRE------AS---EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) Q Consensus 77 ~g~~~~----nr~~~~v~pr~~~------~d---~~~A~~l---~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~ 140 (708) .+.... .++=+++.+.+.. +. .++.+.| +..+......|++..+...++.+.+..|.|+..+.- T Consensus 71 aa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~ 150 (543) T protein:vir:88 71 SAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPP 150 (543) T ss_pred HHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeecc Confidence 443333 2222222222100 00 1222233 344455566899999999999999999999875531 Q ss_pred eccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCC Q lcl|Aclame:pro 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWF 220 (708) Q Consensus 141 ~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~ 220 (708) ++..+.....+.. .|+.++++..+.. . ...-++++..++..++...||+...... ..+ T Consensus 151 ------~~~~~~~~~~~~~--~pl~~y~v~~d~~---G-~v~~i~r~~~~~~~~l~~~~~~~v~~~~--------~~~-- 208 (543) T protein:vir:88 151 ------PDASSNSYNPMKL--YTLHNHVVQRDAF---G-NVLQIVTLDKVAYAALPEDVRNSLSGGQ--------EYK-- 208 (543) T ss_pred ------CccccceecceEE--eEcceEEEeeCCC---C-CeeeeeeeeeccHHHHhHHhhHHHHHHh--------hcC-- Confidence 1111111111111 1444454443222 1 2345788899999999877764321100 000 Q ss_pred CCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCC Q lcl|Aclame:pro 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) Q Consensus 221 ~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~ 300 (708) ..+.+ .+|.+..+..... .+.|+ .-+-+..+....+. T Consensus 209 p~~~~-------------~v~~~V~pr~~~~-----------------------------~~~~~-~~~~~~~v~~~~~~ 245 (543) T protein:vir:88 209 PEQEL-------------EVYTHIYIDDESG-----------------------------DFLSY-QEIEGVEVDGSDGQ 245 (543) T ss_pred Cccce-------------EEEEEEEeecCCC-----------------------------ccccc-ccccCeeeecCCCc Confidence 01111 1222111111000 00010 01122222223456 Q ss_pred CCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCc Q lcl|Aclame:pro 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) Q Consensus 301 ~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~ 380 (708) +|+..+||+++... ..+|..+|.|.+....+-.+.+|++....+..+.+..+++++++.+.+....+. .+++ T Consensus 246 ~~~~e~P~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~------~~~~ 317 (543) T protein:vir:88 246 YPQDALPWIAVRWT--KRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRL------VKAQ 317 (543) T ss_pred cccccCCceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhc------ccCC Confidence 77888888765443 368889999999999999999999999999999999999999987765433221 1111 Q ss_pred eeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccccc-chhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 381 FLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-NIAQETVNNLMNRADMASFI 459 (708) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~-n~sg~ai~~~q~q~~~~~~~ 459 (708) .-.+.+ ...+.+. +......+.+ ....+.++...+.|....-+. .+...++ ..|++-|..+.+.....+.. T Consensus 318 ~g~~v~--g~~~~v~----~~~~~~~~~~-~~~~~~i~~~~~rI~~af~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~ 389 (543) T protein:vir:88 318 TGDFVA--GRKADIE----FLQLEKTADF-TVAKSVADAIEARLSYVFMLN-SAVQRSGERVTAEEIRYVASELEDTLGG 389 (543) T ss_pred Cceeec--CCCCcce----eeecccccch-hHHHHHHHHHHHHHHHHHhhh-hhccCCCCcccHHHHHHHHHHHHHHHhH Confidence 111111 1111111 1222222233 335566777777777665332 2222333 36888899999988888988 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 460 YLDNMA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 460 ~~dn~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) .+.+|. .+..-+.+..+.++.+.--=+ +-..+ .+.+.+..+ -.+- T Consensus 390 v~~rl~~E~l~Pli~r~~~il~r~g~lP-----------------------~~p~~----------~v~~~~vs~-l~~l 435 (543) T protein:vir:88 390 VYSILSQELQLPIVRVLLNQLQATQQIP-----------------------NLPQE----------AVEPTVTTG-AEAL 435 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC-----------------------CCchh----------ceeeeEEec-HHHH Confidence 888877 455555555555554421100 00000 122333222 2345 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh--hhcccCcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQSQP 616 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~--~~~~~~~~~~~~q~~~~~qq~qq~~~ 616 (708) .|.+..+.|+++++.++...+ +.+++. -+.++++..+...... .....+ +++.++.+++++.++ + T Consensus 436 ~r~~~~~~l~~~~~~v~~~~~------p~vld~---id~d~~~~~~a~~~Gv~~~~i~r~--~~e~~~~~~q~~~q~--~ 502 (543) T protein:vir:88 436 GRGQDLDKLTQFLNAVATVSQ------LNGDPD---LNVNNIKLRLANAIGIDTAGLLLT--EAEKAQAQSQEMLKQ--G 502 (543) T ss_pred HHHHHHHHHHHHHHHHHhccc------hhhhcc---CCHHHHHHHHHHHhCCChhhhcCC--HHHHHHHHHHHHHHH--H Confidence 688888888888887665433 112222 2344455444433322 112222 111111111111000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 617 NPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRL 683 (708) Q Consensus 617 ~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~ 683 (708) . +++. .++.....++..+.-..++ .+...+ .++ ..-...+. T Consensus 503 ~------~~~~---~~~~~~~~~~~~~~~~~~~-----~~~~~~----------~~~--~~p~~~~~ 543 (543) T protein:vir:88 503 G------LNAA---AGIGSGVAAQATASPEAME-----SAMDTA----------GVQ--PGPIATQV 543 (543) T ss_pred H------HHHH---HHHhhchhhhhccChHHHH-----HHhhhc----------CCC--CCCCCCCC Confidence 0 0000 0000000000000000000 000000 000 00000000 No 59 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.70 E-value=7.7e-15 Score=97.96 Aligned_cols=435 Identities=9% Similarity=-0.004 Sum_probs=202.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |-+.-.+.+.++...+..- +.+ -+....||+|+|.-.......... . ..--++.|..+-+|+..++.. T Consensus 1 ~~~~~~~~i~~l~~~~~~~-------~~r--~~~l~~Yy~G~~~i~~~~~~~~~~--~-~~~k~~~n~~~~ivd~~~~~l 68 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRL-------SSW--HCCIEGYYEGSNRVRDLGVAIPPE--L-QRVQTVVSWPGIAVDALEERL 68 (441) T ss_pred CCccHHHHHHHHHHHHHHH-------HHH--HHHHHHHHhcCCcchhcCcccchh--h-hhhhhhcchHHHHHHHHHhhh Confidence 6665555566655544321 111 122235899988632211111000 0 011356799999999888866 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) .-+- + +.+ ++. -+..+++.|+++.....++.++++.|+||+-|..+ .++.+++..+ T Consensus 69 ~~~g--~----~~~-d~~--------~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---------~~g~~~i~~~ 124 (441) T protein:vir:80 69 DWLG--W----TNG-DGY--------GLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH---------GDGTVSVRPQ 124 (441) T ss_pred cccc--c----cCC-ChH--------HHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC---------CCCceEEEEE Confidence 3111 1 121 221 14556778999999999999999999999877532 1234455433 Q ss_pred ecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEE Q lcl|Aclame:pro 161 YDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESV 238 (708) Q Consensus 161 ~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~ 238 (708) ++.++ +||+...+.. .++.+..+- ..+...+ +.|... T Consensus 125 --~p~~~~~i~d~~~~~~~----~~~~~~~~~------------------------------~~~~~~~-~vy~~~---- 163 (441) T protein:vir:80 125 --SPKNCTGKFSADGSRLD----AGLVVQQTC------------------------------DPEVVEA-ELLLPD---- 163 (441) T ss_pred --ccceEEEEEeCCCCcee----EEEEEEEEe------------------------------cCceEEE-EEEecC---- Confidence 33443 4776443211 111111110 0000111 111111 Q ss_pred EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeecc Q lcl|Aclame:pro 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~ 318 (708) .++.++...++ .....+..+.+++.+|+|||...+. T Consensus 164 ~~~~~~~~~~~------------------------------------------~~~~~~~~~~~~g~vPvv~~~n~~~-- 199 (441) T protein:vir:80 164 VIVQVERRGSR------------------------------------------EWVEVDRIPNVLGAVPLVPIVNRRR-- 199 (441) T ss_pred eEEEEEEcCCc------------------------------------------ceeeccccccCCCceeEEEeecccc-- Confidence 01111111000 0112234556678888888764322 Q ss_pred CCcccccc-hHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 319 DDIERVEG-HIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 319 d~~~~~~G-~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) .+...|.| +.+.+++.++.+|+.+|.+...+...+.+..++- |+. +.... .......+++..... +..|.. T Consensus 200 ~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~~~~~~~~--~~~~~~~~~i~~~~~--~~~~~~-- 272 (441) T protein:vir:80 200 TSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GVSADEFSQ--PGWVLSMASVWAVDK--DDDGDT-- 272 (441) T ss_pred CCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cCCcccccc--chhhhcccccccCCC--CCCCCc-- Confidence 22333444 4467999999999999999999888877765552 321 11100 011112222221111 111110 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) ..+...+.. -...+...+......+-.+|++++...|..++ +||.|+......-........+.|..+++++.++ T Consensus 273 --~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 349 (441) T protein:vir:80 273 --PNVGSFPVN-SPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFL 349 (441) T ss_pred --ceeEecCcc-chHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111112211 22344455555566666668888888886543 5999999888877777777777788888887776 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) ++.+.-..-.. ... -.+|.+.=.+..+....+..+.+..|.+++ T Consensus 350 ~~~~~~~~~~~-----------~~~-------------------------~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g 393 (441) T protein:vir:80 350 AAKALDSRVDE-----------ADF-------------------------FGDVGLRWRDASTPTRAATADAVTKLVGAG 393 (441) T ss_pred HHHHhcCCCcc-----------ccc-------------------------ceeeeEEeCCCCCcCHHHHHHHHHHHHhcC Confidence 65442111000 000 012333333344444556667777776654 Q ss_pred cccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEA 634 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~ 634 (708) ..... ...+++++.+.. +++ +++. ...++.+. +.+++ ... T Consensus 394 ~~~~s-----~~~~~~~l~~~~-~e~-~~~~--------------------~e~~e~~~---------~~~~~----~~~ 433 (441) T protein:vir:80 394 ILPAD-----SRTVLEMLGLDD-VQV-EAVM--------------------RHRAESSD---------PLAVL----AGA 433 (441) T ss_pred ccccc-----HHHHHHhCCCCH-HHH-HHHH--------------------HHHHHHHH---------HHHHH----hhh Confidence 32111 112233333211 111 1110 00000000 00000 000 Q ss_pred HHHHHHHH Q lcl|Aclame:pro 635 QKATNETA 642 (708) Q Consensus 635 ~k~~~~~~ 642 (708) .+.+.++. T Consensus 434 ~~~~~~~~ 441 (441) T protein:vir:80 434 ISRQTNEV 441 (441) T ss_pred hhcccccC Confidence 00011100 No 60 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.69 E-value=3.7e-15 Score=99.73 Aligned_cols=453 Identities=11% Similarity=0.025 Sum_probs=215.1 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~g 78 (708) |-+..+.+-.+.+..|-... ....+.++ ++-..||.|++-.-..... ......++| .+..|..+.+|+..+| T Consensus 23 ~~~~~~~~~~~~i~~~i~~~--~~~~~~~~--~~~~~yY~g~~~~i~~~~~--~~~~~~~~~~~ki~~n~~~~ivd~~~~ 96 (481) T protein:vir:10 23 VSDLAELLKEENLRNFISRH--QTEQVPRL--EMLESYYLNRNTDILAGER--RLQKYGDKADHRAVHNYAKYVSRFIVG 96 (481) T ss_pred eecchhhcCHHHHHHHHHHH--HHHHHHHH--HHHHHHhcCCCcccccCcc--ccccccccccceeecchHHHHHHHHHh Confidence 33332222222222211111 11222222 2224578887643211110 011112333 3678999999999999 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) +...+.+.+.+ . |.+. .+.+..+++.|+++.....+..+++++|.||+.+..+. ++.+++. T Consensus 97 ~l~g~~~~~~~--~----d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~---------dg~~~i~ 157 (481) T protein:vir:10 97 YLTGNPITITH--Q----DNQT----NDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDF---------EDRDTFK 157 (481) T ss_pred hhccCCceEec--C----ChhH----HHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCC---------CCeEEEE Confidence 99988776654 2 1222 23455667789999999999999999999998885431 2344444 Q ss_pred Eeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 159 PIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 159 ~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) .+ ++..++ ||+... .....+++.|... ..+...+...++|...+. T Consensus 158 ~~--~p~~~~~v~d~~~~-----~~~~~~i~~~~~~--------------------------~~~~~~~~~~~~y~~~~i 204 (481) T protein:vir:10 158 VL--DPKSTFVVYDQTLD-----KKVVAGVRYFEKQ--------------------------DKDKVPVQHVEVYTTDKI 204 (481) T ss_pred EE--cccceEEEEcCCCC-----CceEEEEEEEEEe--------------------------eCCCceEEEEEEEecCeE Confidence 33 233443 443221 1111222221100 000111222333332211 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) . .|.. -++...+.++.|.+++.+|+|+|.- T Consensus 205 ~----~~~~-------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n--- 234 (481) T protein:vir:10 205 Y----YIEI-------------------------------------------KGGTYHRVEEVEHYYNDVPIIEYLN--- 234 (481) T ss_pred E----EEEe-------------------------------------------cCCceeecccccccCCceeEEEeec--- Confidence 0 0100 0011112233455556677766431 Q ss_pred ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 317 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) ...|.|.+..+++.++.+|+..|.+.+.+...+.+.+++......+-+ .. .......... ........+. . T Consensus 235 ----~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~-~~-~~~~~~~~~~-~~~~~~~~~~--~ 305 (481) T protein:vir:10 235 ----DQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSE-DA-KAFRDANMIH-LEPGTNANGS--E 305 (481) T ss_pred ----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCcc-ch-hhhhhcccee-ccccccccCC--C Confidence 234679999999999999999999999999888887776422111111 00 1111111111 1111010000 0 Q ss_pred cccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVW 475 (708) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~ 475 (708) ......+...+.-...+...+..+...|..+|++.+.+.|. .+|.||.|+.................|..+++++.+++ T Consensus 306 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li 385 (481) T protein:vir:10 306 GKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLL 385 (481) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01112222222223556677888889999999998877775 35789999988777777777777777788888877777 Q ss_pred HHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 476 LSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSML 555 (708) Q Consensus 476 l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~ 555 (708) +.++...... . . ...+|.|.=.+..+....+..+.++.+...+ T Consensus 386 ~~~~~~~~~~-----------~----------~---------------~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~i- 428 (481) T protein:vir:10 386 LNNVNLTGLK-----------Q----------H---------------NYAELTITFTPNLPKSMMESINAFNALSGGV- 428 (481) T ss_pred HHHHhccCCC-----------c----------c---------------ccceeeEEeCCCCCcCHHHHHHHHHHHhccC- Confidence 7665321100 0 0 0123444445555655566666666552111 Q ss_pred ccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 556 PTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEA 634 (708) Q Consensus 556 ~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~ 634 (708) + ..++++++.+ ...++-.++++......... .++. ...+. .. ...+. T Consensus 429 ---s-----~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~-------------~~~~--~~~~~----~~-----~~~~~ 476 (481) T protein:vir:10 429 ---S-----ESTRLSLLDFIDNPKEELEKMQEEEAQREKQ-------------ADKR--GYGEA----FE-----NHLNV 476 (481) T ss_pred ---C-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh-------------hhhc--cCCcc----CC-----CCCCC Confidence 1 1233443332 12222333333221100000 0000 00000 00 00000 Q ss_pred HHHHH Q lcl|Aclame:pro 635 QKATN 639 (708) Q Consensus 635 ~k~~~ 639 (708) ..-+. T Consensus 477 dd~~g 481 (481) T protein:vir:10 477 DDSNG 481 (481) T ss_pred CCCCC Confidence 00000 No 61 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.69 E-value=1.9e-15 Score=101.29 Aligned_cols=446 Identities=11% Similarity=0.049 Sum_probs=213.3 Q ss_pred chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHH-HHhhhhhhhcCCC--ceeecchHHHHHHHHHHH Q lcl|Aclame:pro 4 TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATA-AGTKLDEQFEKYP--KFEINKVATELNRIIAEY 80 (708) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~-~~l~~~~q~~grp--~~~~N~i~~~i~~i~g~~ 80 (708) ...+.+.++...+. ..+.+.... ..||.|+|.-..-. ...........+| .+++|..+.+|+..+|+. T Consensus 1 l~~~~i~~~i~~~~-------~~~~r~~~~--~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl 71 (451) T protein:vir:10 1 MELEKIRAIISADA-------ARRQEILQA--KSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYM 71 (451) T ss_pred CCHHHHHHHHHHHH-------HHHHHHHHH--HHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhhe Confidence 33333444332222 222223222 45788976311100 0000000011222 466899999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) ..+.+.+.+ +++.+..+. ++.+. .|+++........+++++|.||..+..+...... ....+.+++..+ T Consensus 72 ~G~p~~~~~-----~~~~~~~~~----~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~-~~~~~~~~~~~i 140 (451) T protein:vir:10 72 FTYPVLFDI-----DNNKELNEK----VTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGE-QVTNQTFKYGVV 140 (451) T ss_pred ecccceeec-----CCcHHHHHH----HHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccc-cccccceeEEEE Confidence 999887764 123333343 44444 4789999999999999999999888665432211 112234444433 Q ss_pred ecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEE Q lcl|Aclame:pro 161 YDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESV 238 (708) Q Consensus 161 ~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~ 238 (708) ++..+| ||.... .-...+++.|...++-. .-.....+...++|..... T Consensus 141 --~p~~~~~vydd~~~-----~~~~~~ir~~~~~~~~~---------------------~~~~~~~~~~~e~yt~~~~-- 190 (451) T protein:vir:10 141 --NTEEIIPIYRNGIE-----RELEAVIRYYIQLEDVK---------------------GQIQKQAYTYVEFWTDKIL-- 190 (451) T ss_pred --cccceEEEEcCCCC-----CceEEEEEEEEeeeccc---------------------ccccceEEEEEEEEeCCeE-- Confidence 334443 443211 11223333332111100 0000111222233332210 Q ss_pred EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeecc Q lcl|Aclame:pro 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~ 318 (708) +.|... .. -..+..++..+.+.+++.+|+++|.- T Consensus 191 --~~~~~~---------~~------------------------------~~~~~~~~~~~~~~~~g~vPvv~~~n----- 224 (451) T protein:vir:10 191 --DKYKFF---------GV------------------------------SCCGSQIEHITVQHRFNSVPFVEFSN----- 224 (451) T ss_pred --EEEEec---------cc------------------------------CccccccccccccCCCCeeeEEEecc----- Confidence 111100 00 01112223333444555566655431 Q ss_pred CCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccc Q lcl|Aclame:pro 319 DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGA 398 (708) Q Consensus 319 d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 398 (708) ...+.|.+..+++.++.+|.+.|.+.+.+...+++.+++.--..+...+.... ....+.+.+.......+ T Consensus 225 --n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~--~~~~~~i~~~~~~~~~~------ 294 (451) T protein:vir:10 225 --NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKE--LKRYKTIKTETDSEGDS------ 294 (451) T ss_pred --CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHH--HhhCCeEEecCcCCccC------ Confidence 22356899999999999999999999999988888776642111111111111 12223333322211111 Q ss_pred cccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 399 TPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSM 478 (708) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~l 478 (708) ....++..+.-..++...+......|...|++.+.+.+..+|.||.|+..+...........-..|..+++++.++++.+ T Consensus 295 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~ 374 (451) T protein:vir:10 295 GGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYF 374 (451) T ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12233333333567777899999999999998766555556899999999888877777777778888888887777766 Q ss_pred HHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccC Q lcl|Aclame:pro 479 AREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTD 558 (708) Q Consensus 479 i~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~ 558 (708) +-. .+ + .||.|.=.+..+..-.+..+.++.+...+ T Consensus 375 ~~~----------~d------~-------------------------~~i~i~f~~~~p~n~~e~~~~~~kl~g~i---- 409 (451) T protein:vir:10 375 LGV----------TD------Y-------------------------KKIQQTYTRNMMSNDLEDADIATKSVGII---- 409 (451) T ss_pred hCC----------CC------c-------------------------cceeEEecCCCCCCHHHHHHHHHHHhccC---- Confidence 421 00 0 01222223344443334444455442111 Q ss_pred chhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 559 PMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKAT 638 (708) Q Consensus 559 p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~ 638 (708) + ..+++.++.+ .+...+.+ +....+++...++. +-..-. - T Consensus 410 S-----~et~~~~~p~--v~d~~~e~-----------------~~~~ee~~~~~~~~---------------~~~~~~-~ 449 (451) T protein:vir:10 410 P-----TKIILRHHPW--VDDVEEAE-----------------KLYLEEKKIQASKV---------------SDDYNN-F 449 (451) T ss_pred c-----hHHHHHhCCC--CCCHHHHH-----------------HHHHHHHHHHHHHH---------------HhhcCC-C Confidence 1 1122222221 11111100 00000000000000 000000 0 Q ss_pred HH Q lcl|Aclame:pro 639 NE 640 (708) Q Consensus 639 ~~ 640 (708) .+ T Consensus 450 ~~ 451 (451) T protein:vir:10 450 TE 451 (451) T ss_pred CC Confidence 00 No 62 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.69 E-value=2.9e-15 Score=100.28 Aligned_cols=460 Identities=7% Similarity=-0.002 Sum_probs=211.2 Q ss_pred chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCC--CCHHHHHH-hhhh--hhhcCC--CceeecchHHHHHHH Q lcl|Aclame:pro 4 TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQ--WEGATAAG-TKLD--EQFEKY--PKFEINKVATELNRI 76 (708) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Q--w~~~~~~~-l~~~--~q~~gr--p~~~~N~i~~~i~~i 76 (708) ..-+.+.++...+ .........+...- ..||.|+| |....... .... ....++ -.+++|..+.+|+.. T Consensus 1 ~~~~~~~~~i~~~---~~~~~~~~~~~~~~--~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~ 75 (470) T protein:vir:10 1 MELDALKKLIQNT---STSRNDLINNYKQA--VNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQE 75 (470) T ss_pred CchHHHHHHHHHH---HHHHHHHHHHHHHH--HHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhh Confidence 1112333333333 33333333333322 34788865 11111100 0000 000111 247899999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+.+.+.+ . |.+..+.+..+++ ++++...+....+++++|.+|..+..+. .+.++ T Consensus 76 ~~yl~G~p~~~~~--~----d~~~~~~l~~~~~-----~~~~~~~~~l~~~~~~~G~a~~~~y~d~---------~~~~~ 135 (470) T protein:vir:10 76 AGYVASVFPDIDV--G----KDADNKKIIDVLG-----DDRALTLNGLLVDSSNAGRAWLHYWIDE---------DGNFR 135 (470) T ss_pred hhheeccceeeec--C----chHHHHHHHHHHh-----hhHHHHHHHHHHHHhhcCeeEEEEEecC---------CCceE Confidence 9999999888754 1 2233444444433 3577788888899999999998886432 13444 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++..+| ||+... ... .++++.|...+ +.+...+...+.|... T Consensus 136 ~~~~--~p~~~~~v~d~~~~----~~~-~a~ir~y~~~~-------------------------~~~~~~~~~~e~yt~~ 183 (470) T protein:vir:10 136 YGII--QPDQITPIYATTLD----NKL-LGILRSYKQLD-------------------------PDSGKYFTVHEYWTDK 183 (470) T ss_pred EEEE--cccceEEEEcCCCC----Cce-EEEEEEEEeee-------------------------cCCceEEEEEEEEcCC Confidence 4433 333443 443211 011 12222222110 0011122233444322 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) ... .|....++. ............. . ...+.....+..+.+++.+|+++ + T Consensus 184 ~~~----~~~~~~~~~-~~~~~~~~~~~~~---------------------~-~~~~~~~~~~~~~~~~g~vPvv~---~ 233 (470) T protein:vir:10 184 EAQ----FFRTNATDS-TVIEPYNIITSYD---------------------L-SAGYETGQSNTLKHNFGRVPFIE---F 233 (470) T ss_pred cEE----EEEeecCcc-eeccccccccccc---------------------c-ccccccccccccccCCCeeeEEE---e Confidence 211 111111100 0000000000000 0 00001111122233334445544 3 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) +- ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++.-...++..+.... ....+.+.....+...+ T Consensus 234 ~n----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~--~~~~~~i~~~~~~~~~~-- 305 (470) T protein:vir:10 234 SK----NKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMND--LRKYKSIKINNTGNGDN-- 305 (470) T ss_pred ec----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhh--hhhcCeEeccCCCCCcC-- Confidence 32 22467899999999999999999999999988888877753222222222221 12222232222111111 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) ....+...+.-..+....+..+...|...|++.+...+..+|.||.|+..+..............|..+++++.++ T Consensus 306 ----~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (470) T protein:vir:10 306 ----SGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (470) T ss_pred ----ceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1223333333346677788889999999999888777766789999999998888888888888888888887777 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) ++.++. +.+.+ ..||.|.-.+..+..-.+..+.++.+... T Consensus 382 i~~~l~----------~~~~d-----------------------------~~~i~i~f~~~~p~d~~e~~~~~~~~~g~- 421 (470) T protein:vir:10 382 IMRYLN----------FSDAD-----------------------------KRHISQHWTRTKVEDSLTKAQIVSTVANY- 421 (470) T ss_pred HHHHhc----------ccCcc-----------------------------cceeeEEeccCCCCCHHHHHHHHHHHhcc- Confidence 765431 11100 01233333334444333444444433111 Q ss_pred cccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) ....++++++.+ ...++..+++..... ...+..+ +.... + T Consensus 422 --------iS~et~l~~~p~v~D~~~E~eri~~E~~--------------------e~~~~~~---------~~~~~--~ 462 (470) T protein:vir:10 422 --------SSKEAVAKANPIVDDWQQELKDLAKDKE--------------------ENDPYSN---------QADEL--N 462 (470) T ss_pred --------CcHHHHHHhCCCCCCHHHHHHHHHHHHH--------------------HHHHhhc---------ccccc--C Confidence 111223333321 112222222221100 0000000 00000 0 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQ 643 (708) Q Consensus 634 ~~k~~~~~~~ 643 (708) .. ..+-.+ T Consensus 463 ~~--~~dde~ 470 (470) T protein:vir:10 463 GK--GVNDEQ 470 (470) T ss_pred CC--CCCCCC Confidence 00 000000 No 63 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.68 E-value=3e-15 Score=100.19 Aligned_cols=447 Identities=11% Similarity=0.050 Sum_probs=210.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHH--HHhhhhhhhcCCC--ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATA--AGTKLDEQFEKYP--KFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~--~~l~~~~q~~grp--~~~~N~i~~~i~~i 76 (708) +-+...+...+++.+|- +.+.+.+.+.. +...||.|+| +-..+ ...........+| .++.|..+.+|+.. T Consensus 37 ~~~~~~~~~~~~i~~~i---~~~~~~~~r~~--~l~~YY~g~~-~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:94 37 RTNNKPETLEEMIVRYI---KQHLEKLPEIS--IGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred ccCCchhhHHHHHHHHH---HHHHHHHHHHH--HHHHHhcccc-ccccccccccccccccccccccccccchHHHHHHHH Confidence 33343444444443333 22333333332 3346898975 21110 0000000001122 36789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+.+.+.+ + |.+..+.+. .++ .|+++.....+..+++++|.||+.+..+. ++.++ T Consensus 111 ~~yl~G~p~~~~~-----~-d~~~~~~l~----~~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d~---------dg~~~ 170 (492) T protein:vir:94 111 VSYIVGKPIAFKH-----T-DDEVVKRID----EVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDE---------EGEFK 170 (492) T ss_pred HhhhcccCceecc-----C-chHHHHHHH----HHH-hccHHHHHHHHHHHHhhCCeEEEEEEecC---------CCceE Confidence 9999888877643 1 233344443 333 36899999999999999999998886431 23445 Q ss_pred eEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.++ +||+... . +. ..+++.|-..+ . ...++|... T Consensus 171 ~~~~--~p~~~~~v~d~~~~-~---~~-~a~ir~~~~~~----------------------------~---~~~~~y~~~ 212 (492) T protein:vir:94 171 LFRV--PAEQGIPIWTDKEH-E---EL-EAFIRMYKLEN----------------------------E---TKVEYWDKV 212 (492) T ss_pred EEEE--cccceEEEEcCCCC-C---ce-EEEEEEEeecc----------------------------c---eeEEEEecC Confidence 5443 33443 3554221 1 11 22333332100 0 001333322 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +... +.+.. +.... ... -..........+.+++.+|+|+|.- T Consensus 213 ~v~~--~~~~~---~~~~~-~~~-------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n- 254 (492) T protein:vir:94 213 TVNY--YVYEN---GSLIP-DYS-------------------------------NNLENSKTHFSTGSWGKIPFIPFKN- 254 (492) T ss_pred eEEE--EEEec---Ceeee-ccc-------------------------------cccccccccccccCCCccceEEecC- Confidence 1111 11110 00000 000 0000111122445566677776532 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHHHHhhcccCCceeeeccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKHWEARNKKRPAFLPLREVRDKSGN 393 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (708) ...+.|.+..+++.++.+|.+.|.+.+.+...+.+.+++. |.. ++..+ +... ......+... ..+ T Consensus 255 ------n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g~~~~~~~~-~~~~-~~~~~~~~~~----~~~- 320 (492) T protein:vir:94 255 ------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK-NYDDQELPE-FKRL-LRYYGAIKVS----DNG- 320 (492) T ss_pred ------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCcccchh-hHHH-HhhccceecC----CCC- Confidence 2246799999999999999999999999998888776653 221 11111 1111 1111111111 111 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) ...+...+.-..++...+..+.+.|...|++.+.+.+. ++|.||.|+...-...........+.|..+++++. T Consensus 321 ------~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~ 394 (492) T protein:vir:94 321 ------GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELL 394 (492) T ss_pred ------cceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222233556677788888999999987766554 45789999988877777777777778888888887 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.++.. .++ .-||.|.=.+..+....+..+.+..+.. T Consensus 395 ~li~~~~~~----------~~~------------------------------~~~i~v~f~~~~p~~~~e~~~~~~kl~g 434 (492) T protein:vir:94 395 WFVFEHFDI----------KGE------------------------------HKDVDISFNYNKVANTELQVQTAQQSMG 434 (492) T ss_pred HHHHHHhcC----------Ccc------------------------------cceeeEEecCCCCCCHHHHHHHHHHHhc Confidence 777665421 110 0122333344555544455555555421 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) .+ + ..++++++.+ ...+...+++..... ...+..+..............+ T Consensus 435 ii----S-----~et~~~~l~~v~d~~~E~eri~~E~~--------------------~~~~~~~~~~~~~~~~~~~~~~ 485 (492) T protein:vir:94 435 IV----S-----HETVLENHPFVEDLQAELERIEQEQM--------------------EYNKQLPNLDDGGADSAQQQER 485 (492) T ss_pred cC----c-----hHHHHHhCCCCCCHHHHHHHHHHHHH--------------------HHHhhccccccccCCCCccccC Confidence 11 1 1223333322 122222223221100 0000000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 632 AEAQKATNETAQTQIKAF 649 (708) Q Consensus 632 ae~~k~~~~~~~~q~e~~ 649 (708) ++ +.+.+ T Consensus 486 ~~-----------~~e~e 492 (492) T protein:vir:94 486 SN-----------NKESE 492 (492) T ss_pred Cc-----------cccCC Confidence 00 00000 No 64 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.68 E-value=3.5e-14 Score=94.38 Aligned_cols=494 Identities=14% Similarity=0.030 Sum_probs=237.5 Q ss_pred CCcchHHHHH----HHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHE----RIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~----~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |.-+++.... .+..+|+........|...|+++.+|..= .=+++.- .+ .+...+.-+.-...++.+ T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP--~~~~~~~------~~--~~~~~~~dstg~~a~~~L 70 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLP--YLMNDKG------DN--ETSQNGWQGVGAQATNHL 70 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcc--cccCCCC------Cc--ccccccccchHHHHHHHH Confidence 7776666654 78888888888888899999888765321 1011100 00 011112233334445544 Q ss_pred HHHHhc-----CcceeEEecCCC---------cchHHHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEE Q lcl|Aclame:pro 77 IAEYRN-----NRITVKFRPGDR---------EASEELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) Q Consensus 77 ~g~~~~-----nr~~~~v~pr~~---------~~d~~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~ 139 (708) .+.... +++=+++.+.+. .+-.++.+. ++..+......|++..+...++.+.+..|.|+..+ T Consensus 71 Aa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~- 149 (516) T protein:vir:10 71 ANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYK- 149 (516) T ss_pred HHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEe- Confidence 433332 333344443321 011123333 34455566778999999999999999999987543 Q ss_pred eeccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 140 ~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) + +. . .++.. |+.++++..+.. . ...-++++..++..++.+.|++....... ..... T Consensus 150 -d---~~---~---~~~~~----pl~~y~v~~d~~---G-~v~~ivrr~~~~~~~l~e~~~~~~~~~~~-----~~~~~- 205 (516) T protein:vir:10 150 -P---SK---G---AISAI----PMHHYVVNRDTN---G-DLLDIILLQEKSLRTFDPATRAVVEVGLK-----GKKCK- 205 (516) T ss_pred -c---CC---C---CeEEE----EcCeEEEeeCCC---C-CeEEEeeeecccHHHHHHHhhhhhhhhhh-----hhccC- Confidence 1 11 1 12322 455666544332 1 12236888899999999998653221111 00000 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) ..+.+ ++|. +.++++. + +|.++.-.+...+...+ T Consensus 206 -~~~~~---~i~t--------~v~~~~~-~---------------------------------~~~~~~~~d~~~~~~~s 239 (516) T protein:vir:10 206 -EDDSI---KLYT--------HAKYLGE-G---------------------------------FWELKQSADDIPVGKVS 239 (516) T ss_pred -CCCce---EEEE--------EEEecCC-C---------------------------------ceEEEEeeCceeecccc Confidence 01111 1111 1111110 0 12222333333333446 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) .||+..+||+++.... .+|..+|.|.+....+--+.+|++...++.......++.++++++.+-...+... ..++ T Consensus 240 ~~~~~e~P~~~~Rw~~--~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~---~~~g 314 (516) T protein:vir:10 240 KIKSEKLPFIPLTWKR--SYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN---SGTG 314 (516) T ss_pred ccccccCCeeeeeeee--cCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhcc---CCCc Confidence 6888889998765443 6888999999999999999999999999999999999999998776643322111 1111 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFI 459 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~ 459 (708) .++ ++ ..+.+ .+.+...... -+.....++...+.|....=++..........|++-|..+.+--...+.. T Consensus 315 ~~~---~g--~~~~v----~~~q~~~~~d-~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGp 384 (516) T protein:vir:10 315 EVV---TG--VEEDI----HIVQLGKYAD-LTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGG 384 (516) T ss_pred eee---cC--Ccccc----eeeecCcccc-hHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhh Confidence 221 11 11111 1111111111 23444556666666665543332222223346888888888877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhH Q lcl|Aclame:pro 460 YLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTAR 539 (708) Q Consensus 460 ~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~ 539 (708) .+.+|.. +++..||..-+ ...... -|. . + +++.+.++. .+-. T Consensus 385 v~~rl~~------Ell~Pli~r~~------~~~~p~-------------~P~--~------l----v~~~~v~~i-~~L~ 426 (516) T protein:vir:10 385 VYSLFAT------TMQSPVAMWGL------LEAGDS-------------FTS--D------L----VDPVIITGI-EALG 426 (516) T ss_pred HHHHHHH------HHHHHHHHHHH------HhhCCC-------------CCh--h------h----cCcceehhH-HHHH Confidence 7776653 23333333221 011110 010 0 0 122222233 3445 Q ss_pred HHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchh-HHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 540 RDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEG-LDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) Q Consensus 540 r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~-~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~ 618 (708) |.+.++.+.++++.+++..+. .+.++...++.. ++.+...+. .+.....++ ++..+..+++++.|+.++. T Consensus 427 raq~~~~i~~~~q~i~~~~q~----~p~v~d~id~d~~~~~~a~~~g---vp~~~irs~-eev~~~r~~~~~~q~~~~~- 497 (516) T protein:vir:10 427 RMAELDKLANFAQYMSLPLQW----PEPVLAAVKWPDYMDWVRGQIS---AELPFLKSA-EEMEQEQEAQMQAQQAQML- 497 (516) T ss_pred HHHHHHHHHHHHHHHHHHhcC----ChHHHhhcCHHHHHHHHHHHhC---CChhccCCH-HHHHHHHHHHHHHHHHHHH- Confidence 777778887777765543221 122344444332 222222221 122222221 1111111111111111110 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 619 EMVLAQAQMVAAQAEAQKA 637 (708) Q Consensus 619 ~~~~aq~~~~~~qae~~k~ 637 (708) +....++.....+-+++++ T Consensus 498 ~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 498 EEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHhhhcccchhhhhhhcC Confidence 0000000000000011111 No 65 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.67 E-value=6.6e-15 Score=98.31 Aligned_cols=448 Identities=12% Similarity=0.045 Sum_probs=210.0 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCC--CCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQ--WEGATAAGTKLDEQFEKYP--KFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Q--w~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i 76 (708) -.+...+...+++ .+..+.+...+.+.. +...||.|+| |.... ...........++ .+++|..+.+|+.. T Consensus 17 ~~~~~~~~~~~~i---~~~i~~~~~~~~~~~--~~~~YY~g~~~i~~~~~-~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 90 (472) T protein:vir:93 17 RTNNKPETLEEMI---VRYIKQHLEKLPEIS--IGQEYYEQRPDIVKEPK-PVDATGAVDPLKPDDRMITNFHANLVDQK 90 (472) T ss_pred eecCchhhHHHHH---HHHHHHHHHHHHHHH--HHHHHhccccccccccc-hhhccccccccccccccccchHHHHHHHH Confidence 1112222333333 333334444344443 3346899975 11110 1000000001122 36689999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+.+.+.+ +|.+..+. ++.++ .|+++.....+..+++++|.||..+..+. ++.++ T Consensus 91 ~~~l~g~~~~~~~------~d~~~~~~----l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------d~~~~ 150 (472) T protein:vir:93 91 VSYIVGKPIAFKH------TDDEVVKR----IDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDE---------EGEFK 150 (472) T ss_pred hhhhcccCeeecc------CChHHHHH----HHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEECC---------CCceE Confidence 9999888877642 12333343 44444 36899999999999999999998876431 23445 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.+++ ||+... .+. ..+++.|-..++ ...++|... T Consensus 151 i~~~--~p~~~~~i~d~~~~----~~~-~~~ir~~~~~~~-------------------------------~~~~~~~~~ 192 (472) T protein:vir:93 151 LFRV--PAEQGIPIWTDKEH----EEL-EAFIRMYKLENE-------------------------------TKVEYWDKV 192 (472) T ss_pred EEEE--cccceEEEEcCCCC----Cce-EEEEEEEEeecc-------------------------------eeEEEEecC Confidence 5443 333443 553221 111 223333211100 001333322 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +... +.+. .+........ ..+ .......+.+++.+|+|+|.- T Consensus 193 ~~~~--~~~~---~~~~~~~~~~-------------------------------~~~-~~~~~~~~~~~~~vPvv~~~n- 234 (472) T protein:vir:93 193 TVNY--YVYE---NGSLIPDYSN-------------------------------NLE-NSKTHFSTGSWGKIPFIPFKN- 234 (472) T ss_pred eEEE--EEEe---cCeeeecccc-------------------------------ccc-ccccccccCCCCCcceEEecC- Confidence 2111 0000 0000000000 000 001122445566677776532 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) ...|.|.+..+++.++.+|..+|.+...+...+.+.+++.-....+..+ +... ....+.+... ..+ T Consensus 235 ------n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~-~~~~~~~~~~----~~~-- 300 (472) T protein:vir:93 235 ------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPE-FKRL-LRYYGAIKVS----DNG-- 300 (472) T ss_pred ------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchh-hHHH-HhhccccccC----CCC-- Confidence 2246789999999999999999999999998888876663211111111 1111 1111121111 111 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~ 473 (708) ...+...+.-..++...+......|...|++.+.+.+. ++|.||.|+...............+.|..+++++.+ T Consensus 301 -----~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 375 (472) T protein:vir:93 301 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 375 (472) T ss_pred -----cceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222223666778888889999999988766654 457899999888777777777777778888888777 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) Q Consensus 474 ~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~ 553 (708) +++.++-.- ++ ..+|.|.=.+..+....+..+.++.+... T Consensus 376 li~~~~~~~----------~~------------------------------~~~i~v~f~~~~p~~~~~~~~~~~k~~gi 415 (472) T protein:vir:93 376 FVFEHFDIK----------GE------------------------------HKDVDISFNYNKVANTELQVQTAQQSMGI 415 (472) T ss_pred HHHHHhCCC----------cc------------------------------cceeeEEeCCCCCCCHHHHHHHHHHHhcc Confidence 776654210 00 01222333445555444555555554221 Q ss_pred ccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) Q Consensus 554 ~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qa 632 (708) + + ..++++++.+ ...+...++++.......... +.... ..... ..+. T Consensus 416 i----s-----~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~--------------------~~~~~--~~~d~-~~~~ 463 (472) T protein:vir:93 416 V----S-----HETVLENHPFVEDLQAELERIEQEQMEYNKQL--------------------PNLDD--GGADG-AQQQ 463 (472) T ss_pred C----c-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhc--------------------cCcCc--ccCCC-CCCC Confidence 1 1 1223333322 222333333332110000000 00000 00000 0000 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 633 EAQKATNETAQTQ 645 (708) Q Consensus 633 e~~k~~~~~~~~q 645 (708) +. ....+.| T Consensus 464 ~~----~~~~~~e 472 (472) T protein:vir:93 464 ER----SNNKESE 472 (472) T ss_pred CC----CCcccCC Confidence 00 0000000 No 66 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.67 E-value=1.7e-14 Score=96.13 Aligned_cols=452 Identities=12% Similarity=0.070 Sum_probs=215.8 Q ss_pred CCc-------c----hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeec Q lcl|Aclame:pro 1 MAE-------T----LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEIN 67 (708) Q Consensus 1 ma~-------~----~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N 67 (708) |+. + ..+.+.+++.++. ..... +. ++-..||.|+| +-. ....+.. .++| .++.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~---r~--~~~~~yy~g~~-~i~-~~~~~~~---~~~~~~ki~~n 67 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFK---AEQLE---RL--KELKRYYLGDN-NIK-YRPAKTD---KYAADNRIASD 67 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHH---HHHHH---HH--HHHHHHhcccC-ccc-ccccccc---ccCCcceeecc Confidence 221 1 1122333333332 11111 11 22345788976 111 1111111 1222 47889 Q ss_pred chHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 68 KVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 68 ~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) ..+.+|+..+|+...+.+.+.+ - |.+ ....+..++..|+++........+++++|.||..+...... T Consensus 68 ~~~~iv~~~~~~l~g~~~~~~~--~----d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~--- 134 (489) T protein:vir:99 68 FAKYITVFEQGYMLGVPVEYKN--E----NKD----LQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKID--- 134 (489) T ss_pred hHHHHHHHHhhhhccCCceeec--C----Chh----HHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCc--- Confidence 9999999999999988777653 1 222 35567777888999999999999999999999887654321 Q ss_pred CCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCcee Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVI 225 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 225 (708) ++.+.+++..+ ++.++| ||+... .+.. .+++.|.. ...+.... T Consensus 135 --d~~~~~~i~~~--~p~~~~~v~dd~~~----~~~~-~~i~~~~~--------------------------~~~~~~~~ 179 (489) T protein:vir:99 135 --DKKTEVKLYQL--PAEQTFVIYDDTYQ----RNSL-MAVHFYDI--------------------------DYGSGKRK 179 (489) T ss_pred --CCCcceEEEEE--cccceEEEEcCCCC----CceE-EEEEEEEE--------------------------ecCCCceE Confidence 23455666544 334443 443221 1111 22222210 00011122 Q ss_pred EEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCC Q lcl|Aclame:pro 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEH 305 (708) Q Consensus 226 ~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~ 305 (708) ...++|..... +.|... .. -.+...+.++.+.+++. T Consensus 180 ~~~~~y~~~~i----~~~~~~-----------~~-----------------------------~~~~~~~~~~~~~~~g~ 215 (489) T protein:vir:99 180 QIIKAYTSDTI----YTYEDY-----------NL-----------------------------ETKGMRLKDYEGHFFKG 215 (489) T ss_pred EEEEEEeCCcE----EEEEec-----------CC-----------------------------CcccceecccccccCCc Confidence 33344433221 111100 00 00011122344555666 Q ss_pred cceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchH--HHHHhhcccCCceee Q lcl|Aclame:pro 306 IPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLE--KHWEARNKKRPAFLP 383 (708) Q Consensus 306 ~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~--~~~~~~~~~~~~~~~ 383 (708) +|+|+|.- ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++--....+.+ ...........+... T Consensus 216 vPvv~~~n-------~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~ 288 (489) T protein:vir:99 216 VPVNEYAN-------NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLA 288 (489) T ss_pred eeEEEeec-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccc Confidence 67766531 224668899999999999999999999887776665554211111100 010111111111100 Q ss_pred eccccccccccc--------cc-ccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-cccchhHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNII--------AG-ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPSNIAQETVNNLMNRA 453 (708) Q Consensus 384 ~~~~~~~~~~~~--------~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~~n~sg~ai~~~q~q~ 453 (708) . ......+.+. .+ .....+...+.-...+...+..+...|...||+.+.+.+ ..+|.||.|+....... T Consensus 289 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 367 (489) T protein:vir:99 289 I-SIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMAS 367 (489) T ss_pred c-ccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHH Confidence 0 0000000000 00 001122222222355566778888899999998775544 34678999998877776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeec Q lcl|Aclame:pro 454 DMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVG 533 (708) Q Consensus 454 ~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~ 533 (708) ..........|..+++++.++++.++...... . +.... --||.|.=. T Consensus 368 ~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~-----~---~~~~~-------------------------~~~i~v~f~ 414 (489) T protein:vir:99 368 DNYREKQERLFKKGLMRRLRLAANIWAIKGNE-----A---TTYSL-------------------------VNDTSIVFT 414 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-----c---ccccc-------------------------cccceEEeC Confidence 66677777778888888888877776432210 0 00000 013334445 Q ss_pred ccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc---hhHHHHHHHHHhhhhhhhc-cc------CcchHHHH Q lcl|Aclame:pro 534 PSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG---EGLDDFKEYNRNQLLISGI-AK------PRNEKEQQ 603 (708) Q Consensus 534 ~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~---~~~~ei~e~~~~~~~~~~~-~~------~~~~~~~q 603 (708) +..+....+..+.+..+...+ + ...+++++.+ +..++..++++........ .+ ...+++ . T Consensus 415 ~~~p~d~~~~~~~~~kl~gii----s-----~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-~ 484 (489) T protein:vir:99 415 PNLPQNDNEIVTAAQNLYGIV----S-----DQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEE-P 484 (489) T ss_pred CCCCcCHHHHHHHHHHHhccC----C-----HHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcC-C Confidence 556665666666666653211 1 1122332221 2333334444332211100 00 000000 0 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 604 IVQQAQMAAQSQP 616 (708) Q Consensus 604 ~~~~~qq~qq~~~ 616 (708) ...+| T Consensus 485 --------~~~~p 489 (489) T protein:vir:99 485 --------TAEKP 489 (489) T ss_pred --------CCCCC Confidence 00000 No 67 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.66 E-value=1.3e-15 Score=102.24 Aligned_cols=495 Identities=7% Similarity=-0.007 Sum_probs=237.6 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHH----H----HH----HHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecc Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEV----R----EK----CIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINK 68 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~----r----~~----~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~ 68 (708) |- +...+++.++.++....+- + +. .+.+....+|-+.+|...-...+.. -.+..|+ T Consensus 1 ~~-----~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~-------~~~~~~l 68 (518) T protein:vir:78 1 MG-----VWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHD-------KLMNSGT 68 (518) T ss_pred Cc-----chhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcccc-------ccccCCh Confidence 43 5555666666555433220 0 00 0000111123355664432211111 1345566 Q ss_pred hHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCC Q lcl|Aclame:pro 69 VATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) Q Consensus 69 i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~ 148 (708) -+.+++.......+-.+.+.|..-+..+| +.+++.+..+.+.|++.....+.++.++..|.+|+++.++ T Consensus 69 ~~~i~~~~A~ll~~e~~~i~v~~~~~~d~----e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d------- 137 (518) T protein:vir:78 69 GNEIVVVAAEYISGKPLSIDVTGVNGSKD----ENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL------- 137 (518) T ss_pred HHHHHHHHHHhhcCCCceEEecCccccCc----HHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE------- Confidence 77888888888888888999876543233 3456667777778999999999999999999999998764 Q ss_pred CCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCcee-EE Q lcl|Aclame:pro 149 MDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVI-YI 227 (708) Q Consensus 149 ~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-~v 227 (708) + +.++|..+ +...|++. ...-++. ..+|...... . ++.. .+.-.++. ..+.. .. T Consensus 138 -~--~~~~i~~v--~ad~~~P~--~~~g~~~--~~~f~~~~~~-~-------~k~~------~y~~lE~h--e~~~~~~~ 192 (518) T protein:vir:78 138 -N--GRPSISVH--SSSQFWID--FKNNEPF--RFNFFEEIPT-S-------NKAD------IYYLVESR--EIKQWDKE 192 (518) T ss_pred -C--CeeEEEEE--cCCeeEEE--eecCcEE--EEEEEEEeec-C-------Ccce------eEEEEEee--ccccccce Confidence 1 24555554 44556542 2222333 3333321111 0 0000 00000000 00000 00 Q ss_pred eeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcc Q lcl|Aclame:pro 228 AKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIP 307 (708) Q Consensus 228 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p 307 (708) .+-+..-+.+- ..|.. ..+............ ....+....++.+...-..+...| T Consensus 193 ~~~~~~~~I~n--~ly~~-~~~~~v~~~~~~~~~----------------------~l~~~~~~~~~~e~~~~~tg~~~~ 247 (518) T protein:vir:78 193 GKKLSGGFVTY--SVIKI-DGDKTTPISAERLPE----------------------QITSYLHTNDIQLNHSVSIGLKSM 247 (518) T ss_pred eecccceeEEE--EEeee-cCccccccccccccc----------------------ccccccccccCccceeeccCCccc Confidence 00000000000 11111 001000000000000 000000011111111111122234 Q ss_pred eeeEE---EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHH-----hhcccCC Q lcl|Aclame:pro 308 LIPVY---GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWE-----ARNKKRP 379 (708) Q Consensus 308 ~~p~~---~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~-----~~~~~~~ 379 (708) +++++ +......+++.|.|++.++++.++.+|...|++.+.+.. +..++.++++.+.....-.. ....... T Consensus 248 ~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~ 326 (518) T protein:vir:78 248 GAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDED 326 (518) T ss_pred eEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCc Confidence 44432 111112356778899999999999999999999999976 67788888776632110000 0000111 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccccc-chhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-NIAQETVNNLMNRADMASF 458 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~-n~sg~ai~~~q~q~~~~~~ 458 (708) .+..++...+..+. +...++..++.--...+...++.....+....|++....|..+ ..||++|....+..-.... T Consensus 327 ~y~~i~~~~~~~~~---~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~ 403 (518) T protein:vir:78 327 YFMQFKGTLDAGAK---LNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIE 403 (518) T ss_pred eEEEecCcCCCCCc---cccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHH Confidence 12222111111111 1112333444433467778888888888889999998888643 4699999999988777788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 459 ~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) .+...+..+++++-+.++.+..-++...... .....++|+|+=..+-.. T Consensus 404 ~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~-------------------------------~~~~~~~v~i~f~D~i~~ 452 (518) T protein:vir:78 404 KKKRLIQNVYEQMLWDFLYLLTGGTNNKEKA-------------------------------IMRDEIRVIIEFPDPMSV 452 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCccccc-------------------------------cCCCceeEEEEeCCCCCC Confidence 8888888888888888888776554311110 011235566665556666 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHH-HHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQG-IILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQS 614 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~-~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~ 614 (708) .+.+..+.+..+...+. .. .-... .+....+-.-+++..++++........++|....-+. ..+. T Consensus 453 D~~~~~~~~~~~v~aGi-mS---~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~-------~~~g 518 (518) T protein:vir:78 453 NLNELSSTLNNMNSALA-MS---VEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGME-------TKGG 518 (518) T ss_pred CHHHHHHHHHHHHhcCC-CC---HHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCC-------CCCC Confidence 66677777776655431 11 11111 1111111122334445554443322222221100000 0000 No 68 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.66 E-value=2.5e-15 Score=100.63 Aligned_cols=451 Identities=13% Similarity=0.055 Sum_probs=215.3 Q ss_pred CCcchHHHHHHH------HHHHHHHHHhhHHHHHHHHHHHHHhhcCCCC--CCHHHHHH---hhh------hhhhcCCC- Q lcl|Aclame:pro 1 MAETLEKKHERI------MLRFDRAYSPQKEVREKCIEATRFARVPGGQ--WEGATAAG---TKL------DEQFEKYP- 62 (708) Q Consensus 1 ma~~~~~~~~~~------~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Q--w~~~~~~~---l~~------~~q~~grp- 62 (708) |- +.+.+..+ .+.+.+..+.....+++.... +.||.|.+ +....+.. +.. .....++| T Consensus 1 ~~--~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~--~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:94 1 MT--LYKLIDDIEAQGILPKHIEALIESHKDDRERMVNL--YNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVN 76 (474) T ss_pred Cc--hHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHH--HHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcc Confidence 11 11111111 011222222222223333222 22333311 00000000 000 00011233 Q ss_pred -ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEee Q lcl|Aclame:pro 63 -KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSM 141 (708) Q Consensus 63 -~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~ 141 (708) .+++|..+.+|+..+|+...+.+.+.+.+.. +. .+.+...+..++..|+++.....+..+++++|.+|..+..+ T Consensus 77 ~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~-~~----~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d 151 (474) T protein:vir:94 77 NKLNNSFDSEIVDTRVGYLHGVPVTYDLDENA-EK----NEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYID 151 (474) T ss_pred cccccchHHHHHHhHhhheeccceeEeeCCCC-cc----hHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeC Confidence 4779999999999999999998888763322 22 34456667777778999999999999999999998877532 Q ss_pred ccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 142 LVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 142 ~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) . .+.+++..+ ++.++| || +.. +.-+ +++.|...+ . T Consensus 152 ~---------~~~~~~~~i--~p~~~~~v~d-~~~-----~~~~-~i~~~~~~~-------------------------~ 188 (474) T protein:vir:94 152 T---------NGDIRIKNI--DPYNVIFVGD-NIL-----EPTY-SLRYFYEKD-------------------------D 188 (474) T ss_pred C---------CCeeEEEEE--cccceEEEEc-CCC-----ceEE-EEEEEEEee-------------------------C Confidence 1 223444432 233332 32 111 1111 222221000 0 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) .+...+...++|+... ++.|. ... .+...+.++. T Consensus 189 ~~~~~~~~~~~y~~~~----~~~~~-----------~~~-------------------------------~~~~~~~~~~ 222 (474) T protein:vir:94 189 DNGTDYVYAEFYDNAY----YYVFR-----------GEG-------------------------------IDALQEVGRY 222 (474) T ss_pred CCceEEEEEEEEcCce----EEEEe-----------ecC-------------------------------CCcccccccc Confidence 0011112223332211 01111 000 0011122334 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) +.+++.+|+|+|. +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++- |. .+.++.... .... T Consensus 223 ~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~-~~~~~~~~~--~~~~ 291 (474) T protein:vir:94 223 EHLFDYNPLFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GM-GMSEEMIQE--TQKS 291 (474) T ss_pred cCCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cC-CCCchhhhh--hhhc Confidence 4555666666543 12346789999999999999999999999998888776653 32 111111111 1122 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASF 458 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~ 458 (708) +.+..... . ....+...+.-..++...+......|...|++.+.+.+. .+|.||.|+..+......... T Consensus 292 ~~i~~~~~---~-------~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 361 (474) T protein:vir:94 292 GAFELFDK---D-------MDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCM 361 (474) T ss_pred ceeEecCC---C-------CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHH Confidence 33322111 1 122333333334667778888899999999988766553 568899999988887777778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 459 ~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) .....|..+++++.++++.++..-.....-. . -.||.+.=.+..+. T Consensus 362 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~---------~-------------------------~~~i~~~f~~~~p~ 407 (474) T protein:vir:94 362 TFERKMTAMLRYQFKVILSALKRKGYNLDDD---------S-------------------------YLNLIFKFTRNIPV 407 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCCcc---------c-------------------------cccceEEeCCCCCC Confidence 7888888888888888888765422110000 0 01333333445555 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) ...+..+.++.+...+ ...+++.++.+ ...+...++++..... ..+..+. T Consensus 408 d~~e~a~~~~kl~g~i---------S~et~~~~l~~v~d~~~E~eri~~E~~e--------------------~~~~~~~ 458 (474) T protein:vir:94 408 NKLEESQVLINLKGQV---------SERTRLGQSQLVDDVDYELDEMEKESLE--------------------FNDKLPD 458 (474) T ss_pred CHHHHHHHHHHHhccC---------chHHHHHhCCCCCCHHHHHHHHHHHHHH--------------------HHhhccc Confidence 4445555555542111 11233333322 2233333333221100 0000000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAE 633 (708) Q Consensus 618 ~~~~~aq~~~~~~qae 633 (708) ........+....+.+ T Consensus 459 ~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 459 IDEGDANDKSQNNQSE 474 (474) T ss_pred ccCCCcCCCCccccCC Confidence 0000000000000111 No 69 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.66 E-value=2.5e-15 Score=100.63 Aligned_cols=451 Identities=13% Similarity=0.055 Sum_probs=215.3 Q ss_pred CCcchHHHHHHH------HHHHHHHHHhhHHHHHHHHHHHHHhhcCCCC--CCHHHHHH---hhh------hhhhcCCC- Q lcl|Aclame:pro 1 MAETLEKKHERI------MLRFDRAYSPQKEVREKCIEATRFARVPGGQ--WEGATAAG---TKL------DEQFEKYP- 62 (708) Q Consensus 1 ma~~~~~~~~~~------~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Q--w~~~~~~~---l~~------~~q~~grp- 62 (708) |- +.+.+..+ .+.+.+..+.....+++.... +.||.|.+ +....+.. +.. .....++| T Consensus 1 ~~--~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~--~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:10 1 MT--LYKLIDDIEAQGILPKHIEALIESHKDDRERMVNL--YNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVN 76 (474) T ss_pred Cc--hHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHH--HHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcc Confidence 11 11111111 011222222222223333222 22333311 00000000 000 00011233 Q ss_pred -ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEee Q lcl|Aclame:pro 63 -KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSM 141 (708) Q Consensus 63 -~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~ 141 (708) .+++|..+.+|+..+|+...+.+.+.+.+.. +. .+.+...+..++..|+++.....+..+++++|.+|..+..+ T Consensus 77 ~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~-~~----~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d 151 (474) T protein:vir:10 77 NKLNNSFDSEIVDTRVGYLHGVPVTYDLDENA-EK----NEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYID 151 (474) T ss_pred cccccchHHHHHHhHhhheeccceeEeeCCCC-cc----hHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeC Confidence 4779999999999999999998888763322 22 34456667777778999999999999999999998877532 Q ss_pred ccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 142 LVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 142 ~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) . .+.+++..+ ++.++| || +.. +.-+ +++.|...+ . T Consensus 152 ~---------~~~~~~~~i--~p~~~~~v~d-~~~-----~~~~-~i~~~~~~~-------------------------~ 188 (474) T protein:vir:10 152 T---------NGDIRIKNI--DPYNVIFVGD-NIL-----EPTY-SLRYFYEKD-------------------------D 188 (474) T ss_pred C---------CCeeEEEEE--cccceEEEEc-CCC-----ceEE-EEEEEEEee-------------------------C Confidence 1 223444432 233332 32 111 1111 222221000 0 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) .+...+...++|+... ++.|. ... .+...+.++. T Consensus 189 ~~~~~~~~~~~y~~~~----~~~~~-----------~~~-------------------------------~~~~~~~~~~ 222 (474) T protein:vir:10 189 DNGTDYVYAEFYDNAY----YYVFR-----------GEG-------------------------------IDALQEVGRY 222 (474) T ss_pred CCceEEEEEEEEcCce----EEEEe-----------ecC-------------------------------CCcccccccc Confidence 0011112223332211 01111 000 0011122334 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) +.+++.+|+|+|. +...|.|.+..+++.++.+|...|.+.+.+...+.+.+++- |. .+.++.... .... T Consensus 223 ~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~-~~~~~~~~~--~~~~ 291 (474) T protein:vir:10 223 EHLFDYNPLFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GM-GMSEEMIQE--TQKS 291 (474) T ss_pred cCCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cC-CCCchhhhh--hhhc Confidence 4555666666543 12346789999999999999999999999998888776653 32 111111111 1122 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASF 458 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~ 458 (708) +.+..... . ....+...+.-..++...+......|...|++.+.+.+. .+|.||.|+..+......... T Consensus 292 ~~i~~~~~---~-------~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 361 (474) T protein:vir:10 292 GAFELFDK---D-------MDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCM 361 (474) T ss_pred ceeEecCC---C-------CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHH Confidence 33322111 1 122333333334667778888899999999988766553 568899999988887777778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 459 ~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) .....|..+++++.++++.++..-.....-. . -.||.+.=.+..+. T Consensus 362 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~---------~-------------------------~~~i~~~f~~~~p~ 407 (474) T protein:vir:10 362 TFERKMTAMLRYQFKVILSALKRKGYNLDDD---------S-------------------------YLNLIFKFTRNIPV 407 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCCcc---------c-------------------------cccceEEeCCCCCC Confidence 7888888888888888888765422110000 0 01333333445555 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) ...+..+.++.+...+ ...+++.++.+ ...+...++++..... ..+..+. T Consensus 408 d~~e~a~~~~kl~g~i---------S~et~~~~l~~v~d~~~E~eri~~E~~e--------------------~~~~~~~ 458 (474) T protein:vir:10 408 NKLEESQVLINLKGQV---------SERTRLGQSQLVDDVDYELDEMEKESLE--------------------FNDKLPD 458 (474) T ss_pred CHHHHHHHHHHHhccC---------chHHHHHhCCCCCCHHHHHHHHHHHHHH--------------------HHhhccc Confidence 4445555555542111 11233333322 2233333333221100 0000000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAE 633 (708) Q Consensus 618 ~~~~~aq~~~~~~qae 633 (708) ........+....+.+ T Consensus 459 ~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 459 IDEGDANDKSQNNQSE 474 (474) T ss_pred ccCCCcCCCCccccCC Confidence 0000000000000111 No 70 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.66 E-value=1.5e-14 Score=96.41 Aligned_cols=448 Identities=12% Similarity=0.055 Sum_probs=208.5 Q ss_pred CC--cchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHH--HhhhhhhhcCCC--ceeecchHHHHH Q lcl|Aclame:pro 1 MA--ETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAA--GTKLDEQFEKYP--KFEINKVATELN 74 (708) Q Consensus 1 ma--~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~--~l~~~~q~~grp--~~~~N~i~~~i~ 74 (708) |. +...+...+++..| .+.+...+.+.. +-..||.|+| +-..+. ..........+| .++.|..+.+|+ T Consensus 35 ~~~~~~~~~~~~~~i~~~---i~~~~~~~~r~~--~l~~YY~g~~-~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd 108 (492) T protein:vir:97 35 IVRTNNKPETLEEMIVRY---IKQHLEKLPEIS--IGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVD 108 (492) T ss_pred cccCCCchhhHHHHHHHH---HHHHHHHHHHHH--HHHHHhcccC-ccccccccccccccccccccccccccchHHHHHH Confidence 22 23233333333333 233333333332 3346899975 211000 000000001122 367899999999 Q ss_pred HHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 75 RIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 75 ~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) ..+|+...+.+.+.+ + |.+..+. ++.++ .|+++.....+..+++++|.||..+..+. ++. T Consensus 109 ~~~~yl~g~p~~~~~-----~-d~~~~~~----l~~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~d~---------dg~ 168 (492) T protein:vir:97 109 QKVSYIVGKPIAFKH-----T-DDEVVKR----IDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDE---------EGE 168 (492) T ss_pred HHhhhhcccCceecc-----C-chHHHHH----HHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEecC---------CCc Confidence 999999988877642 1 2233333 44444 37899999999999999999998775431 234 Q ss_pred eeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeee Q lcl|Aclame:pro 155 IAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYE 232 (708) Q Consensus 155 i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~ 232 (708) +++..+ ++.++| ||+... . +. ..+++.|-..+ . ...++|. T Consensus 169 ~~~~~~--~p~~~~~i~d~~~~-~---~~-~~~vr~~~~~~----------------------------~---~~~~~y~ 210 (492) T protein:vir:97 169 FKLFRV--PAEQGIPIWTDKEH-E---EL-EAFIRMYKLEN----------------------------E---TKVEYWD 210 (492) T ss_pred eEEEEE--cccceEEEEcCCCC-C---ce-EEEEEEEeecc----------------------------c---eeEEEEe Confidence 454433 333443 553221 1 11 22333332100 0 0123333 Q ss_pred ecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEE Q lcl|Aclame:pro 233 VRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVY 312 (708) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~ 312 (708) ..+... +.+.. +........ ..+. ......+.+++.+|+|+|. T Consensus 211 ~~~v~~--~~~~~---~~~~~~~~~-------------------------------~~~~-~~~~~~~~~~g~vPvv~~~ 253 (492) T protein:vir:97 211 KVTVNY--YVYEN---GSLIPDYSN-------------------------------NLEN-SKTHFSTGSWGKIPFIPFK 253 (492) T ss_pred cCeEEE--EEEec---Ceeeecccc-------------------------------cccc-cccccccCCCCCcceEEec Confidence 222111 11110 000000000 0000 0112234555666666543 Q ss_pred EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccc Q lcl|Aclame:pro 313 GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSG 392 (708) Q Consensus 313 ~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (708) - ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++.-....+..+. ... ......+.... .+ T Consensus 254 n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~-~~~-~~~~~~~~~~~----~~ 320 (492) T protein:vir:97 254 N-------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEF-KRL-LRYYGAIKVSD----NG 320 (492) T ss_pred C-------CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhH-HHH-HhhccceecCC----CC Confidence 2 22467899999999999999999999999998887766532111111111 111 11111221111 11 Q ss_pred cccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 393 NIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRA 471 (708) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 471 (708) ...+...+.-...+...+..+.+.|...|++.+.+.+. ++|.||.|+...............+.|..+++++ T Consensus 321 -------~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~ 393 (492) T protein:vir:97 321 -------GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQEL 393 (492) T ss_pred -------cceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222223566777888888999999987766554 4578999998887777777777777778888887 Q ss_pred HHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHH Q lcl|Aclame:pro 472 GEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVL 551 (708) Q Consensus 472 ~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~ll 551 (708) .++++.++. +.++ + .+|.|.=.+..+....+..+.++.+. T Consensus 394 ~~li~~~~~----------~~~~-----~-------------------------~~i~v~f~~~~p~~~~e~a~~~~kl~ 433 (492) T protein:vir:97 394 LWFVFEHFD----------IKGE-----H-------------------------KDVDISFNYNKVANTELQVQTAQQSM 433 (492) T ss_pred HHHHHHHhc----------CCcc-----c-------------------------ceeeEEecCCCCCCHHHHHHHHHHHh Confidence 777666432 1110 0 12223334455544445555555542 Q ss_pred HhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 552 SSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAA 630 (708) Q Consensus 552 q~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~ 630 (708) ..+ ...++++++.+ ...++-.+++....... .+..+.... ...... - T Consensus 434 G~i---------S~et~l~~l~~v~d~~~Eleri~~E~~~~--------------------~~~~~~~~~--~~~~~~-~ 481 (492) T protein:vir:97 434 GIV---------SHETVLENHPFVEDLQAELERIEQEQTEY--------------------NKQLPNLDD--GGADSA-Q 481 (492) T ss_pred ccC---------chHHHHHhCCCCCCHHHHHHHHHHHHHHH--------------------HHhhhcccc--CCCCCC-c Confidence 111 11223333322 22222333333211000 000000000 000000 0 Q ss_pred HHHHHHHHHHHHHHH Q lcl|Aclame:pro 631 QAEAQKATNETAQTQ 645 (708) Q Consensus 631 qae~~k~~~~~~~~q 645 (708) +.+. ....+.+ T Consensus 482 ~~~~----~~~~~~e 492 (492) T protein:vir:97 482 QQER----SNNKESE 492 (492) T ss_pred cccc----ccccccC Confidence 0000 0000000 No 71 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.66 E-value=1.4e-14 Score=96.49 Aligned_cols=447 Identities=11% Similarity=0.040 Sum_probs=209.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHH--HhhhhhhhcCCC--ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAA--GTKLDEQFEKYP--KFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~--~l~~~~q~~grp--~~~~N~i~~~i~~i 76 (708) +- ...+.+.+++ .+..+.....+.+.. +-..||.|+| +-..+. ..........+| .++.|..+.+|+.. T Consensus 29 ~~-~~~e~~~~~i---~~~i~~~~~~~~r~~--~l~~YY~g~~-~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~ 101 (483) T protein:vir:12 29 TN-NKPETLEEMI---VRYIKQHLEKLPEIS--IGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 101 (483) T ss_pred cC-CchhhHHHHH---HHHHHHHHHHHHHHH--HHHHHhcccc-ccccccccccccccccccccccccccchHHHHHHHH Confidence 11 1222233332 333333333333333 3346899976 111110 000000001122 36789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+.+.+.+ + |.+..+. ++.++. |+++.....+..+++++|.||..+..+. ++.++ T Consensus 102 ~~~l~G~p~~~~~-----~-d~~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d~---------d~~~~ 161 (483) T protein:vir:12 102 VSYIVGKPIAFKH-----T-DDEVVKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE---------EGEFK 161 (483) T ss_pred hhhhcccCceecc-----C-ChHHHHH----HHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEcC---------CCceE Confidence 9999988877642 1 2233343 344433 6889999999999999999998886432 23445 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.++| ||+.... +. ..+++.|-..+ . ...++|... T Consensus 162 i~~~--~p~~~~~v~d~~~~~----~~-~~~ir~~~~~~----------------------------~---~~~~~y~~~ 203 (483) T protein:vir:12 162 LFRV--PAEQGIPIWTDKEHE----EL-EAFIRMYKLEN----------------------------E---TKVEYWDKV 203 (483) T ss_pred EEEE--cccceEEEEcCCCCC----ce-EEEEEEEEeec----------------------------c---eEEEEEecC Confidence 4433 334443 5542211 11 22233321100 0 002333322 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +.... .+. .+......... .+. ......+.+++.+|+|+|.- T Consensus 204 ~v~~~--~~~---~~~~~~~~~~~-------------------------------~~~-~~~~~~~~~~g~vPvv~~~n- 245 (483) T protein:vir:12 204 TVNYY--VYE---NGSLIPDYSNN-------------------------------LEN-SKTHFSTGSWGKIPFIPFKN- 245 (483) T ss_pred eEEEE--EEe---CCeeeeccccc-------------------------------ccc-cccccccCCCCccceEEecC- Confidence 21110 000 00000000000 000 01112345556667665531 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++--...++..+. ... ....+.+... ..+ T Consensus 246 ------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~-~~~~~~~~~~----~~~-- 311 (483) T protein:vir:12 246 ------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF-KRL-LRYYGAIKVS----DNG-- 311 (483) T ss_pred ------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhH-HHh-hhhccccccC----CCC-- Confidence 23467999999999999999999999999988888776632111211111 111 1111122111 111 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~ 473 (708) ...+...+.-...+...+..+.+.|...|++.+.+.+. ++|.||.|+...............+.|..+++++.+ T Consensus 312 -----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~ 386 (483) T protein:vir:12 312 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 386 (483) T ss_pred -----cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12223333234666678888888999999988766654 457899999888777777777777778888888777 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) Q Consensus 474 ~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~ 553 (708) ++++++. +.++ -.||.|.=.+..+....+..+.++.+... T Consensus 387 li~~~~~----------~~~~------------------------------~~~i~v~f~~~~p~~~~~~a~~~~kl~Gi 426 (483) T protein:vir:12 387 FVFEHFD----------IKGE------------------------------HKDVDISFNYNKVANTELQVQTAQQSMGI 426 (483) T ss_pred HHHHHhc----------CCCc------------------------------cceeeEEeCCCCCCCHHHHHHHHHHHhcc Confidence 7666532 1110 01233333445555455555555554221 Q ss_pred ccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) Q Consensus 554 ~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qa 632 (708) + ...+++.++.+ ...+.-.+++...........+... ...........+. -+. T Consensus 427 i---------S~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~-------------~~~~d~~~~~~~~----~~~ 480 (483) T protein:vir:12 427 V---------SHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLD-------------DGGADGAQQQERS----NNK 480 (483) T ss_pred C---------chHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc-------------ccccCCcccCCCC----Ccc Confidence 1 11223333322 2222333333322110000000000 0000000000000 000 Q ss_pred HHH Q lcl|Aclame:pro 633 EAQ 635 (708) Q Consensus 633 e~~ 635 (708) |-+ T Consensus 481 e~e 483 (483) T protein:vir:12 481 ESE 483 (483) T ss_pred cCC Confidence 000 No 72 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.66 E-value=2.3e-15 Score=100.89 Aligned_cols=453 Identities=11% Similarity=0.053 Sum_probs=212.9 Q ss_pred CCcch----HHHHHHH-----------HHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhh---hhhcCCC Q lcl|Aclame:pro 1 MAETL----EKKHERI-----------MLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLD---EQFEKYP 62 (708) Q Consensus 1 ma~~~----~~~~~~~-----------~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~---~q~~grp 62 (708) |+|.. +..+.+. .+.+.+..+.+.....+.... ..||.|+| +-..+ ..+.. ....++| T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~--~~yY~g~~-~i~~~-~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMG--ERYYNHHP-DILDA-PPKRDVNGDYDETKP 76 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHH--HHHhcCCC-chhcc-ccccccccccccccc Confidence 88742 1222322 223333344444333333322 45888976 21111 11100 0011222 Q ss_pred --ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe Q lcl|Aclame:pro 63 --KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) Q Consensus 63 --~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~ 140 (708) .++.|..+.+|+..+|+...+.+.+.+ ++ .+..+.+..+ . .++++.....+..+++++|.||+.+.. T Consensus 77 ~~ki~~n~~~~ivd~~~~~l~g~~~~~~~-----~~-d~~~~~l~~~----~-~n~~~~~~~~~~~~~~~~G~~~~~~~~ 145 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVANPVTFGV-----DN-DKALKQIQHT----L-NHKWDDKLVDILTAASNKGIEWVQPYV 145 (478) T ss_pred cceeccchHHHHHHHHHhhhccCCeeeec-----CC-hHHHHHHHHH----H-hcCHHHHHHHHHHHHHhcCeEEEEEEe Confidence 267899999999999999988888753 12 2334444433 3 368999999999999999999988754 Q ss_pred eccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccC Q lcl|Aclame:pro 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYN 218 (708) Q Consensus 141 ~~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~ 218 (708) +. .+.+++..+ ++..+| ||+... .+.. .+++.|-.. . T Consensus 146 d~---------~g~~~~~~~--~p~~~~~i~d~~~~----~~~~-~~v~~~~~~-----------~-------------- 184 (478) T protein:vir:10 146 DE---------EGEFKTFRV--PAEQAVPIWTNKER----DELQ-AFIRVYELD-----------G-------------- 184 (478) T ss_pred cC---------CCeeEEEEE--cccceEEEEcCCCC----CceE-EEEEEEEec-----------C-------------- Confidence 32 234444433 233443 454322 1222 223332100 0 Q ss_pred CCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecC Q lcl|Aclame:pro 219 WFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKP 298 (708) Q Consensus 219 ~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~ 298 (708) ....++|...+.. .|.. .++.......... . -.....+... T Consensus 185 ------~~~~~~y~~~~i~----~~~~-~~~~~~~~~~~~~---------~-------------------~~~~~~~~~~ 225 (478) T protein:vir:10 185 ------AERVEYWTKDDVT----YYEL-KEGQLIPDFYRSD---------D-------------------HIQPHYYQGN 225 (478) T ss_pred ------ceEEEEEeCCeEE----EEEE-cCCeeeccccccc---------c-------------------ccccceeccc Confidence 0012233322211 1110 0111000000000 0 0001112233 Q ss_pred CCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHHHHhhccc Q lcl|Aclame:pro 299 RRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKHWEARNKK 377 (708) Q Consensus 299 ~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~~~~~~~~ 377 (708) .+.+++.+|+++|. +...|.|.+..+++.++.+|...|.+...+...+.+.+++- |.. ++..+. ..+.. T Consensus 226 ~~~~~~~vPvv~~~-------n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~-g~~~~~~~~~--~~~~~ 295 (478) T protein:vir:10 226 KLMSWGRVPFIPFK-------NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK-GYEGEDMKDF--MHNLK 295 (478) T ss_pred ccccCCccceEEec-------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee-cCCccccchh--hhhhh Confidence 45666777776653 13356788999999999999999999999998888766543 321 111111 11112 Q ss_pred CCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 378 RPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMA 456 (708) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~ 456 (708) ....+.... ...+ .+.++..+.-...+...+..+...|...|++.+.+.+. .+|.||.|+.......... T Consensus 296 ~~~~~~~~~--~~~~-------~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k 366 (478) T protein:vir:10 296 YYKAISVAG--ESGS-------GVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLK 366 (478) T ss_pred hcceEEecC--CCCC-------cceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHH Confidence 222222211 1111 22333333234666778888889999999987766554 4678999999887777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccc Q lcl|Aclame:pro 457 SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) Q Consensus 457 ~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~ 536 (708) .......|..+++++.++++.+. . .+ .++ .+|.|.=.+.. T Consensus 367 ~~~~~~~~~~~l~~~~~li~~~~----g---------~~--~~~-------------------------~~i~i~f~~~~ 406 (478) T protein:vir:10 367 ANKLKNKTLTALQELLQYIIDFY----R---------LD--VKV-------------------------QDIEITFNFNV 406 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHh----C---------CC--ccc-------------------------ccceEEecCCC Confidence 77777777788887777666543 1 00 000 12223233344 Q ss_pred hhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQ 615 (708) Q Consensus 537 ~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~ 615 (708) +....+..+.++.+ ..+ .+ ..++++++.+ ...+...++++..... ..+.. T Consensus 407 p~d~~e~a~~~~kl-~g~---iS-----~et~~~~l~~v~D~~~E~~ri~~E~~~--------------------~~~~~ 457 (478) T protein:vir:10 407 MVNELENSQIAMNS-TGL---LS-----KETILSNHAWVEDPVAEMERIEQENIE--------------------LNQQL 457 (478) T ss_pred CCCHHHHHHHHHHH-hCC---CC-----hHHHHHhCCCCCCHHHHHHHHHHHHHH--------------------HHhhc Confidence 44344445555544 111 11 1233333322 2222223333221100 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 616 PNPEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 616 ~~~~~~~aq~~~~~~qae~~k~~~~ 640 (708) ..... ......+.+-...+.+ T Consensus 458 ~~~~~----~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 458 PDIEE----GLNGEQQRQSENNQPE 478 (478) T ss_pred ccccc----ccCCCCCCCCCCCCCC Confidence 00000 0000000000000000 No 73 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.65 E-value=1.1e-14 Score=97.19 Aligned_cols=436 Identities=9% Similarity=0.035 Sum_probs=207.2 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHh--hhhhhhcCCC--ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGT--KLDEQFEKYP--KFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l--~~~~q~~grp--~~~~N~i~~~i~~i 76 (708) ..+...+.+.++.. .......+.. +-..||.|.| +-..+... .......++| -++.|..+.+|+.. T Consensus 24 ~~~~~~~~i~~~i~-------~~~~~~~~~~--~~~~Yy~g~~-~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~ 93 (474) T protein:vir:95 24 QFETQEEMIIRLID-------DHRKQLDKIT--VGQRYYDKDN-DIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQK 93 (474) T ss_pred ccCChHHHHHHHHH-------HHHHHHHHHH--HHHHHhcccC-chhccccccccccccccccccceeccchHHHHHHHH Confidence 22222222332222 2222222222 2235899976 21111100 0000111223 46789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +++...+.+.+.+ +|.+.. ++++.+.+ ++++.....+..++.++|.||..+..+. .+.++ T Consensus 94 ~~~l~g~p~~~~~------~d~~~~----~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~---------~~~~~ 153 (474) T protein:vir:95 94 VSYVASKPVTYSC------EDESVL----KIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINE---------NGEMK 153 (474) T ss_pred HhhhccCCceecc------CchHHH----HHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecC---------CCceE Confidence 9999998887653 223333 34444444 6899999999999999999998875421 23455 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.++| ||+... .+.. .+++.|...+ ....++|... T Consensus 154 i~~~--~p~~~~~v~d~~~~----~~~~-~~i~~~~~~~-------------------------------~~~~~~y~~~ 195 (474) T protein:vir:95 154 LFRV--PAEQAIPIWVDKER----EELK-SFIRYYKFNN-------------------------------EEKVEFWTDT 195 (474) T ss_pred EEEE--cccceEEEEcCCCC----CceE-EEEEEEEEcC-------------------------------eeEEEEEeCC Confidence 4433 233444 444221 1222 2222221000 0012334333 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +.. .|.... +...... ......+.....+.+++.+|+++|.- T Consensus 196 ~~~----~~~~~~-~~~~~~~--------------------------------~~~~~~~~~~~~~~~~g~iPvv~~~n- 237 (474) T protein:vir:95 196 TVT----YYVLEN-GGLIPDY--------------------------------YYGANHIQSHFSNGNWGRVPFIAFKN- 237 (474) T ss_pred eEE----EEEEcC-Ccccccc--------------------------------ccCcccccccccccCCCccceEeecC- Confidence 211 111100 0000000 00111112223445567777776542 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) ...|.|.+..+++.++.+|...|.+.+.+...+.+.+++.-...++...... .......+... ..| T Consensus 238 ------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~--~~~~~~~i~~~----~~~-- 303 (474) T protein:vir:95 238 ------NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMR--GLKYYKAINVD----GDG-- 303 (474) T ss_pred ------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh--hhhccceeecc----CCC-- Confidence 2346788999999999999999999999988888877664322222222111 11112222211 111 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~ 473 (708) ...+...+.-..++...+..+...|...|++.+.+.+. .+|.||.|+.................|..+++++.+ T Consensus 304 -----~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~ 378 (474) T protein:vir:95 304 -----GVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIG 378 (474) T ss_pred -----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222234666677888889999999988766554 467899999988877777777777778888888777 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) Q Consensus 474 ~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~ 553 (708) +++++. |.. .++ .+|.|.-.++.+..-.+..+.+.+ T Consensus 379 li~~~~-------------g~~--~d~-------------------------~~i~v~f~~~~p~d~~e~a~~~~~---- 414 (474) T protein:vir:95 379 FIIDFN-------------NLK--MDV-------------------------KDIEISFNFNRMMNDAEQSQIIAQ---- 414 (474) T ss_pred HHHHHh-------------CCC--ccc-------------------------ceeeEEeccCCCcCHHHHHHHHHh---- Confidence 776653 110 000 122222233334333333333332 Q ss_pred ccccCchhHHHHHHHHhhcc-chhHHHHHHHHHhhhhhhhc-------ccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGIILDNID-GEGLDDFKEYNRNQLLISGI-------AKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 554 ~~~~~p~~~~~~~~~~~~~d-~~~~~ei~e~~~~~~~~~~~-------~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) .+-. + ...++.++. ....++..+++......... ..+...++.. +.....++ T Consensus 415 ~g~i-S-----~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~------~~~~~~~~ 474 (474) T protein:vir:95 415 SQYL-S-----RETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQE------RSNDKESE 474 (474) T ss_pred cCCC-c-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCC------CCccCCCC Confidence 2211 1 122233322 22233333444322110000 0000000000 00000000 No 74 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.65 E-value=8.3e-15 Score=97.79 Aligned_cols=471 Identities=8% Similarity=0.003 Sum_probs=220.9 Q ss_pred CCc---chHHHHHHH---------------HHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHH-HHhhhhh---hh Q lcl|Aclame:pro 1 MAE---TLEKKHERI---------------MLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATA-AGTKLDE---QF 58 (708) Q Consensus 1 ma~---~~~~~~~~~---------------~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~-~~l~~~~---q~ 58 (708) ||+ .....+..+ ...+.+..+.+. +.+. .+-..||.|+|.-.... ....... .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~--~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~ 76 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHN--PEPL--LKGVRYYMCENDIEKKRRTYYDAAGQQLVD 76 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhc--HHHH--HHHHHHhccccchhhccchhcccccccccc Confidence 443 111111111 111222222221 1222 23346899987311000 0000000 00 Q ss_pred cCCC--ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEE Q lcl|Aclame:pro 59 EKYP--KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCF 136 (708) Q Consensus 59 ~grp--~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~ 136 (708) ..+| .++.|..+.+|+..+|+...+.+.+.+ +|.+..+ +++.+.+ |+++.....+..+++++|.+|+ T Consensus 77 ~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~------~d~~~~~----~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~ 145 (503) T protein:vir:59 77 DTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTS------DNKTLLE----YVNELAD-DDFDDILNETVKNMSNKGIEYW 145 (503) T ss_pred cccccceeecchHHHHHHHHHhhhhcCCeeecc------CcHHHHH----HHHHHHh-cCHHHHHHHHHHHHhhCCeEEE Confidence 0111 356899999999999999999887642 2233333 4444443 7899999999999999999998 Q ss_pred EEEeeccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccc Q lcl|Aclame:pro 137 RLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTS 214 (708) Q Consensus 137 ~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~ 214 (708) .+..+. ++.+++..+ ++..+| ||+... . .. .++++.|...+ T Consensus 146 ~v~~d~---------dg~~~i~~~--~p~~~~~i~d~~~~-~---~~-~~~ir~~~~~~--------------------- 188 (503) T protein:vir:59 146 HPFVDE---------EGEFDYVIF--PAEEMIVVYKDNTR-R---DI-LFALRYYSYKG--------------------- 188 (503) T ss_pred EEeecC---------CCceEEEEE--ccceeEEEEeCCCC-C---ce-EEEEEEEEEec--------------------- Confidence 886532 234555543 334443 554321 1 11 12333332110 Q ss_pred cccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEeccee Q lcl|Aclame:pro 215 WEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGF 294 (708) Q Consensus 215 ~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~i 294 (708) .+.+.+...++|+..+.. .|.....+.......... -....+ T Consensus 189 -----~~~~~~~~~evy~~~~i~----~~~~~~~~~~~~~~~~~~-----------------------------~~~~~~ 230 (503) T protein:vir:59 189 -----IMGEETQKAELYTDTHVY----YYEKIDGVYQMDYSYGEN-----------------------------NPRPHM 230 (503) T ss_pred -----CCCceEEEEEEEeCCcEE----EEEEcCCccccccccccc-----------------------------ccccce Confidence 011223345566554422 111111110000000000 000112 Q ss_pred eecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhh Q lcl|Aclame:pro 295 LEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEAR 374 (708) Q Consensus 295 l~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~ 374 (708) .....+.+++.+|+++|.. ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++.-.-.++..+... T Consensus 231 ~~~~~~~~~~~vPiv~~~n-------n~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~-- 301 (503) T protein:vir:59 231 TKGGQAIGWGRVPIIPFKN-------NEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTA-- 301 (503) T ss_pred eecceeccCCccceEEecC-------CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhh-- Confidence 2344566677777776531 2346788999999999999999999999999988877764211121111111 Q ss_pred cccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHH Q lcl|Aclame:pro 375 NKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRA 453 (708) Q Consensus 375 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~ 453 (708) +.....++.... .| ...+.....-...+...++.....|...+++.+.+.+. .+|.||.|+....... T Consensus 302 ~~~~~~~~~~~~----~~-------~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l 370 (503) T protein:vir:59 302 NLRYHSVIKVSG----DG-------GVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALL 370 (503) T ss_pred hhhcccceeccC----CC-------cceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHH Confidence 111122221111 11 12222222223566667788888888888887755443 4678999999888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeec Q lcl|Aclame:pro 454 DMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVG 533 (708) Q Consensus 454 ~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~ 533 (708) ........+.|..+++++.++++.++....... .. ...+|.|.=. T Consensus 371 ~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~--------~~---------------------------~~~~i~i~f~ 415 (503) T protein:vir:59 371 DLKANMAERKIRAGLRLFFWFFAEYLRNTGKGD--------FN---------------------------PDKELTMTFT 415 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc--------cc---------------------------cccceeEEeC Confidence 777777777788888888877777664332210 00 0013344434 Q ss_pred ccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHH Q lcl|Aclame:pro 534 PSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAA 612 (708) Q Consensus 534 ~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~q 612 (708) +..+....+..+.++.+.+++. . + ..++++++.+ +..++..+++...... ..+ T Consensus 416 ~~~p~d~~~~~~~~~kl~~~Gi-i-S-----~et~l~~l~~v~d~~~E~~ri~~E~~~-------------------~~~ 469 (503) T protein:vir:59 416 RTRIQNDSEIVQSLVQGVTGGI-M-S-----KETAVARNPFVQDPEEELARIEEEMNQ-------------------YAE 469 (503) T ss_pred CCCCCCHHHHHHHHHHHHhCCC-C-c-----hHHHHHhCCCCCCHHHHHHHHHHHHHH-------------------HHh Confidence 4555556667777777765431 1 1 1223333322 1222222333211100 000 Q ss_pred HHHHHHHHHHHH-HHHH-HHHH-HHHHHH-HHHH Q lcl|Aclame:pro 613 QSQPNPEMVLAQ-AQMV-AAQA-EAQKAT-NETA 642 (708) Q Consensus 613 q~~~~~~~~~aq-~~~~-~~qa-e~~k~~-~~~~ 642 (708) +........... -+.. .-.. +.++.. .++. T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 470 MQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred hhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 000000000000 0000 0000 000000 0000 No 75 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.65 E-value=5.7e-15 Score=98.67 Aligned_cols=462 Identities=12% Similarity=0.049 Sum_probs=214.4 Q ss_pred CCcchHHH-HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC--ceeecchHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKK-HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRII 77 (708) Q Consensus 1 ma~~~~~~-~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp--~~~~N~i~~~i~~i~ 77 (708) |.+ .+.+ .++++..+......+.+ +. ++-..||.|+|..-..+..... ..++| .++.|..+.+|+..+ T Consensus 16 ~~~-~~~l~~~~i~~li~~~~~~~~~---r~--~~l~~YY~g~~~~i~~~~~~~~---~~~~~~~ki~~n~~~~Iv~~~~ 86 (506) T protein:vir:94 16 QES-LENLTPNKIMKFITHHFNYQRP---RL--EMLDDYYQGYNLKILDKQSRRH---EDGKADHRATHSFAKYIADFQT 86 (506) T ss_pred ccc-hhcCCHHHHHHHHHHHHHHHHH---HH--HHHHHHhcCCCccccccccccc---cccCCcceeecchHHHHHHHhh Confidence 432 1222 22333333332222222 12 2223589998753211111111 12233 367899999999999 Q ss_pred HHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceee Q lcl|Aclame:pro 78 AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAI 157 (708) Q Consensus 78 g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i 157 (708) |+...+.+.+.+ -+ + + ..+.+..+++.|+++.....+..+++++|.+|..+..+. .+.+++ T Consensus 87 ~~l~G~p~~~~~--~d---~-~----~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~de---------d~~~~i 147 (506) T protein:vir:94 87 SYSVGNPINVKL--PD---D-G----SNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGE---------DNEEHL 147 (506) T ss_pred hhhcccCceeec--Cc---c-h----HHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecC---------CCeeEE Confidence 999998777653 21 1 1 135577777889999999999999999999998886531 124444 Q ss_pred EEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 158 EPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 158 ~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) ..+ ++..++ ||+... .....+++.|.....- . ..........+.|...+ T Consensus 148 ~~~--~p~~~~~v~dd~~~-----~~~~~~v~~~~~~~~~------------------~----~~~~~~~~~~~~yt~~~ 198 (506) T protein:vir:94 148 AKL--DPLDTFVIYSTDVD-----PKPIMAVRYHQIELVD------------------D----NQVSTINYVPETWTADT 198 (506) T ss_pred EEE--cccceEEEecCCCC-----CceEEEEEEEeeeecc------------------C----CceeEEEEEEEEEeCce Confidence 433 233332 333211 1122333333221100 0 00000001111121110 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) ...+... .....+....+.+++.+|+++|.- T Consensus 199 ---------------~~~~~~~--------------------------------~~~~~~~~~~~~~~g~vPvv~~~n-- 229 (506) T protein:vir:94 199 ---------------YTLYNPT--------------------------------PIMGKMQVDTTKPITTFPVVEFKN-- 229 (506) T ss_pred ---------------EEEeccc--------------------------------cCccceeccccccCCccceEEecC-- Confidence 0001000 001112223445667777776532 Q ss_pred eccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccc----------------------hHHHHHh Q lcl|Aclame:pro 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG----------------------LEKHWEA 373 (708) Q Consensus 316 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~----------------------~~~~~~~ 373 (708) ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++-...... ......- T Consensus 230 -----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (506) T protein:vir:94 230 -----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELI 304 (506) T ss_pred -----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHH Confidence 1236689999999999999999999998876665544432100000 0000000 Q ss_pred hcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHH Q lcl|Aclame:pro 374 RNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNR 452 (708) Q Consensus 374 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q 452 (708) ........+.+.......+. +....+.+...+.-..++...+..+...|...|++.+.+.+. .+|.||.|+..+... T Consensus 305 ~~~~~~~~~~~~~~~~~~~~--~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~ 382 (506) T protein:vir:94 305 KEMKDANMLLLKSGMTVNGT--QTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLG 382 (506) T ss_pred hhhhhcCeeeecccccccCc--cccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHH Confidence 00111122222221111111 111123334444445777788888999999999988765543 467899999988887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEee Q lcl|Aclame:pro 453 ADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDV 532 (708) Q Consensus 453 ~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~ 532 (708) ...........|..+++++.++++.++...... ...++ .+|.|.= T Consensus 383 l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~----------~~~d~-------------------------~~i~i~f 427 (506) T protein:vir:94 383 TVELASTKRRMFERGLYARYQIISDIENSIHGD----------WTFDP-------------------------QELTFTF 427 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc----------ccccc-------------------------ccceEEe Confidence 777777777788888888888888876543210 00000 1222323 Q ss_pred cccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHH Q lcl|Aclame:pro 533 GPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMA 611 (708) Q Consensus 533 ~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~ 611 (708) .+..+..-.+..+.+..+...+ ....+++++.+ ...++-.+++.+.... T Consensus 428 ~~~~p~d~~e~a~~~~kl~g~i---------S~et~~~~lp~v~d~~~E~~ri~~E~~~--------------------- 477 (506) T protein:vir:94 428 RDNLPADNISQIKALVQAGATL---------PQKYLYQQLPGVTNPQDIVDMMKEQSAN--------------------- 477 (506) T ss_pred CCCCCcCHHHHHHHHHHHhccC---------ChHHHHHhCCCCCCHHHHHHHHHHHHHH--------------------- Confidence 4444444444555555442111 11223333322 1112222222211100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 612 AQSQPNPEMVLAQAQMVAAQAEAQKATNETAQ 643 (708) Q Consensus 612 qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~ 643 (708) ..........-...... -+......+..+ T Consensus 478 --~~~~~~~~~~~~~~~~~-~~~~~~~~~e~~ 506 (506) T protein:vir:94 478 --GDYSFDQNGVISNDGQT-NTTATQTDEEVR 506 (506) T ss_pred --HhhcchhhcCCCcccCc-cccccccccCCC Confidence 00000000000000000 000000000000 No 76 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.64 E-value=4.5e-14 Score=93.74 Aligned_cols=438 Identities=11% Similarity=0.028 Sum_probs=205.7 Q ss_pred CCcc----------------------hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhh-- Q lcl|Aclame:pro 1 MAET----------------------LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDE-- 56 (708) Q Consensus 1 ma~~----------------------~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~-- 56 (708) |++. ..+.+.++. +.......+.. +-..||.|+| +-..+....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i-------~~~~~~~~~~~--~~~~yY~g~~-~i~~~~~~~~~~~~ 70 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLI-------TKHKENVEDIT--VGERYYNHQP-DVLFNAPKRNVKGE 70 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHH-------HHHHHHHHHHH--HHHHHhcCCC-cccccccccccccc Confidence 3332 222222222 22222222222 2235789976 111111000000 Q ss_pred hhcCCC--ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCee Q lcl|Aclame:pro 57 QFEKYP--KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFG 134 (708) Q Consensus 57 q~~grp--~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G 134 (708) ....+| .+++|..+.+++..+|+...+.+.+.+ +|.+..+.+.. +++ ++++.....+..++.++|.| T Consensus 71 ~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~------~d~~~~~~l~~----~~~-n~~~~~~~~~~~~~~~~G~~ 139 (468) T protein:vir:96 71 IDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYGT------EDEKSLKTIQE----VLN-HKWDDKLVDILTAASNKGVE 139 (468) T ss_pred ccccccccccccchHHHHHHHHHhhhccCCceecc------CChHHHHHHHH----HHh-cCHHHHHHHHHHHHhhcCeE Confidence 001122 477899999999999999998888753 12233343333 333 68888999999999999999 Q ss_pred EEEEEeeccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccc Q lcl|Aclame:pro 135 CFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSM 212 (708) Q Consensus 135 ~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~ 212 (708) |..+..+. ++.+++..+ ++.++| ||+... .+.. .+++.|.. +. T Consensus 140 ~~~v~~d~---------~~~~~i~~~--~p~~~~~v~~~~~~----~~~~-~~ir~~~~-~~------------------ 184 (468) T protein:vir:96 140 WIQPYVDE---------QGEFKTFRV--PAEQAIPIWTNKER----DELK-AFIRLYEL-DG------------------ 184 (468) T ss_pred EEEEEEcC---------CCceEEEEE--cccceEEEEcCCCC----CceE-EEEEEEEe-cC------------------ Confidence 98876532 234555544 334444 443221 1222 22323310 00 Q ss_pred cccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecc Q lcl|Aclame:pro 213 TSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGD 292 (708) Q Consensus 213 ~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 292 (708) . .-.++|...+.. .|... ++....-..... ..... T Consensus 185 ---------~---~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~----------------------------~~~~~ 219 (468) T protein:vir:96 185 ---------G---ERVEYWTANDVT----FYELK-DGQLIPDYYQGE----------------------------EHVQA 219 (468) T ss_pred ---------c---eEEEEEeCCeEE----EEEEc-CCceeecccccc----------------------------ccccc Confidence 0 001223222111 11111 111100000000 00111 Q ss_pred eeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHH Q lcl|Aclame:pro 293 GFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWE 372 (708) Q Consensus 293 ~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~ 372 (708) ..+....+.+++.+|+++|.- ...|.|.+..+++.++.+|...|.+.+.+...+.+.+++.-...++...... T Consensus 220 ~~~~~~~~~~~~~iPvv~~~n-------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~ 292 (468) T protein:vir:96 220 HYYVGNKSMSWNRVPFIPFKN-------NPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMY 292 (468) T ss_pred ceeeccccccCCcccEEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhh Confidence 223344567777788876532 2346788999999999999999999999988888777664222222222111 Q ss_pred hhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHH Q lcl|Aclame:pro 373 ARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMN 451 (708) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~ 451 (708) .......+.+.. +..| .+.+...+.-...+...++.....|...|++.+.+.+. .+|.||.|+..... T Consensus 293 --~~~~~~~i~~~~--d~~~-------~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~ 361 (468) T protein:vir:96 293 --NLKYYKAINVDG--DGSG-------GVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYS 361 (468) T ss_pred --hhhcCceEEecC--CCCC-------cceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHH Confidence 112222332221 1111 12333333334666777888899999999987765543 45789999988877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEe Q lcl|Aclame:pro 452 RADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVD 531 (708) Q Consensus 452 q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~ 531 (708) ............|..+++++.++++.++ . .+. +. .+|.|. T Consensus 362 ~l~~k~~~k~~~~~~~l~~~~~li~~~~----g---------~~~------------d~---------------~~i~i~ 401 (468) T protein:vir:96 362 NLDLKANKLKNKTLTALQELLQYIIDFY----K---------LSI------------KV---------------QDVEIT 401 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh----C---------CCc------------cc---------------ceeeEE Confidence 7777777777777788877777666542 1 100 00 122222 Q ss_pred ecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHH Q lcl|Aclame:pro 532 VGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQM 610 (708) Q Consensus 532 ~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq 610 (708) =.++.+....+..+.+. ..+-. ...++++++.+ ...++-.+++..... + T Consensus 402 f~~~~p~d~~e~a~~~~----~~g~i------S~et~i~~l~~v~D~~~E~~ri~~E~~--------------------~ 451 (468) T protein:vir:96 402 FNFNVMVNELEQSQIGV----NSQYL------SKETVVTNHPWVDDPVAEMERIDQEEL--------------------A 451 (468) T ss_pred ecCCCCcCHHHHHHHHH----hcCCC------chHHHHHhCCCCCCHHHHHHHHHHHHH--------------------H Confidence 22333333333333322 22211 11223333322 112222222221100 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 611 AAQSQPNPEMVLAQAQMVAAQAEAQKA 637 (708) Q Consensus 611 ~qq~~~~~~~~~aq~~~~~~qae~~k~ 637 (708) ..+.+. ...- ...-.-. T Consensus 452 ~~~~~~---------~~~~-~~~~~~~ 468 (468) T protein:vir:96 452 LPSIEE---------GLNG-KENNEPT 468 (468) T ss_pred HHHHhh---------ccCC-CCCCCCC Confidence 000000 0000 0000000 No 77 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.64 E-value=6.3e-15 Score=98.45 Aligned_cols=453 Identities=11% Similarity=0.032 Sum_probs=211.3 Q ss_pred CCcc----hHHHHHHHH-----------HHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhh---hhhhcCCC Q lcl|Aclame:pro 1 MAET----LEKKHERIM-----------LRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKL---DEQFEKYP 62 (708) Q Consensus 1 ma~~----~~~~~~~~~-----------~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~---~~q~~grp 62 (708) |+|. ....+.+.+ +.+.+..+.+...+.+... -..||.|+| +- ..+..+. .....++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~--~~~Yy~g~~-~i-~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITM--GERYYNHHP-DI-LDAPFKRDVNGDYDETKP 76 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHH--HHHHhcccc-cc-cccchhhhcccccccccc Confidence 7763 111222222 2223333333333333332 245888976 21 1111110 01112233 Q ss_pred --ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe Q lcl|Aclame:pro 63 --KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) Q Consensus 63 --~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~ 140 (708) .+++|..+.+|+..+|+...+.+.+.+ ++.+..+.|. .+++ |+++.....+..++.++|.||..+.. T Consensus 77 ~~ki~~n~~k~ivd~~~~yl~g~p~~~~~------~~~~~~~~l~----~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~ 145 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVANPVTFGV------DNDKALKQIQ----HTLN-HKWDDKLVDILTAASNKGIEWVQPYV 145 (478) T ss_pred cceeccchHHHHHHHHhhhhcccCceeec------CChHHHHHHH----HHHh-ccHHHHHHHHHHHHhhCCeEEEEEEe Confidence 267899999999999999999887753 1223333333 3333 78999999999999999999988865 Q ss_pred eccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccC Q lcl|Aclame:pro 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYN 218 (708) Q Consensus 141 ~~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~ 218 (708) +. ++.+++..+ ++.+++ ||+... .+..+ +++.|-. +. T Consensus 146 d~---------~~~~~~~~~--~p~~~~~v~d~~~~----~~~~~-~ir~~~~-~~------------------------ 184 (478) T protein:vir:10 146 DE---------EGEFKTFRV--PAEQAVPIWTNKER----DELQA-FIRVYEL-DG------------------------ 184 (478) T ss_pred cC---------CCceEEEEE--cccceEEEEcCCCC----CceEE-EEEEEee-eC------------------------ Confidence 32 134444433 233443 443221 12222 2222211 00 Q ss_pred CCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecC Q lcl|Aclame:pro 219 WFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKP 298 (708) Q Consensus 219 ~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~ 298 (708) ....++|...+.. .|.. ..+.... ... ..........+... T Consensus 185 ------~~~~~~y~~~~i~----~~~~-~~~~~~~--~~~--------------------------~~~~~~~~~~~~~~ 225 (478) T protein:vir:10 185 ------AERVEYWTKDDVT----FYEL-KEGQLIP--DFY--------------------------RSEDHIQPHYYQGN 225 (478) T ss_pred ------ceEEEEEeCCcEE----EEEe-cCCeeec--ccc--------------------------ccccccccceeccc Confidence 0012333332211 0110 0111000 000 00000011122334 Q ss_pred CCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHHHHhhccc Q lcl|Aclame:pro 299 RRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKHWEARNKK 377 (708) Q Consensus 299 ~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~~~~~~~~ 377 (708) .+.+++.+|+++|.- ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++- |.. ++..+... +.. T Consensus 226 ~~~~~g~vPvv~~~n-------~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~--~~~ 295 (478) T protein:vir:10 226 KLMSWGRVPFIPFKN-------NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK-GYEGEDMKDFMH--NLK 295 (478) T ss_pred ccccCCcceEEEecc-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeee-cCCcccccchhh--hhh Confidence 566777777776542 2346788999999999999999999999988887766553 221 11111111 111 Q ss_pred CCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 378 RPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMA 456 (708) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~ 456 (708) ..+.+.+... ..+ .+.++..+.-..++...++.+.+.|...|++.+.+.+. .+|.||.|+..+....... T Consensus 296 ~~~~~~~~~~--~~~-------~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k 366 (478) T protein:vir:10 296 YYKAISVAGE--SGS-------GVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLK 366 (478) T ss_pred hCceeEecCC--CCC-------cceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHH Confidence 2222222211 111 12333333334666777888889999999987765553 4678999999887777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccc Q lcl|Aclame:pro 457 SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) Q Consensus 457 ~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~ 536 (708) .......|..+++++.++++.+.-.-+ ++ .||.|.-.+.. T Consensus 367 ~~~~~~~~~~~l~~~~~li~~~~~~~~---------------d~-------------------------~~i~i~f~~~~ 406 (478) T protein:vir:10 367 ANKLKNKTLTALQELLQYIIDFYRLDV---------------RV-------------------------QDIEITFNFNV 406 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCc---------------cc-------------------------ccceEEeCCCC Confidence 777777777777777776665431100 00 12223334444 Q ss_pred hhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQ 615 (708) Q Consensus 537 ~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~ 615 (708) +..-.+..+.++.+. + ..+ ..++++++.+ ...+...++++...... .++. T Consensus 407 p~~~~e~~~~~~~~~---g-~iS-----~et~i~~~~~v~d~~~E~~ri~~E~~~~--------------------~~~~ 457 (478) T protein:vir:10 407 MVNELENSQIAMNST---G-LLS-----KETILGNHSWVQDPVAEMERIEQENIEL--------------------NQQL 457 (478) T ss_pred CCCHHHHHHHHHHHh---C-CCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHH--------------------HHhc Confidence 443334444444331 1 111 1222333221 12222223332211000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 616 PNPEMVLAQAQMVAAQAEAQKAT 638 (708) Q Consensus 616 ~~~~~~~aq~~~~~~qae~~k~~ 638 (708) +... .........+.+-...+ T Consensus 458 ~~~~--~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 458 PDIE--EGLNDEQQRQSEDNQSE 478 (478) T ss_pred cccC--CCCcccccccCcCCCCC Confidence 0000 00000000000000000 No 78 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.63 E-value=1e-13 Score=91.83 Aligned_cols=501 Identities=12% Similarity=0.057 Sum_probs=230.5 Q ss_pred cchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhc-CCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHh Q lcl|Aclame:pro 3 ETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARV-PGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYR 81 (708) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~-~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~ 81 (708) .+ ++.+|....+....|...|+++.+|.+= -+.=......... + ...+.-..-...++.+.+... T Consensus 1 m~-------~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~--~-----~~~~~dstg~~a~~~LAa~l~ 66 (522) T protein:vir:10 1 MK-------ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHK--S-----LTVPWQSVGAKCCVTLAAKLM 66 (522) T ss_pred Cc-------hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccc--c-----ccccccchHHHHHHHHHHHHH Confidence 23 4455555556666677777776665320 0110000000000 0 012233344445555444333 Q ss_pred c-----CcceeEEecCCCc----ch----HHH---HHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecccc Q lcl|Aclame:pro 82 N-----NRITVKFRPGDRE----AS----EEL---ANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNE 145 (708) Q Consensus 82 ~-----nr~~~~v~pr~~~----~d----~~~---A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~ 145 (708) . +++=++..+.+.+ .+ .++ -+.++..+......|++..+...++.+.+..|.|+..+. T Consensus 67 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~------ 140 (522) T protein:vir:10 67 LAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMG------ 140 (522) T ss_pred HhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEc------ Confidence 2 3444445443311 01 112 223455566667789999999999999999999986542 Q ss_pred CCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCcee Q lcl|Aclame:pro 146 YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVI 225 (708) Q Consensus 146 ~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 225 (708) .++ .++. |+.++++..++.- ...-++++.+|+..++...||....... .. ......+.+ T Consensus 141 ~~~------~~~~----pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~~~~~----~~---~~~~~~~~v 199 (522) T protein:vir:10 141 KDG------LKTF----PLTRYVINRDGDG----NVLEIVTKELISRKVLDIELPEPKPNTG----ID---ESSTTNDDV 199 (522) T ss_pred CCC------ceEE----EcceEEEeeCCCC----CeeEEEeeeeccHHHHHHhcchhccchh----hh---cccCCCCce Confidence 122 1222 4556666544331 3344899999999999999997542110 00 111111222 Q ss_pred EEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEe-cceeeecCCCCCCC Q lcl|Aclame:pro 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVD-GDGFLEKPRRIPGE 304 (708) Q Consensus 226 ~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~il~~~~~~p~~ 304 (708) .|... .+..+..+. +.|+... +..+....+.+++. T Consensus 200 ~v~~~-----------v~p~~~~~~---------------------------------~~~~~~~~~~~~~~~~s~~g~~ 235 (522) T protein:vir:10 200 TIYTY-----------VKLDKSSGR---------------------------------WVWHQEAFDKIIPDSRSTAPKN 235 (522) T ss_pred EEEEE-----------EEeeccCCc---------------------------------eEEEEccCCccccccccccccc Confidence 22111 111111111 1111111 12222233567889 Q ss_pred CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeee Q lcl|Aclame:pro 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL 384 (708) Q Consensus 305 ~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~ 384 (708) ++||+++.... .+|..+|.|.+....+-.+.+|.+...++.......+++++++.+.+..... ..+++...+ T Consensus 236 ~~P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~------l~~~~~~~~ 307 (522) T protein:vir:10 236 ASPWLPLRFNT--VDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPAT------IAKAGNGAI 307 (522) T ss_pred cCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccc------ccCCCCcce Confidence 99998765443 5888999999999999999999999999999999999999998766543222 112222111 Q ss_pred cccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 385 REVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNM 464 (708) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~ 464 (708) ..+ ..+.+ .+.+....+.+ +....+++...+.|....-+. +.-.....|++-|..+.+.....+...+.+| T Consensus 308 v~g--~~~~v----~~~~~~~~~d~-~~~~~~i~~~~~ri~~aFl~~--~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl 378 (522) T protein:vir:10 308 VQG--RPEDV----AVIQVGKTADF-STAANMATAIEKRLLEAFLVM--NVRNAERVTAEEVRLTQLELEQQLGGIFSLL 378 (522) T ss_pred ecC--CCccc----eeecccccccc-hHHHHHHHHHHHHHHHHHhhc--cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHH Confidence 111 11111 11111111222 233455566666665553211 0111233589999999998888888888887 Q ss_pred H-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH Q lcl|Aclame:pro 465 A-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) Q Consensus 465 ~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~ 543 (708) . .+..-+.+..+.++.+ .|. | + +-.. ++ ++-+++ ++. ++-.|.+. T Consensus 379 ~~E~l~Pli~r~~~il~r-------------~g~-----l-P----~~p~------~~--~~~~~v--~~i-s~Laraq~ 424 (522) T protein:vir:10 379 VIEFLIPYLNRTLLVLQR-------------SNQ-----I-P----KLPK------DI--VRPTIV--AGV-NALGRGQD 424 (522) T ss_pred HHHHHHHHHHHHHHHHHh-------------cCC-----C-C----CCCc------cc--cccccc--cch-hHHHHHHH Confidence 6 4554454444444432 110 0 0 0000 00 011121 122 23346777 Q ss_pred HHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh--hhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~--~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .+.++.+++.++...+. +.+++.. +.++++..+-..... ...... +++.++.++++++++++++ +. T Consensus 425 ~~~l~~~~~~i~~~~~p-----~~~~~~i---d~d~~~~~~a~~~Gvp~~~ivrt--~eev~~~~q~~q~~~~~~~--~~ 492 (522) T protein:vir:10 425 RESLTAFVGTIAQTLGP-----EALMQYL---NPLEAIKRLAAAQGIDVLNLVKT--EQQLAEEQQAAQQQAAQQS--LV 492 (522) T ss_pred HHHHHHHHHHHHHhhCc-----hhhhhcC---CHHHHHHHHHHHhCCChhhhcCC--HHHHHHHHHHHHHHHHHHH--HH Confidence 77788777766432211 1122222 334444444333321 112221 1111111111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDK 675 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~ 675 (708) .+..++.. ...++..+.. + ...+. ++.. ++ T Consensus 493 ~~a~~~~~------~~~~~~~~~~--------~-~~~~~-------~~~~--~~ 522 (522) T protein:vir:10 493 DQAGQMTG------SPLMDPTKNP--------Q-LMDEE-------QPPM--EE 522 (522) T ss_pred HHHHHHhc------ccccCccccH--------H-HHHHh-------CCCC--CC Confidence 00000000 0000000000 0 00000 0000 00 No 79 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.62 E-value=1.4e-14 Score=96.47 Aligned_cols=467 Identities=10% Similarity=-0.019 Sum_probs=198.3 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) =.+-.+.++.++...+. .... +-++...||+|+|+-.......... . ..--++.|..+.+|+..++.. T Consensus 10 ~~~~~~~~~~~l~~~~~----~~~~-----r~~~~~~Yy~G~~~i~~~~~~~~~~--~-~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:10 10 EIEDPAIARDEMVSAFE----DSTQ-----NLKTNTSYYEAERRPEAIGVTVPIQ--M-QSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCHHHHHHHHHHHHH----HHHH-----HHHHHHHHHhcCCcchhcCCCCChh--h-hhhhhhcCcHHHHHHHHHhhh Confidence 13333333333332222 2211 1123346899998743211111000 0 011234699999999988876 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) .-+- |+. + ++.+..+ .++.++..|+++.....+..++++.|+||+.|..+.... .....++.++|..+ T Consensus 78 ~~~g----~~~--~-~~~~~~~----~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~-~~~~~~~~~~i~~~ 145 (485) T protein:vir:10 78 AVEG----FRF--G-DADEADE----ELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQI-DLGWDPNTPIIRVE 145 (485) T ss_pred cccc----eec--C-CCchhHH----HHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCccc-ccccCCCeeEEEEE Confidence 3221 222 1 1223333 345566789999999999999999999998887653321 11122344444433 Q ss_pred ecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEE Q lcl|Aclame:pro 161 YDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESV 238 (708) Q Consensus 161 ~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~ 238 (708) ++.++ +|||...+ .. ++.+.+-. . ..+.+...++|..... T Consensus 146 --~p~~~~~~~D~~~~~-----~~-~~~~~~~~---------------------------~-~~~~~~~~~~y~~~~~-- 187 (485) T protein:vir:10 146 --PPTRMYAEIDPRIGR-----VS-KAIRVAYD---------------------------A-EGNEIQAATLYTPNDI-- 187 (485) T ss_pred --ccceeEEEEcCCCCc-----ee-EEEEEEEe---------------------------e-CCCeEEEEEEEeCCeE-- Confidence 23343 47764321 11 11111100 0 0111222233322210 Q ss_pred EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeecc Q lcl|Aclame:pro 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~ 318 (708) +.|. ...+.....+..|.+++.+|+|+|...+. T Consensus 188 --~~~~-------------------------------------------~~~~~~~~~~~~~~~~g~vPvv~~~n~~~-- 220 (485) T protein:vir:10 188 --FGWY-------------------------------------------RVENEWQEWFNNPHGLGVVPVVPIPNRTR-- 220 (485) T ss_pred --EEEE-------------------------------------------EcCCceEEeccccCCCCcccEEEeccccc-- Confidence 1111 00111112233456667778887653321 Q ss_pred CCcccccchH-HhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchH---HHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 319 DDIERVEGHI-AKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLE---KHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 319 d~~~~~~G~v-r~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) .+...|.|-+ +.+++.++.+|+.+|.+...+...+.+..++-....++.. ..-........+.++.-... T Consensus 221 ~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~------ 294 (485) T protein:vir:10 221 LSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDA------ 294 (485) T ss_pred cCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceeccCCC------ Confidence 1223334434 3689999999999999999887777665443210011000 00000000011112211110 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) . ..+...+... ...+...+......+-.+|++++...|.. .| +||.|+.................|..+++++. T Consensus 295 --d-~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~ 370 (485) T protein:vir:10 295 --E-GKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAM 370 (485) T ss_pred --C-ceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 1222222221 23344555555555555677788877743 34 79999998888777777777777778887777 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.+...- + ....+ ++|.|.=.+..+....+..+.+..|.+ T Consensus 371 ~l~~~~~~~~----------~--~~~~~-------------------------~~i~v~w~~~~~~~~~~~ada~~kl~~ 413 (485) T protein:vir:10 371 RLAYRMMKGG----------D--VPPDM-------------------------LRMETVWRDPSTPTYAAKADAASKLYN 413 (485) T ss_pred HHHHHHhCCC----------C--Ccccc-------------------------eeeeEEecCCCCCCHHHHHHHHHHHHh Confidence 7665542110 0 00000 122222222333334556667777766 Q ss_pred hccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qa 632 (708) .+....+. ..+++++.+. ++-.+.++... +++. .+.......+.......-.+. T Consensus 414 ag~~~~s~-----et~~~~lg~~--~~~~~~~~~~~------------------ee~~-~~~~~~~~~~~~~~~~~~~~~ 467 (485) T protein:vir:10 414 GGTGVIPR-----ERARKDMGYS--IAEREEMRRWD------------------EEEA-AMGLGLIGTMVDPNPTVPGSP 467 (485) T ss_pred ccccCCCH-----HHHHHhCCCC--HhHHHHHHHHH------------------HHHH-HHHHHHHHHhhccCCCCCCCC Confidence 44222111 1223332221 11112221110 0000 000000000000000000000 Q ss_pred HH-HHHHHHHHHHHHHHH Q lcl|Aclame:pro 633 EA-QKATNETAQTQIKAF 649 (708) Q Consensus 633 e~-~k~~~~~~~~q~e~~ 649 (708) +. +....-.....-+.+ T Consensus 468 ~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 468 SPAPAPKPAALESGGDAA 485 (485) T ss_pred CccccccCcCCCCCCCCC Confidence 00 000000000000000 No 80 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.62 E-value=1.7e-14 Score=96.06 Aligned_cols=464 Identities=10% Similarity=-0.041 Sum_probs=200.7 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) +-......-..++..+-+.+..+..-. ++...||.|+|.-.......... . ..--++.|..+.+|+..++.. T Consensus 6 ~~~~~~~~~~~~~~~L~~~~~~~~~r~-----~~~~~YY~G~~~i~~~~~~~~~~--~-~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAFEDQNQNL-----RSNTSYYEAERRPEAIGVTVPVQ--M-QSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHH-----HHHHHHHhccCchhhcCcccchh--h-hhhhhccchHHHHHHHHhhhh Confidence 222222222233333322222211111 12235899998422111100000 0 011245699999999999877 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEe Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v 160 (708) .-+- + ..+. +.+.. ..++.++..|+++.....+..++++.|++|+.|..+..... ...+.+.++|..+ T Consensus 78 ~~~g--~-~~~~----~~~~~----~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~-~~~~~~~~~i~~~ 145 (485) T protein:vir:24 78 AVEG--F-RLGD----ADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQID-LGWDPNVPLIRVE 145 (485) T ss_pred ccCc--e-ecCC----CchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc-cccCCCcceEEEe Confidence 4332 1 1221 22222 33455667899999999999999999999998876543211 1223345555543 Q ss_pred ecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEE Q lcl|Aclame:pro 161 YDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESV 238 (708) Q Consensus 161 ~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~ 238 (708) ++.++ +||+...++ .++.+.+-+. +...+....+|.... T Consensus 146 --~p~~~~~i~D~~~~~~------~~~~~~~~~~----------------------------~~~~~~~~~~y~~~~--- 186 (485) T protein:vir:24 146 --PPTRMYAEIDPRIGRP------AKAIRVAYDA----------------------------EGNEIQAATLYTPNE--- 186 (485) T ss_pred --ccceeEEEeeCCcCce------eEEEEEEEee----------------------------cCCeEEEEEEEcCCc--- Confidence 23344 467643321 1222211100 001111222222211 Q ss_pred EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeecc Q lcl|Aclame:pro 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~ 318 (708) ++.|. -.+|...+....+.+++.+|+|||...+. T Consensus 187 -~~~~~-------------------------------------------~~~~~~~~~~~~~h~~g~vPvv~f~n~~~-- 220 (485) T protein:vir:24 187 -TFGWF-------------------------------------------RAEGEWVEWFSDPHGLGAVPVVPLPNRTR-- 220 (485) T ss_pred -EEEEE-------------------------------------------ecCCceEeecccccCCCcccEEEeccCcc-- Confidence 01111 01111122333456667788888753321 Q ss_pred CCcccccchH-HhhhHHHHHHHHHHHHHHHHHhhcCCCceeech---hhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 319 DDIERVEGHI-AKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGM---EQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 319 d~~~~~~G~v-r~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~---~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) .+..+|.|-+ +.+++.++.+|+.+|.+..++...+.+..++-. +.+...............+.++..... T Consensus 221 ~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~------ 294 (485) T protein:vir:24 221 LSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDA------ 294 (485) T ss_pred cCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCCC------ Confidence 1222333333 368999999999999999988877766654421 111000000000001111222211110 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) . ..+....... ...+...+......+-.++++++...|.. .| +||.|+.................|..+++++. T Consensus 295 --~-~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 370 (485) T protein:vir:24 295 --E-GKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAM 370 (485) T ss_pred --C-ceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 1121122111 23344444444555555577888887753 34 79999999888888888888888888998888 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.+... - +. ...+ .+|.|.=.+..+....+..+.+..|.+ T Consensus 371 ~l~~~~~~~-~---------~~--~~d~-------------------------~~i~v~f~~~~~~s~~~~ad~~~kl~~ 413 (485) T protein:vir:24 371 RLAYRLMKG-G---------DV--PPDM-------------------------LRMETVWRDPSTPTYAAKADAATKLYG 413 (485) T ss_pred HHHHHHhcC-C---------CC--cccc-------------------------ceeeEEecCCCCCCHHHHHHHHHHHHh Confidence 887764221 0 00 0000 011111121222224455666677665 Q ss_pred hccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQ---MVA 629 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~---~~~ 629 (708) .+....+ ..++++++.+. ++..+.+++..... ..+ ..+. ......... -.. T Consensus 414 ~g~~~~s-----~et~~~~l~~~--~d~~~e~~~~~ee~------------~~~----~~~~---~~~~~~~~~~~~~~~ 467 (485) T protein:vir:24 414 NGQGVIP-----RERARKDMGYS--IAEREEMRRWDEEE------------AAM----GLGL---LGTMVDADPTVPGSP 467 (485) T ss_pred cccccCC-----HHHHHhhCCCC--HhHHHHHHHHHHHH------------hhh----hhhH---HHhhcccCCCCCCCC Confidence 4422111 12233443332 22222222111000 000 0000 000000000 000 Q ss_pred HHHHHHH---H-H-HHHH Q lcl|Aclame:pro 630 AQAEAQK---A-T-NETA 642 (708) Q Consensus 630 ~qae~~k---~-~-~~~~ 642 (708) ...+... + . ++.+ T Consensus 468 ~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 468 NPTPAPKPQPAIEGGDSA 485 (485) T ss_pred CCCCCCCCccCCCCCCCC Confidence 0000000 0 0 0000 No 81 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.62 E-value=1.7e-14 Score=96.06 Aligned_cols=470 Identities=12% Similarity=0.005 Sum_probs=205.1 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCC----CCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQ----WEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Q----w~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |+....---..++..+...+.....-. ++...||+|+| ++.......+. . -++.|..+-+|++. T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~-----~~~~~Yy~g~~~i~~~~~~~~~~~~~-----~--~~~~n~~~~ivd~~ 68 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNEL-----KSSKAYYDAERRPDAIGLAVPLDMRK-----Y--LAHVGYPRTYVDAI 68 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHH-----HHHHHHHhcccchhhcCcccchhhhh-----h--hhhcchHHHHHHHH Confidence 765222112334444444443332211 22235899987 11111111111 1 24568888888887 Q ss_pred HHHHhcCcce---eEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCc Q lcl|Aclame:pro 77 IAEYRNNRIT---VKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQ 153 (708) Q Consensus 77 ~g~~~~nr~~---~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~ 153 (708) +....-+-.. +.-.+.+..+|.+..+. +..++..|+++.....+..+++++|++|+-|..+.... ...++.+ T Consensus 69 a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~----l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~-~~~~~~~ 143 (488) T protein:vir:23 69 AERQELEGFRIPSANGEEPESGGENDPASE----LWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEV-DFDVDPE 143 (488) T ss_pred HHhhhccceeccCCcccccccccchhHHHH----HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccc-ccCCCCC Confidence 7544322111 11112222234444333 45567899999999999999999999998886543211 1123334 Q ss_pred ceeeEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeee Q lcl|Aclame:pro 154 RIAIEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYY 231 (708) Q Consensus 154 ~i~i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~ 231 (708) ..+|..+ ++..+ +|||.... ...+.+++.+. +...+....+| T Consensus 144 ~~~i~~~--~p~~~~~~~d~~~~~------~~~~~~~~~~~----------------------------~~~~~~~~~~y 187 (488) T protein:vir:23 144 VPLIRVE--PPTALYAEVDPRTRK------VLYAIRAIYGA----------------------------DGNEIVSATLY 187 (488) T ss_pred cceEEEe--ccceeEEEEecCCCc------eEEEEEEEEec----------------------------CCCcEEEEEEE Confidence 4444432 23333 47764221 12222222100 00111112222 Q ss_pred eecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeE Q lcl|Aclame:pro 232 EVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPV 311 (708) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~ 311 (708) .... +++|.. .+|.-.++...+.+++.+|+||| T Consensus 188 ~~~~----~~~~~~-------------------------------------------~~~~~~~~~~~~h~~g~vPvv~f 220 (488) T protein:vir:23 188 LPDT----TMTWLR-------------------------------------------AEGEWEAPTSTPHGLEMVPVIPI 220 (488) T ss_pred ecCc----EEEEEe-------------------------------------------cCCceEeccccccCCCCcceEEe Confidence 2221 111110 01111223345667778888887 Q ss_pred EEeeeccCCcccccchH-HhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHH---HHhhcccCCceeeecc Q lcl|Aclame:pro 312 YGKRWFIDDIERVEGHI-AKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKH---WEARNKKRPAFLPLRE 386 (708) Q Consensus 312 ~~~~~~~d~~~~~~G~v-r~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~---~~~~~~~~~~~~~~~~ 386 (708) ...+. .+..+|.|-+ +.+++.++.+|+..|.+...+...+.++.++- |.. ++.... .........+.++. T Consensus 221 ~n~~~--~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~v~~-- 295 (488) T protein:vir:23 221 SNRTR--LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIF-GAKPEELGINAETGQRMFDAYMARILA-- 295 (488) T ss_pred ccccc--cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh-CCCcccccccccccchhhhhhhhhhcc-- Confidence 64322 2233444545 46899999999999999998876666554431 111 000000 00000000011111 Q ss_pred cccccccccccccccccccCccc-hHHHHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVM-NQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn 463 (708) .+....+.+.+.+.. ...+...+......+-.+|++++..+|.. .| +||.|+......-.......... T Consensus 296 --------~~~g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~ 367 (488) T protein:vir:23 296 --------FEGGEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKI 367 (488) T ss_pred --------CCCCCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHH Confidence 011111222232222 24455556666666667788888888753 33 69999998888888778888888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH Q lcl|Aclame:pro 464 MAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) Q Consensus 464 ~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~ 543 (708) |..+++++.++++.+.-.- ++...+ .+|.|.=.+..+....+. T Consensus 368 f~~~l~~~~~l~~~~~~~~------------~~~~~~-------------------------~~i~v~f~~~~~~s~~~~ 410 (488) T protein:vir:23 368 FGGAWEQAMRLAYKMVKGG------------DIPTEY-------------------------YRMETVWRDPSTPTYAAK 410 (488) T ss_pred HHHHHHHHHHHHHHHhcCC------------Ccchhh-------------------------ccceEEecCCCCCCHHHH Confidence 8888888888777542110 000000 011111122223334556 Q ss_pred HHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLA 623 (708) Q Consensus 544 ~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~a 623 (708) .+.+..|.+.+....+ ...+++++.+ .+.-.+.++.... ++.++...+.. +....... T Consensus 411 ada~~kl~~~g~~~~s-----~et~~~~l~~--~~d~~~~~~~~~~------------~~~~~~~~~~~---~~~~~~~~ 468 (488) T protein:vir:23 411 ADAAAKLFANGAGLIP-----RERGWVDMGY--TIVEREQMRQWLE------------QDQKQGLGLIG---SLYGASTP 468 (488) T ss_pred HHHHHHHHhcccccCC-----HHHHHHhCCC--CchHHHHHHHHHH------------HHHHHHHHHHH---HHhccCCC Confidence 6677777665432211 1233344332 1111122211100 00000000000 00000000 Q ss_pred HHHHHHHHHH-HHHHHHHHH Q lcl|Aclame:pro 624 QAQMVAAQAE-AQKATNETA 642 (708) Q Consensus 624 q~~~~~~qae-~~k~~~~~~ 642 (708) ........+. ...-+..++ T Consensus 469 ~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 469 EGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred cccCCCCCCCCCCCCCCCCC Confidence 0000000000 000000000 No 82 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.62 E-value=2.4e-15 Score=100.69 Aligned_cols=482 Identities=12% Similarity=0.078 Sum_probs=212.3 Q ss_pred CCcchHHHH-HHH----HHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETLEKKH-ERI----MLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~~~~~-~~~----~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) ||=..++.+ +.. .+.+.+..+.+.....+. ++-..||.|+| .+.. .+........-.++.|..+.+|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~--~~l~~Yy~g~~---~i~~-~~~~~~~~~~~ki~~n~~~~Iv~~ 74 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRL--DKLSDYYNGKQ---EIEK-HEFDNATVEAANVMVNHAKYITDM 74 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHH--HHHHHHhcccc---chhc-CCcCcCCCCcceeecchHHHHHHH Confidence 765444333 111 233344444443322223 22346899976 1211 111111111224567999999999 Q ss_pred HHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC-------- Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD-------- 147 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d-------- 147 (708) .+|+...+.+.+.+ . ++ +..+. +..+++.|+++.....+..+++++|.+|..+..+...... T Consensus 75 ~~~~l~g~p~~~~~--~---~~-~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~ 144 (499) T protein:vir:10 75 NVGFMTGNPVKYVA--E---KG-KNIDD----ILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNE 144 (499) T ss_pred HhhhhcccCceeec--C---Ch-hHHHH----HHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccccccc Confidence 99999998877664 1 12 22232 4445678899999999999999999998877544211000 Q ss_pred CCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCcee Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVI 225 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 225 (708) .......+++..+ ++..+| ||.... .-...+++.+.. .+..+...+ T Consensus 145 ~~~~~~~~~~~~v--~p~~~~~v~~d~~~-----~~~~~~i~~~~~-------------------------~~~~~~~~~ 192 (499) T protein:vir:10 145 KLTPNTELKIEVI--DPRATVVVCDDTVE-----HDPLFAVFTQEK-------------------------KDLEGNTNG 192 (499) T ss_pred ccccccceEEEEE--cccceEEEecCCCC-----cceEEEEEEEEE-------------------------eecCCCceE Confidence 0000111111111 111111 110000 000011111000 000011122 Q ss_pred EEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCC Q lcl|Aclame:pro 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEH 305 (708) Q Consensus 226 ~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~ 305 (708) ...+.|...+.. .|.....+ ...+...+....+.+++. T Consensus 193 ~~~~iyt~~~i~----~~~~~~~~--------------------------------------~~~~~~~~~~~~~~~~g~ 230 (499) T protein:vir:10 193 YSITVYMPQRIV----EYRTKTTM--------------------------------------EVSANDPIVYDGENLFGA 230 (499) T ss_pred EEEEEEeCCeEE----EEEecCCc--------------------------------------cccCcceecccccCCCCc Confidence 223333322211 11100000 000111122334455666 Q ss_pred cceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeec Q lcl|Aclame:pro 306 IPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLR 385 (708) Q Consensus 306 ~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~ 385 (708) +|+|+|. +...+.|.+..+++.++.+|...|.+.+.+...+.+.+++.-..++...... .....+.+... T Consensus 231 vPvv~~~-------n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~--~~~~~~~~~~~- 300 (499) T protein:vir:10 231 VPIIEFR-------NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDI--QRLKRGAIEAP- 300 (499) T ss_pred cceEEec-------CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchh--hhhhhcceecc- Confidence 7776643 1234679999999999999999999999998888887776432222111100 01111111111 Q ss_pred ccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 386 EVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNM 464 (708) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~ 464 (708) .... ...+.+...+.-..++...+....+.|...|++.+.+.+. .+|.||.|+..+............+.| T Consensus 301 -~~~~-------~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~ 372 (499) T protein:vir:10 301 -PREE-------GADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYF 372 (499) T ss_pred -CCCC-------CCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 1123333333334666777888889999999987655443 457899999988887777777777778 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHH Q lcl|Aclame:pro 465 AKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATV 544 (708) Q Consensus 465 ~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~ 544 (708) ..+++++.++++.++. +.|.+ .++ .+|.|.=.+..+....+.. T Consensus 373 ~~~l~~~~~li~~~~~----------~~~~~--~d~-------------------------~~i~i~f~~~~p~n~~e~~ 415 (499) T protein:vir:10 373 FDGLRRRLKLIQTIVN----------IKGAN--DDA-------------------------SGCKISLVANIPSNLSDVV 415 (499) T ss_pred HHHHHHHHHHHHHHHh----------ccCCc--ccc-------------------------ccceEEeCCCCCCCHHHHH Confidence 8888877777776543 11210 000 1233333444555445555 Q ss_pred HHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 545 SVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLA 623 (708) Q Consensus 545 ~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~a 623 (708) +.++.+...+ ...+++.++.+ ...++..+++....... ....+. ......+..- ... T Consensus 416 ~~~~kl~g~i---------S~et~~~~l~~v~d~~~E~~ri~~E~~~~----------~~~~~~--~~~~~~~~~~-~~~ 473 (499) T protein:vir:10 416 NNVKNADGII---------PRKYTYSWLPDVDNPQDVIDEMNQQDAET----------IKKNQE--ALRGQDPDRL-ELE 473 (499) T ss_pred HHHHHHhccC---------ChHHHHHhCCCCCCHHHHHHHHHHHHHHH----------HHHHHh--hhccCCCCCC-CCC Confidence 5555542111 11233333322 11222233332211000 000000 0000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 624 QAQMVAAQAEAQKATNETAQTQIKAFTAQQDAME 657 (708) Q Consensus 624 q~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~ 657 (708) ..+ ...+.......++..+ ...-.+. T Consensus 474 ~~~-~~~~~~~~~~~~~~~~-------~~~~~~~ 499 (499) T protein:vir:10 474 DKQ-DDSSENDKEAGSNHNQ-------SHRTRAV 499 (499) T ss_pred CCC-cccCCCCCCCcccccc-------CCCCCCC Confidence 000 0000000000000000 0000000 No 83 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.61 E-value=2.1e-14 Score=95.60 Aligned_cols=450 Identities=9% Similarity=0.026 Sum_probs=203.8 Q ss_pred CCcchHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHh--hhhhhhcCCC--ceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK----HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGT--KLDEQFEKYP--KFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~----~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l--~~~~q~~grp--~~~~N~i~~~ 72 (708) |+++.-+. ++...+.+.+..+.......+. ++-..||.|+|. -..+... .......++| -+++|..+.+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~--~~~~~YY~g~~~-i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~I 89 (474) T protein:vir:94 13 YGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKI--TVGQRYYDKDND-IVKQMKKVDVHGNIDYDKPDWRITTNFHQNL 89 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHH--HHHHHHhccccc-hhcccchhccccccccccCcceeecchHHHH Confidence 33211111 1111222233333333222222 233458999873 1111000 0000111233 3678999999 Q ss_pred HHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDR 152 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~ 152 (708) |+..+|+...+.+.+.+ +|.+..+ +++.+.+ |+++.....+..+++++|.||..+..+. . T Consensus 90 vd~~~~~l~g~p~~~~~------~d~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~---------~ 149 (474) T protein:vir:94 90 VDQKVSYVASKPVTYSC------EDENVLK----VIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINE---------N 149 (474) T ss_pred HHHHHhhhhcCCceecc------CcHHHHH----HHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecC---------C Confidence 99999999998887653 1233333 4444444 7899999999999999999998775421 2 Q ss_pred cceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeee Q lcl|Aclame:pro 153 QRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKY 230 (708) Q Consensus 153 ~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~ 230 (708) +.+++..+ ++..+| ||+... .+.. .+++.|...+ ....+. T Consensus 150 ~~~~i~~~--~p~~~~~v~d~~~~----~~~~-~~ir~~~~~~-------------------------------~~~~~~ 191 (474) T protein:vir:94 150 GEMKLFRV--PAEQAIPIWVDKER----EELK-SFIRYYKFNN-------------------------------EEKVEF 191 (474) T ss_pred CeeEEEEE--cccceEEEEcCCCC----CceE-EEEEEEEecC-------------------------------eEEEEE Confidence 33444433 334444 444221 1222 2233321000 001233 Q ss_pred eeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceee Q lcl|Aclame:pro 231 YEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIP 310 (708) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p 310 (708) |...+.. .|... .+...... ......+.....+.+++.+|+++ T Consensus 192 yt~~~~~----~y~~~-~~~~~~~~--------------------------------~~~~~~~~~~~~~~~~g~vPvv~ 234 (474) T protein:vir:94 192 WTDTTVT----YYVLE-NGGLIPDY--------------------------------YYGANHVQSHFSNGNWGRVPFIA 234 (474) T ss_pred EeCCeEE----EEEEc-CCcccccc--------------------------------ccCcCcccccccccCCCccceEE Confidence 4332211 11110 00000000 00011112223344556666665 Q ss_pred EEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccc Q lcl|Aclame:pro 311 VYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK 390 (708) Q Consensus 311 ~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (708) |.- ...|.|.+..+++.++.+|+..|.+.+.+...+.+.+++.-...++..+.... ......+.... T Consensus 235 ~~n-------n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~--~~~~~~i~~~~---- 301 (474) T protein:vir:94 235 FKN-------NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRG--LKYYKAINVDG---- 301 (474) T ss_pred ecC-------CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhh--hhccceeeccC---- Confidence 431 23467889999999999999999999999888888777643222222222111 11112221111 Q ss_pred cccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 391 SGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLK 469 (708) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~ 469 (708) .+ ...+...+.-..++...+......|...|++.+.+.+. .+|.||.|+..+..............|..+++ T Consensus 302 ~~-------~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:94 302 DG-------GVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred CC-------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12333333334566667888888999999987765543 46789999988777776666666667777777 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTN 549 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~ 549 (708) ++.++++.+ .. +.. ++ -+|.|.=.+..+..-.+..+.+.. T Consensus 375 ~~~~li~~~----~~------~~~-----d~-------------------------~~i~v~f~~~~p~~~~e~a~~~~~ 414 (474) T protein:vir:94 375 ELISFIIDF----NN------LKT-----DV-------------------------KDIEISFNFNRMMNDAEQSQIIAQ 414 (474) T ss_pred HHHHHHHHH----hC------CCc-----cc-------------------------ceeeEEeccCcccCHHHHHHHHHH Confidence 776666554 22 000 00 011121223333322333333332 Q ss_pred HHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 550 VLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMV 628 (708) Q Consensus 550 llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~ 628 (708) .+-. + ...++.++.+ ...+...+++....... .+..+............ T Consensus 415 ----~g~i-S-----~et~l~~l~~v~D~~~E~eri~~E~~~~--------------------~~~~~~~~~~~~~~~~~ 464 (474) T protein:vir:94 415 ----SQYL-S-----RETLVKSSPLVDDYKAELERIEQEQMEY--------------------NKQLPNLDDGGADGAQQ 464 (474) T ss_pred ----cCCC-C-----HHHHHHhCCCCCCHHHHHHHHHHHHHHH--------------------HhhccccCCCCCCCccc Confidence 1111 1 1223333221 11222222222111000 00000000000000000 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 629 AAQAEAQKAT 638 (708) Q Consensus 629 ~~qae~~k~~ 638 (708) ..+.+..+.+ T Consensus 465 ~~~~~~~~~e 474 (474) T protein:vir:94 465 QEGSNNKESE 474 (474) T ss_pred CCCCcccccC Confidence 0000000000 No 84 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.61 E-value=2.1e-14 Score=95.60 Aligned_cols=450 Identities=9% Similarity=0.026 Sum_probs=203.8 Q ss_pred CCcchHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHh--hhhhhhcCCC--ceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK----HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGT--KLDEQFEKYP--KFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~----~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l--~~~~q~~grp--~~~~N~i~~~ 72 (708) |+++.-+. ++...+.+.+..+.......+. ++-..||.|+|. -..+... .......++| -+++|..+.+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~--~~~~~YY~g~~~-i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~I 89 (474) T protein:vir:97 13 YGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKI--TVGQRYYDKDND-IVKQMKKVDVHGNIDYDKPDWRITTNFHQNL 89 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHH--HHHHHHhccccc-hhcccchhccccccccccCcceeecchHHHH Confidence 33211111 1111222233333333222222 233458999873 1111000 0000111233 3678999999 Q ss_pred HHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDR 152 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~ 152 (708) |+..+|+...+.+.+.+ +|.+..+ +++.+.+ |+++.....+..+++++|.||..+..+. . T Consensus 90 vd~~~~~l~g~p~~~~~------~d~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~---------~ 149 (474) T protein:vir:97 90 VDQKVSYVASKPVTYSC------EDENVLK----VIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINE---------N 149 (474) T ss_pred HHHHHhhhhcCCceecc------CcHHHHH----HHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecC---------C Confidence 99999999998887653 1233333 4444444 7899999999999999999998775421 2 Q ss_pred cceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeee Q lcl|Aclame:pro 153 QRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKY 230 (708) Q Consensus 153 ~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~ 230 (708) +.+++..+ ++..+| ||+... .+.. .+++.|...+ ....+. T Consensus 150 ~~~~i~~~--~p~~~~~v~d~~~~----~~~~-~~ir~~~~~~-------------------------------~~~~~~ 191 (474) T protein:vir:97 150 GEMKLFRV--PAEQAIPIWVDKER----EELK-SFIRYYKFNN-------------------------------EEKVEF 191 (474) T ss_pred CeeEEEEE--cccceEEEEcCCCC----CceE-EEEEEEEecC-------------------------------eEEEEE Confidence 33444433 334444 444221 1222 2233321000 001233 Q ss_pred eeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceee Q lcl|Aclame:pro 231 YEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIP 310 (708) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p 310 (708) |...+.. .|... .+...... ......+.....+.+++.+|+++ T Consensus 192 yt~~~~~----~y~~~-~~~~~~~~--------------------------------~~~~~~~~~~~~~~~~g~vPvv~ 234 (474) T protein:vir:97 192 WTDTTVT----YYVLE-NGGLIPDY--------------------------------YYGANHVQSHFSNGNWGRVPFIA 234 (474) T ss_pred EeCCeEE----EEEEc-CCcccccc--------------------------------ccCcCcccccccccCCCccceEE Confidence 4332211 11110 00000000 00011112223344556666665 Q ss_pred EEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccc Q lcl|Aclame:pro 311 VYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK 390 (708) Q Consensus 311 ~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (708) |.- ...|.|.+..+++.++.+|+..|.+.+.+...+.+.+++.-...++..+.... ......+.... T Consensus 235 ~~n-------n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~--~~~~~~i~~~~---- 301 (474) T protein:vir:97 235 FKN-------NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRG--LKYYKAINVDG---- 301 (474) T ss_pred ecC-------CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhh--hhccceeeccC---- Confidence 431 23467889999999999999999999999888888777643222222222111 11112221111 Q ss_pred cccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 391 SGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLK 469 (708) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~ 469 (708) .+ ...+...+.-..++...+......|...|++.+.+.+. .+|.||.|+..+..............|..+++ T Consensus 302 ~~-------~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:97 302 DG-------GVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred CC-------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12333333334566667888888999999987765543 46789999988777776666666667777777 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTN 549 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~ 549 (708) ++.++++.+ .. +.. ++ -+|.|.=.+..+..-.+..+.+.. T Consensus 375 ~~~~li~~~----~~------~~~-----d~-------------------------~~i~v~f~~~~p~~~~e~a~~~~~ 414 (474) T protein:vir:97 375 ELISFIIDF----NN------LKT-----DV-------------------------KDIEISFNFNRMMNDAEQSQIIAQ 414 (474) T ss_pred HHHHHHHHH----hC------CCc-----cc-------------------------ceeeEEeccCcccCHHHHHHHHHH Confidence 776666554 22 000 00 011121223333322333333332 Q ss_pred HHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 550 VLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMV 628 (708) Q Consensus 550 llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~ 628 (708) .+-. + ...++.++.+ ...+...+++....... .+..+............ T Consensus 415 ----~g~i-S-----~et~l~~l~~v~D~~~E~eri~~E~~~~--------------------~~~~~~~~~~~~~~~~~ 464 (474) T protein:vir:97 415 ----SQYL-S-----RETLVKSSPLVDDYKAELERIEQEQMEY--------------------NKQLPNLDDGGADGAQQ 464 (474) T ss_pred ----cCCC-C-----HHHHHHhCCCCCCHHHHHHHHHHHHHHH--------------------HhhccccCCCCCCCccc Confidence 1111 1 1223333221 11222222222111000 00000000000000000 Q ss_pred HHHHHHHHHH Q lcl|Aclame:pro 629 AAQAEAQKAT 638 (708) Q Consensus 629 ~~qae~~k~~ 638 (708) ..+.+..+.+ T Consensus 465 ~~~~~~~~~e 474 (474) T protein:vir:97 465 QEGSNNKESE 474 (474) T ss_pred CCCCcccccC Confidence 0000000000 No 85 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.61 E-value=5.3e-14 Score=93.38 Aligned_cols=457 Identities=12% Similarity=0.083 Sum_probs=211.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHH--hhhhhhh--cCCC--ceeecchHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAG--TKLDEQF--EKYP--KFEINKVATELN 74 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~--l~~~~q~--~grp--~~~~N~i~~~i~ 74 (708) +. ........+.+.+.......+ ..+..+- ..||.|+| +-..+.. ....+.. ..+| .++.|..+.+|+ T Consensus 14 ~~-~~~~~~~~~~~~i~~~~~~~~--~~~~~~~--~~yy~g~~-~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd 87 (479) T protein:vir:79 14 VQ-LKKESTINLVKVIEHYILKHR--PEKYKQG--EEYYYGNT-DVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVD 87 (479) T ss_pred ec-cccCChhHHHHHHHHHHhhhh--HHHHHHH--HHHhccCC-cccccccccccccccccccccCcceeecchHHHHHH Confidence 11 111111222222232222221 1122222 35788876 1100000 0000000 0122 477899999999 Q ss_pred HHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcc Q lcl|Aclame:pro 75 RIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) Q Consensus 75 ~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~ 154 (708) ..+|+...+.+.+.+ + +.+. ..+++.+. .|+++...+.+..+++++|.||..+..+. .+. T Consensus 88 ~~~~~l~g~p~~~~~-----~-~~~~----~~~~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------~~~ 147 (479) T protein:vir:79 88 QKVGYSVGNPIVFNA-----D-DDNL----TKLLNDLL-GEEFDDTITELYLNASNKGVEWLHPYINR---------KGE 147 (479) T ss_pred HHHhhhhcCCceecc-----C-CHHH----HHHHHHHH-hcCHHHHHHHHHHHHHhcCeEEEEEEeCC---------CCc Confidence 999999998877743 1 2222 23444444 47999999999999999999998886431 234 Q ss_pred eeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeee Q lcl|Aclame:pro 155 IAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYE 232 (708) Q Consensus 155 i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~ 232 (708) +++..+ ++..+| ||+... +..-++++.|...+ .+.+.....++|. T Consensus 148 ~~i~~~--~p~~~~~v~d~~~~-----~~~~~~ir~y~~~~--------------------------~~~~~~~~~e~y~ 194 (479) T protein:vir:79 148 FKYVII--PAEEAIPIWDSKRQ-----RELVAFIRFYYIED--------------------------IDGNKIKRVEYYT 194 (479) T ss_pred eEEEEE--ccceeEEEEeCCCC-----CceEEEEEEEEEee--------------------------cCCceEEEEEEEe Confidence 455443 334443 444221 11112222222110 0011223345554 Q ss_pred ecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEE Q lcl|Aclame:pro 233 VRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVY 312 (708) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~ 312 (708) ...... |.....+......... ............+....+.+++.+|+++|. T Consensus 195 ~~~i~~----~~~~~~~~~~~~~~~~------------------------~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 246 (479) T protein:vir:79 195 ENDITY----FIERGNSFIQEFLYDE------------------------YGKMTDIQEGHFRINNKEQGWGKVPFIPFK 246 (479) T ss_pred CCcEEE----EEecCCcccccccccc------------------------cccccccccccccccccccCCCcccEEEec Confidence 443221 1111110000000000 000000111112233445556666666543 Q ss_pred EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccc Q lcl|Aclame:pro 313 GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSG 392 (708) Q Consensus 313 ~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (708) - ...+.|.+..+++.++.+|...|.+.+.+...+++.+++.........+... +......+... T Consensus 247 n-------n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~--~~~~~~~i~~~------- 310 (479) T protein:vir:79 247 N-------NEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFID--NIRYYKSIKVD------- 310 (479) T ss_pred C-------CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchh--hhhhccceecC------- Confidence 1 2346788999999999999999999999998888776653211111111111 11111122111 Q ss_pred cccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 393 NIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) +.....+...+.-..++...+......|...|++.+...+..+|.||.|+..............-..|..+++++. T Consensus 311 ----~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 386 (479) T protein:vir:79 311 ----GGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELL 386 (479) T ss_pred ----CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112333333333466677788888899999998888777667889999988877777777777777777887777 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.++... +. . .++ ..+|.|.=.+..+....+..+.+..+.. T Consensus 387 ~li~~~~~~~----------~~--~---------~~~---------------~~~i~i~f~~~~p~~~~~~a~~~~kl~g 430 (479) T protein:vir:79 387 WFVCEYLKIS----------GN--K---------SYD---------------YKTVQITFNHSMIINEAEKIDMAAKSTG 430 (479) T ss_pred HHHHHHHhcc----------CC--C---------ccc---------------cccceEEeCCCCCcCHHHHHHHHHHHhc Confidence 7766654321 10 0 000 1233444444555444445555554421 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) .+ ....+++++.+ ...++-.++++... ........ ... ...+ T Consensus 431 ~i---------S~et~l~~l~~v~d~~~E~~ri~~E~---------------------~~~~~~~~----~~~---~~~~ 473 (479) T protein:vir:79 431 IV---------SDETIVSNHPWVEDVNDELERLKKQE---------------------DTQKEYDD----LIP---NNQD 473 (479) T ss_pred cC---------cHHHHHHhCCCCCCHHHHHHHHHHHH---------------------HHHHHHHh----ccC---cccC Confidence 11 11223333321 11111122221110 00000000 000 0000 Q ss_pred HHHHHH Q lcl|Aclame:pro 632 AEAQKA 637 (708) Q Consensus 632 ae~~k~ 637 (708) .....+ T Consensus 474 ~~~~e~ 479 (479) T protein:vir:79 474 GVIDET 479 (479) T ss_pred CCcCcC Confidence 000000 No 86 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.61 E-value=2.1e-13 Score=90.08 Aligned_cols=483 Identities=11% Similarity=0.008 Sum_probs=234.7 Q ss_pred CC--cchHHHHHHHHHHH------HHH-----HHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeec Q lcl|Aclame:pro 1 MA--ETLEKKHERIMLRF------DRA-----YSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEIN 67 (708) Q Consensus 1 ma--~~~~~~~~~~~~~~------~~~-----~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N 67 (708) |. +++++.+++...+. ... ..-..+...+..... .||.|+ |+.. ......+....|...+.| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~--~~y~g~-~~~~--~~~~~~~~~~~~~~~sln 75 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDL--DYYSDK-LQYI--HYQASDGIKKKRLKNTIN 75 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHH--HHhcCC-Cccc--ccccCCCCccccceeecc Confidence 43 23333333322111 001 111122333333333 357775 2211 111111111234456779 Q ss_pred chHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 68 KVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 68 ~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) +-+.+++...+....-.+.+.|.. ++ .....|..+.+.|++......++++++..|.||+++.++. T Consensus 76 ~~~~i~~~~A~lv~~e~~~i~v~~----~~-----~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~----- 141 (508) T protein:vir:15 76 MAKTAARRIASVVFNEKAEIHVKD----NN-----EADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG----- 141 (508) T ss_pred hHHHHHHHHHhhhhCCCceEEeCC----ch-----HHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC----- Confidence 999999999999888888888732 11 1234455566689999999999999999999999998752 Q ss_pred CCCCCcceeeEEeecchhheecCCccc-cCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAK-KYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~-~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) +.++|..+ +...|| |-.. ..++..+-|+......+ . ....-.. T Consensus 142 -----~~~~i~~v--~ad~~~--P~~~d~~~~~~~af~~~~~~~~--~-------------------------~~~~~yt 185 (508) T protein:vir:15 142 -----NHIKIAWV--RADQFY--PLQSNTNDISEAAIASRTQRTE--S-------------------------NQTKYYT 185 (508) T ss_pred -----CeeEEEEE--cCCeeE--EEEEcCCCeEEEEEEEEEEeec--C-------------------------CCceEEE Confidence 23455544 445555 3111 12333443332221100 0 0001111 Q ss_pred Eeeeeeec-ceEEE-EEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCC Q lcl|Aclame:pro 227 IAKYYEVR-KESVD-VISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGE 304 (708) Q Consensus 227 v~e~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~ 304 (708) ..|+++.. ..... .+. .|.+.....+-......-+.. |.-+-....+.+....+|. T Consensus 186 ~lE~h~~~~~~~~~I~n~----------ly~~~~~~~lG~~v~l~~~~e------------~~~l~~~~~~~g~~~p~f~ 243 (508) T protein:vir:15 186 LLEFHQWQDNGSYQITNE----------LYKSDSPDIVGNQVPLSTLPV------------YKELAPQVTISGLQRPLFA 243 (508) T ss_pred EEEEEEEecCcceEEEEE----------EEecCCchhcCcccchhhccc------------ccCCCcceEecCCCcceeE Confidence 22232210 00000 111 111111000000000000000 0000000001110111122 Q ss_pred CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeee Q lcl|Aclame:pro 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL 384 (708) Q Consensus 305 ~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~ 384 (708) +|+. | .......+++.|.|++.++++.++.+|...|++.+.+ ..+..++.++++.+..-.+......... ..+ T Consensus 244 y~~~-~--~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~~~~~~~~~---~~~ 316 (508) T protein:vir:15 244 YFKT-P--GANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEHKPTFDTEQ---NVY 316 (508) T ss_pred EecC-C--ccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCCccccCCCC---eeE Confidence 2211 1 1111133567788999999999999999999999999 5667788888877642111000000111 111 Q ss_pred cccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 385 REVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETVNNLMNRADMASFIYLD 462 (708) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai~~~q~q~~~~~~~~~d 462 (708) ..... . ......+..+++.--...+...++.....+....|++....|..++ .||++|....+..-.....+.. T Consensus 317 ~~~~~---~-~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~ 392 (508) T protein:vir:15 317 VGVLS---D-DNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLT 392 (508) T ss_pred EeccC---C-CCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHH Confidence 11110 1 1111234445544444667888888888999999999998886543 5899999888888888888999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 463 NMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 463 n~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) .+..+++++.+.++.+..-++-..- |. ..... ++-...++|+|+=+.+-...+++ T Consensus 393 ~~~~al~~lv~~il~l~~~~~~~~~--------g~-~~~~~----------------~~~~~~~~v~v~f~D~i~~d~~~ 447 (508) T protein:vir:15 393 MVEKAIDELCQSIFELANAGALFDD--------GK-PLFTL----------------DSASQPLDIECHFDDGVFVNKDK 447 (508) T ss_pred HHHHHHHHHHHHHHHHHHHhccccc--------cc-ccccc----------------ccccCCcceEEEeCCCCCCCHHH Confidence 9999999999999998755432100 00 00000 00111345555555555555667 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCc-------chH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPR-------NEK 600 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~-------~~~ 600 (708) ..+.++++...+. . + .-.+.+-....+=+-+++..++++...+......+. ..+ T Consensus 448 ~~~~~~~~v~aGi-~-s--~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 448 QLEEDAKVLAIGA-L-S--KQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHHHHHHhcCC-C-C--HHHHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 7777777765432 1 1 111111111111133445555555543322111110 011 No 87 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.59 E-value=8.6e-14 Score=92.22 Aligned_cols=451 Identities=11% Similarity=0.027 Sum_probs=209.4 Q ss_pred CCcc----hHHHHHHHHHH-----------HHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHH--HhhhhhhhcCCC- Q lcl|Aclame:pro 1 MAET----LEKKHERIMLR-----------FDRAYSPQKEVREKCIEATRFARVPGGQWEGATAA--GTKLDEQFEKYP- 62 (708) Q Consensus 1 ma~~----~~~~~~~~~~~-----------~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~--~l~~~~q~~grp- 62 (708) |||. .+..++++.++ +.+..+.......+.. +-..||.|+| +-..+. ..........+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~--~~~~Yy~g~~-~i~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDIT--VGERYYNHDP-DVLRLAPKLDNKGEIDPLKPD 77 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHH--HHHHHhccCC-cchhccchhcccccccccccc Confidence 7763 23334444322 2223333222222222 2245899976 211111 111111111122 Q ss_pred -ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEee Q lcl|Aclame:pro 63 -KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSM 141 (708) Q Consensus 63 -~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~ 141 (708) .++.|..+.+|+..+|+...+.+.+.+ +|.+..+.+..++ + ++++.....+..++.++|.||..+..+ T Consensus 78 ~ki~~n~~~~Ivd~~~~~l~g~p~~~~~------~d~~~~~~l~~~~----~-n~~~~~~~~~~~~~~~~G~~~~~~y~d 146 (474) T protein:vir:96 78 WRMFTNYHQNLVDQKVAYAVANPVTFSS------DDDKSLKTIQEVL----N-HKWDDKLVDILTAASNKGIEWLQPYID 146 (474) T ss_pred hhcccchHHHHHHhhhhhhcccCceeec------CchHHHHHHHHHH----h-cCHHHHHHHHHHHHHhcCeeEEEEEec Confidence 267899999999999999998888753 1233444444433 2 578888899999999999999887543 Q ss_pred ccccCCCCCCCcceeeEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 142 LVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 142 ~~~~~d~~~~~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) . .+.+++..+ ++..+| ||+... .+. .++++.|-.. . T Consensus 147 ~---------~~~~~i~~~--~p~~~~~v~d~~~~----~~~-~~~vr~~~~~-----------~--------------- 184 (474) T protein:vir:96 147 E---------NGEFKTFRV--PAEQAIPIWTNKER----DTL-KAFIRYYRLD-----------G--------------- 184 (474) T ss_pred C---------CCceEEEEE--cccceEEEEcCCCC----Cce-EEEEEEEeec-----------C--------------- Confidence 2 234555543 334454 444221 122 2333333100 0 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) . .-.++|...++.. +.+.+ +.......... .......+.... T Consensus 185 --~---~~~~~yt~~~v~~--~~~~~---~~~~~~~~~~~----------------------------~~~~~~~~~~~~ 226 (474) T protein:vir:96 185 --A---ERVEYWTDSDVTY--YEYQD---GILIPDYYHGE----------------------------EHIQSHYYVGNK 226 (474) T ss_pred --c---eEEEEEeCCeEEE--EEecC---Cceeecccccc----------------------------cccccccccccc Confidence 0 0012232221110 11110 00000000000 000111222335 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) +.+++.+|+++|.. ...|.|.+..+++.++.+|...|.+.+.+...+.+.+++.-...++..+... +.... T Consensus 227 ~~~~g~iPvv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~~~~~ 297 (474) T protein:vir:96 227 RVSWGRVPFIPFKN-------NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMR--NLKYY 297 (474) T ss_pred ccCCCceeEEEecc-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhh--hhhcC Confidence 56677777776532 2346799999999999999999999999999888776653211111111111 11112 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASF 458 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~ 458 (708) ..+.... ..+ ...+...+.-..+....++...+.|-..|++.+.+.+. .+|.||.|+..+......... T Consensus 298 ~~i~~~~---~~~-------~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 367 (474) T protein:vir:96 298 KAINVDG---DGS-------GVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKAN 367 (474) T ss_pred ceEEecC---CCC-------ceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHH Confidence 2222110 011 12333333334666778888899999999987766554 467899999988777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 459 ~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) .....|..+++++.++++.+.-.-++ . .+|.|.=.+..+. T Consensus 368 ~k~~~~~~~l~~~~~~i~~~~~~~~~-------------~---------------------------~~i~i~f~~~~p~ 407 (474) T protein:vir:96 368 KLKNKTLTALQELLQYIIDFYKLNIK-------------V---------------------------QDVEITFNFNVMV 407 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCcc-------------c---------------------------ceeeEEeccCCCc Confidence 77777888888877776665311110 0 0112222223333 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) .-.+..+.+ ...+- ....+++.++.+ ...+...+++.+...... ++... T Consensus 408 ~~~e~~~~~----~~ag~------iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~-------------------~~~~~- 457 (474) T protein:vir:96 408 NELEQSQIG----VQSQY------LSKETVVTNHPWVDDPVAELERIEQDNIDFN-------------------KQLPP- 457 (474) T ss_pred CHHHHHHHH----HhcCC------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-------------------hcccc- Confidence 222232222 22121 111223333221 122222222221110000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAEAQKATNE 640 (708) Q Consensus 618 ~~~~~aq~~~~~~qae~~k~~~~ 640 (708) . ..+. ... ..+-.+ +.+ T Consensus 458 ~---~~~~-~~~-~~d~~~-e~~ 474 (474) T protein:vir:96 458 L---EGDA-NGR-AQDNES-ETN 474 (474) T ss_pred c---cccc-ccc-cCCCcc-cCC Confidence 0 0000 000 000000 000 No 88 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.59 E-value=1.6e-13 Score=90.76 Aligned_cols=473 Identities=9% Similarity=0.007 Sum_probs=227.1 Q ss_pred CCcchHHHHHHHHHH-----HHHHHH-----hhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchH Q lcl|Aclame:pro 1 MAETLEKKHERIMLR-----FDRAYS-----PQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVA 70 (708) Q Consensus 1 ma~~~~~~~~~~~~~-----~~~~~~-----~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~ 70 (708) |-++.+..+++.... +..-.+ .-.+.+.+..... .||.|+.+.-.... ..+....+..++.|+-+ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~--~~Y~g~~~~~~~~~---~~~~~~~~~~~slnl~~ 77 (500) T protein:vir:30 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNL--KYYKSDWDSVLYLN---TDGETKKRDLNHLPIAR 77 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHH--HHhcCCCCCccccc---CCCCcccCceeecchHH Confidence 333444444432211 111111 1122333333333 36778643221111 01111234567789999 Q ss_pred HHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCC Q lcl|Aclame:pro 71 TELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMD 150 (708) Q Consensus 71 ~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~ 150 (708) .+++...+...+-.+.+.+. |....+. +..+.+.|++.....++++.++..|.|++++.++. T Consensus 78 ~i~~~~A~lv~~e~~~i~~~------d~~~~~~----l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-------- 139 (500) T protein:vir:30 78 TAAKKIASLVFNEQAEIKVD------DDAANEF----ISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-------- 139 (500) T ss_pred HHHHHHhhhhcCCcceEecC------ChHHHHH----HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-------- Confidence 99999999888888888771 2334444 45555589999999999999999999999998752 Q ss_pred CCcceeeEEeecchhheecCCccc-cCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCc-eeEEe Q lcl|Aclame:pro 151 DRQRIAIEPIYDPSRSVWFDPDAK-KYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGAD-VIYIA 228 (708) Q Consensus 151 ~~~~i~i~~v~~~~~~v~~Dp~a~-~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~-~~~v~ 228 (708) +.++|..+ +...|| |-.. .-+...+-++++.. ... .+.+ -.... T Consensus 140 --~~~~I~~v--~ad~~~--P~~~d~~~~~~~a~~~~~~---~~~-------------------------~~~~~~yt~l 185 (500) T protein:vir:30 140 --DKVRVAFV--QAPVFL--PLQSNTQDVSSAAVVIKSV---KTI-------------------------NGKEVYYTLI 185 (500) T ss_pred --CceEEEEE--cCCeeE--EEEEcCCCeEEEEEEEEEe---eee-------------------------cCCceEEEEE Confidence 13455444 345555 3111 11122222222110 000 0001 11222 Q ss_pred eeeeecce-EE-EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKE-SV-DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~ 306 (708) |+++-... .. ..+. .|.+.....+ |.. +....+ |.-+-+...+.+ ++...| T Consensus 186 E~h~~~~~~~~~I~n~----------ly~~~~~~~l-------G~~------v~l~~~-~~~l~~~~~~~~---~~~p~f 238 (500) T protein:vir:30 186 EFHEWQSSDDYVISNE----------LYRSDDKAKV-------GSR------VPLSEV-YKDLKDEAKVTD---VTRPIF 238 (500) T ss_pred EEEEEeCCceeEEEEE----------EEeccccccc-------Ccc------cccccc-cCCcCcceEecc---CCCccE Confidence 33321100 00 0011 1111100000 000 000000 000000111111 111111 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhc-------ccCC Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARN-------KKRP 379 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~-------~~~~ 379 (708) .|+++-.......+++.|.|++.++++..+.+|...|++.+.+.. +..++.++.+.+.....-..... .... T Consensus 239 ~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~ 317 (500) T protein:vir:30 239 TYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQN 317 (500) T ss_pred EEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcc Confidence 111110111123466778899999999999999999999999976 55678888776632211000000 0111 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETVNNLMNRADMAS 457 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai~~~q~q~~~~~ 457 (708) .+.... +. ..+...++..++.-....+...++.....+....|++....|..++ .||++|....+..-... T Consensus 318 ~~~~~~------~~-~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:30 318 VYIRMG------GR-DLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR 390 (500) T ss_pred eEEEcC------CC-CCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH Confidence 111111 11 1111234444443334667788888888888899999988886443 48999998888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--hcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeeccc Q lcl|Aclame:pro 458 FIYLDNMAKSLKRAGEVWLSMARE--VYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) Q Consensus 458 ~~~~dn~~~~~~~~~~~~l~li~~--~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~ 535 (708) ..+...+..+++++.+.++.+..- +|... ....++|+|+=..+ T Consensus 391 ~~~~~~~~~al~~lv~~il~~~~~~~~~~~~-----------------------------------~~~~~~v~v~f~d~ 435 (500) T protein:vir:30 391 NSIVALVEQSLKELVISIFEIAKAYDLYQSE-----------------------------------VPSMDNISISLDDG 435 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-----------------------------------CCCCcceEEEeCCC Confidence 889999999999999999987643 33210 00124455554445 Q ss_pred chhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHH Q lcl|Aclame:pro 536 YTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQI 604 (708) Q Consensus 536 ~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~ 604 (708) -...+++.++.++++.+.+. . + .-.+..-.-..+-.-++++.++++....+......+....--. T Consensus 436 i~~d~~~~~~~~~~~v~aGi-~-s--~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 436 VFTDRDAELDYWIKVVNAGF-G-T--REMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred CCCCHHHHHHHHHHHHHcCC-C-C--HHHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 55556667777777766532 1 1 1111111101111223444445544322211111000000000 No 89 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.59 E-value=1.6e-13 Score=90.76 Aligned_cols=473 Identities=9% Similarity=0.007 Sum_probs=227.1 Q ss_pred CCcchHHHHHHHHHH-----HHHHHH-----hhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchH Q lcl|Aclame:pro 1 MAETLEKKHERIMLR-----FDRAYS-----PQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVA 70 (708) Q Consensus 1 ma~~~~~~~~~~~~~-----~~~~~~-----~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~ 70 (708) |-++.+..+++.... +..-.+ .-.+.+.+..... .||.|+.+.-.... ..+....+..++.|+-+ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~--~~Y~g~~~~~~~~~---~~~~~~~~~~~slnl~~ 77 (500) T protein:vir:98 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNL--KYYKSDWDSVLYLN---TDGETKKRDLNHLPIAR 77 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHH--HHhcCCCCCccccc---CCCCcccCceeecchHH Confidence 333444444432211 111111 1122333333333 36778643221111 01111234567789999 Q ss_pred HHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCC Q lcl|Aclame:pro 71 TELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMD 150 (708) Q Consensus 71 ~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~ 150 (708) .+++...+...+-.+.+.+. |....+. +..+.+.|++.....++++.++..|.|++++.++. T Consensus 78 ~i~~~~A~lv~~e~~~i~~~------d~~~~~~----l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-------- 139 (500) T protein:vir:98 78 TAAKKIASLVFNEQAEIKVD------DDAANEF----ISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-------- 139 (500) T ss_pred HHHHHHhhhhcCCcceEecC------ChHHHHH----HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-------- Confidence 99999999888888888771 2334444 45555589999999999999999999999998752 Q ss_pred CCcceeeEEeecchhheecCCccc-cCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCc-eeEEe Q lcl|Aclame:pro 151 DRQRIAIEPIYDPSRSVWFDPDAK-KYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGAD-VIYIA 228 (708) Q Consensus 151 ~~~~i~i~~v~~~~~~v~~Dp~a~-~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~-~~~v~ 228 (708) +.++|..+ +...|| |-.. .-+...+-++++.. ... .+.+ -.... T Consensus 140 --~~~~I~~v--~ad~~~--P~~~d~~~~~~~a~~~~~~---~~~-------------------------~~~~~~yt~l 185 (500) T protein:vir:98 140 --DKVRVAFV--QAPVFL--PLQSNTQDVSSAAVVIKSV---KTI-------------------------NGKEVYYTLI 185 (500) T ss_pred --CceEEEEE--cCCeeE--EEEEcCCCeEEEEEEEEEe---eee-------------------------cCCceEEEEE Confidence 13455444 345555 3111 11122222222110 000 0001 11222 Q ss_pred eeeeecce-EE-EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCc Q lcl|Aclame:pro 229 KYYEVRKE-SV-DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 229 e~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~ 306 (708) |+++-... .. ..+. .|.+.....+ |.. +....+ |.-+-+...+.+ ++...| T Consensus 186 E~h~~~~~~~~~I~n~----------ly~~~~~~~l-------G~~------v~l~~~-~~~l~~~~~~~~---~~~p~f 238 (500) T protein:vir:98 186 EFHEWQSSDDYVISNE----------LYRSDDKAKV-------GSR------VPLSEV-YKDLKDEAKVTD---VTRPIF 238 (500) T ss_pred EEEEEeCCceeEEEEE----------EEeccccccc-------Ccc------cccccc-cCCcCcceEecc---CCCccE Confidence 33321100 00 0011 1111100000 000 000000 000000111111 111111 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhc-------ccCC Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARN-------KKRP 379 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~-------~~~~ 379 (708) .|+++-.......+++.|.|++.++++..+.+|...|++.+.+.. +..++.++.+.+.....-..... .... T Consensus 239 ~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~ 317 (500) T protein:vir:98 239 TYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQN 317 (500) T ss_pred EEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcc Confidence 111110111123466778899999999999999999999999976 55678888776632211000000 0111 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETVNNLMNRADMAS 457 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai~~~q~q~~~~~ 457 (708) .+.... +. ..+...++..++.-....+...++.....+....|++....|..++ .||++|....+..-... T Consensus 318 ~~~~~~------~~-~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:98 318 VYIRMG------GR-DLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR 390 (500) T ss_pred eEEEcC------CC-CCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH Confidence 111111 11 1111234444443334667788888888888899999988886443 48999998888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--hcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeeccc Q lcl|Aclame:pro 458 FIYLDNMAKSLKRAGEVWLSMARE--VYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) Q Consensus 458 ~~~~dn~~~~~~~~~~~~l~li~~--~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~ 535 (708) ..+...+..+++++.+.++.+..- +|... ....++|+|+=..+ T Consensus 391 ~~~~~~~~~al~~lv~~il~~~~~~~~~~~~-----------------------------------~~~~~~v~v~f~d~ 435 (500) T protein:vir:98 391 NSIVALVEQSLKELVISIFEIAKAYDLYQSE-----------------------------------VPSMDNISISLDDG 435 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-----------------------------------CCCCcceEEEeCCC Confidence 889999999999999999987643 33210 00124455554445 Q ss_pred chhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHH Q lcl|Aclame:pro 536 YTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQI 604 (708) Q Consensus 536 ~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~ 604 (708) -...+++.++.++++.+.+. . + .-.+..-.-..+-.-++++.++++....+......+....--. T Consensus 436 i~~d~~~~~~~~~~~v~aGi-~-s--~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 436 VFTDRDAELDYWIKVVNAGF-G-T--REMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred CCCCHHHHHHHHHHHHHcCC-C-C--HHHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 55556667777777766532 1 1 1111111101111223444445544322211111000000000 No 90 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.59 E-value=5.9e-13 Score=87.61 Aligned_cols=462 Identities=12% Similarity=-0.011 Sum_probs=203.1 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHH----HHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGA----TAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~----~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |-+.-.+.+.+++..+.... .+-++...||+|+|.-.. .-..++.. -++.|..+.+|+.+ T Consensus 18 l~~~e~~~i~~L~~~~~~~~---------~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~-------~~v~n~~~~iVd~~ 81 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRT---------PRNLLRASFYDGKYAIRQIGNLIPPEYLRT-------ATVLGWSAKAVDTL 81 (504) T ss_pred CCHHHHHHHHHHHHHHHHHh---------HHHHHHHHHHhccccchhccccccHHHHHH-------hhccCcHHHHHHHH Confidence 55443344444444333211 112233468999885321 11112111 24568888888887 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) .....-+- -..|. ++.. +..+..+++.|+++...+.+..++++.|++|+-|..+. ++...+. T Consensus 82 a~rl~~~G---f~~~d---~~~~-----~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~-------d~~~~~~ 143 (504) T protein:vir:99 82 ARRCNLES---FVWPD---GDYG-----SIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGG-------AGEPDSL 143 (504) T ss_pred Hhhhccce---eeCCC---CChh-----hHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCC-------CCCceeE Confidence 75433221 12222 2211 23355678899999999999999999999998774321 2233334 Q ss_pred eEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) |..+ ++.++ +|||...++. +..+.+ . .+ .........+|... T Consensus 144 I~~~--sP~~~~~iyD~~~~~~~------~a~~~~-~--------------------------~d-~~g~~~~~~~y~~~ 187 (504) T protein:vir:99 144 IHVK--SAMQATGEWNSRRNAMD------SLLSIT-S--------------------------RD-AEGHPTGIALYEDG 187 (504) T ss_pred EEEe--ccceeEEEEeCCCCcee------EEEEEE-E--------------------------ec-CCCeEEEEEEEcCC Confidence 4433 33344 4787433211 111111 0 00 00011112222211 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) .. +.+. .-++.....+..+.|++ .|+|||+-. T Consensus 188 ~~---------------~~~~--------------------------------~~~~~~~~~~~~~~~~g-vPvV~~~n~ 219 (504) T protein:vir:99 188 VT---------------VTAD--------------------------------MDDDGDWHADVRTHKLG-VPVEVLPYK 219 (504) T ss_pred cE---------------EEEE--------------------------------EcCCceeeeccccCCCC-cceEEeccc Confidence 10 0000 00000011123345554 788887643 Q ss_pred eeccCCcccccc---hHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccc-h-------HHHHHhhcccCCceee Q lcl|Aclame:pro 315 RWFIDDIERVEG---HIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG-L-------EKHWEARNKKRPAFLP 383 (708) Q Consensus 315 ~~~~d~~~~~~G---~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~-~-------~~~~~~~~~~~~~~~~ 383 (708) +. ...+|| +.+.+++.++.+|+.++.++......+.++..+- |+... . ...|... .+.++ T Consensus 220 ~~----~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~-G~~~~~~~~~d~~~~~~~~~~----~~~i~ 290 (504) T protein:vir:99 220 PR----EDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL-GADAKNFRNKDGSMKPAWQIA----LARVF 290 (504) T ss_pred cc----CccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-cCCccccccccccccchhhhh----hhhhh Confidence 22 123444 4468999999999999999887776666554441 11100 0 0011100 00011 Q ss_pred ecccccccccccccc-cccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc---cchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNIIAGA-TPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP---SNIAQETVNNLMNRADMASFI 459 (708) Q Consensus 384 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~---~n~sg~ai~~~q~q~~~~~~~ 459 (708) . ...+..+.+..+. ..++.++...+ ..+...+......+-.+||+.+..+|.. +|+||.|+......-...... T Consensus 291 ~-~~~~~~~~~~~~~~~~~~q~~~~~l-~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~ 368 (504) T protein:vir:99 291 A-LPDDEDEPDAARARADVKQFPASSP-QPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEG 368 (504) T ss_pred c-CCCccccccccCccceeeecCCCCh-HHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHH Confidence 0 0011111111111 12222332222 3455666666777777799999999853 457999999888887777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC-CCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 460 YLDNMAKSLKRAGEVWLSMAREVYG-SEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 460 ~~dn~~~~~~~~~~~~l~li~~~y~-~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) ..+-|..+.++++++.+.+...+-. .....+| +.+ + .-+.+.+ T Consensus 369 k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~-------~v~-w----------------------------~d~~~~s 412 (504) T protein:vir:99 369 ATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTI-------DSK-F----------------------------RSPLYLS 412 (504) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccccccc-------eeE-e----------------------------cCCCccC Confidence 8888889999999988766543211 0000100 000 0 0112223 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~ 618 (708) ..+..+.+..|.+.+....... ..+++++.+. .+++ T Consensus 413 -~a~~aDa~~Kl~~ag~~l~~~~----~~l~~~lg~~-~~ei-------------------------------------- 448 (504) T protein:vir:99 413 -KAAQADAGAKMLGAGPEWLKET----EVGLELLGLT-PQQA-------------------------------------- 448 (504) T ss_pred -HHHHHHHHHHHHhhccccccch----HHHHhhcCCC-HHHH-------------------------------------- Confidence 3456666777766543211110 1122222221 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcC Q lcl|Aclame:pro 619 EMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSP 698 (708) Q Consensus 619 ~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 698 (708) + +...+.++++++. -++.. . .+.. .. ..-...........+.+ T Consensus 449 ~---------r~~~e~~~~~~~~---~~~~l-------~---------------~~~~--~~-~~~~~~~~~~~~e~a~~ 491 (504) T protein:vir:99 449 K---------RALAERRRASSVS---IIEAL-------N---------------RRQQ--EA-ATAGEDQDQGAGEPPAN 491 (504) T ss_pred H---------HHHHHHHHHhhHH---HHHHH-------h---------------cccC--CC-CCCCCCCCcCCCCCCCC Confidence 0 0000000000000 00000 0 0000 00 00000000001111222 Q ss_pred CCCCCCCCCC Q lcl|Aclame:pro 699 PQSPADLMPS 708 (708) Q Consensus 699 ~~~~~e~~~~ 708 (708) ..+.+.-+|+ T Consensus 492 ~~~~~~~~p~ 501 (504) T protein:vir:99 492 EPPAALGRPT 501 (504) T ss_pred CCCccCCCcc Confidence 3334455555 No 91 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.59 E-value=6.6e-13 Score=87.36 Aligned_cols=473 Identities=10% Similarity=0.017 Sum_probs=229.2 Q ss_pred CCcchHHHHHHHHHH------HHHHHHh-----hHHHHHHHHHHHHHhhcCCC--CCCHHHHHHhhhhhhhcCCCceeec Q lcl|Aclame:pro 1 MAETLEKKHERIMLR------FDRAYSP-----QKEVREKCIEATRFARVPGG--QWEGATAAGTKLDEQFEKYPKFEIN 67 (708) Q Consensus 1 ma~~~~~~~~~~~~~------~~~~~~~-----~~~~r~~~~~d~~~~~~~G~--Qw~~~~~~~l~~~~q~~grp~~~~N 67 (708) |=|++...++.++++ ++...+. ..+.+......++ ||.|+ .|....-. ..+....+..++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~--~Y~g~~~~~~~~~~~---~~~~~~~~~~~s~n 75 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKR--LYQGNYAEWHNLNYE---HNGNPVNRRQLSMN 75 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHH--HhcCCcchhhccccc---cCCCccccceeecc Confidence 877766666666654 2223221 2233444444444 46664 55332111 01111123467789 Q ss_pred chHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 68 KVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 68 ~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) +-+-+++...++.....+.+.+ +|...++.|++ +.+.|++......+.++++..|.||+++.++. T Consensus 76 ~~~~iv~~~a~~l~~ep~~i~~------~d~~~~e~l~~----~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~----- 140 (499) T protein:vir:80 76 LPKVTAKYMSKLLFNEKVKINI------DDETAEEFVLN----VLKTNGFTKNMERYIEYGEAMGGFVIKVYHDG----- 140 (499) T ss_pred hHHHHHHHHHHhhhCCcceEee------CCHHHHHHHHH----HHhhccHHHHHHHHHHHHhhcCcEEEEEEECC----- Confidence 9999999999999999888877 13344555555 55578999999999999999999999998753 Q ss_pred CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEE Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYI 227 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v 227 (708) .+.+++..+ +...+|+=... ..++..+-|+-... .+ ...... T Consensus 141 ----~~~~~i~~v--~a~~~~Pi~~d-~~~~~~~~f~~~~~---~~----------------------------~~~y~~ 182 (499) T protein:vir:80 141 ----NKNVKVSFA--TADCMYPLSND-SENVDECLIANSFH---KN----------------------------NKYYKL 182 (499) T ss_pred ----CCcEEEEEE--cCCceEEEEec-CCCeEEEEEEEEEe---ec----------------------------CeEEEE Confidence 134555544 44565521111 12344444332111 10 001111 Q ss_pred eeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcc Q lcl|Aclame:pro 228 AKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIP 307 (708) Q Consensus 228 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p 307 (708) .||++........+... ...|.+.....+ |..+ ....+ + .-+.....+++ ++ T Consensus 183 lE~h~~~~~~~~~y~I~------n~~~~~~~~~~l-------G~~v------~l~~~-~------~~~~~~~~~~~--~~ 234 (499) T protein:vir:80 183 LEWNEWKGEKEEVYTVT------TELYQSDDPNEL-------GGKV------SLKLL-F------NDIEPVVPLPS--LT 234 (499) T ss_pred EEEEEecccceeeEEEE------EEEEeccCcccc-------Cccc------chhhh-c------cCcCCceeecC--CC Confidence 12211110000000000 001111110000 0000 00000 0 00000001111 11 Q ss_pred eeeEEEeee-----ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHh---hcccCC Q lcl|Aclame:pro 308 LIPVYGKRW-----FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEA---RNKKRP 379 (708) Q Consensus 308 ~~p~~~~~~-----~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~---~~~~~~ 379 (708) .+||..++. ...+++.|.|++.++++..+.+|...|++.+.+... ..++.++.+.+....+.... ....+. T Consensus 235 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~g~~~~~~~~~~ 313 (499) T protein:vir:80 235 RPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDGSTTQYFDSTD 313 (499) T ss_pred ccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhc-ccceecchhhhhccCCCCCCcccCCCccc Confidence 112221111 124566778899999999999999999999999764 55677776665322110000 000000 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccccc--chhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS--NIAQETVNNLMNRADMAS 457 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~--n~sg~ai~~~q~q~~~~~ 457 (708) . .+.... +....+...+...++.-...++...++.....+....|++....|... ..||++|....+...... T Consensus 314 ~--~~~~~~---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~ 388 (499) T protein:vir:80 314 E--AFFLYQ---GEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTK 388 (499) T ss_pred c--eeeEee---ccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHH Confidence 0 111111 111111112344444434466778888888899999999998888643 347888887777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccch Q lcl|Aclame:pro 458 FIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYT 537 (708) Q Consensus 458 ~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~ 537 (708) ..+...+..+++++.+.++.+..-+--. .+.. ....+|.|+=..+-+ T Consensus 389 ~~~~~~~~~~l~~l~~~il~~~~~~~~~------~~~~---------------------------~~~~~v~v~f~d~i~ 435 (499) T protein:vir:80 389 NSHSQLIEQGIKEMIVSILEVGKLIKAY------DGDT---------------------------VELDTITVDFDDSIA 435 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccc------cCCC---------------------------CCccceEEEeCCCCC Confidence 7888888899999998888876543210 0100 001233333333444 Q ss_pred hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccc---hhHHHHHHHHHhhhhhhhcccCcchHHHHHHH Q lcl|Aclame:pro 538 ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG---EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQ 606 (708) Q Consensus 538 ~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~---~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~ 606 (708) ....+..+.++++...+. .. .. .++....+ +-+++..++++...... ...++.....-... T Consensus 436 ~d~~~~~~~~~~~~~~Gi-~S---~e---t~l~~~~~~~d~ea~~el~~i~~E~~~~-~~~~d~~g~~ge~e 499 (499) T protein:vir:80 436 QDEDTTINRYTTAKNQGM-IP---LK---IALQRAWNITEAEADEWAEMLAKEKQAE-IPNNDMTGIFGEEE 499 (499) T ss_pred CCHHHHHHHHHHHHHcCC-CC---HH---HHHhhcCCCChHHHHHHHHHHHHHhhcC-CCCCCccccCCCCC Confidence 445566666776654431 11 11 12222111 22334444444332111 01111000000000 No 92 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.58 E-value=7.7e-13 Score=86.98 Aligned_cols=494 Identities=12% Similarity=0.033 Sum_probs=235.1 Q ss_pred CCcchHHH---HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKK---HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII 77 (708) Q Consensus 1 ma~~~~~~---~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~ 77 (708) |-|-.-.. ...+..+|+...+....|...|+++.+|.. .+ -+++.... + ....+.-+.-...++.+. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tl-P~-~~~~~~~~------~--~~~~~~dstg~~a~~~LA 70 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTL-PY-LMNNKGDN------E--TSQNGWQGVGAQATNHLA 70 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhc-cc-ccCCCCCc------c--cccccccchHHHHHHHHH Confidence 54422111 356788888888888888889988876532 21 12111000 0 111122334444555544 Q ss_pred HHHhc-----CcceeEEecCCC------cchHH---HHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe Q lcl|Aclame:pro 78 AEYRN-----NRITVKFRPGDR------EASEE---LANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) Q Consensus 78 g~~~~-----nr~~~~v~pr~~------~~d~~---~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~ 140 (708) +.... +++=+++.+.+. +.+.+ +.+. ++..+......|++..+...++.+.+..|.|.+.+ T Consensus 71 a~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-- 148 (515) T protein:vir:70 71 NKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK-- 148 (515) T ss_pred HHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE-- Confidence 43332 333344443321 11122 2222 34556666778999999999999999999987654 Q ss_pred eccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCC Q lcl|Aclame:pro 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWF 220 (708) Q Consensus 141 ~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~ 220 (708) + + .. ..++. |+.++++..+.. . ...-++++..++..++...|+........ .. ... T Consensus 149 d---~---~~---~~~~~----pl~~y~v~~d~~---G-~v~~i~rr~~~t~~~l~~~f~~~~~~~~~-----~~--~~~ 204 (515) T protein:vir:70 149 P---S---KG---AMSAV----PMHHYVVNRDTN---G-DLMDVILLQEKALRTFDPATRMAIEVGMK-----GK--KCK 204 (515) T ss_pred e---C---CC---CeEEE----EcCeEEEeeCCC---c-CeeEEEeeeeccHHHHHHhhhhhhhhhhh-----hh--hcC Confidence 1 1 11 12332 455666544332 1 22237889999999999999853211000 00 000 Q ss_pred CCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCC Q lcl|Aclame:pro 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) Q Consensus 221 ~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~ 300 (708) ..+.+. +|.+..+. .+ .++.|+.-.+...+...+- T Consensus 205 ~~~~v~-------------i~~~v~~~--------~~------------------------~~~~~~~e~d~~~~~~es~ 239 (515) T protein:vir:70 205 EDDNVK-------------LYTHAQYA--------GE------------------------GFWKINQSADDIPVGKESR 239 (515) T ss_pred CCCceE-------------EEEEEEec--------CC------------------------CceEEEEecCceeeccccc Confidence 111111 11111110 00 0111222222233344577 Q ss_pred CCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCc Q lcl|Aclame:pro 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) Q Consensus 301 ~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~ 380 (708) ||+.++||+++.... .+|..+|.|.+....+--+.+|++...++.....+.+++++++.+.+-...... ...++. T Consensus 240 y~~~e~P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~---~~~~g~ 314 (515) T protein:vir:70 240 IKSEKLPFIPLTWKR--SYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFV---NSGTGE 314 (515) T ss_pred cccccCCceeeeeee--cCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhcc---ccCCce Confidence 888999998765443 688899999999999999999999999999999999999999887664322111 111111 Q ss_pred eeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 381 FLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIY 460 (708) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~ 460 (708) ++ ++ ..+.+ .+.+ ..+..--+.....++...+.|....=+.....+.+.+.|++-|..+.+.-...+... T Consensus 315 iv---~g--~~~~v----~~~~-~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv 384 (515) T protein:vir:70 315 VI---TG--VAEDI----HIVQ-LGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGV 384 (515) T ss_pred ee---cC--Ccccc----eeee-cCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHH Confidence 11 11 11111 1111 111111244455666666666665534333333333468888888888777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHH Q lcl|Aclame:pro 461 LDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) Q Consensus 461 ~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r 540 (708) +.+|.. +++..||.... +..++. .|. + + .++.+..+. .+-.| T Consensus 385 ~srL~~------Ell~Pli~r~~------~~~~p~-------------~P~-~------~-----v~~~~vs~l-~~L~r 426 (515) T protein:vir:70 385 YSLFAM------TMQTPIAMWGL------QEAGDS-------------FTS-E------L-----VDPVIVTGI-EALGR 426 (515) T ss_pred HHHHHH------HHHHHHHHHHH------HhhCCC-------------CCh-h------h-----cccceehhH-HHHHH Confidence 777653 22333332211 001110 010 0 0 122232332 34457 Q ss_pred HHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHH-hhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 541 DATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNR-NQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPE 619 (708) Q Consensus 541 ~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~-~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~ 619 (708) .+..+.+..+++.++..... .+.++...|+ +++.+.+. ....+.....+ +++.++.. +|+++++++ + T Consensus 427 ~q~~~~i~~~~q~i~~~~~~----~p~~~~~id~---d~~~~~~a~~~g~p~~~~rs--~eev~~~r--~q~~~~~~~-~ 494 (515) T protein:vir:70 427 MAELDKLANFAQYMSLPQTW----PEPAQRAIRW---GDYMDWVRGQISAELPFLKS--EEEMQQEM--AQQAQAQQE-A 494 (515) T ss_pred HHHHHHHHHHHHHHHHHhcc----ChhHHhhCCH---HHHHHHHHHHhCCCccccCC--HHHHHHHH--HHHHHHHHH-H Confidence 77777787777765432211 1223333333 23322221 11222222222 12111111 111111100 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 620 MVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQ 653 (708) Q Consensus 620 ~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~ 653 (708) ...+ .+.++.....+..+ ++. T Consensus 495 ~~~~---------~~~~a~~~~~~~~~----~~~ 515 (515) T protein:vir:70 495 MLNE---------GVAKAVPGVIQQEM----KEG 515 (515) T ss_pred HHHH---------hhhhhcccchhhhh----ccC Confidence 0000 00111111000000 000 No 93 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.58 E-value=1.4e-13 Score=91.13 Aligned_cols=408 Identities=11% Similarity=0.009 Sum_probs=203.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHH----HHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGA----TAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~----~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |= ...+++++..+.. . ..+-++...||+|+|.-.. ....++.. .+ ++.|..+.+|+++ T Consensus 1 m~---~~~i~~L~~~~~~----~-----~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~----~~--~v~nw~~~~Vd~~ 62 (422) T protein:vir:97 1 MN---YMGMGYLRRKLAL----F-----KTGVDKRYRYYAMDDRDDTRSIVMPNNVREM----YR--SVLEWTAKGVDSL 62 (422) T ss_pred CC---hHHHHHHHHHHHH----H-----HHHHHHHHHHHhcCCChhhcCccccHHHHHH----HH--hhcchhHHHHHHH Confidence 32 1234444333332 1 1222344568999885322 11222211 11 2348888888877 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) .+-..-+- |+ . +|.+ +..+++.|+++...+.+..++++.|++|+-|..+.. .+.++ T Consensus 63 a~rl~~~G----f~--~--~d~~--------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~--------~~~p~ 118 (422) T protein:vir:97 63 ADRIIFRE----FT--N--DDFN--------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAE--------DGLPK 118 (422) T ss_pred Hhccccce----ee--C--Cchh--------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCC--------CCeeE Confidence 65221111 11 1 2222 346778899999999999999999999998854311 12333 Q ss_pred eEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) |..+ ++.++ +|||....+. +....| +. ..........|| T Consensus 119 i~~~--sp~~~~~i~D~~~~~~~------~a~~~~-~~---------------------------~~~~~~~~~~~~--- 159 (422) T protein:vir:97 119 MQVI--EASKATGILDPTTFLLT------EGYAIL-ES---------------------------DSNGNPTLEAYF--- 159 (422) T ss_pred EEEe--chhhEEEEEeCCCCcce------eeEEEE-Ee---------------------------cCCCcEEEEEEE--- Confidence 4332 23333 4676433211 111000 00 000000001111 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) .+. .+..+.. .+.. ...+.|++++|+|||+-. T Consensus 160 ---------~~~---~~~~~~~-----------------------------------~~~~-~~~~~~~g~vPvv~~~n~ 191 (422) T protein:vir:97 160 ---------TDK---DIWYYPK-----------------------------------KGKP-YNIKNPTGHPLLVPIIHR 191 (422) T ss_pred ---------cCc---eEEEEcC-----------------------------------CCcc-ccccCCCCCcceEEeccc Confidence 100 0000000 0000 012345577899998754 Q ss_pred eeccCCcccccchH-HhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhcc-ch-HHHHHhhcccCCceeeeccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHI-AKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIR-GL-EKHWEARNKKRPAFLPLREVRDKS 391 (708) Q Consensus 315 ~~~~d~~~~~~G~v-r~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~-~~-~~~~~~~~~~~~~~~~~~~~~~~~ 391 (708) +. -+...|.|-| +.+++.|+.+|+.++.++......+.++..+- |.-. +. .+.|. ...+.++.-+ .+.. T Consensus 192 ~~--~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~----~~~~~i~~~~-~de~ 263 (422) T protein:vir:97 192 PD--AVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVL-GMDPDAKPMEKWR----ATVSTLLEIS-KDED 263 (422) T ss_pred CC--CccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-ccCcccccCchhh----hhhhhhhccC-CCCC Confidence 42 2223344434 78999999999999999988888877775551 2111 00 01111 1111111111 1111 Q ss_pred ccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccccc-c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 392 GNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-N-IAQETVNNLMNRADMASFIYLDNMAKSLK 469 (708) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~-n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~ 469 (708) |. ...++.++...+ ..+...+......+-.+||+.+..+|..+ | +||.||.+....-........+.|..+.+ T Consensus 264 ~~----~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~ 338 (422) T protein:vir:97 264 GD----KPTVGQFTTASM-APFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFL 338 (422) T ss_pred CC----cceeeecCCCCh-hHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 112333333333 34566777777788888999999999644 4 79999998888777777888888888888 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeec---ccchhHHHHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVG---PSYTARRDATVSV 546 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~---~~~~~~r~~~~~~ 546 (708) +++++++.+.-..-+.+ ..+ +++.+.=. |.......+.... T Consensus 339 ~~~rla~~~~~~~~~~~-----------~~~-------------------------~~~~~~w~p~~~~~~~s~a~~aDa 382 (422) T protein:vir:97 339 NVAYIAVCLRDEFPYLR-----------NQF-------------------------MDTVIKWEPLFEADANMLTLVGDG 382 (422) T ss_pred HHHHHHHHHhcCCcccc-----------hhh-------------------------ccceEEEccCCCCChHHHHHHHHH Confidence 88888776643221100 000 11111111 1222234556677 Q ss_pred HHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh Q lcl|Aclame:pro 547 LTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS 591 (708) Q Consensus 547 l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~ 591 (708) +..|.+.++... ....+++++.+...++-..++.+..... T Consensus 383 ~~Kl~~a~~~~~-----~~~~~~~~lg~~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 383 AIKLNQAIPGFM-----DADVIRDLTGVKGADKPIPAITEVTTDG 422 (422) T ss_pred HHHHHhhccccc-----cHHHHHHHcCCCchhHHHHHHHhhhccC Confidence 887777643322 1234455555544444444443322111 No 94 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.57 E-value=6.7e-14 Score=92.82 Aligned_cols=462 Identities=12% Similarity=-0.001 Sum_probs=199.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCC----CHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQW----EGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw----~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |... ++++.++...+.. +. .+-++...||+|+|= +......++ .--++.|..+.+|+.. T Consensus 1 ~~t~-~d~i~~L~~~~~~----~~-----~r~~~~~~Yy~G~~~i~~~~~~~~~~~~-------~~~~~~n~~~~ivd~~ 63 (480) T protein:vir:78 1 MTTY-HEHVERLQGLLAR----DL-----PNLLEAEAYRNGTRRLKTIGIGAPPELA-------YLDVQPGWVATYLRTL 63 (480) T ss_pred CCCH-HHHHHHHHHHHHH----HH-----HHHHHHHHHHhccccchhcccccchhhh-------hhhhhcchHHHHHHHH Confidence 7733 3345554443322 11 111233458999861 111001111 0125679999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) ++...-+ --+.+. |.+. +..+..+++.|+++.+...++.++++.|++|+-|...-..+ .+..+.++ T Consensus 64 ~~~l~~~---g~~~~~----d~~~----~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~---~d~~~~~~ 129 (480) T protein:vir:78 64 SDRLDIE---GFRISE----DSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES---GDPAGIPL 129 (480) T ss_pred HhhhccC---ceecCC----Cchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCcccc---CCCCCeeE Confidence 9976422 112222 2222 34456677889999999999999999999998875321111 12334455 Q ss_pred eEEeecchhh--eecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRS--VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~--v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.. ++|||.... ...+ +++.+...+ +.+.....++|... T Consensus 130 i~~~--~p~~~~~i~D~~~~~----~~~~-~i~~~~~~d---------------------------~~~~~~~~~~y~~~ 175 (480) T protein:vir:78 130 IRVE--SPLYMYAELDPRNTR----RVTR-AVRLYTTRD---------------------------DVAVPDRATLYLPD 175 (480) T ss_pred EEEE--cccceEEEEcCCCcc----ceEE-EEEEEEeec---------------------------CCcceEEEEEEeCC Confidence 5543 2334 357775331 1111 222221110 01111122233221 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) . ++.|... +.... .+ ....++.+.+++.+|++||... T Consensus 176 ~----~~~~~~~---------~~~~~----------------------~~--------~~~~~~~~~~~g~vPvv~f~n~ 212 (480) T protein:vir:78 176 E----TVPLRRN---------GGLND----------------------QW--------VVDGDVIKHGLGVVPVVPLTND 212 (480) T ss_pred e----EEEEEec---------CCCcc----------------------cc--------cccccccccCCCCcceEEeecc Confidence 1 1111100 00000 00 0011233455667788776533 Q ss_pred eeccCCcccccchHH-hhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHh----hcccCCceeeeccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIA-KAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEA----RNKKRPAFLPLREVRD 389 (708) Q Consensus 315 ~~~~d~~~~~~G~vr-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~----~~~~~~~~~~~~~~~~ 389 (708) + ..+..+|.|-+. .+++.++.+|+.+|.+...+...+.+..++- |...+ ...++ ......+.++.. T Consensus 213 ~--~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~--~~~~~~~~~~~~~~~~~~~~~---- 283 (480) T protein:vir:78 213 P--RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTD--ELTNDGENTTLDIYYGRILTL---- 283 (480) T ss_pred c--ccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-CCCcc--ccccccccchhhhhhhhhccC---- Confidence 2 123334455554 6899999999999999998887776665442 22110 00000 000000111100 Q ss_pred ccccccccccccccccCcc-chHHHHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 390 KSGNIIAGATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAK 466 (708) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn~~~ 466 (708) ++ .-+.+.+.+. -...+...+......+-.++++++...|.. .| +||.|+......-........+.|.. T Consensus 284 ------~~-~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~ 356 (480) T protein:vir:78 284 ------AS-EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) T ss_pred ------CC-CCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0112222222 224455666666667777788888887743 34 69999998887777777777777788 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHH Q lcl|Aclame:pro 467 SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSV 546 (708) Q Consensus 467 ~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~ 546 (708) ++++++++++.+ .... ....+ ++|.|.=.+..+-...+..+. T Consensus 357 ~l~~~~rl~~~~----~~~~---------~~~~~-------------------------~~i~v~w~~~~~~s~~~~ad~ 398 (480) T protein:vir:78 357 AWERAMRIAMQI----MGRE---------VTEEY-------------------------TRLETVWRDPSTPTVAAKADA 398 (480) T ss_pred HHHHHHHHHHHH----cCCC---------ccccc-------------------------eeeeEEecCCCCCCHHHHHHH Confidence 888877765543 2210 00010 111111111111113355666 Q ss_pred HHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 547 LTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQ 626 (708) Q Consensus 547 l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~ 626 (708) +.+|.+.+..... ..++++++.+. ++-.+.+...... +.+ ....+.. ........++ T Consensus 399 ~~kl~~~g~~~~s-----~et~~~~lg~~--~d~~~e~~~~~~~----------~~~-~~~~~~~-----~~~~~~~~~~ 455 (480) T protein:vir:78 399 VSKLYANGQGPIP-----KEQARIDLGYT--ATQREQMRDWDKQ----------ETE-DMIDTLY-----STTKAQADAT 455 (480) T ss_pred HHHHHHhcccCCC-----HHHHHhcCCCC--HhHHHHHHHHHHH----------HHH-HHHHHhh-----ccccCCCccc Confidence 7777665432211 12233333331 1111111110000 000 0000000 0000000000 Q ss_pred HH----HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 627 MV----AAQAEAQKATNETAQTQIK 647 (708) Q Consensus 627 ~~----~~qae~~k~~~~~~~~q~e 647 (708) .. ....+.+.+....-++... T Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 456 PKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred cCCCCCCCCCccCCCcccCCCcCCC Confidence 00 0000000000000000000 No 95 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.57 E-value=6e-14 Score=93.06 Aligned_cols=466 Identities=11% Similarity=-0.006 Sum_probs=198.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCC----CHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQW----EGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw----~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |. ..++++.++...+.. + +.+ -++.-.||+|+|= +...-..++. --++.|..+.+|+.. T Consensus 1 ~~-t~~~~i~~L~~~~~~----~---~~r--~~~l~~Yy~G~~~i~~~~~~~~~~~~~-------~~~~~n~~~~ivd~~ 63 (480) T protein:vir:78 1 MT-TYHEHVERLQGLLAR----D---LPN--LLEAEAYRNGTRRLKTIGIGAPPELAY-------LDVQPGWVATYLRTL 63 (480) T ss_pred CC-CHHHHHHHHHHHHHH----H---HHH--HHHHHHHHhccccccccccccchhHhh-------hhhhcchHHHHHHHH Confidence 77 334455555444321 1 111 1233468999761 1100001110 125679999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) ++...-+ -...+ +|.+. ...+..+++.|+++.....++.++++.|++|.-|...-..+. +..+.++ T Consensus 64 ~~~l~~~---g~~~~----~d~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~---d~~g~~~ 129 (480) T protein:vir:78 64 SDRLDIE---GFRIS----EDSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESG---DPAGIPL 129 (480) T ss_pred HhhhccC---ceecC----CCchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccC---CCCCeeE Confidence 9876422 22222 22232 234456678899999999999999999999887753321111 2234455 Q ss_pred eEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+. +..+ +|||.... . ..+ +++.+.+.+ +...+...++|... T Consensus 130 i~~~~--p~~~~~~~D~~~~~-~---~~~-~i~~~~~~~---------------------------~~~~~~~~~~y~~~ 175 (480) T protein:vir:78 130 IRVES--PLYMYAELDPRNTR-R---VTR-AVRLYTTRD---------------------------DVAVPDRATLYLPD 175 (480) T ss_pred EEEEc--ccceEEEEcCCCcc-c---eEE-EEEEEEeec---------------------------CCCceEEEEEEeCC Confidence 55432 2333 47774321 1 111 122221110 00111122233221 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) . ++.|... +.... .+ ....++.+.+++.+|++||... T Consensus 176 ~----~~~~~~~---------~~~~~----------------------~~--------~~~~~~~~~~~g~vPvv~f~n~ 212 (480) T protein:vir:78 176 E----TVPLRRN---------GGLND----------------------QW--------VVDGDVIKHGLGVVPVVPLTND 212 (480) T ss_pred e----EEEEEec---------CCCcc----------------------cc--------ccccccccCCCCCcceEEeecc Confidence 1 1111100 00000 00 0001223455667788776533 Q ss_pred eeccCCcccccchHH-hhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhh----cccCCceeeeccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIA-KAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEAR----NKKRPAFLPLREVRD 389 (708) Q Consensus 315 ~~~~d~~~~~~G~vr-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~----~~~~~~~~~~~~~~~ 389 (708) + ..+..+|.|-+. .+++.++.+|+.+|.+...+...+.+..++- |...+ ...++. .....+.++... T Consensus 213 ~--~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~--~~~~~~~~~~~~~~~~~~~~~~--- 284 (480) T protein:vir:78 213 P--RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTD--ELTNDGENTTLDIYYGRILTLA--- 284 (480) T ss_pred c--ccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCCcc--ccccccccchhhhhhhhhccCC--- Confidence 2 123334455554 5899999999999999998877666654442 22110 000000 000011111000 Q ss_pred ccccccccccccccccCcc-chHHHHHHHHHHHHHHHHHhCCChhHccccc-c-hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 390 KSGNIIAGATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQMPS-N-IAQETVNNLMNRADMASFIYLDNMAK 466 (708) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~-n-~sg~ai~~~q~q~~~~~~~~~dn~~~ 466 (708) | . -+.+.+.+. -...+...+......+-.++|+++...|..+ | +||.|+......-........+-|.. T Consensus 285 --~----~--~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~ 356 (480) T protein:vir:78 285 --S----E--AAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) T ss_pred --C----C--CceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0 112222222 1344555666666667677889888888543 3 69999988877766666777777777 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHH Q lcl|Aclame:pro 467 SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSV 546 (708) Q Consensus 467 ~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~ 546 (708) ++++++++++.+. .. .+..++. +|.|.=.+..+-...+..+. T Consensus 357 ~l~~~~~l~~~~~----g~---------~~~~~~~-------------------------~i~v~f~~~~~~s~~~~ad~ 398 (480) T protein:vir:78 357 AWERAMRIAMQIM----GR---------EVTEEYT-------------------------RLETVWRDPSTPTVAAKADA 398 (480) T ss_pred HHHHHHHHHHHHc----CC---------Cccccce-------------------------eeeEEecCCCCCCHHHHHHH Confidence 7777777655432 21 1111111 11111111111123456667 Q ss_pred HHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 547 LTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQ 626 (708) Q Consensus 547 l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~ 626 (708) +.+|.+.+....+ ...+++++.+. ++-.+++..... ++......+ ........+... T Consensus 399 ~~kl~~~g~~~~s-----~et~~~~lg~~--~d~~~~~~~~~~---------------e~~~~~~~~-~~~~~~~~~~~~ 455 (480) T protein:vir:78 399 VSKLYANGQGPIP-----KEQARIDLGYT--ATQREQMRDWDK---------------QETEDMIDT-LYSTTKAQADAT 455 (480) T ss_pred HHHHHHhccccCC-----HHHHHhcCCCC--HhHHHHHHHHHH---------------HHHHHHHHH-hhccccccCCCC Confidence 7777665432211 12233333221 111222211000 000000000 000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 627 MVAAQAEAQKATNETAQTQIKAFTAQQD 654 (708) Q Consensus 627 ~~~~qae~~k~~~~~~~~q~e~~~~~~~ 654 (708) ..-...+. ..+.+.+ ....=++++. T Consensus 456 ~~~~~~~~-~~~~~~~--~~~~~~~~~~ 480 (480) T protein:vir:78 456 PKPTVTET-KTETQTS--PSGFNRTKTR 480 (480) T ss_pred CCCCCCCC-CCccccc--cCCCCcccCC Confidence 00000000 0000000 0000000000 No 96 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.57 E-value=1e-12 Score=86.37 Aligned_cols=478 Identities=10% Similarity=0.010 Sum_probs=233.5 Q ss_pred CC--cchHHHHHHHHHHH--HHHH---------HhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeec Q lcl|Aclame:pro 1 MA--ETLEKKHERIMLRF--DRAY---------SPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEIN 67 (708) Q Consensus 1 ma--~~~~~~~~~~~~~~--~~~~---------~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N 67 (708) |. +..+..+++++.+. .... ..-.+.+......+. +|.|+.. .. ......+....+..++.| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~--~Y~g~~~-~l--~~~~~~~~~~~~~~~sln 75 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKR--YYMDDFK-QV--THKNSYGDTQKHELQSVN 75 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHH--HhcCCCc-cc--cccccCCCccccceeecc Confidence 43 34444444432221 1111 111234444433443 4667532 11 111011111123456678 Q ss_pred chHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 68 KVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 68 ~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) +-+.+++...+...+..+.+.+ + |....+ .+..+.+.|++......+++.++..|.||+++.++. T Consensus 76 l~~~i~~~~A~ll~~e~~~i~~---~---d~~~~e----~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~----- 140 (505) T protein:vir:79 76 VTKLASAKLASLIFNEQCQVTV---S---DETAND----FLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS----- 140 (505) T ss_pred hHHHHHHHHHhhhcCCCceeec---C---ChHHHH----HHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC----- Confidence 8899999999999888888876 1 233344 455555689999999999999999999999998751 Q ss_pred CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEE Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYI 227 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v 227 (708) +.++|..+ +...||+=. ....+..++-|+.+....+.. ...-.+. T Consensus 141 -----~~~~i~~v--~ad~~~P~~-~d~~~~~~~a~~~~~~~~~~~---------------------------~~~~yt~ 185 (505) T protein:vir:79 141 -----GKIKLAWA--TADQVYPLQ-ADTNQVNELAIASRTTEVENH---------------------------RTIYYTL 185 (505) T ss_pred -----CceEEEEE--cCCeeEEEE-EcCCCeEEEEEEEEEEEecCC---------------------------cceEEEE Confidence 23445444 345555200 011134444433322111100 0011223 Q ss_pred eeeeeecceEEE-EEEEecCcc----CceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCC Q lcl|Aclame:pro 228 AKYYEVRKESVD-VISYRHPIT----GEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIP 302 (708) Q Consensus 228 ~e~~~~~~~~~~-~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p 302 (708) .|+|+....... .+.++...+ |..+.+. .+.. |..+.+...+.+....+ T Consensus 186 lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~--~~~~------------------------~~~l~~~~~~~g~~~p~ 239 (505) T protein:vir:79 186 LEFHQWDHGDYVITNELYRSEAAETVGINVPLN--SLEQ------------------------YEGLEPQVKITGLKHPL 239 (505) T ss_pred EEEEEecCceEEEEEEEEecCCCCccCcccchh--hccc------------------------ccccCcceeecCCCcce Confidence 344432211111 111111101 1100000 0000 00000000111101111 Q ss_pred CCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHh---hcc--c Q lcl|Aclame:pro 303 GEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEA---RNK--K 377 (708) Q Consensus 303 ~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~---~~~--~ 377 (708) +.+||. |. ......+++.|.|++.++++..+.+|.+.|++.+.+-+. +.++.++...+.....--.. ... - T Consensus 240 f~~~~~-~~--~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~~~~~~~~~~f 315 (505) T protein:vir:79 240 FAFYRN-KG--ANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKG-QRRLIVPAEWLKTGSSYGGQASETHPPMF 315 (505) T ss_pred EEEecC-Cc--ccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhc-ccceeechHHhcccCCCCcccccccccCC Confidence 222221 11 111134566788999999999999999999999999764 55677776665321110000 000 0 Q ss_pred CCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHHHHHHHHHHH Q lcl|Aclame:pro 378 RPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETVNNLMNRADM 455 (708) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai~~~q~q~~~ 455 (708) ......+....... +...+...++.-...++...++.....+....|++....|..++ .||++|....+.... T Consensus 316 d~~~~~y~~~~~~~-----~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~ 390 (505) T protein:vir:79 316 DPDETVYQAMYGDA-----SEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQ 390 (505) T ss_pred CccceeeeeccCCC-----CCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHH Confidence 00001111111111 11234455554444667888888888899999999998886543 488888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeec Q lcl|Aclame:pro 456 ASFIYLDNMAKSLKRAGEVWLSMAREVYGS--EREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVG 533 (708) Q Consensus 456 ~~~~~~dn~~~~~~~~~~~~l~li~~~y~~--~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~ 533 (708) ....+...+..+++++.+.++.+..-|.-. ..+.. .+ -...++|+|+=+ T Consensus 391 t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~-~~----------------------------~~~~~~i~v~f~ 441 (505) T protein:vir:79 391 TRSSYITQVEKTIKALTYAILELASVPSFYADGQARW-TG----------------------------DVDSLDITINFN 441 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-cC----------------------------CCCceeEEEEeC Confidence 888888899999999999999887666521 11110 00 011356666666 Q ss_pred ccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHH Q lcl|Aclame:pro 534 PSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQI 604 (708) Q Consensus 534 ~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~ 604 (708) .+-...+++..+.++++.+.+. . + .-.+.+.....+=.-+++..++++.... ...|+...--.. T Consensus 442 d~i~~d~~~~~~~~~~~v~~Gi-~-s--~e~~l~~~~~~~eeea~~el~ri~~E~~---~~~p~~~~~gg~ 505 (505) T protein:vir:79 442 DGVFVDQESKRAADLQAVQAQV-M-P--KKQFLMRNYGLDEEEADEWLAQIDAENS---TAEPEFNQFGGD 505 (505) T ss_pred CCCCCCHHHHHHHHHHHHHcCC-C-C--HHHHHHhcCCCChHHHHHHHHHHHHhcc---ccCCCchhccCC Confidence 5666666677777777765432 1 1 1111111111111223344445543321 111111000000 No 97 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.56 E-value=8.2e-14 Score=92.32 Aligned_cols=446 Identities=10% Similarity=0.025 Sum_probs=201.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhh--hhhhcCCC--ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKL--DEQFEKYP--KFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~--~~q~~grp--~~~~N~i~~~i~~i 76 (708) -++..-.+..+++..| .+.+...+.+..+ -..||.|+| +-.-+..... ......+| .++.|..+.+|+.. T Consensus 20 ~~~~~~~~~~~~i~~~---i~~~~~~~~~~~~--l~~Yy~g~~-~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~ 93 (474) T protein:vir:96 20 QMKPKVETQEEMIIRL---INNHKQKLKDINV--GQKYYDKDN-DINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQK 93 (474) T ss_pred hccccccchHHHHHHH---HHHHHHHHHHHHH--HHHHhcccC-ccccccchhhhcccccccccccccccchHHHHHHhh Confidence 2222222233333333 2333332223322 245888976 1111100000 00001122 46789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+.+.+.+ + +.+..+.+ ..+. .++++.....+..+++++|.||..+..+. .+.++ T Consensus 94 ~~yl~g~p~~~~~-----~-~~~~~~~l----~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~---------~~~~~ 153 (474) T protein:vir:96 94 VSYVAGKPVTYAH-----D-DDKVLDVI----HQVL-DTRWDNKLIDILTAASNKGIDWLQVYINE---------DGELK 153 (474) T ss_pred hhhhcccCceecc-----C-ChHHHHHH----HHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCC---------CCceE Confidence 9999998877653 1 22223333 3333 37899999999999999999998876431 12344 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.++| ||+... .+. ..+++.|... .....++|... T Consensus 154 i~~~--~p~~~~~v~d~~~~----~~~-~a~ir~~~~~-------------------------------~~~~~~vy~~~ 195 (474) T protein:vir:96 154 LFRV--PAEQAIPIWTDKER----EQL-NAFIRIFTFN-------------------------------GETKVEYWTAE 195 (474) T ss_pred EEEE--cccceEEEEcCCCC----Cce-EEEEEEEeec-------------------------------CeeEEEEEeCC Confidence 4433 334454 443221 122 2333333210 00112344332 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +.. .|.....+..... ............+.+++.+|+++|.- T Consensus 196 ~i~----~~~~~~~~~~~~~---------------------------------~~~~~~~~~~~~~~~~~~vPvv~~~n- 237 (474) T protein:vir:96 196 TVT----YYVYENGGLIPDF---------------------------------YYGDEHIQTHFSTGSWERVPFIAFKN- 237 (474) T ss_pred eEE----EEEEcCCceeecc---------------------------------ccccccccCcccccCCCccceEEecC- Confidence 221 1111100000000 00111111223345556666665531 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHHHHhhcccCCceeeeccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKHWEARNKKRPAFLPLREVRDKSGN 393 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (708) ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++ .|.. ++..+... ......++.... . T Consensus 238 ------n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~--~~~~~~~i~~~~----~-- 302 (474) T protein:vir:96 238 ------NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFME--GLKYYKAINVSS----D-- 302 (474) T ss_pred ------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhh--hhhccceeeccC----C-- Confidence 234678999999999999999999999999888876654 2321 11111111 011112222111 1 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) ....+...+.-..+....+......|-..|++.+.+.+. .+|.||.|+..+..............|..+++++. T Consensus 303 -----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~ 377 (474) T protein:vir:96 303 -----GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELM 377 (474) T ss_pred -----CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112333333334677778888899999999987665443 46789999998877777777777777777777777 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.+ .. ... ++ .+|.|.=.+..+..-.+..+.+.+ T Consensus 378 ~~i~~~----~g---------~~~--d~-------------------------~~i~i~f~~~~p~~~~e~a~~~~~--- 414 (474) T protein:vir:96 378 QFILDF----NK---------IKL--DA-------------------------KEIEITFNFNVMVNDLEQSQIGAQ--- 414 (474) T ss_pred HHHHHH----hC---------CCc--cc-------------------------ceeeEEecCCCccCHHHHHHHHHH--- Confidence 766554 21 100 00 011122123333322233332222 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) .+-. ....++..+.+ ...++..++++... ....+..+..............+ T Consensus 415 -~gii------S~et~~~~lp~v~D~~~E~eri~~E~--------------------~~~~~~~~~~~~~~~~~~~~~~~ 467 (474) T protein:vir:96 415 -SQYL------SKETLVRHHPWVDDPKAELERLDEEQ--------------------LELNKQLPNLDDGGADGAQQQQQ 467 (474) T ss_pred -cCCC------ChHHHHHhCCCCCCHHHHHHHHHHHH--------------------HHHHhhccccccccCCCCCCcCC Confidence 1111 11222333221 11111112221100 00000000000000000000000 Q ss_pred HHHHHHH Q lcl|Aclame:pro 632 AEAQKAT 638 (708) Q Consensus 632 ae~~k~~ 638 (708) .+..+.+ T Consensus 468 ~~~~e~~ 474 (474) T protein:vir:96 468 SENNQSK 474 (474) T ss_pred CCccccC Confidence 0000000 No 98 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.56 E-value=8.2e-14 Score=92.32 Aligned_cols=446 Identities=10% Similarity=0.025 Sum_probs=201.9 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhh--hhhhcCCC--ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKL--DEQFEKYP--KFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~--~~q~~grp--~~~~N~i~~~i~~i 76 (708) -++..-.+..+++..| .+.+...+.+..+ -..||.|+| +-.-+..... ......+| .++.|..+.+|+.. T Consensus 20 ~~~~~~~~~~~~i~~~---i~~~~~~~~~~~~--l~~Yy~g~~-~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~ 93 (474) T protein:vir:95 20 QMKPKVETQEEMIIRL---INNHKQKLKDINV--GQKYYDKDN-DINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQK 93 (474) T ss_pred hccccccchHHHHHHH---HHHHHHHHHHHHH--HHHHhcccC-ccccccchhhhcccccccccccccccchHHHHHHhh Confidence 2222222233333333 2333332223322 245888976 1111100000 00001122 46789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+.+.+.+ + +.+..+.+ ..+. .++++.....+..+++++|.||..+..+. .+.++ T Consensus 94 ~~yl~g~p~~~~~-----~-~~~~~~~l----~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~---------~~~~~ 153 (474) T protein:vir:95 94 VSYVAGKPVTYAH-----D-DDKVLDVI----HQVL-DTRWDNKLIDILTAASNKGIDWLQVYINE---------DGELK 153 (474) T ss_pred hhhhcccCceecc-----C-ChHHHHHH----HHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCC---------CCceE Confidence 9999998877653 1 22223333 3333 37899999999999999999998876431 12344 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.++| ||+... .+. ..+++.|... .....++|... T Consensus 154 i~~~--~p~~~~~v~d~~~~----~~~-~a~ir~~~~~-------------------------------~~~~~~vy~~~ 195 (474) T protein:vir:95 154 LFRV--PAEQAIPIWTDKER----EQL-NAFIRIFTFN-------------------------------GETKVEYWTAE 195 (474) T ss_pred EEEE--cccceEEEEcCCCC----Cce-EEEEEEEeec-------------------------------CeeEEEEEeCC Confidence 4433 334454 443221 122 2333333210 00112344332 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +.. .|.....+..... ............+.+++.+|+++|.- T Consensus 196 ~i~----~~~~~~~~~~~~~---------------------------------~~~~~~~~~~~~~~~~~~vPvv~~~n- 237 (474) T protein:vir:95 196 TVT----YYVYENGGLIPDF---------------------------------YYGDEHIQTHFSTGSWERVPFIAFKN- 237 (474) T ss_pred eEE----EEEEcCCceeecc---------------------------------ccccccccCcccccCCCccceEEecC- Confidence 221 1111100000000 00111111223345556666665531 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc-cchHHHHHhhcccCCceeeeccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI-RGLEKHWEARNKKRPAFLPLREVRDKSGN 393 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (708) ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++ .|.. ++..+... ......++.... . T Consensus 238 ------n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~--~~~~~~~i~~~~----~-- 302 (474) T protein:vir:95 238 ------NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFME--GLKYYKAINVSS----D-- 302 (474) T ss_pred ------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhh--hhhccceeeccC----C-- Confidence 234678999999999999999999999999888876654 2321 11111111 011112222111 1 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) ....+...+.-..+....+......|-..|++.+.+.+. .+|.||.|+..+..............|..+++++. T Consensus 303 -----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~ 377 (474) T protein:vir:95 303 -----GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELM 377 (474) T ss_pred -----CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112333333334677778888899999999987665443 46789999998877777777777777777777777 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq 552 (708) ++++.+ .. ... ++ .+|.|.=.+..+..-.+..+.+.+ T Consensus 378 ~~i~~~----~g---------~~~--d~-------------------------~~i~i~f~~~~p~~~~e~a~~~~~--- 414 (474) T protein:vir:95 378 QFILDF----NK---------IKL--DA-------------------------KEIEITFNFNVMVNDLEQSQIGAQ--- 414 (474) T ss_pred HHHHHH----hC---------CCc--cc-------------------------ceeeEEecCCCccCHHHHHHHHHH--- Confidence 766554 21 100 00 011122123333322233332222 Q ss_pred hccccCchhHHHHHHHHhhccc-hhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~~~d~-~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) .+-. ....++..+.+ ...++..++++... ....+..+..............+ T Consensus 415 -~gii------S~et~~~~lp~v~D~~~E~eri~~E~--------------------~~~~~~~~~~~~~~~~~~~~~~~ 467 (474) T protein:vir:95 415 -SQYL------SKETLVRHHPWVDDPKAELERLDEEQ--------------------LELNKQLPNLDDGGADGAQQQQQ 467 (474) T ss_pred -cCCC------ChHHHHHhCCCCCCHHHHHHHHHHHH--------------------HHHHhhccccccccCCCCCCcCC Confidence 1111 11222333221 11111112221100 00000000000000000000000 Q ss_pred HHHHHHH Q lcl|Aclame:pro 632 AEAQKAT 638 (708) Q Consensus 632 ae~~k~~ 638 (708) .+..+.+ T Consensus 468 ~~~~e~~ 474 (474) T protein:vir:95 468 SENNQSK 474 (474) T ss_pred CCccccC Confidence 0000000 No 99 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.56 E-value=1.3e-13 Score=91.30 Aligned_cols=399 Identities=11% Similarity=0.002 Sum_probs=203.4 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHH----HHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGA----TAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~----~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |= .+.+.++...+.. . ..+-++...||+|+|.-.. .-..++.. .+ ++.|..+.+|+++ T Consensus 1 ~~---~~~i~~L~~~~~~----~-----~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~----~~--~v~nw~~~iVds~ 62 (409) T protein:vir:94 1 MT---EKGIGYLRFKLSV----H-----KRRAEMRYDQYAMKYVDRFKGITIPQALSQQ----YR--SILGWCAKGVDSL 62 (409) T ss_pred CC---HHHHHHHHHHHHH----H-----hHHHHHHHHHhcccCchhhcChhhhHHHHHH----Hh--hhcchhHHHHHHh Confidence 32 2345555444332 1 1222344568999985322 11222111 11 3458999999988 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) .+-..-+- | +. +|.+ +..+++.|+++...+.+..++++.|++|+-|.-+ .++.++ T Consensus 63 a~rl~~~G----f--~~--~d~~--------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~---------~dg~~~ 117 (409) T protein:vir:94 63 ADRLVFRE----F--EN--DDFT--------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG---------ENDAVR 117 (409) T ss_pred HhhcccCc----c--cC--CchH--------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC---------CCCceE Confidence 66332221 1 11 2322 4567889999999999999999999999888532 123344 Q ss_pred eEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 157 IEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 157 i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) |..+....--++|||..+++- .+.+.|-+ +. ...... ..+|... T Consensus 118 i~~~sp~~~~~i~D~~~~~~~------~a~~~~~~--------------------------d~-~~~~~~-~~~~~~~-- 161 (409) T protein:vir:94 118 LQVIEAVNATGIIDPITGLLT------EGYAVLER--------------------------DE-NNNVVL-EAHFLPD-- 161 (409) T ss_pred EEEeccceEEEEEecCCCcee------eeEEEEEe--------------------------cC-CCceEE-EEEEecC-- Confidence 443321112235777433211 11111100 00 000001 1111100 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) +++.+.. ..+.. ...+.|++++|+|||+-.+. T Consensus 162 -------------~~~~~~~----------------------------------~~~~~-~~~~n~~g~vPvV~f~n~~~ 193 (409) T protein:vir:94 162 -------------RTDYYYR----------------------------------DSRNN-ISIANPTGHPLLVPIIHRPD 193 (409) T ss_pred -------------cEEEEEe----------------------------------cCcee-EeeeCCCCCcceEEeccccc Confidence 0000000 00000 11245667889999875432 Q ss_pred ccCCcccccch-HHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccch--HHHHHhhcccCCceeeeccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGH-IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGL--EKHWEARNKKRPAFLPLREVRDKSGN 393 (708) Q Consensus 317 ~~d~~~~~~G~-vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (708) -+...|.|- .+.+++.|+.+|+.++.++...-..+.++..+- |.-++. .+.|... .+.++.-+. +..|. T Consensus 194 --~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~~~----~~~i~~~~~-d~dg~ 265 (409) T protein:vir:94 194 --AVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-GLSDDAEPMETWKAT----VSSMLQFTK-DEDGD 265 (409) T ss_pred --cccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-ecCCCCcccchhhhh----HHHhhcCCC-CCCCC Confidence 222334443 378999999999999999988888777765551 211110 0112110 011111110 11111 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAKSLKRA 471 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 471 (708) ...++.++...+ +.+...+......+-.+||+++..+|.. .| +||.||.+....-........+.|..+.+++ T Consensus 266 ----~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~ 340 (409) T protein:vir:94 266 ----KPTLGQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNV 340 (409) T ss_pred ----CceEEecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112333333333 3456777777788888899999999954 44 7999999888777777777777788888888 Q ss_pred HHHHHHHHHHhcCCC-cEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHH Q lcl|Aclame:pro 472 GEVWLSMAREVYGSE-REVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) Q Consensus 472 ~~~~l~li~~~y~~~-r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~l 550 (708) +++.+.+.-.+-..+ .+.++. ..|-. .-+.......+..+.+..| T Consensus 341 ~rla~~i~~~~~~~~~~~~~~~-----v~W~p-----------------------------~~~~~~~~~a~~aDa~~Kl 386 (409) T protein:vir:94 341 AYLAACLRDDAPYLREQFRKTK-----PKWEP-----------------------------LFEADASMLSLIGDGAIKL 386 (409) T ss_pred HHHHHHHhCCCCccccccccce-----EEecc-----------------------------CCCcchHHHHHHHHHHHHH Confidence 888777644332100 111110 01110 0012222345667888888 Q ss_pred HHhccccCchhHHHHHHHHhhccchhHH Q lcl|Aclame:pro 551 LSSMLPTDPMRPAIQGIILDNIDGEGLD 578 (708) Q Consensus 551 lq~~~~~~p~~~~~~~~~~~~~d~~~~~ 578 (708) .+.+++..+ ....++++-+...+ T Consensus 387 ~~ag~~~~~-----~~~~~~~lG~~~~d 409 (409) T protein:vir:94 387 NQAIPEFIN-----KDTIRDLTGIEGGE 409 (409) T ss_pred HHhcccccc-----hhHHHHHcCCCCCC Confidence 887654322 12334444443333 No 100 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.56 E-value=2.3e-13 Score=89.90 Aligned_cols=466 Identities=11% Similarity=-0.028 Sum_probs=196.4 Q ss_pred CCcch--HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETL--EKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~--~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g 78 (708) |.|.. .+++.++...+.. .. .+.++...||+|+|= -..+...... .. .+--++.|..+-+|+..++ T Consensus 8 ~~e~~~~~~~~~~l~~~~~~----~~-----~r~~~l~~YY~G~~~-i~~~~~~~~~-~~-~~~~~v~n~~~~iVd~~~~ 75 (486) T protein:vir:42 8 MEEIEDPAVVREEMISAFED----AS-----KDLASNTSYYDAERR-PEAIGVTVPR-EM-QQLLAHVGYPRLYVDSVAE 75 (486) T ss_pred CCCcccHHHHHHHHHHHHHH----HH-----HHHHHHHHHhcccCc-chhcccccch-hH-hhhhhccchHHHHHHHHHh Confidence 44322 2234444333322 21 111222358999871 0000000000 00 0112356999999998888 Q ss_pred HHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeE Q lcl|Aclame:pro 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) Q Consensus 79 ~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~ 158 (708) ...-+- +. .|. +.+.. ..+..++..|+++.....+..++++.|++|.-|..+.... .-....+.+++. T Consensus 76 ~l~~~g--~~-~~~----~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~-~~~~~~~~~~i~ 143 (486) T protein:vir:42 76 RQAVEG--FR-LGD----ADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQL-DLGWDQNVPIIR 143 (486) T ss_pred hhcccc--ee-cCC----CchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccc-ccccCCCeeEEE Confidence 663221 11 121 12222 2345566789999999999999999999998886543211 112233444444 Q ss_pred Eeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 159 PIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 159 ~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) .+ ++.++ +|||...+ ..++.+.+-+. +.+.+...++|.... T Consensus 144 ~~--~p~~~~~i~d~~~~~------~~~~~~~~~~~----------------------------~~~~~~~~~~y~~~~- 186 (486) T protein:vir:42 144 VE--PPTRMHAEIDPRINR------VSKAIRVAYDK----------------------------EGNEIQAATLYTPME- 186 (486) T ss_pred Ee--cccceEEEEeCCCCC------eEEEEEEEEec----------------------------CCCeEEEEEEEcCCc- Confidence 33 23333 47764321 11222222100 011122233333211 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) +++|.. .+|.-..++..+.+++.+|+|||.-.+ T Consensus 187 ---~~~~~~-------------------------------------------~~~~~~~~~~~~h~~g~vPvv~~~n~~- 219 (486) T protein:vir:42 187 ---TIGWFR-------------------------------------------ADGEWAEWFNVPHGLGVVPVVPLPNRT- 219 (486) T ss_pred ---EEEEEe-------------------------------------------cCCcEEeecceecCCCCceEEEecccc- Confidence 111110 001111223344556677777765322 Q ss_pred ccCCcccccchHH-hhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHH----hhcccCCceeeeccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIA-KAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWE----ARNKKRPAFLPLREVRDKS 391 (708) Q Consensus 317 ~~d~~~~~~G~vr-~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 391 (708) ..+...|.|-+. .+++.++.+|+.+|.+...+...+.+..++- |.........+ .......+.++.... T Consensus 220 -~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 293 (486) T protein:vir:42 220 -RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEIGVDSETGQTLFDAYLARILAFED---- 293 (486) T ss_pred -ccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhh-cCCccccccccccccchhhhhhchhcccCC---- Confidence 123334556565 6889999999999999888776666554432 11100000000 000011111111110 Q ss_pred ccccccccccccccCcc-chHHHHHHHHHHHHHHHHHhCCChhHccc-ccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 392 GNIIAGATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQM-PSN-IAQETVNNLMNRADMASFIYLDNMAKSL 468 (708) Q Consensus 392 ~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~ 468 (708) +. ..+ .+.+. -...+...+......+-.++++++...|. ..| +||.|+.................|..++ T Consensus 294 ----~~-~~~--~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l 366 (486) T protein:vir:42 294 ----AE-GKI--QQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAW 366 (486) T ss_pred ----CC-ceE--EeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 112 22222 12334444455455555557788877774 334 7999999988888777778888888888 Q ss_pred HHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHH Q lcl|Aclame:pro 469 KRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLT 548 (708) Q Consensus 469 ~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~ 548 (708) +++.++++.+.-. - + ....+ .+|.|.=.+..+....+..+.+. T Consensus 367 ~~~~~l~~~~~~~-~---------~--~~~d~-------------------------~~i~v~w~~~~~~s~~~~ad~~~ 409 (486) T protein:vir:42 367 EEAMRIAYRIMKG-G---------D--VPPDM-------------------------LRMETVWRDPSTPTYAAKADAAT 409 (486) T ss_pred HHHHHHHHHHhcC-C---------C--ccccc-------------------------eeeeEEecCCCCCCHHHHHHHHH Confidence 8888877664310 0 0 00000 11222222222333445667777 Q ss_pred HHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 549 NVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQ-AQMAAQSQPNPEMVLAQAQM 627 (708) Q Consensus 549 ~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~-~qq~qq~~~~~~~~~aq~~~ 627 (708) .|.+.+....+ ..++++++.+ .++..+.+++... ++....+. ..+............+..+. T Consensus 410 kl~~~~~g~~s-----~et~~~~lg~--~~d~~~e~~~~~~----------e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (486) T protein:vir:42 410 KLYGNGQGVIP-----RERARIDMGY--SVKEREEMRRWDE----------EEAAMGLGLLGTMVDADPTVPGSPSPTAP 472 (486) T ss_pred HHHhcccCCCC-----HHHHHhcCCC--ChhHHHHHHHHHH----------HHHHHHHHHHHHhhcCCCCCCCCCCCCCC Confidence 77665322211 1122333332 1111111211100 00000000 00000000000000000000 Q ss_pred HHHHHHHHHHHHHH Q lcl|Aclame:pro 628 VAAQAEAQKATNET 641 (708) Q Consensus 628 ~~~qae~~k~~~~~ 641 (708) ...+..+..+..+. T Consensus 473 ~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 473 PKPQPAIESSGGDA 486 (486) T ss_pred CCCCcccCCCCCCC Confidence 00000000000000 No 101 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.55 E-value=4.8e-14 Score=93.58 Aligned_cols=464 Identities=11% Similarity=0.033 Sum_probs=198.5 Q ss_pred CCcchHHHH--HHHHHHHH-HHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhh-hhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKH--ERIMLRFD-RAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLD-EQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~--~~~~~~~~-~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~-~q~~grp~~~~N~i~~~i~~i 76 (708) |-+.+++.+ +.+.+.+. +....+.... -+-++...||.|++.-.......... ... ..-.++.|..+.+|+.. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~--~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~~~~~~n~~~~iVd~~ 77 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTEC--ERLDDFEAWTKNGQEVPDLATRHKNKEREV-LQQLSRKPWMGLMVNSF 77 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHh--HHHHHHHHHHhcCCcccccccccCChhHHH-HHHHhhcCcHHHHHHHH Confidence 666554432 11222111 1111111111 11133346899987421100000000 000 00013568899999988 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) ++...-+. |+..+ + +..+ .+..++..|+++.....+..++++.|++|+-|.... +..++.+.++ T Consensus 78 ~~~l~~~g----f~~~d--~--~~~~----~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~----~~~d~~g~~~ 141 (479) T protein:vir:99 78 AQQLIVDG----YRKTG--T--NENA----KGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGI----SPLDGTTVAR 141 (479) T ss_pred Hhhccccc----ccCCC--c--hhhH----HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCC----CCcCCCCceE Confidence 77543111 22211 1 1122 234566789999999999999999999987765321 1223445555 Q ss_pred eEEeecchhhee--cCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v~--~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +..+ ++.+++ ||...+. . ..++. +. . +......+|... T Consensus 142 i~~~--~p~~~~~iydd~~~~----~-~~~~~---~~----------------------~--------~~~~~~~~~~~~ 181 (479) T protein:vir:99 142 IKCI--DPRDAFAIWEDPYWD----E-WPKYL---LE----------------------R--------QPNGQYWWWTEE 181 (479) T ss_pred EEEe--chhheEEEecCCccc----c-eeeEE---Ee----------------------e--------cCceeEEEEecc Confidence 5433 233432 4321110 0 00000 00 0 000001111111 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) . +.++ ...+|.-.+.++.|.+++.+|++||... T Consensus 182 ~-----~~~~------------------------------------------~~~~~~~~~~~~~~h~~g~vPvv~f~n~ 214 (479) T protein:vir:99 182 D-----YSIF------------------------------------------EFKQGKFIYRETVSHDYGHIPFVRYVNV 214 (479) T ss_pred e-----EEEE------------------------------------------EecCCceeeccccccCCCCcceEEeecC Confidence 0 0000 0011111222334445567777776644 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhc--cchHHHHHhhcccCCceeeecccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI--RGLEKHWEARNKKRPAFLPLREVRDKSG 392 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (708) +. ....|.|.+..+++.++.+|+.+|.+...+...+.+..++- |.. ++..............++... +. T Consensus 215 ~~---~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~i~~~~---~~-- 285 (479) T protein:vir:99 215 MD---LRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT-GLMLPEGANADQEKMRFAQESMLISQ---NE-- 285 (479) T ss_pred CC---cCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc-CCCcccccccchhccccccccceeec---CC-- Confidence 32 12357889999999999999999999988887777664442 211 000000000001111111111 00 Q ss_pred cccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 393 NIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~ 472 (708) . ..+...+... ...+...+......+-.+||+++...|..+|.||.|+......-........+.|..++++++ T Consensus 286 ----~-~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~ 359 (479) T protein:vir:99 286 ----K-ASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTM 359 (479) T ss_pred ----C-ceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 1122122111 244555566666666677888889999888999999998888877777778888888888888 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeec-ccchhHHHHHHHHHHHHH Q lcl|Aclame:pro 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVG-PSYTARRDATVSVLTNVL 551 (708) Q Consensus 473 ~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~-~~~~~~r~~~~~~l~~ll 551 (708) ++++.+. + ...... .++|.+.=. +.+++ ..+..+.+..|. T Consensus 360 ~l~~~~~----~---------~~~~~~-------------------------~~~i~~~w~~~~~~s-~~~~ad~~~kl~ 400 (479) T protein:vir:99 360 RLVNKIE----G---------RTEEAT-------------------------DLDFTITWQDVTIQS-LAQFADAWAKMV 400 (479) T ss_pred HHHHHHc----C---------CCcccc-------------------------ceeeeEEecCCCCCC-HHHHHHHHHHHH Confidence 7765532 1 100000 012222111 12223 345667777776 Q ss_pred HhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 552 SSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 552 q~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) +++.- +. ..+++++.+-.-.+ +++++.... ++....+. ..+......... +....... T Consensus 401 ~ag~i--s~-----et~l~~l~gv~~~~-~e~~~~~~~----------~~~~~~~~---~~~~~~~~~~~~-~~~~~~~~ 458 (479) T protein:vir:99 401 ESLKI--PA-----EGVWDMIPNLDQST-VNGWKEIYD----------REGDFGKY---MRKLQNGPDPAE-QRGGPNGA 458 (479) T ss_pred hcCCC--CH-----HHHHHhcCCCCHHH-HHHHHHHHH----------HHHHHHHH---HHHHhcccCccc-ccCCCCCC Confidence 65321 11 22333331100111 122211100 00000000 000000000000 00000000 Q ss_pred HHHHHHHH---HH---HHHHH Q lcl|Aclame:pro 632 AEAQKATN---ET---AQTQI 646 (708) Q Consensus 632 ae~~k~~~---~~---~~~q~ 646 (708) .+...+.. +. -++.. T Consensus 459 ~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 459 TNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred CCCCCCCCCCcchhccCCCCC Confidence 00000000 00 01111 No 102 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.54 E-value=1.8e-12 Score=84.93 Aligned_cols=494 Identities=9% Similarity=-0.010 Sum_probs=221.8 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCC-CCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhc- Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGG-QWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRN- 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~-Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~- 82 (708) ++.....+...++ ...|...|+++.+|. +... ..+...... ..+ ++...-+.-...++.+.+.... T Consensus 1 m~~~~~~l~~k~~-----R~~~e~~w~e~a~~~-lP~~~~~~~~~~~~-~~~-----~~~~~dstg~~a~~~LAa~l~~~ 68 (514) T protein:vir:80 1 MRQQASAMWAEYR-----DSTAIRKAEDFAKFT-IASLMVDPLDKTHQ-AEV-----VEYDFQSAGAFLVNNLTAKLALT 68 (514) T ss_pred CccchHHHHHHhh-----cchHHHHHHHHHHHh-cccccCCCCCCccc-ccc-----cccccchhHHHHHHHHHHHHHhh Confidence 3444444433332 334556666665542 2211 111111000 000 1111223334445544433332 Q ss_pred ----CcceeEEecCCC------cchHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC Q lcl|Aclame:pro 83 ----NRITVKFRPGDR------EASEELANK------LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) Q Consensus 83 ----nr~~~~v~pr~~------~~d~~~A~~------l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~ 146 (708) +++=+++.+.+. ..+.+.+++ ++..+......|++..+...++.+.+..|.|.+.+. . T Consensus 69 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~------~ 142 (514) T protein:vir:80 69 LFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYRE------P 142 (514) T ss_pred hcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe------c Confidence 344445544321 122222333 344455566789999999999999999999876552 1 Q ss_pred CCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 147 DPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 147 d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) ++. .++.. |+.++++..+.. . ...-++++.+|+.++|-..|+...... .......+.+. T Consensus 143 ~~~----~~~~~----pl~~y~v~~d~~---G-~v~~i~rr~~~~~~~l~~~~~~~~~~~---------~~~~~~~~~v~ 201 (514) T protein:vir:80 143 GTG----KMLVW----TMQSYTVRRTSH---G-DPAVVVLRQQMPFRELTPEIQADAQAK---------QIAKRDSDKCD 201 (514) T ss_pred CCC----cEEEE----EcCeEEEeeCCC---c-CeEEEEeeeeecHHHhhhhhhhhhhhh---------hccCCCCCceE Confidence 111 22222 455666544332 1 122378889999998876664322110 00001111221 Q ss_pred EeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCc Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~ 306 (708) |. .+.++.+.. .... ..++.-..|..++ ..+-||+.++ T Consensus 202 v~-----------~~v~~~~~~-----------------------------~~~~-~sv~~e~~g~~i~-~es~y~~~e~ 239 (514) T protein:vir:80 202 LY-----------TVIEWQPTP-----------------------------NGKR-CAVWHELEGKRVG-PESSYPAHLC 239 (514) T ss_pred EE-----------EEEEeecCC-----------------------------CCeE-EEEEEeccceeec-ccCccccccC Confidence 11 111111110 0001 1111122333343 3466888888 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) ||+++.... .+|..+|.|.+....+--+.+|++....+.....+.++.++++.+.+-....... ..++.++. T Consensus 240 P~i~~Rw~~--~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~---~~~g~~v~--- 311 (514) T protein:vir:80 240 PYVPVAWNV--PDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRD---AETGDFVP--- 311 (514) T ss_pred CeeeeeeEe--cCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcc---cCCceeec--- Confidence 888764333 6888999999999999999999999999999999999999998765533222111 11111111 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAK 466 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~ 466 (708) +. .+.+ .+.+ .....--+.....++...+.|....=+.. ......+.|++-|..+.+--...+...+.+|.. T Consensus 312 g~--~~~v----~~~~-~~~~~d~~~~~~~i~~~~~rI~~aFml~~-~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 383 (514) T protein:vir:80 312 GQ--VGSV----ASYE-RGDYNKIAQASASVESIVMRLNRAFMYTG-QVRDAERVTVEEIRTVAEEAENLLGGVYSLLAE 383 (514) T ss_pred CC--Cccc----eeee-cCcccchHHHHHHHHHHHHHHHHHHhhhc-cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHH Confidence 10 1111 1111 11111223334566666666655432211 112223468888988888877788887777763 Q ss_pred -HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHH Q lcl|Aclame:pro 467 -SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVS 545 (708) Q Consensus 467 -~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~ 545 (708) +..-+.+..+.++.+.- .| . . + .. | . + .+.+.+..+ -.+-.|.+..+ T Consensus 384 Ell~Pli~r~~~il~r~~--------~g---~--l----P-~~-p--~------~----l~~~~~vs~-la~l~r~~~~~ 431 (514) T protein:vir:80 384 TLQAPLAYLTMYEASRGN--------GG---M--L----L-GI-A--Q------G----VYRPSIITG-IPALTRNIETA 431 (514) T ss_pred HHHHHHHHHHHHHHhhhc--------cC---C--C----C-CC-C--c------h----hhcceeeec-HHHHHHHHHHH Confidence 44433333333332110 00 0 0 0 00 0 0 0 112222222 23445677777 Q ss_pred HHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhh-cccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 546 VLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISG-IAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQ 624 (708) Q Consensus 546 ~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~-~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq 624 (708) .+.++++.++...+..+ .++ +.-+.++++..+-....... ..-..+++.+..+++++++++++++.++..+. T Consensus 432 ~l~~~~~~i~~l~~~~p----~v~---d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (514) T protein:vir:80 432 NILRATQEASAIVPALV----QLS---KRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALA 504 (514) T ss_pred HHHHHHHHHHHHhccch----hhh---hcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77776665543222211 122 33334445544443332221 11121222221111111111111111111100 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 625 AQMVAAQAEAQKA 637 (708) Q Consensus 625 ~~~~~~qae~~k~ 637 (708) +. .++...-. T Consensus 505 ~~---~~~~~~~~ 514 (514) T protein:vir:80 505 AE---TSAGVLTS 514 (514) T ss_pred Hh---hhccccCC Confidence 00 01000000 No 103 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.54 E-value=1.8e-12 Score=85.01 Aligned_cols=504 Identities=11% Similarity=0.062 Sum_probs=218.2 Q ss_pred CCc-chHHHHHHHHHHHHHHHHh-hH-HHHHHHHHHHHHhhcCCCCCCHHHHHH-hhhhh------hhcCCC--ceeecc Q lcl|Aclame:pro 1 MAE-TLEKKHERIMLRFDRAYSP-QK-EVREKCIEATRFARVPGGQWEGATAAG-TKLDE------QFEKYP--KFEINK 68 (708) Q Consensus 1 ma~-~~~~~~~~~~~~~~~~~~~-~~-~~r~~~~~d~~~~~~~G~Qw~~~~~~~-l~~~~------q~~grp--~~~~N~ 68 (708) |-- .+-..+..+...|.+.... .. ..+.+.... ..||.|+| .++.. .+..+ ....+| .+.+|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~YY~g~h---~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf 75 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIG--ENYYNQEN---DIEKSRIFYMNDKGQLREDNYASNVKISHGF 75 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhcccc---hhhhcccccccccccccccccccccccccch Confidence 221 1111122333333333222 21 122333322 35788986 11111 00000 001122 478899 Q ss_pred hHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCC Q lcl|Aclame:pro 69 VATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) Q Consensus 69 i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~ 148 (708) .+.+|+..+|+...+.+.+.+ -+ +++.+..+. ++.+. .++++........++.++|.+|..+.++. T Consensus 76 ~k~Ivd~~~~yl~G~Pv~~~~--~d-~~~~e~~~~----l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de------ 141 (537) T protein:vir:78 76 FTELVDQLAQYLLSNGVEVKV--KD-EDNTQLDEI----LQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTS------ 141 (537) T ss_pred HHHHHHHHhhhhcccCceeec--Cc-chhHHHHHH----HHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecC------ Confidence 999999999999999887763 22 233343333 33333 36888888999999999999988775432 Q ss_pred CCCCcceeeEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 149 MDDRQRIAIEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 149 ~~~~~~i~i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) .+.+++..+ ++..+ +||.. . +.. .+.+.+....... ...+.+.+. T Consensus 142 ---~~~~~~~~i--~p~~~~pv~d~~-~-----~~~-~~~~~y~~~~~~~---------------------~~~~~~~~~ 188 (537) T protein:vir:78 142 ---EGKLKFQTV--DGLTLIPVFDDY-G-----VLK-MIIRWYSEIRYST---------------------KQQSTETIW 188 (537) T ss_pred ---CCceEEEEE--ccceeEEEEcCC-C-----Cce-eEEEEEeeeeccc---------------------cccCcceEE Confidence 234455433 33443 34431 1 111 1222222111000 001223344 Q ss_pred EeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEE--------ecceeeecC Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVV--------DGDGFLEKP 298 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~--------~~~~il~~~ 298 (708) ..++|....... +.+..........+ .... ....+.+.+. .+....... T Consensus 189 ~~evyt~~~i~~--y~~~~~~~~~~~~~--~~~~-------------------~~~~i~~~~~~~~~~~~~~~~~~~~~~ 245 (537) T protein:vir:78 189 HADVWNEEAVCY--YIQDDEGVSTTYKL--DEAY-------------------NPNPAPHVLAIEESTDADFEDTDGYQV 245 (537) T ss_pred EEEEEcCCcEEE--EEecCCcccccccc--cccc-------------------cccccceeeeccccccccccccccccc Confidence 456665443221 11111110000000 0000 0000000000 011112233 Q ss_pred CCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccC Q lcl|Aclame:pro 299 RRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKR 378 (708) Q Consensus 299 ~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~ 378 (708) .+.+++.+|+++|.- ...+.|.+..+++.++.+|.+.|.+.+.+...+.+.+++.-...++..+..... .. T Consensus 246 ~~~~~g~iPvv~f~n-------n~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l--~~ 316 (537) T protein:vir:78 246 LGRSYSKFPFQLLYN-------NKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNI--KA 316 (537) T ss_pred cccCCcceeEEEecc-------CccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHH--hh Confidence 445556666655432 224679999999999999999999999999998887776432222222221111 11 Q ss_pred CceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 379 PAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASF 458 (708) Q Consensus 379 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~ 458 (708) .+.+.... ..+ ...++..+.-..+....+..+.+.|...|.+-+......+|.||+|+..+......... T Consensus 317 ~~~i~v~~---d~~-------~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~ 386 (537) T protein:vir:78 317 KKMIGVNG---DNA-------GMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKAR 386 (537) T ss_pred cCceeecC---CCC-------ceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHH Confidence 12221110 011 12333333334667778888889998887655554445578899999998877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 459 ~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) ..-+-|..+++++.++++.++...... .+ ++ .||.+.-.+..+. T Consensus 387 ~ke~~f~~~l~~~~~~i~~~~~~~~~~--~~---------d~-------------------------~~i~i~f~~~~P~ 430 (537) T protein:vir:78 387 KMETSLRKVLRWCADMVVSDIALRGLG--EY---------DS-------------------------NDICFEIEPHVLA 430 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCc--cc---------cc-------------------------ceeeEEeccCCCC Confidence 777777778777777777765432100 00 00 1233333344454 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~ 618 (708) .-.+..+.+..+.+. +- ....+++.++.+ .+. ++....... T Consensus 431 n~~e~a~~~~~l~~~-gi------iS~eT~l~~~p~-------------------vdd--~e~ek~~~e----------- 471 (537) T protein:vir:78 431 NELDIATTRKTEAET-EA------LKIGNIMTVAPR-------------------IGD--DETLKLIAE----------- 471 (537) T ss_pred CHHHHHHHHHHHHhc-Cc------chHHHHHHhCCC-------------------CCC--HHHHHHHHH----------- Confidence 333444444443322 11 111222322221 110 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hhhhhhhhhh Q lcl|Aclame:pro 619 EMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKD--VAESQQQQFQ 696 (708) Q Consensus 619 ~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 696 (708) +. .....+..++-.+ .++ +...+.... + ++... ....+++..+ T Consensus 472 -e~------~~~~~~~~~~~~~------------~~~------------~~~~~~~~~--~--~~~~~~~~~~~~~~~d~ 516 (537) T protein:vir:78 472 -EL------DLDYNELKDALAE------------QDA------------QSLDVSPDV--Q--AMLDGLPVNANQPPVDP 516 (537) T ss_pred -HH------Hhhhhhhhhhhhh------------hcc------------cccCcCcch--h--hhcCCCCCCCCCCCCCc Confidence 00 0000000000000 000 000000000 0 00000 1111111111 Q ss_pred cCCCCCCCCCCC Q lcl|Aclame:pro 697 SPPQSPADLMPS 708 (708) Q Consensus 697 ~~~~~~~e~~~~ 708 (708) ..|...+..-|| T Consensus 517 ~~~~~~~~~~~~ 528 (537) T protein:vir:78 517 NQPVADPNVVPP 528 (537) T ss_pred cCCCCCCCCCCC Confidence 122222222222 No 104 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.54 E-value=1.2e-12 Score=85.91 Aligned_cols=481 Identities=9% Similarity=0.020 Sum_probs=238.8 Q ss_pred CC--cchHHHHHHHHHHH-----HHHHH-----hhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecc Q lcl|Aclame:pro 1 MA--ETLEKKHERIMLRF-----DRAYS-----PQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINK 68 (708) Q Consensus 1 ma--~~~~~~~~~~~~~~-----~~~~~-----~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~ 68 (708) |. ++.+..++++...+ ....+ --.+.+....+.+. +|.|++|.-..+ ...+....+..++.|+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~--~Y~g~~~~~~~~---~~~~~~~~~~~~sl~~ 75 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLR--QYEGDYPQVEYI---NSQGKIQERDYMTLNL 75 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHH--HhcCCCcccccc---cccccccccceeecCc Confidence 33 33333343332211 11111 11233444444444 477876632111 1112223455778899 Q ss_pred hHHHHHHHHHHHhcCcceeEEecCCCc-chHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 69 VATELNRIIAEYRNNRITVKFRPGDRE-ASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 69 i~~~i~~i~g~~~~nr~~~~v~pr~~~-~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) -+.++....+...+-.+.+.|...... .+.......++.+.-+.+.|++.....+++++++..|-|++++.++. T Consensus 76 ~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~----- 150 (517) T protein:vir:98 76 RKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN----- 150 (517) T ss_pred HHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC----- Confidence 998998888888888889988643211 11123444566777777799999999999999999999999998761 Q ss_pred CCCCCcceeeEEeecchhheecCCcccc-CChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAKK-YDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~~-~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) +.++|..+ +...|| |-... .+...|-++++..... +. .. .-.+ T Consensus 151 -----~~~~I~~v--~ad~~~--Pl~~~~~~v~~~ai~~~~~~~~-~~--------~~------------------~~Yt 194 (517) T protein:vir:98 151 -----GEIEFSWA--LANAFY--PLRSNSNGISEGVMKSVTTKVI-GN--------KT------------------VYYT 194 (517) T ss_pred -----CeeEEEEE--cCCeeE--EEEecCCCeEEEEEEEEEEEee-cC--------Cc------------------eEEE Confidence 23455544 344555 32211 1222233333221110 00 00 0000 Q ss_pred Eeeeeeecce-------EEEEEEEec---CccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeee Q lcl|Aclame:pro 227 IAKYYEVRKE-------SVDVISYRH---PITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLE 296 (708) Q Consensus 227 v~e~~~~~~~-------~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~ 296 (708) ..|+++.... +.+-..|.. ...|..+...+ + |.-+.....+ T Consensus 195 ~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~---------------------------~-~e~l~~~~~~- 245 (517) T protein:vir:98 195 LLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEE---------------------------L-YEGMQEKTYI- 245 (517) T ss_pred EEEEEecCceeccCCcEEEEEEEEecCCCccccccccccc---------------------------c-ccCCCcceeE- Confidence 1111110000 000000110 00111111000 0 0000000011 Q ss_pred cCCCCCCCCcceeeEE---EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHH- Q lcl|Aclame:pro 297 KPRRIPGEHIPLIPVY---GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWE- 372 (708) Q Consensus 297 ~~~~~p~~~~p~~p~~---~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~- 372 (708) ++-..|.+-++ +......+++.|.|++.+++|..+.+|...+++.|.+.+ ++.++.++.+.+....+.-. T Consensus 246 -----~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~~g~ 319 (517) T protein:vir:98 246 -----QGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDESGM 319 (517) T ss_pred -----CCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCCCCc Confidence 11111221111 111123367788999999999999999999999999987 45578888877632111000 Q ss_pred hh----cccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHH Q lcl|Aclame:pro 373 AR----NKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETV 446 (708) Q Consensus 373 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai 446 (708) .. ..+...+..+. ... +...++..++.-...++++.++.....+....|++....|..+. .||++| T Consensus 320 ~~~~~~d~~~~~y~~~~---~~~-----~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi 391 (517) T protein:vir:98 320 PPPQVFDPDVNVYKSIR---MGT-----DEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEI 391 (517) T ss_pred ccCCCCCcccceeeecc---CCC-----CCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHH Confidence 00 00111111111 111 11123334444344678888888889999999999999997654 478888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeccCCCceEEEecccccccCCCceEEeecccee Q lcl|Aclame:pro 447 NNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREV--YGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVG 524 (708) Q Consensus 447 ~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~--y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g 524 (708) ....+..-.....+...+..+++++.+.++.+..-+ |+.. + .. T Consensus 392 ~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~---------------------~--------------~~ 436 (517) T protein:vir:98 392 VSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGE---------------------I--------------PS 436 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------------------C--------------CC Confidence 888888777788899999999999999998776543 3210 0 01 Q ss_pred eEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHH Q lcl|Aclame:pro 525 RYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQI 604 (708) Q Consensus 525 ~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~ 604 (708) .++|+|+=+.+....+++.++.++++.+.+. . + ...+...+-..+-.-+++...+++...... .|....+. T Consensus 437 ~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~-m-s--~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~---~~~~~~~~-- 507 (517) T protein:vir:98 437 AEHIGVDFDDGVFQDRSALLRFYGQAKTFGF-I-P--TVEAIQRIFKVPKKTAEQWLEEIRKDQIEL---DPVTISQR-- 507 (517) T ss_pred CcceEEEcCCCCCCCHHHHHHHHHHHHhcCC-C-C--HHHHHHHhCCCChHHHHHHHHHHHHhcccc---CCCCcccc-- Confidence 2455666555656667777778888766542 1 1 111111111112222344444554333211 11100000 Q ss_pred HHHHHHHHHHHHHHH Q lcl|Aclame:pro 605 VQQAQMAAQSQPNPE 619 (708) Q Consensus 605 ~~~~qq~qq~~~~~~ 619 (708) .... ..-+.+ T Consensus 508 ~~~~-----~~gd~e 517 (517) T protein:vir:98 508 AQKR-----MFGDEE 517 (517) T ss_pred ccCC-----CCCCCC Confidence 0000 000000 No 105 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.54 E-value=2e-12 Score=84.71 Aligned_cols=498 Identities=11% Similarity=0.031 Sum_probs=234.5 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) |-.+.-....++..+|+...+....|...|+++.+|.+ .. =.+... . +....+ +.-+.-...++.+.+.. T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~l-P~-~~~~~~-~-----~~~~~~--~~dstg~~a~~~LAa~l 70 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTL-PY-LMADVN-D-----DLSSQN--AWQDDGASATNFLSNKL 70 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhc-cc-cccCCC-C-----Cccccc--cccchHHHHHHHHHHHH Confidence 44443344567888888888888888888888776532 11 000000 0 000011 22333444555544433 Q ss_pred hc-----CcceeEEecCCCc------ch---HHHHH---HHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 81 RN-----NRITVKFRPGDRE------AS---EELAN---KLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 81 ~~-----nr~~~~v~pr~~~------~d---~~~A~---~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) .. +++=+++.+.+.. +. .++.+ .++..+......|++..+...++.+.+..|.|+..+ + T Consensus 71 ~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~-- 146 (517) T protein:vir:10 71 SQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYH--P-- 146 (517) T ss_pred HHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--e-- Confidence 33 3333444443210 00 11222 234555556778999999999999999999986532 1 Q ss_pred ccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCc Q lcl|Aclame:pro 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGAD 223 (708) Q Consensus 144 ~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 223 (708) + +...++.. |+.++++..+.. . ...-++++.+++..++.+.|+........ . ..+...+ T Consensus 147 -~-----~~~~~~~~----pl~~y~v~~d~~---G-~v~~ivrr~~~~~~~l~~~~~~~~~~~~~-----~--~~~~~~~ 205 (517) T protein:vir:10 147 -D-----KTSPIQAV----PLHHYCVRRDNN---G-TVLDIVFLQEKALETFEPSIRMAIQASRK-----G--KQYKDKD 205 (517) T ss_pred -C-----CCCcEEEE----EcCeEEEeeCCC---c-CeEEEEeeeeccHHHHHHHhhhhcchhhh-----h--hccCCcC Confidence 1 11123333 455666644332 1 12237888999999999999864321000 0 0111111 Q ss_pred eeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCC Q lcl|Aclame:pro 224 VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPG 303 (708) Q Consensus 224 ~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~ 303 (708) .+ .++.+.++... + +.+++.-+.+..+ ...+.||+ T Consensus 206 ~v-------------~v~~~v~~~~~--------------------~-----------~~~~~~~~d~~~~-~~~s~y~~ 240 (517) T protein:vir:10 206 NV-------------KLYTHAKRTKD--------------------G-----------KYLIRQSADDVPV-GKESTVTE 240 (517) T ss_pred ce-------------EEEEEEEEeCC--------------------C-----------ceEEEEEeCceee-cccccccc Confidence 11 11111111000 0 0111112223333 33466888 Q ss_pred CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceee Q lcl|Aclame:pro 304 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) Q Consensus 304 ~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~ 383 (708) .++||+++.... .+|..+|.|.+....+--+.+|++...++.......+++++++.+.+..... ..+++.-. T Consensus 241 ~e~P~~~~Rw~~--~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~------l~~~~~g~ 312 (517) T protein:vir:10 241 DKSPFLILTWKR--SYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQ------FVEGGSGA 312 (517) T ss_pred ccCCeeeeeeee--cCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhh------ccCCCccc Confidence 889988765443 5888999999999999999999999999999999999999998776533221 11111111 Q ss_pred ecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) +.++ ..+.+ .+++ .....-.+...+.++...+.|....=++....-.....|++-|..+.+--...+...+.+ T Consensus 313 ~~~g--~~~~v----~~~~-~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~r 385 (517) T protein:vir:10 313 VLHG--VEGDI----HIVQ-LGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGGVYSL 385 (517) T ss_pred cccC--Ccccc----eeee-cccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHH Confidence 1111 11111 1111 112212344456677777777766533222222223468888888888777778777777 Q ss_pred HHH-HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MAK-SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~~-~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |.. +..-+.+.++..+..-.. ++ + ..+.+..+.+ +-.|.+ T Consensus 386 l~~Ell~Pli~r~~~~l~~~l~--------~~-------------------------~-----v~~~~~s~la-~l~r~~ 426 (517) T protein:vir:10 386 FATTFQGPLARWFMNGISSILT--------SK-------------------------N-----VSPTILTGIE-ALGRMA 426 (517) T ss_pred HHHHHHHHHHHHHHHHhhhhcC--------CC-------------------------C-----ccceeeccHH-HHHHHH Confidence 663 333333333322211000 00 0 1222223333 445777 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhh-hhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLL-ISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~-~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) ..+.+.++++.+++..+..+ .+.... +.++++..+-.... +.....+ +++.++.+++++.+++.+. .. T Consensus 427 ~~~~i~~~~~~i~~~a~~~~----~~~~~i---d~d~~~~~~a~~~Gvp~~~irs--~~ev~~~~~~~~~~~~~~~--~~ 495 (517) T protein:vir:10 427 ELDKLGTFNGYVSMTAQWPE----PLQQAI---KWPDFTDWVQGQISANFPFFKT--QDELNAEAQAQQEQEATKY--AA 495 (517) T ss_pred HHHHHHHHHHHHHHhhcCCh----HHHhcC---CHHHHHHHHHHHhCCChhhcCC--HHHHHHHHHHHHHHHHHHH--HH Confidence 77788877776654322111 122222 33444444433321 1112221 1111111111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV 677 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~ 677 (708) ++-. ++....++. .+...+-. + T Consensus 496 ~~ag----------~~~~~~~~~------------------~~~~~~~~------~ 517 (517) T protein:vir:10 496 EQAG----------KAIPDMVKN------------------GQINPQGG------Q 517 (517) T ss_pred HHHH----------HHHHHHHhC------------------CCCCCCCC------C Confidence 0000 000000000 00000000 0 No 106 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.53 E-value=2.2e-12 Score=84.44 Aligned_cols=518 Identities=11% Similarity=0.036 Sum_probs=243.0 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhc-- Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRN-- 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~-- 82 (708) ++.. ++.+|+...+....|...|+++.+|.+=..-.-+..... .+ ...+.-+.-...++.+.+.... T Consensus 1 mk~~---a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~---~~-----~~~~~dstg~~a~~~Laa~l~~~l 69 (542) T protein:vir:78 1 MKGL---AQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASG---GR-----LQQPYQSLGSKGVNALSSKLMLSL 69 (542) T ss_pred ChhH---HHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccc---cc-----ccccccchHHHHHHHHHHHHHHhh Confidence 2222 344555555666677777777766532110000000000 00 0112234445555555443333 Q ss_pred ---CcceeEEecCCC------c-chH---HHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC Q lcl|Aclame:pro 83 ---NRITVKFRPGDR------E-ASE---ELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) Q Consensus 83 ---nr~~~~v~pr~~------~-~d~---~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~ 146 (708) +++=++..+.+. + +++ ++... ++..+......|++..+...++.+.+..|.|++.+. . T Consensus 70 tpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~------~ 143 (542) T protein:vir:78 70 FPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAG------K 143 (542) T ss_pred cCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEec------C Confidence 344444544321 0 111 12222 244555667789999999999999999999976542 1 Q ss_pred CCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcc-cccccccccccccCCCCCcee Q lcl|Aclame:pro 147 DPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP-TSLDVTSMTSWEYNWFGADVI 225 (708) Q Consensus 147 d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~-~~~d~~~~~~~~~~~~~~~~~ 225 (708) ++ .+.. |+.++++..++.- ...-+|++..|+..++.+.||+..- +.+.. .... .....+ T Consensus 144 ~~------~~~~----pl~~y~v~~d~~G----~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~-~~~~-----~~~~~~ 203 (542) T protein:vir:78 144 KT------LKVY----PLDRYVIERDGDG----NVIEIITRELVDRSLLPAEFQKQSLLEGKDS-NAVG-----EDGPKF 203 (542) T ss_pred CC------ceEE----ecceeEEeeCCCC----CeEEEeeeeecCHHHHHHhhccccCchHHHh-hccc-----cCCCeE Confidence 22 1222 4456665444321 1223899999999999999985432 11110 0000 001111 Q ss_pred EEeeeeeecceEEEEEEEecCcc-CceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEe-cceeeecCCCCCC Q lcl|Aclame:pro 226 YIAKYYEVRKESVDVISYRHPIT-GEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVD-GDGFLEKPRRIPG 303 (708) Q Consensus 226 ~v~e~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~il~~~~~~p~ 303 (708) .| ++...+.. ++.+... .+... .+.|+.-. |..+-...+.+++ T Consensus 204 ~v-------------~~~v~pr~~~~~~~~~--------------------~~~~~--~~s~~~e~~g~~v~~~~~e~g~ 248 (542) T protein:vir:78 204 GV-------------AQGKGGRNDAEVFTCC--------------------KLVDG--QHRWHQECDGKEIKGSRSSSPL 248 (542) T ss_pred EE-------------EEEeecccCCcccccc--------------------ccCCC--eEEEEEEecccccccccccccc Confidence 11 11111110 0000000 00111 22233322 2322122356788 Q ss_pred CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceee Q lcl|Aclame:pro 304 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) Q Consensus 304 ~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~ 383 (708) ..+||+++.... .+|..+|.|.+....+-.+.+|.+....+.......+++++++.+.+-...+.. ...++.++. T Consensus 249 ~~~P~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~---~~~~g~iv~ 323 (542) T protein:vir:78 249 KHSPWLPLRFNV--VDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLA---RAGTGAIIQ 323 (542) T ss_pred ccCCceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc---cCCCceeec Confidence 999998765443 688899999999999999999999999999999999999999876654332211 112222221 Q ss_pred ecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 384 LREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn 463 (708) .. .+.+. +......+. -....+.++...+.|....-+.. .. .....|++-|..+.+.....+...+.+ T Consensus 324 g~-----~~~v~----~~~~~~~~~-~~~~~~~i~~~~~rI~~aFl~~~-~~-d~~rvTAtEV~~r~~E~~~~LG~v~~r 391 (542) T protein:vir:78 324 GR-----AEDVS----VVQANKGAD-FRTVQEMIRDLSQRISDAFLILN-VR-QSERTTATEVREVQMELDRQLSGIYGS 391 (542) T ss_pred CC-----cccee----eeecccccc-hhHHHHHHHHHHHHHHHHhcccc-cC-CcccccHHHHHHHHHHHHHHhhHHHHH Confidence 11 11111 111111222 23345666777777766654321 11 122358888888888888888888888 Q ss_pred HH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 464 MA-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 464 ~~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |. .+..-+.+..+.++.+.--=+. + | . +-+.+.+..+.. ...|.+ T Consensus 392 l~~E~L~Pli~R~~~il~r~g~lP~---------------~------p--~----------~lv~~~~~s~La-~~~r~~ 437 (542) T protein:vir:78 392 LTVELLTPYLNRKLHLMQRSKQLPS---------------L------P--K----------GLVMPTVVAGLG-GVGRGE 437 (542) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCC---------------C------c--h----------hceeeeeechHH-HHHHHH Confidence 75 5555555555555544221000 0 0 0 012444444443 455777 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh--hcccCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS--GIAKPRNEKEQQIVQQAQMAAQSQPNPEM 620 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~--~~~~~~~~~~~q~~~~~qq~qq~~~~~~~ 620 (708) ..+.|.++++.+++..+. +.+.... +.++++..+-...... ..... +++.++.++++|++ T Consensus 438 ~~~~l~~~~~~i~~~~~p-----~~l~~~i---d~d~~~~~~a~~~Gvp~~~i~~s--~e~~~~~~~q~q~~-------- 499 (542) T protein:vir:78 438 DRAALIEFMQTVGQAMGP-----EALQQFI---DPTEFLKRLAAASGIDTLNLVKS--PETMANEAQQAQQQ-------- 499 (542) T ss_pred HHHHHHHHHHHHHHhcCC-----hhHHhcC---CHHHHHHHHHHHcCCCHhhccCC--HHHHHHHHHHHHHH-------- Confidence 778888777766442111 1122222 3344444443332211 11111 11111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCC Q lcl|Aclame:pro 621 VLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQ 700 (708) Q Consensus 621 ~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (708) +.++.++ . + +.. .+.... ....+.+..++ -+..++.|+ T Consensus 500 --------~~~~al~----~---------~----a~~-~a~~~~--------------~~~~~~~~~a~--~~~~~~~~~ 537 (542) T protein:vir:78 500 --------QMTASLM----G---------Q----AGQ-LAKSPI--------------GEKMMQQINAP--GQEAPAGPQ 537 (542) T ss_pred --------HHHHHHH----H---------h----hhh-cccccc--------------ccchhhhcCCC--CcCCCCCCc Confidence 0000000 0 0 000 000000 00000001111 123344566 Q ss_pred CCCCC Q lcl|Aclame:pro 701 SPADL 705 (708) Q Consensus 701 ~~~e~ 705 (708) +..++ T Consensus 538 ~~~~~ 542 (542) T protein:vir:78 538 TGEDL 542 (542) T ss_pred ccccC Confidence 66777 No 107 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.53 E-value=2.6e-12 Score=84.10 Aligned_cols=489 Identities=12% Similarity=0.048 Sum_probs=224.2 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhc-- Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRN-- 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~-- 82 (708) ++.++.....+++ .+.|...|+++.+|.. . .=... +... .++.-.+| .-+.-...++.+.+.... T Consensus 1 mk~~~~~~~~~lk-----r~~~e~~w~e~a~~tl-P-~~~~~-~~~~---~~~~~~~~--~dstg~~a~~~LAa~l~~~l 67 (510) T protein:vir:78 1 MKSTAAMLWEKLR-----DGSVEQRAIEFAKTTL-P-YLMVD-PMSG---SRGVVEHD--FQSAGALLVNNLAAKLARSL 67 (510) T ss_pred ChhHHHHHHHHHh-----ccchHHHHHHHHHhhc-c-ccccC-CCCc---ccccccCc--ccchHHHHHHHHHHHHHHhh Confidence 4555555555554 3346677777665422 1 00000 0000 00000122 233444455555443333 Q ss_pred ---CcceeEEecCCCc------chH---HHHHH---HHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCC Q lcl|Aclame:pro 83 ---NRITVKFRPGDRE------ASE---ELANK---LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYD 147 (708) Q Consensus 83 ---nr~~~~v~pr~~~------~d~---~~A~~---l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d 147 (708) +++=+++.+.+.. .+. ++.+. ++..+......|++..+...++.+.+..|.+.+.+. .+ T Consensus 68 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~------~~ 141 (510) T protein:vir:78 68 FPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN------SD 141 (510) T ss_pred cCCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEe------CC Confidence 2333334333210 011 12222 334555566789999999999999998898765431 11 Q ss_pred CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEE Q lcl|Aclame:pro 148 PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYI 227 (708) Q Consensus 148 ~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v 227 (708) + ..++.. |+.++++..+.. . ...-++++..++..++.+.||....... ... ...+.+ T Consensus 142 ~----~~~~~~----pl~~y~v~~d~~---G-~vd~i~rr~~~t~~~l~~~~~~~~~~~~----~~~-----~~~~~v-- 198 (510) T protein:vir:78 142 E----ATVVAW----SLRSYAVRRDAT---G-RWMDIVLKQRYKSKDLDDVYKQDLMRAG----RNL-----SGSGSV-- 198 (510) T ss_pred C----CeEEEE----EcceeEEeeCCC---c-CeeEEEeeeeccHHHHHHHhhHHhhhhh----hcc-----CCCceE-- Confidence 1 122222 455666544332 1 2223888999999999999986432111 000 011111 Q ss_pred eeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEE-EecceeeecCCCCCCCCc Q lcl|Aclame:pro 228 AKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSV-VDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 228 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-~~~~~il~~~~~~p~~~~ 306 (708) .++.+.++..+. .+..+.+++ +.|..++ ..+.||+.++ T Consensus 199 -----------~v~~~V~~~~~~-----------------------------~~~~~sv~~e~dg~~i~-~~~~~~~~e~ 237 (510) T protein:vir:78 199 -----------DLYTHVQRRKGT-----------------------------AMDYAEMYHEIDGVRVG-ETGRWPIHLC 237 (510) T ss_pred -----------EEEEEEEeecCC-----------------------------CCcEEEEEEEecCeeec-cccccccccC Confidence 122222111000 000111222 2344443 3467889999 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) ||+|+-... .+|..+|.|.+....+--+.+|++....+.......++.++++.+.+-...... ...++.++... T Consensus 238 P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~---~~~~g~~v~g~- 311 (510) T protein:vir:78 238 PYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ---DAEMGDYVPGG- 311 (510) T ss_pred Ceeeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhc---cCCCceeecCC- Confidence 998765443 588899999999999999999999999999999999999999876553322111 11111221110 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMA- 465 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~- 465 (708) .+.+ .+.+ .....--+...+.++...+.|....=+. ...-.....|++-|..+.+.....+...+.+|. T Consensus 312 ----~~~v----~~~~-~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 381 (510) T protein:vir:78 312 ----AEAV----RAYE-RGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAE 381 (510) T ss_pred ----cccc----cccc-cCcccchHHHHHHHHHHHHHHHHHHhhc-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHH Confidence 1111 1111 1111112333456666666666654221 111112235888888888888888888777776 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHH Q lcl|Aclame:pro 466 KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVS 545 (708) Q Consensus 466 ~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~ 545 (708) .+..-+.+..+.++.... ++-+. +..+ .-.++ ++ -++-.|.+..+ T Consensus 382 E~l~Pli~r~~~il~r~g----l~p~p-----------------~~~~-----------~~~~v--~~-is~Laraq~~~ 426 (510) T protein:vir:78 382 NLQSPLAYVCLSEVDDAL----LQGLI-----------------TKQH-----------KPAIE--TG-LPALSRSAAVQ 426 (510) T ss_pred HHHHHHHHHHHHHHHhcc----CCCCC-----------------cccc-----------cceee--ec-ccHHHHHHHHH Confidence 455555555555544321 11100 0000 01111 11 23444666666 Q ss_pred HHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhh--hhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 546 VLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLL--ISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLA 623 (708) Q Consensus 546 ~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~--~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~a 623 (708) .+..+++.+....+. ..+.+ .. +.++++..+..... +...... +++.++.++++++++++++.++ T Consensus 427 ~l~~~~q~l~~~~~~-~q~~~----~i---d~d~~~~~~a~~~Gv~p~~ivrs--~eev~a~~~~~~~q~~~~~~~~--- 493 (510) T protein:vir:78 427 SMLNASQVIAGLAPI-AQLDP----RI---SLPKMMDTIWAAFSVDTSQFYKS--ADELQAEAEEQRRQAAQAQAAQ--- 493 (510) T ss_pred HHHHHHHHHHHhcCh-hhhhh----cC---CHHHHHHHHHHHhCCChhhhcCC--HHHHHHHHHHHHHHHHHHHHHH--- Confidence 666555554432221 11111 11 34445544443332 1222222 2222111111111110000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 624 QAQMVAAQAEAQKATNETAQTQIKA 648 (708) Q Consensus 624 q~~~~~~qae~~k~~~~~~~~q~e~ 648 (708) +....++ ++...+..-+ T Consensus 494 --~a~~~~~------~~~~~~~~g~ 510 (510) T protein:vir:78 494 --ETLLEGA------SDMTNALAGV 510 (510) T ss_pred --HHHHHhh------hhhcccCCCC Confidence 0000000 0111111111 No 108 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.52 E-value=1.3e-12 Score=85.70 Aligned_cols=449 Identities=12% Similarity=0.014 Sum_probs=193.0 Q ss_pred CCc-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC---ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAE-TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP---KFEINKVATELNRI 76 (708) Q Consensus 1 ma~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp---~~~~N~i~~~i~~i 76 (708) |-- .+++++.++...+.. + +. +-++...||+|+|.-...-.... .+.+. .++.|..+.+|+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~----~---~~--r~~~l~~Yy~g~~~i~~~~~~~~----~~~~~~~~k~~~n~~~~ivd~~ 67 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD----G---MS--RVRLLARYSNGDAPLPELTRNTS----AAWRSFQREARTNWGLMVRDSV 67 (456) T ss_pred CCCCCHHHHHHHHHHHHHH----H---HH--HHHHHHHHHhcCCCchhcCcccC----hhhhhhhhhhhcchHHHHHHHH Confidence 543 444555555443322 1 11 11233468999884211100000 01111 36789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+...+. .. +|.+..+. +..+++.|+++...+.+..++++.|++|.-+..+ ..+.++ T Consensus 68 ~~~l~~~~~~~~---~~--~d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d---------~~g~~~ 129 (456) T protein:vir:10 68 ADRIIPNGITVG---GS--ADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR---------DDGTAT 129 (456) T ss_pred HhhhccCCeecC---CC--CCcchHHH----HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC---------CCCceE Confidence 999877755432 21 22232222 4445678999999999999999999998766432 123344 Q ss_pred eEEeecchhh-eecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 157 IEPIYDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 157 i~~v~~~~~~-v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) +..+ +|... +.|||.... ...++++ .|-+.+ ........|+...- T Consensus 130 i~~~-~p~~~~~i~d~~~~~----~~~~~i~-~~~~~d----------------------------~~~~~~~~~~~~~~ 175 (456) T protein:vir:10 130 ITAD-SPETMVVSVDPLQPW----RIRAAMR-WWRDLD----------------------------AESDFAIVWSGDGW 175 (456) T ss_pred EEEE-ccceeEEEEcCCCCc----ceEEEEE-EEEecC----------------------------CceeEEEEEeccce Confidence 4433 23222 347764432 1122221 111100 00001111111100 Q ss_pred eEE-EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 236 ESV-DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 236 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) ... +.++..+..+..... +..+.....+..+..++..|++|+ T Consensus 176 ~~~~~~~~~~~~~~~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~pvv~~--- 218 (456) T protein:vir:10 176 QKFARPCFVQSSSRRRLVT----------------------------------RISDSWVPVGDAVVTGSPPPVVVY--- 218 (456) T ss_pred eEEEEEEEEeecccceeee----------------------------------ecCCceeeccccCCCCCceeEEEe--- Confidence 000 000000000000000 011111111112222233333332 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) . + ..+-|-+..+++.++.+|+..|.++..+...+.+...+- |...+.. ..++....-.....++......-. T Consensus 219 ~----N-~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~-~~d~~g~~~~~~~~~~~~~~~~~~- 290 (456) T protein:vir:10 219 Q----N-PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLP-NVDENGNAIDYASIFEAAPGALWE- 290 (456) T ss_pred c----C-CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccc-cccccccccchhhhhhhhcccccc- Confidence 1 1 235588999999999999999998776655544333221 1100000 000000000000000000000000 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~ 473 (708) .+....+...+... ...+...+......+-.+||+++...|. .+|.||.||......-........+.|..+++++.+ T Consensus 291 ~~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~r 369 (456) T protein:vir:10 291 LPPGVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) T ss_pred CCCCcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111233333222 3456667777777888888999988875 568899999998888888888888888889988888 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) Q Consensus 474 ~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~ 553 (708) +++.+- |. ... +++.|.=.+..+....+..+.++.|.+. T Consensus 370 l~~~~~-------------g~---~~~-------------------------~~~~v~w~~~~~~~~~~~ada~~kl~~~ 408 (456) T protein:vir:10 370 KALQIE-------------GE---SVE-------------------------DTVDVSFESPDRVTLGEKYSAASLAKAA 408 (456) T ss_pred HHHHhc-------------CC---Ccc-------------------------cceeEEecCCCCcCHHHHHHHHHHHHHc Confidence 876431 11 000 0011110111111234455666665543 Q ss_pred ccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 554 ~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) +-+. . ..+.++..+. .+++ +++ .++....+... .+..-.+..+-+ T Consensus 409 gi~~----~---~~~~~~lg~~-~~~i-~~~------------------e~er~~~e~~~--------~~~~~~~~~~~~ 453 (456) T protein:vir:10 409 GESW----A---SIRRNILNYN-ADQI-KQD------------------DLDRAREQITL--------FAGNPVQRPQED 453 (456) T ss_pred CCCh----H---HHHHhhCCCC-HHHH-HHH------------------HHHHHHHHHHH--------HhhhhhhcCCCC Confidence 2111 1 1112221111 0101 000 00000000000 000000000101 Q ss_pred HHH Q lcl|Aclame:pro 634 AQK 636 (708) Q Consensus 634 ~~k 636 (708) .-+ T Consensus 454 ~~~ 456 (456) T protein:vir:10 454 GSR 456 (456) T ss_pred CCC Confidence 111 No 109 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.52 E-value=1.3e-12 Score=85.70 Aligned_cols=449 Identities=12% Similarity=0.014 Sum_probs=193.0 Q ss_pred CCc-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC---ceeecchHHHHHHH Q lcl|Aclame:pro 1 MAE-TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP---KFEINKVATELNRI 76 (708) Q Consensus 1 ma~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp---~~~~N~i~~~i~~i 76 (708) |-- .+++++.++...+.. + +. +-++...||+|+|.-...-.... .+.+. .++.|..+.+|+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~----~---~~--r~~~l~~Yy~g~~~i~~~~~~~~----~~~~~~~~k~~~n~~~~ivd~~ 67 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD----G---MS--RVRLLARYSNGDAPLPELTRNTS----AAWRSFQREARTNWGLMVRDSV 67 (456) T ss_pred CCCCCHHHHHHHHHHHHHH----H---HH--HHHHHHHHHhcCCCchhcCcccC----hhhhhhhhhhhcchHHHHHHHH Confidence 543 444555555443322 1 11 11233468999884211100000 01111 36789999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+...+. .. +|.+..+. +..+++.|+++...+.+..++++.|++|.-+..+ ..+.++ T Consensus 68 ~~~l~~~~~~~~---~~--~d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d---------~~g~~~ 129 (456) T protein:vir:10 68 ADRIIPNGITVG---GS--ADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR---------DDGTAT 129 (456) T ss_pred HhhhccCCeecC---CC--CCcchHHH----HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC---------CCCceE Confidence 999877755432 21 22232222 4445678999999999999999999998766432 123344 Q ss_pred eEEeecchhh-eecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 157 IEPIYDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 157 i~~v~~~~~~-v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) +..+ +|... +.|||.... ...++++ .|-+.+ ........|+...- T Consensus 130 i~~~-~p~~~~~i~d~~~~~----~~~~~i~-~~~~~d----------------------------~~~~~~~~~~~~~~ 175 (456) T protein:vir:10 130 ITAD-SPETMVVSVDPLQPW----RIRAAMR-WWRDLD----------------------------AESDFAIVWSGDGW 175 (456) T ss_pred EEEE-ccceeEEEEcCCCCc----ceEEEEE-EEEecC----------------------------CceeEEEEEeccce Confidence 4433 23222 347764432 1122221 111100 00001111111100 Q ss_pred eEE-EEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 236 ESV-DVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 236 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) ... +.++..+..+..... +..+.....+..+..++..|++|+ T Consensus 176 ~~~~~~~~~~~~~~~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~pvv~~--- 218 (456) T protein:vir:10 176 QKFARPCFVQSSSRRRLVT----------------------------------RISDSWVPVGDAVVTGSPPPVVVY--- 218 (456) T ss_pred eEEEEEEEEeecccceeee----------------------------------ecCCceeeccccCCCCCceeEEEe--- Confidence 000 000000000000000 011111111112222233333332 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) . + ..+-|-+..+++.++.+|+..|.++..+...+.+...+- |...+.. ..++....-.....++......-. T Consensus 219 ~----N-~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~-~~d~~g~~~~~~~~~~~~~~~~~~- 290 (456) T protein:vir:10 219 Q----N-PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLP-NVDENGNAIDYASIFEAAPGALWE- 290 (456) T ss_pred c----C-CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccc-cccccccccchhhhhhhhcccccc- Confidence 1 1 235588999999999999999998776655544333221 1100000 000000000000000000000000 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~ 473 (708) .+....+...+... ...+...+......+-.+||+++...|. .+|.||.||......-........+.|..+++++.+ T Consensus 291 ~~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~r 369 (456) T protein:vir:10 291 LPPGVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) T ss_pred CCCCcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111233333222 3456667777777888888999988875 568899999998888888888888888889988888 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) Q Consensus 474 ~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~ 553 (708) +++.+- |. ... +++.|.=.+..+....+..+.++.|.+. T Consensus 370 l~~~~~-------------g~---~~~-------------------------~~~~v~w~~~~~~~~~~~ada~~kl~~~ 408 (456) T protein:vir:10 370 KALQIE-------------GE---SVE-------------------------DTVDVSFESPDRVTLGEKYSAASLAKAA 408 (456) T ss_pred HHHHhc-------------CC---Ccc-------------------------cceeEEecCCCCcCHHHHHHHHHHHHHc Confidence 876431 11 000 0011110111111234455666665543 Q ss_pred ccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 554 ~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) +-+. . ..+.++..+. .+++ +++ .++....+... .+..-.+..+-+ T Consensus 409 gi~~----~---~~~~~~lg~~-~~~i-~~~------------------e~er~~~e~~~--------~~~~~~~~~~~~ 453 (456) T protein:vir:10 409 GESW----A---SIRRNILNYN-ADQI-KQD------------------DLDRAREQITL--------FAGNPVQRPQED 453 (456) T ss_pred CCCh----H---HHHHhhCCCC-HHHH-HHH------------------HHHHHHHHHHH--------HhhhhhhcCCCC Confidence 2111 1 1112221111 0101 000 00000000000 000000000101 Q ss_pred HHH Q lcl|Aclame:pro 634 AQK 636 (708) Q Consensus 634 ~~k 636 (708) .-+ T Consensus 454 ~~~ 456 (456) T protein:vir:10 454 GSR 456 (456) T ss_pred CCC Confidence 111 No 110 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.52 E-value=1.2e-13 Score=91.44 Aligned_cols=466 Identities=12% Similarity=0.009 Sum_probs=194.6 Q ss_pred CCc--------chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAE--------TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~--------~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |+. -.+++++.+...+.. +.. +- ++.-.||.|+|.-.......... . .+--++.|..+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~----~~~---rl--~~l~~Yy~G~~~i~~~~~~~~~~--~-~~~~~~~n~~~~i 68 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTE----RTQ---DL--GDNTAYYESERRPDAVGVTVPQQ--M-QKLLAHVGYPRLY 68 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHH----HHH---HH--HHHHHHHhccccchhcccccchh--H-HhhhhhcCcHHHH Confidence 543 123334444444432 211 11 12235899988532110000000 0 0112467999999 Q ss_pred HHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDR 152 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~ 152 (708) |+..++...-+- +. .| ++.+. ...+..+++.|+++.....+..++++.|++|+-|..+.... ...... T Consensus 69 vd~~~~~l~~~g--~~-~~----~~~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~-~~~~~~ 136 (484) T protein:vir:77 69 IDAIAARQELEG--FR-LG----GADKA----DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNI-DPGVDP 136 (484) T ss_pred HHHHHhhhccCc--ee-cC----Ccchh----HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCc-cccccc Confidence 999888664332 11 12 12222 23455667889999999999999999999998887653221 111222 Q ss_pred cceeeEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeee Q lcl|Aclame:pro 153 QRIAIEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKY 230 (708) Q Consensus 153 ~~i~i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~ 230 (708) ...+|..+ ++..+ +|||..++ . ..+++.+.+. +...+..... T Consensus 137 ~~~~i~~~--~p~~~~~~~D~~~~~-----~-~~a~~~~~~~----------------------------~~~~~~~~~~ 180 (484) T protein:vir:77 137 EVPIIRVE--PPTNLYAQIDPRTRQ-----V-MRAIRAIEDE----------------------------EGNEVIGATL 180 (484) T ss_pred ccceEEEe--ccceeEEEecCCCCc-----e-EEEEEEEEee----------------------------cCCcEEEEEE Confidence 23334322 23344 46764321 1 1122111110 0001111112 Q ss_pred eeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceee Q lcl|Aclame:pro 231 YEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIP 310 (708) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p 310 (708) |.... +++|.. .+|...+.+..+.+++.+|+|| T Consensus 181 y~~~~----~~~~~~-------------------------------------------~~~~~~~~~~~~~~~g~vPvv~ 213 (484) T protein:vir:77 181 YLPNN----TVIWNR-------------------------------------------EDGQWVQVANVAHNLEMVPVIP 213 (484) T ss_pred EecCe----EEEEEe-------------------------------------------cCCceEeeccccCCCCCcceEE Confidence 21110 111110 0111112233455667788888 Q ss_pred EEEeeeccCCcccccchH-HhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHh----hcccCCceeeec Q lcl|Aclame:pro 311 VYGKRWFIDDIERVEGHI-AKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEA----RNKKRPAFLPLR 385 (708) Q Consensus 311 ~~~~~~~~d~~~~~~G~v-r~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~----~~~~~~~~~~~~ 385 (708) |.-.+ ..+...|.|-+ +.+++.++.+|+..|.+...+...+.++.++- |...+.....+. ......+.++.. T Consensus 214 f~N~~--~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (484) T protein:vir:77 214 IPNRT--RLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLF-GVKGEELGVDPETGQTLFDAYLARILAF 290 (484) T ss_pred ecccc--ccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh-CCCcchhcccccccchhhhhhhhhhccc Confidence 75322 23334455555 46899999999999999988876666554432 211100000000 000001111110 Q ss_pred ccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccc-hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 386 EVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSN-IAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n-~sg~ai~~~q~q~~~~~~~~~dn 463 (708) .. +. ..+...+... ...+...+......+-.++++++...|. ..| .||.|+......-.........- T Consensus 291 ~~--------~~-~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~ 360 (484) T protein:vir:77 291 ED--------HE-SKAQQFSAAE-LRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKI 360 (484) T ss_pred CC--------CC-ceeEeecCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHH Confidence 00 00 1222222221 1334445555555555567788887774 344 69999998777666666666677 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH Q lcl|Aclame:pro 464 MAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) Q Consensus 464 ~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~ 543 (708) |..+++++.++++.+. .. .+....+ ++|.|.=.+..+....+. T Consensus 361 f~~~l~~~~~l~~~~~----~~--------~~~~~~~-------------------------~~i~v~w~~~~~~s~~~~ 403 (484) T protein:vir:77 361 FGGAWEQAMRVAYKVM----NG--------GDIPPEY-------------------------YRMESIWRDPSTPTYAAK 403 (484) T ss_pred HHHHHHHHHHHHHHHh----CC--------CCccccc-------------------------ccceEEecCCCCCCHHHH Confidence 7777777777665542 10 0000000 112222122222224556 Q ss_pred HHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLA 623 (708) Q Consensus 544 ~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~a 623 (708) .+.+..|.+.+....+ ..++++++.+- +.-.+.++.... ++..+.+ +....+.. T Consensus 404 ad~~~kl~~~g~gi~s-----~et~~~~l~~~--~~~~~e~~~~~~----------------ee~~~~~---~~~~~~~~ 457 (484) T protein:vir:77 404 ADAATKLYNNGQGVIP-----KERARIDMGYS--ITEREEMRKWDE----------------EEQAQGL---GLMGTMFG 457 (484) T ss_pred HHHHHHHHhccCCCCC-----HHHHHhcCCCC--hhHHHHHHHHHH----------------HHHHHHH---HHHhhhcc Confidence 6677776654322111 12233333221 111111111000 0000000 00000000 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 624 QAQMVAAQAEAQKAT-NETAQTQIKAFTAQQDAMESQAN 661 (708) Q Consensus 624 q~~~~~~qae~~k~~-~~~~~~q~e~~~~~~~~~~~~a~ 661 (708) ...+.. .... .+....+.+.. ..++- T Consensus 458 ----~~~~~~-~~~~~~~~~~~~~~~~-------~~~~~ 484 (484) T protein:vir:77 458 ----TDPSGG-GNPDNPETPEPQPNPA-------EEAAA 484 (484) T ss_pred ----ccccCC-CCCCCCCcccccCCCc-------cccCC Confidence 000000 0000 00000000000 00000 No 111 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.52 E-value=2.5e-12 Score=84.20 Aligned_cols=443 Identities=11% Similarity=-0.027 Sum_probs=205.3 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCC----HHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWE----GATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~----~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |-+...+.+.+++..+..-. .+-++...||+|+|=. ...-..++.. -.+.|..+-+|+++ T Consensus 12 l~~~~~~~~~~L~~~~~~~~---------~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~-------~~v~nw~~~~Vd~~ 75 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLR---------WKNLLRTSYYENKRTIQYVGTLIPPQYFNL-------GLVLGWTGKAVDAL 75 (474) T ss_pred CChhHHHHHHHHHHHHHHHh---------hHHHHHHHHhccCCChhhccccccHHHHHH-------HhhcChHHHHHHHH Confidence 77665555666555444321 2223445689997621 1111122111 13568888888887 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) .....-+- + +.|.+..++ ..+..+++.|+++...+.+..++++.|++|+-|..+. ++...++ T Consensus 76 a~rl~~~G--f-~~~d~~~~~--------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~-------d~~~~~~ 137 (474) T protein:vir:81 76 ARRCNLEG--F-VWPDGDLDS--------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGE-------DDEPEAL 137 (474) T ss_pred Hhhhcccc--e-ECCCCCccc--------hHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCC-------CCCceeE Confidence 65333221 2 223321111 1246678899999999999999999999998886431 2333444 Q ss_pred eEEeecchhhe--ecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 157 IEPIYDPSRSV--WFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 157 i~~v~~~~~~v--~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) |..+ ++.++ +|||....+. ..+.+...+ .+.+ .....+|... T Consensus 138 i~~~--sp~~~~~~~D~~~~~~~-----~al~~~~~~----------------------------~~g~-~~~~~ly~~~ 181 (474) T protein:vir:81 138 IHVK--DASEATGEWNRRRRGLN-----NLLSIIDKD----------------------------KEGK-VLSLALYLDN 181 (474) T ss_pred EEEe--ccceEEEEEeCCCCcce-----eeeEEEEEc----------------------------CCCc-EEEEEEEeCC Confidence 4433 33344 3777433211 111110000 0000 0001111100 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) .++.+..+.. .+.| ..+..+.|++ .|+|||+-. T Consensus 182 ---------------~~~~~~~~~~-----------------------~~~w--------~~~~~~~~~g-vPvV~~~n~ 214 (474) T protein:vir:81 182 ---------------ETVTAQRDKA-----------------------TLKW--------QVDRDEHVYG-VPAQVLPYK 214 (474) T ss_pred ---------------cEEEEEEcCc-----------------------ccee--------eeccCCCCCC-cceEEeccc Confidence 0111100000 0001 0122344444 688887654 Q ss_pred eeccCCcccccc-hHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccc-h-------HHHHHhhcccCCceeeec Q lcl|Aclame:pro 315 RWFIDDIERVEG-HIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG-L-------EKHWEARNKKRPAFLPLR 385 (708) Q Consensus 315 ~~~~d~~~~~~G-~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~-~-------~~~~~~~~~~~~~~~~~~ 385 (708) +... ...|.| +.+.+++.|+.+|+.++.++...-..+.++..+- |.... . ...|... .+.++. T Consensus 215 ~~~~--~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~~~~~~~d~d~~~~~~~~~~----~~~i~~- 286 (474) T protein:vir:81 215 PAPK--RPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL-GADESALKNADGTIKSVWEAR----LGRIKG- 286 (474) T ss_pred cccc--CcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-cCChhhcccccccccchhhhh----HHHHhc- Confidence 4321 122323 4578999999999999999988888877776552 21110 0 0011100 000110 Q ss_pred cccccccccccc-ccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc--cc-hhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 386 EVRDKSGNIIAG-ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP--SN-IAQETVNNLMNRADMASFIYL 461 (708) Q Consensus 386 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~--~n-~sg~ai~~~q~q~~~~~~~~~ 461 (708) ...+..+.+... ...++.++... ...+...+......+-.+||+....+|.. .| +||.||.+....-........ T Consensus 287 ~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~ 365 (474) T protein:vir:81 287 LPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAV 365 (474) T ss_pred CCCcccccccccccccccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHH Confidence 011111111100 01233333333 23455667777777888899999999842 44 799999999888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEe-ecccchhHH Q lcl|Aclame:pro 462 DNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVD-VGPSYTARR 540 (708) Q Consensus 462 dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~-~~~~~~~~r 540 (708) +.|..+.++++++.+.+--.+--+ ++..+ + +.+.|. --|.+.| . T Consensus 366 ~~fg~~l~~~~rla~~i~~~~~~~----~~~~~-----~-------------------------~~~~v~W~d~~~~s-~ 410 (474) T protein:vir:81 366 DDFTPALRKAFIRALAMKNKVAID----EIPDE-----W-------------------------KSIDAKWRDPRYLS-K 410 (474) T ss_pred HHHHHHHHHHHHHHHHHhCCCCcc----ccchh-----h-------------------------ccceeEecCCCccC-H Confidence 888999999999888765332110 00000 0 011111 1123344 3 Q ss_pred HHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 541 DATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEM 620 (708) Q Consensus 541 ~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~ 620 (708) .++...+..|.+.+....+- ..+.+++.+. .+++ ++++ .....++....+.. T Consensus 411 a~~aDa~~Kl~~a~~~~~~~-----~~~~~~lg~t-~~~i-~~~~---------------------~~~~~~~~~~~~~~ 462 (474) T protein:vir:81 411 SAQADAGMKQLAAVPWLAET-----EVGLELIGLT-PQQA-RRAM---------------------ADKRRVQGRGTLQA 462 (474) T ss_pred HHHHHHHHHHHhcccCCCcH-----HHHHhhcCCC-HHHH-HHHH---------------------HHHHHHhHHHHHHH Confidence 55677788877765332211 1122222221 0111 1110 00000000000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 621 VLAQAQMVAAQAEAQKAT 638 (708) Q Consensus 621 ~~aq~~~~~~qae~~k~~ 638 (708) + .....+...+| T Consensus 463 l------~~~~~~~~~aq 474 (474) T protein:vir:81 463 L------IDRSNNGATAQ 474 (474) T ss_pred H------HhcCCCCCCCC Confidence 0 00011111112 No 112 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.51 E-value=1.8e-12 Score=84.91 Aligned_cols=440 Identities=12% Similarity=0.008 Sum_probs=192.3 Q ss_pred CCc-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCC---CceeecchHHHHHHH Q lcl|Aclame:pro 1 MAE-TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKY---PKFEINKVATELNRI 76 (708) Q Consensus 1 ma~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~gr---p~~~~N~i~~~i~~i 76 (708) |-- .+++++.++...+.. ... +- ++.-.||.|++= +..+......+.+ ..++.|..+.+|+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~----~~~---r~--~~l~~Yy~g~~~----i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~ 67 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDD----GMS---RV--RLLARYSNGDAP----LPELTRNTSAAWRSFQREARTNWGLMVRDSV 67 (456) T ss_pred CCCCCHHHHHHHHHHHHHH----HHH---HH--HHHHHHHhccCC----hhhcCcccChhhchhhhhhhcchHHHHHHHH Confidence 443 333455444433221 111 11 222358999761 1000000000111 135679999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|+...+...+.. .+|.+..+. +..+++.|+++...+.+..+++++|++|.-+..+ .++.++ T Consensus 68 ~~~l~~~g~~~~~-----~~d~~~~~~----~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~---------edg~~~ 129 (456) T protein:vir:79 68 ADRIIPNGITVGG-----SADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR---------DDGTAT 129 (456) T ss_pred HhhhccCCeecCC-----CCCccHHHH----HHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC---------CCCceE Confidence 9999888654321 123333333 3445668999999999999999999998766432 122344 Q ss_pred eEEeecchhh-eecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 157 IEPIYDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 157 i~~v~~~~~~-v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) +..+ +|... +.|||.... ....+ .+.+-+.++ .......|+.... T Consensus 130 i~~~-~p~~~~~i~d~~~~~----~~~~~-~~~~~~~d~----------------------------~~~~~~~~~~~~~ 175 (456) T protein:vir:79 130 ITAD-SPETMVVSVDPLQPW----RIRSA-MRWWRDLDA----------------------------ESDFAIVWSGDGW 175 (456) T ss_pred EEEe-ccceeEEEEcCCCCC----ceEEE-EEEEEecCC----------------------------ceeEEEEEcCCce Confidence 4432 23221 346664432 11111 222211100 0000011111000 Q ss_pred eEEEEEE-EecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 236 ESVDVIS-YRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 236 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) ....... .....+..... ...+........+.+++.+|++|+. T Consensus 176 ~~~~~~~~~~~~~~~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~pvv~~~-- 219 (456) T protein:vir:79 176 QKFARPCFVQSSSRRRLVT----------------------------------RISDSWVPVGDAVVTGSPPPVVVYQ-- 219 (456) T ss_pred EEEEEEEEeeccccceeee----------------------------------ccCCceeecccccCCCCceeEEEec-- Confidence 0000000 00000000000 0001111111122333455555431 Q ss_pred eeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhc---------ccCCceeeec Q lcl|Aclame:pro 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARN---------KKRPAFLPLR 385 (708) Q Consensus 315 ~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~---------~~~~~~~~~~ 385 (708) ...+.|-+..+++.++.+|+..|.+...+...+.+...+- |...+ ....++.. ....+.++.. T Consensus 220 ------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~-G~~~~-~~~~d~~g~~i~~~~~~~~~~~~~~~~ 291 (456) T protein:vir:79 220 ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-SSEHR-LPKVDENGNAIDYASIFEAAPGALWEL 291 (456) T ss_pred ------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHh-cCCcc-cccccccccccchhhhhhhhccccccC Confidence 1235688999999999999999998776665554443331 11100 00000000 0001111110 Q ss_pred ccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 386 EVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNM 464 (708) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~ 464 (708) +....+...+... ...+...+......+-..||+++...|. .+|.||.|+......-........+.| T Consensus 292 ----------~~~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:79 292 ----------PPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) T ss_pred ----------CCCcceeeecccC-hHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111222222222 2556677888888888899999888774 568999999998888777777778888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHH Q lcl|Aclame:pro 465 AKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATV 544 (708) Q Consensus 465 ~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~ 544 (708) ..+++++.++++.+ .+.. +... +.|+-.. +...+ ..+.. T Consensus 361 ~~~l~~~~~l~~~~----~g~~------------~~~~-----------------------i~v~w~~-~~~~s-~~~~a 399 (456) T protein:vir:79 361 KIGLEAILVKALQI----EGES------------VEDT-----------------------VDVSFES-PDRVT-LGEKY 399 (456) T ss_pred HHHHHHHHHHHHHh----cCCC------------cccc-----------------------ceEEeCC-CCCcC-HHHHH Confidence 88888888776543 2210 0000 0111000 11122 34455 Q ss_pred HHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 545 SVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQ 624 (708) Q Consensus 545 ~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq 624 (708) +.++.|.+.+-+. . ..+++.+.+.. +++ ++ .++.....+ ..+. T Consensus 400 da~~kl~~~G~~~----~---~~~~~~lg~~~-~~i-~~------------------~e~~r~~~e----------~~~~ 442 (456) T protein:vir:79 400 SAASLAKAAGESW----A---SIRRNILNYNA-DQI-KQ------------------DDLDRAREQ----------ITLF 442 (456) T ss_pred HHHHHHHhcCCCh----H---HHHHhcCCCCH-HHH-HH------------------HHHHHHHHH----------HHHH Confidence 6666654432111 1 11112111100 000 00 000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 625 AQMVAAQAEAQKATNETAQ 643 (708) Q Consensus 625 ~~~~~~qae~~k~~~~~~~ 643 (708) +... ++..+.++.+ T Consensus 443 ~~~~-----~~~~~~~~~~ 456 (456) T protein:vir:79 443 AGNP-----VQRPQEDGSR 456 (456) T ss_pred hhhH-----hhcCCCCCCC Confidence 0000 0111111111 No 113 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.50 E-value=6.7e-13 Score=87.34 Aligned_cols=399 Identities=11% Similarity=0.018 Sum_probs=204.0 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHH----HHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGA----TAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~----~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |= .+.+.++...+.. ...+-++...||+|+|.... ....++.+ .+ ++.|..+.+|+++ T Consensus 1 ~~---~~~i~~L~~~~~~---------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~----~~--~v~nw~~~iVds~ 62 (409) T protein:vir:16 1 MT---EKGIGYLRFKLSV---------HKRRAEMRYEQYAMKHVDRFKGITIPQALSQQ----YR--SILGWCAKGVDSL 62 (409) T ss_pred CC---HHHHHHHHHHHHH---------HhHHHHHHHHHHhccCchhhcchhhhHHHHHH----Hh--hhcChhHHHHHHh Confidence 32 2345555444432 11222344568999986422 22222211 12 3459999999988 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) .+-..-+- | + .+|.+ +..+++.|+++...+.+..++++.|++|+-|.-+ .++.++ T Consensus 63 a~rl~~~G----f--~--~~d~~--------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~---------~dg~~~ 117 (409) T protein:vir:16 63 ADRLVFRE----F--E--NDDFT--------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKG---------ENDAVR 117 (409) T ss_pred Hhhccccc----c--c--CcchH--------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecC---------CCCceE Confidence 66333221 1 1 12322 4567889999999999999999999999887532 123344 Q ss_pred eEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 157 IEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 157 i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) |..+....-.++|||..+++. ...+.|-+ + .+..... ..+|... T Consensus 118 i~~~sP~~~~~i~D~~~~~~~------~a~~~~~~--------------------------d-~~~~~~~-~~~~~~~-- 161 (409) T protein:vir:16 118 LQVIEATNATGIIDPITGLLT------EGYAVLER--------------------------D-ENNNVVL-EAHFLPD-- 161 (409) T ss_pred EEEEcccceEEEeecccccce------eeeEEEEe--------------------------c-CCCceEE-EEEEecC-- Confidence 443322222235777544321 11111100 0 0000000 0011000 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) ....+.. +.+. ....|.|++.+|+|||+..+. T Consensus 162 -------------~~~~~~~----------------------------------~~~~-~~~~~~~~g~vPvV~f~n~~~ 193 (409) T protein:vir:16 162 -------------RTDYYYR----------------------------------DSRN-NISIANPTGNPLLVPIIHRPD 193 (409) T ss_pred -------------cEEEEEe----------------------------------cCcc-ccceecCCCCcceEEeccccc Confidence 0000000 0000 011345667889999874432 Q ss_pred ccCCcccccc-hHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccch--HHHHHhhcccCCceeeeccccccccc Q lcl|Aclame:pro 317 FIDDIERVEG-HIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGL--EKHWEARNKKRPAFLPLREVRDKSGN 393 (708) Q Consensus 317 ~~d~~~~~~G-~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (708) . +...|.| |.+.+++.|+.+|+.++.++......+.++..+- |.-++. .+.|.. . .+.++.-+. +..|. T Consensus 194 ~--~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~~---~-~~~i~~~~~-d~~g~ 265 (409) T protein:vir:16 194 A--VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-GLSDDAEPMETWKA---T-VSSMLQFTK-DEDGD 265 (409) T ss_pred c--cccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-ecCCCCCccchhhh---h-hhHhhccCC-CCCCC Confidence 1 2222333 3478999999999999999988877777766552 221110 011111 0 111111110 11111 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAKSLKRA 471 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 471 (708) ...++.++...+ +.+...+......+-.+||+++..+|.. .| +||.||.+....-........+.|..+.+++ T Consensus 266 ----~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~ 340 (409) T protein:vir:16 266 ----KPTLGQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNV 340 (409) T ss_pred ----CceEEecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112333333333 3557777777888888899999999964 45 7999999888777777777777888888888 Q ss_pred HHHHHHHHHHhcC-CCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHH Q lcl|Aclame:pro 472 GEVWLSMAREVYG-SEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) Q Consensus 472 ~~~~l~li~~~y~-~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~l 550 (708) +++.+.+.-.+=. .....++ ...|- |+ .-+++++ ..+..+.+..| T Consensus 341 ~rla~~~~~~~~~~~~~~~~~-----~v~W~------------------~~----------~~~~~~s-~a~~aDa~~Kl 386 (409) T protein:vir:16 341 AYLAACLRDDVPYLREQFSKT-----KPKWE------------------PL----------FEADASM-LSLIGDGAIKL 386 (409) T ss_pred HHHHHHHhcCCCccchhhccc-----eEEec------------------CC----------CCcchhh-HHHHHHHHHHH Confidence 8888776433210 0000000 00111 00 0113333 45677888888 Q ss_pred HHhccccCchhHHHHHHHHhhccchhHH Q lcl|Aclame:pro 551 LSSMLPTDPMRPAIQGIILDNIDGEGLD 578 (708) Q Consensus 551 lq~~~~~~p~~~~~~~~~~~~~d~~~~~ 578 (708) .+.++...+ ....++++-+...+ T Consensus 387 ~~a~~~~~~-----~~v~~~~~g~~~~d 409 (409) T protein:vir:16 387 NQAIPEFIN-----KDTIRDLTGIKGAE 409 (409) T ss_pred Hhhcccccc-----hhHHHHhccCCCCC Confidence 877644322 12233444443333 No 114 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.49 E-value=2.1e-12 Score=84.61 Aligned_cols=400 Identities=11% Similarity=-0.019 Sum_probs=197.3 Q ss_pred HHhhHHHHHHHHHHHHHhhcCCCCCC----HHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcCcceeEEecCCCc Q lcl|Aclame:pro 20 YSPQKEVREKCIEATRFARVPGGQWE----GATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNRITVKFRPGDRE 95 (708) Q Consensus 20 ~~~~~~~r~~~~~d~~~~~~~G~Qw~----~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~ 95 (708) .+.. +.+-+....||+|+|=- ......++.. . -++.|..+.+|+++.+-..-+- | +. T Consensus 1 l~~~-----~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~----~--~~v~nw~~~~Vds~a~rl~~~G----f--~~-- 61 (410) T protein:vir:95 1 MNLY-----QSRVNLRYKHYAMQHYEAPTGITIPAHIRAK----Y--QAVLGWAAKGVDSLADRLIFRA----F--AN-- 61 (410) T ss_pred CCcc-----hhhHHHHHHHhcCCCCccccchhccHHHHhH----H--HhhcchhHHHHHHhHhhhcccc----c--cC-- Confidence 1111 12223445689998722 1111222211 1 1345889999998866333221 1 11 Q ss_pred chHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCcccc Q lcl|Aclame:pro 96 ASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKK 175 (708) Q Consensus 96 ~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~ 175 (708) +|.+ +..+++.|+++...+.+..++++.|++|+-|.-+. ++.++|..+....-.++|||..++ T Consensus 62 ~d~~--------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~---------d~~~~i~~~sP~~~~~i~Dp~~~~ 124 (410) T protein:vir:95 62 DDFN--------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGE---------DDEVRLQVIESSNATGVIDPITGL 124 (410) T ss_pred CCch--------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCC---------CCceEEEEEcccceEEEEeCCCCc Confidence 2222 45678899999999999999999999998885321 223444433211122347774322 Q ss_pred CChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEec Q lcl|Aclame:pro 176 YDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYD 255 (708) Q Consensus 176 ~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (708) + .+..+.+- .+.+ .......+|... .+..+. T Consensus 125 ~------~~al~~~~---------------------------~~~~-~~~~~~~~~~~~---------------~~~~~~ 155 (410) T protein:vir:95 125 L------VEGYAVLA---------------------------RDDY-NRPTLEAYFEPN---------------ATHFIP 155 (410) T ss_pred e------EEEEEEEE---------------------------ecCC-CeEEEEEEEeCC---------------cEEEEe Confidence 1 11111100 0000 011111111111 011110 Q ss_pred CCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccc-hHHhhhHH Q lcl|Aclame:pro 256 SDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEG-HIAKAMDP 334 (708) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G-~vr~~~d~ 334 (708) .+ +.- ...+.|.+..|+|||+-.+.. +.+.|.| |.+.+++. T Consensus 156 ~~-----------------------------------~~~-~~~~~~~g~vPvV~f~n~~~l--~~~~G~s~I~~~v~~l 197 (410) T protein:vir:95 156 KD-----------------------------------GEP-YSVTNETGIPLLVPVIHRPDA--VRPFGRSRITRAGMYY 197 (410) T ss_pred eC-----------------------------------Ccc-ccccCCCCCcceEEecccccC--CccCCccccchhHHHH Confidence 00 000 012345577888887744321 2223333 55789999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCceeechhhccc-h-HHHHHhhcccCCceeeecccccccccccccccccccccCccchHH Q lcl|Aclame:pro 335 QRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG-L-EKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQA 412 (708) Q Consensus 335 Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (708) |+.+|+.++.++......+.++..+- |.-.+ . .+.|. ...+.++.-+.. ..| + ...++.++...+ +. T Consensus 198 ~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~----~~~~~i~~~~~~-~~~---~-~~~v~q~~~~~l-~~ 266 (410) T protein:vir:95 198 QKYAKRTLERADITAEFYSWPQKYIL-GLDPDAEPMEKWK----ATVSSLLTISSS-DKG---V-KPSVGQFTTASM-SP 266 (410) T ss_pred HHHHHHHHHHHHHHHHHhcchhheee-ccCCCCCcCchhh----hhhhhheeccCC-CCC---C-cceEEecCCCCh-HH Confidence 99999999999988888877765552 21110 0 01111 111112211111 111 1 113333444444 34 Q ss_pred HHHHHHHHHHHHHHHhCCChhHcccc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CcEE Q lcl|Aclame:pro 413 LAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGS-EREV 489 (708) Q Consensus 413 ~~~l~~~~~~~~~~~tGv~~~~~G~~-~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~-~r~i 489 (708) +...+......+-.+||+++..+|.. .| +||.||.+....-........+.|..+.++++++.+.+.-.+=.. .... T Consensus 267 ~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~ 346 (410) T protein:vir:95 267 FTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFV 346 (410) T ss_pred HHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccc Confidence 56777778888888899999999954 45 799999988887777777778888889999999888775433111 1111 Q ss_pred EEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHH Q lcl|Aclame:pro 490 RIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIIL 569 (708) Q Consensus 490 rI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~ 569 (708) ++. ..|-.+. -+++.| ..+..+.+..|.+.++...+ ...++ T Consensus 347 ~~~-----v~W~p~~----------------------------d~~~~s-~a~~aDa~~Kl~~a~~g~~~-----~~~~~ 387 (410) T protein:vir:95 347 RTA-----VKWEPLF----------------------------EADANT-MTMIGDGVVKLNQALPGYIN-----AETIR 387 (410) T ss_pred eee-----EEeeecC----------------------------Ccchhh-HHHHHHHHHHHHHhccCCcc-----HHHHH Confidence 110 0111110 113333 35566777777765432211 12334 Q ss_pred hhccchhHHHHHHHHHhhhhhhhc Q lcl|Aclame:pro 570 DNIDGEGLDDFKEYNRNQLLISGI 593 (708) Q Consensus 570 ~~~d~~~~~ei~e~~~~~~~~~~~ 593 (708) +++.+..-+ +...........+. T Consensus 388 ~~lg~~~~~-~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 388 DLTGIAGDM-SAKPVVSEGGSNGE 410 (410) T ss_pred HhcCCChHH-HHHHHHHHHHhCCC Confidence 444442211 11111100000000 No 115 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.48 E-value=6.5e-12 Score=81.91 Aligned_cols=650 Identities=9% Similarity=0.006 Sum_probs=254.3 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH--HHHHHhc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR--IIAEYRN 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~--i~g~~~~ 82 (708) +-+.+++++.++...+++..++..+|+....-++ .... .. -|...+.+.+ -++..++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~-----------~f~~----~~------G~QW~~~~~~~~~~~l~~~ 59 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEAT-----------RFAR----VP------GGQWEGATAAGSELGKHFE 59 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH-----------hhhc----cC------CCCCCHHHHHHHHHHHhhC Confidence 8888888888888877777777777765533222 1110 00 1222333332 1344555 Q ss_pred CcceeEE-------------------ecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEe-ec Q lcl|Aclame:pro 83 NRITVKF-------------------RPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS-ML 142 (708) Q Consensus 83 nr~~~~v-------------------~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~-~~ 142 (708) ++|-+.+ .++--..+.+.-+.+.+++..+....--......+.-++..+++.+ .++| +. T Consensus 60 ~~P~~~~N~i~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~-G~G~~~v 138 (720) T protein:vir:35 60 KYPKFEINKISTELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTG-GFGCFRL 138 (720) T ss_pred CCCeEEEccHHHHHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhc-cceeEEe Confidence 6664332 1111112233234556666666555444445555556666655432 1111 11 Q ss_pred cccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc--ccccccccccccccCCC Q lcl|Aclame:pro 143 VNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP--PTSLDVTSMTSWEYNWF 220 (708) Q Consensus 143 ~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~--~~~~d~~~~~~~~~~~~ 220 (708) ..+.+ .+.+.......+. ...|+-|+.+.-+| -..+..+.++.+-.|-..- .+.+...-.......+. T Consensus 139 ~~d~~-~~~d~~~~~~~i~--i~~v~~~~~~v~~D-------p~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~ 208 (720) T protein:vir:35 139 TTNLV-NALDPMDERQRIC--LEPIYDPARSVWFD-------PDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMS 208 (720) T ss_pred eeccc-ccCCCCcccceee--EecccCchhheeec-------ccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccc Confidence 10000 0000000000010 00111111111111 1122334444332211000 00000000001111122 Q ss_pred CCceeEEeeeeeecceEEEEEEEecCccCceeEecCC---cccchHHH-hhccchhhhhheee--eeEEEEEEEEeccee Q lcl|Aclame:pro 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSD---QVEDIEDE-LAIAGFHEVARRSV--KRRRVYVSVVDGDGF 294 (708) Q Consensus 221 ~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~~~~~~--~~~~v~~~~~~~~~i 294 (708) +...-....|++..++++..||++.+.....+.+.+. +...+... +............+ .++.+. ++.....+ T Consensus 209 ~~~~~~~~d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~-~~~v~~~~ 287 (720) T protein:vir:35 209 GIERSWDYDWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIK-RRRVYVSV 287 (720) T ss_pred cccccccccccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhcccccccccee-EEEEEEEe Confidence 2222233457776666666666665555444444332 22222211 11000111111111 122222 22222222 Q ss_pred eecCCCC--CC-CCcceeeEEEeeeccCCcccccchHHh-hhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHH Q lcl|Aclame:pro 295 LEKPRRI--PG-EHIPLIPVYGKRWFIDDIERVEGHIAK-AMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKH 370 (708) Q Consensus 295 l~~~~~~--p~-~~~p~~p~~~~~~~~d~~~~~~G~vr~-~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~ 370 (708) +.+.... |. .-+.++||+.+.-+... ..+...... +++.-+.=.-.=.....++...+..+.+++.+++++++.. T Consensus 288 ~~g~~~l~~~~~~p~~~fP~vP~~g~r~~-~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~ 366 (720) T protein:vir:35 288 VDGEGFLEKAQRIPGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTL 366 (720) T ss_pred eccchhcccCCCCCCCccceEEEEeeeec-cCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHH Confidence 2222211 11 12335677755432221 122222223 3445555544455666677777888999999999999888 Q ss_pred HHhhcccCCceeeeccccccccccccccc--ccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHH Q lcl|Aclame:pro 371 WEARNKKRPAFLPLREVRDKSGNIIAGAT--PAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNN 448 (708) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~ 448 (708) ..+..+.+.....+-. .+..+. .+|.. ++..+...+-++-....++........+-.++......-+..|+++-.+ T Consensus 367 ~~~~a~~~~~~~~~l~-~~~~~~-~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~A 444 (720) T protein:vir:35 367 EKYWANRNKNRPAFLP-LNEIVD-KQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKET 444 (720) T ss_pred HHHhhccccccccccc-cccccc-cCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHH Confidence 7776655544333222 111111 12222 2223333332333344455555555555333222222223345543333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEee----cccee Q lcl|Aclame:pro 449 LMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALN----DLSVG 524 (708) Q Consensus 449 ~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~n----Di~~g 524 (708) ++..-..+...+..-|.. ++..-+.+-+++..+.. .+- +.++.+.|-.. |+ +...+.+| |-.-| T Consensus 445 i~~rq~qg~~~~~~~~Dn-l~~~~~~~g~~lL~lI~-----~~y---~~er~~RI~~e--d~-~~~~v~~n~~~~d~~~g 512 (720) T protein:vir:35 445 VNHLMHRSDMSSFIYLDN-MAKSLKRAGEVWLSMAR-----EVY---GSDRQVRIVNA--DG-TDDIALMSVVINDNQTG 512 (720) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecC--CC-CcceEeechhhhccCCC Confidence 333344444444433333 33333455555544432 111 33455655433 12 22333333 32222 Q ss_pred ----eEEEEE---eecccchhHHHHH-HHHHHHHHHhccccCchhHHHHHHHHh-hccchhHHHHHHHHHhhhhhhhccc Q lcl|Aclame:pro 525 ----RYDVTV---DVGPSYTARRDAT-VSVLTNVLSSMLPTDPMRPAIQGIILD-NIDGEGLDDFKEYNRNQLLISGIAK 595 (708) Q Consensus 525 ----~~Dv~v---~~~~~~~~~r~~~-~~~l~~llq~~~~~~p~~~~~~~~~~~-~~d~~~~~ei~e~~~~~~~~~~~~~ 595 (708) .-||.+ ++..+..-..... -+.+..|++.++...|... +...++. .++........+.+...... . T Consensus 513 ~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~-~~~~~~~~ile~~d~p~~~e~~erirk~----~ 587 (720) T protein:vir:35 513 QVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDP-MRQVLQGIILDNMEGEGLDEFKEYNRKQ----L 587 (720) T ss_pred ceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCch-hHHHHHHHHHHhcCchhHHHHHHHHHhh----c Confidence 246653 4555554444444 4445555555444433322 2222222 22332233222222221111 1 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 596 PRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDK 675 (708) Q Consensus 596 ~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~ 675 (708) +.+...++...+.++. .+.++.+.++.++++.++|+++++++++.++++.+....++.+..+++.+....+++..+..+ T Consensus 588 ~~~~~~~~~~~e~qq~-~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq 666 (720) T protein:vir:35 588 LTQGVVKPRNTEEEQM-VAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILAS 666 (720) T ss_pred chhcccCccChhHHHH-HHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111001111111111 111112223445555566666667777666666666665555555544444333232222111 Q ss_pred HHHHHHHHHHhhhhhh---hhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 676 AVMEAIRLLKDVAESQ---QQQFQSPPQSPADLMPS 708 (708) Q Consensus 676 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~e~~~~ 708 (708) ..........+..+.. +..+....+..++++.. T Consensus 667 ~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~ 702 (720) T protein:vir:35 667 ADSAKRAEIREALKMLHQFQKEQGDASRADAELILK 702 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhc Confidence 1111111111111111 12223334555777666 No 116 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.48 E-value=6.5e-12 Score=81.90 Aligned_cols=487 Identities=13% Similarity=0.051 Sum_probs=220.0 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCC--CHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQW--EGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRN 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw--~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~ 82 (708) ++..+.....+++ .+.|...|+++.+|.. . .=. +.... +.. -.++ .-+.-...++.+.+.... T Consensus 1 mk~~~~~~~~~lk-----R~~~e~~w~e~a~~tl-P-~~~~~~~~~~-----~~~-~~~~--~dstg~~a~~~LAa~l~~ 65 (510) T protein:vir:63 1 MKTTAAMLWEKLR-----DGSVEQRAIEFAKTTL-P-YLMVDPMSGS-----RGV-VEHD--FQSAGALLVNNLAAKLAR 65 (510) T ss_pred ChhHHHHHHHHHh-----ccchHHHHHHHHHhhc-c-ccCCCCCCcc-----ccc-cCCC--ccchHHHHHHHHHHHHHh Confidence 4444444444443 3346666766655421 1 000 00000 000 0122 233444455555443333 Q ss_pred -----CcceeEEecCCC------cchH---HHHH---HHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecccc Q lcl|Aclame:pro 83 -----NRITVKFRPGDR------EASE---ELAN---KLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNE 145 (708) Q Consensus 83 -----nr~~~~v~pr~~------~~d~---~~A~---~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~ 145 (708) +++=+++.+.+. +.+. ++.+ .++..+......|++..+...++.+.+..|.+.+.+. T Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~------ 139 (510) T protein:vir:63 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRD------ 139 (510) T ss_pred hhcCCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEc------ Confidence 233333433321 0111 1222 2344555667789999999999999999998866542 Q ss_pred CCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCcee Q lcl|Aclame:pro 146 YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVI 225 (708) Q Consensus 146 ~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 225 (708) .++. ++... |+.++++..+.. . ...-++++..++..++-+.|+....... .+ ....+.+ T Consensus 140 ~~~~------~~~~~--pl~~y~v~~d~~---G-~vd~i~rr~~~t~~~l~e~~~~~~~~~~-------~~--~~~~~~v 198 (510) T protein:vir:63 140 SDAA------TVVAW--SLRSYAVRRDAT---G-RWMDIVLKQRYKSKDLDEEYKQDLMRAG-------RN--LSGSGSV 198 (510) T ss_pred CCCc------EEEEE--EcceeEEeeCCC---c-CeeEEEeeeeccHHHHhHHhhhhhhccc-------cc--cCCCcce Confidence 1211 22222 455666544332 1 1223789999999999877764322110 00 0011111 Q ss_pred EEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEE-EecceeeecCCCCCCC Q lcl|Aclame:pro 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSV-VDGDGFLEKPRRIPGE 304 (708) Q Consensus 226 ~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-~~~~~il~~~~~~p~~ 304 (708) .++.+..+..++ .+..+.+++ ..|..+. ..+.||+. T Consensus 199 -------------~v~~~V~~~~~~-----------------------------~~~~~sv~~e~dg~~~~-~~~~~~~~ 235 (510) T protein:vir:63 199 -------------DLYTHVQRKKGT-----------------------------AMEYAELYHEIDGVRVG-KEGRWPIH 235 (510) T ss_pred -------------EEEEEEEeecCC-----------------------------CceEEEEEEEecCceec-cccccccc Confidence 111111110000 001111222 2333333 34678899 Q ss_pred CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeee Q lcl|Aclame:pro 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL 384 (708) Q Consensus 305 ~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~ 384 (708) ++||+|+-... .+|..+|.|.+....+--+.+|++....+.......++.++++++.+-...... ...++.++.. T Consensus 236 e~P~~~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~---~~~~g~~v~g 310 (510) T protein:vir:63 236 LCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ---DAEMGDYVPG 310 (510) T ss_pred cCceeeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhc---cCCCceeecC Confidence 99998765443 588899999999999999999999999999999999999999876553322111 1111222111 Q ss_pred cccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 385 REVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNM 464 (708) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~ 464 (708) . .+.+ .+....+..--+.....++...+.|....=+. ...-.....|++-|..+.+-....+...+.+| T Consensus 311 ~-----~~~v-----~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl 379 (510) T protein:vir:63 311 G-----AEAV-----RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) T ss_pred C-----cccc-----eeeecCcccchHHHHHHHHHHHHHHHHHHHhh-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHH Confidence 0 0111 11111111122333456666666666653221 11111223588888888888888888877777 Q ss_pred H-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH Q lcl|Aclame:pro 465 A-KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) Q Consensus 465 ~-~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~ 543 (708) . .+..-+.+..+.++.... ++-+. +..+ .-+++ ++ -++-.|.+. T Consensus 380 ~~E~l~Pli~r~~~il~r~g----l~p~p------------~~~~----------------~~~~v--~~-is~Laraq~ 424 (510) T protein:vir:63 380 AENLQSPLAYVCLSEVDDAL----LQGLI------------TKQH----------------KPAIE--TG-LPALSRSAA 424 (510) T ss_pred HHHHHHHHHHHHHHHHHhcc----CCCCC------------chhc----------------cccee--cc-hhHHHHHHH Confidence 6 455555555555443311 11110 0000 00111 11 223345555 Q ss_pred HHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhh--hhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLL--ISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~--~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .+.+..+++.+....+. ..+.+ .. +.++++..+..... +...... +++.++.+ +++.++.+++. T Consensus 425 ~~~l~~~~q~l~~~~~~-aq~~~----~i---d~d~~~~~~a~~~Gv~p~~ivrs--~eev~a~~-~~~~qq~~~~~--- 490 (510) T protein:vir:63 425 VQSMLNASQVIAGLAPI-AQLDP----RI---SLPKMMDTIWAAFSVDTSQFYKS--ADELQAEA-EQQRQQAAQAQ--- 490 (510) T ss_pred HHHHHHHHHHHHHhcCc-hhhhc----cC---CHHHHHHHHHHHhCCChhHhcCC--HHHHHHHH-HHHHHHHHHHH--- Confidence 55555555444332221 11111 11 34445544443332 1222222 12111111 11111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKA 648 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~ 648 (708) ++++.+..+|. +...+-+-+ T Consensus 491 -~~~~~~~~~a~------~~~~~~~g~ 510 (510) T protein:vir:63 491 -AAQETLLEGAS------DMTNALAGV 510 (510) T ss_pred -HHHHHHHHHHH------hhcccccCC Confidence 00000000100 000000000 No 117 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.45 E-value=2.3e-12 Score=84.40 Aligned_cols=487 Identities=11% Similarity=0.012 Sum_probs=229.2 Q ss_pred CCcchHHHHHHHHHHHHHH-H---------HhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRA-Y---------SPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVA 70 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~-~---------~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~ 70 (708) |-++.+..+++...++..- . ....+.+.+....+. +|.|+.+.-. .....+....+...+.|+-+ T Consensus 3 ~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~--~y~g~~~~~~---~~~~~~~~~~~~~~slnl~~ 77 (522) T protein:vir:47 3 LFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLV--YYQSKWDDVQ---YKNTDGDIKSRPMNHLPIAR 77 (522) T ss_pred hHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHH--HhcCCccccc---ccccCcchhcccceecchHH Confidence 4445555555544332110 0 012233344433343 5777644211 11111111123356678889 Q ss_pred HHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCC Q lcl|Aclame:pro 71 TELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMD 150 (708) Q Consensus 71 ~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~ 150 (708) .+++...+...+-.+.+.|. |....+.| ..+.+.|++......+++.++..|-|++++.++. T Consensus 78 ~i~~~~A~lv~~e~~~i~v~------d~~~~~~l----~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-------- 139 (522) T protein:vir:47 78 TASKKIASLVYNEQATITTK------NEILQKFL----DDMLTNDRFNKNFERYLESCLALGGLAMRPYIDG-------- 139 (522) T ss_pred HHHHHHhhhhcCCcceeecC------ChHHHHHH----HHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcC-------- Confidence 99998888888878888761 33444544 4445589999999999999999999999998752 Q ss_pred CCcceeeEEeecchhheecCCcccc-CChhccCeEEEeecCCHHHHHHhCCCCccccccccccccccc-CCCCCc--eeE Q lcl|Aclame:pro 151 DRQRIAIEPIYDPSRSVWFDPDAKK-YDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEY-NWFGAD--VIY 226 (708) Q Consensus 151 ~~~~i~i~~v~~~~~~v~~Dp~a~~-~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~-~~~~~~--~~~ 226 (708) +.+++..+ +...++ |-... -+...|-++++........- ..| +-.++ .|.+.+ ... T Consensus 140 --~~~~i~~v--~ad~~~--P~~~~~~~~~e~a~~~~~~~~~~~~~-~~y-------------t~lE~he~~~~~~~~~~ 199 (522) T protein:vir:47 140 --DKVRVAFI--QAPVFF--PLESNTQDVSSAAILTKTIKSEGRKN-VYY-------------TLVEFHEWVTADGQETG 199 (522) T ss_pred --CceEEEEE--cCCceE--EEEEcCCceEEEEEEEEEEeecccce-eEE-------------EEEEEeeeccccccccc Confidence 23455444 344454 21111 11223323322221111100 000 00000 000000 000 Q ss_pred EeeeeeecceEEEEEEEecCc----cCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCC Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYRHPI----TGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIP 302 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p 302 (708) ....+. .-...+.++... .|..+.+..- .....|.....++ T Consensus 200 ~~~~~~---~~~I~n~ly~~~~~~~lG~~v~l~~~--------------------------------~e~~~l~~~~~~~ 244 (522) T protein:vir:47 200 STNDKK---YYRITNELYRSDVNDVLGQRVNLSEL--------------------------------DKYKNLEPVTVFE 244 (522) T ss_pred ccccCC---ceEEEEEEeecCCCcccCcccccccc--------------------------------ccccCCCCceEeC Confidence 000000 000000111000 0111110000 0000000000111 Q ss_pred CCCcceeeEE---EeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHH----H--- Q lcl|Aclame:pro 303 GEHIPLIPVY---GKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHW----E--- 372 (708) Q Consensus 303 ~~~~p~~p~~---~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~----~--- 372 (708) +..-|.+.++ ......++++.|.|++.++++..+.+|...+++.+-+.++. .+++++...+.....-. . T Consensus 245 ~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~~~~~~ 323 (522) T protein:vir:47 245 NLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQ-RRVIVPEHLTQRQYQRPDGTIDFRP 323 (522) T ss_pred CCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhcc-ceeecchHHhccCCCCCCccccccc Confidence 1111221111 11112346778889999999999999999999999997765 46778776653211100 0 Q ss_pred hhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc--hhHHHHHHHH Q lcl|Aclame:pro 373 ARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN--IAQETVNNLM 450 (708) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n--~sg~ai~~~q 450 (708) ........+. ..... ......+...++.--...+...++.....+....|++....|..+. .||++|.... T Consensus 324 ~fd~~~~~f~---~~~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~ 396 (522) T protein:vir:47 324 RFDVEQNVYM---QIGGS----SMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSEN 396 (522) T ss_pred ccCcccceEe---ecCCC----CCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHH Confidence 0000111111 11111 1111234444444444667778888888888889999988886543 4888998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEE Q lcl|Aclame:pro 451 NRADMASFIYLDNMAKSLKRAGEVWLSMAREV--YGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDV 528 (708) Q Consensus 451 ~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~--y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv 528 (708) +..-.....+...+..+++++.+.++.+...+ |.. . ....++| T Consensus 397 ~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~--------~---------------------------~~~~~~i 441 (522) T protein:vir:47 397 SDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSG--------E---------------------------IPELDDI 441 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC--------C---------------------------CCCccee Confidence 88888888899999999999999999887432 211 0 0112445 Q ss_pred EEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcch-HHHHHHHH Q lcl|Aclame:pro 529 TVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNE-KEQQIVQQ 607 (708) Q Consensus 529 ~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~-~~~q~~~~ 607 (708) +|+=+.+-...+++.++.++++.+.+. .. .-.+.+-....+-.-+++..++++....+....++..- ...+... T Consensus 442 ~v~f~D~i~~D~~~~~~~~~~~v~aG~-~s---~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~- 516 (522) T protein:vir:47 442 SVNLDDGVFTDRHAELDYWAKMVAAGF-ST---KKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEE- 516 (522) T ss_pred EEEcCCCCCCCHHHHHHHHHHHHhcCC-CC---HHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccc- Confidence 555555555556677777777765432 11 11111111111112244455555544322111111000 0000000 Q ss_pred HHHHHHHHHHHHH Q lcl|Aclame:pro 608 AQMAAQSQPNPEM 620 (708) Q Consensus 608 ~qq~qq~~~~~~~ 620 (708) . -..+- T Consensus 517 -----~--~d~~~ 522 (522) T protein:vir:47 517 -----K--ADDKG 522 (522) T ss_pred -----c--CCCCC Confidence 0 00000 No 118 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.45 E-value=5.1e-13 Score=87.98 Aligned_cols=471 Identities=11% Similarity=0.029 Sum_probs=197.4 Q ss_pred CCcchHHH----------------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCC-- Q lcl|Aclame:pro 1 MAETLEKK----------------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP-- 62 (708) Q Consensus 1 ma~~~~~~----------------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp-- 62 (708) |-...+.| .+.+.......+..+.. +..+-++...||.|+|.....-..... +.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~--~~~rl~~l~~YY~G~~~~~~~~~~~~~----~~~~~~ 74 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHIS--ERQWLDRIYEYTKGLRGRPEVPEGASD----EVKELA 74 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhccccCCh----hhhhhH Confidence 11111111 11122222222222221 112223445689998853221111100 0111 Q ss_pred -ceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEee Q lcl|Aclame:pro 63 -KFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSM 141 (708) Q Consensus 63 -~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~ 141 (708) -++.|..+.+|+..++...-+- + ..|. .+.+ ..+..+++.|+++.....++.++++.|++|+.|..+ T Consensus 75 ~~~v~n~~~~ivd~~a~~l~~~g--f-~~~d-~~~~--------~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d 142 (501) T protein:vir:25 75 KLSVKNVLSLVRDSFAQNLSVVG--Y-RNAL-AKEN--------DPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPT 142 (501) T ss_pred hhhhcChHHHHHHHHHhhhcccc--e-ecCC-ccch--------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecC Confidence 2356888999998887553221 1 1222 1111 123456789999999999999999999999877543 Q ss_pred ccccCCCCCCCcceeeEEeecchhhe-ec-CCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCC Q lcl|Aclame:pro 142 LVNEYDPMDDRQRIAIEPIYDPSRSV-WF-DPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) Q Consensus 142 ~~~~~d~~~~~~~i~i~~v~~~~~~v-~~-Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~ 219 (708) + ++ .+|. +.+|...+ +| ||...+. ..+ +++.+....+ T Consensus 143 ---e----~~---~~i~-~~sp~~~~~iy~D~~~~~~----~~~-ai~~~~~~~~------------------------- 181 (501) T protein:vir:25 143 ---D----EG---PVFR-TRSPRQILAVYADPSVDAW----PQY-ALETWVAQKD------------------------- 181 (501) T ss_pred ---C----CC---CeEE-EeccccEEEEEecCCCCcc----eeE-EEEEEeeccc------------------------- Confidence 1 11 1233 23343332 24 5543321 111 2222221110 Q ss_pred CCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCC Q lcl|Aclame:pro 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) Q Consensus 220 ~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~ 299 (708) .+......+|.... ++.+... +.. ......... ....+. .+..+........ T Consensus 182 --~~~~~~~~~y~~~~----~~~~~~~--~~~-~~~~~~~~~------------------~~~~~~-~~~~~~~~~~~~~ 233 (501) T protein:vir:25 182 --AKPHRRGVLYDDTY----MYELDLG--EVV-LGDAGGGQA------------------TQQPVN-VREVTDVIEHGAT 233 (501) T ss_pred --cCcceeEEEecCee----EEEEecC--cee-eeecccccc------------------cccccc-ccccccccccccc Confidence 11111223332211 1111110 000 000000000 000000 0000111111223 Q ss_pred CCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCC Q lcl|Aclame:pro 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) Q Consensus 300 ~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~ 379 (708) +.|++.+|+++|.-.+ +....+.|.+..+++.++.+|+.+|.+...+...+.++..+ .|...+....|. ... T Consensus 234 ~~~~~~vPiv~f~N~~---~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i-~G~~~~~~~~~~----~~~ 305 (501) T protein:vir:25 234 FEGKPVCPVVRFVNGR---DADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVI-SGWTGSKAEVLK----ASA 305 (501) T ss_pred cCCccceeeEeccCcc---ccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHH-hCCCCCccchhh----hcc Confidence 4455566666654322 22345778899999999999999999988887766654333 122111111111 111 Q ss_pred ceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc-cccchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPSNIAQETVNNLMNRADMASF 458 (708) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G-~~~n~sg~ai~~~q~q~~~~~~ 458 (708) +.++.-.+.. ..+...+... ...+...+......|-..|++.+...| ..+|.||.|+......-..... T Consensus 306 ~~i~~~~~~~---------~~~~q~~~~~-~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~ 375 (501) T protein:vir:25 306 LRVWTFEDPE---------VKAQAFPPAS-VEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLA 375 (501) T ss_pred cceeccCCCC---------ceEEEecccC-hHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHH Confidence 2222111110 1122222222 244556677777777778889988777 4568899999988888777778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchh Q lcl|Aclame:pro 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) Q Consensus 459 ~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~ 538 (708) ...+.|..+++++.++++.+- +.. .... .+++.|.=.+..+. T Consensus 376 ~k~~~f~~~l~~~~rl~~~~~----~~~---------~~~~-------------------------~~~i~v~w~~~~~~ 417 (501) T protein:vir:25 376 AKRESFGESWEQLLRLAAEMD----DDP---------DTAA-------------------------DSGAEVLWRDTEAR 417 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHh----CCC---------cccc-------------------------ceeeeEEecCCCCC Confidence 888888888888877665432 211 0000 12223322223233 Q ss_pred HHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) Q Consensus 539 ~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~ 618 (708) ...+..+.+..|.+.+ . +.. .++.++..-.-+++ +++.... +.+ ...... T Consensus 418 s~~~~ada~~kl~~~g--i-s~e-----t~~~~~~g~~~~~i-e~~~~~~------------------~e~---~~~~~~ 467 (501) T protein:vir:25 418 SFGAVVDGITKLASAG--I-PIE-----HLLSMVPGMTQQTI-QAIKDSL------------------RGG---EVKSLV 467 (501) T ss_pred CHHHHHHHHHHHHhcC--C-CHH-----HHHHHcCCCCHHHH-HHHHHHH------------------HHH---hHHHHH Confidence 3456667777766532 1 111 11221111001111 1111100 000 000000 Q ss_pred HHHHHHHH--HHH--HHHHH---HHHHHH-HHHH Q lcl|Aclame:pro 619 EMVLAQAQ--MVA--AQAEA---QKATNE-TAQT 644 (708) Q Consensus 619 ~~~~aq~~--~~~--~qae~---~k~~~~-~~~~ 644 (708) .+...+.. ... .+... ...... ..-+ T Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 468 DKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred HHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 00000000 000 00000 000000 0000 No 119 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.33 E-value=5.8e-12 Score=82.18 Aligned_cols=536 Identities=9% Similarity=-0.009 Sum_probs=241.7 Q ss_pred CCcchHHHHHHHHHHHHHHHHhh----HHHHHHHHHHHHHhhcCCCCCCHH-HHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQ----KEVREKCIEATRFARVPGGQWEGA-TAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~----~~~r~~~~~d~~~~~~~G~Qw~~~-~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |+- ..+...-...-|-+...-| ...|-.+-+. --+||.|+||+-. ++.... +-++.++--+.+|++ T Consensus 1 m~~-~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~l-y~d~y~n~~~el~~il~G~d-------r~~~~~ps~r~~V~~ 71 (563) T protein:vir:74 1 MPY-NHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDL-YENIYLNSAETLKLVLRGDD-------SVPILMPSGRKIVEA 71 (563) T ss_pred CCc-cccccCCCcccccccccccCCHHHHHHHHHHHH-HHHhhcCchhhhhhhcCCCc-------eeeeccchHHHHHHH Confidence 441 1111111111122111111 1112222222 2378999999743 333221 123444456788888 Q ss_pred HHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcce Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRI 155 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i 155 (708) +. .....-..+.|-|.+ +|+...+.++.+++.+.+.++.......+-.++++-|-|++++.||... ....++ T Consensus 72 ~~-~~Lg~~~~~~Ve~~~--~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K-----~~g~R~ 143 (563) T protein:vir:74 72 VH-RFLGVGFDYLVEPDM--GDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNK-----KAGERI 143 (563) T ss_pred HH-HhcCCCcEEecCccc--cCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecccc-----ccCCCc Confidence 44 556778888887775 3566667799999999999999999999999999999999999986532 122355 Q ss_pred eeEEeecchhheecCCccccCChhccCeEEEe---ecCCHHHHHH-hCCCCcccccccccccccccCCCCCceeEEeeee Q lcl|Aclame:pro 156 AIEPIYDPSRSVWFDPDAKKYDKSDALWAFCM---YSLSPEKYEA-EYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYY 231 (708) Q Consensus 156 ~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~---~~~~~~e~~~-~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~ 231 (708) ++..+ +|.. + | | +..+| ++..+..+. .|-..++.++ .+. -..+-+.|.+......+--| T Consensus 144 rv~~v-DP~~-~-f-p-~~dpd-~v~g~~~v~v~~~~~~pdd~~~~~~r-----------~~~~~~~lndeg~~~~~~~~ 206 (563) T protein:vir:74 144 SVDEV-DPRQ-I-F-L-IEDGS-TVVGFHMVDIVQDFRSPDDPSKKLAR-----------RRTFRRVRNDEGMFTGRISS 206 (563) T ss_pred eEeec-CCce-e-e-e-ccCCC-CcccceeeecccCCCCCcchhcccee-----------eeeeeeeeCCCCCccceeee Confidence 55543 2322 2 2 2 22233 122221111 2222222221 110 00001111111111111000 Q ss_pred eecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEec-ceeeecCCCCCCCCcceee Q lcl|Aclame:pro 232 EVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDG-DGFLEKPRRIPGEHIPLIP 310 (708) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~il~~~~~~p~~~~p~~p 310 (708) +-+. ..+.-|+..+.....+ . ....+ .+.. ...-++.=|-|.+.+||+- T Consensus 207 dae~---w~lg~wd~r~~~~~~~-----~-----~~~~~-----------------~~~~~~d~e~~~LP~pi~~iPiv~ 256 (563) T protein:vir:74 207 ELTH---WTLGNWDDRGAISDEQ-----A-----RRKEQ-----------------VRSAQHDEEEEELPEPISQLPLYR 256 (563) T ss_pred ccch---hccccccccCccchhh-----h-----cccch-----------------hhhhhhhchhhhccccccCccEEE Confidence 0000 0000011111000000 0 00000 0000 0111122255666667654 Q ss_pred EEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhh-ccchHHHHHhhcccCCceeeeccccc Q lcl|Aclame:pro 311 VYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQ-IRGLEKHWEARNKKRPAFLPLREVRD 389 (708) Q Consensus 311 ~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~a-i~~~~~~~~~~~~~~~~~~~~~~~~~ 389 (708) |-..+ ..++..|.|-...+..+.+.+|..+|-...++..+.++.++.+..+ .++........+..++.+ +.-.... T Consensus 257 ~~tip--~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i-~El~~~~ 333 (563) T protein:vir:74 257 WRNKP--PQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQI-VEIAGNR 333 (563) T ss_pred cCCCC--CcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccccccccccccccccCCcee-EeccCCc Confidence 32222 3456667888999999999999999999999999998887776322 121111111112223222 2222111 Q ss_pred cccccccccccccccc-CccchHHHHHHHHHHHHHHHHHhCCChhHcc--cccc-hhHHHHHHHHHHHHHHH----HHHH Q lcl|Aclame:pro 390 KSGNIIAGATPAGYTQ-PAVMNQALAALLQQTSADIQEVTGGSQAMQQ--MPSN-IAQETVNNLMNRADMAS----FIYL 461 (708) Q Consensus 390 ~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G--~~~n-~sg~ai~~~q~q~~~~~----~~~~ 461 (708) +.|. +..+. .+++..-..-|-......|.+++|+.....| ..+. -||.|...-.+--.... ..+. T Consensus 334 ~~g~-------l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~ 406 (563) T protein:vir:74 334 NDNY-------FERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMI 406 (563) T ss_pred cccc-------eeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHH Confidence 1111 11111 1222222222223334467888999999999 5554 49998775444322211 2366 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHH Q lcl|Aclame:pro 462 DNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRD 541 (708) Q Consensus 462 dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~ 541 (708) ..++.++-+..+++|.+.+..|-...-- .|+-+. + .-++.-|+|.=+|-+|+.+. T Consensus 407 ~~mr~~r~~~~~~lL~~~erl~~~g~~~---------~~~g~~-----~-----------~~~~~~v~ivf~p~~P~d~~ 461 (563) T protein:vir:74 407 VVMDQFLHDWMTMWLPAYESDFQEQDGS---------RPFASA-----D-----------LLNECSVVCIFADPMPVNKT 461 (563) T ss_pred HHHHHHHHHHHHHHHHHHHhHhhhhccc---------cccccc-----c-----------cCCceEEEEEeCCCCCccHH Confidence 6777888888899998888866322211 122111 1 11234567778899999999 Q ss_pred HHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 542 ATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 542 ~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) +.++....|.+.+-- ...+++.++ . .+ +..-|+.+ ++.+.. + ..+. T Consensus 462 ~vv~~~~tl~~aGii-------SretAv~~L--------~-~~-------g~~~pdae--~e~~~i----e--~~~i--- 507 (563) T protein:vir:74 462 QVTQDTLLLQQAHLI-------LRKMAVAKL--------R-SI-------GWEYPEVD--DQGNAL----T--DDDI--- 507 (563) T ss_pred HHHHHHHHHHHcCch-------hHHHHHHHH--------H-hC-------CCCCCcHH--HHHhhc----C--HHHH--- Confidence 999998888775421 111221111 0 00 00111111 110000 0 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCC Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQS 701 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 701 (708) .++.+ ++|+++. . -++++...+.....+-..+.+.--+=.+ T Consensus 508 -~~~~~-------a~a~ad~-------------------~------------~~~~a~~~~g~~~~~~dd~g~p~~~~~~ 548 (563) T protein:vir:74 508 -ADMLL-------AEAEADA-------------------S------------LGLSAMDNGGAGEQQFDDQGNPIDQFGN 548 (563) T ss_pred -HHHHH-------HHhhccC-------------------c------------ccceecccCCCCcccccccCCchhHcCC Confidence 00000 1111110 0 0000000000000000000111123345 Q ss_pred CCCCCCC Q lcl|Aclame:pro 702 PADLMPS 708 (708) Q Consensus 702 ~~e~~~~ 708 (708) |-|+||. T Consensus 549 ~~~~~~~ 555 (563) T protein:vir:74 549 PVEIPPD 555 (563) T ss_pred cccCCcc Confidence 6666666 No 120 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.32 E-value=4.1e-11 Score=77.56 Aligned_cols=422 Identities=11% Similarity=-0.001 Sum_probs=189.3 Q ss_pred hhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 37 ARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETD 116 (708) Q Consensus 37 ~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~ 116 (708) +...+.. + ..+...+ .++.|..+.+|+.+++...-+- +. .|. .+. +..+..+++.|+ T Consensus 1 ~l~~~~~--~----~~~~~~~-----~~v~n~~~~ivd~~~~~l~~~g--f~-~~d-~~~--------~~~~~~i~~~N~ 57 (434) T protein:vir:98 1 MLPKNAE--Q----AFLDFQR-----KARTNFCGLIANASVHRLLALG--VT-GPD-GEP--------DTRASRWWQANR 57 (434) T ss_pred CCCCCcc--H----HHHHhhh-----hhhccchHHHHHHHHhhhccCc--ee-cCC-Cch--------HHHHHHHHHhcC Confidence 1111211 1 1111110 2456999999999998554332 22 121 111 122345678899 Q ss_pred hHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEeecchhh--eecCCccccCChhccCeEEEeecCCHHH Q lcl|Aclame:pro 117 GGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRS--VWFDPDAKKYDKSDALWAFCMYSLSPEK 194 (708) Q Consensus 117 ~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~--v~~Dp~a~~~D~sDa~~~~~~~~~~~~e 194 (708) ++...+.+..++++.|+||+.|..+... ...++...+.|..+ ++.. ++||+....+ .+.+ +.|-... T Consensus 58 ~d~~~~~~~~~a~i~G~ay~~v~~~~~~--~~~~~~~~~~I~~~--~p~~~~~i~D~~~~~~-----~~ai-~~~~~~~- 126 (434) T protein:vir:98 58 LDSRQKLVWRMAMAQSAGYMLVGAHPTR--TEDNGRPSPLITME--HPSECIVEYDPETGEP-----LVGL-KVWHNDI- 126 (434) T ss_pred hhHHHHHHHHHHhhcCceEEEEecCCCc--ccccCCceeEEEEe--ccceeEEEEeCCCCce-----EEEE-EEEEecc- Confidence 9999999999999999999988654321 11233344444433 2333 3577654321 1122 2211000 Q ss_pred HHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhh Q lcl|Aclame:pro 195 YEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEV 274 (708) Q Consensus 195 ~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (708) ........||.-.. ..+.+....++....... T Consensus 127 ---------------------------~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~------------------ 158 (434) T protein:vir:98 127 ---------------------------DGFGYARVFFDDTS---FPYRTRERTGARLPWGPD------------------ 158 (434) T ss_pred ---------------------------CCceEEEEEEeCcE---EEEEEeeccccccccccc------------------ Confidence 00001111111000 001111111110000000 Q ss_pred hheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 275 ARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPG 354 (708) Q Consensus 275 ~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~ 354 (708) .+. + ........+.|++.+|++||+-.+. .+ ..|.|.++.+++.++.+|+.+|.++..+...+. T Consensus 159 --------~~~---~--~~~~~~~~~h~~g~vPvv~f~N~~~-~~--~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~ 222 (434) T protein:vir:98 159 --------SWV---Y--TGTADSGDVHDLGGMQLVEFARMPD-LG--EDPEPEFAGVLDIQDRVNLGILNRMAASRFSGF 222 (434) T ss_pred --------cce---e--cccccccccCCCCccceEEeccCCC-cC--cCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcc Confidence 000 0 1112223445667788888753332 11 246788999999999999999999998877776 Q ss_pred Cceeechhhc-cchH-------HHHHhhcccCCceeeecccccccccccccccccccccCcc-chHHHHHHHHHHHHHHH Q lcl|Aclame:pro 355 QIPIVGMEQI-RGLE-------KHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAV-MNQALAALLQQTSADIQ 425 (708) Q Consensus 355 ~~~i~~~~ai-~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~ 425 (708) ++.++- |.. .... ..+. ......+.++.... ..+.+.+.+. -...+...+......+- T Consensus 223 p~~~i~-G~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~-----------~~~~~~q~~~~~~~~~~~~l~~~i~~~~ 289 (434) T protein:vir:98 223 RQKWIK-GHKFAKRTDPATGMTVVDQ-PFVPSPSAVWASEG-----------ENTQFGQLDATDLSGFLKEHASDVRDML 289 (434) T ss_pred hhhhhc-CCCcccccccccccchhhh-hhhccccccccCCC-----------CCceEEEecCcchHHHHHHHHHHHHHHh Confidence 654442 111 0000 0000 00011111111110 0111122211 22445556666667777 Q ss_pred HHhCCChhHccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEec Q lcl|Aclame:pro 426 EVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLS 504 (708) Q Consensus 426 ~~tGv~~~~~G~-~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in 504 (708) .+|++++...|. .+|.||.|+......-........+.|..+.+++.++++.+. |.. ... T Consensus 290 ~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~-------------g~~--~~~---- 350 (434) T protein:vir:98 290 TISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA-------------GVP--EDY---- 350 (434) T ss_pred cccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------CCC--hhh---- Confidence 778888887774 467899999988887777778888888888888888766541 110 000 Q ss_pred ccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHH Q lcl|Aclame:pro 505 AQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYN 584 (708) Q Consensus 505 ~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~ 584 (708) +++.|.=.+..+....+..+.+..|.+.+.+ . ..+++++.++- +++ +++ T Consensus 351 ---------------------~~~~v~w~~~~~~s~~~~ada~~kl~~~g~~---~-----e~~~~~lg~~~-~e~-~r~ 399 (434) T protein:vir:98 351 ---------------------TEAEVRWANPAHVTMAVKADAATKLKSIGYP---L-----DVIAEELDESP-ARV-RRI 399 (434) T ss_pred ---------------------eeeeEEecCCCCCCHHHHHHHHHHHHhcCCc---H-----HHHHHhCCCCH-HHH-HHH Confidence 1222222223333355566677776553321 1 12334444321 122 222 Q ss_pred HhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 585 RNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATN 639 (708) Q Consensus 585 ~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~ 639 (708) .+.. .++....++...+. .++ .+... -..+. -... T Consensus 400 ~~e~------------~~~~~~~~~~~~~~-~~~---~~g~~-~~~~~---~~dg 434 (434) T protein:vir:98 400 VAGA------------ASQALLAASLLPAP-GAP---SAGNV-PDSGG---AVDG 434 (434) T ss_pred HHHH------------HHHHHHHHhhhccC-CCC---CCCCC-CcccC---CCCC Confidence 1100 00000000000000 000 00000 00000 0000 No 121 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.00 E-value=6.2e-09 Score=65.58 Aligned_cols=506 Identities=9% Similarity=0.016 Sum_probs=214.1 Q ss_pred CCcchHHHHHHHHHHHHHH-------HHhhHHHHHHHHHHHHHhhcCCC--CCCHHHHHHhhhhhhhcCCCceeecchHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRA-------YSPQKEVREKCIEATRFARVPGG--QWEGATAAGTKLDEQFEKYPKFEINKVAT 71 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~-------~~~~~~~r~~~~~d~~~~~~~G~--Qw~~~~~~~l~~~~q~~grp~~~~N~i~~ 71 (708) |+-- .+-..-. .-|... +....+.|-.+- ++--+||.|+ ||.. .+.....+ .-| ++.++ T Consensus 1 ~~~~-~~~~~~~-~~~~~g~~~~p~~v~~~d~~Rl~aY-~l~~~~y~n~~~~~~~-~lrg~~~~---~~r-~~~~p---- 68 (527) T protein:vir:10 1 MGQD-KRQYGST-QQLRAGEANFPNAVTDFDKARLASY-RLYEDMYLTNTSDYQV-ILRGGDEG---DQR-PIYVP---- 68 (527) T ss_pred CCcc-ccccCCC-cCcCCccccCcccCCHHHHHHHHHH-HHHHHHhcCchhheee-ecCCcccc---ccc-eeeeh---- Confidence 3310 0000000 000000 111111111111 1223577775 5532 11111111 123 34333 Q ss_pred HHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCC Q lcl|Aclame:pro 72 ELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD 151 (708) Q Consensus 72 ~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~ 151 (708) ....++| ....+.+.+-+. .+...++-...+++...+.++.+.....+-.++++-|-|++++.||.... . T Consensus 69 s~~~~~~----~~~~~~~~g~~~-~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~-----~ 138 (527) T protein:vir:10 69 NGEKLIE----AKMRFLGQGLKW-EFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD-----E 138 (527) T ss_pred hhHHhhC----CcceeeccCccc-cccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC-----c Confidence 3344444 334455545442 23344666788889999999999999999999999999999999875431 2 Q ss_pred CcceeeEEeecchhheecCCccccCChhccCeEEEeec----CCHHHHHHhCCCCc-ccccccccccccccCCCCCceeE Q lcl|Aclame:pro 152 RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYS----LSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 152 ~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~----~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) .+.+++..+ +| . +|| |- ..+| +.+++...+. -..++-++-+-... ....... ++..........++ T Consensus 139 ~~R~~v~~~-DP-~-~~f-~~-ed~d--~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l--~~~g~~~~~G~~~y 209 (527) T protein:vir:10 139 GSRLSLHEV-DP-S-TYF-PY-EDPR--YPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTL--DDDGKPVPGGAIKY 209 (527) T ss_pred CCCceEeec-Cc-c-eee-ee-ecCC--CCCceeeEEEeeeccCCccccccceehhhhhhhhhc--CcccccccCcceee Confidence 245555543 23 2 222 32 3333 6666666643 22222222110000 0000000 00000011111122 Q ss_pred EeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCc Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~ 306 (708) ....|. .|++. +..+... ... .+ . ...++..+...|.|.+.+ T Consensus 210 t~~~w~---------------lg~w~----d~~e~p~----~~~--~~------------~-~~~~~~~l~~lp~pi~fi 251 (527) T protein:vir:10 210 TEELYE---------------PGKWD----DRPESPL----EPD--DI------------K-KLSTLTEEEPLPEQITTL 251 (527) T ss_pred eeceee---------------ccccc----ccccccc----chh--hh------------h-hhcCceeeecccCCCCcc Confidence 111221 11110 0000000 000 00 0 112233334567777888 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) |+|.|-..+ ..++..|+|-..+++++++.+|+.+|....++..++.+.+....-+..+...........+ +.++--+ T Consensus 252 PvV~~~t~p--~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgP-G~iweL~ 328 (527) T protein:vir:10 252 PVFHFRGHP--IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISP-LGMVEHG 328 (527) T ss_pred ceEeecCCC--ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCC-ceeEecC Confidence 888763333 3556678899999999999999999999999999988776663222111100000011111 1221111 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc--cccc-hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ--MPSN-IAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G--~~~n-~sg~ai~~~q~q~~~~~~~~~dn 463 (708) ...++.......-...+...+....+.|.+++|+.....| ..++ -||.|.....+.--. ++ T Consensus 329 ----------e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLla------r~ 392 (527) T protein:vir:10 329 ----------QNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILS------SC 392 (527) T ss_pred ----------CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHH------HH Confidence 1113333332222334556677778899999999999999 3444 499987765554311 11 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH Q lcl|Aclame:pro 464 MAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) Q Consensus 464 ~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~ 543 (708) -+. ++++..+++=|-...+.|-.-- .+-+.+ .|. ...++|.|.=+|-.|+.+.+. T Consensus 393 ~rk------~L~~~~vqrq~~~~~~~~~L~a---ye~v~~---------------~d~-~~~~~v~ivf~p~lP~D~~av 447 (527) T protein:vir:10 393 AEQ------ELELKSVLKQFFYNLVTQWLPA---YEGVGI---------------DDA-DKKLTVTITFRDPKPVNSEKR 447 (527) T ss_pred HHH------HHHHHHHHHHhhhhhHHHHHHH---hhhccc---------------CCC-ccccceEEEecccCCCCHHHH Confidence 111 2222222211100011000000 000111 110 123577888899999999999 Q ss_pred HHHHHHHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLTNVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVL 622 (708) Q Consensus 544 ~~~l~~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~ 622 (708) ++++..|.+.+.- ...+++.+ .+..+. .++ +++-+... . T Consensus 448 ie~v~tL~~aGi~-------S~~tAv~~L~~~~g~----------------eD~--E~E~~~I~---------------~ 487 (527) T protein:vir:10 448 FNQLLQLWEAGLI-------PAKKLTEELSKIMGF----------------ELT--EEDFKQAT---------------E 487 (527) T ss_pred HHHHHHHHHcCch-------hHHHHHHHHHhccCC----------------CCh--HHHHHHHH---------------H Confidence 9999988775421 11112111 111110 111 11111000 0 Q ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 623 AQAQMVAAQAEAQ---KATNETAQTQIKAFTAQQDAMESQANT 662 (708) Q Consensus 623 aq~~~~~~qae~~---k~~~~~~~~q~e~~~~~~~~~~~~a~~ 662 (708) ..+++++++|++. .+++-.. ..++- -+.+.+.+-... T Consensus 488 era~~a~a~a~A~~~~~a~~~~~-~g~~~--~~~d~~~~~~~~ 527 (527) T protein:vir:10 488 DKKTQGIAQAEAADPFGAQMAAE-QGIPD--EEDDQALNGQPL 527 (527) T ss_pred HHHHHhHHhhhhcCchhhhhccc-cCCCC--CCcccccCCCCC Confidence 1111112222211 0110000 00000 000000000001 No 122 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.00 E-value=6.5e-09 Score=65.45 Aligned_cols=506 Identities=10% Similarity=0.019 Sum_probs=214.3 Q ss_pred CCcchHHHHHHHHHHHHHH-------HHhhHHHHHHHHHHHHHhhcCCC--CCCHHHHHHhhhhhhhcCCCceeecchHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRA-------YSPQKEVREKCIEATRFARVPGG--QWEGATAAGTKLDEQFEKYPKFEINKVAT 71 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~-------~~~~~~~r~~~~~d~~~~~~~G~--Qw~~~~~~~l~~~~q~~grp~~~~N~i~~ 71 (708) |+-- .+-..-. .-|... +....+.|-.+- ++--+||.|+ ||.. .+.....+ .-| ++.++ T Consensus 1 ~~~~-~~~~~~~-~~~~~g~~~~p~~v~~~d~~Rl~aY-~l~~~~y~n~~~~~~~-~lrg~~~~---~~r-~~~~p---- 68 (527) T protein:vir:10 1 MGQD-KRQYGST-QQLRAGEANFPNAVTDFDKARLASY-RLYEDMYLTNTSDYQV-ILRGGDEG---DQR-PIYVP---- 68 (527) T ss_pred CCcc-ccccCCC-cCcCCccccCcccCCHHHHHHHHHH-HHHHHHhcCchhheee-ecCCcccc---ccc-eeeeh---- Confidence 3310 0000000 000000 111111111111 1223577775 5632 11111111 123 34333 Q ss_pred HHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCC Q lcl|Aclame:pro 72 ELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD 151 (708) Q Consensus 72 ~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~ 151 (708) ....++| ....+.+.+-+. .+...++-...+++...+.++.+.....+-.++++-|-|++++.||.... . T Consensus 69 s~~~~~~----~~~~~~~~g~~~-~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~-----~ 138 (527) T protein:vir:10 69 NGEKLIE----AKMRFLGQGLKW-EFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD-----E 138 (527) T ss_pred hhHHhhC----CcceeeccCccc-cccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC-----c Confidence 3344444 334455545442 23344666788889999999999999999999999999999999875431 2 Q ss_pred CcceeeEEeecchhheecCCccccCChhccCeEEEeec----CCHHHHHHhCCCCc-ccccccccccccccCCCCCceeE Q lcl|Aclame:pro 152 RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYS----LSPEKYEAEYGKKP-PTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 152 ~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~----~~~~e~~~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) .+.+++..+ +| . +|| |- ..+| +.+++...+. -..++-++-+-... ....... ++..........++ T Consensus 139 ~~R~~v~~~-DP-~-~~f-~~-ed~d--~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l--~~~g~~~~~G~~~y 209 (527) T protein:vir:10 139 GSRLSLHEV-DP-S-TYF-PY-EDPR--YPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTL--DDDGKPVPGGAIKY 209 (527) T ss_pred CCCceEeec-Cc-c-eee-ee-ecCC--CCCceeeEEEeeeccCCccccccceehhhhhhhhhc--CcccccccCcceee Confidence 245555543 23 2 222 32 3333 6666666643 22222222110000 0000000 00000011111122 Q ss_pred EeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCc Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~ 306 (708) ....|. .|++. +..+... ... .+ . ...++..+...|.|.+.+ T Consensus 210 t~~~w~---------------lg~w~----d~~e~p~----~~~--~~------------~-~~~~~~~l~~lp~pi~fi 251 (527) T protein:vir:10 210 TEELYE---------------PGKWD----DRPESPL----EPD--DI------------K-KLSTLTEEEPLPEQITTL 251 (527) T ss_pred eeceee---------------ccccc----ccccccc----chh--hh------------h-hhcCceeeecccCCCCcc Confidence 111221 11110 0000000 000 00 0 112233334567777888 Q ss_pred ceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 307 p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) |+|.|-..+ ..++..|+|-..+++++++.+|+.+|....++..++.+.+....-+..+...........+ +.++--+ T Consensus 252 PvV~~~t~p--~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgP-G~iweL~ 328 (527) T protein:vir:10 252 PVFHFRGHP--IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISP-LGMVEHG 328 (527) T ss_pred ceEeecCCC--ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCC-ceeEecC Confidence 888763333 3556678899999999999999999999999999988776663222111100000011111 1221111 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcc--cccc-hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ--MPSN-IAQETVNNLMNRADMASFIYLDN 463 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G--~~~n-~sg~ai~~~q~q~~~~~~~~~dn 463 (708) ...++.......-...+...+....+.|.+++|+.....| ..++ -||.|.....+.--. ++ T Consensus 329 ----------e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLla------r~ 392 (527) T protein:vir:10 329 ----------QNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILS------SC 392 (527) T ss_pred ----------CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHH------HH Confidence 1113333332222334566677788899999999999999 3444 499987765554311 11 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH Q lcl|Aclame:pro 464 MAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) Q Consensus 464 ~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~ 543 (708) -+. ++++..+++=|-...+.|-.-- .+-+.+ .|. ...++|.|.=+|-.|+.+.+. T Consensus 393 ~rk------~L~~~~Vqrq~~~~~~~~~L~a---ye~v~~---------------~d~-~~~~~v~ivf~p~lP~D~~av 447 (527) T protein:vir:10 393 AEQ------ELELKSVLKQFFYNLVTQWLPA---YEGVGI---------------DDA-DKKLTVTITFRDPKPVNNEKR 447 (527) T ss_pred HHH------HHHHHHHHHHhhhhhHHHHHHH---hhhccc---------------CCC-ccccceEEEecccCCCCHHHH Confidence 111 2222222211100011000000 000111 110 123577888899999999999 Q ss_pred HHHHHHHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLTNVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVL 622 (708) Q Consensus 544 ~~~l~~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~ 622 (708) ++++..|.+.+.- ...+++.+ .+..+. .++ +++-+.. .. T Consensus 448 ie~v~tL~~aGii-------S~etAv~~L~~~~g~----------------eD~--E~E~~~I---------------~~ 487 (527) T protein:vir:10 448 FAQLLELWEAGLI-------PAKKLTEELSKIMGF----------------ELT--EEDFRQA---------------TE 487 (527) T ss_pred HHHHHHHHHcCch-------hHHHHHHHHHhccCC----------------Cch--HHHHHHH---------------HH Confidence 9999988775421 11112111 111111 111 1111000 01 Q ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 623 AQAQMVAAQAEAQ---KATNETAQTQIKAFTAQQDAMESQANT 662 (708) Q Consensus 623 aq~~~~~~qae~~---k~~~~~~~~q~e~~~~~~~~~~~~a~~ 662 (708) ..+++++++|++. .+++-.. ..++ --+.+.+.+-... T Consensus 488 era~~a~a~a~a~~~~~a~~~~~-~g~~--~~~~d~~~~~~~~ 527 (527) T protein:vir:10 488 DKKTQGIAQAEAADPFGAQMAAE-QGIP--DEEDDQALNGQPL 527 (527) T ss_pred HHHHHhHHhhhhcCchhhhhccc-cCCC--CCCcccccCCCCC Confidence 1111122222211 0110000 0000 0000000000001 No 123 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.96 E-value=9.7e-09 Score=64.53 Aligned_cols=637 Identities=10% Similarity=0.018 Sum_probs=226.0 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH--hc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY--RN 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~--~~ 82 (708) +-+..+++++++...+++..++..+++....-++ ....- .| |...+-+..++-.. .. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~-----------~f~~~----~G------~QW~~~~~~~l~~~~q~~ 59 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEAT-----------RFVRV----PG------GQWEGATVAGTKLDEQFE 59 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------Hhhcc----CC------ccCCHHHHHHHHhhhhhc Confidence 7777778888888777777777666665432222 11110 01 12222233333211 12 Q ss_pred CcceeE-------------------EecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 83 NRITVK-------------------FRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 83 nr~~~~-------------------v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) +||-+. ..+.-...+.+.-.-+.+++..+...---......+..++...++.+ ++.| T Consensus 60 grP~~~~N~i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~---G~G~- 135 (706) T protein:vir:10 60 KYPKFEINKVATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATG---GFGC- 135 (706) T ss_pred CCCceEecchHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhc---Ccce- Confidence 333222 11111111222223345555555444434444455555555555432 1111 Q ss_pred ccCCCCCCCcceeeEEeecchhheecCCccccCChh-----c-cCeEEEe---ecCCHHHHHHhCCCCcccccccccccc Q lcl|Aclame:pro 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKS-----D-ALWAFCM---YSLSPEKYEAEYGKKPPTSLDVTSMTS 214 (708) Q Consensus 144 ~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~s-----D-a~~~~~~---~~~~~~e~~~~~p~~~~~~~d~~~~~~ 214 (708) +++..-+++- .||...+.++. | .+-|++- +..+.++..-.|= ...-+++..-... T Consensus 136 -----------~ev~~d~~~~----~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~-~~~~~~d~~~~~f 199 (706) T protein:vir:10 136 -----------FRLTTSFVNE----YDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFC-MYSVSLEKYQSEY 199 (706) T ss_pred -----------EEeeeccccc----cCCCCCCccceeeeeccchhceecCchhcccChhhcceEee-eecCCHHHHHHhc Confidence 1111000000 01111110100 0 0001100 1122222211110 0000000000000 Q ss_pred cccCCCCCceeEEeeeeee---cceEEEEEEEecCccCceeEe--cC---CcccchHHHhhccchhhhhhee---eeeEE Q lcl|Aclame:pro 215 WEYNWFGADVIYIAKYYEV---RKESVDVISYRHPITGEIATY--DS---DQVEDIEDELAIAGFHEVARRS---VKRRR 283 (708) Q Consensus 215 ~~~~~~~~~~~~v~e~~~~---~~~~~~~~~~~~~~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 283 (708) .+..+.-... ..-+|.. ...++.+.+|+........++ .. .++..+........+....... ..+++ T Consensus 200 p~~~~~~~~~--~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 277 (706) T protein:vir:10 200 DKAPTSLDRV--GSVSWQYDWFTPDVVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRS 277 (706) T ss_pred CCChhhhhhh--ccccccccccCCCcceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcc Confidence 0000000000 0011111 122333344443221111111 10 0000000000000000000000 00011 Q ss_pred EEEEEEecceeeecCCCCCC-CCc--ceeeEEEeeeccCCcccccchHHhhh-HHHHHHHHHHHHHHHHHhhcCCCceee Q lcl|Aclame:pro 284 VYVSVVDGDGFLEKPRRIPG-EHI--PLIPVYGKRWFIDDIERVEGHIAKAM-DPQRLYNLQVSMLADTAAQDPGQIPIV 359 (708) Q Consensus 284 v~~~~~~~~~il~~~~~~p~-~~~--p~~p~~~~~~~~d~~~~~~G~vr~~~-d~Q~~~N~~~s~~~~~l~~~~~~~~i~ 359 (708) +..+-+--. ++.+..-.-. ..| ..+||+.+..+... ..+-|....++ +.-+.=...-..+.-++...+..+..+ T Consensus 278 ~~~~~v~~~-~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~-~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~ 355 (706) T protein:vir:10 278 VKRRRIYVA-VVDGDGFLEKPRRIPGEHIPLIPVYGKRWF-IDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQT 355 (706) T ss_pred cceeeEEEE-eeccccccccCCCCCCCccceEEEeecccc-ccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcc Confidence 110000000 0111111100 112 23566544322110 11223333333 333333333334444555556677777 Q ss_pred chhhccchHHHHHhhcccCCceeeeccccccccccccccc--ccccccCccchHHHHHHHHHHHHHHHHHhCCChhHccc Q lcl|Aclame:pro 360 GMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGAT--PAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM 437 (708) Q Consensus 360 ~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~ 437 (708) +.+++++++.......+.+.....+-.. .+.|.. +|.. +.+.......+.-....++........+ ....|. T Consensus 356 ~~~~~~~i~~~~~~~~~~~~~~~~~l~~-~~~~~~-~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i----~~vsGi 429 (706) T protein:vir:10 356 PIVDMEQIRGLEQHWEGRNRKRPAFLPL-RTVTDK-TGNVVAPANVAGYTQAPVLNQALAALLQQTSADI----QEVTGS 429 (706) T ss_pred cccchhHHHHHHHHhhhcccccccchhc-ccccCC-CCcccccccccccCCCcchHHHHHHHHHHHHHHH----HHHhCC Confidence 7777777777666655544433333222 122221 2221 1111111222222233444444444444 234454 Q ss_pred c----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCC Q lcl|Aclame:pro 438 P----SNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTG 513 (708) Q Consensus 438 ~----~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~ 513 (708) . +..|+++-.+++..-......+.. |-..++..-+.+-+++..+.. .+. +.++.+.|-... + +. T Consensus 430 ~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~-~~Dnl~~~~~~~g~~lL~li~-----~~y---~~~R~~RI~~ed--~-~~ 497 (706) T protein:vir:10 430 SQAMQQMPSNVARETVNSLLNRSDMASFI-YLDNMAKSLKRAGEIWLSMAR-----EIY---GSDREVRIVHED--G-TD 497 (706) T ss_pred CHHHcCCccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC--C-Cc Confidence 3 223553333333333333333332 223334444444455554432 111 233555554322 1 23 Q ss_pred ceEEeeccc----ee----eEEEEE---eecccchhHHHHH-HHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHH Q lcl|Aclame:pro 514 AVVALNDLS----VG----RYDVTV---DVGPSYTARRDAT-VSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFK 581 (708) Q Consensus 514 ~~~~~nDi~----~g----~~Dv~v---~~~~~~~~~r~~~-~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~ 581 (708) .++++|... .| ..||++ ++..+........ .+.+..|++.++...|..+....++.-+.+........ T Consensus 498 ~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~ 577 (706) T protein:vir:10 498 DIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLD 577 (706) T ss_pred cceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchH Confidence 455665421 12 246654 5555544444444 44444555544444443333333332233333333333 Q ss_pred HHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 582 EYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQAN 661 (708) Q Consensus 582 e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~ 661 (708) +.............+..++.+ +.++..+++++.++.++++++.++++++.++++++++++.++.+.+.++..++.. T Consensus 578 e~~e~irk~~~~q~~~~~~~~----~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~q 653 (706) T protein:vir:10 578 DFKAFNRRQLLTQGIVKPRNQ----QEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQD 653 (706) T ss_pred HHHHHHHHhhcccCCccccch----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333222222222222222 1222222223344455566666677777777887777777777766666555544 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 662 TVYKLAQARNIDDKA-VMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 662 ~~~~~~q~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) +.+...++......+ +++..+..+..+..++....++-.+|++..|+ T Consensus 654 a~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a~q~~~~~~~~~~~ 701 (706) T protein:vir:10 654 AMESQANTVYKLAQARNIDDKAVMETLRLLKEVAASQQQTIPSPPSPA 701 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCc Confidence 444333332222222 12222222222222222333444455666666 No 124 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=98.93 E-value=1.4e-08 Score=63.71 Aligned_cols=643 Identities=10% Similarity=-0.012 Sum_probs=216.1 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHH------- Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII------- 77 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~------- 77 (708) +.++++++++++...++...++...|++...-++ .... ..| |...+-+.+++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~-----------~f~~----~~G------~QW~~~~~~~l~~~~q~~ 59 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEAT-----------RFAR----VPG------GQWEGATAAGTKLDEQFE 59 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH-----------Hhhc----CCC------CCCCHHHHHHHHHhhhhc Confidence 9999999999999988888888777775433222 1110 001 12222222222 Q ss_pred --------------HHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 78 --------------AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 78 --------------g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) ....+....-...++--..+.+.-..+.+++..+...---......+..++..+++.+ .++ |. T Consensus 60 grP~~~~N~i~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~-G~G--w~ 136 (708) T protein:vir:10 60 KYPKFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATG-GFG--CF 136 (708) T ss_pred CCCceEEcchHHHHHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhc-ccc--ee Confidence 2222222222222221123333224456666666555555555666777777777542 111 11 Q ss_pred cc-CCCCCCCcceeeEEeecchhheecCC-ccccCChhccCeEEEeecCCHHHHHHhCCCC------ccccccccccccc Q lcl|Aclame:pro 144 NE-YDPMDDRQRIAIEPIYDPSRSVWFDP-DAKKYDKSDALWAFCMYSLSPEKYEAEYGKK------PPTSLDVTSMTSW 215 (708) Q Consensus 144 ~~-~d~~~~~~~i~i~~v~~~~~~v~~Dp-~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~------~~~~~d~~~~~~~ 215 (708) .- .|. ..+.++....-.++... .+|| .+.-+|. ..+..+.++.+-.|=.+ ....+. ..... T Consensus 137 ~~~~d~-~~e~d~~~~~~~i~i~~-~~~p~~~v~~Dp-------~a~~~D~sDar~~~~~~~~~~d~~~~~~p--~~a~~ 205 (708) T protein:vir:10 137 RLTSML-VNEYDPMDDRQRIAIEP-IYDPSRSVWFDP-------DAKKYDKSDALWAFCMYSLSPEKYEAEYG--KKPPT 205 (708) T ss_pred eeeecc-ccccCCCCCccccceEE-eecchhhcccCc-------cccccChhhhhhhhhccCCCHHHHHHhCC--CCccc Confidence 00 000 00000000000000000 1111 1111111 01112333332221000 000000 00011 Q ss_pred ccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecC---CcccchHHH-hhccchhhhhhe--eeeeEEEEEEEE Q lcl|Aclame:pro 216 EYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDS---DQVEDIEDE-LAIAGFHEVARR--SVKRRRVYVSVV 289 (708) Q Consensus 216 ~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~--~~~~~~v~~~~~ 289 (708) .++|..... ....|...+.+++..|+++......++.+.+ +.+..+... ............ .+.++++.++.+ T Consensus 206 ~~d~~~~~~-~~~~~~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v 284 (708) T protein:vir:10 206 SLDVTSMTS-WEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRV 284 (708) T ss_pred ccccccCCC-ccccccCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEE Confidence 112211111 1123444444444444444333333333322 223222221 111111111111 122333333322 Q ss_pred ecceeeecCC--CCCC-CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcC-CCceeechhhcc Q lcl|Aclame:pro 290 DGDGFLEKPR--RIPG-EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDP-GQIPIVGMEQIR 365 (708) Q Consensus 290 ~~~~il~~~~--~~p~-~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~-~~~~i~~~~ai~ 365 (708) .-..+. ++. ..|. --+.++|++.+.-+.... .+......++..=+..=...++..-.+.... ..+-...-.... T Consensus 285 ~~~~~~-g~~~le~~~~~p~~~fP~vP~~g~r~~~-d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~ 362 (708) T protein:vir:10 285 YVSVVD-GDGFLEKPRRIPGEHIPLIPVYGKRWFI-DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGME 362 (708) T ss_pred EEEeec-chhhhccCCCCCCCceeeEEEeeeeecc-CCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChh Confidence 222222 211 1222 223345666443222110 1111111211111111111111100000000 000000000000 Q ss_pred chHHHHHhhcccCCceeeecccccccccc-cccccccccccC--ccchHHHHHHHHHHHHHHHHHhCCChhHcccccchh Q lcl|Aclame:pro 366 GLEKHWEARNKKRPAFLPLREVRDKSGNI-IAGATPAGYTQP--AVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIA 442 (708) Q Consensus 366 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~s 442 (708) .+.....+....+...- ......... ..|........+ .+.+.-...+++.+...+..+.-++..+.+..+..| T Consensus 363 ~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~s 439 (708) T protein:vir:10 363 QIRGLEKHWEARNKKRP---AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS 439 (708) T ss_pred hhhhHHHHHhhccccch---hhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCcc Confidence 00000000000000000 000000000 000101010011 122233344667777777777555433333334445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEee--- Q lcl|Aclame:pro 443 QETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALN--- 519 (708) Q Consensus 443 g~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~n--- 519 (708) +++-.++.+.-......+..-|. .++..-+.+-+++..+.. .+. +.++.+.|-... + ....+.+| T Consensus 440 n~SG~aI~~rq~qg~~~l~~~~D-nl~~~~~~~g~~lL~li~-----~~y---~~er~~RI~~ed--g-~~~~v~in~~~ 507 (708) T protein:vir:10 440 NIAQETVNNLMNRADMASFIYLD-NMAKSLKRAGEVWLSMAR-----EVY---GSEREVRIVNED--G-SDDIAVLSAQV 507 (708) T ss_pred chHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC--C-CcceEEeccee Confidence 44333333333333333333222 233333444444444432 112 233555554321 1 12334444 Q ss_pred -cccee----eEEEEE---eecccch-hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhh Q lcl|Aclame:pro 520 -DLSVG----RYDVTV---DVGPSYT-ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI 590 (708) Q Consensus 520 -Di~~g----~~Dv~v---~~~~~~~-~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~ 590 (708) |...| ..||++ ++..+.. +.-....+.+..|++.++...|..+....++.-+.+........+.+...... T Consensus 508 ~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~ 587 (708) T protein:vir:10 508 VDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQ 587 (708) T ss_pred ccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHh Confidence 44444 356654 5555544 44444545555555544444443333333322222222222222333322222 Q ss_pred hhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 591 SGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQAR 670 (708) Q Consensus 591 ~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~ 670 (708) .....+..+ ..++.++..++.++.++.+++.++.+++++..+.++++++++.++.+.++++...+..+.+...++. T Consensus 588 ~~~~~~~~~----~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~ 663 (708) T protein:vir:10 588 LLISGIAKP----RNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTV 663 (708) T ss_pred hcccccccc----cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 212222122 2222222223333344445555555666666677777777777666665555444433333322222 Q ss_pred HHHHHH---HHHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 671 NIDDKA---VMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 671 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) ++-..+ ++.......+..+..+.. .+...+.+|. T Consensus 664 q~~~~a~~~~~~~~~~~~q~l~~~q~~----q~~~~~~~p~ 700 (708) T protein:vir:10 664 YKLAQARNIDDKAVMEAIRLLKDVAES----QQQQFQSPPQ 700 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhh----HHHHHhcccc Confidence 211111 111111111111111111 1233446666 No 125 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=98.92 E-value=1.5e-08 Score=63.55 Aligned_cols=615 Identities=11% Similarity=0.067 Sum_probs=195.9 Q ss_pred CCcchHHHHHHHHHHHHH--HHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDR--AYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~--~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g 78 (708) |- .++.+-..+.. +.. ........+.....|.+ + | +++-...+++..=+.|. .....+..++- T Consensus 1 ~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-----~-q-~~~r~~a~~d~~fy~G~------QW~~~~~~~l~ 65 (772) T protein:vir:10 1 MQ-ITENDRQYLNG-LPPAGDTPLTVDEYADINYEIE-----D-Q-PAWRAVADKEMDYADGN------QLDTELLRRQQ 65 (772) T ss_pred CC-cchhhHHhhcc-CCcccccccCHHHHHHHHHHHh-----c-c-HHHHHHHHHHHHhhcCC------CCCHHHHHHHH Confidence 32 22221111110 000 00000111222221211 1 1 11111222222222332 22233333332 Q ss_pred HHhcCcceeEE-------------------ecC-CCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEE Q lcl|Aclame:pro 79 EYRNNRITVKF-------------------RPG-DREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRL 138 (708) Q Consensus 79 ~~~~nr~~~~v-------------------~pr-~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v 138 (708) . +.+|-+.+ .++ .+.++.+ -+.+.+++..+...-........+..++..+++.+ .+ T Consensus 66 ~--~g~p~~~~N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~-d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~-G~ 141 (772) T protein:vir:10 66 A--LGIPPAVEDLIGPALLSLQGYEAVTRTDWRVTPNGDVG-GQEVADALNYRLNTAERQSGADRACSEAFRPQIAC-GI 141 (772) T ss_pred h--cCCCcEEEcchHHHHHHHHHHHHhcCcceEEecCCCch-HHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhc-Cc Confidence 2 22222221 111 1222223 34556666666666555666667777887777653 22 Q ss_pred EeeccccCCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCC---cc---ccc-cc-- Q lcl|Aclame:pro 139 TSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKK---PP---TSL-DV-- 209 (708) Q Consensus 139 ~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~---~~---~~~-d~-- 209 (708) + |... ....+..-..+.+ -..||...-+|.+ | +. |.++.+-.|=.+ .+ ..+ +. T Consensus 142 G--w~e~----~~~~d~~~~~i~i----~~v~p~~v~~Dp~-a------~~-D~sDar~~~~~~~~~~d~~~~~fp~~a~ 203 (772) T protein:vir:10 142 G--WVEV----SRESDPFKFPYRC----RPIRRDEIHWDMK-C------GD-DWEACRFLRRQRWLSPDRIALVFPEHAE 203 (772) T ss_pred e--eEEe----ccccCCCCCCeEE----EeeCcccceecCC-C------CC-CHHHhhhhhhhccCCHHHHHHhCCCchh Confidence 2 2211 1111111111110 1124444434421 1 11 444443222100 00 000 00 Q ss_pred --ccccccccCCCC-----------Cc------------eeEEeeeeeecceEEEEEEEecCccCceeEecCCcc--cch Q lcl|Aclame:pro 210 --TSMTSWEYNWFG-----------AD------------VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQV--EDI 262 (708) Q Consensus 210 --~~~~~~~~~~~~-----------~~------------~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 262 (708) .....+..+|.+ .+ ++....||...++++++++||.+.......+..... ..+ T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~ 283 (772) T protein:vir:10 204 LIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEY 283 (772) T ss_pred HHHhhhhhcccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEee Confidence 001111222211 11 112345677777788777777655444444443333 222 Q ss_pred HHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCC--cc--eeeEEEeeeccCCcccccchHHhhhHHHHHH Q lcl|Aclame:pro 263 EDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEH--IP--LIPVYGKRWFIDDIERVEGHIAKAMDPQRLY 338 (708) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~--~p--~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~ 338 (708) ...-..... .+....+..++.+...+-.. ++.++.-.-.+. || ++||+++..+.+ ...|....++..=+.. T Consensus 284 ~~~~~~~~~-~l~~g~~~~~~~~~~rv~~~-~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~---~~~g~~~G~vr~~kd~ 358 (772) T protein:vir:10 284 DPNNLAHNI-ALASGRISPKKVTVSRVRRS-YWLGPHCLHDGPTPYTHRHFPYVPFFGFRE---DATGIPYGYVRGMKYA 358 (772) T ss_pred CcccHHHHH-HHhhcccchheeeeeEEEEE-EEecceeeccCCCCCCCCccceEEEeeeEe---ccCCcccchhhhhhhH Confidence 222111222 22233333333333333222 333433332222 34 478887655433 2366655544322222 Q ss_pred HHHHHHHHHHHhhcCCCceeechhhc---cchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHH Q lcl|Aclame:pro 339 NLQVSMLADTAAQDPGQIPIVGMEQI---RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAA 415 (708) Q Consensus 339 N~~~s~~~~~l~~~~~~~~i~~~~ai---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (708) =...++..- +-.+++..-++ .|..+..+...... +.+++.....+......+...+....-+.-... T Consensus 359 Qr~~N~~~S------~~~~~l~~~~~~~~~gav~~~d~~~~e~----~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (772) T protein:vir:10 359 QDSLNSGVS------KLRWGMSVARVERTKGAVAMTDAQFRRQ----IARPDADIVLDENHMAKPGARFDVKRDYTLTDQ 428 (772) T ss_pred HHHHHHHHH------HHHHHHhcccccccCCCccchhHHHHHh----ccCCCCeEEeCCccccCCCCCccccCCccccHH Confidence 222222111 11222222221 11111111100000 001111000111111122333334344444566 Q ss_pred HHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEE Q lcl|Aclame:pro 416 LLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNR--ADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVR 490 (708) Q Consensus 416 l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q--~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~ir 490 (708) .++.+...+..+. ...|.... ..+.+.+.+.-+ -......+. .+-.-+++.-+.+-+++..+.- . T Consensus 429 ~~~llq~~~~~i~----~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~-~~~Dnl~~~~~~~g~~lL~li~-----~ 498 (772) T protein:vir:10 429 HFQMLQDNRATIE----RVSNITAGFQGRKGTATSGIQEQQQIEQSNQSIG-RIMDNFRAGRTLVGELLLAMIV-----E 498 (772) T ss_pred HHHHHHHHHHHHH----HHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----H Confidence 7777777776664 34454332 233443332211 111111111 2222233333333344443332 1 Q ss_pred EeccCCCceEEEeccccccc-CCCceEEeeccc----eeeE----EEEEe---ecc-cchhHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 491 IVNEDGSDDIAVLSAQVVDR-QTGAVVALNDLS----VGRY----DVTVD---VGP-SYTARRDATVSVLTNVLSSMLPT 557 (708) Q Consensus 491 I~~~~~~~~~v~in~~~~~~-~~~~~~~~nDi~----~g~~----Dv~v~---~~~-~~~~~r~~~~~~l~~llq~~~~~ 557 (708) +. +.++.+.|-.. |+ ....++.+|... .|+. ||++. +.. ..++.-....+.+..|++.++. T Consensus 499 ~y---~~er~~RI~~~--d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~- 572 (772) T protein:vir:10 499 DI---GQERTEVVIEG--DAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKS- 572 (772) T ss_pred Hc---CCCcEEEEecC--CCCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhc- Confidence 11 23345555332 22 224555666432 2332 54432 222 3444444445555555554321 Q ss_pred CchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 558 DPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQ---MAAQSQPNPEMVLAQAQMVAAQAEA 634 (708) Q Consensus 558 ~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~q---q~qq~~~~~~~~~aq~~~~~~qae~ 634 (708) +.+.+...++. -+.+... -|..++.....++.+ ..++.+++.+ .+.+.+++++++++ T Consensus 573 --~~P~~~~~~~~--------~~le~~D---------~p~~~ei~~~ir~~~~~~~peq~~~~~~-q~~qq~~~~~~~el 632 (772) T protein:vir:10 573 --MPPQYQAAVLP--------FLVSLMD---------VPFKRDVVEAIRAVDQQQTPEQIQQQID-QAVQDALAKAGNDI 632 (772) T ss_pred --cChhHHHHHHH--------HHHhhcC---------CCChHHHHHHHHHHhccCChHHHHHHHH-HHHHHHHHHHHHHH Confidence 12222111111 0111110 111222222222111 1111111111 11111222223332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhhhhhhhhhhh-----------cCCCC Q lcl|Aclame:pro 635 QKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIR--LLKDVAESQQQQFQ-----------SPPQS 701 (708) Q Consensus 635 ~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~--~~~~~~~~~~~~~~-----------~~~~~ 701 (708) + ..+.+++..++.++++..++++.+...++.....++.....+ .+...+....+... ..|.+ T Consensus 633 ~-----~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~~ad~~l~~~g~~~~~~~~~~~~~p~~ 707 (772) T protein:vir:10 633 K-----LRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAPIADAVMQSAGYQRPNPAGDDPNYPIA 707 (772) T ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhHHHHHHHHhcccccccccccCCCCCCC Confidence 2 222233333333444444444433333222211222111111 11111111111111 11111 Q ss_pred CCCCCCC Q lcl|Aclame:pro 702 PADLMPS 708 (708) Q Consensus 702 ~~e~~~~ 708 (708) +..-+|+ T Consensus 708 ~~~a~~~ 714 (772) T protein:vir:10 708 DQTAAMN 714 (772) T ss_pred CCccCCC Confidence 1111111 No 126 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.86 E-value=2.6e-08 Score=62.18 Aligned_cols=629 Identities=8% Similarity=-0.073 Sum_probs=218.9 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcCc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNR 84 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~nr 84 (708) +-+ .+..+.++...+...-++ ++++-...+++..=+.| |...+-+..++ ..+.| T Consensus 1 m~d-~~~~~~~~~~~~~~~~~~-----------------~~~~R~~a~~d~~fy~G------~QW~~~~~~~l--~~q~r 54 (725) T protein:vir:10 1 MAD-NENRLESILSRFDADWTA-----------------SDEARREAKNDLFFSRV------SQWDDWLSQYT--TLQYR 54 (725) T ss_pred CCc-hHHHHHHHHHHHHHHHHh-----------------hHHHHHHHHHHHHhhcC------CCCCHHHHHHH--HhcCC Confidence 111 122222222222222121 12222222232222223 22233333333 33445 Q ss_pred ceeEEecCC-----------------CcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC- Q lcl|Aclame:pro 85 ITVKFRPGD-----------------REASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY- 146 (708) Q Consensus 85 ~~~~v~pr~-----------------~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~- 146 (708) |.++.++.. -..+ .....+.+++..+....-...-...+.-++..+++. +++.|..-. T Consensus 55 p~~N~i~~~v~~v~g~e~~nr~d~~v~p~~-~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~---~G~G~~ev~~ 130 (725) T protein:vir:10 55 GQFDVVRPVVRKLVSEMRQNPIDVLYRPKD-GASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIE---AGVGAWRLVT 130 (725) T ss_pred CcccchHHHHHHHHhhHHhCCcceEEecCC-cchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhh---cCcceeeeec Confidence 532221110 0111 112223333333333322222223333333333321 111111000 Q ss_pred C-CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHH----HhCCCCcc-ccc---cccccccccc Q lcl|Aclame:pro 147 D-PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYE----AEYGKKPP-TSL---DVTSMTSWEY 217 (708) Q Consensus 147 d-~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~----~~~p~~~~-~~~---d~~~~~~~~~ 217 (708) | ......+..+.. ....+++||...-+|.. .+..+.++.+ .+|-.... +.+ ....+ .... T Consensus 131 d~~~~d~~~~~~~i---~~~~i~~~~~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a-~~~~ 199 (725) T protein:vir:10 131 DYEDQSPTSNNQVI---RREPIHSACSHVIWDSN-------SKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDA-DNIP 199 (725) T ss_pred cccCCCCCCCceee---eeeecccCHhHcccCch-------hhccChhhhhhhhhhccCCHHHHHHHHHhCCCcc-cccc Confidence 0 000001111110 11122333332222321 1222333332 11111100 000 00000 0111 Q ss_pred CCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecC---CcccchHHHhhcc-chhhhhheee--eeEEEEEEEEec Q lcl|Aclame:pro 218 NWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDS---DQVEDIEDELAIA-GFHEVARRSV--KRRRVYVSVVDG 291 (708) Q Consensus 218 ~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~~--~~~~v~~~~~~~ 291 (708) +|.+... ....|+..+++++..||+..+....++.+.+ ++...+...-... ....+..... ..+++..+.+.- T Consensus 200 ~~~~~~~-~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~ 278 (725) T protein:vir:10 200 SFQNPND-WVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYK 278 (725) T ss_pred ccccccc-ccccccCCCeEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEE Confidence 2222221 1235666667666666666555544544433 2222222211111 1112222221 122222221111 Q ss_pred ceeeecCCCC---CCCCcceeeEEEeeeccCCcccccch--HHhhhHHHHHHHHHHHHHHHHHhhcC-CCceeechhhcc Q lcl|Aclame:pro 292 DGFLEKPRRI---PGEHIPLIPVYGKRWFIDDIERVEGH--IAKAMDPQRLYNLQVSMLADTAAQDP-GQIPIVGMEQIR 365 (708) Q Consensus 292 ~~il~~~~~~---p~~~~p~~p~~~~~~~~d~~~~~~G~--vr~~~d~Q~~~N~~~s~~~~~l~~~~-~~~~i~~~~ai~ 365 (708) .. +.++... +.....++||+.+.-+... ..|. +..++..=+..=...+...-.+.... ...-....+..+ T Consensus 279 ~~-~~g~~~l~~~~~~~~~~fP~vP~~g~r~~---~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~ 354 (725) T protein:vir:10 279 SI-ITCTAVLKDKQLIAGEHIPIVPVFGEWGF---VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPE 354 (725) T ss_pred Ee-ecchhhhcCCCCCCCCceeEEEEEeeeec---cCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHh Confidence 11 1222211 1223345788765433221 1232 22444333333333333322222221 111111111111 Q ss_pred chHHHHHhhcccCCceeeeccccccccc-----ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc Q lcl|Aclame:pro 366 GLEKHWEARNKKRPAFLPLREVRDKSGN-----IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN 440 (708) Q Consensus 366 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n 440 (708) .++.... . +.++..+.... ..+|+.+-..+..-..++-...+++.+......+ ....|.... T Consensus 355 ~i~~~e~--~-------~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i----~~~tGi~~~ 421 (725) T protein:vir:10 355 QIAGFEH--M-------YDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAV----KEVATLGVD 421 (725) T ss_pred hhhHHHH--H-------HhccCCceeeecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHH----HHHhCCCHH Confidence 1221111 0 01111110000 0111111122222222233334555555555444 345554332 Q ss_pred ---hhHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCce Q lcl|Aclame:pro 441 ---IAQETVNNLMNRADMA--SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAV 515 (708) Q Consensus 441 ---~sg~ai~~~q~q~~~~--~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~ 515 (708) ..|.+.+.+.-++-.. ...+.. |-..++.--+.+-+++..+.. .+. +.++.+.|-... . .... T Consensus 422 ~lG~~~n~~SG~ai~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~lI~-----~~~---~~er~~RI~~ed--g-~~~~ 489 (725) T protein:vir:10 422 AEAVNGGQVAYDTVNQLNMRADLETYV-FQDNLATAMRRDGEIYQSIVN-----DIY---DVPRNVTITLED--G-SEKE 489 (725) T ss_pred HhCcCchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC--C-Ccce Confidence 3444444433222222 222222 223333333334444443332 111 233555553321 1 2244 Q ss_pred EEeecc----ceeeE----EEE--Eeecccch-hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHH Q lcl|Aclame:pro 516 VALNDL----SVGRY----DVT--VDVGPSYT-ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYN 584 (708) Q Consensus 516 ~~~nDi----~~g~~----Dv~--v~~~~~~~-~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~ 584 (708) +++|.- ..|+. |++ .++..+.+ +.-...-+.+..|++.+....|..+.....++..++++..+.+.+.. T Consensus 490 v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~ 569 (725) T protein:vir:10 490 VQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMR 569 (725) T ss_pred eEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHH Confidence 555542 22331 231 45555544 44334445555666666666666666667777788888777777766 Q ss_pred HhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 585 RNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVY 664 (708) Q Consensus 585 ~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~ 664 (708) ...........+..+...+.++ ..+++++.++.++++++.++++++.+++++.++++.+..+++.++...+..+.. T Consensus 570 erirkq~~~~~~~~~~~~e~~q----~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~ 645 (725) T protein:vir:10 570 DYANKQLIQMGVKKPETPEEQQ----WLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHhhhhhhccCCccccchhH----HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6655433333333333222222 222223334445566666777777777777778888777777776666655554 Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHhhhhhhhhhhh-----------c---CCCCCCCCCCC Q lcl|Aclame:pro 665 KLAQARNIDDKAVMEAI-------RLLKDVAESQQQQFQ-----------S---PPQSPADLMPS 708 (708) Q Consensus 665 ~~~q~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~-----------~---~~~~~~e~~~~ 708 (708) .+++......++...+. ++....++....... . +-.-.....-| T Consensus 646 ~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~~~~~~~~~~~~ 710 (725) T protein:vir:10 646 NAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQS 710 (725) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhhhhcccc Confidence 44443332222222211 111111111100000 0 00001111122 No 127 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.72 E-value=8.5e-08 Score=59.35 Aligned_cols=629 Identities=9% Similarity=-0.064 Sum_probs=214.8 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcCc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNR 84 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~nr 84 (708) +-+ .+..+.++...+...-++. +++-...+++..=+.|. ...+-+..++ ..+.| T Consensus 1 m~d-~~~~~~~~~~~~~~~~~~~-----------------~~~r~~a~~d~~fy~G~------Qw~~~~~~~l--~~q~r 54 (725) T protein:vir:92 1 MAD-NENRLESILSRFDADWTAS-----------------DEARREAKNDLFFSRIS------QWDDWLSQYT--TLQYR 54 (725) T ss_pred CCc-hHHHHHHHHHHHHHHHHhh-----------------HHHHHHHHHHHHhhcCC------CCCHHHHHHH--HhcCC Confidence 111 1222222222222222222 22222223332222232 2222333332 22445 Q ss_pred ceeEEecCC-----------------CcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC- Q lcl|Aclame:pro 85 ITVKFRPGD-----------------REASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY- 146 (708) Q Consensus 85 ~~~~v~pr~-----------------~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~- 146 (708) |.++.++.. -..+ ...+.+.+++..+....-...-...+.-++..+++. +++.|..-. T Consensus 55 p~~N~i~~~i~~v~g~e~~nr~d~~v~P~~-~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~---~G~G~~ev~~ 130 (725) T protein:vir:92 55 GQFDVVRPVVRKLVSEMRQNPIDVLYRPKD-GASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIE---SGVGAWRLVT 130 (725) T ss_pred CcccchHHHHHHHHhhHHhCCcceEEecCC-ccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhh---cCcceeeeee Confidence 532221110 0111 112233333333333332233333333344333322 111111000 Q ss_pred C-CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhC-----CCCcccccc--cccccccccC Q lcl|Aclame:pro 147 D-PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEY-----GKKPPTSLD--VTSMTSWEYN 218 (708) Q Consensus 147 d-~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~-----p~~~~~~~d--~~~~~~~~~~ 218 (708) | ......+..+.... ..|+.++...-+|. ..+..+.++..-.| +......+. .......... T Consensus 131 d~~~~d~~~~~~~i~~---~~i~~~~~~V~~Dp-------~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 200 (725) T protein:vir:92 131 DYEDQSPTSNNQVIRR---EPIHSACSHVIWDS-------NSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPS 200 (725) T ss_pred cccCCCCCCCceeeEE---eeccCChhhcccCc-------hhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhh Confidence 0 00000001111000 01111111111121 12223333333211 100000000 0000000111 Q ss_pred CCCCceeEEeeeeeecceEEEEEEEecCccCceeEecC---CcccchHHHhhccc-hhhhhheee--eeEEEE-EEEEec Q lcl|Aclame:pro 219 WFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDS---DQVEDIEDELAIAG-FHEVARRSV--KRRRVY-VSVVDG 291 (708) Q Consensus 219 ~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~--~~~~v~-~~~~~~ 291 (708) |.+... ....|+..+++++..||++.+....++.+.+ ++...+...-.... ...+..... ..+++. +.+.. T Consensus 201 ~~~~~~-~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~- 278 (725) T protein:vir:92 201 FQNPND-WVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYK- 278 (725) T ss_pred cccCCc-ccccccCCCeEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEee- Confidence 111111 1234555566555555555454444443332 23332222211111 111222211 122222 22222 Q ss_pred ceeeecCCCC--C-CCCcceeeEEEeeeccCCcccccch--HHhhhHHHHHHHHHHHHHHHHHhhcC-CCceeechhhcc Q lcl|Aclame:pro 292 DGFLEKPRRI--P-GEHIPLIPVYGKRWFIDDIERVEGH--IAKAMDPQRLYNLQVSMLADTAAQDP-GQIPIVGMEQIR 365 (708) Q Consensus 292 ~~il~~~~~~--p-~~~~p~~p~~~~~~~~d~~~~~~G~--vr~~~d~Q~~~N~~~s~~~~~l~~~~-~~~~i~~~~ai~ 365 (708) . ++.++... | .....++||+.+.-+.+. ..|. +..++..=+..=...++..-.+.... ...-....+..+ T Consensus 279 ~-~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~---~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~ 354 (725) T protein:vir:92 279 S-IITCTAVLKDKQLIAGEHIPIVPVFGEWGF---VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPE 354 (725) T ss_pred e-eecchhhhcCCCCCCCCceeeEEEEeeeec---cCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchh Confidence 1 11222211 1 122335788765433222 1232 22444333333333333222221111 111001111111 Q ss_pred chHHHHHhhcccCCceeeeccccccccc-----ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc Q lcl|Aclame:pro 366 GLEKHWEARNKKRPAFLPLREVRDKSGN-----IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN 440 (708) Q Consensus 366 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n 440 (708) .++..... +-++....... ..+|..+-..+..-..++-....++.+......+ ....|.... T Consensus 355 ~i~~~~~~---------~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i----~~~tGi~~~ 421 (725) T protein:vir:92 355 QIAGFEHM---------YDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAV----KEVATLGVD 421 (725) T ss_pred hhhHHHHH---------HhccCccceeeccccccccccccccCCcccCCCCchHHHHHHHHHHHHHH----HHHhCCCHH Confidence 11111110 00111110000 0111111122222222233334555555555544 344554332 Q ss_pred h---hHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCce Q lcl|Aclame:pro 441 I---AQETVNNLMNRADM--ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAV 515 (708) Q Consensus 441 ~---sg~ai~~~q~q~~~--~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~ 515 (708) . .+.+.+.+.-+... ....+. .|-..++.--+.+-+++..+.. .+. +.++.+.|-.. |. .... T Consensus 422 ~lG~~~n~~SG~ai~~rq~qg~~~l~-~~~Dnl~~~~~~~g~~lL~lI~-----~~~---~~~r~~RI~~e--dg-~~~~ 489 (725) T protein:vir:92 422 AEAVNGGQVAYDTVNQLNMRADLETY-VFQDNLATAMRRDGEIYQSIVN-----DIY---DVPRNVTITLE--DG-SEKE 489 (725) T ss_pred HhccCchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----Hhc---CCCcEEEEecC--CC-Ccce Confidence 2 33334333322211 122222 2223333333444444443332 111 23355555332 11 2244 Q ss_pred EEeecc----ceeeE----EEE--Eeecccch-hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHH Q lcl|Aclame:pro 516 VALNDL----SVGRY----DVT--VDVGPSYT-ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYN 584 (708) Q Consensus 516 ~~~nDi----~~g~~----Dv~--v~~~~~~~-~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~ 584 (708) +.+|.- ..|+. ||+ .++..+.+ +.-...-+.+..|++.+....|..+.....+...++.+..+.+.+.. T Consensus 490 v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~ 569 (725) T protein:vir:92 490 VQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMR 569 (725) T ss_pred EEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHH Confidence 555542 22221 332 44444444 44344445555556555555555565566677777777777777766 Q ss_pred HhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 585 RNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVY 664 (708) Q Consensus 585 ~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~ 664 (708) ...........+..+.. ++.++..+++++.++.++++++.++++.+.+++++.++++.+..+.+.++.+.++.+.+ T Consensus 570 erirkq~~~~~~~~~~~----~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~ 645 (725) T protein:vir:92 570 DYANKQLIQMGVKKPET----PEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHhhhchhccCCccc----hhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55543333322222222 22222233333444555666777778888888888888899888888888777777666 Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHhhhhhhhhhhhcCC---------------------CCCCCCCCC Q lcl|Aclame:pro 665 KLAQARNIDDKAVMEAI-------RLLKDVAESQQQQFQSPP---------------------QSPADLMPS 708 (708) Q Consensus 665 ~~~q~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~---------------------~~~~e~~~~ 708 (708) .+++...+..++.+... +.....++.++......+ +++..-.|| T Consensus 646 ~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~~~~~~~~~~~~~~ 717 (725) T protein:vir:92 646 NAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQNQPS 717 (725) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHhcchhccCCc Confidence 65555443333322221 111111111111110000 111111222 No 128 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=98.67 E-value=1.3e-07 Score=58.37 Aligned_cols=609 Identities=10% Similarity=0.006 Sum_probs=166.4 Q ss_pred CCcchHHHH-------------HHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeec Q lcl|Aclame:pro 1 MAETLEKKH-------------ERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEIN 67 (708) Q Consensus 1 ma~~~~~~~-------------~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N 67 (708) ||++.+..- .+++.++...+....+ .+.++-....+..+=+.|.. T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-----------------~~~~~r~~a~~d~~fy~G~Q----- 79 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELS-----------------RQQDNRAEMAVDEDYYDNIQ----- 79 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHh-----------------hchHHHHHHHHHHHHhCCCC----- Confidence 655433221 2222222222211111 12222222222222223331 Q ss_pred chHHHHHHHHHHHhcCcceeEE-------------------ecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Q lcl|Aclame:pro 68 KVATELNRIIAEYRNNRITVKF-------------------RPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDA 128 (708) Q Consensus 68 ~i~~~i~~i~g~~~~nr~~~~v-------------------~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~ 128 (708) .......++ ..+.++-+.+ .++--.. ..--..+.+++..+...--.......++.++ T Consensus 80 -w~~~~~~~l--~~~g~p~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~-~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a 155 (776) T protein:vir:93 80 -WSQDEIDEL--KERGQAPTVYNVISQSVNWIIGSEKRGRSDFKVLPR-RKDGGKAAERKTALLKYLSDVNHTPFERSMA 155 (776) T ss_pred -CCHHHHHHH--HhcCCceEEecchHHHHHHHHHHHHhCCcceEEecC-ChhHHHHHHHHHHHHHHHHHhhcHHHHHHHH Confidence 112222222 1123332222 1111111 1112222333333333222222233334444 Q ss_pred hhcCeeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCccccCCh-----hccCeEEE---eecCCHHHHHHhCC Q lcl|Aclame:pro 129 ATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK-----SDALWAFC---MYSLSPEKYEAEYG 200 (708) Q Consensus 129 ~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~-----sDa~~~~~---~~~~~~~e~~~~~p 200 (708) ...++.+ |.| |..|+||.+.. .+. -+..-+++ .+..+.+++.-.|= T Consensus 156 f~d~~~~---------------G~G----------~~~v~~d~~~~-~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~ 209 (776) T protein:vir:93 156 FEETTKA---------------GIG----------WLESQVQDEND-GEPIYAGAESWRNILWDSTYRRLDMDDCRYIFR 209 (776) T ss_pred HHHhhhc---------------Ccc----------eEEEEeeccCC-CCceEeeccChhheeeccccccCCHHHHhhhhh Confidence 4433221 111 12233343211 010 00000111 11122333322221 Q ss_pred CC------cccccccc---c--ccccccCCCC---CceeE--------------EeeeeeecceEEEEEEEe--cCccCc Q lcl|Aclame:pro 201 KK------PPTSLDVT---S--MTSWEYNWFG---ADVIY--------------IAKYYEVRKESVDVISYR--HPITGE 250 (708) Q Consensus 201 ~~------~~~~~d~~---~--~~~~~~~~~~---~~~~~--------------v~e~~~~~~~~~~~~~~~--~~~~~~ 250 (708) .. ....+... . ....++.+.+ .+... ...|+...++++++++|| .+.+.. T Consensus 210 ~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~ 289 (776) T protein:vir:93 210 VKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQ 289 (776) T ss_pred hccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehh Confidence 00 00000000 0 0011111111 00000 112222334455544444 444444 Q ss_pred eeEecCCcccchHHHhhccchhh-hhheeeeeEEEEEEEEecceeeecCCCCCCCCcc----eeeEEEeeeccCCccccc Q lcl|Aclame:pro 251 IATYDSDQVEDIEDELAIAGFHE-VARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIP----LIPVYGKRWFIDDIERVE 325 (708) Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p----~~p~~~~~~~~d~~~~~~ 325 (708) ++.....++...........+.. +.......++...+.+ ...++.+....-.+..| .+||++.+.+ ....- T Consensus 290 ~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v-~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~---~~~~~ 365 (776) T protein:vir:93 290 RLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRM-HCAIMTTRDLMWAGPSPYRHNRYPFTPIWGF---RRARD 365 (776) T ss_pred hcccccccccceeecccchHHHHHhhcCceeehheeeeee-EEEEEecchhhhccCCCCCCCccceEEecCc---eeccc Confidence 44443333332221111111211 1112222211111111 11222332322222223 4577766432 12234 Q ss_pred chHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccc---ccccccccc-ccccc Q lcl|Aclame:pro 326 GHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREV---RDKSGNIIA-GATPA 401 (708) Q Consensus 326 G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~ 401 (708) |+...+.+.=+..-...+...-.+ .+++....+-. ..+++...... ....|.++. ..... T Consensus 366 ~~~~G~v~~~~d~Q~~~N~~~s~~------~~~l~~~~~~~----------~~gav~~~d~~~~~~~rp~~vi~~~~~~~ 429 (776) T protein:vir:93 366 GMPYGVIRFMRGMQDDVNKRLSKA------LYILSTNKVLM----------EEGAVDDIDEFRREAARPDAVMTVKNGKL 429 (776) T ss_pred ccccchHHhhhHHHHHHHHHHHHH------HHhhcCCceee----------ccccccchHHHHHhcccCCceeeeCCccc Confidence 454555554444444444332222 12332222110 00111000000 001111111 11122 Q ss_pred ccccCccchHHHHHHHHHHHHHHHHHhCCChhHccccc---chhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 402 GYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS---NIAQETVNNLMN--RADMASFIYLDNMAKSLKRAGEVWL 476 (708) Q Consensus 402 ~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~---n~sg~ai~~~q~--q~~~~~~~~~dn~~~~~~~~~~~~l 476 (708) ........++-...++++.......+..+ .|... +.++.+.+...- ........+. .+.+.+++..+.+. T Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~----tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~ 504 (776) T protein:vir:93 430 GAVKMDVDRDLAPAHLELASRSIQMIQQV----GGVTDEMLGRTTNAVSGVAIQARQEQGSVATN-KLFDNLRLAFQQHG 504 (776) T ss_pred cccccccCcCccHHHHHHHHHHHHHHHHh----hCcChHHhCCCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Confidence 22333233333444555555555454433 34332 234444433221 1122122222 22222233333344 Q ss_pred HHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEE---eecccchhHHHH-HHHHHHHHHH Q lcl|Aclame:pro 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTV---DVGPSYTARRDA-TVSVLTNVLS 552 (708) Q Consensus 477 ~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v---~~~~~~~~~r~~-~~~~l~~llq 552 (708) .++..+.- .+. +..+.+.|.... ...+++.+|+ ....-||.+ ++..+....... ..+.+..|++ T Consensus 505 ~~~l~li~-----~~~---~~~r~~ri~~~~---~~~~~v~in~-~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~q 572 (776) T protein:vir:93 505 EKELSLIE-----QYM---TEEKQFRITNSR---GNPEYVTVND-GLPENDITRTKADFIIDEAEWRATMRQAAVAELME 572 (776) T ss_pred HHHHHHHH-----Hhc---CcceEEEEeecC---CCcceEEecc-cchhhhhccceeeEEEeecccchhHHHHHHHHHHH Confidence 43333321 111 233555553321 1234556654 222345543 555555544333 3333334443 Q ss_pred hccccCchhHHHHHHHHh-hccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 553 SMLPTDPMRPAIQGIILD-NIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 553 ~~~~~~p~~~~~~~~~~~-~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) .++. +.+.+...++. .+.........+..+.. .....+..+.+.+..+++++++ +.++..++++...++ T Consensus 573 l~~~---~~p~~~~~~~~~~~e~~d~p~~~e~~~~l---~~~~~~~~p~q~~~~~e~~~~q----q~q~~~~q~q~~~~~ 642 (776) T protein:vir:93 573 VIGK---MPPEIALTMLDLLVENMDIPNRDELVKRI---RAVNGQKDPDQDEPTPEEIARE----QAQQQQQQYNDALAI 642 (776) T ss_pred HHhh---cChhhHHHHHHHHHHhcCccchHHHHHHH---HHhhcccccchhhcchhHHHHH----HHhhHHHHHHHHHhh Confidence 2211 11111111110 00000011000111000 0000011111111111111111 111111222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCCC------ Q lcl|Aclame:pro 632 AEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADL------ 705 (708) Q Consensus 632 ae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~------ 705 (708) +++.+.+++..+.+++++++++++.+.++++.....++.....++..+..++....+...+...++++..|..+ T Consensus 643 a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~~a~~~~p~~p~~~~~~ 722 (776) T protein:vir:93 643 ATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILRESGWDDPNTPQPASAA 722 (776) T ss_pred hhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhccccccccccccccccc Confidence 23333333333444444444444333333332222221111111111111111111111111112222222111 Q ss_pred ---CCC Q lcl|Aclame:pro 706 ---MPS 708 (708) Q Consensus 706 ---~~~ 708 (708) +|+ T Consensus 723 ~~~~~~ 728 (776) T protein:vir:93 723 SGMPPA 728 (776) T ss_pred cCCCCC Confidence 111 No 129 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.62 E-value=1.9e-07 Score=57.45 Aligned_cols=629 Identities=9% Similarity=-0.073 Sum_probs=216.1 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcCc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNR 84 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~nr 84 (708) +- -.+..+.++...+...-++. +++-...+++..=+.|. ...+-+..++ ..+.| T Consensus 1 m~-d~~~~~~~~~~~~~~~~~~~-----------------~~~r~~a~~d~~fy~G~------Qw~~~~~~~l--~~q~r 54 (725) T protein:vir:77 1 MA-DNENRLESILSRFDADWTAS-----------------DEARREAKNDLFFSRVS------QWDDWLSQYT--TLQYR 54 (725) T ss_pred CC-chHHHHHHHHHHHHHHHHhh-----------------HHHHHHHHHHHHhhCCC------CCCHHHHHHH--HhcCC Confidence 11 11222222222222222222 22222222222222232 2223333332 23445 Q ss_pred ceeEEecCC-----------------CcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccC- Q lcl|Aclame:pro 85 ITVKFRPGD-----------------REASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY- 146 (708) Q Consensus 85 ~~~~v~pr~-----------------~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~- 146 (708) |.++.++.. -..+ +-.+.+.+++..+....-...-...+.-++..+++. +++.|..-. T Consensus 55 p~~N~i~~~i~~v~g~~~~nr~d~~v~P~~-~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~---~G~G~~ev~~ 130 (725) T protein:vir:77 55 GQFDVVRPVVRKLVSEMRQNPIDVLYRPKD-GARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIE---AGVGAWRLVT 130 (725) T ss_pred CccccHHHHHHHHHhhHHhCCcceEEecCC-ccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhh---cCcceeeeee Confidence 432221110 0111 112223333333333332233333333344333322 111111000 Q ss_pred C-CCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhC-----CCC-ccccccc-ccccccccC Q lcl|Aclame:pro 147 D-PMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEY-----GKK-PPTSLDV-TSMTSWEYN 218 (708) Q Consensus 147 d-~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~-----p~~-~~~~~d~-~~~~~~~~~ 218 (708) | ......+..+... ...+++||...-+|. ..+..+.++..-.| +.. ....+.. ......... T Consensus 131 d~~~~d~~~~~~~i~---~~~~~~~~~~v~~Dp-------~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 200 (725) T protein:vir:77 131 DYEDQSPTSNNQVIR---REPIHSACSHVIWDS-------NSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPS 200 (725) T ss_pred cccCCCCCCCceeeE---EeecccChhhceeCc-------hhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhccc Confidence 0 0000001111100 011222222221221 12223333333221 100 0000000 000001111 Q ss_pred CCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCC---cccchHHHhhccc-hhhhhheee--eeEEEE-EEEEec Q lcl|Aclame:pro 219 WFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSD---QVEDIEDELAIAG-FHEVARRSV--KRRRVY-VSVVDG 291 (708) Q Consensus 219 ~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~--~~~~v~-~~~~~~ 291 (708) |.+... ....|+..+++++..||++.+....+..+.+. +...+...-.... ......... ..+++. +..... T Consensus 201 ~~~~~~-~~~~~~~~d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~ 279 (725) T protein:vir:77 201 FQNPND-WVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS 279 (725) T ss_pred cccccc-ccccccCCCeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEe Confidence 111111 12346666666666666655555444444332 2222221111111 111111111 111221 111111 Q ss_pred ceeeecCCCC--C-CCCcceeeEEEeeeccCCcccccchH--HhhhHHHHHHHHHHHHHHHHHhhcCC-Cceeechhhcc Q lcl|Aclame:pro 292 DGFLEKPRRI--P-GEHIPLIPVYGKRWFIDDIERVEGHI--AKAMDPQRLYNLQVSMLADTAAQDPG-QIPIVGMEQIR 365 (708) Q Consensus 292 ~~il~~~~~~--p-~~~~p~~p~~~~~~~~d~~~~~~G~v--r~~~d~Q~~~N~~~s~~~~~l~~~~~-~~~i~~~~ai~ 365 (708) . +.+.... | .....++||+.+.-+.+. ..|.. ..++..=+..=...++..-.+....+ .......+..+ T Consensus 280 ~--~~g~~~l~~~~~~~~~~~P~vP~~g~r~~---~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~ 354 (725) T protein:vir:77 280 I--ITCTAVLKDKQLIAGEHIPIVPVFGEWGF---VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPE 354 (725) T ss_pred e--ecCceeeccCCcCCCCccceEEEeeeeec---cCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchh Confidence 1 1222211 1 112234677754332221 12322 24443333333333332222221111 11111111111 Q ss_pred chHHHHHhhcccCCceeeecccccc--cccc---cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc Q lcl|Aclame:pro 366 GLEKHWEARNKKRPAFLPLREVRDK--SGNI---IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN 440 (708) Q Consensus 366 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n 440 (708) .++..... +.+++... .++. .+|..+-+.+.....+.-....++.+......+ ....|.... T Consensus 355 ~i~~~~~~---------~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i----~~~tGi~~~ 421 (725) T protein:vir:77 355 QIAGFEHM---------YDGNDDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAV----KEVATLGVD 421 (725) T ss_pred hhhHHHHH---------HHhccCCceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHH----HHHhCCCHH Confidence 11111110 00111110 0000 111111122222222222234444444444444 344565432 Q ss_pred ---hhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCce Q lcl|Aclame:pro 441 ---IAQETVNNLMNRADM--ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAV 515 (708) Q Consensus 441 ---~sg~ai~~~q~q~~~--~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~ 515 (708) ..+.+.+.+.-+.-. ....+. .|-..++.--+.+-+++..+.. .+. +.++.+.|-... . .... T Consensus 422 ~lG~~~n~~SG~ai~~rq~qg~~~~~-~~~Dnl~~~~~~~g~~lL~lI~-----~~~---~~~rv~RI~~ed--~-~~~~ 489 (725) T protein:vir:77 422 TEAVNGGQVAFDTVNQLNMRADLETY-VFQDNLATAMRRDGEIYQSIVN-----DIY---DVPRNVTITLED--G-SEKD 489 (725) T ss_pred HhCCCchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC--C-Ccce Confidence 233334333322222 222222 2333344444444444444432 111 233555553321 1 2245 Q ss_pred EEeeccc----eeeE----EE--EEeecccch-hHHHHHHHHHHHHHHhccccCchhHHHHHHHHhhccchhHHHHHHHH Q lcl|Aclame:pro 516 VALNDLS----VGRY----DV--TVDVGPSYT-ARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYN 584 (708) Q Consensus 516 ~~~nDi~----~g~~----Dv--~v~~~~~~~-~~r~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~ 584 (708) +++|... .|+. || ..++..+.. +.-...-+.+..|++.+....|..+.....+...++.+..+.+.+.. T Consensus 490 v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~ 569 (725) T protein:vir:77 490 VQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMR 569 (725) T ss_pred eeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHH Confidence 5555332 2221 11 133444443 44344445555666666656556566666777778888888887766 Q ss_pred HhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 585 RNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVY 664 (708) Q Consensus 585 ~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~ 664 (708) ...........+.++.... .++..++.++.++.++++++.++|+.+.+++++.++++.+..+++.++...++.+.. T Consensus 570 erirkq~~~~~~~q~~~~~----e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~ 645 (725) T protein:vir:77 570 DYANKQLIQMGVKKPETPE----EQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHhhhhhhhccCCCChh----hHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6554333332222222222 222223333344555666677777777888888888888888888877777766665 Q ss_pred HHHHHHHHHHHHHHHHH----HHHHhhhhhhhhh---hh--cCCCC--------------------CCCCCCC Q lcl|Aclame:pro 665 KLAQARNIDDKAVMEAI----RLLKDVAESQQQQ---FQ--SPPQS--------------------PADLMPS 708 (708) Q Consensus 665 ~~~q~~~~~~~~~~~~~----~~~~~~~~~~~~~---~~--~~~~~--------------------~~e~~~~ 708 (708) .+++...+..++..+.. +.+....+.+.+. .. +.... ..+.+|- T Consensus 646 ~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~ 718 (725) T protein:vir:77 646 NAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMDIANILQSQRQNQPSG 718 (725) T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhHHHHHHHHHHHHhcCCCc Confidence 55555444444433332 2121111111110 00 00000 1111111 No 130 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=98.61 E-value=1.9e-07 Score=57.39 Aligned_cols=621 Identities=14% Similarity=0.049 Sum_probs=182.0 Q ss_pred CCcchHHH-HHHHHH-HHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKK-HERIML-RFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIA 78 (708) Q Consensus 1 ma~~~~~~-~~~~~~-~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g 78 (708) ||-|..+- +.++-. ..+.......+.++...+.+.....+-..|+++-...+++..=+.|. ...+-+.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~------Qw~~~~~~~l- 73 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGE------QWPSQVRTER- 73 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCC------CCCHHHHHHH- Confidence 88543321 111110 00111111112222222222211111223333333334333333343 2223333322 Q ss_pred HHhcCcceeEE--ecCC-----C------------cch---------------------HHHHHHHHHHHHHHHHhcChH Q lcl|Aclame:pro 79 EYRNNRITVKF--RPGD-----R------------EAS---------------------EELANKLNGLFRADYEETDGG 118 (708) Q Consensus 79 ~~~~nr~~~~v--~pr~-----~------------~~d---------------------~~~A~~l~~~~~~~~~~~~~~ 118 (708) +.+++|-+.+ ++.. + .-+ .+--.-+.+++..+....-.. T Consensus 74 -~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~ 152 (711) T protein:vir:10 74 -ELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYN 152 (711) T ss_pred -HhcCCCcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHh Confidence 2333443332 1110 0 000 000112222222222221111 Q ss_pred HHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCcccc---CCh-----hccCeEEE---e Q lcl|Aclame:pro 119 EACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKK---YDK-----SDALWAFC---M 187 (708) Q Consensus 119 ~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~---~D~-----sDa~~~~~---~ 187 (708) .....++.++...++. .|.| |.+|++|+.+.. .++ .|..-+++ . T Consensus 153 ~~~~~~~s~af~d~~~---------------~G~G----------~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a 207 (711) T protein:vir:10 153 CDAETEYDIAFQGAVE---------------SGMG----------YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDA 207 (711) T ss_pred cChhHHHHHHHHHhhh---------------cCcc----------eEEEEecccCCCCCCCCeEEeeecChhheeeCccc Confidence 1222222233333221 1111 112333332110 000 00110111 1 Q ss_pred ecCCHHHHHHhCCCC-----c-ccccccccccccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccc Q lcl|Aclame:pro 188 YSLSPEKYEAEYGKK-----P-PTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVED 261 (708) Q Consensus 188 ~~~~~~e~~~~~p~~-----~-~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 (708) +..+.+++.-.|=.+ . ...+...-.......|.. .-..|++.+++++..||+..+..-..+.+.++.... T Consensus 208 ~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~----~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~ 283 (711) T protein:vir:10 208 KKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVA----DYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFW 283 (711) T ss_pred cccChhhhcceeeeecCCHHHHHHhCCchhhhhhhccccc----ccCcccCcceeeEEEEEeeeeeeeEEEeecCCceec Confidence 112223332111000 0 000000000011111111 112366666666666666665555555544443332 Q ss_pred hHHHhhccchhhhhhe--e-eeeEEEEEEEEecceeeecCCCCCC-CCcce--eeEEEeeeccCCcccccchHHhhhHHH Q lcl|Aclame:pro 262 IEDELAIAGFHEVARR--S-VKRRRVYVSVVDGDGFLEKPRRIPG-EHIPL--IPVYGKRWFIDDIERVEGHIAKAMDPQ 335 (708) Q Consensus 262 ~~~~~~~~~~~~~~~~--~-~~~~~v~~~~~~~~~il~~~~~~p~-~~~p~--~p~~~~~~~~d~~~~~~G~vr~~~d~Q 335 (708) ..... ..+...... . +..+.+..+.+. ..++.++.-.-. ..+|+ +||+++.-+.. ...+.|....++..= T Consensus 284 ~~~~~--~~~~~~~~~g~~~~~~~~~~~~~v~-~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~-~~d~~~~~~G~vr~~ 359 (711) T protein:vir:10 284 LDALE--DIVDELLEAGISIVRTRKVKTFKTY-WRKITGANVLEGPVEIPSTTIPVIPVWGKSL-IIKKKEIFRSIIRHS 359 (711) T ss_pred cCcch--hHHHHHHhcCchhhhhhhhceeeEE-EEEEecceeecCCCCCCCCcccEEEEeeeee-ccccccccchhhhhh Confidence 22211 111111111 1 111111111111 112233332211 12333 67775543221 112344444443322 Q ss_pred HHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceee----ecccccccccc---cccccccccccCcc Q lcl|Aclame:pro 336 RLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP----LREVRDKSGNI---IAGATPAGYTQPAV 408 (708) Q Consensus 336 ~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~---~~~~~~~~~~~~~~ 408 (708) +..=...+...-.+... +...+-..+. ...+++.. +.......+.+ .|+...-..+.+-. T Consensus 360 ~d~Qr~~N~~~s~~~~~------l~~~~~~~~~-------~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~ 426 (711) T protein:vir:10 360 KDAQRMANYWDSAATET------VALAPKAPFI-------GSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQP 426 (711) T ss_pred hhhHHHHHHHHHHHHHH------HHhcCCCcee-------ecCcccCChHHHHHhccccCCCeeEecccccCcCCccccC Confidence 22223333322222111 1100000000 00011100 00000000111 11111111222222 Q ss_pred chHHHHHHHHHHHHHHHHHhCCChhHcccccch---hHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 409 MNQALAALLQQTSADIQEVTGGSQAMQQMPSNI---AQETVNNLMN--RADMASFIYLDNMAKSLKRAGEVWLSMAREVY 483 (708) Q Consensus 409 ~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~---sg~ai~~~q~--q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y 483 (708) .++-....++.+......+. ...|..... .+.+.+...- .-......+. .+-+.+++..+.+.+++..+. T Consensus 427 ~~~~~~~~~~ll~~~~~~i~----~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~-~~~dn~~~~~~~~g~~ll~li 501 (711) T protein:vir:10 427 PAAVPAAELTLGQNSVEKIK----STMGMYDASLGAMGNETSGRAIIARQRQGDRGSF-AFIDNLTKSIRRVGKILVEMI 501 (711) T ss_pred CCCCCHHHHHHHHHHHHHHH----HHhCCChHHcCCCccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Confidence 23333445555555554443 445543322 2223222211 1111112222 222333344444444444433 Q ss_pred CCCcEEEEeccCCCceEEEecccccccCCCceEEeecc----cee----eEEEEE---eec-ccchhHHHHHHHHHHHHH Q lcl|Aclame:pro 484 GSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDL----SVG----RYDVTV---DVG-PSYTARRDATVSVLTNVL 551 (708) Q Consensus 484 ~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi----~~g----~~Dv~v---~~~-~~~~~~r~~~~~~l~~ll 551 (708) - .+. ..++.+.|-.... +-.++.+|.- ..| ..||++ ++. ...++.-....+.+..|+ T Consensus 502 ~-----~~~---~~er~~rI~ged~---~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ 570 (711) T protein:vir:10 502 P-----HIY---DTERVVRLKFPDE---TEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMI 570 (711) T ss_pred H-----HHc---CCCeEEEEecCCC---CcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHH Confidence 2 111 2335566643321 2234445432 112 246654 334 333444444444444444 Q ss_pred HhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 552 SSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAA 630 (708) Q Consensus 552 q~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~ 630 (708) + +.+ ..+.....++.. ++.......-+.+........... .....+.+.++.+ .+++.+..+++.+++++ T Consensus 571 q-l~~---~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~--~~~~~~~~~qq~~---~e~qq~~~~~q~~~~~~ 641 (711) T protein:vir:10 571 Q-FAQ---AVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNV--LSKDEREAIEEDM---PEQTEPTPEQQVEMAKS 641 (711) T ss_pred H-HHh---hcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCccc--CcchhhhHHHHHH---HHHHHHHHHHHHHHHHH Confidence 3 221 222222222211 111111111111211111111111 1111111111111 11222223334445555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcCCCCCCC Q lcl|Aclame:pro 631 QAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPAD 704 (708) Q Consensus 631 qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 704 (708) |++..+++++.++++++..+++.++...++...+...++. .+..+.++....++..+.+.+..+..-... T Consensus 642 q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq----~~~~~~qq~~~~l~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 642 QADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQ----GGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 6666666666666666666655555444433333222221 122222222122222222222111111111 No 131 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.56 E-value=2.8e-07 Score=56.54 Aligned_cols=621 Identities=10% Similarity=0.000 Sum_probs=203.0 Q ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH--Hhc Q lcl|Aclame:pro 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE--YRN 82 (708) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~--~~~ 82 (708) +-+.+++.+.++..-++++.++-..++....- +.+. .-+.| |...+-+..++-. +.. T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~-----------d~~f----~~y~G------~Qw~~~~~~~l~~~~q~~ 59 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIE-----------ATRF----ARVPG------GQWEGATAAGTKLDEQFE 59 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHH-----------HHHh----hccCC------CCCCHHHHHHHHhhhhhc Confidence 77777877777666555555444444322110 0000 00111 2333444444322 222 Q ss_pred CcceeEE-------------------ecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 83 NRITVKF-------------------RPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 83 nr~~~~v-------------------~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) +||-+.+ .++--..+.+.-..+.+++..+....-.......+.-++..+++.+ ++.| T Consensus 60 ~rP~~~~N~i~~~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~---G~G~- 135 (708) T protein:vir:17 60 KYPKFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATG---GFGC- 135 (708) T ss_pred CCCceEEcchHHHHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhc---ccce- Confidence 3443322 1111112233223345566665555444444555555666555432 1111 Q ss_pred ccCCCCCCCcceeeEEeecchhheecCCccccCChhcc---------CeEE---EeecCCHHHHHHhCCCCccccccccc Q lcl|Aclame:pro 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDA---------LWAF---CMYSLSPEKYEAEYGKKPPTSLDVTS 211 (708) Q Consensus 144 ~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa---------~~~~---~~~~~~~~e~~~~~p~~~~~~~d~~~ 211 (708) +++..-+ .+|++..+ |..+. +-++ ..+..+.++..-.|=. ..-+.+..- T Consensus 136 -----------~~~~~d~------~~e~d~~~-~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~-~~~~~d~~~ 196 (708) T protein:vir:17 136 -----------FRLTSML------VNEYDPMD-DRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCM-YSLSPEKYE 196 (708) T ss_pred -----------eeeeecc------cccCCCCC-CccccceEeeccchhheecCccccccChhhhhhhhhh-ccCCHHHHH Confidence 1111100 11221111 00111 1111 1111233333211100 000000000 Q ss_pred cccccc--CCCCCc--eeEEeeeeeecceEEEEEEEecCccCceeEecC---CcccchHHHh-hccchhhhhhe--eeee Q lcl|Aclame:pro 212 MTSWEY--NWFGAD--VIYIAKYYEVRKESVDVISYRHPITGEIATYDS---DQVEDIEDEL-AIAGFHEVARR--SVKR 281 (708) Q Consensus 212 ~~~~~~--~~~~~~--~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-~~~~~~~~~~~--~~~~ 281 (708) ....+. .+.+.. .....+|+..+++++..||+..+....++.+.+ +++..+.... ........... .+.+ T Consensus 197 ~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~ 276 (708) T protein:vir:17 197 AEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVAR 276 (708) T ss_pred HhCccccchhhhhhhhccccccccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhccccccee Confidence 000000 000001 111235666777777777776666666655533 3333333221 11111111121 1223 Q ss_pred EEEEEEEEecceeeecCC--CCCC-CCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCcee Q lcl|Aclame:pro 282 RRVYVSVVDGDGFLEKPR--RIPG-EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPI 358 (708) Q Consensus 282 ~~v~~~~~~~~~il~~~~--~~p~-~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i 358 (708) +++..+.+.-. .+.+.. ..|. --+.++||+.+.-+... ..+.+....++..=+..=...+...-.+ T Consensus 277 r~~~r~~v~~~-~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~-~d~~~~~yG~vr~~kd~Q~~~N~~~S~~--------- 345 (708) T protein:vir:17 277 RSVKRRRVYVS-VVDGDGFLEKPRRIPGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSML--------- 345 (708) T ss_pred eeeeEEEEEEE-eecccccccCCCCCCCCccceEEEeccccc-ccCCCcccchhhhchhHHHHHHHHHHHH--------- Confidence 33333222212 222222 2222 22345676654432221 1112121222221111111111111000 Q ss_pred echhhccchHHHHHhhcccCCceeeeccc-----------ccccccc--ccccccccc-----ccC--ccchHHHHHHHH Q lcl|Aclame:pro 359 VGMEQIRGLEKHWEARNKKRPAFLPLREV-----------RDKSGNI--IAGATPAGY-----TQP--AVMNQALAALLQ 418 (708) Q Consensus 359 ~~~~ai~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~--~~~~~~~~~-----~~~--~~~~~~~~~l~~ 418 (708) .+......... .++....+ ....... .+.+.+.+. ..+ .+.+.-....++ T Consensus 346 ---------~~~~a~~~~~~-~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~ 415 (708) T protein:vir:17 346 ---------ADTAAQDPGQI-PIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAA 415 (708) T ss_pred ---------HHHHHhcCCcc-eeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHH Confidence 00000000000 00000000 0000000 000011111 111 122222233334 Q ss_pred HHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCc Q lcl|Aclame:pro 419 QTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSD 498 (708) Q Consensus 419 ~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~ 498 (708) ........+.-++..+....+..|.++-.++...-......+.. |-..++.--+.+-+++..+.- .+- +.+ T Consensus 416 llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~-~~Dnl~~~~~~~g~~lL~lI~-----~~y---~~~ 486 (708) T protein:vir:17 416 LLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFI-YLDNMAKSLKRAGEVWLSMAR-----EVY---GSE 486 (708) T ss_pred HHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc---CCC Confidence 44444444432222222222222332222222222222233332 223333333444444444332 111 233 Q ss_pred eEEEecccccccCCCceEEee----ccceee----EEEEE---eecccchhHHHHHHHHHH-HHHHhccccCchhHHHHH Q lcl|Aclame:pro 499 DIAVLSAQVVDRQTGAVVALN----DLSVGR----YDVTV---DVGPSYTARRDATVSVLT-NVLSSMLPTDPMRPAIQG 566 (708) Q Consensus 499 ~~v~in~~~~~~~~~~~~~~n----Di~~g~----~Dv~v---~~~~~~~~~r~~~~~~l~-~llq~~~~~~p~~~~~~~ 566 (708) +.+.|-... . +...+.+| |...|. -||++ ++..+.........++.. .|++.++...|..+.... T Consensus 487 R~~RI~~ed--g-~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~ 563 (708) T protein:vir:17 487 REVRIVNED--G-SDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPA 563 (708) T ss_pred cEEEEecCC--C-CcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHH Confidence 555554321 1 12333333 334443 46654 455555544555554444 444444433333222222 Q ss_pred HHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 567 IILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQI 646 (708) Q Consensus 567 ~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~ 646 (708) .+.-+.+.......-+.+....... +.....++..+..+++.+++++.++.++++++..++++..+++++++++++ T Consensus 564 ~~~l~l~~~D~p~~~ei~e~ir~~~----~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~a 639 (708) T protein:vir:17 564 IQGIILDNIDGEGLDDFKEYNRNQL----LISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATN 639 (708) T ss_pred HHHHHHHhcCCCChHHHHHHHHHHh----hccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211211111111112222111111 111111112222222223333344445555555666666777777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhhhhhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 647 KAFTAQQDAMESQANTVYKLAQARNIDDKAV-MEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 647 e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 708 (708) ++.+.++++..++....+...++.+.-..+. ++........+..+ ..+...+...+.+|. T Consensus 640 ea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~--~~q~~q~q~~~a~p~ 700 (708) T protein:vir:17 640 ETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLK--DVAESQQQQFQSPPQ 700 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--hhhhhHHHHHhcccc Confidence 7766666655544444433333222111111 11111111111111 111111233445565 No 132 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=98.44 E-value=5.6e-08 Score=60.33 Aligned_cols=579 Identities=15% Similarity=0.107 Sum_probs=224.8 Q ss_pred CCcchH----------HHHHH-HHHHHHHHHHhh----HHHHHHH-HHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCce Q lcl|Aclame:pro 1 MAETLE----------KKHER-IMLRFDRAYSPQ----KEVREKC-IEATRFARVPGGQWEGATAAGTKLDEQFEKYPKF 64 (708) Q Consensus 1 ma~~~~----------~~~~~-~~~~~~~~~~~~----~~~r~~~-~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~ 64 (708) ||-++. .+.++ ++..+++..+.. ..+..+. .-|+.|+.|.--|=..+.+. .. ++.+...||- T Consensus 1 maispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG-~~-~~~~A~V~C~ 78 (666) T protein:vir:10 1 MAISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLG-YN-QNIAAKVRCQ 78 (666) T ss_pred CCcCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceee-ec-ccccccCcce Confidence 443221 11122 222233333322 2222222 22455555554443333321 11 1122345676 Q ss_pred eecc--hH----HHHHHHHHHHhc----CcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCee Q lcl|Aclame:pro 65 EINK--VA----TELNRIIAEYRN----NRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFG 134 (708) Q Consensus 65 ~~N~--i~----~~i~~i~g~~~~----nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G 134 (708) +||. +. .-|.+++|+... ..|-.-|... .+..+-||.|+++|..-.....+-...--.+.|+++.-+- T Consensus 79 V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS~--P~~K~~AE~LE~ii~DH~t~~~~~~~LiL~L~D~~KYN~~ 156 (666) T protein:vir:10 79 VVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVST--PDKKEQAEALEGIIQDHMTMTSSIPELILCLQDAAKYNLV 156 (666) T ss_pred eeccccCCchhhhhHHHHHHHHHHHHhcCCccceeecC--CchhHHHHHHHHHHHhhhhhhhhHHHHHHHHhhhhhccee Confidence 6654 33 445666776544 3333333222 2446779999999998777777777777788899998877 Q ss_pred EEEEEeeccccCCCC----CCC-cce----------eeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHh Q lcl|Aclame:pro 135 CFRLTSMLVNEYDPM----DDR-QRI----------AIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAE 198 (708) Q Consensus 135 ~~~v~~~~~~~~d~~----~~~-~~i----------~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~ 198 (708) .|+..|....--++- +-. +.. +|++. +++++||||..--+|.. .-.|+.....+++-.+++. T Consensus 157 ~~ET~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRL--N~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~ 234 (666) T protein:vir:10 157 GWETEWSHIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRL--NLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKY 234 (666) T ss_pred eeeeccccccccchhhhhhcCCCceeecccchhhhhhhhcc--ccccccccCCCCCCchhhhhhhhhHHHHHHHHHHHHH Confidence 677655432211111 101 111 12222 46799999955444533 3567888888887777654 Q ss_pred CC----CC-------ccccccc-cccccc-----------------ccCCC--CCceeEEeeeeeecceEEEEEEEecCc Q lcl|Aclame:pro 199 YG----KK-------PPTSLDV-TSMTSW-----------------EYNWF--GADVIYIAKYYEVRKESVDVISYRHPI 247 (708) Q Consensus 199 ~p----~~-------~~~~~d~-~~~~~~-----------------~~~~~--~~~~~~v~e~~~~~~~~~~~~~~~~~~ 247 (708) .- ++ ....+.. ....++ +.+|. +..+...+.+ .+|+.+..--.+++. T Consensus 235 LN~LT~EKkltykkvV~~Al~~s~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~--~~rvpvneqg~Y~k~ 312 (666) T protein:vir:10 235 LNYLTNEKKLTYKKVVNEALKSSFQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSST--NRRVPVNEQGVYCKH 312 (666) T ss_pred HhhhhcchhhhHHHHHHHHHhhhccccccccCCccCccccccchhhccchhhcCccccccccc--ccccccccccceeee Confidence 21 11 0011000 000010 01110 0000000000 111111111111111 Q ss_pred cCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCccc-cc Q lcl|Aclame:pro 248 TGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIER-VE 325 (708) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~-~~ 325 (708) +--.....+ +++. ..+. +-.---|....+.++.++. ++--..+++||+- ++++. .||.+. -. T Consensus 313 ~~Y~RI~PS-DF~~---~~P~---------~N~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~--~~~~L-EDG~G~QTQ 376 (666) T protein:vir:10 313 TMYLRIIPS-DFEM---NVPN---------RNQVQIWKAVMINRDAIISFEPYIGAYGSFGMG--LAFAL-EDGMGLQTQ 376 (666) T ss_pred eeeeeeccc-ccee---cCCC---------CCcceeeeeeeeccceeEeeehhhhccchhhhh--hhhhh-hhccccccc Confidence 100011111 1100 0000 0011124444566777775 3333345556553 33332 355433 25 Q ss_pred chHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhccc-CCceeeecccccccccccc--cccccc Q lcl|Aclame:pro 326 GHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKK-RPAFLPLREVRDKSGNIIA--GATPAG 402 (708) Q Consensus 326 G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~ 402 (708) |+.+..++.|+.-.++++.-.-........+.++++..+.. ...|.+ +..-++.++-...+|.... ...+|. T Consensus 377 ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a-----~~iNSP~~~~KIP~~~~sL~N~~~~~~Y~~IPFD 451 (666) T protein:vir:10 377 GYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRA-----NDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFD 451 (666) T ss_pred cccccccchhhhhhHHhhhhhhhhhhhhhhhhccChhhhhh-----hcccCCCCCcccceeehhhcccchhhhhccCCcc Confidence 67778889998777655544333333333344444443322 111111 1122222222222222111 011111 Q ss_pred cccCccchHHHHHHHHHHHHHHHHHhCCChhHccc--ccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|Aclame:pro 403 YTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM--PSNIAQETVNNLMNRADMASFIYLDNMA-KSLKRAGEVWLSMA 479 (708) Q Consensus 403 ~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~--~~n~sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~~~~~~l~li 479 (708) .-.. ....+=.+....--++++|.+...+|+ .+|.|-.--...+..++.++....--|. +.+..+-++|.--| T Consensus 452 ~RG~----E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGNKt~~E~~~~MG~a~NR~RLPALiLEH~~F~~iK~~L~LNl 527 (666) T protein:vir:10 452 SRGM----ETVMQNALMLTDWQRELSGMNSATRGQFQKGNKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNL 527 (666) T ss_pred ccch----hHHHhhhHHHHhhHHHhhccCCcccccccccCcceeehhhhcCCcccceehhhHHhhhhhhhhHHHHHhhhh Confidence 1111 112222333445567899999999996 4565433333344444444433222222 22223333333333 Q ss_pred HHhcCCCcEEEEeccCCCceEEEecccccccCCCceEE--eeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 480 REVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVA--LNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPT 557 (708) Q Consensus 480 ~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~--~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~ 557 (708) -+|=++..||.- .+|+++. +..++..-..+.+.+|..- ..+.+.-..+..+||.+.-. T Consensus 528 ~~YG~DT~ViS~-------------------RtG~~~~vDi~~L~~~~L~F~~~DG~TP-~SK~ASs~~lT~~LQMI~sS 587 (666) T protein:vir:10 528 LMYGEDTEVISP-------------------RTGKGVRVDIKELQDLGLKFELGDGLTP-ASKLASSDFLTALLQMIMSS 587 (666) T ss_pred hhccccchhccc-------------------ccCceeeeeHHHHhhhhheeeeccCCCc-hhhhhhhHHHHHHHHHHhhh Confidence 444444444432 2233221 1112222234555666543 33555555555555533211 Q ss_pred Cchh----HHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 558 DPMR----PAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 558 ~p~~----~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) .... +.+-.++.-++.+-++.-+.++....++.-......++..|++-.|.++ | -..+. |+. |. T Consensus 588 ~~~~~A~G~~~P~M~AH~~QLGGVRG~E~Y~daalP~~~~~~~~~Q~LQ~~~LQ~~~--Q---SA~Q~--~A~----Q~- 655 (666) T protein:vir:10 588 ETTLQAFGTQVPGMIAHLAQLGGVRGFEKYADAALPQWQITYGMQQQLQQMLLQLQQ--Q---SAMQL--QAR----QG- 655 (666) T ss_pred hhhHhhhcccchHHHHHHHHhccccchhhhhhccCCccccccchhHHHHHHHHHHhh--h---hhccc--ccc----cc- Confidence 1110 1111233333444444444444433222222222111111111111100 0 00000 000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQTQIKAFTAQQDA 655 (708) Q Consensus 634 ~~k~~~~~~~~q~e~~~~~~~~ 655 (708) ++..++.+-.. T Consensus 656 -----------~L~~~Q~~PSq 666 (666) T protein:vir:10 656 -----------ELSNDQSQPSQ 666 (666) T ss_pred -----------cCcccccCCCC Confidence 00000000000 No 133 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=98.43 E-value=6.9e-07 Score=54.38 Aligned_cols=610 Identities=13% Similarity=0.051 Sum_probs=197.8 Q ss_pred CCcchHHH--------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK--------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~--------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |.|..--. ..+++.++..++....+...+|+.+. ..+.... .|. ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a-----------~~d~~fy------~G~------Qw~~~ 57 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAA-----------NKACAYY------DGD------QLPPE 57 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHH-----------HHHHHhh------cCC------CCCHH Confidence 88743222 13355555555555444444443221 1122222 221 22223 Q ss_pred HHHHHHHHhcCcceeEE-------------------ecCCCcchHHHH-HHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKF-------------------RPGDREASEELA-NKLNGLFRADYEETDGGEACDNAFDDAATGG 132 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v-------------------~pr~~~~d~~~A-~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G 132 (708) +..++-. +.+|-+.+ .++--..+.+-+ ..+.+++..+....-.......++.++..++ T Consensus 58 ~~~~l~~--~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:32 58 VLQVLKD--RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred HHHHHHh--cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 3333322 22222221 111111121112 1244444444333333333444555555555 Q ss_pred eeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCccccCCh--h--ccCeEEE---eecCCHHHHHHhCCCCc-- Q lcl|Aclame:pro 133 FGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK--S--DALWAFC---MYSLSPEKYEAEYGKKP-- 203 (708) Q Consensus 133 ~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~--s--Da~~~~~---~~~~~~~e~~~~~p~~~-- 203 (708) +.+ .++ |..+++|++....++ . +..-+++ .+..|.++..-.|=... T Consensus 136 ~~~-G~G------------------------~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~ 190 (714) T protein:vir:32 136 IKA-GLS------------------------WVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMD 190 (714) T ss_pred hhc-Ccc------------------------eEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCC Confidence 421 111 112233332221110 0 0000111 11122233221110000 Q ss_pred ccccc------cccccccccCCCCC---------c-------------eeEEeeeeeecceEEEEEEEecCccCceeEec Q lcl|Aclame:pro 204 PTSLD------VTSMTSWEYNWFGA---------D-------------VIYIAKYYEVRKESVDVISYRHPITGEIATYD 255 (708) Q Consensus 204 ~~~~d------~~~~~~~~~~~~~~---------~-------------~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (708) -+.+. .........+|.+. . ...-..|+.++++++++++||.+.-.....+. T Consensus 191 ~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~ 270 (714) T protein:vir:32 191 TDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE 270 (714) T ss_pred HHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec Confidence 00000 00011111122110 0 11123466677788887777764333333333 Q ss_pred C--CcccchHHHhhccchhhhhheee-eeEEEEEEEEecceeeecCCCCCCC--Ccc--eeeEEEeeeccCCcccccchH Q lcl|Aclame:pro 256 S--DQVEDIEDELAIAGFHEVARRSV-KRRRVYVSVVDGDGFLEKPRRIPGE--HIP--LIPVYGKRWFIDDIERVEGHI 328 (708) Q Consensus 256 ~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~il~~~~~~p~~--~~p--~~p~~~~~~~~d~~~~~~G~v 328 (708) . +....+...-......+...... ..+++... ...++.++.-.-.+ .|| ++|++.+..+.+ ...|.. T Consensus 271 ~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv---~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---~~~g~~ 344 (714) T protein:vir:32 271 LSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRI---REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEP 344 (714) T ss_pred cCCCceEEeCccCHHHHHHHhhcchhhhccccceE---EEEEEecCcccccCCCCCCCCceeEEEEeeeee---eccCce Confidence 2 22222222211111112111111 12222221 11122222222111 133 366665543322 124444 Q ss_pred HhhhHHHHHHHHHHHHHHHHHhhcCC----CceeechhhccchHHHHHhhcccCCceeee-----cccccccccc----- Q lcl|Aclame:pro 329 AKAMDPQRLYNLQVSMLADTAAQDPG----QIPIVGMEQIRGLEKHWEARNKKRPAFLPL-----REVRDKSGNI----- 394 (708) Q Consensus 329 r~~~d~Q~~~N~~~s~~~~~l~~~~~----~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----- 394 (708) ..++ +.+.++-..-++ -.++.... ..+ ..++++... ......++-+ T Consensus 345 ~G~v----------r~~~d~Qr~~N~~~s~~~~~l~~~--~~~--------~~~~a~~~~d~~~~e~~arp~~vi~~~p~ 404 (714) T protein:vir:32 345 YGLI----------SRAIPAQDEVNFRRIKLTWLLQAK--RVI--------MDEDATQLSDNDLMEQIERPDGIIKLNPV 404 (714) T ss_pred eehh----------hhchhHHHHHHHHHHHHHHhhcCC--cee--------eecCcccccHHHHHHhccCCCCceeeccc Confidence 4332 222222111100 00111100 000 001111000 0000011111 Q ss_pred -cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 395 -IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNRADMAS-FIYLDNMAKSLK 469 (708) Q Consensus 395 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q~~~~~-~~~~dn~~~~~~ 469 (708) ..++.++..+++..-++-....++.+......+ ....|.... ..+.+.+...-+..... ...+-.+-..++ T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~ 480 (714) T protein:vir:32 405 RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLI----QDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQ 480 (714) T ss_pred ccccCCCCccccccCCCCccHHHHHHHHHHHHHH----HHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333444555444455555666655555555 344554432 23444443322111111 111112222333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccc-ee--eEEE---EEeecccchhHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLS-VG--RYDV---TVDVGPSYTARRDAT 543 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~-~g--~~Dv---~v~~~~~~~~~r~~~ 543 (708) ...+.+-+++..+.. .+. +.++.+.|-...........+.+|.-+ .| .-|| ..++..+........ T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:32 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 444444444443332 112 233444443221111111234444222 11 1233 345555555444444 Q ss_pred HHHHH-HHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLT-NVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~-~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .++.. .|++.++. +.+.....++.. ++.-......+-++..........+ +.+++.+++++ +++++++++. T Consensus 553 r~~~~~~l~~l~~~---~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~--~~~~~~e~q~~--~~~~q~~~~~ 625 (714) T protein:vir:32 553 KAQLAQRMSEVIQG---LPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS--PDEMTPEEQEV--AAQQQALQQQ 625 (714) T ss_pred HHHHHHHHHHHHhh---cCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC--ccccchhhHHH--HHHHHHHHHH Confidence 44444 44333211 112212122221 1111122222333322222222222 22222222222 2222333344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHhhhhhhhhhhhc- Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV--MEAIRL-LKDVAESQQQQFQS- 697 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~- 697 (708) ++++++.++++++++.+++++++++.+.+...+++...+.+..........++.+. ++..+. .+...-.+++..+. T Consensus 626 q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:32 626 QAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 45556667778888888888777776666555555444443332222222222211 111111 11122222222222 Q ss_pred -CCCCCCCC Q lcl|Aclame:pro 698 -PPQSPADL 705 (708) Q Consensus 698 -~~~~~~e~ 705 (708) +-....+| T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:32 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 22223333 No 134 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=98.43 E-value=6.9e-07 Score=54.38 Aligned_cols=610 Identities=13% Similarity=0.051 Sum_probs=197.8 Q ss_pred CCcchHHH--------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK--------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~--------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |.|..--. ..+++.++..++....+...+|+.+. ..+.... .|. ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a-----------~~d~~fy------~G~------Qw~~~ 57 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAA-----------NKACAYY------DGD------QLPPE 57 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHH-----------HHHHHhh------cCC------CCCHH Confidence 88743222 13355555555555444444443221 1122222 221 22223 Q ss_pred HHHHHHHHhcCcceeEE-------------------ecCCCcchHHHH-HHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKF-------------------RPGDREASEELA-NKLNGLFRADYEETDGGEACDNAFDDAATGG 132 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v-------------------~pr~~~~d~~~A-~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G 132 (708) +..++-. +.+|-+.+ .++--..+.+-+ ..+.+++..+....-.......++.++..++ T Consensus 58 ~~~~l~~--~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:10 58 VLQVLKD--RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred HHHHHHh--cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 3333322 22222221 111111121112 1244444444333333333444555555555 Q ss_pred eeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCccccCCh--h--ccCeEEE---eecCCHHHHHHhCCCCc-- Q lcl|Aclame:pro 133 FGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK--S--DALWAFC---MYSLSPEKYEAEYGKKP-- 203 (708) Q Consensus 133 ~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~--s--Da~~~~~---~~~~~~~e~~~~~p~~~-- 203 (708) +.+ .++ |..+++|++....++ . +..-+++ .+..|.++..-.|=... T Consensus 136 ~~~-G~G------------------------~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~ 190 (714) T protein:vir:10 136 IKA-GLS------------------------WVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMD 190 (714) T ss_pred hhc-Ccc------------------------eEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCC Confidence 421 111 112233332221110 0 0000111 11122233221110000 Q ss_pred ccccc------cccccccccCCCCC---------c-------------eeEEeeeeeecceEEEEEEEecCccCceeEec Q lcl|Aclame:pro 204 PTSLD------VTSMTSWEYNWFGA---------D-------------VIYIAKYYEVRKESVDVISYRHPITGEIATYD 255 (708) Q Consensus 204 ~~~~d------~~~~~~~~~~~~~~---------~-------------~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (708) -+.+. .........+|.+. . ...-..|+.++++++++++||.+.-.....+. T Consensus 191 ~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~ 270 (714) T protein:vir:10 191 TDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE 270 (714) T ss_pred HHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec Confidence 00000 00011111122110 0 11123466677788887777764333333333 Q ss_pred C--CcccchHHHhhccchhhhhheee-eeEEEEEEEEecceeeecCCCCCCC--Ccc--eeeEEEeeeccCCcccccchH Q lcl|Aclame:pro 256 S--DQVEDIEDELAIAGFHEVARRSV-KRRRVYVSVVDGDGFLEKPRRIPGE--HIP--LIPVYGKRWFIDDIERVEGHI 328 (708) Q Consensus 256 ~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~il~~~~~~p~~--~~p--~~p~~~~~~~~d~~~~~~G~v 328 (708) . +....+...-......+...... ..+++... ...++.++.-.-.+ .|| ++|++.+..+.+ ...|.. T Consensus 271 ~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv---~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---~~~g~~ 344 (714) T protein:vir:10 271 LSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRI---REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEP 344 (714) T ss_pred cCCCceEEeCccCHHHHHHHhhcchhhhccccceE---EEEEEecCcccccCCCCCCCCceeEEEEeeeee---eccCce Confidence 2 22222222211111112111111 12222221 11122222222111 133 366665543322 124444 Q ss_pred HhhhHHHHHHHHHHHHHHHHHhhcCC----CceeechhhccchHHHHHhhcccCCceeee-----cccccccccc----- Q lcl|Aclame:pro 329 AKAMDPQRLYNLQVSMLADTAAQDPG----QIPIVGMEQIRGLEKHWEARNKKRPAFLPL-----REVRDKSGNI----- 394 (708) Q Consensus 329 r~~~d~Q~~~N~~~s~~~~~l~~~~~----~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----- 394 (708) ..++ +.+.++-..-++ -.++.... ..+ ..++++... ......++-+ T Consensus 345 ~G~v----------r~~~d~Qr~~N~~~s~~~~~l~~~--~~~--------~~~~a~~~~d~~~~e~~arp~~vi~~~p~ 404 (714) T protein:vir:10 345 YGLI----------SRAIPAQDEVNFRRIKLTWLLQAK--RVI--------MDEDATQLSDNDLMEQIERPDGIIKLNPV 404 (714) T ss_pred eehh----------hhchhHHHHHHHHHHHHHHhhcCC--cee--------eecCcccccHHHHHHhccCCCCceeeccc Confidence 4332 222222111100 00111100 000 001111000 0000011111 Q ss_pred -cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 395 -IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNRADMAS-FIYLDNMAKSLK 469 (708) Q Consensus 395 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q~~~~~-~~~~dn~~~~~~ 469 (708) ..++.++..+++..-++-....++.+......+ ....|.... ..+.+.+...-+..... ...+-.+-..++ T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~ 480 (714) T protein:vir:10 405 RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLI----QDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQ 480 (714) T ss_pred ccccCCCCccccccCCCCccHHHHHHHHHHHHHH----HHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333444555444455555666655555555 344554432 23444443322111111 111112222333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccc-ee--eEEE---EEeecccchhHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLS-VG--RYDV---TVDVGPSYTARRDAT 543 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~-~g--~~Dv---~v~~~~~~~~~r~~~ 543 (708) ...+.+-+++..+.. .+. +.++.+.|-...........+.+|.-+ .| .-|| ..++..+........ T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:10 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 444444444443332 112 233444443221111111234444222 11 1233 345555555444444 Q ss_pred HHHHH-HHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLT-NVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~-~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .++.. .|++.++. +.+.....++.. ++.-......+-++..........+ +.+++.+++++ +++++++++. T Consensus 553 r~~~~~~l~~l~~~---~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~--~~~~~~e~q~~--~~~~q~~~~~ 625 (714) T protein:vir:10 553 KAQLAQRMSEVIQG---LPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS--PDEMTPEEQEV--AAQQQALQQQ 625 (714) T ss_pred HHHHHHHHHHHHhh---cCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC--ccccchhhHHH--HHHHHHHHHH Confidence 44444 44333211 112212122221 1111122222333322222222222 22222222222 2222333344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHhhhhhhhhhhhc- Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV--MEAIRL-LKDVAESQQQQFQS- 697 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~- 697 (708) ++++++.++++++++.+++++++++.+.+...+++...+.+..........++.+. ++..+. .+...-.+++..+. T Consensus 626 q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:10 626 QAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 45556667778888888888777776666555555444443332222222222211 111111 11122222222222 Q ss_pred -CCCCCCCC Q lcl|Aclame:pro 698 -PPQSPADL 705 (708) Q Consensus 698 -~~~~~~e~ 705 (708) +-....+| T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:10 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 22223333 No 135 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=98.43 E-value=6.9e-07 Score=54.38 Aligned_cols=610 Identities=13% Similarity=0.051 Sum_probs=197.8 Q ss_pred CCcchHHH--------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK--------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~--------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |.|..--. ..+++.++..++....+...+|+.+. ..+.... .|. ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a-----------~~d~~fy------~G~------Qw~~~ 57 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAA-----------NKACAYY------DGD------QLPPE 57 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHH-----------HHHHHhh------cCC------CCCHH Confidence 88743222 13355555555555444444443221 1122222 221 22223 Q ss_pred HHHHHHHHhcCcceeEE-------------------ecCCCcchHHHH-HHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKF-------------------RPGDREASEELA-NKLNGLFRADYEETDGGEACDNAFDDAATGG 132 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v-------------------~pr~~~~d~~~A-~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G 132 (708) +..++-. +.+|-+.+ .++--..+.+-+ ..+.+++..+....-.......++.++..++ T Consensus 58 ~~~~l~~--~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:99 58 VLQVLKD--RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred HHHHHHh--cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 3333322 22222221 111111121112 1244444444333333333444555555555 Q ss_pred eeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCccccCCh--h--ccCeEEE---eecCCHHHHHHhCCCCc-- Q lcl|Aclame:pro 133 FGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK--S--DALWAFC---MYSLSPEKYEAEYGKKP-- 203 (708) Q Consensus 133 ~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~--s--Da~~~~~---~~~~~~~e~~~~~p~~~-- 203 (708) +.+ .++ |..+++|++....++ . +..-+++ .+..|.++..-.|=... T Consensus 136 ~~~-G~G------------------------~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~ 190 (714) T protein:vir:99 136 IKA-GLS------------------------WVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMD 190 (714) T ss_pred hhc-Ccc------------------------eEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCC Confidence 421 111 112233332221110 0 0000111 11122233221110000 Q ss_pred ccccc------cccccccccCCCCC---------c-------------eeEEeeeeeecceEEEEEEEecCccCceeEec Q lcl|Aclame:pro 204 PTSLD------VTSMTSWEYNWFGA---------D-------------VIYIAKYYEVRKESVDVISYRHPITGEIATYD 255 (708) Q Consensus 204 ~~~~d------~~~~~~~~~~~~~~---------~-------------~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (708) -+.+. .........+|.+. . ...-..|+.++++++++++||.+.-.....+. T Consensus 191 ~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~ 270 (714) T protein:vir:99 191 TDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE 270 (714) T ss_pred HHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec Confidence 00000 00011111122110 0 11123466677788887777764333333333 Q ss_pred C--CcccchHHHhhccchhhhhheee-eeEEEEEEEEecceeeecCCCCCCC--Ccc--eeeEEEeeeccCCcccccchH Q lcl|Aclame:pro 256 S--DQVEDIEDELAIAGFHEVARRSV-KRRRVYVSVVDGDGFLEKPRRIPGE--HIP--LIPVYGKRWFIDDIERVEGHI 328 (708) Q Consensus 256 ~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~il~~~~~~p~~--~~p--~~p~~~~~~~~d~~~~~~G~v 328 (708) . +....+...-......+...... ..+++... ...++.++.-.-.+ .|| ++|++.+..+.+ ...|.. T Consensus 271 ~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv---~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---~~~g~~ 344 (714) T protein:vir:99 271 LSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRI---REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEP 344 (714) T ss_pred cCCCceEEeCccCHHHHHHHhhcchhhhccccceE---EEEEEecCcccccCCCCCCCCceeEEEEeeeee---eccCce Confidence 2 22222222211111112111111 12222221 11122222222111 133 366665543322 124444 Q ss_pred HhhhHHHHHHHHHHHHHHHHHhhcCC----CceeechhhccchHHHHHhhcccCCceeee-----cccccccccc----- Q lcl|Aclame:pro 329 AKAMDPQRLYNLQVSMLADTAAQDPG----QIPIVGMEQIRGLEKHWEARNKKRPAFLPL-----REVRDKSGNI----- 394 (708) Q Consensus 329 r~~~d~Q~~~N~~~s~~~~~l~~~~~----~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----- 394 (708) ..++ +.+.++-..-++ -.++.... ..+ ..++++... ......++-+ T Consensus 345 ~G~v----------r~~~d~Qr~~N~~~s~~~~~l~~~--~~~--------~~~~a~~~~d~~~~e~~arp~~vi~~~p~ 404 (714) T protein:vir:99 345 YGLI----------SRAIPAQDEVNFRRIKLTWLLQAK--RVI--------MDEDATQLSDNDLMEQIERPDGIIKLNPV 404 (714) T ss_pred eehh----------hhchhHHHHHHHHHHHHHHhhcCC--cee--------eecCcccccHHHHHHhccCCCCceeeccc Confidence 4332 222222111100 00111100 000 001111000 0000011111 Q ss_pred -cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 395 -IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNRADMAS-FIYLDNMAKSLK 469 (708) Q Consensus 395 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q~~~~~-~~~~dn~~~~~~ 469 (708) ..++.++..+++..-++-....++.+......+ ....|.... ..+.+.+...-+..... ...+-.+-..++ T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~ 480 (714) T protein:vir:99 405 RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLI----QDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQ 480 (714) T ss_pred ccccCCCCccccccCCCCccHHHHHHHHHHHHHH----HHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333444555444455555666655555555 344554432 23444443322111111 111112222333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccc-ee--eEEE---EEeecccchhHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLS-VG--RYDV---TVDVGPSYTARRDAT 543 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~-~g--~~Dv---~v~~~~~~~~~r~~~ 543 (708) ...+.+-+++..+.. .+. +.++.+.|-...........+.+|.-+ .| .-|| ..++..+........ T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:99 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 444444444443332 112 233444443221111111234444222 11 1233 345555555444444 Q ss_pred HHHHH-HHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLT-NVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~-~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .++.. .|++.++. +.+.....++.. ++.-......+-++..........+ +.+++.+++++ +++++++++. T Consensus 553 r~~~~~~l~~l~~~---~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~--~~~~~~e~q~~--~~~~q~~~~~ 625 (714) T protein:vir:99 553 KAQLAQRMSEVIQG---LPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS--PDEMTPEEQEV--AAQQQALQQQ 625 (714) T ss_pred HHHHHHHHHHHHhh---cCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC--ccccchhhHHH--HHHHHHHHHH Confidence 44444 44333211 112212122221 1111122222333322222222222 22222222222 2222333344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHhhhhhhhhhhhc- Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV--MEAIRL-LKDVAESQQQQFQS- 697 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~- 697 (708) ++++++.++++++++.+++++++++.+.+...+++...+.+..........++.+. ++..+. .+...-.+++..+. T Consensus 626 q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:99 626 QAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 45556667778888888888777776666555555444443332222222222211 111111 11122222222222 Q ss_pred -CCCCCCCC Q lcl|Aclame:pro 698 -PPQSPADL 705 (708) Q Consensus 698 -~~~~~~e~ 705 (708) +-....+| T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:99 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 22223333 No 136 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=98.43 E-value=6.9e-07 Score=54.38 Aligned_cols=610 Identities=13% Similarity=0.051 Sum_probs=197.8 Q ss_pred CCcchHHH--------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK--------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~--------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |.|..--. ..+++.++..++....+...+|+.+. ..+.... .|. ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a-----------~~d~~fy------~G~------Qw~~~ 57 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAA-----------NKACAYY------DGD------QLPPE 57 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHH-----------HHHHHhh------cCC------CCCHH Confidence 88743222 13355555555555444444443221 1122222 221 22223 Q ss_pred HHHHHHHHhcCcceeEE-------------------ecCCCcchHHHH-HHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKF-------------------RPGDREASEELA-NKLNGLFRADYEETDGGEACDNAFDDAATGG 132 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v-------------------~pr~~~~d~~~A-~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G 132 (708) +..++-. +.+|-+.+ .++--..+.+-+ ..+.+++..+....-.......++.++..++ T Consensus 58 ~~~~l~~--~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:27 58 VLQVLKD--RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred HHHHHHh--cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 3333322 22222221 111111121112 1244444444333333333444555555555 Q ss_pred eeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCccccCCh--h--ccCeEEE---eecCCHHHHHHhCCCCc-- Q lcl|Aclame:pro 133 FGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK--S--DALWAFC---MYSLSPEKYEAEYGKKP-- 203 (708) Q Consensus 133 ~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~--s--Da~~~~~---~~~~~~~e~~~~~p~~~-- 203 (708) +.+ .++ |..+++|++....++ . +..-+++ .+..|.++..-.|=... T Consensus 136 ~~~-G~G------------------------~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~ 190 (714) T protein:vir:27 136 IKA-GLS------------------------WVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMD 190 (714) T ss_pred hhc-Ccc------------------------eEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCC Confidence 421 111 112233332221110 0 0000111 11122233221110000 Q ss_pred ccccc------cccccccccCCCCC---------c-------------eeEEeeeeeecceEEEEEEEecCccCceeEec Q lcl|Aclame:pro 204 PTSLD------VTSMTSWEYNWFGA---------D-------------VIYIAKYYEVRKESVDVISYRHPITGEIATYD 255 (708) Q Consensus 204 ~~~~d------~~~~~~~~~~~~~~---------~-------------~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (708) -+.+. .........+|.+. . ...-..|+.++++++++++||.+.-.....+. T Consensus 191 ~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~ 270 (714) T protein:vir:27 191 TDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE 270 (714) T ss_pred HHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec Confidence 00000 00011111122110 0 11123466677788887777764333333333 Q ss_pred C--CcccchHHHhhccchhhhhheee-eeEEEEEEEEecceeeecCCCCCCC--Ccc--eeeEEEeeeccCCcccccchH Q lcl|Aclame:pro 256 S--DQVEDIEDELAIAGFHEVARRSV-KRRRVYVSVVDGDGFLEKPRRIPGE--HIP--LIPVYGKRWFIDDIERVEGHI 328 (708) Q Consensus 256 ~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~il~~~~~~p~~--~~p--~~p~~~~~~~~d~~~~~~G~v 328 (708) . +....+...-......+...... ..+++... ...++.++.-.-.+ .|| ++|++.+..+.+ ...|.. T Consensus 271 ~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv---~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---~~~g~~ 344 (714) T protein:vir:27 271 LSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRI---REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEP 344 (714) T ss_pred cCCCceEEeCccCHHHHHHHhhcchhhhccccceE---EEEEEecCcccccCCCCCCCCceeEEEEeeeee---eccCce Confidence 2 22222222211111112111111 12222221 11122222222111 133 366665543322 124444 Q ss_pred HhhhHHHHHHHHHHHHHHHHHhhcCC----CceeechhhccchHHHHHhhcccCCceeee-----cccccccccc----- Q lcl|Aclame:pro 329 AKAMDPQRLYNLQVSMLADTAAQDPG----QIPIVGMEQIRGLEKHWEARNKKRPAFLPL-----REVRDKSGNI----- 394 (708) Q Consensus 329 r~~~d~Q~~~N~~~s~~~~~l~~~~~----~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----- 394 (708) ..++ +.+.++-..-++ -.++.... ..+ ..++++... ......++-+ T Consensus 345 ~G~v----------r~~~d~Qr~~N~~~s~~~~~l~~~--~~~--------~~~~a~~~~d~~~~e~~arp~~vi~~~p~ 404 (714) T protein:vir:27 345 YGLI----------SRAIPAQDEVNFRRIKLTWLLQAK--RVI--------MDEDATQLSDNDLMEQIERPDGIIKLNPV 404 (714) T ss_pred eehh----------hhchhHHHHHHHHHHHHHHhhcCC--cee--------eecCcccccHHHHHHhccCCCCceeeccc Confidence 4332 222222111100 00111100 000 001111000 0000011111 Q ss_pred -cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 395 -IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNRADMAS-FIYLDNMAKSLK 469 (708) Q Consensus 395 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q~~~~~-~~~~dn~~~~~~ 469 (708) ..++.++..+++..-++-....++.+......+ ....|.... ..+.+.+...-+..... ...+-.+-..++ T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~ 480 (714) T protein:vir:27 405 RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLI----QDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQ 480 (714) T ss_pred ccccCCCCccccccCCCCccHHHHHHHHHHHHHH----HHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333444555444455555666655555555 344554432 23444443322111111 111112222333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccc-ee--eEEE---EEeecccchhHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLS-VG--RYDV---TVDVGPSYTARRDAT 543 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~-~g--~~Dv---~v~~~~~~~~~r~~~ 543 (708) ...+.+-+++..+.. .+. +.++.+.|-...........+.+|.-+ .| .-|| ..++..+........ T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:27 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 444444444443332 112 233444443221111111234444222 11 1233 345555555444444 Q ss_pred HHHHH-HHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLT-NVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~-~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .++.. .|++.++. +.+.....++.. ++.-......+-++..........+ +.+++.+++++ +++++++++. T Consensus 553 r~~~~~~l~~l~~~---~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~--~~~~~~e~q~~--~~~~q~~~~~ 625 (714) T protein:vir:27 553 KAQLAQRMSEVIQG---LPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS--PDEMTPEEQEV--AAQQQALQQQ 625 (714) T ss_pred HHHHHHHHHHHHhh---cCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC--ccccchhhHHH--HHHHHHHHHH Confidence 44444 44333211 112212122221 1111122222333322222222222 22222222222 2222333344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHhhhhhhhhhhhc- Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV--MEAIRL-LKDVAESQQQQFQS- 697 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~- 697 (708) ++++++.++++++++.+++++++++.+.+...+++...+.+..........++.+. ++..+. .+...-.+++..+. T Consensus 626 q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:27 626 QAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 45556667778888888888777776666555555444443332222222222211 111111 11122222222222 Q ss_pred -CCCCCCCC Q lcl|Aclame:pro 698 -PPQSPADL 705 (708) Q Consensus 698 -~~~~~~e~ 705 (708) +-....+| T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:27 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 22223333 No 137 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=98.43 E-value=6.9e-07 Score=54.38 Aligned_cols=610 Identities=13% Similarity=0.051 Sum_probs=197.8 Q ss_pred CCcchHHH--------HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAETLEKK--------HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~~~~~--------~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |.|..--. ..+++.++..++....+...+|+.+. ..+.... .|. ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a-----------~~d~~fy------~G~------Qw~~~ 57 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAA-----------NKACAYY------DGD------QLPPE 57 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHH-----------HHHHHhh------cCC------CCCHH Confidence 88743222 13355555555555444444443221 1122222 221 22223 Q ss_pred HHHHHHHHhcCcceeEE-------------------ecCCCcchHHHH-HHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKF-------------------RPGDREASEELA-NKLNGLFRADYEETDGGEACDNAFDDAATGG 132 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v-------------------~pr~~~~d~~~A-~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G 132 (708) +..++-. +.+|-+.+ .++--..+.+-+ ..+.+++..+....-.......++.++..++ T Consensus 58 ~~~~l~~--~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:81 58 VLQVLKD--RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred HHHHHHh--cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 3333322 22222221 111111121112 1244444444333333333444555555555 Q ss_pred eeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCccccCCh--h--ccCeEEE---eecCCHHHHHHhCCCCc-- Q lcl|Aclame:pro 133 FGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK--S--DALWAFC---MYSLSPEKYEAEYGKKP-- 203 (708) Q Consensus 133 ~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~--s--Da~~~~~---~~~~~~~e~~~~~p~~~-- 203 (708) +.+ .++ |..+++|++....++ . +..-+++ .+..|.++..-.|=... T Consensus 136 ~~~-G~G------------------------~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~ 190 (714) T protein:vir:81 136 IKA-GLS------------------------WVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMD 190 (714) T ss_pred hhc-Ccc------------------------eEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCC Confidence 421 111 112233332221110 0 0000111 11122233221110000 Q ss_pred ccccc------cccccccccCCCCC---------c-------------eeEEeeeeeecceEEEEEEEecCccCceeEec Q lcl|Aclame:pro 204 PTSLD------VTSMTSWEYNWFGA---------D-------------VIYIAKYYEVRKESVDVISYRHPITGEIATYD 255 (708) Q Consensus 204 ~~~~d------~~~~~~~~~~~~~~---------~-------------~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (708) -+.+. .........+|.+. . ...-..|+.++++++++++||.+.-.....+. T Consensus 191 ~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~ 270 (714) T protein:vir:81 191 TDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE 270 (714) T ss_pred HHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec Confidence 00000 00011111122110 0 11123466677788887777764333333333 Q ss_pred C--CcccchHHHhhccchhhhhheee-eeEEEEEEEEecceeeecCCCCCCC--Ccc--eeeEEEeeeccCCcccccchH Q lcl|Aclame:pro 256 S--DQVEDIEDELAIAGFHEVARRSV-KRRRVYVSVVDGDGFLEKPRRIPGE--HIP--LIPVYGKRWFIDDIERVEGHI 328 (708) Q Consensus 256 ~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~il~~~~~~p~~--~~p--~~p~~~~~~~~d~~~~~~G~v 328 (708) . +....+...-......+...... ..+++... ...++.++.-.-.+ .|| ++|++.+..+.+ ...|.. T Consensus 271 ~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv---~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---~~~g~~ 344 (714) T protein:vir:81 271 LSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRI---REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEP 344 (714) T ss_pred cCCCceEEeCccCHHHHHHHhhcchhhhccccceE---EEEEEecCcccccCCCCCCCCceeEEEEeeeee---eccCce Confidence 2 22222222211111112111111 12222221 11122222222111 133 366665543322 124444 Q ss_pred HhhhHHHHHHHHHHHHHHHHHhhcCC----CceeechhhccchHHHHHhhcccCCceeee-----cccccccccc----- Q lcl|Aclame:pro 329 AKAMDPQRLYNLQVSMLADTAAQDPG----QIPIVGMEQIRGLEKHWEARNKKRPAFLPL-----REVRDKSGNI----- 394 (708) Q Consensus 329 r~~~d~Q~~~N~~~s~~~~~l~~~~~----~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----- 394 (708) ..++ +.+.++-..-++ -.++.... ..+ ..++++... ......++-+ T Consensus 345 ~G~v----------r~~~d~Qr~~N~~~s~~~~~l~~~--~~~--------~~~~a~~~~d~~~~e~~arp~~vi~~~p~ 404 (714) T protein:vir:81 345 YGLI----------SRAIPAQDEVNFRRIKLTWLLQAK--RVI--------MDEDATQLSDNDLMEQIERPDGIIKLNPV 404 (714) T ss_pred eehh----------hhchhHHHHHHHHHHHHHHhhcCC--cee--------eecCcccccHHHHHHhccCCCCceeeccc Confidence 4332 222222111100 00111100 000 001111000 0000011111 Q ss_pred -cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccc---hhHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 395 -IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETVNNLMNRADMAS-FIYLDNMAKSLK 469 (708) Q Consensus 395 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n---~sg~ai~~~q~q~~~~~-~~~~dn~~~~~~ 469 (708) ..++.++..+++..-++-....++.+......+ ....|.... ..+.+.+...-+..... ...+-.+-..++ T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~ 480 (714) T protein:vir:81 405 RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLI----QDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQ 480 (714) T ss_pred ccccCCCCccccccCCCCccHHHHHHHHHHHHHH----HHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333444555444455555666655555555 344554432 23444443322111111 111112222333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccc-ee--eEEE---EEeecccchhHHHHH Q lcl|Aclame:pro 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLS-VG--RYDV---TVDVGPSYTARRDAT 543 (708) Q Consensus 470 ~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~-~g--~~Dv---~v~~~~~~~~~r~~~ 543 (708) ...+.+-+++..+.. .+. +.++.+.|-...........+.+|.-+ .| .-|| ..++..+........ T Consensus 481 ~~~~~~g~~lL~li~-----~~~---~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:81 481 FACQQVGRLLLAYLL-----DDL---KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHH-----HHc---CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 444444444443332 112 233444443221111111234444222 11 1233 345555555444444 Q ss_pred HHHHH-HHHHhccccCchhHHHHHHHHhh-ccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 544 VSVLT-NVLSSMLPTDPMRPAIQGIILDN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV 621 (708) Q Consensus 544 ~~~l~-~llq~~~~~~p~~~~~~~~~~~~-~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~ 621 (708) .++.. .|++.++. +.+.....++.. ++.-......+-++..........+ +.+++.+++++ +++++++++. T Consensus 553 r~~~~~~l~~l~~~---~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~--~~~~~~e~q~~--~~~~q~~~~~ 625 (714) T protein:vir:81 553 KAQLAQRMSEVIQG---LPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS--PDEMTPEEQEV--AAQQQALQQQ 625 (714) T ss_pred HHHHHHHHHHHHhh---cCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC--ccccchhhHHH--HHHHHHHHHH Confidence 44444 44333211 112212122221 1111122222333322222222222 22222222222 2222333344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHhhhhhhhhhhhc- Q lcl|Aclame:pro 622 LAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV--MEAIRL-LKDVAESQQQQFQS- 697 (708) Q Consensus 622 ~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~- 697 (708) ++++++.++++++++.+++++++++.+.+...+++...+.+..........++.+. ++..+. .+...-.+++..+. T Consensus 626 q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:81 626 QAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 45556667778888888888777776666555555444443332222222222211 111111 11122222222222 Q ss_pred -CCCCCCCC Q lcl|Aclame:pro 698 -PPQSPADL 705 (708) Q Consensus 698 -~~~~~~e~ 705 (708) +-....+| T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:81 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 22223333 No 138 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=98.29 E-value=1.5e-07 Score=58.07 Aligned_cols=579 Identities=15% Similarity=0.103 Sum_probs=223.2 Q ss_pred CCcchH----------HHHHH-HHHHHHHHHHhh----HHHHHHH-HHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCce Q lcl|Aclame:pro 1 MAETLE----------KKHER-IMLRFDRAYSPQ----KEVREKC-IEATRFARVPGGQWEGATAAGTKLDEQFEKYPKF 64 (708) Q Consensus 1 ma~~~~----------~~~~~-~~~~~~~~~~~~----~~~r~~~-~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~ 64 (708) ||-++. .+.++ ++..+++..+.. ..+..+. .-|+.|+.|.--|=..+.+. .. ++.+...||- T Consensus 1 maispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG-~~-~~~~A~V~C~ 78 (666) T protein:vir:96 1 MAISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLG-YN-QNIAAKVRCQ 78 (666) T ss_pred CccCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceee-ec-ccccccccce Confidence 443211 11122 222233333322 2222222 23455555554443333321 11 1122345676 Q ss_pred eecc--h----HHHHHHHHHHHhc----CcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCee Q lcl|Aclame:pro 65 EINK--V----ATELNRIIAEYRN----NRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFG 134 (708) Q Consensus 65 ~~N~--i----~~~i~~i~g~~~~----nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G 134 (708) +||. + -.-|.+++|+... ..|-.-|... .+..+-||.|+++|..-.....+-...--.+.|+++.-+- T Consensus 79 V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS~--P~~K~~AE~LE~ii~DH~t~~~~~~~LiL~L~D~~KYN~~ 156 (666) T protein:vir:96 79 VVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVST--PDKKEQAEALEGIIQDHMTMTSSIPELILCLQDAAKYNLV 156 (666) T ss_pred eeccccCCchhhhhHHHHHHHHHHHHhcCCccceeecC--CchhHHHHHHHHHHHhhhhhhhhHHHHHHHHhhhhhccee Confidence 6654 3 3445666776544 3333333222 2446779999999998777767777777788899988876 Q ss_pred EEEEEeeccccCCC----CCCC-cce----------eeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHh Q lcl|Aclame:pro 135 CFRLTSMLVNEYDP----MDDR-QRI----------AIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAE 198 (708) Q Consensus 135 ~~~v~~~~~~~~d~----~~~~-~~i----------~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~ 198 (708) .|+..|.-..--++ ++-. +.. +|++. +++++||||..--+|.. .-.|+.....+++-.+++. T Consensus 157 ~~ET~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRL--N~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~ 234 (666) T protein:vir:96 157 GWETEWSNIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRL--NLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKY 234 (666) T ss_pred eeeeccccccccchhhhhhcCCCceeeeccchhhhhhhhcc--ccccccccCCCCCCchhhhhhhhhhHHHHHHHHHHHH Confidence 66654432111111 0111 111 12222 46799999955444533 3567888888887777654 Q ss_pred CC----CC-------ccccccc-cccccc-----------------ccCCC--CCceeEEeeeeeecceEEEEEEEecCc Q lcl|Aclame:pro 199 YG----KK-------PPTSLDV-TSMTSW-----------------EYNWF--GADVIYIAKYYEVRKESVDVISYRHPI 247 (708) Q Consensus 199 ~p----~~-------~~~~~d~-~~~~~~-----------------~~~~~--~~~~~~v~e~~~~~~~~~~~~~~~~~~ 247 (708) .- ++ ....+.. ....++ +.+|. +..+...+.+ .+|+.+..--.+++. T Consensus 235 LN~LT~EKkltykkvV~~Al~~s~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~--~~rvpvneqg~Y~k~ 312 (666) T protein:vir:96 235 LNYLTNEKKLTYKKVVNEALKSSFQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSST--NRRVPVNEQGVYCKH 312 (666) T ss_pred HhhhhcchhhhHHHHHHHHHhhhccccccccCCcccccccccchhhccchhhcCccccccccc--ccccccccccceeee Confidence 21 11 0011000 000010 01110 0000000000 111111111111111 Q ss_pred cCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeee-cCCCCCCCCcceeeEEEeeeccCCccc-cc Q lcl|Aclame:pro 248 TGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIER-VE 325 (708) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~-~~ 325 (708) +--.....+ +++. ..+. +-.---|....+.++.++. ++--..+++||+- ++++. .||.+. -. T Consensus 313 ~mY~RI~PS-DF~~---~~P~---------~N~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~--~~~~L-EDGmG~QTQ 376 (666) T protein:vir:96 313 TMYLRIIPS-DFEM---NVPN---------RNQVQIWKAVMINRDAIISFEPYIGAYGSFGMG--LAFAL-EDGMGLQTQ 376 (666) T ss_pred eeeeeeccc-ccee---cCCC---------CCcceeeeeeeeccceeEeeehhhcccchhhhh--hhhhh-hhccccccc Confidence 100001111 1110 0000 0011124444566777775 3333345555553 33332 355433 25 Q ss_pred chHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhccc-CCceeeecccccccccccc--cccccc Q lcl|Aclame:pro 326 GHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKK-RPAFLPLREVRDKSGNIIA--GATPAG 402 (708) Q Consensus 326 G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~ 402 (708) |+.+..++.|+.-.++++.-.-........+.++++..+.. ...|.+ +..-++.++-...+|.... ...+|. T Consensus 377 ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a-----~~iNSP~~~~KIP~~~~sL~N~~m~~~Y~~IPFD 451 (666) T protein:vir:96 377 GYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRA-----NDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFD 451 (666) T ss_pred cccccccchhhhhhHHhhhhhhhhhhhhhhhhhcchhhhhh-----hcccCCCCCcccceeehhhhccchhhhhccCCcc Confidence 66778889998777655544433333333344444443321 111111 1112222222222222111 011111 Q ss_pred cccCccchHHHHHHHHHHHHHHHHHhCCChhHccc--ccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|Aclame:pro 403 YTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM--PSNIAQETVNNLMNRADMASFIYLDNMA-KSLKRAGEVWLSMA 479 (708) Q Consensus 403 ~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~--~~n~sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~~~~~~l~li 479 (708) .-.. ....+-.+....--++++|.+...+|+ .+|.|-.--...+..++.++....--+. +.+..+-++|.--| T Consensus 452 ~RG~----E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGNKt~~E~~~~MG~a~NRmRLPALiLEH~~F~~iK~~L~LNl 527 (666) T protein:vir:96 452 SRGM----ETVMQNALMLTDWQRELSGMNSATRGQFQKGNKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNL 527 (666) T ss_pred ccch----hHHHhhhHHHhhhHHHhhccCCcccccccccCcceeehhhhcCCcccceehhhHHHhhhhhhhHHHHHhhhh Confidence 1111 112222233445567899999999996 4565433333344444444433222222 22223333333333 Q ss_pred HHhcCCCcEEEEeccCCCceEEEecccccccCCCceEE--eeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 480 REVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVA--LNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPT 557 (708) Q Consensus 480 ~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~--~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~ 557 (708) -+|=++..||.- .+|+++. +..++..-..+.+.+|..- ..+.+.-..+..+||.+.-. T Consensus 528 ~~YG~DT~ViS~-------------------RtG~~~~vDi~~L~~~~L~F~~~DGlTP-~SKlASs~~lT~~LQMI~sS 587 (666) T protein:vir:96 528 LMYGEDTEVISP-------------------RTGKGVRVDIKELQDLGLKFELGDGLTP-ASKLASSDFLTALLQMIMSS 587 (666) T ss_pred hhccccchhccc-------------------ccCceeeeeHHHHhhhhheeeeccCCCc-hhhhhhhHHHHHHHHHHhcc Confidence 444444444432 2233321 1112222234556666543 33555555555555543221 Q ss_pred Cchh----HHHHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 558 DPMR----PAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) Q Consensus 558 ~p~~----~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae 633 (708) .... +.+-.++.-++.+-++.-+.++....++.=...- ..+|..|+.- .|.+++ ...|-+ T Consensus 588 ~~~~~A~G~~~P~M~AHl~QLGGVRG~E~Y~~~ALPqwqity----gm~Q~LQ~~~-LQ~~~Q-----------SA~Q~~ 651 (666) T protein:vir:96 588 ETTLQAFGTQVPGMIAHLAQLGGVRGFEKYANAALPQWQITY----GMQQQLQQML-LQLQQQ-----------SAMQLQ 651 (666) T ss_pred hhhHhhhcccchHHHHHHHHhccccchhhcccccCcchhhhh----hhhHHHHHHH-HHHhhh-----------hccccc Confidence 1111 1112233344444444444444333222111100 0011111100 000000 000111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 634 AQKATNETAQTQIKAFTAQQDA 655 (708) Q Consensus 634 ~~k~~~~~~~~q~e~~~~~~~~ 655 (708) +. +.++..++.+-.. T Consensus 652 A~-------Q~~L~~~Q~~PSq 666 (666) T protein:vir:96 652 AR-------QGELSNDQSQPSQ 666 (666) T ss_pred cc-------cccCcccccCCCC Confidence 00 0000000000000 No 139 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.20 E-value=2.7e-06 Score=51.11 Aligned_cols=602 Identities=11% Similarity=0.007 Sum_probs=166.4 Q ss_pred CCcchHHHHHHHHHHHHHHHHhh-------HHHHHHHHHHH---HHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQ-------KEVREKCIEAT---RFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVA 70 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~-------~~~r~~~~~d~---~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~ 70 (708) .++..+-++-.+.+-|-..-+.. .+. +.|...- .|.|...++ |. -++.|.++ T Consensus 79 v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~-~~A~q~t~~~n~~~~~~~~----------------~~-~~~~~~~~ 140 (763) T protein:vir:95 79 VRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDV-QGARQNELVLNYQFRTKLN----------------RV-SFIDNYVR 140 (763) T ss_pred HHHHHHHHHHHHHHhhcCCCcEEEEecCCcchH-HHHHHHHHHHHHHHhhcCc----------------hh-hHHHHHHH Confidence 22122222222222222111110 000 1111000 000100000 00 00111112 Q ss_pred HHHHHHHH-----HHhcCcceeEEecCC---CcchHHHHHHHHHHHHH-HHHh----cChHHHHHHHHHHHhhcCeeE-- Q lcl|Aclame:pro 71 TELNRIIA-----EYRNNRITVKFRPGD---REASEELANKLNGLFRA-DYEE----TDGGEACDNAFDDAATGGFGC-- 135 (708) Q Consensus 71 ~~i~~i~g-----~~~~nr~~~~v~pr~---~~~d~~~A~~l~~~~~~-~~~~----~~~~~~~~~a~~d~~~~G~G~-- 135 (708) ..+..-+| -.+..+.+....... +....+.+..+...+.. ..+- -+.+.....+.......|.++ T Consensus 141 ~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (763) T protein:vir:95 141 SVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYA 220 (763) T ss_pred HHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceee Confidence 11111112 111111111111110 00001111111111111 1111 122333333344444444333 Q ss_pred -------EEEEeecccc-----CCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCc Q lcl|Aclame:pro 136 -------FRLTSMLVNE-----YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKP 203 (708) Q Consensus 136 -------~~v~~~~~~~-----~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~ 203 (708) +.+....... .+|.++-.+.......++..-+++.-.-+.-|+-+..|.. .++ +++........ T Consensus 221 ~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y--~~~--~~~~~~~~~~~ 296 (763) T protein:vir:95 221 VQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRY--HNL--NKIDWQSSAPV 296 (763) T ss_pred ecccceeEEEEEEecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCc--ccc--chhcchhcccc Confidence 2222111100 1222221111111100111112221111112332222221 222 11111000000 Q ss_pred c-cccccccccccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeE Q lcl|Aclame:pro 204 P-TSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRR 282 (708) Q Consensus 204 ~-~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (708) . ............+.....+.+.|.|+|.+-.+...-+.+++. +.+.++.+.... T Consensus 297 ~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~-----v~~~g~~iL~~~------------------- 352 (763) T protein:vir:95 297 NEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGVLEPIV-----ATWIGSTLIRLE------------------- 352 (763) T ss_pred ccccccccchhhccCCCcccceEEEEEeeeeeccCCcceeEEEE-----EEEEcCeeeecc------------------- Confidence 0 000000111112222234678889999864332211111110 011111000000 Q ss_pred EEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHH---hhcCCCceee Q lcl|Aclame:pro 283 RVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTA---AQDPGQIPIV 359 (708) Q Consensus 283 ~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l---~~~~~~~~i~ 359 (708) . .-+.++.+||-.+|++|..... .|.++..-+.+.-+..=.+.|...-.+.-.. .....+. +. T Consensus 353 --------~--~p~~~~~~PFv~~~~~p~~~~~---~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~ga-v~ 418 (763) T protein:vir:95 353 --------K--NPYPDGKLPFVLIPYMPVKRDM---YGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGM-LD 418 (763) T ss_pred --------c--ccccCCCcCEEEecceeecCcc---cCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeeccc-cc Confidence 0 0012234666666666654332 1222222232222222222222221111000 0001111 11 Q ss_pred chhhccchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHH----hCCChhHc Q lcl|Aclame:pro 360 GMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEV----TGGSQAMQ 435 (708) Q Consensus 360 ~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----tGv~~~~~ 435 (708) ..+.... .........++..+-.... .. .+ .+.+.-+...+++++...+.+..+ .|++...+ T Consensus 419 ~~d~~~~---~pg~v~~v~~g~~~~~~~~----~~-----~~--p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~ 484 (763) T protein:vir:95 419 ALNSRRY---REGEDYEYNPTQNPAQMII----EH-----KF--PELPQSALTMATLQNQEAESLTGVKAFAGGVTGESY 484 (763) T ss_pred chhhhcc---cCCceEEeeCCCChhhhcc----cc-----cC--CCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccc Confidence 1111110 0000011111111111110 00 00 122344566666666666655444 47777777 Q ss_pred ccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCcEEEEeccCCCceE-EEecccccc Q lcl|Aclame:pro 436 QMPSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREV----YGSEREVRIVNEDGSDDI-AVLSAQVVD 509 (708) Q Consensus 436 G~~~n-~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~~l~li~~~----y~~~r~irI~~~~~~~~~-v~in~~~~~ 509 (708) |.+.+ .++. .++.+.+....+..+.+.++...+.+..++..+.-.- .+.+..+.|..++-..+| |.+. T Consensus 485 ~~tat~v~~l-~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~----- 558 (763) T protein:vir:95 485 GDVAAGIRGV-LDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVD----- 558 (763) T ss_pred cchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEe----- Confidence 76543 5665 4445555666677888888888888888888763321 112234445433321111 1111 Q ss_pred cCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHH-hccccCchhHHHHHHHHhhccchhHHHHHHHHHhhh Q lcl|Aclame:pro 510 RQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS-SMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQL 588 (708) Q Consensus 510 ~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq-~~~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~ 588 (708) +..... ...+.+++..|.+++. .+++.... .. +..+.+..+.+...+-.+...... T Consensus 559 ------------------~~~as~---~~q~~~~l~~ll~~l~~~~~~~~~~-~i-l~~~~d~~~~~~~~~~lr~~q~~~ 615 (763) T protein:vir:95 559 ------------------ISTAEV---DNQKSQDLGFMLQTIGPNVDQQITL-NI-LAEIADLKRMPKLAHDLRTWQPQP 615 (763) T ss_pred ------------------cccchH---HHHHHHHHHHHHHHhccccChHHHH-HH-HHHHHhhhchhhhHHHHHhcCCCc Confidence 000011 1335566666666553 23332222 11 222333333333222222221111 Q ss_pred hhhhcccCcchHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 589 LISGIAKPRNEKEQQIVQQAQMAAQS-----QPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTV 663 (708) Q Consensus 589 ~~~~~~~~~~~~~~q~~~~~qq~qq~-----~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~ 663 (708) .+..... .+.+++..+..++..+. +.+.+...++++.++++++++..+.+ +..+++..+++.+++.+.+.+. T Consensus 616 d~~~q~q--aqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q-~~~e~~~~~~~~eaq~~l~~~~ 692 (763) T protein:vir:95 616 DPVQEQL--KQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTK-HARDLEKMKAQSQGNQQLEITK 692 (763) T ss_pred cchhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 1111000 00011111111111111 11111112222222222222111111 1112222222222221111111 Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHh-----hhh---------hhhhhhhcCCCCCCCCCCC Q lcl|Aclame:pro 664 YKLAQARNIDDKAVMEAIRL---LKD-----VAE---------SQQQQFQSPPQSPADLMPS 708 (708) Q Consensus 664 ~~~~q~~~~~~~~~~~~~~~---~~~-----~~~---------~~~~~~~~~~~~~~e~~~~ 708 (708) ....+.+.......+++.-. +.+ .+. ..+-....-|..-+.=.|| T Consensus 693 a~~~~~~ea~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 754 (763) T protein:vir:95 693 ALTKPRKEGELPPNLSAAIGYNALTNGEDTGIQSVSERDIAAEANPAYSLGSSQFDPTRDPA 754 (763) T ss_pred HHHHHHHHhccChhHHHhhhhcccccccCCCccchhhcccCccccccccCCCCCCCCCCccc Confidence 11111111000001111000 000 000 0011222344555555666 No 140 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=97.88 E-value=1.3e-05 Score=47.35 Aligned_cols=618 Identities=12% Similarity=0.011 Sum_probs=133.9 Q ss_pred HHHHHHH--HHHH-hhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcCccee Q lcl|Aclame:pro 11 RIMLRFD--RAYS-PQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNRITV 87 (708) Q Consensus 11 ~~~~~~~--~~~~-~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~nr~~~ 87 (708) .+++.=. ...+ ..+-...+...++ .|++| ++...-+..+ .-+.|.|. .. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~--~~~~~-~~~~~~~~~~---~~y~g~~~----------------------~~ 52 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDAL--DFNSS-ELSKQRSEAL---KYYFGEPF----------------------GN 52 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHH--hhhhh-HHHHHHHHHH---HHHhCCCC----------------------Cc Confidence 2211000 0000 0011111222222 23433 2222211111 11223221 11 Q ss_pred EEecCCCcchHHHHHHHHHHHHHHHHh-------------cChHHH----HHHHHHHHhhcCeeEEEEEeeccccCCCCC Q lcl|Aclame:pro 88 KFRPGDREASEELANKLNGLFRADYEE-------------TDGGEA----CDNAFDDAATGGFGCFRLTSMLVNEYDPMD 150 (708) Q Consensus 88 ~v~pr~~~~d~~~A~~l~~~~~~~~~~-------------~~~~~~----~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~ 150 (708) ....+..-=+..+.+..+.++.++.+. +..+.. ........+......+++..++..+. ... T Consensus 53 ~~~~~s~~~~~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~da-l~~ 131 (705) T protein:vir:88 53 ERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDT-LMM 131 (705) T ss_pred ccCCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHH-hhc Confidence 111111111123333333333332211 001111 11111111111111111111111000 001 Q ss_pred CCcceeeE---------Eee-----cchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccc Q lcl|Aclame:pro 151 DRQRIAIE---------PIY-----DPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWE 216 (708) Q Consensus 151 ~~~~i~i~---------~v~-----~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~ 216 (708) |.+.+++. ..+ +.+-.++.||.+.-.+-++-.+..+..+++....+....-..-...++....+. T Consensus 132 g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a- 210 (705) T protein:vir:88 132 KTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLA- 210 (705) T ss_pred CCeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCC- Confidence 11111100 000 001111223332211111111111111111111000000000000000000000 Q ss_pred cCCCCCceeE----------EeeeeeecceEEEEE-EEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEE Q lcl|Aclame:pro 217 YNWFGADVIY----------IAKYYEVRKESVDVI-SYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVY 285 (708) Q Consensus 217 ~~~~~~~~~~----------v~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 285 (708) .+|.+..-+. ...+|+++....... +.....+ ....+..+............+......+.+..+.+| T Consensus 211 ~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~-~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y 289 (705) T protein:vir:88 211 TCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDS-QPERLVRDNFDMTGQLQYNSGDDAEANREVWASECY 289 (705) T ss_pred CCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhh-hhhhccccccccccccccccccccCCceeEEEEEee Confidence 0111110000 011111111000000 0000000 000000000000000000000000001111111111 Q ss_pred EEE-Eecceeee-cCCCCCCCCcceeeEEEeeeccCCccc--ccc-hHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeec Q lcl|Aclame:pro 286 VSV-VDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIER--VEG-HIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVG 360 (708) Q Consensus 286 ~~~-~~~~~il~-~~~~~p~~~~p~~p~~~~~~~~d~~~~--~~G-~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~ 360 (708) ..+ ..|+++.. ....+.+.++..++.++.++|+..... +.+ +-..+.+.-.-+-..++.+...+..+. T Consensus 290 ~~~d~~~d~~~~~~~~~~~g~~il~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~------- 362 (705) T protein:vir:88 290 TLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNI------- 362 (705) T ss_pred eEecccCCcceeeEEEEEeCccccccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHH------- Confidence 111 11222211 011123333333333333333221111 111 113334444444444444433332211 Q ss_pred hhhccchHHHHHhhcccCCceeeecc-c-cccccccccccc----ccccccCccchHHHHHHHHHHHHHHHHHhCCChhH Q lcl|Aclame:pro 361 MEQIRGLEKHWEARNKKRPAFLPLRE-V-RDKSGNIIAGAT----PAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAM 434 (708) Q Consensus 361 ~~ai~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~ 434 (708) ...+. +.++..+. + ....-...|+.. ....+.+.+.+.-.-.+.+ .++.+...-... T Consensus 363 -----------~~~~~--~~~~~~~g~v~~~d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~----ll~~~~~~~~~~ 425 (705) T protein:vir:88 363 -----------YRTNQ--GRSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYG----MLDRLEADRGKR 425 (705) T ss_pred -----------HhccC--CceeccccccCcccccccCCCeeEEecCCCccccccCCcCcHHHHH----HHHHHHHHHHHh Confidence 11110 11111000 0 000011122221 1122222222222222222 222222222455 Q ss_pred cccccchhH---HHHHHHHHHHHHHH-HHHHH-HHHHHHHHHHH-HHHHHHHHhcCCCcEEEEeccC-CCceEEEecccc Q lcl|Aclame:pro 435 QQMPSNIAQ---ETVNNLMNRADMAS-FIYLD-NMAKSLKRAGE-VWLSMAREVYGSEREVRIVNED-GSDDIAVLSAQV 507 (708) Q Consensus 435 ~G~~~n~sg---~ai~~~q~q~~~~~-~~~~d-n~~~~~~~~~~-~~l~li~~~y~~~r~irI~~~~-~~~~~v~in~~~ 507 (708) .|.+.-..| .+......++..++ ..-.. .+....+.+.+ .+-.++...+ .+.-.- ..++.+.|. T Consensus 426 tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~------~li~~~~~~~~~~ri~--- 496 (705) T protein:vir:88 426 TGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLH------DHAIKYQNQEEVFQLR--- 496 (705) T ss_pred hCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHhCCCceEEeec--- Confidence 665433333 33332222222111 11111 11122222221 1111222222 111111 122333332 Q ss_pred cccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHHH--HHHHHhhccchhHHHHHHHHH Q lcl|Aclame:pro 508 VDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAI--QGIILDNIDGEGLDDFKEYNR 585 (708) Q Consensus 508 ~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~~--~~~~~~~~d~~~~~ei~e~~~ 585 (708) |..+.+ ..-..--..++.+..+..-..+-+.+..+...+. +...+ .+.+..........++...+ T Consensus 497 -----g~~v~v---~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~----~~q~l~~~~~~~~~~~~~~~~~~~~el- 563 (705) T protein:vir:88 497 -----GKWVAV---NPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWE----MAQAVVGGGGLGVLVSEQNLYNILKEV- 563 (705) T ss_pred -----cchhcc---chHhhccCCceEEeeccccchHHHHHHHHHHHHH----HHHHhhcccchhhhcChHHHHHHHHHH- Confidence 111100 0000000111222222221222222222211110 00000 01111122222222222222 Q ss_pred hhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 586 NQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYK 665 (708) Q Consensus 586 ~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~ 665 (708) ........+..-......+++.+.+++.. ....+++.+.+++|++++++++++++.+.++++.+.+++..+++.... T Consensus 564 --~e~~~~k~~~~~~~~~~~~e~~~~~~~~~-q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~ 640 (705) T protein:vir:88 564 --TENAGYKDPDRFWTNPNSPEALQAKAIRE-QKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELK 640 (705) T ss_pred --HHhhhhhhHHHHhhhhhhHHHHHHHHhhh-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21122222211111111122222222222 122233445556677888888887777777766666655544433322 Q ss_pred HHHHHHHHHHHHHHHHHHHHhh----hhhhhhhh---------hcCCCCCCCCCCC Q lcl|Aclame:pro 666 LAQARNIDDKAVMEAIRLLKDV----AESQQQQF---------QSPPQSPADLMPS 708 (708) Q Consensus 666 ~~q~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------~~~~~~~~e~~~~ 708 (708) ..+....+++...+..++.... .+.+.+.. .+.+...+.++=+ T Consensus 641 ~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~~~~~ 696 (705) T protein:vir:88 641 KQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPET 696 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH Confidence 2222222221111111111000 00000000 0011111122111 No 141 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=97.62 E-value=3.6e-05 Score=44.95 Aligned_cols=618 Identities=11% Similarity=0.041 Sum_probs=192.7 Q ss_pred chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHHhcC Q lcl|Aclame:pro 4 TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNN 83 (708) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~~~n 83 (708) -..++-..++. -.. .+.+....+.+...+++=.+=+++-...++...=+.|. ...+.+..++-. +. T Consensus 1 ~~~~~~~~~~~-~~~-----~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~------Qw~~~~~~~l~~--~g 66 (714) T protein:vir:10 1 MKNEINTTAMK-NDH-----GSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGD------QLAPEVIQVLKD--RG 66 (714) T ss_pred CCcCcCcccCC-Ccc-----hhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCC------CCCHHHHHHHHh--cC Confidence 11111000000 000 00000000001100111010012212222222222232 222233333222 22 Q ss_pred cceeEE-------------------ecCCCcchH-HHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeecc Q lcl|Aclame:pro 84 RITVKF-------------------RPGDREASE-ELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) Q Consensus 84 r~~~~v-------------------~pr~~~~d~-~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~ 143 (708) +|-+.+ .+.-...+. +-.+.+.+++..+....-.......++.++..+++.+= + .| T Consensus 67 ~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G-~--G~- 142 (714) T protein:vir:10 67 QPMTIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAG-L--SW- 142 (714) T ss_pred CCcEEeccHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcc-c--ce- Confidence 222221 111111122 22223455555554444444445556667776665421 1 11 Q ss_pred ccCCCCCCCcceeeEEeecchhheecCCcc-------ccCChhccCeEEE---eecCCHHHHHHhCCCCc---c---ccc Q lcl|Aclame:pro 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDA-------KKYDKSDALWAFC---MYSLSPEKYEAEYGKKP---P---TSL 207 (708) Q Consensus 144 ~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a-------~~~D~sDa~~~~~---~~~~~~~e~~~~~p~~~---~---~~~ 207 (708) .++++|++. +..|..+ +++ .+..|.++..-.|=.+- + ..+ T Consensus 143 ---------------------~~~~~d~d~~~~~i~i~~v~p~~---v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~f 198 (714) T protein:vir:10 143 ---------------------VEVRRNSEPFGPEFKVSTVSRNE---VFWDWLSREADLSDCRWLMRRRWMDTDEAKATF 198 (714) T ss_pred ---------------------EEeeeccCCCCCCeEEEecChhh---eeeccccccCChhhhhhhhhhccCCHHHHHHhc Confidence 122333321 1111111 111 11223333322210000 0 000 Q ss_pred c--cccccccccCCCCC---------cee-------------EEeeeeeecceEEEEEEEecCccCceeEec--CCcccc Q lcl|Aclame:pro 208 D--VTSMTSWEYNWFGA---------DVI-------------YIAKYYEVRKESVDVISYRHPITGEIATYD--SDQVED 261 (708) Q Consensus 208 d--~~~~~~~~~~~~~~---------~~~-------------~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 261 (708) . .........+|.+. ++. .-..|+...++++++++||.+.-.....+. +++... T Consensus 199 p~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~ 278 (714) T protein:vir:10 199 PGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVA 278 (714) T ss_pred CCchhhhhccchhhcCcccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeee Confidence 0 00011111122110 010 112355566777777777755433333332 233333 Q ss_pred hHHHhhccchhhhhheee-eeEEEEEEEEecceeeecCCCCCCC--Ccc--eeeEEEeeeccCCcccccchHHhhhHHHH Q lcl|Aclame:pro 262 IEDELAIAGFHEVARRSV-KRRRVYVSVVDGDGFLEKPRRIPGE--HIP--LIPVYGKRWFIDDIERVEGHIAKAMDPQR 336 (708) Q Consensus 262 ~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~il~~~~~~p~~--~~p--~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~ 336 (708) +...-............. .++++.. + ...+..++.-.-.+ .|| ++|++.++.+.+ ..+|....++..=+ T Consensus 279 ~d~~~~~~~~~~~~g~~~~~~~~~~r--v-~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~---~~~g~~~G~vr~~~ 352 (714) T protein:vir:10 279 FDKNNLMQAVAVASGRVQVKVGRVSR--I-REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEPYGLISRAI 352 (714) T ss_pred eCccCHHHHHHHHhccceecccceee--E-EEEEEecchhhhcCCCCCCCCceeeEEecceee---eccCccceehhhhh Confidence 322211111222111111 1222222 1 11222222211111 244 367765543322 23555554443222 Q ss_pred HHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeec-----ccccccccc------ccccccccccc Q lcl|Aclame:pro 337 LYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLR-----EVRDKSGNI------IAGATPAGYTQ 405 (708) Q Consensus 337 ~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~------~~~~~~~~~~~ 405 (708) ..-...++..-.+ .++..... .+ ..++++.... .....++-+ .+++.+...++ T Consensus 353 d~Qr~~N~~~s~~------~~~l~~~~--~~--------~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~ 416 (714) T protein:vir:10 353 PAQDEVNFRRIKL------TWLLQAKR--VI--------MDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFR 416 (714) T ss_pred hHHHHHHHHHHHH------HHHHhCCc--ee--------eccccccccHHHHHHhccCCCCeEEecccccccCCcccccc Confidence 2222222211110 11111100 00 0011110000 000001111 11222233344 Q ss_pred CccchHHHHHHHHHHHHHHHHHhCCChhHccccc---chhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 406 PAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS---NIAQETVNNLMNR--ADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) Q Consensus 406 ~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~---n~sg~ai~~~q~q--~~~~~~~~~dn~~~~~~~~~~~~l~li~ 480 (708) +..-++-....++.+......+- ...|... +..+.+.+...-+ -......+.. +-..+++..+.+-+++. T Consensus 417 ~~~~~~~~~~~~~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~-~~dnl~~~~~~~g~~ll 491 (714) T protein:vir:10 417 VEQDFQVASQQFQVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE-INDNYQFACQQVGRLLL 491 (714) T ss_pred ccCCCCCcHHHHHHHHHHHHHHH----HhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 44333444455555555555553 4455443 2344444433211 1122222222 23333344444444554 Q ss_pred HhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccce---eeEEEE---Eeecccch-hHHHHHHHHHHHHHHh Q lcl|Aclame:pro 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSV---GRYDVT---VDVGPSYT-ARRDATVSVLTNVLSS 553 (708) Q Consensus 481 ~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~---g~~Dv~---v~~~~~~~-~~r~~~~~~l~~llq~ 553 (708) .+.- .+.+ .++.+.|-........-..+.+|.... .--||. .++..+.. +.-....+.+..|++. T Consensus 492 ~li~-----~~~~---~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql 563 (714) T protein:vir:10 492 AYLL-----DDLK---KRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEV 563 (714) T ss_pred HHHH-----HHcC---CCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHH Confidence 4432 2222 234444422211111123334443211 112443 34443333 4444444455555554 Q ss_pred ccccCchhHHHHHHHHh-hccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGIILD-NIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) Q Consensus 554 ~~~~~p~~~~~~~~~~~-~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qa 632 (708) ++. ..+.....++. .++........+-++........ +.++..++..++.++++ ++++++.++++++.++++ T Consensus 564 ~~~---~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~--~~~~~~~~~e~q~~q~~--~~~~~~~q~~l~~~e~~a 636 (714) T protein:vir:10 564 IQG---LPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGT--PKSPDEMTPEEQEVAAQ--QQALQQQQAELQMREMAG 636 (714) T ss_pred Hhh---cCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCC--CCCccccCcchhHHHHH--HHHHHHHHHHHHHHHHHH Confidence 321 11221111111 11222222222333332222222 22222222222222222 222333445556666777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH-HHHHhhhhhhhhhh--hcCCCCCCCC Q lcl|Aclame:pro 633 EAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAV--MEAI-RLLKDVAESQQQQF--QSPPQSPADL 705 (708) Q Consensus 633 e~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~--~~~~-~~~~~~~~~~~~~~--~~~~~~~~e~ 705 (708) ++++.+++++++++.+.+...+++...+.+..........++++. ++.. .+.+..+..+++.. .++-....+| T Consensus 637 ~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 637 RVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHhcCC Confidence 777777777766665555444443333322222212222222211 1111 12222222222222 2333334445 No 142 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=96.55 E-value=0.00052 Score=38.58 Aligned_cols=442 Identities=11% Similarity=0.026 Sum_probs=178.7 Q ss_pred CC-cchHHHHHHHHHHHHHHHHhh---HHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHH Q lcl|Aclame:pro 1 MA-ETLEKKHERIMLRFDRAYSPQ---KEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) Q Consensus 1 ma-~~~~~~~~~~~~~~~~~~~~~---~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i 76 (708) |. +.....+...+..++...+.. ..+|... .-|.. +|+.+.....+.+.+ | ..-+|..+.+++.+ T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g-----~~YLp--k~~~E~~~~Y~~rl~---r-A~~~n~~~~t~~~~ 69 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKG-----VRFLP--KLSGQTDDMYNAYKQ---R-ALFYSITSKTLSAL 69 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCC-----cccCC--CCCCCCHHHHHHHHh---h-ccCCchHHHHHHHH Confidence 54 232233444443333333332 2222111 11222 344443333333322 2 44579999999999 Q ss_pred HHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCCccee Q lcl|Aclame:pro 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) Q Consensus 77 ~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~~~i~ 156 (708) +|..-...|.+.+ | ..+.+ + ..-.+-++.+.....++..++..|.+++-|.+... ...+- T Consensus 70 ~G~vf~k~p~~~~-p------~~l~~----~-~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~--------g~rPy 129 (452) T protein:vir:94 70 SGMVLDQPPVITH-P------DAMSK----Y-FEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLT--------GGDPY 129 (452) T ss_pred hchhhcCCceecc-c------HHHHH----H-HhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccC--------CCceE Confidence 9999888876643 1 12222 2 11245678999999999999999999988855321 11222 Q ss_pred eEEeecchhheecCCccccCChhccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecce Q lcl|Aclame:pro 157 IEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKE 236 (708) Q Consensus 157 i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~~ 236 (708) +..+ ++.+|+ |+.... +.. ..++..+. .. ...+.. +.++.+.+.....|.-... T Consensus 130 ~~~~--~~~~Ii-~W~~~~-~g~-l~~v~lre------~~--------------~~~d~~-d~f~~~~~~~yRvL~l~~g 183 (452) T protein:vir:94 130 ISVY--TTENIL-NWEEDE-DGR-LLMVVLRE------FY--------------TVRDTA-DRYVQNIRVRYRCLELVDG 183 (452) T ss_pred EEEe--chhhhc-Cccccc-cCC-eeEEEEEE------EE--------------EEecCC-CcccceeEEEEEEEEEeCC Confidence 2222 344443 332211 110 01111110 00 000000 0011111111111110000 Q ss_pred EEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeee Q lcl|Aclame:pro 237 SVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~ 316 (708) ... ++.+....+.. +..+...+.+.+..+.+.+|||++...+. T Consensus 184 ~~~-v~~~~~~~~~~------------------------------------~~~~~~~~~~~~~~~l~~IP~v~~~~~~~ 226 (452) T protein:vir:94 184 LLQ-ITVHETQDGKV------------------------------------WELAKTSTIQNVGVTMDYIPFFCITPSGL 226 (452) T ss_pred eEE-EEEEEccCCce------------------------------------eeeccceeecCCCcccceeEEEEEcCCCC Confidence 000 00000000000 00011112223445667788876654432 Q ss_pred ccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccc Q lcl|Aclame:pro 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA 396 (708) Q Consensus 317 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (708) +...+..-.-++-+.+.......|-.-+++..++.+...+. |.++.+ +..+ .+. ..-..| T Consensus 227 ---~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~-----g~~~~~-------~i~i--G~~---~~~~lp 286 (452) T protein:vir:94 227 ---SMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWIT-----GAESQS-------TMHI--GST---KAWVIP 286 (452) T ss_pred ---CCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEee-----cCcCCC-------ceEe--ccc---ccccCC Confidence 11222333557777787777778878888888888776663 222111 1111 100 000112 Q ss_pred c-ccccccccCccch-HHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 397 G-ATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 397 ~-~~~~~~~~~~~~~-~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) . ...+.+.++..-+ .....-|....+.|..+ |..-...+..+++||.|......+....+..+..++..+. .. T Consensus 287 e~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~-Ga~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al----~~ 361 (452) T protein:vir:94 287 EVAAKVGFLEFTGQGLQSLEKALSEKQAQLASL-SARLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALL----NK 361 (452) T ss_pred CCCCcceEEccCchhHHHHHHHHHHHHHHHHHH-HHHhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHH----HH Confidence 1 2245565543211 22233344444444333 4432222223345676665444444555666777776665 56 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) +|.++..|.... .. --|.+|. +|+. .....+..++|.++++++ T Consensus 362 ~l~~~a~w~g~~---------~~-~~v~~n~-------------------dF~~--------~~~~~~~~~al~~~~~~G 404 (452) T protein:vir:94 362 AYSCIMDMESMG---------GT-LNIKLNS-------------------AFLD--------SKLTAAELKAWVEAYLSG 404 (452) T ss_pred HHHHHHHHcCCC---------Cc-eEEEecc-------------------cccc--------ccCCHHHHHHHHHHHhcC Confidence 667777777532 11 1233332 1111 011123445566665543 Q ss_pred cccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhh----hcccCcchHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLIS----GIAKPRNEKEQ 602 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~----~~~~~~~~~~~ 602 (708) .- ....-.....-....+.+. -.+.+....+.+ ....+.+..+. T Consensus 405 ~i-s~~t~~~~L~~~gvl~~~~---e~~~i~~E~~~~~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 405 GI-SKEIYIHALKVGKVLPPPG---ESMGVIPDPPAPEPSPSNTPPNPSSKA 452 (452) T ss_pred CC-cHHHHHHHHHhCCCCCCcc---CHHHHHHHhhccCcccCCCCCCCccCC Confidence 21 1101000000111122211 111222111111 11111111111 No 143 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=96.07 E-value=0.001 Score=36.91 Aligned_cols=473 Identities=8% Similarity=-0.029 Sum_probs=179.5 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcC-CCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVP-GGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~-G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~ 79 (708) .+.++ ..+..++..++...+...--+..+... .|.. ...++.+. ..+.+.+ | ..-+|..+.+++.++|. T Consensus 11 V~~~h-p~y~a~~~~W~~ird~~~G~~~~~~r~---~yl~~~~~~~~e~--~Y~~rl~---r-A~~~n~~~~tl~~l~G~ 80 (491) T protein:vir:95 11 VKTKH-REWLHYAPKWQKVRHALAGDLVGYLRN---VGLNEPDKAYGEA--RQAEYEA---G-GIVYNFTRRTLSGMVGS 80 (491) T ss_pred CCccC-HHHHHHHHHHHHHHHHhcCcchhhccc---CCCcCCCCCCCHH--HHHHHHh---c-ccCCChHHHHHHHHhch Confidence 33222 222333322222222221111111111 1111 12333332 1222222 2 45579999999999999 Q ss_pred HhcCcceeEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCCC---cce Q lcl|Aclame:pro 80 YRNNRITVKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDR---QRI 155 (708) Q Consensus 80 ~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~~---~~i 155 (708) .-...|.+.+ | +.|..++..+ .+-++.+.....++..++.+|.+++-|.+... ...+..+ ... T Consensus 81 vfrk~p~~~~-p----------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~--~~~T~Ade~~~~~ 147 (491) T protein:vir:95 81 VMRKEPEINI-P----------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPET--AAATAAEQNAGLL 147 (491) T ss_pred hhcCCceeec-c----------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCC--cccCHHHHHHhcC Confidence 8888777642 1 1245566665 34567999999999999999999988865321 1100000 011 Q ss_pred eeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeec Q lcl|Aclame:pro 156 AIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) Q Consensus 156 ~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~ 234 (708) +-..+..++.+| +++.....+.. ...++..+...... .....+....-..+||-+-..-. T Consensus 148 rPy~~~~~~~~I-inW~~~~v~g~~~L~~v~l~E~~~~~------------------d~~~~f~~~~~~qyRvL~l~~~g 208 (491) T protein:vir:95 148 NPTIAFYTTENI-VNWRLTRVGSVNRVTMVVLRETWEYH------------------EPGNEFETKYGEQYRVLDIDTDG 208 (491) T ss_pred CcEEEEechhhh-cCceeeeeCCceeeeEEEEEEeEEee------------------cCCCCcccceEEEEEEEeecCCC Confidence 111111133444 23433222211 11122222110000 00011111111112221110000 Q ss_pred ceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEe Q lcl|Aclame:pro 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~ 314 (708) +-+++++ +....|... .....+....+..+.+.+|||++... T Consensus 209 ~~~~~v~--r~~~~g~~~------------------------------------~~~~~~~~~~g~~~l~~IPfv~~~~~ 250 (491) T protein:vir:95 209 NYRQRLF--RFDAEGGAQ------------------------------------EEVVEIYPDLGESLRGVIPFTFIGAT 250 (491) T ss_pred ceEEEEE--EEcCCCcce------------------------------------eeeeeeeecCCCcccCeeEEEEEecC Confidence 0000000 000000000 00000111122334566777766543 Q ss_pred eec-cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeeccccccccc Q lcl|Aclame:pro 315 RWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGN 393 (708) Q Consensus 315 ~~~-~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (708) ... ..++...+ ++-...--.=...|-.-+++..++.+.+.+. |. ++....+.......+..+.- ..+- T Consensus 251 ~~~~~~~~pPLl----~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~-G~-d~~~~~~~~~~~~~~i~~g~-----~~~~ 319 (491) T protein:vir:95 251 NNDATIDDAPLL----PLAELNIGHYRNSADNEESSFVVGQPTLFIY-PG-DNLTPQSFKEANPNGIKFGS-----RCGH 319 (491) T ss_pred CCCCCCCcCchH----HHHHHHHHHhhhhhHHHHHHHHcccceeeee-cC-cccCcchhhccCcceeEecC-----cCCc Confidence 221 11122122 3333322111122223445555555555442 11 11111222222222222111 1122 Q ss_pred ccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 394 IIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~ 473 (708) ..|....+.++++....- ..+.|......+.. .|.. ....+++.||+++...+.+....|..+..|+..+.. T Consensus 320 ~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~-~Ga~--l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~---- 391 (491) T protein:vir:95 320 NLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQ-IGAQ--LITPSQQITAESARIQRGADTSVMATIARNVSQAYT---- 391 (491) T ss_pred CCCCCCccceeecCcchH-HHHHHHHHHHHHHH-HHHH--hccCCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHH---- Confidence 222233444554432221 12333333333322 2332 222334568888888887777777777777776654 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) Q Consensus 474 ~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~ 553 (708) .+|.++..|.... ++..--+.+|. +|++. ....+..++|..++++ T Consensus 392 ~~l~~~a~w~G~~--------~~~~v~i~~n~-------------------dF~~~--------~~~~~~~~all~~~~~ 436 (491) T protein:vir:95 392 DALRWVAMMLGKP--------EDSEVEFQLNM-------------------DFFLQ--------PMTAQDRAAWMADINA 436 (491) T ss_pred HHHHHHHHHcCCC--------CCCceEEEeec-------------------ccccc--------cCCHHHHHHHHHHHhc Confidence 4567777776421 11111222332 12111 1112345566666664 Q ss_pred ccccCchhHHHH-HHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQ-GIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQM 610 (708) Q Consensus 554 ~~~~~p~~~~~~-~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq 610 (708) +.- +...... ..-....+ ...+++.+.+....+.....-+...+..+.+++.+. T Consensus 437 G~i--s~~t~~~~L~~~~vl~-~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 437 GLL--PATAYYAALRKAGVTD-WTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred CCC--CHHHHHHHHHhCCCCC-ccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 321 1111110 11111222 234566666665544333333322222222221111 No 144 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=471 Identities=7% Similarity=-0.055 Sum_probs=177.6 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHHHHHHH Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~i~g~~ 80 (708) .+.++- .+..++..++...+...--+..+.... +.--...|+.+.. .+.+. .| ..-+|..+.+++.++|.. T Consensus 11 V~~~hp-~y~a~~~~W~~ird~~~G~~~~~~r~~--yl~~~~~~~~e~~--Y~~rl---~r-A~~~n~~~~tl~~l~G~v 81 (489) T protein:vir:78 11 VKTKHR-EWLHYAPKWQKVRHALAGELVSYLRNV--GLNEPDKAYGEAR--QAEYE---AG-GIVYNFTRRTLSGMVGSV 81 (489) T ss_pred CCccCH-HHHHHHHHHHHHHHHhcCcccccccCC--CCCCCCCCCChHH--HHHHH---hc-cccCChHHHHHHHHhchh Confidence 332221 222222222222222211111011111 0111235544432 12221 12 456799999999999999 Q ss_pred hcCcceeEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCC---Cccee Q lcl|Aclame:pro 81 RNNRITVKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD---RQRIA 156 (708) Q Consensus 81 ~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~---~~~i~ 156 (708) -...|.+.+ -+.|..++..+ .+-++.+.....++..++.+|.+++-|.+... +..+.. ....+ T Consensus 82 frk~p~~~~-----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~--~~~T~ade~~~~~r 148 (489) T protein:vir:78 82 MRKEPEINI-----------PKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPET--GAATAAEQNAGLLN 148 (489) T ss_pred hcCCcceec-----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCC--CCcCHHHHHHhcCC Confidence 888877642 12245566665 44567999999999999999999988865321 110000 01111 Q ss_pred eEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEeeeeeecc Q lcl|Aclame:pro 157 IEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) Q Consensus 157 i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e~~~~~~ 235 (708) -..+..++.+| +++.....+.. ...++..+.....+ .....+.+..-..+||-+ ... T Consensus 149 Py~~~~~~~~I-inW~~~~v~G~~~Lt~v~lrE~~~~~------------------d~~~~f~~~~~~q~RvL~---~~~ 206 (489) T protein:vir:78 149 PTIAFYTTENI-VNWRLTRVGSVNRVTMVVLRETWEYN------------------EPGNEFETKYGEQYRVLD---IDS 206 (489) T ss_pred cEEEEechhhh-cCceeeeeCCccceeEEEEEEeEEee------------------cCCCCccceeEEEEEEEe---cCC Confidence 11111133444 24433322211 11222222210000 001111111111122211 000 Q ss_pred eEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEee Q lcl|Aclame:pro 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~ 315 (708) -...-++.|.... ++. .......++...+..+.+.+|||++.... T Consensus 207 ~g~~~~~~~r~~~-------~g~----------------------------~~~~~~~~~~~~g~~~l~~IPfv~~~~~~ 251 (489) T protein:vir:78 207 DGNYRQRLFRFDA-------EGG----------------------------AQEDVVEIYPDLGESLRGVIPFTFIGATN 251 (489) T ss_pred CcceEEEEEEeec-------CCc----------------------------ccceeeEEeccCCCCccCeeeEEEEecCC Confidence 0000000000000 000 00000111112233445677777765432 Q ss_pred ec-cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccc Q lcl|Aclame:pro 316 WF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNI 394 (708) Q Consensus 316 ~~-~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (708) .. ..++.. .-++-...--.=...|-.-+++..++.+.+.+. |. ++....+.....+.+.++.. ..+-. T Consensus 252 ~~~~~~~pP----Ll~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~-d~~~~~~~~~~~~~~i~~g~-----~~~~~ 320 (489) T protein:vir:78 252 NDATIDDAP----LLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PG-ENLTPQAFKEANPNGIKFGS-----RRGHN 320 (489) T ss_pred CCCCCCcCc----hHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cC-ccCCcccccccCccceeeCC-----ccccc Confidence 21 111211 223333322111122334455555666555542 21 11112222222222222211 11222 Q ss_pred cccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEV 474 (708) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~~ 474 (708) .|....+.++++....-. -+.|......+.. .|.. ....+++.|++++.....+....|..+..|+..+. .. T Consensus 321 lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~~-lGa~--l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al----~~ 392 (489) T protein:vir:78 321 LGYGGSAQLIQAGENNLA-RQNMLDKEQQAIQ-IGAQ--LITPTQQITAQSARIQRGADTSVMATIARNVSQAY----TD 392 (489) T ss_pred CCCCCCcceeccCcchHH-HHHHHHHHHHHHH-Hhhh--hccCCcchhHHHHHHHHHHhhHHHHHHHHHHHHHH----HH Confidence 222233344444332222 2233333333322 2332 22233456888888777777666777777766654 55 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 475 WLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSM 554 (708) Q Consensus 475 ~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~ 554 (708) +|.++..|...+ ++..--+.+|. +|++ .....+..++|..+++++ T Consensus 393 ~l~~~a~w~G~~--------~~~~~~i~~n~-------------------dF~~--------~~~d~~~~~al~~~~~~G 437 (489) T protein:vir:78 393 ALRWVAVMLGKP--------EDTEVEFRLNM-------------------DFFL--------EPMTAQDRAAWMADINAG 437 (489) T ss_pred HHHHHHHHcCCC--------CCCceEEEeec-------------------ccCc--------ccCCHHHHHHHHHHHhcC Confidence 667777776421 11111222332 1221 111123455666666644 Q ss_pred cccCchhHHHHHHHHhhccchhHHHHHHHHHhhhhhhh--cccCcchHHHHHHH Q lcl|Aclame:pro 555 LPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISG--IAKPRNEKEQQIVQ 606 (708) Q Consensus 555 ~~~~p~~~~~~~~~~~~~d~~~~~ei~e~~~~~~~~~~--~~~~~~~~~~q~~~ 606 (708) .- ....-.....-....+. ..+++.+.+.....+.. ..-+.+++.|+..+ T Consensus 438 ~i-s~~t~~~~L~~~gv~d~-~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 438 LL-PATAYYAALRKAGVTDW-TDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred CC-CHHHHHHHHHhCCCCCc-cHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 21 10100000111112222 34555566654322111 11111112122111 No 145 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=95.77 E-value=0.0015 Score=36.08 Aligned_cols=479 Identities=9% Similarity=-0.021 Sum_probs=185.3 Q ss_pred CCc--chHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchHHHHHH Q lcl|Aclame:pro 1 MAE--TLEKKHERIMLR---FDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) Q Consensus 1 ma~--~~~~~~~~~~~~---~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~i~~ 75 (708) |.| .....+..+... ++.+......+|.....- .-..++.+++.+.++..+.+.+ | .+-+|..+.+++. T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~Y--LP~~~~e~~~~e~~~~Y~~rl~---r-A~~~n~~~~t~~~ 74 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTY--LPMPNAEDQSKENKARYEAYLK---R-AVFYNVARRTLFG 74 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhccccc--CcCCCCCCCcccchHHHHHHhh---c-cccCchHHHHHHH Confidence 886 222233333333 333333344444322110 0113345677665555554433 2 5668999999999 Q ss_pred HHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccc-cCCCCC--- Q lcl|Aclame:pro 76 IIAEYRNNRITVKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVN-EYDPMD--- 150 (708) Q Consensus 76 i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~-~~d~~~--- 150 (708) ++|..-...|.+.+ -..|..++..+ .+-++.+.....++..++.+|.+++-|.+.... ++.-+- T Consensus 75 l~G~vf~k~p~~~~-----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~ 143 (501) T protein:vir:95 75 LVGQVFMRDPVVKV-----------PALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADL 143 (501) T ss_pred HhhhhhcCCcceeC-----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHH Confidence 99998877665531 23355566655 345679999999999999999999888553210 000000 Q ss_pred CCcceeeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeEEee Q lcl|Aclame:pro 151 DRQRIAIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAK 229 (708) Q Consensus 151 ~~~~i~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~v~e 229 (708) -...++-..+..++.+| +++.....+.. ...++..+.....+ +..|....-..+||.+ T Consensus 144 ~~~~~rPy~~~~~~~~I-inW~~~~v~g~~~l~~v~l~E~~~~~--------------------d~~f~~~~~~q~RvL~ 202 (501) T protein:vir:95 144 EAGRIRPTLYVYSPTEI-INWRTTDRGAEEVLSLVVLFETWCAA--------------------DDGFEMKTSGQFRVLR 202 (501) T ss_pred HhccCCcEEEEecHhhh-cCcceeccCCceeeeEEEEEEEEeec--------------------CCCcccceeEEEEEEe Confidence 00111111111234444 24433222211 11122222111100 0011111111111111 Q ss_pred eeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCccee Q lcl|Aclame:pro 230 YYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLI 309 (708) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~ 309 (708) .-..-..+++++....+..........+.. .......+.+.+..+.+.+||| T Consensus 203 ~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~----------------------------~~~~~~~~~~~g~~~l~~IPfv 254 (501) T protein:vir:95 203 LDEEGYYVHEIWREPQPTKADGSKIPKGNY----------------------------QQYVVYKPTDAQGKRLTEIPFM 254 (501) T ss_pred eCCCceEEEEEEEecCCcccCcceecCCcc----------------------------cccceeeeeccCCCcCCeeeEE Confidence 000000000000000000000000000000 0000111222233455667777 Q ss_pred eEEEeeec-cCCcccccchHHhhhHHHHHHHHHH--HHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecc Q lcl|Aclame:pro 310 PVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQV--SMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) Q Consensus 310 p~~~~~~~-~d~~~~~~G~vr~~~d~Q~~~N~~~--s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~ 386 (708) ++...+.. ..++...+ ++-+.. +..+. |-..+++..++.+...+ .|.+..|.......+..+ T Consensus 255 ~~~~~~~~~~~~~pPLl----~lA~ln--i~hy~~ssd~~~~l~~~~~P~l~i-----~G~~~~~~~~~~~~~i~~---- 319 (501) T protein:vir:95 255 FIGSENNDSNPDNPNFY----DLASLN--MAHYRNSADYEESCYIVGQPTPVL-----IGLTEEWVTNVLKGSVNF---- 319 (501) T ss_pred EEecCCCCCCCCccchH----HHHHHH--HHHHhhhhHHHHHHHHcccceeee-----eCCcccccccCCCCceee---- Confidence 54433221 11122222 233332 22222 22445555555555444 233333332222222111 Q ss_pred cccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAK 466 (708) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~dn~~~ 466 (708) +...+-..|....+.+.++..- +-..+.|....+.|..+ |.. ...+...+.||++......+....|..+..|+.. T Consensus 320 -G~~~~~~lP~~~~~~~ie~~~~-~i~~~~l~~l~~~m~~~-Ga~-ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~ 395 (501) T protein:vir:95 320 -GSRGGIPLPVGADAKLLQASEN-TMLKEAMDTKERQMVAL-GAK-LVEQKEVQRTATEAELEAASEGSTLSSATKNVSA 395 (501) T ss_pred -cccccccCCCCCceeEEecChh-hHHHHHHHHHHHHHHHH-HHh-hccCCccchhHHHHHHHHHHHhHHHHHHHHHHHH Confidence 1111222333334555554221 11134455555555443 532 2333345678888887777777777778888777 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHH Q lcl|Aclame:pro 467 SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSV 546 (708) Q Consensus 467 ~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~ 546 (708) +.. .+|.++..|.. .++..-.|.+|+. |+.. ....+..++ T Consensus 396 al~----~~l~~~a~w~g---------~~~~~~~v~i~~d-------------------f~~~--------~~~~~~~~a 435 (501) T protein:vir:95 396 AFE----WALKWAARWVG---------QADSGVKFELNTD-------------------FDIA--------RMTPDERRS 435 (501) T ss_pred HHH----HHHHHHHHHcC---------CCCCceEEEEecc-------------------cccc--------cCCHHHHHH Confidence 654 46666777753 1111112334321 1111 111233455 Q ss_pred HHHHHHhccccCchhHHHHHHHHhhccc--hhHHHHHHHHHhhhhhhhcc-----cCcch-HHHHHHHHH Q lcl|Aclame:pro 547 LTNVLSSMLPTDPMRPAIQGIILDNIDG--EGLDDFKEYNRNQLLISGIA-----KPRNE-KEQQIVQQA 608 (708) Q Consensus 547 l~~llq~~~~~~p~~~~~~~~~~~~~d~--~~~~ei~e~~~~~~~~~~~~-----~~~~~-~~~q~~~~~ 608 (708) |..+++++.- . ...+-..+....+ +..+...+++.......... .+... ...--.-+. T Consensus 436 l~~~~~~G~i-s---~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 436 LVEEWQKGAI-T---FEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred HHHHHhCCCC-c---HHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccccCCC Confidence 6666554321 1 1111111111112 23333344443322111100 00000 000000000 No 146 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=94.53 E-value=0.0042 Score=33.63 Aligned_cols=550 Identities=11% Similarity=0.007 Sum_probs=115.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHH------HHHHHHHHHhhcCC------CCCCH-----HHHHHhhhhhhhcCCCc Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVR------EKCIEATRFARVPG------GQWEG-----ATAAGTKLDEQFEKYPK 63 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r------~~~~~d~~~~~~~G------~Qw~~-----~~~~~l~~~~q~~grp~ 63 (708) .|...-.++.++..........|.+.+ .++. |..+.++.+ ..|.+ .++... T Consensus 21 ~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~-~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~v----------- 88 (651) T protein:vir:80 21 VSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQ-DYLRDQVLRSVGDVNADWRHKITTGKAFEAI----------- 88 (651) T ss_pred HHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHH-HhhccccccccCCCCCCCCccccChhHHHHH----------- Confidence 443333444444333222222221111 1222 222222221 11211 111110 Q ss_pred eeecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHH-HHHHH------------------------HHHHHHHhcChH Q lcl|Aclame:pro 64 FEINKVATELNRIIAEYRNNRITVKFRPGDREASEELA-NKLNG------------------------LFRADYEETDGG 118 (708) Q Consensus 64 ~~~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A-~~l~~------------------------~~~~~~~~~~~~ 118 (708) .+++..++....+...--+..+.--........+.+ ..+.. +++..|+..--. T Consensus 89 --e~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~ 166 (651) T protein:vir:80 89 --ETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAE 166 (651) T ss_pred --HHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeee Confidence 111111111111111101100000000000001111 11111 111111110000 Q ss_pred HHHHHHHHHHhhcCeeEEEEEeecccc--------CCCCCCCcceeeEEeecchhheecCCccccCChhccCeEEEeecC Q lcl|Aclame:pro 119 EACDNAFDDAATGGFGCFRLTSMLVNE--------YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSL 190 (708) Q Consensus 119 ~~~~~a~~d~~~~G~G~~~v~~~~~~~--------~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDa~~~~~~~~~ 190 (708) ......++..+..|.+.+.+..+.... .+|.++-.+.....+. +..-++. -.-+..|+.+.-.-.+..+. T Consensus 167 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~-d~~~v~~-~~~t~~~l~~l~~~g~~~~~ 244 (651) T protein:vir:80 167 VKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPN-RGAFIRK-LTKTKADILNLLSEGYYYGV 244 (651) T ss_pred eehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCcc-ccceeee-eeeeHHHHHHHHhcccccch Confidence 000011112222232222221110000 0111111111000000 0000000 00000011110000111222 Q ss_pred CHHHHHHhCCCCc-cccccccc-cc-ccccCCCCCceeEEeeeeeecceEEE-EEEEecCccCceeEecCCcccchHHHh Q lcl|Aclame:pro 191 SPEKYEAEYGKKP-PTSLDVTS-MT-SWEYNWFGADVIYIAKYYEVRKESVD-VISYRHPITGEIATYDSDQVEDIEDEL 266 (708) Q Consensus 191 ~~~e~~~~~p~~~-~~~~d~~~-~~-~~~~~~~~~~~~~v~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (708) ...+.....+... ....+... .. .+...+.....+.|.|+|.+-..... +.+++-...|+.+... . T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~-~--------- 314 (651) T protein:vir:80 245 DPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRF-E--------- 314 (651) T ss_pred hhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecc-c--------- Confidence 2222222221111 10000000 00 00111222345566666643221111 1111100000000000 0 Q ss_pred hccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 267 AIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLA 346 (708) Q Consensus 267 ~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~ 346 (708) ++.. +...||-.+++.|.-+..+ |.+...-+....+-...+.|...-.+. T Consensus 315 ------------------------~~~~---~~~~Pf~~~~~~~~~~~~y---G~g~~~~~~~~q~~ln~l~~~~ld~~~ 364 (651) T protein:vir:80 315 ------------------------QNPY---WCGRPFVIGTYIPTARQPY---AMGALQPNLGMLHELNIITNQRLDNLE 364 (651) T ss_pred ------------------------ccCC---CCCCCeeeecceecCcccc---CCChHHHHhHHHHHHHHHHHHHHHHHH Confidence 0000 0111222222222222111 111111222222222222222111110 Q ss_pred ---HHHhhcCCCceeechhhccchHHHHHhhcccCCceeeecccccccccccccccccccccCccchHHHHHHHHHHHHH Q lcl|Aclame:pro 347 ---DTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSAD 423 (708) Q Consensus 347 ---~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 423 (708) .-........ +.+.+.+.....-.-.. ...+.+.++.+.. + .+.......+++...... T Consensus 365 ~~~~~~~~v~~d~-~~~~~~l~~~pg~vi~~-~~~~~~~~l~~~~------------~----~~~~~~~~l~~l~~~~~~ 426 (651) T protein:vir:80 365 LAIDQMYTLRSDG-LLQPEDVYTEPGKVFLV-SDHGDLQPLANQS------------S----NFSITYQESSFLESTIDK 426 (651) T ss_pred HHhCCcEEecCCc-cccHHHhhcCCCceEEe-cCCCCceeeccCc------------c----cchhHHHHHHHHHHHHHH Confidence 0000000000 11111110000000000 0111122221110 0 011122233333333333 Q ss_pred H----HHHhCCChhHcccccc--hhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH-----HHHhcCCC----c Q lcl|Aclame:pro 424 I----QEVTGGSQAMQQMPSN--IAQETVNNLMNRADMASFIYLDNM-AKSLKRAGEVWLSM-----AREVYGSE----R 487 (708) Q Consensus 424 ~----~~~tGv~~~~~G~~~n--~sg~ai~~~q~q~~~~~~~~~dn~-~~~~~~~~~~~l~l-----i~~~y~~~----r 487 (708) + ....|++..+.|..+. .+..+-...+.-+ .....+-+.+ ...++++..++... +.++...+ - T Consensus 427 ~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~-~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~ 505 (651) T protein:vir:80 427 NFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLS-GIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYE 505 (651) T ss_pred HhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeeccccccccc Confidence 3 2335565555555432 2333322222211 1112222222 23334445554443 22222222 1 Q ss_pred EEEEeccCCCce--EEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHHHHHHHHHHHhccccCchhHH-- Q lcl|Aclame:pro 488 EVRIVNEDGSDD--IAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPA-- 563 (708) Q Consensus 488 ~irI~~~~~~~~--~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~~~~l~~llq~~~~~~p~~~~-- 563 (708) .+.|+.++.+.. .+.+.. .. ...-..+.+.+..+.+++...++....... T Consensus 506 ~~~i~~~dl~~~~~iv~~g~-------------------------~~-~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~ 559 (651) T protein:vir:80 506 YYELDVEDLQKEVRLVPIGS-------------------------DH-VIERKQYIEDRLTFIQAVAQVPEMGQLVDYKR 559 (651) T ss_pred ccccCccceeeeeeeeeccH-------------------------HH-HHHHHHHHHHHHHHHHhhccCCccchhhhHHH Confidence 122222221111 111100 00 011122344455555555555544443221 Q ss_pred HHHHHHhhccchhHHHHHHHHHhhhhhhhcccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 564 IQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQ 643 (708) Q Consensus 564 ~~~~~~~~~d~~~~~ei~e~~~~~~~~~~~~~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~qae~~k~~~~~~~ 643 (708) ....+++.+.++....+.... ....+..+++..+.+ ++....+.+ ...++.+..+.+ ..+.++++.+ T Consensus 560 ~~~~l~~~~g~~~~~~~l~~~-------~q~~~~~~~~~~~~q--~~~~~~~a~--~~~~~~~~~~~~--~~~~~~~~~~ 626 (651) T protein:vir:80 560 ILVDLLQHWGFEEPEAYLKQQ-------DQQAPANPQEALLSQ--AKDVGGQAM--SNMLQNQLQADG--GTQMMSEMYG 626 (651) T ss_pred HHHHHHHHcCCCCcHHhcCCC-------ccchhhhhhHHHHhh--HHHHHHHHH--HHHHHHHHHHHH--HHHHHHHHHH Confidence 222344444444443332110 011111111111000 000000000 001111111110 0111122211 Q ss_pred H----HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 644 T----QIKAFTAQQDAMESQANTVY 664 (708) Q Consensus 644 ~----q~e~~~~~~~~~~~~a~~~~ 664 (708) . +.+.++..++..+.+++..+ T Consensus 627 ~~~~~~~~~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 627 TPNADQMQQELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccC Confidence 1 22222222222222111111 No 147 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=91.00 E-value=0.018 Score=30.15 Aligned_cols=481 Identities=8% Similarity=-0.043 Sum_probs=169.4 Q ss_pred CCcc--hHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHHhhcC---CCCCCHHHHHHhhhhhhhcCCCceeecchHHH Q lcl|Aclame:pro 1 MAET--LEKKHERIMLR---FDRAYSPQKEVREKCIEATRFARVP---GGQWEGATAAGTKLDEQFEKYPKFEINKVATE 72 (708) Q Consensus 1 ma~~--~~~~~~~~~~~---~~~~~~~~~~~r~~~~~d~~~~~~~---G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~~~ 72 (708) |.+- ....+..+... ++.+......+|.... .|.. +.+-+.+-.+..+.+.+ | ..-+|.++.+ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~-----~YLP~~~~~~~~~E~~~~Y~~rl~---r-A~~~n~~~~t 102 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKRE-----EYLPMPSVDSRDEEQRRRYETYLQ---R-AIFYNVTART 102 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhccc-----ccCCCCCcccCCcCCHHHHHHHHh---h-ccCCChhHHH Confidence 7751 12223333322 2333333333332221 1221 22222222222333322 2 4568999999 Q ss_pred HHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCCCCC Q lcl|Aclame:pro 73 LNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD 151 (708) Q Consensus 73 i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~~~~ 151 (708) ++.++|..-...|.+.+ -..|..++..+ .+-++.+.....++..++.+|.+++-|.+... ...... T Consensus 103 l~~l~G~vfrk~p~~~~-----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~--~~~~t~ 169 (535) T protein:vir:80 103 LDGMMGQVFSRDPIRQL-----------PPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNV--GRPVTV 169 (535) T ss_pred HHHHhchhhcCCcceec-----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCC--CCcccH Confidence 99999987766554432 13356666655 34567999999999999999999988865321 111000 Q ss_pred ----CcceeeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCCCceeE Q lcl|Aclame:pro 152 ----RQRIAIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) Q Consensus 152 ----~~~i~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 226 (708) ....+-..+..+..+| +++.....+.. ...++..+.-...+ +..|....-..+| T Consensus 170 ade~~~~~rPy~~~y~ae~I-inW~~~~v~G~~~Lt~v~lrE~~~~~--------------------dd~f~~~~~~q~R 228 (535) T protein:vir:80 170 LEQKLGLYRPTITLVHPTSI-INWRTKLVGGKSVISLVVIQENVLAQ--------------------DDGFETTYVQQWR 228 (535) T ss_pred HHHHhcCCCcEEEEechhhc-cCccccccCCccceeEEEEEEEEEec--------------------CCCcccceeEEEE Confidence 0011111112234444 24433322211 12222222211000 0111111111112 Q ss_pred EeeeeeecceEEEEEEEe-c-CccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCC Q lcl|Aclame:pro 227 IAKYYEVRKESVDVISYR-H-PITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGE 304 (708) Q Consensus 227 v~e~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~ 304 (708) |.+ ........++.| . ...+.... .+..+..+.+..+.+ T Consensus 229 vL~---~~~~G~y~v~~~~~~~~~~~~~~------------------------------------~~~~~~~~~g~~~l~ 269 (535) T protein:vir:80 229 VLQ---LNAEGNYQVERWRRETQEEMYYS------------------------------------YSKHVPTDGNGNPFK 269 (535) T ss_pred EEE---ecCCceEEEEEEEeecCCccccc------------------------------------cceeecccCCCcccC Confidence 211 000000000101 0 00000000 000011122334456 Q ss_pred CcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCceeee Q lcl|Aclame:pro 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL 384 (708) Q Consensus 305 ~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~~~~~ 384 (708) .+|||++...+. +...+..-...+...+..+=...|-..+++..++.+...+ .|.+..|........++... T Consensus 270 ~IPfv~~~~~~~---~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i-----~G~~~~~~~~~~~~~~i~iG 341 (535) T protein:vir:80 270 EIPFQFIGPLDN---NADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFF-----TGLTKDWVEDVFKDFKVHLG 341 (535) T ss_pred eeEEEEeecCCC---CCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeee-----ecCchhhhhcCCCCcceEec Confidence 666665432221 1111111234555555444334444556666666665554 34444443333332222211 Q ss_pred ccccccccccccccc--ccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 385 REVRDKSGNIIAGAT--PAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLD 462 (708) Q Consensus 385 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~~d 462 (708) .. ..-..|... .+-.+.++.++.. .+....+.|..+ |......+ ..+.+++++.....+....|..+.. T Consensus 342 ~~----~~~~lP~~~~~~~~e~~~~~~a~~---~l~~~e~qM~~l-Ga~ll~~~-~~~~Ta~~a~~~~~~~~S~L~~~a~ 412 (535) T protein:vir:80 342 SR----AIIPLPQGATAGILQITPNSVPFE---AMTHKESQMIAM-GANLLVKS-GGNRTFGEAQQEEASEQSILSACTK 412 (535) T ss_pred Cc----ccccCCCCCCcceeeeccchhHHH---HHHHHHHHHHHH-HHHhhccC-cccccHHHHHHHHHHHhHHHHHHHH Confidence 10 011112111 2233444444432 344444444443 43322222 2344444444333333444566666 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHH Q lcl|Aclame:pro 463 NMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDA 542 (708) Q Consensus 463 n~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~ 542 (708) |++.+. +.+|.++..|... . -++..--|.+|. +|. ......+ T Consensus 413 ~le~al----~~aL~~~A~w~G~-----~--~~~~~~~i~~n~-------------------dF~--------~~~ld~~ 454 (535) T protein:vir:80 413 NVSMAF----RKALRWANQFQTG-----I--VNDETVEYNLNT-------------------DFP--------AARLTPN 454 (535) T ss_pred HHHHHH----HHHHHHHHHHcCC-----c--cCCCceEEEecc-------------------ccc--------cccCCHH Confidence 666654 5566677777531 0 011111122321 111 1111223 Q ss_pred HHHHHHHHHHhccccCchhHHHHHHHHhhccch-hHHHHHHHHHhhhh----hhhcccCcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 543 TVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGE-GLDDFKEYNRNQLL----ISGIAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) Q Consensus 543 ~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~~-~~~ei~e~~~~~~~----~~~~~~~~~~~~~q~~~~~qq~qq~~~~ 617 (708) ..++|..+++++. +.-..-.....-...++.. ..++...++..... ..+.........+.....- T Consensus 455 ~~~all~~~~~G~-Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~--------- 524 (535) T protein:vir:80 455 ERAELILEWQQGA-ITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLN--------- 524 (535) T ss_pred HHHHHHHHHhcCC-CCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCccc--------- Confidence 4455666655432 1110000000000111100 11222222221100 0000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 618 PEMVLAQAQMVAAQAEAQKATN 639 (708) Q Consensus 618 ~~~~~aq~~~~~~qae~~k~~~ 639 (708) . .+..+-++-. T Consensus 525 --~---------~~~~~~~~~~ 535 (535) T protein:vir:80 525 --N---------GNGGGNQAGN 535 (535) T ss_pred --C---------CccccccCCC Confidence 0 0000000000 No 148 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=87.90 E-value=0.036 Score=28.52 Aligned_cols=451 Identities=8% Similarity=-0.016 Sum_probs=165.9 Q ss_pred CC-cchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCCCCCHHHH--------HHhhhhh--hhc---CCCceee Q lcl|Aclame:pro 1 MA-ETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATA--------AGTKLDE--QFE---KYPKFEI 66 (708) Q Consensus 1 ma-~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~--------~~l~~~~--q~~---grp~~~~ 66 (708) |. +.....+..+...++...++-.+..+.+ .-.|.. +|+..+. +..+.+. .++ .+=.+-+ T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~----g~~YLP--k~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~ 87 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIKRK----KQTYLP--NLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYV 87 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHHHh----hhhcCC--CCCCccccccCcchhhhhhccchhhhHhhhhhccccC Confidence 44 3333333333333333333222222211 112332 2321100 0000000 000 0114457 Q ss_pred cchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeecccc Q lcl|Aclame:pro 67 NKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNE 145 (708) Q Consensus 67 N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~ 145 (708) |..+.+++.++|..-...|.+.. | +.. .|..++..+ .+-++.+.-...++..++.+|.+++-|.+... T Consensus 88 n~~~~tl~~l~G~vfrk~p~~~~-~----~~~----~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~-- 156 (488) T protein:vir:96 88 NIVNPTMNAITGAVMRREPEFDT-M----DNP----VLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPE-- 156 (488) T ss_pred chhHHHHHHhcchhhccCceecc-C----CcH----HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCC-- Confidence 99999999999998877777652 1 111 245566665 44567999999999999999999988865311 Q ss_pred CCCCCCC---cceeeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHhCCCCcccccccccccccccCCCC Q lcl|Aclame:pro 146 YDPMDDR---QRIAIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFG 221 (708) Q Consensus 146 ~d~~~~~---~~i~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~~~~ 221 (708) . .+-.+ ...+-..+..++.+| +++.....+.. ...++..+.-++. .+..++.+ T Consensus 157 ~-~T~ade~~~~~rPy~~~~~a~~I-inW~~~~v~G~~~L~~v~lrE~~~~---------------------~D~~~~~~ 213 (488) T protein:vir:96 157 S-ATMADWNKGKKLPTAAFYDALHI-IDWEVEYIDGEEKLTYLSLLEDYQE---------------------RDGGTYVS 213 (488) T ss_pred c-CCHHHHHHhcCCcEEEEechhhh-cCcceeccCCceeeEEEEEEEEEEe---------------------ccCCCccc Confidence 0 00000 111111112234444 34433332211 1122222221100 00001111 Q ss_pred CceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecCCCC Q lcl|Aclame:pro 222 ADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRI 301 (708) Q Consensus 222 ~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~ 301 (708) ...+++..+ ...... ...+..+. .++..+....+.. T Consensus 214 ~~~~~~~~l---~~g~~~-----------v~~~~~~~------------------------------~~~e~~~~~~g~~ 249 (488) T protein:vir:96 214 KQRLINHRL---VDGLCE-----------FQEVTDDE------------------------------YSDEWTPVLINSK 249 (488) T ss_pred ceEEEEEEE---ECcEEE-----------EEEEecCC------------------------------cccceEeecCCCc Confidence 111111110 000000 00110000 0011112222333 Q ss_pred CCCCcceeeEEEeeec-cCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccCCc Q lcl|Aclame:pro 302 PGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) Q Consensus 302 p~~~~p~~p~~~~~~~-~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~~~ 380 (708) +.+.+|||++...+.. ..++.. .-++-..+.-.=...|-..+++-.+.-+.++... .+....+...... .+ T Consensus 250 ~l~~IP~v~~~~~~~~~~~~~pP----LldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~---~~~~~~~~~~~~~-~g 321 (488) T protein:vir:96 250 QSDTIPFFLASSQSNEWCIDSTP----LTSLAEISLSIYVMNAYSNKAMILANEAKWMVDM---GDMNKTMASEMNP-LG 321 (488) T ss_pred ccCeeEEEEEecCCCCCCCCCCc----hHHHHHHHHHHHhhhhHHHHHHHhcCCceeeecc---CCCCccccccccc-ce Confidence 5566777765433221 111111 2233333322222223334555555555554421 1111111111111 11 Q ss_pred eeeecccccccccccccccccccccCccchHHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 381 FLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIY 460 (708) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~~~~~ 460 (708) +.... ......+.+ .+.+.+. ..++-..+.|....+.|.. .|..-. ..+++.||+++...+.+....+..+ T Consensus 322 ~~~~~----~~~~~~~~g-~~~~~e~-~~~~l~~~~l~~l~~qm~~-~Ga~l~--~~~~~~Ta~~~~~~~~~~~S~L~~~ 392 (488) T protein:vir:96 322 FTLAG----RMPYYVKNG-DVKVIQA-QFSPETENKVEKLFEQAVK-VGASLF--TQQSNETATGAAIRSGSSTASMATL 392 (488) T ss_pred eeecc----cccccccCC-ceeecCC-chhHHHHHHHHHHHHHHHH-HhHhhc--cCCCcchHHHHHHHHHHhhHHHHHH Confidence 11100 000001111 1222221 1111113334444444433 343222 2234458888887777777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHH Q lcl|Aclame:pro 461 LDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) Q Consensus 461 ~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r 540 (708) ..|+..+. +.+|.++.+|....-- +.+...-.|.+|.. | ...... T Consensus 393 a~~le~al----~~~l~~~A~w~g~~~~----~~~~~~~~~~in~d-------------------F--------~~~~ld 437 (488) T protein:vir:96 393 GNNVEDTV----RNMLRFIMRYFEGTNL----YVNPDELVFKLNRD-------------------Y--------FDVEVN 437 (488) T ss_pred HHHHHHHH----HHHHHHHHHHcCCCCC----CcCccceEEEeccC-------------------C--------CCccCC Confidence 77777665 4556667777642100 00000011222210 1 111112 Q ss_pred HHHHHHHHHHHHhccccCchhHHHHHHHHhhccc--h--hHHHHHHHHHhhhhhh Q lcl|Aclame:pro 541 DATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDG--E--GLDDFKEYNRNQLLIS 591 (708) Q Consensus 541 ~~~~~~l~~llq~~~~~~p~~~~~~~~~~~~~d~--~--~~~ei~e~~~~~~~~~ 591 (708) .+..++|..+++++.-. ...+...+..... | ..+++.+++....... T Consensus 438 ~~~~~al~~~~~~G~Is----~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 438 PQMLQVAYAAMMEGNLP----QVSWFELLKRARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred HHHHHHHHHHHhcCCCC----HHHHHHHHHhCCcCCccCCHHHHHHHHhhcCCCC Confidence 23455666666543211 0111111111100 1 1233444443221111 No 149 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=86.71 E-value=0.044 Score=28.04 Aligned_cols=471 Identities=10% Similarity=0.037 Sum_probs=181.4 Q ss_pred CCcch-HH------HHHHHHHHHHH---HHHhhHHHHHHHHHHHHHhhcCCCCCCHHHHHHhhhhhhhcCCCceeecchH Q lcl|Aclame:pro 1 MAETL-EK------KHERIMLRFDR---AYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVA 70 (708) Q Consensus 1 ma~~~-~~------~~~~~~~~~~~---~~~~~~~~r~~~~~d~~~~~~~G~Qw~~~~~~~l~~~~q~~grp~~~~N~i~ 70 (708) |+|+. +. .+..++..++. +.......|... . .|.. +|+.+.....+.+.+ | ..-+|..+ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g---~--~YLP--k~~~E~~~~Y~~rl~---r-A~~~n~~~ 69 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAG---E--TYLP--RHQEETDKGYQERLA---S-AVLLNMVE 69 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhc---c--cCCC--CCCCCCHHHHHHHHh---c-ccCCChHH Confidence 98863 32 12222222222 222222222111 1 1222 455554444444432 2 45689999 Q ss_pred HHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHH-HHHHHH-HHhcChHHHHHHHHHHHhhcCeeEEEEEeeccccCCC Q lcl|Aclame:pro 71 TELNRIIAEYRNNRITVKFRPGDREASEELANKLN-GLFRAD-YEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) Q Consensus 71 ~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~-~~~~~~-~~~~~~~~~~~~a~~d~~~~G~G~~~v~~~~~~~~d~ 148 (708) .+++.++|..-...|.+. . ++...+. .++..+ .+-++.+.....+|..++.+|.+++-|.+.... .+ T Consensus 70 ~tl~~l~G~vf~k~p~~~--~-------~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~--~~ 138 (513) T protein:vir:97 70 QTLDTLSGKPFSEPIKLN--E-------DVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPA--PR 138 (513) T ss_pred HHHHHHhhhhhhcCcccC--c-------CchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCC--Cc Confidence 999999998877544321 1 1222233 344444 345679999999999999999999888653221 11 Q ss_pred CCC---------CcceeeEEeecchhheecCCccccCChh-ccCeEEEeecCCHHHHHHhCCCCcccccccccccccccC Q lcl|Aclame:pro 149 MDD---------RQRIAIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYN 218 (708) Q Consensus 149 ~~~---------~~~i~i~~v~~~~~~v~~Dp~a~~~D~s-Da~~~~~~~~~~~~e~~~~~p~~~~~~~d~~~~~~~~~~ 218 (708) ..+ ...++-..+..++.+| +++.....+.. ...++..+.-.. ..+. T Consensus 139 ~~~~~~T~Ade~~~~~rPy~~~~~~e~I-inW~~~~v~G~~~L~~v~l~E~~~---------------------~~Dg-- 194 (513) T protein:vir:97 139 EDGQPRTLADDRREGLRPYWVMIKPECL-LFARSEVINGVEVLQHVRIIEHYM---------------------EQDG-- 194 (513) T ss_pred cchhHHhHHHHHhhccCceEEEecHhhh-cCcceeccCcceeeeeEEEEEEEe---------------------ecCC-- Confidence 000 0111111111133444 24433322211 111111111000 0000 Q ss_pred CCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHhhccchhhhhheeeeeEEEEEEEEecceeeecC Q lcl|Aclame:pro 219 WFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKP 298 (708) Q Consensus 219 ~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~ 298 (708) ++.+.+.....|+ + |...++..... +. ...+...+... T Consensus 195 -f~~~~~~q~rvL~-------------~--g~~~v~r~~~~----------~~----------------~~~~e~~~~~~ 232 (513) T protein:vir:97 195 -FAEVCKRRIRVLE-------------P--GLVQLWEPVKK----------SN----------------AQKEEWALADE 232 (513) T ss_pred -CcceEEEEEEEEe-------------C--ceEEEEEeecC----------CC----------------ccccceEEecC Confidence 1111110000010 0 11111100000 00 00000112222 Q ss_pred CCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHHHHHhhcCCCceeechhhccchHHHHHhhcccC Q lcl|Aclame:pro 299 RRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKR 378 (708) Q Consensus 299 ~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~l~~~~~~~~i~~~~ai~~~~~~~~~~~~~~ 378 (708) ...+.+.+||||++..+.- ...+..-.-.+-....-.=...|-..+++..+..+.+.+. |....|. + T Consensus 233 g~~~l~~IP~v~~~~~~~~---~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~-----G~~~~~~-----~ 299 (513) T protein:vir:97 233 WATGLNYVPLVTFYADRQG---FMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACS-----GASGEDS-----D 299 (513) T ss_pred CCCcCCceeEEEEecCCCC---CCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeee-----cCCcCCC-----C Confidence 3345577888877654321 1112222334444554443445555667777776666663 2221111 1 Q ss_pred Cceeeeccccccccccccc-ccccccccCccch-HHHHHHHHHHHHHHHHHhCCChhHcccccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 379 PAFLPLREVRDKSGNIIAG-ATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMA 456 (708) Q Consensus 379 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~l~~~~~~~~~~~tGv~~~~~G~~~n~sg~ai~~~q~q~~~~ 456 (708) +..+-.+. .-..|. ...+.+.++..-+ .....-+....+.| ...|.. .....+.+.||+++...+.+.... T Consensus 300 ~i~iG~~~-----~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm-~~~Ga~-ll~~~~~~~Ta~a~~~~~~~~~S~ 372 (513) T protein:vir:97 300 PVVVGPNK-----VLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQM-AGYGAE-FLKRKTGGQTATARALDSAEATSD 372 (513) T ss_pred ceEeeccc-----cccCCCCCCcceeeccCchhHHHHHHHHHHHHHHH-HHHHHH-hhccCCccccHHHHHHHHHHHHHH Confidence 11111111 011221 2245665554211 22233444445555 344543 233344567999998888877777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccc Q lcl|Aclame:pro 457 SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) Q Consensus 457 ~~~~~dn~~~~~~~~~~~~l~li~~~y~~~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~ 536 (708) |..+..|+..+.. .+|.++..|... +...-.|.||+ +|++. T Consensus 373 L~~~a~~le~al~----~~l~~~a~wlg~---------~~~~~~v~in~-------------------dF~~~------- 413 (513) T protein:vir:97 373 LSAMTGLFEDALA----QALDITADWLRL---------GPNGGTVELVK-------------------DYDLE------- 413 (513) T ss_pred HHHHHHHHHHHHH----HHHHHHHHHhCC---------CCCccEEEecc-------------------ccCcc------- Confidence 8888777776655 455566666541 11111244442 12211 Q ss_pred hhHHHHHHHHHHHHHHhccccCchhHHHHHH---HH-hhcc-chhHHHHHHHHHhhhhhhhc-----ccCcch--H---- Q lcl|Aclame:pro 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGI---IL-DNID-GEGLDDFKEYNRNQLLISGI-----AKPRNE--K---- 600 (708) Q Consensus 537 ~~~r~~~~~~l~~llq~~~~~~p~~~~~~~~---~~-~~~d-~~~~~ei~e~~~~~~~~~~~-----~~~~~~--~---- 600 (708) ....+..++|.++++.+.-. ...-.-... ++ ...+ -...+++++++......... .+.... + T Consensus 414 -~~~~~~~~al~~a~~~G~is-~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 491 (513) T protein:vir:97 414 -EMDAPGLQALQVAREKRDIS-RKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGE 491 (513) T ss_pred -cCCHHHHHHHHHHHhCCCCC-HHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCC Confidence 11123445566665543211 000000000 11 1011 11134455554433211000 000000 0 Q ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 601 EQQIVQQAQMAAQ--SQPNPEMVLAQAQMVAAQA 632 (708) Q Consensus 601 ~~q~~~~~qq~qq--~~~~~~~~~aq~~~~~~qa 632 (708) -...--+-+..-. .-+.- +. T Consensus 492 ~~~~~~~~~~~~~~~~~~~~------------~~ 513 (513) T protein:vir:97 492 GEGEGGEGGEGGEGGGNPGG------------ES 513 (513) T ss_pred CCCCCCCCCCccccCCCCCC------------CC Confidence 0000000000000 00000 00 No 150 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=31.04 E-value=1.5 Score=19.57 Aligned_cols=559 Identities=11% Similarity=0.016 Sum_probs=146.8 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcCCC------CCCHHHHHHhhhhhhhcCCCcee--------- Q lcl|Aclame:pro 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGG------QWEGATAAGTKLDEQFEKYPKFE--------- 65 (708) Q Consensus 1 ma~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~~~~~~G~------Qw~~~~~~~l~~~~q~~grp~~~--------- 65 (708) .|-+=-+.+..+.+.|..+.......++... |.+. --++. =|+ ....+.-- -+ .|+|.. T Consensus 16 la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~-~~~~-~~~~~~~r~nl~~s--ni~~i~P~-iY-ar~P~p~V~~rf~d~ 89 (663) T protein:vir:34 16 WAQRWQEEMSAAREPLEKWHTQGKEIVKRYR-DERD-SAHDAETRWNLFST--NIQTQMAS-LY-GQTPKVSVSRRFADA 89 (663) T ss_pred HHHHHHHHHHHHHhccchHHHHHHHHHHHhh-cccc-CCCccccccchhhh--hHHHHhhh-hh-cCCCcceeeecccCc Confidence 5544334444444444444444444444432 2211 11111 111 11111100 01 111110 Q ss_pred ----ecchHHHHHHHHHHHhcCcceeEEecCCCcchHHHHHHHHHHHHH----------HHHhcChHHHHH------HHH Q lcl|Aclame:pro 66 ----INKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRA----------DYEETDGGEACD------NAF 125 (708) Q Consensus 66 ----~N~i~~~i~~i~g~~~~nr~~~~v~pr~~~~d~~~A~~l~~~~~~----------~~~~~~~~~~~~------~a~ 125 (708) -+.+.-+++..++.... .+|..+=.+|...++. +.++-+++.... .+- T Consensus 90 d~~~~r~ase~leR~~~~~~~------------~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~D~~~ 157 (663) T protein:vir:34 90 DDDVARVASELLERLLNTDIE------------KDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAILDEAT 157 (663) T ss_pred ccchhhhHHHHHHHHHHHHHH------------hhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccccCCCcc Confidence 11112222222221110 0122232333333222 111111110000 000 Q ss_pred HHH---------hhcCeeEEEEEeeccccCCCCCCCcceeeEEeecchhheecCCcccc--CChhccCeEEEeecCCHHH Q lcl|Aclame:pro 126 DDA---------ATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKK--YDKSDALWAFCMYSLSPEK 194 (708) Q Consensus 126 ~d~---------~~~G~G~~~v~~~~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~--~D~sDa~~~~~~~~~~~~e 194 (708) ..- ++. +-+|+.++..- .+|..++. + .|.+|=| -++. +..++++-.| ..+.+. T Consensus 158 ~~~~a~~~~~~e~~a---~E~v~id~v~~---~dfl~~pA-r----~W~ev~w--va~r~~mtk~e~~~rf---~~~~~~ 221 (663) T protein:vir:34 158 GAELAAAVPPTQRKA---YECVETDYLHW---QDVLWSPA-R----VWHEVRW--LAFRNLLDMREFNARF---DADGSR 221 (663) T ss_pred ccchhcccccchhhc---ccceeeeeech---hhcccchh-h----ccccccc--eeeeccCCHHHHHHhh---cCChhh Confidence 000 011 11233333211 12211111 0 1222211 1111 2334443333 233321 Q ss_pred -HHHhCCC--Ccccccccccc-----cccccCCCCCceeEEeeeeeecceEEEEEEEecCccCceeEecCCcccchHHHh Q lcl|Aclame:pro 195 -YEAEYGK--KPPTSLDVTSM-----TSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDEL 266 (708) Q Consensus 195 -~~~~~p~--~~~~~~d~~~~-----~~~~~~~~~~~~~~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (708) ...-.|. +....-+.... ..-.-.|+-... +| ||..+...+ .+.+..+- + T Consensus 222 ~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~-~V--~w~~eg~~~-~L~~~~p~------------------l 279 (663) T protein:vir:34 222 NLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGR-KV--DWYVEGYSA-VLDTQPDP------------------L 279 (663) T ss_pred hhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCc-EE--EEEEcCcce-ecccCCCC------------------C Confidence 1222332 11111111111 111224543222 22 332222111 11111111 1 Q ss_pred hccchhhhhheeeeeEEEEEEEEecceeeecCCCCCCCCcceeeEEEeeeccCCcccccchHHhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 267 AIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLA 346 (708) Q Consensus 267 ~~~~~~~~~~~~~~~~~v~~~~~~~~~il~~~~~~p~~~~p~~p~~~~~~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~ 346 (708) ... +|.++....-+...+.+.||.++|+.|... -+==+.+..+.+.+. T Consensus 280 gl~------------------~ffPcPrpl~~~~~~ds~ipvpd~~~y~~~--------------~~E~n~~t~Rin~l~ 327 (663) T protein:vir:34 280 GLE------------------SFFPCPKPLLANWTTDKVVPRPDFVLAQDL--------------YKEIDLVSTRITLLE 327 (663) T ss_pred CCC------------------CCCCCcccccceecCCCeecCCcHHHHHHH--------------HHHHHHHHHHHHHHH Confidence 111 122222222234455677777776644322 112233333333444 Q ss_pred HHHhhcCCCc-----eeec--hhhccch---HHHHHhhcccCCce---eeecccccccccccccccccccccCccchHHH Q lcl|Aclame:pro 347 DTAAQDPGQI-----PIVG--MEQIRGL---EKHWEARNKKRPAF---LPLREVRDKSGNIIAGATPAGYTQPAVMNQAL 413 (708) Q Consensus 347 ~~l~~~~~~~-----~i~~--~~ai~~~---~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (708) +.+.....-. -+.. .++.++. ...|... ...+|. +.+-+.....-.+ ..-+..-.++-... T Consensus 328 d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~-~~~gg~~k~I~~~pi~~~~~aI-----~~l~~~r~qir~d~ 401 (663) T protein:vir:34 328 RAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTF-ADKGGLRGVVDWFPLEPVVAAL-----TSLRDYRRELVDAL 401 (663) T ss_pred hhhhhceeeccccchhHHHHHHHhhCCCceecchhhhh-hhhcCccchhhcccchhHHHHH-----HHHHHHHHHHHHHH Confidence 4333222210 0000 0000000 0111111 111221 1111111110000 00111222344444 Q ss_pred HHHHHHHHHHHHHHhCCChhHccccc--chh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhcCC--- Q lcl|Aclame:pro 414 AALLQQTSADIQEVTGGSQAMQQMPS--NIA-QETVNNLMNRADMASFIYLDNMAKSLKRAGE--VWLSMAREVYGS--- 485 (708) Q Consensus 414 ~~l~~~~~~~~~~~tGv~~~~~G~~~--n~s-g~ai~~~q~q~~~~~~~~~dn~~~~~~~~~~--~~l~li~~~y~~--- 485 (708) .++-..+ +|-.-+-+.....|..+ +.+ +..|+.++.+-......+++-.+.-+-.-+. -+.+|...=.+. T Consensus 402 ~qITGia--Di~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~e 479 (663) T protein:vir:34 402 HQVTGMA--DIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKE 479 (663) T ss_pred HHHHhHH--HHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccc Confidence 4444443 22222223334444321 122 2345555554333333333322211110000 011111100000 Q ss_pred --CcEEEEeccCCCceEEEecccccccCCCceEEeeccceeeEEEEEeecccchhHHHHH----------HHHHHHHHHh Q lcl|Aclame:pro 486 --EREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT----------VSVLTNVLSS 553 (708) Q Consensus 486 --~r~irI~~~~~~~~~v~in~~~~~~~~~~~~~~nDi~~g~~Dv~v~~~~~~~~~r~~~----------~~~l~~llq~ 553 (708) +.+-++.+..- +-+.|+ .+++.. |. +.....++.. ++++.-|+++ T Consensus 480 i~~~~~~L~n~~~--r~~~ld-----Ie~dsT--------~~--------~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q 536 (663) T protein:vir:34 480 LAPKAAELIKSRF--SMYRVE-----VKPEAV--------SL--------QDFAALRNEKMEVLSGIASFMQGVAPLAQQ 536 (663) T ss_pred hhHHHHHHhcCCC--cceeee-----eccCCC--------Cc--------CChHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00000111111 112221 111111 11 1112222222 1222233455 Q ss_pred ccccCchhHHHHHH-HHhhccchhHHHHHHHHHhhhhhhhcc-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 554 MLPTDPMRPAIQGI-ILDNIDGEGLDDFKEYNRNQLLISGIA-KPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) Q Consensus 554 ~~~~~p~~~~~~~~-~~~~~d~~~~~ei~e~~~~~~~~~~~~-~~~~~~~~q~~~~~qq~qq~~~~~~~~~aq~~~~~~q 631 (708) ++...|....++.. +...-....++...+.+.......... .++.+ .+++++.+. ..+++++|..++++| T Consensus 537 ~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~p-a~~~~~~k~-------~~~q~k~q~~~aeAq 608 (663) T protein:vir:34 537 VPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSP-APQQPDPKV-------VAQAMKGQQEMAKVQ 608 (663) T ss_pred hhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCc-ccchhhHHH-------HHHHHHHHHHHHHHH Confidence 54444433322221 223333444444444444333322211 11111 111122222 222223333333344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcC Q lcl|Aclame:pro 632 AEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSP 698 (708) Q Consensus 632 ae~~k~~~~~~~~q~e~~~~~~~~~~~~a~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 698 (708) ++++ .+..++|++++--+...+.++......+.+...+++++..+..+.. +...| T Consensus 609 ~e~q---~~~~~~ql~~~~~~~k~~~~a~~~~~~a~q~~~~~~~~r~~~~~a~---------~~~~~ 663 (663) T protein:vir:34 609 AEVQ---GDLLRIQAETQANETKERQQAEWNVREAAQKNLISQAARAMNPQAR---------NGGMP 663 (663) T ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhchhhh---------cCCCC Confidence 3332 2222222222222222222222222233333333333322222211 11111 Done!