Query lcl|NC_019423.1_cdsid_YP_006990612.1 [gene=IME11_7] [protein=portal protein] [protein_id=YP_006990612.1] [location=3569..5839] Match_columns 756 No_of_seqs 179 out of 214 Neff 8.3 Searched_HMMs 1612 Date Thu Nov 7 17:12:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_7 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_7_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95821 Length: 763 100.0 6E-189 4E-192 1052.4 77.3 755 1-756 2-761 (763) 2 protein:vir:8846 Length: 705 # 100.0 1E-127 6E-131 716.9 66.5 652 13-709 1-705 (705) 3 protein:vir:93630 Length: 776 100.0 2E-102 1E-105 578.3 57.7 658 1-754 4-776 (776) 4 protein:vir:80165 Length: 651 100.0 6.1E-99 4E-102 559.0 58.8 615 1-702 1-651 (651) 5 protein:vir:105619 Length: 772 100.0 3.5E-97 2E-100 549.4 59.3 650 1-756 5-771 (772) 6 protein:vir:108295 Length: 711 100.0 1.4E-95 8.4E-99 540.7 61.1 599 1-698 1-711 (711) 7 protein:vir:817 Length: 714 # 100.0 8.8E-93 5.5E-96 525.3 59.2 618 1-713 1-714 (714) 8 protein:vir:10117 Length: 714 100.0 8.8E-93 5.5E-96 525.3 59.2 618 1-713 1-714 (714) 9 protein:vir:9950 Length: 714 # 100.0 8.8E-93 5.5E-96 525.3 59.2 618 1-713 1-714 (714) 10 protein:vir:2764 Length: 714 # 100.0 8.8E-93 5.5E-96 525.3 59.2 618 1-713 1-714 (714) 11 protein:vir:3296 Length: 714 # 100.0 8.8E-93 5.5E-96 525.3 59.2 618 1-713 1-714 (714) 12 protein:vir:104437 Length: 714 100.0 1.2E-91 7.6E-95 519.0 57.7 613 1-713 1-714 (714) 13 protein:vir:77597 Length: 725 100.0 1.7E-85 1.1E-88 485.3 56.5 592 22-723 1-725 (725) 14 protein:vir:9263 Length: 725 # 100.0 7.2E-85 4.5E-88 481.9 54.3 608 22-734 1-725 (725) 15 protein:vir:100920 Length: 725 100.0 2.2E-84 1.4E-87 479.2 54.9 606 22-745 1-725 (725) 16 protein:vir:105520 Length: 706 100.0 1E-83 6.4E-87 475.5 57.1 609 22-723 1-706 (706) 17 protein:vir:172 Length: 708 # 100.0 7.6E-83 4.7E-86 470.8 55.7 602 22-728 1-708 (708) 18 protein:vir:105429 Length: 708 100.0 2.6E-82 1.6E-85 467.9 54.7 602 22-728 1-708 (708) 19 protein:vir:3520 Length: 720 # 100.0 7E-81 4.3E-84 460.0 54.5 612 22-733 1-720 (720) 20 protein:vir:94599 Length: 641 100.0 3.3E-80 2E-83 456.3 44.6 609 6-711 1-641 (641) 21 protein:vir:95449 Length: 584 100.0 5.2E-79 3.2E-82 449.8 44.1 552 1-647 1-584 (584) 22 protein:vir:3139 Length: 599 # 100.0 3.9E-70 2.4E-73 401.1 41.2 565 1-676 1-599 (599) 23 protein:vir:345 Length: 663 # 100.0 1E-39 6.4E-43 234.3 39.8 610 1-712 1-663 (663) 24 protein:vir:108295 Length: 711 100.0 1.5E-28 9.1E-32 173.2 61.0 580 13-724 1-711 (711) 25 protein:vir:7321 Length: 556 # 100.0 8.9E-29 5.5E-32 174.4 48.5 520 22-705 1-556 (556) 26 protein:vir:3520 Length: 720 # 100.0 2.7E-27 1.7E-30 166.3 54.5 599 27-756 1-720 (720) 27 protein:vir:9263 Length: 725 # 100.0 6.3E-27 3.9E-30 164.2 54.4 606 28-745 1-725 (725) 28 protein:vir:98506 Length: 555 100.0 3.6E-28 2.2E-31 171.1 45.9 521 22-703 1-555 (555) 29 protein:vir:107822 Length: 555 100.0 3.6E-28 2.2E-31 171.1 45.9 521 22-703 1-555 (555) 30 protein:vir:107404 Length: 555 100.0 3.6E-28 2.2E-31 171.1 45.9 521 22-703 1-555 (555) 31 protein:vir:102668 Length: 547 100.0 5E-28 3.1E-31 170.3 44.9 506 22-673 1-547 (547) 32 protein:vir:95315 Length: 559 100.0 1.3E-27 7.8E-31 168.1 44.6 523 22-710 1-559 (559) 33 protein:vir:103765 Length: 549 100.0 5.5E-26 3.4E-29 159.1 47.7 512 22-702 1-549 (549) 34 protein:vir:1785 Length: 555 # 100.0 2.4E-26 1.5E-29 161.0 44.6 525 22-710 1-555 (555) 35 protein:vir:1538 Length: 535 # 100.0 5.3E-26 3.3E-29 159.2 46.3 510 1-703 1-535 (535) 36 protein:vir:3361 Length: 535 # 100.0 6.6E-26 4.1E-29 158.6 46.8 510 1-703 1-535 (535) 37 protein:vir:10447 Length: 536 100.0 2.5E-25 1.5E-28 155.5 46.9 508 22-707 1-536 (536) 38 protein:vir:99672 Length: 532 99.9 3.8E-25 2.4E-28 154.5 45.9 511 1-672 1-532 (532) 39 protein:vir:2198 Length: 536 # 99.9 6.6E-25 4.1E-28 153.2 46.9 508 22-707 1-536 (536) 40 protein:vir:94572 Length: 535 99.9 1.4E-24 8.6E-28 151.4 46.5 510 10-705 1-535 (535) 41 protein:vir:94709 Length: 522 99.9 7.1E-25 4.4E-28 153.0 43.6 497 1-678 1-522 (522) 42 protein:vir:100039 Length: 522 99.9 3E-24 1.9E-27 149.5 42.4 496 22-700 1-522 (522) 43 protein:vir:8883 Length: 543 # 99.9 2.9E-24 1.8E-27 149.6 42.0 523 1-709 1-543 (543) 44 protein:vir:78696 Length: 542 99.9 5.7E-24 3.5E-27 148.0 37.7 505 22-713 1-542 (542) 45 protein:vir:96988 Length: 516 99.9 4.6E-23 2.9E-26 143.0 40.9 490 1-672 1-516 (516) 46 protein:vir:103330 Length: 517 99.9 4.2E-22 2.6E-25 137.8 44.8 492 20-701 1-517 (517) 47 protein:vir:6322 Length: 510 # 99.9 5.4E-22 3.3E-25 137.2 44.9 482 27-669 1-510 (510) 48 protein:vir:7017 Length: 515 # 99.9 9.7E-22 6E-25 135.8 45.8 495 10-672 1-515 (515) 49 protein:vir:78942 Length: 510 99.9 1.1E-21 6.5E-25 135.6 45.6 483 22-669 1-510 (510) 50 protein:vir:80211 Length: 514 99.9 5.8E-22 3.6E-25 137.0 44.2 482 31-672 1-514 (514) 51 protein:vir:105641 Length: 516 99.9 7.1E-21 4.4E-24 131.1 41.5 491 7-672 1-516 (516) 52 protein:vir:3964 Length: 453 # 99.7 2.4E-15 1.5E-18 100.7 37.8 443 1-666 3-453 (453) 53 protein:vir:3609 Length: 452 # 99.7 1.1E-14 6.9E-18 97.1 35.6 443 5-666 1-452 (452) 54 protein:vir:733 Length: 453 # 99.7 2E-14 1.2E-17 95.7 36.8 437 1-675 3-453 (453) 55 protein:vir:106639 Length: 481 99.6 1.8E-14 1.1E-17 96.0 33.8 456 1-668 6-481 (481) 56 protein:vir:93747 Length: 472 99.6 7.3E-15 4.5E-18 98.1 31.7 452 8-674 1-472 (472) 57 protein:vir:102950 Length: 471 99.6 1.3E-14 7.8E-18 96.8 32.3 434 23-656 1-471 (471) 58 protein:vir:96494 Length: 501 99.6 1.2E-14 7.5E-18 96.9 32.0 468 1-676 1-501 (501) 59 protein:vir:95806 Length: 440 99.6 3.7E-14 2.3E-17 94.2 34.7 421 34-674 1-440 (440) 60 protein:vir:1236 Length: 483 # 99.6 1.7E-13 1E-16 90.6 38.0 459 1-656 1-483 (483) 61 protein:vir:9922 Length: 489 # 99.6 1.5E-13 9.4E-17 90.9 37.1 443 1-625 1-489 (489) 62 protein:vir:38 Length: 496 # N 99.6 3.4E-13 2.1E-16 89.0 38.3 463 10-626 1-496 (496) 63 protein:vir:105292 Length: 478 99.6 8.2E-15 5.1E-18 97.8 29.3 460 4-656 1-478 (478) 64 protein:vir:80959 Length: 499 99.6 1.2E-13 7.7E-17 91.3 35.7 453 22-626 1-499 (499) 65 protein:vir:9871 Length: 429 # 99.6 7.2E-14 4.5E-17 92.6 34.1 422 22-661 1-429 (429) 66 protein:vir:107112 Length: 478 99.6 1.2E-14 7.4E-18 96.9 29.6 461 4-674 1-478 (478) 67 protein:vir:2732 Length: 501 # 99.6 1.7E-13 1.1E-16 90.6 35.7 468 1-670 1-501 (501) 68 protein:vir:96179 Length: 468 99.6 2.9E-13 1.8E-16 89.3 36.9 448 1-678 1-468 (468) 69 protein:vir:98883 Length: 517 99.6 7.3E-14 4.5E-17 92.6 33.4 484 1-635 1-517 (517) 70 protein:vir:79703 Length: 505 99.6 6.3E-13 3.9E-16 87.5 39.9 461 1-656 1-505 (505) 71 protein:vir:97336 Length: 492 99.6 3.6E-14 2.3E-17 94.3 30.6 461 1-674 15-492 (492) 72 protein:vir:96240 Length: 511 99.6 8E-14 5E-17 92.4 32.1 470 1-678 1-511 (511) 73 protein:vir:9306 Length: 511 # 99.6 7.5E-14 4.7E-17 92.5 31.7 470 1-681 1-511 (511) 74 protein:vir:1587 Length: 508 # 99.6 1E-12 6.2E-16 86.4 42.0 454 1-633 1-508 (508) 75 protein:vir:96266 Length: 474 99.6 6.5E-14 4E-17 92.9 30.6 452 1-674 1-474 (474) 76 protein:vir:95899 Length: 474 99.6 6.5E-14 4E-17 92.9 30.6 452 1-674 1-474 (474) 77 protein:vir:105461 Length: 470 99.6 2.8E-13 1.7E-16 89.4 33.8 438 26-664 1-470 (470) 78 protein:vir:80680 Length: 441 99.6 1.1E-12 7E-16 86.1 36.9 426 22-651 1-441 (441) 79 protein:vir:99522 Length: 470 99.6 1.2E-12 7.6E-16 85.9 41.0 444 1-656 11-470 (470) 80 protein:vir:103951 Length: 511 99.6 1.3E-13 8E-17 91.2 31.7 469 1-678 1-511 (511) 81 protein:vir:79043 Length: 479 99.5 4.6E-13 2.9E-16 88.2 33.4 446 1-651 6-479 (479) 82 protein:vir:94805 Length: 492 99.5 2.2E-13 1.4E-16 90.0 31.1 457 1-674 21-492 (492) 83 protein:vir:3028 Length: 500 # 99.5 2.4E-12 1.5E-15 84.3 36.4 467 1-633 1-500 (500) 84 protein:vir:9815 Length: 500 # 99.5 2.4E-12 1.5E-15 84.3 36.4 467 1-633 1-500 (500) 85 protein:vir:106571 Length: 499 99.5 2.2E-13 1.4E-16 90.0 30.5 487 1-675 1-499 (499) 86 protein:vir:99781 Length: 511 99.5 1.9E-12 1.2E-15 84.8 34.6 471 1-674 1-511 (511) 87 protein:vir:94101 Length: 474 99.5 8.5E-13 5.3E-16 86.7 32.4 447 1-656 1-474 (474) 88 protein:vir:105889 Length: 474 99.5 8.5E-13 5.3E-16 86.7 32.4 447 1-656 1-474 (474) 89 protein:vir:5961 Length: 503 # 99.5 1.5E-13 9.4E-17 90.9 28.0 469 4-673 1-503 (503) 90 protein:vir:97171 Length: 512 99.5 9.9E-13 6.2E-16 86.4 32.1 472 1-680 13-512 (512) 91 protein:vir:102330 Length: 451 99.5 1.8E-12 1.1E-15 84.9 33.4 434 23-662 1-451 (451) 92 protein:vir:94546 Length: 506 99.5 1.4E-12 8.4E-16 85.6 32.4 457 1-676 7-506 (506) 93 protein:vir:4898 Length: 502 # 99.5 5.4E-12 3.4E-15 82.3 37.5 468 1-664 1-502 (502) 94 protein:vir:78805 Length: 511 99.5 7.1E-13 4.4E-16 87.2 30.5 473 1-678 1-511 (511) 95 protein:vir:96366 Length: 511 99.5 7.1E-13 4.4E-16 87.2 30.5 473 1-678 1-511 (511) 96 protein:vir:94498 Length: 474 99.5 1.4E-12 8.7E-16 85.6 31.2 451 1-676 1-474 (474) 97 protein:vir:97447 Length: 474 99.5 1.4E-12 8.7E-16 85.6 31.2 451 1-676 1-474 (474) 98 protein:vir:96839 Length: 474 99.5 7.6E-12 4.7E-15 81.5 37.1 455 1-656 1-474 (474) 99 protein:vir:95113 Length: 474 99.5 9.6E-12 5.9E-15 81.0 37.8 457 1-654 1-474 (474) 100 protein:vir:9751 Length: 422 # 99.4 7.3E-12 4.5E-15 81.6 33.1 406 23-632 1-422 (422) 101 protein:vir:2341 Length: 488 # 99.4 5.3E-12 3.3E-15 82.4 31.6 468 1-669 1-488 (488) 102 protein:vir:78227 Length: 480 99.4 8.3E-13 5.1E-16 86.8 27.1 458 22-679 1-480 (480) 103 protein:vir:2427 Length: 485 # 99.4 1.3E-11 8E-15 80.3 31.6 460 6-658 1-485 (485) 104 protein:vir:99072 Length: 479 99.4 1.8E-11 1.1E-14 79.5 32.3 451 6-669 1-479 (479) 105 protein:vir:78537 Length: 480 99.4 1.7E-12 1.1E-15 85.1 26.5 461 22-679 1-480 (480) 106 protein:vir:94742 Length: 409 99.4 5.4E-11 3.3E-14 76.9 34.2 391 23-605 1-409 (409) 107 protein:vir:7768 Length: 484 # 99.4 2E-11 1.2E-14 79.3 30.1 466 4-703 1-484 (484) 108 protein:vir:4223 Length: 486 # 99.4 2.4E-11 1.5E-14 78.8 30.5 470 4-710 1-486 (486) 109 protein:vir:103385 Length: 666 99.4 7.6E-13 4.7E-16 87.0 21.9 581 1-668 1-666 (666) 110 protein:vir:7430 Length: 563 # 99.3 4.1E-11 2.5E-14 77.5 30.9 527 1-664 1-563 (563) 111 protein:vir:9568 Length: 410 # 99.3 7.9E-11 4.9E-14 76.0 31.9 398 39-633 1-410 (410) 112 protein:vir:78907 Length: 518 99.3 5.7E-11 3.5E-14 76.7 30.7 476 1-628 1-518 (518) 113 protein:vir:104082 Length: 485 99.3 1.1E-10 6.7E-14 75.2 39.2 468 6-731 1-485 (485) 114 protein:vir:1634 Length: 409 # 99.3 1.2E-10 7.7E-14 74.9 34.6 393 22-605 1-409 (409) 115 protein:vir:96403 Length: 666 99.3 2.2E-12 1.4E-15 84.5 21.5 581 1-668 1-666 (666) 116 protein:vir:4782 Length: 522 # 99.3 1.8E-10 1.1E-13 74.0 35.4 468 25-634 1-522 (522) 117 protein:vir:102239 Length: 527 99.2 6.3E-10 3.9E-13 71.0 29.3 502 1-657 1-527 (527) 118 protein:vir:101494 Length: 527 99.2 4.8E-10 3E-13 71.7 28.4 502 1-657 1-527 (527) 119 protein:vir:2500 Length: 501 # 99.0 4.8E-09 3E-12 66.2 30.9 472 6-669 1-501 (501) 120 protein:vir:8184 Length: 474 # 98.9 1.5E-08 9.4E-12 63.4 34.0 450 1-671 1-474 (474) 121 protein:vir:78083 Length: 537 98.9 2E-08 1.2E-11 62.8 43.3 493 7-754 1-537 (537) 122 protein:vir:7987 Length: 456 # 98.9 2.1E-08 1.3E-11 62.7 35.8 436 19-659 1-456 (456) 123 protein:vir:99916 Length: 504 98.9 2.3E-08 1.5E-11 62.4 40.4 481 1-748 1-504 (504) 124 protein:vir:105819 Length: 456 98.8 5.6E-08 3.4E-11 60.4 35.4 434 19-670 1-456 (456) 125 protein:vir:102602 Length: 456 98.8 5.6E-08 3.4E-11 60.4 35.4 434 19-670 1-456 (456) 126 protein:vir:98444 Length: 434 98.7 9.9E-08 6.2E-11 59.0 28.5 420 102-668 1-434 (434) 127 protein:vir:105520 Length: 706 98.6 1.7E-07 1.1E-10 57.6 32.0 603 74-736 1-706 (706) 128 protein:vir:105429 Length: 708 98.5 5.2E-07 3.2E-10 55.0 32.7 591 42-734 1-708 (708) 129 protein:vir:93630 Length: 776 97.9 1.1E-05 7E-09 47.7 39.0 649 10-756 1-760 (776) 130 protein:vir:3296 Length: 714 # 97.9 1.2E-05 7.3E-09 47.6 31.8 607 22-737 1-714 (714) 131 protein:vir:817 Length: 714 # 97.9 1.2E-05 7.3E-09 47.6 31.8 607 22-737 1-714 (714) 132 protein:vir:2764 Length: 714 # 97.9 1.2E-05 7.3E-09 47.6 31.8 607 22-737 1-714 (714) 133 protein:vir:9950 Length: 714 # 97.9 1.2E-05 7.3E-09 47.6 31.8 607 22-737 1-714 (714) 134 protein:vir:10117 Length: 714 97.9 1.2E-05 7.3E-09 47.6 31.8 607 22-737 1-714 (714) 135 protein:vir:95821 Length: 763 97.3 9.8E-05 6.1E-08 42.6 39.9 666 3-756 1-757 (763) 136 protein:vir:172 Length: 708 # 96.4 0.00065 4.1E-07 38.0 34.3 601 42-749 1-708 (708) 137 protein:vir:105619 Length: 772 96.4 0.00069 4.2E-07 37.9 31.4 620 10-756 1-760 (772) 138 protein:vir:95014 Length: 491 88.7 0.031 1.9E-05 28.9 30.2 455 60-631 1-491 (491) 139 protein:vir:78393 Length: 489 88.3 0.033 2.1E-05 28.7 27.9 453 60-627 1-489 (489) 140 protein:vir:80453 Length: 535 84.0 0.064 4E-05 27.1 27.3 483 1-655 1-535 (535) 141 protein:vir:104437 Length: 714 72.1 0.19 0.00012 24.6 33.8 597 22-737 1-714 (714) 142 protein:vir:100920 Length: 725 67.0 0.26 0.00016 23.8 31.9 615 28-753 1-725 (725) 143 protein:vir:8846 Length: 705 # 61.0 0.36 0.00022 23.0 30.3 604 61-739 1-705 (705) 144 protein:vir:95149 Length: 501 57.2 0.44 0.00027 22.5 27.0 457 64-647 1-501 (501) 145 protein:vir:97265 Length: 513 55.7 0.47 0.00029 22.4 27.9 468 57-634 1-513 (513) 146 protein:vir:77597 Length: 725 53.8 0.52 0.00032 22.1 36.4 605 28-741 1-725 (725) 147 protein:vir:78641 Length: 278 39.9 1 0.00062 20.6 19.7 263 217-549 1-278 (278) 148 protein:vir:96783 Length: 488 27.8 1.8 0.0011 19.2 29.2 443 60-618 1-488 (488) 149 protein:vir:94956 Length: 452 25.5 2.1 0.0013 18.9 33.5 410 22-621 1-452 (452) No 1 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=6.2e-189 Score=1052.42 Aligned_cols=755 Identities=71% Similarity=1.162 Sum_probs=701.4 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRR 80 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~ 80 (756) -+++|++.||+||+++.||++|+|++++++|+++++.++++++++++++.+|++||+++++.++|+++|||+|||++|++ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v~~ 81 (763) T protein:vir:95 2 EQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLVRR 81 (763) T ss_pred CcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCHHHHH Confidence 35789999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKI 160 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~ 160 (756) +|||++|+|+++|||+++||+|.|+++||+++|+|+|+||||+|+++|+||+++++|||+||++|+||+||||++++++. T Consensus 82 ~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~ 161 (763) T protein:vir:95 82 QAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKE 161 (763) T ss_pred HHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEE Q lcl|NC_019423. 161 KTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVE 240 (756) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie 240 (756) ++..+++..+++.+.+..++.+.+++...+.+..+.+...+....+...+.+.|.+++.++.+.....+++..+++|+|+ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie 241 (763) T protein:vir:95 162 KQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVE 241 (763) T ss_pred eeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEE Confidence 99999999999999999999999999999999888777777777778888899999999999988888888899999999 Q ss_pred EechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEE Q lcl|NC_019423. 241 MLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVA 320 (756) Q Consensus 241 ~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v 320 (756) +|+|++|||||+|++|++||+|++|++++|+++|.++++.+++++.+++.........++.......+.+.|.++++|+| T Consensus 242 ~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v 321 (763) T protein:vir:95 242 MLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVA 321 (763) T ss_pred eecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEE Confidence 99999999999999899999999999999999999999889888888777655555555556666777788888999999 Q ss_pred EEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 321 YEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDL 400 (756) Q Consensus 321 ~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~ 400 (756) +|||+++|++++|++++++++|+|+++|+++++||+|++|||++++++|++|++||+|+++.++|+|+++|+++|+++|+ T Consensus 322 ~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~ 401 (763) T protein:vir:95 322 YEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDL 401 (763) T ss_pred EEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCC Q lcl|NC_019423. 401 LGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVT 480 (756) Q Consensus 401 l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~ 480 (756) ++++++|+|++++|++++.+......+....+.+ +..+...+.+..+|++++.++.+++++...++++|||+++++|++ T Consensus 402 l~~~~~~~~~v~~gav~~~d~~~~~pg~v~~v~~-g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~ 480 (763) T protein:vir:95 402 LGRSANGQRGMPKGMLDALNSRRYREGEDYEYNP-TQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVT 480 (763) T ss_pred HHhhcCCcEEeecccccchhhhcccCCceEEeeC-CCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcC Confidence 9999999999999999887766655555554443 334556788888999999999999999999999999999999999 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc Q lcl|NC_019423. 481 GSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN 560 (756) Q Consensus 481 ~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g 560 (756) ++++|+||++++++++++++++..++|||+++++++|+++++||++||+++++|||+|++|++|++++|+++|||+|+++ T Consensus 481 ~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~ 560 (763) T protein:vir:95 481 GESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS 560 (763) T ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 561 TAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSK 640 (756) Q Consensus 561 ~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~ 640 (756) +++..+++.+++++|++++++.+++.+...++..++++.++++..+.++..+++++|+++++.++++.+++++++.++++ T Consensus 561 ~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~ak 640 (763) T protein:vir:95 561 TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSK 640 (763) T ss_pred cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHH Confidence 98888888888999999999999999988999999999999999999999999999998888899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhccCCCCCCCcc- Q lcl|NC_019423. 641 IALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAAVGYNTLTNG- 719 (756) Q Consensus 641 a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~~~- 719 (756) +++.++++.+.+++++.+++++.++++++++++++++.+++++++++++++++..+...+++.+..+++++|+..+.-+ T Consensus 641 aq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~~~~~~~~~~~~~~~~~~ 720 (763) T protein:vir:95 641 IRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGELPPNLSAAIGYNALTNGE 720 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHHhhhhccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999888664442 Q ss_pred ----cCchhcCCCCCCCCccccccccccCCCCCCCCCCCcC Q lcl|NC_019423. 720 ----NSPQERDLAAQQDPAYSLGSQYYDPSQDPASALGMNL 756 (756) Q Consensus 720 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 756 (756) -...+++-++...|+++++++.++|+..|+..||++| T Consensus 721 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 761 (763) T protein:vir:95 721 DTGIQSVSERDIAAEANPAYSLGSSQFDPTRDPALNPGIRL 761 (763) T ss_pred CCCccchhhcccCccccccccCCCCCCCCCCccccCCcccc Confidence 2335566778889999999999999999999999999 No 2 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=9.9e-128 Score=716.87 Aligned_cols=652 Identities=17% Similarity=0.185 Sum_probs=460.2 Q ss_pred ccccccccCCCchHHHHHHHHHHHHHHHHhhHHHH-HHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHH Q lcl|NC_019423. 13 PAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMS-QIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSE 91 (756) Q Consensus 13 ~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~ 91 (756) -.|+.|--+++++++++.|...+++|++++++.++ ++.+|++||+|+++++ ..+|||+||+++|+++|||++|+|++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~--~~~~~s~~~~~~v~~~v~~~~~~l~~ 78 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN--ERPGKSGIVSRDVQETVDWIMPSLMK 78 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc--ccCCCCccccHHHHHHHHHHHHHHHH Confidence 24444556789999999999999999999999996 6899999999987644 67899999999999999999999999 Q ss_pred hhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC Q lcl|NC_019423. 92 PFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP 171 (756) Q Consensus 92 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~ 171 (756) +||+++++|+|.|++++|+++|++.|+||||+|+++|+|++++++||++||++|+||+||||+.+++...+ .+. T Consensus 79 ~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e------~~~ 152 (705) T protein:vir:88 79 VFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFE------RFS 152 (705) T ss_pred hhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhh------hhc Confidence 99999999999999999999999999999999999999999999999999999999999999866554433 233 Q ss_pred CCCHH-HHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeee-ecCceeEEEechhheEe Q lcl|NC_019423. 172 IENQE-QADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKA-LVNRPTVEMLNPNNVVI 249 (756) Q Consensus 172 ~~~~~-~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~-~~g~~~ie~V~p~~~~~ 249 (756) ..++. +++...+ ......+. ... ..|.+.+.+.+. .+|+|+|++|+|++|+| T Consensus 153 ~~~~~~l~~~~~d-------~~~~~~~~------------~~~-------~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~ 206 (705) T protein:vir:88 153 GLSEDMVADILSD-------PDTSILAQ------------SVD-------DDGTYTIKIRKDKKKREIKVLCVKPENFLV 206 (705) T ss_pred cCChhhhhhhhhh-------hhhhcccc------------ccc-------ccceeeeEEeeeeecCceeeeeccHHHcee Confidence 33332 2221111 00000000 011 223344444433 47999999999999999 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchh---hh--hhhchhhhcc---ccccc-cccccccceEEE Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWES---SS--PITDPDHESK---TPSDF-QFKDALRKKVVA 320 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~---~~--~~~~~~~~~~---~~~~~-~~~d~s~~~V~v 320 (756) ||+|++ ++||+|++|++++|+++|+++++..+.++.+.... .. .........+ ..... .+.+...++|+| T Consensus 207 dp~a~~-~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~ 285 (705) T protein:vir:88 207 DRLATC-IDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWA 285 (705) T ss_pred cCCCCC-cccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEE Confidence 999975 88999999999999999999876654443322211 10 0000111011 11111 122334567999 Q ss_pred EEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 321 YEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDL 400 (756) Q Consensus 321 ~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~ 400 (756) ||||++++++++|+.++++++|+|+++|+.++ .+++||++++++|+++++||+|+++.++|+|+.+|+++|+++|+ T Consensus 286 ~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~~----~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~ 361 (705) T protein:vir:88 286 SECYTLLDVDGDGISELRRILYVGDYIISNEP----WDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDN 361 (705) T ss_pred EEeeeEecccCCcceeeEEEEEeCcccccccc----CCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998653 47899999999999999999999999999999999999999999 Q ss_pred HHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCC Q lcl|NC_019423. 401 LGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVT 480 (756) Q Consensus 401 l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~ 480 (756) ++++++|++++++|+++..+.... .+++++..++.+++.++++|+++++++.|++++.+.++++|||+++++|++ T Consensus 362 ~~~~~~~~~~~~~g~v~~~d~~~~-----~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~ 436 (705) T protein:vir:88 362 IYRTNQGRSVVLDGQVNLEDLLTN-----EAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLD 436 (705) T ss_pred HHhccCCceeccccccCccccccc-----CCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCC Confidence 999999999999999876544332 334445556667899999999999999999999999999999999999998 Q ss_pred cccc--chhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEE Q lcl|NC_019423. 481 GSAY--GDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEV 557 (756) Q Consensus 481 ~~a~--~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V 557 (756) ++++ +.||++++++++++++++..++++|++ +++++|++++.||++||++++++||+| +|++|+|+++.++|||+| T Consensus 437 ~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g-~~v~v~~~~~~~~~~v~v 515 (705) T protein:vir:88 437 QNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRG-KWVAVNPANWRERSDLTV 515 (705) T ss_pred cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeecc-chhccchHhhccCCceEE Confidence 8775 579999999999999999999999986 789999999999999999999999998 699999999999999999 Q ss_pred ecccccHHHH-HHHHHHHHH---HHh------hccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCCh---hh----h Q lcl|NC_019423. 558 DINTAEIDNQ-KSQDLGFMV---QTL------GNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDP---ME----E 620 (756) Q Consensus 558 ~~g~a~~~~~-~~q~l~~ll---q~~------~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p---~~----~ 620 (756) ++++++.++. +.+++..++ +.+ ++.+++....+++.++++.+++.+..+++......... ++ . T Consensus 516 ~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e 595 (705) T protein:vir:88 516 TVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKE 595 (705) T ss_pred eeccccchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhh Confidence 9887765432 233333333 222 23455677778889999999888776665332111100 00 0 Q ss_pred hH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH------HHHHHHHHHHHHHH Q lcl|NC_019423. 621 QL-------KQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDY--LEQESGTKH------ARDMEKQKAQSQGN 685 (756) Q Consensus 621 ~~-------~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~--~~q~~~~k~------~~~~~~~~~q~~~~ 685 (756) .. .|.++++++++.+..++.++..+.+++..+++.+..+.+. .+++...++ .+..++.+++.+++ T Consensus 596 ~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e 675 (705) T protein:vir:88 596 AQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAE 675 (705) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0111112222222222222221111111111111111100 000000000 01111122222233 Q ss_pred HHHHHHHHHHHHhhccCCc------hhhhc Q lcl|NC_019423. 686 QNLQITKALTTPTKEGETT------PNISA 709 (756) Q Consensus 686 ~~~~~~~a~~~~~~~~~~~------~~~~~ 709 (756) .++++.++.....+...-+ .++.+ T Consensus 676 ~~~e~~q~~~~~~~~~~~~~~~k~~~~~rr 705 (705) T protein:vir:88 676 YHLEATQARAAYIGDGKVPETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHhcC Confidence 3333333332222222111 11112 No 3 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=1.9e-102 Score=578.26 Aligned_cols=658 Identities=15% Similarity=0.156 Sum_probs=415.2 Q ss_pred CCcccCC--------C-CC-CCcc--cccc-ccCCCchH---HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC Q lcl|NC_019423. 1 MEHQDTF--------K-PL-PDPA--QSEK-LTDWKKEP---SIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKP 64 (756) Q Consensus 1 ~~~~~~~--------~-~~-~~~~--~~~~-~~~~~~~~---~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~ 64 (756) +...|+. - |+ ++.+ .+++ -.+..++. +.+.|...++.+...+....+...++++||+|+.+.+. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~ 83 (776) T protein:vir:93 4 LNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQD 83 (776) T ss_pred ccccccccccccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH Confidence 1221111 1 11 1111 1111 11344444 44455555555555555555667799999999977653 Q ss_pred CC----CCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHH Q lcl|NC_019423. 65 PK----IKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHS 140 (756) Q Consensus 65 ~~----~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~ 140 (756) .. ..||+.+|-+.|+.+|+|++.... .+..-+.|.|++++|++.|+..|.++||+ ...++......+++++ T Consensus 84 ~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~----~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~~~af~d 158 (776) T protein:vir:93 84 EIDELKERGQAPTVYNVISQSVNWIIGSEK----RGRSDFKVLPRRKDGGKAAERKTALLKYL-SDVNHTPFERSMAFEE 158 (776) T ss_pred HHHHHHhcCCceEEecchHHHHHHHHHHHH----hCCcceEEecCChhHHHHHHHHHHHHHHH-HHhhcHHHHHHHHHHH Confidence 32 489999999999999999998884 45566999999999999999999999997 4788899999999999 Q ss_pred HhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceec Q lcl|NC_019423. 141 IVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAI 220 (756) Q Consensus 141 al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~ 220 (756) +|++|+||++++|+++.. T Consensus 159 ~~~~G~G~~~v~~d~~~~-------------------------------------------------------------- 176 (776) T protein:vir:93 159 TTKAGIGWLESQVQDEND-------------------------------------------------------------- 176 (776) T ss_pred hhhcCcceEEEEeeccCC-------------------------------------------------------------- Confidence 999999999999974310 Q ss_pred cCceeEEEeeeeecCceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhh-h-----cccCchhhh Q lcl|NC_019423. 221 QTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHN-L-----DKIDWESSS 293 (756) Q Consensus 221 ~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~-l-----~~~~~~~~~ 293 (756) .+.+++++|+|++|||||+|++ |++||+|++|++|+|+++++.+++.... + +.+.+.... T Consensus 177 -------------~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 243 (776) T protein:vir:93 177 -------------GEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTD 243 (776) T ss_pred -------------CCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchh Confidence 0124467899999999999986 8999999999999999999998654321 1 111110000 Q ss_pred -hhhc------hhhhccccccccccccccceEEEEEEEEEeecc-----------------------------------C Q lcl|NC_019423. 294 -PITD------PDHESKTPSDFQFKDALRKKVVAYEYWGFYDIN-----------------------------------D 331 (756) Q Consensus 294 -~~~~------~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~-----------------------------------~ 331 (756) .... .........+..+.+.++++|+|+|||+|..+. . T Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~ 323 (776) T protein:vir:93 244 DIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAV 323 (776) T ss_pred cccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehh Confidence 0000 001111222334556677899999999974210 0 Q ss_pred CceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEe Q lcl|NC_019423. 332 DGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGY 411 (756) Q Consensus 332 ~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~ 411 (756) .+..+.++++|+|+++|+.+++||+|++||||+++++++++++||+|++++|+|+||++|+++|+++|+| ++.++++ T Consensus 324 ~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~~~~~ 400 (776) T protein:vir:93 324 SPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STNKVLM 400 (776) T ss_pred eeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCCceee Confidence 1224567889999999999999999999999999999999999999999999999999999999999887 4668999 Q ss_pred eccccCccchhhhhccccccccccccccc--cccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHH Q lcl|NC_019423. 412 PKGMLDTLNRRRYDDGQDYEYNPMQGNPS--QSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAA 489 (756) Q Consensus 412 ~~gav~~~~~~~~~~~~~~~~~~~~~~~~--~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~ 489 (756) ++|++++.+..+..... +...+..+++ ..+.+.+.+++++++++++++..+.++++|||+++++|..+++.+++| T Consensus 401 ~~gav~~~d~~~~~~~r--p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~a- 477 (776) T protein:vir:93 401 EEGAVDDIDEFRREAAR--PDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVA- 477 (776) T ss_pred ccccccchHHHHHhccc--CCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHH- Confidence 99999988776654432 2222333332 356667778899999999999999999999999999999888765554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC----ceeecC-----HhHhcCcceEEEecc Q lcl|NC_019423. 490 GIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE----QYVEIK-----REDLKGNFDIEVDIN 560 (756) Q Consensus 490 ~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~----~~v~i~-----~d~~~~~~Dv~V~~g 560 (756) +++++++|++++..++|||+++++++|+++|+||++||+++|+|||+|+ +||.|| +|..+|+|||+|.+| T Consensus 478 -i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~ 556 (776) T protein:vir:93 478 -IQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEA 556 (776) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeec Confidence 8889999999999999999999999999999999999999999999986 599986 455568999999999 Q ss_pred cccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCChhhhhHH-------HHHHHHH Q lcl|NC_019423. 561 TAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDPMEEQLK-------QLAIQKA 630 (756) Q Consensus 561 ~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p~~~~~~-------q~~~~~a 630 (756) +++.++ +.+++..|+++++ .++|.+...++..+++.++++ ++.+.++...++++|.+.++. +.+++++ T Consensus 557 ~~~~s~-r~~~~~~l~ql~~-~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~ 634 (776) T protein:vir:93 557 EWRATM-RQAAVAELMEVIG-KMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQ 634 (776) T ss_pred ccchhH-HHHHHHHHHHHHh-hcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHH Confidence 887654 3334444444442 345666666666666666665 455556655554443332222 2222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHh--hccC Q lcl|NC_019423. 631 QLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHA------RDMEKQKAQSQGNQNLQITKALTTPT--KEGE 702 (756) Q Consensus 631 q~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~------~~~~~~~~q~~~~~~~~~~~a~~~~~--~~~~ 702 (756) +++.++.++++...++++.+.++++..++++..+........ ..++.......+.......+...... ..+. T Consensus 635 q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~~a~~~~p~ 714 (776) T protein:vir:93 635 QYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILRESGWDDPN 714 (776) T ss_pred HHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhccccccccc Confidence 222222222333333333332233222222211111111000 00111000000000000011100000 0011 Q ss_pred CchhhhccCCCCCCCcccCchhcCCCCCCCC----ccccccccccCC-------------CCCCCCCCC Q lcl|NC_019423. 703 TTPNISAAVGYNTLTNGNSPQERDLAAQQDP----AYSLGSQYYDPS-------------QDPASALGM 754 (756) Q Consensus 703 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-------------~~~~~~~~~ 754 (756) .|+..+.+.| ..|....|+.+..| .+.-.+.-..|. ..|.|+==| T Consensus 715 ~p~~~~~~~~-------~~~~~~~p~~p~~p~~p~~p~~~~~~~~p~~p~~~p~~p~~~~~~~~pqqP~ 776 (776) T protein:vir:93 715 TPQPASAASG-------MPPAPAQPAQPANPAQPPAPGQAASEAQPALPANPPQPPGVVPDGAAPQQPM 776 (776) T ss_pred cccccccccC-------CCCCCCCCCCCCCcCCCCCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCC Confidence 1111100000 01111112111111 122222111221 111111111 No 4 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=6.1e-99 Score=559.02 Aligned_cols=615 Identities=15% Similarity=0.163 Sum_probs=420.8 Q ss_pred CCcccCCCCCCCccccccccCCCc-hHHHHHHHHHHHHHHHHhhHHHHHHH----------HHHHHhccccCC--CCCCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKK-EPSIQLLKGDLESAKPAHDAIMSQIR----------EWNDLMEVKGKA--KPPKI 67 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~a~~~~~~~~~~~~----------~~~~~y~~~~~~--~~~~~ 67 (756) |+.--| +-+.|+..|.+ +.+.+.|..+++.++++++...++|. ++++||++.... .+++. T Consensus 1 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 73 (651) T protein:vir:80 1 MKLATT-------TTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNA 73 (651) T ss_pred Cccccc-------ccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCC Confidence 333222 22334445544 45577888899999999998887774 567888876543 34556 Q ss_pred CCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhh--cCCcch-HHHHHHHHhhc Q lcl|NC_019423. 68 KGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQ--LNKVKL-VDDYVHSIVDD 144 (756) Q Consensus 68 ~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~--~~~~~~-~~~~v~~al~~ 144 (756) .|||+||+++|+++|+|++|+|+++||++++||+|.|. +|++.|++.+++|||++..+ ..+|.. ++.+++++|+. T Consensus 74 ~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~--~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~ 151 (651) T protein:vir:80 74 DWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPA--KPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLIT 151 (651) T ss_pred CCCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccC--CchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhccc Confidence 79999999999999999999999999999999999995 55567999999999999865 234554 44678999999 Q ss_pred CceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCce Q lcl|NC_019423. 145 GTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGV 224 (756) Q Consensus 145 g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~ 224 (756) |+||+||+|++++++.++.+.. + .+.. -| +.. T Consensus 152 G~~i~kv~we~~~~~~~~~~~~---------~--------~~~~------------------------~~-------~~~ 183 (651) T protein:vir:80 152 GNSVLALPWRVETAEVKKKVQV---------R--------TPLF------------------------ED-------EPT 183 (651) T ss_pred CceEEEEeecceeeeeehheec---------c--------cccc------------------------cc-------ccc Confidence 9999999999887666553210 0 0000 00 011 Q ss_pred eEEEee-eeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhc--cchhhhcccCchhhhhhhchh-- Q lcl|NC_019423. 225 TEVEVE-KALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNK--DRYHNLDKIDWESSSPITDPD-- 299 (756) Q Consensus 225 ~~~~~~-~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~--~~~~~l~~~~~~~~~~~~~~~-- 299 (756) +.+.+. ...+|+|+|++|+|++|||||+|+ +++||.|++|++ +|+.++..+. +.+.+++.............+ T Consensus 184 ~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~-~~~d~~~v~~~~-~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 261 (651) T protein:vir:80 184 FEVVSEEREVKSSPDFEVLDMFDCFYDPNVT-DPNRGAFIRKLT-KTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTK 261 (651) T ss_pred eeeeccceeeeceeEEEEecHHHeeecCCCc-Cccccceeeeee-eeHHHHHHHHhcccccchhhHHHHhhhccccccCC Confidence 222332 335689999999999999999996 588999998875 4666665442 222222221111111100011 Q ss_pred -hhcccccccc-ccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCC Q lcl|NC_019423. 300 -HESKTPSDFQ-FKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGE 377 (756) Q Consensus 300 -~~~~~~~~~~-~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~ 377 (756) .......... ....+.++|.|||||++++.+++++ +.+++++.|+.+|+.+++||+++ +||++++|++++|++||+ T Consensus 262 ~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~-~~~~v~~~g~~il~~~~~~~~~~-~Pf~~~~~~~~~~~~yG~ 339 (651) T protein:vir:80 262 QDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTY-HDVVVTIMGNEVLRFEQNPYWCG-RPFVIGTYIPTARQPYAM 339 (651) T ss_pred ccccccccCCCccccccccceEEEEEEEEeeccCCce-EEEEEEEcCcEEecccccCCCCC-CCeeeecceecCccccCC Confidence 1111111111 1123456899999999999998887 55688899999999999999875 599999999999999999 Q ss_pred chHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccC-CCcchHHH Q lcl|NC_019423. 378 ADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKF-PELPQSAI 456 (756) Q Consensus 378 g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~ 456 (756) |+++.+.|.|+.+|+++|+++++++++++|++++++|++.+.++.... +.+++..+....+.++++ +..++..+ T Consensus 340 g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~-----pg~vi~~~~~~~~~~l~~~~~~~~~~~ 414 (651) T protein:vir:80 340 GALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTE-----PGKVFLVSDHGDLQPLANQSSNFSITY 414 (651) T ss_pred ChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcC-----CCceEEecCCCCceeeccCcccchhHH Confidence 999999999999999999999999999999999998877654433222 233333444455666554 33567788 Q ss_pred HHHHHHHHHHHHHhchhHHhcCCCcccc-chhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEE Q lcl|NC_019423. 457 VMTQMQNQEAESLTGVKAFSGGVTGSAY-GDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVV 534 (756) Q Consensus 457 ~~l~~~~~~~e~~tGv~~~~~G~~~~a~-~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~i 534 (756) ++++++.+.++++|||+++++|..+... +.||++|+++++++++++..++++|++ ++++++++++.++++|++.++++ T Consensus 415 ~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ 494 (651) T protein:vir:80 415 QESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMV 494 (651) T ss_pred HHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccce Confidence 9999999999999999999999877654 569999999999999999999999997 89999999999999999999999 Q ss_pred EEecC-----ceeecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhcc--CCH-hHHHHHHHHHHhhcCChhH Q lcl|NC_019423. 535 RITNE-----QYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNT--VDQ-SITLSLVAKIAELKRMPDL 604 (756) Q Consensus 535 RI~g~-----~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~--~~~-~~~~~~l~~l~e~~~~~~~ 604 (756) ||+|+ .++.++++++.++++++ ..|+... +....+++..+++.+++. +.. .+...++..+++.+|+++. T Consensus 495 ri~~~~~~~~~~~~i~~~dl~~~~~iv-~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~ 573 (651) T protein:vir:80 495 RVAGDEAGAYEYYELDVEDLQKEVRLV-PIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEP 573 (651) T ss_pred eecccccccccccccCccceeeeeeee-eccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcCCCCc Confidence 99986 37788899999999984 4454322 334456667777776643 222 3456778899999999999 Q ss_pred HHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 605 AHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQG 684 (756) Q Consensus 605 ~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~ 684 (756) ..++....+++++.++++. +. |++....++..+ ++++++ ++.++.++.+++.+.+++.+. T Consensus 574 ~~~l~~~~q~~~~~~~~~~---~~--q~~~~~~~a~~~-------~~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 633 (651) T protein:vir:80 574 EAYLKQQDQQAPANPQEAL---LS--QAKDVGGQAMSN-------MLQNQL--------QADGGTQMMSEMYGTPNADQM 633 (651) T ss_pred HHhcCCCccchhhhhhHHH---Hh--hHHHHHHHHHHH-------HHHHHH--------HHHHHHHHHHHHHHHHHHHHH Confidence 8887544433322211111 11 111111111111 111110 111122222333333333344 Q ss_pred HHHHHHHHHHHHHhhccC Q lcl|NC_019423. 685 NQNLQITKALTTPTKEGE 702 (756) Q Consensus 685 ~~~~~~~~a~~~~~~~~~ 702 (756) ++++.+.++.+.++.-.- T Consensus 634 ~~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 634 QQELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHHHHHHHHHHHhhccC Confidence 444444443333332221 No 5 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=3.5e-97 Score=549.36 Aligned_cols=650 Identities=12% Similarity=0.089 Sum_probs=435.5 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~ 76 (756) =+.++..+.+++.. -.+|+ ...+..+..+.+ .+........+..+||+|+-+.+..+ ..|+.-++-+ T Consensus 5 ~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~~~~----~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N 75 (772) T protein:vir:10 5 ENDRQYLNGLPPAG-DTPLT----VDEYADINYEIE----DQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVED 75 (772) T ss_pred hhhHHhhccCCccc-ccccC----HHHHHHHHHHHh----ccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEc Confidence 23344555444333 22222 112222333322 23333345668899999997765332 5799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCC-cchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVT-FEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWER 155 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~-~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~ 155 (756) .|+.+|+|++..- -.+-.=+.|.|.+ .+|++.|+..|.+++|+.. .++.-....++|+++|++|.|++.++++. T Consensus 76 ~i~~~v~~v~g~~----~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~ 150 (772) T protein:vir:10 76 LIGPALLSLQGYE----AVTRTDWRVTPNGDVGGQEVADALNYRLNTAER-QSGADRACSEAFRPQIACGIGWVEVSRES 150 (772) T ss_pred chHHHHHHHHHHH----HhcCcceEEecCCCchHHHHHHHHHHHHHHHHH-hcChHHHHHHHHHHhhhcCceeEEecccc Confidence 9999999998887 5556669999985 6999999999999999866 34444447789999999999976653321 Q ss_pred eeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecC Q lcl|NC_019423. 156 KTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVN 235 (756) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g 235 (756) . ...+ T Consensus 151 d---------------------------------------------------------------------------~~~~ 155 (772) T protein:vir:10 151 D---------------------------------------------------------------------------PFKF 155 (772) T ss_pred C---------------------------------------------------------------------------CCCC Confidence 1 0113 Q ss_pred ceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccC-----------------chhhhhhhc- Q lcl|NC_019423. 236 RPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKID-----------------WESSSPITD- 297 (756) Q Consensus 236 ~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~-----------------~~~~~~~~~- 297 (756) .|+|++|+|++|||||+|+.|++||+|+++++|||+++++.+++....+.... +..++.... T Consensus 156 ~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (772) T protein:vir:10 156 PYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNA 235 (772) T ss_pred CeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccc Confidence 57899999999999999988999999999999999999999877543332211 111111111 Q ss_pred -hhhhccccccccccccccceEEEEEEEEEeec--------cCC-------------------------ceeEEEEEEEE Q lcl|NC_019423. 298 -PDHESKTPSDFQFKDALRKKVVAYEYWGFYDI--------NDD-------------------------GSLEPIVATWI 343 (756) Q Consensus 298 -~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~--------~~~-------------------------g~~~~~~~~~~ 343 (756) ............|.+.++++|+|+|||+|... +|. ...+.++++|+ T Consensus 236 ~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~ 315 (772) T protein:vir:10 236 WNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWL 315 (772) T ss_pred cchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEe Confidence 11111222234455677889999999988421 111 11355678899 Q ss_pred CCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchh- Q lcl|NC_019423. 344 GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRR- 422 (756) Q Consensus 344 g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~- 422 (756) |.++|+.+++||+|++|||||+++++++.++.++|+||+|+|+||++|+++|+++++++.+ ++++++|++++.+.. T Consensus 316 g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~---~~~~~~gav~~~d~~~ 392 (772) T protein:vir:10 316 GPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVA---RVERTKGAVAMTDAQF 392 (772) T ss_pred cceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhcc---cccccCCCccchhHHH Confidence 9999999999999999999999999998888888999999999999999999999988554 588999999987642 Q ss_pred hhhcccc---ccccccc-cccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHH Q lcl|NC_019423. 423 RYDDGQD---YEYNPMQ-GNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAA 498 (756) Q Consensus 423 ~~~~~~~---~~~~~~~-~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa 498 (756) .+..... ..+++.. ..++..+.+.+.++++++.+++++...+.++++|||+++++|..+++.|++| |+++++++ T Consensus 393 ~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvA--i~~rq~qg 470 (772) T protein:vir:10 393 RRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQ--EQQQIEQS 470 (772) T ss_pred HHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHH--HHHHHHHH Confidence 2222211 1222111 1234567788889999999999999999999999999999999888866666 88899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCc------eeecC--------------HhHhcCcceEEEe Q lcl|NC_019423. 499 SKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQ------YVEIK--------------REDLKGNFDIEVD 558 (756) Q Consensus 499 ~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~------~v~i~--------------~d~~~~~~Dv~V~ 558 (756) ++.+..++|||+.+++++|+++|+||++||+++|++||+|++ |+.|| +|..+++|||+|+ T Consensus 471 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~ 550 (772) T protein:vir:10 471 NQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALE 550 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEee Confidence 999999999999999999999999999999999999999753 45554 4667889999999 Q ss_pred cccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCChhhhhHHH-------HHHH Q lcl|NC_019423. 559 INTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDPMEEQLKQ-------LAIQ 628 (756) Q Consensus 559 ~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p~~~~~~q-------~~~~ 628 (756) +++++.+. +.+++..|++++++ ++|.+...++..+++++++| ++.+.+++..++++|.+.++++ ++++ T Consensus 551 ~~p~~~t~-r~~~~~~m~ql~~~-~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~ 628 (772) T protein:vir:10 551 DVPSTNSY-RGQQLNAMSEAVKS-MPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKA 628 (772) T ss_pred ccccchHH-HHHHHHHHHHHHhc-cChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHH Confidence 99887654 44556666666644 67888777777777777776 6777777776666654322221 2223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC------ Q lcl|NC_019423. 629 KAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGE------ 702 (756) Q Consensus 629 ~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~------ 702 (756) +++++..++++++++..|+++++++++....+++. .++++.++......+.+ ...++++....... T Consensus 629 ~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~--~~a~~aa~~~~q~~q~a------~~ad~~l~~~g~~~~~~~~~ 700 (772) T protein:vir:10 629 GNDIKLRELEIKERKADSEISGLNAKAVQIGVQAA--FSAMQAGAQIAQMPMIA------PIADAVMQSAGYQRPNPAGD 700 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhhhhhhHHhhhhhh------HHHHHHHHhccccccccccc Confidence 34444455666666666666666666555444322 12222111111111111 11111111111100 Q ss_pred ---Cchhhh-cc-----CCC------CCC--CcccCchhcCCCCCCCC--ccccccccccCCCCCCCCCCCcC Q lcl|NC_019423. 703 ---TTPNIS-AA-----VGY------NTL--TNGNSPQERDLAAQQDP--AYSLGSQYYDPSQDPASALGMNL 756 (756) Q Consensus 703 ---~~~~~~-~a-----~~~------~~~--~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 756 (756) ++.... ++ +|. ... .+...|+++.|+.+.+| +++...|+++|+.. -++-|+- T Consensus 701 ~~~~p~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~p~~~q~~~~~~~g~~~~~~~--~~~~~~~ 771 (772) T protein:vir:10 701 DPNYPIADQTAAMNIRSPYIQGQGPAAEAEAESVSVRRNTSPTYPPVPEEAPTGLRGIETPSTA--DNLSVRG 771 (772) T ss_pred CCCCCCCCCccCCCCCccCCCCCCCCCccccCCCCCccCCCCCCCCCCcccCCCCCCCCCCCCC--ccceecC Confidence 000000 00 011 000 11234566666666666 77778888888432 1222222 No 6 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=1.4e-95 Score=540.68 Aligned_cols=599 Identities=13% Similarity=0.088 Sum_probs=413.8 Q ss_pred CCcccCCCCCCC-ccccccccC---CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCc Q lcl|NC_019423. 1 MEHQDTFKPLPD-PAQSEKLTD---WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQ 72 (756) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~ 72 (756) |.|-..--|.+. +.+|.|+.. -.++.++..+..-|+.+..++....+...++.+||+|+.+.+..+ ..|++- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~ 80 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCc Confidence 766655555443 348888874 455667778888888888888877777789999999987754322 579999 Q ss_pred ccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCC----------------------cchHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_019423. 73 VQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVT----------------------FEDELAARQNELVLNYQFRTQLNK 130 (756) Q Consensus 73 ~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~----------------------~~D~~~A~q~t~~~n~~~~~~~~~ 130 (756) ++-+.|+..|+|+++.- -.+-.-+.|.|+. .+|++.|++.|.+++|+.. .++. T Consensus 81 ~~~N~i~~~v~~v~g~~----~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~-~~~~ 155 (711) T protein:vir:10 81 LVNNVLPTFVDQVLGDQ----RQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY-NCDA 155 (711) T ss_pred EEEcchHHHHHHHhhhH----hhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHH-hcCh Confidence 99999999999998777 4455568888875 7899999999999999654 5566 Q ss_pred cchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHH Q lcl|NC_019423. 131 VKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYF 210 (756) Q Consensus 131 ~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~ 210 (756) -....++++++|++|.|+++++|+++.+. T Consensus 156 ~~~~s~af~d~~~~G~G~~ev~~d~~~~d--------------------------------------------------- 184 (711) T protein:vir:10 156 ETEYDIAFQGAVESGMGYLRVRSDYLADD--------------------------------------------------- 184 (711) T ss_pred hHHHHHHHHHhhhcCcceEEEEecccCCC--------------------------------------------------- Confidence 66788999999999999999988754211 Q ss_pred HhcCCcceeccCceeEEEeeeeecCceeEEEe-chhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccC Q lcl|NC_019423. 211 NETGEATYAIQTGVTEVEVEKALVNRPTVEML-NPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKID 288 (756) Q Consensus 211 ~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V-~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~ 288 (756) ...|+|+|.+| +|.+|||||.++. |++||+|+++++|||+++++.+++.....+ T Consensus 185 ---------------------~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~--- 240 (711) T protein:vir:10 185 ---------------------SFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEP--- 240 (711) T ss_pred ---------------------CCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhh--- Confidence 11256888888 6999999998875 999999999999999999999865432110 Q ss_pred chhhhhhhchhhhccccccccccccccceEEEEEEEEEeecc------CC-------------------c---------- Q lcl|NC_019423. 289 WESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDIN------DD-------------------G---------- 333 (756) Q Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~------~~-------------------g---------- 333 (756) .... .. .........++|+|.|||+|.... ++ | T Consensus 241 ~~~~---------~~---~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 308 (711) T protein:vir:10 241 VYED---------SV---ADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVK 308 (711) T ss_pred hhcc---------cc---cccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhc Confidence 0000 00 000011234689999999873210 00 1 Q ss_pred eeEEEEEEEECCEEEEecccccCCCccceEEeeeeee--cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEe Q lcl|NC_019423. 334 SLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPR--KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGY 411 (756) Q Consensus 334 ~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~--~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~ 411 (756) ..+.++++|.|+++| .+++||+|++|||||+++++. +++++++|++++|+|+||++|+++|+++|++++++++++++ T Consensus 309 ~~~v~~~~~~G~~~L-~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~ 387 (711) T protein:vir:10 309 TFKTYWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIG 387 (711) T ss_pred eeeEEEEEEecceee-cCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceee Confidence 123455678999999 688999999999999998865 78888999999999999999999999999999999999999 Q ss_pred eccccCccchhhhhcc----ccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchh Q lcl|NC_019423. 412 PKGMLDTLNRRRYDDG----QDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDV 487 (756) Q Consensus 412 ~~gav~~~~~~~~~~~----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~t 487 (756) ++|++++.++.+.+.+ ....+++ +..++..+.+++.|++|+++++|+++..+.++++|||+++++|..+++. + T Consensus 388 ~~gai~~~~~~~~e~~~~~~~vi~~~~-~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~--S 464 (711) T protein:vir:10 388 SEGNVEGREDEWEQANTKNFSLLTYIP-QYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNET--S 464 (711) T ss_pred cCcccCChHHHHHhccccCCCeeEecc-cccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccch--H Confidence 9999998776544332 2222222 2234457889999999999999999999999999999999999988865 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC----ceeecCH--------------hHh Q lcl|NC_019423. 488 AAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE----QYVEIKR--------------EDL 549 (756) Q Consensus 488 A~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~----~~v~i~~--------------d~~ 549 (756) +.+|++++++|.+++..++|||+++++++|+++|+||++||+++|+|||+|+ +||.||. |.. T Consensus 465 g~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~ 544 (711) T protein:vir:10 465 GRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN 544 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccc Confidence 5559999999999999999999999999999999999999999999999987 5888763 556 Q ss_pred cCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCChhh---hh-- Q lcl|NC_019423. 550 KGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDPME---EQ-- 621 (756) Q Consensus 550 ~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p~~---~~-- 621 (756) +|+|||+|++++++.+.+. +.+..|++++ +.. |.....++..+++++++| ++.+.++...+++++.. .+ T Consensus 545 ~g~~Dv~i~~~p~~~s~r~-~~~~~l~ql~-~~~-p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~q 621 (711) T protein:vir:10 545 VQKYDVVVTTGPAFATQRI-EAAEAMIQFA-QAV-PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIE 621 (711) T ss_pred eeeeEEEEeeccCchhHHH-HHHHHHHHHH-hhc-chhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHH Confidence 7899999999988765433 3334444433 222 333334444555555555 56667776665543211 11 Q ss_pred --HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHHH-HHHHHHHHHH Q lcl|NC_019423. 622 --LKQLAIQKAQLENEELQSK-------IALNNAKAKEAASSGDLKDLDYLEQESG-TKHAR--DMEKQ-KAQSQGNQNL 688 (756) Q Consensus 622 --~~q~~~~~aq~e~~~~qa~-------a~~~~a~a~~~~aq~~~~~~~~~~q~~~-~k~~~--~~~~~-~~q~~~~~~~ 688 (756) +++.+++.++++.+..+++ ++..+|++++++++++..+++....... ..++. .++.. .+.+..+.++ T Consensus 622 q~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qael 701 (711) T protein:vir:10 622 EDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEI 701 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111112222222222 3333333333333333332221111100 00011 11111 1111223333 Q ss_pred HHHHHHHHHh Q lcl|NC_019423. 689 QITKALTTPT 698 (756) Q Consensus 689 ~~~~a~~~~~ 698 (756) .+.++.+..+ T Consensus 702 q~~q~~~~q~ 711 (711) T protein:vir:10 702 TASQANVTEQ 711 (711) T ss_pred HHHHHHhhcC Confidence 3334333333 No 7 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=8.8e-93 Score=525.25 Aligned_cols=618 Identities=12% Similarity=0.025 Sum_probs=396.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~ 76 (756) |++-...-+ ++--+++.++ +...+...+..+...+...-....++.+||+|+-+.+... ..||+-++-+ T Consensus 1 ~~~~~~~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:81 1 MKNETNTMA--TKNDNGATPR-----FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred CCccccccc--CCCCcchhHH-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEec Confidence 554444322 2221222222 1222222233332222222344568999999987754333 4799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.+|+|++..- -.+-.=+.|.|++++|+ +.|+..|.+++|+.. .++.-....++++++|++|.|++.++|+ T Consensus 74 ~i~~~v~~v~g~~----~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:81 74 LIAPTVDGVLGME----AKTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cHHHHHHHHHhHH----HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccc Confidence 9999999998877 45556699999987665 689999999999977 4444445778999999999998777654 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ++ ... T Consensus 149 ~d---------------------------------------------------------------------------~~~ 153 (714) T protein:vir:81 149 SD---------------------------------------------------------------------------PFG 153 (714) T ss_pred cC---------------------------------------------------------------------------CCC Confidence 21 012 Q ss_pred CceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCc---hhh----------hhhh--ch Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDW---ESS----------SPIT--DP 298 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~---~~~----------~~~~--~~ 298 (756) ++|+|++|||++|||||++++ |++||+|++|++|+|+++++.+++....+..... ... .... .. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:81 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 458899999999999998765 9999999999999999999999765332211111 000 0000 00 Q ss_pred hhhccccccccccccccceEEEEEEEEEee---------------ccC------------------CceeEEEEEEEECC Q lcl|NC_019423. 299 DHESKTPSDFQFKDALRKKVVAYEYWGFYD---------------IND------------------DGSLEPIVATWIGS 345 (756) Q Consensus 299 ~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d---------------~~~------------------~g~~~~~~~~~~g~ 345 (756) ...........|.+..+++|+|+|||+|.. +++ ......++++|+|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:81 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 111112233445566778999999998732 111 12245677889999 Q ss_pred EEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch-hhh Q lcl|NC_019423. 346 TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR-RRY 424 (756) Q Consensus 346 ~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~-~~~ 424 (756) ++|+.+++||+|++|||||+++++.+.....+|++|.++|+||++|++.+++++++ +++ ++++.+|++++.+. ..+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e 390 (714) T protein:vir:81 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLME 390 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHH Confidence 99999999999999999999999987777778999999999999999999988865 455 56688888877643 222 Q ss_pred hc---ccccccccc---ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHH Q lcl|NC_019423. 425 DD---GQDYEYNPM---QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAA 498 (756) Q Consensus 425 ~~---~~~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa 498 (756) .. +....+++. +..+...|++.+.+++|++.++++++..+.++++|||+++++|..+++.|++| +++++++| T Consensus 391 ~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg 468 (714) T protein:vir:81 391 QIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQG 468 (714) T ss_pred hccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHH Confidence 22 222223222 11223457888889999999999999999999999999999999998877766 88999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC-------ceeecC---------HhHhcCcceEEEecccc Q lcl|NC_019423. 499 SKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE-------QYVEIK---------REDLKGNFDIEVDINTA 562 (756) Q Consensus 499 ~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~-------~~v~i~---------~d~~~~~~Dv~V~~g~a 562 (756) .+.+..++|||+.+++.+|+++|+||++||+++|++||+|+ .++.+| ||..+++|||+|+++++ T Consensus 469 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~ 548 (714) T protein:vir:81 469 ATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQ 548 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccC Confidence 99999999999999999999999999999999999999975 277776 45567899999999998 Q ss_pred cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCCh---hh-------hhHHHHHHHH Q lcl|NC_019423. 563 EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDP---ME-------EQLKQLAIQK 629 (756) Q Consensus 563 ~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p---~~-------~~~~q~~~~~ 629 (756) +.+++. +.+..|++++. .++|.....++..+++++++| ++.+.|+...+++++ +. .+++++++++ T Consensus 549 ~~t~r~-~~~~~l~~l~~-~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:81 549 TPAFKA-QLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred chHHHH-HHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 765433 33334444332 234444333444445555554 677777666554332 11 1111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGN-----QNLQITKALTTPTKEGETT 704 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~-----~~~~~~~a~~~~~~~~~~~ 704 (756) ++++.++++++.++.+|++.++++++.....++....+....+...+ ...++++. .+..+.+..+..+.-..+. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:81 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 33444455556666666555555554443333222211111110000 00000000 0000111111111111111 Q ss_pred hhhhccCCC Q lcl|NC_019423. 705 PNISAAVGY 713 (756) Q Consensus 705 ~~~~~a~~~ 713 (756) +-+..++.. T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:81 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 111122211 No 8 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=8.8e-93 Score=525.25 Aligned_cols=618 Identities=12% Similarity=0.025 Sum_probs=396.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~ 76 (756) |++-...-+ ++--+++.++ +...+...+..+...+...-....++.+||+|+-+.+... ..||+-++-+ T Consensus 1 ~~~~~~~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:10 1 MKNETNTMA--TKNDNGATPR-----FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred CCccccccc--CCCCcchhHH-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEec Confidence 554444322 2221222222 1222222233332222222344568999999987754333 4799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.+|+|++..- -.+-.=+.|.|++++|+ +.|+..|.+++|+.. .++.-....++++++|++|.|++.++|+ T Consensus 74 ~i~~~v~~v~g~~----~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:10 74 LIAPTVDGVLGME----AKTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cHHHHHHHHHhHH----HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccc Confidence 9999999998877 45556699999987665 689999999999977 4444445778999999999998777654 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ++ ... T Consensus 149 ~d---------------------------------------------------------------------------~~~ 153 (714) T protein:vir:10 149 SD---------------------------------------------------------------------------PFG 153 (714) T ss_pred cC---------------------------------------------------------------------------CCC Confidence 21 012 Q ss_pred CceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCc---hhh----------hhhh--ch Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDW---ESS----------SPIT--DP 298 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~---~~~----------~~~~--~~ 298 (756) ++|+|++|||++|||||++++ |++||+|++|++|+|+++++.+++....+..... ... .... .. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 458899999999999998765 9999999999999999999999765332211111 000 0000 00 Q ss_pred hhhccccccccccccccceEEEEEEEEEee---------------ccC------------------CceeEEEEEEEECC Q lcl|NC_019423. 299 DHESKTPSDFQFKDALRKKVVAYEYWGFYD---------------IND------------------DGSLEPIVATWIGS 345 (756) Q Consensus 299 ~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d---------------~~~------------------~g~~~~~~~~~~g~ 345 (756) ...........|.+..+++|+|+|||+|.. +++ ......++++|+|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 111112233445566778999999998732 111 12245677889999 Q ss_pred EEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch-hhh Q lcl|NC_019423. 346 TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR-RRY 424 (756) Q Consensus 346 ~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~-~~~ 424 (756) ++|+.+++||+|++|||||+++++.+.....+|++|.++|+||++|++.+++++++ +++ ++++.+|++++.+. ..+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e 390 (714) T protein:vir:10 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLME 390 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHH Confidence 99999999999999999999999987777778999999999999999999988865 455 56688888877643 222 Q ss_pred hc---ccccccccc---ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHH Q lcl|NC_019423. 425 DD---GQDYEYNPM---QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAA 498 (756) Q Consensus 425 ~~---~~~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa 498 (756) .. +....+++. +..+...|++.+.+++|++.++++++..+.++++|||+++++|..+++.|++| +++++++| T Consensus 391 ~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg 468 (714) T protein:vir:10 391 QIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQG 468 (714) T ss_pred hccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHH Confidence 22 222223222 11223457888889999999999999999999999999999999998877766 88999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC-------ceeecC---------HhHhcCcceEEEecccc Q lcl|NC_019423. 499 SKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE-------QYVEIK---------REDLKGNFDIEVDINTA 562 (756) Q Consensus 499 ~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~-------~~v~i~---------~d~~~~~~Dv~V~~g~a 562 (756) .+.+..++|||+.+++.+|+++|+||++||+++|++||+|+ .++.+| ||..+++|||+|+++++ T Consensus 469 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~ 548 (714) T protein:vir:10 469 ATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQ 548 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccC Confidence 99999999999999999999999999999999999999975 277776 45567899999999998 Q ss_pred cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCCh---hh-------hhHHHHHHHH Q lcl|NC_019423. 563 EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDP---ME-------EQLKQLAIQK 629 (756) Q Consensus 563 ~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p---~~-------~~~~q~~~~~ 629 (756) +.+++. +.+..|++++. .++|.....++..+++++++| ++.+.|+...+++++ +. .+++++++++ T Consensus 549 ~~t~r~-~~~~~l~~l~~-~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:10 549 TPAFKA-QLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred chHHHH-HHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 765433 33334444332 234444333444445555554 677777666554332 11 1111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGN-----QNLQITKALTTPTKEGETT 704 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~-----~~~~~~~a~~~~~~~~~~~ 704 (756) ++++.++++++.++.+|++.++++++.....++....+....+...+ ...++++. .+..+.+..+..+.-..+. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:10 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 33444455556666666555555554443333222211111110000 00000000 0000111111111111111 Q ss_pred hhhhccCCC Q lcl|NC_019423. 705 PNISAAVGY 713 (756) Q Consensus 705 ~~~~~a~~~ 713 (756) +-+..++.. T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:10 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 111122211 No 9 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=8.8e-93 Score=525.25 Aligned_cols=618 Identities=12% Similarity=0.025 Sum_probs=396.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~ 76 (756) |++-...-+ ++--+++.++ +...+...+..+...+...-....++.+||+|+-+.+... ..||+-++-+ T Consensus 1 ~~~~~~~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:99 1 MKNETNTMA--TKNDNGATPR-----FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred CCccccccc--CCCCcchhHH-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEec Confidence 554444322 2221222222 1222222233332222222344568999999987754333 4799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.+|+|++..- -.+-.=+.|.|++++|+ +.|+..|.+++|+.. .++.-....++++++|++|.|++.++|+ T Consensus 74 ~i~~~v~~v~g~~----~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:99 74 LIAPTVDGVLGME----AKTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cHHHHHHHHHhHH----HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccc Confidence 9999999998877 45556699999987665 689999999999977 4444445778999999999998777654 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ++ ... T Consensus 149 ~d---------------------------------------------------------------------------~~~ 153 (714) T protein:vir:99 149 SD---------------------------------------------------------------------------PFG 153 (714) T ss_pred cC---------------------------------------------------------------------------CCC Confidence 21 012 Q ss_pred CceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCc---hhh----------hhhh--ch Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDW---ESS----------SPIT--DP 298 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~---~~~----------~~~~--~~ 298 (756) ++|+|++|||++|||||++++ |++||+|++|++|+|+++++.+++....+..... ... .... .. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:99 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 458899999999999998765 9999999999999999999999765332211111 000 0000 00 Q ss_pred hhhccccccccccccccceEEEEEEEEEee---------------ccC------------------CceeEEEEEEEECC Q lcl|NC_019423. 299 DHESKTPSDFQFKDALRKKVVAYEYWGFYD---------------IND------------------DGSLEPIVATWIGS 345 (756) Q Consensus 299 ~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d---------------~~~------------------~g~~~~~~~~~~g~ 345 (756) ...........|.+..+++|+|+|||+|.. +++ ......++++|+|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:99 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 111112233445566778999999998732 111 12245677889999 Q ss_pred EEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch-hhh Q lcl|NC_019423. 346 TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR-RRY 424 (756) Q Consensus 346 ~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~-~~~ 424 (756) ++|+.+++||+|++|||||+++++.+.....+|++|.++|+||++|++.+++++++ +++ ++++.+|++++.+. ..+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e 390 (714) T protein:vir:99 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLME 390 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHH Confidence 99999999999999999999999987777778999999999999999999988865 455 56688888877643 222 Q ss_pred hc---ccccccccc---ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHH Q lcl|NC_019423. 425 DD---GQDYEYNPM---QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAA 498 (756) Q Consensus 425 ~~---~~~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa 498 (756) .. +....+++. +..+...|++.+.+++|++.++++++..+.++++|||+++++|..+++.|++| +++++++| T Consensus 391 ~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg 468 (714) T protein:vir:99 391 QIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQG 468 (714) T ss_pred hccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHH Confidence 22 222223222 11223457888889999999999999999999999999999999998877766 88999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC-------ceeecC---------HhHhcCcceEEEecccc Q lcl|NC_019423. 499 SKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE-------QYVEIK---------REDLKGNFDIEVDINTA 562 (756) Q Consensus 499 ~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~-------~~v~i~---------~d~~~~~~Dv~V~~g~a 562 (756) .+.+..++|||+.+++.+|+++|+||++||+++|++||+|+ .++.+| ||..+++|||+|+++++ T Consensus 469 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~ 548 (714) T protein:vir:99 469 ATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQ 548 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccC Confidence 99999999999999999999999999999999999999975 277776 45567899999999998 Q ss_pred cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCCh---hh-------hhHHHHHHHH Q lcl|NC_019423. 563 EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDP---ME-------EQLKQLAIQK 629 (756) Q Consensus 563 ~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p---~~-------~~~~q~~~~~ 629 (756) +.+++. +.+..|++++. .++|.....++..+++++++| ++.+.|+...+++++ +. .+++++++++ T Consensus 549 ~~t~r~-~~~~~l~~l~~-~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:99 549 TPAFKA-QLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred chHHHH-HHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 765433 33334444332 234444333444445555554 677777666554332 11 1111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGN-----QNLQITKALTTPTKEGETT 704 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~-----~~~~~~~a~~~~~~~~~~~ 704 (756) ++++.++++++.++.+|++.++++++.....++....+....+...+ ...++++. .+..+.+..+..+.-..+. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:99 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 33444455556666666555555554443333222211111110000 00000000 0000111111111111111 Q ss_pred hhhhccCCC Q lcl|NC_019423. 705 PNISAAVGY 713 (756) Q Consensus 705 ~~~~~a~~~ 713 (756) +-+..++.. T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:99 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 111122211 No 10 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=8.8e-93 Score=525.25 Aligned_cols=618 Identities=12% Similarity=0.025 Sum_probs=396.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~ 76 (756) |++-...-+ ++--+++.++ +...+...+..+...+...-....++.+||+|+-+.+... ..||+-++-+ T Consensus 1 ~~~~~~~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:27 1 MKNETNTMA--TKNDNGATPR-----FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred CCccccccc--CCCCcchhHH-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEec Confidence 554444322 2221222222 1222222233332222222344568999999987754333 4799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.+|+|++..- -.+-.=+.|.|++++|+ +.|+..|.+++|+.. .++.-....++++++|++|.|++.++|+ T Consensus 74 ~i~~~v~~v~g~~----~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:27 74 LIAPTVDGVLGME----AKTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cHHHHHHHHHhHH----HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccc Confidence 9999999998877 45556699999987665 689999999999977 4444445778999999999998777654 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ++ ... T Consensus 149 ~d---------------------------------------------------------------------------~~~ 153 (714) T protein:vir:27 149 SD---------------------------------------------------------------------------PFG 153 (714) T ss_pred cC---------------------------------------------------------------------------CCC Confidence 21 012 Q ss_pred CceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCc---hhh----------hhhh--ch Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDW---ESS----------SPIT--DP 298 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~---~~~----------~~~~--~~ 298 (756) ++|+|++|||++|||||++++ |++||+|++|++|+|+++++.+++....+..... ... .... .. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:27 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 458899999999999998765 9999999999999999999999765332211111 000 0000 00 Q ss_pred hhhccccccccccccccceEEEEEEEEEee---------------ccC------------------CceeEEEEEEEECC Q lcl|NC_019423. 299 DHESKTPSDFQFKDALRKKVVAYEYWGFYD---------------IND------------------DGSLEPIVATWIGS 345 (756) Q Consensus 299 ~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d---------------~~~------------------~g~~~~~~~~~~g~ 345 (756) ...........|.+..+++|+|+|||+|.. +++ ......++++|+|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:27 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 111112233445566778999999998732 111 12245677889999 Q ss_pred EEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch-hhh Q lcl|NC_019423. 346 TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR-RRY 424 (756) Q Consensus 346 ~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~-~~~ 424 (756) ++|+.+++||+|++|||||+++++.+.....+|++|.++|+||++|++.+++++++ +++ ++++.+|++++.+. ..+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e 390 (714) T protein:vir:27 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLME 390 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHH Confidence 99999999999999999999999987777778999999999999999999988865 455 56688888877643 222 Q ss_pred hc---ccccccccc---ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHH Q lcl|NC_019423. 425 DD---GQDYEYNPM---QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAA 498 (756) Q Consensus 425 ~~---~~~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa 498 (756) .. +....+++. +..+...|++.+.+++|++.++++++..+.++++|||+++++|..+++.|++| +++++++| T Consensus 391 ~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg 468 (714) T protein:vir:27 391 QIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQG 468 (714) T ss_pred hccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHH Confidence 22 222223222 11223457888889999999999999999999999999999999998877766 88999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC-------ceeecC---------HhHhcCcceEEEecccc Q lcl|NC_019423. 499 SKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE-------QYVEIK---------REDLKGNFDIEVDINTA 562 (756) Q Consensus 499 ~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~-------~~v~i~---------~d~~~~~~Dv~V~~g~a 562 (756) .+.+..++|||+.+++.+|+++|+||++||+++|++||+|+ .++.+| ||..+++|||+|+++++ T Consensus 469 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~ 548 (714) T protein:vir:27 469 ATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQ 548 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccC Confidence 99999999999999999999999999999999999999975 277776 45567899999999998 Q ss_pred cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCCh---hh-------hhHHHHHHHH Q lcl|NC_019423. 563 EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDP---ME-------EQLKQLAIQK 629 (756) Q Consensus 563 ~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p---~~-------~~~~q~~~~~ 629 (756) +.+++. +.+..|++++. .++|.....++..+++++++| ++.+.|+...+++++ +. .+++++++++ T Consensus 549 ~~t~r~-~~~~~l~~l~~-~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:27 549 TPAFKA-QLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred chHHHH-HHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 765433 33334444332 234444333444445555554 677777666554332 11 1111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGN-----QNLQITKALTTPTKEGETT 704 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~-----~~~~~~~a~~~~~~~~~~~ 704 (756) ++++.++++++.++.+|++.++++++.....++....+....+...+ ...++++. .+..+.+..+..+.-..+. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:27 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 33444455556666666555555554443333222211111110000 00000000 0000111111111111111 Q ss_pred hhhhccCCC Q lcl|NC_019423. 705 PNISAAVGY 713 (756) Q Consensus 705 ~~~~~a~~~ 713 (756) +-+..++.. T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:27 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 111122211 No 11 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=8.8e-93 Score=525.25 Aligned_cols=618 Identities=12% Similarity=0.025 Sum_probs=396.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~ 76 (756) |++-...-+ ++--+++.++ +...+...+..+...+...-....++.+||+|+-+.+... ..||+-++-+ T Consensus 1 ~~~~~~~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:32 1 MKNETNTMA--TKNDNGATPR-----FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred CCccccccc--CCCCcchhHH-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEec Confidence 554444322 2221222222 1222222233332222222344568999999987754333 4799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.+|+|++..- -.+-.=+.|.|++++|+ +.|+..|.+++|+.. .++.-....++++++|++|.|++.++|+ T Consensus 74 ~i~~~v~~v~g~~----~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:32 74 LIAPTVDGVLGME----AKTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cHHHHHHHHHhHH----HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccc Confidence 9999999998877 45556699999987665 689999999999977 4444445778999999999998777654 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ++ ... T Consensus 149 ~d---------------------------------------------------------------------------~~~ 153 (714) T protein:vir:32 149 SD---------------------------------------------------------------------------PFG 153 (714) T ss_pred cC---------------------------------------------------------------------------CCC Confidence 21 012 Q ss_pred CceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCc---hhh----------hhhh--ch Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDW---ESS----------SPIT--DP 298 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~---~~~----------~~~~--~~ 298 (756) ++|+|++|||++|||||++++ |++||+|++|++|+|+++++.+++....+..... ... .... .. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:32 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 458899999999999998765 9999999999999999999999765332211111 000 0000 00 Q ss_pred hhhccccccccccccccceEEEEEEEEEee---------------ccC------------------CceeEEEEEEEECC Q lcl|NC_019423. 299 DHESKTPSDFQFKDALRKKVVAYEYWGFYD---------------IND------------------DGSLEPIVATWIGS 345 (756) Q Consensus 299 ~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d---------------~~~------------------~g~~~~~~~~~~g~ 345 (756) ...........|.+..+++|+|+|||+|.. +++ ......++++|+|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:32 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 111112233445566778999999998732 111 12245677889999 Q ss_pred EEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch-hhh Q lcl|NC_019423. 346 TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR-RRY 424 (756) Q Consensus 346 ~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~-~~~ 424 (756) ++|+.+++||+|++|||||+++++.+.....+|++|.++|+||++|++.+++++++ +++ ++++.+|++++.+. ..+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e 390 (714) T protein:vir:32 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLME 390 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHH Confidence 99999999999999999999999987777778999999999999999999988865 455 56688888877643 222 Q ss_pred hc---ccccccccc---ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHH Q lcl|NC_019423. 425 DD---GQDYEYNPM---QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAA 498 (756) Q Consensus 425 ~~---~~~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa 498 (756) .. +....+++. +..+...|++.+.+++|++.++++++..+.++++|||+++++|..+++.|++| +++++++| T Consensus 391 ~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg 468 (714) T protein:vir:32 391 QIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQG 468 (714) T ss_pred hccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHH Confidence 22 222223222 11223457888889999999999999999999999999999999998877766 88999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC-------ceeecC---------HhHhcCcceEEEecccc Q lcl|NC_019423. 499 SKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE-------QYVEIK---------REDLKGNFDIEVDINTA 562 (756) Q Consensus 499 ~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~-------~~v~i~---------~d~~~~~~Dv~V~~g~a 562 (756) .+.+..++|||+.+++.+|+++|+||++||+++|++||+|+ .++.+| ||..+++|||+|+++++ T Consensus 469 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~ 548 (714) T protein:vir:32 469 ATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQ 548 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccC Confidence 99999999999999999999999999999999999999975 277776 45567899999999998 Q ss_pred cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCCh---hh-------hhHHHHHHHH Q lcl|NC_019423. 563 EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDP---ME-------EQLKQLAIQK 629 (756) Q Consensus 563 ~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p---~~-------~~~~q~~~~~ 629 (756) +.+++. +.+..|++++. .++|.....++..+++++++| ++.+.|+...+++++ +. .+++++++++ T Consensus 549 ~~t~r~-~~~~~l~~l~~-~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:32 549 TPAFKA-QLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred chHHHH-HHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 765433 33334444332 234444333444445555554 677777666554332 11 1111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGN-----QNLQITKALTTPTKEGETT 704 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~-----~~~~~~~a~~~~~~~~~~~ 704 (756) ++++.++++++.++.+|++.++++++.....++....+....+...+ ...++++. .+..+.+..+..+.-..+. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~ 705 (714) T protein:vir:32 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTL 705 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHH Confidence 33444455556666666555555554443333222211111110000 00000000 0000111111111111111 Q ss_pred hhhhccCCC Q lcl|NC_019423. 705 PNISAAVGY 713 (756) Q Consensus 705 ~~~~~a~~~ 713 (756) +-+..++.. T Consensus 706 ~~~~~~~~~ 714 (714) T protein:vir:32 706 QQRMNEMSL 714 (714) T ss_pred HHHHHhcCC Confidence 111122211 No 12 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=1.2e-91 Score=518.98 Aligned_cols=613 Identities=12% Similarity=0.045 Sum_probs=400.3 Q ss_pred CCcccCC-CCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccC Q lcl|NC_019423. 1 MEHQDTF-KPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQP 75 (756) Q Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~ 75 (756) |...-+. ...++...-.. +.. ..+..+..+.++ +...-+...++.+||+|+-+.+... ..||+-++- T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~-~~l~~~~~~~~~----~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~ 72 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPR---FSQ-RQLLSLCSDIDS----QPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIH 72 (714) T ss_pred CCcCcCcccCCCcchhhhh---hhH-HHHHHHHHHHhh----hHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEe Confidence 6543222 12222221111 111 223333333332 2222245668999999987754222 479999999 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEee Q lcl|NC_019423. 76 RLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGW 153 (756) Q Consensus 76 ~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w 153 (756) +.|+.+|+|++..- -.+-.=+.|.|++++|+ +.|+..|.+++|+.... +.-....++|+++|++|.|+++++| T Consensus 73 N~i~~~v~~v~g~~----~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~-~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:10 73 NLIAPTVDGVLGME----AKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLG-NMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred ccHHHHHHHHHHHH----HhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhh-chhHHHHHHHHHhhhcccceEEeee Confidence 99999999998887 45555689999987765 68999999999997744 3344577899999999999888766 Q ss_pred eeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeee Q lcl|NC_019423. 154 ERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKAL 233 (756) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~ 233 (756) +++. . T Consensus 148 d~d~---------------------------------------------------------------------------~ 152 (714) T protein:vir:10 148 NSEP---------------------------------------------------------------------------F 152 (714) T ss_pred ccCC---------------------------------------------------------------------------C Confidence 5321 1 Q ss_pred cCceeEEEechhheEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhh------h--------hh-hc Q lcl|NC_019423. 234 VNRPTVEMLNPNNVVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESS------S--------PI-TD 297 (756) Q Consensus 234 ~g~~~ie~V~p~~~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~------~--------~~-~~ 297 (756) .++|+|++|+|++|||||++++ |++||+|++|++|||+++++.+++...++........ . .. .. T Consensus 153 ~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (714) T protein:vir:10 153 GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred CCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccc Confidence 2458899999999999998764 9999999999999999999998765322211111000 0 00 00 Q ss_pred hhhhccccccccccccccceEEEEEEEEEee---------------ccC------------------CceeEEEEEEEEC Q lcl|NC_019423. 298 PDHESKTPSDFQFKDALRKKVVAYEYWGFYD---------------IND------------------DGSLEPIVATWIG 344 (756) Q Consensus 298 ~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d---------------~~~------------------~g~~~~~~~~~~g 344 (756) ..+.........|.+..+++|+|+|||+|.. .++ ....+.++++|.| T Consensus 233 ~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g 312 (714) T protein:vir:10 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVG 312 (714) T ss_pred hhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEec Confidence 1111222233445666778999999998731 111 1223566788999 Q ss_pred CEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch-hh Q lcl|NC_019423. 345 STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR-RR 423 (756) Q Consensus 345 ~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~-~~ 423 (756) .++|+.+++||+|++|||+|+++++.+....++|++|.++|+||.+|++.+++++.| +++ ++++.+|++++.++ .. T Consensus 313 ~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~gav~~~d~~~~ 389 (714) T protein:vir:10 313 PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLM 389 (714) T ss_pred chhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHH--hCC-ceeeccccccccHHHHH Confidence 999999999999999999999999988777788999999999999999999988865 344 67889999987654 32 Q ss_pred hhc---ccccccccc---ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHH Q lcl|NC_019423. 424 YDD---GQDYEYNPM---QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDA 497 (756) Q Consensus 424 ~~~---~~~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~a 497 (756) +.. +....+++. +..++..+++.+.+++|+++++++++..+.++++|||+++++|..+++.|++| |++++++ T Consensus 390 e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--I~~r~~q 467 (714) T protein:vir:10 390 EQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQ 467 (714) T ss_pred HhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHH--HHHHHHH Confidence 222 122223221 12234568888899999999999999999999999999999999998876666 8899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC-------ceeecC---------HhHhcCcceEEEeccc Q lcl|NC_019423. 498 ASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE-------QYVEIK---------REDLKGNFDIEVDINT 561 (756) Q Consensus 498 a~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~-------~~v~i~---------~d~~~~~~Dv~V~~g~ 561 (756) |.+.+..++|||+.+++.+|+++|+||++||+++|++||+|+ .++.+| ||..+++|||+|++++ T Consensus 468 g~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p 547 (714) T protein:vir:10 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeecc Confidence 999999999999999999999999999999999999999975 256665 4566789999999999 Q ss_pred ccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCCh---h-------hhhHHHHHHH Q lcl|NC_019423. 562 AEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPDP---M-------EEQLKQLAIQ 628 (756) Q Consensus 562 a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~p---~-------~~~~~q~~~~ 628 (756) ++.+++ .+++..|++++. .++|.....++..+++++++| ++.+.+++..+++++ . +.++++++++ T Consensus 548 ~~~s~r-~~~~~~l~ql~~-~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~~~~~~~ 625 (714) T protein:vir:10 548 QTPAFK-AQLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQ 625 (714) T ss_pred CcHHHH-HHHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHHHHHHHH Confidence 876543 333444444442 244555555555566666666 466666555443321 1 1111222333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019423. 629 KAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDME---------KQKAQSQGNQNLQITKALTTPTK 699 (756) Q Consensus 629 ~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~---------~~~~q~~~~~~~~~~~a~~~~~~ 699 (756) +++++.+++++++++.++++.++++++.....+++.+.+....+...+ +......++++...+ ..+. T Consensus 626 q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~----~q~~ 701 (714) T protein:vir:10 626 QAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVL----QQQM 701 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHH----HHHH Confidence 444555566666666666666655555443333222211111110000 000000011111111 0111 Q ss_pred ccCCchhhhccCCC Q lcl|NC_019423. 700 EGETTPNISAAVGY 713 (756) Q Consensus 700 ~~~~~~~~~~a~~~ 713 (756) .....+-++ ++.. T Consensus 702 ~q~~~~~~~-~~~~ 714 (714) T protein:vir:10 702 LYTLQQRMN-EMSL 714 (714) T ss_pred HHHHHHHHH-hcCC Confidence 111111111 1211 No 13 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=1.7e-85 Score=485.26 Aligned_cols=592 Identities=13% Similarity=0.092 Sum_probs=378.6 Q ss_pred CCch-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 22 WKKE-PSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 22 ~~~~-~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) |.+. ..+..+...|+.+..+....-....++.+||+|+.+.+... ..||. +-+.|+.+|+|++..-. .+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~~~----~n 74 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMR----QN 74 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ccccHHHHHHHHHhhHH----hC Confidence 5433 35666666777777766665566778999999987754222 45666 55999999999977663 36 Q ss_pred CCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHH Q lcl|NC_019423. 97 SKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQE 176 (756) Q Consensus 97 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 176 (756) -.-+.|.|+.++|++.|+..|.+++|+.. .++.-....++++++|++|.|++.+.|+++.+... T Consensus 75 r~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~--------------- 138 (725) T protein:vir:77 75 PIDVLYRPKDGARPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT--------------- 138 (725) T ss_pred CcceEEecCCccHHHHHHHHHHHHHHHHH-hhCchhHHHHHHHHHhhcCcceeeeeecccCCCCC--------------- Confidence 77799999999999999999999999955 56666667789999999999988887765421000 Q ss_pred HHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEE----EechhheEeCCC Q lcl|NC_019423. 177 QADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVE----MLNPNNVVIDPS 252 (756) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie----~V~p~~~~~Dp~ 252 (756) .++++|. ..+|.+|||||. T Consensus 139 ---------------------------------------------------------~~~~~i~~~~~~~~~~~v~~Dp~ 161 (725) T protein:vir:77 139 ---------------------------------------------------------SNNQVIRREPIHSACSHVIWDSN 161 (725) T ss_pred ---------------------------------------------------------CCceeeEEeecccChhhceeCch Confidence 0112222 236888999999 Q ss_pred CcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeec-- Q lcl|NC_019423. 253 CNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDI-- 329 (756) Q Consensus 253 a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~-- 329 (756) ++. |++||+|+++++|||++++..++..|. ....+..+..+. ....+.| .+.++|+|+|||+|..+ T Consensus 162 a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~------~~~~~~~~~~~~---~~~~~~~--~~~d~vrv~E~~~r~~~~~ 230 (725) T protein:vir:77 162 SKLMDKSDARHCTVIHSMSQNGWEDFAEKYD------LDADDIPSFQNP---NDWVFPW--LTQDTIQIAEFYEVVEKKE 230 (725) T ss_pred hhccChhhHHHHHHHhcCCHHHHHHHHhhCC------cchhhccccccc---ccccccc--cCCCeeEEEEEEEEEEEee Confidence 875 999999999999999998766554332 111111111110 0011122 23468999999997531 Q ss_pred ------c----------------------CCcee----------EEEEEEEECCEEEEecccccCCCccceEEeeeeee- Q lcl|NC_019423. 330 ------N----------------------DDGSL----------EPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPR- 370 (756) Q Consensus 330 ------~----------------------~~g~~----------~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~- 370 (756) + +.|.. +.+.+++.|.++| .+++||+|++|||||+++++. T Consensus 231 ~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l-~~~~~~~~~~~P~vP~~g~r~~ 309 (725) T protein:vir:77 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVL-KDKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred EEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceee-ccCCcCCCCccceEEEeeeeec Confidence 0 11221 1122334566555 478899999999999999864 Q ss_pred -cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc--cccccc----ccccccc Q lcl|NC_019423. 371 -KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY--EYNPMQ----GNPSQSI 443 (756) Q Consensus 371 -~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~--~~~~~~----~~~~~~i 443 (756) ++..|++|+||+|+|+||++|+++|+++++++++++.++++.+|+++.....+....... ..+.+. ..+.+.+ T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i 389 (725) T protein:vir:77 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGDLPTQPL 389 (725) T ss_pred cCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCcccccCc Confidence 677888899999999999999999999999999999999999999987766555443221 111111 1234567 Q ss_pred ccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 444 MEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAM 523 (756) Q Consensus 444 ~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~l 523 (756) .+.+.|++|+++++|++.....++++|||+++++|..+++.|+.| ++++++++.+.+..++|||+.+++++|+++|+| T Consensus 390 ~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a--i~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:77 390 AYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDT--VNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 788899999999999999999999999999999999988765555 889999999999999999999999999999999 Q ss_pred HHhhCCCCcEEEEecCc----eeecCH-------------hHhcCcceEEEecccccHHHH--HHHHHHHHHHHhhccCC Q lcl|NC_019423. 524 NAVFLSEKEVVRITNEQ----YVEIKR-------------EDLKGNFDIEVDINTAEIDNQ--KSQDLGFMVQTLGNTVD 584 (756) Q Consensus 524 i~q~~~~~r~iRI~g~~----~v~i~~-------------d~~~~~~Dv~V~~g~a~~~~~--~~q~l~~llq~~~~~~~ 584 (756) |.+||+++|++||+|++ |+.+|. .+++|+|||+|++++++.+.+ ....|+++++.+.+ .. T Consensus 468 I~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~-~~ 546 (725) T protein:vir:77 468 VNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ-GT 546 (725) T ss_pred HHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccc-cc Confidence 99999999999999874 777762 245789999999998876543 33445556555533 33 Q ss_pred HhHHHHHHHHHHhhcCChhHH---HHhhhccCC------CChh----hhhHHHHHHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_019423. 585 QSITLSLVAKIAELKRMPDLA---HELRTWQPQ------PDPM----EEQLKQLAIQKAQLENEELQSKIAL-----NNA 646 (756) Q Consensus 585 ~~~~~~~l~~l~e~~~~~~~~---~~l~~~~~q------~~p~----~~~~~q~~~~~aq~e~~~~qa~a~~-----~~a 646 (756) + ....++..++++++.+... +.+++..++ .+|. .+++++.++++++++..++++++.. .++ T Consensus 547 ~-~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~ka 625 (725) T protein:vir:77 547 P-EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) T ss_pred h-hHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 3 3445566677777776444 444432221 1111 1122222222333333333322222 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-----------H----------HHHHH--------------HHHHHHHHHHHHHH Q lcl|NC_019423. 647 KAKEAASSGDLKDLDYLEQESGTKH-----------A----------RDMEK--------------QKAQSQGNQNLQIT 691 (756) Q Consensus 647 ~a~~~~aq~~~~~~~~~~q~~~~k~-----------~----------~~~~~--------------~~~q~~~~~~~~~~ 691 (756) +++...++.++.+.+...+.++.+. . +.++. +.....++++.++. T Consensus 626 q~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~~~~~ 705 (725) T protein:vir:77 626 QNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMDIA 705 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhHHHHH Confidence 2222222222111111111111100 0 00000 00000001111111 Q ss_pred HHHHHHhhccCCchhhhccCCCCCCCcc---cCch Q lcl|NC_019423. 692 KALTTPTKEGETTPNISAAVGYNTLTNG---NSPQ 723 (756) Q Consensus 692 ~a~~~~~~~~~~~~~~~~a~~~~~~~~~---~~~~ 723 (756) +++...+.. -+|+ ++|+ T Consensus 706 ~~~~~~~~~---------------~~~~~~~~~~~ 725 (725) T protein:vir:77 706 NILQSQRQN---------------QPSGSVAETPQ 725 (725) T ss_pred HHHHHHHhc---------------CCCcCcccCCC Confidence 111111110 0121 1111 No 14 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=7.2e-85 Score=481.87 Aligned_cols=608 Identities=12% Similarity=0.082 Sum_probs=374.9 Q ss_pred CCch-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 22 WKKE-PSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 22 ~~~~-~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) |.+. ..+..+...|+.+..+....-....++.+||+|+.+.+... ..||. +-+.|+.+|+|++..- -.+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~e----~~n 74 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEM----RQN 74 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhH----HhC Confidence 4332 45666666677776666665566778999999988754222 45666 5599999999997766 336 Q ss_pred CCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHH Q lcl|NC_019423. 97 SKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQE 176 (756) Q Consensus 97 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 176 (756) -.-+.|.|+.++|++.|+..|.+++|+.. .++.-....++++++|++|.|++.+.|+++.+... T Consensus 75 r~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~--------------- 138 (725) T protein:vir:92 75 PIDVLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPT--------------- 138 (725) T ss_pred CcceEEecCCccHHHHHHHHHHHHHHHHH-hhCchHHHHHHHHHHhhcCcceeeeeecccCCCCC--------------- Confidence 67799999999999999999999999955 66666667799999999999988876665421000 Q ss_pred HHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc--eeEEEe--chhheEeCCC Q lcl|NC_019423. 177 QADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR--PTVEML--NPNNVVIDPS 252 (756) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~--~~ie~V--~p~~~~~Dp~ 252 (756) .++ |++..| |+.+|||||. T Consensus 139 ---------------------------------------------------------~~~~~i~~~~i~~~~~~V~~Dp~ 161 (725) T protein:vir:92 139 ---------------------------------------------------------SNNQVIRREPIHSACSHVIWDSN 161 (725) T ss_pred ---------------------------------------------------------CCceeeEEeeccCChhhcccCch Confidence 011 223332 3557999999 Q ss_pred CcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeec-- Q lcl|NC_019423. 253 CNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDI-- 329 (756) Q Consensus 253 a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~-- 329 (756) ++. |++||+|+|+++||+++++..+...|. ....+.....+. ....+.| .++++|+|+|||+|..+ T Consensus 162 a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~------~~~~~~~~~~~~---~~~~~~~--~~~d~vrv~e~~~r~~~~~ 230 (725) T protein:vir:92 162 SKLMDKSDSRHCTVIHSMSQNGWEDFAEKYD------LDADDIPSFQNP---NDWVFPW--LTQDTIQIAEFYEVVEKKE 230 (725) T ss_pred hhccChhhHHHHHHHhcCCHHHHHHHHhhcC------cchhhhhhcccC---Ccccccc--cCCCeEEEEEEEEEEEEee Confidence 875 999999999999999987765543332 111111111110 0111112 24568999999997432 Q ss_pred ------c---C-------------------Ccee----------EEEEEEEECCEEEEecccccCCCccceEEeeeeee- Q lcl|NC_019423. 330 ------N---D-------------------DGSL----------EPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPR- 370 (756) Q Consensus 330 ------~---~-------------------~g~~----------~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~- 370 (756) + | .|.. +.+.+++.|.++|+ +++||+|++|||||+++++. T Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~-~~~~~~~~~~P~vP~~g~r~~ 309 (725) T protein:vir:92 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLK-DKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred eEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhc-CCCCCCCCceeeEEEEeeeec Confidence 0 1 1211 12223456776664 57899999999999998875 Q ss_pred -cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc--ccccc----cccccccc Q lcl|NC_019423. 371 -KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY--EYNPM----QGNPSQSI 443 (756) Q Consensus 371 -~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~--~~~~~----~~~~~~~i 443 (756) ++..|++|+||+|+|+||++|+++|+++++++++++.+++++.|++++....+....... ..+.+ +..+...+ T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i 389 (725) T protein:vir:92 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) T ss_pred cCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccccccccccccCC Confidence 677888899999999999999999999999999999999999999987654443332221 11111 11234567 Q ss_pred ccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 444 MEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAM 523 (756) Q Consensus 444 ~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~l 523 (756) .+.+.+++|+++++|++...+.++++|||++.++|..+++.++.| +++++++|.+.+..++|||+.+++++|+++|+| T Consensus 390 ~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a--i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:92 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT--VNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 888899999999999999999999999999999999888765555 888999999999999999999999999999999 Q ss_pred HHhhCCCCcEEEEecC----ceeecCH-------------hHhcCcceEEEecccccHHHHH--HHHHHHHHHHhhccCC Q lcl|NC_019423. 524 NAVFLSEKEVVRITNE----QYVEIKR-------------EDLKGNFDIEVDINTAEIDNQK--SQDLGFMVQTLGNTVD 584 (756) Q Consensus 524 i~q~~~~~r~iRI~g~----~~v~i~~-------------d~~~~~~Dv~V~~g~a~~~~~~--~q~l~~llq~~~~~~~ 584 (756) |++||+++|++||+|+ +|+.||. .+++|+|||+|++++++.+++. ...+++|++.+.+ .. T Consensus 468 I~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~-~~ 546 (725) T protein:vir:92 468 VNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ-GT 546 (725) T ss_pred HHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhccc-ch Confidence 9999999999999986 4777763 3467899999999998765432 2344555554432 33 Q ss_pred HhHHHHHHHHHHhhcCChhH---HHHhhhccCC-----CChhh-----hhHHHHHHHHHHHHHHHHHH-----HHHHHHH Q lcl|NC_019423. 585 QSITLSLVAKIAELKRMPDL---AHELRTWQPQ-----PDPME-----EQLKQLAIQKAQLENEELQS-----KIALNNA 646 (756) Q Consensus 585 ~~~~~~~l~~l~e~~~~~~~---~~~l~~~~~q-----~~p~~-----~~~~q~~~~~aq~e~~~~qa-----~a~~~~a 646 (756) +. ...++..++++++.+.. .+.++...++ +.+.+ .++++.++++++++..++++ ++.+.++ T Consensus 547 ~~-~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~ka 625 (725) T protein:vir:92 547 PE-YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) T ss_pred hH-HHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 33 33445566777776644 4444433221 11111 11112222222222222222 2222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-----HH---------HHHHHHHHHHHHHHHHH-------HHHHHHH-hhccCCc Q lcl|NC_019423. 647 KAKEAASSGDLKDLDYLEQESGTKH-----AR---------DMEKQKAQSQGNQNLQI-------TKALTTP-TKEGETT 704 (756) Q Consensus 647 ~a~~~~aq~~~~~~~~~~q~~~~k~-----~~---------~~~~~~~q~~~~~~~~~-------~~a~~~~-~~~~~~~ 704 (756) +++..+++.++.+.+...+.++.+- .+ ++.+..+..+.+.+..+ +++.... +...+.- T Consensus 626 qaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~~ 705 (725) T protein:vir:92 626 QNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIA 705 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHH Confidence 2222222222111111111111000 00 00000000000000000 1110000 0000011 Q ss_pred hhhhccCCCCCCCcccCchhcCCCCCCCCc Q lcl|NC_019423. 705 PNISAAVGYNTLTNGNSPQERDLAAQQDPA 734 (756) Q Consensus 705 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 734 (756) .+++.+ .+.++|..- ..+|- T Consensus 706 ~~~~~~------~~~~~~~~~----~~~~~ 725 (725) T protein:vir:92 706 NILQSQ------RQNQPSGSV----AETPQ 725 (725) T ss_pred HHhcch------hccCCcccc----ccCCC Confidence 111111 111111111 11111 No 15 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=2.2e-84 Score=479.19 Aligned_cols=606 Identities=13% Similarity=0.083 Sum_probs=372.2 Q ss_pred CCch-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC----CCCCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 22 WKKE-PSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPK----IKGRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 22 ~~~~-~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----~~grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) |.+. ..+..+...|+.+.......-....+..+||+|+-+.+... ..||. +-+.|+.+|+|++..-. .+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp--~~N~i~~~v~~v~g~e~----~n 74 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMR----QN 74 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHH----hC Confidence 4332 24556666666666655554456678999999987754222 45666 55999999999977763 35 Q ss_pred CCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHH Q lcl|NC_019423. 97 SKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQE 176 (756) Q Consensus 97 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 176 (756) -.=+.|.|+.++|++.|+..|..++|+.. .++.-....++++++|++|.|++.+.|+++.+.. T Consensus 75 r~d~~v~p~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~---------------- 137 (725) T protein:vir:10 75 PIDVLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSP---------------- 137 (725) T ss_pred CcceEEecCCcchHHHHHHHHHHHHHHHH-hcCcchHHhHHHHHHhhcCcceeeeeccccCCCC---------------- Confidence 55599999999999999999999999844 4444555778999999999999888776542000 Q ss_pred HHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc--eeEE--EechhheEeCCC Q lcl|NC_019423. 177 QADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR--PTVE--MLNPNNVVIDPS 252 (756) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~--~~ie--~V~p~~~~~Dp~ 252 (756) . .++ |++. ..++.+|||||. T Consensus 138 ----------------~----------------------------------------~~~~~i~~~~i~~~~~~v~~Dp~ 161 (725) T protein:vir:10 138 ----------------T----------------------------------------SNNQVIRREPIHSACSHVIWDSN 161 (725) T ss_pred ----------------C----------------------------------------CCceeeeeeecccCHhHcccCch Confidence 0 011 2222 346788999998 Q ss_pred CcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeec-- Q lcl|NC_019423. 253 CNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDI-- 329 (756) Q Consensus 253 a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~-- 329 (756) ++. |++||+|+++++||+++.+......|. .+...+.+ +.+ ..++.+...+.++|+|+|||+|..+ T Consensus 162 a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~-~~a~~~~~-----~~~-----~~~~~~~~~~~~~vrv~E~~~r~~~~~ 230 (725) T protein:vir:10 162 SKLMDKSDARHCTVIHSMSQNGWDDFAEKYD-LDADNIPS-----FQN-----PNDWVFPWLTQDTIQIAEFYEVVEKKE 230 (725) T ss_pred hhccChhhhhhhhhhccCCHHHHHHHHHhCC-Cccccccc-----ccc-----cccccccccCCCeEEEEEEEEEEEEee Confidence 875 999999999999999875533211111 11111110 000 1111111223567999999998532 Q ss_pred ------c---C-------------------Ccee----------EEEEEEEECCEEEEecccccCCCccceEEeeeeee- Q lcl|NC_019423. 330 ------N---D-------------------DGSL----------EPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPR- 370 (756) Q Consensus 330 ------~---~-------------------~g~~----------~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~- 370 (756) + | .|.. +.+.+++.|.++|+ +++||+|++|||||+++++. T Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~-~~~~~~~~~fP~vP~~g~r~~ 309 (725) T protein:vir:10 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLK-DKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred EEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhc-CCCCCCCCceeEEEEEeeeec Confidence 0 1 1111 22233456777664 57899999999999998865 Q ss_pred -cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccc--cccc----cccccccc Q lcl|NC_019423. 371 -KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYE--YNPM----QGNPSQSI 443 (756) Q Consensus 371 -~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~--~~~~----~~~~~~~i 443 (756) ++..|++|+||+|+|+||++|+++|+++++++++++.+++++.+++++....+.....+.. .+.+ +..+...+ T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i 389 (725) T protein:vir:10 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) T ss_pred cCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccC Confidence 6778888999999999999999999999999999999999999999875544433322211 1111 11234567 Q ss_pred ccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 444 MEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAM 523 (756) Q Consensus 444 ~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~l 523 (756) .+.+.|++|+++++|++...+.++++|||++.++|..+++.|+.| +++++++|.+.+..++|||+.+++++|+++|+| T Consensus 390 ~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a--i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:10 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT--VNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 888899999999999999999999999999999999988765555 889999999999999999999999999999999 Q ss_pred HHhhCCCCcEEEEecCc----eeecCH-------------hHhcCcceEEEecccccHHHHHH--HHHHHHHHHhhccCC Q lcl|NC_019423. 524 NAVFLSEKEVVRITNEQ----YVEIKR-------------EDLKGNFDIEVDINTAEIDNQKS--QDLGFMVQTLGNTVD 584 (756) Q Consensus 524 i~q~~~~~r~iRI~g~~----~v~i~~-------------d~~~~~~Dv~V~~g~a~~~~~~~--q~l~~llq~~~~~~~ 584 (756) |++||+++|+|||+|++ ||.||. .+++|+|||+|++++++.+.+.. ..|++|++++.+ .. T Consensus 468 I~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~-~~ 546 (725) T protein:vir:10 468 VNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQ-GT 546 (725) T ss_pred HHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccc-cc Confidence 99999999999999874 777763 34578999999999987654332 345555555533 33 Q ss_pred HhHHHHHHHHHHhhcCChhH---HHHhhhccCCC---Chh-------hhhHHHHHHHHHHHHHHHHHHHHHHHH-----H Q lcl|NC_019423. 585 QSITLSLVAKIAELKRMPDL---AHELRTWQPQP---DPM-------EEQLKQLAIQKAQLENEELQSKIALNN-----A 646 (756) Q Consensus 585 ~~~~~~~l~~l~e~~~~~~~---~~~l~~~~~q~---~p~-------~~~~~q~~~~~aq~e~~~~qa~a~~~~-----a 646 (756) | ....++..++++++++.. .+.+++..++. +|+ .+++++.++++++++..++++++.+.+ + T Consensus 547 ~-~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka 625 (725) T protein:vir:10 547 P-EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) T ss_pred h-hHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 3 344555667777777654 44444332211 111 112222222233333333322222222 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHH---HHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019423. 647 KAKEAASSGDLKDLDYLEQESGTKH---------------------ARDMEKQKA---QSQGNQNLQITKALTTPTKEGE 702 (756) Q Consensus 647 ~a~~~~aq~~~~~~~~~~q~~~~k~---------------------~~~~~~~~~---q~~~~~~~~~~~a~~~~~~~~~ 702 (756) +++..+++.++.+.+..++.++.+. .+.+++... +.+++..++.... ..+..-+ T Consensus 626 ~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~--~~~~~~~ 703 (725) T protein:vir:10 626 QNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQ--THKQRMD 703 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH--HHHHHhh Confidence 2222222212111111111111000 000111000 0000000000000 0001111 Q ss_pred CchhhhccCCCCCCCcccCchhcCCCCCCCCccccccccccCC Q lcl|NC_019423. 703 TTPNISAAVGYNTLTNGNSPQERDLAAQQDPAYSLGSQYYDPS 745 (756) Q Consensus 703 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 745 (756) ..++++.+ . ..++|+-. =.+|. T Consensus 704 ~~~~~~~q------~-----------~~~~~~~~----~~~~~ 725 (725) T protein:vir:10 704 IANILQSQ------R-----------QNQPSGSV----AETPQ 725 (725) T ss_pred hhhccccc------c-----------ccCCCccc----ccCCC Confidence 11222211 0 11111111 11111 No 16 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=1e-83 Score=475.55 Aligned_cols=609 Identities=14% Similarity=0.122 Sum_probs=369.5 Q ss_pred C--CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCC--------CCCCCCcccCHHHHHHHHHHHHHH Q lcl|NC_019423. 22 W--KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPP--------KIKGRSQVQPRLVRRQAEWRYAPL 89 (756) Q Consensus 22 ~--~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~--------~~~grS~~v~~~v~~~~e~~~~~L 89 (756) | ++.+++..+...|+.+..+....-....+..+||+++| +.+.. ...||+.++.+.|+.+|+|+++.. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 4 34567788888888888888777777778889997554 32211 234899999999999999999988 Q ss_pred HHhhcCCCCEEEEecCC-cchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeee Q lcl|NC_019423. 90 SEPFLSSSKLFKLTPVT-FEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQ 168 (756) Q Consensus 90 ~~~f~~~~~~~~~~p~~-~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~ 168 (756) . .+-.=+.|.|.. .+|++.|+..|.+++|+.. .++.-....++|+++|++|.|+++++-+++.+. T Consensus 81 ~----~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~--------- 146 (706) T protein:vir:10 81 R----NNRISVKFRPGDNAASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTTSFVNEY--------- 146 (706) T ss_pred H----hCCCceEEecCCCCchHHHHHHHHHHHHHHHH-hcCchHHHHHHHHHHhhcCcceEEeeecccccc--------- Confidence 4 444449999975 5589999999999999854 666666688999999999999777643322100 Q ss_pred cCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEe-chh-h Q lcl|NC_019423. 169 LYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEML-NPN-N 246 (756) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V-~p~-~ 246 (756) ++ ....++|.|++| +|+ + T Consensus 147 ----------------------d~--------------------------------------~~~~~~i~i~~v~~p~~~ 166 (706) T protein:vir:10 147 ----------------------DP--------------------------------------MDERQRIAVEPIYDPARS 166 (706) T ss_pred ----------------------CC--------------------------------------CCCCccceeeeeccchhc Confidence 00 001234667765 455 7 Q ss_pred eEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhh-hcccCchhhhhhhc-hhhhccccccccccccccceEEEEEE Q lcl|NC_019423. 247 VVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHN-LDKIDWESSSPITD-PDHESKTPSDFQFKDALRKKVVAYEY 323 (756) Q Consensus 247 ~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~-l~~~~~~~~~~~~~-~~~~~~~~~~~~~~d~s~~~V~v~E~ 323 (756) |||||.|+. |++||+|+++++|||+++++.+++.... ++.. . +.....+ ....+....++ ..++.+.++.| T Consensus 167 v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~-~-~~~~~~d~~~~d~~~~~ey----y~~~~~~~~~~ 240 (706) T protein:vir:10 167 VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRV-G-SVSWQYDWFTPDVVYIAKY----YEVRKESVDVI 240 (706) T ss_pred eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhh-c-cccccccccCCCcceeccc----ccccceeEEEE Confidence 999999875 9999999999999999999999875431 1100 0 0000000 00000001110 01223334445 Q ss_pred EEEeecc-------------------CCcee----------EEEEEEEECCEEEEecccccCCCccceEEeeeeee--cC Q lcl|NC_019423. 324 WGFYDIN-------------------DDGSL----------EPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPR--KR 372 (756) Q Consensus 324 w~k~d~~-------------------~~g~~----------~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~--~~ 372 (756) |++.... +.|.. ..+..++.|..+| .+++||+|++|||||+++++. ++ T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l-~~~~p~~~~~~P~vP~~g~r~~~d~ 319 (706) T protein:vir:10 241 SYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFL-EKPRRIPGEHIPLIPVYGKRWFIDD 319 (706) T ss_pred EeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecccccc-ccCCCCCCCccceEEEeeccccccc Confidence 6553211 11221 1233456677777 579999999999999999876 77 Q ss_pred cccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc----ccc------ccccccccc Q lcl|NC_019423. 373 ELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY----EYN------PMQGNPSQS 442 (756) Q Consensus 373 ~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~----~~~------~~~~~~~~~ 442 (756) +..++|+||+|+|+||++|+++|+++++++++.+...++..+.++.....+....... ..+ +....+... T Consensus 320 ~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~ 399 (706) T protein:vir:10 320 VERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANV 399 (706) T ss_pred cCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhcccccCCCCcccccccc Confidence 8888999999999999999999999999988766544443333222222222211110 011 111223344 Q ss_pred cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 443 IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICA 522 (756) Q Consensus 443 i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~ 522 (756) +..+..|.+++++++++++....++++|||+++++|..++ . ++.+|+++++++.+.+..++|||+.+++++|+++|+ T Consensus 400 ~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn-~--SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~ 476 (706) T protein:vir:10 400 AGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN-V--ARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLS 476 (706) T ss_pred cccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc-h--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5566778899999999999999999999999999998765 3 444589999999999999999999999999999999 Q ss_pred HHHhhCCCCcEEEEecC----ceeecC--------------HhHhcCcceEEEecccccHHHHH--HHHHHHHHHHhhcc Q lcl|NC_019423. 523 MNAVFLSEKEVVRITNE----QYVEIK--------------REDLKGNFDIEVDINTAEIDNQK--SQDLGFMVQTLGNT 582 (756) Q Consensus 523 li~q~~~~~r~iRI~g~----~~v~i~--------------~d~~~~~~Dv~V~~g~a~~~~~~--~q~l~~llq~~~~~ 582 (756) ||++|||++|+|||+|+ +|+.+| +|..+|+|||+|++++++.+.+. .+.+++|++.+.|. T Consensus 477 li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~ 556 (706) T protein:vir:10 477 MAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQ 556 (706) T ss_pred HHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCc Confidence 99999999999999986 466665 46678999999999988765433 33455566555443 Q ss_pred CCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCC---ChhhhhHH-------HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 583 VDQSITLSLVAKIAELKRMP---DLAHELRTWQPQP---DPMEEQLK-------QLAIQKAQLENEELQSKIALNNAKAK 649 (756) Q Consensus 583 ~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~---~p~~~~~~-------q~~~~~aq~e~~~~qa~a~~~~a~a~ 649 (756) . + ....++..+++++++| ++.+.+++..+++ .|..++.+ |++++++++++.++++++.+.+|+++ T Consensus 557 ~-~-~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~ 634 (706) T protein:vir:10 557 D-P-MRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQ 634 (706) T ss_pred c-h-hhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2 3334445555666555 5666666544332 11111111 12222222233333333333333322 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhhc-cCCchhhhccCCCCCCCcccCch Q lcl|NC_019423. 650 EAASSGDLKDLDYLEQES-GTKHARDMEKQKA---QSQGNQNLQITKALTTPTKE-GETTPNISAAVGYNTLTNGNSPQ 723 (756) Q Consensus 650 ~~~aq~~~~~~~~~~q~~-~~k~~~~~~~~~~---q~~~~~~~~~~~a~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~~ 723 (756) ++++++...++++.+... +.++...+....+ ++...+..++++++.+.+.. +.++|+... +++.+|+ T Consensus 635 k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a~q~~~~~~~~~-------~~~~~~~ 706 (706) T protein:vir:10 635 KSQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLLKEVAASQQQTIPSPPS-------PADIVPS 706 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCC-------CcccCCC Confidence 222222211111111100 0001111111111 11111111222222211111 111121111 1222222 No 17 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=7.6e-83 Score=470.78 Aligned_cols=602 Identities=12% Similarity=0.089 Sum_probs=380.8 Q ss_pred CCc--hHHHHHHHHHHHHHHHHhhHHHHHHHHHH--HHhccccCCCCCC--------CCCCCcccCHHHHHHHHHHHHHH Q lcl|NC_019423. 22 WKK--EPSIQLLKGDLESAKPAHDAIMSQIREWN--DLMEVKGKAKPPK--------IKGRSQVQPRLVRRQAEWRYAPL 89 (756) Q Consensus 22 ~~~--~~~~~~l~~~~~~a~~~~~~~~~~~~~~~--~~y~~~~~~~~~~--------~~grS~~v~~~v~~~~e~~~~~L 89 (756) |.+ +.++..+...|+.+..+......++.+.. +||.|.-+.+... ..||+.++-+.|+.+|+|++..= T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 532 35667777778888887777777776665 5688876654222 25789999999999999996654 Q ss_pred HHhhcCCCCEEEEecCCc-chHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeee Q lcl|NC_019423. 90 SEPFLSSSKLFKLTPVTF-EDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQ 168 (756) Q Consensus 90 ~~~f~~~~~~~~~~p~~~-~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~ 168 (756) -.+-.=+.|.|+++ +|.+.|+..|..++|+.. .++.-....++|+++|++|.|+++++=+++.+. T Consensus 81 ----~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~--------- 146 (708) T protein:vir:17 81 ----RNNRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY--------- 146 (708) T ss_pred ----hhCCcceEEecCCCcchHHHHHHHHHHHHHHHH-hcCchhHHhHHHHHhhhcccceeeeeecccccC--------- Confidence 23344499999975 499999999999999866 445555577899999999999776533322100 Q ss_pred cCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeE--EEechhh Q lcl|NC_019423. 169 LYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTV--EMLNPNN 246 (756) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~i--e~V~p~~ 246 (756) + +.. + ..+++| ..+|+.+ T Consensus 147 -----d-----------------~~~--------~------------------------------~~~i~i~~~~~~~~~ 166 (708) T protein:vir:17 147 -----D-----------------PMD--------D------------------------------RQRIAIEPIYDPSRS 166 (708) T ss_pred -----C-----------------CCC--------C------------------------------ccccceEeeccchhh Confidence 0 000 0 011333 3446789 Q ss_pred eEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEE Q lcl|NC_019423. 247 VVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWG 325 (756) Q Consensus 247 ~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~ 325 (756) |||||.++. |++||+|+++++|||+++++.+++.... ...+..... ..++.| ...++|+|+|||+ T Consensus 167 v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~------~~~~~~~~~------~~~~~~--~~~d~vrv~e~~~ 232 (708) T protein:vir:17 167 VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP------ASLDVTSMT------SWEYDW--FDADVIYIAKYYE 232 (708) T ss_pred eecCccccccChhhhhhhhhhccCCHHHHHHhCccccc------hhhhhhhhc------cccccc--cCCCeEEEEEEEE Confidence 999999976 9999999999999999999998764321 100000000 011112 2346899999998 Q ss_pred Eeec------------------cC------------Cce----------eEEEEEEEECCEEEEecccccCCCccceEEe Q lcl|NC_019423. 326 FYDI------------------ND------------DGS----------LEPIVATWIGSTLIRMENNPFPDGKLPLVVV 365 (756) Q Consensus 326 k~d~------------------~~------------~g~----------~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~ 365 (756) |... +. .|. .+.+.+++.|..+| .+++||||++|||||+ T Consensus 233 r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l-~~~~~~p~~~fP~vP~ 311 (708) T protein:vir:17 233 VRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFL-EKPRRIPGEHIPLIPV 311 (708) T ss_pred EeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccc-cCCCCCCCCccceEEE Confidence 7421 00 011 01223445666666 5789999999999999 Q ss_pred eeeee--cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc----cc----ccc Q lcl|NC_019423. 366 PYMPR--KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY----EY----NPM 435 (756) Q Consensus 366 ~~~~~--~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~----~~----~~~ 435 (756) ++++. ++....+|+||+|+|+||++|+++|+++++++++++.+++++.+++.+....+.....+. .. +++ T Consensus 312 ~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 391 (708) T protein:vir:17 312 YGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKY 391 (708) T ss_pred ecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcc Confidence 99866 566655899999999999999999999999999999999999988865543332222110 01 111 Q ss_pred c--cccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 436 Q--GNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGM 513 (756) Q Consensus 436 ~--~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~ 513 (756) + ......+..+++|+++++++++++.....++++|||+++++|..++ .| +.++++++++|.+.+..++|||+.++ T Consensus 392 g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn-~S--G~Ai~~rq~qg~~~~~~~~Dnl~~~~ 468 (708) T protein:vir:17 392 GNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IA--QETVNNLMNRADMASFIYLDNMAKSL 468 (708) T ss_pred cccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc-hH--HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1222334556788999999999999999999999999999997654 33 44488999999999999999999999 Q ss_pred HHHHHHHHHHHHhhCCCCcEEEEecC----ceeecC--------------HhHhcCcceEEEecccccHHHHH--HHHHH Q lcl|NC_019423. 514 ADIGTKICAMNAVFLSEKEVVRITNE----QYVEIK--------------REDLKGNFDIEVDINTAEIDNQK--SQDLG 573 (756) Q Consensus 514 ~~l~~~~l~li~q~~~~~r~iRI~g~----~~v~i~--------------~d~~~~~~Dv~V~~g~a~~~~~~--~q~l~ 573 (756) +++|+++|+||.+||+++|+|||+|+ +|+.+| +|..+|+|||+|++++++.+.+. .+.|+ T Consensus 469 ~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~ 548 (708) T protein:vir:17 469 KRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLT 548 (708) T ss_pred HHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHH Confidence 99999999999999999999999986 355554 46667899999999988765433 23455 Q ss_pred HHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCC------Chhh----hhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 574 FMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQP------DPME----EQLKQLAIQKAQLENEELQSK 640 (756) Q Consensus 574 ~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~------~p~~----~~~~q~~~~~aq~e~~~~qa~ 640 (756) ++++.+.+..+ ....++..+++.+++| ++.+.++...+++ .+.. +++++.+++++++++.+++++ T Consensus 549 qll~~~~~~~~--~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~ 626 (708) T protein:vir:17 549 NVLSSMLPADP--MRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQ 626 (708) T ss_pred HHHHhcCCccc--hhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66665544332 2222334445555554 5666665433321 1111 112222223333334344444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhccCCCCC Q lcl|NC_019423. 641 IALNNAKAKEAASSGDLKDLDYLEQESGTKHA-----RDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAAVGYNT 715 (756) Q Consensus 641 a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~-----~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~ 715 (756) ..+.+|+++++++++..++++..++.....++ +.++. ...++...+.+.++.+...+..+ +-..+++ T Consensus 627 ~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q-~~~~~~~~~~~~~~~l~~~q~~q---~q~~~a~---- 698 (708) T protein:vir:17 627 MVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQ-ARNIDDKAVMEAIRLLKDVAESQ---QQQFQSP---- 698 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhhhhH---HHHHhcc---- Confidence 44444444444444333333322222211111 11111 11222333344445444444332 1222221 Q ss_pred CCcccCchhcCCC Q lcl|NC_019423. 716 LTNGNSPQERDLA 728 (756) Q Consensus 716 ~~~~~~~~~~~~~ 728 (756) |. -|-..+|. T Consensus 699 --p~-~~~~~~~~ 708 (708) T protein:vir:17 699 --PQ-SPADLMPS 708 (708) T ss_pred --cc-CchhccCC Confidence 11 11111111 No 18 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=2.6e-82 Score=467.88 Aligned_cols=602 Identities=13% Similarity=0.108 Sum_probs=381.0 Q ss_pred CCc--hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCC--------CCCCCCcccCHHHHHHHHHHHHHH Q lcl|NC_019423. 22 WKK--EPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPP--------KIKGRSQVQPRLVRRQAEWRYAPL 89 (756) Q Consensus 22 ~~~--~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~--------~~~grS~~v~~~v~~~~e~~~~~L 89 (756) |.+ +.++..+...|+.+..+.........+..+||+++| +.+.. ...||+-++-+.|+.+|+|++..- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 432 356677777788887777777777777888887544 32211 135889999999999999998876 Q ss_pred HHhhcCCCCEEEEecCCcc-hHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeee Q lcl|NC_019423. 90 SEPFLSSSKLFKLTPVTFE-DELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQ 168 (756) Q Consensus 90 ~~~f~~~~~~~~~~p~~~~-D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~ 168 (756) -.+-.=+.|.|.+++ |.+.|+..|.+++|+.. .++.-....++|+++|++|.|++++.=+++.+. T Consensus 81 ----~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~--------- 146 (708) T protein:vir:10 81 ----RNNRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY--------- 146 (708) T ss_pred ----HhCCcceEEEcCCCCchHHHHHHHHHHHHHHHH-hcCchHHHHHHHHhhhhcccceeeeeecccccc--------- Confidence 345566999999765 89999999999999866 444444577899999999999776533322100 Q ss_pred cCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCcee--EEEechhh Q lcl|NC_019423. 169 LYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPT--VEMLNPNN 246 (756) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~--ie~V~p~~ 246 (756) ++... ..+++ ....|+.+ T Consensus 147 ----------------------d~~~~--------------------------------------~~~i~i~~~~~p~~~ 166 (708) T protein:vir:10 147 ----------------------DPMDD--------------------------------------RQRIAIEPIYDPSRS 166 (708) T ss_pred ----------------------CCCCC--------------------------------------ccccceEEeecchhh Confidence 00000 00122 33345679 Q ss_pred eEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEE Q lcl|NC_019423. 247 VVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWG 325 (756) Q Consensus 247 ~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~ 325 (756) |||||.|++ |++||+|+++++|||+++++.+++.... ...++.. .. ...+.| ...+.|+|.|||. T Consensus 167 v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~-~~~d~~~-----~~------~~~~~~--~~~d~v~v~ey~~ 232 (708) T protein:vir:10 167 VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP-TSLDVTS-----MT------SWEYNW--FGADVIYIAKYYE 232 (708) T ss_pred cccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcc-ccccccc-----CC------Cccccc--cCCCceEEEEeee Confidence 999999985 9999999999999999999998765321 0011110 00 011112 2234588888887 Q ss_pred Eeec------------------cCC------------ce----------eEEEEEEEECCEEEEecccccCCCccceEEe Q lcl|NC_019423. 326 FYDI------------------NDD------------GS----------LEPIVATWIGSTLIRMENNPFPDGKLPLVVV 365 (756) Q Consensus 326 k~d~------------------~~~------------g~----------~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~ 365 (756) +.-. +++ |. .+.+++++.|..+| .+++||||++|||||+ T Consensus 233 r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l-e~~~~~p~~~fP~vP~ 311 (708) T protein:vir:10 233 VRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFL-EKPRRIPGEHIPLIPV 311 (708) T ss_pred EEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhh-ccCCCCCCCceeeEEE Confidence 6311 000 10 01233455666666 6789999999999999 Q ss_pred eeeee--cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc----cccccc--- Q lcl|NC_019423. 366 PYMPR--KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY----EYNPMQ--- 436 (756) Q Consensus 366 ~~~~~--~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~----~~~~~~--- 436 (756) ++++. ++...++|+||+++|+||++|+++|++.++++++.+..++++.+++.+....+.....+. +.+.+. T Consensus 312 ~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 (708) T protein:vir:10 312 YGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKS 391 (708) T ss_pred eeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccc Confidence 99876 566667899999999999999999999999999999999999888865533332222221 111111 Q ss_pred ---cccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 437 ---GNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGM 513 (756) Q Consensus 437 ---~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~ 513 (756) ......+..++++++|++++++++.....++++||++++++|..++ . ++.+|++++++|.+.+..++|||+.++ T Consensus 392 G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn-~--SG~aI~~rq~qg~~~l~~~~Dnl~~~~ 468 (708) T protein:vir:10 392 GNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-I--AQETVNNLMNRADMASFIYLDNMAKSL 468 (708) T ss_pred cccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc-h--HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112234455678899999999999999999999999999997554 3 445599999999999999999999999 Q ss_pred HHHHHHHHHHHHhhCCCCcEEEEecCc----eeecC--------------HhHhcCcceEEEecccccHHH--HHHHHHH Q lcl|NC_019423. 514 ADIGTKICAMNAVFLSEKEVVRITNEQ----YVEIK--------------REDLKGNFDIEVDINTAEIDN--QKSQDLG 573 (756) Q Consensus 514 ~~l~~~~l~li~q~~~~~r~iRI~g~~----~v~i~--------------~d~~~~~~Dv~V~~g~a~~~~--~~~q~l~ 573 (756) +++|+++|+||++||+++|++||+|++ ++.+| +|..+|+|||+|++++++.+. +..+.|+ T Consensus 469 ~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~ 548 (708) T protein:vir:10 469 KRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLT 548 (708) T ss_pred HHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHH Confidence 999999999999999999999999863 44443 677789999999999877543 3334556 Q ss_pred HHHHHhhccCCHhHHHHHHHHHHhhcCCh---hHHHHhhhccCCCC---h---h----hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 574 FMVQTLGNTVDQSITLSLVAKIAELKRMP---DLAHELRTWQPQPD---P---M----EEQLKQLAIQKAQLENEELQSK 640 (756) Q Consensus 574 ~llq~~~~~~~~~~~~~~l~~l~e~~~~~---~~~~~l~~~~~q~~---p---~----~~~~~q~~~~~aq~e~~~~qa~ 640 (756) ++++.+.|..+ ....++..+++++++| ++.+.+++..+++. | . .+++++++++++++++.+++++ T Consensus 549 qll~~~~p~~~--~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~ 626 (708) T protein:vir:10 549 NVLSSMLPTDP--MRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQ 626 (708) T ss_pred HHHHhcCCCch--hhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66666644322 2223344445555555 56666665443321 1 1 1112222233333344444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhccCCCCC Q lcl|NC_019423. 641 IALNNAKAKEAASSGDLKDLDYLEQESGTKH-----ARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAAVGYNT 715 (756) Q Consensus 641 a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~-----~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~ 715 (756) ..+.+|+++++++++...+++..+++....+ .+.++. ....+...+.++++.+.+.+..+ +-..++ T Consensus 627 ~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~-a~~~~~~~~~~~~q~l~~~q~~q---~~~~~~----- 697 (708) T protein:vir:10 627 MVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQ-ARNIDDKAVMEAIRLLKDVAESQ---QQQFQS----- 697 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhhhhH---HHHHhc----- Confidence 4444444444444333332222222211111 111111 11122222333444444433322 111222 Q ss_pred CCcccCchhcCCC Q lcl|NC_019423. 716 LTNGNSPQERDLA 728 (756) Q Consensus 716 ~~~~~~~~~~~~~ 728 (756) +| +.|...+|+ T Consensus 698 -~p-~~~~~~~p~ 708 (708) T protein:vir:10 698 -PP-QSPADLMPS 708 (708) T ss_pred -cc-cCchhccCC Confidence 22 223223222 No 19 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=7e-81 Score=460.03 Aligned_cols=612 Identities=14% Similarity=0.121 Sum_probs=373.5 Q ss_pred CCch--HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCC--------CCCCCCcccCHHHHHHHHHHHHHH Q lcl|NC_019423. 22 WKKE--PSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPP--------KIKGRSQVQPRLVRRQAEWRYAPL 89 (756) Q Consensus 22 ~~~~--~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~--------~~~grS~~v~~~v~~~~e~~~~~L 89 (756) |++. .++..+...++.+..+....-.+..+..+||+++| +.+.. ...||..++-+.|+.+|+|++..- T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 7663 55666666777776666555555556788887544 32211 135899999999999999997776 Q ss_pred HHhhcCCCCEEEEecCCcc-hHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeee Q lcl|NC_019423. 90 SEPFLSSSKLFKLTPVTFE-DELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQ 168 (756) Q Consensus 90 ~~~f~~~~~~~~~~p~~~~-D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~ 168 (756) -.+-.=+.|.|++.+ |++.|+..|..++|+.. .++.-....++|+++|++|.|++++.|+++.+. T Consensus 81 ----~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~--------- 146 (720) T protein:vir:35 81 ----RHNRITVKFRPGDKTASEALANKLNGLFRADYE-ETDGGEACDNAFDDGSTGGFGCFRLTTNLVNAL--------- 146 (720) T ss_pred ----HhCCCceEEEcCCCcchHHHHHHHHHHHHHHHH-hcCchHHHhHHHHHhhhccceeEEeeecccccC--------- Confidence 344455999999665 99999999999999865 444445567899999999999999988765210 Q ss_pred cCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEE--echhh Q lcl|NC_019423. 169 LYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEM--LNPNN 246 (756) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~--V~p~~ 246 (756) ++. ...+++++++ +|+.+ T Consensus 147 ----------------------d~~--------------------------------------~~~~~i~i~~v~~~~~~ 166 (720) T protein:vir:35 147 ----------------------DPM--------------------------------------DERQRICLEPIYDPARS 166 (720) T ss_pred ----------------------CCC--------------------------------------cccceeeEecccCchhh Confidence 000 0012355655 46789 Q ss_pred eEeCCCCcC-ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEE Q lcl|NC_019423. 247 VVIDPSCNG-DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWG 325 (756) Q Consensus 247 ~~~Dp~a~~-d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~ 325 (756) |||||.|++ |++||+|+++++|||+++++.+++........+ ......+.+.+ .+.|+|+|||. T Consensus 167 v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~-------------~~~~~~~d~~~--~~~v~i~E~~~ 231 (720) T protein:vir:35 167 VWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSG-------------IERSWDYDWYD--VDVVYIAKYYE 231 (720) T ss_pred eeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcccccccc-------------ccccccccccC--CCceEEEEeeE Confidence 999999985 999999999999999999999876532211100 00011112222 35699999987 Q ss_pred Eeec------------------cCC------------cee-------EEE--EEEEECCEEEEecccccCCCccceEEee Q lcl|NC_019423. 326 FYDI------------------NDD------------GSL-------EPI--VATWIGSTLIRMENNPFPDGKLPLVVVP 366 (756) Q Consensus 326 k~d~------------------~~~------------g~~-------~~~--~~~~~g~~~L~~~~~P~~~~~~Pfv~~~ 366 (756) +..+ +++ |.. +.+ ++..++++++-.+++|+||++|||||++ T Consensus 232 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~ 311 (720) T protein:vir:35 232 VKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVY 311 (720) T ss_pred EEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEE Confidence 6321 111 110 111 2223466666678899999999999999 Q ss_pred eeee--cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc-------ccccccc Q lcl|NC_019423. 367 YMPR--KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY-------EYNPMQG 437 (756) Q Consensus 367 ~~~~--~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~-------~~~~~~~ 437 (756) +++. ++..+++|+||+++|+||++|+++|.+++++++ .+.+++.|++++.+.......... +++.+.. T Consensus 312 g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~---~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 388 (720) T protein:vir:35 312 GKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQ---DTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVD 388 (720) T ss_pred eeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHc---CCccccccCcchHHHHHHHhhccccccccccccccccc Confidence 9876 666767899999999999999999999999854 467788888887665544332211 1121111 Q ss_pred c------cccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 438 N------PSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK 511 (756) Q Consensus 438 ~------~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~ 511 (756) . +...+.+.+.++++++.+.+++.....++++|||+++++|..+| . ++.+|++++++|.+.+..++|||+. T Consensus 389 ~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn-~--SG~Ai~~rq~qg~~~~~~~~Dnl~~ 465 (720) T protein:vir:35 389 KQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN-I--AKETVNHLMHRSDMSSFIYLDNMAK 465 (720) T ss_pred cCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc-h--HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 23356777888999999999999999999999999999998765 3 4445899999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhCCCCcEEEEecC----ceeecC--------------HhHhcCcceEEEecccccHHHH--HHHH Q lcl|NC_019423. 512 GMADIGTKICAMNAVFLSEKEVVRITNE----QYVEIK--------------REDLKGNFDIEVDINTAEIDNQ--KSQD 571 (756) Q Consensus 512 ~~~~l~~~~l~li~q~~~~~r~iRI~g~----~~v~i~--------------~d~~~~~~Dv~V~~g~a~~~~~--~~q~ 571 (756) +++++|+++|+||++||+++|+|||+|+ +++.+| +|..+|+|||+|++++++.+.+ ..+. T Consensus 466 ~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~ 545 (720) T protein:vir:35 466 SLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSV 545 (720) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHH Confidence 9999999999999999999999999985 355443 6777899999999998775432 2333 Q ss_pred HHHHHHHhhccCCHhHHHHHHHHHHhhcCChh---HHHHhhhccCCCC---h---hhhhH-HHH--HHHHHHHHHHHHHH Q lcl|NC_019423. 572 LGFMVQTLGNTVDQSITLSLVAKIAELKRMPD---LAHELRTWQPQPD---P---MEEQL-KQL--AIQKAQLENEELQS 639 (756) Q Consensus 572 l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~---~~~~l~~~~~q~~---p---~~~~~-~q~--~~~~aq~e~~~~qa 639 (756) +++++..+.|..+ ....++..+++++++|. +.+.+++..++.. | ..++. +++ ++++++++.++++ T Consensus 546 m~qll~~~~p~~~--~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aq- 622 (720) T protein:vir:35 546 LTNLLAGMLPQDP--MRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQ- 622 (720) T ss_pred HHHHHHhcCCCch--hHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHH- Confidence 4444444433222 23345556677777764 5555554432211 1 11110 111 1112222222222 Q ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH---HHhhccCCchhhhccCCC Q lcl|NC_019423. 640 KIALNNAKAKEAASSGDLK--DLDYLEQESGTKH-ARDMEKQKAQSQGNQNLQITKALT---TPTKEGETTPNISAAVGY 713 (756) Q Consensus 640 ~a~~~~a~a~~~~aq~~~~--~~~~~~q~~~~k~-~~~~~~~~~q~~~~~~~~~~~a~~---~~~~~~~~~~~~~~a~~~ 713 (756) +.+.+++++..+++++.. +++..++++.... ++.+....+++++.++..+.+++. ..+.+....+-..++..- T Consensus 623 -a~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~ 701 (720) T protein:vir:35 623 -GVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELIL 701 (720) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhh Confidence 222233333222222222 2222221111111 111222222223333333333322 111121111111111100 Q ss_pred CCCCcccCchhcCCCCCCCC Q lcl|NC_019423. 714 NTLTNGNSPQERDLAAQQDP 733 (756) Q Consensus 714 ~~~~~~~~~~~~~~~~~~~~ 733 (756) ..+.+. --.+++.+.-.+- T Consensus 702 ~~~~~~-~~~~~~~~~~~~~ 720 (720) T protein:vir:35 702 KATDTQ-HKQNRDAAKNHSI 720 (720) T ss_pred cccchh-hhhhHHHhhccCC Confidence 110110 0111111111000 No 20 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=3.3e-80 Score=456.34 Aligned_cols=609 Identities=16% Similarity=0.152 Sum_probs=400.9 Q ss_pred CCCCCCCcc---ccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC------------CCCCCCCC Q lcl|NC_019423. 6 TFKPLPDPA---QSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA------------KPPKIKGR 70 (756) Q Consensus 6 ~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~------------~~~~~~gr 70 (756) -+.|+|-+. +-++-.+++++.+.+.|.+.++.+++.++..+++|++.++||..++.+ ......+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 80 (641) T protein:vir:94 1 MTIEMPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWR 80 (641) T ss_pred CccCCCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhccc Confidence 334444333 112223567778999999999999999999999999999998754332 11234569 Q ss_pred CcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEE Q lcl|NC_019423. 71 SQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIAR 150 (756) Q Consensus 71 S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k 150 (756) |+++++++.+.++|++++|++.||++++||+|.|.++||+++|++.+.|+|+++ .+++++.++++|+++++..|+||+| T Consensus 81 ~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l-~~~~~~~~~~~~~~d~~~~g~~iv~ 159 (641) T protein:vir:94 81 HRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKL-EAASIRDIFETYVRNLVLYGVSTYR 159 (641) T ss_pred ccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHH-hhcchHHHHHHHHHHHhhcCceEEE Confidence 999999999999999999999999999999999999999999999999999998 6788999999999999999999999 Q ss_pred EeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEee Q lcl|NC_019423. 151 IGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVE 230 (756) Q Consensus 151 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~ 230 (756) ++|++++++..+++.. ..|.-+ .. +..+.+ T Consensus 160 ~~w~~~~~~~~~~~~~---------------------------------------------~~~~~~--~~--~~~~~v- 189 (641) T protein:vir:94 160 LGWDTSMERQFKRTFV---------------------------------------------ETGDIF--GG--WEDVAV- 189 (641) T ss_pred eehhhHHHHhhhhhcc---------------------------------------------cchhhc--cc--ccccce- Confidence 9998776543332110 000000 00 000011 Q ss_pred eeecCceeEEEechhheEeCCCCcCccccCceEEEE-eecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 231 KALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVIS-FETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 231 ~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~-~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) ....+.++++.|+|++|||||+|+. +++.|+++| +.+|+.+|...++ .+++.+++..... ......+...+.+ T Consensus 190 ~~~~~~~r~~~v~~~di~~dps~~~--~~~~f~~~r~t~~t~~~l~~eg~--~~~d~v~~~~~~~--~~~~~~d~~~d~~ 263 (641) T protein:vir:94 190 NRQRSELRIEPLSPYDVWLDTSGGK--NTGTFVRLRHTREELHELVTSGY--YDLDLTQVEQYVD--YKFADPDTPKDVN 263 (641) T ss_pred ecccceeeEEecchhheeecCCCCc--ccccceehhhhHHHHHHHHhcCC--CChhhcchhhccc--ccccccccccccc Confidence 1124568999999999999999864 456666554 3445555544432 2333332221111 1111122223333 Q ss_pred ccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHH Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAI 389 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~ 389 (756) +.+.+ +.++||+|+.++.++.. .+.+++++.|+++|+.+.++|+. .+||++++|.+.+|++||.|+++.+.+.|+. T Consensus 264 ~~~~~--~~~~~e~~gd~~~d~~~-~~~~~~~~~g~~il~~~~~~~~d-~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ 339 (641) T protein:vir:94 264 GTDTS--GWDIIEYYGPLLVEGVQ-FWCVHAVFYGKQLIRLSDSKYWC-GSPFVTTTLLPDRDSVYGMSVLHPNLGALHV 339 (641) T ss_pred ccccc--ccceeeeeeeeccCCCc-eeeEEEEEeCCEEeecccccccC-cCCeEEecceecCCcccCCChHHHHHHHHHH Confidence 33333 45788999855544332 23466888999999999998754 5699999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCc-chHHHHHHHHHHHHHHH Q lcl|NC_019423. 390 LGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPEL-PQSAIVMTQMQNQEAES 468 (756) Q Consensus 390 iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~l~~~~~~~e~ 468 (756) +|++.|.+++++.++++|++++..+++.........++. .+..+....++++..+.. ....+.++++....+++ T Consensus 340 ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~-----ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~ 414 (641) T protein:vir:94 340 LNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGA-----VFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYR 414 (641) T ss_pred HHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCc-----ceeeCCCCcceeecCCccccchhHHHHHHHHHHHHH Confidence 999999999999999999999888776443222222322 222233334566544332 23455678888889999 Q ss_pred HhchhHHhcCCCcccc-chhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecC-----ce Q lcl|NC_019423. 469 LTGVKAFSGGVTGSAY-GDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNE-----QY 541 (756) Q Consensus 469 ~tGv~~~~~G~~~~a~-~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~-----~~ 541 (756) .+++...++|..+... ..||++++++++++++++..++++|++ +++++++.++.+++++++.+.++|+.|. .| T Consensus 415 ~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~ 494 (641) T protein:vir:94 415 NTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGF 494 (641) T ss_pred hhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccC Confidence 9999998888765432 469999999999999999999999996 9999999999999999999999999986 48 Q ss_pred eecCHhHhcCcceEEEeccccc--HHHHHHHHHHHHHHHhhcc--CCHhH-HHHHHHHHHhhcCChhHHHHhhhccCCCC Q lcl|NC_019423. 542 VEIKREDLKGNFDIEVDINTAE--IDNQKSQDLGFMVQTLGNT--VDQSI-TLSLVAKIAELKRMPDLAHELRTWQPQPD 616 (756) Q Consensus 542 v~i~~d~~~~~~Dv~V~~g~a~--~~~~~~q~l~~llq~~~~~--~~~~~-~~~~l~~l~e~~~~~~~~~~l~~~~~q~~ 616 (756) +++.|+++++++++ |..|.+. ......++|+++++.++.. +...+ ...++..+++.+|++....+++...+++. T Consensus 495 ~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~ 573 (641) T protein:vir:94 495 FEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPMRYIKKAEAPPA 573 (641) T ss_pred CCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchhhccCccCchh Confidence 88999999999998 4444332 2334455677777776632 11111 22357888899999888888875443322 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 617 PMEEQLKQLAIQKAQLENEELQSKI--ALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKAL 694 (756) Q Consensus 617 p~~~~~~q~~~~~aq~e~~~~qa~a--~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~ 694 (756) +.+ .+++++| ++...+|+. +....++ ....+. .+|+....++... ..+.+ T Consensus 574 ~~~-----~~~~~~q-~~~~~~a~~~~~~~~~~a-------~~~~~~-----------~~~~~~~~~~~~~-~~~~~--- 625 (641) T protein:vir:94 574 APP-----IAPAEPG-ALPPEMMNSVGGGLNDQA-------IAGMTP-----------EDVSDLASRIGID-TSDVA--- 625 (641) T ss_pred HHH-----HHHHHHH-HHHHHHHHHHHhhhHHHH-------HHHhhH-----------HHHHHHHHhhcCC-chhhh--- Confidence 211 1111111 000111110 1111111 000000 0111111100000 00111 Q ss_pred HHHhhc-cCCchhhhccC Q lcl|NC_019423. 695 TTPTKE-GETTPNISAAV 711 (756) Q Consensus 695 ~~~~~~-~~~~~~~~~a~ 711 (756) .|.- ++||...+++. T Consensus 626 --~~~~~~~~~~~~~~~~ 641 (641) T protein:vir:94 626 --PEAMAAATQQITSGAL 641 (641) T ss_pred --HHHHhcccccccccCC Confidence 1111 23333334543 No 21 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=5.2e-79 Score=449.78 Aligned_cols=552 Identities=14% Similarity=0.116 Sum_probs=379.5 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCCCcccCHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGRSQVQPRLV 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grS~~v~~~v 78 (756) |.-. . -..+|+-+ -+.+-+-|..+++++.+.++..+-.|.|.++||++.-..+ .-+..+|||++-+++ T Consensus 1 ~~~~-------~-~~~~~~~~--~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~ 70 (584) T protein:vir:95 1 MSVK-------V-AELNSLLV--RDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKL 70 (584) T ss_pred CCcc-------h-hhhhhhcc--ccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHH Confidence 2111 0 11222222 2345678889999999999999999999999998643222 222457999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHH--HHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELA--ARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~--A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~ 156 (756) +..++|++++|+++||++++||+|.|..++|..+ |+....|+.-++. +.+=..++..+|+++++.|+||+|++|..+ T Consensus 71 ~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~-e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~ 149 (584) T protein:vir:95 71 CQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCR-ESHFRTEVSKLIYDYIDYGNAFATVSFEAK 149 (584) T ss_pred HHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhh-hccHHHHHHHHHHhhccCCceEEEEeEeec Confidence 9999999999999999999999999999999887 5555555533322 234456688999999999999999999876 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) +.+..+. + .+ ..+.+ T Consensus 150 ~~e~~e~----------------------------------------------------------~-----~v--~~~~~ 164 (584) T protein:vir:95 150 YKEMTDG----------------------------------------------------------T-----LV--PDYIG 164 (584) T ss_pred ceeeecc----------------------------------------------------------c-----cc--ccccc Confidence 5332210 0 00 01347 Q ss_pred eeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhc----cchhhhcccCchhhhhhhchh---hhcccccc-- Q lcl|NC_019423. 237 PTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNK----DRYHNLDKIDWESSSPITDPD---HESKTPSD-- 307 (756) Q Consensus 237 ~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~----~~~~~l~~~~~~~~~~~~~~~---~~~~~~~~-- 307 (756) ++|++|+|++|||||+|+ +++|+.||+ |..+|+++|+.+. ..+-.++.+.+.......... +....+.. T Consensus 165 prieriSP~d~~~Dpsa~-~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (584) T protein:vir:95 165 PRLVRISPLDIVFNPLAT-SISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFD 242 (584) T ss_pred ceEEeeChhheeecCCCC-Cccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCcccccccccccc Confidence 999999999999999995 589999998 6678999998763 112222222222111101111 01111111 Q ss_pred -----ccccccccceEEEEEEEEEe-ecc-CCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchH Q lcl|NC_019423. 308 -----FQFKDALRKKVVAYEYWGFY-DIN-DDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA 380 (756) Q Consensus 308 -----~~~~d~s~~~V~v~E~w~k~-d~~-~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v 380 (756) ..+......+|+|+|+|+.+ +.. +++.....+.++.|+++|+.+.||||++++||++.+++|+++++||+|+. T Consensus 243 ~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~ 322 (584) T protein:vir:95 243 VDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPL 322 (584) T ss_pred cccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCch Confidence 11222234479999999853 533 44445555667889999999999999999999999999999999999999 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCc-chHHHHHH Q lcl|NC_019423. 381 ELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPEL-PQSAIVMT 459 (756) Q Consensus 381 ~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~l 459 (756) +.+.|+|+++|.++|+++||++++++|. .+..++..+ ....+ ..++..+-.++++++++|.. -...++.+ T Consensus 323 ~ll~d~Q~~lna~~r~~iDnl~l~~~pv---~k~~~~~~~-~~~~p-----g~~~~~~~~~~~q~~~p~a~~~~s~~~~l 393 (584) T protein:vir:95 323 DNLVGMQYRIDHLENAKADAVDLIIQPP---LKIIGEVEE-FVWGP-----GAEIHLDQGGDVQEIAKNVNYIINADNQI 393 (584) T ss_pred hhhhhHHHHHhHHHHHHHHHHHHhcCcc---eeeccccch-hcccC-----CceeecCCCCCcceecCchhhhhHHHHHH Confidence 9999999999999999999999999983 333333211 12222 22222233334555554421 12455679 Q ss_pred HHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhCCCCcEEEEec Q lcl|NC_019423. 460 QMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGM-ADIGTKICAMNAVFLSEKEVVRITN 538 (756) Q Consensus 460 ~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~-~~l~~~~l~li~q~~~~~r~iRI~g 538 (756) ++++..+++.|||+.+++|.++. .++||+++++++++++..+++++++|.+.+ ++++..++++..+|++...++|++| T Consensus 394 q~~e~~me~~sGvp~~~~G~~~~-~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n 472 (584) T protein:vir:95 394 QMLEDRMELYAGAPREAMGIRTP-GEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMD 472 (584) T ss_pred HHHHHHHHhhhCCChhhcccccc-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeec Confidence 99999999999999999998744 479999999999999999999999999865 8999999999999999999999998 Q ss_pred Cc-----eeecCHhHhcCcceEEEecccccHHH-HHHHHHHHHHH-HhhccCCHhHHHHHHHH-HHhhcCChhHHHHhhh Q lcl|NC_019423. 539 EQ-----YVEIKREDLKGNFDIEVDINTAEIDN-QKSQDLGFMVQ-TLGNTVDQSITLSLVAK-IAELKRMPDLAHELRT 610 (756) Q Consensus 539 ~~-----~v~i~~d~~~~~~Dv~V~~g~a~~~~-~~~q~l~~llq-~~~~~~~~~~~~~~l~~-l~e~~~~~~~~~~l~~ 610 (756) ++ |++|+|++++|+|+++...+++...+ ++.+++.++++ ++++.+.|......+.. +.++.++|.-. +.. T Consensus 473 ~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~--~~~ 550 (584) T protein:vir:95 473 TDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYE--IFR 550 (584) T ss_pred cccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCccc--ccC Confidence 75 89999999999999998887665543 55677888877 66666666654444444 55666666321 111 Q ss_pred ccCCCChhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 611 WQPQPDPMEEQL--KQLAIQKAQLENEELQSKIALNNAK 647 (756) Q Consensus 611 ~~~q~~p~~~~~--~q~~~~~aq~e~~~~qa~a~~~~a~ 647 (756) ++...+++ .|..+.++| +..+++++.....|- T Consensus 551 ----~~~~~~~Q~~~q~~~~~~q-~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 551 ----PNVAVAEQAETQSLVAQAQ-EDLQLQAQMPAEGAI 584 (584) T ss_pred ----CCcccchhHHHHhhhHHHH-HHHHHHHhhhhccCC Confidence 11111111 111111111 111111110000000 No 22 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=3.9e-70 Score=401.13 Aligned_cols=565 Identities=14% Similarity=0.092 Sum_probs=390.7 Q ss_pred CCcccCCCCCCCccccccccC-C-CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCCCCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTD-W-KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPKIKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~grS~~v~~ 76 (756) |.- +...-.|+-. + .....+..|...++...+.++..+..|.|.++|-+..--. ...+...|+|+..+ T Consensus 1 m~~--------~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~ 72 (599) T protein:vir:31 1 MST--------DIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTIN 72 (599) T ss_pred Ccc--------chHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchH Confidence 210 2223333333 3 2334566788889998888988888899999998754222 24456799999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchH--HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDE--LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .....++.++++++..+|.+++||+|+|..++|. ++++....|+.-++. +.+-+.++.++|++.++.|+.|.++-|+ T Consensus 73 k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~-e~~~~~~~~~~v~d~i~~G~~vat~~~e 151 (599) T protein:vir:31 73 KLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVE-ASNLEGVIERMVDDFAVRGFCVAHTRHV 151 (599) T ss_pred HHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhh-hcchHHHHHHHHhhhcccCceeEeeeEE Confidence 9999999999999999999999999999999963 455555556554432 4455677889999999999999999886 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ..+.... + | . ++ ..+ T Consensus 152 r~~~~~~----------------------------d-----------------------~-~-----------v~--~~~ 166 (599) T protein:vir:31 152 KRMTVTA----------------------------E-----------------------N-Q-----------VI--KNY 166 (599) T ss_pred Ecceeec----------------------------c-----------------------c-c-----------cc--ccc Confidence 4431110 0 0 0 00 113 Q ss_pred CceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhc----cchhhhcccCchhhhhhhchhhhccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNK----DRYHNLDKIDWESSSPITDPDHESKTPSDFQF 310 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~----~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (756) -+|++++|+|+||||||+|. +++||.|++ |...|+.+|..+- +.+..++.+.+..+..........+.+....+ T Consensus 167 ~~P~~ervsP~Di~~Dp~A~-si~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g 244 (599) T protein:vir:31 167 SGTVTERLSPSDVFWDVTAD-SLPKAAKCI-RQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRK 244 (599) T ss_pred ccceEEeecccceeeCCCCC-CCCcceeee-ehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhh Confidence 46999999999999999995 589998877 8899999998763 23344444444333322223333344444455 Q ss_pred cccccce-------------EEEEEEEE-EeeccCCceeEEEEEEEECC-EEEEecccccCCCccceEEeeeeeecCccc Q lcl|NC_019423. 311 KDALRKK-------------VVAYEYWG-FYDINDDGSLEPIVATWIGS-TLIRMENNPFPDGKLPLVVVPYMPRKRELF 375 (756) Q Consensus 311 ~d~s~~~-------------V~v~E~w~-k~d~~~~g~~~~~~~~~~g~-~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~ 375 (756) .|.++.+ |.++|+|+ .++.++++....++++|+|+ ++++.+.||||+|++||++.+|.|+++++| T Consensus 245 ~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~y 324 (599) T protein:vir:31 245 FDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLC 324 (599) T ss_pred ccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccC Confidence 5554444 88999997 88999999999999999995 777999999999999999999999999999 Q ss_pred CCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHH Q lcl|NC_019423. 376 GEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSA 455 (756) Q Consensus 376 G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 455 (756) |+|+...+.++|..+|.++|+++|++.+...+ .+...+.+.+.+.. ..+...+...-.+.++++++|.-...+ T Consensus 325 G~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p-~l~~~~dl~~eD~~------~~P~~v~~~~d~~~vq~~~p~s~~~~a 397 (599) T protein:vir:31 325 PIGPLHRLTGMQYKLDKRENFREDLHDRFLHP-SLKKVGDVREKGMR------GGPNHVFEVEETGDVQYMTPPAEVLQP 397 (599) T ss_pred CCCCchhcchHHHHHHHHHHHhhhhhhhhhcc-cccccccccccCcc------CCCCcceeecCCCccccccCchhhhhH Confidence 99999999999999999999999999998876 33333333322111 111111211223345555555545566 Q ss_pred HHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEE Q lcl|NC_019423. 456 IVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVV 534 (756) Q Consensus 456 ~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~i 534 (756) ..++++++..+++.||++.+++|..+.+ +.||+++++++++|+.+.+.+++.|.+ .+++|+++++++.++|++++.++ T Consensus 398 ~~~is~~e~~mee~sGvp~~~~G~~~ag-~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~ti 476 (599) T protein:vir:31 398 DNQLSITLQLMEDLSGAPKESIGQRTAG-EKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTI 476 (599) T ss_pred HHHHHHHHHHHHHhhccchhhcCCcccc-hhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccce Confidence 7789999999999999999999988766 689999999999999999999999998 56899999999999999999999 Q ss_pred EEecCc-----eeecCHhHhcCcceEEEecccccH-H-HHHHHHHHHHHH-HhhccCCHhHHHHHHHHHHhhcCChhHHH Q lcl|NC_019423. 535 RITNEQ-----YVEIKREDLKGNFDIEVDINTAEI-D-NQKSQDLGFMVQ-TLGNTVDQSITLSLVAKIAELKRMPDLAH 606 (756) Q Consensus 535 RI~g~~-----~v~i~~d~~~~~~Dv~V~~g~a~~-~-~~~~q~l~~llq-~~~~~~~~~~~~~~l~~l~e~~~~~~~~~ 606 (756) ||+|++ |++|+++++++++++ +..|.... . .+..|.+.++++ .++..+.|.+...-+..+++ -.. T Consensus 477 ri~~~e~~~~~f~~i~redl~~~~~~-v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~------~~~ 549 (599) T protein:vir:31 477 KTFNSELGTATFLDITADDLNLNGQM-VAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVE------YLG 549 (599) T ss_pred eeecccccceeeEEeehhhhhCCeee-eechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHH------HHH Confidence 999986 999999999999999 55664432 2 222344444443 23344555544433333332 244 Q ss_pred HhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 607 ELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDME 676 (756) Q Consensus 607 ~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~ 676 (756) .|+.++..+++...+.+|.+..++|.++ ++....+.. ..+-+.+-+-.+| T Consensus 550 ~l~~~~~~~~~va~~eqq~~~~m~Q~~l-----q~~~~~~~~---------------~~~~~~~~~~~~~ 599 (599) T protein:vir:31 550 DLDAYGIFTFGIGVQEDQQLARMAQKST-----QQTEETALT---------------QEEVGGPTTDTGQ 599 (599) T ss_pred hccccccCCCchhHHHHHHHHHHHHHHH-----HHhHhhhhh---------------hhhcCCCCcccCC Confidence 5666776666654433322222211111 000000000 0011111111111 No 23 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=1e-39 Score=234.32 Aligned_cols=610 Identities=13% Similarity=0.120 Sum_probs=325.7 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRR 80 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~ 80 (756) |..+...-|-.-|+- --++|+. .+.+|+...+..-.+.+.--+.|-+..+...++ ..+ -+.++. T Consensus 1 m~~~~~~~~~~tpe~--la~~W~~---------~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~---~~r--~nl~~s 64 (663) T protein:vir:34 1 MNESQPTDFADTPQG--WAQRWQE---------EMSAAREPLEKWHTQGKEIVKRYRDERDSAHDA---ETR--WNLFST 64 (663) T ss_pred CCccccccchhcchh--HHHHHHH---------HHHHHHhccchHHHHHHHHHHHhhccccCCCcc---ccc--cchhhh Confidence 877666644333322 1113443 456666554444444444456665544433222 233 489999 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCcc-hHHHHHHHHHHHHHHHhhhcCC----cc-hHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTFE-DELAARQNELVLNYQFRTQLNK----VK-LVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~~-D~~~A~q~t~~~n~~~~~~~~~----~~-~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.++|++ .+..+++.|.|...+ |.+.++-+.+.|+..+++-..+ ++ .+.-.++++|+||.||+++-++ T Consensus 65 ni~~i~P~i----Yar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye 140 (663) T protein:vir:34 65 NIQTQMASL----YGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYE 140 (663) T ss_pred hHHHHhhhh----hcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEee Confidence 999999999 999999999998776 5456777777777766544433 22 2445689999999999999776 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) .+.+.+ .+.+..-++ + ..++. ...|- ..+.... T Consensus 141 ~~~~~~-------~~~~~~~D~---------------~------~~~~~-------a~~~~------------~~e~~a~ 173 (663) T protein:vir:34 141 VEWEEV-------AGVDAILDE---------------A------TGAEL-------AAAVP------------PTQRKAY 173 (663) T ss_pred cccchh-------ccccccCCC---------------c------cccch-------hcccc------------cchhhcc Confidence 554211 111111000 0 00000 00010 1122234 Q ss_pred CceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) .+++|++|+..||++||. ..|+++.||+.+.|||+.++...++ .+++...+.+....+...+. ..+-.+.+ T Consensus 174 E~v~id~v~~~dfl~~pA--r~W~ev~wva~r~~mtk~e~~~rf~--~~~~~~~~a~~~~~~~~~~~-----~~~~~~~~ 244 (663) T protein:vir:34 174 ECVETDYLHWQDVLWSPA--RVWHEVRWLAFRNLLDMREFNARFD--ADGSRNLWASVPKVGKPKDG-----KDGQSCHP 244 (663) T ss_pred cceeeeeechhhcccchh--hccccccceeeeccCCHHHHHHhhc--CChhhhhhhhccCcCCcccc-----CCCCCcch Confidence 568999999999999994 4699999999999999999987753 12222111111111111111 11111223 Q ss_pred cceEEEEEEEEEeeccCCceeEEEEEEEEC--CEEEEecccccCCCcc-----ceEEeeeeeecCcccCCchHHHhHHHH Q lcl|NC_019423. 315 RKKVVAYEYWGFYDINDDGSLEPIVATWIG--STLIRMENNPFPDGKL-----PLVVVPYMPRKRELFGEADAELLGDNQ 387 (756) Q Consensus 315 ~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g--~~~L~~~~~P~~~~~~-----Pfv~~~~~~~~~~~~G~g~v~~~~d~Q 387 (756) -++.+|||.|.|-+ .++++++. +.+|+. +|-+.|.- ||..++.. ..++.++...+-...+.| T Consensus 245 ~~~a~VwEIWdK~~--------~~V~w~~eg~~~~L~~--~~p~lgl~~ffPcPrpl~~~~-~~ds~ipvpd~~~y~~~~ 313 (663) T protein:vir:34 245 WDRAEVWEIWDKGG--------RKVDWYVEGYSAVLDT--QPDPLGLESFFPCPKPLLANW-TTDKVVPRPDFVLAQDLY 313 (663) T ss_pred hcCcceeEEEecCC--------cEEEEEEcCcceeccc--CCCCCCCCCCCCCccccccee-cCCCeecCCcHHHHHHHH Confidence 34789999998753 23444444 345554 44444543 44444433 345777666777999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccccc--------ccccccccccccCCCcchHHHHHH Q lcl|NC_019423. 388 AILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPM--------QGNPSQSIMEHKFPELPQSAIVMT 459 (756) Q Consensus 388 ~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~--------~~~~~~~i~~~~~~~~~~~~~~~l 459 (756) +++|.++.+ +..+....+++++++.|+-......-.+...+ ...++ ..+..+.|..++.+.+.+.+..+. T Consensus 314 ~E~n~~t~R-in~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n-~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~ 391 (663) T protein:vir:34 314 KEIDLVSTR-ITLLERAIRVVGVYDKSSGLTIGRLLSEAAQN-DLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLR 391 (663) T ss_pred HHHHHHHHH-HHHHHhhhhhceeeccccchhHHHHHHHhhCC-CceecchhhhhhhhcCccchhhcccchhHHHHHHHHH Confidence 999987765 56667778999999977764433322222221 22222 123345678888888777777665 Q ss_pred HH---HHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEE Q lcl|NC_019423. 460 QM---QNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRI 536 (756) Q Consensus 460 ~~---~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI 536 (756) +. +...+.++||+.|++.|... .+.||++.+..++.++.|+..+.+.+.++.++++++.-+.|.+.++-+.+-+| T Consensus 392 ~~r~qir~d~~qITGiaDi~Rga~~--a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m 469 (663) T protein:vir:34 392 DYRRELVDALHQVTGMADIMRGASD--PRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQ 469 (663) T ss_pred HHHHHHHHHHHHHHhHHHHhhcccC--cchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHH Confidence 54 55677899999999999653 36899999999999999999999999999999999999999999988888788 Q ss_pred ecCcee---ec-------CHhHhcCcceEEEeccc-ccHHH--HH--H----HHHHHHHHHhhccC--CHhHHHHHHHHH Q lcl|NC_019423. 537 TNEQYV---EI-------KREDLKGNFDIEVDINT-AEIDN--QK--S----QDLGFMVQTLGNTV--DQSITLSLVAKI 595 (756) Q Consensus 537 ~g~~~v---~i-------~~d~~~~~~Dv~V~~g~-a~~~~--~~--~----q~l~~llq~~~~~~--~~~~~~~~l~~l 595 (756) +|.+.. +| .++.+ ..|.|.|..++ ...+. .+ . +.+..+++.++|.+ .|+.. .++.++ T Consensus 470 ~~~elp~~~ei~~~~~~L~n~~~-r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~-p~l~El 547 (663) T protein:vir:34 470 ANAEFTFDKELAPKAAELIKSRF-SMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSA-PFLLQM 547 (663) T ss_pred hcCCCCcccchhHHHHHHhcCCC-cceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhH-HHHHHH Confidence 875422 22 23333 34566665443 22222 11 1 11222333333221 12222 233333 Q ss_pred Hhh--cCCh---hHHHHhhhccCCCChhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 596 AEL--KRMP---DLAHELRTWQPQPDPMEE--QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESG 668 (756) Q Consensus 596 ~e~--~~~~---~~~~~l~~~~~q~~p~~~--~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~ 668 (756) +.. .++. ++.-.+..+.......+. -+++++++++++++..-+.++++..|+++ ..+|.+..+.+...++.+ T Consensus 548 lk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq-~e~q~~~~~~ql~~~~~~ 626 (663) T protein:vir:34 548 LKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQ-AEVQGDLLRIQAETQANE 626 (663) T ss_pred HHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 332 2332 232222222111100000 00111111111111111111111111111 122333333333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhhccCCchhhhccCC Q lcl|NC_019423. 669 TKHARDMEKQKAQSQGNQNLQITKALTT-PTKEGETTPNISAAVG 712 (756) Q Consensus 669 ~k~~~~~~~~~~q~~~~~~~~~~~a~~~-~~~~~~~~~~~~~a~~ 712 (756) .|+.. ++.++++++-+..+. .+++..-+++.-+.++ T Consensus 627 ~k~~~--------~a~~~~~~a~q~~~~~~~~r~~~~~a~~~~~~ 663 (663) T protein:vir:34 627 TKERQ--------QAEWNVREAAQKNLISQAARAMNPQARNGGMP 663 (663) T ss_pred HHHHH--------HHHHHHHHHHHhhHHHHHHHhhchhhhcCCCC Confidence 33211 111222222221111 1111112222222221 No 24 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=1.5e-28 Score=173.18 Aligned_cols=580 Identities=12% Similarity=0.077 Sum_probs=159.3 Q ss_pred ccccccccCCCchHH---------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHH Q lcl|NC_019423. 13 PAQSEKLTDWKKEPS---------IQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAE 83 (756) Q Consensus 13 ~~~~~~~~~~~~~~~---------~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e 83 (756) -.+|.|-..-..-++ -...+..++.++++...-+..+.+|.+-..-+.+ --.| .+|= ..+....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~----fy~G-~Qw~-~~~~~~l~ 74 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLK----FLGG-EQWP-SQVRTERE 74 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH----HhCC-CCCC-HHHHHHHH Confidence 233333321111000 1112223555555544333322233222221111 2245 4663 33332222 Q ss_pred HHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcC-Ccch--H-------------------------- Q lcl|NC_019423. 84 WRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLN-KVKL--V-------------------------- 134 (756) Q Consensus 84 ~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~-~~~~--~-------------------------- 134 (756) ..+-+.+.| + ..+- .||+++..+.+ ...+ . T Consensus 75 ----------~~g~p~~~~-----N--~i~~----~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (711) T protein:vir:10 75 ----------LEQRPCLVN-----N--VLPT----FVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK 133 (711) T ss_pred ----------hcCCCcEEE-----c--chHH----HHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCCh Confidence 112222222 2 1122 44444443322 1111 0 Q ss_pred -HHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhc Q lcl|NC_019423. 135 -DDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNET 213 (756) Q Consensus 135 -~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~ 213 (756) +.-+-++| +|++++.++.. ........++.+.+.+ T Consensus 134 ~d~~~Ae~l---~~~~~~~~~~~-----------------------------------------~~~~~~s~af~d~~~~ 169 (711) T protein:vir:10 134 NDYELAEVF---TGLIKNIEYNC-----------------------------------------DAETEYDIAFQGAVES 169 (711) T ss_pred hHHHHHHHH---HHHHHHHHHhc-----------------------------------------ChhHHHHHHHHHhhhc Confidence 00122222 44444322100 0001123344555566 Q ss_pred CCcceeccCceeEEEeeeeecCceeEEEechhheEe----CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCc Q lcl|NC_019423. 214 GEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI----DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDW 289 (756) Q Consensus 214 G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~----Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~ 289 (756) |.|+ .++..+-...+.+. -++.+ ||..- .+| -..+..++++..... ....++ . T Consensus 170 G~G~-------~ev~~d~~~~d~~~------~e~~i~~v~~p~~v-~~D-----p~a~~~D~sDar~~~-~~~~~~---~ 226 (711) T protein:vir:10 170 GMGY-------LRVRSDYLADDSFE------QDLIIEAIQNQFSV-TID-----PDAKKRDRSDMNWCL-IDDTMS---K 226 (711) T ss_pred Ccce-------EEEEecccCCCCCC------CCeEEeeecChhhe-eeC-----ccccccChhhhccee-eeecCC---H Confidence 6544 33222211111111 11111 22110 000 012233444443321 111111 1 Q ss_pred hhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEE-------EEEEE-ECCEEEEeccc-cc----- Q lcl|NC_019423. 290 ESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEP-------IVATW-IGSTLIRMENN-PF----- 355 (756) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~-------~~~~~-~g~~~L~~~~~-P~----- 355 (756) +... ..+++...... +.. .+.-+..|+..+ .-.+.++ +.++. ..+.......+ ++ T Consensus 227 ~~~~-~~yp~~a~~~~-~~~-------~~~~~~~~~~~~--~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (711) T protein:vir:10 227 EKFK-ALYPDATAEPV-YED-------SVADYDTWFTEK--SVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELL 295 (711) T ss_pred HHHH-HhCCchhhhhh-hcc-------cccccCcccCcc--eeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHH Confidence 1100 01111110000 100 011122233211 0111111 11111 11211111110 00 Q ss_pred C---------------------------CCc--cceEEeeeeeecCccc---CCchHHHh-HHHHHHHHHHHHHHHH-HH Q lcl|NC_019423. 356 P---------------------------DGK--LPLVVVPYMPRKRELF---GEADAELL-GDNQAILGATMRGMID-LL 401 (756) Q Consensus 356 ~---------------------------~~~--~Pfv~~~~~~~~~~~~---G~g~v~~~-~d~Q~~iN~~~~~~~d-~l 401 (756) . ++. +|+-.+++.|.-+... +.+....+ .++-+.-. +.+.+.- .+ T Consensus 296 ~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr-~~N~~~s~~~ 374 (711) T protein:vir:10 296 EAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQR-MANYWDSAAT 374 (711) T ss_pred hcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHH-HHHHHHHHHH Confidence 0 011 2222233333322211 22222222 22222211 1111111 11 Q ss_pred HhhcCCceEeecccc-Cccchhh-hh-cccc-cc--cccccccccccc-ccccCCCcchHHHHHHHHHHHHHHHHhchhH Q lcl|NC_019423. 402 GRSANGQRGYPKGML-DTLNRRR-YD-DGQD-YE--YNPMQGNPSQSI-MEHKFPELPQSAIVMTQMQNQEAESLTGVKA 474 (756) Q Consensus 402 ~~~~~~~~~~~~gav-~~~~~~~-~~-~~~~-~~--~~~~~~~~~~~i-~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~ 474 (756) ...+. ...+.+ -..+... .+ .... .. ...+..+++... ..+..-..++-....++++...... -. T Consensus 375 ~~l~~----~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~----i~ 446 (711) T protein:vir:10 375 ETVAL----APKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEK----IK 446 (711) T ss_pred HHHHh----cCCCceeecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHH----HH Confidence 11111 111111 1000000 00 0000 00 111122222111 1122222233334445554444444 34 Q ss_pred HhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcc Q lcl|NC_019423. 475 FSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNF 553 (756) Q Consensus 475 ~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~ 553 (756) ..+|++..++|...+++|++..++.+..+.. +..|-+.++.-.+.+.+++..+...- .+.+..+.|.+++-..++ T Consensus 447 ~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~----~~~er~~rI~ged~~~~~ 522 (711) T protein:vir:10 447 STMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHI----YDTERVVRLKFPDETEDF 522 (711) T ss_pred HHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCeEEEEecCCCCcce Confidence 5679988888888888999877777655443 34455666667777777777653321 122345566554422221 Q ss_pred eEEEeccc---ccHHHHHHHHHHHH-HH---HhhccCCHhHHHHHHHHHHhhcCC-hhHH----HHhhhccCCCC----- Q lcl|NC_019423. 554 DIEVDINT---AEIDNQKSQDLGFM-VQ---TLGNTVDQSITLSLVAKIAELKRM-PDLA----HELRTWQPQPD----- 616 (756) Q Consensus 554 Dv~V~~g~---a~~~~~~~q~l~~l-lq---~~~~~~~~~~~~~~l~~l~e~~~~-~~~~----~~l~~~~~q~~----- 616 (756) +.++... .++.......+.-. .. ..+|.. +......+..|+++... |+.. ..+-....-+. T Consensus 523 -v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~-~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~ 600 (711) T protein:vir:10 523 -VKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAF-ATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIA 600 (711) T ss_pred -EEecccccccccccceeeeccceeeeEEEEeeccCc-hhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHH Confidence 2232210 00000000000000 00 001111 11112222233332221 2211 11111111111 Q ss_pred --------hhhhhHHHHH-HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH Q lcl|NC_019423. 617 --------PMEEQLKQLA-IQKAQLE--NEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTK---HARDMEKQKAQS 682 (756) Q Consensus 617 --------p~~~~~~q~~-~~~aq~e--~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k---~~~~~~~~~~q~ 682 (756) +........+ .++.+++ .+..+.+.++.++++...+++++.++++..+..++.. ..+.+....+++ T Consensus 601 e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~a 680 (711) T protein:vir:10 601 ERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMA 680 (711) T ss_pred HHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111 1111111 1112222333333333334444433333321111111 011111111111 Q ss_pred HH-HHHHHHHHHHHHHhhccCCchhhhccCCCCCCCcccCchh Q lcl|NC_019423. 683 QG-NQNLQITKALTTPTKEGETTPNISAAVGYNTLTNGNSPQE 724 (756) Q Consensus 683 ~~-~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 724 (756) ++ ..+++..++.++ ....--.+.++.+ -+. T Consensus 681 q~~~~~~qq~~~~l~--~~qaelq~~q~~~----------~q~ 711 (711) T protein:vir:10 681 QGGDVVYQQVRELVA--QALAEITASQANV----------TEQ 711 (711) T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHh----------hcC Confidence 11 111111111110 0000011111111 000 No 25 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=8.9e-29 Score=174.36 Aligned_cols=520 Identities=11% Similarity=0.073 Sum_probs=284.4 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCCCCCC---CCcccCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPPKIKG---RSQVQPRLVRRQAEWRYAPLSEPFLS- 95 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~~~~g---rS~~v~~~v~~~~e~~~~~L~~~f~~- 95 (756) |.+ ..-+.|++.++.+++.++..++.|++-.+|..... ..+.....| .++++++...+.++.+.+.|+..+|+ T Consensus 1 m~~-~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (556) T protein:vir:73 1 MAE-TEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSP 79 (556) T ss_pred CCh-hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCC Confidence 655 45667888999999999999999999999974321 112122222 36789999999999999999999998 Q ss_pred CCCEEEEecCCcchHHHHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeec Q lcl|NC_019423. 96 SSKLFKLTPVTFEDELAAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQL 169 (756) Q Consensus 96 ~~~~~~~~p~~~~D~~~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~ 169 (756) +.+||++.+..++..+.+. ..+..+.-.|. .++-+..++.++++++..|+|++-+-++ T Consensus 80 ~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~--------------- 143 (556) T protein:vir:73 80 ARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFN-KSNLYQSLPVMYASLGTFGTGAMAVMED--------------- 143 (556) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeeeeec--------------- Confidence 8999999986554333222 24444444443 4555666788888888888887643111 Q ss_pred CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe Q lcl|NC_019423. 170 YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI 249 (756) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~ 249 (756) ..+.+++..++..+|++ T Consensus 144 ---------------------------------------------------------------~~~~~r~~~~~l~~~~~ 160 (556) T protein:vir:73 144 ---------------------------------------------------------------DQDVIRTMPFPIGSYYL 160 (556) T ss_pred ---------------------------------------------------------------CCceEEEEEeecceeEE Confidence 01225678899999999 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEE-EEEee Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEY-WGFYD 328 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~-w~k~d 328 (756) ..++...++. ++++..+|..++.+.++. ++|. ....... . .+....+|.|+.+ |-+.+ T Consensus 161 ~~d~~G~vd~---i~r~~~~t~~ql~~~fg~-~~l~---~~v~~~~---~-----------~~~~~~~~~v~~~V~pr~~ 219 (556) T protein:vir:73 161 ANSPRGSVDT---CIRQFSMTVRQMVQEFGL-DNVS---TSVKGMW---E-----------NGTYETWVEVNHCITPNVN 219 (556) T ss_pred eeCCCCCeEE---EEEEEeccHHHHHHHcCc-ccCC---HHHHHHH---h-----------cCCccceEEEEEEEecccc Confidence 9998776553 678899999998877653 2221 1100000 0 0111135666654 32332 Q ss_pred ccCCc----eeEEEEEEEE----CCEEEEecccccCCCccceEEeeeeeecCcccCCc-hHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019423. 329 INDDG----SLEPIVATWI----GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEA-DAELLGDNQAILGATMRGMID 399 (756) Q Consensus 329 ~~~~g----~~~~~~~~~~----g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g-~v~~~~d~Q~~iN~~~~~~~d 399 (756) .+.++ .+.+.-+.|- ++++++ ++.| .++||++..|...++..||+| .+....+-.+.+|.+.+..+. T Consensus 220 ~~~~~~~~~~~p~~s~~~~~~~~~~~vl~--esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~ 295 (556) T protein:vir:73 220 RDSGKMDSKNKPYRSVYFESGGDSDKLLR--ESGF--DEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQ 295 (556) T ss_pred ccccccCcccceEEEEEEEecCCCceecc--cCCc--ccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHH Confidence 22111 1111112221 235554 4555 569999999999999999999 599999999999999999999 Q ss_pred HHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCc-chHHHHHHHHHHHHHHHHhchhHH-hc Q lcl|NC_019423. 400 LLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPEL-PQSAIVMTQMQNQEAESLTGVKAF-SG 477 (756) Q Consensus 400 ~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~l~~~~~~~e~~tGv~~~-~~ 477 (756) ...++++|.++++.+.... .....++...... .......++++....- -..+...++.+.+.+....-..-+ ++ T Consensus 296 ~~~~~~~pp~~v~~~~~~~--~~~~~pgg~~~~~--~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l 371 (556) T protein:vir:73 296 LIDKATNPPMVAPTSLKNQ--RVSLLPGDVTYLD--VISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMML 371 (556) T ss_pred HHHHHhcCceecccccccc--ceeeccCcccccc--CCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 9999999999998775321 1112222111110 1112234555432211 122333445554444443321111 12 Q ss_pred CCCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEE Q lcl|NC_019423. 478 GVTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIE 556 (756) Q Consensus 478 G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~ 556 (756) +.. ++...||++|..+.+.....|..++.+|. +++.++..+.+.++.+.. .++--|+.+.+ -+|. T Consensus 372 ~~~-~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g------------~lP~~P~~l~~-~~i~ 437 (556) T protein:vir:73 372 QNI-NTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKN------------MLPEPPDVLQG-MPLR 437 (556) T ss_pred ccC-CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCchhhcC-ceeE Confidence 222 33347999999999999999999999996 488999999999998742 23333444443 2344 Q ss_pred EecccccHHHHHH---H---HHHHHHHHhhccCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHH Q lcl|NC_019423. 557 VDINTAEIDNQKS---Q---DLGFMVQTLGNTVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQL 625 (756) Q Consensus 557 V~~g~a~~~~~~~---q---~l~~llq~~~~~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~ 625 (756) |..-++....++. + ++++.+..++ .+.|++ .-.++..+++..|+|. ..++. +. +.+++ T Consensus 438 v~yis~La~aqk~~~~~~i~~~~~~~~~la-q~~Pe~~d~id~d~~~~~~a~~~Gvp~--~~irs------~e--ev~~~ 506 (556) T protein:vir:73 438 IEYISVMAQAQKSIGLTSLSQTVGFIGQLA-QFKPEALDKLDVDQAIDAFSEMSGVSP--TVIVP------QE--QVQGI 506 (556) T ss_pred EEeecHHHHHHHHHHHHHHHHHHHHHHHHh-ccChhhHhcCCHHHHHHHHHHHcCCCh--hhcCC------HH--HHHHH Confidence 4333333222222 2 2333333332 234432 2244555666666652 22221 11 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCch Q lcl|NC_019423. 626 AIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTP 705 (756) Q Consensus 626 ~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~ 705 (756) .+++++++. .+++ .++++ ++ ++. +..|..... ..-.+++++.. ..+.+++ T Consensus 507 rq~r~~~qq--~~~~----~~~~~--~a------~~~---------~~~~~~~~~-----~~~~~l~~~~~--~~g~~~~ 556 (556) T protein:vir:73 507 REERAKQAQ--AAQA----MAMGQ--AA------AQG---------AKTLSETQT-----SDPSALTAIAN--AAGAPQQ 556 (556) T ss_pred HHHHHHHHH--HHHH----HHHHH--HH------HHH---------HHHhhhccC-----CCHHHHHHHHH--hhcCCCC Confidence 111111000 0000 00000 00 000 000000000 00011111111 1112222 No 26 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=2.7e-27 Score=166.26 Aligned_cols=599 Identities=10% Similarity=0.009 Sum_probs=163.5 Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCC Q lcl|NC_019423. 27 SIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVT 106 (756) Q Consensus 27 ~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~ 106 (756) -...+...+..+.++.+..+....+|.+-+..+.+. -...| .+| +..++..-+..+. .++-+.+.|-= T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f--~~~~G-~QW-~~~~~~~~~~~l~------~~~~P~~~~N~-- 68 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRF--ARVPG-GQW-EGATAAGSELGKH------FEKYPKFEINK-- 68 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh--hccCC-CCC-CHHHHHHHHHHHh------hCCCCeEEEcc-- Confidence 333345556666666665554444444433322211 12335 366 3333333333321 12222222221 Q ss_pred cchHHHHHHHHHHHHHHHhhh-cCC--cchH---HHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHH Q lcl|NC_019423. 107 FEDELAARQNELVLNYQFRTQ-LNK--VKLV---DDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADV 180 (756) Q Consensus 107 ~~D~~~A~q~t~~~n~~~~~~-~~~--~~~~---~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (756) .+- .||+++..+ ++. ++++ .+.=++...--+|++|+.++ +... T Consensus 69 -----i~~----~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~--------------------- 117 (720) T protein:vir:35 69 -----IST----ELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYE-ETDG--------------------- 117 (720) T ss_pred -----HHH----HHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHH-hcCc--------------------- Confidence 122 344444322 111 1111 01111112222344554332 1100 Q ss_pred HHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCce--eEEEechhheEeCCCCcCccc Q lcl|NC_019423. 181 LQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRP--TVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 181 ~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~--~ie~V~p~~~~~Dp~a~~d~~ 258 (756) ......++.+.+.+|.|+. ++..+-....++ ....|....++.++.+- + T Consensus 118 -------------------~~~~s~Af~~~i~~G~G~~-------~v~~d~~~~~d~~~~~~~i~i~~v~~~~~~v--~- 168 (720) T protein:vir:35 118 -------------------GEACDNAFDDGSTGGFGCF-------RLTTNLVNALDPMDERQRICLEPIYDPARSV--W- 168 (720) T ss_pred -------------------hHHHhHHHHHhhhccceeE-------EeeecccccCCCCcccceeeEecccCchhhe--e- Confidence 0011334445556665443 433322111111 11112111111111110 0 Q ss_pred cCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEE Q lcl|NC_019423. 259 KALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPI 338 (756) Q Consensus 259 da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~ 338 (756) |=...+..++++..... ....++ .+... ..++....... .+.. .-.+++++ +.+...+.+++ T Consensus 169 ---~Dp~a~~~D~sDar~~~-~~~~~~---~d~~~-~~yp~~a~~~~-----~~~~--~~~~~d~~---~~~~v~i~E~~ 230 (720) T protein:vir:35 169 ---FDPDAKKYDKSDAEWAF-CMYSLS---AEKYK-AEYNKDPATLM-----SGIE--RSWDYDWY---DVDVVYIAKYY 230 (720) T ss_pred ---ecccccccChhhhhhhh-hhcCCC---HHHHH-HhCCCcccccc-----cccc--cccccccc---CCCceEEEEee Confidence 11113344555544321 111111 11110 11111110000 0000 01112221 11111222221 Q ss_pred E-------EEE-----ECCEEEEecccc-------------------------cC---C--------CccceEEeeeeee Q lcl|NC_019423. 339 V-------ATW-----IGSTLIRMENNP-------------------------FP---D--------GKLPLVVVPYMPR 370 (756) Q Consensus 339 ~-------~~~-----~g~~~L~~~~~P-------------------------~~---~--------~~~Pfv~~~~~~~ 370 (756) . +++ .|..+...+.++ |+ + +.+||-.|++.|. T Consensus 231 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~ 310 (720) T protein:vir:35 231 EVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPV 310 (720) T ss_pred EEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEE Confidence 1 111 122222222211 00 0 1123333333333 Q ss_pred cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcC----CceEeeccccCccchhhhhcccccccccccc-ccc-cccc Q lcl|NC_019423. 371 KRELFGEADAELLGDNQAILGATMRGMIDLLGRSAN----GQRGYPKGMLDTLNRRRYDDGQDYEYNPMQG-NPS-QSIM 444 (756) Q Consensus 371 ~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~----~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~-~~~-~~i~ 444 (756) -+..+ .++=++ ..--+.|.+.|.-.+.|+ -.+++....+... ...........-.+ .+. ..+. T Consensus 311 ~g~r~----~~d~~~---~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~----~~a~~~~~~~~~~~a~~~~~~~~ 379 (720) T protein:vir:35 311 YGKRW----FIDDIE---RVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIP----IVGKSQIKTLEKYWANRNKNRPA 379 (720) T ss_pred Eeeee----ccCCCc---ccceeeecchhHHHHHHHHHHHHHHHHHcCCcccc----ccCcchHHHHHHHhhcccccccc Confidence 22211 111000 000111122222111111 0111111111100 00000000000000 000 0000 Q ss_pred cc---------------cCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_019423. 445 EH---------------KFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRR 508 (756) Q Consensus 445 ~~---------------~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n 508 (756) .+ +.+.......++.+...++++.....-...+|+++..+|.+++ +|++..++.+.-+.. .-. T Consensus 380 ~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~ 458 (720) T protein:vir:35 380 FLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN-IAKETVNHLMHRSDMSSFI 458 (720) T ss_pred ccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc-hHHHHHHHHHHHHHHHHHH Confidence 00 0010011112233333344443333334567998888887765 888877776654443 344 Q ss_pred HHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcc-------------------eEEE---ecccccHHH Q lcl|NC_019423. 509 LAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNF-------------------DIEV---DINTAEIDN 566 (756) Q Consensus 509 ~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~-------------------Dv~V---~~g~a~~~~ 566 (756) |-+.++.-.+.+.+++..+...- .+.+..+.|...+-..++ ||++ ++....... T Consensus 459 ~~Dnl~~~~~~~g~~lL~lI~~~----y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~ 534 (720) T protein:vir:35 459 YLDNMAKSLKRAGEVWLSMAREV----YGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPS 534 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccC Confidence 55566666677777776553321 122234445433211111 1110 011111111 Q ss_pred HHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC----Chh-HHHHhhhccCCCC-------------hhhhhH---HHH Q lcl|NC_019423. 567 QKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR----MPD-LAHELRTWQPQPD-------------PMEEQL---KQL 625 (756) Q Consensus 567 ~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~----~~~-~~~~l~~~~~q~~-------------p~~~~~---~q~ 625 (756) ...+. +.....+.+++..+. +.. +...+......|. ++.... .+. T Consensus 535 ~~s~r--------------eq~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~ 600 (720) T protein:vir:35 535 YTARR--------------DATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEE 600 (720) T ss_pred cccHH--------------HHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhH Confidence 11110 111111222222111 111 1111111111111 111000 011 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCch Q lcl|NC_019423. 626 AIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTP 705 (756) Q Consensus 626 ~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~ 705 (756) +++.++++.+..+++++..+++++..+++++..+++..+...+++..+. +.....+++++..+..++..+.+ +.-.+ T Consensus 601 qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~a-qa~a~~~~a~~~~~~aq~~~~~q--~~i~q 677 (720) T protein:vir:35 601 EQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQA-QTEARVAEAKMVQILASADSAKR--AEIRE 677 (720) T ss_pred HHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH--HHHHH Confidence 1111111112222333333444444444444444332222221111100 00000001111111111111111 11112 Q ss_pred hhhccCCCCCCCcccCchhcCCCCCCCCccccccccccCCCCCCCCCCCcC Q lcl|NC_019423. 706 NISAAVGYNTLTNGNSPQERDLAAQQDPAYSLGSQYYDPSQDPASALGMNL 756 (756) Q Consensus 706 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 756 (756) +++...|+..... ..+...-.=+.-+.-.-|+.-.+-... +++ T Consensus 678 alq~~~~~q~~q~--~~eqa~~el~~~~~~~~~~~~~~~~~~------~~~ 720 (720) T protein:vir:35 678 ALKMLHQFQKEQG--DASRADAELILKATDTQHKQNRDAAKN------HSI 720 (720) T ss_pred HHHHHHHHHHhcc--hHHHHHHHHhhcccchhhhhhHHHhhc------cCC Confidence 2332223333211 111110000111111111111111111 011 No 27 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=6.3e-27 Score=164.23 Aligned_cols=606 Identities=11% Similarity=-0.008 Sum_probs=181.6 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCc Q lcl|NC_019423. 28 IQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTF 107 (756) Q Consensus 28 ~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~ 107 (756) ++.-+.+++.++++....+....+|..-..-+.+ --.| .+|= ..+... |-. - + .|+ . T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~----fy~G-~Qw~-~~~~~~-------l~~--q-~------rp~-~ 57 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLF----FSRI-SQWD-DWLSQY-------TTL--Q-Y------RGQ-F 57 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH----hhcC-CCCC-HHHHHH-------HHh--c-C------CCc-c Confidence 7777888888888877666544444433332222 2246 4673 322222 211 1 1 221 1 Q ss_pred chHHHHHHHHHHHHHHHhhhcC-C--cch--HHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHH Q lcl|NC_019423. 108 EDELAARQNELVLNYQFRTQLN-K--VKL--VDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQ 182 (756) Q Consensus 108 ~D~~~A~q~t~~~n~~~~~~~~-~--~~~--~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (756) + -.+- +||+++..+.+ . +++ .+..=.++..--+|++|+..+ .. T Consensus 58 N--~i~~----~i~~v~g~e~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~------------------------- 105 (725) T protein:vir:92 58 D--VVRP----VVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMR-HN------------------------- 105 (725) T ss_pred c--chHH----HHHHHHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHH-hh------------------------- Confidence 1 2222 44444442211 1 111 110001111112344442111 00 Q ss_pred HhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhh--eEe--CCCCcCccc Q lcl|NC_019423. 183 QALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNN--VVI--DPSCNGDLD 258 (756) Q Consensus 183 ~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~--~~~--Dp~a~~d~~ 258 (756) ........++.+.+.+|.|+ .++..+-. .-+|++ +.+ .|-. .++. T Consensus 106 ---------------~~~~a~s~Af~~~i~~G~G~-------~ev~~d~~--------~~d~~~~~~~i~~~~i~-~~~~ 154 (725) T protein:vir:92 106 ---------------TAKIAVNVAVREQIESGVGA-------WRLVTDYE--------DQSPTSNNQVIRREPIH-SACS 154 (725) T ss_pred ---------------CchHHHHHHHHHHhhcCcce-------eeeeeccc--------CCCCCCCceeeEEeecc-CChh Confidence 00111234455566666543 33322211 111211 111 1100 0011 Q ss_pred cCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEE Q lcl|NC_019423. 259 KALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPI 338 (756) Q Consensus 259 da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~ 338 (756) ..-|=...+..++++...++ +.+.++...... ..+.+.. ...+ +.+...-.- ...-|.. .+.-.+.+++ T Consensus 155 ~V~~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~----~~~~~~~-~~~~--~~~~~~~~~-~~~~~~~--~d~vrv~e~~ 223 (725) T protein:vir:92 155 HVIWDSNSKLMDKSDSRHCT-VIHSMSQNGWED----FAEKYDL-DADD--IPSFQNPND-WVFPWLT--QDTIQIAEFY 223 (725) T ss_pred hcccCchhhccChhhHHHHH-HHhcCCHHHHHH----HHhhcCc-chhh--hhhcccCCc-ccccccC--CCeEEEEEEE Confidence 01111123444555554432 122222100000 0111110 0000 000000000 0011221 1111222222 Q ss_pred EEEEE-----------CCEEEEeccccc--------------------------C---------CC--ccceEEeeeeee Q lcl|NC_019423. 339 VATWI-----------GSTLIRMENNPF--------------------------P---------DG--KLPLVVVPYMPR 370 (756) Q Consensus 339 ~~~~~-----------g~~~L~~~~~P~--------------------------~---------~~--~~Pfv~~~~~~~ 370 (756) +..+. ++.++...++-+ + ++ .+|.-.|++.|. T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:92 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEE Confidence 22111 122222111100 0 00 112222333333 Q ss_pred cCccc---CCchH-HHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccccccccc----ccc Q lcl|NC_019423. 371 KRELF---GEADA-ELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNP----SQS 442 (756) Q Consensus 371 ~~~~~---G~g~v-~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~----~~~ 442 (756) -+... |.... -.++++-+.-......+.-.+...+...-....+..+..+..............+..++ ++. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:92 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred EeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeecccccccccc Confidence 22221 11111 12222232222222222233333344333333333333333322221111111111111 011 Q ss_pred cc--cccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_019423. 443 IM--EHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRLAKGMADIGTK 519 (756) Q Consensus 443 i~--~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~~~~~~~l~~~ 519 (756) +. ++.....++-....++++....+ .-...+|+++.++|...+++|++..++.+..+.. +-.|-+.++.-.+. T Consensus 384 ~~~~~i~~~~~~~~p~~~~~ll~~~~~----~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~ 459 (725) T protein:vir:92 384 MPTQPLAYYENPEVPQANAYMLEAATA----AVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) T ss_pred ccccCCcccCCCCchHHHHHHHHHHHH----HHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 11112223333344444444444 4456679988888888888999988877765553 35555666766677 Q ss_pred HHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEeccc---ccHHHHHHHHHHHHHHH---hhccCCH--hHHHHH Q lcl|NC_019423. 520 ICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINT---AEIDNQKSQDLGFMVQT---LGNTVDQ--SITLSL 591 (756) Q Consensus 520 ~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~---a~~~~~~~q~l~~llq~---~~~~~~~--~~~~~~ 591 (756) +.+++..+...- .+.+..+.|...+- ...-+.++... .++.......+..-+.. .+|..+- +..... T Consensus 460 ~g~~lL~lI~~~----~~~~r~~RI~~edg-~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~ 534 (725) T protein:vir:92 460 DGEIYQSIVNDI----YDVPRNVTITLEDG-SEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAE 534 (725) T ss_pred HHHHHHHHHHHh----cCCCcEEEEecCCC-CcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHH Confidence 677776553321 01223444543321 12223333221 11110000111000000 1111110 011122 Q ss_pred HHHHHhhcC--ChhHHHHhhhccCCC----------------Chhhh--hH-HHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 592 VAKIAELKR--MPDLAHELRTWQPQP----------------DPMEE--QL-KQLAI-QKAQLENEELQSKIALNNAKAK 649 (756) Q Consensus 592 l~~l~e~~~--~~~~~~~l~~~~~q~----------------~p~~~--~~-~q~~~-~~aq~e~~~~qa~a~~~~a~a~ 649 (756) +.+++.... .+.....+..+..-+ .+... +. ++.++ ..++++.+..+..+...+++++ T Consensus 535 l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~ 614 (725) T protein:vir:92 535 ILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGV 614 (725) T ss_pred HHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHH Confidence 222222211 111111111111111 11100 00 00011 1111112222233333334443 Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHH-------HHHHHHHHHHH---HHHHHHH------HhhccCCchhhhcc Q lcl|NC_019423. 650 EAASSGDLKDLDYLEQESG---TKHARDMEKQ-------KAQSQGNQNLQ---ITKALTT------PTKEGETTPNISAA 710 (756) Q Consensus 650 ~~~aq~~~~~~~~~~q~~~---~k~~~~~~~~-------~~q~~~~~~~~---~~~a~~~------~~~~~~~~~~~~~a 710 (756) .+++++++++++......+ .+...+-++. ..++.+.++.+ +++.... ......+...+.+. T Consensus 615 ~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~ 694 (725) T protein:vir:92 615 LLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGN 694 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHH Confidence 3444444433322211111 1111111111 11111111111 1111111 11111111111111 Q ss_pred C--CCCCCCcccCchhcCCCCCCCCccccccccccCC Q lcl|NC_019423. 711 V--GYNTLTNGNSPQERDLAAQQDPAYSLGSQYYDPS 745 (756) Q Consensus 711 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 745 (756) . +..-+.-.+.. +-..+..+|+-.-. +|. T Consensus 695 ~~~~~~~~d~~~~~--~~~~~~~~~~~~~~----~~~ 725 (725) T protein:vir:92 695 EQTHKQRMDIANIL--QSQRQNQPSGSVAE----TPQ 725 (725) T ss_pred HHHHHHHHHHHHHh--cchhccCCcccccc----CCC Confidence 0 01000001111 11224444443333 344 No 28 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=3.6e-28 Score=171.06 Aligned_cols=521 Identities=12% Similarity=0.081 Sum_probs=291.0 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCCCCCCC---CcccCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPPKIKGR---SQVQPRLVRRQAEWRYAPLSEPFLS- 95 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~~~~gr---S~~v~~~v~~~~e~~~~~L~~~f~~- 95 (756) |.+....+.|+..++..++.+++.++.|++-.+|..... ..+.....|+ .++++....+.++.+.+.|+.-+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 888888888999999999999999999999999985331 1122223333 5589999999999999999999998 Q ss_pred CCCEEEEecCCcchHHHHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeec Q lcl|NC_019423. 96 SSKLFKLTPVTFEDELAAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQL 169 (756) Q Consensus 96 ~~~~~~~~p~~~~D~~~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~ 169 (756) +.+||++.+..++..+.+. ..+..+.-.|. .++-+..++.++++++..|+|++=+-++ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d--------------- 144 (555) T protein:vir:98 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPD--------------- 144 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecC--------------- Confidence 8999999997665433222 23344433333 3444555888888888888887542110 Q ss_pred CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe Q lcl|NC_019423. 170 YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI 249 (756) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~ 249 (756) ..+.+++..++..+|++ T Consensus 145 ---------------------------------------------------------------~~~~~rf~~~pl~~~~v 161 (555) T protein:vir:98 145 ---------------------------------------------------------------FDAVVYHHSLTAGEYAI 161 (555) T ss_pred ---------------------------------------------------------------CCceEEEEEeecceeEE Confidence 01235678889999999 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEe-e Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFY-D 328 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~-d 328 (756) ..++...++ -++++..+|..++...++. ++|.. .... ..+ .+....+|.|+.+.+.. + T Consensus 162 ~~d~~G~vd---~i~r~~~~t~~ql~~~fg~-~~l~~---~~~~---~~~-----------~~~~~~~v~v~~~V~pr~~ 220 (555) T protein:vir:98 162 AADNQGRVN---TLYREFQITVAQMVREFGK-DKCST---TVQS---LFD-----------RGALEQWVTVIHAIEPRAD 220 (555) T ss_pred eeCCCCCEE---EEEEEEeccHHHHHHhcCc-ccCCH---HHHH---HHh-----------cCCCCceEEEEEEEeeccC Confidence 888776554 4678899999999877653 22211 1000 000 01112358888876532 2 Q ss_pred ccCCc---e-eEEEEEEE---E-CCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 329 INDDG---S-LEPIVATW---I-GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDL 400 (756) Q Consensus 329 ~~~~g---~-~~~~~~~~---~-g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~ 400 (756) .+..+ . +.+.-+.| + |.+++ .++.| .++||++..|...++..||+|++....+-.+.+|++.+..+.+ T Consensus 221 ~~~~~~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~ 296 (555) T protein:vir:98 221 RDPSKRDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQA 296 (555) T ss_pred cCcCCCCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 21111 1 11111222 1 33455 45555 4699999999999999999999999999999999999999999 Q ss_pred HHhhcCCceEeeccccCccchhhhhccccccccccccccccccccc-cCCCcchHHHHHHHHHHHHHHHHhchhH-HhcC Q lcl|NC_019423. 401 LGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEH-KFPELPQSAIVMTQMQNQEAESLTGVKA-FSGG 478 (756) Q Consensus 401 l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~~l~~~~~~~e~~tGv~~-~~~G 478 (756) +.+..+|.+.++.+.... .....++....+. ...+...+.+. ....--+.+...++...+.+.... ..+ +.+. T Consensus 297 ~~~~~~pp~~v~~~~~~~--~~~~~pgg~~~v~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l 371 (555) T protein:vir:98 297 IDYKSNPPLQLPVSAKNQ--DISTVPGGLSYVD--AAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLML 371 (555) T ss_pred HHHHhcCceeeccccccc--cceeccccccccc--cCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhc Confidence 999999999998775421 1222222211111 11122223222 111112334455666666665443 222 1122 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEE Q lcl|NC_019423. 479 VTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEV 557 (756) Q Consensus 479 ~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V 557 (756) ...++...||++|..+.+.....|..++-+|. +++.++.++.+.++.+.. .++.-|+.+.+ .+|.| T Consensus 372 ~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g------------~lP~~P~~l~~-~~i~v 438 (555) T protein:vir:98 372 ANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEAN------------ILPPPPQEMQG-VDLNV 438 (555) T ss_pred cCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCchhhcC-ceeEE Confidence 22344458999999999999999999999986 588999999999988742 23333455544 34544 Q ss_pred ecccccHHHHHHHH---HHHHHHHhhc--cCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHH Q lcl|NC_019423. 558 DINTAEIDNQKSQD---LGFMVQTLGN--TVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAI 627 (756) Q Consensus 558 ~~g~a~~~~~~~q~---l~~llq~~~~--~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~ 627 (756) ..-++....++... +..+++.+++ .++|+. .-+++..+++..|+|. ..++. ++ +.+++.+ T Consensus 439 ~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs------~e--ev~~~r~ 508 (555) T protein:vir:98 439 EFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVP------GN--QVALIRK 508 (555) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCC------HH--HHHHHHH Confidence 44334333333332 2233333321 233432 2234455555556551 22221 11 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019423. 628 QKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGET 703 (756) Q Consensus 628 ~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~ 703 (756) ++++++.+ ++++++ +.++. +..+..+.+ ++ ..+ ......+.| ..+.+ T Consensus 509 qr~~~~q~--~~~a~~------~~q~~------~~~~~~~~~----~~--~~~----~~~~~~~~~-----~~~~~ 555 (555) T protein:vir:98 509 QRADQQQA--AQQAAL------LNQGA------DTAAKLGSV----DT--SKQ----NALTDVTRA-----FSGYT 555 (555) T ss_pred HHHHHHHH--HHHHHH------HHHHH------HHHHHhccc----cc--Ccc----hhHHHHHhh-----hccCC Confidence 11111100 000000 00000 000000000 00 000 000111111 11222 No 29 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=3.6e-28 Score=171.06 Aligned_cols=521 Identities=12% Similarity=0.081 Sum_probs=291.0 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCCCCCCC---CcccCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPPKIKGR---SQVQPRLVRRQAEWRYAPLSEPFLS- 95 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~~~~gr---S~~v~~~v~~~~e~~~~~L~~~f~~- 95 (756) |.+....+.|+..++..++.+++.++.|++-.+|..... ..+.....|+ .++++....+.++.+.+.|+.-+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 888888888999999999999999999999999985331 1122223333 5589999999999999999999998 Q ss_pred CCCEEEEecCCcchHHHHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeec Q lcl|NC_019423. 96 SSKLFKLTPVTFEDELAAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQL 169 (756) Q Consensus 96 ~~~~~~~~p~~~~D~~~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~ 169 (756) +.+||++.+..++..+.+. ..+..+.-.|. .++-+..++.++++++..|+|++=+-++ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d--------------- 144 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPD--------------- 144 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecC--------------- Confidence 8999999997665433222 23344433333 3444555888888888888887542110 Q ss_pred CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe Q lcl|NC_019423. 170 YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI 249 (756) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~ 249 (756) ..+.+++..++..+|++ T Consensus 145 ---------------------------------------------------------------~~~~~rf~~~pl~~~~v 161 (555) T protein:vir:10 145 ---------------------------------------------------------------FDAVVYHHSLTAGEYAI 161 (555) T ss_pred ---------------------------------------------------------------CCceEEEEEeecceeEE Confidence 01235678889999999 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEe-e Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFY-D 328 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~-d 328 (756) ..++...++ -++++..+|..++...++. ++|.. .... ..+ .+....+|.|+.+.+.. + T Consensus 162 ~~d~~G~vd---~i~r~~~~t~~ql~~~fg~-~~l~~---~~~~---~~~-----------~~~~~~~v~v~~~V~pr~~ 220 (555) T protein:vir:10 162 AADNQGRVN---TLYREFQITVAQMVREFGK-DKCST---TVQS---LFD-----------RGALEQWVTVIHAIEPRAD 220 (555) T ss_pred eeCCCCCEE---EEEEEEeccHHHHHHhcCc-ccCCH---HHHH---HHh-----------cCCCCceEEEEEEEeeccC Confidence 888776554 4678899999999877653 22211 1000 000 01112358888876532 2 Q ss_pred ccCCc---e-eEEEEEEE---E-CCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 329 INDDG---S-LEPIVATW---I-GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDL 400 (756) Q Consensus 329 ~~~~g---~-~~~~~~~~---~-g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~ 400 (756) .+..+ . +.+.-+.| + |.+++ .++.| .++||++..|...++..||+|++....+-.+.+|++.+..+.+ T Consensus 221 ~~~~~~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~ 296 (555) T protein:vir:10 221 RDPSKRDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQA 296 (555) T ss_pred cCcCCCCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 21111 1 11111222 1 33455 45555 4699999999999999999999999999999999999999999 Q ss_pred HHhhcCCceEeeccccCccchhhhhccccccccccccccccccccc-cCCCcchHHHHHHHHHHHHHHHHhchhH-HhcC Q lcl|NC_019423. 401 LGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEH-KFPELPQSAIVMTQMQNQEAESLTGVKA-FSGG 478 (756) Q Consensus 401 l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~~l~~~~~~~e~~tGv~~-~~~G 478 (756) +.+..+|.+.++.+.... .....++....+. ...+...+.+. ....--+.+...++...+.+.... ..+ +.+. T Consensus 297 ~~~~~~pp~~v~~~~~~~--~~~~~pgg~~~v~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l 371 (555) T protein:vir:10 297 IDYKSNPPLQLPVSAKNQ--DISTVPGGLSYVD--AAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLML 371 (555) T ss_pred HHHHhcCceeeccccccc--cceeccccccccc--cCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhc Confidence 999999999998775421 1222222211111 11122223222 111112334455666666665443 222 1122 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEE Q lcl|NC_019423. 479 VTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEV 557 (756) Q Consensus 479 ~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V 557 (756) ...++...||++|..+.+.....|..++-+|. +++.++.++.+.++.+.. .++.-|+.+.+ .+|.| T Consensus 372 ~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g------------~lP~~P~~l~~-~~i~v 438 (555) T protein:vir:10 372 ANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEAN------------ILPPPPQEMQG-VDLNV 438 (555) T ss_pred cCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCchhhcC-ceeEE Confidence 22344458999999999999999999999986 588999999999988742 23333455544 34544 Q ss_pred ecccccHHHHHHHH---HHHHHHHhhc--cCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHH Q lcl|NC_019423. 558 DINTAEIDNQKSQD---LGFMVQTLGN--TVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAI 627 (756) Q Consensus 558 ~~g~a~~~~~~~q~---l~~llq~~~~--~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~ 627 (756) ..-++....++... +..+++.+++ .++|+. .-+++..+++..|+|. ..++. ++ +.+++.+ T Consensus 439 ~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs------~e--ev~~~r~ 508 (555) T protein:vir:10 439 EFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVP------GN--QVALIRK 508 (555) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCC------HH--HHHHHHH Confidence 44334333333332 2233333321 233432 2234455555556551 22221 11 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019423. 628 QKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGET 703 (756) Q Consensus 628 ~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~ 703 (756) ++++++.+ ++++++ +.++. +..+..+.+ ++ ..+ ......+.| ..+.+ T Consensus 509 qr~~~~q~--~~~a~~------~~q~~------~~~~~~~~~----~~--~~~----~~~~~~~~~-----~~~~~ 555 (555) T protein:vir:10 509 QRADQQQA--AQQAAL------LNQGA------DTAAKLGSV----DT--SKQ----NALTDVTRA-----FSGYT 555 (555) T ss_pred HHHHHHHH--HHHHHH------HHHHH------HHHHHhccc----cc--Ccc----hhHHHHHhh-----hccCC Confidence 11111100 000000 00000 000000000 00 000 000111111 11222 No 30 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=3.6e-28 Score=171.06 Aligned_cols=521 Identities=12% Similarity=0.081 Sum_probs=291.0 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCCCCCCC---CcccCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPPKIKGR---SQVQPRLVRRQAEWRYAPLSEPFLS- 95 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~~~~gr---S~~v~~~v~~~~e~~~~~L~~~f~~- 95 (756) |.+....+.|+..++..++.+++.++.|++-.+|..... ..+.....|+ .++++....+.++.+.+.|+.-+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 888888888999999999999999999999999985331 1122223333 5589999999999999999999998 Q ss_pred CCCEEEEecCCcchHHHHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeec Q lcl|NC_019423. 96 SSKLFKLTPVTFEDELAAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQL 169 (756) Q Consensus 96 ~~~~~~~~p~~~~D~~~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~ 169 (756) +.+||++.+..++..+.+. ..+..+.-.|. .++-+..++.++++++..|+|++=+-++ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d--------------- 144 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPD--------------- 144 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecC--------------- Confidence 8999999997665433222 23344433333 3444555888888888888887542110 Q ss_pred CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe Q lcl|NC_019423. 170 YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI 249 (756) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~ 249 (756) ..+.+++..++..+|++ T Consensus 145 ---------------------------------------------------------------~~~~~rf~~~pl~~~~v 161 (555) T protein:vir:10 145 ---------------------------------------------------------------FDAVVYHHSLTAGEYAI 161 (555) T ss_pred ---------------------------------------------------------------CCceEEEEEeecceeEE Confidence 01235678889999999 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEe-e Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFY-D 328 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~-d 328 (756) ..++...++ -++++..+|..++...++. ++|.. .... ..+ .+....+|.|+.+.+.. + T Consensus 162 ~~d~~G~vd---~i~r~~~~t~~ql~~~fg~-~~l~~---~~~~---~~~-----------~~~~~~~v~v~~~V~pr~~ 220 (555) T protein:vir:10 162 AADNQGRVN---TLYREFQITVAQMVREFGK-DKCST---TVQS---LFD-----------RGALEQWVTVIHAIEPRAD 220 (555) T ss_pred eeCCCCCEE---EEEEEEeccHHHHHHhcCc-ccCCH---HHHH---HHh-----------cCCCCceEEEEEEEeeccC Confidence 888776554 4678899999999877653 22211 1000 000 01112358888876532 2 Q ss_pred ccCCc---e-eEEEEEEE---E-CCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 329 INDDG---S-LEPIVATW---I-GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDL 400 (756) Q Consensus 329 ~~~~g---~-~~~~~~~~---~-g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~ 400 (756) .+..+ . +.+.-+.| + |.+++ .++.| .++||++..|...++..||+|++....+-.+.+|++.+..+.+ T Consensus 221 ~~~~~~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~ 296 (555) T protein:vir:10 221 RDPSKRDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQA 296 (555) T ss_pred cCcCCCCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 21111 1 11111222 1 33455 45555 4699999999999999999999999999999999999999999 Q ss_pred HHhhcCCceEeeccccCccchhhhhccccccccccccccccccccc-cCCCcchHHHHHHHHHHHHHHHHhchhH-HhcC Q lcl|NC_019423. 401 LGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEH-KFPELPQSAIVMTQMQNQEAESLTGVKA-FSGG 478 (756) Q Consensus 401 l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~~l~~~~~~~e~~tGv~~-~~~G 478 (756) +.+..+|.+.++.+.... .....++....+. ...+...+.+. ....--+.+...++...+.+.... ..+ +.+. T Consensus 297 ~~~~~~pp~~v~~~~~~~--~~~~~pgg~~~v~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l 371 (555) T protein:vir:10 297 IDYKSNPPLQLPVSAKNQ--DISTVPGGLSYVD--AAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLML 371 (555) T ss_pred HHHHhcCceeeccccccc--cceeccccccccc--cCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhc Confidence 999999999998775421 1222222211111 11122223222 111112334455666666665443 222 1122 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEE Q lcl|NC_019423. 479 VTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEV 557 (756) Q Consensus 479 ~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V 557 (756) ...++...||++|..+.+.....|..++-+|. +++.++.++.+.++.+.. .++.-|+.+.+ .+|.| T Consensus 372 ~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g------------~lP~~P~~l~~-~~i~v 438 (555) T protein:vir:10 372 ANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEAN------------ILPPPPQEMQG-VDLNV 438 (555) T ss_pred cCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCchhhcC-ceeEE Confidence 22344458999999999999999999999986 588999999999988742 23333455544 34544 Q ss_pred ecccccHHHHHHHH---HHHHHHHhhc--cCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHH Q lcl|NC_019423. 558 DINTAEIDNQKSQD---LGFMVQTLGN--TVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAI 627 (756) Q Consensus 558 ~~g~a~~~~~~~q~---l~~llq~~~~--~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~ 627 (756) ..-++....++... +..+++.+++ .++|+. .-+++..+++..|+|. ..++. ++ +.+++.+ T Consensus 439 ~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs------~e--ev~~~r~ 508 (555) T protein:vir:10 439 EFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVP------GN--QVALIRK 508 (555) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCC------HH--HHHHHHH Confidence 44334333333332 2233333321 233432 2234455555556551 22221 11 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019423. 628 QKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGET 703 (756) Q Consensus 628 ~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~ 703 (756) ++++++.+ ++++++ +.++. +..+..+.+ ++ ..+ ......+.| ..+.+ T Consensus 509 qr~~~~q~--~~~a~~------~~q~~------~~~~~~~~~----~~--~~~----~~~~~~~~~-----~~~~~ 555 (555) T protein:vir:10 509 QRADQQQA--AQQAAL------LNQGA------DTAAKLGSV----DT--SKQ----NALTDVTRA-----FSGYT 555 (555) T ss_pred HHHHHHHH--HHHHHH------HHHHH------HHHHHhccc----cc--Ccc----hhHHHHHhh-----hccCC Confidence 11111100 000000 00000 000000000 00 000 000111111 11222 No 31 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=5e-28 Score=170.28 Aligned_cols=506 Identities=9% Similarity=0.020 Sum_probs=284.6 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC--CCCCC------CCCCCcccCHHHHHHHHHHHHHHHHhh Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGK--AKPPK------IKGRSQVQPRLVRRQAEWRYAPLSEPF 93 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~--~~~~~------~~grS~~v~~~v~~~~e~~~~~L~~~f 93 (756) |. .+.|++.++..++.+++.++.|++-.+|...... .+... .+..+++++....+.++.+.+.|+..+ T Consensus 1 ~~----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~l 76 (547) T protein:vir:10 1 ME----NSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSL 76 (547) T ss_pred CC----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhh Confidence 33 3556678899999999999999999999853211 11111 123467899999999999999999999 Q ss_pred cC-CCCEEEEecCCcchHHHH------HHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeee Q lcl|NC_019423. 94 LS-SSKLFKLTPVTFEDELAA------RQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPV 166 (756) Q Consensus 94 ~~-~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~ 166 (756) |+ +.+||++.+...+..+.+ ++.+..|.-.|. .++-+..++.++++++..|+|++.+..+ T Consensus 77 tPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~d------------ 143 (547) T protein:vir:10 77 TSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSNFNLEANETYIDLCGYGNAIMVEEED------------ 143 (547) T ss_pred cCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCcEeEEeccC------------ Confidence 98 799999987544322222 233334433333 3444555777888888888887664211 Q ss_pred eecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhh Q lcl|NC_019423. 167 FQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNN 246 (756) Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~ 246 (756) ....+.+++..++..+ T Consensus 144 ----------------------------------------------------------------~~~~~~~r~~~~pl~~ 159 (547) T protein:vir:10 144 ----------------------------------------------------------------EDEEGSVVFQSSPIQD 159 (547) T ss_pred ----------------------------------------------------------------CCCCCceeEEEeecce Confidence 0012457789999999 Q ss_pred eEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEE Q lcl|NC_019423. 247 VVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGF 326 (756) Q Consensus 247 ~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k 326 (756) |++..++...++. ++++..+|..++.+.++. +.|+ .+.... ..... +....++.|+.|.+. T Consensus 160 ~~v~~d~~G~v~~---i~r~~~~t~~qi~~~fg~-~~l~---~~v~~~------~~~~~------~~~~~~~~v~~~v~~ 220 (547) T protein:vir:10 160 SYFEEDSRGQVVN---FYRVFRWTPAQIYDRFGD-EGTP---EAIIKK------AKEAS------NQAALKQEVVMCVFT 220 (547) T ss_pred EEEeeCCCcCeee---eeeeeeccHHHHHHhcCc-ccCC---HHHHHH------HhcCC------CcccceEEEEEEEee Confidence 9999988766553 578899999999887653 2221 110000 00000 111235666665443 Q ss_pred -eeccCCc---e----eE-EEEEE--EEC--CEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHH Q lcl|NC_019423. 327 -YDINDDG---S----LE-PIVAT--WIG--STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGAT 393 (756) Q Consensus 327 -~d~~~~g---~----~~-~~~~~--~~g--~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~ 393 (756) .+.+.+. . .. ....+ ..+ .+++ .++.| .++||++..|...++..||.|.+....+-.+.+|.+ T Consensus 221 ~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l--~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l 296 (547) T protein:vir:10 221 RYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLG--EEGGY--YEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRY 296 (547) T ss_pred ccCCCCCccccceeeccccceeEEEEEecCceeee--ecCCc--ccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHH Confidence 2221110 0 00 01111 123 3444 44555 469999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchh Q lcl|NC_019423. 394 MRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVK 473 (756) Q Consensus 394 ~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~ 473 (756) .+.+++...++.+|.++++.+.+... ..... .+.+..+....++++....--......++.+.+.|...-=+. T Consensus 297 ~~~~l~~~~~~~~pp~~v~~~g~~~~--~~~~p-----gg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d 369 (547) T protein:vir:10 297 VELVLRSSEKVIDPAIMVTERGLISD--IDLGA-----SGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVD 369 (547) T ss_pred HHHHHHHHHHHhcCceeccccccccc--ceecC-----CeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhh Confidence 99999999999999999886554321 11111 222222344456666555433444555666666555543111 Q ss_pred HHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHh-c- Q lcl|NC_019423. 474 AFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDL-K- 550 (756) Q Consensus 474 ~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~-~- 550 (756) .+ +.. ++...||++|..+.+.....|..++.+|. +++.++..+.+.++.+.. .++--|+.+ . T Consensus 370 ~~--~~~-~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g------------~lP~~p~~l~~~ 434 (547) T protein:vir:10 370 QL--QMK-DSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAG------------KLGELPSKLLES 434 (547) T ss_pred hh--hcC-CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCchhhhcc Confidence 11 222 23458999999999999999999999997 588999999999988642 222223333 2 Q ss_pred CcceEEEecccccHHHHHHHH---HHHHHHHhhc--cCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 551 GNFDIEVDINTAEIDNQKSQD---LGFMVQTLGN--TVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 551 ~~~Dv~V~~g~a~~~~~~~q~---l~~llq~~~~--~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~ 620 (756) +..++.|..-.+....++... +...++.+++ .+.|++ .-.++..+++..|+|. ..++. +.+ T Consensus 435 ~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs------~ee- 505 (547) T protein:vir:10 435 GKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQ--TLMRP------KAK- 505 (547) T ss_pred CcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCCh--hccCC------HHH- Confidence 223455554334443333332 2333333322 233432 2244555666666651 22221 111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHAR 673 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~ 673 (756) .+++.+++++ +++.++++....+..+.+.++.. ..+..|+-+ T Consensus 506 -v~~~r~qr~~--~~q~~~qaa~~~~~g~~m~~~~~--------~~a~~~~~~ 547 (547) T protein:vir:10 506 -VTSIRKNRSQ--TQQKAEQAAIAEAEGNAMEAQGK--------GQAALKENQ 547 (547) T ss_pred -HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHhhcC--------cccchhccC Confidence 1111111111 11111111111111111111000 000000000 No 32 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.96 E-value=1.3e-27 Score=168.08 Aligned_cols=523 Identities=12% Similarity=0.078 Sum_probs=281.7 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CCCCCCCC---CCCcccCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG--KAKPPKIK---GRSQVQPRLVRRQAEWRYAPLSEPFLS- 95 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~--~~~~~~~~---grS~~v~~~v~~~~e~~~~~L~~~f~~- 95 (756) |.+ .+...|++.++.+++.++..++.|++-.+|..... ..+..... ..+++++....+.++.+.+.|+..+|+ T Consensus 1 m~~-~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (559) T protein:vir:95 1 MAE-TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) T ss_pred CCh-hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Confidence 544 66778899999999999999999999999974221 11112222 346789999999999999999999998 Q ss_pred CCCEEEEecCCcchHHHHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeec Q lcl|NC_019423. 96 SSKLFKLTPVTFEDELAAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQL 169 (756) Q Consensus 96 ~~~~~~~~p~~~~D~~~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~ 169 (756) +.+||++.+..++..+.+. ..+..+.-.|. .++-+..++.++++++..|+|++-+-++ T Consensus 80 ~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~Gta~l~~~~d--------------- 143 (559) T protein:vir:95 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSNLYQSLPQLYGSLGTYSTGAMAVLDD--------------- 143 (559) T ss_pred CCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeEeecC--------------- Confidence 8999999886544333221 22223333333 4455555777888888888886543111 Q ss_pred CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe Q lcl|NC_019423. 170 YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI 249 (756) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~ 249 (756) ..+.+++..++..+|++ T Consensus 144 ---------------------------------------------------------------~~~~~r~~~~~l~~~~v 160 (559) T protein:vir:95 144 ---------------------------------------------------------------DEDIIRTMPFPIGSYYL 160 (559) T ss_pred ---------------------------------------------------------------CCceeEEEEeecCeEEE Confidence 01235688899999999 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEE-Eee Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWG-FYD 328 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~-k~d 328 (756) ..++...++. ++++..+|..++...++. +++. ....... + .+...+.|.|+++-+ +.+ T Consensus 161 ~~d~~G~vd~---i~r~~~~t~~ql~~~fg~-~~l~---~~~~~~~---~-----------~~~~~~~v~v~~~V~pr~~ 219 (559) T protein:vir:95 161 ANSPRGSVDT---CFRKFSMTVRQLVQEFGL-NNVS---ESVKSMW---E-----------SGTYEKWIEVMHSVYPNID 219 (559) T ss_pred eeCCCCCeEE---EEEeEecCHHHHHHHcCc-ccCC---HHHHHHH---h-----------cCCCCCeEEEEEEEecccc Confidence 9888765553 678899999999877653 1221 1100000 0 011123577777633 333 Q ss_pred ccCCce----eEEEEEEEE---C-CEEEEecccccCCCccceEEeeeeeecCcccCCc-hHHHhHHHHHHHHHHHHHHHH Q lcl|NC_019423. 329 INDDGS----LEPIVATWI---G-STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEA-DAELLGDNQAILGATMRGMID 399 (756) Q Consensus 329 ~~~~g~----~~~~~~~~~---g-~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g-~v~~~~d~Q~~iN~~~~~~~d 399 (756) .+.++. ..+.-+.|. + .++++ ++.| .++||++..|...++..||+| .+....+-.+.+|.+.+..+. T Consensus 220 ~~~~~~~~~~~pf~s~~~e~~~~~~~~l~--esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~ 295 (559) T protein:vir:95 220 RDTSKLDSKNKPFKSVYYEVGGDNDKLLR--ESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQ 295 (559) T ss_pred ccccccccccceEEEEEEEecCCCceeee--cCCc--ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHH Confidence 322111 011112221 2 35554 4455 569999999999999999999 699999999999999999999 Q ss_pred HHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCC-cchHHHHHHHHHHHHHHHHhchhHH-hc Q lcl|NC_019423. 400 LLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE-LPQSAIVMTQMQNQEAESLTGVKAF-SG 477 (756) Q Consensus 400 ~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~l~~~~~~~e~~tGv~~~-~~ 477 (756) ...++.+|.++++.+.... .....++....+. ...+...+.+..... --..+...++.+.+.+....-..-+ +. T Consensus 296 ~~~~~~~pp~~v~~~~~~~--~~~l~pgg~~~~~--~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l 371 (559) T protein:vir:95 296 LIDKATNPPMVAPTSLKNQ--RASLLPGDITYID--QITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) T ss_pred HHHHHhcCceecccccccc--ceeeeccceeeeC--CCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHh Confidence 9999999999998765421 1122222221111 111223444432211 1112223344444444433321111 11 Q ss_pred CCCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEE Q lcl|NC_019423. 478 GVTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIE 556 (756) Q Consensus 478 G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~ 556 (756) +. .++...||++|..+.+.....|..++.+|. +++.++..+.+.++.+.. .++.-|+.+.+ -+|. T Consensus 372 ~~-r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g------------~lP~~p~~l~~-~~i~ 437 (559) T protein:vir:95 372 QN-INTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKN------------MLPPPPDVMEG-MPLK 437 (559) T ss_pred hc-CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCcccccC-cceE Confidence 11 123346999999999999999999999996 488999999999998752 12223344432 1333 Q ss_pred EecccccHHHHH---H---HHHHHHHHHhhccCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHH Q lcl|NC_019423. 557 VDINTAEIDNQK---S---QDLGFMVQTLGNTVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQL 625 (756) Q Consensus 557 V~~g~a~~~~~~---~---q~l~~llq~~~~~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~ 625 (756) |..-++....++ . .++++.+..++. +.|++ .-.++..+++..|+|. ..++. + ++.+++ T Consensus 438 v~~is~La~aqk~~~~~~i~~~~~~~~~laq-~~Pevld~id~d~~~~~~a~~~Gvp~--~~irs------~--~ev~~~ 506 (559) T protein:vir:95 438 VEYISVMAQAQKSIGLSSLASTVNFIGQLAQ-VKPEALDKLNVDQAIDAFADMSGVSP--TVIVP------Q--EQVEQA 506 (559) T ss_pred EEeecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChhhhhcCCHHHHHHHHHHHhCCch--hhcCC------H--HHHHHH Confidence 333323222222 2 233333333322 34432 2344556666666652 22321 1 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCch Q lcl|NC_019423. 626 AIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTP 705 (756) Q Consensus 626 ~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~ 705 (756) .+++++++.+..+++.....++..+..+++.... -.+++++..+-. + . T Consensus 507 rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~----------------------------~~~l~~~~~~~~-~---~ 554 (559) T protein:vir:95 507 RQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSD----------------------------PSVLSAMANAVS-G---Q 554 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccccccCCC----------------------------hhHHHHHHHhhc-C---c Confidence 1111111100000000000000000000000000 011111111000 0 0 Q ss_pred hhhcc Q lcl|NC_019423. 706 NISAA 710 (756) Q Consensus 706 ~~~~a 710 (756) +-+.+ T Consensus 555 ~~~~~ 559 (559) T protein:vir:95 555 GGQSQ 559 (559) T ss_pred cccCC Confidence 00000 No 33 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.96 E-value=5.5e-26 Score=159.08 Aligned_cols=512 Identities=14% Similarity=0.042 Sum_probs=287.4 Q ss_pred CCc--hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-----CCCCCCCCCC---CcccCHHHHHHHHHHHHHHHH Q lcl|NC_019423. 22 WKK--EPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKG-----KAKPPKIKGR---SQVQPRLVRRQAEWRYAPLSE 91 (756) Q Consensus 22 ~~~--~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~-----~~~~~~~~gr---S~~v~~~v~~~~e~~~~~L~~ 91 (756) |++ +.+...|+..++..++.+++.++.|++-.+|-.... ........|+ +++++..-.+.++.+.+.|+. T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~ 80 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDS 80 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHh Confidence 543 456678888999999999999999999999965321 1111233443 357888999999999999999 Q ss_pred hhcC-CCCEEEEecCCcchHHHH------HHHHHHHHHHHh-hhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeee Q lcl|NC_019423. 92 PFLS-SSKLFKLTPVTFEDELAA------RQNELVLNYQFR-TQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTE 163 (756) Q Consensus 92 ~f~~-~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~-~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~ 163 (756) -+|+ +.+||++.+-..+..+.+ ++.+..+.-++. ..++-+..++.++++++..|+|++-+-.+ T Consensus 81 ~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~--------- 151 (549) T protein:vir:10 81 MITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHD--------- 151 (549) T ss_pred hccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeec--------- Confidence 9998 789999988654443322 233334433333 24445566788888888888887664110 Q ss_pred eeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEec Q lcl|NC_019423. 164 TPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLN 243 (756) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~ 243 (756) ..+.+++..++ T Consensus 152 ---------------------------------------------------------------------~~~~~~f~~~p 162 (549) T protein:vir:10 152 ---------------------------------------------------------------------VGKGIVYRNVP 162 (549) T ss_pred ---------------------------------------------------------------------CCCeeEEEEEE Confidence 01225678888 Q ss_pred hhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEE Q lcl|NC_019423. 244 PNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEY 323 (756) Q Consensus 244 p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~ 323 (756) ..+|++..++...++ -++++..+|...+.+.++. ++|. .... .... .+ +.++|.||.+ T Consensus 163 l~~~~v~~d~~G~vd---~i~r~~~~t~~ql~~~fg~-~~l~---~~v~---~~~~-----------~~-~~~~~~v~~~ 220 (549) T protein:vir:10 163 MQRLWFAENNSGLID---KTHVQWELTLRQAAQRFGR-ENLS---PSMQ---STLE-----------KD-PEKSAIFYHA 220 (549) T ss_pred cCeEEEeeCCCCCeE---EEEEEeecCHHHHHHhcCc-ccCC---HHHH---HHhh-----------cC-CCceEEEEEE Confidence 899999888876554 3788999999999887654 2221 1100 0000 01 1246777765 Q ss_pred EEE-eeccC---CceeE---EEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHH Q lcl|NC_019423. 324 WGF-YDIND---DGSLE---PIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRG 396 (756) Q Consensus 324 w~k-~d~~~---~g~~~---~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~ 396 (756) =+. .+.+. ++.-. .++....++++++ ++.| .++||++..|...++..||.|++....+-.+.+|.+.+. T Consensus 221 V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~--esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 296 (549) T protein:vir:10 221 VEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQ--NSGF--RTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKT 296 (549) T ss_pred eecCCCCCccccccccCceEEEEEEecCCEeec--cCCc--ccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHH Confidence 322 11110 11111 1122234566665 4445 469999999999999999999999999999999999999 Q ss_pred HHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHh Q lcl|NC_019423. 397 MIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFS 476 (756) Q Consensus 397 ~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~ 476 (756) .+....++.+|.++++.+.+... .....+....+ ..+......+.++..+.-.+....+++.+.+.+...-=+..+. T Consensus 297 ~l~~~~~~~~p~~~v~~~g~~~~--~~l~pgg~~~~-~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~ 373 (549) T protein:vir:10 297 NIRGAQKLVDPPLLANEDGVLDG--FDLRSGALNWG-GLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQ 373 (549) T ss_pred HHHHHHHHhcCceeecccccccc--ceeccCCcccc-ccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhh Confidence 99999999999999987644321 12222221111 1111222334444333333445555666666665543222222 Q ss_pred cCCCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhc-Ccce Q lcl|NC_019423. 477 GGVTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLK-GNFD 554 (756) Q Consensus 477 ~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~-~~~D 554 (756) +-. ++...||++|..+.+.....|..+.-+|. +++.++..+.+.++.+.. .++--|+++. ...+ T Consensus 374 ~~~--~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g------------~lP~~p~~l~~~~~~ 439 (549) T protein:vir:10 374 ILV--DSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAG------------QLPDMPQELIDAGAD 439 (549) T ss_pred hhc--CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCCCChhhhcCCce Confidence 212 33458999999999999999999999997 588999999999988631 2222344432 2334 Q ss_pred EEEecccccHHHHHHHHH---HHHHHHhhc--cCCHhHH-----HHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHH Q lcl|NC_019423. 555 IEVDINTAEIDNQKSQDL---GFMVQTLGN--TVDQSIT-----LSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQ 624 (756) Q Consensus 555 v~V~~g~a~~~~~~~q~l---~~llq~~~~--~~~~~~~-----~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q 624 (756) +.|..-++....++...+ ...++.+++ .++|+.. -.++..+++..|.|. ..++. +++ .++ T Consensus 440 ~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~--~~irs------~ee--v~~ 509 (549) T protein:vir:10 440 VDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPV--EAMST------DEE--LQA 509 (549) T ss_pred eEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCc--cccCC------HHH--HHH Confidence 445443333333333333 333333322 2344322 234455555555552 22221 100 011 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019423. 625 LAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGE 702 (756) Q Consensus 625 ~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~ 702 (756) +.++.++++.+...+++.. ...++-.++ .+++++.+.... T Consensus 510 ~r~~~~~qqq~~~~~~~a~-----~a~~~a~~~---------------------------------~~~~ta~~~~~~ 549 (549) T protein:vir:10 510 QQAAEAQAAQMQQMLAAAP-----VAAGAIKDL---------------------------------SDAQTAAQTARV 549 (549) T ss_pred HHHHHHHHHHHHHHHHHHH-----HHHHHHHhh---------------------------------hhhcCCCcccCC Confidence 1100000000000000000 000000001 111111111111 No 34 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.95 E-value=2.4e-26 Score=161.03 Aligned_cols=525 Identities=11% Similarity=0.063 Sum_probs=279.2 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC--CCCCCCCcccCHHHHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKP--PKIKGRSQVQPRLVRRQAEWRYAPLSEPFLS-SSK 98 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~-~~~ 98 (756) |.. .+++.++..++.+++.++.|++-.+|....-..+. +...-+.++++....+.++.+.+.|+.-+|+ +.+ T Consensus 1 m~~-----~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~ 75 (555) T protein:vir:17 1 MKH-----SAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTS 75 (555) T ss_pred Chh-----HHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCc Confidence 333 47778999999999999999999999864311111 1111235688899999999999999999998 789 Q ss_pred EEEEecCCcchHH-------HHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeee Q lcl|NC_019423. 99 LFKLTPVTFEDEL-------AAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETP 165 (756) Q Consensus 99 ~~~~~p~~~~D~~-------~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~ 165 (756) ||++.+..++..+ .+. ..+..+...| ..++-+..++.++++++..|++++-+ + + . T Consensus 76 WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~--~-~------~-- 143 (555) T protein:vir:17 76 FFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDI-AESSDRVHLEMAMKHLIVTGNALLYQ--G-K------K-- 143 (555) T ss_pred ccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEEEe--c-C------C-- Confidence 9999986433111 111 1333333333 35567777888999999999987521 0 0 0 Q ss_pred eeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechh Q lcl|NC_019423. 166 VFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPN 245 (756) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~ 245 (756) .++.++.. T Consensus 144 ------------------------------------------------------------------------~~~~~pl~ 151 (555) T protein:vir:17 144 ------------------------------------------------------------------------NLKLYPLD 151 (555) T ss_pred ------------------------------------------------------------------------ceeEEEcC Confidence 01123445 Q ss_pred heEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccc-cccccccceEEEEEEE Q lcl|NC_019423. 246 NVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDF-QFKDALRKKVVAYEYW 324 (756) Q Consensus 246 ~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~d~s~~~V~v~E~w 324 (756) +|++..++...++ -++++.++|..+|.+.++...-.+....... ...+....-.... .........+.||.++ T Consensus 152 ~y~v~~d~~G~vd---~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~---~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~ 225 (555) T protein:vir:17 152 RFVVSRDGEGNVM---EIVTEEQIDRSLLPEEFQKVGGLEGAPDSNA---VGEDGPKMGVTAPGGRDKGKSNDALVYTYV 225 (555) T ss_pred eEEEeeCCCcCee---EEEeeeeecHHHHHHHhhhccccchhhhhhh---ccccchhhhhhhhcccccCCCcceeEeecc Confidence 6777777655544 4788999999999887654321111111110 0001000000000 0111122346666655 Q ss_pred EEeeccCCceeEEEEEEEECCEEEE--ecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 325 GFYDINDDGSLEPIVATWIGSTLIR--MENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLG 402 (756) Q Consensus 325 ~k~d~~~~g~~~~~~~~~~g~~~L~--~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~ 402 (756) .+. ++... +..-+++..+. ..++|| .+|||++..|...++..||.|++....+-.+.+|.+.+..+.... T Consensus 226 ~~~----~~~~~--~~~e~~~~~v~~~l~e~g~--~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 297 (555) T protein:vir:17 226 CRK----DGQVK--WHQECDGKVIPGSNSSAPY--THNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSA 297 (555) T ss_pred ccc----CCeeE--EEEecCceeccccccccCc--ccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 432 22211 22223444432 356666 479999999999999999999999999999999999999999999 Q ss_pred hhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCC--cchHHHHHHHHHHHHHHHHhchhHHhcCCC Q lcl|NC_019423. 403 RSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE--LPQSAIVMTQMQNQEAESLTGVKAFSGGVT 480 (756) Q Consensus 403 ~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~ 480 (756) ++.+|.++++.+.+..... ...+.. +.+..+....+.+++... --+.....++.+.+.+.+.. +... T Consensus 298 ~~~~pp~lv~~~g~~~~~~--l~~~~~---g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aF------m~~~ 366 (555) T protein:vir:17 298 ASAKVVFMVSPSATTKPQN--LALAAN---GAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAF------LMLQ 366 (555) T ss_pred HHhCCceeeccccccCcce--eecCCC---ceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHH------hhcC Confidence 9999999997765543221 122221 112222233455554332 12333445555555554432 2211 Q ss_pred -ccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEe Q lcl|NC_019423. 481 -GSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVD 558 (756) Q Consensus 481 -~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~ 558 (756) .++...||++|..+.+.....|..++.+|. +++.++.++.+.++.+..- ++--|++.. .+.+. T Consensus 367 ~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~------------lP~~p~~~v---~~~i~ 431 (555) T protein:vir:17 367 VRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRK------------LPQLPKDLV---QPTVV 431 (555) T ss_pred CCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCC------------CCCCCHhhh---cccee Confidence 233357999999999999999999999997 5889999999999987532 211122222 13333 Q ss_pred ccccc-HHHHHHHHHHHHHHHhhccCC-HhHH-----HHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHH Q lcl|NC_019423. 559 INTAE-IDNQKSQDLGFMVQTLGNTVD-QSIT-----LSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQ 631 (756) Q Consensus 559 ~g~a~-~~~~~~q~l~~llq~~~~~~~-~~~~-----~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq 631 (756) ++... ..+...+.++..++.++...+ |... -.++..+++..|++ ....++ .+.+. +++.+++.+ T Consensus 432 ~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~-p~~ivr------s~eev--~~~rq~~~~ 502 (555) T protein:vir:17 432 AGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGID-TLQLIN------SPETM--KQLGDQQKQ 502 (555) T ss_pred ehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCC-hhhhcC------CHHHH--HHHHHHHHH Confidence 44332 233344445555555443322 2211 12333444444441 011111 11111 111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhcc Q lcl|NC_019423. 632 LENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAA 710 (756) Q Consensus 632 ~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a 710 (756) + +++.+...+.++.+.+ ...+ +.+ ..+...++ .+.....+ -.++.+|+++..| T Consensus 503 ~-----~~q~~~~~qa~~~~~~----~~~~-----~~~---~~~~~~~~--~a~~~~~a-------~~~~~~~~~~~~~ 555 (555) T protein:vir:17 503 D-----MVQASLINQAGQLAKT----PMAE-----QAM---QLIQQQQE--GAQDAGAA-------ESETSSAEAQAGA 555 (555) T ss_pred H-----HHHHHHHHHHHHHHhh----hhhh-----hHH---hccccchh--hhhHHHHH-------HhhcCCcccccCC Confidence 0 0000000000000000 0000 000 01111111 11111111 1234556666555 No 35 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.95 E-value=5.3e-26 Score=159.17 Aligned_cols=510 Identities=10% Similarity=0.019 Sum_probs=284.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCC--CCCcccCHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIK--GRSQVQPRLV 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~--grS~~v~~~v 78 (756) |.+ +++++|- -..+++.++..++.+++.++.|++-.+|....-..+.-... -..++++... T Consensus 1 m~~----------~~~~~~~-------~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~ 63 (535) T protein:vir:15 1 MAD----------SKRTGLG-------EDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVG 63 (535) T ss_pred CCc----------cchhccc-------hHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccH Confidence 221 2233332 23456789999999999999999999998643221111111 2245788889 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCc-------chHH------HHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcC Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTF-------EDEL------AARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDG 145 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~-------~D~~------~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g 145 (756) .+.++.+.+.|+.-+|.+.+||++.+-.. ++.+ --+..+..+.-.| ..++-+..++.++++++..| T Consensus 64 ~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G 142 (535) T protein:vir:15 64 ARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAG 142 (535) T ss_pred HHHHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhC Confidence 99999999999999999899999987432 1111 1123334444334 35667777899999999999 Q ss_pred ceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCcee Q lcl|NC_019423. 146 TGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVT 225 (756) Q Consensus 146 ~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~ 225 (756) +|++.+-++. T Consensus 143 ~a~l~~~~~~---------------------------------------------------------------------- 152 (535) T protein:vir:15 143 NALLYLPEPE---------------------------------------------------------------------- 152 (535) T ss_pred ceeEEeecCC---------------------------------------------------------------------- Confidence 9987752210 Q ss_pred EEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccc Q lcl|NC_019423. 226 EVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTP 305 (756) Q Consensus 226 ~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 305 (756) .+.+++..++..+|++..++...++ -++++..+|..+|.+..... +. . . T Consensus 153 --------~~~~~f~~~pl~~~~v~~d~~G~vd---~i~r~~~~t~~~l~~~~~~~--~~----------~--~------ 201 (535) T protein:vir:15 153 --------GSYNPMKLYRLSSYVVQRDAYGNVL---QIVTRDQIAFGALPEDVRSA--VE----------K--A------ 201 (535) T ss_pred --------CCceeeEEEEcCeeEEeeCCCCCee---EEEEeEeecHHHHHHHHhHh--hh----------c--c------ Confidence 0123456677788998888766555 47889999988875432210 00 0 0 Q ss_pred ccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHH Q lcl|NC_019423. 306 SDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGD 385 (756) Q Consensus 306 ~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d 385 (756) .......++|.||++.++.. +++...++ ..+.+..+...++-|+.+.+||++..|...++..||+|.+....+ T Consensus 202 ---~~~~~~~~~v~v~~~v~~~~--~~~~~~~~--~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~ 274 (535) T protein:vir:15 202 ---GGEKKMDEMVDVYTHVYLDE--ESGDYLKY--EEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLG 274 (535) T ss_pred ---ccccCCCCceeEEEEEEEec--CCCcEEEE--EEeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHH Confidence 00011224688888765421 22322222 233344444444556778899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHHHH Q lcl|NC_019423. 386 NQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQMQN 463 (756) Q Consensus 386 ~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~ 463 (756) -.+.+|.+.+..+....++.+|.++++.+.+..... ...+.... +..+..+.+.+++.. .-.+.....++... T Consensus 275 D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~--l~~~~~g~---~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 349 (535) T protein:vir:15 275 DLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRR--LTKAQTGD---FVPGRREDIDFLQLEKQADFTVAKAVSDQIE 349 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeecccccccchh--cccCCcee---eecCCcccceeeecccccchhHHHHHHHHHH Confidence 999999999999999999999999997666543221 12222111 111222334444322 22344556666666 Q ss_pred HHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 464 QEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 464 ~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) +.+.... ..+. ++.. ++...||++|..+.+.....|..++.+|.. ++.++.++.+.++.+..-=+ T Consensus 350 ~~I~~af-~~~~-~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP----------- 415 (535) T protein:vir:15 350 ARLSYAF-MLNS-AVQR-TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIP----------- 415 (535) T ss_pred HHHHHHH-hhhh-cccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----------- Confidence 6665543 2222 2212 223479999999999999999999999985 88999999999997642111 Q ss_pred ecCHhHhcCcceEEEecccccH-HHHHHHHHHHHHHHhhccCCHhH------HHHHHHHHHhhcCChhHHHHhhhccCCC Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAEI-DNQKSQDLGFMVQTLGNTVDQSI------TLSLVAKIAELKRMPDLAHELRTWQPQP 615 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~~-~~~~~q~l~~llq~~~~~~~~~~------~~~~l~~l~e~~~~~~~~~~l~~~~~q~ 615 (756) ++. + ..+.+.+..+.+.. .....+.+...++.++. +.|.. .-.++..+++..|.|-. ..++ T Consensus 416 ~~p-~---~~v~~~yis~La~aqr~~~~~~l~~~~~~la~-~~P~~ld~~id~d~~~~~~a~~~Gvp~~-~i~~------ 483 (535) T protein:vir:15 416 ELP-K---EAVEPTISTGLEAIGRGQDLDKLERCISAWAA-LAPMQGDPDINLAVIKLRIANAIGIDTS-GILL------ 483 (535) T ss_pred CCC-c---cceeEEEecHHHHHHHHHHHHHHHHHHHHHHh-cChhhhhccCCHHHHHHHHHHHcCCChh-hhcC------ Confidence 111 1 22445554444432 22233344444444433 22221 22344555555555411 0111 Q ss_pred ChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 616 DPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALT 695 (756) Q Consensus 616 ~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~ 695 (756) +++ +.++.++++. ++++..++.+ + + -+..+ ++. .. .-+.+++++ T Consensus 484 ~~e-----ev~~~~~q~~-----~~~~~~~~a~-----~-----~---g~~~~-~~~-~~-----------~p~~~~~~~ 527 (535) T protein:vir:15 484 TDE-----QKQALMMQDA-----AQTGIENAAA-----T-----G---GAGVG-ALA-TS-----------SPEAMQGAA 527 (535) T ss_pred CHH-----HHHHHHHHHH-----HHHHHHHHHH-----H-----H---Hhhcc-chh-cc-----------ChHHHHHHH Confidence 111 1111110000 0000000000 0 0 00000 000 00 011122223 Q ss_pred HHhhccCC Q lcl|NC_019423. 696 TPTKEGET 703 (756) Q Consensus 696 ~~~~~~~~ 703 (756) +.-+..++ T Consensus 528 ~~~g~~~~ 535 (535) T protein:vir:15 528 AQAGLDAT 535 (535) T ss_pred hccCCCCC Confidence 32222222 No 36 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.95 E-value=6.6e-26 Score=158.64 Aligned_cols=510 Identities=10% Similarity=0.019 Sum_probs=282.7 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCC--CCCCcccCHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKI--KGRSQVQPRLV 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~--~grS~~v~~~v 78 (756) |. .+++++|- -..+++.++..++.+++.++.|++-.+|....-..+.-.. ..+.++++... T Consensus 1 m~----------~~~~~~~~-------~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~ 63 (535) T protein:vir:33 1 MA----------DSKRTGLG-------EDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVG 63 (535) T ss_pred CC----------hhhhhccC-------hhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccH Confidence 11 12233331 2345678999999999999999999999865322111111 22245788888 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCcc-------hHHHH------HHHHHHHHHHHhhhcCCcchHHHHHHHHhhcC Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTFE-------DELAA------RQNELVLNYQFRTQLNKVKLVDDYVHSIVDDG 145 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~-------D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g 145 (756) .+.++.+.+.|+.-+|.+.+||++.+-.++ +.+.+ +..+..+.-.| ..++-+..++.++++++..| T Consensus 64 ~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G 142 (535) T protein:vir:33 64 ARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAG 142 (535) T ss_pred HHHHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhC Confidence 999999999999999988999999875321 11111 23333443333 45667777889999999999 Q ss_pred ceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCcee Q lcl|NC_019423. 146 TGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVT 225 (756) Q Consensus 146 ~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~ 225 (756) +|++.+-++. T Consensus 143 ~a~l~~~~~~---------------------------------------------------------------------- 152 (535) T protein:vir:33 143 NALLYLPEPE---------------------------------------------------------------------- 152 (535) T ss_pred ceeEEeecCC---------------------------------------------------------------------- Confidence 9987752210 Q ss_pred EEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccc Q lcl|NC_019423. 226 EVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTP 305 (756) Q Consensus 226 ~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 305 (756) .+.+++..++..+|++..++...++ -++++..+|..+|.+.++.. .+. .... T Consensus 153 --------~~~~~f~~~pl~~~~v~~d~~G~vd---~i~r~~~~t~~ql~~~~~~~-~~~-------------~~~~--- 204 (535) T protein:vir:33 153 --------GSYNPMKLYRLSSYVVQRDAYGNVL---QIVTRDQIAFGALPEDVRSA-VEK-------------SGGE--- 204 (535) T ss_pred --------CCceeeEEEEcCeeEEeeCCCCCee---EEEeeEeecHHHHHHHhhhh-hcc-------------cccc--- Confidence 0124466677788999888766555 37889999999886554321 000 0000 Q ss_pred ccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHH Q lcl|NC_019423. 306 SDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGD 385 (756) Q Consensus 306 ~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d 385 (756) ....+.+.||.|.++ +. +++...++ ..+.+..+....+-|+.+.+||++..|...++..||+|.+....+ T Consensus 205 ------k~~~~~~~v~~~v~~-~~-~~~~~~~~--~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~ 274 (535) T protein:vir:33 205 ------KKMDEMVDVYTHVYL-DE-ESGDYLKY--EEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLG 274 (535) T ss_pred ------cccccCCeEEEEEEe-eC-CCCcEEEE--EEEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHH Confidence 011124666766543 22 22322222 344454554455667778899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHHHH Q lcl|NC_019423. 386 NQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQMQN 463 (756) Q Consensus 386 ~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~ 463 (756) -.+.+|.+.+..+....++.+|.++++.+.+..... ...+.... +..+..+.+.+++.. .-.+.....++... T Consensus 275 D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~--~~~~~~g~---~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 349 (535) T protein:vir:33 275 DLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRR--LTKAQTGD---FVPGRREDIDFLQLEKQADFTVAKAVSDQIE 349 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh--cccCCcee---eecCCcccceeeecccccchhHHHHHHHHHH Confidence 999999999999999999999999998766543221 12222211 111222334444322 22344556666666 Q ss_pred HHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 464 QEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 464 ~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) +.+.... ..+. ++.. ++...||++|..+.+.....|..++.+|.. ++.++.++++.++.+..-=+ T Consensus 350 ~~I~~af-~~~~-~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP----------- 415 (535) T protein:vir:33 350 ARLSYAF-MLNS-AVQR-TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIP----------- 415 (535) T ss_pred HHHHHHH-hhhh-cccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----------- Confidence 6665543 2222 2212 223479999999999999999999999985 88999999999997642111 Q ss_pred ecCHhHhcCcceEEEecccccH-HHHHHHHHHHHHHHhhccCCHhH------HHHHHHHHHhhcCChhHHHHhhhccCCC Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAEI-DNQKSQDLGFMVQTLGNTVDQSI------TLSLVAKIAELKRMPDLAHELRTWQPQP 615 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~~-~~~~~q~l~~llq~~~~~~~~~~------~~~~l~~l~e~~~~~~~~~~l~~~~~q~ 615 (756) ++.. ..+.+.+..+.+.. .....+.+...++.++. +.|.. .-.++..+++..|.|-. ..++ T Consensus 416 ~~p~----~~v~~~yis~La~aqr~~~~~~l~~~~~~la~-~~P~~~d~~id~d~~~~~~a~~~Gvp~~-~i~~------ 483 (535) T protein:vir:33 416 ELPK----EAVEPTISTGLEAIGRGQDLDKLERCISAWAA-LAPMQGDPDINLAVIKLRIANAIGIDTS-GILL------ 483 (535) T ss_pred CCCc----cceeEEEecHHHHHHHHHHHHHHHHHHHHHHh-hChhhhhccCCHHHHHHHHHHHcCCCHh-HhcC------ Confidence 1111 22445554444432 22233344444444433 22221 22344555555555411 0111 Q ss_pred ChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 616 DPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALT 695 (756) Q Consensus 616 ~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~ 695 (756) ++++. ++..++ ++++.+++++.. +....+.+ ++ ... .+..++++ T Consensus 484 ~~ee~--~~~~~q-~~~~~~~~~~~~----~~g~~~~~-----------------~~--~~~----------~~~~~~~~ 527 (535) T protein:vir:33 484 TDEQK--QALMMQ-DAAQTGVENAAA----AGGAGVGA-----------------LA--TSS----------PEAMQGAA 527 (535) T ss_pred CHHHH--HHHHHH-HHHHHHHHHHHH----hhhhhhcc-----------------hh--hcC----------ChhHHHHH Confidence 11110 011000 000000000000 00000000 00 000 01111111 Q ss_pred HHhhccCC Q lcl|NC_019423. 696 TPTKEGET 703 (756) Q Consensus 696 ~~~~~~~~ 703 (756) +.-+-..+ T Consensus 528 ~~~g~~~~ 535 (535) T protein:vir:33 528 AKAGLNAT 535 (535) T ss_pred HhccCCCC Confidence 11111111 No 37 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.95 E-value=2.5e-25 Score=155.50 Aligned_cols=508 Identities=11% Similarity=0.020 Sum_probs=282.5 Q ss_pred CCchH---HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 22 WKKEP---SIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 22 ~~~~~---~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) |+++. ....|+..++..++.+++.++.|++-.+|.......+ .+...-+.++++....+.++.+.+.|+.-+|.+ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC Confidence 66644 4567888899999999999999999999986432211 111222346888899999999999999999988 Q ss_pred CCEEEEecCCcch-------HHH------HHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeee Q lcl|NC_019423. 97 SKLFKLTPVTFED-------ELA------ARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTE 163 (756) Q Consensus 97 ~~~~~~~p~~~~D-------~~~------A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~ 163 (756) .+||++.+..++- ... -+..+..+.-.| ..++-+..++.++++++..|+|++-+ + + . T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~--~-e------~ 150 (536) T protein:vir:10 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFEALKQLVVAGNVLLYL--P-E------P 150 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEE--e-e------C Confidence 8999998754331 111 222344443333 35666777888999998889887653 1 0 0 Q ss_pred eeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEec Q lcl|NC_019423. 164 TPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLN 243 (756) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~ 243 (756) ...+...++.++ T Consensus 151 --------------------------------------------------------------------~~~~~~~~~~~p 162 (536) T protein:vir:10 151 --------------------------------------------------------------------EGSNYNPMKLYR 162 (536) T ss_pred --------------------------------------------------------------------CCCceeeEEEEE Confidence 000112356677 Q ss_pred hhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEE Q lcl|NC_019423. 244 PNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEY 323 (756) Q Consensus 244 p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~ 323 (756) ..+|++..++...++ -++++..+|..+|.+.++.. .+... .. ....++|.||++ T Consensus 163 l~~~~v~~d~~G~vd---~i~r~~~~t~~~l~~~fg~~-~~~~~-------------~~---------~~~~~~v~v~~~ 216 (536) T protein:vir:10 163 LSSYVVQRDAFGNVL---QMVTRDQIAFGALPEDIRKA-VEGQG-------------GE---------KKADETIDVYTH 216 (536) T ss_pred cCeEEEeeCCCCCee---EEeeeeeccHHHHHHhhhhh-hcccc-------------cc---------cCcccceEEEEE Confidence 788888888766555 36889999999887665421 11000 00 011236888887 Q ss_pred EEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019423. 324 WGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGR 403 (756) Q Consensus 324 w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~ 403 (756) -++.+ +++...+ +. .+.+..+.-+...|+...+||++..|...++..||.|++....+-.+.+|.+.+..+..... T Consensus 217 V~~~~--~~~~~~~-~~-e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 292 (536) T protein:vir:10 217 IYLDE--ASGEYLR-YE-EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMI 292 (536) T ss_pred EEEec--CCCcEEE-EE-eecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55432 2222222 22 33444333344556668899999999999999999999999999999999999999999999 Q ss_pred hcCCceEeeccccCccchhhhhcccccccccccccccccccccc--CCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCc Q lcl|NC_019423. 404 SANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHK--FPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTG 481 (756) Q Consensus 404 ~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~ 481 (756) +.++.++++.+.+..... ...+....+ ..+....+.+.+ ...--+.....++.+.+.+....=+. +++.. T Consensus 293 a~~~~~lv~p~g~~~~~~--~~~~~~g~~---v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~- 364 (536) T protein:vir:10 293 SSKVIGLVNPAGITQPRR--LTKAQTGDF---VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQR- 364 (536) T ss_pred HhcCCcccCcccccchhh--hccCCCcce---ecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcccC- Confidence 999999998766633222 111111111 111112233332 22233445666777777776654222 12211 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc Q lcl|NC_019423. 482 SAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN 560 (756) Q Consensus 482 ~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g 560 (756) ++...||++|..+.+.....|..++.+|.. ++.++.++++.++.+.. .+-++..+. +.+.+..+ T Consensus 365 ~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g-----------~lP~~p~~~----v~~~~vs~ 429 (536) T protein:vir:10 365 TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ-----------QIPELPKEA----VEPTISTG 429 (536) T ss_pred CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCC-----------CCCCCChhh----ccceEEec Confidence 233479999999999999999999999975 88999999999986531 111222222 22333334 Q ss_pred cccH-HHHHHHHHHHHHHHhhccCCHh------HHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 561 TAEI-DNQKSQDLGFMVQTLGNTVDQS------ITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 561 ~a~~-~~~~~q~l~~llq~~~~~~~~~------~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e 633 (756) .+.. +....+.++..++.++. +.|. +...++..+++..|+ +....++. +. +.+++.+++++ + T Consensus 430 l~~l~r~~~~~~l~~~~~~la~-~~P~~ld~~id~d~~~~~~a~~~Gv-~p~~~irt------~e--ev~~~r~q~~~-~ 498 (536) T protein:vir:10 430 LEAIGRGQDLDKLERCVTAWAA-LAPMRDDPDINLAMIKLRIANAIGI-DTSGILLT------EE--QKQQKMAQQSM-Q 498 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHh-hchhhhcccCCHHHHHHHHHHHcCC-CchhhcCC------HH--HHHHHHHHHHH-H Confidence 4332 33333444444444332 2221 223344555555555 11122221 11 01111100000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh Q lcl|NC_019423. 634 NEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI 707 (756) Q Consensus 634 ~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~ 707 (756) .+..++.+++.++ +++++. .-.+.+.+..+ .+-..|++ T Consensus 499 ~~~~~~a~~~~~~----~~~~~~-----------------------------~~~~~~~~~~~---~~g~~~~~ 536 (536) T protein:vir:10 499 MGMDNGAAALAQG----MAAQAT-----------------------------ASPEAMAAAAD---SVGLQPGI 536 (536) T ss_pred HHHHHHHHHHHHH----HHHHHh-----------------------------cCchhHHhhhh---ccccCCCC Confidence 0000000000000 000000 00000110010 01111122 No 38 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.95 E-value=3.8e-25 Score=154.45 Aligned_cols=511 Identities=10% Similarity=0.012 Sum_probs=277.2 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCC---CcccCHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGR---SQVQPRL 77 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr---S~~v~~~ 77 (756) |.. +++..+ ....+++.++..++.+++.++.|++-.+|..... .......|+ .++++.. T Consensus 1 m~~----------~~~~~~-------~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-~~~~~~~~~~~~~~~~dst 62 (532) T protein:vir:99 1 MAE----------VEKTGF-------AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV-FPSATADGSTSYTTPWQSI 62 (532) T ss_pred Ccc----------hhhccc-------cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc-cCCCCCcchhhccccccch Confidence 211 122222 1345777899999999999999999999987532 222233333 4688889 Q ss_pred HHHHHHHHHHHHHHhhcC-CCCEEEEecCCcch-------HHHH------HHHHHHHHHHHhhhcCCcchHHHHHHHHhh Q lcl|NC_019423. 78 VRRQAEWRYAPLSEPFLS-SSKLFKLTPVTFED-------ELAA------RQNELVLNYQFRTQLNKVKLVDDYVHSIVD 143 (756) Q Consensus 78 v~~~~e~~~~~L~~~f~~-~~~~~~~~p~~~~D-------~~~A------~q~t~~~n~~~~~~~~~~~~~~~~v~~al~ 143 (756) ..+.++.+.+.|+.-+|+ +.+||++.+-.++. ++.+ +..+..+.-.| ..++-+..++.++++++. T Consensus 63 ~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~ 141 (532) T protein:vir:99 63 GARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-ESNSFRPTLHAAIKQLLV 141 (532) T ss_pred HHHHHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHh Confidence 999999999999999998 69999999853321 1111 12333333333 456677779999999999 Q ss_pred cCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCc Q lcl|NC_019423. 144 DGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTG 223 (756) Q Consensus 144 ~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g 223 (756) .|+|++=+.++.+. T Consensus 142 ~G~a~l~~~~~~~~------------------------------------------------------------------ 155 (532) T protein:vir:99 142 AGNVLLYIPSTEQV------------------------------------------------------------------ 155 (532) T ss_pred HCcEeEEecccccc------------------------------------------------------------------ Confidence 99998765332110 Q ss_pred eeEEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcc Q lcl|NC_019423. 224 VTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESK 303 (756) Q Consensus 224 ~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 303 (756) ..+...+..++..+|++..++...+++ ++++..++...|-+. +. .. . .+ T Consensus 156 ---------~~~~~~f~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~------~~-----~~-----~---~~ 204 (532) T protein:vir:99 156 ---------EGQSNAPKLYKLHNFVVERDAYDNVLQ---IVTEDKIARAALPED------VR-----KS-----L---ED 204 (532) T ss_pred ---------cCcccceEEEEcCeEEEeeCCCCCeee---EeeeeeecHHhcChH------HH-----HH-----h---hc Confidence 001133556666778887777654442 566677766554211 00 00 0 00 Q ss_pred ccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHh Q lcl|NC_019423. 304 TPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELL 383 (756) Q Consensus 304 ~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~ 383 (756) .. .....-.+|.||++.++.+ ++..+. ...++.+..+...++-|+..++||++..|...++..||.|.+... T Consensus 205 ~~----~~~~p~~~v~v~~~v~~~~-~~~~~~---~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~ 276 (532) T protein:vir:99 205 AQ----GDQNPSEEVTIYTHVYRDP-EAMVFR---SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEY 276 (532) T ss_pred cc----cccCCCcceEEEEEEEecC-CCCeeE---EEEeecCceecccccccccccCCceeeeeeecCCCccccchHHHH Confidence 00 0011224688888776532 222222 223444444433445555677999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHH Q lcl|NC_019423. 384 GDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQM 461 (756) Q Consensus 384 ~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~ 461 (756) .+-.+.+|.+.+..+.....+.++.++++.+.+..... ...+....+ ..+....+.+.+.. .--+.....++. T Consensus 277 l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~--~~~~~~g~~---v~g~~~~i~~~~~~~~~~~~~~~~~i~~ 351 (532) T protein:vir:99 277 LGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR--VAKANTGDF---VAGRKQDVEVFQLEKYNDFQVAKATADD 351 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhh--hccCCCcce---ecCCcccceeeecccccchhHHHHHHHH Confidence 99999999999999999999999999998766543322 111111111 11222334444322 222344556666 Q ss_pred HHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCc Q lcl|NC_019423. 462 QNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQ 540 (756) Q Consensus 462 ~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~ 540 (756) ..+.+.... ..+. +.. -++...||++|..+.+.....|..++.+|.. ++.++.++.+.++.+. | T Consensus 352 ~~~rI~~af-~~~~-~~~-~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~----------g-- 416 (532) T protein:vir:99 352 IEKRLSYAF-MLNS-AVQ-RGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT----------S-- 416 (532) T ss_pred HHHHHHHHH-hhhh-ccc-CCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc----------C-- Confidence 666665543 2221 111 1233469999999999999999999999975 8899999999999863 1 Q ss_pred eeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCCh-hh Q lcl|NC_019423. 541 YVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDP-ME 619 (756) Q Consensus 541 ~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p-~~ 619 (756) -++.-|++..+. ++++.++ +....++.+.++..++.++...|+ +++..+...+.+.+-....-+.+ .- T Consensus 417 ~lP~~p~~~~~~-~iv~~is-~Laraq~~~~l~~~~~~laq~~p~---------~~d~id~d~~~~~~a~~~GV~~~~i~ 485 (532) T protein:vir:99 417 KIPNLPKEAVEP-AIATGLE-ALGRGHDLNKLNVFIDYMIKLAGL---------QDDDINLLDVKMRLANSLGMDTTGLI 485 (532) T ss_pred CCCCCChhhccc-ceeecch-HHHHHHHHHHHHHHHHHHHhhcch---------hhhhCCHHHHHHHHHHHhCCChhhcc Confidence 233334444332 3433222 334444555566666665443332 12223333333333222221111 10 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 620 EQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHA 672 (756) Q Consensus 620 ~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~ 672 (756) ...++.++.+++.+.++ +++.+..++.+ ...+ +.+....+++++..+ T Consensus 486 r~~ee~~~~~~q~~~~~-~~~~a~~~~~~--~~~~---~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 486 LTQQDKQAKMAEASTAA-GMVTAGQQMGA--AGGQ---AAAAMMQQQAGMPTQ 532 (532) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHHH--HHHH---hcchhHHhhcCCCCC Confidence 00011111110000000 00000000000 0000 000000111111100 No 39 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.95 E-value=6.6e-25 Score=153.17 Aligned_cols=508 Identities=11% Similarity=0.031 Sum_probs=281.9 Q ss_pred CCchH---HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 22 WKKEP---SIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 22 ~~~~~---~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) |+++. ....|+..++..++.+++.++.|++-.+|.......+ .+...-+.++++....+.++.+.+.|+.-+|.+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC Confidence 66644 4567888899999999999999999999986532211 111222346889999999999999999999988 Q ss_pred CCEEEEecCCcch-------HHH------HHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeee Q lcl|NC_019423. 97 SKLFKLTPVTFED-------ELA------ARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTE 163 (756) Q Consensus 97 ~~~~~~~p~~~~D-------~~~------A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~ 163 (756) .+||++.+..++- ... -+..+..+.-.| ..++-+..++.++++++..|+|++-+ + + . T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~--~-e------~ 150 (536) T protein:vir:21 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFEALKQLVVAGNVLLYL--P-E------P 150 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEE--e-e------C Confidence 8999998754331 111 222344443333 35666777888999998889887653 1 0 0 Q ss_pred eeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEec Q lcl|NC_019423. 164 TPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLN 243 (756) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~ 243 (756) ...+...++.++ T Consensus 151 --------------------------------------------------------------------~~~~~~~f~~~p 162 (536) T protein:vir:21 151 --------------------------------------------------------------------EGSNYNPMKLYR 162 (536) T ss_pred --------------------------------------------------------------------CCCceeeEEEEE Confidence 000112356677 Q ss_pred hhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEE Q lcl|NC_019423. 244 PNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEY 323 (756) Q Consensus 244 p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~ 323 (756) ..+|++..++...++ -++++..+|..+|.+.++.. .+.. ... ....++|.||.+ T Consensus 163 l~~~~v~~d~~G~vd---~i~r~~~~t~~~l~~~fg~~-~~~~-------------~~~---------~~~~~~v~v~~~ 216 (536) T protein:vir:21 163 LSSYVVQRDAFGNVL---QMVTRDQIAFGALPEDIRKA-VEGQ-------------GGE---------KKADETIDVYTH 216 (536) T ss_pred cCeEEEeeCCCCCee---EEeeeeeccHHHHHHhhhhh-hccc-------------ccc---------cccccceeEEEE Confidence 788888888766555 37889999999887765421 1110 000 011236778766 Q ss_pred EEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019423. 324 WGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGR 403 (756) Q Consensus 324 w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~ 403 (756) -++. .+ .+...+ +.- +.+..+.-+...|+...+||++..|...++..||.|++....+-.+.+|.+.+..+..... T Consensus 217 v~~~-~~-~~~~~~-~~e-~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 292 (536) T protein:vir:21 217 IYLD-ED-SGEYLR-YEE-VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMI 292 (536) T ss_pred EEEe-cC-CCcEEE-Eec-cCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4432 22 222222 222 2343343344556678899999999999999999999999999999999999999999999 Q ss_pred hcCCceEeeccccCccchhhhhcccccccccccccccccccccc--CCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCc Q lcl|NC_019423. 404 SANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHK--FPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTG 481 (756) Q Consensus 404 ~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~ 481 (756) +.++.++++.+.+..... ...+....+ ..+....+.+.+ ...--+.....++.+.+.+....=+. +++.. T Consensus 293 a~~~~~lv~p~g~~~~~~--~~~~~~g~~---v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~- 364 (536) T protein:vir:21 293 SSKVIGLVNPAGITQPRR--LTKAQTGDF---VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQR- 364 (536) T ss_pred HhcCCcccCcccccchhh--hccCCCcce---ecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcccC- Confidence 999999998766633222 111111111 111112233332 22233445666777777776654222 12211 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc Q lcl|NC_019423. 482 SAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN 560 (756) Q Consensus 482 ~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g 560 (756) ++...||++|..+.+.....|..++.+|.. ++.++.++++.++.+.. .+-++..+. +.+.+..+ T Consensus 365 ~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g-----------~lP~~p~~~----v~~~~vs~ 429 (536) T protein:vir:21 365 TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ-----------QIPELPKEA----VEPTISTG 429 (536) T ss_pred CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCC-----------CCCCCChhh----ccceEEec Confidence 233479999999999999999999999975 88999999999986531 111222222 22333334 Q ss_pred cccH-HHHHHHHHHHHHHHhhccCCHh------HHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 561 TAEI-DNQKSQDLGFMVQTLGNTVDQS------ITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 561 ~a~~-~~~~~q~l~~llq~~~~~~~~~------~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e 633 (756) .+.. +....+.++..++.++. +.|. +...++..+++..|+ +....++. +. +.+++.+++++ T Consensus 430 l~~l~r~~~~~~l~~~~~~la~-~~Pe~ld~~id~d~~~~~~a~~~Gv-~p~~~irt------~e--ev~~~r~q~~~-- 497 (536) T protein:vir:21 430 LEAIGRGQDLDKLERCVTAWAA-LAPMRDDPDINLAMIKLRIANAIGI-DTSGILLT------EE--QKQQKMAQQSM-- 497 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHh-hchhhhcccCCHHHHHHHHHHHcCC-ChhhhcCC------HH--HHHHHHHHHHH-- Confidence 4332 33333444444444332 2221 223445555555555 11222221 11 11111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh Q lcl|NC_019423. 634 NEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI 707 (756) Q Consensus 634 ~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~ 707 (756) .+.++++ +. .+. ..|+. ++..-.+.+.+..+ .+-..|++ T Consensus 498 ~~~~~~~------------a~----~~~-----------~~~~~-----~~~~~~~~~~~~~~---~~g~~~~~ 536 (536) T protein:vir:21 498 QMGMDNG------------AA----ALA-----------QGMAA-----QATASPEAMAAAAD---SVGLQPGI 536 (536) T ss_pred HHHHHHH------------HH----HHH-----------HHHHH-----HHhcChhhHHhhhh---ccccCCCC Confidence 0000000 00 000 00000 00000001110110 01111222 No 40 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.94 E-value=1.4e-24 Score=151.40 Aligned_cols=510 Identities=12% Similarity=0.061 Sum_probs=273.2 Q ss_pred CCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCCCcccCHHHHHHHHHHHH Q lcl|NC_019423. 10 LPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGRSQVQPRLVRRQAEWRYA 87 (756) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grS~~v~~~v~~~~e~~~~ 87 (756) |.+..+++.+ ....+++.++..++.+++.++.|++-.+|.......+ ......+.++++....+.++.+.+ T Consensus 1 ~~~~~~~~~~-------~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa 73 (535) T protein:vir:94 1 MASSQKREGF-------AENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLAS 73 (535) T ss_pred CCchhhhhhH-------HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHH Confidence 3333333333 2344667799999999999999999999986432211 112223355788999999999999 Q ss_pred HHHHhhcCCCCEEEEecCCcc-------hHHHH------HHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 88 PLSEPFLSSSKLFKLTPVTFE-------DELAA------RQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 88 ~L~~~f~~~~~~~~~~p~~~~-------D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .|+.-+|.+.+||++.+...+ +.+.+ +..+..+. ..+..++-+..++.++++++..|+|++-+.++ T Consensus 74 ~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~-~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~ 152 (535) T protein:vir:94 74 KLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILM-NYIESNSYRVTLFETLKQLVVAGNALLYIPEP 152 (535) T ss_pred HHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHH-HHHHhcCcHHHHHHHHHHHHhhCcEeEeeccC Confidence 999999988899999774211 11111 12222222 22346667777889999999999998875332 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ... T Consensus 153 ~~~----------------------------------------------------------------------------- 155 (535) T protein:vir:94 153 EGT----------------------------------------------------------------------------- 155 (535) T ss_pred cCc----------------------------------------------------------------------------- Confidence 110 Q ss_pred CceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) .+++..++..+|++..++...++ -++++..++...|-..... .+. ... . ... T Consensus 156 -~~~f~~~pl~~y~v~~d~~G~vd---~i~r~~~~~~~~l~~~~~~--~~~----------~~~--------~----~~~ 207 (535) T protein:vir:94 156 -YNPMKLYRLSSYVVQRDAFGTVL---QIVTLDKTAYAALPEDVRN--SMD----------SSQ--------E----HKG 207 (535) T ss_pred -ccceEEEEcCeEEEeeCCCCCeE---EEEeeeeccHHHhhHHHHH--HHH----------hcc--------c----cCC Confidence 01234455556777666655444 3567788887776432210 000 000 0 111 Q ss_pred cceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHH Q lcl|NC_019423. 315 RKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATM 394 (756) Q Consensus 315 ~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~ 394 (756) ...|.||++.++. .+ ++... ...++.+..+...++-++...+||++..|...++..||+|.+....+-.+.+|.+. T Consensus 208 ~~~v~v~~~v~~~-~~-~~~~~--~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~ 283 (535) T protein:vir:94 208 DEMIDVYTHIYLD-EE-SGEYL--KYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQ 283 (535) T ss_pred CceeEEEEEEEee-CC-CCcEE--EEEEecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHH Confidence 2468888775432 22 22222 22355555554334445567899999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHHHHHHHHHHhch Q lcl|NC_019423. 395 RGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQMQNQEAESLTGV 472 (756) Q Consensus 395 ~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~~~~e~~tGv 472 (756) +..+....++.++.++++.+.+...... ..+....+ ..+....+.+.+.. .--+....+++...+.+.... . T Consensus 284 ~~~l~~~~~a~~~~~lv~p~g~~~~~~~--~~~~~g~~---v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~ 357 (535) T protein:vir:94 284 EAIVKMSMISAKVIGLVNPAGITQVRRL--TKAQTGDF---VSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAF-M 357 (535) T ss_pred HHHHHHHHHhccCCcccccccccchhhc--ccCCCcee---ecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-h Confidence 9999999999999999886655332221 11111111 11222333333322 222344556666666665543 1 Q ss_pred hHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC Q lcl|NC_019423. 473 KAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG 551 (756) Q Consensus 473 ~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~ 551 (756) .. +++.. ++...||++|..+.+.....|..++-+|.. ++.++.++.+.++.+.. .++--|++. T Consensus 358 ~~-~~~~~-d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g------------~lP~~p~~~-- 421 (535) T protein:vir:94 358 LN-SAVQR-TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATN------------QIPELPKEA-- 421 (535) T ss_pred Hh-hhccC-CCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCC------------CCCCCChhh-- Confidence 11 12211 233479999999999999999999999975 88999999999997641 122122222 Q ss_pred cceEEEecccccH-HHHHHHHHHHHHHHhhccCCHhHH------HHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHH Q lcl|NC_019423. 552 NFDIEVDINTAEI-DNQKSQDLGFMVQTLGNTVDQSIT------LSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQ 624 (756) Q Consensus 552 ~~Dv~V~~g~a~~-~~~~~q~l~~llq~~~~~~~~~~~------~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q 624 (756) +++.+..+.+.. +....+.++..++.++. +.|... -.++..+.+..|.|- ...++ ++. + T Consensus 422 -v~~~~vs~la~l~r~~~~~~l~~~~~~laq-~~P~~ld~~id~d~~~~~~a~~~Gvp~-~~i~r------s~e-----e 487 (535) T protein:vir:94 422 -VEPTISTGMEALGRGQDLDKLERCIAAWSA-LAPMQGDPDINIATIKLRIANAIGIDT-SGILK------TPE-----E 487 (535) T ss_pred -ccceEeehHHHHHHHHHHHHHHHHHHHHHh-hChHHhhhcCCHHHHHHHHHHHhCCCh-hhhcC------CHH-----H Confidence 223333343332 22333444444554433 333221 223444444445431 00111 111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 625 LAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETT 704 (756) Q Consensus 625 ~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~ 704 (756) .++..+|++. +++...+.++..++...++. +....+++. ....+-+| T Consensus 488 v~~~~~q~~~-----~~~~~~~~~~~g~~~~~~~~---------------~~~~~~~~~-------------~~~~g~~~ 534 (535) T protein:vir:94 488 KQQEMAEAAQ-----GTAMQNAAASAGAGAGTMAT---------------ASPENMKAA-------------AAQAGMAP 534 (535) T ss_pred HHHHHHHHHH-----HHHHHHHHHHHHHhhhcccc---------------cChHHHHHH-------------HHHhccCC Confidence 1111100000 00000000000000000000 000000000 00111111 Q ss_pred h Q lcl|NC_019423. 705 P 705 (756) Q Consensus 705 ~ 705 (756) . T Consensus 535 ~ 535 (535) T protein:vir:94 535 N 535 (535) T ss_pred C Confidence 1 No 41 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.94 E-value=7.1e-25 Score=152.97 Aligned_cols=497 Identities=12% Similarity=0.027 Sum_probs=273.6 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCCCcccCHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGRSQVQPRLV 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grS~~v~~~v 78 (756) |+..+. .....++..++..++.+++.++.|++-.+|.......+ .....-+.+.++... T Consensus 1 ~~~~~~-------------------~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~ 61 (522) T protein:vir:94 1 MAEREG-------------------FAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVG 61 (522) T ss_pred Ccccch-------------------hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccH Confidence 322211 12455777899999999999999999999986432211 111222345888889 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCcc-------hHHHH------HHHHHHHHHHHhhhcCCcchHHHHHHHHhhcC Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTFE-------DELAA------RQNELVLNYQFRTQLNKVKLVDDYVHSIVDDG 145 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~-------D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g 145 (756) .+.++.+.+.|+.-+|.+.+||++.+..++ +...+ +..+..+.-. ...++-+..++.++++++..| T Consensus 62 ~~a~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-~~~snf~~~~~~~~~~L~~~G 140 (522) T protein:vir:94 62 ARCLNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAY-METNSFRVPLFEALKQLIVSG 140 (522) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhhC Confidence 999999999999999988899999875322 22222 2333333322 245666777888899988889 Q ss_pred ceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCcee Q lcl|NC_019423. 146 TGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVT 225 (756) Q Consensus 146 ~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~ 225 (756) +|++-+ +... T Consensus 141 ~a~l~~--~~~~-------------------------------------------------------------------- 150 (522) T protein:vir:94 141 NCLLYI--PEPE-------------------------------------------------------------------- 150 (522) T ss_pred cEeEee--eccC-------------------------------------------------------------------- Confidence 987642 1000 Q ss_pred EEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccc Q lcl|NC_019423. 226 EVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTP 305 (756) Q Consensus 226 ~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 305 (756) ..+...+..++..++++..++...++ -++++..++...|-..... .+ . T Consensus 151 -------~~~~~~~~~~pl~~y~v~~d~~G~vd---~i~r~~~~~~~~l~~~~~~--~~--------------~------ 198 (522) T protein:vir:94 151 -------QGTYSPMRMYRLVSYVVQRDAFGNIL---QIVTIDKVAFSALPEDVKS--QL--------------N------ 198 (522) T ss_pred -------CCceeeEEEEEcceEEEeeCCCcCeE---EEeeeeeccHHhcchHHHH--HH--------------h------ Confidence 00012345566677777777655444 3566777776654221100 00 0 Q ss_pred ccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHH Q lcl|NC_019423. 306 SDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGD 385 (756) Q Consensus 306 ~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d 385 (756) .+ . . ...++|.||++.++. +++..++ . -+.+..+.-.++-|+..++||++..|...++..||.|++....+ T Consensus 199 ~~-~-~-~p~~~v~v~~~v~~~---~~~~~~~-~--~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~ 269 (522) T protein:vir:94 199 AD-D-Y-EPDTELEVYTHIYRQ---DDEYLRY-E--EVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLG 269 (522) T ss_pred cc-c-C-CccceEEEEEEEEee---CCceeEE-e--eccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHH Confidence 00 0 0 112478888887663 2333222 1 22233443344445667899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccC--CCcchHHHHHHHHHH Q lcl|NC_019423. 386 NQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKF--PELPQSAIVMTQMQN 463 (756) Q Consensus 386 ~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~l~~~~ 463 (756) -.+.+|.+.+..+....++.+|.++++.+.+..... ...+.... +..+....+.+.+. +.--+.....++.+. T Consensus 270 D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~--~~~~~~g~---~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 344 (522) T protein:vir:94 270 DLNSLETITEAITKMAKVASKVVGLVNPNGITQPRR--LNKAATGE---FVAGRVEDINFLQLTKGQDFTIAKSVADAIE 344 (522) T ss_pred HHHHHHHHHHHHHHHHHHHhCCceeecccccccchh--eeccCCce---eecCCcccceeeecccccchhHHHHHHHHHH Confidence 999999999999999999999999998766543322 11211111 11122223443332 222344566777777 Q ss_pred HHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 464 QEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 464 ~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) +.+....-+. +++.. ++...||++|..+.+.....|..++.+|.. ++.++.++.+.++.+..-=+ T Consensus 345 ~rI~~af~~~--~~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP----------- 410 (522) T protein:vir:94 345 QRLGWAFLLN--SAVQR-NAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIP----------- 410 (522) T ss_pred HHHHHHHhhh--hhccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----------- Confidence 7777655333 22322 223479999999999999999999999975 88999999999997653211 Q ss_pred ecCHhHhcCcceEEEecccccH-HHHHHHHHHHHHHHhhccCCHhH------HHHHHHHHHhhcCChhHHHHhhhccCCC Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAEI-DNQKSQDLGFMVQTLGNTVDQSI------TLSLVAKIAELKRMPDLAHELRTWQPQP 615 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~~-~~~~~q~l~~llq~~~~~~~~~~------~~~~l~~l~e~~~~~~~~~~l~~~~~q~ 615 (756) ++ |++ .+.+.+..+.+.. +....+.+...++.++. +.|.. .-.++..+++..|++ ....++ T Consensus 411 ~~-p~~---~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~-l~P~~~~~~id~d~~~~~~a~~~Gv~-~~~ivr------ 478 (522) T protein:vir:94 411 DL-PKE---AVEPTVSTGLEALGRGQDLEKLTQAVNMMTG-LQPLSQDPDINLPTLKLRLLNALGID-TAGLLL------ 478 (522) T ss_pred CC-Ccc---cEEeeEecHHHHHHHHHHHHHHHHHHHHHHh-ccchhhhhcCCHHHHHHHHHHHcCCC-hhhccC------ Confidence 11 111 1334444444332 22223334444444432 23322 123444555555552 111111 Q ss_pred ChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 616 DPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQ 678 (756) Q Consensus 616 ~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~ 678 (756) ++.+. +++.+++++++. + ++.+....++ ..+. .+.++-.+|... T Consensus 479 ~~ee~--~~~~~q~~~~~~--~-------~~~~~~~~~~-~~a~-------~~~~~~~~~~~~ 522 (522) T protein:vir:94 479 TQDEK--IQRMAEQSSQQA--V-------VQGASAAGAN-MGAA-------VGQGAGEDMAQA 522 (522) T ss_pred CHHHH--HHHHHHHHHHHH--H-------HHHHHHHHHH-hhhh-------hhcccchhhhcC Confidence 11111 111110000000 0 0000000000 0000 000000111111 No 42 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.93 E-value=3e-24 Score=149.51 Aligned_cols=496 Identities=13% Similarity=0.055 Sum_probs=275.8 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCC-CCCCC---CcccCHHHHHHHHHHHHHHHHhhcC-C Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPP-KIKGR---SQVQPRLVRRQAEWRYAPLSEPFLS-S 96 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~-~~~gr---S~~v~~~v~~~~e~~~~~L~~~f~~-~ 96 (756) |+ +++.++..++.+++.++.|++-.+|.......... ...|+ .++++....+.++.+.+.|+..+|+ + T Consensus 1 m~-------~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 73 (522) T protein:vir:10 1 MK-------ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQ 73 (522) T ss_pred Cc-------hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 22 66789999999999999999999998643111111 11222 3478889999999999999999998 5 Q ss_pred CCEEEEecCCcchHH------------HHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeee Q lcl|NC_019423. 97 SKLFKLTPVTFEDEL------------AARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTET 164 (756) Q Consensus 97 ~~~~~~~p~~~~D~~------------~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~ 164 (756) .+||++.+...+..+ .-+..+..+.-.+ ..++-+..++.++++++..|+|++-+ + + . T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~--~-~------~- 142 (522) T protein:vir:10 74 TSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYI-AASNDRVAVHQALKHLIVGGNALIFM--G-K------D- 142 (522) T ss_pred CccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCceeEEE--c-C------C- Confidence 899999985432111 1122333343333 35667777899999999999997531 0 0 0 Q ss_pred eeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEech Q lcl|NC_019423. 165 PVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNP 244 (756) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p 244 (756) .+..++. T Consensus 143 -------------------------------------------------------------------------~~~~~pl 149 (522) T protein:vir:10 143 -------------------------------------------------------------------------GLKTFPL 149 (522) T ss_pred -------------------------------------------------------------------------CceEEEc Confidence 0123445 Q ss_pred hheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEE Q lcl|NC_019423. 245 NNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYW 324 (756) Q Consensus 245 ~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w 324 (756) .+|++..++...++ -++++.++|...+...++. +.+.. . .. . . .....+|.||.|. T Consensus 150 ~~y~v~~d~~G~vd---~i~r~~~~t~~ql~~~fg~-~~~~~---~----~~---~-----~-----~~~~~~v~v~~~v 205 (522) T protein:vir:10 150 TRYVINRDGDGNVL---EIVTKELISRKVLDIELPE-PKPNT---G----ID---E-----S-----STTNDDVTIYTYV 205 (522) T ss_pred ceEEEeeCCCCCee---EEEeeeeccHHHHHHhcch-hccch---h----hh---c-----c-----cCCCCceEEEEEE Confidence 67888777766554 3788999999998876543 11110 0 00 0 0 0112358888876 Q ss_pred EEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019423. 325 GFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRS 404 (756) Q Consensus 325 ~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~ 404 (756) +..+ + .+...++ ....+.++...++-++...+||++..|...++..||+|.+....+-.+.+|.+.+..+.....+ T Consensus 206 ~p~~-~-~~~~~~~--~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a 281 (522) T protein:vir:10 206 KLDK-S-SGRWVWH--QEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAA 281 (522) T ss_pred Eeec-c-CCceEEE--EccCCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5431 1 2222221 2234444443444445577999999999999999999999999999999999999999999999 Q ss_pred cCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHHHHHHHHHHhchhHHhcCCCcc Q lcl|NC_019423. 405 ANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGS 482 (756) Q Consensus 405 ~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~ 482 (756) .+|.++++.+.+..... ...+... .+..+....+.+.+.. .--+.+..+++.+.+.+.+. ++++...+ T Consensus 282 ~~p~~lv~~~~~~~~~~--l~~~~~~---~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~a-----Fl~~~~~d 351 (522) T protein:vir:10 282 SKVVFLVSPSSTTKPAT--IAKAGNG---AIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEA-----FLVMNVRN 351 (522) T ss_pred cCCceeecccccccccc--ccCCCCc---ceecCCCccceeecccccccchHHHHHHHHHHHHHHHH-----HhhccCCC Confidence 99999997665533222 1112111 1112222334444322 12233445566666665543 22333334 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEeccc Q lcl|NC_019423. 483 AYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINT 561 (756) Q Consensus 483 a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~ 561 (756) +...||++|..+.+.....|..++.+|. +++.++.++.+.++.+- +.++--|+++... ++ |..-. T Consensus 352 ~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~------------g~lP~~p~~~~~~-~~-v~~is 417 (522) T protein:vir:10 352 AERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRS------------NQIPKLPKDIVRP-TI-VAGVN 417 (522) T ss_pred CCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc------------CCCCCCCcccccc-cc-ccchh Confidence 4457999999999999999999999996 48899999999988753 1222223333221 22 22222 Q ss_pred ccHHHHHHHHHHHHHHHhhccCCHhHH------HHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHH Q lcl|NC_019423. 562 AEIDNQKSQDLGFMVQTLGNTVDQSIT------LSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENE 635 (756) Q Consensus 562 a~~~~~~~q~l~~llq~~~~~~~~~~~------~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~ 635 (756) +....++.+.++..++.++..++|..+ -+++..+.+..|++-. ..++ .+.+ .+++++++++.+ T Consensus 418 ~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~-~ivr------t~ee--v~~~~q~~q~~~-- 486 (522) T protein:vir:10 418 ALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVL-NLVK------TEQQ--LAEEQQAAQQQA-- 486 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChh-hhcC------CHHH--HHHHHHHHHHHH-- Confidence 344555556666666665443322211 1234445555554310 1111 1110 000110000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019423. 636 ELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKE 700 (756) Q Consensus 636 ~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~ 700 (756) ++++... + +.+.++.. .+ . .....+.++++++.+.+ T Consensus 487 ---~~~~~~~-~---------------a~~~~~~~-----~~---~--~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 487 ---AQQSLVD-Q---------------AGQMTGSP-----LM---D--PTKNPQLMDEEQPPMEE 522 (522) T ss_pred ---HHHHHHH-H---------------HHHHhccc-----cc---C--ccccHHHHHHhCCCCCC Confidence 0000000 0 00000000 00 0 00000112222222222 No 43 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.93 E-value=2.9e-24 Score=149.61 Aligned_cols=523 Identities=11% Similarity=0.020 Sum_probs=281.4 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCC--CCCCcccCHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKI--KGRSQVQPRLV 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~--~grS~~v~~~v 78 (756) |..++ ++|+ .-..++..++..++.+++.++.|++-.+|.......+.... ..+.++++... T Consensus 1 ~~~~~----------~~~~-------~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~ 63 (543) T protein:vir:88 1 MAETK----------REGL-------AEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVG 63 (543) T ss_pred Ccccc----------cCcc-------hHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchH Confidence 43332 2233 23446677999999999999999999999975422111111 11235788899 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCcc-------hHHH--H----HHHHHHHHHHHhhhcCCcchHHHHHHHHhhcC Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTFE-------DELA--A----RQNELVLNYQFRTQLNKVKLVDDYVHSIVDDG 145 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~-------D~~~--A----~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g 145 (756) .+.++.+.+.|+.-+|.+.+||++.+-..+ +.+. + +..+..+.-.| ..++-+..++.++++++..| T Consensus 64 ~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G 142 (543) T protein:vir:88 64 ARGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYM-EANSYRVTLFELIRQLALAG 142 (543) T ss_pred HHHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhC Confidence 999999999999999988999999885321 1111 1 22233343333 45667777999999999999 Q ss_pred ceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCcee Q lcl|NC_019423. 146 TGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVT 225 (756) Q Consensus 146 ~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~ 225 (756) +|++-+ +... .+ + T Consensus 143 ~a~ly~--~~~~----~~----------------------------------------------------~--------- 155 (543) T protein:vir:88 143 TALIYL--PPPD----AS----------------------------------------------------S--------- 155 (543) T ss_pred ceeeee--ccCc----cc----------------------------------------------------c--------- Confidence 998642 1000 00 0 Q ss_pred EEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccc Q lcl|NC_019423. 226 EVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTP 305 (756) Q Consensus 226 ~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 305 (756) .+.-.+..++..++++..++...++ -++++..++..+|-.... +.+. .. . T Consensus 156 --------~~~~~~~~~pl~~y~v~~d~~G~v~---~i~r~~~~~~~~l~~~~~--~~v~----------~~----~--- 205 (543) T protein:vir:88 156 --------NSYNPMKLYTLHNHVVQRDAFGNVL---QIVTLDKVAYAALPEDVR--NSLS----------GG----Q--- 205 (543) T ss_pred --------ceecceEEeEcceEEEeeCCCCCee---eeeeeeeccHHHHhHHhh--HHHH----------HH----h--- Confidence 0001134455567777666655443 467788888887643211 0000 00 0 Q ss_pred ccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHH Q lcl|NC_019423. 306 SDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGD 385 (756) Q Consensus 306 ~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d 385 (756) ..+. -++|.||++-++.+ +.+.+. + + ..+.+..+....+-|+.+++||++..|...++..||+|.+....+ T Consensus 206 ----~~~p-~~~~~v~~~V~pr~-~~~~~~-~-~-~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~ 276 (543) T protein:vir:88 206 ----EYKP-EQELEVYTHIYIDD-ESGDFL-S-Y-QEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLG 276 (543) T ss_pred ----hcCC-ccceEEEEEEEeec-CCCccc-c-c-ccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHH Confidence 0111 13688887643321 112221 1 1 123455555556667778899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHHHH Q lcl|NC_019423. 386 NQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQMQN 463 (756) Q Consensus 386 ~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~ 463 (756) -.+.+|.+.+..+....++.+|.++++.+.+..... ...+.... +..+....+.+.+.. .--+.....++.+. T Consensus 277 D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~--~~~~~~g~---~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 351 (543) T protein:vir:88 277 DLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRR--LVKAQTGD---FVAGRKADIEFLQLEKTADFTVAKSVADAIE 351 (543) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh--cccCCCce---eecCCCCcceeeecccccchhHHHHHHHHHH Confidence 999999999999999999999999997765533221 12222211 111222334433322 22344666777777 Q ss_pred HHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 464 QEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 464 ~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) +.+.+..= .+.+...+ +...||++|..+.+.....|..++.+|.. ++.++.++.+.++.+..-=+ T Consensus 352 ~rI~~af~-~~~~~~~~--~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP----------- 417 (543) T protein:vir:88 352 ARLSYVFM-LNSAVQRS--GERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIP----------- 417 (543) T ss_pred HHHHHHHh-hhhhccCC--CCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----------- Confidence 77776542 22222222 23479999999999999999999999975 88999999999998752211 Q ss_pred ecCHhHhcCcceEEEeccccc-HHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCC-hhhh Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAE-IDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPD-PMEE 620 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~-~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~-p~~~ 620 (756) ++..+ .+.+.+..+.+. .+....+.+..+++.++...+|.. ++..+...+.+.+-.....++ .... T Consensus 418 ~~p~~----~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~v--------ld~id~d~~~~~~a~~~Gv~~~~i~r 485 (543) T protein:vir:88 418 NLPQE----AVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNG--------DPDLNVNNIKLRLANAIGIDTAGLLL 485 (543) T ss_pred CCchh----ceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhh--------hccCCHHHHHHHHHHHhCCChhhhcC Confidence 11111 223444344433 344444556666666654433322 222333333333322222211 1111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKE 700 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~ 700 (756) ..++.++.++|.+ .+++.++++.+ +..+ .+..+... .+++.+.+..-.- T Consensus 486 ~~~e~~~~~~q~~-----------~q~~~~~~~~~---------~~~~--~~~~~~~~---------~~~~~~~~~~~~~ 534 (543) T protein:vir:88 486 TEAEKAQAQSQEM-----------LKQGGLNAAAG---------IGSG--VAAQATAS---------PEAMESAMDTAGV 534 (543) T ss_pred CHHHHHHHHHHHH-----------HHHHHHHHHHH---------Hhhc--hhhhhccC---------hHHHHHHhhhcCC Confidence 1111111110000 00000000000 0000 00000000 0111111111111 Q ss_pred cCCchhhhc Q lcl|NC_019423. 701 GETTPNISA 709 (756) Q Consensus 701 ~~~~~~~~~ 709 (756) +.+|.+-+. T Consensus 535 ~~~p~~~~~ 543 (543) T protein:vir:88 535 QPGPIATQV 543 (543) T ss_pred CCCCCCCCC Confidence 112211122 No 44 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.92 E-value=5.7e-24 Score=148.03 Aligned_cols=505 Identities=11% Similarity=0.044 Sum_probs=271.6 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC--CCCCCCCcccCHHHHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKP--PKIKGRSQVQPRLVRRQAEWRYAPLSEPFLS-SSK 98 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~-~~~ 98 (756) |+. +.++.++..++.+++.++.|++-.+|........+ ...+...+.++....+.++.+.+.|+.-+|+ +.+ T Consensus 1 mk~-----~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 75 (542) T protein:vir:78 1 MKG-----LAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTS 75 (542) T ss_pred Chh-----HHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 322 34667999999999999999999999864322111 1112224678888999999999999999998 799 Q ss_pred EEEEecCCcc-------hHHHH-------HHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeee Q lcl|NC_019423. 99 LFKLTPVTFE-------DELAA-------RQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTET 164 (756) Q Consensus 99 ~~~~~p~~~~-------D~~~A-------~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~ 164 (756) ||++.+-..+ |.+.. +..+..+.-.| ..++-+..++.++++++..|++++-+ + + T Consensus 76 WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~--~-~-------- 143 (542) T protein:vir:78 76 FFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQI-AESSDRVQLTAAMKHLIVTGNVLVFA--G-K-------- 143 (542) T ss_pred cccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEEEe--c-C-------- Confidence 9999985322 21111 11333443333 35667777889999999999986531 0 0 Q ss_pred eeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEech Q lcl|NC_019423. 165 PVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNP 244 (756) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p 244 (756) + .++.++. T Consensus 144 --------------------------------~----------------------------------------~~~~~pl 151 (542) T protein:vir:78 144 --------------------------------K----------------------------------------TLKVYPL 151 (542) T ss_pred --------------------------------C----------------------------------------CceEEec Confidence 0 0223445 Q ss_pred hheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEE Q lcl|NC_019423. 245 NNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYW 324 (756) Q Consensus 245 ~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w 324 (756) .+|++..++...++. ++++..+|..+|.+.++. +.+.. .. ... ..+....++.|++.. T Consensus 152 ~~y~v~~d~~G~vd~---v~r~~~~t~~ql~~~fg~-~~l~~---~~----~~~-----------~~~~~~~~~~v~~~v 209 (542) T protein:vir:78 152 DRYVIERDGDGNVIE---IITRELVDRSLLPAEFQK-QSLLE---GK----DSN-----------AVGEDGPKFGVAQGK 209 (542) T ss_pred ceeEEeeCCCCCeEE---EeeeeecCHHHHHHhhcc-ccCch---HH----Hhh-----------ccccCCCeEEEEEEe Confidence 677777777665553 788999999999877642 11110 00 000 001112344444443 Q ss_pred EEe-ecc-------CCceeEEEEEEEECCEEE--EecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHH Q lcl|NC_019423. 325 GFY-DIN-------DDGSLEPIVATWIGSTLI--RMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATM 394 (756) Q Consensus 325 ~k~-d~~-------~~g~~~~~~~~~~g~~~L--~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~ 394 (756) +.. +.+ ..+...++ ..+.+..+ ...+++| ..+||++..|...++..||+|.+....+-.+.+|.+. T Consensus 210 ~pr~~~~~~~~~~~~~~~~s~~--~e~~g~~v~~~~~e~g~--~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~ 285 (542) T protein:vir:78 210 GGRNDAEVFTCCKLVDGQHRWH--QECDGKEIKGSRSSSPL--KHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALT 285 (542) T ss_pred ecccCCccccccccCCCeEEEE--EEecccccccccccccc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHH Confidence 321 110 11222221 22333333 2345554 6699999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCC--CcchHHHHHHHHHHHHHHHHhch Q lcl|NC_019423. 395 RGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFP--ELPQSAIVMTQMQNQEAESLTGV 472 (756) Q Consensus 395 ~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~~~~e~~tGv 472 (756) +..+....++.+|.++++.+.+..... ...+....+ ..+....+.+.+.. .--+.....++...+.+....-+ T Consensus 286 ~~~l~~~~~a~~pp~lv~~~g~~~~~~--~~~~~~g~i---v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~ 360 (542) T protein:vir:78 286 RSLIEGSAAAAKVVFMVSPSATTKPQS--LARAGTGAI---IQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLI 360 (542) T ss_pred HHHHHHHHHHhcCceeeccccccchhh--cccCCCcee---ecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcc Confidence 999999999999999997765533221 112221111 11222334444322 12234556666666666654321 Q ss_pred hHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC Q lcl|NC_019423. 473 KAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLA-KGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG 551 (756) Q Consensus 473 ~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~ 551 (756) ...-++...||++|..+.+.....|..++.+|. +++.++.++.+.++.+..- ++--|++. T Consensus 361 -----~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~------------lP~~p~~l-- 421 (542) T protein:vir:78 361 -----LNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQ------------LPSLPKGL-- 421 (542) T ss_pred -----cccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC------------CCCCchhc-- Confidence 111233346999999999999999999999996 5889999999999887532 11122222 Q ss_pred cceEEEecccccH-HHHHHHHHHHHHHHhhccCCH-hH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHH Q lcl|NC_019423. 552 NFDIEVDINTAEI-DNQKSQDLGFMVQTLGNTVDQ-SI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQ 624 (756) Q Consensus 552 ~~Dv~V~~g~a~~-~~~~~q~l~~llq~~~~~~~~-~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q 624 (756) +++.+..+.+.. +....+.+...++.++..++| .+ .-.++..+++..|.|-. ..++ .++ + T Consensus 422 -v~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~-~i~~------s~e-----~ 488 (542) T protein:vir:78 422 -VMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTL-NLVK------SPE-----T 488 (542) T ss_pred -eeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHh-hccC------CHH-----H Confidence 345554554433 223334444444544433322 11 12334445555555411 0111 111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_019423. 625 LAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETT 704 (756) Q Consensus 625 ~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~ 704 (756) .++++++ ++.+++++. .+.++ .+.+..+.. +. ..++. ....+..++.| T Consensus 489 ~~~~~~q---------~q~~~~~~a-l~~~a--------~~~a~~~~~-~~--~~~~~-----------~a~~~~~~~~~ 536 (542) T protein:vir:78 489 MANEAQQ---------AQQQQMTAS-LMGQA--------GQLAKSPIG-EK--MMQQI-----------NAPGQEAPAGP 536 (542) T ss_pred HHHHHHH---------HHHHHHHHH-HHHhh--------hhccccccc-cc--hhhhc-----------CCCCcCCCCCC Confidence 1111100 000000000 00000 000000000 00 01110 01112222222 Q ss_pred hhhhccCCC Q lcl|NC_019423. 705 PNISAAVGY 713 (756) Q Consensus 705 ~~~~~a~~~ 713 (756) . .+. -. T Consensus 537 ~--~~~-~~ 542 (542) T protein:vir:78 537 Q--TGE-DL 542 (542) T ss_pred c--ccc-cC Confidence 1 111 11 No 45 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.92 E-value=4.6e-23 Score=143.03 Aligned_cols=490 Identities=10% Similarity=0.027 Sum_probs=265.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRR 80 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~ 80 (756) |..+.+. ...-.-..|++.++..++.+++.++.|++-.+|.......+....++..+.++..-.+ T Consensus 1 ~~~~~~~---------------~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~ 65 (516) T protein:vir:96 1 MKQSIDL---------------EYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQ 65 (516) T ss_pred Ccchhhh---------------hhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccccCCcccchHHH Confidence 3332222 1112235677889999999999999999999999865433333344555789999999 Q ss_pred HHHHHHHHHHHhhcC-CCCEEEEecCCcch-------HHH------HHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCc Q lcl|NC_019423. 81 QAEWRYAPLSEPFLS-SSKLFKLTPVTFED-------ELA------ARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGT 146 (756) Q Consensus 81 ~~e~~~~~L~~~f~~-~~~~~~~~p~~~~D-------~~~------A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~ 146 (756) .++.+.+.|+.-+|+ +.+||++.+-...+ .+. -++.+..+.-.| ..++-+..++.++++++..|+ T Consensus 66 a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~ 144 (516) T protein:vir:96 66 ATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKEL-EQRQFRPAVVEAFKHLIVAGS 144 (516) T ss_pred HHHHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCe Confidence 999999999999998 57999998742211 111 112333333333 346677778888999888888 Q ss_pred eEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeE Q lcl|NC_019423. 147 GIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTE 226 (756) Q Consensus 147 gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~ 226 (756) +++-+ + + + T Consensus 145 a~l~~--d-~------~--------------------------------------------------------------- 152 (516) T protein:vir:96 145 CMLYK--P-S------K--------------------------------------------------------------- 152 (516) T ss_pred EeEEe--c-C------C--------------------------------------------------------------- Confidence 86542 0 0 0 Q ss_pred EEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccc Q lcl|NC_019423. 227 VEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPS 306 (756) Q Consensus 227 ~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 306 (756) + .+..++..+|++..++.+.+++ ++++.+++..+|.+..... .+. .. .... T Consensus 153 --------~--~~~~~pl~~y~v~~d~~G~v~~---i~rr~~~~~~~l~~~~~~~--~~~--------~~--~~~~---- 203 (516) T protein:vir:96 153 --------G--AISAIPMHHYVVNRDTNGDLLD---IILLQEKALRTFDPATRAV--VEV--------GL--KGKK---- 203 (516) T ss_pred --------C--CEEEEEcCeEEEeeCCCCCeee---ehhhhHhhHHHHHHhhhhh--hhh--------hh--hhhh---- Confidence 0 0223445567776666555443 5667788888775543110 000 00 0000 Q ss_pred cccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHH Q lcl|NC_019423. 307 DFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDN 386 (756) Q Consensus 307 ~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~ 386 (756) ......|.||.|-.+ +.++...+ +.-.-|.+++..+ -|+...|||++..|...++..||.|.+....+- T Consensus 204 -----~~~~~~v~v~~~v~~---~~~~~~~~-~~~~d~~~~~~es--~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D 272 (516) T protein:vir:96 204 -----CKEDDSVKLYTHAKY---LGDGFWEL-KQSADDIPVGKVS--KIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGD 272 (516) T ss_pred -----cCCCCceEEEEeeee---eCCceeEE-EEEeCceeecccc--ccccccCCeeeeeeeecCCCCcccchHHHhhHH Confidence 011124656554333 33444322 2223344554444 444467999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCc--chHHHHHHHHHHH Q lcl|NC_019423. 387 QAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPEL--PQSAIVMTQMQNQ 464 (756) Q Consensus 387 Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~l~~~~~ 464 (756) .+.+|.+.+..+.+...+.++.++++.+.+..... ...+.... +..+....+.+.+...- -+.+...++...+ T Consensus 273 ~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~--l~~~~~g~---i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~ 347 (516) T protein:vir:96 273 LFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDH--FVNSGTGE---VVTGVEEDIHIVQLGKYADLTPISAVLEVYTR 347 (516) T ss_pred HHHHHHHHHHHHHHHHHhcCCccccCcccccchhh--hccCCCce---eecCCcccceeeecCcccchhHHHHHHHHHHH Confidence 99999999999999999999999997766533221 22222211 11122233455443321 2344555666666 Q ss_pred HHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceee Q lcl|NC_019423. 465 EAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVE 543 (756) Q Consensus 465 ~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~ 543 (756) .+.... ..+ +...-++...||++|..+.+-....|..++-+|.. ++.++..+++..+. +.. T Consensus 348 rI~~af-~~~--~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~-----p~l---------- 409 (516) T protein:vir:96 348 RIGVVF-MME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG-----ESF---------- 409 (516) T ss_pred HHHHHH-hhh--hhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcC-----CCC---------- Confidence 665542 111 11112333479999999988888889999999875 77888777654432 110 Q ss_pred cCHhHhcCcceEEEecccccH-HHHHHHHHHHHHHHhhcc--CCHhHH-----HHHHHHHHhhcCChhHHHHhhhccCCC Q lcl|NC_019423. 544 IKREDLKGNFDIEVDINTAEI-DNQKSQDLGFMVQTLGNT--VDQSIT-----LSLVAKIAELKRMPDLAHELRTWQPQP 615 (756) Q Consensus 544 i~~d~~~~~~Dv~V~~g~a~~-~~~~~q~l~~llq~~~~~--~~~~~~-----~~~l~~l~e~~~~~~~~~~l~~~~~q~ 615 (756) | .+..++.+..+.+.. +......+...++.++.. ++|.+. -.++..+++..|.|- ..++ T Consensus 410 --p---~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~--~~ir------ 476 (516) T protein:vir:96 410 --T---SDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL--PFLK------ 476 (516) T ss_pred --c---cccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCc--cccC------ Confidence 0 112233333343332 333333344444444322 333321 234445555555551 1222 Q ss_pred ChhhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 616 DPMEEQLKQLAIQKAQLENEELQSK-IALNNAKAKEAASSGDLKDLDYLEQESGTKHA 672 (756) Q Consensus 616 ~p~~~~~~q~~~~~aq~e~~~~qa~-a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~ 672 (756) ++.+. +++ ++++++.+..++. .+..++.. .+++++ .|.+ T Consensus 477 s~eev--~~~--~~~~~~~q~~~~~a~~~~~~~~--~~~~~~------------~~~~ 516 (516) T protein:vir:96 477 SAEEM--AQE--QEAQMQAQQAQMLEEGVAKAVP--GVIQQE------------LKEA 516 (516) T ss_pred CHHHH--HHH--HHHHHHHHHHHHHHHHhhhhhh--HHhhcc------------cccC Confidence 11111 111 1111111000000 00000110 000000 0100 No 46 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.91 E-value=4.2e-22 Score=137.78 Aligned_cols=492 Identities=10% Similarity=0.019 Sum_probs=275.6 Q ss_pred cCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_019423. 20 TDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLS-SSK 98 (756) Q Consensus 20 ~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~-~~~ 98 (756) -+|.=....+.|++.++..++.+++.++.|++-.+|.......+.......+++++....+.++.+.+.|+.-+|+ +.+ T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 80 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLFPAQRS 80 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 3444334456788889999999999999999999999754221222223345788999999999999999999998 579 Q ss_pred EEEEecCCcchHH---------HHH----HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeee Q lcl|NC_019423. 99 LFKLTPVTFEDEL---------AAR----QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETP 165 (756) Q Consensus 99 ~~~~~p~~~~D~~---------~A~----q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~ 165 (756) ||++.+...+..+ .++ +.+..+.- ....++-+..++.++++++..|++++-+ + + T Consensus 81 WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~~L~~~G~a~ly~--~-~--------- 147 (517) T protein:vir:10 81 FFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAML-YGESLQFRPAVVEAFKHLIVTGNVMMYH--P-D--------- 147 (517) T ss_pred cccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHH-HHHhcCcHHHHHHHHHHHHhHCeEEEEE--e-C--------- Confidence 9999985432111 111 12222222 2356677777888888888888886431 0 0 Q ss_pred eeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechh Q lcl|NC_019423. 166 VFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPN 245 (756) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~ 245 (756) +...+..++.. T Consensus 148 ---------------------------------------------------------------------~~~~~~~~pl~ 158 (517) T protein:vir:10 148 ---------------------------------------------------------------------KTSPIQAVPLH 158 (517) T ss_pred ---------------------------------------------------------------------CCCcEEEEEcC Confidence 00123345556 Q ss_pred heEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEE Q lcl|NC_019423. 246 NVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWG 325 (756) Q Consensus 246 ~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~ 325 (756) +|++..++...+++ +++|.++|..+|.+.++.. + .+. ... ... ....+|.||.+-. T Consensus 159 ~y~v~~d~~G~v~~---ivrr~~~~~~~l~~~~~~~--~-----~~~-----~~~--------~~~-~~~~~v~v~~~v~ 214 (517) T protein:vir:10 159 HYCVRRDNNGTVLD---IVFLQEKALETFEPSIRMA--I-----QAS-----RKG--------KQY-KDKDNVKLYTHAK 214 (517) T ss_pred eEEEeeCCCcCeEE---EEeeeeccHHHHHHHhhhh--c-----chh-----hhh--------hcc-CCcCceEEEEEEE Confidence 78887777665553 5778899998887654421 0 000 000 000 1123577777644 Q ss_pred EeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019423. 326 FYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSA 405 (756) Q Consensus 326 k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~ 405 (756) +. .+|...+ ...+++..+ ..++-|+...+||++..|...++..||.|.+....+-.+.+|++.+..+.....+. T Consensus 215 ~~---~~~~~~~--~~~~d~~~~-~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~ 288 (517) T protein:vir:10 215 RT---KDGKYLI--RQSADDVPV-GKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMA 288 (517) T ss_pred Ee---CCCceEE--EEEeCceee-ccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhc Confidence 32 2343222 222344443 34556777889999999999999999999999999999999999999999999999 Q ss_pred CCceEeeccccCccchhhhhccccccccccccccccccccccCCC--cchHHHHHHHHHHHHHHHHhchhHHhcCCCccc Q lcl|NC_019423. 406 NGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE--LPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSA 483 (756) Q Consensus 406 ~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a 483 (756) ++.++++.+.+..... ...+....+ ..+....+.+.+... -.+.....++...+.+....=+. . ++.. ++ T Consensus 289 ~~~~lv~~~~~~~~~~--l~~~~~g~~---~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-~-l~~~-~~ 360 (517) T protein:vir:10 289 DVKYLVKPGSYTDINQ--FVEGGSGAV---LHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMME-A-MTRR-DA 360 (517) T ss_pred cCCcccCcccccchhh--ccCCCcccc---ccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh-h-hhcc-CC Confidence 9999998766543221 122222111 112223344443222 23445566777777776654221 1 2222 22 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccc Q lcl|NC_019423. 484 YGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTA 562 (756) Q Consensus 484 ~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a 562 (756) ...||++|..+.+-....|..++-+|.. ++.++.++++..+..-+..+ ...+.+..+.+ T Consensus 361 ~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~--------------------~v~~~~~s~la 420 (517) T protein:vir:10 361 ERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSK--------------------NVSPTILTGIE 420 (517) T ss_pred ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCC--------------------CccceeeccHH Confidence 2479999999988888889999999875 88888888887665433221 11222333333 Q ss_pred cH-HHHHHHHHHHHHHHhhc--cCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHH Q lcl|NC_019423. 563 EI-DNQKSQDLGFMVQTLGN--TVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLEN 634 (756) Q Consensus 563 ~~-~~~~~q~l~~llq~~~~--~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~ 634 (756) .. +....+.+..+++.++. .+++.+ .-.++..+++..|.|. ..++. +.+.. +..+++.+++. T Consensus 421 ~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~--~~irs------~~ev~--~~~~~~~~~~~ 490 (517) T protein:vir:10 421 ALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANF--PFFKT------QDELN--AEAQAQQEQEA 490 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCCh--hhcCC------HHHHH--HHHHHHHHHHH Confidence 32 22333334444444432 133332 2244556666666652 23331 11110 00000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019423. 635 EELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEG 701 (756) Q Consensus 635 ~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~ 701 (756) .+..+++. ...+.+ .+..-+. +++... T Consensus 491 ~~~~~~~a-----g~~~~~--------------------~~~~~~~---------------~~~~~~ 517 (517) T protein:vir:10 491 TKYAAEQA-----GKAIPD--------------------MVKNGQI---------------NPQGGQ 517 (517) T ss_pred HHHHHHHH-----HHHHHH--------------------HHhCCCC---------------CCCCCC Confidence 00000000 000000 0000000 000000 No 47 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.91 E-value=5.4e-22 Score=137.19 Aligned_cols=482 Identities=11% Similarity=-0.012 Sum_probs=260.9 Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCC---CcccCHHHHHHHHHHHHHHHHhhcC-CCCEEEE Q lcl|NC_019423. 27 SIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGR---SQVQPRLVRRQAEWRYAPLSEPFLS-SSKLFKL 102 (756) Q Consensus 27 ~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr---S~~v~~~v~~~~e~~~~~L~~~f~~-~~~~~~~ 102 (756) .=+.+++.++..+ +++.++.|++-.+|...... .++...++ .+.++....+.++.+.+.|+.-+|+ +.+||++ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~-~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLM-VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccC-CCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccccc Confidence 2223445555443 67788888888888874321 11111111 3478888999999999999999998 5789999 Q ss_pred ecCCcch-------HHHH--H----HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeec Q lcl|NC_019423. 103 TPVTFED-------ELAA--R----QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQL 169 (756) Q Consensus 103 ~p~~~~D-------~~~A--~----q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~ 169 (756) .+-...+ ...+ + +.+..+.-.| ..++-+..++.++++++..|++++-+ + + T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~Li~~G~a~l~~--~-~------------- 140 (510) T protein:vir:63 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLAVLTQVIKLLIVTGNALLYR--D-S------------- 140 (510) T ss_pred CCChHHhhcccccchhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEEEE--c-C------------- Confidence 8753221 1111 1 1223333232 45667777888888888888875552 0 0 Q ss_pred CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe Q lcl|NC_019423. 170 YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI 249 (756) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~ 249 (756) +..++..++..+|++ T Consensus 141 -----------------------------------------------------------------~~~~~~~~pl~~y~v 155 (510) T protein:vir:63 141 -----------------------------------------------------------------DAATVVAWSLRSYAV 155 (510) T ss_pred -----------------------------------------------------------------CCcEEEEEEcceeEE Confidence 001234556677888 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeec Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDI 329 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~ 329 (756) ..++...++. ++++.++|..+|.+..... .+. .. .....-+.|.||.+.++.+- T Consensus 156 ~~d~~G~vd~---i~rr~~~t~~~l~e~~~~~-~~~-----------~~-----------~~~~~~~~v~v~~~V~~~~~ 209 (510) T protein:vir:63 156 RRDATGRWMD---IVLKQRYKSKDLDEEYKQD-LMR-----------AG-----------RNLSGSGSVDLYTHVQRKKG 209 (510) T ss_pred eeCCCcCeeE---EEeeeeccHHHHhHHhhhh-hhc-----------cc-----------cccCCCcceEEEEEEEeecC Confidence 7777665553 6788999988875432210 000 00 00111235778877765432 Q ss_pred cCCceeEEEEEEE-EC-CEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 330 NDDGSLEPIVATW-IG-STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANG 407 (756) Q Consensus 330 ~~~g~~~~~~~~~-~g-~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~ 407 (756) +-...+.+.+ ++ .++.. ++-|+..++||++..|...++..||.|++....+-.+.+|++.+..+.....+.++ T Consensus 210 ---~~~~~~sv~~e~dg~~~~~--~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~ 284 (510) T protein:vir:63 210 ---TAMEYAELYHEIDGVRVGK--EGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 (510) T ss_pred ---CCceEEEEEEEecCceecc--ccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 1122222222 34 44443 44566688999999999999999999999999999999999999999999999999 Q ss_pred ceEeeccccCccchhhhhccccccccccccccccccccccCCC--cchHHHHHHHHHHHHHHHHhchhHHhcCCCccccc Q lcl|NC_019423. 408 QRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE--LPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYG 485 (756) Q Consensus 408 ~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~ 485 (756) .++++.+.+...+. ...+.... +..+....+.+.+... --+.+...++...+.+.... -.. ....++.. T Consensus 285 ~~lv~p~g~~~~~~--~~~~~~g~---~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af---~~~-l~~~~~~r 355 (510) T protein:vir:63 285 LNLVDEAKGAVVDD--YQDAEMGD---YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF---MYG-ANQRDAER 355 (510) T ss_pred CcccCcccccchhh--hccCCCce---eecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHH---Hhh-cccCCCCC Confidence 99998766533222 12222111 1112223355544322 22334556666666665542 111 11223334 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccH Q lcl|NC_019423. 486 DVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEI 564 (756) Q Consensus 486 ~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~ 564 (756) .||++|..+.+.....|..++-+|.. ++.++.++.+.++.... ..++.++..+.. + |..-.+.. T Consensus 356 vTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g------------l~p~p~~~~~~~--~-v~~is~La 420 (510) T protein:vir:63 356 VTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------LQGLITKQHKPA--I-ETGLPALS 420 (510) T ss_pred cCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc------------CCCCCchhcccc--e-ecchhHHH Confidence 69999999998888999999999875 88999999999886532 123333433321 1 22222334 Q ss_pred HHHHHHHHHHHHHHhhccCCH-----hH-HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 565 DNQKSQDLGFMVQTLGNTVDQ-----SI-TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQ 638 (756) Q Consensus 565 ~~~~~q~l~~llq~~~~~~~~-----~~-~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~q 638 (756) ..++.+.+..+++.++...++ .+ .-+++..+++..|.+ ....++ ++.+ .+++.+++.++.+++.+ T Consensus 421 raq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~-p~~ivr------s~ee--v~a~~~~~~qq~~~~~~ 491 (510) T protein:vir:63 421 RSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVD-TSQFYK------SADE--LQAEAEQQRQQAAQAQA 491 (510) T ss_pred HHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCC-hhHhcC------CHHH--HHHHHHHHHHHHHHHHH Confidence 444555555555544322211 11 122334444444441 011111 1111 11111111111100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGT 669 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~ 669 (756) +++.+++ . +. +.....+++ T Consensus 492 ~~~~~~~-~---------a~--~~~~~~~g~ 510 (510) T protein:vir:63 492 AQETLLE-G---------AS--DMTNALAGV 510 (510) T ss_pred HHHHHHH-H---------HH--hhcccccCC Confidence 0000000 0 00 001111222 No 48 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.91 E-value=9.7e-22 Score=135.78 Aligned_cols=495 Identities=10% Similarity=0.020 Sum_probs=265.6 Q ss_pred CCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHH Q lcl|NC_019423. 10 LPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPL 89 (756) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L 89 (756) ++|+. |---=..+.|++.++..++.+++.++.|++-.+|.......+.....+..+.++....+.++.+.+.| T Consensus 1 ~~~~~-------~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l 73 (515) T protein:vir:70 1 MQDTI-------LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKL 73 (515) T ss_pred Ccchh-------hhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcccccccccchHHHHHHHHHHHH Confidence 22222 21111245677889999999999999999999999864332233333444588999999999999999 Q ss_pred HHhhcC-CCCEEEEecCCcch-------HHHHH------HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeee Q lcl|NC_019423. 90 SEPFLS-SSKLFKLTPVTFED-------ELAAR------QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWER 155 (756) Q Consensus 90 ~~~f~~-~~~~~~~~p~~~~D-------~~~A~------q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~ 155 (756) +.-+|+ +.+||++.+...++ .+.+. ..+..+.-.| ..++-+..++.++++++..|++++-+ + T Consensus 74 ~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~--d- 149 (515) T protein:vir:70 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL-EQRQFRPAIVEVFKHLIVAGNCLLYK--P- 149 (515) T ss_pred HHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-HhcCchHHHHHHHHHHHhHCeEEEEE--e- Confidence 999998 57999998743322 12221 1223333222 35667777888888888888886542 1 Q ss_pred eeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecC Q lcl|NC_019423. 156 KTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVN 235 (756) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g 235 (756) + + + T Consensus 150 ~------~-----------------------------------------------------------------------~ 152 (515) T protein:vir:70 150 S------K-----------------------------------------------------------------------G 152 (515) T ss_pred C------C-----------------------------------------------------------------------C Confidence 0 0 0 Q ss_pred ceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccccccc Q lcl|NC_019423. 236 RPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALR 315 (756) Q Consensus 236 ~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~ 315 (756) . +..++..+|++..++...++. ++++..+|..+|.+.++... +. .... . ..+. . T Consensus 153 ~--~~~~pl~~y~v~~d~~G~v~~---i~rr~~~t~~~l~~~f~~~~-~~-------~~~~--~----------~~~~-~ 206 (515) T protein:vir:70 153 A--MSAVPMHHYVVNRDTNGDLMD---VILLQEKALRTFDPATRMAI-EV-------GMKG--K----------KCKE-D 206 (515) T ss_pred C--eEEEEcCeEEEeeCCCcCeeE---EEeeeeccHHHHHHhhhhhh-hh-------hhhh--h----------hcCC-C Confidence 0 233455677777777665553 67889999999877654210 00 0000 0 0011 1 Q ss_pred ceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHH Q lcl|NC_019423. 316 KKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMR 395 (756) Q Consensus 316 ~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~ 395 (756) +.|.||.+- ...+++...++.. +++..+ ..++-|+..+|||++..|...++..||.|.+....+-.+.+|.+.+ T Consensus 207 ~~v~i~~~v---~~~~~~~~~~~~e--~d~~~~-~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~ 280 (515) T protein:vir:70 207 DNVKLYTHA---QYAGEGFWKINQS--ADDIPV-GKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSE 280 (515) T ss_pred CceEEEEEE---EecCCCceEEEEe--cCceee-ccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHH Confidence 246565443 2334554433222 234333 3455666788999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCC--cchHHHHHHHHHHHHHHHHhchh Q lcl|NC_019423. 396 GMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE--LPQSAIVMTQMQNQEAESLTGVK 473 (756) Q Consensus 396 ~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~l~~~~~~~e~~tGv~ 473 (756) ..+.....+.+|.++++.+.+..... ...+... .+..+....+.+++... --+.+...++...+.+....=+. T Consensus 281 ~~l~~~~~a~~p~~lv~~~g~~~~~~--l~~~~~g---~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~ 355 (515) T protein:vir:70 281 AMARGAALMADIKYLIRPGSQTDVDH--FVNSGTG---EVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME 355 (515) T ss_pred HHHHHHHHhcCCCeeeCcccccchhh--ccccCCc---eeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999998776643222 2222221 11222233455544332 12344455666666665543121 Q ss_pred HHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCc Q lcl|NC_019423. 474 AFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGN 552 (756) Q Consensus 474 ~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~ 552 (756) ... ..++...||++|..+.+-....|..++-+|.. ++.++..+++.-. .+.-|... T Consensus 356 ~l~---~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~-----------------~p~~P~~~--- 412 (515) T protein:vir:70 356 TMT---RRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEA-----------------GDSFTSEL--- 412 (515) T ss_pred hhh---ccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhh-----------------CCCCChhh--- Confidence 111 11223479999999988888889999998875 7777755432111 11122222 Q ss_pred ceEEEeccccc-HHHHHHHHHHHHHHHhhc--cCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHH Q lcl|NC_019423. 553 FDIEVDINTAE-IDNQKSQDLGFMVQTLGN--TVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQK 629 (756) Q Consensus 553 ~Dv~V~~g~a~-~~~~~~q~l~~llq~~~~--~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~ 629 (756) .++.+..+.+. ...+..+.+..+++.++. .++|. +++..+.....+.+-.....|.......++.++.. T Consensus 413 v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~--------~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r 484 (515) T protein:vir:70 413 VDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEP--------AQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEM 484 (515) T ss_pred cccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChh--------HHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHH Confidence 22333334333 233334444444444431 22332 22222333322222222222211111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHA 672 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~ 672 (756) ++++.+++++.....-.++...- ....+||. T Consensus 485 ~q~~~~~~~~~~~~~~~~a~~~~------------~~~~~~~~ 515 (515) T protein:vir:70 485 AQQAQAQQEAMLNEGVAKAVPGV------------IQQEMKEG 515 (515) T ss_pred HHHHHHHHHHHHHHhhhhhcccc------------hhhhhccC Confidence 11000000000000000000000 00011111 No 49 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.91 E-value=1.1e-21 Score=135.59 Aligned_cols=483 Identities=11% Similarity=-0.019 Sum_probs=260.1 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCC---CcccCHHHHHHHHHHHHHHHHhhcC-CC Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGR---SQVQPRLVRRQAEWRYAPLSEPFLS-SS 97 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr---S~~v~~~v~~~~e~~~~~L~~~f~~-~~ 97 (756) |+ +.+++.++..+ +++.++.|++-.+|...... .++...++ -+.++....+.++.+.+.|+.-+|+ +. T Consensus 1 mk-----~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~-~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) T protein:vir:78 1 MK-----STAAMLWEKLR--DGSVEQRAIEFAKTTLPYLM-VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) T ss_pred Ch-----hHHHHHHHHHh--ccchHHHHHHHHHhhccccc-cCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCC Confidence 22 22334444443 67788888888888875322 11111111 2378888899999999999999998 57 Q ss_pred CEEEEecCCcchH-------HHH--H----HHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeee Q lcl|NC_019423. 98 KLFKLTPVTFEDE-------LAA--R----QNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTET 164 (756) Q Consensus 98 ~~~~~~p~~~~D~-------~~A--~----q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~ 164 (756) +||++.+-..... +.+ + ..+..+.-. ...++-+..++.++++++..|++++-+ + + T Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~~L~~~G~a~l~~--~-~-------- 140 (510) T protein:vir:78 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQR-LFQNASLAVLTQVIKLLIVTGNALLYR--N-S-------- 140 (510) T ss_pred cccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhhCeEEEEE--e-C-------- Confidence 8999987533221 111 1 122222222 245566677778888877777765421 0 0 Q ss_pred eeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEech Q lcl|NC_019423. 165 PVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNP 244 (756) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p 244 (756) .+ -++..++. T Consensus 141 ---------------------------------------------------------------------~~-~~~~~~pl 150 (510) T protein:vir:78 141 ---------------------------------------------------------------------DE-ATVVAWSL 150 (510) T ss_pred ---------------------------------------------------------------------CC-CeEEEEEc Confidence 00 01344556 Q ss_pred hheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEE Q lcl|NC_019423. 245 NNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYW 324 (756) Q Consensus 245 ~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w 324 (756) .+|++..++...++. ++++..+|..+|.+.++.. .+.. . ......++|.||.+. T Consensus 151 ~~y~v~~d~~G~vd~---i~rr~~~t~~~l~~~~~~~-~~~~-----------~-----------~~~~~~~~v~v~~~V 204 (510) T protein:vir:78 151 RSYAVRRDATGRWMD---IVLKQRYKSKDLDDVYKQD-LMRA-----------G-----------RNLSGSGSVDLYTHV 204 (510) T ss_pred ceeEEeeCCCcCeeE---EEeeeeccHHHHHHHhhHH-hhhh-----------h-----------hccCCCceEEEEEEE Confidence 678887777665553 6788999999887655321 0000 0 001122368888888 Q ss_pred EEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019423. 325 GFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRS 404 (756) Q Consensus 325 ~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~ 404 (756) ++.+...-.+..+ +.-.-|.+++ .++-|+..++||++..|...++..||.|++....+-.+.+|++.+..+.....+ T Consensus 205 ~~~~~~~~~~~sv-~~e~dg~~i~--~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a 281 (510) T protein:vir:78 205 QRRKGTAMDYAEM-YHEIDGVRVG--ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) T ss_pred EeecCCCCcEEEE-EEEecCeeec--cccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 7654211111111 1112244443 445566788999999999999999999999999999999999999999999999 Q ss_pred cCCceEeeccccCccchhhhhccccccccccccccccccccccCCC--cchHHHHHHHHHHHHHHHHhchhHHhcCCCcc Q lcl|NC_019423. 405 ANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE--LPQSAIVMTQMQNQEAESLTGVKAFSGGVTGS 482 (756) Q Consensus 405 ~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~ 482 (756) .++.++++.+.+...+. ...+....+ ..+....+.+.+... --+.....++...+.+.... -.. ....+ T Consensus 282 ~~~~~lv~p~g~~~~~~--l~~~~~g~~---v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF---~~~-l~~~~ 352 (510) T protein:vir:78 282 LEVLNLVDEAKGAVVDD--YQDAEMGDY---VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF---MYG-ANQRD 352 (510) T ss_pred hcCCcccCCccccchhh--hccCCCcee---ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHH---hhc-cccCC Confidence 99999998766533222 122221111 112223355544322 22344556666666666542 111 11123 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEeccc Q lcl|NC_019423. 483 AYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINT 561 (756) Q Consensus 483 a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~ 561 (756) +...||++|..+.+.....|..++-+|.. ++.++.++.+.++.... ..++.++..+. ..|..-. T Consensus 353 ~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g------------l~p~p~~~~~~---~~v~~is 417 (510) T protein:vir:78 353 AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------LQGLITKQHKP---AIETGLP 417 (510) T ss_pred CCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc------------CCCCCcccccc---eeeeccc Confidence 33469999999998889999999999875 88999999999886542 12222333221 1222222 Q ss_pred ccHHHHHHHHHHHHHHHhhccCC-----HhH-HHHHHHHHHhhcCC-hhHHHHhhhccCCCChhhhhHHHHHHHHHHHHH Q lcl|NC_019423. 562 AEIDNQKSQDLGFMVQTLGNTVD-----QSI-TLSLVAKIAELKRM-PDLAHELRTWQPQPDPMEEQLKQLAIQKAQLEN 634 (756) Q Consensus 562 a~~~~~~~q~l~~llq~~~~~~~-----~~~-~~~~l~~l~e~~~~-~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~ 634 (756) +....++.+.+..+++.++...+ |.+ .-+++..+++..|. |. ..++ ++++ .+++.+++.+++ T Consensus 418 ~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~--~ivr------s~ee--v~a~~~~~~~q~- 486 (510) T protein:vir:78 418 ALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTS--QFYK------SADE--LQAEAEEQRRQA- 486 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChh--hhcC------CHHH--HHHHHHHHHHHH- Confidence 34455555556555555432222 111 12334444444454 21 1111 1111 111111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 635 EELQSKIALNNAKAKEAASSGDLKDLDYLEQESGT 669 (756) Q Consensus 635 ~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~ 669 (756) +..++..+.+..+ +. +......++ T Consensus 487 -------~~~~~~~~a~~~~--~~--~~~~~~~g~ 510 (510) T protein:vir:78 487 -------AQAQAAQETLLEG--AS--DMTNALAGV 510 (510) T ss_pred -------HHHHHHHHHHHHh--hh--hhcccCCCC Confidence 0000000000000 00 000111112 No 50 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.91 E-value=5.8e-22 Score=137.00 Aligned_cols=482 Identities=11% Similarity=0.015 Sum_probs=255.3 Q ss_pred HHHHHHHH--HHHhhHHHHHHHHHHHHhccccCCCCCC--CC-CC-CcccCHHHHHHHHHHHHHHHHhhcC-CCCEEEEe Q lcl|NC_019423. 31 LKGDLESA--KPAHDAIMSQIREWNDLMEVKGKAKPPK--IK-GR-SQVQPRLVRRQAEWRYAPLSEPFLS-SSKLFKLT 103 (756) Q Consensus 31 l~~~~~~a--~~~~~~~~~~~~~~~~~y~~~~~~~~~~--~~-gr-S~~v~~~v~~~~e~~~~~L~~~f~~-~~~~~~~~ 103 (756) +++..... +..+++.++.|++-.+|........+.. .. ++ -+.++..-.+.++.+.+.|+.-+|+ +.+||++. T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQIE 80 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 33333332 4457788889999999987532211111 11 11 2346778888999999999999998 57999998 Q ss_pred cCC-------cchHHHHHH------HHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecC Q lcl|NC_019423. 104 PVT-------FEDELAARQ------NELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLY 170 (756) Q Consensus 104 p~~-------~~D~~~A~q------~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~ 170 (756) +-. .+|.+.++. .+..+.-.| ..++-+..++..+++++..|++++-+ +.. T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~--~~~-------------- 143 (514) T protein:vir:80 81 LDDTLQELAAANGIDQSELHSRTADLERRATRRL-FVNASLSKLHRILKLLVVTGNALFYR--EPG-------------- 143 (514) T ss_pred cCchhhhhccccchhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEEEE--ecC-------------- Confidence 731 222222222 222233223 45677777888899988888887653 100 Q ss_pred CCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeC Q lcl|NC_019423. 171 PIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVID 250 (756) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~D 250 (756) .+ .+..++..+|++. T Consensus 144 ---------------------------------------------------------------~~--~~~~~pl~~y~v~ 158 (514) T protein:vir:80 144 ---------------------------------------------------------------TG--KMLVWTMQSYTVR 158 (514) T ss_pred ---------------------------------------------------------------CC--cEEEEEcCeEEEe Confidence 00 1233455667777 Q ss_pred CCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeecc Q lcl|NC_019423. 251 PSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDIN 330 (756) Q Consensus 251 p~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~ 330 (756) .++.+.+++ ++++.++|..+|....... .. .. ........+|.||.|.++.+.. T Consensus 159 ~d~~G~v~~---i~rr~~~~~~~l~~~~~~~---------~~-----~~---------~~~~~~~~~v~v~~~v~~~~~~ 212 (514) T protein:vir:80 159 RTSHGDPAV---VVLRQQMPFRELTPEIQAD---------AQ-----AK---------QIAKRDSDKCDLYTVIEWQPTP 212 (514) T ss_pred eCCCcCeEE---EEeeeeecHHHhhhhhhhh---------hh-----hh---------hccCCCCCceEEEEEEEeecCC Confidence 776655553 6778899988775432110 00 00 0001122368888887665432 Q ss_pred CCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceE Q lcl|NC_019423. 331 DDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRG 410 (756) Q Consensus 331 ~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~ 410 (756) ++..+.++.-.-|.+++ .++-|+..++||++..|...++..||.|.+....+-.+.+|++.+..+.....+.++.++ T Consensus 213 -~~~~~sv~~e~~g~~i~--~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~ 289 (514) T protein:vir:80 213 -NGKRCAVWHELEGKRVG--PESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNL 289 (514) T ss_pred -CCeEEEEEEeccceeec--ccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Confidence 22212222222234443 455677788999999999999999999999999999999999999999999999999999 Q ss_pred eeccccCccchhhhhccccccccccccccccccccccCCC--cchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhH Q lcl|NC_019423. 411 YPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPE--LPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVA 488 (756) Q Consensus 411 ~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA 488 (756) ++.+.+..... ...+... .+..+....+.+.+... --+.+...++...+.+.... . +.+...++...|| T Consensus 290 v~~~g~~~~~~--l~~~~~g---~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF-m---l~~~~rd~~rvTA 360 (514) T protein:vir:80 290 VDEAKGGAVDD--YRDAETG---DFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAF-M---YTGQVRDAERVTV 360 (514) T ss_pred eCcccccchhh--hcccCCc---eeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHH-h---hhccCCCCCCCCH Confidence 98766543322 1222211 11112223455544322 12333455666665555431 1 1111123334699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEeccccc-HHH Q lcl|NC_019423. 489 AGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAE-IDN 566 (756) Q Consensus 489 ~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~-~~~ 566 (756) ++|..+.+-....|..++.+|.. ++.++.++.+.++..... ...+--|++. +.+.+..+.+. .+. T Consensus 361 tEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~----------g~lP~~p~~l---~~~~~vs~la~l~r~ 427 (514) T protein:vir:80 361 EEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNG----------GMLLGIAQGV---YRPSIITGIPALTRN 427 (514) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhcc----------CCCCCCCchh---hcceeeecHHHHHHH Confidence 99999988888899999999875 888999998888864311 0111112221 22333333332 233 Q ss_pred HHHHHHHH---HHHHhhccCCHhH-----HHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 567 QKSQDLGF---MVQTLGNTVDQSI-----TLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQ 638 (756) Q Consensus 567 ~~~q~l~~---llq~~~~~~~~~~-----~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~q 638 (756) ...+.+.. .++.+++.. |++ .-+++..+++..|.|-.. +. .+++ +.++.+.+.+++..+ T Consensus 428 ~~~~~l~~~~~~i~~l~~~~-p~v~d~id~d~~~~~~a~~~Gvp~~~--i~-----~~~e-----~~~~~~~~~~~~~~~ 494 (514) T protein:vir:80 428 IETANILRATQEASAIVPAL-VQLSKRFDPEKLVERIFANNSVDLST--LS-----KDPD-----VVAAEAEQEAALAQQ 494 (514) T ss_pred HHHHHHHHHHHHHHHHhccc-hhhhhcCCHHHHHHHHHHHhCCCHhh--cc-----CCHH-----HHHHHHHHHHHHHHH Confidence 33333333 333333322 221 223344555555554110 11 1111 011100000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGTKHA 672 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~ 672 (756) +++. ++.. .+. .+.+++--. T Consensus 495 ~~~~---~~~~---------~~~--~~~~~~~~~ 514 (514) T protein:vir:80 495 QLDV---ASGA---------LAA--ETSAGVLTS 514 (514) T ss_pred HHHH---HHHH---------HHH--hhhccccCC Confidence 0000 0000 000 000000000 No 51 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.89 E-value=7.1e-21 Score=131.06 Aligned_cols=491 Identities=10% Similarity=0.033 Sum_probs=263.6 Q ss_pred CCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHH Q lcl|NC_019423. 7 FKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRY 86 (756) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~ 86 (756) +++ +. ++.-.-....|++.++..++.+++.++.|++-.+|.......+....++..++++..-.+.++.+. T Consensus 1 ~~~--~~-------~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LA 71 (516) T protein:vir:10 1 MKQ--ST-------DLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLA 71 (516) T ss_pred CCc--hh-------hHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcccccccccchHHHHHHHHH Confidence 221 11 111122346788899999999999999999999999865433333334445789999999999999 Q ss_pred HHHHHhhcC-CCCEEEEecCCcchH-------HH------HHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEe Q lcl|NC_019423. 87 APLSEPFLS-SSKLFKLTPVTFEDE-------LA------ARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIG 152 (756) Q Consensus 87 ~~L~~~f~~-~~~~~~~~p~~~~D~-------~~------A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~ 152 (756) +.|+.-+|+ +.+||++.+-...+. +. .+..+..+.- ....++-+..++.++++++..|++++-+ T Consensus 72 a~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~~L~~~G~a~l~~- 149 (516) T protein:vir:10 72 NKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMK-ELEQRQFRPAVVEAFKHLIVAGSCMLYK- 149 (516) T ss_pred HHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHH-HHHhcCcHHHHHHHHHHHHhHCeEeEEe- Confidence 999999998 579999987432211 11 1123333322 2345667777888888888888886431 Q ss_pred eeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeee Q lcl|NC_019423. 153 WERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKA 232 (756) Q Consensus 153 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~ 232 (756) + + + T Consensus 150 -d-~------~--------------------------------------------------------------------- 152 (516) T protein:vir:10 150 -P-S------K--------------------------------------------------------------------- 152 (516) T ss_pred -c-C------C--------------------------------------------------------------------- Confidence 1 0 0 Q ss_pred ecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccc Q lcl|NC_019423. 233 LVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKD 312 (756) Q Consensus 233 ~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d 312 (756) + .+..++..+|++..++.+.+++ ++++..++..+|.+.+. ++. ....... .. T Consensus 153 --~--~~~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~~~---~~~------~~~~~~~------------~~ 204 (516) T protein:vir:10 153 --G--AISAIPMHHYVVNRDTNGDLLD---IILLQEKSLRTFDPATR---AVV------EVGLKGK------------KC 204 (516) T ss_pred --C--CeEEEEcCeEEEeeCCCCCeEE---EeeeecccHHHHHHHhh---hhh------hhhhhhh------------cc Confidence 0 0223445567776666555543 56778888877755431 110 0000000 00 Q ss_pred cccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHH Q lcl|NC_019423. 313 ALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGA 392 (756) Q Consensus 313 ~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~ 392 (756) .....|.||.+=.+ +.++... +..-.++..+ ..++-|+...|||++..|...++..||.|.+....+-.+.+|. T Consensus 205 ~~~~~~~i~t~v~~---~~~~~~~--~~~~~d~~~~-~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~ 278 (516) T protein:vir:10 205 KEDDSIKLYTHAKY---LGEGFWE--LKQSADDIPV-GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQF 278 (516) T ss_pred CCCCceEEEEEEEe---cCCCceE--EEEeeCceee-ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHH Confidence 11124555543222 2344322 2222344433 2344455568999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCc--chHHHHHHHHHHHHHHHHh Q lcl|NC_019423. 393 TMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPEL--PQSAIVMTQMQNQEAESLT 470 (756) Q Consensus 393 ~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~l~~~~~~~e~~t 470 (756) +.+..+.....+.++.++++.+.+.... ....+.... +..+....+.+.+...- -+.+...++...+.+.... T Consensus 279 l~~~~l~~~~~a~~~~~lv~p~g~~~~~--~l~~~~~g~---~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af 353 (516) T protein:vir:10 279 LSEAVARGAALMADIKYLIRPGAQTDVD--HFVNSGTGE---VVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVF 353 (516) T ss_pred HHHHHHHHHHHhcCCCcccCcccccchh--hhccCCCce---eecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999776653322 122222211 11222333455443321 2344455666666555442 Q ss_pred chhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHh Q lcl|NC_019423. 471 GVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDL 549 (756) Q Consensus 471 Gv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~ 549 (756) =+.. ...-++...||++|..+.+-....|..++-+|.. ++.++..+.+..+. . ++ |+.+ T Consensus 354 ~~~~---l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~-------------p---~~-P~~l 413 (516) T protein:vir:10 354 MMET---MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG-------------D---SF-TSDL 413 (516) T ss_pred hhhh---hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhC-------------C---CC-Chhh Confidence 1111 1111223479999999988888889999998875 77777766542211 0 11 2222 Q ss_pred cCcceEEEeccccc-HHHHHHHHHHHHHHHhhc--cCCHhHHH-----HHHHHHHhhcCChhHHHHhhhccCCCChhhhh Q lcl|NC_019423. 550 KGNFDIEVDINTAE-IDNQKSQDLGFMVQTLGN--TVDQSITL-----SLVAKIAELKRMPDLAHELRTWQPQPDPMEEQ 621 (756) Q Consensus 550 ~~~~Dv~V~~g~a~-~~~~~~q~l~~llq~~~~--~~~~~~~~-----~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~ 621 (756) . ++.+..+.+. ...+..+.+..+++.++. .++|.+.. ..+..+++..|.|- ..++ ++.+. T Consensus 414 v---~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~--~~ir------s~eev- 481 (516) T protein:vir:10 414 V---DPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL--PFLK------SAEEM- 481 (516) T ss_pred c---CcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCCh--hccC------CHHHH- Confidence 2 2223334333 333444445555555442 23343221 23444555555541 1221 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 622 LKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHA 672 (756) Q Consensus 622 ~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~ 672 (756) +++.+++.+.+..+. ++...+++.......+ +|++ T Consensus 482 -~~~r~~~~~~q~~~~---~~~~~~~~~~~~~~~~------------~~~~ 516 (516) T protein:vir:10 482 -EQEQEAQMQAQQAQM---LEEGVAKAVPGVIQQE------------LKEA 516 (516) T ss_pred -HHHHHHHHHHHHHHH---HHHHhhhcccchhhhh------------hhcC Confidence 111111111110000 0000000000000000 0111 No 52 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.71 E-value=2.4e-15 Score=100.70 Aligned_cols=443 Identities=11% Similarity=0.048 Sum_probs=213.0 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCCCCCC--CcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPKIKGR--SQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~gr--S~~v~~ 76 (756) .|++++|.=|.|. +++.+ .+....+.|+..+.+.++..+||.|.-.- .+++.+++ .+++.+ T Consensus 3 ~~~~~~~~~p~d~-------~~~~~--------~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n 67 (453) T protein:vir:39 3 YKPPKLMTFPKDE-------PITNE--------VVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVN 67 (453) T ss_pred ecCCcceEcCCCC-------CCCHH--------HHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccceeecc Confidence 5777777543333 23332 24444556777777788899999975211 22233343 467777 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~ 156 (756) ..+..|+.....| +|.+..+ .+ +|.+. .+.++.++. .|+--.....+.++++.+|.+++.+|++. T Consensus 68 ~~~~ivd~~~~~l----~g~~~~~--~~---~d~~~----~~~l~~i~~-~N~~~~~~~~~~~~~~~~G~~~~~v~~d~- 132 (453) T protein:vir:39 68 FTKYIVDTFTGYF----NGIPVKK--SH---SDKET----LSKLQEFDN-LNDMEDEESELAKMACIYGRAFELLYQNE- 132 (453) T ss_pred hHHHHHHHHhhhh----cccCcee--cc---CChHH----HHHHHHHHH-hcChhHHHHHHHHHHhhcCeEEEEEEecC- Confidence 7777777776655 5554333 22 23332 335555544 34334456678999999999988876531 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) .|. T Consensus 133 -----------------------------------------------------------------------------~g~ 135 (453) T protein:vir:39 133 -----------------------------------------------------------------------------ETQ 135 (453) T ss_pred -----------------------------------------------------------------------------CCc Confidence 133 Q ss_pred eeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 237 ~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) +++..++|.+++. |+... ....+.++ .+.. T Consensus 136 ~~i~~~~p~~~~~v~d~~~~---~~~~~~ir-~~~~-------------------------------------------- 167 (453) T protein:vir:39 136 TNVIYNTPENMFMVYDDTIK---QEPLFAVR-YGYD-------------------------------------------- 167 (453) T ss_pred eEEEEEcccceEEEecCCCC---CeEEEEEE-EEEe-------------------------------------------- Confidence 6677788887644 43322 11222222 2110 Q ss_pred cceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHH Q lcl|NC_019423. 315 RKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATM 394 (756) Q Consensus 315 ~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~ 394 (756) ...+.++|+|.. +.+ +++...++.....+..|.+.|.+|++.++. +.+|.|.+..++++++.+|..+ T Consensus 168 ~~~~~~~~~yt~-----~~i---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~ 234 (453) T protein:vir:39 168 DDYKLYGEVYTK-----ETT---YALNGTMGFYNMTEQAPNPFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAI 234 (453) T ss_pred CCeEEEEEEEeC-----CeE---EEEEecCCceeeecccccCCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHH Confidence 001234455532 111 111222222222233333446778776643 4568999999999999999999 Q ss_pred HHHHHHHHhhcCCceEeeccccCccchhhhhccccc-cccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchh Q lcl|NC_019423. 395 RGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY-EYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVK 473 (756) Q Consensus 395 ~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~ 473 (756) +.+.+.+...+.|.+++.-..++............. ..+.......+.+.++..+.-.......+..+.+.+-..|+++ T Consensus 235 s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p 314 (453) T protein:vir:39 235 SEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVA 314 (453) T ss_pred HHHHHHHHHhhCceeeeecCCCCchhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 999999988888877765333332222111111111 1111122233445555544334566667888888888899988 Q ss_pred HHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcc Q lcl|NC_019423. 474 AFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNF 553 (756) Q Consensus 474 ~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~ 553 (756) +.+.+..++ .++.++...............+.|..++++++++++.+.... |.. .+. . T Consensus 315 ~~~~~~~gn---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~----------~~~---~~~------~ 372 (453) T protein:vir:39 315 NISDESFGS---SSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNV----------SNK---EAW------K 372 (453) T ss_pred ccccccccC---ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CCc---ccc------c Confidence 776554333 233335444444555556666777777777777777665422 110 000 1 Q ss_pred eEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc-CChhHHHHhhhccCCCChhhhhHHHHHHHHHHH Q lcl|NC_019423. 554 DIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK-RMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQL 632 (756) Q Consensus 554 Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~-~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~ 632 (756) +|.|.-........ .+.+..+..++..++.+... +.. ...+....++++ ..++.+. T Consensus 373 ~i~v~f~~~~p~~~--~~~a~~~~kl~g~is~et~l-------~~l~~v~D~~~E~~ri--------------~~E~~~~ 429 (453) T protein:vir:39 373 DIEYTFTRNEPKDI--KEQAETANILMGITSQETAL-------SVISVIPDVQAEMEKI--------------KKEEAST 429 (453) T ss_pred cceEEeCCCCCcCH--HHHHHHHHHHhccCChHHHH-------HhCCCCCCHHHHHHHH--------------HHHHHHH Confidence 23333333222111 11122222222333332221 111 122221111111 1100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 633 ENEELQSKIALNNAKAKEAASSGDLKDLDYLEQE 666 (756) Q Consensus 633 e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~ 666 (756) . ...+......+. .+.+... -.++ T Consensus 430 ~---~~~~~~~~~~~~----~~~~~~~---~~~e 453 (453) T protein:vir:39 430 A---IFDKDKQPSEKG----TDTVVPE---TNEE 453 (453) T ss_pred H---HHHHhccCCCCC----CCCCCCC---cCCC Confidence 0 000000000000 0000000 0000 No 53 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.66 E-value=1.1e-14 Score=97.09 Aligned_cols=443 Identities=12% Similarity=0.080 Sum_probs=208.4 Q ss_pred cCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCCCCCCC--cccCHHHHH Q lcl|NC_019423. 5 DTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPKIKGRS--QVQPRLVRR 80 (756) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~grS--~~v~~~v~~ 80 (756) -+.+| =|+-.+..++-+. ...+....+.|...+.+.++-.+||.|.-.- .+++..+|+ +++.+..+. T Consensus 1 ~~~~~-------~~~~~~~~~~~~~--~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ 71 (452) T protein:vir:36 1 MKYKP-------PKLMTFSKDEPIT--VEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKY 71 (452) T ss_pred CcccC-------ceeEEcCCccCCC--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccceeecchHHH Confidence 11111 1222333222111 1345556667777777888899999986322 222333443 566666666 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKI 160 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~ 160 (756) .|+....-| +|.+.. |.+ +|.+ ..+.++.++. .|+--.....+.++++.+|.+++.+||+. T Consensus 72 ivd~~~~~l----~g~~~~--~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~----- 132 (452) T protein:vir:36 72 IVDTFTGYF----NGIPVK--KSH---SDKE----ILTKLQEFDN-LNDMEDEESELAKMACIYGRAFEFLYQDE----- 132 (452) T ss_pred HHHHHhhhh----cccCce--eec---CChh----HHHHHHHHHh-hcChhHHHHHHHHHHHhcCeEEEEEEecC----- Confidence 666665554 665544 333 2322 3345665543 34434446678899999999988776531 Q ss_pred eeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEE Q lcl|NC_019423. 161 KTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVE 240 (756) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie 240 (756) .|.+++. T Consensus 133 -------------------------------------------------------------------------~g~~~i~ 139 (452) T protein:vir:36 133 -------------------------------------------------------------------------DTQTNVV 139 (452) T ss_pred -------------------------------------------------------------------------CCeeEEE Confidence 1346677 Q ss_pred EechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceE Q lcl|NC_019423. 241 MLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKV 318 (756) Q Consensus 241 ~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V 318 (756) .++|.++++ |+.... ..-+. .+.+... ..+ T Consensus 140 ~~~p~~~~~v~d~~~~~---~~~~~-i~~~~~~--------------------------------------------~~~ 171 (452) T protein:vir:36 140 YNSPENMFMVYDDTVKQ---EPLFA-VRYGVDE--------------------------------------------DKK 171 (452) T ss_pred EEcccceEEEEcCCCCC---ceEEE-EEEEEec--------------------------------------------Cce Confidence 778887643 433211 11122 2222110 011 Q ss_pred EEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHH Q lcl|NC_019423. 319 VAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMI 398 (756) Q Consensus 319 ~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~ 398 (756) ..+|+|... .+ +++...++........|.+.|.+|++.++. +..|.|.+..++++++.+|..++.+. T Consensus 172 ~~~~vyt~~-----~i---~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~ 238 (452) T protein:vir:36 172 LQGEVYTLL-----ET---IKISGENDEISFGEGTYNPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKA 238 (452) T ss_pred EEEEEEecC-----eE---EEEEEcCCceEEecceeccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHH Confidence 223444321 11 112222222222233344457788876643 34688999999999999999999999 Q ss_pred HHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcC Q lcl|NC_019423. 399 DLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGG 478 (756) Q Consensus 399 d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G 478 (756) +.+...++|.+.+.-..++..................+......+.++..+.-.......+..+.+.+-..|++++.+.+ T Consensus 239 ~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 318 (452) T protein:vir:36 239 NDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDE 318 (452) T ss_pred HHHHHhcCceeEeecCCcCchhhhhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc Confidence 99988899877765333322211111111101010001111223455444444556667788888999999999887666 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEe Q lcl|NC_019423. 479 VTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVD 558 (756) Q Consensus 479 ~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~ 558 (756) ..++. ++.++...............+.|..+++.++++++.+....-... ++ .+|.|. T Consensus 319 ~~gn~---Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~-----------~~i~i~ 376 (452) T protein:vir:36 319 SFGSS---SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKD--------SW-----------KDIEYT 376 (452) T ss_pred cccCC---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc--------cc-----------ccceEE Confidence 44432 333355455555555666667777788888887777665321100 11 123333 Q ss_pred cccccHH--HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHH Q lcl|NC_019423. 559 INTAEID--NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENE 635 (756) Q Consensus 559 ~g~a~~~--~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~ 635 (756) -...... ...++.+ ......++.+. +++..+ ..+....++ ++..++.+. +. T Consensus 377 f~~~~p~d~~~~a~~~----~k~~g~iS~et-------~~~~~~~~~d~~~E~~--------------ri~~E~~~~-~~ 430 (452) T protein:vir:36 377 FTRNEPKDIKEQAETA----NILMGITSQET-------ALSVISVIPDVQAEME--------------KIKKEEAST-AI 430 (452) T ss_pred eCCCCCcCHHHHHHHH----HHHhccCChHH-------HHHhCCCCCCHHHHHH--------------HHHHHHHHH-HH Confidence 3322221 1122222 22222233211 122222 122211111 111111000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 636 ELQSKIALNNAKAKEAASSGDLKDLDYLEQE 666 (756) Q Consensus 636 ~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~ 666 (756) ..+. ..... .........-.++ T Consensus 431 ~~~~--~~~~~-------~~~~~~~~~~~~e 452 (452) T protein:vir:36 431 FDKD--KQPSE-------KGTDTVVSETNEE 452 (452) T ss_pred HHhh--ccCCC-------CcccccCccccCC Confidence 0000 00000 0000000000011 No 54 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.66 E-value=2e-14 Score=95.71 Aligned_cols=437 Identities=9% Similarity=0.033 Sum_probs=196.9 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCCCCCC--CcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPKIKGR--SQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~gr--S~~v~~ 76 (756) |+...-|. ..+..+++. .++....+.|+....+.++..+||.|.-.- ..++.+++ .+++.+ T Consensus 3 ~~~~~~~~-------~~~~~~~~~--------~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n 67 (453) T protein:vir:73 3 LKPIKLMT-------YSRDEEITD--------KVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNN 67 (453) T ss_pred cccceeee-------ccccccCCH--------HHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecc Confidence 33333331 111223333 234445556666667777889999986321 12223343 467777 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~ 156 (756) ..+..|+....-| +|.+ +.|.+ +|.. ..+.++.++. .|+--.....+.++++++|.+.+.+|++.+ T Consensus 68 ~~~~ivd~~~~~l----~g~~--~~~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~ 133 (453) T protein:vir:73 68 FAKYIVDTFVGYF----NGIP--IKKTH---DDKS----VLEAMQLFDN-LNDMEDEESELAKIACVYGRAYELMYQNES 133 (453) T ss_pred hHHHHHHHhhhhh----cccC--ceeec---CChH----HHHHHHHHHH-hcChhHHHHHHHHHHHhcCeEEEEEEeCCC Confidence 7777777665544 6654 33433 2332 2234544432 344445566789999999999888766311 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) |. T Consensus 134 ------------------------------------------------------------------------------~~ 135 (453) T protein:vir:73 134 ------------------------------------------------------------------------------TE 135 (453) T ss_pred ------------------------------------------------------------------------------Cc Confidence 23 Q ss_pred eeEEEechhheEe--CCCCcCccccCceEEE-EeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVI--DPSCNGDLDKALYAVI-SFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDA 313 (756) Q Consensus 237 ~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~-~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 313 (756) +++..++|.++++ |+.... ..++. +.+.+. + T Consensus 136 ~~i~~~~p~~~~~v~dd~~~~-----~~~~~i~~~~~~-------------~---------------------------- 169 (453) T protein:vir:73 136 SEVIYCSPLNVFMVYDDSIKQ-----KPLFAVYYGFDE-------------E---------------------------- 169 (453) T ss_pred eEEEEEcccceEEEEeCCCCc-----eeEEEEEEEEec-------------C---------------------------- Confidence 5566677777643 322211 11222 222110 0 Q ss_pred ccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHH Q lcl|NC_019423. 314 LRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGAT 393 (756) Q Consensus 314 s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~ 393 (756) .....++|.. +++ +++..-++.....+..|.+.|.+|++.++. +.+|.|.+..++++++.+|.. T Consensus 170 ---~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~ 233 (453) T protein:vir:73 170 ---GNLSGTVYTL-----LET---ISITGKAGEVKFGESTYNVYSDLPIVEYNF-----NEERQSIFEPVHSLINSYNKV 233 (453) T ss_pred ---ceEEEEEEeC-----CeE---EEEEecCCceEEccceeccCCceeEEEecC-----CCCCCcchhhHHHHHHHHHHH Confidence 0112233321 110 111111121111222333447788876643 446889999999999999999 Q ss_pred HHHHHHHHHhhcCCceEeeccccCccchhhhhccccccc-----ccc-ccccccccccccCCCcchHHHHHHHHHHHHHH Q lcl|NC_019423. 394 MRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEY-----NPM-QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAE 467 (756) Q Consensus 394 ~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~-----~~~-~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e 467 (756) ++.+.+.+...++|.+++.-..++............... ... .......++++..+.-...+...++.+.+.+- T Consensus 234 ~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~ 313 (453) T protein:vir:73 234 TSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIF 313 (453) T ss_pred HHHHHHHHHHhccceeeeecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHH Confidence 999999998888887766422222211111111100000 000 01112224555444334556667788888888 Q ss_pred HHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh Q lcl|NC_019423. 468 SLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE 547 (756) Q Consensus 468 ~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d 547 (756) ..|++++.+.+..++ -++.++...............+.|..++++++++++.+.... |.. . T Consensus 314 ~~s~~p~~~~~~~gn---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~----------~~~------~ 374 (453) T protein:vir:73 314 QFTMAANISDENFGN---SSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA----------SNK------D 374 (453) T ss_pred HHhCCcccCcccccC---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CCc------c Confidence 899998876554333 233334444444445555555666677777766666544211 110 0 Q ss_pred HhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc-CChhHHHHhhhccCCCChhhhhHHHHH Q lcl|NC_019423. 548 DLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK-RMPDLAHELRTWQPQPDPMEEQLKQLA 626 (756) Q Consensus 548 ~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~-~~~~~~~~l~~~~~q~~p~~~~~~q~~ 626 (756) ++ .+|.|.-.+...... .+.+..+..+...++-+. +++.. +..+...-+++ ++ T Consensus 375 ~~---~~i~v~f~~~~p~~~--~~~a~~~~k~~giis~et-------~~~~~~~~~d~~~E~~r--------------i~ 428 (453) T protein:vir:73 375 AW---KDIEYTFTRNEPKDI--KEQAETANILKGITSEET-------ALSVISVIPDVQAEMEK--------------IK 428 (453) T ss_pred cc---ccceEEeCCCCCCCH--HHHHHHHHHHhccCcHHH-------HHHhCCCCCCHHHHHHH--------------HH Confidence 01 123333232222111 111112222112222211 11211 22221111111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 627 IQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDM 675 (756) Q Consensus 627 ~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~ 675 (756) .++.+ ...++... ...+..+...+ | T Consensus 429 ~E~~~--~~~~~~~~----~~~~~~~~~~~------------------~ 453 (453) T protein:vir:73 429 KKKLL--QLSLTRTS----NLVRMKQMRGN------------------L 453 (453) T ss_pred HHHHH--HHHHHHhc----cCCcchhhhcC------------------C Confidence 10000 00000000 00000000001 1 No 55 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.63 E-value=1.8e-14 Score=95.95 Aligned_cols=456 Identities=10% Similarity=0.079 Sum_probs=200.6 Q ss_pred CCccc-CCCCCCCcc-ccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC---CCCC---CCCCC- Q lcl|NC_019423. 1 MEHQD-TFKPLPDPA-QSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA---KPPK---IKGRS- 71 (756) Q Consensus 1 ~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~~~~---~~grS- 71 (756) |-+-+ .++++.... .-.++..+.+++.+..+-.+ +....+.++++..+||.|.-.. ..++ .++++ T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~------~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~ 79 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISR------HQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKAD 79 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHH------HHHHHHHHHHHHHHHhcCCCcccccCcccccccccccc Confidence 22111 111211111 12233444444443333222 1234566788999999875211 1111 12332 Q ss_pred -cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEE Q lcl|NC_019423. 72 -QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIAR 150 (756) Q Consensus 72 -~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k 150 (756) +++.+.....|+....-| +|.+. .|.+ +|.+..+ +++-++. .|+--.....+.+++++.|.+.+. T Consensus 80 ~ki~~n~~~~ivd~~~~~l----~g~~~--~~~~---~d~~~~~----~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~ 145 (481) T protein:vir:10 80 HRAVHNYAKYVSRFIVGYL----TGNPI--TITH---QDNQTND----KIIELND-LNDADEVNSDLALNLSIYGRAYEI 145 (481) T ss_pred ceeecchHHHHHHHHHhhh----ccCCc--eEec---CChhHHH----HHHHHHH-hcChhHHHHHHHHHHHhcCeEEEE Confidence 466666666666655444 55333 3443 3444433 3333322 233333456788999999988777 Q ss_pred EeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEee Q lcl|NC_019423. 151 IGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVE 230 (756) Q Consensus 151 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~ 230 (756) +|++. T Consensus 146 ~~~d~--------------------------------------------------------------------------- 150 (481) T protein:vir:10 146 VYRDF--------------------------------------------------------------------------- 150 (481) T ss_pred EEeCC--------------------------------------------------------------------------- Confidence 65420 Q ss_pred eeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccc Q lcl|NC_019423. 231 KALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDF 308 (756) Q Consensus 231 ~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 308 (756) .|+|++..++|.+++ ||+.... ...+ +.+.|...+ T Consensus 151 ---dg~~~i~~~~p~~~~~v~d~~~~~---~~~~-~i~~~~~~~------------------------------------ 187 (481) T protein:vir:10 151 ---EDRDTFKVLDPKSTFVVYDQTLDK---KVVA-GVRYFEKQD------------------------------------ 187 (481) T ss_pred ---CCeEEEEEEcccceEEEEcCCCCC---ceEE-EEEEEEEee------------------------------------ Confidence 134667778888876 3433221 1112 222222100 Q ss_pred cccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHH Q lcl|NC_019423. 309 QFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQA 388 (756) Q Consensus 309 ~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~ 388 (756) .....+..+|+|..- . .+++...++..-..++.|.+.|.+|++.++ ++.+|.|.+..++++++ T Consensus 188 ----~~~~~~~~~~~y~~~-----~---i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~lid 250 (481) T protein:vir:10 188 ----KDKVPVQHVEVYTTD-----K---IYYIEIKGGTYHRVEEVEHYYNDVPIIEYL-----NDQFKQGDFENVIALID 250 (481) T ss_pred ----CCCceEEEEEEEecC-----e---EEEEEecCCceeecccccccCCceeEEEee-----cCCCCCCchhhHHHHHH Confidence 001134455665421 1 122223333332223334444677877654 34568999999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCceEeeccccCcc-chhhhhccccccc--c--ccccccccccccccCCCcchHHHHHHHHHH Q lcl|NC_019423. 389 ILGATMRGMIDLLGRSANGQRGYPKGMLDTL-NRRRYDDGQDYEY--N--PMQGNPSQSIMEHKFPELPQSAIVMTQMQN 463 (756) Q Consensus 389 ~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~-~~~~~~~~~~~~~--~--~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~ 463 (756) .+|..++.+.+.+...+++.+.+......+. +............ . .......+.+.++..+.-.......+..+. T Consensus 251 a~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 330 (481) T protein:vir:10 251 LYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQ 330 (481) T ss_pred HHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHH Confidence 9999999999999888888776643222111 1111111110000 0 000111223444443333455667788888 Q ss_pred HHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceee Q lcl|NC_019423. 464 QEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVE 543 (756) Q Consensus 464 ~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~ 543 (756) +.+-..|++++.+.|..+... ++.++...............+.|..+++.++++++.++...... + T Consensus 331 ~~i~~~s~~p~~~~~~~~~n~--Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~---------~--- 396 (481) T protein:vir:10 331 NDIHKYTNTPDLNDEQFSGVQ--SGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLK---------Q--- 396 (481) T ss_pred HHHHHHhCCcccccccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---------c--- Confidence 999999999988776433222 33334333334444455555666677777777666665322110 0 Q ss_pred cCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 544 IKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 544 i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~~~~~q~~p~~~ 620 (756) -...+|.|.-.++.. ....++.+. .+...++.+.. ++..+ ..+..+-++ T Consensus 397 ------~~~~~i~v~f~~~~~~~~~~~a~~~~----kl~g~is~et~-------~~~l~~i~d~~~E~~----------- 448 (481) T protein:vir:10 397 ------HNYAELTITFTPNLPKSMMESINAFN----ALSGGVSESTR-------LSLLDFIDNPKEELE----------- 448 (481) T ss_pred ------cccceeeEEeCCCCCcCHHHHHHHHH----HHhccCChHHH-------HHhCCCCCCHHHHHH----------- Confidence 001233343333322 112222222 12222332221 12111 111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESG 668 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~ 668 (756) .+..++.+. .....+....++. ... ...+ -.++ T Consensus 449 ---ri~~E~~~~--~~~~~~~~~~~~~----~~~---~~~d---d~~g 481 (481) T protein:vir:10 449 ---KMQEEEAQR--EKQADKRGYGEAF----ENH---LNVD---DSNG 481 (481) T ss_pred ---HHHHHHHHH--HhhhhhccCCccC----CCC---CCCC---CCCC Confidence 111110000 0000000000000 000 0000 0000 No 56 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.63 E-value=7.3e-15 Score=98.09 Aligned_cols=452 Identities=9% Similarity=0.034 Sum_probs=205.2 Q ss_pred CCCCCccccccccC---CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC---CCC--------CCCCCCCcc Q lcl|NC_019423. 8 KPLPDPAQSEKLTD---WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGK---AKP--------PKIKGRSQV 73 (756) Q Consensus 8 ~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~---~~~--------~~~~grS~~ 73 (756) .-|.=++.++-+.+ +.+ -...+...+......|...+.+..+..+||.|.-. ... ...+-..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri 78 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNN--KPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRM 78 (472) T ss_pred CCCCCCcchhhhhceeeecC--chhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccc Confidence 22222333333332 221 11123344666666777778888899999998521 010 111222367 Q ss_pred cCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEee Q lcl|NC_019423. 74 QPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGW 153 (756) Q Consensus 74 v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w 153 (756) +.+..+..|+.....| +|.+ +.|.. +|.+..+ +++-.+ .|+-......+.++++++|.+.+.+|+ T Consensus 79 ~~n~~~~ivd~~~~~l----~g~~--~~~~~---~d~~~~~----~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~ 143 (472) T protein:vir:93 79 ITNFHANLVDQKVSYI----VGKP--IAFKH---TDDEVVK----RIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYL 143 (472) T ss_pred ccchHHHHHHHHhhhh----cccC--eeecc---CChHHHH----HHHHHH--hccHHHHHHHHHHHHhhcCeEEEEEEE Confidence 7788888787776655 5544 33332 3444433 343332 244445556788999999988777654 Q ss_pred eeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeee Q lcl|NC_019423. 154 ERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKAL 233 (756) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~ 233 (756) +. T Consensus 144 d~------------------------------------------------------------------------------ 145 (472) T protein:vir:93 144 DE------------------------------------------------------------------------------ 145 (472) T ss_pred CC------------------------------------------------------------------------------ Confidence 21 Q ss_pred cCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccc Q lcl|NC_019423. 234 VNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFK 311 (756) Q Consensus 234 ~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (756) .|+|++..++|.++++ |+.... +..+. .+.|.+..+ T Consensus 146 d~~~~i~~~~p~~~~~i~d~~~~~---~~~~~-ir~~~~~~~-------------------------------------- 183 (472) T protein:vir:93 146 EGEFKLFRVPAEQGIPIWTDKEHE---ELEAF-IRMYKLENE-------------------------------------- 183 (472) T ss_pred CCceEEEEEcccceEEEEcCCCCC---ceEEE-EEEEEeecc-------------------------------------- Confidence 1346778888888765 433322 22232 233321100 Q ss_pred ccccceEEEE---EEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHH Q lcl|NC_019423. 312 DALRKKVVAY---EYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQA 388 (756) Q Consensus 312 d~s~~~V~v~---E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~ 388 (756) ..+++| ++|. +..++...... .-...+...+. ..|.+.+.+|++.+.. +.+|.|.+..++++++ T Consensus 184 ----~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liD 250 (472) T protein:vir:93 184 ----TKVEYWDKVTVNY-YVYENGSLIPD-YSNNLENSKTH--FSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLID 250 (472) T ss_pred ----eeEEEEecCeEEE-EEEecCeeeec-ccccccccccc--cccCCCCCcceEEecC-----CCCCCCchhhhHHHHH Confidence 001111 0000 00111110000 00001111222 2233446778876653 4579999999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHH Q lcl|NC_019423. 389 ILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAES 468 (756) Q Consensus 389 ~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~ 468 (756) .+|..++.+.+.+...+.+.+++.-...+......... .....+....++.+.++..+.-.......++.+.+.+-. T Consensus 251 a~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 327 (472) T protein:vir:93 251 AYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL---RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIML 327 (472) T ss_pred HHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHH---hhccccccCCCCcceeEeecCCHHHHHHHHHHHHHHHHH Confidence 99999999999999888887665422111111111111 111223333445566665444456667788888999999 Q ss_pred HhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhH Q lcl|NC_019423. 469 LTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRED 548 (756) Q Consensus 469 ~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~ 548 (756) .+++++.+.+.-++.. +|.++...............+.|..+++.++++++.++-.- + ++ T Consensus 328 ~s~~p~~~~~~~~~n~--Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~----------~-~~------- 387 (472) T protein:vir:93 328 FGQAVDFSSDKFGSAP--SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----------G-EH------- 387 (472) T ss_pred HhCCCCCCccccccCc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----------c-cc------- Confidence 9999887665433222 33334444444555556666667777777776666654211 0 11 Q ss_pred hcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh-cCChhHHHHhhhccCCCChhhhhHHHHHH Q lcl|NC_019423. 549 LKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL-KRMPDLAHELRTWQPQPDPMEEQLKQLAI 627 (756) Q Consensus 549 ~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~-~~~~~~~~~l~~~~~q~~p~~~~~~q~~~ 627 (756) .+|.|.-+........+ .+..+..+...++.+. +++. .+..+...-++++ +. T Consensus 388 ----~~i~v~f~~~~p~~~~~--~~~~~~k~~giis~et-------~l~~l~~~~d~~~E~~ri--------------~~ 440 (472) T protein:vir:93 388 ----KDVDISFNYNKVANTEL--QVQTAQQSMGIVSHET-------VLENHPFVEDLQAELERI--------------EQ 440 (472) T ss_pred ----ceeeEEeCCCCCCCHHH--HHHHHHHHhccCchHH-------HHHhCCCCCCHHHHHHHH--------------HH Confidence 12333323222211111 1111111222233221 1111 1122221111111 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 628 QKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARD 674 (756) Q Consensus 628 ~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~ 674 (756) .+.+. ...++.. .. ...+. ....++. -..+.+ T Consensus 441 E~~~~-~~~~~~~---~~-------~~~d~--~~~~~~~--~~~~~e 472 (472) T protein:vir:93 441 EQMEY-NKQLPNL---DD-------GGADG--AQQQERS--NNKESE 472 (472) T ss_pred HHHHH-HHhccCc---Cc-------ccCCC--CCCCCCC--CcccCC Confidence 00000 0000000 00 00000 0000000 000000 No 57 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.63 E-value=1.3e-14 Score=96.80 Aligned_cols=434 Identities=11% Similarity=0.072 Sum_probs=198.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC---------------CCCCCCC----CcccCHHHHHHHH Q lcl|NC_019423. 23 KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK---------------PPKIKGR----SQVQPRLVRRQAE 83 (756) Q Consensus 23 ~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~---------------~~~~~gr----S~~v~~~v~~~~e 83 (756) .+ +..|...+......|...+.+..+..+||.|.-.-. ....+++ .+++.+..+..|+ T Consensus 1 ~~---~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd 77 (471) T protein:vir:10 1 ME---IEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLD 77 (471) T ss_pred CC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHH Confidence 22 334455566666777777778889999999752100 0001111 2466666666666 Q ss_pred HHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeee Q lcl|NC_019423. 84 WRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTE 163 (756) Q Consensus 84 ~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~ 163 (756) ....-| ||.+.- |.+ +|.+.- ++++..+. |+-......+.++++..|.+.+.+||+.+ T Consensus 78 ~~~~yl----~G~p~~--~~~---~~~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~------- 135 (471) T protein:vir:10 78 QKKAYA----LTYPPT--FDV---DDKKVN----DMIVDVLG--DDYERISKQLCVNAGNAGIAWLHVWKDAS------- 135 (471) T ss_pred hhhhhh----cccCce--ecc---CChHHH----HHHHHHHh--cCHHHHHHHHHHHHhhCCeEEEEEEeeCC------- Confidence 655444 665533 332 343333 34444432 33333345678999999988887766411 Q ss_pred eeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEec Q lcl|NC_019423. 164 TPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLN 243 (756) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~ 243 (756) .|++++..++ T Consensus 136 ----------------------------------------------------------------------~g~~~~~~~~ 145 (471) T protein:vir:10 136 ----------------------------------------------------------------------DNSFRYACVD 145 (471) T ss_pred ----------------------------------------------------------------------CCeeEEEEEc Confidence 1346788888 Q ss_pred hhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEE Q lcl|NC_019423. 244 PNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAY 321 (756) Q Consensus 244 p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~ 321 (756) |.++++ |..... -...+.|.|.+..+. ....+..+ T Consensus 146 p~~~~~i~d~~~~~----~~~~~ir~~~~~~~~---------------------------------------~~~~~~~~ 182 (471) T protein:vir:10 146 SKEVIPIYSKSLDK----KSIGVLRVYSSIDET---------------------------------------DGKNYTVY 182 (471) T ss_pred ccceEEEEcCCCCC----ceEEEEEEEEeeccC---------------------------------------CCceeEEE Confidence 888643 433211 122223333321100 00122233 Q ss_pred EEEEE-----eeccCCceeEE------EE-EEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHH Q lcl|NC_019423. 322 EYWGF-----YDINDDGSLEP------IV-ATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAI 389 (756) Q Consensus 322 E~w~k-----~d~~~~g~~~~------~~-~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~ 389 (756) |+|.. +...+.+.... .. .....+........|...|.+|++.+.. +..|.|.+..++++++. T Consensus 183 ~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa 257 (471) T protein:vir:10 183 EYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDV 257 (471) T ss_pred EEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHH Confidence 33321 00000000000 00 0001122222233334446677776644 45688999999999999 Q ss_pred HHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccc--cccccccccccccCCCcchHHHHHHHHHHHHHH Q lcl|NC_019423. 390 LGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNP--MQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAE 467 (756) Q Consensus 390 iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~--~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e 467 (756) +|..+|.+.+.+...++|.+++.-...+...+............. .+......+.++..+.-.......++.+.+.+- T Consensus 258 ~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~ 337 (471) T protein:vir:10 258 YDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIF 337 (471) T ss_pred HHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHH Confidence 999999999999999988665442111111111111111111111 011122345565544445667778888899999 Q ss_pred HHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh Q lcl|NC_019423. 468 SLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE 547 (756) Q Consensus 468 ~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d 547 (756) ..|++++.+.+..|+. |+. ++..+...........-+.|..++++++++++.++..+ ++ T Consensus 338 ~~s~tp~~~~~~~gn~-Sg~--Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~------------d~------ 396 (471) T protein:vir:10 338 ISGQGVNPETDKLGNS-SGV--ALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS------------DK------ 396 (471) T ss_pred HHhCCcCCCcccccCc-cHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------CC------ Confidence 9998887765543332 333 35555555555555566667777776666666554211 11 Q ss_pred HhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh-cCChhHHHHhhhccCCCChhhhhHHHHH Q lcl|NC_019423. 548 DLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL-KRMPDLAHELRTWQPQPDPMEEQLKQLA 626 (756) Q Consensus 548 ~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~-~~~~~~~~~l~~~~~q~~p~~~~~~q~~ 626 (756) .+|.|.-......... ...+++..+...++.+. +++. ++..+...-+++ ++ T Consensus 397 -----~~i~i~f~~~~p~n~~--e~~~~~~kl~g~iS~et-------~~~~~p~v~D~~~E~er--------------i~ 448 (471) T protein:vir:10 397 -----LKIKQTWTRNSINNDT--EMAQVVSTLATITSREN-------VAKSNPIVEDWQDELRL--------------QK 448 (471) T ss_pred -----ceeEEEeCCCCCCCHH--HHHHHHHHHhccCchHH-------HHHhCCCCCCHHHHHHH--------------HH Confidence 1233333322221111 11122222222222211 1111 112111111111 10 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 627 IQKAQLENEELQ-SKIALNNAKAKEAASSGD 656 (756) Q Consensus 627 ~~~aq~e~~~~q-a~a~~~~a~a~~~~aq~~ 656 (756) ..+.+ +..+.. ..-...+.+. + T Consensus 449 ~E~~~-~~~~~~~~~~~~~~~e~-------~ 471 (471) T protein:vir:10 449 AEQEG-RSEKLYDMEEVEHESEV-------E 471 (471) T ss_pred HHHHH-HHhcccccCCCCCcccc-------C Confidence 00000 000000 0000000000 0 No 58 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.63 E-value=1.2e-14 Score=96.88 Aligned_cols=468 Identities=12% Similarity=0.055 Sum_probs=210.1 Q ss_pred CCc---------ccCCCCCCCcc--------ccccccCCCchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhccccC- Q lcl|NC_019423. 1 MEH---------QDTFKPLPDPA--------QSEKLTDWKKEPSIQLLKGDLESAKPAHDAIM-SQIREWNDLMEVKGK- 61 (756) Q Consensus 1 ~~~---------~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~- 61 (756) |++ ++++.+--.++ .-+.++.+.- ..+..+...|.... .+.++..+||.|.-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~ 72 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNW--------ELLKNFINHHKLRQAPRIQELLDYARGENHD 72 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChH--------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc Confidence 443 22222211111 1112211111 11334444555443 467788999998521 Q ss_pred C-CCC--CCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHH Q lcl|NC_019423. 62 A-KPP--KIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDD 136 (756) Q Consensus 62 ~-~~~--~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~ 136 (756) . ... ..++++ +++.+-....|+....-| +|.+. .|.....+ .-+...++++.++. .|+--..... T Consensus 73 i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl----~g~p~--~~~~~~~~---~~~~~~~~l~~~~~-~n~~~~~~~~ 142 (501) T protein:vir:96 73 VLKSGRRKDNEMADKRAVHNYGRMISKFKTGYL----AGNPI--RVEYDDND---DNSQNDDAIKRIGR-INDLDSLNRT 142 (501) T ss_pred ccCccccCccccccceeecchHHHHHHHHhhhh----cccCe--eEeeCCcc---chhHHHHHHHHHHH-hcCHHHHHHH Confidence 1 111 122333 577777777777665544 55543 33333222 23445566766544 4444445668 Q ss_pred HHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCc Q lcl|NC_019423. 137 YVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEA 216 (756) Q Consensus 137 ~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~ 216 (756) +++++++.|.+.+.+||+.. T Consensus 143 ~~~~~~~~G~a~~~v~~ded------------------------------------------------------------ 162 (501) T protein:vir:96 143 LIRDLSQTGRAYEVIYRSEY------------------------------------------------------------ 162 (501) T ss_pred HHHHHhhcCeEEEEEEEcCC------------------------------------------------------------ Confidence 99999999999887765311 Q ss_pred ceeccCceeEEEeeeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhh Q lcl|NC_019423. 217 TYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSP 294 (756) Q Consensus 217 ~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~ 294 (756) |.+++..++|.+++ ||+.... ...+.+ +.|.... T Consensus 163 ------------------g~~~i~~~~p~~~~~v~d~~~~~---~~~~~v-~~~~~~~---------------------- 198 (501) T protein:vir:96 163 ------------------DETRIKRLSPLETFVIYDNSLED---NSIAAV-RYYNRGT---------------------- 198 (501) T ss_pred ------------------CceEEEEEccceeEEEEcCCCCC---ceEEEE-EEEEeec---------------------- Confidence 23567778888765 3443321 122222 2221100 Q ss_pred hhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcc Q lcl|NC_019423. 295 ITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKREL 374 (756) Q Consensus 295 ~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~ 374 (756) . ...+.++++|.. +.+ +++. .++........|.+.|.+|++.+. ++. T Consensus 199 ------------------~-~~~~~~~~vyt~-----~~i---~~~~-~~~~~~~~~~~~~~~g~vPvv~~~-----nn~ 245 (501) T protein:vir:96 199 ------------------L-QSAKDVVEIYTD-----EHI---YTLD-ASDDFNEISVTTHAFGTVPITEYL-----NNI 245 (501) T ss_pred ------------------C-CCcEEEEEEEcC-----CcE---EEEe-eCCCceeccccccCCCccceEEec-----CCc Confidence 0 001334555532 111 1111 111122223334445778887664 345 Q ss_pred cCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccc------ccccccccccccccC Q lcl|NC_019423. 375 FGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYN------PMQGNPSQSIMEHKF 448 (756) Q Consensus 375 ~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~------~~~~~~~~~i~~~~~ 448 (756) +|.|.+..++++++.+|..++.+.+.+...+++.+.+.-.................... ..+......+.++.. T Consensus 246 ~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 325 (501) T protein:vir:96 246 DGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTK 325 (501) T ss_pred cCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccccccccccCcceeeEec Confidence 79999999999999999999999999998888876654322222211111111111100 001111223444444 Q ss_pred CCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019423. 449 PELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFL 528 (756) Q Consensus 449 ~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~ 528 (756) +.-.......+..+...+-..|++++.+.|..++.. ++.++...............+.|..++++++++++.++.... T Consensus 326 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~--Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~ 403 (501) T protein:vir:96 326 SYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNT--SGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVN 403 (501) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 444456677788899999999999988776433322 333354444555555666667788888888888777765322 Q ss_pred CCCcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc-CChhHHHH Q lcl|NC_019423. 529 SEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK-RMPDLAHE 607 (756) Q Consensus 529 ~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~-~~~~~~~~ 607 (756) .... ++ -.+|.|.-.+...... ...+..+..+...++.+. +++.. ...+...- T Consensus 404 ~~~~-----------~d------~~~i~i~f~~~~p~n~--~e~ad~~~kl~g~iS~et-------~~~~l~~v~D~~~E 457 (501) T protein:vir:96 404 EFKD-----------FD------ESLLKITFTPNLPKSL--NEQVSILTGLGGQVSQET-------ALSLSGLVESPNEE 457 (501) T ss_pred cccc-----------cc------cccceEEeCCCCCcCH--HHHHHHHHHHhccCchHH-------HHHhCCCCCCHHHH Confidence 1100 00 0123333333222111 111122222222233221 11111 12111111 Q ss_pred hhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 608 LRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDME 676 (756) Q Consensus 608 l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~ 676 (756) ++ + .+.|.+++.......+..-.......+.... .+. ..+.+.+ T Consensus 458 ~~--------------r-----i~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~-----~~d-~~e~~~~ 501 (501) T protein:vir:96 458 LD--------------K-----INKEMSEIDFKGYSNDFNEHVGKYTDEVKET-----HTD-DFEREYE 501 (501) T ss_pred HH--------------H-----HHHHHHHhhccccccchhhcccccCCcCCCC-----CCC-ccccccC Confidence 11 1 1111111000000000000000000000000 000 0000111 No 59 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.63 E-value=3.7e-14 Score=94.20 Aligned_cols=421 Identities=8% Similarity=-0.012 Sum_probs=191.0 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHhccccCC---C-CCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCc Q lcl|NC_019423. 34 DLESAKPAHDAIMSQIREWNDLMEVKGKA---K-PPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTF 107 (756) Q Consensus 34 ~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~-~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~ 107 (756) .+.. .+.....+.++-.+||.|.-.- . ....++++ +++.+..+..|+....-| ||.+.-+.+ -.. T Consensus 1 ~~~~---~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~~~~~~--~~~ 71 (440) T protein:vir:95 1 MLAA---FLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYV----IGNPVSIGV--MEG 71 (440) T ss_pred Chhh---HHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhhe----eccCceEee--CCC Confidence 1222 2223344566777899876321 1 11123443 567777777777655444 776644433 233 Q ss_pred chHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHH Q lcl|NC_019423. 108 EDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQL 187 (756) Q Consensus 108 ~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (756) +|.+... .+..+ ...|+--.....+.+++++.|.+.+.+|++. T Consensus 72 ~~~~~~~----~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~-------------------------------- 114 (440) T protein:vir:95 72 GSADQLS----TIKDI-EWQNDINALNSDLAFDASVYGRAYEYHFRDK-------------------------------- 114 (440) T ss_pred ccHHHHH----HHHHH-HHhcCHhHHHHHHHHHHhhcCeEEEEEEecC-------------------------------- Confidence 3333322 33222 2344444445678899999999988875420 Q ss_pred hhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEE Q lcl|NC_019423. 188 QAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVI 265 (756) Q Consensus 188 ~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~ 265 (756) .|.|++..++|.++++ |+.... ...+.+ T Consensus 115 ----------------------------------------------~~~~~i~~~~p~~~~~~~d~~~~~---~~~~~i- 144 (440) T protein:vir:95 115 ----------------------------------------------DKVDRVVLISPLEMFVIRDLTVEQ---NIIAAV- 144 (440) T ss_pred ----------------------------------------------CCceEEEEEcccceEEEEcCCCCC---ceEEEE- Confidence 1346677788888765 443321 122222 Q ss_pred EeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECC Q lcl|NC_019423. 266 SFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGS 345 (756) Q Consensus 266 ~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~ 345 (756) ++|...+ ...+++|.. +++..++...-..+ T Consensus 145 ~~~~~~~---------------------------------------------~~~~~vyt~-----~~~~~~~~~~~~~~ 174 (440) T protein:vir:95 145 HLPIYAD---------------------------------------------KVNMTVYTK-----DKVITYKPYSNNSV 174 (440) T ss_pred EEEEecC---------------------------------------------ceEEEEEeC-----CeEEEEEEecCCcc Confidence 2221100 001122211 11100000000001 Q ss_pred EEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCc---cch- Q lcl|NC_019423. 346 TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDT---LNR- 421 (756) Q Consensus 346 ~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~---~~~- 421 (756) .....+..|.+.|.+|+|.++ ++.+|.|.++.++++++.+|..++.+.+.+...+.+.+++. |.... ..+ T Consensus 175 ~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~-g~~~~~~~~~e~ 248 (440) T protein:vir:95 175 RLVVDDVKKHSYNDVPVVEWW-----NNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVK-GDLDGIKLSPED 248 (440) T ss_pred ceeecceeeccCceeeEEEee-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeee-cccccCCCCccc Confidence 111122223334667777654 34578999999999999999999999999998888766543 32111 111 Q ss_pred -hhhhcccc-cc-c--cccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHH Q lcl|NC_019423. 422 -RRYDDGQD-YE-Y--NPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALD 496 (756) Q Consensus 422 -~~~~~~~~-~~-~--~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~ 496 (756) ........ .. . ........+.+.++..+.-.......++.+...+-..|++++.+.+.-++.. +|.++..... T Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~--Sg~Al~~~~~ 326 (440) T protein:vir:95 249 AAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTS--SGIALLYKMI 326 (440) T ss_pred hhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc--hHHHHHHHHH Confidence 11111000 00 0 0001112233455544443456667888999999999999987766433222 3444555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccH--HHHHHHHHHH Q lcl|NC_019423. 497 AASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGF 574 (756) Q Consensus 497 aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~ 574 (756) ..........+.|..++++++++++.++..... .+ ++ ..++.|.-..... ....++. T Consensus 327 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~---------~~---~~------~~~v~i~f~~~~p~~~~~~ad~--- 385 (440) T protein:vir:95 327 GLEQVRKDKETYFTKALRRRYELISNIHKAING---------PV---IE------ANKLTFTFHPNIPQDVWTEIKA--- 385 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC---------cc---cc------cccceEEeCCCCCCCHHHHHHH--- Confidence 555666666677778888777776666543211 10 00 1233333333222 1122222 Q ss_pred HHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 575 MVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASS 654 (756) Q Consensus 575 llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq 654 (756) +..+...++.+.. ++..+.- ++. ++.+..+.+.+....+.......... .+.. T Consensus 386 -~~kl~g~iS~et~-------~~~l~~~-------------d~~----~E~~ri~~E~~~~~~~~~~~~~~~~~--~~~~ 438 (440) T protein:vir:95 386 -YIEAGGEISQETL-------MENASFT-------------DYK----TEHSRILKQGGSSDLEIGQIVGDADV--GQAD 438 (440) T ss_pred -HHHHhccCcHHHH-------HHhCCCC-------------CcH----HHHHHHHHHHHHhhhhHHhhccCCCC--CCcC Confidence 2222222332221 1111111 110 00111111000000000000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 655 GDLKDLDYLEQESGTKHARD 674 (756) Q Consensus 655 ~~~~~~~~~~q~~~~k~~~~ 674 (756) .| T Consensus 439 ------------------~e 440 (440) T protein:vir:95 439 ------------------TE 440 (440) T ss_pred ------------------CC Confidence 00 No 60 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.62 E-value=1.7e-13 Score=90.64 Aligned_cols=459 Identities=10% Similarity=0.049 Sum_probs=205.8 Q ss_pred CC----cccCCCCCCCccccccc---c--CCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC---C----- Q lcl|NC_019423. 1 ME----HQDTFKPLPDPAQSEKL---T--DWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA---K----- 63 (756) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~---~--~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~----- 63 (756) |. |--++.=|..|+.++-+ - +...+. +...+....+.|...+.+..+..+||.|.-.- . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~----~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~ 76 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPET----LEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA 76 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhh----HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Confidence 11 11111111122221111 1 122222 23345555566777777888999999986211 0 Q ss_pred ---CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHH Q lcl|NC_019423. 64 ---PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHS 140 (756) Q Consensus 64 ---~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~ 140 (756) ....+-..+++.+..+..|+.....| +|.+ +.|. .+|.+..+ +++..+. ++-......+.++ T Consensus 77 ~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l----~G~p--~~~~---~~d~~~~~----~l~~~~~--n~~~~~~~~~~~~ 141 (483) T protein:vir:12 77 TGAVDPLKPDDRMITNFHANLVDQKVSYI----VGKP--IAFK---HTDDEVVK----RIDEVLG--NRFDDKLHSVLTG 141 (483) T ss_pred cccccccccccccccchHHHHHHHHhhhh----cccC--ceec---cCChHHHH----HHHHHHh--ccHHHHHHHHHHH Confidence 01112224578888888888776655 6644 3332 23444333 4443332 3444555667899 Q ss_pred HhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceec Q lcl|NC_019423. 141 IVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAI 220 (756) Q Consensus 141 al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~ 220 (756) ++++|.+.+.+||+. T Consensus 142 ~~~~G~~y~~v~~d~----------------------------------------------------------------- 156 (483) T protein:vir:12 142 ASNKGIEWLHPYLDE----------------------------------------------------------------- 156 (483) T ss_pred HhhCCeEEEEEEEcC----------------------------------------------------------------- Confidence 999999877775531 Q ss_pred cCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhch Q lcl|NC_019423. 221 QTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDP 298 (756) Q Consensus 221 ~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~ 298 (756) .|+|++..++|.++++ |++.... ..+. .|.|..... . T Consensus 157 -------------d~~~~i~~~~p~~~~~v~d~~~~~~---~~~~-ir~~~~~~~-----------~------------- 195 (483) T protein:vir:12 157 -------------EGEFKLFRVPAEQGIPIWTDKEHEE---LEAF-IRMYKLENE-----------T------------- 195 (483) T ss_pred -------------CCceEEEEEcccceEEEEcCCCCCc---eEEE-EEEEEeecc-----------e------------- Confidence 1346788889998754 5443222 2222 233321000 0 Q ss_pred hhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCc Q lcl|NC_019423. 299 DHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEA 378 (756) Q Consensus 299 ~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g 378 (756) .... + +. .+|..+.+ ++...... .-...+...+... |.+.+.+|++.++. +.+|.| T Consensus 196 ---~~~~----y-~~--~~v~~~~~------~~~~~~~~-~~~~~~~~~~~~~--~~~~g~vPvv~~~n-----n~~g~s 251 (483) T protein:vir:12 196 ---KVEY----W-DK--VTVNYYVY------ENGSLIPD-YSNNLENSKTHFS--TGSWGKIPFIPFKN-----NDLEIS 251 (483) T ss_pred ---EEEE----E-ec--CeEEEEEE------eCCeeeec-ccccccccccccc--cCCCCccceEEecC-----CCCCCC Confidence 0000 0 00 01111111 11110000 0000111122222 33346677776543 457899 Q ss_pred hHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHH Q lcl|NC_019423. 379 DAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVM 458 (756) Q Consensus 379 ~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 458 (756) .+..++++++.+|..++.+.+.+...+.+.+.+.-...+...+..... .....+....++.+.++..+.-....... T Consensus 252 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 328 (483) T protein:vir:12 252 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL---RYYGAIKVSDNGGVDTIQVEVPVENSKKY 328 (483) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhh---hhccccccCCCCcceEEeecCCHHHHHHH Confidence 999999999999999999999999888887665422222111211111 11122333344556666655445666778 Q ss_pred HHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEec Q lcl|NC_019423. 459 TQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITN 538 (756) Q Consensus 459 l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g 538 (756) ++.+.+.+-..|++++.+.+.-++. .++.++...............+.|..+++++++++++++-.- + T Consensus 329 ~~~l~~~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~----------~ 396 (483) T protein:vir:12 329 LDELYQKIMLFGQAVDFSSDKFGSA--PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----------G 396 (483) T ss_pred HHHHHHHHHHHhCCCCCCccccccC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------C Confidence 8888999999999988765533322 233335545555555566666777777777777666654210 1 Q ss_pred CceeecCHhHhcCcceEEEecccccHH--HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCC Q lcl|NC_019423. 539 EQYVEIKREDLKGNFDIEVDINTAEID--NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPD 616 (756) Q Consensus 539 ~~~v~i~~d~~~~~~Dv~V~~g~a~~~--~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~ 616 (756) ++ .++.|.-...... ...++.+ ..+...++.+.... .+....+...-++++....+ T Consensus 397 -~~-----------~~i~v~f~~~~p~~~~~~a~~~----~kl~GiiS~et~~~------~~~~v~d~~~E~~ri~~E~~ 454 (483) T protein:vir:12 397 -EH-----------KDVDISFNYNKVANTELQVQTA----QQSMGIVSHETVLE------NHPFVEDLQAELERIEQEQM 454 (483) T ss_pred -cc-----------ceeeEEeCCCCCCCHHHHHHHH----HHHhccCchHHHHH------hCCCCCCHHHHHHHHHHHHH Confidence 11 1233322222221 1122211 12222232222111 11122222222222111000 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 617 PMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGD 656 (756) Q Consensus 617 p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~ 656 (756) .. ++..........-......+... .+.+ T Consensus 455 ~~---------~~~~~~~~~~~~d~~~~~~~~~~--~e~e 483 (483) T protein:vir:12 455 EY---------NKQLPNLDDGGADGAQQQERSNN--KESE 483 (483) T ss_pred HH---------HhhcccccccccCCcccCCCCCc--ccCC Confidence 00 00000000000000000000000 0000 No 61 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.62 E-value=1.5e-13 Score=90.86 Aligned_cols=443 Identities=11% Similarity=-0.002 Sum_probs=200.6 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC---CCCCCCCC--cccC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK---PPKIKGRS--QVQP 75 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~---~~~~~grS--~~v~ 75 (756) |-+.|.+ .-.+-.+++.+.+...|... +.....+.++..+||.|.-... ....++++ +++. T Consensus 1 ~~~~~~~-------~~~~~~~~~~~~~~~~i~~~-------~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~ 66 (489) T protein:vir:99 1 MLQEDFE-------AIDYESKLWIDQLKNYISRF-------KAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIAS 66 (489) T ss_pred CCcccee-------eeCCCCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHhcccCccccccccccccCCcceeec Confidence 5555544 33333344444444443332 1233455678889999763211 11122333 5777 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcc-hHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 76 RLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVK-LVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 76 ~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~-~~~~~v~~al~~g~gi~k~~w~ 154 (756) +..+..|+....-| ||.+. .|.+ +|.. ..++++.++.. +.+. ....+.+++++.|.+++.+|+. T Consensus 67 n~~~~iv~~~~~~l----~g~~~--~~~~---~d~~----~~~~l~~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~~ 131 (489) T protein:vir:99 67 DFAKYITVFEQGYM----LGVPV--EYKN---ENKD----LQAAIDLMSVR--NNEDYHNVKIKTDLSIYGRAYELLTVE 131 (489) T ss_pred chHHHHHHHHhhhh----ccCCc--eeec---CChh----HHHHHHHHHhh--cChhHHHHHHHHHHhhCCeEEEEEeec Confidence 77777777776555 66443 3333 3333 34466655442 3554 4557889999999998887653 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ... ... T Consensus 132 ~~~--------------------------------------------------------------------------d~~ 137 (489) T protein:vir:99 132 KID--------------------------------------------------------------------------DKK 137 (489) T ss_pred cCc--------------------------------------------------------------------------CCC Confidence 110 012 Q ss_pred CceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKD 312 (756) Q Consensus 235 g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d 312 (756) ++++|..|+|.++++ |+... ....+.++ +|... + T Consensus 138 ~~~~i~~~~p~~~~~v~dd~~~---~~~~~~i~-~~~~~----------------------------------------~ 173 (489) T protein:vir:99 138 TEVKLYQLPAEQTFVIYDDTYQ---RNSLMAVH-FYDID----------------------------------------Y 173 (489) T ss_pred cceEEEEEcccceEEEEcCCCC---CceEEEEE-EEEEe----------------------------------------c Confidence 458889999999754 33322 11222222 22110 0 Q ss_pred cccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-EecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHH Q lcl|NC_019423. 313 ALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-RMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILG 391 (756) Q Consensus 313 ~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN 391 (756) .....+.++++|.. +.+.. +.+...+...+ .....|...|.+|++++.. +.+|.|.+..++++++.+| T Consensus 174 ~~~~~~~~~~~y~~-----~~i~~-~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d 242 (489) T protein:vir:99 174 GSGKRKQIIKAYTS-----DTIYT-YEDYNLETKGMRLKDYEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDAYD 242 (489) T ss_pred CCCceEEEEEEEeC-----CcEEE-EEecCCCcccceecccccccCCceeEEEeec-----CCCCCCchhhhHHHHHHHH Confidence 00012344555531 11111 11111111111 1122233346778877653 4468899999999999999 Q ss_pred HHHHHHHHHHHhhcCCceEeecccc-Cccchhhhhccccc-cc----cc--------ccc-------ccccccccccCCC Q lcl|NC_019423. 392 ATMRGMIDLLGRSANGQRGYPKGML-DTLNRRRYDDGQDY-EY----NP--------MQG-------NPSQSIMEHKFPE 450 (756) Q Consensus 392 ~~~~~~~d~l~~~~~~~~~~~~gav-~~~~~~~~~~~~~~-~~----~~--------~~~-------~~~~~i~~~~~~~ 450 (756) ..++.+.+.+...+.+.+.+- |.. ...+.......... .. .. +.. .....+.++..+. T Consensus 243 ~~~s~~~~~~~~~~~~~l~i~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 321 (489) T protein:vir:99 243 LSQSELANFQQDSVNALLVIA-GNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEY 321 (489) T ss_pred HHHHHHHHHHHHhhhhhhhhc-cCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecC Confidence 999999999988877665542 221 11110000000000 00 00 000 0011233333333 Q ss_pred cchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_019423. 451 LPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSE 530 (756) Q Consensus 451 ~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~ 530 (756) -.......+..+.+.+-..||+++.+.+.-+. +.++.++...............+.|..+++.+.++++.++...... T Consensus 322 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~ 399 (489) T protein:vir:99 322 DTAGSEAYKNRLVADILRFTFTPDTQDMKFSG--VQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNE 399 (489) T ss_pred ChHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 34455566788888888999988765432111 1233334444445555556666777778888888777776432111 Q ss_pred CcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhh Q lcl|NC_019423. 531 KEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRT 610 (756) Q Consensus 531 ~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~ 610 (756) .+...-..+..++.+...+.-....++.+.. +...++.+.+...+ ....-++...-+++ T Consensus 400 -------------~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~k----l~giis~et~~~~l----~~v~~~d~~~E~~r 458 (489) T protein:vir:99 400 -------------ATTYSLVNDTSIVFTPNLPQNDNEIVTAAQN----LYGIVSDQTIFEIL----NTVTGVDAEAELKR 458 (489) T ss_pred -------------cccccccccceEEeCCCCCcCHHHHHHHHHH----HhccCCHHHHHHhc----CCCCchhHHHHHHH Confidence 0000000111222222222111112222221 11223332221111 10000111111111 Q ss_pred ccCCC----------------ChhhhhHHHH Q lcl|NC_019423. 611 WQPQP----------------DPMEEQLKQL 625 (756) Q Consensus 611 ~~~q~----------------~p~~~~~~q~ 625 (756) ++.+. ++.+....++ T Consensus 459 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 459 LKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 11000 0000000011 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.61 E-value=3.4e-13 Score=88.96 Aligned_cols=463 Identities=12% Similarity=0.072 Sum_probs=203.8 Q ss_pred CCCccccccccCCCch-HHHHHHHHHHHHH-HHHhhHHHHHHHHHHHHhccccCC-CCC--CCCCC----CcccCHHHHH Q lcl|NC_019423. 10 LPDPAQSEKLTDWKKE-PSIQLLKGDLESA-KPAHDAIMSQIREWNDLMEVKGKA-KPP--KIKGR----SQVQPRLVRR 80 (756) Q Consensus 10 ~~~~~~~~~~~~~~~~-~~~~~l~~~~~~a-~~~~~~~~~~~~~~~~~y~~~~~~-~~~--~~~gr----S~~v~~~v~~ 80 (756) |-+.- +.++.+|-+. -..+.|+..+++. ...++..+.+.++|.+||.|.... ..+ ...|+ -+++.+.... T Consensus 1 m~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~ 79 (496) T protein:vir:38 1 MINQI-IAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKV 79 (496) T ss_pred ChhHH-HHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHH Confidence 11100 1111111110 0112222222222 122556667788999999875210 000 11122 2233333333 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKI 160 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~ 160 (756) .++ .+...+||-..-+.+ +|.+.++ +|+-++. .++-...+..++.+|+..|.+++++||+.. T Consensus 80 i~~----~~a~~l~~~p~~i~~-----~d~~~~e----~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~---- 141 (496) T protein:vir:38 80 TAK----YMSKLLFNEKVKINI-----DDKAAEE----FVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGN---- 141 (496) T ss_pred HHH----HHhhhhhCCcceEee-----CChHHHH----HHHHHHh-ccCHHHHHHHHHHHHhhhCcEEEEEEEcCC---- Confidence 333 223334655544444 4544444 5655543 345566677899999999999999988621 Q ss_pred eeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEE Q lcl|NC_019423. 161 KTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVE 240 (756) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie 240 (756) |+|+++ T Consensus 142 --------------------------------------------------------------------------~~~~i~ 147 (496) T protein:vir:38 142 --------------------------------------------------------------------------KNVKVS 147 (496) T ss_pred --------------------------------------------------------------------------CcEEEE Confidence 336788 Q ss_pred EechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEE Q lcl|NC_019423. 241 MLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVA 320 (756) Q Consensus 241 ~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v 320 (756) .|+|+.||+=..-..++..+-|+-+ + +.+ +-.+-.++.+.+. +.. -.| . T Consensus 148 ~v~~~~~~P~~~~~~~~~~~~f~~~--~-~~~-----~~~y~~le~h~~~---------------------~~~-~~I-~ 196 (496) T protein:vir:38 148 FATADCMYPLSNDSENVDECVIANS--F-HKN-----NKYYTLLEWNEWQ---------------------GDV-YTV-T 196 (496) T ss_pred EEcccceEEEEecCCcEEEEEEEEE--E-EeC-----CeEEEEEEEEEEe---------------------Cce-EEE-E Confidence 8888887731111123333333311 1 100 0000011110000 000 000 1 Q ss_pred EEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeee----eeecCcccCCchHHHhHHHHHHHHHHHHH Q lcl|NC_019423. 321 YEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPY----MPRKRELFGEADAELLGDNQAILGATMRG 396 (756) Q Consensus 321 ~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~----~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~ 396 (756) +++|...+.+.-|......-++. -+..........+.||+.+.. ....++.+|.|.+.+++++++.+|..++. T Consensus 197 ~~~y~~~~~~~~g~~v~~~~~~~---~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~ 273 (496) T protein:vir:38 197 TELYQSDDPNELGTKVSLTLLFD---DIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDS 273 (496) T ss_pred EEEEecCCccccCcccccccccc---ccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHH Confidence 11111000000000000000000 000000001123455665533 23557789999999999999999999999 Q ss_pred HHHHHHhhcCCceEeeccccCccchh------hhhccccccccccccc---cccccccccCCCc-chHHHHHHHHHHHHH Q lcl|NC_019423. 397 MIDLLGRSANGQRGYPKGMLDTLNRR------RYDDGQDYEYNPMQGN---PSQSIMEHKFPEL-PQSAIVMTQMQNQEA 466 (756) Q Consensus 397 ~~d~l~~~~~~~~~~~~gav~~~~~~------~~~~~~~~~~~~~~~~---~~~~i~~~~~~~~-~~~~~~~l~~~~~~~ 466 (756) ..+.+.. +..++.++...+...... .+... ...+..+... ....++... +.+ .......++.+...+ T Consensus 274 ~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~-~~i~~e~~~~~l~~~l~~i 350 (496) T protein:vir:38 274 YYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDST-DEAFFLYQGDQDDNGKAIKDIS-VEIRSTEFIESINAMLRIY 350 (496) T ss_pred HHHHHhh-cccceecchHHhhccCCCCCccccCCCCc-cceEEEeecCCCcccccceeec-cccCHHHHHHHHHHHHHHH Confidence 9998865 677788876665321110 01100 0011111111 112344333 333 345566777788888 Q ss_pred HHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCH Q lcl|NC_019423. 467 ESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKR 546 (756) Q Consensus 467 e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~ 546 (756) ...+|++....|.++.. ..||+++....+..-.....+.+.|..+++++++.++.+...+..-. |.. T Consensus 351 ~~~~g~~~~~f~~~~~g-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~------g~~------ 417 (496) T protein:vir:38 351 AMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYS------GEV------ 417 (496) T ss_pred HHhhCCChhhcCCCccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------CCC------ Confidence 88889999888865433 35777776555555555566777788899999999988775432100 000 Q ss_pred hHhcCcceEEE--ecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh--hHHHHhhhccCC---CChhh Q lcl|NC_019423. 547 EDLKGNFDIEV--DINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP--DLAHELRTWQPQ---PDPME 619 (756) Q Consensus 547 d~~~~~~Dv~V--~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~--~~~~~l~~~~~q---~~p~~ 619 (756) ....+++| +.+...-.....+.++.+.. . ..++...+ +....+.. ++.+.+.+++.. ..|+. T Consensus 418 ---~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~-~-GiiS~et~------l~~~~~~~d~ea~~el~ri~~E~~~~~~~~ 486 (496) T protein:vir:38 418 ---VELDTITVDFDDSIAQDEDTTINRYTNAKN-Q-GMIPLKIA------LQRAWNITEAEADEWAEMLAKEKQAEMPNN 486 (496) T ss_pred ---CCccceEEEeCCCCCCCHHHHHHHHHHHHh-c-CCCCHHHH------HHhcCCCChHHHHHHHHHHHHhhhccCccc Confidence 01122333 23322212222232333221 1 22332211 12233332 122222222110 00000 Q ss_pred ---hhHHHHH Q lcl|NC_019423. 620 ---EQLKQLA 626 (756) Q Consensus 620 ---~~~~q~~ 626 (756) ...-..+ T Consensus 487 d~~~~~~~~e 496 (496) T protein:vir:38 487 DMNGIFGEEE 496 (496) T ss_pred cccCCCCCCC Confidence 0000000 No 63 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.61 E-value=8.2e-15 Score=97.81 Aligned_cols=460 Identities=9% Similarity=0.046 Sum_probs=202.2 Q ss_pred ccCC-CCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CC-------CCCCC--C Q lcl|NC_019423. 4 QDTF-KPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PP-------KIKGR--S 71 (756) Q Consensus 4 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~-------~~~gr--S 71 (756) +-.+ -|.+-|--++-+....+ --......+......|...+.+.++..+||+|..... .. ..+++ . T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~--~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKP--KYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDW 78 (478) T ss_pred CccccCCCCchhHHHHHHHHhh--ccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccc Confidence 4444 33322222221110000 0001122355556667777888889999999753210 01 11223 2 Q ss_pred cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEE Q lcl|NC_019423. 72 QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARI 151 (756) Q Consensus 72 ~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~ 151 (756) +++.+..+..|+....-| ||.+.-+. .+|.+..+ .+..++. ++-......+++++++.|.+++.+ T Consensus 79 ki~~n~~~~ivd~~~~~l----~g~~~~~~-----~~~d~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~~ 143 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYA----VANPVTFG-----VDNDKALK----QIQHTLN--HKWDDKLVDILTAASNKGIEWVQP 143 (478) T ss_pred eeccchHHHHHHHHHhhh----ccCCeeee-----cCChHHHH----HHHHHHh--cCHHHHHHHHHHHHHhcCeEEEEE Confidence 577777777777665544 66554442 23444333 3333332 344555667889999999998887 Q ss_pred eeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeee Q lcl|NC_019423. 152 GWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEK 231 (756) Q Consensus 152 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~ 231 (756) |++. T Consensus 144 ~~d~---------------------------------------------------------------------------- 147 (478) T protein:vir:10 144 YVDE---------------------------------------------------------------------------- 147 (478) T ss_pred EecC---------------------------------------------------------------------------- Confidence 6531 Q ss_pred eecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 232 ALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 232 ~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) .|++++..++|.++++ |+.... +..+.+ +.|.... . . .... T Consensus 148 --~g~~~~~~~~p~~~~~i~d~~~~~---~~~~~v-~~~~~~~-------~------------------~--~~~~---- 190 (478) T protein:vir:10 148 --EGEFKTFRVPAEQAVPIWTNKERD---ELQAFI-RVYELDG-------A------------------E--RVEY---- 190 (478) T ss_pred --CCeeEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEecC-------c------------------e--EEEE---- Confidence 1235677788888764 443322 222222 2221100 0 0 0000 Q ss_pred ccccccceEEEEEEEEEeeccCCceeEEE-EEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHH Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGFYDINDDGSLEPI-VATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQA 388 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k~d~~~~g~~~~~-~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~ 388 (756) +. ..+|..|++-. +....... ...-... .......|.+.+.+|++.+. ++.+|.|.+..++++++ T Consensus 191 y~---~~~i~~~~~~~-----~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vPvv~~~-----n~~~g~sd~~~v~~liD 256 (478) T protein:vir:10 191 WT---KDDVTYYELKE-----GQLIPDFYRSDDHIQP-HYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMYKTIID 256 (478) T ss_pred Ee---CCeEEEEEEcC-----Ceeecccccccccccc-ceecccccccCCccceEEec-----cCCCCCCcHHHHHHHHH Confidence 00 01222221100 00000000 0000001 11122334455778877664 35679999999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCceEeecccc-CccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHH Q lcl|NC_019423. 389 ILGATMRGMIDLLGRSANGQRGYPKGML-DTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAE 467 (756) Q Consensus 389 ~iN~~~~~~~d~l~~~~~~~~~~~~gav-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e 467 (756) .+|..++.+.+.+...+.|.+.+. |.- +................ +....++.+.++..+.-.......++.+.+.+- T Consensus 257 a~~~~~S~~~~~~~~~~~p~~~~~-g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~ 334 (478) T protein:vir:10 257 ALDKRLSDTQNTFDESVELIYILK-GYEGEDMKDFMHNLKYYKAIS-VAGESGSGVDTIKVEVPIDSVKEYTKMLRDYII 334 (478) T ss_pred HHHHHHHHHHHHHHHhhCceeeee-cCCccccchhhhhhhhcceEE-ecCCCCCcceEEeecCChHHHHHHHHHHHHHHH Confidence 999999999999998888866543 321 11111111111111111 111223445555444334566678888899999 Q ss_pred HHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh Q lcl|NC_019423. 468 SLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE 547 (756) Q Consensus 468 ~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d 547 (756) ..|++++.+.+..++. .++.++...............+.|..+++++++++++++ .. + ++.. T Consensus 335 ~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~----g~---------~---~~~~ 396 (478) T protein:vir:10 335 EFGQGVDFQQDKFGNS--PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY----RL---------D---VKVQ 396 (478) T ss_pred HHhCccccCccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC---------C---cccc Confidence 9999988765533222 233335545555555556666667777777666665543 11 0 0000 Q ss_pred HhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhhhccCCCChhhhhHHHHH Q lcl|NC_019423. 548 DLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELRTWQPQPDPMEEQLKQLA 626 (756) Q Consensus 548 ~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~~~~~q~~p~~~~~~q~~ 626 (756) +..++.+.+.+.-....++ ++..++..++.+. +++..+ ..+...-++++. T Consensus 397 ----~i~i~f~~~~p~d~~e~a~----~~~kl~g~iS~et-------~~~~l~~v~D~~~E~~ri~-------------- 447 (478) T protein:vir:10 397 ----DIEITFNFNVMVNELENSQ----IAMNSTGLLSKET-------ILSNHAWVEDPVAEMERIE-------------- 447 (478) T ss_pred ----cceEEecCCCCCCHHHHHH----HHHHHhCCCChHH-------HHHhCCCCCCHHHHHHHHH-------------- Confidence 1122222222211112222 2222223333211 222222 122221111111 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_019423. 627 IQKAQLENEELQSKIA-LNNAKAKEAASSGD 656 (756) Q Consensus 627 ~~~aq~e~~~~qa~a~-~~~a~a~~~~aq~~ 656 (756) .++............. ....+.+.--.+.+ T Consensus 448 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 448 QENIELNQQLPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHhhccccccccCCCCCCCCCCCCCC Confidence 0000000000000000 00000000000000 No 64 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.61 E-value=1.2e-13 Score=91.33 Aligned_cols=453 Identities=13% Similarity=0.087 Sum_probs=205.8 Q ss_pred CCchHHHHHHHHHHH---------HHH-----HHhhHHHHHHHHHHHHhccccC--CC-----CCCCCCCCcccCHHHHH Q lcl|NC_019423. 22 WKKEPSIQLLKGDLE---------SAK-----PAHDAIMSQIREWNDLMEVKGK--AK-----PPKIKGRSQVQPRLVRR 80 (756) Q Consensus 22 ~~~~~~~~~l~~~~~---------~a~-----~~~~~~~~~~~~~~~~y~~~~~--~~-----~~~~~grS~~v~~~v~~ 80 (756) |-+ -+.+.|+..++ .+. ..+++++.+..+|.+||.|... .. ......+.+++.+.... T Consensus 1 m~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MIN-QIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKV 79 (499) T ss_pred Chh-HHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHH Confidence 221 22222222221 111 2355666778889999987521 01 01111233344444444 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKI 160 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~ 160 (756) .++.. .+.+|+-..-+.+ +|. +.+++|+-++. .++-...+..++..|+..|.+++|+||+.. T Consensus 80 iv~~~----a~~l~~ep~~i~~-----~d~----~~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~---- 141 (499) T protein:vir:80 80 TAKYM----SKLLFNEKVKINI-----DDE----TAEEFVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGN---- 141 (499) T ss_pred HHHHH----HHhhhCCcceEee-----CCH----HHHHHHHHHHh-hccHHHHHHHHHHHHhhcCcEEEEEEECCC---- Confidence 44333 3334555444444 344 44456666544 344455677899999999999999999621 Q ss_pred eeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEE Q lcl|NC_019423. 161 KTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVE 240 (756) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie 240 (756) |+|+|+ T Consensus 142 --------------------------------------------------------------------------~~~~i~ 147 (499) T protein:vir:80 142 --------------------------------------------------------------------------KNVKVS 147 (499) T ss_pred --------------------------------------------------------------------------CcEEEE Confidence 346788 Q ss_pred EechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEE Q lcl|NC_019423. 241 MLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVA 320 (756) Q Consensus 241 ~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v 320 (756) .|+|+.||+=..-..++..|-|+-.. +++ +-.+..|+.+.+.... ...-.|+ T Consensus 148 ~v~a~~~~Pi~~d~~~~~~~~f~~~~---~~~-----~~~y~~lE~h~~~~~~-------------------~~~y~I~- 199 (499) T protein:vir:80 148 FATADCMYPLSNDSENVDECLIANSF---HKN-----NKYYKLLEWNEWKGEK-------------------EEVYTVT- 199 (499) T ss_pred EEcCCceEEEEecCCCeEEEEEEEEE---eec-----CeEEEEEEEEEecccc-------------------eeeEEEE- Confidence 88888877411111234444443111 110 0001111111000000 0000000 Q ss_pred EEEEEEeeccCCceeEEEEEEEECCEEEEecccccC-CCccceEEeee----eeecCcccCCchHHHhHHHHHHHHHHHH Q lcl|NC_019423. 321 YEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFP-DGKLPLVVVPY----MPRKRELFGEADAELLGDNQAILGATMR 395 (756) Q Consensus 321 ~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~-~~~~Pfv~~~~----~~~~~~~~G~g~v~~~~d~Q~~iN~~~~ 395 (756) ++.|...+.+.-|.......++.+ +. ...++. .++.||+.+.. ....++.+|.|++.++++.++.+|..++ T Consensus 200 n~~~~~~~~~~lG~~v~l~~~~~~---~~-~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s 275 (499) T protein:vir:80 200 TELYQSDDPNELGGKVSLKLLFND---IE-PVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFD 275 (499) T ss_pred EEEEeccCccccCcccchhhhccC---cC-CceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHH Confidence 011100000000000000000000 00 000111 24456666543 2245778899999999999999999999 Q ss_pred HHHHHHHhhcCCceEeeccccCccchh------hhhcccccccccccc-c--cccccccccCCCcchHHHHHHHHHHHHH Q lcl|NC_019423. 396 GMIDLLGRSANGQRGYPKGMLDTLNRR------RYDDGQDYEYNPMQG-N--PSQSIMEHKFPELPQSAIVMTQMQNQEA 466 (756) Q Consensus 396 ~~~d~l~~~~~~~~~~~~gav~~~~~~------~~~~~~~~~~~~~~~-~--~~~~i~~~~~~~~~~~~~~~l~~~~~~~ 466 (756) +..+.+.. +..++.++...+...... .+... ...+..+.. . ....++..++.-........++.+...+ T Consensus 276 ~~~~e~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i 353 (499) T protein:vir:80 276 SYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDST-DEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIY 353 (499) T ss_pred HHHHHHHh-cccceecchhhhhccCCCCCCcccCCCcc-cceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHH Confidence 99998865 566777776665321100 01111 111111111 1 1123444443333345567788888888 Q ss_pred HHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCH Q lcl|NC_019423. 467 ESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKR 546 (756) Q Consensus 467 e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~ 546 (756) ....|++....|.+++. ..||+++....+..-.....+.+.|..+++++.+.++.+..-+.--. |.. T Consensus 354 ~~~~g~s~~~fg~~~~g-~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~------~~~------ 420 (499) T protein:vir:80 354 AMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYD------GDT------ 420 (499) T ss_pred HHhcCCChhhcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc------CCC------ Confidence 88889998888865443 36788877655555556666778888888888888888765442110 100 Q ss_pred hHhcCcceEEEec--ccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChh--HHHHhhhccCC-------C Q lcl|NC_019423. 547 EDLKGNFDIEVDI--NTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPD--LAHELRTWQPQ-------P 615 (756) Q Consensus 547 d~~~~~~Dv~V~~--g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~--~~~~l~~~~~q-------~ 615 (756) ....+++|.- +...-.....+..+.+.. . ..++.... +++..+..+ +.+.+.+++.. + T Consensus 421 ---~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~-~-Gi~S~et~------l~~~~~~~d~ea~~el~~i~~E~~~~~~~~ 489 (499) T protein:vir:80 421 ---VELDTITVDFDDSIAQDEDTTINRYTTAKN-Q-GMIPLKIA------LQRAWNITEAEADEWAEMLAKEKQAEIPNN 489 (499) T ss_pred ---CCccceEEEeCCCCCCCHHHHHHHHHHHHH-c-CCCCHHHH------HhhcCCCChHHHHHHHHHHHHHhhcCCCCC Confidence 1122344333 222212222222222221 1 22222111 223333322 11112111100 0 Q ss_pred ChhhhhHHHHH Q lcl|NC_019423. 616 DPMEEQLKQLA 626 (756) Q Consensus 616 ~p~~~~~~q~~ 626 (756) ++...... .+ T Consensus 490 d~~g~~ge-~e 499 (499) T protein:vir:80 490 DMTGIFGE-EE 499 (499) T ss_pred CccccCCC-CC Confidence 00000000 00 No 65 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.60 E-value=7.2e-14 Score=92.63 Aligned_cols=422 Identities=11% Similarity=0.029 Sum_probs=193.7 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSS 97 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~ 97 (756) |+.+.+ ....+.|.....+.++-.+||.|.-.- ..++.++++ +++.+..+..|+.....| +|.+ T Consensus 1 l~~~~l--------~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~ 68 (429) T protein:vir:98 1 MTKDLL--------SELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYF----IGVP 68 (429) T ss_pred CCHHHH--------HHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhh----cccC Confidence 333332 233334556666777888999986311 222333333 577777777777766555 6655 Q ss_pred CEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHH Q lcl|NC_019423. 98 KLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQ 177 (756) Q Consensus 98 ~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~ 177 (756) . .|.+ +|+ ...+.++.++. .|+--.....+.+++++.|.+++.+|++. T Consensus 69 ~--~~~~---~~~----~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------------------- 116 (429) T protein:vir:98 69 V--QTSH---ENK----QVSNYLELLDG-YNDQDDNNAELSKICSIYGHGYELVFNDE---------------------- 116 (429) T ss_pred c--eeec---CCh----HHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEecC---------------------- Confidence 3 3433 222 33345555533 34333446678899999999877765421 Q ss_pred HHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe--CCCCcC Q lcl|NC_019423. 178 ADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNG 255 (756) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~ 255 (756) .|.+++..++|.+++. |..... T Consensus 117 --------------------------------------------------------~g~~~~~~~~p~~~~~v~dd~~~~ 140 (429) T protein:vir:98 117 --------------------------------------------------------NAEAGITYLTPLEAFIVYDDSIRQ 140 (429) T ss_pred --------------------------------------------------------CCcEEEEEEcccceEEEEeCCCCC Confidence 1346677778887643 322211 Q ss_pred ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCcee Q lcl|NC_019423. 256 DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSL 335 (756) Q Consensus 256 d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~ 335 (756) .. ..+.+.+.+. ..+..+++|.. ... T Consensus 141 ---~~-~~~i~~~~~~--------------------------------------------~~~~~~~~~~~------~~~ 166 (429) T protein:vir:98 141 ---KP-LFAVRYFYNK--------------------------------------------GGVLEGSYSDA------SNI 166 (429) T ss_pred ---ce-EEEEEEEEec--------------------------------------------CceEEEEEEeC------ceE Confidence 11 1122222110 01222233321 000 Q ss_pred EEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccc Q lcl|NC_019423. 336 EPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGM 415 (756) Q Consensus 336 ~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~ga 415 (756) ..+.+ -.++..+ .+..|.+.|.+|++.++ ++.+|.|.+..++++++.+|..++.+.+.+...+.|.+.+.-.. T Consensus 167 ~~~~~-~~~~~~~-~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~ 239 (429) T protein:vir:98 167 TYFKD-GEKGIEI-GESEPHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAE 239 (429) T ss_pred EEEEe-cCCceEe-cccccccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 00000 0111111 12233444677877653 35579999999999999999999999999999998876654222 Q ss_pred cCccchhhhhcccccccccccc-ccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHH Q lcl|NC_019423. 416 LDTLNRRRYDDGQDYEYNPMQG-NPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGA 494 (756) Q Consensus 416 v~~~~~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~ 494 (756) .+. +................ .....+.++..+.-.......++.+.+.+-..|++++.+.+..++ .++.++... T Consensus 240 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn---~Sg~Al~~~ 314 (429) T protein:vir:98 240 LDD--ETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFGT---ASGIALRYR 314 (429) T ss_pred CCc--chhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccc---chHHHHHHH Confidence 221 11111111111110000 011224444444334556667888999999999998776553332 233334444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHH Q lcl|NC_019423. 495 LDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGF 574 (756) Q Consensus 495 ~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~ 574 (756) ............+.|..++++++++++.++... +.. .+ -.+|.|.-+....... .+.+. T Consensus 315 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~----------~~~---~d------~~~i~v~f~~~~p~~~--~~~a~ 373 (429) T protein:vir:98 315 LQAMDNLAKTKERKFMSGMNRRYKLIASYPTSK----------IGP---KD------WIGIKYKFTRNLPANL--LEESQ 373 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----------CCc---cc------cccceEEeCCCCCcCH--HHHHH Confidence 445555556666777777777777776654311 111 00 0123333333222111 11112 Q ss_pred HHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 575 MVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASS 654 (756) Q Consensus 575 llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq 654 (756) .+..++..++-+. +++..+. ..+|. +++++.+.+.+. .++.+...... + T Consensus 374 ~~~kl~g~is~et-------~~~~l~~------------v~d~~----~E~~ri~~E~~~-~~~~~~~~~~~-------~ 422 (429) T protein:vir:98 374 IAGNLAGIVSEET-------QVGVLSI------------VENPQ----KEIERKNSDKST-LISRQAGGLNG-------Q 422 (429) T ss_pred HHHHHhccCchHH-------HHHhCCC------------CCCHH----HHHHHHHHHHHH-HHHHHHhhhcC-------C Confidence 2222222233221 1222221 11221 111111111000 00000000000 0 Q ss_pred HHHHHHH Q lcl|NC_019423. 655 GDLKDLD 661 (756) Q Consensus 655 ~~~~~~~ 661 (756) ..-..++ T Consensus 423 ~~~~~~~ 429 (429) T protein:vir:98 423 NTTTILE 429 (429) T ss_pred CCCCCCC Confidence 0000000 No 66 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.60 E-value=1.2e-14 Score=96.92 Aligned_cols=461 Identities=9% Similarity=0.038 Sum_probs=200.3 Q ss_pred ccCC-CCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CC-------CCCCCC-- Q lcl|NC_019423. 4 QDTF-KPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PP-------KIKGRS-- 71 (756) Q Consensus 4 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~-------~~~grS-- 71 (756) +-.+ .|.+-|--.+-+.-..+.. ......+......|...+.+.++..+||.|...-. .. ..++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKY--ETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDW 78 (478) T ss_pred CccccccCCchhhhHHHHHhhhcc--CChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccc Confidence 3333 2222222221121111100 01112245555567777777888999999863210 00 012222 Q ss_pred cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEE Q lcl|NC_019423. 72 QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARI 151 (756) Q Consensus 72 ~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~ 151 (756) +++.+..+..|+....-| ||.+.- |.+ +|.+..+ .++..+. |+-......+.+++++.|.+++.+ T Consensus 79 ki~~n~~k~ivd~~~~yl----~g~p~~--~~~---~~~~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v 143 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYA----VANPVT--FGV---DNDKALK----QIQHTLN--HKWDDKLVDILTAASNKGIEWVQP 143 (478) T ss_pred eeccchHHHHHHHHhhhh----cccCce--eec---CChHHHH----HHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEE Confidence 466666666666665544 665533 332 3444333 3444332 344445567889999999998887 Q ss_pred eeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeee Q lcl|NC_019423. 152 GWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEK 231 (756) Q Consensus 152 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~ 231 (756) ||+.+ T Consensus 144 ~~d~~--------------------------------------------------------------------------- 148 (478) T protein:vir:10 144 YVDEE--------------------------------------------------------------------------- 148 (478) T ss_pred EecCC--------------------------------------------------------------------------- Confidence 65311 Q ss_pred eecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 232 ALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 232 ~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) |++++..++|.+++ ||+..... ..+.+ +.+.+... . .... T Consensus 149 ---~~~~~~~~~p~~~~~v~d~~~~~~---~~~~i-r~~~~~~~-----------~----------------~~~~---- 190 (478) T protein:vir:10 149 ---GEFKTFRVPAEQAVPIWTNKERDE---LQAFI-RVYELDGA-----------E----------------RVEY---- 190 (478) T ss_pred ---CceEEEEEcccceEEEEcCCCCCc---eEEEE-EEEeeeCc-----------e----------------EEEE---- Confidence 23667778888865 45443222 22322 22221000 0 0000 Q ss_pred ccccccceEEEEEEEEEeeccCCceeEEEEEEEEC-CEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHH Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIG-STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQA 388 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g-~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~ 388 (756) +. ..+|..|.+.. .+..+.......+ ......+..|...|.+|++.+.. +..|.|.+..++++++ T Consensus 191 y~---~~~i~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liD 256 (478) T protein:vir:10 191 WT---KDDVTFYELKE------GQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIID 256 (478) T ss_pred Ee---CCcEEEEEecC------CeeeccccccccccccceecccccccCCcceEEEecc-----CCCCCCcHHHHHHHHH Confidence 00 01222222111 0000000000111 11112233455567788877654 4468999999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCceEeeccc-cCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHH Q lcl|NC_019423. 389 ILGATMRGMIDLLGRSANGQRGYPKGM-LDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAE 467 (756) Q Consensus 389 ~iN~~~~~~~d~l~~~~~~~~~~~~ga-v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e 467 (756) .+|..++.+.+.+...+.+.+++. |. .+................ +....++.++++..+.-...+...++.+.+.+- T Consensus 257 a~~~~~S~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~ 334 (478) T protein:vir:10 257 ALDKRLSDTQNTFDESVELIYILK-GYEGEDMKDFMHNLKYYKAIS-VAGESGSGVDTIKVEVPIDSVKEYTKMLRDYII 334 (478) T ss_pred HHHHHHHHHHHHHHHhhCcceeee-cCCcccccchhhhhhhCceeE-ecCCCCCcceEEeecCCHHHHHHHHHHHHHHHH Confidence 999999999999988888866543 32 111111111111111111 111123445666554445666777888889999 Q ss_pred HHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh Q lcl|NC_019423. 468 SLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE 547 (756) Q Consensus 468 ~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d 547 (756) ..|++++.+.+..+.. .++.++...............+.|..+++++++++++++-.-+ ++ T Consensus 335 ~~s~~p~~~~~~~~~n--~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-----------d~------ 395 (478) T protein:vir:10 335 EFGQGVDFQQDKFGNS--PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDV-----------RV------ 395 (478) T ss_pred HHhCCcCcCccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-----------cc------ Confidence 9999887765533222 2333344444555555566666676777766666655442100 00 Q ss_pred HhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc-CChhHHHHhhhccCCCChhhhhHHHHH Q lcl|NC_019423. 548 DLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK-RMPDLAHELRTWQPQPDPMEEQLKQLA 626 (756) Q Consensus 548 ~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~-~~~~~~~~l~~~~~q~~p~~~~~~q~~ 626 (756) .+|.|.-.........+ .+.++......++-+. +++.. ...+...-++ +++ T Consensus 396 -----~~i~i~f~~~~p~~~~e--~~~~~~~~~g~iS~et-------~i~~~~~v~d~~~E~~--------------ri~ 447 (478) T protein:vir:10 396 -----QDIEITFNFNVMVNELE--NSQIAMNSTGLLSKET-------ILGNHSWVQDPVAEME--------------RIE 447 (478) T ss_pred -----ccceEEeCCCCCCCHHH--HHHHHHHHhCCCChHH-------HHHhCCCCCCHHHHHH--------------HHH Confidence 12333333322211111 1122222222222211 11111 1212111111 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 627 IQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARD 674 (756) Q Consensus 627 ~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~ 674 (756) .++.+.......-.-.... .+..+. +..+. | T Consensus 448 ~E~~~~~~~~~~~~~~~~d--~~~~~~-------~d~~~--------e 478 (478) T protein:vir:10 448 QENIELNQQLPDIEEGLND--EQQRQS-------EDNQS--------E 478 (478) T ss_pred HHHHHHHHhccccCCCCcc--cccccC-------cCCCC--------C Confidence 1110000000000000000 000000 00000 0 No 67 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.60 E-value=1.7e-13 Score=90.55 Aligned_cols=468 Identities=10% Similarity=0.027 Sum_probs=208.8 Q ss_pred CCcc---------cCCCCCCCcc--------ccccccCCCchHHHHHHHHHHHHHHHHhhHH-HHHHHHHHHHhcccc-C Q lcl|NC_019423. 1 MEHQ---------DTFKPLPDPA--------QSEKLTDWKKEPSIQLLKGDLESAKPAHDAI-MSQIREWNDLMEVKG-K 61 (756) Q Consensus 1 ~~~~---------~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~-~~~~~~~~~~y~~~~-~ 61 (756) |+.. ++++..-.++ ..+.++.+.= . .+......|... ..+.++..+||.|.- . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~----~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~ 72 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNW----E----LLKNFINHHKLRQAPRIQELLDYARGENHD 72 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccccccccccH----H----HHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 3322 1111111111 1222222211 1 133444455544 356788999999852 1 Q ss_pred C-C--CCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHH Q lcl|NC_019423. 62 A-K--PPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDD 136 (756) Q Consensus 62 ~-~--~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~ 136 (756) . . ....++++ +++.+-.+..|+....-| +|.+.-+... |...-+...++++-++ ..|+--..... T Consensus 73 i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p~~~~~~-----d~~~~~~~~~~l~~~~-~~n~~~~~~~~ 142 (501) T protein:vir:27 73 VLQFGRRKDREMADKRAVHNYGRMISKFKTGYL----AGNPIRVEYD-----DNDNNSQNDDTIKRIG-RINDIDSHNRT 142 (501) T ss_pred ccccCccCccccccceeccchHHHHHHHHhhhh----cccCeeEecC-----CccchHHHHHHHHHHH-HhcChhHHHHH Confidence 1 1 11223444 566666666666665544 6665434332 2222234445665543 34554555667 Q ss_pred HHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCc Q lcl|NC_019423. 137 YVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEA 216 (756) Q Consensus 137 ~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~ 216 (756) +.+++++.|.+.+.+|++.. T Consensus 143 ~~~~~~~~G~a~~~vy~ded------------------------------------------------------------ 162 (501) T protein:vir:27 143 LIRDLSQTGRAYEVIYRNEY------------------------------------------------------------ 162 (501) T ss_pred HHHHHhhCCeEEEEEEeCCC------------------------------------------------------------ Confidence 89999999999887765311 Q ss_pred ceeccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhh Q lcl|NC_019423. 217 TYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSP 294 (756) Q Consensus 217 ~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~ 294 (756) |+|++..++|.++++ |+.... ...+.+ +.|....+ T Consensus 163 ------------------~~~~i~~~~p~~~~~v~d~~~~~---~~~~~i-r~~~~~~~--------------------- 199 (501) T protein:vir:27 163 ------------------DETRIKRLNPLETFVIYDNSLED---NSIAAV-RYYNRGTL--------------------- 199 (501) T ss_pred ------------------CceEEEEEccceeEEEecCCCCC---ceEEEE-EEEEeeec--------------------- Confidence 336677788888754 544322 122222 22211000 Q ss_pred hhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcc Q lcl|NC_019423. 295 ITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKREL 374 (756) Q Consensus 295 ~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~ 374 (756) ...+.++|+|.. +.+ +++...|+ .......|.+.|.+|++.+. ++. T Consensus 200 --------------------~~~~~~~~vyt~-----~~v---~~~~~~~~-~~~~~~~~~~~g~vPvv~~~-----nn~ 245 (501) T protein:vir:27 200 --------------------QNAKDVVEIYTN-----EHI---YTLDASDD-FNEISVTTHAFGTVPITEFL-----NNV 245 (501) T ss_pred --------------------CCcEEEEEEEeC-----CeE---EEEEeCCc-eeeccccccCCCcccEEEec-----CCC Confidence 001334555532 111 11111122 22223333344778887654 355 Q ss_pred cCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccc------ccccccccccccccC Q lcl|NC_019423. 375 FGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYN------PMQGNPSQSIMEHKF 448 (756) Q Consensus 375 ~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~------~~~~~~~~~i~~~~~ 448 (756) +|.|.+..++++++.+|..++.+.+.+...+++.+.+.-...+...+............ +.+......+.++.. T Consensus 246 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 325 (501) T protein:vir:27 246 DGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTK 325 (501) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeec Confidence 79999999999999999999999999988888776654322222211111111111000 011112223455544 Q ss_pred CCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019423. 449 PELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFL 528 (756) Q Consensus 449 ~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~ 528 (756) +.-.......+..+.+.+-..|++++.+.|.-+... ++.++...............+.|..+++++.++++.++.... T Consensus 326 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~--Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~ 403 (501) T protein:vir:27 326 SYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNT--SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVN 403 (501) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 444456667789999999999999987766432222 333344444445555666667787888888887777654322 Q ss_pred CCCcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh-cCChhHHHH Q lcl|NC_019423. 529 SEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL-KRMPDLAHE 607 (756) Q Consensus 529 ~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~-~~~~~~~~~ 607 (756) ... .++ -.+|.|.-.+....... +.+..+..+...++.+. +++. ++..+...- T Consensus 404 ~~~-----------~~d------~~~i~v~f~~~~p~n~~--e~ad~~~kl~g~iS~et-------~l~~l~~v~D~~~E 457 (501) T protein:vir:27 404 EFK-----------DFD------ESLLKITFTPNLPKSLN--EQVSILTGLGGQVSQET-------ALSLSGLVESPNEE 457 (501) T ss_pred ccc-----------ccc------cccceEEeCCCCCcCHH--HHHHHHHHHhccCcHHH-------HHHhCCCCCCHHHH Confidence 110 000 01233333332221111 11122222222233221 1111 122222211 Q ss_pred hhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 608 LRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTK 670 (756) Q Consensus 608 l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k 670 (756) ++++ ..++.+.+... +............-+.+...-+ ..+-+.. T Consensus 458 ~eri--------------~~E~~e~~~~~---~~~~~~~~~~~~~d~~~~~~~d--~~e~~~~ 501 (501) T protein:vir:27 458 LDKI--------------NKEVSEIDFKG---YSNDFNEHVGKYTDEVKETHTD--DFERAYE 501 (501) T ss_pred HHHH--------------HHHHHhhhHhh---hcCccccccccccCCCCCCccc--cccccCC Confidence 1111 11111000000 0000000000000000000000 0000000 No 68 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.60 E-value=2.9e-13 Score=89.33 Aligned_cols=448 Identities=9% Similarity=0.023 Sum_probs=201.9 Q ss_pred CCcccCCCCCCCcc------ccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CC------- Q lcl|NC_019423. 1 MEHQDTFKPLPDPA------QSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PP------- 65 (756) Q Consensus 1 ~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~------- 65 (756) |. .+.-+.+|. +.-+...-.+ ...+....+.|...+.+..++.+||.|.-.-. .+ T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~ 70 (468) T protein:vir:96 1 MI---DIFWPNEKPYHERVVEQIKPQYETQ-------EEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGE 70 (468) T ss_pred Cc---cccCCcCceeehheeecccccccCc-------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccc Confidence 33 222222222 1111111111 22344455666666777889999999863110 00 Q ss_pred --CCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhh Q lcl|NC_019423. 66 --KIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVD 143 (756) Q Consensus 66 --~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~ 143 (756) ..+..-+++.+..+..|+....-| ||.+.-+. .+|.+.-+ .+.-++ +++-......+.++++. T Consensus 71 ~~~~~~~~ki~~n~~~~Iv~~~~~~l----~g~p~~~~-----~~d~~~~~----~l~~~~--~n~~~~~~~~~~~~~~~ 135 (468) T protein:vir:96 71 IDPFKPDWRMYTNYHQNLVDQKVAYA----VANPVTYG-----TEDEKSLK----TIQEVL--NHKWDDKLVDILTAASN 135 (468) T ss_pred ccccccccccccchHHHHHHHHHhhh----ccCCceec-----cCChHHHH----HHHHHH--hcCHHHHHHHHHHHHhh Confidence 111223577777777777666555 66544432 23444333 333333 24455556678899999 Q ss_pred cCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCc Q lcl|NC_019423. 144 DGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTG 223 (756) Q Consensus 144 ~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g 223 (756) .|.+++.+||+.+ T Consensus 136 ~G~~~~~v~~d~~------------------------------------------------------------------- 148 (468) T protein:vir:96 136 KGVEWIQPYVDEQ------------------------------------------------------------------- 148 (468) T ss_pred cCeEEEEEEEcCC------------------------------------------------------------------- Confidence 9999888765311 Q ss_pred eeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhh Q lcl|NC_019423. 224 VTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHE 301 (756) Q Consensus 224 ~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~ 301 (756) |.+++..++|.++++ |+... .+..+.+ +.|.... ... T Consensus 149 -----------~~~~i~~~~p~~~~~v~~~~~~---~~~~~~i-r~~~~~~-----------~~~--------------- 187 (468) T protein:vir:96 149 -----------GEFKTFRVPAEQAIPIWTNKER---DELKAFI-RLYELDG-----------GER--------------- 187 (468) T ss_pred -----------CceEEEEEcccceEEEEcCCCC---CceEEEE-EEEEecC-----------ceE--------------- Confidence 236677788888763 43322 2232332 2231100 000 Q ss_pred ccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHH Q lcl|NC_019423. 302 SKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE 381 (756) Q Consensus 302 ~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~ 381 (756) ... +. ..++..|.++. +........-.............|.+.+++|++.+.. +.+|.|.+. T Consensus 188 -~~~----~~---~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n-----~~~g~sd~e 249 (468) T protein:vir:96 188 -VEY----WT---ANDVTFYELKD-----GQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKN-----NPQEVSDLF 249 (468) T ss_pred -EEE----Ee---CCeEEEEEEcC-----CceeecccccccccccceeeccccccCCcccEEEecC-----CCCCCCchH Confidence 000 00 01222222211 0000000000000011112233355567788887653 456899999 Q ss_pred HhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHH Q lcl|NC_019423. 382 LLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQM 461 (756) Q Consensus 382 ~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~ 461 (756) .++++++.+|..++.+.+.+...++|.+++.-...++............... +..+..+.++++..+.-.......++. T Consensus 250 ~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~-~~~d~~~~~~~l~~~~~~~~~~~~~~~ 328 (468) T protein:vir:96 250 MYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAIN-VDGDGSGGVDTIQIDVPVQSAKEYLDM 328 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEE-ecCCCCCcceEEeecCChHHHHHHHHH Confidence 9999999999999999999988888877665322222222211111111111 111223446666655445666777889 Q ss_pred HHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCce Q lcl|NC_019423. 462 QNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQY 541 (756) Q Consensus 462 ~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~ 541 (756) +.+.+-..|++++.+.+..++. .++.++...............+.|..++++++++++.++ .. . T Consensus 329 l~~~I~~~s~~p~~~~~~~~~n--~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----g~---------~- 392 (468) T protein:vir:96 329 LRDYVIEFGQGVDFQQDKFGNS--PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY----KL---------S- 392 (468) T ss_pred HHHHHHHHhCcccccccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC---------C- Confidence 9999999999988765433222 233335444455555556666667777777666665543 11 0 Q ss_pred eecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh-cCChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 542 VEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL-KRMPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 542 v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~-~~~~~~~~~l~~~~~q~~p~~~ 620 (756) ++.. +..|+.+.+.+.-.... ++++... ..+.-+. +++. ++..+...-++ T Consensus 393 --~d~~----~i~i~f~~~~p~d~~e~----a~~~~~~-g~iS~et-------~i~~l~~v~D~~~E~~----------- 443 (468) T protein:vir:96 393 --IKVQ----DVEITFNFNVMVNELEQ----SQIGVNS-QYLSKET-------VVTNHPWVDDPVAEME----------- 443 (468) T ss_pred --cccc----eeeEEecCCCCcCHHHH----HHHHHhc-CCCchHH-------HHHhCCCCCCHHHHHH----------- Confidence 1111 11222222222111111 1222221 1222111 1111 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQ 678 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~ 678 (756) +++..+.+.. +.+....... +-+.. T Consensus 444 ---ri~~E~~~~~------------------~~~~~~~~~~------------~~~~~ 468 (468) T protein:vir:96 444 ---RIDQEELALP------------------SIEEGLNGKE------------NNEPT 468 (468) T ss_pred ---HHHHHHHHHH------------------HHhhccCCCC------------CCCCC Confidence 1111000000 0000000000 00000 No 69 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.60 E-value=7.3e-14 Score=92.60 Aligned_cols=484 Identities=11% Similarity=0.048 Sum_probs=211.5 Q ss_pred CCcccCCCCCCCccccccccCCCchHH----HHHHHHHHHH-HHHHhhHHHHHHHHHHHHhccccCC-CCCCCCCCCc-- Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPS----IQLLKGDLES-AKPAHDAIMSQIREWNDLMEVKGKA-KPPKIKGRSQ-- 72 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~-a~~~~~~~~~~~~~~~~~y~~~~~~-~~~~~~grS~-- 72 (756) |.= -+++-+|-..=. ...|++.+++ -..--.++..+..+|..||.|...- ......|+.+ T Consensus 1 m~~------------~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~ 68 (517) T protein:vir:98 1 MKV------------IQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQER 68 (517) T ss_pred Cch------------HHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCccccccccccccccc Confidence 100 011111110000 0001111111 0111334556677889999876321 1111222221 Q ss_pred -ccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCC-cchH-HHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEE Q lcl|NC_019423. 73 -VQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVT-FEDE-LAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIA 149 (756) Q Consensus 73 -~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~-~~D~-~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~ 149 (756) ..+--+...|-.-+++| .|+-..-+.+.... .+.. ....-++++||-++. .|+-...+..++..++-.|.|++ T Consensus 69 ~~~sl~~~~~i~~~~A~L---l~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~-~n~f~~~~~~~~e~a~a~G~~a~ 144 (517) T protein:vir:98 69 DYMTLNLRKLSADVLSGL---VFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ-HNKFIKNLSDYLEPTFALGGLTV 144 (517) T ss_pred ceeecCcHHHHHHHhhhh---hcCCcceEEecccccccccccchhHHHHHHHHHHH-hccHHHHHHHHHHHHhhhCCEEE Confidence 11112233344444555 24443444443211 1111 122235568887654 44556667889999999999999 Q ss_pred EEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEe Q lcl|NC_019423. 150 RIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEV 229 (756) Q Consensus 150 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~ 229 (756) |+||+. T Consensus 145 k~~~d~-------------------------------------------------------------------------- 150 (517) T protein:vir:98 145 RPYVDN-------------------------------------------------------------------------- 150 (517) T ss_pred EEEEeC-------------------------------------------------------------------------- Confidence 999961 Q ss_pred eeeecCceeEEEechhheEe-CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccc Q lcl|NC_019423. 230 EKALVNRPTVEMLNPNNVVI-DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDF 308 (756) Q Consensus 230 ~~~~~g~~~ie~V~p~~~~~-Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 308 (756) ++++|+.|+++.||+ ..+.. ....|-++.. .+.+... ....|-.|+.+.+.........-.-+...... T Consensus 151 -----~~~~I~~v~ad~~~Pl~~~~~-~v~~~ai~~~-~~~~~~~---~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s 220 (517) T protein:vir:98 151 -----GEIEFSWALANAFYPLRSNSN-GISEGVMKSV-TTKVIGN---KTVYYTLLEFHEWEKTEEGESLYVITNELYKS 220 (517) T ss_pred -----CeeEEEEEcCCeeEEEEecCC-CeEEEEEEEE-EEEeecC---CceEEEEEEEEecCceeccCCcEEEEEEEEec Confidence 124456666666653 11221 1222222211 1111100 00011112211111100000000000000000 Q ss_pred cccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceE-Eee----eeeecCcccCCchHHHh Q lcl|NC_019423. 309 QFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLV-VVP----YMPRKRELFGEADAELL 383 (756) Q Consensus 309 ~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv-~~~----~~~~~~~~~G~g~v~~~ 383 (756) ...+..+.+|-+-++|. + ..+. ++ +.+-..|.+ .+. .....++.+|.|++.++ T Consensus 221 ~~~~~lG~~v~L~~~~e--~-----l~~~---~~------------~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a 278 (517) T protein:vir:98 221 DNEGEIGKRIPLEELYE--G-----MQEK---TY------------IQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNS 278 (517) T ss_pred CCCcccccccccccccc--C-----CCcc---ee------------ECCCCcceEEEecCCcccccccCCCCCCchhhhh Confidence 00011122232222221 0 0000 00 011112332 221 22334788999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCcc---chhhhhc---ccccccccccccc-ccccccccCCCcchHHH Q lcl|NC_019423. 384 GDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTL---NRRRYDD---GQDYEYNPMQGNP-SQSIMEHKFPELPQSAI 456 (756) Q Consensus 384 ~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~---~~~~~~~---~~~~~~~~~~~~~-~~~i~~~~~~~~~~~~~ 456 (756) ++..+.+|..++++++-+.+ +..++.++++.+... +...... .....+..+.... ...++..++.-....+. T Consensus 279 ~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~ 357 (517) T protein:vir:98 279 VSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYK 357 (517) T ss_pred HHHHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHH Confidence 99999999999999998776 666888888887321 1110000 1111111122211 22344433333345677 Q ss_pred HHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--CCCCcEE Q lcl|NC_019423. 457 VMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVF--LSEKEVV 534 (756) Q Consensus 457 ~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~--~~~~r~i 534 (756) ..++.+.+.+....|++....|.++... +||++|....+..-+....+.+.+..+++++.+.++.+..-+ +... T Consensus 358 ~~~~~~L~~i~~~~Gls~~t~~~~~~~~-kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~--- 433 (517) T protein:vir:98 358 EAINQALRTLEMELKLSVGTFSFDGRSM-KTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGE--- 433 (517) T ss_pred HHHHHHHHHHHHHhCCCccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC--- Confidence 7888888889999999999999776554 789998877666667777788888899999999988776543 2211 Q ss_pred EEecCceeecCHhHhcCcceEEEecccc--cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChh--HHHHhhh Q lcl|NC_019423. 535 RITNEQYVEIKREDLKGNFDIEVDINTA--EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPD--LAHELRT 610 (756) Q Consensus 535 RI~g~~~v~i~~d~~~~~~Dv~V~~g~a--~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~--~~~~l~~ 610 (756) ....++++|+=+.+ .-.....+..+++... ..++.... +++.-|..+ +.+.+.+ T Consensus 434 --------------~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~a--G~ms~~~~------i~~~~g~~eeeA~~e~~~ 491 (517) T protein:vir:98 434 --------------IPSAEHIGVDFDDGVFQDRSALLRFYGQAKTF--GFIPTVEA------IQRIFKVPKKTAEQWLEE 491 (517) T ss_pred --------------CCCCcceEEEcCCCCCCCHHHHHHHHHHHHhc--CCCCHHHH------HHHhCCCChHHHHHHHHH Confidence 11244565553333 2222333333333221 12332211 222223321 1222221 Q ss_pred cc---CCCChhhhhHHHHHHHHHHHHHH Q lcl|NC_019423. 611 WQ---PQPDPMEEQLKQLAIQKAQLENE 635 (756) Q Consensus 611 ~~---~q~~p~~~~~~q~~~~~aq~e~~ 635 (756) ++ ...+|....+.+ .....-+.+ T Consensus 492 i~~E~~~~~~~~~~~~~--~~~~~gd~e 517 (517) T protein:vir:98 492 IRKDQIELDPVTISQRA--QKRMFGDEE 517 (517) T ss_pred HHHhccccCCCCccccc--cCCCCCCCC Confidence 11 111111100000 000000000 No 70 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.59 E-value=6.3e-13 Score=87.46 Aligned_cols=461 Identities=13% Similarity=0.069 Sum_probs=210.1 Q ss_pred CCcccCCCCCCCccccccccCCCchH-HHHHHHHHHHHHH-HHhhHHHHHHHHHHHHhccccC-CCCCCCCCCCc---cc Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEP-SIQLLKGDLESAK-PAHDAIMSQIREWNDLMEVKGK-AKPPKIKGRSQ---VQ 74 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~-~~~~~~~grS~---~v 74 (756) |.=-+.+ |.-..+|.+-. +...|+.-.++-+ ...+.++.+.++|..||.|+.. .....+.|+.+ .. T Consensus 1 m~~~~~i--------k~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~ 72 (505) T protein:vir:79 1 MAFWDTL--------KNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQ 72 (505) T ss_pred CchHHHH--------HHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCcccccee Confidence 1111111 00000111100 0111111111111 1123455667889999987532 11122233222 12 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 75 PRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 75 ~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) +--+...|- -.+.+.+|+-..-|.+ +|. +.+++|+-++. .|+-...++.++..|+..|.+++|+||+ T Consensus 73 slnl~~~i~---~~~A~ll~~e~~~i~~-----~d~----~~~e~l~~i~~-~n~f~~~~~~~~e~a~a~G~~~~k~~~D 139 (505) T protein:vir:79 73 SVNVTKLAS---AKLASLIFNEQCQVTV-----SDE----TANDFLDDVFQ-QNDFYTTFEEKLEEWIALGSGCVRPYVD 139 (505) T ss_pred ecchHHHHH---HHHHhhhcCCCceeec-----CCh----HHHHHHHHHHH-hccHHHHHHHHHHHHhhcCCeEEEEEEe Confidence 211112222 2222333444434443 343 34556766644 3344556778999999999999999885 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) . T Consensus 140 ~------------------------------------------------------------------------------- 140 (505) T protein:vir:79 140 S------------------------------------------------------------------------------- 140 (505) T ss_pred C------------------------------------------------------------------------------- Confidence 1 Q ss_pred CceeEEEechhheEeC-CCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVVID-PSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDA 313 (756) Q Consensus 235 g~~~ie~V~p~~~~~D-p~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 313 (756) |+++|+.|+++.|++= .+. .++..|-|+.+.+..... T Consensus 141 ~~~~i~~v~ad~~~P~~~d~-~~~~~~a~~~~~~~~~~~----------------------------------------- 178 (505) T protein:vir:79 141 GKIKLAWATADQVYPLQADT-NQVNELAIASRTTEVENH----------------------------------------- 178 (505) T ss_pred CceEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEEecCC----------------------------------------- Confidence 2256777888777641 111 234445444222111100 Q ss_pred ccceEEEEEEEEEeeccCCceeEEEEEEEEC------CEE-----------EEecccccCCCccceEEee----eeeecC Q lcl|NC_019423. 314 LRKKVVAYEYWGFYDINDDGSLEPIVATWIG------STL-----------IRMENNPFPDGKLPLVVVP----YMPRKR 372 (756) Q Consensus 314 s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g------~~~-----------L~~~~~P~~~~~~Pfv~~~----~~~~~~ 372 (756) ...-.+++|+|...+ +.+.... -.|.+ +.. |..+.......+.+|+.++ .....+ T Consensus 179 ~~~~yt~lE~h~~~~--~~~~I~n--~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~ 254 (505) T protein:vir:79 179 RTIYYTLLEFHQWDH--GDYVITN--ELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFT 254 (505) T ss_pred cceEEEEEEEEEecC--ceEEEEE--EEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccC Confidence 000122333332110 1111111 00100 000 0000000112233455443 233457 Q ss_pred cccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhh----------hhccccccccccccc-ccc Q lcl|NC_019423. 373 ELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRR----------YDDGQDYEYNPMQGN-PSQ 441 (756) Q Consensus 373 ~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~----------~~~~~~~~~~~~~~~-~~~ 441 (756) +++|.|++.++++..+.+|..+++..+.+.+ ++.++.+++..+....... ++. ....+..+..+ ... T Consensus 255 splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~-~~~~y~~~~~~~~~~ 332 (505) T protein:vir:79 255 SPMGMSLIDNSYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGGQASETHPPMFDP-DETVYQAMYGDASEV 332 (505) T ss_pred CccCCchhhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCcccccccccCCCc-cceeeeeccCCCCCC Confidence 7899999999999999999999999988754 5667788776653211100 000 01111111111 223 Q ss_pred ccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 442 SIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKIC 521 (756) Q Consensus 442 ~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l 521 (756) .+..+++.-....+...++.+...+...+|++....|.++.. .+||+++....+..-.....+.+.|..+++.+.+.++ T Consensus 333 ~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~ 411 (505) T protein:vir:79 333 GFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSG-IQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAIL 411 (505) T ss_pred ceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccc-cchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455555333345566778888888888899998888876554 3688888776666666677778888889999999999 Q ss_pred HHHHhhCCCCc-EEEEecCceeecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh Q lcl|NC_019423. 522 AMNAVFLSEKE-VVRITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL 598 (756) Q Consensus 522 ~li~q~~~~~r-~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~ 598 (756) .+..-|.-..- ..+-.+ -..+++++|+-+.+.. .....++.+.+.+. ..++... -+++. T Consensus 412 ~~~~~~~~~~~g~~~~~~----------~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~--Gi~s~e~------~l~~~ 473 (505) T protein:vir:79 412 ELASVPSFYADGQARWTG----------DVDSLDITINFNDGVFVDQESKRAADLQAVQA--QVMPKKQ------FLMRN 473 (505) T ss_pred HHHHHhcccccccccccC----------CCCceeEEEEeCCCCCCCHHHHHHHHHHHHHc--CCCCHHH------HHHhc Confidence 88776542110 000000 0124455554443322 22222323332221 1223221 12334 Q ss_pred cCChh--HHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 599 KRMPD--LAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGD 656 (756) Q Consensus 599 ~~~~~--~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~ 656 (756) .+..+ +.+.+.+++........ .. ...-.+ T Consensus 474 ~~~~eeea~~el~ri~~E~~~~~p---~~-------------------------~~~gg~ 505 (505) T protein:vir:79 474 YGLDEEEADEWLAQIDAENSTAEP---EF-------------------------NQFGGD 505 (505) T ss_pred CCCChHHHHHHHHHHHHhccccCC---Cc-------------------------hhccCC Confidence 44432 22222222111000000 00 000000 No 71 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.58 E-value=3.6e-14 Score=94.25 Aligned_cols=461 Identities=10% Similarity=0.065 Sum_probs=202.4 Q ss_pred CCcccCCCCCCCcc--ccccccCCC-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCC---------C Q lcl|NC_019423. 1 MEHQDTFKPLPDPA--QSEKLTDWK-KEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPP---------K 66 (756) Q Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~-~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~---------~ 66 (756) ++.-.-+.|..-+. ....+-... +.++ +...+....+.|...+.+..+..+||.|.-.- .+. . T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ 91 (492) T protein:vir:97 15 IKGGNILYPSQPTQTEIFDAIVRTNNKPET---LEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDP 91 (492) T ss_pred hcCCceeeccchhhhhHhhhcccCCCchhh---HHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccc Confidence 11111122211111 111111111 2222 23345555556777777888899999986311 011 1 Q ss_pred CCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCc Q lcl|NC_019423. 67 IKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGT 146 (756) Q Consensus 67 ~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~ 146 (756) .+-..+++.+..+..|+.....| +|.+ +.|.+ +|.+..+ +++..+ +|+-......+.++++++|. T Consensus 92 ~~~~~ri~~n~~k~Ivd~~~~yl----~g~p--~~~~~---~d~~~~~----~l~~~~--~n~~~~~~~~~~~~~~~~G~ 156 (492) T protein:vir:97 92 LKPDDRMITNFHANLVDQKVSYI----VGKP--IAFKH---TDDEVVK----RIDEVL--GNRFDDKLHSVLTGASNKGI 156 (492) T ss_pred cccccccccchHHHHHHHHhhhh----cccC--ceecc---CchHHHH----HHHHHH--hccHHHHHHHHHHHHhhcCe Confidence 12223677888887777776655 5544 33432 3444333 444332 24444556678899999988 Q ss_pred eEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeE Q lcl|NC_019423. 147 GIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTE 226 (756) Q Consensus 147 gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~ 226 (756) +.+.+|++. T Consensus 157 a~~~v~~d~----------------------------------------------------------------------- 165 (492) T protein:vir:97 157 EWLHPYLDE----------------------------------------------------------------------- 165 (492) T ss_pred EEEEEEecC----------------------------------------------------------------------- Confidence 876654320 Q ss_pred EEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccc Q lcl|NC_019423. 227 VEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKT 304 (756) Q Consensus 227 ~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 304 (756) .|++++..++|.++++ |++.... ..+. .+.|..... . ... T Consensus 166 -------dg~~~~~~~~p~~~~~i~d~~~~~~---~~~~-vr~~~~~~~-------------------------~--~~~ 207 (492) T protein:vir:97 166 -------EGEFKLFRVPAEQGIPIWTDKEHEE---LEAF-IRMYKLENE-------------------------T--KVE 207 (492) T ss_pred -------CCceEEEEEcccceEEEEcCCCCCc---eEEE-EEEEeeccc-------------------------e--eEE Confidence 1346778889988765 4333222 2222 233321000 0 000 Q ss_pred cccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhH Q lcl|NC_019423. 305 PSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLG 384 (756) Q Consensus 305 ~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~ 384 (756) . + +. .+|..+++. +++. ....-...+...+...++ ..+.+|++.+.. +.+|.|.+..++ T Consensus 208 ~----y-~~--~~v~~~~~~------~~~~-~~~~~~~~~~~~~~~~~~--~~g~vPvv~~~n-----n~~g~sd~e~v~ 266 (492) T protein:vir:97 208 Y----W-DK--VTVNYYVYE------NGSL-IPDYSNNLENSKTHFSTG--SWGKIPFIPFKN-----NDLEISDIFMYK 266 (492) T ss_pred E----E-ec--CeEEEEEEe------cCee-eecccccccccccccccC--CCCCcceEEecC-----CCCCCCchHhHH Confidence 0 0 00 011111111 1111 000000111122222333 346677776643 446899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHH Q lcl|NC_019423. 385 DNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQ 464 (756) Q Consensus 385 d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~ 464 (756) ++++.+|..++.+.+.+...+.+.+.+.-...+...+..... .....+....++.+.++..+.-.......++.+.+ T Consensus 267 ~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~ 343 (492) T protein:vir:97 267 TLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLL---RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQ 343 (492) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHH---hhccceecCCCCcceeEeccCCHHHHHHHHHHHHH Confidence 999999999999999999888886655321111111111111 11122233344456665544444566777888899 Q ss_pred HHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeec Q lcl|NC_019423. 465 EAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEI 544 (756) Q Consensus 465 ~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i 544 (756) .+-..|++++.+.+.-++. .++.++...............+.|..++++++++++.++... + ++ T Consensus 344 ~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~----------~-~~--- 407 (492) T protein:vir:97 344 KIMLFGQAVDFSSDKFGSA--PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----------G-EH--- 407 (492) T ss_pred HHHHHhCCCCCCccccccC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------c-cc--- Confidence 9999999887765533222 233335444455555566666677777777777666554211 1 11 Q ss_pred CHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc-CChhHHHHhhhccCCCChhhhhHH Q lcl|NC_019423. 545 KREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK-RMPDLAHELRTWQPQPDPMEEQLK 623 (756) Q Consensus 545 ~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~-~~~~~~~~l~~~~~q~~p~~~~~~ 623 (756) .++.|.-.+.......+ .+..+..+...++.+. +++.. ...+...-++ T Consensus 408 --------~~i~v~f~~~~p~~~~e--~a~~~~kl~G~iS~et-------~l~~l~~v~d~~~Ele-------------- 456 (492) T protein:vir:97 408 --------KDVDISFNYNKVANTEL--QVQTAQQSMGIVSHET-------VLENHPFVEDLQAELE-------------- 456 (492) T ss_pred --------ceeeEEecCCCCCCHHH--HHHHHHHHhccCchHH-------HHHhCCCCCCHHHHHH-------------- Confidence 12333323222211111 1122222222233211 11111 1222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 624 QLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARD 674 (756) Q Consensus 624 q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~ 674 (756) ++..++.+ ....++. ... ...+ ..+..+.. ..+. .+ T Consensus 457 ri~~E~~~-~~~~~~~--------~~~--~~~~--~~~~~~~~-~~~~-~e 492 (492) T protein:vir:97 457 RIEQEQTE-YNKQLPN--------LDD--GGAD--SAQQQERS-NNKE-SE 492 (492) T ss_pred HHHHHHHH-HHHhhhc--------ccc--CCCC--CCcccccc-cccc-cC Confidence 11110000 0000000 000 0000 00000000 0000 00 No 72 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.58 E-value=8e-14 Score=92.38 Aligned_cols=470 Identities=10% Similarity=0.026 Sum_probs=208.4 Q ss_pred CC--------------cccCCCCCCCccccccccCCCchHHHH-HHHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC-- Q lcl|NC_019423. 1 ME--------------HQDTFKPLPDPAQSEKLTDWKKEPSIQ-LLKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA-- 62 (756) Q Consensus 1 ~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-- 62 (756) |- .+..|++ +...-..|...+... ....++......|.... .+.++..+||.|.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-----~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~ 75 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFND-----EANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhh-----hhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcccc Confidence 11 1112211 011111243322211 11233555555555444 3577888999876221 Q ss_pred --C--CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 63 --K--PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 63 --~--~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) . +.+.+...+++.+.....|+....-| +|.+.-+ . .+|.+.. ++++-++. .|+--.....+. T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~---~~~~~~~----~~l~~~~~-~n~~~~~~~~~~ 141 (511) T protein:vir:96 76 ELTRRKEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--Q---DDDKDVL----EAIEAFND-LNDVESHNRSLG 141 (511) T ss_pred ccCcCcccccCcceeecchHHHHHHHHHhhh----ccCCcee--e---cCchHHH----HHHHHHHh-hcCHHHHHHHHH Confidence 1 11122334677777777776665444 6644433 2 2343332 34544433 344444456788 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) +++++.|.+.+.+|++. T Consensus 142 ~~~~i~G~a~~~vy~de--------------------------------------------------------------- 158 (511) T protein:vir:96 142 LDLSIYGKAYELMIRNQ--------------------------------------------------------------- 158 (511) T ss_pred HHHHhcCeeEEEEEeCC--------------------------------------------------------------- Confidence 99999998877765530 Q ss_pred eccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 219 AIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 219 ~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) .|.+++..++|.++++ |..... .. ..+.+.+.+.. .+ T Consensus 159 ---------------d~~~~i~~~~p~~~~~vydd~~~~---~~-~~~vr~~~~~~-----------~d----------- 197 (511) T protein:vir:96 159 ---------------DDETRLYKSDAMSTFVIYDNTIER---NS-IAGVRYLRTKP-----------ID----------- 197 (511) T ss_pred ---------------CCceEEEEEccceeEEEEcCCCCC---ce-EEEEEEEEeee-----------cc----------- Confidence 1346777888888764 433211 12 22223332210 00 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-----EecccccCCCccceEEeeeeeec Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-----RMENNPFPDGKLPLVVVPYMPRK 371 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-----~~~~~P~~~~~~Pfv~~~~~~~~ 371 (756) +.....+..+|+|.. +++ ++++..++..+ .....|.+.+.+|++.++ T Consensus 198 ---------------~~~~~~~~~~~iyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~----- 249 (511) T protein:vir:96 198 ---------------KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFS----- 249 (511) T ss_pred ---------------ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccccCCceeeEEec----- Confidence 000112334455532 111 12222221111 112223344566666554 Q ss_pred CcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhh-ccc--------cccccccccccccc Q lcl|NC_019423. 372 RELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYD-DGQ--------DYEYNPMQGNPSQS 442 (756) Q Consensus 372 ~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~-~~~--------~~~~~~~~~~~~~~ 442 (756) ++.+|.|.+..++++++.+|..++.+.+.+...+++.+++.-............ ... ....+......... T Consensus 250 nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:96 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcc Confidence 345789999999999999999999999999888887665443222211111100 000 00011111122334 Q ss_pred cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 443 IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICA 522 (756) Q Consensus 443 i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~ 522 (756) +.++..+.-.......+..+.+.+...|++++.+.+.-+.. .++.++...............+.|..++++++++++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~ 407 (511) T protein:vir:96 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55555544456677788889999999999998876543222 2444465566666677777778888888888888877 Q ss_pred HHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHH--HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc- Q lcl|NC_019423. 523 MNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEID--NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK- 599 (756) Q Consensus 523 li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~--~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~- 599 (756) ++........ . .++ .+|.+.-...... ...++.+ ..+...++.+.. ++.. T Consensus 408 ~~~~~~~~~~-----~--------~d~---~~i~~~f~~~~p~n~~e~~~~~----~kl~G~iS~et~-------l~~l~ 460 (511) T protein:vir:96 408 ILKNTWSIDA-----N--------KDF---NTVRYVYNRNLPKSLIEELKAY----IDSGGKISQTTL-------MSLFS 460 (511) T ss_pred HHHhhcCccc-----c--------ccc---ccceEEeCCCCCCCHHHHHHHH----HHHhccCChHHH-------HHhCC Confidence 7643221100 0 011 1233333322221 1112211 111222332221 1111 Q ss_pred CChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 600 RMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQ 678 (756) Q Consensus 600 ~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~ 678 (756) ...+...-+++ +..++.. .++.. +.....+.......+.....+.. .-+.. T Consensus 461 ~v~D~~~E~~r--------------i~~E~~~----~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) T protein:vir:96 461 FFQDPELEVKK--------------IEEDEKE----SIKKA---------QKGIYKDPRDINDDEQDDDTKDT-VDKKE 511 (511) T ss_pred CCCCHHHHHHH--------------HHHHHHH----HHHHH---------hhccccCCCCCCCCCCCCccccc-ccccC Confidence 12221111111 1110000 00000 00000000000000000000000 00000 No 73 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.57 E-value=7.5e-14 Score=92.54 Aligned_cols=470 Identities=10% Similarity=0.023 Sum_probs=206.6 Q ss_pred CC--------------cccCCCCCCCccccccccCCCchHHH-HHHHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC-- Q lcl|NC_019423. 1 ME--------------HQDTFKPLPDPAQSEKLTDWKKEPSI-QLLKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA-- 62 (756) Q Consensus 1 ~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-- 62 (756) |- .+..|++ . ...-..|...+-. .....++....+.|.... .+.++..+||.|.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----~-~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~ 75 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFND----E-ANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhh----h-hCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcccc Confidence 11 1112211 0 1111134322221 112234555566665444 4567788999876321 Q ss_pred --CCCCCCC--CCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 63 --KPPKIKG--RSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 63 --~~~~~~g--rS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) .....++ .-+++.+.....|+....-| +|.+.- |. .+|.+.-+ +++-++ ..|+--.....+. T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~--~~---~~d~~~~~----~l~~~~-~~n~~~~~~~~~~ 141 (511) T protein:vir:93 76 ELTRRKEEYMADNRVAHDYASYISDFINGYF----LGNPIQ--YQ---DDDKDVLE----VIEAFN-DLNDVESHNRSLG 141 (511) T ss_pred ccCcCcccccCcceeecchHHHHHHHHhhhh----cccCee--ec---cCChHHHH----HHHHHH-hhcCHhHHHHHHH Confidence 1111122 24567776666666655444 554433 32 23443322 343332 2343344456788 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) +++++.|.+.+.+|++. T Consensus 142 ~~~~~~G~ay~~vy~de--------------------------------------------------------------- 158 (511) T protein:vir:93 142 LDLSIYGKAYELMIRNQ--------------------------------------------------------------- 158 (511) T ss_pred HHHHhcCeeEEEEEeCC--------------------------------------------------------------- Confidence 99999998877765530 Q ss_pred eccCceeEEEeeeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 219 AIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 219 ~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) .|.|++..++|.+++ ||..... . ...+.+.|.+.. .+ T Consensus 159 ---------------~~~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~-----------~~----------- 197 (511) T protein:vir:93 159 ---------------DDETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKP-----------ID----------- 197 (511) T ss_pred ---------------CCceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeee-----------cc----------- Confidence 133667888888876 4544321 1 122333332210 00 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-----EecccccCCCccceEEeeeeeec Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-----RMENNPFPDGKLPLVVVPYMPRK 371 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-----~~~~~P~~~~~~Pfv~~~~~~~~ 371 (756) +.....+..+|+|.. +++ +++...++..+ ...+.|.+.+.+|++.++ T Consensus 198 ---------------~~~~~~~~~~~iyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~----- 249 (511) T protein:vir:93 198 ---------------KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFS----- 249 (511) T ss_pred ---------------ccccceEEEEEEEeC-----CcE---EEEEecCCCccccccccccccccCCCccceEEec----- Confidence 001112344555532 111 11222221111 111223334567776554 Q ss_pred CcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhc-cc------cc--cccccccccccc Q lcl|NC_019423. 372 RELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDD-GQ------DY--EYNPMQGNPSQS 442 (756) Q Consensus 372 ~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~-~~------~~--~~~~~~~~~~~~ 442 (756) ++.+|.|.++.++++++.+|..+|.+.+.+...+++.+++.-............. .. .. .........+.. T Consensus 250 nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:93 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcc Confidence 3456899999999999999999999999998888876654421211111111000 00 00 001111222344 Q ss_pred cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 443 IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICA 522 (756) Q Consensus 443 i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~ 522 (756) +.++..+.-.......+..+.+.+-..|++++.+.+.-+... ++.++...............+.|..++++++++++. T Consensus 330 ~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~--Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:93 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555554444566677888889999999999987765332222 344455555666666677777888888888888887 Q ss_pred HHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHH--HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh-c Q lcl|NC_019423. 523 MNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEID--NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL-K 599 (756) Q Consensus 523 li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~--~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~-~ 599 (756) ++........ +.++ .+|.+.-...... ...++.+ ..+...++.+.+ ++. + T Consensus 408 ~l~~~~~~~~-------------~~d~---~~i~~~f~~~~p~n~~e~~~~~----~kl~g~iS~et~-------~~~l~ 460 (511) T protein:vir:93 408 ILKNTWSIDA-------------NKDF---NTVRYVYNRNLPKSLIEELKAY----IDSGGKISQTTL-------MSLFS 460 (511) T ss_pred HHHhccCccc-------------cccc---ccceEEeCCCCCCCHHHHHHHH----HHHhccCchHHH-------HHhCC Confidence 7643322110 0011 1233333322221 1122222 222222333221 111 1 Q ss_pred CChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 600 RMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQK 679 (756) Q Consensus 600 ~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~ 679 (756) ...+...-+++ +..++.. .++. . +.....+.......+.. .+..+-.. + T Consensus 461 ~v~d~~~E~~r--------------i~~E~~~----~~~~--~-------~~~~~~~~~~~~~~~~~---~~~~~~~~-~ 509 (511) T protein:vir:93 461 FFQDPELEVKK--------------IEEDEKE----SIKK--A-------QKGIYKDPRDINDDEQD---DDTKDTVD-K 509 (511) T ss_pred CCCCHHHHHHH--------------HHHHHHH----HHHH--H-------hhhcccCCCCCCCCCCC---Cccccccc-c Confidence 22222111111 1110000 0000 0 00000000000000000 00000000 0 Q ss_pred HH Q lcl|NC_019423. 680 AQ 681 (756) Q Consensus 680 ~q 681 (756) +. T Consensus 510 ~~ 511 (511) T protein:vir:93 510 KE 511 (511) T ss_pred cC Confidence 00 No 74 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.57 E-value=1e-12 Score=86.38 Aligned_cols=454 Identities=12% Similarity=0.075 Sum_probs=214.1 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHH-----------------H-HHHhhHHHHHHHHHHHHhccccCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLES-----------------A-KPAHDAIMSQIREWNDLMEVKGKA 62 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-----------------a-~~~~~~~~~~~~~~~~~y~~~~~~ 62 (756) |. ++..|+.-|+. - ..--..++.+.++|.+||.|.... T Consensus 1 m~------------------------~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~ 56 (508) T protein:vir:15 1 MG------------------------LIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQY 56 (508) T ss_pred CC------------------------hHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcc Confidence 11 11122222111 0 111234456688899999976321 Q ss_pred -C----CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHH Q lcl|NC_019423. 63 -K----PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDY 137 (756) Q Consensus 63 -~----~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 137 (756) . ....+.|-....+..+..++ -+++| +|+-..-+.+. +|. .+.++|+-++. .|+-...++.+ T Consensus 57 ~~~~~~~~~~~~~~~~sln~~~~i~~-~~A~l---v~~e~~~i~v~----~~~----~~~e~l~~il~-~n~f~~~~~~~ 123 (508) T protein:vir:15 57 IHYQASDGIKKKRLKNTINMAKTAAR-RIASV---VFNEKAEIHVK----DNN----EADKFLNDVLE-DNDFKNKFEEA 123 (508) T ss_pred cccccCCCCccccceeecchHHHHHH-HHHhh---hhCCCceEEeC----Cch----HHHHHHHHHHH-hccHHHHHHHH Confidence 1 11112222233344433333 23333 34443334432 122 22346665544 34445557789 Q ss_pred HHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcc Q lcl|NC_019423. 138 VHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEAT 217 (756) Q Consensus 138 v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~ 217 (756) +.+|+..|.+++|+||+. T Consensus 124 ~e~a~a~G~~~~k~~~d~-------------------------------------------------------------- 141 (508) T protein:vir:15 124 LEKGVALGGFAMRPYIDG-------------------------------------------------------------- 141 (508) T ss_pred HHHHhhcCceEEEEEEeC-------------------------------------------------------------- Confidence 999999999999998851 Q ss_pred eeccCceeEEEeeeeecCceeEEEechhheEe-CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 218 YAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI-DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 218 ~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~-Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) ++++|+.|+++.||+ ..+. .++..|-|+.+.....- T Consensus 142 -----------------~~~~i~~v~ad~~~P~~~d~-~~~~~~af~~~~~~~~~------------------------- 178 (508) T protein:vir:15 142 -----------------NHIKIAWVRADQFYPLQSNT-NDISEAAIASRTQRTES------------------------- 178 (508) T ss_pred -----------------CeeEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEeecC------------------------- Confidence 125577777777763 1122 22333433322211100 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEE-----EECCEE-EE-------ecccc-cC-CCccc Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVAT-----WIGSTL-IR-------MENNP-FP-DGKLP 361 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~-----~~g~~~-L~-------~~~~P-~~-~~~~P 361 (756) .....++.+|+|...+ ++.|...+...- -.|..+ |. ..+.. +. ..+.| T Consensus 179 ----------------~~~~~yt~lE~h~~~~-~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~ 241 (508) T protein:vir:15 179 ----------------NQTKYYTLLEFHQWQD-NGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPL 241 (508) T ss_pred ----------------CCceEEEEEEEEEEec-CcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcce Confidence 0000122223222110 011111110000 000000 00 00000 01 12234 Q ss_pred eEEeee----eeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch--hhhhcccccccccc Q lcl|NC_019423. 362 LVVVPY----MPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR--RRYDDGQDYEYNPM 435 (756) Q Consensus 362 fv~~~~----~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~--~~~~~~~~~~~~~~ 435 (756) |+.+.. ....++.+|.|++.++++.++.+|..+++..+.+ ..+.+++.++++.+..... ..+.. ....+..+ T Consensus 242 f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~~~~~~~-~~~~~~~~ 319 (508) T protein:vir:15 242 FAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEHKPTFDT-EQNVYVGV 319 (508) T ss_pred eEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCCccccCC-CCeeEEec Confidence 544432 2344688999999999999999999999999988 6778889998888753221 11111 11222222 Q ss_pred cccc--ccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 436 QGNP--SQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGM 513 (756) Q Consensus 436 ~~~~--~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~ 513 (756) .... +..++.+++.-....+...++.+...+....|++....|.++.. ..||+++....+..-+....+.+.|..++ T Consensus 320 ~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~-~~TAtei~s~~~~~~~t~~~~~~~~~~al 398 (508) T protein:vir:15 320 LSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDG-VKTATEVVSNNSMTYQTRSSYLTMVEKAI 398 (508) T ss_pred cCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 23355544433345567778888889999999998888876554 36898888776666677777888899999 Q ss_pred HHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccc--cHHHHHHHHHHHHHHHhhccCCHhHHHHH Q lcl|NC_019423. 514 ADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTA--EIDNQKSQDLGFMVQTLGNTVDQSITLSL 591 (756) Q Consensus 514 ~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a--~~~~~~~q~l~~llq~~~~~~~~~~~~~~ 591 (756) +++.+.++.+..-+.--.- .....+.+....+++|+|+=+.+ .-.....+..+.+. +.| .++... T Consensus 399 ~~lv~~il~l~~~~~~~~~-------g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v-~aG-i~s~e~---- 465 (508) T protein:vir:15 399 DELCQSIFELANAGALFDD-------GKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVL-AIG-ALSKQT---- 465 (508) T ss_pred HHHHHHHHHHHHHhccccc-------cccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHH-hcC-CCCHHH---- Confidence 9999999988764432110 00111112223345565544333 22222223233322 112 223221 Q ss_pred HHHHHhhcCCh--hHHHHhhhcc---CCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 592 VAKIAELKRMP--DLAHELRTWQ---PQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 592 l~~l~e~~~~~--~~~~~l~~~~---~q~~p~~~~~~q~~~~~aq~e 633 (756) -+++..|.. ++.+.+.++. ...++... ..-......-| T Consensus 466 --~i~~~~g~~deea~~el~ri~~E~~~~~~~~~--~~~~~~g~~ge 508 (508) T protein:vir:15 466 --FLQRNYGMTDEQAAEELAKIQSEAPTDTFEGG--RSAILNGGDGE 508 (508) T ss_pred --HHHhcCCCChHHHHHHHHHHHHhccccCcccc--ccccCCCCCCC Confidence 123344442 2222222221 11110000 00000000000 No 75 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.57 E-value=6.5e-14 Score=92.88 Aligned_cols=452 Identities=11% Similarity=0.093 Sum_probs=203.2 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCC---------CC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKI---------KG 69 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~---------~g 69 (756) |-+-=+ -|++.+-..+-+..+ +.........+....+.|...+.+..+..+||.|.-.-. +.+. +- T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:96 1 MINIIR-MPWDKPYGEEVVEQM--KPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKP 77 (474) T ss_pred Cccccc-CCCCCCCCcchhhhc--cccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccc Confidence 322211 133333222222111 122223333466666667777777888999999863111 0111 11 Q ss_pred CCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEE Q lcl|NC_019423. 70 RSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIA 149 (756) Q Consensus 70 rS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~ 149 (756) ..+++.+..+..|+....-| ||.+. .|.+ +|.+.. +.++..+ .++-......+++++++.|.+.+ T Consensus 78 ~~ki~~n~~k~Iv~~~~~yl----~g~p~--~~~~---~~~~~~----~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~ 142 (474) T protein:vir:96 78 DWRITTNFHQNLVDQKVSYV----AGKPV--TYAH---DDDKVL----DVIHQVL--DTRWDNKLIDILTAASNKGIDWL 142 (474) T ss_pred ccccccchHHHHHHhhhhhh----cccCc--eecc---CChHHH----HHHHHHH--hccHHHHHHHHHHHHhhCCeEEE Confidence 22467777677666665554 66553 3333 333333 3444443 24455556678899999999987 Q ss_pred EEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEe Q lcl|NC_019423. 150 RIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEV 229 (756) Q Consensus 150 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~ 229 (756) .+|++. T Consensus 143 ~~~~d~-------------------------------------------------------------------------- 148 (474) T protein:vir:96 143 QVYINE-------------------------------------------------------------------------- 148 (474) T ss_pred EeeeCC-------------------------------------------------------------------------- Confidence 765531 Q ss_pred eeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccc Q lcl|NC_019423. 230 EKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSD 307 (756) Q Consensus 230 ~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 307 (756) .|.+++..++|.++|+ |+... .+.-+ +.|.|.... T Consensus 149 ----~~~~~i~~~~p~~~~~v~d~~~~---~~~~a-~ir~~~~~~----------------------------------- 185 (474) T protein:vir:96 149 ----DGELKLFRVPAEQAIPIWTDKER---EQLNA-FIRIFTFNG----------------------------------- 185 (474) T ss_pred ----CCceEEEEEcccceEEEEcCCCC---CceEE-EEEEEeecC----------------------------------- Confidence 1336677788888764 33322 22222 223332100 Q ss_pred ccccccccceEEEEEEEEEe-----eccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHH Q lcl|NC_019423. 308 FQFKDALRKKVVAYEYWGFY-----DINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAEL 382 (756) Q Consensus 308 ~~~~d~s~~~V~v~E~w~k~-----d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~ 382 (756) +..+|+|... ...+.+... ....++.....+..|...+.+|++.++. +..|.|.+.. T Consensus 186 ----------~~~~~vy~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~ 247 (474) T protein:vir:96 186 ----------ETKVEYWTAETVTYYVYENGGLIP---DFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWM 247 (474) T ss_pred ----------eeEEEEEeCCeEEEEEEcCCceee---ccccccccccCcccccCCCccceEEecC-----CCCCCCchHH Confidence 0012222110 001111000 0011111111122233446677776643 4568899999 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCceEeeccc-cCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHH Q lcl|NC_019423. 383 LGDNQAILGATMRGMIDLLGRSANGQRGYPKGM-LDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQM 461 (756) Q Consensus 383 ~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~ga-v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~ 461 (756) ++++++.+|.+++.+.+.+...++|.+++ .|. .+......... .....+....++.+.++..+.-..+....+.. T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:96 248 YKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGL---KYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhh---hccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 99999999999999999999998887654 332 11111111111 11122333445566776655556677788899 Q ss_pred HHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCce Q lcl|NC_019423. 462 QNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQY 541 (756) Q Consensus 462 ~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~ 541 (756) +.+.+-..|++++.+.+..++.. ++.++..+............+.|..++++++++++.+.-.-+ ++ T Consensus 324 l~~~I~~~s~~p~~~~~~~~~n~--Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~-----------d~ 390 (474) T protein:vir:96 324 MRAYIVEFGQGVDFQTDKFGSAT--SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKL-----------DA 390 (474) T ss_pred HHHHHHHHhCCcCcccccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-----------cc Confidence 99999999999877654332222 333344444555555666666677777776666655431100 01 Q ss_pred eecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 542 VEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 542 v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~~~~~q~~p~~~ 620 (756) . +.+++.+...+.-....++ ++... ..++.+. +++..+ ..+...-+ T Consensus 391 ~---------~i~i~f~~~~p~~~~e~a~----~~~~~-giiS~et-------~~~~lp~v~D~~~E~------------ 437 (474) T protein:vir:96 391 K---------EIEITFNFNVMVNDLEQSQ----IGAQS-QYLSKET-------LVRHHPWVDDPKAEL------------ 437 (474) T ss_pred c---------eeeEEecCCCccCHHHHHH----HHHHc-CCCChHH-------HHHhCCCCCCHHHHH------------ Confidence 0 1122222221111111111 11111 2222211 111111 11111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLE--QESGTKHARD 674 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~--q~~~~k~~~~ 674 (756) ++++.++.+. +++++...+ ....... .+....+. + T Consensus 438 --eri~~E~~~~---------------~~~~~~~~~-~~~~~~~~~~~~~~~e~-~ 474 (474) T protein:vir:96 438 --ERLDEEQLEL---------------NKQLPNLDD-GGADGAQQQQQSENNQS-K 474 (474) T ss_pred --HHHHHHHHHH---------------Hhhcccccc-ccCCCCCCcCCCCcccc-C Confidence 1111110000 000000000 0000000 00000000 0 No 76 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.57 E-value=6.5e-14 Score=92.88 Aligned_cols=452 Identities=11% Similarity=0.093 Sum_probs=203.2 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCC---------CC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKI---------KG 69 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~---------~g 69 (756) |-+-=+ -|++.+-..+-+..+ +.........+....+.|...+.+..+..+||.|.-.-. +.+. +- T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:95 1 MINIIR-MPWDKPYGEEVVEQM--KPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKP 77 (474) T ss_pred Cccccc-CCCCCCCCcchhhhc--cccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccc Confidence 322211 133333222222111 122223333466666667777777888999999863111 0111 11 Q ss_pred CCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEE Q lcl|NC_019423. 70 RSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIA 149 (756) Q Consensus 70 rS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~ 149 (756) ..+++.+..+..|+....-| ||.+. .|.+ +|.+.. +.++..+ .++-......+++++++.|.+.+ T Consensus 78 ~~ki~~n~~k~Iv~~~~~yl----~g~p~--~~~~---~~~~~~----~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~ 142 (474) T protein:vir:95 78 DWRITTNFHQNLVDQKVSYV----AGKPV--TYAH---DDDKVL----DVIHQVL--DTRWDNKLIDILTAASNKGIDWL 142 (474) T ss_pred ccccccchHHHHHHhhhhhh----cccCc--eecc---CChHHH----HHHHHHH--hccHHHHHHHHHHHHhhCCeEEE Confidence 22467777677666665554 66553 3333 333333 3444443 24455556678899999999987 Q ss_pred EEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEe Q lcl|NC_019423. 150 RIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEV 229 (756) Q Consensus 150 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~ 229 (756) .+|++. T Consensus 143 ~~~~d~-------------------------------------------------------------------------- 148 (474) T protein:vir:95 143 QVYINE-------------------------------------------------------------------------- 148 (474) T ss_pred EeeeCC-------------------------------------------------------------------------- Confidence 765531 Q ss_pred eeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccc Q lcl|NC_019423. 230 EKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSD 307 (756) Q Consensus 230 ~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 307 (756) .|.+++..++|.++|+ |+... .+.-+ +.|.|.... T Consensus 149 ----~~~~~i~~~~p~~~~~v~d~~~~---~~~~a-~ir~~~~~~----------------------------------- 185 (474) T protein:vir:95 149 ----DGELKLFRVPAEQAIPIWTDKER---EQLNA-FIRIFTFNG----------------------------------- 185 (474) T ss_pred ----CCceEEEEEcccceEEEEcCCCC---CceEE-EEEEEeecC----------------------------------- Confidence 1336677788888764 33322 22222 223332100 Q ss_pred ccccccccceEEEEEEEEEe-----eccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHH Q lcl|NC_019423. 308 FQFKDALRKKVVAYEYWGFY-----DINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAEL 382 (756) Q Consensus 308 ~~~~d~s~~~V~v~E~w~k~-----d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~ 382 (756) +..+|+|... ...+.+... ....++.....+..|...+.+|++.++. +..|.|.+.. T Consensus 186 ----------~~~~~vy~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~ 247 (474) T protein:vir:95 186 ----------ETKVEYWTAETVTYYVYENGGLIP---DFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWM 247 (474) T ss_pred ----------eeEEEEEeCCeEEEEEEcCCceee---ccccccccccCcccccCCCccceEEecC-----CCCCCCchHH Confidence 0012222110 001111000 0011111111122233446677776643 4568899999 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCceEeeccc-cCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHH Q lcl|NC_019423. 383 LGDNQAILGATMRGMIDLLGRSANGQRGYPKGM-LDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQM 461 (756) Q Consensus 383 ~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~ga-v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~ 461 (756) ++++++.+|.+++.+.+.+...++|.+++ .|. .+......... .....+....++.+.++..+.-..+....+.. T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:95 248 YKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGL---KYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhh---hccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 99999999999999999999998887654 332 11111111111 11122333445566776655556677788899 Q ss_pred HHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCce Q lcl|NC_019423. 462 QNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQY 541 (756) Q Consensus 462 ~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~ 541 (756) +.+.+-..|++++.+.+..++.. ++.++..+............+.|..++++++++++.+.-.-+ ++ T Consensus 324 l~~~I~~~s~~p~~~~~~~~~n~--Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~-----------d~ 390 (474) T protein:vir:95 324 MRAYIVEFGQGVDFQTDKFGSAT--SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKL-----------DA 390 (474) T ss_pred HHHHHHHHhCCcCcccccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-----------cc Confidence 99999999999877654332222 333344444555555666666677777776666655431100 01 Q ss_pred eecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 542 VEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 542 v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~~~~~q~~p~~~ 620 (756) . +.+++.+...+.-....++ ++... ..++.+. +++..+ ..+...-+ T Consensus 391 ~---------~i~i~f~~~~p~~~~e~a~----~~~~~-giiS~et-------~~~~lp~v~D~~~E~------------ 437 (474) T protein:vir:95 391 K---------EIEITFNFNVMVNDLEQSQ----IGAQS-QYLSKET-------LVRHHPWVDDPKAEL------------ 437 (474) T ss_pred c---------eeeEEecCCCccCHHHHHH----HHHHc-CCCChHH-------HHHhCCCCCCHHHHH------------ Confidence 0 1122222221111111111 11111 2222211 111111 11111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLE--QESGTKHARD 674 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~--q~~~~k~~~~ 674 (756) ++++.++.+. +++++...+ ....... .+....+. + T Consensus 438 --eri~~E~~~~---------------~~~~~~~~~-~~~~~~~~~~~~~~~e~-~ 474 (474) T protein:vir:95 438 --ERLDEEQLEL---------------NKQLPNLDD-GGADGAQQQQQSENNQS-K 474 (474) T ss_pred --HHHHHHHHHH---------------Hhhcccccc-ccCCCCCCcCCCCcccc-C Confidence 1111110000 000000000 0000000 00000000 0 No 77 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.56 E-value=2.8e-13 Score=89.43 Aligned_cols=438 Identities=9% Similarity=0.043 Sum_probs=198.3 Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC---CCCC--------CC--CCC--CcccCHHHHHHHHHHHHHHH Q lcl|NC_019423. 26 PSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGK---AKPP--------KI--KGR--SQVQPRLVRRQAEWRYAPLS 90 (756) Q Consensus 26 ~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~---~~~~--------~~--~gr--S~~v~~~v~~~~e~~~~~L~ 90 (756) =.+..|+..++.....|...+.+..+-.+||.|.-. .... +. .++ .+++.+.....|+... T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~---- 76 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEA---- 76 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhh---- Confidence 122334444566666677777777888999997521 0000 01 111 2355555555554444 Q ss_pred HhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecC Q lcl|NC_019423. 91 EPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLY 170 (756) Q Consensus 91 ~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~ 170 (756) .-|||.+.- |. .+|.+..+...++++. +-...+..+.++++++|.+...+||+. T Consensus 77 ~yl~G~p~~--~~---~~d~~~~~~l~~~~~~------~~~~~~~~l~~~~~~~G~a~~~~y~d~--------------- 130 (470) T protein:vir:10 77 GYVASVFPD--ID---VGKDADNKKIIDVLGD------DRALTLNGLLVDSSNAGRAWLHYWIDE--------------- 130 (470) T ss_pred hheecccee--ee---cCchHHHHHHHHHHhh------hHHHHHHHHHHHHhhcCeeEEEEEecC--------------- Confidence 444776533 32 3455555555555432 223445567789999999888776531 Q ss_pred CCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe- Q lcl|NC_019423. 171 PIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI- 249 (756) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~- 249 (756) .|.+++..++|.++++ T Consensus 131 ---------------------------------------------------------------~~~~~~~~~~p~~~~~v 147 (470) T protein:vir:10 131 ---------------------------------------------------------------DGNFRYGIIQPDQITPI 147 (470) T ss_pred ---------------------------------------------------------------CCceEEEEEcccceEEE Confidence 1336677788888764 Q ss_pred -CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEE-- Q lcl|NC_019423. 250 -DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGF-- 326 (756) Q Consensus 250 -Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k-- 326 (756) |++.. .+..++ .|.|.+.+. .....+..+|+|.. T Consensus 148 ~d~~~~---~~~~a~-ir~y~~~~~---------------------------------------~~~~~~~~~e~yt~~~ 184 (470) T protein:vir:10 148 YATTLD---NKLLGI-LRSYKQLDP---------------------------------------DSGKYFTVHEYWTDKE 184 (470) T ss_pred EcCCCC---CceEEE-EEEEEeeec---------------------------------------CCceEEEEEEEEcCCc Confidence 33221 112222 233322110 00011223333321 Q ss_pred -----eeccCCceeEEEEEE-----EECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHH Q lcl|NC_019423. 327 -----YDINDDGSLEPIVAT-----WIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRG 396 (756) Q Consensus 327 -----~d~~~~g~~~~~~~~-----~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~ 396 (756) ....+....+..... ..+...-..+..+...|.+|++.++. +.+|.|.+..++++++.+|..+|. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~ 259 (470) T protein:vir:10 185 AQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNG 259 (470) T ss_pred EEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHH Confidence 000000000000000 00000111112222335566665553 446899999999999999999999 Q ss_pred HHHHHHhhcCCceEeeccccCccchhhhhcccccccccc--ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhH Q lcl|NC_019423. 397 MIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPM--QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKA 474 (756) Q Consensus 397 ~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~ 474 (756) +.+.+...++|.+++.-...++..+.............. .......+.++..+.-.......++.+.+.+-..+++++ T Consensus 260 ~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~ 339 (470) T protein:vir:10 260 FINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID 339 (470) T ss_pred HHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC Confidence 999999999988776543332222221111111111110 111133456666555556677788999999999999888 Q ss_pred HhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcce Q lcl|NC_019423. 475 FSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFD 554 (756) Q Consensus 475 ~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~D 554 (756) .+.+..|+ .++.++..+...........-+.|..+++.++++++.++.. .+.++ .+ T Consensus 340 ~~~~~~gn---~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~----------~~~d~-----------~~ 395 (470) T protein:vir:10 340 PANFESSN---ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF----------SDADK-----------RH 395 (470) T ss_pred CCcccccc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------cCccc-----------ce Confidence 76543322 23333555555666666666677777777777666654421 11111 12 Q ss_pred EEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhhhccCCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 555 IEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 555 v~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e 633 (756) |.|.-......... ..++++...+..+.-+. +++..+ ..+...-++ +.+ .| T Consensus 396 i~i~f~~~~p~d~~--e~~~~~~~~~g~iS~et-------~l~~~p~v~D~~~E~e-----------------ri~--~E 447 (470) T protein:vir:10 396 ISQHWTRTKVEDSL--TKAQIVSTVANYSSKEA-------VAKANPIVDDWQQELK-----------------DLA--KD 447 (470) T ss_pred eeEEeccCCCCCHH--HHHHHHHHHhccCcHHH-------HHHhCCCCCCHHHHHH-----------------HHH--HH Confidence 33332222221111 11222222222222211 111111 111111111 111 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 634 NEELQSKIALNNAKAKEAASSGDLKDLDYLE 664 (756) Q Consensus 634 ~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~ 664 (756) ..+...... ++. ..+....+..+ T Consensus 448 ~~e~~~~~~--~~~------~~~~~~~dde~ 470 (470) T protein:vir:10 448 KEENDPYSN--QAD------ELNGKGVNDEQ 470 (470) T ss_pred HHHHHHhhc--ccc------ccCCCCCCCCC Confidence 000000000 000 00000000000 No 78 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.56 E-value=1.1e-12 Score=86.09 Aligned_cols=426 Identities=15% Similarity=0.068 Sum_probs=177.1 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC-CCCC----CCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKP-PKIK----GRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-~~~~----grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) +++++. ..|+..++ .+.....+.++-.+||.|....+. ++.. ..=++|.+-.+..|+.....|. T Consensus 1 ~~~~~~-~~i~~l~~----~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~------ 69 (441) T protein:vir:80 1 MNSDEL-ALIEGMYD----RIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD------ 69 (441) T ss_pred CCccHH-HHHHHHHH----HHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc------ Confidence 444332 22222222 244444555667799998633210 0000 1223455555555553333220 Q ss_pred CCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHH Q lcl|NC_019423. 97 SKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQE 176 (756) Q Consensus 97 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 176 (756) +.+.+-+|.+. +.-++ ..|+-......+++++++.|.+.+.+| T Consensus 70 -----~~g~~~~d~~~-------l~~i~-~~n~~~~~~~~~~~~~~~~G~a~~~v~------------------------ 112 (441) T protein:vir:80 70 -----WLGWTNGDGYG-------LDGVY-AANRLATASCDVHLDALIFGLSFVAII------------------------ 112 (441) T ss_pred -----cccccCCChHH-------HHHHH-HhcCHHHHHHHHHHHHhhcCeeEEEEE------------------------ Confidence 11111222111 22221 233434445556666666666654431 Q ss_pred HHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheE--eCCCCc Q lcl|NC_019423. 177 QADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPSCN 254 (756) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~ 254 (756) ....|.|++..++|.+++ ||+... T Consensus 113 ------------------------------------------------------~d~~g~~~i~~~~p~~~~~i~d~~~~ 138 (441) T protein:vir:80 113 ------------------------------------------------------PHGDGTVSVRPQSPKNCTGKFSADGS 138 (441) T ss_pred ------------------------------------------------------eCCCCceEEEEEccceEEEEEeCCCC Confidence 111255778889999865 566432 Q ss_pred CccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCce Q lcl|NC_019423. 255 GDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGS 334 (756) Q Consensus 255 ~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~ 334 (756) . .. + ++ ++..... + .+...+.|.. +.+ T Consensus 139 ~-~~-~-~~-~~~~~~~-------------~-------------------------------~~~~~~vy~~-----~~~ 165 (441) T protein:vir:80 139 R-LD-A-GL-VVQQTCD-------------P-------------------------------EVVEAELLLP-----DVI 165 (441) T ss_pred c-ee-E-EE-EEEEEec-------------C-------------------------------ceEEEEEEec-----CeE Confidence 1 11 1 11 1111000 0 0011122211 100 Q ss_pred eEEEEEEEEC-CEEEEecccccCCCccceEEeeeeeecCcccCCchH-HHhHHHHHHHHHHHHHHHHHHHhhcCCceEee Q lcl|NC_019423. 335 LEPIVATWIG-STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA-ELLGDNQAILGATMRGMIDLLGRSANGQRGYP 412 (756) Q Consensus 335 ~~~~~~~~~g-~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v-~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~ 412 (756) +.....| +.....+..|.+.|++|++++...+..+.+||.|-+ +.++++++.+|..++.+.+.+...+.+...+. T Consensus 166 ---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~ 242 (441) T protein:vir:80 166 ---VQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT 242 (441) T ss_pred ---EEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee Confidence 1111111 122233444555688999999988888999999965 56999999999999999999998888876553 Q ss_pred cccc-CccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHH---HHhchhHHhcCCCccccchhH Q lcl|NC_019423. 413 KGML-DTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAE---SLTGVKAFSGGVTGSAYGDVA 488 (756) Q Consensus 413 ~gav-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e---~~tGv~~~~~G~~~~a~~~tA 488 (756) |+- +......................+..+.+.+.+.- .....+..+...+. ..+++++...|..++.. .+| T Consensus 243 -G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~-~Sg 318 (441) T protein:vir:80 243 -GVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVGSFPVN--SPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNP-PSG 318 (441) T ss_pred -cCCccccccchhhhcccccccCCCCCCCCcceeEecCcc--chHHHHHHHHHHHHHHhcccCCCHHHhccCCCcc-hHH Confidence 421 11111011111001000000011122333333322 22334444544444 45778777777655421 234 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccH--HH Q lcl|NC_019423. 489 AGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEI--DN 566 (756) Q Consensus 489 ~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~ 566 (756) .++...............+.|..+++.++++++.+.-..-... +. -.++.|.=..... .. T Consensus 319 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~---------------~~---~~~i~~~f~~~~~~~~~ 380 (441) T protein:vir:80 319 EALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEA---------------DF---FGDVGLRWRDASTPTRA 380 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc---------------cc---ceeeeEEeCCCCCcCHH Confidence 4455444444555555566677777777766555432211100 00 1123333232221 11 Q ss_pred HHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 567 QKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNA 646 (756) Q Consensus 567 ~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a 646 (756) +.++.+..+.+. +...... . .+++..|+.+ ++ .++++..+.+.+ ..+.+....... T Consensus 381 e~ad~~~kl~~~-g~~~~s~---~---~~~~~l~~~~------------~e----~~~~~~e~~e~~-~~~~~~~~~~~~ 436 (441) T protein:vir:80 381 ATADAVTKLVGA-GILPADS---R---TVLEMLGLDD------------VQ----VEAVMRHRAESS-DPLAVLAGAISR 436 (441) T ss_pred HHHHHHHHHHhc-CcccccH---H---HHHHhCCCCH------------HH----HHHHHHHHHHHH-HHHHHHhhhhhc Confidence 222222222222 1111010 1 1222333311 01 011111111100 000000000001 Q ss_pred HHHHH Q lcl|NC_019423. 647 KAKEA 651 (756) Q Consensus 647 ~a~~~ 651 (756) +..+. T Consensus 437 ~~~~~ 441 (441) T protein:vir:80 437 QTNEV 441 (441) T ss_pred ccccC Confidence 11111 No 79 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.56 E-value=1.2e-12 Score=85.88 Aligned_cols=444 Identities=11% Similarity=0.020 Sum_probs=197.2 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC-CCCCCCCC--CcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA-KPPKIKGR--SQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-~~~~~~gr--S~~v~~ 76 (756) .+.++.|.=|.+ ++|+.+.+ ......|.... .+.++-.+||.|.-.- ..+..+++ -+++.+ T Consensus 11 ~~~~~~~~~~~~-------~~~~~~~i--------~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n 75 (470) T protein:vir:99 11 VTGNSSFIFPKG-------EKLTSNEL--------LGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVVN 75 (470) T ss_pred ccCCceEEeCCC-------CCcCHHHH--------HHHHHHHHHhhHHHHHHHHHHhccccccccCcccccCCcceeecc Confidence 333333322111 13333332 23333343333 4567788999975221 11122233 346666 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~ 156 (756) .....|+....-| +|.+--|.. .+|....+. +.-+ ...|+-......+++++++.|.+.+.+|++. T Consensus 76 ~~~~Ivd~~~~~l----~g~p~~~~~----~~d~~~~~~----l~~~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~- 141 (470) T protein:vir:99 76 SAKYVVDVYNGYF----CGIEPKLAL----LNDSSKIDE----IARW-NRQENFFDTINEISKQCDIFGRSIASIYQGE- 141 (470) T ss_pred hHHHHHHHHhhhh----ccCCeeEee----CCchhHHHH----HHHH-HHhcCHhHHHHHHHHHHHhcCeeEEEEEeCC- Confidence 6666666554444 666533332 234333332 3222 2344445566789999999998877765421 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) .|. T Consensus 142 -----------------------------------------------------------------------------dg~ 144 (470) T protein:vir:99 142 -----------------------------------------------------------------------------DAR 144 (470) T ss_pred -----------------------------------------------------------------------------CCe Confidence 133 Q ss_pred eeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 237 ~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) |++..++|+++++ |+..... ..+ +.+.+.... . T Consensus 145 ~~i~~~~p~~~~~i~d~~~~~~---~~~-~vr~~~~~~--------------------------~--------------- 179 (470) T protein:vir:99 145 PHLMYSSPNHAFIIYDDTVQRQ---PLA-FVHYQIDNS--------------------------N--------------- 179 (470) T ss_pred EEEEEEccceeEEEEcCCCCcc---eEE-EEEEEEEec--------------------------C--------------- Confidence 6677788888654 4332211 111 122222100 0 Q ss_pred cceEEEEEEEEEeeccCCceeEEEEEEEE--CCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHH Q lcl|NC_019423. 315 RKKVVAYEYWGFYDINDDGSLEPIVATWI--GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGA 392 (756) Q Consensus 315 ~~~V~v~E~w~k~d~~~~g~~~~~~~~~~--g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~ 392 (756) ......+++|.. +.+ +.+... +......+..|.+.+.+|++.+. ++.+|.|.+..++++++.+|. T Consensus 180 ~~~~~~~~~~~~-----~~~---~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~ 246 (470) T protein:vir:99 180 NWTDAYGVIQYA-----DKF---YKFKGYDIEEDTNAAGYAINPYGLVPAVEFF-----ENEERQGIFDSIKTLINALDK 246 (470) T ss_pred CeeEEEEEEEec-----CeE---EEEEecccccccccccccccCCCccceEeec-----CCCCCCcchHhHHHHHHHHHH Confidence 001112222211 000 001100 11111112223334677877654 355789999999999999999 Q ss_pred HHHHHHHHHHhhcCCceEeeccccCccch--hhhhccccccc--cccccccccccccccCCCcchHHHHHHHHHHHHHHH Q lcl|NC_019423. 393 TMRGMIDLLGRSANGQRGYPKGMLDTLNR--RRYDDGQDYEY--NPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAES 468 (756) Q Consensus 393 ~~~~~~d~l~~~~~~~~~~~~gav~~~~~--~~~~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~ 468 (756) .++.+.+.+...+++.+.+.-...+..+. ........... .....+.++.+.++..+.....+...++.+.+.+-. T Consensus 247 ~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 326 (470) T protein:vir:99 247 VISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFM 326 (470) T ss_pred HHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHH Confidence 99999999999999887765433322111 01111111111 111112233455555444445566678889999999 Q ss_pred HhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhH Q lcl|NC_019423. 469 LTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRED 548 (756) Q Consensus 469 ~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~ 548 (756) .||+++.+.+..++.. ++.++..............-+.|..++++++++++.++....... .+ T Consensus 327 ~s~~p~~~~~~~~~n~--Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~---------------~~ 389 (470) T protein:vir:99 327 MAMVPNIQDKNFAGNS--SGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQ---------------EL 389 (470) T ss_pred HhCCccccccccccCc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc---------------cc Confidence 9999987655432222 333344444455555566666777777777777776654322110 00 Q ss_pred hcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHH Q lcl|NC_019423. 549 LKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLA 626 (756) Q Consensus 549 ~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~ 626 (756) -.+|.|.-.++.. ..+.++. +..+...++.+.. ++..+.-+...-++++ . T Consensus 390 ---~~~i~v~f~~~~p~~~~e~a~~----~~kl~giis~et~-------l~~l~~vd~~~E~eri--------------~ 441 (470) T protein:vir:99 390 ---WSELDFKFTRNLPEDMASAIDN----AKNAEGIVSKKTQ-------LGMIPDIEPDAEMKQI--------------A 441 (470) T ss_pred ---cccceEEeCCCCCcCHHHHHHH----HHHHhccCCHHHH-------HHhCCCCCHHHHHHHH--------------H Confidence 0123333332222 1122222 2222222332221 1111111111111111 1 Q ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_019423. 627 IQKAQLENEELQSKIA--LNNAKAKEAASSGD 656 (756) Q Consensus 627 ~~~aq~e~~~~qa~a~--~~~a~a~~~~aq~~ 656 (756) .++.. ......+.. ...+.... ..+-+ T Consensus 442 ~E~~~--~~~~~~~~~~~~d~~~~d~-~~ee~ 470 (470) T protein:vir:99 442 KEKAD--AIKQTQQLSMPIDILKRDN-NAEEE 470 (470) T ss_pred HHHHH--HHHHHHhhcCCCCcCCCCC-CccCC Confidence 10000 000000000 00000000 00000 No 80 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.56 E-value=1.3e-13 Score=91.25 Aligned_cols=469 Identities=10% Similarity=0.031 Sum_probs=205.8 Q ss_pred CCc--------------ccCCCCCCCcccccccc-CCCchHHHHH-HHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC- Q lcl|NC_019423. 1 MEH--------------QDTFKPLPDPAQSEKLT-DWKKEPSIQL-LKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA- 62 (756) Q Consensus 1 ~~~--------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~-l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~- 62 (756) |-| +..|.+ +.... .|.+.+.... -...+..+.+.|.... .+.++..+||.|.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~------~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~ 74 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFND------EANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhh------hhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccc Confidence 111 111111 00110 2332222111 1123555555565544 4567889999976221 Q ss_pred ---CCCCCC--CCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHH Q lcl|NC_019423. 63 ---KPPKIK--GRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDY 137 (756) Q Consensus 63 ---~~~~~~--grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 137 (756) .....+ ...+++.+..+-.|+....-| +|.+.-+ .. +|.+.- ++++-++. .|+--.....+ T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~~---~d~~~~----~~l~~~~~-~n~~~~~~~~~ 140 (511) T protein:vir:10 75 VELTRRKEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--QD---DDKDVL----EAIEAFND-LNDVESHNRSL 140 (511) T ss_pred cccCcccccccCcceeecchHHHHHHHHhhhh----cccCcee--ec---CchHHH----HHHHHHHh-hcCHHHHHHHH Confidence 111112 334677777777777665544 5544333 22 343332 34444432 34434445578 Q ss_pred HHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcc Q lcl|NC_019423. 138 VHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEAT 217 (756) Q Consensus 138 v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~ 217 (756) .+++++.|.+.+.+|++. T Consensus 141 ~~~~~i~G~ay~~vy~de-------------------------------------------------------------- 158 (511) T protein:vir:10 141 GLDLSIYGKAYEIMIRNQ-------------------------------------------------------------- 158 (511) T ss_pred HHHHHhcCeeEEEEEeCC-------------------------------------------------------------- Confidence 899999888877664420 Q ss_pred eeccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhh Q lcl|NC_019423. 218 YAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPI 295 (756) Q Consensus 218 ~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~ 295 (756) .|.+++..++|.++++ |..... .. ..+.|.+.+.. .+ T Consensus 159 ----------------dg~~~i~~~~p~~~~~vydd~~~~---~~-~~~vr~~~~~~-----------~d---------- 197 (511) T protein:vir:10 159 ----------------DDETRLYKSDAMSTFVIYDNTIER---NS-IAGVRYLRTKP-----------ID---------- 197 (511) T ss_pred ----------------CCceEEEEEccceeEEEEcCCCCC---ce-EEEEEEEEeee-----------cc---------- Confidence 1346777888888764 433321 11 22233332210 00 Q ss_pred hchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-----EecccccCCCccceEEeeeeee Q lcl|NC_019423. 296 TDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-----RMENNPFPDGKLPLVVVPYMPR 370 (756) Q Consensus 296 ~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-----~~~~~P~~~~~~Pfv~~~~~~~ 370 (756) +.....+..+|+|.. +++ +++...++..+ .....|.+.+.+|++.++ T Consensus 198 ----------------~~~~~~~~~~~iyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~---- 249 (511) T protein:vir:10 198 ----------------KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFS---- 249 (511) T ss_pred ----------------cCccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccccCcceeEEEec---- Confidence 000123444555542 111 22222221111 112223344566666553 Q ss_pred cCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhh-hccc--------ccccccccccccc Q lcl|NC_019423. 371 KRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRY-DDGQ--------DYEYNPMQGNPSQ 441 (756) Q Consensus 371 ~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~-~~~~--------~~~~~~~~~~~~~ 441 (756) ++.+|.|.+..++++++.+|..++.+.+.+...+++.+++.-........... .... ....+......+. T Consensus 250 -nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:10 250 -NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSV 328 (511) T ss_pred -CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCc Confidence 34578999999999999999999999999988888766543212111111110 0000 0011111122233 Q ss_pred ccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 442 SIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKIC 521 (756) Q Consensus 442 ~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l 521 (756) .+.++..+.-.......+..+.+.+...|++++.+.+.-+... ++.++..............-+.|..++++++++++ T Consensus 329 d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~--Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~ 406 (511) T protein:vir:10 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLE 406 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4555554444566677888899999999999987765332222 34445555556666666677778888888888877 Q ss_pred HHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHH--HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh- Q lcl|NC_019423. 522 AMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEID--NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL- 598 (756) Q Consensus 522 ~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~--~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~- 598 (756) .++...-.... +.++ .+|.|.-...... ...++.+..+ ...++.+.+ ++. T Consensus 407 ~~~~~~~~~~~-------------~~d~---~~i~i~f~~~~p~d~~~~~~~~~kl----~G~iS~et~-------~~~l 459 (511) T protein:vir:10 407 TILKNTRSIDA-------------NKDF---NTVRYVYNRNLPKSLIEELKAYIDS----GGKISQTTL-------MSLF 459 (511) T ss_pred HHHHhhCCccc-------------cccc---ceeeEEeCCCCCcCHHHHHHHHHHH----hccCcHHHH-------HHhC Confidence 77643211100 0111 1334433332221 1122222211 122332211 111 Q ss_pred cCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 599 KRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQ 678 (756) Q Consensus 599 ~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~ 678 (756) +...+...-++ ++..++.+ .++.. .... ..+.......+.....+.. .-+.. T Consensus 460 ~~v~d~~~E~~--------------ri~~E~~~----~~~~~--~~~~-------~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) T protein:vir:10 460 SFFQDPELEVK--------------KIEEDEKE----SIKKA--QKGI-------YKDPRDINDDEQDDDTKDT-VDKKE 511 (511) T ss_pred CCCCCHHHHHH--------------HHHHHHHH----HHHHH--hhhc-------ccCCCCCCCCCCCCcccCc-ccccC Confidence 11222111111 11110000 00000 0000 0000000000000000000 00000 No 81 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.54 E-value=4.6e-13 Score=88.20 Aligned_cols=446 Identities=11% Similarity=0.031 Sum_probs=198.1 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--C---------CCCCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--P---------PKIKG 69 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~---------~~~~g 69 (756) +-+.|.+ +.+-+...+. .+ ...+.+....| ..++.++..+||.|.-.-. + ..... T Consensus 6 ~~~~~~~----~~~~~~~~~~----~~----~~~i~~~~~~~--~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~ 71 (479) T protein:vir:79 6 ISETDLI----KVQLKKESTI----NL----VKVIEHYILKH--RPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFT 71 (479) T ss_pred ecccceE----eeccccCChh----HH----HHHHHHHHhhh--hHHHHHHHHHHhccCCcccccccccccccccccccc Confidence 3333333 1112222221 12 22333333333 3456778899998753210 0 01112 Q ss_pred CC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCce Q lcl|NC_019423. 70 RS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTG 147 (756) Q Consensus 70 rS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~g 147 (756) |+ +++.+-.+..|+....-| ||.+.- |.+ +|.. ..++++..+ .|+-......++++++..|.+ T Consensus 72 ~~~~ki~~~~~~~Ivd~~~~~l----~g~p~~--~~~---~~~~----~~~~~~~~~--~n~~~~~~~~~~~~~~~~G~~ 136 (479) T protein:vir:79 72 KVNNKAINNYHKLLVDQKVGYS----VGNPIV--FNA---DDDN----LTKLLNDLL--GEEFDDTITELYLNASNKGVE 136 (479) T ss_pred cCcceeecchHHHHHHHHHhhh----hcCCce--ecc---CCHH----HHHHHHHHH--hcCHHHHHHHHHHHHHhcCeE Confidence 22 466666666666554444 665433 332 2322 223455443 244444456788999999999 Q ss_pred EEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEE Q lcl|NC_019423. 148 IARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEV 227 (756) Q Consensus 148 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~ 227 (756) .+.+||+. T Consensus 137 ~~~v~~d~------------------------------------------------------------------------ 144 (479) T protein:vir:79 137 WLHPYINR------------------------------------------------------------------------ 144 (479) T ss_pred EEEEEeCC------------------------------------------------------------------------ Confidence 88776531 Q ss_pred EeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccc Q lcl|NC_019423. 228 EVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTP 305 (756) Q Consensus 228 ~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 305 (756) .|++++..++|.++++ |+.... ...+. .+.|..... T Consensus 145 ------~~~~~i~~~~p~~~~~v~d~~~~~---~~~~~-ir~y~~~~~-------------------------------- 182 (479) T protein:vir:79 145 ------KGEFKYVIIPAEEAIPIWDSKRQR---ELVAF-IRFYYIEDI-------------------------------- 182 (479) T ss_pred ------CCceEEEEEccceeEEEEeCCCCC---ceEEE-EEEEEEeec-------------------------------- Confidence 1346778888888754 433221 12222 233322100 Q ss_pred ccccccccccceEEEEEEEEEe-----eccCCceeEE------EEEEEECCEEEEecccccCCCccceEEeeeeeecCcc Q lcl|NC_019423. 306 SDFQFKDALRKKVVAYEYWGFY-----DINDDGSLEP------IVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKREL 374 (756) Q Consensus 306 ~~~~~~d~s~~~V~v~E~w~k~-----d~~~~g~~~~------~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~ 374 (756) ....+..+|+|..- ...+.+.... ................|.+.+.+||+.+. ++. T Consensus 183 --------~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~ 249 (479) T protein:vir:79 183 --------DGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFK-----NNE 249 (479) T ss_pred --------CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEec-----CCC Confidence 00112222333210 0001111000 00000111111222333344667777654 355 Q ss_pred cCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchH Q lcl|NC_019423. 375 FGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQS 454 (756) Q Consensus 375 ~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 454 (756) +|.|.+..++++++.+|..++.+.+.+...++|.+++.-...+...+.. ........+....++.+.++..+.-... T Consensus 250 ~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~l~~~~~~~~ 326 (479) T protein:vir:79 250 KCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFI---DNIRYYKSIKVDGGGGVDKLEINIPVEA 326 (479) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccch---hhhhhccceecCCCCcceEEeccCCHHH Confidence 7899999999999999999999999999988887665421111111111 1111122233344555666665544556 Q ss_pred HHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEE Q lcl|NC_019423. 455 AIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVV 534 (756) Q Consensus 455 ~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~i 534 (756) ....++.+.+.+-..|++++.+.+..|+ .++.++..............-+.|..+++.++++++.++.... T Consensus 327 ~~~~~~~l~~~i~~~s~~p~~~~~~~gn---~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~------ 397 (479) T protein:vir:79 327 KKELLDRLEKNIIIFGQGVNPESQNTGD---KSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISG------ 397 (479) T ss_pred HHHHHHHHHHHHHHHhCccccccccccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------ Confidence 6677888888999999998887664433 2333354444445555555556677777777776666653211 Q ss_pred EEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhc-CChhHHHHhhhccC Q lcl|NC_019423. 535 RITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELK-RMPDLAHELRTWQP 613 (756) Q Consensus 535 RI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~-~~~~~~~~l~~~~~ 613 (756) +. .++ ..++.|.-......... ..+..+..+...++.+. +++.. ...++..-++ T Consensus 398 ---~~---~~~------~~~i~i~f~~~~p~~~~--~~a~~~~kl~g~iS~et-------~l~~l~~v~d~~~E~~---- 452 (479) T protein:vir:79 398 ---NK---SYD------YKTVQITFNHSMIINEA--EKIDMAAKSTGIVSDET-------IVSNHPWVEDVNDELE---- 452 (479) T ss_pred ---CC---ccc------cccceEEeCCCCCcCHH--HHHHHHHHHhccCcHHH-------HHHhCCCCCCHHHHHH---- Confidence 00 011 12333333322221111 11122222222232211 11211 1222111111 Q ss_pred CCChhhhhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_019423. 614 QPDPMEEQLKQLAIQKAQLENEELQSKI-ALNNAKAKEA 651 (756) Q Consensus 614 q~~p~~~~~~q~~~~~aq~e~~~~qa~a-~~~~a~a~~~ 651 (756) ++..++.+. .+..... .......+.+ T Consensus 453 ----------ri~~E~~~~--~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 453 ----------RLKKQEDTQ--KEYDDLIPNNQDGVIDET 479 (479) T ss_pred ----------HHHHHHHHH--HHHHhccCcccCCCcCcC Confidence 111100000 0000000 0000000000 No 82 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.54 E-value=2.2e-13 Score=89.97 Aligned_cols=457 Identities=9% Similarity=0.047 Sum_probs=200.1 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCC-------CCCC- Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPK-------IKGR- 70 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~-------~~gr- 70 (756) |++-..+. ++....+-...+++ ..+...+......|.+.+.+.++..+||.|.-.- .+.+ .+.| T Consensus 21 ~~~~~~~~----~~~~~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~ 94 (492) T protein:vir:94 21 LYPSQPTQ----TEIFDAIVRTNNKP--ETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKP 94 (492) T ss_pred eecCccch----hhhhhcccccCCch--hhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc Confidence 22211110 01111111111111 1122234444455666677788899999986211 0111 1122 Q ss_pred -CcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEE Q lcl|NC_019423. 71 -SQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIA 149 (756) Q Consensus 71 -S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~ 149 (756) .+++.+..+..|+....-| +|.+.-+ . .+|.+..+ +++..+ .|+-......+.++++++|.+.+ T Consensus 95 ~~ri~~n~~k~Ivd~~~~yl----~G~p~~~--~---~~d~~~~~----~l~~~~--~n~~~~~~~~~~~~a~~~G~a~~ 159 (492) T protein:vir:94 95 DDRMITNFHANLVDQKVSYI----VGKPIAF--K---HTDDEVVK----RIDEVL--GNRFDDKLHSVLTGASNKGIEWL 159 (492) T ss_pred ccccccchHHHHHHHHHhhh----cccCcee--c---cCchHHHH----HHHHHH--hccHHHHHHHHHHHHhhCCeEEE Confidence 3567777777777666544 6655333 2 24444433 343333 23434445568899999998877 Q ss_pred EEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEe Q lcl|NC_019423. 150 RIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEV 229 (756) Q Consensus 150 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~ 229 (756) .+|++. T Consensus 160 ~v~~d~-------------------------------------------------------------------------- 165 (492) T protein:vir:94 160 HPYLDE-------------------------------------------------------------------------- 165 (492) T ss_pred EEEecC-------------------------------------------------------------------------- Confidence 765421 Q ss_pred eeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccc Q lcl|NC_019423. 230 EKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSD 307 (756) Q Consensus 230 ~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 307 (756) .|++++..++|.+++ ||++.... ..+. .|.|.... . . .... T Consensus 166 ----dg~~~~~~~~p~~~~~v~d~~~~~~---~~a~-ir~~~~~~-----------~--------------~--~~~~-- 208 (492) T protein:vir:94 166 ----EGEFKLFRVPAEQGIPIWTDKEHEE---LEAF-IRMYKLEN-----------E--------------T--KVEY-- 208 (492) T ss_pred ----CCceEEEEEcccceEEEEcCCCCCc---eEEE-EEEEeecc-----------c--------------e--eEEE-- Confidence 134667788888864 45443222 2222 23332100 0 0 0000 Q ss_pred ccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHH Q lcl|NC_019423. 308 FQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQ 387 (756) Q Consensus 308 ~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q 387 (756) +.+ .+|..+++ ++.+.. ...-...+...+...++ +.|.+|++.+.. +-+|.|.+..+++++ T Consensus 209 --y~~---~~v~~~~~------~~~~~~-~~~~~~~~~~~~~~~~~--~~g~vPvv~~~n-----n~~~~sd~e~v~~li 269 (492) T protein:vir:94 209 --WDK---VTVNYYVY------ENGSLI-PDYSNNLENSKTHFSTG--SWGKIPFIPFKN-----NDLEISDIFMYKTLI 269 (492) T ss_pred --Eec---CeEEEEEE------ecCeee-ecccccccccccccccc--CCCccceEEecC-----CCCCCCchHHHHHHH Confidence 000 01212211 111110 00000111222222333 346778776643 446899999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCceEeecccc-CccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHH Q lcl|NC_019423. 388 AILGATMRGMIDLLGRSANGQRGYPKGML-DTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEA 466 (756) Q Consensus 388 ~~iN~~~~~~~d~l~~~~~~~~~~~~gav-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~ 466 (756) +.+|..++.+.+.+...+++.+.+. |.- +......... .....+....++.+.++..+.-.......++.+.+.+ T Consensus 270 Da~d~~~S~~~~~~~~~~~p~lv~~-g~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I 345 (492) T protein:vir:94 270 DAYNRRLSDLSNTFKDSNELTYVLK-NYDDQELPEFKRLL---RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKI 345 (492) T ss_pred HHHHHHHHHHHHHHHHhcCceeeee-cCCcccchhhHHHH---hhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHH Confidence 9999999999999998888876653 321 1111111111 1112222334445666554444456667788899999 Q ss_pred HHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCH Q lcl|NC_019423. 467 ESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKR 546 (756) Q Consensus 467 e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~ 546 (756) -..|++++.+.+.-++.. ++.++...............+.|..+++++++++++++..- + ++ T Consensus 346 ~~~s~~p~~~~~~~~~n~--Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~----------~-~~----- 407 (492) T protein:vir:94 346 MLFGQAVDFSSDKFGSAP--SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----------G-EH----- 407 (492) T ss_pred HHHhCCcCCCccccccCc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------c-cc----- Confidence 999999877655332222 33334444445555566666777777777777766654211 1 11 Q ss_pred hHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhh-cCChhHHHHhhhccCCCChhhhhHHHH Q lcl|NC_019423. 547 EDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAEL-KRMPDLAHELRTWQPQPDPMEEQLKQL 625 (756) Q Consensus 547 d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~-~~~~~~~~~l~~~~~q~~p~~~~~~q~ 625 (756) .+|.|.-.........+ .+..+..+...++.+.. ++. ....+...-+++ + T Consensus 408 ------~~i~v~f~~~~p~~~~e--~~~~~~kl~giiS~et~-------~~~l~~v~d~~~E~er--------------i 458 (492) T protein:vir:94 408 ------KDVDISFNYNKVANTEL--QVQTAQQSMGIVSHETV-------LENHPFVEDLQAELER--------------I 458 (492) T ss_pred ------ceeeEEecCCCCCCHHH--HHHHHHHHhccCchHHH-------HHhCCCCCCHHHHHHH--------------H Confidence 12333333222211111 11122222222332221 111 112222111111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 626 AIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARD 674 (756) Q Consensus 626 ~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~ 674 (756) +.++.+ ....++.... .... ...+.......+. | T Consensus 459 ~~E~~~-~~~~~~~~~~---~~~~---~~~~~~~~~~~e~--------e 492 (492) T protein:vir:94 459 EQEQME-YNKQLPNLDD---GGAD---SAQQQERSNNKES--------E 492 (492) T ss_pred HHHHHH-HHhhcccccc---ccCC---CCccccCCccccC--------C Confidence 100000 0000000000 0000 0000000000000 0 No 83 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.53 E-value=2.4e-12 Score=84.34 Aligned_cols=467 Identities=12% Similarity=0.077 Sum_probs=205.0 Q ss_pred CCcccCCCCCCCccccccccCCCch----HHHHHHHHHHHHHHH-HhhHHHHHHHHHHHHhccccCC-----CCCCCCCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKE----PSIQLLKGDLESAKP-AHDAIMSQIREWNDLMEVKGKA-----KPPKIKGR 70 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~l~~~~~~a~~-~~~~~~~~~~~~~~~y~~~~~~-----~~~~~~gr 70 (756) |.=-+ ++-+|--. -....|....++.+= -.+.++.+.++|..||.|.... ..+..+.| T Consensus 1 m~~~~------------~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~ 68 (500) T protein:vir:30 1 MGVIQ------------KIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKR 68 (500) T ss_pred CchHH------------HHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccC Confidence 11111 11111000 000011111111111 1234556788899999876221 11122222 Q ss_pred CcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEE Q lcl|NC_019423. 71 SQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIAR 150 (756) Q Consensus 71 S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k 150 (756) .....+.-...++ -+++ .+|+-..-+.+ +|. ..+++++-++. .|+-...+..++..|+..|.+++| T Consensus 69 ~~~slnl~~~i~~-~~A~---lv~~e~~~i~~-----~d~----~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k 134 (500) T protein:vir:30 69 DLNHLPIARTAAK-KIAS---LVFNEQAEIKV-----DDD----AANEFISETLK-NDRFNKNFERYLESCLALGGLAMR 134 (500) T ss_pred ceeecchHHHHHH-HHhh---hhcCCcceEec-----CCh----HHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEE Confidence 2233333333332 2233 34554444444 343 44556766554 444566688899999999999999 Q ss_pred EeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEee Q lcl|NC_019423. 151 IGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVE 230 (756) Q Consensus 151 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~ 230 (756) +||+. T Consensus 135 ~~~d~--------------------------------------------------------------------------- 139 (500) T protein:vir:30 135 PYVDG--------------------------------------------------------------------------- 139 (500) T ss_pred EEEeC--------------------------------------------------------------------------- Confidence 99951 Q ss_pred eeecCceeEEEechhheEe-CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 231 KALVNRPTVEMLNPNNVVI-DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 231 ~~~~g~~~ie~V~p~~~~~-Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) ++|+|+.|+++.|++ -.+.. ....|-++++.. .+... ..-.|-.++.+.+.......... ...... T Consensus 140 ----~~~~I~~v~ad~~~P~~~d~~-~~~~~a~~~~~~-~~~~~---~~~~yt~lE~h~~~~~~~~~I~n----~ly~~~ 206 (500) T protein:vir:30 140 ----DKVRVAFVQAPVFLPLQSNTQ-DVSSAAVVIKSV-KTING---KEVYYTLIEFHEWQSSDDYVISN----ELYRSD 206 (500) T ss_pred ----CceEEEEEcCCeeEEEEEcCC-CeEEEEEEEEEe-eeecC---CceEEEEEEEEEEeCCceeEEEE----EEEecc Confidence 124566666666653 11111 122222222111 11000 00001111111111000000000 000000 Q ss_pred ccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEee----eeeecCcccCCchHHHhHH Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVP----YMPRKRELFGEADAELLGD 385 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~----~~~~~~~~~G~g~v~~~~d 385 (756) ..+.-+..|-+.++|.- + .+.+.+ ....+.||+.+. .....++.+|.|++.++++ T Consensus 207 ~~~~lG~~v~l~~~~~~--l-------------~~~~~~------~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~ 265 (500) T protein:vir:30 207 DKAKVGSRVPLSEVYKD--L-------------KDEAKV------TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKT 265 (500) T ss_pred cccccCcccccccccCC--c-------------CcceEe------ccCCCccEEEecCCccccccCCCccCCchhhhhHH Confidence 00000112222222210 0 000000 111223344432 2334578899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch---------hhhhcccccccccccccc--ccccccccCCCc-ch Q lcl|NC_019423. 386 NQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR---------RRYDDGQDYEYNPMQGNP--SQSIMEHKFPEL-PQ 453 (756) Q Consensus 386 ~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~---------~~~~~~~~~~~~~~~~~~--~~~i~~~~~~~~-~~ 453 (756) ..+.+|..++++.+.+.. +..++.++++.+..... ..++. ....+..+.... ...++..+ |.+ .. T Consensus 266 lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~-~~~~~~~~~~~~~~~~~i~~~~-~~ir~e 342 (500) T protein:vir:30 266 TIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFES-DQNVYIRMGGRDLDSSAIQDLT-TPIRAD 342 (500) T ss_pred HHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCC-CcceEEEcCCCCCcCcceeEec-cccChH Confidence 999999999999998865 67788888777632210 00110 111111122222 23355444 344 34 Q ss_pred HHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hCCCC Q lcl|NC_019423. 454 SAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAV--FLSEK 531 (756) Q Consensus 454 ~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q--~~~~~ 531 (756) .....++.+...+....|++....|.++.. ..||+++....+..-+....+.+.|..+++++.+.++.+..- ++... T Consensus 343 ~~~~~l~~~l~~i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~ 421 (500) T protein:vir:30 343 DYIKAINEGLSLFEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSE 421 (500) T ss_pred HHHHHHHHHHHHHHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 566677777788888888888877766543 368988877766677777778888988999999999887653 22211 Q ss_pred cEEEEecCceeecCHhHhcCcceEEEecccc--cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChh--HHHH Q lcl|NC_019423. 532 EVVRITNEQYVEIKREDLKGNFDIEVDINTA--EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPD--LAHE 607 (756) Q Consensus 532 r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a--~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~--~~~~ 607 (756) ....++|+|+-+.+ .-.....+..+.+... ..++.... +++..|..+ +.+. T Consensus 422 -----------------~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~a--Gi~s~~~~------i~~~~g~~eeea~~~ 476 (500) T protein:vir:30 422 -----------------VPSMDNISISLDDGVFTDRDAELDYWIKVVNA--GFGTREMA------IQKVLNVTEEKAQEI 476 (500) T ss_pred -----------------CCCCcceEEEeCCCCCCCHHHHHHHHHHHHHc--CCCCHHHH------HHhcCCCCHHHHHHH Confidence 12234455544333 2222222333333221 22332211 223333321 1122 Q ss_pred hhhccCCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 608 LRTWQPQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 608 l~~~~~q~~p~~~~~~q~~~~~aq~e 633 (756) +.+++....+.--... .....--| T Consensus 477 l~~i~~E~~~~~~~~~--~~~~~~g~ 500 (500) T protein:vir:30 477 AAEINTGIVDEINQQR--TDTHLYGE 500 (500) T ss_pred HHHHHHhccccCCCCC--ccccccCC Confidence 2221111000000000 00000000 No 84 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.53 E-value=2.4e-12 Score=84.34 Aligned_cols=467 Identities=12% Similarity=0.077 Sum_probs=205.0 Q ss_pred CCcccCCCCCCCccccccccCCCch----HHHHHHHHHHHHHHH-HhhHHHHHHHHHHHHhccccCC-----CCCCCCCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKE----PSIQLLKGDLESAKP-AHDAIMSQIREWNDLMEVKGKA-----KPPKIKGR 70 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~l~~~~~~a~~-~~~~~~~~~~~~~~~y~~~~~~-----~~~~~~gr 70 (756) |.=-+ ++-+|--. -....|....++.+= -.+.++.+.++|..||.|.... ..+..+.| T Consensus 1 m~~~~------------~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~ 68 (500) T protein:vir:98 1 MGVIQ------------KIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKR 68 (500) T ss_pred CchHH------------HHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccC Confidence 11111 11111000 000011111111111 1234556788899999876221 11122222 Q ss_pred CcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEE Q lcl|NC_019423. 71 SQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIAR 150 (756) Q Consensus 71 S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k 150 (756) .....+.-...++ -+++ .+|+-..-+.+ +|. ..+++++-++. .|+-...+..++..|+..|.+++| T Consensus 69 ~~~slnl~~~i~~-~~A~---lv~~e~~~i~~-----~d~----~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k 134 (500) T protein:vir:98 69 DLNHLPIARTAAK-KIAS---LVFNEQAEIKV-----DDD----AANEFISETLK-NDRFNKNFERYLESCLALGGLAMR 134 (500) T ss_pred ceeecchHHHHHH-HHhh---hhcCCcceEec-----CCh----HHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEE Confidence 2233333333332 2233 34554444444 343 44556766554 444566688899999999999999 Q ss_pred EeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEee Q lcl|NC_019423. 151 IGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVE 230 (756) Q Consensus 151 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~ 230 (756) +||+. T Consensus 135 ~~~d~--------------------------------------------------------------------------- 139 (500) T protein:vir:98 135 PYVDG--------------------------------------------------------------------------- 139 (500) T ss_pred EEEeC--------------------------------------------------------------------------- Confidence 99951 Q ss_pred eeecCceeEEEechhheEe-CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 231 KALVNRPTVEMLNPNNVVI-DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 231 ~~~~g~~~ie~V~p~~~~~-Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) ++|+|+.|+++.|++ -.+.. ....|-++++.. .+... ..-.|-.++.+.+.......... ...... T Consensus 140 ----~~~~I~~v~ad~~~P~~~d~~-~~~~~a~~~~~~-~~~~~---~~~~yt~lE~h~~~~~~~~~I~n----~ly~~~ 206 (500) T protein:vir:98 140 ----DKVRVAFVQAPVFLPLQSNTQ-DVSSAAVVIKSV-KTING---KEVYYTLIEFHEWQSSDDYVISN----ELYRSD 206 (500) T ss_pred ----CceEEEEEcCCeeEEEEEcCC-CeEEEEEEEEEe-eeecC---CceEEEEEEEEEEeCCceeEEEE----EEEecc Confidence 124566666666653 11111 122222222111 11000 00001111111111000000000 000000 Q ss_pred ccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEee----eeeecCcccCCchHHHhHH Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVP----YMPRKRELFGEADAELLGD 385 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~----~~~~~~~~~G~g~v~~~~d 385 (756) ..+.-+..|-+.++|.- + .+.+.+ ....+.||+.+. .....++.+|.|++.++++ T Consensus 207 ~~~~lG~~v~l~~~~~~--l-------------~~~~~~------~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~ 265 (500) T protein:vir:98 207 DKAKVGSRVPLSEVYKD--L-------------KDEAKV------TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKT 265 (500) T ss_pred cccccCcccccccccCC--c-------------CcceEe------ccCCCccEEEecCCccccccCCCccCCchhhhhHH Confidence 00000112222222210 0 000000 111223344432 2334578899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch---------hhhhcccccccccccccc--ccccccccCCCc-ch Q lcl|NC_019423. 386 NQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR---------RRYDDGQDYEYNPMQGNP--SQSIMEHKFPEL-PQ 453 (756) Q Consensus 386 ~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~---------~~~~~~~~~~~~~~~~~~--~~~i~~~~~~~~-~~ 453 (756) ..+.+|..++++.+.+.. +..++.++++.+..... ..++. ....+..+.... ...++..+ |.+ .. T Consensus 266 lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~-~~~~~~~~~~~~~~~~~i~~~~-~~ir~e 342 (500) T protein:vir:98 266 TIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFES-DQNVYIRMGGRDLDSSAIQDLT-TPIRAD 342 (500) T ss_pred HHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCC-CcceEEEcCCCCCcCcceeEec-cccChH Confidence 999999999999998865 67788888777632210 00110 111111122222 23355444 344 34 Q ss_pred HHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hCCCC Q lcl|NC_019423. 454 SAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAV--FLSEK 531 (756) Q Consensus 454 ~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q--~~~~~ 531 (756) .....++.+...+....|++....|.++.. ..||+++....+..-+....+.+.|..+++++.+.++.+..- ++... T Consensus 343 ~~~~~l~~~l~~i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~ 421 (500) T protein:vir:98 343 DYIKAINEGLSLFEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSE 421 (500) T ss_pred HHHHHHHHHHHHHHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 566677777788888888888877766543 368988877766677777778888988999999999887653 22211 Q ss_pred cEEEEecCceeecCHhHhcCcceEEEecccc--cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChh--HHHH Q lcl|NC_019423. 532 EVVRITNEQYVEIKREDLKGNFDIEVDINTA--EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPD--LAHE 607 (756) Q Consensus 532 r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a--~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~--~~~~ 607 (756) ....++|+|+-+.+ .-.....+..+.+... ..++.... +++..|..+ +.+. T Consensus 422 -----------------~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~a--Gi~s~~~~------i~~~~g~~eeea~~~ 476 (500) T protein:vir:98 422 -----------------VPSMDNISISLDDGVFTDRDAELDYWIKVVNA--GFGTREMA------IQKVLNVTEEKAQEI 476 (500) T ss_pred -----------------CCCCcceEEEeCCCCCCCHHHHHHHHHHHHHc--CCCCHHHH------HHhcCCCCHHHHHHH Confidence 12234455544333 2222222333333221 22332211 223333321 1122 Q ss_pred hhhccCCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 608 LRTWQPQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 608 l~~~~~q~~p~~~~~~q~~~~~aq~e 633 (756) +.+++....+.--... .....--| T Consensus 477 l~~i~~E~~~~~~~~~--~~~~~~g~ 500 (500) T protein:vir:98 477 AAEINTGIVDEINQQR--TDTHLYGE 500 (500) T ss_pred HHHHHHhccccCCCCC--ccccccCC Confidence 2221111000000000 00000000 No 85 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.53 E-value=2.2e-13 Score=89.99 Aligned_cols=487 Identities=10% Similarity=-0.018 Sum_probs=207.3 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCC--CCCCCcccCH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPK--IKGRSQVQPR 76 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~--~~grS~~v~~ 76 (756) |+ =--+++-|++..+ ..... +....+.|.....+.++..+||.|.-.- .+++ .+..-+++.+ T Consensus 1 ~~---------~~~~~~~~~~~~~-~~~~~----i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n 66 (499) T protein:vir:10 1 MA---------VVIDKDLLDDVNE-PNIEA----INYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVN 66 (499) T ss_pred Cc---------cchhhhHHhhhhc-CCHHH----HHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecc Confidence 11 0012222222211 11122 3334445666677788899999986211 1222 2234456666 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~ 156 (756) ..+..|+....-| ||.+.- |.+ +|.+..+...+++ ..|+--.....+.++++++|.+...+|++.. T Consensus 67 ~~~~Iv~~~~~~l----~g~p~~--~~~---~~~~~~~~l~~~~-----~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~ 132 (499) T protein:vir:10 67 HAKYITDMNVGFM----TGNPVK--YVA---EKGKNIDDILEVF-----NQIDIHKHDIELEKDLSVFGYGYELLYLKKT 132 (499) T ss_pred hHHHHHHHHhhhh----cccCce--eec---CChhHHHHHHHHH-----hhcCHhHHHHHHHHHHHhcCceEEEEEeccc Confidence 6666666665444 665533 333 3444444333332 2233234466899999999999888866422 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) - ..... .+..........+ T Consensus 133 g-~~~~~------------------------------------------------------------~~~~~~~~~~~~~ 151 (499) T protein:vir:10 133 D-PISVR------------------------------------------------------------DELGNEKLTPNTE 151 (499) T ss_pred c-ccccc------------------------------------------------------------ccccccccccccc Confidence 1 00000 0000000112245 Q ss_pred eeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRK 316 (756) Q Consensus 237 ~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~ 316 (756) +++..|+|.++++=.+.. ...-...+.+.+.+.+. .... T Consensus 152 ~~~~~v~p~~~~~v~~d~--~~~~~~~~i~~~~~~~~---------------------------------------~~~~ 190 (499) T protein:vir:10 152 LKIEVIDPRATVVVCDDT--VEHDPLFAVFTQEKKDL---------------------------------------EGNT 190 (499) T ss_pred eEEEEEcccceEEEecCC--CCcceEEEEEEEEEeec---------------------------------------CCCc Confidence 788999999865422211 11111122222221100 0011 Q ss_pred eEEEEEEEEEeeccCCceeEEEEEEEE-------CCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHH Q lcl|NC_019423. 317 KVVAYEYWGFYDINDDGSLEPIVATWI-------GSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAI 389 (756) Q Consensus 317 ~V~v~E~w~k~d~~~~g~~~~~~~~~~-------g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~ 389 (756) .+..+|+|..- .+ ++++.. +..++...+++ .|.+|++.+.. +.+|.|.+..++++++. T Consensus 191 ~~~~~~iyt~~-----~i---~~~~~~~~~~~~~~~~~~~~~~~~--~g~vPvv~~~n-----~~~~~~d~e~v~~liD~ 255 (499) T protein:vir:10 191 NGYSITVYMPQ-----RI---VEYRTKTTMEVSANDPIVYDGENL--FGAVPIIEFRN-----NEERQGDFEQLISLIDA 255 (499) T ss_pred eEEEEEEEeCC-----eE---EEEEecCCccccCcceecccccCC--CCccceEEecC-----CCCCCCchHhHHHHHHH Confidence 23344444321 11 111111 12233333444 36677776543 55789999999999999 Q ss_pred HHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHH Q lcl|NC_019423. 390 LGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESL 469 (756) Q Consensus 390 iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~ 469 (756) +|..++.+.+.+...+.|.+++.-..++......... .......+....++.++++..+.-...+...+..+.+.|-.. T Consensus 256 ~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~-~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~ 334 (499) T protein:vir:10 256 YNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRL-KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKI 334 (499) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCccccccchhhhh-hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHH Confidence 9999999999999999888776532222211111100 000111111122333555555444566677888899999999 Q ss_pred hchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHh Q lcl|NC_019423. 470 TGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDL 549 (756) Q Consensus 470 tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~ 549 (756) |++++.+.+.-+.. .++.++..+............+.|..++++++++++.++... |.. .+ + T Consensus 335 s~~p~~~~~~~~gn--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----------~~~---~d---~ 396 (499) T protein:vir:10 335 SYVPNMNDEKFMGN--VSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIK----------GAN---DD---A 396 (499) T ss_pred hCcccCCchhhccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CCc---cc---c Confidence 99887654422222 233335555555566666666777777777777777664311 111 00 0 Q ss_pred cCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHH Q lcl|NC_019423. 550 KGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQK 629 (756) Q Consensus 550 ~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~ 629 (756) .+|.|.-........ ...+.+++.++..++.+.+... ++...+....++++..+.... .+ T Consensus 397 ---~~i~i~f~~~~p~n~--~e~~~~~~kl~g~iS~et~~~~------l~~v~d~~~E~~ri~~E~~~~---------~~ 456 (499) T protein:vir:10 397 ---SGCKISLVANIPSNL--SDVVNNVKNADGIIPRKYTYSW------LPDVDNPQDVIDEMNQQDAET---------IK 456 (499) T ss_pred ---ccceEEeCCCCCCCH--HHHHHHHHHHhccCChHHHHHh------CCCCCCHHHHHHHHHHHHHHH---------HH Confidence 123332222221111 1111112222222333222111 112222222222211100000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAK-EAASSGDLKDLDYLEQESGTKHARDM 675 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~-~~~aq~~~~~~~~~~q~~~~k~~~~~ 675 (756) ...+...............+ ..+........+ ..+. .+.+.+ T Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~ 499 (499) T protein:vir:10 457 KNQEALRGQDPDRLELEDKQDDSSENDKEAGSN-HNQS---HRTRAV 499 (499) T ss_pred HHHhhhccCCCCCCCCCCCCcccCCCCCCCccc-cccC---CCCCCC Confidence 00000000000000000000 000000000000 0000 000000 No 86 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.51 E-value=1.9e-12 Score=84.82 Aligned_cols=471 Identities=9% Similarity=0.014 Sum_probs=206.4 Q ss_pred CCc--------------ccCCCCCCCccccccccCCCchHHHH-HHHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC-- Q lcl|NC_019423. 1 MEH--------------QDTFKPLPDPAQSEKLTDWKKEPSIQ-LLKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA-- 62 (756) Q Consensus 1 ~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-- 62 (756) |-| +..|++ .... --.|...+-.. ....++....+.|.+.. .+.++..+||.|.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----~~n~-~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~ 75 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYLFND----EANV-VYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhh----hhCC-ccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcccc Confidence 111 111211 0000 00243222211 11233555555555444 4567788999875221 Q ss_pred --CCCCC--CCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 63 --KPPKI--KGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 63 --~~~~~--~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) ..... +...+++.+.....|+....-| ||.+.-+ .. +|.+.- ++++-++. .|+--.....+. T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~~---~d~~~~----~~l~~~~~-~n~~~~~~~~~~ 141 (511) T protein:vir:99 76 ELTRRKEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--QD---DDKDVL----EAIEAFND-LNDVESHNRSLG 141 (511) T ss_pred ccCcccccccCcceeecchHHHHHHHHHhhh----cccCcee--ec---CchHHH----HHHHHHHh-hcCHhHHHHHHH Confidence 11111 2234577777777776665544 6644333 22 343332 34444433 344445566788 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) +++++.|.+.+.+||+. T Consensus 142 ~~~~i~G~a~~~vy~de--------------------------------------------------------------- 158 (511) T protein:vir:99 142 LDLSIYGKAYELMIRNQ--------------------------------------------------------------- 158 (511) T ss_pred HHHHhcCeeEEEEEeCC--------------------------------------------------------------- Confidence 99999999887776531 Q ss_pred eccCceeEEEeeeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 219 AIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 219 ~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) .|+|++..++|.++| ||+.... ...+. .+.|.+.. .+ T Consensus 159 ---------------d~~~~i~~~~p~~~~~vyd~~~~~---~~~~~-vr~~~~~~-----------~~----------- 197 (511) T protein:vir:99 159 ---------------DDETRLYKSDAMSTFVIYDNTIER---NSIAG-VRYLRTKP-----------ID----------- 197 (511) T ss_pred ---------------CCceEEEEEccceeEEEEcCCCCC---ceEEE-EEEEEeee-----------cc----------- Confidence 133677888888876 4544321 12222 23332210 00 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-----EecccccCCCccceEEeeeeeec Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-----RMENNPFPDGKLPLVVVPYMPRK 371 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-----~~~~~P~~~~~~Pfv~~~~~~~~ 371 (756) +.....+..+|+|.. +++ +++...++..+ .....|.+.+.+|++.++. T Consensus 198 ---------------~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n---- 250 (511) T protein:vir:99 198 ---------------KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN---- 250 (511) T ss_pred ---------------cCccceEEEEEEEeC-----CcE---EEEEecCCccccccccccccccCCCCccceEEecC---- Confidence 000112334455532 111 11222111111 1122233445677766543 Q ss_pred CcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhh-hhcccc--------ccccccccccccc Q lcl|NC_019423. 372 RELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRR-YDDGQD--------YEYNPMQGNPSQS 442 (756) Q Consensus 372 ~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~-~~~~~~--------~~~~~~~~~~~~~ 442 (756) +.+|.|.+..++++++.+|..++.+.+.+...+++.+++.-....+..... ...... ...+......+.. T Consensus 251 -n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 329 (511) T protein:vir:99 251 -NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred -CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcc Confidence 457899999999999999999999999998888776554321111111111 000000 0001111122334 Q ss_pred cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 443 IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICA 522 (756) Q Consensus 443 i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~ 522 (756) ++++..+.-.......+..+.+.+-..|++++.+.+.-+.. .++.++..+...........-+.|..++++++++++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn--~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~ 407 (511) T protein:vir:99 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55555444455666778888999999999998766533222 2344455555566666677777888888888888888 Q ss_pred HHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC Q lcl|NC_019423. 523 MNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR 600 (756) Q Consensus 523 li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~ 600 (756) ++........ +.++. +|.|.-..... ....++.+ ..+...++.+.+.. + ++. T Consensus 408 ~~~~~~~~~~-------------~~~~~---~i~i~f~~~~p~n~~e~~~~~----~kl~GiiS~et~l~----~--l~~ 461 (511) T protein:vir:99 408 ILKNTRSIDV-------------SKDFN---TVRYVYNRNLPKSLIEELKAY----IDSGGKISQTTLMS----L--FSF 461 (511) T ss_pred HHHhcCCccc-------------ccccc---cceEEeCCCCCcCHHHHHHHH----HHHhccCCHHHHHH----h--CCC Confidence 7754321100 01111 12232222221 11122211 12222233322211 1 112 Q ss_pred ChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 601 MPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARD 674 (756) Q Consensus 601 ~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~ 674 (756) ..+...-++++. .++... + +.............. ........+...-+ .| T Consensus 462 v~D~~~E~~ri~--------------~E~~~~----~--~~~~~~~~~~~~~~~--~~~~~~~~~~~~d~--~e 511 (511) T protein:vir:99 462 FQDPELEVKKIE--------------EDEKES----I--KKAQKNMYQDPRNIN--DDEQDDSTKDSIDK--KE 511 (511) T ss_pred CCCHHHHHHHHH--------------HHHHHH----H--HHHhhcccccCCCCC--CCCCCCCCcCcccc--cC Confidence 222222121111 100000 0 000000000000000 00000000000000 00 No 87 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.51 E-value=8.5e-13 Score=86.75 Aligned_cols=447 Identities=11% Similarity=0.024 Sum_probs=201.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC------------------ Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA------------------ 62 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~------------------ 62 (756) |-=++++- ..+..+.+-+.+...| ..|.....+..+.++||.+.-.. T Consensus 1 ~~~~~~~~-------~~~~~~~~~e~i~~~i--------~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 65 (474) T protein:vir:94 1 MTLYKLID-------DIEAQGILPKHIEALI--------ESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETG 65 (474) T ss_pred CchHHHHh-------hccccCCCHHHHHHHH--------HHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhc Confidence 44444441 1222223332222222 23333334445555666542110 Q ss_pred -CCCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHH Q lcl|NC_019423. 63 -KPPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVH 139 (756) Q Consensus 63 -~~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~ 139 (756) .....++|+ +++.+-.+..|+....-| ||.+.-+.+.+-+..|++ ..++++-++ ..|+--.....+.+ T Consensus 66 ~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl----~g~pv~~~~~~~~~~~e~----~~~~l~~~~-~~n~~~~~~~~~~~ 136 (474) T protein:vir:94 66 GNVRRLDVSVNNKLNNSFDSEIVDTRVGYL----HGVPVTYDLDENAEKNEK----LKKFITNFA-IRNSVDDEDSEIGK 136 (474) T ss_pred ccccccccCcccccccchHHHHHHhHhhhe----eccceeEeeCCCCcchHH----HHHHHHHHH-hhcCHhHHHHHHHH Confidence 011233444 566666666666554443 676665665443333333 333444332 23444445667889 Q ss_pred HHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCccee Q lcl|NC_019423. 140 SIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYA 219 (756) Q Consensus 140 ~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~ 219 (756) ++++.|.+++.+|.+ T Consensus 137 ~~~~~G~a~~~~~~d----------------------------------------------------------------- 151 (474) T protein:vir:94 137 MAAICGYGARLAYID----------------------------------------------------------------- 151 (474) T ss_pred HHhhcCeEEEEEEeC----------------------------------------------------------------- Confidence 999999886654321 Q ss_pred ccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchh Q lcl|NC_019423. 220 IQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPD 299 (756) Q Consensus 220 ~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~ 299 (756) ..|.+++..++|.++++=.+-. . +..+.+ +.|....+ T Consensus 152 -------------~~~~~~~~~i~p~~~~~v~d~~--~-~~~~~i-~~~~~~~~-------------------------- 188 (474) T protein:vir:94 152 -------------TNGDIRIKNIDPYNVIFVGDNI--L-EPTYSL-RYFYEKDD-------------------------- 188 (474) T ss_pred -------------CCCeeEEEEEcccceEEEEcCC--C-ceEEEE-EEEEEeeC-------------------------- Confidence 0134677888888865422211 1 122222 22211000 Q ss_pred hhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEEC---CEEEEecccccCCCccceEEeeeeeecCcccC Q lcl|NC_019423. 300 HESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIG---STLIRMENNPFPDGKLPLVVVPYMPRKRELFG 376 (756) Q Consensus 300 ~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g---~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G 376 (756) ..+..+..+|+|.+. . +..|.+ +.....++.|.+.|.+|++.++ ++.+| T Consensus 189 -------------~~~~~~~~~~~y~~~-----~-----~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g 240 (474) T protein:vir:94 189 -------------DNGTDYVYAEFYDNA-----Y-----YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEM 240 (474) T ss_pred -------------CCceEEEEEEEEcCc-----e-----EEEEeecCCCcccccccccCCCCccceEEec-----CCCCC Confidence 000112234444321 1 111111 1112222223334667777654 35679 Q ss_pred CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccc-cccccccccccCCCcchHH Q lcl|NC_019423. 377 EADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQ-GNPSQSIMEHKFPELPQSA 455 (756) Q Consensus 377 ~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~~~~~~~~~~ 455 (756) .|.+..++++++.+|..++.+.+.+...++|.+.+. |. ...++..... ...+.+. ......+.++..+.-.... T Consensus 241 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~-~~~~~~~~~~---~~~~~i~~~~~~~~~~~l~~~~~~~~~ 315 (474) T protein:vir:94 241 IGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GM-GMSEEMIQET---QKSGAFELFDKDMDVKYLTKDVNDTMI 315 (474) T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cC-CCCchhhhhh---hhcceeEecCCCCceeEEeccCCHHHH Confidence 999999999999999999999999998888876653 32 1111111111 1111111 1234456666555445667 Q ss_pred HHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEE Q lcl|NC_019423. 456 IVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVR 535 (756) Q Consensus 456 ~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iR 535 (756) ...+..+.+.+-..|++++.+.+.-++. .++.++...............+.|..++++++++++.++..-... T Consensus 316 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~----- 388 (474) T protein:vir:94 316 ENHLDRIEKNIMRFAKSVNFNSDEFNGN--VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYN----- 388 (474) T ss_pred HHHHHHHHHHHHHHhCCccccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC----- Confidence 7788889999999999998776533222 244445555555666666677778888888888888876532111 Q ss_pred EecCceeecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccC Q lcl|NC_019423. 536 ITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQP 613 (756) Q Consensus 536 I~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~ 613 (756) ... .++ .+|.+.-..... ....++.+ ..+...++.+.+.. + ++...++..-+++ T Consensus 389 -----~~~---~~~---~~i~~~f~~~~p~d~~e~a~~~----~kl~g~iS~et~~~----~--l~~v~d~~~E~er--- 444 (474) T protein:vir:94 389 -----LDD---DSY---LNLIFKFTRNIPVNKLEESQVL----INLKGQVSERTRLG----Q--SQLVDDVDYELDE--- 444 (474) T ss_pred -----CCc---ccc---ccceEEeCCCCCCCHHHHHHHH----HHHhccCchHHHHH----h--CCCCCCHHHHHHH--- Confidence 000 011 123333332222 11222222 22222233222111 1 1112222111111 Q ss_pred CCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 614 QPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGD 656 (756) Q Consensus 614 q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~ 656 (756) ++.++.+...... .........+....+-+ T Consensus 445 -----------i~~E~~e~~~~~~--~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 445 -----------MEKESLEFNDKLP--DIDEGDANDKSQNNQSE 474 (474) T ss_pred -----------HHHHHHHHHhhcc--cccCCCcCCCCccccCC Confidence 1111100000000 00000000000000000 No 88 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.51 E-value=8.5e-13 Score=86.75 Aligned_cols=447 Identities=11% Similarity=0.024 Sum_probs=201.8 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC------------------ Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA------------------ 62 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~------------------ 62 (756) |-=++++- ..+..+.+-+.+...| ..|.....+..+.++||.+.-.. T Consensus 1 ~~~~~~~~-------~~~~~~~~~e~i~~~i--------~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 65 (474) T protein:vir:10 1 MTLYKLID-------DIEAQGILPKHIEALI--------ESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETG 65 (474) T ss_pred CchHHHHh-------hccccCCCHHHHHHHH--------HHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhc Confidence 44444441 1222223332222222 23333334445555666542110 Q ss_pred -CCCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHH Q lcl|NC_019423. 63 -KPPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVH 139 (756) Q Consensus 63 -~~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~ 139 (756) .....++|+ +++.+-.+..|+....-| ||.+.-+.+.+-+..|++ ..++++-++ ..|+--.....+.+ T Consensus 66 ~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl----~g~pv~~~~~~~~~~~e~----~~~~l~~~~-~~n~~~~~~~~~~~ 136 (474) T protein:vir:10 66 GNVRRLDVSVNNKLNNSFDSEIVDTRVGYL----HGVPVTYDLDENAEKNEK----LKKFITNFA-IRNSVDDEDSEIGK 136 (474) T ss_pred ccccccccCcccccccchHHHHHHhHhhhe----eccceeEeeCCCCcchHH----HHHHHHHHH-hhcCHhHHHHHHHH Confidence 011233444 566666666666554443 676665665443333333 333444332 23444445667889 Q ss_pred HHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCccee Q lcl|NC_019423. 140 SIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYA 219 (756) Q Consensus 140 ~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~ 219 (756) ++++.|.+++.+|.+ T Consensus 137 ~~~~~G~a~~~~~~d----------------------------------------------------------------- 151 (474) T protein:vir:10 137 MAAICGYGARLAYID----------------------------------------------------------------- 151 (474) T ss_pred HHhhcCeEEEEEEeC----------------------------------------------------------------- Confidence 999999886654321 Q ss_pred ccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchh Q lcl|NC_019423. 220 IQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPD 299 (756) Q Consensus 220 ~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~ 299 (756) ..|.+++..++|.++++=.+-. . +..+.+ +.|....+ T Consensus 152 -------------~~~~~~~~~i~p~~~~~v~d~~--~-~~~~~i-~~~~~~~~-------------------------- 188 (474) T protein:vir:10 152 -------------TNGDIRIKNIDPYNVIFVGDNI--L-EPTYSL-RYFYEKDD-------------------------- 188 (474) T ss_pred -------------CCCeeEEEEEcccceEEEEcCC--C-ceEEEE-EEEEEeeC-------------------------- Confidence 0134677888888865422211 1 122222 22211000 Q ss_pred hhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEEC---CEEEEecccccCCCccceEEeeeeeecCcccC Q lcl|NC_019423. 300 HESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIG---STLIRMENNPFPDGKLPLVVVPYMPRKRELFG 376 (756) Q Consensus 300 ~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g---~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G 376 (756) ..+..+..+|+|.+. . +..|.+ +.....++.|.+.|.+|++.++ ++.+| T Consensus 189 -------------~~~~~~~~~~~y~~~-----~-----~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g 240 (474) T protein:vir:10 189 -------------DNGTDYVYAEFYDNA-----Y-----YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEM 240 (474) T ss_pred -------------CCceEEEEEEEEcCc-----e-----EEEEeecCCCcccccccccCCCCccceEEec-----CCCCC Confidence 000112234444321 1 111111 1112222223334667777654 35679 Q ss_pred CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccc-cccccccccccCCCcchHH Q lcl|NC_019423. 377 EADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQ-GNPSQSIMEHKFPELPQSA 455 (756) Q Consensus 377 ~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~~~~~~~~~~ 455 (756) .|.+..++++++.+|..++.+.+.+...++|.+.+. |. ...++..... ...+.+. ......+.++..+.-.... T Consensus 241 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~-~~~~~~~~~~---~~~~~i~~~~~~~~~~~l~~~~~~~~~ 315 (474) T protein:vir:10 241 IGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GM-GMSEEMIQET---QKSGAFELFDKDMDVKYLTKDVNDTMI 315 (474) T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cC-CCCchhhhhh---hhcceeEecCCCCceeEEeccCCHHHH Confidence 999999999999999999999999998888876653 32 1111111111 1111111 1234456666555445667 Q ss_pred HHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEE Q lcl|NC_019423. 456 IVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVR 535 (756) Q Consensus 456 ~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iR 535 (756) ...+..+.+.+-..|++++.+.+.-++. .++.++...............+.|..++++++++++.++..-... T Consensus 316 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~----- 388 (474) T protein:vir:10 316 ENHLDRIEKNIMRFAKSVNFNSDEFNGN--VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYN----- 388 (474) T ss_pred HHHHHHHHHHHHHHhCCccccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC----- Confidence 7788889999999999998776533222 244445555555666666677778888888888888876532111 Q ss_pred EecCceeecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccC Q lcl|NC_019423. 536 ITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQP 613 (756) Q Consensus 536 I~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~ 613 (756) ... .++ .+|.+.-..... ....++.+ ..+...++.+.+.. + ++...++..-+++ T Consensus 389 -----~~~---~~~---~~i~~~f~~~~p~d~~e~a~~~----~kl~g~iS~et~~~----~--l~~v~d~~~E~er--- 444 (474) T protein:vir:10 389 -----LDD---DSY---LNLIFKFTRNIPVNKLEESQVL----INLKGQVSERTRLG----Q--SQLVDDVDYELDE--- 444 (474) T ss_pred -----CCc---ccc---ccceEEeCCCCCCCHHHHHHHH----HHHhccCchHHHHH----h--CCCCCCHHHHHHH--- Confidence 000 011 123333332222 11222222 22222233222111 1 1112222111111 Q ss_pred CCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 614 QPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGD 656 (756) Q Consensus 614 q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~ 656 (756) ++.++.+...... .........+....+-+ T Consensus 445 -----------i~~E~~e~~~~~~--~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 445 -----------MEKESLEFNDKLP--DIDEGDANDKSQNNQSE 474 (474) T ss_pred -----------HHHHHHHHHhhcc--cccCCCcCCCCccccCC Confidence 1111100000000 00000000000000000 No 89 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.51 E-value=1.5e-13 Score=90.86 Aligned_cols=469 Identities=8% Similarity=0.018 Sum_probs=198.9 Q ss_pred ccCCCCCCCcccccccc-------CCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--C---------- Q lcl|NC_019423. 4 QDTFKPLPDPAQSEKLT-------DWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--P---------- 64 (756) Q Consensus 4 ~~~~~~~~~~~~~~~~~-------~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~---------- 64 (756) +-.+-|.+-.. ..++- +-..+..+..|. .....| ...+.++..+||.|.-.-. + T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~----~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 73 (503) T protein:vir:59 1 MADIYPLGKTH-TEELNEIIVESAKEIAEPDTTMIQ----KLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQ 73 (503) T ss_pred CcccccCChhh-HHhHHHhhhhhhhhccchhHHHHH----HHHHhh--cHHHHHHHHHHhccccchhhccchhccccccc Confidence 22233322111 11100 001111222222 221222 2345678899998763110 0 Q ss_pred CCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHh Q lcl|NC_019423. 65 PKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIV 142 (756) Q Consensus 65 ~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al 142 (756) ...++|+ +++.+..+..|+....-| ||.+. .|. .+|.+..+ +++..+ .|+-......++++++ T Consensus 74 ~~~~~~~~~ri~~n~~~~ivd~~~~yl----~g~~~--~~~---~~d~~~~~----~l~~~~--~n~~~~~~~~~~~~~~ 138 (503) T protein:vir:59 74 LVDDTKTNNRTSHAWHKLFVDQKTQYL----VGEPV--TFT---SDNKTLLE----YVNELA--DDDFDDILNETVKNMS 138 (503) T ss_pred ccccccccceeecchHHHHHHHHHhhh----hcCCe--eec---cCcHHHHH----HHHHHH--hcCHHHHHHHHHHHHh Confidence 0112232 456666666666665544 55543 332 23443333 555443 3455555677899999 Q ss_pred hcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccC Q lcl|NC_019423. 143 DDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQT 222 (756) Q Consensus 143 ~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~ 222 (756) +.|.+++.+||+. T Consensus 139 ~~G~~~~~v~~d~------------------------------------------------------------------- 151 (503) T protein:vir:59 139 NKGIEYWHPFVDE------------------------------------------------------------------- 151 (503) T ss_pred hCCeEEEEEeecC------------------------------------------------------------------- Confidence 9999988876631 Q ss_pred ceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhh Q lcl|NC_019423. 223 GVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDH 300 (756) Q Consensus 223 g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~ 300 (756) .|++++..++|.+++. |+... ....++ .+.|.+... T Consensus 152 -----------dg~~~i~~~~p~~~~~i~d~~~~---~~~~~~-ir~~~~~~~--------------------------- 189 (503) T protein:vir:59 152 -----------EGEFDYVIFPAEEMIVVYKDNTR---RDILFA-LRYYSYKGI--------------------------- 189 (503) T ss_pred -----------CCceEEEEEccceeEEEEeCCCC---CceEEE-EEEEEEecC--------------------------- Confidence 1346778888888764 44321 122222 233322100 Q ss_pred hccccccccccccccceEEEEEEEEEe-----eccCCceeEEEEEE-EECCEEEEecccccCCCccceEEeeeeeecCcc Q lcl|NC_019423. 301 ESKTPSDFQFKDALRKKVVAYEYWGFY-----DINDDGSLEPIVAT-WIGSTLIRMENNPFPDGKLPLVVVPYMPRKREL 374 (756) Q Consensus 301 ~~~~~~~~~~~d~s~~~V~v~E~w~k~-----d~~~~g~~~~~~~~-~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~ 374 (756) ....+..+|+|... ...+.+........ ......+.....|...+.+||+.+. ++. T Consensus 190 -------------~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~-----nn~ 251 (503) T protein:vir:59 190 -------------MGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFK-----NNE 251 (503) T ss_pred -------------CCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEec-----CCC Confidence 00012233333211 11111100000000 0000011122234445677777664 345 Q ss_pred cCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchH Q lcl|NC_019423. 375 FGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQS 454 (756) Q Consensus 375 ~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 454 (756) +|.|.+..++++++.+|..++.+.+.+...+++.+.+.-.-.+...+....... ...+.....+.+.++....-... T Consensus 252 ~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~l~~~~~~~~ 328 (503) T protein:vir:59 252 EMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRY---HSVIKVSGDGGVDTLRAEIPVDS 328 (503) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhc---ccceeccCCCcceeEeccCCHHH Confidence 789999999999999999999999999999988776542111111111111111 11122233444566554444456 Q ss_pred HHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEE Q lcl|NC_019423. 455 AIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVV 534 (756) Q Consensus 455 ~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~i 534 (756) ....++.+.+.+...+++++.+.+.-++.. ++.++...............+.|..+++++.++++.++....... T Consensus 329 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~--Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~--- 403 (503) T protein:vir:59 329 AAKELERIQDELYKSAQAVDNSPETIGGGA--TGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGD--- 403 (503) T ss_pred HHHHHHHHHHHHHHHhcccCCCcccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc--- Confidence 667788888889888888876544322222 333344444455555555666677777777777777665332210 Q ss_pred EEecCceeecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhcc Q lcl|NC_019423. 535 RITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQ 612 (756) Q Consensus 535 RI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~ 612 (756) + ....+|.|.-..... ....++.+..+.+. ..++.+.+. .+ ++..++...-+++ T Consensus 404 ------~--------~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~--GiiS~et~l----~~--l~~v~d~~~E~~r-- 459 (503) T protein:vir:59 404 ------F--------NPDKELTMTFTRTRIQNDSEIVQSLVQGVTG--GIMSKETAV----AR--NPFVQDPEEELAR-- 459 (503) T ss_pred ------c--------ccccceeEEeCCCCCCCHHHHHHHHHHHHhC--CCCchHHHH----Hh--CCCCCCHHHHHHH-- Confidence 0 011123333222221 22222222222211 122322211 11 1112111111111 Q ss_pred CCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-H--HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 613 PQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAK-A--KEAASSGDLKDLDYLEQESGTKHAR 673 (756) Q Consensus 613 ~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~-a--~~~~aq~~~~~~~~~~q~~~~k~~~ 673 (756) + +.+.+. ..+......... . ...+........+ -.+..++. T Consensus 460 ------------i---~~E~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~~ 503 (503) T protein:vir:59 460 ------------I---EEEMNQ-YAEMQGNLLDDEGGDDDLEEDDPNAGAAE----SGGAGQVS 503 (503) T ss_pred ------------H---HHHHHH-HHhhhccccCccCCCCCCCcCCCCCCccc----CCCCCCcC Confidence 1 000000 000000000000 0 0000000000000 00000000 No 90 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.50 E-value=9.9e-13 Score=86.39 Aligned_cols=472 Identities=9% Similarity=0.028 Sum_probs=206.0 Q ss_pred CCcccCCCCCCCcccccccc-CCCc-hHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC----CCCCCCCC--C Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLT-DWKK-EPSIQLLKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA----KPPKIKGR--S 71 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~----~~~~~~gr--S 71 (756) ..++.+|.= .+...++ .++. +..+..-..++......|.... .+.++..+||.|.-.- .....+++ . T Consensus 13 ~~~~~~~~~----~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ 88 (512) T protein:vir:97 13 LRENRNYLF----NDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (512) T ss_pred eeeCceeee----ccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcc Confidence 333444321 1111121 1111 0111111133444444554443 3466788999876321 11112233 4 Q ss_pred cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEE Q lcl|NC_019423. 72 QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARI 151 (756) Q Consensus 72 ~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~ 151 (756) +++.+.....|+....-| +|.+.- |.+ +|.+. .++++-++. .|+--.....+.+++++.|.+.+.+ T Consensus 89 ki~~n~~k~Ivd~~~~yl----~g~p~~--~~~---~d~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~i~G~ay~~v 154 (512) T protein:vir:97 89 RVAHDYASYISDFINGYF----LGNPIQ--CQD---DDKDV----LEAIEAFND-LNDVESHNRSLGLDLSIYGKAYELM 154 (512) T ss_pred eeecchHHHHHHHHhhhh----cccCce--ecc---CChHH----HHHHHHHHh-hcCHHHHHHHHHHHHHhcCeEEEEE Confidence 577777777776665444 554333 322 33332 234554433 3444445667889999999887776 Q ss_pred eeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeee Q lcl|NC_019423. 152 GWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEK 231 (756) Q Consensus 152 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~ 231 (756) |++. T Consensus 155 y~de---------------------------------------------------------------------------- 158 (512) T protein:vir:97 155 IRNQ---------------------------------------------------------------------------- 158 (512) T ss_pred EeCC---------------------------------------------------------------------------- Confidence 5420 Q ss_pred eecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 232 ALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 232 ~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) .|++++..++|.+++ ||+.... ..-+ +.|.|.+.. .+ T Consensus 159 --d~~~~i~~~~p~~~~~iyd~~~~~---~~~~-~vr~~~~~~-----------~~------------------------ 197 (512) T protein:vir:97 159 --DDETRLYKSDAMSTFVIYDNTIER---NSIA-GVRYLRTKP-----------ID------------------------ 197 (512) T ss_pred --CCceEEEEEcccceEEEEcCCCCC---ceEE-EEEEEEeee-----------cc------------------------ Confidence 134677888998876 4544422 1222 233332210 00 Q ss_pred ccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEE-----EEecccccCCCccceEEeeeeeecCcccCCchHHHhH Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTL-----IRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLG 384 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~-----L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~ 384 (756) +.....+..+|+|.. +++ +++...++.. ....+.|.+.+.+|++.+. ++.+|.|.+..++ T Consensus 198 --~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~ 262 (512) T protein:vir:97 198 --KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVI 262 (512) T ss_pred --ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccccCcccceEeec-----CCCCCCCchhhhH Confidence 000112344555532 111 1222222111 1112234445667777654 3456889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhc-cccc---------cccccccccccccccccCCCcchH Q lcl|NC_019423. 385 DNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDD-GQDY---------EYNPMQGNPSQSIMEHKFPELPQS 454 (756) Q Consensus 385 d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~-~~~~---------~~~~~~~~~~~~i~~~~~~~~~~~ 454 (756) ++++.+|..+|.+.+.+...+++.+++.-............. .... .........+..+.++..+.-... T Consensus 263 ~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~ 342 (512) T protein:vir:97 263 TLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQG 342 (512) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHH Confidence 999999999999999998888877654321111111111000 0000 000001112233445544444456 Q ss_pred HHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEE Q lcl|NC_019423. 455 AIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVV 534 (756) Q Consensus 455 ~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~i 534 (756) ....+..+...+-..|++++.+.|.-++.. ++.++...............+.|..++++++++++.++....... T Consensus 343 ~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~--Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~--- 417 (512) T protein:vir:97 343 TEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID--- 417 (512) T ss_pred HHHHHHHHHHHHHHHhCCcccCcccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc--- Confidence 667788899999999999988766432222 333455555566666666777788888888888777764332110 Q ss_pred EEecCceeecCHhHhcCcceEEEecccccHH--HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhcc Q lcl|NC_019423. 535 RITNEQYVEIKREDLKGNFDIEVDINTAEID--NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQ 612 (756) Q Consensus 535 RI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~--~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~ 612 (756) . +.++. +|.+.-.++... ...++. +..+...++.+.+. .+ ++...+...-++ T Consensus 418 -------~---~~d~~---~i~~~f~~~~p~~~~e~~~~----~~kl~giiS~et~~----~~--l~~v~d~~~E~e--- 471 (512) T protein:vir:97 418 -------A---NKDFN---TVRYVYNRNLPKSLIEELKA----YIDSGGKISQTTLM----SL--FSFFQDPELEVK--- 471 (512) T ss_pred -------c---ccccc---cceEEeCCCCCcCHHHHHHH----HHHHhccCchHHHH----Hh--CCCCCCHHHHHH--- Confidence 0 01111 233333322221 112221 22222223332211 11 111222111111 Q ss_pred CCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 613 PQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKA 680 (756) Q Consensus 613 ~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~ 680 (756) +++.++.+ .+ +.. + .....+.......+..-+.+.. ..+.. T Consensus 472 -----------ri~~E~~~----~~--~~~----~---~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 512 (512) T protein:vir:97 472 -----------KIEEDEKE----SI--KKA----Q---KGIYKDPRDINDDEQDDDTKDT---VDKKE 512 (512) T ss_pred -----------HHHHHHHH----HH--HHH----h---hcccCCCCCCCCCCCCCCcccc---ccccC Confidence 11110000 00 000 0 0000000000000000000000 00000 No 91 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.50 E-value=1.8e-12 Score=84.92 Aligned_cols=434 Identities=8% Similarity=0.003 Sum_probs=198.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC---C------CCCCCCCC--cccCHHHHHHHHHHHHHHHH Q lcl|NC_019423. 23 KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA---K------PPKIKGRS--QVQPRLVRRQAEWRYAPLSE 91 (756) Q Consensus 23 ~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~------~~~~~grS--~~v~~~v~~~~e~~~~~L~~ 91 (756) .+.+. +....+.|.....+..+..+||.|.-.- . .....+++ +++.+..+..|+....-| T Consensus 1 l~~~~-------i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl-- 71 (451) T protein:vir:10 1 MELEK-------IRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYM-- 71 (451) T ss_pred CCHHH-------HHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhhe-- Confidence 23222 3334445666666678889999985210 0 00111222 566777777777665544 Q ss_pred hhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC Q lcl|NC_019423. 92 PFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP 171 (756) Q Consensus 92 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~ 171 (756) ||.+.-+.. .+|.+..+ .+++.+ .|+--.....+.++++.+|.|...+|++.+.. T Consensus 72 --~G~p~~~~~----~~~~~~~~----~~~~~~--~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~------------- 126 (451) T protein:vir:10 72 --FTYPVLFDI----DNNKELNE----KVTDVL--GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYS------------- 126 (451) T ss_pred --ecccceeec----CCcHHHHH----HHHHHh--ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcc------------- Confidence 666643332 23333333 444432 23333444567899999999988877642210 Q ss_pred CCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEe-- Q lcl|NC_019423. 172 IENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI-- 249 (756) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~-- 249 (756) ......|.+++..++|.++++ T Consensus 127 ---------------------------------------------------------~~~~~~~~~~~~~i~p~~~~~vy 149 (451) T protein:vir:10 127 ---------------------------------------------------------GEQVTNQTFKYGVVNTEEIIPIY 149 (451) T ss_pred ---------------------------------------------------------cccccccceeEEEEcccceEEEE Confidence 001123567888999999864 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeec Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDI 329 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~ 329 (756) |..... +..+.+ |.|....+. . .......+..+|+|.. T Consensus 150 dd~~~~---~~~~~i-r~~~~~~~~-------------~----------------------~~~~~~~~~~~e~yt~--- 187 (451) T protein:vir:10 150 RNGIER---ELEAVI-RYYIQLEDV-------------K----------------------GQIQKQAYTYVEFWTD--- 187 (451) T ss_pred cCCCCC---ceEEEE-EEEEeeecc-------------c----------------------ccccceEEEEEEEEeC--- Confidence 433221 222332 333221100 0 0001112334455532 Q ss_pred cCCceeEEEEE--EEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 330 NDDGSLEPIVA--TWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANG 407 (756) Q Consensus 330 ~~~g~~~~~~~--~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~ 407 (756) +++..+... -..++.++ ...-|...|.+|++.++. +..|.|.+..++++++.+|.++|...+.+.-.+++ T Consensus 188 --~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~ 259 (451) T protein:vir:10 188 --KILDKYKFFGVSCCGSQIE-HITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQI 259 (451) T ss_pred --CeEEEEEecccCccccccc-cccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccc Confidence 111111110 01122222 122233345667665543 44678999999999999999999999999999988 Q ss_pred ceEeeccccCccchhhhhcccccccccc--ccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccc Q lcl|NC_019423. 408 QRGYPKGMLDTLNRRRYDDGQDYEYNPM--QGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYG 485 (756) Q Consensus 408 ~~~~~~gav~~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~ 485 (756) .+++.-...+...+.............. .....+.+.++..+.-..+....+..+.+.+-..|++++.+.+..|+ T Consensus 260 ~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn--- 336 (451) T protein:vir:10 260 IYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGN--- 336 (451) T ss_pred eeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc--- Confidence 7655421111111111111111111100 01122345666555445677778999999999999998765543332 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHH Q lcl|NC_019423. 486 DVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEID 565 (756) Q Consensus 486 ~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~ 565 (756) .++.++..+...........-+.|..++++++++++.++..+ ++ .+|.|.-...... T Consensus 337 ~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~------------d~-----------~~i~i~f~~~~p~ 393 (451) T protein:vir:10 337 ASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT------------DY-----------KKIQQTYTRNMMS 393 (451) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC------------Cc-----------cceeEEecCCCCC Confidence 233334444445555555566667777776666666554210 11 1222222222211 Q ss_pred HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 566 NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNN 645 (756) Q Consensus 566 ~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~ 645 (756) .. .+.+.++..+...++.+.+. . .++.. .++.+. .+....++.. +. + T Consensus 394 n~--~e~~~~~~kl~g~iS~et~~----~--~~p~v-------------~d~~~e-~~~~~ee~~~-~~---~------- 440 (451) T protein:vir:10 394 ND--LEDADIATKSVGIIPTKIIL----R--HHPWV-------------DDVEEA-EKLYLEEKKI-QA---S------- 440 (451) T ss_pred CH--HHHHHHHHHHhccCchHHHH----H--hCCCC-------------CCHHHH-HHHHHHHHHH-HH---H------- Confidence 11 11112222221222221111 1 11111 122110 0001000000 00 0 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_019423. 646 AKAKEAASSGDLKDLDY 662 (756) Q Consensus 646 a~a~~~~aq~~~~~~~~ 662 (756) +.+...-.+.. T Consensus 441 ------~~~~~~~~~~~ 451 (451) T protein:vir:10 441 ------KVSDDYNNFTE 451 (451) T ss_pred ------HHHhhcCCCCC Confidence 00000000100 No 92 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.50 E-value=1.4e-12 Score=85.64 Aligned_cols=457 Identities=8% Similarity=0.037 Sum_probs=201.6 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC---C--CCCCCCC--Ccc Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA---K--PPKIKGR--SQV 73 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~--~~~~~gr--S~~ 73 (756) =.++-++ -.++.+.+++-+.+...|..... ....+.++..+||.|.-.. . ....+++ .++ T Consensus 7 ~~~~~~~------~~~~~~~~l~~~~i~~li~~~~~-------~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki 73 (506) T protein:vir:94 7 EHKQANL------IYQESLENLTPNKIMKFITHHFN-------YQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRA 73 (506) T ss_pred hhhccee------ecccchhcCCHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccCCccee Confidence 1111111 11122333444555544433221 1234566788999986321 1 1112344 356 Q ss_pred cCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEee Q lcl|NC_019423. 74 QPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGW 153 (756) Q Consensus 74 v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w 153 (756) +.+..+..|+....-| ||.+ +.|.+ +|.. ..+.++.++. .|+--.....+.++++++|.+.+.+|| T Consensus 74 ~~n~~~~Iv~~~~~~l----~G~p--~~~~~---~d~~----~~~~l~~~~~-~N~~~~~~~~~~~~~~~~G~a~~~v~~ 139 (506) T protein:vir:94 74 THSFAKYIADFQTSYS----VGNP--INVKL---PDDG----SNSGFDTFNK-ANDVDAENYDLFLDMSRYGRAYEYVYR 139 (506) T ss_pred ecchHHHHHHHhhhhh----cccC--ceeec---Ccch----HHHHHHHHHh-ccCHhHHHHHHHHHHHhcCeEEEEEEe Confidence 7777777777766555 6654 33433 2322 2345665443 444444466788999999999888765 Q ss_pred eeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeee Q lcl|NC_019423. 154 ERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKAL 233 (756) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~ 233 (756) +. T Consensus 140 de------------------------------------------------------------------------------ 141 (506) T protein:vir:94 140 GE------------------------------------------------------------------------------ 141 (506) T ss_pred cC------------------------------------------------------------------------------ Confidence 31 Q ss_pred cCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccc Q lcl|NC_019423. 234 VNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFK 311 (756) Q Consensus 234 ~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (756) .|.+++..++|.++++ |.... ....+ +.+.|..... + . T Consensus 142 d~~~~i~~~~p~~~~~v~dd~~~---~~~~~-~v~~~~~~~~-----------~-------------------------~ 181 (506) T protein:vir:94 142 DNEEHLAKLDPLDTFVIYSTDVD---PKPIM-AVRYHQIELV-----------D-------------------------D 181 (506) T ss_pred CCeeEEEEEcccceEEEecCCCC---CceEE-EEEEEeeeec-----------c-------------------------C Confidence 1336677788888754 43322 11222 3333322100 0 0 Q ss_pred ccccceEEEEEEEEEeeccCCceeEEEEEEEEC----CEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHH Q lcl|NC_019423. 312 DALRKKVVAYEYWGFYDINDDGSLEPIVATWIG----STLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQ 387 (756) Q Consensus 312 d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g----~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q 387 (756) ......+.++|+|.. .++.++.+ ..+....++| .+.+|++.++. +..|.|.+..+++++ T Consensus 182 ~~~~~~~~~~~~yt~----------~~~~~~~~~~~~~~~~~~~~~~--~g~vPvv~~~n-----~~~~~sd~e~~~~li 244 (506) T protein:vir:94 182 NQVSTINYVPETWTA----------DTYTLYNPTPIMGKMQVDTTKP--ITTFPVVEFKN-----SNFRLGDFENVLPLI 244 (506) T ss_pred CceeEEEEEEEEEeC----------ceEEEeccccCccceecccccc--CCccceEEecC-----CCCCCCchhhhHHHH Confidence 000011223333321 11222221 2222223333 46678776644 335789999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCceEeeccccCccc---------------------hhhh---hcccccccc------cccc Q lcl|NC_019423. 388 AILGATMRGMIDLLGRSANGQRGYPKGMLDTLN---------------------RRRY---DDGQDYEYN------PMQG 437 (756) Q Consensus 388 ~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~---------------------~~~~---~~~~~~~~~------~~~~ 437 (756) +.+|..+|.+.+.+.-.+++.+++.-....... .... .......+. .... T Consensus 245 Da~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (506) T protein:vir:94 245 DLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGT 324 (506) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCc Confidence 999999999999887666655443211100000 0000 000000000 0000 Q ss_pred ccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 438 NPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIG 517 (756) Q Consensus 438 ~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~ 517 (756) .....++++..+.-.......+..+...+-..|++++.+.+.-++. .++.++..+...........-+.|..+++.++ T Consensus 325 ~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n--~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~ 402 (506) T protein:vir:94 325 QTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASN--SSGVAMQYKVLGTVELASTKRRMFERGLYARY 402 (506) T ss_pred cccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122344444444456777788889999999999998764432222 23444555555566666666777888888888 Q ss_pred HHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHh Q lcl|NC_019423. 518 TKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAE 597 (756) Q Consensus 518 ~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e 597 (756) ++++.++....... .++ + .+|.|.-........ ...+.++..+...++.+... . . T Consensus 403 ~li~~~~~~~~~~~-----------~~d---~---~~i~i~f~~~~p~d~--~e~a~~~~kl~g~iS~et~~----~--~ 457 (506) T protein:vir:94 403 QIISDIENSIHGDW-----------TFD---P---QELTFTFRDNLPADN--ISQIKALVQAGATLPQKYLY----Q--Q 457 (506) T ss_pred HHHHHHHHhcCCcc-----------ccc---c---ccceEEeCCCCCcCH--HHHHHHHHHHhccCChHHHH----H--h Confidence 88888775432110 000 0 123333333222211 11122222222333332221 1 1 Q ss_pred hcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 598 LKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDME 676 (756) Q Consensus 598 ~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~ 676 (756) +++..+...-++ ++..++.+... . ........+..... +..-...+++. T Consensus 458 lp~v~d~~~E~~--------------ri~~E~~~~~~--~----------~~~~~~~~~~~~~~----~~~~~~~~e~~ 506 (506) T protein:vir:94 458 LPGVTNPQDIVD--------------MMKEQSANGDY--S----------FDQNGVISNDGQTN----TTATQTDEEVR 506 (506) T ss_pred CCCCCCHHHHHH--------------HHHHHHHHHhh--c----------chhhcCCCcccCcc----ccccccccCCC Confidence 122222111111 11100000000 0 00000000000000 00000000111 No 93 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.49 E-value=5.4e-12 Score=82.34 Aligned_cols=468 Identities=11% Similarity=0.053 Sum_probs=206.6 Q ss_pred CCcccCCCCCC----------Ccc-----ccccccCCCchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhccccC-C- Q lcl|NC_019423. 1 MEHQDTFKPLP----------DPA-----QSEKLTDWKKEPSIQLLKGDLESAKPAHDAIM-SQIREWNDLMEVKGK-A- 62 (756) Q Consensus 1 ~~~~~~~~~~~----------~~~-----~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~-~- 62 (756) |-.+-+|.=-. .++ ..+.++++... ... .+......|.... .+.++..+||.|.-. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~ 75 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVN-NWE----LLKNFINHHKLRQAPRIQELLDYARGENHDVL 75 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccc-cHH----HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 33333321100 000 11111111110 111 1333444455443 456789999998521 1 Q ss_pred --CCCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 63 --KPPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 63 --~~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) ...+.++++ +++.+-....|+....-| +|.+.-+. .... +.-+...++++.++. .|+--.....++ T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p~~~~--~~d~---~~~~~~~~~l~~~~~-~N~~~~~~~~~~ 145 (502) T protein:vir:48 76 KSGRRKDNEMADKRAVHNYGRMISKFKTGYL----AGNPIRVE--YDDN---EDNSQNDDAIKRIGR-INDIDTHNRNLI 145 (502) T ss_pred ccccccccccccceeecchHHHHHHHHhhhh----cccCeeEe--cCCc---cchhHHHHHHHHHHh-hcCHhHHHHHHH Confidence 111223443 677777777777666555 55554333 3222 222345556665543 344344456789 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) +++++.|.+.+.+|++. T Consensus 146 ~~~~~~G~a~~~v~~de--------------------------------------------------------------- 162 (502) T protein:vir:48 146 RDLSQTGRAYEVIYRSE--------------------------------------------------------------- 162 (502) T ss_pred HHHhhcCeEEEEEEeCC--------------------------------------------------------------- Confidence 99999999877765421 Q ss_pred eccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 219 AIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 219 ~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) .|.+++..++|.++++ |+.... +..+. .+.|.... T Consensus 163 ---------------dg~~~i~~~~p~~~~~vydd~~~~---~~~~~-ir~~~~~~------------------------ 199 (502) T protein:vir:48 163 ---------------YDETRIKRLSPLETFVIYDNSLED---NSIAA-VRYYNRGT------------------------ 199 (502) T ss_pred ---------------CCceEEEEEcccceEEEEcCCCCC---ceEEE-EEEEEEee------------------------ Confidence 1336677788888754 433221 12222 22221100 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccC Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFG 376 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G 376 (756) . ...+.++|+|..- . .+++...|+.. .....|...|.+|++.++ ++.+| T Consensus 200 ----------------~-~~~~~~~~iyt~~-----~---i~~~~~~~~~~-~~~~~~~~~g~vPvv~~~-----nn~~g 248 (502) T protein:vir:48 200 ----------------L-QNAKDVVEIYTNQ-----H---IYTLDASDSFN-EISVTPHAFGTVPITEFL-----NNADG 248 (502) T ss_pred ----------------c-CCcEEEEEEEeCC-----e---EEEEEeCCcee-eccceecCCCccceEEec-----CCCCC Confidence 0 0013345555421 1 11222222222 223334444678887664 34578 Q ss_pred CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccc------ccccccccccccccCCC Q lcl|NC_019423. 377 EADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYN------PMQGNPSQSIMEHKFPE 450 (756) Q Consensus 377 ~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~------~~~~~~~~~i~~~~~~~ 450 (756) .|.+..++++++.+|..++.+.+.+...+.+.+.+.-......+............. ..+......+.++..+. T Consensus 249 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 328 (502) T protein:vir:48 249 IGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSY 328 (502) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecC Confidence 999999999999999999999999998888876654322222111111110111100 00111223345554444 Q ss_pred cchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_019423. 451 LPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSE 530 (756) Q Consensus 451 ~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~ 530 (756) -.......+..+.+.+-..|++++.+.+.-++.. ++.++...............+.|..++++++++++.++...... T Consensus 329 ~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~--Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 406 (502) T protein:vir:48 329 DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNA--SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEF 406 (502) T ss_pred CHHHHHHHHHHHHHHHHHHhCCCCcCccccccCc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 3456666788899999999999987765432222 33334444445555566666777778877777777766432111 Q ss_pred CcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcC-ChhHHHHhh Q lcl|NC_019423. 531 KEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKR-MPDLAHELR 609 (756) Q Consensus 531 ~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~-~~~~~~~l~ 609 (756) . .+ ++ .+|.|.-.+...... ...++++..+...++.+.+ ++..+ ..+...-++ T Consensus 407 ~-----------~~---d~---~~i~i~f~~~~p~d~--~e~a~~~~kl~g~iS~et~-------l~~l~~v~D~~~E~~ 460 (502) T protein:vir:48 407 K-----------DF---DE---SRLKITFTPNLPKSL--YEQVSILNDLGGQVSQETA-------LSLSGLVENPTEELD 460 (502) T ss_pred c-----------cc---cc---ccceEEeCCCCCcCH--HHHHHHHHHHhccCcHHHH-------HHhCCCCCCHHHHHH Confidence 0 00 00 112232222222111 1111222222222332221 12111 112111111 Q ss_pred hccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH-HHHHH-HHHHHHHHH Q lcl|NC_019423. 610 TWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALN-NAKAKE-AASSG-DLKDLDYLE 664 (756) Q Consensus 610 ~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~-~a~a~~-~~aq~-~~~~~~~~~ 664 (756) ++ +.++.+++........... ...... ..... +...+ -+ T Consensus 461 ri--------------~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~--~~ 502 (502) T protein:vir:48 461 KI--------------NEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERV--YE 502 (502) T ss_pred HH--------------HHHHHhhhhhcccccccccccccCCCccCCCCcCcCCC--CC Confidence 11 1111000000000000000 000000 00000 00000 00 No 94 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.49 E-value=7.1e-13 Score=87.18 Aligned_cols=473 Identities=10% Similarity=0.021 Sum_probs=203.6 Q ss_pred CCc--------------ccCCCCCCCccccccccCCCchHHHHH-HHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC-- Q lcl|NC_019423. 1 MEH--------------QDTFKPLPDPAQSEKLTDWKKEPSIQL-LKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA-- 62 (756) Q Consensus 1 ~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-- 62 (756) |-| +..|++ . ...--.|...+.+.. ....+....+.|.+.. .+.++..+||.|.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----~-~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~ 75 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFND----E-ANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhh----h-hCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCcccc Confidence 111 111211 0 000012332222111 1223444444554443 3566788999876321 Q ss_pred --CCCC--CCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 63 --KPPK--IKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 63 --~~~~--~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) .... .+...+++.+-....|+....-| +|.+.-+ . .+|.+.. ++++-++. .|+--.....+. T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~---~~d~~~~----~~l~~~~~-~n~~~~~~~~~~ 141 (511) T protein:vir:78 76 ELTRRKEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--Q---DDDKDVL----EAIEAFND-LNDVESHNRSLG 141 (511) T ss_pred ccCcccccccCcceeecchHHHHHHHHhhhh----cccCcee--e---cCchHHH----HHHHHHHh-hcChhHHHHHHH Confidence 1111 12234577777777776665544 5544333 2 2333332 34444432 343344455788 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) +++++.|.+.+.+|++. T Consensus 142 ~~~~~~G~a~~~vy~d~--------------------------------------------------------------- 158 (511) T protein:vir:78 142 LDLSIYGKAYELMIRNQ--------------------------------------------------------------- 158 (511) T ss_pred HHHHhcCeeEEEEEeCC--------------------------------------------------------------- Confidence 89999888877665420 Q ss_pred eccCceeEEEeeeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 219 AIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 219 ~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) .|.+++..++|.+++ ||..... .. ..+.+.|.+.. .+ T Consensus 159 ---------------dg~~~i~~~~p~~~~~v~dd~~~~---~~-~~~vr~~~~~~-----------~~----------- 197 (511) T protein:vir:78 159 ---------------DDETRLYKSDAMSTFIIYDNTVER---NS-IAGVRYLRTKP-----------ID----------- 197 (511) T ss_pred ---------------CCceEEEEEcccceEEEEcCCCCC---ce-EEEEEEEEeee-----------cc----------- Confidence 134678888898876 4544321 12 22223332210 00 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-----EecccccCCCccceEEeeeeeec Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-----RMENNPFPDGKLPLVVVPYMPRK 371 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-----~~~~~P~~~~~~Pfv~~~~~~~~ 371 (756) +.....+..+|+|.. +++ ++++..++..+ .....|.+.+.+|++.+. T Consensus 198 ---------------~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~----- 249 (511) T protein:vir:78 198 ---------------KTDEDEVFTVDLFTS-----HGV---YRYLTNRTNGLKLTPRENSFESHSFERMPITEFS----- 249 (511) T ss_pred ---------------ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccCcCcccceEEec----- Confidence 001123444555532 111 22222221111 112234445666776553 Q ss_pred CcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhh-hhccccc------cccc--cccccccc Q lcl|NC_019423. 372 RELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRR-YDDGQDY------EYNP--MQGNPSQS 442 (756) Q Consensus 372 ~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~-~~~~~~~------~~~~--~~~~~~~~ 442 (756) ++.+|.|.+..++++++.+|..++.+.+.+...+++.+++.-......+... ....... ...+ ........ T Consensus 250 n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:78 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVD 329 (511) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcc Confidence 3456899999999999999999999999998888876654321111111110 0000000 0000 11122333 Q ss_pred cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 443 IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICA 522 (756) Q Consensus 443 i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~ 522 (756) +.++..+.-.......+..+.+.+-..|++++.+.+.-+.. .++.++...............+.|..++++++++++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:78 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45555444445666778888899999999998876543322 2343455555556666666677788888888888877 Q ss_pred HHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh Q lcl|NC_019423. 523 MNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP 602 (756) Q Consensus 523 li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~ 602 (756) ++........ +.++ .+|.|.-..+...... + .+..+..+...++.+... .+ ++... T Consensus 408 ~~~~~~~~~~-------------~~~~---~~i~~~f~~~~p~n~~-e-~~d~~~kl~G~iS~et~l----~~--l~~v~ 463 (511) T protein:vir:78 408 ILKNTRSIDA-------------NKDF---NTVRYVYNRNLPKSLI-E-ELKAYIDSGGKISQTTLM----SL--FSFFQ 463 (511) T ss_pred HHHhcCCCcc-------------cccc---ccceEEeCCCCCcCHH-H-HHHHHHHHhccCChHHHH----Hh--CCCCC Confidence 7643321100 0011 1233333332221111 1 111122221223322211 11 11222 Q ss_pred hHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 603 DLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQ 678 (756) Q Consensus 603 ~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~ 678 (756) +...-++++ ..++.. .++.. ......+.......++..+.+... .+.. T Consensus 464 d~~~El~ri--------------~~E~~~----~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~-~e~~ 511 (511) T protein:vir:78 464 DPELEVKKI--------------EEDEKE----SIKKA---------QKGIYKDPRDINDDEQDDDTKDTV-DKKE 511 (511) T ss_pred CHHHHHHHH--------------HHHHHH----HHHHH---------hhccccCCCCCCCCCCCCCccCcc-cccC Confidence 221111111 000000 00000 000000000000000000000000 0000 No 95 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.49 E-value=7.1e-13 Score=87.18 Aligned_cols=473 Identities=10% Similarity=0.021 Sum_probs=203.6 Q ss_pred CCc--------------ccCCCCCCCccccccccCCCchHHHHH-HHHHHHHHHHHhhHHH-HHHHHHHHHhccccCC-- Q lcl|NC_019423. 1 MEH--------------QDTFKPLPDPAQSEKLTDWKKEPSIQL-LKGDLESAKPAHDAIM-SQIREWNDLMEVKGKA-- 62 (756) Q Consensus 1 ~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-- 62 (756) |-| +..|++ . ...--.|...+.+.. ....+....+.|.+.. .+.++..+||.|.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----~-~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~ 75 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFND----E-ANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhh----h-hCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCcccc Confidence 111 111211 0 000012332222111 1223444444554443 3566788999876321 Q ss_pred --CCCC--CCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 63 --KPPK--IKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 63 --~~~~--~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) .... .+...+++.+-....|+....-| +|.+.-+ . .+|.+.. ++++-++. .|+--.....+. T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~---~~d~~~~----~~l~~~~~-~n~~~~~~~~~~ 141 (511) T protein:vir:96 76 ELTRRKEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--Q---DDDKDVL----EAIEAFND-LNDVESHNRSLG 141 (511) T ss_pred ccCcccccccCcceeecchHHHHHHHHhhhh----cccCcee--e---cCchHHH----HHHHHHHh-hcChhHHHHHHH Confidence 1111 12234577777777776665544 5544333 2 2333332 34444432 343344455788 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) +++++.|.+.+.+|++. T Consensus 142 ~~~~~~G~a~~~vy~d~--------------------------------------------------------------- 158 (511) T protein:vir:96 142 LDLSIYGKAYELMIRNQ--------------------------------------------------------------- 158 (511) T ss_pred HHHHhcCeeEEEEEeCC--------------------------------------------------------------- Confidence 89999888877665420 Q ss_pred eccCceeEEEeeeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 219 AIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 219 ~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) .|.+++..++|.+++ ||..... .. ..+.+.|.+.. .+ T Consensus 159 ---------------dg~~~i~~~~p~~~~~v~dd~~~~---~~-~~~vr~~~~~~-----------~~----------- 197 (511) T protein:vir:96 159 ---------------DDETRLYKSDAMSTFIIYDNTVER---NS-IAGVRYLRTKP-----------ID----------- 197 (511) T ss_pred ---------------CCceEEEEEcccceEEEEcCCCCC---ce-EEEEEEEEeee-----------cc----------- Confidence 134678888898876 4544321 12 22223332210 00 Q ss_pred chhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEE-----EecccccCCCccceEEeeeeeec Q lcl|NC_019423. 297 DPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLI-----RMENNPFPDGKLPLVVVPYMPRK 371 (756) Q Consensus 297 ~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L-----~~~~~P~~~~~~Pfv~~~~~~~~ 371 (756) +.....+..+|+|.. +++ ++++..++..+ .....|.+.+.+|++.+. T Consensus 198 ---------------~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~----- 249 (511) T protein:vir:96 198 ---------------KTDEDEVFTVDLFTS-----HGV---YRYLTNRTNGLKLTPRENSFESHSFERMPITEFS----- 249 (511) T ss_pred ---------------ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccCcCcccceEEec----- Confidence 001123444555532 111 22222221111 112234445666776553 Q ss_pred CcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhh-hhccccc------cccc--cccccccc Q lcl|NC_019423. 372 RELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRR-YDDGQDY------EYNP--MQGNPSQS 442 (756) Q Consensus 372 ~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~-~~~~~~~------~~~~--~~~~~~~~ 442 (756) ++.+|.|.+..++++++.+|..++.+.+.+...+++.+++.-......+... ....... ...+ ........ T Consensus 250 n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:96 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVD 329 (511) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcc Confidence 3456899999999999999999999999998888876654321111111110 0000000 0000 11122333 Q ss_pred cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 443 IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICA 522 (756) Q Consensus 443 i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~ 522 (756) +.++..+.-.......+..+.+.+-..|++++.+.+.-+.. .++.++...............+.|..++++++++++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:96 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45555444445666778888899999999998876543322 2343455555556666666677788888888888877 Q ss_pred HHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh Q lcl|NC_019423. 523 MNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP 602 (756) Q Consensus 523 li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~ 602 (756) ++........ +.++ .+|.|.-..+...... + .+..+..+...++.+... .+ ++... T Consensus 408 ~~~~~~~~~~-------------~~~~---~~i~~~f~~~~p~n~~-e-~~d~~~kl~G~iS~et~l----~~--l~~v~ 463 (511) T protein:vir:96 408 ILKNTRSIDA-------------NKDF---NTVRYVYNRNLPKSLI-E-ELKAYIDSGGKISQTTLM----SL--FSFFQ 463 (511) T ss_pred HHHhcCCCcc-------------cccc---ccceEEeCCCCCcCHH-H-HHHHHHHHhccCChHHHH----Hh--CCCCC Confidence 7643321100 0011 1233333332221111 1 111122221223322211 11 11222 Q ss_pred hHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 603 DLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQ 678 (756) Q Consensus 603 ~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~ 678 (756) +...-++++ ..++.. .++.. ......+.......++..+.+... .+.. T Consensus 464 d~~~El~ri--------------~~E~~~----~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~-~e~~ 511 (511) T protein:vir:96 464 DPELEVKKI--------------EEDEKE----SIKKA---------QKGIYKDPRDINDDEQDDDTKDTV-DKKE 511 (511) T ss_pred CHHHHHHHH--------------HHHHHH----HHHHH---------hhccccCCCCCCCCCCCCCccCcc-cccC Confidence 221111111 000000 00000 000000000000000000000000 0000 No 96 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.48 E-value=1.4e-12 Score=85.58 Aligned_cols=451 Identities=11% Similarity=0.077 Sum_probs=198.1 Q ss_pred CCc---ccCCCCCCCccccccccCCCchHHHHHHH-------HHHHHHHHHhhHHHHHHHHHHHHhccccCCC------- Q lcl|NC_019423. 1 MEH---QDTFKPLPDPAQSEKLTDWKKEPSIQLLK-------GDLESAKPAHDAIMSQIREWNDLMEVKGKAK------- 63 (756) Q Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-------~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------- 63 (756) |=| ++..+|. +++++.+|. ..+....+.|...+.+.++..+||.|.-.-. T Consensus 1 ~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~ 67 (474) T protein:vir:94 1 MFNIIRMPWDKPY-------------GEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVD 67 (474) T ss_pred CcccccccCCCch-------------hhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhc Confidence 332 2222221 112222222 2355555667777777889999999752110 Q ss_pred --CCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHH Q lcl|NC_019423. 64 --PPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVH 139 (756) Q Consensus 64 --~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~ 139 (756) -+...+++ +++.+.....|+.....| ||.+.- |.. +|.+.. +.++..+ .++-......+++ T Consensus 68 ~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l----~g~p~~--~~~---~d~~~~----~~l~~~~--~n~~~~~~~e~~~ 132 (474) T protein:vir:94 68 VHGNIDYDKPDWRITTNFHQNLVDQKVSYV----ASKPVT--YSC---EDENVL----KVIHDVL--DTRWDNKLIDILT 132 (474) T ss_pred cccccccccCcceeecchHHHHHHHHHhhh----hcCCce--ecc---CcHHHH----HHHHHHH--hccHHHHHHHHHH Confidence 01123333 467777777777666555 665533 322 343332 3444433 3454555667889 Q ss_pred HHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCccee Q lcl|NC_019423. 140 SIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYA 219 (756) Q Consensus 140 ~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~ 219 (756) ++++.|.+.+.+|++. T Consensus 133 ~~~~~G~~~~~~~~d~---------------------------------------------------------------- 148 (474) T protein:vir:94 133 ATSNKGIDWLQVYINE---------------------------------------------------------------- 148 (474) T ss_pred HHhhcCceEEEEEecC---------------------------------------------------------------- Confidence 9999998877765421 Q ss_pred ccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhc Q lcl|NC_019423. 220 IQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITD 297 (756) Q Consensus 220 ~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~ 297 (756) .|.+++..++|.++++ |+... .+..+++ |.|.... . T Consensus 149 --------------~~~~~i~~~~p~~~~~v~d~~~~---~~~~~~i-r~~~~~~-------~----------------- 186 (474) T protein:vir:94 149 --------------NGEMKLFRVPAEQAIPIWVDKER---EELKSFI-RYYKFNN-------E----------------- 186 (474) T ss_pred --------------CCeeEEEEEcccceEEEEcCCCC---CceEEEE-EEEEecC-------e----------------- Confidence 1346677788888764 33322 2222332 2221100 0 Q ss_pred hhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCC Q lcl|NC_019423. 298 PDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGE 377 (756) Q Consensus 298 ~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~ 377 (756) . ... .|.+ .+|..|.+ ++.+.. .. ...+...+.....|...+.+|++.+. ++.+|. T Consensus 187 -~--~~~----~yt~---~~~~~y~~------~~~~~~-~~--~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~ 242 (474) T protein:vir:94 187 -E--KVE----FWTD---TTVTYYVL------ENGGLI-PD--YYYGANHVQSHFSNGNWGRVPFIAFK-----NNPEEV 242 (474) T ss_pred -E--EEE----EEeC---CeEEEEEE------cCCccc-cc--cccCcCcccccccccCCCccceEEec-----CCcCCC Confidence 0 000 0000 01111111 111110 00 00000011111222334667777654 345799 Q ss_pred chHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHH Q lcl|NC_019423. 378 ADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIV 457 (756) Q Consensus 378 g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 457 (756) |.+..++++++.+|...+.+.+.+...+.|.+++.-...+......... .....+....++.+.++..+.-...+.. T Consensus 243 sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~l~~~~~~~~~~~ 319 (474) T protein:vir:94 243 SDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGL---KYYKAINVDGDGGVETIQVEVPVSSTKE 319 (474) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh---hccceeeccCCCceeEEeecCCHHHHHH Confidence 9999999999999999999999998888887766533222222211111 1122233444556666665544566667 Q ss_pred HHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEe Q lcl|NC_019423. 458 MTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRIT 537 (756) Q Consensus 458 ~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~ 537 (756) .++.+...+-..|++++.+.+.-++. .++.++..+............+.|..++++++++++.+. ... T Consensus 320 ~~~~l~~~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----~~~------ 387 (474) T protein:vir:94 320 YIDLMRVYIMEFGQGVDFQTDKFGSA--PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFN----NLK------ 387 (474) T ss_pred HHHHHHHHHHHHhCccccCccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC------ Confidence 78888899999999987664432222 233334444444444455555566666666665555443 210 Q ss_pred cCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCCh Q lcl|NC_019423. 538 NEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDP 617 (756) Q Consensus 538 g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p 617 (756) . ++. +|.|.-........ .+.+..+... ..++.+.... .+++..+...-+++ T Consensus 388 ~-d~~-----------~i~v~f~~~~p~~~--~e~a~~~~~~-g~iS~et~l~------~l~~v~D~~~E~er------- 439 (474) T protein:vir:94 388 T-DVK-----------DIEISFNFNRMMND--AEQSQIIAQS-QYLSRETLVK------SSPLVDDYKAELER------- 439 (474) T ss_pred c-ccc-----------eeeEEeccCcccCH--HHHHHHHHHc-CCCCHHHHHH------hCCCCCCHHHHHHH------- Confidence 0 111 12222222211111 1112222222 2233221111 11122221111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 618 MEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDME 676 (756) Q Consensus 618 ~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~ 676 (756) +...+.. .++ ....... .......+..+.... +-+ T Consensus 440 -------i~~E~~~--------~~~-------~~~~~~~-~~~~~~~~~~~~~~~-~~e 474 (474) T protein:vir:94 440 -------IEQEQME--------YNK-------QLPNLDD-GGADGAQQQEGSNNK-ESE 474 (474) T ss_pred -------HHHHHHH--------HHh-------hccccCC-CCCCCcccCCCCccc-ccC Confidence 1000000 000 0000000 000000000000000 000 No 97 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.48 E-value=1.4e-12 Score=85.58 Aligned_cols=451 Identities=11% Similarity=0.077 Sum_probs=198.1 Q ss_pred CCc---ccCCCCCCCccccccccCCCchHHHHHHH-------HHHHHHHHHhhHHHHHHHHHHHHhccccCCC------- Q lcl|NC_019423. 1 MEH---QDTFKPLPDPAQSEKLTDWKKEPSIQLLK-------GDLESAKPAHDAIMSQIREWNDLMEVKGKAK------- 63 (756) Q Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-------~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------- 63 (756) |=| ++..+|. +++++.+|. ..+....+.|...+.+.++..+||.|.-.-. T Consensus 1 ~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~ 67 (474) T protein:vir:97 1 MFNIIRMPWDKPY-------------GEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVD 67 (474) T ss_pred CcccccccCCCch-------------hhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhc Confidence 332 2222221 112222222 2355555667777777889999999752110 Q ss_pred --CCCCCCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHH Q lcl|NC_019423. 64 --PPKIKGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVH 139 (756) Q Consensus 64 --~~~~~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~ 139 (756) -+...+++ +++.+.....|+.....| ||.+.- |.. +|.+.. +.++..+ .++-......+++ T Consensus 68 ~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l----~g~p~~--~~~---~d~~~~----~~l~~~~--~n~~~~~~~e~~~ 132 (474) T protein:vir:97 68 VHGNIDYDKPDWRITTNFHQNLVDQKVSYV----ASKPVT--YSC---EDENVL----KVIHDVL--DTRWDNKLIDILT 132 (474) T ss_pred cccccccccCcceeecchHHHHHHHHHhhh----hcCCce--ecc---CcHHHH----HHHHHHH--hccHHHHHHHHHH Confidence 01123333 467777777777666555 665533 322 343332 3444433 3454555667889 Q ss_pred HHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCccee Q lcl|NC_019423. 140 SIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYA 219 (756) Q Consensus 140 ~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~ 219 (756) ++++.|.+.+.+|++. T Consensus 133 ~~~~~G~~~~~~~~d~---------------------------------------------------------------- 148 (474) T protein:vir:97 133 ATSNKGIDWLQVYINE---------------------------------------------------------------- 148 (474) T ss_pred HHhhcCceEEEEEecC---------------------------------------------------------------- Confidence 9999998877765421 Q ss_pred ccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhc Q lcl|NC_019423. 220 IQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITD 297 (756) Q Consensus 220 ~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~ 297 (756) .|.+++..++|.++++ |+... .+..+++ |.|.... . T Consensus 149 --------------~~~~~i~~~~p~~~~~v~d~~~~---~~~~~~i-r~~~~~~-------~----------------- 186 (474) T protein:vir:97 149 --------------NGEMKLFRVPAEQAIPIWVDKER---EELKSFI-RYYKFNN-------E----------------- 186 (474) T ss_pred --------------CCeeEEEEEcccceEEEEcCCCC---CceEEEE-EEEEecC-------e----------------- Confidence 1346677788888764 33322 2222332 2221100 0 Q ss_pred hhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCC Q lcl|NC_019423. 298 PDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGE 377 (756) Q Consensus 298 ~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~ 377 (756) . ... .|.+ .+|..|.+ ++.+.. .. ...+...+.....|...+.+|++.+. ++.+|. T Consensus 187 -~--~~~----~yt~---~~~~~y~~------~~~~~~-~~--~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~ 242 (474) T protein:vir:97 187 -E--KVE----FWTD---TTVTYYVL------ENGGLI-PD--YYYGANHVQSHFSNGNWGRVPFIAFK-----NNPEEV 242 (474) T ss_pred -E--EEE----EEeC---CeEEEEEE------cCCccc-cc--cccCcCcccccccccCCCccceEEec-----CCcCCC Confidence 0 000 0000 01111111 111110 00 00000011111222334667777654 345799 Q ss_pred chHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHH Q lcl|NC_019423. 378 ADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIV 457 (756) Q Consensus 378 g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 457 (756) |.+..++++++.+|...+.+.+.+...+.|.+++.-...+......... .....+....++.+.++..+.-...+.. T Consensus 243 sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~l~~~~~~~~~~~ 319 (474) T protein:vir:97 243 SDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGL---KYYKAINVDGDGGVETIQVEVPVSSTKE 319 (474) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh---hccceeeccCCCceeEEeecCCHHHHHH Confidence 9999999999999999999999998888887766533222222211111 1122233444556666665544566667 Q ss_pred HHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEe Q lcl|NC_019423. 458 MTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRIT 537 (756) Q Consensus 458 ~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~ 537 (756) .++.+...+-..|++++.+.+.-++. .++.++..+............+.|..++++++++++.+. ... T Consensus 320 ~~~~l~~~I~~~s~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----~~~------ 387 (474) T protein:vir:97 320 YIDLMRVYIMEFGQGVDFQTDKFGSA--PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFN----NLK------ 387 (474) T ss_pred HHHHHHHHHHHHhCccccCccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC------ Confidence 78888899999999987664432222 233334444444444455555566666666665555443 210 Q ss_pred cCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCCh Q lcl|NC_019423. 538 NEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDP 617 (756) Q Consensus 538 g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p 617 (756) . ++. +|.|.-........ .+.+..+... ..++.+.... .+++..+...-+++ T Consensus 388 ~-d~~-----------~i~v~f~~~~p~~~--~e~a~~~~~~-g~iS~et~l~------~l~~v~D~~~E~er------- 439 (474) T protein:vir:97 388 T-DVK-----------DIEISFNFNRMMND--AEQSQIIAQS-QYLSRETLVK------SSPLVDDYKAELER------- 439 (474) T ss_pred c-ccc-----------eeeEEeccCcccCH--HHHHHHHHHc-CCCCHHHHHH------hCCCCCCHHHHHHH------- Confidence 0 111 12222222211111 1112222222 2233221111 11122221111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 618 MEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDME 676 (756) Q Consensus 618 ~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~ 676 (756) +...+.. .++ ....... .......+..+.... +-+ T Consensus 440 -------i~~E~~~--------~~~-------~~~~~~~-~~~~~~~~~~~~~~~-~~e 474 (474) T protein:vir:97 440 -------IEQEQME--------YNK-------QLPNLDD-GGADGAQQQEGSNNK-ESE 474 (474) T ss_pred -------HHHHHHH--------HHh-------hccccCC-CCCCCcccCCCCccc-ccC Confidence 1000000 000 0000000 000000000000000 000 No 98 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.48 E-value=7.6e-12 Score=81.53 Aligned_cols=455 Identities=9% Similarity=0.050 Sum_probs=202.2 Q ss_pred CCcccCCCCCCCccccccccC---CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC--CCCCC-------C Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTD---WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA--KPPKI-------K 68 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~-------~ 68 (756) |. +-.-| =+++.-+++-+ -..+.+.. .+....+.|...+.+.++..+||.|.-.- .+.+. + T Consensus 1 ~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~----~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~ 73 (474) T protein:vir:96 1 MI--VIFWP-NEKPYHERVVEQIKPKYETQEE----MIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDP 73 (474) T ss_pred Ce--eeccC-CCchhhhhHHHHhhhccCChHH----HHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccc Confidence 21 11122 12233333321 11111222 23444445666677778889999986311 01111 1 Q ss_pred CC--CcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCc Q lcl|NC_019423. 69 GR--SQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGT 146 (756) Q Consensus 69 gr--S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~ 146 (756) .| .+++.+..+..|+....-| ||.+.-+ .+ +|.+..+. ++.++. ++-........++++..|. T Consensus 74 ~~~~~ki~~n~~~~Ivd~~~~~l----~g~p~~~--~~---~d~~~~~~----l~~~~~--n~~~~~~~~~~~~~~~~G~ 138 (474) T protein:vir:96 74 LKPDWRMFTNYHQNLVDQKVAYA----VANPVTF--SS---DDDKSLKT----IQEVLN--HKWDDKLVDILTAASNKGI 138 (474) T ss_pred cccchhcccchHHHHHHhhhhhh----cccCcee--ec---CchHHHHH----HHHHHh--cCHHHHHHHHHHHHHhcCe Confidence 12 2466666666666665544 7765443 32 34444443 333332 3445556667888999988 Q ss_pred eEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeE Q lcl|NC_019423. 147 GIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTE 226 (756) Q Consensus 147 gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~ 226 (756) +.+.+||+. T Consensus 139 ~~~~~y~d~----------------------------------------------------------------------- 147 (474) T protein:vir:96 139 EWLQPYIDE----------------------------------------------------------------------- 147 (474) T ss_pred eEEEEEecC----------------------------------------------------------------------- Confidence 877765531 Q ss_pred EEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccc Q lcl|NC_019423. 227 VEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKT 304 (756) Q Consensus 227 ~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 304 (756) .|++++..++|.++++ |++.. .+..+. .+.|.... ... .. T Consensus 148 -------~~~~~i~~~~p~~~~~v~d~~~~---~~~~~~-vr~~~~~~-----------~~~----------------~~ 189 (474) T protein:vir:96 148 -------NGEFKTFRVPAEQAIPIWTNKER---DTLKAF-IRYYRLDG-----------AER----------------VE 189 (474) T ss_pred -------CCceEEEEEcccceEEEEcCCCC---CceEEE-EEEEeecC-----------ceE----------------EE Confidence 1346778888888774 44332 223233 23331100 000 00 Q ss_pred cccccccccccceEEEEEEEEEeeccCCceeEEEE-EEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHh Q lcl|NC_019423. 305 PSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIV-ATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELL 383 (756) Q Consensus 305 ~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~-~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~ 383 (756) . +.+ .+|..|. . .+.+...... ..............|.+.|++|++.+.. +.+|.|.+..+ T Consensus 190 ~----yt~---~~v~~~~---~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v 251 (474) T protein:vir:96 190 Y----WTD---SDVTYYE---Y---QDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN-----NPQEMSDLFMY 251 (474) T ss_pred E----EeC---CeEEEEE---e---cCCceeeccccccccccccccccccccCCCceeEEEecc-----CCCCCCcHHHH Confidence 0 000 0111111 0 0111000000 0000000111123344567788887754 45689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccc-cccccccccCCCcchHHHHHHHHH Q lcl|NC_019423. 384 GDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGN-PSQSIMEHKFPELPQSAIVMTQMQ 462 (756) Q Consensus 384 ~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~l~~~ 462 (756) +++++.+|..++.+.+.+...++|.+++.-...+........... ...+... .++.++++..+.-.......++.+ T Consensus 252 ~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~---~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l 328 (474) T protein:vir:96 252 KTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKY---YKAINVDGDGSGVDTIQIEVPVQSSKEYLDML 328 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhc---CceEEecCCCCceeEEeecCChHHHHHHHHHH Confidence 999999999999999999999988766542222211111111111 1122222 233466666554456667788899 Q ss_pred HHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 463 NQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 463 ~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) .+.+-..|++++.+.+..++.. ++.++...............+.|..+++++++.++.+.-..++. . T Consensus 329 ~~~i~~~s~~p~~~~~~~~~n~--Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~-----------~ 395 (474) T protein:vir:96 329 RDYVIEFGQGVDFQQDKFGNSP--SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIKV-----------Q 395 (474) T ss_pred HHHHHHHhCCcccccccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccc-----------c Confidence 9999999999887755433222 33335444455555556666677777777777666654211111 0 Q ss_pred ecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhH Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQL 622 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~ 622 (756) +.+|+.+.+.+.-.... ++++... ..++.+... .+ +....+...-++++. T Consensus 396 ---------~i~i~f~~~~p~~~~e~----~~~~~~a-g~iS~et~~----~~--~~~v~d~~~E~~ri~---------- 445 (474) T protein:vir:96 396 ---------DVEITFNFNVMVNELEQ----SQIGVQS-QYLSKETVV----TN--HPWVDDPVAELERIE---------- 445 (474) T ss_pred ---------eeeEEeccCCCcCHHHH----HHHHHhc-CCCchHHHH----Hh--CCCCCCHHHHHHHHH---------- Confidence 11222222222111111 1222221 222221111 11 122222222222211 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 623 KQLAIQKAQLENEELQ-SKIALNNAKAKEAASSGD 656 (756) Q Consensus 623 ~q~~~~~aq~e~~~~q-a~a~~~~a~a~~~~aq~~ 656 (756) .++.+.. .... ...+....... -..+.+ T Consensus 446 ----~E~~e~~-~~~~~~~~~~~~~~~d-~~~e~~ 474 (474) T protein:vir:96 446 ----QDNIDFN-KQLPPLEGDANGRAQD-NESETN 474 (474) T ss_pred ----HHHHHHH-hcccccccccccccCC-CcccCC Confidence 0000000 0000 00000000000 000001 No 99 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.46 E-value=9.6e-12 Score=80.99 Aligned_cols=457 Identities=10% Similarity=0.054 Sum_probs=199.6 Q ss_pred CC---cccCCCCCCCccc-cccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC---------CCCCC Q lcl|NC_019423. 1 ME---HQDTFKPLPDPAQ-SEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA---------KPPKI 67 (756) Q Consensus 1 ~~---~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---------~~~~~ 67 (756) |- +..+.+| -.++ =+.+.+.. ......+......|...+.+.++..+||.|.-.- ..... T Consensus 1 ~~~~~~~~~~~~--~~~~~~~~~~~~~-----~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~ 73 (474) T protein:vir:95 1 MFNIIRMPWDKP--YGEEVVEQLKPQF-----ETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNID 73 (474) T ss_pred CcceeecCCCCc--hhhHHHHhhhhcc-----CChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccc Confidence 22 2222222 1110 00111110 1111234555556667777788899999875210 00112 Q ss_pred CCCC--cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcC Q lcl|NC_019423. 68 KGRS--QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDG 145 (756) Q Consensus 68 ~grS--~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g 145 (756) .+++ +++.+..+..|+....-| ||.+. .|. .+|.+.- +.+...+. ++-...+..++++++.+| T Consensus 74 ~~~~~~ki~~n~~~~Ivd~~~~~l----~g~p~--~~~---~~d~~~~----~~l~~~~~--n~~~~~~~e~~~~~~~~G 138 (474) T protein:vir:95 74 YDKPDWRITTNFHQNLVDQKVSYV----ASKPV--TYS---CEDESVL----KIIHDVLD--TRWDNKLIDILTATSNKG 138 (474) T ss_pred cccccceeccchHHHHHHHHHhhh----ccCCc--eec---cCchHHH----HHHHHHHh--ccHHHHHHHHHHHHhhcC Confidence 2333 567777777777665544 66543 333 2343332 34444432 444445667889999999 Q ss_pred ceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCcee Q lcl|NC_019423. 146 TGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVT 225 (756) Q Consensus 146 ~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~ 225 (756) .+.+.+||+. T Consensus 139 ~~~~~v~~d~---------------------------------------------------------------------- 148 (474) T protein:vir:95 139 IDWLQVYINE---------------------------------------------------------------------- 148 (474) T ss_pred cEEEEEEecC---------------------------------------------------------------------- Confidence 8877765531 Q ss_pred EEEeeeeecCceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcc Q lcl|NC_019423. 226 EVEVEKALVNRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESK 303 (756) Q Consensus 226 ~~~~~~~~~g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 303 (756) .|++++..++|.+++ ||+.... +..+++ +.+..... . .. T Consensus 149 --------~~~~~i~~~~p~~~~~v~d~~~~~---~~~~~i-~~~~~~~~-------------------------~--~~ 189 (474) T protein:vir:95 149 --------NGEMKLFRVPAEQAIPIWVDKERE---ELKSFI-RYYKFNNE-------------------------E--KV 189 (474) T ss_pred --------CCceEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEEcCe-------------------------e--EE Confidence 134667778888876 3443322 222222 22211000 0 00 Q ss_pred ccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHh Q lcl|NC_019423. 304 TPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELL 383 (756) Q Consensus 304 ~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~ 383 (756) .. |.+ .+|..|.+ ++.+.. . ....+.........+...+.+|++.++. +.+|.|.+..+ T Consensus 190 ~~----y~~---~~~~~~~~------~~~~~~-~--~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v 248 (474) T protein:vir:95 190 EF----WTD---TTVTYYVL------ENGGLI-P--DYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMY 248 (474) T ss_pred EE----EeC---CeEEEEEE------cCCccc-c--ccccCcccccccccccCCCccceEeecC-----CCCCCCcHHHH Confidence 00 000 01111111 111100 0 0000111111122233456778877643 45689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHH Q lcl|NC_019423. 384 GDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQN 463 (756) Q Consensus 384 ~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~ 463 (756) +++++.+|..++.+.+.+...+.|.+++.-...+......... .....+....++.++++..+.-.......+..+. T Consensus 249 ~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:95 249 KSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGL---KYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMR 325 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh---hccceeeccCCCceeEEeecCCHHHHHHHHHHHH Confidence 9999999999999999998888887765532222212211111 1122233344555666665544456667788888 Q ss_pred HHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceee Q lcl|NC_019423. 464 QEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVE 543 (756) Q Consensus 464 ~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~ 543 (756) ..+-..+++++.+.+.-++.. ++.++...............+.|..+++++++++++++-.-+ ++. T Consensus 326 ~~i~~~s~~p~~~~~~~~~n~--Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~-----------d~~- 391 (474) T protein:vir:95 326 AYIMEFGQGVDFQTDKFGSAP--SGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKM-----------DVK- 391 (474) T ss_pred HHHHHHhCCcccccccccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-----------ccc- Confidence 999999999887655332222 333355445555555566666677777777766665531110 111 Q ss_pred cCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHH Q lcl|NC_019423. 544 IKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLK 623 (756) Q Consensus 544 i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~ 623 (756) +.+|+.+.+.+.-....++ .+... ..++.+... . .+.+..+...-++++...... T Consensus 392 --------~i~v~f~~~~p~d~~e~a~----~~~~~-g~iS~et~i----~--~l~~v~d~~~E~~ri~~E~~~------ 446 (474) T protein:vir:95 392 --------DIEISFNFNRMMNDAEQSQ----IIAQS-QYLSRETLV----K--SSPLVDDYKAELERIEQEQME------ 446 (474) T ss_pred --------eeeEEeccCCCcCHHHHHH----HHHhc-CCCchHHHH----H--hCCCCCCHHHHHHHHHHHHHH------ Confidence 1112222222211111111 12221 122221111 0 111222222112111100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 624 QLAIQKAQLENEELQSKIALNNAKAKEAASS 654 (756) Q Consensus 624 q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq 654 (756) ..+.........+......++..-.... T Consensus 447 ---~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 447 ---YNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred ---HHhcccccccccCCCCcCCCCCccCCCC Confidence 0000000000000000000000000000 No 100 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.45 E-value=7.3e-12 Score=81.64 Aligned_cols=406 Identities=13% Similarity=0.082 Sum_probs=174.9 Q ss_pred CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHH----HHHHHHHHHHHhhcCCCC Q lcl|NC_019423. 23 KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRR----QAEWRYAPLSEPFLSSSK 98 (756) Q Consensus 23 ~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~----~~e~~~~~L~~~f~~~~~ 98 (756) -+...+..|...+.. ...+.++-.+||.|....+ .-+.-+++..+. .++|. ...++.|...- T Consensus 1 m~~~~i~~L~~~~~~-------~~~r~~~~~~yy~g~~~~~-----~~~~~~p~~~~~~~~~v~nw~-~~~Vd~~a~rl- 66 (422) T protein:vir:97 1 MNYMGMGYLRRKLAL-------FKTGVDKRYRYYAMDDRDD-----TRSIVMPNNVREMYRSVLEWT-AKGVDSLADRI- 66 (422) T ss_pred CChHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCChh-----hcCccccHHHHHHHHhhcchh-HHHHHHHHhcc- Confidence 233444444444433 3345566789999764321 011112222211 11221 22222221111 Q ss_pred EEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcch-HHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHH Q lcl|NC_019423. 99 LFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKL-VDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQ 177 (756) Q Consensus 99 ~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~-~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~ 177 (756) .|...+-+|.+ +.-++ ..|+ +.. ...++++||+.|.+++.|+.+. T Consensus 67 --~~~Gf~~~d~~--------l~~~w-~~N~-ld~~~~~~~~~al~~G~sf~~v~~~~---------------------- 112 (422) T protein:vir:97 67 --IFREFTNDDFN--------AWEIF-KANN-PDIFFDTAIQSALIASCCFVYIMPGA---------------------- 112 (422) T ss_pred --ccceeeCCchh--------HHHHH-HhcC-hHHHHHHHHHHHHHhcceeEEEeeCC---------------------- Confidence 11222233332 11122 2233 332 3345666666666666653210 Q ss_pred HHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhhe--EeCCCCcC Q lcl|NC_019423. 178 ADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNV--VIDPSCNG 255 (756) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~--~~Dp~a~~ 255 (756) ..|.|.|..++|.++ +|||..+. T Consensus 113 -------------------------------------------------------~~~~p~i~~~sp~~~~~i~D~~~~~ 137 (422) T protein:vir:97 113 -------------------------------------------------------EDGLPKMQVIEASKATGILDPTTFL 137 (422) T ss_pred -------------------------------------------------------CCCeeEEEEechhhEEEEEeCCCCc Confidence 013456677777764 44664321 Q ss_pred ccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCcee Q lcl|NC_019423. 256 DLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSL 335 (756) Q Consensus 256 d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~ 335 (756) +. + ..+++ ..+ . .. ..+..-+|. ++. T Consensus 138 -~~-~---a~~~~-~~~------------------~----------------------~~-~~~~~~~~~------~~~- 163 (422) T protein:vir:97 138 -LT-E---GYAIL-ESD------------------S----------------------NG-NPTLEAYFT------DKD- 163 (422) T ss_pred -ce-e---eEEEE-Eec------------------C----------------------CC-cEEEEEEEc------Cce- Confidence 11 1 11111 000 0 00 000000110 000 Q ss_pred EEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchH-HHhHHHHHHHHHHHHHHHHHHHhhcCCceEeecc Q lcl|NC_019423. 336 EPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA-ELLGDNQAILGATMRGMIDLLGRSANGQRGYPKG 414 (756) Q Consensus 336 ~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v-~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~g 414 (756) +..+.++......++|+ |..|+|+++..+..++.||.|-+ +.++++|+.+|+.+..+.......+.|+..+ .| T Consensus 164 ---~~~~~~~~~~~~~~~~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~G 237 (422) T protein:vir:97 164 ---IWYYPKKGKPYNIKNPT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYV-LG 237 (422) T ss_pred ---EEEEcCCCccccccCCC--CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-cc Confidence 00111111111235665 67899999999999999999977 8899999999999999999999998887544 22 Q ss_pred ccCccc-hhhhhccccccccccccc-cccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHH Q lcl|NC_019423. 415 MLDTLN-RRRYDDGQDYEYNPMQGN-PSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGI 491 (756) Q Consensus 415 av~~~~-~~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i 491 (756) +-.+.+ ...++..... ...+..+ .+..++..+++.-. +.+...+..+...+-.+||++....|...+. ..+|.++ T Consensus 238 ~d~d~~~~~~~~~~~~~-i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~N-psSa~Ai 315 (422) T protein:vir:97 238 MDPDAKPMEKWRATVST-LLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDN-PSSVESI 315 (422) T ss_pred cCcccccCchhhhhhhh-hhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCc-hhHHHHH Confidence 211000 0011111111 1111111 12234444443322 2344555566666666689999999865431 1344445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEeccc---ccH-H-H Q lcl|NC_019423. 492 RGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINT---AEI-D-N 566 (756) Q Consensus 492 ~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~---a~~-~-~ 566 (756) ......-........+.|..+++.++++++.+.-..-+ .++.+. ++.+.=.+ ... + . T Consensus 316 ~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~---------------~~~~~~---~~~~~w~p~~~~~~~s~a 377 (422) T protein:vir:97 316 KAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPY---------------LRNQFM---DTVIKWEPLFEADANMLT 377 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc---------------cchhhc---cceEEEccCCCCChHHHH Confidence 54444444445555677777888888777766532211 012221 22222221 111 1 1 Q ss_pred HHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHH Q lcl|NC_019423. 567 QKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQL 632 (756) Q Consensus 567 ~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~ 632 (756) +.+-.+.-+.+......+. ..+++..|+.+....+..+ ++.+++. T Consensus 378 ~~aDa~~Kl~~a~~~~~~~-------~~~~~~lg~~~~~~~~~~~--------------~~~~~d~ 422 (422) T protein:vir:97 378 LVGDGAIKLNQAIPGFMDA-------DVIRDLTGVKGADKPIPAI--------------TEVTTDG 422 (422) T ss_pred HHHHHHHHHHhhccccccH-------HHHHHHcCCCchhHHHHHH--------------HhhhccC Confidence 1111122222221111111 1233444554332222111 1111111 No 101 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.44 E-value=5.3e-12 Score=82.40 Aligned_cols=468 Identities=11% Similarity=0.029 Sum_probs=193.4 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCC-C-C-CCCcccC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPK-I-K-GRSQVQP 75 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~-~-~-grS~~v~ 75 (756) |...+.+ .++.++..| ...+.....+.++-.+||.|....+ +.. . + ..-++|. T Consensus 1 ~~~~~~~---------------d~~~~i~~L-------~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 58 (488) T protein:vir:23 1 MAETESI---------------DPEKLRDQL-------LDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHV 58 (488) T ss_pred CCcccCC---------------CHHHHHHHH-------HHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhc Confidence 3333333 122233333 3444555566677789999764221 001 1 1 1112455 Q ss_pred HHHHHHHHHHHHHHH-HhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 76 RLVRRQAEWRYAPLS-EPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 76 ~~v~~~~e~~~~~L~-~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) +-.+..|+.....|. .-|+.+.. +.+....-+|.+..+ .++.++ ..|+--.....+.+++++.|.+++.+++. T Consensus 59 n~~~~ivd~~a~~l~~~Gf~~~~~-~~~~~~~~~d~~~~~----~l~~i~-~~N~~~~~~~~~~~~a~i~G~a~~~v~~~ 132 (488) T protein:vir:23 59 GYPRTYVDAIAERQELEGFRIPSA-NGEEPESGGENDPAS----ELWDWW-QANNLDIEATLGHTDALIYGTAYITISMP 132 (488) T ss_pred chHHHHHHHHHHhhhccceeccCC-cccccccccchhHHH----HHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecC Confidence 555555555544441 11211111 111112233444444 344333 35554445667899999999998888553 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) ..... ..... T Consensus 133 ~~~~~----------------------------------------------------------------------~~~~~ 142 (488) T protein:vir:23 133 DPEVD----------------------------------------------------------------------FDVDP 142 (488) T ss_pred Ccccc----------------------------------------------------------------------cCCCC Confidence 11000 00112 Q ss_pred CceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKD 312 (756) Q Consensus 235 g~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d 312 (756) +.++|..++|.+++ |||... ....+.+++-+. + T Consensus 143 ~~~~i~~~~p~~~~~~~d~~~~-----~~~~~~~~~~~~-------------~--------------------------- 177 (488) T protein:vir:23 143 EVPLIRVEPPTALYAEVDPRTR-----KVLYAIRAIYGA-------------D--------------------------- 177 (488) T ss_pred CcceEEEeccceeEEEEecCCC-----ceEEEEEEEEec-------------C--------------------------- Confidence 34667788898865 454322 112222222100 0 Q ss_pred cccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHH-HhHHHHHHHH Q lcl|NC_019423. 313 ALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE-LLGDNQAILG 391 (756) Q Consensus 313 ~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~-~~~d~Q~~iN 391 (756) ...+..+++|.. +.+ +.++-.++........|.+.|.+|++++...+..+..+|.|-+. .++++++.+| T Consensus 178 --~~~~~~~~~y~~-----~~~---~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~ 247 (488) T protein:vir:23 178 --GNEIVSATLYLP-----DTT---MTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAA 247 (488) T ss_pred --CCcEEEEEEEec-----CcE---EEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHH Confidence 001222333321 111 11111222222223445566889999999888888999999885 6899999999 Q ss_pred HHHHHHHHHHHhhcCCceEeecccc-C---ccchhhhhccccccccccc-cccccccccccCCCcchHHHHHHHHHHHHH Q lcl|NC_019423. 392 ATMRGMIDLLGRSANGQRGYPKGML-D---TLNRRRYDDGQDYEYNPMQ-GNPSQSIMEHKFPELPQSAIVMTQMQNQEA 466 (756) Q Consensus 392 ~~~~~~~d~l~~~~~~~~~~~~gav-~---~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~ 466 (756) +.++.+.+.+...+.++..+- |.- + ..+......... ..+.+. ...+....+.+.+..+ ....+..+...+ T Consensus 248 ~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~g~~~~~~q~~~~~--~~~~~~~l~~~i 323 (488) T protein:vir:23 248 QILMNMQGTANLMAIPQRLIF-GAKPEELGINAETGQRMFDA-YMARILAFEGGEGAHAEQFSAAE--LRNFVDALDALD 323 (488) T ss_pred HHHHHHHHHHHHhhhHHHHHh-CCCcccccccccccchhhhh-hhhhhccCCCCCCceeEecCCCC--hHHHHHHHHHHH Confidence 999999999888887765432 211 1 000100000000 001111 1112234444444332 233445555554 Q ss_pred H---HHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceee Q lcl|NC_019423. 467 E---SLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVE 543 (756) Q Consensus 467 e---~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~ 543 (756) + ..|++++...|..... +.+|.++...............+.|..++++++++++.+. .... T Consensus 324 ~~~~~~~~~p~~~~g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~----~~~~----------- 387 (488) T protein:vir:23 324 RKAASYSGLPPQYLSSSSDN-PASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMV----KGGD----------- 387 (488) T ss_pred HHHhcccCCCHHHhccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCC----------- Confidence 4 4567777777754321 1234345444444555555555666667777776665543 2110 Q ss_pred cCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCC-hhHHHHhhhccCCCChhhh Q lcl|NC_019423. 544 IKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRM-PDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 544 i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~-~~~~~~l~~~~~q~~p~~~ 620 (756) . +.++ .++.|.-..... ..+.+..+..+.+.....++... +++..++ ++..+.++.. T Consensus 388 ~-~~~~---~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et-------~~~~l~~~~d~~~~~~~~--------- 447 (488) T protein:vir:23 388 I-PTEY---YRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRER-------GWVDMGYTIVEREQMRQW--------- 447 (488) T ss_pred c-chhh---ccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHH-------HHHhCCCCchHHHHHHHH--------- Confidence 0 0011 122222222211 11222222333222222233322 1222222 1111111110 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGT 669 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~ 669 (756) .++++.+... .+.+.............+... ... ......+ T Consensus 448 ----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~-~~e~~~a 488 (488) T protein:vir:23 448 ----LEQDQKQGLG-LIGSLYGASTPEGKPGEAPVG--EPP-APEPDAA 488 (488) T ss_pred ----HHHHHHHHHH-HHHHHhccCCCcccCCCCCCC--CCC-CCCCCCC Confidence 0000000000 000000000000000000000 000 0000000 No 102 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.44 E-value=8.3e-13 Score=86.83 Aligned_cols=458 Identities=12% Similarity=0.049 Sum_probs=189.4 Q ss_pred C-CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCC--------CCcccCHHHHHHHHHHHHHHHHh Q lcl|NC_019423. 22 W-KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKG--------RSQVQPRLVRRQAEWRYAPLSEP 92 (756) Q Consensus 22 ~-~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~g--------rS~~v~~~v~~~~e~~~~~L~~~ 92 (756) | +.+++|..| ...|.....+..+..+||.|.-.. +..| .-++|.+..+..|+.....| . T Consensus 1 ~~t~~~~i~~L-------~~~~~~~~~r~~~l~~Yy~G~~~i---~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l--~ 68 (480) T protein:vir:78 1 MTTYHEHVERL-------QGLLARDLPNLLEAEAYRNGTRRL---KTIGIGAPPELAYLDVQPGWVATYLRTLSDRL--D 68 (480) T ss_pred CCCHHHHHHHH-------HHHHHHHHHHHHHHHHHHhccccc---cccccccchhHhhhhhhcchHHHHHHHHHhhh--c Confidence 3 333444434 334445555667888999986421 1111 11355555555555444433 0 Q ss_pred hcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCC Q lcl|NC_019423. 93 FLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPI 172 (756) Q Consensus 93 f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~ 172 (756) | +. | ..++|.+..+. +..++ ..|+--.....+++++++.|.+++.+| .-+. T Consensus 69 ~---~g---~--~~~~d~~~~~~----l~~i~-~~N~~d~~~~~~~~~a~~~G~ay~~v~-~~~~--------------- 119 (480) T protein:vir:78 69 I---EG---F--RISEDSEGLEE----LWNWW-QANDLDEESVLGHDDSLTFGRSYITVS-HPDV--------------- 119 (480) T ss_pred c---Cc---e--ecCCCchhHHH----HHHHH-HhcCHHHHHHHHHHHHhhcCceEEEEe-cCcc--------------- Confidence 1 11 2 13345444443 33333 344444445678899999999877762 1000 Q ss_pred CCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheE--eC Q lcl|NC_019423. 173 ENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV--ID 250 (756) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~D 250 (756) ......|.++|..++|.+++ || T Consensus 120 --------------------------------------------------------~~~d~~g~~~i~~~~p~~~~~~~D 143 (480) T protein:vir:78 120 --------------------------------------------------------ESGDPAGIPLIRVESPLYMYAELD 143 (480) T ss_pred --------------------------------------------------------ccCCCCCeeEEEEEcccceEEEEc Confidence 00011256788899999865 45 Q ss_pred CCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeecc Q lcl|NC_019423. 251 PSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDIN 330 (756) Q Consensus 251 p~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~ 330 (756) |.....+ .+.+ +++.+.++ . ..+..+++|.. T Consensus 144 ~~~~~~~---~~~i-~~~~~~~~----------------------------------------~-~~~~~~~~y~~---- 174 (480) T protein:vir:78 144 PRNTRRV---TRAV-RLYTTRDD----------------------------------------V-AVPDRATLYLP---- 174 (480) T ss_pred CCCccce---EEEE-EEEEeecC----------------------------------------C-CceEEEEEEeC---- Confidence 5443222 2222 22211000 0 01223333431 Q ss_pred CCceeEEEEEEEECC----EEEEecccccCCCccceEEeeeeeecCcccCCchHHH-hHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019423. 331 DDGSLEPIVATWIGS----TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAEL-LGDNQAILGATMRGMIDLLGRSA 405 (756) Q Consensus 331 ~~g~~~~~~~~~~g~----~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~-~~d~Q~~iN~~~~~~~d~l~~~~ 405 (756) +.. +++.+.++ .+...+..|...|.+|++++...++.+.+||.|-+.. ++++++.+|+.++.+.+.+...+ T Consensus 175 --~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a 250 (480) T protein:vir:78 175 --DET--VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG 250 (480) T ss_pred --CeE--EEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 111 11111111 1111223344457899999998888899999998875 89999999999999999998888 Q ss_pred CCceEeeccccCccchhhhhccc--cccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCcc Q lcl|NC_019423. 406 NGQRGYPKGMLDTLNRRRYDDGQ--DYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGS 482 (756) Q Consensus 406 ~~~~~~~~gav~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~ 482 (756) .|+..+- |.-... ........ ....+.+....+...++.+.+... ..+...+......+-..++++....|..+. T Consensus 251 ~p~~~i~-G~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~ 328 (480) T protein:vir:78 251 TPLRVIS-GVTTDE-LTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE 328 (480) T ss_pred chhhhhh-cCCccc-cccccccchhhhhhhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccC Confidence 8765442 321110 00000000 001111112223334445544432 223333444444444557788777775432 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccc Q lcl|NC_019423. 483 AYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTA 562 (756) Q Consensus 483 a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a 562 (756) . +.+|.++......-........+.|..+++.++++++.+. ..... .+ ..++.|.=... T Consensus 329 n-~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~----g~~~~-----~~-----------~~~i~v~f~~~ 387 (480) T protein:vir:78 329 N-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIM----GREVT-----EE-----------YTRLETVWRDP 387 (480) T ss_pred c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc----CCCcc-----cc-----------ceeeeEEecCC Confidence 1 1233334443333344444555566666666666555433 21100 01 11222322211 Q ss_pred cH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh-hHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 563 EI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP-DLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQS 639 (756) Q Consensus 563 ~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~-~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa 639 (756) .. ....+..+..+.+.....++... +++..|+. +..+.++ +...++.+.-...+.+ T Consensus 388 ~~~s~~~~ad~~~kl~~~g~~~~s~et-------~~~~lg~~~d~~~~~~--------------~~~~e~~~~~~~~~~~ 446 (480) T protein:vir:78 388 STPTVAAKADAVSKLYANGQGPIPKEQ-------ARIDLGYTATQREQMR--------------DWDKQETEDMIDTLYS 446 (480) T ss_pred CCCCHHHHHHHHHHHHHhccccCCHHH-------HHhcCCCCHhHHHHHH--------------HHHHHHHHHHHHHhhc Confidence 11 11222233333333222233222 22223331 1111111 0000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 640 KIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQK 679 (756) Q Consensus 640 ~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~ 679 (756) .. ...+.........+... +......+..+.. .+ T Consensus 447 ~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~----~~ 480 (480) T protein:vir:78 447 TT-KAQADATPKPTVTETKT-ETQTSPSGFNRTK----TR 480 (480) T ss_pred cc-cccCCCCCCCCCCCCCC-ccccccCCCCccc----CC Confidence 00 00000000000000000 0000000000000 00 No 103 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.40 E-value=1.3e-11 Score=80.29 Aligned_cols=460 Identities=12% Similarity=0.027 Sum_probs=191.6 Q ss_pred CCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC-----CCCCCCCCcccCHHHHH Q lcl|NC_019423. 6 TFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK-----PPKIKGRSQVQPRLVRR 80 (756) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grS~~v~~~v~~ 80 (756) -..|.+. +.+...-..+ ++.....|...+.+.++..+||.|....+ .|....+=++|.+-... T Consensus 1 ~~~~i~~---------~~~~~~~~~~---~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ 68 (485) T protein:vir:24 1 MTAPLPG---------QEEIADPAIA---RDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRL 68 (485) T ss_pred CCCCCCC---------CCcccchHHH---HHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHH Confidence 2222222 2221111111 22223445556666777889999874321 01110112244455555 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCc-chHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTF-EDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVK 159 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~-~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~ 159 (756) .|+.....| + .+ +.+. +|....+ .++-++ ..|+--.....+++++++.|.+.+.||++..... T Consensus 69 ivd~~~~~l---~--~~------g~~~~~~~~~~~----~l~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~ 132 (485) T protein:vir:24 69 YVDSIAERQ---A--VE------GFRLGDADEADE----ELWQWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDPQID 132 (485) T ss_pred HHHHHhhhh---c--cC------ceecCCCchhHH----HHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc Confidence 555444433 1 11 2222 2233222 233333 3443334456789999999999999866422100 Q ss_pred eeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeE Q lcl|NC_019423. 160 IKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTV 239 (756) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~i 239 (756) .....+.++| T Consensus 133 ----------------------------------------------------------------------~~~~~~~~~i 142 (485) T protein:vir:24 133 ----------------------------------------------------------------------LGWDPNVPLI 142 (485) T ss_pred ----------------------------------------------------------------------cccCCCcceE Confidence 0011245778 Q ss_pred EEechhhe--EeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccce Q lcl|NC_019423. 240 EMLNPNNV--VIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKK 317 (756) Q Consensus 240 e~V~p~~~--~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~ 317 (756) ..++|.++ +||+.... + ..+.+++-+. .... T Consensus 143 ~~~~p~~~~~i~D~~~~~-~----~~~~~~~~~~------------------------------------------~~~~ 175 (485) T protein:vir:24 143 RVEPPTRMYAEIDPRIGR-P----AKAIRVAYDA------------------------------------------EGNE 175 (485) T ss_pred EEeccceeEEEeeCCcCc-e----eEEEEEEEee------------------------------------------cCCe Confidence 88999987 44554321 1 1111211100 0001 Q ss_pred EEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHH-HhHHHHHHHHHHHHH Q lcl|NC_019423. 318 VVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE-LLGDNQAILGATMRG 396 (756) Q Consensus 318 V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~-~~~d~Q~~iN~~~~~ 396 (756) +..+++|.. +. .++++..++........|.+.|.+|+|++...+..+..||.|-+. .++++++.+|+.++. T Consensus 176 ~~~~~~y~~-----~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~ 247 (485) T protein:vir:24 176 IQAATLYTP-----NE---TFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILML 247 (485) T ss_pred EEEEEEEcC-----Cc---EEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHH Confidence 223333321 11 111222233332223334445789999999888888899999886 689999999999999 Q ss_pred HHHHHHhhcCCceEeeccc----cCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHH--- Q lcl|NC_019423. 397 MIDLLGRSANGQRGYPKGM----LDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESL--- 469 (756) Q Consensus 397 ~~d~l~~~~~~~~~~~~ga----v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~--- 469 (756) +..++...+.|+..+- |. +...+........ ...+.+-..++...+..+.+.. .....+..+...+..+ T Consensus 248 ~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~q~~~~--~~e~~~~~l~~~i~~~s~~ 323 (485) T protein:vir:24 248 MQATAELMGVPQRLIF-GIKPEEIGVDPETGQTLFD-AYLARILAFEDAEGKIQQFSAA--ELANFTNALDQIAKQVAAY 323 (485) T ss_pred HHHHHHhhcchhhhhc-cCCccccccccccccchhh-hcccceeccCCCCceEEeeccc--chHHHHHHHHHHHHHHhcc Confidence 9999988888776543 21 1000010000000 0011111112223333343322 2334556666666655 Q ss_pred hchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHh Q lcl|NC_019423. 470 TGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDL 549 (756) Q Consensus 470 tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~ 549 (756) +++++...|..+.. +.++.++...............+.|..+++.++++++.+...-.... + T Consensus 324 ~~~p~~~fg~~~~n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~----------------d- 385 (485) T protein:vir:24 324 TGLPPQYLSTAADN-PASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPP----------------D- 385 (485) T ss_pred cCCCHHHhccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcc----------------c- Confidence 56777777754321 12333455455555555666667777788888777766432110000 0 Q ss_pred cCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh-hHHHHhhhccCCCChhhhhHHHHH Q lcl|NC_019423. 550 KGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP-DLAHELRTWQPQPDPMEEQLKQLA 626 (756) Q Consensus 550 ~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~-~~~~~l~~~~~q~~p~~~~~~q~~ 626 (756) ..++.|.=..+.. ....+.....+.+.....++...+ ++..++. +..+.+++...... + T Consensus 386 --~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~-------~~~l~~~~d~~~e~~~~~ee~~---------~ 447 (485) T protein:vir:24 386 --MLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERA-------RKDMGYSIAEREEMRRWDEEEA---------A 447 (485) T ss_pred --cceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHH-------HhhCCCCHhHHHHHHHHHHHHh---------h Confidence 0122222221111 111122222222211112222111 1222221 11111111000000 0 Q ss_pred HHHHHHHHHHH-----HHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_019423. 627 IQKAQLENEEL-----QSKIALNNAKAK-EAASSGDLK 658 (756) Q Consensus 627 ~~~aq~e~~~~-----qa~a~~~~a~a~-~~~aq~~~~ 658 (756) +....++.... +.+.....+... .+..-.+.+ T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 448 MGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred hhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 00000000000 000000000000 000000000 No 104 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.40 E-value=1.8e-11 Score=79.51 Aligned_cols=451 Identities=9% Similarity=0.056 Sum_probs=181.8 Q ss_pred CCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCCC--c----ccCHH Q lcl|NC_019423. 6 TFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGRS--Q----VQPRL 77 (756) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grS--~----~v~~~ 77 (756) -| +-+. ++|+.+++...|. .+....|.....+.++-.+||.|..... +++...+. + .|.+- T Consensus 1 ~~----~~p~----~~l~~~~~~~~~~---~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~ 69 (479) T protein:vir:99 1 MI----DLPD----EDLSSEGLAKYLE---TKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPW 69 (479) T ss_pred Cc----cCCc----ccCChhHHHHHHH---HHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCc Confidence 11 1110 1344444433332 2334455666667778889999864321 01111110 0 12233 Q ss_pred HHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcch-HHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 78 VRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKL-VDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 78 v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~-~~~~v~~al~~g~gi~k~~w~~~ 156 (756) .+..|+.....| .|.+.+..|.+..+.....+ ..| .+.. ...+++++++.|.+++.+|+... T Consensus 70 ~~~iVd~~~~~l-----------~~~gf~~~d~~~~~~~~~i~-----~~N-~~d~~~~~~~~~a~~~G~af~~v~~~~~ 132 (479) T protein:vir:99 70 MGLMVNSFAQQL-----------IVDGYRKTGTNENAKGWDTW-----RLN-QMDKQQFWLNRAVLTFGYAFIKVTSGIS 132 (479) T ss_pred HHHHHHHHHhhc-----------ccccccCCCchhhHHHHHHH-----Hhc-ChhHHHHHHHHHHhhcCceEEEEecCCC Confidence 333333222211 13333444555444433322 233 3444 44578999999988776643111 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) . ..-.|. T Consensus 133 ~-------------------------------------------------------------------------~d~~g~ 139 (479) T protein:vir:99 133 P-------------------------------------------------------------------------LDGTTV 139 (479) T ss_pred C-------------------------------------------------------------------------cCCCCc Confidence 0 001244 Q ss_pred eeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 237 ~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) ++|..++|.+++. |...+. .-+.|.++. + . T Consensus 140 ~~i~~~~p~~~~~iydd~~~~--~~~~~~~~~-~--------------------------------------------~- 171 (479) T protein:vir:99 140 ARIKCIDPRDAFAIWEDPYWD--EWPKYLLER-Q--------------------------------------------P- 171 (479) T ss_pred eEEEEechhheEEEecCCccc--ceeeEEEee-c--------------------------------------------C- Confidence 6778888888753 322111 001111000 0 0 Q ss_pred cceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHH Q lcl|NC_019423. 315 RKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATM 394 (756) Q Consensus 315 ~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~ 394 (756) ...+ .+|...+ .+.+...++........|-..|.+|++++...+..+. +|.|.++.++++++.+|+.+ T Consensus 172 ~~~~---~~~~~~~--------~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~-~g~sd~e~v~~liDa~~~~~ 239 (479) T protein:vir:99 172 NGQY---WWWTEED--------YSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLRG-VCYGDVEPLVTVAKAIDKTG 239 (479) T ss_pred ceeE---EEEecce--------EEEEEecCCceeeccccccCCCCcceEEeecCCCcCc-CCcchhHHHHHHHHHHHHHH Confidence 0001 1111100 0111111121111122233347899999888777644 79999999999999999999 Q ss_pred HHHHHHHHhhcCCceEeeccccC-ccchhhhhccccccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhch Q lcl|NC_019423. 395 RGMIDLLGRSANGQRGYPKGMLD-TLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGV 472 (756) Q Consensus 395 ~~~~d~l~~~~~~~~~~~~gav~-~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv 472 (756) +.+...+...+.++..+. |... ......... .....+.+....+...+..+.+... ..+...++.....+-..|++ T Consensus 240 s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~ 317 (479) T protein:vir:99 240 LDILLVQHHQSFQIRWAT-GLMLPEGANADQEK-MRFAQESMLISQNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQL 317 (479) T ss_pred HHHHHHHHHhhchhhhhc-CCCcccccccchhc-cccccccceeecCCCceEEEecccchHHHHHHHHHHHHHHhccCCC Confidence 999998888888875443 2211 111101000 0111111212223344555554332 22333444444455555678 Q ss_pred hHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCc Q lcl|NC_019423. 473 KAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGN 552 (756) Q Consensus 473 ~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~ 552 (756) +....|..++ .++.++...............+.|..+++.++++++.+. +...- ... T Consensus 318 p~~~~g~~~n---~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~----~~~~~----------------~~~ 374 (479) T protein:vir:99 318 PPHIAGQIVN---VAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIE----GRTEE----------------ATD 374 (479) T ss_pred CHHHcccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc----CCCcc----------------ccc Confidence 8888885544 233334444444444455555666677777766665433 21100 001 Q ss_pred ceEEEecc---cccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhH-HHHhhhccCCCChhhhhHHHHHHH Q lcl|NC_019423. 553 FDIEVDIN---TAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDL-AHELRTWQPQPDPMEEQLKQLAIQ 628 (756) Q Consensus 553 ~Dv~V~~g---~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~-~~~l~~~~~q~~p~~~~~~q~~~~ 628 (756) +++.+.=. +.+.. +.++.+..+.+. ..++.+.+.. .++++.+. .+.++..... +. T Consensus 375 ~~i~~~w~~~~~~s~~-~~ad~~~kl~~a--g~is~et~l~------~l~gv~~~~~e~~~~~~~~------------~~ 433 (479) T protein:vir:99 375 LDFTITWQDVTIQSLA-QFADAWAKMVES--LKIPAEGVWD------MIPNLDQSTVNGWKEIYDR------------EG 433 (479) T ss_pred eeeeEEecCCCCCCHH-HHHHHHHHHHhc--CCCCHHHHHH------hcCCCCHHHHHHHHHHHHH------------HH Confidence 22333211 11111 222222222222 1233322211 11233211 0111100000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHH--------HHHHHH---HHHHHHHHHHHHHHH Q lcl|NC_019423. 629 KAQLENEELQSKIALNNAKA--------KEAASS---GDLKDLDYLEQESGT 669 (756) Q Consensus 629 ~aq~e~~~~qa~a~~~~a~a--------~~~~aq---~~~~~~~~~~q~~~~ 669 (756) .....+..+... ...++. ...++. .+-+++. .. +. T Consensus 434 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--~~ 479 (479) T protein:vir:99 434 DFGKYMRKLQNG--PDPAEQRGGPNGATNMQQANNKTGEPASLN--KS--GA 479 (479) T ss_pred HHHHHHHHHhcc--cCcccccCCCCCCCCCCCCCCCCcchhccC--CC--CC Confidence 000000000000 000000 000000 0000000 00 00 No 105 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.40 E-value=1.7e-12 Score=85.07 Aligned_cols=461 Identities=13% Similarity=0.050 Sum_probs=186.4 Q ss_pred CC-chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC------CCCCCCCCcccCHHHHHHHHHHHHHHHHhhc Q lcl|NC_019423. 22 WK-KEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK------PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFL 94 (756) Q Consensus 22 ~~-~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~ 94 (756) |+ ..++|..|. ..|.....+.++-.+||+|.-..+ ++..+ +-++|.+-....|+.....| + T Consensus 1 ~~t~~d~i~~L~-------~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~~~~l----~ 68 (480) T protein:vir:78 1 MTTYHEHVERLQ-------GLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRL----D 68 (480) T ss_pred CCCHHHHHHHHH-------HHHHHHHHHHHHHHHHHhccccchhcccccchhhh-hhhhhcchHHHHHHHHHhhh----c Confidence 42 233333333 334445555667789999863211 11111 11244555555555444333 1 Q ss_pred CCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCC Q lcl|NC_019423. 95 SSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIEN 174 (756) Q Consensus 95 ~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~ 174 (756) .+. | ..++|.+. .+.++-++ ..|+--.....+++++++.|.+++.+| .-+. T Consensus 69 -~~g---~--~~~~d~~~----~~~l~~i~-~~N~~~~~~~~~~~~a~~~G~ay~~v~-~~~~----------------- 119 (480) T protein:vir:78 69 -IEG---F--RISEDSEG----LEELWNWW-QANDLDEESVLGHDDSLTFGRAYITVS-HPDV----------------- 119 (480) T ss_pred -cCc---e--ecCCCchh----HHHHHHHH-HhcCHHHHHHHHHHHHhhcCceEEEee-cCcc----------------- Confidence 111 1 12344433 23444433 345444556678899999999977762 1000 Q ss_pred HHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheE--eCCC Q lcl|NC_019423. 175 QEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV--IDPS 252 (756) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~Dp~ 252 (756) . .....|.++|..++|.+++ |||. T Consensus 120 ------------------~------------------------------------~~d~~~~~~i~~~~p~~~~~i~D~~ 145 (480) T protein:vir:78 120 ------------------E------------------------------------SGDPAGIPLIRVESPLYMYAELDPR 145 (480) T ss_pred ------------------c------------------------------------cCCCCCeeEEEEEcccceEEEEcCC Confidence 0 0011255788999999865 4555 Q ss_pred CcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCC Q lcl|NC_019423. 253 CNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDD 332 (756) Q Consensus 253 a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~ 332 (756) .... ..+.+ +.+.+.++ ...+..+++|.. + T Consensus 146 ~~~~---~~~~i-~~~~~~d~-----------------------------------------~~~~~~~~~y~~-----~ 175 (480) T protein:vir:78 146 NTRR---VTRAV-RLYTTRDD-----------------------------------------VAVPDRATLYLP-----D 175 (480) T ss_pred Cccc---eEEEE-EEEEeecC-----------------------------------------CcceEEEEEEeC-----C Confidence 4322 22222 22211100 001223333321 1 Q ss_pred ceeEEEEEEEECC----EEEEecccccCCCccceEEeeeeeecCcccCCchHH-HhHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 333 GSLEPIVATWIGS----TLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE-LLGDNQAILGATMRGMIDLLGRSANG 407 (756) Q Consensus 333 g~~~~~~~~~~g~----~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~-~~~d~Q~~iN~~~~~~~d~l~~~~~~ 407 (756) .+ +.+...++ .....+..|...|.+|++++...++.+..||.|-+. .++++++.+|+.++.+...+...+.| T Consensus 176 ~~---~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p 252 (480) T protein:vir:78 176 ET---VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) T ss_pred eE---EEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcch Confidence 11 11111111 112223334445789999999888889999999886 58999999999999999999888888 Q ss_pred ceEeeccccCccchhhhhccccc--cccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCcccc Q lcl|NC_019423. 408 QRGYPKGMLDTLNRRRYDDGQDY--EYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAY 484 (756) Q Consensus 408 ~~~~~~gav~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~ 484 (756) +..+. |.-.. ........... ..+.+-...+...++.+.+... ..+...+......+-.++++++...|..+.. T Consensus 253 ~~~i~-G~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n- 329 (480) T protein:vir:78 253 LRVIS-GVTTD-ELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN- 329 (480) T ss_pred hhhhh-CCCcc-ccccccccchhhhhhhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCc- Confidence 75442 32110 00000000000 0111112223334444444332 2233334444444444567777777743321 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccH Q lcl|NC_019423. 485 GDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEI 564 (756) Q Consensus 485 ~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~ 564 (756) ..++.++......-........+.|..+++.++++++.+. .... . .+ ..++.|.=..... T Consensus 330 ~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~----~~~~--~---~~-----------~~~i~v~w~~~~~ 389 (480) T protein:vir:78 330 PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIM----GREV--T---EE-----------YTRLETVWRDPST 389 (480) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc----CCCc--c---cc-----------ceeeeEEecCCCC Confidence 1233334444444444455555666667766666555432 2110 0 00 1123332221111 Q ss_pred --HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 565 --DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIA 642 (756) Q Consensus 565 --~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~ 642 (756) ..+.+..+..+.+.....+.... +++..|+. +++... ..+...++.+.-...+.+.. T Consensus 390 ~s~~~~ad~~~kl~~~g~~~~s~et-------~~~~lg~~------------~d~~~e-~~~~~~~~~~~~~~~~~~~~- 448 (480) T protein:vir:78 390 PTVAAKADAVSKLYANGQGPIPKEQ-------ARIDLGYT------------ATQREQ-MRDWDKQETEDMIDTLYSTT- 448 (480) T ss_pred CCHHHHHHHHHHHHHhcccCCCHHH-------HHhcCCCC------------HhHHHH-HHHHHHHHHHHHHHHhhccc- Confidence 11222223333332222222211 12222331 011000 00000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 643 LNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQK 679 (756) Q Consensus 643 ~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~ 679 (756) ...+.++......+.. ...+.+.-.+-+. ..+ T Consensus 449 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~--~~~ 480 (480) T protein:vir:78 449 KAQADATPKPTVTETK---TETQTSPSGFNRT--KTR 480 (480) T ss_pred cCCCccccCCCCCCCC---CccCCCcccCCCc--CCC Confidence 0000000000000000 0000000000000 000 No 106 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.37 E-value=5.4e-11 Score=76.89 Aligned_cols=391 Identities=13% Similarity=0.011 Sum_probs=177.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC------CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 23 KKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK------PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSS 96 (756) Q Consensus 23 ~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~ 96 (756) -+..+|..|.+.+.. ...+.++-.+||.|....+ ++..+.+.+.|.+-.+..|+.+...| . T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl--~---- 67 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV-------HKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL--V---- 67 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhc--c---- Confidence 333455555444332 3344556678999864321 11122233344454455454433322 1 Q ss_pred CCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHH Q lcl|NC_019423. 97 SKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQE 176 (756) Q Consensus 97 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 176 (756) |...+..|.. +.-+ +..|+--.....++++|++.|.+++.+ | T Consensus 68 -----~~Gf~~~d~~--------l~~i-~~~N~ld~~~~~~~~~aliyG~sf~~v-~----------------------- 109 (409) T protein:vir:94 68 -----FREFENDDFT--------VNEI-FEENNPDIFFDSAVLSSLIASCSFTYI-S----------------------- 109 (409) T ss_pred -----cCcccCCchH--------HHHH-HHhcChhHHHHHHHHHHHHhcceeEEE-e----------------------- Confidence 1111223321 1112 222222222334556666666665554 1 Q ss_pred HHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhhe--EeCCCCc Q lcl|NC_019423. 177 QADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNV--VIDPSCN 254 (756) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~--~~Dp~a~ 254 (756) . ...|.|+|..++|.++ +|||..+ T Consensus 110 ---------------------------------------------~---------~~dg~~~i~~~sp~~~~~i~D~~~~ 135 (409) T protein:vir:94 110 ---------------------------------------------K---------GENDAVRLQVIEAVNATGIIDPITG 135 (409) T ss_pred ---------------------------------------------c---------CCCCceEEEEeccceEEEEEecCCC Confidence 0 0124577788888875 4566432 Q ss_pred CccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCce Q lcl|NC_019423. 255 GDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGS 334 (756) Q Consensus 255 ~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~ 334 (756) . +. ...+.+-. + . .. ......+|.. +. T Consensus 136 ~-~~----~a~~~~~~-d-------------~---------------------------~~-~~~~~~~~~~-----~~- 162 (409) T protein:vir:94 136 L-LT----EGYAVLER-D-------------E---------------------------NN-NVVLEAHFLP-----DR- 162 (409) T ss_pred c-ee----eeEEEEEe-c-------------C---------------------------CC-ceEEEEEEec-----Cc- Confidence 1 11 11121100 0 0 00 0001111110 00 Q ss_pred eEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchH-HHhHHHHHHHHHHHHHHHHHHHhhcCCceEe-- Q lcl|NC_019423. 335 LEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA-ELLGDNQAILGATMRGMIDLLGRSANGQRGY-- 411 (756) Q Consensus 335 ~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v-~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~-- 411 (756) .+.++.++......+||+ |.+|+|++...++.++.+|.|-+ +.++++|+.+|+.+..+.......+.|+..+ T Consensus 163 ---~~~~~~~~~~~~~~~n~~--g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G 237 (409) T protein:vir:94 163 ---TDYYYRDSRNNISIANPT--GHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTG 237 (409) T ss_pred ---EEEEEecCceeEeeeCCC--CCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEe Confidence 000111111222345665 78999999999999999999977 7899999999999999999999999887544 Q ss_pred -eccccCccchhhhhccccccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHH Q lcl|NC_019423. 412 -PKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAA 489 (756) Q Consensus 412 -~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~ 489 (756) ++++. +.+. ++..............+..++..+.+.-. +.+...+..+...+-.+||++....|.... ...+|. T Consensus 238 ~d~d~~-~~~~--~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~-NpsSa~ 313 (409) T protein:vir:94 238 LSDDAE-PMET--WKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVE 313 (409) T ss_pred cCCCCc-ccch--hhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHH Confidence 22211 1111 11111111000000112234443433322 233455555566666677899888886542 113444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc---cccHH- Q lcl|NC_019423. 490 GIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN---TAEID- 565 (756) Q Consensus 490 ~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g---~a~~~- 565 (756) ++......-........+.|..+++.++++++.+.-..-.. ++++ +++.|.=. ++... T Consensus 314 Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~---------------~~~~---~~~~v~W~p~~~~~~~~ 375 (409) T protein:vir:94 314 AIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYL---------------REQF---RKTKPKWEPLFEADASM 375 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc---------------cccc---ccceEEeccCCCcchHH Confidence 45544444444445555667778888888777765432111 0111 11221111 11111 Q ss_pred -HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHH Q lcl|NC_019423. 566 -NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLA 605 (756) Q Consensus 566 -~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~ 605 (756) .+.+..+.-+.+. ++.+... ..+++..|+.+-+ T Consensus 376 ~a~~aDa~~Kl~~a-g~~~~~~------~~~~~~lG~~~~d 409 (409) T protein:vir:94 376 LSLIGDGAIKLNQA-IPEFINK------DTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHHHHHHh-cccccch------hHHHHHcCCCCCC Confidence 1111122222222 2211110 1244566665433 No 107 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.36 E-value=2e-11 Score=79.29 Aligned_cols=466 Identities=10% Similarity=0.042 Sum_probs=189.4 Q ss_pred ccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC------CCCCCCCCcccCHH Q lcl|NC_019423. 4 QDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK------PPKIKGRSQVQPRL 77 (756) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grS~~v~~~ 77 (756) +.+. -..++++..++++..|...+.. +..+.++-.+||.|....+ ++..+ +=++|.+- T Consensus 1 ~~~~--------~~~~~~~~~~~~~~~l~~~~~~-------~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~-~~~~~~n~ 64 (484) T protein:vir:77 1 MTSP--------LQKQENVDPEKAREEMLNLFTE-------RTQDLGDNTAYYESERRPDAVGVTVPQQMQ-KLLAHVGY 64 (484) T ss_pred CCCc--------ccccCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccccchhHH-hhhhhcCc Confidence 2222 2244667777777777665543 1233456678999863311 01111 11234444 Q ss_pred HHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeee Q lcl|NC_019423. 78 VRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKT 157 (756) Q Consensus 78 v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~ 157 (756) .+..|+.....|. |. -|. .++|.+... .++-++ ..|+--.....+++++++.|.+.+.||++... T Consensus 65 ~~~ivd~~~~~l~--~~------g~~--~~~~~~~~~----~l~~i~-~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~ 129 (484) T protein:vir:77 65 PRLYIDAIAARQE--LE------GFR--LGGADKADE----QLWDWW-QANDLDIESTLGHTDSLVHGRSYITISKPDPN 129 (484) T ss_pred HHHHHHHHHhhhc--cC------cee--cCCcchhHH----HHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEecCCCC Confidence 4444554443331 11 122 123333322 344433 34443344557899999999999998764221 Q ss_pred eeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCce Q lcl|NC_019423. 158 VKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRP 237 (756) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~ 237 (756) .. .....+.+ T Consensus 130 ~~----------------------------------------------------------------------~~~~~~~~ 139 (484) T protein:vir:77 130 ID----------------------------------------------------------------------PGVDPEVP 139 (484) T ss_pred cc----------------------------------------------------------------------cccccccc Confidence 00 00112346 Q ss_pred eEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccccccc Q lcl|NC_019423. 238 TVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALR 315 (756) Q Consensus 238 ~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~ 315 (756) +|..++|.+++ ||+..+ . ..+. .+.+.+. + . T Consensus 140 ~i~~~~p~~~~~~~D~~~~-~---~~~a-~~~~~~~---------------------------~---------------~ 172 (484) T protein:vir:77 140 IIRVEPPTNLYAQIDPRTR-Q---VMRA-IRAIEDE---------------------------E---------------G 172 (484) T ss_pred eEEEeccceeEEEecCCCC-c---eEEE-EEEEEee---------------------------c---------------C Confidence 77888899875 465422 1 1121 1222110 0 0 Q ss_pred ceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHH-HhHHHHHHHHHHH Q lcl|NC_019423. 316 KKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE-LLGDNQAILGATM 394 (756) Q Consensus 316 ~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~-~~~d~Q~~iN~~~ 394 (756) ..+..+++|.. +.. +.+.-.++.....+..|-+.|.+|++++...+..+.++|.|.+. .++++++.+|+.+ T Consensus 173 ~~~~~~~~y~~-----~~~---~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~ 244 (484) T protein:vir:77 173 NEVIGATLYLP-----NNT---VIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTL 244 (484) T ss_pred CcEEEEEEEec-----CeE---EEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHH Confidence 01222233321 000 01111111111122233445789999999888889999999886 6899999999999 Q ss_pred HHHHHHHHhhcCCceEeecccc-Cc---cchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHH- Q lcl|NC_019423. 395 RGMIDLLGRSANGQRGYPKGML-DT---LNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESL- 469 (756) Q Consensus 395 ~~~~d~l~~~~~~~~~~~~gav-~~---~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~- 469 (756) +.+.......+.|+..+- |.- +. .+........ ...+.+-..++...+..+.+..+ ....+..+...+..+ T Consensus 245 s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~q~~~~~--~e~~~~~l~~~i~~~s 320 (484) T protein:vir:77 245 MLMQATAELMGVPQRLLF-GVKGEELGVDPETGQTLFD-AYLARILAFEDHESKAQQFSAAE--LRNFVDALDALDRKAA 320 (484) T ss_pred HHHHHHHHhhhhhHHHHh-CCCcchhcccccccchhhh-hhhhhhcccCCCCceeEeecCCC--hHHHHHHHHHHHHHHh Confidence 999999988887765442 221 10 0000000000 00111111122334444444332 223455555555555 Q ss_pred --hchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh Q lcl|NC_019423. 470 --TGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE 547 (756) Q Consensus 470 --tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d 547 (756) +++++...|..+.. +.+|.++......-........+.|..++++++++++.+. ... .+. . T Consensus 321 ~~~~~p~~~fg~~~~n-~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~----~~~-----------~~~-~ 383 (484) T protein:vir:77 321 AYTGLPPYYLSFSSEN-PASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVM----NGG-----------DIP-P 383 (484) T ss_pred cccCCCHHHhccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC-----------Ccc-c Confidence 56777777754321 1233334433333334444455566667766666555442 110 000 0 Q ss_pred HhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHH Q lcl|NC_019423. 548 DLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQL 625 (756) Q Consensus 548 ~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~ 625 (756) + .+++.|.=..... ..+.+..+..+.+.....+.- ..+++..|+- +++. +.+ T Consensus 384 ~---~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~-------et~~~~l~~~------------~~~~----~e~ 437 (484) T protein:vir:77 384 E---YYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPK-------ERARIDMGYS------------ITER----EEM 437 (484) T ss_pred c---cccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCH-------HHHHhcCCCC------------hhHH----HHH Confidence 0 1123333222221 111222222222211111221 1122222321 1110 001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019423. 626 AIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGET 703 (756) Q Consensus 626 ~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~ 703 (756) +..+.+. ..+++.... +.. ... .+..+.....+ .-++..+.-..+++ T Consensus 438 ~~~~~ee---~~~~~~~~~-~~~----------~~~--~~~~~~~~~~~---------------~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 438 RKWDEEE---QAQGLGLMG-TMF----------GTD--PSGGGNPDNPE---------------TPEPQPNPAEEAAA 484 (484) T ss_pred HHHHHHH---HHHHHHHHh-hhc----------ccc--ccCCCCCCCCC---------------cccccCCCccccCC Confidence 1000000 000000000 000 000 00000000000 00000000000001 No 108 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.36 E-value=2.4e-11 Score=78.85 Aligned_cols=470 Identities=12% Similarity=-0.004 Sum_probs=189.5 Q ss_pred ccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC-----CCCCCCCcccCHHH Q lcl|NC_019423. 4 QDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKP-----PKIKGRSQVQPRLV 78 (756) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-----~~~~grS~~v~~~v 78 (756) ..+..| - +.+.+....+ ++.....|.....+.++-.+||+|....+. |....+=++|.+-. T Consensus 1 ~~~~~~--------~---~~e~~~~~~~---~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~ 66 (486) T protein:vir:42 1 MTAPLP--------G---MEEIEDPAVV---REEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYP 66 (486) T ss_pred CCCCCC--------C---CCCcccHHHH---HHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchH Confidence 222211 1 2222211111 223334445556667777899998642210 00000112333444 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeee Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTV 158 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~ 158 (756) +..|+.....| .| .-|. .+++..... .++-++ ..|+--.....+++++++.|.+.+.||.+.... T Consensus 67 ~~iVd~~~~~l--~~------~g~~--~~~~~~~~~----~~~~i~-~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~ 131 (486) T protein:vir:42 67 RLYVDSVAERQ--AV------EGFR--LGDADEADE----ELWQWW-QANNLDIEAPLGYTDAYVHGRSFITISKPDPQL 131 (486) T ss_pred HHHHHHHHhhh--cc------ccee--cCCCchhHH----HHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCccc Confidence 44444333222 11 1122 123333322 233332 344444445578999999999988874431100 Q ss_pred eeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCcee Q lcl|NC_019423. 159 KIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPT 238 (756) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ 238 (756) . .....+.++ T Consensus 132 ~----------------------------------------------------------------------~~~~~~~~~ 141 (486) T protein:vir:42 132 D----------------------------------------------------------------------LGWDQNVPI 141 (486) T ss_pred c----------------------------------------------------------------------cccCCCeeE Confidence 0 001124578 Q ss_pred EEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccc Q lcl|NC_019423. 239 VEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRK 316 (756) Q Consensus 239 ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~ 316 (756) |..++|.+++ |||... . ..+ +.+++.+. ... T Consensus 142 i~~~~p~~~~~i~d~~~~-~---~~~-~~~~~~~~------------------------------------------~~~ 174 (486) T protein:vir:42 142 IRVEPPTRMHAEIDPRIN-R---VSK-AIRVAYDK------------------------------------------EGN 174 (486) T ss_pred EEEecccceEEEEeCCCC-C---eEE-EEEEEEec------------------------------------------CCC Confidence 8889999865 565422 1 111 11222100 001 Q ss_pred eEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHH-HhHHHHHHHHHHHH Q lcl|NC_019423. 317 KVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE-LLGDNQAILGATMR 395 (756) Q Consensus 317 ~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~-~~~d~Q~~iN~~~~ 395 (756) .+..+++|.. +. .++++..++........|...|.+|++++...+..+..+|.|-+. .++++++.+|+.++ T Consensus 175 ~~~~~~~y~~-----~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s 246 (486) T protein:vir:42 175 EIQAATLYTP-----ME---TIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILM 246 (486) T ss_pred eEEEEEEEcC-----Cc---EEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHH Confidence 2333444431 11 111122222222223334455789999999888889999999987 58899999999999 Q ss_pred HHHHHHHhhcCCceEee---ccccCccchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHH--- Q lcl|NC_019423. 396 GMIDLLGRSANGQRGYP---KGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESL--- 469 (756) Q Consensus 396 ~~~d~l~~~~~~~~~~~---~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~--- 469 (756) .+.......+.++..+. ...+...+........ ...+.+-..+....++.+.+.. .....+..+...+..+ T Consensus 247 ~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~q~~~~--~~e~~~~~l~~~i~~~s~~ 323 (486) T protein:vir:42 247 LMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFD-AYLARILAFEDAEGKIQQFSAA--ELANFTNALDQIAKQVAAY 323 (486) T ss_pred HHHHHHHhhcchHHHhhcCCccccccccccccchhh-hhhchhcccCCCCceEEeeccc--CHHHHHHHHHHHHHHHhcc Confidence 99998888887765442 1111111110000000 0011111112223444444433 2334556666666655 Q ss_pred hchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHh Q lcl|NC_019423. 470 TGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDL 549 (756) Q Consensus 470 tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~ 549 (756) +++++...|..+.. ..++.++...............+.|..+++.++++++.+... .. +..+ T Consensus 324 ~~~p~~~fg~~~~n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~--~~-------------~~~d-- 385 (486) T protein:vir:42 324 TGLPPQYLSTAADN-PASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKG--GD-------------VPPD-- 385 (486) T ss_pred cCCCHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CC-------------cccc-- Confidence 56777676644321 123333444444444555556667777777777766554311 00 0000 Q ss_pred cCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHH Q lcl|NC_019423. 550 KGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAI 627 (756) Q Consensus 550 ~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~ 627 (756) ..++.|.=..... ..+.++....+.+.....+.... +++..++- +++.. +++. T Consensus 386 --~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et-------~~~~lg~~------------~d~~~----e~~~ 440 (486) T protein:vir:42 386 --MLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRER-------ARIDMGYS------------VKERE----EMRR 440 (486) T ss_pred --ceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHH-------HHhcCCCC------------hhHHH----HHHH Confidence 1123332222211 11122222222222112222211 12223321 11110 0010 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh Q lcl|NC_019423. 628 QKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI 707 (756) Q Consensus 628 ~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~ 707 (756) .+.+ +..+++... .+.. . ..... ..+.+.. + .....+.+-+.- T Consensus 441 ~~~e---~~~~~~~~~-~~~~----~---~~~~~-~~~~~~~----------~---------------~~~~~~~~~~~~ 483 (486) T protein:vir:42 441 WDEE---EAAMGLGLL-GTMV----D---ADPTV-PGSPSPT----------A---------------PPKPQPAIESSG 483 (486) T ss_pred HHHH---HHHHHHHHH-HHhh----c---CCCCC-CCCCCCC----------C---------------CCCCCcccCCCC Confidence 0000 000000000 0000 0 00000 0000000 0 000000000000 Q ss_pred hcc Q lcl|NC_019423. 708 SAA 710 (756) Q Consensus 708 ~~a 710 (756) ..+ T Consensus 484 ~~~ 486 (486) T protein:vir:42 484 GDA 486 (486) T ss_pred CCC Confidence 000 No 109 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=99.35 E-value=7.6e-13 Score=87.02 Aligned_cols=581 Identities=17% Similarity=0.144 Sum_probs=264.0 Q ss_pred CCcccCCCCCCCccccccc-cCCCchHHHHHHHHHHHHHHHHhhHHHHH---HHHHHHHhcccc------------CCCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKL-TDWKKEPSIQLLKGDLESAKPAHDAIMSQ---IREWNDLMEVKG------------KAKP 64 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~a~~~~~~~~~~---~~~~~~~y~~~~------------~~~~ 64 (756) |.-+ |+.|.--+-+ |.-.||-+-+.|+.-++.++--...++++ .++++--|.... .++. T Consensus 1 mais-----psepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V 75 (666) T protein:vir:10 1 MAIS-----PSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKV 75 (666) T ss_pred CCcC-----CCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeeecccccccC Confidence 3322 2222211111 22345555667777777776666666664 668888775321 1111 Q ss_pred CCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHH-HHhhhcCCcchHHHHHHHHhh Q lcl|NC_019423. 65 PKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNY-QFRTQLNKVKLVDDYVHSIVD 143 (756) Q Consensus 65 ~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~-~~~~~~~~~~~~~~~v~~al~ 143 (756) |-.-=.-.+|++.|-.+|+++.+.|.++|+||-++|-+.- +|.--+-|+|..-.+.- .... +-.+-+-=++||++. T Consensus 76 ~C~V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~--~~~~~LiL~L~D~~K 152 (666) T protein:vir:10 76 RCQVVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMT--SSIPELILCLQDAAK 152 (666) T ss_pred cceeeccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhh--hhHHHHHHHHhhhhh Confidence 1111135688999999999999999999999999988876 77777888887765532 2111 001111123455544 Q ss_pred cCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCc Q lcl|NC_019423. 144 DGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTG 223 (756) Q Consensus 144 ~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g 223 (756) .-. +.|+++.-.+ +++. ..- .+++ . ..| T Consensus 153 YN~----~~~ET~Ws~I----E~~~------~~~----------------~i~~--------------~--------~~~ 180 (666) T protein:vir:10 153 YNL----VGWETEWSHI----ETYD------PQK----------------EITD--------------L--------EPG 180 (666) T ss_pred cce----eeeeeccccc----cccc------hhh----------------hhhc--------------C--------CCc Confidence 322 3465443111 1111 100 0000 0 001 Q ss_pred eeEEEeeeeecCceeEEEechhheEeCCCCc-Cc-cccCceEEEEeecCHHHHHhh--------ccchhhh-----cc-- Q lcl|NC_019423. 224 VTEVEVEKALVNRPTVEMLNPNNVVIDPSCN-GD-LDKALYAVISFETCKADLMKN--------KDRYHNL-----DK-- 286 (756) Q Consensus 224 ~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~-~d-~~da~~v~~~~~~t~~el~~~--------~~~~~~l-----~~-- 286 (756) . +..+|..+.--+|++++|.+.+|||... .+ .....|.++...+++-.|+.. .-.|+.+ +. T Consensus 181 K--~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~ 258 (666) T protein:vir:10 181 K--TTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSF 258 (666) T ss_pred e--eecccchhhhhhhhccccccccccCCCCCCchhhhhhhhhHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhc Confidence 1 1222333333479999999999999753 22 333567777766665554321 0011111 10 Q ss_pred --cCchhhhhhhchhhhcccccccccc---------ccccceEEEEE--------EEEEee-------ccCCceeEEEE- Q lcl|NC_019423. 287 --IDWESSSPITDPDHESKTPSDFQFK---------DALRKKVVAYE--------YWGFYD-------INDDGSLEPIV- 339 (756) Q Consensus 287 --~~~~~~~~~~~~~~~~~~~~~~~~~---------d~s~~~V~v~E--------~w~k~d-------~~~~g~~~~~~- 339 (756) .+|...-..+..-......++.+|. .-.++||-|-| .|-|+- .........++ T Consensus 259 ~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~~Y~RI~PSDF~~~~P~~N~~QIWK~ 338 (666) T protein:vir:10 259 QGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWKA 338 (666) T ss_pred cccccccCCccCccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeeeccccceecCCCCCcceeeee Confidence 0010000000000000000111110 01123443322 233321 11111122233 Q ss_pred EEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCcc Q lcl|NC_019423. 340 ATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTL 419 (756) Q Consensus 340 ~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~ 419 (756) +++.|+.++..++-.-..|.||.-..-...+.-..--.|+.+..++.|+...++++..+....+....+.++++..+ T Consensus 339 v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDG~G~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i--- 415 (666) T protein:vir:10 339 VMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMI--- 415 (666) T ss_pred eeeccceeEeeehhhhccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhccChhhh--- Confidence 44557788877764434555555432222222222245677789999999998777666555555555555555443 Q ss_pred chhhhhcccccccc--------ccccccccccccccCCCcchHHHHH---HHHHHHHHHHHhchhHHhcCCCccccchhH Q lcl|NC_019423. 420 NRRRYDDGQDYEYN--------PMQGNPSQSIMEHKFPELPQSAIVM---TQMQNQEAESLTGVKAFSGGVTGSAYGDVA 488 (756) Q Consensus 420 ~~~~~~~~~~~~~~--------~~~~~~~~~i~~~~~~~~~~~~~~~---l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA 488 (756) ...+.....+-. .++...+..-.++++ -..+.... .+.+.+.-++++|++...+|+-.-. +++- T Consensus 416 --~a~~iNSP~~~~KIP~~~~sL~N~~~~~~Y~~IPF--D~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKG-NKt~ 490 (666) T protein:vir:10 416 --RANDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPF--DSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKG-NKTR 490 (666) T ss_pred --hhhcccCCCCCcccceeehhhcccchhhhhccCCc--cccchhHHHhhhHHHHhhHHHhhccCCccccccccc-Ccce Confidence 222222211111 112222333333333 23333333 3445566678889988888853221 3444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC-cceEEEecccccHHH Q lcl|NC_019423. 489 AGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG-NFDIEVDINTAEIDN 566 (756) Q Consensus 489 ~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~-~~Dv~V~~g~a~~~~ 566 (756) .+-.-.|-.+..|++..+=-+.. .+..+-+++.-.+.+|.++..+|-=.-.+.+.|+-+.++. -..+.+.+|....+. T Consensus 491 ~E~~~~MG~a~NR~RLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~L~~~~L~F~~~DG~TP~SK 570 (666) T protein:vir:10 491 AEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKELQDLGLKFELGDGLTPASK 570 (666) T ss_pred eehhhhcCCcccceehhhHHhhhhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHHHhhhhheeeeccCCCchhh Confidence 44445555666666665555554 3344555555556778777776654323467777776664 223334454321111 Q ss_pred -HHHHHHHHHH----------HHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHH Q lcl|NC_019423. 567 -QKSQDLGFMV----------QTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENE 635 (756) Q Consensus 567 -~~~q~l~~ll----------q~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~ 635 (756) .....+..+| +.+|+.+|. +++-++.+.|.....++-...-++=.+--..++++++.-.| T Consensus 571 ~ASs~~lT~~LQMI~sS~~~~~A~G~~~P~-----M~AH~~QLGGVRG~E~Y~daalP~~~~~~~~~Q~LQ~~~LQ---- 641 (666) T protein:vir:10 571 LASSDFLTALLQMIMSSETTLQAFGTQVPG-----MIAHLAQLGGVRGFEKYADAALPQWQITYGMQQQLQQMLLQ---- 641 (666) T ss_pred hhhhHHHHHHHHHHhhhhhhHhhhcccchH-----HHHHHHHhccccchhhhhhccCCccccccchhHHHHHHHHH---- Confidence 1111222222 234455544 33456677777666555543333221111111222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 636 ELQSKIALNNAKAKEAASSGDLKDLDYLEQESG 668 (756) Q Consensus 636 ~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~ 668 (756) ++.|.+ .+-++. |.++...+. +-++ T Consensus 642 -~~~QSA-~Q~~A~----Q~~L~~~Q~--~PSq 666 (666) T protein:vir:10 642 -LQQQSA-MQLQAR----QGELSNDQS--QPSQ 666 (666) T ss_pred -Hhhhhh-cccccc----cccCccccc--CCCC Confidence 111100 000000 111110000 0000 No 110 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.34 E-value=4.1e-11 Score=77.54 Aligned_cols=527 Identities=12% Similarity=0.066 Sum_probs=210.3 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCc--ccCHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQ--VQPRLV 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~--~v~~~v 78 (756) |.-+..--=+..+.=.+-.-+|..+.-.+.|. .-+.=.+||++....-.+...|+-+ +-.+.- T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~Rla---------------aY~ly~d~y~n~~~el~~il~G~dr~~~~~ps~ 65 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVR---------------AYDLYENIYLNSAETLKLVLRGDDSVPILMPSG 65 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHH---------------HHHHHHHhhcCchhhhhhhcCCCceeeeccchH Confidence 32221111111111123333454432222221 1112236777654332222344433 333345 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeee Q lcl|NC_019423. 79 RRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTV 158 (756) Q Consensus 79 ~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~ 158 (756) +..|+. ++ -|||.+-.+-+.|.+ +|+...+....|++-.+.+++=..+.. ..-++|++.|-||+++.||.+.+ T Consensus 66 r~~V~~----~~-~~Lg~~~~~~Ve~~~-~de~~~~avq~~Lr~~~~~e~l~~~~~-~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 66 RKIVEA----VH-RFLGVGFDYLVEPDM-GDEGIRQSLNAYFRTTFKREAIKAKFT-SNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred HHHHHH----HH-HhcCCCcEEecCccc-cCcchHHHHHHHHHHHHHHhhhHHHHH-HHHHhhhhhcceeEEEeeccccc Confidence 666666 33 345777778888876 577777778889999988887666544 35678899999999999987643 Q ss_pred eeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCcee Q lcl|NC_019423. 159 KIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPT 238 (756) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ 238 (756) .- ++....+ .+ |..+ .++. .. ....|-.- T Consensus 139 ~g-~R~rv~~----vD-----------------P~~~----------------------fp~~--dp-----d~v~g~~~ 167 (563) T protein:vir:74 139 AG-ERISVDE----VD-----------------PRQI----------------------FLIE--DG-----STVVGFHM 167 (563) T ss_pred cC-CCceEee----cC-----------------Ccee----------------------eecc--CC-----CCccccee Confidence 11 1111000 00 0000 0000 00 00001011 Q ss_pred EEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceE Q lcl|NC_019423. 239 VEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKV 318 (756) Q Consensus 239 ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V 318 (756) ++.+.. |+.|. +. .+.+++++..++. +....+- -..-. T Consensus 168 v~v~~~---~~~pd---d~--~~~~~r~~~~~~~-----------lndeg~~-----------------------~~~~~ 205 (563) T protein:vir:74 168 VDIVQD---FRSPD---DP--SKKLARRRTFRRV-----------RNDEGMF-----------------------TGRIS 205 (563) T ss_pred eecccC---CCCCc---ch--hccceeeeeeeee-----------eCCCCCc-----------------------cceee Confidence 222211 11222 11 2344554433221 0000000 00000 Q ss_pred EEEEEE-----EEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHH Q lcl|NC_019423. 319 VAYEYW-----GFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGAT 393 (756) Q Consensus 319 ~v~E~w-----~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~ 393 (756) .-.|.| .....+.........-++...+..+...-|-+.+.+||++++..|.+++.||.|-...+..+++++|.. T Consensus 206 ~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~ 285 (563) T protein:vir:74 206 SELTHWTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQS 285 (563) T ss_pred eccchhccccccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhh Confidence 011222 111111111111122234433444444445566899999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccc--c-ccc---ccccccccc-CCCcchHHHHHHH-HHHHH Q lcl|NC_019423. 394 MRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNP--M-QGN---PSQSIMEHK-FPELPQSAIVMTQ-MQNQE 465 (756) Q Consensus 394 ~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~--~-~~~---~~~~i~~~~-~~~~~~~~~~~l~-~~~~~ 465 (756) ++-...++..+++|.++.+..+ +.+....+. .++.+++ + ... ...-+..+. .+++. .+..=+. ..... T Consensus 286 ~Td~s~i~~~tG~pi~vl~~~~--p~d~~~g~~-~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~-~~q~Hm~~l~era 361 (563) T protein:vir:74 286 LTDEDATIVFQGLGMYVTNASA--PVDPNTGEL-TDWNIGPMQIVEIAGNRNDNYFERVSGVQDVS-PFQDHMKWIDEKG 361 (563) T ss_pred hhHHHHHHHhcCCCeEEecccc--ccccccccc-cccccCCceeEeccCCccccceeeecchhhhH-HHHHHHHHHHHHH Confidence 9999889999999887776432 211111110 0111111 0 000 001111111 12221 1111123 33446 Q ss_pred HHHHhchhHHhcCC--CccccchhHHHHHHH-HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCce Q lcl|NC_019423. 466 AESLTGVKAFSGGV--TGSAYGDVAAGIRGA-LDAASKREMA-ILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQY 541 (756) Q Consensus 466 ~e~~tGv~~~~~G~--~~~a~~~tA~~i~~~-~~aa~~~l~~-~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~ 541 (756) +.+++|+++...|. .+.+.|+.|-.++.- .-++..+-.. +..-++.++.+...++|.+.+.-+-... |.+| T Consensus 362 l~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~-----~~~~ 436 (563) T protein:vir:74 362 IAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQD-----GSRP 436 (563) T ss_pred HHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhc-----cccc Confidence 77889999999993 344456666554431 1111111111 3333444556667777777766432211 1122 Q ss_pred eecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCC--hhHHHHhhhccCCCCh Q lcl|NC_019423. 542 VEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRM--PDLAHELRTWQPQPDP 617 (756) Q Consensus 542 v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~--~~~~~~l~~~~~q~~p 617 (756) +-+ ++.-++.-|+|.-++... +.+..++...+.+. ..+.......+|.++ |+ |++...++.+... T Consensus 437 ~g~--~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~a--GiiSretAv~~L~~~----g~~~pdae~e~~~ie~~--- 505 (563) T protein:vir:74 437 FAS--ADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQA--HLILRKMAVAKLRSI----GWEYPEVDDQGNALTDD--- 505 (563) T ss_pred ccc--cccCCceEEEEEeCCCCCccHHHHHHHHHHHHHc--CchhHHHHHHHHHhC----CCCCCcHHHHHhhcCHH--- Confidence 222 112222233444444322 11111222111110 111111111111111 11 1111111111000 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHH------HHHHHH---HH----HHHHHHHHHHHHHH Q lcl|NC_019423. 618 MEEQLKQLAIQKAQLENEELQSKIA------LNNAKA---KE----AASSGDLKDLDYLE 664 (756) Q Consensus 618 ~~~~~~q~~~~~aq~e~~~~qa~a~------~~~a~a---~~----~~aq~~~~~~~~~~ 664 (756) .....+++++++.+- --++|-.. ...-+. .+ ...--+.++.-.-. T Consensus 506 -~i~~~~~a~a~ad~~-~~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 506 -DIADMLLAEAEADAS-LGLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred -HHHHHHHHHhhccCc-ccceecccCCCCcccccccCCchhHcCCcccCCccccccCCCC Confidence 000000000000000 00000000 000000 00 00000000000000 No 111 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.34 E-value=7.9e-11 Score=75.97 Aligned_cols=398 Identities=11% Similarity=0.025 Sum_probs=172.2 Q ss_pred HHHhhHHHHHHHHHHHHhccccCCC------CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHH Q lcl|NC_019423. 39 KPAHDAIMSQIREWNDLMEVKGKAK------PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELA 112 (756) Q Consensus 39 ~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~ 112 (756) -+.|. .+.+.-.+||.|....+ ++..+.+-+.|.+-.+..|+.+...|. |...+..|.+ T Consensus 1 l~~~~---~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~-----------~~Gf~~~d~~- 65 (410) T protein:vir:95 1 MNLYQ---SRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLI-----------FRAFANDDFN- 65 (410) T ss_pred CCcch---hhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhc-----------cccccCCCch- Confidence 33443 33445679998764321 112223334455555555555433331 1122223321 Q ss_pred HHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhcc Q lcl|NC_019423. 113 ARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENP 192 (756) Q Consensus 113 A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 192 (756) +.-+ ...|+--.....++++||+.|.+++.| | T Consensus 66 -------l~~i-~~~N~ld~~~~~~~~~al~~G~sf~~v-~--------------------------------------- 97 (410) T protein:vir:95 66 -------VTEI-FDRNNPDIFFDSAILSALIGSCSFVYI-S--------------------------------------- 97 (410) T ss_pred -------HHHH-HhhcChHHHHHHHHHHHHHhCceeEEE-e--------------------------------------- Confidence 1122 222222222334556666666665554 1 Q ss_pred chhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhhe--EeCCCCcCccccCceEEEEeecC Q lcl|NC_019423. 193 REYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNV--VIDPSCNGDLDKALYAVISFETC 270 (756) Q Consensus 193 ~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~--~~Dp~a~~d~~da~~v~~~~~~t 270 (756) . ...|.|+|..++|.++ +|||..+. +. + ..+.+-+ T Consensus 98 -----------------------------~---------~~d~~~~i~~~sP~~~~~i~Dp~~~~-~~---~-al~~~~~ 134 (410) T protein:vir:95 98 -----------------------------K---------GEDDEVRLQVIESSNATGVIDPITGL-LV---E-GYAVLAR 134 (410) T ss_pred -----------------------------c---------CCCCceEEEEEcccceEEEEeCCCCc-eE---E-EEEEEEe Confidence 0 0124467788888875 45664221 11 1 1111100 Q ss_pred HHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEe Q lcl|NC_019423. 271 KADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRM 350 (756) Q Consensus 271 ~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~ 350 (756) + + ..+.....+|.. + .+..+.++...+. T Consensus 135 ------------------~---------~---------------~~~~~~~~~~~~-----~-----~~~~~~~~~~~~~ 162 (410) T protein:vir:95 135 ------------------D---------D---------------YNRPTLEAYFEP-----N-----ATHFIPKDGEPYS 162 (410) T ss_pred ------------------c---------C---------------CCeEEEEEEEeC-----C-----cEEEEeeCCcccc Confidence 0 0 000111111210 0 0111111111223 Q ss_pred cccccCCCccceEEeeeeeecCcccCCch-HHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccc-hhhhhccc Q lcl|NC_019423. 351 ENNPFPDGKLPLVVVPYMPRKRELFGEAD-AELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLN-RRRYDDGQ 428 (756) Q Consensus 351 ~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~-v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~-~~~~~~~~ 428 (756) .++|+ |.+|+|+++..++.++.+|.|- .+.++++|+.+|+.+..+.......+.|+..+- |+-++.+ ...++... T Consensus 163 ~~~~~--g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~~~~ 239 (410) T protein:vir:95 163 VTNET--GIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYIL-GLDPDAEPMEKWKATV 239 (410) T ss_pred ccCCC--CCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-ccCCCCCcCchhhhhh Confidence 35554 7899999999999899999995 488999999999999999999999888875442 2211000 00111111 Q ss_pred cccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 429 DYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILR 507 (756) Q Consensus 429 ~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~ 507 (756) ...........+..++..+++.-. +.+...+..+...+-..||++....|.... ...+|.++......-........+ T Consensus 240 ~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~ka~~k~~ 318 (410) T protein:vir:95 240 SSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQR 318 (410) T ss_pred hhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHH Confidence 110000000111233443333322 234455566666666778899998885432 113444455444444444555567 Q ss_pred HHHHHHHHHHHHHHHHHHhhCCC-CcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHh Q lcl|NC_019423. 508 RLAKGMADIGTKICAMNAVFLSE-KEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQS 586 (756) Q Consensus 508 n~~~~~~~l~~~~l~li~q~~~~-~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~ 586 (756) .|..+++.++++++.+.-..-.. ....++.= .|-++ .|. ...+. .+.+-...-+.+. ++-+.+. T Consensus 319 ~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v-~W~p~--------~d~----~~~s~-a~~aDa~~Kl~~a-~~g~~~~ 383 (410) T protein:vir:95 319 SLGAGLLNVAYVAACLRDEFRYTRSQFVRTAV-KWEPL--------FEA----DANTM-TMIGDGVVKLNQA-LPGYINA 383 (410) T ss_pred HHHHHHHHHHHHHHHHhcCCCCcccccceeeE-Eeeec--------CCc----chhhH-HHHHHHHHHHHHh-ccCCccH Confidence 77788888888887776433211 11111100 11111 011 00111 1111111112221 1111111 Q ss_pred HHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHH Q lcl|NC_019423. 587 ITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLE 633 (756) Q Consensus 587 ~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e 633 (756) .-+++..|+.+-. ... ...+.++++-| T Consensus 384 ------~~~~~~lg~~~~~------------~~~--~~~~e~~~~g~ 410 (410) T protein:vir:95 384 ------ETIRDLTGIAGDM------------SAK--PVVSEGGSNGE 410 (410) T ss_pred ------HHHHHhcCCChHH------------HHH--HHHHHHHhCCC Confidence 1133444543110 000 00000111101 No 112 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.33 E-value=5.7e-11 Score=76.74 Aligned_cols=476 Identities=12% Similarity=0.015 Sum_probs=201.7 Q ss_pred CCcccCCCCCCCccccccccCCCc----hHHHHHHHHHHHHHHHHhhHHHHHHHHH-HHHhccccCCCCCCC-CCCCccc Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKK----EPSIQLLKGDLESAKPAHDAIMSQIREW-NDLMEVKGKAKPPKI-KGRSQVQ 74 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~~~~~a~~~~~~~~~~~~~~-~~~y~~~~~~~~~~~-~grS~~v 74 (756) |-=-.+ -|+-.+.|-+ ..-...+ ...++. .-....+| ..+|.+.-+..++.. ..+.++. T Consensus 1 ~~~~~~--------~~~~i~~w~~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~ 65 (518) T protein:vir:78 1 MGVWSV--------MTRFIKGWLNGKPNGSEPELI----PKYLPL---VPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMN 65 (518) T ss_pred Ccchhh--------HHHHHHHhhcCCCCccchhcc----HHHhhh---cccchhhhhhhhhhhhhcccCCCCcccccccc Confidence 100000 1111222211 0000000 000000 00011111 123332222222211 1122233 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeee Q lcl|NC_019423. 75 PRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWE 154 (756) Q Consensus 75 ~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~ 154 (756) .+.-+. |-.-+++| .|+-..-+.|......|. ++++++++.++. .|+-...++.++..++-.|.+++|++|+ T Consensus 66 ~~l~~~-i~~~~A~l---l~~e~~~i~v~~~~~~d~---e~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d 137 (518) T protein:vir:78 66 SGTGNE-IVVVAAEY---ISGKPLSIDVTGVNGSKD---ENLTKQLKEALR-IDNFDSKSVKIVELAGGSGVSAVKINIL 137 (518) T ss_pred CChHHH-HHHHHHHh---hcCCCceEEecCccccCc---HHHHHHHHHHHH-hccHHHHHHHHHHHhhccCceEEEEEEE Confidence 333232 22233333 355555677765444443 355777877654 3444555778999999999999999884 Q ss_pred eeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeec Q lcl|NC_019423. 155 RKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALV 234 (756) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~ 234 (756) . T Consensus 138 ~------------------------------------------------------------------------------- 138 (518) T protein:vir:78 138 N------------------------------------------------------------------------------- 138 (518) T ss_pred C------------------------------------------------------------------------------- Confidence 1 Q ss_pred CceeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 235 NRPTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 235 g~~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) |+++|+.|+++.|++...- +++-.|-|+-+...-++ ...|..|+.+.+....... + .. T Consensus 139 ~~~~i~~v~ad~~~P~~~~-g~~~~~~f~~~~~~~~k------~~~y~~lE~he~~~~~~~~-------------~--~~ 196 (518) T protein:vir:78 139 GRPSISVHSSSQFWIDFKN-NEPFRFNFFEEIPTSNK------ADIYYLVESREIKQWDKEG-------------K--KL 196 (518) T ss_pred CeeEEEEEcCCeeEEEeec-CcEEEEEEEEEeecCCc------ceeEEEEEeecccccccee-------------e--cc Confidence 2245566666666653221 12222322211111000 0011112221111100000 0 00 Q ss_pred cceEEEEEEEEE-----eeccCCceeEEEEEE--EECCEEEEecccccC-CCccceEEeeee-----eecCcccCCchHH Q lcl|NC_019423. 315 RKKVVAYEYWGF-----YDINDDGSLEPIVAT--WIGSTLIRMENNPFP-DGKLPLVVVPYM-----PRKRELFGEADAE 381 (756) Q Consensus 315 ~~~V~v~E~w~k-----~d~~~~g~~~~~~~~--~~g~~~L~~~~~P~~-~~~~Pfv~~~~~-----~~~~~~~G~g~v~ 381 (756) +.-...++.|.. +........+....+ +.|.. .+ .... ....||+++... ...++.+|.|++. T Consensus 197 ~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~---e~-~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~ 272 (518) T protein:vir:78 197 SGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQ---LN-HSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLS 272 (518) T ss_pred cceeEEEEEeeecCcccccccccccccccccccccccCc---cc-eeeccCCccceEEeeccccccccccCCCcCcchHh Confidence 000001111100 000000000000000 11100 00 0011 233566665433 3457788999999 Q ss_pred HhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchh-------hhhcccccccccccccccc------ccccccC Q lcl|NC_019423. 382 LLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRR-------RYDDGQDYEYNPMQGNPSQ------SIMEHKF 448 (756) Q Consensus 382 ~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~-------~~~~~~~~~~~~~~~~~~~------~i~~~~~ 448 (756) ++++.++.+|..+++..+.+.+ +..++.+++..+...... .++.. ...+..+....+. .|+..++ T Consensus 273 ~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~-~~~y~~i~~~~~~~~~~~~~i~~~~~ 350 (518) T protein:vir:78 273 QCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVD-EDYFMQFKGTLDAGAKLNDMIQFMQG 350 (518) T ss_pred hhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCC-CceEEEecCcCCCCCccccceeeeec Confidence 9999999999999999999865 788899988776421110 01111 1112122211111 1333332 Q ss_pred CCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC Q lcl|NC_019423. 449 PELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFL 528 (756) Q Consensus 449 ~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~ 528 (756) .-....+...++.+...+....|++....|.++. ..||+++....+..-+.+..+...+..+++.+.+.++.+..-++ T Consensus 351 ~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~--~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~ 428 (518) T protein:vir:78 351 DFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR--EVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGT 428 (518) T ss_pred ccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 2223456677788888888889999888886432 47888888776666667777778888888888888888776553 Q ss_pred CCCcEEEEecCceeecCHhHhcCcceEEEecccc--cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh--hH Q lcl|NC_019423. 529 SEKEVVRITNEQYVEIKREDLKGNFDIEVDINTA--EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP--DL 604 (756) Q Consensus 529 ~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a--~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~--~~ 604 (756) ..... .....+++++|+-+.+ .-.....+.+..+.. . ..+..... +.++ ..+.. ++ T Consensus 429 ~~~~~-------------~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~-a-GimS~e~~---i~~~--~~~~~deea 488 (518) T protein:vir:78 429 NNKEK-------------AIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNS-A-LAMSVEEK---VKLI--HPKWEDEEI 488 (518) T ss_pred Ccccc-------------ccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh-c-CCCCHHHH---HHHh--CCCCCHHHH Confidence 32110 1112344455543332 222223332222221 1 22332221 1111 11221 11 Q ss_pred HHHhhhcc------CCCChhhhhHHHHHHH Q lcl|NC_019423. 605 AHELRTWQ------PQPDPMEEQLKQLAIQ 628 (756) Q Consensus 605 ~~~l~~~~------~q~~p~~~~~~q~~~~ 628 (756) .+-+.++. ..++|...-..+.++= T Consensus 489 ~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 489 QAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 11121111 1111110000000000 No 113 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.32 E-value=1.1e-10 Score=75.21 Aligned_cols=468 Identities=10% Similarity=-0.010 Sum_probs=194.5 Q ss_pred CCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC-CCC--C-C-CCCcccCHHHHH Q lcl|NC_019423. 6 TFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK-PPK--I-K-GRSQVQPRLVRR 80 (756) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-~~~--~-~-grS~~v~~~v~~ 80 (756) -+-|. ++ +.+.+....+ ++.....+.....+.++-.+||.|..... .+. . + .+=+++.+-.+. T Consensus 1 ~~~~i-------~~--~~~~~~~~~~---~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ 68 (485) T protein:vir:10 1 MTAPL-------PG--QEEIEDPAIA---RDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRL 68 (485) T ss_pred CCCCC-------CC--CCCCCCHHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHH Confidence 11221 11 1111111222 33344555566667778899999864321 011 0 0 111234455566 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeee Q lcl|NC_019423. 81 QAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKI 160 (756) Q Consensus 81 ~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~ 160 (756) .|+.....| .| .-|. .++|.+..+ .++.++ ..|+--.....+.+++++.|.+.+.+|.+..... T Consensus 69 ivd~~~~~l--~~------~g~~--~~~~~~~~~----~~~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~- 132 (485) T protein:vir:10 69 YVDSIAERQ--AV------EGFR--FGDADEADE----ELWQWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDPQID- 132 (485) T ss_pred HHHHHHhhh--cc------ccee--cCCCchhHH----HHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccc- Confidence 666554444 11 1122 133444333 333333 3343334455688999999999888744311000 Q ss_pred eeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEE Q lcl|NC_019423. 161 KTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVE 240 (756) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie 240 (756) .....+.++|. T Consensus 133 ---------------------------------------------------------------------~~~~~~~~~i~ 143 (485) T protein:vir:10 133 ---------------------------------------------------------------------LGWDPNTPIIR 143 (485) T ss_pred ---------------------------------------------------------------------cccCCCeeEEE Confidence 00112457888 Q ss_pred EechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceE Q lcl|NC_019423. 241 MLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKV 318 (756) Q Consensus 241 ~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V 318 (756) .++|.+++ |||.... + ...+ +.+-+ . ....+ T Consensus 144 ~~~p~~~~~~~D~~~~~-~---~~~~-~~~~~------------------~------------------------~~~~~ 176 (485) T protein:vir:10 144 VEPPTRMYAEIDPRIGR-V---SKAI-RVAYD------------------A------------------------EGNEI 176 (485) T ss_pred EEccceeEEEEcCCCCc-e---eEEE-EEEEe------------------e------------------------CCCeE Confidence 89999864 5654321 1 1111 11100 0 00112 Q ss_pred EEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHH-HhHHHHHHHHHHHHHH Q lcl|NC_019423. 319 VAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAE-LLGDNQAILGATMRGM 397 (756) Q Consensus 319 ~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~-~~~d~Q~~iN~~~~~~ 397 (756) ..+++|.. +.+ +++...++........|.+.|.+|+++++..++.+..||.|-+. .++++++.+|+.++.+ T Consensus 177 ~~~~~y~~-----~~~---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~ 248 (485) T protein:vir:10 177 QAATLYTP-----NDI---FGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLM 248 (485) T ss_pred EEEEEEeC-----CeE---EEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHH Confidence 23334431 111 11112222232233445556889999999998889999999886 5899999999999999 Q ss_pred HHHHHhhcCCceEeeccccCc---cchhhhhccccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHH---hc Q lcl|NC_019423. 398 IDLLGRSANGQRGYPKGMLDT---LNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESL---TG 471 (756) Q Consensus 398 ~d~l~~~~~~~~~~~~gav~~---~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~---tG 471 (756) .......+.|+..+.-...+. .+........ ...+.+-..++...+..+.+..+ ....++.+...+..+ |+ T Consensus 249 ~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~d~k~~q~~~~~--~~~~~~~l~~~i~~~~~~~~ 325 (485) T protein:vir:10 249 QATAELMGVPQRLIFGIKPEEIGVDPETGQTLFD-AYLARILAFEDAEGKIQQFSAAE--LANFTNALDQIAKQVAAYTG 325 (485) T ss_pred HHHHHhhcchHHHHhcCCcccccccccccchhhh-hcccceeccCCCCceEEeecccc--hHHHHHHHHHHHHHHhcccC Confidence 999988888765442111111 0110000000 00111111222334444444322 334455566666655 66 Q ss_pred hhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC Q lcl|NC_019423. 472 VKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG 551 (756) Q Consensus 472 v~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~ 551 (756) +++...|..+.. +.++.++...............+.|..++++++++++.+.. .. + ...+ T Consensus 326 ~p~~~fg~~~~n-~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~----~~------~-----~~~~---- 385 (485) T protein:vir:10 326 LPPQYLSTAADN-PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMK----GG------D-----VPPD---- 385 (485) T ss_pred CCHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CC------C-----Cccc---- Confidence 677777754321 12333344444444444555556666677777666555432 10 0 0000 Q ss_pred cceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHH Q lcl|NC_019423. 552 NFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQK 629 (756) Q Consensus 552 ~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~ 629 (756) .+++.|.=..... ..+.++.+..+.+.....++... +++..|+. +++ .+..+ T Consensus 386 ~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et-------~~~~lg~~------------~~~-------~~~~~ 439 (485) T protein:vir:10 386 MLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRER-------ARKDMGYS------------IAE-------REEMR 439 (485) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHH-------HHHhCCCC------------HhH-------HHHHH Confidence 1123333222221 11122222222221111122211 12222321 000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhc Q lcl|NC_019423. 630 AQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISA 709 (756) Q Consensus 630 aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~ 709 (756) ... .+..+ +++ ..+.++... .+...+ T Consensus 440 ~~~---------ee~~~-----~~~----------------------------------~~~~~~~~~------~~~~~~ 465 (485) T protein:vir:10 440 RWD---------EEEAA-----MGL----------------------------------GLIGTMVDP------NPTVPG 465 (485) T ss_pred HHH---------HHHHH-----HHH----------------------------------HHHHHhhcc------CCCCCC Confidence 000 00000 000 000001110 011111 Q ss_pred cCCCCCCCc-ccCchhcCCCCCC Q lcl|NC_019423. 710 AVGYNTLTN-GNSPQERDLAAQQ 731 (756) Q Consensus 710 a~~~~~~~~-~~~~~~~~~~~~~ 731 (756) +. ...| ++.|.++.|-.++ T Consensus 466 ~~---~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 466 SP---SPAPAPKPAALESGGDAA 485 (485) T ss_pred CC---CccccccCcCCCCCCCCC Confidence 10 0001 0112222222222 No 114 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.32 E-value=1.2e-10 Score=74.91 Aligned_cols=393 Identities=13% Similarity=0.033 Sum_probs=171.2 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC------CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcC Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK------PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLS 95 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~ 95 (756) |.. ..|..|.+.+.. ...+.++-.+||.|....+ ++..+.+-+.|.+-.+..|+.+...| T Consensus 1 ~~~-~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl------ 66 (409) T protein:vir:16 1 MTE-KGIGYLRFKLSV-------HKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL------ 66 (409) T ss_pred CCH-HHHHHHHHHHHH-------HhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhc------ Confidence 433 444555444433 3345556778999864321 11111112233344444444332222 Q ss_pred CCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCH Q lcl|NC_019423. 96 SSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQ 175 (756) Q Consensus 96 ~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~ 175 (756) .|...+..|.+ +.-+ +..|+--.....++++|++.|.+++.| T Consensus 67 -----~~~Gf~~~d~~--------l~~i-~~~N~ld~~~~~~~~~al~yG~sf~~v------------------------ 108 (409) T protein:vir:16 67 -----VFREFENDDFT--------VNEI-FEENNPDIFFDSTVLSALIASCSFTYI------------------------ 108 (409) T ss_pred -----ccccccCcchH--------HHHH-HHhcChhHHHHHHHHHHHHhCceeEEE------------------------ Confidence 01111222321 1111 222222222334555666555555543 Q ss_pred HHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhhe--EeCCCC Q lcl|NC_019423. 176 EQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNV--VIDPSC 253 (756) Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~--~~Dp~a 253 (756) ..+ ..|.|+|..++|.++ +|||.. T Consensus 109 ---------------------------------------------~~~---------~dg~~~i~~~sP~~~~~i~D~~~ 134 (409) T protein:vir:16 109 ---------------------------------------------SKG---------ENDAVRLQVIEATNATGIIDPIT 134 (409) T ss_pred ---------------------------------------------ecC---------CCCceEEEEEcccceEEEeeccc Confidence 111 124567888888875 446643 Q ss_pred cCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCc Q lcl|NC_019423. 254 NGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDG 333 (756) Q Consensus 254 ~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g 333 (756) +. +. +-+ +.|-. + ..+..+ ...+|.. +. T Consensus 135 ~~-~~-~a~---~~~~~-------------------d----------------------~~~~~~-~~~~~~~-----~~ 162 (409) T protein:vir:16 135 GL-LT-EGY---AVLER-------------------D----------------------ENNNVV-LEAHFLP-----DR 162 (409) T ss_pred cc-ce-eee---EEEEe-------------------c----------------------CCCceE-EEEEEec-----Cc Confidence 32 11 111 11100 0 000000 0111110 00 Q ss_pred eeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchH-HHhHHHHHHHHHHHHHHHHHHHhhcCCceEee Q lcl|NC_019423. 334 SLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA-ELLGDNQAILGATMRGMIDLLGRSANGQRGYP 412 (756) Q Consensus 334 ~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v-~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~ 412 (756) .+.++-++..-...++|+ |.+|+|++...++.++.+|.|-+ +.++++|+.+|+.+..+.-.....+.|+..+- T Consensus 163 ----~~~~~~~~~~~~~~~~~~--g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~ 236 (409) T protein:vir:16 163 ----TDYYYRDSRNNISIANPT--GNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT 236 (409) T ss_pred ----EEEEEecCccccceecCC--CCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE Confidence 000011111111234554 78999999999999999999965 78999999999999999999888888875542 Q ss_pred ccccCcc-chhhhhccccccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHH Q lcl|NC_019423. 413 KGMLDTL-NRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAG 490 (756) Q Consensus 413 ~gav~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~ 490 (756) |+-++. ....++..............+..++..+++.-. +.+...+..+...+-.+||++....|..... ..+|.+ T Consensus 237 -G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~N-psSa~A 314 (409) T protein:vir:16 237 -GLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDN-PSSVEA 314 (409) T ss_pred -ecCCCCCccchhhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCc-hhHHHH Confidence 221110 000111111110000000112233443333322 2344556666666667788998888865321 134444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc---cccH-H- Q lcl|NC_019423. 491 IRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN---TAEI-D- 565 (756) Q Consensus 491 i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g---~a~~-~- 565 (756) +......-........+.|..+++.++++++.+.-..-.. ++.+ +++.|.=. .+.. + T Consensus 315 i~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~---------------~~~~---~~~~v~W~~~~~~~~~s~ 376 (409) T protein:vir:16 315 IKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYL---------------REQF---SKTKPKWEPLFEADASML 376 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---------------chhh---ccceEEecCCCCcchhhH Confidence 4443333334444455667777777777777765332110 1111 11111111 1111 1 Q ss_pred HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHH Q lcl|NC_019423. 566 NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLA 605 (756) Q Consensus 566 ~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~ 605 (756) .+.+..+.-+.+. ++.+.. - .-+.+..|+.+-+ T Consensus 377 a~~aDa~~Kl~~a-~~~~~~---~---~v~~~~~g~~~~d 409 (409) T protein:vir:16 377 SLIGDGAIKLNQA-IPEFIN---K---DTIRDLTGIKGAE 409 (409) T ss_pred HHHHHHHHHHHhh-cccccc---h---hHHHHhccCCCCC Confidence 1111112222222 121111 0 1134455654333 No 115 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=99.30 E-value=2.2e-12 Score=84.50 Aligned_cols=581 Identities=16% Similarity=0.141 Sum_probs=261.0 Q ss_pred CCcccCCCCCCCccccccc-cCCCchHHHHHHHHHHHHHHHHhhHHHHH---HHHHHHHhcccc------------CCCC Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKL-TDWKKEPSIQLLKGDLESAKPAHDAIMSQ---IREWNDLMEVKG------------KAKP 64 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~a~~~~~~~~~~---~~~~~~~y~~~~------------~~~~ 64 (756) |.-+ |+.|.--+-+ |.-.||-+-+.|+.-++.++--...++++ .++++--|.... .++. T Consensus 1 mais-----psepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V 75 (666) T protein:vir:96 1 MAIS-----PSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKV 75 (666) T ss_pred CccC-----CCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeeecccccccc Confidence 3322 2222211111 22345555667777777776666666664 668888775321 1111 Q ss_pred CCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHH-HHhhhcCCcchHHHHHHHHhh Q lcl|NC_019423. 65 PKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNY-QFRTQLNKVKLVDDYVHSIVD 143 (756) Q Consensus 65 ~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~-~~~~~~~~~~~~~~~v~~al~ 143 (756) |-.-=.-.+|++.|-.+|+++.+.|.++|+||-++|-+.- +|.--+-|+|..-.+.- .... +-.+-+-=++||++. T Consensus 76 ~C~V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~--~~~~~LiL~L~D~~K 152 (666) T protein:vir:96 76 RCQVVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMT--SSIPELILCLQDAAK 152 (666) T ss_pred cceeeccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhh--hhHHHHHHHHhhhhh Confidence 1111135688999999999999999999999999998876 77777888887765532 2111 001111123455443 Q ss_pred cCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCc Q lcl|NC_019423. 144 DGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTG 223 (756) Q Consensus 144 ~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g 223 (756) .-. +.|+++.-.++ ..+.. ..+ .. ...| T Consensus 153 YN~----~~~ET~Ws~IE----------~~~~~----------------------------~~i----~~------~~~~ 180 (666) T protein:vir:96 153 YNL----VGWETEWSNIE----------TYDPQ----------------------------KEI----TD------LEPG 180 (666) T ss_pred cce----eeeeecccccc----------ccchh----------------------------hhh----hc------CCCc Confidence 322 34654432211 11110 000 00 0011 Q ss_pred eeEEEeeeeecCceeEEEechhheEeCCCCc-Cc-cccCceEEEEeecCHHHHHhh--------ccchhhh-----cc-- Q lcl|NC_019423. 224 VTEVEVEKALVNRPTVEMLNPNNVVIDPSCN-GD-LDKALYAVISFETCKADLMKN--------KDRYHNL-----DK-- 286 (756) Q Consensus 224 ~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~-~d-~~da~~v~~~~~~t~~el~~~--------~~~~~~l-----~~-- 286 (756) . +..+|..+.--+|++++|.+.+|||... .+ .....|.++...+++-.|+.. .-.|+.+ +. T Consensus 181 K--~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~ 258 (666) T protein:vir:96 181 K--TTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSF 258 (666) T ss_pred e--eeeccchhhhhhhhccccccccccCCCCCCchhhhhhhhhhHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhc Confidence 1 1222333333479999999999999753 22 334567777766665554321 0011111 10 Q ss_pred --cCchhhhhhhchhhhcccccccccc---------ccccceEEEEE--------EEEEee-------ccCCceeEEEE- Q lcl|NC_019423. 287 --IDWESSSPITDPDHESKTPSDFQFK---------DALRKKVVAYE--------YWGFYD-------INDDGSLEPIV- 339 (756) Q Consensus 287 --~~~~~~~~~~~~~~~~~~~~~~~~~---------d~s~~~V~v~E--------~w~k~d-------~~~~g~~~~~~- 339 (756) .+|...-..+..-......++.+|. .-.++||-|-| .|-|+- .........++ T Consensus 259 ~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~mY~RI~PSDF~~~~P~~N~~QIWK~ 338 (666) T protein:vir:96 259 QGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWKA 338 (666) T ss_pred cccccccCCcccccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeeeccccceecCCCCCcceeeee Confidence 0010000000000000000111110 01123443322 233321 11111122233 Q ss_pred EEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCcc Q lcl|NC_019423. 340 ATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTL 419 (756) Q Consensus 340 ~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~ 419 (756) +++.|+.++..++-.-..|.||.-..-...+.-..--.|+.+..++.|+...++++..+....+....+.++++..+. T Consensus 339 v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDGmG~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~-- 416 (666) T protein:vir:96 339 VMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIR-- 416 (666) T ss_pred eeeccceeEeeehhhcccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhhcchhhhh-- Confidence 445577888777644345555554322222222222456777899999999988776666555555555555554432 Q ss_pred chhhhhcccccccc--------ccccccccccccccCCCcchHHHHH---HHHHHHHHHHHhchhHHhcCCCccccchhH Q lcl|NC_019423. 420 NRRRYDDGQDYEYN--------PMQGNPSQSIMEHKFPELPQSAIVM---TQMQNQEAESLTGVKAFSGGVTGSAYGDVA 488 (756) Q Consensus 420 ~~~~~~~~~~~~~~--------~~~~~~~~~i~~~~~~~~~~~~~~~---l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA 488 (756) ..+.....+-. .++...+..-.++++. ..+.... .+.+.+.-++++|++...+|+-.-. +++- T Consensus 417 ---a~~iNSP~~~~KIP~~~~sL~N~~m~~~Y~~IPFD--~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKG-NKt~ 490 (666) T protein:vir:96 417 ---ANDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFD--SRGMETVMQNALMLTDWQRELSGMNSATRGQFQKG-NKTR 490 (666) T ss_pred ---hhcccCCCCCcccceeehhhhccchhhhhccCCcc--ccchhHHHhhhHHHhhhHHHhhccCCccccccccc-Ccce Confidence 22222211111 1122223333333332 3333333 3445566678889988888853221 3444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC-cceEEEecccccHHH Q lcl|NC_019423. 489 AGIRGALDAASKREMAILRRLAK-GMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG-NFDIEVDINTAEIDN 566 (756) Q Consensus 489 ~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~-~~Dv~V~~g~a~~~~ 566 (756) .+-.-.|-.+..|++..+=-+.. .+..+-+++.-.+.+|.++..+|-=.-.+.+.|+-+.++. -..+.+.+|....+. T Consensus 491 ~E~~~~MG~a~NRmRLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~L~~~~L~F~~~DGlTP~SK 570 (666) T protein:vir:96 491 AEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKELQDLGLKFELGDGLTPASK 570 (666) T ss_pred eehhhhcCCcccceehhhHHHhhhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHHHhhhhheeeeccCCCchhh Confidence 44455555666666665555554 3344555555556678777776654323467777776664 223334454321111 Q ss_pred -HHHHHHHHHHH----------HhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHH Q lcl|NC_019423. 567 -QKSQDLGFMVQ----------TLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENE 635 (756) Q Consensus 567 -~~~q~l~~llq----------~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~ 635 (756) .....+..+|| .+|+.+|. +++-++.+.|.....++--.. -|+=+ ..=-.+++.|+-.. T Consensus 571 lASs~~lT~~LQMI~sS~~~~~A~G~~~P~-----M~AHl~QLGGVRG~E~Y~~~A----LPqwq-itygm~Q~LQ~~~L 640 (666) T protein:vir:96 571 LASSDFLTALLQMIMSSETTLQAFGTQVPG-----MIAHLAQLGGVRGFEKYANAA----LPQWQ-ITYGMQQQLQQMLL 640 (666) T ss_pred hhhhHHHHHHHHHHhcchhhHhhhcccchH-----HHHHHHHhccccchhhccccc----Ccchh-hhhhhhHHHHHHHH Confidence 11112222222 34455543 334566777766555442111 11000 00000111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 636 ELQSKIALNNAKAKEAASSGDLKDLDYLEQESG 668 (756) Q Consensus 636 ~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~ 668 (756) +++.|.+. +-++. |.++...+. +-++ T Consensus 641 Q~~~QSA~-Q~~A~----Q~~L~~~Q~--~PSq 666 (666) T protein:vir:96 641 QLQQQSAM-QLQAR----QGELSNDQS--QPSQ 666 (666) T ss_pred HHhhhhcc-ccccc----cccCccccc--CCCC Confidence 11111000 00000 111110000 0000 No 116 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.29 E-value=1.8e-10 Score=74.01 Aligned_cols=468 Identities=11% Similarity=0.091 Sum_probs=203.2 Q ss_pred hHHHHHHHHHHHHH-----------------HHHhhHHHHHHHHHHHHhccccCCCCCC--CCC----CCcccCHHHHHH Q lcl|NC_019423. 25 EPSIQLLKGDLESA-----------------KPAHDAIMSQIREWNDLMEVKGKAKPPK--IKG----RSQVQPRLVRRQ 81 (756) Q Consensus 25 ~~~~~~l~~~~~~a-----------------~~~~~~~~~~~~~~~~~y~~~~~~~~~~--~~g----rS~~v~~~v~~~ 81 (756) =-++..|+.-++.- ..-+..++.+..+|..||.|... .... ..| |.....+.-... T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~-~~~~~~~~~~~~~~~~~slnl~~~i 79 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWD-DVQYKNTDGDIKSRPMNHLPIARTA 79 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcc-cccccccCcchhcccceecchHHHH Confidence 00111111111110 11145566678889999987522 1000 111 112222333333 Q ss_pred HHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeee Q lcl|NC_019423. 82 AEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIK 161 (756) Q Consensus 82 ~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~ 161 (756) ++ -+++| .|+-..-+.+ +|. +.+++++.++. .|+-...++.++..++-.|.+++|+||+.. T Consensus 80 ~~-~~A~l---v~~e~~~i~v-----~d~----~~~~~l~~~l~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~----- 140 (522) T protein:vir:47 80 SK-KIASL---VYNEQATITT-----KNE----ILQKFLDDMLT-NDRFNKNFERYLESCLALGGLAMRPYIDGD----- 140 (522) T ss_pred HH-HHhhh---hcCCcceeec-----CCh----HHHHHHHHHHh-hcchHHHHHHHHHHhhccCCEEEEEEEcCC----- Confidence 32 23333 2443333333 343 45557766654 444456688899999999999999999621 Q ss_pred eeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEE Q lcl|NC_019423. 162 TETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEM 241 (756) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~ 241 (756) +++|+. T Consensus 141 --------------------------------------------------------------------------~~~i~~ 146 (522) T protein:vir:47 141 --------------------------------------------------------------------------KVRVAF 146 (522) T ss_pred --------------------------------------------------------------------------ceEEEE Confidence 134455 Q ss_pred echhheEe-CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEE Q lcl|NC_019423. 242 LNPNNVVI-DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVA 320 (756) Q Consensus 242 V~p~~~~~-Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v 320 (756) |+++.|++ ..+... ...|-++.+..+..... .-.|..|+.+.+.......... . .+...-.| . T Consensus 147 v~ad~~~P~~~~~~~-~~e~a~~~~~~~~~~~~----~~~yt~lE~he~~~~~~~~~~~------~----~~~~~~~I-~ 210 (522) T protein:vir:47 147 IQAPVFFPLESNTQD-VSSAAILTKTIKSEGRK----NVYYTLVEFHEWVTADGQETGS------T----NDKKYYRI-T 210 (522) T ss_pred EcCCceEEEEEcCCc-eEEEEEEEEEEeecccc----eeEEEEEEEeeecccccccccc------c----ccCCceEE-E Confidence 55555553 111111 22233332222211000 0001112222221110000000 0 00000001 1 Q ss_pred EEEEEEeeccCCceeEEEEEE--EECCEEEEecccccCC-CccceEEe----eeeeecCcccCCchHHHhHHHHHHHHHH Q lcl|NC_019423. 321 YEYWGFYDINDDGSLEPIVAT--WIGSTLIRMENNPFPD-GKLPLVVV----PYMPRKRELFGEADAELLGDNQAILGAT 393 (756) Q Consensus 321 ~E~w~k~d~~~~g~~~~~~~~--~~g~~~L~~~~~P~~~-~~~Pfv~~----~~~~~~~~~~G~g~v~~~~d~Q~~iN~~ 393 (756) +++|.-.+-+.-|......-+ |.+ |.. .-.+.+ .+.+|+.+ +.....++.+|.|++.++++..+.+|.. T Consensus 211 n~ly~~~~~~~lG~~v~l~~~~e~~~---l~~-~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~ 286 (522) T protein:vir:47 211 NELYRSDVNDVLGQRVNLSELDKYKN---LEP-VTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRS 286 (522) T ss_pred EEEeecCCCcccCccccccccccccC---CCC-ceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHH Confidence 122211000000110000000 100 000 000111 22234333 2233457889999999999999999999 Q ss_pred HHHHHHHHHhhcCCceEeeccccCccchh---------hhhcccccccccccccc--ccccccccCCCcchHHHHHHHHH Q lcl|NC_019423. 394 MRGMIDLLGRSANGQRGYPKGMLDTLNRR---------RYDDGQDYEYNPMQGNP--SQSIMEHKFPELPQSAIVMTQMQ 462 (756) Q Consensus 394 ~~~~~d~l~~~~~~~~~~~~gav~~~~~~---------~~~~~~~~~~~~~~~~~--~~~i~~~~~~~~~~~~~~~l~~~ 462 (756) +++.++-+.++ ..++.+++..+...... .++. ....+..+.... ...+...++.-....+...++.+ T Consensus 287 ~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~-~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~ 364 (522) T protein:vir:47 287 YDEFMWEVRMG-QRRVIVPEHLTQRQYQRPDGTIDFRPRFDV-EQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEG 364 (522) T ss_pred HHHHHHHHHhc-cceeecchHHhccCCCCCCcccccccccCc-ccceEeecCCCCCCCCcceeeccccChHHHHHHHHHH Confidence 99999887654 44788877665321110 0111 111122222221 23455544333344566677888 Q ss_pred HHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--CCCCcEEEEecCc Q lcl|NC_019423. 463 NQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVF--LSEKEVVRITNEQ 540 (756) Q Consensus 463 ~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~--~~~~r~iRI~g~~ 540 (756) ...++...|++....|.++.. .+||+++....+..-+....+.+.|..+++++.+.++.+...+ +... T Consensus 365 l~~i~~~~gls~~tf~~~~~~-~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~--------- 434 (522) T protein:vir:47 365 LKLFEMQIGVSSGMFTFDGQG-MKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGE--------- 434 (522) T ss_pred HHHHHHHhCCCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCC--------- Confidence 888888889988777766553 4789988877777777778888889999999999999887532 1110 Q ss_pred eeecCHhHhcCcceEEEecccc--cHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChh--HHHHhhhccCCC- Q lcl|NC_019423. 541 YVEIKREDLKGNFDIEVDINTA--EIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPD--LAHELRTWQPQP- 615 (756) Q Consensus 541 ~v~i~~d~~~~~~Dv~V~~g~a--~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~--~~~~l~~~~~q~- 615 (756) ....++|+|+-+.+ .-.....++.+++.. . ..++... .+++..|..+ +.+.+.+++... T Consensus 435 --------~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~-a-G~~s~e~------~i~~~~g~~eeea~~el~ri~~E~~ 498 (522) T protein:vir:47 435 --------IPELDDISVNLDDGVFTDRHAELDYWAKMVA-A-GFSTKKR------AIGKTLNISGVEAEKELNAINSELL 498 (522) T ss_pred --------CCCcceeEEEcCCCCCCCHHHHHHHHHHHHh-c-CCCCHHH------HHHhcCCCChHHHHHHHHHHHHhhc Confidence 01234444443332 222222233333221 1 1222211 1223333321 112222111100 Q ss_pred --Chh---hhhHHHHHHHHHHHHH Q lcl|NC_019423. 616 --DPM---EEQLKQLAIQKAQLEN 634 (756) Q Consensus 616 --~p~---~~~~~q~~~~~aq~e~ 634 (756) .|. ...+.-.++.....+- T Consensus 499 ~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 499 PMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred cCCCCCCCCCCCCCcccccCCCCC Confidence 000 0000000000000010 No 117 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.18 E-value=6.3e-10 Score=71.03 Aligned_cols=502 Identities=12% Similarity=0.069 Sum_probs=205.2 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc-cCCCCCCCCCCCcccCHHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVK-GKAKPPKIKGRSQVQPRLVR 79 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~-~~~~~~~~~grS~~v~~~v~ 79 (756) |.-+..---+..|=+-... .+.|- ..+.+.++- ..-+.=.+||++. +....+ .+|+ +-+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~-~~p~~------v~~~d~~Rl------~aY~l~~~~y~n~~~~~~~~-lrg~------~~~ 60 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEA-NFPNA------VTDFDKARL------ASYRLYEDMYLTNTSDYQVI-LRGG------DEG 60 (527) T ss_pred CCccccccCCCcCcCCccc-cCccc------CCHHHHHHH------HHHHHHHHHhcCchhheeee-cCCc------ccc Confidence 4322211100011111111 23331 112222111 1111223566543 111111 1111 122 Q ss_pred HHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeee Q lcl|NC_019423. 80 RQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVK 159 (756) Q Consensus 80 ~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~ 159 (756) ++.+-..||. +..++...-|-+.+....+...++.....++-.+.++|-..+ ...--+++++.|-|++++.|+.+.++ T Consensus 61 ~~r~~~~ps~-~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~-~~~~~r~~~vlGDg~f~l~wD~~k~~ 138 (527) T protein:vir:10 61 DQRPIYVPNG-EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQK-FESLKRWTEIRGDYVLLLIGDDEKDE 138 (527) T ss_pred ccceeeehhh-HHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHH-HHHHHHhhhhhcceeEEEeeccCCCc Confidence 3334444554 333566666667777666766777777777766665444443 44566889999999999999976431 Q ss_pred ---eeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 160 ---IKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 160 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) ...++ .|+.+ + +.. +.. .+. T Consensus 139 ~~R~~v~~--------~DP~~-----------------~------------------------------f~~-ed~-d~~ 161 (527) T protein:vir:10 139 GSRLSLHE--------VDPST-----------------Y------------------------------FPY-EDP-RYP 161 (527) T ss_pred CCCceEee--------cCcce-----------------e------------------------------eee-ecC-CCC Confidence 21111 11100 0 000 000 000 Q ss_pred eeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRK 316 (756) Q Consensus 237 ~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~ 316 (756) =.+..|+.-+-|..|... ...-+|.++++.++. ++.. +... ..+ T Consensus 162 ~~v~~v~~~~~~~~P~d~---~~~~~~ar~~~~~~~-----------l~~~--------g~~~-------------~~G- 205 (527) T protein:vir:10 162 GQVLGVYLVDEYPHPDSE---KKNEKCARVQKYMKT-----------LDDD--------GKPV-------------PGG- 205 (527) T ss_pred CceeeEEEeeeccCCccc---cccceehhhhhhhhh-----------cCcc--------cccc-------------cCc- Confidence 012222211123333321 111133333333321 1000 0000 001 Q ss_pred eEEEEE-EEE--Eee-ccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHH Q lcl|NC_019423. 317 KVVAYE-YWG--FYD-INDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGA 392 (756) Q Consensus 317 ~V~v~E-~w~--k~d-~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~ 392 (756) ++++.+ .|. +++ .+.-..-.-.+-+..++.+++..++|+ +.+|+|+++-.+.+++.||+|-+.+++.+++++|+ T Consensus 206 ~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~ 283 (527) T protein:vir:10 206 AIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQ 283 (527) T ss_pred ceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhh Confidence 233333 343 121 110000011234566788887777776 67999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccc-cccccccccccccCCCc--chHHHHHHHHHHHHHHHH Q lcl|NC_019423. 393 TMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNP-MQGNPSQSIMEHKFPEL--PQSAIVMTQMQNQEAESL 469 (756) Q Consensus 393 ~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~--~~~~~~~l~~~~~~~e~~ 469 (756) .++....++..+++|.+... |+ ...+... +. .++.+++ ..+.-+..-+....... -..+...+..+...+.++ T Consensus 284 ~~Td~s~is~~sG~Pi~~~t-g~-~~vd~~G-~~-~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~v 359 (527) T protein:vir:10 284 TMTDEDLIMVFGGLGFYATD-SA-PPRDSRG-NM-VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQT 359 (527) T ss_pred hhhHHHHHHHHhCCceeeec-cc-ccccccC-Cc-CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHh Confidence 99999999999988877653 32 2221111 10 1111111 11111111222222222 233556678888899999 Q ss_pred hchhHHhcCCCc--cccchhHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 470 TGVKAFSGGVTG--SAYGDVAAGIRGALDAASKREMAILRRLA-----KGMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 470 tGv~~~~~G~~~--~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-----~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) +|++....|.-+ +..|+.|-.++. +-. +.++-+ .++.+.+ ..+++..++.--+-+-+ T Consensus 360 A~~PavA~G~vD~s~~~SG~ALeL~L----~PL----lar~~rk~L~~~~Vqrq~--~~~~~~~~L~aye~v~~------ 423 (527) T protein:vir:10 360 KGIPDIAVGVVDAAVAESGIALDLKL----SAI----LSSCAEQELELKSVLKQF--FYNLVTQWLPAYEGVGI------ 423 (527) T ss_pred hcCCeeeeccccCCcCcHHHHHHHHH----HHH----HHHHHHHHHHHHHHHHHh--hhhhHHHHHHHhhhccc------ Confidence 999999999433 333444433221 100 111111 1111111 11122222111111111 Q ss_pred ecCHhHhcCcceEEEecccccHHH--HHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAEIDN--QKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~~~~--~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~ 620 (756) .+.-...++.|.-++..... +..+++..+.+. ..+... -.+.++.+.+++.+....++.+. T Consensus 424 ----~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~a--GiiS~e---tAv~~L~~~~g~eD~E~E~~~I~-------- 486 (527) T protein:vir:10 424 ----DDADKKLTVTITFRDPKPVNNEKRFAQLLELWEA--GLIPAK---KLTEELSKIMGFELTEEDFRQAT-------- 486 (527) T ss_pred ----CCCccccceEEEecccCCCCHHHHHHHHHHHHHc--CchhHH---HHHHHHHhccCCCchHHHHHHHH-------- Confidence 11112334555556554322 222222222211 112211 22334444555554443333211 Q ss_pred hHHHHHHHHHHHHHHH-HHHHHHHHH----HHHHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEE-LQSKIALNN----AKAKEAASSGDL 657 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~-~qa~a~~~~----a~a~~~~aq~~~ 657 (756) ....++.++++++.- ..+++.-.. .+....-.-.-+ T Consensus 487 -~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 487 -EDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred -HHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 111112222221110 001100000 000000000000 No 118 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.18 E-value=4.8e-10 Score=71.69 Aligned_cols=502 Identities=12% Similarity=0.069 Sum_probs=205.6 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc-cCCCCCCCCCCCcccCHHHH Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVK-GKAKPPKIKGRSQVQPRLVR 79 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~-~~~~~~~~~grS~~v~~~v~ 79 (756) |.-+..---+..|=+-... ++.|- ..+.+.++- ..-+.=.+||++. +....+ .+|+ +-+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~-~~p~~------v~~~d~~Rl------~aY~l~~~~y~n~~~~~~~~-lrg~------~~~ 60 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEA-NFPNA------VTDFDKARL------ASYRLYEDMYLTNTSDYQVI-LRGG------DEG 60 (527) T ss_pred CCccccccCCCcCcCCccc-cCccc------CCHHHHHHH------HHHHHHHHHhcCchhheeee-cCCc------ccc Confidence 4322211100011111111 23331 112222111 1111223566543 111111 1111 122 Q ss_pred HHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeee Q lcl|NC_019423. 80 RQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVK 159 (756) Q Consensus 80 ~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~ 159 (756) ++.+-..||. +..++...-|-+.+....+...++.....++-.+.++|-..+ ...--+++++.|-|++++.|+.+.++ T Consensus 61 ~~r~~~~ps~-~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~-~~~~~r~~~vlGDg~f~l~wD~~k~~ 138 (527) T protein:vir:10 61 DQRPIYVPNG-EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQK-FESLKRWTEIRGDYVLLLIGDDEKDE 138 (527) T ss_pred ccceeeehhh-HHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHH-HHHHHHhhhhhcceeEEEeeccCCCc Confidence 3334444554 333566666667777666766677777777766665444443 44566889999999999999976431 Q ss_pred ---eeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 160 ---IKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 160 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) ...++ .|+.+ + +.. +.. .+. T Consensus 139 ~~R~~v~~--------~DP~~-----------------~------------------------------f~~-ed~-d~~ 161 (527) T protein:vir:10 139 GSRLSLHE--------VDPST-----------------Y------------------------------FPY-EDP-RYP 161 (527) T ss_pred CCCceEee--------cCcce-----------------e------------------------------eee-ecC-CCC Confidence 21111 11100 0 000 000 000 Q ss_pred eeEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRK 316 (756) Q Consensus 237 ~~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~ 316 (756) =.+..|+.-+-|..|... ...-+|.++++.++. ++.. +... ..+ T Consensus 162 ~~v~~v~~~~~~~~P~d~---~~~~~~ar~~~~~~~-----------l~~~--------g~~~-------------~~G- 205 (527) T protein:vir:10 162 GQVLGVYLVDEYPHPDSE---KKNEKCARVQKYMKT-----------LDDD--------GKPV-------------PGG- 205 (527) T ss_pred CceeeEEEeeeccCCccc---cccceehhhhhhhhh-----------cCcc--------cccc-------------cCc- Confidence 012222211123333321 111133333333321 1000 0000 001 Q ss_pred eEEEEE-EEE--Eee-ccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHH Q lcl|NC_019423. 317 KVVAYE-YWG--FYD-INDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGA 392 (756) Q Consensus 317 ~V~v~E-~w~--k~d-~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~ 392 (756) ++++.+ .|. +++ .+.-..-.-.+-+..++.+++..++|+ +.+|+|+++-.+.+++.||+|-+.+++.+++++|+ T Consensus 206 ~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~ 283 (527) T protein:vir:10 206 AIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQ 283 (527) T ss_pred ceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhh Confidence 233333 343 121 110000011234566788887777776 67999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccc-cccccccccccccCCCc--chHHHHHHHHHHHHHHHH Q lcl|NC_019423. 393 TMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNP-MQGNPSQSIMEHKFPEL--PQSAIVMTQMQNQEAESL 469 (756) Q Consensus 393 ~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~--~~~~~~~l~~~~~~~e~~ 469 (756) .++....++..+++|.+... |+ ...+... +. .++.+++ ..+.-+..-+....... -..+...+..+...+.++ T Consensus 284 ~~Td~s~is~~sG~Pi~~~t-g~-~~vd~~G-~~-~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~v 359 (527) T protein:vir:10 284 TMTDEDLIMVFGGLGFYATD-SA-PPRDSRG-NM-VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQT 359 (527) T ss_pred hhhHHHHHHHHhCCceeeec-cc-ccccccC-Cc-CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHh Confidence 99999999999988877653 32 2221111 10 1111111 11111111222222222 233556677888899999 Q ss_pred hchhHHhcCCCc--cccchhHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHhhCCCCcEEEEecCcee Q lcl|NC_019423. 470 TGVKAFSGGVTG--SAYGDVAAGIRGALDAASKREMAILRRLA-----KGMADIGTKICAMNAVFLSEKEVVRITNEQYV 542 (756) Q Consensus 470 tGv~~~~~G~~~--~a~~~tA~~i~~~~~aa~~~l~~~~~n~~-----~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v 542 (756) +|++....|.-+ +..|+.|-.++.. -. +.++-+ .++.+.+ ..+++..++.--+-+-+ T Consensus 360 A~~PavA~G~vD~s~~~SG~ALeL~L~----PL----lar~~rk~L~~~~vqrq~--~~~~~~~~L~aye~v~~------ 423 (527) T protein:vir:10 360 KGIPDIAVGVVDAAVAESGIALDLKLS----AI----LSSCAEQELELKSVLKQF--FYNLVTQWLPAYEGVGI------ 423 (527) T ss_pred hcCCeeeeccccCCcCcHHHHHHHHHH----HH----HHHHHHHHHHHHHHHHHh--hhhhHHHHHHHhhhccc------ Confidence 999999999433 3334444332211 00 111111 1111111 11122222111111111 Q ss_pred ecCHhHhcCcceEEEecccccHHH--HHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhh Q lcl|NC_019423. 543 EIKREDLKGNFDIEVDINTAEIDN--QKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEE 620 (756) Q Consensus 543 ~i~~d~~~~~~Dv~V~~g~a~~~~--~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~ 620 (756) .+.-...++.|.-++..... +..+++..+.+. ..+... -.+.++.+.+++.+....++.+. T Consensus 424 ----~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~a--Gi~S~~---tAv~~L~~~~g~eD~E~E~~~I~-------- 486 (527) T protein:vir:10 424 ----DDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEA--GLIPAK---KLTEELSKIMGFELTEEDFKQAT-------- 486 (527) T ss_pred ----CCCccccceEEEecccCCCCHHHHHHHHHHHHHc--CchhHH---HHHHHHHhccCCCChHHHHHHHH-------- Confidence 11112334555556554322 222222222111 112221 22334445555554443333221 Q ss_pred hHHHHHHHHHHHHHHH-HHHHHHHHH----HHHHHHHHHHHH Q lcl|NC_019423. 621 QLKQLAIQKAQLENEE-LQSKIALNN----AKAKEAASSGDL 657 (756) Q Consensus 621 ~~~q~~~~~aq~e~~~-~qa~a~~~~----a~a~~~~aq~~~ 657 (756) ....++.++++++.- ..+++.-.. .+....-.-.-+ T Consensus 487 -~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 487 -EDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred -HHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 111112222221110 011100000 000000000000 No 119 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.03 E-value=4.8e-09 Score=66.19 Aligned_cols=472 Identities=10% Similarity=0.042 Sum_probs=184.9 Q ss_pred CCCCCCCcc-ccccccCCCchH-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCCCCCC-----CcccCH Q lcl|NC_019423. 6 TFKPLPDPA-QSEKLTDWKKEP-SIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPKIKGR-----SQVQPR 76 (756) Q Consensus 6 ~~~~~~~~~-~~~~~~~~~~~~-~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~gr-----S~~v~~ 76 (756) -+.|++.-. +..|..++.++. +...+...+......|.....+.++-.+||.|..... +.....+ -..|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n 80 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKN 80 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcC Confidence 333433211 344555554433 2222333344444556666667778889999864211 1111111 123444 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 77 LVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 77 ~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~ 156 (756) -.+..|+.....| + |.+.+-+|....+ .+. .+...|+--.....+++++++.|.+++.+|++.. T Consensus 81 ~~~~ivd~~a~~l---~--------~~gf~~~d~~~~~----~l~-~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~ 144 (501) T protein:vir:25 81 VLSLVRDSFAQNL---S--------VVGYRNALAKEND----PAW-EMWQRNRMDARQAEVHRPALTYGASYVTVTPTDE 144 (501) T ss_pred hHHHHHHHHHhhh---c--------ccceecCCccchH----HHH-HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCC Confidence 4444555332222 1 1122222322222 222 2233444333345688999999998777643200 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) . T Consensus 145 -------------------------------------------------------------------------------~ 145 (501) T protein:vir:25 145 -------------------------------------------------------------------------------G 145 (501) T ss_pred -------------------------------------------------------------------------------C Confidence 1 Q ss_pred eeEEEechhheE--e-CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVV--I-DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDA 313 (756) Q Consensus 237 ~~ie~V~p~~~~--~-Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 313 (756) ++|..++|.+++ | ||..... ..++++ .+....+- + T Consensus 146 ~~i~~~sp~~~~~iy~D~~~~~~---~~~ai~-~~~~~~~~----~---------------------------------- 183 (501) T protein:vir:25 146 PVFRTRSPRQILAVYADPSVDAW---PQYALE-TWVAQKDA----K---------------------------------- 183 (501) T ss_pred CeEEEeccccEEEEEecCCCCcc---eeEEEE-EEeecccc----C---------------------------------- Confidence 235556777763 3 5554321 223222 22211100 0 Q ss_pred ccceEEEE--EEEEEeeccC-------Cce--eEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHH Q lcl|NC_019423. 314 LRKKVVAY--EYWGFYDIND-------DGS--LEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAEL 382 (756) Q Consensus 314 s~~~V~v~--E~w~k~d~~~-------~g~--~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~ 382 (756) ...++.+| .+++.+...+ .+. .........++. ......|-+.+.+||+.++..+.. +.+|.|.++. T Consensus 184 ~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vPiv~f~N~~~~-~~~g~sdie~ 261 (501) T protein:vir:25 184 PHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDV-IEHGATFEGKPVCPVVRFVNGRDA-DDMIVGEVAP 261 (501) T ss_pred cceeEEEecCeeEEEEecCceeeeeccccccccccccccccccc-cccccccCCccceeeEeccCcccc-Cccccchhhh Confidence 00001111 0001110000 000 000001111111 111222334567888888776654 4568999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcc-hHHHHHHHH Q lcl|NC_019423. 383 LGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQM 461 (756) Q Consensus 383 ~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~ 461 (756) ++++++.+|+.++.+.......+.|+..+ .|.-.. +...++... +.+-..++...+..+.+... +.+...+.. T Consensus 262 v~~l~Da~~~~~s~~~~~~e~~a~p~~~i-~G~~~~-~~~~~~~~~----~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~ 335 (501) T protein:vir:25 262 LILLQQAINSVNFDRLIVSRFGANPQRVI-SGWTGS-KAEVLKASA----LRVWTFEDPEVKAQAFPPASVEPYNLILEE 335 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHhhccHHHHH-hCCCCC-ccchhhhcc----cceeccCCCCceEEEecccChHHHHHHHHH Confidence 99999999999999998888888775433 232111 111111111 11111122334444444322 233344555 Q ss_pred HHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCce Q lcl|NC_019423. 462 QNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQY 541 (756) Q Consensus 462 ~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~ 541 (756) ....+-..|++++...|.... |.+|.++......-........+.|..+++.++++++.+. ....- T Consensus 336 ~i~~i~~~s~~P~~~~~~~~~--N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~----~~~~~-------- 401 (501) T protein:vir:25 336 MLQHVAMVAQISPAQVTGKMI--NVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMD----DDPDT-------- 401 (501) T ss_pred HHHHHHhhcCCChhhhccccC--ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCcc-------- Confidence 555555667788887773322 2244344444444444555556667777777666554332 21100 Q ss_pred eecCHhHhcCcceEEEecccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHH-HHhhhccCCCChh Q lcl|NC_019423. 542 VEIKREDLKGNFDIEVDINTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLA-HELRTWQPQPDPM 618 (756) Q Consensus 542 v~i~~d~~~~~~Dv~V~~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~-~~l~~~~~q~~p~ 618 (756) ...+++.|.=..... ..+.+..+..+.+. + ++...+ +.+++|+.+.. +.++. T Consensus 402 --------~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~-g--is~et~------~~~~~g~~~~~ie~~~~-------- 456 (501) T protein:vir:25 402 --------AADSGAEVLWRDTEARSFGAVVDGITKLASA-G--IPIEHL------LSMVPGMTQQTIQAIKD-------- 456 (501) T ss_pred --------ccceeeeEEecCCCCCCHHHHHHHHHHHHhc-C--CCHHHH------HHHcCCCCHHHHHHHHH-------- Confidence 011234333222211 11122222222211 1 232211 12223332100 00000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 619 EEQLKQLAIQKAQLENEELQSKI--ALNNAKAKEAASSGDLKDLDYLEQESGT 669 (756) Q Consensus 619 ~~~~~q~~~~~aq~e~~~~qa~a--~~~~a~a~~~~aq~~~~~~~~~~q~~~~ 669 (756) +.+.+.++.....+.+.. +......+.. ++.+-.. . ..=..+. T Consensus 457 -----~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~-~~~~~g~ 501 (501) T protein:vir:25 457 -----SLRGGEVKSLVDKLLSNEPAPVPPPPPQAA-AQALNEG-G-VNGNGGA 501 (501) T ss_pred -----HHHHHhHHHHHHHhhccCcCCCCCCCCCCC-ccccccc-c-CCCCCCC Confidence 000000000000000000 0000000000 0000000 0 0000000 No 120 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.91 E-value=1.5e-08 Score=63.44 Aligned_cols=450 Identities=10% Similarity=0.057 Sum_probs=180.9 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHH-- Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLV-- 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v-- 78 (756) |--..+. .+--+++++. .. ++.....+.....+.+.-.+||.|....+ . -+.-+++.. T Consensus 1 ~~~~~~~----------~~~gl~~~~~-~~----~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~--~---~~~~~p~~~r~ 60 (474) T protein:vir:81 1 MIQQQTV----------RIPSLSNDEN-AL----INGLLAQIENLRWKNLLRTSYYENKRTIQ--Y---VGTLIPPQYFN 60 (474) T ss_pred CcCCCcC----------cCCCCChhHH-HH----HHHHHHHHHHHhhHHHHHHHHhccCCChh--h---ccccccHHHHH Confidence 3333322 2334444432 11 22222233334445556679999763321 1 111122222 Q ss_pred -HHHHHHHHHHHHHhhcCCCCEEEEe-cCCcchHHHHHHHHHHHHHHHhhhcCCcch-HHHHHHHHhhcCceEEEEeeee Q lcl|NC_019423. 79 -RRQAEWRYAPLSEPFLSSSKLFKLT-PVTFEDELAARQNELVLNYQFRTQLNKVKL-VDDYVHSIVDDGTGIARIGWER 155 (756) Q Consensus 79 -~~~~e~~~~~L~~~f~~~~~~~~~~-p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~-~~~~v~~al~~g~gi~k~~w~~ 155 (756) +..++|. +..++.|..--.+--|. |-+.+|.. .+. .+... |.+.. ...++++|++.|.+++.|+... T Consensus 61 ~~~v~nw~-~~~Vd~~a~rl~~~Gf~~~d~~~~~~-------~l~-~iw~~-N~ld~~~~~~~~~al~~G~sf~~V~~~~ 130 (474) T protein:vir:81 61 LGLVLGWT-GKAVDALARRCNLEGFVWPDGDLDSL-------GGT-EVVDD-NHLLSEIDSAIVAAMQHGPAFLINTVGE 130 (474) T ss_pred HHhhcChH-HHHHHHHHhhhcccceECCCCCccch-------HHH-HHHHh-cChhHHHHHHHHHHHhhCceeEEEecCC Confidence 1222221 22222222111112222 21111111 122 22223 33444 4457788888888877763210 Q ss_pred eeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecC Q lcl|NC_019423. 156 KTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVN 235 (756) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g 235 (756) ...+ T Consensus 131 ----------------------------------------------------------------------------d~~~ 134 (474) T protein:vir:81 131 ----------------------------------------------------------------------------DDEP 134 (474) T ss_pred ----------------------------------------------------------------------------CCCc Confidence 0112 Q ss_pred ceeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccccccc Q lcl|NC_019423. 236 RPTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDA 313 (756) Q Consensus 236 ~~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 313 (756) .+.|..++|.+++ |||..+. +. +-+. +...+. T Consensus 135 ~~~i~~~sp~~~~~~~D~~~~~-~~-~al~--~~~~~~------------------------------------------ 168 (474) T protein:vir:81 135 EALIHVKDASEATGEWNRRRRG-LN-NLLS--IIDKDK------------------------------------------ 168 (474) T ss_pred eeEEEEeccceEEEEEeCCCCc-ce-eeeE--EEEEcC------------------------------------------ Confidence 3667888888876 6775421 11 1111 111000 Q ss_pred ccceEEEEEEEEEeeccCCceeEEEEEEEECCE-EEEecccccCCCccceEEeeeeeecCcccCCchH-HHhHHHHHHHH Q lcl|NC_019423. 314 LRKKVVAYEYWGFYDINDDGSLEPIVATWIGST-LIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA-ELLGDNQAILG 391 (756) Q Consensus 314 s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~-~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v-~~~~d~Q~~iN 391 (756) .+ +.+...+|.. +....+..-..+.. .....++|+ | .|+|+++..++-...+|.|-+ +.++++|+.+| T Consensus 169 ~g-~~~~~~ly~~------~~~~~~~~~~~~~~w~~~~~~~~~--g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~ 238 (474) T protein:vir:81 169 EG-KVLSLALYLD------NETVTAQRDKATLKWQVDRDEHVY--G-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGV 238 (474) T ss_pred CC-cEEEEEEEeC------CcEEEEEEcCccceeeeccCCCCC--C-cceEEecccccccCcCCccccchhHHHHHHHHH Confidence 00 0001111210 00000000001111 112234444 5 699999999888888998855 79999999999 Q ss_pred HHHHHHHHHHHhhcCCceEeeccccC----ccchhhhhccccc--ccccccccccc------ccccccCCCcc-hHHHHH Q lcl|NC_019423. 392 ATMRGMIDLLGRSANGQRGYPKGMLD----TLNRRRYDDGQDY--EYNPMQGNPSQ------SIMEHKFPELP-QSAIVM 458 (756) Q Consensus 392 ~~~~~~~d~l~~~~~~~~~~~~gav~----~~~~~~~~~~~~~--~~~~~~~~~~~------~i~~~~~~~~~-~~~~~~ 458 (756) +.+..+.......+.|+..+- |+-. +.+......+... ....+..+.+. ..+..+++... +.+... T Consensus 239 r~~~~~~~~~e~~a~pqr~i~-G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~ 317 (474) T protein:vir:81 239 RELARREGHMDVFSYPEFWLL-GADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSD 317 (474) T ss_pred HHHHHHHHHHHHhcchhheee-cCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHH Confidence 999999999999888875442 2211 1111000000000 00111111111 12333333322 223344 Q ss_pred HHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEec Q lcl|NC_019423. 459 TQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITN 538 (756) Q Consensus 459 l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g 538 (756) +..+...+-..||++....|........+|.++......-........+.|..+++.++++++.+.-.+--++ T Consensus 318 l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~------- 390 (474) T protein:vir:81 318 INGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDE------- 390 (474) T ss_pred HHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc------- Confidence 5555555666689999888854322224555555544444455556667788888888888876653321110 Q ss_pred CceeecCHhHhcCcceEEEecccc-cHH-HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCC Q lcl|NC_019423. 539 EQYVEIKREDLKGNFDIEVDINTA-EID-NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPD 616 (756) Q Consensus 539 ~~~v~i~~d~~~~~~Dv~V~~g~a-~~~-~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~ 616 (756) +.. ..+++.|.=... +.+ .+++.....+.+ .++.++.... +.+..|+.. T Consensus 391 -----~~~----~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~-a~~~~~~~~~------~~~~lg~t~------------- 441 (474) T protein:vir:81 391 -----IPD----EWKSIDAKWRDPRYLSKSAQADAGMKQLA-AVPWLAETEV------GLELIGLTP------------- 441 (474) T ss_pred -----cch----hhccceeEecCCCccCHHHHHHHHHHHHh-cccCCCcHHH------HHhhcCCCH------------- Confidence 000 012232221111 111 111111222222 1222222111 223334320 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 617 PMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKH 671 (756) Q Consensus 617 p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~ 671 (756) . +.+..+.+. .+.+++.... +.. .. ...++..| T Consensus 442 -~-----~i~~~~~~~--~~~~~~~~~~-~l~---~~----------~~~~~~aq 474 (474) T protein:vir:81 442 -Q-----QARRAMADK--RRVQGRGTLQ-ALI---DR----------SNNGATAQ 474 (474) T ss_pred -H-----HHHHHHHHH--HHHhHHHHHH-HHH---hc----------CCCCCCCC Confidence 0 111111110 0001110000 000 00 00000000 No 121 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.88 E-value=2e-08 Score=62.79 Aligned_cols=493 Identities=11% Similarity=0.014 Sum_probs=192.7 Q ss_pred CCCCCCccccccccCCCchHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhccccCC--CCCC-C-------C--CCC-- Q lcl|NC_019423. 7 FKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAH-DAIMSQIREWNDLMEVKGKA--KPPK-I-------K--GRS-- 71 (756) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~-~~~~~~~~~~~~~y~~~~~~--~~~~-~-------~--grS-- 71 (756) +-| +|-.|.=+.+-..|..++.. ++ +....+..+..+||.|.-.- .+.. . + .++ T Consensus 1 ~~~--------~~~~~~~~~~~~~~~~~i~~---~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nn 69 (537) T protein:vir:78 1 MTS--------PLLNKPIDQLGGLLNTEITT---YMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNV 69 (537) T ss_pred CCc--------ccccccHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhcccchhhhccccccccccccccccccccc Confidence 111 11122222222333333222 22 33445566788999976210 0000 0 1 111 Q ss_pred cccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEE Q lcl|NC_019423. 72 QVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARI 151 (756) Q Consensus 72 ~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~ 151 (756) +++.+-....|+... .-|||.+.-+ .+... ......+.++..+. ++-.+......+++.++|.+...+ T Consensus 70 ki~~nf~k~Ivd~~~----~yl~G~Pv~~--~~~d~----~~~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~ 137 (537) T protein:vir:78 70 KISHGFFTELVDQLA----QYLLSNGVEV--KVKDE----DNTQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFA 137 (537) T ss_pred ccccchHHHHHHHHh----hhhcccCcee--ecCcc----hhHHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEe Confidence 344444444444443 3447766444 33222 22233445555432 344455667889999999887766 Q ss_pred eeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeee Q lcl|NC_019423. 152 GWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEK 231 (756) Q Consensus 152 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~ 231 (756) ||+.. T Consensus 138 y~de~--------------------------------------------------------------------------- 142 (537) T protein:vir:78 138 RTTSE--------------------------------------------------------------------------- 142 (537) T ss_pred eecCC--------------------------------------------------------------------------- Confidence 55311 Q ss_pred eecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhcccccccc Q lcl|NC_019423. 232 ALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQ 309 (756) Q Consensus 232 ~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (756) |.+++..|+|.++|+ |.. .. ...+++.......+.. T Consensus 143 ---~~~~~~~i~p~~~~pv~d~~--~~---~~~~~~~y~~~~~~~~---------------------------------- 180 (537) T protein:vir:78 143 ---GKLKFQTVDGLTLIPVFDDY--GV---LKMIIRWYSEIRYSTK---------------------------------- 180 (537) T ss_pred ---CceEEEEEccceeEEEEcCC--CC---ceeEEEEEeeeecccc---------------------------------- Confidence 236677788888643 422 11 1122222211100000 Q ss_pred ccccccceEEEEEEEEE-----eeccCCceeEE------------EEEEEEC-----CEEEEe--cccccCCCccceEEe Q lcl|NC_019423. 310 FKDALRKKVVAYEYWGF-----YDINDDGSLEP------------IVATWIG-----STLIRM--ENNPFPDGKLPLVVV 365 (756) Q Consensus 310 ~~d~s~~~V~v~E~w~k-----~d~~~~g~~~~------------~~~~~~g-----~~~L~~--~~~P~~~~~~Pfv~~ 365 (756) +.....+..+|+|.. +...+.+.... -+++..+ +..-.. ...|.+.|++||+.+ T Consensus 181 --~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f 258 (537) T protein:vir:78 181 --QQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLL 258 (537) T ss_pred --ccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEe Confidence 000011222222211 00011111000 0001000 000011 112233466676655 Q ss_pred eeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccc-cccccc Q lcl|NC_019423. 366 PYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGN-PSQSIM 444 (756) Q Consensus 366 ~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~-~~~~i~ 444 (756) .. +-+|.|.+..++++++.+|..+|.+.+.+...+++.+++.-..++...+........ ..+... ..+.+. T Consensus 259 ~n-----n~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~---~~i~v~~d~~~v~ 330 (537) T protein:vir:78 259 YN-----NKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAK---KMIGVNGDNAGME 330 (537) T ss_pred cc-----CccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhc---CceeecCCCCcee Confidence 44 456889999999999999999999999999999887766532333222222211111 112222 234466 Q ss_pred cccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 445 EHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMN 524 (756) Q Consensus 445 ~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li 524 (756) ++..+.-.......+..+.+.+-..|.+.+......|+ .++.++..+...........-+-|..++++++++++.++ T Consensus 331 ~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn---~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~ 407 (537) T protein:vir:78 331 IQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGN---VTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDI 407 (537) T ss_pred EEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccC---CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66655555666777888888888877655543222222 222334444555555555555667777777777777765 Q ss_pred HhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccHHH--HHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh Q lcl|NC_019423. 525 AVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDN--QKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP 602 (756) Q Consensus 525 ~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~--~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~ 602 (756) ..... .++ + -.+|.+.-....... ..++.+.. +... ..+..+ .++...++ T Consensus 408 ~~~~~---------~~~---d------~~~i~i~f~~~~P~n~~e~a~~~~~-l~~~-giiS~e-------T~l~~~p~- 459 (537) T protein:vir:78 408 ALRGL---------GEY---D------SNDICFEIEPHVLANELDIATTRKT-EAET-EALKIG-------NIMTVAPR- 459 (537) T ss_pred hhcCC---------ccc---c------cceeeEEeccCCCCCHHHHHHHHHH-HHhc-CcchHH-------HHHHhCCC- Confidence 43211 000 0 012333333222211 12221111 1111 111111 11111111 Q ss_pred hHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 603 DLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQS 682 (756) Q Consensus 603 ~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~ 682 (756) ..+++ +.+...+ +.. .. ..+..... .+++ .+..++ T Consensus 460 -----------vdd~e--------~ek~~~e--e~~-------~~------~~~~~~~~-~~~~---~~~~~~------- 494 (537) T protein:vir:78 460 -----------IGDDE--------TLKLIAE--ELD-------LD------YNELKDAL-AEQD---AQSLDV------- 494 (537) T ss_pred -----------CCCHH--------HHHHHHH--HHH-------hh------hhhhhhhh-hhhc---ccccCc------- Confidence 11110 0000000 000 00 00000000 0000 000000 Q ss_pred HHHHHHHHHHHHHHHhhccCCchhhhccCCCCCCCcccCchhcCCCCCCCCccccccccccCCCCCCCCCCC Q lcl|NC_019423. 683 QGNQNLQITKALTTPTKEGETTPNISAAVGYNTLTNGNSPQERDLAAQQDPAYSLGSQYYDPSQDPASALGM 754 (756) Q Consensus 683 ~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 754 (756) .....+.+.+...=+...| .+|.=+.|..=--|+-+|.--|-- T Consensus 495 -----------------~~~~~~~~~~~~~~~~~~~------------~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 495 -----------------SPDVQAMLDGLPVNANQPP------------VDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred -----------------CcchhhhcCCCCCCCCCCC------------CCccCCCCCCCCCCCCCCccCCCC Confidence 0000011111100000001 111111111111122222111111 No 122 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.88 E-value=2.1e-08 Score=62.66 Aligned_cols=436 Identities=11% Similarity=0.010 Sum_probs=175.2 Q ss_pred ccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CC--CCCCCC---cccCHHHHHHHHHHHHHHHH Q lcl|NC_019423. 19 LTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PP--KIKGRS---QVQPRLVRRQAEWRYAPLSE 91 (756) Q Consensus 19 ~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~--~~~grS---~~v~~~v~~~~e~~~~~L~~ 91 (756) +++.+-+.++..|... |+....+.++-.+||.|..... .. ..+.|+ ++|.+-....|+.....| T Consensus 1 ~~~~t~~~~~~~l~~~-------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l-- 71 (456) T protein:vir:79 1 MTASTPAEWLPVLTKR-------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-- 71 (456) T ss_pred CCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhh-- Confidence 6666666665544433 4445556677889999763211 00 112222 234455555555444333 Q ss_pred hhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC Q lcl|NC_019423. 92 PFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP 171 (756) Q Consensus 92 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~ 171 (756) ++ +.+ ......|.+..+...+++ ..|+--.....+++++++.|.+.+.+ |. T Consensus 72 --~~-~g~---~~~~~~d~~~~~~~~~~~-----~~n~~d~~~~~~~~~a~~~G~a~~~~-~~----------------- 122 (456) T protein:vir:79 72 --IP-NGI---TVGGSADSDLALRARRIW-----RDNRMDSVCKQWVKYGLDFGESYLTC-WR----------------- 122 (456) T ss_pred --cc-CCe---ecCCCCCccHHHHHHHHH-----HhcChhHHHHHHHHHHhhcCeeEEEE-ee----------------- Confidence 22 222 222334444444433332 23433333446788888888875543 20 Q ss_pred CCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheE--e Q lcl|NC_019423. 172 IENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV--I 249 (756) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~--~ 249 (756) ...|.+++..++|++++ | T Consensus 123 ------------------------------------------------------------~edg~~~i~~~~p~~~~~i~ 142 (456) T protein:vir:79 123 ------------------------------------------------------------RDDGTATITADSPETMVVSV 142 (456) T ss_pred ------------------------------------------------------------CCCCceEEEEeccceeEEEE Confidence 01144677888888864 4 Q ss_pred CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeec Q lcl|NC_019423. 250 DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDI 329 (756) Q Consensus 250 Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~ 329 (756) ||..... ....+ +.+.+.++ .. . ... .+.+. ..+.++.+|..... T Consensus 143 d~~~~~~---~~~~~-~~~~~~d~-----~~------------------~--~~~----~~~~~--~~~~~~~~~~~~~~ 187 (456) T protein:vir:79 143 DPLQPWR---IRSAM-RWWRDLDA-----ES------------------D--FAI----VWSGD--GWQKFARPCFVQSS 187 (456) T ss_pred cCCCCCc---eEEEE-EEEEecCC-----ce------------------e--EEE----EEcCC--ceEEEEEEEEeecc Confidence 5443221 11222 22211100 00 0 000 00000 01222222211110 Q ss_pred cCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCce Q lcl|NC_019423. 330 NDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQR 409 (756) Q Consensus 330 ~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~ 409 (756) ......+..++.......-|...+.+|++++ .+.+|.|.+..++++++.+|+.++.+...+...+.++. T Consensus 188 -----~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~ 256 (456) T protein:vir:79 188 -----SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQR 256 (456) T ss_pred -----ccceeeeccCCceeecccccCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHH Confidence 0111112222322222333334466777654 34678899999999999999999988877777666554 Q ss_pred EeeccccC---ccchhhh-----hccccccccccccccccccccccCCCc-chHHHHHHHHHHHHHHHHhchhHHhcCCC Q lcl|NC_019423. 410 GYPKGMLD---TLNRRRY-----DDGQDYEYNPMQGNPSQSIMEHKFPEL-PQSAIVMTQMQNQEAESLTGVKAFSGGVT 480 (756) Q Consensus 410 ~~~~gav~---~~~~~~~-----~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~l~~~~~~~e~~tGv~~~~~G~~ 480 (756) .+. |.-. ..+.... ..... ..+.+ +.........+.+.. ...+...+......+-..||++....|.. T Consensus 257 ~~~-G~~~~~~~~d~~g~~i~~~~~~~~-~~~~~-~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~ 333 (456) T protein:vir:79 257 ALK-SSEHRLPKVDENGNAIDYASIFEA-APGAL-WELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD 333 (456) T ss_pred HHh-cCCcccccccccccccchhhhhhh-hcccc-ccCCCCcceeeecccChHHHHHHHHHHHHHHHhhcCCChhHhccc Confidence 432 2110 0010000 00000 00111 111122222232222 13344556666666667788888887743 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc Q lcl|NC_019423. 481 GSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN 560 (756) Q Consensus 481 ~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g 560 (756) .+ |.++.++......-........+.|..++++++++++.+ ..... ...+.|.=. T Consensus 334 ~~--N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~----~g~~~-------------------~~~i~v~w~ 388 (456) T protein:vir:79 334 SA--NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI----EGESV-------------------EDTVDVSFE 388 (456) T ss_pred cc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCc-------------------cccceEEeC Confidence 22 223333444444444455555566777777666655443 22210 011222111 Q ss_pred cccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 561 TAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQ 638 (756) Q Consensus 561 ~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~q 638 (756) +... ..+.++....+.+ .| ++... .+++..|+. +... ++.++++.. .+.. T Consensus 389 ~~~~~s~~~~ada~~kl~~-~G--~~~~~------~~~~~lg~~--------------~~~i--~~~e~~r~~---~e~~ 440 (456) T protein:vir:79 389 SPDRVTLGEKYSAASLAKA-AG--ESWAS------IRRNILNYN--------------ADQI--KQDDLDRAR---EQIT 440 (456) T ss_pred CCCCcCHHHHHHHHHHHHh-cC--CChHH------HHHhcCCCC--------------HHHH--HHHHHHHHH---HHHH Confidence 1111 1111121122111 11 11111 111222321 1000 011111111 0111 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKD 659 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~ 659 (756) +.+...-+. .+.+.++ T Consensus 441 ~~~~~~~~~-----~~~~~~~ 456 (456) T protein:vir:79 441 LFAGNPVQR-----PQEDGSR 456 (456) T ss_pred HHhhhHhhc-----CCCCCCC Confidence 110000000 0000111 No 123 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.87 E-value=2.3e-08 Score=62.42 Aligned_cols=481 Identities=11% Similarity=0.016 Sum_probs=188.9 Q ss_pred CCcccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHH-- Q lcl|NC_019423. 1 MEHQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLV-- 78 (756) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v-- 78 (756) |-.-+|-.. .---.++.+++++. .. ++.....+.....+.++-.+||.|....+ ..+.-+++.. T Consensus 1 ~~~~~~~~~----~~~~~~~~l~~~e~-~~----i~~L~~~~~~~~~r~~~l~~YY~G~~~i~-----~~~~~~p~~~~~ 66 (504) T protein:vir:99 1 MTEETTSAS----KFTFRIPELNDDVV-DK----VNGLYQQLVDRTPRNLLRASFYDGKYAIR-----QIGNLIPPEYLR 66 (504) T ss_pred CCccCCccc----ccccccCCCCHHHH-HH----HHHHHHHHHHHhHHHHHHHHHHhccccch-----hccccccHHHHH Confidence 433333221 11122334555552 11 23333334444455667779999764211 1111222221 Q ss_pred -HHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcch-HHHHHHHHhhcCceEEEEeeeee Q lcl|NC_019423. 79 -RRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKL-VDDYVHSIVDDGTGIARIGWERK 156 (756) Q Consensus 79 -~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~-~~~~v~~al~~g~gi~k~~w~~~ 156 (756) +..++| ...+++.|..--.+--|. .+++.+... .+.-+ ...|+ +.. ...+++++++.|.+++.||=+ + T Consensus 67 ~~~v~n~-~~~iVd~~a~rl~~~Gf~--~~d~~~~~~----~l~~i-~~~N~-ld~~~~~~~~~a~iyG~af~~v~~~-~ 136 (504) T protein:vir:99 67 TATVLGW-SAKAVDTLARRCNLESFV--WPDGDYGSI----GGPDV-WDENF-FATKANNAMVSSLIHGPAFLINTEG-G 136 (504) T ss_pred HhhccCc-HHHHHHHHHhhhccceee--CCCCChhhH----HHHHH-HHhcC-hhhHHHHHHHHHHhhCceeEEEecC-C Confidence 112333 112222222110111121 112222222 23222 22333 433 445777777777776665200 0 Q ss_pred eeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCc Q lcl|NC_019423. 157 TVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNR 236 (756) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~ 236 (756) ....+ T Consensus 137 ---------------------------------------------------------------------------d~~~~ 141 (504) T protein:vir:99 137 ---------------------------------------------------------------------------AGEPD 141 (504) T ss_pred ---------------------------------------------------------------------------CCCce Confidence 00123 Q ss_pred eeEEEechhheE--eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccc Q lcl|NC_019423. 237 PTVEMLNPNNVV--IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDAL 314 (756) Q Consensus 237 ~~ie~V~p~~~~--~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s 314 (756) +.|..++|.+++ |||.... +. ..+++...+. . T Consensus 142 ~~I~~~sP~~~~~iyD~~~~~-~~---~a~~~~~~d~------------------------------------------~ 175 (504) T protein:vir:99 142 SLIHVKSAMQATGEWNSRRNA-MD---SLLSITSRDA------------------------------------------E 175 (504) T ss_pred eEEEEeccceeEEEEeCCCCc-ee---EEEEEEEecC------------------------------------------C Confidence 568888999874 6765321 11 1111110000 0 Q ss_pred cceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchH-HHhHHHHHHHHHH Q lcl|NC_019423. 315 RKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADA-ELLGDNQAILGAT 393 (756) Q Consensus 315 ~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v-~~~~d~Q~~iN~~ 393 (756) + .....++|.. +.. +....-++.....+..|.+.| +|+|++...++.++.+|.|-+ +.++++++.+|+. T Consensus 176 g-~~~~~~~y~~------~~~--~~~~~~~~~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~ 245 (504) T protein:vir:99 176 G-HPTGIALYED------GVT--VTADMDDDGDWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKG 245 (504) T ss_pred C-eEEEEEEEcC------CcE--EEEEEcCCceeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHH Confidence 0 0111222221 000 000111111111222333445 799999988888889998855 6899999999999 Q ss_pred HHHHHHHHHhhcCCceEeeccccC----ccchhh---hhcccccccccccccc------ccccccccCCCcc-hHHHHHH Q lcl|NC_019423. 394 MRGMIDLLGRSANGQRGYPKGMLD----TLNRRR---YDDGQDYEYNPMQGNP------SQSIMEHKFPELP-QSAIVMT 459 (756) Q Consensus 394 ~~~~~d~l~~~~~~~~~~~~gav~----~~~~~~---~~~~~~~~~~~~~~~~------~~~i~~~~~~~~~-~~~~~~l 459 (756) ++.+.......+.|+..+- |+-. ..+... ++..... ...+..+. ....+..+.+.-. +.+...+ T Consensus 246 ~~~~~~~~e~~a~p~r~i~-G~~~~~~~~~d~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l 323 (504) T protein:vir:99 246 CIRMDGHADVYSFPQLILL-GADAKNFRNKDGSMKPAWQIALAR-VFALPDDEDEPDAARARADVKQFPASSPQPHIEML 323 (504) T ss_pred HHHHHHHHHHhcchhhhhc-cCCccccccccccccchhhhhhhh-hhcCCCccccccccCccceeeecCCCChHHHHHHH Confidence 9999988888887764431 2211 001000 0100000 00000000 1122333333221 1233344 Q ss_pred HHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecC Q lcl|NC_019423. 460 QMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNE 539 (756) Q Consensus 460 ~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~ 539 (756) ..+...+-..|+++....|..+.+.+.+|.++......-........+.|..+++.++++++.+....-... T Consensus 324 ~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~-------- 395 (504) T protein:vir:99 324 EQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIP-------- 395 (504) T ss_pred HHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-------- Confidence 444555555589999999976554344555565555455555666667788888888888776654321100 Q ss_pred ceeecCHhHhcCcceEEEecccccH-H-HHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCCh Q lcl|NC_019423. 540 QYVEIKREDLKGNFDIEVDINTAEI-D-NQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDP 617 (756) Q Consensus 540 ~~v~i~~d~~~~~~Dv~V~~g~a~~-~-~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p 617 (756) +. .+++.|.=..... + .+.+..+..+.+.....+.+. . .+++..|+. + T Consensus 396 -------~~---~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~---~---~l~~~lg~~--------------~ 445 (504) T protein:vir:99 396 -------PE---WKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKET---E---VGLELLGLT--------------P 445 (504) T ss_pred -------cc---cccceeEecCCCccCHHHHHHHHHHHHhhccccccch---H---HHHhhcCCC--------------H Confidence 00 0112221111111 1 111111222222111111110 0 112222321 0 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 618 MEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKALTTP 697 (756) Q Consensus 618 ~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a~~~~ 697 (756) . +.+..+.+ . ++.+++ ..+.++.. T Consensus 446 ~-----ei~r~~~e---------~-------~~~~~~----------------------------------~~~~~l~~- 469 (504) T protein:vir:99 446 Q-----QAKRALAE---------R-------RRASSV----------------------------------SIIEALNR- 469 (504) T ss_pred H-----HHHHHHHH---------H-------HHHhhH----------------------------------HHHHHHhc- Confidence 0 00000000 0 000000 00000100 Q ss_pred hhccCCchhhhccCCCCCCCcccCchhcCCCCCCCCccccccccccCCCCC Q lcl|NC_019423. 698 TKEGETTPNISAAVGYNTLTNGNSPQERDLAAQQDPAYSLGSQYYDPSQDP 748 (756) Q Consensus 698 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 748 (756) +. +..+.+. .....| +..+|+...++....|..++ T Consensus 470 ~~-----~~~~~~~-----~~~~~~------~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 470 RQ-----QEAATAG-----EDQDQG------AGEPPANEPPAALGRPTLVG 504 (504) T ss_pred cc-----CCCCCCC-----CCCCcC------CCCCCCCCCCccCCCcccCC Confidence 00 0000100 000001 12222333334455565554 No 124 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.77 E-value=5.6e-08 Score=60.36 Aligned_cols=434 Identities=12% Similarity=0.023 Sum_probs=177.9 Q ss_pred ccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCC--CCCCC---cccCHHHHHHHHHHHHHHHH Q lcl|NC_019423. 19 LTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPK--IKGRS---QVQPRLVRRQAEWRYAPLSE 91 (756) Q Consensus 19 ~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~--~~grS---~~v~~~v~~~~e~~~~~L~~ 91 (756) ||+-+.++++..|.. .|+....+.++-.+||.|..... ++. .+.|+ ++|.+-.+..|+.....| T Consensus 1 ~~~~t~~~~~~~l~~-------~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l-- 71 (456) T protein:vir:10 1 MTASTPAEWLPVLTK-------RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-- 71 (456) T ss_pred CCCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhh-- Confidence 666666666655533 34455666678889999864211 111 12333 366666666666555544 Q ss_pred hhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHH-HHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecC Q lcl|NC_019423. 92 PFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVD-DYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLY 170 (756) Q Consensus 92 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~-~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~ 170 (756) + ++.+. + + ...|.+........+ ..| .++... .+++++++.|.+.+.+ |. T Consensus 72 --~-~~~~~-~-~-~~~d~~~~~~~~~i~-----~~N-~~d~~~~~~~~~a~i~G~ay~~v-~~---------------- 122 (456) T protein:vir:10 72 --I-PNGIT-V-G-GSADSDLALRARRIW-----RDN-RMDSVCKQWVKYGLDFGESYLTC-WR---------------- 122 (456) T ss_pred --c-cCCee-c-C-CCCCcchHHHHHHHH-----Hhc-ChhhHHHHHHHHHhhcCeeEEEE-ee---------------- Confidence 1 22222 1 1 223333333333322 233 344433 5678888888875543 21 Q ss_pred CCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheE-- Q lcl|NC_019423. 171 PIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV-- 248 (756) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~-- 248 (756) ...|.++|..++|.+++ T Consensus 123 -------------------------------------------------------------d~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 123 -------------------------------------------------------------RDDGTATITADSPETMVVS 141 (456) T ss_pred -------------------------------------------------------------CCCCceEEEEEccceeEEE Confidence 01144678888898854 Q ss_pred eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEE-EEEEe Q lcl|NC_019423. 249 IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYE-YWGFY 327 (756) Q Consensus 249 ~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E-~w~k~ 327 (756) |||..... ..+++ +++.+.++ . . . +.. ....+. -+..+. +|+.. T Consensus 142 ~d~~~~~~---~~~~i-~~~~~~d~-----~----------~--------~--~~~---~~~~~~---~~~~~~~~~~~~ 186 (456) T protein:vir:10 142 VDPLQPWR---IRAAM-RWWRDLDA-----E----------S--------D--FAI---VWSGDG---WQKFARPCFVQS 186 (456) T ss_pred EcCCCCcc---eEEEE-EEEEecCC-----c----------e--------e--EEE---EEeccc---eeEEEEEEEEee Confidence 46544321 22222 22221100 0 0 0 000 000000 011111 11111 Q ss_pred eccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 328 DINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANG 407 (756) Q Consensus 328 d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~ 407 (756) +. ......+.++........|...+..|++++ .+.+|.|.+..++++++.+|+.++.++......+.+ T Consensus 187 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~ 254 (456) T protein:vir:10 187 SS------RRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFR 254 (456) T ss_pred cc------cceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhH Confidence 10 111223333433333333333455666544 345789999999999999999999888777666665 Q ss_pred ceEeecccc---Cccchhh--h---hccccccccccccccccccccccCCCc-chHHHHHHHHHHHHHHHHhchhHHhcC Q lcl|NC_019423. 408 QRGYPKGML---DTLNRRR--Y---DDGQDYEYNPMQGNPSQSIMEHKFPEL-PQSAIVMTQMQNQEAESLTGVKAFSGG 478 (756) Q Consensus 408 ~~~~~~gav---~~~~~~~--~---~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~l~~~~~~~e~~tGv~~~~~G 478 (756) +..+. |.- ...+... . ...... .+. .+......+..+.+.. ...+...+......+-..||++....| T Consensus 255 ~~~i~-G~~~~~~~~d~~g~~~~~~~~~~~~-~~~-~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~ 331 (456) T protein:vir:10 255 QRALK-STEHGLPNVDENGNAIDYASIFEAA-PGA-LWELPPGVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLM 331 (456) T ss_pred hHhhh-ccCcccccccccccccchhhhhhhh-ccc-cccCCCCcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhc Confidence 54331 211 0000000 0 000000 010 1111122222233222 133444455555556666788888877 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEe Q lcl|NC_019423. 479 VTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVD 558 (756) Q Consensus 479 ~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~ 558 (756) ...+ |.+|.++......-........+.|..+++.++++++.+- ... + ..++.|. T Consensus 332 ~~~~--N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~----g~~--------~-----------~~~~~v~ 386 (456) T protein:vir:10 332 PDSA--NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE----GES--------V-----------EDTVDVS 386 (456) T ss_pred cccc--ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----CCC--------c-----------ccceeEE Confidence 4322 2344445555555555566666777778877777665431 111 0 1122222 Q ss_pred cccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHH Q lcl|NC_019423. 559 INTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEE 636 (756) Q Consensus 559 ~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~ 636 (756) =..... ..+.++.+..+.+. -++... .+.+..|+. +... ++.++++...+ T Consensus 387 w~~~~~~~~~~~ada~~kl~~~---gi~~~~------~~~~~lg~~--------------~~~i--~~~e~er~~~e--- 438 (456) T protein:vir:10 387 FESPDRVTLGEKYSAASLAKAA---GESWAS------IRRNILNYN--------------ADQI--KQDDLDRAREQ--- 438 (456) T ss_pred ecCCCCcCHHHHHHHHHHHHHc---CCChHH------HHHhhCCCC--------------HHHH--HHHHHHHHHHH--- Confidence 111111 11122222222111 111111 111222321 0000 01111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 637 LQSKIALNNAKAKEAASSGDLKDLDYLEQESGTK 670 (756) Q Consensus 637 ~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k 670 (756) .. +.+.. +.+. .+.-+.+ T Consensus 439 ~~-------~~~~~------~~~~---~~~~~~~ 456 (456) T protein:vir:10 439 IT-------LFAGN------PVQR---PQEDGSR 456 (456) T ss_pred HH-------HHhhh------hhhc---CCCCCCC Confidence 00 00000 0000 0000000 No 125 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.77 E-value=5.6e-08 Score=60.36 Aligned_cols=434 Identities=12% Similarity=0.023 Sum_probs=177.9 Q ss_pred ccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCC--CCC--CCCCC---cccCHHHHHHHHHHHHHHHH Q lcl|NC_019423. 19 LTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAK--PPK--IKGRS---QVQPRLVRRQAEWRYAPLSE 91 (756) Q Consensus 19 ~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~--~~grS---~~v~~~v~~~~e~~~~~L~~ 91 (756) ||+-+.++++..|.. .|+....+.++-.+||.|..... ++. .+.|+ ++|.+-.+..|+.....| T Consensus 1 ~~~~t~~~~~~~l~~-------~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l-- 71 (456) T protein:vir:10 1 MTASTPAEWLPVLTK-------RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-- 71 (456) T ss_pred CCCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhh-- Confidence 666666666655533 34455666678889999864211 111 12333 366666666666555544 Q ss_pred hhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHH-HHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecC Q lcl|NC_019423. 92 PFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVD-DYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLY 170 (756) Q Consensus 92 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~-~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~ 170 (756) + ++.+. + + ...|.+........+ ..| .++... .+++++++.|.+.+.+ |. T Consensus 72 --~-~~~~~-~-~-~~~d~~~~~~~~~i~-----~~N-~~d~~~~~~~~~a~i~G~ay~~v-~~---------------- 122 (456) T protein:vir:10 72 --I-PNGIT-V-G-GSADSDLALRARRIW-----RDN-RMDSVCKQWVKYGLDFGESYLTC-WR---------------- 122 (456) T ss_pred --c-cCCee-c-C-CCCCcchHHHHHHHH-----Hhc-ChhhHHHHHHHHHhhcCeeEEEE-ee---------------- Confidence 1 22222 1 1 223333333333322 233 344433 5678888888875543 21 Q ss_pred CCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheE-- Q lcl|NC_019423. 171 PIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVV-- 248 (756) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~-- 248 (756) ...|.++|..++|.+++ T Consensus 123 -------------------------------------------------------------d~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 123 -------------------------------------------------------------RDDGTATITADSPETMVVS 141 (456) T ss_pred -------------------------------------------------------------CCCCceEEEEEccceeEEE Confidence 01144678888898854 Q ss_pred eCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEE-EEEEe Q lcl|NC_019423. 249 IDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYE-YWGFY 327 (756) Q Consensus 249 ~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E-~w~k~ 327 (756) |||..... ..+++ +++.+.++ . . . +.. ....+. -+..+. +|+.. T Consensus 142 ~d~~~~~~---~~~~i-~~~~~~d~-----~----------~--------~--~~~---~~~~~~---~~~~~~~~~~~~ 186 (456) T protein:vir:10 142 VDPLQPWR---IRAAM-RWWRDLDA-----E----------S--------D--FAI---VWSGDG---WQKFARPCFVQS 186 (456) T ss_pred EcCCCCcc---eEEEE-EEEEecCC-----c----------e--------e--EEE---EEeccc---eeEEEEEEEEee Confidence 46544321 22222 22221100 0 0 0 000 000000 011111 11111 Q ss_pred eccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019423. 328 DINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANG 407 (756) Q Consensus 328 d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~ 407 (756) +. ......+.++........|...+..|++++ .+.+|.|.+..++++++.+|+.++.++......+.+ T Consensus 187 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~ 254 (456) T protein:vir:10 187 SS------RRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFR 254 (456) T ss_pred cc------cceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhH Confidence 10 111223333433333333333455666544 345789999999999999999999888777666665 Q ss_pred ceEeecccc---Cccchhh--h---hccccccccccccccccccccccCCCc-chHHHHHHHHHHHHHHHHhchhHHhcC Q lcl|NC_019423. 408 QRGYPKGML---DTLNRRR--Y---DDGQDYEYNPMQGNPSQSIMEHKFPEL-PQSAIVMTQMQNQEAESLTGVKAFSGG 478 (756) Q Consensus 408 ~~~~~~gav---~~~~~~~--~---~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~l~~~~~~~e~~tGv~~~~~G 478 (756) +..+. |.- ...+... . ...... .+. .+......+..+.+.. ...+...+......+-..||++....| T Consensus 255 ~~~i~-G~~~~~~~~d~~g~~~~~~~~~~~~-~~~-~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~ 331 (456) T protein:vir:10 255 QRALK-STEHGLPNVDENGNAIDYASIFEAA-PGA-LWELPPGVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLM 331 (456) T ss_pred hHhhh-ccCcccccccccccccchhhhhhhh-ccc-cccCCCCcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhc Confidence 54331 211 0000000 0 000000 010 1111122222233222 133444455555556666788888877 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEe Q lcl|NC_019423. 479 VTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVD 558 (756) Q Consensus 479 ~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~ 558 (756) ...+ |.+|.++......-........+.|..+++.++++++.+- ... + ..++.|. T Consensus 332 ~~~~--N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~----g~~--------~-----------~~~~~v~ 386 (456) T protein:vir:10 332 PDSA--NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE----GES--------V-----------EDTVDVS 386 (456) T ss_pred cccc--ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----CCC--------c-----------ccceeEE Confidence 4322 2344445555555555566666777778877777665431 111 0 1122222 Q ss_pred cccccH--HHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHH Q lcl|NC_019423. 559 INTAEI--DNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEE 636 (756) Q Consensus 559 ~g~a~~--~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~ 636 (756) =..... ..+.++.+..+.+. -++... .+.+..|+. +... ++.++++...+ T Consensus 387 w~~~~~~~~~~~ada~~kl~~~---gi~~~~------~~~~~lg~~--------------~~~i--~~~e~er~~~e--- 438 (456) T protein:vir:10 387 FESPDRVTLGEKYSAASLAKAA---GESWAS------IRRNILNYN--------------ADQI--KQDDLDRAREQ--- 438 (456) T ss_pred ecCCCCcCHHHHHHHHHHHHHc---CCChHH------HHHhhCCCC--------------HHHH--HHHHHHHHHHH--- Confidence 111111 11122222222111 111111 111222321 0000 01111110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 637 LQSKIALNNAKAKEAASSGDLKDLDYLEQESGTK 670 (756) Q Consensus 637 ~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k 670 (756) .. +.+.. +.+. .+.-+.+ T Consensus 439 ~~-------~~~~~------~~~~---~~~~~~~ 456 (456) T protein:vir:10 439 IT-------LFAGN------PVQR---PQEDGSR 456 (456) T ss_pred HH-------HHhhh------hhhc---CCCCCCC Confidence 00 00000 0000 0000000 No 126 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.70 E-value=9.9e-08 Score=58.98 Aligned_cols=420 Identities=11% Similarity=0.024 Sum_probs=166.8 Q ss_pred EecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHH Q lcl|NC_019423. 102 LTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVL 181 (756) Q Consensus 102 ~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (756) |.|.+-.++-++.....++||- -.+++.++..+. ++||. + -+....+ .+ T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~-------~~ivd~~~~~l~--~~gf~---~----------------~d~~~~~---~~ 49 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFC-------GLIANASVHRLL--ALGVT---G----------------PDGEPDT---RA 49 (434) T ss_pred CCCCCccHHHHHhhhhhhccch-------HHHHHHHHhhhc--cCcee---c----------------CCCchHH---HH Confidence 8888887777765443344442 233443443322 33421 0 0111111 11 Q ss_pred HHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhhe--EeCCCCcCcccc Q lcl|NC_019423. 182 QQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNV--VIDPSCNGDLDK 259 (756) Q Consensus 182 ~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~--~~Dp~a~~d~~d 259 (756) .+.. . .+.+.........+...+|.++..+..+... . .......+.|..++|.++ +||+... . T Consensus 50 ~~i~---~------~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~-~-~~~~~~~~~I~~~~p~~~~~i~D~~~~----~ 114 (434) T protein:vir:98 50 SRWW---Q------ANRLDSRQKLVWRMAMAQSAGYMLVGAHPTR-T-EDNGRPSPLITMEHPSECIVEYDPETG----E 114 (434) T ss_pred HHHH---H------hcChhHHHHHHHHHHhhcCceEEEEecCCCc-c-cccCCceeEEEEeccceeEEEEeCCCC----c Confidence 1111 0 1223334455566677888877655432111 0 112234678999999985 4565432 1 Q ss_pred CceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEE---EEEEEeeccCCceeE Q lcl|NC_019423. 260 ALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAY---EYWGFYDINDDGSLE 336 (756) Q Consensus 260 a~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~---E~w~k~d~~~~g~~~ 336 (756) ..+.+++...+.. + . ....+.++ .+|+..... .+... T Consensus 115 ~~~ai~~~~~~~~------~--------------------~-------------~~~~~~~~~~~~~~~~~~~~-~~~~~ 154 (434) T protein:vir:98 115 PLVGLKVWHNDID------G--------------------F-------------GYARVFFDDTSFPYRTRERT-GARLP 154 (434) T ss_pred eEEEEEEEEeccC------C--------------------c-------------eEEEEEEeCcEEEEEEeecc-ccccc Confidence 2233333221110 0 0 00001110 011111100 00000 Q ss_pred EEEEEEECCEEEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeecccc Q lcl|NC_019423. 337 PIVATWIGSTLIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGML 416 (756) Q Consensus 337 ~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav 416 (756) ..-..+.....+ ....|-+.|.+|++++...+..+. +|.|.++.+++.++.+|+.++.+.......+.|+..+. |.- T Consensus 155 ~~~~~~~~~~~~-~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~ 231 (434) T protein:vir:98 155 WGPDSWVYTGTA-DSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIK-GHK 231 (434) T ss_pred cccccceecccc-cccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCC Confidence 000011111111 011223447889999887776655 69999999999999999999999999888888765543 211 Q ss_pred Cc--cchhhhhc--ccc--ccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHH Q lcl|NC_019423. 417 DT--LNRRRYDD--GQD--YEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAA 489 (756) Q Consensus 417 ~~--~~~~~~~~--~~~--~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~ 489 (756) .. .+...... ... ...+.+-..++...+..+.+... ..+...+......+-.+|++++...|... .+.++. T Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~--~n~Sg~ 309 (434) T protein:vir:98 232 FAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDL--VNISAD 309 (434) T ss_pred cccccccccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhcccc--CChHHH Confidence 10 00000000 000 00111111122334444433321 22333344444445555777777777321 223444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEecccccH--HHH Q lcl|NC_019423. 490 GIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINTAEI--DNQ 567 (756) Q Consensus 490 ~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~--~~~ 567 (756) ++......-........+.|..++++++++++.+. |.. .+ .+++.|.=..... ... T Consensus 310 Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~-------------g~~-----~~----~~~~~v~w~~~~~~s~~~ 367 (434) T protein:vir:98 310 TIGALDILHVAKVREHIASFSEGLESVLALAAAQA-------------GVP-----ED----YTEAEVRWANPAHVTMAV 367 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------CCC-----hh----heeeeEEecCCCCCCHHH Confidence 45544444455555566777778877777665442 110 01 1123332222211 112 Q ss_pred HHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 568 KSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAK 647 (756) Q Consensus 568 ~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~ 647 (756) .++.+..+.+ .+ ++... +++..|+.. ..+ ..+++.+ +.+.+.+. ....+ T Consensus 368 ~ada~~kl~~-~g--~~~e~-------~~~~lg~~~--~e~--------------~r~~~e~---~~~~~~~~--~~~~~ 416 (434) T protein:vir:98 368 KADAATKLKS-IG--YPLDV-------IAEELDESP--ARV--------------RRIVAGA---ASQALLAA--SLLPA 416 (434) T ss_pred HHHHHHHHHh-cC--CcHHH-------HHHhCCCCH--HHH--------------HHHHHHH---HHHHHHHH--hhhcc Confidence 2222222222 11 23221 223333321 000 0011000 00000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 648 AKEAASSGDLKDLDYLEQESG 668 (756) Q Consensus 648 a~~~~aq~~~~~~~~~~q~~~ 668 (756) +....+..+ .-+ ...-.+ T Consensus 417 ~~~~~~g~~--~~~-~~~~dg 434 (434) T protein:vir:98 417 PGAPSAGNV--PDS-GGAVDG 434 (434) T ss_pred CCCCCCCCC--Ccc-cCCCCC Confidence 000000000 000 000000 No 127 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.63 E-value=1.7e-07 Score=57.64 Aligned_cols=603 Identities=10% Similarity=0.014 Sum_probs=206.5 Q ss_pred cCHHHHHHHHHHHHHHHHhhcCC-------------CCEEEE-ecCCcchHHHHHHHHHHHHHHHhhhcCCc-chHHHHH Q lcl|NC_019423. 74 QPRLVRRQAEWRYAPLSEPFLSS-------------SKLFKL-TPVTFEDELAARQNELVLNYQFRTQLNKV-KLVDDYV 138 (756) Q Consensus 74 v~~~v~~~~e~~~~~L~~~f~~~-------------~~~~~~-~p~~~~D~~~A~q~t~~~n~~~~~~~~~~-~~~~~~v 138 (756) -+-+-++ ++-.|++-|.-- |.-|.+ -.-.|.++..+.... ..+..|. .+.+|.| T Consensus 1 m~e~~~~----~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~-------~~q~~grP~~~~N~i 69 (706) T protein:vir:10 1 MAESRQK----QHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKL-------DEQFEKYPKFEINKV 69 (706) T ss_pred CCcchHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHh-------hhhhcCCCceEecch Confidence 2222222 222333333110 101111 234455443332100 1122233 3455666 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeec-CCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcc Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQL-YPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEAT 217 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~ 217 (756) +-.+..-.| +...++.. ..+.. .+..+.++++.+...+..+.+.- ........++.+.+.+|.|+ T Consensus 70 ~~~v~~v~g-----~~~~nr~~----~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~-----~~~~a~s~Af~d~i~~G~G~ 135 (706) T protein:vir:10 70 ATELNRIIS-----EYRNNRIS----VKFRPGDNAASEELANKLNGLFRADYEET-----DGGEACDNAFDDAATGGFGC 135 (706) T ss_pred HHHHHHHhh-----HHHhCCCc----eEEecCCCCchHHHHHHHHHHHHHHHHhc-----CchHHHHHHHHHHhhcCcce Confidence 666666555 32233222 12222 34556778888888877764432 23334456677777777554 Q ss_pred eeccCceeEEEeeeeecCce-eEEEechhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhh Q lcl|NC_019423. 218 YAIQTGVTEVEVEKALVNRP-TVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPIT 296 (756) Q Consensus 218 ~~~~~g~~~~~~~~~~~g~~-~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~ 296 (756) . ++..+.....+| .++....-+.++||.- ..+-|. ..+..++++.+... ..+.++.......-... T Consensus 136 ~-------ev~~d~~~~~d~~~~~~~i~i~~v~~p~~-~v~~Dp----~a~~~D~sDar~~~-~~~~~~~d~~~~~fp~~ 202 (706) T protein:vir:10 136 F-------RLTTSFVNEYDPMDERQRIAVEPIYDPAR-SVWFDP----DAKKYDKSDALWAF-CMYSVSLEKYQSEYDKA 202 (706) T ss_pred E-------EeeeccccccCCCCCCccceeeeeccchh-ceecCc----hhcccChhhcceEe-eeecCCHHHHHHhcCCC Confidence 3 333332211111 1111001111222220 111111 12233444433221 11111110000000000 Q ss_pred chhhhcccccccccc-c-cccceEEEEEEEEEeeccCCceeEEEEEEEECCEEEEeccc-------------------c- Q lcl|NC_019423. 297 DPDHESKTPSDFQFK-D-ALRKKVVAYEYWGFYDINDDGSLEPIVATWIGSTLIRMENN-------------------P- 354 (756) Q Consensus 297 ~~~~~~~~~~~~~~~-d-~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~~~L~~~~~-------------------P- 354 (756) ..+ .....+.++. + ...+.|++.|||.+....- ...+.+. -..++........ + T Consensus 203 ~~~--~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 278 (706) T protein:vir:10 203 PTS--LDRVGSVSWQYDWFTPDVVYIAKYYEVRKESV-DVISYRQ-PLTQEIATYDSEQIADIQDELEQAGFEEIGRRSV 278 (706) T ss_pred hhh--hhhhccccccccccCCCcceecccccccceeE-EEEEeec-cccCCceeeccchhhhhHHHHhhCCchhhhhccc Confidence 000 0000111111 1 1234688899887542210 0011111 0111111111100 0 Q ss_pred -----cC---------CC--ccceEEeeeeeecCcc---cCCchHH-HhHHHHHHHHHHHHHHHHHHHhhcCCceEeecc Q lcl|NC_019423. 355 -----FP---------DG--KLPLVVVPYMPRKREL---FGEADAE-LLGDNQAILGATMRGMIDLLGRSANGQRGYPKG 414 (756) Q Consensus 355 -----~~---------~~--~~Pfv~~~~~~~~~~~---~G~g~v~-~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~g 414 (756) |+ .+ -||.-.|++.|.-+.. .|.+... .++++-+.-..+...+...+.+.+..+...+.+ T Consensus 279 ~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 358 (706) T protein:vir:10 279 KRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIV 358 (706) T ss_pred ceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCccccc Confidence 00 01 1222333444332221 1222222 234566666666667777777788888889999 Q ss_pred ccCccchhhhhcccccccccccc--cc----cccc-ccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchh Q lcl|NC_019423. 415 MLDTLNRRRYDDGQDYEYNPMQG--NP----SQSI-MEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDV 487 (756) Q Consensus 415 av~~~~~~~~~~~~~~~~~~~~~--~~----~~~i-~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~t 487 (756) ++++.+................. .+ .+.+ .+.+.+... ....+-+...++++.....-...+|++..++|.. T Consensus 359 ~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~ 437 (706) T protein:vir:10 359 DMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAGYT-QAPVLNQALAALLQQTSADIQEVTGSSQAMQQMP 437 (706) T ss_pred chhHHHHHHHHhhhcccccccchhcccccCCCCcccccccccccC-CCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCc Confidence 88766665444433222222111 11 1112 111111110 0112223333334443334455679988888765 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcCcceEEEeccc---cc Q lcl|NC_019423. 488 AAGIRGALDAASKREMAI-LRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINT---AE 563 (756) Q Consensus 488 A~~i~~~~~aa~~~l~~~-~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~---a~ 563 (756) . .+|++..++.+..... .-.|-+.++...+.+-+++..+...- .+.+..+.|..++-. ..-+.++... .+ T Consensus 438 s-n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~~~----y~~~R~~RI~~ed~~-~~~v~in~~~~d~~~ 511 (706) T protein:vir:10 438 S-NVARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMAREI----YGSDREVRIVHEDGT-DDIALMNAAVLDNQT 511 (706) T ss_pred c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCC-ccceeeccceecccc Confidence 5 4788877666654443 34455566777777777777654331 122334555444321 1112222110 00 Q ss_pred HHHHHHHHHHHH-HHH---hhccCC--HhHHHHHHHHHHhhcC-----ChhHHHHhhhccCCC--Chhhhh--------- Q lcl|NC_019423. 564 IDNQKSQDLGFM-VQT---LGNTVD--QSITLSLVAKIAELKR-----MPDLAHELRTWQPQP--DPMEEQ--------- 621 (756) Q Consensus 564 ~~~~~~q~l~~l-lq~---~~~~~~--~~~~~~~l~~l~e~~~-----~~~~~~~l~~~~~q~--~p~~~~--------- 621 (756) +.......+.-. ... -+|..+ .......+.+++.... .+.+...+-....-| +..... T Consensus 512 G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~ 591 (706) T protein:vir:10 512 GRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQG 591 (706) T ss_pred CceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccC Confidence 000000000000 000 001111 1112222333333221 112222222211111 111100 Q ss_pred -H--HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 622 -L--KQLAIQKAQL---ENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHA-RDMEKQKAQSQGNQNLQITKAL 694 (756) Q Consensus 622 -~--~q~~~~~aq~---e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~-~~~~~~~~q~~~~~~~~~~~a~ 694 (756) . .+.++++..+ +++..++++++.+++++..++++++.+++....+.+++.. ..++...+++++...+...... T Consensus 592 ~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~ 671 (706) T protein:vir:10 592 IVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNI 671 (706) T ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0111111111 1112222333333333333344443333332222222211 1222222222222222222111 Q ss_pred HHHh-hcc-CCchhhhccCCCCCCCcccCchhcCCCCC--CCCccc Q lcl|NC_019423. 695 TTPT-KEG-ETTPNISAAVGYNTLTNGNSPQERDLAAQ--QDPAYS 736 (756) Q Consensus 695 ~~~~-~~~-~~~~~~~~a~~~~~~~~~~~~~~~~~~~~--~~~~~~ 736 (756) .... ..+ ...+.+++. .+. +.|+.+ ++-+|+ T Consensus 672 ~~~~~~q~~q~l~~~~a~------q~~-----~~~~~~~~~~~~~~ 706 (706) T protein:vir:10 672 DDKAVMETLRLLKEVAAS------QQQ-----TIPSPPSPADIVPS 706 (706) T ss_pred HHHHHHHHHHHHHHHHHh------ccC-----CCCCCCCCcccCCC Confidence 1111 000 011222222 111 111111 111222 No 128 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=98.47 E-value=5.2e-07 Score=55.03 Aligned_cols=591 Identities=11% Similarity=0.013 Sum_probs=172.1 Q ss_pred hhHHH-HHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEE-ecCCcchHHHHHHHHHH Q lcl|NC_019423. 42 HDAIM-SQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKL-TPVTFEDELAARQNELV 119 (756) Q Consensus 42 ~~~~~-~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~-~p~~~~D~~~A~q~t~~ 119 (756) -.... ...++..+.|..+ ...+-+|+...+-... |.| -+..|.++..+.... T Consensus 1 m~~~~~~~~~~~~~~~~~~------------------~~~~~~~r~~~~~D~~------f~~~~G~QW~~~~~~~l~~-- 54 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRA------------------YSPQKEVREKCIEATR------FARVPGGQWEGATAAGTKL-- 54 (708) T ss_pred CchhHHHHHHHHHHHHHHH------------------HHhhHHHHHHHHHHHH------hhcCCCCCCCHHHHHHHHH-- Confidence 11111 1233344433321 1112233322222211 222 255666665443111 Q ss_pred HHHHHhhhcCCcc-hHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecC-CCCCHHHHHHHHHhHHHhhhccchhcc Q lcl|NC_019423. 120 LNYQFRTQLNKVK-LVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLY-PIENQEQADVLQQALQLQAENPREYDE 197 (756) Q Consensus 120 ~n~~~~~~~~~~~-~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~e 197 (756) ..+..|.+ +.+|.|+-.+..-.|.-+ .+ .. ...+... ...+.+.++.+...+..+.+.- T Consensus 55 -----~~q~~grP~~~~N~i~~~v~~v~g~~~-----~n---r~-d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~----- 115 (708) T protein:vir:10 55 -----DEQFEKYPKFEINKVATELNRIIAEYR-----NN---RI-TVKFRPGDREASEELANKLNGLFRADYEET----- 115 (708) T ss_pred -----hhhhcCCCceEEcchHHHHHHHHHHHH-----hC---Cc-ceEEEcCCCCchHHHHHHHHHHHHHHHHhc----- Confidence 11223333 344666666666555222 22 22 1233333 3446778888888777655532 Q ss_pred ccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCcee-EEEechhheEeCCCCcCccccCceEEEEeecCHHHHHh Q lcl|NC_019423. 198 TMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPT-VEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLMK 276 (756) Q Consensus 198 ~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~-ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~ 276 (756) ........++.+.+.+|.|+ .++..+-....+|. ..-..+.....||.- +.+-| -..+..++++.+. T Consensus 116 ~~~~~~s~Af~d~i~~G~Gw-------~~~~~d~~~e~d~~~~~~~i~i~~~~~p~~-~v~~D----p~a~~~D~sDar~ 183 (708) T protein:vir:10 116 DGGEACDNAFDDAATGGFGC-------FRLTSMLVNEYDPMDDRQRIAIEPIYDPSR-SVWFD----PDAKKYDKSDALW 183 (708) T ss_pred CchHHHHHHHHhhhhcccce-------eeeeeccccccCCCCCccccceEEeecchh-hcccC----ccccccChhhhhh Confidence 22233445666666766543 33333211111111 000000111122210 00000 0122234444433 Q ss_pred hccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEE-------EEE-----EC Q lcl|NC_019423. 277 NKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIV-------ATW-----IG 344 (756) Q Consensus 277 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~-------~~~-----~g 344 (756) .. ....++ .+.. ...++.... ..+++..... ...+ |. ..+...+.++++ +++ .| T Consensus 184 ~~-~~~~~~---~d~~-~~~~p~~a~---~~~d~~~~~~---~~~~-~~--~~d~v~v~ey~~r~~~~~~~~~~~~~~tg 249 (708) T protein:vir:10 184 AF-CMYSLS---PEKY-EAEYGKKPP---TSLDVTSMTS---WEYN-WF--GADVIYIAKYYEVRKESVDVISYRHPITG 249 (708) T ss_pred hh-hccCCC---HHHH-HHhCCCCcc---cccccccCCC---cccc-cc--CCCceEEEEeeeEEEEEEEEEEEecCCCC Confidence 21 111111 1100 011111100 0111111100 0111 21 111122223221 111 12 Q ss_pred CEEEEecccc-------------------------c-----------CCCccceEEeeeeeecCccc---CCchHHH-hH Q lcl|NC_019423. 345 STLIRMENNP-------------------------F-----------PDGKLPLVVVPYMPRKRELF---GEADAEL-LG 384 (756) Q Consensus 345 ~~~L~~~~~P-------------------------~-----------~~~~~Pfv~~~~~~~~~~~~---G~g~v~~-~~ 384 (756) ..+...+... + ..+.+||-.|++.|.-+..+ |...... ++ T Consensus 250 ~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr 329 (708) T protein:vir:10 250 EIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIA 329 (708) T ss_pred ceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccCCCcccceeec Confidence 2222211110 0 11225555555555433322 1111011 12 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccc--ccc-cc---ccccccCC----CcchH Q lcl|NC_019423. 385 DNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQ--GNP-SQ---SIMEHKFP----ELPQS 454 (756) Q Consensus 385 d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~--~~~-~~---~i~~~~~~----~~~~~ 454 (756) ++-+.-......+.-.+...+..+....-...................+... +.+ .. .+.....+ +.++- T Consensus 330 ~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~ 409 (708) T protein:vir:10 330 KAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVM 409 (708) T ss_pred ccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccccccCCccccCCccc Confidence 2221111111111111111111111111111011111101111100011000 000 00 11111111 12233 Q ss_pred HHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhCCCCcE Q lcl|NC_019423. 455 AIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRLAKGMADIGTKICAMNAVFLSEKEV 533 (756) Q Consensus 455 ~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~~~~~~~l~~~~l~li~q~~~~~r~ 533 (756) ....++++.....++ ...+|.++.++|.. +.+|++..++.+..+.. .-.|-+.++.-.+.+.+++..+..+- T Consensus 410 ~~~~~~l~q~~~~~i----~~vsG~~~~~lG~~-sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~-- 482 (708) T protein:vir:10 410 NQALAALLQQTSADI----QEVTGGSQAMQQMP-SNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREV-- 482 (708) T ss_pred hHHHHHHHHHHHHHH----HHHhCcChhHccCc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 333555555555554 33468888777754 45888876666554443 34444566666677777776654331 Q ss_pred EEEecCceeecCHhHhcCcceEEEe-----------------------cccccHHHHHHHHHHHHHHHhhccCCHhHHHH Q lcl|NC_019423. 534 VRITNEQYVEIKREDLKGNFDIEVD-----------------------INTAEIDNQKSQDLGFMVQTLGNTVDQSITLS 590 (756) Q Consensus 534 iRI~g~~~v~i~~d~~~~~~Dv~V~-----------------------~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~ 590 (756) .+.+..+.|..++=. ...+.++ +..........+. +.... T Consensus 483 --y~~er~~RI~~edg~-~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r--------------~~~~~ 545 (708) T protein:vir:10 483 --YGSEREVRIVNEDGS-DDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR--------------DATVS 545 (708) T ss_pred --cCCCcEEEEecCCCC-cceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHH--------------HHHHH Confidence 112234555544311 1112221 1111111111110 11112 Q ss_pred HHHHHHhhcCCh-----hHHHHhhhccCCC--ChhhhhH----------HH---HHHHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 591 LVAKIAELKRMP-----DLAHELRTWQPQP--DPMEEQL----------KQ---LAIQK--AQLENEELQSKIALNNAKA 648 (756) Q Consensus 591 ~l~~l~e~~~~~-----~~~~~l~~~~~q~--~p~~~~~----------~q---~~~~~--aq~e~~~~qa~a~~~~a~a 648 (756) .|.+++...... .+...+-....-| +.....+ ++ .++++ +++.+++.+++.....+++ T Consensus 546 ~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa 625 (708) T protein:vir:10 546 VLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQA 625 (708) T ss_pred HHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222222211 1111111111111 1111110 00 00110 1111111112222222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhccCCCCCCCc-ccCchhcC Q lcl|NC_019423. 649 KEAASSGDLKDLDYLEQESGTKH-ARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAAVGYNTLTN-GNSPQERD 726 (756) Q Consensus 649 ~~~~aq~~~~~~~~~~q~~~~k~-~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~~~~ 726 (756) +.+++|++++++++.++..+++. .+.++-..++.+..+.++....+.. .. ...++.++....+ .-.-+... T Consensus 626 ~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~-~~------~~~~~q~l~~~q~~q~~~~~~~ 698 (708) T protein:vir:10 626 QMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDD-KA------VMEAIRLLKDVAESQQQQFQSP 698 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH------HHHHHHHhhhhhhhHHHHHhcc Confidence 33333333333332222211111 0111111111111111111111110 00 0111112221111 00011111 Q ss_pred CCC--CCCCc Q lcl|NC_019423. 727 LAA--QQDPA 734 (756) Q Consensus 727 ~~~--~~~~~ 734 (756) |-- ..+|+ T Consensus 699 p~~~~~~~p~ 708 (708) T protein:vir:10 699 PQSPADLMPS 708 (708) T ss_pred ccCchhccCC Confidence 111 22233 No 129 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=97.91 E-value=1.1e-05 Score=47.71 Aligned_cols=649 Identities=8% Similarity=-0.006 Sum_probs=182.0 Q ss_pred CCCccccccc----c-----C-CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHH Q lcl|NC_019423. 10 LPDPAQSEKL----T-----D-WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVR 79 (756) Q Consensus 10 ~~~~~~~~~~----~-----~-~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~ 79 (756) |.|-..+.+. + . +++.+.. ..+...+...........++-+.+|..+-. . ..+.|+ ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~--~~~~r~-----~a~ 69 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAA---QREKPANPLDSEQAVELHSRLLSYYRQELS-R--QQDNRA-----EMA 69 (776) T ss_pred CCCccccccccccccccccccCCCCCccc---chhcccCCCCCHHHHHHHHHHHHHHHHHHh-h--chHHHH-----HHH Confidence 3343333222 0 1 2332221 111111111111111122333444432111 0 111121 111 Q ss_pred HHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeee Q lcl|NC_019423. 80 RQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVK 159 (756) Q Consensus 80 ~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~ 159 (756) .-.-|..|+ .|.++..+. ....+.-.+.+|.|+-.+..-.|..+ ..+. T Consensus 70 ---------~d~~fy~G~--------Qw~~~~~~~----------l~~~g~p~~~~N~i~~~i~~v~g~~~-----~nr~ 117 (776) T protein:vir:93 70 ---------VDEDYYDNI--------QWSQDEIDE----------LKERGQAPTVYNVISQSVNWIIGSEK-----RGRS 117 (776) T ss_pred ---------HHHHHhCCC--------CCCHHHHHH----------HHhcCCceEEecchHHHHHHHHHHHH-----hCCc Confidence 111344444 444433332 12233344566666666655555222 1111 Q ss_pred eeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeE Q lcl|NC_019423. 160 IKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTV 239 (756) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~i 239 (756) ...+...+..+.+.++.+..++..+... +........++.+.+.+|.|+..+ ..+ +. . T Consensus 118 ----~~~~~p~~~~d~~~Ae~l~~~~~~~~~~-----~~~~~~~~~af~d~~~~G~G~~~v-------~~d---~~-~-- 175 (776) T protein:vir:93 118 ----DFKVLPRRKDGGKAAERKTALLKYLSDV-----NHTPFERSMAFEETTKAGIGWLES-------QVQ---DE-N-- 175 (776) T ss_pred ----ceEEecCChhHHHHHHHHHHHHHHHHHh-----hcHHHHHHHHHHHhhhcCcceEEE-------Eee---cc-C-- Confidence 1223334567778888888877765432 223334456677777777665432 221 00 0 Q ss_pred EEechhheEeCCCCcCccccCceEEE--EeecCHHHHHhhccc----hhhhcccCchhhhhhhchhhhcccccccccccc Q lcl|NC_019423. 240 EMLNPNNVVIDPSCNGDLDKALYAVI--SFETCKADLMKNKDR----YHNLDKIDWESSSPITDPDHESKTPSDFQFKDA 313 (756) Q Consensus 240 e~V~p~~~~~Dp~a~~d~~da~~v~~--~~~~t~~el~~~~~~----~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 313 (756) .-+.++. ... ++. .+++- .+..++++....+-. .+.+...-.+..+.... ..........+.+. T Consensus 176 ---~~~~~~~-~~~--~p~--~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~--~~~~~~~~~~~~~~ 245 (776) T protein:vir:93 176 ---DGEPIYA-GAE--SWR--NILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRA--AAVDNFETWGTDDI 245 (776) T ss_pred ---CCCceEe-ecc--Chh--heeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHH--hhhhcccccchhcc Confidence 0011110 000 011 12221 233456666544211 11111110000000000 00000001111111 Q ss_pred ccce----EEEEEEEEEeec-----cCC--ceeEEEEEEEE---------C--CEE------------------------ Q lcl|NC_019423. 314 LRKK----VVAYEYWGFYDI-----NDD--GSLEPIVATWI---------G--STL------------------------ 347 (756) Q Consensus 314 s~~~----V~v~E~w~k~d~-----~~~--g~~~~~~~~~~---------g--~~~------------------------ 347 (756) .... ..+..+|...+. ..+ .+.++++-.+. | ..+ T Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~ 325 (776) T protein:vir:93 246 DGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSP 325 (776) T ss_pred cccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehhee Confidence 0000 001111111100 001 11222221110 0 011 Q ss_pred -------EEecccccCCCccc--eEEeeeeeecCccc-CCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccC Q lcl|NC_019423. 348 -------IRMENNPFPDGKLP--LVVVPYMPRKRELF-GEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLD 417 (756) Q Consensus 348 -------L~~~~~P~~~~~~P--fv~~~~~~~~~~~~-G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~ 417 (756) +..+......+..| +-.|++.+..+... ..|+...+...=+-.-.+.+...--+ .+++..+.+- T Consensus 326 ~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~------~~~l~~~~~~ 399 (776) T protein:vir:93 326 MMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKA------LYILSTNKVL 399 (776) T ss_pred eeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHH------HHhhcCCcee Confidence 11111112222233 34556665555433 23343344433333333333221111 2333322221 Q ss_pred ccchhhhhccccccccccccccccccccccC--C-----CcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHH Q lcl|NC_019423. 418 TLNRRRYDDGQDYEYNPMQGNPSQSIMEHKF--P-----ELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAG 490 (756) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--~-----~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~ 490 (756) .....-... ..... ....++..+...+. + ..++....+++++....+.+ ....|++..++|...++ T Consensus 400 ~~~gav~~~-d~~~~--~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i----~~~tGi~~~~~G~~~n~ 472 (776) T protein:vir:93 400 MEEGAVDDI-DEFRR--EAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMI----QQVGGVTDEMLGRTTNA 472 (776) T ss_pred eccccccch-HHHHH--hcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHH----HHhhCcChHHhCCCcch Confidence 110000000 00000 01123333332221 1 11223334444444444443 44568888777777778 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEe-cCceeecCHhHhcCcceEEEecccccHHHHH Q lcl|NC_019423. 491 IRGALDAASKREMA-ILRRLAKGMADIGTKICAMNAVFLSEKEVVRIT-NEQYVEIKREDLKGNFDIEVDINTAEIDNQK 568 (756) Q Consensus 491 i~~~~~aa~~~l~~-~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~-g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~ 568 (756) +|++..++.+.-+. .+..|.+.+.+..+.+.+++....-. +. .+..+.|...+-..+| |.|+.+. ..+.. T Consensus 473 ~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~-----~~~~~r~~ri~~~~~~~~~-v~in~~~--~~nd~ 544 (776) T protein:vir:93 473 VSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQ-----YMTEEKQFRITNSRGNPEY-VTVNDGL--PENDI 544 (776) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hcCcceEEEEeecCCCcce-EEecccc--hhhhh Confidence 88876555443322 33445555566666666666654322 21 2234445443322121 3333221 11000 Q ss_pred HHHHHHHHHHhhccCCH--hHHHHHHHHHHhhcCChhHH----HHhhhccCCCC--hhhhhHHHH------HH-HHHHHH Q lcl|NC_019423. 569 SQDLGFMVQTLGNTVDQ--SITLSLVAKIAELKRMPDLA----HELRTWQPQPD--PMEEQLKQL------AI-QKAQLE 633 (756) Q Consensus 569 ~q~l~~llq~~~~~~~~--~~~~~~l~~l~e~~~~~~~~----~~l~~~~~q~~--p~~~~~~q~------~~-~~aq~e 633 (756) ..--..+.-..++..+- ......+..++..++ +++. ..+......+. ......++. .+ +..+.+ T Consensus 545 ~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~-p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~ 623 (776) T protein:vir:93 545 TRTKADFIIDEAEWRATMRQAAVAELMEVIGKMP-PEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEE 623 (776) T ss_pred ccceeeEEEeecccchhHHHHHHHHHHHHHhhcC-hhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhH Confidence 00000000000011000 001111222222111 2211 12222222111 111111100 00 011111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH---HHHHHHHH-HH---HHhhccCCch Q lcl|NC_019423. 634 NEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKH-ARDMEKQKAQSQGN---QNLQITKA-LT---TPTKEGETTP 705 (756) Q Consensus 634 ~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~-~~~~~~~~~q~~~~---~~~~~~~a-~~---~~~~~~~~~~ 705 (756) .+..+++.+..+++.+..+++.+..+++..+..+.... ..+.++...++... .+..++++ .. ..+....+.+ T Consensus 624 ~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~ 703 (776) T protein:vir:93 624 IAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDG 703 (776) T ss_pred HHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Confidence 11111122211222111111111111111111110000 00011111000000 00001110 00 0111111122 Q ss_pred hhhccCCCCCCCcccCchhcCCCCCCCCccccccccccCCC---CCCCC----CCCcC Q lcl|NC_019423. 706 NISAAVGYNTLTNGNSPQERDLAAQQDPAYSLGSQYYDPSQ---DPASA----LGMNL 756 (756) Q Consensus 706 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~----~~~~~ 756 (756) .++.+....-..|...+....+ +++++.+.--+.-..|.. ...++ |+-.- T Consensus 704 ~~~~a~~~~p~~p~~~~~~~~~-~~~~~~p~~p~~p~~p~~p~~~~~~~~p~~p~~~p 760 (776) T protein:vir:93 704 ILRESGWDDPNTPQPASAASGM-PPAPAQPAQPANPAQPPAPGQAASEAQPALPANPP 760 (776) T ss_pred hhccccccccccccccccccCC-CCCCCCCCCCCCcCCCCCCCCCCCCCCCcccCCCC Confidence 2222211111112111111100 111111111111111110 00000 00000 No 130 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=607 Identities=12% Similarity=0.056 Sum_probs=171.8 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHH-hhcCCCCEE Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSE-PFLSSSKLF 100 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~-~f~~~~~~~ 100 (756) |.||+-.-.=+.+=......| ++.+. ++..++..+-+|+--..-. -|..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~------------------~~~~~~~~~~~~R~~a~~d~~fy~G~--- 52 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFS-------QRQLQ------------------ALCSDIDSQPKWRDAANKACAYYDGD--- 52 (714) T ss_pred CCcccccccCCCCcchhHHHH-------HHHHH------------------HHHHHHHhhHHHHHHHHHHHHhhcCC--- Confidence 333221100000000000001 11111 2222333334444332222 233444 Q ss_pred EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC--CCCHHHH Q lcl|NC_019423. 101 KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP--IENQEQA 178 (756) Q Consensus 101 ~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~--~~~~~~~ 178 (756) .|.++..+. + ...+.-++.+|.|+-.+..-.|.-+ ..+. ...+...+ ..+.+.+ T Consensus 53 -----Qw~~~~~~~--------l--~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~----~~~v~p~~~~~~~~~~A 108 (714) T protein:vir:32 53 -----QLPPEVLQV--------L--KDRGQPMTIHNLIAPTVDGVLGMEA-----KTRT----DLVVMSDEPDDETEKLA 108 (714) T ss_pred -----CCCHHHHHH--------H--HhcCCCcEEeccHHHHHHHHHhHHH-----hCCc----ceEEecCCCCchhHHHH Confidence 333322222 1 2233445556677666666566222 2222 12233322 2333567 Q ss_pred HHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccc Q lcl|NC_019423. 179 DVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 179 ~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~ 258 (756) +.+...+..+...- ........++.+.+.+|.|+. ++ . ++++.|--+..++. ++ T Consensus 109 e~l~~~~~~~~~~~-----~~~~~~s~af~~~~~~G~G~~-------~~--~-----------~~~d~~~~~i~i~~-v~ 162 (714) T protein:vir:32 109 EAINAEFADACRLG-----NMNKARSDAYAEQIKAGLSWV-------EV--R-----------RNSDPFGPEFKVST-VS 162 (714) T ss_pred HHHHHHHHHHHHhh-----chhHHHHHHHHHhhhcCcceE-------Ee--c-----------cccCCCCCCeEEEe-cc Confidence 77776666544421 222334456666666665431 11 0 11111111111111 00 Q ss_pred cCceEEE--EeecCHHHHHhhcc----chhhhcccCchhhhhhhchhhhccccccccccc-cccceEEEEEEEEEeeccC Q lcl|NC_019423. 259 KALYAVI--SFETCKADLMKNKD----RYHNLDKIDWESSSPITDPDHESKTPSDFQFKD-ALRKKVVAYEYWGFYDIND 331 (756) Q Consensus 259 da~~v~~--~~~~t~~el~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~s~~~V~v~E~w~k~d~~~ 331 (756) --.+++- .+..++++.....- ..+.+...-.+..+.............+....+ .....+.-++.+..++... T Consensus 163 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (714) T protein:vir:32 163 RNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQ 242 (714) T ss_pred hhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccc Confidence 0011111 22344555543321 111111100000000000000000000000000 0000011111111111111 Q ss_pred ------Cc--e--eEEE-EE-----EE---ECCEEEEeccccc---------------CCCccceEEee--------eee Q lcl|NC_019423. 332 ------DG--S--LEPI-VA-----TW---IGSTLIRMENNPF---------------PDGKLPLVVVP--------YMP 369 (756) Q Consensus 332 ------~g--~--~~~~-~~-----~~---~g~~~L~~~~~P~---------------~~~~~Pfv~~~--------~~~ 369 (756) +. + .+++ +. ++ .|+.+...+.+|- .+.++....|. ..| T Consensus 243 ~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p 322 (714) T protein:vir:32 243 NEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCS 322 (714) T ss_pred cccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCC Confidence 00 0 0111 10 00 1222333322220 01111111111 112 Q ss_pred ecCcccC----CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCce----Ee-eccccCccchhhh---hcccc-ccc-ccc Q lcl|NC_019423. 370 RKRELFG----EADAELLGDNQAILGATMRGMIDLLGRSANGQR----GY-PKGMLDTLNRRRY---DDGQD-YEY-NPM 435 (756) Q Consensus 370 ~~~~~~G----~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~----~~-~~gav~~~~~~~~---~~~~~-~~~-~~~ 435 (756) .+|..|+ +|+.+ +.....--++|.++|+-...|+..- ++ .++.+-....... ..... ... +.+ T Consensus 323 ~p~~~fp~vp~~g~~~---~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~a~~~~d~~~~e~~arp~~vi 399 (714) T protein:vir:32 323 APQGMFPLVPFWGYRK---DKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGII 399 (714) T ss_pred CCCCceeEEEEeeeee---eccCceeehhhhchhHHHHHHHHHHHHHHhhcCCceeeecCcccccHHHHHHhccCCCCce Confidence 2232211 11211 1111222355566665443332111 11 1111100000000 00000 001 111 Q ss_pred ccccccc-----cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_019423. 436 QGNPSQS-----IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRL 509 (756) Q Consensus 436 ~~~~~~~-----i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~ 509 (756) ..++... ..++.+.+.++-....++++.... +.-...+|.++.++|...+++|++..++.+..+.. +-.| T Consensus 400 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~----~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~ 475 (714) T protein:vir:32 400 KLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESE----KLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEI 475 (714) T ss_pred eecccccccCCCCccccccCCCCccHHHHHHHHHHH----HHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHH Confidence 2222111 111222223334444444444444 44455679988888888888999887777765442 3444 Q ss_pred HHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh-HhcCcce-EEEec--ccccHHHHHHHHHHHHH-H---Hhhc Q lcl|NC_019423. 510 AKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE-DLKGNFD-IEVDI--NTAEIDNQKSQDLGFMV-Q---TLGN 581 (756) Q Consensus 510 ~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d-~~~~~~D-v~V~~--g~a~~~~~~~q~l~~ll-q---~~~~ 581 (756) -+.++...+.+.+++..+...- .+.+..+.|... +-.+... ++++. +.+...+. +.-.- . ..+| T Consensus 476 ~Dnl~~~~~~~g~~lL~li~~~----~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nD----i~~~~~Dv~i~~~p 547 (714) T protein:vir:32 476 NDNYQFACQQVGRLLLAYLLDD----LKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTND----ISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEeccCCCcCcceEEeeccccCcceeccc----ceeeeEEEEEeecc Confidence 5566767777777776553321 112233444321 1111001 22221 11111000 00000 0 0011 Q ss_pred cCCHhHHHHHHHHHHhhcCC--hhH----HHHhhhccCCCCh--hh----------hhH--HHHHHHHHHHHHHHH---H Q lcl|NC_019423. 582 TVDQSITLSLVAKIAELKRM--PDL----AHELRTWQPQPDP--ME----------EQL--KQLAIQKAQLENEEL---Q 638 (756) Q Consensus 582 ~~~~~~~~~~l~~l~e~~~~--~~~----~~~l~~~~~q~~p--~~----------~~~--~q~~~~~aq~e~~~~---q 638 (756) . .+..-...+..|+++... |+. ..++-....-|.. .. ... .+++++++++..+.+ + T Consensus 548 ~-~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:32 548 Q-TPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred C-chHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 1 112222233333333221 221 1222122221211 10 000 111222222222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh-hccCCCC Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGTKHAR---DMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI-SAAVGYN 714 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~---~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~-~~a~~~~ 714 (756) ++.+..+++++..+.++++++++++++..+.+.+. ..+..+. .++..+.++.+.+...+.... ...+ +.++ -. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~-~~~~~~a~~a~~~~~~~~~~~-~~~~~~~q~-~q 703 (714) T protein:vir:32 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY-VDALNQAHTAEIITGVQNMEQ-EQDVLQQQM-LY 703 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhHhhhhh-hhHHHHHHH-HH Confidence 22222222222222333333332222111111111 1111111 111111122111111111000 0000 0110 00 Q ss_pred CCCcccCchhcCCCCCCCCcccc Q lcl|NC_019423. 715 TLTNGNSPQERDLAAQQDPAYSL 737 (756) Q Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~ 737 (756) +..+. -.|-.| T Consensus 704 ~~~~~------------~~~~~~ 714 (714) T protein:vir:32 704 TLQQR------------MNEMSL 714 (714) T ss_pred HHHHH------------HHhcCC Confidence 00010 001111 No 131 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=607 Identities=12% Similarity=0.056 Sum_probs=171.8 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHH-hhcCCCCEE Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSE-PFLSSSKLF 100 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~-~f~~~~~~~ 100 (756) |.||+-.-.=+.+=......| ++.+. ++..++..+-+|+--..-. -|..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~------------------~~~~~~~~~~~~R~~a~~d~~fy~G~--- 52 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFS-------QRQLQ------------------ALCSDIDSQPKWRDAANKACAYYDGD--- 52 (714) T ss_pred CCcccccccCCCCcchhHHHH-------HHHHH------------------HHHHHHHhhHHHHHHHHHHHHhhcCC--- Confidence 333221100000000000001 11111 2222333334444332222 233444 Q ss_pred EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC--CCCHHHH Q lcl|NC_019423. 101 KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP--IENQEQA 178 (756) Q Consensus 101 ~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~--~~~~~~~ 178 (756) .|.++..+. + ...+.-++.+|.|+-.+..-.|.-+ ..+. ...+...+ ..+.+.+ T Consensus 53 -----Qw~~~~~~~--------l--~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~----~~~v~p~~~~~~~~~~A 108 (714) T protein:vir:81 53 -----QLPPEVLQV--------L--KDRGQPMTIHNLIAPTVDGVLGMEA-----KTRT----DLVVMSDEPDDETEKLA 108 (714) T ss_pred -----CCCHHHHHH--------H--HhcCCCcEEeccHHHHHHHHHhHHH-----hCCc----ceEEecCCCCchhHHHH Confidence 333322222 1 2233445556677666666566222 2222 12233322 2333567 Q ss_pred HHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccc Q lcl|NC_019423. 179 DVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 179 ~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~ 258 (756) +.+...+..+...- ........++.+.+.+|.|+. ++ . ++++.|--+..++. ++ T Consensus 109 e~l~~~~~~~~~~~-----~~~~~~s~af~~~~~~G~G~~-------~~--~-----------~~~d~~~~~i~i~~-v~ 162 (714) T protein:vir:81 109 EAINAEFADACRLG-----NMNKARSDAYAEQIKAGLSWV-------EV--R-----------RNSDPFGPEFKVST-VS 162 (714) T ss_pred HHHHHHHHHHHHhh-----chhHHHHHHHHHhhhcCcceE-------Ee--c-----------cccCCCCCCeEEEe-cc Confidence 77776666544421 222334456666666665431 11 0 11111111111111 00 Q ss_pred cCceEEE--EeecCHHHHHhhcc----chhhhcccCchhhhhhhchhhhccccccccccc-cccceEEEEEEEEEeeccC Q lcl|NC_019423. 259 KALYAVI--SFETCKADLMKNKD----RYHNLDKIDWESSSPITDPDHESKTPSDFQFKD-ALRKKVVAYEYWGFYDIND 331 (756) Q Consensus 259 da~~v~~--~~~~t~~el~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~s~~~V~v~E~w~k~d~~~ 331 (756) --.+++- .+..++++.....- ..+.+...-.+..+.............+....+ .....+.-++.+..++... T Consensus 163 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (714) T protein:vir:81 163 RNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQ 242 (714) T ss_pred hhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccc Confidence 0011111 22344555543321 111111100000000000000000000000000 0000011111111111111 Q ss_pred ------Cc--e--eEEE-EE-----EE---ECCEEEEeccccc---------------CCCccceEEee--------eee Q lcl|NC_019423. 332 ------DG--S--LEPI-VA-----TW---IGSTLIRMENNPF---------------PDGKLPLVVVP--------YMP 369 (756) Q Consensus 332 ------~g--~--~~~~-~~-----~~---~g~~~L~~~~~P~---------------~~~~~Pfv~~~--------~~~ 369 (756) +. + .+++ +. ++ .|+.+...+.+|- .+.++....|. ..| T Consensus 243 ~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p 322 (714) T protein:vir:81 243 NEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCS 322 (714) T ss_pred cccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCC Confidence 00 0 0111 10 00 1222333322220 01111111111 112 Q ss_pred ecCcccC----CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCce----Ee-eccccCccchhhh---hcccc-ccc-ccc Q lcl|NC_019423. 370 RKRELFG----EADAELLGDNQAILGATMRGMIDLLGRSANGQR----GY-PKGMLDTLNRRRY---DDGQD-YEY-NPM 435 (756) Q Consensus 370 ~~~~~~G----~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~----~~-~~gav~~~~~~~~---~~~~~-~~~-~~~ 435 (756) .+|..|+ +|+.+ +.....--++|.++|+-...|+..- ++ .++.+-....... ..... ... +.+ T Consensus 323 ~p~~~fp~vp~~g~~~---~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~a~~~~d~~~~e~~arp~~vi 399 (714) T protein:vir:81 323 APQGMFPLVPFWGYRK---DKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGII 399 (714) T ss_pred CCCCceeEEEEeeeee---eccCceeehhhhchhHHHHHHHHHHHHHHhhcCCceeeecCcccccHHHHHHhccCCCCce Confidence 2232211 11211 1111222355566665443332111 11 1111100000000 00000 001 111 Q ss_pred ccccccc-----cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_019423. 436 QGNPSQS-----IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRL 509 (756) Q Consensus 436 ~~~~~~~-----i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~ 509 (756) ..++... ..++.+.+.++-....++++.... +.-...+|.++.++|...+++|++..++.+..+.. +-.| T Consensus 400 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~----~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~ 475 (714) T protein:vir:81 400 KLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESE----KLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEI 475 (714) T ss_pred eecccccccCCCCccccccCCCCccHHHHHHHHHHH----HHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHH Confidence 2222111 111222223334444444444444 44455679988888888888999887777765442 3444 Q ss_pred HHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh-HhcCcce-EEEec--ccccHHHHHHHHHHHHH-H---Hhhc Q lcl|NC_019423. 510 AKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE-DLKGNFD-IEVDI--NTAEIDNQKSQDLGFMV-Q---TLGN 581 (756) Q Consensus 510 ~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d-~~~~~~D-v~V~~--g~a~~~~~~~q~l~~ll-q---~~~~ 581 (756) -+.++...+.+.+++..+...- .+.+..+.|... +-.+... ++++. +.+...+. +.-.- . ..+| T Consensus 476 ~Dnl~~~~~~~g~~lL~li~~~----~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nD----i~~~~~Dv~i~~~p 547 (714) T protein:vir:81 476 NDNYQFACQQVGRLLLAYLLDD----LKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTND----ISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEeccCCCcCcceEEeeccccCcceeccc----ceeeeEEEEEeecc Confidence 5566767777777776553321 112233444321 1111001 22221 11111000 00000 0 0011 Q ss_pred cCCHhHHHHHHHHHHhhcCC--hhH----HHHhhhccCCCCh--hh----------hhH--HHHHHHHHHHHHHHH---H Q lcl|NC_019423. 582 TVDQSITLSLVAKIAELKRM--PDL----AHELRTWQPQPDP--ME----------EQL--KQLAIQKAQLENEEL---Q 638 (756) Q Consensus 582 ~~~~~~~~~~l~~l~e~~~~--~~~----~~~l~~~~~q~~p--~~----------~~~--~q~~~~~aq~e~~~~---q 638 (756) . .+..-...+..|+++... |+. ..++-....-|.. .. ... .+++++++++..+.+ + T Consensus 548 ~-~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:81 548 Q-TPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred C-chHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 1 112222233333333221 221 1222122221211 10 000 111222222222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh-hccCCCC Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGTKHAR---DMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI-SAAVGYN 714 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~---~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~-~~a~~~~ 714 (756) ++.+..+++++..+.++++++++++++..+.+.+. ..+..+. .++..+.++.+.+...+.... ...+ +.++ -. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~-~~~~~~a~~a~~~~~~~~~~~-~~~~~~~q~-~q 703 (714) T protein:vir:81 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY-VDALNQAHTAEIITGVQNMEQ-EQDVLQQQM-LY 703 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhHhhhhh-hhHHHHHHH-HH Confidence 22222222222222333333332222111111111 1111111 111111122111111111000 0000 0110 00 Q ss_pred CCCcccCchhcCCCCCCCCcccc Q lcl|NC_019423. 715 TLTNGNSPQERDLAAQQDPAYSL 737 (756) Q Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~ 737 (756) +..+. -.|-.| T Consensus 704 ~~~~~------------~~~~~~ 714 (714) T protein:vir:81 704 TLQQR------------MNEMSL 714 (714) T ss_pred HHHHH------------HHhcCC Confidence 00010 001111 No 132 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=607 Identities=12% Similarity=0.056 Sum_probs=171.8 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHH-hhcCCCCEE Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSE-PFLSSSKLF 100 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~-~f~~~~~~~ 100 (756) |.||+-.-.=+.+=......| ++.+. ++..++..+-+|+--..-. -|..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~------------------~~~~~~~~~~~~R~~a~~d~~fy~G~--- 52 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFS-------QRQLQ------------------ALCSDIDSQPKWRDAANKACAYYDGD--- 52 (714) T ss_pred CCcccccccCCCCcchhHHHH-------HHHHH------------------HHHHHHHhhHHHHHHHHHHHHhhcCC--- Confidence 333221100000000000001 11111 2222333334444332222 233444 Q ss_pred EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC--CCCHHHH Q lcl|NC_019423. 101 KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP--IENQEQA 178 (756) Q Consensus 101 ~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~--~~~~~~~ 178 (756) .|.++..+. + ...+.-++.+|.|+-.+..-.|.-+ ..+. ...+...+ ..+.+.+ T Consensus 53 -----Qw~~~~~~~--------l--~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~----~~~v~p~~~~~~~~~~A 108 (714) T protein:vir:27 53 -----QLPPEVLQV--------L--KDRGQPMTIHNLIAPTVDGVLGMEA-----KTRT----DLVVMSDEPDDETEKLA 108 (714) T ss_pred -----CCCHHHHHH--------H--HhcCCCcEEeccHHHHHHHHHhHHH-----hCCc----ceEEecCCCCchhHHHH Confidence 333322222 1 2233445556677666666566222 2222 12233322 2333567 Q ss_pred HHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccc Q lcl|NC_019423. 179 DVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 179 ~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~ 258 (756) +.+...+..+...- ........++.+.+.+|.|+. ++ . ++++.|--+..++. ++ T Consensus 109 e~l~~~~~~~~~~~-----~~~~~~s~af~~~~~~G~G~~-------~~--~-----------~~~d~~~~~i~i~~-v~ 162 (714) T protein:vir:27 109 EAINAEFADACRLG-----NMNKARSDAYAEQIKAGLSWV-------EV--R-----------RNSDPFGPEFKVST-VS 162 (714) T ss_pred HHHHHHHHHHHHhh-----chhHHHHHHHHHhhhcCcceE-------Ee--c-----------cccCCCCCCeEEEe-cc Confidence 77776666544421 222334456666666665431 11 0 11111111111111 00 Q ss_pred cCceEEE--EeecCHHHHHhhcc----chhhhcccCchhhhhhhchhhhccccccccccc-cccceEEEEEEEEEeeccC Q lcl|NC_019423. 259 KALYAVI--SFETCKADLMKNKD----RYHNLDKIDWESSSPITDPDHESKTPSDFQFKD-ALRKKVVAYEYWGFYDIND 331 (756) Q Consensus 259 da~~v~~--~~~~t~~el~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~s~~~V~v~E~w~k~d~~~ 331 (756) --.+++- .+..++++.....- ..+.+...-.+..+.............+....+ .....+.-++.+..++... T Consensus 163 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (714) T protein:vir:27 163 RNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQ 242 (714) T ss_pred hhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccc Confidence 0011111 22344555543321 111111100000000000000000000000000 0000011111111111111 Q ss_pred ------Cc--e--eEEE-EE-----EE---ECCEEEEeccccc---------------CCCccceEEee--------eee Q lcl|NC_019423. 332 ------DG--S--LEPI-VA-----TW---IGSTLIRMENNPF---------------PDGKLPLVVVP--------YMP 369 (756) Q Consensus 332 ------~g--~--~~~~-~~-----~~---~g~~~L~~~~~P~---------------~~~~~Pfv~~~--------~~~ 369 (756) +. + .+++ +. ++ .|+.+...+.+|- .+.++....|. ..| T Consensus 243 ~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p 322 (714) T protein:vir:27 243 NEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCS 322 (714) T ss_pred cccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCC Confidence 00 0 0111 10 00 1222333322220 01111111111 112 Q ss_pred ecCcccC----CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCce----Ee-eccccCccchhhh---hcccc-ccc-ccc Q lcl|NC_019423. 370 RKRELFG----EADAELLGDNQAILGATMRGMIDLLGRSANGQR----GY-PKGMLDTLNRRRY---DDGQD-YEY-NPM 435 (756) Q Consensus 370 ~~~~~~G----~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~----~~-~~gav~~~~~~~~---~~~~~-~~~-~~~ 435 (756) .+|..|+ +|+.+ +.....--++|.++|+-...|+..- ++ .++.+-....... ..... ... +.+ T Consensus 323 ~p~~~fp~vp~~g~~~---~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~a~~~~d~~~~e~~arp~~vi 399 (714) T protein:vir:27 323 APQGMFPLVPFWGYRK---DKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGII 399 (714) T ss_pred CCCCceeEEEEeeeee---eccCceeehhhhchhHHHHHHHHHHHHHHhhcCCceeeecCcccccHHHHHHhccCCCCce Confidence 2232211 11211 1111222355566665443332111 11 1111100000000 00000 001 111 Q ss_pred ccccccc-----cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_019423. 436 QGNPSQS-----IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRL 509 (756) Q Consensus 436 ~~~~~~~-----i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~ 509 (756) ..++... ..++.+.+.++-....++++.... +.-...+|.++.++|...+++|++..++.+..+.. +-.| T Consensus 400 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~----~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~ 475 (714) T protein:vir:27 400 KLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESE----KLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEI 475 (714) T ss_pred eecccccccCCCCccccccCCCCccHHHHHHHHHHH----HHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHH Confidence 2222111 111222223334444444444444 44455679988888888888999887777765442 3444 Q ss_pred HHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh-HhcCcce-EEEec--ccccHHHHHHHHHHHHH-H---Hhhc Q lcl|NC_019423. 510 AKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE-DLKGNFD-IEVDI--NTAEIDNQKSQDLGFMV-Q---TLGN 581 (756) Q Consensus 510 ~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d-~~~~~~D-v~V~~--g~a~~~~~~~q~l~~ll-q---~~~~ 581 (756) -+.++...+.+.+++..+...- .+.+..+.|... +-.+... ++++. +.+...+. +.-.- . ..+| T Consensus 476 ~Dnl~~~~~~~g~~lL~li~~~----~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nD----i~~~~~Dv~i~~~p 547 (714) T protein:vir:27 476 NDNYQFACQQVGRLLLAYLLDD----LKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTND----ISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEeccCCCcCcceEEeeccccCcceeccc----ceeeeEEEEEeecc Confidence 5566767777777776553321 112233444321 1111001 22221 11111000 00000 0 0011 Q ss_pred cCCHhHHHHHHHHHHhhcCC--hhH----HHHhhhccCCCCh--hh----------hhH--HHHHHHHHHHHHHHH---H Q lcl|NC_019423. 582 TVDQSITLSLVAKIAELKRM--PDL----AHELRTWQPQPDP--ME----------EQL--KQLAIQKAQLENEEL---Q 638 (756) Q Consensus 582 ~~~~~~~~~~l~~l~e~~~~--~~~----~~~l~~~~~q~~p--~~----------~~~--~q~~~~~aq~e~~~~---q 638 (756) . .+..-...+..|+++... |+. ..++-....-|.. .. ... .+++++++++..+.+ + T Consensus 548 ~-~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:27 548 Q-TPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred C-chHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 1 112222233333333221 221 1222122221211 10 000 111222222222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh-hccCCCC Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGTKHAR---DMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI-SAAVGYN 714 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~---~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~-~~a~~~~ 714 (756) ++.+..+++++..+.++++++++++++..+.+.+. ..+..+. .++..+.++.+.+...+.... ...+ +.++ -. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~-~~~~~~a~~a~~~~~~~~~~~-~~~~~~~q~-~q 703 (714) T protein:vir:27 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY-VDALNQAHTAEIITGVQNMEQ-EQDVLQQQM-LY 703 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhHhhhhh-hhHHHHHHH-HH Confidence 22222222222222333333332222111111111 1111111 111111122111111111000 0000 0110 00 Q ss_pred CCCcccCchhcCCCCCCCCcccc Q lcl|NC_019423. 715 TLTNGNSPQERDLAAQQDPAYSL 737 (756) Q Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~ 737 (756) +..+. -.|-.| T Consensus 704 ~~~~~------------~~~~~~ 714 (714) T protein:vir:27 704 TLQQR------------MNEMSL 714 (714) T ss_pred HHHHH------------HHhcCC Confidence 00010 001111 No 133 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=607 Identities=12% Similarity=0.056 Sum_probs=171.8 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHH-hhcCCCCEE Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSE-PFLSSSKLF 100 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~-~f~~~~~~~ 100 (756) |.||+-.-.=+.+=......| ++.+. ++..++..+-+|+--..-. -|..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~------------------~~~~~~~~~~~~R~~a~~d~~fy~G~--- 52 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFS-------QRQLQ------------------ALCSDIDSQPKWRDAANKACAYYDGD--- 52 (714) T ss_pred CCcccccccCCCCcchhHHHH-------HHHHH------------------HHHHHHHhhHHHHHHHHHHHHhhcCC--- Confidence 333221100000000000001 11111 2222333334444332222 233444 Q ss_pred EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC--CCCHHHH Q lcl|NC_019423. 101 KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP--IENQEQA 178 (756) Q Consensus 101 ~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~--~~~~~~~ 178 (756) .|.++..+. + ...+.-++.+|.|+-.+..-.|.-+ ..+. ...+...+ ..+.+.+ T Consensus 53 -----Qw~~~~~~~--------l--~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~----~~~v~p~~~~~~~~~~A 108 (714) T protein:vir:99 53 -----QLPPEVLQV--------L--KDRGQPMTIHNLIAPTVDGVLGMEA-----KTRT----DLVVMSDEPDDETEKLA 108 (714) T ss_pred -----CCCHHHHHH--------H--HhcCCCcEEeccHHHHHHHHHhHHH-----hCCc----ceEEecCCCCchhHHHH Confidence 333322222 1 2233445556677666666566222 2222 12233322 2333567 Q ss_pred HHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccc Q lcl|NC_019423. 179 DVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 179 ~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~ 258 (756) +.+...+..+...- ........++.+.+.+|.|+. ++ . ++++.|--+..++. ++ T Consensus 109 e~l~~~~~~~~~~~-----~~~~~~s~af~~~~~~G~G~~-------~~--~-----------~~~d~~~~~i~i~~-v~ 162 (714) T protein:vir:99 109 EAINAEFADACRLG-----NMNKARSDAYAEQIKAGLSWV-------EV--R-----------RNSDPFGPEFKVST-VS 162 (714) T ss_pred HHHHHHHHHHHHhh-----chhHHHHHHHHHhhhcCcceE-------Ee--c-----------cccCCCCCCeEEEe-cc Confidence 77776666544421 222334456666666665431 11 0 11111111111111 00 Q ss_pred cCceEEE--EeecCHHHHHhhcc----chhhhcccCchhhhhhhchhhhccccccccccc-cccceEEEEEEEEEeeccC Q lcl|NC_019423. 259 KALYAVI--SFETCKADLMKNKD----RYHNLDKIDWESSSPITDPDHESKTPSDFQFKD-ALRKKVVAYEYWGFYDIND 331 (756) Q Consensus 259 da~~v~~--~~~~t~~el~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~s~~~V~v~E~w~k~d~~~ 331 (756) --.+++- .+..++++.....- ..+.+...-.+..+.............+....+ .....+.-++.+..++... T Consensus 163 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (714) T protein:vir:99 163 RNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQ 242 (714) T ss_pred hhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccc Confidence 0011111 22344555543321 111111100000000000000000000000000 0000011111111111111 Q ss_pred ------Cc--e--eEEE-EE-----EE---ECCEEEEeccccc---------------CCCccceEEee--------eee Q lcl|NC_019423. 332 ------DG--S--LEPI-VA-----TW---IGSTLIRMENNPF---------------PDGKLPLVVVP--------YMP 369 (756) Q Consensus 332 ------~g--~--~~~~-~~-----~~---~g~~~L~~~~~P~---------------~~~~~Pfv~~~--------~~~ 369 (756) +. + .+++ +. ++ .|+.+...+.+|- .+.++....|. ..| T Consensus 243 ~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p 322 (714) T protein:vir:99 243 NEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCS 322 (714) T ss_pred cccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCC Confidence 00 0 0111 10 00 1222333322220 01111111111 112 Q ss_pred ecCcccC----CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCce----Ee-eccccCccchhhh---hcccc-ccc-ccc Q lcl|NC_019423. 370 RKRELFG----EADAELLGDNQAILGATMRGMIDLLGRSANGQR----GY-PKGMLDTLNRRRY---DDGQD-YEY-NPM 435 (756) Q Consensus 370 ~~~~~~G----~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~----~~-~~gav~~~~~~~~---~~~~~-~~~-~~~ 435 (756) .+|..|+ +|+.+ +.....--++|.++|+-...|+..- ++ .++.+-....... ..... ... +.+ T Consensus 323 ~p~~~fp~vp~~g~~~---~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~a~~~~d~~~~e~~arp~~vi 399 (714) T protein:vir:99 323 APQGMFPLVPFWGYRK---DKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGII 399 (714) T ss_pred CCCCceeEEEEeeeee---eccCceeehhhhchhHHHHHHHHHHHHHHhhcCCceeeecCcccccHHHHHHhccCCCCce Confidence 2232211 11211 1111222355566665443332111 11 1111100000000 00000 001 111 Q ss_pred ccccccc-----cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_019423. 436 QGNPSQS-----IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRL 509 (756) Q Consensus 436 ~~~~~~~-----i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~ 509 (756) ..++... ..++.+.+.++-....++++.... +.-...+|.++.++|...+++|++..++.+..+.. +-.| T Consensus 400 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~----~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~ 475 (714) T protein:vir:99 400 KLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESE----KLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEI 475 (714) T ss_pred eecccccccCCCCccccccCCCCccHHHHHHHHHHH----HHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHH Confidence 2222111 111222223334444444444444 44455679988888888888999887777765442 3444 Q ss_pred HHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh-HhcCcce-EEEec--ccccHHHHHHHHHHHHH-H---Hhhc Q lcl|NC_019423. 510 AKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE-DLKGNFD-IEVDI--NTAEIDNQKSQDLGFMV-Q---TLGN 581 (756) Q Consensus 510 ~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d-~~~~~~D-v~V~~--g~a~~~~~~~q~l~~ll-q---~~~~ 581 (756) -+.++...+.+.+++..+...- .+.+..+.|... +-.+... ++++. +.+...+. +.-.- . ..+| T Consensus 476 ~Dnl~~~~~~~g~~lL~li~~~----~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nD----i~~~~~Dv~i~~~p 547 (714) T protein:vir:99 476 NDNYQFACQQVGRLLLAYLLDD----LKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTND----ISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEeccCCCcCcceEEeeccccCcceeccc----ceeeeEEEEEeecc Confidence 5566767777777776553321 112233444321 1111001 22221 11111000 00000 0 0011 Q ss_pred cCCHhHHHHHHHHHHhhcCC--hhH----HHHhhhccCCCCh--hh----------hhH--HHHHHHHHHHHHHHH---H Q lcl|NC_019423. 582 TVDQSITLSLVAKIAELKRM--PDL----AHELRTWQPQPDP--ME----------EQL--KQLAIQKAQLENEEL---Q 638 (756) Q Consensus 582 ~~~~~~~~~~l~~l~e~~~~--~~~----~~~l~~~~~q~~p--~~----------~~~--~q~~~~~aq~e~~~~---q 638 (756) . .+..-...+..|+++... |+. ..++-....-|.. .. ... .+++++++++..+.+ + T Consensus 548 ~-~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:99 548 Q-TPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred C-chHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 1 112222233333333221 221 1222122221211 10 000 111222222222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh-hccCCCC Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGTKHAR---DMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI-SAAVGYN 714 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~---~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~-~~a~~~~ 714 (756) ++.+..+++++..+.++++++++++++..+.+.+. ..+..+. .++..+.++.+.+...+.... ...+ +.++ -. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~-~~~~~~a~~a~~~~~~~~~~~-~~~~~~~q~-~q 703 (714) T protein:vir:99 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY-VDALNQAHTAEIITGVQNMEQ-EQDVLQQQM-LY 703 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhHhhhhh-hhHHHHHHH-HH Confidence 22222222222222333333332222111111111 1111111 111111122111111111000 0000 0110 00 Q ss_pred CCCcccCchhcCCCCCCCCcccc Q lcl|NC_019423. 715 TLTNGNSPQERDLAAQQDPAYSL 737 (756) Q Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~ 737 (756) +..+. -.|-.| T Consensus 704 ~~~~~------------~~~~~~ 714 (714) T protein:vir:99 704 TLQQR------------MNEMSL 714 (714) T ss_pred HHHHH------------HHhcCC Confidence 00010 001111 No 134 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=607 Identities=12% Similarity=0.056 Sum_probs=171.8 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHH-hhcCCCCEE Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSE-PFLSSSKLF 100 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~-~f~~~~~~~ 100 (756) |.||+-.-.=+.+=......| ++.+. ++..++..+-+|+--..-. -|..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~------------------~~~~~~~~~~~~R~~a~~d~~fy~G~--- 52 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFS-------QRQLQ------------------ALCSDIDSQPKWRDAANKACAYYDGD--- 52 (714) T ss_pred CCcccccccCCCCcchhHHHH-------HHHHH------------------HHHHHHHhhHHHHHHHHHHHHhhcCC--- Confidence 333221100000000000001 11111 2222333334444332222 233444 Q ss_pred EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCC--CCCHHHH Q lcl|NC_019423. 101 KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYP--IENQEQA 178 (756) Q Consensus 101 ~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~--~~~~~~~ 178 (756) .|.++..+. + ...+.-++.+|.|+-.+..-.|.-+ ..+. ...+...+ ..+.+.+ T Consensus 53 -----Qw~~~~~~~--------l--~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~----~~~v~p~~~~~~~~~~A 108 (714) T protein:vir:10 53 -----QLPPEVLQV--------L--KDRGQPMTIHNLIAPTVDGVLGMEA-----KTRT----DLVVMSDEPDDETEKLA 108 (714) T ss_pred -----CCCHHHHHH--------H--HhcCCCcEEeccHHHHHHHHHhHHH-----hCCc----ceEEecCCCCchhHHHH Confidence 333322222 1 2233445556677666666566222 2222 12233322 2333567 Q ss_pred HHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccc Q lcl|NC_019423. 179 DVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 179 ~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~ 258 (756) +.+...+..+...- ........++.+.+.+|.|+. ++ . ++++.|--+..++. ++ T Consensus 109 e~l~~~~~~~~~~~-----~~~~~~s~af~~~~~~G~G~~-------~~--~-----------~~~d~~~~~i~i~~-v~ 162 (714) T protein:vir:10 109 EAINAEFADACRLG-----NMNKARSDAYAEQIKAGLSWV-------EV--R-----------RNSDPFGPEFKVST-VS 162 (714) T ss_pred HHHHHHHHHHHHhh-----chhHHHHHHHHHhhhcCcceE-------Ee--c-----------cccCCCCCCeEEEe-cc Confidence 77776666544421 222334456666666665431 11 0 11111111111111 00 Q ss_pred cCceEEE--EeecCHHHHHhhcc----chhhhcccCchhhhhhhchhhhccccccccccc-cccceEEEEEEEEEeeccC Q lcl|NC_019423. 259 KALYAVI--SFETCKADLMKNKD----RYHNLDKIDWESSSPITDPDHESKTPSDFQFKD-ALRKKVVAYEYWGFYDIND 331 (756) Q Consensus 259 da~~v~~--~~~~t~~el~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~s~~~V~v~E~w~k~d~~~ 331 (756) --.+++- .+..++++.....- ..+.+...-.+..+.............+....+ .....+.-++.+..++... T Consensus 163 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (714) T protein:vir:10 163 RNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQ 242 (714) T ss_pred hhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccc Confidence 0011111 22344555543321 111111100000000000000000000000000 0000011111111111111 Q ss_pred ------Cc--e--eEEE-EE-----EE---ECCEEEEeccccc---------------CCCccceEEee--------eee Q lcl|NC_019423. 332 ------DG--S--LEPI-VA-----TW---IGSTLIRMENNPF---------------PDGKLPLVVVP--------YMP 369 (756) Q Consensus 332 ------~g--~--~~~~-~~-----~~---~g~~~L~~~~~P~---------------~~~~~Pfv~~~--------~~~ 369 (756) +. + .+++ +. ++ .|+.+...+.+|- .+.++....|. ..| T Consensus 243 ~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p 322 (714) T protein:vir:10 243 NEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCS 322 (714) T ss_pred cccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCC Confidence 00 0 0111 10 00 1222333322220 01111111111 112 Q ss_pred ecCcccC----CchHHHhHHHHHHHHHHHHHHHHHHHhhcCCce----Ee-eccccCccchhhh---hcccc-ccc-ccc Q lcl|NC_019423. 370 RKRELFG----EADAELLGDNQAILGATMRGMIDLLGRSANGQR----GY-PKGMLDTLNRRRY---DDGQD-YEY-NPM 435 (756) Q Consensus 370 ~~~~~~G----~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~----~~-~~gav~~~~~~~~---~~~~~-~~~-~~~ 435 (756) .+|..|+ +|+.+ +.....--++|.++|+-...|+..- ++ .++.+-....... ..... ... +.+ T Consensus 323 ~p~~~fp~vp~~g~~~---~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~a~~~~d~~~~e~~arp~~vi 399 (714) T protein:vir:10 323 APQGMFPLVPFWGYRK---DKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGII 399 (714) T ss_pred CCCCceeEEEEeeeee---eccCceeehhhhchhHHHHHHHHHHHHHHhhcCCceeeecCcccccHHHHHHhccCCCCce Confidence 2232211 11211 1111222355566665443332111 11 1111100000000 00000 001 111 Q ss_pred ccccccc-----cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_019423. 436 QGNPSQS-----IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRL 509 (756) Q Consensus 436 ~~~~~~~-----i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~ 509 (756) ..++... ..++.+.+.++-....++++.... +.-...+|.++.++|...+++|++..++.+..+.. +-.| T Consensus 400 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~----~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~ 475 (714) T protein:vir:10 400 KLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESE----KLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEI 475 (714) T ss_pred eecccccccCCCCccccccCCCCccHHHHHHHHHHH----HHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHH Confidence 2222111 111222223334444444444444 44455679988888888888999887777765442 3444 Q ss_pred HHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh-HhcCcce-EEEec--ccccHHHHHHHHHHHHH-H---Hhhc Q lcl|NC_019423. 510 AKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE-DLKGNFD-IEVDI--NTAEIDNQKSQDLGFMV-Q---TLGN 581 (756) Q Consensus 510 ~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d-~~~~~~D-v~V~~--g~a~~~~~~~q~l~~ll-q---~~~~ 581 (756) -+.++...+.+.+++..+...- .+.+..+.|... +-.+... ++++. +.+...+. +.-.- . ..+| T Consensus 476 ~Dnl~~~~~~~g~~lL~li~~~----~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nD----i~~~~~Dv~i~~~p 547 (714) T protein:vir:10 476 NDNYQFACQQVGRLLLAYLLDD----LKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTND----ISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEeccCCCcCcceEEeeccccCcceeccc----ceeeeEEEEEeecc Confidence 5566767777777776553321 112233444321 1111001 22221 11111000 00000 0 0011 Q ss_pred cCCHhHHHHHHHHHHhhcCC--hhH----HHHhhhccCCCCh--hh----------hhH--HHHHHHHHHHHHHHH---H Q lcl|NC_019423. 582 TVDQSITLSLVAKIAELKRM--PDL----AHELRTWQPQPDP--ME----------EQL--KQLAIQKAQLENEEL---Q 638 (756) Q Consensus 582 ~~~~~~~~~~l~~l~e~~~~--~~~----~~~l~~~~~q~~p--~~----------~~~--~q~~~~~aq~e~~~~---q 638 (756) . .+..-...+..|+++... |+. ..++-....-|.. .. ... .+++++++++..+.+ + T Consensus 548 ~-~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q 626 (714) T protein:vir:10 548 Q-TPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQ 626 (714) T ss_pred C-chHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHH Confidence 1 112222233333333221 221 1222122221211 10 000 111222222222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhh-hccCCCC Q lcl|NC_019423. 639 SKIALNNAKAKEAASSGDLKDLDYLEQESGTKHAR---DMEKQKAQSQGNQNLQITKALTTPTKEGETTPNI-SAAVGYN 714 (756) Q Consensus 639 a~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~---~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~-~~a~~~~ 714 (756) ++.+..+++++..+.++++++++++++..+.+.+. ..+..+. .++..+.++.+.+...+.... ...+ +.++ -. T Consensus 627 ~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~-~~~~~~a~~a~~~~~~~~~~~-~~~~~~~q~-~q 703 (714) T protein:vir:10 627 AELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY-VDALNQAHTAEIITGVQNMEQ-EQDVLQQQM-LY 703 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhHhhhhh-hhHHHHHHH-HH Confidence 22222222222222333333332222111111111 1111111 111111122111111111000 0000 0110 00 Q ss_pred CCCcccCchhcCCCCCCCCcccc Q lcl|NC_019423. 715 TLTNGNSPQERDLAAQQDPAYSL 737 (756) Q Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~ 737 (756) +..+. -.|-.| T Consensus 704 ~~~~~------------~~~~~~ 714 (714) T protein:vir:10 704 TLQQR------------MNEMSL 714 (714) T ss_pred HHHHH------------HHhcCC Confidence 00010 001111 No 135 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=97.31 E-value=9.8e-05 Score=42.55 Aligned_cols=666 Identities=9% Similarity=-0.008 Sum_probs=200.9 Q ss_pred cccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHH Q lcl|NC_019423. 3 HQDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQA 82 (756) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~ 82 (756) --.+.-..--+...++.++.++=.-...|.+..+++..+.+..-+.+.+...+|.-... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------- 59 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRI--------------------- 59 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhc--------------------- Confidence 11111111111122233333332333333333334333333333333344333332111 Q ss_pred HHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeee Q lcl|NC_019423. 83 EWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKT 162 (756) Q Consensus 83 e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~ 162 (756) .|-.+ -|...| .. .++.-.+.. .++-+...+-+.+..|.=|++ T Consensus 60 -----------~~~~~----~~~~~g---rs----~vv~~~v~~---~ve~~~~~l~~~f~~~~~~~~------------ 102 (763) T protein:vir:95 60 -----------EGKAK----PPKVKG---RS----QVQPKLVRR---QAEWRYSALTEPFLGSNKLFK------------ 102 (763) T ss_pred -----------cccCc----ccccCC---Cc----cccCHHHHH---HHHHHHHHHHHhhcCCCcEEE------------ Confidence 11110 111111 00 010000000 011122223333333444443 Q ss_pred eeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEe-eee-ecCceeEE Q lcl|NC_019423. 163 ETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEV-EKA-LVNRPTVE 240 (756) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~-~~~-~~g~~~ie 240 (756) +....-.|.+.++.....+..+..... ....-+...+.+...+|.|+.+++|-.+--+. +.. ......+. T Consensus 103 ----~~P~~~~D~~~A~q~t~~~n~~~~~~~----~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~ 174 (763) T protein:vir:95 103 ----VTPVTWEDVQGARQNELVLNYQFRTKL----NRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQ 174 (763) T ss_pred ----EecCCcchHHHHHHHHHHHHHHHhhcC----chhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhcccc Confidence 233444566666666666665443321 12233456677788899999887543211111 110 00000111 Q ss_pred EechhheEe-------------------CC--CCcCccccC-----------ceEEEEee--------cCHHHHHhhccc Q lcl|NC_019423. 241 MLNPNNVVI-------------------DP--SCNGDLDKA-----------LYAVISFE--------TCKADLMKNKDR 280 (756) Q Consensus 241 ~V~p~~~~~-------------------Dp--~a~~d~~da-----------~~v~~~~~--------~t~~el~~~~~~ 280 (756) .....+++. ++ +.....++. ++..++.. ++..++.-.. . T Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp-~ 253 (763) T protein:vir:95 175 TQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDP-S 253 (763) T ss_pred chhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecC-C Confidence 111000000 00 000001111 11111111 2222221100 0 Q ss_pred hh-hhcccCc-------hhhhhhhchhhhccccccccccccccceEEEEEEEEEe---eccCCcee-----EEEEEE-EE Q lcl|NC_019423. 281 YH-NLDKIDW-------ESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFY---DINDDGSL-----EPIVAT-WI 343 (756) Q Consensus 281 ~~-~l~~~~~-------~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~---d~~~~g~~-----~~~~~~-~~ 343 (756) .. .++.... ...+..... ..+.......+...+..... .+.|.++ +..+.... +++.-+ .- T Consensus 254 a~sD~~Da~~~~~~~~~t~~dL~~~~-~~y~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~ 331 (763) T protein:vir:95 254 CQGDINKAMFAIVSFETCKADLLKEK-DRYHNLNKIDWQSSAPVNEP-DHATTTPQEFQISDPMRKRVVAYEYWGFWDIE 331 (763) T ss_pred CCCchhhCceEeeEEeccHHHHHhcc-CCccccchhcchhccccccc-cccccchhhccCCCcccceEEEEEeeeeeccC Confidence 00 0000000 000000000 00000000000000000000 0011111 11111100 111110 11 Q ss_pred CCE------EEEecccccCCCccceEE--eeeeeecCcccCCchH-HHhHHHHHHHHHHHHHHHHHHHhhcCCceEeecc Q lcl|NC_019423. 344 GST------LIRMENNPFPDGKLPLVV--VPYMPRKRELFGEADA-ELLGDNQAILGATMRGMIDLLGRSANGQRGYPKG 414 (756) Q Consensus 344 g~~------~L~~~~~P~~~~~~Pfv~--~~~~~~~~~~~G~g~v-~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~g 414 (756) |+. ++..+..-...+..||-+ +++...+.-....++. ..+.+.=.-+-...+.+...+.-..+ ....+ T Consensus 332 gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~---~~~~~ 408 (763) T protein:vir:95 332 GNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLG---RSANG 408 (763) T ss_pred CcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHH---hhcCC Confidence 221 222222223334445532 2222222111111222 22333333333333322222211111 11111 Q ss_pred ccC-ccchhh-hhcccccccccccccccccccccc-CCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHH Q lcl|NC_019423. 415 MLD-TLNRRR-YDDGQDYEYNPMQGNPSQSIMEHK-FPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGI 491 (756) Q Consensus 415 av~-~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i 491 (756) .+- ...... .+.....+..++.++++..+.... .-..+. +..-...+.+..+. ......|+.....|-...+. T Consensus 409 ~~~v~~gav~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~-~~~~~~~~l~~~~~---~~e~~TGv~~~~~G~~~~~~ 484 (763) T protein:vir:95 409 QRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPE-LPQSALTMATLQNQ---EAESLTGVKAFAGGVTGESY 484 (763) T ss_pred cEEeecccccchhhhcccCCceEEeeCCCChhhhcccccCCC-CcchHHHHHHHHHH---HHHHhhCcchhhcCcCcccc Confidence 110 001111 111111222222233322222111 112222 22222223333332 34566677654332221112 Q ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC-c-ceEE--EecccccH Q lcl|NC_019423. 492 RGAL---DAASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG-N-FDIE--VDINTAEI 564 (756) Q Consensus 492 ~~~~---~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~-~-~Dv~--V~~g~a~~ 564 (756) +... .....+-...+..+.+.+.+..+.+...+....-.- ...+..+.|.+..+.. . -++. .++..... T Consensus 485 ~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~----~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~ 560 (763) T protein:vir:95 485 GDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVF----LAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS 560 (763) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----CCCCcEEEEeCCccccccHHHhcCCcceEEecc Confidence 2111 111111111122222222222233333333331110 0122345554432211 0 0111 11122232 Q ss_pred H-HHHHHHHHHHHHHhhccCCHhHHHHHHHHHH-hhcCChhHHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 565 D-NQKSQDLGFMVQTLGNTVDQSITLSLVAKIA-ELKRMPDLAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIA 642 (756) Q Consensus 565 ~-~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~-e~~~~~~~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~ 642 (756) . ....+++..+.+ +...+++.+.+.+...++ +.+++.+..+.........++. .+.++++++++..+++++++ T Consensus 561 ~as~~~q~~~~l~~-ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~----d~~~q~qaqle~~~~q~e~~ 635 (763) T protein:vir:95 561 TAEVDNQKSQDLGF-MLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQP----DPVQEQLKQLAVEKAQLENE 635 (763) T ss_pred cchHHHHHHHHHHH-HHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCc----cchhhhHHHHHHHHHHHHHH Confidence 3 333444444444 445566666665555544 4456666665555555444332 23444556677777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhccCCCCCCC--cc Q lcl|NC_019423. 643 LNNAKAKEAASSGDLKDLDYLEQESGT-KHARDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAAVGYNTLT--NG 719 (756) Q Consensus 643 ~~~a~a~~~~aq~~~~~~~~~~q~~~~-k~~~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~--~~ 719 (756) ..+++++..++++.....+...+.... .++.+++..+.....+.+.++...+...+... ...-++.- ..-.. .+ T Consensus 636 ~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~--~~~~ea~~-~~~~~~~~~ 712 (763) T protein:vir:95 636 ELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALT--KPRKEGEL-PPNLSAAIG 712 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhcc-ChhHHHhhh Confidence 777777666655544433322221111 11122222222111122222111111111111 11111221 00000 01 Q ss_pred cCchhc-----CCCCCCCCcccc---ccccccCCCCCCCCCCCcC Q lcl|NC_019423. 720 NSPQER-----DLAAQQDPAYSL---GSQYYDPSQDPASALGMNL 756 (756) Q Consensus 720 ~~~~~~-----~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 756 (756) ..|... .++...+-.+.. .|+-..|.-+|+.+|.++- T Consensus 713 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 757 (763) T protein:vir:95 713 YNALTNGEDTGIQSVSERDIAAEANPAYSLGSSQFDPTRDPALNP 757 (763) T ss_pred hcccccccCCCccchhhcccCccccccccCCCCCCCCCCccccCC Confidence 111111 111111112223 4555566777777777777 No 136 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=96.41 E-value=0.00065 Score=38.04 Aligned_cols=601 Identities=12% Similarity=0.033 Sum_probs=166.8 Q ss_pred hhHHH-HHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEe-cCCcchHHHHHHHHHH Q lcl|NC_019423. 42 HDAIM-SQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLT-PVTFEDELAARQNELV 119 (756) Q Consensus 42 ~~~~~-~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~-p~~~~D~~~A~q~t~~ 119 (756) -.+.. ..+.++++.|...-. ...+|+--... +.=|.|+ ...|.++..+. T Consensus 1 ma~~~~~~~~~~~~r~~~~~~------------------~~~~~r~~~~~------d~~f~~y~G~Qw~~~~~~~----- 51 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYS------------------PQQEVREKCIE------ATRFARVPGGQWEGATAAG----- 51 (708) T ss_pred CchhHHHHHHHHHHHHHHHHh------------------hhHHHHHHHHH------HHHhhccCCCCCCHHHHHH----- Confidence 11111 112233333322100 00111111000 1112333 33555554442 Q ss_pred HHHHHhhhcCCcc-hHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecC-CCCCHHHHHHHHHhHHHhhhccchhcc Q lcl|NC_019423. 120 LNYQFRTQLNKVK-LVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLY-PIENQEQADVLQQALQLQAENPREYDE 197 (756) Q Consensus 120 ~n~~~~~~~~~~~-~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~e 197 (756) |.. ..+-.|.+ +.+|.|+-.+..-.|.-+ .+ ..+ +.+... ...+.+.++.+...+..+.+.- T Consensus 52 l~~--~~q~~~rP~~~~N~i~~~i~~v~g~e~-----~n---r~d-~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~----- 115 (708) T protein:vir:17 52 TKL--DEQFEKYPKFEINKVATELNRIIAEYR-----NN---RIT-VKFRPGDREASEELANKLNGLFRADYEET----- 115 (708) T ss_pred HHh--hhhhcCCCceEEcchHHHHHHHHhhHh-----hC---Ccc-eEEecCCCcchHHHHHHHHHHHHHHHHhc----- Confidence 100 01222333 445666666665555211 22 221 223333 3456778888888777665532 Q ss_pred ccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCce--eEEEechhheEeCCCCcCccccCceEEEEeecCHHHHH Q lcl|NC_019423. 198 TMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRP--TVEMLNPNNVVIDPSCNGDLDKALYAVISFETCKADLM 275 (756) Q Consensus 198 ~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~--~ie~V~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~ 275 (756) ........++.+.+.+|.|+. ++..+.....++ .-..+....++.+|..- .+| -..+..+++|.+ T Consensus 116 ~~~~~~s~Af~~~i~~G~G~~-------~~~~d~~~e~d~~~~~~~i~i~~~~~~~~~v-~~D-----p~a~~~D~sDar 182 (708) T protein:vir:17 116 DGGEACDNAFDDAATGGFGCF-------RLTSMLVNEYDPMDDRQRIAIEPIYDPSRSV-WFD-----PDAKKYDKSDAL 182 (708) T ss_pred CchhHHhHHHHHhhhccccee-------eeeecccccCCCCCCccccceEeeccchhhe-ecC-----ccccccChhhhh Confidence 222334556667777766543 222211100000 00111111111111110 011 112233444443 Q ss_pred hhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEE--------E----E Q lcl|NC_019423. 276 KNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVAT--------W----I 343 (756) Q Consensus 276 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~--------~----~ 343 (756) ..+ ....++. +.. ...++.... ......... ...++++ +.+.-.+.++++-. + + T Consensus 183 ~~~-~~~~~~~---d~~-~~~yp~~a~---~~~~~~~~~---~~~~~~~---~~d~vrv~e~~~r~~~~~~~~~~~~~~~ 248 (708) T protein:vir:17 183 WAF-CMYSLSP---EKY-EAEYGKKPP---ASLDVTSMT---SWEYDWF---DADVIYIAKYYEVRKESVDVISYRHPIT 248 (708) T ss_pred hhh-hhccCCH---HHH-HHhCccccc---hhhhhhhhc---ccccccc---CCCeEEEEEEEEEeeeeeEEEEEecCcc Confidence 321 1111111 110 001111100 000000000 0112222 11111122222111 1 1 Q ss_pred CCEEEEecccc-------------------------c-----------CCCccceEEeeeeeecCccc---CCchHHHhH Q lcl|NC_019423. 344 GSTLIRMENNP-------------------------F-----------PDGKLPLVVVPYMPRKRELF---GEADAELLG 384 (756) Q Consensus 344 g~~~L~~~~~P-------------------------~-----------~~~~~Pfv~~~~~~~~~~~~---G~g~v~~~~ 384 (756) |+.+...+.+. | ..+.+||-.|++.|.-+..+ |.+.... T Consensus 249 g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG-- 326 (708) T protein:vir:17 249 GEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEG-- 326 (708) T ss_pred CceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccccCCCcccc-- Confidence 22222221110 0 00124454455554433321 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCceE-eeccccCccchh--hhhcccc----c----ccccc--cccc----cccccccc Q lcl|NC_019423. 385 DNQAILGATMRGMIDLLGRSANGQRG-YPKGMLDTLNRR--RYDDGQD----Y----EYNPM--QGNP----SQSIMEHK 447 (756) Q Consensus 385 d~Q~~iN~~~~~~~d~l~~~~~~~~~-~~~gav~~~~~~--~~~~~~~----~----~~~~~--~~~~----~~~i~~~~ 447 (756) +.|.+.|.-...|+-.-. .+-.++...... +...... + ..... ...+ ...+.... T Consensus 327 --------~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a 398 (708) T protein:vir:17 327 --------HIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGA 398 (708) T ss_pred --------hhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCccccccccc Confidence 122222222221110000 000001000000 0000000 0 00000 0000 01111111 Q ss_pred CCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019423. 448 FPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRLAKGMADIGTKICAMNAV 526 (756) Q Consensus 448 ~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~~~~~~~l~~~~l~li~q 526 (756) .+..-.....+-+...++++...+--...+|.+..++|.+. .+|++..++.+..+.. .-.|-+.++.-.+.+.+++.. T Consensus 399 ~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~s-n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~ 477 (708) T protein:vir:17 399 TPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-NIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLS 477 (708) T ss_pred CCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCcc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11110011123334444444444444445788887777654 4788877666654443 344555666677777777776 Q ss_pred hCCCCcEEEEecCceeecCHhHhcCcceEEEeccc---ccHHHHHHHHHHHH-HH---HhhccCCH--hHHHHHHHHHHh Q lcl|NC_019423. 527 FLSEKEVVRITNEQYVEIKREDLKGNFDIEVDINT---AEIDNQKSQDLGFM-VQ---TLGNTVDQ--SITLSLVAKIAE 597 (756) Q Consensus 527 ~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~---a~~~~~~~q~l~~l-lq---~~~~~~~~--~~~~~~l~~l~e 597 (756) +..+- .+.+..+.|...+-. ...+.++... ..........+.-. .. ..+|..+- +.....|.+++. T Consensus 478 lI~~~----y~~~R~~RI~~edg~-~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~ 552 (708) T protein:vir:17 478 MAREV----YGSEREVRIVNEDGS-DDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) T ss_pred HHHHH----cCCCcEEEEecCCCC-cceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHH Confidence 54331 112234455443311 1112222110 00000000000000 00 00011110 011122223333 Q ss_pred hcCC-----hhHHHHhhhccCCC--ChhhhhH-------------HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 598 LKRM-----PDLAHELRTWQPQP--DPMEEQL-------------KQLAIQK--AQLENEELQSKIALNNAKAKEAASSG 655 (756) Q Consensus 598 ~~~~-----~~~~~~l~~~~~q~--~p~~~~~-------------~q~~~~~--aq~e~~~~qa~a~~~~a~a~~~~aq~ 655 (756) .... +.+...+-.....| +.....+ .+.++++ .++..++.+++.....++++..++|+ T Consensus 553 ~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qA 632 (708) T protein:vir:17 553 SMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) T ss_pred hcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2221 11222221111111 1111000 0001111 11111122222233333444444444 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCchhhhccCCCCCCCcccCchhcCCCCCCCCc Q lcl|NC_019423. 656 DLKDLDYLEQESGTKHA-RDMEKQKAQSQGNQNLQITKALTTPTKEGETTPNISAAVGYNTLTNGNSPQERDLAAQQDPA 734 (756) Q Consensus 656 ~~~~~~~~~q~~~~k~~-~~~~~~~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 734 (756) ++.+++..+...+++.. +..+...++.+..+.++....+. ........+.++.. ...-+.+--+++.-|+ T Consensus 633 e~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~-~~~~~~~~~~l~~~--------q~~q~q~~~a~p~~~~ 703 (708) T protein:vir:17 633 EAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNID-DKAVMEAIRLLKDV--------AESQQQQFQSPPQSPA 703 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhh--------hhhHHHHHhccccCch Confidence 44444332222211110 01111111111111111110000 00001111222211 1111111111111110 Q ss_pred cccccccccCCCCCC Q lcl|NC_019423. 735 YSLGSQYYDPSQDPA 749 (756) Q Consensus 735 ~~~~~~~~~~~~~~~ 749 (756) ..+|+ T Consensus 704 ----------~~~~~ 708 (708) T protein:vir:17 704 ----------DLMPS 708 (708) T ss_pred ----------hccCC Confidence 00000 No 137 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=96.38 E-value=0.00069 Score=37.93 Aligned_cols=620 Identities=13% Similarity=0.054 Sum_probs=188.6 Q ss_pred CCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCC-CCCCCCCCCcccCHHHHHHHHHHHHH Q lcl|NC_019423. 10 LPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKA-KPPKIKGRSQVQPRLVRRQAEWRYAP 88 (756) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~-~~~~~~grS~~v~~~v~~~~e~~~~~ 88 (756) +.-++ +-..|-++.... ..+.+-..=..+..++..+-+|+--. T Consensus 1 ~~~~~------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a 44 (772) T protein:vir:10 1 MQITE------------------------------------NDRQYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVA 44 (772) T ss_pred CCcch------------------------------------hhHHhhccCCcccccccCHHHHHHHHHHHhccHHHHHHH Confidence 11111 112233321111 11111111112333455555666444 Q ss_pred HHHh-hcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeee Q lcl|NC_019423. 89 LSEP-FLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVF 167 (756) Q Consensus 89 L~~~-f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~ 167 (756) .-.. |..|+ .|.++..+.. +..+.-.+.+|.|+-.+..-.|..+ ..+. + ..+ T Consensus 45 ~~d~~fy~G~--------QW~~~~~~~l----------~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~---d-~~v 97 (772) T protein:vir:10 45 DKEMDYADGN--------QLDTELLRRQ----------QALGIPPAVEDLIGPALLSLQGYEA-----VTRT---D-WRV 97 (772) T ss_pred HHHHHhhcCC--------CCCHHHHHHH----------HhcCCCcEEEcchHHHHHHHHHHHH-----hcCc---c-eEE Confidence 3322 33443 4444333331 2334445556666666666556222 2222 1 223 Q ss_pred ecC-CCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechh- Q lcl|NC_019423. 168 QLY-PIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPN- 245 (756) Q Consensus 168 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~- 245 (756) ... +..+.+.++.+..++..+.+.- ........++.+.+.+|.|+ .++ .. . -+|+ T Consensus 98 ~Pr~~~~d~~~Ae~l~~~~~~~~~~~-----~~~~~~s~Af~~~i~~G~Gw-------~e~--~~--~-------~d~~~ 154 (772) T protein:vir:10 98 TPNGDVGGQEVADALNYRLNTAERQS-----GADRACSEAFRPQIACGIGW-------VEV--SR--E-------SDPFK 154 (772) T ss_pred ecCCCchHHHHHHHHHHHHHHHHHhc-----ChHHHHHHHHHHhhhcCcee-------EEe--cc--c-------cCCCC Confidence 332 3466777888888777655432 12223345566666666543 211 10 0 0111 Q ss_pred -heEe---CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcc------cCchhhhhhhch--hhhcccccccccccc Q lcl|NC_019423. 246 -NVVI---DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDK------IDWESSSPITDP--DHESKTPSDFQFKDA 313 (756) Q Consensus 246 -~~~~---Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~------~~~~~~~~~~~~--~~~~~~~~~~~~~d~ 313 (756) ++++ ||..- .+| .. ...++++.+... ....++. +........... ........+.++.+. T Consensus 155 ~~i~i~~v~p~~v-~~D-p~-----a~~D~sDar~~~-~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~ 226 (772) T protein:vir:10 155 FPYRCRPIRRDEI-HWD-MK-----CGDDWEACRFLR-RQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEG 226 (772) T ss_pred CCeEEEeeCcccc-eec-CC-----CCCCHHHhhhhh-hhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccc Confidence 1211 22221 111 10 123566654432 1111111 111110000000 000111111111110 Q ss_pred ccceEEEE------EEEEEe-----eccCCc--eeE-EEE------EEEE--CCEEEEecccc----------------- Q lcl|NC_019423. 314 LRKKVVAY------EYWGFY-----DINDDG--SLE-PIV------ATWI--GSTLIRMENNP----------------- 354 (756) Q Consensus 314 s~~~V~v~------E~w~k~-----d~~~~g--~~~-~~~------~~~~--g~~~L~~~~~P----------------- 354 (756) .. ...++ ..|... +.+.+. +.+ +++ +++. |..+.....+| T Consensus 227 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~ 305 (772) T protein:vir:10 227 GT-STGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVT 305 (772) T ss_pred cc-ccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheee Confidence 00 00011 112110 000010 111 111 1111 22222222221 Q ss_pred --------------cCCCccce--EEeeeeeecCccc-CCchHHHhHHHHHHHHHHHHHHHHHHHhhcC----CceEeec Q lcl|NC_019423. 355 --------------FPDGKLPL--VVVPYMPRKRELF-GEADAELLGDNQAILGATMRGMIDLLGRSAN----GQRGYPK 413 (756) Q Consensus 355 --------------~~~~~~Pf--v~~~~~~~~~~~~-G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~----~~~~~~~ 413 (756) ...+..|| -.|++.|.-+... -.|... -++|.++|.-...|+ -.+++.. T Consensus 306 ~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~----------G~vr~~kd~Qr~~N~~~S~~~~~l~~ 375 (772) T protein:vir:10 306 VSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPY----------GYVRGMKYAQDSLNSGVSKLRWGMSV 375 (772) T ss_pred eeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCccc----------chhhhhhhHHHHHHHHHHHHHHHHhc Confidence 12222233 3344454322221 112222 233444443322222 1223332 Q ss_pred cccCccchhhhhcc----cc-ccccc-cccccccccc---cccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCcccc Q lcl|NC_019423. 414 GMLDTLNRRRYDDG----QD-YEYNP-MQGNPSQSIM---EHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAY 484 (756) Q Consensus 414 gav~~~~~~~~~~~----~~-~~~~~-~~~~~~~~i~---~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~ 484 (756) -++-.....-.... .. ...+. +.++++..-. .+.....+.-....++++.....++ ...+|.+...+ T Consensus 376 ~~~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~vsGv~~~~l 451 (772) T protein:vir:10 376 ARVERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATI----ERVSNITAGFQ 451 (772) T ss_pred ccccccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHH----HHHhCCCHHHc Confidence 22211111000000 00 01111 1122221111 1111122333445555555555554 34559988888 Q ss_pred chhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEe-cCceeecCHhH-hcCcceEEEecc- Q lcl|NC_019423. 485 GDVAAGIRGALDAASKREMA-ILRRLAKGMADIGTKICAMNAVFLSEKEVVRIT-NEQYVEIKRED-LKGNFDIEVDIN- 560 (756) Q Consensus 485 ~~tA~~i~~~~~aa~~~l~~-~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~-g~~~v~i~~d~-~~~~~Dv~V~~g- 560 (756) |.+.+++|++..++.+.... ..-.|-+.++.-.+.+.+++..+... +. .+..+.|...+ ...+.=+.++.. T Consensus 452 G~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~-----~y~~er~~RI~~~d~~~~~~~v~in~~~ 526 (772) T protein:vir:10 452 GRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVE-----DIGQERTEVVIEGDAVTADRVVVLNEPQ 526 (772) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HcCCCcEEEEecCCCCCCCceEEeccce Confidence 88888899988777765543 23444456666667777777665432 22 22344454332 211111222110 Q ss_pred --cccHHHHHHHHHHHH-HH---HhhccCCHhHHHHHHHHHHhhcCC--hhHHH----HhhhccCCCChh--hhhHHH-- Q lcl|NC_019423. 561 --TAEIDNQKSQDLGFM-VQ---TLGNTVDQSITLSLVAKIAELKRM--PDLAH----ELRTWQPQPDPM--EEQLKQ-- 624 (756) Q Consensus 561 --~a~~~~~~~q~l~~l-lq---~~~~~~~~~~~~~~l~~l~e~~~~--~~~~~----~l~~~~~q~~p~--~~~~~q-- 624 (756) +.+........+.-. .. .-+| ..+......+..++++.+. |.... .+-.....|... .....+ T Consensus 527 ~d~~tg~~~~~NDi~~g~yDv~i~~~p-~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~ 605 (772) T protein:vir:10 527 RDPQTGAAYLSNDLLRTRIKVALEDVP-STNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVD 605 (772) T ss_pred ecccccccceeccceeeeEEEEeeccc-cchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHh Confidence 000000000000000 00 0001 0111112233333333221 22211 111111111111 110111 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H Q lcl|NC_019423. 625 ----LAIQKAQLENEELQSKIALNNAKAK--EAASSGDLKDLDYLEQESGTKHARDMEKQKAQSQGNQNLQITKA----L 694 (756) Q Consensus 625 ----~~~~~aq~e~~~~qa~a~~~~a~a~--~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~~q~~~~~~~~~~~a----~ 694 (756) +++++.+.+ +..+.+.+..+++.+ +++++...+.++.++.++ +..+...+.+..++++ . T Consensus 606 ~~~~peq~~~~~~-q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~a----------qa~~~~~~a~~~a~~aa~~~~ 674 (772) T protein:vir:10 606 QQQTPEQIQQQID-QAVQDALAKAGNDIKLRELEIKERKADSEISGLNA----------KAVQIGVQAAFSAMQAGAQIA 674 (772) T ss_pred ccCChHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHhhhhhhHH Confidence 111111100 001111111111111 111111111111111111 1111111111111111 0 Q ss_pred HHHhhccCCchhhhccCCCCCCCccc----CchhcCCCCC----------CCCccccccccccC-----------CCCCC Q lcl|NC_019423. 695 TTPTKEGETTPNISAAVGYNTLTNGN----SPQERDLAAQ----------QDPAYSLGSQYYDP-----------SQDPA 749 (756) Q Consensus 695 ~~~~~~~~~~~~~~~a~~~~~~~~~~----~~~~~~~~~~----------~~~~~~~~~~~~~~-----------~~~~~ 749 (756) ...+..+.+.+.+..+ ||..++|.. .|....++++ .+++..++.|...| -..|+ T Consensus 675 q~~q~a~~ad~~l~~~-g~~~~~~~~~~~~~p~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~p~~~q~~~~ 753 (772) T protein:vir:10 675 QMPMIAPIADAVMQSA-GYQRPNPAGDDPNYPIADQTAAMNIRSPYIQGQGPAAEAEAESVSVRRNTSPTYPPVPEEAPT 753 (772) T ss_pred hhhhhhHHHHHHHHhc-ccccccccccCCCCCCCCCccCCCCCccCCCCCCCCCccccCCCCCccCCCCCCCCCCcccCC Confidence 1112222234445443 665443321 1111111111 11222222222211 11223 Q ss_pred CCCCCcC Q lcl|NC_019423. 750 SALGMNL 756 (756) Q Consensus 750 ~~~~~~~ 756 (756) ...||.- T Consensus 754 ~~~g~~~ 760 (772) T protein:vir:10 754 GLRGIET 760 (772) T ss_pred CCCCCCC Confidence 3334333 No 138 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=455 Identities=11% Similarity=0.062 Sum_probs=166.9 Q ss_pred cCCCCCCCCCCCc----ccCHHHHHHHHHHHHHHHHhhcCCCCEE----EEecCC--cchHHHHHHHHHHHHHHHhhhcC Q lcl|NC_019423. 60 GKAKPPKIKGRSQ----VQPRLVRRQAEWRYAPLSEPFLSSSKLF----KLTPVT--FEDELAARQNELVLNYQFRTQLN 129 (756) Q Consensus 60 ~~~~~~~~~grS~----~v~~~v~~~~e~~~~~L~~~f~~~~~~~----~~~p~~--~~D~~~A~q~t~~~n~~~~~~~~ 129 (756) |. ...|+-+ ..+-.....-.|- +++..++|+... .|.|.- ++++ +.|-+|+ T Consensus 1 ~~----~~~~~~~~V~~~hp~y~a~~~~W~---~ird~~~G~~~~~~r~~yl~~~~~~~~e------~~Y~~rl------ 61 (491) T protein:vir:95 1 ML----TANGQGSGVKTKHREWLHYAPKWQ---KVRHALAGDLVGYLRNVGLNEPDKAYGE------ARQAEYE------ 61 (491) T ss_pred Cc----ccCCccCCCCccCHHHHHHHHHHH---HHHHHhcCcchhhcccCCCcCCCCCCCH------HHHHHHH------ Confidence 11 1223333 2233444455554 466667775432 233321 2222 2355554 Q ss_pred CcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHH Q lcl|NC_019423. 130 KVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNY 209 (756) Q Consensus 130 ~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~ 209 (756) ..-++.|++++.|..=.|.+-. +.++.+ ....+ +...++-+.....+.+-.+..+.. T Consensus 62 ~rA~~~n~~~~tl~~l~G~vfr-----------k~p~~~----~p~~l--------~~l~~d~D~~G~~L~~f~~~~~~~ 118 (491) T protein:vir:95 62 AGGIVYNFTRRTLSGMVGSVMR-----------KEPEIN----IPKEL--------EYLLKNADGSGVGLIQHAQDTLME 118 (491) T ss_pred hcccCCChHHHHHHHHhchhhc-----------CCceee----ccHHH--------HHHHhccCCCCCCHHHHHHHHHHH Confidence 1233556777766554443321 011111 11221 112222222333344445556666 Q ss_pred HHhcCCcceeccCceeE--EEee-eeecCceeEEEechhheEeCCCCcCcc--ccCceEEEEeecCHHHHHhhccchhhh Q lcl|NC_019423. 210 FNETGEATYAIQTGVTE--VEVE-KALVNRPTVEMLNPNNVVIDPSCNGDL--DKALYAVISFETCKADLMKNKDRYHNL 284 (756) Q Consensus 210 ~~~~G~~~~~~~~g~~~--~~~~-~~~~g~~~ie~V~p~~~~~Dp~a~~d~--~da~~v~~~~~~t~~el~~~~~~~~~l 284 (756) ...+|.+..-+-.-... ..-+ +-..-+|.+..++|++|+ ++.....- ....++..+.... T Consensus 119 ~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~Ii-nW~~~~v~g~~~L~~v~l~E~~~-------------- 183 (491) T protein:vir:95 119 IDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIV-NWRLTRVGSVNRVTMVVLRETWE-------------- 183 (491) T ss_pred HHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhc-CceeeeeCCceeeeEEEEEEeEE-------------- Confidence 66777655433210000 0000 001126888889988886 45432210 0111221111000 Q ss_pred cccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEE--CCEEEE-------eccccc Q lcl|NC_019423. 285 DKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWI--GSTLIR-------MENNPF 355 (756) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~--g~~~L~-------~~~~P~ 355 (756) .......+......++||++. +.+|....+++-.. |+.... .+.++ T Consensus 184 ------------------~~d~~~~f~~~~~~qyRvL~l------~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~- 238 (491) T protein:vir:95 184 ------------------YHEPGNEFETKYGEQYRVLDI------DTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESL- 238 (491) T ss_pred ------------------eecCCCCcccceEEEEEEEee------cCCCceEEEEEEEcCCCcceeeeeeeeecCCCcc- Confidence 000011222333334555543 11222222222111 111111 11122 Q ss_pred CCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccccc--ccc Q lcl|NC_019423. 356 PDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDY--EYN 433 (756) Q Consensus 356 ~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~--~~~ 433 (756) .+.+||+++..... +-..+.+..-++..+...+=...+-.-+.++.++.|...+. |.-+...+ +...+... .++ T Consensus 239 -l~~IPfv~~~~~~~-~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~-G~d~~~~~-~~~~~~~~~i~~g 314 (491) T protein:vir:95 239 -RGVIPFTFIGATNN-DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PGDNLTPQ-SFKEANPNGIKFG 314 (491) T ss_pred -cCeeEEEEEecCCC-CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cCcccCcc-hhhccCcceeEec Confidence 24566666554321 22224555666666654443334445566777777765542 21111111 11111100 000 Q ss_pred ---ccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 434 ---PMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLA 510 (756) Q Consensus 434 ---~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~ 510 (756) +.....+....++++....- ....|.-....|.. .|.. +...+ .+.||++.+.-..+....|..++.|++ T Consensus 315 ~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~-~Ga~---l~~~~--~~~Ta~~~~~~~~~~~S~L~~~a~~~e 387 (491) T protein:vir:95 315 SRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQ-IGAQ---LITPS--QQITAESARIQRGADTSVMATIARNVS 387 (491) T ss_pred CcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHH-HHHH---hccCC--cchhHHHHHHHHHHhhHHHHHHHHHHH Confidence 01111122233333332211 12223333333322 2332 11111 247887777766677777888888887 Q ss_pred HHHHHHHHHHHHHHHhhCCCC--cEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHH Q lcl|NC_019423. 511 KGMADIGTKICAMNAVFLSEK--EVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSIT 588 (756) Q Consensus 511 ~~~~~l~~~~l~li~q~~~~~--r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~ 588 (756) +++... |.++..|.... .-+.| .+|+| |++ ...+.+..+.++.+.+. ..++.... T Consensus 388 ~al~~~----l~~~a~w~G~~~~~~v~i------~~n~d-----F~~------~~~~~~~~~all~~~~~--G~is~~t~ 444 (491) T protein:vir:95 388 QAYTDA----LRWVAMMLGKPEDSEVEF------QLNMD-----FFL------QPMTAQDRAAWMADINA--GLLPATAY 444 (491) T ss_pred HHHHHH----HHHHHHHcCCCCCCceEE------Eeecc-----ccc------ccCCHHHHHHHHHHHhc--CCCCHHHH Confidence 766554 55555554421 11111 11111 111 11111222333333322 33444433 Q ss_pred HHHHHH--HHhhcCChhHHHHhhhccCCC---ChhhhhHHHHHHHHHH Q lcl|NC_019423. 589 LSLVAK--IAELKRMPDLAHELRTWQPQP---DPMEEQLKQLAIQKAQ 631 (756) Q Consensus 589 ~~~l~~--l~e~~~~~~~~~~l~~~~~q~---~p~~~~~~q~~~~~aq 631 (756) ..-|.+ +++ ...+++.+.|+...+.. .+..-...+.+++..+ T Consensus 445 ~~~L~~~~vl~-~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 445 YAALRKAGVTD-WTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHHhCCCCC-ccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 333321 111 11222333333222110 0111111111111111 No 139 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=88.31 E-value=0.033 Score=28.70 Aligned_cols=453 Identities=10% Similarity=0.076 Sum_probs=163.4 Q ss_pred cCCCCCCCCCCCc----ccCHHHHHHHHHHHHHHHHhhcCCCCEE-E---EecCCcchHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_019423. 60 GKAKPPKIKGRSQ----VQPRLVRRQAEWRYAPLSEPFLSSSKLF-K---LTPVTFEDELAARQNELVLNYQFRTQLNKV 131 (756) Q Consensus 60 ~~~~~~~~~grS~----~v~~~v~~~~e~~~~~L~~~f~~~~~~~-~---~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~ 131 (756) |. ...|+-+ ..+-.....-.|- +++..++|+..+ . +.|..+. + ...+.|-+|+- . T Consensus 1 ~~----~~~~~~~~V~~~hp~y~a~~~~W~---~ird~~~G~~~~~~r~~yl~~~~~-~---~~e~~Y~~rl~------r 63 (489) T protein:vir:78 1 ML----TENGQGSGVKTKHREWLHYAPKWQ---KVRHALAGELVSYLRNVGLNEPDK-A---YGEARQAEYEA------G 63 (489) T ss_pred Cc----cCCCccCCCCccCHHHHHHHHHHH---HHHHHhcCcccccccCCCCCCCCC-C---CChHHHHHHHh------c Confidence 21 1223333 3333455555665 366777776531 1 3332211 0 01123555542 2 Q ss_pred chHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHH Q lcl|NC_019423. 132 KLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFN 211 (756) Q Consensus 132 ~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~ 211 (756) -++.|++++.|..=.|.+-. +.++.. ....+ +...++-......+.+-.+..+.... T Consensus 64 A~~~n~~~~tl~~l~G~vfr-----------k~p~~~----~p~~l--------~~l~~d~D~~G~~L~~f~~~~~~~~l 120 (489) T protein:vir:78 64 GIVYNFTRRTLSGMVGSVMR-----------KEPEIN----IPKEL--------EYLLKNADGSGVGLIQHAQDTLMEID 120 (489) T ss_pred cccCChHHHHHHHHhchhhc-----------CCccee----ccHHH--------HHHHhccCCCCCCHHHHHHHHHHHHH Confidence 23556777776654443321 011111 11221 12222222233334444555666666 Q ss_pred hcCCcceeccCceeEE--Eee-eeecCceeEEEechhheEeCCCCcCc--cccCceEEEEeecCHHHHHhhccchhhhcc Q lcl|NC_019423. 212 ETGEATYAIQTGVTEV--EVE-KALVNRPTVEMLNPNNVVIDPSCNGD--LDKALYAVISFETCKADLMKNKDRYHNLDK 286 (756) Q Consensus 212 ~~G~~~~~~~~g~~~~--~~~-~~~~g~~~ie~V~p~~~~~Dp~a~~d--~~da~~v~~~~~~t~~el~~~~~~~~~l~~ 286 (756) .+|.+..-+-.-.... .-+ +...-+|.+..++|++|+ ++..... .....++..+..... T Consensus 121 ~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~Ii-nW~~~~v~G~~~Lt~v~lrE~~~~--------------- 184 (489) T protein:vir:78 121 SVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIV-NWRLTRVGSVNRVTMVVLRETWEY--------------- 184 (489) T ss_pred hcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhc-CceeeeeCCccceeEEEEEEeEEe--------------- Confidence 6776554321100000 000 011126888889988886 4543221 001122211111000 Q ss_pred cCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEE--ECCE------EE-EecccccCC Q lcl|NC_019423. 287 IDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATW--IGST------LI-RMENNPFPD 357 (756) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~--~g~~------~L-~~~~~P~~~ 357 (756) ......+......++||++. +.+|....++... -|+. ++ ..+.+ .. T Consensus 185 -----------------~d~~~~f~~~~~~q~RvL~~------~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~--~l 239 (489) T protein:vir:78 185 -----------------NEPGNEFETKYGEQYRVLDI------DSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGES--LR 239 (489) T ss_pred -----------------ecCCCCccceeEEEEEEEec------CCCcceEEEEEEeecCCcccceeeEEeccCCCC--cc Confidence 00001122222233444431 1122111111110 1111 11 11112 23 Q ss_pred CccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccc--cc-- Q lcl|NC_019423. 358 GKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYE--YN-- 433 (756) Q Consensus 358 ~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~--~~-- 433 (756) +.+||+++..... +-..+.+..-++..+...+=...+-.-++++.++.|...+. |. +.........+.... ++ T Consensus 240 ~~IPfv~~~~~~~-~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~-d~~~~~~~~~~~~~~i~~g~~ 316 (489) T protein:vir:78 240 GVIPFTFIGATNN-DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PG-ENLTPQAFKEANPNGIKFGSR 316 (489) T ss_pred CeeeEEEEecCCC-CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cC-ccCCcccccccCccceeeCCc Confidence 5567766554322 12224555666666654444444555667777777766543 22 111111111111100 00 Q ss_pred -ccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 434 -PMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKG 512 (756) Q Consensus 434 -~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~ 512 (756) +.....++...++++....-. -..|.-....|.. .|..-. .. + .+.||++.+.-..+....|..++.+++++ T Consensus 317 ~~~~lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~~-lGa~l~---~~-~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~a 389 (489) T protein:vir:78 317 RGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAIQ-IGAQLI---TP-T-QQITAQSARIQRGADTSVMATIARNVSQA 389 (489) T ss_pred ccccCCCCCCcceeccCcchHH-HHHHHHHHHHHHH-Hhhhhc---cC-C-cchhHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 011111222334433322211 2222222222222 232211 11 1 24788777776666777788888887766 Q ss_pred HHHHHHHHHHHHHhhCCC--CcEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHH Q lcl|NC_019423. 513 MADIGTKICAMNAVFLSE--KEVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLS 590 (756) Q Consensus 513 ~~~l~~~~l~li~q~~~~--~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~ 590 (756) +.. +|.++..|... +.-+.| .+|+ +|++ ...+.+..+.++.+.+ +..+....... T Consensus 390 l~~----~l~~~a~w~G~~~~~~~~i------~~n~-----dF~~------~~~d~~~~~al~~~~~--~G~is~~t~~~ 446 (489) T protein:vir:78 390 YTD----ALRWVAVMLGKPEDTEVEF------RLNM-----DFFL------EPMTAQDRAAWMADIN--AGLLPATAYYA 446 (489) T ss_pred HHH----HHHHHHHHcCCCCCCceEE------Eeec-----ccCc------ccCCHHHHHHHHHHHh--cCCCCHHHHHH Confidence 654 45555555442 211222 1111 1221 1111122222333322 12344333332 Q ss_pred HHHH--HHhhcCChhHHHHhhhccCC-----CChhhhhHHHHHH Q lcl|NC_019423. 591 LVAK--IAELKRMPDLAHELRTWQPQ-----PDPMEEQLKQLAI 627 (756) Q Consensus 591 ~l~~--l~e~~~~~~~~~~l~~~~~q-----~~p~~~~~~q~~~ 627 (756) -|.+ +++. ...++...|....+. +.+.++..||.++ T Consensus 447 ~L~~~gv~d~-~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 447 ALRKAGVTDW-TDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHhCCCCCc-cHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 2222 1110 111222233221100 0000011111111 No 140 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=83.95 E-value=0.064 Score=27.12 Aligned_cols=483 Identities=12% Similarity=0.090 Sum_probs=170.9 Q ss_pred CCc----------ccCCCCCCCccccccccCCCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCC Q lcl|NC_019423. 1 MEH----------QDTFKPLPDPAQSEKLTDWKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGR 70 (756) Q Consensus 1 ~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr 70 (756) |.+ +.++-||+.|+...-.++|.| ++.-..++. .....|..--+.|-|+ ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~d---V~~~hp~y~-------a~~~~W~~ird~~~G~-----~~~r-- 63 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPN---VGYQRVEFG-------EMLPKWRKIMDCLSGQ-----EAIK-- 63 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCC---CCcCCHHHH-------HHHHHHHHHHHHhcCh-----HHHH-- Confidence 433 455667788877777777776 111111111 1112222222333322 1111 Q ss_pred CcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEE Q lcl|NC_019423. 71 SQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIAR 150 (756) Q Consensus 71 S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k 150 (756) +.=+ .|-|.-+......+-.+.|-+|+- .-++.|++++.|..=.|.+- T Consensus 64 ---------~~g~-----------------~YLP~~~~~~~~~E~~~~Y~~rl~------rA~~~n~~~~tl~~l~G~vf 111 (535) T protein:vir:80 64 ---------AKRE-----------------EYLPMPSVDSRDEEQRRRYETYLQ------RAIFYNVTARTLDGMMGQVF 111 (535) T ss_pred ---------hccc-----------------ccCCCCCcccCCcCCHHHHHHHHh------hccCCChhHHHHHHHhchhh Confidence 1001 122322211111122334665552 23455667666655444322 Q ss_pred EeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccC-ceeE--E Q lcl|NC_019423. 151 IGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQT-GVTE--V 227 (756) Q Consensus 151 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~-g~~~--~ 227 (756) . + .++.. ..+.+ +...++-......+.+-.+..+.....+|.+..-+-+ ..+. . T Consensus 112 r------k-----~p~~~----~p~~l--------~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t 168 (535) T protein:vir:80 112 S------R-----DPIRQ----LPPAL--------EAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVT 168 (535) T ss_pred c------C-----Cccee----ccHHH--------HHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCccc Confidence 1 0 11110 11211 1222222222333444455556666667765543311 0000 0 Q ss_pred Eee-eeecCceeEEEechhheEeCCCCcCc--cccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccc Q lcl|NC_019423. 228 EVE-KALVNRPTVEMLNPNNVVIDPSCNGD--LDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKT 304 (756) Q Consensus 228 ~~~-~~~~g~~~ie~V~p~~~~~Dp~a~~d--~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 304 (756) ..+ +...-+|.+..++|++|+ ++..... .....++..+...+ T Consensus 169 ~ade~~~~~rPy~~~y~ae~Ii-nW~~~~v~G~~~Lt~v~lrE~~~---------------------------------- 213 (535) T protein:vir:80 169 VLEQKLGLYRPTITLVHPTSII-NWRTKLVGGKSVISLVVIQENVL---------------------------------- 213 (535) T ss_pred HHHHHhcCCCcEEEEechhhcc-CccccccCCccceeEEEEEEEEE---------------------------------- Confidence 000 111235889999999986 5543221 11122221111110 Q ss_pred cccccccccccceEEEEEEEEEeeccCCceeEEEEEEEEC-C--------EE-EEecccccCCCccceEEeeeeeecCcc Q lcl|NC_019423. 305 PSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIG-S--------TL-IRMENNPFPDGKLPLVVVPYMPRKREL 374 (756) Q Consensus 305 ~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g-~--------~~-L~~~~~P~~~~~~Pfv~~~~~~~~~~~ 374 (756) ..+..+.......++|++ .+.+|....++...-+ + ++ .....+ ..+.+||+++.... -+.. T Consensus 214 ~~dd~f~~~~~~q~RvL~------~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~--~l~~IPfv~~~~~~-~~~~ 284 (535) T protein:vir:80 214 AQDDGFETTYVQQWRVLQ------LNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGN--PFKEIPFQFIGPLD-NNAD 284 (535) T ss_pred ecCCCcccceeEEEEEEE------ecCCceEEEEEEEeecCCccccccceeecccCCCc--ccCeeEEEEeecCC-CCCC Confidence 000111111112233332 2222222211111101 0 01 111122 23556776543222 2333 Q ss_pred cCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhh-----hhcccccccccccccccccccccc-- Q lcl|NC_019423. 375 FGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRR-----YDDGQDYEYNPMQGNPSQSIMEHK-- 447 (756) Q Consensus 375 ~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~-----~~~~~~~~~~~~~~~~~~~i~~~~-- 447 (756) .+...+..+..++..+=...+-..+.++.++.|...+. |..+...... ...+... .+....+...++.. T Consensus 285 ~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~-G~~~~~~~~~~~~~~i~iG~~~---~~~lP~~~~~~~~e~~ 360 (535) T protein:vir:80 285 IDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFT-GLTKDWVEDVFKDFKVHLGSRA---IIPLPQGATAGILQIT 360 (535) T ss_pred CCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeee-cCchhhhhcCCCCcceEecCcc---cccCCCCCCcceeeec Confidence 46667778888887766655666677888888765543 2221110000 0111111 11111122233333 Q ss_pred CCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019423. 448 FPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAKGMADIGTKICAMNAVF 527 (756) Q Consensus 448 ~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~ 527 (756) ...++. ..++...+.|.. .|..-...+ ..+.||++.+.-..+....|..++.|+++++..+ |.++..| T Consensus 361 ~~~~a~---~~l~~~e~qM~~-lGa~ll~~~----~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~a----L~~~A~w 428 (535) T protein:vir:80 361 PNSVPF---EAMTHKESQMIA-MGANLLVKS----GGNRTFGEAQQEEASEQSILSACTKNVSMAFRKA----LRWANQF 428 (535) T ss_pred cchhHH---HHHHHHHHHHHH-HHHHhhccC----cccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHH----HHHHHHH Confidence 233332 223434444433 233222221 2246777776555555666777777777776554 4555555 Q ss_pred CCC---CcEEEEe-cCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCCh- Q lcl|NC_019423. 528 LSE---KEVVRIT-NEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMP- 602 (756) Q Consensus 528 ~~~---~r~iRI~-g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~- 602 (756) +.. +.-+.|+ +.+|+. ...+....+.++.+.+. ..+.......-| ...++. T Consensus 429 ~G~~~~~~~~~i~~n~dF~~------------------~~ld~~~~~all~~~~~--G~Is~et~~~~L----~r~gvl~ 484 (535) T protein:vir:80 429 QTGIVNDETVEYNLNTDFPA------------------ARLTPNERAELILEWQQ--GAITFKEMRAGL----RRAGVAS 484 (535) T ss_pred cCCccCCCceEEEecccccc------------------ccCCHHHHHHHHHHHhc--CCCCHHHHHHHH----HhCCCCC Confidence 431 1112221 222211 00111122222222221 122222221111 111110 Q ss_pred ------hHHHHhhhc----c---CC-CChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 603 ------DLAHELRTW----Q---PQ-PDPMEEQLKQLAIQKAQLENEELQSKIALNNAKAKEAASSG 655 (756) Q Consensus 603 ------~~~~~l~~~----~---~q-~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~a~~~~aq~ 655 (756) +....++.- . +. -+.... .....-+.. .+..+.+ +-. T Consensus 485 ~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~-----g~~~~~~~~---------~~~~~~~--~~~ 535 (535) T protein:vir:80 485 EDDAKAETEGKATVEFIAKTAAAGKVGDAASG-----GTNKAKLNN---------GNGGGNQ--AGN 535 (535) T ss_pred cccchHHHHHHHHhhhhhccccCCCCCCCCCC-----CCCcCcccC---------Ccccccc--CCC Confidence 000011000 0 00 000000 000000000 0000000 000 No 141 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=72.11 E-value=0.19 Score=24.58 Aligned_cols=597 Identities=12% Similarity=0.060 Sum_probs=173.2 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHH-HhhcCCCCEE Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLS-EPFLSSSKLF 100 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~-~~f~~~~~~~ 100 (756) |.++-....+..+-.+.... ..+.+.+|. .++..+-+|+--..- .-|..|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-------~~~~l~~~~------------------~~~~~~~~~r~~a~~d~~fy~G~Q-- 53 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRF-------SQRQLLSLC------------------SDIDSQPLWRDAANKACAYYDGDQ-- 53 (714) T ss_pred CCcCcCcccCCCcchhhhhh-------hHHHHHHHH------------------HHHhhhHHHHHHHHHHHHhhcCCC-- Confidence 22211111111111100000 001111111 122222233321111 12334443 Q ss_pred EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCC--CCHHHH Q lcl|NC_019423. 101 KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPI--ENQEQA 178 (756) Q Consensus 101 ~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~--~~~~~~ 178 (756) |.++..+. + ...+.-.+.+|.|+-.+..-.|..+ ..+. . ..+...+. .+.+.+ T Consensus 54 ------w~~~~~~~--------l--~~~g~p~~~~N~i~~~v~~v~g~~~-----~nr~---~-~~v~pr~~~~~~~~~A 108 (714) T protein:vir:10 54 ------LAPEVIQV--------L--KDRGQPMTIHNLIAPTVDGVLGMEA-----KTRT---D-LIVMSDDPNDETEKLA 108 (714) T ss_pred ------CCHHHHHH--------H--HhcCCCcEEeccHHHHHHHHHHHHH-----hCCc---c-eEEecCCCChhhHHHH Confidence 33322222 1 2233444556666666666556222 2221 1 22333222 233566 Q ss_pred HHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCcCccc Q lcl|NC_019423. 179 DVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCNGDLD 258 (756) Q Consensus 179 ~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~~d~~ 258 (756) +.+...+..+.+.- ........++.+.+.+|.|+ .++. ++++.+--++.++. ++ T Consensus 109 e~l~~~~~~~~~~~-----~~~~~~s~af~~~~~~G~G~-------~~~~-------------~d~d~~~~~i~i~~-v~ 162 (714) T protein:vir:10 109 EAINAEFADACRLG-----NMNKARSDAYAEQIKAGLSW-------VEVR-------------RNSEPFGPEFKVST-VS 162 (714) T ss_pred HHHHHHHHHHHHhh-----chhHHHHHHHHHhhhcccce-------EEee-------------eccCCCCCCeEEEe-cC Confidence 77766665543321 12223344556666665443 2211 11111111111111 11 Q ss_pred cCceEEE--EeecCHHHHHhhcc----chhhhcc-cCchhhhhhhchhhhccccccccc-cccccceEEEEEEEEEeecc Q lcl|NC_019423. 259 KALYAVI--SFETCKADLMKNKD----RYHNLDK-IDWESSSPITDPDHESKTPSDFQF-KDALRKKVVAYEYWGFYDIN 330 (756) Q Consensus 259 da~~v~~--~~~~t~~el~~~~~----~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~-~d~s~~~V~v~E~w~k~d~~ 330 (756) --.+++- .+..++++.....- ..+.+.. +...... ............+... .......+.-++.+..++.. T Consensus 163 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (714) T protein:vir:10 163 RNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQV-IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ 241 (714) T ss_pred hhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhh-hhccchhhcCcccchhhhhhcccccccchhhcccccc Confidence 0111221 23445555544321 1111111 1111100 0000000000000000 00001111122222222221 Q ss_pred CCc--------e--eEEE-E------EE--EECCEEEEecccc-------------------------c------CCCc- Q lcl|NC_019423. 331 DDG--------S--LEPI-V------AT--WIGSTLIRMENNP-------------------------F------PDGK- 359 (756) Q Consensus 331 ~~g--------~--~~~~-~------~~--~~g~~~L~~~~~P-------------------------~------~~~~- 359 (756) .+. + .+++ + |. ..|..+.....++ | ..+. T Consensus 242 ~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~ 321 (714) T protein:vir:10 242 QNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPC 321 (714) T ss_pred cccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCC Confidence 111 1 1111 1 00 0122222222221 0 1111 Q ss_pred -cceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCc----eEee-ccccCccchhhh---hcccc- Q lcl|NC_019423. 360 -LPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQ----RGYP-KGMLDTLNRRRY---DDGQD- 429 (756) Q Consensus 360 -~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~----~~~~-~gav~~~~~~~~---~~~~~- 429 (756) ||+..|++.|+.+.+. +.....--++|.++|+-...|+.. +++. ++.+-....... ..... T Consensus 322 p~p~~~fp~vP~~g~~~---------~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~gav~~~d~~~~e~~ 392 (714) T protein:vir:10 322 SAPQGMFPLVPFWGYRK---------DKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQL 392 (714) T ss_pred CCCCCceeeEEecceee---------eccCccceehhhhhhHHHHHHHHHHHHHHHHhCCceeeccccccccHHHHHHhc Confidence 3334455555433322 111112234455555543333211 1111 111110000000 00000 Q ss_pred cc-ccccccccccc-----cccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_019423. 430 YE-YNPMQGNPSQS-----IMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREM 503 (756) Q Consensus 430 ~~-~~~~~~~~~~~-----i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~ 503 (756) .. .+.+..++... ...+.+.+.++-....++++...... -...+|.+..++|...+++|++..++.+..+ T Consensus 393 ~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~----i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg 468 (714) T protein:vir:10 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKL----IQDTMGVYSAFLGQDSGATSGVAISNLVEQG 468 (714) T ss_pred cCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHH----HHHhhCCCHHHcCCCcchhHHHHHHHHHHHH Confidence 00 11122222110 11122222233344445544444444 4455799888888888889998777666554 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHh-HhcCcc-eEEEec--ccccHHHHHHHHHHHH-HH Q lcl|NC_019423. 504 A-ILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKRE-DLKGNF-DIEVDI--NTAEIDNQKSQDLGFM-VQ 577 (756) Q Consensus 504 ~-~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d-~~~~~~-Dv~V~~--g~a~~~~~~~q~l~~l-lq 577 (756) . .+..|-+.++.-.+.+.+++..+...- .+.+..+.|... +-.+.. -+.++. +.+...+. +.-. .. T Consensus 469 ~~~l~~~~dnl~~~~~~~g~~ll~li~~~----~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nD----i~~~~~d 540 (714) T protein:vir:10 469 ATTLAEINDNYQFACQQVGRLLLAYLLDD----LKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTND----ISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEeccCCCcccceeEeeccccCCcccccc----ceeeeEE Confidence 3 234455666667777777777654321 112233444221 111111 122221 11110000 0000 00 Q ss_pred H---hhccCCHhHHHHHHHHHHhhcCC--hhH----HHHhhhccCCCCh--hh----------hhHH--HHHHHHHHHH- Q lcl|NC_019423. 578 T---LGNTVDQSITLSLVAKIAELKRM--PDL----AHELRTWQPQPDP--ME----------EQLK--QLAIQKAQLE- 633 (756) Q Consensus 578 ~---~~~~~~~~~~~~~l~~l~e~~~~--~~~----~~~l~~~~~q~~p--~~----------~~~~--q~~~~~aq~e- 633 (756) . .+|. .+......+..|+++... |+. ..++-....-|.. .. ...+ +++++++++. T Consensus 541 v~i~~~p~-~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~ 619 (714) T protein:vir:10 541 IALAPVQQ-TPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQ 619 (714) T ss_pred EEEeeccC-cHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHH Confidence 0 0011 111122222333332221 211 1111111111111 10 0001 1111111111 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH----hhccCC-c Q lcl|NC_019423. 634 --NEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKHARDMEKQK--AQSQGNQNLQITKALTTP----TKEGET-T 704 (756) Q Consensus 634 --~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~~~~~~~~~--~q~~~~~~~~~~~a~~~~----~~~~~~-~ 704 (756) +++.+++.+..+.+++..+.++++++++.+++..+.+.+..+.... ...++..+.++.+.+... +..+.. . T Consensus 620 ~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~~q 699 (714) T protein:vir:10 620 QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQ 699 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHH Confidence 1122222222222333333333333333222222221111111111 111222222221111110 111101 0 Q ss_pred hhhhccCCCCCCCcccCchhcCCCCCCCCcccc Q lcl|NC_019423. 705 PNISAAVGYNTLTNGNSPQERDLAAQQDPAYSL 737 (756) Q Consensus 705 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 737 (756) ..++.. .+-+. +-.| T Consensus 700 ~~~q~~------~~~~~------------~~~~ 714 (714) T protein:vir:10 700 QMLYTL------QQRMN------------EMSL 714 (714) T ss_pred HHHHHH------HHHHH------------hcCC Confidence 111110 01000 1111 No 142 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=67.04 E-value=0.26 Score=23.81 Aligned_cols=615 Identities=12% Similarity=0.004 Sum_probs=194.3 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHh-hcCCCCEEEEecCC Q lcl|NC_019423. 28 IQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEP-FLSSSKLFKLTPVT 106 (756) Q Consensus 28 ~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~-f~~~~~~~~~~p~~ 106 (756) ++.-+.+++.++++....+.. +-+|+--.+-.. |..|+ . T Consensus 1 m~d~~~~~~~~~~~~~~~~~~--------------------------------~~~~R~~a~~d~~fy~G~--------Q 40 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTA--------------------------------SDEARREAKNDLFFSRVS--------Q 40 (725) T ss_pred CCchHHHHHHHHHHHHHHHHh--------------------------------hHHHHHHHHHHHHhhcCC--------C Confidence 333333344433333322221 112222222211 22222 4 Q ss_pred cchHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHH Q lcl|NC_019423. 107 FEDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQ 186 (756) Q Consensus 107 ~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (756) |.++..+. + +..|.+ +.|.|+-.+..-.|. ...+++ .+.+...+..+.+.++.+...+. T Consensus 41 W~~~~~~~-----l------~~q~rp-~~N~i~~~v~~v~g~-----e~~nr~----d~~v~p~~~~d~~~Ae~l~~~~~ 99 (725) T protein:vir:10 41 WDDWLSQY-----T------TLQYRG-QFDVVRPVVRKLVSE-----MRQNPI----DVLYRPKDGASPDAADVLMGMYR 99 (725) T ss_pred CCHHHHHH-----H------HhcCCC-cccchHHHHHHHHhh-----HHhCCc----ceEEecCCcchHHHHHHHHHHHH Confidence 44433331 1 222333 335555555444441 112222 22344456778888888888887 Q ss_pred HhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEe-chhheEeCCCCcCccccCceEEE Q lcl|NC_019423. 187 LQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEML-NPNNVVIDPSCNGDLDKALYAVI 265 (756) Q Consensus 187 ~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V-~p~~~~~Dp~a~~d~~da~~v~~ 265 (756) .+.+.- ........++.+.+.+|.|+. ++..+-........+.+ ....+++||..- .+| .. T Consensus 100 ~~~~~~-----~~~~~~s~Af~~~i~~G~G~~-------ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v-~~D-----p~ 161 (725) T protein:vir:10 100 TDMRHN-----TAKIAVNIAVREQIEAGVGAW-------RLVTDYEDQSPTSNNQVIRREPIHSACSHV-IWD-----SN 161 (725) T ss_pred HHHHhc-----CcchHHhHHHHHHhhcCccee-------eeeccccCCCCCCCceeeeeeecccCHhHc-ccC-----ch Confidence 764432 222334456666777766543 32222111111111111 111112222211 011 11 Q ss_pred EeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEEC- Q lcl|NC_019423. 266 SFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIG- 344 (756) Q Consensus 266 ~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g- 344 (756) .+..++++.+... ....++....+. -........ ... ..+.+ .+. ..++ |. +.+.-.+.++++..+.. T Consensus 162 a~~~D~sDar~~~-~~~~~~~~~~~~-~~~~~~~~a-~~~--~~~~~-~~~--~~~~-~~--~~~~vrv~E~~~r~~~~~ 230 (725) T protein:vir:10 162 SKLMDKSDARHCT-VIHSMSQNGWDD-FAEKYDLDA-DNI--PSFQN-PND--WVFP-WL--TQDTIQIAEFYEVVEKKE 230 (725) T ss_pred hhccChhhhhhhh-hhccCCHHHHHH-HHHhCCCcc-ccc--ccccc-ccc--cccc-cc--CCCeEEEEEEEEEEEEee Confidence 3344445443321 222222100000 000000000 000 00000 000 0111 21 11111222332222111 Q ss_pred ----------CEEEEecccccC-----------------------------------CC--ccceEEeeeeeecCcccC- Q lcl|NC_019423. 345 ----------STLIRMENNPFP-----------------------------------DG--KLPLVVVPYMPRKRELFG- 376 (756) Q Consensus 345 ----------~~~L~~~~~P~~-----------------------------------~~--~~Pfv~~~~~~~~~~~~G- 376 (756) +.++...++.+. ++ .+|.-.|++.|.-+..++ T Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~ 310 (725) T protein:vir:10 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) T ss_pred EEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeecc Confidence 122221111100 00 112222333333222210 Q ss_pred CchH---HHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccccccccc----ccccc--ccc Q lcl|NC_019423. 377 EADA---ELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNP----SQSIM--EHK 447 (756) Q Consensus 377 ~g~v---~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~----~~~i~--~~~ 447 (756) .|.. -.++++-+.-......+.-.+...+..+-....+..+..+..............+..++ ++.+. .+. T Consensus 311 ~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~ 390 (725) T protein:vir:10 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLA 390 (725) T ss_pred CCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccCc Confidence 1111 12233333333333333334444444444444544444444332221111111111111 01111 111 Q ss_pred CCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019423. 448 FPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAI-LRRLAKGMADIGTKICAMNAV 526 (756) Q Consensus 448 ~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~-~~n~~~~~~~l~~~~l~li~q 526 (756) ....++-....++++..... .-...+|.++.++|...+++|++..++.+..+.. .-.|-+.++.-.+.+.+++.. T Consensus 391 ~~~~~~~p~~~~~ll~~~~~----~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~ 466 (725) T protein:vir:10 391 YYENPEVPQANAYMLEAATA----AVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) T ss_pred ccCCCCchHHHHHHHHHHHH----HHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11223333344444444444 4456679988888888888999887777665543 344555666666666666665 Q ss_pred hCCCCcEEEEecCceeecCHhHhcCcceEEEecc---cccHHHHHHHHHHHHHHH---hhccCCH--hHHHHHHHHHHhh Q lcl|NC_019423. 527 FLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN---TAEIDNQKSQDLGFMVQT---LGNTVDQ--SITLSLVAKIAEL 598 (756) Q Consensus 527 ~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g---~a~~~~~~~q~l~~llq~---~~~~~~~--~~~~~~l~~l~e~ 598 (756) +...- .+.+..+.|...+-. .--+.++.. +.++.......+..-+.. .+|..+- +.....|.+++.. T Consensus 467 lI~~~----~~~er~~RI~~edg~-~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~ 541 (725) T protein:vir:10 467 IVNDI----YDVPRNVTITLEDGS-EKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGK 541 (725) T ss_pred HHHHH----cCCCcEEEEecCCCC-cceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHh Confidence 43321 122234455443211 111223321 111111000001000000 1111110 1111222223322 Q ss_pred cC--ChhHHHHhhhccCCCC---------------hhhhhHH---HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 599 KR--MPDLAHELRTWQPQPD---------------PMEEQLK---QLAIQ--KAQLENEELQSKIALNNAKAKEAASSGD 656 (756) Q Consensus 599 ~~--~~~~~~~l~~~~~q~~---------------p~~~~~~---q~~~~--~aq~e~~~~qa~a~~~~a~a~~~~aq~~ 656 (756) .. .+.....+..+.+-++ +.+...+ +.+++ ..+++.++.++.+.+.+++++.++++++ T Consensus 542 ~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae 621 (725) T protein:vir:10 542 TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) T ss_pred ccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 21 1111111111111111 0000000 01110 1111222233333344444444444444 Q ss_pred HHHHHHHHHHHH---HHHHHHHHHH-------HHHHHHHHHHHHH---HHHHHHhh------ccCCchhhhccCCCCCCC Q lcl|NC_019423. 657 LKDLDYLEQESG---TKHARDMEKQ-------KAQSQGNQNLQIT---KALTTPTK------EGETTPNISAAVGYNTLT 717 (756) Q Consensus 657 ~~~~~~~~q~~~---~k~~~~~~~~-------~~q~~~~~~~~~~---~a~~~~~~------~~~~~~~~~~a~~~~~~~ 717 (756) +++++......+ .+...+-++. ..++.+.++.+.. ++....+. ..+....+++.. T Consensus 622 ~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~------ 695 (725) T protein:vir:10 622 LAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNE------ 695 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH------ Confidence 433322211111 1111111111 1122212221111 11111111 111111222210 Q ss_pred cccCchhcCCCCCCCCccccccccccCCCCCCCCCC Q lcl|NC_019423. 718 NGNSPQERDLAAQQDPAYSLGSQYYDPSQDPASALG 753 (756) Q Consensus 718 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 753 (756) - -+++ -.....+.........|.+. +--|- T Consensus 696 ~---~~~~--~~~~~~~~~~q~~~~~~~~~-~~~~~ 725 (725) T protein:vir:10 696 Q---THKQ--RMDIANILQSQRQNQPSGSV-AETPQ 725 (725) T ss_pred H---HHHH--HhhhhhccccccccCCCccc-ccCCC Confidence 0 0011 01122333444455555333 33333 No 143 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=60.97 E-value=0.36 Score=23.00 Aligned_cols=604 Identities=10% Similarity=0.036 Sum_probs=146.9 Q ss_pred CCCCCCCCC-----CCcccCHHHHHHHHHHHHHHH-------HhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhc Q lcl|NC_019423. 61 KAKPPKIKG-----RSQVQPRLVRRQAEWRYAPLS-------EPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQL 128 (756) Q Consensus 61 ~~~~~~~~g-----rS~~v~~~v~~~~e~~~~~L~-------~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~ 128 (756) -++..+.+. .-+.|..+|....+|.=..|. +.++|...-... ++ -.+++...-. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~----~~----------~s~~~~~~v~ 66 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNER----PG----------KSGIVSRDVQ 66 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCccc----CC----------CCccccHHHH Confidence 112111111 113455555555554332222 222221110000 00 0000000000 Q ss_pred CCcchHHHHHHHHhhc----CceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHH Q lcl|NC_019423. 129 NKVKLVDDYVHSIVDD----GTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIK 204 (756) Q Consensus 129 ~~~~~~~~~v~~al~~----g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~ 204 (756) +.+ +|+...|+. +.=+++ +..+.-.|.+.+.....++..++.+... ....+. T Consensus 67 ~~v----~~~~~~l~~~~~~~~~~~~----------------~~p~~~~D~~~a~~~~~~~~~~~~~~~~----~~~~~~ 122 (705) T protein:vir:88 67 ETV----DWIMPSLMKVFTSGGQVVK----------------YEPDTAEDVEQAEQETEYVNYLFMRKNE----GFKVMF 122 (705) T ss_pred HHH----HHHHHHHHHhhcCCCceEE----------------EeeCChhHHHHHHHHHHHHhHHHhhccc----hhHHHH Confidence 011 344444432 333333 2223345666777777777654433221 112334 Q ss_pred HHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEechhheEeCCCCc----CccccCceE-----------EEEeec Q lcl|NC_019423. 205 EAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVIDPSCN----GDLDKALYA-----------VISFET 269 (756) Q Consensus 205 ~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~Dp~a~----~d~~da~~v-----------~~~~~~ 269 (756) ..+.+...+|.|+.+++|-.+.-.......+-+ .+..-.++.||.+. ++.++..|- ++...+ T Consensus 123 ~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~---~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V 199 (705) T protein:vir:88 123 DWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLS---EDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCV 199 (705) T ss_pred HHHHHHhhcCCeEEEeccccccchhhhhhccCC---hhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeec Confidence 556677899999998865221111001111111 11122344455431 111111110 011122 Q ss_pred CHHHHHhhcc----------------chhhhcc--cCchhhhhhhchhhhcc--cc--ccccccccccceEEEEEEEEEe Q lcl|NC_019423. 270 CKADLMKNKD----------------RYHNLDK--IDWESSSPITDPDHESK--TP--SDFQFKDALRKKVVAYEYWGFY 327 (756) Q Consensus 270 t~~el~~~~~----------------~~~~l~~--~~~~~~~~~~~~~~~~~--~~--~~~~~~d~s~~~V~v~E~w~k~ 327 (756) +..++.-... .+..|.. ++.+........+.... .. ...+..+.+. ...+.+.|... T Consensus 200 ~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~-~~~~~~~~~~~ 278 (705) T protein:vir:88 200 KPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTG-QLQYNSGDDAE 278 (705) T ss_pred cHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhcccccccccc-ccccccccccC Confidence 2222211100 0001100 00000000000000000 00 0000001111 11122222111 Q ss_pred eccCCceeEEEEEEE----ECCEEEEecccccCCCc----cceEEeeeeeecCccc-CCchHHHhHHHHHHHHHHHHHHH Q lcl|NC_019423. 328 DINDDGSLEPIVATW----IGSTLIRMENNPFPDGK----LPLVVVPYMPRKRELF-GEADAELLGDNQAILGATMRGMI 398 (756) Q Consensus 328 d~~~~g~~~~~~~~~----~g~~~L~~~~~P~~~~~----~Pfv~~~~~~~~~~~~-G~g~v~~~~d~Q~~iN~~~~~~~ 398 (756) . ....+++-|.+ -|+.+.+.....|..+. .|+-.+++...+--.. +..+...+.+.-.-+-...+.+. T Consensus 279 ~---~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~ 355 (705) T protein:vir:88 279 A---NREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLM 355 (705) T ss_pred C---ceeEEEEEeeeEecccCCcceeeEEEEEeCccccccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHH Confidence 0 00001111110 11111110000111111 1111111111111111 11222333444444444444333 Q ss_pred HHHHhhcCCceEeeccccCccchhhhhccccccccccccccccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhc Q lcl|NC_019423. 399 DLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSG 477 (756) Q Consensus 399 d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~ 477 (756) ..+.-..+ ........+.. +...........+++.+.+.....+. -....+-+....+++.+...-+... T Consensus 356 ~~~~d~~~-~~~~~~~~~~~--------g~v~~~d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~t 426 (705) T protein:vir:88 356 RNIMDNIY-RTNQGRSVVLD--------GQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRT 426 (705) T ss_pred HHHHHHHH-hccCCceeccc--------cccCcccccccCCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhh Confidence 32211111 11111111111 00001111222344444332222110 0011122223333444444556677 Q ss_pred CCCccccchhHHHHHHHHHHHH-----HHHHHHHHHHHHHH-HHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC Q lcl|NC_019423. 478 GVTGSAYGDVAAGIRGALDAAS-----KREMAILRRLAKGM-ADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG 551 (756) Q Consensus 478 G~~~~a~~~tA~~i~~~~~aa~-----~~l~~~~~n~~~~~-~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~ 551 (756) |++....|..+..+..-..++. ..-...++.+.+-+ +...+.++.++....-+- ...+..+.|....... T Consensus 427 Gi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~----~~~~~~~ri~g~~v~v 502 (705) T protein:vir:88 427 GITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKY----QNQEEVFQLRGKWVAV 502 (705) T ss_pred CCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCceEEeeccchhcc Confidence 8876554422211211111111 11111122222222 222334444444332211 0112234443321111 Q ss_pred cc-eEEEe--cccccHHHH-HHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChhHHH----HhhhccC------CCCh Q lcl|NC_019423. 552 NF-DIEVD--INTAEIDNQ-KSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPDLAH----ELRTWQP------QPDP 617 (756) Q Consensus 552 ~~-Dv~V~--~g~a~~~~~-~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~~~~----~l~~~~~------q~~p 617 (756) +- +...+ +........ ...+.++.+..+..... .+.. ...+....+.....+ ..+.... ..++ T Consensus 503 ~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q-~l~~--~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~ 579 (705) T protein:vir:88 503 NPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQ-AVVG--GGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNP 579 (705) T ss_pred chHhhccCCceEEeeccccchHHHHHHHHHHHHHHHH-Hhhc--ccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhh Confidence 00 00000 000111111 11122222222211000 0000 001111111111111 1111110 1122 Q ss_pred hhhhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHH Q lcl|NC_019423. 618 MEEQLKQLAIQ----KAQLENEELQSKIALNNAKAKEAASSGDLKDLDYLEQESGTKH-------------ARDMEKQKA 680 (756) Q Consensus 618 ~~~~~~q~~~~----~aq~e~~~~qa~a~~~~a~a~~~~aq~~~~~~~~~~q~~~~k~-------------~~~~~~~~~ 680 (756) ...++.+.+++ +.+++..++++++...+++++...+++++.. .+++.+.++ .++++...+ T Consensus 580 ~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~---~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a 656 (705) T protein:vir:88 580 NSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQM---KQVEAQIRLAEIELKKQEAVLQQREMALKEA 656 (705) T ss_pred hhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21111121111 1122222223333333333322222111111 111111111 111111111 Q ss_pred HHHHHHH-HHHHHHHHHHhhccCCchhhhccCCCCCCCcccCchhcCCCCCCCCcccccc Q lcl|NC_019423. 681 QSQGNQN-LQITKALTTPTKEGETTPNISAAVGYNTLTNGNSPQERDLAAQQDPAYSLGS 739 (756) Q Consensus 681 q~~~~~~-~~~~~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 739 (756) +.+.+++ .+..++........+.-...+... ...+-|..+- |.-.+.- T Consensus 657 ~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~-----~~~~~~~~~k------~~~~~rr 705 (705) T protein:vir:88 657 ELQLERDRFTWERARNEAEYHLEATQARAAYI-----GDGKVPETKK------PTKAVRR 705 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhHHHHHH------HHHHhcC Confidence 1111111 111111111111111111111110 0111122221 1111111 No 144 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=57.22 E-value=0.44 Score=22.54 Aligned_cols=457 Identities=10% Similarity=0.021 Sum_probs=160.9 Q ss_pred CCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEE-----EEecCCcchHHHHHHHHHHHHHHHhhhcCCcchHHHHH Q lcl|NC_019423. 64 PPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLF-----KLTPVTFEDELAARQNELVLNYQFRTQLNKVKLVDDYV 138 (756) Q Consensus 64 ~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~-----~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v 138 (756) .|. -|...+-.....-.|- +++..|+|..-+ .|-|.-....+..+....|-+|+ ..-++.|++ T Consensus 1 m~~---V~~~hp~y~~~~~~W~---~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl------~rA~~~n~~ 68 (501) T protein:vir:95 1 MPN---VSFIRPELGKLLPLYY---LIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYL------KRAVFYNVA 68 (501) T ss_pred CCC---CCCCCHHHHHHHHHHH---HHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHh------hccccCchH Confidence 221 2223333445555565 466667766543 57775322111112223355554 123455666 Q ss_pred HHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcce Q lcl|NC_019423. 139 HSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATY 218 (756) Q Consensus 139 ~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~ 218 (756) ++.|..=+|.+- +. .|+.. ....+ +...++-......+.+-.+..+.....+|.+.. T Consensus 69 ~~t~~~l~G~vf--~k---------~p~~~----~p~~l--------~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~i 125 (501) T protein:vir:95 69 RRTLFGLVGQVF--MR---------DPVVK----VPALL--------NPLVANATGSGINLTQLAKRAVSLNLAYSRAGL 125 (501) T ss_pred HHHHHHHhhhhh--cC---------Cccee----CcHHH--------HHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEE Confidence 666654444222 21 11111 11221 122222222233344445555566666776554 Q ss_pred eccC---ceeEEE---eeeeecCceeEEEechhheEeCCCCcCc--cccCceEEEEeecCHHHHHhhccchhhhcccCch Q lcl|NC_019423. 219 AIQT---GVTEVE---VEKALVNRPTVEMLNPNNVVIDPSCNGD--LDKALYAVISFETCKADLMKNKDRYHNLDKIDWE 290 (756) Q Consensus 219 ~~~~---g~~~~~---~~~~~~g~~~ie~V~p~~~~~Dp~a~~d--~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~ 290 (756) -+-+ +.+... .++...-+|.+..++|++|+ ++..... .....++..+...+. T Consensus 126 lVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~Ii-nW~~~~v~g~~~l~~v~l~E~~~~------------------- 185 (501) T protein:vir:95 126 LVDYPTTEAEGGASIADLEAGRIRPTLYVYSPTEII-NWRTTDRGAEEVLSLVVLFETWCA------------------- 185 (501) T ss_pred EEeecCCCCcccccHHHHHhccCCcEEEEecHhhhc-CcceeccCCceeeeEEEEEEEEee------------------- Confidence 3311 000000 00111125889999999986 4543211 111222211111100 Q ss_pred hhhhhhchhhhccccccccccccccceEEE----------EEEEEEeeccCCceeEEEEEEEECCE-------EEEeccc Q lcl|NC_019423. 291 SSSPITDPDHESKTPSDFQFKDALRKKVVA----------YEYWGFYDINDDGSLEPIVATWIGST-------LIRMENN 353 (756) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v----------~E~w~k~d~~~~g~~~~~~~~~~g~~-------~L~~~~~ 353 (756) .+..+.......+|+ +++|.+-. .+...... +.-|.. ....+.+ T Consensus 186 ---------------~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~---~~~~~~~~-~~~~~~~~~~~~~~~~~g~~ 246 (501) T protein:vir:95 186 ---------------ADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQ---PTKADGSK-IPKGNYQQYVVYKPTDAQGK 246 (501) T ss_pred ---------------cCCCcccceeEEEEEEeeCCCceEEEEEEEecC---CcccCcce-ecCCcccccceeeeeccCCC Confidence 000011111112222 23332211 00000000 000111 1111112 Q ss_pred ccCCCccceEEeeeeeecCccc--CCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccch----hhhhcc Q lcl|NC_019423. 354 PFPDGKLPLVVVPYMPRKRELF--GEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNR----RRYDDG 427 (756) Q Consensus 354 P~~~~~~Pfv~~~~~~~~~~~~--G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~----~~~~~~ 427 (756) ..+.+||+++ -..++.+ +.+..-.+..+...+=...+-..++++.++.|...+. |.-+.... .....+ T Consensus 247 --~l~~IPfv~~---~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~-G~~~~~~~~~~~~~i~~G 320 (501) T protein:vir:95 247 --RLTEIPFMFI---GSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLI-GLTEEWVTNVLKGSVNFG 320 (501) T ss_pred --cCCeeeEEEE---ecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeee-CCcccccccCCCCceeec Confidence 2345666644 2233322 3444556655554432222335566777777765542 32211100 000111 Q ss_pred ccccccccccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 428 QDYEYNPMQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILR 507 (756) Q Consensus 428 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~ 507 (756) .. .......++.+.++++.... -....|+...+.|..+ |. +... .+..+.||++.+.-..+....|..++. T Consensus 321 ~~---~~~~lP~~~~~~~ie~~~~~-i~~~~l~~l~~~m~~~-Ga-~ll~---~~~~~~Ta~~~~~~~~~~~S~L~~~a~ 391 (501) T protein:vir:95 321 SR---GGIPLPVGADAKLLQASENT-MLKEAMDTKERQMVAL-GA-KLVE---QKEVQRTATEAELEAASEGSTLSSATK 391 (501) T ss_pred cc---ccccCCCCCceeEEecChhh-HHHHHHHHHHHHHHHH-HH-hhcc---CCccchhHHHHHHHHHHHhHHHHHHHH Confidence 11 11112223334444432211 1123344444444332 32 2222 222347887777766677777888888 Q ss_pred HHHHHHHHHHHHHHHHHHhhCCC--Cc-EEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019423. 508 RLAKGMADIGTKICAMNAVFLSE--KE-VVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVD 584 (756) Q Consensus 508 n~~~~~~~l~~~~l~li~q~~~~--~r-~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~ 584 (756) |+++++.. +|.++.+|... .. .|.| +.+|. . ........+.+..+.+ +..+. T Consensus 392 ~le~al~~----~l~~~a~w~g~~~~~~~v~i-~~df~------------~------~~~~~~~~~al~~~~~--~G~is 446 (501) T protein:vir:95 392 NVSAAFEW----ALKWAARWVGQADSGVKFEL-NTDFD------------I------ARMTPDERRSLVEEWQ--KGAIT 446 (501) T ss_pred HHHHHHHH----HHHHHHHHcCCCCCceEEEE-ecccc------------c------ccCCHHHHHHHHHHHh--CCCCc Confidence 88777655 45555555532 21 1222 22221 1 0111112222333322 12233 Q ss_pred HhHHHHHHHHHHhhcCChh-----HHHHhhhccCCCChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 585 QSITLSLVAKIAELKRMPD-----LAHELRTWQPQPDPMEEQLKQLAIQKAQLENEELQSKIALNNAK 647 (756) Q Consensus 585 ~~~~~~~l~~l~e~~~~~~-----~~~~l~~~~~q~~p~~~~~~q~~~~~aq~e~~~~qa~a~~~~a~ 647 (756) ..... ..+...++.+ ..+.+......++........... ..-.+. ...++ T Consensus 447 ~~t~~----~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~-~~gg~~--------~~~~~ 501 (501) T protein:vir:95 447 FEEMR----TGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGD-GSGGDN--------VGNSE 501 (501) T ss_pred HHHHH----HHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCC-Cccccc--------ccCCC Confidence 22222 2222223321 111111111110000000000000 000000 00000 No 145 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=55.67 E-value=0.47 Score=22.36 Aligned_cols=468 Identities=12% Similarity=0.050 Sum_probs=174.1 Q ss_pred ccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEE-----EEecCCcchHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_019423. 57 EVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLF-----KLTPVTFEDELAARQNELVLNYQFRTQLNKV 131 (756) Q Consensus 57 ~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~-----~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~ 131 (756) +.+ . .++--|...+-.....-.|- +++..|+|...+ .|-|..+. + ....|-+|+ .. T Consensus 1 m~~---~--~~~~v~~~h~~y~a~~~~W~---~ird~~~G~~~~r~~g~~YLPk~~~-E----~~~~Y~~rl------~r 61 (513) T protein:vir:97 1 MAD---K--DPKSPATTSGAYDQMLPRWH---VIETLLGGTEAMREAGETYLPRHQE-E----TDKGYQERL------AS 61 (513) T ss_pred CCC---C--CCCCCCcCCHHHHHHHHHHH---HHHHHhcChHHHHhhcccCCCCCCC-C----CHHHHHHHH------hc Confidence 211 1 12222223333344555665 466667665432 46666543 1 222355554 23 Q ss_pred chHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHH Q lcl|NC_019423. 132 KLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFN 211 (756) Q Consensus 132 ~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~ 211 (756) -++.|+++++|..=+|.+-. +.++.+ ....+.+.+.+ .++-......+..-.+..+.... T Consensus 62 A~~~n~~~~tl~~l~G~vf~-----------k~p~~~--~~~p~~~~~~l-------~~d~D~~G~~L~~f~~~~~~~~l 121 (513) T protein:vir:97 62 AVLLNMVEQTLDTLSGKPFS-----------EPIKLN--EDVPKAIEETI-------LPDVDLQGNNLDVFARQWFREGM 121 (513) T ss_pred ccCCChHHHHHHHHhhhhhh-----------cCcccC--cCchHHHHHHH-------hhccCCCCCCHHHHHHHHHHHHH Confidence 33556666666554443321 011111 11122222211 11112222333344455566666 Q ss_pred hcCCcceeccCc----eeE----EEee-eeecCceeEEEechhheEeCCCCcCccc---cCceEEEEeecCHHHHHhhcc Q lcl|NC_019423. 212 ETGEATYAIQTG----VTE----VEVE-KALVNRPTVEMLNPNNVVIDPSCNGDLD---KALYAVISFETCKADLMKNKD 279 (756) Q Consensus 212 ~~G~~~~~~~~g----~~~----~~~~-~~~~g~~~ie~V~p~~~~~Dp~a~~d~~---da~~v~~~~~~t~~el~~~~~ 279 (756) .+|.+..-+-+- .++ ..-+ +...-+|.+..++|++|+ ++.... .+ ...++..+.... + T Consensus 122 ~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~Ii-nW~~~~-v~G~~~L~~v~l~E~~~--~------ 191 (513) T protein:vir:97 122 AKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPECLL-FARSEV-INGVEVLQHVRIIEHYM--E------ 191 (513) T ss_pred hcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHhhhc-Ccceec-cCcceeeeeEEEEEEEe--e------ Confidence 777655433110 000 0000 011115888888888886 554322 11 111111111000 0 Q ss_pred chhhhcccCchhhhhhhchhhhccccccccccccccceEEEE-----EEEEEeeccCCceeEEEEEEEECC-EEEEeccc Q lcl|NC_019423. 280 RYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAY-----EYWGFYDINDDGSLEPIVATWIGS-TLIRMENN 353 (756) Q Consensus 280 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~-----E~w~k~d~~~~g~~~~~~~~~~g~-~~L~~~~~ 353 (756) .+.+......+++|+ +.|.+.... ... .+. .+...... T Consensus 192 ---------------------------~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~-~~~--------~~e~~~~~~g~~ 235 (513) T protein:vir:97 192 ---------------------------QDGFAEVCKRRIRVLEPGLVQLWEPVKKS-NAQ--------KEEWALADEWAT 235 (513) T ss_pred ---------------------------cCCCcceEEEEEEEEeCceEEEEEeecCC-Ccc--------ccceEEecCCCC Confidence 000111111123332 222111100 000 011 12222222 Q ss_pred ccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccc Q lcl|NC_019423. 354 PFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYN 433 (756) Q Consensus 354 P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~ 433 (756) + .+.+||+++..... +...+.+.+..+..+...+=...+-.-++++.+..|...+. |. +.........+.+. T Consensus 236 ~--l~~IP~v~~~~~~~-~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~-G~-~~~~~~~i~iG~~~--- 307 (513) T protein:vir:97 236 G--LNYVPLVTFYADRQ-GFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACS-GA-SGEDSDPVVVGPNK--- 307 (513) T ss_pred c--CCceeEEEEecCCC-CCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeee-cC-CcCCCCceEeeccc--- Confidence 2 36678887754432 23346677778888877766666777778888888876663 32 11100011111111 Q ss_pred cccccc--ccccccccCCCcc-hHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 434 PMQGNP--SQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLA 510 (756) Q Consensus 434 ~~~~~~--~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~ 510 (756) ....| +..+.++++..-+ .....-+..+.+.|.. .|.. .... +..+.||++.+.-..+....|..++.++. T Consensus 308 -~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~-~Ga~-ll~~---~~~~~Ta~a~~~~~~~~~S~L~~~a~~le 381 (513) T protein:vir:97 308 -VLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAG-YGAE-FLKR---KTGGQTATARALDSAEATSDLSAMTGLFE 381 (513) T ss_pred -cccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHH-HHHH-hhcc---CCccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122 2335555544322 2223344555555533 3432 2222 22247887777777777778888888887 Q ss_pred HHHHHHHHHHHHHHHhhCCC-Cc--EEEEecCceeec--CHhHhcCcceEEEecccccHHHHHHHHHHHHHH---Hhhcc Q lcl|NC_019423. 511 KGMADIGTKICAMNAVFLSE-KE--VVRITNEQYVEI--KREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQ---TLGNT 582 (756) Q Consensus 511 ~~~~~l~~~~l~li~q~~~~-~r--~iRI~g~~~v~i--~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq---~~~~~ 582 (756) +++...+ .++.+|+.. .. .|.| +.+|... ++..+..-+.. +..| ..+ + +.+...|+ .+.+. T Consensus 382 ~al~~~l----~~~a~wlg~~~~~~~v~i-n~dF~~~~~~~~~~~al~~a-~~~G--~is--~-~t~~~~L~r~gvl~~d 450 (513) T protein:vir:97 382 DALAQAL----DITADWLRLGPNGGTVEL-VKDYDLEEMDAPGLQALQVA-REKR--DIS--R-KTYLNGLRLRGVLPED 450 (513) T ss_pred HHHHHHH----HHHHHHhCCCCCccEEEe-ccccCcccCCHHHHHHHHHH-HhCC--CCC--H-HHHHHHHHhccCCCcc Confidence 7665554 455555432 11 1222 2222210 01111000000 0000 000 0 01111111 22233 Q ss_pred CCHhHHHH-HHHHHHhhcCChhHHH-HhhhccCCCCh----hhhhHHHHHHH-----HHHHHH Q lcl|NC_019423. 583 VDQSITLS-LVAKIAELKRMPDLAH-ELRTWQPQPDP----MEEQLKQLAIQ-----KAQLEN 634 (756) Q Consensus 583 ~~~~~~~~-~l~~l~e~~~~~~~~~-~l~~~~~q~~p----~~~~~~q~~~~-----~aq~e~ 634 (756) +++....+ +..++-+..+..+... .+....+.... ...+..+-+.. ..--+. T Consensus 451 ~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 451 FDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred CCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 33333222 2222333322211110 00000000000 00000000000 000000 No 146 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=53.80 E-value=0.52 Score=22.14 Aligned_cols=605 Identities=11% Similarity=-0.018 Sum_probs=185.5 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHHHHhhcCCCCEEEEecCCc Q lcl|NC_019423. 28 IQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGKAKPPKIKGRSQVQPRLVRRQAEWRYAPLSEPFLSSSKLFKLTPVTF 107 (756) Q Consensus 28 ~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grS~~v~~~v~~~~e~~~~~L~~~f~~~~~~~~~~p~~~ 107 (756) ++..++.++.++++....+....+|..-+.-+. -|..|+ .| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~-------------------------------~fy~G~--------Qw 41 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDL-------------------------------FFSRVS--------QW 41 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHH-------------------------------HhhCCC--------CC Confidence 555555555555554443333222222222111 122222 33 Q ss_pred chHHHHHHHHHHHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCCHHHHHHHHHhHHH Q lcl|NC_019423. 108 EDELAARQNELVLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIENQEQADVLQQALQL 187 (756) Q Consensus 108 ~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (756) .++..+. + +..|.+ +.|.|+-.+..-.|.-+ .+++ .+.+...+..+.+.++.+...+.. T Consensus 42 ~~~~~~~-----l------~~q~rp-~~N~i~~~i~~v~g~~~-----~nr~----d~~v~P~~~~d~~~Ae~l~~~~~~ 100 (725) T protein:vir:77 42 DDWLSQY-----T------TLQYRG-QFDVVRPVVRKLVSEMR-----QNPI----DVLYRPKDGARPDAADVLMGMYRT 100 (725) T ss_pred CHHHHHH-----H------HhcCCC-ccccHHHHHHHHHhhHH-----hCCc----ceEEecCCccHHHHHHHHHHHHHH Confidence 3333321 1 112323 23455444444334111 1211 223444567788888888888877 Q ss_pred hhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEE-echhheEeCCCCcCccccCceEEEE Q lcl|NC_019423. 188 QAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEM-LNPNNVVIDPSCNGDLDKALYAVIS 266 (756) Q Consensus 188 ~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~-V~p~~~~~Dp~a~~d~~da~~v~~~ 266 (756) +...- ........++.+.+.+|.|+ .++..+-........+. |....++.||.. .-|=... T Consensus 101 ~~~~~-----~~~~a~s~Af~~~i~~G~G~-------~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~------v~~Dp~a 162 (725) T protein:vir:77 101 DMRHN-----TAKIAVNIAVREQIEAGVGA-------WRLVTDYEDQSPTSNNQVIRREPIHSACSH------VIWDSNS 162 (725) T ss_pred HHHhh-----CchhHHHHHHHHHhhcCcce-------eeeeecccCCCCCCCceeeEEeecccChhh------ceeCchh Confidence 65422 22233445666777776654 33322211111011100 011111111211 0011113 Q ss_pred eecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEEC-- Q lcl|NC_019423. 267 FETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIG-- 344 (756) Q Consensus 267 ~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g-- 344 (756) +..++++...++ +...++. +.. ....+.+...-.....+.+. . - .+.-|+. .+.-.+.++++..+.. T Consensus 163 ~~~D~sDar~~~-~~~~~~~---d~~-~~~~~~~~~~~~~~~~~~~~-~--~-~~~~~~~--~d~vrv~E~~~r~~~~~~ 231 (725) T protein:vir:77 163 KLMDKSDARHCT-VIHSMSQ---NGW-EDFAEKYDLDADDIPSFQNP-N--D-WVFPWLT--QDTIQIAEFYEVVEKKET 231 (725) T ss_pred hccChhhHHHHH-HHhcCCH---HHH-HHHHhhCCcchhhccccccc-c--c-ccccccC--CCeeEEEEEEEEEEEeeE Confidence 344555554332 1122211 100 00111111000000000000 0 0 0111221 1111122222211111 Q ss_pred ---------CEEEEeccccc--------CCC-----------------------------ccceEEeeeeeecCccc--- Q lcl|NC_019423. 345 ---------STLIRMENNPF--------PDG-----------------------------KLPLVVVPYMPRKRELF--- 375 (756) Q Consensus 345 ---------~~~L~~~~~P~--------~~~-----------------------------~~Pfv~~~~~~~~~~~~--- 375 (756) +.++....+-+ ..| .+|.-.|++.|.-+... T Consensus 232 ~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~ 311 (725) T protein:vir:77 232 AFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVE 311 (725) T ss_pred EEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccC Confidence 11111111100 000 01111223333222111 Q ss_pred CCchHHHh-HHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhcccccccccccccccccccc--------- Q lcl|NC_019423. 376 GEADAELL-GDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGNPSQSIME--------- 445 (756) Q Consensus 376 G~g~v~~~-~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~~~~~i~~--------- 445 (756) |......+ +++-+.-......+.-.+...+........+..+..+........ .+.+.+.....+.. T Consensus 312 g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) T protein:vir:77 312 DKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDG---NDDYPYYLLNRTDENSGDLPTQP 388 (725) T ss_pred CcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHh---ccCCceecccccccCCCcccccC Confidence 11111122 222222222222222223333333333333333333333222211 11111111111111 Q ss_pred ccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 446 HKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMA-ILRRLAKGMADIGTKICAMN 524 (756) Q Consensus 446 ~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~-~~~n~~~~~~~l~~~~l~li 524 (756) +.....++-....++++...... -...+|+++..+|...+++|+++.++.+.... ..-.|-+.++.-.+.+.+++ T Consensus 389 i~~~~~~~lp~~~~~ll~~~~~~----i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~l 464 (725) T protein:vir:77 389 LAYYENPEVPQANAYMLEAATSA----VKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIY 464 (725) T ss_pred ccccCCCCchHHHHHHHHHHHHH----HHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11112233333444444444444 35556999888888888899988777765443 33455566777777777777 Q ss_pred HhhCCCCcEEEEecCceeecCHhHhcCcceEEEecc---cccHHHHHHHHHHHHHHH---hhccCCH--hHHHHHHHHHH Q lcl|NC_019423. 525 AVFLSEKEVVRITNEQYVEIKREDLKGNFDIEVDIN---TAEIDNQKSQDLGFMVQT---LGNTVDQ--SITLSLVAKIA 596 (756) Q Consensus 525 ~q~~~~~r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g---~a~~~~~~~q~l~~llq~---~~~~~~~--~~~~~~l~~l~ 596 (756) ..+...- .+.+..+.|...+- ....++++.. +..+.......+..-... .+|..+- +.....+.+++ T Consensus 465 L~lI~~~----~~~~rv~RI~~ed~-~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll 539 (725) T protein:vir:77 465 QSIVNDI----YDVPRNVTITLEDG-SEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELL 539 (725) T ss_pred HHHHHHH----cCCCcEEEEecCCC-CcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHH Confidence 6553321 11223455544432 1223333321 111111111111000000 0111110 01112222222 Q ss_pred hhcC--ChhHHHHhhhccCCCC----------------hhhh-hHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 597 ELKR--MPDLAHELRTWQPQPD----------------PMEE-QLKQLAI---QKAQLENEELQSKIALNNAKAKEAASS 654 (756) Q Consensus 597 e~~~--~~~~~~~l~~~~~q~~----------------p~~~-~~~q~~~---~~aq~e~~~~qa~a~~~~a~a~~~~aq 654 (756) .... .+.....+..+..-++ +... ++.++++ .+.+++.++.++++.+.++++..++++ T Consensus 540 ~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~q 619 (725) T protein:vir:77 540 GKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQ 619 (725) T ss_pred HhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH Confidence 2211 1111111211111111 1100 0000111 111112222233333333444334444 Q ss_pred HHHHHHHHHHHHHH---HHHH-------HHHHHHHHHHHHHHHHHH---HHHHHHHhhc------cCCchhhhccC-C-- Q lcl|NC_019423. 655 GDLKDLDYLEQESG---TKHA-------RDMEKQKAQSQGNQNLQI---TKALTTPTKE------GETTPNISAAV-G-- 712 (756) Q Consensus 655 ~~~~~~~~~~q~~~---~k~~-------~~~~~~~~q~~~~~~~~~---~~a~~~~~~~------~~~~~~~~~a~-~-- 712 (756) +++++++......+ .+.. ++......++.+.++.+. ++++...+.. +.+...+++.. + T Consensus 620 a~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~ 699 (725) T protein:vir:77 620 AELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHK 699 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHh Confidence 44433322211111 1111 111111122222222111 1111111111 11111112210 0 Q ss_pred -----CCCCCcccCchhcCCCCCCCCcccccccc Q lcl|NC_019423. 713 -----YNTLTNGNSPQERDLAAQQDPAYSLGSQY 741 (756) Q Consensus 713 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 741 (756) ++++. +.. ..++|+---.|+. T Consensus 700 q~~~~~~~~~---~~~-----~~~~~~~~~~~~~ 725 (725) T protein:vir:77 700 QRMDIANILQ---SQR-----QNQPSGSVAETPQ 725 (725) T ss_pred hHHHHHHHHH---HHH-----hcCCCcCcccCCC Confidence 01111 010 1112211111111 No 147 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=39.90 E-value=1 Score=20.59 Aligned_cols=263 Identities=5% Similarity=-0.028 Sum_probs=90.9 Q ss_pred ceeccCceeEEEeeeeecCceeEEEechhheEe--CCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhh Q lcl|NC_019423. 217 TYAIQTGVTEVEVEKALVNRPTVEMLNPNNVVI--DPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSP 294 (756) Q Consensus 217 ~~~~~~g~~~~~~~~~~~g~~~ie~V~p~~~~~--Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~ 294 (756) +. ...+++-+..+ ..+ ++...++ .|+ .++|-.++...- ...+...+ T Consensus 1 ia-----~l~~~~~~~~~---~~~--~~l~~lL~~~PN--------------~~~t~~~f~~~~--~~~ll~~G------ 48 (278) T protein:vir:78 1 MA-----SLPLKMYEDYK---VVN--TEVSDLLTVSPN--------------NSLSSFDFINQI--ETIRNEKG------ 48 (278) T ss_pred Cc-----cceeEEEecCc---ccc--cHHHHHHHhcCC--------------CCCCHHHHHHHH--HHHHhhcC------ Confidence 11 11111110000 000 0000000 011 112222221110 00000000 Q ss_pred hhchhhhccccccccccccccceEEEEEEE------EEeeccCCceeEEEEEEEECCEEEEecccccCCCccceEEeeee Q lcl|NC_019423. 295 ITDPDHESKTPSDFQFKDALRKKVVAYEYW------GFYDINDDGSLEPIVATWIGSTLIRMENNPFPDGKLPLVVVPYM 368 (756) Q Consensus 295 ~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w------~k~d~~~~g~~~~~~~~~~g~~~L~~~~~P~~~~~~Pfv~~~~~ 368 (756) +...... .+..+. +.++| +.+..+.+|....+.+...++..+. |+.+. .+++... T Consensus 49 -----na~~~i~----r~~~G~---~~~l~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~-----~~~~e--vih~~~~ 109 (278) T protein:vir:78 49 -----NAYVLIE----RDIYHQ---PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLI-----VHNMD--MLHFKHI 109 (278) T ss_pred -----CEEEEEE----ECCCCc---EEEEEEECCceeEEEEcCCCceEEEEEEcCCceEEE-----Ecccc--EEEECCC Confidence 0000000 001111 12222 2222233333223333333332221 11111 3344443 Q ss_pred eecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEee-ccccCccchhhh----hcccccccccccccccccc Q lcl|NC_019423. 369 PRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYP-KGMLDTLNRRRY----DDGQDYEYNPMQGNPSQSI 443 (756) Q Consensus 369 ~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~-~gav~~~~~~~~----~~~~~~~~~~~~~~~~~~i 443 (756) ...+.++|.|.+..+...-...+...+...... .+.+..++. .+.+++...... +........++....+..+ T Consensus 110 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~ 187 (278) T protein:vir:78 110 VASNMVQGISPIDVLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEI 187 (278) T ss_pred CCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCceE Confidence 345667899998888877766665444433222 223444443 333433211111 1111111112222222223 Q ss_pred ccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_019423. 444 MEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMAILRRLAK-GMADIGTKICA 522 (756) Q Consensus 444 ~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~~~~n~~~-~~~~l~~~~l~ 522 (756) .++.............++....+-..-||+....|...++...++ .+ ..++|.. .++++.+.+-+ T Consensus 188 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~---~~-----------~~~~~~~~~l~P~~~~i~~ 253 (278) T protein:vir:78 188 EPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKN---EE-----------LNRFYLQHTLLPIVKQYEE 253 (278) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH---HH-----------HHHHHHHHHHHHHHHHHHH Confidence 333322222333455566777777888999999986544321111 11 1123332 45555555544 Q ss_pred HHH-hhCCCCcEEEEecCceeecCHhHh Q lcl|NC_019423. 523 MNA-VFLSEKEVVRITNEQYVEIKREDL 549 (756) Q Consensus 523 li~-q~~~~~r~iRI~g~~~v~i~~d~~ 549 (756) -+- +.+++...- ...|+.+|.+.+ T Consensus 254 ~ln~~L~~~~e~~---~g~~~~f~~~~l 278 (278) T protein:vir:78 254 EFNRKLLTKTDRE---KIGILNLTLNLI 278 (278) T ss_pred HHHhhcCChhHhc---CCceEEEecccC Confidence 433 334432211 012455554444 No 148 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=27.76 E-value=1.8 Score=19.17 Aligned_cols=443 Identities=11% Similarity=0.057 Sum_probs=156.2 Q ss_pred cCCCCCCCCCC------CcccCHHHHHHHHHHHHHHHHhhcCCCCEE----EEecCCcc---h--------HHHHHHHHH Q lcl|NC_019423. 60 GKAKPPKIKGR------SQVQPRLVRRQAEWRYAPLSEPFLSSSKLF----KLTPVTFE---D--------ELAARQNEL 118 (756) Q Consensus 60 ~~~~~~~~~gr------S~~v~~~v~~~~e~~~~~L~~~f~~~~~~~----~~~p~~~~---D--------~~~A~q~t~ 118 (756) +. +--.+|-| +...+-.....-.|.+ ++. |+++.+- .|-|.-+. + .-+++-... T Consensus 1 ~~-~~~~~~~~~~~m~V~~~hp~y~a~~~~W~~---~~d-~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~ 75 (488) T protein:vir:96 1 ML-KCLYIKHRGFFMLTPIYHPDYLVNAPQWLR---NLD-CVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKD 75 (488) T ss_pred Cc-eeEEEeecceeecccccCHHHHHHhhhhhH---hhh-hhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhh Confidence 11 11112222 2223333444444532 221 4444332 35554221 1 112223334 Q ss_pred HHHHHHhhhcCCcchHHHHHHHHhhcCceEEEEeeeeeeeeeeeeeeeeecCCCCC-HHHHHHHHHhHHHhhhccchhcc Q lcl|NC_019423. 119 VLNYQFRTQLNKVKLVDDYVHSIVDDGTGIARIGWERKTVKIKTETPVFQLYPIEN-QEQADVLQQALQLQAENPREYDE 197 (756) Q Consensus 119 ~~n~~~~~~~~~~~~~~~~v~~al~~g~gi~k~~w~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~e 197 (756) |-+|.+. .-++.|+++++|..=+|.+-. +.+++. ..+ +.+.. ..++-+.... T Consensus 76 y~~~~~~-----rA~~~n~~~~tl~~l~G~vfr-----------k~p~~~---~~~~~~l~~--------l~~d~D~~G~ 128 (488) T protein:vir:96 76 WEDLTWR-----LANYVNIVNPTMNAITGAVMR-----------REPEFD---TMDNPVLIG--------LRDNIDGKGN 128 (488) T ss_pred hHhhhhh-----ccccCchhHHHHHHhcchhhc-----------cCceec---cCCcHHHHH--------HHhccCCCCC Confidence 4444321 223557777777664553321 112211 111 22211 1222222233 Q ss_pred ccchHHHHHHHHHHhcCCcceeccCceeEEE--ee-eeecCceeEEEechhheEeCCCCcCc--cccCceEEEEeecCHH Q lcl|NC_019423. 198 TMPEDIKEAVNYFNETGEATYAIQTGVTEVE--VE-KALVNRPTVEMLNPNNVVIDPSCNGD--LDKALYAVISFETCKA 272 (756) Q Consensus 198 ~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~--~~-~~~~g~~~ie~V~p~~~~~Dp~a~~d--~~da~~v~~~~~~t~~ 272 (756) .+.+-.+..+.....+|.+..-+-.= +... -+ +-..-+|.+..++|++|+ ++..... .....++..+...+ T Consensus 129 ~L~~f~~~~~~~~l~~G~~~ilVD~P-~~~~T~ade~~~~~rPy~~~~~a~~Ii-nW~~~~v~G~~~L~~v~lrE~~~-- 204 (488) T protein:vir:96 129 GIDQECKQALNALQWGSRCGWLVRSH-PESATMADWNKGKKLPTAAFYDALHII-DWEVEYIDGEEKLTYLSLLEDYQ-- 204 (488) T ss_pred CHHHHHHHHHHHHHhcCeEEEEEecC-CCcCCHHHHHHhcCCcEEEEechhhhc-CcceeccCCceeeEEEEEEEEEE-- Confidence 34444455556666666555422110 0000 00 001225888888888886 5543221 00111221111100 Q ss_pred HHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEEEEEEeeccCCceeEEEEEEEECC---EEEE Q lcl|NC_019423. 273 DLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYEYWGFYDINDDGSLEPIVATWIGS---TLIR 349 (756) Q Consensus 273 el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E~w~k~d~~~~g~~~~~~~~~~g~---~~L~ 349 (756) .- ...++ .+...++++.+ .+|..+.++....+. .... T Consensus 205 ----------~~---------------------D~~~~--~~~~~~~~~~l-------~~g~~~v~~~~~~~~~~e~~~~ 244 (488) T protein:vir:96 205 ----------ER---------------------DGGTY--VSKQRLINHRL-------VDGLCEFQEVTDDEYSDEWTPV 244 (488) T ss_pred ----------ec---------------------cCCCc--ccceEEEEEEE-------ECcEEEEEEEecCCcccceEee Confidence 00 00001 11112222211 023222222211111 1111 Q ss_pred e-cccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHHHHHHHHHHhhcCCceEeeccccCccchhhhhccc Q lcl|NC_019423. 350 M-ENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATMRGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQ 428 (756) Q Consensus 350 ~-~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~ 428 (756) . +.. ..+.+||+++..... +-..+.+..-++..++..+=...+-.-++++.+.-|.++..-+-.+. ... .... T Consensus 245 ~~g~~--~l~~IP~v~~~~~~~-~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~--~~~-~~~~ 318 (488) T protein:vir:96 245 LINSK--QSDTIPFFLASSQSN-EWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNK--TMA-SEMN 318 (488) T ss_pred cCCCc--ccCeeEEEEEecCCC-CCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCc--ccc-cccc Confidence 1 111 234566665543321 22235556667777665544444555556666666666542111111 100 1000 Q ss_pred cccccc----cccccccccccccCCCcchHHHHHHHHHHHHHHHHhchhHHhcCCCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_019423. 429 DYEYNP----MQGNPSQSIMEHKFPELPQSAIVMTQMQNQEAESLTGVKAFSGGVTGSAYGDVAAGIRGALDAASKREMA 504 (756) Q Consensus 429 ~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~e~~tGv~~~~~G~~~~a~~~tA~~i~~~~~aa~~~l~~ 504 (756) ...... ....+.+...+.+.. .+.-....|+.+.+.|.. .|..-.. .+ .+.||++.+.-..+....|.. T Consensus 319 ~~g~~~~~~~~~~~~~g~~~~~e~~-~~~l~~~~l~~l~~qm~~-~Ga~l~~---~~--~~~Ta~~~~~~~~~~~S~L~~ 391 (488) T protein:vir:96 319 PLGFTLAGRMPYYVKNGDVKVIQAQ-FSPETENKVEKLFEQAVK-VGASLFT---QQ--SNETATGAAIRSGSSTASMAT 391 (488) T ss_pred cceeeecccccccccCCceeecCCc-hhHHHHHHHHHHHHHHHH-HhHhhcc---CC--CcchHHHHHHHHHHhhHHHHH Confidence 000000 001122223333221 111122334444444432 2332221 11 246787777767777777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCCC------cEEEEecCceeecCHhHhcCcceEEEecccccHHHHHHHHHHHHHHH Q lcl|NC_019423. 505 ILRRLAKGMADIGTKICAMNAVFLSEK------EVVRITNEQYVEIKREDLKGNFDIEVDINTAEIDNQKSQDLGFMVQT 578 (756) Q Consensus 505 ~~~n~~~~~~~l~~~~l~li~q~~~~~------r~iRI~g~~~v~i~~d~~~~~~Dv~V~~g~a~~~~~~~q~l~~llq~ 578 (756) ++.++++++... |.++.+|+... .-+.| .||+ +|. ......+..+++..+.+. T Consensus 392 ~a~~le~al~~~----l~~~A~w~g~~~~~~~~~~~~~------~in~-----dF~------~~~ld~~~~~al~~~~~~ 450 (488) T protein:vir:96 392 LGNNVEDTVRNM----LRFIMRYFEGTNLYVNPDELVF------KLNR-----DYF------DVEVNPQMLQVAYAAMME 450 (488) T ss_pred HHHHHHHHHHHH----HHHHHHHcCCCCCCcCccceEE------Eecc-----CCC------CccCCHHHHHHHHHHHhc Confidence 888887776554 55555554321 00011 1111 111 111112222223333221 Q ss_pred hhccCCHhHHHHHHHHHHhhcCC--hh--HHHHhhhccCCCChh Q lcl|NC_019423. 579 LGNTVDQSITLSLVAKIAELKRM--PD--LAHELRTWQPQPDPM 618 (756) Q Consensus 579 ~~~~~~~~~~~~~l~~l~e~~~~--~~--~~~~l~~~~~q~~p~ 618 (756) ..+.......-|. ..|+ ++ ..+....+....-.+ T Consensus 451 --G~Is~~t~~~~L~----~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 451 --GNLPQVSWFELLK----RARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred --CCCCHHHHHHHHH----hCCcCCccCCHHHHHHHHhhcCCCC Confidence 2233222222222 1222 11 111111111111111 No 149 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=25.48 E-value=2.1 Score=18.87 Aligned_cols=410 Identities=10% Similarity=0.029 Sum_probs=164.2 Q ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC------CCCCCCCCC----------CcccCHHHHHHHHHH Q lcl|NC_019423. 22 WKKEPSIQLLKGDLESAKPAHDAIMSQIREWNDLMEVKGK------AKPPKIKGR----------SQVQPRLVRRQAEWR 85 (756) Q Consensus 22 ~~~~~~~~~l~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~------~~~~~~~gr----------S~~v~~~v~~~~e~~ 85 (756) |. ++....++.+ ...+|..--+.|.|+.. .=-|+..|. ..+..+-++++++.+ T Consensus 1 m~----V~~~hp~y~a-------~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~ 69 (452) T protein:vir:94 1 MP----IETKHPEYLA-------YENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSAL 69 (452) T ss_pred CC----CCCcCHHHHH-------HHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHH Confidence 11 0000001111 11112222222222200 001222222 246778888888888 Q ss_pred HHHHHHhhcCCCCEEEEecCCcchHHHHHHHHHHHHHHHhhhcCCcch---HHHHHHHHhhcCceEEEEeeeeeeeeeee Q lcl|NC_019423. 86 YAPLSEPFLSSSKLFKLTPVTFEDELAARQNELVLNYQFRTQLNKVKL---VDDYVHSIVDDGTGIARIGWERKTVKIKT 162 (756) Q Consensus 86 ~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~---~~~~v~~al~~g~gi~k~~w~~~~~~~~~ 162 (756) ...+ |.-++.+++ | + ... ++|. ...|.++ +..+++.+|..|.+-+-|-|.. T Consensus 70 ~G~v----f~k~p~~~~-p---~------~l~-~~~~----D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~------- 123 (452) T protein:vir:94 70 SGMV----LDQPPVITH-P---D------AMS-KYFE----DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPL------- 123 (452) T ss_pred hchh----hcCCceecc-c---H------HHH-HHHh----cccCCCHHHHHHHHHHHHHhcCeEEEEEeecc------- Confidence 7777 555555543 1 1 111 1222 2345443 5577778887777666652210 Q ss_pred eeeeeecCCCCCHHHHHHHHHhHHHhhhccchhccccchHHHHHHHHHHhcCCcceeccCceeEEEeeeeecCceeEEEe Q lcl|NC_019423. 163 ETPVFQLYPIENQEQADVLQQALQLQAENPREYDETMPEDIKEAVNYFNETGEATYAIQTGVTEVEVEKALVNRPTVEML 242 (756) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~g~~~ie~V 242 (756) ..++|.+..+ T Consensus 124 ----------------------------------------------------------------------~g~rPy~~~~ 133 (452) T protein:vir:94 124 ----------------------------------------------------------------------TGGDPYISVY 133 (452) T ss_pred ----------------------------------------------------------------------CCCceEEEEe Confidence 0135889999 Q ss_pred chhheEeCCCCcCccccCceEEEEeecCHHHHHhhccchhhhcccCchhhhhhhchhhhccccccccccccccceEEEEE Q lcl|NC_019423. 243 NPNNVVIDPSCNGDLDKALYAVISFETCKADLMKNKDRYHNLDKIDWESSSPITDPDHESKTPSDFQFKDALRKKVVAYE 322 (756) Q Consensus 243 ~p~~~~~Dp~a~~d~~da~~v~~~~~~t~~el~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~s~~~V~v~E 322 (756) +|++|+ ++....+ ..-.++.-|..... ....+.+.......+||++ T Consensus 134 ~~~~Ii-~W~~~~~-g~l~~v~lre~~~~--------------------------------~d~~d~f~~~~~~~yRvL~ 179 (452) T protein:vir:94 134 TTENIL-NWEEDED-GRLLMVVLREFYTV--------------------------------RDTADRYVQNIRVRYRCLE 179 (452) T ss_pred chhhhc-Ccccccc-CCeeEEEEEEEEEE--------------------------------ecCCCcccceeEEEEEEEE Confidence 999987 5553221 11111111110000 0000111112112233332 Q ss_pred EEEEeeccCCceeEEEEEEEECCE--------EEEecccccCCCccceEEeeeeeecCcccCCchHHHhHHHHHHHHHHH Q lcl|NC_019423. 323 YWGFYDINDDGSLEPIVATWIGST--------LIRMENNPFPDGKLPLVVVPYMPRKRELFGEADAELLGDNQAILGATM 394 (756) Q Consensus 323 ~w~k~d~~~~g~~~~~~~~~~g~~--------~L~~~~~P~~~~~~Pfv~~~~~~~~~~~~G~g~v~~~~d~Q~~iN~~~ 394 (756) . + +|....++....++. ....+.++ .+.+||+++...... -..+.+...++..++..+.... T Consensus 180 l------~-~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--l~~IP~v~~~~~~~~-~~~~~pPLl~LA~ln~~hy~~~ 249 (452) T protein:vir:94 180 L------V-DGLLQITVHETQDGKVWELAKTSTIQNVGVT--MDYIPFFCITPSGLS-MTPAKPPMIDIVDINYSHYRTS 249 (452) T ss_pred E------e-CCeEEEEEEEccCCceeeeccceeecCCCcc--cceeEEEEEcCCCCC-CCCCccchHHHHHHHHHHhcch Confidence 1 1 222111111111111 22222232 366788766544332 2357788889999999999888 Q ss_pred HHHHHHHHhhcCCceEeeccccCccchhhhhccccccccccccc-cccccccccCCCcc-hHHHHHHHHHHHHHHHHhch Q lcl|NC_019423. 395 RGMIDLLGRSANGQRGYPKGMLDTLNRRRYDDGQDYEYNPMQGN-PSQSIMEHKFPELP-QSAIVMTQMQNQEAESLTGV 472 (756) Q Consensus 395 ~~~~d~l~~~~~~~~~~~~gav~~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~-~~~~~~l~~~~~~~e~~tGv 472 (756) +-..++++.++.|...+. |. +.. .....+... ++... ++..+.++.+..-+ .....-|+.+.+.|..+ |. T Consensus 250 sd~~~~l~~~~~P~l~~~-g~-~~~--~~i~iG~~~---~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~-Ga 321 (452) T protein:vir:94 250 ADLEHGRHFTGLPTPWIT-GA-ESQ--STMHIGSTK---AWVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASL-SA 321 (452) T ss_pred hHHHHHHHHcccceeEee-cC-cCC--CceEecccc---cccCCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHH-HH Confidence 888899999999876554 22 111 111112111 11111 23335555543222 12233344444444332 33 Q ss_pred hHHhcCCCccccchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEecCceeecCHhHhcC Q lcl|NC_019423. 473 KAFSGGVTGSAYGDVAAGIRGALD-AASKREMAILRRLAKGMADIGTKICAMNAVFLSEKEVVRITNEQYVEIKREDLKG 551 (756) Q Consensus 473 ~~~~~G~~~~a~~~tA~~i~~~~~-aa~~~l~~~~~n~~~~~~~l~~~~l~li~q~~~~~r~iRI~g~~~v~i~~d~~~~ 551 (756) . ...+.. .+.+++....... +....|..++.++++++ ..+|.++..|...+--+.| .++++ T Consensus 322 ~-ll~~~~---~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al----~~~l~~~a~w~g~~~~~~v------~~n~d---- 383 (452) T protein:vir:94 322 R-LIDNST---RGSEATETVKLRYMSETASLKSVTRAVEALL----NKAYSCIMDMESMGGTLNI------KLNSA---- 383 (452) T ss_pred H-hhccCC---CcchHHHHHHHHHHHhhHHHHHHHHHHHHHH----HHHHHHHHHHcCCCCceEE------Eeccc---- Confidence 2 222211 1233332222222 23456777777776665 5566677777654322222 12111 Q ss_pred cceEEEecccccHHHHHHHHHHHHHHHhhccCCHhHHHHHHHHHHhhcCChh-------HHHHhhhccC-----CCChhh Q lcl|NC_019423. 552 NFDIEVDINTAEIDNQKSQDLGFMVQTLGNTVDQSITLSLVAKIAELKRMPD-------LAHELRTWQP-----QPDPME 619 (756) Q Consensus 552 ~~Dv~V~~g~a~~~~~~~q~l~~llq~~~~~~~~~~~~~~l~~l~e~~~~~~-------~~~~l~~~~~-----q~~p~~ 619 (756) |.. ...+.+..+.+.++.+. ..+....... .+...|+.+ +..-+....+ +++|.. T Consensus 384 -F~~------~~~~~~~~~al~~~~~~--G~is~~t~~~----~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:94 384 -FLD------SKLTAAELKAWVEAYLS--GGISKEIYIH----ALKVGKVLPPPGESMGVIPDPPAPEPSPSNTPPNPSS 450 (452) T ss_pred -ccc------ccCCHHHHHHHHHHHhc--CCCcHHHHHH----HHHhCCCCCCccCHHHHHHHhhccCcccCCCCCCCcc Confidence 111 11111222223333221 2233322222 222222221 1111111110 011111 Q ss_pred hh Q lcl|NC_019423. 620 EQ 621 (756) Q Consensus 620 ~~ 621 (756) .. T Consensus 451 ~~ 452 (452) T protein:vir:94 451 KA 452 (452) T ss_pred CC Confidence 11 Done!